Amino acid dipepetide frequency for Human papillomavirus type 34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.189AlaAla: 2.189 ± 1.049
1.313AlaCys: 1.313 ± 0.439
4.816AlaAsp: 4.816 ± 1.314
2.627AlaGlu: 2.627 ± 0.714
2.189AlaPhe: 2.189 ± 1.051
3.94AlaGly: 3.94 ± 1.107
0.438AlaHis: 0.438 ± 0.349
3.94AlaIle: 3.94 ± 0.659
3.503AlaLys: 3.503 ± 1.111
6.13AlaLeu: 6.13 ± 1.247
0.438AlaMet: 0.438 ± 0.373
1.751AlaAsn: 1.751 ± 0.928
3.065AlaPro: 3.065 ± 1.021
2.189AlaGln: 2.189 ± 0.714
2.189AlaArg: 2.189 ± 0.563
4.378AlaSer: 4.378 ± 1.679
3.94AlaThr: 3.94 ± 1.794
2.627AlaVal: 2.627 ± 0.996
0.876AlaTrp: 0.876 ± 0.788
0.438AlaTyr: 0.438 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
1.751CysAla: 1.751 ± 0.583
0.0CysCys: 0.0 ± 0.0
1.313CysAsp: 1.313 ± 0.85
3.065CysGlu: 3.065 ± 2.25
0.876CysPhe: 0.876 ± 0.923
1.751CysGly: 1.751 ± 0.662
0.0CysHis: 0.0 ± 0.0
0.876CysIle: 0.876 ± 0.411
2.189CysLys: 2.189 ± 1.075
3.503CysLeu: 3.503 ± 1.473
0.876CysMet: 0.876 ± 0.537
0.876CysAsn: 0.876 ± 0.724
2.189CysPro: 2.189 ± 0.836
2.627CysGln: 2.627 ± 1.28
0.438CysArg: 0.438 ± 0.349
1.751CysSer: 1.751 ± 1.175
0.876CysThr: 0.876 ± 0.806
3.065CysVal: 3.065 ± 1.67
1.313CysTrp: 1.313 ± 0.673
0.438CysTyr: 0.438 ± 0.536
0.0CysXaa: 0.0 ± 0.0
Asp
4.378AspAla: 4.378 ± 1.098
2.627AspCys: 2.627 ± 1.215
3.065AspAsp: 3.065 ± 1.994
2.189AspGlu: 2.189 ± 1.432
2.627AspPhe: 2.627 ± 0.583
2.627AspGly: 2.627 ± 1.193
0.876AspHis: 0.876 ± 0.923
3.94AspIle: 3.94 ± 1.302
2.189AspLys: 2.189 ± 0.814
5.692AspLeu: 5.692 ± 2.156
0.876AspMet: 0.876 ± 0.411
3.94AspAsn: 3.94 ± 1.075
3.065AspPro: 3.065 ± 1.283
1.313AspGln: 1.313 ± 0.678
1.313AspArg: 1.313 ± 0.67
6.13AspSer: 6.13 ± 1.165
6.13AspThr: 6.13 ± 2.011
3.065AspVal: 3.065 ± 1.332
0.438AspTrp: 0.438 ± 0.349
1.313AspTyr: 1.313 ± 0.802
0.0AspXaa: 0.0 ± 0.0
Glu
2.189GluAla: 2.189 ± 0.711
0.876GluCys: 0.876 ± 0.746
3.94GluAsp: 3.94 ± 1.815
3.503GluGlu: 3.503 ± 1.236
0.876GluPhe: 0.876 ± 0.413
2.189GluGly: 2.189 ± 0.923
0.438GluHis: 0.438 ± 0.373
2.627GluIle: 2.627 ± 0.767
1.313GluLys: 1.313 ± 0.779
3.503GluLeu: 3.503 ± 0.611
0.876GluMet: 0.876 ± 0.457
3.94GluAsn: 3.94 ± 1.091
0.438GluPro: 0.438 ± 0.349
2.189GluGln: 2.189 ± 0.746
3.503GluArg: 3.503 ± 1.552
3.503GluSer: 3.503 ± 1.608
4.816GluThr: 4.816 ± 0.845
4.378GluVal: 4.378 ± 1.079
0.876GluTrp: 0.876 ± 0.457
1.751GluTyr: 1.751 ± 1.048
0.0GluXaa: 0.0 ± 0.0
Phe
2.189PheAla: 2.189 ± 0.563
0.0PheCys: 0.0 ± 0.0
1.751PheAsp: 1.751 ± 0.966
0.438PheGlu: 0.438 ± 0.377
1.751PhePhe: 1.751 ± 0.827
3.065PheGly: 3.065 ± 0.931
0.438PheHis: 0.438 ± 0.689
2.189PheIle: 2.189 ± 1.009
2.627PheLys: 2.627 ± 0.899
3.94PheLeu: 3.94 ± 0.88
1.313PheMet: 1.313 ± 0.439
2.627PheAsn: 2.627 ± 0.668
1.313PhePro: 1.313 ± 0.673
0.876PheGln: 0.876 ± 0.464
0.876PheArg: 0.876 ± 0.704
2.627PheSer: 2.627 ± 0.794
2.189PheThr: 2.189 ± 1.098
3.94PheVal: 3.94 ± 1.295
0.876PheTrp: 0.876 ± 0.411
3.065PheTyr: 3.065 ± 1.468
0.0PheXaa: 0.0 ± 0.0
Gly
1.313GlyAla: 1.313 ± 0.635
0.876GlyCys: 0.876 ± 0.411
3.503GlyAsp: 3.503 ± 1.271
1.751GlyGlu: 1.751 ± 0.822
1.751GlyPhe: 1.751 ± 0.861
3.503GlyGly: 3.503 ± 0.899
0.876GlyHis: 0.876 ± 0.537
3.94GlyIle: 3.94 ± 0.846
4.378GlyLys: 4.378 ± 1.65
4.378GlyLeu: 4.378 ± 1.469
1.313GlyMet: 1.313 ± 0.703
3.94GlyAsn: 3.94 ± 1.547
2.627GlyPro: 2.627 ± 0.99
1.313GlyGln: 1.313 ± 0.753
4.816GlyArg: 4.816 ± 1.631
4.378GlySer: 4.378 ± 1.422
5.254GlyThr: 5.254 ± 2.776
3.94GlyVal: 3.94 ± 0.987
0.438GlyTrp: 0.438 ± 0.349
1.751GlyTyr: 1.751 ± 0.803
0.0GlyXaa: 0.0 ± 0.0
His
1.313HisAla: 1.313 ± 0.804
0.0HisCys: 0.0 ± 0.0
0.438HisAsp: 0.438 ± 0.377
0.876HisGlu: 0.876 ± 0.724
1.313HisPhe: 1.313 ± 0.666
0.876HisGly: 0.876 ± 0.588
0.0HisHis: 0.0 ± 0.0
2.189HisIle: 2.189 ± 0.855
1.751HisLys: 1.751 ± 0.914
3.503HisLeu: 3.503 ± 1.238
0.0HisMet: 0.0 ± 0.0
2.189HisAsn: 2.189 ± 1.324
1.751HisPro: 1.751 ± 1.105
0.438HisGln: 0.438 ± 0.689
1.313HisArg: 1.313 ± 0.441
2.627HisSer: 2.627 ± 0.914
0.438HisThr: 0.438 ± 0.462
1.751HisVal: 1.751 ± 1.311
1.313HisTrp: 1.313 ± 0.93
0.876HisTyr: 0.876 ± 0.545
0.0HisXaa: 0.0 ± 0.0
Ile
3.065IleAla: 3.065 ± 0.861
1.313IleCys: 1.313 ± 0.802
3.503IleAsp: 3.503 ± 0.962
2.627IleGlu: 2.627 ± 0.979
2.189IlePhe: 2.189 ± 1.29
2.189IleGly: 2.189 ± 1.201
1.751IleHis: 1.751 ± 0.662
2.627IleIle: 2.627 ± 1.686
1.751IleLys: 1.751 ± 0.822
3.94IleLeu: 3.94 ± 1.106
0.876IleMet: 0.876 ± 0.505
0.876IleAsn: 0.876 ± 0.464
5.254IlePro: 5.254 ± 2.323
0.876IleGln: 0.876 ± 0.464
3.065IleArg: 3.065 ± 0.691
3.94IleSer: 3.94 ± 0.859
4.378IleThr: 4.378 ± 1.213
6.13IleVal: 6.13 ± 1.545
0.438IleTrp: 0.438 ± 0.462
2.627IleTyr: 2.627 ± 0.526
0.0IleXaa: 0.0 ± 0.0
Lys
3.065LysAla: 3.065 ± 1.163
2.627LysCys: 2.627 ± 1.173
1.313LysAsp: 1.313 ± 0.439
2.627LysGlu: 2.627 ± 1.406
2.189LysPhe: 2.189 ± 1.089
3.503LysGly: 3.503 ± 2.155
2.189LysHis: 2.189 ± 1.289
1.751LysIle: 1.751 ± 0.9
1.751LysLys: 1.751 ± 0.9
2.189LysLeu: 2.189 ± 0.657
0.876LysMet: 0.876 ± 0.411
1.313LysAsn: 1.313 ± 0.439
2.627LysPro: 2.627 ± 1.984
4.378LysGln: 4.378 ± 1.314
7.005LysArg: 7.005 ± 1.115
2.627LysSer: 2.627 ± 1.652
2.627LysThr: 2.627 ± 0.902
4.816LysVal: 4.816 ± 1.298
0.438LysTrp: 0.438 ± 0.462
2.627LysTyr: 2.627 ± 0.911
0.0LysXaa: 0.0 ± 0.0
Leu
4.378LeuAla: 4.378 ± 1.243
3.503LeuCys: 3.503 ± 2.58
5.692LeuAsp: 5.692 ± 1.345
6.567LeuGlu: 6.567 ± 2.129
2.627LeuPhe: 2.627 ± 1.0
3.94LeuGly: 3.94 ± 1.822
3.94LeuHis: 3.94 ± 1.461
2.627LeuIle: 2.627 ± 1.458
6.567LeuLys: 6.567 ± 1.708
10.508LeuLeu: 10.508 ± 3.082
1.751LeuMet: 1.751 ± 0.962
1.313LeuAsn: 1.313 ± 0.441
2.627LeuPro: 2.627 ± 1.16
8.319LeuGln: 8.319 ± 2.069
2.189LeuArg: 2.189 ± 0.855
3.94LeuSer: 3.94 ± 1.193
5.692LeuThr: 5.692 ± 1.851
2.627LeuVal: 2.627 ± 0.856
0.0LeuTrp: 0.0 ± 0.0
4.816LeuTyr: 4.816 ± 1.435
0.0LeuXaa: 0.0 ± 0.0
Met
2.189MetAla: 2.189 ± 0.94
1.313MetCys: 1.313 ± 0.754
0.876MetAsp: 0.876 ± 0.411
1.313MetGlu: 1.313 ± 0.67
1.751MetPhe: 1.751 ± 0.714
1.751MetGly: 1.751 ± 0.949
0.876MetHis: 0.876 ± 0.692
1.313MetIle: 1.313 ± 0.836
0.438MetLys: 0.438 ± 0.349
0.438MetLeu: 0.438 ± 0.349
0.438MetMet: 0.438 ± 0.462
0.438MetAsn: 0.438 ± 0.373
0.0MetPro: 0.0 ± 0.0
1.313MetGln: 1.313 ± 0.67
0.876MetArg: 0.876 ± 0.464
2.189MetSer: 2.189 ± 0.714
1.313MetThr: 1.313 ± 0.703
0.876MetVal: 0.876 ± 0.411
0.876MetTrp: 0.876 ± 0.746
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.254AsnAla: 5.254 ± 1.055
0.876AsnCys: 0.876 ± 0.588
2.189AsnAsp: 2.189 ± 1.051
1.751AsnGlu: 1.751 ± 0.914
1.313AsnPhe: 1.313 ± 1.119
2.627AsnGly: 2.627 ± 0.519
0.0AsnHis: 0.0 ± 0.0
3.94AsnIle: 3.94 ± 1.179
3.065AsnLys: 3.065 ± 1.669
1.313AsnLeu: 1.313 ± 0.779
0.438AsnMet: 0.438 ± 0.349
3.065AsnAsn: 3.065 ± 1.033
4.378AsnPro: 4.378 ± 1.307
1.313AsnGln: 1.313 ± 0.779
0.876AsnArg: 0.876 ± 0.411
3.94AsnSer: 3.94 ± 1.044
5.254AsnThr: 5.254 ± 1.005
3.503AsnVal: 3.503 ± 1.229
1.313AsnTrp: 1.313 ± 1.048
0.438AsnTyr: 0.438 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
3.503ProAla: 3.503 ± 1.698
1.751ProCys: 1.751 ± 0.662
4.816ProAsp: 4.816 ± 1.493
1.313ProGlu: 1.313 ± 0.757
1.751ProPhe: 1.751 ± 0.98
2.627ProGly: 2.627 ± 1.239
0.438ProHis: 0.438 ± 0.349
3.065ProIle: 3.065 ± 0.933
3.065ProLys: 3.065 ± 0.807
7.443ProLeu: 7.443 ± 1.748
0.876ProMet: 0.876 ± 0.545
1.313ProAsn: 1.313 ± 0.992
7.005ProPro: 7.005 ± 2.142
0.438ProGln: 0.438 ± 0.373
1.751ProArg: 1.751 ± 0.576
6.13ProSer: 6.13 ± 1.895
6.567ProThr: 6.567 ± 3.226
2.627ProVal: 2.627 ± 1.432
0.0ProTrp: 0.0 ± 0.0
3.065ProTyr: 3.065 ± 1.718
0.0ProXaa: 0.0 ± 0.0
Gln
3.503GlnAla: 3.503 ± 1.229
3.065GlnCys: 3.065 ± 1.752
3.503GlnAsp: 3.503 ± 1.052
0.876GlnGlu: 0.876 ± 0.699
3.503GlnPhe: 3.503 ± 1.166
0.876GlnGly: 0.876 ± 0.411
0.876GlnHis: 0.876 ± 0.413
1.751GlnIle: 1.751 ± 0.569
1.751GlnLys: 1.751 ± 0.662
3.94GlnLeu: 3.94 ± 1.429
1.751GlnMet: 1.751 ± 0.98
1.313GlnAsn: 1.313 ± 0.703
3.503GlnPro: 3.503 ± 1.377
3.065GlnGln: 3.065 ± 1.246
1.751GlnArg: 1.751 ± 0.569
3.065GlnSer: 3.065 ± 0.814
1.751GlnThr: 1.751 ± 0.583
2.627GlnVal: 2.627 ± 1.075
2.189GlnTrp: 2.189 ± 0.717
1.751GlnTyr: 1.751 ± 0.662
0.0GlnXaa: 0.0 ± 0.0
Arg
2.627ArgAla: 2.627 ± 1.393
2.627ArgCys: 2.627 ± 2.288
2.189ArgAsp: 2.189 ± 0.928
1.313ArgGlu: 1.313 ± 0.802
0.876ArgPhe: 0.876 ± 0.724
0.438ArgGly: 0.438 ± 0.373
3.065ArgHis: 3.065 ± 0.463
1.751ArgIle: 1.751 ± 0.84
5.692ArgLys: 5.692 ± 1.191
6.13ArgLeu: 6.13 ± 1.469
0.876ArgMet: 0.876 ± 0.457
0.876ArgAsn: 0.876 ± 0.699
4.378ArgPro: 4.378 ± 1.229
2.627ArgGln: 2.627 ± 1.619
2.189ArgArg: 2.189 ± 1.048
2.627ArgSer: 2.627 ± 0.519
2.189ArgThr: 2.189 ± 0.657
1.751ArgVal: 1.751 ± 0.647
0.0ArgTrp: 0.0 ± 0.0
3.065ArgTyr: 3.065 ± 1.23
0.0ArgXaa: 0.0 ± 0.0
Ser
3.503SerAla: 3.503 ± 0.815
0.438SerCys: 0.438 ± 0.373
3.94SerAsp: 3.94 ± 1.603
2.189SerGlu: 2.189 ± 1.002
0.876SerPhe: 0.876 ± 0.464
7.443SerGly: 7.443 ± 2.041
2.189SerHis: 2.189 ± 0.975
3.94SerIle: 3.94 ± 1.241
3.065SerLys: 3.065 ± 0.887
6.13SerLeu: 6.13 ± 1.832
1.313SerMet: 1.313 ± 1.119
7.005SerAsn: 7.005 ± 3.157
3.065SerPro: 3.065 ± 1.003
3.503SerGln: 3.503 ± 1.426
3.065SerArg: 3.065 ± 0.998
10.508SerSer: 10.508 ± 2.086
8.319SerThr: 8.319 ± 1.401
6.13SerVal: 6.13 ± 2.188
0.876SerTrp: 0.876 ± 0.457
2.189SerTyr: 2.189 ± 0.72
0.0SerXaa: 0.0 ± 0.0
Thr
2.189ThrAla: 2.189 ± 1.201
2.189ThrCys: 2.189 ± 0.702
4.378ThrAsp: 4.378 ± 1.652
4.816ThrGlu: 4.816 ± 0.963
3.94ThrPhe: 3.94 ± 1.012
3.94ThrGly: 3.94 ± 1.151
2.627ThrHis: 2.627 ± 1.102
3.503ThrIle: 3.503 ± 0.907
0.438ThrLys: 0.438 ± 0.349
3.94ThrLeu: 3.94 ± 1.328
2.189ThrMet: 2.189 ± 0.68
6.13ThrAsn: 6.13 ± 1.345
5.692ThrPro: 5.692 ± 1.743
5.692ThrGln: 5.692 ± 0.683
3.065ThrArg: 3.065 ± 1.332
5.692ThrSer: 5.692 ± 0.963
8.319ThrThr: 8.319 ± 3.669
5.692ThrVal: 5.692 ± 0.94
1.313ThrTrp: 1.313 ± 0.67
2.627ThrTyr: 2.627 ± 0.779
0.0ThrXaa: 0.0 ± 0.0
Val
1.751ValAla: 1.751 ± 0.682
3.503ValCys: 3.503 ± 1.769
3.503ValAsp: 3.503 ± 1.233
4.378ValGlu: 4.378 ± 1.277
4.378ValPhe: 4.378 ± 1.087
3.503ValGly: 3.503 ± 2.057
1.751ValHis: 1.751 ± 1.129
3.065ValIle: 3.065 ± 1.014
2.189ValLys: 2.189 ± 0.72
3.065ValLeu: 3.065 ± 1.08
0.876ValMet: 0.876 ± 0.593
1.751ValAsn: 1.751 ± 0.861
5.254ValPro: 5.254 ± 1.344
3.065ValGln: 3.065 ± 1.625
2.627ValArg: 2.627 ± 1.299
7.881ValSer: 7.881 ± 2.693
5.692ValThr: 5.692 ± 2.011
3.94ValVal: 3.94 ± 0.964
0.876ValTrp: 0.876 ± 0.537
4.378ValTyr: 4.378 ± 2.982
0.0ValXaa: 0.0 ± 0.0
Trp
0.876TrpAla: 0.876 ± 0.411
0.0TrpCys: 0.0 ± 0.0
0.438TrpAsp: 0.438 ± 0.462
1.313TrpGlu: 1.313 ± 0.439
0.876TrpPhe: 0.876 ± 0.411
0.876TrpGly: 0.876 ± 0.411
0.876TrpHis: 0.876 ± 0.537
0.876TrpIle: 0.876 ± 0.699
1.751TrpLys: 1.751 ± 0.966
1.751TrpLeu: 1.751 ± 0.596
0.0TrpMet: 0.0 ± 0.0
0.438TrpAsn: 0.438 ± 0.373
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.876TrpArg: 0.876 ± 0.704
0.438TrpSer: 0.438 ± 0.349
2.189TrpThr: 2.189 ± 1.397
0.876TrpVal: 0.876 ± 0.457
0.0TrpTrp: 0.0 ± 0.0
1.313TrpTyr: 1.313 ± 0.641
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.751TyrAla: 1.751 ± 0.736
0.876TyrCys: 0.876 ± 0.724
2.627TyrAsp: 2.627 ± 0.583
2.189TyrGlu: 2.189 ± 0.88
0.438TyrPhe: 0.438 ± 0.377
4.378TyrGly: 4.378 ± 0.559
1.313TyrHis: 1.313 ± 0.615
3.503TyrIle: 3.503 ± 1.152
3.065TyrLys: 3.065 ± 0.675
2.189TyrLeu: 2.189 ± 1.089
2.189TyrMet: 2.189 ± 0.689
1.751TyrAsn: 1.751 ± 1.07
1.313TyrPro: 1.313 ± 1.119
0.876TyrGln: 0.876 ± 0.699
3.503TyrArg: 3.503 ± 2.13
1.313TyrSer: 1.313 ± 0.757
0.438TyrThr: 0.438 ± 0.462
3.503TyrVal: 3.503 ± 0.62
1.313TyrTrp: 1.313 ± 0.439
2.627TyrTyr: 2.627 ± 1.612
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2285 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski