Amino acid dipepetide frequency for Human papillomavirus 199

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.45AlaAla: 4.45 ± 1.471
1.335AlaCys: 1.335 ± 0.784
2.67AlaAsp: 2.67 ± 0.431
7.121AlaGlu: 7.121 ± 0.734
4.005AlaPhe: 4.005 ± 0.908
1.78AlaGly: 1.78 ± 0.662
1.78AlaHis: 1.78 ± 0.703
2.67AlaIle: 2.67 ± 0.95
2.67AlaLys: 2.67 ± 0.851
6.231AlaLeu: 6.231 ± 1.741
1.335AlaMet: 1.335 ± 0.439
3.115AlaAsn: 3.115 ± 0.994
4.895AlaPro: 4.895 ± 2.994
1.335AlaGln: 1.335 ± 0.726
3.56AlaArg: 3.56 ± 1.707
3.115AlaSer: 3.115 ± 1.016
4.45AlaThr: 4.45 ± 1.474
3.56AlaVal: 3.56 ± 1.423
0.0AlaTrp: 0.0 ± 0.0
3.56AlaTyr: 3.56 ± 1.285
0.0AlaXaa: 0.0 ± 0.0
Cys
1.335CysAla: 1.335 ± 0.784
1.335CysCys: 1.335 ± 0.784
1.335CysAsp: 1.335 ± 0.867
0.89CysGlu: 0.89 ± 0.548
1.335CysPhe: 1.335 ± 0.717
0.445CysGly: 0.445 ± 0.354
0.445CysHis: 0.445 ± 0.554
1.335CysIle: 1.335 ± 0.569
3.115CysLys: 3.115 ± 1.232
2.67CysLeu: 2.67 ± 1.392
0.89CysMet: 0.89 ± 0.586
0.89CysAsn: 0.89 ± 0.548
1.335CysPro: 1.335 ± 0.822
0.0CysGln: 0.0 ± 0.0
0.89CysArg: 0.89 ± 0.763
1.335CysSer: 1.335 ± 0.902
0.89CysThr: 0.89 ± 0.707
0.89CysVal: 0.89 ± 0.636
2.225CysTrp: 2.225 ± 0.854
0.89CysTyr: 0.89 ± 0.548
0.0CysXaa: 0.0 ± 0.0
Asp
2.67AspAla: 2.67 ± 0.641
2.225AspCys: 2.225 ± 0.354
2.67AspAsp: 2.67 ± 0.714
5.34AspGlu: 5.34 ± 0.897
3.115AspPhe: 3.115 ± 1.224
2.225AspGly: 2.225 ± 0.676
1.78AspHis: 1.78 ± 0.969
3.56AspIle: 3.56 ± 1.218
2.225AspLys: 2.225 ± 1.051
6.231AspLeu: 6.231 ± 1.362
0.89AspMet: 0.89 ± 0.468
5.34AspAsn: 5.34 ± 0.791
5.34AspPro: 5.34 ± 1.222
0.445AspGln: 0.445 ± 0.354
2.225AspArg: 2.225 ± 0.43
5.34AspSer: 5.34 ± 1.595
4.45AspThr: 4.45 ± 1.379
4.895AspVal: 4.895 ± 2.058
0.89AspTrp: 0.89 ± 0.707
0.89AspTyr: 0.89 ± 0.763
0.0AspXaa: 0.0 ± 0.0
Glu
3.56GluAla: 3.56 ± 1.32
1.335GluCys: 1.335 ± 0.784
4.005GluAsp: 4.005 ± 0.802
8.901GluGlu: 8.901 ± 2.372
2.225GluPhe: 2.225 ± 0.66
3.115GluGly: 3.115 ± 1.108
1.335GluHis: 1.335 ± 0.803
3.115GluIle: 3.115 ± 0.872
3.115GluLys: 3.115 ± 1.258
4.005GluLeu: 4.005 ± 1.174
0.89GluMet: 0.89 ± 0.509
3.115GluAsn: 3.115 ± 0.847
4.45GluPro: 4.45 ± 2.177
2.67GluGln: 2.67 ± 1.033
2.67GluArg: 2.67 ± 0.542
4.005GluSer: 4.005 ± 1.876
3.56GluThr: 3.56 ± 1.601
1.335GluVal: 1.335 ± 0.737
0.89GluTrp: 0.89 ± 0.499
2.67GluTyr: 2.67 ± 1.25
0.0GluXaa: 0.0 ± 0.0
Phe
5.785PheAla: 5.785 ± 0.675
1.78PheCys: 1.78 ± 0.741
4.005PheAsp: 4.005 ± 0.812
3.56PheGlu: 3.56 ± 1.771
1.78PhePhe: 1.78 ± 0.741
2.225PheGly: 2.225 ± 0.975
1.78PheHis: 1.78 ± 1.002
2.67PheIle: 2.67 ± 1.08
5.34PheLys: 5.34 ± 2.9
6.231PheLeu: 6.231 ± 1.555
0.445PheMet: 0.445 ± 0.394
1.78PheAsn: 1.78 ± 0.606
1.335PhePro: 1.335 ± 0.391
3.115PheGln: 3.115 ± 1.29
2.225PheArg: 2.225 ± 0.854
2.67PheSer: 2.67 ± 1.033
3.115PheThr: 3.115 ± 0.652
1.78PheVal: 1.78 ± 1.17
1.335PheTrp: 1.335 ± 0.439
2.67PheTyr: 2.67 ± 0.947
0.0PheXaa: 0.0 ± 0.0
Gly
1.78GlyAla: 1.78 ± 0.67
0.445GlyCys: 0.445 ± 0.354
7.121GlyAsp: 7.121 ± 1.807
2.67GlyGlu: 2.67 ± 1.879
1.78GlyPhe: 1.78 ± 0.567
2.67GlyGly: 2.67 ± 1.95
0.89GlyHis: 0.89 ± 0.586
2.67GlyIle: 2.67 ± 0.759
2.67GlyLys: 2.67 ± 1.073
2.225GlyLeu: 2.225 ± 1.303
1.335GlyMet: 1.335 ± 0.737
4.005GlyAsn: 4.005 ± 0.613
3.115GlyPro: 3.115 ± 1.023
0.445GlyGln: 0.445 ± 0.401
4.45GlyArg: 4.45 ± 1.124
4.45GlySer: 4.45 ± 1.979
3.115GlyThr: 3.115 ± 0.784
2.225GlyVal: 2.225 ± 1.011
0.0GlyTrp: 0.0 ± 0.0
1.335GlyTyr: 1.335 ± 0.726
0.0GlyXaa: 0.0 ± 0.0
His
1.335HisAla: 1.335 ± 0.619
0.445HisCys: 0.445 ± 0.486
0.445HisAsp: 0.445 ± 0.353
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.335HisGly: 1.335 ± 0.723
0.445HisHis: 0.445 ± 0.554
1.78HisIle: 1.78 ± 0.794
0.89HisLys: 0.89 ± 0.548
2.225HisLeu: 2.225 ± 1.115
0.89HisMet: 0.89 ± 0.636
0.0HisAsn: 0.0 ± 0.0
1.335HisPro: 1.335 ± 0.726
0.445HisGln: 0.445 ± 0.353
0.0HisArg: 0.0 ± 0.0
3.115HisSer: 3.115 ± 0.909
1.335HisThr: 1.335 ± 0.439
0.445HisVal: 0.445 ± 0.452
0.89HisTrp: 0.89 ± 0.972
0.89HisTyr: 0.89 ± 0.417
0.0HisXaa: 0.0 ± 0.0
Ile
3.56IleAla: 3.56 ± 1.808
0.89IleCys: 0.89 ± 0.404
2.67IleAsp: 2.67 ± 0.412
5.785IleGlu: 5.785 ± 1.078
3.115IlePhe: 3.115 ± 0.591
3.115IleGly: 3.115 ± 0.868
0.0IleHis: 0.0 ± 0.0
3.115IleIle: 3.115 ± 1.18
1.335IleLys: 1.335 ± 0.661
3.115IleLeu: 3.115 ± 1.363
0.0IleMet: 0.0 ± 0.0
1.78IleAsn: 1.78 ± 0.567
2.225IlePro: 2.225 ± 1.035
2.225IleGln: 2.225 ± 0.43
1.78IleArg: 1.78 ± 0.758
2.67IleSer: 2.67 ± 1.65
4.45IleThr: 4.45 ± 1.035
4.45IleVal: 4.45 ± 1.149
0.0IleTrp: 0.0 ± 0.0
2.67IleTyr: 2.67 ± 1.065
0.0IleXaa: 0.0 ± 0.0
Lys
3.115LysAla: 3.115 ± 0.558
1.335LysCys: 1.335 ± 0.619
1.78LysAsp: 1.78 ± 0.67
4.005LysGlu: 4.005 ± 1.653
4.005LysPhe: 4.005 ± 1.197
3.115LysGly: 3.115 ± 0.849
0.89LysHis: 0.89 ± 0.548
1.78LysIle: 1.78 ± 0.777
4.005LysLys: 4.005 ± 0.768
3.115LysLeu: 3.115 ± 1.212
1.335LysMet: 1.335 ± 0.653
6.676LysAsn: 6.676 ± 3.049
1.78LysPro: 1.78 ± 0.846
2.225LysGln: 2.225 ± 0.712
4.45LysArg: 4.45 ± 0.929
3.115LysSer: 3.115 ± 0.888
3.115LysThr: 3.115 ± 0.758
3.56LysVal: 3.56 ± 1.456
0.0LysTrp: 0.0 ± 0.0
1.335LysTyr: 1.335 ± 0.569
0.0LysXaa: 0.0 ± 0.0
Leu
6.231LeuAla: 6.231 ± 1.0
2.67LeuCys: 2.67 ± 1.709
5.785LeuAsp: 5.785 ± 0.843
3.56LeuGlu: 3.56 ± 1.074
4.45LeuPhe: 4.45 ± 1.678
5.785LeuGly: 5.785 ± 2.309
3.115LeuHis: 3.115 ± 1.0
3.56LeuIle: 3.56 ± 1.148
4.45LeuLys: 4.45 ± 1.126
8.901LeuLeu: 8.901 ± 2.413
1.335LeuMet: 1.335 ± 0.708
2.225LeuAsn: 2.225 ± 0.716
4.005LeuPro: 4.005 ± 1.491
6.231LeuGln: 6.231 ± 1.495
4.45LeuArg: 4.45 ± 1.641
5.34LeuSer: 5.34 ± 1.422
4.45LeuThr: 4.45 ± 1.079
5.785LeuVal: 5.785 ± 1.259
1.335LeuTrp: 1.335 ± 0.837
4.005LeuTyr: 4.005 ± 1.544
0.0LeuXaa: 0.0 ± 0.0
Met
1.335MetAla: 1.335 ± 0.673
0.0MetCys: 0.0 ± 0.0
0.445MetAsp: 0.445 ± 0.353
0.89MetGlu: 0.89 ± 0.643
2.67MetPhe: 2.67 ± 0.641
1.335MetGly: 1.335 ± 0.673
0.0MetHis: 0.0 ± 0.0
0.445MetIle: 0.445 ± 0.353
0.0MetLys: 0.0 ± 0.0
0.89MetLeu: 0.89 ± 0.548
0.0MetMet: 0.0 ± 0.0
0.89MetAsn: 0.89 ± 0.404
1.335MetPro: 1.335 ± 0.439
0.0MetGln: 0.0 ± 0.0
0.89MetArg: 0.89 ± 0.636
1.78MetSer: 1.78 ± 0.567
0.89MetThr: 0.89 ± 0.404
0.89MetVal: 0.89 ± 0.404
0.445MetTrp: 0.445 ± 0.353
1.78MetTyr: 1.78 ± 0.798
0.0MetXaa: 0.0 ± 0.0
Asn
4.45AsnAla: 4.45 ± 0.982
1.335AsnCys: 1.335 ± 1.0
2.67AsnAsp: 2.67 ± 0.929
1.335AsnGlu: 1.335 ± 0.897
2.225AsnPhe: 2.225 ± 0.777
1.335AsnGly: 1.335 ± 0.439
0.445AsnHis: 0.445 ± 0.353
4.45AsnIle: 4.45 ± 1.782
4.005AsnLys: 4.005 ± 0.677
1.78AsnLeu: 1.78 ± 1.096
0.89AsnMet: 0.89 ± 0.657
3.56AsnAsn: 3.56 ± 0.802
4.45AsnPro: 4.45 ± 1.671
2.225AsnGln: 2.225 ± 0.354
3.115AsnArg: 3.115 ± 1.213
4.005AsnSer: 4.005 ± 0.929
4.005AsnThr: 4.005 ± 0.812
4.45AsnVal: 4.45 ± 1.228
0.445AsnTrp: 0.445 ± 0.353
1.335AsnTyr: 1.335 ± 0.381
0.0AsnXaa: 0.0 ± 0.0
Pro
2.67ProAla: 2.67 ± 0.861
0.89ProCys: 0.89 ± 0.586
4.895ProAsp: 4.895 ± 1.795
0.89ProGlu: 0.89 ± 0.801
2.67ProPhe: 2.67 ± 0.861
0.445ProGly: 0.445 ± 0.401
0.0ProHis: 0.0 ± 0.0
1.78ProIle: 1.78 ± 1.17
3.56ProLys: 3.56 ± 1.269
6.676ProLeu: 6.676 ± 2.066
0.89ProMet: 0.89 ± 0.485
2.225ProAsn: 2.225 ± 1.048
4.895ProPro: 4.895 ± 1.182
1.78ProGln: 1.78 ± 0.621
2.67ProArg: 2.67 ± 1.076
8.011ProSer: 8.011 ± 3.751
7.121ProThr: 7.121 ± 2.047
3.115ProVal: 3.115 ± 1.812
0.0ProTrp: 0.0 ± 0.0
1.78ProTyr: 1.78 ± 0.998
0.0ProXaa: 0.0 ± 0.0
Gln
2.225GlnAla: 2.225 ± 0.87
0.445GlnCys: 0.445 ± 0.486
1.78GlnAsp: 1.78 ± 0.703
2.67GlnGlu: 2.67 ± 0.878
3.115GlnPhe: 3.115 ± 1.145
2.225GlnGly: 2.225 ± 0.87
0.0GlnHis: 0.0 ± 0.0
0.89GlnIle: 0.89 ± 0.404
0.89GlnLys: 0.89 ± 0.904
5.34GlnLeu: 5.34 ± 1.923
1.335GlnMet: 1.335 ± 0.793
1.335GlnAsn: 1.335 ± 0.391
1.335GlnPro: 1.335 ± 0.764
2.67GlnGln: 2.67 ± 0.676
1.78GlnArg: 1.78 ± 1.297
1.78GlnSer: 1.78 ± 0.621
2.225GlnThr: 2.225 ± 0.464
4.45GlnVal: 4.45 ± 1.507
1.335GlnTrp: 1.335 ± 0.717
2.225GlnTyr: 2.225 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
1.78ArgAla: 1.78 ± 0.67
2.225ArgCys: 2.225 ± 1.166
1.78ArgAsp: 1.78 ± 0.735
2.225ArgGlu: 2.225 ± 0.716
2.225ArgPhe: 2.225 ± 0.854
3.56ArgGly: 3.56 ± 1.726
1.78ArgHis: 1.78 ± 0.879
0.445ArgIle: 0.445 ± 0.354
5.34ArgLys: 5.34 ± 1.282
6.231ArgLeu: 6.231 ± 1.334
0.89ArgMet: 0.89 ± 0.404
1.78ArgAsn: 1.78 ± 1.086
3.115ArgPro: 3.115 ± 1.812
2.67ArgGln: 2.67 ± 1.306
4.45ArgArg: 4.45 ± 1.865
5.34ArgSer: 5.34 ± 0.897
2.225ArgThr: 2.225 ± 0.776
4.005ArgVal: 4.005 ± 1.45
0.89ArgTrp: 0.89 ± 0.904
2.225ArgTyr: 2.225 ± 0.962
0.0ArgXaa: 0.0 ± 0.0
Ser
4.45SerAla: 4.45 ± 1.387
0.89SerCys: 0.89 ± 0.417
4.895SerAsp: 4.895 ± 1.437
4.005SerGlu: 4.005 ± 1.151
4.45SerPhe: 4.45 ± 1.364
4.895SerGly: 4.895 ± 1.258
0.89SerHis: 0.89 ± 0.904
4.45SerIle: 4.45 ± 1.108
2.67SerLys: 2.67 ± 0.893
6.676SerLeu: 6.676 ± 1.077
1.78SerMet: 1.78 ± 1.413
4.005SerAsn: 4.005 ± 1.472
4.005SerPro: 4.005 ± 1.256
4.005SerGln: 4.005 ± 0.508
4.005SerArg: 4.005 ± 1.027
7.121SerSer: 7.121 ± 2.013
5.34SerThr: 5.34 ± 2.105
6.231SerVal: 6.231 ± 1.3
0.0SerTrp: 0.0 ± 0.0
0.89SerTyr: 0.89 ± 0.417
0.0SerXaa: 0.0 ± 0.0
Thr
5.785ThrAla: 5.785 ± 1.437
1.78ThrCys: 1.78 ± 0.906
5.785ThrAsp: 5.785 ± 1.293
3.56ThrGlu: 3.56 ± 0.916
4.895ThrPhe: 4.895 ± 1.558
4.005ThrGly: 4.005 ± 1.031
1.335ThrHis: 1.335 ± 1.06
2.67ThrIle: 2.67 ± 0.947
1.335ThrLys: 1.335 ± 0.672
5.785ThrLeu: 5.785 ± 1.194
0.445ThrMet: 0.445 ± 0.354
3.115ThrAsn: 3.115 ± 1.533
4.005ThrPro: 4.005 ± 1.439
2.225ThrGln: 2.225 ± 0.712
4.005ThrArg: 4.005 ± 1.514
3.115ThrSer: 3.115 ± 1.352
5.34ThrThr: 5.34 ± 1.958
6.231ThrVal: 6.231 ± 1.0
0.0ThrTrp: 0.0 ± 0.0
1.335ThrTyr: 1.335 ± 0.884
0.0ThrXaa: 0.0 ± 0.0
Val
4.005ValAla: 4.005 ± 1.406
0.89ValCys: 0.89 ± 0.763
6.231ValAsp: 6.231 ± 1.514
2.225ValGlu: 2.225 ± 0.863
4.895ValPhe: 4.895 ± 1.868
3.56ValGly: 3.56 ± 1.209
0.89ValHis: 0.89 ± 0.417
4.895ValIle: 4.895 ± 2.055
3.115ValLys: 3.115 ± 1.108
5.34ValLeu: 5.34 ± 1.657
0.0ValMet: 0.0 ± 0.0
2.67ValAsn: 2.67 ± 1.68
3.115ValPro: 3.115 ± 0.753
2.67ValGln: 2.67 ± 0.936
3.56ValArg: 3.56 ± 0.879
5.785ValSer: 5.785 ± 1.206
4.895ValThr: 4.895 ± 0.941
2.225ValVal: 2.225 ± 1.129
0.445ValTrp: 0.445 ± 0.354
0.89ValTyr: 0.89 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
1.335TrpAla: 1.335 ± 0.672
0.445TrpCys: 0.445 ± 0.353
0.89TrpAsp: 0.89 ± 0.499
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.225TrpIle: 2.225 ± 1.035
0.445TrpLys: 0.445 ± 0.353
1.335TrpLeu: 1.335 ± 0.673
0.0TrpMet: 0.0 ± 0.0
1.335TrpAsn: 1.335 ± 0.737
0.0TrpPro: 0.0 ± 0.0
0.89TrpGln: 0.89 ± 0.499
1.78TrpArg: 1.78 ± 1.05
0.445TrpSer: 0.445 ± 0.353
0.89TrpThr: 0.89 ± 0.904
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.78TyrAla: 1.78 ± 1.086
1.78TyrCys: 1.78 ± 0.952
0.89TyrAsp: 0.89 ± 0.417
1.335TyrGlu: 1.335 ± 0.641
3.115TyrPhe: 3.115 ± 0.417
2.225TyrGly: 2.225 ± 0.716
0.445TyrHis: 0.445 ± 0.353
0.445TyrIle: 0.445 ± 0.354
3.56TyrLys: 3.56 ± 0.923
2.67TyrLeu: 2.67 ± 0.918
0.89TyrMet: 0.89 ± 0.404
2.67TyrAsn: 2.67 ± 1.42
0.89TyrPro: 0.89 ± 0.499
1.78TyrGln: 1.78 ± 0.777
2.225TyrArg: 2.225 ± 0.772
3.115TyrSer: 3.115 ± 1.481
0.89TyrThr: 0.89 ± 0.484
1.78TyrVal: 1.78 ± 0.606
0.89TyrTrp: 0.89 ± 0.708
1.78TyrTyr: 1.78 ± 0.972
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2248 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski