Amino acid dipepetide frequency for Human papillomavirus type 137

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.32AlaAla: 3.32 ± 1.148
1.66AlaCys: 1.66 ± 1.148
4.979AlaAsp: 4.979 ± 1.333
4.979AlaGlu: 4.979 ± 1.21
2.49AlaPhe: 2.49 ± 0.792
2.075AlaGly: 2.075 ± 1.126
0.415AlaHis: 0.415 ± 0.359
3.32AlaIle: 3.32 ± 0.747
4.564AlaLys: 4.564 ± 0.712
6.224AlaLeu: 6.224 ± 2.074
0.83AlaMet: 0.83 ± 0.43
2.905AlaAsn: 2.905 ± 0.632
2.49AlaPro: 2.49 ± 0.53
1.245AlaGln: 1.245 ± 0.619
2.905AlaArg: 2.905 ± 1.013
4.149AlaSer: 4.149 ± 1.322
1.66AlaThr: 1.66 ± 0.701
3.734AlaVal: 3.734 ± 0.638
0.415AlaTrp: 0.415 ± 0.359
1.245AlaTyr: 1.245 ± 0.753
0.0AlaXaa: 0.0 ± 0.0
Cys
0.83CysAla: 0.83 ± 0.517
1.245CysCys: 1.245 ± 0.991
0.83CysAsp: 0.83 ± 0.564
0.83CysGlu: 0.83 ± 0.752
2.49CysPhe: 2.49 ± 1.501
0.0CysGly: 0.0 ± 0.0
0.415CysHis: 0.415 ± 0.389
1.245CysIle: 1.245 ± 0.991
1.245CysLys: 1.245 ± 1.078
1.66CysLeu: 1.66 ± 1.46
0.0CysMet: 0.0 ± 0.0
0.83CysAsn: 0.83 ± 0.752
1.66CysPro: 1.66 ± 1.039
0.83CysGln: 0.83 ± 0.487
1.66CysArg: 1.66 ± 0.583
2.49CysSer: 2.49 ± 1.411
2.075CysThr: 2.075 ± 0.97
0.83CysVal: 0.83 ± 0.517
0.415CysTrp: 0.415 ± 0.376
1.66CysTyr: 1.66 ± 1.033
0.0CysXaa: 0.0 ± 0.0
Asp
3.32AspAla: 3.32 ± 0.779
1.245AspCys: 1.245 ± 0.753
4.979AspAsp: 4.979 ± 2.106
3.734AspGlu: 3.734 ± 1.05
2.075AspPhe: 2.075 ± 0.771
2.905AspGly: 2.905 ± 0.801
0.83AspHis: 0.83 ± 0.707
3.32AspIle: 3.32 ± 1.959
2.49AspLys: 2.49 ± 1.338
9.129AspLeu: 9.129 ± 1.9
0.415AspMet: 0.415 ± 0.359
4.979AspAsn: 4.979 ± 1.202
5.809AspPro: 5.809 ± 1.756
1.66AspGln: 1.66 ± 1.059
1.66AspArg: 1.66 ± 1.039
4.979AspSer: 4.979 ± 1.087
4.149AspThr: 4.149 ± 1.03
4.564AspVal: 4.564 ± 1.019
1.245AspTrp: 1.245 ± 1.128
3.32AspTyr: 3.32 ± 0.58
0.0AspXaa: 0.0 ± 0.0
Glu
3.734GluAla: 3.734 ± 0.942
0.83GluCys: 0.83 ± 0.564
6.224GluAsp: 6.224 ± 1.618
7.469GluGlu: 7.469 ± 2.887
3.32GluPhe: 3.32 ± 1.541
2.075GluGly: 2.075 ± 1.2
2.905GluHis: 2.905 ± 0.846
5.809GluIle: 5.809 ± 1.911
2.075GluLys: 2.075 ± 0.987
5.394GluLeu: 5.394 ± 1.211
1.245GluMet: 1.245 ± 0.643
3.734GluAsn: 3.734 ± 1.323
2.905GluPro: 2.905 ± 0.693
2.905GluGln: 2.905 ± 0.772
2.905GluArg: 2.905 ± 0.755
5.394GluSer: 5.394 ± 1.014
2.905GluThr: 2.905 ± 1.622
2.905GluVal: 2.905 ± 0.936
0.415GluTrp: 0.415 ± 0.374
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.905PheAla: 2.905 ± 0.864
0.83PheCys: 0.83 ± 0.564
4.149PheAsp: 4.149 ± 0.997
3.734PheGlu: 3.734 ± 2.133
2.49PhePhe: 2.49 ± 1.162
2.905PheGly: 2.905 ± 1.259
0.83PheHis: 0.83 ± 0.575
2.075PheIle: 2.075 ± 0.431
3.32PheLys: 3.32 ± 1.392
3.734PheLeu: 3.734 ± 1.038
1.245PheMet: 1.245 ± 0.751
1.245PheAsn: 1.245 ± 0.699
1.66PhePro: 1.66 ± 0.734
3.734PheGln: 3.734 ± 1.73
3.32PheArg: 3.32 ± 0.509
0.415PheSer: 0.415 ± 0.359
2.49PheThr: 2.49 ± 1.374
2.075PheVal: 2.075 ± 0.458
0.83PheTrp: 0.83 ± 0.458
1.66PheTyr: 1.66 ± 0.582
0.0PheXaa: 0.0 ± 0.0
Gly
2.49GlyAla: 2.49 ± 0.902
1.245GlyCys: 1.245 ± 0.445
2.075GlyAsp: 2.075 ± 1.103
2.905GlyGlu: 2.905 ± 1.375
0.83GlyPhe: 0.83 ± 0.752
4.149GlyGly: 4.149 ± 2.104
0.83GlyHis: 0.83 ± 0.718
2.49GlyIle: 2.49 ± 0.981
3.32GlyLys: 3.32 ± 1.104
6.639GlyLeu: 6.639 ± 2.358
0.415GlyMet: 0.415 ± 0.407
4.564GlyAsn: 4.564 ± 1.107
0.83GlyPro: 0.83 ± 0.43
2.075GlyGln: 2.075 ± 0.825
2.905GlyArg: 2.905 ± 1.234
4.979GlySer: 4.979 ± 1.307
5.394GlyThr: 5.394 ± 3.532
1.66GlyVal: 1.66 ± 1.629
0.0GlyTrp: 0.0 ± 0.0
2.49GlyTyr: 2.49 ± 1.34
0.0GlyXaa: 0.0 ± 0.0
His
1.245HisAla: 1.245 ± 0.792
0.415HisCys: 0.415 ± 0.376
1.245HisAsp: 1.245 ± 0.63
1.245HisGlu: 1.245 ± 0.445
1.66HisPhe: 1.66 ± 0.804
0.83HisGly: 0.83 ± 0.517
0.0HisHis: 0.0 ± 0.0
0.415HisIle: 0.415 ± 0.407
1.245HisLys: 1.245 ± 0.753
0.83HisLeu: 0.83 ± 0.487
0.0HisMet: 0.0 ± 0.0
1.245HisAsn: 1.245 ± 0.443
1.66HisPro: 1.66 ± 0.921
0.83HisGln: 0.83 ± 0.458
1.66HisArg: 1.66 ± 1.188
2.075HisSer: 2.075 ± 0.839
0.0HisThr: 0.0 ± 0.0
1.245HisVal: 1.245 ± 0.698
1.245HisTrp: 1.245 ± 0.443
1.66HisTyr: 1.66 ± 0.549
0.0HisXaa: 0.0 ± 0.0
Ile
3.734IleAla: 3.734 ± 1.102
0.83IleCys: 0.83 ± 0.987
3.734IleAsp: 3.734 ± 1.218
4.149IleGlu: 4.149 ± 1.017
2.49IlePhe: 2.49 ± 1.011
2.905IleGly: 2.905 ± 1.179
0.83IleHis: 0.83 ± 0.385
2.905IleIle: 2.905 ± 0.69
2.075IleLys: 2.075 ± 0.908
4.149IleLeu: 4.149 ± 1.008
0.83IleMet: 0.83 ± 0.385
3.32IleAsn: 3.32 ± 0.999
2.905IlePro: 2.905 ± 1.483
0.83IleGln: 0.83 ± 0.517
1.66IleArg: 1.66 ± 0.984
3.734IleSer: 3.734 ± 0.578
2.075IleThr: 2.075 ± 1.051
4.149IleVal: 4.149 ± 1.768
0.0IleTrp: 0.0 ± 0.0
2.075IleTyr: 2.075 ± 0.825
0.0IleXaa: 0.0 ± 0.0
Lys
1.66LysAla: 1.66 ± 0.681
2.075LysCys: 2.075 ± 0.985
2.905LysAsp: 2.905 ± 1.052
3.734LysGlu: 3.734 ± 1.451
1.66LysPhe: 1.66 ± 0.984
2.49LysGly: 2.49 ± 0.461
1.66LysHis: 1.66 ± 1.148
1.245LysIle: 1.245 ± 0.643
2.49LysLys: 2.49 ± 0.777
5.394LysLeu: 5.394 ± 2.176
0.415LysMet: 0.415 ± 0.429
2.905LysAsn: 2.905 ± 0.849
3.734LysPro: 3.734 ± 1.049
2.905LysGln: 2.905 ± 0.813
5.394LysArg: 5.394 ± 0.97
4.149LysSer: 4.149 ± 1.539
3.32LysThr: 3.32 ± 1.438
4.979LysVal: 4.979 ± 0.853
0.415LysTrp: 0.415 ± 0.359
1.245LysTyr: 1.245 ± 0.643
0.0LysXaa: 0.0 ± 0.0
Leu
4.979LeuAla: 4.979 ± 1.146
1.245LeuCys: 1.245 ± 0.54
6.224LeuAsp: 6.224 ± 1.019
6.224LeuGlu: 6.224 ± 1.652
6.224LeuPhe: 6.224 ± 2.264
4.979LeuGly: 4.979 ± 2.304
2.49LeuHis: 2.49 ± 0.731
3.734LeuIle: 3.734 ± 0.915
4.564LeuLys: 4.564 ± 1.013
10.788LeuLeu: 10.788 ± 2.465
2.075LeuMet: 2.075 ± 0.435
4.149LeuAsn: 4.149 ± 1.19
5.394LeuPro: 5.394 ± 1.057
5.809LeuGln: 5.809 ± 2.13
3.32LeuArg: 3.32 ± 0.982
9.129LeuSer: 9.129 ± 1.789
6.639LeuThr: 6.639 ± 2.236
5.809LeuVal: 5.809 ± 1.629
0.415LeuTrp: 0.415 ± 0.376
6.639LeuTyr: 6.639 ± 0.82
0.0LeuXaa: 0.0 ± 0.0
Met
1.245MetAla: 1.245 ± 0.732
0.415MetCys: 0.415 ± 0.359
0.83MetAsp: 0.83 ± 0.458
0.83MetGlu: 0.83 ± 0.615
0.0MetPhe: 0.0 ± 0.0
0.83MetGly: 0.83 ± 0.458
0.415MetHis: 0.415 ± 0.376
0.0MetIle: 0.0 ± 0.0
0.83MetLys: 0.83 ± 0.385
0.83MetLeu: 0.83 ± 0.752
0.0MetMet: 0.0 ± 0.0
1.66MetAsn: 1.66 ± 0.571
0.83MetPro: 0.83 ± 0.385
0.415MetGln: 0.415 ± 0.407
0.83MetArg: 0.83 ± 0.517
1.66MetSer: 1.66 ± 0.701
1.245MetThr: 1.245 ± 0.443
2.075MetVal: 2.075 ± 0.908
0.0MetTrp: 0.0 ± 0.0
0.415MetTyr: 0.415 ± 0.376
0.0MetXaa: 0.0 ± 0.0
Asn
2.49AsnAla: 2.49 ± 1.382
0.83AsnCys: 0.83 ± 0.461
1.66AsnAsp: 1.66 ± 0.922
3.734AsnGlu: 3.734 ± 1.196
2.075AsnPhe: 2.075 ± 0.99
4.564AsnGly: 4.564 ± 1.679
0.83AsnHis: 0.83 ± 0.461
5.394AsnIle: 5.394 ± 0.909
4.564AsnLys: 4.564 ± 1.198
2.905AsnLeu: 2.905 ± 0.893
0.415AsnMet: 0.415 ± 0.376
4.564AsnAsn: 4.564 ± 1.742
2.075AsnPro: 2.075 ± 1.152
3.32AsnGln: 3.32 ± 1.283
4.149AsnArg: 4.149 ± 1.204
2.49AsnSer: 2.49 ± 1.044
3.32AsnThr: 3.32 ± 1.468
2.075AsnVal: 2.075 ± 0.763
1.245AsnTrp: 1.245 ± 0.443
0.415AsnTyr: 0.415 ± 0.407
0.0AsnXaa: 0.0 ± 0.0
Pro
5.394ProAla: 5.394 ± 2.412
0.83ProCys: 0.83 ± 0.575
4.149ProAsp: 4.149 ± 1.325
2.075ProGlu: 2.075 ± 0.742
2.49ProPhe: 2.49 ± 0.651
2.905ProGly: 2.905 ± 2.093
0.0ProHis: 0.0 ± 0.0
1.66ProIle: 1.66 ± 0.971
3.734ProLys: 3.734 ± 1.064
8.299ProLeu: 8.299 ± 2.045
0.415ProMet: 0.415 ± 0.376
2.905ProAsn: 2.905 ± 1.142
7.054ProPro: 7.054 ± 2.634
2.49ProGln: 2.49 ± 1.019
0.83ProArg: 0.83 ± 0.385
4.149ProSer: 4.149 ± 1.344
4.979ProThr: 4.979 ± 2.037
2.905ProVal: 2.905 ± 0.864
0.0ProTrp: 0.0 ± 0.0
2.49ProTyr: 2.49 ± 0.559
0.0ProXaa: 0.0 ± 0.0
Gln
1.245GlnAla: 1.245 ± 0.386
0.415GlnCys: 0.415 ± 0.493
2.49GlnAsp: 2.49 ± 0.756
1.245GlnGlu: 1.245 ± 0.63
2.49GlnPhe: 2.49 ± 0.756
2.075GlnGly: 2.075 ± 0.784
1.66GlnHis: 1.66 ± 0.427
3.32GlnIle: 3.32 ± 1.066
2.075GlnLys: 2.075 ± 1.288
7.054GlnLeu: 7.054 ± 1.786
0.83GlnMet: 0.83 ± 0.458
1.245GlnAsn: 1.245 ± 0.699
1.66GlnPro: 1.66 ± 0.876
1.66GlnGln: 1.66 ± 0.981
2.905GlnArg: 2.905 ± 1.469
1.66GlnSer: 1.66 ± 0.692
2.905GlnThr: 2.905 ± 0.805
1.66GlnVal: 1.66 ± 0.981
0.83GlnTrp: 0.83 ± 0.458
0.83GlnTyr: 0.83 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
3.734ArgAla: 3.734 ± 0.657
1.66ArgCys: 1.66 ± 0.828
4.149ArgAsp: 4.149 ± 1.497
2.075ArgGlu: 2.075 ± 0.482
1.245ArgPhe: 1.245 ± 0.719
2.905ArgGly: 2.905 ± 1.238
1.245ArgHis: 1.245 ± 0.823
0.83ArgIle: 0.83 ± 0.815
4.564ArgLys: 4.564 ± 0.847
5.809ArgLeu: 5.809 ± 1.786
0.415ArgMet: 0.415 ± 0.359
2.49ArgAsn: 2.49 ± 1.41
3.32ArgPro: 3.32 ± 1.171
1.66ArgGln: 1.66 ± 1.132
7.884ArgArg: 7.884 ± 3.179
2.905ArgSer: 2.905 ± 0.541
2.075ArgThr: 2.075 ± 0.458
3.32ArgVal: 3.32 ± 1.548
0.415ArgTrp: 0.415 ± 0.374
1.245ArgTyr: 1.245 ± 0.628
0.0ArgXaa: 0.0 ± 0.0
Ser
3.32SerAla: 3.32 ± 1.223
1.66SerCys: 1.66 ± 0.823
4.979SerAsp: 4.979 ± 0.947
7.054SerGlu: 7.054 ± 1.385
2.905SerPhe: 2.905 ± 1.002
2.075SerGly: 2.075 ± 0.458
1.66SerHis: 1.66 ± 1.193
3.32SerIle: 3.32 ± 1.272
4.564SerLys: 4.564 ± 0.734
7.054SerLeu: 7.054 ± 2.519
1.245SerMet: 1.245 ± 0.525
4.564SerAsn: 4.564 ± 2.536
4.979SerPro: 4.979 ± 1.264
2.075SerGln: 2.075 ± 0.83
3.32SerArg: 3.32 ± 1.925
8.714SerSer: 8.714 ± 2.638
7.054SerThr: 7.054 ± 1.662
2.49SerVal: 2.49 ± 0.974
0.0SerTrp: 0.0 ± 0.0
2.075SerTyr: 2.075 ± 0.924
0.0SerXaa: 0.0 ± 0.0
Thr
4.564ThrAla: 4.564 ± 1.462
1.245ThrCys: 1.245 ± 0.823
4.149ThrAsp: 4.149 ± 1.963
4.564ThrGlu: 4.564 ± 1.435
1.66ThrPhe: 1.66 ± 0.702
5.809ThrGly: 5.809 ± 2.041
1.66ThrHis: 1.66 ± 0.624
4.564ThrIle: 4.564 ± 1.215
2.49ThrLys: 2.49 ± 0.626
4.979ThrLeu: 4.979 ± 1.39
2.49ThrMet: 2.49 ± 1.264
2.905ThrAsn: 2.905 ± 0.813
3.32ThrPro: 3.32 ± 1.874
0.415ThrGln: 0.415 ± 0.407
2.905ThrArg: 2.905 ± 0.877
5.394ThrSer: 5.394 ± 0.824
3.734ThrThr: 3.734 ± 1.362
4.979ThrVal: 4.979 ± 1.72
0.415ThrTrp: 0.415 ± 0.407
1.66ThrTyr: 1.66 ± 0.647
0.0ThrXaa: 0.0 ± 0.0
Val
2.075ValAla: 2.075 ± 1.052
1.66ValCys: 1.66 ± 0.793
5.394ValAsp: 5.394 ± 1.672
2.905ValGlu: 2.905 ± 1.002
2.075ValPhe: 2.075 ± 0.909
4.149ValGly: 4.149 ± 2.018
1.245ValHis: 1.245 ± 0.535
1.245ValIle: 1.245 ± 0.386
2.905ValLys: 2.905 ± 1.142
5.809ValLeu: 5.809 ± 0.805
0.83ValMet: 0.83 ± 0.523
1.245ValAsn: 1.245 ± 0.75
5.809ValPro: 5.809 ± 1.553
3.32ValGln: 3.32 ± 1.754
1.66ValArg: 1.66 ± 0.738
2.905ValSer: 2.905 ± 0.486
3.734ValThr: 3.734 ± 1.203
2.905ValVal: 2.905 ± 1.279
2.49ValTrp: 2.49 ± 1.207
2.49ValTyr: 2.49 ± 0.559
0.0ValXaa: 0.0 ± 0.0
Trp
0.83TrpAla: 0.83 ± 0.385
0.415TrpCys: 0.415 ± 0.359
0.415TrpAsp: 0.415 ± 0.359
0.0TrpGlu: 0.0 ± 0.0
0.83TrpPhe: 0.83 ± 0.461
0.415TrpGly: 0.415 ± 0.359
0.415TrpHis: 0.415 ± 0.374
0.83TrpIle: 0.83 ± 0.752
0.415TrpLys: 0.415 ± 0.376
1.245TrpLeu: 1.245 ± 0.757
0.0TrpMet: 0.0 ± 0.0
0.415TrpAsn: 0.415 ± 0.374
0.415TrpPro: 0.415 ± 0.359
0.415TrpGln: 0.415 ± 0.359
0.83TrpArg: 0.83 ± 0.575
0.0TrpSer: 0.0 ± 0.0
1.66TrpThr: 1.66 ± 0.681
1.245TrpVal: 1.245 ± 0.443
0.0TrpTrp: 0.0 ± 0.0
0.415TrpTyr: 0.415 ± 0.374
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.49TyrAla: 2.49 ± 0.53
2.49TyrCys: 2.49 ± 1.334
1.245TyrAsp: 1.245 ± 0.443
1.66TyrGlu: 1.66 ± 1.236
4.149TyrPhe: 4.149 ± 0.923
1.245TyrGly: 1.245 ± 0.386
0.415TyrHis: 0.415 ± 0.376
1.66TyrIle: 1.66 ± 1.039
1.66TyrLys: 1.66 ± 0.701
2.905TyrLeu: 2.905 ± 0.708
0.83TyrMet: 0.83 ± 0.504
1.245TyrAsn: 1.245 ± 0.732
1.245TyrPro: 1.245 ± 0.805
1.66TyrGln: 1.66 ± 0.662
1.245TyrArg: 1.245 ± 0.722
3.734TyrSer: 3.734 ± 1.8
2.49TyrThr: 2.49 ± 0.946
1.245TyrVal: 1.245 ± 0.643
0.415TyrTrp: 0.415 ± 0.359
2.075TyrTyr: 2.075 ± 1.204
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2411 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski