Amino acid dipepetide frequency for Rusa timorensis papillomavirus type 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.337AlaAla: 5.337 ± 2.549
1.642AlaCys: 1.642 ± 0.892
2.463AlaAsp: 2.463 ± 0.984
6.979AlaGlu: 6.979 ± 2.189
2.463AlaPhe: 2.463 ± 0.47
3.284AlaGly: 3.284 ± 1.04
1.642AlaHis: 1.642 ± 0.193
0.821AlaIle: 0.821 ± 0.412
3.695AlaLys: 3.695 ± 1.181
2.874AlaLeu: 2.874 ± 1.633
1.642AlaMet: 1.642 ± 0.193
0.821AlaAsn: 0.821 ± 0.371
5.337AlaPro: 5.337 ± 1.553
1.642AlaGln: 1.642 ± 0.884
4.516AlaArg: 4.516 ± 1.388
4.105AlaSer: 4.105 ± 1.668
4.516AlaThr: 4.516 ± 1.063
3.695AlaVal: 3.695 ± 1.211
1.232AlaTrp: 1.232 ± 0.565
1.642AlaTyr: 1.642 ± 0.776
0.0AlaXaa: 0.0 ± 0.0
Cys
1.232CysAla: 1.232 ± 0.818
1.642CysCys: 1.642 ± 1.587
1.232CysAsp: 1.232 ± 0.698
1.232CysGlu: 1.232 ± 0.664
0.411CysPhe: 0.411 ± 0.371
1.642CysGly: 1.642 ± 1.666
0.411CysHis: 0.411 ± 0.534
1.642CysIle: 1.642 ± 1.039
1.232CysLys: 1.232 ± 0.698
1.232CysLeu: 1.232 ± 0.742
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.053CysPro: 2.053 ± 0.758
0.821CysGln: 0.821 ± 0.614
1.642CysArg: 1.642 ± 1.584
2.053CysSer: 2.053 ± 1.085
0.821CysThr: 0.821 ± 0.571
1.642CysVal: 1.642 ± 1.201
0.411CysTrp: 0.411 ± 0.529
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.874AspAla: 2.874 ± 0.435
1.642AspCys: 1.642 ± 0.643
4.926AspAsp: 4.926 ± 1.366
4.516AspGlu: 4.516 ± 1.542
4.105AspPhe: 4.105 ± 1.337
4.926AspGly: 4.926 ± 1.556
0.0AspHis: 0.0 ± 0.0
7.389AspIle: 7.389 ± 1.25
2.053AspLys: 2.053 ± 1.015
5.337AspLeu: 5.337 ± 2.151
0.821AspMet: 0.821 ± 0.442
2.463AspAsn: 2.463 ± 0.512
4.516AspPro: 4.516 ± 1.541
1.642AspGln: 1.642 ± 0.667
2.463AspArg: 2.463 ± 1.324
3.695AspSer: 3.695 ± 0.813
5.747AspThr: 5.747 ± 1.192
2.874AspVal: 2.874 ± 0.693
1.642AspTrp: 1.642 ± 0.776
1.232AspTyr: 1.232 ± 0.714
0.0AspXaa: 0.0 ± 0.0
Glu
4.926GluAla: 4.926 ± 2.036
0.821GluCys: 0.821 ± 0.762
6.568GluAsp: 6.568 ± 0.86
7.389GluGlu: 7.389 ± 1.053
2.874GluPhe: 2.874 ± 1.056
4.105GluGly: 4.105 ± 1.36
2.053GluHis: 2.053 ± 0.644
0.821GluIle: 0.821 ± 0.442
2.053GluLys: 2.053 ± 0.644
4.516GluLeu: 4.516 ± 1.516
0.821GluMet: 0.821 ± 0.673
2.053GluAsn: 2.053 ± 0.723
2.053GluPro: 2.053 ± 0.393
2.053GluGln: 2.053 ± 0.762
4.105GluArg: 4.105 ± 1.398
2.874GluSer: 2.874 ± 0.578
3.695GluThr: 3.695 ± 1.001
3.695GluVal: 3.695 ± 0.922
0.0GluTrp: 0.0 ± 0.0
1.232GluTyr: 1.232 ± 0.676
0.0GluXaa: 0.0 ± 0.0
Phe
3.284PheAla: 3.284 ± 0.709
1.642PheCys: 1.642 ± 2.115
3.695PheAsp: 3.695 ± 0.56
2.053PheGlu: 2.053 ± 0.676
2.053PhePhe: 2.053 ± 0.846
2.874PheGly: 2.874 ± 0.753
0.411PheHis: 0.411 ± 0.381
2.463PheIle: 2.463 ± 0.736
2.463PheLys: 2.463 ± 0.849
5.337PheLeu: 5.337 ± 1.029
2.053PheMet: 2.053 ± 0.846
0.411PheAsn: 0.411 ± 0.371
3.284PhePro: 3.284 ± 0.751
2.874PheGln: 2.874 ± 1.167
2.053PheArg: 2.053 ± 0.405
2.463PheSer: 2.463 ± 0.714
1.642PheThr: 1.642 ± 0.559
2.874PheVal: 2.874 ± 1.16
1.642PheTrp: 1.642 ± 1.035
0.821PheTyr: 0.821 ± 0.692
0.0PheXaa: 0.0 ± 0.0
Gly
5.337GlyAla: 5.337 ± 1.094
1.642GlyCys: 1.642 ± 1.142
6.568GlyAsp: 6.568 ± 1.429
3.695GlyGlu: 3.695 ± 0.684
3.284GlyPhe: 3.284 ± 1.663
5.747GlyGly: 5.747 ± 2.627
1.642GlyHis: 1.642 ± 0.527
2.874GlyIle: 2.874 ± 0.753
2.874GlyLys: 2.874 ± 1.16
4.516GlyLeu: 4.516 ± 1.44
0.411GlyMet: 0.411 ± 0.346
3.284GlyAsn: 3.284 ± 0.974
3.284GlyPro: 3.284 ± 1.004
2.874GlyGln: 2.874 ± 0.911
7.8GlyArg: 7.8 ± 2.113
7.389GlySer: 7.389 ± 2.926
4.516GlyThr: 4.516 ± 1.895
4.105GlyVal: 4.105 ± 1.753
0.821GlyTrp: 0.821 ± 0.64
3.284GlyTyr: 3.284 ± 1.073
0.0GlyXaa: 0.0 ± 0.0
His
1.232HisAla: 1.232 ± 0.714
0.0HisCys: 0.0 ± 0.0
0.411HisAsp: 0.411 ± 0.371
2.053HisGlu: 2.053 ± 0.71
1.232HisPhe: 1.232 ± 0.565
0.821HisGly: 0.821 ± 0.371
0.0HisHis: 0.0 ± 0.0
0.821HisIle: 0.821 ± 0.413
0.821HisLys: 0.821 ± 0.762
1.642HisLeu: 1.642 ± 0.836
0.0HisMet: 0.0 ± 0.0
2.463HisAsn: 2.463 ± 0.942
0.821HisPro: 0.821 ± 0.442
1.232HisGln: 1.232 ± 0.875
0.821HisArg: 0.821 ± 0.692
2.874HisSer: 2.874 ± 1.135
1.642HisThr: 1.642 ± 0.193
0.821HisVal: 0.821 ± 0.441
0.411HisTrp: 0.411 ± 0.371
0.821HisTyr: 0.821 ± 0.413
0.0HisXaa: 0.0 ± 0.0
Ile
2.874IleAla: 2.874 ± 0.572
0.821IleCys: 0.821 ± 0.673
4.105IleAsp: 4.105 ± 1.932
2.463IleGlu: 2.463 ± 0.572
1.642IlePhe: 1.642 ± 0.602
4.105IleGly: 4.105 ± 1.871
0.411IleHis: 0.411 ± 0.319
2.874IleIle: 2.874 ± 0.849
1.232IleLys: 1.232 ± 0.399
2.053IleLeu: 2.053 ± 1.065
0.411IleMet: 0.411 ± 0.529
1.232IleAsn: 1.232 ± 0.728
2.874IlePro: 2.874 ± 1.43
1.232IleGln: 1.232 ± 0.664
2.053IleArg: 2.053 ± 1.041
3.284IleSer: 3.284 ± 0.685
2.053IleThr: 2.053 ± 0.723
5.747IleVal: 5.747 ± 1.192
0.411IleTrp: 0.411 ± 0.529
2.463IleTyr: 2.463 ± 0.684
0.0IleXaa: 0.0 ± 0.0
Lys
2.463LysAla: 2.463 ± 0.659
0.0LysCys: 0.0 ± 0.0
3.284LysAsp: 3.284 ± 1.633
2.463LysGlu: 2.463 ± 0.659
2.874LysPhe: 2.874 ± 1.212
3.695LysGly: 3.695 ± 0.765
1.232LysHis: 1.232 ± 0.875
1.232LysIle: 1.232 ± 0.955
3.695LysLys: 3.695 ± 1.445
4.105LysLeu: 4.105 ± 0.654
0.821LysMet: 0.821 ± 0.423
0.821LysAsn: 0.821 ± 0.413
0.411LysPro: 0.411 ± 0.381
2.463LysGln: 2.463 ± 1.341
5.337LysArg: 5.337 ± 0.922
3.695LysSer: 3.695 ± 1.397
3.284LysThr: 3.284 ± 0.827
3.695LysVal: 3.695 ± 1.507
0.821LysTrp: 0.821 ± 0.413
1.232LysTyr: 1.232 ± 0.399
0.0LysXaa: 0.0 ± 0.0
Leu
5.747LeuAla: 5.747 ± 1.363
1.232LeuCys: 1.232 ± 1.089
6.158LeuAsp: 6.158 ± 1.39
5.747LeuGlu: 5.747 ± 1.238
2.874LeuPhe: 2.874 ± 1.33
6.158LeuGly: 6.158 ± 1.293
4.105LeuHis: 4.105 ± 0.651
3.695LeuIle: 3.695 ± 2.054
4.105LeuLys: 4.105 ± 0.654
5.337LeuLeu: 5.337 ± 2.061
1.642LeuMet: 1.642 ± 0.592
2.463LeuAsn: 2.463 ± 0.847
4.516LeuPro: 4.516 ± 1.323
5.337LeuGln: 5.337 ± 1.514
4.105LeuArg: 4.105 ± 0.785
4.926LeuSer: 4.926 ± 1.426
2.874LeuThr: 2.874 ± 1.633
3.284LeuVal: 3.284 ± 1.109
0.821LeuTrp: 0.821 ± 0.412
4.105LeuTyr: 4.105 ± 1.465
0.0LeuXaa: 0.0 ± 0.0
Met
1.232MetAla: 1.232 ± 0.668
1.642MetCys: 1.642 ± 0.839
0.411MetAsp: 0.411 ± 0.381
1.232MetGlu: 1.232 ± 0.676
0.411MetPhe: 0.411 ± 0.319
0.411MetGly: 0.411 ± 0.371
0.411MetHis: 0.411 ± 0.529
0.821MetIle: 0.821 ± 0.673
0.411MetLys: 0.411 ± 0.319
1.232MetLeu: 1.232 ± 0.882
0.0MetMet: 0.0 ± 0.0
0.411MetAsn: 0.411 ± 0.371
0.411MetPro: 0.411 ± 0.381
0.0MetGln: 0.0 ± 0.0
1.232MetArg: 1.232 ± 0.714
0.821MetSer: 0.821 ± 0.614
1.642MetThr: 1.642 ± 0.509
0.821MetVal: 0.821 ± 0.413
0.0MetTrp: 0.0 ± 0.0
0.411MetTyr: 0.411 ± 0.319
0.0MetXaa: 0.0 ± 0.0
Asn
3.284AsnAla: 3.284 ± 1.58
1.232AsnCys: 1.232 ± 0.647
1.232AsnAsp: 1.232 ± 0.616
2.053AsnGlu: 2.053 ± 0.644
2.463AsnPhe: 2.463 ± 0.868
2.463AsnGly: 2.463 ± 0.761
0.821AsnHis: 0.821 ± 0.762
1.642AsnIle: 1.642 ± 0.961
2.463AsnLys: 2.463 ± 0.847
0.411AsnLeu: 0.411 ± 0.381
0.411AsnMet: 0.411 ± 0.319
1.642AsnAsn: 1.642 ± 0.884
3.695AsnPro: 3.695 ± 1.27
1.642AsnGln: 1.642 ± 0.596
2.463AsnArg: 2.463 ± 1.053
1.232AsnSer: 1.232 ± 1.143
1.642AsnThr: 1.642 ± 0.587
2.463AsnVal: 2.463 ± 0.735
0.411AsnTrp: 0.411 ± 0.381
2.053AsnTyr: 2.053 ± 1.11
0.0AsnXaa: 0.0 ± 0.0
Pro
3.695ProAla: 3.695 ± 0.806
0.821ProCys: 0.821 ± 0.742
4.105ProAsp: 4.105 ± 1.444
2.053ProGlu: 2.053 ± 0.389
2.053ProPhe: 2.053 ± 0.898
5.747ProGly: 5.747 ± 2.113
1.232ProHis: 1.232 ± 0.616
1.642ProIle: 1.642 ± 0.712
3.284ProLys: 3.284 ± 1.133
4.516ProLeu: 4.516 ± 0.715
0.0ProMet: 0.0 ± 0.0
3.284ProAsn: 3.284 ± 0.735
6.568ProPro: 6.568 ± 1.6
0.821ProGln: 0.821 ± 0.412
4.105ProArg: 4.105 ± 1.583
5.337ProSer: 5.337 ± 1.637
6.158ProThr: 6.158 ± 1.366
5.337ProVal: 5.337 ± 1.186
0.411ProTrp: 0.411 ± 0.346
2.053ProTyr: 2.053 ± 1.139
0.0ProXaa: 0.0 ± 0.0
Gln
0.411GlnAla: 0.411 ± 0.381
1.232GlnCys: 1.232 ± 0.818
2.053GlnAsp: 2.053 ± 1.13
2.053GlnGlu: 2.053 ± 1.041
1.642GlnPhe: 1.642 ± 0.562
3.695GlnGly: 3.695 ± 0.954
1.232GlnHis: 1.232 ± 0.714
4.105GlnIle: 4.105 ± 0.397
0.0GlnLys: 0.0 ± 0.0
5.747GlnLeu: 5.747 ± 1.596
0.411GlnMet: 0.411 ± 0.381
2.463GlnAsn: 2.463 ± 1.037
1.642GlnPro: 1.642 ± 0.587
2.053GlnGln: 2.053 ± 0.707
0.821GlnArg: 0.821 ± 0.614
2.463GlnSer: 2.463 ± 0.851
1.642GlnThr: 1.642 ± 0.596
1.642GlnVal: 1.642 ± 0.651
1.232GlnTrp: 1.232 ± 0.608
1.642GlnTyr: 1.642 ± 1.082
0.0GlnXaa: 0.0 ± 0.0
Arg
4.516ArgAla: 4.516 ± 1.525
2.874ArgCys: 2.874 ± 1.435
2.874ArgAsp: 2.874 ± 1.029
1.642ArgGlu: 1.642 ± 0.651
2.053ArgPhe: 2.053 ± 0.76
6.568ArgGly: 6.568 ± 2.656
2.053ArgHis: 2.053 ± 1.119
0.821ArgIle: 0.821 ± 0.614
4.926ArgLys: 4.926 ± 1.698
6.979ArgLeu: 6.979 ± 1.471
0.821ArgMet: 0.821 ± 0.606
2.874ArgAsn: 2.874 ± 1.059
6.979ArgPro: 6.979 ± 1.434
2.874ArgGln: 2.874 ± 1.041
5.747ArgArg: 5.747 ± 2.01
9.442ArgSer: 9.442 ± 6.01
3.695ArgThr: 3.695 ± 1.556
3.695ArgVal: 3.695 ± 0.729
0.411ArgTrp: 0.411 ± 0.346
2.463ArgTyr: 2.463 ± 0.763
0.0ArgXaa: 0.0 ± 0.0
Ser
3.284SerAla: 3.284 ± 0.974
0.411SerCys: 0.411 ± 0.529
4.926SerAsp: 4.926 ± 0.673
2.874SerGlu: 2.874 ± 0.909
3.284SerPhe: 3.284 ± 0.721
6.979SerGly: 6.979 ± 2.64
0.821SerHis: 0.821 ± 0.413
6.158SerIle: 6.158 ± 1.454
2.463SerLys: 2.463 ± 0.849
6.979SerLeu: 6.979 ± 1.888
1.232SerMet: 1.232 ± 0.736
2.874SerAsn: 2.874 ± 1.056
6.158SerPro: 6.158 ± 2.458
2.874SerGln: 2.874 ± 1.378
9.031SerArg: 9.031 ± 4.042
10.263SerSer: 10.263 ± 3.026
6.568SerThr: 6.568 ± 1.524
2.053SerVal: 2.053 ± 0.389
1.232SerTrp: 1.232 ± 0.608
2.463SerTyr: 2.463 ± 1.446
0.0SerXaa: 0.0 ± 0.0
Thr
2.053ThrAla: 2.053 ± 1.242
1.232ThrCys: 1.232 ± 0.714
2.874ThrAsp: 2.874 ± 0.397
2.053ThrGlu: 2.053 ± 1.119
3.695ThrPhe: 3.695 ± 1.447
5.747ThrGly: 5.747 ± 0.963
1.232ThrHis: 1.232 ± 0.605
1.642ThrIle: 1.642 ± 0.759
4.105ThrLys: 4.105 ± 1.82
5.747ThrLeu: 5.747 ± 2.238
0.821ThrMet: 0.821 ± 0.423
2.463ThrAsn: 2.463 ± 1.021
3.695ThrPro: 3.695 ± 1.16
1.232ThrGln: 1.232 ± 0.757
6.158ThrArg: 6.158 ± 1.359
6.568ThrSer: 6.568 ± 1.135
2.874ThrThr: 2.874 ± 0.397
4.105ThrVal: 4.105 ± 1.214
0.821ThrTrp: 0.821 ± 0.371
2.463ThrTyr: 2.463 ± 0.888
0.0ThrXaa: 0.0 ± 0.0
Val
4.105ValAla: 4.105 ± 0.678
0.821ValCys: 0.821 ± 0.614
4.926ValAsp: 4.926 ± 1.355
3.695ValGlu: 3.695 ± 0.809
2.874ValPhe: 2.874 ± 1.02
3.284ValGly: 3.284 ± 1.051
0.411ValHis: 0.411 ± 0.319
2.053ValIle: 2.053 ± 0.676
2.463ValLys: 2.463 ± 1.426
5.747ValLeu: 5.747 ± 1.862
0.411ValMet: 0.411 ± 0.381
0.411ValAsn: 0.411 ± 0.381
3.284ValPro: 3.284 ± 0.721
3.695ValGln: 3.695 ± 1.019
5.337ValArg: 5.337 ± 1.679
6.568ValSer: 6.568 ± 2.479
4.105ValThr: 4.105 ± 1.652
2.053ValVal: 2.053 ± 0.82
0.411ValTrp: 0.411 ± 0.371
2.463ValTyr: 2.463 ± 0.606
0.0ValXaa: 0.0 ± 0.0
Trp
0.411TrpAla: 0.411 ± 0.381
0.0TrpCys: 0.0 ± 0.0
0.411TrpAsp: 0.411 ± 0.346
0.821TrpGlu: 0.821 ± 0.441
1.232TrpPhe: 1.232 ± 0.65
0.821TrpGly: 0.821 ± 0.742
0.0TrpHis: 0.0 ± 0.0
0.411TrpIle: 0.411 ± 0.381
0.821TrpLys: 0.821 ± 0.614
2.053TrpLeu: 2.053 ± 1.11
0.0TrpMet: 0.0 ± 0.0
1.232TrpAsn: 1.232 ± 0.698
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.874TrpArg: 2.874 ± 1.629
0.411TrpSer: 0.411 ± 0.346
1.232TrpThr: 1.232 ± 0.701
1.642TrpVal: 1.642 ± 0.562
0.0TrpTrp: 0.0 ± 0.0
0.411TrpTyr: 0.411 ± 0.381
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.053TyrAla: 2.053 ± 0.628
0.0TyrCys: 0.0 ± 0.0
1.642TyrAsp: 1.642 ± 0.587
1.642TyrGlu: 1.642 ± 0.791
2.874TyrPhe: 2.874 ± 0.835
2.874TyrGly: 2.874 ± 0.667
0.821TyrHis: 0.821 ± 0.442
0.411TyrIle: 0.411 ± 0.381
2.463TyrLys: 2.463 ± 0.814
2.874TyrLeu: 2.874 ± 1.174
0.821TyrMet: 0.821 ± 0.705
2.053TyrAsn: 2.053 ± 0.644
1.642TyrPro: 1.642 ± 0.602
0.821TyrGln: 0.821 ± 0.413
1.642TyrArg: 1.642 ± 0.712
2.874TyrSer: 2.874 ± 1.159
1.232TyrThr: 1.232 ± 0.676
2.874TyrVal: 2.874 ± 1.404
1.642TyrTrp: 1.642 ± 0.527
3.284TyrTyr: 3.284 ± 1.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2437 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski