Amino acid dipepetide frequency for Bos grunniens papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.831AlaAla: 5.831 ± 0.909
2.499AlaCys: 2.499 ± 1.169
2.499AlaAsp: 2.499 ± 0.8
5.831AlaGlu: 5.831 ± 1.11
2.499AlaPhe: 2.499 ± 0.841
6.247AlaGly: 6.247 ± 1.359
0.416AlaHis: 0.416 ± 0.35
2.082AlaIle: 2.082 ± 0.759
4.581AlaLys: 4.581 ± 1.185
7.497AlaLeu: 7.497 ± 2.185
0.833AlaMet: 0.833 ± 0.477
3.332AlaAsn: 3.332 ± 1.032
4.165AlaPro: 4.165 ± 1.583
4.165AlaGln: 4.165 ± 1.311
5.831AlaArg: 5.831 ± 1.592
4.998AlaSer: 4.998 ± 0.818
2.915AlaThr: 2.915 ± 1.557
4.581AlaVal: 4.581 ± 1.234
0.416AlaTrp: 0.416 ± 0.38
2.499AlaTyr: 2.499 ± 1.208
0.0AlaXaa: 0.0 ± 0.0
Cys
1.249CysAla: 1.249 ± 0.8
1.249CysCys: 1.249 ± 1.246
0.0CysAsp: 0.0 ± 0.0
1.249CysGlu: 1.249 ± 0.589
0.833CysPhe: 0.833 ± 0.7
1.666CysGly: 1.666 ± 1.133
0.416CysHis: 0.416 ± 0.515
0.416CysIle: 0.416 ± 0.593
1.249CysLys: 1.249 ± 0.644
3.332CysLeu: 3.332 ± 2.234
0.0CysMet: 0.0 ± 0.0
0.416CysAsn: 0.416 ± 0.341
1.666CysPro: 1.666 ± 0.917
0.416CysGln: 0.416 ± 0.35
1.666CysArg: 1.666 ± 1.443
3.748CysSer: 3.748 ± 1.295
2.499CysThr: 2.499 ± 0.904
0.833CysVal: 0.833 ± 0.645
0.416CysTrp: 0.416 ± 0.341
1.249CysTyr: 1.249 ± 1.2
0.0CysXaa: 0.0 ± 0.0
Asp
3.748AspAla: 3.748 ± 0.785
1.249AspCys: 1.249 ± 0.759
3.332AspAsp: 3.332 ± 0.8
3.332AspGlu: 3.332 ± 1.108
3.748AspPhe: 3.748 ± 0.791
4.998AspGly: 4.998 ± 1.324
2.082AspHis: 2.082 ± 0.737
2.499AspIle: 2.499 ± 1.236
1.666AspLys: 1.666 ± 0.518
5.831AspLeu: 5.831 ± 1.733
0.833AspMet: 0.833 ± 0.36
2.499AspAsn: 2.499 ± 0.918
2.499AspPro: 2.499 ± 1.115
0.833AspGln: 0.833 ± 0.484
3.748AspArg: 3.748 ± 1.099
4.581AspSer: 4.581 ± 1.301
3.748AspThr: 3.748 ± 1.497
0.833AspVal: 0.833 ± 0.403
0.416AspTrp: 0.416 ± 0.341
0.416AspTyr: 0.416 ± 0.38
0.0AspXaa: 0.0 ± 0.0
Glu
3.332GluAla: 3.332 ± 1.041
1.249GluCys: 1.249 ± 0.785
4.998GluAsp: 4.998 ± 1.354
7.497GluGlu: 7.497 ± 2.623
1.666GluPhe: 1.666 ± 0.863
3.748GluGly: 3.748 ± 1.01
0.416GluHis: 0.416 ± 0.515
3.332GluIle: 3.332 ± 1.79
3.332GluLys: 3.332 ± 0.948
3.748GluLeu: 3.748 ± 0.795
0.416GluMet: 0.416 ± 0.35
3.332GluAsn: 3.332 ± 0.937
2.915GluPro: 2.915 ± 1.04
3.332GluGln: 3.332 ± 1.079
3.332GluArg: 3.332 ± 1.08
4.165GluSer: 4.165 ± 1.723
4.998GluThr: 4.998 ± 0.86
2.082GluVal: 2.082 ± 0.721
0.416GluTrp: 0.416 ± 0.341
1.666GluTyr: 1.666 ± 1.007
0.0GluXaa: 0.0 ± 0.0
Phe
2.082PheAla: 2.082 ± 0.777
0.416PheCys: 0.416 ± 0.593
1.249PheAsp: 1.249 ± 0.666
3.332PheGlu: 3.332 ± 1.159
2.499PhePhe: 2.499 ± 0.94
5.414PheGly: 5.414 ± 0.576
2.499PheHis: 2.499 ± 1.367
2.082PheIle: 2.082 ± 0.802
3.332PheLys: 3.332 ± 0.905
5.831PheLeu: 5.831 ± 1.696
0.833PheMet: 0.833 ± 0.541
2.499PheAsn: 2.499 ± 0.98
0.833PhePro: 0.833 ± 0.484
0.416PheGln: 0.416 ± 0.341
3.332PheArg: 3.332 ± 0.891
2.082PheSer: 2.082 ± 1.061
1.249PheThr: 1.249 ± 0.681
1.249PheVal: 1.249 ± 0.663
1.249PheTrp: 1.249 ± 0.583
0.833PheTyr: 0.833 ± 0.76
0.0PheXaa: 0.0 ± 0.0
Gly
4.998GlyAla: 4.998 ± 0.66
2.082GlyCys: 2.082 ± 0.709
4.165GlyAsp: 4.165 ± 1.473
3.332GlyGlu: 3.332 ± 1.015
1.666GlyPhe: 1.666 ± 0.87
7.08GlyGly: 7.08 ± 1.448
2.082GlyHis: 2.082 ± 0.924
3.332GlyIle: 3.332 ± 1.221
2.499GlyLys: 2.499 ± 1.423
6.664GlyLeu: 6.664 ± 1.377
0.833GlyMet: 0.833 ± 0.484
1.666GlyAsn: 1.666 ± 0.536
4.165GlyPro: 4.165 ± 1.971
2.499GlyGln: 2.499 ± 0.665
3.332GlyArg: 3.332 ± 0.793
10.829GlySer: 10.829 ± 1.459
5.831GlyThr: 5.831 ± 1.054
4.165GlyVal: 4.165 ± 0.716
0.833GlyTrp: 0.833 ± 0.7
0.833GlyTyr: 0.833 ± 0.697
0.0GlyXaa: 0.0 ± 0.0
His
1.666HisAla: 1.666 ± 0.598
0.416HisCys: 0.416 ± 0.35
0.833HisAsp: 0.833 ± 0.409
0.833HisGlu: 0.833 ± 0.403
2.499HisPhe: 2.499 ± 0.74
1.666HisGly: 1.666 ± 1.232
0.416HisHis: 0.416 ± 0.342
1.666HisIle: 1.666 ± 0.777
1.249HisLys: 1.249 ± 1.022
2.499HisLeu: 2.499 ± 0.871
0.416HisMet: 0.416 ± 0.38
0.416HisAsn: 0.416 ± 0.38
3.332HisPro: 3.332 ± 1.235
0.833HisGln: 0.833 ± 1.194
2.499HisArg: 2.499 ± 1.303
1.666HisSer: 1.666 ± 0.635
0.416HisThr: 0.416 ± 0.342
2.499HisVal: 2.499 ± 1.196
0.0HisTrp: 0.0 ± 0.0
0.833HisTyr: 0.833 ± 0.409
0.0HisXaa: 0.0 ± 0.0
Ile
3.748IleAla: 3.748 ± 1.442
0.416IleCys: 0.416 ± 0.38
3.332IleAsp: 3.332 ± 1.015
3.748IleGlu: 3.748 ± 1.194
1.666IlePhe: 1.666 ± 0.797
2.915IleGly: 2.915 ± 1.195
0.416IleHis: 0.416 ± 0.341
2.082IleIle: 2.082 ± 1.263
0.833IleLys: 0.833 ± 0.681
5.831IleLeu: 5.831 ± 1.109
0.0IleMet: 0.0 ± 0.0
1.666IleAsn: 1.666 ± 0.518
1.249IlePro: 1.249 ± 0.72
2.082IleGln: 2.082 ± 1.033
2.499IleArg: 2.499 ± 1.309
2.499IleSer: 2.499 ± 0.653
3.332IleThr: 3.332 ± 0.72
1.249IleVal: 1.249 ± 1.027
0.416IleTrp: 0.416 ± 0.38
0.833IleTyr: 0.833 ± 0.431
0.0IleXaa: 0.0 ± 0.0
Lys
2.499LysAla: 2.499 ± 1.333
0.833LysCys: 0.833 ± 0.484
0.833LysAsp: 0.833 ± 0.645
4.581LysGlu: 4.581 ± 2.312
2.082LysPhe: 2.082 ± 1.082
3.748LysGly: 3.748 ± 0.509
2.915LysHis: 2.915 ± 0.697
2.915LysIle: 2.915 ± 0.927
5.831LysLys: 5.831 ± 1.636
3.748LysLeu: 3.748 ± 1.22
0.416LysMet: 0.416 ± 0.601
3.332LysAsn: 3.332 ± 1.032
0.416LysPro: 0.416 ± 0.597
2.499LysGln: 2.499 ± 0.552
4.998LysArg: 4.998 ± 1.463
5.414LysSer: 5.414 ± 2.577
2.499LysThr: 2.499 ± 0.896
2.082LysVal: 2.082 ± 1.376
0.0LysTrp: 0.0 ± 0.0
1.249LysTyr: 1.249 ± 0.644
0.0LysXaa: 0.0 ± 0.0
Leu
7.08LeuAla: 7.08 ± 2.044
3.748LeuCys: 3.748 ± 1.585
7.08LeuAsp: 7.08 ± 1.863
4.998LeuGlu: 4.998 ± 1.326
4.998LeuPhe: 4.998 ± 1.815
5.831LeuGly: 5.831 ± 0.798
2.915LeuHis: 2.915 ± 0.986
3.332LeuIle: 3.332 ± 1.064
7.497LeuLys: 7.497 ± 1.259
12.495LeuLeu: 12.495 ± 3.589
1.249LeuMet: 1.249 ± 0.583
1.666LeuAsn: 1.666 ± 0.761
4.998LeuPro: 4.998 ± 1.193
4.581LeuGln: 4.581 ± 1.02
2.082LeuArg: 2.082 ± 0.403
6.664LeuSer: 6.664 ± 1.703
5.414LeuThr: 5.414 ± 1.627
3.748LeuVal: 3.748 ± 1.198
2.499LeuTrp: 2.499 ± 1.32
4.165LeuTyr: 4.165 ± 1.325
0.0LeuXaa: 0.0 ± 0.0
Met
2.082MetAla: 2.082 ± 0.648
0.0MetCys: 0.0 ± 0.0
0.416MetAsp: 0.416 ± 0.593
1.249MetGlu: 1.249 ± 0.797
0.416MetPhe: 0.416 ± 0.38
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.666MetLeu: 1.666 ± 1.2
0.416MetMet: 0.416 ± 0.38
0.833MetAsn: 0.833 ± 0.484
0.833MetPro: 0.833 ± 0.514
1.666MetGln: 1.666 ± 0.811
0.833MetArg: 0.833 ± 0.409
0.833MetSer: 0.833 ± 0.403
0.0MetThr: 0.0 ± 0.0
1.249MetVal: 1.249 ± 0.8
0.0MetTrp: 0.0 ± 0.0
0.416MetTyr: 0.416 ± 0.38
0.0MetXaa: 0.0 ± 0.0
Asn
4.165AsnAla: 4.165 ± 1.628
0.833AsnCys: 0.833 ± 0.409
0.416AsnAsp: 0.416 ± 0.341
3.332AsnGlu: 3.332 ± 1.415
0.416AsnPhe: 0.416 ± 0.38
1.666AsnGly: 1.666 ± 0.642
1.249AsnHis: 1.249 ± 0.666
2.915AsnIle: 2.915 ± 0.544
0.833AsnLys: 0.833 ± 0.76
2.915AsnLeu: 2.915 ± 0.945
0.833AsnMet: 0.833 ± 0.356
2.082AsnAsn: 2.082 ± 1.115
1.666AsnPro: 1.666 ± 0.833
2.082AsnGln: 2.082 ± 0.984
1.666AsnArg: 1.666 ± 0.734
2.082AsnSer: 2.082 ± 0.962
4.165AsnThr: 4.165 ± 0.603
1.249AsnVal: 1.249 ± 0.653
2.082AsnTrp: 2.082 ± 0.519
0.833AsnTyr: 0.833 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
6.247ProAla: 6.247 ± 1.488
1.666ProCys: 1.666 ± 1.055
4.998ProAsp: 4.998 ± 1.957
2.082ProGlu: 2.082 ± 0.921
2.499ProPhe: 2.499 ± 1.366
1.666ProGly: 1.666 ± 0.775
0.833ProHis: 0.833 ± 0.588
0.833ProIle: 0.833 ± 0.684
2.082ProLys: 2.082 ± 0.403
6.664ProLeu: 6.664 ± 1.377
0.416ProMet: 0.416 ± 0.551
2.082ProAsn: 2.082 ± 0.605
6.247ProPro: 6.247 ± 2.654
1.249ProGln: 1.249 ± 0.834
3.748ProArg: 3.748 ± 1.908
4.165ProSer: 4.165 ± 1.977
5.414ProThr: 5.414 ± 2.013
4.165ProVal: 4.165 ± 1.252
0.416ProTrp: 0.416 ± 0.35
1.666ProTyr: 1.666 ± 1.151
0.0ProXaa: 0.0 ± 0.0
Gln
3.748GlnAla: 3.748 ± 1.641
0.416GlnCys: 0.416 ± 0.341
1.666GlnAsp: 1.666 ± 1.003
2.499GlnGlu: 2.499 ± 1.395
0.833GlnPhe: 0.833 ± 0.76
4.165GlnGly: 4.165 ± 1.208
0.0GlnHis: 0.0 ± 0.0
2.082GlnIle: 2.082 ± 0.648
1.249GlnLys: 1.249 ± 0.583
3.332GlnLeu: 3.332 ± 0.797
1.249GlnMet: 1.249 ± 0.679
1.249GlnAsn: 1.249 ± 0.586
3.748GlnPro: 3.748 ± 0.603
2.082GlnGln: 2.082 ± 1.376
2.082GlnArg: 2.082 ± 0.463
1.666GlnSer: 1.666 ± 0.863
4.581GlnThr: 4.581 ± 1.253
3.748GlnVal: 3.748 ± 1.194
0.833GlnTrp: 0.833 ± 0.681
0.416GlnTyr: 0.416 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
5.831ArgAla: 5.831 ± 1.161
2.915ArgCys: 2.915 ± 1.966
2.082ArgAsp: 2.082 ± 1.299
1.249ArgGlu: 1.249 ± 0.8
2.499ArgPhe: 2.499 ± 1.058
3.748ArgGly: 3.748 ± 0.974
2.915ArgHis: 2.915 ± 1.169
0.416ArgIle: 0.416 ± 0.35
6.664ArgLys: 6.664 ± 1.412
4.581ArgLeu: 4.581 ± 1.092
0.0ArgMet: 0.0 ± 0.0
3.332ArgAsn: 3.332 ± 1.013
4.165ArgPro: 4.165 ± 1.29
2.082ArgGln: 2.082 ± 1.082
2.499ArgArg: 2.499 ± 0.55
2.082ArgSer: 2.082 ± 0.464
3.332ArgThr: 3.332 ± 0.821
4.581ArgVal: 4.581 ± 1.307
0.416ArgTrp: 0.416 ± 0.597
4.165ArgTyr: 4.165 ± 0.839
0.0ArgXaa: 0.0 ± 0.0
Ser
4.998SerAla: 4.998 ± 1.207
0.833SerCys: 0.833 ± 0.549
2.915SerAsp: 2.915 ± 0.725
4.581SerGlu: 4.581 ± 0.892
3.332SerPhe: 3.332 ± 1.948
7.08SerGly: 7.08 ± 1.467
2.499SerHis: 2.499 ± 0.936
4.998SerIle: 4.998 ± 1.786
2.915SerLys: 2.915 ± 0.699
7.497SerLeu: 7.497 ± 1.491
1.249SerMet: 1.249 ± 0.583
2.915SerAsn: 2.915 ± 1.118
5.831SerPro: 5.831 ± 2.169
2.915SerGln: 2.915 ± 0.888
4.165SerArg: 4.165 ± 0.478
7.497SerSer: 7.497 ± 1.972
4.581SerThr: 4.581 ± 1.8
5.414SerVal: 5.414 ± 0.969
0.833SerTrp: 0.833 ± 0.409
0.833SerTyr: 0.833 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
4.998ThrAla: 4.998 ± 1.199
1.666ThrCys: 1.666 ± 0.806
4.165ThrAsp: 4.165 ± 1.125
2.499ThrGlu: 2.499 ± 0.745
3.332ThrPhe: 3.332 ± 0.725
6.664ThrGly: 6.664 ± 1.514
1.249ThrHis: 1.249 ± 0.843
2.082ThrIle: 2.082 ± 0.95
2.082ThrLys: 2.082 ± 1.489
4.581ThrLeu: 4.581 ± 0.967
2.082ThrMet: 2.082 ± 0.825
2.915ThrAsn: 2.915 ± 0.646
5.414ThrPro: 5.414 ± 1.21
2.499ThrGln: 2.499 ± 1.242
3.332ThrArg: 3.332 ± 0.926
4.581ThrSer: 4.581 ± 1.3
4.581ThrThr: 4.581 ± 1.08
5.414ThrVal: 5.414 ± 2.923
1.249ThrTrp: 1.249 ± 0.652
2.082ThrTyr: 2.082 ± 0.75
0.0ThrXaa: 0.0 ± 0.0
Val
2.499ValAla: 2.499 ± 1.087
0.833ValCys: 0.833 ± 0.588
3.748ValAsp: 3.748 ± 0.915
2.499ValGlu: 2.499 ± 0.549
2.915ValPhe: 2.915 ± 1.357
2.915ValGly: 2.915 ± 1.305
0.833ValHis: 0.833 ± 0.76
2.082ValIle: 2.082 ± 0.908
3.332ValLys: 3.332 ± 0.954
3.332ValLeu: 3.332 ± 0.595
0.0ValMet: 0.0 ± 0.0
0.416ValAsn: 0.416 ± 0.38
3.748ValPro: 3.748 ± 0.96
4.165ValGln: 4.165 ± 1.656
4.165ValArg: 4.165 ± 1.097
4.581ValSer: 4.581 ± 1.677
4.581ValThr: 4.581 ± 1.741
2.082ValVal: 2.082 ± 0.815
0.833ValTrp: 0.833 ± 0.484
3.748ValTyr: 3.748 ± 1.228
0.0ValXaa: 0.0 ± 0.0
Trp
0.416TrpAla: 0.416 ± 0.341
0.416TrpCys: 0.416 ± 0.593
1.249TrpAsp: 1.249 ± 0.585
0.416TrpGlu: 0.416 ± 0.38
1.666TrpPhe: 1.666 ± 0.531
0.833TrpGly: 0.833 ± 0.356
0.416TrpHis: 0.416 ± 0.597
0.416TrpIle: 0.416 ± 0.341
1.249TrpLys: 1.249 ± 0.843
1.666TrpLeu: 1.666 ± 0.536
0.0TrpMet: 0.0 ± 0.0
0.833TrpAsn: 0.833 ± 0.76
0.0TrpPro: 0.0 ± 0.0
0.833TrpGln: 0.833 ± 0.463
0.416TrpArg: 0.416 ± 0.341
0.833TrpSer: 0.833 ± 0.431
1.666TrpThr: 1.666 ± 0.826
1.249TrpVal: 1.249 ± 0.681
0.0TrpTrp: 0.0 ± 0.0
0.416TrpTyr: 0.416 ± 0.35
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.499TyrAla: 2.499 ± 0.549
0.833TyrCys: 0.833 ± 0.655
2.915TyrAsp: 2.915 ± 0.922
0.833TyrGlu: 0.833 ± 0.409
1.249TyrPhe: 1.249 ± 0.665
0.833TyrGly: 0.833 ± 0.64
2.082TyrHis: 2.082 ± 0.898
1.249TyrIle: 1.249 ± 0.381
0.833TyrLys: 0.833 ± 0.76
3.332TyrLeu: 3.332 ± 1.479
0.416TyrMet: 0.416 ± 0.35
0.0TyrAsn: 0.0 ± 0.0
1.249TyrPro: 1.249 ± 0.586
0.416TyrGln: 0.416 ± 0.341
3.332TyrArg: 3.332 ± 1.125
2.915TyrSer: 2.915 ± 1.296
1.666TyrThr: 1.666 ± 1.008
0.833TyrVal: 0.833 ± 0.76
1.666TyrTrp: 1.666 ± 0.856
1.666TyrTyr: 1.666 ± 0.73
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2402 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski