Amino acid dipepetide frequency for Human polyomavirus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.031AlaAla: 6.031 ± 2.584
1.096AlaCys: 1.096 ± 0.425
0.548AlaAsp: 0.548 ± 0.519
2.741AlaGlu: 2.741 ± 1.347
1.096AlaPhe: 1.096 ± 0.639
1.645AlaGly: 1.645 ± 0.944
0.0AlaHis: 0.0 ± 0.0
3.838AlaIle: 3.838 ± 1.323
1.645AlaLys: 1.645 ± 0.656
9.32AlaLeu: 9.32 ± 3.513
0.548AlaMet: 0.548 ± 0.357
1.645AlaAsn: 1.645 ± 0.581
2.741AlaPro: 2.741 ± 0.876
1.645AlaGln: 1.645 ± 0.581
2.193AlaArg: 2.193 ± 1.455
3.838AlaSer: 3.838 ± 1.093
2.741AlaThr: 2.741 ± 1.391
8.224AlaVal: 8.224 ± 1.25
2.741AlaTrp: 2.741 ± 1.18
1.645AlaTyr: 1.645 ± 0.653
0.0AlaXaa: 0.0 ± 0.0
Cys
0.548CysAla: 0.548 ± 0.411
1.096CysCys: 1.096 ± 1.29
0.548CysAsp: 0.548 ± 0.519
1.096CysGlu: 1.096 ± 0.823
1.096CysPhe: 1.096 ± 0.639
1.645CysGly: 1.645 ± 0.653
0.0CysHis: 0.0 ± 0.0
1.096CysIle: 1.096 ± 1.29
2.741CysLys: 2.741 ± 0.415
2.193CysLeu: 2.193 ± 1.278
0.0CysMet: 0.0 ± 0.0
0.548CysAsn: 0.548 ± 0.411
1.645CysPro: 1.645 ± 0.854
2.193CysGln: 2.193 ± 0.532
0.0CysArg: 0.0 ± 0.0
1.096CysSer: 1.096 ± 0.823
1.645CysThr: 1.645 ± 1.234
1.096CysVal: 1.096 ± 0.823
0.0CysTrp: 0.0 ± 0.0
1.096CysTyr: 1.096 ± 1.29
0.0CysXaa: 0.0 ± 0.0
Asp
1.096AspAla: 1.096 ± 0.66
0.548AspCys: 0.548 ± 0.645
2.193AspAsp: 2.193 ± 1.645
2.741AspGlu: 2.741 ± 0.892
3.289AspPhe: 3.289 ± 1.939
2.741AspGly: 2.741 ± 1.394
0.548AspHis: 0.548 ± 0.411
2.741AspIle: 2.741 ± 1.391
2.741AspLys: 2.741 ± 1.441
3.289AspLeu: 3.289 ± 1.278
1.096AspMet: 1.096 ± 1.038
1.645AspAsn: 1.645 ± 0.656
3.838AspPro: 3.838 ± 1.426
2.741AspGln: 2.741 ± 0.953
1.096AspArg: 1.096 ± 0.425
1.096AspSer: 1.096 ± 0.823
0.548AspThr: 0.548 ± 0.519
2.193AspVal: 2.193 ± 1.161
2.193AspTrp: 2.193 ± 1.455
1.096AspTyr: 1.096 ± 0.728
0.0AspXaa: 0.0 ± 0.0
Glu
4.386GluAla: 4.386 ± 1.712
1.645GluCys: 1.645 ± 0.653
3.838GluAsp: 3.838 ± 1.657
7.127GluGlu: 7.127 ± 1.25
2.193GluPhe: 2.193 ± 0.908
2.193GluGly: 2.193 ± 0.849
0.0GluHis: 0.0 ± 0.0
6.031GluIle: 6.031 ± 1.494
8.224GluLys: 8.224 ± 2.71
8.224GluLeu: 8.224 ± 1.106
0.548GluMet: 0.548 ± 0.411
3.838GluAsn: 3.838 ± 0.598
2.741GluPro: 2.741 ± 0.415
3.838GluGln: 3.838 ± 1.44
2.193GluArg: 2.193 ± 1.186
2.193GluSer: 2.193 ± 0.532
1.096GluThr: 1.096 ± 0.823
3.289GluVal: 3.289 ± 2.561
0.0GluTrp: 0.0 ± 0.0
4.934GluTyr: 4.934 ± 1.744
0.0GluXaa: 0.0 ± 0.0
Phe
1.096PheAla: 1.096 ± 0.823
2.741PheCys: 2.741 ± 0.953
1.645PheAsp: 1.645 ± 1.234
2.193PheGlu: 2.193 ± 0.908
1.096PhePhe: 1.096 ± 0.528
1.645PheGly: 1.645 ± 0.643
2.193PheHis: 2.193 ± 0.696
1.645PheIle: 1.645 ± 1.234
1.645PheLys: 1.645 ± 1.234
3.838PheLeu: 3.838 ± 1.657
0.548PheMet: 0.548 ± 0.569
1.645PheAsn: 1.645 ± 1.395
3.838PhePro: 3.838 ± 1.088
3.838PheGln: 3.838 ± 0.916
0.548PheArg: 0.548 ± 0.645
3.838PheSer: 3.838 ± 0.757
3.838PheThr: 3.838 ± 0.708
0.548PheVal: 0.548 ± 0.411
0.548PheTrp: 0.548 ± 0.645
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.838GlyAla: 3.838 ± 1.733
0.548GlyCys: 0.548 ± 0.411
2.193GlyAsp: 2.193 ± 1.214
4.934GlyGlu: 4.934 ± 1.697
1.645GlyPhe: 1.645 ± 1.307
7.675GlyGly: 7.675 ± 1.545
1.096GlyHis: 1.096 ± 0.728
3.838GlyIle: 3.838 ± 0.93
4.386GlyLys: 4.386 ± 2.018
7.127GlyLeu: 7.127 ± 2.94
0.0GlyMet: 0.0 ± 0.0
3.289GlyAsn: 3.289 ± 1.278
4.386GlyPro: 4.386 ± 1.064
2.193GlyGln: 2.193 ± 0.849
2.193GlyArg: 2.193 ± 1.455
4.386GlySer: 4.386 ± 0.386
4.386GlyThr: 4.386 ± 1.57
4.386GlyVal: 4.386 ± 1.118
0.0GlyTrp: 0.0 ± 0.0
3.838GlyTyr: 3.838 ± 0.53
0.0GlyXaa: 0.0 ± 0.0
His
2.193HisAla: 2.193 ± 0.696
1.096HisCys: 1.096 ± 0.823
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.645HisPhe: 1.645 ± 1.234
1.096HisGly: 1.096 ± 0.823
0.548HisHis: 0.548 ± 0.411
1.645HisIle: 1.645 ± 0.581
1.096HisLys: 1.096 ± 0.639
1.096HisLeu: 1.096 ± 0.823
1.645HisMet: 1.645 ± 0.856
0.0HisAsn: 0.0 ± 0.0
1.096HisPro: 1.096 ± 0.639
0.548HisGln: 0.548 ± 0.519
0.548HisArg: 0.548 ± 0.411
0.548HisSer: 0.548 ± 0.411
0.548HisThr: 0.548 ± 0.519
1.096HisVal: 1.096 ± 0.823
0.0HisTrp: 0.0 ± 0.0
2.741HisTyr: 2.741 ± 1.251
0.0HisXaa: 0.0 ± 0.0
Ile
1.645IleAla: 1.645 ± 1.076
1.096IleCys: 1.096 ± 0.823
0.548IleAsp: 0.548 ± 0.411
3.838IleGlu: 3.838 ± 1.282
2.193IlePhe: 2.193 ± 1.056
3.838IleGly: 3.838 ± 2.148
2.741IleHis: 2.741 ± 0.415
0.548IleIle: 0.548 ± 0.645
2.741IleLys: 2.741 ± 1.025
6.579IleLeu: 6.579 ± 2.821
1.645IleMet: 1.645 ± 0.581
2.193IleAsn: 2.193 ± 0.706
0.548IlePro: 0.548 ± 0.519
2.741IleGln: 2.741 ± 0.415
1.096IleArg: 1.096 ± 0.639
1.096IleSer: 1.096 ± 0.425
4.386IleThr: 4.386 ± 0.751
1.645IleVal: 1.645 ± 0.653
1.096IleTrp: 1.096 ± 0.728
4.934IleTyr: 4.934 ± 0.529
0.0IleXaa: 0.0 ± 0.0
Lys
4.934LysAla: 4.934 ± 0.529
1.096LysCys: 1.096 ± 0.639
1.645LysAsp: 1.645 ± 0.656
4.386LysGlu: 4.386 ± 1.434
1.645LysPhe: 1.645 ± 0.653
3.289LysGly: 3.289 ± 0.974
2.741LysHis: 2.741 ± 2.057
3.289LysIle: 3.289 ± 1.719
8.224LysLys: 8.224 ± 1.561
5.482LysLeu: 5.482 ± 0.731
1.096LysMet: 1.096 ± 0.639
3.838LysAsn: 3.838 ± 1.078
3.838LysPro: 3.838 ± 1.485
1.645LysGln: 1.645 ± 0.86
3.838LysArg: 3.838 ± 1.651
3.289LysSer: 3.289 ± 1.939
3.838LysThr: 3.838 ± 1.2
1.645LysVal: 1.645 ± 1.234
1.096LysTrp: 1.096 ± 0.823
1.096LysTyr: 1.096 ± 1.038
0.0LysXaa: 0.0 ± 0.0
Leu
3.838LeuAla: 3.838 ± 1.44
0.548LeuCys: 0.548 ± 0.519
7.127LeuAsp: 7.127 ± 1.32
8.772LeuGlu: 8.772 ± 2.129
6.579LeuPhe: 6.579 ± 0.535
5.482LeuGly: 5.482 ± 1.679
2.741LeuHis: 2.741 ± 1.025
2.741LeuIle: 2.741 ± 0.911
1.645LeuLys: 1.645 ± 0.86
15.351LeuLeu: 15.351 ± 4.396
3.838LeuMet: 3.838 ± 1.855
6.031LeuAsn: 6.031 ± 0.486
6.579LeuPro: 6.579 ± 2.691
3.838LeuGln: 3.838 ± 1.485
4.386LeuArg: 4.386 ± 0.901
10.417LeuSer: 10.417 ± 3.148
4.934LeuThr: 4.934 ± 2.316
3.838LeuVal: 3.838 ± 1.114
1.645LeuTrp: 1.645 ± 1.216
4.386LeuTyr: 4.386 ± 1.367
0.0LeuXaa: 0.0 ± 0.0
Met
1.645MetAla: 1.645 ± 0.856
0.548MetCys: 0.548 ± 0.411
2.741MetAsp: 2.741 ± 1.833
1.645MetGlu: 1.645 ± 0.656
1.096MetPhe: 1.096 ± 0.425
1.645MetGly: 1.645 ± 1.076
0.548MetHis: 0.548 ± 0.411
0.0MetIle: 0.0 ± 0.0
1.645MetLys: 1.645 ± 0.653
2.741MetLeu: 2.741 ± 1.502
0.0MetMet: 0.0 ± 0.0
1.096MetAsn: 1.096 ± 0.823
0.548MetPro: 0.548 ± 0.519
1.645MetGln: 1.645 ± 0.653
1.645MetArg: 1.645 ± 0.912
1.645MetSer: 1.645 ± 0.581
1.096MetThr: 1.096 ± 0.425
0.548MetVal: 0.548 ± 0.519
0.548MetTrp: 0.548 ± 0.519
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.386AsnAla: 4.386 ± 1.599
2.193AsnCys: 2.193 ± 1.186
0.548AsnAsp: 0.548 ± 0.519
5.482AsnGlu: 5.482 ± 0.847
0.548AsnPhe: 0.548 ± 0.645
1.645AsnGly: 1.645 ± 0.856
0.0AsnHis: 0.0 ± 0.0
3.838AsnIle: 3.838 ± 1.719
3.838AsnLys: 3.838 ± 2.031
4.386AsnLeu: 4.386 ± 0.751
1.096AsnMet: 1.096 ± 0.832
2.741AsnAsn: 2.741 ± 0.72
5.482AsnPro: 5.482 ± 1.309
2.741AsnGln: 2.741 ± 1.338
0.0AsnArg: 0.0 ± 0.0
2.741AsnSer: 2.741 ± 1.025
2.193AsnThr: 2.193 ± 0.532
1.096AsnVal: 1.096 ± 1.038
1.096AsnTrp: 1.096 ± 0.425
2.193AsnTyr: 2.193 ± 0.706
0.0AsnXaa: 0.0 ± 0.0
Pro
1.096ProAla: 1.096 ± 0.425
1.645ProCys: 1.645 ± 0.912
3.838ProAsp: 3.838 ± 1.596
3.838ProGlu: 3.838 ± 2.031
1.096ProPhe: 1.096 ± 0.823
4.934ProGly: 4.934 ± 1.022
0.0ProHis: 0.0 ± 0.0
2.741ProIle: 2.741 ± 1.251
3.289ProLys: 3.289 ± 0.632
3.838ProLeu: 3.838 ± 0.757
1.645ProMet: 1.645 ± 0.854
1.645ProAsn: 1.645 ± 0.854
6.579ProPro: 6.579 ± 1.263
2.193ProGln: 2.193 ± 1.214
2.741ProArg: 2.741 ± 1.66
3.289ProSer: 3.289 ± 0.354
5.482ProThr: 5.482 ± 2.2
4.934ProVal: 4.934 ± 3.203
1.096ProTrp: 1.096 ± 0.728
1.096ProTyr: 1.096 ± 0.425
0.0ProXaa: 0.0 ± 0.0
Gln
3.289GlnAla: 3.289 ± 0.632
0.548GlnCys: 0.548 ± 0.411
1.096GlnAsp: 1.096 ± 0.728
3.289GlnGlu: 3.289 ± 0.632
2.741GlnPhe: 2.741 ± 0.415
3.838GlnGly: 3.838 ± 2.197
2.741GlnHis: 2.741 ± 1.518
4.386GlnIle: 4.386 ± 1.064
2.193GlnLys: 2.193 ± 1.186
4.386GlnLeu: 4.386 ± 1.159
0.548GlnMet: 0.548 ± 0.48
1.645GlnAsn: 1.645 ± 0.854
1.645GlnPro: 1.645 ± 0.656
3.838GlnGln: 3.838 ± 1.088
4.386GlnArg: 4.386 ± 1.519
1.096GlnSer: 1.096 ± 0.823
3.289GlnThr: 3.289 ± 0.844
4.934GlnVal: 4.934 ± 1.218
0.0GlnTrp: 0.0 ± 0.0
1.096GlnTyr: 1.096 ± 0.728
0.0GlnXaa: 0.0 ± 0.0
Arg
2.741ArgAla: 2.741 ± 0.657
0.0ArgCys: 0.0 ± 0.0
0.548ArgAsp: 0.548 ± 0.411
3.838ArgGlu: 3.838 ± 1.963
1.096ArgPhe: 1.096 ± 0.823
4.934ArgGly: 4.934 ± 2.912
0.548ArgHis: 0.548 ± 0.411
1.645ArgIle: 1.645 ± 0.653
2.193ArgLys: 2.193 ± 0.849
3.289ArgLeu: 3.289 ± 1.1
1.096ArgMet: 1.096 ± 0.425
3.289ArgAsn: 3.289 ± 1.1
0.0ArgPro: 0.0 ± 0.0
2.741ArgGln: 2.741 ± 1.502
2.193ArgArg: 2.193 ± 1.455
9.868ArgSer: 9.868 ± 4.486
1.645ArgThr: 1.645 ± 0.653
3.289ArgVal: 3.289 ± 1.202
1.645ArgTrp: 1.645 ± 0.581
1.645ArgTyr: 1.645 ± 1.557
0.0ArgXaa: 0.0 ± 0.0
Ser
5.482SerAla: 5.482 ± 1.533
1.645SerCys: 1.645 ± 0.912
2.193SerAsp: 2.193 ± 1.009
2.741SerGlu: 2.741 ± 0.415
4.934SerPhe: 4.934 ± 1.136
4.934SerGly: 4.934 ± 0.906
1.096SerHis: 1.096 ± 0.823
0.548SerIle: 0.548 ± 0.411
6.579SerLys: 6.579 ± 1.029
7.127SerLeu: 7.127 ± 1.585
3.289SerMet: 3.289 ± 1.416
3.289SerAsn: 3.289 ± 0.354
1.096SerPro: 1.096 ± 0.425
6.031SerGln: 6.031 ± 0.945
4.386SerArg: 4.386 ± 2.911
12.61SerSer: 12.61 ± 1.277
1.645SerThr: 1.645 ± 0.869
4.386SerVal: 4.386 ± 2.429
1.096SerTrp: 1.096 ± 0.823
1.645SerTyr: 1.645 ± 0.581
0.0SerXaa: 0.0 ± 0.0
Thr
0.548ThrAla: 0.548 ± 0.502
2.193ThrCys: 2.193 ± 0.849
1.645ThrAsp: 1.645 ± 0.854
2.741ThrGlu: 2.741 ± 0.904
1.096ThrPhe: 1.096 ± 0.528
2.741ThrGly: 2.741 ± 1.683
0.0ThrHis: 0.0 ± 0.0
3.838ThrIle: 3.838 ± 1.111
0.548ThrLys: 0.548 ± 0.645
7.675ThrLeu: 7.675 ± 2.32
1.096ThrMet: 1.096 ± 0.823
3.289ThrAsn: 3.289 ± 1.712
6.579ThrPro: 6.579 ± 1.327
2.193ThrGln: 2.193 ± 0.532
4.386ThrArg: 4.386 ± 0.901
2.741ThrSer: 2.741 ± 0.643
6.031ThrThr: 6.031 ± 2.389
4.934ThrVal: 4.934 ± 0.873
1.096ThrTrp: 1.096 ± 0.728
2.193ThrTyr: 2.193 ± 0.991
0.0ThrXaa: 0.0 ± 0.0
Val
4.386ValAla: 4.386 ± 1.163
0.548ValCys: 0.548 ± 0.645
4.934ValAsp: 4.934 ± 0.967
5.482ValGlu: 5.482 ± 1.145
0.548ValPhe: 0.548 ± 0.411
3.289ValGly: 3.289 ± 1.338
0.0ValHis: 0.0 ± 0.0
2.193ValIle: 2.193 ± 1.009
3.838ValLys: 3.838 ± 1.653
3.289ValLeu: 3.289 ± 0.841
1.096ValMet: 1.096 ± 1.038
4.934ValAsn: 4.934 ± 1.149
1.645ValPro: 1.645 ± 1.076
2.193ValGln: 2.193 ± 0.532
4.934ValArg: 4.934 ± 2.005
5.482ValSer: 5.482 ± 0.258
4.386ValThr: 4.386 ± 1.163
1.645ValVal: 1.645 ± 0.453
1.096ValTrp: 1.096 ± 0.832
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.096TrpAla: 1.096 ± 0.728
0.0TrpCys: 0.0 ± 0.0
1.096TrpAsp: 1.096 ± 0.639
1.645TrpGlu: 1.645 ± 0.856
1.645TrpPhe: 1.645 ± 0.912
3.289TrpGly: 3.289 ± 1.1
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.548TrpLys: 0.548 ± 0.411
0.548TrpLeu: 0.548 ± 0.502
1.645TrpMet: 1.645 ± 0.581
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.548TrpGln: 0.548 ± 0.411
2.193TrpArg: 2.193 ± 0.532
0.548TrpSer: 0.548 ± 0.519
1.096TrpThr: 1.096 ± 1.29
0.0TrpVal: 0.0 ± 0.0
0.548TrpTrp: 0.548 ± 0.411
1.645TrpTyr: 1.645 ± 0.86
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.645TyrAla: 1.645 ± 0.581
0.548TyrCys: 0.548 ± 0.411
0.548TyrAsp: 0.548 ± 0.411
0.0TyrGlu: 0.0 ± 0.0
2.193TyrPhe: 2.193 ± 0.706
3.838TyrGly: 3.838 ± 2.036
1.645TyrHis: 1.645 ± 0.86
0.548TyrIle: 0.548 ± 0.519
2.193TyrLys: 2.193 ± 1.278
4.934TyrLeu: 4.934 ± 0.707
0.0TyrMet: 0.0 ± 0.0
1.645TyrAsn: 1.645 ± 0.86
2.193TyrPro: 2.193 ± 1.691
1.645TyrGln: 1.645 ± 0.856
3.289TyrArg: 3.289 ± 1.162
4.934TyrSer: 4.934 ± 0.906
2.741TyrThr: 2.741 ± 0.985
2.193TyrVal: 2.193 ± 0.532
0.548TyrTrp: 0.548 ± 0.645
3.289TyrTyr: 3.289 ± 1.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1825 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski