Amino acid dipepetide frequency for Human polyomavirus IPPyV

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.031AlaAla: 6.031 ± 2.653
1.096AlaCys: 1.096 ± 0.457
0.548AlaAsp: 0.548 ± 0.544
2.741AlaGlu: 2.741 ± 1.373
1.096AlaPhe: 1.096 ± 0.546
1.645AlaGly: 1.645 ± 0.986
0.0AlaHis: 0.0 ± 0.0
3.838AlaIle: 3.838 ± 1.374
1.645AlaLys: 1.645 ± 0.635
9.32AlaLeu: 9.32 ± 3.376
0.548AlaMet: 0.548 ± 0.367
1.645AlaAsn: 1.645 ± 0.58
2.741AlaPro: 2.741 ± 0.822
1.645AlaGln: 1.645 ± 0.58
2.193AlaArg: 2.193 ± 1.407
3.838AlaSer: 3.838 ± 1.193
2.741AlaThr: 2.741 ± 1.273
8.224AlaVal: 8.224 ± 1.14
2.741AlaTrp: 2.741 ± 0.99
1.645AlaTyr: 1.645 ± 0.594
0.0AlaXaa: 0.0 ± 0.0
Cys
0.548CysAla: 0.548 ± 0.375
1.096CysCys: 1.096 ± 1.085
0.548CysAsp: 0.548 ± 0.544
1.096CysGlu: 1.096 ± 0.75
1.096CysPhe: 1.096 ± 0.546
1.645CysGly: 1.645 ± 0.594
0.0CysHis: 0.0 ± 0.0
1.096CysIle: 1.096 ± 1.085
2.741CysLys: 2.741 ± 0.324
2.193CysLeu: 2.193 ± 1.093
0.0CysMet: 0.0 ± 0.0
0.548CysAsn: 0.548 ± 0.375
1.645CysPro: 1.645 ± 0.933
2.193CysGln: 2.193 ± 0.458
0.0CysArg: 0.0 ± 0.0
1.096CysSer: 1.096 ± 0.75
1.645CysThr: 1.645 ± 1.126
1.096CysVal: 1.096 ± 0.75
0.0CysTrp: 0.0 ± 0.0
1.096CysTyr: 1.096 ± 1.085
0.0CysXaa: 0.0 ± 0.0
Asp
1.096AspAla: 1.096 ± 0.641
0.548AspCys: 0.548 ± 0.542
2.193AspAsp: 2.193 ± 1.501
2.741AspGlu: 2.741 ± 0.912
3.289AspPhe: 3.289 ± 1.77
2.741AspGly: 2.741 ± 1.28
0.548AspHis: 0.548 ± 0.375
2.741AspIle: 2.741 ± 1.273
2.741AspLys: 2.741 ± 1.497
3.289AspLeu: 3.289 ± 1.1
1.096AspMet: 1.096 ± 1.089
1.645AspAsn: 1.645 ± 0.635
3.838AspPro: 3.838 ± 1.478
2.741AspGln: 2.741 ± 0.867
1.096AspArg: 1.096 ± 0.457
1.096AspSer: 1.096 ± 0.75
0.548AspThr: 0.548 ± 0.544
2.193AspVal: 2.193 ± 1.11
2.193AspTrp: 2.193 ± 1.407
1.096AspTyr: 1.096 ± 0.703
0.0AspXaa: 0.0 ± 0.0
Glu
4.386GluAla: 4.386 ± 1.739
1.645GluCys: 1.645 ± 0.594
3.838GluAsp: 3.838 ± 1.508
7.127GluGlu: 7.127 ± 1.264
2.193GluPhe: 2.193 ± 0.925
2.193GluGly: 2.193 ± 0.915
0.0GluHis: 0.0 ± 0.0
6.031GluIle: 6.031 ± 1.581
8.224GluLys: 8.224 ± 2.602
8.224GluLeu: 8.224 ± 1.013
0.548GluMet: 0.548 ± 0.375
3.838GluAsn: 3.838 ± 0.593
2.741GluPro: 2.741 ± 0.324
3.838GluGln: 3.838 ± 1.458
2.193GluArg: 2.193 ± 1.073
2.193GluSer: 2.193 ± 0.458
1.096GluThr: 1.096 ± 0.75
3.289GluVal: 3.289 ± 2.637
0.0GluTrp: 0.0 ± 0.0
4.934GluTyr: 4.934 ± 1.74
0.0GluXaa: 0.0 ± 0.0
Phe
1.096PheAla: 1.096 ± 0.75
2.741PheCys: 2.741 ± 0.867
1.645PheAsp: 1.645 ± 1.126
2.193PheGlu: 2.193 ± 0.925
1.096PhePhe: 1.096 ± 0.548
1.645PheGly: 1.645 ± 0.565
2.193PheHis: 2.193 ± 0.678
1.645PheIle: 1.645 ± 1.126
1.645PheLys: 1.645 ± 1.126
3.838PheLeu: 3.838 ± 1.508
1.096PheMet: 1.096 ± 0.532
1.645PheAsn: 1.645 ± 1.202
3.838PhePro: 3.838 ± 1.181
3.838PheGln: 3.838 ± 0.974
0.548PheArg: 0.548 ± 0.542
3.838PheSer: 3.838 ± 0.694
3.838PheThr: 3.838 ± 0.697
0.548PheVal: 0.548 ± 0.375
0.548PheTrp: 0.548 ± 0.542
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.838GlyAla: 3.838 ± 1.571
0.548GlyCys: 0.548 ± 0.375
2.193GlyAsp: 2.193 ± 1.136
4.934GlyGlu: 4.934 ± 1.61
1.645GlyPhe: 1.645 ± 1.075
7.675GlyGly: 7.675 ± 1.429
1.096GlyHis: 1.096 ± 0.703
3.838GlyIle: 3.838 ± 0.981
4.386GlyLys: 4.386 ± 1.876
7.127GlyLeu: 7.127 ± 2.652
0.0GlyMet: 0.0 ± 0.0
3.289GlyAsn: 3.289 ± 1.1
4.386GlyPro: 4.386 ± 0.916
2.193GlyGln: 2.193 ± 0.915
2.193GlyArg: 2.193 ± 1.407
4.386GlySer: 4.386 ± 0.405
4.386GlyThr: 4.386 ± 1.612
4.386GlyVal: 4.386 ± 1.139
0.0GlyTrp: 0.0 ± 0.0
3.838GlyTyr: 3.838 ± 0.518
0.0GlyXaa: 0.0 ± 0.0
His
2.193HisAla: 2.193 ± 0.678
1.096HisCys: 1.096 ± 0.75
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.645HisPhe: 1.645 ± 1.126
1.096HisGly: 1.096 ± 0.75
0.548HisHis: 0.548 ± 0.375
1.645HisIle: 1.645 ± 0.58
1.096HisLys: 1.096 ± 0.546
1.096HisLeu: 1.096 ± 0.75
1.645HisMet: 1.645 ± 0.772
0.0HisAsn: 0.0 ± 0.0
1.096HisPro: 1.096 ± 0.546
0.548HisGln: 0.548 ± 0.544
0.548HisArg: 0.548 ± 0.375
0.548HisSer: 0.548 ± 0.375
0.548HisThr: 0.548 ± 0.544
1.096HisVal: 1.096 ± 0.75
0.0HisTrp: 0.0 ± 0.0
2.741HisTyr: 2.741 ± 1.233
0.0HisXaa: 0.0 ± 0.0
Ile
1.645IleAla: 1.645 ± 1.054
1.096IleCys: 1.096 ± 0.75
0.548IleAsp: 0.548 ± 0.375
3.838IleGlu: 3.838 ± 1.267
2.193IlePhe: 2.193 ± 1.096
3.838IleGly: 3.838 ± 2.118
2.741IleHis: 2.741 ± 0.324
0.548IleIle: 0.548 ± 0.542
2.741IleLys: 2.741 ± 1.042
6.579IleLeu: 6.579 ± 2.703
1.645IleMet: 1.645 ± 0.58
2.193IleAsn: 2.193 ± 0.641
0.548IlePro: 0.548 ± 0.544
2.741IleGln: 2.741 ± 0.324
1.096IleArg: 1.096 ± 0.546
1.096IleSer: 1.096 ± 0.457
4.386IleThr: 4.386 ± 0.699
1.645IleVal: 1.645 ± 0.594
1.096IleTrp: 1.096 ± 0.703
4.934IleTyr: 4.934 ± 0.508
0.0IleXaa: 0.0 ± 0.0
Lys
4.934LysAla: 4.934 ± 0.508
1.096LysCys: 1.096 ± 0.546
1.645LysAsp: 1.645 ± 0.635
4.386LysGlu: 4.386 ± 1.461
1.645LysPhe: 1.645 ± 0.594
3.289LysGly: 3.289 ± 0.971
2.741LysHis: 2.741 ± 1.876
3.289LysIle: 3.289 ± 1.529
8.224LysLys: 8.224 ± 1.656
5.482LysLeu: 5.482 ± 0.787
1.096LysMet: 1.096 ± 0.546
3.838LysAsn: 3.838 ± 1.084
3.838LysPro: 3.838 ± 1.302
1.645LysGln: 1.645 ± 0.764
3.838LysArg: 3.838 ± 1.558
3.289LysSer: 3.289 ± 1.77
3.838LysThr: 3.838 ± 1.156
1.645LysVal: 1.645 ± 1.126
1.096LysTrp: 1.096 ± 0.75
1.096LysTyr: 1.096 ± 1.089
0.0LysXaa: 0.0 ± 0.0
Leu
3.838LeuAla: 3.838 ± 1.458
0.548LeuCys: 0.548 ± 0.544
7.127LeuAsp: 7.127 ± 1.314
8.772LeuGlu: 8.772 ± 1.943
6.579LeuPhe: 6.579 ± 0.496
5.482LeuGly: 5.482 ± 1.554
2.741LeuHis: 2.741 ± 1.042
2.741LeuIle: 2.741 ± 0.939
1.645LeuLys: 1.645 ± 0.764
15.351LeuLeu: 15.351 ± 3.865
3.838LeuMet: 3.838 ± 1.607
6.031LeuAsn: 6.031 ± 0.532
6.579LeuPro: 6.579 ± 2.42
3.838LeuGln: 3.838 ± 1.302
4.386LeuArg: 4.386 ± 0.912
10.417LeuSer: 10.417 ± 3.174
4.934LeuThr: 4.934 ± 2.337
3.838LeuVal: 3.838 ± 1.158
1.645LeuTrp: 1.645 ± 1.022
4.386LeuTyr: 4.386 ± 1.217
0.0LeuXaa: 0.0 ± 0.0
Met
1.645MetAla: 1.645 ± 0.772
0.548MetCys: 0.548 ± 0.375
2.741MetAsp: 2.741 ± 1.546
1.645MetGlu: 1.645 ± 0.635
1.096MetPhe: 1.096 ± 0.457
1.645MetGly: 1.645 ± 1.076
0.548MetHis: 0.548 ± 0.375
0.0MetIle: 0.0 ± 0.0
1.645MetLys: 1.645 ± 0.594
2.741MetLeu: 2.741 ± 1.373
0.0MetMet: 0.0 ± 0.0
1.096MetAsn: 1.096 ± 0.75
0.548MetPro: 0.548 ± 0.544
1.645MetGln: 1.645 ± 0.594
1.645MetArg: 1.645 ± 0.896
1.645MetSer: 1.645 ± 0.58
1.096MetThr: 1.096 ± 0.457
0.548MetVal: 0.548 ± 0.544
0.548MetTrp: 0.548 ± 0.544
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.386AsnAla: 4.386 ± 1.437
2.193AsnCys: 2.193 ± 1.073
0.548AsnAsp: 0.548 ± 0.544
5.482AsnGlu: 5.482 ± 0.762
0.548AsnPhe: 0.548 ± 0.542
1.645AsnGly: 1.645 ± 0.772
0.0AsnHis: 0.0 ± 0.0
3.838AsnIle: 3.838 ± 1.587
3.838AsnLys: 3.838 ± 1.825
4.386AsnLeu: 4.386 ± 0.699
1.096AsnMet: 1.096 ± 0.759
2.741AsnAsn: 2.741 ± 0.785
5.482AsnPro: 5.482 ± 1.391
2.741AsnGln: 2.741 ± 1.253
0.0AsnArg: 0.0 ± 0.0
2.741AsnSer: 2.741 ± 1.042
2.193AsnThr: 2.193 ± 0.458
1.096AsnVal: 1.096 ± 1.089
1.096AsnTrp: 1.096 ± 0.457
2.193AsnTyr: 2.193 ± 0.641
0.0AsnXaa: 0.0 ± 0.0
Pro
1.096ProAla: 1.096 ± 0.457
1.645ProCys: 1.645 ± 0.896
3.838ProAsp: 3.838 ± 1.553
3.838ProGlu: 3.838 ± 1.825
1.096ProPhe: 1.096 ± 0.75
4.934ProGly: 4.934 ± 0.912
0.0ProHis: 0.0 ± 0.0
2.741ProIle: 2.741 ± 1.233
3.289ProLys: 3.289 ± 0.531
3.838ProLeu: 3.838 ± 0.694
1.645ProMet: 1.645 ± 0.933
1.645ProAsn: 1.645 ± 0.933
6.579ProPro: 6.579 ± 1.062
2.193ProGln: 2.193 ± 1.136
2.741ProArg: 2.741 ± 1.605
3.289ProSer: 3.289 ± 0.356
5.482ProThr: 5.482 ± 2.025
4.934ProVal: 4.934 ± 3.449
1.096ProTrp: 1.096 ± 0.703
1.096ProTyr: 1.096 ± 0.457
0.0ProXaa: 0.0 ± 0.0
Gln
3.289GlnAla: 3.289 ± 0.531
0.548GlnCys: 0.548 ± 0.375
1.096GlnAsp: 1.096 ± 0.703
3.289GlnGlu: 3.289 ± 0.531
2.741GlnPhe: 2.741 ± 0.324
3.838GlnGly: 3.838 ± 2.386
2.741GlnHis: 2.741 ± 1.517
4.386GlnIle: 4.386 ± 0.916
2.193GlnLys: 2.193 ± 1.073
4.386GlnLeu: 4.386 ± 1.184
0.548GlnMet: 0.548 ± 0.508
1.645GlnAsn: 1.645 ± 0.933
1.645GlnPro: 1.645 ± 0.635
3.838GlnGln: 3.838 ± 1.181
4.386GlnArg: 4.386 ± 1.46
1.096GlnSer: 1.096 ± 0.75
3.289GlnThr: 3.289 ± 0.778
4.934GlnVal: 4.934 ± 1.206
0.0GlnTrp: 0.0 ± 0.0
1.096GlnTyr: 1.096 ± 0.703
0.0GlnXaa: 0.0 ± 0.0
Arg
2.741ArgAla: 2.741 ± 0.703
0.0ArgCys: 0.0 ± 0.0
0.548ArgAsp: 0.548 ± 0.375
3.838ArgGlu: 3.838 ± 1.922
1.096ArgPhe: 1.096 ± 0.75
4.934ArgGly: 4.934 ± 2.726
0.548ArgHis: 0.548 ± 0.375
1.645ArgIle: 1.645 ± 0.594
2.193ArgLys: 2.193 ± 0.915
3.289ArgLeu: 3.289 ± 1.057
1.096ArgMet: 1.096 ± 0.457
3.289ArgAsn: 3.289 ± 1.057
0.0ArgPro: 0.0 ± 0.0
2.741ArgGln: 2.741 ± 1.373
2.193ArgArg: 2.193 ± 1.407
9.868ArgSer: 9.868 ± 4.381
1.645ArgThr: 1.645 ± 0.594
3.289ArgVal: 3.289 ± 1.095
1.645ArgTrp: 1.645 ± 0.58
1.645ArgTyr: 1.645 ± 1.633
0.0ArgXaa: 0.0 ± 0.0
Ser
5.482SerAla: 5.482 ± 1.558
1.645SerCys: 1.645 ± 0.896
2.193SerAsp: 2.193 ± 0.938
2.741SerGlu: 2.741 ± 0.324
4.934SerPhe: 4.934 ± 1.045
4.934SerGly: 4.934 ± 0.891
1.096SerHis: 1.096 ± 0.75
0.548SerIle: 0.548 ± 0.375
6.579SerLys: 6.579 ± 0.95
7.127SerLeu: 7.127 ± 1.309
2.741SerMet: 2.741 ± 1.439
3.289SerAsn: 3.289 ± 0.356
1.096SerPro: 1.096 ± 0.457
6.031SerGln: 6.031 ± 0.948
4.386SerArg: 4.386 ± 2.813
12.61SerSer: 12.61 ± 1.325
1.645SerThr: 1.645 ± 0.743
4.386SerVal: 4.386 ± 2.272
1.096SerTrp: 1.096 ± 0.75
1.645SerTyr: 1.645 ± 0.58
0.0SerXaa: 0.0 ± 0.0
Thr
0.548ThrAla: 0.548 ± 0.506
2.193ThrCys: 2.193 ± 0.915
1.645ThrAsp: 1.645 ± 0.933
2.741ThrGlu: 2.741 ± 0.859
1.096ThrPhe: 1.096 ± 0.548
2.741ThrGly: 2.741 ± 1.624
0.0ThrHis: 0.0 ± 0.0
3.838ThrIle: 3.838 ± 1.096
0.548ThrLys: 0.548 ± 0.542
7.675ThrLeu: 7.675 ± 2.028
1.096ThrMet: 1.096 ± 0.75
3.289ThrAsn: 3.289 ± 1.544
6.579ThrPro: 6.579 ± 1.143
2.193ThrGln: 2.193 ± 0.458
4.386ThrArg: 4.386 ± 0.912
2.741ThrSer: 2.741 ± 0.606
6.031ThrThr: 6.031 ± 2.564
4.934ThrVal: 4.934 ± 0.861
1.096ThrTrp: 1.096 ± 0.703
2.193ThrTyr: 2.193 ± 1.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.386ValAla: 4.386 ± 1.162
0.548ValCys: 0.548 ± 0.542
4.934ValAsp: 4.934 ± 0.942
5.482ValGlu: 5.482 ± 1.088
0.548ValPhe: 0.548 ± 0.375
3.289ValGly: 3.289 ± 1.317
0.0ValHis: 0.0 ± 0.0
2.193ValIle: 2.193 ± 0.938
3.838ValLys: 3.838 ± 1.81
3.289ValLeu: 3.289 ± 0.876
1.096ValMet: 1.096 ± 1.089
4.934ValAsn: 4.934 ± 1.017
1.645ValPro: 1.645 ± 1.076
2.193ValGln: 2.193 ± 0.458
4.934ValArg: 4.934 ± 1.804
5.482ValSer: 5.482 ± 0.253
4.386ValThr: 4.386 ± 1.162
1.645ValVal: 1.645 ± 0.476
1.096ValTrp: 1.096 ± 0.759
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.096TrpAla: 1.096 ± 0.703
0.0TrpCys: 0.0 ± 0.0
1.096TrpAsp: 1.096 ± 0.546
1.645TrpGlu: 1.645 ± 0.772
1.645TrpPhe: 1.645 ± 0.896
3.289TrpGly: 3.289 ± 1.057
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.548TrpLys: 0.548 ± 0.375
0.548TrpLeu: 0.548 ± 0.506
1.645TrpMet: 1.645 ± 0.58
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.548TrpGln: 0.548 ± 0.375
2.193TrpArg: 2.193 ± 0.458
0.548TrpSer: 0.548 ± 0.544
1.096TrpThr: 1.096 ± 1.085
0.0TrpVal: 0.0 ± 0.0
0.548TrpTrp: 0.548 ± 0.375
1.645TrpTyr: 1.645 ± 0.764
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.645TyrAla: 1.645 ± 0.58
0.548TyrCys: 0.548 ± 0.375
0.548TyrAsp: 0.548 ± 0.375
0.0TyrGlu: 0.0 ± 0.0
2.193TyrPhe: 2.193 ± 0.641
3.838TyrGly: 3.838 ± 1.864
1.645TyrHis: 1.645 ± 0.764
0.548TyrIle: 0.548 ± 0.544
2.193TyrLys: 2.193 ± 1.093
4.934TyrLeu: 4.934 ± 0.757
0.0TyrMet: 0.0 ± 0.0
1.645TyrAsn: 1.645 ± 0.764
2.193TyrPro: 2.193 ± 1.709
1.645TyrGln: 1.645 ± 0.772
3.289TyrArg: 3.289 ± 1.16
4.934TyrSer: 4.934 ± 0.891
2.741TyrThr: 2.741 ± 0.929
2.193TyrVal: 2.193 ± 0.458
0.548TyrTrp: 0.548 ± 0.542
3.289TyrTyr: 3.289 ± 1.095
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1825 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski