Amino acid dipepetide frequency for STL polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.137AlaAla: 5.137 ± 2.388
0.571AlaCys: 0.571 ± 0.383
2.283AlaAsp: 2.283 ± 1.52
3.995AlaGlu: 3.995 ± 1.197
2.854AlaPhe: 2.854 ± 1.641
2.854AlaGly: 2.854 ± 1.072
1.142AlaHis: 1.142 ± 0.85
5.137AlaIle: 5.137 ± 2.778
3.995AlaLys: 3.995 ± 1.298
3.995AlaLeu: 3.995 ± 1.197
0.571AlaMet: 0.571 ± 0.503
0.571AlaAsn: 0.571 ± 0.562
3.425AlaPro: 3.425 ± 2.132
2.283AlaGln: 2.283 ± 1.2
2.854AlaArg: 2.854 ± 0.875
1.712AlaSer: 1.712 ± 1.052
1.712AlaThr: 1.712 ± 1.686
5.708AlaVal: 5.708 ± 1.763
2.854AlaTrp: 2.854 ± 0.927
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.283CysAla: 2.283 ± 0.7
1.712CysCys: 1.712 ± 0.653
1.712CysAsp: 1.712 ± 0.653
2.854CysGlu: 2.854 ± 1.327
1.712CysPhe: 1.712 ± 1.306
0.571CysGly: 0.571 ± 0.562
0.571CysHis: 0.571 ± 0.383
1.142CysIle: 1.142 ± 0.679
0.571CysLys: 0.571 ± 0.682
1.142CysLeu: 1.142 ± 0.766
0.0CysMet: 0.0 ± 0.0
1.142CysAsn: 1.142 ± 0.486
2.283CysPro: 2.283 ± 1.52
1.712CysGln: 1.712 ± 0.979
1.712CysArg: 1.712 ± 0.653
2.283CysSer: 2.283 ± 1.533
1.712CysThr: 1.712 ± 0.979
2.283CysVal: 2.283 ± 1.243
0.0CysTrp: 0.0 ± 0.0
1.142CysTyr: 1.142 ± 0.679
0.0CysXaa: 0.0 ± 0.0
Asp
2.283AspAla: 2.283 ± 1.615
0.571AspCys: 0.571 ± 0.682
1.712AspAsp: 1.712 ± 1.15
4.566AspGlu: 4.566 ± 1.15
1.142AspPhe: 1.142 ± 0.766
2.854AspGly: 2.854 ± 2.073
0.571AspHis: 0.571 ± 0.383
3.995AspIle: 3.995 ± 0.661
1.712AspLys: 1.712 ± 0.653
6.279AspLeu: 6.279 ± 1.821
0.0AspMet: 0.0 ± 0.0
1.712AspAsn: 1.712 ± 0.785
5.137AspPro: 5.137 ± 2.936
1.142AspGln: 1.142 ± 0.679
0.571AspArg: 0.571 ± 0.562
6.849AspSer: 6.849 ± 1.458
3.995AspThr: 3.995 ± 1.127
2.854AspVal: 2.854 ± 1.15
1.712AspTrp: 1.712 ± 0.756
1.142AspTyr: 1.142 ± 0.85
0.0AspXaa: 0.0 ± 0.0
Glu
6.279GluAla: 6.279 ± 2.211
1.142GluCys: 1.142 ± 0.679
5.137GluAsp: 5.137 ± 2.943
4.566GluGlu: 4.566 ± 1.15
2.283GluPhe: 2.283 ± 1.533
2.283GluGly: 2.283 ± 0.979
0.571GluHis: 0.571 ± 0.383
4.566GluIle: 4.566 ± 1.822
3.425GluLys: 3.425 ± 1.69
7.42GluLeu: 7.42 ± 2.483
1.142GluMet: 1.142 ± 0.723
9.132GluAsn: 9.132 ± 2.3
0.571GluPro: 0.571 ± 0.562
1.142GluGln: 1.142 ± 0.766
1.712GluArg: 1.712 ± 0.855
2.854GluSer: 2.854 ± 1.072
3.995GluThr: 3.995 ± 0.492
3.425GluVal: 3.425 ± 0.834
0.0GluTrp: 0.0 ± 0.0
1.142GluTyr: 1.142 ± 0.486
0.0GluXaa: 0.0 ± 0.0
Phe
1.142PheAla: 1.142 ± 0.486
2.854PheCys: 2.854 ± 0.92
1.712PheAsp: 1.712 ± 0.746
3.425PheGlu: 3.425 ± 1.019
1.142PhePhe: 1.142 ± 0.85
1.142PheGly: 1.142 ± 0.838
2.283PheHis: 2.283 ± 0.979
1.142PheIle: 1.142 ± 0.679
2.854PheLys: 2.854 ± 0.92
2.283PheLeu: 2.283 ± 0.84
1.712PheMet: 1.712 ± 1.109
2.283PheAsn: 2.283 ± 1.533
2.854PhePro: 2.854 ± 0.815
2.854PheGln: 2.854 ± 1.378
2.283PheArg: 2.283 ± 1.156
3.425PheSer: 3.425 ± 0.995
1.142PheThr: 1.142 ± 0.81
0.571PheVal: 0.571 ± 0.383
1.142PheTrp: 1.142 ± 0.85
0.571PheTyr: 0.571 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
1.142GlyAla: 1.142 ± 1.124
2.283GlyCys: 2.283 ± 0.7
2.854GlyAsp: 2.854 ± 1.378
2.283GlyGlu: 2.283 ± 1.358
0.571GlyPhe: 0.571 ± 0.562
3.425GlyGly: 3.425 ± 0.995
1.142GlyHis: 1.142 ± 1.124
4.566GlyIle: 4.566 ± 1.187
2.283GlyLys: 2.283 ± 1.533
8.562GlyLeu: 8.562 ± 2.634
2.283GlyMet: 2.283 ± 0.979
3.425GlyAsn: 3.425 ± 1.29
2.854GlyPro: 2.854 ± 0.927
4.566GlyGln: 4.566 ± 1.187
2.854GlyArg: 2.854 ± 0.875
5.137GlySer: 5.137 ± 2.564
1.712GlyThr: 1.712 ± 0.671
2.283GlyVal: 2.283 ± 1.52
1.142GlyTrp: 1.142 ± 0.85
1.712GlyTyr: 1.712 ± 0.855
0.0GlyXaa: 0.0 ± 0.0
His
1.142HisAla: 1.142 ± 0.679
0.0HisCys: 0.0 ± 0.0
1.142HisAsp: 1.142 ± 0.679
1.142HisGlu: 1.142 ± 0.766
1.712HisPhe: 1.712 ± 0.979
0.0HisGly: 0.0 ± 0.0
0.571HisHis: 0.571 ± 0.383
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.142HisLeu: 1.142 ± 0.766
0.0HisMet: 0.0 ± 0.0
2.283HisAsn: 2.283 ± 0.844
1.142HisPro: 1.142 ± 0.679
0.0HisGln: 0.0 ± 0.0
1.142HisArg: 1.142 ± 0.766
1.142HisSer: 1.142 ± 0.766
0.571HisThr: 0.571 ± 0.562
0.571HisVal: 0.571 ± 0.682
0.0HisTrp: 0.0 ± 0.0
1.142HisTyr: 1.142 ± 0.486
0.0HisXaa: 0.0 ± 0.0
Ile
2.283IleAla: 2.283 ± 0.912
1.712IleCys: 1.712 ± 0.671
2.854IleAsp: 2.854 ± 1.562
1.712IleGlu: 1.712 ± 0.855
1.142IlePhe: 1.142 ± 0.679
2.283IleGly: 2.283 ± 0.844
0.571IleHis: 0.571 ± 0.383
3.425IleIle: 3.425 ± 0.69
0.571IleLys: 0.571 ± 0.682
8.562IleLeu: 8.562 ± 1.433
1.142IleMet: 1.142 ± 0.714
3.995IleAsn: 3.995 ± 0.99
1.712IlePro: 1.712 ± 1.271
2.854IleGln: 2.854 ± 0.517
2.283IleArg: 2.283 ± 0.844
2.854IleSer: 2.854 ± 0.781
3.425IleThr: 3.425 ± 0.671
4.566IleVal: 4.566 ± 2.299
1.712IleTrp: 1.712 ± 1.032
3.425IleTyr: 3.425 ± 1.019
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
1.712LysCys: 1.712 ± 1.306
1.712LysAsp: 1.712 ± 0.979
2.283LysGlu: 2.283 ± 1.156
1.142LysPhe: 1.142 ± 0.766
3.995LysGly: 3.995 ± 1.19
2.854LysHis: 2.854 ± 1.488
2.854LysIle: 2.854 ± 0.647
4.566LysLys: 4.566 ± 1.479
1.142LysLeu: 1.142 ± 0.766
0.0LysMet: 0.0 ± 0.0
2.854LysAsn: 2.854 ± 1.327
2.854LysPro: 2.854 ± 0.92
1.712LysGln: 1.712 ± 0.867
5.137LysArg: 5.137 ± 1.589
3.995LysSer: 3.995 ± 1.19
4.566LysThr: 4.566 ± 1.919
3.425LysVal: 3.425 ± 1.734
0.0LysTrp: 0.0 ± 0.0
0.571LysTyr: 0.571 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
1.712LeuAla: 1.712 ± 0.671
2.283LeuCys: 2.283 ± 0.972
9.703LeuAsp: 9.703 ± 1.541
3.425LeuGlu: 3.425 ± 1.81
2.283LeuPhe: 2.283 ± 1.533
6.849LeuGly: 6.849 ± 1.56
1.142LeuHis: 1.142 ± 0.679
5.708LeuIle: 5.708 ± 0.304
4.566LeuLys: 4.566 ± 1.919
14.269LeuLeu: 14.269 ± 3.156
3.425LeuMet: 3.425 ± 1.734
6.279LeuAsn: 6.279 ± 1.574
5.708LeuPro: 5.708 ± 1.921
9.132LeuGln: 9.132 ± 4.449
5.137LeuArg: 5.137 ± 0.738
10.845LeuSer: 10.845 ± 4.674
7.42LeuThr: 7.42 ± 1.768
2.854LeuVal: 2.854 ± 1.2
1.712LeuTrp: 1.712 ± 0.653
2.283LeuTyr: 2.283 ± 0.7
0.0LeuXaa: 0.0 ± 0.0
Met
2.283MetAla: 2.283 ± 0.593
0.571MetCys: 0.571 ± 0.383
1.712MetAsp: 1.712 ± 0.653
2.283MetGlu: 2.283 ± 0.972
1.712MetPhe: 1.712 ± 0.867
3.995MetGly: 3.995 ± 0.663
0.0MetHis: 0.0 ± 0.0
0.571MetIle: 0.571 ± 0.383
1.142MetLys: 1.142 ± 0.679
1.142MetLeu: 1.142 ± 0.679
0.0MetMet: 0.0 ± 0.0
1.142MetAsn: 1.142 ± 0.766
0.571MetPro: 0.571 ± 0.562
2.283MetGln: 2.283 ± 0.7
1.142MetArg: 1.142 ± 0.85
0.0MetSer: 0.0 ± 0.0
1.142MetThr: 1.142 ± 0.679
1.712MetVal: 1.712 ± 1.15
0.571MetTrp: 0.571 ± 0.383
1.712MetTyr: 1.712 ± 1.306
0.0MetXaa: 0.0 ± 0.0
Asn
3.995AsnAla: 3.995 ± 2.399
2.854AsnCys: 2.854 ± 1.916
0.571AsnAsp: 0.571 ± 0.562
8.562AsnGlu: 8.562 ± 2.761
2.283AsnPhe: 2.283 ± 0.979
2.283AsnGly: 2.283 ± 0.593
0.571AsnHis: 0.571 ± 0.383
3.995AsnIle: 3.995 ± 1.556
2.283AsnLys: 2.283 ± 1.156
5.708AsnLeu: 5.708 ± 1.931
2.854AsnMet: 2.854 ± 1.2
2.854AsnAsn: 2.854 ± 1.327
3.425AsnPro: 3.425 ± 2.132
1.142AsnGln: 1.142 ± 0.766
0.0AsnArg: 0.0 ± 0.0
3.995AsnSer: 3.995 ± 0.661
2.854AsnThr: 2.854 ± 0.92
3.995AsnVal: 3.995 ± 0.733
0.571AsnTrp: 0.571 ± 0.383
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.566ProAla: 4.566 ± 1.014
1.142ProCys: 1.142 ± 0.81
7.42ProAsp: 7.42 ± 1.944
2.854ProGlu: 2.854 ± 0.92
2.854ProPhe: 2.854 ± 0.517
4.566ProGly: 4.566 ± 1.187
0.0ProHis: 0.0 ± 0.0
0.571ProIle: 0.571 ± 0.383
3.995ProLys: 3.995 ± 1.574
7.42ProLeu: 7.42 ± 2.226
1.142ProMet: 1.142 ± 1.124
0.0ProAsn: 0.0 ± 0.0
7.991ProPro: 7.991 ± 4.304
1.712ProGln: 1.712 ± 1.032
2.283ProArg: 2.283 ± 2.248
1.142ProSer: 1.142 ± 1.124
5.137ProThr: 5.137 ± 2.192
6.279ProVal: 6.279 ± 2.359
0.0ProTrp: 0.0 ± 0.0
1.142ProTyr: 1.142 ± 0.486
0.0ProXaa: 0.0 ± 0.0
Gln
4.566GlnAla: 4.566 ± 1.479
0.0GlnCys: 0.0 ± 0.0
1.712GlnAsp: 1.712 ± 1.15
1.142GlnGlu: 1.142 ± 0.562
2.854GlnPhe: 2.854 ± 0.868
1.142GlnGly: 1.142 ± 1.124
0.571GlnHis: 0.571 ± 0.682
3.995GlnIle: 3.995 ± 1.864
3.425GlnLys: 3.425 ± 1.224
1.712GlnLeu: 1.712 ± 1.032
1.142GlnMet: 1.142 ± 1.124
3.995GlnAsn: 3.995 ± 1.472
2.854GlnPro: 2.854 ± 1.439
1.712GlnGln: 1.712 ± 0.671
1.712GlnArg: 1.712 ± 0.855
4.566GlnSer: 4.566 ± 2.302
5.137GlnThr: 5.137 ± 2.071
1.142GlnVal: 1.142 ± 0.486
0.571GlnTrp: 0.571 ± 0.383
3.995GlnTyr: 3.995 ± 1.42
0.0GlnXaa: 0.0 ± 0.0
Arg
2.854ArgAla: 2.854 ± 0.647
1.712ArgCys: 1.712 ± 1.686
1.142ArgAsp: 1.142 ± 0.766
1.712ArgGlu: 1.712 ± 0.756
2.283ArgPhe: 2.283 ± 0.7
1.712ArgGly: 1.712 ± 1.686
0.571ArgHis: 0.571 ± 0.562
2.283ArgIle: 2.283 ± 0.593
1.142ArgLys: 1.142 ± 0.486
3.995ArgLeu: 3.995 ± 2.428
4.566ArgMet: 4.566 ± 1.94
1.712ArgAsn: 1.712 ± 1.216
3.425ArgPro: 3.425 ± 1.511
4.566ArgGln: 4.566 ± 1.652
3.995ArgArg: 3.995 ± 1.913
1.712ArgSer: 1.712 ± 0.867
5.137ArgThr: 5.137 ± 1.041
1.712ArgVal: 1.712 ± 0.671
1.142ArgTrp: 1.142 ± 0.85
2.283ArgTyr: 2.283 ± 0.7
0.0ArgXaa: 0.0 ± 0.0
Ser
5.708SerAla: 5.708 ± 1.123
1.712SerCys: 1.712 ± 0.671
2.854SerAsp: 2.854 ± 0.517
1.142SerGlu: 1.142 ± 0.486
4.566SerPhe: 4.566 ± 1.166
7.991SerGly: 7.991 ± 2.441
0.571SerHis: 0.571 ± 0.383
2.283SerIle: 2.283 ± 0.979
3.995SerLys: 3.995 ± 1.363
9.132SerLeu: 9.132 ± 2.673
0.571SerMet: 0.571 ± 0.383
3.425SerAsn: 3.425 ± 1.019
2.283SerPro: 2.283 ± 0.844
3.425SerGln: 3.425 ± 0.866
6.849SerArg: 6.849 ± 1.954
7.991SerSer: 7.991 ± 1.252
3.425SerThr: 3.425 ± 1.458
3.425SerVal: 3.425 ± 1.933
1.142SerTrp: 1.142 ± 0.679
3.995SerTyr: 3.995 ± 2.393
0.0SerXaa: 0.0 ± 0.0
Thr
1.712ThrAla: 1.712 ± 1.1
3.425ThrCys: 3.425 ± 1.364
2.854ThrAsp: 2.854 ± 1.5
3.995ThrGlu: 3.995 ± 1.476
1.712ThrPhe: 1.712 ± 1.306
1.142ThrGly: 1.142 ± 1.124
0.0ThrHis: 0.0 ± 0.0
1.142ThrIle: 1.142 ± 0.486
1.712ThrLys: 1.712 ± 0.867
9.132ThrLeu: 9.132 ± 3.345
3.425ThrMet: 3.425 ± 1.224
1.712ThrAsn: 1.712 ± 0.473
6.279ThrPro: 6.279 ± 1.513
2.283ThrGln: 2.283 ± 1.533
0.571ThrArg: 0.571 ± 0.383
5.708ThrSer: 5.708 ± 0.627
2.854ThrThr: 2.854 ± 0.517
8.562ThrVal: 8.562 ± 2.395
0.571ThrTrp: 0.571 ± 0.682
3.425ThrTyr: 3.425 ± 1.342
0.0ThrXaa: 0.0 ± 0.0
Val
2.854ValAla: 2.854 ± 0.874
1.142ValCys: 1.142 ± 0.679
0.571ValAsp: 0.571 ± 0.557
6.849ValGlu: 6.849 ± 3.315
2.283ValPhe: 2.283 ± 1.099
2.854ValGly: 2.854 ± 1.378
0.0ValHis: 0.0 ± 0.0
3.995ValIle: 3.995 ± 0.492
3.425ValLys: 3.425 ± 2.299
9.132ValLeu: 9.132 ± 2.808
0.571ValMet: 0.571 ± 0.682
4.566ValAsn: 4.566 ± 0.845
3.425ValPro: 3.425 ± 0.428
2.283ValGln: 2.283 ± 0.972
3.995ValArg: 3.995 ± 1.926
4.566ValSer: 4.566 ± 1.65
3.995ValThr: 3.995 ± 1.275
1.712ValVal: 1.712 ± 0.756
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.142TrpAla: 1.142 ± 0.85
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.854TrpGlu: 2.854 ± 0.517
1.142TrpPhe: 1.142 ± 0.81
2.854TrpGly: 2.854 ± 1.373
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.142TrpLys: 1.142 ± 0.679
1.712TrpLeu: 1.712 ± 0.756
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.571TrpPro: 0.571 ± 0.682
1.142TrpGln: 1.142 ± 0.766
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.142TrpThr: 1.142 ± 0.85
0.571TrpVal: 0.571 ± 0.383
1.142TrpTrp: 1.142 ± 0.766
0.571TrpTyr: 0.571 ± 0.562
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.571TyrAla: 0.571 ± 0.557
1.142TyrCys: 1.142 ± 0.81
1.142TyrAsp: 1.142 ± 1.124
1.142TyrGlu: 1.142 ± 0.562
1.712TyrPhe: 1.712 ± 0.979
2.283TyrGly: 2.283 ± 0.593
1.142TyrHis: 1.142 ± 0.679
1.712TyrIle: 1.712 ± 1.271
0.571TyrLys: 0.571 ± 0.562
2.854TyrLeu: 2.854 ± 0.517
0.571TyrMet: 0.571 ± 0.383
1.712TyrAsn: 1.712 ± 0.867
2.854TyrPro: 2.854 ± 1.5
0.0TyrGln: 0.0 ± 0.0
3.425TyrArg: 3.425 ± 1.313
5.708TyrSer: 5.708 ± 2.1
1.142TyrThr: 1.142 ± 0.766
0.571TyrVal: 0.571 ± 0.383
0.0TyrTrp: 0.0 ± 0.0
2.854TyrTyr: 2.854 ± 1.562
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1753 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski