Amino acid dipepetide frequency for Human polyomavirus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.807AlaAla: 6.807 ± 2.768
0.567AlaCys: 0.567 ± 0.386
4.538AlaAsp: 4.538 ± 1.453
6.807AlaGlu: 6.807 ± 2.111
2.269AlaPhe: 2.269 ± 0.756
3.403AlaGly: 3.403 ± 1.442
1.134AlaHis: 1.134 ± 0.772
3.971AlaIle: 3.971 ± 2.657
6.239AlaLys: 6.239 ± 0.795
5.105AlaLeu: 5.105 ± 2.503
2.836AlaMet: 2.836 ± 0.69
0.567AlaAsn: 0.567 ± 0.386
1.702AlaPro: 1.702 ± 0.703
2.269AlaGln: 2.269 ± 0.639
5.672AlaArg: 5.672 ± 1.618
2.269AlaSer: 2.269 ± 1.08
2.836AlaThr: 2.836 ± 1.049
4.538AlaVal: 4.538 ± 0.907
0.0AlaTrp: 0.0 ± 0.0
1.702AlaTyr: 1.702 ± 0.847
0.0AlaXaa: 0.0 ± 0.0
Cys
1.702CysAla: 1.702 ± 0.703
1.702CysCys: 1.702 ± 0.704
1.702CysAsp: 1.702 ± 0.703
0.0CysGlu: 0.0 ± 0.0
1.134CysPhe: 1.134 ± 1.514
1.134CysGly: 1.134 ± 0.772
0.0CysHis: 0.0 ± 0.0
1.134CysIle: 1.134 ± 0.704
1.134CysLys: 1.134 ± 0.439
1.702CysLeu: 1.702 ± 0.703
0.567CysMet: 0.567 ± 0.757
2.269CysAsn: 2.269 ± 0.756
1.134CysPro: 1.134 ± 0.439
1.702CysGln: 1.702 ± 1.158
0.567CysArg: 0.567 ± 0.757
2.269CysSer: 2.269 ± 0.756
1.702CysThr: 1.702 ± 0.847
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.702CysTyr: 1.702 ± 1.411
0.0CysXaa: 0.0 ± 0.0
Asp
1.134AspAla: 1.134 ± 0.54
0.567AspCys: 0.567 ± 0.386
2.269AspAsp: 2.269 ± 1.046
3.403AspGlu: 3.403 ± 1.317
3.971AspPhe: 3.971 ± 1.448
3.971AspGly: 3.971 ± 0.633
0.567AspHis: 0.567 ± 0.386
2.836AspIle: 2.836 ± 0.82
3.403AspLys: 3.403 ± 1.272
3.971AspLeu: 3.971 ± 0.809
1.134AspMet: 1.134 ± 0.87
2.269AspAsn: 2.269 ± 1.046
4.538AspPro: 4.538 ± 0.907
2.269AspGln: 2.269 ± 0.639
1.134AspArg: 1.134 ± 0.439
3.403AspSer: 3.403 ± 1.272
1.134AspThr: 1.134 ± 0.704
3.971AspVal: 3.971 ± 1.224
3.403AspTrp: 3.403 ± 1.588
1.134AspTyr: 1.134 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
5.105GluAla: 5.105 ± 1.091
0.567GluCys: 0.567 ± 0.386
1.134GluAsp: 1.134 ± 0.439
2.836GluGlu: 2.836 ± 0.475
0.567GluPhe: 0.567 ± 0.435
3.971GluGly: 3.971 ± 0.912
2.269GluHis: 2.269 ± 1.109
1.702GluIle: 1.702 ± 0.703
2.269GluLys: 2.269 ± 1.544
3.971GluLeu: 3.971 ± 1.433
2.836GluMet: 2.836 ± 0.73
3.971GluAsn: 3.971 ± 1.74
1.702GluPro: 1.702 ± 0.703
3.403GluGln: 3.403 ± 1.714
3.403GluArg: 3.403 ± 0.5
5.105GluSer: 5.105 ± 1.578
5.105GluThr: 5.105 ± 1.453
4.538GluVal: 4.538 ± 2.078
1.702GluTrp: 1.702 ± 1.137
0.567GluTyr: 0.567 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
5.672PheAla: 5.672 ± 2.03
0.567PheCys: 0.567 ± 0.757
1.134PheAsp: 1.134 ± 0.772
3.971PheGlu: 3.971 ± 1.448
0.567PhePhe: 0.567 ± 0.386
2.836PheGly: 2.836 ± 1.51
0.567PheHis: 0.567 ± 0.386
0.567PheIle: 0.567 ± 0.386
3.971PheLys: 3.971 ± 2.141
5.105PheLeu: 5.105 ± 0.442
1.702PheMet: 1.702 ± 0.665
2.269PheAsn: 2.269 ± 1.165
1.702PhePro: 1.702 ± 1.158
1.702PheGln: 1.702 ± 0.847
2.269PheArg: 2.269 ± 0.636
1.134PheSer: 1.134 ± 0.439
3.971PheThr: 3.971 ± 0.947
0.0PheVal: 0.0 ± 0.0
0.567PheTrp: 0.567 ± 0.386
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.105GlyAla: 5.105 ± 2.586
0.567GlyCys: 0.567 ± 0.386
5.672GlyAsp: 5.672 ± 1.412
2.836GlyGlu: 2.836 ± 1.049
3.403GlyPhe: 3.403 ± 2.158
9.075GlyGly: 9.075 ± 0.625
3.403GlyHis: 3.403 ± 1.471
3.403GlyIle: 3.403 ± 0.615
2.836GlyLys: 2.836 ± 0.973
5.672GlyLeu: 5.672 ± 2.315
0.567GlyMet: 0.567 ± 0.579
5.672GlyAsn: 5.672 ± 0.933
3.971GlyPro: 3.971 ± 1.226
5.105GlyGln: 5.105 ± 1.243
2.269GlyArg: 2.269 ± 0.639
5.672GlySer: 5.672 ± 1.172
1.702GlyThr: 1.702 ± 0.703
4.538GlyVal: 4.538 ± 0.887
0.567GlyTrp: 0.567 ± 0.435
1.134GlyTyr: 1.134 ± 0.772
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.567HisCys: 0.567 ± 0.386
1.134HisAsp: 1.134 ± 0.439
3.403HisGlu: 3.403 ± 1.037
1.134HisPhe: 1.134 ± 0.88
0.567HisGly: 0.567 ± 0.386
0.567HisHis: 0.567 ± 0.386
1.702HisIle: 1.702 ± 0.939
0.0HisLys: 0.0 ± 0.0
2.269HisLeu: 2.269 ± 0.756
2.269HisMet: 2.269 ± 0.592
0.567HisAsn: 0.567 ± 0.386
2.269HisPro: 2.269 ± 1.395
1.702HisGln: 1.702 ± 0.939
2.269HisArg: 2.269 ± 1.112
0.567HisSer: 0.567 ± 0.386
0.0HisThr: 0.0 ± 0.0
1.702HisVal: 1.702 ± 0.665
0.0HisTrp: 0.0 ± 0.0
2.269HisTyr: 2.269 ± 0.639
0.0HisXaa: 0.0 ± 0.0
Ile
1.134IleAla: 1.134 ± 0.772
1.134IleCys: 1.134 ± 0.772
3.971IleAsp: 3.971 ± 0.873
6.239IleGlu: 6.239 ± 1.218
2.836IlePhe: 2.836 ± 1.433
2.836IleGly: 2.836 ± 1.341
0.567IleHis: 0.567 ± 0.386
0.567IleIle: 0.567 ± 0.435
1.702IleLys: 1.702 ± 0.493
9.075IleLeu: 9.075 ± 2.121
0.567IleMet: 0.567 ± 0.386
2.269IleAsn: 2.269 ± 1.544
2.836IlePro: 2.836 ± 0.87
3.971IleGln: 3.971 ± 1.557
0.567IleArg: 0.567 ± 0.386
5.672IleSer: 5.672 ± 1.941
3.971IleThr: 3.971 ± 1.518
2.836IleVal: 2.836 ± 1.411
2.269IleTrp: 2.269 ± 1.759
1.134IleTyr: 1.134 ± 0.704
0.0IleXaa: 0.0 ± 0.0
Lys
4.538LysAla: 4.538 ± 2.544
1.702LysCys: 1.702 ± 1.158
2.836LysAsp: 2.836 ± 0.973
1.134LysGlu: 1.134 ± 0.439
1.702LysPhe: 1.702 ± 0.847
3.971LysGly: 3.971 ± 1.249
2.269LysHis: 2.269 ± 1.112
3.971LysIle: 3.971 ± 2.022
5.672LysLys: 5.672 ± 1.269
7.941LysLeu: 7.941 ± 1.087
2.836LysMet: 2.836 ± 0.691
2.836LysAsn: 2.836 ± 0.82
1.134LysPro: 1.134 ± 0.704
0.567LysGln: 0.567 ± 0.386
6.239LysArg: 6.239 ± 1.087
2.269LysSer: 2.269 ± 0.878
5.672LysThr: 5.672 ± 2.214
1.134LysVal: 1.134 ± 0.772
0.0LysTrp: 0.0 ± 0.0
2.269LysTyr: 2.269 ± 0.636
0.0LysXaa: 0.0 ± 0.0
Leu
6.239LeuAla: 6.239 ± 3.603
1.702LeuCys: 1.702 ± 0.704
7.374LeuAsp: 7.374 ± 2.208
3.971LeuGlu: 3.971 ± 0.873
5.105LeuPhe: 5.105 ± 1.174
5.672LeuGly: 5.672 ± 2.982
4.538LeuHis: 4.538 ± 1.271
8.508LeuIle: 8.508 ± 1.374
3.971LeuLys: 3.971 ± 2.793
10.777LeuLeu: 10.777 ± 2.844
3.971LeuMet: 3.971 ± 0.628
3.403LeuAsn: 3.403 ± 1.016
5.672LeuPro: 5.672 ± 1.208
5.105LeuGln: 5.105 ± 2.081
3.971LeuArg: 3.971 ± 1.191
4.538LeuSer: 4.538 ± 1.936
5.105LeuThr: 5.105 ± 0.864
2.836LeuVal: 2.836 ± 2.278
2.269LeuTrp: 2.269 ± 0.858
2.269LeuTyr: 2.269 ± 0.639
0.0LeuXaa: 0.0 ± 0.0
Met
3.403MetAla: 3.403 ± 0.5
0.0MetCys: 0.0 ± 0.0
2.836MetAsp: 2.836 ± 0.69
0.567MetGlu: 0.567 ± 0.435
1.702MetPhe: 1.702 ± 0.703
2.269MetGly: 2.269 ± 0.756
0.0MetHis: 0.0 ± 0.0
6.239MetIle: 6.239 ± 2.95
2.836MetLys: 2.836 ± 1.433
2.269MetLeu: 2.269 ± 1.109
1.702MetMet: 1.702 ± 0.776
2.269MetAsn: 2.269 ± 0.636
1.134MetPro: 1.134 ± 0.87
0.0MetGln: 0.0 ± 0.0
2.269MetArg: 2.269 ± 0.858
1.134MetSer: 1.134 ± 0.658
1.702MetThr: 1.702 ± 0.784
2.836MetVal: 2.836 ± 1.64
1.702MetTrp: 1.702 ± 0.703
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.134AsnAla: 1.134 ± 0.772
1.702AsnCys: 1.702 ± 0.847
2.269AsnAsp: 2.269 ± 1.046
3.971AsnGlu: 3.971 ± 1.41
1.134AsnPhe: 1.134 ± 0.88
2.269AsnGly: 2.269 ± 0.878
0.0AsnHis: 0.0 ± 0.0
4.538AsnIle: 4.538 ± 2.509
5.672AsnLys: 5.672 ± 2.699
3.403AsnLeu: 3.403 ± 2.113
2.836AsnMet: 2.836 ± 1.049
1.134AsnAsn: 1.134 ± 0.772
2.836AsnPro: 2.836 ± 1.107
0.0AsnGln: 0.0 ± 0.0
0.567AsnArg: 0.567 ± 0.579
2.269AsnSer: 2.269 ± 1.112
3.403AsnThr: 3.403 ± 2.146
5.672AsnVal: 5.672 ± 1.618
0.0AsnTrp: 0.0 ± 0.0
1.134AsnTyr: 1.134 ± 0.439
0.0AsnXaa: 0.0 ± 0.0
Pro
3.403ProAla: 3.403 ± 0.718
2.269ProCys: 2.269 ± 1.544
5.672ProAsp: 5.672 ± 0.901
2.269ProGlu: 2.269 ± 0.636
1.702ProPhe: 1.702 ± 1.158
3.971ProGly: 3.971 ± 1.872
0.567ProHis: 0.567 ± 0.386
1.134ProIle: 1.134 ± 0.439
5.105ProLys: 5.105 ± 1.277
4.538ProLeu: 4.538 ± 1.396
1.702ProMet: 1.702 ± 1.158
0.0ProAsn: 0.0 ± 0.0
7.941ProPro: 7.941 ± 1.484
0.567ProGln: 0.567 ± 0.435
2.269ProArg: 2.269 ± 1.19
4.538ProSer: 4.538 ± 1.371
0.567ProThr: 0.567 ± 0.386
3.971ProVal: 3.971 ± 1.954
0.0ProTrp: 0.0 ± 0.0
3.403ProTyr: 3.403 ± 0.649
0.0ProXaa: 0.0 ± 0.0
Gln
2.269GlnAla: 2.269 ± 0.639
2.836GlnCys: 2.836 ± 1.34
0.0GlnAsp: 0.0 ± 0.0
1.702GlnGlu: 1.702 ± 0.939
1.134GlnPhe: 1.134 ± 0.772
2.269GlnGly: 2.269 ± 0.636
2.269GlnHis: 2.269 ± 1.444
2.836GlnIle: 2.836 ± 0.475
3.403GlnLys: 3.403 ± 1.33
6.239GlnLeu: 6.239 ± 1.815
2.269GlnMet: 2.269 ± 0.636
0.567GlnAsn: 0.567 ± 0.386
2.269GlnPro: 2.269 ± 1.17
4.538GlnGln: 4.538 ± 1.214
1.134GlnArg: 1.134 ± 0.439
1.134GlnSer: 1.134 ± 0.439
1.702GlnThr: 1.702 ± 0.738
3.403GlnVal: 3.403 ± 0.649
0.0GlnTrp: 0.0 ± 0.0
0.567GlnTyr: 0.567 ± 0.435
0.0GlnXaa: 0.0 ± 0.0
Arg
2.836ArgAla: 2.836 ± 0.809
1.134ArgCys: 1.134 ± 1.514
1.702ArgAsp: 1.702 ± 1.158
2.836ArgGlu: 2.836 ± 0.973
2.269ArgPhe: 2.269 ± 1.046
3.403ArgGly: 3.403 ± 0.587
2.269ArgHis: 2.269 ± 0.636
2.269ArgIle: 2.269 ± 1.046
2.836ArgLys: 2.836 ± 0.863
3.403ArgLeu: 3.403 ± 1.3
2.269ArgMet: 2.269 ± 1.17
4.538ArgAsn: 4.538 ± 1.085
1.702ArgPro: 1.702 ± 0.939
2.836ArgGln: 2.836 ± 1.475
6.807ArgArg: 6.807 ± 3.176
0.0ArgSer: 0.0 ± 0.0
2.269ArgThr: 2.269 ± 0.636
3.971ArgVal: 3.971 ± 1.226
1.134ArgTrp: 1.134 ± 0.88
1.702ArgTyr: 1.702 ± 1.114
0.0ArgXaa: 0.0 ± 0.0
Ser
5.672SerAla: 5.672 ± 1.519
1.134SerCys: 1.134 ± 0.87
1.702SerAsp: 1.702 ± 1.158
2.269SerGlu: 2.269 ± 1.331
2.836SerPhe: 2.836 ± 1.93
7.374SerGly: 7.374 ± 1.172
0.567SerHis: 0.567 ± 0.435
2.836SerIle: 2.836 ± 1.34
1.702SerLys: 1.702 ± 0.703
5.105SerLeu: 5.105 ± 1.345
1.134SerMet: 1.134 ± 1.159
1.702SerAsn: 1.702 ± 0.703
3.971SerPro: 3.971 ± 1.226
2.836SerGln: 2.836 ± 0.488
3.971SerArg: 3.971 ± 1.226
6.807SerSer: 6.807 ± 0.662
4.538SerThr: 4.538 ± 1.94
2.836SerVal: 2.836 ± 1.194
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.538ThrAla: 4.538 ± 2.522
2.269ThrCys: 2.269 ± 1.464
3.971ThrAsp: 3.971 ± 1.406
2.836ThrGlu: 2.836 ± 0.973
3.403ThrPhe: 3.403 ± 1.394
2.269ThrGly: 2.269 ± 0.592
1.702ThrHis: 1.702 ± 0.703
2.269ThrIle: 2.269 ± 1.046
0.0ThrLys: 0.0 ± 0.0
6.239ThrLeu: 6.239 ± 1.254
1.134ThrMet: 1.134 ± 0.772
2.836ThrAsn: 2.836 ± 1.418
4.538ThrPro: 4.538 ± 1.272
1.134ThrGln: 1.134 ± 0.439
2.269ThrArg: 2.269 ± 0.878
5.672ThrSer: 5.672 ± 1.146
3.971ThrThr: 3.971 ± 0.998
4.538ThrVal: 4.538 ± 1.544
1.702ThrTrp: 1.702 ± 1.114
1.134ThrTyr: 1.134 ± 0.848
0.0ThrXaa: 0.0 ± 0.0
Val
4.538ValAla: 4.538 ± 2.349
0.0ValCys: 0.0 ± 0.0
0.567ValAsp: 0.567 ± 0.386
3.403ValGlu: 3.403 ± 1.571
0.567ValPhe: 0.567 ± 0.386
3.971ValGly: 3.971 ± 1.601
1.134ValHis: 1.134 ± 0.439
5.105ValIle: 5.105 ± 1.448
4.538ValLys: 4.538 ± 1.544
7.941ValLeu: 7.941 ± 2.224
1.702ValMet: 1.702 ± 0.703
3.971ValAsn: 3.971 ± 0.947
3.403ValPro: 3.403 ± 1.19
1.702ValGln: 1.702 ± 0.953
1.134ValArg: 1.134 ± 0.87
2.269ValSer: 2.269 ± 0.756
6.807ValThr: 6.807 ± 1.341
3.403ValVal: 3.403 ± 0.5
0.567ValTrp: 0.567 ± 0.757
1.134ValTyr: 1.134 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.702TrpGlu: 1.702 ± 0.939
1.702TrpPhe: 1.702 ± 0.704
3.971TrpGly: 3.971 ± 2.716
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.567TrpLeu: 0.567 ± 0.757
1.134TrpMet: 1.134 ± 0.88
1.134TrpAsn: 1.134 ± 0.704
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.702TrpArg: 1.702 ± 1.137
0.567TrpSer: 0.567 ± 0.435
0.567TrpThr: 0.567 ± 0.386
1.134TrpVal: 1.134 ± 0.88
0.0TrpTrp: 0.0 ± 0.0
1.702TrpTyr: 1.702 ± 0.703
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
2.269TyrCys: 2.269 ± 1.409
0.0TyrAsp: 0.0 ± 0.0
0.567TyrGlu: 0.567 ± 0.435
1.134TyrPhe: 1.134 ± 0.87
3.971TyrGly: 3.971 ± 1.557
0.567TyrHis: 0.567 ± 0.435
0.567TyrIle: 0.567 ± 0.386
3.403TyrLys: 3.403 ± 1.78
1.702TyrLeu: 1.702 ± 1.158
1.134TyrMet: 1.134 ± 0.439
1.702TyrAsn: 1.702 ± 0.847
1.702TyrPro: 1.702 ± 1.305
1.134TyrGln: 1.134 ± 0.772
1.702TyrArg: 1.702 ± 1.137
1.702TyrSer: 1.702 ± 0.703
1.702TyrThr: 1.702 ± 1.137
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.836TyrTyr: 2.836 ± 1.341
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski