Amino acid dipepetide frequency for LI polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.646AlaAla: 7.646 ± 1.776
2.185AlaCys: 2.185 ± 0.981
3.277AlaAsp: 3.277 ± 1.314
5.461AlaGlu: 5.461 ± 2.459
2.185AlaPhe: 2.185 ± 0.981
2.185AlaGly: 2.185 ± 0.659
2.185AlaHis: 2.185 ± 1.325
3.823AlaIle: 3.823 ± 1.388
2.185AlaLys: 2.185 ± 0.88
3.823AlaLeu: 3.823 ± 1.521
0.0AlaMet: 0.0 ± 0.0
1.638AlaAsn: 1.638 ± 0.749
2.731AlaPro: 2.731 ± 1.181
1.638AlaGln: 1.638 ± 0.677
3.277AlaArg: 3.277 ± 0.723
5.461AlaSer: 5.461 ± 1.074
2.185AlaThr: 2.185 ± 1.214
2.731AlaVal: 2.731 ± 0.565
1.092AlaTrp: 1.092 ± 0.662
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.546CysAla: 0.546 ± 0.331
0.0CysCys: 0.0 ± 0.0
1.092CysAsp: 1.092 ± 0.662
0.0CysGlu: 0.0 ± 0.0
1.638CysPhe: 1.638 ± 0.824
1.092CysGly: 1.092 ± 0.438
0.0CysHis: 0.0 ± 0.0
1.092CysIle: 1.092 ± 1.753
2.185CysLys: 2.185 ± 0.88
4.369CysLeu: 4.369 ± 2.386
0.0CysMet: 0.0 ± 0.761
2.185CysAsn: 2.185 ± 0.88
3.277CysPro: 3.277 ± 1.248
0.546CysGln: 0.546 ± 0.331
1.638CysArg: 1.638 ± 0.824
2.731CysSer: 2.731 ± 0.746
1.092CysThr: 1.092 ± 0.662
0.546CysVal: 0.546 ± 0.479
0.0CysTrp: 0.0 ± 0.0
1.092CysTyr: 1.092 ± 1.753
0.0CysXaa: 0.0 ± 0.0
Asp
2.731AspAla: 2.731 ± 0.913
0.546AspCys: 0.546 ± 0.479
1.638AspAsp: 1.638 ± 0.994
1.638AspGlu: 1.638 ± 0.611
1.638AspPhe: 1.638 ± 0.994
3.823AspGly: 3.823 ± 1.063
2.185AspHis: 2.185 ± 0.88
2.731AspIle: 2.731 ± 0.746
1.638AspLys: 1.638 ± 0.824
4.369AspLeu: 4.369 ± 1.087
1.092AspMet: 1.092 ± 0.438
4.369AspAsn: 4.369 ± 1.954
6.008AspPro: 6.008 ± 0.703
1.092AspGln: 1.092 ± 0.879
1.638AspArg: 1.638 ± 0.749
2.185AspSer: 2.185 ± 0.88
1.638AspThr: 1.638 ± 0.856
3.277AspVal: 3.277 ± 1.088
1.638AspTrp: 1.638 ± 0.749
1.092AspTyr: 1.092 ± 0.879
0.0AspXaa: 0.0 ± 0.0
Glu
5.461GluAla: 5.461 ± 1.329
1.092GluCys: 1.092 ± 0.783
2.731GluAsp: 2.731 ± 0.724
5.461GluGlu: 5.461 ± 1.247
3.277GluPhe: 3.277 ± 0.621
2.185GluGly: 2.185 ± 0.563
2.185GluHis: 2.185 ± 1.125
2.731GluIle: 2.731 ± 0.881
5.461GluLys: 5.461 ± 1.71
4.369GluLeu: 4.369 ± 0.972
1.638GluMet: 1.638 ± 0.824
6.008GluAsn: 6.008 ± 1.268
3.277GluPro: 3.277 ± 1.495
2.731GluGln: 2.731 ± 0.797
2.731GluArg: 2.731 ± 0.724
3.277GluSer: 3.277 ± 1.123
1.638GluThr: 1.638 ± 0.856
4.369GluVal: 4.369 ± 1.8
0.546GluTrp: 0.546 ± 0.331
2.185GluTyr: 2.185 ± 0.981
0.0GluXaa: 0.0 ± 0.0
Phe
3.823PheAla: 3.823 ± 1.851
1.638PheCys: 1.638 ± 0.994
1.092PheAsp: 1.092 ± 0.662
3.277PheGlu: 3.277 ± 0.939
2.185PhePhe: 2.185 ± 0.876
2.731PheGly: 2.731 ± 0.724
1.092PheHis: 1.092 ± 0.958
0.546PheIle: 0.546 ± 0.537
3.277PheLys: 3.277 ± 0.939
4.915PheLeu: 4.915 ± 1.807
1.638PheMet: 1.638 ± 0.62
1.092PheAsn: 1.092 ± 0.958
2.731PhePro: 2.731 ± 0.899
1.092PheGln: 1.092 ± 0.875
1.092PheArg: 1.092 ± 0.662
2.731PheSer: 2.731 ± 0.915
4.369PheThr: 4.369 ± 1.087
2.185PheVal: 2.185 ± 0.887
0.546PheTrp: 0.546 ± 0.877
1.638PheTyr: 1.638 ± 1.32
0.0PheXaa: 0.0 ± 0.0
Gly
1.638GlyAla: 1.638 ± 1.279
1.638GlyCys: 1.638 ± 0.994
3.823GlyAsp: 3.823 ± 1.013
3.823GlyGlu: 3.823 ± 1.624
2.731GlyPhe: 2.731 ± 0.915
6.554GlyGly: 6.554 ± 0.878
1.092GlyHis: 1.092 ± 0.875
6.008GlyIle: 6.008 ± 2.724
3.277GlyLys: 3.277 ± 0.996
6.008GlyLeu: 6.008 ± 0.514
2.731GlyMet: 2.731 ± 0.875
3.277GlyAsn: 3.277 ± 1.734
4.369GlyPro: 4.369 ± 0.949
4.369GlyGln: 4.369 ± 1.237
0.546GlyArg: 0.546 ± 0.479
3.823GlySer: 3.823 ± 0.831
4.369GlyThr: 4.369 ± 1.237
2.185GlyVal: 2.185 ± 0.659
0.0GlyTrp: 0.0 ± 0.0
2.731GlyTyr: 2.731 ± 1.409
0.0GlyXaa: 0.0 ± 0.0
His
1.092HisAla: 1.092 ± 0.438
1.092HisCys: 1.092 ± 1.753
1.092HisAsp: 1.092 ± 0.958
1.638HisGlu: 1.638 ± 0.994
1.638HisPhe: 1.638 ± 0.611
0.0HisGly: 0.0 ± 0.0
1.092HisHis: 1.092 ± 0.783
0.0HisIle: 0.0 ± 0.0
2.185HisLys: 2.185 ± 0.659
0.0HisLeu: 0.0 ± 0.0
0.546HisMet: 0.546 ± 0.331
1.092HisAsn: 1.092 ± 0.662
2.731HisPro: 2.731 ± 0.959
1.638HisGln: 1.638 ± 0.611
1.092HisArg: 1.092 ± 0.662
1.638HisSer: 1.638 ± 1.32
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.092HisTyr: 1.092 ± 0.662
0.0HisXaa: 0.0 ± 0.0
Ile
2.185IleAla: 2.185 ± 1.138
1.092IleCys: 1.092 ± 0.438
2.731IleAsp: 2.731 ± 1.011
3.823IleGlu: 3.823 ± 2.478
1.638IlePhe: 1.638 ± 0.994
2.731IleGly: 2.731 ± 1.485
0.0IleHis: 0.0 ± 0.0
2.731IleIle: 2.731 ± 0.548
4.369IleLys: 4.369 ± 0.395
2.731IleLeu: 2.731 ± 1.393
0.546IleMet: 0.546 ± 0.479
4.369IleAsn: 4.369 ± 1.954
5.461IlePro: 5.461 ± 0.403
1.092IleGln: 1.092 ± 0.438
0.546IleArg: 0.546 ± 0.331
1.638IleSer: 1.638 ± 0.869
2.731IleThr: 2.731 ± 1.01
2.731IleVal: 2.731 ± 1.523
1.092IleTrp: 1.092 ± 0.879
1.638IleTyr: 1.638 ± 0.749
0.0IleXaa: 0.0 ± 0.0
Lys
3.823LysAla: 3.823 ± 0.869
2.185LysCys: 2.185 ± 0.623
2.731LysAsp: 2.731 ± 1.273
4.369LysGlu: 4.369 ± 2.138
2.731LysPhe: 2.731 ± 1.573
6.008LysGly: 6.008 ± 0.936
2.185LysHis: 2.185 ± 0.981
1.092LysIle: 1.092 ± 0.783
4.369LysLys: 4.369 ± 2.113
6.554LysLeu: 6.554 ± 1.831
1.092LysMet: 1.092 ± 0.662
6.008LysAsn: 6.008 ± 1.976
2.731LysPro: 2.731 ± 1.273
1.092LysGln: 1.092 ± 0.438
4.369LysArg: 4.369 ± 1.043
4.369LysSer: 4.369 ± 1.26
5.461LysThr: 5.461 ± 1.814
3.823LysVal: 3.823 ± 1.815
0.0LysTrp: 0.0 ± 0.0
0.546LysTyr: 0.546 ± 0.331
0.0LysXaa: 0.0 ± 0.0
Leu
4.915LeuAla: 4.915 ± 2.197
3.277LeuCys: 3.277 ± 0.939
4.915LeuAsp: 4.915 ± 1.17
7.1LeuGlu: 7.1 ± 0.832
7.1LeuPhe: 7.1 ± 0.853
5.461LeuGly: 5.461 ± 2.838
1.092LeuHis: 1.092 ± 0.783
4.915LeuIle: 4.915 ± 2.114
7.646LeuLys: 7.646 ± 2.556
12.561LeuLeu: 12.561 ± 2.317
4.369LeuMet: 4.369 ± 1.443
6.008LeuAsn: 6.008 ± 1.691
6.008LeuPro: 6.008 ± 1.532
5.461LeuGln: 5.461 ± 1.466
3.277LeuArg: 3.277 ± 1.495
9.285LeuSer: 9.285 ± 4.059
5.461LeuThr: 5.461 ± 1.27
4.369LeuVal: 4.369 ± 2.276
2.731LeuTrp: 2.731 ± 1.011
1.638LeuTyr: 1.638 ± 0.994
0.0LeuXaa: 0.0 ± 0.0
Met
0.546MetAla: 0.546 ± 0.479
0.546MetCys: 0.546 ± 0.331
2.731MetAsp: 2.731 ± 1.393
0.546MetGlu: 0.546 ± 0.331
0.546MetPhe: 0.546 ± 0.331
1.092MetGly: 1.092 ± 0.666
0.546MetHis: 0.546 ± 0.331
1.092MetIle: 1.092 ± 0.438
1.638MetLys: 1.638 ± 0.824
2.185MetLeu: 2.185 ± 0.758
1.638MetMet: 1.638 ± 0.686
1.092MetAsn: 1.092 ± 0.875
0.0MetPro: 0.0 ± 0.0
1.638MetGln: 1.638 ± 0.824
0.546MetArg: 0.546 ± 0.479
1.092MetSer: 1.092 ± 0.875
2.185MetThr: 2.185 ± 1.459
1.638MetVal: 1.638 ± 0.749
1.092MetTrp: 1.092 ± 0.438
0.546MetTyr: 0.546 ± 0.479
0.0MetXaa: 0.0 ± 0.0
Asn
2.185AsnAla: 2.185 ± 0.981
1.638AsnCys: 1.638 ± 0.824
2.185AsnAsp: 2.185 ± 1.317
2.185AsnGlu: 2.185 ± 0.876
2.731AsnPhe: 2.731 ± 0.763
1.638AsnGly: 1.638 ± 0.887
1.092AsnHis: 1.092 ± 0.875
4.369AsnIle: 4.369 ± 0.621
4.915AsnLys: 4.915 ± 1.4
8.738AsnLeu: 8.738 ± 2.703
1.638AsnMet: 1.638 ± 0.887
4.369AsnAsn: 4.369 ± 2.251
2.185AsnPro: 2.185 ± 0.887
3.277AsnGln: 3.277 ± 1.486
2.185AsnArg: 2.185 ± 0.88
4.915AsnSer: 4.915 ± 2.255
3.277AsnThr: 3.277 ± 2.261
5.461AsnVal: 5.461 ± 1.814
1.092AsnTrp: 1.092 ± 0.438
3.277AsnTyr: 3.277 ± 0.621
0.0AsnXaa: 0.0 ± 0.0
Pro
2.185ProAla: 2.185 ± 1.325
0.546ProCys: 0.546 ± 0.479
4.369ProAsp: 4.369 ± 0.395
5.461ProGlu: 5.461 ± 1.642
1.092ProPhe: 1.092 ± 0.783
4.915ProGly: 4.915 ± 1.302
1.638ProHis: 1.638 ± 0.824
2.731ProIle: 2.731 ± 1.01
7.1ProLys: 7.1 ± 3.397
9.831ProLeu: 9.831 ± 2.31
1.092ProMet: 1.092 ± 0.958
3.277ProAsn: 3.277 ± 0.696
6.554ProPro: 6.554 ± 1.393
4.915ProGln: 4.915 ± 0.223
3.823ProArg: 3.823 ± 1.063
4.915ProSer: 4.915 ± 1.833
0.546ProThr: 0.546 ± 0.479
4.369ProVal: 4.369 ± 1.774
0.0ProTrp: 0.0 ± 0.0
0.546ProTyr: 0.546 ± 0.479
0.0ProXaa: 0.0 ± 0.0
Gln
3.823GlnAla: 3.823 ± 0.587
0.0GlnCys: 0.0 ± 0.0
4.915GlnAsp: 4.915 ± 1.629
3.277GlnGlu: 3.277 ± 0.996
2.185GlnPhe: 2.185 ± 0.858
2.185GlnGly: 2.185 ± 1.317
1.092GlnHis: 1.092 ± 0.783
1.092GlnIle: 1.092 ± 0.438
3.823GlnLys: 3.823 ± 1.086
4.369GlnLeu: 4.369 ± 1.433
0.0GlnMet: 0.0 ± 0.0
3.277GlnAsn: 3.277 ± 1.123
2.185GlnPro: 2.185 ± 1.317
2.185GlnGln: 2.185 ± 0.758
0.546GlnArg: 0.546 ± 0.331
1.092GlnSer: 1.092 ± 0.662
3.823GlnThr: 3.823 ± 1.326
0.546GlnVal: 0.546 ± 0.479
0.0GlnTrp: 0.0 ± 0.0
0.546GlnTyr: 0.546 ± 0.479
0.0GlnXaa: 0.0 ± 0.0
Arg
1.638ArgAla: 1.638 ± 0.824
1.092ArgCys: 1.092 ± 0.879
1.638ArgAsp: 1.638 ± 0.611
1.638ArgGlu: 1.638 ± 0.611
0.546ArgPhe: 0.546 ± 0.331
3.823ArgGly: 3.823 ± 0.831
0.546ArgHis: 0.546 ± 0.331
0.546ArgIle: 0.546 ± 0.479
3.277ArgLys: 3.277 ± 1.222
2.185ArgLeu: 2.185 ± 0.758
0.546ArgMet: 0.546 ± 0.479
3.277ArgAsn: 3.277 ± 0.621
2.731ArgPro: 2.731 ± 1.211
1.638ArgGln: 1.638 ± 0.824
2.731ArgArg: 2.731 ± 1.211
4.369ArgSer: 4.369 ± 1.68
1.092ArgThr: 1.092 ± 0.438
2.185ArgVal: 2.185 ± 0.876
2.185ArgTrp: 2.185 ± 0.659
2.731ArgTyr: 2.731 ± 1.787
0.0ArgXaa: 0.0 ± 0.0
Ser
2.185SerAla: 2.185 ± 0.828
1.638SerCys: 1.638 ± 0.994
1.092SerAsp: 1.092 ± 0.783
2.185SerGlu: 2.185 ± 1.325
4.369SerPhe: 4.369 ± 1.103
7.646SerGly: 7.646 ± 1.76
0.546SerHis: 0.546 ± 0.331
3.823SerIle: 3.823 ± 1.527
2.185SerLys: 2.185 ± 0.623
10.377SerLeu: 10.377 ± 3.794
1.092SerMet: 1.092 ± 0.875
3.823SerAsn: 3.823 ± 0.88
4.915SerPro: 4.915 ± 1.194
4.369SerGln: 4.369 ± 1.76
3.823SerArg: 3.823 ± 0.831
7.1SerSer: 7.1 ± 1.963
7.646SerThr: 7.646 ± 2.172
2.185SerVal: 2.185 ± 0.563
0.0SerTrp: 0.0 ± 0.0
1.092SerTyr: 1.092 ± 0.958
0.0SerXaa: 0.0 ± 0.0
Thr
3.823ThrAla: 3.823 ± 1.045
2.185ThrCys: 2.185 ± 0.659
2.185ThrAsp: 2.185 ± 0.876
2.731ThrGlu: 2.731 ± 0.565
2.185ThrPhe: 2.185 ± 0.623
2.731ThrGly: 2.731 ± 1.409
0.0ThrHis: 0.0 ± 0.0
1.638ThrIle: 1.638 ± 0.677
1.092ThrLys: 1.092 ± 0.958
9.285ThrLeu: 9.285 ± 2.914
1.638ThrMet: 1.638 ± 0.603
1.638ThrAsn: 1.638 ± 0.887
6.554ThrPro: 6.554 ± 0.569
1.092ThrGln: 1.092 ± 0.438
2.185ThrArg: 2.185 ± 0.88
4.915ThrSer: 4.915 ± 1.488
3.823ThrThr: 3.823 ± 1.38
4.369ThrVal: 4.369 ± 1.707
1.638ThrTrp: 1.638 ± 1.32
2.731ThrTyr: 2.731 ± 0.746
0.0ThrXaa: 0.0 ± 0.0
Val
4.915ValAla: 4.915 ± 1.276
1.638ValCys: 1.638 ± 0.686
1.092ValAsp: 1.092 ± 1.022
2.731ValGlu: 2.731 ± 1.181
0.546ValPhe: 0.546 ± 0.331
1.638ValGly: 1.638 ± 1.438
0.546ValHis: 0.546 ± 0.479
2.731ValIle: 2.731 ± 0.724
2.185ValLys: 2.185 ± 1.317
8.738ValLeu: 8.738 ± 1.076
1.092ValMet: 1.092 ± 0.662
3.277ValAsn: 3.277 ± 1.713
4.369ValPro: 4.369 ± 3.214
0.0ValGln: 0.0 ± 0.0
1.638ValArg: 1.638 ± 0.887
4.369ValSer: 4.369 ± 1.4
5.461ValThr: 5.461 ± 2.471
1.092ValVal: 1.092 ± 0.438
0.0ValTrp: 0.0 ± 0.0
1.638ValTyr: 1.638 ± 0.887
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.546TrpCys: 0.546 ± 0.479
0.546TrpAsp: 0.546 ± 0.331
4.369TrpGlu: 4.369 ± 2.768
0.546TrpPhe: 0.546 ± 0.877
1.638TrpGly: 1.638 ± 1.629
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.546TrpLys: 0.546 ± 0.331
1.638TrpLeu: 1.638 ± 0.749
0.0TrpMet: 0.0 ± 0.0
0.546TrpAsn: 0.546 ± 0.331
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.546TrpArg: 0.546 ± 0.331
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.092TrpVal: 1.092 ± 0.438
0.546TrpTrp: 0.546 ± 0.331
1.638TrpTyr: 1.638 ± 0.611
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.546TyrAla: 0.546 ± 0.479
1.092TyrCys: 1.092 ± 1.753
1.092TyrAsp: 1.092 ± 0.958
1.638TyrGlu: 1.638 ± 0.611
2.185TyrPhe: 2.185 ± 1.125
4.915TyrGly: 4.915 ± 0.654
0.546TyrHis: 0.546 ± 0.479
2.185TyrIle: 2.185 ± 0.758
2.185TyrLys: 2.185 ± 0.981
0.546TyrLeu: 0.546 ± 0.331
0.0TyrMet: 0.0 ± 0.0
2.731TyrAsn: 2.731 ± 1.011
1.092TyrPro: 1.092 ± 0.958
1.092TyrGln: 1.092 ± 0.783
2.185TyrArg: 2.185 ± 0.876
2.185TyrSer: 2.185 ± 0.876
1.638TyrThr: 1.638 ± 1.279
0.546TyrVal: 0.546 ± 0.479
0.0TyrTrp: 0.0 ± 0.0
1.092TyrTyr: 1.092 ± 0.879
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski