Amino acid dipepetide frequency for Taro bacilliform CH virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.332AlaAla: 3.332 ± 1.076
0.416AlaCys: 0.416 ± 0.222
2.915AlaAsp: 2.915 ± 1.043
2.082AlaGlu: 2.082 ± 1.111
2.499AlaPhe: 2.499 ± 1.058
2.082AlaGly: 2.082 ± 0.839
2.082AlaHis: 2.082 ± 4.027
7.08AlaIle: 7.08 ± 2.156
2.082AlaLys: 2.082 ± 0.839
5.831AlaLeu: 5.831 ± 3.124
2.082AlaMet: 2.082 ± 0.978
1.249AlaAsn: 1.249 ± 0.925
1.666AlaPro: 1.666 ± 1.306
3.332AlaGln: 3.332 ± 2.849
3.748AlaArg: 3.748 ± 1.468
4.998AlaSer: 4.998 ± 2.666
3.748AlaThr: 3.748 ± 1.401
3.332AlaVal: 3.332 ± 1.283
0.833AlaTrp: 0.833 ± 0.444
3.332AlaTyr: 3.332 ± 1.778
0.0AlaXaa: 0.0 ± 0.0
Cys
1.249CysAla: 1.249 ± 0.667
1.249CysCys: 1.249 ± 1.013
1.249CysAsp: 1.249 ± 0.667
0.416CysGlu: 0.416 ± 0.222
1.249CysPhe: 1.249 ± 1.013
0.416CysGly: 0.416 ± 0.222
0.833CysHis: 0.833 ± 0.444
0.416CysIle: 0.416 ± 0.222
2.499CysLys: 2.499 ± 1.058
1.249CysLeu: 1.249 ± 1.36
0.833CysMet: 0.833 ± 0.749
1.249CysAsn: 1.249 ± 0.667
0.833CysPro: 0.833 ± 0.444
0.833CysGln: 0.833 ± 0.444
0.833CysArg: 0.833 ± 0.444
0.416CysSer: 0.416 ± 0.222
1.249CysThr: 1.249 ± 0.909
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.833CysTyr: 0.833 ± 0.444
0.0CysXaa: 0.0 ± 0.0
Asp
2.082AspAla: 2.082 ± 0.979
2.499AspCys: 2.499 ± 1.333
5.414AspAsp: 5.414 ± 2.278
3.748AspGlu: 3.748 ± 2.0
1.666AspPhe: 1.666 ± 0.889
2.915AspGly: 2.915 ± 1.024
0.416AspHis: 0.416 ± 1.06
2.082AspIle: 2.082 ± 1.111
2.082AspLys: 2.082 ± 0.922
4.998AspLeu: 4.998 ± 0.906
0.833AspMet: 0.833 ± 0.444
4.581AspAsn: 4.581 ± 1.804
2.082AspPro: 2.082 ± 1.111
2.082AspGln: 2.082 ± 0.839
2.499AspArg: 2.499 ± 1.85
0.416AspSer: 0.416 ± 0.222
2.915AspThr: 2.915 ± 1.11
0.833AspVal: 0.833 ± 0.444
0.416AspTrp: 0.416 ± 0.222
1.666AspTyr: 1.666 ± 0.889
0.0AspXaa: 0.0 ± 0.0
Glu
9.163GluAla: 9.163 ± 1.628
1.249GluCys: 1.249 ± 0.909
2.915GluAsp: 2.915 ± 1.114
9.163GluGlu: 9.163 ± 3.085
0.833GluPhe: 0.833 ± 0.444
3.332GluGly: 3.332 ± 1.778
2.082GluHis: 2.082 ± 0.922
6.247GluIle: 6.247 ± 1.252
5.831GluLys: 5.831 ± 1.705
7.08GluLeu: 7.08 ± 2.913
2.082GluMet: 2.082 ± 2.971
3.748GluAsn: 3.748 ± 1.216
0.416GluPro: 0.416 ± 0.222
2.915GluGln: 2.915 ± 1.817
4.165GluArg: 4.165 ± 2.095
2.915GluSer: 2.915 ± 1.808
2.915GluThr: 2.915 ± 1.032
4.165GluVal: 4.165 ± 1.168
2.082GluTrp: 2.082 ± 1.917
2.915GluTyr: 2.915 ± 1.011
0.0GluXaa: 0.0 ± 0.0
Phe
2.915PheAla: 2.915 ± 1.165
1.249PheCys: 1.249 ± 1.013
1.249PheAsp: 1.249 ± 0.667
1.666PheGlu: 1.666 ± 0.889
0.416PhePhe: 0.416 ± 0.222
2.082PheGly: 2.082 ± 0.979
1.249PheHis: 1.249 ± 0.909
3.748PheIle: 3.748 ± 1.424
3.748PheLys: 3.748 ± 1.303
2.082PheLeu: 2.082 ± 2.578
0.833PheMet: 0.833 ± 0.444
1.666PheAsn: 1.666 ± 0.889
1.666PhePro: 1.666 ± 0.889
2.082PheGln: 2.082 ± 1.111
2.499PheArg: 2.499 ± 1.086
1.666PheSer: 1.666 ± 0.908
1.249PheThr: 1.249 ± 0.667
0.0PheVal: 0.0 ± 0.0
0.833PheTrp: 0.833 ± 0.444
0.833PheTyr: 0.833 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
2.082GlyAla: 2.082 ± 1.111
0.833GlyCys: 0.833 ± 0.444
3.748GlyAsp: 3.748 ± 1.517
3.748GlyGlu: 3.748 ± 1.417
1.249GlyPhe: 1.249 ± 0.667
3.332GlyGly: 3.332 ± 1.144
0.416GlyHis: 0.416 ± 0.222
1.666GlyIle: 1.666 ± 0.929
4.581GlyLys: 4.581 ± 1.864
4.165GlyLeu: 4.165 ± 1.196
1.666GlyMet: 1.666 ± 2.22
2.499GlyAsn: 2.499 ± 0.942
3.748GlyPro: 3.748 ± 3.207
1.249GlyGln: 1.249 ± 0.755
2.915GlyArg: 2.915 ± 1.555
1.666GlySer: 1.666 ± 0.897
5.414GlyThr: 5.414 ± 1.002
3.748GlyVal: 3.748 ± 0.996
0.833GlyTrp: 0.833 ± 0.444
3.748GlyTyr: 3.748 ± 2.0
0.0GlyXaa: 0.0 ± 0.0
His
2.915HisAla: 2.915 ± 1.619
0.416HisCys: 0.416 ± 0.222
1.666HisAsp: 1.666 ± 0.889
0.833HisGlu: 0.833 ± 0.962
0.833HisPhe: 0.833 ± 1.099
1.249HisGly: 1.249 ± 0.667
0.833HisHis: 0.833 ± 1.099
3.748HisIle: 3.748 ± 1.073
1.249HisLys: 1.249 ± 1.36
3.748HisLeu: 3.748 ± 1.216
0.416HisMet: 0.416 ± 1.251
1.666HisAsn: 1.666 ± 0.897
1.666HisPro: 1.666 ± 2.331
0.416HisGln: 0.416 ± 0.222
2.082HisArg: 2.082 ± 1.111
1.666HisSer: 1.666 ± 2.198
0.833HisThr: 0.833 ± 0.962
0.833HisVal: 0.833 ± 1.003
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.414IleAla: 5.414 ± 2.187
1.249IleCys: 1.249 ± 0.667
3.332IleAsp: 3.332 ± 1.778
4.998IleGlu: 4.998 ± 1.917
2.082IlePhe: 2.082 ± 0.979
2.915IleGly: 2.915 ± 1.555
0.833IleHis: 0.833 ± 0.444
4.165IleIle: 4.165 ± 1.457
3.332IleLys: 3.332 ± 1.283
6.664IleLeu: 6.664 ± 1.594
0.833IleMet: 0.833 ± 0.962
3.332IleAsn: 3.332 ± 1.283
5.414IlePro: 5.414 ± 1.611
5.414IleGln: 5.414 ± 2.104
2.915IleArg: 2.915 ± 1.134
5.414IleSer: 5.414 ± 2.278
5.414IleThr: 5.414 ± 1.659
2.915IleVal: 2.915 ± 1.555
0.416IleTrp: 0.416 ± 0.222
1.666IleTyr: 1.666 ± 0.889
0.0IleXaa: 0.0 ± 0.0
Lys
2.082LysAla: 2.082 ± 1.547
1.666LysCys: 1.666 ± 0.767
2.499LysAsp: 2.499 ± 1.058
6.664LysGlu: 6.664 ± 3.432
2.915LysPhe: 2.915 ± 1.12
3.332LysGly: 3.332 ± 1.533
1.249LysHis: 1.249 ± 0.667
4.581LysIle: 4.581 ± 1.523
4.998LysLys: 4.998 ± 3.03
5.831LysLeu: 5.831 ± 4.1
2.082LysMet: 2.082 ± 1.111
5.831LysAsn: 5.831 ± 1.705
5.414LysPro: 5.414 ± 2.433
3.332LysGln: 3.332 ± 1.135
2.082LysArg: 2.082 ± 1.111
4.581LysSer: 4.581 ± 2.117
2.082LysThr: 2.082 ± 2.102
3.748LysVal: 3.748 ± 1.216
0.416LysTrp: 0.416 ± 0.222
1.249LysTyr: 1.249 ± 0.667
0.0LysXaa: 0.0 ± 0.0
Leu
5.414LeuAla: 5.414 ± 3.697
1.249LeuCys: 1.249 ± 0.667
2.082LeuAsp: 2.082 ± 0.922
10.412LeuGlu: 10.412 ± 5.612
2.082LeuPhe: 2.082 ± 0.909
6.247LeuGly: 6.247 ± 2.726
2.915LeuHis: 2.915 ± 1.803
3.748LeuIle: 3.748 ± 1.592
6.664LeuLys: 6.664 ± 3.005
7.497LeuLeu: 7.497 ± 9.096
1.666LeuMet: 1.666 ± 1.416
2.499LeuAsn: 2.499 ± 1.221
4.581LeuPro: 4.581 ± 1.855
5.414LeuGln: 5.414 ± 3.059
4.998LeuArg: 4.998 ± 1.7
8.33LeuSer: 8.33 ± 3.506
3.748LeuThr: 3.748 ± 2.647
4.581LeuVal: 4.581 ± 1.294
0.416LeuTrp: 0.416 ± 1.22
4.165LeuTyr: 4.165 ± 1.01
0.0LeuXaa: 0.0 ± 0.0
Met
0.833MetAla: 0.833 ± 0.962
0.0MetCys: 0.0 ± 0.0
0.833MetAsp: 0.833 ± 0.444
4.165MetGlu: 4.165 ± 3.717
0.833MetPhe: 0.833 ± 1.099
1.249MetGly: 1.249 ± 1.013
1.249MetHis: 1.249 ± 0.667
2.082MetIle: 2.082 ± 0.909
2.499MetLys: 2.499 ± 4.709
1.249MetLeu: 1.249 ± 0.667
0.833MetMet: 0.833 ± 1.11
1.666MetAsn: 1.666 ± 0.889
0.833MetPro: 0.833 ± 0.444
2.499MetGln: 2.499 ± 1.036
0.833MetArg: 0.833 ± 0.444
2.082MetSer: 2.082 ± 2.53
2.499MetThr: 2.499 ± 2.51
1.249MetVal: 1.249 ± 1.702
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.581AsnAla: 4.581 ± 1.523
0.416AsnCys: 0.416 ± 0.222
1.666AsnAsp: 1.666 ± 0.889
2.082AsnGlu: 2.082 ± 0.839
2.499AsnPhe: 2.499 ± 0.998
1.249AsnGly: 1.249 ± 0.667
0.833AsnHis: 0.833 ± 1.003
1.666AsnIle: 1.666 ± 0.889
2.499AsnLys: 2.499 ± 0.959
7.913AsnLeu: 7.913 ± 2.045
0.416AsnMet: 0.416 ± 0.222
1.666AsnAsn: 1.666 ± 0.767
2.915AsnPro: 2.915 ± 1.165
1.249AsnGln: 1.249 ± 0.667
1.666AsnArg: 1.666 ± 1.508
3.748AsnSer: 3.748 ± 1.063
3.748AsnThr: 3.748 ± 1.786
2.082AsnVal: 2.082 ± 1.111
0.833AsnTrp: 0.833 ± 1.11
3.332AsnTyr: 3.332 ± 1.817
0.0AsnXaa: 0.0 ± 0.0
Pro
2.915ProAla: 2.915 ± 1.506
0.0ProCys: 0.0 ± 0.0
2.915ProAsp: 2.915 ± 1.11
2.915ProGlu: 2.915 ± 1.114
1.666ProPhe: 1.666 ± 0.889
2.915ProGly: 2.915 ± 1.555
0.833ProHis: 0.833 ± 1.099
3.332ProIle: 3.332 ± 1.778
1.666ProLys: 1.666 ± 1.205
6.247ProLeu: 6.247 ± 4.694
1.666ProMet: 1.666 ± 1.758
3.332ProAsn: 3.332 ± 1.135
2.499ProPro: 2.499 ± 1.333
3.748ProGln: 3.748 ± 1.291
2.082ProArg: 2.082 ± 1.338
3.332ProSer: 3.332 ± 1.259
2.082ProThr: 2.082 ± 1.111
0.833ProVal: 0.833 ± 0.444
0.0ProTrp: 0.0 ± 0.0
2.915ProTyr: 2.915 ± 1.972
0.0ProXaa: 0.0 ± 0.0
Gln
2.915GlnAla: 2.915 ± 1.011
0.833GlnCys: 0.833 ± 0.962
2.915GlnAsp: 2.915 ± 1.642
6.247GlnGlu: 6.247 ± 1.55
1.249GlnPhe: 1.249 ± 0.755
3.748GlnGly: 3.748 ± 1.424
2.499GlnHis: 2.499 ± 1.058
3.748GlnIle: 3.748 ± 1.073
3.748GlnLys: 3.748 ± 2.228
6.247GlnLeu: 6.247 ± 2.71
2.082GlnMet: 2.082 ± 0.961
1.666GlnAsn: 1.666 ± 0.767
2.082GlnPro: 2.082 ± 0.922
4.581GlnGln: 4.581 ± 1.071
2.915GlnArg: 2.915 ± 1.11
1.666GlnSer: 1.666 ± 1.543
2.915GlnThr: 2.915 ± 1.11
3.332GlnVal: 3.332 ± 1.135
0.416GlnTrp: 0.416 ± 0.222
1.249GlnTyr: 1.249 ± 0.909
0.0GlnXaa: 0.0 ± 0.0
Arg
0.833ArgAla: 0.833 ± 1.099
0.833ArgCys: 0.833 ± 0.444
2.082ArgAsp: 2.082 ± 0.979
3.748ArgGlu: 3.748 ± 2.444
1.666ArgPhe: 1.666 ± 1.527
2.082ArgGly: 2.082 ± 1.111
2.082ArgHis: 2.082 ± 1.366
4.581ArgIle: 4.581 ± 1.071
2.915ArgLys: 2.915 ± 1.043
4.165ArgLeu: 4.165 ± 1.168
2.082ArgMet: 2.082 ± 0.909
2.082ArgAsn: 2.082 ± 1.181
2.499ArgPro: 2.499 ± 1.333
2.915ArgGln: 2.915 ± 1.555
4.581ArgArg: 4.581 ± 1.893
4.998ArgSer: 4.998 ± 1.986
4.165ArgThr: 4.165 ± 1.211
4.165ArgVal: 4.165 ± 2.86
2.082ArgTrp: 2.082 ± 1.376
1.666ArgTyr: 1.666 ± 0.889
0.0ArgXaa: 0.0 ± 0.0
Ser
2.499SerAla: 2.499 ± 0.959
0.833SerCys: 0.833 ± 0.444
3.748SerAsp: 3.748 ± 1.073
3.332SerGlu: 3.332 ± 1.793
2.915SerPhe: 2.915 ± 1.11
4.581SerGly: 4.581 ± 4.159
1.249SerHis: 1.249 ± 0.755
5.414SerIle: 5.414 ± 1.923
5.414SerLys: 5.414 ± 1.714
3.748SerLeu: 3.748 ± 1.291
1.249SerMet: 1.249 ± 0.787
2.082SerAsn: 2.082 ± 0.909
2.499SerPro: 2.499 ± 1.333
4.581SerGln: 4.581 ± 3.588
4.165SerArg: 4.165 ± 1.066
5.414SerSer: 5.414 ± 1.957
5.831SerThr: 5.831 ± 0.9
3.332SerVal: 3.332 ± 2.071
0.416SerTrp: 0.416 ± 0.222
2.915SerTyr: 2.915 ± 1.366
0.0SerXaa: 0.0 ± 0.0
Thr
2.915ThrAla: 2.915 ± 1.532
1.249ThrCys: 1.249 ± 0.667
2.082ThrAsp: 2.082 ± 1.111
3.332ThrGlu: 3.332 ± 2.297
0.833ThrPhe: 0.833 ± 0.444
4.581ThrGly: 4.581 ± 1.295
1.666ThrHis: 1.666 ± 0.929
5.414ThrIle: 5.414 ± 2.294
3.332ThrLys: 3.332 ± 1.353
4.165ThrLeu: 4.165 ± 4.242
2.915ThrMet: 2.915 ± 3.226
1.666ThrAsn: 1.666 ± 0.971
2.082ThrPro: 2.082 ± 0.909
5.831ThrGln: 5.831 ± 3.087
3.748ThrArg: 3.748 ± 2.879
4.581ThrSer: 4.581 ± 2.565
6.247ThrThr: 6.247 ± 4.099
1.666ThrVal: 1.666 ± 0.929
0.416ThrTrp: 0.416 ± 0.222
2.915ThrTyr: 2.915 ± 1.583
0.0ThrXaa: 0.0 ± 0.0
Val
2.499ValAla: 2.499 ± 1.333
0.833ValCys: 0.833 ± 1.003
1.249ValAsp: 1.249 ± 0.667
3.332ValGlu: 3.332 ± 2.071
4.581ValPhe: 4.581 ± 1.864
2.499ValGly: 2.499 ± 2.026
2.915ValHis: 2.915 ± 2.161
1.666ValIle: 1.666 ± 0.889
1.666ValLys: 1.666 ± 2.399
1.666ValLeu: 1.666 ± 0.889
1.666ValMet: 1.666 ± 2.22
0.833ValAsn: 0.833 ± 0.444
2.499ValPro: 2.499 ± 1.333
3.748ValGln: 3.748 ± 0.971
3.748ValArg: 3.748 ± 1.148
4.998ValSer: 4.998 ± 1.917
1.249ValThr: 1.249 ± 0.999
2.915ValVal: 2.915 ± 1.808
0.416ValTrp: 0.416 ± 0.222
1.666ValTyr: 1.666 ± 0.897
0.0ValXaa: 0.0 ± 0.0
Trp
0.416TrpAla: 0.416 ± 0.222
0.416TrpCys: 0.416 ± 0.222
0.833TrpAsp: 0.833 ± 0.444
1.249TrpGlu: 1.249 ± 0.925
0.416TrpPhe: 0.416 ± 0.222
0.833TrpGly: 0.833 ± 0.962
0.0TrpHis: 0.0 ± 0.0
0.833TrpIle: 0.833 ± 0.444
0.833TrpLys: 0.833 ± 0.444
0.833TrpLeu: 0.833 ± 0.444
0.0TrpMet: 0.0 ± 0.0
0.833TrpAsn: 0.833 ± 1.099
0.416TrpPro: 0.416 ± 0.222
0.0TrpGln: 0.0 ± 0.0
1.249TrpArg: 1.249 ± 1.013
0.416TrpSer: 0.416 ± 0.222
1.249TrpThr: 1.249 ± 1.671
1.249TrpVal: 1.249 ± 0.667
0.416TrpTrp: 0.416 ± 0.222
0.416TrpTyr: 0.416 ± 1.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.666TyrAla: 1.666 ± 0.889
0.416TyrCys: 0.416 ± 1.06
0.833TyrAsp: 0.833 ± 1.003
2.082TyrGlu: 2.082 ± 1.111
1.249TyrPhe: 1.249 ± 0.909
1.666TyrGly: 1.666 ± 0.889
0.833TyrHis: 0.833 ± 1.099
2.499TyrIle: 2.499 ± 1.333
4.998TyrLys: 4.998 ± 1.404
3.332TyrLeu: 3.332 ± 1.057
0.833TyrMet: 0.833 ± 0.444
2.915TyrAsn: 2.915 ± 1.555
2.082TyrPro: 2.082 ± 1.111
0.833TyrGln: 0.833 ± 0.444
2.499TyrArg: 2.499 ± 1.036
3.332TyrSer: 3.332 ± 1.365
2.082TyrThr: 2.082 ± 2.412
1.666TyrVal: 1.666 ± 0.889
1.666TyrTrp: 1.666 ± 0.908
2.082TyrTyr: 2.082 ± 1.111
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2402 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski