Amino acid dipepetide frequency for Hubei tombus-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.357AlaAla: 8.357 ± 5.553
0.929AlaCys: 0.929 ± 0.585
2.786AlaAsp: 2.786 ± 1.756
0.929AlaGlu: 0.929 ± 0.585
7.428AlaPhe: 7.428 ± 2.493
3.714AlaGly: 3.714 ± 1.339
0.929AlaHis: 0.929 ± 0.585
2.786AlaIle: 2.786 ± 0.835
1.857AlaLys: 1.857 ± 1.171
1.857AlaLeu: 1.857 ± 0.706
5.571AlaMet: 5.571 ± 1.337
1.857AlaAsn: 1.857 ± 1.115
1.857AlaPro: 1.857 ± 0.585
5.571AlaGln: 5.571 ± 1.512
4.643AlaArg: 4.643 ± 2.155
1.857AlaSer: 1.857 ± 0.706
3.714AlaThr: 3.714 ± 2.49
3.714AlaVal: 3.714 ± 1.411
1.857AlaTrp: 1.857 ± 0.706
0.929AlaTyr: 0.929 ± 0.82
0.0AlaXaa: 0.0 ± 0.0
Cys
0.929CysAla: 0.929 ± 0.585
0.929CysCys: 0.929 ± 0.82
0.929CysAsp: 0.929 ± 0.883
1.857CysGlu: 1.857 ± 1.171
1.857CysPhe: 1.857 ± 0.585
0.0CysGly: 0.0 ± 0.0
0.929CysHis: 0.929 ± 0.585
2.786CysIle: 2.786 ± 1.756
0.0CysLys: 0.0 ± 0.0
1.857CysLeu: 1.857 ± 0.585
0.0CysMet: 0.0 ± 0.0
3.714CysAsn: 3.714 ± 1.17
1.857CysPro: 1.857 ± 0.585
1.857CysGln: 1.857 ± 1.171
0.929CysArg: 0.929 ± 0.585
0.929CysSer: 0.929 ± 0.585
0.0CysThr: 0.0 ± 0.0
1.857CysVal: 1.857 ± 0.706
0.929CysTrp: 0.929 ± 0.585
0.929CysTyr: 0.929 ± 0.82
0.0CysXaa: 0.0 ± 0.0
Asp
0.929AspAla: 0.929 ± 0.585
1.857AspCys: 1.857 ± 1.171
1.857AspAsp: 1.857 ± 1.171
2.786AspGlu: 2.786 ± 0.835
0.929AspPhe: 0.929 ± 0.82
3.714AspGly: 3.714 ± 1.411
1.857AspHis: 1.857 ± 1.171
6.5AspIle: 6.5 ± 1.583
1.857AspLys: 1.857 ± 1.171
4.643AspLeu: 4.643 ± 0.692
0.0AspMet: 0.0 ± 0.0
1.857AspAsn: 1.857 ± 0.585
1.857AspPro: 1.857 ± 1.171
0.929AspGln: 0.929 ± 0.82
2.786AspArg: 2.786 ± 0.949
1.857AspSer: 1.857 ± 1.171
3.714AspThr: 3.714 ± 2.341
2.786AspVal: 2.786 ± 0.835
0.929AspTrp: 0.929 ± 0.585
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.857GluAla: 1.857 ± 0.585
0.929GluCys: 0.929 ± 0.585
4.643GluAsp: 4.643 ± 1.975
4.643GluGlu: 4.643 ± 1.317
4.643GluPhe: 4.643 ± 1.567
1.857GluGly: 1.857 ± 1.171
0.929GluHis: 0.929 ± 0.82
1.857GluIle: 1.857 ± 0.585
3.714GluLys: 3.714 ± 1.192
3.714GluLeu: 3.714 ± 0.134
0.0GluMet: 0.0 ± 0.0
1.857GluAsn: 1.857 ± 0.585
3.714GluPro: 3.714 ± 2.092
3.714GluGln: 3.714 ± 1.17
0.0GluArg: 0.0 ± 0.0
3.714GluSer: 3.714 ± 2.341
0.0GluThr: 0.0 ± 0.0
7.428GluVal: 7.428 ± 1.092
0.0GluTrp: 0.0 ± 0.0
0.929GluTyr: 0.929 ± 0.82
0.0GluXaa: 0.0 ± 0.0
Phe
1.857PheAla: 1.857 ± 1.766
0.929PheCys: 0.929 ± 0.585
3.714PheAsp: 3.714 ± 2.341
2.786PheGlu: 2.786 ± 0.835
0.929PhePhe: 0.929 ± 0.585
2.786PheGly: 2.786 ± 1.746
0.0PheHis: 0.0 ± 0.0
2.786PheIle: 2.786 ± 1.756
5.571PheLys: 5.571 ± 2.495
0.0PheLeu: 0.0 ± 0.0
2.786PheMet: 2.786 ± 1.259
3.714PheAsn: 3.714 ± 0.134
0.929PhePro: 0.929 ± 0.82
0.929PheGln: 0.929 ± 0.585
3.714PheArg: 3.714 ± 1.17
4.643PheSer: 4.643 ± 0.948
3.714PheThr: 3.714 ± 0.134
3.714PheVal: 3.714 ± 1.192
0.0PheTrp: 0.0 ± 0.0
0.929PheTyr: 0.929 ± 0.585
0.0PheXaa: 0.0 ± 0.0
Gly
0.929GlyAla: 0.929 ± 0.585
0.929GlyCys: 0.929 ± 0.585
2.786GlyAsp: 2.786 ± 0.949
1.857GlyGlu: 1.857 ± 1.115
2.786GlyPhe: 2.786 ± 1.756
2.786GlyGly: 2.786 ± 1.488
0.0GlyHis: 0.0 ± 0.0
4.643GlyIle: 4.643 ± 1.861
3.714GlyLys: 3.714 ± 1.318
3.714GlyLeu: 3.714 ± 0.134
0.0GlyMet: 0.0 ± 0.0
2.786GlyAsn: 2.786 ± 0.949
2.786GlyPro: 2.786 ± 1.836
4.643GlyGln: 4.643 ± 2.204
11.142GlyArg: 11.142 ± 8.366
4.643GlySer: 4.643 ± 2.155
6.5GlyThr: 6.5 ± 2.84
3.714GlyVal: 3.714 ± 3.532
0.0GlyTrp: 0.0 ± 0.0
1.857GlyTyr: 1.857 ± 0.585
0.0GlyXaa: 0.0 ± 0.0
His
2.786HisAla: 2.786 ± 1.299
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.929HisGlu: 0.929 ± 0.585
1.857HisPhe: 1.857 ± 1.171
1.857HisGly: 1.857 ± 0.585
1.857HisHis: 1.857 ± 1.64
2.786HisIle: 2.786 ± 0.835
0.929HisLys: 0.929 ± 0.585
2.786HisLeu: 2.786 ± 0.835
0.0HisMet: 0.0 ± 0.0
0.929HisAsn: 0.929 ± 0.585
1.857HisPro: 1.857 ± 1.115
1.857HisGln: 1.857 ± 1.64
2.786HisArg: 2.786 ± 0.536
1.857HisSer: 1.857 ± 0.585
1.857HisThr: 1.857 ± 1.64
0.0HisVal: 0.0 ± 0.0
0.929HisTrp: 0.929 ± 0.585
2.786HisTyr: 2.786 ± 0.835
0.0HisXaa: 0.0 ± 0.0
Ile
2.786IleAla: 2.786 ± 0.949
2.786IleCys: 2.786 ± 1.299
3.714IleAsp: 3.714 ± 1.411
2.786IleGlu: 2.786 ± 1.756
3.714IlePhe: 3.714 ± 0.134
1.857IleGly: 1.857 ± 0.706
3.714IleHis: 3.714 ± 1.17
2.786IleIle: 2.786 ± 1.299
5.571IleLys: 5.571 ± 1.239
2.786IleLeu: 2.786 ± 1.299
2.786IleMet: 2.786 ± 0.536
1.857IleAsn: 1.857 ± 1.171
1.857IlePro: 1.857 ± 1.64
2.786IleGln: 2.786 ± 0.949
1.857IleArg: 1.857 ± 1.64
6.5IleSer: 6.5 ± 2.332
5.571IleThr: 5.571 ± 1.239
3.714IleVal: 3.714 ± 2.341
0.929IleTrp: 0.929 ± 0.585
1.857IleTyr: 1.857 ± 1.171
0.0IleXaa: 0.0 ± 0.0
Lys
1.857LysAla: 1.857 ± 1.171
0.929LysCys: 0.929 ± 0.82
1.857LysAsp: 1.857 ± 1.171
3.714LysGlu: 3.714 ± 2.092
1.857LysPhe: 1.857 ± 1.171
4.643LysGly: 4.643 ± 1.861
0.0LysHis: 0.0 ± 0.0
4.643LysIle: 4.643 ± 1.84
3.714LysLys: 3.714 ± 2.341
4.643LysLeu: 4.643 ± 0.659
3.714LysMet: 3.714 ± 1.192
2.786LysAsn: 2.786 ± 0.536
0.929LysPro: 0.929 ± 0.585
1.857LysGln: 1.857 ± 1.171
2.786LysArg: 2.786 ± 0.835
4.643LysSer: 4.643 ± 1.567
2.786LysThr: 2.786 ± 0.949
3.714LysVal: 3.714 ± 1.411
0.0LysTrp: 0.0 ± 0.0
8.357LysTyr: 8.357 ± 2.992
0.0LysXaa: 0.0 ± 0.0
Leu
9.285LeuAla: 9.285 ± 1.317
4.643LeuCys: 4.643 ± 2.927
2.786LeuAsp: 2.786 ± 1.756
3.714LeuGlu: 3.714 ± 1.192
2.786LeuPhe: 2.786 ± 1.746
5.571LeuGly: 5.571 ± 2.117
0.929LeuHis: 0.929 ± 0.585
3.714LeuIle: 3.714 ± 1.318
4.643LeuLys: 4.643 ± 1.317
2.786LeuLeu: 2.786 ± 2.46
1.857LeuMet: 1.857 ± 1.64
1.857LeuAsn: 1.857 ± 1.64
6.5LeuPro: 6.5 ± 2.933
3.714LeuGln: 3.714 ± 2.49
4.643LeuArg: 4.643 ± 3.271
4.643LeuSer: 4.643 ± 1.317
0.929LeuThr: 0.929 ± 0.82
1.857LeuVal: 1.857 ± 1.115
1.857LeuTrp: 1.857 ± 0.585
1.857LeuTyr: 1.857 ± 1.115
0.0LeuXaa: 0.0 ± 0.0
Met
0.929MetAla: 0.929 ± 0.82
0.929MetCys: 0.929 ± 0.585
0.929MetAsp: 0.929 ± 0.883
0.929MetGlu: 0.929 ± 0.82
0.929MetPhe: 0.929 ± 0.585
0.929MetGly: 0.929 ± 0.883
0.929MetHis: 0.929 ± 0.82
1.857MetIle: 1.857 ± 1.115
2.786MetLys: 2.786 ± 0.536
0.929MetLeu: 0.929 ± 0.585
0.929MetMet: 0.929 ± 0.883
3.714MetAsn: 3.714 ± 1.339
0.929MetPro: 0.929 ± 0.82
0.929MetGln: 0.929 ± 0.585
2.786MetArg: 2.786 ± 1.299
0.929MetSer: 0.929 ± 0.585
1.857MetThr: 1.857 ± 1.64
1.857MetVal: 1.857 ± 1.171
0.0MetTrp: 0.0 ± 0.0
0.929MetTyr: 0.929 ± 0.883
0.0MetXaa: 0.0 ± 0.0
Asn
0.929AsnAla: 0.929 ± 0.585
0.929AsnCys: 0.929 ± 0.585
0.0AsnAsp: 0.0 ± 0.0
0.929AsnGlu: 0.929 ± 0.585
2.786AsnPhe: 2.786 ± 0.536
1.857AsnGly: 1.857 ± 1.171
4.643AsnHis: 4.643 ± 2.9
1.857AsnIle: 1.857 ± 0.706
1.857AsnLys: 1.857 ± 0.585
7.428AsnLeu: 7.428 ± 3.151
0.0AsnMet: 0.0 ± 0.0
1.857AsnAsn: 1.857 ± 0.585
4.643AsnPro: 4.643 ± 1.567
2.786AsnGln: 2.786 ± 0.949
0.0AsnArg: 0.0 ± 0.0
3.714AsnSer: 3.714 ± 0.134
1.857AsnThr: 1.857 ± 0.585
3.714AsnVal: 3.714 ± 1.339
0.929AsnTrp: 0.929 ± 0.82
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.714ProAla: 3.714 ± 1.192
0.0ProCys: 0.0 ± 0.0
2.786ProAsp: 2.786 ± 1.299
1.857ProGlu: 1.857 ± 0.585
2.786ProPhe: 2.786 ± 1.836
2.786ProGly: 2.786 ± 1.746
0.0ProHis: 0.0 ± 0.0
2.786ProIle: 2.786 ± 1.756
2.786ProLys: 2.786 ± 0.536
1.857ProLeu: 1.857 ± 1.171
2.786ProMet: 2.786 ± 1.299
1.857ProAsn: 1.857 ± 0.706
3.714ProPro: 3.714 ± 1.192
2.786ProGln: 2.786 ± 1.836
7.428ProArg: 7.428 ± 3.682
4.643ProSer: 4.643 ± 1.648
5.571ProThr: 5.571 ± 1.827
1.857ProVal: 1.857 ± 1.115
0.0ProTrp: 0.0 ± 0.0
1.857ProTyr: 1.857 ± 0.706
0.0ProXaa: 0.0 ± 0.0
Gln
2.786GlnAla: 2.786 ± 0.835
0.929GlnCys: 0.929 ± 0.585
2.786GlnAsp: 2.786 ± 1.299
3.714GlnGlu: 3.714 ± 1.17
0.929GlnPhe: 0.929 ± 0.82
0.0GlnGly: 0.0 ± 0.0
0.929GlnHis: 0.929 ± 0.82
3.714GlnIle: 3.714 ± 0.134
0.929GlnLys: 0.929 ± 0.82
4.643GlnLeu: 4.643 ± 1.648
1.857GlnMet: 1.857 ± 1.115
0.929GlnAsn: 0.929 ± 0.82
2.786GlnPro: 2.786 ± 2.649
0.0GlnGln: 0.0 ± 0.0
5.571GlnArg: 5.571 ± 1.67
3.714GlnSer: 3.714 ± 0.134
0.929GlnThr: 0.929 ± 0.883
6.5GlnVal: 6.5 ± 1.657
0.929GlnTrp: 0.929 ± 0.883
4.643GlnTyr: 4.643 ± 1.94
0.0GlnXaa: 0.0 ± 0.0
Arg
4.643ArgAla: 4.643 ± 1.317
0.0ArgCys: 0.0 ± 0.0
0.929ArgAsp: 0.929 ± 0.585
1.857ArgGlu: 1.857 ± 1.64
2.786ArgPhe: 2.786 ± 0.536
9.285ArgGly: 9.285 ± 8.831
2.786ArgHis: 2.786 ± 1.299
1.857ArgIle: 1.857 ± 1.115
4.643ArgLys: 4.643 ± 1.861
9.285ArgLeu: 9.285 ± 3.95
2.786ArgMet: 2.786 ± 1.756
1.857ArgAsn: 1.857 ± 0.585
5.571ArgPro: 5.571 ± 3.344
1.857ArgGln: 1.857 ± 1.64
5.571ArgArg: 5.571 ± 2.233
5.571ArgSer: 5.571 ± 0.831
0.929ArgThr: 0.929 ± 0.82
1.857ArgVal: 1.857 ± 0.706
0.929ArgTrp: 0.929 ± 0.585
6.5ArgTyr: 6.5 ± 1.822
0.0ArgXaa: 0.0 ± 0.0
Ser
2.786SerAla: 2.786 ± 0.949
3.714SerCys: 3.714 ± 0.134
3.714SerAsp: 3.714 ± 1.411
4.643SerGlu: 4.643 ± 0.692
4.643SerPhe: 4.643 ± 1.317
11.142SerGly: 11.142 ± 3.735
2.786SerHis: 2.786 ± 0.949
4.643SerIle: 4.643 ± 1.94
5.571SerLys: 5.571 ± 2.495
5.571SerLeu: 5.571 ± 1.67
0.929SerMet: 0.929 ± 0.585
0.0SerAsn: 0.0 ± 0.0
3.714SerPro: 3.714 ± 1.318
0.0SerGln: 0.0 ± 0.0
2.786SerArg: 2.786 ± 0.835
6.5SerSer: 6.5 ± 2.332
1.857SerThr: 1.857 ± 0.585
0.929SerVal: 0.929 ± 0.585
2.786SerTrp: 2.786 ± 0.835
1.857SerTyr: 1.857 ± 1.766
0.0SerXaa: 0.0 ± 0.0
Thr
5.571ThrAla: 5.571 ± 2.117
0.929ThrCys: 0.929 ± 0.82
1.857ThrAsp: 1.857 ± 0.585
2.786ThrGlu: 2.786 ± 1.746
0.0ThrPhe: 0.0 ± 0.0
1.857ThrGly: 1.857 ± 0.706
1.857ThrHis: 1.857 ± 1.171
2.786ThrIle: 2.786 ± 0.835
1.857ThrLys: 1.857 ± 1.171
1.857ThrLeu: 1.857 ± 1.115
0.0ThrMet: 0.0 ± 0.0
2.786ThrAsn: 2.786 ± 0.536
4.643ThrPro: 4.643 ± 2.204
3.714ThrGln: 3.714 ± 1.339
3.714ThrArg: 3.714 ± 0.134
2.786ThrSer: 2.786 ± 1.299
5.571ThrThr: 5.571 ± 2.495
4.643ThrVal: 4.643 ± 0.692
0.929ThrTrp: 0.929 ± 0.82
1.857ThrTyr: 1.857 ± 1.171
0.0ThrXaa: 0.0 ± 0.0
Val
8.357ValAla: 8.357 ± 6.162
1.857ValCys: 1.857 ± 1.115
2.786ValAsp: 2.786 ± 1.756
5.571ValGlu: 5.571 ± 0.831
0.0ValPhe: 0.0 ± 0.0
3.714ValGly: 3.714 ± 1.411
1.857ValHis: 1.857 ± 1.171
4.643ValIle: 4.643 ± 1.94
3.714ValLys: 3.714 ± 1.192
4.643ValLeu: 4.643 ± 1.648
0.0ValMet: 0.0 ± 0.645
3.714ValAsn: 3.714 ± 0.134
1.857ValPro: 1.857 ± 1.115
1.857ValGln: 1.857 ± 1.115
2.786ValArg: 2.786 ± 0.835
3.714ValSer: 3.714 ± 2.341
2.786ValThr: 2.786 ± 0.536
5.571ValVal: 5.571 ± 1.755
1.857ValTrp: 1.857 ± 0.585
0.929ValTyr: 0.929 ± 0.585
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.857TrpPhe: 1.857 ± 1.171
0.929TrpGly: 0.929 ± 0.585
0.929TrpHis: 0.929 ± 0.82
0.0TrpIle: 0.0 ± 0.0
1.857TrpLys: 1.857 ± 1.115
3.714TrpLeu: 3.714 ± 1.17
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.929TrpGln: 0.929 ± 0.585
2.786TrpArg: 2.786 ± 0.835
0.929TrpSer: 0.929 ± 0.82
1.857TrpThr: 1.857 ± 0.706
0.929TrpVal: 0.929 ± 0.585
0.929TrpTrp: 0.929 ± 0.883
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.857TyrAla: 1.857 ± 1.171
0.929TyrCys: 0.929 ± 0.585
2.786TyrAsp: 2.786 ± 1.756
1.857TyrGlu: 1.857 ± 0.706
0.929TyrPhe: 0.929 ± 0.585
0.929TyrGly: 0.929 ± 0.585
2.786TyrHis: 2.786 ± 0.536
2.786TyrIle: 2.786 ± 0.536
1.857TyrLys: 1.857 ± 1.171
1.857TyrLeu: 1.857 ± 0.585
0.0TyrMet: 0.0 ± 0.0
2.786TyrAsn: 2.786 ± 0.835
1.857TyrPro: 1.857 ± 1.171
6.5TyrGln: 6.5 ± 0.807
2.786TyrArg: 2.786 ± 1.488
2.786TyrSer: 2.786 ± 1.756
0.0TyrThr: 0.0 ± 0.0
2.786TyrVal: 2.786 ± 1.836
0.929TyrTrp: 0.929 ± 0.82
0.929TyrTyr: 0.929 ± 0.82
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski