Amino acid dipepetide frequency for Hubei toti-like virus 24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.075AlaAla: 9.075 ± 0.401
1.134AlaCys: 1.134 ± 0.052
6.807AlaAsp: 6.807 ± 2.134
2.269AlaGlu: 2.269 ± 0.711
2.269AlaPhe: 2.269 ± 1.733
10.777AlaGly: 10.777 ± 0.73
1.134AlaHis: 1.134 ± 0.763
5.105AlaIle: 5.105 ± 0.174
3.971AlaLys: 3.971 ± 2.218
10.21AlaLeu: 10.21 ± 0.466
0.567AlaMet: 0.567 ± 0.433
2.269AlaAsn: 2.269 ± 0.711
4.538AlaPro: 4.538 ± 1.422
3.403AlaGln: 3.403 ± 2.289
9.643AlaArg: 9.643 ± 0.847
11.912AlaSer: 11.912 ± 0.679
3.403AlaThr: 3.403 ± 1.474
7.374AlaVal: 7.374 ± 2.515
1.702AlaTrp: 1.702 ± 0.485
1.702AlaTyr: 1.702 ± 0.33
0.0AlaXaa: 0.0 ± 0.0
Cys
1.134CysAla: 1.134 ± 0.052
1.134CysCys: 1.134 ± 0.052
1.702CysAsp: 1.702 ± 0.485
0.567CysGlu: 0.567 ± 0.433
0.567CysPhe: 0.567 ± 0.433
0.0CysGly: 0.0 ± 0.0
1.134CysHis: 1.134 ± 0.052
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.134CysLeu: 1.134 ± 0.052
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.567CysPro: 0.567 ± 0.381
0.0CysGln: 0.0 ± 0.0
1.134CysArg: 1.134 ± 0.052
1.702CysSer: 1.702 ± 0.33
1.702CysThr: 1.702 ± 0.485
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.567CysTyr: 0.567 ± 0.381
0.0CysXaa: 0.0 ± 0.0
Asp
4.538AspAla: 4.538 ± 0.608
0.567AspCys: 0.567 ± 0.381
3.403AspAsp: 3.403 ± 0.659
3.403AspGlu: 3.403 ± 0.155
1.702AspPhe: 1.702 ± 1.3
4.538AspGly: 4.538 ± 0.207
1.134AspHis: 1.134 ± 0.866
4.538AspIle: 4.538 ± 0.207
1.134AspLys: 1.134 ± 0.052
5.672AspLeu: 5.672 ± 0.556
1.134AspMet: 1.134 ± 0.866
0.0AspAsn: 0.0 ± 0.0
4.538AspPro: 4.538 ± 1.422
2.836AspGln: 2.836 ± 1.907
0.567AspArg: 0.567 ± 0.433
3.971AspSer: 3.971 ± 1.041
3.403AspThr: 3.403 ± 1.474
4.538AspVal: 4.538 ± 0.207
1.134AspTrp: 1.134 ± 0.052
2.836AspTyr: 2.836 ± 2.166
0.0AspXaa: 0.0 ± 0.0
Glu
5.105GluAla: 5.105 ± 0.989
1.134GluCys: 1.134 ± 0.763
2.269GluAsp: 2.269 ± 0.711
3.403GluGlu: 3.403 ± 0.97
2.269GluPhe: 2.269 ± 0.104
0.567GluGly: 0.567 ± 0.433
1.134GluHis: 1.134 ± 0.866
1.134GluIle: 1.134 ± 0.866
1.702GluLys: 1.702 ± 0.485
4.538GluLeu: 4.538 ± 0.608
2.269GluMet: 2.269 ± 0.104
0.0GluAsn: 0.0 ± 0.0
0.567GluPro: 0.567 ± 0.381
1.702GluGln: 1.702 ± 0.485
2.269GluArg: 2.269 ± 0.918
4.538GluSer: 4.538 ± 1.022
1.134GluThr: 1.134 ± 0.763
5.672GluVal: 5.672 ± 0.259
1.134GluTrp: 1.134 ± 0.052
1.134GluTyr: 1.134 ± 0.866
0.567GluXaa: 0.567 ± 0.433
Phe
3.971PheAla: 3.971 ± 0.226
0.0PheCys: 0.0 ± 0.0
2.269PheAsp: 2.269 ± 0.918
3.403PheGlu: 3.403 ± 0.155
2.836PhePhe: 2.836 ± 0.537
2.836PheGly: 2.836 ± 0.537
0.0PheHis: 0.0 ± 0.0
1.702PheIle: 1.702 ± 0.33
1.134PheLys: 1.134 ± 0.866
3.971PheLeu: 3.971 ± 1.403
0.0PheMet: 0.0 ± 0.0
0.567PheAsn: 0.567 ± 0.381
1.702PhePro: 1.702 ± 0.33
1.134PheGln: 1.134 ± 0.763
1.134PheArg: 1.134 ± 0.052
4.538PheSer: 4.538 ± 1.836
3.971PheThr: 3.971 ± 0.226
3.971PheVal: 3.971 ± 0.589
0.567PheTrp: 0.567 ± 0.433
1.134PheTyr: 1.134 ± 0.866
0.0PheXaa: 0.0 ± 0.0
Gly
6.239GlyAla: 6.239 ± 1.752
0.567GlyCys: 0.567 ± 0.433
3.403GlyAsp: 3.403 ± 0.659
2.269GlyGlu: 2.269 ± 0.104
2.836GlyPhe: 2.836 ± 0.278
8.508GlyGly: 8.508 ± 0.796
1.702GlyHis: 1.702 ± 0.33
1.702GlyIle: 1.702 ± 0.485
1.134GlyLys: 1.134 ± 0.866
7.374GlyLeu: 7.374 ± 0.071
1.702GlyMet: 1.702 ± 1.144
1.702GlyAsn: 1.702 ± 1.144
5.105GlyPro: 5.105 ± 0.64
3.971GlyGln: 3.971 ± 2.67
3.971GlyArg: 3.971 ± 0.589
5.105GlySer: 5.105 ± 0.174
4.538GlyThr: 4.538 ± 1.022
7.374GlyVal: 7.374 ± 0.744
1.702GlyTrp: 1.702 ± 0.485
1.134GlyTyr: 1.134 ± 0.866
0.0GlyXaa: 0.0 ± 0.0
His
2.269HisAla: 2.269 ± 0.104
0.567HisCys: 0.567 ± 0.381
1.134HisAsp: 1.134 ± 0.052
0.0HisGlu: 0.0 ± 0.0
2.269HisPhe: 2.269 ± 0.104
1.702HisGly: 1.702 ± 0.33
1.134HisHis: 1.134 ± 0.052
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.836HisLeu: 2.836 ± 0.537
0.567HisMet: 0.567 ± 0.433
0.567HisAsn: 0.567 ± 0.433
1.702HisPro: 1.702 ± 0.485
0.567HisGln: 0.567 ± 0.381
0.567HisArg: 0.567 ± 0.381
1.134HisSer: 1.134 ± 0.052
1.702HisThr: 1.702 ± 0.485
0.567HisVal: 0.567 ± 0.381
0.567HisTrp: 0.567 ± 0.433
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.971IleAla: 3.971 ± 1.041
0.0IleCys: 0.0 ± 0.0
1.702IleAsp: 1.702 ± 0.485
0.567IleGlu: 0.567 ± 0.381
1.134IlePhe: 1.134 ± 0.866
4.538IleGly: 4.538 ± 1.422
0.0IleHis: 0.0 ± 0.0
1.702IleIle: 1.702 ± 0.485
2.269IleLys: 2.269 ± 0.711
3.971IleLeu: 3.971 ± 1.041
0.567IleMet: 0.567 ± 0.381
0.567IleAsn: 0.567 ± 0.381
1.702IlePro: 1.702 ± 1.144
0.567IleGln: 0.567 ± 0.433
2.269IleArg: 2.269 ± 0.104
3.403IleSer: 3.403 ± 0.97
2.836IleThr: 2.836 ± 1.093
2.269IleVal: 2.269 ± 0.918
0.567IleTrp: 0.567 ± 0.433
2.269IleTyr: 2.269 ± 0.918
0.0IleXaa: 0.0 ± 0.0
Lys
2.269LysAla: 2.269 ± 0.918
0.567LysCys: 0.567 ± 0.433
3.403LysAsp: 3.403 ± 0.155
1.702LysGlu: 1.702 ± 0.33
1.134LysPhe: 1.134 ± 0.052
5.672LysGly: 5.672 ± 1.074
0.567LysHis: 0.567 ± 0.433
0.567LysIle: 0.567 ± 0.381
2.269LysLys: 2.269 ± 0.104
2.836LysLeu: 2.836 ± 2.166
0.567LysMet: 0.567 ± 0.381
0.0LysAsn: 0.0 ± 0.0
1.702LysPro: 1.702 ± 0.485
0.567LysGln: 0.567 ± 0.381
1.702LysArg: 1.702 ± 0.485
2.269LysSer: 2.269 ± 0.918
2.269LysThr: 2.269 ± 1.733
3.403LysVal: 3.403 ± 2.289
0.567LysTrp: 0.567 ± 0.433
1.702LysTyr: 1.702 ± 0.33
0.0LysXaa: 0.0 ± 0.0
Leu
10.21LeuAla: 10.21 ± 1.164
2.836LeuCys: 2.836 ± 1.351
1.702LeuAsp: 1.702 ± 1.3
6.239LeuGlu: 6.239 ± 0.692
6.807LeuPhe: 6.807 ± 1.125
7.941LeuGly: 7.941 ± 0.362
0.567LeuHis: 0.567 ± 0.381
5.105LeuIle: 5.105 ± 2.619
2.269LeuLys: 2.269 ± 1.733
9.075LeuLeu: 9.075 ± 2.044
0.0LeuMet: 0.0 ± 0.0
3.403LeuAsn: 3.403 ± 2.289
6.807LeuPro: 6.807 ± 1.319
2.269LeuGln: 2.269 ± 0.711
10.21LeuArg: 10.21 ± 2.095
8.508LeuSer: 8.508 ± 0.796
3.971LeuThr: 3.971 ± 0.589
6.239LeuVal: 6.239 ± 0.123
0.567LeuTrp: 0.567 ± 0.433
2.836LeuTyr: 2.836 ± 0.278
0.0LeuXaa: 0.0 ± 0.0
Met
1.702MetAla: 1.702 ± 0.33
1.134MetCys: 1.134 ± 0.052
0.0MetAsp: 0.0 ± 0.0
0.567MetGlu: 0.567 ± 0.433
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.567MetHis: 0.567 ± 0.433
1.134MetIle: 1.134 ± 0.866
0.567MetLys: 0.567 ± 0.381
1.702MetLeu: 1.702 ± 1.144
1.134MetMet: 1.134 ± 0.335
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.134MetArg: 1.134 ± 0.866
1.134MetSer: 1.134 ± 0.052
2.836MetThr: 2.836 ± 1.093
2.269MetVal: 2.269 ± 0.104
0.0MetTrp: 0.0 ± 0.0
1.134MetTyr: 1.134 ± 0.763
0.567MetXaa: 0.567 ± 0.381
Asn
1.134AsnAla: 1.134 ± 0.763
0.567AsnCys: 0.567 ± 0.381
0.567AsnAsp: 0.567 ± 0.381
0.0AsnGlu: 0.0 ± 0.0
0.567AsnPhe: 0.567 ± 0.433
1.134AsnGly: 1.134 ± 0.763
0.0AsnHis: 0.0 ± 0.0
1.134AsnIle: 1.134 ± 0.763
1.134AsnLys: 1.134 ± 0.763
2.269AsnLeu: 2.269 ± 0.918
1.134AsnMet: 1.134 ± 0.763
1.702AsnAsn: 1.702 ± 0.33
2.269AsnPro: 2.269 ± 0.711
0.0AsnGln: 0.0 ± 0.0
2.269AsnArg: 2.269 ± 1.526
0.567AsnSer: 0.567 ± 0.381
1.702AsnThr: 1.702 ± 0.33
3.403AsnVal: 3.403 ± 0.659
0.567AsnTrp: 0.567 ± 0.433
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.105ProAla: 5.105 ± 2.619
0.0ProCys: 0.0 ± 0.0
5.105ProAsp: 5.105 ± 0.64
1.702ProGlu: 1.702 ± 0.33
3.971ProPhe: 3.971 ± 1.041
1.702ProGly: 1.702 ± 0.33
2.269ProHis: 2.269 ± 0.104
1.134ProIle: 1.134 ± 0.763
3.403ProLys: 3.403 ± 0.659
7.941ProLeu: 7.941 ± 2.082
1.134ProMet: 1.134 ± 0.866
1.134ProAsn: 1.134 ± 0.052
7.374ProPro: 7.374 ± 0.886
0.0ProGln: 0.0 ± 0.0
2.269ProArg: 2.269 ± 0.918
4.538ProSer: 4.538 ± 1.022
2.836ProThr: 2.836 ± 1.093
7.374ProVal: 7.374 ± 2.515
1.702ProTrp: 1.702 ± 0.33
0.567ProTyr: 0.567 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
2.836GlnAla: 2.836 ± 1.907
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.134GlnGlu: 1.134 ± 0.763
2.269GlnPhe: 2.269 ± 0.711
1.702GlnGly: 1.702 ± 0.485
1.702GlnHis: 1.702 ± 1.144
0.567GlnIle: 0.567 ± 0.381
0.567GlnLys: 0.567 ± 0.381
1.702GlnLeu: 1.702 ± 0.33
1.134GlnMet: 1.134 ± 0.052
1.134GlnAsn: 1.134 ± 0.052
1.134GlnPro: 1.134 ± 0.763
2.836GlnGln: 2.836 ± 1.093
1.702GlnArg: 1.702 ± 0.33
1.134GlnSer: 1.134 ± 0.052
1.134GlnThr: 1.134 ± 0.763
1.702GlnVal: 1.702 ± 0.33
2.269GlnTrp: 2.269 ± 1.526
0.567GlnTyr: 0.567 ± 0.433
0.0GlnXaa: 0.0 ± 0.0
Arg
9.643ArgAla: 9.643 ± 4.106
1.702ArgCys: 1.702 ± 0.485
1.702ArgAsp: 1.702 ± 0.485
3.971ArgGlu: 3.971 ± 0.226
3.403ArgPhe: 3.403 ± 0.97
2.836ArgGly: 2.836 ± 0.537
1.702ArgHis: 1.702 ± 0.485
1.702ArgIle: 1.702 ± 0.33
1.702ArgLys: 1.702 ± 0.33
5.105ArgLeu: 5.105 ± 0.989
1.702ArgMet: 1.702 ± 0.243
2.836ArgAsn: 2.836 ± 0.537
2.269ArgPro: 2.269 ± 0.104
1.134ArgGln: 1.134 ± 0.763
3.403ArgArg: 3.403 ± 0.155
5.105ArgSer: 5.105 ± 1.455
5.105ArgThr: 5.105 ± 0.174
6.807ArgVal: 6.807 ± 2.948
2.836ArgTrp: 2.836 ± 2.166
2.269ArgTyr: 2.269 ± 1.733
0.0ArgXaa: 0.0 ± 0.0
Ser
10.777SerAla: 10.777 ± 0.73
0.0SerCys: 0.0 ± 0.0
3.971SerAsp: 3.971 ± 1.041
3.971SerGlu: 3.971 ± 0.589
2.836SerPhe: 2.836 ± 0.537
9.643SerGly: 9.643 ± 0.782
1.702SerHis: 1.702 ± 0.485
2.269SerIle: 2.269 ± 0.104
2.836SerLys: 2.836 ± 0.537
7.941SerLeu: 7.941 ± 3.621
0.567SerMet: 0.567 ± 0.381
0.0SerAsn: 0.0 ± 0.0
5.105SerPro: 5.105 ± 0.989
1.702SerGln: 1.702 ± 0.485
5.672SerArg: 5.672 ± 1.074
5.672SerSer: 5.672 ± 1.074
5.672SerThr: 5.672 ± 0.259
9.075SerVal: 9.075 ± 2.044
2.836SerTrp: 2.836 ± 0.537
1.134SerTyr: 1.134 ± 0.866
0.0SerXaa: 0.0 ± 0.0
Thr
5.105ThrAla: 5.105 ± 0.64
0.0ThrCys: 0.0 ± 0.0
2.836ThrAsp: 2.836 ± 1.093
1.134ThrGlu: 1.134 ± 0.052
0.567ThrPhe: 0.567 ± 0.381
0.567ThrGly: 0.567 ± 0.381
2.269ThrHis: 2.269 ± 0.104
2.836ThrIle: 2.836 ± 0.278
2.836ThrLys: 2.836 ± 0.278
5.105ThrLeu: 5.105 ± 1.455
2.269ThrMet: 2.269 ± 0.711
2.269ThrAsn: 2.269 ± 1.526
4.538ThrPro: 4.538 ± 1.422
1.702ThrGln: 1.702 ± 0.33
3.403ThrArg: 3.403 ± 0.155
7.941ThrSer: 7.941 ± 1.177
7.941ThrThr: 7.941 ± 0.362
7.374ThrVal: 7.374 ± 1.559
2.269ThrTrp: 2.269 ± 0.711
1.702ThrTyr: 1.702 ± 0.33
0.0ThrXaa: 0.0 ± 0.0
Val
6.239ValAla: 6.239 ± 1.752
1.134ValCys: 1.134 ± 0.052
10.777ValAsp: 10.777 ± 1.545
6.239ValGlu: 6.239 ± 3.136
2.269ValPhe: 2.269 ± 0.104
5.672ValGly: 5.672 ± 0.556
0.567ValHis: 0.567 ± 0.381
3.403ValIle: 3.403 ± 0.155
5.105ValLys: 5.105 ± 1.455
9.075ValLeu: 9.075 ± 2.845
0.0ValMet: 0.0 ± 0.0
2.836ValAsn: 2.836 ± 1.093
7.941ValPro: 7.941 ± 0.452
0.567ValGln: 0.567 ± 0.433
9.075ValArg: 9.075 ± 1.215
6.239ValSer: 6.239 ± 1.507
5.105ValThr: 5.105 ± 0.174
6.807ValVal: 6.807 ± 0.504
0.567ValTrp: 0.567 ± 0.381
1.702ValTyr: 1.702 ± 0.485
0.0ValXaa: 0.0 ± 0.0
Trp
4.538TrpAla: 4.538 ± 0.608
0.0TrpCys: 0.0 ± 0.0
1.134TrpAsp: 1.134 ± 0.866
0.567TrpGlu: 0.567 ± 0.381
0.567TrpPhe: 0.567 ± 0.433
1.134TrpGly: 1.134 ± 0.052
0.0TrpHis: 0.0 ± 0.0
1.134TrpIle: 1.134 ± 0.866
0.567TrpLys: 0.567 ± 0.381
3.403TrpLeu: 3.403 ± 0.97
0.0TrpMet: 0.0 ± 0.0
0.567TrpAsn: 0.567 ± 0.381
1.134TrpPro: 1.134 ± 0.052
1.134TrpGln: 1.134 ± 0.052
1.134TrpArg: 1.134 ± 0.052
2.269TrpSer: 2.269 ± 0.104
0.0TrpThr: 0.0 ± 0.0
2.836TrpVal: 2.836 ± 1.351
0.0TrpTrp: 0.0 ± 0.0
1.134TrpTyr: 1.134 ± 0.763
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.836TyrAla: 2.836 ± 0.537
0.0TyrCys: 0.0 ± 0.0
1.702TyrAsp: 1.702 ± 1.3
1.134TyrGlu: 1.134 ± 0.052
0.567TyrPhe: 0.567 ± 0.381
0.567TyrGly: 0.567 ± 0.381
0.567TyrHis: 0.567 ± 0.433
0.0TyrIle: 0.0 ± 0.0
1.134TyrLys: 1.134 ± 0.866
1.702TyrLeu: 1.702 ± 0.33
0.567TyrMet: 0.567 ± 0.433
0.567TyrAsn: 0.567 ± 0.433
0.567TyrPro: 0.567 ± 0.433
1.134TyrGln: 1.134 ± 0.052
3.403TyrArg: 3.403 ± 1.785
1.702TyrSer: 1.702 ± 1.144
3.403TyrThr: 3.403 ± 1.785
2.269TyrVal: 2.269 ± 0.918
1.702TyrTrp: 1.702 ± 1.144
0.567TyrTyr: 0.567 ± 0.433
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.567XaaAla: 0.567 ± 0.433
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.567XaaMet: 0.567 ± 0.381
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski