Amino acid dipepetide frequency for Wuhan house centipede virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.818AlaAla: 3.818 ± 0.0
1.018AlaCys: 1.018 ± 0.0
1.782AlaAsp: 1.782 ± 0.0
4.072AlaGlu: 4.072 ± 0.0
2.036AlaPhe: 2.036 ± 0.0
2.291AlaGly: 2.291 ± 0.0
1.273AlaHis: 1.273 ± 0.0
3.818AlaIle: 3.818 ± 0.0
3.563AlaLys: 3.563 ± 0.0
4.072AlaLeu: 4.072 ± 0.0
2.036AlaMet: 2.036 ± 0.0
3.818AlaAsn: 3.818 ± 0.0
1.527AlaPro: 1.527 ± 0.0
1.782AlaGln: 1.782 ± 0.0
2.291AlaArg: 2.291 ± 0.0
2.545AlaSer: 2.545 ± 0.0
4.072AlaThr: 4.072 ± 0.0
6.617AlaVal: 6.617 ± 0.0
1.782AlaTrp: 1.782 ± 0.0
3.054AlaTyr: 3.054 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.527CysAla: 1.527 ± 0.0
0.509CysCys: 0.509 ± 0.0
0.255CysAsp: 0.255 ± 0.0
1.527CysGlu: 1.527 ± 0.0
0.255CysPhe: 0.255 ± 0.0
1.018CysGly: 1.018 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.018CysIle: 1.018 ± 0.0
1.273CysLys: 1.273 ± 0.0
2.8CysLeu: 2.8 ± 0.0
0.255CysMet: 0.255 ± 0.0
0.764CysAsn: 0.764 ± 0.0
0.509CysPro: 0.509 ± 0.0
0.255CysGln: 0.255 ± 0.0
0.255CysArg: 0.255 ± 0.0
0.509CysSer: 0.509 ± 0.0
1.018CysThr: 1.018 ± 0.0
0.509CysVal: 0.509 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.255CysTyr: 0.255 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.545AspAla: 2.545 ± 0.0
1.527AspCys: 1.527 ± 0.0
4.327AspAsp: 4.327 ± 0.0
3.818AspGlu: 3.818 ± 0.0
3.818AspPhe: 3.818 ± 0.0
3.563AspGly: 3.563 ± 0.0
0.764AspHis: 0.764 ± 0.0
4.072AspIle: 4.072 ± 0.0
4.327AspLys: 4.327 ± 0.0
4.581AspLeu: 4.581 ± 0.0
1.018AspMet: 1.018 ± 0.0
3.563AspAsn: 3.563 ± 0.0
2.291AspPro: 2.291 ± 0.0
2.036AspGln: 2.036 ± 0.0
2.291AspArg: 2.291 ± 0.0
4.581AspSer: 4.581 ± 0.0
3.309AspThr: 3.309 ± 0.0
4.581AspVal: 4.581 ± 0.0
0.764AspTrp: 0.764 ± 0.0
0.764AspTyr: 0.764 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.036GluAla: 2.036 ± 0.0
1.782GluCys: 1.782 ± 0.0
3.309GluAsp: 3.309 ± 0.0
5.09GluGlu: 5.09 ± 0.0
3.818GluPhe: 3.818 ± 0.0
1.527GluGly: 1.527 ± 0.0
1.273GluHis: 1.273 ± 0.0
4.327GluIle: 4.327 ± 0.0
3.309GluLys: 3.309 ± 0.0
6.617GluLeu: 6.617 ± 0.0
2.036GluMet: 2.036 ± 0.0
2.291GluAsn: 2.291 ± 0.0
2.291GluPro: 2.291 ± 0.0
3.563GluGln: 3.563 ± 0.0
1.273GluArg: 1.273 ± 0.0
5.345GluSer: 5.345 ± 0.0
3.054GluThr: 3.054 ± 0.0
2.036GluVal: 2.036 ± 0.0
0.764GluTrp: 0.764 ± 0.0
3.309GluTyr: 3.309 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.545PheAla: 2.545 ± 0.0
0.764PheCys: 0.764 ± 0.0
2.291PheAsp: 2.291 ± 0.0
2.545PheGlu: 2.545 ± 0.0
1.018PhePhe: 1.018 ± 0.0
3.563PheGly: 3.563 ± 0.0
1.273PheHis: 1.273 ± 0.0
3.309PheIle: 3.309 ± 0.0
3.054PheLys: 3.054 ± 0.0
2.545PheLeu: 2.545 ± 0.0
1.018PheMet: 1.018 ± 0.0
2.545PheAsn: 2.545 ± 0.0
1.527PhePro: 1.527 ± 0.0
1.273PheGln: 1.273 ± 0.0
1.273PheArg: 1.273 ± 0.0
2.545PheSer: 2.545 ± 0.0
3.054PheThr: 3.054 ± 0.0
2.8PheVal: 2.8 ± 0.0
0.509PheTrp: 0.509 ± 0.0
0.255PheTyr: 0.255 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.054GlyAla: 3.054 ± 0.0
0.764GlyCys: 0.764 ± 0.0
3.054GlyAsp: 3.054 ± 0.0
2.545GlyGlu: 2.545 ± 0.0
1.273GlyPhe: 1.273 ± 0.0
1.018GlyGly: 1.018 ± 0.0
0.255GlyHis: 0.255 ± 0.0
5.09GlyIle: 5.09 ± 0.0
2.8GlyLys: 2.8 ± 0.0
4.836GlyLeu: 4.836 ± 0.0
1.018GlyMet: 1.018 ± 0.0
2.8GlyAsn: 2.8 ± 0.0
1.527GlyPro: 1.527 ± 0.0
1.273GlyGln: 1.273 ± 0.0
1.782GlyArg: 1.782 ± 0.0
4.581GlySer: 4.581 ± 0.0
5.854GlyThr: 5.854 ± 0.0
2.291GlyVal: 2.291 ± 0.0
0.509GlyTrp: 0.509 ± 0.0
3.563GlyTyr: 3.563 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.509HisAla: 0.509 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.273HisAsp: 1.273 ± 0.0
0.509HisGlu: 0.509 ± 0.0
1.018HisPhe: 1.018 ± 0.0
1.018HisGly: 1.018 ± 0.0
0.764HisHis: 0.764 ± 0.0
2.036HisIle: 2.036 ± 0.0
1.782HisLys: 1.782 ± 0.0
0.509HisLeu: 0.509 ± 0.0
0.255HisMet: 0.255 ± 0.0
0.764HisAsn: 0.764 ± 0.0
0.255HisPro: 0.255 ± 0.0
0.509HisGln: 0.509 ± 0.0
1.018HisArg: 1.018 ± 0.0
0.255HisSer: 0.255 ± 0.0
0.509HisThr: 0.509 ± 0.0
1.018HisVal: 1.018 ± 0.0
0.255HisTrp: 0.255 ± 0.0
0.509HisTyr: 0.509 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.327IleAla: 4.327 ± 0.0
1.527IleCys: 1.527 ± 0.0
4.836IleAsp: 4.836 ± 0.0
5.854IleGlu: 5.854 ± 0.0
1.273IlePhe: 1.273 ± 0.0
4.581IleGly: 4.581 ± 0.0
0.509IleHis: 0.509 ± 0.0
7.381IleIle: 7.381 ± 0.0
7.89IleLys: 7.89 ± 0.0
5.09IleLeu: 5.09 ± 0.0
2.291IleMet: 2.291 ± 0.0
6.872IleAsn: 6.872 ± 0.0
3.309IlePro: 3.309 ± 0.0
3.054IleGln: 3.054 ± 0.0
3.054IleArg: 3.054 ± 0.0
7.636IleSer: 7.636 ± 0.0
5.345IleThr: 5.345 ± 0.0
5.854IleVal: 5.854 ± 0.0
1.273IleTrp: 1.273 ± 0.0
2.8IleTyr: 2.8 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.273LysAla: 1.273 ± 0.0
0.509LysCys: 0.509 ± 0.0
3.563LysAsp: 3.563 ± 0.0
3.563LysGlu: 3.563 ± 0.0
3.309LysPhe: 3.309 ± 0.0
2.8LysGly: 2.8 ± 0.0
0.764LysHis: 0.764 ± 0.0
5.599LysIle: 5.599 ± 0.0
4.327LysLys: 4.327 ± 0.0
5.599LysLeu: 5.599 ± 0.0
1.273LysMet: 1.273 ± 0.0
4.327LysAsn: 4.327 ± 0.0
3.309LysPro: 3.309 ± 0.0
3.818LysGln: 3.818 ± 0.0
2.545LysArg: 2.545 ± 0.0
5.599LysSer: 5.599 ± 0.0
5.09LysThr: 5.09 ± 0.0
5.09LysVal: 5.09 ± 0.0
0.255LysTrp: 0.255 ± 0.0
4.581LysTyr: 4.581 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.072LeuAla: 4.072 ± 0.0
1.018LeuCys: 1.018 ± 0.0
6.363LeuAsp: 6.363 ± 0.0
5.09LeuGlu: 5.09 ± 0.0
4.836LeuPhe: 4.836 ± 0.0
6.617LeuGly: 6.617 ± 0.0
1.527LeuHis: 1.527 ± 0.0
5.345LeuIle: 5.345 ± 0.0
6.363LeuLys: 6.363 ± 0.0
8.654LeuLeu: 8.654 ± 0.0
1.273LeuMet: 1.273 ± 0.0
6.617LeuAsn: 6.617 ± 0.0
3.309LeuPro: 3.309 ± 0.0
3.563LeuGln: 3.563 ± 0.0
3.309LeuArg: 3.309 ± 0.0
5.345LeuSer: 5.345 ± 0.0
5.599LeuThr: 5.599 ± 0.0
4.581LeuVal: 4.581 ± 0.0
0.764LeuTrp: 0.764 ± 0.0
3.563LeuTyr: 3.563 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.291MetAla: 2.291 ± 0.0
0.255MetCys: 0.255 ± 0.0
0.509MetAsp: 0.509 ± 0.0
0.764MetGlu: 0.764 ± 0.0
0.764MetPhe: 0.764 ± 0.0
0.509MetGly: 0.509 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.018MetIle: 1.018 ± 0.0
1.018MetLys: 1.018 ± 0.0
3.054MetLeu: 3.054 ± 0.0
0.255MetMet: 0.255 ± 0.0
1.782MetAsn: 1.782 ± 0.0
1.782MetPro: 1.782 ± 0.0
0.764MetGln: 0.764 ± 0.0
1.782MetArg: 1.782 ± 0.0
2.545MetSer: 2.545 ± 0.0
1.018MetThr: 1.018 ± 0.0
1.273MetVal: 1.273 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.764MetTyr: 0.764 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.072AsnAla: 4.072 ± 0.0
1.527AsnCys: 1.527 ± 0.0
4.072AsnAsp: 4.072 ± 0.0
3.309AsnGlu: 3.309 ± 0.0
2.036AsnPhe: 2.036 ± 0.0
1.527AsnGly: 1.527 ± 0.0
1.018AsnHis: 1.018 ± 0.0
6.363AsnIle: 6.363 ± 0.0
2.8AsnLys: 2.8 ± 0.0
8.399AsnLeu: 8.399 ± 0.0
1.273AsnMet: 1.273 ± 0.0
4.836AsnAsn: 4.836 ± 0.0
3.309AsnPro: 3.309 ± 0.0
1.782AsnGln: 1.782 ± 0.0
1.018AsnArg: 1.018 ± 0.0
4.836AsnSer: 4.836 ± 0.0
3.818AsnThr: 3.818 ± 0.0
4.581AsnVal: 4.581 ± 0.0
0.255AsnTrp: 0.255 ± 0.0
3.309AsnTyr: 3.309 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.836ProAla: 4.836 ± 0.0
0.0ProCys: 0.0 ± 0.0
2.036ProAsp: 2.036 ± 0.0
2.036ProGlu: 2.036 ± 0.0
1.018ProPhe: 1.018 ± 0.0
2.036ProGly: 2.036 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.309ProIle: 3.309 ± 0.0
2.8ProLys: 2.8 ± 0.0
2.8ProLeu: 2.8 ± 0.0
0.509ProMet: 0.509 ± 0.0
1.782ProAsn: 1.782 ± 0.0
1.018ProPro: 1.018 ± 0.0
1.782ProGln: 1.782 ± 0.0
2.291ProArg: 2.291 ± 0.0
3.309ProSer: 3.309 ± 0.0
3.309ProThr: 3.309 ± 0.0
1.018ProVal: 1.018 ± 0.0
0.255ProTrp: 0.255 ± 0.0
3.818ProTyr: 3.818 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.8GlnAla: 2.8 ± 0.0
0.509GlnCys: 0.509 ± 0.0
2.291GlnAsp: 2.291 ± 0.0
2.036GlnGlu: 2.036 ± 0.0
2.291GlnPhe: 2.291 ± 0.0
1.527GlnGly: 1.527 ± 0.0
1.018GlnHis: 1.018 ± 0.0
3.563GlnIle: 3.563 ± 0.0
2.036GlnLys: 2.036 ± 0.0
4.072GlnLeu: 4.072 ± 0.0
0.255GlnMet: 0.255 ± 0.0
3.309GlnAsn: 3.309 ± 0.0
2.036GlnPro: 2.036 ± 0.0
3.563GlnGln: 3.563 ± 0.0
2.545GlnArg: 2.545 ± 0.0
2.036GlnSer: 2.036 ± 0.0
2.545GlnThr: 2.545 ± 0.0
1.782GlnVal: 1.782 ± 0.0
0.509GlnTrp: 0.509 ± 0.0
1.018GlnTyr: 1.018 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.8ArgAla: 2.8 ± 0.0
0.255ArgCys: 0.255 ± 0.0
1.527ArgAsp: 1.527 ± 0.0
3.054ArgGlu: 3.054 ± 0.0
1.527ArgPhe: 1.527 ± 0.0
2.545ArgGly: 2.545 ± 0.0
1.273ArgHis: 1.273 ± 0.0
3.818ArgIle: 3.818 ± 0.0
1.527ArgLys: 1.527 ± 0.0
5.09ArgLeu: 5.09 ± 0.0
0.764ArgMet: 0.764 ± 0.0
3.054ArgAsn: 3.054 ± 0.0
2.291ArgPro: 2.291 ± 0.0
1.018ArgGln: 1.018 ± 0.0
0.255ArgArg: 0.255 ± 0.0
2.036ArgSer: 2.036 ± 0.0
2.036ArgThr: 2.036 ± 0.0
2.8ArgVal: 2.8 ± 0.0
0.255ArgTrp: 0.255 ± 0.0
1.782ArgTyr: 1.782 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.327SerAla: 4.327 ± 0.0
0.764SerCys: 0.764 ± 0.0
3.818SerAsp: 3.818 ± 0.0
3.054SerGlu: 3.054 ± 0.0
3.309SerPhe: 3.309 ± 0.0
4.327SerGly: 4.327 ± 0.0
0.764SerHis: 0.764 ± 0.0
6.872SerIle: 6.872 ± 0.0
5.345SerLys: 5.345 ± 0.0
5.09SerLeu: 5.09 ± 0.0
2.545SerMet: 2.545 ± 0.0
4.327SerAsn: 4.327 ± 0.0
1.527SerPro: 1.527 ± 0.0
2.8SerGln: 2.8 ± 0.0
4.581SerArg: 4.581 ± 0.0
5.599SerSer: 5.599 ± 0.0
4.836SerThr: 4.836 ± 0.0
4.836SerVal: 4.836 ± 0.0
0.255SerTrp: 0.255 ± 0.0
2.8SerTyr: 2.8 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.072ThrAla: 4.072 ± 0.0
0.764ThrCys: 0.764 ± 0.0
4.072ThrAsp: 4.072 ± 0.0
2.291ThrGlu: 2.291 ± 0.0
2.036ThrPhe: 2.036 ± 0.0
4.836ThrGly: 4.836 ± 0.0
0.509ThrHis: 0.509 ± 0.0
8.399ThrIle: 8.399 ± 0.0
4.581ThrLys: 4.581 ± 0.0
6.108ThrLeu: 6.108 ± 0.0
2.036ThrMet: 2.036 ± 0.0
2.8ThrAsn: 2.8 ± 0.0
3.054ThrPro: 3.054 ± 0.0
3.054ThrGln: 3.054 ± 0.0
2.036ThrArg: 2.036 ± 0.0
3.563ThrSer: 3.563 ± 0.0
7.126ThrThr: 7.126 ± 0.0
4.581ThrVal: 4.581 ± 0.0
0.255ThrTrp: 0.255 ± 0.0
4.072ThrTyr: 4.072 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.818ValAla: 3.818 ± 0.0
0.764ValCys: 0.764 ± 0.0
4.581ValAsp: 4.581 ± 0.0
4.327ValGlu: 4.327 ± 0.0
1.782ValPhe: 1.782 ± 0.0
3.054ValGly: 3.054 ± 0.0
1.018ValHis: 1.018 ± 0.0
5.599ValIle: 5.599 ± 0.0
4.836ValLys: 4.836 ± 0.0
4.072ValLeu: 4.072 ± 0.0
0.764ValMet: 0.764 ± 0.0
4.836ValAsn: 4.836 ± 0.0
3.054ValPro: 3.054 ± 0.0
1.782ValGln: 1.782 ± 0.0
3.309ValArg: 3.309 ± 0.0
4.581ValSer: 4.581 ± 0.0
5.09ValThr: 5.09 ± 0.0
3.818ValVal: 3.818 ± 0.0
0.509ValTrp: 0.509 ± 0.0
1.782ValTyr: 1.782 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.255TrpAsp: 0.255 ± 0.0
0.255TrpGlu: 0.255 ± 0.0
0.255TrpPhe: 0.255 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.255TrpHis: 0.255 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.255TrpLys: 0.255 ± 0.0
0.764TrpLeu: 0.764 ± 0.0
0.255TrpMet: 0.255 ± 0.0
0.764TrpAsn: 0.764 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.273TrpGln: 1.273 ± 0.0
0.764TrpArg: 0.764 ± 0.0
1.273TrpSer: 1.273 ± 0.0
0.255TrpThr: 0.255 ± 0.0
1.527TrpVal: 1.527 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.018TrpTyr: 1.018 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.545TyrAla: 2.545 ± 0.0
0.255TyrCys: 0.255 ± 0.0
4.327TyrAsp: 4.327 ± 0.0
3.309TyrGlu: 3.309 ± 0.0
2.036TyrPhe: 2.036 ± 0.0
1.527TyrGly: 1.527 ± 0.0
0.509TyrHis: 0.509 ± 0.0
3.818TyrIle: 3.818 ± 0.0
3.309TyrLys: 3.309 ± 0.0
3.309TyrLeu: 3.309 ± 0.0
0.764TyrMet: 0.764 ± 0.0
2.291TyrAsn: 2.291 ± 0.0
2.036TyrPro: 2.036 ± 0.0
2.8TyrGln: 2.8 ± 0.0
2.036TyrArg: 2.036 ± 0.0
3.054TyrSer: 3.054 ± 0.0
3.309TyrThr: 3.309 ± 0.0
1.782TyrVal: 1.782 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.273TyrTyr: 1.273 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3930 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski