Amino acid dipepetide frequency for Wuhan centipede virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.802AlaAla: 1.802 ± 0.0
1.287AlaCys: 1.287 ± 0.0
1.287AlaAsp: 1.287 ± 0.0
3.604AlaGlu: 3.604 ± 0.0
2.96AlaPhe: 2.96 ± 0.0
2.059AlaGly: 2.059 ± 0.0
0.644AlaHis: 0.644 ± 0.0
2.831AlaIle: 2.831 ± 0.0
2.059AlaLys: 2.059 ± 0.0
4.247AlaLeu: 4.247 ± 0.0
1.544AlaMet: 1.544 ± 0.0
1.416AlaAsn: 1.416 ± 0.0
1.802AlaPro: 1.802 ± 0.0
1.673AlaGln: 1.673 ± 0.0
2.703AlaArg: 2.703 ± 0.0
3.475AlaSer: 3.475 ± 0.0
3.089AlaThr: 3.089 ± 0.0
4.118AlaVal: 4.118 ± 0.0
0.386AlaTrp: 0.386 ± 0.0
1.802AlaTyr: 1.802 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.772CysAla: 0.772 ± 0.0
1.03CysCys: 1.03 ± 0.0
1.802CysAsp: 1.802 ± 0.0
1.416CysGlu: 1.416 ± 0.0
1.287CysPhe: 1.287 ± 0.0
2.317CysGly: 2.317 ± 0.0
0.515CysHis: 0.515 ± 0.0
1.416CysIle: 1.416 ± 0.0
2.188CysLys: 2.188 ± 0.0
2.188CysLeu: 2.188 ± 0.0
0.772CysMet: 0.772 ± 0.0
1.544CysAsn: 1.544 ± 0.0
1.03CysPro: 1.03 ± 0.0
0.772CysGln: 0.772 ± 0.0
1.802CysArg: 1.802 ± 0.0
1.802CysSer: 1.802 ± 0.0
2.188CysThr: 2.188 ± 0.0
1.416CysVal: 1.416 ± 0.0
0.515CysTrp: 0.515 ± 0.0
1.931CysTyr: 1.931 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.317AspAla: 2.317 ± 0.0
1.544AspCys: 1.544 ± 0.0
3.218AspAsp: 3.218 ± 0.0
4.247AspGlu: 4.247 ± 0.0
3.99AspPhe: 3.99 ± 0.0
2.188AspGly: 2.188 ± 0.0
1.802AspHis: 1.802 ± 0.0
3.861AspIle: 3.861 ± 0.0
4.505AspLys: 4.505 ± 0.0
6.049AspLeu: 6.049 ± 0.0
1.287AspMet: 1.287 ± 0.0
2.831AspAsn: 2.831 ± 0.0
1.287AspPro: 1.287 ± 0.0
1.03AspGln: 1.03 ± 0.0
1.287AspArg: 1.287 ± 0.0
3.604AspSer: 3.604 ± 0.0
2.188AspThr: 2.188 ± 0.0
5.019AspVal: 5.019 ± 0.0
0.772AspTrp: 0.772 ± 0.0
3.346AspTyr: 3.346 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.317GluAla: 2.317 ± 0.0
0.901GluCys: 0.901 ± 0.0
4.118GluAsp: 4.118 ± 0.0
5.405GluGlu: 5.405 ± 0.0
3.604GluPhe: 3.604 ± 0.0
3.346GluGly: 3.346 ± 0.0
1.158GluHis: 1.158 ± 0.0
5.792GluIle: 5.792 ± 0.0
8.623GluLys: 8.623 ± 0.0
6.95GluLeu: 6.95 ± 0.0
1.416GluMet: 1.416 ± 0.0
3.346GluAsn: 3.346 ± 0.0
1.158GluPro: 1.158 ± 0.0
1.673GluGln: 1.673 ± 0.0
2.703GluArg: 2.703 ± 0.0
4.633GluSer: 4.633 ± 0.0
3.475GluThr: 3.475 ± 0.0
4.891GluVal: 4.891 ± 0.0
0.515GluTrp: 0.515 ± 0.0
2.96GluTyr: 2.96 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.574PheAla: 2.574 ± 0.0
2.831PheCys: 2.831 ± 0.0
3.604PheAsp: 3.604 ± 0.0
4.376PheGlu: 4.376 ± 0.0
5.277PhePhe: 5.277 ± 0.0
4.633PheGly: 4.633 ± 0.0
1.931PheHis: 1.931 ± 0.0
2.831PheIle: 2.831 ± 0.0
3.861PheLys: 3.861 ± 0.0
5.277PheLeu: 5.277 ± 0.0
0.901PheMet: 0.901 ± 0.0
1.544PheAsn: 1.544 ± 0.0
1.673PhePro: 1.673 ± 0.0
1.802PheGln: 1.802 ± 0.0
2.317PheArg: 2.317 ± 0.0
5.92PheSer: 5.92 ± 0.0
2.445PheThr: 2.445 ± 0.0
4.376PheVal: 4.376 ± 0.0
0.901PheTrp: 0.901 ± 0.0
2.188PheTyr: 2.188 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.218GlyAla: 3.218 ± 0.0
1.802GlyCys: 1.802 ± 0.0
2.574GlyAsp: 2.574 ± 0.0
2.188GlyGlu: 2.188 ± 0.0
3.861GlyPhe: 3.861 ± 0.0
2.317GlyGly: 2.317 ± 0.0
1.03GlyHis: 1.03 ± 0.0
2.317GlyIle: 2.317 ± 0.0
3.99GlyLys: 3.99 ± 0.0
4.376GlyLeu: 4.376 ± 0.0
1.673GlyMet: 1.673 ± 0.0
3.475GlyAsn: 3.475 ± 0.0
1.802GlyPro: 1.802 ± 0.0
1.03GlyGln: 1.03 ± 0.0
1.931GlyArg: 1.931 ± 0.0
3.475GlySer: 3.475 ± 0.0
2.188GlyThr: 2.188 ± 0.0
3.861GlyVal: 3.861 ± 0.0
1.158GlyTrp: 1.158 ± 0.0
1.802GlyTyr: 1.802 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.544HisAla: 1.544 ± 0.0
1.544HisCys: 1.544 ± 0.0
2.059HisAsp: 2.059 ± 0.0
1.544HisGlu: 1.544 ± 0.0
0.386HisPhe: 0.386 ± 0.0
1.673HisGly: 1.673 ± 0.0
1.673HisHis: 1.673 ± 0.0
1.416HisIle: 1.416 ± 0.0
2.188HisLys: 2.188 ± 0.0
1.931HisLeu: 1.931 ± 0.0
0.257HisMet: 0.257 ± 0.0
0.901HisAsn: 0.901 ± 0.0
0.515HisPro: 0.515 ± 0.0
0.515HisGln: 0.515 ± 0.0
0.772HisArg: 0.772 ± 0.0
0.901HisSer: 0.901 ± 0.0
0.772HisThr: 0.772 ± 0.0
2.059HisVal: 2.059 ± 0.0
0.257HisTrp: 0.257 ± 0.0
0.772HisTyr: 0.772 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.188IleAla: 2.188 ± 0.0
2.703IleCys: 2.703 ± 0.0
3.861IleAsp: 3.861 ± 0.0
5.405IleGlu: 5.405 ± 0.0
5.663IlePhe: 5.663 ± 0.0
2.96IleGly: 2.96 ± 0.0
0.901IleHis: 0.901 ± 0.0
3.218IleIle: 3.218 ± 0.0
4.247IleLys: 4.247 ± 0.0
5.148IleLeu: 5.148 ± 0.0
0.644IleMet: 0.644 ± 0.0
3.346IleAsn: 3.346 ± 0.0
2.445IlePro: 2.445 ± 0.0
1.673IleGln: 1.673 ± 0.0
3.604IleArg: 3.604 ± 0.0
4.762IleSer: 4.762 ± 0.0
3.475IleThr: 3.475 ± 0.0
4.762IleVal: 4.762 ± 0.0
0.386IleTrp: 0.386 ± 0.0
3.218IleTyr: 3.218 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.604LysAla: 3.604 ± 0.0
1.673LysCys: 1.673 ± 0.0
3.861LysAsp: 3.861 ± 0.0
6.821LysGlu: 6.821 ± 0.0
4.247LysPhe: 4.247 ± 0.0
3.475LysGly: 3.475 ± 0.0
2.445LysHis: 2.445 ± 0.0
7.207LysIle: 7.207 ± 0.0
9.91LysLys: 9.91 ± 0.0
6.306LysLeu: 6.306 ± 0.0
2.188LysMet: 2.188 ± 0.0
4.376LysAsn: 4.376 ± 0.0
2.317LysPro: 2.317 ± 0.0
2.317LysGln: 2.317 ± 0.0
5.534LysArg: 5.534 ± 0.0
5.148LysSer: 5.148 ± 0.0
3.861LysThr: 3.861 ± 0.0
5.148LysVal: 5.148 ± 0.0
1.287LysTrp: 1.287 ± 0.0
2.574LysTyr: 2.574 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.891LeuAla: 4.891 ± 0.0
2.96LeuCys: 2.96 ± 0.0
5.405LeuAsp: 5.405 ± 0.0
6.306LeuGlu: 6.306 ± 0.0
4.762LeuPhe: 4.762 ± 0.0
3.861LeuGly: 3.861 ± 0.0
2.059LeuHis: 2.059 ± 0.0
6.049LeuIle: 6.049 ± 0.0
6.95LeuLys: 6.95 ± 0.0
7.851LeuLeu: 7.851 ± 0.0
2.317LeuMet: 2.317 ± 0.0
5.019LeuAsn: 5.019 ± 0.0
3.218LeuPro: 3.218 ± 0.0
3.218LeuGln: 3.218 ± 0.0
2.96LeuArg: 2.96 ± 0.0
7.851LeuSer: 7.851 ± 0.0
3.475LeuThr: 3.475 ± 0.0
5.792LeuVal: 5.792 ± 0.0
1.673LeuTrp: 1.673 ± 0.0
3.346LeuTyr: 3.346 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.901MetAla: 0.901 ± 0.0
0.644MetCys: 0.644 ± 0.0
1.287MetAsp: 1.287 ± 0.0
2.059MetGlu: 2.059 ± 0.0
0.644MetPhe: 0.644 ± 0.0
1.158MetGly: 1.158 ± 0.0
0.515MetHis: 0.515 ± 0.0
1.802MetIle: 1.802 ± 0.0
0.901MetLys: 0.901 ± 0.0
2.445MetLeu: 2.445 ± 0.0
0.515MetMet: 0.515 ± 0.0
1.03MetAsn: 1.03 ± 0.0
0.644MetPro: 0.644 ± 0.0
1.03MetGln: 1.03 ± 0.0
0.644MetArg: 0.644 ± 0.0
2.188MetSer: 2.188 ± 0.0
1.158MetThr: 1.158 ± 0.0
1.802MetVal: 1.802 ± 0.0
0.386MetTrp: 0.386 ± 0.0
0.386MetTyr: 0.386 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.317AsnAla: 2.317 ± 0.0
1.03AsnCys: 1.03 ± 0.0
2.445AsnAsp: 2.445 ± 0.0
2.96AsnGlu: 2.96 ± 0.0
2.831AsnPhe: 2.831 ± 0.0
2.445AsnGly: 2.445 ± 0.0
1.673AsnHis: 1.673 ± 0.0
3.218AsnIle: 3.218 ± 0.0
4.118AsnLys: 4.118 ± 0.0
6.692AsnLeu: 6.692 ± 0.0
0.515AsnMet: 0.515 ± 0.0
2.188AsnAsn: 2.188 ± 0.0
1.416AsnPro: 1.416 ± 0.0
1.287AsnGln: 1.287 ± 0.0
1.416AsnArg: 1.416 ± 0.0
2.188AsnSer: 2.188 ± 0.0
3.604AsnThr: 3.604 ± 0.0
3.99AsnVal: 3.99 ± 0.0
0.386AsnTrp: 0.386 ± 0.0
3.218AsnTyr: 3.218 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.158ProAla: 1.158 ± 0.0
1.03ProCys: 1.03 ± 0.0
2.059ProAsp: 2.059 ± 0.0
3.089ProGlu: 3.089 ± 0.0
1.673ProPhe: 1.673 ± 0.0
0.772ProGly: 0.772 ± 0.0
0.386ProHis: 0.386 ± 0.0
1.931ProIle: 1.931 ± 0.0
3.346ProLys: 3.346 ± 0.0
2.831ProLeu: 2.831 ± 0.0
0.129ProMet: 0.129 ± 0.0
1.544ProAsn: 1.544 ± 0.0
1.158ProPro: 1.158 ± 0.0
0.386ProGln: 0.386 ± 0.0
0.901ProArg: 0.901 ± 0.0
2.574ProSer: 2.574 ± 0.0
1.673ProThr: 1.673 ± 0.0
1.802ProVal: 1.802 ± 0.0
0.644ProTrp: 0.644 ± 0.0
0.901ProTyr: 0.901 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.802GlnAla: 1.802 ± 0.0
0.386GlnCys: 0.386 ± 0.0
0.772GlnAsp: 0.772 ± 0.0
1.03GlnGlu: 1.03 ± 0.0
2.317GlnPhe: 2.317 ± 0.0
1.673GlnGly: 1.673 ± 0.0
0.386GlnHis: 0.386 ± 0.0
1.158GlnIle: 1.158 ± 0.0
4.118GlnLys: 4.118 ± 0.0
1.931GlnLeu: 1.931 ± 0.0
0.901GlnMet: 0.901 ± 0.0
2.317GlnAsn: 2.317 ± 0.0
0.644GlnPro: 0.644 ± 0.0
0.901GlnGln: 0.901 ± 0.0
1.287GlnArg: 1.287 ± 0.0
1.673GlnSer: 1.673 ± 0.0
0.644GlnThr: 0.644 ± 0.0
1.673GlnVal: 1.673 ± 0.0
0.901GlnTrp: 0.901 ± 0.0
1.416GlnTyr: 1.416 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.931ArgAla: 1.931 ± 0.0
0.901ArgCys: 0.901 ± 0.0
1.544ArgAsp: 1.544 ± 0.0
3.218ArgGlu: 3.218 ± 0.0
3.475ArgPhe: 3.475 ± 0.0
2.574ArgGly: 2.574 ± 0.0
1.673ArgHis: 1.673 ± 0.0
2.703ArgIle: 2.703 ± 0.0
4.247ArgLys: 4.247 ± 0.0
4.118ArgLeu: 4.118 ± 0.0
1.03ArgMet: 1.03 ± 0.0
2.574ArgAsn: 2.574 ± 0.0
1.287ArgPro: 1.287 ± 0.0
1.03ArgGln: 1.03 ± 0.0
2.317ArgArg: 2.317 ± 0.0
2.703ArgSer: 2.703 ± 0.0
1.544ArgThr: 1.544 ± 0.0
3.475ArgVal: 3.475 ± 0.0
0.386ArgTrp: 0.386 ± 0.0
2.188ArgTyr: 2.188 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.732SerAla: 3.732 ± 0.0
2.188SerCys: 2.188 ± 0.0
4.376SerAsp: 4.376 ± 0.0
4.247SerGlu: 4.247 ± 0.0
5.148SerPhe: 5.148 ± 0.0
4.376SerGly: 4.376 ± 0.0
1.158SerHis: 1.158 ± 0.0
5.92SerIle: 5.92 ± 0.0
5.534SerLys: 5.534 ± 0.0
6.178SerLeu: 6.178 ± 0.0
2.059SerMet: 2.059 ± 0.0
3.218SerAsn: 3.218 ± 0.0
1.673SerPro: 1.673 ± 0.0
1.802SerGln: 1.802 ± 0.0
4.376SerArg: 4.376 ± 0.0
4.505SerSer: 4.505 ± 0.0
2.445SerThr: 2.445 ± 0.0
5.792SerVal: 5.792 ± 0.0
0.772SerTrp: 0.772 ± 0.0
4.376SerTyr: 4.376 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.317ThrAla: 2.317 ± 0.0
0.772ThrCys: 0.772 ± 0.0
2.831ThrAsp: 2.831 ± 0.0
2.574ThrGlu: 2.574 ± 0.0
2.188ThrPhe: 2.188 ± 0.0
3.089ThrGly: 3.089 ± 0.0
1.03ThrHis: 1.03 ± 0.0
2.445ThrIle: 2.445 ± 0.0
3.861ThrLys: 3.861 ± 0.0
3.346ThrLeu: 3.346 ± 0.0
1.416ThrMet: 1.416 ± 0.0
2.317ThrAsn: 2.317 ± 0.0
0.901ThrPro: 0.901 ± 0.0
2.059ThrGln: 2.059 ± 0.0
2.574ThrArg: 2.574 ± 0.0
4.633ThrSer: 4.633 ± 0.0
2.96ThrThr: 2.96 ± 0.0
3.604ThrVal: 3.604 ± 0.0
0.386ThrTrp: 0.386 ± 0.0
1.931ThrTyr: 1.931 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.218ValAla: 3.218 ± 0.0
2.188ValCys: 2.188 ± 0.0
4.376ValAsp: 4.376 ± 0.0
5.019ValGlu: 5.019 ± 0.0
4.247ValPhe: 4.247 ± 0.0
2.831ValGly: 2.831 ± 0.0
1.158ValHis: 1.158 ± 0.0
3.99ValIle: 3.99 ± 0.0
5.405ValLys: 5.405 ± 0.0
6.821ValLeu: 6.821 ± 0.0
2.059ValMet: 2.059 ± 0.0
3.089ValAsn: 3.089 ± 0.0
2.831ValPro: 2.831 ± 0.0
2.188ValGln: 2.188 ± 0.0
2.445ValArg: 2.445 ± 0.0
6.306ValSer: 6.306 ± 0.0
4.247ValThr: 4.247 ± 0.0
7.079ValVal: 7.079 ± 0.0
1.158ValTrp: 1.158 ± 0.0
2.703ValTyr: 2.703 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.386TrpAla: 0.386 ± 0.0
0.257TrpCys: 0.257 ± 0.0
0.901TrpAsp: 0.901 ± 0.0
0.772TrpGlu: 0.772 ± 0.0
0.901TrpPhe: 0.901 ± 0.0
0.129TrpGly: 0.129 ± 0.0
0.257TrpHis: 0.257 ± 0.0
1.287TrpIle: 1.287 ± 0.0
1.287TrpLys: 1.287 ± 0.0
1.544TrpLeu: 1.544 ± 0.0
0.257TrpMet: 0.257 ± 0.0
1.03TrpAsn: 1.03 ± 0.0
0.644TrpPro: 0.644 ± 0.0
0.129TrpGln: 0.129 ± 0.0
0.772TrpArg: 0.772 ± 0.0
1.416TrpSer: 1.416 ± 0.0
0.515TrpThr: 0.515 ± 0.0
0.515TrpVal: 0.515 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.901TrpTyr: 0.901 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.802TyrAla: 1.802 ± 0.0
1.03TyrCys: 1.03 ± 0.0
3.99TyrAsp: 3.99 ± 0.0
2.703TyrGlu: 2.703 ± 0.0
1.802TyrPhe: 1.802 ± 0.0
2.188TyrGly: 2.188 ± 0.0
1.287TyrHis: 1.287 ± 0.0
3.346TyrIle: 3.346 ± 0.0
2.96TyrLys: 2.96 ± 0.0
3.732TyrLeu: 3.732 ± 0.0
0.129TyrMet: 0.129 ± 0.0
2.831TyrAsn: 2.831 ± 0.0
1.544TyrPro: 1.544 ± 0.0
1.416TyrGln: 1.416 ± 0.0
2.574TyrArg: 2.574 ± 0.0
4.118TyrSer: 4.118 ± 0.0
1.287TyrThr: 1.287 ± 0.0
2.188TyrVal: 2.188 ± 0.0
1.03TyrTrp: 1.03 ± 0.0
2.059TyrTyr: 2.059 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (7771 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski