Amino acid dipepetide frequency for Changjiang picorna-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.196AlaAla: 2.196 ± 0.0
1.318AlaCys: 1.318 ± 0.0
4.392AlaAsp: 4.392 ± 0.0
3.513AlaGlu: 3.513 ± 0.0
1.757AlaPhe: 1.757 ± 0.0
5.709AlaGly: 5.709 ± 0.0
2.635AlaHis: 2.635 ± 0.0
4.392AlaIle: 4.392 ± 0.0
2.196AlaLys: 2.196 ± 0.0
3.074AlaLeu: 3.074 ± 0.0
1.757AlaMet: 1.757 ± 0.0
4.831AlaAsn: 4.831 ± 0.0
1.318AlaPro: 1.318 ± 0.0
1.318AlaGln: 1.318 ± 0.0
3.074AlaArg: 3.074 ± 0.0
3.074AlaSer: 3.074 ± 0.0
4.392AlaThr: 4.392 ± 0.0
2.635AlaVal: 2.635 ± 0.0
0.439AlaTrp: 0.439 ± 0.0
2.196AlaTyr: 2.196 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.318CysAla: 1.318 ± 0.0
0.878CysCys: 0.878 ± 0.0
1.318CysAsp: 1.318 ± 0.0
1.318CysGlu: 1.318 ± 0.0
0.439CysPhe: 0.439 ± 0.0
3.513CysGly: 3.513 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.757CysIle: 1.757 ± 0.0
1.318CysLys: 1.318 ± 0.0
2.196CysLeu: 2.196 ± 0.0
0.878CysMet: 0.878 ± 0.0
1.318CysAsn: 1.318 ± 0.0
1.318CysPro: 1.318 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.318CysArg: 1.318 ± 0.0
0.878CysSer: 0.878 ± 0.0
1.318CysThr: 1.318 ± 0.0
0.439CysVal: 0.439 ± 0.0
0.439CysTrp: 0.439 ± 0.0
0.439CysTyr: 0.439 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.196AspAla: 2.196 ± 0.0
2.196AspCys: 2.196 ± 0.0
4.392AspAsp: 4.392 ± 0.0
4.831AspGlu: 4.831 ± 0.0
4.831AspPhe: 4.831 ± 0.0
5.27AspGly: 5.27 ± 0.0
0.878AspHis: 0.878 ± 0.0
4.831AspIle: 4.831 ± 0.0
2.635AspLys: 2.635 ± 0.0
2.196AspLeu: 2.196 ± 0.0
2.196AspMet: 2.196 ± 0.0
1.318AspAsn: 1.318 ± 0.0
3.513AspPro: 3.513 ± 0.0
2.196AspGln: 2.196 ± 0.0
3.513AspArg: 3.513 ± 0.0
2.635AspSer: 2.635 ± 0.0
1.318AspThr: 1.318 ± 0.0
6.148AspVal: 6.148 ± 0.0
1.318AspTrp: 1.318 ± 0.0
3.953AspTyr: 3.953 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.074GluAla: 3.074 ± 0.0
2.196GluCys: 2.196 ± 0.0
2.635GluAsp: 2.635 ± 0.0
2.635GluGlu: 2.635 ± 0.0
3.513GluPhe: 3.513 ± 0.0
3.513GluGly: 3.513 ± 0.0
1.318GluHis: 1.318 ± 0.0
3.513GluIle: 3.513 ± 0.0
3.074GluLys: 3.074 ± 0.0
3.513GluLeu: 3.513 ± 0.0
0.878GluMet: 0.878 ± 0.0
3.513GluAsn: 3.513 ± 0.0
1.757GluPro: 1.757 ± 0.0
2.196GluGln: 2.196 ± 0.0
1.318GluArg: 1.318 ± 0.0
1.757GluSer: 1.757 ± 0.0
2.196GluThr: 2.196 ± 0.0
1.318GluVal: 1.318 ± 0.0
0.878GluTrp: 0.878 ± 0.0
1.757GluTyr: 1.757 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.757PheAla: 1.757 ± 0.0
2.196PheCys: 2.196 ± 0.0
3.953PheAsp: 3.953 ± 0.0
3.513PheGlu: 3.513 ± 0.0
0.439PhePhe: 0.439 ± 0.0
2.196PheGly: 2.196 ± 0.0
1.757PheHis: 1.757 ± 0.0
3.953PheIle: 3.953 ± 0.0
3.074PheLys: 3.074 ± 0.0
3.074PheLeu: 3.074 ± 0.0
1.318PheMet: 1.318 ± 0.0
4.392PheAsn: 4.392 ± 0.0
1.318PhePro: 1.318 ± 0.0
1.318PheGln: 1.318 ± 0.0
1.318PheArg: 1.318 ± 0.0
2.635PheSer: 2.635 ± 0.0
3.953PheThr: 3.953 ± 0.0
2.635PheVal: 2.635 ± 0.0
0.0PheTrp: 0.0 ± 0.0
1.318PheTyr: 1.318 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.392GlyAla: 4.392 ± 0.0
0.439GlyCys: 0.439 ± 0.0
3.513GlyAsp: 3.513 ± 0.0
3.953GlyGlu: 3.953 ± 0.0
1.757GlyPhe: 1.757 ± 0.0
1.318GlyGly: 1.318 ± 0.0
1.757GlyHis: 1.757 ± 0.0
6.148GlyIle: 6.148 ± 0.0
6.148GlyLys: 6.148 ± 0.0
3.513GlyLeu: 3.513 ± 0.0
1.757GlyMet: 1.757 ± 0.0
1.757GlyAsn: 1.757 ± 0.0
2.635GlyPro: 2.635 ± 0.0
1.757GlyGln: 1.757 ± 0.0
2.635GlyArg: 2.635 ± 0.0
7.027GlySer: 7.027 ± 0.0
3.953GlyThr: 3.953 ± 0.0
4.831GlyVal: 4.831 ± 0.0
0.878GlyTrp: 0.878 ± 0.0
3.953GlyTyr: 3.953 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.878HisAla: 0.878 ± 0.0
0.439HisCys: 0.439 ± 0.0
0.439HisAsp: 0.439 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.878HisPhe: 0.878 ± 0.0
1.318HisGly: 1.318 ± 0.0
0.439HisHis: 0.439 ± 0.0
2.196HisIle: 2.196 ± 0.0
1.757HisLys: 1.757 ± 0.0
2.196HisLeu: 2.196 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.757HisAsn: 1.757 ± 0.0
1.757HisPro: 1.757 ± 0.0
1.318HisGln: 1.318 ± 0.0
3.074HisArg: 3.074 ± 0.0
5.27HisSer: 5.27 ± 0.0
0.439HisThr: 0.439 ± 0.0
1.318HisVal: 1.318 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.318HisTyr: 1.318 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.588IleAla: 6.588 ± 0.0
1.318IleCys: 1.318 ± 0.0
4.831IleAsp: 4.831 ± 0.0
3.953IleGlu: 3.953 ± 0.0
2.196IlePhe: 2.196 ± 0.0
3.513IleGly: 3.513 ± 0.0
2.196IleHis: 2.196 ± 0.0
3.074IleIle: 3.074 ± 0.0
4.392IleLys: 4.392 ± 0.0
3.513IleLeu: 3.513 ± 0.0
1.757IleMet: 1.757 ± 0.0
6.148IleAsn: 6.148 ± 0.0
1.757IlePro: 1.757 ± 0.0
4.831IleGln: 4.831 ± 0.0
2.635IleArg: 2.635 ± 0.0
5.709IleSer: 5.709 ± 0.0
3.513IleThr: 3.513 ± 0.0
6.148IleVal: 6.148 ± 0.0
1.318IleTrp: 1.318 ± 0.0
2.196IleTyr: 2.196 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.757LysAla: 1.757 ± 0.0
0.878LysCys: 0.878 ± 0.0
5.27LysAsp: 5.27 ± 0.0
2.196LysGlu: 2.196 ± 0.0
1.318LysPhe: 1.318 ± 0.0
2.196LysGly: 2.196 ± 0.0
2.196LysHis: 2.196 ± 0.0
5.709LysIle: 5.709 ± 0.0
3.513LysLys: 3.513 ± 0.0
4.831LysLeu: 4.831 ± 0.0
2.196LysMet: 2.196 ± 0.0
2.196LysAsn: 2.196 ± 0.0
1.757LysPro: 1.757 ± 0.0
0.878LysGln: 0.878 ± 0.0
3.513LysArg: 3.513 ± 0.0
6.148LysSer: 6.148 ± 0.0
4.831LysThr: 4.831 ± 0.0
2.635LysVal: 2.635 ± 0.0
0.439LysTrp: 0.439 ± 0.0
2.196LysTyr: 2.196 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.148LeuAla: 6.148 ± 0.0
1.318LeuCys: 1.318 ± 0.0
7.027LeuAsp: 7.027 ± 0.0
3.513LeuGlu: 3.513 ± 0.0
1.757LeuPhe: 1.757 ± 0.0
3.953LeuGly: 3.953 ± 0.0
2.196LeuHis: 2.196 ± 0.0
7.027LeuIle: 7.027 ± 0.0
2.635LeuLys: 2.635 ± 0.0
4.392LeuLeu: 4.392 ± 0.0
2.196LeuMet: 2.196 ± 0.0
3.513LeuAsn: 3.513 ± 0.0
3.074LeuPro: 3.074 ± 0.0
2.635LeuGln: 2.635 ± 0.0
4.392LeuArg: 4.392 ± 0.0
4.831LeuSer: 4.831 ± 0.0
5.709LeuThr: 5.709 ± 0.0
4.831LeuVal: 4.831 ± 0.0
0.878LeuTrp: 0.878 ± 0.0
2.635LeuTyr: 2.635 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.318MetAla: 1.318 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.635MetAsp: 2.635 ± 0.0
1.318MetGlu: 1.318 ± 0.0
2.196MetPhe: 2.196 ± 0.0
3.074MetGly: 3.074 ± 0.0
0.439MetHis: 0.439 ± 0.0
2.635MetIle: 2.635 ± 0.0
1.757MetLys: 1.757 ± 0.0
2.196MetLeu: 2.196 ± 0.0
0.878MetMet: 0.878 ± 0.0
1.757MetAsn: 1.757 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.878MetGln: 0.878 ± 0.0
2.196MetArg: 2.196 ± 0.0
1.318MetSer: 1.318 ± 0.0
1.318MetThr: 1.318 ± 0.0
0.878MetVal: 0.878 ± 0.0
0.439MetTrp: 0.439 ± 0.0
1.757MetTyr: 1.757 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.635AsnAla: 2.635 ± 0.0
0.878AsnCys: 0.878 ± 0.0
2.635AsnAsp: 2.635 ± 0.0
2.635AsnGlu: 2.635 ± 0.0
4.392AsnPhe: 4.392 ± 0.0
4.392AsnGly: 4.392 ± 0.0
0.878AsnHis: 0.878 ± 0.0
2.196AsnIle: 2.196 ± 0.0
1.318AsnLys: 1.318 ± 0.0
4.831AsnLeu: 4.831 ± 0.0
3.513AsnMet: 3.513 ± 0.0
4.831AsnAsn: 4.831 ± 0.0
3.513AsnPro: 3.513 ± 0.0
1.318AsnGln: 1.318 ± 0.0
2.196AsnArg: 2.196 ± 0.0
3.513AsnSer: 3.513 ± 0.0
4.392AsnThr: 4.392 ± 0.0
2.635AsnVal: 2.635 ± 0.0
0.878AsnTrp: 0.878 ± 0.0
4.831AsnTyr: 4.831 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.757ProAla: 1.757 ± 0.0
0.439ProCys: 0.439 ± 0.0
1.757ProAsp: 1.757 ± 0.0
1.318ProGlu: 1.318 ± 0.0
2.635ProPhe: 2.635 ± 0.0
2.635ProGly: 2.635 ± 0.0
1.757ProHis: 1.757 ± 0.0
3.074ProIle: 3.074 ± 0.0
1.757ProLys: 1.757 ± 0.0
4.831ProLeu: 4.831 ± 0.0
0.878ProMet: 0.878 ± 0.0
2.196ProAsn: 2.196 ± 0.0
0.878ProPro: 0.878 ± 0.0
0.878ProGln: 0.878 ± 0.0
1.318ProArg: 1.318 ± 0.0
3.513ProSer: 3.513 ± 0.0
1.757ProThr: 1.757 ± 0.0
4.831ProVal: 4.831 ± 0.0
0.878ProTrp: 0.878 ± 0.0
2.196ProTyr: 2.196 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.757GlnAla: 1.757 ± 0.0
0.439GlnCys: 0.439 ± 0.0
1.757GlnAsp: 1.757 ± 0.0
2.635GlnGlu: 2.635 ± 0.0
1.757GlnPhe: 1.757 ± 0.0
2.635GlnGly: 2.635 ± 0.0
0.878GlnHis: 0.878 ± 0.0
1.757GlnIle: 1.757 ± 0.0
1.757GlnLys: 1.757 ± 0.0
3.953GlnLeu: 3.953 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.757GlnAsn: 1.757 ± 0.0
0.439GlnPro: 0.439 ± 0.0
0.439GlnGln: 0.439 ± 0.0
1.757GlnArg: 1.757 ± 0.0
3.513GlnSer: 3.513 ± 0.0
2.196GlnThr: 2.196 ± 0.0
0.439GlnVal: 0.439 ± 0.0
0.439GlnTrp: 0.439 ± 0.0
2.196GlnTyr: 2.196 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.074ArgAla: 3.074 ± 0.0
1.318ArgCys: 1.318 ± 0.0
1.757ArgAsp: 1.757 ± 0.0
2.196ArgGlu: 2.196 ± 0.0
1.757ArgPhe: 1.757 ± 0.0
4.392ArgGly: 4.392 ± 0.0
1.318ArgHis: 1.318 ± 0.0
3.953ArgIle: 3.953 ± 0.0
3.953ArgLys: 3.953 ± 0.0
1.757ArgLeu: 1.757 ± 0.0
1.318ArgMet: 1.318 ± 0.0
3.074ArgAsn: 3.074 ± 0.0
2.635ArgPro: 2.635 ± 0.0
0.878ArgGln: 0.878 ± 0.0
1.757ArgArg: 1.757 ± 0.0
3.513ArgSer: 3.513 ± 0.0
2.196ArgThr: 2.196 ± 0.0
3.953ArgVal: 3.953 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
2.635ArgTyr: 2.635 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.27SerAla: 5.27 ± 0.0
1.757SerCys: 1.757 ± 0.0
3.074SerAsp: 3.074 ± 0.0
2.196SerGlu: 2.196 ± 0.0
3.953SerPhe: 3.953 ± 0.0
3.953SerGly: 3.953 ± 0.0
1.318SerHis: 1.318 ± 0.0
4.392SerIle: 4.392 ± 0.0
5.27SerLys: 5.27 ± 0.0
8.783SerLeu: 8.783 ± 0.0
2.196SerMet: 2.196 ± 0.0
4.392SerAsn: 4.392 ± 0.0
2.635SerPro: 2.635 ± 0.0
1.757SerGln: 1.757 ± 0.0
3.074SerArg: 3.074 ± 0.0
4.831SerSer: 4.831 ± 0.0
7.905SerThr: 7.905 ± 0.0
5.27SerVal: 5.27 ± 0.0
0.439SerTrp: 0.439 ± 0.0
3.074SerTyr: 3.074 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.196ThrAla: 2.196 ± 0.0
1.318ThrCys: 1.318 ± 0.0
2.196ThrAsp: 2.196 ± 0.0
0.439ThrGlu: 0.439 ± 0.0
2.196ThrPhe: 2.196 ± 0.0
3.513ThrGly: 3.513 ± 0.0
2.196ThrHis: 2.196 ± 0.0
4.831ThrIle: 4.831 ± 0.0
3.074ThrLys: 3.074 ± 0.0
3.074ThrLeu: 3.074 ± 0.0
1.318ThrMet: 1.318 ± 0.0
3.074ThrAsn: 3.074 ± 0.0
5.27ThrPro: 5.27 ± 0.0
2.196ThrGln: 2.196 ± 0.0
2.196ThrArg: 2.196 ± 0.0
6.148ThrSer: 6.148 ± 0.0
1.318ThrThr: 1.318 ± 0.0
4.831ThrVal: 4.831 ± 0.0
1.318ThrTrp: 1.318 ± 0.0
6.148ThrTyr: 6.148 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.392ValAla: 4.392 ± 0.0
1.757ValCys: 1.757 ± 0.0
3.513ValAsp: 3.513 ± 0.0
2.196ValGlu: 2.196 ± 0.0
3.953ValPhe: 3.953 ± 0.0
3.953ValGly: 3.953 ± 0.0
0.439ValHis: 0.439 ± 0.0
3.953ValIle: 3.953 ± 0.0
3.513ValLys: 3.513 ± 0.0
5.709ValLeu: 5.709 ± 0.0
1.757ValMet: 1.757 ± 0.0
4.392ValAsn: 4.392 ± 0.0
3.953ValPro: 3.953 ± 0.0
3.513ValGln: 3.513 ± 0.0
2.635ValArg: 2.635 ± 0.0
4.392ValSer: 4.392 ± 0.0
3.513ValThr: 3.513 ± 0.0
4.831ValVal: 4.831 ± 0.0
0.0ValTrp: 0.0 ± 0.0
2.635ValTyr: 2.635 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.196TrpAsp: 2.196 ± 0.0
0.439TrpGlu: 0.439 ± 0.0
1.318TrpPhe: 1.318 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.439TrpHis: 0.439 ± 0.0
0.878TrpIle: 0.878 ± 0.0
1.318TrpLys: 1.318 ± 0.0
0.878TrpLeu: 0.878 ± 0.0
0.439TrpMet: 0.439 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.439TrpGln: 0.439 ± 0.0
0.878TrpArg: 0.878 ± 0.0
1.318TrpSer: 1.318 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.878TrpTyr: 0.878 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.635TyrAla: 2.635 ± 0.0
1.318TyrCys: 1.318 ± 0.0
3.074TyrAsp: 3.074 ± 0.0
1.757TyrGlu: 1.757 ± 0.0
3.074TyrPhe: 3.074 ± 0.0
3.074TyrGly: 3.074 ± 0.0
1.757TyrHis: 1.757 ± 0.0
1.318TyrIle: 1.318 ± 0.0
2.196TyrLys: 2.196 ± 0.0
6.148TyrLeu: 6.148 ± 0.0
1.318TyrMet: 1.318 ± 0.0
2.196TyrAsn: 2.196 ± 0.0
1.757TyrPro: 1.757 ± 0.0
1.757TyrGln: 1.757 ± 0.0
2.635TyrArg: 2.635 ± 0.0
3.513TyrSer: 3.513 ± 0.0
2.635TyrThr: 2.635 ± 0.0
4.831TyrVal: 4.831 ± 0.0
1.318TyrTrp: 1.318 ± 0.0
3.953TyrTyr: 3.953 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2278 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski