Amino acid dipepetide frequency for Changjiang picorna-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.106AlaAla: 6.106 ± 0.0
0.339AlaCys: 0.339 ± 0.0
3.731AlaAsp: 3.731 ± 0.0
3.053AlaGlu: 3.053 ± 0.0
2.374AlaPhe: 2.374 ± 0.0
5.088AlaGly: 5.088 ± 0.0
0.678AlaHis: 0.678 ± 0.0
2.714AlaIle: 2.714 ± 0.0
4.749AlaLys: 4.749 ± 0.0
6.106AlaLeu: 6.106 ± 0.0
2.035AlaMet: 2.035 ± 0.0
4.071AlaAsn: 4.071 ± 0.0
5.427AlaPro: 5.427 ± 0.0
1.696AlaGln: 1.696 ± 0.0
2.714AlaArg: 2.714 ± 0.0
4.071AlaSer: 4.071 ± 0.0
5.088AlaThr: 5.088 ± 0.0
3.053AlaVal: 3.053 ± 0.0
1.018AlaTrp: 1.018 ± 0.0
2.374AlaTyr: 2.374 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.678CysAla: 0.678 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.678CysAsp: 0.678 ± 0.0
1.018CysGlu: 1.018 ± 0.0
1.018CysPhe: 1.018 ± 0.0
1.357CysGly: 1.357 ± 0.0
0.339CysHis: 0.339 ± 0.0
1.357CysIle: 1.357 ± 0.0
0.678CysLys: 0.678 ± 0.0
1.357CysLeu: 1.357 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.035CysAsn: 2.035 ± 0.0
0.339CysPro: 0.339 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.339CysArg: 0.339 ± 0.0
1.696CysSer: 1.696 ± 0.0
1.018CysThr: 1.018 ± 0.0
2.374CysVal: 2.374 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.696AspAla: 1.696 ± 0.0
0.678AspCys: 0.678 ± 0.0
4.41AspAsp: 4.41 ± 0.0
5.767AspGlu: 5.767 ± 0.0
3.392AspPhe: 3.392 ± 0.0
0.678AspGly: 0.678 ± 0.0
1.696AspHis: 1.696 ± 0.0
3.392AspIle: 3.392 ± 0.0
2.714AspLys: 2.714 ± 0.0
5.088AspLeu: 5.088 ± 0.0
1.696AspMet: 1.696 ± 0.0
2.374AspAsn: 2.374 ± 0.0
2.714AspPro: 2.714 ± 0.0
1.357AspGln: 1.357 ± 0.0
1.696AspArg: 1.696 ± 0.0
2.714AspSer: 2.714 ± 0.0
3.392AspThr: 3.392 ± 0.0
4.41AspVal: 4.41 ± 0.0
0.339AspTrp: 0.339 ± 0.0
1.696AspTyr: 1.696 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.071GluAla: 4.071 ± 0.0
2.035GluCys: 2.035 ± 0.0
2.714GluAsp: 2.714 ± 0.0
4.071GluGlu: 4.071 ± 0.0
2.714GluPhe: 2.714 ± 0.0
1.696GluGly: 1.696 ± 0.0
1.696GluHis: 1.696 ± 0.0
3.731GluIle: 3.731 ± 0.0
3.392GluLys: 3.392 ± 0.0
6.106GluLeu: 6.106 ± 0.0
3.392GluMet: 3.392 ± 0.0
3.053GluAsn: 3.053 ± 0.0
4.071GluPro: 4.071 ± 0.0
2.714GluGln: 2.714 ± 0.0
1.357GluArg: 1.357 ± 0.0
4.41GluSer: 4.41 ± 0.0
3.053GluThr: 3.053 ± 0.0
4.071GluVal: 4.071 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.357GluTyr: 1.357 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
5.088PheAla: 5.088 ± 0.0
1.696PheCys: 1.696 ± 0.0
1.357PheAsp: 1.357 ± 0.0
2.035PheGlu: 2.035 ± 0.0
1.696PhePhe: 1.696 ± 0.0
2.374PheGly: 2.374 ± 0.0
0.678PheHis: 0.678 ± 0.0
3.053PheIle: 3.053 ± 0.0
3.392PheLys: 3.392 ± 0.0
3.053PheLeu: 3.053 ± 0.0
0.678PheMet: 0.678 ± 0.0
1.696PheAsn: 1.696 ± 0.0
1.357PhePro: 1.357 ± 0.0
3.053PheGln: 3.053 ± 0.0
2.374PheArg: 2.374 ± 0.0
3.053PheSer: 3.053 ± 0.0
3.392PheThr: 3.392 ± 0.0
2.714PheVal: 2.714 ± 0.0
0.339PheTrp: 0.339 ± 0.0
2.035PheTyr: 2.035 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.749GlyAla: 4.749 ± 0.0
1.018GlyCys: 1.018 ± 0.0
3.053GlyAsp: 3.053 ± 0.0
1.357GlyGlu: 1.357 ± 0.0
2.035GlyPhe: 2.035 ± 0.0
4.071GlyGly: 4.071 ± 0.0
1.018GlyHis: 1.018 ± 0.0
2.714GlyIle: 2.714 ± 0.0
2.714GlyLys: 2.714 ± 0.0
2.374GlyLeu: 2.374 ± 0.0
1.696GlyMet: 1.696 ± 0.0
3.392GlyAsn: 3.392 ± 0.0
2.374GlyPro: 2.374 ± 0.0
1.018GlyGln: 1.018 ± 0.0
1.696GlyArg: 1.696 ± 0.0
6.445GlySer: 6.445 ± 0.0
5.767GlyThr: 5.767 ± 0.0
2.714GlyVal: 2.714 ± 0.0
1.696GlyTrp: 1.696 ± 0.0
2.374GlyTyr: 2.374 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.678HisAla: 0.678 ± 0.0
0.339HisCys: 0.339 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.035HisGlu: 2.035 ± 0.0
1.018HisPhe: 1.018 ± 0.0
1.018HisGly: 1.018 ± 0.0
0.339HisHis: 0.339 ± 0.0
1.357HisIle: 1.357 ± 0.0
1.018HisLys: 1.018 ± 0.0
3.392HisLeu: 3.392 ± 0.0
1.357HisMet: 1.357 ± 0.0
0.339HisAsn: 0.339 ± 0.0
2.374HisPro: 2.374 ± 0.0
0.339HisGln: 0.339 ± 0.0
1.018HisArg: 1.018 ± 0.0
1.696HisSer: 1.696 ± 0.0
1.018HisThr: 1.018 ± 0.0
2.035HisVal: 2.035 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.696HisTyr: 1.696 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.714IleAla: 2.714 ± 0.0
0.678IleCys: 0.678 ± 0.0
3.392IleAsp: 3.392 ± 0.0
3.053IleGlu: 3.053 ± 0.0
3.392IlePhe: 3.392 ± 0.0
3.392IleGly: 3.392 ± 0.0
1.696IleHis: 1.696 ± 0.0
2.714IleIle: 2.714 ± 0.0
2.374IleLys: 2.374 ± 0.0
4.41IleLeu: 4.41 ± 0.0
1.696IleMet: 1.696 ± 0.0
3.731IleAsn: 3.731 ± 0.0
3.053IlePro: 3.053 ± 0.0
3.053IleGln: 3.053 ± 0.0
2.374IleArg: 2.374 ± 0.0
4.749IleSer: 4.749 ± 0.0
4.749IleThr: 4.749 ± 0.0
4.41IleVal: 4.41 ± 0.0
0.0IleTrp: 0.0 ± 0.0
2.714IleTyr: 2.714 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.392LysAla: 3.392 ± 0.0
0.339LysCys: 0.339 ± 0.0
2.035LysAsp: 2.035 ± 0.0
2.714LysGlu: 2.714 ± 0.0
3.053LysPhe: 3.053 ± 0.0
3.053LysGly: 3.053 ± 0.0
1.696LysHis: 1.696 ± 0.0
4.071LysIle: 4.071 ± 0.0
5.427LysLys: 5.427 ± 0.0
5.767LysLeu: 5.767 ± 0.0
2.035LysMet: 2.035 ± 0.0
2.714LysAsn: 2.714 ± 0.0
3.392LysPro: 3.392 ± 0.0
2.714LysGln: 2.714 ± 0.0
3.053LysArg: 3.053 ± 0.0
3.392LysSer: 3.392 ± 0.0
5.427LysThr: 5.427 ± 0.0
3.053LysVal: 3.053 ± 0.0
0.678LysTrp: 0.678 ± 0.0
1.357LysTyr: 1.357 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.731LeuAla: 3.731 ± 0.0
3.731LeuCys: 3.731 ± 0.0
2.714LeuAsp: 2.714 ± 0.0
5.767LeuGlu: 5.767 ± 0.0
3.731LeuPhe: 3.731 ± 0.0
5.427LeuGly: 5.427 ± 0.0
1.696LeuHis: 1.696 ± 0.0
5.088LeuIle: 5.088 ± 0.0
6.784LeuLys: 6.784 ± 0.0
7.463LeuLeu: 7.463 ± 0.0
2.035LeuMet: 2.035 ± 0.0
5.427LeuAsn: 5.427 ± 0.0
3.053LeuPro: 3.053 ± 0.0
2.714LeuGln: 2.714 ± 0.0
2.374LeuArg: 2.374 ± 0.0
7.463LeuSer: 7.463 ± 0.0
6.784LeuThr: 6.784 ± 0.0
5.427LeuVal: 5.427 ± 0.0
0.678LeuTrp: 0.678 ± 0.0
1.018LeuTyr: 1.018 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.714MetAla: 2.714 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.374MetAsp: 2.374 ± 0.0
2.035MetGlu: 2.035 ± 0.0
1.357MetPhe: 1.357 ± 0.0
2.035MetGly: 2.035 ± 0.0
0.678MetHis: 0.678 ± 0.0
1.357MetIle: 1.357 ± 0.0
1.357MetLys: 1.357 ± 0.0
0.678MetLeu: 0.678 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.696MetAsn: 1.696 ± 0.0
2.714MetPro: 2.714 ± 0.0
1.696MetGln: 1.696 ± 0.0
2.035MetArg: 2.035 ± 0.0
2.374MetSer: 2.374 ± 0.0
2.374MetThr: 2.374 ± 0.0
1.357MetVal: 1.357 ± 0.0
0.678MetTrp: 0.678 ± 0.0
1.357MetTyr: 1.357 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.731AsnAla: 3.731 ± 0.0
1.696AsnCys: 1.696 ± 0.0
3.392AsnAsp: 3.392 ± 0.0
3.053AsnGlu: 3.053 ± 0.0
1.018AsnPhe: 1.018 ± 0.0
3.053AsnGly: 3.053 ± 0.0
1.357AsnHis: 1.357 ± 0.0
3.731AsnIle: 3.731 ± 0.0
2.035AsnLys: 2.035 ± 0.0
4.071AsnLeu: 4.071 ± 0.0
2.374AsnMet: 2.374 ± 0.0
2.035AsnAsn: 2.035 ± 0.0
2.714AsnPro: 2.714 ± 0.0
3.731AsnGln: 3.731 ± 0.0
2.714AsnArg: 2.714 ± 0.0
4.41AsnSer: 4.41 ± 0.0
2.714AsnThr: 2.714 ± 0.0
4.41AsnVal: 4.41 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.696AsnTyr: 1.696 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.053ProAla: 3.053 ± 0.0
0.0ProCys: 0.0 ± 0.0
3.053ProAsp: 3.053 ± 0.0
2.374ProGlu: 2.374 ± 0.0
2.374ProPhe: 2.374 ± 0.0
2.035ProGly: 2.035 ± 0.0
1.018ProHis: 1.018 ± 0.0
3.053ProIle: 3.053 ± 0.0
2.714ProLys: 2.714 ± 0.0
3.731ProLeu: 3.731 ± 0.0
1.357ProMet: 1.357 ± 0.0
1.357ProAsn: 1.357 ± 0.0
3.731ProPro: 3.731 ± 0.0
0.678ProGln: 0.678 ± 0.0
1.696ProArg: 1.696 ± 0.0
2.374ProSer: 2.374 ± 0.0
4.071ProThr: 4.071 ± 0.0
5.427ProVal: 5.427 ± 0.0
0.339ProTrp: 0.339 ± 0.0
3.392ProTyr: 3.392 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.053GlnAla: 3.053 ± 0.0
0.339GlnCys: 0.339 ± 0.0
2.714GlnAsp: 2.714 ± 0.0
1.696GlnGlu: 1.696 ± 0.0
1.357GlnPhe: 1.357 ± 0.0
1.696GlnGly: 1.696 ± 0.0
2.374GlnHis: 2.374 ± 0.0
1.696GlnIle: 1.696 ± 0.0
1.357GlnLys: 1.357 ± 0.0
3.392GlnLeu: 3.392 ± 0.0
0.678GlnMet: 0.678 ± 0.0
2.714GlnAsn: 2.714 ± 0.0
1.357GlnPro: 1.357 ± 0.0
3.392GlnGln: 3.392 ± 0.0
1.696GlnArg: 1.696 ± 0.0
1.696GlnSer: 1.696 ± 0.0
2.714GlnThr: 2.714 ± 0.0
2.714GlnVal: 2.714 ± 0.0
1.018GlnTrp: 1.018 ± 0.0
2.714GlnTyr: 2.714 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.731ArgAla: 3.731 ± 0.0
0.0ArgCys: 0.0 ± 0.0
2.035ArgAsp: 2.035 ± 0.0
3.392ArgGlu: 3.392 ± 0.0
1.357ArgPhe: 1.357 ± 0.0
1.018ArgGly: 1.018 ± 0.0
1.357ArgHis: 1.357 ± 0.0
1.696ArgIle: 1.696 ± 0.0
1.696ArgLys: 1.696 ± 0.0
4.071ArgLeu: 4.071 ± 0.0
0.339ArgMet: 0.339 ± 0.0
3.392ArgAsn: 3.392 ± 0.0
2.035ArgPro: 2.035 ± 0.0
0.678ArgGln: 0.678 ± 0.0
4.071ArgArg: 4.071 ± 0.0
1.357ArgSer: 1.357 ± 0.0
4.41ArgThr: 4.41 ± 0.0
2.714ArgVal: 2.714 ± 0.0
0.678ArgTrp: 0.678 ± 0.0
2.035ArgTyr: 2.035 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.106SerAla: 6.106 ± 0.0
0.678SerCys: 0.678 ± 0.0
2.714SerAsp: 2.714 ± 0.0
3.731SerGlu: 3.731 ± 0.0
3.053SerPhe: 3.053 ± 0.0
4.071SerGly: 4.071 ± 0.0
1.696SerHis: 1.696 ± 0.0
5.427SerIle: 5.427 ± 0.0
4.071SerLys: 4.071 ± 0.0
6.106SerLeu: 6.106 ± 0.0
3.053SerMet: 3.053 ± 0.0
2.035SerAsn: 2.035 ± 0.0
2.374SerPro: 2.374 ± 0.0
4.071SerGln: 4.071 ± 0.0
2.714SerArg: 2.714 ± 0.0
4.41SerSer: 4.41 ± 0.0
4.749SerThr: 4.749 ± 0.0
6.784SerVal: 6.784 ± 0.0
1.357SerTrp: 1.357 ± 0.0
1.696SerTyr: 1.696 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.427ThrAla: 5.427 ± 0.0
1.018ThrCys: 1.018 ± 0.0
4.41ThrAsp: 4.41 ± 0.0
4.071ThrGlu: 4.071 ± 0.0
4.41ThrPhe: 4.41 ± 0.0
5.767ThrGly: 5.767 ± 0.0
1.696ThrHis: 1.696 ± 0.0
5.767ThrIle: 5.767 ± 0.0
3.731ThrLys: 3.731 ± 0.0
6.445ThrLeu: 6.445 ± 0.0
2.374ThrMet: 2.374 ± 0.0
3.731ThrAsn: 3.731 ± 0.0
2.035ThrPro: 2.035 ± 0.0
2.035ThrGln: 2.035 ± 0.0
2.374ThrArg: 2.374 ± 0.0
5.767ThrSer: 5.767 ± 0.0
6.445ThrThr: 6.445 ± 0.0
5.088ThrVal: 5.088 ± 0.0
0.678ThrTrp: 0.678 ± 0.0
2.714ThrTyr: 2.714 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.749ValAla: 4.749 ± 0.0
0.0ValCys: 0.0 ± 0.0
4.749ValAsp: 4.749 ± 0.0
4.749ValGlu: 4.749 ± 0.0
3.392ValPhe: 3.392 ± 0.0
4.071ValGly: 4.071 ± 0.0
1.018ValHis: 1.018 ± 0.0
3.731ValIle: 3.731 ± 0.0
3.053ValLys: 3.053 ± 0.0
6.445ValLeu: 6.445 ± 0.0
2.035ValMet: 2.035 ± 0.0
4.41ValAsn: 4.41 ± 0.0
2.714ValPro: 2.714 ± 0.0
2.374ValGln: 2.374 ± 0.0
2.374ValArg: 2.374 ± 0.0
5.427ValSer: 5.427 ± 0.0
5.088ValThr: 5.088 ± 0.0
6.445ValVal: 6.445 ± 0.0
1.696ValTrp: 1.696 ± 0.0
3.053ValTyr: 3.053 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.339TrpAla: 0.339 ± 0.0
0.678TrpCys: 0.678 ± 0.0
0.339TrpAsp: 0.339 ± 0.0
0.678TrpGlu: 0.678 ± 0.0
0.678TrpPhe: 0.678 ± 0.0
0.678TrpGly: 0.678 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.678TrpIle: 0.678 ± 0.0
1.696TrpLys: 1.696 ± 0.0
0.339TrpLeu: 0.339 ± 0.0
0.339TrpMet: 0.339 ± 0.0
1.696TrpAsn: 1.696 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.018TrpArg: 1.018 ± 0.0
0.678TrpSer: 0.678 ± 0.0
1.696TrpThr: 1.696 ± 0.0
0.678TrpVal: 0.678 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.678TyrAla: 0.678 ± 0.0
1.018TyrCys: 1.018 ± 0.0
2.035TyrAsp: 2.035 ± 0.0
3.053TyrGlu: 3.053 ± 0.0
1.696TyrPhe: 1.696 ± 0.0
1.696TyrGly: 1.696 ± 0.0
0.339TyrHis: 0.339 ± 0.0
1.018TyrIle: 1.018 ± 0.0
4.41TyrLys: 4.41 ± 0.0
2.714TyrLeu: 2.714 ± 0.0
1.357TyrMet: 1.357 ± 0.0
2.035TyrAsn: 2.035 ± 0.0
0.339TyrPro: 0.339 ± 0.0
3.053TyrGln: 3.053 ± 0.0
2.374TyrArg: 2.374 ± 0.0
2.714TyrSer: 2.714 ± 0.0
2.035TyrThr: 2.035 ± 0.0
1.696TyrVal: 1.696 ± 0.0
1.018TyrTrp: 1.018 ± 0.0
1.357TyrTyr: 1.357 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2949 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski