Amino acid dipepetide frequency for Hubei picorna-like virus 34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.609AlaAla: 6.609 ± 0.0
0.696AlaCys: 0.696 ± 0.0
1.391AlaAsp: 1.391 ± 0.0
2.435AlaGlu: 2.435 ± 0.0
3.13AlaPhe: 3.13 ± 0.0
3.478AlaGly: 3.478 ± 0.0
0.696AlaHis: 0.696 ± 0.0
3.478AlaIle: 3.478 ± 0.0
2.087AlaLys: 2.087 ± 0.0
3.478AlaLeu: 3.478 ± 0.0
3.13AlaMet: 3.13 ± 0.0
3.826AlaAsn: 3.826 ± 0.0
4.174AlaPro: 4.174 ± 0.0
3.826AlaGln: 3.826 ± 0.0
1.043AlaArg: 1.043 ± 0.0
4.174AlaSer: 4.174 ± 0.0
5.913AlaThr: 5.913 ± 0.0
4.522AlaVal: 4.522 ± 0.0
1.391AlaTrp: 1.391 ± 0.0
1.391AlaTyr: 1.391 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.783CysAla: 2.783 ± 0.0
0.696CysCys: 0.696 ± 0.0
1.043CysAsp: 1.043 ± 0.0
1.043CysGlu: 1.043 ± 0.0
1.043CysPhe: 1.043 ± 0.0
2.435CysGly: 2.435 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.696CysIle: 0.696 ± 0.0
0.348CysLys: 0.348 ± 0.0
1.739CysLeu: 1.739 ± 0.0
0.696CysMet: 0.696 ± 0.0
1.043CysAsn: 1.043 ± 0.0
1.391CysPro: 1.391 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.348CysArg: 0.348 ± 0.0
0.348CysSer: 0.348 ± 0.0
1.043CysThr: 1.043 ± 0.0
3.478CysVal: 3.478 ± 0.0
0.348CysTrp: 0.348 ± 0.0
0.696CysTyr: 0.696 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.087AspAla: 2.087 ± 0.0
2.087AspCys: 2.087 ± 0.0
4.87AspAsp: 4.87 ± 0.0
3.478AspGlu: 3.478 ± 0.0
4.522AspPhe: 4.522 ± 0.0
2.783AspGly: 2.783 ± 0.0
1.391AspHis: 1.391 ± 0.0
2.087AspIle: 2.087 ± 0.0
1.739AspLys: 1.739 ± 0.0
5.565AspLeu: 5.565 ± 0.0
0.696AspMet: 0.696 ± 0.0
1.043AspAsn: 1.043 ± 0.0
2.435AspPro: 2.435 ± 0.0
0.696AspGln: 0.696 ± 0.0
1.739AspArg: 1.739 ± 0.0
3.13AspSer: 3.13 ± 0.0
3.13AspThr: 3.13 ± 0.0
2.435AspVal: 2.435 ± 0.0
1.739AspTrp: 1.739 ± 0.0
1.043AspTyr: 1.043 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.478GluAla: 3.478 ± 0.0
1.391GluCys: 1.391 ± 0.0
2.087GluAsp: 2.087 ± 0.0
4.87GluGlu: 4.87 ± 0.0
2.783GluPhe: 2.783 ± 0.0
4.87GluGly: 4.87 ± 0.0
1.043GluHis: 1.043 ± 0.0
4.522GluIle: 4.522 ± 0.0
4.174GluLys: 4.174 ± 0.0
6.609GluLeu: 6.609 ± 0.0
0.696GluMet: 0.696 ± 0.0
3.13GluAsn: 3.13 ± 0.0
1.043GluPro: 1.043 ± 0.0
2.087GluGln: 2.087 ± 0.0
4.174GluArg: 4.174 ± 0.0
2.435GluSer: 2.435 ± 0.0
3.826GluThr: 3.826 ± 0.0
5.913GluVal: 5.913 ± 0.0
2.783GluTrp: 2.783 ± 0.0
0.696GluTyr: 0.696 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.043PheAla: 1.043 ± 0.0
1.391PheCys: 1.391 ± 0.0
1.043PheAsp: 1.043 ± 0.0
3.826PheGlu: 3.826 ± 0.0
1.043PhePhe: 1.043 ± 0.0
3.13PheGly: 3.13 ± 0.0
1.043PheHis: 1.043 ± 0.0
1.391PheIle: 1.391 ± 0.0
2.783PheLys: 2.783 ± 0.0
4.522PheLeu: 4.522 ± 0.0
1.391PheMet: 1.391 ± 0.0
2.087PheAsn: 2.087 ± 0.0
2.783PhePro: 2.783 ± 0.0
2.435PheGln: 2.435 ± 0.0
2.783PheArg: 2.783 ± 0.0
3.478PheSer: 3.478 ± 0.0
3.478PheThr: 3.478 ± 0.0
5.913PheVal: 5.913 ± 0.0
0.348PheTrp: 0.348 ± 0.0
1.391PheTyr: 1.391 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.522GlyAla: 4.522 ± 0.0
0.696GlyCys: 0.696 ± 0.0
3.478GlyAsp: 3.478 ± 0.0
1.739GlyGlu: 1.739 ± 0.0
4.174GlyPhe: 4.174 ± 0.0
5.913GlyGly: 5.913 ± 0.0
0.696GlyHis: 0.696 ± 0.0
3.826GlyIle: 3.826 ± 0.0
2.087GlyLys: 2.087 ± 0.0
6.261GlyLeu: 6.261 ± 0.0
2.087GlyMet: 2.087 ± 0.0
2.435GlyAsn: 2.435 ± 0.0
1.391GlyPro: 1.391 ± 0.0
1.739GlyGln: 1.739 ± 0.0
2.087GlyArg: 2.087 ± 0.0
6.957GlySer: 6.957 ± 0.0
3.826GlyThr: 3.826 ± 0.0
7.652GlyVal: 7.652 ± 0.0
1.043GlyTrp: 1.043 ± 0.0
2.087GlyTyr: 2.087 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.348HisAla: 0.348 ± 0.0
1.043HisCys: 1.043 ± 0.0
0.696HisAsp: 0.696 ± 0.0
0.696HisGlu: 0.696 ± 0.0
0.348HisPhe: 0.348 ± 0.0
1.043HisGly: 1.043 ± 0.0
0.696HisHis: 0.696 ± 0.0
1.043HisIle: 1.043 ± 0.0
1.391HisLys: 1.391 ± 0.0
2.435HisLeu: 2.435 ± 0.0
0.696HisMet: 0.696 ± 0.0
0.348HisAsn: 0.348 ± 0.0
0.348HisPro: 0.348 ± 0.0
0.696HisGln: 0.696 ± 0.0
0.348HisArg: 0.348 ± 0.0
1.043HisSer: 1.043 ± 0.0
1.043HisThr: 1.043 ± 0.0
2.087HisVal: 2.087 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.043HisTyr: 1.043 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.174IleAla: 4.174 ± 0.0
1.391IleCys: 1.391 ± 0.0
1.739IleAsp: 1.739 ± 0.0
3.13IleGlu: 3.13 ± 0.0
1.391IlePhe: 1.391 ± 0.0
2.783IleGly: 2.783 ± 0.0
1.043IleHis: 1.043 ± 0.0
2.435IleIle: 2.435 ± 0.0
4.174IleLys: 4.174 ± 0.0
4.174IleLeu: 4.174 ± 0.0
2.435IleMet: 2.435 ± 0.0
2.435IleAsn: 2.435 ± 0.0
4.522IlePro: 4.522 ± 0.0
1.739IleGln: 1.739 ± 0.0
3.13IleArg: 3.13 ± 0.0
1.391IleSer: 1.391 ± 0.0
3.13IleThr: 3.13 ± 0.0
4.174IleVal: 4.174 ± 0.0
1.043IleTrp: 1.043 ± 0.0
1.391IleTyr: 1.391 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.435LysAla: 2.435 ± 0.0
1.739LysCys: 1.739 ± 0.0
4.174LysAsp: 4.174 ± 0.0
3.13LysGlu: 3.13 ± 0.0
1.391LysPhe: 1.391 ± 0.0
4.174LysGly: 4.174 ± 0.0
0.348LysHis: 0.348 ± 0.0
3.478LysIle: 3.478 ± 0.0
2.435LysLys: 2.435 ± 0.0
5.565LysLeu: 5.565 ± 0.0
3.13LysMet: 3.13 ± 0.0
2.435LysAsn: 2.435 ± 0.0
1.043LysPro: 1.043 ± 0.0
1.043LysGln: 1.043 ± 0.0
2.435LysArg: 2.435 ± 0.0
3.478LysSer: 3.478 ± 0.0
5.217LysThr: 5.217 ± 0.0
2.087LysVal: 2.087 ± 0.0
0.696LysTrp: 0.696 ± 0.0
2.087LysTyr: 2.087 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.87LeuAla: 4.87 ± 0.0
4.174LeuCys: 4.174 ± 0.0
6.261LeuAsp: 6.261 ± 0.0
5.217LeuGlu: 5.217 ± 0.0
4.174LeuPhe: 4.174 ± 0.0
5.913LeuGly: 5.913 ± 0.0
1.043LeuHis: 1.043 ± 0.0
4.522LeuIle: 4.522 ± 0.0
5.565LeuLys: 5.565 ± 0.0
6.261LeuLeu: 6.261 ± 0.0
2.783LeuMet: 2.783 ± 0.0
3.478LeuAsn: 3.478 ± 0.0
3.826LeuPro: 3.826 ± 0.0
3.478LeuGln: 3.478 ± 0.0
5.913LeuArg: 5.913 ± 0.0
5.217LeuSer: 5.217 ± 0.0
3.826LeuThr: 3.826 ± 0.0
6.609LeuVal: 6.609 ± 0.0
1.043LeuTrp: 1.043 ± 0.0
4.522LeuTyr: 4.522 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.435MetAla: 2.435 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.739MetAsp: 1.739 ± 0.0
2.435MetGlu: 2.435 ± 0.0
0.696MetPhe: 0.696 ± 0.0
1.391MetGly: 1.391 ± 0.0
1.391MetHis: 1.391 ± 0.0
1.739MetIle: 1.739 ± 0.0
1.391MetLys: 1.391 ± 0.0
1.739MetLeu: 1.739 ± 0.0
1.739MetMet: 1.739 ± 0.0
1.739MetAsn: 1.739 ± 0.0
2.435MetPro: 2.435 ± 0.0
1.043MetGln: 1.043 ± 0.0
2.087MetArg: 2.087 ± 0.0
3.13MetSer: 3.13 ± 0.0
3.826MetThr: 3.826 ± 0.0
2.435MetVal: 2.435 ± 0.0
0.348MetTrp: 0.348 ± 0.0
1.391MetTyr: 1.391 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.739AsnAla: 1.739 ± 0.0
1.043AsnCys: 1.043 ± 0.0
1.391AsnAsp: 1.391 ± 0.0
3.478AsnGlu: 3.478 ± 0.0
3.13AsnPhe: 3.13 ± 0.0
2.783AsnGly: 2.783 ± 0.0
0.696AsnHis: 0.696 ± 0.0
2.783AsnIle: 2.783 ± 0.0
2.783AsnLys: 2.783 ± 0.0
3.13AsnLeu: 3.13 ± 0.0
1.043AsnMet: 1.043 ± 0.0
0.696AsnAsn: 0.696 ± 0.0
3.478AsnPro: 3.478 ± 0.0
1.043AsnGln: 1.043 ± 0.0
2.783AsnArg: 2.783 ± 0.0
5.565AsnSer: 5.565 ± 0.0
1.043AsnThr: 1.043 ± 0.0
3.13AsnVal: 3.13 ± 0.0
1.391AsnTrp: 1.391 ± 0.0
2.435AsnTyr: 2.435 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.087ProAla: 2.087 ± 0.0
0.0ProCys: 0.0 ± 0.0
0.348ProAsp: 0.348 ± 0.0
2.087ProGlu: 2.087 ± 0.0
3.13ProPhe: 3.13 ± 0.0
2.435ProGly: 2.435 ± 0.0
1.391ProHis: 1.391 ± 0.0
2.435ProIle: 2.435 ± 0.0
2.783ProLys: 2.783 ± 0.0
3.826ProLeu: 3.826 ± 0.0
1.391ProMet: 1.391 ± 0.0
2.435ProAsn: 2.435 ± 0.0
1.739ProPro: 1.739 ± 0.0
1.739ProGln: 1.739 ± 0.0
3.826ProArg: 3.826 ± 0.0
2.087ProSer: 2.087 ± 0.0
3.13ProThr: 3.13 ± 0.0
3.826ProVal: 3.826 ± 0.0
0.348ProTrp: 0.348 ± 0.0
3.826ProTyr: 3.826 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.435GlnAla: 2.435 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.739GlnAsp: 1.739 ± 0.0
1.043GlnGlu: 1.043 ± 0.0
2.087GlnPhe: 2.087 ± 0.0
2.435GlnGly: 2.435 ± 0.0
0.696GlnHis: 0.696 ± 0.0
1.043GlnIle: 1.043 ± 0.0
2.087GlnLys: 2.087 ± 0.0
6.261GlnLeu: 6.261 ± 0.0
0.696GlnMet: 0.696 ± 0.0
0.696GlnAsn: 0.696 ± 0.0
0.348GlnPro: 0.348 ± 0.0
1.391GlnGln: 1.391 ± 0.0
1.391GlnArg: 1.391 ± 0.0
1.739GlnSer: 1.739 ± 0.0
2.087GlnThr: 2.087 ± 0.0
2.783GlnVal: 2.783 ± 0.0
0.348GlnTrp: 0.348 ± 0.0
2.783GlnTyr: 2.783 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.13ArgAla: 3.13 ± 0.0
0.696ArgCys: 0.696 ± 0.0
3.826ArgAsp: 3.826 ± 0.0
6.609ArgGlu: 6.609 ± 0.0
3.13ArgPhe: 3.13 ± 0.0
3.13ArgGly: 3.13 ± 0.0
2.087ArgHis: 2.087 ± 0.0
2.087ArgIle: 2.087 ± 0.0
4.174ArgLys: 4.174 ± 0.0
2.435ArgLeu: 2.435 ± 0.0
1.391ArgMet: 1.391 ± 0.0
2.435ArgAsn: 2.435 ± 0.0
3.13ArgPro: 3.13 ± 0.0
1.043ArgGln: 1.043 ± 0.0
3.826ArgArg: 3.826 ± 0.0
2.435ArgSer: 2.435 ± 0.0
2.087ArgThr: 2.087 ± 0.0
4.174ArgVal: 4.174 ± 0.0
1.043ArgTrp: 1.043 ± 0.0
0.696ArgTyr: 0.696 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.783SerAla: 2.783 ± 0.0
0.696SerCys: 0.696 ± 0.0
4.522SerAsp: 4.522 ± 0.0
5.565SerGlu: 5.565 ± 0.0
4.87SerPhe: 4.87 ± 0.0
3.478SerGly: 3.478 ± 0.0
0.348SerHis: 0.348 ± 0.0
4.174SerIle: 4.174 ± 0.0
1.739SerLys: 1.739 ± 0.0
5.217SerLeu: 5.217 ± 0.0
2.783SerMet: 2.783 ± 0.0
4.174SerAsn: 4.174 ± 0.0
3.478SerPro: 3.478 ± 0.0
2.087SerGln: 2.087 ± 0.0
3.478SerArg: 3.478 ± 0.0
5.565SerSer: 5.565 ± 0.0
2.435SerThr: 2.435 ± 0.0
5.217SerVal: 5.217 ± 0.0
1.391SerTrp: 1.391 ± 0.0
2.783SerTyr: 2.783 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.565ThrAla: 5.565 ± 0.0
1.043ThrCys: 1.043 ± 0.0
2.435ThrAsp: 2.435 ± 0.0
3.826ThrGlu: 3.826 ± 0.0
2.087ThrPhe: 2.087 ± 0.0
3.13ThrGly: 3.13 ± 0.0
1.043ThrHis: 1.043 ± 0.0
2.435ThrIle: 2.435 ± 0.0
1.391ThrLys: 1.391 ± 0.0
6.261ThrLeu: 6.261 ± 0.0
2.783ThrMet: 2.783 ± 0.0
4.522ThrAsn: 4.522 ± 0.0
2.087ThrPro: 2.087 ± 0.0
1.391ThrGln: 1.391 ± 0.0
2.087ThrArg: 2.087 ± 0.0
5.565ThrSer: 5.565 ± 0.0
3.478ThrThr: 3.478 ± 0.0
6.261ThrVal: 6.261 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
3.478ThrTyr: 3.478 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.217ValAla: 5.217 ± 0.0
1.391ValCys: 1.391 ± 0.0
3.478ValAsp: 3.478 ± 0.0
4.522ValGlu: 4.522 ± 0.0
2.783ValPhe: 2.783 ± 0.0
4.522ValGly: 4.522 ± 0.0
1.043ValHis: 1.043 ± 0.0
4.87ValIle: 4.87 ± 0.0
5.565ValLys: 5.565 ± 0.0
7.304ValLeu: 7.304 ± 0.0
3.13ValMet: 3.13 ± 0.0
4.174ValAsn: 4.174 ± 0.0
3.13ValPro: 3.13 ± 0.0
5.217ValGln: 5.217 ± 0.0
5.565ValArg: 5.565 ± 0.0
5.565ValSer: 5.565 ± 0.0
4.87ValThr: 4.87 ± 0.0
9.391ValVal: 9.391 ± 0.0
0.348ValTrp: 0.348 ± 0.0
3.826ValTyr: 3.826 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.043TrpAla: 1.043 ± 0.0
0.696TrpCys: 0.696 ± 0.0
0.696TrpAsp: 0.696 ± 0.0
1.043TrpGlu: 1.043 ± 0.0
0.696TrpPhe: 0.696 ± 0.0
2.087TrpGly: 2.087 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.391TrpIle: 1.391 ± 0.0
1.043TrpLys: 1.043 ± 0.0
1.391TrpLeu: 1.391 ± 0.0
0.348TrpMet: 0.348 ± 0.0
1.043TrpAsn: 1.043 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.043TrpGln: 1.043 ± 0.0
1.043TrpArg: 1.043 ± 0.0
1.739TrpSer: 1.739 ± 0.0
0.696TrpThr: 0.696 ± 0.0
0.348TrpVal: 0.348 ± 0.0
0.696TrpTrp: 0.696 ± 0.0
0.348TrpTyr: 0.348 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.087TyrAla: 2.087 ± 0.0
0.348TyrCys: 0.348 ± 0.0
1.739TyrAsp: 1.739 ± 0.0
2.435TyrGlu: 2.435 ± 0.0
0.696TyrPhe: 0.696 ± 0.0
2.783TyrGly: 2.783 ± 0.0
0.696TyrHis: 0.696 ± 0.0
1.391TyrIle: 1.391 ± 0.0
2.783TyrLys: 2.783 ± 0.0
4.522TyrLeu: 4.522 ± 0.0
1.739TyrMet: 1.739 ± 0.0
1.391TyrAsn: 1.391 ± 0.0
2.087TyrPro: 2.087 ± 0.0
0.348TyrGln: 0.348 ± 0.0
3.478TyrArg: 3.478 ± 0.0
2.087TyrSer: 2.087 ± 0.0
2.783TyrThr: 2.783 ± 0.0
3.478TyrVal: 3.478 ± 0.0
1.043TyrTrp: 1.043 ± 0.0
2.087TyrTyr: 2.087 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2876 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski