Amino acid dipepetide frequency for Jingmen picorna-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.943AlaAla: 4.943 ± 0.0
0.618AlaCys: 0.618 ± 0.0
3.398AlaAsp: 3.398 ± 0.0
5.252AlaGlu: 5.252 ± 0.0
3.089AlaPhe: 3.089 ± 0.0
5.561AlaGly: 5.561 ± 0.0
2.162AlaHis: 2.162 ± 0.0
4.634AlaIle: 4.634 ± 0.0
2.78AlaLys: 2.78 ± 0.0
4.634AlaLeu: 4.634 ± 0.0
1.545AlaMet: 1.545 ± 0.0
1.854AlaAsn: 1.854 ± 0.0
3.707AlaPro: 3.707 ± 0.0
2.162AlaGln: 2.162 ± 0.0
3.707AlaArg: 3.707 ± 0.0
3.398AlaSer: 3.398 ± 0.0
3.398AlaThr: 3.398 ± 0.0
7.414AlaVal: 7.414 ± 0.0
0.309AlaTrp: 0.309 ± 0.0
2.162AlaTyr: 2.162 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.309CysAla: 0.309 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.927CysAsp: 0.927 ± 0.0
0.309CysGlu: 0.309 ± 0.0
0.927CysPhe: 0.927 ± 0.0
1.545CysGly: 1.545 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.927CysIle: 0.927 ± 0.0
1.545CysLys: 1.545 ± 0.0
2.162CysLeu: 2.162 ± 0.0
0.309CysMet: 0.309 ± 0.0
0.309CysAsn: 0.309 ± 0.0
0.927CysPro: 0.927 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.309CysArg: 0.309 ± 0.0
1.545CysSer: 1.545 ± 0.0
0.927CysThr: 0.927 ± 0.0
1.545CysVal: 1.545 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.309CysTyr: 0.309 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.943AspAla: 4.943 ± 0.0
1.545AspCys: 1.545 ± 0.0
4.943AspAsp: 4.943 ± 0.0
3.398AspGlu: 3.398 ± 0.0
2.162AspPhe: 2.162 ± 0.0
4.325AspGly: 4.325 ± 0.0
1.545AspHis: 1.545 ± 0.0
2.162AspIle: 2.162 ± 0.0
4.634AspLys: 4.634 ± 0.0
7.414AspLeu: 7.414 ± 0.0
1.236AspMet: 1.236 ± 0.0
1.545AspAsn: 1.545 ± 0.0
4.016AspPro: 4.016 ± 0.0
3.398AspGln: 3.398 ± 0.0
1.854AspArg: 1.854 ± 0.0
4.325AspSer: 4.325 ± 0.0
0.927AspThr: 0.927 ± 0.0
8.341AspVal: 8.341 ± 0.0
0.309AspTrp: 0.309 ± 0.0
3.398AspTyr: 3.398 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.016GluAla: 4.016 ± 0.0
0.0GluCys: 0.0 ± 0.0
4.634GluAsp: 4.634 ± 0.0
3.089GluGlu: 3.089 ± 0.0
4.325GluPhe: 4.325 ± 0.0
3.089GluGly: 3.089 ± 0.0
1.854GluHis: 1.854 ± 0.0
3.398GluIle: 3.398 ± 0.0
5.561GluLys: 5.561 ± 0.0
4.016GluLeu: 4.016 ± 0.0
2.78GluMet: 2.78 ± 0.0
1.854GluAsn: 1.854 ± 0.0
2.471GluPro: 2.471 ± 0.0
1.854GluGln: 1.854 ± 0.0
1.854GluArg: 1.854 ± 0.0
2.78GluSer: 2.78 ± 0.0
2.78GluThr: 2.78 ± 0.0
4.943GluVal: 4.943 ± 0.0
2.162GluTrp: 2.162 ± 0.0
2.471GluTyr: 2.471 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.162PheAla: 2.162 ± 0.0
1.545PheCys: 1.545 ± 0.0
4.634PheAsp: 4.634 ± 0.0
4.325PheGlu: 4.325 ± 0.0
1.854PhePhe: 1.854 ± 0.0
3.398PheGly: 3.398 ± 0.0
1.854PheHis: 1.854 ± 0.0
1.236PheIle: 1.236 ± 0.0
1.854PheLys: 1.854 ± 0.0
5.252PheLeu: 5.252 ± 0.0
0.618PheMet: 0.618 ± 0.0
3.089PheAsn: 3.089 ± 0.0
2.162PhePro: 2.162 ± 0.0
2.162PheGln: 2.162 ± 0.0
3.398PheArg: 3.398 ± 0.0
3.398PheSer: 3.398 ± 0.0
1.854PheThr: 1.854 ± 0.0
4.634PheVal: 4.634 ± 0.0
0.309PheTrp: 0.309 ± 0.0
2.162PheTyr: 2.162 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.325GlyAla: 4.325 ± 0.0
1.854GlyCys: 1.854 ± 0.0
5.561GlyAsp: 5.561 ± 0.0
3.398GlyGlu: 3.398 ± 0.0
3.398GlyPhe: 3.398 ± 0.0
3.707GlyGly: 3.707 ± 0.0
0.309GlyHis: 0.309 ± 0.0
6.179GlyIle: 6.179 ± 0.0
4.325GlyLys: 4.325 ± 0.0
3.398GlyLeu: 3.398 ± 0.0
1.236GlyMet: 1.236 ± 0.0
2.162GlyAsn: 2.162 ± 0.0
1.854GlyPro: 1.854 ± 0.0
2.471GlyGln: 2.471 ± 0.0
5.252GlyArg: 5.252 ± 0.0
2.471GlySer: 2.471 ± 0.0
4.943GlyThr: 4.943 ± 0.0
5.561GlyVal: 5.561 ± 0.0
1.545GlyTrp: 1.545 ± 0.0
4.016GlyTyr: 4.016 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.545HisAla: 1.545 ± 0.0
0.927HisCys: 0.927 ± 0.0
0.927HisAsp: 0.927 ± 0.0
1.236HisGlu: 1.236 ± 0.0
0.927HisPhe: 0.927 ± 0.0
1.545HisGly: 1.545 ± 0.0
0.927HisHis: 0.927 ± 0.0
1.545HisIle: 1.545 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.471HisLeu: 2.471 ± 0.0
0.309HisMet: 0.309 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.927HisPro: 0.927 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.162HisSer: 2.162 ± 0.0
0.927HisThr: 0.927 ± 0.0
1.854HisVal: 1.854 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.927HisTyr: 0.927 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.707IleAla: 3.707 ± 0.0
0.309IleCys: 0.309 ± 0.0
4.016IleAsp: 4.016 ± 0.0
1.854IleGlu: 1.854 ± 0.0
0.927IlePhe: 0.927 ± 0.0
4.634IleGly: 4.634 ± 0.0
1.236IleHis: 1.236 ± 0.0
1.854IleIle: 1.854 ± 0.0
2.78IleLys: 2.78 ± 0.0
4.634IleLeu: 4.634 ± 0.0
0.618IleMet: 0.618 ± 0.0
2.162IleAsn: 2.162 ± 0.0
1.854IlePro: 1.854 ± 0.0
1.854IleGln: 1.854 ± 0.0
0.927IleArg: 0.927 ± 0.0
4.325IleSer: 4.325 ± 0.0
2.471IleThr: 2.471 ± 0.0
4.016IleVal: 4.016 ± 0.0
0.927IleTrp: 0.927 ± 0.0
3.398IleTyr: 3.398 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.471LysAla: 2.471 ± 0.0
0.618LysCys: 0.618 ± 0.0
3.398LysAsp: 3.398 ± 0.0
3.089LysGlu: 3.089 ± 0.0
4.634LysPhe: 4.634 ± 0.0
2.78LysGly: 2.78 ± 0.0
1.236LysHis: 1.236 ± 0.0
2.78LysIle: 2.78 ± 0.0
5.87LysLys: 5.87 ± 0.0
4.634LysLeu: 4.634 ± 0.0
0.309LysMet: 0.309 ± 0.0
3.398LysAsn: 3.398 ± 0.0
1.854LysPro: 1.854 ± 0.0
1.854LysGln: 1.854 ± 0.0
3.398LysArg: 3.398 ± 0.0
2.471LysSer: 2.471 ± 0.0
4.325LysThr: 4.325 ± 0.0
4.634LysVal: 4.634 ± 0.0
0.927LysTrp: 0.927 ± 0.0
1.236LysTyr: 1.236 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.796LeuAla: 6.796 ± 0.0
1.236LeuCys: 1.236 ± 0.0
5.252LeuAsp: 5.252 ± 0.0
6.179LeuGlu: 6.179 ± 0.0
2.78LeuPhe: 2.78 ± 0.0
4.016LeuGly: 4.016 ± 0.0
1.854LeuHis: 1.854 ± 0.0
3.089LeuIle: 3.089 ± 0.0
2.78LeuLys: 2.78 ± 0.0
5.87LeuLeu: 5.87 ± 0.0
2.471LeuMet: 2.471 ± 0.0
3.089LeuAsn: 3.089 ± 0.0
4.325LeuPro: 4.325 ± 0.0
3.707LeuGln: 3.707 ± 0.0
4.634LeuArg: 4.634 ± 0.0
7.414LeuSer: 7.414 ± 0.0
4.016LeuThr: 4.016 ± 0.0
6.179LeuVal: 6.179 ± 0.0
1.236LeuTrp: 1.236 ± 0.0
2.471LeuTyr: 2.471 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.471MetAla: 2.471 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.545MetAsp: 1.545 ± 0.0
1.236MetGlu: 1.236 ± 0.0
1.236MetPhe: 1.236 ± 0.0
0.927MetGly: 0.927 ± 0.0
0.309MetHis: 0.309 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.545MetLys: 1.545 ± 0.0
1.236MetLeu: 1.236 ± 0.0
0.927MetMet: 0.927 ± 0.0
2.78MetAsn: 2.78 ± 0.0
1.236MetPro: 1.236 ± 0.0
1.236MetGln: 1.236 ± 0.0
2.78MetArg: 2.78 ± 0.0
2.162MetSer: 2.162 ± 0.0
2.162MetThr: 2.162 ± 0.0
1.236MetVal: 1.236 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.236MetTyr: 1.236 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.707AsnAla: 3.707 ± 0.0
0.0AsnCys: 0.0 ± 0.0
3.089AsnAsp: 3.089 ± 0.0
0.927AsnGlu: 0.927 ± 0.0
2.162AsnPhe: 2.162 ± 0.0
3.089AsnGly: 3.089 ± 0.0
0.927AsnHis: 0.927 ± 0.0
2.78AsnIle: 2.78 ± 0.0
0.309AsnLys: 0.309 ± 0.0
4.016AsnLeu: 4.016 ± 0.0
0.309AsnMet: 0.309 ± 0.0
2.471AsnAsn: 2.471 ± 0.0
1.236AsnPro: 1.236 ± 0.0
1.545AsnGln: 1.545 ± 0.0
1.545AsnArg: 1.545 ± 0.0
4.634AsnSer: 4.634 ± 0.0
1.854AsnThr: 1.854 ± 0.0
2.78AsnVal: 2.78 ± 0.0
0.309AsnTrp: 0.309 ± 0.0
2.471AsnTyr: 2.471 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.398ProAla: 3.398 ± 0.0
0.0ProCys: 0.0 ± 0.0
3.089ProAsp: 3.089 ± 0.0
2.78ProGlu: 2.78 ± 0.0
3.089ProPhe: 3.089 ± 0.0
3.089ProGly: 3.089 ± 0.0
0.927ProHis: 0.927 ± 0.0
2.471ProIle: 2.471 ± 0.0
2.471ProLys: 2.471 ± 0.0
5.252ProLeu: 5.252 ± 0.0
0.618ProMet: 0.618 ± 0.0
1.236ProAsn: 1.236 ± 0.0
2.162ProPro: 2.162 ± 0.0
0.927ProGln: 0.927 ± 0.0
2.471ProArg: 2.471 ± 0.0
1.854ProSer: 1.854 ± 0.0
4.325ProThr: 4.325 ± 0.0
3.398ProVal: 3.398 ± 0.0
0.309ProTrp: 0.309 ± 0.0
1.854ProTyr: 1.854 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.162GlnAla: 2.162 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.854GlnAsp: 1.854 ± 0.0
1.236GlnGlu: 1.236 ± 0.0
3.398GlnPhe: 3.398 ± 0.0
2.78GlnGly: 2.78 ± 0.0
1.236GlnHis: 1.236 ± 0.0
0.927GlnIle: 0.927 ± 0.0
5.252GlnLys: 5.252 ± 0.0
1.236GlnLeu: 1.236 ± 0.0
0.618GlnMet: 0.618 ± 0.0
1.854GlnAsn: 1.854 ± 0.0
0.309GlnPro: 0.309 ± 0.0
0.927GlnGln: 0.927 ± 0.0
1.236GlnArg: 1.236 ± 0.0
3.089GlnSer: 3.089 ± 0.0
3.398GlnThr: 3.398 ± 0.0
3.398GlnVal: 3.398 ± 0.0
0.927GlnTrp: 0.927 ± 0.0
2.78GlnTyr: 2.78 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.398ArgAla: 3.398 ± 0.0
0.618ArgCys: 0.618 ± 0.0
2.162ArgAsp: 2.162 ± 0.0
2.78ArgGlu: 2.78 ± 0.0
2.162ArgPhe: 2.162 ± 0.0
3.089ArgGly: 3.089 ± 0.0
0.309ArgHis: 0.309 ± 0.0
2.471ArgIle: 2.471 ± 0.0
3.089ArgLys: 3.089 ± 0.0
2.78ArgLeu: 2.78 ± 0.0
2.162ArgMet: 2.162 ± 0.0
2.471ArgAsn: 2.471 ± 0.0
1.854ArgPro: 1.854 ± 0.0
2.162ArgGln: 2.162 ± 0.0
2.162ArgArg: 2.162 ± 0.0
3.707ArgSer: 3.707 ± 0.0
2.162ArgThr: 2.162 ± 0.0
4.943ArgVal: 4.943 ± 0.0
0.618ArgTrp: 0.618 ± 0.0
1.854ArgTyr: 1.854 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.016SerAla: 4.016 ± 0.0
0.927SerCys: 0.927 ± 0.0
4.634SerAsp: 4.634 ± 0.0
5.561SerGlu: 5.561 ± 0.0
4.325SerPhe: 4.325 ± 0.0
6.487SerGly: 6.487 ± 0.0
0.618SerHis: 0.618 ± 0.0
4.325SerIle: 4.325 ± 0.0
3.707SerLys: 3.707 ± 0.0
5.252SerLeu: 5.252 ± 0.0
3.398SerMet: 3.398 ± 0.0
1.545SerAsn: 1.545 ± 0.0
3.707SerPro: 3.707 ± 0.0
2.162SerGln: 2.162 ± 0.0
1.854SerArg: 1.854 ± 0.0
4.325SerSer: 4.325 ± 0.0
2.471SerThr: 2.471 ± 0.0
7.414SerVal: 7.414 ± 0.0
1.236SerTrp: 1.236 ± 0.0
3.089SerTyr: 3.089 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.854ThrAla: 1.854 ± 0.0
0.309ThrCys: 0.309 ± 0.0
2.78ThrAsp: 2.78 ± 0.0
3.089ThrGlu: 3.089 ± 0.0
2.78ThrPhe: 2.78 ± 0.0
5.561ThrGly: 5.561 ± 0.0
0.618ThrHis: 0.618 ± 0.0
4.016ThrIle: 4.016 ± 0.0
1.545ThrLys: 1.545 ± 0.0
4.634ThrLeu: 4.634 ± 0.0
1.545ThrMet: 1.545 ± 0.0
2.471ThrAsn: 2.471 ± 0.0
2.162ThrPro: 2.162 ± 0.0
1.854ThrGln: 1.854 ± 0.0
2.471ThrArg: 2.471 ± 0.0
4.943ThrSer: 4.943 ± 0.0
2.78ThrThr: 2.78 ± 0.0
4.943ThrVal: 4.943 ± 0.0
1.545ThrTrp: 1.545 ± 0.0
2.471ThrTyr: 2.471 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.032ValAla: 8.032 ± 0.0
1.545ValCys: 1.545 ± 0.0
6.179ValAsp: 6.179 ± 0.0
7.414ValGlu: 7.414 ± 0.0
4.634ValPhe: 4.634 ± 0.0
5.561ValGly: 5.561 ± 0.0
0.618ValHis: 0.618 ± 0.0
1.545ValIle: 1.545 ± 0.0
4.016ValLys: 4.016 ± 0.0
5.561ValLeu: 5.561 ± 0.0
2.471ValMet: 2.471 ± 0.0
2.471ValAsn: 2.471 ± 0.0
5.87ValPro: 5.87 ± 0.0
4.943ValGln: 4.943 ± 0.0
3.707ValArg: 3.707 ± 0.0
7.414ValSer: 7.414 ± 0.0
4.943ValThr: 4.943 ± 0.0
6.179ValVal: 6.179 ± 0.0
0.0ValTrp: 0.0 ± 0.0
4.016ValTyr: 4.016 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.309TrpAla: 0.309 ± 0.0
0.927TrpCys: 0.927 ± 0.0
0.618TrpAsp: 0.618 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.309TrpPhe: 0.309 ± 0.0
0.309TrpGly: 0.309 ± 0.0
0.309TrpHis: 0.309 ± 0.0
0.927TrpIle: 0.927 ± 0.0
0.927TrpLys: 0.927 ± 0.0
1.545TrpLeu: 1.545 ± 0.0
0.618TrpMet: 0.618 ± 0.0
2.162TrpAsn: 2.162 ± 0.0
0.309TrpPro: 0.309 ± 0.0
0.618TrpGln: 0.618 ± 0.0
0.309TrpArg: 0.309 ± 0.0
1.236TrpSer: 1.236 ± 0.0
0.618TrpThr: 0.618 ± 0.0
0.927TrpVal: 0.927 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.927TrpTyr: 0.927 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.162TyrAla: 2.162 ± 0.0
1.854TyrCys: 1.854 ± 0.0
2.78TyrAsp: 2.78 ± 0.0
2.78TyrGlu: 2.78 ± 0.0
2.78TyrPhe: 2.78 ± 0.0
2.471TyrGly: 2.471 ± 0.0
0.309TyrHis: 0.309 ± 0.0
1.236TyrIle: 1.236 ± 0.0
0.927TyrLys: 0.927 ± 0.0
3.398TyrLeu: 3.398 ± 0.0
1.854TyrMet: 1.854 ± 0.0
1.236TyrAsn: 1.236 ± 0.0
2.78TyrPro: 2.78 ± 0.0
2.78TyrGln: 2.78 ± 0.0
3.089TyrArg: 3.089 ± 0.0
3.398TyrSer: 3.398 ± 0.0
3.089TyrThr: 3.089 ± 0.0
2.78TyrVal: 2.78 ± 0.0
1.545TyrTrp: 1.545 ± 0.0
1.854TyrTyr: 1.854 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3238 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski