Amino acid dipepetide frequency for Corey virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.777AlaAla: 7.777 ± 0.0
0.353AlaCys: 0.353 ± 0.0
3.181AlaAsp: 3.181 ± 0.0
5.302AlaGlu: 5.302 ± 0.0
3.535AlaPhe: 3.535 ± 0.0
2.828AlaGly: 2.828 ± 0.0
1.06AlaHis: 1.06 ± 0.0
8.484AlaIle: 8.484 ± 0.0
6.009AlaLys: 6.009 ± 0.0
6.363AlaLeu: 6.363 ± 0.0
1.06AlaMet: 1.06 ± 0.0
2.121AlaAsn: 2.121 ± 0.0
5.302AlaPro: 5.302 ± 0.0
2.121AlaGln: 2.121 ± 0.0
5.656AlaArg: 5.656 ± 0.0
3.181AlaSer: 3.181 ± 0.0
6.363AlaThr: 6.363 ± 0.0
2.474AlaVal: 2.474 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.767AlaTyr: 1.767 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.828CysAla: 2.828 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.06CysGlu: 1.06 ± 0.0
0.353CysPhe: 0.353 ± 0.0
1.06CysGly: 1.06 ± 0.0
1.06CysHis: 1.06 ± 0.0
0.353CysIle: 0.353 ± 0.0
0.353CysLys: 0.353 ± 0.0
0.707CysLeu: 0.707 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.353CysAsn: 0.353 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.06CysGln: 1.06 ± 0.0
0.707CysArg: 0.707 ± 0.0
0.707CysSer: 0.707 ± 0.0
0.707CysThr: 0.707 ± 0.0
0.353CysVal: 0.353 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.707CysTyr: 0.707 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.181AspAla: 3.181 ± 0.0
0.707AspCys: 0.707 ± 0.0
2.474AspAsp: 2.474 ± 0.0
7.07AspGlu: 7.07 ± 0.0
4.949AspPhe: 4.949 ± 0.0
4.242AspGly: 4.242 ± 0.0
1.06AspHis: 1.06 ± 0.0
3.535AspIle: 3.535 ± 0.0
4.242AspLys: 4.242 ± 0.0
4.242AspLeu: 4.242 ± 0.0
1.414AspMet: 1.414 ± 0.0
2.121AspAsn: 2.121 ± 0.0
1.414AspPro: 1.414 ± 0.0
3.181AspGln: 3.181 ± 0.0
1.06AspArg: 1.06 ± 0.0
1.767AspSer: 1.767 ± 0.0
1.414AspThr: 1.414 ± 0.0
3.535AspVal: 3.535 ± 0.0
1.767AspTrp: 1.767 ± 0.0
2.121AspTyr: 2.121 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.181GluAla: 3.181 ± 0.0
1.767GluCys: 1.767 ± 0.0
5.302GluAsp: 5.302 ± 0.0
8.484GluGlu: 8.484 ± 0.0
2.474GluPhe: 2.474 ± 0.0
3.181GluGly: 3.181 ± 0.0
0.707GluHis: 0.707 ± 0.0
4.242GluIle: 4.242 ± 0.0
4.242GluLys: 4.242 ± 0.0
6.716GluLeu: 6.716 ± 0.0
1.06GluMet: 1.06 ± 0.0
2.474GluAsn: 2.474 ± 0.0
2.474GluPro: 2.474 ± 0.0
3.888GluGln: 3.888 ± 0.0
5.656GluArg: 5.656 ± 0.0
2.121GluSer: 2.121 ± 0.0
3.181GluThr: 3.181 ± 0.0
4.242GluVal: 4.242 ± 0.0
2.828GluTrp: 2.828 ± 0.0
2.474GluTyr: 2.474 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.474PheAla: 2.474 ± 0.0
0.0PheCys: 0.0 ± 0.0
4.242PheAsp: 4.242 ± 0.0
3.535PheGlu: 3.535 ± 0.0
1.767PhePhe: 1.767 ± 0.0
2.828PheGly: 2.828 ± 0.0
2.828PheHis: 2.828 ± 0.0
4.595PheIle: 4.595 ± 0.0
3.181PheLys: 3.181 ± 0.0
4.242PheLeu: 4.242 ± 0.0
0.353PheMet: 0.353 ± 0.0
1.767PheAsn: 1.767 ± 0.0
2.474PhePro: 2.474 ± 0.0
1.767PheGln: 1.767 ± 0.0
2.121PheArg: 2.121 ± 0.0
3.535PheSer: 3.535 ± 0.0
0.707PheThr: 0.707 ± 0.0
2.474PheVal: 2.474 ± 0.0
0.707PheTrp: 0.707 ± 0.0
2.121PheTyr: 2.121 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.595GlyAla: 4.595 ± 0.0
0.707GlyCys: 0.707 ± 0.0
3.888GlyAsp: 3.888 ± 0.0
6.363GlyGlu: 6.363 ± 0.0
2.474GlyPhe: 2.474 ± 0.0
3.535GlyGly: 3.535 ± 0.0
0.707GlyHis: 0.707 ± 0.0
4.242GlyIle: 4.242 ± 0.0
5.302GlyLys: 5.302 ± 0.0
4.242GlyLeu: 4.242 ± 0.0
0.353GlyMet: 0.353 ± 0.0
2.121GlyAsn: 2.121 ± 0.0
2.474GlyPro: 2.474 ± 0.0
2.121GlyGln: 2.121 ± 0.0
1.06GlyArg: 1.06 ± 0.0
3.535GlySer: 3.535 ± 0.0
4.595GlyThr: 4.595 ± 0.0
6.716GlyVal: 6.716 ± 0.0
0.353GlyTrp: 0.353 ± 0.0
3.181GlyTyr: 3.181 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.414HisAla: 1.414 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.06HisAsp: 1.06 ± 0.0
0.707HisGlu: 0.707 ± 0.0
2.121HisPhe: 2.121 ± 0.0
1.767HisGly: 1.767 ± 0.0
0.353HisHis: 0.353 ± 0.0
0.353HisIle: 0.353 ± 0.0
1.06HisLys: 1.06 ± 0.0
2.121HisLeu: 2.121 ± 0.0
0.353HisMet: 0.353 ± 0.0
0.707HisAsn: 0.707 ± 0.0
0.707HisPro: 0.707 ± 0.0
2.121HisGln: 2.121 ± 0.0
0.707HisArg: 0.707 ± 0.0
1.414HisSer: 1.414 ± 0.0
1.414HisThr: 1.414 ± 0.0
0.707HisVal: 0.707 ± 0.0
0.353HisTrp: 0.353 ± 0.0
2.121HisTyr: 2.121 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.302IleAla: 5.302 ± 0.0
0.353IleCys: 0.353 ± 0.0
4.242IleAsp: 4.242 ± 0.0
4.949IleGlu: 4.949 ± 0.0
3.535IlePhe: 3.535 ± 0.0
2.828IleGly: 2.828 ± 0.0
1.767IleHis: 1.767 ± 0.0
2.828IleIle: 2.828 ± 0.0
3.181IleLys: 3.181 ± 0.0
4.595IleLeu: 4.595 ± 0.0
1.06IleMet: 1.06 ± 0.0
6.009IleAsn: 6.009 ± 0.0
3.181IlePro: 3.181 ± 0.0
1.767IleGln: 1.767 ± 0.0
3.888IleArg: 3.888 ± 0.0
3.888IleSer: 3.888 ± 0.0
5.656IleThr: 5.656 ± 0.0
4.595IleVal: 4.595 ± 0.0
0.353IleTrp: 0.353 ± 0.0
1.767IleTyr: 1.767 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.535LysAla: 3.535 ± 0.0
0.353LysCys: 0.353 ± 0.0
3.888LysAsp: 3.888 ± 0.0
6.363LysGlu: 6.363 ± 0.0
3.181LysPhe: 3.181 ± 0.0
3.888LysGly: 3.888 ± 0.0
2.121LysHis: 2.121 ± 0.0
1.767LysIle: 1.767 ± 0.0
5.302LysLys: 5.302 ± 0.0
5.656LysLeu: 5.656 ± 0.0
1.767LysMet: 1.767 ± 0.0
2.474LysAsn: 2.474 ± 0.0
3.181LysPro: 3.181 ± 0.0
1.414LysGln: 1.414 ± 0.0
3.535LysArg: 3.535 ± 0.0
4.595LysSer: 4.595 ± 0.0
5.656LysThr: 5.656 ± 0.0
6.009LysVal: 6.009 ± 0.0
1.06LysTrp: 1.06 ± 0.0
3.888LysTyr: 3.888 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.777LeuAla: 7.777 ± 0.0
2.121LeuCys: 2.121 ± 0.0
3.535LeuAsp: 3.535 ± 0.0
4.949LeuGlu: 4.949 ± 0.0
1.767LeuPhe: 1.767 ± 0.0
8.484LeuGly: 8.484 ± 0.0
1.414LeuHis: 1.414 ± 0.0
3.888LeuIle: 3.888 ± 0.0
8.484LeuLys: 8.484 ± 0.0
5.656LeuLeu: 5.656 ± 0.0
1.414LeuMet: 1.414 ± 0.0
3.888LeuAsn: 3.888 ± 0.0
4.242LeuPro: 4.242 ± 0.0
2.474LeuGln: 2.474 ± 0.0
3.535LeuArg: 3.535 ± 0.0
3.181LeuSer: 3.181 ± 0.0
5.656LeuThr: 5.656 ± 0.0
4.242LeuVal: 4.242 ± 0.0
0.353LeuTrp: 0.353 ± 0.0
2.828LeuTyr: 2.828 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.121MetAla: 2.121 ± 0.0
0.707MetCys: 0.707 ± 0.0
2.474MetAsp: 2.474 ± 0.0
1.06MetGlu: 1.06 ± 0.0
1.06MetPhe: 1.06 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.414MetHis: 1.414 ± 0.0
0.707MetIle: 0.707 ± 0.0
2.474MetLys: 2.474 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.353MetMet: 0.353 ± 0.0
1.06MetAsn: 1.06 ± 0.0
1.767MetPro: 1.767 ± 0.0
0.353MetGln: 0.353 ± 0.0
0.353MetArg: 0.353 ± 0.0
1.414MetSer: 1.414 ± 0.0
1.414MetThr: 1.414 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.707MetTyr: 0.707 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.767AsnAla: 1.767 ± 0.0
0.353AsnCys: 0.353 ± 0.0
1.06AsnAsp: 1.06 ± 0.0
1.767AsnGlu: 1.767 ± 0.0
1.767AsnPhe: 1.767 ± 0.0
3.181AsnGly: 3.181 ± 0.0
0.707AsnHis: 0.707 ± 0.0
3.535AsnIle: 3.535 ± 0.0
1.767AsnLys: 1.767 ± 0.0
4.949AsnLeu: 4.949 ± 0.0
1.767AsnMet: 1.767 ± 0.0
1.767AsnAsn: 1.767 ± 0.0
5.302AsnPro: 5.302 ± 0.0
1.06AsnGln: 1.06 ± 0.0
1.414AsnArg: 1.414 ± 0.0
1.414AsnSer: 1.414 ± 0.0
3.181AsnThr: 3.181 ± 0.0
3.535AsnVal: 3.535 ± 0.0
1.414AsnTrp: 1.414 ± 0.0
2.474AsnTyr: 2.474 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.121ProAla: 2.121 ± 0.0
0.0ProCys: 0.0 ± 0.0
3.181ProAsp: 3.181 ± 0.0
2.474ProGlu: 2.474 ± 0.0
1.414ProPhe: 1.414 ± 0.0
4.242ProGly: 4.242 ± 0.0
1.06ProHis: 1.06 ± 0.0
4.242ProIle: 4.242 ± 0.0
2.121ProLys: 2.121 ± 0.0
6.009ProLeu: 6.009 ± 0.0
0.353ProMet: 0.353 ± 0.0
1.767ProAsn: 1.767 ± 0.0
1.414ProPro: 1.414 ± 0.0
1.767ProGln: 1.767 ± 0.0
2.828ProArg: 2.828 ± 0.0
2.828ProSer: 2.828 ± 0.0
4.242ProThr: 4.242 ± 0.0
3.535ProVal: 3.535 ± 0.0
1.414ProTrp: 1.414 ± 0.0
1.414ProTyr: 1.414 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.181GlnAla: 3.181 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.06GlnAsp: 1.06 ± 0.0
1.767GlnGlu: 1.767 ± 0.0
1.06GlnPhe: 1.06 ± 0.0
3.888GlnGly: 3.888 ± 0.0
1.414GlnHis: 1.414 ± 0.0
2.121GlnIle: 2.121 ± 0.0
2.828GlnLys: 2.828 ± 0.0
5.302GlnLeu: 5.302 ± 0.0
2.828GlnMet: 2.828 ± 0.0
2.474GlnAsn: 2.474 ± 0.0
1.767GlnPro: 1.767 ± 0.0
2.474GlnGln: 2.474 ± 0.0
1.414GlnArg: 1.414 ± 0.0
1.06GlnSer: 1.06 ± 0.0
2.474GlnThr: 2.474 ± 0.0
2.121GlnVal: 2.121 ± 0.0
0.707GlnTrp: 0.707 ± 0.0
0.707GlnTyr: 0.707 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.474ArgAla: 2.474 ± 0.0
1.414ArgCys: 1.414 ± 0.0
2.474ArgAsp: 2.474 ± 0.0
3.181ArgGlu: 3.181 ± 0.0
4.949ArgPhe: 4.949 ± 0.0
3.181ArgGly: 3.181 ± 0.0
0.707ArgHis: 0.707 ± 0.0
2.828ArgIle: 2.828 ± 0.0
4.242ArgLys: 4.242 ± 0.0
4.949ArgLeu: 4.949 ± 0.0
0.707ArgMet: 0.707 ± 0.0
0.707ArgAsn: 0.707 ± 0.0
2.121ArgPro: 2.121 ± 0.0
1.414ArgGln: 1.414 ± 0.0
3.181ArgArg: 3.181 ± 0.0
4.242ArgSer: 4.242 ± 0.0
3.181ArgThr: 3.181 ± 0.0
1.06ArgVal: 1.06 ± 0.0
0.353ArgTrp: 0.353 ± 0.0
1.767ArgTyr: 1.767 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.242SerAla: 4.242 ± 0.0
0.707SerCys: 0.707 ± 0.0
1.414SerAsp: 1.414 ± 0.0
3.535SerGlu: 3.535 ± 0.0
2.121SerPhe: 2.121 ± 0.0
2.474SerGly: 2.474 ± 0.0
0.353SerHis: 0.353 ± 0.0
4.595SerIle: 4.595 ± 0.0
4.595SerLys: 4.595 ± 0.0
3.181SerLeu: 3.181 ± 0.0
1.06SerMet: 1.06 ± 0.0
4.949SerAsn: 4.949 ± 0.0
2.121SerPro: 2.121 ± 0.0
1.767SerGln: 1.767 ± 0.0
2.121SerArg: 2.121 ± 0.0
2.474SerSer: 2.474 ± 0.0
4.949SerThr: 4.949 ± 0.0
3.888SerVal: 3.888 ± 0.0
1.06SerTrp: 1.06 ± 0.0
1.767SerTyr: 1.767 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
7.07ThrAla: 7.07 ± 0.0
0.353ThrCys: 0.353 ± 0.0
3.888ThrAsp: 3.888 ± 0.0
2.121ThrGlu: 2.121 ± 0.0
2.474ThrPhe: 2.474 ± 0.0
4.595ThrGly: 4.595 ± 0.0
0.353ThrHis: 0.353 ± 0.0
3.535ThrIle: 3.535 ± 0.0
2.474ThrLys: 2.474 ± 0.0
5.656ThrLeu: 5.656 ± 0.0
1.414ThrMet: 1.414 ± 0.0
2.828ThrAsn: 2.828 ± 0.0
3.888ThrPro: 3.888 ± 0.0
2.828ThrGln: 2.828 ± 0.0
3.181ThrArg: 3.181 ± 0.0
6.009ThrSer: 6.009 ± 0.0
4.595ThrThr: 4.595 ± 0.0
4.595ThrVal: 4.595 ± 0.0
1.767ThrTrp: 1.767 ± 0.0
2.474ThrTyr: 2.474 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.302ValAla: 5.302 ± 0.0
1.767ValCys: 1.767 ± 0.0
5.302ValAsp: 5.302 ± 0.0
3.888ValGlu: 3.888 ± 0.0
3.888ValPhe: 3.888 ± 0.0
3.888ValGly: 3.888 ± 0.0
0.707ValHis: 0.707 ± 0.0
4.595ValIle: 4.595 ± 0.0
3.535ValLys: 3.535 ± 0.0
3.181ValLeu: 3.181 ± 0.0
0.707ValMet: 0.707 ± 0.0
2.828ValAsn: 2.828 ± 0.0
3.181ValPro: 3.181 ± 0.0
2.474ValGln: 2.474 ± 0.0
2.828ValArg: 2.828 ± 0.0
2.474ValSer: 2.474 ± 0.0
3.888ValThr: 3.888 ± 0.0
4.242ValVal: 4.242 ± 0.0
2.121ValTrp: 2.121 ± 0.0
1.06ValTyr: 1.06 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.767TrpAla: 1.767 ± 0.0
0.353TrpCys: 0.353 ± 0.0
0.707TrpAsp: 0.707 ± 0.0
1.06TrpGlu: 1.06 ± 0.0
0.707TrpPhe: 0.707 ± 0.0
0.353TrpGly: 0.353 ± 0.0
0.353TrpHis: 0.353 ± 0.0
1.767TrpIle: 1.767 ± 0.0
2.828TrpLys: 2.828 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.353TrpAsn: 0.353 ± 0.0
0.353TrpPro: 0.353 ± 0.0
2.121TrpGln: 2.121 ± 0.0
1.06TrpArg: 1.06 ± 0.0
1.767TrpSer: 1.767 ± 0.0
1.06TrpThr: 1.06 ± 0.0
1.06TrpVal: 1.06 ± 0.0
0.353TrpTrp: 0.353 ± 0.0
0.707TrpTyr: 0.707 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.828TyrAla: 2.828 ± 0.0
0.353TyrCys: 0.353 ± 0.0
2.121TyrAsp: 2.121 ± 0.0
1.06TyrGlu: 1.06 ± 0.0
2.121TyrPhe: 2.121 ± 0.0
2.121TyrGly: 2.121 ± 0.0
1.414TyrHis: 1.414 ± 0.0
3.535TyrIle: 3.535 ± 0.0
1.414TyrLys: 1.414 ± 0.0
2.121TyrLeu: 2.121 ± 0.0
1.06TyrMet: 1.06 ± 0.0
1.414TyrAsn: 1.414 ± 0.0
1.06TyrPro: 1.06 ± 0.0
2.474TyrGln: 2.474 ± 0.0
2.828TyrArg: 2.828 ± 0.0
1.767TyrSer: 1.767 ± 0.0
1.767TyrThr: 1.767 ± 0.0
2.828TyrVal: 2.828 ± 0.0
1.767TyrTrp: 1.767 ± 0.0
1.414TyrTyr: 1.414 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2830 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski