Amino acid dipepetide frequency for Shahe picorna-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.044AlaAla: 6.044 ± 0.0
0.336AlaCys: 0.336 ± 0.0
3.358AlaAsp: 3.358 ± 0.0
3.022AlaGlu: 3.022 ± 0.0
3.694AlaPhe: 3.694 ± 0.0
3.694AlaGly: 3.694 ± 0.0
1.679AlaHis: 1.679 ± 0.0
5.709AlaIle: 5.709 ± 0.0
2.351AlaLys: 2.351 ± 0.0
6.044AlaLeu: 6.044 ± 0.0
3.358AlaMet: 3.358 ± 0.0
3.022AlaAsn: 3.022 ± 0.0
3.022AlaPro: 3.022 ± 0.0
1.679AlaGln: 1.679 ± 0.0
2.686AlaArg: 2.686 ± 0.0
5.709AlaSer: 5.709 ± 0.0
4.701AlaThr: 4.701 ± 0.0
5.037AlaVal: 5.037 ± 0.0
0.336AlaTrp: 0.336 ± 0.0
3.022AlaTyr: 3.022 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.007CysAla: 1.007 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.336CysAsp: 0.336 ± 0.0
0.672CysGlu: 0.672 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.351CysGly: 2.351 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.672CysLys: 0.672 ± 0.0
0.336CysLeu: 0.336 ± 0.0
0.336CysMet: 0.336 ± 0.0
0.336CysAsn: 0.336 ± 0.0
1.007CysPro: 1.007 ± 0.0
1.007CysGln: 1.007 ± 0.0
0.672CysArg: 0.672 ± 0.0
0.672CysSer: 0.672 ± 0.0
1.679CysThr: 1.679 ± 0.0
1.679CysVal: 1.679 ± 0.0
0.336CysTrp: 0.336 ± 0.0
1.343CysTyr: 1.343 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.351AspAla: 2.351 ± 0.0
0.672AspCys: 0.672 ± 0.0
3.694AspAsp: 3.694 ± 0.0
5.037AspGlu: 5.037 ± 0.0
2.351AspPhe: 2.351 ± 0.0
2.015AspGly: 2.015 ± 0.0
1.007AspHis: 1.007 ± 0.0
5.037AspIle: 5.037 ± 0.0
4.03AspLys: 4.03 ± 0.0
3.694AspLeu: 3.694 ± 0.0
1.343AspMet: 1.343 ± 0.0
2.015AspAsn: 2.015 ± 0.0
2.351AspPro: 2.351 ± 0.0
1.343AspGln: 1.343 ± 0.0
2.351AspArg: 2.351 ± 0.0
3.022AspSer: 3.022 ± 0.0
3.022AspThr: 3.022 ± 0.0
3.022AspVal: 3.022 ± 0.0
1.679AspTrp: 1.679 ± 0.0
2.686AspTyr: 2.686 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.686GluAla: 2.686 ± 0.0
0.336GluCys: 0.336 ± 0.0
4.03GluAsp: 4.03 ± 0.0
3.694GluGlu: 3.694 ± 0.0
1.679GluPhe: 1.679 ± 0.0
1.679GluGly: 1.679 ± 0.0
1.679GluHis: 1.679 ± 0.0
5.709GluIle: 5.709 ± 0.0
3.022GluLys: 3.022 ± 0.0
4.03GluLeu: 4.03 ± 0.0
1.007GluMet: 1.007 ± 0.0
3.694GluAsn: 3.694 ± 0.0
2.351GluPro: 2.351 ± 0.0
0.336GluGln: 0.336 ± 0.0
3.022GluArg: 3.022 ± 0.0
3.022GluSer: 3.022 ± 0.0
3.694GluThr: 3.694 ± 0.0
3.022GluVal: 3.022 ± 0.0
1.343GluTrp: 1.343 ± 0.0
3.694GluTyr: 3.694 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.022PheAla: 3.022 ± 0.0
1.679PheCys: 1.679 ± 0.0
1.679PheAsp: 1.679 ± 0.0
3.358PheGlu: 3.358 ± 0.0
3.694PhePhe: 3.694 ± 0.0
1.343PheGly: 1.343 ± 0.0
2.015PheHis: 2.015 ± 0.0
3.358PheIle: 3.358 ± 0.0
2.015PheLys: 2.015 ± 0.0
3.022PheLeu: 3.022 ± 0.0
2.015PheMet: 2.015 ± 0.0
3.022PheAsn: 3.022 ± 0.0
1.343PhePro: 1.343 ± 0.0
1.343PheGln: 1.343 ± 0.0
3.358PheArg: 3.358 ± 0.0
3.694PheSer: 3.694 ± 0.0
2.686PheThr: 2.686 ± 0.0
3.694PheVal: 3.694 ± 0.0
0.672PheTrp: 0.672 ± 0.0
2.015PheTyr: 2.015 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.679GlyAla: 1.679 ± 0.0
1.007GlyCys: 1.007 ± 0.0
3.022GlyAsp: 3.022 ± 0.0
2.686GlyGlu: 2.686 ± 0.0
3.358GlyPhe: 3.358 ± 0.0
3.694GlyGly: 3.694 ± 0.0
0.672GlyHis: 0.672 ± 0.0
3.694GlyIle: 3.694 ± 0.0
3.022GlyLys: 3.022 ± 0.0
3.358GlyLeu: 3.358 ± 0.0
2.351GlyMet: 2.351 ± 0.0
3.022GlyAsn: 3.022 ± 0.0
3.358GlyPro: 3.358 ± 0.0
1.343GlyGln: 1.343 ± 0.0
3.358GlyArg: 3.358 ± 0.0
6.044GlySer: 6.044 ± 0.0
5.373GlyThr: 5.373 ± 0.0
5.037GlyVal: 5.037 ± 0.0
1.343GlyTrp: 1.343 ± 0.0
0.672GlyTyr: 0.672 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.007HisAla: 1.007 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.336HisAsp: 0.336 ± 0.0
0.336HisGlu: 0.336 ± 0.0
2.015HisPhe: 2.015 ± 0.0
1.343HisGly: 1.343 ± 0.0
0.336HisHis: 0.336 ± 0.0
1.343HisIle: 1.343 ± 0.0
0.672HisLys: 0.672 ± 0.0
2.015HisLeu: 2.015 ± 0.0
1.343HisMet: 1.343 ± 0.0
1.343HisAsn: 1.343 ± 0.0
2.351HisPro: 2.351 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.007HisArg: 1.007 ± 0.0
0.672HisSer: 0.672 ± 0.0
0.672HisThr: 0.672 ± 0.0
0.672HisVal: 0.672 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.351HisTyr: 2.351 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.358IleAla: 3.358 ± 0.0
0.336IleCys: 0.336 ± 0.0
3.694IleAsp: 3.694 ± 0.0
2.686IleGlu: 2.686 ± 0.0
2.351IlePhe: 2.351 ± 0.0
4.03IleGly: 4.03 ± 0.0
1.679IleHis: 1.679 ± 0.0
1.679IleIle: 1.679 ± 0.0
4.701IleLys: 4.701 ± 0.0
2.686IleLeu: 2.686 ± 0.0
2.351IleMet: 2.351 ± 0.0
5.373IleAsn: 5.373 ± 0.0
4.365IlePro: 4.365 ± 0.0
1.679IleGln: 1.679 ± 0.0
3.694IleArg: 3.694 ± 0.0
6.716IleSer: 6.716 ± 0.0
5.373IleThr: 5.373 ± 0.0
3.358IleVal: 3.358 ± 0.0
2.015IleTrp: 2.015 ± 0.0
2.015IleTyr: 2.015 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.022LysAla: 3.022 ± 0.0
1.007LysCys: 1.007 ± 0.0
1.343LysAsp: 1.343 ± 0.0
3.022LysGlu: 3.022 ± 0.0
1.679LysPhe: 1.679 ± 0.0
2.351LysGly: 2.351 ± 0.0
0.672LysHis: 0.672 ± 0.0
4.701LysIle: 4.701 ± 0.0
3.358LysLys: 3.358 ± 0.0
2.351LysLeu: 2.351 ± 0.0
0.336LysMet: 0.336 ± 0.0
2.686LysAsn: 2.686 ± 0.0
2.351LysPro: 2.351 ± 0.0
0.336LysGln: 0.336 ± 0.0
3.022LysArg: 3.022 ± 0.0
4.365LysSer: 4.365 ± 0.0
2.351LysThr: 2.351 ± 0.0
4.03LysVal: 4.03 ± 0.0
0.336LysTrp: 0.336 ± 0.0
3.022LysTyr: 3.022 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.059LeuAla: 8.059 ± 0.0
1.343LeuCys: 1.343 ± 0.0
3.358LeuAsp: 3.358 ± 0.0
4.701LeuGlu: 4.701 ± 0.0
3.694LeuPhe: 3.694 ± 0.0
4.03LeuGly: 4.03 ± 0.0
2.351LeuHis: 2.351 ± 0.0
3.358LeuIle: 3.358 ± 0.0
3.358LeuLys: 3.358 ± 0.0
7.388LeuLeu: 7.388 ± 0.0
1.679LeuMet: 1.679 ± 0.0
5.037LeuAsn: 5.037 ± 0.0
4.03LeuPro: 4.03 ± 0.0
2.015LeuGln: 2.015 ± 0.0
2.686LeuArg: 2.686 ± 0.0
6.716LeuSer: 6.716 ± 0.0
7.388LeuThr: 7.388 ± 0.0
3.022LeuVal: 3.022 ± 0.0
2.015LeuTrp: 2.015 ± 0.0
3.358LeuTyr: 3.358 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.022MetAla: 3.022 ± 0.0
0.336MetCys: 0.336 ± 0.0
2.015MetAsp: 2.015 ± 0.0
1.679MetGlu: 1.679 ± 0.0
0.672MetPhe: 0.672 ± 0.0
3.022MetGly: 3.022 ± 0.0
0.672MetHis: 0.672 ± 0.0
1.679MetIle: 1.679 ± 0.0
1.343MetLys: 1.343 ± 0.0
3.694MetLeu: 3.694 ± 0.0
0.672MetMet: 0.672 ± 0.0
2.015MetAsn: 2.015 ± 0.0
2.015MetPro: 2.015 ± 0.0
0.336MetGln: 0.336 ± 0.0
2.351MetArg: 2.351 ± 0.0
2.351MetSer: 2.351 ± 0.0
2.015MetThr: 2.015 ± 0.0
0.336MetVal: 0.336 ± 0.0
0.336MetTrp: 0.336 ± 0.0
0.672MetTyr: 0.672 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.694AsnAla: 3.694 ± 0.0
1.007AsnCys: 1.007 ± 0.0
2.015AsnAsp: 2.015 ± 0.0
3.022AsnGlu: 3.022 ± 0.0
1.679AsnPhe: 1.679 ± 0.0
4.03AsnGly: 4.03 ± 0.0
1.007AsnHis: 1.007 ± 0.0
5.037AsnIle: 5.037 ± 0.0
1.679AsnLys: 1.679 ± 0.0
6.044AsnLeu: 6.044 ± 0.0
3.358AsnMet: 3.358 ± 0.0
3.022AsnAsn: 3.022 ± 0.0
3.694AsnPro: 3.694 ± 0.0
4.03AsnGln: 4.03 ± 0.0
1.343AsnArg: 1.343 ± 0.0
3.358AsnSer: 3.358 ± 0.0
3.694AsnThr: 3.694 ± 0.0
4.701AsnVal: 4.701 ± 0.0
1.007AsnTrp: 1.007 ± 0.0
3.022AsnTyr: 3.022 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.38ProAla: 6.38 ± 0.0
0.672ProCys: 0.672 ± 0.0
2.015ProAsp: 2.015 ± 0.0
3.022ProGlu: 3.022 ± 0.0
3.358ProPhe: 3.358 ± 0.0
1.679ProGly: 1.679 ± 0.0
0.336ProHis: 0.336 ± 0.0
5.037ProIle: 5.037 ± 0.0
3.358ProLys: 3.358 ± 0.0
5.037ProLeu: 5.037 ± 0.0
1.343ProMet: 1.343 ± 0.0
1.007ProAsn: 1.007 ± 0.0
4.701ProPro: 4.701 ± 0.0
1.343ProGln: 1.343 ± 0.0
1.007ProArg: 1.007 ± 0.0
3.694ProSer: 3.694 ± 0.0
5.373ProThr: 5.373 ± 0.0
2.351ProVal: 2.351 ± 0.0
0.672ProTrp: 0.672 ± 0.0
2.015ProTyr: 2.015 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.022GlnAla: 3.022 ± 0.0
1.007GlnCys: 1.007 ± 0.0
1.679GlnAsp: 1.679 ± 0.0
1.343GlnGlu: 1.343 ± 0.0
1.007GlnPhe: 1.007 ± 0.0
2.351GlnGly: 2.351 ± 0.0
0.336GlnHis: 0.336 ± 0.0
1.679GlnIle: 1.679 ± 0.0
1.007GlnLys: 1.007 ± 0.0
2.015GlnLeu: 2.015 ± 0.0
0.672GlnMet: 0.672 ± 0.0
1.007GlnAsn: 1.007 ± 0.0
1.007GlnPro: 1.007 ± 0.0
0.672GlnGln: 0.672 ± 0.0
0.672GlnArg: 0.672 ± 0.0
3.358GlnSer: 3.358 ± 0.0
1.007GlnThr: 1.007 ± 0.0
2.015GlnVal: 2.015 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.007GlnTyr: 1.007 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.686ArgAla: 2.686 ± 0.0
0.336ArgCys: 0.336 ± 0.0
3.022ArgAsp: 3.022 ± 0.0
2.015ArgGlu: 2.015 ± 0.0
3.694ArgPhe: 3.694 ± 0.0
1.679ArgGly: 1.679 ± 0.0
0.672ArgHis: 0.672 ± 0.0
2.351ArgIle: 2.351 ± 0.0
1.679ArgLys: 1.679 ± 0.0
5.037ArgLeu: 5.037 ± 0.0
0.336ArgMet: 0.336 ± 0.0
2.686ArgAsn: 2.686 ± 0.0
1.343ArgPro: 1.343 ± 0.0
2.351ArgGln: 2.351 ± 0.0
1.007ArgArg: 1.007 ± 0.0
2.015ArgSer: 2.015 ± 0.0
2.686ArgThr: 2.686 ± 0.0
5.037ArgVal: 5.037 ± 0.0
0.672ArgTrp: 0.672 ± 0.0
1.679ArgTyr: 1.679 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.03SerAla: 4.03 ± 0.0
0.672SerCys: 0.672 ± 0.0
6.38SerAsp: 6.38 ± 0.0
3.022SerGlu: 3.022 ± 0.0
4.701SerPhe: 4.701 ± 0.0
6.38SerGly: 6.38 ± 0.0
1.343SerHis: 1.343 ± 0.0
5.037SerIle: 5.037 ± 0.0
3.358SerLys: 3.358 ± 0.0
6.38SerLeu: 6.38 ± 0.0
2.351SerMet: 2.351 ± 0.0
4.03SerAsn: 4.03 ± 0.0
3.694SerPro: 3.694 ± 0.0
2.015SerGln: 2.015 ± 0.0
3.022SerArg: 3.022 ± 0.0
5.373SerSer: 5.373 ± 0.0
8.731SerThr: 8.731 ± 0.0
4.365SerVal: 4.365 ± 0.0
2.015SerTrp: 2.015 ± 0.0
3.022SerTyr: 3.022 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.365ThrAla: 4.365 ± 0.0
1.343ThrCys: 1.343 ± 0.0
2.686ThrAsp: 2.686 ± 0.0
2.686ThrGlu: 2.686 ± 0.0
3.358ThrPhe: 3.358 ± 0.0
5.037ThrGly: 5.037 ± 0.0
1.343ThrHis: 1.343 ± 0.0
5.037ThrIle: 5.037 ± 0.0
1.343ThrLys: 1.343 ± 0.0
5.373ThrLeu: 5.373 ± 0.0
2.686ThrMet: 2.686 ± 0.0
6.38ThrAsn: 6.38 ± 0.0
4.701ThrPro: 4.701 ± 0.0
1.343ThrGln: 1.343 ± 0.0
2.351ThrArg: 2.351 ± 0.0
9.738ThrSer: 9.738 ± 0.0
7.052ThrThr: 7.052 ± 0.0
5.037ThrVal: 5.037 ± 0.0
1.343ThrTrp: 1.343 ± 0.0
2.015ThrTyr: 2.015 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.037ValAla: 5.037 ± 0.0
0.672ValCys: 0.672 ± 0.0
3.694ValAsp: 3.694 ± 0.0
3.358ValGlu: 3.358 ± 0.0
3.358ValPhe: 3.358 ± 0.0
4.03ValGly: 4.03 ± 0.0
1.007ValHis: 1.007 ± 0.0
1.343ValIle: 1.343 ± 0.0
2.351ValLys: 2.351 ± 0.0
6.38ValLeu: 6.38 ± 0.0
1.343ValMet: 1.343 ± 0.0
6.044ValAsn: 6.044 ± 0.0
4.365ValPro: 4.365 ± 0.0
1.679ValGln: 1.679 ± 0.0
3.022ValArg: 3.022 ± 0.0
5.709ValSer: 5.709 ± 0.0
4.365ValThr: 4.365 ± 0.0
2.351ValVal: 2.351 ± 0.0
1.007ValTrp: 1.007 ± 0.0
2.351ValTyr: 2.351 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.007TrpAla: 1.007 ± 0.0
0.672TrpCys: 0.672 ± 0.0
0.336TrpAsp: 0.336 ± 0.0
1.343TrpGlu: 1.343 ± 0.0
0.336TrpPhe: 0.336 ± 0.0
1.007TrpGly: 1.007 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.679TrpIle: 1.679 ± 0.0
1.007TrpLys: 1.007 ± 0.0
2.015TrpLeu: 2.015 ± 0.0
1.007TrpMet: 1.007 ± 0.0
2.351TrpAsn: 2.351 ± 0.0
0.672TrpPro: 0.672 ± 0.0
1.007TrpGln: 1.007 ± 0.0
0.672TrpArg: 0.672 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.007TrpThr: 1.007 ± 0.0
1.343TrpVal: 1.343 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.022TyrAla: 3.022 ± 0.0
1.007TyrCys: 1.007 ± 0.0
4.03TyrAsp: 4.03 ± 0.0
3.022TyrGlu: 3.022 ± 0.0
2.351TyrPhe: 2.351 ± 0.0
2.351TyrGly: 2.351 ± 0.0
1.007TyrHis: 1.007 ± 0.0
0.672TyrIle: 0.672 ± 0.0
2.015TyrLys: 2.015 ± 0.0
2.686TyrLeu: 2.686 ± 0.0
1.007TyrMet: 1.007 ± 0.0
2.351TyrAsn: 2.351 ± 0.0
1.679TyrPro: 1.679 ± 0.0
1.007TyrGln: 1.007 ± 0.0
1.679TyrArg: 1.679 ± 0.0
3.694TyrSer: 3.694 ± 0.0
2.351TyrThr: 2.351 ± 0.0
3.358TyrVal: 3.358 ± 0.0
0.336TyrTrp: 0.336 ± 0.0
2.015TyrTyr: 2.015 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski