Amino acid dipepetide frequency for Shahe picorna-like virus 13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.647AlaAla: 8.647 ± 0.0
0.376AlaCys: 0.376 ± 0.0
4.135AlaAsp: 4.135 ± 0.0
2.632AlaGlu: 2.632 ± 0.0
4.887AlaPhe: 4.887 ± 0.0
3.759AlaGly: 3.759 ± 0.0
1.128AlaHis: 1.128 ± 0.0
4.887AlaIle: 4.887 ± 0.0
3.759AlaLys: 3.759 ± 0.0
8.647AlaLeu: 8.647 ± 0.0
1.88AlaMet: 1.88 ± 0.0
1.128AlaAsn: 1.128 ± 0.0
3.383AlaPro: 3.383 ± 0.0
3.008AlaGln: 3.008 ± 0.0
3.383AlaArg: 3.383 ± 0.0
7.143AlaSer: 7.143 ± 0.0
7.143AlaThr: 7.143 ± 0.0
4.887AlaVal: 4.887 ± 0.0
0.752AlaTrp: 0.752 ± 0.0
1.504AlaTyr: 1.504 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.752CysAla: 0.752 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.376CysAsp: 0.376 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.504CysGly: 1.504 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.376CysIle: 0.376 ± 0.0
0.752CysLys: 0.752 ± 0.0
0.752CysLeu: 0.752 ± 0.0
0.376CysMet: 0.376 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.128CysPro: 1.128 ± 0.0
0.376CysGln: 0.376 ± 0.0
0.376CysArg: 0.376 ± 0.0
1.128CysSer: 1.128 ± 0.0
0.376CysThr: 0.376 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.128CysTyr: 1.128 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.383AspAla: 3.383 ± 0.0
0.0AspCys: 0.0 ± 0.0
4.887AspAsp: 4.887 ± 0.0
3.383AspGlu: 3.383 ± 0.0
4.887AspPhe: 4.887 ± 0.0
3.008AspGly: 3.008 ± 0.0
0.752AspHis: 0.752 ± 0.0
3.759AspIle: 3.759 ± 0.0
1.504AspLys: 1.504 ± 0.0
6.015AspLeu: 6.015 ± 0.0
0.0AspMet: 0.0 ± 0.0
1.128AspAsn: 1.128 ± 0.0
4.135AspPro: 4.135 ± 0.0
3.759AspGln: 3.759 ± 0.0
3.008AspArg: 3.008 ± 0.0
3.759AspSer: 3.759 ± 0.0
1.88AspThr: 1.88 ± 0.0
2.256AspVal: 2.256 ± 0.0
2.256AspTrp: 2.256 ± 0.0
1.88AspTyr: 1.88 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.008GluAla: 3.008 ± 0.0
1.128GluCys: 1.128 ± 0.0
4.135GluAsp: 4.135 ± 0.0
3.008GluGlu: 3.008 ± 0.0
2.632GluPhe: 2.632 ± 0.0
2.256GluGly: 2.256 ± 0.0
0.376GluHis: 0.376 ± 0.0
4.887GluIle: 4.887 ± 0.0
1.128GluLys: 1.128 ± 0.0
3.759GluLeu: 3.759 ± 0.0
1.128GluMet: 1.128 ± 0.0
2.632GluAsn: 2.632 ± 0.0
2.256GluPro: 2.256 ± 0.0
0.752GluGln: 0.752 ± 0.0
3.008GluArg: 3.008 ± 0.0
3.008GluSer: 3.008 ± 0.0
5.263GluThr: 5.263 ± 0.0
1.88GluVal: 1.88 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.504GluTyr: 1.504 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.511PheAla: 4.511 ± 0.0
0.376PheCys: 0.376 ± 0.0
3.008PheAsp: 3.008 ± 0.0
3.008PheGlu: 3.008 ± 0.0
2.256PhePhe: 2.256 ± 0.0
6.391PheGly: 6.391 ± 0.0
0.752PheHis: 0.752 ± 0.0
1.88PheIle: 1.88 ± 0.0
2.256PheLys: 2.256 ± 0.0
3.759PheLeu: 3.759 ± 0.0
2.256PheMet: 2.256 ± 0.0
3.759PheAsn: 3.759 ± 0.0
3.008PhePro: 3.008 ± 0.0
2.256PheGln: 2.256 ± 0.0
2.256PheArg: 2.256 ± 0.0
6.391PheSer: 6.391 ± 0.0
2.632PheThr: 2.632 ± 0.0
1.504PheVal: 1.504 ± 0.0
0.752PheTrp: 0.752 ± 0.0
2.256PheTyr: 2.256 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.767GlyAla: 6.767 ± 0.0
0.376GlyCys: 0.376 ± 0.0
1.128GlyAsp: 1.128 ± 0.0
3.008GlyGlu: 3.008 ± 0.0
3.008GlyPhe: 3.008 ± 0.0
2.632GlyGly: 2.632 ± 0.0
0.752GlyHis: 0.752 ± 0.0
4.511GlyIle: 4.511 ± 0.0
4.887GlyLys: 4.887 ± 0.0
7.519GlyLeu: 7.519 ± 0.0
0.752GlyMet: 0.752 ± 0.0
4.135GlyAsn: 4.135 ± 0.0
1.88GlyPro: 1.88 ± 0.0
0.752GlyGln: 0.752 ± 0.0
3.008GlyArg: 3.008 ± 0.0
4.511GlySer: 4.511 ± 0.0
3.759GlyThr: 3.759 ± 0.0
4.135GlyVal: 4.135 ± 0.0
0.752GlyTrp: 0.752 ± 0.0
1.88GlyTyr: 1.88 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.128HisAla: 1.128 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.376HisAsp: 0.376 ± 0.0
1.128HisGlu: 1.128 ± 0.0
1.128HisPhe: 1.128 ± 0.0
1.128HisGly: 1.128 ± 0.0
0.376HisHis: 0.376 ± 0.0
0.752HisIle: 0.752 ± 0.0
0.752HisLys: 0.752 ± 0.0
1.88HisLeu: 1.88 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.376HisAsn: 0.376 ± 0.0
1.88HisPro: 1.88 ± 0.0
0.376HisGln: 0.376 ± 0.0
0.376HisArg: 0.376 ± 0.0
2.256HisSer: 2.256 ± 0.0
2.256HisThr: 2.256 ± 0.0
0.752HisVal: 0.752 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.376HisTyr: 0.376 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.639IleAla: 5.639 ± 0.0
1.88IleCys: 1.88 ± 0.0
3.008IleAsp: 3.008 ± 0.0
1.88IleGlu: 1.88 ± 0.0
4.135IlePhe: 4.135 ± 0.0
4.511IleGly: 4.511 ± 0.0
1.504IleHis: 1.504 ± 0.0
1.504IleIle: 1.504 ± 0.0
2.256IleLys: 2.256 ± 0.0
6.015IleLeu: 6.015 ± 0.0
0.752IleMet: 0.752 ± 0.0
4.887IleAsn: 4.887 ± 0.0
4.135IlePro: 4.135 ± 0.0
2.256IleGln: 2.256 ± 0.0
3.008IleArg: 3.008 ± 0.0
4.135IleSer: 4.135 ± 0.0
3.759IleThr: 3.759 ± 0.0
3.383IleVal: 3.383 ± 0.0
1.88IleTrp: 1.88 ± 0.0
1.128IleTyr: 1.128 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.504LysAla: 1.504 ± 0.0
1.504LysCys: 1.504 ± 0.0
3.383LysAsp: 3.383 ± 0.0
0.752LysGlu: 0.752 ± 0.0
3.008LysPhe: 3.008 ± 0.0
1.504LysGly: 1.504 ± 0.0
0.376LysHis: 0.376 ± 0.0
6.015LysIle: 6.015 ± 0.0
3.383LysLys: 3.383 ± 0.0
2.632LysLeu: 2.632 ± 0.0
1.88LysMet: 1.88 ± 0.0
2.256LysAsn: 2.256 ± 0.0
1.128LysPro: 1.128 ± 0.0
1.504LysGln: 1.504 ± 0.0
1.128LysArg: 1.128 ± 0.0
2.256LysSer: 2.256 ± 0.0
2.632LysThr: 2.632 ± 0.0
2.256LysVal: 2.256 ± 0.0
0.752LysTrp: 0.752 ± 0.0
1.88LysTyr: 1.88 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.895LeuAla: 7.895 ± 0.0
0.376LeuCys: 0.376 ± 0.0
4.135LeuAsp: 4.135 ± 0.0
4.511LeuGlu: 4.511 ± 0.0
2.632LeuPhe: 2.632 ± 0.0
4.887LeuGly: 4.887 ± 0.0
2.256LeuHis: 2.256 ± 0.0
6.015LeuIle: 6.015 ± 0.0
6.391LeuLys: 6.391 ± 0.0
8.271LeuLeu: 8.271 ± 0.0
3.008LeuMet: 3.008 ± 0.0
5.639LeuAsn: 5.639 ± 0.0
5.263LeuPro: 5.263 ± 0.0
3.759LeuGln: 3.759 ± 0.0
3.759LeuArg: 3.759 ± 0.0
8.271LeuSer: 8.271 ± 0.0
7.519LeuThr: 7.519 ± 0.0
4.135LeuVal: 4.135 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
2.256LeuTyr: 2.256 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.88MetAla: 1.88 ± 0.0
0.376MetCys: 0.376 ± 0.0
1.88MetAsp: 1.88 ± 0.0
0.752MetGlu: 0.752 ± 0.0
1.88MetPhe: 1.88 ± 0.0
0.752MetGly: 0.752 ± 0.0
0.376MetHis: 0.376 ± 0.0
3.383MetIle: 3.383 ± 0.0
1.128MetLys: 1.128 ± 0.0
0.752MetLeu: 0.752 ± 0.0
0.752MetMet: 0.752 ± 0.0
2.256MetAsn: 2.256 ± 0.0
1.504MetPro: 1.504 ± 0.0
1.128MetGln: 1.128 ± 0.0
0.752MetArg: 0.752 ± 0.0
1.88MetSer: 1.88 ± 0.0
2.256MetThr: 2.256 ± 0.0
1.88MetVal: 1.88 ± 0.0
0.376MetTrp: 0.376 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.511AsnAla: 4.511 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.504AsnAsp: 1.504 ± 0.0
3.383AsnGlu: 3.383 ± 0.0
3.759AsnPhe: 3.759 ± 0.0
1.88AsnGly: 1.88 ± 0.0
1.128AsnHis: 1.128 ± 0.0
4.511AsnIle: 4.511 ± 0.0
1.88AsnLys: 1.88 ± 0.0
4.887AsnLeu: 4.887 ± 0.0
2.632AsnMet: 2.632 ± 0.0
1.128AsnAsn: 1.128 ± 0.0
3.008AsnPro: 3.008 ± 0.0
1.504AsnGln: 1.504 ± 0.0
3.008AsnArg: 3.008 ± 0.0
4.135AsnSer: 4.135 ± 0.0
3.008AsnThr: 3.008 ± 0.0
3.759AsnVal: 3.759 ± 0.0
1.128AsnTrp: 1.128 ± 0.0
0.752AsnTyr: 0.752 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.887ProAla: 4.887 ± 0.0
0.752ProCys: 0.752 ± 0.0
3.383ProAsp: 3.383 ± 0.0
1.88ProGlu: 1.88 ± 0.0
4.135ProPhe: 4.135 ± 0.0
3.759ProGly: 3.759 ± 0.0
2.632ProHis: 2.632 ± 0.0
5.263ProIle: 5.263 ± 0.0
1.504ProLys: 1.504 ± 0.0
3.759ProLeu: 3.759 ± 0.0
1.504ProMet: 1.504 ± 0.0
1.504ProAsn: 1.504 ± 0.0
3.008ProPro: 3.008 ± 0.0
1.88ProGln: 1.88 ± 0.0
3.008ProArg: 3.008 ± 0.0
6.391ProSer: 6.391 ± 0.0
3.383ProThr: 3.383 ± 0.0
3.008ProVal: 3.008 ± 0.0
2.256ProTrp: 2.256 ± 0.0
2.632ProTyr: 2.632 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.383GlnAla: 3.383 ± 0.0
0.376GlnCys: 0.376 ± 0.0
1.88GlnAsp: 1.88 ± 0.0
3.008GlnGlu: 3.008 ± 0.0
1.88GlnPhe: 1.88 ± 0.0
3.383GlnGly: 3.383 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.88GlnIle: 1.88 ± 0.0
1.128GlnLys: 1.128 ± 0.0
2.632GlnLeu: 2.632 ± 0.0
2.632GlnMet: 2.632 ± 0.0
2.256GlnAsn: 2.256 ± 0.0
3.383GlnPro: 3.383 ± 0.0
1.88GlnGln: 1.88 ± 0.0
0.0GlnArg: 0.0 ± 0.0
2.256GlnSer: 2.256 ± 0.0
3.383GlnThr: 3.383 ± 0.0
2.256GlnVal: 2.256 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.128GlnTyr: 1.128 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.256ArgAla: 2.256 ± 0.0
0.0ArgCys: 0.0 ± 0.0
2.632ArgAsp: 2.632 ± 0.0
3.008ArgGlu: 3.008 ± 0.0
1.504ArgPhe: 1.504 ± 0.0
3.008ArgGly: 3.008 ± 0.0
0.752ArgHis: 0.752 ± 0.0
4.135ArgIle: 4.135 ± 0.0
1.88ArgLys: 1.88 ± 0.0
5.639ArgLeu: 5.639 ± 0.0
0.752ArgMet: 0.752 ± 0.0
1.88ArgAsn: 1.88 ± 0.0
2.256ArgPro: 2.256 ± 0.0
2.256ArgGln: 2.256 ± 0.0
2.256ArgArg: 2.256 ± 0.0
3.383ArgSer: 3.383 ± 0.0
4.135ArgThr: 4.135 ± 0.0
3.759ArgVal: 3.759 ± 0.0
0.376ArgTrp: 0.376 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.767SerAla: 6.767 ± 0.0
0.376SerCys: 0.376 ± 0.0
5.263SerAsp: 5.263 ± 0.0
4.511SerGlu: 4.511 ± 0.0
4.511SerPhe: 4.511 ± 0.0
6.015SerGly: 6.015 ± 0.0
1.504SerHis: 1.504 ± 0.0
3.383SerIle: 3.383 ± 0.0
1.504SerLys: 1.504 ± 0.0
11.654SerLeu: 11.654 ± 0.0
0.752SerMet: 0.752 ± 0.0
3.759SerAsn: 3.759 ± 0.0
6.391SerPro: 6.391 ± 0.0
4.135SerGln: 4.135 ± 0.0
3.383SerArg: 3.383 ± 0.0
8.271SerSer: 8.271 ± 0.0
6.391SerThr: 6.391 ± 0.0
5.639SerVal: 5.639 ± 0.0
1.88SerTrp: 1.88 ± 0.0
0.752SerTyr: 0.752 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.135ThrAla: 4.135 ± 0.0
0.752ThrCys: 0.752 ± 0.0
3.008ThrAsp: 3.008 ± 0.0
4.135ThrGlu: 4.135 ± 0.0
4.135ThrPhe: 4.135 ± 0.0
4.135ThrGly: 4.135 ± 0.0
1.504ThrHis: 1.504 ± 0.0
2.256ThrIle: 2.256 ± 0.0
1.504ThrLys: 1.504 ± 0.0
5.263ThrLeu: 5.263 ± 0.0
3.008ThrMet: 3.008 ± 0.0
5.263ThrAsn: 5.263 ± 0.0
4.511ThrPro: 4.511 ± 0.0
3.008ThrGln: 3.008 ± 0.0
3.383ThrArg: 3.383 ± 0.0
7.519ThrSer: 7.519 ± 0.0
5.263ThrThr: 5.263 ± 0.0
6.015ThrVal: 6.015 ± 0.0
0.376ThrTrp: 0.376 ± 0.0
2.632ThrTyr: 2.632 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.759ValAla: 3.759 ± 0.0
0.376ValCys: 0.376 ± 0.0
5.263ValAsp: 5.263 ± 0.0
2.256ValGlu: 2.256 ± 0.0
2.632ValPhe: 2.632 ± 0.0
4.887ValGly: 4.887 ± 0.0
0.376ValHis: 0.376 ± 0.0
1.504ValIle: 1.504 ± 0.0
1.88ValLys: 1.88 ± 0.0
3.383ValLeu: 3.383 ± 0.0
1.128ValMet: 1.128 ± 0.0
4.135ValAsn: 4.135 ± 0.0
4.887ValPro: 4.887 ± 0.0
3.008ValGln: 3.008 ± 0.0
2.632ValArg: 2.632 ± 0.0
6.391ValSer: 6.391 ± 0.0
3.383ValThr: 3.383 ± 0.0
4.887ValVal: 4.887 ± 0.0
0.376ValTrp: 0.376 ± 0.0
1.504ValTyr: 1.504 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.376TrpAla: 0.376 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.376TrpAsp: 0.376 ± 0.0
1.128TrpGlu: 1.128 ± 0.0
0.376TrpPhe: 0.376 ± 0.0
0.376TrpGly: 0.376 ± 0.0
0.376TrpHis: 0.376 ± 0.0
0.376TrpIle: 0.376 ± 0.0
1.504TrpLys: 1.504 ± 0.0
1.128TrpLeu: 1.128 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.376TrpAsn: 0.376 ± 0.0
0.752TrpPro: 0.752 ± 0.0
0.376TrpGln: 0.376 ± 0.0
1.128TrpArg: 1.128 ± 0.0
1.88TrpSer: 1.88 ± 0.0
1.128TrpThr: 1.128 ± 0.0
0.376TrpVal: 0.376 ± 0.0
0.376TrpTrp: 0.376 ± 0.0
2.256TrpTyr: 2.256 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.88TyrAla: 1.88 ± 0.0
0.376TyrCys: 0.376 ± 0.0
1.88TyrAsp: 1.88 ± 0.0
0.376TyrGlu: 0.376 ± 0.0
1.88TyrPhe: 1.88 ± 0.0
0.376TyrGly: 0.376 ± 0.0
0.376TyrHis: 0.376 ± 0.0
0.752TyrIle: 0.752 ± 0.0
0.752TyrLys: 0.752 ± 0.0
2.632TyrLeu: 2.632 ± 0.0
0.376TyrMet: 0.376 ± 0.0
3.008TyrAsn: 3.008 ± 0.0
2.632TyrPro: 2.632 ± 0.0
0.752TyrGln: 0.752 ± 0.0
3.008TyrArg: 3.008 ± 0.0
1.88TyrSer: 1.88 ± 0.0
2.256TyrThr: 2.256 ± 0.0
1.88TyrVal: 1.88 ± 0.0
0.376TyrTrp: 0.376 ± 0.0
1.128TyrTyr: 1.128 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2661 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski