Amino acid dipepetide frequency for Beihai picorna-like virus 57

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.315AlaAla: 6.315 ± 0.0
1.486AlaCys: 1.486 ± 0.0
2.6AlaAsp: 2.6 ± 0.0
5.201AlaGlu: 5.201 ± 0.0
2.229AlaPhe: 2.229 ± 0.0
6.686AlaGly: 6.686 ± 0.0
1.486AlaHis: 1.486 ± 0.0
4.458AlaIle: 4.458 ± 0.0
2.972AlaLys: 2.972 ± 0.0
5.944AlaLeu: 5.944 ± 0.0
3.343AlaMet: 3.343 ± 0.0
4.086AlaAsn: 4.086 ± 0.0
7.801AlaPro: 7.801 ± 0.0
3.343AlaGln: 3.343 ± 0.0
4.829AlaArg: 4.829 ± 0.0
4.829AlaSer: 4.829 ± 0.0
6.686AlaThr: 6.686 ± 0.0
7.429AlaVal: 7.429 ± 0.0
1.114AlaTrp: 1.114 ± 0.0
2.6AlaTyr: 2.6 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.743CysAsp: 0.743 ± 0.0
0.743CysGlu: 0.743 ± 0.0
0.371CysPhe: 0.371 ± 0.0
2.229CysGly: 2.229 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.371CysIle: 0.371 ± 0.0
0.371CysLys: 0.371 ± 0.0
2.972CysLeu: 2.972 ± 0.0
0.371CysMet: 0.371 ± 0.0
0.371CysAsn: 0.371 ± 0.0
0.371CysPro: 0.371 ± 0.0
0.371CysGln: 0.371 ± 0.0
0.371CysArg: 0.371 ± 0.0
1.114CysSer: 1.114 ± 0.0
0.743CysThr: 0.743 ± 0.0
1.114CysVal: 1.114 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.371CysTyr: 0.371 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.343AspAla: 3.343 ± 0.0
1.114AspCys: 1.114 ± 0.0
3.715AspAsp: 3.715 ± 0.0
4.458AspGlu: 4.458 ± 0.0
2.229AspPhe: 2.229 ± 0.0
4.458AspGly: 4.458 ± 0.0
2.229AspHis: 2.229 ± 0.0
2.229AspIle: 2.229 ± 0.0
2.6AspLys: 2.6 ± 0.0
4.458AspLeu: 4.458 ± 0.0
1.857AspMet: 1.857 ± 0.0
1.857AspAsn: 1.857 ± 0.0
2.972AspPro: 2.972 ± 0.0
1.486AspGln: 1.486 ± 0.0
0.743AspArg: 0.743 ± 0.0
4.458AspSer: 4.458 ± 0.0
3.343AspThr: 3.343 ± 0.0
2.6AspVal: 2.6 ± 0.0
0.743AspTrp: 0.743 ± 0.0
1.486AspTyr: 1.486 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.686GluAla: 6.686 ± 0.0
1.114GluCys: 1.114 ± 0.0
3.715GluAsp: 3.715 ± 0.0
6.686GluGlu: 6.686 ± 0.0
2.6GluPhe: 2.6 ± 0.0
2.6GluGly: 2.6 ± 0.0
0.743GluHis: 0.743 ± 0.0
4.458GluIle: 4.458 ± 0.0
3.343GluLys: 3.343 ± 0.0
5.944GluLeu: 5.944 ± 0.0
1.857GluMet: 1.857 ± 0.0
1.486GluAsn: 1.486 ± 0.0
3.715GluPro: 3.715 ± 0.0
2.6GluGln: 2.6 ± 0.0
1.857GluArg: 1.857 ± 0.0
3.343GluSer: 3.343 ± 0.0
2.972GluThr: 2.972 ± 0.0
4.086GluVal: 4.086 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.486GluTyr: 1.486 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.6PheAla: 2.6 ± 0.0
0.371PheCys: 0.371 ± 0.0
1.857PheAsp: 1.857 ± 0.0
2.229PheGlu: 2.229 ± 0.0
0.371PhePhe: 0.371 ± 0.0
4.458PheGly: 4.458 ± 0.0
0.371PheHis: 0.371 ± 0.0
1.486PheIle: 1.486 ± 0.0
1.114PheLys: 1.114 ± 0.0
3.343PheLeu: 3.343 ± 0.0
0.743PheMet: 0.743 ± 0.0
1.486PheAsn: 1.486 ± 0.0
2.229PhePro: 2.229 ± 0.0
2.6PheGln: 2.6 ± 0.0
3.715PheArg: 3.715 ± 0.0
2.972PheSer: 2.972 ± 0.0
4.829PheThr: 4.829 ± 0.0
3.715PheVal: 3.715 ± 0.0
0.371PheTrp: 0.371 ± 0.0
1.114PheTyr: 1.114 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.829GlyAla: 4.829 ± 0.0
0.743GlyCys: 0.743 ± 0.0
4.086GlyAsp: 4.086 ± 0.0
2.229GlyGlu: 2.229 ± 0.0
2.6GlyPhe: 2.6 ± 0.0
3.343GlyGly: 3.343 ± 0.0
0.743GlyHis: 0.743 ± 0.0
4.458GlyIle: 4.458 ± 0.0
4.086GlyLys: 4.086 ± 0.0
7.429GlyLeu: 7.429 ± 0.0
2.6GlyMet: 2.6 ± 0.0
1.114GlyAsn: 1.114 ± 0.0
3.343GlyPro: 3.343 ± 0.0
2.972GlyGln: 2.972 ± 0.0
2.972GlyArg: 2.972 ± 0.0
5.201GlySer: 5.201 ± 0.0
1.857GlyThr: 1.857 ± 0.0
6.315GlyVal: 6.315 ± 0.0
0.371GlyTrp: 0.371 ± 0.0
2.6GlyTyr: 2.6 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.857HisAla: 1.857 ± 0.0
1.114HisCys: 1.114 ± 0.0
1.114HisAsp: 1.114 ± 0.0
1.114HisGlu: 1.114 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.486HisGly: 1.486 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.371HisIle: 0.371 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.114HisLeu: 1.114 ± 0.0
0.743HisMet: 0.743 ± 0.0
1.114HisAsn: 1.114 ± 0.0
2.229HisPro: 2.229 ± 0.0
1.857HisGln: 1.857 ± 0.0
1.857HisArg: 1.857 ± 0.0
0.743HisSer: 0.743 ± 0.0
1.114HisThr: 1.114 ± 0.0
2.229HisVal: 2.229 ± 0.0
1.114HisTrp: 1.114 ± 0.0
1.857HisTyr: 1.857 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.6IleAla: 2.6 ± 0.0
0.371IleCys: 0.371 ± 0.0
2.6IleAsp: 2.6 ± 0.0
2.972IleGlu: 2.972 ± 0.0
2.6IlePhe: 2.6 ± 0.0
2.6IleGly: 2.6 ± 0.0
1.857IleHis: 1.857 ± 0.0
1.114IleIle: 1.114 ± 0.0
2.972IleLys: 2.972 ± 0.0
2.229IleLeu: 2.229 ± 0.0
1.486IleMet: 1.486 ± 0.0
1.857IleAsn: 1.857 ± 0.0
4.086IlePro: 4.086 ± 0.0
2.972IleGln: 2.972 ± 0.0
1.486IleArg: 1.486 ± 0.0
4.458IleSer: 4.458 ± 0.0
3.343IleThr: 3.343 ± 0.0
1.114IleVal: 1.114 ± 0.0
1.114IleTrp: 1.114 ± 0.0
1.857IleTyr: 1.857 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.972LysAla: 2.972 ± 0.0
0.0LysCys: 0.0 ± 0.0
2.972LysAsp: 2.972 ± 0.0
3.343LysGlu: 3.343 ± 0.0
1.857LysPhe: 1.857 ± 0.0
1.857LysGly: 1.857 ± 0.0
1.486LysHis: 1.486 ± 0.0
1.857LysIle: 1.857 ± 0.0
2.972LysLys: 2.972 ± 0.0
3.715LysLeu: 3.715 ± 0.0
1.486LysMet: 1.486 ± 0.0
1.857LysAsn: 1.857 ± 0.0
4.086LysPro: 4.086 ± 0.0
0.743LysGln: 0.743 ± 0.0
2.972LysArg: 2.972 ± 0.0
3.715LysSer: 3.715 ± 0.0
1.857LysThr: 1.857 ± 0.0
3.343LysVal: 3.343 ± 0.0
1.486LysTrp: 1.486 ± 0.0
0.371LysTyr: 0.371 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.915LeuAla: 8.915 ± 0.0
1.486LeuCys: 1.486 ± 0.0
5.572LeuAsp: 5.572 ± 0.0
4.086LeuGlu: 4.086 ± 0.0
4.086LeuPhe: 4.086 ± 0.0
4.458LeuGly: 4.458 ± 0.0
1.857LeuHis: 1.857 ± 0.0
4.086LeuIle: 4.086 ± 0.0
4.086LeuLys: 4.086 ± 0.0
6.315LeuLeu: 6.315 ± 0.0
0.743LeuMet: 0.743 ± 0.0
3.343LeuAsn: 3.343 ± 0.0
4.829LeuPro: 4.829 ± 0.0
4.829LeuGln: 4.829 ± 0.0
3.343LeuArg: 3.343 ± 0.0
5.201LeuSer: 5.201 ± 0.0
6.315LeuThr: 6.315 ± 0.0
5.944LeuVal: 5.944 ± 0.0
1.486LeuTrp: 1.486 ± 0.0
2.6LeuTyr: 2.6 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.229MetAla: 2.229 ± 0.0
1.114MetCys: 1.114 ± 0.0
1.486MetAsp: 1.486 ± 0.0
1.486MetGlu: 1.486 ± 0.0
1.486MetPhe: 1.486 ± 0.0
0.371MetGly: 0.371 ± 0.0
0.371MetHis: 0.371 ± 0.0
0.743MetIle: 0.743 ± 0.0
1.486MetLys: 1.486 ± 0.0
0.743MetLeu: 0.743 ± 0.0
0.371MetMet: 0.371 ± 0.0
1.114MetAsn: 1.114 ± 0.0
1.486MetPro: 1.486 ± 0.0
0.743MetGln: 0.743 ± 0.0
2.229MetArg: 2.229 ± 0.0
1.486MetSer: 1.486 ± 0.0
2.229MetThr: 2.229 ± 0.0
2.229MetVal: 2.229 ± 0.0
0.743MetTrp: 0.743 ± 0.0
0.371MetTyr: 0.371 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.086AsnAla: 4.086 ± 0.0
0.371AsnCys: 0.371 ± 0.0
1.857AsnAsp: 1.857 ± 0.0
1.857AsnGlu: 1.857 ± 0.0
2.229AsnPhe: 2.229 ± 0.0
2.6AsnGly: 2.6 ± 0.0
0.743AsnHis: 0.743 ± 0.0
1.114AsnIle: 1.114 ± 0.0
1.114AsnLys: 1.114 ± 0.0
4.086AsnLeu: 4.086 ± 0.0
1.857AsnMet: 1.857 ± 0.0
1.114AsnAsn: 1.114 ± 0.0
3.715AsnPro: 3.715 ± 0.0
0.371AsnGln: 0.371 ± 0.0
1.486AsnArg: 1.486 ± 0.0
1.114AsnSer: 1.114 ± 0.0
0.743AsnThr: 0.743 ± 0.0
3.343AsnVal: 3.343 ± 0.0
0.743AsnTrp: 0.743 ± 0.0
0.743AsnTyr: 0.743 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.6ProAla: 2.6 ± 0.0
0.0ProCys: 0.0 ± 0.0
2.6ProAsp: 2.6 ± 0.0
5.944ProGlu: 5.944 ± 0.0
4.086ProPhe: 4.086 ± 0.0
3.343ProGly: 3.343 ± 0.0
1.486ProHis: 1.486 ± 0.0
2.6ProIle: 2.6 ± 0.0
2.972ProLys: 2.972 ± 0.0
5.944ProLeu: 5.944 ± 0.0
1.114ProMet: 1.114 ± 0.0
2.6ProAsn: 2.6 ± 0.0
5.201ProPro: 5.201 ± 0.0
0.743ProGln: 0.743 ± 0.0
4.829ProArg: 4.829 ± 0.0
6.315ProSer: 6.315 ± 0.0
2.229ProThr: 2.229 ± 0.0
5.944ProVal: 5.944 ± 0.0
1.486ProTrp: 1.486 ± 0.0
2.972ProTyr: 2.972 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.6GlnAla: 2.6 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.229GlnAsp: 2.229 ± 0.0
1.857GlnGlu: 1.857 ± 0.0
1.114GlnPhe: 1.114 ± 0.0
3.715GlnGly: 3.715 ± 0.0
0.371GlnHis: 0.371 ± 0.0
0.743GlnIle: 0.743 ± 0.0
0.743GlnLys: 0.743 ± 0.0
3.715GlnLeu: 3.715 ± 0.0
0.371GlnMet: 0.371 ± 0.0
1.114GlnAsn: 1.114 ± 0.0
1.486GlnPro: 1.486 ± 0.0
0.743GlnGln: 0.743 ± 0.0
1.857GlnArg: 1.857 ± 0.0
4.458GlnSer: 4.458 ± 0.0
3.715GlnThr: 3.715 ± 0.0
2.6GlnVal: 2.6 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.486GlnTyr: 1.486 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.201ArgAla: 5.201 ± 0.0
0.371ArgCys: 0.371 ± 0.0
2.972ArgAsp: 2.972 ± 0.0
4.086ArgGlu: 4.086 ± 0.0
0.743ArgPhe: 0.743 ± 0.0
2.6ArgGly: 2.6 ± 0.0
2.229ArgHis: 2.229 ± 0.0
1.857ArgIle: 1.857 ± 0.0
2.6ArgLys: 2.6 ± 0.0
5.944ArgLeu: 5.944 ± 0.0
1.114ArgMet: 1.114 ± 0.0
1.114ArgAsn: 1.114 ± 0.0
3.343ArgPro: 3.343 ± 0.0
2.229ArgGln: 2.229 ± 0.0
3.715ArgArg: 3.715 ± 0.0
3.715ArgSer: 3.715 ± 0.0
2.6ArgThr: 2.6 ± 0.0
2.972ArgVal: 2.972 ± 0.0
1.114ArgTrp: 1.114 ± 0.0
2.972ArgTyr: 2.972 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
9.287SerAla: 9.287 ± 0.0
0.371SerCys: 0.371 ± 0.0
1.857SerAsp: 1.857 ± 0.0
4.829SerGlu: 4.829 ± 0.0
3.343SerPhe: 3.343 ± 0.0
3.715SerGly: 3.715 ± 0.0
2.6SerHis: 2.6 ± 0.0
4.829SerIle: 4.829 ± 0.0
4.086SerLys: 4.086 ± 0.0
6.686SerLeu: 6.686 ± 0.0
1.114SerMet: 1.114 ± 0.0
1.486SerAsn: 1.486 ± 0.0
3.715SerPro: 3.715 ± 0.0
1.486SerGln: 1.486 ± 0.0
2.972SerArg: 2.972 ± 0.0
4.086SerSer: 4.086 ± 0.0
4.829SerThr: 4.829 ± 0.0
8.915SerVal: 8.915 ± 0.0
2.6SerTrp: 2.6 ± 0.0
1.857SerTyr: 1.857 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.715ThrAla: 3.715 ± 0.0
0.743ThrCys: 0.743 ± 0.0
3.343ThrAsp: 3.343 ± 0.0
3.343ThrGlu: 3.343 ± 0.0
3.715ThrPhe: 3.715 ± 0.0
3.343ThrGly: 3.343 ± 0.0
2.6ThrHis: 2.6 ± 0.0
3.715ThrIle: 3.715 ± 0.0
2.229ThrLys: 2.229 ± 0.0
2.972ThrLeu: 2.972 ± 0.0
1.857ThrMet: 1.857 ± 0.0
2.229ThrAsn: 2.229 ± 0.0
1.486ThrPro: 1.486 ± 0.0
2.229ThrGln: 2.229 ± 0.0
4.829ThrArg: 4.829 ± 0.0
5.201ThrSer: 5.201 ± 0.0
5.572ThrThr: 5.572 ± 0.0
5.944ThrVal: 5.944 ± 0.0
1.857ThrTrp: 1.857 ± 0.0
2.229ThrTyr: 2.229 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
10.401ValAla: 10.401 ± 0.0
0.0ValCys: 0.0 ± 0.0
4.458ValAsp: 4.458 ± 0.0
3.715ValGlu: 3.715 ± 0.0
3.343ValPhe: 3.343 ± 0.0
4.829ValGly: 4.829 ± 0.0
1.114ValHis: 1.114 ± 0.0
2.972ValIle: 2.972 ± 0.0
2.6ValLys: 2.6 ± 0.0
5.944ValLeu: 5.944 ± 0.0
0.743ValMet: 0.743 ± 0.0
4.458ValAsn: 4.458 ± 0.0
7.429ValPro: 7.429 ± 0.0
1.486ValGln: 1.486 ± 0.0
3.715ValArg: 3.715 ± 0.0
8.172ValSer: 8.172 ± 0.0
5.572ValThr: 5.572 ± 0.0
4.458ValVal: 4.458 ± 0.0
1.114ValTrp: 1.114 ± 0.0
3.715ValTyr: 3.715 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.857TrpAla: 1.857 ± 0.0
0.743TrpCys: 0.743 ± 0.0
1.114TrpAsp: 1.114 ± 0.0
1.114TrpGlu: 1.114 ± 0.0
1.114TrpPhe: 1.114 ± 0.0
1.114TrpGly: 1.114 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.114TrpIle: 1.114 ± 0.0
1.486TrpLys: 1.486 ± 0.0
0.371TrpLeu: 0.371 ± 0.0
0.371TrpMet: 0.371 ± 0.0
0.371TrpAsn: 0.371 ± 0.0
0.371TrpPro: 0.371 ± 0.0
0.371TrpGln: 0.371 ± 0.0
2.229TrpArg: 2.229 ± 0.0
1.114TrpSer: 1.114 ± 0.0
1.486TrpThr: 1.486 ± 0.0
2.229TrpVal: 2.229 ± 0.0
0.371TrpTrp: 0.371 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.343TyrAla: 3.343 ± 0.0
1.486TyrCys: 1.486 ± 0.0
1.486TyrAsp: 1.486 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
1.857TyrPhe: 1.857 ± 0.0
4.086TyrGly: 4.086 ± 0.0
0.743TyrHis: 0.743 ± 0.0
1.114TyrIle: 1.114 ± 0.0
1.114TyrLys: 1.114 ± 0.0
3.715TyrLeu: 3.715 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.114TyrAsn: 1.114 ± 0.0
1.114TyrPro: 1.114 ± 0.0
0.743TyrGln: 0.743 ± 0.0
1.486TyrArg: 1.486 ± 0.0
2.972TyrSer: 2.972 ± 0.0
1.114TyrThr: 1.114 ± 0.0
3.715TyrVal: 3.715 ± 0.0
1.114TyrTrp: 1.114 ± 0.0
2.229TyrTyr: 2.229 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2693 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski