Amino acid dipepetide frequency for Beihai picorna-like virus 96

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.042AlaAla: 2.042 ± 0.0
0.681AlaCys: 0.681 ± 0.0
1.361AlaAsp: 1.361 ± 0.0
2.042AlaGlu: 2.042 ± 0.0
1.361AlaPhe: 1.361 ± 0.0
2.383AlaGly: 2.383 ± 0.0
0.681AlaHis: 0.681 ± 0.0
3.744AlaIle: 3.744 ± 0.0
1.702AlaLys: 1.702 ± 0.0
4.765AlaLeu: 4.765 ± 0.0
0.681AlaMet: 0.681 ± 0.0
2.723AlaAsn: 2.723 ± 0.0
2.383AlaPro: 2.383 ± 0.0
1.021AlaGln: 1.021 ± 0.0
2.383AlaArg: 2.383 ± 0.0
3.063AlaSer: 3.063 ± 0.0
4.084AlaThr: 4.084 ± 0.0
3.404AlaVal: 3.404 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
3.063AlaTyr: 3.063 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.021CysAla: 1.021 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.021CysAsp: 1.021 ± 0.0
1.361CysGlu: 1.361 ± 0.0
0.681CysPhe: 0.681 ± 0.0
0.681CysGly: 0.681 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.021CysIle: 1.021 ± 0.0
1.021CysLys: 1.021 ± 0.0
1.021CysLeu: 1.021 ± 0.0
0.681CysMet: 0.681 ± 0.0
1.361CysAsn: 1.361 ± 0.0
1.361CysPro: 1.361 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.021CysArg: 1.021 ± 0.0
1.021CysSer: 1.021 ± 0.0
1.702CysThr: 1.702 ± 0.0
2.042CysVal: 2.042 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.681CysTyr: 0.681 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.383AspAla: 2.383 ± 0.0
1.021AspCys: 1.021 ± 0.0
5.786AspAsp: 5.786 ± 0.0
3.744AspGlu: 3.744 ± 0.0
4.425AspPhe: 4.425 ± 0.0
2.723AspGly: 2.723 ± 0.0
1.361AspHis: 1.361 ± 0.0
4.425AspIle: 4.425 ± 0.0
3.744AspLys: 3.744 ± 0.0
5.446AspLeu: 5.446 ± 0.0
0.34AspMet: 0.34 ± 0.0
2.042AspAsn: 2.042 ± 0.0
2.042AspPro: 2.042 ± 0.0
2.042AspGln: 2.042 ± 0.0
3.063AspArg: 3.063 ± 0.0
4.425AspSer: 4.425 ± 0.0
0.34AspThr: 0.34 ± 0.0
3.744AspVal: 3.744 ± 0.0
1.021AspTrp: 1.021 ± 0.0
4.084AspTyr: 4.084 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.383GluAla: 2.383 ± 0.0
1.021GluCys: 1.021 ± 0.0
3.744GluAsp: 3.744 ± 0.0
4.425GluGlu: 4.425 ± 0.0
6.807GluPhe: 6.807 ± 0.0
2.723GluGly: 2.723 ± 0.0
0.0GluHis: 0.0 ± 0.0
5.106GluIle: 5.106 ± 0.0
4.425GluLys: 4.425 ± 0.0
5.786GluLeu: 5.786 ± 0.0
1.702GluMet: 1.702 ± 0.0
2.042GluAsn: 2.042 ± 0.0
2.723GluPro: 2.723 ± 0.0
1.702GluGln: 1.702 ± 0.0
3.063GluArg: 3.063 ± 0.0
4.084GluSer: 4.084 ± 0.0
3.063GluThr: 3.063 ± 0.0
6.127GluVal: 6.127 ± 0.0
1.021GluTrp: 1.021 ± 0.0
2.383GluTyr: 2.383 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.744PheAla: 3.744 ± 0.0
1.361PheCys: 1.361 ± 0.0
3.744PheAsp: 3.744 ± 0.0
4.084PheGlu: 4.084 ± 0.0
3.404PhePhe: 3.404 ± 0.0
2.042PheGly: 2.042 ± 0.0
1.702PheHis: 1.702 ± 0.0
4.084PheIle: 4.084 ± 0.0
2.723PheLys: 2.723 ± 0.0
5.106PheLeu: 5.106 ± 0.0
1.361PheMet: 1.361 ± 0.0
1.361PheAsn: 1.361 ± 0.0
1.702PhePro: 1.702 ± 0.0
0.681PheGln: 0.681 ± 0.0
1.702PheArg: 1.702 ± 0.0
7.148PheSer: 7.148 ± 0.0
3.744PheThr: 3.744 ± 0.0
5.446PheVal: 5.446 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.723PheTyr: 2.723 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.404GlyAla: 3.404 ± 0.0
0.681GlyCys: 0.681 ± 0.0
2.042GlyAsp: 2.042 ± 0.0
4.084GlyGlu: 4.084 ± 0.0
3.063GlyPhe: 3.063 ± 0.0
3.063GlyGly: 3.063 ± 0.0
2.042GlyHis: 2.042 ± 0.0
7.148GlyIle: 7.148 ± 0.0
4.425GlyLys: 4.425 ± 0.0
3.744GlyLeu: 3.744 ± 0.0
0.681GlyMet: 0.681 ± 0.0
1.021GlyAsn: 1.021 ± 0.0
2.383GlyPro: 2.383 ± 0.0
3.063GlyGln: 3.063 ± 0.0
1.361GlyArg: 1.361 ± 0.0
5.106GlySer: 5.106 ± 0.0
1.702GlyThr: 1.702 ± 0.0
4.425GlyVal: 4.425 ± 0.0
1.021GlyTrp: 1.021 ± 0.0
3.404GlyTyr: 3.404 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.681HisAla: 0.681 ± 0.0
0.681HisCys: 0.681 ± 0.0
1.361HisAsp: 1.361 ± 0.0
0.681HisGlu: 0.681 ± 0.0
2.723HisPhe: 2.723 ± 0.0
0.681HisGly: 0.681 ± 0.0
0.34HisHis: 0.34 ± 0.0
1.361HisIle: 1.361 ± 0.0
2.723HisLys: 2.723 ± 0.0
1.361HisLeu: 1.361 ± 0.0
0.681HisMet: 0.681 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.021HisPro: 1.021 ± 0.0
1.361HisGln: 1.361 ± 0.0
1.361HisArg: 1.361 ± 0.0
1.021HisSer: 1.021 ± 0.0
1.702HisThr: 1.702 ± 0.0
2.042HisVal: 2.042 ± 0.0
0.34HisTrp: 0.34 ± 0.0
1.021HisTyr: 1.021 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.383IleAla: 2.383 ± 0.0
1.702IleCys: 1.702 ± 0.0
4.765IleAsp: 4.765 ± 0.0
7.148IleGlu: 7.148 ± 0.0
4.084IlePhe: 4.084 ± 0.0
5.446IleGly: 5.446 ± 0.0
2.042IleHis: 2.042 ± 0.0
4.425IleIle: 4.425 ± 0.0
5.446IleLys: 5.446 ± 0.0
5.786IleLeu: 5.786 ± 0.0
0.681IleMet: 0.681 ± 0.0
4.765IleAsn: 4.765 ± 0.0
5.106IlePro: 5.106 ± 0.0
3.404IleGln: 3.404 ± 0.0
4.084IleArg: 4.084 ± 0.0
5.786IleSer: 5.786 ± 0.0
3.404IleThr: 3.404 ± 0.0
3.404IleVal: 3.404 ± 0.0
1.021IleTrp: 1.021 ± 0.0
2.042IleTyr: 2.042 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.042LysAla: 2.042 ± 0.0
1.361LysCys: 1.361 ± 0.0
4.425LysAsp: 4.425 ± 0.0
6.127LysGlu: 6.127 ± 0.0
2.383LysPhe: 2.383 ± 0.0
3.404LysGly: 3.404 ± 0.0
1.702LysHis: 1.702 ± 0.0
5.446LysIle: 5.446 ± 0.0
5.446LysLys: 5.446 ± 0.0
6.127LysLeu: 6.127 ± 0.0
1.361LysMet: 1.361 ± 0.0
3.744LysAsn: 3.744 ± 0.0
2.723LysPro: 2.723 ± 0.0
1.702LysGln: 1.702 ± 0.0
1.021LysArg: 1.021 ± 0.0
3.063LysSer: 3.063 ± 0.0
4.084LysThr: 4.084 ± 0.0
5.786LysVal: 5.786 ± 0.0
0.681LysTrp: 0.681 ± 0.0
4.084LysTyr: 4.084 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.744LeuAla: 3.744 ± 0.0
1.361LeuCys: 1.361 ± 0.0
6.127LeuAsp: 6.127 ± 0.0
5.106LeuGlu: 5.106 ± 0.0
4.084LeuPhe: 4.084 ± 0.0
4.425LeuGly: 4.425 ± 0.0
2.383LeuHis: 2.383 ± 0.0
5.446LeuIle: 5.446 ± 0.0
5.786LeuLys: 5.786 ± 0.0
8.509LeuLeu: 8.509 ± 0.0
2.383LeuMet: 2.383 ± 0.0
1.702LeuAsn: 1.702 ± 0.0
5.106LeuPro: 5.106 ± 0.0
3.404LeuGln: 3.404 ± 0.0
4.084LeuArg: 4.084 ± 0.0
8.509LeuSer: 8.509 ± 0.0
4.425LeuThr: 4.425 ± 0.0
3.744LeuVal: 3.744 ± 0.0
2.042LeuTrp: 2.042 ± 0.0
3.744LeuTyr: 3.744 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.361MetAla: 1.361 ± 0.0
0.34MetCys: 0.34 ± 0.0
0.34MetAsp: 0.34 ± 0.0
0.681MetGlu: 0.681 ± 0.0
0.34MetPhe: 0.34 ± 0.0
2.042MetGly: 2.042 ± 0.0
0.681MetHis: 0.681 ± 0.0
1.361MetIle: 1.361 ± 0.0
0.681MetLys: 0.681 ± 0.0
0.681MetLeu: 0.681 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.361MetAsn: 1.361 ± 0.0
1.021MetPro: 1.021 ± 0.0
0.681MetGln: 0.681 ± 0.0
0.681MetArg: 0.681 ± 0.0
0.681MetSer: 0.681 ± 0.0
1.021MetThr: 1.021 ± 0.0
1.702MetVal: 1.702 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.021MetTyr: 1.021 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.702AsnAla: 1.702 ± 0.0
0.681AsnCys: 0.681 ± 0.0
2.723AsnAsp: 2.723 ± 0.0
3.063AsnGlu: 3.063 ± 0.0
1.702AsnPhe: 1.702 ± 0.0
3.063AsnGly: 3.063 ± 0.0
0.34AsnHis: 0.34 ± 0.0
5.786AsnIle: 5.786 ± 0.0
3.063AsnLys: 3.063 ± 0.0
3.404AsnLeu: 3.404 ± 0.0
0.0AsnMet: 0.0 ± 0.0
3.404AsnAsn: 3.404 ± 0.0
2.723AsnPro: 2.723 ± 0.0
1.361AsnGln: 1.361 ± 0.0
2.383AsnArg: 2.383 ± 0.0
4.084AsnSer: 4.084 ± 0.0
1.021AsnThr: 1.021 ± 0.0
3.063AsnVal: 3.063 ± 0.0
0.681AsnTrp: 0.681 ± 0.0
2.383AsnTyr: 2.383 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.042ProAla: 2.042 ± 0.0
1.021ProCys: 1.021 ± 0.0
2.723ProAsp: 2.723 ± 0.0
2.723ProGlu: 2.723 ± 0.0
3.404ProPhe: 3.404 ± 0.0
2.042ProGly: 2.042 ± 0.0
0.34ProHis: 0.34 ± 0.0
3.063ProIle: 3.063 ± 0.0
3.744ProLys: 3.744 ± 0.0
6.807ProLeu: 6.807 ± 0.0
0.34ProMet: 0.34 ± 0.0
1.702ProAsn: 1.702 ± 0.0
1.702ProPro: 1.702 ± 0.0
2.723ProGln: 2.723 ± 0.0
2.042ProArg: 2.042 ± 0.0
2.042ProSer: 2.042 ± 0.0
4.765ProThr: 4.765 ± 0.0
2.042ProVal: 2.042 ± 0.0
0.34ProTrp: 0.34 ± 0.0
1.702ProTyr: 1.702 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.702GlnAla: 1.702 ± 0.0
1.021GlnCys: 1.021 ± 0.0
2.723GlnAsp: 2.723 ± 0.0
3.404GlnGlu: 3.404 ± 0.0
1.361GlnPhe: 1.361 ± 0.0
3.744GlnGly: 3.744 ± 0.0
0.681GlnHis: 0.681 ± 0.0
2.723GlnIle: 2.723 ± 0.0
2.723GlnLys: 2.723 ± 0.0
2.042GlnLeu: 2.042 ± 0.0
0.34GlnMet: 0.34 ± 0.0
2.723GlnAsn: 2.723 ± 0.0
0.34GlnPro: 0.34 ± 0.0
0.0GlnGln: 0.0 ± 0.0
3.063GlnArg: 3.063 ± 0.0
1.702GlnSer: 1.702 ± 0.0
1.702GlnThr: 1.702 ± 0.0
2.042GlnVal: 2.042 ± 0.0
0.34GlnTrp: 0.34 ± 0.0
0.681GlnTyr: 0.681 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.042ArgAla: 2.042 ± 0.0
0.34ArgCys: 0.34 ± 0.0
1.021ArgAsp: 1.021 ± 0.0
3.063ArgGlu: 3.063 ± 0.0
3.744ArgPhe: 3.744 ± 0.0
4.084ArgGly: 4.084 ± 0.0
1.361ArgHis: 1.361 ± 0.0
3.744ArgIle: 3.744 ± 0.0
2.042ArgLys: 2.042 ± 0.0
4.084ArgLeu: 4.084 ± 0.0
0.34ArgMet: 0.34 ± 0.0
1.361ArgAsn: 1.361 ± 0.0
1.361ArgPro: 1.361 ± 0.0
2.042ArgGln: 2.042 ± 0.0
3.404ArgArg: 3.404 ± 0.0
3.063ArgSer: 3.063 ± 0.0
2.042ArgThr: 2.042 ± 0.0
1.702ArgVal: 1.702 ± 0.0
0.34ArgTrp: 0.34 ± 0.0
2.042ArgTyr: 2.042 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.383SerAla: 2.383 ± 0.0
1.361SerCys: 1.361 ± 0.0
2.723SerAsp: 2.723 ± 0.0
3.063SerGlu: 3.063 ± 0.0
6.467SerPhe: 6.467 ± 0.0
4.765SerGly: 4.765 ± 0.0
1.361SerHis: 1.361 ± 0.0
6.467SerIle: 6.467 ± 0.0
5.106SerLys: 5.106 ± 0.0
6.467SerLeu: 6.467 ± 0.0
0.681SerMet: 0.681 ± 0.0
6.467SerAsn: 6.467 ± 0.0
2.723SerPro: 2.723 ± 0.0
3.744SerGln: 3.744 ± 0.0
3.063SerArg: 3.063 ± 0.0
8.169SerSer: 8.169 ± 0.0
5.786SerThr: 5.786 ± 0.0
4.765SerVal: 4.765 ± 0.0
0.681SerTrp: 0.681 ± 0.0
3.063SerTyr: 3.063 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.383ThrAla: 2.383 ± 0.0
1.361ThrCys: 1.361 ± 0.0
2.042ThrAsp: 2.042 ± 0.0
3.404ThrGlu: 3.404 ± 0.0
3.744ThrPhe: 3.744 ± 0.0
3.063ThrGly: 3.063 ± 0.0
1.361ThrHis: 1.361 ± 0.0
3.404ThrIle: 3.404 ± 0.0
3.063ThrLys: 3.063 ± 0.0
5.786ThrLeu: 5.786 ± 0.0
1.021ThrMet: 1.021 ± 0.0
2.383ThrAsn: 2.383 ± 0.0
5.446ThrPro: 5.446 ± 0.0
1.361ThrGln: 1.361 ± 0.0
1.021ThrArg: 1.021 ± 0.0
4.425ThrSer: 4.425 ± 0.0
2.383ThrThr: 2.383 ± 0.0
4.084ThrVal: 4.084 ± 0.0
0.34ThrTrp: 0.34 ± 0.0
2.042ThrTyr: 2.042 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.744ValAla: 3.744 ± 0.0
1.702ValCys: 1.702 ± 0.0
4.084ValAsp: 4.084 ± 0.0
4.084ValGlu: 4.084 ± 0.0
1.702ValPhe: 1.702 ± 0.0
3.744ValGly: 3.744 ± 0.0
2.042ValHis: 2.042 ± 0.0
4.765ValIle: 4.765 ± 0.0
6.127ValLys: 6.127 ± 0.0
4.765ValLeu: 4.765 ± 0.0
1.361ValMet: 1.361 ± 0.0
3.404ValAsn: 3.404 ± 0.0
3.744ValPro: 3.744 ± 0.0
3.063ValGln: 3.063 ± 0.0
2.042ValArg: 2.042 ± 0.0
7.148ValSer: 7.148 ± 0.0
3.744ValThr: 3.744 ± 0.0
5.446ValVal: 5.446 ± 0.0
1.021ValTrp: 1.021 ± 0.0
1.702ValTyr: 1.702 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.021TrpAsp: 1.021 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.681TrpPhe: 0.681 ± 0.0
0.34TrpGly: 0.34 ± 0.0
0.34TrpHis: 0.34 ± 0.0
1.361TrpIle: 1.361 ± 0.0
0.34TrpLys: 0.34 ± 0.0
1.021TrpLeu: 1.021 ± 0.0
0.34TrpMet: 0.34 ± 0.0
0.34TrpAsn: 0.34 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.34TrpGln: 0.34 ± 0.0
0.34TrpArg: 0.34 ± 0.0
1.702TrpSer: 1.702 ± 0.0
1.021TrpThr: 1.021 ± 0.0
2.723TrpVal: 2.723 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.34TrpTyr: 0.34 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.383TyrAla: 2.383 ± 0.0
0.0TyrCys: 0.0 ± 0.0
4.425TyrAsp: 4.425 ± 0.0
2.042TyrGlu: 2.042 ± 0.0
2.042TyrPhe: 2.042 ± 0.0
3.404TyrGly: 3.404 ± 0.0
2.383TyrHis: 2.383 ± 0.0
2.042TyrIle: 2.042 ± 0.0
2.383TyrLys: 2.383 ± 0.0
3.404TyrLeu: 3.404 ± 0.0
1.361TyrMet: 1.361 ± 0.0
2.723TyrAsn: 2.723 ± 0.0
1.702TyrPro: 1.702 ± 0.0
1.361TyrGln: 1.361 ± 0.0
2.042TyrArg: 2.042 ± 0.0
3.063TyrSer: 3.063 ± 0.0
2.383TyrThr: 2.383 ± 0.0
1.702TyrVal: 1.702 ± 0.0
1.361TyrTrp: 1.361 ± 0.0
1.361TyrTyr: 1.361 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2939 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski