Amino acid dipepetide frequency for Wenzhou shrimp virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.075AlaAla: 6.075 ± 0.0
1.215AlaCys: 1.215 ± 0.0
4.253AlaAsp: 4.253 ± 0.0
4.86AlaGlu: 4.86 ± 0.0
1.519AlaPhe: 1.519 ± 0.0
5.468AlaGly: 5.468 ± 0.0
0.911AlaHis: 0.911 ± 0.0
3.038AlaIle: 3.038 ± 0.0
6.075AlaLys: 6.075 ± 0.0
8.202AlaLeu: 8.202 ± 0.0
2.126AlaMet: 2.126 ± 0.0
3.038AlaAsn: 3.038 ± 0.0
4.557AlaPro: 4.557 ± 0.0
3.038AlaGln: 3.038 ± 0.0
5.164AlaArg: 5.164 ± 0.0
6.075AlaSer: 6.075 ± 0.0
7.594AlaThr: 7.594 ± 0.0
5.164AlaVal: 5.164 ± 0.0
1.215AlaTrp: 1.215 ± 0.0
3.038AlaTyr: 3.038 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.304CysAla: 0.304 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.215CysAsp: 1.215 ± 0.0
0.911CysGlu: 0.911 ± 0.0
0.911CysPhe: 0.911 ± 0.0
0.304CysGly: 0.304 ± 0.0
0.304CysHis: 0.304 ± 0.0
1.519CysIle: 1.519 ± 0.0
1.215CysLys: 1.215 ± 0.0
1.519CysLeu: 1.519 ± 0.0
0.304CysMet: 0.304 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.608CysPro: 0.608 ± 0.0
0.304CysGln: 0.304 ± 0.0
0.608CysArg: 0.608 ± 0.0
1.823CysSer: 1.823 ± 0.0
1.519CysThr: 1.519 ± 0.0
0.304CysVal: 0.304 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.215CysTyr: 1.215 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.772AspAla: 5.772 ± 0.0
0.608AspCys: 0.608 ± 0.0
3.645AspAsp: 3.645 ± 0.0
4.557AspGlu: 4.557 ± 0.0
2.734AspPhe: 2.734 ± 0.0
1.215AspGly: 1.215 ± 0.0
2.126AspHis: 2.126 ± 0.0
3.038AspIle: 3.038 ± 0.0
2.126AspLys: 2.126 ± 0.0
7.898AspLeu: 7.898 ± 0.0
0.911AspMet: 0.911 ± 0.0
1.823AspAsn: 1.823 ± 0.0
2.734AspPro: 2.734 ± 0.0
1.519AspGln: 1.519 ± 0.0
1.823AspArg: 1.823 ± 0.0
4.253AspSer: 4.253 ± 0.0
3.341AspThr: 3.341 ± 0.0
2.734AspVal: 2.734 ± 0.0
0.608AspTrp: 0.608 ± 0.0
1.823AspTyr: 1.823 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.734GluAla: 2.734 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.519GluAsp: 1.519 ± 0.0
3.949GluGlu: 3.949 ± 0.0
3.949GluPhe: 3.949 ± 0.0
3.038GluGly: 3.038 ± 0.0
2.43GluHis: 2.43 ± 0.0
3.645GluIle: 3.645 ± 0.0
3.949GluLys: 3.949 ± 0.0
5.772GluLeu: 5.772 ± 0.0
1.519GluMet: 1.519 ± 0.0
1.215GluAsn: 1.215 ± 0.0
2.734GluPro: 2.734 ± 0.0
2.126GluGln: 2.126 ± 0.0
1.823GluArg: 1.823 ± 0.0
5.164GluSer: 5.164 ± 0.0
2.734GluThr: 2.734 ± 0.0
3.038GluVal: 3.038 ± 0.0
0.608GluTrp: 0.608 ± 0.0
2.734GluTyr: 2.734 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.86PheAla: 4.86 ± 0.0
0.608PheCys: 0.608 ± 0.0
2.43PheAsp: 2.43 ± 0.0
1.519PheGlu: 1.519 ± 0.0
1.215PhePhe: 1.215 ± 0.0
2.734PheGly: 2.734 ± 0.0
1.519PheHis: 1.519 ± 0.0
1.823PheIle: 1.823 ± 0.0
2.126PheLys: 2.126 ± 0.0
2.43PheLeu: 2.43 ± 0.0
0.304PheMet: 0.304 ± 0.0
2.126PheAsn: 2.126 ± 0.0
3.038PhePro: 3.038 ± 0.0
1.215PheGln: 1.215 ± 0.0
1.215PheArg: 1.215 ± 0.0
2.126PheSer: 2.126 ± 0.0
2.734PheThr: 2.734 ± 0.0
3.645PheVal: 3.645 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.911PheTyr: 0.911 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.253GlyAla: 4.253 ± 0.0
0.911GlyCys: 0.911 ± 0.0
2.126GlyAsp: 2.126 ± 0.0
1.823GlyGlu: 1.823 ± 0.0
1.823GlyPhe: 1.823 ± 0.0
2.126GlyGly: 2.126 ± 0.0
1.823GlyHis: 1.823 ± 0.0
3.949GlyIle: 3.949 ± 0.0
5.164GlyLys: 5.164 ± 0.0
2.43GlyLeu: 2.43 ± 0.0
1.215GlyMet: 1.215 ± 0.0
3.341GlyAsn: 3.341 ± 0.0
1.823GlyPro: 1.823 ± 0.0
2.43GlyGln: 2.43 ± 0.0
3.038GlyArg: 3.038 ± 0.0
3.038GlySer: 3.038 ± 0.0
5.164GlyThr: 5.164 ± 0.0
2.126GlyVal: 2.126 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
0.911GlyTyr: 0.911 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.519HisAla: 1.519 ± 0.0
0.304HisCys: 0.304 ± 0.0
2.126HisAsp: 2.126 ± 0.0
1.519HisGlu: 1.519 ± 0.0
0.911HisPhe: 0.911 ± 0.0
1.519HisGly: 1.519 ± 0.0
0.304HisHis: 0.304 ± 0.0
2.43HisIle: 2.43 ± 0.0
1.215HisLys: 1.215 ± 0.0
3.341HisLeu: 3.341 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.608HisAsn: 0.608 ± 0.0
2.43HisPro: 2.43 ± 0.0
1.823HisGln: 1.823 ± 0.0
0.608HisArg: 0.608 ± 0.0
1.823HisSer: 1.823 ± 0.0
2.43HisThr: 2.43 ± 0.0
1.519HisVal: 1.519 ± 0.0
0.608HisTrp: 0.608 ± 0.0
0.911HisTyr: 0.911 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.645IleAla: 3.645 ± 0.0
0.608IleCys: 0.608 ± 0.0
3.645IleAsp: 3.645 ± 0.0
3.949IleGlu: 3.949 ± 0.0
2.43IlePhe: 2.43 ± 0.0
2.43IleGly: 2.43 ± 0.0
1.215IleHis: 1.215 ± 0.0
2.126IleIle: 2.126 ± 0.0
2.43IleLys: 2.43 ± 0.0
4.557IleLeu: 4.557 ± 0.0
1.215IleMet: 1.215 ± 0.0
2.734IleAsn: 2.734 ± 0.0
3.949IlePro: 3.949 ± 0.0
3.038IleGln: 3.038 ± 0.0
2.43IleArg: 2.43 ± 0.0
4.557IleSer: 4.557 ± 0.0
3.645IleThr: 3.645 ± 0.0
3.949IleVal: 3.949 ± 0.0
0.0IleTrp: 0.0 ± 0.0
1.519IleTyr: 1.519 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.379LysAla: 6.379 ± 0.0
1.215LysCys: 1.215 ± 0.0
3.949LysAsp: 3.949 ± 0.0
3.645LysGlu: 3.645 ± 0.0
1.823LysPhe: 1.823 ± 0.0
2.126LysGly: 2.126 ± 0.0
1.519LysHis: 1.519 ± 0.0
2.734LysIle: 2.734 ± 0.0
3.645LysLys: 3.645 ± 0.0
6.075LysLeu: 6.075 ± 0.0
0.0LysMet: 0.0 ± 0.0
3.341LysAsn: 3.341 ± 0.0
2.734LysPro: 2.734 ± 0.0
3.949LysGln: 3.949 ± 0.0
3.341LysArg: 3.341 ± 0.0
7.594LysSer: 7.594 ± 0.0
3.341LysThr: 3.341 ± 0.0
3.645LysVal: 3.645 ± 0.0
0.608LysTrp: 0.608 ± 0.0
0.608LysTyr: 0.608 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.898LeuAla: 7.898 ± 0.0
1.519LeuCys: 1.519 ± 0.0
5.164LeuAsp: 5.164 ± 0.0
3.341LeuGlu: 3.341 ± 0.0
3.038LeuPhe: 3.038 ± 0.0
5.164LeuGly: 5.164 ± 0.0
3.038LeuHis: 3.038 ± 0.0
3.341LeuIle: 3.341 ± 0.0
7.29LeuLys: 7.29 ± 0.0
6.379LeuLeu: 6.379 ± 0.0
1.823LeuMet: 1.823 ± 0.0
3.949LeuAsn: 3.949 ± 0.0
6.379LeuPro: 6.379 ± 0.0
4.557LeuGln: 4.557 ± 0.0
4.557LeuArg: 4.557 ± 0.0
6.683LeuSer: 6.683 ± 0.0
6.379LeuThr: 6.379 ± 0.0
4.557LeuVal: 4.557 ± 0.0
0.304LeuTrp: 0.304 ± 0.0
3.341LeuTyr: 3.341 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.911MetAla: 0.911 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.911MetAsp: 0.911 ± 0.0
0.911MetGlu: 0.911 ± 0.0
0.304MetPhe: 0.304 ± 0.0
0.608MetGly: 0.608 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.519MetIle: 1.519 ± 0.0
1.519MetLys: 1.519 ± 0.0
0.911MetLeu: 0.911 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.608MetAsn: 0.608 ± 0.0
2.126MetPro: 2.126 ± 0.0
2.43MetGln: 2.43 ± 0.0
0.911MetArg: 0.911 ± 0.0
2.126MetSer: 2.126 ± 0.0
1.823MetThr: 1.823 ± 0.0
0.911MetVal: 0.911 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.304MetTyr: 0.304 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.734AsnAla: 2.734 ± 0.0
0.608AsnCys: 0.608 ± 0.0
1.823AsnAsp: 1.823 ± 0.0
2.43AsnGlu: 2.43 ± 0.0
1.823AsnPhe: 1.823 ± 0.0
4.557AsnGly: 4.557 ± 0.0
0.911AsnHis: 0.911 ± 0.0
1.519AsnIle: 1.519 ± 0.0
1.519AsnLys: 1.519 ± 0.0
5.468AsnLeu: 5.468 ± 0.0
1.823AsnMet: 1.823 ± 0.0
0.911AsnAsn: 0.911 ± 0.0
2.734AsnPro: 2.734 ± 0.0
0.911AsnGln: 0.911 ± 0.0
1.519AsnArg: 1.519 ± 0.0
3.645AsnSer: 3.645 ± 0.0
3.038AsnThr: 3.038 ± 0.0
0.608AsnVal: 0.608 ± 0.0
0.608AsnTrp: 0.608 ± 0.0
0.911AsnTyr: 0.911 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.987ProAla: 6.987 ± 0.0
1.519ProCys: 1.519 ± 0.0
3.949ProAsp: 3.949 ± 0.0
5.468ProGlu: 5.468 ± 0.0
1.823ProPhe: 1.823 ± 0.0
2.43ProGly: 2.43 ± 0.0
0.911ProHis: 0.911 ± 0.0
4.253ProIle: 4.253 ± 0.0
2.43ProLys: 2.43 ± 0.0
5.164ProLeu: 5.164 ± 0.0
0.911ProMet: 0.911 ± 0.0
1.823ProAsn: 1.823 ± 0.0
3.949ProPro: 3.949 ± 0.0
3.341ProGln: 3.341 ± 0.0
2.734ProArg: 2.734 ± 0.0
4.557ProSer: 4.557 ± 0.0
3.038ProThr: 3.038 ± 0.0
3.949ProVal: 3.949 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.215ProTyr: 1.215 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.734GlnAla: 2.734 ± 0.0
0.304GlnCys: 0.304 ± 0.0
2.126GlnAsp: 2.126 ± 0.0
1.519GlnGlu: 1.519 ± 0.0
2.126GlnPhe: 2.126 ± 0.0
3.341GlnGly: 3.341 ± 0.0
1.823GlnHis: 1.823 ± 0.0
3.038GlnIle: 3.038 ± 0.0
3.038GlnLys: 3.038 ± 0.0
2.43GlnLeu: 2.43 ± 0.0
2.126GlnMet: 2.126 ± 0.0
2.734GlnAsn: 2.734 ± 0.0
3.341GlnPro: 3.341 ± 0.0
3.038GlnGln: 3.038 ± 0.0
2.126GlnArg: 2.126 ± 0.0
4.86GlnSer: 4.86 ± 0.0
3.038GlnThr: 3.038 ± 0.0
3.038GlnVal: 3.038 ± 0.0
0.304GlnTrp: 0.304 ± 0.0
0.608GlnTyr: 0.608 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.949ArgAla: 3.949 ± 0.0
1.519ArgCys: 1.519 ± 0.0
2.43ArgAsp: 2.43 ± 0.0
2.126ArgGlu: 2.126 ± 0.0
2.126ArgPhe: 2.126 ± 0.0
1.823ArgGly: 1.823 ± 0.0
1.823ArgHis: 1.823 ± 0.0
2.43ArgIle: 2.43 ± 0.0
2.43ArgLys: 2.43 ± 0.0
3.949ArgLeu: 3.949 ± 0.0
0.608ArgMet: 0.608 ± 0.0
1.215ArgAsn: 1.215 ± 0.0
2.734ArgPro: 2.734 ± 0.0
2.126ArgGln: 2.126 ± 0.0
4.253ArgArg: 4.253 ± 0.0
3.341ArgSer: 3.341 ± 0.0
3.949ArgThr: 3.949 ± 0.0
4.253ArgVal: 4.253 ± 0.0
2.126ArgTrp: 2.126 ± 0.0
2.126ArgTyr: 2.126 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
8.505SerAla: 8.505 ± 0.0
1.215SerCys: 1.215 ± 0.0
4.253SerAsp: 4.253 ± 0.0
6.987SerGlu: 6.987 ± 0.0
2.126SerPhe: 2.126 ± 0.0
4.253SerGly: 4.253 ± 0.0
3.038SerHis: 3.038 ± 0.0
4.557SerIle: 4.557 ± 0.0
6.683SerLys: 6.683 ± 0.0
6.075SerLeu: 6.075 ± 0.0
0.911SerMet: 0.911 ± 0.0
3.645SerAsn: 3.645 ± 0.0
3.949SerPro: 3.949 ± 0.0
2.43SerGln: 2.43 ± 0.0
6.683SerArg: 6.683 ± 0.0
8.202SerSer: 8.202 ± 0.0
5.772SerThr: 5.772 ± 0.0
3.645SerVal: 3.645 ± 0.0
0.911SerTrp: 0.911 ± 0.0
2.43SerTyr: 2.43 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.164ThrAla: 5.164 ± 0.0
1.215ThrCys: 1.215 ± 0.0
3.341ThrAsp: 3.341 ± 0.0
2.43ThrGlu: 2.43 ± 0.0
3.949ThrPhe: 3.949 ± 0.0
3.341ThrGly: 3.341 ± 0.0
1.215ThrHis: 1.215 ± 0.0
3.949ThrIle: 3.949 ± 0.0
3.949ThrLys: 3.949 ± 0.0
6.987ThrLeu: 6.987 ± 0.0
1.823ThrMet: 1.823 ± 0.0
2.43ThrAsn: 2.43 ± 0.0
5.772ThrPro: 5.772 ± 0.0
4.253ThrGln: 4.253 ± 0.0
4.253ThrArg: 4.253 ± 0.0
7.29ThrSer: 7.29 ± 0.0
6.379ThrThr: 6.379 ± 0.0
3.949ThrVal: 3.949 ± 0.0
0.608ThrTrp: 0.608 ± 0.0
0.608ThrTyr: 0.608 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.379ValAla: 6.379 ± 0.0
0.608ValCys: 0.608 ± 0.0
3.341ValAsp: 3.341 ± 0.0
1.519ValGlu: 1.519 ± 0.0
2.43ValPhe: 2.43 ± 0.0
1.519ValGly: 1.519 ± 0.0
1.823ValHis: 1.823 ± 0.0
3.038ValIle: 3.038 ± 0.0
4.557ValLys: 4.557 ± 0.0
4.86ValLeu: 4.86 ± 0.0
0.304ValMet: 0.304 ± 0.0
2.43ValAsn: 2.43 ± 0.0
3.038ValPro: 3.038 ± 0.0
3.645ValGln: 3.645 ± 0.0
3.645ValArg: 3.645 ± 0.0
5.468ValSer: 5.468 ± 0.0
4.557ValThr: 4.557 ± 0.0
3.645ValVal: 3.645 ± 0.0
0.0ValTrp: 0.0 ± 0.0
0.911ValTyr: 0.911 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.608TrpAla: 0.608 ± 0.0
0.304TrpCys: 0.304 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.608TrpGlu: 0.608 ± 0.0
0.304TrpPhe: 0.304 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.304TrpHis: 0.304 ± 0.0
0.304TrpIle: 0.304 ± 0.0
0.304TrpLys: 0.304 ± 0.0
0.304TrpLeu: 0.304 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.215TrpAsn: 1.215 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.215TrpGln: 1.215 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.519TrpSer: 1.519 ± 0.0
0.911TrpThr: 0.911 ± 0.0
1.215TrpVal: 1.215 ± 0.0
0.304TrpTrp: 0.304 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.215TyrAla: 1.215 ± 0.0
0.608TyrCys: 0.608 ± 0.0
3.038TyrAsp: 3.038 ± 0.0
0.608TyrGlu: 0.608 ± 0.0
1.215TyrPhe: 1.215 ± 0.0
1.519TyrGly: 1.519 ± 0.0
0.911TyrHis: 0.911 ± 0.0
1.823TyrIle: 1.823 ± 0.0
0.608TyrLys: 0.608 ± 0.0
3.949TyrLeu: 3.949 ± 0.0
0.608TyrMet: 0.608 ± 0.0
0.911TyrAsn: 0.911 ± 0.0
1.823TyrPro: 1.823 ± 0.0
0.304TyrGln: 0.304 ± 0.0
0.911TyrArg: 0.911 ± 0.0
2.126TyrSer: 2.126 ± 0.0
1.519TyrThr: 1.519 ± 0.0
1.823TyrVal: 1.823 ± 0.0
0.608TyrTrp: 0.608 ± 0.0
0.608TyrTyr: 0.608 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski