Amino acid dipepetide frequency for Wenzhou bivalvia virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.71AlaAla: 4.71 ± 0.0
1.472AlaCys: 1.472 ± 0.0
4.121AlaAsp: 4.121 ± 0.0
3.238AlaGlu: 3.238 ± 0.0
3.533AlaPhe: 3.533 ± 0.0
5.593AlaGly: 5.593 ± 0.0
1.472AlaHis: 1.472 ± 0.0
3.533AlaIle: 3.533 ± 0.0
2.649AlaLys: 2.649 ± 0.0
7.359AlaLeu: 7.359 ± 0.0
2.649AlaMet: 2.649 ± 0.0
3.238AlaAsn: 3.238 ± 0.0
3.533AlaPro: 3.533 ± 0.0
2.061AlaGln: 2.061 ± 0.0
2.944AlaArg: 2.944 ± 0.0
4.71AlaSer: 4.71 ± 0.0
4.71AlaThr: 4.71 ± 0.0
5.299AlaVal: 5.299 ± 0.0
0.589AlaTrp: 0.589 ± 0.0
1.472AlaTyr: 1.472 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.883CysAla: 0.883 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.589CysAsp: 0.589 ± 0.0
0.294CysGlu: 0.294 ± 0.0
1.178CysPhe: 1.178 ± 0.0
1.472CysGly: 1.472 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.294CysIle: 0.294 ± 0.0
0.883CysLys: 0.883 ± 0.0
1.472CysLeu: 1.472 ± 0.0
0.589CysMet: 0.589 ± 0.0
1.472CysAsn: 1.472 ± 0.0
0.883CysPro: 0.883 ± 0.0
0.883CysGln: 0.883 ± 0.0
0.589CysArg: 0.589 ± 0.0
1.472CysSer: 1.472 ± 0.0
1.178CysThr: 1.178 ± 0.0
0.883CysVal: 0.883 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.589CysTyr: 0.589 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.355AspAla: 2.355 ± 0.0
0.589AspCys: 0.589 ± 0.0
4.121AspAsp: 4.121 ± 0.0
2.944AspGlu: 2.944 ± 0.0
2.649AspPhe: 2.649 ± 0.0
3.533AspGly: 3.533 ± 0.0
0.294AspHis: 0.294 ± 0.0
4.71AspIle: 4.71 ± 0.0
2.944AspLys: 2.944 ± 0.0
4.416AspLeu: 4.416 ± 0.0
2.355AspMet: 2.355 ± 0.0
1.472AspAsn: 1.472 ± 0.0
2.355AspPro: 2.355 ± 0.0
3.533AspGln: 3.533 ± 0.0
3.238AspArg: 3.238 ± 0.0
6.182AspSer: 6.182 ± 0.0
3.533AspThr: 3.533 ± 0.0
3.533AspVal: 3.533 ± 0.0
1.178AspTrp: 1.178 ± 0.0
2.649AspTyr: 2.649 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.238GluAla: 3.238 ± 0.0
0.294GluCys: 0.294 ± 0.0
2.355GluAsp: 2.355 ± 0.0
4.71GluGlu: 4.71 ± 0.0
3.238GluPhe: 3.238 ± 0.0
4.416GluGly: 4.416 ± 0.0
0.294GluHis: 0.294 ± 0.0
2.649GluIle: 2.649 ± 0.0
2.944GluLys: 2.944 ± 0.0
7.654GluLeu: 7.654 ± 0.0
1.472GluMet: 1.472 ± 0.0
1.766GluAsn: 1.766 ± 0.0
2.355GluPro: 2.355 ± 0.0
3.238GluGln: 3.238 ± 0.0
2.061GluArg: 2.061 ± 0.0
4.416GluSer: 4.416 ± 0.0
1.766GluThr: 1.766 ± 0.0
5.299GluVal: 5.299 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.178GluTyr: 1.178 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.827PheAla: 3.827 ± 0.0
0.294PheCys: 0.294 ± 0.0
3.827PheAsp: 3.827 ± 0.0
1.766PheGlu: 1.766 ± 0.0
1.766PhePhe: 1.766 ± 0.0
3.827PheGly: 3.827 ± 0.0
0.294PheHis: 0.294 ± 0.0
3.533PheIle: 3.533 ± 0.0
1.766PheLys: 1.766 ± 0.0
5.004PheLeu: 5.004 ± 0.0
1.178PheMet: 1.178 ± 0.0
0.883PheAsn: 0.883 ± 0.0
1.178PhePro: 1.178 ± 0.0
0.883PheGln: 0.883 ± 0.0
2.061PheArg: 2.061 ± 0.0
3.533PheSer: 3.533 ± 0.0
3.827PheThr: 3.827 ± 0.0
3.238PheVal: 3.238 ± 0.0
1.472PheTrp: 1.472 ± 0.0
1.472PheTyr: 1.472 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.476GlyAla: 6.476 ± 0.0
0.294GlyCys: 0.294 ± 0.0
4.71GlyAsp: 4.71 ± 0.0
5.004GlyGlu: 5.004 ± 0.0
2.649GlyPhe: 2.649 ± 0.0
4.416GlyGly: 4.416 ± 0.0
1.178GlyHis: 1.178 ± 0.0
4.121GlyIle: 4.121 ± 0.0
3.827GlyLys: 3.827 ± 0.0
6.476GlyLeu: 6.476 ± 0.0
1.472GlyMet: 1.472 ± 0.0
3.827GlyAsn: 3.827 ± 0.0
2.061GlyPro: 2.061 ± 0.0
2.355GlyGln: 2.355 ± 0.0
3.827GlyArg: 3.827 ± 0.0
5.299GlySer: 5.299 ± 0.0
5.888GlyThr: 5.888 ± 0.0
5.299GlyVal: 5.299 ± 0.0
1.178GlyTrp: 1.178 ± 0.0
2.649GlyTyr: 2.649 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.178HisAla: 1.178 ± 0.0
0.294HisCys: 0.294 ± 0.0
0.883HisAsp: 0.883 ± 0.0
1.472HisGlu: 1.472 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.589HisGly: 0.589 ± 0.0
0.294HisHis: 0.294 ± 0.0
0.883HisIle: 0.883 ± 0.0
0.883HisLys: 0.883 ± 0.0
1.472HisLeu: 1.472 ± 0.0
0.294HisMet: 0.294 ± 0.0
0.294HisAsn: 0.294 ± 0.0
0.589HisPro: 0.589 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.294HisArg: 0.294 ± 0.0
2.061HisSer: 2.061 ± 0.0
1.178HisThr: 1.178 ± 0.0
0.294HisVal: 0.294 ± 0.0
0.294HisTrp: 0.294 ± 0.0
0.883HisTyr: 0.883 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.533IleAla: 3.533 ± 0.0
1.178IleCys: 1.178 ± 0.0
6.476IleAsp: 6.476 ± 0.0
2.355IleGlu: 2.355 ± 0.0
2.944IlePhe: 2.944 ± 0.0
4.71IleGly: 4.71 ± 0.0
1.472IleHis: 1.472 ± 0.0
3.533IleIle: 3.533 ± 0.0
1.472IleLys: 1.472 ± 0.0
2.355IleLeu: 2.355 ± 0.0
0.589IleMet: 0.589 ± 0.0
2.355IleAsn: 2.355 ± 0.0
3.238IlePro: 3.238 ± 0.0
2.355IleGln: 2.355 ± 0.0
2.944IleArg: 2.944 ± 0.0
5.593IleSer: 5.593 ± 0.0
3.533IleThr: 3.533 ± 0.0
3.533IleVal: 3.533 ± 0.0
0.589IleTrp: 0.589 ± 0.0
1.472IleTyr: 1.472 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.121LysAla: 4.121 ± 0.0
0.883LysCys: 0.883 ± 0.0
3.827LysAsp: 3.827 ± 0.0
1.766LysGlu: 1.766 ± 0.0
4.71LysPhe: 4.71 ± 0.0
4.71LysGly: 4.71 ± 0.0
1.472LysHis: 1.472 ± 0.0
2.944LysIle: 2.944 ± 0.0
2.061LysLys: 2.061 ± 0.0
3.533LysLeu: 3.533 ± 0.0
1.178LysMet: 1.178 ± 0.0
3.238LysAsn: 3.238 ± 0.0
2.649LysPro: 2.649 ± 0.0
1.766LysGln: 1.766 ± 0.0
1.178LysArg: 1.178 ± 0.0
3.827LysSer: 3.827 ± 0.0
2.944LysThr: 2.944 ± 0.0
3.827LysVal: 3.827 ± 0.0
0.0LysTrp: 0.0 ± 0.0
2.944LysTyr: 2.944 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.299LeuAla: 5.299 ± 0.0
2.061LeuCys: 2.061 ± 0.0
3.827LeuAsp: 3.827 ± 0.0
6.771LeuGlu: 6.771 ± 0.0
2.355LeuPhe: 2.355 ± 0.0
4.71LeuGly: 4.71 ± 0.0
1.766LeuHis: 1.766 ± 0.0
3.827LeuIle: 3.827 ± 0.0
5.299LeuLys: 5.299 ± 0.0
9.42LeuLeu: 9.42 ± 0.0
2.944LeuMet: 2.944 ± 0.0
4.71LeuAsn: 4.71 ± 0.0
3.533LeuPro: 3.533 ± 0.0
5.593LeuGln: 5.593 ± 0.0
4.71LeuArg: 4.71 ± 0.0
4.121LeuSer: 4.121 ± 0.0
7.359LeuThr: 7.359 ± 0.0
6.182LeuVal: 6.182 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
1.766LeuTyr: 1.766 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.944MetAla: 2.944 ± 0.0
0.294MetCys: 0.294 ± 0.0
0.883MetAsp: 0.883 ± 0.0
1.178MetGlu: 1.178 ± 0.0
0.589MetPhe: 0.589 ± 0.0
2.944MetGly: 2.944 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.883MetIle: 0.883 ± 0.0
1.178MetLys: 1.178 ± 0.0
2.355MetLeu: 2.355 ± 0.0
1.178MetMet: 1.178 ± 0.0
0.883MetAsn: 0.883 ± 0.0
1.472MetPro: 1.472 ± 0.0
1.178MetGln: 1.178 ± 0.0
2.355MetArg: 2.355 ± 0.0
2.355MetSer: 2.355 ± 0.0
2.649MetThr: 2.649 ± 0.0
1.178MetVal: 1.178 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.883MetTyr: 0.883 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.355AsnAla: 2.355 ± 0.0
0.294AsnCys: 0.294 ± 0.0
1.766AsnAsp: 1.766 ± 0.0
1.178AsnGlu: 1.178 ± 0.0
1.178AsnPhe: 1.178 ± 0.0
3.827AsnGly: 3.827 ± 0.0
0.0AsnHis: 0.0 ± 0.0
2.944AsnIle: 2.944 ± 0.0
2.649AsnLys: 2.649 ± 0.0
3.533AsnLeu: 3.533 ± 0.0
1.472AsnMet: 1.472 ± 0.0
0.589AsnAsn: 0.589 ± 0.0
4.416AsnPro: 4.416 ± 0.0
1.178AsnGln: 1.178 ± 0.0
2.944AsnArg: 2.944 ± 0.0
3.238AsnSer: 3.238 ± 0.0
3.533AsnThr: 3.533 ± 0.0
3.238AsnVal: 3.238 ± 0.0
1.472AsnTrp: 1.472 ± 0.0
2.649AsnTyr: 2.649 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.355ProAla: 2.355 ± 0.0
0.294ProCys: 0.294 ± 0.0
2.649ProAsp: 2.649 ± 0.0
3.827ProGlu: 3.827 ± 0.0
2.061ProPhe: 2.061 ± 0.0
3.827ProGly: 3.827 ± 0.0
0.294ProHis: 0.294 ± 0.0
3.238ProIle: 3.238 ± 0.0
2.355ProLys: 2.355 ± 0.0
4.416ProLeu: 4.416 ± 0.0
1.766ProMet: 1.766 ± 0.0
2.355ProAsn: 2.355 ± 0.0
1.472ProPro: 1.472 ± 0.0
1.178ProGln: 1.178 ± 0.0
1.766ProArg: 1.766 ± 0.0
4.416ProSer: 4.416 ± 0.0
2.944ProThr: 2.944 ± 0.0
5.299ProVal: 5.299 ± 0.0
0.294ProTrp: 0.294 ± 0.0
2.061ProTyr: 2.061 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.649GlnAla: 2.649 ± 0.0
1.766GlnCys: 1.766 ± 0.0
1.472GlnAsp: 1.472 ± 0.0
2.061GlnGlu: 2.061 ± 0.0
2.355GlnPhe: 2.355 ± 0.0
3.533GlnGly: 3.533 ± 0.0
0.589GlnHis: 0.589 ± 0.0
2.649GlnIle: 2.649 ± 0.0
3.533GlnLys: 3.533 ± 0.0
2.061GlnLeu: 2.061 ± 0.0
0.883GlnMet: 0.883 ± 0.0
1.472GlnAsn: 1.472 ± 0.0
0.883GlnPro: 0.883 ± 0.0
2.944GlnGln: 2.944 ± 0.0
1.178GlnArg: 1.178 ± 0.0
3.827GlnSer: 3.827 ± 0.0
1.178GlnThr: 1.178 ± 0.0
4.416GlnVal: 4.416 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.472GlnTyr: 1.472 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.533ArgAla: 3.533 ± 0.0
0.883ArgCys: 0.883 ± 0.0
2.355ArgAsp: 2.355 ± 0.0
2.649ArgGlu: 2.649 ± 0.0
2.355ArgPhe: 2.355 ± 0.0
2.649ArgGly: 2.649 ± 0.0
0.294ArgHis: 0.294 ± 0.0
2.944ArgIle: 2.944 ± 0.0
2.355ArgLys: 2.355 ± 0.0
3.238ArgLeu: 3.238 ± 0.0
1.178ArgMet: 1.178 ± 0.0
1.766ArgAsn: 1.766 ± 0.0
1.472ArgPro: 1.472 ± 0.0
2.061ArgGln: 2.061 ± 0.0
2.649ArgArg: 2.649 ± 0.0
2.944ArgSer: 2.944 ± 0.0
4.121ArgThr: 4.121 ± 0.0
5.299ArgVal: 5.299 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
1.766ArgTyr: 1.766 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.065SerAla: 7.065 ± 0.0
0.883SerCys: 0.883 ± 0.0
4.121SerAsp: 4.121 ± 0.0
5.299SerGlu: 5.299 ± 0.0
5.299SerPhe: 5.299 ± 0.0
7.065SerGly: 7.065 ± 0.0
2.649SerHis: 2.649 ± 0.0
3.827SerIle: 3.827 ± 0.0
7.065SerLys: 7.065 ± 0.0
5.004SerLeu: 5.004 ± 0.0
0.294SerMet: 0.294 ± 0.0
2.944SerAsn: 2.944 ± 0.0
5.004SerPro: 5.004 ± 0.0
2.061SerGln: 2.061 ± 0.0
4.121SerArg: 4.121 ± 0.0
6.182SerSer: 6.182 ± 0.0
3.533SerThr: 3.533 ± 0.0
3.533SerVal: 3.533 ± 0.0
0.883SerTrp: 0.883 ± 0.0
4.121SerTyr: 4.121 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.593ThrAla: 5.593 ± 0.0
1.178ThrCys: 1.178 ± 0.0
2.649ThrAsp: 2.649 ± 0.0
3.238ThrGlu: 3.238 ± 0.0
2.944ThrPhe: 2.944 ± 0.0
4.121ThrGly: 4.121 ± 0.0
0.589ThrHis: 0.589 ± 0.0
3.238ThrIle: 3.238 ± 0.0
3.238ThrLys: 3.238 ± 0.0
4.71ThrLeu: 4.71 ± 0.0
1.766ThrMet: 1.766 ± 0.0
3.827ThrAsn: 3.827 ± 0.0
3.238ThrPro: 3.238 ± 0.0
2.355ThrGln: 2.355 ± 0.0
2.944ThrArg: 2.944 ± 0.0
6.476ThrSer: 6.476 ± 0.0
6.476ThrThr: 6.476 ± 0.0
5.004ThrVal: 5.004 ± 0.0
0.294ThrTrp: 0.294 ± 0.0
2.061ThrTyr: 2.061 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.827ValAla: 3.827 ± 0.0
1.178ValCys: 1.178 ± 0.0
4.121ValAsp: 4.121 ± 0.0
4.121ValGlu: 4.121 ± 0.0
2.355ValPhe: 2.355 ± 0.0
4.71ValGly: 4.71 ± 0.0
1.178ValHis: 1.178 ± 0.0
4.71ValIle: 4.71 ± 0.0
4.71ValLys: 4.71 ± 0.0
7.359ValLeu: 7.359 ± 0.0
1.178ValMet: 1.178 ± 0.0
5.004ValAsn: 5.004 ± 0.0
6.182ValPro: 6.182 ± 0.0
3.827ValGln: 3.827 ± 0.0
2.944ValArg: 2.944 ± 0.0
6.182ValSer: 6.182 ± 0.0
2.944ValThr: 2.944 ± 0.0
5.004ValVal: 5.004 ± 0.0
1.472ValTrp: 1.472 ± 0.0
0.883ValTyr: 0.883 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.883TrpAla: 0.883 ± 0.0
0.294TrpCys: 0.294 ± 0.0
0.883TrpAsp: 0.883 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.589TrpPhe: 0.589 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.294TrpHis: 0.294 ± 0.0
0.589TrpIle: 0.589 ± 0.0
0.589TrpLys: 0.589 ± 0.0
1.178TrpLeu: 1.178 ± 0.0
0.294TrpMet: 0.294 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.294TrpPro: 0.294 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.883TrpArg: 0.883 ± 0.0
0.589TrpSer: 0.589 ± 0.0
0.589TrpThr: 0.589 ± 0.0
1.178TrpVal: 1.178 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.589TrpTyr: 0.589 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.061TyrAla: 2.061 ± 0.0
1.178TyrCys: 1.178 ± 0.0
2.944TyrAsp: 2.944 ± 0.0
1.472TyrGlu: 1.472 ± 0.0
0.883TyrPhe: 0.883 ± 0.0
2.355TyrGly: 2.355 ± 0.0
0.0TyrHis: 0.0 ± 0.0
1.178TyrIle: 1.178 ± 0.0
1.178TyrLys: 1.178 ± 0.0
3.238TyrLeu: 3.238 ± 0.0
2.355TyrMet: 2.355 ± 0.0
2.355TyrAsn: 2.355 ± 0.0
2.061TyrPro: 2.061 ± 0.0
1.472TyrGln: 1.472 ± 0.0
0.883TyrArg: 0.883 ± 0.0
3.533TyrSer: 3.533 ± 0.0
2.061TyrThr: 2.061 ± 0.0
2.355TyrVal: 2.355 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.589TyrTyr: 0.589 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski