Amino acid dipepetide frequency for Gastropod associated circular ssDNA virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.645AlaCys: 1.645 ± 1.157
8.224AlaAsp: 8.224 ± 0.942
4.934AlaGlu: 4.934 ± 1.372
3.289AlaPhe: 3.289 ± 2.313
4.934AlaGly: 4.934 ± 3.47
0.0AlaHis: 0.0 ± 0.0
4.934AlaIle: 4.934 ± 1.372
8.224AlaLys: 8.224 ± 0.942
4.934AlaLeu: 4.934 ± 3.47
1.645AlaMet: 1.645 ± 1.157
3.289AlaAsn: 3.289 ± 0.107
1.645AlaPro: 1.645 ± 1.157
6.579AlaGln: 6.579 ± 0.215
3.289AlaArg: 3.289 ± 0.107
6.579AlaSer: 6.579 ± 2.206
1.645AlaThr: 1.645 ± 1.157
0.0AlaVal: 0.0 ± 0.0
4.934AlaTrp: 4.934 ± 3.792
3.289AlaTyr: 3.289 ± 2.313
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.645CysGly: 1.645 ± 1.157
1.645CysHis: 1.645 ± 1.264
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
3.289CysLeu: 3.289 ± 0.107
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.645CysGln: 1.645 ± 1.264
3.289CysArg: 3.289 ± 2.528
0.0CysSer: 0.0 ± 0.0
1.645CysThr: 1.645 ± 1.264
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
11.513AspAla: 11.513 ± 5.675
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
1.645AspGlu: 1.645 ± 1.157
3.289AspPhe: 3.289 ± 2.313
8.224AspGly: 8.224 ± 1.479
0.0AspHis: 0.0 ± 0.0
1.645AspIle: 1.645 ± 1.264
4.934AspLys: 4.934 ± 1.049
1.645AspLeu: 1.645 ± 1.264
1.645AspMet: 1.645 ± 1.264
1.645AspAsn: 1.645 ± 1.157
1.645AspPro: 1.645 ± 1.264
0.0AspGln: 0.0 ± 0.0
6.579AspArg: 6.579 ± 2.636
4.934AspSer: 4.934 ± 1.372
3.289AspThr: 3.289 ± 2.313
4.934AspVal: 4.934 ± 1.372
0.0AspTrp: 0.0 ± 0.0
3.289AspTyr: 3.289 ± 0.107
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
3.289GluCys: 3.289 ± 2.528
3.289GluAsp: 3.289 ± 0.107
4.934GluGlu: 4.934 ± 3.792
1.645GluPhe: 1.645 ± 1.264
9.868GluGly: 9.868 ± 2.743
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
1.645GluLys: 1.645 ± 1.157
1.645GluLeu: 1.645 ± 1.157
1.645GluMet: 1.645 ± 1.264
4.934GluAsn: 4.934 ± 1.049
3.289GluPro: 3.289 ± 2.313
4.934GluGln: 4.934 ± 1.372
0.0GluArg: 0.0 ± 0.0
3.289GluSer: 3.289 ± 0.107
4.934GluThr: 4.934 ± 1.372
6.579GluVal: 6.579 ± 0.215
1.645GluTrp: 1.645 ± 1.264
3.289GluTyr: 3.289 ± 2.528
0.0GluXaa: 0.0 ± 0.0
Phe
6.579PheAla: 6.579 ± 0.215
0.0PheCys: 0.0 ± 0.0
4.934PheAsp: 4.934 ± 1.049
1.645PheGlu: 1.645 ± 1.264
1.645PhePhe: 1.645 ± 1.264
3.289PheGly: 3.289 ± 2.313
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.645PheLys: 1.645 ± 1.264
1.645PheLeu: 1.645 ± 1.264
0.0PheMet: 0.0 ± 0.0
1.645PheAsn: 1.645 ± 1.264
4.934PhePro: 4.934 ± 3.47
1.645PheGln: 1.645 ± 1.157
3.289PheArg: 3.289 ± 0.107
1.645PheSer: 1.645 ± 1.157
3.289PheThr: 3.289 ± 0.107
1.645PheVal: 1.645 ± 1.264
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.645GlyAla: 1.645 ± 1.157
0.0GlyCys: 0.0 ± 0.0
6.579GlyAsp: 6.579 ± 0.215
6.579GlyGlu: 6.579 ± 2.636
4.934GlyPhe: 4.934 ± 3.792
6.579GlyGly: 6.579 ± 0.215
1.645GlyHis: 1.645 ± 1.264
1.645GlyIle: 1.645 ± 1.157
6.579GlyLys: 6.579 ± 5.056
6.579GlyLeu: 6.579 ± 0.215
3.289GlyMet: 3.289 ± 2.941
3.289GlyAsn: 3.289 ± 2.313
6.579GlyPro: 6.579 ± 2.206
3.289GlyGln: 3.289 ± 2.313
3.289GlyArg: 3.289 ± 0.107
4.934GlySer: 4.934 ± 1.049
16.447GlyThr: 16.447 ± 0.537
3.289GlyVal: 3.289 ± 0.107
0.0GlyTrp: 0.0 ± 0.0
3.289GlyTyr: 3.289 ± 2.313
0.0GlyXaa: 0.0 ± 0.0
His
6.579HisAla: 6.579 ± 2.206
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.645HisHis: 1.645 ± 1.264
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
6.579HisLeu: 6.579 ± 2.636
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.289HisPro: 3.289 ± 2.528
0.0HisGln: 0.0 ± 0.0
1.645HisArg: 1.645 ± 1.157
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.645HisVal: 1.645 ± 1.264
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.289IleAla: 3.289 ± 0.107
0.0IleCys: 0.0 ± 0.0
6.579IleAsp: 6.579 ± 0.215
4.934IleGlu: 4.934 ± 1.049
0.0IlePhe: 0.0 ± 0.0
8.224IleGly: 8.224 ± 0.942
0.0IleHis: 0.0 ± 0.0
1.645IleIle: 1.645 ± 1.264
0.0IleLys: 0.0 ± 0.0
3.289IleLeu: 3.289 ± 2.528
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
0.0IlePro: 0.0 ± 0.0
1.645IleGln: 1.645 ± 1.157
1.645IleArg: 1.645 ± 1.264
0.0IleSer: 0.0 ± 0.0
3.289IleThr: 3.289 ± 0.107
6.579IleVal: 6.579 ± 2.636
1.645IleTrp: 1.645 ± 1.157
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.934LysAla: 4.934 ± 1.372
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
0.0LysGlu: 0.0 ± 0.0
3.289LysPhe: 3.289 ± 2.528
6.579LysGly: 6.579 ± 2.206
1.645LysHis: 1.645 ± 1.264
0.0LysIle: 0.0 ± 0.0
4.934LysLys: 4.934 ± 1.372
1.645LysLeu: 1.645 ± 1.157
0.0LysMet: 0.0 ± 0.0
1.645LysAsn: 1.645 ± 1.264
0.0LysPro: 0.0 ± 0.0
6.579LysGln: 6.579 ± 2.206
4.934LysArg: 4.934 ± 1.049
0.0LysSer: 0.0 ± 0.0
4.934LysThr: 4.934 ± 1.372
4.934LysVal: 4.934 ± 1.049
1.645LysTrp: 1.645 ± 1.264
3.289LysTyr: 3.289 ± 0.107
0.0LysXaa: 0.0 ± 0.0
Leu
1.645LeuAla: 1.645 ± 1.157
0.0LeuCys: 0.0 ± 0.0
3.289LeuAsp: 3.289 ± 2.313
1.645LeuGlu: 1.645 ± 1.264
0.0LeuPhe: 0.0 ± 0.0
6.579LeuGly: 6.579 ± 2.206
0.0LeuHis: 0.0 ± 0.0
6.579LeuIle: 6.579 ± 2.206
3.289LeuLys: 3.289 ± 2.528
6.579LeuLeu: 6.579 ± 5.056
1.645LeuMet: 1.645 ± 1.157
8.224LeuAsn: 8.224 ± 1.479
3.289LeuPro: 3.289 ± 0.107
3.289LeuGln: 3.289 ± 0.107
3.289LeuArg: 3.289 ± 2.528
9.868LeuSer: 9.868 ± 4.519
6.579LeuThr: 6.579 ± 2.636
0.0LeuVal: 0.0 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
3.289LeuTyr: 3.289 ± 0.107
0.0LeuXaa: 0.0 ± 0.0
Met
4.934MetAla: 4.934 ± 1.372
0.0MetCys: 0.0 ± 0.0
1.645MetAsp: 1.645 ± 1.157
0.0MetGlu: 0.0 ± 0.0
1.645MetPhe: 1.645 ± 1.157
1.645MetGly: 1.645 ± 1.157
0.0MetHis: 0.0 ± 0.0
1.645MetIle: 1.645 ± 1.264
0.0MetLys: 0.0 ± 0.0
1.645MetLeu: 1.645 ± 1.264
0.0MetMet: 0.0 ± 0.0
1.645MetAsn: 1.645 ± 1.157
4.934MetPro: 4.934 ± 3.47
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
3.289MetThr: 3.289 ± 2.313
1.645MetVal: 1.645 ± 1.157
1.645MetTrp: 1.645 ± 1.157
1.645MetTyr: 1.645 ± 1.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.289AsnAla: 3.289 ± 0.107
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
3.289AsnGlu: 3.289 ± 0.107
0.0AsnPhe: 0.0 ± 0.0
6.579AsnGly: 6.579 ± 0.215
3.289AsnHis: 3.289 ± 2.313
1.645AsnIle: 1.645 ± 1.157
1.645AsnLys: 1.645 ± 1.157
3.289AsnLeu: 3.289 ± 2.313
3.289AsnMet: 3.289 ± 2.313
4.934AsnAsn: 4.934 ± 1.372
1.645AsnPro: 1.645 ± 1.264
1.645AsnGln: 1.645 ± 1.157
1.645AsnArg: 1.645 ± 1.264
1.645AsnSer: 1.645 ± 1.157
1.645AsnThr: 1.645 ± 1.264
0.0AsnVal: 0.0 ± 0.0
4.934AsnTrp: 4.934 ± 3.792
1.645AsnTyr: 1.645 ± 1.157
0.0AsnXaa: 0.0 ± 0.0
Pro
4.934ProAla: 4.934 ± 1.049
1.645ProCys: 1.645 ± 1.157
1.645ProAsp: 1.645 ± 1.264
8.224ProGlu: 8.224 ± 0.942
4.934ProPhe: 4.934 ± 1.049
0.0ProGly: 0.0 ± 0.0
1.645ProHis: 1.645 ± 1.264
0.0ProIle: 0.0 ± 0.0
1.645ProLys: 1.645 ± 1.157
3.289ProLeu: 3.289 ± 0.107
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
1.645ProPro: 1.645 ± 1.157
3.289ProGln: 3.289 ± 2.313
0.0ProArg: 0.0 ± 0.0
1.645ProSer: 1.645 ± 1.264
4.934ProThr: 4.934 ± 3.792
1.645ProVal: 1.645 ± 1.157
0.0ProTrp: 0.0 ± 0.0
3.289ProTyr: 3.289 ± 2.313
0.0ProXaa: 0.0 ± 0.0
Gln
6.579GlnAla: 6.579 ± 2.206
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.645GlnGlu: 1.645 ± 1.157
3.289GlnPhe: 3.289 ± 2.313
4.934GlnGly: 4.934 ± 1.372
0.0GlnHis: 0.0 ± 0.0
3.289GlnIle: 3.289 ± 2.313
1.645GlnLys: 1.645 ± 1.157
6.579GlnLeu: 6.579 ± 4.626
3.289GlnMet: 3.289 ± 2.313
1.645GlnAsn: 1.645 ± 1.157
0.0GlnPro: 0.0 ± 0.0
1.645GlnGln: 1.645 ± 1.157
3.289GlnArg: 3.289 ± 0.107
3.289GlnSer: 3.289 ± 0.107
3.289GlnThr: 3.289 ± 2.313
0.0GlnVal: 0.0 ± 0.0
3.289GlnTrp: 3.289 ± 2.528
4.934GlnTyr: 4.934 ± 3.792
0.0GlnXaa: 0.0 ± 0.0
Arg
8.224ArgAla: 8.224 ± 3.362
3.289ArgCys: 3.289 ± 2.528
6.579ArgAsp: 6.579 ± 2.636
1.645ArgGlu: 1.645 ± 1.264
1.645ArgPhe: 1.645 ± 1.264
3.289ArgGly: 3.289 ± 2.528
0.0ArgHis: 0.0 ± 0.0
0.0ArgIle: 0.0 ± 0.0
1.645ArgLys: 1.645 ± 1.264
1.645ArgLeu: 1.645 ± 1.157
3.289ArgMet: 3.289 ± 0.107
1.645ArgAsn: 1.645 ± 1.157
0.0ArgPro: 0.0 ± 0.0
1.645ArgGln: 1.645 ± 1.157
3.289ArgArg: 3.289 ± 2.528
1.645ArgSer: 1.645 ± 1.264
3.289ArgThr: 3.289 ± 0.107
8.224ArgVal: 8.224 ± 3.9
1.645ArgTrp: 1.645 ± 1.157
6.579ArgTyr: 6.579 ± 5.056
0.0ArgXaa: 0.0 ± 0.0
Ser
3.289SerAla: 3.289 ± 0.107
0.0SerCys: 0.0 ± 0.0
3.289SerAsp: 3.289 ± 0.107
4.934SerGlu: 4.934 ± 1.049
0.0SerPhe: 0.0 ± 0.0
3.289SerGly: 3.289 ± 2.313
0.0SerHis: 0.0 ± 0.0
3.289SerIle: 3.289 ± 2.528
1.645SerLys: 1.645 ± 1.157
1.645SerLeu: 1.645 ± 1.157
1.645SerMet: 1.645 ± 1.157
1.645SerAsn: 1.645 ± 1.264
4.934SerPro: 4.934 ± 1.049
3.289SerGln: 3.289 ± 0.107
1.645SerArg: 1.645 ± 1.157
0.0SerSer: 0.0 ± 0.0
3.289SerThr: 3.289 ± 2.313
3.289SerVal: 3.289 ± 0.107
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.645ThrAla: 1.645 ± 1.157
1.645ThrCys: 1.645 ± 1.264
4.934ThrAsp: 4.934 ± 1.372
3.289ThrGlu: 3.289 ± 2.528
4.934ThrPhe: 4.934 ± 3.47
8.224ThrGly: 8.224 ± 1.479
1.645ThrHis: 1.645 ± 1.157
6.579ThrIle: 6.579 ± 0.215
4.934ThrLys: 4.934 ± 1.049
6.579ThrLeu: 6.579 ± 2.636
1.645ThrMet: 1.645 ± 1.157
3.289ThrAsn: 3.289 ± 0.107
4.934ThrPro: 4.934 ± 3.792
3.289ThrGln: 3.289 ± 2.313
1.645ThrArg: 1.645 ± 1.264
0.0ThrSer: 0.0 ± 0.0
8.224ThrThr: 8.224 ± 5.783
6.579ThrVal: 6.579 ± 2.206
1.645ThrTrp: 1.645 ± 1.157
3.289ThrTyr: 3.289 ± 0.107
0.0ThrXaa: 0.0 ± 0.0
Val
1.645ValAla: 1.645 ± 1.264
0.0ValCys: 0.0 ± 0.0
1.645ValAsp: 1.645 ± 1.157
8.224ValGlu: 8.224 ± 1.479
3.289ValPhe: 3.289 ± 2.528
4.934ValGly: 4.934 ± 1.372
4.934ValHis: 4.934 ± 1.049
3.289ValIle: 3.289 ± 0.107
1.645ValLys: 1.645 ± 1.264
3.289ValLeu: 3.289 ± 0.107
3.289ValMet: 3.289 ± 0.768
4.934ValAsn: 4.934 ± 1.049
0.0ValPro: 0.0 ± 0.0
3.289ValGln: 3.289 ± 0.107
3.289ValArg: 3.289 ± 2.528
0.0ValSer: 0.0 ± 0.0
1.645ValThr: 1.645 ± 1.157
1.645ValVal: 1.645 ± 1.157
1.645ValTrp: 1.645 ± 1.157
3.289ValTyr: 3.289 ± 0.107
0.0ValXaa: 0.0 ± 0.0
Trp
1.645TrpAla: 1.645 ± 1.157
0.0TrpCys: 0.0 ± 0.0
1.645TrpAsp: 1.645 ± 1.264
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.289TrpGly: 3.289 ± 2.528
0.0TrpHis: 0.0 ± 0.0
1.645TrpIle: 1.645 ± 1.264
1.645TrpLys: 1.645 ± 1.157
1.645TrpLeu: 1.645 ± 1.264
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.645TrpPro: 1.645 ± 1.264
1.645TrpGln: 1.645 ± 1.264
4.934TrpArg: 4.934 ± 1.372
0.0TrpSer: 0.0 ± 0.0
1.645TrpThr: 1.645 ± 1.157
1.645TrpVal: 1.645 ± 1.264
0.0TrpTrp: 0.0 ± 0.0
1.645TrpTyr: 1.645 ± 1.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.645TyrAla: 1.645 ± 1.264
1.645TyrCys: 1.645 ± 1.264
6.579TyrAsp: 6.579 ± 2.206
3.289TyrGlu: 3.289 ± 2.528
1.645TyrPhe: 1.645 ± 1.157
0.0TyrGly: 0.0 ± 0.0
3.289TyrHis: 3.289 ± 2.528
4.934TyrIle: 4.934 ± 1.372
3.289TyrLys: 3.289 ± 0.107
1.645TyrLeu: 1.645 ± 1.157
0.0TyrMet: 0.0 ± 0.0
1.645TyrAsn: 1.645 ± 1.157
0.0TyrPro: 0.0 ± 0.0
3.289TyrGln: 3.289 ± 2.313
8.224TyrArg: 8.224 ± 1.479
1.645TyrSer: 1.645 ± 1.157
1.645TyrThr: 1.645 ± 1.264
1.645TyrVal: 1.645 ± 1.157
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (609 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski