Amino acid dipepetide frequency for Mongoose feces-associated gemycircularvirus c

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.279AlaAla: 3.279 ± 2.147
0.0AlaCys: 0.0 ± 0.0
8.197AlaAsp: 8.197 ± 1.081
3.279AlaGlu: 3.279 ± 0.004
1.639AlaPhe: 1.639 ± 1.07
8.197AlaGly: 8.197 ± 3.224
1.639AlaHis: 1.639 ± 1.07
3.279AlaIle: 3.279 ± 0.004
1.639AlaLys: 1.639 ± 1.074
1.639AlaLeu: 1.639 ± 1.074
0.0AlaMet: 0.0 ± 0.0
3.279AlaAsn: 3.279 ± 0.004
1.639AlaPro: 1.639 ± 1.074
1.639AlaGln: 1.639 ± 1.074
11.475AlaArg: 11.475 ± 3.228
4.918AlaSer: 4.918 ± 1.077
1.639AlaThr: 1.639 ± 1.07
4.918AlaVal: 4.918 ± 1.066
1.639AlaTrp: 1.639 ± 1.07
3.279AlaTyr: 3.279 ± 0.004
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
3.279CysGly: 3.279 ± 2.14
0.0CysHis: 0.0 ± 0.0
6.557CysIle: 6.557 ± 2.136
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.639CysPro: 1.639 ± 1.074
0.0CysGln: 0.0 ± 0.0
1.639CysArg: 1.639 ± 1.074
1.639CysSer: 1.639 ± 1.07
1.639CysThr: 1.639 ± 1.07
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.639CysTyr: 1.639 ± 1.074
0.0CysXaa: 0.0 ± 0.0
Asp
4.918AspAla: 4.918 ± 1.066
3.279AspCys: 3.279 ± 2.14
3.279AspAsp: 3.279 ± 0.004
3.279AspGlu: 3.279 ± 2.147
1.639AspPhe: 1.639 ± 1.07
6.557AspGly: 6.557 ± 0.007
3.279AspHis: 3.279 ± 2.14
3.279AspIle: 3.279 ± 2.14
0.0AspLys: 0.0 ± 0.0
1.639AspLeu: 1.639 ± 1.074
1.639AspMet: 1.639 ± 1.074
4.918AspAsn: 4.918 ± 3.221
1.639AspPro: 1.639 ± 1.07
1.639AspGln: 1.639 ± 1.07
4.918AspArg: 4.918 ± 1.066
0.0AspSer: 0.0 ± 0.0
3.279AspThr: 3.279 ± 2.147
4.918AspVal: 4.918 ± 3.21
3.279AspTrp: 3.279 ± 0.004
3.279AspTyr: 3.279 ± 0.004
0.0AspXaa: 0.0 ± 0.0
Glu
3.279GluAla: 3.279 ± 0.004
0.0GluCys: 0.0 ± 0.0
1.639GluAsp: 1.639 ± 1.07
3.279GluGlu: 3.279 ± 2.14
4.918GluPhe: 4.918 ± 1.066
6.557GluGly: 6.557 ± 2.151
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
1.639GluLys: 1.639 ± 1.07
1.639GluLeu: 1.639 ± 1.07
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.639GluPro: 1.639 ± 1.07
1.639GluGln: 1.639 ± 1.07
3.279GluArg: 3.279 ± 0.004
1.639GluSer: 1.639 ± 1.074
3.279GluThr: 3.279 ± 2.147
1.639GluVal: 1.639 ± 1.074
1.639GluTrp: 1.639 ± 1.07
1.639GluTyr: 1.639 ± 1.07
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.639PheCys: 1.639 ± 1.074
3.279PheAsp: 3.279 ± 2.14
1.639PheGlu: 1.639 ± 1.074
1.639PhePhe: 1.639 ± 1.07
4.918PheGly: 4.918 ± 1.066
1.639PheHis: 1.639 ± 1.07
0.0PheIle: 0.0 ± 0.0
1.639PheLys: 1.639 ± 1.07
3.279PheLeu: 3.279 ± 0.004
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.279PhePro: 3.279 ± 0.004
1.639PheGln: 1.639 ± 1.07
6.557PheArg: 6.557 ± 0.007
8.197PheSer: 8.197 ± 1.063
4.918PheThr: 4.918 ± 1.077
3.279PheVal: 3.279 ± 0.004
0.0PheTrp: 0.0 ± 0.0
1.639PheTyr: 1.639 ± 1.07
0.0PheXaa: 0.0 ± 0.0
Gly
1.639GlyAla: 1.639 ± 1.074
0.0GlyCys: 0.0 ± 0.0
9.836GlyAsp: 9.836 ± 2.133
1.639GlyGlu: 1.639 ± 1.07
4.918GlyPhe: 4.918 ± 1.066
6.557GlyGly: 6.557 ± 4.28
0.0GlyHis: 0.0 ± 0.0
1.639GlyIle: 1.639 ± 1.074
6.557GlyLys: 6.557 ± 2.136
8.197GlyLeu: 8.197 ± 1.063
1.639GlyMet: 1.639 ± 1.074
3.279GlyAsn: 3.279 ± 0.004
3.279GlyPro: 3.279 ± 0.004
4.918GlyGln: 4.918 ± 1.066
6.557GlyArg: 6.557 ± 2.151
13.115GlySer: 13.115 ± 0.014
9.836GlyThr: 9.836 ± 4.298
4.918GlyVal: 4.918 ± 3.221
3.279GlyTrp: 3.279 ± 0.004
3.279GlyTyr: 3.279 ± 0.004
0.0GlyXaa: 0.0 ± 0.0
His
8.197HisAla: 8.197 ± 1.081
0.0HisCys: 0.0 ± 0.0
1.639HisAsp: 1.639 ± 1.07
1.639HisGlu: 1.639 ± 1.074
0.0HisPhe: 0.0 ± 0.0
1.639HisGly: 1.639 ± 1.07
0.0HisHis: 0.0 ± 0.0
1.639HisIle: 1.639 ± 1.07
0.0HisLys: 0.0 ± 0.0
3.279HisLeu: 3.279 ± 2.14
0.0HisMet: 0.0 ± 0.0
1.639HisAsn: 1.639 ± 1.07
1.639HisPro: 1.639 ± 1.07
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
3.279HisSer: 3.279 ± 2.14
1.639HisThr: 1.639 ± 1.07
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.918IleAla: 4.918 ± 1.077
0.0IleCys: 0.0 ± 0.0
1.639IleAsp: 1.639 ± 1.07
1.639IleGlu: 1.639 ± 1.074
1.639IlePhe: 1.639 ± 1.07
3.279IleGly: 3.279 ± 0.004
1.639IleHis: 1.639 ± 1.074
3.279IleIle: 3.279 ± 2.14
1.639IleLys: 1.639 ± 1.074
1.639IleLeu: 1.639 ± 1.07
0.0IleMet: 0.0 ± 0.0
1.639IleAsn: 1.639 ± 1.074
0.0IlePro: 0.0 ± 0.0
1.639IleGln: 1.639 ± 1.07
3.279IleArg: 3.279 ± 0.004
3.279IleSer: 3.279 ± 2.14
0.0IleThr: 0.0 ± 0.0
9.836IleVal: 9.836 ± 0.011
1.639IleTrp: 1.639 ± 1.07
3.279IleTyr: 3.279 ± 0.004
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
1.639LysCys: 1.639 ± 1.074
1.639LysAsp: 1.639 ± 1.07
0.0LysGlu: 0.0 ± 0.0
3.279LysPhe: 3.279 ± 2.14
3.279LysGly: 3.279 ± 2.147
3.279LysHis: 3.279 ± 2.14
0.0LysIle: 0.0 ± 0.0
4.918LysLys: 4.918 ± 1.066
1.639LysLeu: 1.639 ± 1.074
0.0LysMet: 0.0 ± 0.0
3.279LysAsn: 3.279 ± 0.004
3.279LysPro: 3.279 ± 0.004
0.0LysGln: 0.0 ± 0.0
3.279LysArg: 3.279 ± 2.147
1.639LysSer: 1.639 ± 1.07
3.279LysThr: 3.279 ± 0.004
1.639LysVal: 1.639 ± 1.074
0.0LysTrp: 0.0 ± 0.0
1.639LysTyr: 1.639 ± 1.07
0.0LysXaa: 0.0 ± 0.0
Leu
1.639LeuAla: 1.639 ± 1.07
0.0LeuCys: 0.0 ± 0.0
6.557LeuAsp: 6.557 ± 4.294
4.918LeuGlu: 4.918 ± 3.21
11.475LeuPhe: 11.475 ± 1.084
4.918LeuGly: 4.918 ± 3.21
6.557LeuHis: 6.557 ± 4.28
1.639LeuIle: 1.639 ± 1.07
0.0LeuLys: 0.0 ± 0.0
1.639LeuLeu: 1.639 ± 1.074
0.0LeuMet: 0.0 ± 0.0
3.279LeuAsn: 3.279 ± 0.004
1.639LeuPro: 1.639 ± 1.074
1.639LeuGln: 1.639 ± 1.074
6.557LeuArg: 6.557 ± 4.28
3.279LeuSer: 3.279 ± 2.14
1.639LeuThr: 1.639 ± 1.074
0.0LeuVal: 0.0 ± 0.0
3.279LeuTrp: 3.279 ± 2.147
1.639LeuTyr: 1.639 ± 1.07
0.0LeuXaa: 0.0 ± 0.0
Met
1.639MetAla: 1.639 ± 1.074
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.639MetPhe: 1.639 ± 1.074
4.918MetGly: 4.918 ± 1.077
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
1.639MetMet: 1.639 ± 1.074
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.639MetSer: 1.639 ± 1.07
0.0MetThr: 0.0 ± 0.0
1.639MetVal: 1.639 ± 1.07
1.639MetTrp: 1.639 ± 1.07
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.279AsnAla: 3.279 ± 2.14
1.639AsnCys: 1.639 ± 1.07
1.639AsnAsp: 1.639 ± 1.074
1.639AsnGlu: 1.639 ± 1.074
1.639AsnPhe: 1.639 ± 1.074
4.918AsnGly: 4.918 ± 1.077
1.639AsnHis: 1.639 ± 1.07
4.918AsnIle: 4.918 ± 1.077
0.0AsnLys: 0.0 ± 0.0
1.639AsnLeu: 1.639 ± 1.07
3.279AsnMet: 3.279 ± 0.004
4.918AsnAsn: 4.918 ± 1.066
4.918AsnPro: 4.918 ± 1.077
0.0AsnGln: 0.0 ± 0.0
1.639AsnArg: 1.639 ± 1.074
1.639AsnSer: 1.639 ± 1.074
1.639AsnThr: 1.639 ± 1.07
3.279AsnVal: 3.279 ± 2.147
0.0AsnTrp: 0.0 ± 0.0
1.639AsnTyr: 1.639 ± 1.074
0.0AsnXaa: 0.0 ± 0.0
Pro
1.639ProAla: 1.639 ± 1.074
1.639ProCys: 1.639 ± 1.07
1.639ProAsp: 1.639 ± 1.07
1.639ProGlu: 1.639 ± 1.07
1.639ProPhe: 1.639 ± 1.074
3.279ProGly: 3.279 ± 0.004
0.0ProHis: 0.0 ± 0.0
1.639ProIle: 1.639 ± 1.074
0.0ProLys: 0.0 ± 0.0
1.639ProLeu: 1.639 ± 1.074
1.639ProMet: 1.639 ± 1.07
3.279ProAsn: 3.279 ± 0.004
1.639ProPro: 1.639 ± 1.074
0.0ProGln: 0.0 ± 0.0
1.639ProArg: 1.639 ± 1.074
9.836ProSer: 9.836 ± 2.133
3.279ProThr: 3.279 ± 2.147
3.279ProVal: 3.279 ± 2.147
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.639GlnAla: 1.639 ± 1.07
1.639GlnCys: 1.639 ± 1.07
1.639GlnAsp: 1.639 ± 1.07
1.639GlnGlu: 1.639 ± 1.074
0.0GlnPhe: 0.0 ± 0.0
3.279GlnGly: 3.279 ± 2.14
0.0GlnHis: 0.0 ± 0.0
1.639GlnIle: 1.639 ± 1.07
0.0GlnLys: 0.0 ± 0.0
4.918GlnLeu: 4.918 ± 1.066
1.639GlnMet: 1.639 ± 0.818
1.639GlnAsn: 1.639 ± 1.07
3.279GlnPro: 3.279 ± 2.147
1.639GlnGln: 1.639 ± 1.07
1.639GlnArg: 1.639 ± 1.07
1.639GlnSer: 1.639 ± 1.074
1.639GlnThr: 1.639 ± 1.07
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.639ArgAla: 1.639 ± 1.074
1.639ArgCys: 1.639 ± 1.074
6.557ArgAsp: 6.557 ± 0.007
3.279ArgGlu: 3.279 ± 2.14
4.918ArgPhe: 4.918 ± 1.077
1.639ArgGly: 1.639 ± 1.074
0.0ArgHis: 0.0 ± 0.0
3.279ArgIle: 3.279 ± 0.004
9.836ArgLys: 9.836 ± 0.011
1.639ArgLeu: 1.639 ± 1.07
0.0ArgMet: 0.0 ± 0.0
1.639ArgAsn: 1.639 ± 1.074
3.279ArgPro: 3.279 ± 2.14
3.279ArgGln: 3.279 ± 2.14
21.311ArgArg: 21.311 ± 9.669
16.393ArgSer: 16.393 ± 4.305
6.557ArgThr: 6.557 ± 0.007
1.639ArgVal: 1.639 ± 1.074
1.639ArgTrp: 1.639 ± 1.074
8.197ArgTyr: 8.197 ± 1.081
0.0ArgXaa: 0.0 ± 0.0
Ser
9.836SerAla: 9.836 ± 0.011
1.639SerCys: 1.639 ± 1.074
1.639SerAsp: 1.639 ± 1.074
1.639SerGlu: 1.639 ± 1.074
1.639SerPhe: 1.639 ± 1.07
9.836SerGly: 9.836 ± 4.298
0.0SerHis: 0.0 ± 0.0
6.557SerIle: 6.557 ± 2.151
0.0SerLys: 0.0 ± 0.0
14.754SerLeu: 14.754 ± 5.343
0.0SerMet: 0.0 ± 0.0
6.557SerAsn: 6.557 ± 2.136
1.639SerPro: 1.639 ± 1.07
3.279SerGln: 3.279 ± 2.14
4.918SerArg: 4.918 ± 1.077
6.557SerSer: 6.557 ± 2.136
4.918SerThr: 4.918 ± 1.066
4.918SerVal: 4.918 ± 3.221
3.279SerTrp: 3.279 ± 2.14
3.279SerTyr: 3.279 ± 0.004
0.0SerXaa: 0.0 ± 0.0
Thr
6.557ThrAla: 6.557 ± 4.294
0.0ThrCys: 0.0 ± 0.0
4.918ThrAsp: 4.918 ± 1.077
3.279ThrGlu: 3.279 ± 0.004
1.639ThrPhe: 1.639 ± 1.07
1.639ThrGly: 1.639 ± 1.07
1.639ThrHis: 1.639 ± 1.07
3.279ThrIle: 3.279 ± 0.004
3.279ThrLys: 3.279 ± 0.004
3.279ThrLeu: 3.279 ± 0.004
1.639ThrMet: 1.639 ± 1.07
1.639ThrAsn: 1.639 ± 1.074
0.0ThrPro: 0.0 ± 0.0
1.639ThrGln: 1.639 ± 1.07
8.197ThrArg: 8.197 ± 1.063
1.639ThrSer: 1.639 ± 1.074
1.639ThrThr: 1.639 ± 1.074
4.918ThrVal: 4.918 ± 3.221
0.0ThrTrp: 0.0 ± 0.0
4.918ThrTyr: 4.918 ± 1.077
0.0ThrXaa: 0.0 ± 0.0
Val
1.639ValAla: 1.639 ± 1.074
1.639ValCys: 1.639 ± 1.07
1.639ValAsp: 1.639 ± 1.07
3.279ValGlu: 3.279 ± 2.14
3.279ValPhe: 3.279 ± 0.004
6.557ValGly: 6.557 ± 0.007
3.279ValHis: 3.279 ± 2.147
0.0ValIle: 0.0 ± 0.0
0.0ValLys: 0.0 ± 0.0
1.639ValLeu: 1.639 ± 1.074
0.0ValMet: 0.0 ± 0.0
3.279ValAsn: 3.279 ± 2.147
1.639ValPro: 1.639 ± 1.074
4.918ValGln: 4.918 ± 3.221
6.557ValArg: 6.557 ± 2.151
4.918ValSer: 4.918 ± 3.221
4.918ValThr: 4.918 ± 1.066
1.639ValVal: 1.639 ± 1.07
1.639ValTrp: 1.639 ± 1.07
4.918ValTyr: 4.918 ± 1.077
0.0ValXaa: 0.0 ± 0.0
Trp
1.639TrpAla: 1.639 ± 1.07
0.0TrpCys: 0.0 ± 0.0
3.279TrpAsp: 3.279 ± 2.14
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.279TrpGly: 3.279 ± 2.14
1.639TrpHis: 1.639 ± 1.074
1.639TrpIle: 1.639 ± 1.07
1.639TrpLys: 1.639 ± 1.074
6.557TrpLeu: 6.557 ± 2.136
0.0TrpMet: 0.0 ± 0.0
1.639TrpAsn: 1.639 ± 1.074
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.639TrpArg: 1.639 ± 1.074
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.639TrpVal: 1.639 ± 1.074
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
9.836TyrAla: 9.836 ± 2.154
1.639TyrCys: 1.639 ± 1.07
0.0TyrAsp: 0.0 ± 0.0
1.639TyrGlu: 1.639 ± 1.07
1.639TyrPhe: 1.639 ± 1.07
6.557TyrGly: 6.557 ± 2.151
0.0TyrHis: 0.0 ± 0.0
1.639TyrIle: 1.639 ± 1.074
4.918TyrLys: 4.918 ± 1.077
1.639TyrLeu: 1.639 ± 1.074
0.0TyrMet: 0.0 ± 0.816
0.0TyrAsn: 0.0 ± 0.0
3.279TyrPro: 3.279 ± 2.147
0.0TyrGln: 0.0 ± 0.0
3.279TyrArg: 3.279 ± 2.14
1.639TyrSer: 1.639 ± 1.07
0.0TyrThr: 0.0 ± 0.0
3.279TyrVal: 3.279 ± 2.14
1.639TyrTrp: 1.639 ± 1.074
3.279TyrTyr: 3.279 ± 2.147
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski