Amino acid dipepetide frequency for Chicken stool-associated gemycircularvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.868AlaAla: 3.868 ± 2.287
0.0AlaCys: 0.0 ± 0.0
5.803AlaAsp: 5.803 ± 0.525
3.868AlaGlu: 3.868 ± 3.524
3.868AlaPhe: 3.868 ± 2.287
5.803AlaGly: 5.803 ± 0.525
1.934AlaHis: 1.934 ± 1.762
3.868AlaIle: 3.868 ± 3.524
1.934AlaLys: 1.934 ± 1.762
1.934AlaLeu: 1.934 ± 1.144
0.0AlaMet: 0.0 ± 0.0
1.934AlaAsn: 1.934 ± 1.762
9.671AlaPro: 9.671 ± 2.813
0.0AlaGln: 0.0 ± 0.0
1.934AlaArg: 1.934 ± 1.144
7.737AlaSer: 7.737 ± 1.236
3.868AlaThr: 3.868 ± 2.287
0.0AlaVal: 0.0 ± 0.0
1.934AlaTrp: 1.934 ± 1.144
1.934AlaTyr: 1.934 ± 1.144
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.934CysPhe: 1.934 ± 1.144
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.934CysPro: 1.934 ± 1.144
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.934CysVal: 1.934 ± 1.762
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.934AspAla: 1.934 ± 1.762
0.0AspCys: 0.0 ± 0.0
5.803AspAsp: 5.803 ± 3.431
1.934AspGlu: 1.934 ± 1.762
1.934AspPhe: 1.934 ± 1.144
3.868AspGly: 3.868 ± 3.524
0.0AspHis: 0.0 ± 0.0
3.868AspIle: 3.868 ± 0.618
3.868AspLys: 3.868 ± 2.287
5.803AspLeu: 5.803 ± 0.525
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
3.868AspPro: 3.868 ± 0.618
0.0AspGln: 0.0 ± 0.0
3.868AspArg: 3.868 ± 2.287
5.803AspSer: 5.803 ± 0.525
1.934AspThr: 1.934 ± 1.144
1.934AspVal: 1.934 ± 1.762
0.0AspTrp: 0.0 ± 0.0
5.803AspTyr: 5.803 ± 2.38
0.0AspXaa: 0.0 ± 0.0
Glu
3.868GluAla: 3.868 ± 2.287
0.0GluCys: 0.0 ± 0.0
5.803GluAsp: 5.803 ± 2.38
5.803GluGlu: 5.803 ± 0.525
5.803GluPhe: 5.803 ± 0.525
1.934GluGly: 1.934 ± 1.762
0.0GluHis: 0.0 ± 0.0
1.934GluIle: 1.934 ± 1.144
0.0GluLys: 0.0 ± 0.0
3.868GluLeu: 3.868 ± 3.524
1.934GluMet: 1.934 ± 1.144
7.737GluAsn: 7.737 ± 4.574
0.0GluPro: 0.0 ± 0.0
3.868GluGln: 3.868 ± 2.287
5.803GluArg: 5.803 ± 5.285
3.868GluSer: 3.868 ± 0.618
1.934GluThr: 1.934 ± 1.762
1.934GluVal: 1.934 ± 1.762
0.0GluTrp: 0.0 ± 0.0
1.934GluTyr: 1.934 ± 1.762
0.0GluXaa: 0.0 ± 0.0
Phe
1.934PheAla: 1.934 ± 1.144
0.0PheCys: 0.0 ± 0.0
1.934PheAsp: 1.934 ± 1.762
3.868PheGlu: 3.868 ± 0.618
9.671PhePhe: 9.671 ± 0.093
3.868PheGly: 3.868 ± 2.287
0.0PheHis: 0.0 ± 0.0
3.868PheIle: 3.868 ± 0.618
0.0PheLys: 0.0 ± 0.0
3.868PheLeu: 3.868 ± 0.618
0.0PheMet: 0.0 ± 0.0
1.934PheAsn: 1.934 ± 1.762
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
11.605PheArg: 11.605 ± 1.855
7.737PheSer: 7.737 ± 4.574
1.934PheThr: 1.934 ± 1.144
3.868PheVal: 3.868 ± 2.287
0.0PheTrp: 0.0 ± 0.0
1.934PheTyr: 1.934 ± 1.144
0.0PheXaa: 0.0 ± 0.0
Gly
7.737GlyAla: 7.737 ± 1.236
0.0GlyCys: 0.0 ± 0.0
5.803GlyAsp: 5.803 ± 2.38
5.803GlyGlu: 5.803 ± 2.38
0.0GlyPhe: 0.0 ± 0.0
9.671GlyGly: 9.671 ± 8.809
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
3.868GlyLys: 3.868 ± 0.618
1.934GlyLeu: 1.934 ± 1.144
0.0GlyMet: 0.0 ± 0.0
3.868GlyAsn: 3.868 ± 2.287
1.934GlyPro: 1.934 ± 1.762
1.934GlyGln: 1.934 ± 1.144
1.934GlyArg: 1.934 ± 1.762
7.737GlySer: 7.737 ± 1.236
7.737GlyThr: 7.737 ± 4.142
5.803GlyVal: 5.803 ± 3.431
3.868GlyTrp: 3.868 ± 0.618
5.803GlyTyr: 5.803 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
3.868HisAla: 3.868 ± 0.618
1.934HisCys: 1.934 ± 1.762
1.934HisAsp: 1.934 ± 1.144
5.803HisGlu: 5.803 ± 0.525
1.934HisPhe: 1.934 ± 1.762
0.0HisGly: 0.0 ± 0.0
1.934HisHis: 1.934 ± 1.144
0.0HisIle: 0.0 ± 0.0
1.934HisLys: 1.934 ± 1.144
1.934HisLeu: 1.934 ± 1.762
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.868HisPro: 3.868 ± 0.618
1.934HisGln: 1.934 ± 1.762
0.0HisArg: 0.0 ± 0.0
1.934HisSer: 1.934 ± 1.144
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.868IleAla: 3.868 ± 0.618
0.0IleCys: 0.0 ± 0.0
1.934IleAsp: 1.934 ± 1.762
3.868IleGlu: 3.868 ± 3.524
3.868IlePhe: 3.868 ± 0.618
1.934IleGly: 1.934 ± 1.762
1.934IleHis: 1.934 ± 1.762
1.934IleIle: 1.934 ± 1.762
3.868IleLys: 3.868 ± 3.524
1.934IleLeu: 1.934 ± 1.144
1.934IleMet: 1.934 ± 1.144
1.934IleAsn: 1.934 ± 1.144
1.934IlePro: 1.934 ± 1.144
1.934IleGln: 1.934 ± 1.762
1.934IleArg: 1.934 ± 1.762
0.0IleSer: 0.0 ± 0.0
1.934IleThr: 1.934 ± 1.144
3.868IleVal: 3.868 ± 0.618
0.0IleTrp: 0.0 ± 0.0
1.934IleTyr: 1.934 ± 1.762
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
1.934LysAsp: 1.934 ± 1.762
1.934LysGlu: 1.934 ± 1.144
0.0LysPhe: 0.0 ± 0.0
1.934LysGly: 1.934 ± 1.144
1.934LysHis: 1.934 ± 1.144
3.868LysIle: 3.868 ± 3.524
3.868LysLys: 3.868 ± 0.618
1.934LysLeu: 1.934 ± 1.144
1.934LysMet: 1.934 ± 1.144
0.0LysAsn: 0.0 ± 0.0
3.868LysPro: 3.868 ± 3.524
0.0LysGln: 0.0 ± 0.0
1.934LysArg: 1.934 ± 1.144
3.868LysSer: 3.868 ± 0.618
3.868LysThr: 3.868 ± 2.287
0.0LysVal: 0.0 ± 0.0
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
3.868LeuAsp: 3.868 ± 2.287
1.934LeuGlu: 1.934 ± 1.144
3.868LeuPhe: 3.868 ± 3.524
9.671LeuGly: 9.671 ± 5.904
5.803LeuHis: 5.803 ± 0.525
0.0LeuIle: 0.0 ± 0.0
0.0LeuLys: 0.0 ± 0.0
3.868LeuLeu: 3.868 ± 0.618
0.0LeuMet: 0.0 ± 0.0
0.0LeuAsn: 0.0 ± 0.0
3.868LeuPro: 3.868 ± 2.287
0.0LeuGln: 0.0 ± 0.0
7.737LeuArg: 7.737 ± 1.236
7.737LeuSer: 7.737 ± 1.236
5.803LeuThr: 5.803 ± 0.525
1.934LeuVal: 1.934 ± 1.762
1.934LeuTrp: 1.934 ± 1.144
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.868MetAla: 3.868 ± 0.618
1.934MetCys: 1.934 ± 1.144
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.934MetGly: 1.934 ± 1.144
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
1.934MetMet: 1.934 ± 1.144
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.934MetArg: 1.934 ± 1.144
0.0MetSer: 0.0 ± 0.0
5.803MetThr: 5.803 ± 0.525
0.0MetVal: 0.0 ± 0.0
1.934MetTrp: 1.934 ± 1.144
1.934MetTyr: 1.934 ± 1.144
0.0MetXaa: 0.0 ± 0.0
Asn
3.868AsnAla: 3.868 ± 2.287
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.934AsnGlu: 1.934 ± 1.144
1.934AsnPhe: 1.934 ± 1.144
5.803AsnGly: 5.803 ± 0.525
0.0AsnHis: 0.0 ± 0.0
3.868AsnIle: 3.868 ± 0.618
3.868AsnLys: 3.868 ± 0.618
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.888
3.868AsnAsn: 3.868 ± 2.287
3.868AsnPro: 3.868 ± 2.287
1.934AsnGln: 1.934 ± 1.144
1.934AsnArg: 1.934 ± 1.762
3.868AsnSer: 3.868 ± 0.618
1.934AsnThr: 1.934 ± 1.144
1.934AsnVal: 1.934 ± 1.144
0.0AsnTrp: 0.0 ± 0.0
5.803AsnTyr: 5.803 ± 0.525
0.0AsnXaa: 0.0 ± 0.0
Pro
5.803ProAla: 5.803 ± 3.431
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
0.0ProGlu: 0.0 ± 0.0
1.934ProPhe: 1.934 ± 1.144
3.868ProGly: 3.868 ± 2.287
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
1.934ProLys: 1.934 ± 1.144
3.868ProLeu: 3.868 ± 2.287
0.0ProMet: 0.0 ± 0.0
9.671ProAsn: 9.671 ± 0.093
0.0ProPro: 0.0 ± 0.0
1.934ProGln: 1.934 ± 1.144
7.737ProArg: 7.737 ± 4.142
5.803ProSer: 5.803 ± 0.525
3.868ProThr: 3.868 ± 3.524
3.868ProVal: 3.868 ± 2.287
1.934ProTrp: 1.934 ± 1.762
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
5.803GlnAsp: 5.803 ± 0.525
1.934GlnGlu: 1.934 ± 1.762
1.934GlnPhe: 1.934 ± 1.144
0.0GlnGly: 0.0 ± 0.0
5.803GlnHis: 5.803 ± 0.525
3.868GlnIle: 3.868 ± 0.618
0.0GlnLys: 0.0 ± 0.0
5.803GlnLeu: 5.803 ± 0.525
0.0GlnMet: 0.0 ± 0.0
1.934GlnAsn: 1.934 ± 1.144
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
1.934GlnSer: 1.934 ± 1.762
0.0GlnThr: 0.0 ± 0.0
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.934GlnTyr: 1.934 ± 1.762
0.0GlnXaa: 0.0 ± 0.0
Arg
3.868ArgAla: 3.868 ± 3.524
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
5.803ArgGlu: 5.803 ± 2.38
9.671ArgPhe: 9.671 ± 2.998
7.737ArgGly: 7.737 ± 4.142
0.0ArgHis: 0.0 ± 0.0
1.934ArgIle: 1.934 ± 1.762
5.803ArgLys: 5.803 ± 0.525
5.803ArgLeu: 5.803 ± 5.285
5.803ArgMet: 5.803 ± 1.647
3.868ArgAsn: 3.868 ± 2.287
5.803ArgPro: 5.803 ± 3.431
0.0ArgGln: 0.0 ± 0.0
30.948ArgArg: 30.948 ± 15.392
9.671ArgSer: 9.671 ± 0.093
3.868ArgThr: 3.868 ± 2.287
7.737ArgVal: 7.737 ± 1.669
3.868ArgTrp: 3.868 ± 0.618
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.868SerAla: 3.868 ± 0.618
0.0SerCys: 0.0 ± 0.0
5.803SerAsp: 5.803 ± 2.38
7.737SerGlu: 7.737 ± 4.574
1.934SerPhe: 1.934 ± 1.144
1.934SerGly: 1.934 ± 1.762
5.803SerHis: 5.803 ± 5.285
3.868SerIle: 3.868 ± 3.524
0.0SerLys: 0.0 ± 0.0
3.868SerLeu: 3.868 ± 0.618
3.868SerMet: 3.868 ± 2.287
3.868SerAsn: 3.868 ± 2.287
3.868SerPro: 3.868 ± 0.618
5.803SerGln: 5.803 ± 2.38
9.671SerArg: 9.671 ± 0.093
3.868SerSer: 3.868 ± 2.287
9.671SerThr: 9.671 ± 2.813
1.934SerVal: 1.934 ± 1.144
0.0SerTrp: 0.0 ± 0.0
9.671SerTyr: 9.671 ± 2.813
0.0SerXaa: 0.0 ± 0.0
Thr
1.934ThrAla: 1.934 ± 1.144
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
1.934ThrGlu: 1.934 ± 1.762
1.934ThrPhe: 1.934 ± 1.144
9.671ThrGly: 9.671 ± 2.813
0.0ThrHis: 0.0 ± 0.0
5.803ThrIle: 5.803 ± 0.525
0.0ThrLys: 0.0 ± 0.0
1.934ThrLeu: 1.934 ± 1.144
1.934ThrMet: 1.934 ± 1.144
0.0ThrAsn: 0.0 ± 0.0
7.737ThrPro: 7.737 ± 1.236
5.803ThrGln: 5.803 ± 2.38
5.803ThrArg: 5.803 ± 0.525
5.803ThrSer: 5.803 ± 0.525
0.0ThrThr: 0.0 ± 0.0
5.803ThrVal: 5.803 ± 3.431
1.934ThrTrp: 1.934 ± 1.144
1.934ThrTyr: 1.934 ± 1.762
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.934ValCys: 1.934 ± 1.144
1.934ValAsp: 1.934 ± 1.144
1.934ValGlu: 1.934 ± 1.144
1.934ValPhe: 1.934 ± 1.144
5.803ValGly: 5.803 ± 0.525
0.0ValHis: 0.0 ± 0.0
1.934ValIle: 1.934 ± 1.762
0.0ValLys: 0.0 ± 0.0
3.868ValLeu: 3.868 ± 3.524
0.0ValMet: 0.0 ± 0.0
3.868ValAsn: 3.868 ± 0.618
0.0ValPro: 0.0 ± 0.0
5.803ValGln: 5.803 ± 0.525
9.671ValArg: 9.671 ± 2.813
0.0ValSer: 0.0 ± 0.0
1.934ValThr: 1.934 ± 1.144
1.934ValVal: 1.934 ± 1.762
0.0ValTrp: 0.0 ± 0.0
5.803ValTyr: 5.803 ± 3.431
0.0ValXaa: 0.0 ± 0.0
Trp
1.934TrpAla: 1.934 ± 1.144
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
3.868TrpHis: 3.868 ± 2.287
0.0TrpIle: 0.0 ± 0.0
1.934TrpLys: 1.934 ± 1.144
1.934TrpLeu: 1.934 ± 1.762
0.0TrpMet: 0.0 ± 0.0
3.868TrpAsn: 3.868 ± 0.618
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.934TrpArg: 1.934 ± 1.144
1.934TrpSer: 1.934 ± 1.762
1.934TrpThr: 1.934 ± 1.144
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
9.671TyrAla: 9.671 ± 2.998
0.0TyrCys: 0.0 ± 0.0
3.868TyrAsp: 3.868 ± 2.287
1.934TyrGlu: 1.934 ± 1.144
3.868TyrPhe: 3.868 ± 2.287
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
1.934TyrIle: 1.934 ± 1.144
0.0TyrLys: 0.0 ± 0.0
3.868TyrLeu: 3.868 ± 2.287
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
5.803TyrArg: 5.803 ± 5.285
7.737TyrSer: 7.737 ± 1.669
1.934TyrThr: 1.934 ± 1.762
3.868TyrVal: 3.868 ± 0.618
1.934TyrTrp: 1.934 ± 1.144
1.934TyrTyr: 1.934 ± 1.144
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski