Amino acid dipepetide frequency for Faeces associated gemycircularvirus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.211AlaAla: 6.211 ± 3.237
0.0AlaCys: 0.0 ± 0.0
1.553AlaAsp: 1.553 ± 0.947
7.764AlaGlu: 7.764 ± 0.053
3.106AlaPhe: 3.106 ± 1.895
7.764AlaGly: 7.764 ± 4.631
0.0AlaHis: 0.0 ± 0.0
1.553AlaIle: 1.553 ± 0.947
4.658AlaLys: 4.658 ± 0.5
4.658AlaLeu: 4.658 ± 4.184
0.0AlaMet: 0.0 ± 0.0
3.106AlaAsn: 3.106 ± 1.895
1.553AlaPro: 1.553 ± 0.947
3.106AlaGln: 3.106 ± 0.447
6.211AlaArg: 6.211 ± 3.79
4.658AlaSer: 4.658 ± 1.842
1.553AlaThr: 1.553 ± 0.947
3.106AlaVal: 3.106 ± 0.447
0.0AlaTrp: 0.0 ± 0.0
1.553AlaTyr: 1.553 ± 1.395
0.0AlaXaa: 0.0 ± 0.0
Cys
3.106CysAla: 3.106 ± 0.447
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.553CysGlu: 1.553 ± 0.947
1.553CysPhe: 1.553 ± 0.947
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.553CysIle: 1.553 ± 0.947
0.0CysLys: 0.0 ± 0.0
1.553CysLeu: 1.553 ± 0.947
0.0CysMet: 0.0 ± 0.0
4.658CysAsn: 4.658 ± 2.842
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.553CysTyr: 1.553 ± 0.947
0.0CysXaa: 0.0 ± 0.0
Asp
3.106AspAla: 3.106 ± 0.447
0.0AspCys: 0.0 ± 0.0
1.553AspAsp: 1.553 ± 0.947
4.658AspGlu: 4.658 ± 0.5
7.764AspPhe: 7.764 ± 4.737
7.764AspGly: 7.764 ± 2.395
1.553AspHis: 1.553 ± 0.947
4.658AspIle: 4.658 ± 1.842
3.106AspLys: 3.106 ± 1.895
3.106AspLeu: 3.106 ± 0.447
0.0AspMet: 0.0 ± 0.0
1.553AspAsn: 1.553 ± 0.947
3.106AspPro: 3.106 ± 1.895
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
4.658AspSer: 4.658 ± 2.842
0.0AspThr: 0.0 ± 0.0
4.658AspVal: 4.658 ± 0.5
3.106AspTrp: 3.106 ± 0.447
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.106GluAla: 3.106 ± 0.447
1.553GluCys: 1.553 ± 0.947
3.106GluAsp: 3.106 ± 1.895
0.0GluGlu: 0.0 ± 0.0
4.658GluPhe: 4.658 ± 0.5
0.0GluGly: 0.0 ± 0.0
1.553GluHis: 1.553 ± 0.947
1.553GluIle: 1.553 ± 0.947
1.553GluLys: 1.553 ± 0.947
4.658GluLeu: 4.658 ± 0.5
0.0GluMet: 0.0 ± 0.0
1.553GluAsn: 1.553 ± 1.395
3.106GluPro: 3.106 ± 0.447
0.0GluGln: 0.0 ± 0.0
6.211GluArg: 6.211 ± 3.79
4.658GluSer: 4.658 ± 0.5
1.553GluThr: 1.553 ± 1.395
3.106GluVal: 3.106 ± 1.895
1.553GluTrp: 1.553 ± 0.947
4.658GluTyr: 4.658 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
3.106PheAla: 3.106 ± 1.895
1.553PheCys: 1.553 ± 0.947
9.317PheAsp: 9.317 ± 1.0
3.106PheGlu: 3.106 ± 0.447
0.0PhePhe: 0.0 ± 0.0
1.553PheGly: 1.553 ± 0.947
3.106PheHis: 3.106 ± 1.895
1.553PheIle: 1.553 ± 0.947
3.106PheLys: 3.106 ± 0.447
6.211PheLeu: 6.211 ± 1.447
3.106PheMet: 3.106 ± 0.447
3.106PheAsn: 3.106 ± 0.447
0.0PhePro: 0.0 ± 0.0
1.553PheGln: 1.553 ± 0.947
6.211PheArg: 6.211 ± 1.447
3.106PheSer: 3.106 ± 2.789
1.553PheThr: 1.553 ± 0.947
4.658PheVal: 4.658 ± 2.842
0.0PheTrp: 0.0 ± 0.0
4.658PheTyr: 4.658 ± 1.842
0.0PheXaa: 0.0 ± 0.0
Gly
3.106GlyAla: 3.106 ± 2.789
0.0GlyCys: 0.0 ± 0.0
3.106GlyAsp: 3.106 ± 1.895
4.658GlyGlu: 4.658 ± 2.842
3.106GlyPhe: 3.106 ± 0.447
4.658GlyGly: 4.658 ± 0.5
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
6.211GlyLys: 6.211 ± 0.895
4.658GlyLeu: 4.658 ± 0.5
1.553GlyMet: 1.553 ± 1.511
3.106GlyAsn: 3.106 ± 2.789
4.658GlyPro: 4.658 ± 2.842
4.658GlyGln: 4.658 ± 0.5
3.106GlyArg: 3.106 ± 0.447
1.553GlySer: 1.553 ± 1.395
6.211GlyThr: 6.211 ± 0.895
0.0GlyVal: 0.0 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
1.553GlyTyr: 1.553 ± 1.395
0.0GlyXaa: 0.0 ± 0.0
His
1.553HisAla: 1.553 ± 0.947
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
3.106HisPhe: 3.106 ± 1.895
1.553HisGly: 1.553 ± 0.947
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.106HisLys: 3.106 ± 0.447
1.553HisLeu: 1.553 ± 0.947
1.553HisMet: 1.553 ± 1.395
0.0HisAsn: 0.0 ± 0.0
3.106HisPro: 3.106 ± 0.447
0.0HisGln: 0.0 ± 0.0
3.106HisArg: 3.106 ± 1.895
3.106HisSer: 3.106 ± 2.789
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.553HisTrp: 1.553 ± 0.947
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.553IleAla: 1.553 ± 0.947
0.0IleCys: 0.0 ± 0.0
1.553IleAsp: 1.553 ± 0.947
4.658IleGlu: 4.658 ± 2.842
3.106IlePhe: 3.106 ± 1.895
1.553IleGly: 1.553 ± 0.947
6.211IleHis: 6.211 ± 3.237
4.658IleIle: 4.658 ± 1.842
3.106IleLys: 3.106 ± 0.447
6.211IleLeu: 6.211 ± 3.79
3.106IleMet: 3.106 ± 2.154
0.0IleAsn: 0.0 ± 0.0
3.106IlePro: 3.106 ± 0.447
0.0IleGln: 0.0 ± 0.0
6.211IleArg: 6.211 ± 3.237
1.553IleSer: 1.553 ± 1.395
3.106IleThr: 3.106 ± 0.447
6.211IleVal: 6.211 ± 0.895
1.553IleTrp: 1.553 ± 0.947
3.106IleTyr: 3.106 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
6.211LysAla: 6.211 ± 3.237
0.0LysCys: 0.0 ± 0.0
4.658LysAsp: 4.658 ± 0.5
3.106LysGlu: 3.106 ± 1.895
7.764LysPhe: 7.764 ± 0.053
3.106LysGly: 3.106 ± 2.789
0.0LysHis: 0.0 ± 0.0
3.106LysIle: 3.106 ± 0.447
9.317LysLys: 9.317 ± 3.684
3.106LysLeu: 3.106 ± 0.447
0.0LysMet: 0.0 ± 0.0
3.106LysAsn: 3.106 ± 2.789
4.658LysPro: 4.658 ± 2.842
1.553LysGln: 1.553 ± 0.947
6.211LysArg: 6.211 ± 0.895
3.106LysSer: 3.106 ± 1.895
3.106LysThr: 3.106 ± 2.789
6.211LysVal: 6.211 ± 5.579
0.0LysTrp: 0.0 ± 0.0
7.764LysTyr: 7.764 ± 2.289
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
3.106LeuCys: 3.106 ± 0.447
9.317LeuAsp: 9.317 ± 3.342
0.0LeuGlu: 0.0 ± 0.0
1.553LeuPhe: 1.553 ± 0.947
6.211LeuGly: 6.211 ± 3.79
3.106LeuHis: 3.106 ± 0.447
4.658LeuIle: 4.658 ± 0.5
3.106LeuLys: 3.106 ± 0.447
0.0LeuLeu: 0.0 ± 0.0
3.106LeuMet: 3.106 ± 0.447
4.658LeuAsn: 4.658 ± 0.5
3.106LeuPro: 3.106 ± 2.789
4.658LeuGln: 4.658 ± 1.842
3.106LeuArg: 3.106 ± 2.789
3.106LeuSer: 3.106 ± 0.447
1.553LeuThr: 1.553 ± 0.947
3.106LeuVal: 3.106 ± 0.447
0.0LeuTrp: 0.0 ± 0.0
1.553LeuTyr: 1.553 ± 0.947
0.0LeuXaa: 0.0 ± 0.0
Met
1.553MetAla: 1.553 ± 1.395
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.553MetPhe: 1.553 ± 0.947
1.553MetGly: 1.553 ± 0.947
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.553MetLeu: 1.553 ± 0.947
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.553MetPro: 1.553 ± 0.947
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
4.658MetSer: 4.658 ± 4.184
1.553MetThr: 1.553 ± 1.395
0.0MetVal: 0.0 ± 0.0
1.553MetTrp: 1.553 ± 0.947
3.106MetTyr: 3.106 ± 2.789
0.0MetXaa: 0.0 ± 0.0
Asn
1.553AsnAla: 1.553 ± 1.395
0.0AsnCys: 0.0 ± 0.0
4.658AsnAsp: 4.658 ± 0.5
4.658AsnGlu: 4.658 ± 0.5
3.106AsnPhe: 3.106 ± 2.789
1.553AsnGly: 1.553 ± 0.947
1.553AsnHis: 1.553 ± 1.395
6.211AsnIle: 6.211 ± 1.447
7.764AsnLys: 7.764 ± 0.053
1.553AsnLeu: 1.553 ± 0.947
0.0AsnMet: 0.0 ± 0.0
1.553AsnAsn: 1.553 ± 0.947
4.658AsnPro: 4.658 ± 2.842
3.106AsnGln: 3.106 ± 0.447
1.553AsnArg: 1.553 ± 1.395
6.211AsnSer: 6.211 ± 0.895
1.553AsnThr: 1.553 ± 0.947
3.106AsnVal: 3.106 ± 1.895
0.0AsnTrp: 0.0 ± 0.0
3.106AsnTyr: 3.106 ± 0.447
0.0AsnXaa: 0.0 ± 0.0
Pro
3.106ProAla: 3.106 ± 1.895
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.106ProGlu: 3.106 ± 0.447
3.106ProPhe: 3.106 ± 0.447
1.553ProGly: 1.553 ± 1.395
1.553ProHis: 1.553 ± 0.947
3.106ProIle: 3.106 ± 1.895
0.0ProLys: 0.0 ± 0.0
1.553ProLeu: 1.553 ± 1.395
0.0ProMet: 0.0 ± 0.0
7.764ProAsn: 7.764 ± 2.395
1.553ProPro: 1.553 ± 0.947
0.0ProGln: 0.0 ± 0.0
4.658ProArg: 4.658 ± 2.842
3.106ProSer: 3.106 ± 1.895
3.106ProThr: 3.106 ± 0.447
1.553ProVal: 1.553 ± 1.395
3.106ProTrp: 3.106 ± 0.447
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.553GlnGlu: 1.553 ± 0.947
3.106GlnPhe: 3.106 ± 2.789
3.106GlnGly: 3.106 ± 2.789
1.553GlnHis: 1.553 ± 0.947
3.106GlnIle: 3.106 ± 0.447
1.553GlnLys: 1.553 ± 0.947
0.0GlnLeu: 0.0 ± 0.0
1.553GlnMet: 1.553 ± 0.947
1.553GlnAsn: 1.553 ± 0.947
0.0GlnPro: 0.0 ± 0.0
1.553GlnGln: 1.553 ± 0.947
3.106GlnArg: 3.106 ± 1.895
1.553GlnSer: 1.553 ± 0.947
4.658GlnThr: 4.658 ± 1.842
1.553GlnVal: 1.553 ± 0.947
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.106ArgAla: 3.106 ± 1.895
3.106ArgCys: 3.106 ± 1.895
3.106ArgAsp: 3.106 ± 1.895
4.658ArgGlu: 4.658 ± 0.5
3.106ArgPhe: 3.106 ± 1.895
3.106ArgGly: 3.106 ± 1.895
3.106ArgHis: 3.106 ± 1.895
7.764ArgIle: 7.764 ± 2.289
7.764ArgLys: 7.764 ± 2.289
3.106ArgLeu: 3.106 ± 0.447
1.553ArgMet: 1.553 ± 1.395
4.658ArgAsn: 4.658 ± 2.842
0.0ArgPro: 0.0 ± 0.0
3.106ArgGln: 3.106 ± 0.447
10.87ArgArg: 10.87 ± 0.395
3.106ArgSer: 3.106 ± 0.447
4.658ArgThr: 4.658 ± 0.5
3.106ArgVal: 3.106 ± 1.895
1.553ArgTrp: 1.553 ± 1.395
1.553ArgTyr: 1.553 ± 1.395
0.0ArgXaa: 0.0 ± 0.0
Ser
3.106SerAla: 3.106 ± 0.447
0.0SerCys: 0.0 ± 0.0
3.106SerAsp: 3.106 ± 0.447
3.106SerGlu: 3.106 ± 0.447
0.0SerPhe: 0.0 ± 0.0
1.553SerGly: 1.553 ± 1.395
0.0SerHis: 0.0 ± 0.0
1.553SerIle: 1.553 ± 0.947
6.211SerLys: 6.211 ± 3.237
3.106SerLeu: 3.106 ± 2.789
1.553SerMet: 1.553 ± 1.395
7.764SerAsn: 7.764 ± 0.053
0.0SerPro: 0.0 ± 0.0
6.211SerGln: 6.211 ± 1.447
3.106SerArg: 3.106 ± 0.447
6.211SerSer: 6.211 ± 0.895
13.975SerThr: 13.975 ± 5.526
6.211SerVal: 6.211 ± 0.895
1.553SerTrp: 1.553 ± 0.947
4.658SerTyr: 4.658 ± 1.842
0.0SerXaa: 0.0 ± 0.0
Thr
3.106ThrAla: 3.106 ± 2.789
1.553ThrCys: 1.553 ± 0.947
4.658ThrAsp: 4.658 ± 0.5
1.553ThrGlu: 1.553 ± 0.947
0.0ThrPhe: 0.0 ± 0.0
1.553ThrGly: 1.553 ± 1.395
0.0ThrHis: 0.0 ± 0.0
4.658ThrIle: 4.658 ± 4.184
4.658ThrLys: 4.658 ± 1.842
3.106ThrLeu: 3.106 ± 1.895
0.0ThrMet: 0.0 ± 0.0
3.106ThrAsn: 3.106 ± 2.789
4.658ThrPro: 4.658 ± 1.842
0.0ThrGln: 0.0 ± 0.0
3.106ThrArg: 3.106 ± 1.895
4.658ThrSer: 4.658 ± 4.184
1.553ThrThr: 1.553 ± 1.395
3.106ThrVal: 3.106 ± 1.895
1.553ThrTrp: 1.553 ± 0.947
7.764ThrTyr: 7.764 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
10.87ValAla: 10.87 ± 4.29
4.658ValCys: 4.658 ± 2.842
1.553ValAsp: 1.553 ± 0.947
0.0ValGlu: 0.0 ± 0.0
3.106ValPhe: 3.106 ± 1.895
6.211ValGly: 6.211 ± 0.895
0.0ValHis: 0.0 ± 0.0
3.106ValIle: 3.106 ± 0.447
1.553ValLys: 1.553 ± 1.395
4.658ValLeu: 4.658 ± 0.5
0.0ValMet: 0.0 ± 0.0
1.553ValAsn: 1.553 ± 1.395
1.553ValPro: 1.553 ± 0.947
0.0ValGln: 0.0 ± 0.0
6.211ValArg: 6.211 ± 1.447
6.211ValSer: 6.211 ± 3.237
1.553ValThr: 1.553 ± 0.947
4.658ValVal: 4.658 ± 2.842
3.106ValTrp: 3.106 ± 2.789
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.553TrpAla: 1.553 ± 0.947
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
4.658TrpPhe: 4.658 ± 0.5
1.553TrpGly: 1.553 ± 0.947
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
3.106TrpLys: 3.106 ± 0.447
4.658TrpLeu: 4.658 ± 1.842
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.553TrpPro: 1.553 ± 1.395
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.553TrpSer: 1.553 ± 0.947
1.553TrpThr: 1.553 ± 1.395
3.106TrpVal: 3.106 ± 1.895
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.106TyrAla: 3.106 ± 0.447
0.0TyrCys: 0.0 ± 0.0
3.106TyrAsp: 3.106 ± 0.447
0.0TyrGlu: 0.0 ± 0.0
3.106TyrPhe: 3.106 ± 2.789
1.553TyrGly: 1.553 ± 0.947
0.0TyrHis: 0.0 ± 0.0
7.764TyrIle: 7.764 ± 0.053
6.211TyrLys: 6.211 ± 5.579
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
4.658TyrAsn: 4.658 ± 0.5
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
3.106TyrArg: 3.106 ± 2.789
6.211TyrSer: 6.211 ± 0.895
1.553TyrThr: 1.553 ± 0.947
3.106TyrVal: 3.106 ± 0.447
3.106TyrTrp: 3.106 ± 0.447
3.106TyrTyr: 3.106 ± 0.447
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski