Amino acid dipepetide frequency for Faeces associated gemycircularvirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.764AlaAla: 7.764 ± 3.718
1.553AlaCys: 1.553 ± 1.119
6.211AlaAsp: 6.211 ± 2.148
6.211AlaGlu: 6.211 ± 4.476
1.553AlaPhe: 1.553 ± 1.119
4.658AlaGly: 4.658 ± 1.029
0.0AlaHis: 0.0 ± 0.0
3.106AlaIle: 3.106 ± 0.09
7.764AlaLys: 7.764 ± 1.39
6.211AlaLeu: 6.211 ± 2.148
1.553AlaMet: 1.553 ± 1.209
7.764AlaAsn: 7.764 ± 1.39
1.553AlaPro: 1.553 ± 1.209
4.658AlaGln: 4.658 ± 1.3
9.317AlaArg: 9.317 ± 0.271
1.553AlaSer: 1.553 ± 1.119
6.211AlaThr: 6.211 ± 0.181
3.106AlaVal: 3.106 ± 2.238
0.0AlaTrp: 0.0 ± 0.0
3.106AlaTyr: 3.106 ± 0.09
0.0AlaXaa: 0.0 ± 0.0
Cys
3.106CysAla: 3.106 ± 0.09
0.0CysCys: 0.0 ± 0.0
1.553CysAsp: 1.553 ± 1.119
0.0CysGlu: 0.0 ± 0.0
1.553CysPhe: 1.553 ± 1.209
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.553CysIle: 1.553 ± 1.119
1.553CysLys: 1.553 ± 1.119
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.553CysAsn: 1.553 ± 1.119
1.553CysPro: 1.553 ± 1.119
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.553CysTyr: 1.553 ± 1.209
0.0CysXaa: 0.0 ± 0.0
Asp
3.106AspAla: 3.106 ± 0.09
0.0AspCys: 0.0 ± 0.0
6.211AspAsp: 6.211 ± 2.509
4.658AspGlu: 4.658 ± 1.029
1.553AspPhe: 1.553 ± 1.119
7.764AspGly: 7.764 ± 3.266
1.553AspHis: 1.553 ± 1.119
9.317AspIle: 9.317 ± 4.385
3.106AspLys: 3.106 ± 2.419
6.211AspLeu: 6.211 ± 0.181
3.106AspMet: 3.106 ± 0.09
0.0AspAsn: 0.0 ± 0.0
3.106AspPro: 3.106 ± 2.238
1.553AspGln: 1.553 ± 1.119
0.0AspArg: 0.0 ± 0.0
1.553AspSer: 1.553 ± 1.209
4.658AspThr: 4.658 ± 3.628
4.658AspVal: 4.658 ± 3.357
1.553AspTrp: 1.553 ± 1.209
6.211AspTyr: 6.211 ± 0.181
0.0AspXaa: 0.0 ± 0.0
Glu
1.553GluAla: 1.553 ± 1.209
1.553GluCys: 1.553 ± 1.119
1.553GluAsp: 1.553 ± 1.119
1.553GluGlu: 1.553 ± 1.119
1.553GluPhe: 1.553 ± 1.119
1.553GluGly: 1.553 ± 1.119
3.106GluHis: 3.106 ± 0.09
0.0GluIle: 0.0 ± 0.0
1.553GluLys: 1.553 ± 1.209
4.658GluLeu: 4.658 ± 1.029
1.553GluMet: 1.553 ± 0.761
3.106GluAsn: 3.106 ± 2.238
1.553GluPro: 1.553 ± 1.119
1.553GluGln: 1.553 ± 1.119
4.658GluArg: 4.658 ± 3.357
4.658GluSer: 4.658 ± 1.3
0.0GluThr: 0.0 ± 0.0
0.0GluVal: 0.0 ± 0.0
1.553GluTrp: 1.553 ± 1.119
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.553PheCys: 1.553 ± 1.119
1.553PheAsp: 1.553 ± 1.119
1.553PheGlu: 1.553 ± 1.119
3.106PhePhe: 3.106 ± 0.09
7.764PheGly: 7.764 ± 3.266
1.553PheHis: 1.553 ± 1.119
1.553PheIle: 1.553 ± 1.119
1.553PheLys: 1.553 ± 1.209
3.106PheLeu: 3.106 ± 0.09
0.0PheMet: 0.0 ± 0.0
1.553PheAsn: 1.553 ± 1.209
4.658PhePro: 4.658 ± 3.357
0.0PheGln: 0.0 ± 0.0
3.106PheArg: 3.106 ± 0.09
1.553PheSer: 1.553 ± 1.209
3.106PheThr: 3.106 ± 0.09
3.106PheVal: 3.106 ± 0.09
1.553PheTrp: 1.553 ± 1.209
1.553PheTyr: 1.553 ± 1.209
0.0PheXaa: 0.0 ± 0.0
Gly
13.975GlyAla: 13.975 ± 0.757
0.0GlyCys: 0.0 ± 0.0
7.764GlyAsp: 7.764 ± 1.39
0.0GlyGlu: 0.0 ± 0.0
3.106GlyPhe: 3.106 ± 2.238
12.422GlyGly: 12.422 ± 0.362
0.0GlyHis: 0.0 ± 0.0
3.106GlyIle: 3.106 ± 0.09
3.106GlyLys: 3.106 ± 2.238
7.764GlyLeu: 7.764 ± 0.938
6.211GlyMet: 6.211 ± 4.837
6.211GlyAsn: 6.211 ± 0.181
0.0GlyPro: 0.0 ± 0.0
1.553GlyGln: 1.553 ± 1.119
6.211GlyArg: 6.211 ± 0.181
4.658GlySer: 4.658 ± 1.029
6.211GlyThr: 6.211 ± 2.509
3.106GlyVal: 3.106 ± 2.238
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
4.658HisAla: 4.658 ± 3.357
0.0HisCys: 0.0 ± 0.0
1.553HisAsp: 1.553 ± 1.209
1.553HisGlu: 1.553 ± 1.209
1.553HisPhe: 1.553 ± 1.119
1.553HisGly: 1.553 ± 1.209
0.0HisHis: 0.0 ± 0.0
3.106HisIle: 3.106 ± 0.09
0.0HisLys: 0.0 ± 0.0
3.106HisLeu: 3.106 ± 2.238
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.553HisPro: 1.553 ± 1.119
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.553HisSer: 1.553 ± 1.119
0.0HisThr: 0.0 ± 0.0
1.553HisVal: 1.553 ± 1.119
1.553HisTrp: 1.553 ± 1.119
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.553IleAla: 1.553 ± 1.119
3.106IleCys: 3.106 ± 0.09
6.211IleAsp: 6.211 ± 2.509
1.553IleGlu: 1.553 ± 1.119
4.658IlePhe: 4.658 ± 1.029
4.658IleGly: 4.658 ± 1.3
0.0IleHis: 0.0 ± 0.0
4.658IleIle: 4.658 ± 1.3
4.658IleLys: 4.658 ± 1.029
6.211IleLeu: 6.211 ± 0.181
0.0IleMet: 0.0 ± 0.0
3.106IleAsn: 3.106 ± 2.419
1.553IlePro: 1.553 ± 1.209
3.106IleGln: 3.106 ± 0.09
3.106IleArg: 3.106 ± 0.09
3.106IleSer: 3.106 ± 2.238
1.553IleThr: 1.553 ± 1.119
6.211IleVal: 6.211 ± 2.148
3.106IleTrp: 3.106 ± 0.09
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
3.106LysAsp: 3.106 ± 2.238
4.658LysGlu: 4.658 ± 1.3
4.658LysPhe: 4.658 ± 3.357
7.764LysGly: 7.764 ± 3.718
0.0LysHis: 0.0 ± 0.0
3.106LysIle: 3.106 ± 0.09
3.106LysLys: 3.106 ± 2.419
1.553LysLeu: 1.553 ± 1.209
0.0LysMet: 0.0 ± 0.794
3.106LysAsn: 3.106 ± 2.419
1.553LysPro: 1.553 ± 1.119
1.553LysGln: 1.553 ± 1.119
6.211LysArg: 6.211 ± 2.509
4.658LysSer: 4.658 ± 1.029
3.106LysThr: 3.106 ± 0.09
1.553LysVal: 1.553 ± 1.209
3.106LysTrp: 3.106 ± 2.238
3.106LysTyr: 3.106 ± 0.09
0.0LysXaa: 0.0 ± 0.0
Leu
9.317LeuAla: 9.317 ± 4.385
0.0LeuCys: 0.0 ± 0.0
4.658LeuAsp: 4.658 ± 1.029
1.553LeuGlu: 1.553 ± 1.119
3.106LeuPhe: 3.106 ± 2.419
4.658LeuGly: 4.658 ± 1.029
1.553LeuHis: 1.553 ± 1.119
4.658LeuIle: 4.658 ± 3.357
3.106LeuLys: 3.106 ± 2.419
1.553LeuLeu: 1.553 ± 1.209
0.0LeuMet: 0.0 ± 0.0
4.658LeuAsn: 4.658 ± 1.3
1.553LeuPro: 1.553 ± 1.209
1.553LeuGln: 1.553 ± 1.209
3.106LeuArg: 3.106 ± 0.09
1.553LeuSer: 1.553 ± 1.209
6.211LeuThr: 6.211 ± 2.509
9.317LeuVal: 9.317 ± 4.385
1.553LeuTrp: 1.553 ± 1.209
6.211LeuTyr: 6.211 ± 2.148
0.0LeuXaa: 0.0 ± 0.0
Met
1.553MetAla: 1.553 ± 1.209
0.0MetCys: 0.0 ± 0.0
1.553MetAsp: 1.553 ± 1.209
1.553MetGlu: 1.553 ± 1.119
1.553MetPhe: 1.553 ± 1.209
1.553MetGly: 1.553 ± 1.209
1.553MetHis: 1.553 ± 1.119
0.0MetIle: 0.0 ± 0.0
3.106MetLys: 3.106 ± 0.09
3.106MetLeu: 3.106 ± 0.09
0.0MetMet: 0.0 ± 0.0
1.553MetAsn: 1.553 ± 1.209
0.0MetPro: 0.0 ± 0.0
1.553MetGln: 1.553 ± 1.209
0.0MetArg: 0.0 ± 0.0
1.553MetSer: 1.553 ± 1.209
1.553MetThr: 1.553 ± 1.119
1.553MetVal: 1.553 ± 1.119
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
7.764AsnAla: 7.764 ± 0.938
0.0AsnCys: 0.0 ± 0.0
3.106AsnAsp: 3.106 ± 2.238
1.553AsnGlu: 1.553 ± 1.209
0.0AsnPhe: 0.0 ± 0.0
3.106AsnGly: 3.106 ± 2.419
4.658AsnHis: 4.658 ± 1.029
1.553AsnIle: 1.553 ± 1.119
3.106AsnLys: 3.106 ± 2.419
4.658AsnLeu: 4.658 ± 3.628
0.0AsnMet: 0.0 ± 0.0
4.658AsnAsn: 4.658 ± 1.3
3.106AsnPro: 3.106 ± 2.238
0.0AsnGln: 0.0 ± 0.0
4.658AsnArg: 4.658 ± 1.3
1.553AsnSer: 1.553 ± 1.209
4.658AsnThr: 4.658 ± 1.3
1.553AsnVal: 1.553 ± 1.209
0.0AsnTrp: 0.0 ± 0.0
1.553AsnTyr: 1.553 ± 1.209
0.0AsnXaa: 0.0 ± 0.0
Pro
1.553ProAla: 1.553 ± 1.119
0.0ProCys: 0.0 ± 0.0
1.553ProAsp: 1.553 ± 1.119
1.553ProGlu: 1.553 ± 1.119
0.0ProPhe: 0.0 ± 0.0
4.658ProGly: 4.658 ± 1.029
3.106ProHis: 3.106 ± 2.238
3.106ProIle: 3.106 ± 2.419
0.0ProLys: 0.0 ± 0.0
1.553ProLeu: 1.553 ± 1.119
0.0ProMet: 0.0 ± 0.0
3.106ProAsn: 3.106 ± 0.09
0.0ProPro: 0.0 ± 0.0
1.553ProGln: 1.553 ± 1.119
3.106ProArg: 3.106 ± 2.238
0.0ProSer: 0.0 ± 0.0
3.106ProThr: 3.106 ± 2.419
0.0ProVal: 0.0 ± 0.0
3.106ProTrp: 3.106 ± 0.09
1.553ProTyr: 1.553 ± 1.119
0.0ProXaa: 0.0 ± 0.0
Gln
3.106GlnAla: 3.106 ± 0.09
1.553GlnCys: 1.553 ± 1.119
4.658GlnAsp: 4.658 ± 1.3
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
1.553GlnLeu: 1.553 ± 1.119
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.553GlnPro: 1.553 ± 1.119
0.0GlnGln: 0.0 ± 0.0
3.106GlnArg: 3.106 ± 0.09
4.658GlnSer: 4.658 ± 1.029
1.553GlnThr: 1.553 ± 1.209
1.553GlnVal: 1.553 ± 1.119
0.0GlnTrp: 0.0 ± 0.0
1.553GlnTyr: 1.553 ± 1.209
0.0GlnXaa: 0.0 ± 0.0
Arg
4.658ArgAla: 4.658 ± 1.3
0.0ArgCys: 0.0 ± 0.0
6.211ArgAsp: 6.211 ± 4.476
3.106ArgGlu: 3.106 ± 2.238
1.553ArgPhe: 1.553 ± 1.209
3.106ArgGly: 3.106 ± 0.09
0.0ArgHis: 0.0 ± 0.0
6.211ArgIle: 6.211 ± 2.509
7.764ArgLys: 7.764 ± 1.39
3.106ArgLeu: 3.106 ± 0.09
1.553ArgMet: 1.553 ± 1.119
3.106ArgAsn: 3.106 ± 0.09
1.553ArgPro: 1.553 ± 1.209
1.553ArgGln: 1.553 ± 1.119
9.317ArgArg: 9.317 ± 4.928
7.764ArgSer: 7.764 ± 1.39
6.211ArgThr: 6.211 ± 2.509
0.0ArgVal: 0.0 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
6.211ArgTyr: 6.211 ± 0.181
0.0ArgXaa: 0.0 ± 0.0
Ser
3.106SerAla: 3.106 ± 0.09
1.553SerCys: 1.553 ± 1.209
3.106SerAsp: 3.106 ± 0.09
3.106SerGlu: 3.106 ± 0.09
1.553SerPhe: 1.553 ± 1.209
7.764SerGly: 7.764 ± 1.39
1.553SerHis: 1.553 ± 1.119
6.211SerIle: 6.211 ± 0.181
3.106SerLys: 3.106 ± 0.09
4.658SerLeu: 4.658 ± 3.357
1.553SerMet: 1.553 ± 1.119
3.106SerAsn: 3.106 ± 2.419
0.0SerPro: 0.0 ± 0.0
1.553SerGln: 1.553 ± 1.119
7.764SerArg: 7.764 ± 1.39
3.106SerSer: 3.106 ± 0.09
12.422SerThr: 12.422 ± 9.675
1.553SerVal: 1.553 ± 1.209
3.106SerTrp: 3.106 ± 2.238
3.106SerTyr: 3.106 ± 2.419
0.0SerXaa: 0.0 ± 0.0
Thr
1.553ThrAla: 1.553 ± 1.209
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
1.553ThrGlu: 1.553 ± 1.119
1.553ThrPhe: 1.553 ± 1.119
7.764ThrGly: 7.764 ± 1.39
1.553ThrHis: 1.553 ± 1.119
4.658ThrIle: 4.658 ± 3.628
3.106ThrLys: 3.106 ± 0.09
7.764ThrLeu: 7.764 ± 1.39
0.0ThrMet: 0.0 ± 0.0
1.553ThrAsn: 1.553 ± 1.209
1.553ThrPro: 1.553 ± 1.209
0.0ThrGln: 0.0 ± 0.0
1.553ThrArg: 1.553 ± 1.209
21.739ThrSer: 21.739 ± 14.602
3.106ThrThr: 3.106 ± 0.09
3.106ThrVal: 3.106 ± 2.419
3.106ThrTrp: 3.106 ± 0.09
3.106ThrTyr: 3.106 ± 0.09
0.0ThrXaa: 0.0 ± 0.0
Val
7.764ValAla: 7.764 ± 1.39
1.553ValCys: 1.553 ± 1.119
4.658ValAsp: 4.658 ± 1.029
0.0ValGlu: 0.0 ± 0.0
1.553ValPhe: 1.553 ± 1.119
1.553ValGly: 1.553 ± 1.119
0.0ValHis: 0.0 ± 0.0
1.553ValIle: 1.553 ± 1.119
3.106ValLys: 3.106 ± 2.238
1.553ValLeu: 1.553 ± 1.119
1.553ValMet: 1.553 ± 1.209
1.553ValAsn: 1.553 ± 1.119
1.553ValPro: 1.553 ± 1.119
3.106ValGln: 3.106 ± 2.419
1.553ValArg: 1.553 ± 1.119
4.658ValSer: 4.658 ± 3.357
3.106ValThr: 3.106 ± 0.09
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
1.553ValTyr: 1.553 ± 1.119
0.0ValXaa: 0.0 ± 0.0
Trp
4.658TrpAla: 4.658 ± 1.029
1.553TrpCys: 1.553 ± 1.209
1.553TrpAsp: 1.553 ± 1.119
0.0TrpGlu: 0.0 ± 0.0
3.106TrpPhe: 3.106 ± 0.09
1.553TrpGly: 1.553 ± 1.119
3.106TrpHis: 3.106 ± 2.419
3.106TrpIle: 3.106 ± 0.09
3.106TrpLys: 3.106 ± 2.238
0.0TrpLeu: 0.0 ± 0.0
1.553TrpMet: 1.553 ± 1.119
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.106TrpArg: 3.106 ± 0.09
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.106TyrAla: 3.106 ± 2.238
0.0TyrCys: 0.0 ± 0.0
3.106TyrAsp: 3.106 ± 2.419
1.553TyrGlu: 1.553 ± 1.119
6.211TyrPhe: 6.211 ± 0.181
1.553TyrGly: 1.553 ± 1.119
0.0TyrHis: 0.0 ± 0.0
1.553TyrIle: 1.553 ± 1.209
1.553TyrLys: 1.553 ± 1.119
1.553TyrLeu: 1.553 ± 1.209
3.106TyrMet: 3.106 ± 0.09
1.553TyrAsn: 1.553 ± 1.119
4.658TyrPro: 4.658 ± 1.3
0.0TyrGln: 0.0 ± 0.0
3.106TyrArg: 3.106 ± 2.419
3.106TyrSer: 3.106 ± 0.09
1.553TyrThr: 1.553 ± 1.209
0.0TyrVal: 0.0 ± 0.0
3.106TyrTrp: 3.106 ± 0.09
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski