Amino acid dipepetide frequency for Faeces associated gemycircularvirus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.849AlaAla: 6.849 ± 2.922
2.283AlaCys: 2.283 ± 0.974
6.849AlaAsp: 6.849 ± 1.487
5.708AlaGlu: 5.708 ± 1.186
1.142AlaPhe: 1.142 ± 0.821
3.425AlaGly: 3.425 ± 2.672
1.142AlaHis: 1.142 ± 0.821
5.708AlaIle: 5.708 ± 0.577
6.849AlaLys: 6.849 ± 1.487
6.849AlaLeu: 6.849 ± 1.487
1.142AlaMet: 1.142 ± 0.891
2.283AlaAsn: 2.283 ± 0.974
6.849AlaPro: 6.849 ± 1.684
4.566AlaGln: 4.566 ± 0.688
6.849AlaArg: 6.849 ± 1.487
1.142AlaSer: 1.142 ± 0.821
0.0AlaThr: 0.0 ± 0.0
1.142AlaVal: 1.142 ± 0.821
3.425AlaTrp: 3.425 ± 0.235
1.142AlaTyr: 1.142 ± 0.821
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.283CysAsp: 2.283 ± 0.974
0.0CysGlu: 0.0 ± 0.0
3.425CysPhe: 3.425 ± 0.235
2.283CysGly: 2.283 ± 0.974
0.0CysHis: 0.0 ± 0.0
3.425CysIle: 3.425 ± 1.483
0.0CysLys: 0.0 ± 0.0
1.142CysLeu: 1.142 ± 1.022
1.142CysMet: 1.142 ± 0.821
5.708CysAsn: 5.708 ± 2.735
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.142CysArg: 1.142 ± 1.022
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.142CysTyr: 1.142 ± 0.891
0.0CysXaa: 0.0 ± 0.0
Asp
2.283AspAla: 2.283 ± 0.974
0.0AspCys: 0.0 ± 0.0
5.708AspAsp: 5.708 ± 2.057
2.283AspGlu: 2.283 ± 1.781
2.283AspPhe: 2.283 ± 0.974
12.557AspGly: 12.557 ± 5.6
2.283AspHis: 2.283 ± 0.974
5.708AspIle: 5.708 ± 0.577
2.283AspLys: 2.283 ± 1.781
1.142AspLeu: 1.142 ± 0.891
0.0AspMet: 0.0 ± 0.0
1.142AspAsn: 1.142 ± 0.891
9.132AspPro: 9.132 ± 2.418
2.283AspGln: 2.283 ± 0.974
3.425AspArg: 3.425 ± 0.235
2.283AspSer: 2.283 ± 0.796
1.142AspThr: 1.142 ± 0.821
9.132AspVal: 9.132 ± 2.418
5.708AspTrp: 5.708 ± 1.102
4.566AspTyr: 4.566 ± 0.865
0.0AspXaa: 0.0 ± 0.0
Glu
3.425GluAla: 3.425 ± 2.462
2.283GluCys: 2.283 ± 0.974
0.0GluAsp: 0.0 ± 0.0
2.283GluGlu: 2.283 ± 0.974
3.425GluPhe: 3.425 ± 0.235
0.0GluGly: 0.0 ± 0.0
3.425GluHis: 3.425 ± 0.235
3.425GluIle: 3.425 ± 1.483
2.283GluLys: 2.283 ± 1.781
4.566GluLeu: 4.566 ± 2.191
0.0GluMet: 0.0 ± 0.0
2.283GluAsn: 2.283 ± 1.781
3.425GluPro: 3.425 ± 1.483
0.0GluGln: 0.0 ± 0.0
3.425GluArg: 3.425 ± 0.235
2.283GluSer: 2.283 ± 0.974
2.283GluThr: 2.283 ± 0.992
3.425GluVal: 3.425 ± 0.235
1.142GluTrp: 1.142 ± 0.821
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.283PheAla: 2.283 ± 0.974
0.0PheCys: 0.0 ± 0.0
4.566PheAsp: 4.566 ± 0.688
0.0PheGlu: 0.0 ± 0.0
1.142PhePhe: 1.142 ± 0.821
4.566PheGly: 4.566 ± 1.948
3.425PheHis: 3.425 ± 0.235
4.566PheIle: 4.566 ± 0.865
2.283PheLys: 2.283 ± 0.796
0.0PheLeu: 0.0 ± 0.0
1.142PheMet: 1.142 ± 0.891
0.0PheAsn: 0.0 ± 0.0
1.142PhePro: 1.142 ± 0.821
0.0PheGln: 0.0 ± 0.0
6.849PheArg: 6.849 ± 1.487
3.425PheSer: 3.425 ± 0.235
1.142PheThr: 1.142 ± 0.891
3.425PheVal: 3.425 ± 1.483
3.425PheTrp: 3.425 ± 0.235
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
9.132GlyAla: 9.132 ± 2.418
0.0GlyCys: 0.0 ± 0.0
11.416GlyAsp: 11.416 ± 0.636
1.142GlyGlu: 1.142 ± 0.821
2.283GlyPhe: 2.283 ± 0.974
14.84GlyGly: 14.84 ± 4.983
0.0GlyHis: 0.0 ± 0.0
7.991GlyIle: 7.991 ± 3.308
3.425GlyLys: 3.425 ± 1.476
7.991GlyLeu: 7.991 ± 1.483
2.283GlyMet: 2.283 ± 0.796
1.142GlyAsn: 1.142 ± 0.891
1.142GlyPro: 1.142 ± 1.022
3.425GlyGln: 3.425 ± 0.235
11.416GlyArg: 11.416 ± 4.743
2.283GlySer: 2.283 ± 1.781
7.991GlyThr: 7.991 ± 2.175
4.566GlyVal: 4.566 ± 2.372
0.0GlyTrp: 0.0 ± 0.0
2.283GlyTyr: 2.283 ± 0.974
0.0GlyXaa: 0.0 ± 0.0
His
2.283HisAla: 2.283 ± 0.974
2.283HisCys: 2.283 ± 0.974
0.0HisAsp: 0.0 ± 0.0
1.142HisGlu: 1.142 ± 0.891
7.991HisPhe: 7.991 ± 2.067
0.0HisGly: 0.0 ± 0.0
2.283HisHis: 2.283 ± 0.974
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
5.708HisLeu: 5.708 ± 1.102
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.425HisPro: 3.425 ± 0.235
0.0HisGln: 0.0 ± 0.0
1.142HisArg: 1.142 ± 0.891
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.142HisVal: 1.142 ± 0.821
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.566IleAla: 4.566 ± 0.865
3.425IleCys: 3.425 ± 0.235
4.566IleAsp: 4.566 ± 0.865
3.425IleGlu: 3.425 ± 0.235
7.991IlePhe: 7.991 ± 1.37
3.425IleGly: 3.425 ± 1.82
2.283IleHis: 2.283 ± 0.974
1.142IleIle: 1.142 ± 0.891
4.566IleLys: 4.566 ± 2.191
1.142IleLeu: 1.142 ± 0.891
0.0IleMet: 0.0 ± 0.0
2.283IleAsn: 2.283 ± 1.781
1.142IlePro: 1.142 ± 0.891
1.142IleGln: 1.142 ± 0.891
4.566IleArg: 4.566 ± 0.688
3.425IleSer: 3.425 ± 1.476
7.991IleThr: 7.991 ± 0.619
2.283IleVal: 2.283 ± 0.974
1.142IleTrp: 1.142 ± 0.891
1.142IleTyr: 1.142 ± 0.891
0.0IleXaa: 0.0 ± 0.0
Lys
4.566LysAla: 4.566 ± 0.865
0.0LysCys: 0.0 ± 0.0
5.708LysAsp: 5.708 ± 1.102
2.283LysGlu: 2.283 ± 1.641
4.566LysPhe: 4.566 ± 0.688
2.283LysGly: 2.283 ± 1.781
0.0LysHis: 0.0 ± 0.0
1.142LysIle: 1.142 ± 0.891
4.566LysLys: 4.566 ± 2.305
1.142LysLeu: 1.142 ± 0.891
1.142LysMet: 1.142 ± 0.711
1.142LysAsn: 1.142 ± 0.891
3.425LysPro: 3.425 ± 0.235
1.142LysGln: 1.142 ± 0.891
6.849LysArg: 6.849 ± 1.334
0.0LysSer: 0.0 ± 0.0
3.425LysThr: 3.425 ± 1.476
1.142LysVal: 1.142 ± 0.821
0.0LysTrp: 0.0 ± 0.0
3.425LysTyr: 3.425 ± 1.483
0.0LysXaa: 0.0 ± 0.0
Leu
4.566LeuAla: 4.566 ± 1.252
0.0LeuCys: 0.0 ± 0.0
5.708LeuAsp: 5.708 ± 1.102
3.425LeuGlu: 3.425 ± 1.483
0.0LeuPhe: 0.0 ± 0.0
6.849LeuGly: 6.849 ± 1.487
2.283LeuHis: 2.283 ± 0.974
6.849LeuIle: 6.849 ± 0.471
1.142LeuLys: 1.142 ± 0.821
0.0LeuLeu: 0.0 ± 0.0
0.0LeuMet: 0.0 ± 0.0
2.283LeuAsn: 2.283 ± 1.781
1.142LeuPro: 1.142 ± 1.022
1.142LeuGln: 1.142 ± 0.891
3.425LeuArg: 3.425 ± 2.672
3.425LeuSer: 3.425 ± 1.585
1.142LeuThr: 1.142 ± 0.891
6.849LeuVal: 6.849 ± 2.313
2.283LeuTrp: 2.283 ± 0.796
2.283LeuTyr: 2.283 ± 1.641
0.0LeuXaa: 0.0 ± 0.0
Met
2.283MetAla: 2.283 ± 1.781
0.0MetCys: 0.0 ± 0.0
3.425MetAsp: 3.425 ± 0.235
2.283MetGlu: 2.283 ± 0.796
0.0MetPhe: 0.0 ± 0.0
1.142MetGly: 1.142 ± 0.891
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.283MetLeu: 2.283 ± 1.781
0.0MetMet: 0.0 ± 0.0
2.283MetAsn: 2.283 ± 1.781
2.283MetPro: 2.283 ± 0.974
1.142MetGln: 1.142 ± 0.891
1.142MetArg: 1.142 ± 0.891
1.142MetSer: 1.142 ± 0.821
0.0MetThr: 0.0 ± 0.0
1.142MetVal: 1.142 ± 0.821
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.849AsnAla: 6.849 ± 2.011
2.283AsnCys: 2.283 ± 0.974
1.142AsnAsp: 1.142 ± 0.821
1.142AsnGlu: 1.142 ± 0.891
0.0AsnPhe: 0.0 ± 0.0
4.566AsnGly: 4.566 ± 3.563
0.0AsnHis: 0.0 ± 0.0
4.566AsnIle: 4.566 ± 0.865
1.142AsnLys: 1.142 ± 0.891
1.142AsnLeu: 1.142 ± 0.891
2.283AsnMet: 2.283 ± 1.781
2.283AsnAsn: 2.283 ± 1.781
1.142AsnPro: 1.142 ± 0.891
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
3.425AsnSer: 3.425 ± 2.672
4.566AsnThr: 4.566 ± 3.563
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
2.283AsnTyr: 2.283 ± 0.974
0.0AsnXaa: 0.0 ± 0.0
Pro
2.283ProAla: 2.283 ± 1.641
0.0ProCys: 0.0 ± 0.0
1.142ProAsp: 1.142 ± 1.022
3.425ProGlu: 3.425 ± 0.235
2.283ProPhe: 2.283 ± 0.974
3.425ProGly: 3.425 ± 0.235
0.0ProHis: 0.0 ± 0.0
1.142ProIle: 1.142 ± 0.891
2.283ProLys: 2.283 ± 0.974
4.566ProLeu: 4.566 ± 0.865
1.142ProMet: 1.142 ± 0.891
4.566ProAsn: 4.566 ± 0.865
3.425ProPro: 3.425 ± 1.82
1.142ProGln: 1.142 ± 1.022
5.708ProArg: 5.708 ± 1.102
5.708ProSer: 5.708 ± 1.102
1.142ProThr: 1.142 ± 0.891
3.425ProVal: 3.425 ± 0.235
4.566ProTrp: 4.566 ± 1.948
3.425ProTyr: 3.425 ± 1.585
0.0ProXaa: 0.0 ± 0.0
Gln
1.142GlnAla: 1.142 ± 0.821
2.283GlnCys: 2.283 ± 0.974
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
3.425GlnGly: 3.425 ± 1.585
0.0GlnHis: 0.0 ± 0.0
1.142GlnIle: 1.142 ± 0.891
0.0GlnLys: 0.0 ± 0.0
2.283GlnLeu: 2.283 ± 0.974
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.142GlnGln: 1.142 ± 0.891
1.142GlnArg: 1.142 ± 0.891
2.283GlnSer: 2.283 ± 0.974
1.142GlnThr: 1.142 ± 0.891
2.283GlnVal: 2.283 ± 1.781
1.142GlnTrp: 1.142 ± 0.891
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.142ArgAla: 1.142 ± 0.821
2.283ArgCys: 2.283 ± 0.974
4.566ArgAsp: 4.566 ± 1.948
5.708ArgGlu: 5.708 ± 2.371
3.425ArgPhe: 3.425 ± 0.235
4.566ArgGly: 4.566 ± 0.688
2.283ArgHis: 2.283 ± 0.974
2.283ArgIle: 2.283 ± 1.781
5.708ArgLys: 5.708 ± 1.741
4.566ArgLeu: 4.566 ± 0.865
2.283ArgMet: 2.283 ± 0.553
1.142ArgAsn: 1.142 ± 0.891
4.566ArgPro: 4.566 ± 0.688
0.0ArgGln: 0.0 ± 0.0
9.132ArgArg: 9.132 ± 4.404
11.416ArgSer: 11.416 ± 2.204
3.425ArgThr: 3.425 ± 0.235
6.849ArgVal: 6.849 ± 1.487
1.142ArgTrp: 1.142 ± 0.821
6.849ArgTyr: 6.849 ± 2.626
0.0ArgXaa: 0.0 ± 0.0
Ser
2.283SerAla: 2.283 ± 1.781
0.0SerCys: 0.0 ± 0.0
2.283SerAsp: 2.283 ± 0.796
0.0SerGlu: 0.0 ± 0.0
0.0SerPhe: 0.0 ± 0.0
12.557SerGly: 12.557 ± 1.442
1.142SerHis: 1.142 ± 0.891
0.0SerIle: 0.0 ± 0.0
4.566SerLys: 4.566 ± 0.865
4.566SerLeu: 4.566 ± 0.688
0.0SerMet: 0.0 ± 0.0
3.425SerAsn: 3.425 ± 1.476
4.566SerPro: 4.566 ± 1.252
1.142SerGln: 1.142 ± 0.891
6.849SerArg: 6.849 ± 0.471
5.708SerSer: 5.708 ± 3.214
3.425SerThr: 3.425 ± 2.672
4.566SerVal: 4.566 ± 0.865
1.142SerTrp: 1.142 ± 0.821
2.283SerTyr: 2.283 ± 0.974
0.0SerXaa: 0.0 ± 0.0
Thr
3.425ThrAla: 3.425 ± 2.672
0.0ThrCys: 0.0 ± 0.0
5.708ThrAsp: 5.708 ± 1.741
3.425ThrGlu: 3.425 ± 1.483
0.0ThrPhe: 0.0 ± 0.0
1.142ThrGly: 1.142 ± 0.891
0.0ThrHis: 0.0 ± 0.0
5.708ThrIle: 5.708 ± 1.741
3.425ThrLys: 3.425 ± 0.235
2.283ThrLeu: 2.283 ± 0.796
1.142ThrMet: 1.142 ± 0.891
4.566ThrAsn: 4.566 ± 0.865
5.708ThrPro: 5.708 ± 1.186
0.0ThrGln: 0.0 ± 0.0
1.142ThrArg: 1.142 ± 0.891
7.991ThrSer: 7.991 ± 3.935
1.142ThrThr: 1.142 ± 0.891
3.425ThrVal: 3.425 ± 1.349
1.142ThrTrp: 1.142 ± 0.891
2.283ThrTyr: 2.283 ± 0.974
0.0ThrXaa: 0.0 ± 0.0
Val
3.425ValAla: 3.425 ± 1.476
2.283ValCys: 2.283 ± 0.992
4.566ValAsp: 4.566 ± 1.948
1.142ValGlu: 1.142 ± 0.821
2.283ValPhe: 2.283 ± 0.796
6.849ValGly: 6.849 ± 2.967
4.566ValHis: 4.566 ± 1.948
2.283ValIle: 2.283 ± 0.974
3.425ValLys: 3.425 ± 1.476
1.142ValLeu: 1.142 ± 0.821
3.425ValMet: 3.425 ± 1.483
1.142ValAsn: 1.142 ± 0.891
2.283ValPro: 2.283 ± 1.781
0.0ValGln: 0.0 ± 0.0
2.283ValArg: 2.283 ± 0.974
2.283ValSer: 2.283 ± 1.641
7.991ValThr: 7.991 ± 2.194
0.0ValVal: 0.0 ± 0.0
1.142ValTrp: 1.142 ± 0.821
4.566ValTyr: 4.566 ± 0.688
0.0ValXaa: 0.0 ± 0.0
Trp
1.142TrpAla: 1.142 ± 0.821
2.283TrpCys: 2.283 ± 0.974
2.283TrpAsp: 2.283 ± 0.974
2.283TrpGlu: 2.283 ± 0.974
1.142TrpPhe: 1.142 ± 0.891
1.142TrpGly: 1.142 ± 0.821
2.283TrpHis: 2.283 ± 1.781
1.142TrpIle: 1.142 ± 0.821
0.0TrpLys: 0.0 ± 0.0
2.283TrpLeu: 2.283 ± 1.641
0.0TrpMet: 0.0 ± 0.0
1.142TrpAsn: 1.142 ± 0.891
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
4.566TrpArg: 4.566 ± 1.948
2.283TrpSer: 2.283 ± 1.781
3.425TrpThr: 3.425 ± 0.235
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
11.416TyrAla: 11.416 ± 3.373
0.0TyrCys: 0.0 ± 0.0
3.425TyrAsp: 3.425 ± 0.235
2.283TyrGlu: 2.283 ± 0.974
0.0TyrPhe: 0.0 ± 0.0
5.708TyrGly: 5.708 ± 2.371
0.0TyrHis: 0.0 ± 0.0
3.425TyrIle: 3.425 ± 2.672
0.0TyrLys: 0.0 ± 0.0
0.0TyrLeu: 0.0 ± 0.0
2.283TyrMet: 2.283 ± 1.781
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
1.142TyrGln: 1.142 ± 0.891
3.425TyrArg: 3.425 ± 1.476
0.0TyrSer: 0.0 ± 0.0
2.283TyrThr: 2.283 ± 0.974
2.283TyrVal: 2.283 ± 0.796
0.0TyrTrp: 0.0 ± 0.0
1.142TyrTyr: 1.142 ± 0.891
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski