Amino acid dipepetide frequency for Sewage-associated circular DNA virus-30

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.022AlaAla: 7.022 ± 1.599
0.0AlaCys: 0.0 ± 0.0
1.404AlaAsp: 1.404 ± 0.757
1.404AlaGlu: 1.404 ± 0.757
1.404AlaPhe: 1.404 ± 0.757
4.213AlaGly: 4.213 ± 2.102
0.0AlaHis: 0.0 ± 0.0
1.404AlaIle: 1.404 ± 0.757
1.404AlaLys: 1.404 ± 0.757
2.809AlaLeu: 2.809 ± 2.859
1.404AlaMet: 1.404 ± 0.757
4.213AlaAsn: 4.213 ± 2.102
4.213AlaPro: 4.213 ± 2.271
2.809AlaGln: 2.809 ± 1.514
7.022AlaArg: 7.022 ± 1.599
8.427AlaSer: 8.427 ± 4.205
4.213AlaThr: 4.213 ± 4.289
4.213AlaVal: 4.213 ± 2.102
1.404AlaTrp: 1.404 ± 0.757
7.022AlaTyr: 7.022 ± 0.588
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.809CysAsp: 2.809 ± 0.673
1.404CysGlu: 1.404 ± 0.757
1.404CysPhe: 1.404 ± 0.757
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.404CysIle: 1.404 ± 1.43
0.0CysLys: 0.0 ± 0.0
1.404CysLeu: 1.404 ± 0.757
0.0CysMet: 0.0 ± 0.0
1.404CysAsn: 1.404 ± 0.757
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.404CysSer: 1.404 ± 0.757
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.404AspAla: 1.404 ± 0.757
0.0AspCys: 0.0 ± 0.0
5.618AspAsp: 5.618 ± 3.028
5.618AspGlu: 5.618 ± 0.842
2.809AspPhe: 2.809 ± 1.514
2.809AspGly: 2.809 ± 1.514
1.404AspHis: 1.404 ± 1.43
1.404AspIle: 1.404 ± 1.43
4.213AspLys: 4.213 ± 0.084
5.618AspLeu: 5.618 ± 0.842
0.0AspMet: 0.0 ± 0.0
5.618AspAsn: 5.618 ± 0.842
5.618AspPro: 5.618 ± 3.028
1.404AspGln: 1.404 ± 0.757
2.809AspArg: 2.809 ± 1.514
0.0AspSer: 0.0 ± 0.0
2.809AspThr: 2.809 ± 2.859
4.213AspVal: 4.213 ± 0.084
1.404AspTrp: 1.404 ± 0.757
4.213AspTyr: 4.213 ± 2.271
0.0AspXaa: 0.0 ± 0.0
Glu
2.809GluAla: 2.809 ± 1.514
4.213GluCys: 4.213 ± 0.084
1.404GluAsp: 1.404 ± 0.757
4.213GluGlu: 4.213 ± 2.271
4.213GluPhe: 4.213 ± 2.271
5.618GluGly: 5.618 ± 0.842
1.404GluHis: 1.404 ± 0.757
2.809GluIle: 2.809 ± 1.514
5.618GluLys: 5.618 ± 0.842
7.022GluLeu: 7.022 ± 0.588
0.0GluMet: 0.0 ± 0.0
4.213GluAsn: 4.213 ± 2.271
1.404GluPro: 1.404 ± 0.757
0.0GluGln: 0.0 ± 0.0
4.213GluArg: 4.213 ± 2.271
1.404GluSer: 1.404 ± 1.43
1.404GluThr: 1.404 ± 0.757
0.0GluVal: 0.0 ± 0.0
1.404GluTrp: 1.404 ± 1.43
1.404GluTyr: 1.404 ± 0.757
0.0GluXaa: 0.0 ± 0.0
Phe
4.213PheAla: 4.213 ± 2.102
0.0PheCys: 0.0 ± 0.0
11.236PheAsp: 11.236 ± 3.87
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
2.809PheGly: 2.809 ± 0.673
2.809PheHis: 2.809 ± 0.673
0.0PheIle: 0.0 ± 0.0
2.809PheLys: 2.809 ± 0.673
5.618PheLeu: 5.618 ± 3.028
1.404PheMet: 1.404 ± 0.757
2.809PheAsn: 2.809 ± 0.673
5.618PhePro: 5.618 ± 3.028
0.0PheGln: 0.0 ± 0.0
7.022PheArg: 7.022 ± 2.775
1.404PheSer: 1.404 ± 1.43
1.404PheThr: 1.404 ± 0.757
2.809PheVal: 2.809 ± 1.514
0.0PheTrp: 0.0 ± 0.0
1.404PheTyr: 1.404 ± 0.757
0.0PheXaa: 0.0 ± 0.0
Gly
4.213GlyAla: 4.213 ± 2.102
1.404GlyCys: 1.404 ± 1.43
5.618GlyAsp: 5.618 ± 3.028
1.404GlyGlu: 1.404 ± 1.43
0.0GlyPhe: 0.0 ± 0.0
4.213GlyGly: 4.213 ± 2.102
1.404GlyHis: 1.404 ± 1.43
5.618GlyIle: 5.618 ± 3.532
4.213GlyLys: 4.213 ± 2.271
0.0GlyLeu: 0.0 ± 0.0
2.809GlyMet: 2.809 ± 1.514
2.809GlyAsn: 2.809 ± 1.514
2.809GlyPro: 2.809 ± 1.514
2.809GlyGln: 2.809 ± 0.673
8.427GlyArg: 8.427 ± 4.205
2.809GlySer: 2.809 ± 0.673
4.213GlyThr: 4.213 ± 4.289
5.618GlyVal: 5.618 ± 3.532
0.0GlyTrp: 0.0 ± 0.0
1.404GlyTyr: 1.404 ± 0.757
0.0GlyXaa: 0.0 ± 0.0
His
1.404HisAla: 1.404 ± 0.757
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
4.213HisGly: 4.213 ± 2.271
0.0HisHis: 0.0 ± 0.0
1.404HisIle: 1.404 ± 0.757
1.404HisLys: 1.404 ± 1.43
2.809HisLeu: 2.809 ± 0.673
0.0HisMet: 0.0 ± 0.884
0.0HisAsn: 0.0 ± 0.0
1.404HisPro: 1.404 ± 0.757
2.809HisGln: 2.809 ± 1.514
1.404HisArg: 1.404 ± 0.757
1.404HisSer: 1.404 ± 0.757
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.404HisTrp: 1.404 ± 0.757
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.213IleAla: 4.213 ± 0.084
1.404IleCys: 1.404 ± 0.757
1.404IleAsp: 1.404 ± 0.757
2.809IleGlu: 2.809 ± 0.673
5.618IlePhe: 5.618 ± 0.842
2.809IleGly: 2.809 ± 0.673
0.0IleHis: 0.0 ± 0.0
2.809IleIle: 2.809 ± 0.673
1.404IleLys: 1.404 ± 0.757
4.213IleLeu: 4.213 ± 0.084
2.809IleMet: 2.809 ± 1.253
2.809IleAsn: 2.809 ± 2.859
1.404IlePro: 1.404 ± 1.43
2.809IleGln: 2.809 ± 0.673
1.404IleArg: 1.404 ± 0.757
4.213IleSer: 4.213 ± 0.084
2.809IleThr: 2.809 ± 0.673
4.213IleVal: 4.213 ± 2.271
0.0IleTrp: 0.0 ± 0.0
2.809IleTyr: 2.809 ± 0.673
0.0IleXaa: 0.0 ± 0.0
Lys
1.404LysAla: 1.404 ± 0.757
1.404LysCys: 1.404 ± 0.757
4.213LysAsp: 4.213 ± 2.271
7.022LysGlu: 7.022 ± 1.599
2.809LysPhe: 2.809 ± 0.673
2.809LysGly: 2.809 ± 0.673
4.213LysHis: 4.213 ± 2.271
1.404LysIle: 1.404 ± 0.757
2.809LysLys: 2.809 ± 0.673
7.022LysLeu: 7.022 ± 1.599
1.404LysMet: 1.404 ± 0.757
2.809LysAsn: 2.809 ± 1.514
0.0LysPro: 0.0 ± 0.0
2.809LysGln: 2.809 ± 0.673
4.213LysArg: 4.213 ± 0.084
4.213LysSer: 4.213 ± 0.084
1.404LysThr: 1.404 ± 0.757
4.213LysVal: 4.213 ± 0.084
0.0LysTrp: 0.0 ± 0.0
1.404LysTyr: 1.404 ± 1.43
0.0LysXaa: 0.0 ± 0.0
Leu
1.404LeuAla: 1.404 ± 0.757
0.0LeuCys: 0.0 ± 0.0
5.618LeuAsp: 5.618 ± 1.345
4.213LeuGlu: 4.213 ± 2.271
2.809LeuPhe: 2.809 ± 1.514
0.0LeuGly: 0.0 ± 0.0
5.618LeuHis: 5.618 ± 3.028
2.809LeuIle: 2.809 ± 0.673
7.022LeuLys: 7.022 ± 1.599
5.618LeuLeu: 5.618 ± 0.842
1.404LeuMet: 1.404 ± 1.43
4.213LeuAsn: 4.213 ± 0.084
5.618LeuPro: 5.618 ± 3.028
1.404LeuGln: 1.404 ± 0.757
7.022LeuArg: 7.022 ± 4.962
4.213LeuSer: 4.213 ± 2.271
9.831LeuThr: 9.831 ± 1.261
4.213LeuVal: 4.213 ± 2.102
2.809LeuTrp: 2.809 ± 0.673
4.213LeuTyr: 4.213 ± 0.084
0.0LeuXaa: 0.0 ± 0.0
Met
2.809MetAla: 2.809 ± 0.673
0.0MetCys: 0.0 ± 0.0
2.809MetAsp: 2.809 ± 2.859
1.404MetGlu: 1.404 ± 0.757
1.404MetPhe: 1.404 ± 1.43
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.404MetIle: 1.404 ± 0.757
1.404MetLys: 1.404 ± 0.757
5.618MetLeu: 5.618 ± 3.028
4.213MetMet: 4.213 ± 2.271
0.0MetAsn: 0.0 ± 0.0
1.404MetPro: 1.404 ± 1.43
1.404MetGln: 1.404 ± 0.757
0.0MetArg: 0.0 ± 0.0
2.809MetSer: 2.809 ± 1.514
1.404MetThr: 1.404 ± 1.43
0.0MetVal: 0.0 ± 0.0
1.404MetTrp: 1.404 ± 1.43
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.213AsnAla: 4.213 ± 0.084
0.0AsnCys: 0.0 ± 0.0
2.809AsnAsp: 2.809 ± 1.514
4.213AsnGlu: 4.213 ± 2.271
5.618AsnPhe: 5.618 ± 0.842
4.213AsnGly: 4.213 ± 4.289
1.404AsnHis: 1.404 ± 0.757
5.618AsnIle: 5.618 ± 0.842
4.213AsnLys: 4.213 ± 0.084
4.213AsnLeu: 4.213 ± 2.271
2.809AsnMet: 2.809 ± 2.859
5.618AsnAsn: 5.618 ± 0.842
5.618AsnPro: 5.618 ± 3.532
0.0AsnGln: 0.0 ± 0.0
4.213AsnArg: 4.213 ± 2.102
1.404AsnSer: 1.404 ± 0.757
1.404AsnThr: 1.404 ± 1.43
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.404AsnTyr: 1.404 ± 0.757
0.0AsnXaa: 0.0 ± 0.0
Pro
5.618ProAla: 5.618 ± 0.842
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
5.618ProGlu: 5.618 ± 3.028
2.809ProPhe: 2.809 ± 1.514
5.618ProGly: 5.618 ± 3.028
0.0ProHis: 0.0 ± 0.0
7.022ProIle: 7.022 ± 1.599
2.809ProLys: 2.809 ± 1.514
5.618ProLeu: 5.618 ± 0.842
1.404ProMet: 1.404 ± 1.43
2.809ProAsn: 2.809 ± 0.673
11.236ProPro: 11.236 ± 6.057
4.213ProGln: 4.213 ± 2.271
1.404ProArg: 1.404 ± 1.43
0.0ProSer: 0.0 ± 0.0
5.618ProThr: 5.618 ± 3.532
1.404ProVal: 1.404 ± 0.757
0.0ProTrp: 0.0 ± 0.0
1.404ProTyr: 1.404 ± 0.757
0.0ProXaa: 0.0 ± 0.0
Gln
4.213GlnAla: 4.213 ± 0.084
1.404GlnCys: 1.404 ± 0.757
1.404GlnAsp: 1.404 ± 0.757
2.809GlnGlu: 2.809 ± 0.673
1.404GlnPhe: 1.404 ± 0.757
1.404GlnGly: 1.404 ± 0.757
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
4.213GlnLys: 4.213 ± 2.271
1.404GlnLeu: 1.404 ± 0.757
1.404GlnMet: 1.404 ± 0.757
0.0GlnAsn: 0.0 ± 0.0
4.213GlnPro: 4.213 ± 0.084
2.809GlnGln: 2.809 ± 1.514
0.0GlnArg: 0.0 ± 0.0
4.213GlnSer: 4.213 ± 0.084
4.213GlnThr: 4.213 ± 0.084
1.404GlnVal: 1.404 ± 0.757
0.0GlnTrp: 0.0 ± 0.0
1.404GlnTyr: 1.404 ± 0.757
0.0GlnXaa: 0.0 ± 0.0
Arg
8.427ArgAla: 8.427 ± 2.018
0.0ArgCys: 0.0 ± 0.0
2.809ArgAsp: 2.809 ± 1.514
4.213ArgGlu: 4.213 ± 2.271
5.618ArgPhe: 5.618 ± 1.345
5.618ArgGly: 5.618 ± 3.532
0.0ArgHis: 0.0 ± 0.0
4.213ArgIle: 4.213 ± 0.084
2.809ArgLys: 2.809 ± 0.673
0.0ArgLeu: 0.0 ± 0.0
0.0ArgMet: 0.0 ± 0.0
4.213ArgAsn: 4.213 ± 0.084
1.404ArgPro: 1.404 ± 0.757
4.213ArgGln: 4.213 ± 2.271
9.831ArgArg: 9.831 ± 5.634
2.809ArgSer: 2.809 ± 1.514
5.618ArgThr: 5.618 ± 5.719
1.404ArgVal: 1.404 ± 0.757
1.404ArgTrp: 1.404 ± 0.757
5.618ArgTyr: 5.618 ± 3.532
0.0ArgXaa: 0.0 ± 0.0
Ser
5.618SerAla: 5.618 ± 3.532
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
1.404SerGlu: 1.404 ± 0.757
2.809SerPhe: 2.809 ± 0.673
4.213SerGly: 4.213 ± 2.102
0.0SerHis: 0.0 ± 0.0
2.809SerIle: 2.809 ± 0.673
4.213SerLys: 4.213 ± 2.271
5.618SerLeu: 5.618 ± 0.842
1.404SerMet: 1.404 ± 0.757
2.809SerAsn: 2.809 ± 1.514
0.0SerPro: 0.0 ± 0.0
5.618SerGln: 5.618 ± 0.842
7.022SerArg: 7.022 ± 3.785
1.404SerSer: 1.404 ± 1.43
8.427SerThr: 8.427 ± 4.205
1.404SerVal: 1.404 ± 1.43
2.809SerTrp: 2.809 ± 0.673
1.404SerTyr: 1.404 ± 1.43
0.0SerXaa: 0.0 ± 0.0
Thr
5.618ThrAla: 5.618 ± 3.532
0.0ThrCys: 0.0 ± 0.0
1.404ThrAsp: 1.404 ± 1.43
2.809ThrGlu: 2.809 ± 0.673
4.213ThrPhe: 4.213 ± 2.102
2.809ThrGly: 2.809 ± 0.673
0.0ThrHis: 0.0 ± 0.0
5.618ThrIle: 5.618 ± 1.345
1.404ThrLys: 1.404 ± 1.43
4.213ThrLeu: 4.213 ± 4.289
0.0ThrMet: 0.0 ± 0.0
7.022ThrAsn: 7.022 ± 2.775
5.618ThrPro: 5.618 ± 1.345
2.809ThrGln: 2.809 ± 0.673
1.404ThrArg: 1.404 ± 1.43
9.831ThrSer: 9.831 ± 3.448
2.809ThrThr: 2.809 ± 0.673
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
4.213ThrTyr: 4.213 ± 2.102
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.404ValCys: 1.404 ± 0.757
2.809ValAsp: 2.809 ± 1.514
2.809ValGlu: 2.809 ± 0.673
1.404ValPhe: 1.404 ± 0.757
2.809ValGly: 2.809 ± 2.859
0.0ValHis: 0.0 ± 0.0
2.809ValIle: 2.809 ± 1.514
5.618ValLys: 5.618 ± 0.842
4.213ValLeu: 4.213 ± 2.102
4.213ValMet: 4.213 ± 0.084
1.404ValAsn: 1.404 ± 1.43
2.809ValPro: 2.809 ± 1.514
0.0ValGln: 0.0 ± 0.0
1.404ValArg: 1.404 ± 1.43
4.213ValSer: 4.213 ± 2.102
1.404ValThr: 1.404 ± 0.757
5.618ValVal: 5.618 ± 3.028
1.404ValTrp: 1.404 ± 1.43
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.809TrpAsp: 2.809 ± 0.673
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.404TrpGly: 1.404 ± 1.43
0.0TrpHis: 0.0 ± 0.0
1.404TrpIle: 1.404 ± 0.757
0.0TrpLys: 0.0 ± 0.0
1.404TrpLeu: 1.404 ± 0.757
0.0TrpMet: 0.0 ± 0.0
4.213TrpAsn: 4.213 ± 2.102
1.404TrpPro: 1.404 ± 0.757
0.0TrpGln: 0.0 ± 0.0
1.404TrpArg: 1.404 ± 0.757
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.404TrpVal: 1.404 ± 1.43
1.404TrpTrp: 1.404 ± 0.757
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.404TyrAla: 1.404 ± 0.757
0.0TyrCys: 0.0 ± 0.0
2.809TyrAsp: 2.809 ± 0.673
1.404TyrGlu: 1.404 ± 0.757
7.022TyrPhe: 7.022 ± 0.588
4.213TyrGly: 4.213 ± 2.102
2.809TyrHis: 2.809 ± 1.514
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
5.618TyrLeu: 5.618 ± 1.345
1.404TyrMet: 1.404 ± 1.43
0.0TyrAsn: 0.0 ± 0.0
2.809TyrPro: 2.809 ± 0.673
0.0TyrGln: 0.0 ± 0.0
0.0TyrArg: 0.0 ± 0.0
2.809TyrSer: 2.809 ± 1.514
2.809TyrThr: 2.809 ± 0.673
4.213TyrVal: 4.213 ± 0.084
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (713 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski