Amino acid dipepetide frequency for Sewage-associated circular DNA virus-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.211AlaAla: 6.211 ± 4.282
0.0AlaCys: 0.0 ± 0.0
1.553AlaAsp: 1.553 ± 1.121
1.553AlaGlu: 1.553 ± 1.07
6.211AlaPhe: 6.211 ± 2.09
3.106AlaGly: 3.106 ± 2.141
0.0AlaHis: 0.0 ± 0.0
4.658AlaIle: 4.658 ± 3.211
1.553AlaLys: 1.553 ± 1.121
3.106AlaLeu: 3.106 ± 0.051
0.0AlaMet: 0.0 ± 0.0
1.553AlaAsn: 1.553 ± 1.121
6.211AlaPro: 6.211 ± 4.282
0.0AlaGln: 0.0 ± 0.0
4.658AlaArg: 4.658 ± 3.364
1.553AlaSer: 1.553 ± 1.07
3.106AlaThr: 3.106 ± 2.141
4.658AlaVal: 4.658 ± 3.211
1.553AlaTrp: 1.553 ± 1.07
3.106AlaTyr: 3.106 ± 2.243
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.553CysAsp: 1.553 ± 1.121
3.106CysGlu: 3.106 ± 2.243
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.553CysIle: 1.553 ± 1.121
0.0CysLys: 0.0 ± 0.0
3.106CysLeu: 3.106 ± 0.051
0.0CysMet: 0.0 ± 0.0
1.553CysAsn: 1.553 ± 1.121
0.0CysPro: 0.0 ± 0.0
3.106CysGln: 3.106 ± 2.243
0.0CysArg: 0.0 ± 0.0
3.106CysSer: 3.106 ± 2.243
0.0CysThr: 0.0 ± 0.0
1.553CysVal: 1.553 ± 1.121
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.553AspAla: 1.553 ± 1.121
0.0AspCys: 0.0 ± 0.0
3.106AspAsp: 3.106 ± 0.051
4.658AspGlu: 4.658 ± 3.364
0.0AspPhe: 0.0 ± 0.0
4.658AspGly: 4.658 ± 1.173
0.0AspHis: 0.0 ± 0.0
4.658AspIle: 4.658 ± 3.364
3.106AspLys: 3.106 ± 2.243
1.553AspLeu: 1.553 ± 1.07
4.658AspMet: 4.658 ± 1.019
4.658AspAsn: 4.658 ± 3.211
3.106AspPro: 3.106 ± 0.051
4.658AspGln: 4.658 ± 3.211
1.553AspArg: 1.553 ± 1.07
4.658AspSer: 4.658 ± 1.173
3.106AspThr: 3.106 ± 2.243
4.658AspVal: 4.658 ± 1.173
0.0AspTrp: 0.0 ± 0.0
4.658AspTyr: 4.658 ± 1.173
0.0AspXaa: 0.0 ± 0.0
Glu
1.553GluAla: 1.553 ± 1.07
0.0GluCys: 0.0 ± 0.0
7.764GluAsp: 7.764 ± 0.968
1.553GluGlu: 1.553 ± 1.121
6.211GluPhe: 6.211 ± 0.102
1.553GluGly: 1.553 ± 1.07
0.0GluHis: 0.0 ± 0.0
1.553GluIle: 1.553 ± 1.121
1.553GluLys: 1.553 ± 1.121
1.553GluLeu: 1.553 ± 1.121
3.106GluMet: 3.106 ± 0.749
1.553GluAsn: 1.553 ± 1.07
7.764GluPro: 7.764 ± 3.415
3.106GluGln: 3.106 ± 0.051
3.106GluArg: 3.106 ± 2.243
1.553GluSer: 1.553 ± 1.121
1.553GluThr: 1.553 ± 1.121
4.658GluVal: 4.658 ± 1.019
4.658GluTrp: 4.658 ± 3.364
1.553GluTyr: 1.553 ± 1.121
0.0GluXaa: 0.0 ± 0.0
Phe
1.553PheAla: 1.553 ± 1.07
1.553PheCys: 1.553 ± 1.121
6.211PheAsp: 6.211 ± 2.294
1.553PheGlu: 1.553 ± 1.121
3.106PhePhe: 3.106 ± 0.051
6.211PheGly: 6.211 ± 0.102
1.553PheHis: 1.553 ± 1.121
0.0PheIle: 0.0 ± 0.0
3.106PheLys: 3.106 ± 2.141
3.106PheLeu: 3.106 ± 0.051
0.0PheMet: 0.0 ± 0.0
1.553PheAsn: 1.553 ± 1.121
3.106PhePro: 3.106 ± 2.243
0.0PheGln: 0.0 ± 0.0
9.317PheArg: 9.317 ± 4.231
4.658PheSer: 4.658 ± 1.019
6.211PheThr: 6.211 ± 0.102
1.553PheVal: 1.553 ± 1.07
0.0PheTrp: 0.0 ± 0.0
1.553PheTyr: 1.553 ± 1.07
0.0PheXaa: 0.0 ± 0.0
Gly
4.658GlyAla: 4.658 ± 1.019
3.106GlyCys: 3.106 ± 0.051
1.553GlyAsp: 1.553 ± 1.07
3.106GlyGlu: 3.106 ± 0.051
1.553GlyPhe: 1.553 ± 1.121
4.658GlyGly: 4.658 ± 3.364
0.0GlyHis: 0.0 ± 0.0
3.106GlyIle: 3.106 ± 2.141
0.0GlyLys: 0.0 ± 0.0
4.658GlyLeu: 4.658 ± 1.173
1.553GlyMet: 1.553 ± 1.121
3.106GlyAsn: 3.106 ± 2.141
3.106GlyPro: 3.106 ± 2.243
3.106GlyGln: 3.106 ± 2.141
6.211GlyArg: 6.211 ± 4.486
0.0GlySer: 0.0 ± 0.0
6.211GlyThr: 6.211 ± 4.282
4.658GlyVal: 4.658 ± 3.211
3.106GlyTrp: 3.106 ± 0.051
1.553GlyTyr: 1.553 ± 1.07
0.0GlyXaa: 0.0 ± 0.0
His
1.553HisAla: 1.553 ± 1.121
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
3.106HisPhe: 3.106 ± 2.243
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.553HisLeu: 1.553 ± 1.121
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.553HisPro: 1.553 ± 1.121
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.553HisSer: 1.553 ± 1.121
0.0HisThr: 0.0 ± 0.0
1.553HisVal: 1.553 ± 1.121
1.553HisTrp: 1.553 ± 1.121
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.211IleAla: 6.211 ± 0.102
0.0IleCys: 0.0 ± 0.0
3.106IleAsp: 3.106 ± 2.243
3.106IleGlu: 3.106 ± 0.051
1.553IlePhe: 1.553 ± 1.121
3.106IleGly: 3.106 ± 2.141
0.0IleHis: 0.0 ± 0.0
1.553IleIle: 1.553 ± 1.07
1.553IleLys: 1.553 ± 1.121
4.658IleLeu: 4.658 ± 1.173
0.0IleMet: 0.0 ± 0.0
1.553IleAsn: 1.553 ± 1.07
1.553IlePro: 1.553 ± 1.07
6.211IleGln: 6.211 ± 2.294
3.106IleArg: 3.106 ± 2.141
4.658IleSer: 4.658 ± 3.211
1.553IleThr: 1.553 ± 1.07
1.553IleVal: 1.553 ± 1.07
3.106IleTrp: 3.106 ± 2.243
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.553LysAla: 1.553 ± 1.121
0.0LysCys: 0.0 ± 0.0
3.106LysAsp: 3.106 ± 0.051
3.106LysGlu: 3.106 ± 0.051
0.0LysPhe: 0.0 ± 0.0
1.553LysGly: 1.553 ± 1.07
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
1.553LysLys: 1.553 ± 1.121
1.553LysLeu: 1.553 ± 1.07
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
1.553LysPro: 1.553 ± 1.121
1.553LysGln: 1.553 ± 1.07
6.211LysArg: 6.211 ± 2.09
3.106LysSer: 3.106 ± 0.051
1.553LysThr: 1.553 ± 1.07
1.553LysVal: 1.553 ± 1.07
1.553LysTrp: 1.553 ± 1.07
9.317LysTyr: 9.317 ± 2.345
0.0LysXaa: 0.0 ± 0.0
Leu
3.106LeuAla: 3.106 ± 2.141
1.553LeuCys: 1.553 ± 1.121
1.553LeuAsp: 1.553 ± 1.121
4.658LeuGlu: 4.658 ± 3.364
4.658LeuPhe: 4.658 ± 1.173
4.658LeuGly: 4.658 ± 1.173
4.658LeuHis: 4.658 ± 3.364
7.764LeuIle: 7.764 ± 1.224
3.106LeuLys: 3.106 ± 2.141
4.658LeuLeu: 4.658 ± 3.364
0.0LeuMet: 0.0 ± 0.0
1.553LeuAsn: 1.553 ± 1.07
1.553LeuPro: 1.553 ± 1.121
3.106LeuGln: 3.106 ± 0.051
7.764LeuArg: 7.764 ± 1.224
6.211LeuSer: 6.211 ± 2.294
4.658LeuThr: 4.658 ± 1.173
3.106LeuVal: 3.106 ± 2.141
0.0LeuTrp: 0.0 ± 0.0
3.106LeuTyr: 3.106 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
4.658MetAla: 4.658 ± 1.019
0.0MetCys: 0.0 ± 0.0
3.106MetAsp: 3.106 ± 0.051
0.0MetGlu: 0.0 ± 0.0
1.553MetPhe: 1.553 ± 1.121
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.106MetPro: 3.106 ± 2.141
3.106MetGln: 3.106 ± 2.141
0.0MetArg: 0.0 ± 0.0
1.553MetSer: 1.553 ± 1.07
0.0MetThr: 0.0 ± 0.0
1.553MetVal: 1.553 ± 1.121
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.553AsnCys: 1.553 ± 1.121
4.658AsnAsp: 4.658 ± 1.019
3.106AsnGlu: 3.106 ± 0.051
1.553AsnPhe: 1.553 ± 1.121
6.211AsnGly: 6.211 ± 0.102
0.0AsnHis: 0.0 ± 0.0
1.553AsnIle: 1.553 ± 1.121
6.211AsnLys: 6.211 ± 4.282
4.658AsnLeu: 4.658 ± 1.173
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
6.211AsnPro: 6.211 ± 2.09
0.0AsnGln: 0.0 ± 0.0
1.553AsnArg: 1.553 ± 1.07
0.0AsnSer: 0.0 ± 0.0
3.106AsnThr: 3.106 ± 0.051
4.658AsnVal: 4.658 ± 1.019
1.553AsnTrp: 1.553 ± 1.121
1.553AsnTyr: 1.553 ± 1.07
0.0AsnXaa: 0.0 ± 0.0
Pro
1.553ProAla: 1.553 ± 1.07
3.106ProCys: 3.106 ± 2.243
3.106ProAsp: 3.106 ± 0.051
3.106ProGlu: 3.106 ± 0.051
3.106ProPhe: 3.106 ± 2.141
1.553ProGly: 1.553 ± 1.07
1.553ProHis: 1.553 ± 1.121
1.553ProIle: 1.553 ± 1.07
1.553ProLys: 1.553 ± 1.07
6.211ProLeu: 6.211 ± 0.102
1.553ProMet: 1.553 ± 0.803
4.658ProAsn: 4.658 ± 1.173
4.658ProPro: 4.658 ± 1.019
3.106ProGln: 3.106 ± 2.243
9.317ProArg: 9.317 ± 2.345
0.0ProSer: 0.0 ± 0.0
1.553ProThr: 1.553 ± 1.07
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.106GlnAla: 3.106 ± 2.141
3.106GlnCys: 3.106 ± 2.243
0.0GlnAsp: 0.0 ± 0.0
1.553GlnGlu: 1.553 ± 1.07
0.0GlnPhe: 0.0 ± 0.0
4.658GlnGly: 4.658 ± 1.019
1.553GlnHis: 1.553 ± 1.121
1.553GlnIle: 1.553 ± 1.07
1.553GlnLys: 1.553 ± 1.121
3.106GlnLeu: 3.106 ± 0.051
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
3.106GlnArg: 3.106 ± 2.243
3.106GlnSer: 3.106 ± 0.051
3.106GlnThr: 3.106 ± 2.141
4.658GlnVal: 4.658 ± 1.019
1.553GlnTrp: 1.553 ± 1.121
1.553GlnTyr: 1.553 ± 1.07
0.0GlnXaa: 0.0 ± 0.0
Arg
1.553ArgAla: 1.553 ± 1.07
0.0ArgCys: 0.0 ± 0.0
7.764ArgAsp: 7.764 ± 3.415
7.764ArgGlu: 7.764 ± 1.224
9.317ArgPhe: 9.317 ± 2.039
4.658ArgGly: 4.658 ± 1.173
0.0ArgHis: 0.0 ± 0.0
0.0ArgIle: 0.0 ± 0.0
4.658ArgLys: 4.658 ± 1.019
4.658ArgLeu: 4.658 ± 1.173
1.553ArgMet: 1.553 ± 1.07
3.106ArgAsn: 3.106 ± 2.243
1.553ArgPro: 1.553 ± 1.07
1.553ArgGln: 1.553 ± 1.07
23.292ArgArg: 23.292 ± 7.289
9.317ArgSer: 9.317 ± 0.153
10.87ArgThr: 10.87 ± 3.467
9.317ArgVal: 9.317 ± 4.231
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.658SerAla: 4.658 ± 1.173
0.0SerCys: 0.0 ± 0.0
3.106SerAsp: 3.106 ± 0.051
4.658SerGlu: 4.658 ± 1.019
6.211SerPhe: 6.211 ± 2.09
3.106SerGly: 3.106 ± 0.051
0.0SerHis: 0.0 ± 0.0
4.658SerIle: 4.658 ± 1.173
6.211SerLys: 6.211 ± 0.102
9.317SerLeu: 9.317 ± 0.153
1.553SerMet: 1.553 ± 1.07
6.211SerAsn: 6.211 ± 2.09
3.106SerPro: 3.106 ± 2.243
0.0SerGln: 0.0 ± 0.0
7.764SerArg: 7.764 ± 0.968
9.317SerSer: 9.317 ± 2.039
4.658SerThr: 4.658 ± 1.019
1.553SerVal: 1.553 ± 1.07
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.658ThrAla: 4.658 ± 3.211
0.0ThrCys: 0.0 ± 0.0
3.106ThrAsp: 3.106 ± 0.051
7.764ThrGlu: 7.764 ± 3.415
1.553ThrPhe: 1.553 ± 1.121
3.106ThrGly: 3.106 ± 0.051
3.106ThrHis: 3.106 ± 2.243
3.106ThrIle: 3.106 ± 2.141
0.0ThrLys: 0.0 ± 0.0
4.658ThrLeu: 4.658 ± 1.173
0.0ThrMet: 0.0 ± 0.0
9.317ThrAsn: 9.317 ± 2.039
1.553ThrPro: 1.553 ± 1.07
4.658ThrGln: 4.658 ± 1.173
4.658ThrArg: 4.658 ± 3.211
12.422ThrSer: 12.422 ± 1.988
1.553ThrThr: 1.553 ± 1.07
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
4.658ThrTyr: 4.658 ± 1.173
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.553ValCys: 1.553 ± 1.121
4.658ValAsp: 4.658 ± 1.019
1.553ValGlu: 1.553 ± 1.07
1.553ValPhe: 1.553 ± 1.07
3.106ValGly: 3.106 ± 2.141
0.0ValHis: 0.0 ± 0.0
7.764ValIle: 7.764 ± 0.968
1.553ValLys: 1.553 ± 1.07
1.553ValLeu: 1.553 ± 1.121
1.553ValMet: 1.553 ± 1.07
6.211ValAsn: 6.211 ± 0.102
0.0ValPro: 0.0 ± 0.0
0.0ValGln: 0.0 ± 0.0
4.658ValArg: 4.658 ± 1.019
4.658ValSer: 4.658 ± 3.211
7.764ValThr: 7.764 ± 3.16
3.106ValVal: 3.106 ± 0.051
4.658ValTrp: 4.658 ± 1.019
4.658ValTyr: 4.658 ± 1.173
0.0ValXaa: 0.0 ± 0.0
Trp
4.658TrpAla: 4.658 ± 1.019
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.553TrpIle: 1.553 ± 1.121
1.553TrpLys: 1.553 ± 1.121
3.106TrpLeu: 3.106 ± 2.243
1.553TrpMet: 1.553 ± 1.07
1.553TrpAsn: 1.553 ± 1.121
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.553TrpArg: 1.553 ± 1.121
3.106TrpSer: 3.106 ± 0.051
4.658TrpThr: 4.658 ± 1.173
1.553TrpVal: 1.553 ± 1.121
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.553TyrAla: 1.553 ± 1.07
3.106TyrCys: 3.106 ± 2.243
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
4.658TyrPhe: 4.658 ± 1.019
3.106TyrGly: 3.106 ± 0.051
0.0TyrHis: 0.0 ± 0.0
1.553TyrIle: 1.553 ± 1.07
0.0TyrLys: 0.0 ± 0.0
4.658TyrLeu: 4.658 ± 1.173
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
3.106TyrPro: 3.106 ± 0.051
0.0TyrGln: 0.0 ± 0.0
3.106TyrArg: 3.106 ± 0.051
1.553TyrSer: 1.553 ± 1.07
4.658TyrThr: 4.658 ± 3.364
4.658TyrVal: 4.658 ± 1.173
1.553TyrTrp: 1.553 ± 1.121
1.553TyrTyr: 1.553 ± 1.07
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski