Amino acid dipepetide frequency for Sewage-associated circular DNA virus-33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.667AlaAla: 2.667 ± 0.543
1.333AlaCys: 1.333 ± 0.829
1.333AlaAsp: 1.333 ± 1.373
6.667AlaGlu: 6.667 ± 4.661
8.0AlaPhe: 8.0 ± 1.63
5.333AlaGly: 5.333 ± 3.318
1.333AlaHis: 1.333 ± 0.829
9.333AlaIle: 9.333 ± 3.002
9.333AlaLys: 9.333 ± 3.002
4.0AlaLeu: 4.0 ± 0.286
1.333AlaMet: 1.333 ± 0.829
5.333AlaAsn: 5.333 ± 1.086
4.0AlaPro: 4.0 ± 2.488
0.0AlaGln: 0.0 ± 0.0
6.667AlaArg: 6.667 ± 2.459
4.0AlaSer: 4.0 ± 0.286
1.333AlaThr: 1.333 ± 0.829
4.0AlaVal: 4.0 ± 0.286
0.0AlaTrp: 0.0 ± 0.0
2.667AlaTyr: 2.667 ± 0.543
0.0AlaXaa: 0.0 ± 0.0
Cys
1.333CysAla: 1.333 ± 1.373
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.333CysGlu: 1.333 ± 1.373
2.667CysPhe: 2.667 ± 1.659
2.667CysGly: 2.667 ± 2.745
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.333CysPro: 1.333 ± 0.829
0.0CysGln: 0.0 ± 0.0
1.333CysArg: 1.333 ± 0.829
1.333CysSer: 1.333 ± 0.829
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.333AspAla: 1.333 ± 1.373
0.0AspCys: 0.0 ± 0.0
4.0AspAsp: 4.0 ± 4.118
0.0AspGlu: 0.0 ± 0.0
2.667AspPhe: 2.667 ± 2.745
2.667AspGly: 2.667 ± 0.543
0.0AspHis: 0.0 ± 0.0
1.333AspIle: 1.333 ± 0.829
1.333AspLys: 1.333 ± 1.373
1.333AspLeu: 1.333 ± 0.829
1.333AspMet: 1.333 ± 0.829
4.0AspAsn: 4.0 ± 1.916
2.667AspPro: 2.667 ± 1.659
2.667AspGln: 2.667 ± 0.543
0.0AspArg: 0.0 ± 0.0
5.333AspSer: 5.333 ± 1.086
4.0AspThr: 4.0 ± 1.916
5.333AspVal: 5.333 ± 1.116
1.333AspTrp: 1.333 ± 1.373
1.333AspTyr: 1.333 ± 0.829
0.0AspXaa: 0.0 ± 0.0
Glu
2.667GluAla: 2.667 ± 0.543
0.0GluCys: 0.0 ± 0.0
4.0GluAsp: 4.0 ± 4.118
2.667GluGlu: 2.667 ± 0.543
5.333GluPhe: 5.333 ± 1.116
2.667GluGly: 2.667 ± 0.543
1.333GluHis: 1.333 ± 1.373
4.0GluIle: 4.0 ± 1.916
2.667GluLys: 2.667 ± 1.659
2.667GluLeu: 2.667 ± 0.543
0.0GluMet: 0.0 ± 0.0
4.0GluAsn: 4.0 ± 0.286
2.667GluPro: 2.667 ± 0.543
0.0GluGln: 0.0 ± 0.0
1.333GluArg: 1.333 ± 0.829
0.0GluSer: 0.0 ± 0.0
2.667GluThr: 2.667 ± 0.543
4.0GluVal: 4.0 ± 0.286
0.0GluTrp: 0.0 ± 0.0
1.333GluTyr: 1.333 ± 1.373
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.667PheCys: 2.667 ± 0.543
1.333PheAsp: 1.333 ± 1.373
5.333PheGlu: 5.333 ± 1.086
2.667PhePhe: 2.667 ± 1.659
6.667PheGly: 6.667 ± 1.945
0.0PheHis: 0.0 ± 0.0
4.0PheIle: 4.0 ± 1.916
5.333PheLys: 5.333 ± 1.086
1.333PheLeu: 1.333 ± 0.829
1.333PheMet: 1.333 ± 0.799
2.667PheAsn: 2.667 ± 0.543
5.333PhePro: 5.333 ± 1.116
1.333PheGln: 1.333 ± 0.829
1.333PheArg: 1.333 ± 0.829
4.0PheSer: 4.0 ± 2.488
1.333PheThr: 1.333 ± 0.829
2.667PheVal: 2.667 ± 2.745
0.0PheTrp: 0.0 ± 0.0
1.333PheTyr: 1.333 ± 0.829
0.0PheXaa: 0.0 ± 0.0
Gly
8.0GlyAla: 8.0 ± 1.63
0.0GlyCys: 0.0 ± 0.0
1.333GlyAsp: 1.333 ± 0.829
2.667GlyGlu: 2.667 ± 1.659
4.0GlyPhe: 4.0 ± 2.488
8.0GlyGly: 8.0 ± 2.775
2.667GlyHis: 2.667 ± 0.543
2.667GlyIle: 2.667 ± 0.543
2.667GlyLys: 2.667 ± 0.543
1.333GlyLeu: 1.333 ± 0.829
0.0GlyMet: 0.0 ± 0.0
2.667GlyAsn: 2.667 ± 0.543
4.0GlyPro: 4.0 ± 1.916
5.333GlyGln: 5.333 ± 1.116
5.333GlyArg: 5.333 ± 3.318
5.333GlySer: 5.333 ± 3.318
9.333GlyThr: 9.333 ± 3.604
6.667GlyVal: 6.667 ± 0.257
0.0GlyTrp: 0.0 ± 0.0
4.0GlyTyr: 4.0 ± 1.916
0.0GlyXaa: 0.0 ± 0.0
His
1.333HisAla: 1.333 ± 1.373
0.0HisCys: 0.0 ± 0.0
1.333HisAsp: 1.333 ± 0.829
1.333HisGlu: 1.333 ± 0.829
1.333HisPhe: 1.333 ± 0.829
0.0HisGly: 0.0 ± 0.0
1.333HisHis: 1.333 ± 1.373
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.667HisLeu: 2.667 ± 0.543
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.333HisGln: 1.333 ± 0.829
1.333HisArg: 1.333 ± 1.373
1.333HisSer: 1.333 ± 0.829
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.333HisTrp: 1.333 ± 1.373
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.667IleAla: 6.667 ± 2.459
1.333IleCys: 1.333 ± 0.829
5.333IleAsp: 5.333 ± 1.086
0.0IleGlu: 0.0 ± 0.0
1.333IlePhe: 1.333 ± 1.373
4.0IleGly: 4.0 ± 0.286
0.0IleHis: 0.0 ± 0.0
2.667IleIle: 2.667 ± 0.543
5.333IleLys: 5.333 ± 1.086
2.667IleLeu: 2.667 ± 0.543
0.0IleMet: 0.0 ± 0.0
9.333IleAsn: 9.333 ± 0.8
2.667IlePro: 2.667 ± 0.543
4.0IleGln: 4.0 ± 0.286
5.333IleArg: 5.333 ± 1.086
0.0IleSer: 0.0 ± 0.0
2.667IleThr: 2.667 ± 0.543
1.333IleVal: 1.333 ± 1.373
2.667IleTrp: 2.667 ± 2.745
2.667IleTyr: 2.667 ± 1.659
0.0IleXaa: 0.0 ± 0.0
Lys
5.333LysAla: 5.333 ± 1.116
1.333LysCys: 1.333 ± 1.373
2.667LysAsp: 2.667 ± 0.543
6.667LysGlu: 6.667 ± 2.459
1.333LysPhe: 1.333 ± 1.373
4.0LysGly: 4.0 ± 0.286
1.333LysHis: 1.333 ± 0.829
5.333LysIle: 5.333 ± 1.086
6.667LysLys: 6.667 ± 4.661
8.0LysLeu: 8.0 ± 4.977
2.667LysMet: 2.667 ± 0.543
2.667LysAsn: 2.667 ± 1.659
1.333LysPro: 1.333 ± 1.373
2.667LysGln: 2.667 ± 2.745
2.667LysArg: 2.667 ± 0.543
9.333LysSer: 9.333 ± 3.604
4.0LysThr: 4.0 ± 1.916
0.0LysVal: 0.0 ± 0.0
1.333LysTrp: 1.333 ± 1.373
5.333LysTyr: 5.333 ± 1.086
0.0LysXaa: 0.0 ± 0.0
Leu
6.667LeuAla: 6.667 ± 0.257
0.0LeuCys: 0.0 ± 0.0
4.0LeuAsp: 4.0 ± 1.916
2.667LeuGlu: 2.667 ± 1.659
6.667LeuPhe: 6.667 ± 2.459
6.667LeuGly: 6.667 ± 0.257
0.0LeuHis: 0.0 ± 0.0
1.333LeuIle: 1.333 ± 0.829
5.333LeuLys: 5.333 ± 1.086
4.0LeuLeu: 4.0 ± 0.286
1.333LeuMet: 1.333 ± 1.304
4.0LeuAsn: 4.0 ± 2.488
1.333LeuPro: 1.333 ± 0.829
1.333LeuGln: 1.333 ± 0.829
5.333LeuArg: 5.333 ± 1.086
6.667LeuSer: 6.667 ± 1.945
5.333LeuThr: 5.333 ± 1.116
1.333LeuVal: 1.333 ± 0.829
1.333LeuTrp: 1.333 ± 0.829
4.0LeuTyr: 4.0 ± 2.488
0.0LeuXaa: 0.0 ± 0.0
Met
1.333MetAla: 1.333 ± 0.829
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.333MetGly: 1.333 ± 0.829
0.0MetHis: 0.0 ± 0.0
1.333MetIle: 1.333 ± 0.829
4.0MetLys: 4.0 ± 1.916
1.333MetLeu: 1.333 ± 0.829
1.333MetMet: 1.333 ± 0.829
0.0MetAsn: 0.0 ± 0.0
1.333MetPro: 1.333 ± 1.373
1.333MetGln: 1.333 ± 1.373
2.667MetArg: 2.667 ± 1.659
1.333MetSer: 1.333 ± 0.829
2.667MetThr: 2.667 ± 1.659
1.333MetVal: 1.333 ± 0.829
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.0AsnAla: 4.0 ± 0.286
0.0AsnCys: 0.0 ± 0.0
2.667AsnAsp: 2.667 ± 0.543
1.333AsnGlu: 1.333 ± 1.373
2.667AsnPhe: 2.667 ± 1.659
2.667AsnGly: 2.667 ± 1.659
1.333AsnHis: 1.333 ± 0.829
4.0AsnIle: 4.0 ± 1.916
2.667AsnLys: 2.667 ± 0.543
1.333AsnLeu: 1.333 ± 1.373
4.0AsnMet: 4.0 ± 0.286
0.0AsnAsn: 0.0 ± 0.0
1.333AsnPro: 1.333 ± 0.829
0.0AsnGln: 0.0 ± 0.0
5.333AsnArg: 5.333 ± 1.086
1.333AsnSer: 1.333 ± 1.373
8.0AsnThr: 8.0 ± 3.832
2.667AsnVal: 2.667 ± 0.543
2.667AsnTrp: 2.667 ± 1.659
2.667AsnTyr: 2.667 ± 0.543
0.0AsnXaa: 0.0 ± 0.0
Pro
5.333ProAla: 5.333 ± 1.116
0.0ProCys: 0.0 ± 0.0
1.333ProAsp: 1.333 ± 1.373
2.667ProGlu: 2.667 ± 1.659
1.333ProPhe: 1.333 ± 0.829
6.667ProGly: 6.667 ± 0.257
0.0ProHis: 0.0 ± 0.0
2.667ProIle: 2.667 ± 1.659
1.333ProLys: 1.333 ± 1.373
2.667ProLeu: 2.667 ± 0.543
1.333ProMet: 1.333 ± 0.829
0.0ProAsn: 0.0 ± 0.0
4.0ProPro: 4.0 ± 1.916
1.333ProGln: 1.333 ± 0.829
2.667ProArg: 2.667 ± 1.659
5.333ProSer: 5.333 ± 1.086
1.333ProThr: 1.333 ± 1.373
1.333ProVal: 1.333 ± 1.373
1.333ProTrp: 1.333 ± 1.373
4.0ProTyr: 4.0 ± 2.488
0.0ProXaa: 0.0 ± 0.0
Gln
1.333GlnAla: 1.333 ± 1.373
0.0GlnCys: 0.0 ± 0.0
1.333GlnAsp: 1.333 ± 0.829
0.0GlnGlu: 0.0 ± 0.0
2.667GlnPhe: 2.667 ± 1.659
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
2.667GlnIle: 2.667 ± 0.543
1.333GlnLys: 1.333 ± 1.373
8.0GlnLeu: 8.0 ± 1.63
0.0GlnMet: 0.0 ± 0.0
4.0GlnAsn: 4.0 ± 1.916
0.0GlnPro: 0.0 ± 0.0
2.667GlnGln: 2.667 ± 1.659
1.333GlnArg: 1.333 ± 0.829
0.0GlnSer: 0.0 ± 0.0
2.667GlnThr: 2.667 ± 1.659
1.333GlnVal: 1.333 ± 0.829
1.333GlnTrp: 1.333 ± 1.373
2.667GlnTyr: 2.667 ± 1.659
0.0GlnXaa: 0.0 ± 0.0
Arg
5.333ArgAla: 5.333 ± 1.086
0.0ArgCys: 0.0 ± 0.0
2.667ArgAsp: 2.667 ± 1.659
2.667ArgGlu: 2.667 ± 2.745
4.0ArgPhe: 4.0 ± 1.916
5.333ArgGly: 5.333 ± 1.116
0.0ArgHis: 0.0 ± 0.0
4.0ArgIle: 4.0 ± 2.488
5.333ArgLys: 5.333 ± 3.318
4.0ArgLeu: 4.0 ± 1.916
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
1.333ArgPro: 1.333 ± 1.373
1.333ArgGln: 1.333 ± 0.829
2.667ArgArg: 2.667 ± 0.543
12.0ArgSer: 12.0 ± 3.061
5.333ArgThr: 5.333 ± 3.318
4.0ArgVal: 4.0 ± 0.286
0.0ArgTrp: 0.0 ± 0.0
2.667ArgTyr: 2.667 ± 0.543
0.0ArgXaa: 0.0 ± 0.0
Ser
8.0SerAla: 8.0 ± 4.977
0.0SerCys: 0.0 ± 0.0
2.667SerAsp: 2.667 ± 1.659
4.0SerGlu: 4.0 ± 1.916
0.0SerPhe: 0.0 ± 0.0
4.0SerGly: 4.0 ± 2.488
1.333SerHis: 1.333 ± 1.373
5.333SerIle: 5.333 ± 1.086
10.667SerLys: 10.667 ± 2.232
9.333SerLeu: 9.333 ± 5.806
2.667SerMet: 2.667 ± 1.659
4.0SerAsn: 4.0 ± 1.916
2.667SerPro: 2.667 ± 2.745
1.333SerGln: 1.333 ± 0.829
10.667SerArg: 10.667 ± 4.434
4.0SerSer: 4.0 ± 0.286
4.0SerThr: 4.0 ± 2.488
4.0SerVal: 4.0 ± 0.286
1.333SerTrp: 1.333 ± 1.373
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.667ThrAla: 6.667 ± 0.257
1.333ThrCys: 1.333 ± 0.829
4.0ThrAsp: 4.0 ± 1.916
0.0ThrGlu: 0.0 ± 0.0
1.333ThrPhe: 1.333 ± 0.829
4.0ThrGly: 4.0 ± 0.286
1.333ThrHis: 1.333 ± 0.829
5.333ThrIle: 5.333 ± 3.289
1.333ThrLys: 1.333 ± 1.373
5.333ThrLeu: 5.333 ± 1.116
1.333ThrMet: 1.333 ± 0.829
2.667ThrAsn: 2.667 ± 2.745
4.0ThrPro: 4.0 ± 2.488
0.0ThrGln: 0.0 ± 0.0
4.0ThrArg: 4.0 ± 0.286
8.0ThrSer: 8.0 ± 2.775
4.0ThrThr: 4.0 ± 2.488
6.667ThrVal: 6.667 ± 4.147
0.0ThrTrp: 0.0 ± 0.0
1.333ThrTyr: 1.333 ± 0.829
0.0ThrXaa: 0.0 ± 0.0
Val
2.667ValAla: 2.667 ± 0.543
1.333ValCys: 1.333 ± 1.373
1.333ValAsp: 1.333 ± 0.829
0.0ValGlu: 0.0 ± 0.0
1.333ValPhe: 1.333 ± 1.373
6.667ValGly: 6.667 ± 1.945
1.333ValHis: 1.333 ± 1.373
1.333ValIle: 1.333 ± 1.373
5.333ValLys: 5.333 ± 1.116
6.667ValLeu: 6.667 ± 1.945
0.0ValMet: 0.0 ± 0.0
1.333ValAsn: 1.333 ± 0.829
6.667ValPro: 6.667 ± 1.945
4.0ValGln: 4.0 ± 0.286
1.333ValArg: 1.333 ± 1.373
5.333ValSer: 5.333 ± 1.116
1.333ValThr: 1.333 ± 0.829
8.0ValVal: 8.0 ± 4.977
0.0ValTrp: 0.0 ± 0.0
2.667ValTyr: 2.667 ± 0.543
0.0ValXaa: 0.0 ± 0.0
Trp
2.667TrpAla: 2.667 ± 2.745
1.333TrpCys: 1.333 ± 1.373
0.0TrpAsp: 0.0 ± 0.0
1.333TrpGlu: 1.333 ± 0.829
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.333TrpIle: 1.333 ± 0.829
0.0TrpLys: 0.0 ± 0.0
2.667TrpLeu: 2.667 ± 2.745
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.333TrpGln: 1.333 ± 1.373
0.0TrpArg: 0.0 ± 0.0
2.667TrpSer: 2.667 ± 0.543
1.333TrpThr: 1.333 ± 1.373
0.0TrpVal: 0.0 ± 0.0
1.333TrpTrp: 1.333 ± 1.373
1.333TrpTyr: 1.333 ± 1.373
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.333TyrAla: 5.333 ± 1.086
1.333TyrCys: 1.333 ± 0.829
1.333TyrAsp: 1.333 ± 0.829
2.667TyrGlu: 2.667 ± 1.659
1.333TyrPhe: 1.333 ± 0.829
2.667TyrGly: 2.667 ± 0.543
1.333TyrHis: 1.333 ± 0.829
1.333TyrIle: 1.333 ± 1.373
4.0TyrLys: 4.0 ± 2.488
1.333TyrLeu: 1.333 ± 0.829
1.333TyrMet: 1.333 ± 1.373
4.0TyrAsn: 4.0 ± 0.286
0.0TyrPro: 0.0 ± 0.0
1.333TyrGln: 1.333 ± 1.373
1.333TyrArg: 1.333 ± 0.829
2.667TyrSer: 2.667 ± 0.543
1.333TyrThr: 1.333 ± 0.829
4.0TyrVal: 4.0 ± 0.286
1.333TyrTrp: 1.333 ± 1.373
4.0TyrTyr: 4.0 ± 1.916
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (751 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski