Amino acid dipepetide frequency for Sewage-associated circular DNA virus-9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.664AlaAla: 1.664 ± 0.889
6.656AlaCys: 6.656 ± 2.644
3.328AlaAsp: 3.328 ± 0.289
0.0AlaGlu: 0.0 ± 0.0
4.992AlaPhe: 4.992 ± 1.466
6.656AlaGly: 6.656 ± 1.489
0.0AlaHis: 0.0 ± 0.0
0.0AlaIle: 0.0 ± 0.0
1.664AlaLys: 1.664 ± 0.889
1.664AlaLeu: 1.664 ± 0.889
8.319AlaMet: 8.319 ± 2.377
4.992AlaAsn: 4.992 ± 0.6
4.992AlaPro: 4.992 ± 0.6
0.0AlaGln: 0.0 ± 0.0
0.0AlaArg: 0.0 ± 0.0
3.328AlaSer: 3.328 ± 0.289
9.983AlaThr: 9.983 ± 3.266
3.328AlaVal: 3.328 ± 0.289
1.664AlaTrp: 1.664 ± 1.178
3.328AlaTyr: 3.328 ± 2.355
0.0AlaXaa: 0.0 ± 0.0
Cys
1.664CysAla: 1.664 ± 0.889
0.0CysCys: 0.0 ± 0.0
3.328CysAsp: 3.328 ± 0.289
1.664CysGlu: 1.664 ± 1.178
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.664CysLeu: 1.664 ± 1.178
0.0CysMet: 0.0 ± 0.0
1.664CysAsn: 1.664 ± 1.178
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.328CysSer: 3.328 ± 2.355
0.0CysThr: 0.0 ± 0.0
1.664CysVal: 1.664 ± 1.178
0.0CysTrp: 0.0 ± 0.0
1.664CysTyr: 1.664 ± 0.889
0.0CysXaa: 0.0 ± 0.0
Asp
3.328AspAla: 3.328 ± 2.355
0.0AspCys: 0.0 ± 0.0
3.328AspAsp: 3.328 ± 2.355
4.992AspGlu: 4.992 ± 1.466
4.992AspPhe: 4.992 ± 1.466
3.328AspGly: 3.328 ± 1.777
0.0AspHis: 0.0 ± 0.0
3.328AspIle: 3.328 ± 1.777
0.0AspLys: 0.0 ± 0.0
8.319AspLeu: 8.319 ± 0.311
4.992AspMet: 4.992 ± 0.6
1.664AspAsn: 1.664 ± 1.178
3.328AspPro: 3.328 ± 1.777
0.0AspGln: 0.0 ± 0.0
6.656AspArg: 6.656 ± 4.71
9.983AspSer: 9.983 ± 5.332
4.992AspThr: 4.992 ± 1.466
6.656AspVal: 6.656 ± 1.489
3.328AspTrp: 3.328 ± 2.355
4.992AspTyr: 4.992 ± 1.466
0.0AspXaa: 0.0 ± 0.0
Glu
1.664GluAla: 1.664 ± 0.889
0.0GluCys: 0.0 ± 0.0
1.664GluAsp: 1.664 ± 1.178
0.0GluGlu: 0.0 ± 0.0
3.328GluPhe: 3.328 ± 0.289
6.656GluGly: 6.656 ± 2.644
0.0GluHis: 0.0 ± 0.0
4.992GluIle: 4.992 ± 0.6
0.0GluLys: 0.0 ± 0.0
1.664GluLeu: 1.664 ± 0.889
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
1.664GluGln: 1.664 ± 1.178
6.656GluArg: 6.656 ± 4.71
4.992GluSer: 4.992 ± 1.466
4.992GluThr: 4.992 ± 1.466
0.0GluVal: 0.0 ± 0.0
1.664GluTrp: 1.664 ± 1.178
3.328GluTyr: 3.328 ± 2.355
0.0GluXaa: 0.0 ± 0.0
Phe
1.664PheAla: 1.664 ± 1.178
1.664PheCys: 1.664 ± 1.178
6.656PheAsp: 6.656 ± 0.578
4.992PheGlu: 4.992 ± 3.533
1.664PhePhe: 1.664 ± 1.178
1.664PheGly: 1.664 ± 0.889
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.664PheLys: 1.664 ± 0.889
1.664PheLeu: 1.664 ± 0.889
0.0PheMet: 0.0 ± 0.0
3.328PheAsn: 3.328 ± 1.777
0.0PhePro: 0.0 ± 0.0
4.992PheGln: 4.992 ± 0.6
6.656PheArg: 6.656 ± 4.71
1.664PheSer: 1.664 ± 1.178
3.328PheThr: 3.328 ± 0.289
1.664PheVal: 1.664 ± 0.889
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
4.992GlyAsp: 4.992 ± 0.6
0.0GlyGlu: 0.0 ± 0.0
3.328GlyPhe: 3.328 ± 0.289
1.664GlyGly: 1.664 ± 0.889
0.0GlyHis: 0.0 ± 0.0
3.328GlyIle: 3.328 ± 1.777
6.656GlyLys: 6.656 ± 0.578
8.319GlyLeu: 8.319 ± 0.311
4.992GlyMet: 4.992 ± 2.666
6.656GlyAsn: 6.656 ± 1.489
3.328GlyPro: 3.328 ± 1.777
1.664GlyGln: 1.664 ± 0.889
4.992GlyArg: 4.992 ± 1.466
3.328GlySer: 3.328 ± 1.777
1.664GlyThr: 1.664 ± 0.889
8.319GlyVal: 8.319 ± 0.311
0.0GlyTrp: 0.0 ± 0.0
3.328GlyTyr: 3.328 ± 2.355
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.664HisPhe: 1.664 ± 1.178
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.328HisLys: 3.328 ± 0.289
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.664HisThr: 1.664 ± 0.889
3.328HisVal: 3.328 ± 2.355
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.664IleAla: 1.664 ± 0.889
0.0IleCys: 0.0 ± 0.0
3.328IleAsp: 3.328 ± 2.355
4.992IleGlu: 4.992 ± 1.466
0.0IlePhe: 0.0 ± 0.0
6.656IleGly: 6.656 ± 3.555
1.664IleHis: 1.664 ± 0.889
3.328IleIle: 3.328 ± 0.289
0.0IleLys: 0.0 ± 0.0
3.328IleLeu: 3.328 ± 1.777
1.664IleMet: 1.664 ± 0.889
1.664IleAsn: 1.664 ± 0.889
1.664IlePro: 1.664 ± 1.178
4.992IleGln: 4.992 ± 2.666
0.0IleArg: 0.0 ± 0.0
6.656IleSer: 6.656 ± 1.489
1.664IleThr: 1.664 ± 1.178
3.328IleVal: 3.328 ± 0.289
1.664IleTrp: 1.664 ± 1.178
3.328IleTyr: 3.328 ± 1.777
0.0IleXaa: 0.0 ± 0.0
Lys
6.656LysAla: 6.656 ± 1.489
0.0LysCys: 0.0 ± 0.0
3.328LysAsp: 3.328 ± 2.355
0.0LysGlu: 0.0 ± 0.0
0.0LysPhe: 0.0 ± 0.0
3.328LysGly: 3.328 ± 1.777
0.0LysHis: 0.0 ± 0.0
1.664LysIle: 1.664 ± 0.889
1.664LysLys: 1.664 ± 0.889
1.664LysLeu: 1.664 ± 0.889
1.664LysMet: 1.664 ± 0.889
1.664LysAsn: 1.664 ± 1.178
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
1.664LysArg: 1.664 ± 1.178
0.0LysSer: 0.0 ± 0.0
1.664LysThr: 1.664 ± 1.178
4.992LysVal: 4.992 ± 0.6
1.664LysTrp: 1.664 ± 1.178
1.664LysTyr: 1.664 ± 0.889
0.0LysXaa: 0.0 ± 0.0
Leu
3.328LeuAla: 3.328 ± 0.289
1.664LeuCys: 1.664 ± 1.178
4.992LeuAsp: 4.992 ± 0.6
1.664LeuGlu: 1.664 ± 1.178
1.664LeuPhe: 1.664 ± 1.178
4.992LeuGly: 4.992 ± 1.466
1.664LeuHis: 1.664 ± 1.178
1.664LeuIle: 1.664 ± 0.889
4.992LeuLys: 4.992 ± 1.466
3.328LeuLeu: 3.328 ± 1.777
1.664LeuMet: 1.664 ± 0.889
4.992LeuAsn: 4.992 ± 2.666
8.319LeuPro: 8.319 ± 0.311
4.992LeuGln: 4.992 ± 2.666
3.328LeuArg: 3.328 ± 1.777
8.319LeuSer: 8.319 ± 0.311
0.0LeuThr: 0.0 ± 0.0
4.992LeuVal: 4.992 ± 1.466
3.328LeuTrp: 3.328 ± 0.289
4.992LeuTyr: 4.992 ± 2.666
0.0LeuXaa: 0.0 ± 0.0
Met
3.328MetAla: 3.328 ± 0.289
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
4.992MetGlu: 4.992 ± 0.6
1.664MetPhe: 1.664 ± 0.889
1.664MetGly: 1.664 ± 0.889
0.0MetHis: 0.0 ± 0.0
1.664MetIle: 1.664 ± 0.889
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
4.992MetPro: 4.992 ± 2.666
1.664MetGln: 1.664 ± 0.889
3.328MetArg: 3.328 ± 1.777
4.992MetSer: 4.992 ± 2.666
1.664MetThr: 1.664 ± 0.889
3.328MetVal: 3.328 ± 1.777
0.0MetTrp: 0.0 ± 0.0
1.664MetTyr: 1.664 ± 1.178
0.0MetXaa: 0.0 ± 0.0
Asn
4.992AsnAla: 4.992 ± 0.6
0.0AsnCys: 0.0 ± 0.0
4.992AsnAsp: 4.992 ± 2.666
0.0AsnGlu: 0.0 ± 0.0
3.328AsnPhe: 3.328 ± 1.777
1.664AsnGly: 1.664 ± 1.178
0.0AsnHis: 0.0 ± 0.0
4.992AsnIle: 4.992 ± 0.6
1.664AsnLys: 1.664 ± 0.889
3.328AsnLeu: 3.328 ± 1.777
0.0AsnMet: 0.0 ± 0.0
1.664AsnAsn: 1.664 ± 0.889
6.656AsnPro: 6.656 ± 3.555
1.664AsnGln: 1.664 ± 0.889
0.0AsnArg: 0.0 ± 0.0
6.656AsnSer: 6.656 ± 0.578
4.992AsnThr: 4.992 ± 0.6
3.328AsnVal: 3.328 ± 0.289
1.664AsnTrp: 1.664 ± 0.889
1.664AsnTyr: 1.664 ± 0.889
0.0AsnXaa: 0.0 ± 0.0
Pro
4.992ProAla: 4.992 ± 2.666
1.664ProCys: 1.664 ± 0.889
3.328ProAsp: 3.328 ± 1.777
6.656ProGlu: 6.656 ± 4.71
1.664ProPhe: 1.664 ± 1.178
0.0ProGly: 0.0 ± 0.0
1.664ProHis: 1.664 ± 1.178
4.992ProIle: 4.992 ± 2.666
1.664ProLys: 1.664 ± 0.889
6.656ProLeu: 6.656 ± 3.555
0.0ProMet: 0.0 ± 0.0
1.664ProAsn: 1.664 ± 0.889
6.656ProPro: 6.656 ± 0.578
4.992ProGln: 4.992 ± 0.6
4.992ProArg: 4.992 ± 1.466
1.664ProSer: 1.664 ± 0.889
1.664ProThr: 1.664 ± 1.178
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.664ProTyr: 1.664 ± 0.889
0.0ProXaa: 0.0 ± 0.0
Gln
3.328GlnAla: 3.328 ± 1.777
0.0GlnCys: 0.0 ± 0.0
4.992GlnAsp: 4.992 ± 0.6
1.664GlnGlu: 1.664 ± 0.889
4.992GlnPhe: 4.992 ± 0.6
0.0GlnGly: 0.0 ± 0.0
1.664GlnHis: 1.664 ± 0.889
3.328GlnIle: 3.328 ± 1.777
1.664GlnLys: 1.664 ± 0.889
4.992GlnLeu: 4.992 ± 1.466
1.664GlnMet: 1.664 ± 0.889
1.664GlnAsn: 1.664 ± 0.889
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.664GlnArg: 1.664 ± 1.178
1.664GlnSer: 1.664 ± 0.889
0.0GlnThr: 0.0 ± 0.0
3.328GlnVal: 3.328 ± 0.289
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.664ArgAla: 1.664 ± 1.178
0.0ArgCys: 0.0 ± 0.0
8.319ArgAsp: 8.319 ± 3.821
3.328ArgGlu: 3.328 ± 2.355
3.328ArgPhe: 3.328 ± 0.289
11.647ArgGly: 11.647 ± 0.022
1.664ArgHis: 1.664 ± 1.178
1.664ArgIle: 1.664 ± 1.178
0.0ArgLys: 0.0 ± 0.0
3.328ArgLeu: 3.328 ± 1.777
3.328ArgMet: 3.328 ± 1.485
3.328ArgAsn: 3.328 ± 2.355
3.328ArgPro: 3.328 ± 2.355
0.0ArgGln: 0.0 ± 0.0
1.664ArgArg: 1.664 ± 1.178
3.328ArgSer: 3.328 ± 0.289
1.664ArgThr: 1.664 ± 1.178
4.992ArgVal: 4.992 ± 1.466
0.0ArgTrp: 0.0 ± 0.0
6.656ArgTyr: 6.656 ± 4.71
0.0ArgXaa: 0.0 ± 0.0
Ser
4.992SerAla: 4.992 ± 0.6
3.328SerCys: 3.328 ± 0.289
4.992SerAsp: 4.992 ± 0.6
4.992SerGlu: 4.992 ± 0.6
1.664SerPhe: 1.664 ± 1.178
3.328SerGly: 3.328 ± 0.289
0.0SerHis: 0.0 ± 0.0
6.656SerIle: 6.656 ± 1.489
0.0SerLys: 0.0 ± 0.0
6.656SerLeu: 6.656 ± 1.489
3.328SerMet: 3.328 ± 1.777
11.647SerAsn: 11.647 ± 6.221
3.328SerPro: 3.328 ± 2.355
1.664SerGln: 1.664 ± 0.889
3.328SerArg: 3.328 ± 0.289
1.664SerSer: 1.664 ± 1.178
1.664SerThr: 1.664 ± 0.889
6.656SerVal: 6.656 ± 0.578
1.664SerTrp: 1.664 ± 1.178
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
8.319ThrAla: 8.319 ± 0.311
0.0ThrCys: 0.0 ± 0.0
4.992ThrAsp: 4.992 ± 0.6
1.664ThrGlu: 1.664 ± 0.889
0.0ThrPhe: 0.0 ± 0.0
1.664ThrGly: 1.664 ± 1.178
1.664ThrHis: 1.664 ± 1.178
4.992ThrIle: 4.992 ± 1.466
0.0ThrLys: 0.0 ± 0.0
3.328ThrLeu: 3.328 ± 0.289
1.664ThrMet: 1.664 ± 0.889
1.664ThrAsn: 1.664 ± 0.889
6.656ThrPro: 6.656 ± 0.578
1.664ThrGln: 1.664 ± 0.889
4.992ThrArg: 4.992 ± 0.6
0.0ThrSer: 0.0 ± 0.0
3.328ThrThr: 3.328 ± 1.777
1.664ThrVal: 1.664 ± 1.178
1.664ThrTrp: 1.664 ± 0.889
3.328ThrTyr: 3.328 ± 1.777
0.0ThrXaa: 0.0 ± 0.0
Val
4.992ValAla: 4.992 ± 1.466
0.0ValCys: 0.0 ± 0.0
4.992ValAsp: 4.992 ± 2.666
0.0ValGlu: 0.0 ± 0.0
3.328ValPhe: 3.328 ± 0.289
4.992ValGly: 4.992 ± 2.666
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
4.992ValLys: 4.992 ± 1.466
6.656ValLeu: 6.656 ± 4.71
0.0ValMet: 0.0 ± 0.879
3.328ValAsn: 3.328 ± 1.777
1.664ValPro: 1.664 ± 0.889
1.664ValGln: 1.664 ± 1.178
6.656ValArg: 6.656 ± 2.644
4.992ValSer: 4.992 ± 2.666
6.656ValThr: 6.656 ± 1.489
3.328ValVal: 3.328 ± 1.777
3.328ValTrp: 3.328 ± 2.355
8.319ValTyr: 8.319 ± 1.755
0.0ValXaa: 0.0 ± 0.0
Trp
3.328TrpAla: 3.328 ± 2.355
1.664TrpCys: 1.664 ± 1.178
3.328TrpAsp: 3.328 ± 2.355
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.664TrpGly: 1.664 ± 0.889
0.0TrpHis: 0.0 ± 0.0
1.664TrpIle: 1.664 ± 1.178
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.664TrpGln: 1.664 ± 1.178
3.328TrpArg: 3.328 ± 0.289
1.664TrpSer: 1.664 ± 1.178
1.664TrpThr: 1.664 ± 0.889
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.664TrpTyr: 1.664 ± 1.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.992TyrAla: 4.992 ± 0.6
0.0TyrCys: 0.0 ± 0.0
3.328TyrAsp: 3.328 ± 0.289
1.664TyrGlu: 1.664 ± 1.178
1.664TyrPhe: 1.664 ± 1.178
4.992TyrGly: 4.992 ± 0.6
0.0TyrHis: 0.0 ± 0.0
3.328TyrIle: 3.328 ± 2.355
3.328TyrLys: 3.328 ± 0.289
8.319TyrLeu: 8.319 ± 3.821
0.0TyrMet: 0.0 ± 0.0
1.664TyrAsn: 1.664 ± 0.889
1.664TyrPro: 1.664 ± 0.889
3.328TyrGln: 3.328 ± 0.289
3.328TyrArg: 3.328 ± 0.289
3.328TyrSer: 3.328 ± 0.289
0.0TyrThr: 0.0 ± 0.0
6.656TyrVal: 6.656 ± 0.578
0.0TyrTrp: 0.0 ± 0.0
3.328TyrTyr: 3.328 ± 1.777
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (602 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski