Amino acid dipepetide frequency for Circovirus-like genome CB-A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.854AlaAla: 4.854 ± 3.082
0.0AlaCys: 0.0 ± 0.0
4.854AlaAsp: 4.854 ± 0.778
3.236AlaGlu: 3.236 ± 2.055
3.236AlaPhe: 3.236 ± 0.25
3.236AlaGly: 3.236 ± 2.055
0.0AlaHis: 0.0 ± 0.0
0.0AlaIle: 0.0 ± 0.0
3.236AlaLys: 3.236 ± 2.055
3.236AlaLeu: 3.236 ± 0.25
3.236AlaMet: 3.236 ± 2.555
3.236AlaAsn: 3.236 ± 0.25
3.236AlaPro: 3.236 ± 2.055
3.236AlaGln: 3.236 ± 0.25
3.236AlaArg: 3.236 ± 0.25
3.236AlaSer: 3.236 ± 0.25
9.709AlaThr: 9.709 ± 1.555
4.854AlaVal: 4.854 ± 0.778
0.0AlaTrp: 0.0 ± 0.0
4.854AlaTyr: 4.854 ± 1.527
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.618CysPhe: 1.618 ± 1.027
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.618CysIle: 1.618 ± 1.277
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
1.618CysMet: 1.618 ± 1.027
1.618CysAsn: 1.618 ± 1.277
0.0CysPro: 0.0 ± 0.0
1.618CysGln: 1.618 ± 1.027
1.618CysArg: 1.618 ± 1.027
1.618CysSer: 1.618 ± 1.027
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.618AspAla: 1.618 ± 1.277
1.618AspCys: 1.618 ± 1.277
4.854AspAsp: 4.854 ± 0.778
3.236AspGlu: 3.236 ± 0.25
1.618AspPhe: 1.618 ± 1.027
4.854AspGly: 4.854 ± 3.082
4.854AspHis: 4.854 ± 3.082
3.236AspIle: 3.236 ± 2.055
1.618AspLys: 1.618 ± 1.277
1.618AspLeu: 1.618 ± 1.277
0.0AspMet: 0.0 ± 0.0
3.236AspAsn: 3.236 ± 2.555
0.0AspPro: 0.0 ± 0.0
1.618AspGln: 1.618 ± 1.027
1.618AspArg: 1.618 ± 1.027
1.618AspSer: 1.618 ± 1.027
4.854AspThr: 4.854 ± 3.832
6.472AspVal: 6.472 ± 4.11
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
8.091GluAla: 8.091 ± 2.833
0.0GluCys: 0.0 ± 0.0
6.472GluAsp: 6.472 ± 1.805
4.854GluGlu: 4.854 ± 3.082
4.854GluPhe: 4.854 ± 0.778
4.854GluGly: 4.854 ± 3.082
1.618GluHis: 1.618 ± 1.027
0.0GluIle: 0.0 ± 0.0
3.236GluLys: 3.236 ± 2.055
1.618GluLeu: 1.618 ± 1.027
1.618GluMet: 1.618 ± 1.027
0.0GluAsn: 0.0 ± 0.0
4.854GluPro: 4.854 ± 1.527
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
6.472GluSer: 6.472 ± 0.5
1.618GluThr: 1.618 ± 1.027
6.472GluVal: 6.472 ± 0.5
0.0GluTrp: 0.0 ± 0.0
3.236GluTyr: 3.236 ± 2.055
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.236PheAsp: 3.236 ± 2.055
3.236PheGlu: 3.236 ± 0.25
1.618PhePhe: 1.618 ± 1.027
4.854PheGly: 4.854 ± 0.778
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
9.709PheLys: 9.709 ± 1.555
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.865
1.618PheAsn: 1.618 ± 1.277
1.618PhePro: 1.618 ± 1.277
0.0PheGln: 0.0 ± 0.0
4.854PheArg: 4.854 ± 3.832
4.854PheSer: 4.854 ± 0.778
3.236PheThr: 3.236 ± 0.25
4.854PheVal: 4.854 ± 0.778
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.236GlyAla: 3.236 ± 0.25
0.0GlyCys: 0.0 ± 0.0
3.236GlyAsp: 3.236 ± 0.25
8.091GlyGlu: 8.091 ± 2.833
4.854GlyPhe: 4.854 ± 1.527
6.472GlyGly: 6.472 ± 1.805
1.618GlyHis: 1.618 ± 1.027
1.618GlyIle: 1.618 ± 1.027
4.854GlyLys: 4.854 ± 3.082
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 0.0
3.236GlyAsn: 3.236 ± 0.25
1.618GlyPro: 1.618 ± 1.027
0.0GlyGln: 0.0 ± 0.0
1.618GlyArg: 1.618 ± 1.027
1.618GlySer: 1.618 ± 1.277
6.472GlyThr: 6.472 ± 1.805
1.618GlyVal: 1.618 ± 1.277
0.0GlyTrp: 0.0 ± 0.0
4.854GlyTyr: 4.854 ± 3.082
0.0GlyXaa: 0.0 ± 0.0
His
1.618HisAla: 1.618 ± 1.027
0.0HisCys: 0.0 ± 0.0
1.618HisAsp: 1.618 ± 1.277
3.236HisGlu: 3.236 ± 2.055
1.618HisPhe: 1.618 ± 1.027
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.618HisIle: 1.618 ± 1.027
1.618HisLys: 1.618 ± 1.277
1.618HisLeu: 1.618 ± 1.027
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.618HisPro: 1.618 ± 1.027
1.618HisGln: 1.618 ± 1.277
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
3.236HisTyr: 3.236 ± 2.055
0.0HisXaa: 0.0 ± 0.0
Ile
3.236IleAla: 3.236 ± 2.555
1.618IleCys: 1.618 ± 1.027
1.618IleAsp: 1.618 ± 1.027
3.236IleGlu: 3.236 ± 2.055
4.854IlePhe: 4.854 ± 1.527
0.0IleGly: 0.0 ± 0.0
0.0IleHis: 0.0 ± 0.0
1.618IleIle: 1.618 ± 1.027
8.091IleLys: 8.091 ± 1.777
1.618IleLeu: 1.618 ± 1.027
3.236IleMet: 3.236 ± 0.25
0.0IleAsn: 0.0 ± 0.0
3.236IlePro: 3.236 ± 0.25
1.618IleGln: 1.618 ± 1.027
6.472IleArg: 6.472 ± 4.11
1.618IleSer: 1.618 ± 1.277
1.618IleThr: 1.618 ± 1.277
4.854IleVal: 4.854 ± 0.778
3.236IleTrp: 3.236 ± 0.25
1.618IleTyr: 1.618 ± 1.027
0.0IleXaa: 0.0 ± 0.0
Lys
1.618LysAla: 1.618 ± 1.027
1.618LysCys: 1.618 ± 1.027
3.236LysAsp: 3.236 ± 0.25
6.472LysGlu: 6.472 ± 1.805
1.618LysPhe: 1.618 ± 1.277
3.236LysGly: 3.236 ± 0.25
1.618LysHis: 1.618 ± 1.277
9.709LysIle: 9.709 ± 0.75
8.091LysLys: 8.091 ± 4.082
0.0LysLeu: 0.0 ± 0.0
1.618LysMet: 1.618 ± 1.027
4.854LysAsn: 4.854 ± 0.778
6.472LysPro: 6.472 ± 1.805
0.0LysGln: 0.0 ± 0.0
8.091LysArg: 8.091 ± 1.777
9.709LysSer: 9.709 ± 1.555
3.236LysThr: 3.236 ± 2.055
6.472LysVal: 6.472 ± 2.805
1.618LysTrp: 1.618 ± 1.027
3.236LysTyr: 3.236 ± 0.25
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
0.0LeuAsp: 0.0 ± 0.0
3.236LeuGlu: 3.236 ± 0.25
0.0LeuPhe: 0.0 ± 0.0
1.618LeuGly: 1.618 ± 1.277
0.0LeuHis: 0.0 ± 0.0
1.618LeuIle: 1.618 ± 1.027
9.709LeuLys: 9.709 ± 3.054
0.0LeuLeu: 0.0 ± 0.0
0.0LeuMet: 0.0 ± 0.0
6.472LeuAsn: 6.472 ± 2.805
6.472LeuPro: 6.472 ± 2.805
1.618LeuGln: 1.618 ± 1.027
0.0LeuArg: 0.0 ± 0.0
1.618LeuSer: 1.618 ± 1.277
8.091LeuThr: 8.091 ± 4.082
1.618LeuVal: 1.618 ± 1.277
0.0LeuTrp: 0.0 ± 0.0
1.618LeuTyr: 1.618 ± 1.027
0.0LeuXaa: 0.0 ± 0.0
Met
1.618MetAla: 1.618 ± 1.277
0.0MetCys: 0.0 ± 0.0
1.618MetAsp: 1.618 ± 1.027
0.0MetGlu: 0.0 ± 0.0
1.618MetPhe: 1.618 ± 1.277
1.618MetGly: 1.618 ± 1.277
0.0MetHis: 0.0 ± 0.0
1.618MetIle: 1.618 ± 1.027
3.236MetLys: 3.236 ± 2.055
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.236MetPro: 3.236 ± 2.555
0.0MetGln: 0.0 ± 0.0
1.618MetArg: 1.618 ± 1.277
1.618MetSer: 1.618 ± 1.027
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
3.236MetTyr: 3.236 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
4.854AsnAla: 4.854 ± 1.527
1.618AsnCys: 1.618 ± 1.277
1.618AsnAsp: 1.618 ± 1.027
1.618AsnGlu: 1.618 ± 1.277
3.236AsnPhe: 3.236 ± 0.25
1.618AsnGly: 1.618 ± 1.277
0.0AsnHis: 0.0 ± 0.0
1.618AsnIle: 1.618 ± 1.277
6.472AsnLys: 6.472 ± 1.805
6.472AsnLeu: 6.472 ± 5.109
1.618AsnMet: 1.618 ± 1.277
4.854AsnAsn: 4.854 ± 3.082
1.618AsnPro: 1.618 ± 1.277
1.618AsnGln: 1.618 ± 1.027
0.0AsnArg: 0.0 ± 0.0
1.618AsnSer: 1.618 ± 1.277
4.854AsnThr: 4.854 ± 1.527
3.236AsnVal: 3.236 ± 2.555
3.236AsnTrp: 3.236 ± 2.055
6.472AsnTyr: 6.472 ± 1.805
0.0AsnXaa: 0.0 ± 0.0
Pro
9.709ProAla: 9.709 ± 6.165
1.618ProCys: 1.618 ± 1.027
3.236ProAsp: 3.236 ± 2.555
4.854ProGlu: 4.854 ± 0.778
1.618ProPhe: 1.618 ± 1.027
3.236ProGly: 3.236 ± 0.25
3.236ProHis: 3.236 ± 0.25
3.236ProIle: 3.236 ± 0.25
4.854ProLys: 4.854 ± 1.527
1.618ProLeu: 1.618 ± 1.277
0.0ProMet: 0.0 ± 0.0
3.236ProAsn: 3.236 ± 0.25
12.945ProPro: 12.945 ± 3.61
3.236ProGln: 3.236 ± 0.25
1.618ProArg: 1.618 ± 1.027
4.854ProSer: 4.854 ± 0.778
6.472ProThr: 6.472 ± 5.109
0.0ProVal: 0.0 ± 0.0
1.618ProTrp: 1.618 ± 1.277
1.618ProTyr: 1.618 ± 1.277
0.0ProXaa: 0.0 ± 0.0
Gln
1.618GlnAla: 1.618 ± 1.027
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
1.618GlnGly: 1.618 ± 1.027
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
3.236GlnLys: 3.236 ± 2.055
1.618GlnLeu: 1.618 ± 1.277
3.236GlnMet: 3.236 ± 0.948
3.236GlnAsn: 3.236 ± 2.055
3.236GlnPro: 3.236 ± 0.25
1.618GlnGln: 1.618 ± 1.027
0.0GlnArg: 0.0 ± 0.0
3.236GlnSer: 3.236 ± 0.25
0.0GlnThr: 0.0 ± 0.0
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
3.236GlnTyr: 3.236 ± 0.25
0.0GlnXaa: 0.0 ± 0.0
Arg
8.091ArgAla: 8.091 ± 4.082
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
3.236ArgGlu: 3.236 ± 0.25
3.236ArgPhe: 3.236 ± 2.055
3.236ArgGly: 3.236 ± 0.25
1.618ArgHis: 1.618 ± 1.027
4.854ArgIle: 4.854 ± 1.527
1.618ArgLys: 1.618 ± 1.277
0.0ArgLeu: 0.0 ± 0.0
3.236ArgMet: 3.236 ± 0.25
1.618ArgAsn: 1.618 ± 1.027
4.854ArgPro: 4.854 ± 0.778
3.236ArgGln: 3.236 ± 0.25
11.327ArgArg: 11.327 ± 2.027
4.854ArgSer: 4.854 ± 1.527
0.0ArgThr: 0.0 ± 0.0
3.236ArgVal: 3.236 ± 0.25
0.0ArgTrp: 0.0 ± 0.0
3.236ArgTyr: 3.236 ± 0.25
0.0ArgXaa: 0.0 ± 0.0
Ser
4.854SerAla: 4.854 ± 0.778
0.0SerCys: 0.0 ± 0.0
4.854SerAsp: 4.854 ± 1.527
3.236SerGlu: 3.236 ± 0.25
3.236SerPhe: 3.236 ± 2.555
4.854SerGly: 4.854 ± 3.082
0.0SerHis: 0.0 ± 0.0
3.236SerIle: 3.236 ± 0.25
4.854SerLys: 4.854 ± 1.527
4.854SerLeu: 4.854 ± 1.527
0.0SerMet: 0.0 ± 0.0
3.236SerAsn: 3.236 ± 2.055
4.854SerPro: 4.854 ± 3.082
1.618SerGln: 1.618 ± 1.027
4.854SerArg: 4.854 ± 0.778
6.472SerSer: 6.472 ± 2.805
4.854SerThr: 4.854 ± 1.527
4.854SerVal: 4.854 ± 1.527
0.0SerTrp: 0.0 ± 0.0
4.854SerTyr: 4.854 ± 3.832
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
0.0ThrCys: 0.0 ± 0.0
3.236ThrAsp: 3.236 ± 2.055
1.618ThrGlu: 1.618 ± 1.027
0.0ThrPhe: 0.0 ± 0.0
3.236ThrGly: 3.236 ± 0.25
0.0ThrHis: 0.0 ± 0.0
4.854ThrIle: 4.854 ± 0.778
3.236ThrLys: 3.236 ± 2.055
11.327ThrLeu: 11.327 ± 6.637
0.0ThrMet: 0.0 ± 0.0
6.472ThrAsn: 6.472 ± 5.109
8.091ThrPro: 8.091 ± 1.777
0.0ThrGln: 0.0 ± 0.0
4.854ThrArg: 4.854 ± 1.527
3.236ThrSer: 3.236 ± 0.25
3.236ThrThr: 3.236 ± 0.25
3.236ThrVal: 3.236 ± 0.25
0.0ThrTrp: 0.0 ± 0.0
8.091ThrTyr: 8.091 ± 1.777
0.0ThrXaa: 0.0 ± 0.0
Val
9.709ValAla: 9.709 ± 3.86
0.0ValCys: 0.0 ± 0.0
1.618ValAsp: 1.618 ± 1.277
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
3.236ValGly: 3.236 ± 0.25
3.236ValHis: 3.236 ± 0.25
8.091ValIle: 8.091 ± 2.833
1.618ValLys: 1.618 ± 1.027
4.854ValLeu: 4.854 ± 0.778
0.0ValMet: 0.0 ± 0.0
4.854ValAsn: 4.854 ± 3.832
3.236ValPro: 3.236 ± 2.555
3.236ValGln: 3.236 ± 0.25
4.854ValArg: 4.854 ± 1.527
3.236ValSer: 3.236 ± 2.555
1.618ValThr: 1.618 ± 1.027
9.709ValVal: 9.709 ± 0.75
0.0ValTrp: 0.0 ± 0.0
4.854ValTyr: 4.854 ± 1.527
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.618TrpCys: 1.618 ± 1.027
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
3.236TrpPhe: 3.236 ± 0.25
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
3.236TrpSer: 3.236 ± 2.055
0.0TrpThr: 0.0 ± 0.0
1.618TrpVal: 1.618 ± 1.027
0.0TrpTrp: 0.0 ± 0.0
1.618TrpTyr: 1.618 ± 1.277
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.618TyrCys: 1.618 ± 1.027
3.236TyrAsp: 3.236 ± 2.055
6.472TyrGlu: 6.472 ± 4.11
3.236TyrPhe: 3.236 ± 0.25
3.236TyrGly: 3.236 ± 2.055
1.618TyrHis: 1.618 ± 1.027
4.854TyrIle: 4.854 ± 1.527
3.236TyrLys: 3.236 ± 2.555
3.236TyrLeu: 3.236 ± 0.25
0.0TyrMet: 0.0 ± 0.0
6.472TyrAsn: 6.472 ± 0.5
1.618TyrPro: 1.618 ± 1.027
0.0TyrGln: 0.0 ± 0.0
4.854TyrArg: 4.854 ± 3.832
4.854TyrSer: 4.854 ± 1.527
4.854TyrThr: 4.854 ± 1.527
4.854TyrVal: 4.854 ± 1.527
1.618TyrTrp: 1.618 ± 1.027
4.854TyrTyr: 4.854 ± 3.832
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (619 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski