Amino acid dipepetide frequency for Circoviridae 9 LDMD-2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.112AlaAla: 7.112 ± 3.284
0.0AlaCys: 0.0 ± 0.0
7.112AlaAsp: 7.112 ± 4.612
4.267AlaGlu: 4.267 ± 1.41
1.422AlaPhe: 1.422 ± 0.922
5.69AlaGly: 5.69 ± 2.487
2.845AlaHis: 2.845 ± 2.409
2.845AlaIle: 2.845 ± 0.932
4.267AlaLys: 4.267 ± 2.921
5.69AlaLeu: 5.69 ± 3.69
0.0AlaMet: 0.0 ± 0.0
2.845AlaAsn: 2.845 ± 0.932
1.422AlaPro: 1.422 ± 1.205
2.845AlaGln: 2.845 ± 1.447
4.267AlaArg: 4.267 ± 0.782
5.69AlaSer: 5.69 ± 0.612
5.69AlaThr: 5.69 ± 2.193
4.267AlaVal: 4.267 ± 1.41
1.422AlaTrp: 1.422 ± 1.612
1.422AlaTyr: 1.422 ± 1.612
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.422CysPhe: 1.422 ± 0.922
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.845CysLys: 2.845 ± 2.409
0.0CysLeu: 0.0 ± 0.0
2.845CysMet: 2.845 ± 0.896
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.422CysArg: 1.422 ± 1.205
1.422CysSer: 1.422 ± 0.922
1.422CysThr: 1.422 ± 1.205
0.0CysVal: 0.0 ± 0.0
1.422CysTrp: 1.422 ± 0.922
1.422CysTyr: 1.422 ± 0.922
0.0CysXaa: 0.0 ± 0.0
Asp
5.69AspAla: 5.69 ± 3.69
0.0AspCys: 0.0 ± 0.0
1.422AspAsp: 1.422 ± 1.205
5.69AspGlu: 5.69 ± 3.101
1.422AspPhe: 1.422 ± 0.922
1.422AspGly: 1.422 ± 1.205
0.0AspHis: 0.0 ± 0.0
1.422AspIle: 1.422 ± 1.612
4.267AspLys: 4.267 ± 1.41
2.845AspLeu: 2.845 ± 2.409
1.422AspMet: 1.422 ± 0.922
4.267AspAsn: 4.267 ± 1.41
1.422AspPro: 1.422 ± 1.612
0.0AspGln: 0.0 ± 0.0
2.845AspArg: 2.845 ± 1.597
5.69AspSer: 5.69 ± 2.193
9.957AspThr: 9.957 ± 1.958
2.845AspVal: 2.845 ± 0.932
0.0AspTrp: 0.0 ± 0.0
2.845AspTyr: 2.845 ± 2.409
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.422GluCys: 1.422 ± 1.205
4.267GluAsp: 4.267 ± 0.782
4.267GluGlu: 4.267 ± 3.614
1.422GluPhe: 1.422 ± 1.205
1.422GluGly: 1.422 ± 1.205
1.422GluHis: 1.422 ± 1.205
1.422GluIle: 1.422 ± 1.205
2.845GluLys: 2.845 ± 1.447
8.535GluLeu: 8.535 ± 3.894
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.422GluPro: 1.422 ± 0.922
1.422GluGln: 1.422 ± 1.205
1.422GluArg: 1.422 ± 0.922
2.845GluSer: 2.845 ± 0.932
2.845GluThr: 2.845 ± 0.932
2.845GluVal: 2.845 ± 1.845
1.422GluTrp: 1.422 ± 0.922
2.845GluTyr: 2.845 ± 0.932
0.0GluXaa: 0.0 ± 0.0
Phe
4.267PheAla: 4.267 ± 0.782
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
1.422PheGlu: 1.422 ± 1.612
0.0PhePhe: 0.0 ± 0.0
2.845PheGly: 2.845 ± 1.597
1.422PheHis: 1.422 ± 0.922
2.845PheIle: 2.845 ± 1.447
1.422PheLys: 1.422 ± 0.922
1.422PheLeu: 1.422 ± 0.922
1.422PheMet: 1.422 ± 0.922
2.845PheAsn: 2.845 ± 0.932
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
1.422PheArg: 1.422 ± 1.612
4.267PheSer: 4.267 ± 0.782
5.69PheThr: 5.69 ± 1.865
1.422PheVal: 1.422 ± 0.922
0.0PheTrp: 0.0 ± 0.0
1.422PheTyr: 1.422 ± 1.205
0.0PheXaa: 0.0 ± 0.0
Gly
9.957GlyAla: 9.957 ± 3.389
2.845GlyCys: 2.845 ± 2.409
7.112GlyAsp: 7.112 ± 4.282
2.845GlyGlu: 2.845 ± 1.447
0.0GlyPhe: 0.0 ± 0.0
2.845GlyGly: 2.845 ± 1.597
1.422GlyHis: 1.422 ± 1.612
4.267GlyIle: 4.267 ± 2.767
11.38GlyLys: 11.38 ± 2.851
2.845GlyLeu: 2.845 ± 0.932
0.0GlyMet: 0.0 ± 0.0
2.845GlyAsn: 2.845 ± 1.845
1.422GlyPro: 1.422 ± 0.922
1.422GlyGln: 1.422 ± 0.922
4.267GlyArg: 4.267 ± 0.782
9.957GlySer: 9.957 ± 1.058
7.112GlyThr: 7.112 ± 4.282
5.69GlyVal: 5.69 ± 2.193
1.422GlyTrp: 1.422 ± 0.922
5.69GlyTyr: 5.69 ± 2.193
0.0GlyXaa: 0.0 ± 0.0
His
1.422HisAla: 1.422 ± 0.922
0.0HisCys: 0.0 ± 0.0
1.422HisAsp: 1.422 ± 1.205
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.422HisGly: 1.422 ± 1.205
1.422HisHis: 1.422 ± 1.205
2.845HisIle: 2.845 ± 2.409
1.422HisLys: 1.422 ± 0.922
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
2.845HisGln: 2.845 ± 1.447
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
4.267HisTrp: 4.267 ± 3.614
1.422HisTyr: 1.422 ± 1.612
0.0HisXaa: 0.0 ± 0.0
Ile
4.267IleAla: 4.267 ± 2.767
0.0IleCys: 0.0 ± 0.0
1.422IleAsp: 1.422 ± 1.205
1.422IleGlu: 1.422 ± 1.205
0.0IlePhe: 0.0 ± 0.0
8.535IleGly: 8.535 ± 2.016
0.0IleHis: 0.0 ± 0.0
1.422IleIle: 1.422 ± 0.922
5.69IleLys: 5.69 ± 0.612
1.422IleLeu: 1.422 ± 1.205
0.0IleMet: 0.0 ± 0.0
2.845IleAsn: 2.845 ± 3.224
1.422IlePro: 1.422 ± 1.612
4.267IleGln: 4.267 ± 1.41
7.112IleArg: 7.112 ± 2.206
4.267IleSer: 4.267 ± 2.767
2.845IleThr: 2.845 ± 1.447
4.267IleVal: 4.267 ± 1.41
1.422IleTrp: 1.422 ± 0.922
4.267IleTyr: 4.267 ± 1.41
0.0IleXaa: 0.0 ± 0.0
Lys
1.422LysAla: 1.422 ± 1.612
2.845LysCys: 2.845 ± 1.845
2.845LysAsp: 2.845 ± 0.932
2.845LysGlu: 2.845 ± 2.409
5.69LysPhe: 5.69 ± 4.49
8.535LysGly: 8.535 ± 3.568
1.422LysHis: 1.422 ± 1.205
2.845LysIle: 2.845 ± 1.845
8.535LysLys: 8.535 ± 2.88
1.422LysLeu: 1.422 ± 0.922
1.422LysMet: 1.422 ± 1.612
0.0LysAsn: 0.0 ± 0.0
4.267LysPro: 4.267 ± 2.974
7.112LysGln: 7.112 ± 2.206
9.957LysArg: 9.957 ± 1.958
8.535LysSer: 8.535 ± 3.627
5.69LysThr: 5.69 ± 3.193
1.422LysVal: 1.422 ± 0.922
0.0LysTrp: 0.0 ± 0.0
5.69LysTyr: 5.69 ± 4.49
0.0LysXaa: 0.0 ± 0.0
Leu
4.267LeuAla: 4.267 ± 1.41
0.0LeuCys: 0.0 ± 0.0
0.0LeuAsp: 0.0 ± 0.0
7.112LeuGlu: 7.112 ± 3.055
1.422LeuPhe: 1.422 ± 1.205
2.845LeuGly: 2.845 ± 0.932
2.845LeuHis: 2.845 ± 1.447
4.267LeuIle: 4.267 ± 1.41
2.845LeuLys: 2.845 ± 1.845
2.845LeuLeu: 2.845 ± 0.932
0.0LeuMet: 0.0 ± 0.0
4.267LeuAsn: 4.267 ± 1.947
7.112LeuPro: 7.112 ± 6.082
2.845LeuGln: 2.845 ± 0.932
0.0LeuArg: 0.0 ± 0.0
2.845LeuSer: 2.845 ± 0.932
5.69LeuThr: 5.69 ± 1.426
4.267LeuVal: 4.267 ± 1.41
0.0LeuTrp: 0.0 ± 0.0
7.112LeuTyr: 7.112 ± 4.282
0.0LeuXaa: 0.0 ± 0.0
Met
2.845MetAla: 2.845 ± 1.447
0.0MetCys: 0.0 ± 0.0
1.422MetAsp: 1.422 ± 1.205
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
4.267MetGly: 4.267 ± 1.41
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.422MetLys: 1.422 ± 0.922
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.422MetAsn: 1.422 ± 1.205
1.422MetPro: 1.422 ± 1.612
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.422MetSer: 1.422 ± 1.205
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.845AsnAla: 2.845 ± 1.845
1.422AsnCys: 1.422 ± 0.922
2.845AsnAsp: 2.845 ± 1.447
4.267AsnGlu: 4.267 ± 1.41
0.0AsnPhe: 0.0 ± 0.0
4.267AsnGly: 4.267 ± 1.41
0.0AsnHis: 0.0 ± 0.0
1.422AsnIle: 1.422 ± 0.922
1.422AsnLys: 1.422 ± 1.205
4.267AsnLeu: 4.267 ± 1.41
0.0AsnMet: 0.0 ± 0.0
1.422AsnAsn: 1.422 ± 1.205
4.267AsnPro: 4.267 ± 2.767
1.422AsnGln: 1.422 ± 0.922
2.845AsnArg: 2.845 ± 1.447
4.267AsnSer: 4.267 ± 0.782
0.0AsnThr: 0.0 ± 0.0
0.0AsnVal: 0.0 ± 0.0
1.422AsnTrp: 1.422 ± 0.922
2.845AsnTyr: 2.845 ± 0.932
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
2.845ProAsp: 2.845 ± 1.447
0.0ProGlu: 0.0 ± 0.0
0.0ProPhe: 0.0 ± 0.0
4.267ProGly: 4.267 ± 1.41
1.422ProHis: 1.422 ± 1.205
2.845ProIle: 2.845 ± 1.845
8.535ProLys: 8.535 ± 9.671
2.845ProLeu: 2.845 ± 1.845
0.0ProMet: 0.0 ± 0.0
1.422ProAsn: 1.422 ± 0.922
4.267ProPro: 4.267 ± 4.835
1.422ProGln: 1.422 ± 1.612
5.69ProArg: 5.69 ± 4.509
2.845ProSer: 2.845 ± 3.224
5.69ProThr: 5.69 ± 3.101
1.422ProVal: 1.422 ± 0.922
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.845GlnAla: 2.845 ± 1.597
0.0GlnCys: 0.0 ± 0.0
2.845GlnAsp: 2.845 ± 0.932
1.422GlnGlu: 1.422 ± 1.205
2.845GlnPhe: 2.845 ± 1.845
1.422GlnGly: 1.422 ± 1.205
1.422GlnHis: 1.422 ± 1.205
0.0GlnIle: 0.0 ± 0.0
1.422GlnLys: 1.422 ± 1.612
1.422GlnLeu: 1.422 ± 1.205
0.0GlnMet: 0.0 ± 0.0
2.845GlnAsn: 2.845 ± 1.845
2.845GlnPro: 2.845 ± 1.845
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
2.845GlnSer: 2.845 ± 1.845
0.0GlnThr: 0.0 ± 0.0
2.845GlnVal: 2.845 ± 1.597
1.422GlnTrp: 1.422 ± 0.922
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.267ArgAla: 4.267 ± 3.614
0.0ArgCys: 0.0 ± 0.0
5.69ArgAsp: 5.69 ± 0.612
1.422ArgGlu: 1.422 ± 0.922
2.845ArgPhe: 2.845 ± 2.409
1.422ArgGly: 1.422 ± 1.612
0.0ArgHis: 0.0 ± 0.0
7.112ArgIle: 7.112 ± 1.356
7.112ArgLys: 7.112 ± 3.96
2.845ArgLeu: 2.845 ± 1.447
1.422ArgMet: 1.422 ± 1.16
0.0ArgAsn: 0.0 ± 0.0
0.0ArgPro: 0.0 ± 0.0
0.0ArgGln: 0.0 ± 0.0
5.69ArgArg: 5.69 ± 3.193
7.112ArgSer: 7.112 ± 3.96
11.38ArgThr: 11.38 ± 3.1
0.0ArgVal: 0.0 ± 0.0
1.422ArgTrp: 1.422 ± 1.205
1.422ArgTyr: 1.422 ± 1.205
0.0ArgXaa: 0.0 ± 0.0
Ser
4.267SerAla: 4.267 ± 1.813
0.0SerCys: 0.0 ± 0.0
1.422SerAsp: 1.422 ± 1.205
2.845SerGlu: 2.845 ± 0.932
5.69SerPhe: 5.69 ± 2.487
15.647SerGly: 15.647 ± 0.816
0.0SerHis: 0.0 ± 0.0
7.112SerIle: 7.112 ± 1.356
7.112SerLys: 7.112 ± 3.148
9.957SerLeu: 9.957 ± 2.424
1.422SerMet: 1.422 ± 1.047
2.845SerAsn: 2.845 ± 1.845
2.845SerPro: 2.845 ± 1.597
0.0SerGln: 0.0 ± 0.0
0.0SerArg: 0.0 ± 0.0
8.535SerSer: 8.535 ± 7.682
7.112SerThr: 7.112 ± 1.356
2.845SerVal: 2.845 ± 1.845
0.0SerTrp: 0.0 ± 0.0
8.535SerTyr: 8.535 ± 1.563
0.0SerXaa: 0.0 ± 0.0
Thr
5.69ThrAla: 5.69 ± 2.193
2.845ThrCys: 2.845 ± 0.932
2.845ThrAsp: 2.845 ± 2.409
1.422ThrGlu: 1.422 ± 1.205
5.69ThrPhe: 5.69 ± 3.193
8.535ThrGly: 8.535 ± 3.894
0.0ThrHis: 0.0 ± 0.0
4.267ThrIle: 4.267 ± 1.813
2.845ThrLys: 2.845 ± 0.932
4.267ThrLeu: 4.267 ± 2.325
1.422ThrMet: 1.422 ± 1.205
4.267ThrAsn: 4.267 ± 2.767
8.535ThrPro: 8.535 ± 4.79
1.422ThrGln: 1.422 ± 1.205
8.535ThrArg: 8.535 ± 4.79
5.69ThrSer: 5.69 ± 2.193
5.69ThrThr: 5.69 ± 1.865
5.69ThrVal: 5.69 ± 2.193
0.0ThrTrp: 0.0 ± 0.0
1.422ThrTyr: 1.422 ± 1.612
0.0ThrXaa: 0.0 ± 0.0
Val
4.267ValAla: 4.267 ± 2.767
0.0ValCys: 0.0 ± 0.0
5.69ValAsp: 5.69 ± 3.69
1.422ValGlu: 1.422 ± 1.205
1.422ValPhe: 1.422 ± 1.205
5.69ValGly: 5.69 ± 3.69
1.422ValHis: 1.422 ± 1.205
2.845ValIle: 2.845 ± 1.597
1.422ValLys: 1.422 ± 1.205
1.422ValLeu: 1.422 ± 0.922
0.0ValMet: 0.0 ± 0.0
4.267ValAsn: 4.267 ± 1.813
2.845ValPro: 2.845 ± 1.845
0.0ValGln: 0.0 ± 0.0
2.845ValArg: 2.845 ± 0.932
4.267ValSer: 4.267 ± 1.41
1.422ValThr: 1.422 ± 0.922
4.267ValVal: 4.267 ± 2.767
1.422ValTrp: 1.422 ± 1.205
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.422TrpAla: 1.422 ± 0.922
0.0TrpCys: 0.0 ± 0.0
4.267TrpAsp: 4.267 ± 2.325
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.422TrpGly: 1.422 ± 1.205
1.422TrpHis: 1.422 ± 0.922
4.267TrpIle: 4.267 ± 1.813
0.0TrpLys: 0.0 ± 0.0
1.422TrpLeu: 1.422 ± 1.205
0.0TrpMet: 0.0 ± 0.0
2.845TrpAsn: 2.845 ± 1.845
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.422TrpVal: 1.422 ± 1.205
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.267TyrAla: 4.267 ± 1.947
2.845TyrCys: 2.845 ± 0.932
0.0TyrAsp: 0.0 ± 0.0
1.422TyrGlu: 1.422 ± 1.205
2.845TyrPhe: 2.845 ± 1.845
2.845TyrGly: 2.845 ± 0.932
0.0TyrHis: 0.0 ± 0.0
2.845TyrIle: 2.845 ± 2.409
5.69TyrLys: 5.69 ± 0.612
7.112TyrLeu: 7.112 ± 2.339
1.422TyrMet: 1.422 ± 1.205
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
1.422TyrGln: 1.422 ± 1.205
2.845TyrArg: 2.845 ± 1.447
7.112TyrSer: 7.112 ± 1.356
2.845TyrThr: 2.845 ± 1.597
1.422TyrVal: 1.422 ± 1.612
1.422TyrTrp: 1.422 ± 1.612
1.422TyrTyr: 1.422 ± 1.612
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski