Amino acid dipepetide frequency for Avon-Heathcote Estuary associated circular virus 26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.422AlaAla: 1.422 ± 0.812
0.0AlaCys: 0.0 ± 0.0
1.422AlaAsp: 1.422 ± 0.812
0.0AlaGlu: 0.0 ± 0.0
4.267AlaPhe: 4.267 ± 0.349
2.845AlaGly: 2.845 ± 1.623
2.845AlaHis: 2.845 ± 0.463
1.422AlaIle: 1.422 ± 0.812
5.69AlaLys: 5.69 ± 3.012
1.422AlaLeu: 1.422 ± 1.275
1.422AlaMet: 1.422 ± 0.812
0.0AlaAsn: 0.0 ± 0.0
1.422AlaPro: 1.422 ± 1.275
0.0AlaGln: 0.0 ± 0.0
4.267AlaArg: 4.267 ± 0.349
5.69AlaSer: 5.69 ± 1.16
2.845AlaThr: 2.845 ± 1.623
4.267AlaVal: 4.267 ± 2.435
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.845CysAla: 2.845 ± 2.549
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.422CysHis: 1.422 ± 1.275
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.422CysAsn: 1.422 ± 0.812
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.422CysThr: 1.422 ± 1.275
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.422AspAla: 1.422 ± 0.812
0.0AspCys: 0.0 ± 0.0
2.845AspAsp: 2.845 ± 0.463
2.845AspGlu: 2.845 ± 2.549
2.845AspPhe: 2.845 ± 2.549
2.845AspGly: 2.845 ± 2.549
1.422AspHis: 1.422 ± 0.812
1.422AspIle: 1.422 ± 1.275
2.845AspLys: 2.845 ± 1.623
4.267AspLeu: 4.267 ± 1.738
1.422AspMet: 1.422 ± 0.812
1.422AspAsn: 1.422 ± 0.812
4.267AspPro: 4.267 ± 0.349
2.845AspGln: 2.845 ± 0.463
1.422AspArg: 1.422 ± 0.812
4.267AspSer: 4.267 ± 0.349
5.69AspThr: 5.69 ± 3.247
2.845AspVal: 2.845 ± 0.463
1.422AspTrp: 1.422 ± 1.275
1.422AspTyr: 1.422 ± 1.275
0.0AspXaa: 0.0 ± 0.0
Glu
1.422GluAla: 1.422 ± 1.275
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
4.267GluGlu: 4.267 ± 3.824
5.69GluPhe: 5.69 ± 3.012
1.422GluGly: 1.422 ± 1.275
0.0GluHis: 0.0 ± 0.0
4.267GluIle: 4.267 ± 1.738
4.267GluLys: 4.267 ± 3.824
5.69GluLeu: 5.69 ± 1.16
0.0GluMet: 0.0 ± 0.0
1.422GluAsn: 1.422 ± 1.275
1.422GluPro: 1.422 ± 0.812
4.267GluGln: 4.267 ± 1.738
0.0GluArg: 0.0 ± 0.0
4.267GluSer: 4.267 ± 2.435
2.845GluThr: 2.845 ± 0.463
4.267GluVal: 4.267 ± 1.738
1.422GluTrp: 1.422 ± 0.812
1.422GluTyr: 1.422 ± 0.812
0.0GluXaa: 0.0 ± 0.0
Phe
2.845PheAla: 2.845 ± 0.463
0.0PheCys: 0.0 ± 0.0
2.845PheAsp: 2.845 ± 2.549
8.535PheGlu: 8.535 ± 1.389
2.845PhePhe: 2.845 ± 1.623
5.69PheGly: 5.69 ± 1.16
2.845PheHis: 2.845 ± 1.623
2.845PheIle: 2.845 ± 0.463
2.845PheLys: 2.845 ± 0.463
1.422PheLeu: 1.422 ± 1.275
1.422PheMet: 1.422 ± 0.812
5.69PheAsn: 5.69 ± 3.247
0.0PhePro: 0.0 ± 0.0
2.845PheGln: 2.845 ± 0.463
4.267PheArg: 4.267 ± 0.349
1.422PheSer: 1.422 ± 0.812
5.69PheThr: 5.69 ± 3.012
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
1.422PheTyr: 1.422 ± 1.275
0.0PheXaa: 0.0 ± 0.0
Gly
5.69GlyAla: 5.69 ± 0.926
0.0GlyCys: 0.0 ± 0.0
5.69GlyAsp: 5.69 ± 0.926
2.845GlyGlu: 2.845 ± 0.463
4.267GlyPhe: 4.267 ± 0.349
0.0GlyGly: 0.0 ± 0.0
0.0GlyHis: 0.0 ± 0.0
1.422GlyIle: 1.422 ± 0.812
4.267GlyLys: 4.267 ± 1.738
0.0GlyLeu: 0.0 ± 0.0
1.422GlyMet: 1.422 ± 0.812
4.267GlyAsn: 4.267 ± 2.435
1.422GlyPro: 1.422 ± 0.812
9.957GlyGln: 9.957 ± 4.75
2.845GlyArg: 2.845 ± 1.623
7.112GlySer: 7.112 ± 1.972
5.69GlyThr: 5.69 ± 0.926
2.845GlyVal: 2.845 ± 0.463
0.0GlyTrp: 0.0 ± 0.0
2.845GlyTyr: 2.845 ± 2.549
0.0GlyXaa: 0.0 ± 0.0
His
1.422HisAla: 1.422 ± 0.812
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.422HisHis: 1.422 ± 1.275
1.422HisIle: 1.422 ± 0.812
2.845HisLys: 2.845 ± 0.463
4.267HisLeu: 4.267 ± 1.738
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.422HisArg: 1.422 ± 0.812
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.422HisVal: 1.422 ± 0.812
1.422HisTrp: 1.422 ± 1.275
4.267HisTyr: 4.267 ± 1.738
0.0HisXaa: 0.0 ± 0.0
Ile
1.422IleAla: 1.422 ± 0.812
1.422IleCys: 1.422 ± 1.275
5.69IleAsp: 5.69 ± 3.012
4.267IleGlu: 4.267 ± 0.349
0.0IlePhe: 0.0 ± 0.0
5.69IleGly: 5.69 ± 0.926
0.0IleHis: 0.0 ± 0.0
5.69IleIle: 5.69 ± 0.926
5.69IleLys: 5.69 ± 1.16
0.0IleLeu: 0.0 ± 0.0
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
2.845IlePro: 2.845 ± 0.463
4.267IleGln: 4.267 ± 2.435
2.845IleArg: 2.845 ± 0.463
4.267IleSer: 4.267 ± 0.349
7.112IleThr: 7.112 ± 2.201
5.69IleVal: 5.69 ± 3.247
0.0IleTrp: 0.0 ± 0.0
2.845IleTyr: 2.845 ± 0.463
0.0IleXaa: 0.0 ± 0.0
Lys
5.69LysAla: 5.69 ± 0.926
0.0LysCys: 0.0 ± 0.0
2.845LysAsp: 2.845 ± 0.463
1.422LysGlu: 1.422 ± 1.275
1.422LysPhe: 1.422 ± 0.812
4.267LysGly: 4.267 ± 1.738
0.0LysHis: 0.0 ± 0.0
2.845LysIle: 2.845 ± 0.463
14.225LysLys: 14.225 ± 0.229
0.0LysLeu: 0.0 ± 0.0
4.267LysMet: 4.267 ± 1.206
2.845LysAsn: 2.845 ± 1.623
4.267LysPro: 4.267 ± 3.824
4.267LysGln: 4.267 ± 0.349
5.69LysArg: 5.69 ± 0.926
9.957LysSer: 9.957 ± 1.509
7.112LysThr: 7.112 ± 0.114
5.69LysVal: 5.69 ± 3.247
4.267LysTrp: 4.267 ± 3.824
7.112LysTyr: 7.112 ± 1.972
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
4.267LeuAsp: 4.267 ± 0.349
2.845LeuGlu: 2.845 ± 2.549
1.422LeuPhe: 1.422 ± 0.812
2.845LeuGly: 2.845 ± 1.623
0.0LeuHis: 0.0 ± 0.0
1.422LeuIle: 1.422 ± 1.275
4.267LeuLys: 4.267 ± 1.738
1.422LeuLeu: 1.422 ± 1.275
1.422LeuMet: 1.422 ± 0.812
1.422LeuAsn: 1.422 ± 1.275
4.267LeuPro: 4.267 ± 0.349
4.267LeuGln: 4.267 ± 2.435
1.422LeuArg: 1.422 ± 1.275
2.845LeuSer: 2.845 ± 1.623
2.845LeuThr: 2.845 ± 0.463
1.422LeuVal: 1.422 ± 1.275
0.0LeuTrp: 0.0 ± 0.0
1.422LeuTyr: 1.422 ± 0.812
0.0LeuXaa: 0.0 ± 0.0
Met
1.422MetAla: 1.422 ± 0.812
0.0MetCys: 0.0 ± 0.0
1.422MetAsp: 1.422 ± 0.812
0.0MetGlu: 0.0 ± 0.0
1.422MetPhe: 1.422 ± 0.812
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.422MetIle: 1.422 ± 0.812
2.845MetLys: 2.845 ± 1.623
1.422MetLeu: 1.422 ± 0.812
1.422MetMet: 1.422 ± 1.275
1.422MetAsn: 1.422 ± 0.812
4.267MetPro: 4.267 ± 1.738
0.0MetGln: 0.0 ± 0.0
1.422MetArg: 1.422 ± 1.275
1.422MetSer: 1.422 ± 0.812
1.422MetThr: 1.422 ± 0.812
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.845AsnAla: 2.845 ± 1.623
0.0AsnCys: 0.0 ± 0.0
4.267AsnAsp: 4.267 ± 2.435
2.845AsnGlu: 2.845 ± 1.623
1.422AsnPhe: 1.422 ± 0.812
1.422AsnGly: 1.422 ± 0.812
1.422AsnHis: 1.422 ± 0.812
4.267AsnIle: 4.267 ± 1.738
1.422AsnLys: 1.422 ± 1.275
5.69AsnLeu: 5.69 ± 3.247
0.0AsnMet: 0.0 ± 0.0
5.69AsnAsn: 5.69 ± 0.926
2.845AsnPro: 2.845 ± 1.623
2.845AsnGln: 2.845 ± 1.623
1.422AsnArg: 1.422 ± 0.812
1.422AsnSer: 1.422 ± 0.812
8.535AsnThr: 8.535 ± 2.784
5.69AsnVal: 5.69 ± 3.012
1.422AsnTrp: 1.422 ± 0.812
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.267ProAla: 4.267 ± 0.349
1.422ProCys: 1.422 ± 1.275
1.422ProAsp: 1.422 ± 1.275
1.422ProGlu: 1.422 ± 1.275
4.267ProPhe: 4.267 ± 0.349
7.112ProGly: 7.112 ± 4.059
0.0ProHis: 0.0 ± 0.0
1.422ProIle: 1.422 ± 1.275
5.69ProLys: 5.69 ± 3.247
0.0ProLeu: 0.0 ± 0.0
0.0ProMet: 0.0 ± 0.0
1.422ProAsn: 1.422 ± 0.812
5.69ProPro: 5.69 ± 3.247
0.0ProGln: 0.0 ± 0.0
2.845ProArg: 2.845 ± 0.463
5.69ProSer: 5.69 ± 0.926
2.845ProThr: 2.845 ± 0.463
5.69ProVal: 5.69 ± 1.16
0.0ProTrp: 0.0 ± 0.0
1.422ProTyr: 1.422 ± 1.275
0.0ProXaa: 0.0 ± 0.0
Gln
1.422GlnAla: 1.422 ± 0.812
0.0GlnCys: 0.0 ± 0.0
1.422GlnAsp: 1.422 ± 1.275
2.845GlnGlu: 2.845 ± 0.463
5.69GlnPhe: 5.69 ± 1.16
5.69GlnGly: 5.69 ± 0.926
0.0GlnHis: 0.0 ± 0.0
4.267GlnIle: 4.267 ± 1.738
4.267GlnLys: 4.267 ± 1.738
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.422GlnAsn: 1.422 ± 0.812
2.845GlnPro: 2.845 ± 1.623
1.422GlnGln: 1.422 ± 0.812
1.422GlnArg: 1.422 ± 0.812
0.0GlnSer: 0.0 ± 0.0
5.69GlnThr: 5.69 ± 1.16
5.69GlnVal: 5.69 ± 3.012
0.0GlnTrp: 0.0 ± 0.0
4.267GlnTyr: 4.267 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
1.422ArgCys: 1.422 ± 1.275
2.845ArgAsp: 2.845 ± 0.463
1.422ArgGlu: 1.422 ± 1.275
5.69ArgPhe: 5.69 ± 1.16
4.267ArgGly: 4.267 ± 1.738
1.422ArgHis: 1.422 ± 1.275
7.112ArgIle: 7.112 ± 1.972
5.69ArgLys: 5.69 ± 1.16
0.0ArgLeu: 0.0 ± 0.0
0.0ArgMet: 0.0 ± 0.0
2.845ArgAsn: 2.845 ± 1.623
1.422ArgPro: 1.422 ± 0.812
2.845ArgGln: 2.845 ± 0.463
4.267ArgArg: 4.267 ± 0.349
2.845ArgSer: 2.845 ± 0.463
4.267ArgThr: 4.267 ± 2.435
2.845ArgVal: 2.845 ± 0.463
1.422ArgTrp: 1.422 ± 1.275
4.267ArgTyr: 4.267 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
2.845SerAla: 2.845 ± 1.623
0.0SerCys: 0.0 ± 0.0
4.267SerAsp: 4.267 ± 0.349
1.422SerGlu: 1.422 ± 0.812
4.267SerPhe: 4.267 ± 0.349
4.267SerGly: 4.267 ± 1.738
0.0SerHis: 0.0 ± 0.0
5.69SerIle: 5.69 ± 3.247
2.845SerLys: 2.845 ± 1.623
1.422SerLeu: 1.422 ± 0.812
2.845SerMet: 2.845 ± 1.623
4.267SerAsn: 4.267 ± 0.349
2.845SerPro: 2.845 ± 1.623
2.845SerGln: 2.845 ± 0.463
5.69SerArg: 5.69 ± 0.926
14.225SerSer: 14.225 ± 6.031
7.112SerThr: 7.112 ± 1.972
8.535SerVal: 8.535 ± 2.784
0.0SerTrp: 0.0 ± 0.0
2.845SerTyr: 2.845 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
2.845ThrAla: 2.845 ± 1.623
0.0ThrCys: 0.0 ± 0.0
1.422ThrAsp: 1.422 ± 0.812
4.267ThrGlu: 4.267 ± 2.435
2.845ThrPhe: 2.845 ± 1.623
9.957ThrGly: 9.957 ± 2.664
1.422ThrHis: 1.422 ± 1.275
2.845ThrIle: 2.845 ± 1.623
8.535ThrLys: 8.535 ± 1.389
4.267ThrLeu: 4.267 ± 0.349
1.422ThrMet: 1.422 ± 1.313
7.112ThrAsn: 7.112 ± 1.972
1.422ThrPro: 1.422 ± 0.812
1.422ThrGln: 1.422 ± 0.812
5.69ThrArg: 5.69 ± 3.012
4.267ThrSer: 4.267 ± 0.349
2.845ThrThr: 2.845 ± 1.623
5.69ThrVal: 5.69 ± 0.926
1.422ThrTrp: 1.422 ± 0.812
4.267ThrTyr: 4.267 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
1.422ValAla: 1.422 ± 0.812
1.422ValCys: 1.422 ± 0.812
4.267ValAsp: 4.267 ± 0.349
2.845ValGlu: 2.845 ± 0.463
2.845ValPhe: 2.845 ± 2.549
2.845ValGly: 2.845 ± 1.623
2.845ValHis: 2.845 ± 0.463
4.267ValIle: 4.267 ± 0.349
5.69ValLys: 5.69 ± 3.012
4.267ValLeu: 4.267 ± 0.349
2.845ValMet: 2.845 ± 0.463
5.69ValAsn: 5.69 ± 0.926
7.112ValPro: 7.112 ± 1.972
2.845ValGln: 2.845 ± 0.463
4.267ValArg: 4.267 ± 2.435
5.69ValSer: 5.69 ± 3.247
0.0ValThr: 0.0 ± 0.0
7.112ValVal: 7.112 ± 2.201
0.0ValTrp: 0.0 ± 0.0
5.69ValTyr: 5.69 ± 0.926
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.422TrpAsp: 1.422 ± 1.275
0.0TrpGlu: 0.0 ± 0.0
1.422TrpPhe: 1.422 ± 1.275
1.422TrpGly: 1.422 ± 1.275
0.0TrpHis: 0.0 ± 0.0
1.422TrpIle: 1.422 ± 1.275
2.845TrpLys: 2.845 ± 1.623
1.422TrpLeu: 1.422 ± 1.275
0.0TrpMet: 0.0 ± 0.0
1.422TrpAsn: 1.422 ± 0.812
0.0TrpPro: 0.0 ± 0.0
1.422TrpGln: 1.422 ± 1.275
1.422TrpArg: 1.422 ± 0.812
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.422TrpVal: 1.422 ± 0.812
1.422TrpTrp: 1.422 ± 1.275
1.422TrpTyr: 1.422 ± 1.275
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.422TyrCys: 1.422 ± 1.275
1.422TyrAsp: 1.422 ± 0.812
4.267TyrGlu: 4.267 ± 3.824
2.845TyrPhe: 2.845 ± 2.549
0.0TyrGly: 0.0 ± 0.0
2.845TyrHis: 2.845 ± 0.463
4.267TyrIle: 4.267 ± 0.349
1.422TyrLys: 1.422 ± 0.812
2.845TyrLeu: 2.845 ± 0.463
0.0TyrMet: 0.0 ± 0.0
5.69TyrAsn: 5.69 ± 1.16
4.267TyrPro: 4.267 ± 1.738
0.0TyrGln: 0.0 ± 0.0
4.267TyrArg: 4.267 ± 0.349
2.845TyrSer: 2.845 ± 0.463
1.422TyrThr: 1.422 ± 0.812
2.845TyrVal: 2.845 ± 0.463
4.267TyrTrp: 4.267 ± 0.349
1.422TyrTyr: 1.422 ± 0.812
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski