Amino acid dipepetide frequency for Avon-Heathcote Estuary associated circular virus 27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.702AlaAla: 4.702 ± 3.246
0.0AlaCys: 0.0 ± 0.0
3.135AlaAsp: 3.135 ± 0.088
3.135AlaGlu: 3.135 ± 2.164
4.702AlaPhe: 4.702 ± 3.51
7.837AlaGly: 7.837 ± 0.905
1.567AlaHis: 1.567 ± 1.17
4.702AlaIle: 4.702 ± 1.258
4.702AlaLys: 4.702 ± 1.258
6.27AlaLeu: 6.27 ± 0.176
0.0AlaMet: 0.0 ± 0.0
1.567AlaAsn: 1.567 ± 1.17
4.702AlaPro: 4.702 ± 0.994
3.135AlaGln: 3.135 ± 0.088
4.702AlaArg: 4.702 ± 0.994
3.135AlaSer: 3.135 ± 2.164
1.567AlaThr: 1.567 ± 1.17
4.702AlaVal: 4.702 ± 1.258
0.0AlaTrp: 0.0 ± 0.0
1.567AlaTyr: 1.567 ± 1.17
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
3.135CysPhe: 3.135 ± 2.34
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
1.567CysMet: 1.567 ± 1.17
1.567CysAsn: 1.567 ± 1.082
1.567CysPro: 1.567 ± 1.17
0.0CysGln: 0.0 ± 0.0
1.567CysArg: 1.567 ± 1.082
0.0CysSer: 0.0 ± 0.0
1.567CysThr: 1.567 ± 1.17
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.702AspAla: 4.702 ± 0.994
1.567AspCys: 1.567 ± 1.082
4.702AspAsp: 4.702 ± 0.994
1.567AspGlu: 1.567 ± 1.17
6.27AspPhe: 6.27 ± 2.076
3.135AspGly: 3.135 ± 2.164
1.567AspHis: 1.567 ± 1.17
4.702AspIle: 4.702 ± 3.51
1.567AspLys: 1.567 ± 1.17
4.702AspLeu: 4.702 ± 3.51
1.567AspMet: 1.567 ± 1.17
1.567AspAsn: 1.567 ± 1.17
3.135AspPro: 3.135 ± 2.164
0.0AspGln: 0.0 ± 0.0
3.135AspArg: 3.135 ± 0.088
6.27AspSer: 6.27 ± 4.328
3.135AspThr: 3.135 ± 0.088
4.702AspVal: 4.702 ± 0.994
1.567AspTrp: 1.567 ± 1.17
6.27AspTyr: 6.27 ± 2.076
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.567GluCys: 1.567 ± 1.17
3.135GluAsp: 3.135 ± 2.34
3.135GluGlu: 3.135 ± 2.34
1.567GluPhe: 1.567 ± 1.17
1.567GluGly: 1.567 ± 1.17
1.567GluHis: 1.567 ± 1.17
1.567GluIle: 1.567 ± 1.17
3.135GluLys: 3.135 ± 2.34
3.135GluLeu: 3.135 ± 2.164
0.0GluMet: 0.0 ± 0.0
3.135GluAsn: 3.135 ± 0.088
3.135GluPro: 3.135 ± 2.34
1.567GluGln: 1.567 ± 1.17
1.567GluArg: 1.567 ± 1.17
9.404GluSer: 9.404 ± 0.265
3.135GluThr: 3.135 ± 0.088
6.27GluVal: 6.27 ± 2.429
1.567GluTrp: 1.567 ± 1.17
3.135GluTyr: 3.135 ± 2.164
0.0GluXaa: 0.0 ± 0.0
Phe
6.27PheAla: 6.27 ± 2.429
0.0PheCys: 0.0 ± 0.0
7.837PheAsp: 7.837 ± 3.599
4.702PheGlu: 4.702 ± 3.51
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
3.135PheHis: 3.135 ± 0.088
3.135PheIle: 3.135 ± 0.088
6.27PheLys: 6.27 ± 2.429
0.0PheLeu: 0.0 ± 0.0
3.135PheMet: 3.135 ± 1.214
6.27PheAsn: 6.27 ± 2.076
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
0.0PheArg: 0.0 ± 0.0
3.135PheSer: 3.135 ± 0.088
6.27PheThr: 6.27 ± 0.176
6.27PheVal: 6.27 ± 0.176
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.567GlyAla: 1.567 ± 1.17
0.0GlyCys: 0.0 ± 0.0
4.702GlyAsp: 4.702 ± 0.994
1.567GlyGlu: 1.567 ± 1.17
3.135GlyPhe: 3.135 ± 2.164
1.567GlyGly: 1.567 ± 1.082
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
6.27GlyLys: 6.27 ± 4.681
4.702GlyLeu: 4.702 ± 0.994
0.0GlyMet: 0.0 ± 0.0
6.27GlyAsn: 6.27 ± 0.176
1.567GlyPro: 1.567 ± 1.082
3.135GlyGln: 3.135 ± 0.088
9.404GlyArg: 9.404 ± 2.517
4.702GlySer: 4.702 ± 3.246
1.567GlyThr: 1.567 ± 1.082
1.567GlyVal: 1.567 ± 1.17
0.0GlyTrp: 0.0 ± 0.0
3.135GlyTyr: 3.135 ± 0.088
0.0GlyXaa: 0.0 ± 0.0
His
1.567HisAla: 1.567 ± 1.17
0.0HisCys: 0.0 ± 0.0
4.702HisAsp: 4.702 ± 0.994
1.567HisGlu: 1.567 ± 1.17
1.567HisPhe: 1.567 ± 1.17
3.135HisGly: 3.135 ± 2.34
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.567HisLys: 1.567 ± 1.17
1.567HisLeu: 1.567 ± 1.17
1.567HisMet: 1.567 ± 1.17
0.0HisAsn: 0.0 ± 0.0
3.135HisPro: 3.135 ± 2.164
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.567HisSer: 1.567 ± 1.082
0.0HisThr: 0.0 ± 0.0
3.135HisVal: 3.135 ± 0.088
0.0HisTrp: 0.0 ± 0.0
1.567HisTyr: 1.567 ± 1.082
0.0HisXaa: 0.0 ± 0.0
Ile
10.972IleAla: 10.972 ± 0.817
0.0IleCys: 0.0 ± 0.0
1.567IleAsp: 1.567 ± 1.082
3.135IleGlu: 3.135 ± 2.34
6.27IlePhe: 6.27 ± 0.176
1.567IleGly: 1.567 ± 1.082
1.567IleHis: 1.567 ± 1.17
6.27IleIle: 6.27 ± 2.076
4.702IleLys: 4.702 ± 1.258
0.0IleLeu: 0.0 ± 0.0
1.567IleMet: 1.567 ± 1.082
1.567IleAsn: 1.567 ± 1.082
3.135IlePro: 3.135 ± 2.164
4.702IleGln: 4.702 ± 0.994
1.567IleArg: 1.567 ± 1.17
3.135IleSer: 3.135 ± 0.088
0.0IleThr: 0.0 ± 0.0
1.567IleVal: 1.567 ± 1.082
0.0IleTrp: 0.0 ± 0.0
1.567IleTyr: 1.567 ± 1.082
0.0IleXaa: 0.0 ± 0.0
Lys
4.702LysAla: 4.702 ± 1.258
0.0LysCys: 0.0 ± 0.0
1.567LysAsp: 1.567 ± 1.082
0.0LysGlu: 0.0 ± 0.0
0.0LysPhe: 0.0 ± 0.0
4.702LysGly: 4.702 ± 3.51
1.567LysHis: 1.567 ± 1.082
4.702LysIle: 4.702 ± 1.258
4.702LysLys: 4.702 ± 3.51
6.27LysLeu: 6.27 ± 2.429
1.567LysMet: 1.567 ± 1.17
3.135LysAsn: 3.135 ± 0.088
1.567LysPro: 1.567 ± 1.17
3.135LysGln: 3.135 ± 2.34
4.702LysArg: 4.702 ± 3.246
4.702LysSer: 4.702 ± 3.51
6.27LysThr: 6.27 ± 0.176
3.135LysVal: 3.135 ± 2.34
0.0LysTrp: 0.0 ± 0.0
6.27LysTyr: 6.27 ± 2.429
0.0LysXaa: 0.0 ± 0.0
Leu
3.135LeuAla: 3.135 ± 2.164
0.0LeuCys: 0.0 ± 0.0
1.567LeuAsp: 1.567 ± 1.17
4.702LeuGlu: 4.702 ± 1.258
3.135LeuPhe: 3.135 ± 2.164
1.567LeuGly: 1.567 ± 1.17
1.567LeuHis: 1.567 ± 1.082
3.135LeuIle: 3.135 ± 2.164
3.135LeuLys: 3.135 ± 2.164
4.702LeuLeu: 4.702 ± 0.994
1.567LeuMet: 1.567 ± 1.17
3.135LeuAsn: 3.135 ± 2.164
1.567LeuPro: 1.567 ± 1.082
3.135LeuGln: 3.135 ± 0.088
6.27LeuArg: 6.27 ± 0.176
4.702LeuSer: 4.702 ± 3.51
1.567LeuThr: 1.567 ± 1.082
3.135LeuVal: 3.135 ± 2.34
0.0LeuTrp: 0.0 ± 0.0
3.135LeuTyr: 3.135 ± 0.088
0.0LeuXaa: 0.0 ± 0.0
Met
1.567MetAla: 1.567 ± 1.17
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
3.135MetPhe: 3.135 ± 2.34
1.567MetGly: 1.567 ± 1.082
1.567MetHis: 1.567 ± 1.17
0.0MetIle: 0.0 ± 0.0
3.135MetLys: 3.135 ± 0.088
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
4.702MetPro: 4.702 ± 3.51
0.0MetGln: 0.0 ± 0.0
3.135MetArg: 3.135 ± 2.164
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
3.135MetVal: 3.135 ± 2.34
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.567AsnAla: 1.567 ± 1.17
0.0AsnCys: 0.0 ± 0.0
6.27AsnAsp: 6.27 ± 2.429
3.135AsnGlu: 3.135 ± 2.164
4.702AsnPhe: 4.702 ± 1.258
1.567AsnGly: 1.567 ± 1.17
3.135AsnHis: 3.135 ± 0.088
1.567AsnIle: 1.567 ± 1.082
1.567AsnLys: 1.567 ± 1.082
1.567AsnLeu: 1.567 ± 1.082
1.567AsnMet: 1.567 ± 1.17
1.567AsnAsn: 1.567 ± 1.082
0.0AsnPro: 0.0 ± 0.0
7.837AsnGln: 7.837 ± 3.157
3.135AsnArg: 3.135 ± 2.164
1.567AsnSer: 1.567 ± 1.082
3.135AsnThr: 3.135 ± 0.088
4.702AsnVal: 4.702 ± 0.994
1.567AsnTrp: 1.567 ± 1.082
3.135AsnTyr: 3.135 ± 2.164
0.0AsnXaa: 0.0 ± 0.0
Pro
3.135ProAla: 3.135 ± 0.088
0.0ProCys: 0.0 ± 0.0
3.135ProAsp: 3.135 ± 2.34
1.567ProGlu: 1.567 ± 1.17
6.27ProPhe: 6.27 ± 0.176
0.0ProGly: 0.0 ± 0.0
1.567ProHis: 1.567 ± 1.17
1.567ProIle: 1.567 ± 1.082
1.567ProLys: 1.567 ± 1.082
3.135ProLeu: 3.135 ± 2.164
0.0ProMet: 0.0 ± 0.0
3.135ProAsn: 3.135 ± 0.088
0.0ProPro: 0.0 ± 0.0
1.567ProGln: 1.567 ± 1.082
1.567ProArg: 1.567 ± 1.082
6.27ProSer: 6.27 ± 2.076
4.702ProThr: 4.702 ± 0.994
3.135ProVal: 3.135 ± 2.164
0.0ProTrp: 0.0 ± 0.0
3.135ProTyr: 3.135 ± 2.164
0.0ProXaa: 0.0 ± 0.0
Gln
3.135GlnAla: 3.135 ± 2.34
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
6.27GlnGlu: 6.27 ± 2.429
1.567GlnPhe: 1.567 ± 1.17
3.135GlnGly: 3.135 ± 0.088
1.567GlnHis: 1.567 ± 1.17
1.567GlnIle: 1.567 ± 1.082
0.0GlnLys: 0.0 ± 0.0
3.135GlnLeu: 3.135 ± 0.088
0.0GlnMet: 0.0 ± 0.0
3.135GlnAsn: 3.135 ± 2.164
1.567GlnPro: 1.567 ± 1.082
0.0GlnGln: 0.0 ± 0.0
4.702GlnArg: 4.702 ± 0.994
3.135GlnSer: 3.135 ± 2.164
4.702GlnThr: 4.702 ± 1.258
1.567GlnVal: 1.567 ± 1.082
0.0GlnTrp: 0.0 ± 0.0
4.702GlnTyr: 4.702 ± 3.246
0.0GlnXaa: 0.0 ± 0.0
Arg
1.567ArgAla: 1.567 ± 1.082
1.567ArgCys: 1.567 ± 1.17
6.27ArgAsp: 6.27 ± 2.076
6.27ArgGlu: 6.27 ± 0.176
0.0ArgPhe: 0.0 ± 0.0
1.567ArgGly: 1.567 ± 1.17
0.0ArgHis: 0.0 ± 0.0
3.135ArgIle: 3.135 ± 0.088
1.567ArgLys: 1.567 ± 1.17
1.567ArgLeu: 1.567 ± 1.082
1.567ArgMet: 1.567 ± 1.17
1.567ArgAsn: 1.567 ± 1.17
6.27ArgPro: 6.27 ± 4.328
1.567ArgGln: 1.567 ± 1.17
4.702ArgArg: 4.702 ± 0.994
9.404ArgSer: 9.404 ± 1.987
3.135ArgThr: 3.135 ± 0.088
4.702ArgVal: 4.702 ± 0.994
1.567ArgTrp: 1.567 ± 1.17
3.135ArgTyr: 3.135 ± 2.164
0.0ArgXaa: 0.0 ± 0.0
Ser
7.837SerAla: 7.837 ± 0.905
1.567SerCys: 1.567 ± 1.082
3.135SerAsp: 3.135 ± 2.164
1.567SerGlu: 1.567 ± 1.17
3.135SerPhe: 3.135 ± 2.34
12.539SerGly: 12.539 ± 4.151
0.0SerHis: 0.0 ± 0.0
4.702SerIle: 4.702 ± 3.246
6.27SerLys: 6.27 ± 2.076
3.135SerLeu: 3.135 ± 0.088
1.567SerMet: 1.567 ± 1.082
3.135SerAsn: 3.135 ± 2.164
0.0SerPro: 0.0 ± 0.0
6.27SerGln: 6.27 ± 4.328
3.135SerArg: 3.135 ± 0.088
6.27SerSer: 6.27 ± 4.328
4.702SerThr: 4.702 ± 1.258
6.27SerVal: 6.27 ± 2.076
1.567SerTrp: 1.567 ± 1.17
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.567ThrAla: 1.567 ± 1.17
1.567ThrCys: 1.567 ± 1.17
4.702ThrAsp: 4.702 ± 3.246
3.135ThrGlu: 3.135 ± 0.088
1.567ThrPhe: 1.567 ± 1.082
6.27ThrGly: 6.27 ± 0.176
3.135ThrHis: 3.135 ± 0.088
4.702ThrIle: 4.702 ± 1.258
6.27ThrLys: 6.27 ± 4.681
4.702ThrLeu: 4.702 ± 3.246
0.0ThrMet: 0.0 ± 0.0
3.135ThrAsn: 3.135 ± 2.164
3.135ThrPro: 3.135 ± 2.164
1.567ThrGln: 1.567 ± 1.17
3.135ThrArg: 3.135 ± 2.34
1.567ThrSer: 1.567 ± 1.082
7.837ThrThr: 7.837 ± 3.157
1.567ThrVal: 1.567 ± 1.17
0.0ThrTrp: 0.0 ± 0.0
1.567ThrTyr: 1.567 ± 1.082
0.0ThrXaa: 0.0 ± 0.0
Val
6.27ValAla: 6.27 ± 0.176
1.567ValCys: 1.567 ± 1.17
6.27ValAsp: 6.27 ± 2.076
3.135ValGlu: 3.135 ± 2.34
4.702ValPhe: 4.702 ± 3.51
1.567ValGly: 1.567 ± 1.082
1.567ValHis: 1.567 ± 1.082
4.702ValIle: 4.702 ± 0.994
1.567ValLys: 1.567 ± 1.17
3.135ValLeu: 3.135 ± 2.164
0.0ValMet: 0.0 ± 0.0
3.135ValAsn: 3.135 ± 2.34
3.135ValPro: 3.135 ± 2.34
3.135ValGln: 3.135 ± 0.088
3.135ValArg: 3.135 ± 0.088
3.135ValSer: 3.135 ± 2.164
7.837ValThr: 7.837 ± 3.157
6.27ValVal: 6.27 ± 0.176
0.0ValTrp: 0.0 ± 0.0
4.702ValTyr: 4.702 ± 3.51
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.567TrpAsp: 1.567 ± 1.17
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
3.135TrpLys: 3.135 ± 2.34
1.567TrpLeu: 1.567 ± 1.17
1.567TrpMet: 1.567 ± 1.082
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.567TrpGln: 1.567 ± 1.17
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.567TrpTyr: 1.567 ± 1.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.567TyrAla: 1.567 ± 1.082
1.567TyrCys: 1.567 ± 1.17
1.567TyrAsp: 1.567 ± 1.082
4.702TyrGlu: 4.702 ± 1.258
3.135TyrPhe: 3.135 ± 0.088
1.567TyrGly: 1.567 ± 1.17
1.567TyrHis: 1.567 ± 1.082
4.702TyrIle: 4.702 ± 3.246
1.567TyrLys: 1.567 ± 1.082
1.567TyrLeu: 1.567 ± 1.17
1.567TyrMet: 1.567 ± 1.17
6.27TyrAsn: 6.27 ± 2.076
3.135TyrPro: 3.135 ± 2.164
1.567TyrGln: 1.567 ± 1.082
1.567TyrArg: 1.567 ± 1.082
4.702TyrSer: 4.702 ± 3.246
0.0TyrThr: 0.0 ± 0.0
3.135TyrVal: 3.135 ± 0.088
3.135TyrTrp: 3.135 ± 2.34
1.567TyrTyr: 1.567 ± 1.082
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (639 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski