Amino acid dipepetide frequency for Avon-Heathcote Estuary associated circular virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.629AlaAla: 1.629 ± 1.084
0.0AlaCys: 0.0 ± 0.0
1.629AlaAsp: 1.629 ± 1.084
3.257AlaGlu: 3.257 ± 0.18
1.629AlaPhe: 1.629 ± 1.084
3.257AlaGly: 3.257 ± 2.168
0.0AlaHis: 0.0 ± 0.0
1.629AlaIle: 1.629 ± 1.084
6.515AlaLys: 6.515 ± 1.988
3.257AlaLeu: 3.257 ± 0.18
3.257AlaMet: 3.257 ± 2.168
6.515AlaAsn: 6.515 ± 4.336
0.0AlaPro: 0.0 ± 0.0
0.0AlaGln: 0.0 ± 0.0
4.886AlaArg: 4.886 ± 0.904
1.629AlaSer: 1.629 ± 1.264
4.886AlaThr: 4.886 ± 3.791
6.515AlaVal: 6.515 ± 0.359
0.0AlaTrp: 0.0 ± 0.0
4.886AlaTyr: 4.886 ± 0.904
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.629CysTyr: 1.629 ± 1.264
0.0CysXaa: 0.0 ± 0.0
Asp
3.257AspAla: 3.257 ± 0.18
0.0AspCys: 0.0 ± 0.0
1.629AspAsp: 1.629 ± 1.264
3.257AspGlu: 3.257 ± 2.528
6.515AspPhe: 6.515 ± 0.359
3.257AspGly: 3.257 ± 2.168
0.0AspHis: 0.0 ± 0.0
8.143AspIle: 8.143 ± 1.623
4.886AspLys: 4.886 ± 3.791
6.515AspLeu: 6.515 ± 0.359
3.257AspMet: 3.257 ± 2.528
3.257AspAsn: 3.257 ± 2.168
3.257AspPro: 3.257 ± 0.18
1.629AspGln: 1.629 ± 1.084
1.629AspArg: 1.629 ± 1.264
3.257AspSer: 3.257 ± 0.18
1.629AspThr: 1.629 ± 1.264
1.629AspVal: 1.629 ± 1.084
0.0AspTrp: 0.0 ± 0.0
1.629AspTyr: 1.629 ± 1.264
0.0AspXaa: 0.0 ± 0.0
Glu
1.629GluAla: 1.629 ± 1.264
0.0GluCys: 0.0 ± 0.0
3.257GluAsp: 3.257 ± 2.528
3.257GluGlu: 3.257 ± 2.528
4.886GluPhe: 4.886 ± 1.443
4.886GluGly: 4.886 ± 1.443
3.257GluHis: 3.257 ± 0.18
1.629GluIle: 1.629 ± 1.264
1.629GluLys: 1.629 ± 1.264
1.629GluLeu: 1.629 ± 1.264
1.629GluMet: 1.629 ± 1.264
1.629GluAsn: 1.629 ± 1.084
3.257GluPro: 3.257 ± 0.18
0.0GluGln: 0.0 ± 0.0
1.629GluArg: 1.629 ± 1.264
4.886GluSer: 4.886 ± 0.904
4.886GluThr: 4.886 ± 1.443
8.143GluVal: 8.143 ± 1.623
1.629GluTrp: 1.629 ± 1.264
1.629GluTyr: 1.629 ± 1.264
0.0GluXaa: 0.0 ± 0.0
Phe
3.257PheAla: 3.257 ± 0.18
0.0PheCys: 0.0 ± 0.0
4.886PheAsp: 4.886 ± 0.904
1.629PheGlu: 1.629 ± 1.084
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
3.257PheHis: 3.257 ± 2.168
1.629PheIle: 1.629 ± 1.084
9.772PheLys: 9.772 ± 0.539
1.629PheLeu: 1.629 ± 1.084
1.629PheMet: 1.629 ± 1.722
1.629PheAsn: 1.629 ± 1.264
0.0PhePro: 0.0 ± 0.0
3.257PheGln: 3.257 ± 2.528
3.257PheArg: 3.257 ± 0.18
3.257PheSer: 3.257 ± 2.528
1.629PheThr: 1.629 ± 1.264
3.257PheVal: 3.257 ± 0.18
0.0PheTrp: 0.0 ± 0.0
3.257PheTyr: 3.257 ± 0.18
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
8.143GlyAsp: 8.143 ± 6.319
3.257GlyGlu: 3.257 ± 0.18
4.886GlyPhe: 4.886 ± 3.252
6.515GlyGly: 6.515 ± 4.336
0.0GlyHis: 0.0 ± 0.0
4.886GlyIle: 4.886 ± 0.904
3.257GlyLys: 3.257 ± 0.18
6.515GlyLeu: 6.515 ± 0.359
1.629GlyMet: 1.629 ± 1.264
3.257GlyAsn: 3.257 ± 2.168
3.257GlyPro: 3.257 ± 0.18
3.257GlyGln: 3.257 ± 2.168
8.143GlyArg: 8.143 ± 0.725
6.515GlySer: 6.515 ± 4.336
4.886GlyThr: 4.886 ± 0.904
4.886GlyVal: 4.886 ± 0.904
0.0GlyTrp: 0.0 ± 0.0
1.629GlyTyr: 1.629 ± 1.264
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.629HisAsp: 1.629 ± 1.084
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.629HisGly: 1.629 ± 1.084
0.0HisHis: 0.0 ± 0.0
3.257HisIle: 3.257 ± 0.18
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.257HisPro: 3.257 ± 2.168
0.0HisGln: 0.0 ± 0.0
1.629HisArg: 1.629 ± 1.084
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
3.257HisTyr: 3.257 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
4.886IleAla: 4.886 ± 0.904
0.0IleCys: 0.0 ± 0.0
3.257IleAsp: 3.257 ± 0.18
1.629IleGlu: 1.629 ± 1.084
3.257IlePhe: 3.257 ± 2.528
4.886IleGly: 4.886 ± 0.904
1.629IleHis: 1.629 ± 1.084
3.257IleIle: 3.257 ± 0.18
6.515IleLys: 6.515 ± 5.055
6.515IleLeu: 6.515 ± 0.359
0.0IleMet: 0.0 ± 0.0
3.257IleAsn: 3.257 ± 2.168
3.257IlePro: 3.257 ± 0.18
3.257IleGln: 3.257 ± 2.168
1.629IleArg: 1.629 ± 1.264
1.629IleSer: 1.629 ± 1.084
1.629IleThr: 1.629 ± 1.084
3.257IleVal: 3.257 ± 0.18
0.0IleTrp: 0.0 ± 0.0
1.629IleTyr: 1.629 ± 1.264
0.0IleXaa: 0.0 ± 0.0
Lys
8.143LysAla: 8.143 ± 3.072
0.0LysCys: 0.0 ± 0.0
6.515LysAsp: 6.515 ± 2.707
3.257LysGlu: 3.257 ± 2.528
0.0LysPhe: 0.0 ± 0.0
11.401LysGly: 11.401 ± 4.151
3.257LysHis: 3.257 ± 0.18
4.886LysIle: 4.886 ± 3.791
6.515LysLys: 6.515 ± 0.359
6.515LysLeu: 6.515 ± 2.707
1.629LysMet: 1.629 ± 0.847
4.886LysAsn: 4.886 ± 0.904
3.257LysPro: 3.257 ± 0.18
3.257LysGln: 3.257 ± 2.168
6.515LysArg: 6.515 ± 1.988
4.886LysSer: 4.886 ± 3.791
3.257LysThr: 3.257 ± 0.18
1.629LysVal: 1.629 ± 1.264
1.629LysTrp: 1.629 ± 1.264
4.886LysTyr: 4.886 ± 3.252
0.0LysXaa: 0.0 ± 0.0
Leu
6.515LeuAla: 6.515 ± 0.359
0.0LeuCys: 0.0 ± 0.0
3.257LeuAsp: 3.257 ± 2.168
9.772LeuGlu: 9.772 ± 5.235
8.143LeuPhe: 8.143 ± 0.725
4.886LeuGly: 4.886 ± 1.443
0.0LeuHis: 0.0 ± 0.0
1.629LeuIle: 1.629 ± 1.264
9.772LeuLys: 9.772 ± 5.235
4.886LeuLeu: 4.886 ± 1.443
3.257LeuMet: 3.257 ± 2.528
3.257LeuAsn: 3.257 ± 2.168
3.257LeuPro: 3.257 ± 0.18
0.0LeuGln: 0.0 ± 0.0
4.886LeuArg: 4.886 ± 1.443
6.515LeuSer: 6.515 ± 1.988
3.257LeuThr: 3.257 ± 2.168
1.629LeuVal: 1.629 ± 1.084
0.0LeuTrp: 0.0 ± 0.0
3.257LeuTyr: 3.257 ± 0.18
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.629MetAsp: 1.629 ± 1.264
1.629MetGlu: 1.629 ± 1.264
0.0MetPhe: 0.0 ± 0.0
3.257MetGly: 3.257 ± 2.528
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.257MetLys: 3.257 ± 2.528
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
6.515MetPro: 6.515 ± 1.988
1.629MetGln: 1.629 ± 1.084
1.629MetArg: 1.629 ± 1.264
3.257MetSer: 3.257 ± 2.528
1.629MetThr: 1.629 ± 1.084
3.257MetVal: 3.257 ± 2.168
0.0MetTrp: 0.0 ± 0.0
1.629MetTyr: 1.629 ± 1.264
0.0MetXaa: 0.0 ± 0.0
Asn
6.515AsnAla: 6.515 ± 0.359
0.0AsnCys: 0.0 ± 0.0
3.257AsnAsp: 3.257 ± 2.168
4.886AsnGlu: 4.886 ± 0.904
1.629AsnPhe: 1.629 ± 1.084
1.629AsnGly: 1.629 ± 1.264
1.629AsnHis: 1.629 ± 1.084
1.629AsnIle: 1.629 ± 1.084
3.257AsnLys: 3.257 ± 2.168
1.629AsnLeu: 1.629 ± 1.084
0.0AsnMet: 0.0 ± 0.0
1.629AsnAsn: 1.629 ± 1.084
3.257AsnPro: 3.257 ± 2.168
4.886AsnGln: 4.886 ± 3.252
1.629AsnArg: 1.629 ± 1.264
4.886AsnSer: 4.886 ± 3.252
3.257AsnThr: 3.257 ± 0.18
1.629AsnVal: 1.629 ± 1.264
0.0AsnTrp: 0.0 ± 0.0
1.629AsnTyr: 1.629 ± 1.264
0.0AsnXaa: 0.0 ± 0.0
Pro
3.257ProAla: 3.257 ± 2.168
1.629ProCys: 1.629 ± 1.264
0.0ProAsp: 0.0 ± 0.0
1.629ProGlu: 1.629 ± 1.264
1.629ProPhe: 1.629 ± 1.084
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
4.886ProIle: 4.886 ± 1.443
4.886ProLys: 4.886 ± 3.252
3.257ProLeu: 3.257 ± 2.168
0.0ProMet: 0.0 ± 0.0
1.629ProAsn: 1.629 ± 1.264
4.886ProPro: 4.886 ± 1.443
3.257ProGln: 3.257 ± 2.168
6.515ProArg: 6.515 ± 0.359
4.886ProSer: 4.886 ± 3.252
6.515ProThr: 6.515 ± 1.988
0.0ProVal: 0.0 ± 0.0
1.629ProTrp: 1.629 ± 1.264
3.257ProTyr: 3.257 ± 2.528
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.629GlnAsp: 1.629 ± 1.084
3.257GlnGlu: 3.257 ± 0.18
0.0GlnPhe: 0.0 ± 0.0
6.515GlnGly: 6.515 ± 1.988
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.629GlnLys: 1.629 ± 1.084
9.772GlnLeu: 9.772 ± 0.539
1.629GlnMet: 1.629 ± 1.084
1.629GlnAsn: 1.629 ± 1.084
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
3.257GlnArg: 3.257 ± 0.18
1.629GlnSer: 1.629 ± 1.084
6.515GlnThr: 6.515 ± 4.336
1.629GlnVal: 1.629 ± 1.084
1.629GlnTrp: 1.629 ± 1.264
1.629GlnTyr: 1.629 ± 1.084
0.0GlnXaa: 0.0 ± 0.0
Arg
1.629ArgAla: 1.629 ± 1.084
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
0.0ArgGlu: 0.0 ± 0.0
1.629ArgPhe: 1.629 ± 1.264
4.886ArgGly: 4.886 ± 1.443
0.0ArgHis: 0.0 ± 0.0
3.257ArgIle: 3.257 ± 2.168
6.515ArgLys: 6.515 ± 2.707
6.515ArgLeu: 6.515 ± 0.359
4.886ArgMet: 4.886 ± 0.904
3.257ArgAsn: 3.257 ± 0.18
8.143ArgPro: 8.143 ± 1.623
1.629ArgGln: 1.629 ± 1.084
1.629ArgArg: 1.629 ± 1.264
1.629ArgSer: 1.629 ± 1.084
3.257ArgThr: 3.257 ± 0.18
6.515ArgVal: 6.515 ± 1.988
1.629ArgTrp: 1.629 ± 1.264
4.886ArgTyr: 4.886 ± 1.443
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.0SerCys: 0.0 ± 0.0
4.886SerAsp: 4.886 ± 0.904
3.257SerGlu: 3.257 ± 2.528
0.0SerPhe: 0.0 ± 0.0
11.401SerGly: 11.401 ± 5.241
1.629SerHis: 1.629 ± 1.084
1.629SerIle: 1.629 ± 1.084
9.772SerLys: 9.772 ± 1.809
8.143SerLeu: 8.143 ± 3.971
1.629SerMet: 1.629 ± 1.264
4.886SerAsn: 4.886 ± 3.252
0.0SerPro: 0.0 ± 0.0
3.257SerGln: 3.257 ± 0.18
1.629SerArg: 1.629 ± 1.084
1.629SerSer: 1.629 ± 1.084
6.515SerThr: 6.515 ± 1.988
3.257SerVal: 3.257 ± 2.168
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.886ThrAla: 4.886 ± 3.252
0.0ThrCys: 0.0 ± 0.0
3.257ThrAsp: 3.257 ± 0.18
4.886ThrGlu: 4.886 ± 0.904
8.143ThrPhe: 8.143 ± 0.725
1.629ThrGly: 1.629 ± 1.084
0.0ThrHis: 0.0 ± 0.0
3.257ThrIle: 3.257 ± 2.168
3.257ThrLys: 3.257 ± 2.528
4.886ThrLeu: 4.886 ± 1.443
0.0ThrMet: 0.0 ± 0.0
1.629ThrAsn: 1.629 ± 1.264
4.886ThrPro: 4.886 ± 0.904
3.257ThrGln: 3.257 ± 0.18
4.886ThrArg: 4.886 ± 1.443
4.886ThrSer: 4.886 ± 0.904
4.886ThrThr: 4.886 ± 0.904
1.629ThrVal: 1.629 ± 1.084
0.0ThrTrp: 0.0 ± 0.0
1.629ThrTyr: 1.629 ± 1.084
0.0ThrXaa: 0.0 ± 0.0
Val
8.143ValAla: 8.143 ± 0.725
0.0ValCys: 0.0 ± 0.0
1.629ValAsp: 1.629 ± 1.264
1.629ValGlu: 1.629 ± 1.264
4.886ValPhe: 4.886 ± 3.791
3.257ValGly: 3.257 ± 2.168
0.0ValHis: 0.0 ± 0.0
8.143ValIle: 8.143 ± 0.725
4.886ValLys: 4.886 ± 0.904
4.886ValLeu: 4.886 ± 0.904
0.0ValMet: 0.0 ± 0.0
1.629ValAsn: 1.629 ± 1.084
3.257ValPro: 3.257 ± 2.168
3.257ValGln: 3.257 ± 0.18
4.886ValArg: 4.886 ± 3.252
1.629ValSer: 1.629 ± 1.084
0.0ValThr: 0.0 ± 0.0
0.0ValVal: 0.0 ± 0.0
3.257ValTrp: 3.257 ± 2.528
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.629TrpAsp: 1.629 ± 1.264
1.629TrpGlu: 1.629 ± 1.264
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.629TrpMet: 1.629 ± 1.264
1.629TrpAsn: 1.629 ± 1.264
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.629TrpThr: 1.629 ± 1.264
3.257TrpVal: 3.257 ± 2.528
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.629TyrAla: 1.629 ± 1.084
0.0TyrCys: 0.0 ± 0.0
6.515TyrAsp: 6.515 ± 2.707
3.257TyrGlu: 3.257 ± 2.528
1.629TyrPhe: 1.629 ± 1.084
1.629TyrGly: 1.629 ± 1.084
0.0TyrHis: 0.0 ± 0.0
1.629TyrIle: 1.629 ± 1.264
0.0TyrLys: 0.0 ± 0.0
1.629TyrLeu: 1.629 ± 1.264
1.629TyrMet: 1.629 ± 1.264
3.257TyrAsn: 3.257 ± 2.528
0.0TyrPro: 0.0 ± 0.0
6.515TyrGln: 6.515 ± 0.359
1.629TyrArg: 1.629 ± 1.264
6.515TyrSer: 6.515 ± 1.988
1.629TyrThr: 1.629 ± 1.084
3.257TyrVal: 3.257 ± 0.18
0.0TyrTrp: 0.0 ± 0.0
1.629TyrTyr: 1.629 ± 1.084
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (615 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski