Amino acid dipepetide frequency for Avon-Heathcote Estuary associated circular virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.091AlaAla: 8.091 ± 4.546
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
4.854AlaGlu: 4.854 ± 0.603
1.618AlaPhe: 1.618 ± 1.215
4.854AlaGly: 4.854 ± 2.728
1.618AlaHis: 1.618 ± 1.215
3.236AlaIle: 3.236 ± 1.818
1.618AlaLys: 1.618 ± 0.909
3.236AlaLeu: 3.236 ± 1.818
1.618AlaMet: 1.618 ± 0.909
4.854AlaAsn: 4.854 ± 0.603
3.236AlaPro: 3.236 ± 1.818
1.618AlaGln: 1.618 ± 1.215
3.236AlaArg: 3.236 ± 0.306
3.236AlaSer: 3.236 ± 1.818
8.091AlaThr: 8.091 ± 0.297
4.854AlaVal: 4.854 ± 0.603
0.0AlaTrp: 0.0 ± 0.0
4.854AlaTyr: 4.854 ± 1.521
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.618CysPhe: 1.618 ± 1.215
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.236CysLys: 3.236 ± 2.43
1.618CysLeu: 1.618 ± 0.909
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.618CysPro: 1.618 ± 1.215
0.0CysGln: 0.0 ± 0.0
1.618CysArg: 1.618 ± 1.215
1.618CysSer: 1.618 ± 1.215
1.618CysThr: 1.618 ± 0.909
1.618CysVal: 1.618 ± 1.215
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.618AspAla: 1.618 ± 1.215
3.236AspCys: 3.236 ± 2.43
6.472AspAsp: 6.472 ± 4.861
0.0AspGlu: 0.0 ± 0.0
1.618AspPhe: 1.618 ± 1.215
3.236AspGly: 3.236 ± 0.306
0.0AspHis: 0.0 ± 0.0
8.091AspIle: 8.091 ± 6.076
0.0AspLys: 0.0 ± 0.0
0.0AspLeu: 0.0 ± 0.0
6.472AspMet: 6.472 ± 2.736
6.472AspAsn: 6.472 ± 0.612
6.472AspPro: 6.472 ± 1.513
1.618AspGln: 1.618 ± 0.909
3.236AspArg: 3.236 ± 0.306
0.0AspSer: 0.0 ± 0.0
4.854AspThr: 4.854 ± 0.603
1.618AspVal: 1.618 ± 1.215
1.618AspTrp: 1.618 ± 0.909
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.618GluAla: 1.618 ± 0.909
1.618GluCys: 1.618 ± 1.215
4.854GluAsp: 4.854 ± 1.521
0.0GluGlu: 0.0 ± 0.0
3.236GluPhe: 3.236 ± 0.306
1.618GluGly: 1.618 ± 1.215
0.0GluHis: 0.0 ± 0.0
8.091GluIle: 8.091 ± 6.076
0.0GluLys: 0.0 ± 0.0
1.618GluLeu: 1.618 ± 0.909
1.618GluMet: 1.618 ± 0.909
4.854GluAsn: 4.854 ± 0.603
1.618GluPro: 1.618 ± 0.909
1.618GluGln: 1.618 ± 1.215
3.236GluArg: 3.236 ± 0.306
0.0GluSer: 0.0 ± 0.0
1.618GluThr: 1.618 ± 0.909
4.854GluVal: 4.854 ± 2.728
0.0GluTrp: 0.0 ± 0.0
1.618GluTyr: 1.618 ± 0.909
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.618PheAsp: 1.618 ± 1.215
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
3.236PheHis: 3.236 ± 2.43
1.618PheIle: 1.618 ± 0.909
3.236PheLys: 3.236 ± 2.43
3.236PheLeu: 3.236 ± 2.43
1.618PheMet: 1.618 ± 1.215
4.854PheAsn: 4.854 ± 2.728
0.0PhePro: 0.0 ± 0.0
1.618PheGln: 1.618 ± 0.909
1.618PheArg: 1.618 ± 0.909
3.236PheSer: 3.236 ± 1.818
3.236PheThr: 3.236 ± 0.306
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
1.618PheTyr: 1.618 ± 1.215
0.0PheXaa: 0.0 ± 0.0
Gly
6.472GlyAla: 6.472 ± 1.513
0.0GlyCys: 0.0 ± 0.0
6.472GlyAsp: 6.472 ± 2.736
0.0GlyGlu: 0.0 ± 0.0
0.0GlyPhe: 0.0 ± 0.0
4.854GlyGly: 4.854 ± 0.603
0.0GlyHis: 0.0 ± 0.0
3.236GlyIle: 3.236 ± 1.818
3.236GlyLys: 3.236 ± 2.43
4.854GlyLeu: 4.854 ± 0.603
3.236GlyMet: 3.236 ± 1.818
6.472GlyAsn: 6.472 ± 3.637
3.236GlyPro: 3.236 ± 0.306
6.472GlyGln: 6.472 ± 1.513
4.854GlyArg: 4.854 ± 0.603
6.472GlySer: 6.472 ± 0.612
3.236GlyThr: 3.236 ± 0.306
1.618GlyVal: 1.618 ± 0.909
0.0GlyTrp: 0.0 ± 0.0
3.236GlyTyr: 3.236 ± 2.43
0.0GlyXaa: 0.0 ± 0.0
His
1.618HisAla: 1.618 ± 1.215
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.618HisGlu: 1.618 ± 1.215
1.618HisPhe: 1.618 ± 0.909
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.618HisIle: 1.618 ± 1.215
0.0HisLys: 0.0 ± 0.0
4.854HisLeu: 4.854 ± 3.645
0.0HisMet: 0.0 ± 0.0
3.236HisAsn: 3.236 ± 0.306
1.618HisPro: 1.618 ± 1.215
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.618HisSer: 1.618 ± 0.909
3.236HisThr: 3.236 ± 0.306
1.618HisVal: 1.618 ± 1.215
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.472IleAla: 6.472 ± 0.612
1.618IleCys: 1.618 ± 1.215
4.854IleAsp: 4.854 ± 1.521
1.618IleGlu: 1.618 ± 1.215
3.236IlePhe: 3.236 ± 0.306
1.618IleGly: 1.618 ± 0.909
3.236IleHis: 3.236 ± 2.43
8.091IleIle: 8.091 ± 6.076
3.236IleLys: 3.236 ± 2.43
1.618IleLeu: 1.618 ± 1.215
0.0IleMet: 0.0 ± 0.0
1.618IleAsn: 1.618 ± 0.909
4.854IlePro: 4.854 ± 0.603
1.618IleGln: 1.618 ± 1.215
6.472IleArg: 6.472 ± 3.637
4.854IleSer: 4.854 ± 0.603
4.854IleThr: 4.854 ± 3.645
6.472IleVal: 6.472 ± 1.513
0.0IleTrp: 0.0 ± 0.0
3.236IleTyr: 3.236 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
1.618LysCys: 1.618 ± 1.215
6.472LysAsp: 6.472 ± 2.736
4.854LysGlu: 4.854 ± 1.521
1.618LysPhe: 1.618 ± 1.215
3.236LysGly: 3.236 ± 0.306
0.0LysHis: 0.0 ± 0.0
3.236LysIle: 3.236 ± 0.306
3.236LysLys: 3.236 ± 0.306
4.854LysLeu: 4.854 ± 0.603
1.618LysMet: 1.618 ± 0.691
3.236LysAsn: 3.236 ± 0.306
1.618LysPro: 1.618 ± 1.215
4.854LysGln: 4.854 ± 0.603
8.091LysArg: 8.091 ± 0.297
1.618LysSer: 1.618 ± 0.909
4.854LysThr: 4.854 ± 1.521
4.854LysVal: 4.854 ± 0.603
0.0LysTrp: 0.0 ± 0.0
4.854LysTyr: 4.854 ± 1.521
0.0LysXaa: 0.0 ± 0.0
Leu
4.854LeuAla: 4.854 ± 0.603
0.0LeuCys: 0.0 ± 0.0
6.472LeuAsp: 6.472 ± 0.612
6.472LeuGlu: 6.472 ± 0.612
0.0LeuPhe: 0.0 ± 0.0
1.618LeuGly: 1.618 ± 1.215
1.618LeuHis: 1.618 ± 1.215
1.618LeuIle: 1.618 ± 0.909
8.091LeuLys: 8.091 ± 0.297
3.236LeuLeu: 3.236 ± 2.43
3.236LeuMet: 3.236 ± 0.983
6.472LeuAsn: 6.472 ± 3.637
3.236LeuPro: 3.236 ± 0.306
1.618LeuGln: 1.618 ± 1.215
1.618LeuArg: 1.618 ± 0.909
4.854LeuSer: 4.854 ± 2.728
3.236LeuThr: 3.236 ± 1.818
0.0LeuVal: 0.0 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
1.618LeuTyr: 1.618 ± 0.909
0.0LeuXaa: 0.0 ± 0.0
Met
1.618MetAla: 1.618 ± 1.215
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.236MetGly: 3.236 ± 1.818
0.0MetHis: 0.0 ± 0.0
1.618MetIle: 1.618 ± 1.215
1.618MetLys: 1.618 ± 1.215
1.618MetLeu: 1.618 ± 0.909
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.236MetPro: 3.236 ± 1.818
0.0MetGln: 0.0 ± 0.0
6.472MetArg: 6.472 ± 2.736
3.236MetSer: 3.236 ± 0.306
3.236MetThr: 3.236 ± 1.818
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
4.854MetTyr: 4.854 ± 0.603
0.0MetXaa: 0.0 ± 0.0
Asn
4.854AsnAla: 4.854 ± 2.728
0.0AsnCys: 0.0 ± 0.0
1.618AsnAsp: 1.618 ± 0.909
1.618AsnGlu: 1.618 ± 0.909
1.618AsnPhe: 1.618 ± 0.909
8.091AsnGly: 8.091 ± 0.297
0.0AsnHis: 0.0 ± 0.0
4.854AsnIle: 4.854 ± 0.603
4.854AsnLys: 4.854 ± 0.603
6.472AsnLeu: 6.472 ± 3.637
1.618AsnMet: 1.618 ± 0.909
6.472AsnAsn: 6.472 ± 3.637
3.236AsnPro: 3.236 ± 1.818
4.854AsnGln: 4.854 ± 0.603
3.236AsnArg: 3.236 ± 1.818
1.618AsnSer: 1.618 ± 0.909
1.618AsnThr: 1.618 ± 0.909
4.854AsnVal: 4.854 ± 0.603
1.618AsnTrp: 1.618 ± 1.215
1.618AsnTyr: 1.618 ± 1.215
0.0AsnXaa: 0.0 ± 0.0
Pro
1.618ProAla: 1.618 ± 0.909
1.618ProCys: 1.618 ± 0.909
3.236ProAsp: 3.236 ± 1.818
0.0ProGlu: 0.0 ± 0.0
3.236ProPhe: 3.236 ± 1.818
1.618ProGly: 1.618 ± 0.909
6.472ProHis: 6.472 ± 1.513
1.618ProIle: 1.618 ± 0.909
8.091ProLys: 8.091 ± 1.827
1.618ProLeu: 1.618 ± 0.909
0.0ProMet: 0.0 ± 0.0
1.618ProAsn: 1.618 ± 0.909
3.236ProPro: 3.236 ± 1.818
1.618ProGln: 1.618 ± 0.909
1.618ProArg: 1.618 ± 0.909
0.0ProSer: 0.0 ± 0.0
6.472ProThr: 6.472 ± 0.612
4.854ProVal: 4.854 ± 2.728
0.0ProTrp: 0.0 ± 0.0
1.618ProTyr: 1.618 ± 1.215
0.0ProXaa: 0.0 ± 0.0
Gln
1.618GlnAla: 1.618 ± 0.909
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.618GlnGlu: 1.618 ± 0.909
3.236GlnPhe: 3.236 ± 2.43
8.091GlnGly: 8.091 ± 0.297
3.236GlnHis: 3.236 ± 0.306
1.618GlnIle: 1.618 ± 0.909
3.236GlnLys: 3.236 ± 0.306
4.854GlnLeu: 4.854 ± 0.603
0.0GlnMet: 0.0 ± 0.0
3.236GlnAsn: 3.236 ± 1.818
0.0GlnPro: 0.0 ± 0.0
3.236GlnGln: 3.236 ± 0.306
1.618GlnArg: 1.618 ± 1.215
1.618GlnSer: 1.618 ± 1.215
0.0GlnThr: 0.0 ± 0.0
1.618GlnVal: 1.618 ± 0.909
3.236GlnTrp: 3.236 ± 2.43
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.472ArgAla: 6.472 ± 3.637
0.0ArgCys: 0.0 ± 0.0
1.618ArgAsp: 1.618 ± 1.215
3.236ArgGlu: 3.236 ± 0.306
1.618ArgPhe: 1.618 ± 0.909
8.091ArgGly: 8.091 ± 0.297
1.618ArgHis: 1.618 ± 1.215
1.618ArgIle: 1.618 ± 1.215
3.236ArgLys: 3.236 ± 1.818
3.236ArgLeu: 3.236 ± 1.818
3.236ArgMet: 3.236 ± 0.306
4.854ArgAsn: 4.854 ± 0.603
0.0ArgPro: 0.0 ± 0.0
4.854ArgGln: 4.854 ± 1.521
9.709ArgArg: 9.709 ± 1.207
3.236ArgSer: 3.236 ± 0.306
11.327ArgThr: 11.327 ± 2.133
1.618ArgVal: 1.618 ± 0.909
0.0ArgTrp: 0.0 ± 0.0
6.472ArgTyr: 6.472 ± 0.612
0.0ArgXaa: 0.0 ± 0.0
Ser
4.854SerAla: 4.854 ± 0.603
0.0SerCys: 0.0 ± 0.0
4.854SerAsp: 4.854 ± 0.603
4.854SerGlu: 4.854 ± 0.603
0.0SerPhe: 0.0 ± 0.0
0.0SerGly: 0.0 ± 0.0
0.0SerHis: 0.0 ± 0.0
6.472SerIle: 6.472 ± 3.637
1.618SerLys: 1.618 ± 1.215
1.618SerLeu: 1.618 ± 0.909
1.618SerMet: 1.618 ± 0.909
1.618SerAsn: 1.618 ± 1.215
1.618SerPro: 1.618 ± 0.909
0.0SerGln: 0.0 ± 0.0
6.472SerArg: 6.472 ± 2.736
1.618SerSer: 1.618 ± 1.215
3.236SerThr: 3.236 ± 1.818
4.854SerVal: 4.854 ± 1.521
0.0SerTrp: 0.0 ± 0.0
4.854SerTyr: 4.854 ± 2.728
0.0SerXaa: 0.0 ± 0.0
Thr
6.472ThrAla: 6.472 ± 0.612
0.0ThrCys: 0.0 ± 0.0
6.472ThrAsp: 6.472 ± 2.736
1.618ThrGlu: 1.618 ± 0.909
1.618ThrPhe: 1.618 ± 0.909
11.327ThrGly: 11.327 ± 0.009
0.0ThrHis: 0.0 ± 0.0
4.854ThrIle: 4.854 ± 0.603
4.854ThrLys: 4.854 ± 0.603
6.472ThrLeu: 6.472 ± 0.612
1.618ThrMet: 1.618 ± 1.215
0.0ThrAsn: 0.0 ± 0.0
6.472ThrPro: 6.472 ± 3.637
1.618ThrGln: 1.618 ± 0.909
6.472ThrArg: 6.472 ± 0.612
6.472ThrSer: 6.472 ± 0.612
6.472ThrThr: 6.472 ± 0.612
1.618ThrVal: 1.618 ± 0.909
1.618ThrTrp: 1.618 ± 1.215
4.854ThrTyr: 4.854 ± 0.603
0.0ThrXaa: 0.0 ± 0.0
Val
3.236ValAla: 3.236 ± 1.818
1.618ValCys: 1.618 ± 0.909
0.0ValAsp: 0.0 ± 0.0
4.854ValGlu: 4.854 ± 0.603
1.618ValPhe: 1.618 ± 1.215
3.236ValGly: 3.236 ± 1.818
0.0ValHis: 0.0 ± 0.0
3.236ValIle: 3.236 ± 0.306
8.091ValLys: 8.091 ± 0.297
3.236ValLeu: 3.236 ± 0.306
0.0ValMet: 0.0 ± 0.0
1.618ValAsn: 1.618 ± 0.909
4.854ValPro: 4.854 ± 2.728
3.236ValGln: 3.236 ± 2.43
1.618ValArg: 1.618 ± 1.215
1.618ValSer: 1.618 ± 0.909
4.854ValThr: 4.854 ± 2.728
4.854ValVal: 4.854 ± 2.728
1.618ValTrp: 1.618 ± 0.909
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.618TrpCys: 1.618 ± 1.215
0.0TrpAsp: 0.0 ± 0.0
1.618TrpGlu: 1.618 ± 1.215
1.618TrpPhe: 1.618 ± 1.215
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
3.236TrpLys: 3.236 ± 0.306
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.618TrpSer: 1.618 ± 0.909
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.618TrpTrp: 1.618 ± 1.215
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.236TyrAla: 3.236 ± 0.306
1.618TyrCys: 1.618 ± 1.215
1.618TyrAsp: 1.618 ± 1.215
4.854TyrGlu: 4.854 ± 1.521
1.618TyrPhe: 1.618 ± 0.909
3.236TyrGly: 3.236 ± 2.43
1.618TyrHis: 1.618 ± 1.215
4.854TyrIle: 4.854 ± 3.645
1.618TyrLys: 1.618 ± 0.909
1.618TyrLeu: 1.618 ± 0.909
1.618TyrMet: 1.618 ± 0.909
3.236TyrAsn: 3.236 ± 1.818
0.0TyrPro: 0.0 ± 0.0
1.618TyrGln: 1.618 ± 0.909
4.854TyrArg: 4.854 ± 2.728
1.618TyrSer: 1.618 ± 1.215
4.854TyrThr: 4.854 ± 1.521
1.618TyrVal: 1.618 ± 0.909
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (619 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski