Amino acid dipepetide frequency for HCBI8.215 virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.279AlaAla: 3.279 ± 2.768
0.0AlaCys: 0.0 ± 0.0
3.279AlaAsp: 3.279 ± 2.768
3.279AlaGlu: 3.279 ± 2.768
3.279AlaPhe: 3.279 ± 0.884
9.836AlaGly: 9.836 ± 0.153
1.639AlaHis: 1.639 ± 1.384
4.918AlaIle: 4.918 ± 1.295
3.279AlaLys: 3.279 ± 1.662
1.639AlaLeu: 1.639 ± 1.384
1.639AlaMet: 1.639 ± 0.891
1.639AlaAsn: 1.639 ± 0.992
4.918AlaPro: 4.918 ± 1.27
3.279AlaGln: 3.279 ± 0.884
4.918AlaArg: 4.918 ± 1.295
14.754AlaSer: 14.754 ± 6.922
4.918AlaThr: 4.918 ± 1.27
4.918AlaVal: 4.918 ± 3.094
0.0AlaTrp: 0.0 ± 0.0
1.639AlaTyr: 1.639 ± 1.917
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.639CysGly: 1.639 ± 1.384
0.0CysHis: 0.0 ± 0.0
4.918CysIle: 4.918 ± 3.854
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.639CysArg: 1.639 ± 0.992
0.0CysSer: 0.0 ± 0.0
1.639CysThr: 1.639 ± 0.992
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.639CysTyr: 1.639 ± 0.992
0.0CysXaa: 0.0 ± 0.0
Asp
6.557AspAla: 6.557 ± 1.767
0.0AspCys: 0.0 ± 0.0
8.197AspAsp: 8.197 ± 3.438
3.279AspGlu: 3.279 ± 1.983
4.918AspPhe: 4.918 ± 1.295
4.918AspGly: 4.918 ± 2.1
0.0AspHis: 0.0 ± 0.0
1.639AspIle: 1.639 ± 1.917
3.279AspLys: 3.279 ± 3.834
4.918AspLeu: 4.918 ± 2.1
0.0AspMet: 0.0 ± 0.0
1.639AspAsn: 1.639 ± 0.992
11.475AspPro: 11.475 ± 7.564
1.639AspGln: 1.639 ± 1.917
1.639AspArg: 1.639 ± 0.992
0.0AspSer: 0.0 ± 0.0
3.279AspThr: 3.279 ± 0.884
6.557AspVal: 6.557 ± 2.103
4.918AspTrp: 4.918 ± 1.295
3.279AspTyr: 3.279 ± 2.768
0.0AspXaa: 0.0 ± 0.0
Glu
3.279GluAla: 3.279 ± 2.768
1.639GluCys: 1.639 ± 1.384
3.279GluAsp: 3.279 ± 2.17
1.639GluGlu: 1.639 ± 1.917
3.279GluPhe: 3.279 ± 2.768
8.197GluGly: 8.197 ± 3.438
0.0GluHis: 0.0 ± 0.0
3.279GluIle: 3.279 ± 1.983
1.639GluLys: 1.639 ± 1.384
1.639GluLeu: 1.639 ± 1.384
0.0GluMet: 0.0 ± 0.0
1.639GluAsn: 1.639 ± 1.384
0.0GluPro: 0.0 ± 0.0
0.0GluGln: 0.0 ± 0.0
3.279GluArg: 3.279 ± 0.884
3.279GluSer: 3.279 ± 1.662
4.918GluThr: 4.918 ± 2.975
1.639GluVal: 1.639 ± 1.917
0.0GluTrp: 0.0 ± 0.0
3.279GluTyr: 3.279 ± 2.17
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
6.557PheAsp: 6.557 ± 2.103
1.639PheGlu: 1.639 ± 1.384
4.918PhePhe: 4.918 ± 1.295
3.279PheGly: 3.279 ± 2.17
1.639PheHis: 1.639 ± 1.384
1.639PheIle: 1.639 ± 0.992
0.0PheLys: 0.0 ± 0.0
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
1.639PheAsn: 1.639 ± 0.992
4.918PhePro: 4.918 ± 3.094
6.557PheGln: 6.557 ± 2.101
8.197PheArg: 8.197 ± 3.03
6.557PheSer: 6.557 ± 2.614
3.279PheThr: 3.279 ± 0.884
4.918PheVal: 4.918 ± 1.295
0.0PheTrp: 0.0 ± 0.0
4.918PheTyr: 4.918 ± 1.295
0.0PheXaa: 0.0 ± 0.0
Gly
4.918GlyAla: 4.918 ± 2.975
0.0GlyCys: 0.0 ± 0.0
4.918GlyAsp: 4.918 ± 2.1
4.918GlyGlu: 4.918 ± 1.27
3.279GlyPhe: 3.279 ± 0.884
13.115GlyGly: 13.115 ± 6.352
0.0GlyHis: 0.0 ± 0.0
1.639GlyIle: 1.639 ± 0.992
3.279GlyLys: 3.279 ± 1.662
9.836GlyLeu: 9.836 ± 2.59
0.0GlyMet: 0.0 ± 1.18
3.279GlyAsn: 3.279 ± 1.662
1.639GlyPro: 1.639 ± 0.992
1.639GlyGln: 1.639 ± 0.992
8.197GlyArg: 8.197 ± 1.95
0.0GlySer: 0.0 ± 0.0
11.475GlyThr: 11.475 ± 4.961
6.557GlyVal: 6.557 ± 0.782
3.279GlyTrp: 3.279 ± 0.884
1.639GlyTyr: 1.639 ± 0.992
0.0GlyXaa: 0.0 ± 0.0
His
3.279HisAla: 3.279 ± 2.17
0.0HisCys: 0.0 ± 0.0
1.639HisAsp: 1.639 ± 1.384
1.639HisGlu: 1.639 ± 0.992
0.0HisPhe: 0.0 ± 0.0
1.639HisGly: 1.639 ± 0.992
1.639HisHis: 1.639 ± 1.384
1.639HisIle: 1.639 ± 1.384
0.0HisLys: 0.0 ± 0.0
1.639HisLeu: 1.639 ± 1.384
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.639HisPro: 1.639 ± 1.384
1.639HisGln: 1.639 ± 1.384
0.0HisArg: 0.0 ± 0.0
1.639HisSer: 1.639 ± 1.384
0.0HisThr: 0.0 ± 0.0
1.639HisVal: 1.639 ± 0.992
0.0HisTrp: 0.0 ± 0.0
4.918HisTyr: 4.918 ± 2.1
0.0HisXaa: 0.0 ± 0.0
Ile
3.279IleAla: 3.279 ± 1.662
1.639IleCys: 1.639 ± 0.992
1.639IleAsp: 1.639 ± 0.992
0.0IleGlu: 0.0 ± 0.0
3.279IlePhe: 3.279 ± 1.662
8.197IleGly: 8.197 ± 2.909
0.0IleHis: 0.0 ± 0.0
3.279IleIle: 3.279 ± 0.884
3.279IleLys: 3.279 ± 2.17
1.639IleLeu: 1.639 ± 0.992
0.0IleMet: 0.0 ± 0.0
1.639IleAsn: 1.639 ± 0.992
0.0IlePro: 0.0 ± 0.0
3.279IleGln: 3.279 ± 1.983
3.279IleArg: 3.279 ± 1.983
3.279IleSer: 3.279 ± 2.17
3.279IleThr: 3.279 ± 2.17
4.918IleVal: 4.918 ± 2.1
1.639IleTrp: 1.639 ± 1.917
1.639IleTyr: 1.639 ± 0.992
0.0IleXaa: 0.0 ± 0.0
Lys
3.279LysAla: 3.279 ± 1.662
3.279LysCys: 3.279 ± 1.983
1.639LysAsp: 1.639 ± 1.384
0.0LysGlu: 0.0 ± 0.0
3.279LysPhe: 3.279 ± 1.662
0.0LysGly: 0.0 ± 0.0
4.918LysHis: 4.918 ± 1.295
0.0LysIle: 0.0 ± 0.0
4.918LysLys: 4.918 ± 5.751
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.0
1.639LysAsn: 1.639 ± 1.917
1.639LysPro: 1.639 ± 1.917
0.0LysGln: 0.0 ± 0.0
1.639LysArg: 1.639 ± 1.384
3.279LysSer: 3.279 ± 1.662
0.0LysThr: 0.0 ± 0.0
0.0LysVal: 0.0 ± 0.0
1.639LysTrp: 1.639 ± 1.917
4.918LysTyr: 4.918 ± 3.854
0.0LysXaa: 0.0 ± 0.0
Leu
6.557LeuAla: 6.557 ± 0.782
0.0LeuCys: 0.0 ± 0.0
6.557LeuAsp: 6.557 ± 2.103
9.836LeuGlu: 9.836 ± 2.59
6.557LeuPhe: 6.557 ± 2.614
3.279LeuGly: 3.279 ± 2.17
1.639LeuHis: 1.639 ± 1.384
0.0LeuIle: 0.0 ± 0.0
1.639LeuLys: 1.639 ± 1.917
6.557LeuLeu: 6.557 ± 3.445
0.0LeuMet: 0.0 ± 0.0
1.639LeuAsn: 1.639 ± 1.384
0.0LeuPro: 0.0 ± 0.0
0.0LeuGln: 0.0 ± 0.0
0.0LeuArg: 0.0 ± 0.0
1.639LeuSer: 1.639 ± 0.992
3.279LeuThr: 3.279 ± 2.768
0.0LeuVal: 0.0 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
3.279LeuTyr: 3.279 ± 1.662
0.0LeuXaa: 0.0 ± 0.0
Met
1.639MetAla: 1.639 ± 0.992
0.0MetCys: 0.0 ± 0.0
3.279MetAsp: 3.279 ± 1.662
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.639MetGly: 1.639 ± 0.992
0.0MetHis: 0.0 ± 0.0
1.639MetIle: 1.639 ± 0.992
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.639MetSer: 1.639 ± 1.384
1.639MetThr: 1.639 ± 0.992
0.0MetVal: 0.0 ± 0.0
1.639MetTrp: 1.639 ± 1.917
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.279AsnAla: 3.279 ± 1.983
1.639AsnCys: 1.639 ± 1.917
1.639AsnAsp: 1.639 ± 0.992
1.639AsnGlu: 1.639 ± 1.917
0.0AsnPhe: 0.0 ± 0.0
1.639AsnGly: 1.639 ± 0.992
0.0AsnHis: 0.0 ± 0.0
3.279AsnIle: 3.279 ± 0.884
1.639AsnLys: 1.639 ± 1.384
1.639AsnLeu: 1.639 ± 0.992
3.279AsnMet: 3.279 ± 1.983
1.639AsnAsn: 1.639 ± 0.992
0.0AsnPro: 0.0 ± 0.0
3.279AsnGln: 3.279 ± 2.17
3.279AsnArg: 3.279 ± 1.983
1.639AsnSer: 1.639 ± 0.992
0.0AsnThr: 0.0 ± 0.0
3.279AsnVal: 3.279 ± 1.983
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.639ProAla: 1.639 ± 1.384
1.639ProCys: 1.639 ± 1.917
1.639ProAsp: 1.639 ± 1.384
3.279ProGlu: 3.279 ± 2.768
1.639ProPhe: 1.639 ± 1.384
3.279ProGly: 3.279 ± 1.983
1.639ProHis: 1.639 ± 0.992
3.279ProIle: 3.279 ± 1.662
0.0ProLys: 0.0 ± 0.0
1.639ProLeu: 1.639 ± 1.917
0.0ProMet: 0.0 ± 0.0
3.279ProAsn: 3.279 ± 0.884
0.0ProPro: 0.0 ± 0.0
4.918ProGln: 4.918 ± 1.295
6.557ProArg: 6.557 ± 2.101
3.279ProSer: 3.279 ± 0.884
3.279ProThr: 3.279 ± 0.884
1.639ProVal: 1.639 ± 0.992
1.639ProTrp: 1.639 ± 0.992
3.279ProTyr: 3.279 ± 0.884
0.0ProXaa: 0.0 ± 0.0
Gln
3.279GlnAla: 3.279 ± 1.983
1.639GlnCys: 1.639 ± 1.384
1.639GlnAsp: 1.639 ± 1.917
0.0GlnGlu: 0.0 ± 0.0
3.279GlnPhe: 3.279 ± 0.884
1.639GlnGly: 1.639 ± 0.992
0.0GlnHis: 0.0 ± 0.0
1.639GlnIle: 1.639 ± 1.384
0.0GlnLys: 0.0 ± 0.0
3.279GlnLeu: 3.279 ± 1.662
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.639GlnPro: 1.639 ± 0.992
1.639GlnGln: 1.639 ± 0.992
1.639GlnArg: 1.639 ± 0.992
4.918GlnSer: 4.918 ± 1.27
3.279GlnThr: 3.279 ± 1.983
1.639GlnVal: 1.639 ± 1.384
1.639GlnTrp: 1.639 ± 1.384
1.639GlnTyr: 1.639 ± 1.917
0.0GlnXaa: 0.0 ± 0.0
Arg
3.279ArgAla: 3.279 ± 2.768
0.0ArgCys: 0.0 ± 0.0
4.918ArgAsp: 4.918 ± 2.1
1.639ArgGlu: 1.639 ± 1.384
6.557ArgPhe: 6.557 ± 1.767
4.918ArgGly: 4.918 ± 1.27
3.279ArgHis: 3.279 ± 0.884
1.639ArgIle: 1.639 ± 0.992
3.279ArgLys: 3.279 ± 1.983
3.279ArgLeu: 3.279 ± 0.884
0.0ArgMet: 0.0 ± 0.0
4.918ArgAsn: 4.918 ± 2.975
3.279ArgPro: 3.279 ± 0.884
0.0ArgGln: 0.0 ± 0.0
24.59ArgArg: 24.59 ± 10.9
11.475ArgSer: 11.475 ± 5.263
4.918ArgThr: 4.918 ± 2.975
8.197ArgVal: 8.197 ± 3.03
0.0ArgTrp: 0.0 ± 0.0
4.918ArgTyr: 4.918 ± 1.27
0.0ArgXaa: 0.0 ± 0.0
Ser
6.557SerAla: 6.557 ± 2.101
0.0SerCys: 0.0 ± 0.0
4.918SerAsp: 4.918 ± 1.27
1.639SerGlu: 1.639 ± 0.992
4.918SerPhe: 4.918 ± 1.27
6.557SerGly: 6.557 ± 3.967
1.639SerHis: 1.639 ± 1.384
4.918SerIle: 4.918 ± 1.27
3.279SerLys: 3.279 ± 1.983
8.197SerLeu: 8.197 ± 5.052
3.279SerMet: 3.279 ± 1.318
1.639SerAsn: 1.639 ± 1.917
4.918SerPro: 4.918 ± 2.975
3.279SerGln: 3.279 ± 0.884
6.557SerArg: 6.557 ± 2.101
4.918SerSer: 4.918 ± 2.975
6.557SerThr: 6.557 ± 0.782
1.639SerVal: 1.639 ± 0.992
1.639SerTrp: 1.639 ± 1.917
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.557ThrAla: 6.557 ± 3.967
0.0ThrCys: 0.0 ± 0.0
3.279ThrAsp: 3.279 ± 1.662
3.279ThrGlu: 3.279 ± 1.662
3.279ThrPhe: 3.279 ± 0.884
1.639ThrGly: 1.639 ± 0.992
1.639ThrHis: 1.639 ± 1.384
4.918ThrIle: 4.918 ± 1.27
1.639ThrLys: 1.639 ± 1.384
0.0ThrLeu: 0.0 ± 0.0
0.0ThrMet: 0.0 ± 0.0
3.279ThrAsn: 3.279 ± 1.983
9.836ThrPro: 9.836 ± 4.333
3.279ThrGln: 3.279 ± 1.983
4.918ThrArg: 4.918 ± 1.27
4.918ThrSer: 4.918 ± 2.1
4.918ThrThr: 4.918 ± 1.953
3.279ThrVal: 3.279 ± 0.884
0.0ThrTrp: 0.0 ± 0.0
1.639ThrTyr: 1.639 ± 1.384
0.0ThrXaa: 0.0 ± 0.0
Val
1.639ValAla: 1.639 ± 1.384
0.0ValCys: 0.0 ± 0.0
6.557ValAsp: 6.557 ± 2.103
6.557ValGlu: 6.557 ± 0.782
6.557ValPhe: 6.557 ± 0.782
1.639ValGly: 1.639 ± 1.384
1.639ValHis: 1.639 ± 1.384
1.639ValIle: 1.639 ± 0.992
1.639ValLys: 1.639 ± 1.917
3.279ValLeu: 3.279 ± 0.884
1.639ValMet: 1.639 ± 0.992
1.639ValAsn: 1.639 ± 0.992
0.0ValPro: 0.0 ± 0.0
0.0ValGln: 0.0 ± 0.0
9.836ValArg: 9.836 ± 2.54
3.279ValSer: 3.279 ± 0.884
1.639ValThr: 1.639 ± 1.917
3.279ValVal: 3.279 ± 0.884
1.639ValTrp: 1.639 ± 0.992
3.279ValTyr: 3.279 ± 1.983
0.0ValXaa: 0.0 ± 0.0
Trp
4.918TrpAla: 4.918 ± 1.295
0.0TrpCys: 0.0 ± 0.0
1.639TrpAsp: 1.639 ± 1.384
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.279TrpGly: 3.279 ± 1.662
3.279TrpHis: 3.279 ± 0.884
1.639TrpIle: 1.639 ± 1.917
0.0TrpLys: 0.0 ± 0.0
1.639TrpLeu: 1.639 ± 1.917
1.639TrpMet: 1.639 ± 1.917
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.639TrpArg: 1.639 ± 0.992
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.639TrpVal: 1.639 ± 0.992
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
9.836TyrAla: 9.836 ± 6.866
0.0TyrCys: 0.0 ± 0.0
4.918TyrAsp: 4.918 ± 1.27
1.639TyrGlu: 1.639 ± 1.384
1.639TyrPhe: 1.639 ± 1.917
4.918TyrGly: 4.918 ± 2.975
0.0TyrHis: 0.0 ± 0.0
1.639TyrIle: 1.639 ± 1.384
3.279TyrLys: 3.279 ± 3.834
1.639TyrLeu: 1.639 ± 0.992
0.0TyrMet: 0.0 ± 0.0
1.639TyrAsn: 1.639 ± 0.992
1.639TyrPro: 1.639 ± 1.917
0.0TyrGln: 0.0 ± 0.0
3.279TyrArg: 3.279 ± 2.768
6.557TyrSer: 6.557 ± 3.324
0.0TyrThr: 0.0 ± 0.0
1.639TyrVal: 1.639 ± 0.992
1.639TyrTrp: 1.639 ± 0.992
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski