Amino acid dipepetide frequency for Beihai weivirus-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.831AlaAla: 12.831 ± 2.864
0.855AlaCys: 0.855 ± 1.182
1.711AlaAsp: 1.711 ± 0.712
5.988AlaGlu: 5.988 ± 1.638
3.422AlaPhe: 3.422 ± 1.425
4.277AlaGly: 4.277 ± 0.698
0.855AlaHis: 0.855 ± 0.47
7.699AlaIle: 7.699 ± 0.925
7.699AlaLys: 7.699 ± 0.925
5.133AlaLeu: 5.133 ± 0.485
2.566AlaMet: 2.566 ± 0.287
2.566AlaAsn: 2.566 ± 1.41
2.566AlaPro: 2.566 ± 1.895
0.855AlaGln: 0.855 ± 0.47
5.133AlaArg: 5.133 ± 2.137
5.133AlaSer: 5.133 ± 1.168
4.277AlaThr: 4.277 ± 2.607
1.711AlaVal: 1.711 ± 0.712
2.566AlaTrp: 2.566 ± 1.41
1.711AlaTyr: 1.711 ± 0.94
0.0AlaXaa: 0.0 ± 0.0
Cys
0.855CysAla: 0.855 ± 0.47
0.855CysCys: 0.855 ± 0.47
0.855CysAsp: 0.855 ± 0.47
0.855CysGlu: 0.855 ± 1.182
1.711CysPhe: 1.711 ± 0.94
1.711CysGly: 1.711 ± 0.94
0.0CysHis: 0.0 ± 0.0
3.422CysIle: 3.422 ± 0.228
0.0CysLys: 0.0 ± 0.0
2.566CysLeu: 2.566 ± 0.242
0.0CysMet: 0.0 ± 0.0
0.855CysAsn: 0.855 ± 0.47
0.855CysPro: 0.855 ± 1.182
1.711CysGln: 1.711 ± 0.94
0.0CysArg: 0.0 ± 0.0
2.566CysSer: 2.566 ± 0.242
1.711CysThr: 1.711 ± 0.712
3.422CysVal: 3.422 ± 3.077
0.855CysTrp: 0.855 ± 0.47
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.277AspAla: 4.277 ± 0.955
0.855AspCys: 0.855 ± 0.47
4.277AspAsp: 4.277 ± 0.955
2.566AspGlu: 2.566 ± 1.41
0.855AspPhe: 0.855 ± 1.182
3.422AspGly: 3.422 ± 1.88
0.855AspHis: 0.855 ± 0.47
1.711AspIle: 1.711 ± 0.94
4.277AspLys: 4.277 ± 2.607
5.133AspLeu: 5.133 ± 2.82
1.711AspMet: 1.711 ± 0.712
0.855AspAsn: 0.855 ± 0.47
2.566AspPro: 2.566 ± 0.242
1.711AspGln: 1.711 ± 0.712
5.988AspArg: 5.988 ± 0.015
2.566AspSer: 2.566 ± 1.41
2.566AspThr: 2.566 ± 1.895
4.277AspVal: 4.277 ± 0.955
0.855AspTrp: 0.855 ± 0.47
1.711AspTyr: 1.711 ± 0.712
0.0AspXaa: 0.0 ± 0.0
Glu
2.566GluAla: 2.566 ± 1.41
1.711GluCys: 1.711 ± 0.712
7.699GluAsp: 7.699 ± 0.727
7.699GluGlu: 7.699 ± 2.578
5.133GluPhe: 5.133 ± 0.485
5.988GluGly: 5.988 ± 3.29
0.855GluHis: 0.855 ± 0.47
5.988GluIle: 5.988 ± 3.29
3.422GluLys: 3.422 ± 0.228
3.422GluLeu: 3.422 ± 1.88
0.0GluMet: 0.0 ± 0.0
4.277GluAsn: 4.277 ± 2.35
2.566GluPro: 2.566 ± 0.242
0.0GluGln: 0.0 ± 0.0
6.843GluArg: 6.843 ± 2.108
5.988GluSer: 5.988 ± 3.29
1.711GluThr: 1.711 ± 0.94
2.566GluVal: 2.566 ± 1.41
1.711GluTrp: 1.711 ± 0.712
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.277PheAla: 4.277 ± 0.698
0.0PheCys: 0.0 ± 0.0
0.855PheAsp: 0.855 ± 0.47
2.566PheGlu: 2.566 ± 1.41
0.0PhePhe: 0.0 ± 0.0
0.855PheGly: 0.855 ± 0.47
1.711PheHis: 1.711 ± 2.365
0.855PheIle: 0.855 ± 0.47
5.133PheLys: 5.133 ± 1.168
3.422PheLeu: 3.422 ± 3.077
0.0PheMet: 0.0 ± 0.0
1.711PheAsn: 1.711 ± 0.712
1.711PhePro: 1.711 ± 2.365
0.0PheGln: 0.0 ± 0.0
1.711PheArg: 1.711 ± 0.94
2.566PheSer: 2.566 ± 1.41
3.422PheThr: 3.422 ± 1.425
0.855PheVal: 0.855 ± 0.47
0.0PheTrp: 0.0 ± 0.0
0.855PheTyr: 0.855 ± 0.47
0.0PheXaa: 0.0 ± 0.0
Gly
2.566GlyAla: 2.566 ± 1.41
0.855GlyCys: 0.855 ± 0.47
5.133GlyAsp: 5.133 ± 1.168
2.566GlyGlu: 2.566 ± 1.41
4.277GlyPhe: 4.277 ± 0.698
6.843GlyGly: 6.843 ± 1.197
2.566GlyHis: 2.566 ± 0.242
1.711GlyIle: 1.711 ± 2.365
5.133GlyLys: 5.133 ± 2.137
5.133GlyLeu: 5.133 ± 1.168
1.711GlyMet: 1.711 ± 0.712
3.422GlyAsn: 3.422 ± 1.425
1.711GlyPro: 1.711 ± 0.712
3.422GlyGln: 3.422 ± 0.228
2.566GlyArg: 2.566 ± 1.895
3.422GlySer: 3.422 ± 0.228
2.566GlyThr: 2.566 ± 1.41
8.554GlyVal: 8.554 ± 0.257
3.422GlyTrp: 3.422 ± 0.228
2.566GlyTyr: 2.566 ± 0.242
0.0GlyXaa: 0.0 ± 0.0
His
1.711HisAla: 1.711 ± 0.712
0.855HisCys: 0.855 ± 0.47
0.0HisAsp: 0.0 ± 0.0
0.855HisGlu: 0.855 ± 1.182
0.855HisPhe: 0.855 ± 1.182
0.855HisGly: 0.855 ± 0.47
0.0HisHis: 0.0 ± 0.0
0.855HisIle: 0.855 ± 0.47
1.711HisLys: 1.711 ± 2.365
3.422HisLeu: 3.422 ± 0.228
0.855HisMet: 0.855 ± 0.47
0.855HisAsn: 0.855 ± 1.182
1.711HisPro: 1.711 ± 0.712
0.0HisGln: 0.0 ± 0.0
1.711HisArg: 1.711 ± 0.712
1.711HisSer: 1.711 ± 0.94
2.566HisThr: 2.566 ± 0.242
0.855HisVal: 0.855 ± 0.47
0.0HisTrp: 0.0 ± 0.0
0.855HisTyr: 0.855 ± 1.182
0.0HisXaa: 0.0 ± 0.0
Ile
5.133IleAla: 5.133 ± 2.137
1.711IleCys: 1.711 ± 0.712
3.422IleAsp: 3.422 ± 0.228
3.422IleGlu: 3.422 ± 1.88
3.422IlePhe: 3.422 ± 1.88
2.566IleGly: 2.566 ± 0.242
0.0IleHis: 0.0 ± 0.0
2.566IleIle: 2.566 ± 1.41
3.422IleLys: 3.422 ± 1.88
3.422IleLeu: 3.422 ± 0.228
1.711IleMet: 1.711 ± 0.94
2.566IleAsn: 2.566 ± 0.242
1.711IlePro: 1.711 ± 0.712
1.711IleGln: 1.711 ± 0.712
3.422IleArg: 3.422 ± 1.88
4.277IleSer: 4.277 ± 0.955
5.133IleThr: 5.133 ± 0.485
3.422IleVal: 3.422 ± 0.228
0.855IleTrp: 0.855 ± 0.47
1.711IleTyr: 1.711 ± 0.94
0.0IleXaa: 0.0 ± 0.0
Lys
1.711LysAla: 1.711 ± 0.712
1.711LysCys: 1.711 ± 0.94
0.855LysAsp: 0.855 ± 0.47
4.277LysGlu: 4.277 ± 2.35
1.711LysPhe: 1.711 ± 0.94
4.277LysGly: 4.277 ± 2.607
1.711LysHis: 1.711 ± 0.712
2.566LysIle: 2.566 ± 0.242
5.133LysLys: 5.133 ± 0.485
5.133LysLeu: 5.133 ± 1.168
5.133LysMet: 5.133 ± 1.574
2.566LysAsn: 2.566 ± 0.242
5.133LysPro: 5.133 ± 2.137
1.711LysGln: 1.711 ± 0.94
1.711LysArg: 1.711 ± 0.94
3.422LysSer: 3.422 ± 0.228
3.422LysThr: 3.422 ± 1.425
5.133LysVal: 5.133 ± 0.485
0.855LysTrp: 0.855 ± 0.47
2.566LysTyr: 2.566 ± 0.242
0.0LysXaa: 0.0 ± 0.0
Leu
4.277LeuAla: 4.277 ± 0.955
0.855LeuCys: 0.855 ± 0.47
5.133LeuAsp: 5.133 ± 2.82
1.711LeuGlu: 1.711 ± 0.94
2.566LeuPhe: 2.566 ± 0.242
7.699LeuGly: 7.699 ± 0.727
1.711LeuHis: 1.711 ± 0.712
4.277LeuIle: 4.277 ± 0.955
6.843LeuLys: 6.843 ± 2.108
4.277LeuLeu: 4.277 ± 2.35
0.855LeuMet: 0.855 ± 0.47
1.711LeuAsn: 1.711 ± 0.712
6.843LeuPro: 6.843 ± 0.455
1.711LeuGln: 1.711 ± 0.712
5.133LeuArg: 5.133 ± 2.82
7.699LeuSer: 7.699 ± 2.578
4.277LeuThr: 4.277 ± 2.607
0.855LeuVal: 0.855 ± 0.47
0.0LeuTrp: 0.0 ± 0.0
0.855LeuTyr: 0.855 ± 1.182
0.0LeuXaa: 0.0 ± 0.0
Met
2.566MetAla: 2.566 ± 0.242
0.855MetCys: 0.855 ± 1.182
0.855MetAsp: 0.855 ± 1.182
4.277MetGlu: 4.277 ± 0.698
0.855MetPhe: 0.855 ± 0.47
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.855MetIle: 0.855 ± 0.47
0.0MetLys: 0.0 ± 0.0
0.855MetLeu: 0.855 ± 0.47
0.855MetMet: 0.855 ± 1.182
0.855MetAsn: 0.855 ± 0.47
4.277MetPro: 4.277 ± 0.955
1.711MetGln: 1.711 ± 0.94
1.711MetArg: 1.711 ± 0.94
3.422MetSer: 3.422 ± 0.228
1.711MetThr: 1.711 ± 0.712
3.422MetVal: 3.422 ± 1.425
0.855MetTrp: 0.855 ± 0.47
0.855MetTyr: 0.855 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
3.422AsnAla: 3.422 ± 0.228
0.855AsnCys: 0.855 ± 1.182
2.566AsnAsp: 2.566 ± 0.242
3.422AsnGlu: 3.422 ± 0.228
2.566AsnPhe: 2.566 ± 1.895
4.277AsnGly: 4.277 ± 0.698
0.0AsnHis: 0.0 ± 0.0
0.855AsnIle: 0.855 ± 0.47
1.711AsnLys: 1.711 ± 0.712
1.711AsnLeu: 1.711 ± 0.712
2.566AsnMet: 2.566 ± 1.895
0.0AsnAsn: 0.0 ± 0.0
1.711AsnPro: 1.711 ± 0.94
0.855AsnGln: 0.855 ± 0.47
0.855AsnArg: 0.855 ± 0.47
0.0AsnSer: 0.0 ± 0.0
3.422AsnThr: 3.422 ± 1.425
1.711AsnVal: 1.711 ± 0.712
0.855AsnTrp: 0.855 ± 0.47
1.711AsnTyr: 1.711 ± 2.365
0.0AsnXaa: 0.0 ± 0.0
Pro
2.566ProAla: 2.566 ± 3.547
0.855ProCys: 0.855 ± 0.47
2.566ProAsp: 2.566 ± 0.242
5.133ProGlu: 5.133 ± 1.168
0.0ProPhe: 0.0 ± 0.0
4.277ProGly: 4.277 ± 0.955
0.855ProHis: 0.855 ± 1.182
1.711ProIle: 1.711 ± 2.365
3.422ProLys: 3.422 ± 1.88
5.988ProLeu: 5.988 ± 3.32
2.566ProMet: 2.566 ± 0.242
1.711ProAsn: 1.711 ± 0.94
5.988ProPro: 5.988 ± 0.015
0.855ProGln: 0.855 ± 0.47
5.133ProArg: 5.133 ± 0.485
3.422ProSer: 3.422 ± 1.425
4.277ProThr: 4.277 ± 0.698
4.277ProVal: 4.277 ± 0.955
0.855ProTrp: 0.855 ± 1.182
1.711ProTyr: 1.711 ± 2.365
0.0ProXaa: 0.0 ± 0.0
Gln
2.566GlnAla: 2.566 ± 1.41
0.0GlnCys: 0.0 ± 0.0
0.855GlnAsp: 0.855 ± 0.47
0.855GlnGlu: 0.855 ± 0.47
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
0.855GlnHis: 0.855 ± 0.47
1.711GlnIle: 1.711 ± 0.712
0.855GlnLys: 0.855 ± 0.47
0.855GlnLeu: 0.855 ± 1.182
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.566GlnPro: 2.566 ± 0.242
0.0GlnGln: 0.0 ± 0.0
4.277GlnArg: 4.277 ± 2.35
4.277GlnSer: 4.277 ± 0.698
0.0GlnThr: 0.0 ± 0.0
1.711GlnVal: 1.711 ± 0.712
0.0GlnTrp: 0.0 ± 0.0
3.422GlnTyr: 3.422 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
7.699ArgAla: 7.699 ± 0.925
2.566ArgCys: 2.566 ± 1.41
4.277ArgAsp: 4.277 ± 0.698
2.566ArgGlu: 2.566 ± 0.242
2.566ArgPhe: 2.566 ± 0.242
3.422ArgGly: 3.422 ± 3.077
1.711ArgHis: 1.711 ± 2.365
3.422ArgIle: 3.422 ± 0.228
3.422ArgLys: 3.422 ± 1.425
5.133ArgLeu: 5.133 ± 2.82
4.277ArgMet: 4.277 ± 0.698
0.855ArgAsn: 0.855 ± 0.47
1.711ArgPro: 1.711 ± 0.94
2.566ArgGln: 2.566 ± 0.242
4.277ArgArg: 4.277 ± 0.698
4.277ArgSer: 4.277 ± 2.35
5.133ArgThr: 5.133 ± 1.168
3.422ArgVal: 3.422 ± 1.88
3.422ArgTrp: 3.422 ± 0.228
1.711ArgTyr: 1.711 ± 0.712
0.0ArgXaa: 0.0 ± 0.0
Ser
6.843SerAla: 6.843 ± 0.455
3.422SerCys: 3.422 ± 1.88
2.566SerAsp: 2.566 ± 1.895
7.699SerGlu: 7.699 ± 4.23
0.855SerPhe: 0.855 ± 0.47
7.699SerGly: 7.699 ± 2.38
1.711SerHis: 1.711 ± 0.712
3.422SerIle: 3.422 ± 1.88
2.566SerLys: 2.566 ± 1.41
5.133SerLeu: 5.133 ± 2.82
1.711SerMet: 1.711 ± 0.94
0.855SerAsn: 0.855 ± 1.182
5.133SerPro: 5.133 ± 2.137
0.855SerGln: 0.855 ± 0.47
3.422SerArg: 3.422 ± 1.88
4.277SerSer: 4.277 ± 0.955
5.988SerThr: 5.988 ± 3.29
4.277SerVal: 4.277 ± 2.607
0.855SerTrp: 0.855 ± 0.47
0.855SerTyr: 0.855 ± 0.47
0.0SerXaa: 0.0 ± 0.0
Thr
5.133ThrAla: 5.133 ± 0.485
3.422ThrCys: 3.422 ± 1.425
1.711ThrAsp: 1.711 ± 2.365
4.277ThrGlu: 4.277 ± 2.35
0.855ThrPhe: 0.855 ± 1.182
5.988ThrGly: 5.988 ± 0.015
1.711ThrHis: 1.711 ± 0.712
6.843ThrIle: 6.843 ± 2.108
4.277ThrLys: 4.277 ± 2.607
2.566ThrLeu: 2.566 ± 0.242
0.855ThrMet: 0.855 ± 0.47
5.133ThrAsn: 5.133 ± 2.137
5.133ThrPro: 5.133 ± 0.485
2.566ThrGln: 2.566 ± 0.242
5.133ThrArg: 5.133 ± 0.485
1.711ThrSer: 1.711 ± 0.94
5.988ThrThr: 5.988 ± 1.638
2.566ThrVal: 2.566 ± 1.41
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.277ValAla: 4.277 ± 0.698
0.855ValCys: 0.855 ± 0.47
3.422ValAsp: 3.422 ± 0.228
5.133ValGlu: 5.133 ± 0.485
0.855ValPhe: 0.855 ± 0.47
4.277ValGly: 4.277 ± 0.698
4.277ValHis: 4.277 ± 0.698
3.422ValIle: 3.422 ± 1.425
1.711ValLys: 1.711 ± 0.712
4.277ValLeu: 4.277 ± 0.955
0.0ValMet: 0.0 ± 0.0
2.566ValAsn: 2.566 ± 3.547
3.422ValPro: 3.422 ± 1.425
1.711ValGln: 1.711 ± 0.94
4.277ValArg: 4.277 ± 0.955
7.699ValSer: 7.699 ± 2.38
3.422ValThr: 3.422 ± 0.228
5.988ValVal: 5.988 ± 1.667
0.855ValTrp: 0.855 ± 0.47
0.855ValTyr: 0.855 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
2.566TrpAla: 2.566 ± 1.41
0.0TrpCys: 0.0 ± 0.0
2.566TrpAsp: 2.566 ± 1.41
1.711TrpGlu: 1.711 ± 0.94
0.0TrpPhe: 0.0 ± 0.0
0.855TrpGly: 0.855 ± 1.182
0.855TrpHis: 0.855 ± 0.47
0.855TrpIle: 0.855 ± 0.47
0.0TrpLys: 0.0 ± 0.0
0.855TrpLeu: 0.855 ± 0.47
0.855TrpMet: 0.855 ± 0.47
0.855TrpAsn: 0.855 ± 1.182
0.855TrpPro: 0.855 ± 0.47
0.0TrpGln: 0.0 ± 0.0
2.566TrpArg: 2.566 ± 0.242
0.0TrpSer: 0.0 ± 0.0
1.711TrpThr: 1.711 ± 0.94
1.711TrpVal: 1.711 ± 0.712
1.711TrpTrp: 1.711 ± 0.94
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.566TyrAla: 2.566 ± 0.242
1.711TyrCys: 1.711 ± 2.365
1.711TyrAsp: 1.711 ± 0.712
2.566TyrGlu: 2.566 ± 0.242
0.0TyrPhe: 0.0 ± 0.0
0.855TyrGly: 0.855 ± 0.47
0.0TyrHis: 0.0 ± 0.0
0.855TyrIle: 0.855 ± 0.47
0.0TyrLys: 0.0 ± 0.0
0.855TyrLeu: 0.855 ± 0.47
1.711TyrMet: 1.711 ± 0.94
0.855TyrAsn: 0.855 ± 1.182
0.0TyrPro: 0.0 ± 0.0
0.855TyrGln: 0.855 ± 0.47
2.566TyrArg: 2.566 ± 1.895
1.711TyrSer: 1.711 ± 0.712
2.566TyrThr: 2.566 ± 0.242
2.566TyrVal: 2.566 ± 0.242
0.0TyrTrp: 0.0 ± 0.0
0.855TyrTyr: 0.855 ± 0.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1170 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski