Amino acid dipepetide frequency for Hubei sobemo-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.276AlaAla: 9.276 ± 2.805
1.855AlaCys: 1.855 ± 0.563
0.0AlaAsp: 0.0 ± 0.0
8.04AlaGlu: 8.04 ± 3.247
1.855AlaPhe: 1.855 ± 0.904
8.04AlaGly: 8.04 ± 0.91
1.237AlaHis: 1.237 ± 0.456
3.711AlaIle: 3.711 ± 1.6
4.947AlaLys: 4.947 ± 1.1
5.566AlaLeu: 5.566 ± 0.56
1.237AlaMet: 1.237 ± 0.385
4.329AlaAsn: 4.329 ± 0.255
4.947AlaPro: 4.947 ± 1.727
4.947AlaGln: 4.947 ± 1.486
1.237AlaArg: 1.237 ± 0.468
9.895AlaSer: 9.895 ± 2.3
8.658AlaThr: 8.658 ± 2.742
9.895AlaVal: 9.895 ± 3.65
2.474AlaTrp: 2.474 ± 0.072
4.947AlaTyr: 4.947 ± 0.68
0.0AlaXaa: 0.0 ± 0.0
Cys
1.237CysAla: 1.237 ± 0.468
0.0CysCys: 0.0 ± 0.0
1.237CysAsp: 1.237 ± 0.799
0.0CysGlu: 0.0 ± 0.0
1.237CysPhe: 1.237 ± 1.293
0.618CysGly: 0.618 ± 0.4
1.237CysHis: 1.237 ± 1.039
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.618CysPro: 0.618 ± 0.4
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.711CysSer: 3.711 ± 0.49
1.237CysThr: 1.237 ± 0.468
1.855CysVal: 1.855 ± 0.327
0.618CysTrp: 0.618 ± 0.519
0.618CysTyr: 0.618 ± 0.4
0.0CysXaa: 0.0 ± 0.0
Asp
1.855AspAla: 1.855 ± 1.274
1.237AspCys: 1.237 ± 0.468
2.474AspAsp: 2.474 ± 0.739
2.474AspGlu: 2.474 ± 1.398
1.855AspPhe: 1.855 ± 1.558
4.329AspGly: 4.329 ± 0.752
0.618AspHis: 0.618 ± 0.4
0.0AspIle: 0.0 ± 0.0
1.855AspLys: 1.855 ± 0.563
4.329AspLeu: 4.329 ± 0.892
1.855AspMet: 1.855 ± 1.045
0.0AspAsn: 0.0 ± 0.0
3.711AspPro: 3.711 ± 1.403
0.618AspGln: 0.618 ± 0.519
1.237AspArg: 1.237 ± 0.468
3.092AspSer: 3.092 ± 1.971
3.092AspThr: 3.092 ± 0.944
5.566AspVal: 5.566 ± 1.879
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.566GluAla: 5.566 ± 2.439
1.237GluCys: 1.237 ± 1.039
0.618GluAsp: 0.618 ± 0.519
3.092GluGlu: 3.092 ± 1.407
0.0GluPhe: 0.0 ± 0.0
5.566GluGly: 5.566 ± 0.455
1.855GluHis: 1.855 ± 0.904
2.474GluIle: 2.474 ± 0.739
4.329GluLys: 4.329 ± 0.892
4.329GluLeu: 4.329 ± 1.566
3.092GluMet: 3.092 ± 1.249
3.711GluAsn: 3.711 ± 1.403
1.855GluPro: 1.855 ± 0.698
1.237GluGln: 1.237 ± 0.727
2.474GluArg: 2.474 ± 0.912
6.803GluSer: 6.803 ± 2.483
4.329GluThr: 4.329 ± 1.566
3.092GluVal: 3.092 ± 0.483
2.474GluTrp: 2.474 ± 0.935
0.618GluTyr: 0.618 ± 0.647
0.0GluXaa: 0.0 ± 0.0
Phe
6.184PheAla: 6.184 ± 0.944
1.237PheCys: 1.237 ± 0.468
1.855PheAsp: 1.855 ± 1.274
2.474PheGlu: 2.474 ± 1.539
0.0PhePhe: 0.0 ± 0.0
1.855PheGly: 1.855 ± 0.904
0.618PheHis: 0.618 ± 0.519
1.855PheIle: 1.855 ± 0.563
1.855PheLys: 1.855 ± 0.327
3.711PheLeu: 3.711 ± 1.395
0.0PheMet: 0.0 ± 0.0
0.618PheAsn: 0.618 ± 0.519
1.855PhePro: 1.855 ± 0.327
1.855PheGln: 1.855 ± 0.698
1.855PheArg: 1.855 ± 0.327
4.947PheSer: 4.947 ± 2.09
2.474PheThr: 2.474 ± 0.912
4.329PheVal: 4.329 ± 0.612
0.0PheTrp: 0.0 ± 0.0
1.237PheTyr: 1.237 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
3.092GlyAla: 3.092 ± 1.054
0.618GlyCys: 0.618 ± 0.519
8.04GlyAsp: 8.04 ± 1.204
1.855GlyGlu: 1.855 ± 0.904
4.329GlyPhe: 4.329 ± 0.892
5.566GlyGly: 5.566 ± 2.07
1.855GlyHis: 1.855 ± 0.698
3.092GlyIle: 3.092 ± 0.483
4.329GlyLys: 4.329 ± 2.18
4.329GlyLeu: 4.329 ± 1.208
1.855GlyMet: 1.855 ± 1.94
3.711GlyAsn: 3.711 ± 3.151
2.474GlyPro: 2.474 ± 0.072
1.855GlyGln: 1.855 ± 0.563
4.947GlyArg: 4.947 ± 1.376
8.04GlySer: 8.04 ± 0.91
8.04GlyThr: 8.04 ± 2.353
3.092GlyVal: 3.092 ± 1.054
1.855GlyTrp: 1.855 ± 0.327
2.474GlyTyr: 2.474 ± 0.912
0.0GlyXaa: 0.0 ± 0.0
His
1.855HisAla: 1.855 ± 0.904
0.618HisCys: 0.618 ± 0.4
1.237HisAsp: 1.237 ± 1.039
1.855HisGlu: 1.855 ± 0.698
0.618HisPhe: 0.618 ± 0.4
2.474HisGly: 2.474 ± 0.912
0.0HisHis: 0.0 ± 0.0
0.618HisIle: 0.618 ± 0.519
0.0HisLys: 0.0 ± 0.0
1.237HisLeu: 1.237 ± 0.456
0.618HisMet: 0.618 ± 0.4
0.0HisAsn: 0.0 ± 0.0
1.237HisPro: 1.237 ± 1.039
0.0HisGln: 0.0 ± 0.0
0.618HisArg: 0.618 ± 0.4
1.855HisSer: 1.855 ± 0.698
1.237HisThr: 1.237 ± 0.799
0.618HisVal: 0.618 ± 0.4
0.0HisTrp: 0.0 ± 0.0
0.618HisTyr: 0.618 ± 0.647
0.0HisXaa: 0.0 ± 0.0
Ile
4.947IleAla: 4.947 ± 2.072
0.0IleCys: 0.0 ± 0.0
0.618IleAsp: 0.618 ± 0.647
1.237IleGlu: 1.237 ± 1.293
1.237IlePhe: 1.237 ± 0.456
4.329IleGly: 4.329 ± 1.018
0.618IleHis: 0.618 ± 0.4
1.855IleIle: 1.855 ± 0.563
2.474IleLys: 2.474 ± 1.398
3.092IleLeu: 3.092 ± 1.342
0.0IleMet: 0.0 ± 0.0
0.618IleAsn: 0.618 ± 0.4
1.237IlePro: 1.237 ± 0.468
1.237IleGln: 1.237 ± 0.468
1.855IleArg: 1.855 ± 0.563
0.618IleSer: 0.618 ± 0.4
4.329IleThr: 4.329 ± 1.371
4.329IleVal: 4.329 ± 1.921
0.0IleTrp: 0.0 ± 0.0
0.618IleTyr: 0.618 ± 0.4
0.0IleXaa: 0.0 ± 0.0
Lys
8.658LysAla: 8.658 ± 4.067
1.855LysCys: 1.855 ± 0.904
2.474LysAsp: 2.474 ± 0.935
8.04LysGlu: 8.04 ± 3.107
3.711LysPhe: 3.711 ± 0.654
1.855LysGly: 1.855 ± 1.558
0.618LysHis: 0.618 ± 0.4
2.474LysIle: 2.474 ± 1.598
3.092LysLys: 3.092 ± 0.6
8.04LysLeu: 8.04 ± 3.037
1.855LysMet: 1.855 ± 0.244
3.092LysAsn: 3.092 ± 1.118
1.855LysPro: 1.855 ± 1.085
3.092LysGln: 3.092 ± 1.222
2.474LysArg: 2.474 ± 1.453
3.092LysSer: 3.092 ± 0.472
3.711LysThr: 3.711 ± 0.456
3.711LysVal: 3.711 ± 0.654
0.0LysTrp: 0.0 ± 0.0
2.474LysTyr: 2.474 ± 0.912
0.0LysXaa: 0.0 ± 0.0
Leu
4.947LeuAla: 4.947 ± 1.64
0.618LeuCys: 0.618 ± 0.4
1.855LeuAsp: 1.855 ± 0.327
3.711LeuGlu: 3.711 ± 1.403
5.566LeuPhe: 5.566 ± 1.967
7.421LeuGly: 7.421 ± 1.764
0.618LeuHis: 0.618 ± 0.647
3.711LeuIle: 3.711 ± 1.791
3.092LeuLys: 3.092 ± 0.472
7.421LeuLeu: 7.421 ± 1.357
1.237LeuMet: 1.237 ± 0.799
2.474LeuAsn: 2.474 ± 0.739
4.947LeuPro: 4.947 ± 1.1
2.474LeuGln: 2.474 ± 1.598
3.092LeuArg: 3.092 ± 0.483
8.04LeuSer: 8.04 ± 2.79
6.184LeuThr: 6.184 ± 1.201
3.711LeuVal: 3.711 ± 1.744
1.237LeuTrp: 1.237 ± 0.799
1.855LeuTyr: 1.855 ± 0.327
0.0LeuXaa: 0.0 ± 0.0
Met
1.237MetAla: 1.237 ± 0.799
1.237MetCys: 1.237 ± 0.799
1.237MetAsp: 1.237 ± 0.727
0.618MetGlu: 0.618 ± 0.519
0.0MetPhe: 0.0 ± 0.0
0.618MetGly: 0.618 ± 0.647
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.474MetLys: 2.474 ± 0.739
1.237MetLeu: 1.237 ± 0.468
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.618MetGln: 0.618 ± 0.519
1.855MetArg: 1.855 ± 0.904
0.0MetSer: 0.0 ± 0.0
2.474MetThr: 2.474 ± 1.678
1.237MetVal: 1.237 ± 0.456
0.0MetTrp: 0.0 ± 0.0
1.237MetTyr: 1.237 ± 0.468
0.0MetXaa: 0.0 ± 0.0
Asn
6.184AsnAla: 6.184 ± 2.918
0.0AsnCys: 0.0 ± 0.0
2.474AsnAsp: 2.474 ± 0.072
1.237AsnGlu: 1.237 ± 0.468
2.474AsnPhe: 2.474 ± 0.739
1.855AsnGly: 1.855 ± 1.274
1.237AsnHis: 1.237 ± 0.468
0.0AsnIle: 0.0 ± 0.0
2.474AsnLys: 2.474 ± 0.912
2.474AsnLeu: 2.474 ± 0.912
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.474AsnPro: 2.474 ± 0.072
0.618AsnGln: 0.618 ± 0.519
3.092AsnArg: 3.092 ± 1.547
2.474AsnSer: 2.474 ± 0.863
2.474AsnThr: 2.474 ± 0.072
3.092AsnVal: 3.092 ± 1.547
1.237AsnTrp: 1.237 ± 1.039
1.237AsnTyr: 1.237 ± 0.456
0.0AsnXaa: 0.0 ± 0.0
Pro
6.803ProAla: 6.803 ± 2.781
0.0ProCys: 0.0 ± 0.0
2.474ProAsp: 2.474 ± 0.739
2.474ProGlu: 2.474 ± 1.036
1.237ProPhe: 1.237 ± 0.468
3.092ProGly: 3.092 ± 0.6
0.0ProHis: 0.0 ± 0.0
2.474ProIle: 2.474 ± 0.912
3.711ProLys: 3.711 ± 0.871
3.092ProLeu: 3.092 ± 1.118
1.855ProMet: 1.855 ± 0.904
1.237ProAsn: 1.237 ± 0.456
2.474ProPro: 2.474 ± 1.036
1.855ProGln: 1.855 ± 0.563
1.855ProArg: 1.855 ± 0.698
5.566ProSer: 5.566 ± 2.078
4.329ProThr: 4.329 ± 0.255
3.092ProVal: 3.092 ± 0.483
0.0ProTrp: 0.0 ± 0.0
1.855ProTyr: 1.855 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
5.566GlnAla: 5.566 ± 1.16
0.618GlnCys: 0.618 ± 0.647
0.0GlnAsp: 0.0 ± 0.0
1.855GlnGlu: 1.855 ± 1.199
1.855GlnPhe: 1.855 ± 0.327
2.474GlnGly: 2.474 ± 1.598
0.618GlnHis: 0.618 ± 0.4
0.618GlnIle: 0.618 ± 0.4
2.474GlnLys: 2.474 ± 0.912
2.474GlnLeu: 2.474 ± 0.072
0.618GlnMet: 0.618 ± 0.519
0.618GlnAsn: 0.618 ± 0.647
0.0GlnPro: 0.0 ± 0.0
0.618GlnGln: 0.618 ± 0.4
3.711GlnArg: 3.711 ± 0.49
1.237GlnSer: 1.237 ± 0.456
0.618GlnThr: 0.618 ± 0.647
3.711GlnVal: 3.711 ± 1.6
1.237GlnTrp: 1.237 ± 0.468
0.618GlnTyr: 0.618 ± 0.647
0.0GlnXaa: 0.0 ± 0.0
Arg
1.237ArgAla: 1.237 ± 0.799
0.0ArgCys: 0.0 ± 0.0
2.474ArgAsp: 2.474 ± 1.539
1.237ArgGlu: 1.237 ± 0.468
3.092ArgPhe: 3.092 ± 1.407
3.092ArgGly: 3.092 ± 1.478
0.0ArgHis: 0.0 ± 0.0
1.855ArgIle: 1.855 ± 1.558
5.566ArgLys: 5.566 ± 1.16
5.566ArgLeu: 5.566 ± 2.471
0.618ArgMet: 0.618 ± 0.519
3.092ArgAsn: 3.092 ± 1.547
4.947ArgPro: 4.947 ± 0.928
1.237ArgGln: 1.237 ± 0.456
2.474ArgArg: 2.474 ± 1.453
4.947ArgSer: 4.947 ± 0.68
3.092ArgThr: 3.092 ± 1.222
1.855ArgVal: 1.855 ± 0.698
0.618ArgTrp: 0.618 ± 0.519
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
9.276SerAla: 9.276 ± 2.831
1.237SerCys: 1.237 ± 0.456
4.329SerAsp: 4.329 ± 0.255
4.947SerGlu: 4.947 ± 1.87
2.474SerPhe: 2.474 ± 0.912
8.04SerGly: 8.04 ± 3.188
3.711SerHis: 3.711 ± 1.403
3.092SerIle: 3.092 ± 1.547
6.184SerLys: 6.184 ± 1.449
7.421SerLeu: 7.421 ± 1.604
0.0SerMet: 0.0 ± 0.0
4.947SerAsn: 4.947 ± 1.381
5.566SerPro: 5.566 ± 2.369
2.474SerGln: 2.474 ± 1.598
4.329SerArg: 4.329 ± 2.18
10.513SerSer: 10.513 ± 3.74
9.895SerThr: 9.895 ± 2.33
4.947SerVal: 4.947 ± 1.381
1.237SerTrp: 1.237 ± 0.799
1.855SerTyr: 1.855 ± 0.327
0.0SerXaa: 0.0 ± 0.0
Thr
6.803ThrAla: 6.803 ± 2.058
0.618ThrCys: 0.618 ± 0.4
3.092ThrAsp: 3.092 ± 1.342
4.947ThrGlu: 4.947 ± 1.486
4.947ThrPhe: 4.947 ± 0.909
3.711ThrGly: 3.711 ± 1.126
1.237ThrHis: 1.237 ± 0.799
1.855ThrIle: 1.855 ± 0.327
8.04ThrLys: 8.04 ± 2.505
4.947ThrLeu: 4.947 ± 3.478
0.618ThrMet: 0.618 ± 0.519
2.474ThrAsn: 2.474 ± 1.678
4.947ThrPro: 4.947 ± 1.486
2.474ThrGln: 2.474 ± 0.912
4.329ThrArg: 4.329 ± 0.255
9.276ThrSer: 9.276 ± 2.649
11.75ThrThr: 11.75 ± 2.278
4.329ThrVal: 4.329 ± 1.402
1.237ThrTrp: 1.237 ± 0.456
3.711ThrTyr: 3.711 ± 2.091
0.0ThrXaa: 0.0 ± 0.0
Val
8.04ValAla: 8.04 ± 3.18
1.237ValCys: 1.237 ± 0.727
3.092ValAsp: 3.092 ± 0.472
4.947ValGlu: 4.947 ± 1.1
3.711ValPhe: 3.711 ± 0.456
6.184ValGly: 6.184 ± 0.521
1.237ValHis: 1.237 ± 1.293
2.474ValIle: 2.474 ± 0.072
5.566ValLys: 5.566 ± 2.07
3.092ValLeu: 3.092 ± 0.944
0.0ValMet: 0.0 ± 0.0
3.092ValAsn: 3.092 ± 2.318
2.474ValPro: 2.474 ± 1.885
2.474ValGln: 2.474 ± 0.739
3.092ValArg: 3.092 ± 1.054
8.04ValSer: 8.04 ± 0.617
4.947ValThr: 4.947 ± 0.145
4.329ValVal: 4.329 ± 1.015
0.618ValTrp: 0.618 ± 0.4
1.855ValTyr: 1.855 ± 1.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.618TrpAla: 0.618 ± 0.4
0.0TrpCys: 0.0 ± 0.0
0.618TrpAsp: 0.618 ± 0.519
1.237TrpGlu: 1.237 ± 1.039
0.618TrpPhe: 0.618 ± 0.4
1.237TrpGly: 1.237 ± 0.456
0.0TrpHis: 0.0 ± 0.0
1.237TrpIle: 1.237 ± 0.468
1.855TrpLys: 1.855 ± 0.904
1.237TrpLeu: 1.237 ± 1.039
0.0TrpMet: 0.0 ± 0.0
1.855TrpAsn: 1.855 ± 0.327
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.237TrpArg: 1.237 ± 0.456
1.237TrpSer: 1.237 ± 0.799
0.0TrpThr: 0.0 ± 0.0
1.237TrpVal: 1.237 ± 0.799
1.855TrpTrp: 1.855 ± 0.698
1.237TrpTyr: 1.237 ± 0.799
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.711TyrAla: 3.711 ± 2.189
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
2.474TyrGlu: 2.474 ± 1.398
0.0TyrPhe: 0.0 ± 0.0
3.092TyrGly: 3.092 ± 0.944
0.0TyrHis: 0.0 ± 0.0
1.855TyrIle: 1.855 ± 1.94
2.474TyrLys: 2.474 ± 0.863
0.618TyrLeu: 0.618 ± 0.647
0.0TyrMet: 0.0 ± 0.0
1.237TyrAsn: 1.237 ± 0.799
1.855TyrPro: 1.855 ± 0.904
1.855TyrGln: 1.855 ± 0.327
1.237TyrArg: 1.237 ± 0.468
3.092TyrSer: 3.092 ± 1.478
2.474TyrThr: 2.474 ± 0.863
2.474TyrVal: 2.474 ± 2.587
0.618TyrTrp: 0.618 ± 0.4
0.618TyrTyr: 0.618 ± 0.4
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1618 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski