Amino acid dipepetide frequency for Hubei virga-like virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.491AlaAla: 7.491 ± 1.247
2.185AlaCys: 2.185 ± 0.607
3.745AlaAsp: 3.745 ± 1.793
2.497AlaGlu: 2.497 ± 0.869
2.497AlaPhe: 2.497 ± 0.819
5.618AlaGly: 5.618 ± 1.767
1.561AlaHis: 1.561 ± 1.022
2.809AlaIle: 2.809 ± 0.801
1.873AlaLys: 1.873 ± 0.896
8.427AlaLeu: 8.427 ± 2.418
1.561AlaMet: 1.561 ± 0.727
1.561AlaAsn: 1.561 ± 1.546
2.497AlaPro: 2.497 ± 0.869
3.121AlaGln: 3.121 ± 0.998
4.994AlaArg: 4.994 ± 1.546
4.37AlaSer: 4.37 ± 1.009
4.994AlaThr: 4.994 ± 1.937
7.803AlaVal: 7.803 ± 2.381
0.0AlaTrp: 0.0 ± 0.0
3.121AlaTyr: 3.121 ± 1.417
0.0AlaXaa: 0.0 ± 0.0
Cys
1.561CysAla: 1.561 ± 0.727
0.624CysCys: 0.624 ± 0.466
1.248CysAsp: 1.248 ± 0.598
0.624CysGlu: 0.624 ± 0.466
0.936CysPhe: 0.936 ± 0.598
1.561CysGly: 1.561 ± 0.854
0.0CysHis: 0.0 ± 0.0
0.624CysIle: 0.624 ± 1.104
0.312CysLys: 0.312 ± 0.149
1.873CysLeu: 1.873 ± 0.499
0.624CysMet: 0.624 ± 0.299
0.624CysAsn: 0.624 ± 0.299
2.185CysPro: 2.185 ± 0.607
0.624CysGln: 0.624 ± 0.299
2.185CysArg: 2.185 ± 1.317
1.248CysSer: 1.248 ± 0.835
1.248CysThr: 1.248 ± 0.932
1.561CysVal: 1.561 ± 0.747
0.0CysTrp: 0.0 ± 0.0
0.936CysTyr: 0.936 ± 0.398
0.0CysXaa: 0.0 ± 0.0
Asp
4.682AspAla: 4.682 ± 0.86
0.624AspCys: 0.624 ± 0.299
4.37AspAsp: 4.37 ± 0.796
0.936AspGlu: 0.936 ± 0.448
3.121AspPhe: 3.121 ± 0.835
2.809AspGly: 2.809 ± 1.169
1.561AspHis: 1.561 ± 0.417
5.618AspIle: 5.618 ± 0.848
4.682AspLys: 4.682 ± 1.209
6.866AspLeu: 6.866 ± 1.666
0.624AspMet: 0.624 ± 0.299
1.873AspAsn: 1.873 ± 0.896
4.057AspPro: 4.057 ± 1.423
1.248AspGln: 1.248 ± 0.598
4.057AspArg: 4.057 ± 1.138
2.809AspSer: 2.809 ± 1.299
2.809AspThr: 2.809 ± 1.345
5.618AspVal: 5.618 ± 1.153
0.312AspTrp: 0.312 ± 0.149
2.497AspTyr: 2.497 ± 0.539
0.0AspXaa: 0.0 ± 0.0
Glu
2.185GluAla: 2.185 ± 0.607
0.312GluCys: 0.312 ± 0.149
2.497GluAsp: 2.497 ± 1.67
2.185GluGlu: 2.185 ± 0.674
4.682GluPhe: 4.682 ± 1.514
2.497GluGly: 2.497 ± 0.759
1.561GluHis: 1.561 ± 0.732
2.809GluIle: 2.809 ± 0.548
3.745GluLys: 3.745 ± 1.591
5.618GluLeu: 5.618 ± 1.813
1.561GluMet: 1.561 ± 0.713
1.873GluAsn: 1.873 ± 0.602
1.561GluPro: 1.561 ± 0.854
2.185GluGln: 2.185 ± 1.046
4.057GluArg: 4.057 ± 0.886
4.057GluSer: 4.057 ± 0.976
3.121GluThr: 3.121 ± 0.531
4.057GluVal: 4.057 ± 1.095
0.624GluTrp: 0.624 ± 0.466
2.497GluTyr: 2.497 ± 0.73
0.0GluXaa: 0.0 ± 0.0
Phe
2.809PheAla: 2.809 ± 0.548
1.873PheCys: 1.873 ± 0.896
3.433PheAsp: 3.433 ± 1.18
1.248PheGlu: 1.248 ± 0.932
1.248PhePhe: 1.248 ± 0.598
3.121PheGly: 3.121 ± 1.177
0.936PheHis: 0.936 ± 0.398
4.057PheIle: 4.057 ± 2.51
3.121PheLys: 3.121 ± 1.494
4.994PheLeu: 4.994 ± 2.14
0.624PheMet: 0.624 ± 0.466
1.248PheAsn: 1.248 ± 0.598
1.873PhePro: 1.873 ± 0.896
0.936PheGln: 0.936 ± 0.448
4.057PheArg: 4.057 ± 1.401
2.809PheSer: 2.809 ± 0.783
1.561PheThr: 1.561 ± 0.563
4.37PheVal: 4.37 ± 2.368
0.0PheTrp: 0.0 ± 0.0
1.873PheTyr: 1.873 ± 1.623
0.0PheXaa: 0.0 ± 0.0
Gly
4.682GlyAla: 4.682 ± 2.958
0.312GlyCys: 0.312 ± 0.149
4.37GlyAsp: 4.37 ± 1.214
4.682GlyGlu: 4.682 ± 1.514
1.873GlyPhe: 1.873 ± 0.627
5.618GlyGly: 5.618 ± 2.689
0.936GlyHis: 0.936 ± 1.027
1.873GlyIle: 1.873 ± 0.896
4.994GlyLys: 4.994 ± 1.217
3.745GlyLeu: 3.745 ± 1.279
1.561GlyMet: 1.561 ± 0.581
0.936GlyAsn: 0.936 ± 0.448
1.561GlyPro: 1.561 ± 0.732
1.873GlyGln: 1.873 ± 0.499
2.809GlyArg: 2.809 ± 1.345
2.809GlySer: 2.809 ± 0.801
2.497GlyThr: 2.497 ± 0.759
4.682GlyVal: 4.682 ± 1.099
0.312GlyTrp: 0.312 ± 0.567
2.497GlyTyr: 2.497 ± 0.73
0.0GlyXaa: 0.0 ± 0.0
His
2.497HisAla: 2.497 ± 1.195
0.312HisCys: 0.312 ± 0.149
0.624HisAsp: 0.624 ± 0.299
1.561HisGlu: 1.561 ± 0.727
0.936HisPhe: 0.936 ± 0.448
0.624HisGly: 0.624 ± 0.299
0.624HisHis: 0.624 ± 0.466
1.873HisIle: 1.873 ± 0.753
1.873HisLys: 1.873 ± 0.627
1.873HisLeu: 1.873 ± 0.499
1.248HisMet: 1.248 ± 0.379
0.936HisAsn: 0.936 ± 0.448
0.936HisPro: 0.936 ± 1.565
0.624HisGln: 0.624 ± 0.299
1.561HisArg: 1.561 ± 0.417
2.497HisSer: 2.497 ± 0.539
1.561HisThr: 1.561 ± 0.747
2.185HisVal: 2.185 ± 0.607
0.0HisTrp: 0.0 ± 0.0
1.248HisTyr: 1.248 ± 0.598
0.0HisXaa: 0.0 ± 0.0
Ile
5.618IleAla: 5.618 ± 1.548
1.561IleCys: 1.561 ± 1.491
3.433IleAsp: 3.433 ± 0.908
4.682IleGlu: 4.682 ± 0.614
1.873IlePhe: 1.873 ± 1.691
1.561IleGly: 1.561 ± 0.563
1.873IleHis: 1.873 ± 0.896
2.809IleIle: 2.809 ± 0.493
3.121IleLys: 3.121 ± 1.293
6.242IleLeu: 6.242 ± 1.227
1.248IleMet: 1.248 ± 0.598
1.873IleAsn: 1.873 ± 0.753
3.433IlePro: 3.433 ± 0.605
0.936IleGln: 0.936 ± 0.954
1.873IleArg: 1.873 ± 0.896
2.497IleSer: 2.497 ± 0.768
3.745IleThr: 3.745 ± 0.453
5.306IleVal: 5.306 ± 1.46
0.0IleTrp: 0.0 ± 0.0
1.561IleTyr: 1.561 ± 1.022
0.0IleXaa: 0.0 ± 0.0
Lys
4.682LysAla: 4.682 ± 0.294
1.561LysCys: 1.561 ± 1.279
2.809LysAsp: 2.809 ± 1.539
1.248LysGlu: 1.248 ± 0.598
4.057LysPhe: 4.057 ± 0.976
2.185LysGly: 2.185 ± 0.802
0.0LysHis: 0.0 ± 0.0
3.745LysIle: 3.745 ± 1.279
1.561LysLys: 1.561 ± 0.854
4.37LysLeu: 4.37 ± 1.092
1.248LysMet: 1.248 ± 0.561
0.936LysAsn: 0.936 ± 0.448
1.873LysPro: 1.873 ± 0.896
3.121LysGln: 3.121 ± 0.603
2.809LysArg: 2.809 ± 0.964
4.057LysSer: 4.057 ± 1.423
3.433LysThr: 3.433 ± 0.525
3.433LysVal: 3.433 ± 0.885
0.936LysTrp: 0.936 ± 0.448
2.809LysTyr: 2.809 ± 0.861
0.0LysXaa: 0.0 ± 0.0
Leu
6.242LeuAla: 6.242 ± 1.825
2.185LeuCys: 2.185 ± 0.607
5.306LeuAsp: 5.306 ± 1.08
9.363LeuGlu: 9.363 ± 3.345
4.994LeuPhe: 4.994 ± 0.89
4.994LeuGly: 4.994 ± 1.826
2.185LeuHis: 2.185 ± 0.607
3.121LeuIle: 3.121 ± 1.893
4.682LeuLys: 4.682 ± 0.738
10.3LeuLeu: 10.3 ± 3.185
3.121LeuMet: 3.121 ± 0.998
3.433LeuAsn: 3.433 ± 0.605
5.93LeuPro: 5.93 ± 2.182
3.433LeuGln: 3.433 ± 0.525
5.93LeuArg: 5.93 ± 1.303
4.682LeuSer: 4.682 ± 1.06
4.994LeuThr: 4.994 ± 0.155
8.427LeuVal: 8.427 ± 5.4
0.936LeuTrp: 0.936 ± 0.398
3.121LeuTyr: 3.121 ± 1.459
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.624MetCys: 0.624 ± 0.299
1.873MetAsp: 1.873 ± 0.499
0.936MetGlu: 0.936 ± 0.448
1.248MetPhe: 1.248 ± 0.379
0.0MetGly: 0.0 ± 0.0
0.624MetHis: 0.624 ± 0.299
1.248MetIle: 1.248 ± 0.598
1.561MetLys: 1.561 ± 0.417
2.497MetLeu: 2.497 ± 0.73
0.936MetMet: 0.936 ± 0.465
0.936MetAsn: 0.936 ± 0.448
0.624MetPro: 0.624 ± 0.299
1.248MetGln: 1.248 ± 0.598
1.873MetArg: 1.873 ± 0.602
2.809MetSer: 2.809 ± 0.548
2.185MetThr: 2.185 ± 1.046
1.561MetVal: 1.561 ± 1.729
0.0MetTrp: 0.0 ± 0.0
0.312MetTyr: 0.312 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
2.185AsnAla: 2.185 ± 1.046
0.312AsnCys: 0.312 ± 0.149
2.497AsnAsp: 2.497 ± 1.195
2.497AsnGlu: 2.497 ± 0.539
0.312AsnPhe: 0.312 ± 0.149
1.873AsnGly: 1.873 ± 0.753
0.624AsnHis: 0.624 ± 0.299
1.248AsnIle: 1.248 ± 0.835
2.185AsnLys: 2.185 ± 1.046
1.873AsnLeu: 1.873 ± 0.636
0.624AsnMet: 0.624 ± 0.299
1.248AsnAsn: 1.248 ± 0.598
0.624AsnPro: 0.624 ± 0.842
1.248AsnGln: 1.248 ± 0.598
0.624AsnArg: 0.624 ± 0.299
3.745AsnSer: 3.745 ± 1.254
1.561AsnThr: 1.561 ± 0.747
2.809AsnVal: 2.809 ± 0.878
0.936AsnTrp: 0.936 ± 0.398
0.624AsnTyr: 0.624 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
3.433ProAla: 3.433 ± 2.642
0.0ProCys: 0.0 ± 0.0
3.121ProAsp: 3.121 ± 0.835
2.185ProGlu: 2.185 ± 0.763
2.185ProPhe: 2.185 ± 0.607
3.745ProGly: 3.745 ± 1.138
0.936ProHis: 0.936 ± 0.448
1.248ProIle: 1.248 ± 0.598
2.185ProLys: 2.185 ± 1.046
4.057ProLeu: 4.057 ± 0.588
2.185ProMet: 2.185 ± 0.469
0.936ProAsn: 0.936 ± 0.448
3.121ProPro: 3.121 ± 1.125
0.624ProGln: 0.624 ± 0.299
2.497ProArg: 2.497 ± 0.768
2.809ProSer: 2.809 ± 0.493
2.497ProThr: 2.497 ± 0.874
4.682ProVal: 4.682 ± 1.035
0.312ProTrp: 0.312 ± 0.567
1.248ProTyr: 1.248 ± 0.379
0.0ProXaa: 0.0 ± 0.0
Gln
2.185GlnAla: 2.185 ± 0.674
0.312GlnCys: 0.312 ± 0.149
2.809GlnAsp: 2.809 ± 0.861
0.936GlnGlu: 0.936 ± 0.779
1.873GlnPhe: 1.873 ± 0.896
1.873GlnGly: 1.873 ± 0.602
1.561GlnHis: 1.561 ± 0.417
2.185GlnIle: 2.185 ± 1.046
1.873GlnLys: 1.873 ± 0.602
2.809GlnLeu: 2.809 ± 1.781
0.312GlnMet: 0.312 ± 0.149
0.312GlnAsn: 0.312 ± 0.149
1.561GlnPro: 1.561 ± 0.417
1.248GlnGln: 1.248 ± 0.598
3.433GlnArg: 3.433 ± 0.525
1.561GlnSer: 1.561 ± 0.747
1.248GlnThr: 1.248 ± 0.598
2.185GlnVal: 2.185 ± 0.763
0.624GlnTrp: 0.624 ± 0.299
3.121GlnTyr: 3.121 ± 0.595
0.0GlnXaa: 0.0 ± 0.0
Arg
3.745ArgAla: 3.745 ± 1.383
0.312ArgCys: 0.312 ± 0.149
3.745ArgAsp: 3.745 ± 0.773
4.37ArgGlu: 4.37 ± 1.348
2.809ArgPhe: 2.809 ± 1.169
2.809ArgGly: 2.809 ± 2.126
2.497ArgHis: 2.497 ± 0.768
4.682ArgIle: 4.682 ± 0.738
3.121ArgLys: 3.121 ± 1.105
6.554ArgLeu: 6.554 ± 1.296
1.561ArgMet: 1.561 ± 0.747
3.121ArgAsn: 3.121 ± 1.067
3.433ArgPro: 3.433 ± 0.673
1.873ArgGln: 1.873 ± 0.796
5.618ArgArg: 5.618 ± 1.743
5.306ArgSer: 5.306 ± 1.08
4.37ArgThr: 4.37 ± 1.633
4.682ArgVal: 4.682 ± 1.173
1.248ArgTrp: 1.248 ± 0.741
1.561ArgTyr: 1.561 ± 0.854
0.0ArgXaa: 0.0 ± 0.0
Ser
3.121SerAla: 3.121 ± 0.625
3.121SerCys: 3.121 ± 1.707
3.121SerAsp: 3.121 ± 0.595
3.745SerGlu: 3.745 ± 1.279
2.497SerPhe: 2.497 ± 1.332
3.745SerGly: 3.745 ± 1.138
1.873SerHis: 1.873 ± 0.602
3.745SerIle: 3.745 ± 1.196
2.809SerLys: 2.809 ± 0.493
9.051SerLeu: 9.051 ± 3.95
1.561SerMet: 1.561 ± 0.747
1.873SerAsn: 1.873 ± 0.636
1.248SerPro: 1.248 ± 1.334
1.561SerGln: 1.561 ± 0.417
5.618SerArg: 5.618 ± 0.808
4.057SerSer: 4.057 ± 1.423
2.497SerThr: 2.497 ± 0.874
7.803SerVal: 7.803 ± 1.518
0.312SerTrp: 0.312 ± 0.149
3.121SerTyr: 3.121 ± 1.494
0.0SerXaa: 0.0 ± 0.0
Thr
3.745ThrAla: 3.745 ± 1.383
1.561ThrCys: 1.561 ± 0.727
2.497ThrAsp: 2.497 ± 0.539
2.497ThrGlu: 2.497 ± 0.73
2.809ThrPhe: 2.809 ± 0.861
4.057ThrGly: 4.057 ± 1.549
1.873ThrHis: 1.873 ± 0.499
2.809ThrIle: 2.809 ± 0.878
3.745ThrLys: 3.745 ± 1.205
2.809ThrLeu: 2.809 ± 0.493
0.312ThrMet: 0.312 ± 0.149
2.185ThrAsn: 2.185 ± 0.607
1.873ThrPro: 1.873 ± 0.499
2.809ThrGln: 2.809 ± 0.548
3.745ThrArg: 3.745 ± 0.998
3.121ThrSer: 3.121 ± 0.807
4.37ThrThr: 4.37 ± 0.941
6.866ThrVal: 6.866 ± 1.233
0.0ThrTrp: 0.0 ± 0.0
3.433ThrTyr: 3.433 ± 0.908
0.0ThrXaa: 0.0 ± 0.0
Val
8.739ValAla: 8.739 ± 7.125
1.873ValCys: 1.873 ± 0.636
4.057ValAsp: 4.057 ± 1.138
5.306ValGlu: 5.306 ± 1.453
4.37ValPhe: 4.37 ± 4.95
3.433ValGly: 3.433 ± 1.644
2.497ValHis: 2.497 ± 0.73
4.37ValIle: 4.37 ± 0.789
1.561ValLys: 1.561 ± 0.417
9.363ValLeu: 9.363 ± 5.305
0.312ValMet: 0.312 ± 0.149
2.809ValAsn: 2.809 ± 0.704
3.745ValPro: 3.745 ± 1.257
3.745ValGln: 3.745 ± 0.998
6.554ValArg: 6.554 ± 2.79
6.554ValSer: 6.554 ± 0.734
6.554ValThr: 6.554 ± 1.822
8.739ValVal: 8.739 ± 6.305
0.312ValTrp: 0.312 ± 0.567
4.37ValTyr: 4.37 ± 1.043
0.0ValXaa: 0.0 ± 0.0
Trp
0.624TrpAla: 0.624 ± 0.466
0.312TrpCys: 0.312 ± 0.567
0.624TrpAsp: 0.624 ± 0.299
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.312TrpGly: 0.312 ± 0.149
0.312TrpHis: 0.312 ± 0.149
0.312TrpIle: 0.312 ± 0.149
0.312TrpLys: 0.312 ± 0.149
1.248TrpLeu: 1.248 ± 0.932
0.312TrpMet: 0.312 ± 0.149
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.248TrpSer: 1.248 ± 0.846
0.312TrpThr: 0.312 ± 0.567
0.312TrpVal: 0.312 ± 0.149
0.312TrpTrp: 0.312 ± 0.149
1.248TrpTyr: 1.248 ± 0.598
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.497TyrAla: 2.497 ± 0.73
0.624TyrCys: 0.624 ± 0.466
4.994TyrAsp: 4.994 ± 1.136
2.185TyrGlu: 2.185 ± 1.046
1.873TyrPhe: 1.873 ± 0.636
2.497TyrGly: 2.497 ± 1.195
1.561TyrHis: 1.561 ± 0.732
4.057TyrIle: 4.057 ± 1.942
1.561TyrLys: 1.561 ± 1.022
2.809TyrLeu: 2.809 ± 0.493
1.248TyrMet: 1.248 ± 0.598
0.936TyrAsn: 0.936 ± 0.448
1.873TyrPro: 1.873 ± 0.499
1.561TyrGln: 1.561 ± 0.563
3.121TyrArg: 3.121 ± 1.156
3.433TyrSer: 3.433 ± 2.148
1.561TyrThr: 1.561 ± 0.73
2.185TyrVal: 2.185 ± 1.481
0.624TyrTrp: 0.624 ± 0.299
1.873TyrTyr: 1.873 ± 0.602
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3205 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski