Amino acid dipepetide frequency for Hipposideros bat coronavirus HKU10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.066AlaAla: 5.066 ± 1.032
2.479AlaCys: 2.479 ± 0.516
2.587AlaAsp: 2.587 ± 0.611
2.802AlaGlu: 2.802 ± 0.79
4.527AlaPhe: 4.527 ± 1.079
3.126AlaGly: 3.126 ± 0.461
0.647AlaHis: 0.647 ± 0.341
4.958AlaIle: 4.958 ± 0.711
3.449AlaLys: 3.449 ± 1.369
5.497AlaLeu: 5.497 ± 0.831
1.401AlaMet: 1.401 ± 0.537
4.419AlaAsn: 4.419 ± 0.573
2.587AlaPro: 2.587 ± 1.017
2.479AlaGln: 2.479 ± 0.903
2.156AlaArg: 2.156 ± 0.41
3.665AlaSer: 3.665 ± 0.545
3.88AlaThr: 3.88 ± 0.757
6.251AlaVal: 6.251 ± 0.964
0.431AlaTrp: 0.431 ± 0.122
2.263AlaTyr: 2.263 ± 0.885
0.0AlaXaa: 0.0 ± 0.0
Cys
1.725CysAla: 1.725 ± 0.396
0.97CysCys: 0.97 ± 0.383
1.725CysAsp: 1.725 ± 0.49
1.293CysGlu: 1.293 ± 0.339
2.156CysPhe: 2.156 ± 0.294
1.832CysGly: 1.832 ± 0.596
0.539CysHis: 0.539 ± 0.299
1.401CysIle: 1.401 ± 0.389
1.832CysLys: 1.832 ± 0.596
2.048CysLeu: 2.048 ± 0.577
0.323CysMet: 0.323 ± 0.128
2.156CysAsn: 2.156 ± 0.622
0.97CysPro: 0.97 ± 0.275
0.97CysGln: 0.97 ± 0.588
1.186CysArg: 1.186 ± 0.299
2.695CysSer: 2.695 ± 0.719
1.94CysThr: 1.94 ± 0.534
3.449CysVal: 3.449 ± 1.172
0.431CysTrp: 0.431 ± 0.227
2.587CysTyr: 2.587 ± 1.176
0.0CysXaa: 0.0 ± 0.0
Asp
3.449AspAla: 3.449 ± 0.801
1.832AspCys: 1.832 ± 0.501
1.509AspAsp: 1.509 ± 0.462
2.802AspGlu: 2.802 ± 0.593
3.233AspPhe: 3.233 ± 0.683
4.096AspGly: 4.096 ± 1.193
0.97AspHis: 0.97 ± 0.511
3.126AspIle: 3.126 ± 0.617
2.479AspLys: 2.479 ± 0.277
4.311AspLeu: 4.311 ± 1.246
1.186AspMet: 1.186 ± 0.223
2.91AspAsn: 2.91 ± 0.807
1.509AspPro: 1.509 ± 0.376
1.617AspGln: 1.617 ± 0.813
1.078AspArg: 1.078 ± 0.378
2.91AspSer: 2.91 ± 0.531
2.371AspThr: 2.371 ± 0.506
6.144AspVal: 6.144 ± 1.022
0.647AspTrp: 0.647 ± 0.341
3.341AspTyr: 3.341 ± 0.561
0.0AspXaa: 0.0 ± 0.0
Glu
2.371GluAla: 2.371 ± 0.452
1.078GluCys: 1.078 ± 0.37
2.263GluAsp: 2.263 ± 0.676
2.048GluGlu: 2.048 ± 0.727
3.233GluPhe: 3.233 ± 1.499
2.587GluGly: 2.587 ± 0.939
1.186GluHis: 1.186 ± 0.47
2.048GluIle: 2.048 ± 0.706
2.695GluLys: 2.695 ± 0.907
3.772GluLeu: 3.772 ± 0.813
0.647GluMet: 0.647 ± 0.286
2.587GluAsn: 2.587 ± 0.38
1.832GluPro: 1.832 ± 0.499
1.617GluGln: 1.617 ± 0.3
1.832GluArg: 1.832 ± 0.408
2.479GluSer: 2.479 ± 0.884
1.725GluThr: 1.725 ± 0.628
3.557GluVal: 3.557 ± 0.863
0.754GluTrp: 0.754 ± 0.2
1.401GluTyr: 1.401 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
3.665PheAla: 3.665 ± 1.002
2.156PheCys: 2.156 ± 0.934
4.635PheAsp: 4.635 ± 1.227
2.587PheGlu: 2.587 ± 0.634
2.048PhePhe: 2.048 ± 0.687
3.988PheGly: 3.988 ± 1.555
0.647PheHis: 0.647 ± 0.37
2.802PheIle: 2.802 ± 0.799
3.341PheLys: 3.341 ± 0.657
3.988PheLeu: 3.988 ± 0.659
0.97PheMet: 0.97 ± 0.417
3.233PheAsn: 3.233 ± 0.63
0.97PhePro: 0.97 ± 0.703
0.97PheGln: 0.97 ± 0.724
0.97PheArg: 0.97 ± 0.491
4.527PheSer: 4.527 ± 1.672
2.695PheThr: 2.695 ± 0.627
7.868PheVal: 7.868 ± 1.467
1.401PheTrp: 1.401 ± 0.486
2.371PheTyr: 2.371 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
3.449GlyAla: 3.449 ± 0.852
2.156GlyCys: 2.156 ± 0.622
4.419GlyAsp: 4.419 ± 0.662
2.587GlyGlu: 2.587 ± 0.569
3.988GlyPhe: 3.988 ± 0.731
4.203GlyGly: 4.203 ± 1.047
0.647GlyHis: 0.647 ± 0.266
2.587GlyIle: 2.587 ± 0.492
4.527GlyLys: 4.527 ± 1.219
5.174GlyLeu: 5.174 ± 0.714
1.617GlyMet: 1.617 ± 0.601
2.91GlyAsn: 2.91 ± 0.375
1.293GlyPro: 1.293 ± 0.556
1.725GlyGln: 1.725 ± 0.401
1.94GlyArg: 1.94 ± 1.182
4.203GlySer: 4.203 ± 1.086
3.88GlyThr: 3.88 ± 0.521
6.682GlyVal: 6.682 ± 0.517
0.754GlyTrp: 0.754 ± 0.422
2.263GlyTyr: 2.263 ± 0.256
0.0GlyXaa: 0.0 ± 0.0
His
1.94HisAla: 1.94 ± 0.634
0.862HisCys: 0.862 ± 0.272
0.862HisAsp: 0.862 ± 0.455
0.647HisGlu: 0.647 ± 0.341
1.078HisPhe: 1.078 ± 0.737
0.647HisGly: 0.647 ± 0.178
0.323HisHis: 0.323 ± 0.47
0.647HisIle: 0.647 ± 0.256
1.401HisLys: 1.401 ± 0.522
1.509HisLeu: 1.509 ± 0.841
0.0HisMet: 0.0 ± 0.0
1.078HisAsn: 1.078 ± 0.449
0.431HisPro: 0.431 ± 0.227
0.539HisGln: 0.539 ± 0.284
0.539HisArg: 0.539 ± 0.284
1.509HisSer: 1.509 ± 0.588
1.617HisThr: 1.617 ± 0.431
2.048HisVal: 2.048 ± 0.566
0.216HisTrp: 0.216 ± 0.114
0.862HisTyr: 0.862 ± 0.386
0.0HisXaa: 0.0 ± 0.0
Ile
4.527IleAla: 4.527 ± 0.578
1.293IleCys: 1.293 ± 0.511
2.371IleAsp: 2.371 ± 0.565
2.479IleGlu: 2.479 ± 1.183
2.156IlePhe: 2.156 ± 0.848
2.263IleGly: 2.263 ± 0.475
0.539IleHis: 0.539 ± 0.284
1.401IleIle: 1.401 ± 0.61
3.772IleLys: 3.772 ± 0.586
5.174IleLeu: 5.174 ± 1.402
1.725IleMet: 1.725 ± 0.745
2.91IleAsn: 2.91 ± 0.533
2.802IlePro: 2.802 ± 0.969
1.832IleGln: 1.832 ± 0.37
1.186IleArg: 1.186 ± 0.625
3.557IleSer: 3.557 ± 0.994
4.527IleThr: 4.527 ± 1.615
5.281IleVal: 5.281 ± 1.326
0.431IleTrp: 0.431 ± 0.227
1.94IleTyr: 1.94 ± 1.013
0.0IleXaa: 0.0 ± 0.0
Lys
2.91LysAla: 2.91 ± 0.949
2.048LysCys: 2.048 ± 0.577
3.557LysAsp: 3.557 ± 1.062
2.587LysGlu: 2.587 ± 0.82
3.449LysPhe: 3.449 ± 1.226
2.587LysGly: 2.587 ± 1.095
2.695LysHis: 2.695 ± 0.99
2.587LysIle: 2.587 ± 0.629
1.401LysLys: 1.401 ± 0.465
5.066LysLeu: 5.066 ± 0.93
1.078LysMet: 1.078 ± 0.299
2.91LysAsn: 2.91 ± 1.382
3.665LysPro: 3.665 ± 0.78
3.126LysGln: 3.126 ± 0.817
1.94LysArg: 1.94 ± 0.652
3.988LysSer: 3.988 ± 1.077
4.203LysThr: 4.203 ± 1.247
5.281LysVal: 5.281 ± 1.274
0.862LysTrp: 0.862 ± 0.703
1.94LysTyr: 1.94 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
6.036LeuAla: 6.036 ± 1.131
2.91LeuCys: 2.91 ± 0.67
3.665LeuAsp: 3.665 ± 0.807
3.341LeuGlu: 3.341 ± 0.884
5.174LeuPhe: 5.174 ± 0.709
4.85LeuGly: 4.85 ± 1.052
2.695LeuHis: 2.695 ± 0.473
3.018LeuIle: 3.018 ± 1.231
5.712LeuLys: 5.712 ± 1.75
9.054LeuLeu: 9.054 ± 2.738
1.725LeuMet: 1.725 ± 1.113
5.82LeuAsn: 5.82 ± 1.67
3.988LeuPro: 3.988 ± 1.545
4.419LeuGln: 4.419 ± 0.78
2.263LeuArg: 2.263 ± 0.567
7.114LeuSer: 7.114 ± 0.837
4.742LeuThr: 4.742 ± 0.972
5.712LeuVal: 5.712 ± 2.154
1.401LeuTrp: 1.401 ± 0.967
4.527LeuTyr: 4.527 ± 1.134
0.0LeuXaa: 0.0 ± 0.0
Met
1.293MetAla: 1.293 ± 0.406
1.401MetCys: 1.401 ± 0.586
0.754MetAsp: 0.754 ± 0.398
0.431MetGlu: 0.431 ± 0.227
1.401MetPhe: 1.401 ± 0.822
1.509MetGly: 1.509 ± 0.288
0.431MetHis: 0.431 ± 0.122
1.293MetIle: 1.293 ± 0.571
0.754MetLys: 0.754 ± 0.336
2.156MetLeu: 2.156 ± 0.755
0.539MetMet: 0.539 ± 0.299
0.323MetAsn: 0.323 ± 0.17
0.754MetPro: 0.754 ± 0.2
0.754MetGln: 0.754 ± 0.345
1.078MetArg: 1.078 ± 0.347
1.832MetSer: 1.832 ± 0.9
1.401MetThr: 1.401 ± 0.848
1.617MetVal: 1.617 ± 0.48
0.108MetTrp: 0.108 ± 0.057
1.617MetTyr: 1.617 ± 0.727
0.0MetXaa: 0.0 ± 0.0
Asn
3.341AsnAla: 3.341 ± 1.639
2.263AsnCys: 2.263 ± 0.795
2.371AsnAsp: 2.371 ± 0.405
2.479AsnGlu: 2.479 ± 0.328
3.018AsnPhe: 3.018 ± 1.161
6.036AsnGly: 6.036 ± 1.377
0.539AsnHis: 0.539 ± 0.284
3.018AsnIle: 3.018 ± 0.975
3.665AsnLys: 3.665 ± 0.675
4.419AsnLeu: 4.419 ± 1.569
1.293AsnMet: 1.293 ± 0.358
4.203AsnAsn: 4.203 ± 1.843
1.293AsnPro: 1.293 ± 0.28
1.94AsnGln: 1.94 ± 1.165
1.509AsnArg: 1.509 ± 0.426
4.742AsnSer: 4.742 ± 1.473
3.988AsnThr: 3.988 ± 1.196
6.144AsnVal: 6.144 ± 1.107
0.862AsnTrp: 0.862 ± 1.054
1.617AsnTyr: 1.617 ± 0.425
0.0AsnXaa: 0.0 ± 0.0
Pro
2.371ProAla: 2.371 ± 0.47
0.862ProCys: 0.862 ± 0.262
1.617ProAsp: 1.617 ± 0.421
2.048ProGlu: 2.048 ± 0.54
1.617ProPhe: 1.617 ± 0.641
2.048ProGly: 2.048 ± 0.566
0.647ProHis: 0.647 ± 0.209
1.94ProIle: 1.94 ± 0.348
1.725ProLys: 1.725 ± 0.419
4.203ProLeu: 4.203 ± 1.397
0.539ProMet: 0.539 ± 0.284
1.186ProAsn: 1.186 ± 0.614
1.293ProPro: 1.293 ± 0.434
1.725ProGln: 1.725 ± 0.912
1.94ProArg: 1.94 ± 1.395
3.341ProSer: 3.341 ± 1.034
2.371ProThr: 2.371 ± 1.691
3.557ProVal: 3.557 ± 0.905
0.431ProTrp: 0.431 ± 0.122
1.293ProTyr: 1.293 ± 0.59
0.0ProXaa: 0.0 ± 0.0
Gln
3.126GlnAla: 3.126 ± 0.738
0.431GlnCys: 0.431 ± 0.227
1.293GlnAsp: 1.293 ± 0.394
1.186GlnGlu: 1.186 ± 0.551
1.186GlnPhe: 1.186 ± 0.299
1.94GlnGly: 1.94 ± 0.358
0.539GlnHis: 0.539 ± 0.508
1.725GlnIle: 1.725 ± 0.707
1.725GlnLys: 1.725 ± 0.815
4.419GlnLeu: 4.419 ± 0.555
0.754GlnMet: 0.754 ± 0.231
1.725GlnAsn: 1.725 ± 0.95
2.587GlnPro: 2.587 ± 0.987
1.617GlnGln: 1.617 ± 0.668
2.156GlnArg: 2.156 ± 0.661
2.479GlnSer: 2.479 ± 1.017
2.479GlnThr: 2.479 ± 0.418
2.587GlnVal: 2.587 ± 1.062
0.216GlnTrp: 0.216 ± 0.155
1.509GlnTyr: 1.509 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
2.695ArgAla: 2.695 ± 0.336
1.401ArgCys: 1.401 ± 0.399
1.293ArgAsp: 1.293 ± 0.525
0.97ArgGlu: 0.97 ± 0.402
2.263ArgPhe: 2.263 ± 0.598
2.263ArgGly: 2.263 ± 0.34
0.539ArgHis: 0.539 ± 0.282
2.695ArgIle: 2.695 ± 1.016
1.725ArgLys: 1.725 ± 0.804
3.018ArgLeu: 3.018 ± 1.099
1.293ArgMet: 1.293 ± 0.541
1.94ArgAsn: 1.94 ± 0.466
1.293ArgPro: 1.293 ± 1.382
1.078ArgGln: 1.078 ± 0.816
1.617ArgArg: 1.617 ± 0.541
1.725ArgSer: 1.725 ± 1.611
1.725ArgThr: 1.725 ± 0.457
2.695ArgVal: 2.695 ± 0.529
0.431ArgTrp: 0.431 ± 0.365
1.832ArgTyr: 1.832 ± 0.246
0.0ArgXaa: 0.0 ± 0.0
Ser
4.958SerAla: 4.958 ± 0.804
1.401SerCys: 1.401 ± 0.535
4.419SerAsp: 4.419 ± 0.805
2.263SerGlu: 2.263 ± 0.389
5.066SerPhe: 5.066 ± 0.573
4.635SerGly: 4.635 ± 0.947
1.078SerHis: 1.078 ± 0.315
4.311SerIle: 4.311 ± 1.151
3.449SerLys: 3.449 ± 1.062
4.958SerLeu: 4.958 ± 1.227
1.617SerMet: 1.617 ± 0.697
4.742SerAsn: 4.742 ± 2.404
2.048SerPro: 2.048 ± 0.683
2.263SerGln: 2.263 ± 0.823
3.449SerArg: 3.449 ± 2.504
4.635SerSer: 4.635 ± 1.784
4.85SerThr: 4.85 ± 0.527
6.575SerVal: 6.575 ± 0.842
1.078SerTrp: 1.078 ± 0.543
3.88SerTyr: 3.88 ± 0.663
0.0SerXaa: 0.0 ± 0.0
Thr
3.341ThrAla: 3.341 ± 0.77
1.617ThrCys: 1.617 ± 0.358
2.587ThrAsp: 2.587 ± 0.284
2.048ThrGlu: 2.048 ± 1.124
3.126ThrPhe: 3.126 ± 0.841
4.203ThrGly: 4.203 ± 1.12
1.293ThrHis: 1.293 ± 0.356
3.772ThrIle: 3.772 ± 0.989
3.449ThrLys: 3.449 ± 0.828
7.006ThrLeu: 7.006 ± 1.116
2.156ThrMet: 2.156 ± 0.624
3.772ThrAsn: 3.772 ± 0.943
2.371ThrPro: 2.371 ± 0.859
1.725ThrGln: 1.725 ± 0.58
2.156ThrArg: 2.156 ± 0.506
5.281ThrSer: 5.281 ± 0.732
4.419ThrThr: 4.419 ± 0.73
5.497ThrVal: 5.497 ± 0.832
0.323ThrTrp: 0.323 ± 0.17
2.263ThrTyr: 2.263 ± 0.948
0.0ThrXaa: 0.0 ± 0.0
Val
4.958ValAla: 4.958 ± 0.658
3.557ValCys: 3.557 ± 0.978
5.497ValAsp: 5.497 ± 0.793
4.742ValGlu: 4.742 ± 1.045
4.311ValPhe: 4.311 ± 1.03
4.742ValGly: 4.742 ± 1.224
1.725ValHis: 1.725 ± 0.527
5.82ValIle: 5.82 ± 0.42
7.114ValLys: 7.114 ± 1.766
7.976ValLeu: 7.976 ± 2.412
1.832ValMet: 1.832 ± 0.805
5.389ValAsn: 5.389 ± 0.63
3.126ValPro: 3.126 ± 1.084
4.203ValGln: 4.203 ± 0.839
3.341ValArg: 3.341 ± 0.752
6.79ValSer: 6.79 ± 1.187
6.144ValThr: 6.144 ± 0.857
9.916ValVal: 9.916 ± 1.687
0.862ValTrp: 0.862 ± 0.272
4.096ValTyr: 4.096 ± 0.66
0.0ValXaa: 0.0 ± 0.0
Trp
0.754TrpAla: 0.754 ± 0.695
0.216TrpCys: 0.216 ± 0.114
0.97TrpAsp: 0.97 ± 0.511
0.323TrpGlu: 0.323 ± 0.17
0.647TrpPhe: 0.647 ± 0.178
0.216TrpGly: 0.216 ± 0.278
0.216TrpHis: 0.216 ± 0.256
0.539TrpIle: 0.539 ± 0.302
0.539TrpLys: 0.539 ± 0.497
1.509TrpLeu: 1.509 ± 0.653
0.108TrpMet: 0.108 ± 0.057
1.078TrpAsn: 1.078 ± 0.78
0.539TrpPro: 0.539 ± 0.446
0.108TrpGln: 0.108 ± 0.057
0.647TrpArg: 0.647 ± 0.209
1.186TrpSer: 1.186 ± 0.547
0.647TrpThr: 0.647 ± 0.266
1.078TrpVal: 1.078 ± 0.388
0.323TrpTrp: 0.323 ± 0.17
0.862TrpTyr: 0.862 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.695TyrAla: 2.695 ± 0.418
1.293TyrCys: 1.293 ± 0.356
3.126TyrAsp: 3.126 ± 1.262
2.048TyrGlu: 2.048 ± 0.6
2.156TyrPhe: 2.156 ± 0.597
3.233TyrGly: 3.233 ± 0.599
0.862TyrHis: 0.862 ± 0.272
2.479TyrIle: 2.479 ± 1.298
3.126TyrLys: 3.126 ± 0.572
2.91TyrLeu: 2.91 ± 0.372
0.539TyrMet: 0.539 ± 0.142
3.557TyrAsn: 3.557 ± 0.95
1.401TyrPro: 1.401 ± 0.391
0.97TyrGln: 0.97 ± 0.516
1.509TyrArg: 1.509 ± 0.617
3.018TyrSer: 3.018 ± 0.546
2.802TyrThr: 2.802 ± 0.887
4.203TyrVal: 4.203 ± 1.12
0.539TyrTrp: 0.539 ± 0.433
3.772TyrTyr: 3.772 ± 0.59
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (9279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski