Amino acid dipepetide frequency for Rhinolophus bat coronavirus HKU2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.573AlaAla: 5.573 ± 0.709
3.019AlaCys: 3.019 ± 0.75
3.56AlaAsp: 3.56 ± 0.556
1.935AlaGlu: 1.935 ± 0.399
4.799AlaPhe: 4.799 ± 0.597
3.793AlaGly: 3.793 ± 0.362
1.935AlaHis: 1.935 ± 0.29
3.56AlaIle: 3.56 ± 0.738
4.102AlaLys: 4.102 ± 0.425
6.269AlaLeu: 6.269 ± 0.294
2.012AlaMet: 2.012 ± 0.259
3.638AlaAsn: 3.638 ± 0.623
2.632AlaPro: 2.632 ± 1.19
2.012AlaGln: 2.012 ± 0.526
2.632AlaArg: 2.632 ± 0.673
5.495AlaSer: 5.495 ± 0.651
3.947AlaThr: 3.947 ± 0.481
7.276AlaVal: 7.276 ± 0.799
0.929AlaTrp: 0.929 ± 0.213
2.709AlaTyr: 2.709 ± 0.582
0.0AlaXaa: 0.0 ± 0.0
Cys
1.471CysAla: 1.471 ± 0.33
1.161CysCys: 1.161 ± 0.446
2.399CysAsp: 2.399 ± 0.498
0.697CysGlu: 0.697 ± 0.302
2.245CysPhe: 2.245 ± 0.187
3.406CysGly: 3.406 ± 0.537
0.387CysHis: 0.387 ± 0.269
1.393CysIle: 1.393 ± 0.21
2.864CysLys: 2.864 ± 0.618
2.554CysLeu: 2.554 ± 0.312
0.619CysMet: 0.619 ± 0.238
2.09CysAsn: 2.09 ± 0.378
1.238CysPro: 1.238 ± 0.276
0.387CysGln: 0.387 ± 0.158
1.316CysArg: 1.316 ± 0.18
1.393CysSer: 1.393 ± 0.442
1.625CysThr: 1.625 ± 0.505
3.328CysVal: 3.328 ± 0.762
0.697CysTrp: 0.697 ± 0.225
2.709CysTyr: 2.709 ± 0.551
0.0CysXaa: 0.0 ± 0.0
Asp
5.495AspAla: 5.495 ± 0.991
2.012AspCys: 2.012 ± 0.324
3.328AspAsp: 3.328 ± 0.661
2.322AspGlu: 2.322 ± 0.252
4.489AspPhe: 4.489 ± 0.502
5.805AspGly: 5.805 ± 0.342
1.471AspHis: 1.471 ± 0.354
3.251AspIle: 3.251 ± 0.616
3.406AspLys: 3.406 ± 0.781
3.715AspLeu: 3.715 ± 0.595
1.316AspMet: 1.316 ± 0.309
3.173AspAsn: 3.173 ± 0.765
1.238AspPro: 1.238 ± 0.403
1.238AspGln: 1.238 ± 0.313
1.548AspArg: 1.548 ± 0.225
4.18AspSer: 4.18 ± 0.802
3.251AspThr: 3.251 ± 0.697
5.573AspVal: 5.573 ± 1.057
0.774AspTrp: 0.774 ± 0.174
2.709AspTyr: 2.709 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
2.09GluAla: 2.09 ± 0.512
1.703GluCys: 1.703 ± 0.369
1.858GluAsp: 1.858 ± 0.367
1.625GluGlu: 1.625 ± 0.486
2.399GluPhe: 2.399 ± 0.385
2.399GluGly: 2.399 ± 0.545
1.161GluHis: 1.161 ± 0.395
2.786GluIle: 2.786 ± 0.467
2.245GluLys: 2.245 ± 0.471
3.56GluLeu: 3.56 ± 0.502
0.155GluMet: 0.155 ± 0.177
1.316GluAsn: 1.316 ± 0.243
1.625GluPro: 1.625 ± 0.301
1.084GluGln: 1.084 ± 0.561
2.245GluArg: 2.245 ± 0.458
1.78GluSer: 1.78 ± 0.467
1.935GluThr: 1.935 ± 0.764
3.019GluVal: 3.019 ± 0.429
1.084GluTrp: 1.084 ± 0.313
1.858GluTyr: 1.858 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
3.793PheAla: 3.793 ± 0.72
1.935PheCys: 1.935 ± 0.363
5.65PheAsp: 5.65 ± 0.962
3.328PheGlu: 3.328 ± 0.532
3.173PhePhe: 3.173 ± 1.099
5.341PheGly: 5.341 ± 0.672
0.464PheHis: 0.464 ± 0.147
3.56PheIle: 3.56 ± 0.527
3.483PheLys: 3.483 ± 0.519
5.108PheLeu: 5.108 ± 0.867
0.619PheMet: 0.619 ± 0.191
3.483PheAsn: 3.483 ± 0.744
0.619PhePro: 0.619 ± 0.495
1.161PheGln: 1.161 ± 0.723
2.012PheArg: 2.012 ± 0.437
5.186PheSer: 5.186 ± 1.053
2.941PheThr: 2.941 ± 0.364
6.889PheVal: 6.889 ± 0.48
1.006PheTrp: 1.006 ± 0.333
2.399PheTyr: 2.399 ± 0.407
0.0PheXaa: 0.0 ± 0.0
Gly
6.037GlyAla: 6.037 ± 0.839
2.167GlyCys: 2.167 ± 0.296
5.805GlyAsp: 5.805 ± 1.058
1.548GlyGlu: 1.548 ± 0.484
4.721GlyPhe: 4.721 ± 0.829
4.489GlyGly: 4.489 ± 0.546
1.316GlyHis: 1.316 ± 0.258
3.328GlyIle: 3.328 ± 0.936
4.334GlyLys: 4.334 ± 0.578
5.65GlyLeu: 5.65 ± 0.708
1.238GlyMet: 1.238 ± 0.294
2.554GlyAsn: 2.554 ± 0.504
2.012GlyPro: 2.012 ± 0.349
1.316GlyGln: 1.316 ± 0.206
2.322GlyArg: 2.322 ± 0.905
5.65GlySer: 5.65 ± 1.152
3.019GlyThr: 3.019 ± 0.416
8.514GlyVal: 8.514 ± 0.536
0.697GlyTrp: 0.697 ± 0.322
2.399GlyTyr: 2.399 ± 0.265
0.0GlyXaa: 0.0 ± 0.0
His
1.316HisAla: 1.316 ± 0.4
1.084HisCys: 1.084 ± 0.337
1.006HisAsp: 1.006 ± 0.288
0.542HisGlu: 0.542 ± 0.184
1.393HisPhe: 1.393 ± 0.326
1.238HisGly: 1.238 ± 0.338
0.31HisHis: 0.31 ± 0.173
0.464HisIle: 0.464 ± 0.147
1.316HisLys: 1.316 ± 0.17
1.316HisLeu: 1.316 ± 1.058
0.0HisMet: 0.0 ± 0.0
2.322HisAsn: 2.322 ± 0.457
0.31HisPro: 0.31 ± 0.113
0.387HisGln: 0.387 ± 0.114
0.619HisArg: 0.619 ± 0.324
1.006HisSer: 1.006 ± 0.329
0.774HisThr: 0.774 ± 0.24
2.632HisVal: 2.632 ± 0.62
0.077HisTrp: 0.077 ± 0.188
0.542HisTyr: 0.542 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
3.793IleAla: 3.793 ± 1.609
1.238IleCys: 1.238 ± 0.316
2.864IleAsp: 2.864 ± 0.79
2.012IleGlu: 2.012 ± 0.527
2.941IlePhe: 2.941 ± 0.412
3.251IleGly: 3.251 ± 0.733
0.155IleHis: 0.155 ± 0.122
3.019IleIle: 3.019 ± 1.708
2.864IleLys: 2.864 ± 0.763
3.947IleLeu: 3.947 ± 1.109
0.929IleMet: 0.929 ± 0.345
4.025IleAsn: 4.025 ± 0.667
1.935IlePro: 1.935 ± 0.497
1.703IleGln: 1.703 ± 0.309
1.625IleArg: 1.625 ± 0.776
3.173IleSer: 3.173 ± 0.418
3.793IleThr: 3.793 ± 0.838
4.334IleVal: 4.334 ± 0.753
0.232IleTrp: 0.232 ± 0.074
1.78IleTyr: 1.78 ± 0.902
0.0IleXaa: 0.0 ± 0.0
Lys
4.644LysAla: 4.644 ± 1.015
2.012LysCys: 2.012 ± 0.348
3.251LysAsp: 3.251 ± 0.437
1.78LysGlu: 1.78 ± 0.365
4.18LysPhe: 4.18 ± 0.734
2.864LysGly: 2.864 ± 0.686
1.78LysHis: 1.78 ± 0.305
1.858LysIle: 1.858 ± 0.352
2.012LysLys: 2.012 ± 0.34
6.734LysLeu: 6.734 ± 0.684
1.084LysMet: 1.084 ± 0.311
2.554LysAsn: 2.554 ± 0.487
4.257LysPro: 4.257 ± 0.6
2.554LysGln: 2.554 ± 0.682
2.167LysArg: 2.167 ± 0.626
2.012LysSer: 2.012 ± 0.541
3.483LysThr: 3.483 ± 0.763
5.108LysVal: 5.108 ± 0.898
0.464LysTrp: 0.464 ± 0.212
3.251LysTyr: 3.251 ± 0.654
0.0LysXaa: 0.0 ± 0.0
Leu
5.728LeuAla: 5.728 ± 0.831
2.941LeuCys: 2.941 ± 0.348
4.644LeuAsp: 4.644 ± 1.051
3.251LeuGlu: 3.251 ± 0.371
5.263LeuPhe: 5.263 ± 1.362
6.424LeuGly: 6.424 ± 0.782
2.399LeuHis: 2.399 ± 0.418
4.567LeuIle: 4.567 ± 0.654
6.502LeuLys: 6.502 ± 1.414
8.591LeuLeu: 8.591 ± 0.607
1.471LeuMet: 1.471 ± 0.979
4.025LeuAsn: 4.025 ± 0.282
2.786LeuPro: 2.786 ± 0.413
3.56LeuGln: 3.56 ± 0.608
3.56LeuArg: 3.56 ± 0.279
7.663LeuSer: 7.663 ± 0.909
4.025LeuThr: 4.025 ± 1.346
5.805LeuVal: 5.805 ± 2.183
1.161LeuTrp: 1.161 ± 0.93
4.489LeuTyr: 4.489 ± 1.395
0.0LeuXaa: 0.0 ± 0.0
Met
1.006MetAla: 1.006 ± 0.177
1.238MetCys: 1.238 ± 0.337
1.316MetAsp: 1.316 ± 0.206
0.929MetGlu: 0.929 ± 0.317
1.316MetPhe: 1.316 ± 0.893
0.619MetGly: 0.619 ± 0.225
0.697MetHis: 0.697 ± 0.225
0.774MetIle: 0.774 ± 0.256
0.774MetLys: 0.774 ± 0.369
2.477MetLeu: 2.477 ± 0.7
0.31MetMet: 0.31 ± 0.191
1.084MetAsn: 1.084 ± 0.168
1.084MetPro: 1.084 ± 0.32
0.387MetGln: 0.387 ± 0.185
0.619MetArg: 0.619 ± 0.147
1.703MetSer: 1.703 ± 0.458
1.316MetThr: 1.316 ± 0.416
0.929MetVal: 0.929 ± 0.183
0.232MetTrp: 0.232 ± 0.38
1.084MetTyr: 1.084 ± 0.266
0.0MetXaa: 0.0 ± 0.0
Asn
3.483AsnAla: 3.483 ± 1.153
2.09AsnCys: 2.09 ± 0.413
2.399AsnAsp: 2.399 ± 0.244
2.399AsnGlu: 2.399 ± 0.486
3.638AsnPhe: 3.638 ± 0.23
6.502AsnGly: 6.502 ± 0.91
0.697AsnHis: 0.697 ± 0.221
3.251AsnIle: 3.251 ± 0.589
2.322AsnLys: 2.322 ± 0.564
4.18AsnLeu: 4.18 ± 0.405
1.393AsnMet: 1.393 ± 0.541
2.709AsnAsn: 2.709 ± 0.763
0.929AsnPro: 0.929 ± 0.611
1.084AsnGln: 1.084 ± 0.312
1.238AsnArg: 1.238 ± 0.319
3.406AsnSer: 3.406 ± 0.863
2.864AsnThr: 2.864 ± 0.717
6.656AsnVal: 6.656 ± 1.012
0.542AsnTrp: 0.542 ± 1.033
2.167AsnTyr: 2.167 ± 0.488
0.0AsnXaa: 0.0 ± 0.0
Pro
3.173ProAla: 3.173 ± 1.365
0.619ProCys: 0.619 ± 0.242
1.006ProAsp: 1.006 ± 0.31
1.78ProGlu: 1.78 ± 0.366
2.167ProPhe: 2.167 ± 0.254
2.245ProGly: 2.245 ± 0.294
0.542ProHis: 0.542 ± 0.174
1.78ProIle: 1.78 ± 0.305
1.316ProLys: 1.316 ± 0.492
3.406ProLeu: 3.406 ± 0.412
0.619ProMet: 0.619 ± 0.327
1.316ProAsn: 1.316 ± 0.411
1.006ProPro: 1.006 ± 0.332
0.464ProGln: 0.464 ± 0.455
1.316ProArg: 1.316 ± 0.864
3.173ProSer: 3.173 ± 0.421
2.09ProThr: 2.09 ± 0.813
2.477ProVal: 2.477 ± 1.006
0.542ProTrp: 0.542 ± 0.114
1.316ProTyr: 1.316 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
2.245GlnAla: 2.245 ± 0.737
0.619GlnCys: 0.619 ± 0.123
1.393GlnAsp: 1.393 ± 0.226
0.697GlnGlu: 0.697 ± 0.24
1.316GlnPhe: 1.316 ± 0.362
2.012GlnGly: 2.012 ± 0.354
0.697GlnHis: 0.697 ± 0.226
0.929GlnIle: 0.929 ± 0.338
1.78GlnLys: 1.78 ± 0.324
4.025GlnLeu: 4.025 ± 0.753
0.697GlnMet: 0.697 ± 0.209
1.084GlnAsn: 1.084 ± 0.399
1.084GlnPro: 1.084 ± 0.471
0.542GlnGln: 0.542 ± 0.52
1.238GlnArg: 1.238 ± 0.319
2.399GlnSer: 2.399 ± 1.213
1.471GlnThr: 1.471 ± 0.254
1.703GlnVal: 1.703 ± 0.214
0.542GlnTrp: 0.542 ± 0.33
0.619GlnTyr: 0.619 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
2.709ArgAla: 2.709 ± 0.404
1.625ArgCys: 1.625 ± 0.552
0.232ArgAsp: 0.232 ± 0.137
1.238ArgGlu: 1.238 ± 0.148
3.096ArgPhe: 3.096 ± 0.4
2.399ArgGly: 2.399 ± 0.648
0.697ArgHis: 0.697 ± 0.225
1.471ArgIle: 1.471 ± 0.969
2.012ArgLys: 2.012 ± 0.835
3.328ArgLeu: 3.328 ± 0.616
1.006ArgMet: 1.006 ± 0.293
2.399ArgAsn: 2.399 ± 1.019
0.774ArgPro: 0.774 ± 0.305
0.542ArgGln: 0.542 ± 0.184
1.316ArgArg: 1.316 ± 0.264
3.638ArgSer: 3.638 ± 1.575
1.471ArgThr: 1.471 ± 0.651
3.638ArgVal: 3.638 ± 0.639
0.619ArgTrp: 0.619 ± 0.157
0.851ArgTyr: 0.851 ± 0.464
0.0ArgXaa: 0.0 ± 0.0
Ser
4.954SerAla: 4.954 ± 0.388
1.625SerCys: 1.625 ± 0.305
5.418SerAsp: 5.418 ± 0.862
2.167SerGlu: 2.167 ± 0.35
4.954SerPhe: 4.954 ± 0.876
4.412SerGly: 4.412 ± 0.371
1.316SerHis: 1.316 ± 0.336
3.251SerIle: 3.251 ± 0.835
4.412SerLys: 4.412 ± 0.302
5.418SerLeu: 5.418 ± 0.323
1.316SerMet: 1.316 ± 0.247
3.947SerAsn: 3.947 ± 0.729
1.548SerPro: 1.548 ± 0.337
2.709SerGln: 2.709 ± 1.325
1.858SerArg: 1.858 ± 1.742
4.954SerSer: 4.954 ± 0.445
5.263SerThr: 5.263 ± 0.535
6.037SerVal: 6.037 ± 0.75
0.697SerTrp: 0.697 ± 0.485
3.715SerTyr: 3.715 ± 0.439
0.0SerXaa: 0.0 ± 0.0
Thr
3.251ThrAla: 3.251 ± 0.667
1.161ThrCys: 1.161 ± 0.225
3.096ThrAsp: 3.096 ± 0.339
2.167ThrGlu: 2.167 ± 1.186
3.406ThrPhe: 3.406 ± 0.458
3.715ThrGly: 3.715 ± 1.15
1.006ThrHis: 1.006 ± 0.373
3.638ThrIle: 3.638 ± 0.8
2.632ThrLys: 2.632 ± 0.313
5.728ThrLeu: 5.728 ± 0.715
1.548ThrMet: 1.548 ± 0.464
2.399ThrAsn: 2.399 ± 0.726
2.477ThrPro: 2.477 ± 0.853
1.238ThrGln: 1.238 ± 0.22
1.703ThrArg: 1.703 ± 0.593
2.941ThrSer: 2.941 ± 0.554
4.18ThrThr: 4.18 ± 0.721
7.353ThrVal: 7.353 ± 0.875
0.619ThrTrp: 0.619 ± 0.45
2.012ThrTyr: 2.012 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
7.198ValAla: 7.198 ± 0.682
3.019ValCys: 3.019 ± 0.643
6.734ValAsp: 6.734 ± 1.059
5.495ValGlu: 5.495 ± 0.911
4.334ValPhe: 4.334 ± 0.587
5.263ValGly: 5.263 ± 0.965
0.774ValHis: 0.774 ± 0.28
4.025ValIle: 4.025 ± 1.065
7.121ValLys: 7.121 ± 1.583
9.056ValLeu: 9.056 ± 1.123
2.167ValMet: 2.167 ± 0.515
5.882ValAsn: 5.882 ± 0.509
3.251ValPro: 3.251 ± 0.509
3.251ValGln: 3.251 ± 0.377
2.941ValArg: 2.941 ± 0.2
5.728ValSer: 5.728 ± 0.721
5.728ValThr: 5.728 ± 0.451
11.765ValVal: 11.765 ± 2.051
1.006ValTrp: 1.006 ± 0.17
3.56ValTyr: 3.56 ± 0.514
0.0ValXaa: 0.0 ± 0.0
Trp
0.464TrpAla: 0.464 ± 0.781
0.387TrpCys: 0.387 ± 0.114
0.774TrpAsp: 0.774 ± 0.256
0.464TrpGlu: 0.464 ± 0.095
0.542TrpPhe: 0.542 ± 0.47
0.31TrpGly: 0.31 ± 0.211
0.232TrpHis: 0.232 ± 0.177
0.31TrpIle: 0.31 ± 0.095
0.542TrpLys: 0.542 ± 0.174
1.548TrpLeu: 1.548 ± 0.96
0.232TrpMet: 0.232 ± 0.074
1.006TrpAsn: 1.006 ± 0.727
0.387TrpPro: 0.387 ± 0.358
0.464TrpGln: 0.464 ± 0.393
0.851TrpArg: 0.851 ± 0.229
1.78TrpSer: 1.78 ± 0.333
0.464TrpThr: 0.464 ± 0.095
0.697TrpVal: 0.697 ± 0.327
0.619TrpTrp: 0.619 ± 0.486
0.929TrpTyr: 0.929 ± 0.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.406TyrAla: 3.406 ± 0.332
2.322TyrCys: 2.322 ± 0.389
3.56TyrAsp: 3.56 ± 1.0
1.548TyrGlu: 1.548 ± 0.396
1.78TyrPhe: 1.78 ± 0.228
2.632TyrGly: 2.632 ± 0.56
0.619TyrHis: 0.619 ± 0.157
2.399TyrIle: 2.399 ± 0.607
2.012TyrLys: 2.012 ± 0.283
2.245TyrLeu: 2.245 ± 0.338
1.084TyrMet: 1.084 ± 0.33
2.941TyrAsn: 2.941 ± 0.516
1.161TyrPro: 1.161 ± 0.42
1.006TyrGln: 1.006 ± 0.204
1.935TyrArg: 1.935 ± 0.27
2.709TyrSer: 2.709 ± 0.887
2.632TyrThr: 2.632 ± 0.39
4.644TyrVal: 4.644 ± 0.79
0.464TyrTrp: 0.464 ± 0.178
2.554TyrTyr: 2.554 ± 0.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (12921 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski