Amino acid dipepetide frequency for Miniopterus bat coronavirus HKU8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.54AlaAla: 5.54 ± 0.422
3.207AlaCys: 3.207 ± 0.608
2.77AlaAsp: 2.77 ± 0.427
2.478AlaGlu: 2.478 ± 0.286
5.176AlaPhe: 5.176 ± 0.463
3.791AlaGly: 3.791 ± 0.597
1.385AlaHis: 1.385 ± 0.154
4.374AlaIle: 4.374 ± 0.446
4.228AlaLys: 4.228 ± 0.803
7.581AlaLeu: 7.581 ± 0.868
1.385AlaMet: 1.385 ± 0.261
3.28AlaAsn: 3.28 ± 0.224
2.26AlaPro: 2.26 ± 0.937
1.239AlaGln: 1.239 ± 0.391
3.062AlaArg: 3.062 ± 0.529
5.832AlaSer: 5.832 ± 0.704
4.374AlaThr: 4.374 ± 0.36
8.237AlaVal: 8.237 ± 0.768
1.166AlaTrp: 1.166 ± 0.375
3.645AlaTyr: 3.645 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
1.968CysAla: 1.968 ± 0.245
1.021CysCys: 1.021 ± 0.221
2.406CysAsp: 2.406 ± 0.443
0.583CysGlu: 0.583 ± 0.189
2.697CysPhe: 2.697 ± 0.386
3.135CysGly: 3.135 ± 0.672
0.51CysHis: 0.51 ± 0.19
1.021CysIle: 1.021 ± 0.219
2.114CysLys: 2.114 ± 0.577
1.822CysLeu: 1.822 ± 0.282
0.219CysMet: 0.219 ± 0.096
1.968CysAsn: 1.968 ± 0.434
1.166CysPro: 1.166 ± 0.371
1.166CysGln: 1.166 ± 0.22
1.385CysArg: 1.385 ± 0.196
2.26CysSer: 2.26 ± 0.407
2.406CysThr: 2.406 ± 0.365
3.572CysVal: 3.572 ± 0.558
0.656CysTrp: 0.656 ± 0.218
2.114CysTyr: 2.114 ± 0.588
0.0CysXaa: 0.0 ± 0.0
Asp
4.811AspAla: 4.811 ± 0.528
1.385AspCys: 1.385 ± 0.319
2.041AspAsp: 2.041 ± 0.27
2.26AspGlu: 2.26 ± 0.399
3.718AspPhe: 3.718 ± 0.431
6.05AspGly: 6.05 ± 1.183
1.312AspHis: 1.312 ± 0.249
2.187AspIle: 2.187 ± 0.427
2.114AspLys: 2.114 ± 0.306
4.811AspLeu: 4.811 ± 0.641
1.093AspMet: 1.093 ± 0.18
2.697AspAsn: 2.697 ± 0.305
1.968AspPro: 1.968 ± 0.203
1.312AspGln: 1.312 ± 0.57
1.312AspArg: 1.312 ± 0.204
3.28AspSer: 3.28 ± 0.451
1.968AspThr: 1.968 ± 0.347
5.321AspVal: 5.321 ± 0.728
0.802AspTrp: 0.802 ± 0.266
2.551AspTyr: 2.551 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
3.426GluAla: 3.426 ± 0.649
1.75GluCys: 1.75 ± 0.459
1.458GluAsp: 1.458 ± 0.193
1.822GluGlu: 1.822 ± 0.381
3.572GluPhe: 3.572 ± 0.957
3.426GluGly: 3.426 ± 0.285
1.093GluHis: 1.093 ± 0.691
1.895GluIle: 1.895 ± 0.236
1.895GluLys: 1.895 ± 0.377
2.989GluLeu: 2.989 ± 0.771
0.51GluMet: 0.51 ± 0.168
1.895GluAsn: 1.895 ± 0.16
2.624GluPro: 2.624 ± 0.216
1.458GluGln: 1.458 ± 0.654
1.604GluArg: 1.604 ± 0.207
1.239GluSer: 1.239 ± 0.381
2.551GluThr: 2.551 ± 0.627
3.572GluVal: 3.572 ± 0.465
0.51GluTrp: 0.51 ± 0.144
1.531GluTyr: 1.531 ± 0.188
0.0GluXaa: 0.0 ± 0.0
Phe
3.499PheAla: 3.499 ± 0.443
1.75PheCys: 1.75 ± 0.459
3.936PheAsp: 3.936 ± 0.531
2.916PheGlu: 2.916 ± 0.575
2.77PhePhe: 2.77 ± 0.533
5.03PheGly: 5.03 ± 0.569
0.656PheHis: 0.656 ± 0.249
2.916PheIle: 2.916 ± 0.499
4.301PheLys: 4.301 ± 0.904
4.228PheLeu: 4.228 ± 0.794
1.166PheMet: 1.166 ± 0.386
4.082PheAsn: 4.082 ± 0.586
0.656PhePro: 0.656 ± 0.127
1.093PheGln: 1.093 ± 0.5
1.166PheArg: 1.166 ± 0.579
3.353PheSer: 3.353 ± 0.853
3.28PheThr: 3.28 ± 0.419
7.873PheVal: 7.873 ± 0.831
1.166PheTrp: 1.166 ± 0.292
2.989PheTyr: 2.989 ± 0.39
0.0PheXaa: 0.0 ± 0.0
Gly
5.249GlyAla: 5.249 ± 1.169
2.114GlyCys: 2.114 ± 0.372
5.249GlyAsp: 5.249 ± 1.112
2.406GlyGlu: 2.406 ± 0.308
4.374GlyPhe: 4.374 ± 0.594
5.613GlyGly: 5.613 ± 0.545
0.437GlyHis: 0.437 ± 0.08
2.989GlyIle: 2.989 ± 0.91
3.426GlyLys: 3.426 ± 0.656
4.811GlyLeu: 4.811 ± 0.302
0.948GlyMet: 0.948 ± 0.27
3.28GlyAsn: 3.28 ± 0.4
1.822GlyPro: 1.822 ± 0.204
0.875GlyGln: 0.875 ± 0.175
2.187GlyArg: 2.187 ± 0.394
5.832GlySer: 5.832 ± 0.882
4.155GlyThr: 4.155 ± 0.579
9.695GlyVal: 9.695 ± 1.257
0.583GlyTrp: 0.583 ± 0.264
3.353GlyTyr: 3.353 ± 0.294
0.0GlyXaa: 0.0 ± 0.0
His
1.968HisAla: 1.968 ± 0.408
0.583HisCys: 0.583 ± 0.203
0.729HisAsp: 0.729 ± 0.162
1.166HisGlu: 1.166 ± 0.369
0.729HisPhe: 0.729 ± 0.374
0.729HisGly: 0.729 ± 0.207
0.146HisHis: 0.146 ± 0.185
0.802HisIle: 0.802 ± 0.154
0.729HisLys: 0.729 ± 0.268
1.385HisLeu: 1.385 ± 0.548
0.219HisMet: 0.219 ± 0.096
1.385HisAsn: 1.385 ± 0.355
0.364HisPro: 0.364 ± 0.18
0.583HisGln: 0.583 ± 0.209
0.364HisArg: 0.364 ± 0.177
1.239HisSer: 1.239 ± 0.181
1.531HisThr: 1.531 ± 0.389
2.041HisVal: 2.041 ± 0.549
0.219HisTrp: 0.219 ± 0.075
1.021HisTyr: 1.021 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
2.77IleAla: 2.77 ± 0.238
1.458IleCys: 1.458 ± 0.731
3.207IleAsp: 3.207 ± 0.651
2.041IleGlu: 2.041 ± 0.51
2.989IlePhe: 2.989 ± 0.519
2.77IleGly: 2.77 ± 0.551
0.292IleHis: 0.292 ± 0.209
2.114IleIle: 2.114 ± 0.982
2.989IleLys: 2.989 ± 0.436
3.499IleLeu: 3.499 ± 1.191
1.166IleMet: 1.166 ± 0.211
2.041IleAsn: 2.041 ± 0.282
2.478IlePro: 2.478 ± 0.875
1.458IleGln: 1.458 ± 0.373
1.822IleArg: 1.822 ± 0.245
2.551IleSer: 2.551 ± 0.63
3.135IleThr: 3.135 ± 0.437
5.321IleVal: 5.321 ± 0.422
0.364IleTrp: 0.364 ± 0.173
1.312IleTyr: 1.312 ± 0.431
0.0IleXaa: 0.0 ± 0.0
Lys
4.301LysAla: 4.301 ± 0.701
2.114LysCys: 2.114 ± 0.577
2.989LysAsp: 2.989 ± 0.589
1.968LysGlu: 1.968 ± 0.781
2.916LysPhe: 2.916 ± 0.467
2.916LysGly: 2.916 ± 0.593
2.114LysHis: 2.114 ± 0.69
2.333LysIle: 2.333 ± 0.397
1.312LysLys: 1.312 ± 0.782
4.52LysLeu: 4.52 ± 0.914
1.166LysMet: 1.166 ± 0.274
2.187LysAsn: 2.187 ± 0.44
3.135LysPro: 3.135 ± 0.697
1.822LysGln: 1.822 ± 0.27
2.916LysArg: 2.916 ± 0.395
3.645LysSer: 3.645 ± 0.986
2.843LysThr: 2.843 ± 0.4
4.301LysVal: 4.301 ± 0.724
0.51LysTrp: 0.51 ± 0.306
2.551LysTyr: 2.551 ± 0.293
0.0LysXaa: 0.0 ± 0.0
Leu
6.561LeuAla: 6.561 ± 0.524
2.77LeuCys: 2.77 ± 0.523
3.572LeuAsp: 3.572 ± 0.31
3.572LeuGlu: 3.572 ± 0.487
4.593LeuPhe: 4.593 ± 0.585
4.957LeuGly: 4.957 ± 0.573
1.895LeuHis: 1.895 ± 0.38
3.28LeuIle: 3.28 ± 1.117
4.593LeuLys: 4.593 ± 1.128
7.727LeuLeu: 7.727 ± 1.773
1.531LeuMet: 1.531 ± 0.702
4.738LeuAsn: 4.738 ± 0.649
3.645LeuPro: 3.645 ± 1.853
3.572LeuGln: 3.572 ± 0.755
2.624LeuArg: 2.624 ± 0.221
7.29LeuSer: 7.29 ± 0.97
4.957LeuThr: 4.957 ± 0.561
6.269LeuVal: 6.269 ± 0.811
0.948LeuTrp: 0.948 ± 0.893
4.082LeuTyr: 4.082 ± 0.781
0.0LeuXaa: 0.0 ± 0.0
Met
1.093MetAla: 1.093 ± 0.213
0.948MetCys: 0.948 ± 0.348
1.166MetAsp: 1.166 ± 0.384
0.729MetGlu: 0.729 ± 0.238
1.604MetPhe: 1.604 ± 0.349
1.312MetGly: 1.312 ± 0.181
0.219MetHis: 0.219 ± 0.075
0.656MetIle: 0.656 ± 0.326
0.292MetLys: 0.292 ± 0.316
2.77MetLeu: 2.77 ± 0.458
0.437MetMet: 0.437 ± 0.149
0.729MetAsn: 0.729 ± 0.181
1.021MetPro: 1.021 ± 0.262
0.802MetGln: 0.802 ± 0.269
1.166MetArg: 1.166 ± 0.254
1.677MetSer: 1.677 ± 0.523
0.802MetThr: 0.802 ± 0.225
1.458MetVal: 1.458 ± 0.251
0.146MetTrp: 0.146 ± 0.052
1.093MetTyr: 1.093 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
3.572AsnAla: 3.572 ± 0.651
2.333AsnCys: 2.333 ± 0.55
1.458AsnAsp: 1.458 ± 0.335
1.822AsnGlu: 1.822 ± 0.367
2.697AsnPhe: 2.697 ± 0.316
6.123AsnGly: 6.123 ± 1.328
0.875AsnHis: 0.875 ± 0.166
3.645AsnIle: 3.645 ± 0.395
2.406AsnLys: 2.406 ± 0.498
3.28AsnLeu: 3.28 ± 0.226
1.166AsnMet: 1.166 ± 0.174
2.843AsnAsn: 2.843 ± 0.434
1.968AsnPro: 1.968 ± 0.525
0.875AsnGln: 0.875 ± 0.279
1.166AsnArg: 1.166 ± 0.291
4.593AsnSer: 4.593 ± 1.339
2.916AsnThr: 2.916 ± 0.324
6.342AsnVal: 6.342 ± 0.563
0.802AsnTrp: 0.802 ± 0.441
1.968AsnTyr: 1.968 ± 0.252
0.0AsnXaa: 0.0 ± 0.0
Pro
2.916ProAla: 2.916 ± 0.471
1.239ProCys: 1.239 ± 0.225
1.968ProAsp: 1.968 ± 0.476
2.26ProGlu: 2.26 ± 0.122
1.531ProPhe: 1.531 ± 0.408
2.187ProGly: 2.187 ± 0.449
1.093ProHis: 1.093 ± 0.257
1.677ProIle: 1.677 ± 0.289
1.822ProLys: 1.822 ± 0.941
3.353ProLeu: 3.353 ± 0.35
0.729ProMet: 0.729 ± 0.217
1.895ProAsn: 1.895 ± 0.715
2.624ProPro: 2.624 ± 0.351
1.385ProGln: 1.385 ± 0.777
1.458ProArg: 1.458 ± 0.701
3.426ProSer: 3.426 ± 0.378
2.697ProThr: 2.697 ± 0.642
4.374ProVal: 4.374 ± 0.943
0.437ProTrp: 0.437 ± 0.08
0.948ProTyr: 0.948 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
2.114GlnAla: 2.114 ± 0.307
0.656GlnCys: 0.656 ± 0.121
1.385GlnAsp: 1.385 ± 0.256
0.948GlnGlu: 0.948 ± 0.153
1.385GlnPhe: 1.385 ± 0.281
1.677GlnGly: 1.677 ± 0.499
0.51GlnHis: 0.51 ± 0.148
0.729GlnIle: 0.729 ± 0.321
1.312GlnLys: 1.312 ± 0.318
3.936GlnLeu: 3.936 ± 1.027
0.802GlnMet: 0.802 ± 0.285
1.239GlnAsn: 1.239 ± 0.34
1.822GlnPro: 1.822 ± 0.526
1.239GlnGln: 1.239 ± 0.839
1.458GlnArg: 1.458 ± 0.292
2.333GlnSer: 2.333 ± 0.801
1.458GlnThr: 1.458 ± 0.406
1.604GlnVal: 1.604 ± 0.712
0.219GlnTrp: 0.219 ± 0.075
1.021GlnTyr: 1.021 ± 0.779
0.0GlnXaa: 0.0 ± 0.0
Arg
2.624ArgAla: 2.624 ± 0.311
1.312ArgCys: 1.312 ± 0.35
1.385ArgAsp: 1.385 ± 0.248
1.021ArgGlu: 1.021 ± 0.487
2.406ArgPhe: 2.406 ± 0.345
2.041ArgGly: 2.041 ± 0.509
0.802ArgHis: 0.802 ± 0.185
1.385ArgIle: 1.385 ± 0.446
2.551ArgLys: 2.551 ± 0.97
3.135ArgLeu: 3.135 ± 1.161
0.656ArgMet: 0.656 ± 0.201
1.895ArgAsn: 1.895 ± 0.278
1.531ArgPro: 1.531 ± 0.179
0.948ArgGln: 0.948 ± 0.264
1.312ArgArg: 1.312 ± 0.756
1.968ArgSer: 1.968 ± 0.621
2.333ArgThr: 2.333 ± 0.534
4.082ArgVal: 4.082 ± 0.539
0.364ArgTrp: 0.364 ± 0.144
1.895ArgTyr: 1.895 ± 0.198
0.0ArgXaa: 0.0 ± 0.0
Ser
5.467SerAla: 5.467 ± 0.391
1.968SerCys: 1.968 ± 0.483
3.864SerAsp: 3.864 ± 0.783
3.135SerGlu: 3.135 ± 0.501
3.864SerPhe: 3.864 ± 0.661
5.249SerGly: 5.249 ± 0.817
1.166SerHis: 1.166 ± 0.303
3.645SerIle: 3.645 ± 1.473
3.936SerLys: 3.936 ± 0.885
4.884SerLeu: 4.884 ± 0.721
1.895SerMet: 1.895 ± 0.415
3.207SerAsn: 3.207 ± 0.472
1.75SerPro: 1.75 ± 0.458
1.968SerGln: 1.968 ± 1.191
2.333SerArg: 2.333 ± 1.203
4.082SerSer: 4.082 ± 0.646
5.249SerThr: 5.249 ± 0.764
8.019SerVal: 8.019 ± 0.784
0.656SerTrp: 0.656 ± 0.483
2.406SerTyr: 2.406 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
4.52ThrAla: 4.52 ± 0.7
1.822ThrCys: 1.822 ± 0.516
3.791ThrAsp: 3.791 ± 0.45
2.916ThrGlu: 2.916 ± 0.305
4.228ThrPhe: 4.228 ± 0.6
2.843ThrGly: 2.843 ± 0.645
0.364ThrHis: 0.364 ± 0.191
3.135ThrIle: 3.135 ± 0.5
2.041ThrLys: 2.041 ± 0.493
5.832ThrLeu: 5.832 ± 0.64
1.822ThrMet: 1.822 ± 0.399
2.478ThrAsn: 2.478 ± 0.366
3.135ThrPro: 3.135 ± 0.719
1.968ThrGln: 1.968 ± 0.115
2.916ThrArg: 2.916 ± 0.334
4.738ThrSer: 4.738 ± 0.477
5.54ThrThr: 5.54 ± 0.943
5.905ThrVal: 5.905 ± 0.535
0.437ThrTrp: 0.437 ± 0.173
1.75ThrTyr: 1.75 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
8.966ValAla: 8.966 ± 0.847
3.718ValCys: 3.718 ± 0.609
6.342ValAsp: 6.342 ± 1.051
4.374ValGlu: 4.374 ± 0.56
4.665ValPhe: 4.665 ± 0.561
6.342ValGly: 6.342 ± 0.903
1.75ValHis: 1.75 ± 0.369
4.52ValIle: 4.52 ± 0.812
7.727ValLys: 7.727 ± 1.68
7.727ValLeu: 7.727 ± 0.377
1.895ValMet: 1.895 ± 0.601
7.29ValAsn: 7.29 ± 0.546
4.593ValPro: 4.593 ± 0.398
2.697ValGln: 2.697 ± 0.367
3.207ValArg: 3.207 ± 0.857
6.415ValSer: 6.415 ± 0.496
6.634ValThr: 6.634 ± 0.418
12.32ValVal: 12.32 ± 1.756
1.166ValTrp: 1.166 ± 0.219
3.936ValTyr: 3.936 ± 0.354
0.0ValXaa: 0.0 ± 0.0
Trp
0.948TrpAla: 0.948 ± 0.634
0.146TrpCys: 0.146 ± 0.106
0.729TrpAsp: 0.729 ± 0.209
0.219TrpGlu: 0.219 ± 0.075
0.948TrpPhe: 0.948 ± 0.248
0.073TrpGly: 0.073 ± 0.049
0.51TrpHis: 0.51 ± 0.145
0.656TrpIle: 0.656 ± 0.441
0.292TrpLys: 0.292 ± 0.206
1.604TrpLeu: 1.604 ± 0.291
0.292TrpMet: 0.292 ± 0.171
0.656TrpAsn: 0.656 ± 0.477
0.656TrpPro: 0.656 ± 0.29
0.146TrpGln: 0.146 ± 0.106
0.729TrpArg: 0.729 ± 0.172
0.802TrpSer: 0.802 ± 0.244
0.583TrpThr: 0.583 ± 0.353
1.239TrpVal: 1.239 ± 0.223
0.292TrpTrp: 0.292 ± 0.194
0.729TrpTyr: 0.729 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.77TyrAla: 2.77 ± 0.295
1.895TyrCys: 1.895 ± 0.274
2.551TyrAsp: 2.551 ± 0.883
2.187TyrGlu: 2.187 ± 0.42
1.968TyrPhe: 1.968 ± 0.266
3.135TyrGly: 3.135 ± 0.597
0.802TyrHis: 0.802 ± 0.209
1.822TyrIle: 1.822 ± 0.384
2.478TyrLys: 2.478 ± 0.296
3.28TyrLeu: 3.28 ± 0.302
0.948TyrMet: 0.948 ± 0.23
3.062TyrAsn: 3.062 ± 0.627
0.802TyrPro: 0.802 ± 0.342
1.239TyrGln: 1.239 ± 0.291
1.458TyrArg: 1.458 ± 0.583
2.187TyrSer: 2.187 ± 0.567
2.77TyrThr: 2.77 ± 0.401
4.738TyrVal: 4.738 ± 0.583
0.802TyrTrp: 0.802 ± 0.13
2.478TyrTyr: 2.478 ± 0.259
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (13719 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski