Amino acid dipepetide frequency for Bat coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.693AlaAla: 5.693 ± 0.815
2.256AlaCys: 2.256 ± 0.52
3.652AlaAsp: 3.652 ± 0.757
2.148AlaGlu: 2.148 ± 0.717
4.404AlaPhe: 4.404 ± 1.347
3.652AlaGly: 3.652 ± 0.544
0.967AlaHis: 0.967 ± 0.278
5.156AlaIle: 5.156 ± 1.172
4.404AlaLys: 4.404 ± 0.806
5.693AlaLeu: 5.693 ± 0.942
2.47AlaMet: 2.47 ± 0.658
4.834AlaAsn: 4.834 ± 0.425
2.793AlaPro: 2.793 ± 0.632
2.148AlaGln: 2.148 ± 0.653
3.545AlaArg: 3.545 ± 0.574
5.371AlaSer: 5.371 ± 0.711
4.082AlaThr: 4.082 ± 0.883
6.445AlaVal: 6.445 ± 1.035
0.752AlaTrp: 0.752 ± 0.376
3.759AlaTyr: 3.759 ± 0.593
0.0AlaXaa: 0.0 ± 0.0
Cys
2.041CysAla: 2.041 ± 0.393
0.752CysCys: 0.752 ± 0.356
2.363CysAsp: 2.363 ± 0.872
0.644CysGlu: 0.644 ± 0.588
2.041CysPhe: 2.041 ± 0.927
1.719CysGly: 1.719 ± 0.47
0.537CysHis: 0.537 ± 0.253
0.967CysIle: 0.967 ± 0.394
1.396CysLys: 1.396 ± 0.571
2.148CysLeu: 2.148 ± 0.487
0.537CysMet: 0.537 ± 0.158
1.504CysAsn: 1.504 ± 0.295
0.859CysPro: 0.859 ± 0.263
0.859CysGln: 0.859 ± 0.328
1.074CysArg: 1.074 ± 0.296
2.041CysSer: 2.041 ± 0.472
2.578CysThr: 2.578 ± 0.592
2.793CysVal: 2.793 ± 0.676
0.322CysTrp: 0.322 ± 0.176
1.826CysTyr: 1.826 ± 0.62
0.0CysXaa: 0.0 ± 0.0
Asp
4.189AspAla: 4.189 ± 0.864
0.967AspCys: 0.967 ± 0.407
3.437AspAsp: 3.437 ± 0.674
2.148AspGlu: 2.148 ± 0.672
2.793AspPhe: 2.793 ± 0.549
4.082AspGly: 4.082 ± 0.962
0.537AspHis: 0.537 ± 0.265
3.115AspIle: 3.115 ± 0.611
3.008AspLys: 3.008 ± 0.812
4.189AspLeu: 4.189 ± 0.742
1.182AspMet: 1.182 ± 0.439
2.685AspAsn: 2.685 ± 0.473
1.826AspPro: 1.826 ± 0.512
1.074AspGln: 1.074 ± 0.405
1.182AspArg: 1.182 ± 0.324
3.008AspSer: 3.008 ± 0.924
3.759AspThr: 3.759 ± 0.562
5.048AspVal: 5.048 ± 1.033
0.752AspTrp: 0.752 ± 0.352
3.437AspTyr: 3.437 ± 0.933
0.0AspXaa: 0.0 ± 0.0
Glu
2.685GluAla: 2.685 ± 0.671
0.752GluCys: 0.752 ± 0.316
1.504GluAsp: 1.504 ± 0.428
2.578GluGlu: 2.578 ± 1.122
1.611GluPhe: 1.611 ± 0.302
2.578GluGly: 2.578 ± 1.441
0.967GluHis: 0.967 ± 0.368
1.289GluIle: 1.289 ± 0.371
1.396GluLys: 1.396 ± 0.384
4.082GluLeu: 4.082 ± 0.777
0.644GluMet: 0.644 ± 0.397
2.256GluAsn: 2.256 ± 0.679
2.148GluPro: 2.148 ± 0.419
0.967GluGln: 0.967 ± 0.512
1.182GluArg: 1.182 ± 0.48
3.867GluSer: 3.867 ± 0.749
1.719GluThr: 1.719 ± 0.594
3.545GluVal: 3.545 ± 0.897
0.215GluTrp: 0.215 ± 0.196
0.859GluTyr: 0.859 ± 0.309
0.0GluXaa: 0.0 ± 0.0
Phe
3.115PheAla: 3.115 ± 1.291
1.396PheCys: 1.396 ± 0.28
2.9PheAsp: 2.9 ± 1.079
1.504PheGlu: 1.504 ± 0.431
1.504PhePhe: 1.504 ± 0.203
3.222PheGly: 3.222 ± 0.731
0.537PheHis: 0.537 ± 0.363
3.115PheIle: 3.115 ± 0.973
3.33PheLys: 3.33 ± 0.891
3.867PheLeu: 3.867 ± 0.612
1.396PheMet: 1.396 ± 0.302
3.437PheAsn: 3.437 ± 1.127
1.289PhePro: 1.289 ± 0.733
0.967PheGln: 0.967 ± 0.577
1.826PheArg: 1.826 ± 0.471
3.974PheSer: 3.974 ± 0.504
4.296PheThr: 4.296 ± 1.063
5.156PheVal: 5.156 ± 1.806
0.537PheTrp: 0.537 ± 0.363
3.33PheTyr: 3.33 ± 0.781
0.0PheXaa: 0.0 ± 0.0
Gly
3.974GlyAla: 3.974 ± 0.705
1.611GlyCys: 1.611 ± 0.443
3.545GlyAsp: 3.545 ± 0.552
1.826GlyGlu: 1.826 ± 0.461
3.759GlyPhe: 3.759 ± 0.55
4.189GlyGly: 4.189 ± 0.794
0.859GlyHis: 0.859 ± 0.37
2.793GlyIle: 2.793 ± 0.673
2.256GlyLys: 2.256 ± 0.553
3.545GlyLeu: 3.545 ± 0.764
1.182GlyMet: 1.182 ± 0.208
2.9GlyAsn: 2.9 ± 1.037
2.685GlyPro: 2.685 ± 0.321
1.826GlyGln: 1.826 ± 0.978
2.363GlyArg: 2.363 ± 0.975
4.726GlySer: 4.726 ± 1.615
4.941GlyThr: 4.941 ± 0.639
7.734GlyVal: 7.734 ± 1.248
1.182GlyTrp: 1.182 ± 0.38
2.47GlyTyr: 2.47 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
1.504HisAla: 1.504 ± 0.614
0.644HisCys: 0.644 ± 0.74
0.644HisAsp: 0.644 ± 0.256
0.644HisGlu: 0.644 ± 0.278
0.859HisPhe: 0.859 ± 0.634
1.182HisGly: 1.182 ± 0.342
0.107HisHis: 0.107 ± 0.096
0.752HisIle: 0.752 ± 0.415
0.967HisLys: 0.967 ± 0.439
1.611HisLeu: 1.611 ± 0.436
0.322HisMet: 0.322 ± 0.148
0.644HisAsn: 0.644 ± 0.31
0.859HisPro: 0.859 ± 0.443
0.43HisGln: 0.43 ± 0.155
1.074HisArg: 1.074 ± 0.459
0.967HisSer: 0.967 ± 0.301
1.182HisThr: 1.182 ± 0.497
2.041HisVal: 2.041 ± 0.657
0.322HisTrp: 0.322 ± 0.13
0.752HisTyr: 0.752 ± 0.356
0.0HisXaa: 0.0 ± 0.0
Ile
3.008IleAla: 3.008 ± 0.719
0.644IleCys: 0.644 ± 0.614
1.933IleAsp: 1.933 ± 0.327
1.396IleGlu: 1.396 ± 0.663
1.611IlePhe: 1.611 ± 0.331
3.222IleGly: 3.222 ± 0.867
0.322IleHis: 0.322 ± 0.107
1.182IleIle: 1.182 ± 0.318
2.148IleLys: 2.148 ± 0.492
3.437IleLeu: 3.437 ± 2.048
1.074IleMet: 1.074 ± 0.332
2.256IleAsn: 2.256 ± 0.657
2.148IlePro: 2.148 ± 0.887
1.182IleGln: 1.182 ± 0.423
2.041IleArg: 2.041 ± 0.547
4.082IleSer: 4.082 ± 1.059
4.082IleThr: 4.082 ± 0.727
5.048IleVal: 5.048 ± 0.911
0.537IleTrp: 0.537 ± 0.554
2.041IleTyr: 2.041 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
3.652LysAla: 3.652 ± 1.263
1.396LysCys: 1.396 ± 0.51
2.578LysAsp: 2.578 ± 0.544
2.47LysGlu: 2.47 ± 0.761
2.47LysPhe: 2.47 ± 0.697
2.685LysGly: 2.685 ± 0.669
2.148LysHis: 2.148 ± 0.884
0.537LysIle: 0.537 ± 0.372
1.933LysLys: 1.933 ± 1.077
5.048LysLeu: 5.048 ± 0.893
1.074LysMet: 1.074 ± 0.412
1.074LysAsn: 1.074 ± 0.426
3.33LysPro: 3.33 ± 0.588
2.148LysGln: 2.148 ± 0.473
2.256LysArg: 2.256 ± 0.48
2.578LysSer: 2.578 ± 0.853
2.148LysThr: 2.148 ± 0.984
4.726LysVal: 4.726 ± 1.125
0.537LysTrp: 0.537 ± 0.277
2.256LysTyr: 2.256 ± 0.85
0.0LysXaa: 0.0 ± 0.0
Leu
8.593LeuAla: 8.593 ± 1.072
3.867LeuCys: 3.867 ± 0.493
4.189LeuAsp: 4.189 ± 0.772
3.222LeuGlu: 3.222 ± 0.96
4.189LeuPhe: 4.189 ± 0.834
4.082LeuGly: 4.082 ± 0.562
1.074LeuHis: 1.074 ± 0.271
3.008LeuIle: 3.008 ± 0.485
5.156LeuLys: 5.156 ± 1.647
9.989LeuLeu: 9.989 ± 2.387
1.933LeuMet: 1.933 ± 0.463
3.974LeuAsn: 3.974 ± 0.633
5.693LeuPro: 5.693 ± 0.851
4.941LeuGln: 4.941 ± 0.595
4.726LeuArg: 4.726 ± 0.896
7.411LeuSer: 7.411 ± 0.603
4.511LeuThr: 4.511 ± 1.062
7.197LeuVal: 7.197 ± 1.278
1.719LeuTrp: 1.719 ± 0.705
3.867LeuTyr: 3.867 ± 1.066
0.0LeuXaa: 0.0 ± 0.0
Met
1.611MetAla: 1.611 ± 0.38
0.859MetCys: 0.859 ± 0.174
1.289MetAsp: 1.289 ± 0.555
0.859MetGlu: 0.859 ± 0.252
0.859MetPhe: 0.859 ± 0.681
1.182MetGly: 1.182 ± 0.333
1.074MetHis: 1.074 ± 0.408
0.967MetIle: 0.967 ± 0.225
0.107MetLys: 0.107 ± 0.215
3.222MetLeu: 3.222 ± 0.661
0.43MetMet: 0.43 ± 0.313
0.859MetAsn: 0.859 ± 0.37
1.182MetPro: 1.182 ± 0.802
1.074MetGln: 1.074 ± 0.308
0.967MetArg: 0.967 ± 0.382
1.504MetSer: 1.504 ± 0.657
1.182MetThr: 1.182 ± 0.263
2.148MetVal: 2.148 ± 0.508
0.215MetTrp: 0.215 ± 0.359
0.967MetTyr: 0.967 ± 0.27
0.0MetXaa: 0.0 ± 0.0
Asn
4.296AsnAla: 4.296 ± 0.555
1.504AsnCys: 1.504 ± 0.413
2.363AsnAsp: 2.363 ± 0.779
1.933AsnGlu: 1.933 ± 0.567
2.578AsnPhe: 2.578 ± 0.473
3.974AsnGly: 3.974 ± 0.695
0.537AsnHis: 0.537 ± 0.413
2.363AsnIle: 2.363 ± 0.457
2.793AsnLys: 2.793 ± 0.357
4.619AsnLeu: 4.619 ± 0.691
1.074AsnMet: 1.074 ± 0.494
2.9AsnAsn: 2.9 ± 1.166
2.793AsnPro: 2.793 ± 0.656
1.504AsnGln: 1.504 ± 0.535
1.182AsnArg: 1.182 ± 0.572
3.115AsnSer: 3.115 ± 1.101
3.008AsnThr: 3.008 ± 0.855
5.371AsnVal: 5.371 ± 0.64
0.752AsnTrp: 0.752 ± 0.305
2.256AsnTyr: 2.256 ± 0.665
0.0AsnXaa: 0.0 ± 0.0
Pro
3.437ProAla: 3.437 ± 0.496
1.289ProCys: 1.289 ± 0.717
2.578ProAsp: 2.578 ± 0.426
1.933ProGlu: 1.933 ± 0.72
2.363ProPhe: 2.363 ± 0.743
2.9ProGly: 2.9 ± 0.528
1.074ProHis: 1.074 ± 0.332
2.9ProIle: 2.9 ± 0.871
1.611ProLys: 1.611 ± 0.693
5.048ProLeu: 5.048 ± 1.417
1.074ProMet: 1.074 ± 0.572
1.933ProAsn: 1.933 ± 0.702
2.148ProPro: 2.148 ± 0.525
1.933ProGln: 1.933 ± 0.395
2.363ProArg: 2.363 ± 1.588
2.256ProSer: 2.256 ± 0.684
3.115ProThr: 3.115 ± 0.518
4.082ProVal: 4.082 ± 0.853
0.644ProTrp: 0.644 ± 0.173
2.041ProTyr: 2.041 ± 0.209
0.0ProXaa: 0.0 ± 0.0
Gln
2.47GlnAla: 2.47 ± 0.624
0.752GlnCys: 0.752 ± 0.786
1.719GlnAsp: 1.719 ± 0.563
1.826GlnGlu: 1.826 ± 0.572
2.148GlnPhe: 2.148 ± 0.748
2.578GlnGly: 2.578 ± 0.626
0.752GlnHis: 0.752 ± 0.56
0.967GlnIle: 0.967 ± 0.411
1.289GlnLys: 1.289 ± 0.249
3.867GlnLeu: 3.867 ± 0.789
1.182GlnMet: 1.182 ± 0.306
1.611GlnAsn: 1.611 ± 0.596
1.933GlnPro: 1.933 ± 0.609
2.256GlnGln: 2.256 ± 0.691
2.041GlnArg: 2.041 ± 1.342
1.719GlnSer: 1.719 ± 0.808
2.041GlnThr: 2.041 ± 0.433
2.363GlnVal: 2.363 ± 0.769
0.752GlnTrp: 0.752 ± 0.344
1.396GlnTyr: 1.396 ± 0.35
0.0GlnXaa: 0.0 ± 0.0
Arg
2.363ArgAla: 2.363 ± 0.595
1.611ArgCys: 1.611 ± 0.435
1.504ArgAsp: 1.504 ± 0.356
1.289ArgGlu: 1.289 ± 0.411
2.363ArgPhe: 2.363 ± 0.586
2.793ArgGly: 2.793 ± 1.102
1.289ArgHis: 1.289 ± 0.915
1.826ArgIle: 1.826 ± 0.404
1.504ArgLys: 1.504 ± 0.351
4.082ArgLeu: 4.082 ± 0.463
0.859ArgMet: 0.859 ± 0.263
2.793ArgAsn: 2.793 ± 1.708
1.611ArgPro: 1.611 ± 0.362
1.826ArgGln: 1.826 ± 1.14
2.578ArgArg: 2.578 ± 0.964
2.793ArgSer: 2.793 ± 0.844
3.008ArgThr: 3.008 ± 1.066
3.115ArgVal: 3.115 ± 0.679
0.537ArgTrp: 0.537 ± 0.176
2.148ArgTyr: 2.148 ± 0.607
0.0ArgXaa: 0.0 ± 0.0
Ser
5.908SerAla: 5.908 ± 0.542
2.041SerCys: 2.041 ± 0.476
4.511SerAsp: 4.511 ± 0.952
2.685SerGlu: 2.685 ± 0.454
3.437SerPhe: 3.437 ± 0.719
3.759SerGly: 3.759 ± 0.755
1.504SerHis: 1.504 ± 1.041
3.974SerIle: 3.974 ± 1.081
3.545SerLys: 3.545 ± 0.911
7.841SerLeu: 7.841 ± 1.224
1.182SerMet: 1.182 ± 0.447
2.793SerAsn: 2.793 ± 0.902
2.256SerPro: 2.256 ± 0.43
1.933SerGln: 1.933 ± 0.703
2.9SerArg: 2.9 ± 1.535
7.197SerSer: 7.197 ± 1.73
4.619SerThr: 4.619 ± 0.814
7.197SerVal: 7.197 ± 0.809
1.074SerTrp: 1.074 ± 0.425
2.041SerTyr: 2.041 ± 0.974
0.0SerXaa: 0.0 ± 0.0
Thr
4.941ThrAla: 4.941 ± 0.891
2.256ThrCys: 2.256 ± 0.705
3.115ThrAsp: 3.115 ± 0.345
1.719ThrGlu: 1.719 ± 0.451
3.759ThrPhe: 3.759 ± 1.07
5.048ThrGly: 5.048 ± 0.635
1.289ThrHis: 1.289 ± 0.402
3.222ThrIle: 3.222 ± 0.668
3.652ThrLys: 3.652 ± 0.94
6.122ThrLeu: 6.122 ± 1.711
1.826ThrMet: 1.826 ± 0.521
3.115ThrAsn: 3.115 ± 0.816
3.652ThrPro: 3.652 ± 1.213
2.363ThrGln: 2.363 ± 0.721
2.9ThrArg: 2.9 ± 0.709
4.189ThrSer: 4.189 ± 0.704
5.371ThrThr: 5.371 ± 1.029
6.66ThrVal: 6.66 ± 1.135
0.644ThrTrp: 0.644 ± 0.268
3.437ThrTyr: 3.437 ± 1.08
0.0ThrXaa: 0.0 ± 0.0
Val
6.767ValAla: 6.767 ± 0.621
3.115ValCys: 3.115 ± 1.06
4.619ValAsp: 4.619 ± 1.092
4.082ValGlu: 4.082 ± 0.944
4.726ValPhe: 4.726 ± 0.864
4.296ValGly: 4.296 ± 0.681
1.182ValHis: 1.182 ± 0.366
3.33ValIle: 3.33 ± 1.144
4.082ValLys: 4.082 ± 1.027
9.345ValLeu: 9.345 ± 1.651
2.148ValMet: 2.148 ± 0.573
5.585ValAsn: 5.585 ± 1.208
4.619ValPro: 4.619 ± 0.91
3.759ValGln: 3.759 ± 0.809
3.652ValArg: 3.652 ± 0.588
7.841ValSer: 7.841 ± 1.297
7.841ValThr: 7.841 ± 1.444
10.097ValVal: 10.097 ± 1.336
0.967ValTrp: 0.967 ± 0.293
4.082ValTyr: 4.082 ± 0.837
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.327
0.322TrpCys: 0.322 ± 0.337
1.074TrpAsp: 1.074 ± 0.53
0.43TrpGlu: 0.43 ± 0.192
0.967TrpPhe: 0.967 ± 0.433
0.752TrpGly: 0.752 ± 0.559
0.0TrpHis: 0.0 ± 0.0
0.322TrpIle: 0.322 ± 0.13
0.43TrpLys: 0.43 ± 0.164
2.256TrpLeu: 2.256 ± 0.871
0.107TrpMet: 0.107 ± 0.215
0.43TrpAsn: 0.43 ± 0.357
0.322TrpPro: 0.322 ± 0.458
0.752TrpGln: 0.752 ± 0.325
0.43TrpArg: 0.43 ± 0.117
0.967TrpSer: 0.967 ± 0.395
0.537TrpThr: 0.537 ± 0.246
1.289TrpVal: 1.289 ± 0.294
0.215TrpTrp: 0.215 ± 0.371
0.859TrpTyr: 0.859 ± 0.401
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.33TyrAla: 3.33 ± 0.593
0.967TyrCys: 0.967 ± 0.214
2.9TyrAsp: 2.9 ± 0.808
1.504TyrGlu: 1.504 ± 0.453
2.256TyrPhe: 2.256 ± 0.415
2.47TyrGly: 2.47 ± 0.407
0.752TyrHis: 0.752 ± 0.232
1.611TyrIle: 1.611 ± 0.419
2.041TyrLys: 2.041 ± 0.32
3.437TyrLeu: 3.437 ± 0.665
0.752TyrMet: 0.752 ± 0.256
3.33TyrAsn: 3.33 ± 0.706
2.578TyrPro: 2.578 ± 0.618
1.719TyrGln: 1.719 ± 0.663
1.611TyrArg: 1.611 ± 0.533
2.793TyrSer: 2.793 ± 0.604
5.263TyrThr: 5.263 ± 0.833
4.082TyrVal: 4.082 ± 1.329
0.43TyrTrp: 0.43 ± 0.203
2.685TyrTyr: 2.685 ± 0.371
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (9311 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski