Amino acid dipepetide frequency for Bat coronavirus CDPHE15/USA/2006

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.317AlaAla: 6.317 ± 0.744
2.557AlaCys: 2.557 ± 0.399
3.234AlaAsp: 3.234 ± 0.231
3.384AlaGlu: 3.384 ± 0.529
4.888AlaPhe: 4.888 ± 0.692
4.061AlaGly: 4.061 ± 0.516
0.902AlaHis: 0.902 ± 0.262
5.414AlaIle: 5.414 ± 0.498
4.362AlaLys: 4.362 ± 0.41
6.618AlaLeu: 6.618 ± 0.773
2.03AlaMet: 2.03 ± 0.367
4.362AlaAsn: 4.362 ± 0.106
2.331AlaPro: 2.331 ± 0.244
1.805AlaGln: 1.805 ± 0.512
2.482AlaArg: 2.482 ± 0.388
4.813AlaSer: 4.813 ± 0.873
4.211AlaThr: 4.211 ± 0.273
5.79AlaVal: 5.79 ± 0.638
0.752AlaTrp: 0.752 ± 0.222
3.008AlaTyr: 3.008 ± 0.573
0.0AlaXaa: 0.0 ± 0.0
Cys
1.805CysAla: 1.805 ± 0.2
1.955CysCys: 1.955 ± 0.235
2.482CysAsp: 2.482 ± 0.446
0.602CysGlu: 0.602 ± 0.167
2.181CysPhe: 2.181 ± 0.293
2.331CysGly: 2.331 ± 0.291
0.602CysHis: 0.602 ± 0.216
1.203CysIle: 1.203 ± 0.174
2.858CysLys: 2.858 ± 0.418
2.406CysLeu: 2.406 ± 0.413
0.602CysMet: 0.602 ± 0.196
2.03CysAsn: 2.03 ± 0.249
0.677CysPro: 0.677 ± 0.16
0.602CysGln: 0.602 ± 0.095
0.978CysArg: 0.978 ± 0.271
2.482CysSer: 2.482 ± 0.375
2.406CysThr: 2.406 ± 0.373
3.685CysVal: 3.685 ± 0.565
0.602CysTrp: 0.602 ± 0.167
1.955CysTyr: 1.955 ± 0.297
0.0CysXaa: 0.0 ± 0.0
Asp
3.459AspAla: 3.459 ± 0.419
1.504AspCys: 1.504 ± 0.202
3.986AspAsp: 3.986 ± 0.69
2.782AspGlu: 2.782 ± 0.45
4.587AspPhe: 4.587 ± 0.8
4.813AspGly: 4.813 ± 0.334
1.278AspHis: 1.278 ± 0.225
2.782AspIle: 2.782 ± 0.692
1.955AspLys: 1.955 ± 0.311
3.76AspLeu: 3.76 ± 0.415
1.278AspMet: 1.278 ± 0.355
2.256AspAsn: 2.256 ± 0.339
1.73AspPro: 1.73 ± 0.355
2.03AspGln: 2.03 ± 0.324
1.504AspArg: 1.504 ± 0.175
5.114AspSer: 5.114 ± 0.609
2.782AspThr: 2.782 ± 0.204
5.866AspVal: 5.866 ± 0.459
0.827AspTrp: 0.827 ± 0.231
2.482AspTyr: 2.482 ± 0.61
0.0AspXaa: 0.0 ± 0.0
Glu
3.083GluAla: 3.083 ± 0.638
0.752GluCys: 0.752 ± 0.19
2.03GluAsp: 2.03 ± 0.282
2.858GluGlu: 2.858 ± 0.388
2.632GluPhe: 2.632 ± 0.533
3.083GluGly: 3.083 ± 0.181
1.128GluHis: 1.128 ± 0.191
1.278GluIle: 1.278 ± 0.363
1.504GluLys: 1.504 ± 0.29
2.707GluLeu: 2.707 ± 0.368
0.376GluMet: 0.376 ± 0.077
2.181GluAsn: 2.181 ± 0.483
2.632GluPro: 2.632 ± 0.538
1.88GluGln: 1.88 ± 0.393
1.504GluArg: 1.504 ± 0.376
2.406GluSer: 2.406 ± 0.305
1.203GluThr: 1.203 ± 0.171
3.008GluVal: 3.008 ± 0.327
0.526GluTrp: 0.526 ± 0.11
1.354GluTyr: 1.354 ± 0.417
0.0GluXaa: 0.0 ± 0.0
Phe
2.707PheAla: 2.707 ± 0.212
1.955PheCys: 1.955 ± 0.257
3.61PheAsp: 3.61 ± 0.329
2.632PheGlu: 2.632 ± 0.321
2.406PhePhe: 2.406 ± 0.489
4.738PheGly: 4.738 ± 0.459
0.376PheHis: 0.376 ± 0.152
2.933PheIle: 2.933 ± 0.236
3.384PheLys: 3.384 ± 0.383
5.49PheLeu: 5.49 ± 0.896
1.429PheMet: 1.429 ± 0.402
3.61PheAsn: 3.61 ± 0.672
0.902PhePro: 0.902 ± 0.153
0.526PheGln: 0.526 ± 0.18
1.88PheArg: 1.88 ± 0.312
3.459PheSer: 3.459 ± 0.275
3.91PheThr: 3.91 ± 1.196
7.294PheVal: 7.294 ± 0.602
0.978PheTrp: 0.978 ± 0.124
3.384PheTyr: 3.384 ± 0.328
0.0PheXaa: 0.0 ± 0.0
Gly
6.242GlyAla: 6.242 ± 0.697
3.234GlyCys: 3.234 ± 0.418
4.813GlyAsp: 4.813 ± 0.159
1.955GlyGlu: 1.955 ± 0.18
4.362GlyPhe: 4.362 ± 0.481
5.264GlyGly: 5.264 ± 0.593
1.278GlyHis: 1.278 ± 0.419
2.782GlyIle: 2.782 ± 0.471
3.91GlyLys: 3.91 ± 0.698
6.317GlyLeu: 6.317 ± 0.151
0.752GlyMet: 0.752 ± 0.234
2.858GlyAsn: 2.858 ± 0.401
1.955GlyPro: 1.955 ± 0.194
1.805GlyGln: 1.805 ± 0.192
1.354GlyArg: 1.354 ± 0.705
5.264GlySer: 5.264 ± 0.475
4.061GlyThr: 4.061 ± 0.495
9.174GlyVal: 9.174 ± 1.105
0.602GlyTrp: 0.602 ± 0.269
2.557GlyTyr: 2.557 ± 0.239
0.0GlyXaa: 0.0 ± 0.0
His
1.955HisAla: 1.955 ± 0.419
0.602HisCys: 0.602 ± 0.211
0.827HisAsp: 0.827 ± 0.273
1.128HisGlu: 1.128 ± 0.327
0.526HisPhe: 0.526 ± 0.291
1.278HisGly: 1.278 ± 0.172
0.376HisHis: 0.376 ± 0.13
1.053HisIle: 1.053 ± 0.099
0.677HisLys: 0.677 ± 0.25
1.504HisLeu: 1.504 ± 0.33
0.376HisMet: 0.376 ± 0.18
1.053HisAsn: 1.053 ± 0.099
0.376HisPro: 0.376 ± 0.104
0.827HisGln: 0.827 ± 0.1
0.376HisArg: 0.376 ± 0.145
1.579HisSer: 1.579 ± 0.164
1.579HisThr: 1.579 ± 0.159
1.88HisVal: 1.88 ± 0.345
0.15HisTrp: 0.15 ± 0.049
1.354HisTyr: 1.354 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
2.256IleAla: 2.256 ± 1.294
1.128IleCys: 1.128 ± 0.157
2.106IleAsp: 2.106 ± 0.258
1.203IleGlu: 1.203 ± 0.564
2.181IlePhe: 2.181 ± 0.314
2.707IleGly: 2.707 ± 0.284
1.278IleHis: 1.278 ± 0.178
2.782IleIle: 2.782 ± 0.652
3.234IleLys: 3.234 ± 0.554
4.587IleLeu: 4.587 ± 0.499
1.429IleMet: 1.429 ± 0.289
3.083IleAsn: 3.083 ± 0.449
1.579IlePro: 1.579 ± 0.129
1.203IleGln: 1.203 ± 0.205
2.256IleArg: 2.256 ± 0.378
3.61IleSer: 3.61 ± 0.329
2.707IleThr: 2.707 ± 0.218
3.835IleVal: 3.835 ± 0.27
0.602IleTrp: 0.602 ± 0.128
2.03IleTyr: 2.03 ± 0.696
0.0IleXaa: 0.0 ± 0.0
Lys
3.158LysAla: 3.158 ± 0.388
1.429LysCys: 1.429 ± 0.243
2.632LysAsp: 2.632 ± 0.244
1.504LysGlu: 1.504 ± 0.379
3.158LysPhe: 3.158 ± 0.482
2.858LysGly: 2.858 ± 0.366
1.955LysHis: 1.955 ± 0.595
2.181LysIle: 2.181 ± 0.288
2.03LysLys: 2.03 ± 0.419
3.835LysLeu: 3.835 ± 0.713
1.053LysMet: 1.053 ± 0.1
1.805LysAsn: 1.805 ± 0.358
4.362LysPro: 4.362 ± 0.591
2.03LysGln: 2.03 ± 0.445
1.73LysArg: 1.73 ± 0.216
3.986LysSer: 3.986 ± 0.581
3.158LysThr: 3.158 ± 0.589
4.286LysVal: 4.286 ± 0.55
0.526LysTrp: 0.526 ± 0.232
2.632LysTyr: 2.632 ± 0.485
0.0LysXaa: 0.0 ± 0.0
Leu
7.219LeuAla: 7.219 ± 0.705
3.459LeuCys: 3.459 ± 0.758
4.662LeuAsp: 4.662 ± 0.613
3.309LeuGlu: 3.309 ± 0.561
4.813LeuPhe: 4.813 ± 0.611
6.016LeuGly: 6.016 ± 0.458
1.88LeuHis: 1.88 ± 0.3
2.256LeuIle: 2.256 ± 0.849
5.189LeuLys: 5.189 ± 1.005
8.197LeuLeu: 8.197 ± 1.176
1.128LeuMet: 1.128 ± 0.24
5.264LeuAsn: 5.264 ± 0.764
3.234LeuPro: 3.234 ± 0.914
4.512LeuGln: 4.512 ± 0.746
3.76LeuArg: 3.76 ± 0.203
6.994LeuSer: 6.994 ± 0.654
4.061LeuThr: 4.061 ± 0.937
6.918LeuVal: 6.918 ± 0.811
1.654LeuTrp: 1.654 ± 0.566
3.685LeuTyr: 3.685 ± 0.517
0.0LeuXaa: 0.0 ± 0.0
Met
2.106MetAla: 2.106 ± 0.32
0.677MetCys: 0.677 ± 0.346
0.752MetAsp: 0.752 ± 0.233
0.752MetGlu: 0.752 ± 0.164
2.03MetPhe: 2.03 ± 0.191
1.354MetGly: 1.354 ± 0.138
0.451MetHis: 0.451 ± 0.131
0.526MetIle: 0.526 ± 0.248
0.301MetLys: 0.301 ± 0.145
2.406MetLeu: 2.406 ± 0.32
0.451MetMet: 0.451 ± 0.199
0.677MetAsn: 0.677 ± 0.128
0.902MetPro: 0.902 ± 0.128
0.301MetGln: 0.301 ± 0.138
0.978MetArg: 0.978 ± 0.096
1.278MetSer: 1.278 ± 0.245
0.827MetThr: 0.827 ± 0.184
2.03MetVal: 2.03 ± 0.318
0.226MetTrp: 0.226 ± 0.065
1.203MetTyr: 1.203 ± 0.338
0.0MetXaa: 0.0 ± 0.0
Asn
3.234AsnAla: 3.234 ± 0.384
1.805AsnCys: 1.805 ± 0.385
1.73AsnAsp: 1.73 ± 0.2
1.88AsnGlu: 1.88 ± 0.208
2.707AsnPhe: 2.707 ± 0.364
6.467AsnGly: 6.467 ± 1.016
1.354AsnHis: 1.354 ± 0.138
1.88AsnIle: 1.88 ± 0.146
1.73AsnLys: 1.73 ± 0.538
4.888AsnLeu: 4.888 ± 0.407
1.128AsnMet: 1.128 ± 0.16
3.384AsnAsn: 3.384 ± 0.274
1.579AsnPro: 1.579 ± 0.155
0.978AsnGln: 0.978 ± 0.582
1.429AsnArg: 1.429 ± 0.369
3.61AsnSer: 3.61 ± 1.02
3.459AsnThr: 3.459 ± 0.137
6.843AsnVal: 6.843 ± 0.347
0.526AsnTrp: 0.526 ± 0.271
1.73AsnTyr: 1.73 ± 0.171
0.0AsnXaa: 0.0 ± 0.0
Pro
2.933ProAla: 2.933 ± 0.582
0.526ProCys: 0.526 ± 0.054
1.88ProAsp: 1.88 ± 0.405
1.579ProGlu: 1.579 ± 0.294
2.106ProPhe: 2.106 ± 0.447
2.406ProGly: 2.406 ± 0.299
0.978ProHis: 0.978 ± 0.157
2.406ProIle: 2.406 ± 0.231
1.805ProLys: 1.805 ± 0.795
3.91ProLeu: 3.91 ± 0.305
0.376ProMet: 0.376 ± 0.104
1.278ProAsn: 1.278 ± 0.314
1.805ProPro: 1.805 ± 0.167
0.978ProGln: 0.978 ± 0.374
1.73ProArg: 1.73 ± 0.76
2.406ProSer: 2.406 ± 0.167
2.256ProThr: 2.256 ± 0.454
3.61ProVal: 3.61 ± 0.506
0.451ProTrp: 0.451 ± 0.043
0.902ProTyr: 0.902 ± 0.34
0.0ProXaa: 0.0 ± 0.0
Gln
2.181GlnAla: 2.181 ± 0.166
1.429GlnCys: 1.429 ± 0.256
1.203GlnAsp: 1.203 ± 0.192
0.677GlnGlu: 0.677 ± 0.137
1.504GlnPhe: 1.504 ± 0.707
1.579GlnGly: 1.579 ± 0.123
0.376GlnHis: 0.376 ± 0.152
1.278GlnIle: 1.278 ± 0.545
0.752GlnLys: 0.752 ± 0.101
4.211GlnLeu: 4.211 ± 0.482
0.827GlnMet: 0.827 ± 0.111
1.278GlnAsn: 1.278 ± 0.124
1.504GlnPro: 1.504 ± 0.688
0.677GlnGln: 0.677 ± 0.543
2.482GlnArg: 2.482 ± 0.192
1.955GlnSer: 1.955 ± 0.272
1.429GlnThr: 1.429 ± 0.366
1.73GlnVal: 1.73 ± 0.705
0.376GlnTrp: 0.376 ± 0.104
1.73GlnTyr: 1.73 ± 0.653
0.0GlnXaa: 0.0 ± 0.0
Arg
3.309ArgAla: 3.309 ± 0.598
1.805ArgCys: 1.805 ± 0.432
1.053ArgAsp: 1.053 ± 0.126
0.677ArgGlu: 0.677 ± 0.186
2.106ArgPhe: 2.106 ± 0.209
2.106ArgGly: 2.106 ± 0.513
1.128ArgHis: 1.128 ± 0.157
1.429ArgIle: 1.429 ± 0.401
1.955ArgLys: 1.955 ± 0.411
3.986ArgLeu: 3.986 ± 0.539
1.203ArgMet: 1.203 ± 0.208
1.955ArgAsn: 1.955 ± 0.455
1.053ArgPro: 1.053 ± 0.234
0.978ArgGln: 0.978 ± 0.584
2.181ArgArg: 2.181 ± 0.252
2.858ArgSer: 2.858 ± 0.523
2.933ArgThr: 2.933 ± 0.609
3.309ArgVal: 3.309 ± 0.33
0.226ArgTrp: 0.226 ± 0.131
1.654ArgTyr: 1.654 ± 0.125
0.0ArgXaa: 0.0 ± 0.0
Ser
6.994SerAla: 6.994 ± 0.731
2.106SerCys: 2.106 ± 0.343
4.963SerAsp: 4.963 ± 0.398
2.557SerGlu: 2.557 ± 0.538
4.662SerPhe: 4.662 ± 0.536
5.264SerGly: 5.264 ± 0.538
1.354SerHis: 1.354 ± 0.307
4.362SerIle: 4.362 ± 0.962
2.707SerLys: 2.707 ± 0.494
5.339SerLeu: 5.339 ± 0.325
0.677SerMet: 0.677 ± 0.168
3.384SerAsn: 3.384 ± 0.632
1.73SerPro: 1.73 ± 0.458
2.632SerGln: 2.632 ± 0.531
3.083SerArg: 3.083 ± 0.687
4.738SerSer: 4.738 ± 0.966
5.941SerThr: 5.941 ± 0.356
6.392SerVal: 6.392 ± 0.312
0.526SerTrp: 0.526 ± 0.156
4.437SerTyr: 4.437 ± 0.645
0.0SerXaa: 0.0 ± 0.0
Thr
3.008ThrAla: 3.008 ± 0.295
1.654ThrCys: 1.654 ± 0.25
3.91ThrAsp: 3.91 ± 0.171
2.256ThrGlu: 2.256 ± 0.294
2.933ThrPhe: 2.933 ± 0.445
4.286ThrGly: 4.286 ± 0.645
0.376ThrHis: 0.376 ± 0.152
3.459ThrIle: 3.459 ± 0.498
2.181ThrLys: 2.181 ± 0.294
5.414ThrLeu: 5.414 ± 0.727
1.88ThrMet: 1.88 ± 0.548
2.782ThrAsn: 2.782 ± 0.816
2.782ThrPro: 2.782 ± 0.688
2.181ThrGln: 2.181 ± 0.21
1.88ThrArg: 1.88 ± 0.426
4.963ThrSer: 4.963 ± 0.375
4.512ThrThr: 4.512 ± 0.632
6.918ThrVal: 6.918 ± 0.662
0.301ThrTrp: 0.301 ± 0.106
3.008ThrTyr: 3.008 ± 0.45
0.0ThrXaa: 0.0 ± 0.0
Val
7.294ValAla: 7.294 ± 0.835
3.534ValCys: 3.534 ± 0.334
6.768ValAsp: 6.768 ± 0.28
3.685ValGlu: 3.685 ± 0.324
4.061ValPhe: 4.061 ± 0.258
6.618ValGly: 6.618 ± 0.931
1.504ValHis: 1.504 ± 0.297
3.76ValIle: 3.76 ± 0.646
7.219ValLys: 7.219 ± 1.494
8.347ValLeu: 8.347 ± 0.73
2.03ValMet: 2.03 ± 0.279
5.64ValAsn: 5.64 ± 0.424
3.234ValPro: 3.234 ± 0.295
2.482ValGln: 2.482 ± 0.56
3.986ValArg: 3.986 ± 0.504
8.122ValSer: 8.122 ± 0.853
6.091ValThr: 6.091 ± 1.079
10.378ValVal: 10.378 ± 0.873
0.902ValTrp: 0.902 ± 0.153
3.083ValTyr: 3.083 ± 0.549
0.0ValXaa: 0.0 ± 0.0
Trp
0.376TrpAla: 0.376 ± 0.126
0.451TrpCys: 0.451 ± 0.131
1.053TrpAsp: 1.053 ± 0.109
0.376TrpGlu: 0.376 ± 0.104
0.752TrpPhe: 0.752 ± 0.107
0.15TrpGly: 0.15 ± 0.1
0.376TrpHis: 0.376 ± 0.111
0.226TrpIle: 0.226 ± 0.065
0.451TrpLys: 0.451 ± 0.205
1.504TrpLeu: 1.504 ± 0.237
0.226TrpMet: 0.226 ± 0.065
0.827TrpAsn: 0.827 ± 0.256
0.602TrpPro: 0.602 ± 0.216
0.075TrpGln: 0.075 ± 0.05
0.752TrpArg: 0.752 ± 0.233
1.203TrpSer: 1.203 ± 0.219
0.526TrpThr: 0.526 ± 0.054
1.429TrpVal: 1.429 ± 0.194
0.376TrpTrp: 0.376 ± 0.075
0.15TrpTyr: 0.15 ± 0.1
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.685TyrAla: 3.685 ± 0.425
1.88TyrCys: 1.88 ± 0.17
3.083TyrAsp: 3.083 ± 0.781
2.331TyrGlu: 2.331 ± 0.503
3.008TyrPhe: 3.008 ± 0.375
3.158TyrGly: 3.158 ± 0.454
0.526TyrHis: 0.526 ± 0.168
1.805TyrIle: 1.805 ± 0.146
2.106TyrLys: 2.106 ± 0.288
2.782TyrLeu: 2.782 ± 0.426
0.902TyrMet: 0.902 ± 0.2
2.632TyrAsn: 2.632 ± 0.514
1.354TyrPro: 1.354 ± 0.304
0.677TyrGln: 0.677 ± 0.335
1.579TyrArg: 1.579 ± 0.117
3.083TyrSer: 3.083 ± 0.732
2.632TyrThr: 2.632 ± 0.163
4.362TyrVal: 4.362 ± 0.619
0.752TyrTrp: 0.752 ± 0.21
1.955TyrTyr: 1.955 ± 0.723
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (13299 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski