Amino acid dipepetide frequency for Murine coronavirus (strain 2) (MHV-2) (Murine hepatitis virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.23AlaAla: 5.23 ± 0.303
2.898AlaCys: 2.898 ± 0.525
4.24AlaAsp: 4.24 ± 0.492
2.261AlaGlu: 2.261 ± 0.357
4.382AlaPhe: 4.382 ± 0.596
3.746AlaGly: 3.746 ± 0.511
1.272AlaHis: 1.272 ± 0.264
4.452AlaIle: 4.452 ± 0.329
5.018AlaLys: 5.018 ± 0.513
4.806AlaLeu: 4.806 ± 0.484
1.555AlaMet: 1.555 ± 0.287
4.311AlaAsn: 4.311 ± 0.528
2.544AlaPro: 2.544 ± 0.709
1.908AlaGln: 1.908 ± 0.605
1.979AlaArg: 1.979 ± 0.198
5.3AlaSer: 5.3 ± 0.591
3.534AlaThr: 3.534 ± 0.341
6.078AlaVal: 6.078 ± 1.176
1.131AlaTrp: 1.131 ± 0.338
2.544AlaTyr: 2.544 ± 0.274
0.0AlaXaa: 0.0 ± 0.0
Cys
1.908CysAla: 1.908 ± 0.248
1.837CysCys: 1.837 ± 0.2
2.544CysAsp: 2.544 ± 0.33
1.131CysGlu: 1.131 ± 0.217
2.191CysPhe: 2.191 ± 0.202
2.898CysGly: 2.898 ± 0.434
0.495CysHis: 0.495 ± 0.069
1.767CysIle: 1.767 ± 0.389
2.544CysLys: 2.544 ± 0.237
2.827CysLeu: 2.827 ± 0.316
0.353CysMet: 0.353 ± 0.523
2.403CysAsn: 2.403 ± 0.214
0.989CysPro: 0.989 ± 0.157
0.989CysGln: 0.989 ± 0.309
1.625CysArg: 1.625 ± 0.281
3.534CysSer: 3.534 ± 0.548
2.049CysThr: 2.049 ± 0.527
3.322CysVal: 3.322 ± 0.353
0.636CysTrp: 0.636 ± 0.203
2.261CysTyr: 2.261 ± 0.474
0.0CysXaa: 0.0 ± 0.0
Asp
4.24AspAla: 4.24 ± 0.46
2.191AspCys: 2.191 ± 0.465
3.463AspAsp: 3.463 ± 0.569
2.756AspGlu: 2.756 ± 0.249
3.322AspPhe: 3.322 ± 0.473
4.664AspGly: 4.664 ± 0.313
0.565AspHis: 0.565 ± 0.175
2.049AspIle: 2.049 ± 0.963
3.392AspLys: 3.392 ± 0.222
5.442AspLeu: 5.442 ± 0.465
1.979AspMet: 1.979 ± 0.155
2.049AspAsn: 2.049 ± 0.285
1.625AspPro: 1.625 ± 0.329
1.555AspGln: 1.555 ± 0.423
1.201AspArg: 1.201 ± 0.236
4.594AspSer: 4.594 ± 0.388
2.12AspThr: 2.12 ± 0.158
7.845AspVal: 7.845 ± 0.9
0.353AspTrp: 0.353 ± 0.174
2.473AspTyr: 2.473 ± 0.445
0.0AspXaa: 0.0 ± 0.0
Glu
3.604GluAla: 3.604 ± 0.187
1.201GluCys: 1.201 ± 0.23
3.322GluAsp: 3.322 ± 0.25
2.686GluGlu: 2.686 ± 0.759
2.756GluPhe: 2.756 ± 0.366
1.908GluGly: 1.908 ± 0.319
0.424GluHis: 0.424 ± 0.135
1.979GluIle: 1.979 ± 0.486
2.191GluLys: 2.191 ± 0.22
4.311GluLeu: 4.311 ± 0.976
0.777GluMet: 0.777 ± 0.285
1.272GluAsn: 1.272 ± 0.216
1.555GluPro: 1.555 ± 0.375
0.919GluGln: 0.919 ± 0.193
1.484GluArg: 1.484 ± 0.221
1.908GluSer: 1.908 ± 0.275
2.12GluThr: 2.12 ± 0.359
4.17GluVal: 4.17 ± 0.289
0.565GluTrp: 0.565 ± 0.37
1.413GluTyr: 1.413 ± 0.274
0.0GluXaa: 0.0 ± 0.0
Phe
3.039PheAla: 3.039 ± 0.273
2.332PheCys: 2.332 ± 0.448
3.392PheAsp: 3.392 ± 0.412
2.049PheGlu: 2.049 ± 0.282
2.12PhePhe: 2.12 ± 0.273
3.534PheGly: 3.534 ± 0.55
0.707PheHis: 0.707 ± 0.139
2.261PheIle: 2.261 ± 0.287
3.816PheLys: 3.816 ± 0.464
3.958PheLeu: 3.958 ± 0.312
1.06PheMet: 1.06 ± 0.322
4.311PheAsn: 4.311 ± 0.479
1.484PhePro: 1.484 ± 0.122
1.413PheGln: 1.413 ± 0.202
1.696PheArg: 1.696 ± 0.409
3.887PheSer: 3.887 ± 0.285
3.11PheThr: 3.11 ± 0.361
6.29PheVal: 6.29 ± 0.668
0.565PheTrp: 0.565 ± 0.175
3.534PheTyr: 3.534 ± 0.61
0.0PheXaa: 0.0 ± 0.0
Gly
3.18GlyAla: 3.18 ± 0.24
3.251GlyCys: 3.251 ± 0.415
3.392GlyAsp: 3.392 ± 0.306
1.555GlyGlu: 1.555 ± 0.265
3.887GlyPhe: 3.887 ± 0.513
3.604GlyGly: 3.604 ± 0.379
1.343GlyHis: 1.343 ± 0.177
2.615GlyIle: 2.615 ± 1.187
4.17GlyLys: 4.17 ± 0.538
5.088GlyLeu: 5.088 ± 0.351
1.625GlyMet: 1.625 ± 0.318
3.392GlyAsn: 3.392 ± 0.402
1.201GlyPro: 1.201 ± 0.301
1.625GlyGln: 1.625 ± 0.548
1.625GlyArg: 1.625 ± 0.531
5.018GlySer: 5.018 ± 0.896
4.028GlyThr: 4.028 ± 0.347
6.855GlyVal: 6.855 ± 0.52
0.777GlyTrp: 0.777 ± 0.114
3.18GlyTyr: 3.18 ± 0.277
0.0GlyXaa: 0.0 ± 0.0
His
1.413HisAla: 1.413 ± 0.274
0.353HisCys: 0.353 ± 0.198
1.06HisAsp: 1.06 ± 0.161
0.919HisGlu: 0.919 ± 0.186
1.484HisPhe: 1.484 ± 0.234
0.424HisGly: 0.424 ± 0.139
0.0HisHis: 0.0 ± 0.0
0.777HisIle: 0.777 ± 0.207
1.131HisLys: 1.131 ± 0.267
1.413HisLeu: 1.413 ± 0.384
0.495HisMet: 0.495 ± 0.194
1.06HisAsn: 1.06 ± 0.22
0.565HisPro: 0.565 ± 0.129
0.707HisGln: 0.707 ± 0.283
0.424HisArg: 0.424 ± 0.063
0.777HisSer: 0.777 ± 0.134
0.565HisThr: 0.565 ± 0.23
2.403HisVal: 2.403 ± 0.637
0.141HisTrp: 0.141 ± 0.046
0.495HisTyr: 0.495 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
2.403IleAla: 2.403 ± 0.113
1.201IleCys: 1.201 ± 0.188
1.979IleAsp: 1.979 ± 0.664
1.625IleGlu: 1.625 ± 0.207
1.908IlePhe: 1.908 ± 0.202
3.675IleGly: 3.675 ± 0.7
0.565IleHis: 0.565 ± 0.166
2.473IleIle: 2.473 ± 0.667
3.039IleLys: 3.039 ± 0.452
4.099IleLeu: 4.099 ± 0.382
0.989IleMet: 0.989 ± 0.294
2.898IleAsn: 2.898 ± 0.662
1.484IlePro: 1.484 ± 0.346
1.837IleGln: 1.837 ± 0.381
1.979IleArg: 1.979 ± 0.436
2.12IleSer: 2.12 ± 0.811
2.756IleThr: 2.756 ± 0.222
4.099IleVal: 4.099 ± 0.529
0.636IleTrp: 0.636 ± 0.273
0.848IleTyr: 0.848 ± 0.241
0.0IleXaa: 0.0 ± 0.0
Lys
3.392LysAla: 3.392 ± 0.346
2.544LysCys: 2.544 ± 0.33
1.908LysAsp: 1.908 ± 0.382
2.756LysGlu: 2.756 ± 0.249
3.251LysPhe: 3.251 ± 0.619
4.523LysGly: 4.523 ± 0.673
1.201LysHis: 1.201 ± 0.261
2.615LysIle: 2.615 ± 0.403
2.12LysLys: 2.12 ± 0.23
6.572LysLeu: 6.572 ± 0.657
1.06LysMet: 1.06 ± 0.152
1.908LysAsn: 1.908 ± 0.227
2.827LysPro: 2.827 ± 0.439
2.898LysGln: 2.898 ± 0.521
2.332LysArg: 2.332 ± 0.166
3.887LysSer: 3.887 ± 0.355
2.473LysThr: 2.473 ± 0.239
6.36LysVal: 6.36 ± 0.828
1.272LysTrp: 1.272 ± 0.176
3.039LysTyr: 3.039 ± 0.677
0.0LysXaa: 0.0 ± 0.0
Leu
6.714LeuAla: 6.714 ± 0.626
4.382LeuCys: 4.382 ± 0.643
4.876LeuAsp: 4.876 ± 0.434
4.099LeuGlu: 4.099 ± 0.47
5.159LeuPhe: 5.159 ± 0.503
4.947LeuGly: 4.947 ± 0.536
0.989LeuHis: 0.989 ± 0.268
3.18LeuIle: 3.18 ± 0.295
4.17LeuLys: 4.17 ± 0.268
7.35LeuLeu: 7.35 ± 0.584
1.413LeuMet: 1.413 ± 0.307
4.947LeuAsn: 4.947 ± 0.614
3.958LeuPro: 3.958 ± 0.589
4.452LeuGln: 4.452 ± 0.498
3.534LeuArg: 3.534 ± 0.491
7.845LeuSer: 7.845 ± 0.563
5.654LeuThr: 5.654 ± 0.416
8.41LeuVal: 8.41 ± 0.891
1.343LeuTrp: 1.343 ± 0.365
5.23LeuTyr: 5.23 ± 0.594
0.0LeuXaa: 0.0 ± 0.0
Met
2.332MetAla: 2.332 ± 0.631
0.919MetCys: 0.919 ± 0.367
1.625MetAsp: 1.625 ± 0.235
0.353MetGlu: 0.353 ± 0.144
1.343MetPhe: 1.343 ± 0.187
0.848MetGly: 0.848 ± 0.335
0.919MetHis: 0.919 ± 0.287
0.565MetIle: 0.565 ± 0.108
0.424MetLys: 0.424 ± 0.249
3.18MetLeu: 3.18 ± 0.337
0.565MetMet: 0.565 ± 0.173
0.989MetAsn: 0.989 ± 0.272
1.625MetPro: 1.625 ± 0.358
1.343MetGln: 1.343 ± 0.114
0.848MetArg: 0.848 ± 0.258
1.555MetSer: 1.555 ± 0.247
1.343MetThr: 1.343 ± 0.275
1.131MetVal: 1.131 ± 0.302
0.495MetTrp: 0.495 ± 0.177
1.272MetTyr: 1.272 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
3.675AsnAla: 3.675 ± 0.8
1.837AsnCys: 1.837 ± 0.375
1.696AsnAsp: 1.696 ± 0.189
2.403AsnGlu: 2.403 ± 0.35
2.898AsnPhe: 2.898 ± 0.625
3.746AsnGly: 3.746 ± 0.469
0.989AsnHis: 0.989 ± 0.411
1.555AsnIle: 1.555 ± 0.289
2.968AsnLys: 2.968 ± 0.512
3.534AsnLeu: 3.534 ± 0.758
1.201AsnMet: 1.201 ± 0.235
2.403AsnAsn: 2.403 ± 0.941
1.979AsnPro: 1.979 ± 0.303
1.767AsnGln: 1.767 ± 0.629
2.544AsnArg: 2.544 ± 0.513
3.18AsnSer: 3.18 ± 0.264
2.615AsnThr: 2.615 ± 0.556
5.442AsnVal: 5.442 ± 0.674
0.707AsnTrp: 0.707 ± 0.141
1.837AsnTyr: 1.837 ± 0.555
0.0AsnXaa: 0.0 ± 0.0
Pro
3.039ProAla: 3.039 ± 0.484
0.919ProCys: 0.919 ± 0.125
1.767ProAsp: 1.767 ± 0.317
1.908ProGlu: 1.908 ± 0.304
1.272ProPhe: 1.272 ± 0.229
2.403ProGly: 2.403 ± 0.387
0.989ProHis: 0.989 ± 0.626
1.272ProIle: 1.272 ± 0.185
2.403ProLys: 2.403 ± 0.376
3.251ProLeu: 3.251 ± 0.325
0.283ProMet: 0.283 ± 0.104
1.555ProAsn: 1.555 ± 0.943
1.201ProPro: 1.201 ± 0.429
1.131ProGln: 1.131 ± 0.199
1.555ProArg: 1.555 ± 0.371
2.191ProSer: 2.191 ± 0.642
3.18ProThr: 3.18 ± 0.228
3.11ProVal: 3.11 ± 0.368
0.565ProTrp: 0.565 ± 0.182
1.484ProTyr: 1.484 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
1.767GlnAla: 1.767 ± 0.552
1.06GlnCys: 1.06 ± 0.207
1.413GlnAsp: 1.413 ± 0.321
1.979GlnGlu: 1.979 ± 0.365
1.908GlnPhe: 1.908 ± 0.673
2.191GlnGly: 2.191 ± 0.324
1.06GlnHis: 1.06 ± 0.233
1.837GlnIle: 1.837 ± 0.238
1.625GlnLys: 1.625 ± 0.486
4.382GlnLeu: 4.382 ± 0.68
0.283GlnMet: 0.283 ± 0.104
1.272GlnAsn: 1.272 ± 0.302
0.989GlnPro: 0.989 ± 0.386
1.201GlnGln: 1.201 ± 0.328
0.848GlnArg: 0.848 ± 0.273
2.827GlnSer: 2.827 ± 0.295
1.696GlnThr: 1.696 ± 0.245
3.039GlnVal: 3.039 ± 0.595
1.272GlnTrp: 1.272 ± 0.291
1.272GlnTyr: 1.272 ± 0.552
0.0GlnXaa: 0.0 ± 0.0
Arg
3.251ArgAla: 3.251 ± 0.525
1.06ArgCys: 1.06 ± 0.265
2.403ArgAsp: 2.403 ± 0.349
1.413ArgGlu: 1.413 ± 0.117
2.12ArgPhe: 2.12 ± 0.202
1.979ArgGly: 1.979 ± 0.517
0.777ArgHis: 0.777 ± 0.155
1.06ArgIle: 1.06 ± 0.422
2.191ArgLys: 2.191 ± 0.444
3.887ArgLeu: 3.887 ± 0.366
0.777ArgMet: 0.777 ± 0.148
1.343ArgAsn: 1.343 ± 0.489
1.131ArgPro: 1.131 ± 0.268
0.848ArgGln: 0.848 ± 0.649
1.625ArgArg: 1.625 ± 0.606
3.675ArgSer: 3.675 ± 1.001
1.696ArgThr: 1.696 ± 0.38
3.11ArgVal: 3.11 ± 0.408
0.071ArgTrp: 0.071 ± 0.127
1.696ArgTyr: 1.696 ± 0.299
0.0ArgXaa: 0.0 ± 0.0
Ser
6.219SerAla: 6.219 ± 0.758
2.332SerCys: 2.332 ± 0.41
4.311SerAsp: 4.311 ± 0.361
2.473SerGlu: 2.473 ± 0.176
3.816SerPhe: 3.816 ± 0.296
4.735SerGly: 4.735 ± 1.049
1.272SerHis: 1.272 ± 0.371
4.17SerIle: 4.17 ± 0.64
3.534SerLys: 3.534 ± 0.322
7.703SerLeu: 7.703 ± 0.42
2.403SerMet: 2.403 ± 0.26
2.615SerAsn: 2.615 ± 0.818
2.191SerPro: 2.191 ± 0.747
1.908SerGln: 1.908 ± 0.339
2.756SerArg: 2.756 ± 0.691
5.088SerSer: 5.088 ± 0.831
2.968SerThr: 2.968 ± 0.278
7.067SerVal: 7.067 ± 0.924
0.777SerTrp: 0.777 ± 0.25
3.18SerTyr: 3.18 ± 0.468
0.0SerXaa: 0.0 ± 0.0
Thr
3.392ThrAla: 3.392 ± 0.358
1.484ThrCys: 1.484 ± 0.261
4.099ThrAsp: 4.099 ± 0.435
2.191ThrGlu: 2.191 ± 0.234
3.463ThrPhe: 3.463 ± 0.44
4.664ThrGly: 4.664 ± 1.275
1.131ThrHis: 1.131 ± 0.346
2.12ThrIle: 2.12 ± 0.72
3.11ThrLys: 3.11 ± 0.492
5.442ThrLeu: 5.442 ± 0.418
2.473ThrMet: 2.473 ± 0.261
1.979ThrAsn: 1.979 ± 0.314
1.908ThrPro: 1.908 ± 0.423
1.908ThrGln: 1.908 ± 0.203
1.837ThrArg: 1.837 ± 0.434
3.322ThrSer: 3.322 ± 0.814
3.746ThrThr: 3.746 ± 0.686
4.311ThrVal: 4.311 ± 0.441
0.565ThrTrp: 0.565 ± 0.112
2.827ThrTyr: 2.827 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
6.36ValAla: 6.36 ± 0.732
4.028ValCys: 4.028 ± 0.524
7.279ValAsp: 7.279 ± 0.901
4.028ValGlu: 4.028 ± 0.506
3.604ValPhe: 3.604 ± 0.348
3.887ValGly: 3.887 ± 0.369
0.636ValHis: 0.636 ± 0.203
4.17ValIle: 4.17 ± 1.03
8.198ValLys: 8.198 ± 0.943
9.682ValLeu: 9.682 ± 1.112
2.827ValMet: 2.827 ± 0.348
4.947ValAsn: 4.947 ± 0.668
4.452ValPro: 4.452 ± 0.555
3.392ValGln: 3.392 ± 0.495
3.322ValArg: 3.322 ± 0.368
6.643ValSer: 6.643 ± 0.384
5.583ValThr: 5.583 ± 0.656
12.367ValVal: 12.367 ± 2.405
0.919ValTrp: 0.919 ± 0.18
4.735ValTyr: 4.735 ± 0.667
0.0ValXaa: 0.0 ± 0.0
Trp
0.848TrpAla: 0.848 ± 0.188
0.495TrpCys: 0.495 ± 0.202
0.353TrpAsp: 0.353 ± 0.218
0.283TrpGlu: 0.283 ± 0.083
1.131TrpPhe: 1.131 ± 0.201
0.353TrpGly: 0.353 ± 0.107
0.424TrpHis: 0.424 ± 0.13
0.424TrpIle: 0.424 ± 0.182
0.141TrpLys: 0.141 ± 0.119
2.473TrpLeu: 2.473 ± 0.397
0.283TrpMet: 0.283 ± 0.093
0.919TrpAsn: 0.919 ± 0.181
0.565TrpPro: 0.565 ± 0.123
0.495TrpGln: 0.495 ± 0.089
0.777TrpArg: 0.777 ± 0.236
1.131TrpSer: 1.131 ± 0.187
0.495TrpThr: 0.495 ± 0.135
0.919TrpVal: 0.919 ± 0.08
0.071TrpTrp: 0.071 ± 0.092
0.777TrpTyr: 0.777 ± 0.371
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.18TyrAla: 3.18 ± 0.528
2.049TyrCys: 2.049 ± 0.433
2.756TyrAsp: 2.756 ± 0.469
1.696TyrGlu: 1.696 ± 0.129
2.544TyrPhe: 2.544 ± 0.475
2.686TyrGly: 2.686 ± 0.271
0.848TyrHis: 0.848 ± 0.282
1.272TyrIle: 1.272 ± 0.201
2.756TyrLys: 2.756 ± 0.693
3.251TyrLeu: 3.251 ± 0.313
1.201TyrMet: 1.201 ± 0.163
2.403TyrAsn: 2.403 ± 0.399
1.201TyrPro: 1.201 ± 0.228
1.625TyrGln: 1.625 ± 0.177
2.12TyrArg: 2.12 ± 0.256
3.039TyrSer: 3.039 ± 0.268
4.24TyrThr: 4.24 ± 0.549
4.735TyrVal: 4.735 ± 0.294
0.495TyrTrp: 0.495 ± 0.125
3.11TyrTyr: 3.11 ± 0.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (14151 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski