Amino acid dipepetide frequency for Murine coronavirus (strain A59) (MHV-A59) (Murine hepatitis virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.313AlaAla: 5.313 ± 0.233
2.892AlaCys: 2.892 ± 0.398
4.372AlaAsp: 4.372 ± 0.387
2.421AlaGlu: 2.421 ± 0.204
4.372AlaPhe: 4.372 ± 0.279
4.305AlaGly: 4.305 ± 0.323
1.412AlaHis: 1.412 ± 0.18
4.17AlaIle: 4.17 ± 0.254
4.439AlaLys: 4.439 ± 0.393
5.112AlaLeu: 5.112 ± 0.453
1.48AlaMet: 1.48 ± 0.173
4.708AlaAsn: 4.708 ± 0.243
2.22AlaPro: 2.22 ± 0.608
2.085AlaGln: 2.085 ± 0.481
1.816AlaArg: 1.816 ± 0.229
5.515AlaSer: 5.515 ± 0.382
3.43AlaThr: 3.43 ± 0.304
6.995AlaVal: 6.995 ± 0.64
1.076AlaTrp: 1.076 ± 0.259
2.421AlaTyr: 2.421 ± 0.26
0.0AlaXaa: 0.0 ± 0.0
Cys
2.354CysAla: 2.354 ± 0.275
1.749CysCys: 1.749 ± 0.17
1.681CysAsp: 1.681 ± 0.138
1.278CysGlu: 1.278 ± 0.22
2.287CysPhe: 2.287 ± 0.123
2.69CysGly: 2.69 ± 0.262
0.404CysHis: 0.404 ± 0.108
2.489CysIle: 2.489 ± 0.327
2.489CysLys: 2.489 ± 0.268
3.094CysLeu: 3.094 ± 0.305
0.471CysMet: 0.471 ± 0.234
2.421CysAsn: 2.421 ± 0.211
1.345CysPro: 1.345 ± 0.181
1.143CysGln: 1.143 ± 0.303
1.48CysArg: 1.48 ± 0.156
3.565CysSer: 3.565 ± 0.561
2.018CysThr: 2.018 ± 0.213
2.556CysVal: 2.556 ± 0.372
0.605CysTrp: 0.605 ± 0.162
2.152CysTyr: 2.152 ± 0.385
0.0CysXaa: 0.0 ± 0.0
Asp
4.439AspAla: 4.439 ± 0.289
1.681AspCys: 1.681 ± 0.126
3.228AspAsp: 3.228 ± 0.282
2.892AspGlu: 2.892 ± 0.225
3.161AspPhe: 3.161 ± 0.487
4.708AspGly: 4.708 ± 0.23
0.538AspHis: 0.538 ± 0.201
2.018AspIle: 2.018 ± 0.459
3.228AspLys: 3.228 ± 0.277
4.977AspLeu: 4.977 ± 0.418
1.749AspMet: 1.749 ± 0.244
2.287AspAsn: 2.287 ± 0.408
1.749AspPro: 1.749 ± 0.301
1.681AspGln: 1.681 ± 0.287
1.547AspArg: 1.547 ± 0.198
4.103AspSer: 4.103 ± 0.357
2.018AspThr: 2.018 ± 0.17
6.928AspVal: 6.928 ± 0.928
0.404AspTrp: 0.404 ± 0.193
2.623AspTyr: 2.623 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
4.641GluAla: 4.641 ± 0.334
1.345GluCys: 1.345 ± 0.257
2.892GluAsp: 2.892 ± 0.172
2.758GluGlu: 2.758 ± 0.369
2.489GluPhe: 2.489 ± 0.296
2.421GluGly: 2.421 ± 0.38
0.404GluHis: 0.404 ± 0.108
1.749GluIle: 1.749 ± 0.273
2.287GluLys: 2.287 ± 0.235
4.036GluLeu: 4.036 ± 0.49
0.874GluMet: 0.874 ± 0.291
1.278GluAsn: 1.278 ± 0.237
1.95GluPro: 1.95 ± 0.277
0.874GluGln: 0.874 ± 0.146
1.547GluArg: 1.547 ± 0.235
2.018GluSer: 2.018 ± 0.223
2.287GluThr: 2.287 ± 0.269
4.17GluVal: 4.17 ± 0.34
0.471GluTrp: 0.471 ± 0.159
1.614GluTyr: 1.614 ± 0.292
0.0GluXaa: 0.0 ± 0.0
Phe
2.892PheAla: 2.892 ± 0.208
2.018PheCys: 2.018 ± 0.227
3.43PheAsp: 3.43 ± 0.385
1.883PheGlu: 1.883 ± 0.17
1.681PhePhe: 1.681 ± 0.178
3.228PheGly: 3.228 ± 0.414
0.874PheHis: 0.874 ± 0.247
2.758PheIle: 2.758 ± 0.536
3.901PheLys: 3.901 ± 0.482
3.363PheLeu: 3.363 ± 0.334
1.211PheMet: 1.211 ± 0.165
4.036PheAsn: 4.036 ± 0.263
1.48PhePro: 1.48 ± 0.1
1.278PheGln: 1.278 ± 0.223
1.883PheArg: 1.883 ± 0.43
3.699PheSer: 3.699 ± 0.354
3.027PheThr: 3.027 ± 0.47
6.255PheVal: 6.255 ± 0.833
0.605PheTrp: 0.605 ± 0.195
3.766PheTyr: 3.766 ± 0.423
0.0PheXaa: 0.0 ± 0.0
Gly
3.161GlyAla: 3.161 ± 0.459
3.296GlyCys: 3.296 ± 0.286
3.565GlyAsp: 3.565 ± 0.403
1.211GlyGlu: 1.211 ± 0.128
3.834GlyPhe: 3.834 ± 0.723
3.43GlyGly: 3.43 ± 0.359
1.614GlyHis: 1.614 ± 0.342
2.825GlyIle: 2.825 ± 0.494
3.834GlyLys: 3.834 ± 0.348
5.044GlyLeu: 5.044 ± 0.41
1.278GlyMet: 1.278 ± 0.203
3.497GlyAsn: 3.497 ± 0.541
1.614GlyPro: 1.614 ± 0.442
1.48GlyGln: 1.48 ± 0.37
1.883GlyArg: 1.883 ± 0.415
5.313GlySer: 5.313 ± 0.308
3.766GlyThr: 3.766 ± 0.248
7.197GlyVal: 7.197 ± 0.398
0.74GlyTrp: 0.74 ± 0.101
3.228GlyTyr: 3.228 ± 0.288
0.0GlyXaa: 0.0 ± 0.0
His
1.278HisAla: 1.278 ± 0.348
0.471HisCys: 0.471 ± 0.234
1.143HisAsp: 1.143 ± 0.12
1.009HisGlu: 1.009 ± 0.155
1.48HisPhe: 1.48 ± 0.167
0.538HisGly: 0.538 ± 0.354
0.135HisHis: 0.135 ± 0.167
0.673HisIle: 0.673 ± 0.086
1.076HisLys: 1.076 ± 0.174
1.749HisLeu: 1.749 ± 0.319
0.471HisMet: 0.471 ± 0.144
1.009HisAsn: 1.009 ± 0.207
0.538HisPro: 0.538 ± 0.111
0.538HisGln: 0.538 ± 0.062
0.404HisArg: 0.404 ± 0.047
0.807HisSer: 0.807 ± 0.12
0.874HisThr: 0.874 ± 0.14
2.287HisVal: 2.287 ± 0.422
0.269HisTrp: 0.269 ± 0.102
0.538HisTyr: 0.538 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
2.623IleAla: 2.623 ± 0.251
1.816IleCys: 1.816 ± 0.356
2.22IleAsp: 2.22 ± 0.339
1.681IleGlu: 1.681 ± 0.206
1.749IlePhe: 1.749 ± 0.249
3.43IleGly: 3.43 ± 0.427
0.471IleHis: 0.471 ± 0.139
2.085IleIle: 2.085 ± 0.486
3.632IleLys: 3.632 ± 0.425
4.574IleLeu: 4.574 ± 0.405
0.942IleMet: 0.942 ± 0.257
2.22IleAsn: 2.22 ± 0.393
1.345IlePro: 1.345 ± 0.191
1.48IleGln: 1.48 ± 0.293
1.681IleArg: 1.681 ± 0.375
2.085IleSer: 2.085 ± 0.763
2.892IleThr: 2.892 ± 0.218
4.506IleVal: 4.506 ± 0.433
0.404IleTrp: 0.404 ± 0.129
1.009IleTyr: 1.009 ± 0.188
0.067IleXaa: 0.067 ± 0.119
Lys
4.506LysAla: 4.506 ± 0.426
2.22LysCys: 2.22 ± 0.254
2.018LysAsp: 2.018 ± 0.485
2.623LysGlu: 2.623 ± 0.107
3.296LysPhe: 3.296 ± 0.487
4.305LysGly: 4.305 ± 0.371
1.211LysHis: 1.211 ± 0.31
2.758LysIle: 2.758 ± 0.278
1.95LysLys: 1.95 ± 0.252
6.322LysLeu: 6.322 ± 0.49
0.874LysMet: 0.874 ± 0.099
1.816LysAsn: 1.816 ± 0.188
3.43LysPro: 3.43 ± 0.349
2.892LysGln: 2.892 ± 0.443
2.421LysArg: 2.421 ± 0.276
3.027LysSer: 3.027 ± 0.26
2.22LysThr: 2.22 ± 0.185
5.784LysVal: 5.784 ± 0.666
1.211LysTrp: 1.211 ± 0.148
2.758LysTyr: 2.758 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
6.928LeuAla: 6.928 ± 0.613
3.968LeuCys: 3.968 ± 0.416
5.112LeuAsp: 5.112 ± 0.424
4.305LeuGlu: 4.305 ± 0.358
5.313LeuPhe: 5.313 ± 0.293
5.112LeuGly: 5.112 ± 0.432
1.278LeuHis: 1.278 ± 0.128
3.094LeuIle: 3.094 ± 0.319
3.968LeuLys: 3.968 ± 0.376
8.206LeuLeu: 8.206 ± 0.966
1.547LeuMet: 1.547 ± 0.265
4.708LeuAsn: 4.708 ± 0.858
4.439LeuPro: 4.439 ± 0.637
4.506LeuGln: 4.506 ± 0.389
3.43LeuArg: 3.43 ± 0.376
7.197LeuSer: 7.197 ± 0.415
5.381LeuThr: 5.381 ± 0.207
7.6LeuVal: 7.6 ± 0.684
1.278LeuTrp: 1.278 ± 0.28
4.843LeuTyr: 4.843 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
2.152MetAla: 2.152 ± 0.383
0.874MetCys: 0.874 ± 0.15
1.211MetAsp: 1.211 ± 0.183
0.673MetGlu: 0.673 ± 0.222
1.143MetPhe: 1.143 ± 0.159
0.942MetGly: 0.942 ± 0.236
0.874MetHis: 0.874 ± 0.198
0.404MetIle: 0.404 ± 0.106
0.404MetLys: 0.404 ± 0.167
3.363MetLeu: 3.363 ± 0.313
0.605MetMet: 0.605 ± 0.162
0.807MetAsn: 0.807 ± 0.145
1.547MetPro: 1.547 ± 0.255
1.278MetGln: 1.278 ± 0.177
0.807MetArg: 0.807 ± 0.179
1.412MetSer: 1.412 ± 0.219
1.278MetThr: 1.278 ± 0.205
1.345MetVal: 1.345 ± 0.273
0.404MetTrp: 0.404 ± 0.112
1.211MetTyr: 1.211 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
3.632AsnAla: 3.632 ± 0.421
1.883AsnCys: 1.883 ± 0.294
1.681AsnAsp: 1.681 ± 0.25
1.883AsnGlu: 1.883 ± 0.136
2.623AsnPhe: 2.623 ± 0.334
3.901AsnGly: 3.901 ± 0.501
0.807AsnHis: 0.807 ± 0.137
1.681AsnIle: 1.681 ± 0.388
2.69AsnLys: 2.69 ± 0.332
3.363AsnLeu: 3.363 ± 0.374
1.278AsnMet: 1.278 ± 0.134
2.69AsnAsn: 2.69 ± 0.712
2.085AsnPro: 2.085 ± 0.223
1.95AsnGln: 1.95 ± 0.498
2.22AsnArg: 2.22 ± 0.348
3.632AsnSer: 3.632 ± 0.472
2.623AsnThr: 2.623 ± 0.293
5.717AsnVal: 5.717 ± 0.376
0.605AsnTrp: 0.605 ± 0.081
1.95AsnTyr: 1.95 ± 0.443
0.0AsnXaa: 0.0 ± 0.0
Pro
2.892ProAla: 2.892 ± 0.293
1.143ProCys: 1.143 ± 0.137
2.152ProAsp: 2.152 ± 0.252
2.018ProGlu: 2.018 ± 0.224
1.412ProPhe: 1.412 ± 0.203
2.421ProGly: 2.421 ± 0.37
1.009ProHis: 1.009 ± 0.358
1.547ProIle: 1.547 ± 0.294
2.489ProLys: 2.489 ± 0.531
3.296ProLeu: 3.296 ± 0.342
0.471ProMet: 0.471 ± 0.152
1.547ProAsn: 1.547 ± 0.558
1.143ProPro: 1.143 ± 0.282
1.345ProGln: 1.345 ± 0.264
1.749ProArg: 1.749 ± 0.199
2.758ProSer: 2.758 ± 0.599
3.497ProThr: 3.497 ± 0.346
3.43ProVal: 3.43 ± 0.335
0.538ProTrp: 0.538 ± 0.114
1.345ProTyr: 1.345 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
1.547GlnAla: 1.547 ± 0.197
1.211GlnCys: 1.211 ± 0.164
1.614GlnAsp: 1.614 ± 0.17
2.085GlnGlu: 2.085 ± 0.246
2.085GlnPhe: 2.085 ± 0.449
2.018GlnGly: 2.018 ± 0.231
1.076GlnHis: 1.076 ± 0.124
2.018GlnIle: 2.018 ± 0.268
1.883GlnLys: 1.883 ± 0.549
4.237GlnLeu: 4.237 ± 0.591
0.269GlnMet: 0.269 ± 0.095
1.211GlnAsn: 1.211 ± 0.228
1.076GlnPro: 1.076 ± 0.484
1.076GlnGln: 1.076 ± 0.269
0.874GlnArg: 0.874 ± 0.28
2.825GlnSer: 2.825 ± 0.364
1.95GlnThr: 1.95 ± 0.329
3.027GlnVal: 3.027 ± 0.359
1.143GlnTrp: 1.143 ± 0.256
1.211GlnTyr: 1.211 ± 0.299
0.0GlnXaa: 0.0 ± 0.0
Arg
2.825ArgAla: 2.825 ± 0.582
1.143ArgCys: 1.143 ± 0.158
2.354ArgAsp: 2.354 ± 0.247
1.614ArgGlu: 1.614 ± 0.164
1.681ArgPhe: 1.681 ± 0.224
2.354ArgGly: 2.354 ± 0.495
0.942ArgHis: 0.942 ± 0.261
0.807ArgIle: 0.807 ± 0.257
2.287ArgLys: 2.287 ± 0.369
3.968ArgLeu: 3.968 ± 0.368
1.009ArgMet: 1.009 ± 0.163
1.412ArgAsn: 1.412 ± 0.304
1.278ArgPro: 1.278 ± 0.262
1.076ArgGln: 1.076 ± 0.597
1.345ArgArg: 1.345 ± 0.442
3.766ArgSer: 3.766 ± 0.975
1.883ArgThr: 1.883 ± 0.203
3.094ArgVal: 3.094 ± 0.455
0.202ArgTrp: 0.202 ± 0.302
1.614ArgTyr: 1.614 ± 0.139
0.0ArgXaa: 0.0 ± 0.0
Ser
6.053SerAla: 6.053 ± 0.407
2.489SerCys: 2.489 ± 0.191
3.834SerAsp: 3.834 ± 0.264
3.228SerGlu: 3.228 ± 0.217
3.43SerPhe: 3.43 ± 0.193
4.237SerGly: 4.237 ± 0.877
1.345SerHis: 1.345 ± 0.233
3.766SerIle: 3.766 ± 0.34
3.766SerLys: 3.766 ± 0.305
7.264SerLeu: 7.264 ± 0.516
2.22SerMet: 2.22 ± 0.418
2.152SerAsn: 2.152 ± 0.18
2.152SerPro: 2.152 ± 0.288
2.085SerGln: 2.085 ± 0.222
2.69SerArg: 2.69 ± 0.484
5.179SerSer: 5.179 ± 0.765
3.363SerThr: 3.363 ± 0.324
7.735SerVal: 7.735 ± 0.686
1.009SerTrp: 1.009 ± 0.24
2.959SerTyr: 2.959 ± 0.424
0.0SerXaa: 0.0 ± 0.0
Thr
3.699ThrAla: 3.699 ± 0.678
1.614ThrCys: 1.614 ± 0.236
3.766ThrAsp: 3.766 ± 0.251
2.018ThrGlu: 2.018 ± 0.239
3.901ThrPhe: 3.901 ± 0.388
4.305ThrGly: 4.305 ± 0.314
1.143ThrHis: 1.143 ± 0.171
2.085ThrIle: 2.085 ± 0.563
3.094ThrLys: 3.094 ± 0.23
5.179ThrLeu: 5.179 ± 0.515
2.152ThrMet: 2.152 ± 0.426
2.421ThrAsn: 2.421 ± 0.212
2.623ThrPro: 2.623 ± 0.481
1.95ThrGln: 1.95 ± 0.191
1.816ThrArg: 1.816 ± 0.408
3.43ThrSer: 3.43 ± 0.254
3.901ThrThr: 3.901 ± 0.339
4.17ThrVal: 4.17 ± 0.344
0.605ThrTrp: 0.605 ± 0.12
2.959ThrTyr: 2.959 ± 0.189
0.0ThrXaa: 0.0 ± 0.0
Val
6.053ValAla: 6.053 ± 0.815
3.699ValCys: 3.699 ± 0.46
6.928ValAsp: 6.928 ± 0.731
4.237ValGlu: 4.237 ± 0.364
3.632ValPhe: 3.632 ± 0.242
4.036ValGly: 4.036 ± 0.288
0.673ValHis: 0.673 ± 0.16
4.103ValIle: 4.103 ± 0.357
7.331ValLys: 7.331 ± 0.88
9.483ValLeu: 9.483 ± 0.661
2.489ValMet: 2.489 ± 0.423
5.582ValAsn: 5.582 ± 0.571
4.439ValPro: 4.439 ± 0.333
3.699ValGln: 3.699 ± 0.411
4.17ValArg: 4.17 ± 0.524
6.591ValSer: 6.591 ± 0.346
5.448ValThr: 5.448 ± 0.602
11.232ValVal: 11.232 ± 1.604
1.076ValTrp: 1.076 ± 0.242
4.574ValTyr: 4.574 ± 0.592
0.0ValXaa: 0.0 ± 0.0
Trp
0.605TrpAla: 0.605 ± 0.136
0.336TrpCys: 0.336 ± 0.144
0.336TrpAsp: 0.336 ± 0.228
0.269TrpGlu: 0.269 ± 0.052
1.345TrpPhe: 1.345 ± 0.165
0.336TrpGly: 0.336 ± 0.077
0.404TrpHis: 0.404 ± 0.104
0.471TrpIle: 0.471 ± 0.186
0.269TrpLys: 0.269 ± 0.052
2.354TrpLeu: 2.354 ± 0.289
0.202TrpMet: 0.202 ± 0.102
0.807TrpAsn: 0.807 ± 0.234
0.605TrpPro: 0.605 ± 0.153
0.471TrpGln: 0.471 ± 0.073
0.807TrpArg: 0.807 ± 0.181
1.278TrpSer: 1.278 ± 0.101
0.74TrpThr: 0.74 ± 0.166
0.74TrpVal: 0.74 ± 0.102
0.067TrpTrp: 0.067 ± 0.095
0.673TrpTyr: 0.673 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.959TyrAla: 2.959 ± 0.238
2.354TyrCys: 2.354 ± 0.32
2.421TyrAsp: 2.421 ± 0.401
2.018TyrGlu: 2.018 ± 0.344
2.354TyrPhe: 2.354 ± 0.219
3.094TyrGly: 3.094 ± 0.404
0.673TyrHis: 0.673 ± 0.223
1.412TyrIle: 1.412 ± 0.202
2.623TyrLys: 2.623 ± 0.405
3.296TyrLeu: 3.296 ± 0.275
1.076TyrMet: 1.076 ± 0.159
2.287TyrAsn: 2.287 ± 0.422
1.143TyrPro: 1.143 ± 0.263
1.547TyrGln: 1.547 ± 0.175
2.085TyrArg: 2.085 ± 0.194
2.758TyrSer: 2.758 ± 0.341
4.237TyrThr: 4.237 ± 0.515
4.708TyrVal: 4.708 ± 0.325
0.404TyrTrp: 0.404 ± 0.114
3.161TyrTyr: 3.161 ± 0.52
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.067XaaCys: 0.067 ± 0.119
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (14869 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski