Amino acid dipepetide frequency for Murine coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.229AlaAla: 5.229 ± 0.379
3.068AlaCys: 3.068 ± 0.746
4.323AlaAsp: 4.323 ± 0.49
1.952AlaGlu: 1.952 ± 0.325
4.044AlaPhe: 4.044 ± 0.583
3.765AlaGly: 3.765 ± 0.464
1.394AlaHis: 1.394 ± 0.341
4.532AlaIle: 4.532 ± 0.385
4.602AlaLys: 4.602 ± 0.419
4.95AlaLeu: 4.95 ± 0.353
1.604AlaMet: 1.604 ± 0.273
4.392AlaAsn: 4.392 ± 0.51
2.37AlaPro: 2.37 ± 1.187
2.022AlaGln: 2.022 ± 0.365
1.813AlaArg: 1.813 ± 0.282
5.647AlaSer: 5.647 ± 0.52
3.765AlaThr: 3.765 ± 0.448
6.623AlaVal: 6.623 ± 0.567
1.046AlaTrp: 1.046 ± 0.355
2.51AlaTyr: 2.51 ± 0.475
0.0AlaXaa: 0.0 ± 0.0
Cys
2.161CysAla: 2.161 ± 0.253
1.882CysCys: 1.882 ± 0.23
2.022CysAsp: 2.022 ± 0.196
0.976CysGlu: 0.976 ± 0.28
2.37CysPhe: 2.37 ± 0.148
2.859CysGly: 2.859 ± 0.386
0.418CysHis: 0.418 ± 0.139
1.952CysIle: 1.952 ± 0.436
2.58CysLys: 2.58 ± 0.343
2.998CysLeu: 2.998 ± 0.448
0.418CysMet: 0.418 ± 0.188
2.301CysAsn: 2.301 ± 0.308
1.046CysPro: 1.046 ± 0.223
1.046CysGln: 1.046 ± 0.448
1.604CysArg: 1.604 ± 0.276
3.207CysSer: 3.207 ± 0.679
1.813CysThr: 1.813 ± 0.344
3.137CysVal: 3.137 ± 0.337
0.627CysTrp: 0.627 ± 0.208
2.161CysTyr: 2.161 ± 0.553
0.0CysXaa: 0.0 ± 0.0
Asp
4.044AspAla: 4.044 ± 0.616
2.022AspCys: 2.022 ± 0.21
3.207AspAsp: 3.207 ± 0.509
2.649AspGlu: 2.649 ± 0.251
2.998AspPhe: 2.998 ± 0.589
4.532AspGly: 4.532 ± 0.557
0.418AspHis: 0.418 ± 0.192
1.882AspIle: 1.882 ± 0.291
3.137AspLys: 3.137 ± 0.307
4.811AspLeu: 4.811 ± 0.653
1.882AspMet: 1.882 ± 0.275
2.44AspAsn: 2.44 ± 0.411
1.952AspPro: 1.952 ± 0.469
1.534AspGln: 1.534 ± 0.348
1.673AspArg: 1.673 ± 0.238
4.392AspSer: 4.392 ± 0.442
2.301AspThr: 2.301 ± 0.299
6.693AspVal: 6.693 ± 1.192
0.418AspTrp: 0.418 ± 0.241
2.58AspTyr: 2.58 ± 0.242
0.0AspXaa: 0.0 ± 0.0
Glu
4.392GluAla: 4.392 ± 0.231
0.837GluCys: 0.837 ± 0.19
2.859GluAsp: 2.859 ± 0.219
2.58GluGlu: 2.58 ± 0.249
2.44GluPhe: 2.44 ± 0.326
2.161GluGly: 2.161 ± 0.288
0.558GluHis: 0.558 ± 0.178
1.673GluIle: 1.673 ± 0.221
2.161GluLys: 2.161 ± 0.245
3.695GluLeu: 3.695 ± 0.797
0.767GluMet: 0.767 ± 0.321
1.325GluAsn: 1.325 ± 0.244
1.604GluPro: 1.604 ± 0.285
0.976GluGln: 0.976 ± 0.311
1.534GluArg: 1.534 ± 0.219
1.952GluSer: 1.952 ± 0.236
2.022GluThr: 2.022 ± 0.438
4.602GluVal: 4.602 ± 0.472
0.488GluTrp: 0.488 ± 0.312
1.743GluTyr: 1.743 ± 0.233
0.0GluXaa: 0.0 ± 0.0
Phe
2.719PheAla: 2.719 ± 0.235
2.022PheCys: 2.022 ± 0.334
3.556PheAsp: 3.556 ± 0.588
1.743PheGlu: 1.743 ± 0.208
1.673PhePhe: 1.673 ± 0.222
3.277PheGly: 3.277 ± 0.468
0.697PheHis: 0.697 ± 0.261
2.51PheIle: 2.51 ± 0.922
4.044PheLys: 4.044 ± 0.363
3.556PheLeu: 3.556 ± 0.514
1.185PheMet: 1.185 ± 0.3
4.253PheAsn: 4.253 ± 0.544
1.255PhePro: 1.255 ± 0.105
1.534PheGln: 1.534 ± 0.233
1.743PheArg: 1.743 ± 0.366
3.695PheSer: 3.695 ± 0.341
2.719PheThr: 2.719 ± 0.518
6.205PheVal: 6.205 ± 0.925
0.558PheTrp: 0.558 ± 0.184
3.486PheTyr: 3.486 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
3.486GlyAla: 3.486 ± 0.511
3.277GlyCys: 3.277 ± 0.473
3.137GlyAsp: 3.137 ± 0.373
1.325GlyGlu: 1.325 ± 0.219
3.765GlyPhe: 3.765 ± 0.689
3.765GlyGly: 3.765 ± 0.258
1.673GlyHis: 1.673 ± 0.332
2.301GlyIle: 2.301 ± 0.608
4.183GlyLys: 4.183 ± 0.69
4.88GlyLeu: 4.88 ± 0.272
1.534GlyMet: 1.534 ± 0.349
3.625GlyAsn: 3.625 ± 0.765
1.604GlyPro: 1.604 ± 0.884
1.534GlyGln: 1.534 ± 0.538
1.394GlyArg: 1.394 ± 0.438
5.02GlySer: 5.02 ± 0.642
4.114GlyThr: 4.114 ± 0.413
7.39GlyVal: 7.39 ± 0.418
0.697GlyTrp: 0.697 ± 0.221
3.068GlyTyr: 3.068 ± 0.462
0.0GlyXaa: 0.0 ± 0.0
His
1.394HisAla: 1.394 ± 0.486
0.209HisCys: 0.209 ± 0.069
1.116HisAsp: 1.116 ± 0.285
0.837HisGlu: 0.837 ± 0.242
1.534HisPhe: 1.534 ± 0.237
0.697HisGly: 0.697 ± 0.257
0.139HisHis: 0.139 ± 0.222
0.627HisIle: 0.627 ± 0.081
1.116HisLys: 1.116 ± 0.21
1.604HisLeu: 1.604 ± 0.346
0.418HisMet: 0.418 ± 0.144
0.837HisAsn: 0.837 ± 0.144
0.558HisPro: 0.558 ± 0.158
0.627HisGln: 0.627 ± 0.143
0.349HisArg: 0.349 ± 0.111
0.697HisSer: 0.697 ± 0.179
0.906HisThr: 0.906 ± 0.12
2.161HisVal: 2.161 ± 0.65
0.209HisTrp: 0.209 ± 0.069
0.418HisTyr: 0.418 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
2.58IleAla: 2.58 ± 0.318
1.882IleCys: 1.882 ± 0.315
1.813IleAsp: 1.813 ± 0.547
2.022IleGlu: 2.022 ± 0.152
1.394IlePhe: 1.394 ± 0.184
2.998IleGly: 2.998 ± 0.71
0.627IleHis: 0.627 ± 0.162
2.37IleIle: 2.37 ± 1.046
3.347IleLys: 3.347 ± 0.469
4.253IleLeu: 4.253 ± 0.55
0.767IleMet: 0.767 ± 0.337
2.51IleAsn: 2.51 ± 0.593
1.394IlePro: 1.394 ± 0.29
1.534IleGln: 1.534 ± 0.56
1.604IleArg: 1.604 ± 0.615
2.37IleSer: 2.37 ± 1.18
2.789IleThr: 2.789 ± 0.228
4.253IleVal: 4.253 ± 0.453
0.558IleTrp: 0.558 ± 0.292
0.837IleTyr: 0.837 ± 0.186
0.0IleXaa: 0.0 ± 0.0
Lys
4.392LysAla: 4.392 ± 0.549
2.44LysCys: 2.44 ± 0.409
1.813LysAsp: 1.813 ± 0.5
2.789LysGlu: 2.789 ± 0.16
3.347LysPhe: 3.347 ± 0.602
3.625LysGly: 3.625 ± 0.473
1.185LysHis: 1.185 ± 0.379
2.37LysIle: 2.37 ± 0.162
2.092LysLys: 2.092 ± 0.39
6.275LysLeu: 6.275 ± 0.908
0.837LysMet: 0.837 ± 0.174
1.673LysAsn: 1.673 ± 0.16
3.416LysPro: 3.416 ± 0.487
2.928LysGln: 2.928 ± 0.667
2.58LysArg: 2.58 ± 0.333
3.556LysSer: 3.556 ± 0.36
2.51LysThr: 2.51 ± 0.309
6.205LysVal: 6.205 ± 0.62
1.255LysTrp: 1.255 ± 0.169
2.51LysTyr: 2.51 ± 0.644
0.0LysXaa: 0.0 ± 0.0
Leu
6.345LeuAla: 6.345 ± 0.766
4.044LeuCys: 4.044 ± 0.344
5.02LeuAsp: 5.02 ± 0.387
3.556LeuGlu: 3.556 ± 0.626
5.02LeuPhe: 5.02 ± 0.335
4.811LeuGly: 4.811 ± 0.46
1.116LeuHis: 1.116 ± 0.285
3.207LeuIle: 3.207 ± 0.314
3.695LeuLys: 3.695 ± 0.416
8.366LeuLeu: 8.366 ± 1.71
1.813LeuMet: 1.813 ± 0.36
4.88LeuAsn: 4.88 ± 1.306
4.532LeuPro: 4.532 ± 1.067
4.183LeuGln: 4.183 ± 0.401
3.486LeuArg: 3.486 ± 0.511
7.53LeuSer: 7.53 ± 0.584
5.787LeuThr: 5.787 ± 0.457
7.46LeuVal: 7.46 ± 1.456
1.325LeuTrp: 1.325 ± 0.305
5.09LeuTyr: 5.09 ± 0.431
0.0LeuXaa: 0.0 ± 0.0
Met
2.161MetAla: 2.161 ± 0.416
0.837MetCys: 0.837 ± 0.149
1.325MetAsp: 1.325 ± 0.332
0.906MetGlu: 0.906 ± 0.269
1.394MetPhe: 1.394 ± 0.272
0.697MetGly: 0.697 ± 0.246
0.906MetHis: 0.906 ± 0.296
0.488MetIle: 0.488 ± 0.311
0.349MetLys: 0.349 ± 0.205
3.068MetLeu: 3.068 ± 0.563
0.488MetMet: 0.488 ± 0.175
0.906MetAsn: 0.906 ± 0.173
1.673MetPro: 1.673 ± 0.311
1.325MetGln: 1.325 ± 0.235
0.837MetArg: 0.837 ± 0.266
1.325MetSer: 1.325 ± 0.257
1.673MetThr: 1.673 ± 0.382
1.534MetVal: 1.534 ± 0.41
0.488MetTrp: 0.488 ± 0.266
1.255MetTyr: 1.255 ± 0.17
0.0MetXaa: 0.0 ± 0.0
Asn
3.625AsnAla: 3.625 ± 0.827
1.673AsnCys: 1.673 ± 0.298
1.673AsnAsp: 1.673 ± 0.235
2.37AsnGlu: 2.37 ± 0.194
2.649AsnPhe: 2.649 ± 0.506
4.183AsnGly: 4.183 ± 0.778
0.837AsnHis: 0.837 ± 0.109
1.813AsnIle: 1.813 ± 0.667
2.58AsnLys: 2.58 ± 0.392
3.347AsnLeu: 3.347 ± 0.772
1.325AsnMet: 1.325 ± 0.232
2.859AsnAsn: 2.859 ± 1.473
1.882AsnPro: 1.882 ± 0.241
1.882AsnGln: 1.882 ± 0.596
2.37AsnArg: 2.37 ± 0.599
3.765AsnSer: 3.765 ± 0.483
2.58AsnThr: 2.58 ± 0.336
5.647AsnVal: 5.647 ± 0.276
0.627AsnTrp: 0.627 ± 0.081
2.022AsnTyr: 2.022 ± 0.962
0.0AsnXaa: 0.0 ± 0.0
Pro
2.928ProAla: 2.928 ± 0.436
0.976ProCys: 0.976 ± 0.175
2.022ProAsp: 2.022 ± 0.36
2.092ProGlu: 2.092 ± 0.225
1.464ProPhe: 1.464 ± 0.226
2.44ProGly: 2.44 ± 0.349
0.767ProHis: 0.767 ± 0.216
1.534ProIle: 1.534 ± 0.337
2.44ProLys: 2.44 ± 0.702
3.277ProLeu: 3.277 ± 0.43
0.558ProMet: 0.558 ± 0.1
1.743ProAsn: 1.743 ± 0.718
1.394ProPro: 1.394 ± 0.479
1.394ProGln: 1.394 ± 0.461
1.673ProArg: 1.673 ± 0.21
2.58ProSer: 2.58 ± 0.819
3.137ProThr: 3.137 ± 0.453
3.625ProVal: 3.625 ± 0.403
0.627ProTrp: 0.627 ± 0.301
1.743ProTyr: 1.743 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
1.394GlnAla: 1.394 ± 0.347
1.255GlnCys: 1.255 ± 0.258
1.604GlnAsp: 1.604 ± 0.146
1.952GlnGlu: 1.952 ± 0.223
1.743GlnPhe: 1.743 ± 0.5
2.092GlnGly: 2.092 ± 0.437
0.976GlnHis: 0.976 ± 0.311
2.301GlnIle: 2.301 ± 0.681
2.022GlnLys: 2.022 ± 0.784
4.114GlnLeu: 4.114 ± 0.557
0.279GlnMet: 0.279 ± 0.107
1.325GlnAsn: 1.325 ± 0.414
1.116GlnPro: 1.116 ± 0.495
1.255GlnGln: 1.255 ± 0.518
0.837GlnArg: 0.837 ± 0.344
2.928GlnSer: 2.928 ± 0.247
1.882GlnThr: 1.882 ± 0.361
2.859GlnVal: 2.859 ± 0.596
1.185GlnTrp: 1.185 ± 0.267
1.255GlnTyr: 1.255 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
2.928ArgAla: 2.928 ± 0.506
0.976ArgCys: 0.976 ± 0.198
2.44ArgAsp: 2.44 ± 0.346
1.604ArgGlu: 1.604 ± 0.286
2.022ArgPhe: 2.022 ± 0.286
2.51ArgGly: 2.51 ± 0.625
0.837ArgHis: 0.837 ± 0.224
0.767ArgIle: 0.767 ± 0.408
2.092ArgLys: 2.092 ± 0.346
3.416ArgLeu: 3.416 ± 0.347
0.837ArgMet: 0.837 ± 0.145
1.464ArgAsn: 1.464 ± 0.361
1.325ArgPro: 1.325 ± 0.356
0.906ArgGln: 0.906 ± 0.664
1.604ArgArg: 1.604 ± 0.519
3.486ArgSer: 3.486 ± 0.792
1.882ArgThr: 1.882 ± 0.26
3.765ArgVal: 3.765 ± 0.476
0.209ArgTrp: 0.209 ± 0.32
1.743ArgTyr: 1.743 ± 0.163
0.0ArgXaa: 0.0 ± 0.0
Ser
5.926SerAla: 5.926 ± 0.546
2.58SerCys: 2.58 ± 0.516
3.695SerAsp: 3.695 ± 0.438
3.137SerGlu: 3.137 ± 0.39
3.556SerPhe: 3.556 ± 0.276
4.741SerGly: 4.741 ± 1.193
0.976SerHis: 0.976 ± 0.201
3.207SerIle: 3.207 ± 0.335
3.625SerLys: 3.625 ± 0.275
7.53SerLeu: 7.53 ± 0.426
2.022SerMet: 2.022 ± 0.515
2.44SerAsn: 2.44 ± 0.221
2.161SerPro: 2.161 ± 0.187
1.743SerGln: 1.743 ± 0.157
2.859SerArg: 2.859 ± 0.672
5.09SerSer: 5.09 ± 0.693
3.904SerThr: 3.904 ± 0.434
8.088SerVal: 8.088 ± 0.823
0.976SerTrp: 0.976 ± 0.318
3.068SerTyr: 3.068 ± 0.405
0.07SerXaa: 0.07 ± 0.137
Thr
3.765ThrAla: 3.765 ± 0.959
1.604ThrCys: 1.604 ± 0.246
4.044ThrAsp: 4.044 ± 0.482
1.882ThrGlu: 1.882 ± 0.24
3.695ThrPhe: 3.695 ± 0.344
4.602ThrGly: 4.602 ± 0.689
1.185ThrHis: 1.185 ± 0.312
2.092ThrIle: 2.092 ± 1.147
2.928ThrLys: 2.928 ± 0.588
5.438ThrLeu: 5.438 ± 0.554
2.231ThrMet: 2.231 ± 0.567
2.44ThrAsn: 2.44 ± 0.56
2.44ThrPro: 2.44 ± 0.56
1.882ThrGln: 1.882 ± 0.245
1.952ThrArg: 1.952 ± 0.458
3.416ThrSer: 3.416 ± 0.654
3.835ThrThr: 3.835 ± 0.505
4.114ThrVal: 4.114 ± 0.45
0.697ThrTrp: 0.697 ± 0.174
3.277ThrTyr: 3.277 ± 0.218
0.0ThrXaa: 0.0 ± 0.0
Val
6.414ValAla: 6.414 ± 0.965
3.695ValCys: 3.695 ± 0.473
7.111ValAsp: 7.111 ± 1.028
3.765ValGlu: 3.765 ± 0.398
3.904ValPhe: 3.904 ± 0.404
4.392ValGly: 4.392 ± 0.499
0.767ValHis: 0.767 ± 0.19
4.183ValIle: 4.183 ± 0.782
8.088ValLys: 8.088 ± 1.303
9.9ValLeu: 9.9 ± 1.22
3.068ValMet: 3.068 ± 0.58
4.811ValAsn: 4.811 ± 0.737
4.741ValPro: 4.741 ± 0.408
4.044ValGln: 4.044 ± 0.553
3.486ValArg: 3.486 ± 0.351
6.623ValSer: 6.623 ± 0.452
5.229ValThr: 5.229 ± 0.612
12.131ValVal: 12.131 ± 2.239
0.906ValTrp: 0.906 ± 0.206
4.811ValTyr: 4.811 ± 0.663
0.0ValXaa: 0.0 ± 0.0
Trp
0.697TrpAla: 0.697 ± 0.176
0.279TrpCys: 0.279 ± 0.14
0.349TrpAsp: 0.349 ± 0.225
0.279TrpGlu: 0.279 ± 0.082
1.046TrpPhe: 1.046 ± 0.188
0.349TrpGly: 0.349 ± 0.111
0.349TrpHis: 0.349 ± 0.079
0.349TrpIle: 0.349 ± 0.127
0.279TrpLys: 0.279 ± 0.082
2.44TrpLeu: 2.44 ± 0.396
0.209TrpMet: 0.209 ± 0.124
0.976TrpAsn: 0.976 ± 0.256
0.627TrpPro: 0.627 ± 0.229
0.488TrpGln: 0.488 ± 0.084
1.116TrpArg: 1.116 ± 0.298
1.046TrpSer: 1.046 ± 0.227
0.697TrpThr: 0.697 ± 0.132
0.906TrpVal: 0.906 ± 0.108
0.07TrpTrp: 0.07 ± 0.166
0.697TrpTyr: 0.697 ± 0.315
0.139TrpXaa: 0.139 ± 0.048
Tyr
3.068TyrAla: 3.068 ± 0.298
2.022TyrCys: 2.022 ± 0.534
2.44TyrAsp: 2.44 ± 0.428
1.952TyrGlu: 1.952 ± 0.335
2.37TyrPhe: 2.37 ± 0.335
3.277TyrGly: 3.277 ± 0.694
0.627TyrHis: 0.627 ± 0.222
1.743TyrIle: 1.743 ± 0.343
2.44TyrLys: 2.44 ± 0.444
3.347TyrLeu: 3.347 ± 0.216
1.325TyrMet: 1.325 ± 0.246
2.51TyrAsn: 2.51 ± 0.577
1.325TyrPro: 1.325 ± 0.244
1.534TyrGln: 1.534 ± 0.178
2.231TyrArg: 2.231 ± 0.392
3.068TyrSer: 3.068 ± 0.339
3.974TyrThr: 3.974 ± 0.486
4.602TyrVal: 4.602 ± 0.52
0.418TyrTrp: 0.418 ± 0.151
3.416TyrTyr: 3.416 ± 0.656
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.07XaaArg: 0.07 ± 0.137
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.139XaaVal: 0.139 ± 0.048
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (14344 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski