Amino acid dipepetide frequency for Murine hepatitis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.109AlaAla: 5.109 ± 0.485
2.452AlaCys: 2.452 ± 1.105
4.394AlaAsp: 4.394 ± 0.74
2.555AlaGlu: 2.555 ± 0.397
4.19AlaPhe: 4.19 ± 0.507
4.394AlaGly: 4.394 ± 0.416
1.328AlaHis: 1.328 ± 0.402
4.087AlaIle: 4.087 ± 0.573
4.087AlaLys: 4.087 ± 0.907
4.905AlaLeu: 4.905 ± 0.415
1.226AlaMet: 1.226 ± 0.498
4.394AlaAsn: 4.394 ± 0.663
2.452AlaPro: 2.452 ± 0.937
1.942AlaGln: 1.942 ± 0.659
1.635AlaArg: 1.635 ± 0.443
5.212AlaSer: 5.212 ± 0.698
3.679AlaThr: 3.679 ± 0.513
6.54AlaVal: 6.54 ± 1.09
1.124AlaTrp: 1.124 ± 0.552
2.35AlaTyr: 2.35 ± 0.429
0.0AlaXaa: 0.0 ± 0.0
Cys
2.35CysAla: 2.35 ± 0.59
1.839CysCys: 1.839 ± 0.314
1.533CysAsp: 1.533 ± 0.386
1.328CysGlu: 1.328 ± 0.284
2.248CysPhe: 2.248 ± 0.228
2.555CysGly: 2.555 ± 0.637
0.409CysHis: 0.409 ± 0.172
2.452CysIle: 2.452 ± 0.456
2.248CysLys: 2.248 ± 0.59
2.657CysLeu: 2.657 ± 0.64
0.511CysMet: 0.511 ± 0.419
2.555CysAsn: 2.555 ± 0.456
1.124CysPro: 1.124 ± 0.327
1.124CysGln: 1.124 ± 0.399
1.533CysArg: 1.533 ± 0.279
3.372CysSer: 3.372 ± 0.833
2.248CysThr: 2.248 ± 0.469
2.657CysVal: 2.657 ± 0.787
0.613CysTrp: 0.613 ± 0.258
2.146CysTyr: 2.146 ± 0.525
0.0CysXaa: 0.0 ± 0.0
Asp
4.087AspAla: 4.087 ± 0.838
1.737AspCys: 1.737 ± 0.223
3.27AspAsp: 3.27 ± 0.42
2.861AspGlu: 2.861 ± 0.389
3.372AspPhe: 3.372 ± 0.871
4.905AspGly: 4.905 ± 0.423
0.715AspHis: 0.715 ± 0.38
2.35AspIle: 2.35 ± 0.998
3.27AspLys: 3.27 ± 0.352
5.314AspLeu: 5.314 ± 0.772
1.635AspMet: 1.635 ± 0.398
2.146AspAsn: 2.146 ± 0.483
1.635AspPro: 1.635 ± 0.4
1.431AspGln: 1.431 ± 0.528
1.737AspArg: 1.737 ± 0.401
4.087AspSer: 4.087 ± 0.77
2.146AspThr: 2.146 ± 0.38
6.336AspVal: 6.336 ± 1.778
0.307AspTrp: 0.307 ± 0.163
2.555AspTyr: 2.555 ± 0.49
0.0AspXaa: 0.0 ± 0.0
Glu
4.292GluAla: 4.292 ± 0.664
1.328GluCys: 1.328 ± 0.402
2.963GluAsp: 2.963 ± 0.34
2.861GluGlu: 2.861 ± 0.747
2.452GluPhe: 2.452 ± 0.541
2.146GluGly: 2.146 ± 0.455
0.409GluHis: 0.409 ± 0.172
2.044GluIle: 2.044 ± 0.633
2.35GluLys: 2.35 ± 0.423
4.496GluLeu: 4.496 ± 0.927
1.022GluMet: 1.022 ± 0.57
1.431GluAsn: 1.431 ± 0.43
1.839GluPro: 1.839 ± 0.497
0.817GluGln: 0.817 ± 0.266
1.635GluArg: 1.635 ± 0.605
1.839GluSer: 1.839 ± 0.357
2.044GluThr: 2.044 ± 0.338
4.087GluVal: 4.087 ± 0.69
0.511GluTrp: 0.511 ± 0.334
1.737GluTyr: 1.737 ± 0.555
0.0GluXaa: 0.0 ± 0.0
Phe
3.27PheAla: 3.27 ± 0.625
1.942PheCys: 1.942 ± 0.388
3.577PheAsp: 3.577 ± 0.45
1.942PheGlu: 1.942 ± 0.191
1.737PhePhe: 1.737 ± 0.359
3.27PheGly: 3.27 ± 0.656
0.817PheHis: 0.817 ± 0.221
2.861PheIle: 2.861 ± 1.06
3.985PheLys: 3.985 ± 0.867
3.168PheLeu: 3.168 ± 0.554
1.022PheMet: 1.022 ± 0.308
3.985PheAsn: 3.985 ± 0.558
1.431PhePro: 1.431 ± 0.204
1.533PheGln: 1.533 ± 0.49
1.737PheArg: 1.737 ± 0.545
3.577PheSer: 3.577 ± 0.67
3.372PheThr: 3.372 ± 0.949
5.927PheVal: 5.927 ± 1.423
0.817PheTrp: 0.817 ± 0.424
3.474PheTyr: 3.474 ± 0.581
0.0PheXaa: 0.0 ± 0.0
Gly
3.372GlyAla: 3.372 ± 0.676
3.168GlyCys: 3.168 ± 0.54
3.474GlyAsp: 3.474 ± 0.451
1.226GlyGlu: 1.226 ± 0.23
3.781GlyPhe: 3.781 ± 1.0
3.474GlyGly: 3.474 ± 0.699
1.328GlyHis: 1.328 ± 0.306
2.759GlyIle: 2.759 ± 0.766
3.679GlyLys: 3.679 ± 0.667
4.803GlyLeu: 4.803 ± 0.669
1.226GlyMet: 1.226 ± 0.356
3.27GlyAsn: 3.27 ± 0.588
1.737GlyPro: 1.737 ± 0.791
1.635GlyGln: 1.635 ± 0.625
2.146GlyArg: 2.146 ± 0.807
5.518GlySer: 5.518 ± 0.422
3.883GlyThr: 3.883 ± 0.529
6.847GlyVal: 6.847 ± 1.007
0.715GlyTrp: 0.715 ± 0.163
3.474GlyTyr: 3.474 ± 0.543
0.0GlyXaa: 0.0 ± 0.0
His
1.431HisAla: 1.431 ± 0.638
0.511HisCys: 0.511 ± 0.367
1.124HisAsp: 1.124 ± 0.195
1.328HisGlu: 1.328 ± 0.469
1.431HisPhe: 1.431 ± 0.179
0.307HisGly: 0.307 ± 0.153
0.102HisHis: 0.102 ± 0.152
0.715HisIle: 0.715 ± 0.16
1.226HisLys: 1.226 ± 0.35
1.737HisLeu: 1.737 ± 0.56
0.409HisMet: 0.409 ± 0.265
0.92HisAsn: 0.92 ± 0.407
0.511HisPro: 0.511 ± 0.244
0.511HisGln: 0.511 ± 0.133
0.511HisArg: 0.511 ± 0.101
0.715HisSer: 0.715 ± 0.289
0.92HisThr: 0.92 ± 0.181
2.248HisVal: 2.248 ± 0.607
0.409HisTrp: 0.409 ± 0.386
0.715HisTyr: 0.715 ± 0.358
0.0HisXaa: 0.0 ± 0.0
Ile
2.248IleAla: 2.248 ± 0.465
1.737IleCys: 1.737 ± 0.634
2.248IleAsp: 2.248 ± 0.669
1.839IleGlu: 1.839 ± 0.363
1.942IlePhe: 1.942 ± 0.574
3.679IleGly: 3.679 ± 0.798
0.613IleHis: 0.613 ± 0.263
2.146IleIle: 2.146 ± 1.046
3.372IleLys: 3.372 ± 0.764
4.803IleLeu: 4.803 ± 1.149
1.022IleMet: 1.022 ± 0.496
2.555IleAsn: 2.555 ± 0.793
1.328IlePro: 1.328 ± 0.354
1.635IleGln: 1.635 ± 0.649
2.248IleArg: 2.248 ± 1.051
2.35IleSer: 2.35 ± 1.312
2.963IleThr: 2.963 ± 0.248
4.292IleVal: 4.292 ± 0.652
0.409IleTrp: 0.409 ± 0.315
1.124IleTyr: 1.124 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
3.883LysAla: 3.883 ± 0.973
2.044LysCys: 2.044 ± 0.559
2.248LysAsp: 2.248 ± 1.061
2.657LysGlu: 2.657 ± 0.228
3.474LysPhe: 3.474 ± 0.588
4.19LysGly: 4.19 ± 0.658
1.328LysHis: 1.328 ± 0.569
2.657LysIle: 2.657 ± 0.515
2.044LysLys: 2.044 ± 0.399
6.131LysLeu: 6.131 ± 0.809
0.715LysMet: 0.715 ± 0.165
1.942LysAsn: 1.942 ± 0.267
3.577LysPro: 3.577 ± 0.788
2.861LysGln: 2.861 ± 0.696
2.452LysArg: 2.452 ± 0.567
3.066LysSer: 3.066 ± 0.4
2.452LysThr: 2.452 ± 0.292
5.416LysVal: 5.416 ± 0.945
1.124LysTrp: 1.124 ± 0.324
2.861LysTyr: 2.861 ± 0.808
0.0LysXaa: 0.0 ± 0.0
Leu
6.131LeuAla: 6.131 ± 1.3
3.883LeuCys: 3.883 ± 0.714
4.905LeuAsp: 4.905 ± 0.519
4.19LeuGlu: 4.19 ± 0.679
5.314LeuPhe: 5.314 ± 0.697
5.007LeuGly: 5.007 ± 0.875
1.226LeuHis: 1.226 ± 0.311
3.27LeuIle: 3.27 ± 0.626
4.19LeuLys: 4.19 ± 0.832
7.868LeuLeu: 7.868 ± 1.398
1.737LeuMet: 1.737 ± 0.506
4.905LeuAsn: 4.905 ± 1.164
4.496LeuPro: 4.496 ± 0.97
4.292LeuGln: 4.292 ± 0.462
3.372LeuArg: 3.372 ± 0.495
7.255LeuSer: 7.255 ± 0.873
5.416LeuThr: 5.416 ± 0.592
7.766LeuVal: 7.766 ± 0.9
1.226LeuTrp: 1.226 ± 0.406
4.496LeuTyr: 4.496 ± 0.939
0.0LeuXaa: 0.0 ± 0.0
Met
1.533MetAla: 1.533 ± 0.531
0.92MetCys: 0.92 ± 0.314
1.124MetAsp: 1.124 ± 0.341
0.511MetGlu: 0.511 ± 0.244
1.328MetPhe: 1.328 ± 0.332
0.92MetGly: 0.92 ± 0.39
0.715MetHis: 0.715 ± 0.396
0.511MetIle: 0.511 ± 0.233
0.511MetLys: 0.511 ± 0.401
3.168MetLeu: 3.168 ± 0.467
0.613MetMet: 0.613 ± 0.258
0.92MetAsn: 0.92 ± 0.225
1.328MetPro: 1.328 ± 0.573
1.328MetGln: 1.328 ± 0.394
0.715MetArg: 0.715 ± 0.268
1.533MetSer: 1.533 ± 0.349
1.124MetThr: 1.124 ± 0.461
1.226MetVal: 1.226 ± 0.452
0.409MetTrp: 0.409 ± 0.18
1.124MetTyr: 1.124 ± 0.308
0.0MetXaa: 0.0 ± 0.0
Asn
3.781AsnAla: 3.781 ± 0.791
1.942AsnCys: 1.942 ± 0.636
1.839AsnAsp: 1.839 ± 0.45
1.942AsnGlu: 1.942 ± 0.266
2.555AsnPhe: 2.555 ± 0.585
4.087AsnGly: 4.087 ± 0.878
0.817AsnHis: 0.817 ± 0.312
1.737AsnIle: 1.737 ± 0.515
2.759AsnLys: 2.759 ± 0.714
3.985AsnLeu: 3.985 ± 0.964
1.226AsnMet: 1.226 ± 0.23
2.963AsnAsn: 2.963 ± 1.122
2.044AsnPro: 2.044 ± 0.438
2.146AsnGln: 2.146 ± 0.771
2.35AsnArg: 2.35 ± 0.53
3.781AsnSer: 3.781 ± 0.617
2.657AsnThr: 2.657 ± 0.362
5.825AsnVal: 5.825 ± 0.666
0.613AsnTrp: 0.613 ± 0.115
1.942AsnTyr: 1.942 ± 0.556
0.0AsnXaa: 0.0 ± 0.0
Pro
2.861ProAla: 2.861 ± 0.622
1.124ProCys: 1.124 ± 0.231
2.146ProAsp: 2.146 ± 0.392
2.044ProGlu: 2.044 ± 0.328
1.635ProPhe: 1.635 ± 0.297
2.452ProGly: 2.452 ± 0.794
1.226ProHis: 1.226 ± 0.795
1.635ProIle: 1.635 ± 0.481
2.452ProLys: 2.452 ± 0.682
3.066ProLeu: 3.066 ± 0.462
0.409ProMet: 0.409 ± 0.133
1.839ProAsn: 1.839 ± 0.958
1.328ProPro: 1.328 ± 0.613
1.431ProGln: 1.431 ± 0.514
1.839ProArg: 1.839 ± 0.377
2.759ProSer: 2.759 ± 1.01
3.168ProThr: 3.168 ± 0.58
2.963ProVal: 2.963 ± 0.652
0.511ProTrp: 0.511 ± 0.279
1.431ProTyr: 1.431 ± 0.494
0.0ProXaa: 0.0 ± 0.0
Gln
1.533GlnAla: 1.533 ± 0.4
1.226GlnCys: 1.226 ± 0.263
1.737GlnAsp: 1.737 ± 0.325
1.839GlnGlu: 1.839 ± 0.526
2.35GlnPhe: 2.35 ± 1.215
1.942GlnGly: 1.942 ± 0.375
1.022GlnHis: 1.022 ± 0.266
1.942GlnIle: 1.942 ± 0.394
2.248GlnLys: 2.248 ± 1.121
4.087GlnLeu: 4.087 ± 1.002
0.307GlnMet: 0.307 ± 0.163
1.533GlnAsn: 1.533 ± 0.486
1.226GlnPro: 1.226 ± 0.818
1.226GlnGln: 1.226 ± 0.459
1.022GlnArg: 1.022 ± 0.567
2.555GlnSer: 2.555 ± 0.534
2.044GlnThr: 2.044 ± 0.446
2.759GlnVal: 2.759 ± 0.771
1.124GlnTrp: 1.124 ± 0.367
1.328GlnTyr: 1.328 ± 0.498
0.0GlnXaa: 0.0 ± 0.0
Arg
3.168ArgAla: 3.168 ± 1.139
1.328ArgCys: 1.328 ± 0.314
2.248ArgAsp: 2.248 ± 0.37
1.635ArgGlu: 1.635 ± 0.282
1.635ArgPhe: 1.635 ± 0.4
2.452ArgGly: 2.452 ± 1.033
1.226ArgHis: 1.226 ± 0.532
1.022ArgIle: 1.022 ± 0.504
2.146ArgLys: 2.146 ± 0.625
4.292ArgLeu: 4.292 ± 0.552
0.92ArgMet: 0.92 ± 0.323
1.635ArgAsn: 1.635 ± 0.592
1.226ArgPro: 1.226 ± 0.402
1.226ArgGln: 1.226 ± 0.77
1.533ArgArg: 1.533 ± 0.937
3.781ArgSer: 3.781 ± 1.232
2.044ArgThr: 2.044 ± 0.696
3.372ArgVal: 3.372 ± 0.743
0.102ArgTrp: 0.102 ± 0.179
1.533ArgTyr: 1.533 ± 0.273
0.0ArgXaa: 0.0 ± 0.0
Ser
6.029SerAla: 6.029 ± 0.898
2.452SerCys: 2.452 ± 0.398
3.577SerAsp: 3.577 ± 0.564
3.168SerGlu: 3.168 ± 0.401
3.168SerPhe: 3.168 ± 0.418
4.087SerGly: 4.087 ± 1.076
1.431SerHis: 1.431 ± 0.377
3.883SerIle: 3.883 ± 0.673
3.474SerLys: 3.474 ± 0.227
6.949SerLeu: 6.949 ± 1.036
2.044SerMet: 2.044 ± 0.549
2.044SerAsn: 2.044 ± 0.347
2.452SerPro: 2.452 ± 0.524
2.146SerGln: 2.146 ± 0.464
2.759SerArg: 2.759 ± 0.752
5.212SerSer: 5.212 ± 0.945
3.577SerThr: 3.577 ± 0.674
8.073SerVal: 8.073 ± 0.91
1.124SerTrp: 1.124 ± 0.383
3.372SerTyr: 3.372 ± 0.805
0.0SerXaa: 0.0 ± 0.0
Thr
3.781ThrAla: 3.781 ± 1.142
1.737ThrCys: 1.737 ± 0.435
3.679ThrAsp: 3.679 ± 0.53
1.942ThrGlu: 1.942 ± 0.394
3.474ThrPhe: 3.474 ± 0.86
4.496ThrGly: 4.496 ± 0.737
0.92ThrHis: 0.92 ± 0.446
2.452ThrIle: 2.452 ± 1.291
3.066ThrLys: 3.066 ± 0.526
5.007ThrLeu: 5.007 ± 0.561
2.146ThrMet: 2.146 ± 0.592
2.657ThrAsn: 2.657 ± 0.345
2.452ThrPro: 2.452 ± 0.805
2.146ThrGln: 2.146 ± 0.451
2.35ThrArg: 2.35 ± 0.956
3.577ThrSer: 3.577 ± 0.513
4.19ThrThr: 4.19 ± 0.679
4.292ThrVal: 4.292 ± 0.663
0.613ThrTrp: 0.613 ± 0.263
2.861ThrTyr: 2.861 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
5.825ValAla: 5.825 ± 1.304
3.679ValCys: 3.679 ± 0.717
6.744ValAsp: 6.744 ± 1.235
4.087ValGlu: 4.087 ± 0.733
3.679ValPhe: 3.679 ± 0.475
3.883ValGly: 3.883 ± 0.492
0.817ValHis: 0.817 ± 0.436
3.985ValIle: 3.985 ± 0.62
6.847ValLys: 6.847 ± 1.398
8.89ValLeu: 8.89 ± 1.476
2.248ValMet: 2.248 ± 0.748
5.927ValAsn: 5.927 ± 0.983
4.292ValPro: 4.292 ± 0.64
3.679ValGln: 3.679 ± 0.848
3.985ValArg: 3.985 ± 0.837
6.642ValSer: 6.642 ± 0.461
5.007ValThr: 5.007 ± 1.341
10.321ValVal: 10.321 ± 2.905
0.92ValTrp: 0.92 ± 0.415
5.109ValTyr: 5.109 ± 1.174
0.0ValXaa: 0.0 ± 0.0
Trp
0.613TrpAla: 0.613 ± 0.261
0.409TrpCys: 0.409 ± 0.357
0.511TrpAsp: 0.511 ± 0.461
0.307TrpGlu: 0.307 ± 0.097
1.124TrpPhe: 1.124 ± 0.308
0.409TrpGly: 0.409 ± 0.19
0.409TrpHis: 0.409 ± 0.236
0.511TrpIle: 0.511 ± 0.466
0.307TrpLys: 0.307 ± 0.097
2.146TrpLeu: 2.146 ± 0.658
0.102TrpMet: 0.102 ± 0.072
1.022TrpAsn: 1.022 ± 0.455
0.511TrpPro: 0.511 ± 0.235
0.409TrpGln: 0.409 ± 0.198
0.715TrpArg: 0.715 ± 0.317
1.226TrpSer: 1.226 ± 0.2
0.715TrpThr: 0.715 ± 0.238
0.817TrpVal: 0.817 ± 0.172
0.102TrpTrp: 0.102 ± 0.202
0.817TrpTyr: 0.817 ± 0.379
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.759TyrAla: 2.759 ± 0.565
2.044TyrCys: 2.044 ± 0.573
2.555TyrAsp: 2.555 ± 0.851
2.044TyrGlu: 2.044 ± 0.324
2.555TyrPhe: 2.555 ± 0.513
2.861TyrGly: 2.861 ± 0.444
0.92TyrHis: 0.92 ± 0.575
1.533TyrIle: 1.533 ± 0.34
2.861TyrLys: 2.861 ± 0.743
3.474TyrLeu: 3.474 ± 0.558
0.92TyrMet: 0.92 ± 0.334
2.35TyrAsn: 2.35 ± 0.758
1.328TyrPro: 1.328 ± 0.425
1.533TyrGln: 1.533 ± 0.264
2.248TyrArg: 2.248 ± 0.347
2.963TyrSer: 2.963 ± 0.691
4.19TyrThr: 4.19 ± 1.176
4.905TyrVal: 4.905 ± 0.698
0.409TyrTrp: 0.409 ± 0.181
3.372TyrTyr: 3.372 ± 0.834
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (9787 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski