Amino acid dipepetide frequency for Murid herpesvirus 1 (strain Smith) (MuHV-1) (Mouse cytomegalovirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.302AlaAla: 12.302 ± 2.428
1.144AlaCys: 1.144 ± 0.23
4.768AlaAsp: 4.768 ± 0.678
3.91AlaGlu: 3.91 ± 0.47
2.766AlaPhe: 2.766 ± 0.562
6.008AlaGly: 6.008 ± 0.992
1.526AlaHis: 1.526 ± 0.433
3.242AlaIle: 3.242 ± 0.444
2.766AlaLys: 2.766 ± 0.457
6.294AlaLeu: 6.294 ± 0.509
2.575AlaMet: 2.575 ± 0.277
1.812AlaAsn: 1.812 ± 0.529
4.768AlaPro: 4.768 ± 0.686
1.717AlaGln: 1.717 ± 0.299
4.482AlaArg: 4.482 ± 0.708
5.722AlaSer: 5.722 ± 0.55
6.294AlaThr: 6.294 ± 0.974
7.152AlaVal: 7.152 ± 1.251
0.668AlaTrp: 0.668 ± 0.309
1.621AlaTyr: 1.621 ± 0.263
0.0AlaXaa: 0.0 ± 0.0
Cys
1.621CysAla: 1.621 ± 0.517
0.191CysCys: 0.191 ± 0.105
2.003CysAsp: 2.003 ± 0.425
1.43CysGlu: 1.43 ± 0.361
0.763CysPhe: 0.763 ± 0.285
1.812CysGly: 1.812 ± 0.611
0.572CysHis: 0.572 ± 0.213
0.572CysIle: 0.572 ± 0.336
1.049CysLys: 1.049 ± 0.207
2.479CysLeu: 2.479 ± 0.415
0.858CysMet: 0.858 ± 0.235
0.954CysAsn: 0.954 ± 0.266
0.858CysPro: 0.858 ± 0.329
0.572CysGln: 0.572 ± 0.24
2.098CysArg: 2.098 ± 0.323
1.812CysSer: 1.812 ± 0.481
0.668CysThr: 0.668 ± 0.21
2.384CysVal: 2.384 ± 0.494
0.095CysTrp: 0.095 ± 0.073
1.144CysTyr: 1.144 ± 0.277
0.0CysXaa: 0.0 ± 0.0
Asp
4.864AspAla: 4.864 ± 0.643
0.858AspCys: 0.858 ± 0.206
5.34AspAsp: 5.34 ± 0.743
7.057AspGlu: 7.057 ± 0.926
2.193AspPhe: 2.193 ± 0.52
3.147AspGly: 3.147 ± 0.455
1.335AspHis: 1.335 ± 0.372
3.147AspIle: 3.147 ± 0.415
1.526AspLys: 1.526 ± 0.398
5.34AspLeu: 5.34 ± 0.691
2.384AspMet: 2.384 ± 0.512
1.621AspAsn: 1.621 ± 0.372
2.861AspPro: 2.861 ± 0.485
1.907AspGln: 1.907 ± 0.309
4.005AspArg: 4.005 ± 0.463
4.482AspSer: 4.482 ± 0.774
2.861AspThr: 2.861 ± 0.677
4.959AspVal: 4.959 ± 0.779
0.477AspTrp: 0.477 ± 0.154
2.193AspTyr: 2.193 ± 0.417
0.0AspXaa: 0.0 ± 0.0
Glu
4.482GluAla: 4.482 ± 0.629
1.717GluCys: 1.717 ± 0.366
5.34GluAsp: 5.34 ± 0.732
5.627GluGlu: 5.627 ± 1.85
2.193GluPhe: 2.193 ± 0.362
2.861GluGly: 2.861 ± 0.538
1.335GluHis: 1.335 ± 0.417
2.193GluIle: 2.193 ± 0.641
2.766GluLys: 2.766 ± 0.514
4.864GluLeu: 4.864 ± 0.598
1.717GluMet: 1.717 ± 0.322
3.147GluAsn: 3.147 ± 0.487
2.384GluPro: 2.384 ± 0.605
1.717GluGln: 1.717 ± 0.448
3.91GluArg: 3.91 ± 0.516
4.864GluSer: 4.864 ± 0.868
3.91GluThr: 3.91 ± 0.627
2.384GluVal: 2.384 ± 0.658
0.191GluTrp: 0.191 ± 0.121
2.479GluTyr: 2.479 ± 0.495
0.0GluXaa: 0.0 ± 0.0
Phe
2.575PheAla: 2.575 ± 0.522
0.858PheCys: 0.858 ± 0.252
2.67PheAsp: 2.67 ± 0.522
1.812PheGlu: 1.812 ± 0.341
2.67PhePhe: 2.67 ± 0.409
2.575PheGly: 2.575 ± 0.551
0.858PheHis: 0.858 ± 0.26
1.812PheIle: 1.812 ± 0.336
1.621PheLys: 1.621 ± 0.291
4.005PheLeu: 4.005 ± 0.558
0.954PheMet: 0.954 ± 0.28
1.43PheAsn: 1.43 ± 0.344
1.907PhePro: 1.907 ± 0.363
1.43PheGln: 1.43 ± 0.456
2.003PheArg: 2.003 ± 0.357
3.147PheSer: 3.147 ± 0.855
2.289PheThr: 2.289 ± 0.504
2.766PheVal: 2.766 ± 0.626
0.477PheTrp: 0.477 ± 0.183
2.003PheTyr: 2.003 ± 0.521
0.0PheXaa: 0.0 ± 0.0
Gly
5.817GlyAla: 5.817 ± 0.753
1.049GlyCys: 1.049 ± 0.331
4.196GlyAsp: 4.196 ± 0.777
3.147GlyGlu: 3.147 ± 0.63
2.193GlyPhe: 2.193 ± 0.601
8.011GlyGly: 8.011 ± 1.27
1.621GlyHis: 1.621 ± 0.4
2.003GlyIle: 2.003 ± 0.388
2.098GlyLys: 2.098 ± 0.541
4.578GlyLeu: 4.578 ± 0.722
1.144GlyMet: 1.144 ± 0.252
1.717GlyAsn: 1.717 ± 0.477
3.052GlyPro: 3.052 ± 0.441
2.003GlyGln: 2.003 ± 0.39
4.864GlyArg: 4.864 ± 0.861
4.005GlySer: 4.005 ± 0.789
3.242GlyThr: 3.242 ± 0.464
3.91GlyVal: 3.91 ± 0.572
0.381GlyTrp: 0.381 ± 0.183
2.67GlyTyr: 2.67 ± 0.49
0.0GlyXaa: 0.0 ± 0.0
His
1.526HisAla: 1.526 ± 0.31
0.381HisCys: 0.381 ± 0.184
1.43HisAsp: 1.43 ± 0.44
1.144HisGlu: 1.144 ± 0.183
0.858HisPhe: 0.858 ± 0.217
1.812HisGly: 1.812 ± 0.355
1.24HisHis: 1.24 ± 0.411
1.43HisIle: 1.43 ± 0.196
0.954HisLys: 0.954 ± 0.496
2.766HisLeu: 2.766 ± 0.591
0.381HisMet: 0.381 ± 0.198
1.049HisAsn: 1.049 ± 0.3
0.954HisPro: 0.954 ± 0.332
0.477HisGln: 0.477 ± 0.214
1.43HisArg: 1.43 ± 0.378
1.621HisSer: 1.621 ± 0.587
1.335HisThr: 1.335 ± 0.305
1.717HisVal: 1.717 ± 0.347
0.095HisTrp: 0.095 ± 0.086
0.668HisTyr: 0.668 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
3.052IleAla: 3.052 ± 0.474
0.954IleCys: 0.954 ± 0.222
1.907IleAsp: 1.907 ± 0.435
1.717IleGlu: 1.717 ± 0.409
2.479IlePhe: 2.479 ± 0.498
2.193IleGly: 2.193 ± 0.554
0.763IleHis: 0.763 ± 0.284
1.526IleIle: 1.526 ± 0.381
2.193IleLys: 2.193 ± 0.436
4.387IleLeu: 4.387 ± 0.434
1.717IleMet: 1.717 ± 0.287
2.766IleAsn: 2.766 ± 0.423
2.575IlePro: 2.575 ± 0.578
0.954IleGln: 0.954 ± 0.27
3.338IleArg: 3.338 ± 0.547
2.861IleSer: 2.861 ± 0.429
2.575IleThr: 2.575 ± 0.69
2.193IleVal: 2.193 ± 0.299
0.381IleTrp: 0.381 ± 0.197
2.098IleTyr: 2.098 ± 0.499
0.0IleXaa: 0.0 ± 0.0
Lys
2.766LysAla: 2.766 ± 0.578
0.858LysCys: 0.858 ± 0.256
1.43LysAsp: 1.43 ± 0.358
2.098LysGlu: 2.098 ± 0.415
1.24LysPhe: 1.24 ± 0.148
1.717LysGly: 1.717 ± 0.544
1.049LysHis: 1.049 ± 0.393
3.147LysIle: 3.147 ± 0.486
3.624LysLys: 3.624 ± 0.754
4.387LysLeu: 4.387 ± 0.558
1.43LysMet: 1.43 ± 0.403
2.003LysAsn: 2.003 ± 0.475
1.43LysPro: 1.43 ± 0.402
1.621LysGln: 1.621 ± 0.396
3.338LysArg: 3.338 ± 0.388
2.575LysSer: 2.575 ± 0.44
2.003LysThr: 2.003 ± 0.392
2.67LysVal: 2.67 ± 0.732
0.191LysTrp: 0.191 ± 0.121
1.43LysTyr: 1.43 ± 0.349
0.0LysXaa: 0.0 ± 0.0
Leu
7.629LeuAla: 7.629 ± 0.89
3.052LeuCys: 3.052 ± 0.526
5.054LeuAsp: 5.054 ± 0.772
4.959LeuGlu: 4.959 ± 0.585
3.529LeuPhe: 3.529 ± 0.728
5.34LeuGly: 5.34 ± 0.792
2.575LeuHis: 2.575 ± 0.401
3.719LeuIle: 3.719 ± 0.677
4.196LeuLys: 4.196 ± 0.531
7.248LeuLeu: 7.248 ± 1.188
2.956LeuMet: 2.956 ± 0.436
2.861LeuAsn: 2.861 ± 0.445
4.864LeuPro: 4.864 ± 0.895
2.479LeuGln: 2.479 ± 0.552
5.627LeuArg: 5.627 ± 0.86
7.152LeuSer: 7.152 ± 0.611
4.673LeuThr: 4.673 ± 0.84
4.387LeuVal: 4.387 ± 0.903
0.477LeuTrp: 0.477 ± 0.204
3.147LeuTyr: 3.147 ± 0.805
0.0LeuXaa: 0.0 ± 0.0
Met
2.766MetAla: 2.766 ± 0.558
0.572MetCys: 0.572 ± 0.187
1.812MetAsp: 1.812 ± 0.361
1.24MetGlu: 1.24 ± 0.258
1.621MetPhe: 1.621 ± 0.411
0.954MetGly: 0.954 ± 0.389
0.286MetHis: 0.286 ± 0.21
1.907MetIle: 1.907 ± 0.512
1.24MetLys: 1.24 ± 0.332
2.479MetLeu: 2.479 ± 0.678
0.858MetMet: 0.858 ± 0.344
1.144MetAsn: 1.144 ± 0.298
1.812MetPro: 1.812 ± 0.466
0.858MetGln: 0.858 ± 0.29
1.526MetArg: 1.526 ± 0.33
2.766MetSer: 2.766 ± 0.461
1.43MetThr: 1.43 ± 0.278
1.526MetVal: 1.526 ± 0.285
0.191MetTrp: 0.191 ± 0.105
0.858MetTyr: 0.858 ± 0.348
0.0MetXaa: 0.0 ± 0.0
Asn
3.719AsnAla: 3.719 ± 0.595
0.668AsnCys: 0.668 ± 0.287
1.621AsnAsp: 1.621 ± 0.295
1.717AsnGlu: 1.717 ± 0.465
0.954AsnPhe: 0.954 ± 0.287
2.575AsnGly: 2.575 ± 0.346
0.858AsnHis: 0.858 ± 0.166
1.907AsnIle: 1.907 ± 0.468
1.43AsnLys: 1.43 ± 0.5
4.196AsnLeu: 4.196 ± 0.512
1.049AsnMet: 1.049 ± 0.302
2.003AsnAsn: 2.003 ± 0.505
1.717AsnPro: 1.717 ± 0.334
1.049AsnGln: 1.049 ± 0.326
2.479AsnArg: 2.479 ± 0.406
2.479AsnSer: 2.479 ± 0.464
3.052AsnThr: 3.052 ± 0.583
2.766AsnVal: 2.766 ± 0.635
0.381AsnTrp: 0.381 ± 0.175
1.43AsnTyr: 1.43 ± 0.338
0.0AsnXaa: 0.0 ± 0.0
Pro
4.864ProAla: 4.864 ± 0.84
1.43ProCys: 1.43 ± 0.347
2.766ProAsp: 2.766 ± 0.467
4.196ProGlu: 4.196 ± 0.812
2.384ProPhe: 2.384 ± 0.421
2.289ProGly: 2.289 ± 0.409
1.144ProHis: 1.144 ± 0.206
2.289ProIle: 2.289 ± 0.41
1.812ProLys: 1.812 ± 0.417
4.101ProLeu: 4.101 ± 0.618
1.049ProMet: 1.049 ± 0.316
2.003ProAsn: 2.003 ± 0.576
4.673ProPro: 4.673 ± 1.152
2.003ProGln: 2.003 ± 0.473
3.242ProArg: 3.242 ± 0.586
6.008ProSer: 6.008 ± 1.14
2.479ProThr: 2.479 ± 0.574
4.387ProVal: 4.387 ± 0.534
0.381ProTrp: 0.381 ± 0.153
1.43ProTyr: 1.43 ± 0.314
0.0ProXaa: 0.0 ± 0.0
Gln
1.526GlnAla: 1.526 ± 0.452
0.763GlnCys: 0.763 ± 0.225
0.954GlnAsp: 0.954 ± 0.267
1.621GlnGlu: 1.621 ± 0.496
1.24GlnPhe: 1.24 ± 0.373
1.43GlnGly: 1.43 ± 0.317
0.477GlnHis: 0.477 ± 0.199
1.526GlnIle: 1.526 ± 0.321
1.335GlnLys: 1.335 ± 0.374
2.956GlnLeu: 2.956 ± 0.46
0.381GlnMet: 0.381 ± 0.232
1.24GlnAsn: 1.24 ± 0.304
1.144GlnPro: 1.144 ± 0.473
3.719GlnGln: 3.719 ± 1.131
2.766GlnArg: 2.766 ± 0.287
2.575GlnSer: 2.575 ± 0.353
2.193GlnThr: 2.193 ± 0.505
1.621GlnVal: 1.621 ± 0.447
0.0GlnTrp: 0.0 ± 0.0
0.858GlnTyr: 0.858 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
3.91ArgAla: 3.91 ± 0.695
2.575ArgCys: 2.575 ± 0.241
5.436ArgAsp: 5.436 ± 0.752
4.482ArgGlu: 4.482 ± 0.493
2.67ArgPhe: 2.67 ± 0.576
4.005ArgGly: 4.005 ± 0.511
2.479ArgHis: 2.479 ± 0.584
3.147ArgIle: 3.147 ± 0.471
3.147ArgLys: 3.147 ± 0.548
4.482ArgLeu: 4.482 ± 0.47
2.289ArgMet: 2.289 ± 0.526
2.098ArgAsn: 2.098 ± 0.673
4.196ArgPro: 4.196 ± 0.829
1.717ArgGln: 1.717 ± 0.429
7.725ArgArg: 7.725 ± 1.326
6.58ArgSer: 6.58 ± 0.91
3.91ArgThr: 3.91 ± 0.592
5.15ArgVal: 5.15 ± 0.731
0.286ArgTrp: 0.286 ± 0.182
2.289ArgTyr: 2.289 ± 0.283
0.0ArgXaa: 0.0 ± 0.0
Ser
4.959SerAla: 4.959 ± 0.838
2.003SerCys: 2.003 ± 0.276
4.768SerAsp: 4.768 ± 0.845
4.482SerGlu: 4.482 ± 0.76
3.529SerPhe: 3.529 ± 0.715
5.245SerGly: 5.245 ± 0.901
1.621SerHis: 1.621 ± 0.435
3.147SerIle: 3.147 ± 0.607
2.67SerLys: 2.67 ± 0.711
6.008SerLeu: 6.008 ± 0.748
1.621SerMet: 1.621 ± 0.536
2.67SerAsn: 2.67 ± 0.372
5.627SerPro: 5.627 ± 1.182
2.003SerGln: 2.003 ± 0.481
5.817SerArg: 5.817 ± 0.813
10.586SerSer: 10.586 ± 3.119
5.15SerThr: 5.15 ± 0.637
6.58SerVal: 6.58 ± 1.099
0.477SerTrp: 0.477 ± 0.154
2.956SerTyr: 2.956 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
5.436ThrAla: 5.436 ± 0.805
1.43ThrCys: 1.43 ± 0.34
3.338ThrAsp: 3.338 ± 0.696
3.147ThrGlu: 3.147 ± 0.673
1.717ThrPhe: 1.717 ± 0.468
3.338ThrGly: 3.338 ± 0.572
1.144ThrHis: 1.144 ± 0.348
2.003ThrIle: 2.003 ± 0.4
1.907ThrLys: 1.907 ± 0.401
4.578ThrLeu: 4.578 ± 0.748
1.812ThrMet: 1.812 ± 0.288
2.193ThrAsn: 2.193 ± 0.49
4.673ThrPro: 4.673 ± 0.787
1.526ThrGln: 1.526 ± 0.25
4.291ThrArg: 4.291 ± 0.809
4.482ThrSer: 4.482 ± 0.626
3.529ThrThr: 3.529 ± 0.89
5.436ThrVal: 5.436 ± 0.938
0.381ThrTrp: 0.381 ± 0.182
2.384ThrTyr: 2.384 ± 0.601
0.0ThrXaa: 0.0 ± 0.0
Val
4.768ValAla: 4.768 ± 0.659
2.289ValCys: 2.289 ± 0.353
4.578ValAsp: 4.578 ± 0.79
4.101ValGlu: 4.101 ± 0.549
3.433ValPhe: 3.433 ± 0.745
3.815ValGly: 3.815 ± 0.612
1.43ValHis: 1.43 ± 0.354
2.956ValIle: 2.956 ± 0.589
3.052ValLys: 3.052 ± 0.4
7.725ValLeu: 7.725 ± 1.14
1.144ValMet: 1.144 ± 0.396
3.433ValAsn: 3.433 ± 0.513
3.529ValPro: 3.529 ± 0.765
1.43ValGln: 1.43 ± 0.298
4.578ValArg: 4.578 ± 0.594
5.34ValSer: 5.34 ± 0.847
4.005ValThr: 4.005 ± 0.82
6.58ValVal: 6.58 ± 1.059
1.144ValTrp: 1.144 ± 0.348
1.907ValTyr: 1.907 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
0.381TrpAla: 0.381 ± 0.149
0.477TrpCys: 0.477 ± 0.227
0.286TrpAsp: 0.286 ± 0.155
0.191TrpGlu: 0.191 ± 0.18
0.286TrpPhe: 0.286 ± 0.141
0.286TrpGly: 0.286 ± 0.146
0.0TrpHis: 0.0 ± 0.0
0.095TrpIle: 0.095 ± 0.09
0.286TrpLys: 0.286 ± 0.13
0.477TrpLeu: 0.477 ± 0.178
0.477TrpMet: 0.477 ± 0.183
0.286TrpAsn: 0.286 ± 0.167
0.572TrpPro: 0.572 ± 0.192
0.477TrpGln: 0.477 ± 0.22
0.572TrpArg: 0.572 ± 0.183
0.572TrpSer: 0.572 ± 0.187
0.572TrpThr: 0.572 ± 0.203
0.191TrpVal: 0.191 ± 0.17
0.095TrpTrp: 0.095 ± 0.073
0.286TrpTyr: 0.286 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.575TyrAla: 2.575 ± 0.488
0.668TyrCys: 0.668 ± 0.202
2.861TyrAsp: 2.861 ± 0.371
2.003TyrGlu: 2.003 ± 0.374
1.144TyrPhe: 1.144 ± 0.31
2.289TyrGly: 2.289 ± 0.425
0.954TyrHis: 0.954 ± 0.356
0.858TyrIle: 0.858 ± 0.286
1.335TyrLys: 1.335 ± 0.333
2.575TyrLeu: 2.575 ± 0.495
0.954TyrMet: 0.954 ± 0.225
1.621TyrAsn: 1.621 ± 0.539
1.335TyrPro: 1.335 ± 0.278
0.763TyrGln: 0.763 ± 0.249
4.482TyrArg: 4.482 ± 0.643
2.289TyrSer: 2.289 ± 0.411
2.575TyrThr: 2.575 ± 0.428
2.479TyrVal: 2.479 ± 0.366
0.095TyrTrp: 0.095 ± 0.092
1.144TyrTyr: 1.144 ± 0.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (10487 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski