Amino acid dipepetide frequency for Moraxella phage Mcat6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.956AlaAla: 2.956 ± 0.652
0.879AlaCys: 0.879 ± 0.383
5.432AlaAsp: 5.432 ± 0.651
3.355AlaGlu: 3.355 ± 0.527
3.595AlaPhe: 3.595 ± 0.54
4.953AlaGly: 4.953 ± 0.721
2.077AlaHis: 2.077 ± 0.367
6.231AlaIle: 6.231 ± 0.631
8.388AlaLys: 8.388 ± 1.117
7.429AlaLeu: 7.429 ± 0.821
2.956AlaMet: 2.956 ± 0.746
4.553AlaAsn: 4.553 ± 0.782
2.636AlaPro: 2.636 ± 0.449
5.033AlaGln: 5.033 ± 0.714
3.116AlaArg: 3.116 ± 0.505
6.071AlaSer: 6.071 ± 0.841
5.911AlaThr: 5.911 ± 0.935
6.471AlaVal: 6.471 ± 0.629
1.678AlaTrp: 1.678 ± 0.354
2.876AlaTyr: 2.876 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
0.559CysAla: 0.559 ± 0.226
0.32CysCys: 0.32 ± 0.193
0.639CysAsp: 0.639 ± 0.214
0.719CysGlu: 0.719 ± 0.23
0.399CysPhe: 0.399 ± 0.171
1.278CysGly: 1.278 ± 0.384
0.32CysHis: 0.32 ± 0.162
0.32CysIle: 0.32 ± 0.149
0.32CysLys: 0.32 ± 0.152
0.959CysLeu: 0.959 ± 0.31
0.16CysMet: 0.16 ± 0.119
0.32CysAsn: 0.32 ± 0.167
0.479CysPro: 0.479 ± 0.18
0.32CysGln: 0.32 ± 0.17
0.559CysArg: 0.559 ± 0.259
0.479CysSer: 0.479 ± 0.192
0.559CysThr: 0.559 ± 0.263
0.559CysVal: 0.559 ± 0.202
0.0CysTrp: 0.0 ± 0.0
0.559CysTyr: 0.559 ± 0.244
0.0CysXaa: 0.0 ± 0.0
Asp
4.314AspAla: 4.314 ± 0.539
0.479AspCys: 0.479 ± 0.289
6.471AspAsp: 6.471 ± 0.798
5.592AspGlu: 5.592 ± 0.614
2.237AspPhe: 2.237 ± 0.552
6.391AspGly: 6.391 ± 0.789
0.559AspHis: 0.559 ± 0.248
3.914AspIle: 3.914 ± 0.486
6.311AspLys: 6.311 ± 0.799
5.113AspLeu: 5.113 ± 0.649
1.278AspMet: 1.278 ± 0.411
3.595AspAsn: 3.595 ± 0.548
1.358AspPro: 1.358 ± 0.385
1.358AspGln: 1.358 ± 0.311
1.917AspArg: 1.917 ± 0.409
2.397AspSer: 2.397 ± 0.488
4.074AspThr: 4.074 ± 0.618
2.636AspVal: 2.636 ± 0.487
0.959AspTrp: 0.959 ± 0.265
2.876AspTyr: 2.876 ± 0.603
0.0AspXaa: 0.0 ± 0.0
Glu
3.355GluAla: 3.355 ± 0.591
0.719GluCys: 0.719 ± 0.265
0.959GluAsp: 0.959 ± 0.355
1.198GluGlu: 1.198 ± 0.402
3.036GluPhe: 3.036 ± 0.502
2.476GluGly: 2.476 ± 0.521
1.678GluHis: 1.678 ± 0.376
4.873GluIle: 4.873 ± 0.667
3.195GluLys: 3.195 ± 0.657
7.349GluLeu: 7.349 ± 0.695
1.678GluMet: 1.678 ± 0.388
3.036GluAsn: 3.036 ± 0.53
1.997GluPro: 1.997 ± 0.523
4.074GluGln: 4.074 ± 0.574
3.755GluArg: 3.755 ± 0.538
2.956GluSer: 2.956 ± 0.537
2.157GluThr: 2.157 ± 0.347
2.556GluVal: 2.556 ± 0.444
1.039GluTrp: 1.039 ± 0.263
1.997GluTyr: 1.997 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
3.675PheAla: 3.675 ± 0.405
0.799PheCys: 0.799 ± 0.255
2.716PheAsp: 2.716 ± 0.562
2.636PheGlu: 2.636 ± 0.41
0.799PhePhe: 0.799 ± 0.26
3.515PheGly: 3.515 ± 0.38
1.278PheHis: 1.278 ± 0.311
2.876PheIle: 2.876 ± 0.578
2.077PheLys: 2.077 ± 0.338
2.237PheLeu: 2.237 ± 0.596
1.039PheMet: 1.039 ± 0.291
1.757PheAsn: 1.757 ± 0.276
0.559PhePro: 0.559 ± 0.217
0.399PheGln: 0.399 ± 0.18
1.678PheArg: 1.678 ± 0.373
1.917PheSer: 1.917 ± 0.381
1.278PheThr: 1.278 ± 0.281
1.757PheVal: 1.757 ± 0.388
0.479PheTrp: 0.479 ± 0.219
1.598PheTyr: 1.598 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
5.033GlyAla: 5.033 ± 1.058
0.399GlyCys: 0.399 ± 0.188
4.474GlyAsp: 4.474 ± 0.558
4.873GlyGlu: 4.873 ± 0.766
2.397GlyPhe: 2.397 ± 0.628
4.713GlyGly: 4.713 ± 0.742
1.118GlyHis: 1.118 ± 0.246
4.553GlyIle: 4.553 ± 0.684
5.352GlyLys: 5.352 ± 0.605
5.752GlyLeu: 5.752 ± 0.806
2.317GlyMet: 2.317 ± 0.367
2.876GlyAsn: 2.876 ± 0.546
0.32GlyPro: 0.32 ± 0.212
2.876GlyGln: 2.876 ± 0.55
3.355GlyArg: 3.355 ± 0.524
2.956GlySer: 2.956 ± 0.416
3.355GlyThr: 3.355 ± 0.693
5.432GlyVal: 5.432 ± 0.763
1.278GlyTrp: 1.278 ± 0.347
2.876GlyTyr: 2.876 ± 0.606
0.0GlyXaa: 0.0 ± 0.0
His
2.556HisAla: 2.556 ± 0.477
0.08HisCys: 0.08 ± 0.077
1.278HisAsp: 1.278 ± 0.282
1.518HisGlu: 1.518 ± 0.296
1.198HisPhe: 1.198 ± 0.306
1.917HisGly: 1.917 ± 0.365
1.039HisHis: 1.039 ± 0.339
1.118HisIle: 1.118 ± 0.331
1.278HisLys: 1.278 ± 0.312
1.917HisLeu: 1.917 ± 0.293
0.479HisMet: 0.479 ± 0.193
0.719HisAsn: 0.719 ± 0.286
0.959HisPro: 0.959 ± 0.245
0.799HisGln: 0.799 ± 0.262
1.198HisArg: 1.198 ± 0.281
1.518HisSer: 1.518 ± 0.262
2.556HisThr: 2.556 ± 0.477
0.719HisVal: 0.719 ± 0.194
0.16HisTrp: 0.16 ± 0.115
0.879HisTyr: 0.879 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
7.03IleAla: 7.03 ± 0.779
0.639IleCys: 0.639 ± 0.233
4.633IleAsp: 4.633 ± 0.6
3.515IleGlu: 3.515 ± 0.568
1.757IlePhe: 1.757 ± 0.417
4.394IleGly: 4.394 ± 0.685
0.879IleHis: 0.879 ± 0.261
3.834IleIle: 3.834 ± 0.629
5.113IleLys: 5.113 ± 0.728
4.553IleLeu: 4.553 ± 0.667
1.118IleMet: 1.118 ± 0.282
4.633IleAsn: 4.633 ± 0.782
2.876IlePro: 2.876 ± 0.5
2.716IleGln: 2.716 ± 0.416
3.116IleArg: 3.116 ± 0.581
5.512IleSer: 5.512 ± 0.741
5.752IleThr: 5.752 ± 0.747
2.956IleVal: 2.956 ± 0.545
0.639IleTrp: 0.639 ± 0.23
2.476IleTyr: 2.476 ± 0.538
0.0IleXaa: 0.0 ± 0.0
Lys
6.551LysAla: 6.551 ± 0.737
0.24LysCys: 0.24 ± 0.137
3.834LysAsp: 3.834 ± 0.521
4.234LysGlu: 4.234 ± 0.637
2.317LysPhe: 2.317 ± 0.594
3.595LysGly: 3.595 ± 0.442
1.438LysHis: 1.438 ± 0.274
5.672LysIle: 5.672 ± 0.704
4.474LysLys: 4.474 ± 0.642
5.033LysLeu: 5.033 ± 0.555
1.757LysMet: 1.757 ± 0.397
3.275LysAsn: 3.275 ± 0.659
2.636LysPro: 2.636 ± 0.545
3.914LysGln: 3.914 ± 0.504
3.515LysArg: 3.515 ± 0.64
4.793LysSer: 4.793 ± 0.803
4.793LysThr: 4.793 ± 0.717
4.873LysVal: 4.873 ± 0.611
0.639LysTrp: 0.639 ± 0.226
2.157LysTyr: 2.157 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
7.988LeuAla: 7.988 ± 1.188
1.438LeuCys: 1.438 ± 0.423
6.95LeuAsp: 6.95 ± 0.718
3.994LeuGlu: 3.994 ± 0.577
2.397LeuPhe: 2.397 ± 0.5
6.151LeuGly: 6.151 ± 0.805
1.678LeuHis: 1.678 ± 0.369
6.151LeuIle: 6.151 ± 0.817
5.432LeuLys: 5.432 ± 0.612
5.911LeuLeu: 5.911 ± 0.879
2.077LeuMet: 2.077 ± 0.447
4.873LeuAsn: 4.873 ± 0.683
3.595LeuPro: 3.595 ± 0.514
4.314LeuGln: 4.314 ± 0.539
3.275LeuArg: 3.275 ± 0.57
7.03LeuSer: 7.03 ± 0.688
6.471LeuThr: 6.471 ± 0.725
4.713LeuVal: 4.713 ± 0.593
0.479LeuTrp: 0.479 ± 0.171
2.077LeuTyr: 2.077 ± 0.378
0.0LeuXaa: 0.0 ± 0.0
Met
2.556MetAla: 2.556 ± 0.582
0.08MetCys: 0.08 ± 0.068
1.757MetAsp: 1.757 ± 0.354
0.08MetGlu: 0.08 ± 0.091
0.799MetPhe: 0.799 ± 0.227
2.077MetGly: 2.077 ± 0.393
0.399MetHis: 0.399 ± 0.184
1.198MetIle: 1.198 ± 0.266
1.438MetLys: 1.438 ± 0.311
1.757MetLeu: 1.757 ± 0.479
0.559MetMet: 0.559 ± 0.191
1.678MetAsn: 1.678 ± 0.408
1.118MetPro: 1.118 ± 0.281
1.039MetGln: 1.039 ± 0.335
1.198MetArg: 1.198 ± 0.308
2.317MetSer: 2.317 ± 0.459
2.556MetThr: 2.556 ± 0.523
1.039MetVal: 1.039 ± 0.235
0.24MetTrp: 0.24 ± 0.157
0.479MetTyr: 0.479 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
4.873AsnAla: 4.873 ± 0.685
0.24AsnCys: 0.24 ± 0.129
2.397AsnAsp: 2.397 ± 0.445
2.476AsnGlu: 2.476 ± 0.421
1.997AsnPhe: 1.997 ± 0.322
3.355AsnGly: 3.355 ± 0.531
1.837AsnHis: 1.837 ± 0.334
2.077AsnIle: 2.077 ± 0.49
3.036AsnLys: 3.036 ± 0.558
5.113AsnLeu: 5.113 ± 0.699
1.118AsnMet: 1.118 ± 0.267
2.636AsnAsn: 2.636 ± 0.46
3.595AsnPro: 3.595 ± 0.709
3.275AsnGln: 3.275 ± 0.743
1.598AsnArg: 1.598 ± 0.36
3.755AsnSer: 3.755 ± 0.579
2.636AsnThr: 2.636 ± 0.604
2.556AsnVal: 2.556 ± 0.536
0.639AsnTrp: 0.639 ± 0.198
2.077AsnTyr: 2.077 ± 0.457
0.0AsnXaa: 0.0 ± 0.0
Pro
3.275ProAla: 3.275 ± 0.53
0.399ProCys: 0.399 ± 0.195
1.837ProAsp: 1.837 ± 0.375
1.598ProGlu: 1.598 ± 0.406
1.198ProPhe: 1.198 ± 0.262
0.08ProGly: 0.08 ± 0.084
0.639ProHis: 0.639 ± 0.216
2.796ProIle: 2.796 ± 0.503
2.796ProLys: 2.796 ± 0.542
3.515ProLeu: 3.515 ± 0.597
0.559ProMet: 0.559 ± 0.202
2.397ProAsn: 2.397 ± 0.421
1.518ProPro: 1.518 ± 0.38
1.358ProGln: 1.358 ± 0.354
1.438ProArg: 1.438 ± 0.263
2.556ProSer: 2.556 ± 0.607
3.515ProThr: 3.515 ± 0.547
1.757ProVal: 1.757 ± 0.344
0.24ProTrp: 0.24 ± 0.139
1.118ProTyr: 1.118 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
5.592GlnAla: 5.592 ± 0.702
0.479GlnCys: 0.479 ± 0.179
2.956GlnAsp: 2.956 ± 0.535
3.036GlnGlu: 3.036 ± 0.519
1.837GlnPhe: 1.837 ± 0.309
3.515GlnGly: 3.515 ± 0.516
0.719GlnHis: 0.719 ± 0.26
3.755GlnIle: 3.755 ± 0.634
3.435GlnLys: 3.435 ± 0.571
3.515GlnLeu: 3.515 ± 0.558
1.278GlnMet: 1.278 ± 0.283
2.556GlnAsn: 2.556 ± 0.512
1.518GlnPro: 1.518 ± 0.393
1.997GlnGln: 1.997 ± 0.538
1.757GlnArg: 1.757 ± 0.429
3.675GlnSer: 3.675 ± 0.745
2.956GlnThr: 2.956 ± 0.39
2.476GlnVal: 2.476 ± 0.464
0.24GlnTrp: 0.24 ± 0.155
1.198GlnTyr: 1.198 ± 0.328
0.0GlnXaa: 0.0 ± 0.0
Arg
4.474ArgAla: 4.474 ± 0.612
0.32ArgCys: 0.32 ± 0.138
2.237ArgAsp: 2.237 ± 0.461
2.317ArgGlu: 2.317 ± 0.322
1.757ArgPhe: 1.757 ± 0.396
1.358ArgGly: 1.358 ± 0.314
1.678ArgHis: 1.678 ± 0.333
3.275ArgIle: 3.275 ± 0.528
2.556ArgLys: 2.556 ± 0.485
4.234ArgLeu: 4.234 ± 0.553
1.039ArgMet: 1.039 ± 0.26
1.358ArgAsn: 1.358 ± 0.32
1.438ArgPro: 1.438 ± 0.38
2.796ArgGln: 2.796 ± 0.522
2.237ArgArg: 2.237 ± 0.426
2.636ArgSer: 2.636 ± 0.359
2.636ArgThr: 2.636 ± 0.409
2.636ArgVal: 2.636 ± 0.416
0.399ArgTrp: 0.399 ± 0.15
1.837ArgTyr: 1.837 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
5.672SerAla: 5.672 ± 0.712
0.559SerCys: 0.559 ± 0.239
4.793SerAsp: 4.793 ± 0.704
4.394SerGlu: 4.394 ± 0.55
2.397SerPhe: 2.397 ± 0.438
4.394SerGly: 4.394 ± 0.654
2.397SerHis: 2.397 ± 0.513
3.675SerIle: 3.675 ± 0.661
3.755SerLys: 3.755 ± 0.421
5.432SerLeu: 5.432 ± 0.661
1.438SerMet: 1.438 ± 0.323
3.116SerAsn: 3.116 ± 0.762
1.358SerPro: 1.358 ± 0.399
4.314SerGln: 4.314 ± 0.83
2.636SerArg: 2.636 ± 0.427
3.435SerSer: 3.435 ± 0.973
3.116SerThr: 3.116 ± 0.469
5.193SerVal: 5.193 ± 0.841
0.799SerTrp: 0.799 ± 0.283
1.358SerTyr: 1.358 ± 0.344
0.0SerXaa: 0.0 ± 0.0
Thr
7.589ThrAla: 7.589 ± 0.868
0.32ThrCys: 0.32 ± 0.155
4.234ThrAsp: 4.234 ± 0.59
3.275ThrGlu: 3.275 ± 0.574
1.678ThrPhe: 1.678 ± 0.397
4.633ThrGly: 4.633 ± 0.718
1.997ThrHis: 1.997 ± 0.53
3.515ThrIle: 3.515 ± 0.468
5.272ThrLys: 5.272 ± 0.582
7.669ThrLeu: 7.669 ± 0.716
1.358ThrMet: 1.358 ± 0.322
3.355ThrAsn: 3.355 ± 0.657
3.116ThrPro: 3.116 ± 0.525
2.157ThrGln: 2.157 ± 0.44
1.757ThrArg: 1.757 ± 0.407
3.834ThrSer: 3.834 ± 0.794
3.994ThrThr: 3.994 ± 0.551
3.435ThrVal: 3.435 ± 0.526
0.559ThrTrp: 0.559 ± 0.175
1.278ThrTyr: 1.278 ± 0.29
0.0ThrXaa: 0.0 ± 0.0
Val
4.234ValAla: 4.234 ± 0.692
0.719ValCys: 0.719 ± 0.21
3.116ValAsp: 3.116 ± 0.425
2.476ValGlu: 2.476 ± 0.374
1.997ValPhe: 1.997 ± 0.408
4.553ValGly: 4.553 ± 0.643
0.879ValHis: 0.879 ± 0.274
5.512ValIle: 5.512 ± 0.798
3.116ValLys: 3.116 ± 0.537
5.033ValLeu: 5.033 ± 0.719
1.278ValMet: 1.278 ± 0.347
2.556ValAsn: 2.556 ± 0.423
1.917ValPro: 1.917 ± 0.459
2.796ValGln: 2.796 ± 0.405
3.036ValArg: 3.036 ± 0.518
3.834ValSer: 3.834 ± 0.535
3.994ValThr: 3.994 ± 0.777
3.275ValVal: 3.275 ± 0.677
1.118ValTrp: 1.118 ± 0.25
2.556ValTyr: 2.556 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
1.198TrpAla: 1.198 ± 0.246
0.32TrpCys: 0.32 ± 0.202
0.879TrpAsp: 0.879 ± 0.273
0.879TrpGlu: 0.879 ± 0.224
0.479TrpPhe: 0.479 ± 0.189
0.719TrpGly: 0.719 ± 0.247
0.16TrpHis: 0.16 ± 0.111
0.32TrpIle: 0.32 ± 0.163
0.639TrpLys: 0.639 ± 0.184
0.959TrpLeu: 0.959 ± 0.255
0.08TrpMet: 0.08 ± 0.079
0.479TrpAsn: 0.479 ± 0.162
0.0TrpPro: 0.0 ± 0.0
1.278TrpGln: 1.278 ± 0.477
0.559TrpArg: 0.559 ± 0.235
0.639TrpSer: 0.639 ± 0.266
0.639TrpThr: 0.639 ± 0.175
1.438TrpVal: 1.438 ± 0.446
0.24TrpTrp: 0.24 ± 0.131
0.32TrpTyr: 0.32 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.116TyrAla: 3.116 ± 0.484
0.399TyrCys: 0.399 ± 0.217
2.237TyrAsp: 2.237 ± 0.465
1.917TyrGlu: 1.917 ± 0.357
0.799TyrPhe: 0.799 ± 0.242
2.237TyrGly: 2.237 ± 0.417
1.118TyrHis: 1.118 ± 0.265
2.476TyrIle: 2.476 ± 0.556
1.757TyrLys: 1.757 ± 0.264
3.515TyrLeu: 3.515 ± 0.622
0.719TyrMet: 0.719 ± 0.245
1.598TyrAsn: 1.598 ± 0.39
1.598TyrPro: 1.598 ± 0.403
1.678TyrGln: 1.678 ± 0.413
1.518TyrArg: 1.518 ± 0.298
1.917TyrSer: 1.917 ± 0.418
2.077TyrThr: 2.077 ± 0.466
1.518TyrVal: 1.518 ± 0.307
0.399TyrTrp: 0.399 ± 0.181
1.518TyrTyr: 1.518 ± 0.526
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12519 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski