Amino acid dipepetide frequency for Mycobacterium phage Dreamboat

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.455AlaAla: 14.455 ± 1.611
0.935AlaCys: 0.935 ± 0.231
6.916AlaAsp: 6.916 ± 0.794
5.857AlaGlu: 5.857 ± 0.695
3.115AlaPhe: 3.115 ± 0.556
7.788AlaGly: 7.788 ± 0.695
1.62AlaHis: 1.62 ± 0.345
4.673AlaIle: 4.673 ± 0.69
4.112AlaLys: 4.112 ± 0.525
9.283AlaLeu: 9.283 ± 0.873
2.368AlaMet: 2.368 ± 0.469
2.492AlaAsn: 2.492 ± 0.416
4.86AlaPro: 4.86 ± 0.744
2.928AlaGln: 2.928 ± 0.495
6.48AlaArg: 6.48 ± 0.59
5.109AlaSer: 5.109 ± 0.647
6.293AlaThr: 6.293 ± 0.619
9.097AlaVal: 9.097 ± 0.759
1.869AlaTrp: 1.869 ± 0.333
2.617AlaTyr: 2.617 ± 0.384
0.0AlaXaa: 0.0 ± 0.0
Cys
0.872CysAla: 0.872 ± 0.237
0.062CysCys: 0.062 ± 0.072
0.374CysAsp: 0.374 ± 0.138
0.81CysGlu: 0.81 ± 0.215
0.125CysPhe: 0.125 ± 0.088
0.748CysGly: 0.748 ± 0.225
0.125CysHis: 0.125 ± 0.088
0.312CysIle: 0.312 ± 0.163
0.249CysLys: 0.249 ± 0.16
0.374CysLeu: 0.374 ± 0.164
0.062CysMet: 0.062 ± 0.059
0.374CysAsn: 0.374 ± 0.145
0.312CysPro: 0.312 ± 0.146
0.187CysGln: 0.187 ± 0.113
0.498CysArg: 0.498 ± 0.19
0.436CysSer: 0.436 ± 0.161
0.374CysThr: 0.374 ± 0.165
0.312CysVal: 0.312 ± 0.133
0.187CysTrp: 0.187 ± 0.109
0.125CysTyr: 0.125 ± 0.094
0.0CysXaa: 0.0 ± 0.0
Asp
6.231AspAla: 6.231 ± 0.776
0.561AspCys: 0.561 ± 0.174
4.05AspAsp: 4.05 ± 0.502
3.676AspGlu: 3.676 ± 0.5
2.305AspPhe: 2.305 ± 0.382
6.417AspGly: 6.417 ± 0.614
1.059AspHis: 1.059 ± 0.26
2.741AspIle: 2.741 ± 0.419
2.43AspLys: 2.43 ± 0.371
6.978AspLeu: 6.978 ± 0.784
0.997AspMet: 0.997 ± 0.195
1.682AspAsn: 1.682 ± 0.347
4.735AspPro: 4.735 ± 0.522
1.62AspGln: 1.62 ± 0.394
3.364AspArg: 3.364 ± 0.416
3.24AspSer: 3.24 ± 0.539
3.863AspThr: 3.863 ± 0.448
4.174AspVal: 4.174 ± 0.563
1.869AspTrp: 1.869 ± 0.307
1.869AspTyr: 1.869 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
6.355GluAla: 6.355 ± 0.778
0.187GluCys: 0.187 ± 0.143
4.673GluAsp: 4.673 ± 0.673
4.922GluGlu: 4.922 ± 0.59
2.305GluPhe: 2.305 ± 0.37
3.988GluGly: 3.988 ± 0.473
1.682GluHis: 1.682 ± 0.354
3.614GluIle: 3.614 ± 0.511
2.43GluLys: 2.43 ± 0.43
7.04GluLeu: 7.04 ± 0.631
1.433GluMet: 1.433 ± 0.252
1.745GluAsn: 1.745 ± 0.317
2.741GluPro: 2.741 ± 0.488
2.056GluGln: 2.056 ± 0.342
4.361GluArg: 4.361 ± 0.626
3.364GluSer: 3.364 ± 0.436
3.988GluThr: 3.988 ± 0.546
4.984GluVal: 4.984 ± 0.597
1.745GluTrp: 1.745 ± 0.355
2.43GluTyr: 2.43 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
2.679PheAla: 2.679 ± 0.393
0.249PheCys: 0.249 ± 0.136
2.804PheAsp: 2.804 ± 0.344
2.243PheGlu: 2.243 ± 0.394
0.498PhePhe: 0.498 ± 0.161
3.364PheGly: 3.364 ± 0.472
0.623PheHis: 0.623 ± 0.277
1.495PheIle: 1.495 ± 0.307
1.433PheLys: 1.433 ± 0.318
2.555PheLeu: 2.555 ± 0.567
0.561PheMet: 0.561 ± 0.204
1.184PheAsn: 1.184 ± 0.247
1.433PhePro: 1.433 ± 0.353
0.997PheGln: 0.997 ± 0.282
1.62PheArg: 1.62 ± 0.394
1.994PheSer: 1.994 ± 0.475
2.056PheThr: 2.056 ± 0.378
1.869PheVal: 1.869 ± 0.34
0.561PheTrp: 0.561 ± 0.19
1.059PheTyr: 1.059 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
6.978GlyAla: 6.978 ± 0.957
0.748GlyCys: 0.748 ± 0.236
5.545GlyAsp: 5.545 ± 0.46
4.984GlyGlu: 4.984 ± 0.524
3.178GlyPhe: 3.178 ± 0.548
8.847GlyGly: 8.847 ± 2.204
1.807GlyHis: 1.807 ± 0.393
4.361GlyIle: 4.361 ± 0.762
3.863GlyLys: 3.863 ± 0.556
7.539GlyLeu: 7.539 ± 0.879
1.682GlyMet: 1.682 ± 0.303
3.551GlyAsn: 3.551 ± 0.457
3.988GlyPro: 3.988 ± 0.627
2.617GlyGln: 2.617 ± 0.381
4.548GlyArg: 4.548 ± 0.502
6.355GlySer: 6.355 ± 0.903
4.673GlyThr: 4.673 ± 0.676
5.732GlyVal: 5.732 ± 0.661
2.492GlyTrp: 2.492 ± 0.38
2.866GlyTyr: 2.866 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
1.62HisAla: 1.62 ± 0.334
0.187HisCys: 0.187 ± 0.097
0.997HisAsp: 0.997 ± 0.254
1.745HisGlu: 1.745 ± 0.404
0.872HisPhe: 0.872 ± 0.187
1.682HisGly: 1.682 ± 0.369
0.748HisHis: 0.748 ± 0.216
0.81HisIle: 0.81 ± 0.215
0.997HisLys: 0.997 ± 0.31
1.308HisLeu: 1.308 ± 0.324
0.187HisMet: 0.187 ± 0.133
0.187HisAsn: 0.187 ± 0.11
1.433HisPro: 1.433 ± 0.293
1.246HisGln: 1.246 ± 0.306
1.246HisArg: 1.246 ± 0.277
0.685HisSer: 0.685 ± 0.226
0.997HisThr: 0.997 ± 0.241
1.558HisVal: 1.558 ± 0.325
0.374HisTrp: 0.374 ± 0.152
0.81HisTyr: 0.81 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
6.106IleAla: 6.106 ± 0.77
0.187IleCys: 0.187 ± 0.11
3.551IleAsp: 3.551 ± 0.405
3.988IleGlu: 3.988 ± 0.508
0.81IlePhe: 0.81 ± 0.255
4.174IleGly: 4.174 ± 0.526
0.872IleHis: 0.872 ± 0.206
1.745IleIle: 1.745 ± 0.392
2.056IleLys: 2.056 ± 0.376
3.489IleLeu: 3.489 ± 0.457
0.81IleMet: 0.81 ± 0.213
1.745IleAsn: 1.745 ± 0.331
3.178IlePro: 3.178 ± 0.433
1.62IleGln: 1.62 ± 0.392
3.489IleArg: 3.489 ± 0.468
3.178IleSer: 3.178 ± 0.509
3.24IleThr: 3.24 ± 0.448
3.302IleVal: 3.302 ± 0.551
0.623IleTrp: 0.623 ± 0.182
1.62IleTyr: 1.62 ± 0.271
0.0IleXaa: 0.0 ± 0.0
Lys
4.112LysAla: 4.112 ± 0.581
0.312LysCys: 0.312 ± 0.144
2.368LysAsp: 2.368 ± 0.421
1.994LysGlu: 1.994 ± 0.405
1.558LysPhe: 1.558 ± 0.339
2.43LysGly: 2.43 ± 0.408
0.997LysHis: 0.997 ± 0.269
2.492LysIle: 2.492 ± 0.446
1.62LysLys: 1.62 ± 0.375
3.676LysLeu: 3.676 ± 0.487
0.872LysMet: 0.872 ± 0.181
1.62LysAsn: 1.62 ± 0.336
2.555LysPro: 2.555 ± 0.444
1.433LysGln: 1.433 ± 0.333
3.364LysArg: 3.364 ± 0.455
2.305LysSer: 2.305 ± 0.426
1.994LysThr: 1.994 ± 0.411
2.991LysVal: 2.991 ± 0.49
0.935LysTrp: 0.935 ± 0.242
0.81LysTyr: 0.81 ± 0.269
0.0LysXaa: 0.0 ± 0.0
Leu
9.72LeuAla: 9.72 ± 0.955
0.436LeuCys: 0.436 ± 0.173
5.607LeuAsp: 5.607 ± 0.676
5.483LeuGlu: 5.483 ± 0.666
1.931LeuPhe: 1.931 ± 0.447
7.539LeuGly: 7.539 ± 0.947
1.682LeuHis: 1.682 ± 0.357
5.296LeuIle: 5.296 ± 0.588
3.801LeuLys: 3.801 ± 0.524
5.358LeuLeu: 5.358 ± 0.586
1.931LeuMet: 1.931 ± 0.331
2.741LeuAsn: 2.741 ± 0.381
5.483LeuPro: 5.483 ± 0.567
2.804LeuGln: 2.804 ± 0.444
5.981LeuArg: 5.981 ± 0.625
5.421LeuSer: 5.421 ± 0.46
6.355LeuThr: 6.355 ± 0.526
4.299LeuVal: 4.299 ± 0.649
1.121LeuTrp: 1.121 ± 0.313
2.305LeuTyr: 2.305 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
2.492MetAla: 2.492 ± 0.371
0.062MetCys: 0.062 ± 0.066
1.308MetAsp: 1.308 ± 0.289
1.308MetGlu: 1.308 ± 0.329
0.685MetPhe: 0.685 ± 0.205
1.62MetGly: 1.62 ± 0.324
0.249MetHis: 0.249 ± 0.133
0.498MetIle: 0.498 ± 0.194
0.935MetLys: 0.935 ± 0.243
1.308MetLeu: 1.308 ± 0.276
0.249MetMet: 0.249 ± 0.114
1.059MetAsn: 1.059 ± 0.162
1.308MetPro: 1.308 ± 0.292
0.498MetGln: 0.498 ± 0.168
1.433MetArg: 1.433 ± 0.303
2.118MetSer: 2.118 ± 0.348
2.492MetThr: 2.492 ± 0.413
1.059MetVal: 1.059 ± 0.233
0.187MetTrp: 0.187 ± 0.111
0.374MetTyr: 0.374 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.24AsnAla: 3.24 ± 0.562
0.249AsnCys: 0.249 ± 0.132
2.181AsnAsp: 2.181 ± 0.374
1.931AsnGlu: 1.931 ± 0.314
0.872AsnPhe: 0.872 ± 0.263
3.364AsnGly: 3.364 ± 0.517
0.872AsnHis: 0.872 ± 0.272
1.495AsnIle: 1.495 ± 0.335
0.748AsnLys: 0.748 ± 0.218
2.118AsnLeu: 2.118 ± 0.346
0.623AsnMet: 0.623 ± 0.173
0.748AsnAsn: 0.748 ± 0.191
2.804AsnPro: 2.804 ± 0.392
1.059AsnGln: 1.059 ± 0.274
1.433AsnArg: 1.433 ± 0.312
1.682AsnSer: 1.682 ± 0.431
2.118AsnThr: 2.118 ± 0.346
2.368AsnVal: 2.368 ± 0.446
0.748AsnTrp: 0.748 ± 0.193
1.433AsnTyr: 1.433 ± 0.275
0.0AsnXaa: 0.0 ± 0.0
Pro
5.981ProAla: 5.981 ± 0.663
0.374ProCys: 0.374 ± 0.156
4.299ProAsp: 4.299 ± 0.432
4.735ProGlu: 4.735 ± 0.608
2.118ProPhe: 2.118 ± 0.43
4.984ProGly: 4.984 ± 0.696
0.872ProHis: 0.872 ± 0.233
2.368ProIle: 2.368 ± 0.377
2.056ProLys: 2.056 ± 0.321
4.548ProLeu: 4.548 ± 0.588
0.935ProMet: 0.935 ± 0.252
1.62ProAsn: 1.62 ± 0.307
2.741ProPro: 2.741 ± 0.599
1.371ProGln: 1.371 ± 0.299
2.679ProArg: 2.679 ± 0.439
3.801ProSer: 3.801 ± 0.551
3.863ProThr: 3.863 ± 0.448
4.237ProVal: 4.237 ± 0.497
0.685ProTrp: 0.685 ± 0.282
1.682ProTyr: 1.682 ± 0.373
0.0ProXaa: 0.0 ± 0.0
Gln
2.866GlnAla: 2.866 ± 0.592
0.062GlnCys: 0.062 ± 0.064
1.371GlnAsp: 1.371 ± 0.371
1.495GlnGlu: 1.495 ± 0.279
1.308GlnPhe: 1.308 ± 0.278
2.118GlnGly: 2.118 ± 0.352
0.498GlnHis: 0.498 ± 0.149
2.866GlnIle: 2.866 ± 0.5
1.059GlnLys: 1.059 ± 0.28
3.738GlnLeu: 3.738 ± 0.53
0.935GlnMet: 0.935 ± 0.303
0.623GlnAsn: 0.623 ± 0.179
1.869GlnPro: 1.869 ± 0.366
1.807GlnGln: 1.807 ± 0.38
1.745GlnArg: 1.745 ± 0.397
1.558GlnSer: 1.558 ± 0.335
1.807GlnThr: 1.807 ± 0.332
2.555GlnVal: 2.555 ± 0.415
0.623GlnTrp: 0.623 ± 0.166
0.561GlnTyr: 0.561 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
6.106ArgAla: 6.106 ± 0.752
0.685ArgCys: 0.685 ± 0.189
2.928ArgAsp: 2.928 ± 0.385
4.984ArgGlu: 4.984 ± 0.724
1.745ArgPhe: 1.745 ± 0.369
5.047ArgGly: 5.047 ± 0.632
1.246ArgHis: 1.246 ± 0.283
2.866ArgIle: 2.866 ± 0.438
2.928ArgLys: 2.928 ± 0.51
5.607ArgLeu: 5.607 ± 0.697
2.368ArgMet: 2.368 ± 0.395
2.492ArgAsn: 2.492 ± 0.473
2.679ArgPro: 2.679 ± 0.425
1.869ArgGln: 1.869 ± 0.329
5.296ArgArg: 5.296 ± 0.68
3.738ArgSer: 3.738 ± 0.424
2.928ArgThr: 2.928 ± 0.509
5.047ArgVal: 5.047 ± 0.554
1.371ArgTrp: 1.371 ± 0.307
1.807ArgTyr: 1.807 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
6.355SerAla: 6.355 ± 0.918
0.436SerCys: 0.436 ± 0.185
2.928SerAsp: 2.928 ± 0.398
3.551SerGlu: 3.551 ± 0.51
1.994SerPhe: 1.994 ± 0.311
7.04SerGly: 7.04 ± 0.972
1.371SerHis: 1.371 ± 0.353
2.741SerIle: 2.741 ± 0.449
2.617SerLys: 2.617 ± 0.47
5.109SerLeu: 5.109 ± 0.66
1.433SerMet: 1.433 ± 0.262
2.056SerAsn: 2.056 ± 0.403
3.24SerPro: 3.24 ± 0.553
1.869SerGln: 1.869 ± 0.315
3.115SerArg: 3.115 ± 0.368
3.676SerSer: 3.676 ± 0.692
3.24SerThr: 3.24 ± 0.506
3.676SerVal: 3.676 ± 0.485
1.308SerTrp: 1.308 ± 0.354
1.059SerTyr: 1.059 ± 0.238
0.0SerXaa: 0.0 ± 0.0
Thr
6.168ThrAla: 6.168 ± 0.808
0.249ThrCys: 0.249 ± 0.136
4.05ThrAsp: 4.05 ± 0.537
4.299ThrGlu: 4.299 ± 0.554
1.994ThrPhe: 1.994 ± 0.401
6.667ThrGly: 6.667 ± 0.734
1.059ThrHis: 1.059 ± 0.307
3.115ThrIle: 3.115 ± 0.555
2.741ThrLys: 2.741 ± 0.391
5.545ThrLeu: 5.545 ± 0.591
1.308ThrMet: 1.308 ± 0.26
1.807ThrAsn: 1.807 ± 0.325
3.863ThrPro: 3.863 ± 0.529
1.682ThrGln: 1.682 ± 0.339
3.489ThrArg: 3.489 ± 0.676
3.364ThrSer: 3.364 ± 0.526
4.361ThrThr: 4.361 ± 0.694
4.798ThrVal: 4.798 ± 0.602
1.371ThrTrp: 1.371 ± 0.329
1.994ThrTyr: 1.994 ± 0.35
0.0ThrXaa: 0.0 ± 0.0
Val
7.04ValAla: 7.04 ± 0.7
0.561ValCys: 0.561 ± 0.177
5.421ValAsp: 5.421 ± 0.571
4.922ValGlu: 4.922 ± 0.577
2.243ValPhe: 2.243 ± 0.345
4.486ValGly: 4.486 ± 0.617
1.184ValHis: 1.184 ± 0.233
3.24ValIle: 3.24 ± 0.406
3.115ValLys: 3.115 ± 0.561
5.607ValLeu: 5.607 ± 0.504
1.246ValMet: 1.246 ± 0.294
2.741ValAsn: 2.741 ± 0.407
4.237ValPro: 4.237 ± 0.51
1.994ValGln: 1.994 ± 0.372
4.984ValArg: 4.984 ± 0.735
4.237ValSer: 4.237 ± 0.496
5.483ValThr: 5.483 ± 0.595
5.358ValVal: 5.358 ± 0.705
1.121ValTrp: 1.121 ± 0.283
2.243ValTyr: 2.243 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
1.371TrpAla: 1.371 ± 0.321
0.125TrpCys: 0.125 ± 0.08
1.433TrpAsp: 1.433 ± 0.317
1.059TrpGlu: 1.059 ± 0.239
0.997TrpPhe: 0.997 ± 0.258
1.558TrpGly: 1.558 ± 0.295
0.498TrpHis: 0.498 ± 0.191
1.371TrpIle: 1.371 ± 0.245
0.374TrpLys: 0.374 ± 0.201
1.682TrpLeu: 1.682 ± 0.302
0.498TrpMet: 0.498 ± 0.175
0.374TrpAsn: 0.374 ± 0.14
0.81TrpPro: 0.81 ± 0.282
0.748TrpGln: 0.748 ± 0.197
1.558TrpArg: 1.558 ± 0.326
1.121TrpSer: 1.121 ± 0.257
1.62TrpThr: 1.62 ± 0.469
2.056TrpVal: 2.056 ± 0.329
0.498TrpTrp: 0.498 ± 0.188
0.312TrpTyr: 0.312 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.118TyrAla: 2.118 ± 0.374
0.249TyrCys: 0.249 ± 0.132
1.059TyrAsp: 1.059 ± 0.294
2.056TyrGlu: 2.056 ± 0.37
0.623TyrPhe: 0.623 ± 0.181
2.555TyrGly: 2.555 ± 0.392
0.623TyrHis: 0.623 ± 0.188
1.682TyrIle: 1.682 ± 0.304
0.872TyrLys: 0.872 ± 0.206
2.43TyrLeu: 2.43 ± 0.378
0.685TyrMet: 0.685 ± 0.165
1.246TyrAsn: 1.246 ± 0.319
1.682TyrPro: 1.682 ± 0.366
1.059TyrGln: 1.059 ± 0.248
3.053TyrArg: 3.053 ± 0.449
1.433TyrSer: 1.433 ± 0.285
2.181TyrThr: 2.181 ± 0.406
1.994TyrVal: 1.994 ± 0.356
0.374TyrTrp: 0.374 ± 0.147
0.748TyrTyr: 0.748 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (16051 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski