Amino acid dipepetide frequency for Enterobacteria phage HK140

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.393AlaAla: 10.393 ± 1.433
0.879AlaCys: 0.879 ± 0.272
7.195AlaAsp: 7.195 ± 0.626
6.875AlaGlu: 6.875 ± 0.75
3.198AlaPhe: 3.198 ± 0.513
7.515AlaGly: 7.515 ± 0.675
1.919AlaHis: 1.919 ± 0.441
7.754AlaIle: 7.754 ± 0.644
5.036AlaLys: 5.036 ± 0.721
6.555AlaLeu: 6.555 ± 0.732
3.517AlaMet: 3.517 ± 0.588
4.157AlaAsn: 4.157 ± 0.469
1.599AlaPro: 1.599 ± 0.306
4.477AlaGln: 4.477 ± 0.634
4.317AlaArg: 4.317 ± 0.585
6.875AlaSer: 6.875 ± 0.772
4.876AlaThr: 4.876 ± 0.843
5.836AlaVal: 5.836 ± 0.681
1.439AlaTrp: 1.439 ± 0.304
2.638AlaTyr: 2.638 ± 0.442
0.0AlaXaa: 0.0 ± 0.0
Cys
1.119CysAla: 1.119 ± 0.339
0.24CysCys: 0.24 ± 0.145
0.959CysAsp: 0.959 ± 0.263
0.959CysGlu: 0.959 ± 0.282
0.08CysPhe: 0.08 ± 0.077
0.799CysGly: 0.799 ± 0.274
0.32CysHis: 0.32 ± 0.185
0.879CysIle: 0.879 ± 0.26
0.959CysLys: 0.959 ± 0.259
0.959CysLeu: 0.959 ± 0.268
0.4CysMet: 0.4 ± 0.196
0.879CysAsn: 0.879 ± 0.244
0.08CysPro: 0.08 ± 0.088
0.32CysGln: 0.32 ± 0.151
0.879CysArg: 0.879 ± 0.309
0.959CysSer: 0.959 ± 0.37
0.48CysThr: 0.48 ± 0.178
0.56CysVal: 0.56 ± 0.248
0.32CysTrp: 0.32 ± 0.218
0.32CysTyr: 0.32 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
6.236AspAla: 6.236 ± 0.667
0.719AspCys: 0.719 ± 0.254
3.517AspAsp: 3.517 ± 0.67
3.438AspGlu: 3.438 ± 0.576
2.318AspPhe: 2.318 ± 0.497
5.276AspGly: 5.276 ± 0.719
0.4AspHis: 0.4 ± 0.186
3.278AspIle: 3.278 ± 0.473
3.278AspLys: 3.278 ± 0.454
5.356AspLeu: 5.356 ± 0.839
1.759AspMet: 1.759 ± 0.358
2.878AspAsn: 2.878 ± 0.499
1.839AspPro: 1.839 ± 0.332
1.919AspGln: 1.919 ± 0.399
3.278AspArg: 3.278 ± 0.549
3.517AspSer: 3.517 ± 0.533
2.318AspThr: 2.318 ± 0.52
3.597AspVal: 3.597 ± 0.638
1.199AspTrp: 1.199 ± 0.333
2.158AspTyr: 2.158 ± 0.419
0.0AspXaa: 0.0 ± 0.0
Glu
4.876GluAla: 4.876 ± 0.632
1.279GluCys: 1.279 ± 0.374
2.478GluAsp: 2.478 ± 0.37
3.917GluGlu: 3.917 ± 0.758
1.999GluPhe: 1.999 ± 0.417
3.757GluGly: 3.757 ± 0.688
1.199GluHis: 1.199 ± 0.283
3.597GluIle: 3.597 ± 0.598
4.157GluLys: 4.157 ± 0.594
5.196GluLeu: 5.196 ± 0.663
2.158GluMet: 2.158 ± 0.442
3.198GluAsn: 3.198 ± 0.43
2.238GluPro: 2.238 ± 0.353
3.757GluGln: 3.757 ± 0.527
4.717GluArg: 4.717 ± 0.744
4.237GluSer: 4.237 ± 0.412
2.398GluThr: 2.398 ± 0.435
3.837GluVal: 3.837 ± 0.573
1.039GluTrp: 1.039 ± 0.354
1.759GluTyr: 1.759 ± 0.377
0.0GluXaa: 0.0 ± 0.0
Phe
3.677PheAla: 3.677 ± 0.608
0.64PheCys: 0.64 ± 0.185
2.798PheAsp: 2.798 ± 0.459
1.839PheGlu: 1.839 ± 0.365
0.64PhePhe: 0.64 ± 0.22
3.278PheGly: 3.278 ± 0.427
0.32PheHis: 0.32 ± 0.134
1.519PheIle: 1.519 ± 0.432
1.679PheLys: 1.679 ± 0.334
1.439PheLeu: 1.439 ± 0.421
0.799PheMet: 0.799 ± 0.213
1.199PheAsn: 1.199 ± 0.276
1.599PhePro: 1.599 ± 0.393
0.879PheGln: 0.879 ± 0.25
1.679PheArg: 1.679 ± 0.367
2.478PheSer: 2.478 ± 0.474
2.718PheThr: 2.718 ± 0.518
1.839PheVal: 1.839 ± 0.308
0.719PheTrp: 0.719 ± 0.217
0.4PheTyr: 0.4 ± 0.157
0.0PheXaa: 0.0 ± 0.0
Gly
6.395GlyAla: 6.395 ± 0.843
0.799GlyCys: 0.799 ± 0.269
4.717GlyAsp: 4.717 ± 0.72
4.317GlyGlu: 4.317 ± 0.503
2.798GlyPhe: 2.798 ± 0.509
5.836GlyGly: 5.836 ± 0.863
0.64GlyHis: 0.64 ± 0.26
4.477GlyIle: 4.477 ± 0.513
5.996GlyLys: 5.996 ± 0.74
6.156GlyLeu: 6.156 ± 0.824
2.238GlyMet: 2.238 ± 0.45
4.237GlyAsn: 4.237 ± 0.759
1.279GlyPro: 1.279 ± 0.289
2.638GlyGln: 2.638 ± 0.453
4.397GlyArg: 4.397 ± 0.55
4.797GlySer: 4.797 ± 0.625
4.797GlyThr: 4.797 ± 0.713
5.196GlyVal: 5.196 ± 0.777
1.119GlyTrp: 1.119 ± 0.307
2.318GlyTyr: 2.318 ± 0.33
0.0GlyXaa: 0.0 ± 0.0
His
1.359HisAla: 1.359 ± 0.397
0.08HisCys: 0.08 ± 0.077
1.039HisAsp: 1.039 ± 0.313
0.48HisGlu: 0.48 ± 0.248
0.48HisPhe: 0.48 ± 0.224
1.679HisGly: 1.679 ± 0.396
0.16HisHis: 0.16 ± 0.134
0.879HisIle: 0.879 ± 0.24
0.799HisLys: 0.799 ± 0.328
0.799HisLeu: 0.799 ± 0.302
0.24HisMet: 0.24 ± 0.142
0.56HisAsn: 0.56 ± 0.208
0.879HisPro: 0.879 ± 0.279
0.799HisGln: 0.799 ± 0.303
1.199HisArg: 1.199 ± 0.342
1.119HisSer: 1.119 ± 0.361
0.48HisThr: 0.48 ± 0.209
0.56HisVal: 0.56 ± 0.23
0.48HisTrp: 0.48 ± 0.186
0.799HisTyr: 0.799 ± 0.219
0.0HisXaa: 0.0 ± 0.0
Ile
6.555IleAla: 6.555 ± 0.715
0.48IleCys: 0.48 ± 0.202
3.997IleAsp: 3.997 ± 0.655
3.997IleGlu: 3.997 ± 0.571
1.519IlePhe: 1.519 ± 0.306
3.358IleGly: 3.358 ± 0.616
1.199IleHis: 1.199 ± 0.277
3.917IleIle: 3.917 ± 0.547
2.958IleLys: 2.958 ± 0.461
4.956IleLeu: 4.956 ± 0.641
1.279IleMet: 1.279 ± 0.384
3.438IleAsn: 3.438 ± 0.43
2.478IlePro: 2.478 ± 0.348
3.677IleGln: 3.677 ± 0.489
3.517IleArg: 3.517 ± 0.555
5.036IleSer: 5.036 ± 0.67
4.157IleThr: 4.157 ± 0.517
2.798IleVal: 2.798 ± 0.515
1.039IleTrp: 1.039 ± 0.393
1.519IleTyr: 1.519 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
5.436LysAla: 5.436 ± 0.732
0.48LysCys: 0.48 ± 0.256
3.038LysAsp: 3.038 ± 0.515
3.757LysGlu: 3.757 ± 0.576
1.119LysPhe: 1.119 ± 0.344
3.198LysGly: 3.198 ± 0.496
0.719LysHis: 0.719 ± 0.273
2.958LysIle: 2.958 ± 0.53
2.878LysLys: 2.878 ± 0.745
6.156LysLeu: 6.156 ± 0.73
1.519LysMet: 1.519 ± 0.392
2.079LysAsn: 2.079 ± 0.377
2.478LysPro: 2.478 ± 0.554
2.958LysGln: 2.958 ± 0.502
3.997LysArg: 3.997 ± 0.718
4.477LysSer: 4.477 ± 0.741
3.438LysThr: 3.438 ± 0.534
3.597LysVal: 3.597 ± 0.561
1.119LysTrp: 1.119 ± 0.295
1.919LysTyr: 1.919 ± 0.348
0.0LysXaa: 0.0 ± 0.0
Leu
8.394LeuAla: 8.394 ± 0.888
1.279LeuCys: 1.279 ± 0.382
4.637LeuAsp: 4.637 ± 0.583
5.196LeuGlu: 5.196 ± 0.599
2.238LeuPhe: 2.238 ± 0.401
4.237LeuGly: 4.237 ± 0.722
1.039LeuHis: 1.039 ± 0.316
4.637LeuIle: 4.637 ± 0.727
4.157LeuLys: 4.157 ± 0.513
5.756LeuLeu: 5.756 ± 0.687
1.599LeuMet: 1.599 ± 0.367
3.837LeuAsn: 3.837 ± 0.547
3.517LeuPro: 3.517 ± 0.471
3.597LeuGln: 3.597 ± 0.733
5.356LeuArg: 5.356 ± 0.672
6.236LeuSer: 6.236 ± 0.866
5.356LeuThr: 5.356 ± 0.509
4.797LeuVal: 4.797 ± 0.584
0.56LeuTrp: 0.56 ± 0.221
2.238LeuTyr: 2.238 ± 0.482
0.0LeuXaa: 0.0 ± 0.0
Met
3.198MetAla: 3.198 ± 0.488
0.48MetCys: 0.48 ± 0.197
1.359MetAsp: 1.359 ± 0.279
1.359MetGlu: 1.359 ± 0.35
0.719MetPhe: 0.719 ± 0.327
1.759MetGly: 1.759 ± 0.301
0.32MetHis: 0.32 ± 0.148
1.279MetIle: 1.279 ± 0.34
2.079MetLys: 2.079 ± 0.4
2.398MetLeu: 2.398 ± 0.433
0.56MetMet: 0.56 ± 0.176
0.879MetAsn: 0.879 ± 0.27
1.599MetPro: 1.599 ± 0.317
1.359MetGln: 1.359 ± 0.366
1.839MetArg: 1.839 ± 0.389
1.919MetSer: 1.919 ± 0.355
2.238MetThr: 2.238 ± 0.327
1.599MetVal: 1.599 ± 0.343
0.24MetTrp: 0.24 ± 0.132
0.4MetTyr: 0.4 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
5.036AsnAla: 5.036 ± 0.674
0.32AsnCys: 0.32 ± 0.162
2.318AsnAsp: 2.318 ± 0.401
2.718AsnGlu: 2.718 ± 0.476
0.959AsnPhe: 0.959 ± 0.362
4.876AsnGly: 4.876 ± 0.622
0.719AsnHis: 0.719 ± 0.244
2.878AsnIle: 2.878 ± 0.416
2.558AsnLys: 2.558 ± 0.472
3.038AsnLeu: 3.038 ± 0.418
1.199AsnMet: 1.199 ± 0.341
1.999AsnAsn: 1.999 ± 0.447
2.638AsnPro: 2.638 ± 0.505
2.238AsnGln: 2.238 ± 0.4
2.238AsnArg: 2.238 ± 0.487
2.878AsnSer: 2.878 ± 0.409
3.198AsnThr: 3.198 ± 0.566
2.798AsnVal: 2.798 ± 0.616
0.56AsnTrp: 0.56 ± 0.185
1.279AsnTyr: 1.279 ± 0.383
0.0AsnXaa: 0.0 ± 0.0
Pro
3.677ProAla: 3.677 ± 0.575
0.4ProCys: 0.4 ± 0.157
1.839ProAsp: 1.839 ± 0.37
2.798ProGlu: 2.798 ± 0.538
1.519ProPhe: 1.519 ± 0.386
2.798ProGly: 2.798 ± 0.365
0.64ProHis: 0.64 ± 0.215
1.439ProIle: 1.439 ± 0.395
2.318ProLys: 2.318 ± 0.478
2.558ProLeu: 2.558 ± 0.69
1.439ProMet: 1.439 ± 0.419
1.519ProAsn: 1.519 ± 0.379
1.279ProPro: 1.279 ± 0.333
1.999ProGln: 1.999 ± 0.453
2.079ProArg: 2.079 ± 0.458
2.158ProSer: 2.158 ± 0.409
1.919ProThr: 1.919 ± 0.395
3.038ProVal: 3.038 ± 0.501
0.48ProTrp: 0.48 ± 0.246
1.039ProTyr: 1.039 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
4.876GlnAla: 4.876 ± 0.724
0.56GlnCys: 0.56 ± 0.192
1.439GlnAsp: 1.439 ± 0.331
2.318GlnGlu: 2.318 ± 0.326
1.839GlnPhe: 1.839 ± 0.372
2.878GlnGly: 2.878 ± 0.466
0.48GlnHis: 0.48 ± 0.212
3.038GlnIle: 3.038 ± 0.417
2.958GlnLys: 2.958 ± 0.64
4.237GlnLeu: 4.237 ± 0.491
1.119GlnMet: 1.119 ± 0.302
2.478GlnAsn: 2.478 ± 0.511
1.759GlnPro: 1.759 ± 0.344
2.958GlnGln: 2.958 ± 0.485
2.478GlnArg: 2.478 ± 0.42
2.718GlnSer: 2.718 ± 0.547
2.878GlnThr: 2.878 ± 0.567
2.638GlnVal: 2.638 ± 0.444
0.48GlnTrp: 0.48 ± 0.163
1.679GlnTyr: 1.679 ± 0.447
0.0GlnXaa: 0.0 ± 0.0
Arg
4.956ArgAla: 4.956 ± 0.616
0.799ArgCys: 0.799 ± 0.258
3.917ArgAsp: 3.917 ± 0.593
4.317ArgGlu: 4.317 ± 0.661
1.999ArgPhe: 1.999 ± 0.345
3.358ArgGly: 3.358 ± 0.492
1.039ArgHis: 1.039 ± 0.296
4.397ArgIle: 4.397 ± 0.593
4.317ArgLys: 4.317 ± 0.609
4.397ArgLeu: 4.397 ± 0.58
1.759ArgMet: 1.759 ± 0.4
3.038ArgAsn: 3.038 ± 0.532
1.999ArgPro: 1.999 ± 0.512
2.638ArgGln: 2.638 ± 0.441
3.038ArgArg: 3.038 ± 0.612
2.798ArgSer: 2.798 ± 0.533
3.278ArgThr: 3.278 ± 0.503
3.517ArgVal: 3.517 ± 0.632
1.519ArgTrp: 1.519 ± 0.327
2.158ArgTyr: 2.158 ± 0.453
0.0ArgXaa: 0.0 ± 0.0
Ser
5.676SerAla: 5.676 ± 0.628
0.48SerCys: 0.48 ± 0.182
4.157SerAsp: 4.157 ± 0.63
4.157SerGlu: 4.157 ± 0.428
2.558SerPhe: 2.558 ± 0.406
6.715SerGly: 6.715 ± 0.823
0.799SerHis: 0.799 ± 0.293
3.358SerIle: 3.358 ± 0.65
3.278SerLys: 3.278 ± 0.653
6.315SerLeu: 6.315 ± 0.731
1.839SerMet: 1.839 ± 0.417
2.878SerAsn: 2.878 ± 0.393
3.118SerPro: 3.118 ± 0.471
3.597SerGln: 3.597 ± 0.577
3.757SerArg: 3.757 ± 0.505
4.477SerSer: 4.477 ± 0.821
3.118SerThr: 3.118 ± 0.517
4.876SerVal: 4.876 ± 0.74
1.759SerTrp: 1.759 ± 0.318
1.519SerTyr: 1.519 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
6.236ThrAla: 6.236 ± 0.782
0.719ThrCys: 0.719 ± 0.257
2.798ThrAsp: 2.798 ± 0.506
3.278ThrGlu: 3.278 ± 0.495
2.558ThrPhe: 2.558 ± 0.485
5.436ThrGly: 5.436 ± 0.826
0.959ThrHis: 0.959 ± 0.304
3.917ThrIle: 3.917 ± 0.512
2.718ThrLys: 2.718 ± 0.44
4.077ThrLeu: 4.077 ± 0.524
1.039ThrMet: 1.039 ± 0.269
2.398ThrAsn: 2.398 ± 0.487
2.558ThrPro: 2.558 ± 0.481
2.158ThrGln: 2.158 ± 0.447
2.798ThrArg: 2.798 ± 0.438
3.677ThrSer: 3.677 ± 0.504
3.198ThrThr: 3.198 ± 0.595
4.557ThrVal: 4.557 ± 0.682
1.199ThrTrp: 1.199 ± 0.329
1.679ThrTyr: 1.679 ± 0.441
0.0ThrXaa: 0.0 ± 0.0
Val
5.116ValAla: 5.116 ± 0.576
0.799ValCys: 0.799 ± 0.244
3.597ValAsp: 3.597 ± 0.432
4.077ValGlu: 4.077 ± 0.55
2.318ValPhe: 2.318 ± 0.33
4.397ValGly: 4.397 ± 0.818
0.56ValHis: 0.56 ± 0.207
4.876ValIle: 4.876 ± 0.629
3.757ValLys: 3.757 ± 0.548
4.237ValLeu: 4.237 ± 0.558
2.079ValMet: 2.079 ± 0.411
3.038ValAsn: 3.038 ± 0.462
1.999ValPro: 1.999 ± 0.423
1.679ValGln: 1.679 ± 0.338
3.517ValArg: 3.517 ± 0.523
5.196ValSer: 5.196 ± 0.664
4.397ValThr: 4.397 ± 0.638
3.677ValVal: 3.677 ± 0.625
1.279ValTrp: 1.279 ± 0.276
1.439ValTyr: 1.439 ± 0.286
0.0ValXaa: 0.0 ± 0.0
Trp
1.279TrpAla: 1.279 ± 0.32
0.64TrpCys: 0.64 ± 0.231
1.119TrpAsp: 1.119 ± 0.277
0.64TrpGlu: 0.64 ± 0.229
0.48TrpPhe: 0.48 ± 0.169
1.519TrpGly: 1.519 ± 0.309
0.799TrpHis: 0.799 ± 0.259
0.959TrpIle: 0.959 ± 0.265
0.56TrpLys: 0.56 ± 0.217
1.759TrpLeu: 1.759 ± 0.474
0.56TrpMet: 0.56 ± 0.178
0.719TrpAsn: 0.719 ± 0.211
0.4TrpPro: 0.4 ± 0.163
0.879TrpGln: 0.879 ± 0.237
1.439TrpArg: 1.439 ± 0.301
0.879TrpSer: 0.879 ± 0.283
1.279TrpThr: 1.279 ± 0.37
1.119TrpVal: 1.119 ± 0.27
0.4TrpTrp: 0.4 ± 0.168
0.32TrpTyr: 0.32 ± 0.129
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.558TyrAla: 2.558 ± 0.446
0.4TyrCys: 0.4 ± 0.171
1.359TyrAsp: 1.359 ± 0.246
1.359TyrGlu: 1.359 ± 0.296
0.799TyrPhe: 0.799 ± 0.296
2.558TyrGly: 2.558 ± 0.505
0.64TyrHis: 0.64 ± 0.247
1.839TyrIle: 1.839 ± 0.442
0.879TyrLys: 0.879 ± 0.226
2.318TyrLeu: 2.318 ± 0.429
0.4TyrMet: 0.4 ± 0.191
1.039TyrAsn: 1.039 ± 0.227
1.599TyrPro: 1.599 ± 0.437
1.279TyrGln: 1.279 ± 0.29
2.718TyrArg: 2.718 ± 0.433
1.999TyrSer: 1.999 ± 0.481
1.439TyrThr: 1.439 ± 0.437
1.599TyrVal: 1.599 ± 0.341
0.799TyrTrp: 0.799 ± 0.258
0.64TyrTyr: 0.64 ± 0.201
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (12510 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski