Amino acid dipepetide frequency for Enterobacteria phage T7M

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.267AlaAla: 10.267 ± 1.134
0.674AlaCys: 0.674 ± 0.229
5.845AlaAsp: 5.845 ± 0.71
5.92AlaGlu: 5.92 ± 0.579
3.147AlaPhe: 3.147 ± 0.569
7.944AlaGly: 7.944 ± 0.785
1.424AlaHis: 1.424 ± 0.218
4.796AlaIle: 4.796 ± 0.443
6.22AlaLys: 6.22 ± 0.734
8.993AlaLeu: 8.993 ± 0.798
3.447AlaMet: 3.447 ± 0.527
4.122AlaAsn: 4.122 ± 0.629
2.698AlaPro: 2.698 ± 0.481
3.597AlaGln: 3.597 ± 0.546
5.396AlaArg: 5.396 ± 0.554
4.871AlaSer: 4.871 ± 0.69
4.122AlaThr: 4.122 ± 0.64
5.321AlaVal: 5.321 ± 0.765
1.948AlaTrp: 1.948 ± 0.316
2.548AlaTyr: 2.548 ± 0.386
0.0AlaXaa: 0.0 ± 0.0
Cys
0.749CysAla: 0.749 ± 0.292
0.225CysCys: 0.225 ± 0.132
0.974CysAsp: 0.974 ± 0.387
0.45CysGlu: 0.45 ± 0.21
0.6CysPhe: 0.6 ± 0.248
0.824CysGly: 0.824 ± 0.226
0.6CysHis: 0.6 ± 0.239
0.674CysIle: 0.674 ± 0.279
0.899CysLys: 0.899 ± 0.269
1.199CysLeu: 1.199 ± 0.429
0.075CysMet: 0.075 ± 0.071
0.3CysAsn: 0.3 ± 0.136
0.3CysPro: 0.3 ± 0.185
0.6CysGln: 0.6 ± 0.247
0.6CysArg: 0.6 ± 0.228
0.6CysSer: 0.6 ± 0.256
0.375CysThr: 0.375 ± 0.229
0.6CysVal: 0.6 ± 0.237
0.225CysTrp: 0.225 ± 0.161
0.525CysTyr: 0.525 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
5.621AspAla: 5.621 ± 0.512
0.899AspCys: 0.899 ± 0.279
4.272AspAsp: 4.272 ± 0.556
4.272AspGlu: 4.272 ± 0.585
2.548AspPhe: 2.548 ± 0.337
7.044AspGly: 7.044 ± 0.734
1.124AspHis: 1.124 ± 0.253
2.698AspIle: 2.698 ± 0.441
4.721AspLys: 4.721 ± 0.536
3.522AspLeu: 3.522 ± 0.679
2.023AspMet: 2.023 ± 0.493
2.698AspAsn: 2.698 ± 0.327
3.222AspPro: 3.222 ± 0.575
1.874AspGln: 1.874 ± 0.485
2.998AspArg: 2.998 ± 0.684
3.672AspSer: 3.672 ± 0.666
3.672AspThr: 3.672 ± 0.553
4.197AspVal: 4.197 ± 0.471
0.45AspTrp: 0.45 ± 0.214
1.799AspTyr: 1.799 ± 0.323
0.0AspXaa: 0.0 ± 0.0
Glu
7.269GluAla: 7.269 ± 0.718
0.674GluCys: 0.674 ± 0.209
4.721GluAsp: 4.721 ± 0.673
5.096GluGlu: 5.096 ± 0.6
2.398GluPhe: 2.398 ± 0.401
4.421GluGly: 4.421 ± 0.611
1.424GluHis: 1.424 ± 0.329
3.447GluIle: 3.447 ± 0.529
4.272GluLys: 4.272 ± 0.626
5.995GluLeu: 5.995 ± 0.724
2.173GluMet: 2.173 ± 0.463
2.248GluAsn: 2.248 ± 0.367
1.649GluPro: 1.649 ± 0.328
4.197GluGln: 4.197 ± 0.799
4.197GluArg: 4.197 ± 0.48
4.347GluSer: 4.347 ± 0.452
2.623GluThr: 2.623 ± 0.488
4.197GluVal: 4.197 ± 0.472
1.499GluTrp: 1.499 ± 0.363
2.698GluTyr: 2.698 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.398PheAla: 2.398 ± 0.358
0.375PheCys: 0.375 ± 0.21
3.073PheAsp: 3.073 ± 0.557
2.398PheGlu: 2.398 ± 0.38
1.124PhePhe: 1.124 ± 0.29
3.073PheGly: 3.073 ± 0.375
1.049PheHis: 1.049 ± 0.368
1.874PheIle: 1.874 ± 0.404
2.773PheLys: 2.773 ± 0.48
2.998PheLeu: 2.998 ± 0.385
1.124PheMet: 1.124 ± 0.209
1.499PheAsn: 1.499 ± 0.393
1.574PhePro: 1.574 ± 0.392
1.349PheGln: 1.349 ± 0.259
1.799PheArg: 1.799 ± 0.353
1.574PheSer: 1.574 ± 0.421
2.848PheThr: 2.848 ± 0.441
1.724PheVal: 1.724 ± 0.353
0.15PheTrp: 0.15 ± 0.101
1.199PheTyr: 1.199 ± 0.312
0.0PheXaa: 0.0 ± 0.0
Gly
7.044GlyAla: 7.044 ± 0.976
1.274GlyCys: 1.274 ± 0.414
5.171GlyAsp: 5.171 ± 0.623
5.246GlyGlu: 5.246 ± 0.574
3.522GlyPhe: 3.522 ± 0.618
5.096GlyGly: 5.096 ± 0.777
1.274GlyHis: 1.274 ± 0.318
4.421GlyIle: 4.421 ± 0.621
5.845GlyLys: 5.845 ± 0.858
6.82GlyLeu: 6.82 ± 0.745
2.098GlyMet: 2.098 ± 0.332
2.848GlyAsn: 2.848 ± 0.529
0.899GlyPro: 0.899 ± 0.347
3.147GlyGln: 3.147 ± 0.43
4.496GlyArg: 4.496 ± 0.579
4.571GlySer: 4.571 ± 0.77
2.998GlyThr: 2.998 ± 0.507
4.347GlyVal: 4.347 ± 0.459
1.499GlyTrp: 1.499 ± 0.411
2.548GlyTyr: 2.548 ± 0.45
0.0GlyXaa: 0.0 ± 0.0
His
1.724HisAla: 1.724 ± 0.439
0.225HisCys: 0.225 ± 0.133
0.749HisAsp: 0.749 ± 0.201
1.199HisGlu: 1.199 ± 0.32
0.824HisPhe: 0.824 ± 0.224
1.499HisGly: 1.499 ± 0.249
0.6HisHis: 0.6 ± 0.208
1.499HisIle: 1.499 ± 0.31
1.124HisLys: 1.124 ± 0.291
2.248HisLeu: 2.248 ± 0.434
0.749HisMet: 0.749 ± 0.216
0.674HisAsn: 0.674 ± 0.193
0.45HisPro: 0.45 ± 0.204
0.075HisGln: 0.075 ± 0.079
1.049HisArg: 1.049 ± 0.265
1.424HisSer: 1.424 ± 0.347
0.824HisThr: 0.824 ± 0.233
1.349HisVal: 1.349 ± 0.271
0.525HisTrp: 0.525 ± 0.172
0.6HisTyr: 0.6 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
4.197IleAla: 4.197 ± 0.475
0.674IleCys: 0.674 ± 0.205
3.597IleAsp: 3.597 ± 0.423
3.597IleGlu: 3.597 ± 0.688
1.199IlePhe: 1.199 ± 0.287
3.222IleGly: 3.222 ± 0.428
1.499IleHis: 1.499 ± 0.37
2.998IleIle: 2.998 ± 0.319
3.522IleLys: 3.522 ± 0.528
3.597IleLeu: 3.597 ± 0.502
0.824IleMet: 0.824 ± 0.227
2.698IleAsn: 2.698 ± 0.638
2.848IlePro: 2.848 ± 0.443
2.098IleGln: 2.098 ± 0.471
2.698IleArg: 2.698 ± 0.447
2.923IleSer: 2.923 ± 0.436
1.948IleThr: 1.948 ± 0.383
3.147IleVal: 3.147 ± 0.381
0.6IleTrp: 0.6 ± 0.201
1.724IleTyr: 1.724 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
7.644LysAla: 7.644 ± 0.876
0.45LysCys: 0.45 ± 0.163
3.147LysAsp: 3.147 ± 0.325
5.171LysGlu: 5.171 ± 0.754
2.473LysPhe: 2.473 ± 0.473
4.721LysGly: 4.721 ± 0.56
1.874LysHis: 1.874 ± 0.38
1.649LysIle: 1.649 ± 0.295
4.796LysLys: 4.796 ± 0.909
5.321LysLeu: 5.321 ± 0.76
1.799LysMet: 1.799 ± 0.39
2.173LysAsn: 2.173 ± 0.34
3.297LysPro: 3.297 ± 0.555
2.023LysGln: 2.023 ± 0.544
4.496LysArg: 4.496 ± 0.612
3.672LysSer: 3.672 ± 0.601
3.522LysThr: 3.522 ± 0.531
5.546LysVal: 5.546 ± 0.582
0.974LysTrp: 0.974 ± 0.277
2.023LysTyr: 2.023 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
8.243LeuAla: 8.243 ± 1.31
0.45LeuCys: 0.45 ± 0.209
4.871LeuAsp: 4.871 ± 0.471
6.295LeuGlu: 6.295 ± 0.727
2.548LeuPhe: 2.548 ± 0.516
4.496LeuGly: 4.496 ± 0.692
0.824LeuHis: 0.824 ± 0.294
3.822LeuIle: 3.822 ± 0.565
5.995LeuLys: 5.995 ± 0.687
4.946LeuLeu: 4.946 ± 0.746
2.548LeuMet: 2.548 ± 0.313
4.122LeuAsn: 4.122 ± 0.475
3.672LeuPro: 3.672 ± 0.477
3.747LeuGln: 3.747 ± 0.528
6.37LeuArg: 6.37 ± 0.702
4.796LeuSer: 4.796 ± 0.587
5.845LeuThr: 5.845 ± 0.821
5.396LeuVal: 5.396 ± 0.576
1.274LeuTrp: 1.274 ± 0.362
2.548LeuTyr: 2.548 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
2.998MetAla: 2.998 ± 0.401
0.225MetCys: 0.225 ± 0.14
2.023MetAsp: 2.023 ± 0.459
1.874MetGlu: 1.874 ± 0.531
0.974MetPhe: 0.974 ± 0.245
2.623MetGly: 2.623 ± 0.502
0.45MetHis: 0.45 ± 0.192
1.349MetIle: 1.349 ± 0.273
1.124MetLys: 1.124 ± 0.301
3.672MetLeu: 3.672 ± 0.515
1.124MetMet: 1.124 ± 0.303
1.499MetAsn: 1.499 ± 0.258
1.724MetPro: 1.724 ± 0.29
1.124MetGln: 1.124 ± 0.352
1.349MetArg: 1.349 ± 0.277
1.349MetSer: 1.349 ± 0.293
1.874MetThr: 1.874 ± 0.385
2.173MetVal: 2.173 ± 0.364
0.075MetTrp: 0.075 ± 0.071
0.6MetTyr: 0.6 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
3.522AsnAla: 3.522 ± 0.448
0.525AsnCys: 0.525 ± 0.258
2.323AsnAsp: 2.323 ± 0.443
2.773AsnGlu: 2.773 ± 0.372
1.574AsnPhe: 1.574 ± 0.309
3.972AsnGly: 3.972 ± 0.691
0.375AsnHis: 0.375 ± 0.175
2.473AsnIle: 2.473 ± 0.524
2.098AsnLys: 2.098 ± 0.311
3.372AsnLeu: 3.372 ± 0.444
0.899AsnMet: 0.899 ± 0.27
1.948AsnAsn: 1.948 ± 0.438
2.248AsnPro: 2.248 ± 0.433
2.173AsnGln: 2.173 ± 0.393
2.398AsnArg: 2.398 ± 0.576
2.398AsnSer: 2.398 ± 0.634
2.098AsnThr: 2.098 ± 0.395
2.623AsnVal: 2.623 ± 0.57
0.6AsnTrp: 0.6 ± 0.206
2.023AsnTyr: 2.023 ± 0.491
0.0AsnXaa: 0.0 ± 0.0
Pro
2.848ProAla: 2.848 ± 0.393
0.674ProCys: 0.674 ± 0.264
3.147ProAsp: 3.147 ± 0.432
3.147ProGlu: 3.147 ± 0.661
1.199ProPhe: 1.199 ± 0.288
1.649ProGly: 1.649 ± 0.34
0.6ProHis: 0.6 ± 0.188
1.649ProIle: 1.649 ± 0.381
2.773ProLys: 2.773 ± 0.467
2.323ProLeu: 2.323 ± 0.511
1.124ProMet: 1.124 ± 0.324
2.248ProAsn: 2.248 ± 0.474
0.899ProPro: 0.899 ± 0.324
1.124ProGln: 1.124 ± 0.304
1.948ProArg: 1.948 ± 0.403
2.023ProSer: 2.023 ± 0.308
2.398ProThr: 2.398 ± 0.419
2.248ProVal: 2.248 ± 0.38
0.749ProTrp: 0.749 ± 0.161
1.349ProTyr: 1.349 ± 0.354
0.0ProXaa: 0.0 ± 0.0
Gln
3.747GlnAla: 3.747 ± 0.648
0.525GlnCys: 0.525 ± 0.227
2.248GlnAsp: 2.248 ± 0.414
3.297GlnGlu: 3.297 ± 0.554
1.799GlnPhe: 1.799 ± 0.3
2.698GlnGly: 2.698 ± 0.326
0.674GlnHis: 0.674 ± 0.191
1.799GlnIle: 1.799 ± 0.436
2.098GlnLys: 2.098 ± 0.303
3.822GlnLeu: 3.822 ± 0.435
1.124GlnMet: 1.124 ± 0.271
1.124GlnAsn: 1.124 ± 0.287
1.199GlnPro: 1.199 ± 0.393
1.874GlnGln: 1.874 ± 0.281
2.023GlnArg: 2.023 ± 0.434
2.323GlnSer: 2.323 ± 0.442
2.098GlnThr: 2.098 ± 0.418
2.548GlnVal: 2.548 ± 0.504
0.899GlnTrp: 0.899 ± 0.251
1.274GlnTyr: 1.274 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
5.171ArgAla: 5.171 ± 0.623
1.049ArgCys: 1.049 ± 0.301
3.897ArgAsp: 3.897 ± 0.573
4.946ArgGlu: 4.946 ± 0.74
2.773ArgPhe: 2.773 ± 0.485
3.747ArgGly: 3.747 ± 0.4
1.499ArgHis: 1.499 ± 0.365
2.623ArgIle: 2.623 ± 0.496
3.522ArgLys: 3.522 ± 0.564
5.321ArgLeu: 5.321 ± 0.564
2.098ArgMet: 2.098 ± 0.488
2.623ArgAsn: 2.623 ± 0.548
1.124ArgPro: 1.124 ± 0.264
1.874ArgGln: 1.874 ± 0.423
3.147ArgArg: 3.147 ± 0.443
4.197ArgSer: 4.197 ± 0.598
3.147ArgThr: 3.147 ± 0.458
3.672ArgVal: 3.672 ± 0.539
0.974ArgTrp: 0.974 ± 0.283
1.199ArgTyr: 1.199 ± 0.164
0.0ArgXaa: 0.0 ± 0.0
Ser
6.07SerAla: 6.07 ± 0.655
0.824SerCys: 0.824 ± 0.325
4.946SerAsp: 4.946 ± 0.798
2.698SerGlu: 2.698 ± 0.356
2.173SerPhe: 2.173 ± 0.4
4.796SerGly: 4.796 ± 0.58
1.124SerHis: 1.124 ± 0.279
3.297SerIle: 3.297 ± 0.609
3.222SerLys: 3.222 ± 0.597
3.972SerLeu: 3.972 ± 0.55
1.424SerMet: 1.424 ± 0.366
2.173SerAsn: 2.173 ± 0.321
1.574SerPro: 1.574 ± 0.352
2.398SerGln: 2.398 ± 0.376
3.672SerArg: 3.672 ± 0.594
3.672SerSer: 3.672 ± 0.605
3.672SerThr: 3.672 ± 0.572
4.122SerVal: 4.122 ± 0.503
0.749SerTrp: 0.749 ± 0.175
2.248SerTyr: 2.248 ± 0.492
0.0SerXaa: 0.0 ± 0.0
Thr
4.571ThrAla: 4.571 ± 0.77
0.6ThrCys: 0.6 ± 0.235
2.548ThrAsp: 2.548 ± 0.412
3.747ThrGlu: 3.747 ± 0.579
1.948ThrPhe: 1.948 ± 0.37
5.396ThrGly: 5.396 ± 0.871
1.199ThrHis: 1.199 ± 0.264
3.222ThrIle: 3.222 ± 0.486
4.047ThrLys: 4.047 ± 0.418
5.246ThrLeu: 5.246 ± 0.632
2.098ThrMet: 2.098 ± 0.341
1.799ThrAsn: 1.799 ± 0.346
2.848ThrPro: 2.848 ± 0.523
1.574ThrGln: 1.574 ± 0.324
2.023ThrArg: 2.023 ± 0.294
3.897ThrSer: 3.897 ± 0.574
2.998ThrThr: 2.998 ± 0.448
3.222ThrVal: 3.222 ± 0.488
0.45ThrTrp: 0.45 ± 0.253
1.274ThrTyr: 1.274 ± 0.286
0.0ThrXaa: 0.0 ± 0.0
Val
5.471ValAla: 5.471 ± 0.519
0.375ValCys: 0.375 ± 0.166
3.147ValAsp: 3.147 ± 0.432
4.347ValGlu: 4.347 ± 0.651
1.874ValPhe: 1.874 ± 0.394
4.646ValGly: 4.646 ± 0.654
1.049ValHis: 1.049 ± 0.238
3.073ValIle: 3.073 ± 0.442
4.796ValLys: 4.796 ± 0.709
5.021ValLeu: 5.021 ± 0.687
1.948ValMet: 1.948 ± 0.361
3.073ValAsn: 3.073 ± 0.517
2.848ValPro: 2.848 ± 0.386
2.098ValGln: 2.098 ± 0.369
5.021ValArg: 5.021 ± 0.58
3.372ValSer: 3.372 ± 0.595
4.571ValThr: 4.571 ± 0.399
5.621ValVal: 5.621 ± 0.738
1.499ValTrp: 1.499 ± 0.319
1.948ValTyr: 1.948 ± 0.506
0.0ValXaa: 0.0 ± 0.0
Trp
0.674TrpAla: 0.674 ± 0.221
0.45TrpCys: 0.45 ± 0.195
0.525TrpAsp: 0.525 ± 0.189
1.124TrpGlu: 1.124 ± 0.256
0.3TrpPhe: 0.3 ± 0.171
1.049TrpGly: 1.049 ± 0.324
0.3TrpHis: 0.3 ± 0.198
0.674TrpIle: 0.674 ± 0.328
1.424TrpLys: 1.424 ± 0.379
1.724TrpLeu: 1.724 ± 0.405
0.525TrpMet: 0.525 ± 0.188
1.424TrpAsn: 1.424 ± 0.34
0.3TrpPro: 0.3 ± 0.153
0.674TrpGln: 0.674 ± 0.209
0.749TrpArg: 0.749 ± 0.2
1.049TrpSer: 1.049 ± 0.327
0.974TrpThr: 0.974 ± 0.245
1.499TrpVal: 1.499 ± 0.368
0.375TrpTrp: 0.375 ± 0.161
0.3TrpTyr: 0.3 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.073TyrAla: 3.073 ± 0.471
0.225TyrCys: 0.225 ± 0.148
2.098TyrAsp: 2.098 ± 0.481
1.948TyrGlu: 1.948 ± 0.398
0.974TyrPhe: 0.974 ± 0.299
2.248TyrGly: 2.248 ± 0.443
0.45TyrHis: 0.45 ± 0.188
1.574TyrIle: 1.574 ± 0.427
1.499TyrLys: 1.499 ± 0.343
2.698TyrLeu: 2.698 ± 0.413
0.899TyrMet: 0.899 ± 0.232
1.124TyrAsn: 1.124 ± 0.226
0.749TyrPro: 0.749 ± 0.187
1.574TyrGln: 1.574 ± 0.363
2.173TyrArg: 2.173 ± 0.429
2.098TyrSer: 2.098 ± 0.331
2.098TyrThr: 2.098 ± 0.374
2.248TyrVal: 2.248 ± 0.413
0.674TyrTrp: 0.674 ± 0.25
0.3TyrTyr: 0.3 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13345 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski