Amino acid dipepetide frequency for Escherichia phage 2725-N35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.64AlaAla: 8.64 ± 1.116
0.521AlaCys: 0.521 ± 0.202
4.245AlaAsp: 4.245 ± 0.544
5.214AlaGlu: 5.214 ± 0.661
2.979AlaPhe: 2.979 ± 0.409
6.182AlaGly: 6.182 ± 0.741
1.192AlaHis: 1.192 ± 0.342
6.703AlaIle: 6.703 ± 0.644
6.182AlaLys: 6.182 ± 0.989
7.225AlaLeu: 7.225 ± 0.902
2.607AlaMet: 2.607 ± 0.491
4.767AlaAsn: 4.767 ± 0.773
2.011AlaPro: 2.011 ± 0.344
3.426AlaGln: 3.426 ± 0.497
4.767AlaArg: 4.767 ± 0.718
5.586AlaSer: 5.586 ± 0.768
4.767AlaThr: 4.767 ± 0.711
5.363AlaVal: 5.363 ± 0.646
1.192AlaTrp: 1.192 ± 0.291
2.383AlaTyr: 2.383 ± 0.478
0.0AlaXaa: 0.0 ± 0.0
Cys
0.596CysAla: 0.596 ± 0.241
0.223CysCys: 0.223 ± 0.117
0.819CysAsp: 0.819 ± 0.229
0.819CysGlu: 0.819 ± 0.236
0.372CysPhe: 0.372 ± 0.193
1.266CysGly: 1.266 ± 0.424
0.298CysHis: 0.298 ± 0.134
0.521CysIle: 0.521 ± 0.186
0.67CysLys: 0.67 ± 0.229
0.894CysLeu: 0.894 ± 0.252
0.596CysMet: 0.596 ± 0.213
0.447CysAsn: 0.447 ± 0.186
0.298CysPro: 0.298 ± 0.137
0.223CysGln: 0.223 ± 0.118
1.043CysArg: 1.043 ± 0.325
1.043CysSer: 1.043 ± 0.331
0.819CysThr: 0.819 ± 0.235
0.67CysVal: 0.67 ± 0.203
0.149CysTrp: 0.149 ± 0.114
0.372CysTyr: 0.372 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
5.437AspAla: 5.437 ± 0.651
0.596AspCys: 0.596 ± 0.225
3.203AspAsp: 3.203 ± 0.662
3.799AspGlu: 3.799 ± 0.521
2.234AspPhe: 2.234 ± 0.452
7.597AspGly: 7.597 ± 0.746
1.192AspHis: 1.192 ± 0.414
3.65AspIle: 3.65 ± 0.44
3.799AspLys: 3.799 ± 0.489
4.022AspLeu: 4.022 ± 0.576
1.266AspMet: 1.266 ± 0.354
2.607AspAsn: 2.607 ± 0.376
1.415AspPro: 1.415 ± 0.341
1.192AspGln: 1.192 ± 0.258
2.16AspArg: 2.16 ± 0.518
3.352AspSer: 3.352 ± 0.504
3.426AspThr: 3.426 ± 0.529
4.618AspVal: 4.618 ± 0.482
1.192AspTrp: 1.192 ± 0.309
3.352AspTyr: 3.352 ± 0.522
0.0AspXaa: 0.0 ± 0.0
Glu
6.108GluAla: 6.108 ± 0.613
0.894GluCys: 0.894 ± 0.287
3.203GluAsp: 3.203 ± 0.451
4.171GluGlu: 4.171 ± 0.724
3.873GluPhe: 3.873 ± 0.497
3.203GluGly: 3.203 ± 0.435
0.298GluHis: 0.298 ± 0.15
5.288GluIle: 5.288 ± 0.599
3.799GluLys: 3.799 ± 0.731
5.363GluLeu: 5.363 ± 0.631
2.607GluMet: 2.607 ± 0.615
3.724GluAsn: 3.724 ± 0.471
1.862GluPro: 1.862 ± 0.343
2.905GluGln: 2.905 ± 0.647
2.979GluArg: 2.979 ± 0.493
4.692GluSer: 4.692 ± 0.623
3.352GluThr: 3.352 ± 0.47
5.288GluVal: 5.288 ± 0.733
0.596GluTrp: 0.596 ± 0.186
2.681GluTyr: 2.681 ± 0.492
0.0GluXaa: 0.0 ± 0.0
Phe
2.16PheAla: 2.16 ± 0.412
0.67PheCys: 0.67 ± 0.2
3.65PheAsp: 3.65 ± 0.523
3.426PheGlu: 3.426 ± 0.613
1.266PhePhe: 1.266 ± 0.294
3.426PheGly: 3.426 ± 0.446
0.67PheHis: 0.67 ± 0.241
2.532PheIle: 2.532 ± 0.45
2.83PheLys: 2.83 ± 0.589
1.937PheLeu: 1.937 ± 0.36
0.819PheMet: 0.819 ± 0.239
2.086PheAsn: 2.086 ± 0.353
1.043PhePro: 1.043 ± 0.272
1.937PheGln: 1.937 ± 0.353
2.16PheArg: 2.16 ± 0.519
2.458PheSer: 2.458 ± 0.437
2.383PheThr: 2.383 ± 0.356
2.756PheVal: 2.756 ± 0.476
0.298PheTrp: 0.298 ± 0.138
1.341PheTyr: 1.341 ± 0.305
0.0PheXaa: 0.0 ± 0.0
Gly
5.884GlyAla: 5.884 ± 0.79
1.564GlyCys: 1.564 ± 0.422
3.873GlyAsp: 3.873 ± 0.548
4.692GlyGlu: 4.692 ± 0.708
2.83GlyPhe: 2.83 ± 0.455
6.182GlyGly: 6.182 ± 0.985
0.67GlyHis: 0.67 ± 0.258
5.214GlyIle: 5.214 ± 0.628
5.586GlyLys: 5.586 ± 0.651
6.554GlyLeu: 6.554 ± 0.682
2.011GlyMet: 2.011 ± 0.447
3.799GlyAsn: 3.799 ± 0.563
0.67GlyPro: 0.67 ± 0.172
2.234GlyGln: 2.234 ± 0.392
3.501GlyArg: 3.501 ± 0.432
5.512GlySer: 5.512 ± 0.675
4.32GlyThr: 4.32 ± 0.797
5.884GlyVal: 5.884 ± 0.599
0.968GlyTrp: 0.968 ± 0.293
4.022GlyTyr: 4.022 ± 0.563
0.0GlyXaa: 0.0 ± 0.0
His
1.043HisAla: 1.043 ± 0.292
0.298HisCys: 0.298 ± 0.135
0.819HisAsp: 0.819 ± 0.228
0.521HisGlu: 0.521 ± 0.21
0.819HisPhe: 0.819 ± 0.243
1.043HisGly: 1.043 ± 0.321
0.223HisHis: 0.223 ± 0.158
1.117HisIle: 1.117 ± 0.331
1.043HisLys: 1.043 ± 0.254
0.819HisLeu: 0.819 ± 0.28
0.298HisMet: 0.298 ± 0.19
0.521HisAsn: 0.521 ± 0.184
0.298HisPro: 0.298 ± 0.14
0.372HisGln: 0.372 ± 0.141
0.67HisArg: 0.67 ± 0.219
0.521HisSer: 0.521 ± 0.191
0.745HisThr: 0.745 ± 0.238
1.117HisVal: 1.117 ± 0.308
0.149HisTrp: 0.149 ± 0.107
0.372HisTyr: 0.372 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
6.033IleAla: 6.033 ± 0.773
0.447IleCys: 0.447 ± 0.167
6.108IleAsp: 6.108 ± 0.533
3.948IleGlu: 3.948 ± 0.498
1.49IlePhe: 1.49 ± 0.254
4.32IleGly: 4.32 ± 0.66
0.894IleHis: 0.894 ± 0.241
3.575IleIle: 3.575 ± 0.527
5.288IleLys: 5.288 ± 0.605
3.575IleLeu: 3.575 ± 0.491
1.639IleMet: 1.639 ± 0.452
4.32IleAsn: 4.32 ± 0.706
2.458IlePro: 2.458 ± 0.517
2.309IleGln: 2.309 ± 0.539
2.681IleArg: 2.681 ± 0.353
5.139IleSer: 5.139 ± 0.608
5.065IleThr: 5.065 ± 0.526
4.097IleVal: 4.097 ± 0.555
0.819IleTrp: 0.819 ± 0.201
3.128IleTyr: 3.128 ± 0.588
0.0IleXaa: 0.0 ± 0.0
Lys
6.182LysAla: 6.182 ± 0.801
0.447LysCys: 0.447 ± 0.185
3.948LysAsp: 3.948 ± 0.526
5.661LysGlu: 5.661 ± 0.842
2.905LysPhe: 2.905 ± 0.532
3.426LysGly: 3.426 ± 0.47
0.968LysHis: 0.968 ± 0.261
3.575LysIle: 3.575 ± 0.401
3.948LysLys: 3.948 ± 0.599
4.692LysLeu: 4.692 ± 0.571
2.83LysMet: 2.83 ± 0.571
3.277LysAsn: 3.277 ± 0.459
1.639LysPro: 1.639 ± 0.318
2.979LysGln: 2.979 ± 0.45
2.681LysArg: 2.681 ± 0.575
3.948LysSer: 3.948 ± 0.588
3.426LysThr: 3.426 ± 0.503
4.767LysVal: 4.767 ± 0.639
0.745LysTrp: 0.745 ± 0.204
3.128LysTyr: 3.128 ± 0.465
0.0LysXaa: 0.0 ± 0.0
Leu
6.48LeuAla: 6.48 ± 0.673
0.819LeuCys: 0.819 ± 0.253
4.097LeuAsp: 4.097 ± 0.463
4.245LeuGlu: 4.245 ± 0.565
2.234LeuPhe: 2.234 ± 0.348
4.171LeuGly: 4.171 ± 0.729
0.819LeuHis: 0.819 ± 0.303
4.99LeuIle: 4.99 ± 0.637
3.873LeuLys: 3.873 ± 0.511
3.724LeuLeu: 3.724 ± 0.43
1.49LeuMet: 1.49 ± 0.343
2.979LeuAsn: 2.979 ± 0.377
2.83LeuPro: 2.83 ± 0.411
2.979LeuGln: 2.979 ± 0.717
3.426LeuArg: 3.426 ± 0.457
5.884LeuSer: 5.884 ± 0.66
3.948LeuThr: 3.948 ± 0.612
5.735LeuVal: 5.735 ± 0.434
0.521LeuTrp: 0.521 ± 0.21
2.011LeuTyr: 2.011 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
3.277MetAla: 3.277 ± 0.449
0.298MetCys: 0.298 ± 0.161
1.415MetAsp: 1.415 ± 0.339
1.117MetGlu: 1.117 ± 0.261
1.043MetPhe: 1.043 ± 0.275
1.117MetGly: 1.117 ± 0.251
0.447MetHis: 0.447 ± 0.167
1.937MetIle: 1.937 ± 0.365
2.086MetLys: 2.086 ± 0.464
1.937MetLeu: 1.937 ± 0.45
0.521MetMet: 0.521 ± 0.244
1.192MetAsn: 1.192 ± 0.318
0.596MetPro: 0.596 ± 0.193
0.968MetGln: 0.968 ± 0.259
1.49MetArg: 1.49 ± 0.334
1.788MetSer: 1.788 ± 0.371
2.309MetThr: 2.309 ± 0.453
1.266MetVal: 1.266 ± 0.431
0.447MetTrp: 0.447 ± 0.172
0.447MetTyr: 0.447 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
3.65AsnAla: 3.65 ± 0.679
0.521AsnCys: 0.521 ± 0.22
3.054AsnAsp: 3.054 ± 0.373
3.799AsnGlu: 3.799 ± 0.524
2.234AsnPhe: 2.234 ± 0.443
6.257AsnGly: 6.257 ± 0.93
0.67AsnHis: 0.67 ± 0.224
3.054AsnIle: 3.054 ± 0.479
2.383AsnLys: 2.383 ± 0.417
3.352AsnLeu: 3.352 ± 0.635
0.894AsnMet: 0.894 ± 0.226
2.979AsnAsn: 2.979 ± 0.476
1.788AsnPro: 1.788 ± 0.26
2.532AsnGln: 2.532 ± 0.545
2.011AsnArg: 2.011 ± 0.351
4.245AsnSer: 4.245 ± 0.638
2.234AsnThr: 2.234 ± 0.347
4.097AsnVal: 4.097 ± 0.487
0.745AsnTrp: 0.745 ± 0.209
1.415AsnTyr: 1.415 ± 0.274
0.0AsnXaa: 0.0 ± 0.0
Pro
2.905ProAla: 2.905 ± 0.454
0.372ProCys: 0.372 ± 0.219
2.011ProAsp: 2.011 ± 0.479
2.979ProGlu: 2.979 ± 0.396
1.788ProPhe: 1.788 ± 0.346
1.937ProGly: 1.937 ± 0.381
0.372ProHis: 0.372 ± 0.163
1.713ProIle: 1.713 ± 0.308
1.043ProLys: 1.043 ± 0.313
1.564ProLeu: 1.564 ± 0.296
0.372ProMet: 0.372 ± 0.162
1.341ProAsn: 1.341 ± 0.31
0.596ProPro: 0.596 ± 0.223
1.341ProGln: 1.341 ± 0.367
1.49ProArg: 1.49 ± 0.333
1.564ProSer: 1.564 ± 0.31
1.564ProThr: 1.564 ± 0.289
3.352ProVal: 3.352 ± 0.484
0.298ProTrp: 0.298 ± 0.185
1.49ProTyr: 1.49 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
4.171GlnAla: 4.171 ± 0.861
0.372GlnCys: 0.372 ± 0.211
1.564GlnAsp: 1.564 ± 0.329
2.979GlnGlu: 2.979 ± 0.525
1.341GlnPhe: 1.341 ± 0.235
2.458GlnGly: 2.458 ± 0.388
0.223GlnHis: 0.223 ± 0.123
3.948GlnIle: 3.948 ± 0.473
2.16GlnLys: 2.16 ± 0.301
2.905GlnLeu: 2.905 ± 0.583
0.894GlnMet: 0.894 ± 0.247
1.564GlnAsn: 1.564 ± 0.409
1.341GlnPro: 1.341 ± 0.362
2.458GlnGln: 2.458 ± 0.759
1.266GlnArg: 1.266 ± 0.355
2.979GlnSer: 2.979 ± 0.407
2.011GlnThr: 2.011 ± 0.416
2.309GlnVal: 2.309 ± 0.364
0.447GlnTrp: 0.447 ± 0.201
1.415GlnTyr: 1.415 ± 0.391
0.0GlnXaa: 0.0 ± 0.0
Arg
4.32ArgAla: 4.32 ± 0.459
0.819ArgCys: 0.819 ± 0.283
1.713ArgAsp: 1.713 ± 0.329
3.203ArgGlu: 3.203 ± 0.457
2.756ArgPhe: 2.756 ± 0.383
2.383ArgGly: 2.383 ± 0.451
0.223ArgHis: 0.223 ± 0.127
3.575ArgIle: 3.575 ± 0.529
3.948ArgLys: 3.948 ± 0.525
3.277ArgLeu: 3.277 ± 0.493
1.49ArgMet: 1.49 ± 0.386
2.681ArgAsn: 2.681 ± 0.495
1.713ArgPro: 1.713 ± 0.407
1.415ArgGln: 1.415 ± 0.394
2.905ArgArg: 2.905 ± 0.447
2.383ArgSer: 2.383 ± 0.378
2.011ArgThr: 2.011 ± 0.463
3.873ArgVal: 3.873 ± 0.414
0.372ArgTrp: 0.372 ± 0.17
1.713ArgTyr: 1.713 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
5.884SerAla: 5.884 ± 0.655
0.447SerCys: 0.447 ± 0.182
4.767SerAsp: 4.767 ± 0.615
5.065SerGlu: 5.065 ± 0.639
2.086SerPhe: 2.086 ± 0.467
6.778SerGly: 6.778 ± 0.63
1.266SerHis: 1.266 ± 0.295
4.32SerIle: 4.32 ± 0.528
3.799SerLys: 3.799 ± 0.498
5.288SerLeu: 5.288 ± 0.493
1.49SerMet: 1.49 ± 0.294
3.352SerAsn: 3.352 ± 0.51
2.607SerPro: 2.607 ± 0.38
2.383SerGln: 2.383 ± 0.411
2.979SerArg: 2.979 ± 0.463
5.214SerSer: 5.214 ± 0.87
4.394SerThr: 4.394 ± 0.651
5.065SerVal: 5.065 ± 0.751
0.596SerTrp: 0.596 ± 0.186
2.234SerTyr: 2.234 ± 0.409
0.0SerXaa: 0.0 ± 0.0
Thr
4.469ThrAla: 4.469 ± 0.603
0.745ThrCys: 0.745 ± 0.207
2.905ThrAsp: 2.905 ± 0.404
3.426ThrGlu: 3.426 ± 0.565
2.607ThrPhe: 2.607 ± 0.403
6.331ThrGly: 6.331 ± 0.771
0.521ThrHis: 0.521 ± 0.192
4.394ThrIle: 4.394 ± 0.518
3.352ThrLys: 3.352 ± 0.525
2.905ThrLeu: 2.905 ± 0.48
0.968ThrMet: 0.968 ± 0.25
3.054ThrAsn: 3.054 ± 0.48
2.532ThrPro: 2.532 ± 0.355
2.16ThrGln: 2.16 ± 0.365
2.086ThrArg: 2.086 ± 0.293
4.097ThrSer: 4.097 ± 0.596
3.501ThrThr: 3.501 ± 0.644
4.692ThrVal: 4.692 ± 0.656
0.67ThrTrp: 0.67 ± 0.207
2.756ThrTyr: 2.756 ± 0.407
0.0ThrXaa: 0.0 ± 0.0
Val
5.288ValAla: 5.288 ± 0.88
1.117ValCys: 1.117 ± 0.287
4.618ValAsp: 4.618 ± 0.574
4.543ValGlu: 4.543 ± 0.651
3.054ValPhe: 3.054 ± 0.411
4.32ValGly: 4.32 ± 0.549
1.043ValHis: 1.043 ± 0.24
4.394ValIle: 4.394 ± 0.555
5.586ValLys: 5.586 ± 0.702
3.575ValLeu: 3.575 ± 0.481
1.788ValMet: 1.788 ± 0.373
4.99ValAsn: 4.99 ± 0.581
2.458ValPro: 2.458 ± 0.343
2.756ValGln: 2.756 ± 0.539
4.171ValArg: 4.171 ± 0.47
5.81ValSer: 5.81 ± 0.732
4.99ValThr: 4.99 ± 0.678
6.033ValVal: 6.033 ± 0.757
0.894ValTrp: 0.894 ± 0.179
2.905ValTyr: 2.905 ± 0.54
0.0ValXaa: 0.0 ± 0.0
Trp
0.596TrpAla: 0.596 ± 0.183
0.298TrpCys: 0.298 ± 0.137
1.117TrpAsp: 1.117 ± 0.267
0.745TrpGlu: 0.745 ± 0.185
0.819TrpPhe: 0.819 ± 0.22
0.894TrpGly: 0.894 ± 0.245
0.223TrpHis: 0.223 ± 0.121
0.745TrpIle: 0.745 ± 0.232
1.341TrpLys: 1.341 ± 0.304
0.745TrpLeu: 0.745 ± 0.194
0.223TrpMet: 0.223 ± 0.14
0.298TrpAsn: 0.298 ± 0.115
0.447TrpPro: 0.447 ± 0.164
0.298TrpGln: 0.298 ± 0.13
0.596TrpArg: 0.596 ± 0.212
0.67TrpSer: 0.67 ± 0.331
0.447TrpThr: 0.447 ± 0.179
1.043TrpVal: 1.043 ± 0.25
0.074TrpTrp: 0.074 ± 0.074
0.223TrpTyr: 0.223 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.83TyrAla: 2.83 ± 0.32
0.67TyrCys: 0.67 ± 0.261
2.979TyrAsp: 2.979 ± 0.465
2.532TyrGlu: 2.532 ± 0.539
1.117TyrPhe: 1.117 ± 0.317
2.681TyrGly: 2.681 ± 0.446
0.596TyrHis: 0.596 ± 0.203
2.086TyrIle: 2.086 ± 0.311
2.458TyrLys: 2.458 ± 0.497
2.979TyrLeu: 2.979 ± 0.359
0.819TyrMet: 0.819 ± 0.288
1.937TyrAsn: 1.937 ± 0.354
1.49TyrPro: 1.49 ± 0.31
1.937TyrGln: 1.937 ± 0.342
1.713TyrArg: 1.713 ± 0.324
3.128TyrSer: 3.128 ± 0.525
2.458TyrThr: 2.458 ± 0.369
2.309TyrVal: 2.309 ± 0.402
0.67TyrTrp: 0.67 ± 0.239
1.49TyrTyr: 1.49 ± 0.323
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (13427 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski