Amino acid dipepetide frequency for Escherichia phage vB_EcoS_PHB17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.683AlaAla: 8.683 ± 1.313
0.668AlaCys: 0.668 ± 0.223
4.275AlaAsp: 4.275 ± 0.599
5.878AlaGlu: 5.878 ± 0.682
2.605AlaPhe: 2.605 ± 0.425
6.011AlaGly: 6.011 ± 0.576
1.069AlaHis: 1.069 ± 0.288
5.143AlaIle: 5.143 ± 0.742
7.347AlaLys: 7.347 ± 1.042
7.414AlaLeu: 7.414 ± 0.747
2.137AlaMet: 2.137 ± 0.343
3.206AlaAsn: 3.206 ± 0.603
1.336AlaPro: 1.336 ± 0.261
4.408AlaGln: 4.408 ± 0.816
4.876AlaArg: 4.876 ± 0.553
6.412AlaSer: 6.412 ± 0.837
4.074AlaThr: 4.074 ± 0.66
6.011AlaVal: 6.011 ± 0.701
1.069AlaTrp: 1.069 ± 0.249
3.273AlaTyr: 3.273 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.935CysAla: 0.935 ± 0.288
0.2CysCys: 0.2 ± 0.112
1.002CysAsp: 1.002 ± 0.29
1.135CysGlu: 1.135 ± 0.291
0.534CysPhe: 0.534 ± 0.205
1.469CysGly: 1.469 ± 0.33
0.067CysHis: 0.067 ± 0.056
0.735CysIle: 0.735 ± 0.242
1.002CysLys: 1.002 ± 0.352
0.601CysLeu: 0.601 ± 0.21
0.334CysMet: 0.334 ± 0.144
0.668CysAsn: 0.668 ± 0.215
0.601CysPro: 0.601 ± 0.198
0.0CysGln: 0.0 ± 0.0
0.601CysArg: 0.601 ± 0.273
1.002CysSer: 1.002 ± 0.227
0.868CysThr: 0.868 ± 0.237
0.868CysVal: 0.868 ± 0.217
0.534CysTrp: 0.534 ± 0.193
0.534CysTyr: 0.534 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
4.475AspAla: 4.475 ± 0.585
1.069AspCys: 1.069 ± 0.248
2.738AspAsp: 2.738 ± 0.504
4.675AspGlu: 4.675 ± 0.46
2.271AspPhe: 2.271 ± 0.364
6.412AspGly: 6.412 ± 0.694
0.935AspHis: 0.935 ± 0.266
3.34AspIle: 3.34 ± 0.454
3.674AspLys: 3.674 ± 0.431
4.007AspLeu: 4.007 ± 0.512
1.336AspMet: 1.336 ± 0.27
2.204AspAsn: 2.204 ± 0.449
1.937AspPro: 1.937 ± 0.305
2.137AspGln: 2.137 ± 0.538
2.805AspArg: 2.805 ± 0.418
3.54AspSer: 3.54 ± 0.465
3.34AspThr: 3.34 ± 0.45
3.74AspVal: 3.74 ± 0.479
1.002AspTrp: 1.002 ± 0.239
2.605AspTyr: 2.605 ± 0.4
0.0AspXaa: 0.0 ± 0.0
Glu
5.811GluAla: 5.811 ± 0.673
0.935GluCys: 0.935 ± 0.206
2.805GluAsp: 2.805 ± 0.416
4.408GluGlu: 4.408 ± 0.558
4.141GluPhe: 4.141 ± 0.538
3.139GluGly: 3.139 ± 0.37
1.403GluHis: 1.403 ± 0.334
4.809GluIle: 4.809 ± 0.616
4.074GluLys: 4.074 ± 0.506
5.944GluLeu: 5.944 ± 0.814
3.206GluMet: 3.206 ± 0.425
3.473GluAsn: 3.473 ± 0.463
1.803GluPro: 1.803 ± 0.378
3.406GluGln: 3.406 ± 0.503
2.538GluArg: 2.538 ± 0.483
3.74GluSer: 3.74 ± 0.455
4.675GluThr: 4.675 ± 0.579
6.145GluVal: 6.145 ± 0.689
0.735GluTrp: 0.735 ± 0.222
2.538GluTyr: 2.538 ± 0.354
0.0GluXaa: 0.0 ± 0.0
Phe
3.072PheAla: 3.072 ± 0.383
0.735PheCys: 0.735 ± 0.241
3.206PheAsp: 3.206 ± 0.529
2.872PheGlu: 2.872 ± 0.384
1.536PhePhe: 1.536 ± 0.338
3.607PheGly: 3.607 ± 0.463
0.735PheHis: 0.735 ± 0.21
2.471PheIle: 2.471 ± 0.451
3.072PheLys: 3.072 ± 0.489
2.271PheLeu: 2.271 ± 0.431
1.069PheMet: 1.069 ± 0.268
2.137PheAsn: 2.137 ± 0.441
1.536PhePro: 1.536 ± 0.326
1.536PheGln: 1.536 ± 0.274
1.87PheArg: 1.87 ± 0.326
2.071PheSer: 2.071 ± 0.366
2.738PheThr: 2.738 ± 0.394
2.872PheVal: 2.872 ± 0.402
0.735PheTrp: 0.735 ± 0.216
0.868PheTyr: 0.868 ± 0.225
0.0PheXaa: 0.0 ± 0.0
Gly
5.21GlyAla: 5.21 ± 0.643
1.403GlyCys: 1.403 ± 0.276
4.208GlyAsp: 4.208 ± 0.625
5.009GlyGlu: 5.009 ± 0.565
3.139GlyPhe: 3.139 ± 0.481
6.078GlyGly: 6.078 ± 0.977
1.202GlyHis: 1.202 ± 0.268
4.007GlyIle: 4.007 ± 0.487
7.013GlyLys: 7.013 ± 0.51
5.944GlyLeu: 5.944 ± 0.374
1.937GlyMet: 1.937 ± 0.393
4.141GlyAsn: 4.141 ± 0.457
0.267GlyPro: 0.267 ± 0.147
2.271GlyGln: 2.271 ± 0.394
3.273GlyArg: 3.273 ± 0.397
5.811GlySer: 5.811 ± 0.531
2.471GlyThr: 2.471 ± 0.406
6.078GlyVal: 6.078 ± 0.605
1.336GlyTrp: 1.336 ± 0.311
3.206GlyTyr: 3.206 ± 0.491
0.0GlyXaa: 0.0 ± 0.0
His
1.202HisAla: 1.202 ± 0.367
0.2HisCys: 0.2 ± 0.133
1.269HisAsp: 1.269 ± 0.299
1.269HisGlu: 1.269 ± 0.278
0.735HisPhe: 0.735 ± 0.224
1.336HisGly: 1.336 ± 0.284
0.534HisHis: 0.534 ± 0.203
1.202HisIle: 1.202 ± 0.282
1.336HisLys: 1.336 ± 0.319
1.737HisLeu: 1.737 ± 0.444
0.468HisMet: 0.468 ± 0.172
0.868HisAsn: 0.868 ± 0.226
0.668HisPro: 0.668 ± 0.257
0.468HisGln: 0.468 ± 0.164
0.935HisArg: 0.935 ± 0.261
1.002HisSer: 1.002 ± 0.274
0.868HisThr: 0.868 ± 0.27
1.135HisVal: 1.135 ± 0.237
0.067HisTrp: 0.067 ± 0.077
1.336HisTyr: 1.336 ± 0.337
0.0HisXaa: 0.0 ± 0.0
Ile
5.143IleAla: 5.143 ± 0.508
0.868IleCys: 0.868 ± 0.302
4.141IleAsp: 4.141 ± 0.571
4.074IleGlu: 4.074 ± 0.537
2.004IlePhe: 2.004 ± 0.335
3.273IleGly: 3.273 ± 0.49
1.403IleHis: 1.403 ± 0.286
2.872IleIle: 2.872 ± 0.472
4.675IleLys: 4.675 ± 0.562
3.006IleLeu: 3.006 ± 0.408
2.071IleMet: 2.071 ± 0.384
3.139IleAsn: 3.139 ± 0.621
2.672IlePro: 2.672 ± 0.434
2.204IleGln: 2.204 ± 0.437
3.54IleArg: 3.54 ± 0.448
4.074IleSer: 4.074 ± 0.509
4.408IleThr: 4.408 ± 0.532
4.208IleVal: 4.208 ± 0.48
1.202IleTrp: 1.202 ± 0.288
2.338IleTyr: 2.338 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
6.88LysAla: 6.88 ± 0.89
0.601LysCys: 0.601 ± 0.195
4.809LysAsp: 4.809 ± 0.48
6.011LysGlu: 6.011 ± 0.574
2.137LysPhe: 2.137 ± 0.415
4.275LysGly: 4.275 ± 0.55
1.603LysHis: 1.603 ± 0.4
3.473LysIle: 3.473 ± 0.355
4.675LysLys: 4.675 ± 0.654
5.076LysLeu: 5.076 ± 0.531
2.939LysMet: 2.939 ± 0.42
3.206LysAsn: 3.206 ± 0.537
3.34LysPro: 3.34 ± 0.493
2.805LysGln: 2.805 ± 0.588
3.674LysArg: 3.674 ± 0.542
3.406LysSer: 3.406 ± 0.431
4.742LysThr: 4.742 ± 0.613
6.011LysVal: 6.011 ± 0.724
1.069LysTrp: 1.069 ± 0.284
2.137LysTyr: 2.137 ± 0.365
0.0LysXaa: 0.0 ± 0.0
Leu
6.612LeuAla: 6.612 ± 0.847
0.801LeuCys: 0.801 ± 0.203
3.674LeuAsp: 3.674 ± 0.498
4.943LeuGlu: 4.943 ± 0.645
2.338LeuPhe: 2.338 ± 0.463
4.542LeuGly: 4.542 ± 0.758
1.202LeuHis: 1.202 ± 0.313
4.609LeuIle: 4.609 ± 0.459
3.941LeuLys: 3.941 ± 0.517
3.34LeuLeu: 3.34 ± 0.435
1.803LeuMet: 1.803 ± 0.298
3.807LeuAsn: 3.807 ± 0.422
2.939LeuPro: 2.939 ± 0.48
2.538LeuGln: 2.538 ± 0.445
4.074LeuArg: 4.074 ± 0.488
5.076LeuSer: 5.076 ± 0.427
4.408LeuThr: 4.408 ± 0.456
4.609LeuVal: 4.609 ± 0.443
1.269LeuTrp: 1.269 ± 0.232
2.271LeuTyr: 2.271 ± 0.369
0.0LeuXaa: 0.0 ± 0.0
Met
2.872MetAla: 2.872 ± 0.456
0.334MetCys: 0.334 ± 0.146
1.002MetAsp: 1.002 ± 0.253
1.67MetGlu: 1.67 ± 0.32
0.935MetPhe: 0.935 ± 0.219
1.202MetGly: 1.202 ± 0.271
0.668MetHis: 0.668 ± 0.202
2.338MetIle: 2.338 ± 0.445
2.672MetLys: 2.672 ± 0.494
2.071MetLeu: 2.071 ± 0.346
1.336MetMet: 1.336 ± 0.314
0.935MetAsn: 0.935 ± 0.235
0.935MetPro: 0.935 ± 0.205
1.603MetGln: 1.603 ± 0.367
1.87MetArg: 1.87 ± 0.393
1.603MetSer: 1.603 ± 0.331
1.67MetThr: 1.67 ± 0.271
2.204MetVal: 2.204 ± 0.311
0.134MetTrp: 0.134 ± 0.091
0.935MetTyr: 0.935 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
4.275AsnAla: 4.275 ± 0.497
0.601AsnCys: 0.601 ± 0.221
3.072AsnAsp: 3.072 ± 0.402
2.872AsnGlu: 2.872 ± 0.467
2.071AsnPhe: 2.071 ± 0.293
5.143AsnGly: 5.143 ± 0.621
0.868AsnHis: 0.868 ± 0.235
2.471AsnIle: 2.471 ± 0.379
3.473AsnLys: 3.473 ± 0.435
2.939AsnLeu: 2.939 ± 0.562
1.002AsnMet: 1.002 ± 0.259
2.137AsnAsn: 2.137 ± 0.428
1.336AsnPro: 1.336 ± 0.295
1.536AsnGln: 1.536 ± 0.314
2.071AsnArg: 2.071 ± 0.314
3.34AsnSer: 3.34 ± 0.539
1.803AsnThr: 1.803 ± 0.456
3.74AsnVal: 3.74 ± 0.428
0.868AsnTrp: 0.868 ± 0.209
1.336AsnTyr: 1.336 ± 0.28
0.0AsnXaa: 0.0 ± 0.0
Pro
2.338ProAla: 2.338 ± 0.356
0.334ProCys: 0.334 ± 0.192
1.87ProAsp: 1.87 ± 0.347
3.473ProGlu: 3.473 ± 0.674
2.004ProPhe: 2.004 ± 0.265
2.738ProGly: 2.738 ± 0.501
0.735ProHis: 0.735 ± 0.215
1.803ProIle: 1.803 ± 0.372
1.737ProLys: 1.737 ± 0.346
1.737ProLeu: 1.737 ± 0.34
0.935ProMet: 0.935 ± 0.247
1.536ProAsn: 1.536 ± 0.327
0.668ProPro: 0.668 ± 0.223
1.069ProGln: 1.069 ± 0.238
1.536ProArg: 1.536 ± 0.301
1.87ProSer: 1.87 ± 0.381
1.269ProThr: 1.269 ± 0.291
2.271ProVal: 2.271 ± 0.395
0.401ProTrp: 0.401 ± 0.163
1.269ProTyr: 1.269 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
3.206GlnAla: 3.206 ± 0.498
0.735GlnCys: 0.735 ± 0.176
2.605GlnAsp: 2.605 ± 0.379
2.137GlnGlu: 2.137 ± 0.608
1.403GlnPhe: 1.403 ± 0.258
2.605GlnGly: 2.605 ± 0.415
0.868GlnHis: 0.868 ± 0.234
2.872GlnIle: 2.872 ± 0.462
2.805GlnLys: 2.805 ± 0.433
2.471GlnLeu: 2.471 ± 0.442
1.002GlnMet: 1.002 ± 0.206
1.603GlnAsn: 1.603 ± 0.294
1.336GlnPro: 1.336 ± 0.289
1.67GlnGln: 1.67 ± 0.422
2.404GlnArg: 2.404 ± 0.465
2.204GlnSer: 2.204 ± 0.351
2.137GlnThr: 2.137 ± 0.365
2.471GlnVal: 2.471 ± 0.373
0.735GlnTrp: 0.735 ± 0.212
1.403GlnTyr: 1.403 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
4.475ArgAla: 4.475 ± 0.602
0.601ArgCys: 0.601 ± 0.242
1.937ArgAsp: 1.937 ± 0.426
3.807ArgGlu: 3.807 ± 0.551
2.672ArgPhe: 2.672 ± 0.483
3.273ArgGly: 3.273 ± 0.387
1.202ArgHis: 1.202 ± 0.368
3.273ArgIle: 3.273 ± 0.322
4.208ArgLys: 4.208 ± 0.453
4.074ArgLeu: 4.074 ± 0.537
1.536ArgMet: 1.536 ± 0.255
2.137ArgAsn: 2.137 ± 0.378
1.269ArgPro: 1.269 ± 0.352
2.338ArgGln: 2.338 ± 0.501
3.273ArgArg: 3.273 ± 0.56
2.805ArgSer: 2.805 ± 0.396
1.87ArgThr: 1.87 ± 0.295
4.208ArgVal: 4.208 ± 0.781
0.735ArgTrp: 0.735 ± 0.251
1.803ArgTyr: 1.803 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
5.677SerAla: 5.677 ± 0.646
0.468SerCys: 0.468 ± 0.21
3.74SerAsp: 3.74 ± 0.59
4.408SerGlu: 4.408 ± 0.609
2.471SerPhe: 2.471 ± 0.396
5.878SerGly: 5.878 ± 0.544
1.069SerHis: 1.069 ± 0.377
4.609SerIle: 4.609 ± 0.72
4.275SerLys: 4.275 ± 0.581
4.475SerLeu: 4.475 ± 0.677
1.002SerMet: 1.002 ± 0.222
2.137SerAsn: 2.137 ± 0.425
2.471SerPro: 2.471 ± 0.43
3.206SerGln: 3.206 ± 0.647
3.139SerArg: 3.139 ± 0.44
3.139SerSer: 3.139 ± 0.506
2.872SerThr: 2.872 ± 0.456
4.475SerVal: 4.475 ± 0.505
0.735SerTrp: 0.735 ± 0.274
1.737SerTyr: 1.737 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
5.61ThrAla: 5.61 ± 0.771
0.534ThrCys: 0.534 ± 0.244
3.006ThrAsp: 3.006 ± 0.386
2.939ThrGlu: 2.939 ± 0.483
2.939ThrPhe: 2.939 ± 0.328
5.944ThrGly: 5.944 ± 0.598
0.468ThrHis: 0.468 ± 0.148
3.607ThrIle: 3.607 ± 0.512
3.072ThrLys: 3.072 ± 0.547
3.674ThrLeu: 3.674 ± 0.448
1.135ThrMet: 1.135 ± 0.236
3.406ThrAsn: 3.406 ± 0.483
2.404ThrPro: 2.404 ± 0.335
1.403ThrGln: 1.403 ± 0.329
2.004ThrArg: 2.004 ± 0.367
3.34ThrSer: 3.34 ± 0.844
3.139ThrThr: 3.139 ± 0.496
3.54ThrVal: 3.54 ± 0.449
0.735ThrTrp: 0.735 ± 0.183
1.937ThrTyr: 1.937 ± 0.447
0.0ThrXaa: 0.0 ± 0.0
Val
5.61ValAla: 5.61 ± 0.593
1.469ValCys: 1.469 ± 0.358
5.277ValAsp: 5.277 ± 0.481
5.21ValGlu: 5.21 ± 0.694
3.006ValPhe: 3.006 ± 0.433
4.141ValGly: 4.141 ± 0.489
1.002ValHis: 1.002 ± 0.238
4.943ValIle: 4.943 ± 0.565
5.677ValLys: 5.677 ± 0.795
4.876ValLeu: 4.876 ± 0.596
2.338ValMet: 2.338 ± 0.421
3.607ValAsn: 3.607 ± 0.609
2.204ValPro: 2.204 ± 0.429
2.204ValGln: 2.204 ± 0.502
3.807ValArg: 3.807 ± 0.595
4.341ValSer: 4.341 ± 0.636
4.141ValThr: 4.141 ± 0.456
4.208ValVal: 4.208 ± 0.594
1.069ValTrp: 1.069 ± 0.244
2.137ValTyr: 2.137 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
1.002TrpAla: 1.002 ± 0.323
0.468TrpCys: 0.468 ± 0.154
0.601TrpAsp: 0.601 ± 0.158
0.668TrpGlu: 0.668 ± 0.219
0.601TrpPhe: 0.601 ± 0.197
0.735TrpGly: 0.735 ± 0.203
0.668TrpHis: 0.668 ± 0.24
1.002TrpIle: 1.002 ± 0.232
1.603TrpLys: 1.603 ± 0.294
1.069TrpLeu: 1.069 ± 0.294
0.534TrpMet: 0.534 ± 0.212
0.935TrpAsn: 0.935 ± 0.258
0.334TrpPro: 0.334 ± 0.168
0.401TrpGln: 0.401 ± 0.149
1.135TrpArg: 1.135 ± 0.288
1.135TrpSer: 1.135 ± 0.281
0.868TrpThr: 0.868 ± 0.275
0.601TrpVal: 0.601 ± 0.172
0.134TrpTrp: 0.134 ± 0.095
0.468TrpTyr: 0.468 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.805TyrAla: 2.805 ± 0.478
0.668TyrCys: 0.668 ± 0.254
3.006TyrAsp: 3.006 ± 0.425
2.271TyrGlu: 2.271 ± 0.368
1.469TyrPhe: 1.469 ± 0.358
2.204TyrGly: 2.204 ± 0.313
0.935TyrHis: 0.935 ± 0.187
1.737TyrIle: 1.737 ± 0.292
2.538TyrLys: 2.538 ± 0.373
2.071TyrLeu: 2.071 ± 0.367
0.801TyrMet: 0.801 ± 0.196
1.737TyrAsn: 1.737 ± 0.399
1.536TyrPro: 1.536 ± 0.309
1.469TyrGln: 1.469 ± 0.342
2.071TyrArg: 2.071 ± 0.335
2.071TyrSer: 2.071 ± 0.423
2.471TyrThr: 2.471 ± 0.399
1.87TyrVal: 1.87 ± 0.291
0.401TyrTrp: 0.401 ± 0.155
0.935TyrTyr: 0.935 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (14973 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski