Amino acid dipepetide frequency for Escherichia phage IMM-002

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.526AlaAla: 9.526 ± 1.022
0.847AlaCys: 0.847 ± 0.219
5.01AlaAsp: 5.01 ± 0.546
5.715AlaGlu: 5.715 ± 0.677
2.611AlaPhe: 2.611 ± 0.448
8.397AlaGly: 8.397 ± 1.118
1.058AlaHis: 1.058 ± 0.252
5.08AlaIle: 5.08 ± 0.658
5.715AlaLys: 5.715 ± 0.577
6.633AlaLeu: 6.633 ± 0.909
2.681AlaMet: 2.681 ± 0.419
3.599AlaAsn: 3.599 ± 0.411
2.681AlaPro: 2.681 ± 0.448
2.681AlaGln: 2.681 ± 0.475
3.881AlaArg: 3.881 ± 0.479
5.574AlaSer: 5.574 ± 0.756
4.163AlaThr: 4.163 ± 0.641
6.351AlaVal: 6.351 ± 0.874
1.058AlaTrp: 1.058 ± 0.339
2.47AlaTyr: 2.47 ± 0.429
0.0AlaXaa: 0.0 ± 0.0
Cys
0.988CysAla: 0.988 ± 0.256
0.212CysCys: 0.212 ± 0.151
1.058CysAsp: 1.058 ± 0.314
0.706CysGlu: 0.706 ± 0.272
0.776CysPhe: 0.776 ± 0.217
0.917CysGly: 0.917 ± 0.279
0.423CysHis: 0.423 ± 0.199
0.282CysIle: 0.282 ± 0.158
0.917CysLys: 0.917 ± 0.304
1.129CysLeu: 1.129 ± 0.306
0.282CysMet: 0.282 ± 0.218
0.494CysAsn: 0.494 ± 0.188
0.564CysPro: 0.564 ± 0.173
0.564CysGln: 0.564 ± 0.218
0.706CysArg: 0.706 ± 0.256
0.917CysSer: 0.917 ± 0.315
0.706CysThr: 0.706 ± 0.237
1.058CysVal: 1.058 ± 0.373
0.212CysTrp: 0.212 ± 0.135
0.141CysTyr: 0.141 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
5.222AspAla: 5.222 ± 0.776
0.847AspCys: 0.847 ± 0.37
3.74AspAsp: 3.74 ± 0.552
3.74AspGlu: 3.74 ± 0.441
2.329AspPhe: 2.329 ± 0.442
5.574AspGly: 5.574 ± 0.588
1.058AspHis: 1.058 ± 0.269
3.246AspIle: 3.246 ± 0.369
2.822AspLys: 2.822 ± 0.496
4.657AspLeu: 4.657 ± 0.58
1.623AspMet: 1.623 ± 0.302
2.258AspAsn: 2.258 ± 0.385
2.47AspPro: 2.47 ± 0.417
1.976AspGln: 1.976 ± 0.372
2.046AspArg: 2.046 ± 0.349
3.316AspSer: 3.316 ± 0.515
3.74AspThr: 3.74 ± 0.434
3.951AspVal: 3.951 ± 0.535
0.706AspTrp: 0.706 ± 0.208
2.117AspTyr: 2.117 ± 0.336
0.0AspXaa: 0.0 ± 0.0
Glu
6.633GluAla: 6.633 ± 0.847
0.847GluCys: 0.847 ± 0.325
4.093GluAsp: 4.093 ± 0.491
3.881GluGlu: 3.881 ± 0.624
2.258GluPhe: 2.258 ± 0.368
4.798GluGly: 4.798 ± 0.811
0.988GluHis: 0.988 ± 0.232
2.046GluIle: 2.046 ± 0.348
2.47GluLys: 2.47 ± 0.44
5.504GluLeu: 5.504 ± 0.647
1.764GluMet: 1.764 ± 0.331
2.046GluAsn: 2.046 ± 0.43
2.399GluPro: 2.399 ± 0.331
2.681GluGln: 2.681 ± 0.381
4.022GluArg: 4.022 ± 0.55
3.528GluSer: 3.528 ± 0.568
3.316GluThr: 3.316 ± 0.404
4.163GluVal: 4.163 ± 0.597
1.058GluTrp: 1.058 ± 0.205
2.54GluTyr: 2.54 ± 0.481
0.0GluXaa: 0.0 ± 0.0
Phe
2.54PheAla: 2.54 ± 0.413
0.423PheCys: 0.423 ± 0.192
2.611PheAsp: 2.611 ± 0.498
1.835PheGlu: 1.835 ± 0.394
1.341PhePhe: 1.341 ± 0.348
2.681PheGly: 2.681 ± 0.448
0.847PheHis: 0.847 ± 0.307
1.835PheIle: 1.835 ± 0.333
2.47PheLys: 2.47 ± 0.434
3.458PheLeu: 3.458 ± 0.451
0.564PheMet: 0.564 ± 0.194
2.046PheAsn: 2.046 ± 0.325
1.27PhePro: 1.27 ± 0.29
1.058PheGln: 1.058 ± 0.285
1.623PheArg: 1.623 ± 0.259
2.822PheSer: 2.822 ± 0.423
2.117PheThr: 2.117 ± 0.311
2.399PheVal: 2.399 ± 0.437
0.564PheTrp: 0.564 ± 0.215
1.058PheTyr: 1.058 ± 0.189
0.0PheXaa: 0.0 ± 0.0
Gly
6.421GlyAla: 6.421 ± 0.766
0.917GlyCys: 0.917 ± 0.367
4.516GlyAsp: 4.516 ± 0.873
4.375GlyGlu: 4.375 ± 0.549
2.681GlyPhe: 2.681 ± 0.571
5.433GlyGly: 5.433 ± 0.74
1.129GlyHis: 1.129 ± 0.293
4.375GlyIle: 4.375 ± 0.525
5.292GlyLys: 5.292 ± 0.688
6.703GlyLeu: 6.703 ± 0.754
2.258GlyMet: 2.258 ± 0.438
3.387GlyAsn: 3.387 ± 0.814
2.187GlyPro: 2.187 ± 0.402
3.599GlyGln: 3.599 ± 0.467
5.433GlyArg: 5.433 ± 0.416
5.857GlySer: 5.857 ± 0.989
4.163GlyThr: 4.163 ± 0.52
5.998GlyVal: 5.998 ± 0.649
1.058GlyTrp: 1.058 ± 0.267
3.599GlyTyr: 3.599 ± 0.607
0.0GlyXaa: 0.0 ± 0.0
His
0.776HisAla: 0.776 ± 0.252
0.212HisCys: 0.212 ± 0.115
1.129HisAsp: 1.129 ± 0.363
1.058HisGlu: 1.058 ± 0.305
0.494HisPhe: 0.494 ± 0.185
0.988HisGly: 0.988 ± 0.283
0.706HisHis: 0.706 ± 0.259
0.988HisIle: 0.988 ± 0.22
0.635HisLys: 0.635 ± 0.187
2.47HisLeu: 2.47 ± 0.486
0.635HisMet: 0.635 ± 0.214
0.706HisAsn: 0.706 ± 0.19
0.988HisPro: 0.988 ± 0.337
0.564HisGln: 0.564 ± 0.21
1.411HisArg: 1.411 ± 0.341
0.988HisSer: 0.988 ± 0.272
1.341HisThr: 1.341 ± 0.294
0.776HisVal: 0.776 ± 0.242
0.353HisTrp: 0.353 ± 0.176
0.635HisTyr: 0.635 ± 0.236
0.0HisXaa: 0.0 ± 0.0
Ile
3.669IleAla: 3.669 ± 0.494
0.776IleCys: 0.776 ± 0.282
2.822IleAsp: 2.822 ± 0.42
2.611IleGlu: 2.611 ± 0.375
1.129IlePhe: 1.129 ± 0.284
4.445IleGly: 4.445 ± 0.572
1.058IleHis: 1.058 ± 0.209
1.835IleIle: 1.835 ± 0.374
3.387IleLys: 3.387 ± 0.488
3.175IleLeu: 3.175 ± 0.5
0.988IleMet: 0.988 ± 0.277
2.399IleAsn: 2.399 ± 0.411
2.117IlePro: 2.117 ± 0.404
1.623IleGln: 1.623 ± 0.435
3.669IleArg: 3.669 ± 0.476
2.47IleSer: 2.47 ± 0.404
3.387IleThr: 3.387 ± 0.482
3.74IleVal: 3.74 ± 0.521
0.635IleTrp: 0.635 ± 0.166
1.27IleTyr: 1.27 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
6.562LysAla: 6.562 ± 0.639
0.635LysCys: 0.635 ± 0.281
3.175LysAsp: 3.175 ± 0.447
3.105LysGlu: 3.105 ± 0.497
2.046LysPhe: 2.046 ± 0.429
3.74LysGly: 3.74 ± 0.467
0.988LysHis: 0.988 ± 0.348
1.764LysIle: 1.764 ± 0.305
3.246LysLys: 3.246 ± 0.762
4.375LysLeu: 4.375 ± 0.54
1.764LysMet: 1.764 ± 0.301
1.693LysAsn: 1.693 ± 0.358
2.611LysPro: 2.611 ± 0.511
1.835LysGln: 1.835 ± 0.338
3.528LysArg: 3.528 ± 0.514
4.587LysSer: 4.587 ± 0.529
3.74LysThr: 3.74 ± 0.484
4.798LysVal: 4.798 ± 0.476
0.988LysTrp: 0.988 ± 0.278
1.623LysTyr: 1.623 ± 0.297
0.0LysXaa: 0.0 ± 0.0
Leu
7.621LeuAla: 7.621 ± 0.819
0.564LeuCys: 0.564 ± 0.231
4.728LeuAsp: 4.728 ± 0.46
5.504LeuGlu: 5.504 ± 0.667
2.611LeuPhe: 2.611 ± 0.45
4.939LeuGly: 4.939 ± 0.581
1.27LeuHis: 1.27 ± 0.277
3.599LeuIle: 3.599 ± 0.443
6.844LeuLys: 6.844 ± 0.742
6.139LeuLeu: 6.139 ± 0.767
2.47LeuMet: 2.47 ± 0.416
4.516LeuAsn: 4.516 ± 0.547
3.528LeuPro: 3.528 ± 0.405
4.304LeuGln: 4.304 ± 0.577
7.268LeuArg: 7.268 ± 0.815
6.351LeuSer: 6.351 ± 0.613
5.01LeuThr: 5.01 ± 0.693
6.421LeuVal: 6.421 ± 0.885
1.2LeuTrp: 1.2 ± 0.325
1.764LeuTyr: 1.764 ± 0.358
0.0LeuXaa: 0.0 ± 0.0
Met
2.611MetAla: 2.611 ± 0.426
0.353MetCys: 0.353 ± 0.166
1.411MetAsp: 1.411 ± 0.27
1.341MetGlu: 1.341 ± 0.218
0.917MetPhe: 0.917 ± 0.28
2.258MetGly: 2.258 ± 0.412
0.212MetHis: 0.212 ± 0.128
1.482MetIle: 1.482 ± 0.262
0.917MetLys: 0.917 ± 0.232
2.187MetLeu: 2.187 ± 0.369
0.494MetMet: 0.494 ± 0.16
0.988MetAsn: 0.988 ± 0.319
0.988MetPro: 0.988 ± 0.305
0.917MetGln: 0.917 ± 0.235
1.27MetArg: 1.27 ± 0.331
2.681MetSer: 2.681 ± 0.43
2.187MetThr: 2.187 ± 0.344
2.54MetVal: 2.54 ± 0.418
0.212MetTrp: 0.212 ± 0.109
0.917MetTyr: 0.917 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
3.881AsnAla: 3.881 ± 0.512
0.635AsnCys: 0.635 ± 0.241
1.764AsnAsp: 1.764 ± 0.371
2.117AsnGlu: 2.117 ± 0.375
1.623AsnPhe: 1.623 ± 0.241
4.234AsnGly: 4.234 ± 0.709
0.635AsnHis: 0.635 ± 0.237
2.187AsnIle: 2.187 ± 0.355
1.764AsnLys: 1.764 ± 0.352
3.387AsnLeu: 3.387 ± 0.669
1.129AsnMet: 1.129 ± 0.249
1.976AsnAsn: 1.976 ± 0.477
2.611AsnPro: 2.611 ± 0.442
1.905AsnGln: 1.905 ± 0.303
2.681AsnArg: 2.681 ± 0.475
2.187AsnSer: 2.187 ± 0.42
2.47AsnThr: 2.47 ± 0.53
2.822AsnVal: 2.822 ± 0.418
0.212AsnTrp: 0.212 ± 0.155
1.411AsnTyr: 1.411 ± 0.35
0.0AsnXaa: 0.0 ± 0.0
Pro
3.387ProAla: 3.387 ± 0.544
0.988ProCys: 0.988 ± 0.371
2.046ProAsp: 2.046 ± 0.276
3.034ProGlu: 3.034 ± 0.395
1.693ProPhe: 1.693 ± 0.368
1.905ProGly: 1.905 ± 0.364
0.635ProHis: 0.635 ± 0.18
1.623ProIle: 1.623 ± 0.347
2.681ProLys: 2.681 ± 0.572
3.387ProLeu: 3.387 ± 0.619
1.411ProMet: 1.411 ± 0.362
2.117ProAsn: 2.117 ± 0.391
1.976ProPro: 1.976 ± 0.54
1.693ProGln: 1.693 ± 0.313
1.905ProArg: 1.905 ± 0.397
3.105ProSer: 3.105 ± 0.475
3.175ProThr: 3.175 ± 0.517
2.964ProVal: 2.964 ± 0.391
0.776ProTrp: 0.776 ± 0.261
1.2ProTyr: 1.2 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
3.246GlnAla: 3.246 ± 0.529
0.494GlnCys: 0.494 ± 0.201
3.105GlnAsp: 3.105 ± 0.665
2.47GlnGlu: 2.47 ± 0.397
1.693GlnPhe: 1.693 ± 0.355
3.246GlnGly: 3.246 ± 0.495
0.423GlnHis: 0.423 ± 0.178
1.2GlnIle: 1.2 ± 0.262
1.976GlnLys: 1.976 ± 0.47
4.728GlnLeu: 4.728 ± 0.531
1.129GlnMet: 1.129 ± 0.295
1.341GlnAsn: 1.341 ± 0.409
1.552GlnPro: 1.552 ± 0.287
1.835GlnGln: 1.835 ± 0.456
2.47GlnArg: 2.47 ± 0.525
2.752GlnSer: 2.752 ± 0.424
2.822GlnThr: 2.822 ± 0.533
2.681GlnVal: 2.681 ± 0.419
0.776GlnTrp: 0.776 ± 0.231
0.988GlnTyr: 0.988 ± 0.218
0.0GlnXaa: 0.0 ± 0.0
Arg
4.587ArgAla: 4.587 ± 0.547
1.058ArgCys: 1.058 ± 0.292
4.022ArgAsp: 4.022 ± 0.415
3.81ArgGlu: 3.81 ± 0.467
2.611ArgPhe: 2.611 ± 0.354
4.022ArgGly: 4.022 ± 0.471
0.847ArgHis: 0.847 ± 0.273
3.175ArgIle: 3.175 ± 0.545
2.681ArgLys: 2.681 ± 0.504
6.703ArgLeu: 6.703 ± 0.654
1.482ArgMet: 1.482 ± 0.381
2.399ArgAsn: 2.399 ± 0.435
2.752ArgPro: 2.752 ± 0.543
2.611ArgGln: 2.611 ± 0.339
2.752ArgArg: 2.752 ± 0.506
3.81ArgSer: 3.81 ± 0.546
2.822ArgThr: 2.822 ± 0.491
3.951ArgVal: 3.951 ± 0.587
1.27ArgTrp: 1.27 ± 0.282
1.411ArgTyr: 1.411 ± 0.257
0.0ArgXaa: 0.0 ± 0.0
Ser
5.504SerAla: 5.504 ± 0.636
0.988SerCys: 0.988 ± 0.246
4.093SerAsp: 4.093 ± 0.448
3.105SerGlu: 3.105 ± 0.411
2.893SerPhe: 2.893 ± 0.327
5.786SerGly: 5.786 ± 0.701
2.329SerHis: 2.329 ± 0.448
2.964SerIle: 2.964 ± 0.45
3.175SerLys: 3.175 ± 0.38
5.998SerLeu: 5.998 ± 0.664
1.129SerMet: 1.129 ± 0.257
2.117SerAsn: 2.117 ± 0.402
3.387SerPro: 3.387 ± 0.541
2.822SerGln: 2.822 ± 0.47
3.881SerArg: 3.881 ± 0.553
6.139SerSer: 6.139 ± 0.993
4.234SerThr: 4.234 ± 0.662
5.504SerVal: 5.504 ± 0.66
1.129SerTrp: 1.129 ± 0.25
2.611SerTyr: 2.611 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
3.458ThrAla: 3.458 ± 0.581
0.706ThrCys: 0.706 ± 0.232
2.752ThrAsp: 2.752 ± 0.386
4.728ThrGlu: 4.728 ± 0.498
2.258ThrPhe: 2.258 ± 0.306
5.927ThrGly: 5.927 ± 0.979
0.917ThrHis: 0.917 ± 0.24
3.599ThrIle: 3.599 ± 0.488
3.105ThrLys: 3.105 ± 0.384
5.715ThrLeu: 5.715 ± 0.623
1.552ThrMet: 1.552 ± 0.332
2.046ThrAsn: 2.046 ± 0.346
2.893ThrPro: 2.893 ± 0.36
2.611ThrGln: 2.611 ± 0.499
2.47ThrArg: 2.47 ± 0.429
4.304ThrSer: 4.304 ± 0.789
3.387ThrThr: 3.387 ± 0.527
5.01ThrVal: 5.01 ± 0.609
0.635ThrTrp: 0.635 ± 0.166
1.693ThrTyr: 1.693 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
5.786ValAla: 5.786 ± 0.638
0.847ValCys: 0.847 ± 0.282
3.105ValAsp: 3.105 ± 0.424
5.363ValGlu: 5.363 ± 0.643
2.117ValPhe: 2.117 ± 0.352
5.927ValGly: 5.927 ± 0.58
1.341ValHis: 1.341 ± 0.452
3.951ValIle: 3.951 ± 0.452
3.81ValLys: 3.81 ± 0.5
6.421ValLeu: 6.421 ± 0.773
2.117ValMet: 2.117 ± 0.413
3.599ValAsn: 3.599 ± 0.596
3.246ValPro: 3.246 ± 0.444
3.528ValGln: 3.528 ± 0.564
3.951ValArg: 3.951 ± 0.472
5.08ValSer: 5.08 ± 0.603
4.445ValThr: 4.445 ± 0.584
5.433ValVal: 5.433 ± 0.834
0.776ValTrp: 0.776 ± 0.214
2.752ValTyr: 2.752 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.423TrpAla: 0.423 ± 0.147
0.282TrpCys: 0.282 ± 0.177
0.564TrpAsp: 0.564 ± 0.175
0.847TrpGlu: 0.847 ± 0.234
0.494TrpPhe: 0.494 ± 0.184
0.988TrpGly: 0.988 ± 0.301
0.494TrpHis: 0.494 ± 0.196
0.353TrpIle: 0.353 ± 0.177
1.2TrpLys: 1.2 ± 0.322
2.046TrpLeu: 2.046 ± 0.395
0.212TrpMet: 0.212 ± 0.113
0.564TrpAsn: 0.564 ± 0.166
0.212TrpPro: 0.212 ± 0.127
0.564TrpGln: 0.564 ± 0.22
1.27TrpArg: 1.27 ± 0.305
1.341TrpSer: 1.341 ± 0.352
0.776TrpThr: 0.776 ± 0.241
1.27TrpVal: 1.27 ± 0.319
0.212TrpTrp: 0.212 ± 0.123
0.423TrpTyr: 0.423 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.893TyrAla: 2.893 ± 0.432
0.423TyrCys: 0.423 ± 0.166
1.693TyrAsp: 1.693 ± 0.32
1.693TyrGlu: 1.693 ± 0.362
0.988TyrPhe: 0.988 ± 0.312
3.175TyrGly: 3.175 ± 0.494
0.494TyrHis: 0.494 ± 0.183
1.623TyrIle: 1.623 ± 0.373
1.764TyrLys: 1.764 ± 0.362
1.835TyrLeu: 1.835 ± 0.353
1.129TyrMet: 1.129 ± 0.214
1.341TyrAsn: 1.341 ± 0.377
1.27TyrPro: 1.27 ± 0.345
1.552TyrGln: 1.552 ± 0.429
2.54TyrArg: 2.54 ± 0.417
1.976TyrSer: 1.976 ± 0.35
1.764TyrThr: 1.764 ± 0.344
1.764TyrVal: 1.764 ± 0.346
0.706TyrTrp: 0.706 ± 0.22
1.058TyrTyr: 1.058 ± 0.266
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (14173 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski