Amino acid dipepetide frequency for Bacillus phage 049ML003

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.118AlaAla: 8.118 ± 1.801
0.206AlaCys: 0.206 ± 0.127
3.921AlaAsp: 3.921 ± 0.507
6.467AlaGlu: 6.467 ± 1.046
3.165AlaPhe: 3.165 ± 0.623
4.403AlaGly: 4.403 ± 0.452
1.032AlaHis: 1.032 ± 0.263
4.472AlaIle: 4.472 ± 0.636
5.366AlaLys: 5.366 ± 0.741
6.604AlaLeu: 6.604 ± 0.804
1.857AlaMet: 1.857 ± 0.383
3.027AlaAsn: 3.027 ± 0.438
1.72AlaPro: 1.72 ± 0.419
2.614AlaGln: 2.614 ± 0.427
3.371AlaArg: 3.371 ± 0.477
4.059AlaSer: 4.059 ± 0.704
3.577AlaThr: 3.577 ± 0.588
4.953AlaVal: 4.953 ± 0.951
1.376AlaTrp: 1.376 ± 0.338
2.683AlaTyr: 2.683 ± 0.433
0.0AlaXaa: 0.0 ± 0.0
Cys
0.688CysAla: 0.688 ± 0.231
0.206CysCys: 0.206 ± 0.155
0.619CysAsp: 0.619 ± 0.231
0.55CysGlu: 0.55 ± 0.21
0.206CysPhe: 0.206 ± 0.111
0.55CysGly: 0.55 ± 0.253
0.206CysHis: 0.206 ± 0.12
0.413CysIle: 0.413 ± 0.212
0.619CysLys: 0.619 ± 0.23
0.206CysLeu: 0.206 ± 0.179
0.0CysMet: 0.0 ± 0.0
0.482CysAsn: 0.482 ± 0.178
0.138CysPro: 0.138 ± 0.101
0.275CysGln: 0.275 ± 0.13
0.413CysArg: 0.413 ± 0.157
0.138CysSer: 0.138 ± 0.092
0.069CysThr: 0.069 ± 0.059
0.619CysVal: 0.619 ± 0.231
0.0CysTrp: 0.0 ± 0.0
0.206CysTyr: 0.206 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
4.403AspAla: 4.403 ± 0.634
0.826AspCys: 0.826 ± 0.211
4.678AspAsp: 4.678 ± 0.743
5.228AspGlu: 5.228 ± 0.742
2.752AspPhe: 2.752 ± 0.405
5.228AspGly: 5.228 ± 0.689
0.963AspHis: 0.963 ± 0.257
4.334AspIle: 4.334 ± 0.779
4.128AspLys: 4.128 ± 0.527
4.54AspLeu: 4.54 ± 0.574
1.238AspMet: 1.238 ± 0.3
3.371AspAsn: 3.371 ± 0.679
2.064AspPro: 2.064 ± 0.504
1.17AspGln: 1.17 ± 0.34
2.752AspArg: 2.752 ± 0.424
3.096AspSer: 3.096 ± 0.403
3.233AspThr: 3.233 ± 0.503
3.233AspVal: 3.233 ± 0.514
0.963AspTrp: 0.963 ± 0.265
2.339AspTyr: 2.339 ± 0.345
0.0AspXaa: 0.0 ± 0.0
Glu
5.366GluAla: 5.366 ± 0.828
0.482GluCys: 0.482 ± 0.16
2.958GluAsp: 2.958 ± 0.446
7.636GluGlu: 7.636 ± 0.842
3.302GluPhe: 3.302 ± 0.416
5.985GluGly: 5.985 ± 0.545
1.582GluHis: 1.582 ± 0.378
5.504GluIle: 5.504 ± 0.616
6.192GluLys: 6.192 ± 0.796
8.531GluLeu: 8.531 ± 0.919
3.646GluMet: 3.646 ± 0.441
4.747GluAsn: 4.747 ± 0.609
2.545GluPro: 2.545 ± 0.361
3.784GluGln: 3.784 ± 0.532
4.54GluArg: 4.54 ± 0.555
3.577GluSer: 3.577 ± 0.538
3.302GluThr: 3.302 ± 0.465
5.71GluVal: 5.71 ± 0.653
1.238GluTrp: 1.238 ± 0.316
3.233GluTyr: 3.233 ± 0.449
0.0GluXaa: 0.0 ± 0.0
Phe
3.027PheAla: 3.027 ± 0.445
0.344PheCys: 0.344 ± 0.18
2.614PheAsp: 2.614 ± 0.454
3.096PheGlu: 3.096 ± 0.5
1.789PhePhe: 1.789 ± 0.307
2.821PheGly: 2.821 ± 0.414
0.619PheHis: 0.619 ± 0.199
2.683PheIle: 2.683 ± 0.398
3.509PheLys: 3.509 ± 0.448
2.958PheLeu: 2.958 ± 0.503
1.376PheMet: 1.376 ± 0.257
2.064PheAsn: 2.064 ± 0.389
0.963PhePro: 0.963 ± 0.234
1.582PheGln: 1.582 ± 0.312
1.101PheArg: 1.101 ± 0.229
1.789PheSer: 1.789 ± 0.315
2.958PheThr: 2.958 ± 0.489
2.201PheVal: 2.201 ± 0.372
0.688PheTrp: 0.688 ± 0.171
1.445PheTyr: 1.445 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
5.435GlyAla: 5.435 ± 0.728
0.688GlyCys: 0.688 ± 0.24
3.646GlyAsp: 3.646 ± 0.557
6.054GlyGlu: 6.054 ± 0.717
2.545GlyPhe: 2.545 ± 0.361
7.086GlyGly: 7.086 ± 0.615
1.101GlyHis: 1.101 ± 0.356
4.128GlyIle: 4.128 ± 0.669
5.779GlyLys: 5.779 ± 0.608
4.334GlyLeu: 4.334 ± 0.637
2.064GlyMet: 2.064 ± 0.357
3.44GlyAsn: 3.44 ± 0.476
1.995GlyPro: 1.995 ± 0.349
3.165GlyGln: 3.165 ± 0.502
3.921GlyArg: 3.921 ± 0.525
4.816GlySer: 4.816 ± 0.678
3.99GlyThr: 3.99 ± 0.579
4.196GlyVal: 4.196 ± 0.419
0.482GlyTrp: 0.482 ± 0.162
3.577GlyTyr: 3.577 ± 0.584
0.0GlyXaa: 0.0 ± 0.0
His
0.963HisAla: 0.963 ± 0.355
0.138HisCys: 0.138 ± 0.095
1.307HisAsp: 1.307 ± 0.318
1.101HisGlu: 1.101 ± 0.279
0.757HisPhe: 0.757 ± 0.225
1.376HisGly: 1.376 ± 0.336
0.413HisHis: 0.413 ± 0.186
1.17HisIle: 1.17 ± 0.299
1.376HisLys: 1.376 ± 0.309
1.238HisLeu: 1.238 ± 0.274
0.757HisMet: 0.757 ± 0.225
0.55HisAsn: 0.55 ± 0.175
0.688HisPro: 0.688 ± 0.219
0.275HisGln: 0.275 ± 0.132
1.17HisArg: 1.17 ± 0.278
1.032HisSer: 1.032 ± 0.243
0.55HisThr: 0.55 ± 0.177
1.101HisVal: 1.101 ± 0.261
0.482HisTrp: 0.482 ± 0.228
0.619HisTyr: 0.619 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
3.233IleAla: 3.233 ± 0.552
0.275IleCys: 0.275 ± 0.135
4.472IleAsp: 4.472 ± 0.668
5.366IleGlu: 5.366 ± 0.684
1.926IlePhe: 1.926 ± 0.394
4.128IleGly: 4.128 ± 0.648
1.032IleHis: 1.032 ± 0.23
4.816IleIle: 4.816 ± 0.503
6.879IleLys: 6.879 ± 0.656
4.747IleLeu: 4.747 ± 0.544
2.339IleMet: 2.339 ± 0.466
3.715IleAsn: 3.715 ± 0.436
2.545IlePro: 2.545 ± 0.447
2.545IleGln: 2.545 ± 0.434
2.545IleArg: 2.545 ± 0.408
4.403IleSer: 4.403 ± 0.521
4.403IleThr: 4.403 ± 0.646
4.472IleVal: 4.472 ± 0.479
0.688IleTrp: 0.688 ± 0.258
2.477IleTyr: 2.477 ± 0.476
0.0IleXaa: 0.0 ± 0.0
Lys
5.848LysAla: 5.848 ± 0.687
0.206LysCys: 0.206 ± 0.108
4.403LysAsp: 4.403 ± 0.537
7.98LysGlu: 7.98 ± 0.934
2.201LysPhe: 2.201 ± 0.506
5.228LysGly: 5.228 ± 0.561
1.995LysHis: 1.995 ± 0.371
5.504LysIle: 5.504 ± 0.628
8.324LysLys: 8.324 ± 1.007
5.366LysLeu: 5.366 ± 0.673
3.027LysMet: 3.027 ± 0.449
6.054LysAsn: 6.054 ± 0.68
2.683LysPro: 2.683 ± 0.543
2.477LysGln: 2.477 ± 0.45
4.816LysArg: 4.816 ± 0.552
4.128LysSer: 4.128 ± 0.832
4.884LysThr: 4.884 ± 0.671
4.54LysVal: 4.54 ± 0.597
1.101LysTrp: 1.101 ± 0.264
2.408LysTyr: 2.408 ± 0.47
0.0LysXaa: 0.0 ± 0.0
Leu
5.228LeuAla: 5.228 ± 0.713
0.619LeuCys: 0.619 ± 0.231
4.265LeuAsp: 4.265 ± 0.484
7.292LeuGlu: 7.292 ± 0.787
2.477LeuPhe: 2.477 ± 0.427
5.022LeuGly: 5.022 ± 0.777
1.238LeuHis: 1.238 ± 0.276
4.265LeuIle: 4.265 ± 0.532
6.879LeuLys: 6.879 ± 0.884
4.678LeuLeu: 4.678 ± 0.58
2.201LeuMet: 2.201 ± 0.392
3.646LeuAsn: 3.646 ± 0.467
2.339LeuPro: 2.339 ± 0.418
3.233LeuGln: 3.233 ± 0.573
3.99LeuArg: 3.99 ± 0.505
4.128LeuSer: 4.128 ± 0.664
4.403LeuThr: 4.403 ± 0.58
3.99LeuVal: 3.99 ± 0.551
0.688LeuTrp: 0.688 ± 0.217
2.477LeuTyr: 2.477 ± 0.47
0.0LeuXaa: 0.0 ± 0.0
Met
3.096MetAla: 3.096 ± 0.765
0.206MetCys: 0.206 ± 0.111
1.513MetAsp: 1.513 ± 0.267
1.926MetGlu: 1.926 ± 0.418
0.894MetPhe: 0.894 ± 0.22
1.582MetGly: 1.582 ± 0.299
0.482MetHis: 0.482 ± 0.215
2.133MetIle: 2.133 ± 0.331
2.752MetLys: 2.752 ± 0.436
1.101MetLeu: 1.101 ± 0.257
1.101MetMet: 1.101 ± 0.335
2.545MetAsn: 2.545 ± 0.35
1.307MetPro: 1.307 ± 0.231
1.238MetGln: 1.238 ± 0.296
1.789MetArg: 1.789 ± 0.316
2.201MetSer: 2.201 ± 0.342
1.857MetThr: 1.857 ± 0.326
0.894MetVal: 0.894 ± 0.314
0.413MetTrp: 0.413 ± 0.208
1.101MetTyr: 1.101 ± 0.244
0.0MetXaa: 0.0 ± 0.0
Asn
3.853AsnAla: 3.853 ± 0.618
0.413AsnCys: 0.413 ± 0.194
3.784AsnAsp: 3.784 ± 0.58
4.403AsnGlu: 4.403 ± 0.705
2.477AsnPhe: 2.477 ± 0.352
4.472AsnGly: 4.472 ± 0.637
0.894AsnHis: 0.894 ± 0.23
4.265AsnIle: 4.265 ± 0.674
4.059AsnLys: 4.059 ± 0.557
4.059AsnLeu: 4.059 ± 0.441
1.651AsnMet: 1.651 ± 0.315
3.165AsnAsn: 3.165 ± 0.667
2.614AsnPro: 2.614 ± 0.467
1.376AsnGln: 1.376 ± 0.377
2.133AsnArg: 2.133 ± 0.379
2.201AsnSer: 2.201 ± 0.329
2.821AsnThr: 2.821 ± 0.368
2.614AsnVal: 2.614 ± 0.545
0.55AsnTrp: 0.55 ± 0.196
1.857AsnTyr: 1.857 ± 0.294
0.0AsnXaa: 0.0 ± 0.0
Pro
2.339ProAla: 2.339 ± 0.446
0.069ProCys: 0.069 ± 0.067
2.27ProAsp: 2.27 ± 0.318
3.165ProGlu: 3.165 ± 0.398
1.72ProPhe: 1.72 ± 0.331
2.752ProGly: 2.752 ± 0.462
0.688ProHis: 0.688 ± 0.193
1.995ProIle: 1.995 ± 0.315
2.614ProLys: 2.614 ± 0.424
2.201ProLeu: 2.201 ± 0.32
0.688ProMet: 0.688 ± 0.154
1.101ProAsn: 1.101 ± 0.236
1.376ProPro: 1.376 ± 0.32
1.17ProGln: 1.17 ± 0.271
0.826ProArg: 0.826 ± 0.268
2.064ProSer: 2.064 ± 0.376
1.513ProThr: 1.513 ± 0.379
3.027ProVal: 3.027 ± 0.445
0.344ProTrp: 0.344 ± 0.163
1.238ProTyr: 1.238 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
3.027GlnAla: 3.027 ± 0.514
0.069GlnCys: 0.069 ± 0.064
2.064GlnAsp: 2.064 ± 0.437
2.477GlnGlu: 2.477 ± 0.385
0.963GlnPhe: 0.963 ± 0.25
1.72GlnGly: 1.72 ± 0.373
0.482GlnHis: 0.482 ± 0.154
1.513GlnIle: 1.513 ± 0.283
3.096GlnLys: 3.096 ± 0.429
3.509GlnLeu: 3.509 ± 0.481
1.307GlnMet: 1.307 ± 0.413
1.995GlnAsn: 1.995 ± 0.394
1.513GlnPro: 1.513 ± 0.279
1.582GlnGln: 1.582 ± 0.33
1.17GlnArg: 1.17 ± 0.294
2.064GlnSer: 2.064 ± 0.342
2.339GlnThr: 2.339 ± 0.412
2.27GlnVal: 2.27 ± 0.37
0.344GlnTrp: 0.344 ± 0.136
1.307GlnTyr: 1.307 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
2.752ArgAla: 2.752 ± 0.417
0.55ArgCys: 0.55 ± 0.174
2.614ArgAsp: 2.614 ± 0.366
3.44ArgGlu: 3.44 ± 0.415
2.683ArgPhe: 2.683 ± 0.462
3.784ArgGly: 3.784 ± 0.451
0.757ArgHis: 0.757 ± 0.206
4.403ArgIle: 4.403 ± 0.514
4.059ArgLys: 4.059 ± 0.49
3.853ArgLeu: 3.853 ± 0.589
1.513ArgMet: 1.513 ± 0.302
2.133ArgAsn: 2.133 ± 0.364
1.17ArgPro: 1.17 ± 0.298
1.238ArgGln: 1.238 ± 0.277
2.408ArgArg: 2.408 ± 0.436
1.376ArgSer: 1.376 ± 0.293
3.165ArgThr: 3.165 ± 0.473
3.509ArgVal: 3.509 ± 0.522
0.688ArgTrp: 0.688 ± 0.248
1.995ArgTyr: 1.995 ± 0.433
0.0ArgXaa: 0.0 ± 0.0
Ser
4.472SerAla: 4.472 ± 0.492
0.413SerCys: 0.413 ± 0.148
3.921SerAsp: 3.921 ± 0.609
3.646SerGlu: 3.646 ± 0.467
2.27SerPhe: 2.27 ± 0.487
4.953SerGly: 4.953 ± 0.525
0.619SerHis: 0.619 ± 0.196
3.44SerIle: 3.44 ± 0.487
3.509SerLys: 3.509 ± 0.449
3.715SerLeu: 3.715 ± 0.761
1.72SerMet: 1.72 ± 0.347
1.789SerAsn: 1.789 ± 0.341
1.445SerPro: 1.445 ± 0.354
1.513SerGln: 1.513 ± 0.328
3.302SerArg: 3.302 ± 0.471
2.339SerSer: 2.339 ± 0.571
3.44SerThr: 3.44 ± 0.449
3.99SerVal: 3.99 ± 0.616
0.894SerTrp: 0.894 ± 0.22
1.789SerTyr: 1.789 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
4.678ThrAla: 4.678 ± 0.876
0.413ThrCys: 0.413 ± 0.196
3.577ThrAsp: 3.577 ± 0.536
4.334ThrGlu: 4.334 ± 0.5
3.509ThrPhe: 3.509 ± 0.419
4.816ThrGly: 4.816 ± 0.849
0.894ThrHis: 0.894 ± 0.269
4.128ThrIle: 4.128 ± 0.418
3.509ThrLys: 3.509 ± 0.544
3.44ThrLeu: 3.44 ± 0.46
1.238ThrMet: 1.238 ± 0.292
2.477ThrAsn: 2.477 ± 0.513
2.545ThrPro: 2.545 ± 0.397
1.72ThrGln: 1.72 ± 0.321
2.201ThrArg: 2.201 ± 0.421
2.683ThrSer: 2.683 ± 0.502
2.958ThrThr: 2.958 ± 0.446
4.678ThrVal: 4.678 ± 0.606
0.619ThrTrp: 0.619 ± 0.191
2.133ThrTyr: 2.133 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
3.165ValAla: 3.165 ± 0.525
0.138ValCys: 0.138 ± 0.094
4.059ValAsp: 4.059 ± 0.641
5.16ValGlu: 5.16 ± 0.79
1.995ValPhe: 1.995 ± 0.386
2.821ValGly: 2.821 ± 0.44
0.963ValHis: 0.963 ± 0.264
4.816ValIle: 4.816 ± 0.482
6.604ValLys: 6.604 ± 0.703
3.99ValLeu: 3.99 ± 0.56
1.307ValMet: 1.307 ± 0.322
3.921ValAsn: 3.921 ± 0.591
2.27ValPro: 2.27 ± 0.363
2.27ValGln: 2.27 ± 0.316
2.821ValArg: 2.821 ± 0.456
4.128ValSer: 4.128 ± 0.543
4.953ValThr: 4.953 ± 0.76
3.165ValVal: 3.165 ± 0.445
1.789ValTrp: 1.789 ± 0.6
2.27ValTyr: 2.27 ± 0.358
0.0ValXaa: 0.0 ± 0.0
Trp
0.894TrpAla: 0.894 ± 0.241
0.138TrpCys: 0.138 ± 0.104
0.826TrpAsp: 0.826 ± 0.261
0.757TrpGlu: 0.757 ± 0.257
0.482TrpPhe: 0.482 ± 0.228
1.17TrpGly: 1.17 ± 0.268
0.482TrpHis: 0.482 ± 0.223
0.963TrpIle: 0.963 ± 0.288
1.032TrpLys: 1.032 ± 0.231
0.826TrpLeu: 0.826 ± 0.269
0.138TrpMet: 0.138 ± 0.086
1.651TrpAsn: 1.651 ± 0.63
0.138TrpPro: 0.138 ± 0.109
0.413TrpGln: 0.413 ± 0.147
0.894TrpArg: 0.894 ± 0.268
0.894TrpSer: 0.894 ± 0.205
0.55TrpThr: 0.55 ± 0.185
0.894TrpVal: 0.894 ± 0.218
0.069TrpTrp: 0.069 ± 0.06
0.757TrpTyr: 0.757 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.408TyrAla: 2.408 ± 0.37
0.206TyrCys: 0.206 ± 0.118
3.027TyrAsp: 3.027 ± 0.496
3.784TyrGlu: 3.784 ± 0.726
1.513TyrPhe: 1.513 ± 0.301
2.545TyrGly: 2.545 ± 0.363
0.619TyrHis: 0.619 ± 0.151
2.064TyrIle: 2.064 ± 0.347
2.889TyrLys: 2.889 ± 0.6
3.165TyrLeu: 3.165 ± 0.431
0.894TyrMet: 0.894 ± 0.232
1.789TyrAsn: 1.789 ± 0.378
1.101TyrPro: 1.101 ± 0.361
1.17TyrGln: 1.17 ± 0.319
1.926TyrArg: 1.926 ± 0.416
2.064TyrSer: 2.064 ± 0.395
1.582TyrThr: 1.582 ± 0.336
2.545TyrVal: 2.545 ± 0.456
0.55TyrTrp: 0.55 ± 0.198
1.651TyrTyr: 1.651 ± 0.396
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (14537 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski