Amino acid dipepetide frequency for Escherichia phage Shashou

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.066AlaAla: 11.066 ± 1.561
1.15AlaCys: 1.15 ± 0.342
5.533AlaAsp: 5.533 ± 0.722
7.114AlaGlu: 7.114 ± 0.933
3.737AlaPhe: 3.737 ± 0.576
5.964AlaGly: 5.964 ± 0.837
1.437AlaHis: 1.437 ± 0.35
4.743AlaIle: 4.743 ± 0.839
5.605AlaLys: 5.605 ± 0.795
7.833AlaLeu: 7.833 ± 0.843
1.653AlaMet: 1.653 ± 0.37
3.88AlaAsn: 3.88 ± 0.451
3.737AlaPro: 3.737 ± 0.533
3.377AlaGln: 3.377 ± 0.715
4.743AlaArg: 4.743 ± 0.612
5.677AlaSer: 5.677 ± 0.888
5.533AlaThr: 5.533 ± 0.818
6.396AlaVal: 6.396 ± 0.627
1.725AlaTrp: 1.725 ± 0.307
3.737AlaTyr: 3.737 ± 0.5
0.0AlaXaa: 0.0 ± 0.0
Cys
1.078CysAla: 1.078 ± 0.281
0.072CysCys: 0.072 ± 0.079
0.934CysAsp: 0.934 ± 0.227
1.006CysGlu: 1.006 ± 0.291
0.216CysPhe: 0.216 ± 0.107
1.006CysGly: 1.006 ± 0.259
0.144CysHis: 0.144 ± 0.11
0.431CysIle: 0.431 ± 0.208
0.719CysLys: 0.719 ± 0.22
0.647CysLeu: 0.647 ± 0.268
0.072CysMet: 0.072 ± 0.067
0.359CysAsn: 0.359 ± 0.188
0.359CysPro: 0.359 ± 0.163
0.216CysGln: 0.216 ± 0.119
0.934CysArg: 0.934 ± 0.218
0.647CysSer: 0.647 ± 0.229
0.862CysThr: 0.862 ± 0.311
0.719CysVal: 0.719 ± 0.228
0.216CysTrp: 0.216 ± 0.131
0.359CysTyr: 0.359 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
5.749AspAla: 5.749 ± 0.542
0.503AspCys: 0.503 ± 0.193
4.527AspAsp: 4.527 ± 0.565
3.952AspGlu: 3.952 ± 0.634
3.09AspPhe: 3.09 ± 0.41
5.964AspGly: 5.964 ± 0.742
0.934AspHis: 0.934 ± 0.252
3.809AspIle: 3.809 ± 0.406
2.803AspLys: 2.803 ± 0.457
4.455AspLeu: 4.455 ± 0.536
1.653AspMet: 1.653 ± 0.328
2.731AspAsn: 2.731 ± 0.483
1.653AspPro: 1.653 ± 0.362
1.006AspGln: 1.006 ± 0.337
2.371AspArg: 2.371 ± 0.476
3.377AspSer: 3.377 ± 0.538
4.24AspThr: 4.24 ± 0.653
5.03AspVal: 5.03 ± 0.669
0.862AspTrp: 0.862 ± 0.289
1.796AspTyr: 1.796 ± 0.381
0.0AspXaa: 0.0 ± 0.0
Glu
6.036GluAla: 6.036 ± 0.932
0.503GluCys: 0.503 ± 0.205
3.737GluAsp: 3.737 ± 0.596
5.174GluGlu: 5.174 ± 0.915
2.659GluPhe: 2.659 ± 0.659
4.168GluGly: 4.168 ± 0.523
0.934GluHis: 0.934 ± 0.268
3.665GluIle: 3.665 ± 0.536
4.168GluLys: 4.168 ± 0.681
5.892GluLeu: 5.892 ± 0.612
2.946GluMet: 2.946 ± 0.443
2.731GluAsn: 2.731 ± 0.519
1.94GluPro: 1.94 ± 0.484
3.737GluGln: 3.737 ± 0.876
3.809GluArg: 3.809 ± 0.647
2.587GluSer: 2.587 ± 0.565
3.09GluThr: 3.09 ± 0.498
5.246GluVal: 5.246 ± 0.527
1.078GluTrp: 1.078 ± 0.292
2.371GluTyr: 2.371 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
2.803PheAla: 2.803 ± 0.365
0.431PheCys: 0.431 ± 0.185
3.521PheAsp: 3.521 ± 0.561
2.803PheGlu: 2.803 ± 0.488
1.006PhePhe: 1.006 ± 0.252
3.018PheGly: 3.018 ± 0.398
0.503PheHis: 0.503 ± 0.203
2.587PheIle: 2.587 ± 0.468
2.156PheLys: 2.156 ± 0.353
1.94PheLeu: 1.94 ± 0.394
0.575PheMet: 0.575 ± 0.196
1.94PheAsn: 1.94 ± 0.352
1.581PhePro: 1.581 ± 0.409
1.15PheGln: 1.15 ± 0.322
1.868PheArg: 1.868 ± 0.286
2.515PheSer: 2.515 ± 0.476
3.162PheThr: 3.162 ± 0.576
3.018PheVal: 3.018 ± 0.436
0.503PheTrp: 0.503 ± 0.222
1.581PheTyr: 1.581 ± 0.326
0.0PheXaa: 0.0 ± 0.0
Gly
7.186GlyAla: 7.186 ± 0.806
1.222GlyCys: 1.222 ± 0.281
4.383GlyAsp: 4.383 ± 0.578
5.174GlyGlu: 5.174 ± 0.576
3.593GlyPhe: 3.593 ± 0.517
5.318GlyGly: 5.318 ± 0.7
1.437GlyHis: 1.437 ± 0.353
2.659GlyIle: 2.659 ± 0.338
4.886GlyLys: 4.886 ± 0.608
5.03GlyLeu: 5.03 ± 0.683
2.084GlyMet: 2.084 ± 0.372
4.168GlyAsn: 4.168 ± 0.62
1.725GlyPro: 1.725 ± 0.345
2.371GlyGln: 2.371 ± 0.337
3.809GlyArg: 3.809 ± 0.458
5.174GlySer: 5.174 ± 0.786
4.024GlyThr: 4.024 ± 0.571
5.892GlyVal: 5.892 ± 0.773
1.293GlyTrp: 1.293 ± 0.257
2.587GlyTyr: 2.587 ± 0.455
0.0GlyXaa: 0.0 ± 0.0
His
1.078HisAla: 1.078 ± 0.25
0.287HisCys: 0.287 ± 0.126
0.647HisAsp: 0.647 ± 0.221
1.006HisGlu: 1.006 ± 0.328
0.503HisPhe: 0.503 ± 0.202
1.078HisGly: 1.078 ± 0.366
0.359HisHis: 0.359 ± 0.174
1.365HisIle: 1.365 ± 0.333
1.796HisLys: 1.796 ± 0.424
1.078HisLeu: 1.078 ± 0.357
0.287HisMet: 0.287 ± 0.14
0.934HisAsn: 0.934 ± 0.229
0.79HisPro: 0.79 ± 0.184
1.15HisGln: 1.15 ± 0.282
0.862HisArg: 0.862 ± 0.257
0.719HisSer: 0.719 ± 0.211
0.79HisThr: 0.79 ± 0.343
1.078HisVal: 1.078 ± 0.328
0.0HisTrp: 0.0 ± 0.0
0.647HisTyr: 0.647 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
5.174IleAla: 5.174 ± 0.692
1.078IleCys: 1.078 ± 0.263
3.665IleAsp: 3.665 ± 0.659
3.234IleGlu: 3.234 ± 0.401
1.437IlePhe: 1.437 ± 0.35
2.587IleGly: 2.587 ± 0.437
0.431IleHis: 0.431 ± 0.167
2.587IleIle: 2.587 ± 0.407
3.162IleLys: 3.162 ± 0.495
3.018IleLeu: 3.018 ± 0.436
0.934IleMet: 0.934 ± 0.305
3.377IleAsn: 3.377 ± 0.473
2.659IlePro: 2.659 ± 0.335
1.653IleGln: 1.653 ± 0.27
2.874IleArg: 2.874 ± 0.339
3.377IleSer: 3.377 ± 0.511
4.815IleThr: 4.815 ± 0.672
3.809IleVal: 3.809 ± 0.561
0.862IleTrp: 0.862 ± 0.286
1.725IleTyr: 1.725 ± 0.394
0.0IleXaa: 0.0 ± 0.0
Lys
6.611LysAla: 6.611 ± 1.05
0.503LysCys: 0.503 ± 0.235
3.234LysAsp: 3.234 ± 0.528
4.024LysGlu: 4.024 ± 0.733
2.084LysPhe: 2.084 ± 0.301
3.737LysGly: 3.737 ± 0.462
1.437LysHis: 1.437 ± 0.348
2.874LysIle: 2.874 ± 0.627
3.09LysLys: 3.09 ± 0.459
4.743LysLeu: 4.743 ± 0.58
3.234LysMet: 3.234 ± 0.656
2.371LysAsn: 2.371 ± 0.493
2.515LysPro: 2.515 ± 0.44
1.868LysGln: 1.868 ± 0.323
4.24LysArg: 4.24 ± 0.586
2.874LysSer: 2.874 ± 0.571
3.737LysThr: 3.737 ± 0.529
3.665LysVal: 3.665 ± 0.551
0.575LysTrp: 0.575 ± 0.206
2.371LysTyr: 2.371 ± 0.405
0.0LysXaa: 0.0 ± 0.0
Leu
6.827LeuAla: 6.827 ± 0.726
0.719LeuCys: 0.719 ± 0.227
4.096LeuAsp: 4.096 ± 0.536
5.174LeuGlu: 5.174 ± 0.674
2.659LeuPhe: 2.659 ± 0.446
5.389LeuGly: 5.389 ± 0.502
1.437LeuHis: 1.437 ± 0.407
4.24LeuIle: 4.24 ± 0.467
4.527LeuLys: 4.527 ± 0.75
5.677LeuLeu: 5.677 ± 0.59
1.653LeuMet: 1.653 ± 0.304
3.88LeuAsn: 3.88 ± 0.528
3.593LeuPro: 3.593 ± 0.547
2.515LeuGln: 2.515 ± 0.327
5.174LeuArg: 5.174 ± 0.731
4.168LeuSer: 4.168 ± 0.508
5.533LeuThr: 5.533 ± 0.72
4.599LeuVal: 4.599 ± 0.672
0.934LeuTrp: 0.934 ± 0.309
1.94LeuTyr: 1.94 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
2.731MetAla: 2.731 ± 0.476
0.287MetCys: 0.287 ± 0.134
0.503MetAsp: 0.503 ± 0.194
1.365MetGlu: 1.365 ± 0.309
0.719MetPhe: 0.719 ± 0.2
1.725MetGly: 1.725 ± 0.323
0.216MetHis: 0.216 ± 0.122
1.222MetIle: 1.222 ± 0.368
1.796MetLys: 1.796 ± 0.367
1.653MetLeu: 1.653 ± 0.347
0.647MetMet: 0.647 ± 0.227
1.078MetAsn: 1.078 ± 0.321
1.15MetPro: 1.15 ± 0.236
1.078MetGln: 1.078 ± 0.267
1.293MetArg: 1.293 ± 0.303
1.868MetSer: 1.868 ± 0.35
2.012MetThr: 2.012 ± 0.437
2.371MetVal: 2.371 ± 0.382
0.216MetTrp: 0.216 ± 0.138
0.862MetTyr: 0.862 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
4.24AsnAla: 4.24 ± 0.593
0.503AsnCys: 0.503 ± 0.165
2.587AsnAsp: 2.587 ± 0.419
2.803AsnGlu: 2.803 ± 0.423
1.509AsnPhe: 1.509 ± 0.33
4.958AsnGly: 4.958 ± 0.691
0.719AsnHis: 0.719 ± 0.247
2.3AsnIle: 2.3 ± 0.403
2.3AsnLys: 2.3 ± 0.358
3.377AsnLeu: 3.377 ± 0.492
0.647AsnMet: 0.647 ± 0.267
2.659AsnAsn: 2.659 ± 0.534
2.084AsnPro: 2.084 ± 0.425
1.509AsnGln: 1.509 ± 0.479
2.443AsnArg: 2.443 ± 0.473
2.443AsnSer: 2.443 ± 0.351
3.521AsnThr: 3.521 ± 0.57
4.024AsnVal: 4.024 ± 0.561
0.647AsnTrp: 0.647 ± 0.202
1.293AsnTyr: 1.293 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
3.018ProAla: 3.018 ± 0.443
0.287ProCys: 0.287 ± 0.125
3.09ProAsp: 3.09 ± 0.546
3.306ProGlu: 3.306 ± 0.59
1.365ProPhe: 1.365 ± 0.374
2.587ProGly: 2.587 ± 0.459
0.719ProHis: 0.719 ± 0.207
1.653ProIle: 1.653 ± 0.369
1.725ProLys: 1.725 ± 0.344
2.946ProLeu: 2.946 ± 0.532
0.719ProMet: 0.719 ± 0.249
1.509ProAsn: 1.509 ± 0.312
1.293ProPro: 1.293 ± 0.287
1.222ProGln: 1.222 ± 0.317
1.653ProArg: 1.653 ± 0.363
2.443ProSer: 2.443 ± 0.311
2.874ProThr: 2.874 ± 0.633
4.527ProVal: 4.527 ± 0.503
0.287ProTrp: 0.287 ± 0.132
1.581ProTyr: 1.581 ± 0.382
0.0ProXaa: 0.0 ± 0.0
Gln
3.377GlnAla: 3.377 ± 0.407
0.431GlnCys: 0.431 ± 0.202
2.084GlnAsp: 2.084 ± 0.323
2.371GlnGlu: 2.371 ± 0.579
1.509GlnPhe: 1.509 ± 0.371
1.796GlnGly: 1.796 ± 0.374
0.862GlnHis: 0.862 ± 0.337
1.94GlnIle: 1.94 ± 0.376
2.084GlnLys: 2.084 ± 0.403
3.593GlnLeu: 3.593 ± 0.578
0.862GlnMet: 0.862 ± 0.211
1.796GlnAsn: 1.796 ± 0.405
1.365GlnPro: 1.365 ± 0.274
2.084GlnGln: 2.084 ± 0.54
2.228GlnArg: 2.228 ± 0.392
1.796GlnSer: 1.796 ± 0.419
1.725GlnThr: 1.725 ± 0.357
2.156GlnVal: 2.156 ± 0.455
0.862GlnTrp: 0.862 ± 0.237
1.868GlnTyr: 1.868 ± 0.332
0.0GlnXaa: 0.0 ± 0.0
Arg
4.024ArgAla: 4.024 ± 0.506
0.431ArgCys: 0.431 ± 0.157
2.946ArgAsp: 2.946 ± 0.391
3.593ArgGlu: 3.593 ± 0.568
2.012ArgPhe: 2.012 ± 0.327
3.377ArgGly: 3.377 ± 0.538
1.222ArgHis: 1.222 ± 0.28
2.587ArgIle: 2.587 ± 0.482
3.952ArgLys: 3.952 ± 0.521
4.383ArgLeu: 4.383 ± 0.394
1.509ArgMet: 1.509 ± 0.302
3.449ArgAsn: 3.449 ± 0.505
1.725ArgPro: 1.725 ± 0.329
3.162ArgGln: 3.162 ± 0.524
4.743ArgArg: 4.743 ± 0.63
2.587ArgSer: 2.587 ± 0.484
2.731ArgThr: 2.731 ± 0.431
4.815ArgVal: 4.815 ± 0.551
0.719ArgTrp: 0.719 ± 0.251
1.796ArgTyr: 1.796 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
5.174SerAla: 5.174 ± 0.951
0.431SerCys: 0.431 ± 0.16
3.306SerAsp: 3.306 ± 0.451
2.803SerGlu: 2.803 ± 0.452
2.587SerPhe: 2.587 ± 0.417
5.821SerGly: 5.821 ± 0.722
0.934SerHis: 0.934 ± 0.276
3.377SerIle: 3.377 ± 0.587
3.306SerLys: 3.306 ± 0.489
3.88SerLeu: 3.88 ± 0.445
1.222SerMet: 1.222 ± 0.425
3.09SerAsn: 3.09 ± 0.523
2.228SerPro: 2.228 ± 0.455
2.3SerGln: 2.3 ± 0.426
2.587SerArg: 2.587 ± 0.453
2.874SerSer: 2.874 ± 0.684
3.809SerThr: 3.809 ± 0.531
4.671SerVal: 4.671 ± 0.473
0.647SerTrp: 0.647 ± 0.198
2.587SerTyr: 2.587 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
6.467ThrAla: 6.467 ± 0.889
0.862ThrCys: 0.862 ± 0.246
3.521ThrAsp: 3.521 ± 0.501
3.737ThrGlu: 3.737 ± 0.516
2.874ThrPhe: 2.874 ± 0.458
6.539ThrGly: 6.539 ± 0.88
0.934ThrHis: 0.934 ± 0.24
3.162ThrIle: 3.162 ± 0.388
3.88ThrLys: 3.88 ± 0.397
4.671ThrLeu: 4.671 ± 0.451
1.509ThrMet: 1.509 ± 0.333
2.084ThrAsn: 2.084 ± 0.472
3.737ThrPro: 3.737 ± 0.604
2.084ThrGln: 2.084 ± 0.409
3.377ThrArg: 3.377 ± 0.417
3.449ThrSer: 3.449 ± 0.444
4.312ThrThr: 4.312 ± 0.496
4.743ThrVal: 4.743 ± 0.634
0.575ThrTrp: 0.575 ± 0.232
2.443ThrTyr: 2.443 ± 0.491
0.0ThrXaa: 0.0 ± 0.0
Val
6.97ValAla: 6.97 ± 0.726
0.79ValCys: 0.79 ± 0.23
4.886ValAsp: 4.886 ± 0.57
5.174ValGlu: 5.174 ± 0.638
2.659ValPhe: 2.659 ± 0.41
4.455ValGly: 4.455 ± 0.44
1.006ValHis: 1.006 ± 0.281
5.102ValIle: 5.102 ± 0.711
4.383ValLys: 4.383 ± 0.519
5.677ValLeu: 5.677 ± 0.74
1.293ValMet: 1.293 ± 0.326
3.162ValAsn: 3.162 ± 0.518
2.946ValPro: 2.946 ± 0.557
2.515ValGln: 2.515 ± 0.362
3.665ValArg: 3.665 ± 0.506
5.533ValSer: 5.533 ± 0.595
5.892ValThr: 5.892 ± 0.651
5.389ValVal: 5.389 ± 0.708
1.15ValTrp: 1.15 ± 0.295
3.162ValTyr: 3.162 ± 0.617
0.0ValXaa: 0.0 ± 0.0
Trp
1.222TrpAla: 1.222 ± 0.389
0.144TrpCys: 0.144 ± 0.088
0.719TrpAsp: 0.719 ± 0.209
0.431TrpGlu: 0.431 ± 0.169
1.006TrpPhe: 1.006 ± 0.32
1.365TrpGly: 1.365 ± 0.365
0.287TrpHis: 0.287 ± 0.155
0.359TrpIle: 0.359 ± 0.164
0.79TrpLys: 0.79 ± 0.247
1.653TrpLeu: 1.653 ± 0.309
0.431TrpMet: 0.431 ± 0.183
0.575TrpAsn: 0.575 ± 0.207
0.359TrpPro: 0.359 ± 0.228
0.503TrpGln: 0.503 ± 0.235
0.79TrpArg: 0.79 ± 0.207
0.719TrpSer: 0.719 ± 0.252
0.503TrpThr: 0.503 ± 0.169
1.293TrpVal: 1.293 ± 0.304
0.287TrpTrp: 0.287 ± 0.162
0.431TrpTyr: 0.431 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.88TyrAla: 3.88 ± 0.467
0.359TyrCys: 0.359 ± 0.179
2.587TyrAsp: 2.587 ± 0.527
2.228TyrGlu: 2.228 ± 0.31
1.437TyrPhe: 1.437 ± 0.298
3.162TyrGly: 3.162 ± 0.443
0.503TyrHis: 0.503 ± 0.179
1.796TyrIle: 1.796 ± 0.308
2.659TyrLys: 2.659 ± 0.549
2.587TyrLeu: 2.587 ± 0.412
0.79TyrMet: 0.79 ± 0.212
0.79TyrAsn: 0.79 ± 0.209
1.365TyrPro: 1.365 ± 0.425
1.293TyrGln: 1.293 ± 0.294
2.3TyrArg: 2.3 ± 0.433
2.803TyrSer: 2.803 ± 0.492
1.796TyrThr: 1.796 ± 0.394
2.371TyrVal: 2.371 ± 0.456
0.431TyrTrp: 0.431 ± 0.178
1.15TyrTyr: 1.15 ± 0.194
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (13917 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski