Amino acid dipepetide frequency for Shigella phage POCJ13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.754AlaAla: 8.754 ± 0.932
1.577AlaCys: 1.577 ± 0.393
5.274AlaAsp: 5.274 ± 0.693
8.482AlaGlu: 8.482 ± 0.837
3.643AlaPhe: 3.643 ± 0.534
8.211AlaGly: 8.211 ± 1.184
1.305AlaHis: 1.305 ± 0.249
4.024AlaIle: 4.024 ± 0.446
4.622AlaLys: 4.622 ± 0.507
5.981AlaLeu: 5.981 ± 0.488
3.643AlaMet: 3.643 ± 0.463
2.936AlaAsn: 2.936 ± 0.364
2.991AlaPro: 2.991 ± 0.374
4.948AlaGln: 4.948 ± 0.863
6.09AlaArg: 6.09 ± 0.6
6.144AlaSer: 6.144 ± 0.544
5.274AlaThr: 5.274 ± 0.518
6.199AlaVal: 6.199 ± 0.609
1.414AlaTrp: 1.414 ± 0.277
1.903AlaTyr: 1.903 ± 0.316
0.0AlaXaa: 0.0 ± 0.0
Cys
1.033CysAla: 1.033 ± 0.269
0.272CysCys: 0.272 ± 0.12
0.381CysAsp: 0.381 ± 0.144
0.707CysGlu: 0.707 ± 0.215
0.435CysPhe: 0.435 ± 0.178
0.924CysGly: 0.924 ± 0.291
0.381CysHis: 0.381 ± 0.162
0.272CysIle: 0.272 ± 0.126
0.707CysLys: 0.707 ± 0.227
0.924CysLeu: 0.924 ± 0.298
0.272CysMet: 0.272 ± 0.113
0.381CysAsn: 0.381 ± 0.131
0.598CysPro: 0.598 ± 0.186
0.217CysGln: 0.217 ± 0.103
1.414CysArg: 1.414 ± 0.388
1.196CysSer: 1.196 ± 0.302
0.652CysThr: 0.652 ± 0.195
1.142CysVal: 1.142 ± 0.282
0.109CysTrp: 0.109 ± 0.087
0.435CysTyr: 0.435 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
5.709AspAla: 5.709 ± 0.621
0.707AspCys: 0.707 ± 0.212
4.187AspAsp: 4.187 ± 0.505
4.676AspGlu: 4.676 ± 0.499
2.175AspPhe: 2.175 ± 0.347
5.22AspGly: 5.22 ± 0.427
1.033AspHis: 1.033 ± 0.31
3.48AspIle: 3.48 ± 0.436
3.806AspLys: 3.806 ± 0.423
3.969AspLeu: 3.969 ± 0.398
1.468AspMet: 1.468 ± 0.231
2.882AspAsn: 2.882 ± 0.416
2.719AspPro: 2.719 ± 0.425
1.359AspGln: 1.359 ± 0.387
3.317AspArg: 3.317 ± 0.395
3.426AspSer: 3.426 ± 0.434
2.882AspThr: 2.882 ± 0.467
3.589AspVal: 3.589 ± 0.44
1.359AspTrp: 1.359 ± 0.355
1.74AspTyr: 1.74 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
6.253GluAla: 6.253 ± 0.547
0.924GluCys: 0.924 ± 0.225
2.719GluAsp: 2.719 ± 0.342
4.459GluGlu: 4.459 ± 0.476
2.284GluPhe: 2.284 ± 0.306
4.404GluGly: 4.404 ± 0.55
0.761GluHis: 0.761 ± 0.208
4.241GluIle: 4.241 ± 0.467
4.948GluLys: 4.948 ± 0.581
6.09GluLeu: 6.09 ± 0.583
2.121GluMet: 2.121 ± 0.323
3.099GluAsn: 3.099 ± 0.485
2.175GluPro: 2.175 ± 0.54
4.731GluGln: 4.731 ± 0.625
5.329GluArg: 5.329 ± 0.575
3.371GluSer: 3.371 ± 0.321
3.861GluThr: 3.861 ± 0.568
4.35GluVal: 4.35 ± 0.596
1.087GluTrp: 1.087 ± 0.214
2.447GluTyr: 2.447 ± 0.354
0.0GluXaa: 0.0 ± 0.0
Phe
2.664PheAla: 2.664 ± 0.398
0.544PheCys: 0.544 ± 0.192
1.957PheAsp: 1.957 ± 0.314
1.577PheGlu: 1.577 ± 0.361
0.979PhePhe: 0.979 ± 0.273
2.066PheGly: 2.066 ± 0.316
0.652PheHis: 0.652 ± 0.162
1.903PheIle: 1.903 ± 0.311
1.359PheLys: 1.359 ± 0.231
2.012PheLeu: 2.012 ± 0.318
0.924PheMet: 0.924 ± 0.246
1.631PheAsn: 1.631 ± 0.299
1.251PhePro: 1.251 ± 0.272
0.652PheGln: 0.652 ± 0.196
2.773PheArg: 2.773 ± 0.33
2.882PheSer: 2.882 ± 0.42
2.284PheThr: 2.284 ± 0.305
2.719PheVal: 2.719 ± 0.412
0.652PheTrp: 0.652 ± 0.159
1.087PheTyr: 1.087 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
6.036GlyAla: 6.036 ± 0.916
0.652GlyCys: 0.652 ± 0.271
4.731GlyAsp: 4.731 ± 0.638
6.307GlyGlu: 6.307 ± 1.232
2.447GlyPhe: 2.447 ± 0.404
5.002GlyGly: 5.002 ± 0.698
0.979GlyHis: 0.979 ± 0.257
4.187GlyIle: 4.187 ± 0.595
5.329GlyLys: 5.329 ± 0.811
5.166GlyLeu: 5.166 ± 0.45
2.066GlyMet: 2.066 ± 0.323
2.882GlyAsn: 2.882 ± 0.373
4.567GlyPro: 4.567 ± 2.447
2.556GlyGln: 2.556 ± 0.407
4.513GlyArg: 4.513 ± 0.552
3.861GlySer: 3.861 ± 0.557
3.643GlyThr: 3.643 ± 0.502
5.383GlyVal: 5.383 ± 0.491
0.924GlyTrp: 0.924 ± 0.218
2.066GlyTyr: 2.066 ± 0.304
0.0GlyXaa: 0.0 ± 0.0
His
2.121HisAla: 2.121 ± 0.287
0.272HisCys: 0.272 ± 0.139
0.979HisAsp: 0.979 ± 0.203
0.979HisGlu: 0.979 ± 0.25
0.652HisPhe: 0.652 ± 0.227
1.251HisGly: 1.251 ± 0.345
0.435HisHis: 0.435 ± 0.136
0.924HisIle: 0.924 ± 0.201
0.652HisLys: 0.652 ± 0.242
1.849HisLeu: 1.849 ± 0.399
0.326HisMet: 0.326 ± 0.122
1.196HisAsn: 1.196 ± 0.205
0.707HisPro: 0.707 ± 0.233
0.489HisGln: 0.489 ± 0.181
1.196HisArg: 1.196 ± 0.207
0.924HisSer: 0.924 ± 0.225
0.979HisThr: 0.979 ± 0.199
0.707HisVal: 0.707 ± 0.208
0.217HisTrp: 0.217 ± 0.182
0.761HisTyr: 0.761 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
5.329IleAla: 5.329 ± 0.526
0.924IleCys: 0.924 ± 0.257
3.806IleAsp: 3.806 ± 0.459
2.827IleGlu: 2.827 ± 0.464
0.979IlePhe: 0.979 ± 0.236
2.664IleGly: 2.664 ± 0.581
0.707IleHis: 0.707 ± 0.167
2.664IleIle: 2.664 ± 0.486
3.154IleLys: 3.154 ± 0.457
3.426IleLeu: 3.426 ± 0.556
1.142IleMet: 1.142 ± 0.226
2.392IleAsn: 2.392 ± 0.355
2.882IlePro: 2.882 ± 0.457
2.284IleGln: 2.284 ± 0.288
4.622IleArg: 4.622 ± 0.476
3.969IleSer: 3.969 ± 0.455
3.371IleThr: 3.371 ± 0.614
2.066IleVal: 2.066 ± 0.348
0.326IleTrp: 0.326 ± 0.158
2.012IleTyr: 2.012 ± 0.384
0.0IleXaa: 0.0 ± 0.0
Lys
5.655LysAla: 5.655 ± 0.552
0.544LysCys: 0.544 ± 0.204
3.208LysAsp: 3.208 ± 0.361
3.752LysGlu: 3.752 ± 0.55
1.522LysPhe: 1.522 ± 0.213
4.296LysGly: 4.296 ± 0.909
1.196LysHis: 1.196 ± 0.266
3.317LysIle: 3.317 ± 0.528
4.459LysLys: 4.459 ± 0.766
5.002LysLeu: 5.002 ± 0.624
1.251LysMet: 1.251 ± 0.261
3.426LysAsn: 3.426 ± 0.427
2.61LysPro: 2.61 ± 0.304
2.61LysGln: 2.61 ± 0.372
3.154LysArg: 3.154 ± 0.563
3.154LysSer: 3.154 ± 0.362
3.48LysThr: 3.48 ± 0.386
3.589LysVal: 3.589 ± 0.377
0.707LysTrp: 0.707 ± 0.205
1.577LysTyr: 1.577 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
8.319LeuAla: 8.319 ± 0.756
1.468LeuCys: 1.468 ± 0.282
3.806LeuAsp: 3.806 ± 0.52
4.241LeuGlu: 4.241 ± 0.518
2.338LeuPhe: 2.338 ± 0.391
4.078LeuGly: 4.078 ± 0.537
1.794LeuHis: 1.794 ± 0.291
3.371LeuIle: 3.371 ± 0.519
4.894LeuLys: 4.894 ± 0.522
7.232LeuLeu: 7.232 ± 0.605
1.957LeuMet: 1.957 ± 0.361
4.296LeuAsn: 4.296 ± 0.574
3.861LeuPro: 3.861 ± 0.497
3.317LeuGln: 3.317 ± 0.687
5.057LeuArg: 5.057 ± 0.628
5.546LeuSer: 5.546 ± 0.55
4.731LeuThr: 4.731 ± 0.501
3.861LeuVal: 3.861 ± 0.355
0.652LeuTrp: 0.652 ± 0.187
2.175LeuTyr: 2.175 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
3.208MetAla: 3.208 ± 0.451
0.109MetCys: 0.109 ± 0.075
1.414MetAsp: 1.414 ± 0.285
1.577MetGlu: 1.577 ± 0.288
0.707MetPhe: 0.707 ± 0.215
1.414MetGly: 1.414 ± 0.308
0.435MetHis: 0.435 ± 0.147
1.087MetIle: 1.087 ± 0.216
1.957MetLys: 1.957 ± 0.343
1.903MetLeu: 1.903 ± 0.283
1.087MetMet: 1.087 ± 0.266
1.849MetAsn: 1.849 ± 0.402
1.414MetPro: 1.414 ± 0.288
1.251MetGln: 1.251 ± 0.215
1.522MetArg: 1.522 ± 0.286
2.664MetSer: 2.664 ± 0.277
1.903MetThr: 1.903 ± 0.283
1.468MetVal: 1.468 ± 0.345
0.163MetTrp: 0.163 ± 0.086
0.707MetTyr: 0.707 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
4.731AsnAla: 4.731 ± 0.562
0.598AsnCys: 0.598 ± 0.209
2.392AsnAsp: 2.392 ± 0.436
3.317AsnGlu: 3.317 ± 0.373
1.468AsnPhe: 1.468 ± 0.246
3.534AsnGly: 3.534 ± 0.432
1.033AsnHis: 1.033 ± 0.298
2.773AsnIle: 2.773 ± 0.453
2.284AsnLys: 2.284 ± 0.286
3.262AsnLeu: 3.262 ± 0.381
1.196AsnMet: 1.196 ± 0.272
2.501AsnAsn: 2.501 ± 0.526
2.338AsnPro: 2.338 ± 0.35
1.794AsnGln: 1.794 ± 0.285
2.664AsnArg: 2.664 ± 0.496
2.501AsnSer: 2.501 ± 0.445
2.066AsnThr: 2.066 ± 0.285
2.066AsnVal: 2.066 ± 0.389
0.707AsnTrp: 0.707 ± 0.178
1.686AsnTyr: 1.686 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
4.132ProAla: 4.132 ± 0.965
0.326ProCys: 0.326 ± 0.133
4.785ProAsp: 4.785 ± 0.494
5.437ProGlu: 5.437 ± 0.611
1.359ProPhe: 1.359 ± 0.243
3.643ProGly: 3.643 ± 0.816
0.652ProHis: 0.652 ± 0.183
1.087ProIle: 1.087 ± 0.259
2.012ProLys: 2.012 ± 0.284
2.501ProLeu: 2.501 ± 0.293
0.598ProMet: 0.598 ± 0.16
0.979ProAsn: 0.979 ± 0.224
1.903ProPro: 1.903 ± 0.323
2.719ProGln: 2.719 ± 0.697
1.522ProArg: 1.522 ± 0.37
2.827ProSer: 2.827 ± 0.365
1.849ProThr: 1.849 ± 0.29
4.622ProVal: 4.622 ± 0.505
0.598ProTrp: 0.598 ± 0.209
1.196ProTyr: 1.196 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
4.187GlnAla: 4.187 ± 0.6
0.544GlnCys: 0.544 ± 0.186
2.284GlnAsp: 2.284 ± 0.373
2.664GlnGlu: 2.664 ± 0.498
1.414GlnPhe: 1.414 ± 0.31
3.426GlnGly: 3.426 ± 1.051
0.761GlnHis: 0.761 ± 0.197
2.121GlnIle: 2.121 ± 0.461
3.154GlnLys: 3.154 ± 0.528
3.045GlnLeu: 3.045 ± 0.443
1.305GlnMet: 1.305 ± 0.268
1.849GlnAsn: 1.849 ± 0.45
1.903GlnPro: 1.903 ± 0.424
3.915GlnGln: 3.915 ± 0.919
3.208GlnArg: 3.208 ± 0.544
2.61GlnSer: 2.61 ± 0.346
2.284GlnThr: 2.284 ± 0.294
2.664GlnVal: 2.664 ± 0.46
0.489GlnTrp: 0.489 ± 0.162
1.305GlnTyr: 1.305 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
4.35ArgAla: 4.35 ± 0.434
0.435ArgCys: 0.435 ± 0.182
3.861ArgAsp: 3.861 ± 0.505
5.002ArgGlu: 5.002 ± 0.735
2.284ArgPhe: 2.284 ± 0.324
4.894ArgGly: 4.894 ± 0.888
1.631ArgHis: 1.631 ± 0.267
3.752ArgIle: 3.752 ± 0.614
4.296ArgLys: 4.296 ± 0.508
5.166ArgLeu: 5.166 ± 0.452
2.284ArgMet: 2.284 ± 0.354
3.208ArgAsn: 3.208 ± 0.404
1.957ArgPro: 1.957 ± 0.36
3.208ArgGln: 3.208 ± 0.375
5.709ArgArg: 5.709 ± 0.657
3.534ArgSer: 3.534 ± 0.455
3.371ArgThr: 3.371 ± 0.57
4.35ArgVal: 4.35 ± 0.456
1.251ArgTrp: 1.251 ± 0.242
2.447ArgTyr: 2.447 ± 0.486
0.0ArgXaa: 0.0 ± 0.0
Ser
6.09SerAla: 6.09 ± 0.68
0.652SerCys: 0.652 ± 0.218
4.078SerAsp: 4.078 ± 0.451
4.024SerGlu: 4.024 ± 0.429
2.229SerPhe: 2.229 ± 0.365
6.036SerGly: 6.036 ± 0.595
0.979SerHis: 0.979 ± 0.19
2.284SerIle: 2.284 ± 0.4
2.61SerLys: 2.61 ± 0.446
5.981SerLeu: 5.981 ± 0.662
1.849SerMet: 1.849 ± 0.334
2.719SerAsn: 2.719 ± 0.422
3.154SerPro: 3.154 ± 0.372
2.664SerGln: 2.664 ± 0.46
3.806SerArg: 3.806 ± 0.539
2.882SerSer: 2.882 ± 0.566
2.991SerThr: 2.991 ± 0.453
4.296SerVal: 4.296 ± 0.662
0.816SerTrp: 0.816 ± 0.198
1.522SerTyr: 1.522 ± 0.3
0.0SerXaa: 0.0 ± 0.0
Thr
5.383ThrAla: 5.383 ± 0.545
0.544ThrCys: 0.544 ± 0.197
3.317ThrAsp: 3.317 ± 0.49
4.024ThrGlu: 4.024 ± 0.411
1.522ThrPhe: 1.522 ± 0.385
5.655ThrGly: 5.655 ± 0.793
1.033ThrHis: 1.033 ± 0.204
3.48ThrIle: 3.48 ± 0.487
2.882ThrLys: 2.882 ± 0.358
4.567ThrLeu: 4.567 ± 0.512
0.761ThrMet: 0.761 ± 0.175
1.903ThrAsn: 1.903 ± 0.315
3.48ThrPro: 3.48 ± 0.426
2.175ThrGln: 2.175 ± 0.38
2.664ThrArg: 2.664 ± 0.318
3.589ThrSer: 3.589 ± 0.527
3.643ThrThr: 3.643 ± 0.418
3.317ThrVal: 3.317 ± 0.486
1.033ThrTrp: 1.033 ± 0.191
1.142ThrTyr: 1.142 ± 0.387
0.0ThrXaa: 0.0 ± 0.0
Val
6.09ValAla: 6.09 ± 0.598
0.924ValCys: 0.924 ± 0.268
3.969ValAsp: 3.969 ± 0.362
3.861ValGlu: 3.861 ± 0.396
2.556ValPhe: 2.556 ± 0.374
3.317ValGly: 3.317 ± 0.415
0.979ValHis: 0.979 ± 0.204
4.078ValIle: 4.078 ± 0.492
3.099ValLys: 3.099 ± 0.484
5.655ValLeu: 5.655 ± 0.566
1.849ValMet: 1.849 ± 0.279
2.882ValAsn: 2.882 ± 0.323
2.338ValPro: 2.338 ± 0.328
2.121ValGln: 2.121 ± 0.379
4.513ValArg: 4.513 ± 0.876
4.35ValSer: 4.35 ± 0.503
4.513ValThr: 4.513 ± 0.65
4.513ValVal: 4.513 ± 0.48
0.544ValTrp: 0.544 ± 0.183
1.522ValTyr: 1.522 ± 0.218
0.0ValXaa: 0.0 ± 0.0
Trp
0.816TrpAla: 0.816 ± 0.18
0.163TrpCys: 0.163 ± 0.087
0.544TrpAsp: 0.544 ± 0.199
0.598TrpGlu: 0.598 ± 0.163
0.598TrpPhe: 0.598 ± 0.146
0.761TrpGly: 0.761 ± 0.203
0.163TrpHis: 0.163 ± 0.106
0.598TrpIle: 0.598 ± 0.154
0.979TrpLys: 0.979 ± 0.232
1.468TrpLeu: 1.468 ± 0.367
0.761TrpMet: 0.761 ± 0.169
0.489TrpAsn: 0.489 ± 0.177
0.435TrpPro: 0.435 ± 0.189
1.142TrpGln: 1.142 ± 0.189
1.522TrpArg: 1.522 ± 0.292
0.652TrpSer: 0.652 ± 0.186
0.598TrpThr: 0.598 ± 0.253
0.924TrpVal: 0.924 ± 0.232
0.217TrpTrp: 0.217 ± 0.151
0.435TrpTyr: 0.435 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.556TyrAla: 2.556 ± 0.409
0.272TyrCys: 0.272 ± 0.137
2.012TyrAsp: 2.012 ± 0.274
1.142TyrGlu: 1.142 ± 0.232
0.979TyrPhe: 0.979 ± 0.205
2.501TyrGly: 2.501 ± 0.421
0.707TyrHis: 0.707 ± 0.216
1.903TyrIle: 1.903 ± 0.322
1.305TyrLys: 1.305 ± 0.31
1.794TyrLeu: 1.794 ± 0.354
0.924TyrMet: 0.924 ± 0.216
1.414TyrAsn: 1.414 ± 0.344
1.468TyrPro: 1.468 ± 0.261
0.979TyrGln: 0.979 ± 0.216
2.392TyrArg: 2.392 ± 0.428
1.577TyrSer: 1.577 ± 0.249
1.74TyrThr: 1.74 ± 0.454
1.849TyrVal: 1.849 ± 0.317
0.598TyrTrp: 0.598 ± 0.194
1.196TyrTyr: 1.196 ± 0.252
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (18392 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski