Amino acid dipepetide frequency for Escherichia phage PTXU04

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.241AlaAla: 16.241 ± 2.404
0.609AlaCys: 0.609 ± 0.188
5.329AlaAsp: 5.329 ± 0.634
6.699AlaGlu: 6.699 ± 0.782
2.741AlaPhe: 2.741 ± 0.33
7.004AlaGly: 7.004 ± 0.853
1.32AlaHis: 1.32 ± 0.27
5.278AlaIle: 5.278 ± 0.547
5.583AlaLys: 5.583 ± 0.581
7.816AlaLeu: 7.816 ± 0.656
3.147AlaMet: 3.147 ± 0.378
5.684AlaAsn: 5.684 ± 0.852
5.126AlaPro: 5.126 ± 0.788
5.025AlaGln: 5.025 ± 0.768
5.786AlaArg: 5.786 ± 0.548
7.004AlaSer: 7.004 ± 1.305
7.156AlaThr: 7.156 ± 1.049
6.446AlaVal: 6.446 ± 0.534
1.269AlaTrp: 1.269 ± 0.282
3.553AlaTyr: 3.553 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
0.863CysAla: 0.863 ± 0.211
0.152CysCys: 0.152 ± 0.073
0.761CysAsp: 0.761 ± 0.214
0.305CysGlu: 0.305 ± 0.109
0.457CysPhe: 0.457 ± 0.177
1.066CysGly: 1.066 ± 0.307
0.152CysHis: 0.152 ± 0.084
0.66CysIle: 0.66 ± 0.192
0.457CysLys: 0.457 ± 0.171
0.609CysLeu: 0.609 ± 0.176
0.355CysMet: 0.355 ± 0.125
0.457CysAsn: 0.457 ± 0.165
0.558CysPro: 0.558 ± 0.18
0.203CysGln: 0.203 ± 0.103
0.812CysArg: 0.812 ± 0.223
0.863CysSer: 0.863 ± 0.238
0.609CysThr: 0.609 ± 0.177
0.508CysVal: 0.508 ± 0.161
0.203CysTrp: 0.203 ± 0.099
0.203CysTyr: 0.203 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
6.446AspAla: 6.446 ± 0.653
0.66AspCys: 0.66 ± 0.179
4.314AspAsp: 4.314 ± 0.466
3.807AspGlu: 3.807 ± 0.488
1.878AspPhe: 1.878 ± 0.341
4.111AspGly: 4.111 ± 0.481
1.32AspHis: 1.32 ± 0.329
2.994AspIle: 2.994 ± 0.368
3.248AspLys: 3.248 ± 0.484
4.872AspLeu: 4.872 ± 0.378
1.523AspMet: 1.523 ± 0.262
2.842AspAsn: 2.842 ± 0.474
2.385AspPro: 2.385 ± 0.325
2.182AspGln: 2.182 ± 0.33
2.994AspArg: 2.994 ± 0.314
4.263AspSer: 4.263 ± 0.492
3.451AspThr: 3.451 ± 0.503
4.06AspVal: 4.06 ± 0.541
0.812AspTrp: 0.812 ± 0.202
2.994AspTyr: 2.994 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
7.461GluAla: 7.461 ± 0.649
0.406GluCys: 0.406 ± 0.163
2.487GluAsp: 2.487 ± 0.372
3.959GluGlu: 3.959 ± 0.604
1.675GluPhe: 1.675 ± 0.29
3.654GluGly: 3.654 ± 0.396
1.218GluHis: 1.218 ± 0.29
3.096GluIle: 3.096 ± 0.397
3.705GluLys: 3.705 ± 0.524
5.278GluLeu: 5.278 ± 0.581
2.487GluMet: 2.487 ± 0.384
2.182GluAsn: 2.182 ± 0.305
2.081GluPro: 2.081 ± 0.378
2.791GluGln: 2.791 ± 0.398
3.451GluArg: 3.451 ± 0.477
3.35GluSer: 3.35 ± 0.439
2.233GluThr: 2.233 ± 0.357
3.248GluVal: 3.248 ± 0.535
1.167GluTrp: 1.167 ± 0.27
2.182GluTyr: 2.182 ± 0.325
0.0GluXaa: 0.0 ± 0.0
Phe
2.233PheAla: 2.233 ± 0.287
0.609PheCys: 0.609 ± 0.226
2.791PheAsp: 2.791 ± 0.377
1.929PheGlu: 1.929 ± 0.306
1.167PhePhe: 1.167 ± 0.301
2.436PheGly: 2.436 ± 0.344
0.558PheHis: 0.558 ± 0.173
1.827PheIle: 1.827 ± 0.308
2.182PheLys: 2.182 ± 0.429
1.929PheLeu: 1.929 ± 0.258
0.914PheMet: 0.914 ± 0.22
1.929PheAsn: 1.929 ± 0.337
1.523PhePro: 1.523 ± 0.37
1.269PheGln: 1.269 ± 0.239
1.421PheArg: 1.421 ± 0.259
2.233PheSer: 2.233 ± 0.3
2.69PheThr: 2.69 ± 0.369
1.878PheVal: 1.878 ± 0.299
0.406PheTrp: 0.406 ± 0.172
1.523PheTyr: 1.523 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
7.613GlyAla: 7.613 ± 0.604
0.761GlyCys: 0.761 ± 0.213
4.771GlyAsp: 4.771 ± 0.501
3.502GlyGlu: 3.502 ± 0.399
2.538GlyPhe: 2.538 ± 0.355
5.583GlyGly: 5.583 ± 0.719
1.624GlyHis: 1.624 ± 0.273
3.604GlyIle: 3.604 ± 0.402
4.365GlyLys: 4.365 ± 0.496
5.634GlyLeu: 5.634 ± 0.582
2.385GlyMet: 2.385 ± 0.382
4.06GlyAsn: 4.06 ± 0.505
1.37GlyPro: 1.37 ± 0.225
2.893GlyGln: 2.893 ± 0.332
3.147GlyArg: 3.147 ± 0.468
3.807GlySer: 3.807 ± 0.757
6.141GlyThr: 6.141 ± 0.849
4.72GlyVal: 4.72 ± 0.612
1.523GlyTrp: 1.523 ± 0.272
3.096GlyTyr: 3.096 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.573HisAla: 1.573 ± 0.335
0.711HisCys: 0.711 ± 0.208
1.218HisAsp: 1.218 ± 0.25
0.508HisGlu: 0.508 ± 0.154
0.508HisPhe: 0.508 ± 0.172
0.964HisGly: 0.964 ± 0.21
0.558HisHis: 0.558 ± 0.178
1.066HisIle: 1.066 ± 0.248
0.964HisLys: 0.964 ± 0.197
1.726HisLeu: 1.726 ± 0.34
0.558HisMet: 0.558 ± 0.149
1.32HisAsn: 1.32 ± 0.227
0.761HisPro: 0.761 ± 0.182
0.457HisGln: 0.457 ± 0.162
1.117HisArg: 1.117 ± 0.297
1.269HisSer: 1.269 ± 0.325
2.03HisThr: 2.03 ± 0.742
1.269HisVal: 1.269 ± 0.24
0.152HisTrp: 0.152 ± 0.077
1.066HisTyr: 1.066 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.822IleAla: 4.822 ± 0.506
0.508IleCys: 0.508 ± 0.174
3.4IleAsp: 3.4 ± 0.423
3.147IleGlu: 3.147 ± 0.403
1.472IlePhe: 1.472 ± 0.267
3.147IleGly: 3.147 ± 0.362
1.066IleHis: 1.066 ± 0.2
3.045IleIle: 3.045 ± 0.457
3.197IleLys: 3.197 ± 0.374
2.994IleLeu: 2.994 ± 0.336
1.421IleMet: 1.421 ± 0.262
2.994IleAsn: 2.994 ± 0.439
1.776IlePro: 1.776 ± 0.329
1.472IleGln: 1.472 ± 0.26
2.893IleArg: 2.893 ± 0.506
2.639IleSer: 2.639 ± 0.404
2.994IleThr: 2.994 ± 0.393
3.756IleVal: 3.756 ± 0.451
0.609IleTrp: 0.609 ± 0.19
1.523IleTyr: 1.523 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
6.243LysAla: 6.243 ± 0.727
0.457LysCys: 0.457 ± 0.169
2.944LysAsp: 2.944 ± 0.537
3.096LysGlu: 3.096 ± 0.572
1.573LysPhe: 1.573 ± 0.224
4.568LysGly: 4.568 ± 0.582
0.761LysHis: 0.761 ± 0.237
2.03LysIle: 2.03 ± 0.391
2.588LysLys: 2.588 ± 0.499
4.923LysLeu: 4.923 ± 0.546
2.436LysMet: 2.436 ± 0.362
1.827LysAsn: 1.827 ± 0.383
2.538LysPro: 2.538 ± 0.419
2.791LysGln: 2.791 ± 0.441
3.045LysArg: 3.045 ± 0.377
3.299LysSer: 3.299 ± 0.365
2.791LysThr: 2.791 ± 0.4
3.147LysVal: 3.147 ± 0.398
0.66LysTrp: 0.66 ± 0.21
2.538LysTyr: 2.538 ± 0.357
0.0LysXaa: 0.0 ± 0.0
Leu
7.258LeuAla: 7.258 ± 0.462
0.914LeuCys: 0.914 ± 0.212
4.416LeuAsp: 4.416 ± 0.528
5.075LeuGlu: 5.075 ± 0.476
2.385LeuPhe: 2.385 ± 0.346
4.06LeuGly: 4.06 ± 0.506
1.472LeuHis: 1.472 ± 0.359
3.705LeuIle: 3.705 ± 0.443
4.872LeuLys: 4.872 ± 0.506
4.213LeuLeu: 4.213 ± 0.441
1.979LeuMet: 1.979 ± 0.312
4.01LeuAsn: 4.01 ± 0.399
4.314LeuPro: 4.314 ± 0.446
2.639LeuGln: 2.639 ± 0.331
4.568LeuArg: 4.568 ± 0.486
4.771LeuSer: 4.771 ± 0.548
5.025LeuThr: 5.025 ± 0.449
4.416LeuVal: 4.416 ± 0.463
1.167LeuTrp: 1.167 ± 0.329
1.878LeuTyr: 1.878 ± 0.372
0.0LeuXaa: 0.0 ± 0.0
Met
3.908MetAla: 3.908 ± 0.489
0.254MetCys: 0.254 ± 0.139
1.776MetAsp: 1.776 ± 0.251
1.218MetGlu: 1.218 ± 0.267
0.711MetPhe: 0.711 ± 0.173
2.538MetGly: 2.538 ± 0.37
0.406MetHis: 0.406 ± 0.174
1.218MetIle: 1.218 ± 0.271
1.827MetLys: 1.827 ± 0.251
2.182MetLeu: 2.182 ± 0.372
1.218MetMet: 1.218 ± 0.225
1.218MetAsn: 1.218 ± 0.212
1.066MetPro: 1.066 ± 0.26
1.523MetGln: 1.523 ± 0.312
1.32MetArg: 1.32 ± 0.282
2.639MetSer: 2.639 ± 0.353
1.878MetThr: 1.878 ± 0.307
1.979MetVal: 1.979 ± 0.334
0.558MetTrp: 0.558 ± 0.169
0.863MetTyr: 0.863 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
5.329AsnAla: 5.329 ± 0.686
0.254AsnCys: 0.254 ± 0.148
3.857AsnAsp: 3.857 ± 0.445
2.791AsnGlu: 2.791 ± 0.364
1.726AsnPhe: 1.726 ± 0.364
4.06AsnGly: 4.06 ± 0.422
1.675AsnHis: 1.675 ± 0.287
2.487AsnIle: 2.487 ± 0.44
2.944AsnLys: 2.944 ± 0.355
3.451AsnLeu: 3.451 ± 0.427
1.32AsnMet: 1.32 ± 0.286
2.741AsnAsn: 2.741 ± 0.607
2.03AsnPro: 2.03 ± 0.29
2.233AsnGln: 2.233 ± 0.368
2.842AsnArg: 2.842 ± 0.399
3.147AsnSer: 3.147 ± 0.464
2.588AsnThr: 2.588 ± 0.405
3.248AsnVal: 3.248 ± 0.392
0.406AsnTrp: 0.406 ± 0.134
1.015AsnTyr: 1.015 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
4.974ProAla: 4.974 ± 0.938
0.457ProCys: 0.457 ± 0.16
3.147ProAsp: 3.147 ± 0.457
2.944ProGlu: 2.944 ± 0.54
1.421ProPhe: 1.421 ± 0.222
3.807ProGly: 3.807 ± 0.466
0.761ProHis: 0.761 ± 0.201
1.32ProIle: 1.32 ± 0.241
2.385ProLys: 2.385 ± 0.461
2.081ProLeu: 2.081 ± 0.374
0.914ProMet: 0.914 ± 0.222
1.929ProAsn: 1.929 ± 0.284
2.284ProPro: 2.284 ± 0.402
2.639ProGln: 2.639 ± 0.382
1.827ProArg: 1.827 ± 0.382
2.03ProSer: 2.03 ± 0.319
1.929ProThr: 1.929 ± 0.323
4.466ProVal: 4.466 ± 0.556
0.508ProTrp: 0.508 ± 0.181
1.32ProTyr: 1.32 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
5.887GlnAla: 5.887 ± 0.756
0.558GlnCys: 0.558 ± 0.194
1.37GlnAsp: 1.37 ± 0.287
1.827GlnGlu: 1.827 ± 0.416
1.878GlnPhe: 1.878 ± 0.355
2.893GlnGly: 2.893 ± 0.439
0.761GlnHis: 0.761 ± 0.182
1.878GlnIle: 1.878 ± 0.383
1.827GlnLys: 1.827 ± 0.25
3.299GlnLeu: 3.299 ± 0.537
1.979GlnMet: 1.979 ± 0.25
1.37GlnAsn: 1.37 ± 0.296
2.03GlnPro: 2.03 ± 0.301
3.857GlnGln: 3.857 ± 1.199
2.335GlnArg: 2.335 ± 0.303
2.385GlnSer: 2.385 ± 0.461
2.69GlnThr: 2.69 ± 0.404
2.944GlnVal: 2.944 ± 0.724
0.66GlnTrp: 0.66 ± 0.199
2.03GlnTyr: 2.03 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
4.162ArgAla: 4.162 ± 0.417
0.761ArgCys: 0.761 ± 0.229
3.807ArgAsp: 3.807 ± 0.466
3.553ArgGlu: 3.553 ± 0.472
1.726ArgPhe: 1.726 ± 0.256
3.604ArgGly: 3.604 ± 0.441
1.117ArgHis: 1.117 ± 0.245
2.944ArgIle: 2.944 ± 0.415
3.045ArgLys: 3.045 ± 0.453
4.06ArgLeu: 4.06 ± 0.481
1.827ArgMet: 1.827 ± 0.295
3.096ArgAsn: 3.096 ± 0.394
1.929ArgPro: 1.929 ± 0.391
2.284ArgGln: 2.284 ± 0.397
2.639ArgArg: 2.639 ± 0.321
2.487ArgSer: 2.487 ± 0.297
2.182ArgThr: 2.182 ± 0.366
3.959ArgVal: 3.959 ± 0.438
0.761ArgTrp: 0.761 ± 0.205
2.335ArgTyr: 2.335 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
7.055SerAla: 7.055 ± 1.628
0.102SerCys: 0.102 ± 0.07
3.197SerAsp: 3.197 ± 0.501
3.045SerGlu: 3.045 ± 0.544
2.588SerPhe: 2.588 ± 0.365
5.228SerGly: 5.228 ± 0.505
1.675SerHis: 1.675 ± 0.58
3.096SerIle: 3.096 ± 0.374
2.588SerLys: 2.588 ± 0.458
5.025SerLeu: 5.025 ± 0.463
1.523SerMet: 1.523 ± 0.279
3.248SerAsn: 3.248 ± 0.438
2.944SerPro: 2.944 ± 0.517
2.994SerGln: 2.994 ± 0.671
2.639SerArg: 2.639 ± 0.322
4.365SerSer: 4.365 ± 0.773
4.314SerThr: 4.314 ± 0.696
3.299SerVal: 3.299 ± 0.536
0.457SerTrp: 0.457 ± 0.127
1.421SerTyr: 1.421 ± 0.252
0.0SerXaa: 0.0 ± 0.0
Thr
6.446ThrAla: 6.446 ± 0.91
0.711ThrCys: 0.711 ± 0.215
3.705ThrAsp: 3.705 ± 0.446
3.451ThrGlu: 3.451 ± 0.34
2.741ThrPhe: 2.741 ± 0.362
5.329ThrGly: 5.329 ± 0.571
1.117ThrHis: 1.117 ± 0.291
3.299ThrIle: 3.299 ± 0.356
2.436ThrLys: 2.436 ± 0.355
4.771ThrLeu: 4.771 ± 0.55
1.37ThrMet: 1.37 ± 0.226
2.487ThrAsn: 2.487 ± 0.489
3.553ThrPro: 3.553 ± 0.474
2.284ThrGln: 2.284 ± 0.477
2.69ThrArg: 2.69 ± 0.346
4.365ThrSer: 4.365 ± 0.868
4.263ThrThr: 4.263 ± 0.658
4.619ThrVal: 4.619 ± 0.54
0.609ThrTrp: 0.609 ± 0.169
2.182ThrTyr: 2.182 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
6.547ValAla: 6.547 ± 0.791
0.609ValCys: 0.609 ± 0.201
4.517ValAsp: 4.517 ± 0.614
4.111ValGlu: 4.111 ± 0.432
2.284ValPhe: 2.284 ± 0.373
4.72ValGly: 4.72 ± 0.442
1.117ValHis: 1.117 ± 0.245
2.994ValIle: 2.994 ± 0.424
3.502ValLys: 3.502 ± 0.392
4.416ValLeu: 4.416 ± 0.548
1.421ValMet: 1.421 ± 0.266
4.263ValAsn: 4.263 ± 0.453
2.994ValPro: 2.994 ± 0.388
2.69ValGln: 2.69 ± 0.369
3.807ValArg: 3.807 ± 0.443
3.147ValSer: 3.147 ± 0.332
4.771ValThr: 4.771 ± 0.465
3.959ValVal: 3.959 ± 0.561
1.015ValTrp: 1.015 ± 0.235
2.538ValTyr: 2.538 ± 0.419
0.0ValXaa: 0.0 ± 0.0
Trp
0.914TrpAla: 0.914 ± 0.145
0.355TrpCys: 0.355 ± 0.135
0.812TrpAsp: 0.812 ± 0.216
0.914TrpGlu: 0.914 ± 0.18
0.66TrpPhe: 0.66 ± 0.224
1.167TrpGly: 1.167 ± 0.246
0.152TrpHis: 0.152 ± 0.09
0.508TrpIle: 0.508 ± 0.149
0.711TrpLys: 0.711 ± 0.167
1.066TrpLeu: 1.066 ± 0.256
0.609TrpMet: 0.609 ± 0.195
0.406TrpAsn: 0.406 ± 0.15
0.457TrpPro: 0.457 ± 0.197
0.812TrpGln: 0.812 ± 0.211
0.863TrpArg: 0.863 ± 0.21
0.711TrpSer: 0.711 ± 0.198
0.406TrpThr: 0.406 ± 0.162
1.523TrpVal: 1.523 ± 0.269
0.508TrpTrp: 0.508 ± 0.203
0.406TrpTyr: 0.406 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.842TyrAla: 2.842 ± 0.377
0.406TyrCys: 0.406 ± 0.157
2.538TyrAsp: 2.538 ± 0.409
2.233TyrGlu: 2.233 ± 0.309
1.32TyrPhe: 1.32 ± 0.265
3.197TyrGly: 3.197 ± 0.372
0.964TyrHis: 0.964 ± 0.199
1.827TyrIle: 1.827 ± 0.319
1.878TyrLys: 1.878 ± 0.334
2.791TyrLeu: 2.791 ± 0.424
0.863TyrMet: 0.863 ± 0.19
2.081TyrAsn: 2.081 ± 0.308
1.421TyrPro: 1.421 ± 0.323
1.421TyrGln: 1.421 ± 0.255
2.081TyrArg: 2.081 ± 0.364
2.03TyrSer: 2.03 ± 0.302
2.284TyrThr: 2.284 ± 0.327
1.979TyrVal: 1.979 ± 0.323
0.508TyrTrp: 0.508 ± 0.192
1.421TyrTyr: 1.421 ± 0.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (19704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski