Amino acid dipepetide frequency for Escherichia phage PA28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.844AlaAla: 8.844 ± 0.78
0.995AlaCys: 0.995 ± 0.258
5.362AlaAsp: 5.362 ± 0.5
8.623AlaGlu: 8.623 ± 0.778
3.427AlaPhe: 3.427 ± 0.614
7.849AlaGly: 7.849 ± 1.082
1.603AlaHis: 1.603 ± 0.32
3.814AlaIle: 3.814 ± 0.458
4.698AlaLys: 4.698 ± 0.451
7.352AlaLeu: 7.352 ± 0.713
3.206AlaMet: 3.206 ± 0.435
2.653AlaAsn: 2.653 ± 0.38
3.206AlaPro: 3.206 ± 0.38
5.085AlaGln: 5.085 ± 0.728
6.523AlaArg: 6.523 ± 0.535
5.804AlaSer: 5.804 ± 0.558
5.804AlaThr: 5.804 ± 0.717
6.467AlaVal: 6.467 ± 0.561
1.99AlaTrp: 1.99 ± 0.303
2.543AlaTyr: 2.543 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
1.382CysAla: 1.382 ± 0.345
0.387CysCys: 0.387 ± 0.154
0.442CysAsp: 0.442 ± 0.172
0.608CysGlu: 0.608 ± 0.215
0.332CysPhe: 0.332 ± 0.134
0.94CysGly: 0.94 ± 0.311
0.276CysHis: 0.276 ± 0.139
0.663CysIle: 0.663 ± 0.191
0.387CysLys: 0.387 ± 0.195
0.719CysLeu: 0.719 ± 0.25
0.111CysMet: 0.111 ± 0.077
0.276CysAsn: 0.276 ± 0.115
0.497CysPro: 0.497 ± 0.16
0.608CysGln: 0.608 ± 0.185
1.271CysArg: 1.271 ± 0.335
0.995CysSer: 0.995 ± 0.291
0.553CysThr: 0.553 ± 0.203
0.774CysVal: 0.774 ± 0.233
0.111CysTrp: 0.111 ± 0.089
0.442CysTyr: 0.442 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
5.749AspAla: 5.749 ± 0.67
0.829AspCys: 0.829 ± 0.254
3.648AspAsp: 3.648 ± 0.425
4.643AspGlu: 4.643 ± 0.542
1.658AspPhe: 1.658 ± 0.282
4.312AspGly: 4.312 ± 0.475
0.995AspHis: 0.995 ± 0.306
3.372AspIle: 3.372 ± 0.382
4.201AspLys: 4.201 ± 0.438
4.256AspLeu: 4.256 ± 0.479
1.935AspMet: 1.935 ± 0.273
2.709AspAsn: 2.709 ± 0.34
2.266AspPro: 2.266 ± 0.419
1.382AspGln: 1.382 ± 0.275
3.151AspArg: 3.151 ± 0.343
3.206AspSer: 3.206 ± 0.36
2.819AspThr: 2.819 ± 0.456
4.09AspVal: 4.09 ± 0.441
1.271AspTrp: 1.271 ± 0.288
1.548AspTyr: 1.548 ± 0.384
0.0AspXaa: 0.0 ± 0.0
Glu
6.578GluAla: 6.578 ± 0.579
0.94GluCys: 0.94 ± 0.279
2.266GluAsp: 2.266 ± 0.319
3.814GluGlu: 3.814 ± 0.465
2.432GluPhe: 2.432 ± 0.442
3.703GluGly: 3.703 ± 0.619
1.382GluHis: 1.382 ± 0.246
3.427GluIle: 3.427 ± 0.432
4.864GluLys: 4.864 ± 0.581
6.412GluLeu: 6.412 ± 0.649
2.266GluMet: 2.266 ± 0.383
3.151GluAsn: 3.151 ± 0.526
1.99GluPro: 1.99 ± 0.406
4.256GluGln: 4.256 ± 0.508
6.08GluArg: 6.08 ± 0.599
3.814GluSer: 3.814 ± 0.424
3.869GluThr: 3.869 ± 0.716
4.035GluVal: 4.035 ± 0.573
1.161GluTrp: 1.161 ± 0.245
1.935GluTyr: 1.935 ± 0.253
0.0GluXaa: 0.0 ± 0.0
Phe
3.095PheAla: 3.095 ± 0.46
0.332PheCys: 0.332 ± 0.133
1.548PheAsp: 1.548 ± 0.311
1.382PheGlu: 1.382 ± 0.339
0.719PhePhe: 0.719 ± 0.207
2.266PheGly: 2.266 ± 0.297
0.553PheHis: 0.553 ± 0.171
1.935PheIle: 1.935 ± 0.303
1.714PheLys: 1.714 ± 0.322
1.658PheLeu: 1.658 ± 0.318
0.94PheMet: 0.94 ± 0.232
1.437PheAsn: 1.437 ± 0.342
1.271PhePro: 1.271 ± 0.275
0.774PheGln: 0.774 ± 0.196
2.377PheArg: 2.377 ± 0.371
2.432PheSer: 2.432 ± 0.451
2.377PheThr: 2.377 ± 0.313
2.377PheVal: 2.377 ± 0.34
0.774PheTrp: 0.774 ± 0.234
1.106PheTyr: 1.106 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
5.693GlyAla: 5.693 ± 0.994
1.05GlyCys: 1.05 ± 0.288
5.085GlyAsp: 5.085 ± 0.83
5.804GlyGlu: 5.804 ± 1.59
2.487GlyPhe: 2.487 ± 0.377
4.975GlyGly: 4.975 ± 0.657
1.106GlyHis: 1.106 ± 0.224
4.256GlyIle: 4.256 ± 0.637
4.975GlyLys: 4.975 ± 0.918
4.975GlyLeu: 4.975 ± 0.452
2.377GlyMet: 2.377 ± 0.302
2.819GlyAsn: 2.819 ± 0.367
3.98GlyPro: 3.98 ± 2.422
2.598GlyGln: 2.598 ± 0.415
3.814GlyArg: 3.814 ± 0.591
4.09GlySer: 4.09 ± 0.465
3.151GlyThr: 3.151 ± 0.499
5.251GlyVal: 5.251 ± 0.426
1.216GlyTrp: 1.216 ± 0.256
2.432GlyTyr: 2.432 ± 0.376
0.0GlyXaa: 0.0 ± 0.0
His
1.714HisAla: 1.714 ± 0.224
0.221HisCys: 0.221 ± 0.104
1.05HisAsp: 1.05 ± 0.197
0.774HisGlu: 0.774 ± 0.218
0.608HisPhe: 0.608 ± 0.195
1.327HisGly: 1.327 ± 0.332
0.553HisHis: 0.553 ± 0.201
0.774HisIle: 0.774 ± 0.209
0.94HisLys: 0.94 ± 0.258
1.382HisLeu: 1.382 ± 0.382
0.332HisMet: 0.332 ± 0.13
0.719HisAsn: 0.719 ± 0.176
0.884HisPro: 0.884 ± 0.249
0.719HisGln: 0.719 ± 0.195
0.995HisArg: 0.995 ± 0.262
1.05HisSer: 1.05 ± 0.233
0.995HisThr: 0.995 ± 0.203
0.829HisVal: 0.829 ± 0.217
0.442HisTrp: 0.442 ± 0.195
0.774HisTyr: 0.774 ± 0.174
0.0HisXaa: 0.0 ± 0.0
Ile
5.141IleAla: 5.141 ± 0.56
1.106IleCys: 1.106 ± 0.273
3.98IleAsp: 3.98 ± 0.431
2.653IleGlu: 2.653 ± 0.397
0.995IlePhe: 0.995 ± 0.22
2.709IleGly: 2.709 ± 0.437
0.94IleHis: 0.94 ± 0.199
2.377IleIle: 2.377 ± 0.348
2.764IleLys: 2.764 ± 0.412
2.819IleLeu: 2.819 ± 0.497
0.995IleMet: 0.995 ± 0.223
3.206IleAsn: 3.206 ± 0.499
2.322IlePro: 2.322 ± 0.375
1.935IleGln: 1.935 ± 0.28
4.09IleArg: 4.09 ± 0.364
4.09IleSer: 4.09 ± 0.633
3.261IleThr: 3.261 ± 0.483
2.377IleVal: 2.377 ± 0.417
0.608IleTrp: 0.608 ± 0.241
1.271IleTyr: 1.271 ± 0.302
0.0IleXaa: 0.0 ± 0.0
Lys
6.412LysAla: 6.412 ± 0.626
0.442LysCys: 0.442 ± 0.163
2.985LysAsp: 2.985 ± 0.406
4.312LysGlu: 4.312 ± 0.536
1.05LysPhe: 1.05 ± 0.259
5.417LysGly: 5.417 ± 1.215
0.774LysHis: 0.774 ± 0.168
3.593LysIle: 3.593 ± 0.468
3.869LysLys: 3.869 ± 0.546
5.528LysLeu: 5.528 ± 0.573
2.156LysMet: 2.156 ± 0.374
3.593LysAsn: 3.593 ± 0.394
2.764LysPro: 2.764 ± 0.522
2.598LysGln: 2.598 ± 0.408
2.985LysArg: 2.985 ± 0.403
2.764LysSer: 2.764 ± 0.383
3.482LysThr: 3.482 ± 0.534
2.819LysVal: 2.819 ± 0.434
0.884LysTrp: 0.884 ± 0.249
1.548LysTyr: 1.548 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
8.899LeuAla: 8.899 ± 0.737
1.05LeuCys: 1.05 ± 0.281
3.593LeuAsp: 3.593 ± 0.48
4.312LeuGlu: 4.312 ± 0.444
2.653LeuPhe: 2.653 ± 0.417
4.533LeuGly: 4.533 ± 0.389
1.327LeuHis: 1.327 ± 0.237
3.759LeuIle: 3.759 ± 0.592
4.809LeuLys: 4.809 ± 0.499
6.467LeuLeu: 6.467 ± 0.643
2.211LeuMet: 2.211 ± 0.348
4.035LeuAsn: 4.035 ± 0.484
3.538LeuPro: 3.538 ± 0.499
2.819LeuGln: 2.819 ± 0.512
5.528LeuArg: 5.528 ± 0.589
5.362LeuSer: 5.362 ± 0.549
4.864LeuThr: 4.864 ± 0.429
4.643LeuVal: 4.643 ± 0.443
0.995LeuTrp: 0.995 ± 0.308
2.543LeuTyr: 2.543 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
3.206MetAla: 3.206 ± 0.363
0.055MetCys: 0.055 ± 0.063
1.382MetAsp: 1.382 ± 0.28
1.492MetGlu: 1.492 ± 0.241
0.719MetPhe: 0.719 ± 0.19
1.658MetGly: 1.658 ± 0.355
0.387MetHis: 0.387 ± 0.117
0.829MetIle: 0.829 ± 0.197
2.045MetLys: 2.045 ± 0.282
1.99MetLeu: 1.99 ± 0.313
0.774MetMet: 0.774 ± 0.191
1.548MetAsn: 1.548 ± 0.314
1.603MetPro: 1.603 ± 0.347
1.382MetGln: 1.382 ± 0.267
1.714MetArg: 1.714 ± 0.284
2.1MetSer: 2.1 ± 0.343
2.487MetThr: 2.487 ± 0.301
1.769MetVal: 1.769 ± 0.328
0.276MetTrp: 0.276 ± 0.127
0.553MetTyr: 0.553 ± 0.198
0.0MetXaa: 0.0 ± 0.0
Asn
4.809AsnAla: 4.809 ± 0.615
0.442AsnCys: 0.442 ± 0.163
2.377AsnAsp: 2.377 ± 0.401
2.709AsnGlu: 2.709 ± 0.448
1.271AsnPhe: 1.271 ± 0.321
3.482AsnGly: 3.482 ± 0.397
1.437AsnHis: 1.437 ± 0.318
2.709AsnIle: 2.709 ± 0.355
2.377AsnLys: 2.377 ± 0.362
3.04AsnLeu: 3.04 ± 0.402
1.327AsnMet: 1.327 ± 0.233
1.935AsnAsn: 1.935 ± 0.365
1.603AsnPro: 1.603 ± 0.286
1.935AsnGln: 1.935 ± 0.308
3.372AsnArg: 3.372 ± 0.539
2.377AsnSer: 2.377 ± 0.376
2.211AsnThr: 2.211 ± 0.359
2.045AsnVal: 2.045 ± 0.403
0.553AsnTrp: 0.553 ± 0.166
1.05AsnTyr: 1.05 ± 0.244
0.0AsnXaa: 0.0 ± 0.0
Pro
3.427ProAla: 3.427 ± 0.613
0.276ProCys: 0.276 ± 0.125
4.201ProAsp: 4.201 ± 0.524
5.251ProGlu: 5.251 ± 0.848
1.382ProPhe: 1.382 ± 0.257
2.985ProGly: 2.985 ± 0.614
0.442ProHis: 0.442 ± 0.157
0.94ProIle: 0.94 ± 0.276
2.764ProLys: 2.764 ± 0.785
2.598ProLeu: 2.598 ± 0.404
0.884ProMet: 0.884 ± 0.209
1.05ProAsn: 1.05 ± 0.221
1.327ProPro: 1.327 ± 0.245
2.156ProGln: 2.156 ± 0.631
1.99ProArg: 1.99 ± 0.399
2.598ProSer: 2.598 ± 0.393
1.769ProThr: 1.769 ± 0.312
4.754ProVal: 4.754 ± 0.553
0.663ProTrp: 0.663 ± 0.21
1.437ProTyr: 1.437 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
4.533GlnAla: 4.533 ± 0.526
0.774GlnCys: 0.774 ± 0.241
2.377GlnAsp: 2.377 ± 0.339
2.764GlnGlu: 2.764 ± 0.385
1.658GlnPhe: 1.658 ± 0.29
3.261GlnGly: 3.261 ± 0.748
0.719GlnHis: 0.719 ± 0.191
1.769GlnIle: 1.769 ± 0.406
3.04GlnLys: 3.04 ± 0.514
3.427GlnLeu: 3.427 ± 0.421
0.829GlnMet: 0.829 ± 0.219
1.714GlnAsn: 1.714 ± 0.344
2.156GlnPro: 2.156 ± 0.452
3.814GlnGln: 3.814 ± 0.796
3.427GlnArg: 3.427 ± 0.645
2.377GlnSer: 2.377 ± 0.386
2.045GlnThr: 2.045 ± 0.388
2.432GlnVal: 2.432 ± 0.429
0.884GlnTrp: 0.884 ± 0.207
1.769GlnTyr: 1.769 ± 0.29
0.0GlnXaa: 0.0 ± 0.0
Arg
4.477ArgAla: 4.477 ± 0.491
0.387ArgCys: 0.387 ± 0.136
4.256ArgAsp: 4.256 ± 0.694
5.472ArgGlu: 5.472 ± 0.739
2.156ArgPhe: 2.156 ± 0.371
4.477ArgGly: 4.477 ± 0.989
1.437ArgHis: 1.437 ± 0.251
3.593ArgIle: 3.593 ± 0.55
4.588ArgLys: 4.588 ± 0.528
5.638ArgLeu: 5.638 ± 0.442
1.603ArgMet: 1.603 ± 0.322
2.874ArgAsn: 2.874 ± 0.41
2.156ArgPro: 2.156 ± 0.402
3.538ArgGln: 3.538 ± 0.409
5.859ArgArg: 5.859 ± 0.656
4.201ArgSer: 4.201 ± 0.481
3.427ArgThr: 3.427 ± 0.563
4.809ArgVal: 4.809 ± 0.617
1.327ArgTrp: 1.327 ± 0.207
2.266ArgTyr: 2.266 ± 0.404
0.0ArgXaa: 0.0 ± 0.0
Ser
6.025SerAla: 6.025 ± 0.585
0.608SerCys: 0.608 ± 0.209
3.593SerAsp: 3.593 ± 0.475
4.09SerGlu: 4.09 ± 0.374
1.658SerPhe: 1.658 ± 0.302
5.583SerGly: 5.583 ± 0.584
0.719SerHis: 0.719 ± 0.169
2.764SerIle: 2.764 ± 0.353
2.764SerLys: 2.764 ± 0.528
5.859SerLeu: 5.859 ± 0.645
1.658SerMet: 1.658 ± 0.373
2.322SerAsn: 2.322 ± 0.421
3.261SerPro: 3.261 ± 0.456
3.372SerGln: 3.372 ± 0.437
3.869SerArg: 3.869 ± 0.523
3.04SerSer: 3.04 ± 0.713
3.538SerThr: 3.538 ± 0.44
3.869SerVal: 3.869 ± 0.475
1.05SerTrp: 1.05 ± 0.222
1.879SerTyr: 1.879 ± 0.395
0.0SerXaa: 0.0 ± 0.0
Thr
5.251ThrAla: 5.251 ± 0.554
0.442ThrCys: 0.442 ± 0.163
3.759ThrAsp: 3.759 ± 0.477
3.703ThrGlu: 3.703 ± 0.353
1.879ThrPhe: 1.879 ± 0.371
5.638ThrGly: 5.638 ± 0.834
0.94ThrHis: 0.94 ± 0.243
3.261ThrIle: 3.261 ± 0.408
2.598ThrLys: 2.598 ± 0.506
5.085ThrLeu: 5.085 ± 0.458
0.829ThrMet: 0.829 ± 0.199
1.714ThrAsn: 1.714 ± 0.231
3.372ThrPro: 3.372 ± 0.377
1.769ThrGln: 1.769 ± 0.411
2.653ThrArg: 2.653 ± 0.308
3.814ThrSer: 3.814 ± 0.5
3.261ThrThr: 3.261 ± 0.577
3.869ThrVal: 3.869 ± 0.494
0.884ThrTrp: 0.884 ± 0.213
1.327ThrTyr: 1.327 ± 0.305
0.0ThrXaa: 0.0 ± 0.0
Val
6.578ValAla: 6.578 ± 0.601
0.829ValCys: 0.829 ± 0.231
3.427ValAsp: 3.427 ± 0.315
3.869ValGlu: 3.869 ± 0.347
1.99ValPhe: 1.99 ± 0.312
3.593ValGly: 3.593 ± 0.511
0.608ValHis: 0.608 ± 0.177
3.372ValIle: 3.372 ± 0.527
3.869ValLys: 3.869 ± 0.506
5.472ValLeu: 5.472 ± 0.58
1.935ValMet: 1.935 ± 0.331
3.261ValAsn: 3.261 ± 0.382
2.819ValPro: 2.819 ± 0.339
2.487ValGln: 2.487 ± 0.456
4.864ValArg: 4.864 ± 0.876
4.809ValSer: 4.809 ± 0.603
3.814ValThr: 3.814 ± 0.446
3.869ValVal: 3.869 ± 0.402
1.106ValTrp: 1.106 ± 0.25
1.99ValTyr: 1.99 ± 0.376
0.0ValXaa: 0.0 ± 0.0
Trp
1.216TrpAla: 1.216 ± 0.302
0.276TrpCys: 0.276 ± 0.12
0.608TrpAsp: 0.608 ± 0.23
0.608TrpGlu: 0.608 ± 0.159
0.774TrpPhe: 0.774 ± 0.165
0.719TrpGly: 0.719 ± 0.223
0.387TrpHis: 0.387 ± 0.139
0.829TrpIle: 0.829 ± 0.224
1.382TrpLys: 1.382 ± 0.227
1.769TrpLeu: 1.769 ± 0.384
0.663TrpMet: 0.663 ± 0.19
0.774TrpAsn: 0.774 ± 0.233
0.608TrpPro: 0.608 ± 0.242
1.216TrpGln: 1.216 ± 0.223
1.603TrpArg: 1.603 ± 0.31
0.995TrpSer: 0.995 ± 0.227
0.608TrpThr: 0.608 ± 0.189
1.382TrpVal: 1.382 ± 0.268
0.497TrpTrp: 0.497 ± 0.192
0.553TrpTyr: 0.553 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.819TyrAla: 2.819 ± 0.329
0.332TyrCys: 0.332 ± 0.142
2.1TyrAsp: 2.1 ± 0.375
1.492TyrGlu: 1.492 ± 0.312
1.271TyrPhe: 1.271 ± 0.249
2.709TyrGly: 2.709 ± 0.477
0.497TyrHis: 0.497 ± 0.183
1.603TyrIle: 1.603 ± 0.278
1.327TyrLys: 1.327 ± 0.329
1.492TyrLeu: 1.492 ± 0.293
0.884TyrMet: 0.884 ± 0.223
1.271TyrAsn: 1.271 ± 0.25
1.271TyrPro: 1.271 ± 0.219
1.327TyrGln: 1.327 ± 0.253
2.377TyrArg: 2.377 ± 0.447
1.603TyrSer: 1.603 ± 0.267
1.714TyrThr: 1.714 ± 0.49
2.1TyrVal: 2.1 ± 0.321
0.774TyrTrp: 0.774 ± 0.185
1.327TyrTyr: 1.327 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (18092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski