Amino acid dipepetide frequency for Escherichia phage LL5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.446AlaAla: 7.446 ± 0.843
0.784AlaCys: 0.784 ± 0.277
4.833AlaAsp: 4.833 ± 0.56
5.617AlaGlu: 5.617 ± 0.719
3.331AlaPhe: 3.331 ± 0.445
5.421AlaGly: 5.421 ± 0.735
0.784AlaHis: 0.784 ± 0.225
4.964AlaIle: 4.964 ± 0.588
6.923AlaLys: 6.923 ± 1.177
7.184AlaLeu: 7.184 ± 0.743
2.678AlaMet: 2.678 ± 0.385
3.396AlaAsn: 3.396 ± 0.549
1.763AlaPro: 1.763 ± 0.365
4.441AlaGln: 4.441 ± 0.576
4.245AlaArg: 4.245 ± 0.544
4.115AlaSer: 4.115 ± 0.454
3.984AlaThr: 3.984 ± 0.615
5.552AlaVal: 5.552 ± 0.746
0.914AlaTrp: 0.914 ± 0.229
3.004AlaTyr: 3.004 ± 0.508
0.0AlaXaa: 0.0 ± 0.0
Cys
0.849CysAla: 0.849 ± 0.282
0.196CysCys: 0.196 ± 0.109
1.241CysAsp: 1.241 ± 0.219
0.653CysGlu: 0.653 ± 0.207
0.392CysPhe: 0.392 ± 0.16
1.372CysGly: 1.372 ± 0.336
0.327CysHis: 0.327 ± 0.125
0.784CysIle: 0.784 ± 0.261
1.045CysLys: 1.045 ± 0.285
1.11CysLeu: 1.11 ± 0.336
0.523CysMet: 0.523 ± 0.189
0.588CysAsn: 0.588 ± 0.2
0.327CysPro: 0.327 ± 0.131
0.131CysGln: 0.131 ± 0.1
0.718CysArg: 0.718 ± 0.292
0.523CysSer: 0.523 ± 0.157
0.718CysThr: 0.718 ± 0.239
0.98CysVal: 0.98 ± 0.244
0.653CysTrp: 0.653 ± 0.214
0.523CysTyr: 0.523 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
4.703AspAla: 4.703 ± 0.591
0.718AspCys: 0.718 ± 0.236
3.723AspAsp: 3.723 ± 0.529
4.507AspGlu: 4.507 ± 0.426
2.482AspPhe: 2.482 ± 0.366
5.748AspGly: 5.748 ± 0.812
1.045AspHis: 1.045 ± 0.251
3.462AspIle: 3.462 ± 0.453
4.898AspLys: 4.898 ± 0.483
5.16AspLeu: 5.16 ± 0.496
1.502AspMet: 1.502 ± 0.291
3.527AspAsn: 3.527 ± 0.446
2.613AspPro: 2.613 ± 0.407
1.568AspGln: 1.568 ± 0.321
2.743AspArg: 2.743 ± 0.396
3.266AspSer: 3.266 ± 0.474
3.462AspThr: 3.462 ± 0.371
3.723AspVal: 3.723 ± 0.529
0.98AspTrp: 0.98 ± 0.194
3.462AspTyr: 3.462 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
5.094GluAla: 5.094 ± 0.537
1.176GluCys: 1.176 ± 0.286
3.135GluAsp: 3.135 ± 0.457
4.768GluGlu: 4.768 ± 0.782
3.919GluPhe: 3.919 ± 0.505
3.2GluGly: 3.2 ± 0.431
1.633GluHis: 1.633 ± 0.357
5.094GluIle: 5.094 ± 0.618
4.833GluLys: 4.833 ± 0.571
5.094GluLeu: 5.094 ± 0.532
2.678GluMet: 2.678 ± 0.36
3.462GluAsn: 3.462 ± 0.454
1.894GluPro: 1.894 ± 0.401
3.919GluGln: 3.919 ± 0.7
3.396GluArg: 3.396 ± 0.491
4.311GluSer: 4.311 ± 0.498
3.788GluThr: 3.788 ± 0.541
4.245GluVal: 4.245 ± 0.692
0.653GluTrp: 0.653 ± 0.217
3.266GluTyr: 3.266 ± 0.46
0.0GluXaa: 0.0 ± 0.0
Phe
2.808PheAla: 2.808 ± 0.424
0.653PheCys: 0.653 ± 0.224
2.743PheAsp: 2.743 ± 0.53
2.482PheGlu: 2.482 ± 0.438
1.372PhePhe: 1.372 ± 0.346
3.07PheGly: 3.07 ± 0.601
0.718PheHis: 0.718 ± 0.195
2.417PheIle: 2.417 ± 0.366
3.135PheLys: 3.135 ± 0.556
2.286PheLeu: 2.286 ± 0.303
1.241PheMet: 1.241 ± 0.384
2.613PheAsn: 2.613 ± 0.293
1.698PhePro: 1.698 ± 0.316
1.176PheGln: 1.176 ± 0.3
1.437PheArg: 1.437 ± 0.271
2.351PheSer: 2.351 ± 0.339
3.004PheThr: 3.004 ± 0.482
2.613PheVal: 2.613 ± 0.34
0.523PheTrp: 0.523 ± 0.182
1.633PheTyr: 1.633 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
4.507GlyAla: 4.507 ± 0.636
1.176GlyCys: 1.176 ± 0.308
4.572GlyAsp: 4.572 ± 0.654
4.898GlyGlu: 4.898 ± 0.614
2.939GlyPhe: 2.939 ± 0.432
6.009GlyGly: 6.009 ± 0.802
1.11GlyHis: 1.11 ± 0.284
4.115GlyIle: 4.115 ± 0.463
5.617GlyLys: 5.617 ± 0.589
3.788GlyLeu: 3.788 ± 0.512
2.613GlyMet: 2.613 ± 0.35
3.853GlyAsn: 3.853 ± 0.45
0.131GlyPro: 0.131 ± 0.125
1.763GlyGln: 1.763 ± 0.437
3.331GlyArg: 3.331 ± 0.406
5.29GlySer: 5.29 ± 0.636
3.853GlyThr: 3.853 ± 0.82
5.486GlyVal: 5.486 ± 0.562
1.502GlyTrp: 1.502 ± 0.265
3.658GlyTyr: 3.658 ± 0.44
0.0GlyXaa: 0.0 ± 0.0
His
1.045HisAla: 1.045 ± 0.293
0.196HisCys: 0.196 ± 0.119
1.372HisAsp: 1.372 ± 0.369
0.98HisGlu: 0.98 ± 0.278
0.914HisPhe: 0.914 ± 0.241
1.763HisGly: 1.763 ± 0.362
0.523HisHis: 0.523 ± 0.24
0.588HisIle: 0.588 ± 0.195
1.437HisLys: 1.437 ± 0.328
1.502HisLeu: 1.502 ± 0.367
0.392HisMet: 0.392 ± 0.17
0.523HisAsn: 0.523 ± 0.192
0.784HisPro: 0.784 ± 0.27
0.718HisGln: 0.718 ± 0.233
0.914HisArg: 0.914 ± 0.28
1.241HisSer: 1.241 ± 0.311
1.045HisThr: 1.045 ± 0.285
1.437HisVal: 1.437 ± 0.281
0.131HisTrp: 0.131 ± 0.088
0.718HisTyr: 0.718 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
6.205IleAla: 6.205 ± 0.761
1.045IleCys: 1.045 ± 0.3
5.682IleAsp: 5.682 ± 0.655
4.245IleGlu: 4.245 ± 0.494
2.482IlePhe: 2.482 ± 0.33
3.853IleGly: 3.853 ± 0.433
1.241IleHis: 1.241 ± 0.284
4.245IleIle: 4.245 ± 0.563
5.943IleLys: 5.943 ± 0.54
3.723IleLeu: 3.723 ± 0.461
2.417IleMet: 2.417 ± 0.428
3.462IleAsn: 3.462 ± 0.446
2.221IlePro: 2.221 ± 0.328
2.09IleGln: 2.09 ± 0.386
3.004IleArg: 3.004 ± 0.425
4.376IleSer: 4.376 ± 0.486
4.507IleThr: 4.507 ± 0.533
4.115IleVal: 4.115 ± 0.359
0.98IleTrp: 0.98 ± 0.22
1.894IleTyr: 1.894 ± 0.421
0.0IleXaa: 0.0 ± 0.0
Lys
7.38LysAla: 7.38 ± 0.861
0.784LysCys: 0.784 ± 0.239
4.898LysAsp: 4.898 ± 0.544
6.858LysGlu: 6.858 ± 0.819
2.155LysPhe: 2.155 ± 0.344
3.331LysGly: 3.331 ± 0.538
1.437LysHis: 1.437 ± 0.328
4.572LysIle: 4.572 ± 0.423
4.441LysLys: 4.441 ± 0.651
5.421LysLeu: 5.421 ± 0.581
3.462LysMet: 3.462 ± 0.507
4.311LysAsn: 4.311 ± 0.448
3.266LysPro: 3.266 ± 0.499
2.417LysGln: 2.417 ± 0.406
3.723LysArg: 3.723 ± 0.536
3.331LysSer: 3.331 ± 0.562
4.637LysThr: 4.637 ± 0.574
5.094LysVal: 5.094 ± 0.537
1.176LysTrp: 1.176 ± 0.259
2.417LysTyr: 2.417 ± 0.381
0.0LysXaa: 0.0 ± 0.0
Leu
7.446LeuAla: 7.446 ± 1.189
0.784LeuCys: 0.784 ± 0.245
3.853LeuAsp: 3.853 ± 0.492
3.984LeuGlu: 3.984 ± 0.572
2.221LeuPhe: 2.221 ± 0.324
3.266LeuGly: 3.266 ± 0.513
1.372LeuHis: 1.372 ± 0.349
4.703LeuIle: 4.703 ± 0.512
5.16LeuLys: 5.16 ± 0.544
4.311LeuLeu: 4.311 ± 0.376
2.417LeuMet: 2.417 ± 0.457
4.115LeuAsn: 4.115 ± 0.549
3.07LeuPro: 3.07 ± 0.51
1.763LeuGln: 1.763 ± 0.372
3.396LeuArg: 3.396 ± 0.464
4.115LeuSer: 4.115 ± 0.441
5.16LeuThr: 5.16 ± 0.639
3.984LeuVal: 3.984 ± 0.531
1.045LeuTrp: 1.045 ± 0.25
2.155LeuTyr: 2.155 ± 0.33
0.0LeuXaa: 0.0 ± 0.0
Met
3.2MetAla: 3.2 ± 0.476
0.523MetCys: 0.523 ± 0.165
1.437MetAsp: 1.437 ± 0.288
1.502MetGlu: 1.502 ± 0.347
1.306MetPhe: 1.306 ± 0.328
1.306MetGly: 1.306 ± 0.278
0.849MetHis: 0.849 ± 0.211
2.743MetIle: 2.743 ± 0.44
2.351MetLys: 2.351 ± 0.495
2.613MetLeu: 2.613 ± 0.39
1.11MetMet: 1.11 ± 0.292
1.698MetAsn: 1.698 ± 0.289
0.914MetPro: 0.914 ± 0.218
1.829MetGln: 1.829 ± 0.292
1.829MetArg: 1.829 ± 0.35
1.633MetSer: 1.633 ± 0.376
1.633MetThr: 1.633 ± 0.351
2.09MetVal: 2.09 ± 0.357
0.588MetTrp: 0.588 ± 0.176
0.98MetTyr: 0.98 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
4.311AsnAla: 4.311 ± 0.522
0.653AsnCys: 0.653 ± 0.195
3.331AsnAsp: 3.331 ± 0.504
3.462AsnGlu: 3.462 ± 0.465
1.763AsnPhe: 1.763 ± 0.315
5.552AsnGly: 5.552 ± 0.804
0.849AsnHis: 0.849 ± 0.26
3.266AsnIle: 3.266 ± 0.404
3.658AsnLys: 3.658 ± 0.522
2.743AsnLeu: 2.743 ± 0.436
0.98AsnMet: 0.98 ± 0.232
2.482AsnAsn: 2.482 ± 0.369
1.763AsnPro: 1.763 ± 0.32
2.025AsnGln: 2.025 ± 0.35
2.351AsnArg: 2.351 ± 0.368
3.2AsnSer: 3.2 ± 0.392
1.829AsnThr: 1.829 ± 0.337
3.788AsnVal: 3.788 ± 0.411
0.457AsnTrp: 0.457 ± 0.142
1.568AsnTyr: 1.568 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
2.155ProAla: 2.155 ± 0.403
0.392ProCys: 0.392 ± 0.178
2.613ProAsp: 2.613 ± 0.413
3.266ProGlu: 3.266 ± 0.587
1.568ProPhe: 1.568 ± 0.317
2.547ProGly: 2.547 ± 0.434
0.523ProHis: 0.523 ± 0.174
1.959ProIle: 1.959 ± 0.415
1.763ProLys: 1.763 ± 0.274
1.633ProLeu: 1.633 ± 0.298
0.914ProMet: 0.914 ± 0.286
1.829ProAsn: 1.829 ± 0.305
1.372ProPro: 1.372 ± 0.366
1.176ProGln: 1.176 ± 0.294
1.241ProArg: 1.241 ± 0.329
1.437ProSer: 1.437 ± 0.269
1.372ProThr: 1.372 ± 0.242
3.004ProVal: 3.004 ± 0.389
0.261ProTrp: 0.261 ± 0.111
1.241ProTyr: 1.241 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
2.547GlnAla: 2.547 ± 0.442
0.457GlnCys: 0.457 ± 0.162
2.221GlnAsp: 2.221 ± 0.384
2.613GlnGlu: 2.613 ± 0.379
1.241GlnPhe: 1.241 ± 0.325
1.437GlnGly: 1.437 ± 0.332
0.457GlnHis: 0.457 ± 0.175
3.396GlnIle: 3.396 ± 0.476
3.07GlnLys: 3.07 ± 0.5
3.853GlnLeu: 3.853 ± 0.659
1.11GlnMet: 1.11 ± 0.254
1.437GlnAsn: 1.437 ± 0.311
1.11GlnPro: 1.11 ± 0.261
2.743GlnGln: 2.743 ± 0.771
2.221GlnArg: 2.221 ± 0.485
1.959GlnSer: 1.959 ± 0.418
1.698GlnThr: 1.698 ± 0.361
2.417GlnVal: 2.417 ± 0.411
0.523GlnTrp: 0.523 ± 0.148
1.372GlnTyr: 1.372 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
3.984ArgAla: 3.984 ± 0.533
0.914ArgCys: 0.914 ± 0.325
2.808ArgAsp: 2.808 ± 0.368
3.919ArgGlu: 3.919 ± 0.532
2.155ArgPhe: 2.155 ± 0.376
2.874ArgGly: 2.874 ± 0.452
0.784ArgHis: 0.784 ± 0.229
4.245ArgIle: 4.245 ± 0.501
3.788ArgLys: 3.788 ± 0.584
2.547ArgLeu: 2.547 ± 0.5
1.829ArgMet: 1.829 ± 0.307
1.633ArgAsn: 1.633 ± 0.358
1.568ArgPro: 1.568 ± 0.344
1.502ArgGln: 1.502 ± 0.344
2.808ArgArg: 2.808 ± 0.555
2.417ArgSer: 2.417 ± 0.36
1.306ArgThr: 1.306 ± 0.287
4.376ArgVal: 4.376 ± 0.51
0.457ArgTrp: 0.457 ± 0.15
2.155ArgTyr: 2.155 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
4.768SerAla: 4.768 ± 0.587
0.653SerCys: 0.653 ± 0.224
4.18SerAsp: 4.18 ± 0.678
4.049SerGlu: 4.049 ± 0.528
2.025SerPhe: 2.025 ± 0.298
5.617SerGly: 5.617 ± 0.569
0.98SerHis: 0.98 ± 0.322
4.311SerIle: 4.311 ± 0.676
3.788SerLys: 3.788 ± 0.451
3.396SerLeu: 3.396 ± 0.501
1.698SerMet: 1.698 ± 0.41
2.025SerAsn: 2.025 ± 0.364
1.829SerPro: 1.829 ± 0.426
2.221SerGln: 2.221 ± 0.561
2.417SerArg: 2.417 ± 0.439
2.547SerSer: 2.547 ± 0.507
2.678SerThr: 2.678 ± 0.452
4.18SerVal: 4.18 ± 0.475
0.914SerTrp: 0.914 ± 0.248
2.482SerTyr: 2.482 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
4.18ThrAla: 4.18 ± 0.641
0.523ThrCys: 0.523 ± 0.224
2.939ThrAsp: 2.939 ± 0.508
2.939ThrGlu: 2.939 ± 0.529
3.331ThrPhe: 3.331 ± 0.412
5.29ThrGly: 5.29 ± 0.509
0.914ThrHis: 0.914 ± 0.227
3.658ThrIle: 3.658 ± 0.355
3.723ThrLys: 3.723 ± 0.68
3.658ThrLeu: 3.658 ± 0.436
1.306ThrMet: 1.306 ± 0.262
2.417ThrAsn: 2.417 ± 0.395
2.547ThrPro: 2.547 ± 0.388
2.155ThrGln: 2.155 ± 0.43
1.763ThrArg: 1.763 ± 0.302
3.396ThrSer: 3.396 ± 0.554
2.351ThrThr: 2.351 ± 0.453
4.115ThrVal: 4.115 ± 0.492
0.784ThrTrp: 0.784 ± 0.2
1.698ThrTyr: 1.698 ± 0.345
0.0ThrXaa: 0.0 ± 0.0
Val
4.703ValAla: 4.703 ± 0.609
0.98ValCys: 0.98 ± 0.262
3.919ValAsp: 3.919 ± 0.459
5.748ValGlu: 5.748 ± 0.589
2.155ValPhe: 2.155 ± 0.475
4.507ValGly: 4.507 ± 0.426
1.306ValHis: 1.306 ± 0.295
5.617ValIle: 5.617 ± 0.572
5.813ValLys: 5.813 ± 0.726
4.507ValLeu: 4.507 ± 0.456
2.221ValMet: 2.221 ± 0.391
4.18ValAsn: 4.18 ± 0.521
1.829ValPro: 1.829 ± 0.351
1.959ValGln: 1.959 ± 0.297
3.462ValArg: 3.462 ± 0.532
3.984ValSer: 3.984 ± 0.629
4.049ValThr: 4.049 ± 0.53
4.18ValVal: 4.18 ± 0.577
1.11ValTrp: 1.11 ± 0.22
2.547ValTyr: 2.547 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
0.784TrpAla: 0.784 ± 0.224
0.457TrpCys: 0.457 ± 0.178
0.849TrpAsp: 0.849 ± 0.194
0.784TrpGlu: 0.784 ± 0.261
0.653TrpPhe: 0.653 ± 0.227
1.045TrpGly: 1.045 ± 0.246
0.392TrpHis: 0.392 ± 0.153
0.914TrpIle: 0.914 ± 0.25
0.849TrpLys: 0.849 ± 0.205
1.502TrpLeu: 1.502 ± 0.288
0.261TrpMet: 0.261 ± 0.125
0.849TrpAsn: 0.849 ± 0.17
0.261TrpPro: 0.261 ± 0.135
0.392TrpGln: 0.392 ± 0.141
0.98TrpArg: 0.98 ± 0.271
0.914TrpSer: 0.914 ± 0.31
0.849TrpThr: 0.849 ± 0.225
0.914TrpVal: 0.914 ± 0.204
0.131TrpTrp: 0.131 ± 0.099
0.392TrpTyr: 0.392 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.266TyrAla: 3.266 ± 0.477
0.588TyrCys: 0.588 ± 0.182
2.939TyrAsp: 2.939 ± 0.363
2.286TyrGlu: 2.286 ± 0.412
1.568TyrPhe: 1.568 ± 0.339
3.004TyrGly: 3.004 ± 0.404
0.98TyrHis: 0.98 ± 0.244
2.678TyrIle: 2.678 ± 0.438
2.874TyrLys: 2.874 ± 0.478
2.286TyrLeu: 2.286 ± 0.331
0.718TyrMet: 0.718 ± 0.224
1.633TyrAsn: 1.633 ± 0.333
1.306TyrPro: 1.306 ± 0.265
1.894TyrGln: 1.894 ± 0.304
2.09TyrArg: 2.09 ± 0.501
2.417TyrSer: 2.417 ± 0.369
1.763TyrThr: 1.763 ± 0.266
2.482TyrVal: 2.482 ± 0.379
0.392TyrTrp: 0.392 ± 0.185
1.633TyrTyr: 1.633 ± 0.414
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 88 proteins (15312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski