Amino acid dipepetide frequency for Escherichia phage vB_EcoS_AKS96

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.934AlaAla: 7.934 ± 1.102
0.572AlaCys: 0.572 ± 0.224
4.146AlaAsp: 4.146 ± 0.65
4.861AlaGlu: 4.861 ± 0.723
2.788AlaPhe: 2.788 ± 0.427
6.862AlaGly: 6.862 ± 0.733
0.858AlaHis: 0.858 ± 0.248
6.791AlaIle: 6.791 ± 0.77
6.147AlaLys: 6.147 ± 1.117
6.719AlaLeu: 6.719 ± 0.85
2.287AlaMet: 2.287 ± 0.421
4.146AlaAsn: 4.146 ± 0.491
1.287AlaPro: 1.287 ± 0.254
3.574AlaGln: 3.574 ± 0.483
4.289AlaArg: 4.289 ± 0.703
5.933AlaSer: 5.933 ± 0.653
3.931AlaThr: 3.931 ± 0.533
5.432AlaVal: 5.432 ± 0.836
1.215AlaTrp: 1.215 ± 0.278
2.716AlaTyr: 2.716 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.572CysAla: 0.572 ± 0.237
0.143CysCys: 0.143 ± 0.121
0.715CysAsp: 0.715 ± 0.202
1.001CysGlu: 1.001 ± 0.295
0.643CysPhe: 0.643 ± 0.242
1.072CysGly: 1.072 ± 0.305
0.214CysHis: 0.214 ± 0.155
0.572CysIle: 0.572 ± 0.251
0.929CysLys: 0.929 ± 0.288
1.144CysLeu: 1.144 ± 0.316
0.286CysMet: 0.286 ± 0.152
0.643CysAsn: 0.643 ± 0.202
0.429CysPro: 0.429 ± 0.172
0.286CysGln: 0.286 ± 0.147
0.929CysArg: 0.929 ± 0.275
1.144CysSer: 1.144 ± 0.344
0.572CysThr: 0.572 ± 0.199
1.001CysVal: 1.001 ± 0.226
0.214CysTrp: 0.214 ± 0.136
0.357CysTyr: 0.357 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
4.718AspAla: 4.718 ± 0.661
0.715AspCys: 0.715 ± 0.266
4.217AspAsp: 4.217 ± 0.554
4.646AspGlu: 4.646 ± 0.696
2.716AspPhe: 2.716 ± 0.481
7.291AspGly: 7.291 ± 0.78
0.786AspHis: 0.786 ± 0.292
3.574AspIle: 3.574 ± 0.498
5.004AspLys: 5.004 ± 0.616
3.717AspLeu: 3.717 ± 0.501
1.001AspMet: 1.001 ± 0.36
2.716AspAsn: 2.716 ± 0.538
1.858AspPro: 1.858 ± 0.31
1.287AspGln: 1.287 ± 0.365
2.43AspArg: 2.43 ± 0.489
4.789AspSer: 4.789 ± 0.611
2.859AspThr: 2.859 ± 0.4
4.217AspVal: 4.217 ± 0.556
0.643AspTrp: 0.643 ± 0.196
3.074AspTyr: 3.074 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
5.361GluAla: 5.361 ± 0.661
0.5GluCys: 0.5 ± 0.18
3.217GluAsp: 3.217 ± 0.542
4.003GluGlu: 4.003 ± 0.708
3.645GluPhe: 3.645 ± 0.502
3.145GluGly: 3.145 ± 0.463
0.858GluHis: 0.858 ± 0.284
5.289GluIle: 5.289 ± 0.645
3.645GluLys: 3.645 ± 0.639
5.647GluLeu: 5.647 ± 0.634
2.502GluMet: 2.502 ± 0.558
3.145GluAsn: 3.145 ± 0.615
1.287GluPro: 1.287 ± 0.317
2.573GluGln: 2.573 ± 0.433
2.859GluArg: 2.859 ± 0.558
4.146GluSer: 4.146 ± 0.638
3.503GluThr: 3.503 ± 0.583
5.075GluVal: 5.075 ± 0.779
0.572GluTrp: 0.572 ± 0.196
2.859GluTyr: 2.859 ± 0.474
0.0GluXaa: 0.0 ± 0.0
Phe
2.859PheAla: 2.859 ± 0.507
0.858PheCys: 0.858 ± 0.299
3.288PheAsp: 3.288 ± 0.467
2.716PheGlu: 2.716 ± 0.551
0.929PhePhe: 0.929 ± 0.251
4.36PheGly: 4.36 ± 0.587
0.5PheHis: 0.5 ± 0.261
2.144PheIle: 2.144 ± 0.355
2.287PheLys: 2.287 ± 0.448
2.287PheLeu: 2.287 ± 0.483
1.215PheMet: 1.215 ± 0.313
2.716PheAsn: 2.716 ± 0.488
1.358PhePro: 1.358 ± 0.312
1.358PheGln: 1.358 ± 0.367
1.716PheArg: 1.716 ± 0.258
2.502PheSer: 2.502 ± 0.351
2.502PheThr: 2.502 ± 0.512
1.93PheVal: 1.93 ± 0.316
0.5PheTrp: 0.5 ± 0.185
1.501PheTyr: 1.501 ± 0.386
0.0PheXaa: 0.0 ± 0.0
Gly
4.789GlyAla: 4.789 ± 0.727
1.287GlyCys: 1.287 ± 0.312
4.432GlyAsp: 4.432 ± 0.697
5.289GlyGlu: 5.289 ± 0.508
3.574GlyPhe: 3.574 ± 0.815
6.147GlyGly: 6.147 ± 1.144
0.858GlyHis: 0.858 ± 0.337
5.504GlyIle: 5.504 ± 0.647
5.075GlyLys: 5.075 ± 0.607
6.004GlyLeu: 6.004 ± 0.598
2.359GlyMet: 2.359 ± 0.497
3.217GlyAsn: 3.217 ± 0.484
1.072GlyPro: 1.072 ± 0.465
1.787GlyGln: 1.787 ± 0.339
3.217GlyArg: 3.217 ± 0.518
6.362GlySer: 6.362 ± 0.674
4.36GlyThr: 4.36 ± 0.67
6.576GlyVal: 6.576 ± 0.729
1.43GlyTrp: 1.43 ± 0.417
4.003GlyTyr: 4.003 ± 0.541
0.0GlyXaa: 0.0 ± 0.0
His
0.643HisAla: 0.643 ± 0.267
0.214HisCys: 0.214 ± 0.116
0.929HisAsp: 0.929 ± 0.306
0.929HisGlu: 0.929 ± 0.317
0.357HisPhe: 0.357 ± 0.155
1.072HisGly: 1.072 ± 0.349
0.357HisHis: 0.357 ± 0.166
1.001HisIle: 1.001 ± 0.297
1.716HisLys: 1.716 ± 0.39
1.144HisLeu: 1.144 ± 0.393
0.143HisMet: 0.143 ± 0.104
0.643HisAsn: 0.643 ± 0.284
0.286HisPro: 0.286 ± 0.16
0.214HisGln: 0.214 ± 0.133
0.929HisArg: 0.929 ± 0.287
0.786HisSer: 0.786 ± 0.239
0.858HisThr: 0.858 ± 0.258
1.43HisVal: 1.43 ± 0.374
0.143HisTrp: 0.143 ± 0.106
0.715HisTyr: 0.715 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
6.219IleAla: 6.219 ± 0.535
1.144IleCys: 1.144 ± 0.294
5.575IleAsp: 5.575 ± 0.798
5.289IleGlu: 5.289 ± 0.768
1.858IlePhe: 1.858 ± 0.29
4.289IleGly: 4.289 ± 0.667
1.072IleHis: 1.072 ± 0.285
4.003IleIle: 4.003 ± 0.588
4.074IleLys: 4.074 ± 0.48
3.074IleLeu: 3.074 ± 0.557
0.929IleMet: 0.929 ± 0.353
3.931IleAsn: 3.931 ± 0.52
2.573IlePro: 2.573 ± 0.439
2.073IleGln: 2.073 ± 0.488
3.503IleArg: 3.503 ± 0.514
3.86IleSer: 3.86 ± 0.52
4.646IleThr: 4.646 ± 0.509
3.717IleVal: 3.717 ± 0.413
0.572IleTrp: 0.572 ± 0.205
2.43IleTyr: 2.43 ± 0.393
0.0IleXaa: 0.0 ± 0.0
Lys
6.719LysAla: 6.719 ± 0.982
0.715LysCys: 0.715 ± 0.353
3.145LysAsp: 3.145 ± 0.454
4.074LysGlu: 4.074 ± 0.75
2.716LysPhe: 2.716 ± 0.408
3.645LysGly: 3.645 ± 0.586
1.287LysHis: 1.287 ± 0.317
3.645LysIle: 3.645 ± 0.493
4.36LysLys: 4.36 ± 0.826
5.718LysLeu: 5.718 ± 0.769
2.931LysMet: 2.931 ± 0.667
2.216LysAsn: 2.216 ± 0.389
1.43LysPro: 1.43 ± 0.444
2.645LysGln: 2.645 ± 0.468
2.502LysArg: 2.502 ± 0.587
4.575LysSer: 4.575 ± 0.577
3.788LysThr: 3.788 ± 0.57
4.861LysVal: 4.861 ± 0.733
0.715LysTrp: 0.715 ± 0.245
2.216LysTyr: 2.216 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
6.219LeuAla: 6.219 ± 0.735
0.643LeuCys: 0.643 ± 0.26
4.932LeuAsp: 4.932 ± 0.666
4.146LeuGlu: 4.146 ± 0.492
1.93LeuPhe: 1.93 ± 0.376
4.36LeuGly: 4.36 ± 0.501
1.43LeuHis: 1.43 ± 0.329
4.36LeuIle: 4.36 ± 0.629
4.36LeuLys: 4.36 ± 0.588
4.217LeuLeu: 4.217 ± 0.687
1.93LeuMet: 1.93 ± 0.398
3.503LeuAsn: 3.503 ± 0.305
3.288LeuPro: 3.288 ± 0.499
2.645LeuGln: 2.645 ± 0.608
3.931LeuArg: 3.931 ± 0.541
6.004LeuSer: 6.004 ± 0.746
4.861LeuThr: 4.861 ± 0.529
4.646LeuVal: 4.646 ± 0.637
0.429LeuTrp: 0.429 ± 0.18
2.216LeuTyr: 2.216 ± 0.416
0.0LeuXaa: 0.0 ± 0.0
Met
3.145MetAla: 3.145 ± 0.452
0.214MetCys: 0.214 ± 0.139
0.858MetAsp: 0.858 ± 0.256
1.43MetGlu: 1.43 ± 0.367
1.215MetPhe: 1.215 ± 0.323
1.001MetGly: 1.001 ± 0.233
0.429MetHis: 0.429 ± 0.182
2.001MetIle: 2.001 ± 0.498
1.787MetLys: 1.787 ± 0.387
1.644MetLeu: 1.644 ± 0.414
1.072MetMet: 1.072 ± 0.33
1.644MetAsn: 1.644 ± 0.395
0.786MetPro: 0.786 ± 0.226
1.072MetGln: 1.072 ± 0.331
1.43MetArg: 1.43 ± 0.332
1.716MetSer: 1.716 ± 0.382
2.144MetThr: 2.144 ± 0.465
2.001MetVal: 2.001 ± 0.408
0.214MetTrp: 0.214 ± 0.119
0.715MetTyr: 0.715 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
3.645AsnAla: 3.645 ± 0.535
0.572AsnCys: 0.572 ± 0.265
4.003AsnAsp: 4.003 ± 0.5
3.36AsnGlu: 3.36 ± 0.578
1.144AsnPhe: 1.144 ± 0.33
5.933AsnGly: 5.933 ± 0.887
0.929AsnHis: 0.929 ± 0.281
2.645AsnIle: 2.645 ± 0.401
2.573AsnLys: 2.573 ± 0.364
4.503AsnLeu: 4.503 ± 0.508
1.215AsnMet: 1.215 ± 0.349
3.717AsnAsn: 3.717 ± 0.618
1.787AsnPro: 1.787 ± 0.338
1.93AsnGln: 1.93 ± 0.469
1.93AsnArg: 1.93 ± 0.372
3.36AsnSer: 3.36 ± 0.661
2.645AsnThr: 2.645 ± 0.476
3.503AsnVal: 3.503 ± 0.528
0.715AsnTrp: 0.715 ± 0.217
2.073AsnTyr: 2.073 ± 0.432
0.0AsnXaa: 0.0 ± 0.0
Pro
3.074ProAla: 3.074 ± 0.43
0.429ProCys: 0.429 ± 0.213
1.287ProAsp: 1.287 ± 0.466
2.788ProGlu: 2.788 ± 0.494
1.787ProPhe: 1.787 ± 0.383
1.93ProGly: 1.93 ± 0.496
0.5ProHis: 0.5 ± 0.207
1.215ProIle: 1.215 ± 0.296
1.215ProLys: 1.215 ± 0.282
1.858ProLeu: 1.858 ± 0.443
0.5ProMet: 0.5 ± 0.206
1.501ProAsn: 1.501 ± 0.294
0.929ProPro: 0.929 ± 0.278
2.144ProGln: 2.144 ± 0.485
1.43ProArg: 1.43 ± 0.307
1.358ProSer: 1.358 ± 0.309
1.43ProThr: 1.43 ± 0.356
3.288ProVal: 3.288 ± 0.478
0.357ProTrp: 0.357 ± 0.167
1.001ProTyr: 1.001 ± 0.242
0.0ProXaa: 0.0 ± 0.0
Gln
4.074GlnAla: 4.074 ± 0.725
0.5GlnCys: 0.5 ± 0.212
1.93GlnAsp: 1.93 ± 0.397
2.573GlnGlu: 2.573 ± 0.486
1.358GlnPhe: 1.358 ± 0.271
2.073GlnGly: 2.073 ± 0.405
0.357GlnHis: 0.357 ± 0.143
3.431GlnIle: 3.431 ± 0.766
1.93GlnLys: 1.93 ± 0.442
3.217GlnLeu: 3.217 ± 0.55
0.643GlnMet: 0.643 ± 0.2
1.93GlnAsn: 1.93 ± 0.43
1.287GlnPro: 1.287 ± 0.284
2.573GlnGln: 2.573 ± 0.799
1.573GlnArg: 1.573 ± 0.427
2.931GlnSer: 2.931 ± 0.579
1.644GlnThr: 1.644 ± 0.421
2.716GlnVal: 2.716 ± 0.441
0.429GlnTrp: 0.429 ± 0.19
1.573GlnTyr: 1.573 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
3.217ArgAla: 3.217 ± 0.564
1.072ArgCys: 1.072 ± 0.382
2.788ArgAsp: 2.788 ± 0.455
3.074ArgGlu: 3.074 ± 0.434
2.216ArgPhe: 2.216 ± 0.376
2.931ArgGly: 2.931 ± 0.535
1.001ArgHis: 1.001 ± 0.341
3.36ArgIle: 3.36 ± 0.521
3.788ArgLys: 3.788 ± 0.63
3.288ArgLeu: 3.288 ± 0.579
0.858ArgMet: 0.858 ± 0.247
2.502ArgAsn: 2.502 ± 0.442
2.073ArgPro: 2.073 ± 0.414
2.502ArgGln: 2.502 ± 0.434
2.931ArgArg: 2.931 ± 0.447
2.502ArgSer: 2.502 ± 0.498
1.858ArgThr: 1.858 ± 0.474
3.574ArgVal: 3.574 ± 0.47
0.5ArgTrp: 0.5 ± 0.191
2.43ArgTyr: 2.43 ± 0.398
0.0ArgXaa: 0.0 ± 0.0
Ser
6.29SerAla: 6.29 ± 0.869
0.5SerCys: 0.5 ± 0.234
5.504SerAsp: 5.504 ± 0.574
4.646SerGlu: 4.646 ± 0.589
3.288SerPhe: 3.288 ± 0.482
7.148SerGly: 7.148 ± 0.786
0.572SerHis: 0.572 ± 0.215
3.86SerIle: 3.86 ± 0.576
3.86SerLys: 3.86 ± 0.541
5.289SerLeu: 5.289 ± 0.557
1.573SerMet: 1.573 ± 0.392
4.217SerAsn: 4.217 ± 0.537
2.43SerPro: 2.43 ± 0.465
3.002SerGln: 3.002 ± 0.421
2.788SerArg: 2.788 ± 0.548
5.79SerSer: 5.79 ± 1.148
3.574SerThr: 3.574 ± 0.601
5.147SerVal: 5.147 ± 0.611
0.643SerTrp: 0.643 ± 0.23
2.573SerTyr: 2.573 ± 0.443
0.0SerXaa: 0.0 ± 0.0
Thr
5.004ThrAla: 5.004 ± 0.696
0.858ThrCys: 0.858 ± 0.243
2.502ThrAsp: 2.502 ± 0.425
2.788ThrGlu: 2.788 ± 0.446
1.858ThrPhe: 1.858 ± 0.403
5.075ThrGly: 5.075 ± 0.607
0.858ThrHis: 0.858 ± 0.215
4.074ThrIle: 4.074 ± 0.523
2.287ThrLys: 2.287 ± 0.434
3.717ThrLeu: 3.717 ± 0.52
1.501ThrMet: 1.501 ± 0.345
3.217ThrAsn: 3.217 ± 0.503
2.287ThrPro: 2.287 ± 0.388
2.43ThrGln: 2.43 ± 0.422
2.359ThrArg: 2.359 ± 0.377
3.717ThrSer: 3.717 ± 0.494
3.074ThrThr: 3.074 ± 0.564
4.36ThrVal: 4.36 ± 0.582
1.001ThrTrp: 1.001 ± 0.263
2.43ThrTyr: 2.43 ± 0.429
0.0ThrXaa: 0.0 ± 0.0
Val
5.289ValAla: 5.289 ± 0.735
0.858ValCys: 0.858 ± 0.272
5.147ValAsp: 5.147 ± 0.466
3.217ValGlu: 3.217 ± 0.607
3.145ValPhe: 3.145 ± 0.526
5.004ValGly: 5.004 ± 0.659
0.858ValHis: 0.858 ± 0.237
4.217ValIle: 4.217 ± 0.561
5.79ValLys: 5.79 ± 0.972
4.074ValLeu: 4.074 ± 0.453
2.216ValMet: 2.216 ± 0.41
3.645ValAsn: 3.645 ± 0.553
2.216ValPro: 2.216 ± 0.509
2.788ValGln: 2.788 ± 0.528
4.146ValArg: 4.146 ± 0.626
7.291ValSer: 7.291 ± 0.544
3.431ValThr: 3.431 ± 0.444
5.504ValVal: 5.504 ± 0.802
1.072ValTrp: 1.072 ± 0.241
2.716ValTyr: 2.716 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.429TrpAla: 0.429 ± 0.162
0.357TrpCys: 0.357 ± 0.151
0.643TrpAsp: 0.643 ± 0.201
0.5TrpGlu: 0.5 ± 0.209
0.572TrpPhe: 0.572 ± 0.241
0.929TrpGly: 0.929 ± 0.252
0.286TrpHis: 0.286 ± 0.131
0.929TrpIle: 0.929 ± 0.249
0.929TrpLys: 0.929 ± 0.262
1.072TrpLeu: 1.072 ± 0.234
0.357TrpMet: 0.357 ± 0.148
0.5TrpAsn: 0.5 ± 0.198
0.357TrpPro: 0.357 ± 0.148
0.214TrpGln: 0.214 ± 0.186
0.858TrpArg: 0.858 ± 0.24
0.929TrpSer: 0.929 ± 0.337
1.001TrpThr: 1.001 ± 0.321
0.858TrpVal: 0.858 ± 0.262
0.357TrpTrp: 0.357 ± 0.162
0.214TrpTyr: 0.214 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.287TyrAla: 2.287 ± 0.47
0.715TyrCys: 0.715 ± 0.263
3.074TyrAsp: 3.074 ± 0.518
2.502TyrGlu: 2.502 ± 0.468
1.787TyrPhe: 1.787 ± 0.351
3.074TyrGly: 3.074 ± 0.483
0.5TyrHis: 0.5 ± 0.209
2.216TyrIle: 2.216 ± 0.364
2.144TyrLys: 2.144 ± 0.597
1.716TyrLeu: 1.716 ± 0.275
0.929TyrMet: 0.929 ± 0.259
2.43TyrAsn: 2.43 ± 0.361
1.001TyrPro: 1.001 ± 0.271
1.644TyrGln: 1.644 ± 0.249
2.716TyrArg: 2.716 ± 0.416
3.002TyrSer: 3.002 ± 0.494
2.716TyrThr: 2.716 ± 0.405
2.788TyrVal: 2.788 ± 0.394
0.572TyrTrp: 0.572 ± 0.213
1.644TyrTyr: 1.644 ± 0.369
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (13991 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski