Amino acid dipepetide frequency for Enterococcus phage vB_EfaS_IME196

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.707AlaAla: 0.707 ± 0.231
0.442AlaCys: 0.442 ± 0.235
3.004AlaAsp: 3.004 ± 0.662
4.24AlaGlu: 4.24 ± 0.59
2.915AlaPhe: 2.915 ± 0.677
3.004AlaGly: 3.004 ± 0.469
1.502AlaHis: 1.502 ± 0.402
5.389AlaIle: 5.389 ± 0.907
6.095AlaLys: 6.095 ± 0.792
5.83AlaLeu: 5.83 ± 0.657
2.473AlaMet: 2.473 ± 0.622
4.329AlaAsn: 4.329 ± 0.65
2.032AlaPro: 2.032 ± 0.367
1.943AlaGln: 1.943 ± 0.343
1.678AlaArg: 1.678 ± 0.275
2.827AlaSer: 2.827 ± 0.447
4.329AlaThr: 4.329 ± 0.66
4.682AlaVal: 4.682 ± 0.648
0.353AlaTrp: 0.353 ± 0.148
2.562AlaTyr: 2.562 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.177CysAla: 0.177 ± 0.114
0.0CysCys: 0.0 ± 0.0
0.265CysAsp: 0.265 ± 0.147
0.53CysGlu: 0.53 ± 0.228
0.265CysPhe: 0.265 ± 0.166
0.353CysGly: 0.353 ± 0.177
0.088CysHis: 0.088 ± 0.082
0.442CysIle: 0.442 ± 0.224
0.707CysLys: 0.707 ± 0.285
0.353CysLeu: 0.353 ± 0.17
0.265CysMet: 0.265 ± 0.144
0.618CysAsn: 0.618 ± 0.234
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.353CysSer: 0.353 ± 0.182
0.088CysThr: 0.088 ± 0.084
0.353CysVal: 0.353 ± 0.2
0.177CysTrp: 0.177 ± 0.124
0.088CysTyr: 0.088 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
3.445AspAla: 3.445 ± 0.559
0.265AspCys: 0.265 ± 0.149
2.12AspAsp: 2.12 ± 0.501
3.975AspGlu: 3.975 ± 0.698
3.004AspPhe: 3.004 ± 0.53
4.594AspGly: 4.594 ± 0.809
0.795AspHis: 0.795 ± 0.267
4.505AspIle: 4.505 ± 0.682
6.184AspLys: 6.184 ± 0.753
5.212AspLeu: 5.212 ± 0.728
2.297AspMet: 2.297 ± 0.387
4.594AspAsn: 4.594 ± 0.847
2.12AspPro: 2.12 ± 0.531
1.502AspGln: 1.502 ± 0.272
1.767AspArg: 1.767 ± 0.346
3.622AspSer: 3.622 ± 0.436
3.445AspThr: 3.445 ± 0.566
3.534AspVal: 3.534 ± 0.598
0.795AspTrp: 0.795 ± 0.242
3.887AspTyr: 3.887 ± 0.749
0.0AspXaa: 0.0 ± 0.0
Glu
5.919GluAla: 5.919 ± 0.828
0.353GluCys: 0.353 ± 0.177
4.682GluAsp: 4.682 ± 0.945
7.42GluGlu: 7.42 ± 1.608
3.357GluPhe: 3.357 ± 0.622
4.329GluGly: 4.329 ± 0.604
1.148GluHis: 1.148 ± 0.308
3.445GluIle: 3.445 ± 0.601
6.36GluLys: 6.36 ± 0.781
8.127GluLeu: 8.127 ± 0.86
2.032GluMet: 2.032 ± 0.524
3.445GluAsn: 3.445 ± 0.592
2.65GluPro: 2.65 ± 0.632
3.799GluGln: 3.799 ± 0.682
3.269GluArg: 3.269 ± 0.567
4.064GluSer: 4.064 ± 0.591
4.505GluThr: 4.505 ± 0.522
7.067GluVal: 7.067 ± 0.938
1.767GluTrp: 1.767 ± 0.325
3.357GluTyr: 3.357 ± 0.702
0.0GluXaa: 0.0 ± 0.0
Phe
1.502PheAla: 1.502 ± 0.308
0.353PheCys: 0.353 ± 0.179
3.004PheAsp: 3.004 ± 0.475
3.18PheGlu: 3.18 ± 0.538
1.06PhePhe: 1.06 ± 0.376
3.18PheGly: 3.18 ± 0.732
0.265PheHis: 0.265 ± 0.193
4.505PheIle: 4.505 ± 0.7
4.947PheLys: 4.947 ± 0.741
2.208PheLeu: 2.208 ± 0.363
1.148PheMet: 1.148 ± 0.306
3.71PheAsn: 3.71 ± 0.693
0.707PhePro: 0.707 ± 0.216
1.59PheGln: 1.59 ± 0.271
1.502PheArg: 1.502 ± 0.448
2.385PheSer: 2.385 ± 0.599
3.799PheThr: 3.799 ± 0.726
2.385PheVal: 2.385 ± 0.412
0.53PheTrp: 0.53 ± 0.241
1.06PheTyr: 1.06 ± 0.305
0.0PheXaa: 0.0 ± 0.0
Gly
4.417GlyAla: 4.417 ± 1.178
0.177GlyCys: 0.177 ± 0.12
4.064GlyAsp: 4.064 ± 0.723
3.887GlyGlu: 3.887 ± 0.5
3.534GlyPhe: 3.534 ± 0.467
4.947GlyGly: 4.947 ± 1.725
0.795GlyHis: 0.795 ± 0.313
5.212GlyIle: 5.212 ± 0.777
5.83GlyLys: 5.83 ± 0.66
6.625GlyLeu: 6.625 ± 1.086
1.148GlyMet: 1.148 ± 0.311
4.152GlyAsn: 4.152 ± 0.765
0.707GlyPro: 0.707 ± 0.309
1.855GlyGln: 1.855 ± 0.433
2.473GlyArg: 2.473 ± 0.416
2.827GlySer: 2.827 ± 0.561
3.18GlyThr: 3.18 ± 0.754
4.594GlyVal: 4.594 ± 0.791
0.972GlyTrp: 0.972 ± 0.287
2.12GlyTyr: 2.12 ± 0.51
0.0GlyXaa: 0.0 ± 0.0
His
0.972HisAla: 0.972 ± 0.309
0.088HisCys: 0.088 ± 0.085
0.883HisAsp: 0.883 ± 0.255
1.148HisGlu: 1.148 ± 0.3
0.883HisPhe: 0.883 ± 0.303
1.06HisGly: 1.06 ± 0.313
0.177HisHis: 0.177 ± 0.108
0.972HisIle: 0.972 ± 0.291
1.237HisLys: 1.237 ± 0.277
0.883HisLeu: 0.883 ± 0.25
0.265HisMet: 0.265 ± 0.156
1.148HisAsn: 1.148 ± 0.313
0.177HisPro: 0.177 ± 0.145
0.707HisGln: 0.707 ± 0.22
0.972HisArg: 0.972 ± 0.289
0.442HisSer: 0.442 ± 0.201
0.883HisThr: 0.883 ± 0.369
1.325HisVal: 1.325 ± 0.344
0.177HisTrp: 0.177 ± 0.145
0.883HisTyr: 0.883 ± 0.327
0.0HisXaa: 0.0 ± 0.0
Ile
4.24IleAla: 4.24 ± 0.531
0.353IleCys: 0.353 ± 0.188
5.124IleAsp: 5.124 ± 0.622
7.155IleGlu: 7.155 ± 0.988
2.915IlePhe: 2.915 ± 0.65
3.357IleGly: 3.357 ± 0.646
0.883IleHis: 0.883 ± 0.295
3.887IleIle: 3.887 ± 0.576
5.919IleLys: 5.919 ± 0.73
5.3IleLeu: 5.3 ± 0.668
1.767IleMet: 1.767 ± 0.487
4.77IleAsn: 4.77 ± 0.608
2.915IlePro: 2.915 ± 0.335
2.562IleGln: 2.562 ± 0.446
2.297IleArg: 2.297 ± 0.39
3.269IleSer: 3.269 ± 0.515
3.887IleThr: 3.887 ± 0.445
3.975IleVal: 3.975 ± 0.608
0.53IleTrp: 0.53 ± 0.203
1.59IleTyr: 1.59 ± 0.425
0.0IleXaa: 0.0 ± 0.0
Lys
6.802LysAla: 6.802 ± 0.91
0.442LysCys: 0.442 ± 0.237
5.919LysAsp: 5.919 ± 0.906
9.187LysGlu: 9.187 ± 1.063
3.004LysPhe: 3.004 ± 0.556
4.77LysGly: 4.77 ± 0.828
1.678LysHis: 1.678 ± 0.395
4.24LysIle: 4.24 ± 0.641
5.742LysLys: 5.742 ± 0.71
7.42LysLeu: 7.42 ± 0.828
3.004LysMet: 3.004 ± 0.419
4.505LysAsn: 4.505 ± 0.567
3.799LysPro: 3.799 ± 0.673
3.887LysGln: 3.887 ± 0.586
3.622LysArg: 3.622 ± 0.518
3.799LysSer: 3.799 ± 0.797
4.859LysThr: 4.859 ± 0.488
5.742LysVal: 5.742 ± 0.633
1.59LysTrp: 1.59 ± 0.356
3.357LysTyr: 3.357 ± 0.572
0.0LysXaa: 0.0 ± 0.0
Leu
4.947LeuAla: 4.947 ± 0.64
0.177LeuCys: 0.177 ± 0.191
6.184LeuAsp: 6.184 ± 0.648
8.392LeuGlu: 8.392 ± 0.997
3.357LeuPhe: 3.357 ± 0.534
5.919LeuGly: 5.919 ± 1.18
1.06LeuHis: 1.06 ± 0.274
3.975LeuIle: 3.975 ± 0.759
6.449LeuLys: 6.449 ± 0.858
6.714LeuLeu: 6.714 ± 0.951
1.678LeuMet: 1.678 ± 0.33
5.654LeuAsn: 5.654 ± 0.756
2.65LeuPro: 2.65 ± 0.477
4.064LeuGln: 4.064 ± 0.526
2.385LeuArg: 2.385 ± 0.368
5.389LeuSer: 5.389 ± 0.564
4.77LeuThr: 4.77 ± 0.604
5.742LeuVal: 5.742 ± 0.727
1.148LeuTrp: 1.148 ± 0.265
3.269LeuTyr: 3.269 ± 0.59
0.0LeuXaa: 0.0 ± 0.0
Met
1.943MetAla: 1.943 ± 0.501
0.088MetCys: 0.088 ± 0.096
1.502MetAsp: 1.502 ± 0.434
1.678MetGlu: 1.678 ± 0.459
1.502MetPhe: 1.502 ± 0.361
1.502MetGly: 1.502 ± 0.448
0.088MetHis: 0.088 ± 0.096
2.208MetIle: 2.208 ± 0.401
2.385MetLys: 2.385 ± 0.505
1.502MetLeu: 1.502 ± 0.404
0.618MetMet: 0.618 ± 0.278
2.032MetAsn: 2.032 ± 0.376
0.618MetPro: 0.618 ± 0.237
1.237MetGln: 1.237 ± 0.344
1.678MetArg: 1.678 ± 0.379
1.855MetSer: 1.855 ± 0.331
1.502MetThr: 1.502 ± 0.347
1.943MetVal: 1.943 ± 0.583
0.795MetTrp: 0.795 ± 0.322
1.325MetTyr: 1.325 ± 0.413
0.0MetXaa: 0.0 ± 0.0
Asn
4.329AsnAla: 4.329 ± 0.661
0.177AsnCys: 0.177 ± 0.129
2.915AsnAsp: 2.915 ± 0.567
6.184AsnGlu: 6.184 ± 0.741
1.855AsnPhe: 1.855 ± 0.463
6.184AsnGly: 6.184 ± 0.736
1.148AsnHis: 1.148 ± 0.312
4.152AsnIle: 4.152 ± 0.635
6.007AsnLys: 6.007 ± 0.663
4.859AsnLeu: 4.859 ± 0.62
1.855AsnMet: 1.855 ± 0.439
4.152AsnAsn: 4.152 ± 0.653
2.297AsnPro: 2.297 ± 0.582
1.855AsnGln: 1.855 ± 0.291
1.325AsnArg: 1.325 ± 0.26
3.445AsnSer: 3.445 ± 0.619
4.329AsnThr: 4.329 ± 0.702
4.417AsnVal: 4.417 ± 0.858
0.972AsnTrp: 0.972 ± 0.294
3.357AsnTyr: 3.357 ± 0.68
0.0AsnXaa: 0.0 ± 0.0
Pro
1.678ProAla: 1.678 ± 0.447
0.088ProCys: 0.088 ± 0.084
2.385ProAsp: 2.385 ± 0.658
3.269ProGlu: 3.269 ± 0.495
0.972ProPhe: 0.972 ± 0.264
0.0ProGly: 0.0 ± 0.0
0.177ProHis: 0.177 ± 0.119
2.032ProIle: 2.032 ± 0.292
2.473ProLys: 2.473 ± 0.934
3.269ProLeu: 3.269 ± 0.468
0.883ProMet: 0.883 ± 0.226
2.473ProAsn: 2.473 ± 0.528
0.353ProPro: 0.353 ± 0.159
1.59ProGln: 1.59 ± 0.368
1.06ProArg: 1.06 ± 0.369
2.032ProSer: 2.032 ± 0.346
2.65ProThr: 2.65 ± 0.52
1.678ProVal: 1.678 ± 0.291
0.265ProTrp: 0.265 ± 0.147
1.59ProTyr: 1.59 ± 0.523
0.0ProXaa: 0.0 ± 0.0
Gln
1.855GlnAla: 1.855 ± 0.439
0.265GlnCys: 0.265 ± 0.172
1.502GlnAsp: 1.502 ± 0.282
2.385GlnGlu: 2.385 ± 0.43
1.59GlnPhe: 1.59 ± 0.431
1.767GlnGly: 1.767 ± 0.39
0.707GlnHis: 0.707 ± 0.253
2.915GlnIle: 2.915 ± 0.429
2.385GlnLys: 2.385 ± 0.486
3.357GlnLeu: 3.357 ± 0.583
1.325GlnMet: 1.325 ± 0.323
1.855GlnAsn: 1.855 ± 0.34
1.59GlnPro: 1.59 ± 0.39
2.297GlnGln: 2.297 ± 0.511
1.502GlnArg: 1.502 ± 0.411
2.385GlnSer: 2.385 ± 0.371
2.032GlnThr: 2.032 ± 0.3
3.004GlnVal: 3.004 ± 0.434
0.353GlnTrp: 0.353 ± 0.208
1.943GlnTyr: 1.943 ± 0.372
0.0GlnXaa: 0.0 ± 0.0
Arg
1.943ArgAla: 1.943 ± 0.324
0.265ArgCys: 0.265 ± 0.145
2.12ArgAsp: 2.12 ± 0.472
1.502ArgGlu: 1.502 ± 0.492
1.943ArgPhe: 1.943 ± 0.277
1.943ArgGly: 1.943 ± 0.574
0.442ArgHis: 0.442 ± 0.235
2.208ArgIle: 2.208 ± 0.39
3.534ArgLys: 3.534 ± 0.615
3.092ArgLeu: 3.092 ± 0.547
1.06ArgMet: 1.06 ± 0.254
2.208ArgAsn: 2.208 ± 0.45
1.06ArgPro: 1.06 ± 0.33
1.237ArgGln: 1.237 ± 0.387
1.325ArgArg: 1.325 ± 0.302
1.943ArgSer: 1.943 ± 0.394
1.855ArgThr: 1.855 ± 0.314
2.208ArgVal: 2.208 ± 0.559
0.353ArgTrp: 0.353 ± 0.149
1.325ArgTyr: 1.325 ± 0.376
0.0ArgXaa: 0.0 ± 0.0
Ser
2.562SerAla: 2.562 ± 0.542
0.177SerCys: 0.177 ± 0.128
2.739SerAsp: 2.739 ± 0.374
3.534SerGlu: 3.534 ± 0.619
2.915SerPhe: 2.915 ± 0.432
4.682SerGly: 4.682 ± 0.782
1.413SerHis: 1.413 ± 0.35
3.534SerIle: 3.534 ± 0.53
4.77SerLys: 4.77 ± 0.666
4.329SerLeu: 4.329 ± 0.629
1.59SerMet: 1.59 ± 0.371
4.505SerAsn: 4.505 ± 0.706
0.795SerPro: 0.795 ± 0.281
2.12SerGln: 2.12 ± 0.532
1.237SerArg: 1.237 ± 0.344
3.004SerSer: 3.004 ± 0.59
3.534SerThr: 3.534 ± 0.808
3.092SerVal: 3.092 ± 0.571
0.795SerTrp: 0.795 ± 0.233
2.032SerTyr: 2.032 ± 0.509
0.0SerXaa: 0.0 ± 0.0
Thr
4.24ThrAla: 4.24 ± 0.587
0.265ThrCys: 0.265 ± 0.17
3.799ThrAsp: 3.799 ± 0.593
3.799ThrGlu: 3.799 ± 0.545
2.208ThrPhe: 2.208 ± 0.45
4.859ThrGly: 4.859 ± 0.768
1.678ThrHis: 1.678 ± 0.444
4.505ThrIle: 4.505 ± 0.646
6.449ThrLys: 6.449 ± 0.661
4.859ThrLeu: 4.859 ± 0.717
1.237ThrMet: 1.237 ± 0.351
3.622ThrAsn: 3.622 ± 0.494
2.473ThrPro: 2.473 ± 0.414
1.678ThrGln: 1.678 ± 0.391
2.473ThrArg: 2.473 ± 0.466
2.208ThrSer: 2.208 ± 0.412
4.329ThrThr: 4.329 ± 0.744
3.71ThrVal: 3.71 ± 0.546
0.53ThrTrp: 0.53 ± 0.178
2.385ThrTyr: 2.385 ± 0.755
0.0ThrXaa: 0.0 ± 0.0
Val
6.095ValAla: 6.095 ± 0.699
0.353ValCys: 0.353 ± 0.169
5.477ValAsp: 5.477 ± 0.561
4.77ValGlu: 4.77 ± 0.683
3.18ValPhe: 3.18 ± 0.468
4.859ValGly: 4.859 ± 0.745
0.795ValHis: 0.795 ± 0.28
4.417ValIle: 4.417 ± 0.555
5.212ValLys: 5.212 ± 1.048
5.124ValLeu: 5.124 ± 0.662
2.12ValMet: 2.12 ± 0.309
3.975ValAsn: 3.975 ± 0.502
2.562ValPro: 2.562 ± 0.444
1.678ValGln: 1.678 ± 0.401
1.59ValArg: 1.59 ± 0.432
4.505ValSer: 4.505 ± 0.455
3.71ValThr: 3.71 ± 0.529
4.505ValVal: 4.505 ± 0.624
0.972ValTrp: 0.972 ± 0.261
2.208ValTyr: 2.208 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
0.707TrpAla: 0.707 ± 0.306
0.265TrpCys: 0.265 ± 0.166
0.795TrpAsp: 0.795 ± 0.386
1.325TrpGlu: 1.325 ± 0.315
0.972TrpPhe: 0.972 ± 0.283
0.707TrpGly: 0.707 ± 0.258
0.088TrpHis: 0.088 ± 0.087
0.707TrpIle: 0.707 ± 0.276
0.883TrpLys: 0.883 ± 0.266
1.413TrpLeu: 1.413 ± 0.331
0.177TrpMet: 0.177 ± 0.119
0.53TrpAsn: 0.53 ± 0.242
0.0TrpPro: 0.0 ± 0.0
0.353TrpGln: 0.353 ± 0.157
0.618TrpArg: 0.618 ± 0.223
0.53TrpSer: 0.53 ± 0.17
0.53TrpThr: 0.53 ± 0.163
2.208TrpVal: 2.208 ± 0.463
0.177TrpTrp: 0.177 ± 0.105
0.265TrpTyr: 0.265 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.855TyrAla: 1.855 ± 0.431
0.53TyrCys: 0.53 ± 0.221
3.18TyrAsp: 3.18 ± 0.59
3.534TyrGlu: 3.534 ± 0.631
1.59TyrPhe: 1.59 ± 0.356
1.502TyrGly: 1.502 ± 0.304
0.353TyrHis: 0.353 ± 0.153
3.445TyrIle: 3.445 ± 0.534
3.887TyrLys: 3.887 ± 0.545
3.445TyrLeu: 3.445 ± 0.657
0.972TyrMet: 0.972 ± 0.262
3.534TyrAsn: 3.534 ± 0.644
1.413TyrPro: 1.413 ± 0.423
1.06TyrGln: 1.06 ± 0.27
0.795TyrArg: 0.795 ± 0.296
2.473TyrSer: 2.473 ± 0.507
3.092TyrThr: 3.092 ± 0.685
1.943TyrVal: 1.943 ± 0.534
0.0TyrTrp: 0.0 ± 0.0
2.032TyrTyr: 2.032 ± 0.496
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11321 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski