Amino acid dipepetide frequency for Escherichia phage Rac-SA53

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.847AlaAla: 8.847 ± 1.149
0.941AlaCys: 0.941 ± 0.245
4.518AlaAsp: 4.518 ± 0.573
5.459AlaGlu: 5.459 ± 0.751
3.012AlaPhe: 3.012 ± 0.483
5.459AlaGly: 5.459 ± 0.524
1.38AlaHis: 1.38 ± 0.281
5.271AlaIle: 5.271 ± 0.514
4.832AlaLys: 4.832 ± 0.673
8.471AlaLeu: 8.471 ± 0.772
2.761AlaMet: 2.761 ± 0.36
4.141AlaAsn: 4.141 ± 0.461
2.196AlaPro: 2.196 ± 0.359
3.89AlaGln: 3.89 ± 0.559
4.706AlaArg: 4.706 ± 0.569
4.957AlaSer: 4.957 ± 0.503
5.396AlaThr: 5.396 ± 0.524
5.522AlaVal: 5.522 ± 0.697
1.38AlaTrp: 1.38 ± 0.287
2.259AlaTyr: 2.259 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
1.004CysAla: 1.004 ± 0.295
0.188CysCys: 0.188 ± 0.104
1.129CysAsp: 1.129 ± 0.273
0.878CysGlu: 0.878 ± 0.22
0.565CysPhe: 0.565 ± 0.203
0.878CysGly: 0.878 ± 0.26
0.502CysHis: 0.502 ± 0.153
0.627CysIle: 0.627 ± 0.195
0.439CysLys: 0.439 ± 0.183
0.941CysLeu: 0.941 ± 0.226
0.251CysMet: 0.251 ± 0.122
0.69CysAsn: 0.69 ± 0.219
0.627CysPro: 0.627 ± 0.236
0.251CysGln: 0.251 ± 0.125
0.941CysArg: 0.941 ± 0.293
1.192CysSer: 1.192 ± 0.239
0.439CysThr: 0.439 ± 0.143
0.816CysVal: 0.816 ± 0.22
0.439CysTrp: 0.439 ± 0.154
0.251CysTyr: 0.251 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
4.832AspAla: 4.832 ± 0.633
1.129AspCys: 1.129 ± 0.28
3.012AspAsp: 3.012 ± 0.364
4.079AspGlu: 4.079 ± 0.544
2.071AspPhe: 2.071 ± 0.306
4.706AspGly: 4.706 ± 0.521
0.627AspHis: 0.627 ± 0.198
4.079AspIle: 4.079 ± 0.496
2.698AspLys: 2.698 ± 0.431
3.514AspLeu: 3.514 ± 0.439
1.82AspMet: 1.82 ± 0.305
2.071AspAsn: 2.071 ± 0.402
2.322AspPro: 2.322 ± 0.391
1.38AspGln: 1.38 ± 0.29
1.631AspArg: 1.631 ± 0.363
3.451AspSer: 3.451 ± 0.512
3.137AspThr: 3.137 ± 0.394
4.267AspVal: 4.267 ± 0.528
1.129AspTrp: 1.129 ± 0.252
2.133AspTyr: 2.133 ± 0.308
0.0AspXaa: 0.0 ± 0.0
Glu
5.898GluAla: 5.898 ± 0.786
0.502GluCys: 0.502 ± 0.168
2.635GluAsp: 2.635 ± 0.428
5.208GluGlu: 5.208 ± 0.825
2.573GluPhe: 2.573 ± 0.313
4.267GluGly: 4.267 ± 0.36
1.318GluHis: 1.318 ± 0.303
5.459GluIle: 5.459 ± 0.615
5.647GluLys: 5.647 ± 0.567
6.902GluLeu: 6.902 ± 0.732
1.569GluMet: 1.569 ± 0.318
3.137GluAsn: 3.137 ± 0.573
2.259GluPro: 2.259 ± 0.479
3.263GluGln: 3.263 ± 0.495
3.075GluArg: 3.075 ± 0.462
3.263GluSer: 3.263 ± 0.45
3.263GluThr: 3.263 ± 0.605
3.451GluVal: 3.451 ± 0.366
1.192GluTrp: 1.192 ± 0.27
2.573GluTyr: 2.573 ± 0.546
0.0GluXaa: 0.0 ± 0.0
Phe
2.447PheAla: 2.447 ± 0.403
0.816PheCys: 0.816 ± 0.235
1.882PheAsp: 1.882 ± 0.29
1.82PheGlu: 1.82 ± 0.401
1.318PhePhe: 1.318 ± 0.312
2.886PheGly: 2.886 ± 0.47
0.816PheHis: 0.816 ± 0.23
2.322PheIle: 2.322 ± 0.409
1.694PheLys: 1.694 ± 0.307
2.322PheLeu: 2.322 ± 0.337
0.502PheMet: 0.502 ± 0.224
1.882PheAsn: 1.882 ± 0.354
1.067PhePro: 1.067 ± 0.284
1.192PheGln: 1.192 ± 0.392
2.196PheArg: 2.196 ± 0.294
3.388PheSer: 3.388 ± 0.433
2.447PheThr: 2.447 ± 0.304
2.51PheVal: 2.51 ± 0.273
0.439PheTrp: 0.439 ± 0.136
1.443PheTyr: 1.443 ± 0.309
0.0PheXaa: 0.0 ± 0.0
Gly
4.769GlyAla: 4.769 ± 0.627
0.941GlyCys: 0.941 ± 0.261
3.514GlyAsp: 3.514 ± 0.447
4.894GlyGlu: 4.894 ± 0.466
3.137GlyPhe: 3.137 ± 0.412
5.773GlyGly: 5.773 ± 0.71
0.941GlyHis: 0.941 ± 0.227
4.518GlyIle: 4.518 ± 0.412
5.02GlyLys: 5.02 ± 0.594
5.773GlyLeu: 5.773 ± 0.741
1.757GlyMet: 1.757 ± 0.307
3.514GlyAsn: 3.514 ± 0.482
1.82GlyPro: 1.82 ± 0.384
3.075GlyGln: 3.075 ± 0.358
4.016GlyArg: 4.016 ± 0.483
4.016GlySer: 4.016 ± 0.519
3.89GlyThr: 3.89 ± 0.593
5.459GlyVal: 5.459 ± 0.616
1.067GlyTrp: 1.067 ± 0.235
2.384GlyTyr: 2.384 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.38HisAla: 1.38 ± 0.352
0.125HisCys: 0.125 ± 0.108
0.753HisAsp: 0.753 ± 0.197
0.816HisGlu: 0.816 ± 0.2
0.941HisPhe: 0.941 ± 0.313
1.318HisGly: 1.318 ± 0.246
0.753HisHis: 0.753 ± 0.294
1.443HisIle: 1.443 ± 0.298
0.753HisLys: 0.753 ± 0.219
1.945HisLeu: 1.945 ± 0.387
0.251HisMet: 0.251 ± 0.125
0.502HisAsn: 0.502 ± 0.194
1.129HisPro: 1.129 ± 0.24
0.314HisGln: 0.314 ± 0.182
1.192HisArg: 1.192 ± 0.284
0.69HisSer: 0.69 ± 0.198
0.878HisThr: 0.878 ± 0.233
1.004HisVal: 1.004 ± 0.25
0.125HisTrp: 0.125 ± 0.09
0.627HisTyr: 0.627 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
6.4IleAla: 6.4 ± 0.544
1.255IleCys: 1.255 ± 0.247
3.514IleAsp: 3.514 ± 0.533
4.33IleGlu: 4.33 ± 0.521
1.757IlePhe: 1.757 ± 0.322
3.012IleGly: 3.012 ± 0.63
1.255IleHis: 1.255 ± 0.329
4.016IleIle: 4.016 ± 0.508
2.573IleLys: 2.573 ± 0.471
3.263IleLeu: 3.263 ± 0.407
1.067IleMet: 1.067 ± 0.211
4.016IleAsn: 4.016 ± 0.657
4.204IlePro: 4.204 ± 0.493
1.757IleGln: 1.757 ± 0.252
3.577IleArg: 3.577 ± 0.531
4.455IleSer: 4.455 ± 0.586
4.079IleThr: 4.079 ± 0.519
3.451IleVal: 3.451 ± 0.505
1.004IleTrp: 1.004 ± 0.255
2.698IleTyr: 2.698 ± 0.498
0.0IleXaa: 0.0 ± 0.0
Lys
4.455LysAla: 4.455 ± 0.634
0.439LysCys: 0.439 ± 0.16
2.949LysAsp: 2.949 ± 0.411
3.451LysGlu: 3.451 ± 0.557
1.882LysPhe: 1.882 ± 0.306
3.89LysGly: 3.89 ± 0.484
0.878LysHis: 0.878 ± 0.253
3.828LysIle: 3.828 ± 0.458
3.639LysLys: 3.639 ± 0.581
5.459LysLeu: 5.459 ± 0.586
1.067LysMet: 1.067 ± 0.266
2.447LysAsn: 2.447 ± 0.447
2.698LysPro: 2.698 ± 0.395
2.196LysGln: 2.196 ± 0.331
3.2LysArg: 3.2 ± 0.485
4.518LysSer: 4.518 ± 0.528
3.765LysThr: 3.765 ± 0.428
2.635LysVal: 2.635 ± 0.386
0.753LysTrp: 0.753 ± 0.179
1.757LysTyr: 1.757 ± 0.355
0.0LysXaa: 0.0 ± 0.0
Leu
8.659LeuAla: 8.659 ± 0.802
1.506LeuCys: 1.506 ± 0.324
3.577LeuAsp: 3.577 ± 0.421
4.706LeuGlu: 4.706 ± 0.519
2.322LeuPhe: 2.322 ± 0.459
4.455LeuGly: 4.455 ± 0.677
1.255LeuHis: 1.255 ± 0.355
5.083LeuIle: 5.083 ± 0.559
5.334LeuLys: 5.334 ± 0.57
7.341LeuLeu: 7.341 ± 0.848
2.322LeuMet: 2.322 ± 0.342
4.267LeuAsn: 4.267 ± 0.455
5.334LeuPro: 5.334 ± 0.473
2.635LeuGln: 2.635 ± 0.484
5.71LeuArg: 5.71 ± 0.598
5.584LeuSer: 5.584 ± 0.727
5.71LeuThr: 5.71 ± 0.469
5.584LeuVal: 5.584 ± 0.684
0.816LeuTrp: 0.816 ± 0.206
2.824LeuTyr: 2.824 ± 0.405
0.0LeuXaa: 0.0 ± 0.0
Met
2.949MetAla: 2.949 ± 0.43
0.188MetCys: 0.188 ± 0.11
1.129MetAsp: 1.129 ± 0.276
1.38MetGlu: 1.38 ± 0.268
0.878MetPhe: 0.878 ± 0.236
0.627MetGly: 0.627 ± 0.196
0.314MetHis: 0.314 ± 0.12
1.129MetIle: 1.129 ± 0.259
1.694MetLys: 1.694 ± 0.293
2.384MetLeu: 2.384 ± 0.466
0.439MetMet: 0.439 ± 0.175
0.753MetAsn: 0.753 ± 0.198
1.443MetPro: 1.443 ± 0.275
1.694MetGln: 1.694 ± 0.324
1.694MetArg: 1.694 ± 0.32
1.757MetSer: 1.757 ± 0.299
1.945MetThr: 1.945 ± 0.327
1.129MetVal: 1.129 ± 0.312
0.439MetTrp: 0.439 ± 0.131
0.376MetTyr: 0.376 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
5.208AsnAla: 5.208 ± 0.548
0.439AsnCys: 0.439 ± 0.143
2.886AsnAsp: 2.886 ± 0.44
3.137AsnGlu: 3.137 ± 0.428
1.631AsnPhe: 1.631 ± 0.289
4.267AsnGly: 4.267 ± 0.607
1.004AsnHis: 1.004 ± 0.256
2.886AsnIle: 2.886 ± 0.405
2.259AsnLys: 2.259 ± 0.407
5.02AsnLeu: 5.02 ± 0.528
0.941AsnMet: 0.941 ± 0.241
1.82AsnAsn: 1.82 ± 0.306
2.949AsnPro: 2.949 ± 0.462
2.322AsnGln: 2.322 ± 0.332
2.008AsnArg: 2.008 ± 0.331
2.447AsnSer: 2.447 ± 0.33
2.698AsnThr: 2.698 ± 0.464
2.384AsnVal: 2.384 ± 0.389
0.565AsnTrp: 0.565 ± 0.181
1.882AsnTyr: 1.882 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
3.075ProAla: 3.075 ± 0.343
0.565ProCys: 0.565 ± 0.192
4.079ProAsp: 4.079 ± 0.511
4.455ProGlu: 4.455 ± 0.632
1.882ProPhe: 1.882 ± 0.431
3.828ProGly: 3.828 ± 0.463
0.627ProHis: 0.627 ± 0.202
2.322ProIle: 2.322 ± 0.347
2.008ProLys: 2.008 ± 0.363
3.953ProLeu: 3.953 ± 0.479
0.941ProMet: 0.941 ± 0.196
2.008ProAsn: 2.008 ± 0.326
1.694ProPro: 1.694 ± 0.326
1.882ProGln: 1.882 ± 0.298
1.255ProArg: 1.255 ± 0.318
2.761ProSer: 2.761 ± 0.388
2.259ProThr: 2.259 ± 0.379
3.953ProVal: 3.953 ± 0.509
0.188ProTrp: 0.188 ± 0.119
1.694ProTyr: 1.694 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.702GlnAla: 3.702 ± 0.413
0.376GlnCys: 0.376 ± 0.149
1.694GlnAsp: 1.694 ± 0.317
2.761GlnGlu: 2.761 ± 0.581
1.004GlnPhe: 1.004 ± 0.244
2.322GlnGly: 2.322 ± 0.58
1.004GlnHis: 1.004 ± 0.174
2.196GlnIle: 2.196 ± 0.328
2.886GlnLys: 2.886 ± 0.497
3.451GlnLeu: 3.451 ± 0.457
1.192GlnMet: 1.192 ± 0.284
2.635GlnAsn: 2.635 ± 0.421
2.51GlnPro: 2.51 ± 0.376
2.761GlnGln: 2.761 ± 0.393
2.008GlnArg: 2.008 ± 0.344
2.573GlnSer: 2.573 ± 0.376
1.945GlnThr: 1.945 ± 0.35
2.949GlnVal: 2.949 ± 0.429
0.439GlnTrp: 0.439 ± 0.185
1.631GlnTyr: 1.631 ± 0.351
0.0GlnXaa: 0.0 ± 0.0
Arg
3.326ArgAla: 3.326 ± 0.452
0.753ArgCys: 0.753 ± 0.232
3.639ArgAsp: 3.639 ± 0.54
4.267ArgGlu: 4.267 ± 0.684
1.694ArgPhe: 1.694 ± 0.255
3.451ArgGly: 3.451 ± 0.481
0.816ArgHis: 0.816 ± 0.22
4.079ArgIle: 4.079 ± 0.491
3.388ArgLys: 3.388 ± 0.504
4.267ArgLeu: 4.267 ± 0.4
1.067ArgMet: 1.067 ± 0.258
3.137ArgAsn: 3.137 ± 0.523
1.569ArgPro: 1.569 ± 0.26
3.263ArgGln: 3.263 ± 0.514
3.263ArgArg: 3.263 ± 0.496
1.945ArgSer: 1.945 ± 0.309
3.012ArgThr: 3.012 ± 0.49
3.577ArgVal: 3.577 ± 0.439
0.816ArgTrp: 0.816 ± 0.2
2.259ArgTyr: 2.259 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
5.459SerAla: 5.459 ± 0.586
0.627SerCys: 0.627 ± 0.154
4.141SerAsp: 4.141 ± 0.4
4.141SerGlu: 4.141 ± 0.696
2.008SerPhe: 2.008 ± 0.413
7.09SerGly: 7.09 ± 0.762
1.004SerHis: 1.004 ± 0.229
2.384SerIle: 2.384 ± 0.361
3.075SerLys: 3.075 ± 0.345
5.71SerLeu: 5.71 ± 0.585
1.38SerMet: 1.38 ± 0.281
2.698SerAsn: 2.698 ± 0.46
3.326SerPro: 3.326 ± 0.53
2.196SerGln: 2.196 ± 0.369
3.577SerArg: 3.577 ± 0.47
5.02SerSer: 5.02 ± 0.718
3.075SerThr: 3.075 ± 0.442
5.334SerVal: 5.334 ± 0.557
0.753SerTrp: 0.753 ± 0.261
2.51SerTyr: 2.51 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
4.643ThrAla: 4.643 ± 0.435
0.69ThrCys: 0.69 ± 0.211
2.949ThrAsp: 2.949 ± 0.481
4.894ThrGlu: 4.894 ± 0.503
1.882ThrPhe: 1.882 ± 0.319
4.769ThrGly: 4.769 ± 0.488
0.439ThrHis: 0.439 ± 0.189
2.949ThrIle: 2.949 ± 0.536
2.51ThrLys: 2.51 ± 0.415
5.145ThrLeu: 5.145 ± 0.527
1.631ThrMet: 1.631 ± 0.338
2.51ThrAsn: 2.51 ± 0.418
4.079ThrPro: 4.079 ± 0.52
2.133ThrGln: 2.133 ± 0.346
2.008ThrArg: 2.008 ± 0.296
3.765ThrSer: 3.765 ± 0.513
4.079ThrThr: 4.079 ± 0.438
4.769ThrVal: 4.769 ± 0.611
1.506ThrTrp: 1.506 ± 0.269
1.631ThrTyr: 1.631 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
5.208ValAla: 5.208 ± 0.471
0.565ValCys: 0.565 ± 0.169
3.577ValAsp: 3.577 ± 0.474
3.702ValGlu: 3.702 ± 0.454
3.012ValPhe: 3.012 ± 0.329
2.824ValGly: 2.824 ± 0.43
0.941ValHis: 0.941 ± 0.218
3.89ValIle: 3.89 ± 0.511
3.263ValLys: 3.263 ± 0.415
4.706ValLeu: 4.706 ± 0.436
2.322ValMet: 2.322 ± 0.358
3.702ValAsn: 3.702 ± 0.46
2.51ValPro: 2.51 ± 0.364
3.326ValGln: 3.326 ± 0.512
4.016ValArg: 4.016 ± 0.484
6.463ValSer: 6.463 ± 0.715
3.953ValThr: 3.953 ± 0.644
3.828ValVal: 3.828 ± 0.483
1.004ValTrp: 1.004 ± 0.356
2.824ValTyr: 2.824 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.261
0.188TrpCys: 0.188 ± 0.102
0.878TrpAsp: 0.878 ± 0.239
1.004TrpGlu: 1.004 ± 0.235
0.314TrpPhe: 0.314 ± 0.176
1.067TrpGly: 1.067 ± 0.271
0.188TrpHis: 0.188 ± 0.103
1.255TrpIle: 1.255 ± 0.29
0.251TrpLys: 0.251 ± 0.117
1.318TrpLeu: 1.318 ± 0.289
0.314TrpMet: 0.314 ± 0.149
1.067TrpAsn: 1.067 ± 0.267
0.439TrpPro: 0.439 ± 0.159
0.69TrpGln: 0.69 ± 0.187
1.129TrpArg: 1.129 ± 0.21
0.941TrpSer: 0.941 ± 0.242
0.816TrpThr: 0.816 ± 0.228
1.067TrpVal: 1.067 ± 0.211
0.125TrpTrp: 0.125 ± 0.091
0.627TrpTyr: 0.627 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.196TyrAla: 2.196 ± 0.306
0.816TyrCys: 0.816 ± 0.187
1.631TyrAsp: 1.631 ± 0.397
2.071TyrGlu: 2.071 ± 0.399
1.506TyrPhe: 1.506 ± 0.261
3.388TyrGly: 3.388 ± 0.393
0.878TyrHis: 0.878 ± 0.293
1.945TyrIle: 1.945 ± 0.409
1.945TyrLys: 1.945 ± 0.444
3.2TyrLeu: 3.2 ± 0.401
0.69TyrMet: 0.69 ± 0.237
1.757TyrAsn: 1.757 ± 0.327
1.255TyrPro: 1.255 ± 0.319
1.82TyrGln: 1.82 ± 0.324
2.384TyrArg: 2.384 ± 0.345
2.322TyrSer: 2.322 ± 0.382
2.196TyrThr: 2.196 ± 0.317
1.945TyrVal: 1.945 ± 0.397
0.439TyrTrp: 0.439 ± 0.164
1.506TyrTyr: 1.506 ± 0.293
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (15938 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski