Amino acid dipepetide frequency for Escherichia phage aaroes

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.028AlaAla: 7.028 ± 0.876
0.781AlaCys: 0.781 ± 0.248
5.01AlaAsp: 5.01 ± 0.703
5.206AlaGlu: 5.206 ± 0.667
3.058AlaPhe: 3.058 ± 0.514
5.141AlaGly: 5.141 ± 0.61
1.301AlaHis: 1.301 ± 0.342
5.401AlaIle: 5.401 ± 0.731
6.702AlaLys: 6.702 ± 0.94
6.442AlaLeu: 6.442 ± 0.646
3.123AlaMet: 3.123 ± 0.415
3.904AlaAsn: 3.904 ± 0.665
1.822AlaPro: 1.822 ± 0.287
3.384AlaGln: 3.384 ± 0.648
4.23AlaArg: 4.23 ± 0.661
5.531AlaSer: 5.531 ± 0.623
4.164AlaThr: 4.164 ± 0.861
5.921AlaVal: 5.921 ± 0.596
1.171AlaTrp: 1.171 ± 0.236
2.668AlaTyr: 2.668 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.041CysAla: 1.041 ± 0.285
0.26CysCys: 0.26 ± 0.234
1.692CysAsp: 1.692 ± 0.353
1.041CysGlu: 1.041 ± 0.286
0.39CysPhe: 0.39 ± 0.182
1.627CysGly: 1.627 ± 0.413
0.325CysHis: 0.325 ± 0.143
0.716CysIle: 0.716 ± 0.24
0.976CysLys: 0.976 ± 0.317
0.846CysLeu: 0.846 ± 0.265
0.521CysMet: 0.521 ± 0.177
0.651CysAsn: 0.651 ± 0.206
0.325CysPro: 0.325 ± 0.137
0.325CysGln: 0.325 ± 0.138
0.976CysArg: 0.976 ± 0.294
0.521CysSer: 0.521 ± 0.174
0.781CysThr: 0.781 ± 0.206
0.781CysVal: 0.781 ± 0.232
0.39CysTrp: 0.39 ± 0.149
0.455CysTyr: 0.455 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
5.401AspAla: 5.401 ± 0.571
0.846AspCys: 0.846 ± 0.283
3.709AspAsp: 3.709 ± 0.513
4.164AspGlu: 4.164 ± 0.426
2.798AspPhe: 2.798 ± 0.445
5.986AspGly: 5.986 ± 0.766
0.976AspHis: 0.976 ± 0.27
2.993AspIle: 2.993 ± 0.448
4.034AspLys: 4.034 ± 0.51
4.36AspLeu: 4.36 ± 0.533
1.887AspMet: 1.887 ± 0.348
2.473AspAsn: 2.473 ± 0.456
2.343AspPro: 2.343 ± 0.381
2.017AspGln: 2.017 ± 0.384
2.538AspArg: 2.538 ± 0.397
3.254AspSer: 3.254 ± 0.494
3.123AspThr: 3.123 ± 0.419
3.319AspVal: 3.319 ± 0.437
1.106AspTrp: 1.106 ± 0.31
2.668AspTyr: 2.668 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
5.661GluAla: 5.661 ± 0.79
1.236GluCys: 1.236 ± 0.33
3.188GluAsp: 3.188 ± 0.435
3.644GluGlu: 3.644 ± 0.544
3.254GluPhe: 3.254 ± 0.44
3.644GluGly: 3.644 ± 0.444
1.301GluHis: 1.301 ± 0.325
5.401GluIle: 5.401 ± 0.529
4.23GluLys: 4.23 ± 0.603
5.01GluLeu: 5.01 ± 0.71
2.993GluMet: 2.993 ± 0.427
3.058GluAsn: 3.058 ± 0.53
1.692GluPro: 1.692 ± 0.324
2.603GluGln: 2.603 ± 0.477
3.384GluArg: 3.384 ± 0.642
4.685GluSer: 4.685 ± 0.474
3.579GluThr: 3.579 ± 0.549
4.555GluVal: 4.555 ± 0.637
0.716GluTrp: 0.716 ± 0.254
2.928GluTyr: 2.928 ± 0.461
0.0GluXaa: 0.0 ± 0.0
Phe
2.408PheAla: 2.408 ± 0.38
0.325PheCys: 0.325 ± 0.142
2.928PheAsp: 2.928 ± 0.485
2.277PheGlu: 2.277 ± 0.366
1.106PhePhe: 1.106 ± 0.328
3.188PheGly: 3.188 ± 0.466
0.586PheHis: 0.586 ± 0.197
2.538PheIle: 2.538 ± 0.474
3.188PheLys: 3.188 ± 0.392
2.343PheLeu: 2.343 ± 0.429
1.301PheMet: 1.301 ± 0.342
2.538PheAsn: 2.538 ± 0.415
1.432PhePro: 1.432 ± 0.378
1.301PheGln: 1.301 ± 0.334
1.952PheArg: 1.952 ± 0.306
2.147PheSer: 2.147 ± 0.427
2.863PheThr: 2.863 ± 0.44
2.863PheVal: 2.863 ± 0.443
0.846PheTrp: 0.846 ± 0.254
1.106PheTyr: 1.106 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
4.945GlyAla: 4.945 ± 0.866
1.497GlyCys: 1.497 ± 0.31
4.62GlyAsp: 4.62 ± 0.646
5.075GlyGlu: 5.075 ± 0.727
3.449GlyPhe: 3.449 ± 0.511
6.572GlyGly: 6.572 ± 1.063
1.692GlyHis: 1.692 ± 0.341
3.969GlyIle: 3.969 ± 0.551
6.702GlyLys: 6.702 ± 0.56
4.945GlyLeu: 4.945 ± 0.477
2.798GlyMet: 2.798 ± 0.41
3.579GlyAsn: 3.579 ± 0.461
0.325GlyPro: 0.325 ± 0.141
2.147GlyGln: 2.147 ± 0.356
2.668GlyArg: 2.668 ± 0.332
5.726GlySer: 5.726 ± 0.648
3.319GlyThr: 3.319 ± 0.554
5.986GlyVal: 5.986 ± 0.688
1.301GlyTrp: 1.301 ± 0.289
3.384GlyTyr: 3.384 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
0.716HisAla: 0.716 ± 0.27
0.325HisCys: 0.325 ± 0.152
1.236HisAsp: 1.236 ± 0.323
1.041HisGlu: 1.041 ± 0.32
0.716HisPhe: 0.716 ± 0.24
1.627HisGly: 1.627 ± 0.422
0.586HisHis: 0.586 ± 0.261
1.301HisIle: 1.301 ± 0.337
1.497HisLys: 1.497 ± 0.363
1.171HisLeu: 1.171 ± 0.315
0.26HisMet: 0.26 ± 0.158
1.041HisAsn: 1.041 ± 0.346
0.716HisPro: 0.716 ± 0.242
0.846HisGln: 0.846 ± 0.256
1.236HisArg: 1.236 ± 0.335
0.976HisSer: 0.976 ± 0.245
1.106HisThr: 1.106 ± 0.328
1.301HisVal: 1.301 ± 0.297
0.195HisTrp: 0.195 ± 0.103
1.432HisTyr: 1.432 ± 0.318
0.0HisXaa: 0.0 ± 0.0
Ile
6.182IleAla: 6.182 ± 0.837
1.236IleCys: 1.236 ± 0.315
4.815IleAsp: 4.815 ± 0.673
4.685IleGlu: 4.685 ± 0.453
2.147IlePhe: 2.147 ± 0.385
4.49IleGly: 4.49 ± 0.538
1.432IleHis: 1.432 ± 0.388
3.969IleIle: 3.969 ± 0.54
5.141IleLys: 5.141 ± 0.631
2.798IleLeu: 2.798 ± 0.357
1.822IleMet: 1.822 ± 0.343
3.384IleAsn: 3.384 ± 0.602
2.147IlePro: 2.147 ± 0.459
2.733IleGln: 2.733 ± 0.569
2.993IleArg: 2.993 ± 0.489
4.034IleSer: 4.034 ± 0.58
4.164IleThr: 4.164 ± 0.485
4.099IleVal: 4.099 ± 0.516
0.911IleTrp: 0.911 ± 0.287
2.408IleTyr: 2.408 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
6.963LysAla: 6.963 ± 1.149
1.041LysCys: 1.041 ± 0.273
4.49LysAsp: 4.49 ± 0.595
5.921LysGlu: 5.921 ± 0.63
3.058LysPhe: 3.058 ± 0.485
3.839LysGly: 3.839 ± 0.516
1.366LysHis: 1.366 ± 0.375
3.969LysIle: 3.969 ± 0.62
4.164LysLys: 4.164 ± 0.496
5.336LysLeu: 5.336 ± 0.583
3.514LysMet: 3.514 ± 0.482
3.123LysAsn: 3.123 ± 0.602
3.904LysPro: 3.904 ± 0.56
3.384LysGln: 3.384 ± 0.519
4.295LysArg: 4.295 ± 0.602
3.188LysSer: 3.188 ± 0.438
4.62LysThr: 4.62 ± 0.587
5.141LysVal: 5.141 ± 0.482
1.041LysTrp: 1.041 ± 0.287
1.952LysTyr: 1.952 ± 0.352
0.0LysXaa: 0.0 ± 0.0
Leu
5.466LeuAla: 5.466 ± 0.648
0.716LeuCys: 0.716 ± 0.196
2.993LeuAsp: 2.993 ± 0.471
4.164LeuGlu: 4.164 ± 0.56
1.887LeuPhe: 1.887 ± 0.381
3.774LeuGly: 3.774 ± 0.589
1.106LeuHis: 1.106 ± 0.289
5.141LeuIle: 5.141 ± 0.579
5.01LeuLys: 5.01 ± 0.632
3.449LeuLeu: 3.449 ± 0.432
1.952LeuMet: 1.952 ± 0.364
3.709LeuAsn: 3.709 ± 0.458
3.188LeuPro: 3.188 ± 0.511
1.757LeuGln: 1.757 ± 0.391
4.034LeuArg: 4.034 ± 0.51
4.295LeuSer: 4.295 ± 0.549
4.62LeuThr: 4.62 ± 0.455
3.644LeuVal: 3.644 ± 0.487
1.041LeuTrp: 1.041 ± 0.267
2.473LeuTyr: 2.473 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
3.644MetAla: 3.644 ± 0.462
0.39MetCys: 0.39 ± 0.139
1.366MetAsp: 1.366 ± 0.352
1.757MetGlu: 1.757 ± 0.359
1.041MetPhe: 1.041 ± 0.281
1.171MetGly: 1.171 ± 0.259
0.651MetHis: 0.651 ± 0.223
2.473MetIle: 2.473 ± 0.373
3.058MetLys: 3.058 ± 0.467
2.212MetLeu: 2.212 ± 0.436
1.041MetMet: 1.041 ± 0.277
1.432MetAsn: 1.432 ± 0.369
1.171MetPro: 1.171 ± 0.249
1.301MetGln: 1.301 ± 0.281
1.692MetArg: 1.692 ± 0.308
2.473MetSer: 2.473 ± 0.348
1.952MetThr: 1.952 ± 0.367
1.887MetVal: 1.887 ± 0.301
0.26MetTrp: 0.26 ± 0.138
1.041MetTyr: 1.041 ± 0.248
0.0MetXaa: 0.0 ± 0.0
Asn
4.685AsnAla: 4.685 ± 0.514
0.651AsnCys: 0.651 ± 0.253
2.473AsnAsp: 2.473 ± 0.432
3.579AsnGlu: 3.579 ± 0.464
1.692AsnPhe: 1.692 ± 0.337
5.791AsnGly: 5.791 ± 0.581
1.041AsnHis: 1.041 ± 0.335
3.254AsnIle: 3.254 ± 0.405
3.254AsnLys: 3.254 ± 0.56
2.473AsnLeu: 2.473 ± 0.474
1.106AsnMet: 1.106 ± 0.31
2.538AsnAsn: 2.538 ± 0.419
1.627AsnPro: 1.627 ± 0.294
1.822AsnGln: 1.822 ± 0.37
1.627AsnArg: 1.627 ± 0.323
2.538AsnSer: 2.538 ± 0.442
1.757AsnThr: 1.757 ± 0.466
2.863AsnVal: 2.863 ± 0.336
0.651AsnTrp: 0.651 ± 0.164
1.887AsnTyr: 1.887 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
2.212ProAla: 2.212 ± 0.387
0.586ProCys: 0.586 ± 0.228
2.408ProAsp: 2.408 ± 0.52
3.254ProGlu: 3.254 ± 0.587
1.562ProPhe: 1.562 ± 0.271
2.733ProGly: 2.733 ± 0.471
0.521ProHis: 0.521 ± 0.171
2.017ProIle: 2.017 ± 0.512
1.497ProLys: 1.497 ± 0.298
1.822ProLeu: 1.822 ± 0.367
0.846ProMet: 0.846 ± 0.225
1.366ProAsn: 1.366 ± 0.351
1.236ProPro: 1.236 ± 0.284
1.171ProGln: 1.171 ± 0.294
1.952ProArg: 1.952 ± 0.437
1.562ProSer: 1.562 ± 0.322
1.822ProThr: 1.822 ± 0.344
2.733ProVal: 2.733 ± 0.408
0.455ProTrp: 0.455 ± 0.155
1.497ProTyr: 1.497 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
3.188GlnAla: 3.188 ± 0.528
0.586GlnCys: 0.586 ± 0.188
1.887GlnAsp: 1.887 ± 0.453
2.017GlnGlu: 2.017 ± 0.382
1.497GlnPhe: 1.497 ± 0.299
2.408GlnGly: 2.408 ± 0.351
0.781GlnHis: 0.781 ± 0.239
2.668GlnIle: 2.668 ± 0.479
2.343GlnLys: 2.343 ± 0.44
2.538GlnLeu: 2.538 ± 0.576
0.716GlnMet: 0.716 ± 0.305
1.366GlnAsn: 1.366 ± 0.341
1.432GlnPro: 1.432 ± 0.384
2.798GlnGln: 2.798 ± 0.732
2.277GlnArg: 2.277 ± 0.313
2.473GlnSer: 2.473 ± 0.458
1.822GlnThr: 1.822 ± 0.462
3.123GlnVal: 3.123 ± 0.373
0.455GlnTrp: 0.455 ± 0.165
1.757GlnTyr: 1.757 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
4.034ArgAla: 4.034 ± 0.567
0.911ArgCys: 0.911 ± 0.318
2.668ArgAsp: 2.668 ± 0.471
3.384ArgGlu: 3.384 ± 0.394
1.757ArgPhe: 1.757 ± 0.328
3.254ArgGly: 3.254 ± 0.344
0.651ArgHis: 0.651 ± 0.215
3.904ArgIle: 3.904 ± 0.429
4.295ArgLys: 4.295 ± 0.564
2.928ArgLeu: 2.928 ± 0.499
1.562ArgMet: 1.562 ± 0.312
1.952ArgAsn: 1.952 ± 0.503
1.952ArgPro: 1.952 ± 0.426
1.887ArgGln: 1.887 ± 0.402
3.254ArgArg: 3.254 ± 0.532
2.863ArgSer: 2.863 ± 0.448
2.082ArgThr: 2.082 ± 0.342
4.49ArgVal: 4.49 ± 0.651
0.586ArgTrp: 0.586 ± 0.176
2.277ArgTyr: 2.277 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
5.075SerAla: 5.075 ± 0.684
0.911SerCys: 0.911 ± 0.272
3.774SerAsp: 3.774 ± 0.464
4.815SerGlu: 4.815 ± 0.492
2.733SerPhe: 2.733 ± 0.425
5.531SerGly: 5.531 ± 0.611
1.301SerHis: 1.301 ± 0.319
4.36SerIle: 4.36 ± 0.685
4.36SerLys: 4.36 ± 0.471
4.23SerLeu: 4.23 ± 0.439
1.301SerMet: 1.301 ± 0.283
2.343SerAsn: 2.343 ± 0.429
1.952SerPro: 1.952 ± 0.361
2.017SerGln: 2.017 ± 0.445
3.058SerArg: 3.058 ± 0.405
3.319SerSer: 3.319 ± 0.648
3.254SerThr: 3.254 ± 0.521
4.295SerVal: 4.295 ± 0.524
0.976SerTrp: 0.976 ± 0.3
2.473SerTyr: 2.473 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
4.555ThrAla: 4.555 ± 0.72
0.651ThrCys: 0.651 ± 0.307
3.058ThrAsp: 3.058 ± 0.394
2.863ThrGlu: 2.863 ± 0.563
2.343ThrPhe: 2.343 ± 0.39
6.117ThrGly: 6.117 ± 0.858
0.455ThrHis: 0.455 ± 0.146
3.709ThrIle: 3.709 ± 0.546
3.384ThrLys: 3.384 ± 0.687
4.034ThrLeu: 4.034 ± 0.574
1.236ThrMet: 1.236 ± 0.285
3.254ThrAsn: 3.254 ± 0.483
2.408ThrPro: 2.408 ± 0.412
2.277ThrGln: 2.277 ± 0.401
1.497ThrArg: 1.497 ± 0.327
4.295ThrSer: 4.295 ± 0.609
2.343ThrThr: 2.343 ± 0.452
3.904ThrVal: 3.904 ± 0.46
0.846ThrTrp: 0.846 ± 0.227
1.757ThrTyr: 1.757 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
4.88ValAla: 4.88 ± 0.55
0.976ValCys: 0.976 ± 0.256
4.62ValAsp: 4.62 ± 0.515
4.815ValGlu: 4.815 ± 0.602
2.603ValPhe: 2.603 ± 0.4
3.904ValGly: 3.904 ± 0.479
1.236ValHis: 1.236 ± 0.333
4.75ValIle: 4.75 ± 0.557
6.442ValLys: 6.442 ± 0.72
4.425ValLeu: 4.425 ± 0.481
2.343ValMet: 2.343 ± 0.336
3.254ValAsn: 3.254 ± 0.416
1.757ValPro: 1.757 ± 0.498
2.212ValGln: 2.212 ± 0.494
3.514ValArg: 3.514 ± 0.515
4.555ValSer: 4.555 ± 0.699
4.685ValThr: 4.685 ± 0.541
4.62ValVal: 4.62 ± 0.457
0.846ValTrp: 0.846 ± 0.212
2.082ValTyr: 2.082 ± 0.378
0.0ValXaa: 0.0 ± 0.0
Trp
0.846TrpAla: 0.846 ± 0.267
0.26TrpCys: 0.26 ± 0.188
0.716TrpAsp: 0.716 ± 0.2
0.651TrpGlu: 0.651 ± 0.209
0.846TrpPhe: 0.846 ± 0.306
1.106TrpGly: 1.106 ± 0.224
0.716TrpHis: 0.716 ± 0.196
1.041TrpIle: 1.041 ± 0.227
0.846TrpLys: 0.846 ± 0.2
1.171TrpLeu: 1.171 ± 0.247
0.39TrpMet: 0.39 ± 0.134
0.586TrpAsn: 0.586 ± 0.197
0.325TrpPro: 0.325 ± 0.154
0.325TrpGln: 0.325 ± 0.151
1.236TrpArg: 1.236 ± 0.267
0.976TrpSer: 0.976 ± 0.256
0.781TrpThr: 0.781 ± 0.223
0.521TrpVal: 0.521 ± 0.173
0.13TrpTrp: 0.13 ± 0.093
0.651TrpTyr: 0.651 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.058TyrAla: 3.058 ± 0.602
0.455TyrCys: 0.455 ± 0.161
2.343TyrAsp: 2.343 ± 0.353
2.408TyrGlu: 2.408 ± 0.387
1.301TyrPhe: 1.301 ± 0.329
2.798TyrGly: 2.798 ± 0.463
1.301TyrHis: 1.301 ± 0.282
2.017TyrIle: 2.017 ± 0.387
3.254TyrLys: 3.254 ± 0.465
1.887TyrLeu: 1.887 ± 0.337
1.236TyrMet: 1.236 ± 0.288
1.757TyrAsn: 1.757 ± 0.312
1.497TyrPro: 1.497 ± 0.341
1.822TyrGln: 1.822 ± 0.339
2.277TyrArg: 2.277 ± 0.372
2.668TyrSer: 2.668 ± 0.399
2.147TyrThr: 2.147 ± 0.324
2.473TyrVal: 2.473 ± 0.331
0.195TyrTrp: 0.195 ± 0.106
1.041TyrTyr: 1.041 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (15369 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski