Amino acid dipepetide frequency for Escherichia phage P AB-2017

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.789AlaAla: 11.789 ± 1.721
1.434AlaCys: 1.434 ± 0.376
6.134AlaAsp: 6.134 ± 0.754
6.691AlaGlu: 6.691 ± 0.886
4.142AlaPhe: 4.142 ± 0.651
7.328AlaGly: 7.328 ± 0.78
1.434AlaHis: 1.434 ± 0.358
5.257AlaIle: 5.257 ± 0.794
6.054AlaLys: 6.054 ± 0.85
8.842AlaLeu: 8.842 ± 0.854
1.832AlaMet: 1.832 ± 0.45
3.585AlaAsn: 3.585 ± 0.555
3.903AlaPro: 3.903 ± 0.49
3.186AlaGln: 3.186 ± 0.715
4.142AlaArg: 4.142 ± 0.501
5.576AlaSer: 5.576 ± 0.782
5.895AlaThr: 5.895 ± 0.834
7.647AlaVal: 7.647 ± 0.691
1.832AlaTrp: 1.832 ± 0.431
3.266AlaTyr: 3.266 ± 0.382
0.0AlaXaa: 0.0 ± 0.0
Cys
0.956CysAla: 0.956 ± 0.306
0.159CysCys: 0.159 ± 0.109
1.115CysAsp: 1.115 ± 0.352
0.956CysGlu: 0.956 ± 0.335
0.319CysPhe: 0.319 ± 0.147
0.876CysGly: 0.876 ± 0.271
0.239CysHis: 0.239 ± 0.147
0.478CysIle: 0.478 ± 0.199
0.637CysLys: 0.637 ± 0.232
0.637CysLeu: 0.637 ± 0.235
0.08CysMet: 0.08 ± 0.067
0.319CysAsn: 0.319 ± 0.194
0.08CysPro: 0.08 ± 0.079
0.159CysGln: 0.159 ± 0.108
0.637CysArg: 0.637 ± 0.241
0.797CysSer: 0.797 ± 0.294
0.558CysThr: 0.558 ± 0.246
0.797CysVal: 0.797 ± 0.256
0.239CysTrp: 0.239 ± 0.144
0.478CysTyr: 0.478 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
6.372AspAla: 6.372 ± 0.798
0.717AspCys: 0.717 ± 0.245
5.018AspAsp: 5.018 ± 0.679
4.381AspGlu: 4.381 ± 0.553
2.23AspPhe: 2.23 ± 0.384
6.452AspGly: 6.452 ± 0.905
1.036AspHis: 1.036 ± 0.258
4.54AspIle: 4.54 ± 0.511
2.947AspLys: 2.947 ± 0.451
4.54AspLeu: 4.54 ± 0.656
1.673AspMet: 1.673 ± 0.306
2.39AspAsn: 2.39 ± 0.385
1.274AspPro: 1.274 ± 0.355
1.036AspGln: 1.036 ± 0.33
2.708AspArg: 2.708 ± 0.464
2.549AspSer: 2.549 ± 0.418
4.54AspThr: 4.54 ± 0.469
4.222AspVal: 4.222 ± 0.533
0.876AspTrp: 0.876 ± 0.201
2.469AspTyr: 2.469 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
5.895GluAla: 5.895 ± 0.627
0.398GluCys: 0.398 ± 0.257
3.505GluAsp: 3.505 ± 0.598
5.178GluGlu: 5.178 ± 0.925
2.708GluPhe: 2.708 ± 0.508
4.461GluGly: 4.461 ± 0.688
0.876GluHis: 0.876 ± 0.235
2.947GluIle: 2.947 ± 0.553
4.301GluLys: 4.301 ± 0.633
5.895GluLeu: 5.895 ± 0.677
2.151GluMet: 2.151 ± 0.475
1.912GluAsn: 1.912 ± 0.39
2.071GluPro: 2.071 ± 0.484
3.505GluGln: 3.505 ± 0.928
3.744GluArg: 3.744 ± 0.584
3.107GluSer: 3.107 ± 0.497
3.425GluThr: 3.425 ± 0.635
4.859GluVal: 4.859 ± 0.503
1.354GluTrp: 1.354 ± 0.326
2.868GluTyr: 2.868 ± 0.485
0.0GluXaa: 0.0 ± 0.0
Phe
2.23PheAla: 2.23 ± 0.4
0.637PheCys: 0.637 ± 0.235
3.027PheAsp: 3.027 ± 0.5
2.39PheGlu: 2.39 ± 0.36
1.195PhePhe: 1.195 ± 0.318
3.266PheGly: 3.266 ± 0.442
0.398PheHis: 0.398 ± 0.16
2.708PheIle: 2.708 ± 0.522
2.629PheLys: 2.629 ± 0.402
2.071PheLeu: 2.071 ± 0.422
0.717PheMet: 0.717 ± 0.186
1.991PheAsn: 1.991 ± 0.387
1.274PhePro: 1.274 ± 0.383
1.115PheGln: 1.115 ± 0.319
1.991PheArg: 1.991 ± 0.335
3.027PheSer: 3.027 ± 0.546
2.868PheThr: 2.868 ± 0.388
2.23PheVal: 2.23 ± 0.387
0.478PheTrp: 0.478 ± 0.183
1.513PheTyr: 1.513 ± 0.35
0.0PheXaa: 0.0 ± 0.0
Gly
7.249GlyAla: 7.249 ± 0.684
1.115GlyCys: 1.115 ± 0.267
4.939GlyAsp: 4.939 ± 0.815
5.257GlyGlu: 5.257 ± 0.678
3.505GlyPhe: 3.505 ± 0.546
5.337GlyGly: 5.337 ± 0.678
1.593GlyHis: 1.593 ± 0.487
2.469GlyIle: 2.469 ± 0.422
5.417GlyLys: 5.417 ± 0.754
5.496GlyLeu: 5.496 ± 0.585
2.31GlyMet: 2.31 ± 0.593
3.346GlyAsn: 3.346 ± 0.495
1.991GlyPro: 1.991 ± 0.379
3.346GlyGln: 3.346 ± 0.557
3.425GlyArg: 3.425 ± 0.488
5.576GlySer: 5.576 ± 0.786
4.381GlyThr: 4.381 ± 0.488
6.213GlyVal: 6.213 ± 0.983
1.195GlyTrp: 1.195 ± 0.235
2.469GlyTyr: 2.469 ± 0.441
0.0GlyXaa: 0.0 ± 0.0
His
1.195HisAla: 1.195 ± 0.243
0.478HisCys: 0.478 ± 0.166
1.036HisAsp: 1.036 ± 0.254
0.797HisGlu: 0.797 ± 0.224
0.797HisPhe: 0.797 ± 0.214
1.115HisGly: 1.115 ± 0.342
0.717HisHis: 0.717 ± 0.286
0.956HisIle: 0.956 ± 0.282
1.195HisLys: 1.195 ± 0.381
1.434HisLeu: 1.434 ± 0.429
0.398HisMet: 0.398 ± 0.16
0.558HisAsn: 0.558 ± 0.194
0.956HisPro: 0.956 ± 0.279
0.558HisGln: 0.558 ± 0.189
1.274HisArg: 1.274 ± 0.321
0.717HisSer: 0.717 ± 0.227
0.717HisThr: 0.717 ± 0.317
1.354HisVal: 1.354 ± 0.412
0.08HisTrp: 0.08 ± 0.084
0.319HisTyr: 0.319 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.859IleAla: 4.859 ± 0.531
0.717IleCys: 0.717 ± 0.243
3.346IleAsp: 3.346 ± 0.461
3.425IleGlu: 3.425 ± 0.452
1.195IlePhe: 1.195 ± 0.363
2.708IleGly: 2.708 ± 0.45
0.637IleHis: 0.637 ± 0.241
2.868IleIle: 2.868 ± 0.366
3.027IleLys: 3.027 ± 0.539
3.266IleLeu: 3.266 ± 0.54
1.115IleMet: 1.115 ± 0.317
3.027IleAsn: 3.027 ± 0.483
2.947IlePro: 2.947 ± 0.481
1.912IleGln: 1.912 ± 0.385
1.673IleArg: 1.673 ± 0.363
4.062IleSer: 4.062 ± 0.51
4.62IleThr: 4.62 ± 0.614
3.983IleVal: 3.983 ± 0.485
1.036IleTrp: 1.036 ± 0.285
1.195IleTyr: 1.195 ± 0.299
0.0IleXaa: 0.0 ± 0.0
Lys
6.213LysAla: 6.213 ± 0.933
0.398LysCys: 0.398 ± 0.166
4.62LysAsp: 4.62 ± 0.719
4.54LysGlu: 4.54 ± 0.742
1.673LysPhe: 1.673 ± 0.265
4.062LysGly: 4.062 ± 0.548
1.354LysHis: 1.354 ± 0.314
1.991LysIle: 1.991 ± 0.388
3.186LysLys: 3.186 ± 0.527
4.859LysLeu: 4.859 ± 0.589
2.629LysMet: 2.629 ± 0.454
1.832LysAsn: 1.832 ± 0.357
2.39LysPro: 2.39 ± 0.447
1.912LysGln: 1.912 ± 0.45
3.823LysArg: 3.823 ± 0.641
2.708LysSer: 2.708 ± 0.477
4.062LysThr: 4.062 ± 0.524
3.585LysVal: 3.585 ± 0.555
0.637LysTrp: 0.637 ± 0.244
2.549LysTyr: 2.549 ± 0.355
0.0LysXaa: 0.0 ± 0.0
Leu
9.32LeuAla: 9.32 ± 0.817
0.876LeuCys: 0.876 ± 0.249
3.664LeuAsp: 3.664 ± 0.576
4.939LeuGlu: 4.939 ± 0.779
2.629LeuPhe: 2.629 ± 0.61
5.656LeuGly: 5.656 ± 0.549
1.115LeuHis: 1.115 ± 0.299
4.062LeuIle: 4.062 ± 0.379
4.301LeuLys: 4.301 ± 0.566
5.815LeuLeu: 5.815 ± 0.717
1.912LeuMet: 1.912 ± 0.311
3.903LeuAsn: 3.903 ± 0.784
3.823LeuPro: 3.823 ± 0.591
2.708LeuGln: 2.708 ± 0.463
4.381LeuArg: 4.381 ± 0.531
4.939LeuSer: 4.939 ± 0.543
5.815LeuThr: 5.815 ± 0.774
4.62LeuVal: 4.62 ± 0.613
0.956LeuTrp: 0.956 ± 0.313
2.868LeuTyr: 2.868 ± 0.538
0.0LeuXaa: 0.0 ± 0.0
Met
3.027MetAla: 3.027 ± 0.526
0.398MetCys: 0.398 ± 0.136
0.717MetAsp: 0.717 ± 0.3
1.195MetGlu: 1.195 ± 0.308
0.876MetPhe: 0.876 ± 0.302
2.071MetGly: 2.071 ± 0.4
0.319MetHis: 0.319 ± 0.191
1.832MetIle: 1.832 ± 0.411
1.832MetLys: 1.832 ± 0.447
1.832MetLeu: 1.832 ± 0.378
0.398MetMet: 0.398 ± 0.173
0.558MetAsn: 0.558 ± 0.198
1.673MetPro: 1.673 ± 0.446
0.637MetGln: 0.637 ± 0.225
1.593MetArg: 1.593 ± 0.299
2.23MetSer: 2.23 ± 0.301
1.832MetThr: 1.832 ± 0.393
1.673MetVal: 1.673 ± 0.342
0.319MetTrp: 0.319 ± 0.133
0.319MetTyr: 0.319 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
4.062AsnAla: 4.062 ± 0.444
0.398AsnCys: 0.398 ± 0.164
2.39AsnAsp: 2.39 ± 0.305
2.23AsnGlu: 2.23 ± 0.385
1.115AsnPhe: 1.115 ± 0.256
3.983AsnGly: 3.983 ± 0.526
0.478AsnHis: 0.478 ± 0.172
1.752AsnIle: 1.752 ± 0.441
2.071AsnLys: 2.071 ± 0.357
2.788AsnLeu: 2.788 ± 0.408
1.115AsnMet: 1.115 ± 0.327
2.071AsnAsn: 2.071 ± 0.39
1.752AsnPro: 1.752 ± 0.362
1.673AsnGln: 1.673 ± 0.381
2.31AsnArg: 2.31 ± 0.516
2.708AsnSer: 2.708 ± 0.358
2.868AsnThr: 2.868 ± 0.54
3.903AsnVal: 3.903 ± 0.445
0.797AsnTrp: 0.797 ± 0.218
1.354AsnTyr: 1.354 ± 0.27
0.0AsnXaa: 0.0 ± 0.0
Pro
3.505ProAla: 3.505 ± 0.542
0.398ProCys: 0.398 ± 0.159
3.107ProAsp: 3.107 ± 0.45
3.585ProGlu: 3.585 ± 0.56
1.832ProPhe: 1.832 ± 0.444
3.027ProGly: 3.027 ± 0.437
0.478ProHis: 0.478 ± 0.192
1.832ProIle: 1.832 ± 0.363
1.593ProLys: 1.593 ± 0.452
2.788ProLeu: 2.788 ± 0.488
0.876ProMet: 0.876 ± 0.344
1.991ProAsn: 1.991 ± 0.363
1.195ProPro: 1.195 ± 0.362
0.956ProGln: 0.956 ± 0.193
1.752ProArg: 1.752 ± 0.414
2.39ProSer: 2.39 ± 0.381
2.788ProThr: 2.788 ± 0.573
3.585ProVal: 3.585 ± 0.528
0.398ProTrp: 0.398 ± 0.167
1.115ProTyr: 1.115 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
3.903GlnAla: 3.903 ± 0.812
0.319GlnCys: 0.319 ± 0.216
1.752GlnAsp: 1.752 ± 0.342
1.912GlnGlu: 1.912 ± 0.386
1.195GlnPhe: 1.195 ± 0.311
1.832GlnGly: 1.832 ± 0.377
0.558GlnHis: 0.558 ± 0.212
2.151GlnIle: 2.151 ± 0.317
2.549GlnLys: 2.549 ± 0.614
3.186GlnLeu: 3.186 ± 0.514
1.434GlnMet: 1.434 ± 0.327
1.593GlnAsn: 1.593 ± 0.432
1.752GlnPro: 1.752 ± 0.388
2.39GlnGln: 2.39 ± 0.625
2.708GlnArg: 2.708 ± 0.443
1.832GlnSer: 1.832 ± 0.352
1.832GlnThr: 1.832 ± 0.374
2.39GlnVal: 2.39 ± 0.398
0.797GlnTrp: 0.797 ± 0.29
1.434GlnTyr: 1.434 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
3.983ArgAla: 3.983 ± 0.512
0.319ArgCys: 0.319 ± 0.189
2.469ArgAsp: 2.469 ± 0.391
3.425ArgGlu: 3.425 ± 0.53
1.991ArgPhe: 1.991 ± 0.314
3.744ArgGly: 3.744 ± 0.523
1.274ArgHis: 1.274 ± 0.272
3.027ArgIle: 3.027 ± 0.438
3.744ArgLys: 3.744 ± 0.494
4.62ArgLeu: 4.62 ± 0.585
1.593ArgMet: 1.593 ± 0.304
2.629ArgAsn: 2.629 ± 0.356
1.593ArgPro: 1.593 ± 0.343
3.186ArgGln: 3.186 ± 0.6
4.461ArgArg: 4.461 ± 0.628
2.868ArgSer: 2.868 ± 0.368
2.868ArgThr: 2.868 ± 0.449
3.346ArgVal: 3.346 ± 0.435
0.637ArgTrp: 0.637 ± 0.24
1.274ArgTyr: 1.274 ± 0.292
0.0ArgXaa: 0.0 ± 0.0
Ser
5.815SerAla: 5.815 ± 0.596
0.319SerCys: 0.319 ± 0.143
3.505SerAsp: 3.505 ± 0.508
2.868SerGlu: 2.868 ± 0.557
2.39SerPhe: 2.39 ± 0.473
7.01SerGly: 7.01 ± 0.725
1.354SerHis: 1.354 ± 0.301
2.788SerIle: 2.788 ± 0.662
3.505SerLys: 3.505 ± 0.547
4.779SerLeu: 4.779 ± 0.514
1.274SerMet: 1.274 ± 0.367
2.549SerAsn: 2.549 ± 0.483
2.868SerPro: 2.868 ± 0.569
2.469SerGln: 2.469 ± 0.534
3.186SerArg: 3.186 ± 0.477
4.062SerSer: 4.062 ± 0.927
4.222SerThr: 4.222 ± 0.473
4.142SerVal: 4.142 ± 0.493
0.398SerTrp: 0.398 ± 0.2
2.23SerTyr: 2.23 ± 0.48
0.0SerXaa: 0.0 ± 0.0
Thr
7.488ThrAla: 7.488 ± 0.959
0.478ThrCys: 0.478 ± 0.162
4.301ThrAsp: 4.301 ± 0.538
3.744ThrGlu: 3.744 ± 0.509
3.107ThrPhe: 3.107 ± 0.54
5.496ThrGly: 5.496 ± 0.968
1.115ThrHis: 1.115 ± 0.354
3.425ThrIle: 3.425 ± 0.444
3.346ThrLys: 3.346 ± 0.526
5.576ThrLeu: 5.576 ± 0.495
1.036ThrMet: 1.036 ± 0.282
2.31ThrAsn: 2.31 ± 0.425
4.222ThrPro: 4.222 ± 0.666
1.912ThrGln: 1.912 ± 0.402
2.39ThrArg: 2.39 ± 0.311
3.585ThrSer: 3.585 ± 0.473
3.983ThrThr: 3.983 ± 0.658
3.823ThrVal: 3.823 ± 0.577
1.115ThrTrp: 1.115 ± 0.271
2.23ThrTyr: 2.23 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
7.328ValAla: 7.328 ± 0.728
0.478ValCys: 0.478 ± 0.19
3.903ValAsp: 3.903 ± 0.536
4.939ValGlu: 4.939 ± 0.705
1.991ValPhe: 1.991 ± 0.395
3.983ValGly: 3.983 ± 0.635
0.797ValHis: 0.797 ± 0.222
4.859ValIle: 4.859 ± 0.729
4.381ValLys: 4.381 ± 0.639
5.576ValLeu: 5.576 ± 0.79
1.513ValMet: 1.513 ± 0.329
3.107ValAsn: 3.107 ± 0.607
1.434ValPro: 1.434 ± 0.332
2.549ValGln: 2.549 ± 0.503
4.062ValArg: 4.062 ± 0.435
5.576ValSer: 5.576 ± 0.6
5.098ValThr: 5.098 ± 0.825
4.939ValVal: 4.939 ± 0.657
0.876ValTrp: 0.876 ± 0.283
2.788ValTyr: 2.788 ± 0.552
0.0ValXaa: 0.0 ± 0.0
Trp
1.354TrpAla: 1.354 ± 0.442
0.08TrpCys: 0.08 ± 0.068
1.036TrpAsp: 1.036 ± 0.252
0.558TrpGlu: 0.558 ± 0.231
1.036TrpPhe: 1.036 ± 0.354
0.797TrpGly: 0.797 ± 0.225
0.239TrpHis: 0.239 ± 0.15
0.398TrpIle: 0.398 ± 0.189
0.717TrpLys: 0.717 ± 0.187
1.832TrpLeu: 1.832 ± 0.342
0.558TrpMet: 0.558 ± 0.206
0.637TrpAsn: 0.637 ± 0.266
0.637TrpPro: 0.637 ± 0.272
0.637TrpGln: 0.637 ± 0.279
0.956TrpArg: 0.956 ± 0.296
0.956TrpSer: 0.956 ± 0.336
0.478TrpThr: 0.478 ± 0.186
0.956TrpVal: 0.956 ± 0.238
0.08TrpTrp: 0.08 ± 0.072
0.478TrpTyr: 0.478 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.425TyrAla: 3.425 ± 0.559
0.398TyrCys: 0.398 ± 0.199
2.39TyrAsp: 2.39 ± 0.648
2.23TyrGlu: 2.23 ± 0.396
1.832TyrPhe: 1.832 ± 0.363
3.425TyrGly: 3.425 ± 0.542
0.637TyrHis: 0.637 ± 0.218
1.274TyrIle: 1.274 ± 0.275
1.593TyrLys: 1.593 ± 0.304
2.469TyrLeu: 2.469 ± 0.369
0.637TyrMet: 0.637 ± 0.249
1.513TyrAsn: 1.513 ± 0.318
1.354TyrPro: 1.354 ± 0.43
1.673TyrGln: 1.673 ± 0.295
1.832TyrArg: 1.832 ± 0.355
2.469TyrSer: 2.469 ± 0.343
1.832TyrThr: 1.832 ± 0.439
1.912TyrVal: 1.912 ± 0.362
0.319TyrTrp: 0.319 ± 0.144
1.274TyrTyr: 1.274 ± 0.265
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12555 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski