Amino acid dipepetide frequency for Erwinia phage phiEa21-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.859AlaAla: 6.859 ± 0.768
1.07AlaCys: 1.07 ± 0.228
4.123AlaAsp: 4.123 ± 0.384
5.273AlaGlu: 5.273 ± 0.401
2.577AlaPhe: 2.577 ± 0.31
6.066AlaGly: 6.066 ± 0.865
1.626AlaHis: 1.626 ± 0.253
4.639AlaIle: 4.639 ± 0.425
5.273AlaLys: 5.273 ± 0.491
6.224AlaLeu: 6.224 ± 0.586
2.855AlaMet: 2.855 ± 0.352
3.489AlaAsn: 3.489 ± 0.478
1.863AlaPro: 1.863 ± 0.226
2.855AlaGln: 2.855 ± 0.367
3.33AlaArg: 3.33 ± 0.391
4.877AlaSer: 4.877 ± 0.52
5.273AlaThr: 5.273 ± 0.699
6.026AlaVal: 6.026 ± 0.39
0.991AlaTrp: 0.991 ± 0.219
3.33AlaTyr: 3.33 ± 0.343
0.0AlaXaa: 0.0 ± 0.0
Cys
0.714CysAla: 0.714 ± 0.18
0.159CysCys: 0.159 ± 0.087
0.714CysAsp: 0.714 ± 0.174
0.872CysGlu: 0.872 ± 0.179
0.793CysPhe: 0.793 ± 0.166
0.634CysGly: 0.634 ± 0.173
0.396CysHis: 0.396 ± 0.109
0.555CysIle: 0.555 ± 0.156
0.793CysLys: 0.793 ± 0.151
0.833CysLeu: 0.833 ± 0.167
0.238CysMet: 0.238 ± 0.093
0.595CysAsn: 0.595 ± 0.14
0.555CysPro: 0.555 ± 0.183
0.515CysGln: 0.515 ± 0.121
0.396CysArg: 0.396 ± 0.128
0.793CysSer: 0.793 ± 0.152
0.436CysThr: 0.436 ± 0.158
0.714CysVal: 0.714 ± 0.154
0.04CysTrp: 0.04 ± 0.04
0.674CysTyr: 0.674 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
5.352AspAla: 5.352 ± 0.45
0.674AspCys: 0.674 ± 0.162
4.321AspAsp: 4.321 ± 0.634
5.035AspGlu: 5.035 ± 0.57
3.291AspPhe: 3.291 ± 0.337
4.678AspGly: 4.678 ± 0.494
0.753AspHis: 0.753 ± 0.164
3.529AspIle: 3.529 ± 0.373
4.044AspLys: 4.044 ± 0.358
4.678AspLeu: 4.678 ± 0.424
1.546AspMet: 1.546 ± 0.217
3.885AspAsn: 3.885 ± 0.47
1.665AspPro: 1.665 ± 0.315
1.031AspGln: 1.031 ± 0.163
2.181AspArg: 2.181 ± 0.32
4.916AspSer: 4.916 ± 0.436
3.291AspThr: 3.291 ± 0.453
4.559AspVal: 4.559 ± 0.392
1.031AspTrp: 1.031 ± 0.186
2.973AspTyr: 2.973 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
4.837GluAla: 4.837 ± 0.487
0.674GluCys: 0.674 ± 0.169
3.529GluAsp: 3.529 ± 0.334
4.321GluGlu: 4.321 ± 0.509
3.291GluPhe: 3.291 ± 0.318
4.044GluGly: 4.044 ± 0.344
1.586GluHis: 1.586 ± 0.22
4.044GluIle: 4.044 ± 0.393
4.599GluLys: 4.599 ± 0.476
5.313GluLeu: 5.313 ± 0.418
2.101GluMet: 2.101 ± 0.319
3.092GluAsn: 3.092 ± 0.3
1.546GluPro: 1.546 ± 0.317
2.379GluGln: 2.379 ± 0.335
3.132GluArg: 3.132 ± 0.386
4.361GluSer: 4.361 ± 0.305
3.449GluThr: 3.449 ± 0.355
3.846GluVal: 3.846 ± 0.404
0.793GluTrp: 0.793 ± 0.203
1.943GluTyr: 1.943 ± 0.266
0.0GluXaa: 0.0 ± 0.0
Phe
3.41PheAla: 3.41 ± 0.45
0.278PheCys: 0.278 ± 0.108
2.815PheAsp: 2.815 ± 0.315
2.498PheGlu: 2.498 ± 0.357
2.141PhePhe: 2.141 ± 0.255
2.855PheGly: 2.855 ± 0.346
0.317PheHis: 0.317 ± 0.1
2.775PheIle: 2.775 ± 0.343
3.013PheLys: 3.013 ± 0.317
2.855PheLeu: 2.855 ± 0.314
1.388PheMet: 1.388 ± 0.256
2.181PheAsn: 2.181 ± 0.327
1.943PhePro: 1.943 ± 0.272
1.269PheGln: 1.269 ± 0.221
1.665PheArg: 1.665 ± 0.245
2.696PheSer: 2.696 ± 0.307
3.251PheThr: 3.251 ± 0.451
2.101PheVal: 2.101 ± 0.324
0.436PheTrp: 0.436 ± 0.124
1.467PheTyr: 1.467 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
5.63GlyAla: 5.63 ± 0.521
0.991GlyCys: 0.991 ± 0.215
4.44GlyAsp: 4.44 ± 0.433
4.242GlyGlu: 4.242 ± 0.424
2.973GlyPhe: 2.973 ± 0.406
3.925GlyGly: 3.925 ± 0.668
1.388GlyHis: 1.388 ± 0.237
4.282GlyIle: 4.282 ± 0.401
5.709GlyLys: 5.709 ± 0.52
6.423GlyLeu: 6.423 ± 0.564
1.586GlyMet: 1.586 ± 0.23
3.172GlyAsn: 3.172 ± 0.432
0.198GlyPro: 0.198 ± 0.1
2.458GlyGln: 2.458 ± 0.343
2.855GlyArg: 2.855 ± 0.33
4.321GlySer: 4.321 ± 0.478
3.727GlyThr: 3.727 ± 0.4
5.63GlyVal: 5.63 ± 0.425
0.753GlyTrp: 0.753 ± 0.243
3.33GlyTyr: 3.33 ± 0.362
0.0GlyXaa: 0.0 ± 0.0
His
1.269HisAla: 1.269 ± 0.279
0.278HisCys: 0.278 ± 0.112
1.467HisAsp: 1.467 ± 0.28
1.07HisGlu: 1.07 ± 0.171
0.714HisPhe: 0.714 ± 0.142
1.507HisGly: 1.507 ± 0.272
0.515HisHis: 0.515 ± 0.164
0.912HisIle: 0.912 ± 0.211
1.546HisLys: 1.546 ± 0.253
1.388HisLeu: 1.388 ± 0.252
0.634HisMet: 0.634 ± 0.15
1.15HisAsn: 1.15 ± 0.258
0.793HisPro: 0.793 ± 0.19
0.793HisGln: 0.793 ± 0.194
1.11HisArg: 1.11 ± 0.21
1.546HisSer: 1.546 ± 0.22
0.991HisThr: 0.991 ± 0.229
1.388HisVal: 1.388 ± 0.228
0.198HisTrp: 0.198 ± 0.08
0.991HisTyr: 0.991 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
5.154IleAla: 5.154 ± 0.426
0.555IleCys: 0.555 ± 0.142
4.084IleAsp: 4.084 ± 0.413
4.123IleGlu: 4.123 ± 0.366
1.982IlePhe: 1.982 ± 0.207
3.965IleGly: 3.965 ± 0.428
1.15IleHis: 1.15 ± 0.203
3.33IleIle: 3.33 ± 0.406
4.44IleLys: 4.44 ± 0.424
4.52IleLeu: 4.52 ± 0.461
1.348IleMet: 1.348 ± 0.262
3.806IleAsn: 3.806 ± 0.404
2.022IlePro: 2.022 ± 0.264
2.498IleGln: 2.498 ± 0.314
2.339IleArg: 2.339 ± 0.278
3.965IleSer: 3.965 ± 0.404
3.568IleThr: 3.568 ± 0.382
3.806IleVal: 3.806 ± 0.305
0.753IleTrp: 0.753 ± 0.137
1.982IleTyr: 1.982 ± 0.252
0.0IleXaa: 0.0 ± 0.0
Lys
6.938LysAla: 6.938 ± 0.538
0.595LysCys: 0.595 ± 0.149
4.48LysAsp: 4.48 ± 0.457
3.766LysGlu: 3.766 ± 0.402
2.22LysPhe: 2.22 ± 0.25
5.471LysGly: 5.471 ± 0.431
0.991LysHis: 0.991 ± 0.2
3.608LysIle: 3.608 ± 0.357
4.916LysLys: 4.916 ± 0.493
5.035LysLeu: 5.035 ± 0.447
1.863LysMet: 1.863 ± 0.242
3.806LysAsn: 3.806 ± 0.327
2.379LysPro: 2.379 ± 0.364
2.617LysGln: 2.617 ± 0.279
3.529LysArg: 3.529 ± 0.395
4.916LysSer: 4.916 ± 0.476
4.401LysThr: 4.401 ± 0.421
5.352LysVal: 5.352 ± 0.414
0.634LysTrp: 0.634 ± 0.155
2.696LysTyr: 2.696 ± 0.36
0.0LysXaa: 0.0 ± 0.0
Leu
6.819LeuAla: 6.819 ± 0.402
0.912LeuCys: 0.912 ± 0.223
5.63LeuAsp: 5.63 ± 0.495
5.392LeuGlu: 5.392 ± 0.53
3.211LeuPhe: 3.211 ± 0.391
3.647LeuGly: 3.647 ± 0.487
1.586LeuHis: 1.586 ± 0.271
4.242LeuIle: 4.242 ± 0.359
5.947LeuLys: 5.947 ± 0.55
6.106LeuLeu: 6.106 ± 0.552
2.577LeuMet: 2.577 ± 0.287
4.48LeuAsn: 4.48 ± 0.477
3.41LeuPro: 3.41 ± 0.348
2.577LeuGln: 2.577 ± 0.398
4.084LeuArg: 4.084 ± 0.605
5.352LeuSer: 5.352 ± 0.503
5.55LeuThr: 5.55 ± 0.406
4.678LeuVal: 4.678 ± 0.397
1.427LeuTrp: 1.427 ± 0.219
2.498LeuTyr: 2.498 ± 0.279
0.0LeuXaa: 0.0 ± 0.0
Met
2.537MetAla: 2.537 ± 0.318
0.278MetCys: 0.278 ± 0.094
1.586MetAsp: 1.586 ± 0.265
1.546MetGlu: 1.546 ± 0.227
0.674MetPhe: 0.674 ± 0.157
1.903MetGly: 1.903 ± 0.304
0.317MetHis: 0.317 ± 0.108
1.824MetIle: 1.824 ± 0.229
2.379MetLys: 2.379 ± 0.278
2.22MetLeu: 2.22 ± 0.332
0.753MetMet: 0.753 ± 0.145
1.982MetAsn: 1.982 ± 0.311
0.872MetPro: 0.872 ± 0.157
1.15MetGln: 1.15 ± 0.279
1.189MetArg: 1.189 ± 0.239
2.26MetSer: 2.26 ± 0.289
2.26MetThr: 2.26 ± 0.257
1.388MetVal: 1.388 ± 0.294
0.0MetTrp: 0.0 ± 0.0
1.07MetTyr: 1.07 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.41AsnAla: 3.41 ± 0.516
0.515AsnCys: 0.515 ± 0.132
2.656AsnAsp: 2.656 ± 0.373
2.418AsnGlu: 2.418 ± 0.289
1.824AsnPhe: 1.824 ± 0.264
4.599AsnGly: 4.599 ± 0.443
1.467AsnHis: 1.467 ± 0.319
3.925AsnIle: 3.925 ± 0.339
3.172AsnLys: 3.172 ± 0.295
4.758AsnLeu: 4.758 ± 0.428
1.348AsnMet: 1.348 ± 0.215
2.339AsnAsn: 2.339 ± 0.323
3.211AsnPro: 3.211 ± 0.381
1.943AsnGln: 1.943 ± 0.287
2.498AsnArg: 2.498 ± 0.289
3.806AsnSer: 3.806 ± 0.411
2.696AsnThr: 2.696 ± 0.316
2.894AsnVal: 2.894 ± 0.354
0.833AsnTrp: 0.833 ± 0.187
1.784AsnTyr: 1.784 ± 0.291
0.0AsnXaa: 0.0 ± 0.0
Pro
2.22ProAla: 2.22 ± 0.266
0.238ProCys: 0.238 ± 0.091
2.617ProAsp: 2.617 ± 0.285
2.855ProGlu: 2.855 ± 0.371
2.022ProPhe: 2.022 ± 0.326
0.436ProGly: 0.436 ± 0.18
0.793ProHis: 0.793 ± 0.191
1.427ProIle: 1.427 ± 0.186
1.982ProLys: 1.982 ± 0.284
2.775ProLeu: 2.775 ± 0.36
0.714ProMet: 0.714 ± 0.149
1.863ProAsn: 1.863 ± 0.284
0.833ProPro: 0.833 ± 0.161
1.11ProGln: 1.11 ± 0.2
1.348ProArg: 1.348 ± 0.235
1.903ProSer: 1.903 ± 0.29
2.22ProThr: 2.22 ± 0.361
2.537ProVal: 2.537 ± 0.413
0.396ProTrp: 0.396 ± 0.16
1.427ProTyr: 1.427 ± 0.212
0.0ProXaa: 0.0 ± 0.0
Gln
2.775GlnAla: 2.775 ± 0.296
0.436GlnCys: 0.436 ± 0.133
1.507GlnAsp: 1.507 ± 0.243
1.863GlnGlu: 1.863 ± 0.257
1.863GlnPhe: 1.863 ± 0.286
2.537GlnGly: 2.537 ± 0.364
0.714GlnHis: 0.714 ± 0.213
2.656GlnIle: 2.656 ± 0.292
2.418GlnLys: 2.418 ± 0.282
3.053GlnLeu: 3.053 ± 0.413
1.388GlnMet: 1.388 ± 0.262
2.022GlnAsn: 2.022 ± 0.34
1.427GlnPro: 1.427 ± 0.233
1.586GlnGln: 1.586 ± 0.315
1.467GlnArg: 1.467 ± 0.211
2.101GlnSer: 2.101 ± 0.254
2.736GlnThr: 2.736 ± 0.46
1.665GlnVal: 1.665 ± 0.234
0.555GlnTrp: 0.555 ± 0.152
1.546GlnTyr: 1.546 ± 0.212
0.0GlnXaa: 0.0 ± 0.0
Arg
2.855ArgAla: 2.855 ± 0.299
0.634ArgCys: 0.634 ± 0.164
2.537ArgAsp: 2.537 ± 0.343
2.458ArgGlu: 2.458 ± 0.329
1.784ArgPhe: 1.784 ± 0.279
2.775ArgGly: 2.775 ± 0.395
1.507ArgHis: 1.507 ± 0.225
2.934ArgIle: 2.934 ± 0.315
3.41ArgLys: 3.41 ± 0.386
3.568ArgLeu: 3.568 ± 0.433
1.546ArgMet: 1.546 ± 0.252
2.022ArgAsn: 2.022 ± 0.297
0.952ArgPro: 0.952 ± 0.154
2.062ArgGln: 2.062 ± 0.316
2.418ArgArg: 2.418 ± 0.329
2.181ArgSer: 2.181 ± 0.329
2.101ArgThr: 2.101 ± 0.257
3.568ArgVal: 3.568 ± 0.383
0.991ArgTrp: 0.991 ± 0.19
2.141ArgTyr: 2.141 ± 0.362
0.0ArgXaa: 0.0 ± 0.0
Ser
4.956SerAla: 4.956 ± 0.573
0.833SerCys: 0.833 ± 0.177
4.48SerAsp: 4.48 ± 0.425
4.123SerGlu: 4.123 ± 0.405
2.934SerPhe: 2.934 ± 0.346
5.709SerGly: 5.709 ± 0.539
1.269SerHis: 1.269 ± 0.236
3.291SerIle: 3.291 ± 0.36
4.48SerLys: 4.48 ± 0.461
5.313SerLeu: 5.313 ± 0.465
1.903SerMet: 1.903 ± 0.312
3.013SerAsn: 3.013 ± 0.379
2.101SerPro: 2.101 ± 0.302
2.855SerGln: 2.855 ± 0.332
2.617SerArg: 2.617 ± 0.37
4.758SerSer: 4.758 ± 0.492
3.449SerThr: 3.449 ± 0.354
5.432SerVal: 5.432 ± 0.398
0.634SerTrp: 0.634 ± 0.169
3.053SerTyr: 3.053 ± 0.398
0.0SerXaa: 0.0 ± 0.0
Thr
4.995ThrAla: 4.995 ± 0.472
0.674ThrCys: 0.674 ± 0.157
4.163ThrAsp: 4.163 ± 0.478
3.172ThrGlu: 3.172 ± 0.373
2.418ThrPhe: 2.418 ± 0.333
5.194ThrGly: 5.194 ± 0.584
1.308ThrHis: 1.308 ± 0.228
4.084ThrIle: 4.084 ± 0.338
3.251ThrLys: 3.251 ± 0.406
5.63ThrLeu: 5.63 ± 0.568
0.952ThrMet: 0.952 ± 0.168
2.855ThrAsn: 2.855 ± 0.407
2.181ThrPro: 2.181 ± 0.297
2.26ThrGln: 2.26 ± 0.297
2.537ThrArg: 2.537 ± 0.309
4.004ThrSer: 4.004 ± 0.429
4.52ThrThr: 4.52 ± 0.667
5.154ThrVal: 5.154 ± 0.471
0.714ThrTrp: 0.714 ± 0.214
2.458ThrTyr: 2.458 ± 0.288
0.0ThrXaa: 0.0 ± 0.0
Val
4.678ValAla: 4.678 ± 0.455
0.833ValCys: 0.833 ± 0.179
4.52ValAsp: 4.52 ± 0.407
4.678ValGlu: 4.678 ± 0.565
2.379ValPhe: 2.379 ± 0.227
4.242ValGly: 4.242 ± 0.421
1.15ValHis: 1.15 ± 0.224
4.639ValIle: 4.639 ± 0.405
5.114ValLys: 5.114 ± 0.355
4.639ValLeu: 4.639 ± 0.483
2.26ValMet: 2.26 ± 0.333
3.766ValAsn: 3.766 ± 0.443
2.141ValPro: 2.141 ± 0.304
2.26ValGln: 2.26 ± 0.259
3.053ValArg: 3.053 ± 0.448
4.599ValSer: 4.599 ± 0.44
5.035ValThr: 5.035 ± 0.541
5.511ValVal: 5.511 ± 0.607
0.595ValTrp: 0.595 ± 0.2
2.894ValTyr: 2.894 ± 0.312
0.0ValXaa: 0.0 ± 0.0
Trp
0.912TrpAla: 0.912 ± 0.178
0.119TrpCys: 0.119 ± 0.066
0.912TrpAsp: 0.912 ± 0.247
0.714TrpGlu: 0.714 ± 0.162
0.595TrpPhe: 0.595 ± 0.14
0.515TrpGly: 0.515 ± 0.192
0.317TrpHis: 0.317 ± 0.115
0.674TrpIle: 0.674 ± 0.235
1.308TrpLys: 1.308 ± 0.251
1.269TrpLeu: 1.269 ± 0.262
0.357TrpMet: 0.357 ± 0.096
0.753TrpAsn: 0.753 ± 0.146
0.238TrpPro: 0.238 ± 0.094
0.357TrpGln: 0.357 ± 0.109
0.476TrpArg: 0.476 ± 0.154
0.912TrpSer: 0.912 ± 0.223
0.674TrpThr: 0.674 ± 0.202
0.753TrpVal: 0.753 ± 0.162
0.317TrpTrp: 0.317 ± 0.127
0.476TrpTyr: 0.476 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.299TyrAla: 2.299 ± 0.329
0.674TyrCys: 0.674 ± 0.131
2.855TyrAsp: 2.855 ± 0.347
2.458TyrGlu: 2.458 ± 0.305
1.665TyrPhe: 1.665 ± 0.288
3.33TyrGly: 3.33 ± 0.314
1.11TyrHis: 1.11 ± 0.188
2.062TyrIle: 2.062 ± 0.335
2.418TyrLys: 2.418 ± 0.312
3.529TyrLeu: 3.529 ± 0.298
0.793TyrMet: 0.793 ± 0.188
1.824TyrAsn: 1.824 ± 0.235
1.388TyrPro: 1.388 ± 0.249
1.744TyrGln: 1.744 ± 0.285
2.22TyrArg: 2.22 ± 0.252
2.934TyrSer: 2.934 ± 0.39
2.894TyrThr: 2.894 ± 0.354
1.982TyrVal: 1.982 ± 0.284
0.555TyrTrp: 0.555 ± 0.148
1.427TyrTyr: 1.427 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 118 proteins (25224 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski