Amino acid dipepetide frequency for Escherichia phage phAPEC8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.815AlaAla: 4.815 ± 0.355
0.91AlaCys: 0.91 ± 0.142
3.417AlaAsp: 3.417 ± 0.231
5.125AlaGlu: 5.125 ± 0.38
2.773AlaPhe: 2.773 ± 0.243
4.948AlaGly: 4.948 ± 0.381
1.287AlaHis: 1.287 ± 0.175
3.816AlaIle: 3.816 ± 0.31
4.97AlaLys: 4.97 ± 0.305
5.946AlaLeu: 5.946 ± 0.389
1.997AlaMet: 1.997 ± 0.204
3.617AlaAsn: 3.617 ± 0.316
2.197AlaPro: 2.197 ± 0.257
2.041AlaGln: 2.041 ± 0.205
2.751AlaArg: 2.751 ± 0.24
3.883AlaSer: 3.883 ± 0.438
4.792AlaThr: 4.792 ± 0.351
4.837AlaVal: 4.837 ± 0.36
1.398AlaTrp: 1.398 ± 0.182
3.328AlaTyr: 3.328 ± 0.285
0.0AlaXaa: 0.0 ± 0.0
Cys
0.932CysAla: 0.932 ± 0.148
0.2CysCys: 0.2 ± 0.068
0.799CysAsp: 0.799 ± 0.139
0.954CysGlu: 0.954 ± 0.165
1.087CysPhe: 1.087 ± 0.168
0.976CysGly: 0.976 ± 0.141
0.355CysHis: 0.355 ± 0.092
0.71CysIle: 0.71 ± 0.132
1.198CysLys: 1.198 ± 0.204
1.132CysLeu: 1.132 ± 0.158
0.555CysMet: 0.555 ± 0.126
0.532CysAsn: 0.532 ± 0.114
0.488CysPro: 0.488 ± 0.115
0.488CysGln: 0.488 ± 0.111
0.643CysArg: 0.643 ± 0.139
0.91CysSer: 0.91 ± 0.142
0.621CysThr: 0.621 ± 0.113
1.132CysVal: 1.132 ± 0.176
0.311CysTrp: 0.311 ± 0.095
0.599CysTyr: 0.599 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
4.016AspAla: 4.016 ± 0.294
0.732AspCys: 0.732 ± 0.127
3.794AspAsp: 3.794 ± 0.304
4.282AspGlu: 4.282 ± 0.335
3.128AspPhe: 3.128 ± 0.281
4.97AspGly: 4.97 ± 0.385
1.309AspHis: 1.309 ± 0.184
4.105AspIle: 4.105 ± 0.285
3.972AspLys: 3.972 ± 0.307
5.658AspLeu: 5.658 ± 0.369
1.731AspMet: 1.731 ± 0.201
3.372AspAsn: 3.372 ± 0.273
2.485AspPro: 2.485 ± 0.212
2.019AspGln: 2.019 ± 0.172
2.374AspArg: 2.374 ± 0.251
3.35AspSer: 3.35 ± 0.254
3.151AspThr: 3.151 ± 0.264
4.526AspVal: 4.526 ± 0.333
1.198AspTrp: 1.198 ± 0.168
3.55AspTyr: 3.55 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
5.613GluAla: 5.613 ± 0.392
0.932GluCys: 0.932 ± 0.144
5.857GluAsp: 5.857 ± 0.413
7.921GluGlu: 7.921 ± 0.562
2.662GluPhe: 2.662 ± 0.214
4.682GluGly: 4.682 ± 0.311
1.265GluHis: 1.265 ± 0.21
4.548GluIle: 4.548 ± 0.326
4.881GluLys: 4.881 ± 0.368
5.258GluLeu: 5.258 ± 0.382
2.685GluMet: 2.685 ± 0.283
3.883GluAsn: 3.883 ± 0.32
1.842GluPro: 1.842 ± 0.192
2.818GluGln: 2.818 ± 0.29
3.106GluArg: 3.106 ± 0.284
3.284GluSer: 3.284 ± 0.247
4.26GluThr: 4.26 ± 0.298
4.97GluVal: 4.97 ± 0.368
1.731GluTrp: 1.731 ± 0.192
3.262GluTyr: 3.262 ± 0.25
0.0GluXaa: 0.0 ± 0.0
Phe
3.084PheAla: 3.084 ± 0.256
0.577PheCys: 0.577 ± 0.118
3.195PheAsp: 3.195 ± 0.239
3.062PheGlu: 3.062 ± 0.284
2.041PhePhe: 2.041 ± 0.206
2.973PheGly: 2.973 ± 0.293
0.71PheHis: 0.71 ± 0.111
2.463PheIle: 2.463 ± 0.22
3.151PheLys: 3.151 ± 0.298
3.239PheLeu: 3.239 ± 0.273
1.087PheMet: 1.087 ± 0.149
2.596PheAsn: 2.596 ± 0.221
1.664PhePro: 1.664 ± 0.176
1.265PheGln: 1.265 ± 0.189
1.819PheArg: 1.819 ± 0.222
2.64PheSer: 2.64 ± 0.259
2.685PheThr: 2.685 ± 0.268
2.973PheVal: 2.973 ± 0.281
0.621PheTrp: 0.621 ± 0.109
2.174PheTyr: 2.174 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
3.883GlyAla: 3.883 ± 0.293
1.132GlyCys: 1.132 ± 0.176
4.482GlyAsp: 4.482 ± 0.491
4.881GlyGlu: 4.881 ± 0.289
3.284GlyPhe: 3.284 ± 0.311
4.304GlyGly: 4.304 ± 0.32
1.398GlyHis: 1.398 ± 0.181
4.06GlyIle: 4.06 ± 0.282
4.748GlyLys: 4.748 ± 0.312
5.059GlyLeu: 5.059 ± 0.303
1.686GlyMet: 1.686 ± 0.201
3.417GlyAsn: 3.417 ± 0.256
0.91GlyPro: 0.91 ± 0.241
1.908GlyGln: 1.908 ± 0.207
2.374GlyArg: 2.374 ± 0.232
3.772GlySer: 3.772 ± 0.299
4.171GlyThr: 4.171 ± 0.593
5.17GlyVal: 5.17 ± 0.327
1.242GlyTrp: 1.242 ± 0.165
3.617GlyTyr: 3.617 ± 0.239
0.0GlyXaa: 0.0 ± 0.0
His
1.021HisAla: 1.021 ± 0.15
0.399HisCys: 0.399 ± 0.09
1.087HisAsp: 1.087 ± 0.148
1.065HisGlu: 1.065 ± 0.162
1.021HisPhe: 1.021 ± 0.135
1.265HisGly: 1.265 ± 0.179
0.311HisHis: 0.311 ± 0.087
1.242HisIle: 1.242 ± 0.178
1.198HisLys: 1.198 ± 0.187
1.42HisLeu: 1.42 ± 0.166
0.666HisMet: 0.666 ± 0.147
1.043HisAsn: 1.043 ± 0.153
0.821HisPro: 0.821 ± 0.17
0.488HisGln: 0.488 ± 0.118
0.865HisArg: 0.865 ± 0.143
0.932HisSer: 0.932 ± 0.134
0.976HisThr: 0.976 ± 0.169
1.553HisVal: 1.553 ± 0.199
0.288HisTrp: 0.288 ± 0.086
1.176HisTyr: 1.176 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
4.46IleAla: 4.46 ± 0.388
1.021IleCys: 1.021 ± 0.161
3.794IleAsp: 3.794 ± 0.283
4.504IleGlu: 4.504 ± 0.359
2.33IlePhe: 2.33 ± 0.221
3.417IleGly: 3.417 ± 0.268
0.998IleHis: 0.998 ± 0.158
3.794IleIle: 3.794 ± 0.283
4.127IleLys: 4.127 ± 0.311
4.992IleLeu: 4.992 ± 0.333
1.553IleMet: 1.553 ± 0.218
3.816IleAsn: 3.816 ± 0.282
2.862IlePro: 2.862 ± 0.324
2.174IleGln: 2.174 ± 0.212
2.596IleArg: 2.596 ± 0.25
3.461IleSer: 3.461 ± 0.276
4.659IleThr: 4.659 ± 0.324
4.548IleVal: 4.548 ± 0.332
0.599IleTrp: 0.599 ± 0.123
2.374IleTyr: 2.374 ± 0.232
0.0IleXaa: 0.0 ± 0.0
Lys
5.391LysAla: 5.391 ± 0.366
0.599LysCys: 0.599 ± 0.113
4.415LysAsp: 4.415 ± 0.363
5.658LysGlu: 5.658 ± 0.448
2.374LysPhe: 2.374 ± 0.228
4.615LysGly: 4.615 ± 0.48
1.575LysHis: 1.575 ± 0.19
4.105LysIle: 4.105 ± 0.319
4.482LysLys: 4.482 ± 0.421
4.682LysLeu: 4.682 ± 0.331
2.818LysMet: 2.818 ± 0.262
3.328LysAsn: 3.328 ± 0.298
2.219LysPro: 2.219 ± 0.214
2.507LysGln: 2.507 ± 0.246
2.729LysArg: 2.729 ± 0.278
3.617LysSer: 3.617 ± 0.308
3.838LysThr: 3.838 ± 0.32
4.504LysVal: 4.504 ± 0.372
0.932LysTrp: 0.932 ± 0.13
2.574LysTyr: 2.574 ± 0.25
0.0LysXaa: 0.0 ± 0.0
Leu
6.257LeuAla: 6.257 ± 0.395
1.287LeuCys: 1.287 ± 0.165
5.547LeuAsp: 5.547 ± 0.278
6.346LeuGlu: 6.346 ± 0.376
2.951LeuPhe: 2.951 ± 0.253
4.837LeuGly: 4.837 ± 0.308
1.353LeuHis: 1.353 ± 0.181
4.415LeuIle: 4.415 ± 0.33
5.857LeuLys: 5.857 ± 0.375
5.924LeuLeu: 5.924 ± 0.348
2.241LeuMet: 2.241 ± 0.244
4.082LeuAsn: 4.082 ± 0.313
3.306LeuPro: 3.306 ± 0.312
2.418LeuGln: 2.418 ± 0.215
3.528LeuArg: 3.528 ± 0.302
5.48LeuSer: 5.48 ± 0.351
5.125LeuThr: 5.125 ± 0.374
5.103LeuVal: 5.103 ± 0.311
1.065LeuTrp: 1.065 ± 0.164
3.195LeuTyr: 3.195 ± 0.255
0.0LeuXaa: 0.0 ± 0.0
Met
2.174MetAla: 2.174 ± 0.212
0.377MetCys: 0.377 ± 0.089
1.509MetAsp: 1.509 ± 0.185
1.686MetGlu: 1.686 ± 0.228
1.087MetPhe: 1.087 ± 0.148
1.731MetGly: 1.731 ± 0.206
0.688MetHis: 0.688 ± 0.124
2.041MetIle: 2.041 ± 0.196
2.13MetLys: 2.13 ± 0.246
2.063MetLeu: 2.063 ± 0.224
0.732MetMet: 0.732 ± 0.126
0.91MetAsn: 0.91 ± 0.173
0.954MetPro: 0.954 ± 0.171
1.176MetGln: 1.176 ± 0.184
1.464MetArg: 1.464 ± 0.163
2.396MetSer: 2.396 ± 0.265
1.93MetThr: 1.93 ± 0.204
1.886MetVal: 1.886 ± 0.205
0.288MetTrp: 0.288 ± 0.081
0.998MetTyr: 0.998 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
3.528AsnAla: 3.528 ± 0.247
0.621AsnCys: 0.621 ± 0.121
2.751AsnAsp: 2.751 ± 0.242
2.862AsnGlu: 2.862 ± 0.233
2.174AsnPhe: 2.174 ± 0.205
4.171AsnGly: 4.171 ± 0.391
0.887AsnHis: 0.887 ± 0.148
4.193AsnIle: 4.193 ± 0.354
3.617AsnLys: 3.617 ± 0.28
4.393AsnLeu: 4.393 ± 0.296
1.487AsnMet: 1.487 ± 0.183
3.173AsnAsn: 3.173 ± 0.322
2.884AsnPro: 2.884 ± 0.276
1.353AsnGln: 1.353 ± 0.179
2.063AsnArg: 2.063 ± 0.227
3.417AsnSer: 3.417 ± 0.33
3.372AsnThr: 3.372 ± 0.255
3.284AsnVal: 3.284 ± 0.274
0.932AsnTrp: 0.932 ± 0.168
2.552AsnTyr: 2.552 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
2.108ProAla: 2.108 ± 0.206
0.799ProCys: 0.799 ± 0.125
2.529ProAsp: 2.529 ± 0.273
3.239ProGlu: 3.239 ± 0.258
2.041ProPhe: 2.041 ± 0.239
1.908ProGly: 1.908 ± 0.239
0.688ProHis: 0.688 ± 0.118
1.819ProIle: 1.819 ± 0.204
2.019ProLys: 2.019 ± 0.241
2.485ProLeu: 2.485 ± 0.272
0.71ProMet: 0.71 ± 0.124
2.263ProAsn: 2.263 ± 0.259
1.065ProPro: 1.065 ± 0.145
1.021ProGln: 1.021 ± 0.187
1.331ProArg: 1.331 ± 0.16
2.463ProSer: 2.463 ± 0.212
2.463ProThr: 2.463 ± 0.247
2.729ProVal: 2.729 ± 0.234
0.51ProTrp: 0.51 ± 0.101
1.642ProTyr: 1.642 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
2.307GlnAla: 2.307 ± 0.243
0.577GlnCys: 0.577 ± 0.13
1.997GlnAsp: 1.997 ± 0.226
2.973GlnGlu: 2.973 ± 0.238
1.22GlnPhe: 1.22 ± 0.149
1.686GlnGly: 1.686 ± 0.278
0.599GlnHis: 0.599 ± 0.095
2.618GlnIle: 2.618 ± 0.264
1.575GlnLys: 1.575 ± 0.237
2.13GlnLeu: 2.13 ± 0.213
0.976GlnMet: 0.976 ± 0.174
1.775GlnAsn: 1.775 ± 0.223
1.154GlnPro: 1.154 ± 0.153
1.021GlnGln: 1.021 ± 0.153
1.265GlnArg: 1.265 ± 0.2
1.575GlnSer: 1.575 ± 0.17
1.753GlnThr: 1.753 ± 0.186
2.063GlnVal: 2.063 ± 0.236
0.621GlnTrp: 0.621 ± 0.106
1.353GlnTyr: 1.353 ± 0.161
0.0GlnXaa: 0.0 ± 0.0
Arg
2.529ArgAla: 2.529 ± 0.208
0.843ArgCys: 0.843 ± 0.155
2.84ArgAsp: 2.84 ± 0.268
2.685ArgGlu: 2.685 ± 0.331
1.642ArgPhe: 1.642 ± 0.208
2.596ArgGly: 2.596 ± 0.256
0.643ArgHis: 0.643 ± 0.117
2.729ArgIle: 2.729 ± 0.279
2.995ArgLys: 2.995 ± 0.237
3.528ArgLeu: 3.528 ± 0.318
1.376ArgMet: 1.376 ± 0.195
2.507ArgAsn: 2.507 ± 0.249
1.176ArgPro: 1.176 ± 0.164
1.42ArgGln: 1.42 ± 0.177
1.997ArgArg: 1.997 ± 0.251
2.818ArgSer: 2.818 ± 0.225
1.753ArgThr: 1.753 ± 0.188
2.441ArgVal: 2.441 ± 0.226
0.488ArgTrp: 0.488 ± 0.109
1.464ArgTyr: 1.464 ± 0.2
0.0ArgXaa: 0.0 ± 0.0
Ser
4.06SerAla: 4.06 ± 0.296
0.865SerCys: 0.865 ± 0.139
3.395SerAsp: 3.395 ± 0.252
3.727SerGlu: 3.727 ± 0.297
3.04SerPhe: 3.04 ± 0.233
4.06SerGly: 4.06 ± 0.386
0.998SerHis: 0.998 ± 0.174
3.195SerIle: 3.195 ± 0.272
3.861SerLys: 3.861 ± 0.279
5.369SerLeu: 5.369 ± 0.342
1.42SerMet: 1.42 ± 0.167
3.55SerAsn: 3.55 ± 0.276
2.396SerPro: 2.396 ± 0.209
1.864SerGln: 1.864 ± 0.236
2.529SerArg: 2.529 ± 0.226
3.639SerSer: 3.639 ± 0.329
3.528SerThr: 3.528 ± 0.356
4.038SerVal: 4.038 ± 0.318
0.865SerTrp: 0.865 ± 0.128
2.441SerTyr: 2.441 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
4.082ThrAla: 4.082 ± 0.379
0.666ThrCys: 0.666 ± 0.108
2.951ThrAsp: 2.951 ± 0.275
4.349ThrGlu: 4.349 ± 0.338
3.239ThrPhe: 3.239 ± 0.295
4.682ThrGly: 4.682 ± 0.431
1.065ThrHis: 1.065 ± 0.131
3.927ThrIle: 3.927 ± 0.328
4.038ThrLys: 4.038 ± 0.279
5.658ThrLeu: 5.658 ± 0.405
1.132ThrMet: 1.132 ± 0.151
2.973ThrAsn: 2.973 ± 0.243
3.151ThrPro: 3.151 ± 0.228
1.686ThrGln: 1.686 ± 0.21
2.063ThrArg: 2.063 ± 0.183
3.417ThrSer: 3.417 ± 0.374
3.883ThrThr: 3.883 ± 0.39
4.26ThrVal: 4.26 ± 0.326
0.998ThrTrp: 0.998 ± 0.142
2.64ThrTyr: 2.64 ± 0.249
0.0ThrXaa: 0.0 ± 0.0
Val
4.748ValAla: 4.748 ± 0.311
1.22ValCys: 1.22 ± 0.189
4.526ValAsp: 4.526 ± 0.321
5.636ValGlu: 5.636 ± 0.375
3.04ValPhe: 3.04 ± 0.302
4.304ValGly: 4.304 ± 0.375
1.154ValHis: 1.154 ± 0.148
4.77ValIle: 4.77 ± 0.347
4.815ValLys: 4.815 ± 0.339
5.791ValLeu: 5.791 ± 0.409
1.708ValMet: 1.708 ± 0.187
3.461ValAsn: 3.461 ± 0.3
1.93ValPro: 1.93 ± 0.216
1.708ValGln: 1.708 ± 0.201
2.374ValArg: 2.374 ± 0.228
3.972ValSer: 3.972 ± 0.325
4.415ValThr: 4.415 ± 0.36
5.902ValVal: 5.902 ± 0.479
0.799ValTrp: 0.799 ± 0.14
3.395ValTyr: 3.395 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
0.71TrpAla: 0.71 ± 0.123
0.222TrpCys: 0.222 ± 0.073
1.353TrpAsp: 1.353 ± 0.181
1.398TrpGlu: 1.398 ± 0.177
0.998TrpPhe: 0.998 ± 0.148
0.732TrpGly: 0.732 ± 0.13
0.466TrpHis: 0.466 ± 0.093
1.109TrpIle: 1.109 ± 0.152
0.91TrpLys: 0.91 ± 0.158
1.686TrpLeu: 1.686 ± 0.196
0.51TrpMet: 0.51 ± 0.096
0.821TrpAsn: 0.821 ± 0.118
0.288TrpPro: 0.288 ± 0.077
0.333TrpGln: 0.333 ± 0.093
0.666TrpArg: 0.666 ± 0.14
0.932TrpSer: 0.932 ± 0.185
0.754TrpThr: 0.754 ± 0.12
0.998TrpVal: 0.998 ± 0.132
0.355TrpTrp: 0.355 ± 0.083
0.577TrpTyr: 0.577 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.128TyrAla: 3.128 ± 0.217
0.643TyrCys: 0.643 ± 0.104
3.483TyrAsp: 3.483 ± 0.276
3.106TyrGlu: 3.106 ± 0.263
1.886TyrPhe: 1.886 ± 0.211
2.64TyrGly: 2.64 ± 0.227
1.087TyrHis: 1.087 ± 0.174
2.418TyrIle: 2.418 ± 0.215
2.418TyrLys: 2.418 ± 0.242
3.905TyrLeu: 3.905 ± 0.256
1.132TyrMet: 1.132 ± 0.162
2.596TyrAsn: 2.596 ± 0.222
2.041TyrPro: 2.041 ± 0.201
1.531TyrGln: 1.531 ± 0.184
1.93TyrArg: 1.93 ± 0.197
2.907TyrSer: 2.907 ± 0.269
2.729TyrThr: 2.729 ± 0.243
2.751TyrVal: 2.751 ± 0.225
0.577TyrTrp: 0.577 ± 0.127
2.063TyrTyr: 2.063 ± 0.249
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 269 proteins (45072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski