Amino acid dipepetide frequency for Enterobacter phage PG7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.021AlaAla: 6.021 ± 0.432
0.486AlaCys: 0.486 ± 0.11
3.702AlaAsp: 3.702 ± 0.281
5.385AlaGlu: 5.385 ± 0.382
2.449AlaPhe: 2.449 ± 0.196
5.067AlaGly: 5.067 ± 0.423
1.29AlaHis: 1.29 ± 0.169
4.749AlaIle: 4.749 ± 0.312
5.329AlaLys: 5.329 ± 0.29
6.058AlaLeu: 6.058 ± 0.317
1.758AlaMet: 1.758 ± 0.163
3.384AlaAsn: 3.384 ± 0.297
2.206AlaPro: 2.206 ± 0.224
2.655AlaGln: 2.655 ± 0.242
3.422AlaArg: 3.422 ± 0.246
4.039AlaSer: 4.039 ± 0.329
3.983AlaThr: 3.983 ± 0.373
4.637AlaVal: 4.637 ± 0.316
1.122AlaTrp: 1.122 ± 0.136
2.805AlaTyr: 2.805 ± 0.187
0.0AlaXaa: 0.0 ± 0.0
Cys
0.785CysAla: 0.785 ± 0.123
0.168CysCys: 0.168 ± 0.064
0.711CysAsp: 0.711 ± 0.109
0.86CysGlu: 0.86 ± 0.126
0.467CysPhe: 0.467 ± 0.096
0.673CysGly: 0.673 ± 0.122
0.224CysHis: 0.224 ± 0.069
0.524CysIle: 0.524 ± 0.116
0.598CysLys: 0.598 ± 0.132
0.673CysLeu: 0.673 ± 0.141
0.449CysMet: 0.449 ± 0.1
0.505CysAsn: 0.505 ± 0.101
0.486CysPro: 0.486 ± 0.098
0.28CysGln: 0.28 ± 0.079
0.654CysArg: 0.654 ± 0.11
0.654CysSer: 0.654 ± 0.111
0.748CysThr: 0.748 ± 0.129
0.561CysVal: 0.561 ± 0.105
0.206CysTrp: 0.206 ± 0.058
0.467CysTyr: 0.467 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
4.188AspAla: 4.188 ± 0.305
0.542AspCys: 0.542 ± 0.101
3.571AspAsp: 3.571 ± 0.286
5.067AspGlu: 5.067 ± 0.366
3.272AspPhe: 3.272 ± 0.258
5.067AspGly: 5.067 ± 0.345
1.066AspHis: 1.066 ± 0.175
4.656AspIle: 4.656 ± 0.258
4.282AspLys: 4.282 ± 0.302
4.525AspLeu: 4.525 ± 0.306
1.72AspMet: 1.72 ± 0.214
2.973AspAsn: 2.973 ± 0.244
2.094AspPro: 2.094 ± 0.215
1.571AspGln: 1.571 ± 0.22
2.075AspArg: 2.075 ± 0.176
3.515AspSer: 3.515 ± 0.247
3.272AspThr: 3.272 ± 0.275
4.375AspVal: 4.375 ± 0.247
1.103AspTrp: 1.103 ± 0.135
3.197AspTyr: 3.197 ± 0.281
0.0AspXaa: 0.0 ± 0.0
Glu
5.722GluAla: 5.722 ± 0.316
0.841GluCys: 0.841 ± 0.129
4.824GluAsp: 4.824 ± 0.322
5.235GluGlu: 5.235 ± 0.373
3.609GluPhe: 3.609 ± 0.269
3.945GluGly: 3.945 ± 0.249
1.477GluHis: 1.477 ± 0.172
5.31GluIle: 5.31 ± 0.292
4.805GluLys: 4.805 ± 0.334
6.563GluLeu: 6.563 ± 0.395
2.337GluMet: 2.337 ± 0.216
3.31GluAsn: 3.31 ± 0.26
2.188GluPro: 2.188 ± 0.199
2.319GluGln: 2.319 ± 0.202
2.879GluArg: 2.879 ± 0.229
4.301GluSer: 4.301 ± 0.301
3.87GluThr: 3.87 ± 0.291
5.609GluVal: 5.609 ± 0.365
1.084GluTrp: 1.084 ± 0.12
3.085GluTyr: 3.085 ± 0.247
0.0GluXaa: 0.0 ± 0.0
Phe
2.487PheAla: 2.487 ± 0.228
0.561PheCys: 0.561 ± 0.107
2.973PheAsp: 2.973 ± 0.287
3.646PheGlu: 3.646 ± 0.308
1.776PhePhe: 1.776 ± 0.203
3.085PheGly: 3.085 ± 0.245
0.823PheHis: 0.823 ± 0.119
2.954PheIle: 2.954 ± 0.255
3.833PheLys: 3.833 ± 0.302
2.393PheLeu: 2.393 ± 0.218
1.29PheMet: 1.29 ± 0.155
2.506PheAsn: 2.506 ± 0.183
1.477PhePro: 1.477 ± 0.18
1.44PheGln: 1.44 ± 0.174
1.832PheArg: 1.832 ± 0.174
2.805PheSer: 2.805 ± 0.226
2.562PheThr: 2.562 ± 0.219
3.01PheVal: 3.01 ± 0.261
0.636PheTrp: 0.636 ± 0.112
1.645PheTyr: 1.645 ± 0.174
0.0PheXaa: 0.0 ± 0.0
Gly
4.17GlyAla: 4.17 ± 0.43
0.748GlyCys: 0.748 ± 0.125
4.263GlyAsp: 4.263 ± 0.347
4.17GlyGlu: 4.17 ± 0.247
2.954GlyPhe: 2.954 ± 0.236
3.459GlyGly: 3.459 ± 0.414
1.197GlyHis: 1.197 ± 0.158
3.908GlyIle: 3.908 ± 0.228
4.394GlyLys: 4.394 ± 0.295
4.674GlyLeu: 4.674 ± 0.301
1.982GlyMet: 1.982 ± 0.187
3.347GlyAsn: 3.347 ± 0.32
1.702GlyPro: 1.702 ± 0.214
2.543GlyGln: 2.543 ± 0.24
2.879GlyArg: 2.879 ± 0.278
4.469GlySer: 4.469 ± 0.351
4.413GlyThr: 4.413 ± 0.365
4.693GlyVal: 4.693 ± 0.295
1.29GlyTrp: 1.29 ± 0.171
2.655GlyTyr: 2.655 ± 0.228
0.0GlyXaa: 0.0 ± 0.0
His
1.253HisAla: 1.253 ± 0.164
0.262HisCys: 0.262 ± 0.075
1.122HisAsp: 1.122 ± 0.158
0.972HisGlu: 0.972 ± 0.135
0.879HisPhe: 0.879 ± 0.142
1.271HisGly: 1.271 ± 0.155
0.449HisHis: 0.449 ± 0.1
1.384HisIle: 1.384 ± 0.198
1.29HisLys: 1.29 ± 0.189
1.496HisLeu: 1.496 ± 0.175
0.561HisMet: 0.561 ± 0.106
0.748HisAsn: 0.748 ± 0.131
1.084HisPro: 1.084 ± 0.146
0.748HisGln: 0.748 ± 0.114
0.711HisArg: 0.711 ± 0.128
0.972HisSer: 0.972 ± 0.128
0.879HisThr: 0.879 ± 0.118
1.346HisVal: 1.346 ± 0.15
0.28HisTrp: 0.28 ± 0.075
0.879HisTyr: 0.879 ± 0.132
0.0HisXaa: 0.0 ± 0.0
Ile
4.843IleAla: 4.843 ± 0.316
0.785IleCys: 0.785 ± 0.119
4.712IleAsp: 4.712 ± 0.275
5.665IleGlu: 5.665 ± 0.314
2.431IlePhe: 2.431 ± 0.197
3.665IleGly: 3.665 ± 0.258
1.234IleHis: 1.234 ± 0.17
4.45IleIle: 4.45 ± 0.281
5.179IleLys: 5.179 ± 0.317
4.226IleLeu: 4.226 ± 0.258
1.945IleMet: 1.945 ± 0.221
4.226IleAsn: 4.226 ± 0.316
2.861IlePro: 2.861 ± 0.235
2.169IleGln: 2.169 ± 0.2
3.908IleArg: 3.908 ± 0.273
4.057IleSer: 4.057 ± 0.323
4.338IleThr: 4.338 ± 0.29
4.936IleVal: 4.936 ± 0.348
0.935IleTrp: 0.935 ± 0.179
2.468IleTyr: 2.468 ± 0.206
0.0IleXaa: 0.0 ± 0.0
Lys
5.74LysAla: 5.74 ± 0.343
0.823LysCys: 0.823 ± 0.14
4.506LysAsp: 4.506 ± 0.263
5.179LysGlu: 5.179 ± 0.324
3.478LysPhe: 3.478 ± 0.291
4.057LysGly: 4.057 ± 0.27
1.402LysHis: 1.402 ± 0.164
4.805LysIle: 4.805 ± 0.325
4.431LysLys: 4.431 ± 0.333
6.208LysLeu: 6.208 ± 0.337
2.543LysMet: 2.543 ± 0.217
3.571LysAsn: 3.571 ± 0.213
2.375LysPro: 2.375 ± 0.196
2.375LysGln: 2.375 ± 0.235
3.796LysArg: 3.796 ± 0.27
4.282LysSer: 4.282 ± 0.285
4.319LysThr: 4.319 ± 0.267
4.768LysVal: 4.768 ± 0.264
1.215LysTrp: 1.215 ± 0.159
3.029LysTyr: 3.029 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
5.217LeuAla: 5.217 ± 0.371
0.748LeuCys: 0.748 ± 0.143
4.955LeuAsp: 4.955 ± 0.352
5.497LeuGlu: 5.497 ± 0.353
3.029LeuPhe: 3.029 ± 0.246
4.45LeuGly: 4.45 ± 0.274
1.44LeuHis: 1.44 ± 0.164
4.712LeuIle: 4.712 ± 0.29
6.17LeuLys: 6.17 ± 0.397
4.562LeuLeu: 4.562 ± 0.323
2.15LeuMet: 2.15 ± 0.229
4.618LeuAsn: 4.618 ± 0.268
3.179LeuPro: 3.179 ± 0.225
2.524LeuGln: 2.524 ± 0.231
3.796LeuArg: 3.796 ± 0.262
4.506LeuSer: 4.506 ± 0.338
4.282LeuThr: 4.282 ± 0.286
4.581LeuVal: 4.581 ± 0.272
0.954LeuTrp: 0.954 ± 0.147
2.823LeuTyr: 2.823 ± 0.238
0.0LeuXaa: 0.0 ± 0.0
Met
2.113MetAla: 2.113 ± 0.198
0.224MetCys: 0.224 ± 0.07
1.29MetAsp: 1.29 ± 0.13
1.776MetGlu: 1.776 ± 0.189
1.309MetPhe: 1.309 ± 0.198
1.458MetGly: 1.458 ± 0.18
0.449MetHis: 0.449 ± 0.088
1.926MetIle: 1.926 ± 0.218
3.216MetLys: 3.216 ± 0.292
2.393MetLeu: 2.393 ± 0.188
1.028MetMet: 1.028 ± 0.152
2.132MetAsn: 2.132 ± 0.186
0.767MetPro: 0.767 ± 0.113
1.028MetGln: 1.028 ± 0.141
1.159MetArg: 1.159 ± 0.157
1.963MetSer: 1.963 ± 0.178
2.15MetThr: 2.15 ± 0.205
1.627MetVal: 1.627 ± 0.156
0.243MetTrp: 0.243 ± 0.061
0.86MetTyr: 0.86 ± 0.121
0.0MetXaa: 0.0 ± 0.0
Asn
3.534AsnAla: 3.534 ± 0.314
0.561AsnCys: 0.561 ± 0.101
3.066AsnAsp: 3.066 ± 0.231
3.983AsnGlu: 3.983 ± 0.277
2.524AsnPhe: 2.524 ± 0.212
4.431AsnGly: 4.431 ± 0.338
0.804AsnHis: 0.804 ± 0.118
3.796AsnIle: 3.796 ± 0.264
3.721AsnLys: 3.721 ± 0.285
3.646AsnLeu: 3.646 ± 0.272
1.253AsnMet: 1.253 ± 0.167
2.786AsnAsn: 2.786 ± 0.255
2.431AsnPro: 2.431 ± 0.246
1.851AsnGln: 1.851 ± 0.202
2.506AsnArg: 2.506 ± 0.21
3.048AsnSer: 3.048 ± 0.244
2.73AsnThr: 2.73 ± 0.246
3.066AsnVal: 3.066 ± 0.25
0.748AsnTrp: 0.748 ± 0.128
2.001AsnTyr: 2.001 ± 0.235
0.0AsnXaa: 0.0 ± 0.0
Pro
2.468ProAla: 2.468 ± 0.244
0.505ProCys: 0.505 ± 0.108
2.506ProAsp: 2.506 ± 0.251
3.44ProGlu: 3.44 ± 0.282
1.664ProPhe: 1.664 ± 0.171
2.524ProGly: 2.524 ± 0.208
0.711ProHis: 0.711 ± 0.117
2.206ProIle: 2.206 ± 0.202
2.356ProLys: 2.356 ± 0.261
2.281ProLeu: 2.281 ± 0.216
0.823ProMet: 0.823 ± 0.133
1.832ProAsn: 1.832 ± 0.227
1.028ProPro: 1.028 ± 0.188
1.178ProGln: 1.178 ± 0.162
1.645ProArg: 1.645 ± 0.164
2.206ProSer: 2.206 ± 0.195
2.244ProThr: 2.244 ± 0.197
2.543ProVal: 2.543 ± 0.217
0.897ProTrp: 0.897 ± 0.147
1.253ProTyr: 1.253 ± 0.158
0.0ProXaa: 0.0 ± 0.0
Gln
2.094GlnAla: 2.094 ± 0.256
0.262GlnCys: 0.262 ± 0.065
1.627GlnAsp: 1.627 ± 0.191
2.206GlnGlu: 2.206 ± 0.198
1.702GlnPhe: 1.702 ± 0.19
1.87GlnGly: 1.87 ± 0.167
0.542GlnHis: 0.542 ± 0.106
2.823GlnIle: 2.823 ± 0.236
2.188GlnLys: 2.188 ± 0.195
3.328GlnLeu: 3.328 ± 0.281
1.234GlnMet: 1.234 ± 0.157
1.627GlnAsn: 1.627 ± 0.178
1.178GlnPro: 1.178 ± 0.158
1.047GlnGln: 1.047 ± 0.136
1.72GlnArg: 1.72 ± 0.146
2.038GlnSer: 2.038 ± 0.205
1.851GlnThr: 1.851 ± 0.181
2.543GlnVal: 2.543 ± 0.26
0.449GlnTrp: 0.449 ± 0.106
1.627GlnTyr: 1.627 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
2.992ArgAla: 2.992 ± 0.264
0.636ArgCys: 0.636 ± 0.113
2.468ArgAsp: 2.468 ± 0.218
3.235ArgGlu: 3.235 ± 0.245
1.776ArgPhe: 1.776 ± 0.198
3.048ArgGly: 3.048 ± 0.244
0.935ArgHis: 0.935 ± 0.134
3.74ArgIle: 3.74 ± 0.243
3.59ArgLys: 3.59 ± 0.287
3.665ArgLeu: 3.665 ± 0.242
1.608ArgMet: 1.608 ± 0.143
2.487ArgAsn: 2.487 ± 0.22
1.608ArgPro: 1.608 ± 0.174
1.814ArgGln: 1.814 ± 0.213
2.113ArgArg: 2.113 ± 0.214
2.674ArgSer: 2.674 ± 0.222
2.468ArgThr: 2.468 ± 0.192
3.272ArgVal: 3.272 ± 0.218
0.729ArgTrp: 0.729 ± 0.112
1.571ArgTyr: 1.571 ± 0.213
0.0ArgXaa: 0.0 ± 0.0
Ser
4.057SerAla: 4.057 ± 0.257
0.636SerCys: 0.636 ± 0.108
3.627SerAsp: 3.627 ± 0.248
4.226SerGlu: 4.226 ± 0.298
2.954SerPhe: 2.954 ± 0.259
4.899SerGly: 4.899 ± 0.399
0.804SerHis: 0.804 ± 0.125
4.394SerIle: 4.394 ± 0.299
4.712SerLys: 4.712 ± 0.278
4.039SerLeu: 4.039 ± 0.247
1.702SerMet: 1.702 ± 0.173
2.898SerAsn: 2.898 ± 0.264
2.225SerPro: 2.225 ± 0.229
1.963SerGln: 1.963 ± 0.188
2.917SerArg: 2.917 ± 0.236
3.814SerSer: 3.814 ± 0.315
3.366SerThr: 3.366 ± 0.325
4.301SerVal: 4.301 ± 0.289
0.748SerTrp: 0.748 ± 0.117
2.337SerTyr: 2.337 ± 0.222
0.0SerXaa: 0.0 ± 0.0
Thr
3.945ThrAla: 3.945 ± 0.352
0.524ThrCys: 0.524 ± 0.109
2.954ThrAsp: 2.954 ± 0.241
3.833ThrGlu: 3.833 ± 0.296
2.599ThrPhe: 2.599 ± 0.245
4.394ThrGly: 4.394 ± 0.361
1.141ThrHis: 1.141 ± 0.121
4.244ThrIle: 4.244 ± 0.314
3.74ThrLys: 3.74 ± 0.249
4.674ThrLeu: 4.674 ± 0.424
1.328ThrMet: 1.328 ± 0.155
2.692ThrAsn: 2.692 ± 0.229
3.253ThrPro: 3.253 ± 0.278
2.038ThrGln: 2.038 ± 0.236
2.674ThrArg: 2.674 ± 0.269
3.833ThrSer: 3.833 ± 0.305
3.01ThrThr: 3.01 ± 0.283
4.039ThrVal: 4.039 ± 0.339
0.785ThrTrp: 0.785 ± 0.124
2.169ThrTyr: 2.169 ± 0.24
0.0ThrXaa: 0.0 ± 0.0
Val
5.123ValAla: 5.123 ± 0.323
0.58ValCys: 0.58 ± 0.096
5.011ValAsp: 5.011 ± 0.315
5.31ValGlu: 5.31 ± 0.328
2.674ValPhe: 2.674 ± 0.226
3.235ValGly: 3.235 ± 0.252
1.346ValHis: 1.346 ± 0.152
4.731ValIle: 4.731 ± 0.284
5.366ValLys: 5.366 ± 0.294
4.712ValLeu: 4.712 ± 0.288
1.888ValMet: 1.888 ± 0.212
3.74ValAsn: 3.74 ± 0.266
2.262ValPro: 2.262 ± 0.217
2.618ValGln: 2.618 ± 0.249
3.066ValArg: 3.066 ± 0.221
4.114ValSer: 4.114 ± 0.253
4.076ValThr: 4.076 ± 0.368
4.674ValVal: 4.674 ± 0.328
0.972ValTrp: 0.972 ± 0.141
3.029ValTyr: 3.029 ± 0.23
0.0ValXaa: 0.0 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.133
0.28TrpCys: 0.28 ± 0.069
1.028TrpAsp: 1.028 ± 0.131
1.066TrpGlu: 1.066 ± 0.15
0.711TrpPhe: 0.711 ± 0.115
0.785TrpGly: 0.785 ± 0.147
0.318TrpHis: 0.318 ± 0.077
0.954TrpIle: 0.954 ± 0.139
1.402TrpLys: 1.402 ± 0.182
1.253TrpLeu: 1.253 ± 0.162
0.486TrpMet: 0.486 ± 0.104
0.748TrpAsn: 0.748 ± 0.103
0.505TrpPro: 0.505 ± 0.093
0.542TrpGln: 0.542 ± 0.098
0.785TrpArg: 0.785 ± 0.12
0.804TrpSer: 0.804 ± 0.135
0.935TrpThr: 0.935 ± 0.133
0.935TrpVal: 0.935 ± 0.139
0.262TrpTrp: 0.262 ± 0.067
0.785TrpTyr: 0.785 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.066TyrAla: 3.066 ± 0.257
0.524TyrCys: 0.524 ± 0.096
3.235TyrAsp: 3.235 ± 0.244
2.562TyrGlu: 2.562 ± 0.245
1.477TyrPhe: 1.477 ± 0.162
2.524TyrGly: 2.524 ± 0.235
0.935TyrHis: 0.935 ± 0.129
2.767TyrIle: 2.767 ± 0.287
2.3TyrLys: 2.3 ± 0.196
2.823TyrLeu: 2.823 ± 0.288
1.01TyrMet: 1.01 ± 0.133
2.356TyrAsn: 2.356 ± 0.192
1.402TyrPro: 1.402 ± 0.144
1.253TyrGln: 1.253 ± 0.137
1.888TyrArg: 1.888 ± 0.177
2.524TyrSer: 2.524 ± 0.285
2.375TyrThr: 2.375 ± 0.212
3.01TyrVal: 3.01 ± 0.291
0.654TyrTrp: 0.654 ± 0.108
1.645TyrTyr: 1.645 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 294 proteins (53483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski