Amino acid dipepetide frequency for Delftia phage PhiW-14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.689AlaAla: 7.689 ± 0.61
0.961AlaCys: 0.961 ± 0.152
4.703AlaAsp: 4.703 ± 0.297
5.562AlaGlu: 5.562 ± 0.351
3.027AlaPhe: 3.027 ± 0.225
5.91AlaGly: 5.91 ± 0.388
1.513AlaHis: 1.513 ± 0.17
5.01AlaIle: 5.01 ± 0.306
5.48AlaLys: 5.48 ± 0.394
6.81AlaLeu: 6.81 ± 0.424
3.047AlaMet: 3.047 ± 0.266
3.599AlaAsn: 3.599 ± 0.281
3.579AlaPro: 3.579 ± 0.304
3.436AlaGln: 3.436 ± 0.318
4.151AlaArg: 4.151 ± 0.282
4.519AlaSer: 4.519 ± 0.323
4.213AlaThr: 4.213 ± 0.267
5.215AlaVal: 5.215 ± 0.385
1.268AlaTrp: 1.268 ± 0.181
2.209AlaTyr: 2.209 ± 0.194
0.0AlaXaa: 0.0 ± 0.0
Cys
0.532CysAla: 0.532 ± 0.109
0.123CysCys: 0.123 ± 0.041
0.593CysAsp: 0.593 ± 0.104
0.736CysGlu: 0.736 ± 0.125
0.429CysPhe: 0.429 ± 0.106
1.166CysGly: 1.166 ± 0.169
0.368CysHis: 0.368 ± 0.085
0.429CysIle: 0.429 ± 0.088
0.511CysLys: 0.511 ± 0.096
1.022CysLeu: 1.022 ± 0.126
0.47CysMet: 0.47 ± 0.1
0.532CysAsn: 0.532 ± 0.095
0.491CysPro: 0.491 ± 0.102
0.184CysGln: 0.184 ± 0.066
0.613CysArg: 0.613 ± 0.113
0.634CysSer: 0.634 ± 0.105
0.736CysThr: 0.736 ± 0.111
0.654CysVal: 0.654 ± 0.091
0.286CysTrp: 0.286 ± 0.07
0.266CysTyr: 0.266 ± 0.079
0.0CysXaa: 0.0 ± 0.0
Asp
4.458AspAla: 4.458 ± 0.277
0.552AspCys: 0.552 ± 0.095
3.333AspAsp: 3.333 ± 0.337
4.253AspGlu: 4.253 ± 0.302
2.454AspPhe: 2.454 ± 0.24
4.253AspGly: 4.253 ± 0.346
1.595AspHis: 1.595 ± 0.17
3.476AspIle: 3.476 ± 0.261
3.456AspLys: 3.456 ± 0.313
5.419AspLeu: 5.419 ± 0.365
2.065AspMet: 2.065 ± 0.193
2.883AspAsn: 2.883 ± 0.235
2.965AspPro: 2.965 ± 0.254
2.699AspGln: 2.699 ± 0.241
3.108AspArg: 3.108 ± 0.259
3.067AspSer: 3.067 ± 0.265
3.19AspThr: 3.19 ± 0.221
4.397AspVal: 4.397 ± 0.291
1.207AspTrp: 1.207 ± 0.157
2.065AspTyr: 2.065 ± 0.205
0.0AspXaa: 0.0 ± 0.0
Glu
6.298GluAla: 6.298 ± 0.442
0.716GluCys: 0.716 ± 0.13
4.09GluAsp: 4.09 ± 0.294
3.804GluGlu: 3.804 ± 0.311
2.454GluPhe: 2.454 ± 0.237
4.253GluGly: 4.253 ± 0.266
2.086GluHis: 2.086 ± 0.197
3.108GluIle: 3.108 ± 0.269
3.047GluLys: 3.047 ± 0.288
5.603GluLeu: 5.603 ± 0.37
2.27GluMet: 2.27 ± 0.228
2.086GluAsn: 2.086 ± 0.208
2.106GluPro: 2.106 ± 0.236
3.251GluGln: 3.251 ± 0.307
4.029GluArg: 4.029 ± 0.328
3.476GluSer: 3.476 ± 0.309
3.395GluThr: 3.395 ± 0.242
3.926GluVal: 3.926 ± 0.291
1.084GluTrp: 1.084 ± 0.142
2.781GluTyr: 2.781 ± 0.221
0.0GluXaa: 0.0 ± 0.0
Phe
2.29PheAla: 2.29 ± 0.23
0.552PheCys: 0.552 ± 0.122
2.577PheAsp: 2.577 ± 0.269
2.311PheGlu: 2.311 ± 0.201
1.595PhePhe: 1.595 ± 0.186
2.352PheGly: 2.352 ± 0.182
0.92PheHis: 0.92 ± 0.121
2.024PheIle: 2.024 ± 0.211
2.863PheLys: 2.863 ± 0.241
3.027PheLeu: 3.027 ± 0.288
1.186PheMet: 1.186 ± 0.145
2.597PheAsn: 2.597 ± 0.236
1.452PhePro: 1.452 ± 0.171
1.268PheGln: 1.268 ± 0.159
2.249PheArg: 2.249 ± 0.228
2.433PheSer: 2.433 ± 0.211
2.515PheThr: 2.515 ± 0.203
2.413PheVal: 2.413 ± 0.218
0.613PheTrp: 0.613 ± 0.126
1.493PheTyr: 1.493 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
5.399GlyAla: 5.399 ± 0.371
0.798GlyCys: 0.798 ± 0.143
4.54GlyAsp: 4.54 ± 0.362
4.438GlyGlu: 4.438 ± 0.337
3.027GlyPhe: 3.027 ± 0.238
5.031GlyGly: 5.031 ± 0.384
1.902GlyHis: 1.902 ± 0.185
3.66GlyIle: 3.66 ± 0.3
4.274GlyLys: 4.274 ± 0.356
6.073GlyLeu: 6.073 ± 0.413
2.352GlyMet: 2.352 ± 0.204
2.618GlyAsn: 2.618 ± 0.236
2.27GlyPro: 2.27 ± 0.24
2.638GlyGln: 2.638 ± 0.203
3.804GlyArg: 3.804 ± 0.283
4.581GlySer: 4.581 ± 0.377
3.681GlyThr: 3.681 ± 0.346
5.071GlyVal: 5.071 ± 0.327
1.616GlyTrp: 1.616 ± 0.202
2.024GlyTyr: 2.024 ± 0.162
0.0GlyXaa: 0.0 ± 0.0
His
1.677HisAla: 1.677 ± 0.165
0.368HisCys: 0.368 ± 0.085
1.411HisAsp: 1.411 ± 0.183
1.452HisGlu: 1.452 ± 0.196
0.777HisPhe: 0.777 ± 0.137
1.554HisGly: 1.554 ± 0.2
0.695HisHis: 0.695 ± 0.116
1.472HisIle: 1.472 ± 0.225
1.493HisLys: 1.493 ± 0.163
1.922HisLeu: 1.922 ± 0.187
0.818HisMet: 0.818 ± 0.143
1.063HisAsn: 1.063 ± 0.15
1.35HisPro: 1.35 ± 0.164
0.838HisGln: 0.838 ± 0.113
1.166HisArg: 1.166 ± 0.175
1.391HisSer: 1.391 ± 0.151
1.288HisThr: 1.288 ± 0.143
1.534HisVal: 1.534 ± 0.165
0.695HisTrp: 0.695 ± 0.12
1.166HisTyr: 1.166 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
4.478IleAla: 4.478 ± 0.359
0.736IleCys: 0.736 ± 0.131
3.926IleAsp: 3.926 ± 0.263
4.11IleGlu: 4.11 ± 0.273
1.329IlePhe: 1.329 ± 0.166
3.088IleGly: 3.088 ± 0.245
1.431IleHis: 1.431 ± 0.167
2.945IleIle: 2.945 ± 0.279
3.906IleLys: 3.906 ± 0.305
3.967IleLeu: 3.967 ± 0.262
1.35IleMet: 1.35 ± 0.155
3.66IleAsn: 3.66 ± 0.267
2.658IlePro: 2.658 ± 0.217
2.229IleGln: 2.229 ± 0.208
3.129IleArg: 3.129 ± 0.289
2.72IleSer: 2.72 ± 0.208
3.006IleThr: 3.006 ± 0.26
2.986IleVal: 2.986 ± 0.246
0.573IleTrp: 0.573 ± 0.109
1.84IleTyr: 1.84 ± 0.163
0.0IleXaa: 0.0 ± 0.0
Lys
5.951LysAla: 5.951 ± 0.405
0.532LysCys: 0.532 ± 0.091
3.354LysAsp: 3.354 ± 0.322
4.478LysGlu: 4.478 ± 0.31
2.945LysPhe: 2.945 ± 0.242
4.581LysGly: 4.581 ± 0.327
1.697LysHis: 1.697 ± 0.175
2.393LysIle: 2.393 ± 0.213
2.822LysLys: 2.822 ± 0.309
4.478LysLeu: 4.478 ± 0.345
2.127LysMet: 2.127 ± 0.204
1.943LysAsn: 1.943 ± 0.179
3.149LysPro: 3.149 ± 0.313
3.251LysGln: 3.251 ± 0.226
3.088LysArg: 3.088 ± 0.274
3.313LysSer: 3.313 ± 0.23
3.456LysThr: 3.456 ± 0.308
4.09LysVal: 4.09 ± 0.341
1.104LysTrp: 1.104 ± 0.168
2.045LysTyr: 2.045 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
7.137LeuAla: 7.137 ± 0.351
0.879LeuCys: 0.879 ± 0.149
5.051LeuAsp: 5.051 ± 0.307
5.583LeuGlu: 5.583 ± 0.31
2.986LeuPhe: 2.986 ± 0.225
5.542LeuGly: 5.542 ± 0.336
1.82LeuHis: 1.82 ± 0.211
4.417LeuIle: 4.417 ± 0.292
5.562LeuLys: 5.562 ± 0.338
6.176LeuLeu: 6.176 ± 0.442
2.168LeuMet: 2.168 ± 0.214
3.845LeuAsn: 3.845 ± 0.259
3.456LeuPro: 3.456 ± 0.287
2.842LeuGln: 2.842 ± 0.246
4.253LeuArg: 4.253 ± 0.355
5.501LeuSer: 5.501 ± 0.356
4.478LeuThr: 4.478 ± 0.318
6.217LeuVal: 6.217 ± 0.342
0.941LeuTrp: 0.941 ± 0.162
2.761LeuTyr: 2.761 ± 0.259
0.0LeuXaa: 0.0 ± 0.0
Met
2.658MetAla: 2.658 ± 0.238
0.266MetCys: 0.266 ± 0.065
1.84MetAsp: 1.84 ± 0.18
1.82MetGlu: 1.82 ± 0.217
1.329MetPhe: 1.329 ± 0.142
2.27MetGly: 2.27 ± 0.222
0.348MetHis: 0.348 ± 0.093
2.229MetIle: 2.229 ± 0.235
1.984MetLys: 1.984 ± 0.189
2.699MetLeu: 2.699 ± 0.226
0.818MetMet: 0.818 ± 0.128
1.8MetAsn: 1.8 ± 0.19
1.309MetPro: 1.309 ± 0.169
0.92MetGln: 0.92 ± 0.151
1.697MetArg: 1.697 ± 0.17
2.556MetSer: 2.556 ± 0.194
2.413MetThr: 2.413 ± 0.215
2.352MetVal: 2.352 ± 0.232
0.45MetTrp: 0.45 ± 0.085
0.92MetTyr: 0.92 ± 0.135
0.0MetXaa: 0.0 ± 0.0
Asn
3.251AsnAla: 3.251 ± 0.298
0.409AsnCys: 0.409 ± 0.095
2.904AsnAsp: 2.904 ± 0.215
2.045AsnGlu: 2.045 ± 0.202
1.963AsnPhe: 1.963 ± 0.188
3.947AsnGly: 3.947 ± 0.334
1.207AsnHis: 1.207 ± 0.163
2.474AsnIle: 2.474 ± 0.201
3.006AsnLys: 3.006 ± 0.232
3.763AsnLeu: 3.763 ± 0.233
1.309AsnMet: 1.309 ± 0.157
2.168AsnAsn: 2.168 ± 0.194
2.413AsnPro: 2.413 ± 0.245
1.534AsnGln: 1.534 ± 0.178
3.517AsnArg: 3.517 ± 0.28
2.495AsnSer: 2.495 ± 0.243
2.229AsnThr: 2.229 ± 0.234
3.047AsnVal: 3.047 ± 0.225
0.982AsnTrp: 0.982 ± 0.151
1.37AsnTyr: 1.37 ± 0.186
0.0AsnXaa: 0.0 ± 0.0
Pro
3.497ProAla: 3.497 ± 0.283
0.429ProCys: 0.429 ± 0.077
3.027ProAsp: 3.027 ± 0.263
3.067ProGlu: 3.067 ± 0.258
1.984ProPhe: 1.984 ± 0.19
3.272ProGly: 3.272 ± 0.262
0.92ProHis: 0.92 ± 0.14
2.147ProIle: 2.147 ± 0.186
2.761ProLys: 2.761 ± 0.27
2.863ProLeu: 2.863 ± 0.225
1.207ProMet: 1.207 ± 0.151
2.147ProAsn: 2.147 ± 0.241
1.84ProPro: 1.84 ± 0.235
1.227ProGln: 1.227 ± 0.168
1.738ProArg: 1.738 ± 0.211
2.679ProSer: 2.679 ± 0.213
2.822ProThr: 2.822 ± 0.262
3.456ProVal: 3.456 ± 0.286
0.736ProTrp: 0.736 ± 0.118
1.493ProTyr: 1.493 ± 0.189
0.0ProXaa: 0.0 ± 0.0
Gln
3.967GlnAla: 3.967 ± 0.339
0.327GlnCys: 0.327 ± 0.089
1.984GlnAsp: 1.984 ± 0.216
2.106GlnGlu: 2.106 ± 0.173
1.35GlnPhe: 1.35 ± 0.168
2.679GlnGly: 2.679 ± 0.205
0.9GlnHis: 0.9 ± 0.141
2.147GlnIle: 2.147 ± 0.204
1.943GlnLys: 1.943 ± 0.176
3.681GlnLeu: 3.681 ± 0.334
1.534GlnMet: 1.534 ± 0.171
1.452GlnAsn: 1.452 ± 0.162
1.493GlnPro: 1.493 ± 0.165
1.84GlnGln: 1.84 ± 0.195
2.679GlnArg: 2.679 ± 0.296
2.393GlnSer: 2.393 ± 0.231
2.209GlnThr: 2.209 ± 0.266
3.108GlnVal: 3.108 ± 0.245
0.9GlnTrp: 0.9 ± 0.145
1.288GlnTyr: 1.288 ± 0.174
0.0GlnXaa: 0.0 ± 0.0
Arg
4.417ArgAla: 4.417 ± 0.299
0.593ArgCys: 0.593 ± 0.121
2.883ArgAsp: 2.883 ± 0.233
3.66ArgGlu: 3.66 ± 0.287
2.413ArgPhe: 2.413 ± 0.236
3.681ArgGly: 3.681 ± 0.323
1.452ArgHis: 1.452 ± 0.185
2.883ArgIle: 2.883 ± 0.249
3.333ArgLys: 3.333 ± 0.292
4.765ArgLeu: 4.765 ± 0.328
1.922ArgMet: 1.922 ± 0.217
2.413ArgAsn: 2.413 ± 0.238
2.311ArgPro: 2.311 ± 0.233
2.331ArgGln: 2.331 ± 0.245
3.374ArgArg: 3.374 ± 0.306
3.272ArgSer: 3.272 ± 0.234
2.658ArgThr: 2.658 ± 0.239
3.783ArgVal: 3.783 ± 0.243
1.084ArgTrp: 1.084 ± 0.16
2.209ArgTyr: 2.209 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
4.622SerAla: 4.622 ± 0.328
0.552SerCys: 0.552 ± 0.095
3.579SerAsp: 3.579 ± 0.221
3.067SerGlu: 3.067 ± 0.238
1.922SerPhe: 1.922 ± 0.204
4.213SerGly: 4.213 ± 0.347
1.431SerHis: 1.431 ± 0.166
3.742SerIle: 3.742 ± 0.271
4.069SerLys: 4.069 ± 0.319
5.337SerLeu: 5.337 ± 0.365
2.188SerMet: 2.188 ± 0.254
2.761SerAsn: 2.761 ± 0.234
2.27SerPro: 2.27 ± 0.243
2.372SerGln: 2.372 ± 0.204
3.108SerArg: 3.108 ± 0.265
3.456SerSer: 3.456 ± 0.319
3.17SerThr: 3.17 ± 0.256
4.54SerVal: 4.54 ± 0.305
1.002SerTrp: 1.002 ± 0.148
1.881SerTyr: 1.881 ± 0.195
0.0SerXaa: 0.0 ± 0.0
Thr
4.274ThrAla: 4.274 ± 0.324
0.389ThrCys: 0.389 ± 0.074
2.781ThrAsp: 2.781 ± 0.232
3.681ThrGlu: 3.681 ± 0.277
2.352ThrPhe: 2.352 ± 0.218
4.213ThrGly: 4.213 ± 0.286
1.186ThrHis: 1.186 ± 0.171
3.231ThrIle: 3.231 ± 0.269
3.149ThrLys: 3.149 ± 0.24
4.683ThrLeu: 4.683 ± 0.269
1.84ThrMet: 1.84 ± 0.172
2.781ThrAsn: 2.781 ± 0.234
3.64ThrPro: 3.64 ± 0.261
2.372ThrGln: 2.372 ± 0.215
3.108ThrArg: 3.108 ± 0.263
2.965ThrSer: 2.965 ± 0.295
3.19ThrThr: 3.19 ± 0.354
3.824ThrVal: 3.824 ± 0.269
1.002ThrTrp: 1.002 ± 0.128
2.045ThrTyr: 2.045 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
5.337ValAla: 5.337 ± 0.378
1.084ValCys: 1.084 ± 0.154
4.642ValAsp: 4.642 ± 0.346
4.724ValGlu: 4.724 ± 0.319
2.495ValPhe: 2.495 ± 0.234
4.253ValGly: 4.253 ± 0.359
1.166ValHis: 1.166 ± 0.151
3.865ValIle: 3.865 ± 0.345
4.54ValLys: 4.54 ± 0.325
5.133ValLeu: 5.133 ± 0.315
2.249ValMet: 2.249 ± 0.195
3.19ValAsn: 3.19 ± 0.297
2.924ValPro: 2.924 ± 0.236
2.72ValGln: 2.72 ± 0.24
3.436ValArg: 3.436 ± 0.277
4.622ValSer: 4.622 ± 0.305
4.397ValThr: 4.397 ± 0.333
4.56ValVal: 4.56 ± 0.317
1.145ValTrp: 1.145 ± 0.148
2.127ValTyr: 2.127 ± 0.21
0.0ValXaa: 0.0 ± 0.0
Trp
1.431TrpAla: 1.431 ± 0.181
0.307TrpCys: 0.307 ± 0.077
1.084TrpAsp: 1.084 ± 0.163
0.961TrpGlu: 0.961 ± 0.112
0.818TrpPhe: 0.818 ± 0.159
1.125TrpGly: 1.125 ± 0.147
0.429TrpHis: 0.429 ± 0.1
0.654TrpIle: 0.654 ± 0.116
0.941TrpLys: 0.941 ± 0.163
1.82TrpLeu: 1.82 ± 0.212
0.47TrpMet: 0.47 ± 0.104
0.593TrpAsn: 0.593 ± 0.106
0.389TrpPro: 0.389 ± 0.101
0.45TrpGln: 0.45 ± 0.086
1.186TrpArg: 1.186 ± 0.15
1.288TrpSer: 1.288 ± 0.154
1.268TrpThr: 1.268 ± 0.141
1.35TrpVal: 1.35 ± 0.162
0.409TrpTrp: 0.409 ± 0.109
0.695TrpTyr: 0.695 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.536TyrAla: 2.536 ± 0.227
0.204TyrCys: 0.204 ± 0.062
2.618TyrAsp: 2.618 ± 0.218
2.004TyrGlu: 2.004 ± 0.208
1.084TyrPhe: 1.084 ± 0.172
2.413TyrGly: 2.413 ± 0.208
1.063TyrHis: 1.063 ± 0.168
1.759TyrIle: 1.759 ± 0.169
1.656TyrLys: 1.656 ± 0.189
2.372TyrLeu: 2.372 ± 0.191
1.268TyrMet: 1.268 ± 0.143
1.902TyrAsn: 1.902 ± 0.215
1.247TyrPro: 1.247 ± 0.167
1.575TyrGln: 1.575 ± 0.191
2.004TyrArg: 2.004 ± 0.24
1.922TyrSer: 1.922 ± 0.179
2.393TyrThr: 2.393 ± 0.268
2.086TyrVal: 2.086 ± 0.176
0.573TyrTrp: 0.573 ± 0.092
1.043TyrTyr: 1.043 ± 0.137
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 236 proteins (48902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski