Amino acid dipepetide frequency for Klebsiella phage KMI9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.955AlaAla: 3.955 ± 0.338
0.45AlaCys: 0.45 ± 0.093
3.749AlaAsp: 3.749 ± 0.293
4.78AlaGlu: 4.78 ± 0.355
2.774AlaPhe: 2.774 ± 0.192
4.03AlaGly: 4.03 ± 0.384
1.218AlaHis: 1.218 ± 0.172
5.155AlaIle: 5.155 ± 0.335
5.342AlaLys: 5.342 ± 0.354
5.492AlaLeu: 5.492 ± 0.299
2.118AlaMet: 2.118 ± 0.239
3.561AlaAsn: 3.561 ± 0.252
2.062AlaPro: 2.062 ± 0.188
2.268AlaGln: 2.268 ± 0.244
3.093AlaArg: 3.093 ± 0.247
3.768AlaSer: 3.768 ± 0.299
4.274AlaThr: 4.274 ± 0.456
4.367AlaVal: 4.367 ± 0.256
0.769AlaTrp: 0.769 ± 0.144
3.205AlaTyr: 3.205 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
0.862CysAla: 0.862 ± 0.141
0.169CysCys: 0.169 ± 0.057
0.694CysAsp: 0.694 ± 0.117
0.619CysGlu: 0.619 ± 0.111
0.562CysPhe: 0.562 ± 0.105
0.9CysGly: 0.9 ± 0.124
0.262CysHis: 0.262 ± 0.074
0.6CysIle: 0.6 ± 0.099
0.993CysLys: 0.993 ± 0.136
0.918CysLeu: 0.918 ± 0.106
0.337CysMet: 0.337 ± 0.079
0.562CysAsn: 0.562 ± 0.126
0.562CysPro: 0.562 ± 0.108
0.281CysGln: 0.281 ± 0.078
0.581CysArg: 0.581 ± 0.099
0.712CysSer: 0.712 ± 0.127
0.712CysThr: 0.712 ± 0.111
0.75CysVal: 0.75 ± 0.122
0.15CysTrp: 0.15 ± 0.054
0.394CysTyr: 0.394 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
4.18AspAla: 4.18 ± 0.299
0.731AspCys: 0.731 ± 0.102
3.955AspAsp: 3.955 ± 0.336
4.855AspGlu: 4.855 ± 0.358
3.074AspPhe: 3.074 ± 0.245
4.461AspGly: 4.461 ± 0.353
0.937AspHis: 0.937 ± 0.12
5.08AspIle: 5.08 ± 0.323
4.311AspLys: 4.311 ± 0.299
5.061AspLeu: 5.061 ± 0.321
1.856AspMet: 1.856 ± 0.201
3.205AspAsn: 3.205 ± 0.27
2.83AspPro: 2.83 ± 0.217
1.743AspGln: 1.743 ± 0.143
3.074AspArg: 3.074 ± 0.225
3.805AspSer: 3.805 ± 0.272
3.936AspThr: 3.936 ± 0.26
4.33AspVal: 4.33 ± 0.287
1.312AspTrp: 1.312 ± 0.151
3.374AspTyr: 3.374 ± 0.236
0.0AspXaa: 0.0 ± 0.0
Glu
5.267GluAla: 5.267 ± 0.381
0.937GluCys: 0.937 ± 0.13
3.561GluAsp: 3.561 ± 0.281
5.117GluGlu: 5.117 ± 0.462
2.83GluPhe: 2.83 ± 0.273
3.768GluGly: 3.768 ± 0.268
1.575GluHis: 1.575 ± 0.17
5.492GluIle: 5.492 ± 0.376
5.323GluLys: 5.323 ± 0.391
5.867GluLeu: 5.867 ± 0.327
2.099GluMet: 2.099 ± 0.216
3.486GluAsn: 3.486 ± 0.247
1.387GluPro: 1.387 ± 0.148
2.306GluGln: 2.306 ± 0.256
3.149GluArg: 3.149 ± 0.293
4.03GluSer: 4.03 ± 0.305
4.049GluThr: 4.049 ± 0.271
4.33GluVal: 4.33 ± 0.275
1.106GluTrp: 1.106 ± 0.149
2.924GluTyr: 2.924 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
2.418PheAla: 2.418 ± 0.244
0.487PheCys: 0.487 ± 0.086
3.655PheAsp: 3.655 ± 0.301
2.98PheGlu: 2.98 ± 0.303
1.293PhePhe: 1.293 ± 0.165
3.112PheGly: 3.112 ± 0.239
0.581PheHis: 0.581 ± 0.105
2.587PheIle: 2.587 ± 0.273
3.112PheLys: 3.112 ± 0.261
2.549PheLeu: 2.549 ± 0.218
1.2PheMet: 1.2 ± 0.146
2.268PheAsn: 2.268 ± 0.173
1.35PhePro: 1.35 ± 0.163
1.181PheGln: 1.181 ± 0.175
1.743PheArg: 1.743 ± 0.165
2.474PheSer: 2.474 ± 0.251
2.68PheThr: 2.68 ± 0.223
2.98PheVal: 2.98 ± 0.239
0.6PheTrp: 0.6 ± 0.093
1.668PheTyr: 1.668 ± 0.196
0.0PheXaa: 0.0 ± 0.0
Gly
3.974GlyAla: 3.974 ± 0.415
0.937GlyCys: 0.937 ± 0.121
4.592GlyAsp: 4.592 ± 0.332
3.824GlyGlu: 3.824 ± 0.274
3.018GlyPhe: 3.018 ± 0.254
4.068GlyGly: 4.068 ± 0.597
1.05GlyHis: 1.05 ± 0.14
3.974GlyIle: 3.974 ± 0.264
4.855GlyLys: 4.855 ± 0.297
4.499GlyLeu: 4.499 ± 0.26
1.724GlyMet: 1.724 ± 0.193
3.505GlyAsn: 3.505 ± 0.353
0.937GlyPro: 0.937 ± 0.137
1.65GlyGln: 1.65 ± 0.21
2.962GlyArg: 2.962 ± 0.213
4.705GlySer: 4.705 ± 0.394
3.899GlyThr: 3.899 ± 0.451
4.949GlyVal: 4.949 ± 0.285
0.862GlyTrp: 0.862 ± 0.127
3.43GlyTyr: 3.43 ± 0.233
0.0GlyXaa: 0.0 ± 0.0
His
1.162HisAla: 1.162 ± 0.148
0.281HisCys: 0.281 ± 0.08
1.125HisAsp: 1.125 ± 0.163
1.143HisGlu: 1.143 ± 0.156
0.75HisPhe: 0.75 ± 0.106
1.05HisGly: 1.05 ± 0.145
0.412HisHis: 0.412 ± 0.099
1.312HisIle: 1.312 ± 0.164
1.162HisLys: 1.162 ± 0.168
1.462HisLeu: 1.462 ± 0.139
0.356HisMet: 0.356 ± 0.092
0.862HisAsn: 0.862 ± 0.126
1.031HisPro: 1.031 ± 0.136
0.656HisGln: 0.656 ± 0.112
0.9HisArg: 0.9 ± 0.152
0.993HisSer: 0.993 ± 0.171
0.806HisThr: 0.806 ± 0.104
1.143HisVal: 1.143 ± 0.125
0.225HisTrp: 0.225 ± 0.062
0.787HisTyr: 0.787 ± 0.115
0.0HisXaa: 0.0 ± 0.0
Ile
4.874IleAla: 4.874 ± 0.399
0.637IleCys: 0.637 ± 0.094
5.436IleAsp: 5.436 ± 0.331
4.817IleGlu: 4.817 ± 0.313
2.287IlePhe: 2.287 ± 0.209
3.786IleGly: 3.786 ± 0.279
1.218IleHis: 1.218 ± 0.158
3.936IleIle: 3.936 ± 0.285
5.08IleLys: 5.08 ± 0.337
4.03IleLeu: 4.03 ± 0.313
2.174IleMet: 2.174 ± 0.2
3.899IleAsn: 3.899 ± 0.249
2.83IlePro: 2.83 ± 0.25
2.324IleGln: 2.324 ± 0.213
3.899IleArg: 3.899 ± 0.274
3.918IleSer: 3.918 ± 0.296
4.667IleThr: 4.667 ± 0.267
5.211IleVal: 5.211 ± 0.343
0.675IleTrp: 0.675 ± 0.097
2.812IleTyr: 2.812 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
5.717LysAla: 5.717 ± 0.36
0.769LysCys: 0.769 ± 0.15
4.199LysAsp: 4.199 ± 0.296
5.698LysGlu: 5.698 ± 0.445
3.018LysPhe: 3.018 ± 0.24
4.574LysGly: 4.574 ± 0.332
1.575LysHis: 1.575 ± 0.185
4.761LysIle: 4.761 ± 0.302
5.248LysLys: 5.248 ± 0.415
5.755LysLeu: 5.755 ± 0.385
2.268LysMet: 2.268 ± 0.195
4.255LysAsn: 4.255 ± 0.252
2.605LysPro: 2.605 ± 0.223
2.943LysGln: 2.943 ± 0.255
3.711LysArg: 3.711 ± 0.308
3.711LysSer: 3.711 ± 0.277
4.442LysThr: 4.442 ± 0.311
4.442LysVal: 4.442 ± 0.296
0.918LysTrp: 0.918 ± 0.128
3.599LysTyr: 3.599 ± 0.283
0.0LysXaa: 0.0 ± 0.0
Leu
5.005LeuAla: 5.005 ± 0.332
0.975LeuCys: 0.975 ± 0.15
5.455LeuAsp: 5.455 ± 0.423
4.892LeuGlu: 4.892 ± 0.348
3.093LeuPhe: 3.093 ± 0.276
4.068LeuGly: 4.068 ± 0.273
1.312LeuHis: 1.312 ± 0.135
4.517LeuIle: 4.517 ± 0.319
5.455LeuLys: 5.455 ± 0.388
4.911LeuLeu: 4.911 ± 0.301
2.643LeuMet: 2.643 ± 0.212
4.292LeuAsn: 4.292 ± 0.286
3.337LeuPro: 3.337 ± 0.202
2.381LeuGln: 2.381 ± 0.242
3.974LeuArg: 3.974 ± 0.244
5.417LeuSer: 5.417 ± 0.317
4.405LeuThr: 4.405 ± 0.328
4.536LeuVal: 4.536 ± 0.299
0.769LeuTrp: 0.769 ± 0.111
3.411LeuTyr: 3.411 ± 0.255
0.0LeuXaa: 0.0 ± 0.0
Met
1.893MetAla: 1.893 ± 0.187
0.244MetCys: 0.244 ± 0.062
1.874MetAsp: 1.874 ± 0.219
1.724MetGlu: 1.724 ± 0.181
1.106MetPhe: 1.106 ± 0.149
1.537MetGly: 1.537 ± 0.155
0.469MetHis: 0.469 ± 0.087
2.287MetIle: 2.287 ± 0.194
2.905MetLys: 2.905 ± 0.249
2.231MetLeu: 2.231 ± 0.228
0.9MetMet: 0.9 ± 0.126
2.212MetAsn: 2.212 ± 0.211
0.731MetPro: 0.731 ± 0.118
1.256MetGln: 1.256 ± 0.14
1.125MetArg: 1.125 ± 0.145
1.912MetSer: 1.912 ± 0.202
1.593MetThr: 1.593 ± 0.18
1.687MetVal: 1.687 ± 0.178
0.469MetTrp: 0.469 ± 0.093
1.162MetTyr: 1.162 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
3.843AsnAla: 3.843 ± 0.315
0.6AsnCys: 0.6 ± 0.113
3.262AsnAsp: 3.262 ± 0.255
3.711AsnGlu: 3.711 ± 0.236
2.043AsnPhe: 2.043 ± 0.242
4.461AsnGly: 4.461 ± 0.336
1.012AsnHis: 1.012 ± 0.148
3.468AsnIle: 3.468 ± 0.231
3.543AsnLys: 3.543 ± 0.317
4.292AsnLeu: 4.292 ± 0.293
1.443AsnMet: 1.443 ± 0.131
3.093AsnAsn: 3.093 ± 0.27
2.737AsnPro: 2.737 ± 0.234
1.874AsnGln: 1.874 ± 0.189
2.68AsnArg: 2.68 ± 0.195
3.618AsnSer: 3.618 ± 0.272
3.318AsnThr: 3.318 ± 0.286
3.843AsnVal: 3.843 ± 0.301
0.356AsnTrp: 0.356 ± 0.082
1.724AsnTyr: 1.724 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
2.081ProAla: 2.081 ± 0.229
0.431ProCys: 0.431 ± 0.085
3.13ProAsp: 3.13 ± 0.24
2.755ProGlu: 2.755 ± 0.299
1.575ProPhe: 1.575 ± 0.203
2.268ProGly: 2.268 ± 0.3
0.525ProHis: 0.525 ± 0.107
2.062ProIle: 2.062 ± 0.199
2.381ProLys: 2.381 ± 0.251
2.605ProLeu: 2.605 ± 0.204
0.712ProMet: 0.712 ± 0.111
1.706ProAsn: 1.706 ± 0.181
1.162ProPro: 1.162 ± 0.165
1.05ProGln: 1.05 ± 0.115
1.5ProArg: 1.5 ± 0.174
2.156ProSer: 2.156 ± 0.198
2.231ProThr: 2.231 ± 0.223
3.13ProVal: 3.13 ± 0.268
0.45ProTrp: 0.45 ± 0.094
1.556ProTyr: 1.556 ± 0.152
0.0ProXaa: 0.0 ± 0.0
Gln
2.493GlnAla: 2.493 ± 0.257
0.375GlnCys: 0.375 ± 0.09
1.668GlnAsp: 1.668 ± 0.192
2.118GlnGlu: 2.118 ± 0.196
1.35GlnPhe: 1.35 ± 0.157
1.612GlnGly: 1.612 ± 0.186
0.431GlnHis: 0.431 ± 0.093
2.849GlnIle: 2.849 ± 0.248
2.699GlnLys: 2.699 ± 0.272
2.774GlnLeu: 2.774 ± 0.25
1.05GlnMet: 1.05 ± 0.135
1.406GlnAsn: 1.406 ± 0.162
0.862GlnPro: 0.862 ± 0.112
1.312GlnGln: 1.312 ± 0.165
1.837GlnArg: 1.837 ± 0.185
1.668GlnSer: 1.668 ± 0.165
1.856GlnThr: 1.856 ± 0.174
2.418GlnVal: 2.418 ± 0.18
0.412GlnTrp: 0.412 ± 0.108
1.668GlnTyr: 1.668 ± 0.159
0.0GlnXaa: 0.0 ± 0.0
Arg
2.624ArgAla: 2.624 ± 0.199
0.544ArgCys: 0.544 ± 0.105
3.168ArgAsp: 3.168 ± 0.246
3.355ArgGlu: 3.355 ± 0.284
1.874ArgPhe: 1.874 ± 0.201
3.28ArgGly: 3.28 ± 0.254
0.694ArgHis: 0.694 ± 0.098
3.693ArgIle: 3.693 ± 0.256
3.655ArgLys: 3.655 ± 0.311
3.636ArgLeu: 3.636 ± 0.223
1.387ArgMet: 1.387 ± 0.152
2.999ArgAsn: 2.999 ± 0.231
1.443ArgPro: 1.443 ± 0.169
1.575ArgGln: 1.575 ± 0.169
2.174ArgArg: 2.174 ± 0.209
2.887ArgSer: 2.887 ± 0.24
2.474ArgThr: 2.474 ± 0.23
3.243ArgVal: 3.243 ± 0.264
0.844ArgTrp: 0.844 ± 0.149
2.062ArgTyr: 2.062 ± 0.195
0.0ArgXaa: 0.0 ± 0.0
Ser
3.768SerAla: 3.768 ± 0.245
0.694SerCys: 0.694 ± 0.104
3.955SerAsp: 3.955 ± 0.254
3.524SerGlu: 3.524 ± 0.278
2.68SerPhe: 2.68 ± 0.211
4.799SerGly: 4.799 ± 0.469
1.106SerHis: 1.106 ± 0.145
4.292SerIle: 4.292 ± 0.325
4.011SerLys: 4.011 ± 0.271
4.836SerLeu: 4.836 ± 0.316
1.912SerMet: 1.912 ± 0.237
2.905SerAsn: 2.905 ± 0.251
2.549SerPro: 2.549 ± 0.213
2.024SerGln: 2.024 ± 0.213
2.699SerArg: 2.699 ± 0.241
3.674SerSer: 3.674 ± 0.367
3.468SerThr: 3.468 ± 0.35
4.48SerVal: 4.48 ± 0.272
0.825SerTrp: 0.825 ± 0.123
2.512SerTyr: 2.512 ± 0.219
0.0SerXaa: 0.0 ± 0.0
Thr
4.011ThrAla: 4.011 ± 0.33
0.562ThrCys: 0.562 ± 0.104
3.561ThrAsp: 3.561 ± 0.302
3.805ThrGlu: 3.805 ± 0.271
2.437ThrPhe: 2.437 ± 0.214
4.424ThrGly: 4.424 ± 0.426
1.162ThrHis: 1.162 ± 0.136
4.086ThrIle: 4.086 ± 0.278
3.918ThrLys: 3.918 ± 0.31
5.342ThrLeu: 5.342 ± 0.319
1.256ThrMet: 1.256 ± 0.161
3.168ThrAsn: 3.168 ± 0.327
2.812ThrPro: 2.812 ± 0.267
1.799ThrGln: 1.799 ± 0.216
2.737ThrArg: 2.737 ± 0.229
3.468ThrSer: 3.468 ± 0.332
3.337ThrThr: 3.337 ± 0.305
4.761ThrVal: 4.761 ± 0.428
0.806ThrTrp: 0.806 ± 0.132
2.324ThrTyr: 2.324 ± 0.2
0.0ThrXaa: 0.0 ± 0.0
Val
4.461ValAla: 4.461 ± 0.304
1.087ValCys: 1.087 ± 0.148
4.93ValAsp: 4.93 ± 0.296
5.155ValGlu: 5.155 ± 0.358
2.887ValPhe: 2.887 ± 0.212
3.918ValGly: 3.918 ± 0.306
0.956ValHis: 0.956 ± 0.133
4.592ValIle: 4.592 ± 0.315
5.848ValLys: 5.848 ± 0.381
4.574ValLeu: 4.574 ± 0.333
1.968ValMet: 1.968 ± 0.189
3.918ValAsn: 3.918 ± 0.298
2.249ValPro: 2.249 ± 0.21
2.118ValGln: 2.118 ± 0.193
2.924ValArg: 2.924 ± 0.22
4.555ValSer: 4.555 ± 0.271
4.068ValThr: 4.068 ± 0.321
4.555ValVal: 4.555 ± 0.297
0.993ValTrp: 0.993 ± 0.127
3.486ValTyr: 3.486 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
0.937TrpAla: 0.937 ± 0.128
0.187TrpCys: 0.187 ± 0.059
0.918TrpAsp: 0.918 ± 0.124
0.787TrpGlu: 0.787 ± 0.134
0.619TrpPhe: 0.619 ± 0.101
0.637TrpGly: 0.637 ± 0.116
0.187TrpHis: 0.187 ± 0.064
0.862TrpIle: 0.862 ± 0.128
1.181TrpLys: 1.181 ± 0.151
1.031TrpLeu: 1.031 ± 0.128
0.619TrpMet: 0.619 ± 0.123
0.731TrpAsn: 0.731 ± 0.108
0.206TrpPro: 0.206 ± 0.059
0.562TrpGln: 0.562 ± 0.102
0.75TrpArg: 0.75 ± 0.115
0.581TrpSer: 0.581 ± 0.105
0.694TrpThr: 0.694 ± 0.108
0.881TrpVal: 0.881 ± 0.14
0.112TrpTrp: 0.112 ± 0.041
0.694TrpTyr: 0.694 ± 0.104
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.905TyrAla: 2.905 ± 0.205
0.525TyrCys: 0.525 ± 0.094
3.224TyrAsp: 3.224 ± 0.239
2.943TyrGlu: 2.943 ± 0.228
1.65TyrPhe: 1.65 ± 0.197
2.662TyrGly: 2.662 ± 0.246
0.937TyrHis: 0.937 ± 0.144
2.905TyrIle: 2.905 ± 0.224
3.205TyrLys: 3.205 ± 0.233
2.98TyrLeu: 2.98 ± 0.228
1.275TyrMet: 1.275 ± 0.165
2.887TyrAsn: 2.887 ± 0.219
1.781TyrPro: 1.781 ± 0.197
1.556TyrGln: 1.556 ± 0.176
2.099TyrArg: 2.099 ± 0.21
2.662TyrSer: 2.662 ± 0.246
2.868TyrThr: 2.868 ± 0.276
3.205TyrVal: 3.205 ± 0.239
0.525TyrTrp: 0.525 ± 0.107
1.837TyrTyr: 1.837 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 252 proteins (53350 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski