Amino acid dipepetide frequency for Klebsiella phage vB_KpnM_KpS110

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.511AlaAla: 5.511 ± 0.411
0.559AlaCys: 0.559 ± 0.105
4.268AlaAsp: 4.268 ± 0.334
4.392AlaGlu: 4.392 ± 0.4
2.59AlaPhe: 2.59 ± 0.217
4.807AlaGly: 4.807 ± 0.381
1.16AlaHis: 1.16 ± 0.162
4.392AlaIle: 4.392 ± 0.29
4.289AlaLys: 4.289 ± 0.366
5.18AlaLeu: 5.18 ± 0.322
1.782AlaMet: 1.782 ± 0.208
3.273AlaAsn: 3.273 ± 0.271
2.383AlaPro: 2.383 ± 0.31
2.528AlaGln: 2.528 ± 0.272
3.315AlaArg: 3.315 ± 0.257
4.019AlaSer: 4.019 ± 0.324
3.75AlaThr: 3.75 ± 0.284
4.848AlaVal: 4.848 ± 0.297
0.912AlaTrp: 0.912 ± 0.135
2.486AlaTyr: 2.486 ± 0.234
0.0AlaXaa: 0.0 ± 0.0
Cys
0.787CysAla: 0.787 ± 0.12
0.186CysCys: 0.186 ± 0.061
0.725CysAsp: 0.725 ± 0.139
0.746CysGlu: 0.746 ± 0.137
0.331CysPhe: 0.331 ± 0.093
0.849CysGly: 0.849 ± 0.14
0.477CysHis: 0.477 ± 0.117
0.849CysIle: 0.849 ± 0.102
0.767CysLys: 0.767 ± 0.156
0.891CysLeu: 0.891 ± 0.158
0.373CysMet: 0.373 ± 0.094
0.622CysAsn: 0.622 ± 0.113
0.518CysPro: 0.518 ± 0.102
0.331CysGln: 0.331 ± 0.081
0.663CysArg: 0.663 ± 0.107
0.932CysSer: 0.932 ± 0.136
0.539CysThr: 0.539 ± 0.125
1.015CysVal: 1.015 ± 0.177
0.186CysTrp: 0.186 ± 0.061
0.394CysTyr: 0.394 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
4.827AspAla: 4.827 ± 0.331
0.829AspCys: 0.829 ± 0.146
4.309AspAsp: 4.309 ± 0.392
3.688AspGlu: 3.688 ± 0.322
3.253AspPhe: 3.253 ± 0.233
4.599AspGly: 4.599 ± 0.346
1.222AspHis: 1.222 ± 0.175
4.247AspIle: 4.247 ± 0.363
3.916AspLys: 3.916 ± 0.316
5.49AspLeu: 5.49 ± 0.376
2.113AspMet: 2.113 ± 0.238
2.963AspAsn: 2.963 ± 0.241
3.128AspPro: 3.128 ± 0.284
2.093AspGln: 2.093 ± 0.224
2.238AspArg: 2.238 ± 0.242
3.936AspSer: 3.936 ± 0.273
3.377AspThr: 3.377 ± 0.269
4.309AspVal: 4.309 ± 0.298
1.057AspTrp: 1.057 ± 0.16
3.273AspTyr: 3.273 ± 0.309
0.0AspXaa: 0.0 ± 0.0
Glu
4.123GluAla: 4.123 ± 0.398
0.746GluCys: 0.746 ± 0.134
4.247GluAsp: 4.247 ± 0.305
4.185GluGlu: 4.185 ± 0.37
2.797GluPhe: 2.797 ± 0.282
4.226GluGly: 4.226 ± 0.316
1.45GluHis: 1.45 ± 0.193
4.599GluIle: 4.599 ± 0.324
4.33GluLys: 4.33 ± 0.301
5.905GluLeu: 5.905 ± 0.367
2.32GluMet: 2.32 ± 0.215
2.631GluAsn: 2.631 ± 0.248
1.968GluPro: 1.968 ± 0.185
2.569GluGln: 2.569 ± 0.257
3.771GluArg: 3.771 ± 0.313
3.543GluSer: 3.543 ± 0.265
3.584GluThr: 3.584 ± 0.249
4.185GluVal: 4.185 ± 0.309
0.994GluTrp: 0.994 ± 0.12
2.983GluTyr: 2.983 ± 0.274
0.0GluXaa: 0.0 ± 0.0
Phe
2.424PheAla: 2.424 ± 0.231
0.58PheCys: 0.58 ± 0.116
2.88PheAsp: 2.88 ± 0.278
2.901PheGlu: 2.901 ± 0.27
1.699PhePhe: 1.699 ± 0.234
2.983PheGly: 2.983 ± 0.222
0.725PheHis: 0.725 ± 0.109
2.569PheIle: 2.569 ± 0.233
2.673PheLys: 2.673 ± 0.214
3.108PheLeu: 3.108 ± 0.291
1.388PheMet: 1.388 ± 0.198
2.59PheAsn: 2.59 ± 0.239
1.533PhePro: 1.533 ± 0.191
1.678PheGln: 1.678 ± 0.177
2.258PheArg: 2.258 ± 0.219
3.273PheSer: 3.273 ± 0.263
2.838PheThr: 2.838 ± 0.205
2.631PheVal: 2.631 ± 0.216
0.849PheTrp: 0.849 ± 0.133
1.45PheTyr: 1.45 ± 0.148
0.0PheXaa: 0.0 ± 0.0
Gly
3.791GlyAla: 3.791 ± 0.28
1.098GlyCys: 1.098 ± 0.157
3.791GlyAsp: 3.791 ± 0.283
4.724GlyGlu: 4.724 ± 0.319
2.901GlyPhe: 2.901 ± 0.263
4.599GlyGly: 4.599 ± 0.428
1.119GlyHis: 1.119 ± 0.153
4.889GlyIle: 4.889 ± 0.356
5.573GlyLys: 5.573 ± 0.356
4.869GlyLeu: 4.869 ± 0.334
1.865GlyMet: 1.865 ± 0.179
3.273GlyAsn: 3.273 ± 0.275
1.367GlyPro: 1.367 ± 0.163
2.631GlyGln: 2.631 ± 0.209
3.066GlyArg: 3.066 ± 0.26
4.081GlySer: 4.081 ± 0.312
3.709GlyThr: 3.709 ± 0.295
5.117GlyVal: 5.117 ± 0.383
1.45GlyTrp: 1.45 ± 0.169
2.631GlyTyr: 2.631 ± 0.245
0.0GlyXaa: 0.0 ± 0.0
His
0.725HisAla: 0.725 ± 0.143
0.331HisCys: 0.331 ± 0.09
1.036HisAsp: 1.036 ± 0.163
0.849HisGlu: 0.849 ± 0.132
0.912HisPhe: 0.912 ± 0.116
1.243HisGly: 1.243 ± 0.191
0.373HisHis: 0.373 ± 0.098
1.326HisIle: 1.326 ± 0.168
1.388HisLys: 1.388 ± 0.228
1.72HisLeu: 1.72 ± 0.166
0.622HisMet: 0.622 ± 0.115
0.767HisAsn: 0.767 ± 0.135
1.036HisPro: 1.036 ± 0.156
0.746HisGln: 0.746 ± 0.129
0.932HisArg: 0.932 ± 0.141
0.994HisSer: 0.994 ± 0.161
0.974HisThr: 0.974 ± 0.153
1.243HisVal: 1.243 ± 0.189
0.166HisTrp: 0.166 ± 0.061
0.829HisTyr: 0.829 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
4.144IleAla: 4.144 ± 0.262
0.829IleCys: 0.829 ± 0.123
5.076IleAsp: 5.076 ± 0.383
5.034IleGlu: 5.034 ± 0.337
1.72IlePhe: 1.72 ± 0.233
4.123IleGly: 4.123 ± 0.364
1.098IleHis: 1.098 ± 0.164
3.709IleIle: 3.709 ± 0.317
3.957IleLys: 3.957 ± 0.288
4.517IleLeu: 4.517 ± 0.31
1.492IleMet: 1.492 ± 0.155
3.605IleAsn: 3.605 ± 0.295
3.232IlePro: 3.232 ± 0.271
2.652IleGln: 2.652 ± 0.243
2.963IleArg: 2.963 ± 0.222
4.185IleSer: 4.185 ± 0.335
3.916IleThr: 3.916 ± 0.346
3.75IleVal: 3.75 ± 0.297
0.974IleTrp: 0.974 ± 0.142
2.383IleTyr: 2.383 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
4.33LysAla: 4.33 ± 0.391
0.642LysCys: 0.642 ± 0.136
4.061LysAsp: 4.061 ± 0.288
4.33LysGlu: 4.33 ± 0.331
3.211LysPhe: 3.211 ± 0.225
4.019LysGly: 4.019 ± 0.309
1.015LysHis: 1.015 ± 0.138
4.164LysIle: 4.164 ± 0.281
4.123LysLys: 4.123 ± 0.335
4.475LysLeu: 4.475 ± 0.226
2.631LysMet: 2.631 ± 0.234
3.046LysAsn: 3.046 ± 0.245
2.528LysPro: 2.528 ± 0.226
2.631LysGln: 2.631 ± 0.235
3.149LysArg: 3.149 ± 0.26
4.185LysSer: 4.185 ± 0.319
4.226LysThr: 4.226 ± 0.265
4.289LysVal: 4.289 ± 0.331
0.994LysTrp: 0.994 ± 0.143
2.859LysTyr: 2.859 ± 0.247
0.0LysXaa: 0.0 ± 0.0
Leu
5.49LeuAla: 5.49 ± 0.32
0.767LeuCys: 0.767 ± 0.129
5.159LeuAsp: 5.159 ± 0.329
5.428LeuGlu: 5.428 ± 0.367
3.564LeuPhe: 3.564 ± 0.269
5.097LeuGly: 5.097 ± 0.349
1.347LeuHis: 1.347 ± 0.179
4.206LeuIle: 4.206 ± 0.304
6.091LeuLys: 6.091 ± 0.407
5.884LeuLeu: 5.884 ± 0.369
1.906LeuMet: 1.906 ± 0.195
4.579LeuAsn: 4.579 ± 0.312
3.232LeuPro: 3.232 ± 0.267
2.652LeuGln: 2.652 ± 0.232
3.895LeuArg: 3.895 ± 0.254
5.925LeuSer: 5.925 ± 0.359
4.641LeuThr: 4.641 ± 0.319
5.78LeuVal: 5.78 ± 0.316
0.849LeuTrp: 0.849 ± 0.149
3.087LeuTyr: 3.087 ± 0.235
0.0LeuXaa: 0.0 ± 0.0
Met
2.424MetAla: 2.424 ± 0.193
0.249MetCys: 0.249 ± 0.081
1.699MetAsp: 1.699 ± 0.191
1.575MetGlu: 1.575 ± 0.179
1.554MetPhe: 1.554 ± 0.191
1.512MetGly: 1.512 ± 0.156
0.394MetHis: 0.394 ± 0.087
1.844MetIle: 1.844 ± 0.192
2.445MetLys: 2.445 ± 0.227
2.238MetLeu: 2.238 ± 0.194
0.767MetMet: 0.767 ± 0.116
1.595MetAsn: 1.595 ± 0.207
0.932MetPro: 0.932 ± 0.155
0.994MetGln: 0.994 ± 0.135
1.678MetArg: 1.678 ± 0.215
2.175MetSer: 2.175 ± 0.191
1.575MetThr: 1.575 ± 0.209
1.761MetVal: 1.761 ± 0.21
0.331MetTrp: 0.331 ± 0.074
0.932MetTyr: 0.932 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
3.916AsnAla: 3.916 ± 0.288
0.663AsnCys: 0.663 ± 0.116
2.88AsnAsp: 2.88 ± 0.209
2.465AsnGlu: 2.465 ± 0.262
2.238AsnPhe: 2.238 ± 0.177
3.895AsnGly: 3.895 ± 0.268
0.932AsnHis: 0.932 ± 0.162
3.273AsnIle: 3.273 ± 0.268
3.336AsnLys: 3.336 ± 0.262
4.04AsnLeu: 4.04 ± 0.356
1.72AsnMet: 1.72 ± 0.207
3.646AsnAsn: 3.646 ± 0.285
2.445AsnPro: 2.445 ± 0.216
1.968AsnGln: 1.968 ± 0.186
2.673AsnArg: 2.673 ± 0.24
3.418AsnSer: 3.418 ± 0.281
3.128AsnThr: 3.128 ± 0.266
3.543AsnVal: 3.543 ± 0.3
0.684AsnTrp: 0.684 ± 0.135
1.616AsnTyr: 1.616 ± 0.206
0.0AsnXaa: 0.0 ± 0.0
Pro
2.735ProAla: 2.735 ± 0.229
0.559ProCys: 0.559 ± 0.1
2.88ProAsp: 2.88 ± 0.236
3.232ProGlu: 3.232 ± 0.291
1.637ProPhe: 1.637 ± 0.197
2.528ProGly: 2.528 ± 0.225
0.725ProHis: 0.725 ± 0.146
1.948ProIle: 1.948 ± 0.218
2.32ProLys: 2.32 ± 0.211
3.17ProLeu: 3.17 ± 0.256
0.912ProMet: 0.912 ± 0.164
1.72ProAsn: 1.72 ± 0.199
0.932ProPro: 0.932 ± 0.172
1.409ProGln: 1.409 ± 0.162
1.699ProArg: 1.699 ± 0.199
2.445ProSer: 2.445 ± 0.249
2.486ProThr: 2.486 ± 0.206
2.88ProVal: 2.88 ± 0.255
0.58ProTrp: 0.58 ± 0.104
1.492ProTyr: 1.492 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
2.528GlnAla: 2.528 ± 0.23
0.497GlnCys: 0.497 ± 0.1
2.217GlnAsp: 2.217 ± 0.19
2.61GlnGlu: 2.61 ± 0.351
2.01GlnPhe: 2.01 ± 0.187
2.196GlnGly: 2.196 ± 0.218
0.663GlnHis: 0.663 ± 0.134
2.838GlnIle: 2.838 ± 0.244
2.134GlnLys: 2.134 ± 0.218
3.232GlnLeu: 3.232 ± 0.313
1.16GlnMet: 1.16 ± 0.172
1.823GlnAsn: 1.823 ± 0.178
1.243GlnPro: 1.243 ± 0.171
1.616GlnGln: 1.616 ± 0.229
1.72GlnArg: 1.72 ± 0.199
2.32GlnSer: 2.32 ± 0.24
2.528GlnThr: 2.528 ± 0.246
2.424GlnVal: 2.424 ± 0.243
0.559GlnTrp: 0.559 ± 0.103
1.471GlnTyr: 1.471 ± 0.146
0.0GlnXaa: 0.0 ± 0.0
Arg
2.735ArgAla: 2.735 ± 0.258
0.725ArgCys: 0.725 ± 0.126
3.004ArgAsp: 3.004 ± 0.263
3.191ArgGlu: 3.191 ± 0.272
2.113ArgPhe: 2.113 ± 0.21
3.066ArgGly: 3.066 ± 0.306
0.974ArgHis: 0.974 ± 0.133
3.356ArgIle: 3.356 ± 0.263
2.818ArgLys: 2.818 ± 0.278
4.703ArgLeu: 4.703 ± 0.298
1.305ArgMet: 1.305 ± 0.165
2.383ArgAsn: 2.383 ± 0.21
1.575ArgPro: 1.575 ± 0.226
2.093ArgGln: 2.093 ± 0.23
2.735ArgArg: 2.735 ± 0.259
3.356ArgSer: 3.356 ± 0.254
2.445ArgThr: 2.445 ± 0.226
3.128ArgVal: 3.128 ± 0.227
0.787ArgTrp: 0.787 ± 0.137
2.383ArgTyr: 2.383 ± 0.228
0.0ArgXaa: 0.0 ± 0.0
Ser
3.501SerAla: 3.501 ± 0.322
0.622SerCys: 0.622 ± 0.126
3.791SerAsp: 3.791 ± 0.262
3.999SerGlu: 3.999 ± 0.304
2.942SerPhe: 2.942 ± 0.202
5.014SerGly: 5.014 ± 0.355
1.119SerHis: 1.119 ± 0.156
4.102SerIle: 4.102 ± 0.28
3.605SerLys: 3.605 ± 0.246
5.78SerLeu: 5.78 ± 0.393
1.761SerMet: 1.761 ± 0.168
4.061SerAsn: 4.061 ± 0.349
2.507SerPro: 2.507 ± 0.296
2.963SerGln: 2.963 ± 0.234
2.942SerArg: 2.942 ± 0.243
4.475SerSer: 4.475 ± 0.429
4.206SerThr: 4.206 ± 0.359
4.372SerVal: 4.372 ± 0.302
0.849SerTrp: 0.849 ± 0.139
2.528SerTyr: 2.528 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
4.061ThrAla: 4.061 ± 0.315
0.497ThrCys: 0.497 ± 0.095
3.543ThrAsp: 3.543 ± 0.291
3.191ThrGlu: 3.191 ± 0.305
2.465ThrPhe: 2.465 ± 0.223
4.019ThrGly: 4.019 ± 0.363
0.87ThrHis: 0.87 ± 0.144
3.957ThrIle: 3.957 ± 0.303
3.273ThrLys: 3.273 ± 0.203
4.682ThrLeu: 4.682 ± 0.335
1.492ThrMet: 1.492 ± 0.191
3.211ThrAsn: 3.211 ± 0.259
3.294ThrPro: 3.294 ± 0.251
2.072ThrGln: 2.072 ± 0.18
2.859ThrArg: 2.859 ± 0.249
4.185ThrSer: 4.185 ± 0.373
4.123ThrThr: 4.123 ± 0.365
4.703ThrVal: 4.703 ± 0.407
0.808ThrTrp: 0.808 ± 0.142
2.196ThrTyr: 2.196 ± 0.274
0.0ThrXaa: 0.0 ± 0.0
Val
4.807ValAla: 4.807 ± 0.283
0.746ValCys: 0.746 ± 0.138
5.49ValAsp: 5.49 ± 0.348
4.786ValGlu: 4.786 ± 0.306
2.693ValPhe: 2.693 ± 0.246
4.102ValGly: 4.102 ± 0.304
1.347ValHis: 1.347 ± 0.187
4.185ValIle: 4.185 ± 0.265
4.807ValLys: 4.807 ± 0.334
5.2ValLeu: 5.2 ± 0.343
1.74ValMet: 1.74 ± 0.141
3.46ValAsn: 3.46 ± 0.287
2.714ValPro: 2.714 ± 0.263
2.383ValGln: 2.383 ± 0.228
3.439ValArg: 3.439 ± 0.285
4.268ValSer: 4.268 ± 0.323
4.351ValThr: 4.351 ± 0.433
5.428ValVal: 5.428 ± 0.374
1.098ValTrp: 1.098 ± 0.152
2.942ValTyr: 2.942 ± 0.23
0.0ValXaa: 0.0 ± 0.0
Trp
1.098TrpAla: 1.098 ± 0.17
0.331TrpCys: 0.331 ± 0.085
0.974TrpAsp: 0.974 ± 0.153
1.347TrpGlu: 1.347 ± 0.17
0.725TrpPhe: 0.725 ± 0.114
0.953TrpGly: 0.953 ± 0.119
0.124TrpHis: 0.124 ± 0.049
0.663TrpIle: 0.663 ± 0.103
0.932TrpLys: 0.932 ± 0.14
1.305TrpLeu: 1.305 ± 0.153
0.394TrpMet: 0.394 ± 0.101
0.849TrpAsn: 0.849 ± 0.128
0.311TrpPro: 0.311 ± 0.084
0.373TrpGln: 0.373 ± 0.102
0.932TrpArg: 0.932 ± 0.118
0.642TrpSer: 0.642 ± 0.106
0.725TrpThr: 0.725 ± 0.126
1.45TrpVal: 1.45 ± 0.169
0.228TrpTrp: 0.228 ± 0.058
0.539TrpTyr: 0.539 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.486TyrAla: 2.486 ± 0.21
0.725TyrCys: 0.725 ± 0.12
2.838TyrAsp: 2.838 ± 0.267
2.486TyrGlu: 2.486 ± 0.256
1.678TyrPhe: 1.678 ± 0.196
2.714TyrGly: 2.714 ± 0.24
1.077TyrHis: 1.077 ± 0.18
2.238TyrIle: 2.238 ± 0.222
2.134TyrLys: 2.134 ± 0.222
3.211TyrLeu: 3.211 ± 0.259
0.912TyrMet: 0.912 ± 0.136
2.383TyrAsn: 2.383 ± 0.193
1.409TyrPro: 1.409 ± 0.138
1.409TyrGln: 1.409 ± 0.197
1.948TyrArg: 1.948 ± 0.224
2.776TyrSer: 2.776 ± 0.23
2.383TyrThr: 2.383 ± 0.345
3.128TyrVal: 3.128 ± 0.236
0.539TyrTrp: 0.539 ± 0.116
1.616TyrTyr: 1.616 ± 0.177
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 201 proteins (48268 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski