Amino acid dipepetide frequency for Klebsiella phage vB_Kpn_F48

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.791AlaAla: 5.791 ± 0.41
0.456AlaCys: 0.456 ± 0.094
4.462AlaAsp: 4.462 ± 0.334
5.658AlaGlu: 5.658 ± 0.329
2.62AlaPhe: 2.62 ± 0.207
4.614AlaGly: 4.614 ± 0.369
1.329AlaHis: 1.329 ± 0.163
5.298AlaIle: 5.298 ± 0.307
5.582AlaLys: 5.582 ± 0.388
5.867AlaLeu: 5.867 ± 0.334
1.595AlaMet: 1.595 ± 0.159
3.703AlaAsn: 3.703 ± 0.26
2.981AlaPro: 2.981 ± 0.25
2.791AlaGln: 2.791 ± 0.234
3.38AlaArg: 3.38 ± 0.268
4.595AlaSer: 4.595 ± 0.314
4.101AlaThr: 4.101 ± 0.4
5.07AlaVal: 5.07 ± 0.347
1.139AlaTrp: 1.139 ± 0.171
2.696AlaTyr: 2.696 ± 0.244
0.0AlaXaa: 0.0 ± 0.0
Cys
0.93CysAla: 0.93 ± 0.136
0.209CysCys: 0.209 ± 0.069
0.627CysAsp: 0.627 ± 0.104
0.797CysGlu: 0.797 ± 0.136
0.437CysPhe: 0.437 ± 0.091
0.703CysGly: 0.703 ± 0.117
0.285CysHis: 0.285 ± 0.074
0.816CysIle: 0.816 ± 0.135
0.589CysLys: 0.589 ± 0.091
0.589CysLeu: 0.589 ± 0.101
0.437CysMet: 0.437 ± 0.099
0.532CysAsn: 0.532 ± 0.101
0.513CysPro: 0.513 ± 0.097
0.38CysGln: 0.38 ± 0.087
0.494CysArg: 0.494 ± 0.086
0.835CysSer: 0.835 ± 0.136
0.532CysThr: 0.532 ± 0.103
0.57CysVal: 0.57 ± 0.115
0.114CysTrp: 0.114 ± 0.047
0.494CysTyr: 0.494 ± 0.096
0.0CysXaa: 0.0 ± 0.0
Asp
4.12AspAla: 4.12 ± 0.309
0.703AspCys: 0.703 ± 0.117
4.044AspAsp: 4.044 ± 0.301
5.089AspGlu: 5.089 ± 0.319
2.905AspPhe: 2.905 ± 0.239
4.652AspGly: 4.652 ± 0.273
1.025AspHis: 1.025 ± 0.148
4.576AspIle: 4.576 ± 0.298
4.234AspLys: 4.234 ± 0.335
4.861AspLeu: 4.861 ± 0.298
1.747AspMet: 1.747 ± 0.2
3.038AspAsn: 3.038 ± 0.252
2.355AspPro: 2.355 ± 0.238
1.747AspGln: 1.747 ± 0.16
2.355AspArg: 2.355 ± 0.216
3.836AspSer: 3.836 ± 0.213
3.266AspThr: 3.266 ± 0.278
4.082AspVal: 4.082 ± 0.239
1.196AspTrp: 1.196 ± 0.155
2.81AspTyr: 2.81 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
6.494GluAla: 6.494 ± 0.338
0.873GluCys: 0.873 ± 0.13
3.76GluAsp: 3.76 ± 0.268
5.051GluGlu: 5.051 ± 0.359
3.38GluPhe: 3.38 ± 0.251
4.139GluGly: 4.139 ± 0.297
1.253GluHis: 1.253 ± 0.173
5.431GluIle: 5.431 ± 0.393
4.424GluLys: 4.424 ± 0.319
7.291GluLeu: 7.291 ± 0.485
2.241GluMet: 2.241 ± 0.205
3.114GluAsn: 3.114 ± 0.261
1.728GluPro: 1.728 ± 0.184
2.734GluGln: 2.734 ± 0.261
3.133GluArg: 3.133 ± 0.254
3.608GluSer: 3.608 ± 0.263
4.139GluThr: 4.139 ± 0.237
5.165GluVal: 5.165 ± 0.267
0.987GluTrp: 0.987 ± 0.116
3.494GluTyr: 3.494 ± 0.251
0.0GluXaa: 0.0 ± 0.0
Phe
2.715PheAla: 2.715 ± 0.289
0.342PheCys: 0.342 ± 0.081
2.81PheAsp: 2.81 ± 0.243
3.893PheGlu: 3.893 ± 0.31
1.405PhePhe: 1.405 ± 0.14
2.487PheGly: 2.487 ± 0.2
0.608PheHis: 0.608 ± 0.1
2.677PheIle: 2.677 ± 0.239
3.931PheLys: 3.931 ± 0.321
2.753PheLeu: 2.753 ± 0.235
1.557PheMet: 1.557 ± 0.148
2.734PheAsn: 2.734 ± 0.217
1.101PhePro: 1.101 ± 0.132
1.329PheGln: 1.329 ± 0.155
2.146PheArg: 2.146 ± 0.175
2.563PheSer: 2.563 ± 0.229
2.791PheThr: 2.791 ± 0.219
2.753PheVal: 2.753 ± 0.217
0.627PheTrp: 0.627 ± 0.119
1.405PheTyr: 1.405 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
3.627GlyAla: 3.627 ± 0.34
0.627GlyCys: 0.627 ± 0.113
3.893GlyAsp: 3.893 ± 0.296
3.931GlyGlu: 3.931 ± 0.294
3.076GlyPhe: 3.076 ± 0.237
3.684GlyGly: 3.684 ± 0.457
0.949GlyHis: 0.949 ± 0.14
4.88GlyIle: 4.88 ± 0.274
4.443GlyLys: 4.443 ± 0.299
5.108GlyLeu: 5.108 ± 0.309
1.766GlyMet: 1.766 ± 0.218
3.228GlyAsn: 3.228 ± 0.406
1.614GlyPro: 1.614 ± 0.189
2.241GlyGln: 2.241 ± 0.203
2.943GlyArg: 2.943 ± 0.235
4.481GlySer: 4.481 ± 0.318
4.348GlyThr: 4.348 ± 0.397
4.196GlyVal: 4.196 ± 0.3
1.101GlyTrp: 1.101 ± 0.137
2.848GlyTyr: 2.848 ± 0.212
0.0GlyXaa: 0.0 ± 0.0
His
1.177HisAla: 1.177 ± 0.155
0.304HisCys: 0.304 ± 0.087
1.215HisAsp: 1.215 ± 0.138
1.196HisGlu: 1.196 ± 0.159
0.911HisPhe: 0.911 ± 0.139
1.31HisGly: 1.31 ± 0.169
0.551HisHis: 0.551 ± 0.102
1.424HisIle: 1.424 ± 0.155
1.424HisLys: 1.424 ± 0.175
1.234HisLeu: 1.234 ± 0.149
0.38HisMet: 0.38 ± 0.093
0.665HisAsn: 0.665 ± 0.104
0.703HisPro: 0.703 ± 0.12
0.627HisGln: 0.627 ± 0.103
0.684HisArg: 0.684 ± 0.154
1.082HisSer: 1.082 ± 0.14
1.025HisThr: 1.025 ± 0.116
1.139HisVal: 1.139 ± 0.125
0.285HisTrp: 0.285 ± 0.069
0.949HisTyr: 0.949 ± 0.135
0.0HisXaa: 0.0 ± 0.0
Ile
4.842IleAla: 4.842 ± 0.361
0.627IleCys: 0.627 ± 0.087
4.937IleAsp: 4.937 ± 0.284
5.905IleGlu: 5.905 ± 0.406
2.26IlePhe: 2.26 ± 0.192
3.532IleGly: 3.532 ± 0.314
1.044IleHis: 1.044 ± 0.131
4.234IleIle: 4.234 ± 0.301
6.057IleLys: 6.057 ± 0.412
4.196IleLeu: 4.196 ± 0.307
1.804IleMet: 1.804 ± 0.183
4.177IleAsn: 4.177 ± 0.24
2.924IlePro: 2.924 ± 0.24
2.62IleGln: 2.62 ± 0.241
3.38IleArg: 3.38 ± 0.229
4.709IleSer: 4.709 ± 0.329
4.348IleThr: 4.348 ± 0.276
4.633IleVal: 4.633 ± 0.273
0.608IleTrp: 0.608 ± 0.098
2.449IleTyr: 2.449 ± 0.248
0.0IleXaa: 0.0 ± 0.0
Lys
6.152LysAla: 6.152 ± 0.449
0.797LysCys: 0.797 ± 0.123
4.5LysAsp: 4.5 ± 0.275
5.26LysGlu: 5.26 ± 0.4
3.266LysPhe: 3.266 ± 0.256
4.253LysGly: 4.253 ± 0.279
1.728LysHis: 1.728 ± 0.167
4.785LysIle: 4.785 ± 0.259
4.367LysLys: 4.367 ± 0.324
6.019LysLeu: 6.019 ± 0.38
2.582LysMet: 2.582 ± 0.224
3.361LysAsn: 3.361 ± 0.221
2.355LysPro: 2.355 ± 0.202
2.506LysGln: 2.506 ± 0.24
3.665LysArg: 3.665 ± 0.237
4.12LysSer: 4.12 ± 0.282
4.006LysThr: 4.006 ± 0.26
5.374LysVal: 5.374 ± 0.304
0.873LysTrp: 0.873 ± 0.11
3.076LysTyr: 3.076 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
6.133LeuAla: 6.133 ± 0.395
0.854LeuCys: 0.854 ± 0.14
5.601LeuAsp: 5.601 ± 0.35
5.393LeuGlu: 5.393 ± 0.329
3.0LeuPhe: 3.0 ± 0.243
4.215LeuGly: 4.215 ± 0.277
1.158LeuHis: 1.158 ± 0.16
4.861LeuIle: 4.861 ± 0.329
6.247LeuLys: 6.247 ± 0.305
4.443LeuLeu: 4.443 ± 0.288
2.317LeuMet: 2.317 ± 0.234
4.253LeuAsn: 4.253 ± 0.33
2.981LeuPro: 2.981 ± 0.21
2.582LeuGln: 2.582 ± 0.24
3.532LeuArg: 3.532 ± 0.25
4.823LeuSer: 4.823 ± 0.253
4.291LeuThr: 4.291 ± 0.335
4.519LeuVal: 4.519 ± 0.237
0.779LeuTrp: 0.779 ± 0.119
2.943LeuTyr: 2.943 ± 0.244
0.0LeuXaa: 0.0 ± 0.0
Met
2.696MetAla: 2.696 ± 0.221
0.304MetCys: 0.304 ± 0.079
1.462MetAsp: 1.462 ± 0.156
1.576MetGlu: 1.576 ± 0.179
1.272MetPhe: 1.272 ± 0.146
1.557MetGly: 1.557 ± 0.178
0.494MetHis: 0.494 ± 0.121
2.222MetIle: 2.222 ± 0.258
2.449MetLys: 2.449 ± 0.245
1.975MetLeu: 1.975 ± 0.215
0.741MetMet: 0.741 ± 0.121
1.785MetAsn: 1.785 ± 0.178
0.93MetPro: 0.93 ± 0.111
1.044MetGln: 1.044 ± 0.135
1.158MetArg: 1.158 ± 0.141
1.861MetSer: 1.861 ± 0.179
1.652MetThr: 1.652 ± 0.153
1.671MetVal: 1.671 ± 0.173
0.228MetTrp: 0.228 ± 0.064
1.196MetTyr: 1.196 ± 0.17
0.0MetXaa: 0.0 ± 0.0
Asn
3.513AsnAla: 3.513 ± 0.234
0.684AsnCys: 0.684 ± 0.11
2.772AsnAsp: 2.772 ± 0.22
3.589AsnGlu: 3.589 ± 0.287
2.639AsnPhe: 2.639 ± 0.211
4.101AsnGly: 4.101 ± 0.311
1.063AsnHis: 1.063 ± 0.141
3.665AsnIle: 3.665 ± 0.249
3.703AsnLys: 3.703 ± 0.241
3.779AsnLeu: 3.779 ± 0.349
1.348AsnMet: 1.348 ± 0.162
2.544AsnAsn: 2.544 ± 0.271
2.146AsnPro: 2.146 ± 0.224
1.88AsnGln: 1.88 ± 0.189
2.089AsnArg: 2.089 ± 0.19
3.152AsnSer: 3.152 ± 0.29
3.019AsnThr: 3.019 ± 0.227
3.171AsnVal: 3.171 ± 0.236
0.722AsnTrp: 0.722 ± 0.123
1.842AsnTyr: 1.842 ± 0.177
0.0AsnXaa: 0.0 ± 0.0
Pro
2.791ProAla: 2.791 ± 0.222
0.342ProCys: 0.342 ± 0.083
2.146ProAsp: 2.146 ± 0.207
3.209ProGlu: 3.209 ± 0.275
1.462ProPhe: 1.462 ± 0.161
3.038ProGly: 3.038 ± 0.285
0.589ProHis: 0.589 ± 0.11
1.975ProIle: 1.975 ± 0.204
2.525ProLys: 2.525 ± 0.238
2.108ProLeu: 2.108 ± 0.183
0.722ProMet: 0.722 ± 0.122
1.747ProAsn: 1.747 ± 0.171
0.949ProPro: 0.949 ± 0.167
0.873ProGln: 0.873 ± 0.139
1.481ProArg: 1.481 ± 0.191
2.317ProSer: 2.317 ± 0.206
1.88ProThr: 1.88 ± 0.234
2.639ProVal: 2.639 ± 0.232
0.703ProTrp: 0.703 ± 0.121
1.158ProTyr: 1.158 ± 0.15
0.0ProXaa: 0.0 ± 0.0
Gln
2.943GlnAla: 2.943 ± 0.248
0.342GlnCys: 0.342 ± 0.065
2.07GlnAsp: 2.07 ± 0.247
2.26GlnGlu: 2.26 ± 0.177
1.709GlnPhe: 1.709 ± 0.191
1.842GlnGly: 1.842 ± 0.175
0.779GlnHis: 0.779 ± 0.13
2.525GlnIle: 2.525 ± 0.255
2.184GlnLys: 2.184 ± 0.238
2.943GlnLeu: 2.943 ± 0.247
1.082GlnMet: 1.082 ± 0.147
1.443GlnAsn: 1.443 ± 0.171
0.892GlnPro: 0.892 ± 0.142
0.987GlnGln: 0.987 ± 0.139
1.785GlnArg: 1.785 ± 0.205
2.07GlnSer: 2.07 ± 0.18
1.842GlnThr: 1.842 ± 0.176
2.563GlnVal: 2.563 ± 0.211
0.741GlnTrp: 0.741 ± 0.12
1.975GlnTyr: 1.975 ± 0.188
0.0GlnXaa: 0.0 ± 0.0
Arg
3.152ArgAla: 3.152 ± 0.248
0.627ArgCys: 0.627 ± 0.109
2.943ArgAsp: 2.943 ± 0.211
3.152ArgGlu: 3.152 ± 0.246
2.108ArgPhe: 2.108 ± 0.238
2.715ArgGly: 2.715 ± 0.177
0.911ArgHis: 0.911 ± 0.135
3.361ArgIle: 3.361 ± 0.247
3.456ArgLys: 3.456 ± 0.24
3.57ArgLeu: 3.57 ± 0.283
1.367ArgMet: 1.367 ± 0.152
2.336ArgAsn: 2.336 ± 0.199
1.348ArgPro: 1.348 ± 0.177
2.032ArgGln: 2.032 ± 0.191
2.127ArgArg: 2.127 ± 0.228
2.355ArgSer: 2.355 ± 0.2
2.468ArgThr: 2.468 ± 0.247
3.114ArgVal: 3.114 ± 0.227
0.873ArgTrp: 0.873 ± 0.159
1.861ArgTyr: 1.861 ± 0.212
0.0ArgXaa: 0.0 ± 0.0
Ser
4.576SerAla: 4.576 ± 0.309
0.76SerCys: 0.76 ± 0.109
4.158SerAsp: 4.158 ± 0.273
4.082SerGlu: 4.082 ± 0.277
2.696SerPhe: 2.696 ± 0.218
4.747SerGly: 4.747 ± 0.333
0.987SerHis: 0.987 ± 0.139
4.329SerIle: 4.329 ± 0.288
4.329SerLys: 4.329 ± 0.282
4.671SerLeu: 4.671 ± 0.314
1.557SerMet: 1.557 ± 0.186
3.0SerAsn: 3.0 ± 0.244
1.994SerPro: 1.994 ± 0.217
1.956SerGln: 1.956 ± 0.201
2.981SerArg: 2.981 ± 0.232
4.709SerSer: 4.709 ± 0.348
3.779SerThr: 3.779 ± 0.297
3.836SerVal: 3.836 ± 0.293
0.892SerTrp: 0.892 ± 0.119
2.506SerTyr: 2.506 ± 0.229
0.0SerXaa: 0.0 ± 0.0
Thr
3.836ThrAla: 3.836 ± 0.313
0.456ThrCys: 0.456 ± 0.09
3.133ThrAsp: 3.133 ± 0.247
3.817ThrGlu: 3.817 ± 0.263
2.639ThrPhe: 2.639 ± 0.229
4.196ThrGly: 4.196 ± 0.38
1.367ThrHis: 1.367 ± 0.148
3.987ThrIle: 3.987 ± 0.228
3.513ThrLys: 3.513 ± 0.21
4.348ThrLeu: 4.348 ± 0.329
1.576ThrMet: 1.576 ± 0.163
2.962ThrAsn: 2.962 ± 0.305
2.355ThrPro: 2.355 ± 0.27
1.975ThrGln: 1.975 ± 0.212
2.696ThrArg: 2.696 ± 0.247
3.779ThrSer: 3.779 ± 0.362
3.38ThrThr: 3.38 ± 0.458
4.576ThrVal: 4.576 ± 0.422
0.627ThrTrp: 0.627 ± 0.097
2.449ThrTyr: 2.449 ± 0.23
0.0ThrXaa: 0.0 ± 0.0
Val
4.481ValAla: 4.481 ± 0.272
0.684ValCys: 0.684 ± 0.119
4.5ValAsp: 4.5 ± 0.303
5.525ValGlu: 5.525 ± 0.291
2.639ValPhe: 2.639 ± 0.261
3.968ValGly: 3.968 ± 0.3
0.968ValHis: 0.968 ± 0.12
4.861ValIle: 4.861 ± 0.291
5.488ValLys: 5.488 ± 0.256
4.994ValLeu: 4.994 ± 0.318
1.861ValMet: 1.861 ± 0.155
3.513ValAsn: 3.513 ± 0.244
2.411ValPro: 2.411 ± 0.215
2.696ValGln: 2.696 ± 0.223
3.323ValArg: 3.323 ± 0.208
4.196ValSer: 4.196 ± 0.279
3.608ValThr: 3.608 ± 0.317
5.317ValVal: 5.317 ± 0.336
0.816ValTrp: 0.816 ± 0.119
2.563ValTyr: 2.563 ± 0.214
0.0ValXaa: 0.0 ± 0.0
Trp
0.949TrpAla: 0.949 ± 0.142
0.19TrpCys: 0.19 ± 0.053
0.797TrpAsp: 0.797 ± 0.104
0.816TrpGlu: 0.816 ± 0.099
0.684TrpPhe: 0.684 ± 0.138
0.57TrpGly: 0.57 ± 0.102
0.38TrpHis: 0.38 ± 0.088
0.892TrpIle: 0.892 ± 0.152
1.158TrpLys: 1.158 ± 0.167
1.196TrpLeu: 1.196 ± 0.136
0.494TrpMet: 0.494 ± 0.093
0.703TrpAsn: 0.703 ± 0.117
0.494TrpPro: 0.494 ± 0.091
0.494TrpGln: 0.494 ± 0.091
0.589TrpArg: 0.589 ± 0.092
0.722TrpSer: 0.722 ± 0.111
0.911TrpThr: 0.911 ± 0.135
1.196TrpVal: 1.196 ± 0.166
0.19TrpTrp: 0.19 ± 0.066
0.779TrpTyr: 0.779 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.81TyrAla: 2.81 ± 0.253
0.703TyrCys: 0.703 ± 0.114
2.81TyrAsp: 2.81 ± 0.251
2.241TyrGlu: 2.241 ± 0.212
1.519TyrPhe: 1.519 ± 0.178
2.658TyrGly: 2.658 ± 0.244
0.816TyrHis: 0.816 ± 0.13
2.487TyrIle: 2.487 ± 0.172
2.943TyrLys: 2.943 ± 0.237
3.0TyrLeu: 3.0 ± 0.205
1.082TyrMet: 1.082 ± 0.142
2.582TyrAsn: 2.582 ± 0.229
1.88TyrPro: 1.88 ± 0.2
1.481TyrGln: 1.481 ± 0.184
1.937TyrArg: 1.937 ± 0.22
2.734TyrSer: 2.734 ± 0.22
2.279TyrThr: 2.279 ± 0.198
2.829TyrVal: 2.829 ± 0.291
0.703TyrTrp: 0.703 ± 0.127
1.633TyrTyr: 1.633 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 283 proteins (52666 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski