Amino acid dipepetide frequency for Klebsiella phage KP15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.058AlaAla: 4.058 ± 0.373
0.457AlaCys: 0.457 ± 0.092
3.766AlaAsp: 3.766 ± 0.26
4.662AlaGlu: 4.662 ± 0.368
2.833AlaPhe: 2.833 ± 0.216
3.857AlaGly: 3.857 ± 0.305
1.115AlaHis: 1.115 ± 0.147
5.155AlaIle: 5.155 ± 0.318
5.137AlaLys: 5.137 ± 0.331
5.777AlaLeu: 5.777 ± 0.333
2.121AlaMet: 2.121 ± 0.234
3.455AlaAsn: 3.455 ± 0.253
1.974AlaPro: 1.974 ± 0.203
2.431AlaGln: 2.431 ± 0.232
3.272AlaArg: 3.272 ± 0.276
3.638AlaSer: 3.638 ± 0.272
4.205AlaThr: 4.205 ± 0.368
4.735AlaVal: 4.735 ± 0.284
0.75AlaTrp: 0.75 ± 0.123
3.144AlaTyr: 3.144 ± 0.261
0.0AlaXaa: 0.0 ± 0.0
Cys
0.841CysAla: 0.841 ± 0.127
0.128CysCys: 0.128 ± 0.054
0.676CysAsp: 0.676 ± 0.111
0.804CysGlu: 0.804 ± 0.124
0.603CysPhe: 0.603 ± 0.114
0.969CysGly: 0.969 ± 0.16
0.256CysHis: 0.256 ± 0.064
0.622CysIle: 0.622 ± 0.103
0.969CysLys: 0.969 ± 0.146
0.969CysLeu: 0.969 ± 0.125
0.366CysMet: 0.366 ± 0.085
0.603CysAsn: 0.603 ± 0.117
0.53CysPro: 0.53 ± 0.12
0.292CysGln: 0.292 ± 0.09
0.567CysArg: 0.567 ± 0.094
0.695CysSer: 0.695 ± 0.121
0.713CysThr: 0.713 ± 0.091
0.75CysVal: 0.75 ± 0.13
0.128CysTrp: 0.128 ± 0.043
0.42CysTyr: 0.42 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
4.241AspAla: 4.241 ± 0.294
0.695AspCys: 0.695 ± 0.095
4.003AspAsp: 4.003 ± 0.275
4.972AspGlu: 4.972 ± 0.265
3.071AspPhe: 3.071 ± 0.233
4.515AspGly: 4.515 ± 0.352
0.932AspHis: 0.932 ± 0.113
4.917AspIle: 4.917 ± 0.359
4.351AspLys: 4.351 ± 0.312
5.155AspLeu: 5.155 ± 0.363
1.865AspMet: 1.865 ± 0.202
3.199AspAsn: 3.199 ± 0.229
2.907AspPro: 2.907 ± 0.241
1.682AspGln: 1.682 ± 0.182
2.961AspArg: 2.961 ± 0.212
3.748AspSer: 3.748 ± 0.274
3.93AspThr: 3.93 ± 0.277
4.278AspVal: 4.278 ± 0.294
1.261AspTrp: 1.261 ± 0.152
3.327AspTyr: 3.327 ± 0.261
0.0AspXaa: 0.0 ± 0.0
Glu
4.972GluAla: 4.972 ± 0.339
1.042GluCys: 1.042 ± 0.144
3.656GluAsp: 3.656 ± 0.263
5.393GluGlu: 5.393 ± 0.382
2.907GluPhe: 2.907 ± 0.255
3.638GluGly: 3.638 ± 0.256
1.536GluHis: 1.536 ± 0.178
5.448GluIle: 5.448 ± 0.321
5.356GluLys: 5.356 ± 0.365
5.905GluLeu: 5.905 ± 0.323
2.175GluMet: 2.175 ± 0.227
3.345GluAsn: 3.345 ± 0.262
1.444GluPro: 1.444 ± 0.154
2.322GluGln: 2.322 ± 0.23
3.217GluArg: 3.217 ± 0.243
4.131GluSer: 4.131 ± 0.283
4.205GluThr: 4.205 ± 0.292
4.406GluVal: 4.406 ± 0.287
1.079GluTrp: 1.079 ± 0.139
2.925GluTyr: 2.925 ± 0.248
0.0GluXaa: 0.0 ± 0.0
Phe
2.23PheAla: 2.23 ± 0.2
0.512PheCys: 0.512 ± 0.08
3.693PheAsp: 3.693 ± 0.261
2.998PheGlu: 2.998 ± 0.283
1.353PhePhe: 1.353 ± 0.2
2.98PheGly: 2.98 ± 0.21
0.567PheHis: 0.567 ± 0.104
2.706PheIle: 2.706 ± 0.221
3.126PheLys: 3.126 ± 0.233
2.651PheLeu: 2.651 ± 0.226
1.225PheMet: 1.225 ± 0.144
2.322PheAsn: 2.322 ± 0.202
1.353PhePro: 1.353 ± 0.191
1.261PheGln: 1.261 ± 0.156
1.755PheArg: 1.755 ± 0.162
2.468PheSer: 2.468 ± 0.24
2.815PheThr: 2.815 ± 0.219
3.126PheVal: 3.126 ± 0.271
0.64PheTrp: 0.64 ± 0.084
1.536PheTyr: 1.536 ± 0.206
0.0PheXaa: 0.0 ± 0.0
Gly
4.04GlyAla: 4.04 ± 0.45
0.951GlyCys: 0.951 ± 0.129
4.424GlyAsp: 4.424 ± 0.373
3.967GlyGlu: 3.967 ± 0.278
2.998GlyPhe: 2.998 ± 0.29
4.077GlyGly: 4.077 ± 0.74
1.042GlyHis: 1.042 ± 0.141
3.912GlyIle: 3.912 ± 0.224
4.588GlyLys: 4.588 ± 0.286
4.515GlyLeu: 4.515 ± 0.269
1.773GlyMet: 1.773 ± 0.199
3.601GlyAsn: 3.601 ± 0.345
0.859GlyPro: 0.859 ± 0.129
1.609GlyGln: 1.609 ± 0.193
2.98GlyArg: 2.98 ± 0.243
4.662GlySer: 4.662 ± 0.324
3.766GlyThr: 3.766 ± 0.417
4.917GlyVal: 4.917 ± 0.25
0.877GlyTrp: 0.877 ± 0.135
3.254GlyTyr: 3.254 ± 0.208
0.0GlyXaa: 0.0 ± 0.0
His
1.243HisAla: 1.243 ± 0.14
0.292HisCys: 0.292 ± 0.072
1.097HisAsp: 1.097 ± 0.176
1.17HisGlu: 1.17 ± 0.173
0.786HisPhe: 0.786 ± 0.119
1.024HisGly: 1.024 ± 0.143
0.384HisHis: 0.384 ± 0.091
1.316HisIle: 1.316 ± 0.154
1.079HisLys: 1.079 ± 0.176
1.389HisLeu: 1.389 ± 0.174
0.402HisMet: 0.402 ± 0.086
0.914HisAsn: 0.914 ± 0.13
0.987HisPro: 0.987 ± 0.126
0.622HisGln: 0.622 ± 0.096
0.859HisArg: 0.859 ± 0.131
1.042HisSer: 1.042 ± 0.155
0.75HisThr: 0.75 ± 0.117
1.152HisVal: 1.152 ± 0.14
0.292HisTrp: 0.292 ± 0.08
0.786HisTyr: 0.786 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
4.899IleAla: 4.899 ± 0.345
0.64IleCys: 0.64 ± 0.095
5.411IleAsp: 5.411 ± 0.308
4.643IleGlu: 4.643 ± 0.259
2.249IlePhe: 2.249 ± 0.219
3.766IleGly: 3.766 ± 0.315
1.207IleHis: 1.207 ± 0.158
3.875IleIle: 3.875 ± 0.303
5.082IleLys: 5.082 ± 0.33
4.058IleLeu: 4.058 ± 0.237
2.212IleMet: 2.212 ± 0.202
3.857IleAsn: 3.857 ± 0.272
2.833IlePro: 2.833 ± 0.215
2.322IleGln: 2.322 ± 0.165
3.875IleArg: 3.875 ± 0.265
3.949IleSer: 3.949 ± 0.294
4.588IleThr: 4.588 ± 0.277
5.155IleVal: 5.155 ± 0.303
0.713IleTrp: 0.713 ± 0.11
2.815IleTyr: 2.815 ± 0.237
0.0IleXaa: 0.0 ± 0.0
Lys
5.722LysAla: 5.722 ± 0.345
0.695LysCys: 0.695 ± 0.136
4.15LysAsp: 4.15 ± 0.254
5.466LysGlu: 5.466 ± 0.356
3.053LysPhe: 3.053 ± 0.267
4.314LysGly: 4.314 ± 0.305
1.444LysHis: 1.444 ± 0.172
4.753LysIle: 4.753 ± 0.265
4.863LysLys: 4.863 ± 0.327
5.704LysLeu: 5.704 ± 0.346
2.358LysMet: 2.358 ± 0.208
4.15LysAsn: 4.15 ± 0.259
2.578LysPro: 2.578 ± 0.246
2.943LysGln: 2.943 ± 0.244
3.693LysArg: 3.693 ± 0.302
3.62LysSer: 3.62 ± 0.278
4.296LysThr: 4.296 ± 0.244
4.826LysVal: 4.826 ± 0.295
0.987LysTrp: 0.987 ± 0.131
3.309LysTyr: 3.309 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
4.991LeuAla: 4.991 ± 0.296
1.024LeuCys: 1.024 ± 0.146
5.557LeuAsp: 5.557 ± 0.355
4.899LeuGlu: 4.899 ± 0.3
3.29LeuPhe: 3.29 ± 0.269
4.022LeuGly: 4.022 ± 0.238
1.353LeuHis: 1.353 ± 0.158
4.534LeuIle: 4.534 ± 0.252
5.612LeuLys: 5.612 ± 0.311
4.753LeuLeu: 4.753 ± 0.313
2.632LeuMet: 2.632 ± 0.237
4.259LeuAsn: 4.259 ± 0.271
3.29LeuPro: 3.29 ± 0.243
2.376LeuGln: 2.376 ± 0.239
4.095LeuArg: 4.095 ± 0.214
5.356LeuSer: 5.356 ± 0.31
4.424LeuThr: 4.424 ± 0.313
4.46LeuVal: 4.46 ± 0.304
0.622LeuTrp: 0.622 ± 0.105
3.546LeuTyr: 3.546 ± 0.248
0.0LeuXaa: 0.0 ± 0.0
Met
1.883MetAla: 1.883 ± 0.19
0.292MetCys: 0.292 ± 0.075
1.901MetAsp: 1.901 ± 0.194
1.791MetGlu: 1.791 ± 0.21
1.097MetPhe: 1.097 ± 0.14
1.627MetGly: 1.627 ± 0.17
0.512MetHis: 0.512 ± 0.109
2.322MetIle: 2.322 ± 0.207
2.852MetLys: 2.852 ± 0.261
2.194MetLeu: 2.194 ± 0.226
0.987MetMet: 0.987 ± 0.137
2.175MetAsn: 2.175 ± 0.21
0.786MetPro: 0.786 ± 0.111
1.28MetGln: 1.28 ± 0.158
1.207MetArg: 1.207 ± 0.135
1.901MetSer: 1.901 ± 0.194
1.664MetThr: 1.664 ± 0.155
1.664MetVal: 1.664 ± 0.164
0.366MetTrp: 0.366 ± 0.079
1.17MetTyr: 1.17 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
4.04AsnAla: 4.04 ± 0.29
0.603AsnCys: 0.603 ± 0.096
3.272AsnAsp: 3.272 ± 0.304
3.784AsnGlu: 3.784 ± 0.218
2.121AsnPhe: 2.121 ± 0.214
4.351AsnGly: 4.351 ± 0.261
1.207AsnHis: 1.207 ± 0.187
3.418AsnIle: 3.418 ± 0.243
3.382AsnLys: 3.382 ± 0.245
4.15AsnLeu: 4.15 ± 0.257
1.462AsnMet: 1.462 ± 0.154
3.053AsnAsn: 3.053 ± 0.261
2.742AsnPro: 2.742 ± 0.237
1.865AsnGln: 1.865 ± 0.209
2.724AsnArg: 2.724 ± 0.224
3.546AsnSer: 3.546 ± 0.276
3.382AsnThr: 3.382 ± 0.283
3.894AsnVal: 3.894 ± 0.292
0.311AsnTrp: 0.311 ± 0.077
1.828AsnTyr: 1.828 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
2.139ProAla: 2.139 ± 0.218
0.457ProCys: 0.457 ± 0.089
3.254ProAsp: 3.254 ± 0.244
2.779ProGlu: 2.779 ± 0.229
1.554ProPhe: 1.554 ± 0.185
2.212ProGly: 2.212 ± 0.279
0.567ProHis: 0.567 ± 0.097
1.993ProIle: 1.993 ± 0.187
2.34ProLys: 2.34 ± 0.241
2.578ProLeu: 2.578 ± 0.225
0.676ProMet: 0.676 ± 0.116
1.718ProAsn: 1.718 ± 0.17
1.133ProPro: 1.133 ± 0.153
0.969ProGln: 0.969 ± 0.099
1.499ProArg: 1.499 ± 0.196
2.175ProSer: 2.175 ± 0.181
2.267ProThr: 2.267 ± 0.205
2.98ProVal: 2.98 ± 0.209
0.457ProTrp: 0.457 ± 0.081
1.536ProTyr: 1.536 ± 0.174
0.0ProXaa: 0.0 ± 0.0
Gln
2.413GlnAla: 2.413 ± 0.225
0.42GlnCys: 0.42 ± 0.096
1.682GlnAsp: 1.682 ± 0.186
2.047GlnGlu: 2.047 ± 0.215
1.316GlnPhe: 1.316 ± 0.16
1.645GlnGly: 1.645 ± 0.191
0.494GlnHis: 0.494 ± 0.086
2.76GlnIle: 2.76 ± 0.25
2.578GlnLys: 2.578 ± 0.294
2.87GlnLeu: 2.87 ± 0.214
0.932GlnMet: 0.932 ± 0.132
1.59GlnAsn: 1.59 ± 0.173
0.896GlnPro: 0.896 ± 0.129
1.389GlnGln: 1.389 ± 0.151
1.865GlnArg: 1.865 ± 0.196
1.718GlnSer: 1.718 ± 0.194
1.919GlnThr: 1.919 ± 0.165
2.285GlnVal: 2.285 ± 0.193
0.402GlnTrp: 0.402 ± 0.084
1.664GlnTyr: 1.664 ± 0.147
0.0GlnXaa: 0.0 ± 0.0
Arg
2.742ArgAla: 2.742 ± 0.218
0.548ArgCys: 0.548 ± 0.104
3.016ArgAsp: 3.016 ± 0.269
3.51ArgGlu: 3.51 ± 0.305
1.938ArgPhe: 1.938 ± 0.203
3.108ArgGly: 3.108 ± 0.258
0.64ArgHis: 0.64 ± 0.105
3.802ArgIle: 3.802 ± 0.263
3.4ArgLys: 3.4 ± 0.288
3.839ArgLeu: 3.839 ± 0.238
1.316ArgMet: 1.316 ± 0.145
3.144ArgAsn: 3.144 ± 0.214
1.462ArgPro: 1.462 ± 0.146
1.609ArgGln: 1.609 ± 0.182
2.139ArgArg: 2.139 ± 0.187
2.961ArgSer: 2.961 ± 0.254
2.431ArgThr: 2.431 ± 0.221
3.437ArgVal: 3.437 ± 0.285
0.914ArgTrp: 0.914 ± 0.147
2.047ArgTyr: 2.047 ± 0.198
0.0ArgXaa: 0.0 ± 0.0
Ser
3.693SerAla: 3.693 ± 0.249
0.713SerCys: 0.713 ± 0.12
3.949SerAsp: 3.949 ± 0.228
3.693SerGlu: 3.693 ± 0.262
2.669SerPhe: 2.669 ± 0.201
4.899SerGly: 4.899 ± 0.454
1.133SerHis: 1.133 ± 0.144
4.186SerIle: 4.186 ± 0.301
4.168SerLys: 4.168 ± 0.246
4.899SerLeu: 4.899 ± 0.365
1.773SerMet: 1.773 ± 0.2
2.998SerAsn: 2.998 ± 0.229
2.486SerPro: 2.486 ± 0.199
2.029SerGln: 2.029 ± 0.194
2.742SerArg: 2.742 ± 0.197
3.802SerSer: 3.802 ± 0.328
3.29SerThr: 3.29 ± 0.304
4.588SerVal: 4.588 ± 0.26
0.859SerTrp: 0.859 ± 0.123
2.395SerTyr: 2.395 ± 0.246
0.0SerXaa: 0.0 ± 0.0
Thr
3.967ThrAla: 3.967 ± 0.342
0.622ThrCys: 0.622 ± 0.124
3.455ThrAsp: 3.455 ± 0.31
3.711ThrGlu: 3.711 ± 0.258
2.45ThrPhe: 2.45 ± 0.206
4.351ThrGly: 4.351 ± 0.398
1.152ThrHis: 1.152 ± 0.133
4.186ThrIle: 4.186 ± 0.238
3.967ThrLys: 3.967 ± 0.265
5.192ThrLeu: 5.192 ± 0.347
1.389ThrMet: 1.389 ± 0.151
3.217ThrAsn: 3.217 ± 0.289
2.706ThrPro: 2.706 ± 0.236
1.791ThrGln: 1.791 ± 0.205
2.779ThrArg: 2.779 ± 0.255
3.583ThrSer: 3.583 ± 0.341
3.236ThrThr: 3.236 ± 0.273
4.735ThrVal: 4.735 ± 0.404
0.713ThrTrp: 0.713 ± 0.114
2.376ThrTyr: 2.376 ± 0.224
0.0ThrXaa: 0.0 ± 0.0
Val
4.534ValAla: 4.534 ± 0.315
1.06ValCys: 1.06 ± 0.155
4.936ValAsp: 4.936 ± 0.27
5.1ValGlu: 5.1 ± 0.288
2.779ValPhe: 2.779 ± 0.233
4.04ValGly: 4.04 ± 0.273
1.005ValHis: 1.005 ± 0.131
4.643ValIle: 4.643 ± 0.319
5.795ValLys: 5.795 ± 0.371
4.643ValLeu: 4.643 ± 0.32
2.011ValMet: 2.011 ± 0.161
3.967ValAsn: 3.967 ± 0.318
2.34ValPro: 2.34 ± 0.216
2.066ValGln: 2.066 ± 0.177
3.108ValArg: 3.108 ± 0.241
4.662ValSer: 4.662 ± 0.286
4.223ValThr: 4.223 ± 0.358
4.442ValVal: 4.442 ± 0.302
1.024ValTrp: 1.024 ± 0.136
3.528ValTyr: 3.528 ± 0.282
0.0ValXaa: 0.0 ± 0.0
Trp
0.932TrpAla: 0.932 ± 0.136
0.201TrpCys: 0.201 ± 0.056
0.932TrpAsp: 0.932 ± 0.139
0.768TrpGlu: 0.768 ± 0.122
0.676TrpPhe: 0.676 ± 0.105
0.676TrpGly: 0.676 ± 0.113
0.201TrpHis: 0.201 ± 0.062
0.713TrpIle: 0.713 ± 0.117
1.188TrpLys: 1.188 ± 0.144
0.987TrpLeu: 0.987 ± 0.126
0.64TrpMet: 0.64 ± 0.119
0.676TrpAsn: 0.676 ± 0.092
0.201TrpPro: 0.201 ± 0.056
0.53TrpGln: 0.53 ± 0.097
0.695TrpArg: 0.695 ± 0.121
0.567TrpSer: 0.567 ± 0.098
0.695TrpThr: 0.695 ± 0.105
0.896TrpVal: 0.896 ± 0.132
0.128TrpTrp: 0.128 ± 0.054
0.713TrpTyr: 0.713 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.961TyrAla: 2.961 ± 0.219
0.548TyrCys: 0.548 ± 0.094
3.181TyrAsp: 3.181 ± 0.218
2.943TyrGlu: 2.943 ± 0.237
1.572TyrPhe: 1.572 ± 0.175
2.742TyrGly: 2.742 ± 0.233
0.841TyrHis: 0.841 ± 0.124
2.888TyrIle: 2.888 ± 0.238
3.089TyrLys: 3.089 ± 0.223
2.87TyrLeu: 2.87 ± 0.197
1.334TyrMet: 1.334 ± 0.154
2.888TyrAsn: 2.888 ± 0.224
1.773TyrPro: 1.773 ± 0.198
1.499TyrGln: 1.499 ± 0.151
1.993TyrArg: 1.993 ± 0.202
2.742TyrSer: 2.742 ± 0.22
2.779TyrThr: 2.779 ± 0.265
3.181TyrVal: 3.181 ± 0.207
0.494TyrTrp: 0.494 ± 0.091
1.81TyrTyr: 1.81 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 258 proteins (54704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski