Amino acid dipepetide frequency for Klebsiella virus 0507KN21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.862AlaAla: 5.862 ± 0.376
0.615AlaCys: 0.615 ± 0.104
4.501AlaAsp: 4.501 ± 0.327
4.347AlaGlu: 4.347 ± 0.37
2.569AlaPhe: 2.569 ± 0.198
5.225AlaGly: 5.225 ± 0.494
1.076AlaHis: 1.076 ± 0.154
4.216AlaIle: 4.216 ± 0.314
3.952AlaLys: 3.952 ± 0.264
5.533AlaLeu: 5.533 ± 0.332
1.735AlaMet: 1.735 ± 0.209
3.271AlaAsn: 3.271 ± 0.253
2.679AlaPro: 2.679 ± 0.257
2.415AlaGln: 2.415 ± 0.198
3.513AlaArg: 3.513 ± 0.267
4.325AlaSer: 4.325 ± 0.303
3.623AlaThr: 3.623 ± 0.301
5.05AlaVal: 5.05 ± 0.384
0.878AlaTrp: 0.878 ± 0.161
2.723AlaTyr: 2.723 ± 0.277
0.0AlaXaa: 0.0 ± 0.0
Cys
0.637CysAla: 0.637 ± 0.117
0.11CysCys: 0.11 ± 0.051
0.768CysAsp: 0.768 ± 0.122
0.944CysGlu: 0.944 ± 0.12
0.351CysPhe: 0.351 ± 0.077
0.768CysGly: 0.768 ± 0.123
0.285CysHis: 0.285 ± 0.091
0.812CysIle: 0.812 ± 0.125
0.593CysLys: 0.593 ± 0.123
0.637CysLeu: 0.637 ± 0.124
0.395CysMet: 0.395 ± 0.102
0.593CysAsn: 0.593 ± 0.109
0.395CysPro: 0.395 ± 0.08
0.307CysGln: 0.307 ± 0.081
0.549CysArg: 0.549 ± 0.112
0.725CysSer: 0.725 ± 0.125
0.483CysThr: 0.483 ± 0.106
0.878CysVal: 0.878 ± 0.154
0.263CysTrp: 0.263 ± 0.075
0.351CysTyr: 0.351 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
5.225AspAla: 5.225 ± 0.335
0.659AspCys: 0.659 ± 0.13
4.413AspAsp: 4.413 ± 0.357
3.711AspGlu: 3.711 ± 0.277
3.14AspPhe: 3.14 ± 0.246
4.984AspGly: 4.984 ± 0.351
1.251AspHis: 1.251 ± 0.16
4.216AspIle: 4.216 ± 0.271
3.908AspLys: 3.908 ± 0.325
5.489AspLeu: 5.489 ± 0.367
2.086AspMet: 2.086 ± 0.177
3.162AspAsn: 3.162 ± 0.245
3.118AspPro: 3.118 ± 0.257
1.976AspGln: 1.976 ± 0.179
2.437AspArg: 2.437 ± 0.238
3.842AspSer: 3.842 ± 0.337
3.403AspThr: 3.403 ± 0.278
4.325AspVal: 4.325 ± 0.307
1.251AspTrp: 1.251 ± 0.197
3.403AspTyr: 3.403 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
4.237GluAla: 4.237 ± 0.34
0.593GluCys: 0.593 ± 0.122
4.018GluAsp: 4.018 ± 0.317
4.435GluGlu: 4.435 ± 0.366
3.008GluPhe: 3.008 ± 0.256
4.303GluGly: 4.303 ± 0.348
1.427GluHis: 1.427 ± 0.179
4.523GluIle: 4.523 ± 0.302
3.798GluLys: 3.798 ± 0.367
5.972GluLeu: 5.972 ± 0.325
1.976GluMet: 1.976 ± 0.182
2.613GluAsn: 2.613 ± 0.218
1.932GluPro: 1.932 ± 0.181
2.569GluGln: 2.569 ± 0.219
3.359GluArg: 3.359 ± 0.295
3.271GluSer: 3.271 ± 0.297
3.711GluThr: 3.711 ± 0.271
4.567GluVal: 4.567 ± 0.355
0.9GluTrp: 0.9 ± 0.113
3.03GluTyr: 3.03 ± 0.266
0.0GluXaa: 0.0 ± 0.0
Phe
2.371PheAla: 2.371 ± 0.241
0.439PheCys: 0.439 ± 0.094
3.162PheAsp: 3.162 ± 0.219
3.14PheGlu: 3.14 ± 0.248
1.822PhePhe: 1.822 ± 0.215
3.206PheGly: 3.206 ± 0.264
0.812PheHis: 0.812 ± 0.114
2.437PheIle: 2.437 ± 0.234
2.591PheLys: 2.591 ± 0.205
2.986PheLeu: 2.986 ± 0.255
1.537PheMet: 1.537 ± 0.212
2.766PheAsn: 2.766 ± 0.256
1.449PhePro: 1.449 ± 0.192
1.647PheGln: 1.647 ± 0.182
1.932PheArg: 1.932 ± 0.219
3.315PheSer: 3.315 ± 0.252
2.744PheThr: 2.744 ± 0.238
2.81PheVal: 2.81 ± 0.234
0.79PheTrp: 0.79 ± 0.148
1.691PheTyr: 1.691 ± 0.181
0.0PheXaa: 0.0 ± 0.0
Gly
3.864GlyAla: 3.864 ± 0.383
0.944GlyCys: 0.944 ± 0.145
3.996GlyAsp: 3.996 ± 0.263
4.479GlyGlu: 4.479 ± 0.298
3.052GlyPhe: 3.052 ± 0.276
5.072GlyGly: 5.072 ± 0.473
1.208GlyHis: 1.208 ± 0.168
5.05GlyIle: 5.05 ± 0.342
5.379GlyLys: 5.379 ± 0.374
5.182GlyLeu: 5.182 ± 0.349
1.844GlyMet: 1.844 ± 0.203
3.271GlyAsn: 3.271 ± 0.243
1.295GlyPro: 1.295 ± 0.176
2.832GlyGln: 2.832 ± 0.224
3.14GlyArg: 3.14 ± 0.293
4.216GlySer: 4.216 ± 0.403
4.435GlyThr: 4.435 ± 0.441
4.852GlyVal: 4.852 ± 0.406
1.361GlyTrp: 1.361 ± 0.173
2.481GlyTyr: 2.481 ± 0.237
0.0GlyXaa: 0.0 ± 0.0
His
0.812HisAla: 0.812 ± 0.155
0.198HisCys: 0.198 ± 0.07
1.098HisAsp: 1.098 ± 0.177
0.79HisGlu: 0.79 ± 0.112
1.054HisPhe: 1.054 ± 0.151
1.142HisGly: 1.142 ± 0.141
0.373HisHis: 0.373 ± 0.074
1.317HisIle: 1.317 ± 0.184
1.251HisLys: 1.251 ± 0.188
1.756HisLeu: 1.756 ± 0.163
0.659HisMet: 0.659 ± 0.128
0.9HisAsn: 0.9 ± 0.146
0.9HisPro: 0.9 ± 0.134
0.79HisGln: 0.79 ± 0.131
0.834HisArg: 0.834 ± 0.126
1.01HisSer: 1.01 ± 0.159
1.032HisThr: 1.032 ± 0.176
1.208HisVal: 1.208 ± 0.184
0.285HisTrp: 0.285 ± 0.087
0.79HisTyr: 0.79 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
3.952IleAla: 3.952 ± 0.313
0.746IleCys: 0.746 ± 0.109
4.764IleAsp: 4.764 ± 0.319
4.699IleGlu: 4.699 ± 0.328
1.735IlePhe: 1.735 ± 0.217
3.732IleGly: 3.732 ± 0.31
1.032IleHis: 1.032 ± 0.148
3.842IleIle: 3.842 ± 0.336
4.04IleLys: 4.04 ± 0.287
4.479IleLeu: 4.479 ± 0.345
1.427IleMet: 1.427 ± 0.156
3.315IleAsn: 3.315 ± 0.245
3.535IlePro: 3.535 ± 0.276
2.942IleGln: 2.942 ± 0.244
2.876IleArg: 2.876 ± 0.232
3.908IleSer: 3.908 ± 0.348
4.194IleThr: 4.194 ± 0.346
3.952IleVal: 3.952 ± 0.268
0.812IleTrp: 0.812 ± 0.13
2.349IleTyr: 2.349 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
4.369LysAla: 4.369 ± 0.366
0.527LysCys: 0.527 ± 0.123
3.732LysAsp: 3.732 ± 0.289
4.216LysGlu: 4.216 ± 0.383
3.008LysPhe: 3.008 ± 0.217
3.93LysGly: 3.93 ± 0.289
1.032LysHis: 1.032 ± 0.164
3.996LysIle: 3.996 ± 0.3
3.93LysLys: 3.93 ± 0.372
4.567LysLeu: 4.567 ± 0.273
2.459LysMet: 2.459 ± 0.226
2.766LysAsn: 2.766 ± 0.266
2.305LysPro: 2.305 ± 0.237
2.591LysGln: 2.591 ± 0.248
2.701LysArg: 2.701 ± 0.269
4.325LysSer: 4.325 ± 0.258
4.04LysThr: 4.04 ± 0.294
4.347LysVal: 4.347 ± 0.339
1.032LysTrp: 1.032 ± 0.172
2.393LysTyr: 2.393 ± 0.225
0.0LysXaa: 0.0 ± 0.0
Leu
5.687LeuAla: 5.687 ± 0.347
0.615LeuCys: 0.615 ± 0.117
5.423LeuAsp: 5.423 ± 0.383
5.204LeuGlu: 5.204 ± 0.342
3.447LeuPhe: 3.447 ± 0.241
5.05LeuGly: 5.05 ± 0.299
1.405LeuHis: 1.405 ± 0.187
3.952LeuIle: 3.952 ± 0.263
6.016LeuLys: 6.016 ± 0.406
5.73LeuLeu: 5.73 ± 0.399
1.932LeuMet: 1.932 ± 0.217
4.633LeuAsn: 4.633 ± 0.269
3.359LeuPro: 3.359 ± 0.306
2.788LeuGln: 2.788 ± 0.22
3.974LeuArg: 3.974 ± 0.271
5.752LeuSer: 5.752 ± 0.402
4.457LeuThr: 4.457 ± 0.295
5.665LeuVal: 5.665 ± 0.31
0.812LeuTrp: 0.812 ± 0.139
3.118LeuTyr: 3.118 ± 0.257
0.0LeuXaa: 0.0 ± 0.0
Met
2.349MetAla: 2.349 ± 0.2
0.373MetCys: 0.373 ± 0.094
1.537MetAsp: 1.537 ± 0.182
1.471MetGlu: 1.471 ± 0.167
1.493MetPhe: 1.493 ± 0.21
1.559MetGly: 1.559 ± 0.165
0.417MetHis: 0.417 ± 0.083
1.537MetIle: 1.537 ± 0.178
2.283MetLys: 2.283 ± 0.239
2.283MetLeu: 2.283 ± 0.211
0.944MetMet: 0.944 ± 0.15
1.735MetAsn: 1.735 ± 0.198
0.922MetPro: 0.922 ± 0.136
0.988MetGln: 0.988 ± 0.127
1.756MetArg: 1.756 ± 0.215
1.976MetSer: 1.976 ± 0.19
1.581MetThr: 1.581 ± 0.254
1.888MetVal: 1.888 ± 0.181
0.351MetTrp: 0.351 ± 0.084
1.01MetTyr: 1.01 ± 0.121
0.0MetXaa: 0.0 ± 0.0
Asn
4.128AsnAla: 4.128 ± 0.339
0.659AsnCys: 0.659 ± 0.132
3.03AsnAsp: 3.03 ± 0.248
2.481AsnGlu: 2.481 ± 0.256
2.261AsnPhe: 2.261 ± 0.205
3.908AsnGly: 3.908 ± 0.285
1.032AsnHis: 1.032 ± 0.171
3.315AsnIle: 3.315 ± 0.298
3.14AsnLys: 3.14 ± 0.254
3.798AsnLeu: 3.798 ± 0.315
1.603AsnMet: 1.603 ± 0.166
3.491AsnAsn: 3.491 ± 0.289
2.371AsnPro: 2.371 ± 0.212
1.756AsnGln: 1.756 ± 0.208
2.701AsnArg: 2.701 ± 0.245
3.228AsnSer: 3.228 ± 0.283
3.359AsnThr: 3.359 ± 0.257
3.711AsnVal: 3.711 ± 0.297
0.725AsnTrp: 0.725 ± 0.128
1.778AsnTyr: 1.778 ± 0.198
0.0AsnXaa: 0.0 ± 0.0
Pro
3.096ProAla: 3.096 ± 0.243
0.527ProCys: 0.527 ± 0.094
2.832ProAsp: 2.832 ± 0.258
3.271ProGlu: 3.271 ± 0.267
1.866ProPhe: 1.866 ± 0.204
2.459ProGly: 2.459 ± 0.205
0.768ProHis: 0.768 ± 0.122
2.13ProIle: 2.13 ± 0.201
2.283ProLys: 2.283 ± 0.274
3.052ProLeu: 3.052 ± 0.263
0.746ProMet: 0.746 ± 0.136
1.8ProAsn: 1.8 ± 0.229
1.23ProPro: 1.23 ± 0.219
1.471ProGln: 1.471 ± 0.175
1.8ProArg: 1.8 ± 0.226
2.481ProSer: 2.481 ± 0.266
2.547ProThr: 2.547 ± 0.227
2.898ProVal: 2.898 ± 0.25
0.571ProTrp: 0.571 ± 0.102
1.471ProTyr: 1.471 ± 0.191
0.0ProXaa: 0.0 ± 0.0
Gln
2.437GlnAla: 2.437 ± 0.265
0.417GlnCys: 0.417 ± 0.084
2.327GlnAsp: 2.327 ± 0.23
2.459GlnGlu: 2.459 ± 0.3
2.239GlnPhe: 2.239 ± 0.22
2.042GlnGly: 2.042 ± 0.213
0.746GlnHis: 0.746 ± 0.127
2.525GlnIle: 2.525 ± 0.284
2.152GlnLys: 2.152 ± 0.203
3.206GlnLeu: 3.206 ± 0.295
1.339GlnMet: 1.339 ± 0.176
1.756GlnAsn: 1.756 ± 0.185
1.23GlnPro: 1.23 ± 0.165
1.647GlnGln: 1.647 ± 0.208
1.976GlnArg: 1.976 ± 0.171
2.327GlnSer: 2.327 ± 0.274
2.437GlnThr: 2.437 ± 0.207
2.459GlnVal: 2.459 ± 0.251
0.615GlnTrp: 0.615 ± 0.113
1.647GlnTyr: 1.647 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
2.788ArgAla: 2.788 ± 0.202
0.725ArgCys: 0.725 ± 0.13
2.898ArgAsp: 2.898 ± 0.257
3.118ArgGlu: 3.118 ± 0.25
2.064ArgPhe: 2.064 ± 0.215
3.162ArgGly: 3.162 ± 0.291
1.01ArgHis: 1.01 ± 0.141
3.008ArgIle: 3.008 ± 0.232
2.613ArgLys: 2.613 ± 0.238
4.786ArgLeu: 4.786 ± 0.361
1.427ArgMet: 1.427 ± 0.182
2.525ArgAsn: 2.525 ± 0.241
1.669ArgPro: 1.669 ± 0.216
1.998ArgGln: 1.998 ± 0.231
2.635ArgArg: 2.635 ± 0.227
3.14ArgSer: 3.14 ± 0.276
2.327ArgThr: 2.327 ± 0.21
3.249ArgVal: 3.249 ± 0.254
0.681ArgTrp: 0.681 ± 0.141
2.393ArgTyr: 2.393 ± 0.212
0.0ArgXaa: 0.0 ± 0.0
Ser
3.82SerAla: 3.82 ± 0.361
0.461SerCys: 0.461 ± 0.108
3.864SerAsp: 3.864 ± 0.246
3.491SerGlu: 3.491 ± 0.261
2.766SerPhe: 2.766 ± 0.275
5.16SerGly: 5.16 ± 0.417
1.098SerHis: 1.098 ± 0.13
4.589SerIle: 4.589 ± 0.332
3.535SerLys: 3.535 ± 0.306
5.621SerLeu: 5.621 ± 0.447
1.515SerMet: 1.515 ± 0.195
3.645SerAsn: 3.645 ± 0.279
2.525SerPro: 2.525 ± 0.203
2.766SerGln: 2.766 ± 0.268
2.876SerArg: 2.876 ± 0.264
4.479SerSer: 4.479 ± 0.375
4.062SerThr: 4.062 ± 0.366
4.391SerVal: 4.391 ± 0.302
0.768SerTrp: 0.768 ± 0.127
2.701SerTyr: 2.701 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
4.194ThrAla: 4.194 ± 0.357
0.549ThrCys: 0.549 ± 0.109
3.711ThrAsp: 3.711 ± 0.319
3.403ThrGlu: 3.403 ± 0.267
2.437ThrPhe: 2.437 ± 0.228
4.457ThrGly: 4.457 ± 0.413
0.768ThrHis: 0.768 ± 0.139
3.974ThrIle: 3.974 ± 0.325
3.074ThrLys: 3.074 ± 0.26
4.589ThrLeu: 4.589 ± 0.304
1.559ThrMet: 1.559 ± 0.189
3.184ThrAsn: 3.184 ± 0.284
3.162ThrPro: 3.162 ± 0.25
2.218ThrGln: 2.218 ± 0.193
2.788ThrArg: 2.788 ± 0.249
3.996ThrSer: 3.996 ± 0.382
3.974ThrThr: 3.974 ± 0.417
4.94ThrVal: 4.94 ± 0.404
0.922ThrTrp: 0.922 ± 0.14
2.108ThrTyr: 2.108 ± 0.261
0.0ThrXaa: 0.0 ± 0.0
Val
4.545ValAla: 4.545 ± 0.271
0.659ValCys: 0.659 ± 0.107
5.709ValAsp: 5.709 ± 0.383
4.962ValGlu: 4.962 ± 0.304
2.81ValPhe: 2.81 ± 0.258
4.172ValGly: 4.172 ± 0.278
1.317ValHis: 1.317 ± 0.156
4.062ValIle: 4.062 ± 0.293
4.545ValLys: 4.545 ± 0.389
4.764ValLeu: 4.764 ± 0.338
1.756ValMet: 1.756 ± 0.159
3.842ValAsn: 3.842 ± 0.313
3.206ValPro: 3.206 ± 0.311
2.393ValGln: 2.393 ± 0.251
3.557ValArg: 3.557 ± 0.269
4.545ValSer: 4.545 ± 0.27
4.457ValThr: 4.457 ± 0.425
5.928ValVal: 5.928 ± 0.463
1.098ValTrp: 1.098 ± 0.158
2.942ValTyr: 2.942 ± 0.225
0.0ValXaa: 0.0 ± 0.0
Trp
1.23TrpAla: 1.23 ± 0.17
0.307TrpCys: 0.307 ± 0.08
1.12TrpAsp: 1.12 ± 0.173
1.361TrpGlu: 1.361 ± 0.181
0.834TrpPhe: 0.834 ± 0.112
0.856TrpGly: 0.856 ± 0.112
0.22TrpHis: 0.22 ± 0.067
0.615TrpIle: 0.615 ± 0.124
0.834TrpLys: 0.834 ± 0.142
1.317TrpLeu: 1.317 ± 0.198
0.461TrpMet: 0.461 ± 0.111
0.725TrpAsn: 0.725 ± 0.098
0.351TrpPro: 0.351 ± 0.101
0.329TrpGln: 0.329 ± 0.093
0.922TrpArg: 0.922 ± 0.151
0.746TrpSer: 0.746 ± 0.116
0.746TrpThr: 0.746 ± 0.122
1.251TrpVal: 1.251 ± 0.126
0.154TrpTrp: 0.154 ± 0.058
0.505TrpTyr: 0.505 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.503TyrAla: 2.503 ± 0.237
0.725TyrCys: 0.725 ± 0.117
3.096TyrAsp: 3.096 ± 0.258
2.261TyrGlu: 2.261 ± 0.227
1.691TyrPhe: 1.691 ± 0.178
2.766TyrGly: 2.766 ± 0.262
0.922TyrHis: 0.922 ± 0.138
2.218TyrIle: 2.218 ± 0.267
2.349TyrLys: 2.349 ± 0.226
3.271TyrLeu: 3.271 ± 0.269
1.032TyrMet: 1.032 ± 0.13
2.481TyrAsn: 2.481 ± 0.216
1.669TyrPro: 1.669 ± 0.194
1.559TyrGln: 1.559 ± 0.201
1.888TyrArg: 1.888 ± 0.195
2.525TyrSer: 2.525 ± 0.215
2.393TyrThr: 2.393 ± 0.359
2.92TyrVal: 2.92 ± 0.251
0.615TyrTrp: 0.615 ± 0.133
1.778TyrTyr: 1.778 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 154 proteins (45547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski