Amino acid dipepetide frequency for Klebsiella phage K11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.765AlaAla: 8.765 ± 0.964
0.599AlaCys: 0.599 ± 0.243
6.368AlaAsp: 6.368 ± 0.75
5.169AlaGlu: 5.169 ± 0.653
3.147AlaPhe: 3.147 ± 0.488
7.117AlaGly: 7.117 ± 1.036
1.648AlaHis: 1.648 ± 0.237
4.12AlaIle: 4.12 ± 0.514
6.518AlaLys: 6.518 ± 0.74
8.016AlaLeu: 8.016 ± 0.866
3.072AlaMet: 3.072 ± 0.61
4.27AlaAsn: 4.27 ± 0.502
2.922AlaPro: 2.922 ± 0.425
3.371AlaGln: 3.371 ± 0.573
4.645AlaArg: 4.645 ± 0.614
5.619AlaSer: 5.619 ± 0.57
4.27AlaThr: 4.27 ± 0.596
5.394AlaVal: 5.394 ± 0.629
1.274AlaTrp: 1.274 ± 0.315
3.147AlaTyr: 3.147 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
0.899CysAla: 0.899 ± 0.316
0.075CysCys: 0.075 ± 0.078
0.524CysAsp: 0.524 ± 0.261
0.599CysGlu: 0.599 ± 0.227
0.45CysPhe: 0.45 ± 0.209
0.375CysGly: 0.375 ± 0.159
0.375CysHis: 0.375 ± 0.171
0.45CysIle: 0.45 ± 0.195
0.599CysLys: 0.599 ± 0.181
0.974CysLeu: 0.974 ± 0.304
0.075CysMet: 0.075 ± 0.091
0.15CysAsn: 0.15 ± 0.111
0.524CysPro: 0.524 ± 0.178
0.674CysGln: 0.674 ± 0.241
0.599CysArg: 0.599 ± 0.268
1.199CysSer: 1.199 ± 0.429
0.674CysThr: 0.674 ± 0.272
0.824CysVal: 0.824 ± 0.206
0.15CysTrp: 0.15 ± 0.111
0.45CysTyr: 0.45 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
5.469AspAla: 5.469 ± 0.608
0.375AspCys: 0.375 ± 0.203
4.27AspAsp: 4.27 ± 0.432
3.821AspGlu: 3.821 ± 0.483
2.772AspPhe: 2.772 ± 0.436
6.218AspGly: 6.218 ± 0.586
0.824AspHis: 0.824 ± 0.231
2.847AspIle: 2.847 ± 0.389
3.896AspLys: 3.896 ± 0.734
3.971AspLeu: 3.971 ± 0.565
1.873AspMet: 1.873 ± 0.423
1.798AspAsn: 1.798 ± 0.303
2.772AspPro: 2.772 ± 0.478
2.697AspGln: 2.697 ± 0.422
3.072AspArg: 3.072 ± 0.459
3.971AspSer: 3.971 ± 0.496
4.195AspThr: 4.195 ± 0.417
3.896AspVal: 3.896 ± 0.438
0.674AspTrp: 0.674 ± 0.271
2.397AspTyr: 2.397 ± 0.391
0.0AspXaa: 0.0 ± 0.0
Glu
7.267GluAla: 7.267 ± 0.81
0.824GluCys: 0.824 ± 0.295
3.746GluAsp: 3.746 ± 0.543
5.094GluGlu: 5.094 ± 0.706
2.697GluPhe: 2.697 ± 0.414
5.469GluGly: 5.469 ± 0.651
1.573GluHis: 1.573 ± 0.418
2.472GluIle: 2.472 ± 0.414
3.147GluLys: 3.147 ± 0.518
5.469GluLeu: 5.469 ± 0.866
1.349GluMet: 1.349 ± 0.449
2.248GluAsn: 2.248 ± 0.419
2.397GluPro: 2.397 ± 0.528
2.847GluGln: 2.847 ± 0.799
4.195GluArg: 4.195 ± 0.658
4.12GluSer: 4.12 ± 0.561
2.922GluThr: 2.922 ± 0.454
4.57GluVal: 4.57 ± 0.549
0.824GluTrp: 0.824 ± 0.24
2.772GluTyr: 2.772 ± 0.415
0.0GluXaa: 0.0 ± 0.0
Phe
2.772PheAla: 2.772 ± 0.456
0.225PheCys: 0.225 ± 0.161
2.922PheAsp: 2.922 ± 0.379
1.723PheGlu: 1.723 ± 0.341
0.824PhePhe: 0.824 ± 0.206
3.147PheGly: 3.147 ± 0.617
0.674PheHis: 0.674 ± 0.277
1.723PheIle: 1.723 ± 0.437
2.248PheLys: 2.248 ± 0.346
2.847PheLeu: 2.847 ± 0.502
0.899PheMet: 0.899 ± 0.185
2.248PheAsn: 2.248 ± 0.383
1.498PhePro: 1.498 ± 0.334
1.423PheGln: 1.423 ± 0.293
1.798PheArg: 1.798 ± 0.425
2.322PheSer: 2.322 ± 0.326
2.547PheThr: 2.547 ± 0.359
2.847PheVal: 2.847 ± 0.497
0.3PheTrp: 0.3 ± 0.133
1.049PheTyr: 1.049 ± 0.193
0.0PheXaa: 0.0 ± 0.0
Gly
7.417GlyAla: 7.417 ± 0.945
0.824GlyCys: 0.824 ± 0.29
5.319GlyAsp: 5.319 ± 0.521
5.469GlyGlu: 5.469 ± 0.635
2.697GlyPhe: 2.697 ± 0.318
6.368GlyGly: 6.368 ± 0.842
1.274GlyHis: 1.274 ± 0.328
4.645GlyIle: 4.645 ± 0.77
5.544GlyLys: 5.544 ± 0.763
6.818GlyLeu: 6.818 ± 0.687
1.948GlyMet: 1.948 ± 0.38
3.521GlyAsn: 3.521 ± 0.543
1.049GlyPro: 1.049 ± 0.332
2.847GlyGln: 2.847 ± 0.397
4.12GlyArg: 4.12 ± 0.459
6.443GlySer: 6.443 ± 0.914
4.645GlyThr: 4.645 ± 0.596
4.945GlyVal: 4.945 ± 0.621
1.798GlyTrp: 1.798 ± 0.404
3.446GlyTyr: 3.446 ± 0.472
0.0GlyXaa: 0.0 ± 0.0
His
1.049HisAla: 1.049 ± 0.279
0.375HisCys: 0.375 ± 0.153
0.599HisAsp: 0.599 ± 0.168
1.723HisGlu: 1.723 ± 0.375
0.974HisPhe: 0.974 ± 0.237
1.124HisGly: 1.124 ± 0.34
0.45HisHis: 0.45 ± 0.192
1.274HisIle: 1.274 ± 0.359
1.049HisLys: 1.049 ± 0.286
1.423HisLeu: 1.423 ± 0.368
0.674HisMet: 0.674 ± 0.169
0.3HisAsn: 0.3 ± 0.149
0.674HisPro: 0.674 ± 0.197
0.524HisGln: 0.524 ± 0.2
0.524HisArg: 0.524 ± 0.171
1.049HisSer: 1.049 ± 0.286
0.974HisThr: 0.974 ± 0.221
1.573HisVal: 1.573 ± 0.298
0.225HisTrp: 0.225 ± 0.129
0.899HisTyr: 0.899 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
4.27IleAla: 4.27 ± 0.409
0.749IleCys: 0.749 ± 0.178
3.072IleAsp: 3.072 ± 0.45
3.147IleGlu: 3.147 ± 0.432
1.049IlePhe: 1.049 ± 0.263
3.971IleGly: 3.971 ± 0.496
0.824IleHis: 0.824 ± 0.262
2.173IleIle: 2.173 ± 0.455
2.922IleLys: 2.922 ± 0.399
3.746IleLeu: 3.746 ± 0.444
0.899IleMet: 0.899 ± 0.297
2.098IleAsn: 2.098 ± 0.535
2.397IlePro: 2.397 ± 0.52
1.873IleGln: 1.873 ± 0.451
3.221IleArg: 3.221 ± 0.471
3.296IleSer: 3.296 ± 0.483
2.697IleThr: 2.697 ± 0.469
3.521IleVal: 3.521 ± 0.528
0.599IleTrp: 0.599 ± 0.193
1.573IleTyr: 1.573 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
7.642LysAla: 7.642 ± 1.044
0.674LysCys: 0.674 ± 0.236
3.371LysAsp: 3.371 ± 0.478
5.019LysGlu: 5.019 ± 0.691
2.098LysPhe: 2.098 ± 0.342
6.068LysGly: 6.068 ± 0.951
1.423LysHis: 1.423 ± 0.31
2.173LysIle: 2.173 ± 0.336
3.371LysLys: 3.371 ± 0.667
5.544LysLeu: 5.544 ± 0.612
1.723LysMet: 1.723 ± 0.371
2.472LysAsn: 2.472 ± 0.382
2.472LysPro: 2.472 ± 0.416
2.397LysGln: 2.397 ± 0.367
3.671LysArg: 3.671 ± 0.747
3.221LysSer: 3.221 ± 0.524
3.221LysThr: 3.221 ± 0.419
5.169LysVal: 5.169 ± 0.758
0.674LysTrp: 0.674 ± 0.209
1.648LysTyr: 1.648 ± 0.381
0.0LysXaa: 0.0 ± 0.0
Leu
7.941LeuAla: 7.941 ± 1.114
0.375LeuCys: 0.375 ± 0.158
4.645LeuAsp: 4.645 ± 0.508
6.293LeuGlu: 6.293 ± 0.863
2.922LeuPhe: 2.922 ± 0.404
5.394LeuGly: 5.394 ± 0.568
1.349LeuHis: 1.349 ± 0.324
4.046LeuIle: 4.046 ± 0.585
6.143LeuLys: 6.143 ± 0.544
6.443LeuLeu: 6.443 ± 0.809
2.248LeuMet: 2.248 ± 0.298
3.971LeuAsn: 3.971 ± 0.456
3.446LeuPro: 3.446 ± 0.446
3.221LeuGln: 3.221 ± 0.487
5.619LeuArg: 5.619 ± 0.644
5.094LeuSer: 5.094 ± 0.585
5.019LeuThr: 5.019 ± 0.572
5.094LeuVal: 5.094 ± 0.611
1.498LeuTrp: 1.498 ± 0.377
2.697LeuTyr: 2.697 ± 0.466
0.0LeuXaa: 0.0 ± 0.0
Met
3.371MetAla: 3.371 ± 0.476
0.225MetCys: 0.225 ± 0.141
1.873MetAsp: 1.873 ± 0.365
1.274MetGlu: 1.274 ± 0.284
0.749MetPhe: 0.749 ± 0.228
1.573MetGly: 1.573 ± 0.343
0.524MetHis: 0.524 ± 0.194
0.974MetIle: 0.974 ± 0.254
1.423MetLys: 1.423 ± 0.276
2.547MetLeu: 2.547 ± 0.418
0.599MetMet: 0.599 ± 0.182
0.974MetAsn: 0.974 ± 0.204
0.824MetPro: 0.824 ± 0.186
1.948MetGln: 1.948 ± 0.483
1.124MetArg: 1.124 ± 0.272
1.498MetSer: 1.498 ± 0.336
1.948MetThr: 1.948 ± 0.436
1.573MetVal: 1.573 ± 0.317
0.075MetTrp: 0.075 ± 0.08
0.599MetTyr: 0.599 ± 0.215
0.0MetXaa: 0.0 ± 0.0
Asn
3.371AsnAla: 3.371 ± 0.565
0.599AsnCys: 0.599 ± 0.219
1.948AsnAsp: 1.948 ± 0.304
2.248AsnGlu: 2.248 ± 0.403
1.423AsnPhe: 1.423 ± 0.276
4.57AsnGly: 4.57 ± 0.587
0.3AsnHis: 0.3 ± 0.13
2.997AsnIle: 2.997 ± 0.563
2.472AsnLys: 2.472 ± 0.415
3.296AsnLeu: 3.296 ± 0.575
1.049AsnMet: 1.049 ± 0.311
1.723AsnAsn: 1.723 ± 0.495
2.098AsnPro: 2.098 ± 0.328
1.274AsnGln: 1.274 ± 0.214
1.948AsnArg: 1.948 ± 0.591
2.922AsnSer: 2.922 ± 0.562
2.472AsnThr: 2.472 ± 0.478
2.847AsnVal: 2.847 ± 0.456
0.674AsnTrp: 0.674 ± 0.249
1.798AsnTyr: 1.798 ± 0.372
0.0AsnXaa: 0.0 ± 0.0
Pro
2.772ProAla: 2.772 ± 0.467
0.45ProCys: 0.45 ± 0.197
2.023ProAsp: 2.023 ± 0.302
3.746ProGlu: 3.746 ± 0.718
1.423ProPhe: 1.423 ± 0.251
2.697ProGly: 2.697 ± 0.446
0.375ProHis: 0.375 ± 0.135
0.974ProIle: 0.974 ± 0.298
2.472ProLys: 2.472 ± 0.453
2.922ProLeu: 2.922 ± 0.306
0.674ProMet: 0.674 ± 0.265
2.397ProAsn: 2.397 ± 0.402
1.049ProPro: 1.049 ± 0.302
1.573ProGln: 1.573 ± 0.272
2.023ProArg: 2.023 ± 0.337
2.098ProSer: 2.098 ± 0.26
1.948ProThr: 1.948 ± 0.413
2.847ProVal: 2.847 ± 0.42
0.899ProTrp: 0.899 ± 0.197
1.423ProTyr: 1.423 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
3.296GlnAla: 3.296 ± 0.531
0.225GlnCys: 0.225 ± 0.119
2.847GlnAsp: 2.847 ± 0.273
2.697GlnGlu: 2.697 ± 0.422
1.573GlnPhe: 1.573 ± 0.292
2.847GlnGly: 2.847 ± 0.391
0.524GlnHis: 0.524 ± 0.203
1.948GlnIle: 1.948 ± 0.367
3.147GlnLys: 3.147 ± 0.466
4.12GlnLeu: 4.12 ± 0.51
1.573GlnMet: 1.573 ± 0.498
1.199GlnAsn: 1.199 ± 0.263
1.648GlnPro: 1.648 ± 0.241
3.296GlnGln: 3.296 ± 0.584
2.173GlnArg: 2.173 ± 0.454
2.397GlnSer: 2.397 ± 0.36
1.948GlnThr: 1.948 ± 0.412
2.847GlnVal: 2.847 ± 0.337
0.749GlnTrp: 0.749 ± 0.238
1.349GlnTyr: 1.349 ± 0.433
0.0GlnXaa: 0.0 ± 0.0
Arg
4.795ArgAla: 4.795 ± 0.675
0.824ArgCys: 0.824 ± 0.273
3.446ArgAsp: 3.446 ± 0.516
4.046ArgGlu: 4.046 ± 0.446
2.098ArgPhe: 2.098 ± 0.395
4.046ArgGly: 4.046 ± 0.524
0.749ArgHis: 0.749 ± 0.203
2.922ArgIle: 2.922 ± 0.525
3.971ArgLys: 3.971 ± 0.679
5.169ArgLeu: 5.169 ± 0.663
1.274ArgMet: 1.274 ± 0.247
2.697ArgAsn: 2.697 ± 0.369
1.873ArgPro: 1.873 ± 0.374
3.072ArgGln: 3.072 ± 0.414
2.547ArgArg: 2.547 ± 0.317
3.221ArgSer: 3.221 ± 0.411
3.072ArgThr: 3.072 ± 0.538
3.446ArgVal: 3.446 ± 0.561
1.049ArgTrp: 1.049 ± 0.304
1.274ArgTyr: 1.274 ± 0.238
0.0ArgXaa: 0.0 ± 0.0
Ser
5.469SerAla: 5.469 ± 0.873
0.824SerCys: 0.824 ± 0.278
4.645SerAsp: 4.645 ± 0.615
3.521SerGlu: 3.521 ± 0.536
2.997SerPhe: 2.997 ± 0.401
5.769SerGly: 5.769 ± 0.847
1.648SerHis: 1.648 ± 0.32
2.997SerIle: 2.997 ± 0.449
3.821SerLys: 3.821 ± 0.485
5.169SerLeu: 5.169 ± 0.711
1.274SerMet: 1.274 ± 0.355
2.098SerAsn: 2.098 ± 0.638
1.948SerPro: 1.948 ± 0.363
2.922SerGln: 2.922 ± 0.392
3.371SerArg: 3.371 ± 0.503
3.821SerSer: 3.821 ± 0.571
4.345SerThr: 4.345 ± 0.666
4.42SerVal: 4.42 ± 0.566
0.749SerTrp: 0.749 ± 0.211
2.397SerTyr: 2.397 ± 0.501
0.0SerXaa: 0.0 ± 0.0
Thr
4.72ThrAla: 4.72 ± 0.751
1.199ThrCys: 1.199 ± 0.314
3.221ThrAsp: 3.221 ± 0.337
3.446ThrGlu: 3.446 ± 0.511
2.173ThrPhe: 2.173 ± 0.371
4.645ThrGly: 4.645 ± 0.52
0.974ThrHis: 0.974 ± 0.218
3.671ThrIle: 3.671 ± 0.505
4.42ThrLys: 4.42 ± 0.651
5.319ThrLeu: 5.319 ± 0.693
1.573ThrMet: 1.573 ± 0.327
1.723ThrAsn: 1.723 ± 0.407
3.147ThrPro: 3.147 ± 0.625
2.098ThrGln: 2.098 ± 0.356
2.772ThrArg: 2.772 ± 0.464
3.896ThrSer: 3.896 ± 0.73
2.922ThrThr: 2.922 ± 0.698
4.046ThrVal: 4.046 ± 0.704
0.674ThrTrp: 0.674 ± 0.209
1.423ThrTyr: 1.423 ± 0.309
0.0ThrXaa: 0.0 ± 0.0
Val
5.244ValAla: 5.244 ± 0.646
0.45ValCys: 0.45 ± 0.179
3.596ValAsp: 3.596 ± 0.566
3.896ValGlu: 3.896 ± 0.464
2.248ValPhe: 2.248 ± 0.515
5.694ValGly: 5.694 ± 0.512
0.974ValHis: 0.974 ± 0.328
3.446ValIle: 3.446 ± 0.531
4.195ValLys: 4.195 ± 0.52
6.143ValLeu: 6.143 ± 0.768
1.199ValMet: 1.199 ± 0.28
3.446ValAsn: 3.446 ± 0.641
2.397ValPro: 2.397 ± 0.413
2.173ValGln: 2.173 ± 0.324
5.019ValArg: 5.019 ± 0.676
5.319ValSer: 5.319 ± 0.891
5.394ValThr: 5.394 ± 0.688
5.244ValVal: 5.244 ± 0.712
0.599ValTrp: 0.599 ± 0.272
2.472ValTyr: 2.472 ± 0.425
0.0ValXaa: 0.0 ± 0.0
Trp
0.599TrpAla: 0.599 ± 0.212
0.375TrpCys: 0.375 ± 0.171
0.749TrpAsp: 0.749 ± 0.231
0.974TrpGlu: 0.974 ± 0.239
0.524TrpPhe: 0.524 ± 0.18
0.674TrpGly: 0.674 ± 0.217
0.375TrpHis: 0.375 ± 0.204
0.599TrpIle: 0.599 ± 0.273
1.199TrpLys: 1.199 ± 0.3
1.274TrpLeu: 1.274 ± 0.373
0.375TrpMet: 0.375 ± 0.156
0.974TrpAsn: 0.974 ± 0.227
0.45TrpPro: 0.45 ± 0.18
0.674TrpGln: 0.674 ± 0.196
0.899TrpArg: 0.899 ± 0.214
1.199TrpSer: 1.199 ± 0.358
0.749TrpThr: 0.749 ± 0.251
1.349TrpVal: 1.349 ± 0.348
0.15TrpTrp: 0.15 ± 0.099
0.075TrpTyr: 0.075 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.322TyrAla: 2.322 ± 0.371
0.375TyrCys: 0.375 ± 0.171
2.697TyrAsp: 2.697 ± 0.431
2.098TyrGlu: 2.098 ± 0.38
1.199TyrPhe: 1.199 ± 0.216
3.371TyrGly: 3.371 ± 0.479
0.599TyrHis: 0.599 ± 0.234
1.648TyrIle: 1.648 ± 0.511
1.498TyrLys: 1.498 ± 0.346
2.248TyrLeu: 2.248 ± 0.417
1.274TyrMet: 1.274 ± 0.333
1.723TyrAsn: 1.723 ± 0.408
1.199TyrPro: 1.199 ± 0.304
1.498TyrGln: 1.498 ± 0.446
2.248TyrArg: 2.248 ± 0.345
1.498TyrSer: 1.498 ± 0.34
2.098TyrThr: 2.098 ± 0.459
2.697TyrVal: 2.697 ± 0.535
0.599TyrTrp: 0.599 ± 0.22
0.824TyrTyr: 0.824 ± 0.305
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (13349 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski