Amino acid dipepetide frequency for Klebsiella phage vB_KpnP_IME337

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.253AlaAla: 14.253 ± 1.347
0.864AlaCys: 0.864 ± 0.247
6.119AlaAsp: 6.119 ± 0.673
5.399AlaGlu: 5.399 ± 0.587
3.023AlaPhe: 3.023 ± 0.335
7.918AlaGly: 7.918 ± 0.928
1.512AlaHis: 1.512 ± 0.53
5.039AlaIle: 5.039 ± 0.62
5.543AlaLys: 5.543 ± 0.9
9.142AlaLeu: 9.142 ± 0.659
2.807AlaMet: 2.807 ± 0.393
3.239AlaAsn: 3.239 ± 0.433
3.743AlaPro: 3.743 ± 0.666
5.039AlaGln: 5.039 ± 0.799
5.975AlaArg: 5.975 ± 0.707
5.255AlaSer: 5.255 ± 0.622
4.967AlaThr: 4.967 ± 0.675
6.766AlaVal: 6.766 ± 0.633
1.224AlaTrp: 1.224 ± 0.28
4.175AlaTyr: 4.175 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.576CysAla: 0.576 ± 0.256
0.36CysCys: 0.36 ± 0.217
0.36CysAsp: 0.36 ± 0.144
0.432CysGlu: 0.432 ± 0.138
0.288CysPhe: 0.288 ± 0.158
0.864CysGly: 0.864 ± 0.273
0.288CysHis: 0.288 ± 0.151
0.504CysIle: 0.504 ± 0.229
0.432CysLys: 0.432 ± 0.195
0.936CysLeu: 0.936 ± 0.257
0.648CysMet: 0.648 ± 0.209
0.576CysAsn: 0.576 ± 0.207
0.504CysPro: 0.504 ± 0.248
0.36CysGln: 0.36 ± 0.183
0.864CysArg: 0.864 ± 0.197
0.864CysSer: 0.864 ± 0.219
0.864CysThr: 0.864 ± 0.242
0.864CysVal: 0.864 ± 0.253
0.216CysTrp: 0.216 ± 0.134
0.504CysTyr: 0.504 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
6.91AspAla: 6.91 ± 0.792
0.936AspCys: 0.936 ± 0.284
3.095AspAsp: 3.095 ± 0.427
3.167AspGlu: 3.167 ± 0.431
2.663AspPhe: 2.663 ± 0.401
4.319AspGly: 4.319 ± 0.497
0.576AspHis: 0.576 ± 0.196
3.095AspIle: 3.095 ± 0.625
2.879AspLys: 2.879 ± 0.575
5.183AspLeu: 5.183 ± 0.483
2.232AspMet: 2.232 ± 0.356
3.167AspAsn: 3.167 ± 0.47
2.303AspPro: 2.303 ± 0.387
1.584AspGln: 1.584 ± 0.356
3.023AspArg: 3.023 ± 0.51
4.823AspSer: 4.823 ± 0.491
4.103AspThr: 4.103 ± 0.512
3.815AspVal: 3.815 ± 0.379
1.152AspTrp: 1.152 ± 0.177
2.807AspTyr: 2.807 ± 0.523
0.0AspXaa: 0.0 ± 0.0
Glu
5.543GluAla: 5.543 ± 0.813
0.432GluCys: 0.432 ± 0.14
2.951GluAsp: 2.951 ± 0.363
4.319GluGlu: 4.319 ± 0.757
2.303GluPhe: 2.303 ± 0.386
4.175GluGly: 4.175 ± 0.734
2.375GluHis: 2.375 ± 0.465
2.735GluIle: 2.735 ± 0.434
2.088GluLys: 2.088 ± 0.307
4.967GluLeu: 4.967 ± 0.538
2.016GluMet: 2.016 ± 0.298
1.944GluAsn: 1.944 ± 0.359
1.584GluPro: 1.584 ± 0.269
3.239GluGln: 3.239 ± 0.59
3.455GluArg: 3.455 ± 0.49
3.095GluSer: 3.095 ± 0.529
2.663GluThr: 2.663 ± 0.38
4.679GluVal: 4.679 ± 0.535
0.792GluTrp: 0.792 ± 0.197
2.807GluTyr: 2.807 ± 0.416
0.0GluXaa: 0.0 ± 0.0
Phe
2.735PheAla: 2.735 ± 0.371
0.36PheCys: 0.36 ± 0.169
2.232PheAsp: 2.232 ± 0.373
2.16PheGlu: 2.16 ± 0.388
1.152PhePhe: 1.152 ± 0.226
2.088PheGly: 2.088 ± 0.311
0.648PheHis: 0.648 ± 0.179
1.08PheIle: 1.08 ± 0.216
1.728PheLys: 1.728 ± 0.388
2.088PheLeu: 2.088 ± 0.37
0.504PheMet: 0.504 ± 0.2
1.512PheAsn: 1.512 ± 0.358
1.656PhePro: 1.656 ± 0.33
1.224PheGln: 1.224 ± 0.229
1.728PheArg: 1.728 ± 0.351
1.872PheSer: 1.872 ± 0.377
2.519PheThr: 2.519 ± 0.484
1.656PheVal: 1.656 ± 0.364
0.36PheTrp: 0.36 ± 0.144
1.656PheTyr: 1.656 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
6.335GlyAla: 6.335 ± 0.587
1.152GlyCys: 1.152 ± 0.334
5.111GlyAsp: 5.111 ± 0.574
3.743GlyGlu: 3.743 ± 0.469
2.519GlyPhe: 2.519 ± 0.438
4.823GlyGly: 4.823 ± 0.749
1.152GlyHis: 1.152 ± 0.267
4.391GlyIle: 4.391 ± 0.5
4.319GlyLys: 4.319 ± 0.656
6.623GlyLeu: 6.623 ± 0.526
2.016GlyMet: 2.016 ± 0.427
3.959GlyAsn: 3.959 ± 0.507
1.872GlyPro: 1.872 ± 0.331
3.167GlyGln: 3.167 ± 0.405
4.607GlyArg: 4.607 ± 0.429
5.543GlySer: 5.543 ± 0.577
5.831GlyThr: 5.831 ± 0.87
5.975GlyVal: 5.975 ± 0.859
0.936GlyTrp: 0.936 ± 0.244
3.455GlyTyr: 3.455 ± 0.565
0.0GlyXaa: 0.0 ± 0.0
His
1.368HisAla: 1.368 ± 0.372
0.36HisCys: 0.36 ± 0.189
0.72HisAsp: 0.72 ± 0.171
1.152HisGlu: 1.152 ± 0.309
0.432HisPhe: 0.432 ± 0.127
2.303HisGly: 2.303 ± 0.587
0.072HisHis: 0.072 ± 0.07
0.864HisIle: 0.864 ± 0.217
1.224HisLys: 1.224 ± 0.355
2.232HisLeu: 2.232 ± 0.353
0.504HisMet: 0.504 ± 0.171
0.576HisAsn: 0.576 ± 0.196
0.72HisPro: 0.72 ± 0.309
0.648HisGln: 0.648 ± 0.235
1.44HisArg: 1.44 ± 0.303
0.792HisSer: 0.792 ± 0.25
0.864HisThr: 0.864 ± 0.258
1.152HisVal: 1.152 ± 0.31
0.288HisTrp: 0.288 ± 0.15
0.72HisTyr: 0.72 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
3.527IleAla: 3.527 ± 0.548
0.36IleCys: 0.36 ± 0.165
3.095IleAsp: 3.095 ± 0.39
2.663IleGlu: 2.663 ± 0.634
0.72IlePhe: 0.72 ± 0.157
3.095IleGly: 3.095 ± 0.44
0.648IleHis: 0.648 ± 0.227
1.872IleIle: 1.872 ± 0.298
3.239IleLys: 3.239 ± 0.491
4.031IleLeu: 4.031 ± 0.579
1.224IleMet: 1.224 ± 0.22
1.872IleAsn: 1.872 ± 0.303
2.16IlePro: 2.16 ± 0.323
2.735IleGln: 2.735 ± 0.39
2.735IleArg: 2.735 ± 0.335
2.879IleSer: 2.879 ± 0.417
2.735IleThr: 2.735 ± 0.347
2.663IleVal: 2.663 ± 0.372
0.144IleTrp: 0.144 ± 0.126
1.152IleTyr: 1.152 ± 0.202
0.0IleXaa: 0.0 ± 0.0
Lys
5.759LysAla: 5.759 ± 0.896
0.36LysCys: 0.36 ± 0.156
2.735LysAsp: 2.735 ± 0.413
3.815LysGlu: 3.815 ± 0.424
1.512LysPhe: 1.512 ± 0.373
3.455LysGly: 3.455 ± 0.714
1.008LysHis: 1.008 ± 0.291
1.44LysIle: 1.44 ± 0.22
1.728LysLys: 1.728 ± 0.397
5.039LysLeu: 5.039 ± 0.632
1.152LysMet: 1.152 ± 0.298
1.296LysAsn: 1.296 ± 0.28
1.584LysPro: 1.584 ± 0.327
3.599LysGln: 3.599 ± 0.57
3.455LysArg: 3.455 ± 0.508
3.671LysSer: 3.671 ± 0.422
2.375LysThr: 2.375 ± 0.404
3.599LysVal: 3.599 ± 0.479
1.008LysTrp: 1.008 ± 0.243
1.44LysTyr: 1.44 ± 0.341
0.0LysXaa: 0.0 ± 0.0
Leu
8.494LeuAla: 8.494 ± 0.737
1.08LeuCys: 1.08 ± 0.28
6.91LeuAsp: 6.91 ± 0.561
5.615LeuGlu: 5.615 ± 0.688
2.951LeuPhe: 2.951 ± 0.388
6.335LeuGly: 6.335 ± 0.674
1.872LeuHis: 1.872 ± 0.535
3.887LeuIle: 3.887 ± 0.596
3.527LeuLys: 3.527 ± 0.64
6.119LeuLeu: 6.119 ± 0.578
2.016LeuMet: 2.016 ± 0.336
3.527LeuAsn: 3.527 ± 0.42
2.879LeuPro: 2.879 ± 0.382
3.959LeuGln: 3.959 ± 0.637
5.831LeuArg: 5.831 ± 0.484
5.471LeuSer: 5.471 ± 0.644
4.607LeuThr: 4.607 ± 0.613
6.191LeuVal: 6.191 ± 0.654
1.152LeuTrp: 1.152 ± 0.302
3.527LeuTyr: 3.527 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
2.879MetAla: 2.879 ± 0.573
0.144MetCys: 0.144 ± 0.087
1.872MetAsp: 1.872 ± 0.446
1.296MetGlu: 1.296 ± 0.285
0.864MetPhe: 0.864 ± 0.245
1.512MetGly: 1.512 ± 0.292
0.792MetHis: 0.792 ± 0.258
0.72MetIle: 0.72 ± 0.235
0.792MetLys: 0.792 ± 0.222
3.167MetLeu: 3.167 ± 0.548
0.648MetMet: 0.648 ± 0.25
1.152MetAsn: 1.152 ± 0.3
1.08MetPro: 1.08 ± 0.226
2.232MetGln: 2.232 ± 0.417
1.8MetArg: 1.8 ± 0.387
2.16MetSer: 2.16 ± 0.429
1.008MetThr: 1.008 ± 0.283
2.016MetVal: 2.016 ± 0.325
0.432MetTrp: 0.432 ± 0.169
1.368MetTyr: 1.368 ± 0.432
0.0MetXaa: 0.0 ± 0.0
Asn
3.455AsnAla: 3.455 ± 0.442
0.576AsnCys: 0.576 ± 0.171
2.016AsnAsp: 2.016 ± 0.404
1.368AsnGlu: 1.368 ± 0.362
0.864AsnPhe: 0.864 ± 0.223
4.391AsnGly: 4.391 ± 0.505
0.144AsnHis: 0.144 ± 0.108
2.16AsnIle: 2.16 ± 0.313
2.16AsnLys: 2.16 ± 0.297
3.239AsnLeu: 3.239 ± 0.583
1.584AsnMet: 1.584 ± 0.267
1.8AsnAsn: 1.8 ± 0.326
2.879AsnPro: 2.879 ± 0.441
1.584AsnGln: 1.584 ± 0.336
2.016AsnArg: 2.016 ± 0.356
2.951AsnSer: 2.951 ± 0.476
2.447AsnThr: 2.447 ± 0.345
3.383AsnVal: 3.383 ± 0.304
0.72AsnTrp: 0.72 ± 0.216
1.368AsnTyr: 1.368 ± 0.397
0.0AsnXaa: 0.0 ± 0.0
Pro
4.247ProAla: 4.247 ± 0.64
0.288ProCys: 0.288 ± 0.137
2.735ProAsp: 2.735 ± 0.405
3.599ProGlu: 3.599 ± 0.417
1.08ProPhe: 1.08 ± 0.232
2.375ProGly: 2.375 ± 0.501
0.432ProHis: 0.432 ± 0.217
2.016ProIle: 2.016 ± 0.353
1.728ProLys: 1.728 ± 0.372
3.023ProLeu: 3.023 ± 0.43
0.936ProMet: 0.936 ± 0.255
1.224ProAsn: 1.224 ± 0.285
0.72ProPro: 0.72 ± 0.223
1.44ProGln: 1.44 ± 0.256
1.872ProArg: 1.872 ± 0.344
2.735ProSer: 2.735 ± 0.551
2.591ProThr: 2.591 ± 0.332
2.303ProVal: 2.303 ± 0.302
0.576ProTrp: 0.576 ± 0.184
1.224ProTyr: 1.224 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
5.039GlnAla: 5.039 ± 0.899
0.648GlnCys: 0.648 ± 0.247
2.951GlnAsp: 2.951 ± 0.43
3.383GlnGlu: 3.383 ± 0.448
1.368GlnPhe: 1.368 ± 0.298
2.951GlnGly: 2.951 ± 0.419
1.368GlnHis: 1.368 ± 0.331
0.936GlnIle: 0.936 ± 0.289
3.023GlnLys: 3.023 ± 0.554
4.535GlnLeu: 4.535 ± 0.574
1.44GlnMet: 1.44 ± 0.256
2.16GlnAsn: 2.16 ± 0.396
1.872GlnPro: 1.872 ± 0.399
2.879GlnGln: 2.879 ± 0.572
2.879GlnArg: 2.879 ± 0.398
2.519GlnSer: 2.519 ± 0.394
1.656GlnThr: 1.656 ± 0.344
2.447GlnVal: 2.447 ± 0.475
0.792GlnTrp: 0.792 ± 0.318
2.375GlnTyr: 2.375 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
6.551ArgAla: 6.551 ± 0.987
0.504ArgCys: 0.504 ± 0.209
3.311ArgAsp: 3.311 ± 0.492
3.383ArgGlu: 3.383 ± 0.324
2.016ArgPhe: 2.016 ± 0.335
4.319ArgGly: 4.319 ± 0.608
1.008ArgHis: 1.008 ± 0.223
3.167ArgIle: 3.167 ± 0.506
2.951ArgLys: 2.951 ± 0.533
4.895ArgLeu: 4.895 ± 0.521
1.872ArgMet: 1.872 ± 0.344
2.735ArgAsn: 2.735 ± 0.357
1.584ArgPro: 1.584 ± 0.278
2.591ArgGln: 2.591 ± 0.295
4.607ArgArg: 4.607 ± 0.659
2.879ArgSer: 2.879 ± 0.514
3.455ArgThr: 3.455 ± 0.405
3.959ArgVal: 3.959 ± 0.451
1.008ArgTrp: 1.008 ± 0.271
2.447ArgTyr: 2.447 ± 0.317
0.0ArgXaa: 0.0 ± 0.0
Ser
8.35SerAla: 8.35 ± 0.902
0.864SerCys: 0.864 ± 0.264
4.031SerAsp: 4.031 ± 0.516
2.735SerGlu: 2.735 ± 0.391
1.872SerPhe: 1.872 ± 0.33
6.551SerGly: 6.551 ± 0.876
0.648SerHis: 0.648 ± 0.197
2.232SerIle: 2.232 ± 0.448
3.671SerLys: 3.671 ± 0.585
4.751SerLeu: 4.751 ± 0.716
2.232SerMet: 2.232 ± 0.364
3.311SerAsn: 3.311 ± 0.526
2.519SerPro: 2.519 ± 0.361
2.016SerGln: 2.016 ± 0.385
2.807SerArg: 2.807 ± 0.373
4.247SerSer: 4.247 ± 0.65
4.751SerThr: 4.751 ± 0.522
4.319SerVal: 4.319 ± 0.339
0.864SerTrp: 0.864 ± 0.215
1.656SerTyr: 1.656 ± 0.446
0.0SerXaa: 0.0 ± 0.0
Thr
5.903ThrAla: 5.903 ± 0.729
0.504ThrCys: 0.504 ± 0.229
3.455ThrAsp: 3.455 ± 0.451
2.519ThrGlu: 2.519 ± 0.494
1.872ThrPhe: 1.872 ± 0.429
5.471ThrGly: 5.471 ± 0.753
1.152ThrHis: 1.152 ± 0.321
1.656ThrIle: 1.656 ± 0.376
2.735ThrLys: 2.735 ± 0.418
5.111ThrLeu: 5.111 ± 0.577
1.44ThrMet: 1.44 ± 0.36
1.872ThrAsn: 1.872 ± 0.371
2.879ThrPro: 2.879 ± 0.278
2.663ThrGln: 2.663 ± 0.364
3.023ThrArg: 3.023 ± 0.421
4.175ThrSer: 4.175 ± 0.528
3.383ThrThr: 3.383 ± 0.439
4.751ThrVal: 4.751 ± 0.531
1.08ThrTrp: 1.08 ± 0.249
2.447ThrTyr: 2.447 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
5.975ValAla: 5.975 ± 0.648
0.576ValCys: 0.576 ± 0.219
4.823ValAsp: 4.823 ± 0.518
4.031ValGlu: 4.031 ± 0.636
1.44ValPhe: 1.44 ± 0.274
6.623ValGly: 6.623 ± 0.733
1.872ValHis: 1.872 ± 0.343
3.383ValIle: 3.383 ± 0.58
3.239ValLys: 3.239 ± 0.495
5.399ValLeu: 5.399 ± 0.628
1.728ValMet: 1.728 ± 0.325
2.879ValAsn: 2.879 ± 0.583
3.311ValPro: 3.311 ± 0.488
3.311ValGln: 3.311 ± 0.661
4.031ValArg: 4.031 ± 0.411
4.751ValSer: 4.751 ± 0.633
3.383ValThr: 3.383 ± 0.502
5.759ValVal: 5.759 ± 0.657
0.936ValTrp: 0.936 ± 0.251
2.519ValTyr: 2.519 ± 0.416
0.0ValXaa: 0.0 ± 0.0
Trp
1.152TrpAla: 1.152 ± 0.28
0.288TrpCys: 0.288 ± 0.143
0.936TrpAsp: 0.936 ± 0.224
1.08TrpGlu: 1.08 ± 0.272
0.648TrpPhe: 0.648 ± 0.239
0.792TrpGly: 0.792 ± 0.218
0.36TrpHis: 0.36 ± 0.13
0.504TrpIle: 0.504 ± 0.183
0.504TrpLys: 0.504 ± 0.19
1.584TrpLeu: 1.584 ± 0.221
0.072TrpMet: 0.072 ± 0.072
0.72TrpAsn: 0.72 ± 0.251
0.432TrpPro: 0.432 ± 0.155
0.576TrpGln: 0.576 ± 0.209
0.72TrpArg: 0.72 ± 0.22
0.936TrpSer: 0.936 ± 0.277
1.008TrpThr: 1.008 ± 0.21
1.512TrpVal: 1.512 ± 0.334
0.36TrpTrp: 0.36 ± 0.163
0.72TrpTyr: 0.72 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.951TyrAla: 2.951 ± 0.505
0.648TyrCys: 0.648 ± 0.243
2.375TyrAsp: 2.375 ± 0.4
2.303TyrGlu: 2.303 ± 0.46
1.368TyrPhe: 1.368 ± 0.229
3.167TyrGly: 3.167 ± 0.629
0.504TyrHis: 0.504 ± 0.172
1.872TyrIle: 1.872 ± 0.351
2.519TyrLys: 2.519 ± 0.459
3.887TyrLeu: 3.887 ± 0.412
0.792TyrMet: 0.792 ± 0.255
1.584TyrAsn: 1.584 ± 0.28
0.864TyrPro: 0.864 ± 0.185
2.447TyrGln: 2.447 ± 0.36
2.375TyrArg: 2.375 ± 0.425
2.807TyrSer: 2.807 ± 0.425
2.879TyrThr: 2.879 ± 0.406
2.16TyrVal: 2.16 ± 0.491
0.864TyrTrp: 0.864 ± 0.273
1.872TyrTyr: 1.872 ± 0.443
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (13893 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski