Amino acid dipepetide frequency for Klebsiella phage ST16-OXA48phi5.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.759AlaAla: 10.759 ± 0.797
0.947AlaCys: 0.947 ± 0.231
7.105AlaAsp: 7.105 ± 0.795
6.022AlaGlu: 6.022 ± 0.888
2.842AlaPhe: 2.842 ± 0.392
8.526AlaGly: 8.526 ± 0.677
1.962AlaHis: 1.962 ± 0.555
6.496AlaIle: 6.496 ± 0.601
4.872AlaLys: 4.872 ± 0.579
8.459AlaLeu: 8.459 ± 0.96
3.18AlaMet: 3.18 ± 0.503
3.857AlaAsn: 3.857 ± 0.513
2.233AlaPro: 2.233 ± 0.392
3.857AlaGln: 3.857 ± 0.46
5.278AlaArg: 5.278 ± 0.679
5.752AlaSer: 5.752 ± 0.62
5.887AlaThr: 5.887 ± 0.657
5.887AlaVal: 5.887 ± 0.566
1.421AlaTrp: 1.421 ± 0.316
2.774AlaTyr: 2.774 ± 0.554
0.0AlaXaa: 0.0 ± 0.0
Cys
1.624CysAla: 1.624 ± 0.31
0.203CysCys: 0.203 ± 0.132
0.88CysAsp: 0.88 ± 0.251
0.541CysGlu: 0.541 ± 0.216
0.338CysPhe: 0.338 ± 0.13
0.88CysGly: 0.88 ± 0.218
0.271CysHis: 0.271 ± 0.137
0.406CysIle: 0.406 ± 0.163
0.677CysLys: 0.677 ± 0.193
1.421CysLeu: 1.421 ± 0.319
0.406CysMet: 0.406 ± 0.164
0.474CysAsn: 0.474 ± 0.191
0.677CysPro: 0.677 ± 0.187
0.338CysGln: 0.338 ± 0.18
1.015CysArg: 1.015 ± 0.233
0.744CysSer: 0.744 ± 0.289
0.541CysThr: 0.541 ± 0.166
0.474CysVal: 0.474 ± 0.288
0.406CysTrp: 0.406 ± 0.176
0.609CysTyr: 0.609 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
6.09AspAla: 6.09 ± 0.697
0.677AspCys: 0.677 ± 0.207
3.722AspAsp: 3.722 ± 0.529
4.06AspGlu: 4.06 ± 0.554
2.774AspPhe: 2.774 ± 0.37
4.534AspGly: 4.534 ± 0.546
0.677AspHis: 0.677 ± 0.251
3.383AspIle: 3.383 ± 0.376
3.045AspLys: 3.045 ± 0.422
3.586AspLeu: 3.586 ± 0.494
1.353AspMet: 1.353 ± 0.259
2.639AspAsn: 2.639 ± 0.414
2.368AspPro: 2.368 ± 0.482
1.759AspGln: 1.759 ± 0.329
3.248AspArg: 3.248 ± 0.455
3.654AspSer: 3.654 ± 0.408
2.842AspThr: 2.842 ± 0.459
4.128AspVal: 4.128 ± 0.45
1.353AspTrp: 1.353 ± 0.338
1.962AspTyr: 1.962 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
5.752GluAla: 5.752 ± 0.711
0.947GluCys: 0.947 ± 0.246
2.91GluAsp: 2.91 ± 0.557
3.248GluGlu: 3.248 ± 0.659
2.639GluPhe: 2.639 ± 0.396
4.534GluGly: 4.534 ± 0.661
0.947GluHis: 0.947 ± 0.288
3.519GluIle: 3.519 ± 0.482
2.504GluLys: 2.504 ± 0.459
5.21GluLeu: 5.21 ± 0.594
1.895GluMet: 1.895 ± 0.378
2.842GluAsn: 2.842 ± 0.509
2.436GluPro: 2.436 ± 0.39
2.639GluGln: 2.639 ± 0.411
4.195GluArg: 4.195 ± 0.595
3.519GluSer: 3.519 ± 0.496
2.977GluThr: 2.977 ± 0.475
3.248GluVal: 3.248 ± 0.476
0.947GluTrp: 0.947 ± 0.25
1.759GluTyr: 1.759 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
2.571PheAla: 2.571 ± 0.453
0.338PheCys: 0.338 ± 0.143
2.301PheAsp: 2.301 ± 0.388
1.962PheGlu: 1.962 ± 0.42
1.15PhePhe: 1.15 ± 0.259
2.436PheGly: 2.436 ± 0.445
0.271PheHis: 0.271 ± 0.121
2.436PheIle: 2.436 ± 0.45
1.692PheLys: 1.692 ± 0.34
2.436PheLeu: 2.436 ± 0.419
0.88PheMet: 0.88 ± 0.271
2.098PheAsn: 2.098 ± 0.474
1.353PhePro: 1.353 ± 0.271
1.286PheGln: 1.286 ± 0.259
2.03PheArg: 2.03 ± 0.34
1.962PheSer: 1.962 ± 0.315
2.707PheThr: 2.707 ± 0.365
2.165PheVal: 2.165 ± 0.456
0.947PheTrp: 0.947 ± 0.251
1.015PheTyr: 1.015 ± 0.254
0.0PheXaa: 0.0 ± 0.0
Gly
7.173GlyAla: 7.173 ± 0.711
1.083GlyCys: 1.083 ± 0.254
4.466GlyAsp: 4.466 ± 0.75
3.586GlyGlu: 3.586 ± 0.405
3.18GlyPhe: 3.18 ± 0.538
4.94GlyGly: 4.94 ± 0.615
0.947GlyHis: 0.947 ± 0.264
4.804GlyIle: 4.804 ± 0.536
5.143GlyLys: 5.143 ± 0.608
6.631GlyLeu: 6.631 ± 0.677
2.436GlyMet: 2.436 ± 0.5
3.586GlyAsn: 3.586 ± 0.424
2.098GlyPro: 2.098 ± 0.354
2.774GlyGln: 2.774 ± 0.424
4.331GlyArg: 4.331 ± 0.552
4.331GlySer: 4.331 ± 0.718
4.601GlyThr: 4.601 ± 0.742
5.143GlyVal: 5.143 ± 0.517
1.624GlyTrp: 1.624 ± 0.382
2.639GlyTyr: 2.639 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
1.015HisAla: 1.015 ± 0.251
0.406HisCys: 0.406 ± 0.173
0.744HisAsp: 0.744 ± 0.272
0.88HisGlu: 0.88 ± 0.381
0.474HisPhe: 0.474 ± 0.169
1.624HisGly: 1.624 ± 0.324
0.338HisHis: 0.338 ± 0.136
0.677HisIle: 0.677 ± 0.208
0.541HisLys: 0.541 ± 0.172
1.286HisLeu: 1.286 ± 0.278
0.338HisMet: 0.338 ± 0.139
0.406HisAsn: 0.406 ± 0.168
0.947HisPro: 0.947 ± 0.285
0.744HisGln: 0.744 ± 0.214
0.947HisArg: 0.947 ± 0.266
1.218HisSer: 1.218 ± 0.386
0.947HisThr: 0.947 ± 0.258
0.609HisVal: 0.609 ± 0.203
0.338HisTrp: 0.338 ± 0.144
0.677HisTyr: 0.677 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.534IleAla: 4.534 ± 0.488
0.88IleCys: 0.88 ± 0.302
3.248IleAsp: 3.248 ± 0.368
4.06IleGlu: 4.06 ± 0.586
2.03IlePhe: 2.03 ± 0.326
5.346IleGly: 5.346 ± 0.576
0.88IleHis: 0.88 ± 0.242
2.368IleIle: 2.368 ± 0.428
2.301IleLys: 2.301 ± 0.38
3.045IleLeu: 3.045 ± 0.452
1.624IleMet: 1.624 ± 0.314
2.707IleAsn: 2.707 ± 0.47
2.842IlePro: 2.842 ± 0.563
2.504IleGln: 2.504 ± 0.515
3.586IleArg: 3.586 ± 0.472
4.534IleSer: 4.534 ± 0.47
4.06IleThr: 4.06 ± 0.544
3.519IleVal: 3.519 ± 0.506
0.744IleTrp: 0.744 ± 0.216
1.556IleTyr: 1.556 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
5.21LysAla: 5.21 ± 0.751
0.947LysCys: 0.947 ± 0.343
2.368LysAsp: 2.368 ± 0.398
2.639LysGlu: 2.639 ± 0.423
1.489LysPhe: 1.489 ± 0.289
2.707LysGly: 2.707 ± 0.429
0.677LysHis: 0.677 ± 0.213
2.504LysIle: 2.504 ± 0.63
2.707LysLys: 2.707 ± 0.466
2.91LysLeu: 2.91 ± 0.354
1.556LysMet: 1.556 ± 0.428
2.639LysAsn: 2.639 ± 0.456
2.504LysPro: 2.504 ± 0.408
2.436LysGln: 2.436 ± 0.354
4.331LysArg: 4.331 ± 0.618
2.842LysSer: 2.842 ± 0.439
3.113LysThr: 3.113 ± 0.486
3.18LysVal: 3.18 ± 0.525
1.286LysTrp: 1.286 ± 0.302
1.556LysTyr: 1.556 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
8.594LeuAla: 8.594 ± 0.847
1.15LeuCys: 1.15 ± 0.26
4.128LeuAsp: 4.128 ± 0.595
4.94LeuGlu: 4.94 ± 0.619
2.03LeuPhe: 2.03 ± 0.405
5.481LeuGly: 5.481 ± 0.662
0.947LeuHis: 0.947 ± 0.346
4.804LeuIle: 4.804 ± 0.716
4.466LeuLys: 4.466 ± 0.666
6.428LeuLeu: 6.428 ± 0.59
2.098LeuMet: 2.098 ± 0.456
3.316LeuAsn: 3.316 ± 0.502
3.789LeuPro: 3.789 ± 0.464
3.045LeuGln: 3.045 ± 0.445
5.007LeuArg: 5.007 ± 0.717
6.564LeuSer: 6.564 ± 0.635
4.872LeuThr: 4.872 ± 0.554
4.466LeuVal: 4.466 ± 0.538
1.218LeuTrp: 1.218 ± 0.294
2.436LeuTyr: 2.436 ± 0.454
0.0LeuXaa: 0.0 ± 0.0
Met
3.248MetAla: 3.248 ± 0.589
0.068MetCys: 0.068 ± 0.07
0.947MetAsp: 0.947 ± 0.24
1.489MetGlu: 1.489 ± 0.311
0.406MetPhe: 0.406 ± 0.157
1.218MetGly: 1.218 ± 0.275
0.677MetHis: 0.677 ± 0.201
1.218MetIle: 1.218 ± 0.261
1.556MetLys: 1.556 ± 0.399
2.098MetLeu: 2.098 ± 0.474
0.947MetMet: 0.947 ± 0.356
1.218MetAsn: 1.218 ± 0.246
1.962MetPro: 1.962 ± 0.328
1.15MetGln: 1.15 ± 0.238
2.098MetArg: 2.098 ± 0.416
2.098MetSer: 2.098 ± 0.366
2.165MetThr: 2.165 ± 0.356
2.165MetVal: 2.165 ± 0.45
0.203MetTrp: 0.203 ± 0.11
0.677MetTyr: 0.677 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
4.195AsnAla: 4.195 ± 0.406
0.271AsnCys: 0.271 ± 0.172
1.895AsnAsp: 1.895 ± 0.35
1.692AsnGlu: 1.692 ± 0.335
1.759AsnPhe: 1.759 ± 0.328
3.519AsnGly: 3.519 ± 0.554
0.541AsnHis: 0.541 ± 0.176
1.962AsnIle: 1.962 ± 0.357
1.692AsnLys: 1.692 ± 0.431
3.316AsnLeu: 3.316 ± 0.409
1.083AsnMet: 1.083 ± 0.287
1.962AsnAsn: 1.962 ± 0.465
2.774AsnPro: 2.774 ± 0.449
2.233AsnGln: 2.233 ± 0.345
2.368AsnArg: 2.368 ± 0.368
2.707AsnSer: 2.707 ± 0.425
3.316AsnThr: 3.316 ± 0.458
2.842AsnVal: 2.842 ± 0.446
0.744AsnTrp: 0.744 ± 0.25
1.489AsnTyr: 1.489 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
4.06ProAla: 4.06 ± 0.511
0.338ProCys: 0.338 ± 0.154
3.654ProAsp: 3.654 ± 0.492
3.248ProGlu: 3.248 ± 0.549
1.624ProPhe: 1.624 ± 0.281
3.113ProGly: 3.113 ± 0.498
0.812ProHis: 0.812 ± 0.239
2.098ProIle: 2.098 ± 0.416
1.895ProLys: 1.895 ± 0.356
3.992ProLeu: 3.992 ± 0.597
0.744ProMet: 0.744 ± 0.243
1.556ProAsn: 1.556 ± 0.33
1.962ProPro: 1.962 ± 0.407
1.692ProGln: 1.692 ± 0.324
1.624ProArg: 1.624 ± 0.269
2.707ProSer: 2.707 ± 0.397
3.18ProThr: 3.18 ± 0.458
3.654ProVal: 3.654 ± 0.645
0.609ProTrp: 0.609 ± 0.23
1.624ProTyr: 1.624 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
4.263GlnAla: 4.263 ± 0.623
0.541GlnCys: 0.541 ± 0.187
1.489GlnAsp: 1.489 ± 0.342
1.962GlnGlu: 1.962 ± 0.367
1.421GlnPhe: 1.421 ± 0.26
1.962GlnGly: 1.962 ± 0.392
0.744GlnHis: 0.744 ± 0.204
2.436GlnIle: 2.436 ± 0.459
2.504GlnLys: 2.504 ± 0.389
3.789GlnLeu: 3.789 ± 0.52
1.083GlnMet: 1.083 ± 0.305
1.692GlnAsn: 1.692 ± 0.399
1.421GlnPro: 1.421 ± 0.299
1.556GlnGln: 1.556 ± 0.288
2.098GlnArg: 2.098 ± 0.465
3.925GlnSer: 3.925 ± 0.484
2.977GlnThr: 2.977 ± 0.408
2.504GlnVal: 2.504 ± 0.45
1.083GlnTrp: 1.083 ± 0.309
1.556GlnTyr: 1.556 ± 0.384
0.0GlnXaa: 0.0 ± 0.0
Arg
5.007ArgAla: 5.007 ± 0.686
0.88ArgCys: 0.88 ± 0.222
3.316ArgAsp: 3.316 ± 0.529
4.195ArgGlu: 4.195 ± 0.634
1.895ArgPhe: 1.895 ± 0.434
3.586ArgGly: 3.586 ± 0.495
1.286ArgHis: 1.286 ± 0.306
4.263ArgIle: 4.263 ± 0.647
4.398ArgLys: 4.398 ± 0.648
4.94ArgLeu: 4.94 ± 0.577
2.504ArgMet: 2.504 ± 0.385
2.301ArgAsn: 2.301 ± 0.453
2.774ArgPro: 2.774 ± 0.436
3.113ArgGln: 3.113 ± 0.476
4.737ArgArg: 4.737 ± 0.768
3.789ArgSer: 3.789 ± 0.519
2.774ArgThr: 2.774 ± 0.469
3.18ArgVal: 3.18 ± 0.396
1.218ArgTrp: 1.218 ± 0.288
1.421ArgTyr: 1.421 ± 0.316
0.0ArgXaa: 0.0 ± 0.0
Ser
6.834SerAla: 6.834 ± 0.628
0.677SerCys: 0.677 ± 0.239
3.992SerAsp: 3.992 ± 0.438
4.872SerGlu: 4.872 ± 0.64
2.233SerPhe: 2.233 ± 0.376
6.225SerGly: 6.225 ± 0.961
1.083SerHis: 1.083 ± 0.266
3.248SerIle: 3.248 ± 0.433
2.842SerLys: 2.842 ± 0.349
5.684SerLeu: 5.684 ± 0.643
1.556SerMet: 1.556 ± 0.349
2.368SerAsn: 2.368 ± 0.369
3.18SerPro: 3.18 ± 0.416
2.91SerGln: 2.91 ± 0.532
3.519SerArg: 3.519 ± 0.539
3.925SerSer: 3.925 ± 0.499
3.316SerThr: 3.316 ± 0.421
5.819SerVal: 5.819 ± 0.792
0.812SerTrp: 0.812 ± 0.219
1.556SerTyr: 1.556 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
6.225ThrAla: 6.225 ± 0.688
0.677ThrCys: 0.677 ± 0.236
3.722ThrAsp: 3.722 ± 0.538
3.113ThrGlu: 3.113 ± 0.434
2.301ThrPhe: 2.301 ± 0.394
6.834ThrGly: 6.834 ± 0.752
0.812ThrHis: 0.812 ± 0.24
3.654ThrIle: 3.654 ± 0.43
1.962ThrLys: 1.962 ± 0.338
5.346ThrLeu: 5.346 ± 0.632
0.947ThrMet: 0.947 ± 0.227
2.165ThrAsn: 2.165 ± 0.457
4.195ThrPro: 4.195 ± 0.562
2.233ThrGln: 2.233 ± 0.506
2.977ThrArg: 2.977 ± 0.441
3.654ThrSer: 3.654 ± 0.543
3.925ThrThr: 3.925 ± 0.495
5.887ThrVal: 5.887 ± 0.733
1.015ThrTrp: 1.015 ± 0.288
1.692ThrTyr: 1.692 ± 0.274
0.0ThrXaa: 0.0 ± 0.0
Val
6.699ValAla: 6.699 ± 0.691
1.015ValCys: 1.015 ± 0.258
4.195ValAsp: 4.195 ± 0.592
3.586ValGlu: 3.586 ± 0.432
1.759ValPhe: 1.759 ± 0.357
4.534ValGly: 4.534 ± 0.588
0.609ValHis: 0.609 ± 0.184
3.519ValIle: 3.519 ± 0.422
3.383ValLys: 3.383 ± 0.419
5.21ValLeu: 5.21 ± 0.688
1.556ValMet: 1.556 ± 0.268
2.842ValAsn: 2.842 ± 0.454
2.774ValPro: 2.774 ± 0.516
1.895ValGln: 1.895 ± 0.376
4.128ValArg: 4.128 ± 0.714
5.481ValSer: 5.481 ± 0.604
5.955ValThr: 5.955 ± 0.673
4.263ValVal: 4.263 ± 0.714
0.677ValTrp: 0.677 ± 0.255
2.233ValTyr: 2.233 ± 0.53
0.0ValXaa: 0.0 ± 0.0
Trp
1.624TrpAla: 1.624 ± 0.37
0.609TrpCys: 0.609 ± 0.196
1.015TrpAsp: 1.015 ± 0.243
1.353TrpGlu: 1.353 ± 0.286
0.88TrpPhe: 0.88 ± 0.207
0.744TrpGly: 0.744 ± 0.19
0.271TrpHis: 0.271 ± 0.133
0.947TrpIle: 0.947 ± 0.202
0.744TrpLys: 0.744 ± 0.264
1.962TrpLeu: 1.962 ± 0.295
0.744TrpMet: 0.744 ± 0.198
0.677TrpAsn: 0.677 ± 0.201
0.338TrpPro: 0.338 ± 0.155
0.812TrpGln: 0.812 ± 0.236
1.015TrpArg: 1.015 ± 0.249
1.421TrpSer: 1.421 ± 0.32
0.744TrpThr: 0.744 ± 0.211
1.083TrpVal: 1.083 ± 0.278
0.474TrpTrp: 0.474 ± 0.202
0.609TrpTyr: 0.609 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.707TyrAla: 2.707 ± 0.386
0.271TyrCys: 0.271 ± 0.128
1.759TyrAsp: 1.759 ± 0.374
1.218TyrGlu: 1.218 ± 0.336
0.812TyrPhe: 0.812 ± 0.216
2.774TyrGly: 2.774 ± 0.363
0.406TyrHis: 0.406 ± 0.166
1.624TyrIle: 1.624 ± 0.289
0.677TyrLys: 0.677 ± 0.201
2.03TyrLeu: 2.03 ± 0.379
0.609TyrMet: 0.609 ± 0.22
1.015TyrAsn: 1.015 ± 0.25
1.556TyrPro: 1.556 ± 0.244
2.03TyrGln: 2.03 ± 0.46
3.045TyrArg: 3.045 ± 0.449
1.827TyrSer: 1.827 ± 0.373
2.436TyrThr: 2.436 ± 0.313
2.098TyrVal: 2.098 ± 0.387
0.947TyrTrp: 0.947 ± 0.228
1.015TyrTyr: 1.015 ± 0.226
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14779 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski