Amino acid dipepetide frequency for Klebsiella phage KN3-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.773AlaAla: 7.773 ± 1.302
0.893AlaCys: 0.893 ± 0.355
5.718AlaAsp: 5.718 ± 0.846
5.272AlaGlu: 5.272 ± 0.749
3.663AlaPhe: 3.663 ± 0.491
7.327AlaGly: 7.327 ± 1.092
1.698AlaHis: 1.698 ± 0.373
3.663AlaIle: 3.663 ± 0.57
5.361AlaLys: 5.361 ± 0.65
8.488AlaLeu: 8.488 ± 1.108
3.217AlaMet: 3.217 ± 0.618
4.378AlaAsn: 4.378 ± 0.41
2.77AlaPro: 2.77 ± 0.429
3.574AlaGln: 3.574 ± 0.709
4.914AlaArg: 4.914 ± 0.689
4.736AlaSer: 4.736 ± 0.616
3.931AlaThr: 3.931 ± 0.511
5.986AlaVal: 5.986 ± 0.741
1.072AlaTrp: 1.072 ± 0.391
2.591AlaTyr: 2.591 ± 0.441
0.0AlaXaa: 0.0 ± 0.0
Cys
0.715CysAla: 0.715 ± 0.216
0.268CysCys: 0.268 ± 0.163
0.715CysAsp: 0.715 ± 0.302
0.715CysGlu: 0.715 ± 0.307
0.268CysPhe: 0.268 ± 0.183
0.893CysGly: 0.893 ± 0.299
0.357CysHis: 0.357 ± 0.161
0.536CysIle: 0.536 ± 0.174
0.447CysLys: 0.447 ± 0.169
0.983CysLeu: 0.983 ± 0.344
0.089CysMet: 0.089 ± 0.097
0.268CysAsn: 0.268 ± 0.148
0.625CysPro: 0.625 ± 0.213
0.715CysGln: 0.715 ± 0.264
0.983CysArg: 0.983 ± 0.393
0.893CysSer: 0.893 ± 0.328
0.357CysThr: 0.357 ± 0.186
0.715CysVal: 0.715 ± 0.213
0.268CysTrp: 0.268 ± 0.172
0.447CysTyr: 0.447 ± 0.262
0.0CysXaa: 0.0 ± 0.0
Asp
5.718AspAla: 5.718 ± 0.681
0.536AspCys: 0.536 ± 0.238
4.467AspAsp: 4.467 ± 0.507
3.395AspGlu: 3.395 ± 0.479
3.395AspPhe: 3.395 ± 0.665
6.254AspGly: 6.254 ± 0.65
1.34AspHis: 1.34 ± 0.264
2.77AspIle: 2.77 ± 0.34
4.11AspLys: 4.11 ± 0.733
3.663AspLeu: 3.663 ± 0.588
1.787AspMet: 1.787 ± 0.362
2.234AspAsn: 2.234 ± 0.353
3.127AspPro: 3.127 ± 0.529
2.68AspGln: 2.68 ± 0.474
3.217AspArg: 3.217 ± 0.539
3.485AspSer: 3.485 ± 0.438
3.842AspThr: 3.842 ± 0.557
5.093AspVal: 5.093 ± 0.813
0.536AspTrp: 0.536 ± 0.233
1.876AspTyr: 1.876 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
6.701GluAla: 6.701 ± 1.01
0.536GluCys: 0.536 ± 0.194
4.199GluAsp: 4.199 ± 0.781
5.629GluGlu: 5.629 ± 1.276
2.502GluPhe: 2.502 ± 0.474
5.54GluGly: 5.54 ± 0.847
1.698GluHis: 1.698 ± 0.534
2.591GluIle: 2.591 ± 0.424
2.859GluLys: 2.859 ± 0.617
6.254GluLeu: 6.254 ± 0.882
1.072GluMet: 1.072 ± 0.332
2.144GluAsn: 2.144 ± 0.321
2.055GluPro: 2.055 ± 0.579
3.485GluGln: 3.485 ± 0.757
3.485GluArg: 3.485 ± 0.608
4.11GluSer: 4.11 ± 0.588
2.949GluThr: 2.949 ± 0.52
4.11GluVal: 4.11 ± 0.662
0.536GluTrp: 0.536 ± 0.227
2.77GluTyr: 2.77 ± 0.332
0.0GluXaa: 0.0 ± 0.0
Phe
3.038PheAla: 3.038 ± 0.429
0.536PheCys: 0.536 ± 0.225
2.859PheAsp: 2.859 ± 0.648
2.412PheGlu: 2.412 ± 0.521
0.625PhePhe: 0.625 ± 0.23
3.038PheGly: 3.038 ± 0.571
1.072PheHis: 1.072 ± 0.313
1.608PheIle: 1.608 ± 0.414
2.055PheLys: 2.055 ± 0.318
2.949PheLeu: 2.949 ± 0.462
1.162PheMet: 1.162 ± 0.293
1.787PheAsn: 1.787 ± 0.471
1.519PhePro: 1.519 ± 0.368
1.43PheGln: 1.43 ± 0.495
1.34PheArg: 1.34 ± 0.405
2.144PheSer: 2.144 ± 0.432
2.77PheThr: 2.77 ± 0.695
2.144PheVal: 2.144 ± 0.436
0.268PheTrp: 0.268 ± 0.13
0.893PheTyr: 0.893 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
6.701GlyAla: 6.701 ± 1.155
1.072GlyCys: 1.072 ± 0.309
5.272GlyAsp: 5.272 ± 0.592
5.182GlyGlu: 5.182 ± 0.692
2.949GlyPhe: 2.949 ± 0.324
5.897GlyGly: 5.897 ± 0.89
1.787GlyHis: 1.787 ± 0.477
5.361GlyIle: 5.361 ± 0.888
5.45GlyLys: 5.45 ± 0.773
6.791GlyLeu: 6.791 ± 0.846
1.698GlyMet: 1.698 ± 0.515
3.217GlyAsn: 3.217 ± 0.504
1.34GlyPro: 1.34 ± 0.448
3.306GlyGln: 3.306 ± 0.683
4.557GlyArg: 4.557 ± 0.557
5.986GlySer: 5.986 ± 0.818
5.54GlyThr: 5.54 ± 0.854
5.182GlyVal: 5.182 ± 0.745
1.251GlyTrp: 1.251 ± 0.348
3.306GlyTyr: 3.306 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.608HisAla: 1.608 ± 0.4
0.357HisCys: 0.357 ± 0.149
0.893HisAsp: 0.893 ± 0.295
1.43HisGlu: 1.43 ± 0.339
0.804HisPhe: 0.804 ± 0.283
1.608HisGly: 1.608 ± 0.369
0.804HisHis: 0.804 ± 0.245
0.536HisIle: 0.536 ± 0.183
0.893HisLys: 0.893 ± 0.235
1.876HisLeu: 1.876 ± 0.498
0.536HisMet: 0.536 ± 0.2
0.536HisAsn: 0.536 ± 0.241
0.715HisPro: 0.715 ± 0.208
0.893HisGln: 0.893 ± 0.298
1.43HisArg: 1.43 ± 0.622
1.072HisSer: 1.072 ± 0.297
1.162HisThr: 1.162 ± 0.288
2.234HisVal: 2.234 ± 0.577
0.268HisTrp: 0.268 ± 0.133
1.072HisTyr: 1.072 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
4.11IleAla: 4.11 ± 0.551
0.536IleCys: 0.536 ± 0.176
3.217IleAsp: 3.217 ± 0.616
3.038IleGlu: 3.038 ± 0.446
0.625IlePhe: 0.625 ± 0.28
3.395IleGly: 3.395 ± 0.637
0.893IleHis: 0.893 ± 0.308
2.949IleIle: 2.949 ± 0.695
2.949IleLys: 2.949 ± 0.525
3.574IleLeu: 3.574 ± 0.458
1.072IleMet: 1.072 ± 0.335
1.966IleAsn: 1.966 ± 0.577
2.412IlePro: 2.412 ± 0.429
2.144IleGln: 2.144 ± 0.474
3.217IleArg: 3.217 ± 0.53
2.859IleSer: 2.859 ± 0.395
2.323IleThr: 2.323 ± 0.387
3.127IleVal: 3.127 ± 0.427
0.536IleTrp: 0.536 ± 0.248
1.698IleTyr: 1.698 ± 0.348
0.0IleXaa: 0.0 ± 0.0
Lys
6.701LysAla: 6.701 ± 1.06
0.536LysCys: 0.536 ± 0.233
3.306LysAsp: 3.306 ± 0.645
4.557LysGlu: 4.557 ± 0.685
2.144LysPhe: 2.144 ± 0.383
5.808LysGly: 5.808 ± 0.893
1.698LysHis: 1.698 ± 0.309
2.055LysIle: 2.055 ± 0.478
2.77LysLys: 2.77 ± 0.633
5.004LysLeu: 5.004 ± 0.834
1.43LysMet: 1.43 ± 0.335
2.591LysAsn: 2.591 ± 0.439
2.323LysPro: 2.323 ± 0.553
1.698LysGln: 1.698 ± 0.399
3.038LysArg: 3.038 ± 0.668
2.412LysSer: 2.412 ± 0.458
3.038LysThr: 3.038 ± 0.36
4.378LysVal: 4.378 ± 0.555
0.536LysTrp: 0.536 ± 0.227
1.43LysTyr: 1.43 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
8.041LeuAla: 8.041 ± 1.087
0.625LeuCys: 0.625 ± 0.286
4.646LeuAsp: 4.646 ± 0.544
6.344LeuGlu: 6.344 ± 0.999
2.859LeuPhe: 2.859 ± 0.497
5.986LeuGly: 5.986 ± 0.826
1.251LeuHis: 1.251 ± 0.342
3.306LeuIle: 3.306 ± 0.443
6.254LeuLys: 6.254 ± 0.538
7.684LeuLeu: 7.684 ± 1.302
1.966LeuMet: 1.966 ± 0.325
3.753LeuAsn: 3.753 ± 0.65
4.11LeuPro: 4.11 ± 0.635
4.021LeuGln: 4.021 ± 0.584
5.182LeuArg: 5.182 ± 0.651
5.54LeuSer: 5.54 ± 0.578
5.629LeuThr: 5.629 ± 0.933
6.701LeuVal: 6.701 ± 1.212
1.072LeuTrp: 1.072 ± 0.36
1.876LeuTyr: 1.876 ± 0.381
0.0LeuXaa: 0.0 ± 0.0
Met
2.949MetAla: 2.949 ± 0.452
0.357MetCys: 0.357 ± 0.182
1.966MetAsp: 1.966 ± 0.314
1.251MetGlu: 1.251 ± 0.35
0.715MetPhe: 0.715 ± 0.247
1.43MetGly: 1.43 ± 0.346
0.715MetHis: 0.715 ± 0.267
1.34MetIle: 1.34 ± 0.318
1.072MetLys: 1.072 ± 0.287
2.234MetLeu: 2.234 ± 0.546
0.804MetMet: 0.804 ± 0.316
0.804MetAsn: 0.804 ± 0.196
1.162MetPro: 1.162 ± 0.361
1.787MetGln: 1.787 ± 0.465
0.983MetArg: 0.983 ± 0.22
1.43MetSer: 1.43 ± 0.288
1.966MetThr: 1.966 ± 0.46
1.519MetVal: 1.519 ± 0.394
0.0MetTrp: 0.0 ± 0.0
0.625MetTyr: 0.625 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
3.306AsnAla: 3.306 ± 0.605
0.804AsnCys: 0.804 ± 0.266
2.323AsnAsp: 2.323 ± 0.396
1.966AsnGlu: 1.966 ± 0.407
1.519AsnPhe: 1.519 ± 0.403
4.646AsnGly: 4.646 ± 0.808
0.536AsnHis: 0.536 ± 0.228
2.502AsnIle: 2.502 ± 0.479
1.519AsnLys: 1.519 ± 0.305
3.753AsnLeu: 3.753 ± 0.646
0.625AsnMet: 0.625 ± 0.231
1.519AsnAsn: 1.519 ± 0.346
2.949AsnPro: 2.949 ± 0.426
1.966AsnGln: 1.966 ± 0.5
1.876AsnArg: 1.876 ± 0.512
2.412AsnSer: 2.412 ± 0.408
1.966AsnThr: 1.966 ± 0.312
3.038AsnVal: 3.038 ± 0.46
0.715AsnTrp: 0.715 ± 0.296
1.876AsnTyr: 1.876 ± 0.455
0.0AsnXaa: 0.0 ± 0.0
Pro
2.949ProAla: 2.949 ± 0.456
0.715ProCys: 0.715 ± 0.279
1.698ProAsp: 1.698 ± 0.32
4.289ProGlu: 4.289 ± 0.754
1.34ProPhe: 1.34 ± 0.236
3.217ProGly: 3.217 ± 0.471
0.357ProHis: 0.357 ± 0.139
1.162ProIle: 1.162 ± 0.424
2.412ProLys: 2.412 ± 0.509
3.485ProLeu: 3.485 ± 0.612
0.625ProMet: 0.625 ± 0.242
1.876ProAsn: 1.876 ± 0.411
0.983ProPro: 0.983 ± 0.359
1.519ProGln: 1.519 ± 0.294
1.698ProArg: 1.698 ± 0.342
1.966ProSer: 1.966 ± 0.35
2.502ProThr: 2.502 ± 0.493
3.127ProVal: 3.127 ± 0.521
0.804ProTrp: 0.804 ± 0.245
2.055ProTyr: 2.055 ± 0.474
0.0ProXaa: 0.0 ± 0.0
Gln
4.021GlnAla: 4.021 ± 0.664
0.0GlnCys: 0.0 ± 0.0
2.412GlnAsp: 2.412 ± 0.361
3.217GlnGlu: 3.217 ± 0.387
1.787GlnPhe: 1.787 ± 0.333
2.949GlnGly: 2.949 ± 0.418
0.625GlnHis: 0.625 ± 0.243
1.34GlnIle: 1.34 ± 0.387
2.77GlnLys: 2.77 ± 0.54
4.557GlnLeu: 4.557 ± 0.635
1.519GlnMet: 1.519 ± 0.545
1.519GlnAsn: 1.519 ± 0.319
1.876GlnPro: 1.876 ± 0.226
3.663GlnGln: 3.663 ± 0.62
2.77GlnArg: 2.77 ± 0.592
2.859GlnSer: 2.859 ± 0.493
1.787GlnThr: 1.787 ± 0.457
3.127GlnVal: 3.127 ± 0.487
0.536GlnTrp: 0.536 ± 0.228
1.608GlnTyr: 1.608 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
5.093ArgAla: 5.093 ± 0.705
0.715ArgCys: 0.715 ± 0.25
4.11ArgAsp: 4.11 ± 0.59
3.485ArgGlu: 3.485 ± 0.541
1.608ArgPhe: 1.608 ± 0.433
4.11ArgGly: 4.11 ± 0.669
1.072ArgHis: 1.072 ± 0.317
2.68ArgIle: 2.68 ± 0.441
3.127ArgLys: 3.127 ± 0.537
4.825ArgLeu: 4.825 ± 0.744
1.162ArgMet: 1.162 ± 0.288
2.234ArgAsn: 2.234 ± 0.377
1.698ArgPro: 1.698 ± 0.344
3.127ArgGln: 3.127 ± 0.488
3.127ArgArg: 3.127 ± 0.565
3.485ArgSer: 3.485 ± 0.489
3.306ArgThr: 3.306 ± 0.55
4.199ArgVal: 4.199 ± 0.76
0.893ArgTrp: 0.893 ± 0.309
1.519ArgTyr: 1.519 ± 0.307
0.0ArgXaa: 0.0 ± 0.0
Ser
4.557SerAla: 4.557 ± 0.702
0.715SerCys: 0.715 ± 0.26
4.378SerAsp: 4.378 ± 0.542
3.127SerGlu: 3.127 ± 0.54
3.038SerPhe: 3.038 ± 0.668
5.004SerGly: 5.004 ± 0.836
1.608SerHis: 1.608 ± 0.378
2.859SerIle: 2.859 ± 0.538
3.306SerLys: 3.306 ± 0.485
4.289SerLeu: 4.289 ± 0.644
1.519SerMet: 1.519 ± 0.44
1.876SerAsn: 1.876 ± 0.543
2.323SerPro: 2.323 ± 0.464
2.949SerGln: 2.949 ± 0.492
3.574SerArg: 3.574 ± 0.636
3.485SerSer: 3.485 ± 0.953
3.663SerThr: 3.663 ± 0.536
5.004SerVal: 5.004 ± 0.748
0.625SerTrp: 0.625 ± 0.173
2.68SerTyr: 2.68 ± 0.575
0.0SerXaa: 0.0 ± 0.0
Thr
5.182ThrAla: 5.182 ± 0.764
0.804ThrCys: 0.804 ± 0.293
3.753ThrAsp: 3.753 ± 0.515
3.663ThrGlu: 3.663 ± 0.645
2.144ThrPhe: 2.144 ± 0.482
5.897ThrGly: 5.897 ± 0.662
1.162ThrHis: 1.162 ± 0.22
3.574ThrIle: 3.574 ± 0.54
3.217ThrLys: 3.217 ± 0.522
5.093ThrLeu: 5.093 ± 0.599
1.519ThrMet: 1.519 ± 0.327
2.323ThrAsn: 2.323 ± 0.57
2.68ThrPro: 2.68 ± 0.4
2.055ThrGln: 2.055 ± 0.404
2.055ThrArg: 2.055 ± 0.395
4.199ThrSer: 4.199 ± 1.004
3.485ThrThr: 3.485 ± 0.935
3.931ThrVal: 3.931 ± 0.824
0.536ThrTrp: 0.536 ± 0.18
1.43ThrTyr: 1.43 ± 0.264
0.0ThrXaa: 0.0 ± 0.0
Val
5.004ValAla: 5.004 ± 0.568
0.625ValCys: 0.625 ± 0.233
3.931ValAsp: 3.931 ± 0.581
4.199ValGlu: 4.199 ± 0.666
2.234ValPhe: 2.234 ± 0.708
5.004ValGly: 5.004 ± 0.597
1.162ValHis: 1.162 ± 0.366
4.021ValIle: 4.021 ± 0.622
4.199ValLys: 4.199 ± 0.59
7.416ValLeu: 7.416 ± 1.258
1.43ValMet: 1.43 ± 0.374
4.557ValAsn: 4.557 ± 1.014
2.502ValPro: 2.502 ± 0.465
2.055ValGln: 2.055 ± 0.31
5.182ValArg: 5.182 ± 0.823
4.914ValSer: 4.914 ± 1.136
5.808ValThr: 5.808 ± 0.971
5.361ValVal: 5.361 ± 0.947
0.983ValTrp: 0.983 ± 0.306
2.234ValTyr: 2.234 ± 0.533
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.165
0.447TrpCys: 0.447 ± 0.205
0.625TrpAsp: 0.625 ± 0.232
0.625TrpGlu: 0.625 ± 0.192
0.268TrpPhe: 0.268 ± 0.142
0.536TrpGly: 0.536 ± 0.212
0.357TrpHis: 0.357 ± 0.212
0.625TrpIle: 0.625 ± 0.363
1.072TrpLys: 1.072 ± 0.317
1.072TrpLeu: 1.072 ± 0.372
0.357TrpMet: 0.357 ± 0.167
0.625TrpAsn: 0.625 ± 0.203
0.357TrpPro: 0.357 ± 0.212
0.536TrpGln: 0.536 ± 0.196
0.715TrpArg: 0.715 ± 0.233
0.804TrpSer: 0.804 ± 0.32
0.893TrpThr: 0.893 ± 0.311
1.519TrpVal: 1.519 ± 0.363
0.357TrpTrp: 0.357 ± 0.148
0.179TrpTyr: 0.179 ± 0.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.412TyrAla: 2.412 ± 0.581
0.179TyrCys: 0.179 ± 0.126
3.038TyrAsp: 3.038 ± 0.524
1.43TyrGlu: 1.43 ± 0.336
1.162TyrPhe: 1.162 ± 0.271
3.306TyrGly: 3.306 ± 0.41
0.536TyrHis: 0.536 ± 0.242
1.162TyrIle: 1.162 ± 0.346
1.698TyrLys: 1.698 ± 0.321
2.859TyrLeu: 2.859 ± 0.558
1.43TyrMet: 1.43 ± 0.294
1.608TyrAsn: 1.608 ± 0.358
1.251TyrPro: 1.251 ± 0.275
1.34TyrGln: 1.34 ± 0.403
2.234TyrArg: 2.234 ± 0.302
1.787TyrSer: 1.787 ± 0.357
1.787TyrThr: 1.787 ± 0.423
2.323TyrVal: 2.323 ± 0.54
0.625TyrTrp: 0.625 ± 0.245
0.893TyrTyr: 0.893 ± 0.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (11193 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski