Amino acid dipepetide frequency for Klebsiella phage 1 LV-2017

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.617AlaAla: 11.617 ± 2.013
0.796AlaCys: 0.796 ± 0.357
6.206AlaAsp: 6.206 ± 0.956
5.57AlaGlu: 5.57 ± 0.697
4.297AlaPhe: 4.297 ± 1.044
7.798AlaGly: 7.798 ± 1.092
1.114AlaHis: 1.114 ± 0.553
4.297AlaIle: 4.297 ± 0.542
5.411AlaLys: 5.411 ± 1.039
10.344AlaLeu: 10.344 ± 2.018
3.342AlaMet: 3.342 ± 0.984
3.978AlaAsn: 3.978 ± 0.761
3.819AlaPro: 3.819 ± 0.854
4.774AlaGln: 4.774 ± 1.165
7.798AlaArg: 7.798 ± 1.086
6.525AlaSer: 6.525 ± 1.533
7.161AlaThr: 7.161 ± 1.241
5.888AlaVal: 5.888 ± 1.105
1.273AlaTrp: 1.273 ± 0.359
2.705AlaTyr: 2.705 ± 0.519
0.0AlaXaa: 0.0 ± 0.0
Cys
0.318CysAla: 0.318 ± 0.29
0.318CysCys: 0.318 ± 0.199
0.796CysAsp: 0.796 ± 0.355
0.159CysGlu: 0.159 ± 0.18
0.318CysPhe: 0.318 ± 0.23
0.477CysGly: 0.477 ± 0.274
0.0CysHis: 0.0 ± 0.0
0.318CysIle: 0.318 ± 0.231
0.796CysLys: 0.796 ± 0.337
0.637CysLeu: 0.637 ± 0.405
0.159CysMet: 0.159 ± 0.141
0.159CysAsn: 0.159 ± 0.165
0.159CysPro: 0.159 ± 0.145
0.318CysGln: 0.318 ± 0.235
0.955CysArg: 0.955 ± 0.383
0.477CysSer: 0.477 ± 0.238
0.637CysThr: 0.637 ± 0.368
0.955CysVal: 0.955 ± 0.422
0.477CysTrp: 0.477 ± 0.356
0.318CysTyr: 0.318 ± 0.245
0.0CysXaa: 0.0 ± 0.0
Asp
6.525AspAla: 6.525 ± 1.157
0.318AspCys: 0.318 ± 0.189
4.137AspAsp: 4.137 ± 1.075
5.251AspGlu: 5.251 ± 1.131
1.75AspPhe: 1.75 ± 0.487
5.729AspGly: 5.729 ± 1.192
0.477AspHis: 0.477 ± 0.243
3.978AspIle: 3.978 ± 0.576
1.91AspLys: 1.91 ± 0.52
4.615AspLeu: 4.615 ± 0.802
1.114AspMet: 1.114 ± 0.518
1.591AspAsn: 1.591 ± 0.394
3.501AspPro: 3.501 ± 1.005
2.705AspGln: 2.705 ± 0.61
1.91AspArg: 1.91 ± 0.492
2.228AspSer: 2.228 ± 0.435
2.387AspThr: 2.387 ± 0.555
3.66AspVal: 3.66 ± 1.131
0.955AspTrp: 0.955 ± 0.351
1.91AspTyr: 1.91 ± 0.462
0.0AspXaa: 0.0 ± 0.0
Glu
7.161GluAla: 7.161 ± 0.916
0.637GluCys: 0.637 ± 0.399
2.546GluAsp: 2.546 ± 0.518
5.092GluGlu: 5.092 ± 1.031
2.387GluPhe: 2.387 ± 0.553
3.183GluGly: 3.183 ± 0.848
1.114GluHis: 1.114 ± 0.341
3.183GluIle: 3.183 ± 0.731
3.978GluLys: 3.978 ± 0.454
7.957GluLeu: 7.957 ± 1.176
1.591GluMet: 1.591 ± 0.572
3.819GluAsn: 3.819 ± 0.525
3.342GluPro: 3.342 ± 0.951
3.183GluGln: 3.183 ± 0.817
4.297GluArg: 4.297 ± 0.563
4.137GluSer: 4.137 ± 1.024
3.183GluThr: 3.183 ± 0.616
4.615GluVal: 4.615 ± 0.87
1.114GluTrp: 1.114 ± 0.314
1.273GluTyr: 1.273 ± 0.511
0.0GluXaa: 0.0 ± 0.0
Phe
3.342PheAla: 3.342 ± 0.613
0.159PheCys: 0.159 ± 0.166
2.387PheAsp: 2.387 ± 0.85
1.75PheGlu: 1.75 ± 0.63
0.955PhePhe: 0.955 ± 0.402
3.501PheGly: 3.501 ± 0.804
0.796PheHis: 0.796 ± 0.39
1.75PheIle: 1.75 ± 0.584
1.114PheLys: 1.114 ± 0.35
2.705PheLeu: 2.705 ± 0.565
0.796PheMet: 0.796 ± 0.317
3.342PheAsn: 3.342 ± 0.777
0.796PhePro: 0.796 ± 0.439
0.637PheGln: 0.637 ± 0.287
3.183PheArg: 3.183 ± 0.875
2.546PheSer: 2.546 ± 0.756
2.387PheThr: 2.387 ± 0.481
2.069PheVal: 2.069 ± 0.678
0.477PheTrp: 0.477 ± 0.244
1.591PheTyr: 1.591 ± 0.458
0.0PheXaa: 0.0 ± 0.0
Gly
7.957GlyAla: 7.957 ± 2.067
0.318GlyCys: 0.318 ± 0.222
4.137GlyAsp: 4.137 ± 0.937
5.251GlyGlu: 5.251 ± 0.808
3.024GlyPhe: 3.024 ± 0.894
6.843GlyGly: 6.843 ± 1.181
1.432GlyHis: 1.432 ± 0.409
3.66GlyIle: 3.66 ± 0.709
4.137GlyLys: 4.137 ± 0.919
6.206GlyLeu: 6.206 ± 0.963
1.75GlyMet: 1.75 ± 0.47
2.864GlyAsn: 2.864 ± 0.581
1.432GlyPro: 1.432 ± 0.49
2.069GlyGln: 2.069 ± 0.471
4.297GlyArg: 4.297 ± 0.618
3.501GlySer: 3.501 ± 0.713
4.297GlyThr: 4.297 ± 1.024
5.57GlyVal: 5.57 ± 0.919
0.955GlyTrp: 0.955 ± 0.356
2.228GlyTyr: 2.228 ± 0.567
0.0GlyXaa: 0.0 ± 0.0
His
0.796HisAla: 0.796 ± 0.379
0.477HisCys: 0.477 ± 0.253
1.75HisAsp: 1.75 ± 0.651
1.114HisGlu: 1.114 ± 0.438
0.318HisPhe: 0.318 ± 0.217
1.273HisGly: 1.273 ± 0.518
0.796HisHis: 0.796 ± 0.456
0.955HisIle: 0.955 ± 0.432
0.477HisLys: 0.477 ± 0.26
1.91HisLeu: 1.91 ± 0.524
0.477HisMet: 0.477 ± 0.268
0.318HisAsn: 0.318 ± 0.167
0.955HisPro: 0.955 ± 0.395
0.796HisGln: 0.796 ± 0.487
0.955HisArg: 0.955 ± 0.405
0.955HisSer: 0.955 ± 0.427
0.637HisThr: 0.637 ± 0.273
0.796HisVal: 0.796 ± 0.392
0.318HisTrp: 0.318 ± 0.22
0.159HisTyr: 0.159 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
5.092IleAla: 5.092 ± 0.801
0.637IleCys: 0.637 ± 0.359
3.819IleAsp: 3.819 ± 0.755
2.546IleGlu: 2.546 ± 0.59
1.432IlePhe: 1.432 ± 0.388
2.546IleGly: 2.546 ± 0.564
0.637IleHis: 0.637 ± 0.368
2.069IleIle: 2.069 ± 0.544
2.705IleLys: 2.705 ± 0.554
2.864IleLeu: 2.864 ± 0.727
1.114IleMet: 1.114 ± 0.372
3.024IleAsn: 3.024 ± 0.684
2.387IlePro: 2.387 ± 0.776
1.432IleGln: 1.432 ± 0.786
3.342IleArg: 3.342 ± 0.803
5.092IleSer: 5.092 ± 0.749
3.66IleThr: 3.66 ± 0.651
1.75IleVal: 1.75 ± 0.668
0.955IleTrp: 0.955 ± 0.432
1.114IleTyr: 1.114 ± 0.482
0.0IleXaa: 0.0 ± 0.0
Lys
5.729LysAla: 5.729 ± 0.889
0.159LysCys: 0.159 ± 0.163
1.75LysAsp: 1.75 ± 0.463
3.978LysGlu: 3.978 ± 0.828
0.955LysPhe: 0.955 ± 0.27
2.228LysGly: 2.228 ± 0.536
1.432LysHis: 1.432 ± 0.42
2.069LysIle: 2.069 ± 0.829
4.297LysLys: 4.297 ± 0.969
3.819LysLeu: 3.819 ± 0.847
1.91LysMet: 1.91 ± 0.513
2.705LysAsn: 2.705 ± 0.659
3.978LysPro: 3.978 ± 0.864
2.864LysGln: 2.864 ± 0.57
3.342LysArg: 3.342 ± 0.871
3.66LysSer: 3.66 ± 0.863
4.297LysThr: 4.297 ± 1.193
3.342LysVal: 3.342 ± 0.56
0.796LysTrp: 0.796 ± 0.3
1.114LysTyr: 1.114 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
9.23LeuAla: 9.23 ± 0.979
1.114LeuCys: 1.114 ± 0.644
5.251LeuAsp: 5.251 ± 0.744
5.729LeuGlu: 5.729 ± 0.682
3.342LeuPhe: 3.342 ± 0.785
6.525LeuGly: 6.525 ± 1.177
1.432LeuHis: 1.432 ± 0.344
4.933LeuIle: 4.933 ± 1.016
5.888LeuLys: 5.888 ± 0.757
7.957LeuLeu: 7.957 ± 0.946
2.705LeuMet: 2.705 ± 0.672
4.297LeuAsn: 4.297 ± 0.658
2.069LeuPro: 2.069 ± 0.531
2.546LeuGln: 2.546 ± 0.752
6.047LeuArg: 6.047 ± 1.085
7.638LeuSer: 7.638 ± 1.354
4.615LeuThr: 4.615 ± 1.301
5.411LeuVal: 5.411 ± 1.025
0.637LeuTrp: 0.637 ± 0.38
2.864LeuTyr: 2.864 ± 0.895
0.0LeuXaa: 0.0 ± 0.0
Met
3.501MetAla: 3.501 ± 0.821
0.318MetCys: 0.318 ± 0.214
0.955MetAsp: 0.955 ± 0.336
0.796MetGlu: 0.796 ± 0.286
1.432MetPhe: 1.432 ± 0.485
1.91MetGly: 1.91 ± 0.481
0.0MetHis: 0.0 ± 0.0
1.432MetIle: 1.432 ± 0.41
2.546MetLys: 2.546 ± 0.449
1.75MetLeu: 1.75 ± 0.641
0.477MetMet: 0.477 ± 0.233
1.114MetAsn: 1.114 ± 0.354
1.273MetPro: 1.273 ± 0.495
1.432MetGln: 1.432 ± 0.541
1.91MetArg: 1.91 ± 0.427
2.546MetSer: 2.546 ± 0.435
1.273MetThr: 1.273 ± 0.458
1.114MetVal: 1.114 ± 0.447
0.796MetTrp: 0.796 ± 0.381
0.477MetTyr: 0.477 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
4.456AsnAla: 4.456 ± 1.076
0.159AsnCys: 0.159 ± 0.15
2.069AsnAsp: 2.069 ± 0.524
3.501AsnGlu: 3.501 ± 0.731
1.273AsnPhe: 1.273 ± 0.452
3.501AsnGly: 3.501 ± 0.583
0.318AsnHis: 0.318 ± 0.209
3.501AsnIle: 3.501 ± 0.663
3.501AsnLys: 3.501 ± 0.817
3.978AsnLeu: 3.978 ± 0.828
0.796AsnMet: 0.796 ± 0.263
2.228AsnAsn: 2.228 ± 0.766
2.864AsnPro: 2.864 ± 0.583
2.069AsnGln: 2.069 ± 0.767
2.069AsnArg: 2.069 ± 0.574
2.069AsnSer: 2.069 ± 0.614
1.91AsnThr: 1.91 ± 0.639
1.75AsnVal: 1.75 ± 0.48
0.955AsnTrp: 0.955 ± 0.372
0.637AsnTyr: 0.637 ± 0.337
0.0AsnXaa: 0.0 ± 0.0
Pro
4.774ProAla: 4.774 ± 1.061
0.0ProCys: 0.0 ± 0.0
3.183ProAsp: 3.183 ± 0.758
4.933ProGlu: 4.933 ± 1.336
1.432ProPhe: 1.432 ± 0.505
1.91ProGly: 1.91 ± 0.46
1.114ProHis: 1.114 ± 0.482
1.591ProIle: 1.591 ± 0.584
2.228ProLys: 2.228 ± 0.405
3.342ProLeu: 3.342 ± 0.652
0.955ProMet: 0.955 ± 0.453
1.432ProAsn: 1.432 ± 0.461
0.955ProPro: 0.955 ± 0.503
1.114ProGln: 1.114 ± 0.651
1.114ProArg: 1.114 ± 0.357
2.705ProSer: 2.705 ± 1.208
0.955ProThr: 0.955 ± 0.318
3.183ProVal: 3.183 ± 1.13
0.637ProTrp: 0.637 ± 0.315
1.273ProTyr: 1.273 ± 0.535
0.0ProXaa: 0.0 ± 0.0
Gln
5.888GlnAla: 5.888 ± 1.584
0.318GlnCys: 0.318 ± 0.213
1.591GlnAsp: 1.591 ± 0.723
2.864GlnGlu: 2.864 ± 0.938
1.591GlnPhe: 1.591 ± 0.379
2.705GlnGly: 2.705 ± 0.561
0.477GlnHis: 0.477 ± 0.228
1.432GlnIle: 1.432 ± 0.454
2.228GlnLys: 2.228 ± 0.5
3.342GlnLeu: 3.342 ± 0.671
1.432GlnMet: 1.432 ± 0.488
1.273GlnAsn: 1.273 ± 0.757
1.591GlnPro: 1.591 ± 0.517
1.75GlnGln: 1.75 ± 0.713
2.705GlnArg: 2.705 ± 0.654
2.228GlnSer: 2.228 ± 0.811
3.024GlnThr: 3.024 ± 0.629
2.387GlnVal: 2.387 ± 0.622
0.796GlnTrp: 0.796 ± 0.28
0.796GlnTyr: 0.796 ± 0.401
0.0GlnXaa: 0.0 ± 0.0
Arg
6.684ArgAla: 6.684 ± 0.898
0.159ArgCys: 0.159 ± 0.201
3.183ArgAsp: 3.183 ± 0.593
5.092ArgGlu: 5.092 ± 1.022
2.546ArgPhe: 2.546 ± 0.839
4.456ArgGly: 4.456 ± 0.807
2.069ArgHis: 2.069 ± 0.588
3.024ArgIle: 3.024 ± 0.776
2.069ArgLys: 2.069 ± 0.386
7.161ArgLeu: 7.161 ± 0.895
2.228ArgMet: 2.228 ± 0.619
2.069ArgAsn: 2.069 ± 0.621
1.432ArgPro: 1.432 ± 0.464
3.183ArgGln: 3.183 ± 1.138
5.092ArgArg: 5.092 ± 0.864
3.978ArgSer: 3.978 ± 1.006
2.546ArgThr: 2.546 ± 0.738
3.024ArgVal: 3.024 ± 0.675
1.75ArgTrp: 1.75 ± 0.484
1.91ArgTyr: 1.91 ± 0.495
0.0ArgXaa: 0.0 ± 0.0
Ser
7.002SerAla: 7.002 ± 1.079
1.273SerCys: 1.273 ± 0.472
3.66SerAsp: 3.66 ± 0.919
4.456SerGlu: 4.456 ± 0.457
2.705SerPhe: 2.705 ± 0.635
6.206SerGly: 6.206 ± 1.062
0.637SerHis: 0.637 ± 0.38
2.228SerIle: 2.228 ± 0.552
4.615SerLys: 4.615 ± 0.912
6.047SerLeu: 6.047 ± 1.442
1.75SerMet: 1.75 ± 0.533
3.501SerAsn: 3.501 ± 0.598
1.91SerPro: 1.91 ± 0.537
3.024SerGln: 3.024 ± 1.121
4.137SerArg: 4.137 ± 0.701
5.092SerSer: 5.092 ± 1.339
3.024SerThr: 3.024 ± 0.674
4.933SerVal: 4.933 ± 0.968
0.955SerTrp: 0.955 ± 0.587
0.796SerTyr: 0.796 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
5.888ThrAla: 5.888 ± 1.188
0.477ThrCys: 0.477 ± 0.282
3.024ThrAsp: 3.024 ± 0.529
3.183ThrGlu: 3.183 ± 0.825
1.91ThrPhe: 1.91 ± 0.445
5.092ThrGly: 5.092 ± 0.83
0.477ThrHis: 0.477 ± 0.26
3.66ThrIle: 3.66 ± 0.956
2.069ThrLys: 2.069 ± 0.451
5.251ThrLeu: 5.251 ± 0.65
1.591ThrMet: 1.591 ± 0.41
1.75ThrAsn: 1.75 ± 0.596
2.387ThrPro: 2.387 ± 0.635
3.66ThrGln: 3.66 ± 0.599
3.024ThrArg: 3.024 ± 0.518
4.137ThrSer: 4.137 ± 0.872
2.705ThrThr: 2.705 ± 0.535
4.297ThrVal: 4.297 ± 1.039
0.955ThrTrp: 0.955 ± 0.36
1.114ThrTyr: 1.114 ± 0.322
0.0ThrXaa: 0.0 ± 0.0
Val
4.615ValAla: 4.615 ± 1.113
0.477ValCys: 0.477 ± 0.255
3.978ValAsp: 3.978 ± 0.68
3.978ValGlu: 3.978 ± 0.742
2.228ValPhe: 2.228 ± 0.651
2.387ValGly: 2.387 ± 0.455
0.955ValHis: 0.955 ± 0.437
3.024ValIle: 3.024 ± 0.725
2.387ValLys: 2.387 ± 0.751
7.479ValLeu: 7.479 ± 1.217
1.591ValMet: 1.591 ± 0.569
2.546ValAsn: 2.546 ± 0.646
2.864ValPro: 2.864 ± 0.665
0.796ValGln: 0.796 ± 0.298
4.615ValArg: 4.615 ± 0.835
5.092ValSer: 5.092 ± 0.801
4.774ValThr: 4.774 ± 1.074
4.933ValVal: 4.933 ± 1.118
1.114ValTrp: 1.114 ± 0.441
3.342ValTyr: 3.342 ± 0.491
0.0ValXaa: 0.0 ± 0.0
Trp
1.432TrpAla: 1.432 ± 0.593
0.477TrpCys: 0.477 ± 0.26
0.955TrpAsp: 0.955 ± 0.441
0.796TrpGlu: 0.796 ± 0.403
0.796TrpPhe: 0.796 ± 0.38
1.591TrpGly: 1.591 ± 0.565
0.477TrpHis: 0.477 ± 0.28
0.477TrpIle: 0.477 ± 0.276
0.796TrpLys: 0.796 ± 0.359
1.75TrpLeu: 1.75 ± 0.559
0.637TrpMet: 0.637 ± 0.345
0.796TrpAsn: 0.796 ± 0.29
0.318TrpPro: 0.318 ± 0.196
0.637TrpGln: 0.637 ± 0.394
0.796TrpArg: 0.796 ± 0.329
0.796TrpSer: 0.796 ± 0.392
0.796TrpThr: 0.796 ± 0.32
1.432TrpVal: 1.432 ± 0.469
0.318TrpTrp: 0.318 ± 0.231
0.477TrpTyr: 0.477 ± 0.246
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.705TyrAla: 2.705 ± 0.355
0.159TyrCys: 0.159 ± 0.15
1.273TyrAsp: 1.273 ± 0.374
1.75TyrGlu: 1.75 ± 0.566
1.591TyrPhe: 1.591 ± 0.544
2.069TyrGly: 2.069 ± 0.476
0.477TyrHis: 0.477 ± 0.256
0.637TyrIle: 0.637 ± 0.322
1.273TyrLys: 1.273 ± 0.466
1.273TyrLeu: 1.273 ± 0.405
0.477TyrMet: 0.477 ± 0.275
0.796TyrAsn: 0.796 ± 0.41
0.796TyrPro: 0.796 ± 0.402
1.432TyrGln: 1.432 ± 0.502
2.069TyrArg: 2.069 ± 0.517
2.546TyrSer: 2.546 ± 0.486
2.387TyrThr: 2.387 ± 0.651
2.069TyrVal: 2.069 ± 0.499
0.318TyrTrp: 0.318 ± 0.246
0.796TyrTyr: 0.796 ± 0.307
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (6285 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski