Amino acid dipepetide frequency for Klebsiella phage YX3973

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.518AlaAla: 11.518 ± 2.065
0.892AlaCys: 0.892 ± 0.25
6.019AlaAsp: 6.019 ± 0.739
5.722AlaGlu: 5.722 ± 0.76
2.155AlaPhe: 2.155 ± 0.449
7.431AlaGly: 7.431 ± 0.733
0.966AlaHis: 0.966 ± 0.259
6.093AlaIle: 6.093 ± 0.541
5.35AlaLys: 5.35 ± 0.581
7.505AlaLeu: 7.505 ± 0.863
3.418AlaMet: 3.418 ± 0.521
4.161AlaAsn: 4.161 ± 0.663
2.749AlaPro: 2.749 ± 0.466
3.344AlaGln: 3.344 ± 1.048
5.573AlaArg: 5.573 ± 0.948
7.208AlaSer: 7.208 ± 1.181
4.979AlaThr: 4.979 ± 0.87
6.465AlaVal: 6.465 ± 0.618
1.04AlaTrp: 1.04 ± 0.309
2.304AlaTyr: 2.304 ± 0.343
0.0AlaXaa: 0.0 ± 0.0
Cys
1.189CysAla: 1.189 ± 0.29
0.149CysCys: 0.149 ± 0.104
0.892CysAsp: 0.892 ± 0.292
0.892CysGlu: 0.892 ± 0.299
0.372CysPhe: 0.372 ± 0.155
1.486CysGly: 1.486 ± 0.434
0.52CysHis: 0.52 ± 0.206
0.594CysIle: 0.594 ± 0.254
1.115CysLys: 1.115 ± 0.294
0.594CysLeu: 0.594 ± 0.214
0.074CysMet: 0.074 ± 0.069
0.892CysAsn: 0.892 ± 0.247
0.446CysPro: 0.446 ± 0.202
0.446CysGln: 0.446 ± 0.174
1.04CysArg: 1.04 ± 0.286
0.223CysSer: 0.223 ± 0.124
0.743CysThr: 0.743 ± 0.234
0.817CysVal: 0.817 ± 0.258
0.446CysTrp: 0.446 ± 0.167
0.372CysTyr: 0.372 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
5.871AspAla: 5.871 ± 0.687
0.743AspCys: 0.743 ± 0.219
4.087AspAsp: 4.087 ± 0.673
5.053AspGlu: 5.053 ± 0.733
2.749AspPhe: 2.749 ± 0.421
5.796AspGly: 5.796 ± 0.766
0.966AspHis: 0.966 ± 0.311
4.31AspIle: 4.31 ± 0.541
4.459AspLys: 4.459 ± 0.633
3.864AspLeu: 3.864 ± 0.391
1.412AspMet: 1.412 ± 0.348
1.932AspAsn: 1.932 ± 0.452
1.709AspPro: 1.709 ± 0.378
1.486AspGln: 1.486 ± 0.348
2.006AspArg: 2.006 ± 0.373
2.675AspSer: 2.675 ± 0.456
2.304AspThr: 2.304 ± 0.448
4.087AspVal: 4.087 ± 0.632
0.669AspTrp: 0.669 ± 0.22
2.601AspTyr: 2.601 ± 0.381
0.0AspXaa: 0.0 ± 0.0
Glu
6.465GluAla: 6.465 ± 0.709
1.486GluCys: 1.486 ± 0.394
3.567GluAsp: 3.567 ± 0.59
4.979GluGlu: 4.979 ± 0.674
2.527GluPhe: 2.527 ± 0.5
3.195GluGly: 3.195 ± 0.667
1.115GluHis: 1.115 ± 0.349
4.236GluIle: 4.236 ± 0.603
4.236GluLys: 4.236 ± 0.661
5.722GluLeu: 5.722 ± 0.565
2.378GluMet: 2.378 ± 0.405
2.972GluAsn: 2.972 ± 0.509
2.452GluPro: 2.452 ± 0.43
3.493GluGln: 3.493 ± 0.565
4.607GluArg: 4.607 ± 0.623
3.938GluSer: 3.938 ± 0.618
2.527GluThr: 2.527 ± 0.493
3.716GluVal: 3.716 ± 0.654
1.338GluTrp: 1.338 ± 0.301
2.452GluTyr: 2.452 ± 0.367
0.0GluXaa: 0.0 ± 0.0
Phe
2.675PheAla: 2.675 ± 0.418
0.817PheCys: 0.817 ± 0.276
2.749PheAsp: 2.749 ± 0.372
2.304PheGlu: 2.304 ± 0.383
1.338PhePhe: 1.338 ± 0.291
3.938PheGly: 3.938 ± 0.481
0.149PheHis: 0.149 ± 0.103
1.709PheIle: 1.709 ± 0.405
1.412PheLys: 1.412 ± 0.336
1.263PheLeu: 1.263 ± 0.303
1.04PheMet: 1.04 ± 0.294
1.783PheAsn: 1.783 ± 0.378
1.338PhePro: 1.338 ± 0.321
0.52PheGln: 0.52 ± 0.198
1.561PheArg: 1.561 ± 0.342
2.675PheSer: 2.675 ± 0.512
1.858PheThr: 1.858 ± 0.384
2.304PheVal: 2.304 ± 0.398
0.52PheTrp: 0.52 ± 0.177
0.892PheTyr: 0.892 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
5.276GlyAla: 5.276 ± 0.742
0.817GlyCys: 0.817 ± 0.244
4.31GlyAsp: 4.31 ± 0.579
4.384GlyGlu: 4.384 ± 0.575
2.972GlyPhe: 2.972 ± 0.483
5.573GlyGly: 5.573 ± 0.764
0.743GlyHis: 0.743 ± 0.233
4.013GlyIle: 4.013 ± 0.59
6.242GlyLys: 6.242 ± 0.855
4.682GlyLeu: 4.682 ± 0.645
2.527GlyMet: 2.527 ± 0.417
3.493GlyAsn: 3.493 ± 0.533
1.263GlyPro: 1.263 ± 0.404
2.898GlyGln: 2.898 ± 0.496
3.938GlyArg: 3.938 ± 0.527
5.35GlySer: 5.35 ± 0.727
4.682GlyThr: 4.682 ± 1.0
6.242GlyVal: 6.242 ± 0.622
1.263GlyTrp: 1.263 ± 0.413
3.79GlyTyr: 3.79 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
1.486HisAla: 1.486 ± 0.416
0.446HisCys: 0.446 ± 0.18
0.966HisAsp: 0.966 ± 0.283
0.743HisGlu: 0.743 ± 0.215
0.817HisPhe: 0.817 ± 0.27
1.932HisGly: 1.932 ± 0.572
0.52HisHis: 0.52 ± 0.235
0.817HisIle: 0.817 ± 0.198
1.338HisLys: 1.338 ± 0.344
1.263HisLeu: 1.263 ± 0.339
0.52HisMet: 0.52 ± 0.202
0.892HisAsn: 0.892 ± 0.283
1.189HisPro: 1.189 ± 0.33
0.594HisGln: 0.594 ± 0.224
0.594HisArg: 0.594 ± 0.214
0.966HisSer: 0.966 ± 0.221
0.297HisThr: 0.297 ± 0.123
1.04HisVal: 1.04 ± 0.311
0.149HisTrp: 0.149 ± 0.108
0.446HisTyr: 0.446 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
5.127IleAla: 5.127 ± 0.558
0.669IleCys: 0.669 ± 0.234
4.533IleAsp: 4.533 ± 0.753
4.756IleGlu: 4.756 ± 0.686
2.155IlePhe: 2.155 ± 0.328
3.864IleGly: 3.864 ± 0.463
1.263IleHis: 1.263 ± 0.319
4.979IleIle: 4.979 ± 0.565
4.607IleLys: 4.607 ± 0.735
3.121IleLeu: 3.121 ± 0.481
1.858IleMet: 1.858 ± 0.435
3.27IleAsn: 3.27 ± 0.498
2.452IlePro: 2.452 ± 0.474
2.006IleGln: 2.006 ± 0.412
3.121IleArg: 3.121 ± 0.399
3.864IleSer: 3.864 ± 0.545
4.459IleThr: 4.459 ± 0.619
3.344IleVal: 3.344 ± 0.386
0.297IleTrp: 0.297 ± 0.143
1.932IleTyr: 1.932 ± 0.328
0.0IleXaa: 0.0 ± 0.0
Lys
6.762LysAla: 6.762 ± 0.687
1.04LysCys: 1.04 ± 0.308
2.749LysAsp: 2.749 ± 0.499
4.087LysGlu: 4.087 ± 0.708
2.006LysPhe: 2.006 ± 0.403
3.641LysGly: 3.641 ± 0.673
2.155LysHis: 2.155 ± 0.551
3.716LysIle: 3.716 ± 0.692
3.938LysLys: 3.938 ± 0.813
5.053LysLeu: 5.053 ± 0.804
2.378LysMet: 2.378 ± 0.491
3.195LysAsn: 3.195 ± 0.468
2.898LysPro: 2.898 ± 0.43
2.229LysGln: 2.229 ± 0.446
3.79LysArg: 3.79 ± 0.865
4.384LysSer: 4.384 ± 0.63
3.567LysThr: 3.567 ± 0.59
3.493LysVal: 3.493 ± 0.535
1.04LysTrp: 1.04 ± 0.288
2.972LysTyr: 2.972 ± 0.541
0.0LysXaa: 0.0 ± 0.0
Leu
6.688LeuAla: 6.688 ± 0.803
1.189LeuCys: 1.189 ± 0.384
3.864LeuAsp: 3.864 ± 0.514
4.607LeuGlu: 4.607 ± 0.625
2.378LeuPhe: 2.378 ± 0.515
4.236LeuGly: 4.236 ± 0.513
1.04LeuHis: 1.04 ± 0.324
4.682LeuIle: 4.682 ± 0.598
5.127LeuLys: 5.127 ± 0.688
4.979LeuLeu: 4.979 ± 0.669
1.932LeuMet: 1.932 ± 0.302
3.716LeuAsn: 3.716 ± 0.656
2.675LeuPro: 2.675 ± 0.504
2.898LeuGln: 2.898 ± 0.448
4.31LeuArg: 4.31 ± 0.507
5.127LeuSer: 5.127 ± 0.685
5.053LeuThr: 5.053 ± 0.563
3.641LeuVal: 3.641 ± 0.458
1.189LeuTrp: 1.189 ± 0.263
2.749LeuTyr: 2.749 ± 0.513
0.0LeuXaa: 0.0 ± 0.0
Met
3.864MetAla: 3.864 ± 0.582
0.074MetCys: 0.074 ± 0.07
1.486MetAsp: 1.486 ± 0.401
1.932MetGlu: 1.932 ± 0.367
0.892MetPhe: 0.892 ± 0.312
1.263MetGly: 1.263 ± 0.376
0.669MetHis: 0.669 ± 0.291
1.858MetIle: 1.858 ± 0.363
2.749MetLys: 2.749 ± 0.517
2.081MetLeu: 2.081 ± 0.392
1.338MetMet: 1.338 ± 0.357
1.115MetAsn: 1.115 ± 0.26
1.635MetPro: 1.635 ± 0.292
1.115MetGln: 1.115 ± 0.28
1.561MetArg: 1.561 ± 0.302
1.486MetSer: 1.486 ± 0.288
1.783MetThr: 1.783 ± 0.331
1.263MetVal: 1.263 ± 0.316
0.297MetTrp: 0.297 ± 0.159
0.743MetTyr: 0.743 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
4.459AsnAla: 4.459 ± 0.768
0.669AsnCys: 0.669 ± 0.238
2.675AsnAsp: 2.675 ± 0.557
2.304AsnGlu: 2.304 ± 0.342
0.966AsnPhe: 0.966 ± 0.257
3.938AsnGly: 3.938 ± 0.589
0.817AsnHis: 0.817 ± 0.285
2.824AsnIle: 2.824 ± 0.476
3.493AsnLys: 3.493 ± 0.495
3.27AsnLeu: 3.27 ± 0.494
0.743AsnMet: 0.743 ± 0.236
2.229AsnAsn: 2.229 ± 0.485
2.378AsnPro: 2.378 ± 0.413
1.932AsnGln: 1.932 ± 0.427
2.898AsnArg: 2.898 ± 0.447
2.824AsnSer: 2.824 ± 0.461
2.378AsnThr: 2.378 ± 0.517
3.79AsnVal: 3.79 ± 0.564
0.594AsnTrp: 0.594 ± 0.234
2.006AsnTyr: 2.006 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
3.047ProAla: 3.047 ± 0.602
0.446ProCys: 0.446 ± 0.197
2.675ProAsp: 2.675 ± 0.489
2.972ProGlu: 2.972 ± 0.506
1.04ProPhe: 1.04 ± 0.231
2.824ProGly: 2.824 ± 0.634
0.817ProHis: 0.817 ± 0.254
1.412ProIle: 1.412 ± 0.357
1.709ProLys: 1.709 ± 0.374
2.304ProLeu: 2.304 ± 0.404
1.04ProMet: 1.04 ± 0.257
1.412ProAsn: 1.412 ± 0.251
0.817ProPro: 0.817 ± 0.201
0.966ProGln: 0.966 ± 0.383
1.115ProArg: 1.115 ± 0.314
2.527ProSer: 2.527 ± 0.465
2.452ProThr: 2.452 ± 0.578
3.641ProVal: 3.641 ± 0.664
0.594ProTrp: 0.594 ± 0.201
1.635ProTyr: 1.635 ± 0.305
0.0ProXaa: 0.0 ± 0.0
Gln
3.27GlnAla: 3.27 ± 1.168
0.446GlnCys: 0.446 ± 0.258
1.04GlnAsp: 1.04 ± 0.285
2.081GlnGlu: 2.081 ± 0.44
1.263GlnPhe: 1.263 ± 0.365
1.412GlnGly: 1.412 ± 0.479
0.669GlnHis: 0.669 ± 0.292
1.932GlnIle: 1.932 ± 0.305
2.824GlnLys: 2.824 ± 0.388
3.938GlnLeu: 3.938 ± 0.628
1.263GlnMet: 1.263 ± 0.328
1.709GlnAsn: 1.709 ± 0.346
1.635GlnPro: 1.635 ± 0.366
2.006GlnGln: 2.006 ± 0.47
2.749GlnArg: 2.749 ± 0.477
3.195GlnSer: 3.195 ± 0.577
1.858GlnThr: 1.858 ± 0.526
2.601GlnVal: 2.601 ± 0.424
0.594GlnTrp: 0.594 ± 0.198
1.412GlnTyr: 1.412 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
4.607ArgAla: 4.607 ± 0.696
0.817ArgCys: 0.817 ± 0.274
3.047ArgAsp: 3.047 ± 0.421
4.533ArgGlu: 4.533 ± 0.603
1.04ArgPhe: 1.04 ± 0.257
3.418ArgGly: 3.418 ± 0.559
0.892ArgHis: 0.892 ± 0.303
3.047ArgIle: 3.047 ± 0.456
2.972ArgLys: 2.972 ± 0.526
4.459ArgLeu: 4.459 ± 0.726
1.635ArgMet: 1.635 ± 0.443
2.601ArgAsn: 2.601 ± 0.425
1.709ArgPro: 1.709 ± 0.358
2.378ArgGln: 2.378 ± 0.488
3.344ArgArg: 3.344 ± 0.614
2.601ArgSer: 2.601 ± 0.478
2.601ArgThr: 2.601 ± 0.536
4.087ArgVal: 4.087 ± 0.523
0.594ArgTrp: 0.594 ± 0.214
2.527ArgTyr: 2.527 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
5.796SerAla: 5.796 ± 0.674
0.669SerCys: 0.669 ± 0.26
3.864SerAsp: 3.864 ± 0.501
5.127SerGlu: 5.127 ± 0.545
2.527SerPhe: 2.527 ± 0.533
6.688SerGly: 6.688 ± 0.859
1.263SerHis: 1.263 ± 0.284
4.013SerIle: 4.013 ± 0.529
4.236SerLys: 4.236 ± 0.673
4.83SerLeu: 4.83 ± 0.767
1.858SerMet: 1.858 ± 0.444
2.304SerAsn: 2.304 ± 0.439
1.709SerPro: 1.709 ± 0.289
2.749SerGln: 2.749 ± 0.44
2.675SerArg: 2.675 ± 0.552
3.344SerSer: 3.344 ± 0.457
3.79SerThr: 3.79 ± 0.718
4.161SerVal: 4.161 ± 0.775
1.338SerTrp: 1.338 ± 0.323
2.081SerTyr: 2.081 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
5.35ThrAla: 5.35 ± 0.849
0.669ThrCys: 0.669 ± 0.233
3.27ThrAsp: 3.27 ± 0.438
3.567ThrGlu: 3.567 ± 0.506
1.263ThrPhe: 1.263 ± 0.341
6.019ThrGly: 6.019 ± 0.807
0.594ThrHis: 0.594 ± 0.204
3.79ThrIle: 3.79 ± 0.507
2.675ThrLys: 2.675 ± 0.482
4.756ThrLeu: 4.756 ± 0.758
1.189ThrMet: 1.189 ± 0.3
2.898ThrAsn: 2.898 ± 0.533
2.675ThrPro: 2.675 ± 0.467
2.229ThrGln: 2.229 ± 0.487
2.601ThrArg: 2.601 ± 0.46
4.161ThrSer: 4.161 ± 0.761
3.418ThrThr: 3.418 ± 0.736
4.31ThrVal: 4.31 ± 0.825
0.594ThrTrp: 0.594 ± 0.275
1.412ThrTyr: 1.412 ± 0.411
0.0ThrXaa: 0.0 ± 0.0
Val
7.803ValAla: 7.803 ± 0.847
0.669ValCys: 0.669 ± 0.288
3.938ValAsp: 3.938 ± 0.602
4.533ValGlu: 4.533 ± 0.471
2.898ValPhe: 2.898 ± 0.597
4.533ValGly: 4.533 ± 0.49
1.04ValHis: 1.04 ± 0.255
4.459ValIle: 4.459 ± 0.541
3.716ValLys: 3.716 ± 0.584
4.161ValLeu: 4.161 ± 0.544
1.783ValMet: 1.783 ± 0.42
4.013ValAsn: 4.013 ± 0.508
2.006ValPro: 2.006 ± 0.456
2.081ValGln: 2.081 ± 0.345
2.824ValArg: 2.824 ± 0.438
4.905ValSer: 4.905 ± 0.497
5.425ValThr: 5.425 ± 0.87
4.905ValVal: 4.905 ± 0.708
0.817ValTrp: 0.817 ± 0.291
1.412ValTyr: 1.412 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
0.669TrpAla: 0.669 ± 0.262
0.149TrpCys: 0.149 ± 0.118
0.892TrpAsp: 0.892 ± 0.176
0.669TrpGlu: 0.669 ± 0.248
0.297TrpPhe: 0.297 ± 0.131
0.372TrpGly: 0.372 ± 0.16
0.372TrpHis: 0.372 ± 0.149
0.966TrpIle: 0.966 ± 0.298
0.817TrpLys: 0.817 ± 0.237
1.338TrpLeu: 1.338 ± 0.334
0.372TrpMet: 0.372 ± 0.188
0.817TrpAsn: 0.817 ± 0.286
0.446TrpPro: 0.446 ± 0.184
1.04TrpGln: 1.04 ± 0.319
1.04TrpArg: 1.04 ± 0.285
1.115TrpSer: 1.115 ± 0.223
1.115TrpThr: 1.115 ± 0.298
1.115TrpVal: 1.115 ± 0.226
0.149TrpTrp: 0.149 ± 0.097
0.223TrpTyr: 0.223 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.972TyrAla: 2.972 ± 0.449
0.446TyrCys: 0.446 ± 0.18
2.155TyrAsp: 2.155 ± 0.363
2.155TyrGlu: 2.155 ± 0.442
1.263TyrPhe: 1.263 ± 0.379
2.898TyrGly: 2.898 ± 0.494
0.446TyrHis: 0.446 ± 0.225
2.229TyrIle: 2.229 ± 0.394
2.081TyrLys: 2.081 ± 0.363
2.898TyrLeu: 2.898 ± 0.489
0.372TyrMet: 0.372 ± 0.152
1.783TyrAsn: 1.783 ± 0.389
1.263TyrPro: 1.263 ± 0.373
1.486TyrGln: 1.486 ± 0.313
1.561TyrArg: 1.561 ± 0.295
2.452TyrSer: 2.452 ± 0.447
2.155TyrThr: 2.155 ± 0.424
2.898TyrVal: 2.898 ± 0.383
0.446TyrTrp: 0.446 ± 0.191
1.486TyrTyr: 1.486 ± 0.329
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13458 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski