Amino acid dipepetide frequency for Klebsiella phage 1 TK-2018

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.054AlaAla: 16.054 ± 1.707
0.879AlaCys: 0.879 ± 0.275
5.99AlaAsp: 5.99 ± 0.668
5.192AlaGlu: 5.192 ± 0.64
3.674AlaPhe: 3.674 ± 0.472
8.147AlaGly: 8.147 ± 1.13
1.837AlaHis: 1.837 ± 0.555
3.914AlaIle: 3.914 ± 0.551
5.272AlaLys: 5.272 ± 0.906
9.425AlaLeu: 9.425 ± 0.84
2.875AlaMet: 2.875 ± 0.417
3.355AlaAsn: 3.355 ± 0.456
4.633AlaPro: 4.633 ± 0.967
5.272AlaGln: 5.272 ± 0.827
5.751AlaArg: 5.751 ± 0.724
5.99AlaSer: 5.99 ± 0.785
5.671AlaThr: 5.671 ± 0.704
7.109AlaVal: 7.109 ± 0.83
1.198AlaTrp: 1.198 ± 0.34
4.153AlaTyr: 4.153 ± 0.593
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.295
0.479CysCys: 0.479 ± 0.28
0.559CysAsp: 0.559 ± 0.177
0.479CysGlu: 0.479 ± 0.174
0.319CysPhe: 0.319 ± 0.16
0.639CysGly: 0.639 ± 0.191
0.319CysHis: 0.319 ± 0.146
0.399CysIle: 0.399 ± 0.166
0.479CysLys: 0.479 ± 0.217
0.719CysLeu: 0.719 ± 0.213
0.639CysMet: 0.639 ± 0.216
0.24CysAsn: 0.24 ± 0.139
0.479CysPro: 0.479 ± 0.205
0.399CysGln: 0.399 ± 0.157
0.719CysArg: 0.719 ± 0.217
0.799CysSer: 0.799 ± 0.326
0.958CysThr: 0.958 ± 0.282
0.719CysVal: 0.719 ± 0.218
0.319CysTrp: 0.319 ± 0.155
0.479CysTyr: 0.479 ± 0.17
0.0CysXaa: 0.0 ± 0.0
Asp
7.508AspAla: 7.508 ± 1.015
1.038AspCys: 1.038 ± 0.332
3.435AspAsp: 3.435 ± 0.456
3.834AspGlu: 3.834 ± 0.551
2.716AspPhe: 2.716 ± 0.373
4.792AspGly: 4.792 ± 0.52
0.319AspHis: 0.319 ± 0.154
3.115AspIle: 3.115 ± 0.546
2.316AspLys: 2.316 ± 0.43
5.032AspLeu: 5.032 ± 0.565
2.636AspMet: 2.636 ± 0.384
2.955AspAsn: 2.955 ± 0.444
2.716AspPro: 2.716 ± 0.437
1.677AspGln: 1.677 ± 0.304
2.476AspArg: 2.476 ± 0.634
5.032AspSer: 5.032 ± 0.603
3.435AspThr: 3.435 ± 0.516
3.834AspVal: 3.834 ± 0.499
0.879AspTrp: 0.879 ± 0.161
2.556AspTyr: 2.556 ± 0.425
0.0AspXaa: 0.0 ± 0.0
Glu
6.07GluAla: 6.07 ± 0.877
0.319GluCys: 0.319 ± 0.135
3.275GluAsp: 3.275 ± 0.449
3.994GluGlu: 3.994 ± 0.886
2.476GluPhe: 2.476 ± 0.335
3.834GluGly: 3.834 ± 0.659
2.077GluHis: 2.077 ± 0.362
2.157GluIle: 2.157 ± 0.393
2.236GluLys: 2.236 ± 0.407
5.431GluLeu: 5.431 ± 0.674
2.157GluMet: 2.157 ± 0.356
1.917GluAsn: 1.917 ± 0.392
1.597GluPro: 1.597 ± 0.338
3.115GluGln: 3.115 ± 0.624
4.073GluArg: 4.073 ± 0.61
2.556GluSer: 2.556 ± 0.506
2.636GluThr: 2.636 ± 0.44
5.272GluVal: 5.272 ± 0.558
0.958GluTrp: 0.958 ± 0.296
2.556GluTyr: 2.556 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.716PheAla: 2.716 ± 0.47
0.399PheCys: 0.399 ± 0.242
1.837PheAsp: 1.837 ± 0.378
2.077PheGlu: 2.077 ± 0.469
1.118PhePhe: 1.118 ± 0.233
2.157PheGly: 2.157 ± 0.39
0.559PheHis: 0.559 ± 0.251
1.358PheIle: 1.358 ± 0.335
2.077PheLys: 2.077 ± 0.456
2.077PheLeu: 2.077 ± 0.445
0.559PheMet: 0.559 ± 0.239
1.917PheAsn: 1.917 ± 0.445
1.518PhePro: 1.518 ± 0.289
1.438PheGln: 1.438 ± 0.256
2.396PheArg: 2.396 ± 0.458
1.997PheSer: 1.997 ± 0.385
1.757PheThr: 1.757 ± 0.485
1.757PheVal: 1.757 ± 0.5
0.559PheTrp: 0.559 ± 0.206
1.597PheTyr: 1.597 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
6.15GlyAla: 6.15 ± 0.646
1.358GlyCys: 1.358 ± 0.393
4.952GlyAsp: 4.952 ± 0.589
4.153GlyGlu: 4.153 ± 0.429
2.636GlyPhe: 2.636 ± 0.484
4.393GlyGly: 4.393 ± 0.624
1.358GlyHis: 1.358 ± 0.302
4.073GlyIle: 4.073 ± 0.656
4.393GlyLys: 4.393 ± 0.732
7.348GlyLeu: 7.348 ± 0.681
1.837GlyMet: 1.837 ± 0.399
3.594GlyAsn: 3.594 ± 0.583
1.518GlyPro: 1.518 ± 0.322
3.435GlyGln: 3.435 ± 0.431
4.633GlyArg: 4.633 ± 0.476
5.112GlySer: 5.112 ± 0.602
4.473GlyThr: 4.473 ± 0.709
6.07GlyVal: 6.07 ± 0.735
0.719GlyTrp: 0.719 ± 0.209
3.275GlyTyr: 3.275 ± 0.612
0.0GlyXaa: 0.0 ± 0.0
His
1.677HisAla: 1.677 ± 0.408
0.319HisCys: 0.319 ± 0.144
1.198HisAsp: 1.198 ± 0.249
1.278HisGlu: 1.278 ± 0.331
0.24HisPhe: 0.24 ± 0.106
2.077HisGly: 2.077 ± 0.496
0.08HisHis: 0.08 ± 0.074
0.879HisIle: 0.879 ± 0.329
1.038HisLys: 1.038 ± 0.326
2.316HisLeu: 2.316 ± 0.485
0.479HisMet: 0.479 ± 0.166
0.879HisAsn: 0.879 ± 0.292
0.879HisPro: 0.879 ± 0.323
0.319HisGln: 0.319 ± 0.224
1.518HisArg: 1.518 ± 0.348
0.799HisSer: 0.799 ± 0.295
0.719HisThr: 0.719 ± 0.242
0.639HisVal: 0.639 ± 0.25
0.319HisTrp: 0.319 ± 0.194
0.559HisTyr: 0.559 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
3.435IleAla: 3.435 ± 0.505
0.24IleCys: 0.24 ± 0.122
2.875IleAsp: 2.875 ± 0.365
2.236IleGlu: 2.236 ± 0.515
0.639IlePhe: 0.639 ± 0.174
2.716IleGly: 2.716 ± 0.431
0.799IleHis: 0.799 ± 0.238
1.917IleIle: 1.917 ± 0.351
2.955IleLys: 2.955 ± 0.497
4.473IleLeu: 4.473 ± 0.554
1.518IleMet: 1.518 ± 0.28
1.997IleAsn: 1.997 ± 0.392
1.997IlePro: 1.997 ± 0.425
2.396IleGln: 2.396 ± 0.456
2.796IleArg: 2.796 ± 0.395
3.275IleSer: 3.275 ± 0.421
2.236IleThr: 2.236 ± 0.429
2.796IleVal: 2.796 ± 0.524
0.24IleTrp: 0.24 ± 0.163
1.438IleTyr: 1.438 ± 0.354
0.0IleXaa: 0.0 ± 0.0
Lys
5.99LysAla: 5.99 ± 0.919
0.399LysCys: 0.399 ± 0.159
2.636LysAsp: 2.636 ± 0.427
3.275LysGlu: 3.275 ± 0.57
1.038LysPhe: 1.038 ± 0.354
2.955LysGly: 2.955 ± 0.555
0.958LysHis: 0.958 ± 0.267
1.278LysIle: 1.278 ± 0.313
1.917LysLys: 1.917 ± 0.459
5.112LysLeu: 5.112 ± 0.726
1.118LysMet: 1.118 ± 0.292
1.358LysAsn: 1.358 ± 0.311
1.677LysPro: 1.677 ± 0.387
3.674LysGln: 3.674 ± 0.638
3.035LysArg: 3.035 ± 0.447
3.115LysSer: 3.115 ± 0.461
2.476LysThr: 2.476 ± 0.404
3.834LysVal: 3.834 ± 0.604
0.879LysTrp: 0.879 ± 0.218
1.917LysTyr: 1.917 ± 0.434
0.0LysXaa: 0.0 ± 0.0
Leu
8.387LeuAla: 8.387 ± 0.991
0.958LeuCys: 0.958 ± 0.325
6.949LeuAsp: 6.949 ± 0.631
5.511LeuGlu: 5.511 ± 0.625
2.875LeuPhe: 2.875 ± 0.438
6.949LeuGly: 6.949 ± 0.807
1.438LeuHis: 1.438 ± 0.329
4.313LeuIle: 4.313 ± 0.634
2.955LeuLys: 2.955 ± 0.476
6.629LeuLeu: 6.629 ± 0.74
1.837LeuMet: 1.837 ± 0.379
3.834LeuAsn: 3.834 ± 0.493
3.035LeuPro: 3.035 ± 0.443
4.233LeuGln: 4.233 ± 0.526
5.831LeuArg: 5.831 ± 0.574
4.792LeuSer: 4.792 ± 0.687
5.431LeuThr: 5.431 ± 0.635
5.99LeuVal: 5.99 ± 0.668
1.198LeuTrp: 1.198 ± 0.311
3.195LeuTyr: 3.195 ± 0.488
0.0LeuXaa: 0.0 ± 0.0
Met
3.435MetAla: 3.435 ± 0.609
0.24MetCys: 0.24 ± 0.15
2.077MetAsp: 2.077 ± 0.458
1.278MetGlu: 1.278 ± 0.292
0.719MetPhe: 0.719 ± 0.281
1.518MetGly: 1.518 ± 0.271
0.958MetHis: 0.958 ± 0.322
0.639MetIle: 0.639 ± 0.214
1.198MetLys: 1.198 ± 0.335
3.355MetLeu: 3.355 ± 0.583
0.559MetMet: 0.559 ± 0.246
0.799MetAsn: 0.799 ± 0.268
1.038MetPro: 1.038 ± 0.296
2.157MetGln: 2.157 ± 0.462
2.236MetArg: 2.236 ± 0.414
2.236MetSer: 2.236 ± 0.446
0.879MetThr: 0.879 ± 0.284
2.236MetVal: 2.236 ± 0.416
0.399MetTrp: 0.399 ± 0.154
1.198MetTyr: 1.198 ± 0.31
0.0MetXaa: 0.0 ± 0.0
Asn
3.594AsnAla: 3.594 ± 0.453
0.319AsnCys: 0.319 ± 0.168
2.077AsnAsp: 2.077 ± 0.376
1.438AsnGlu: 1.438 ± 0.364
1.038AsnPhe: 1.038 ± 0.304
3.594AsnGly: 3.594 ± 0.507
0.399AsnHis: 0.399 ± 0.136
3.115AsnIle: 3.115 ± 0.522
1.997AsnLys: 1.997 ± 0.295
2.875AsnLeu: 2.875 ± 0.557
1.518AsnMet: 1.518 ± 0.384
1.597AsnAsn: 1.597 ± 0.37
2.796AsnPro: 2.796 ± 0.406
1.438AsnGln: 1.438 ± 0.361
1.518AsnArg: 1.518 ± 0.384
3.115AsnSer: 3.115 ± 0.505
2.955AsnThr: 2.955 ± 0.491
3.355AsnVal: 3.355 ± 0.38
0.639AsnTrp: 0.639 ± 0.17
1.278AsnTyr: 1.278 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
4.233ProAla: 4.233 ± 0.846
0.24ProCys: 0.24 ± 0.124
2.875ProAsp: 2.875 ± 0.517
3.674ProGlu: 3.674 ± 0.556
0.879ProPhe: 0.879 ± 0.182
3.115ProGly: 3.115 ± 0.576
0.639ProHis: 0.639 ± 0.25
1.597ProIle: 1.597 ± 0.358
1.438ProLys: 1.438 ± 0.354
2.955ProLeu: 2.955 ± 0.529
1.038ProMet: 1.038 ± 0.241
1.597ProAsn: 1.597 ± 0.362
0.559ProPro: 0.559 ± 0.2
1.438ProGln: 1.438 ± 0.296
1.837ProArg: 1.837 ± 0.33
2.157ProSer: 2.157 ± 0.526
2.316ProThr: 2.316 ± 0.35
3.035ProVal: 3.035 ± 0.506
0.639ProTrp: 0.639 ± 0.203
1.597ProTyr: 1.597 ± 0.373
0.0ProXaa: 0.0 ± 0.0
Gln
5.351GlnAla: 5.351 ± 0.722
0.479GlnCys: 0.479 ± 0.192
2.716GlnAsp: 2.716 ± 0.485
3.514GlnGlu: 3.514 ± 0.573
1.518GlnPhe: 1.518 ± 0.339
2.955GlnGly: 2.955 ± 0.432
1.198GlnHis: 1.198 ± 0.338
0.958GlnIle: 0.958 ± 0.224
2.636GlnLys: 2.636 ± 0.497
4.553GlnLeu: 4.553 ± 0.634
1.198GlnMet: 1.198 ± 0.217
1.997GlnAsn: 1.997 ± 0.391
1.677GlnPro: 1.677 ± 0.389
3.115GlnGln: 3.115 ± 0.7
2.636GlnArg: 2.636 ± 0.404
3.594GlnSer: 3.594 ± 0.494
1.757GlnThr: 1.757 ± 0.391
3.275GlnVal: 3.275 ± 0.402
0.719GlnTrp: 0.719 ± 0.226
1.997GlnTyr: 1.997 ± 0.415
0.0GlnXaa: 0.0 ± 0.0
Arg
6.39ArgAla: 6.39 ± 1.036
0.559ArgCys: 0.559 ± 0.246
3.195ArgAsp: 3.195 ± 0.496
3.754ArgGlu: 3.754 ± 0.528
2.396ArgPhe: 2.396 ± 0.438
3.914ArgGly: 3.914 ± 0.752
1.038ArgHis: 1.038 ± 0.211
3.115ArgIle: 3.115 ± 0.471
3.035ArgLys: 3.035 ± 0.515
4.712ArgLeu: 4.712 ± 0.534
1.917ArgMet: 1.917 ± 0.407
2.716ArgAsn: 2.716 ± 0.383
1.837ArgPro: 1.837 ± 0.378
2.396ArgGln: 2.396 ± 0.328
4.313ArgArg: 4.313 ± 0.631
2.636ArgSer: 2.636 ± 0.555
3.275ArgThr: 3.275 ± 0.323
4.393ArgVal: 4.393 ± 0.593
0.639ArgTrp: 0.639 ± 0.235
2.157ArgTyr: 2.157 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
8.626SerAla: 8.626 ± 0.833
0.559SerCys: 0.559 ± 0.216
4.153SerAsp: 4.153 ± 0.539
2.716SerGlu: 2.716 ± 0.586
1.837SerPhe: 1.837 ± 0.287
6.31SerGly: 6.31 ± 0.734
0.479SerHis: 0.479 ± 0.218
2.796SerIle: 2.796 ± 0.645
4.073SerLys: 4.073 ± 0.598
4.393SerLeu: 4.393 ± 0.706
2.556SerMet: 2.556 ± 0.334
3.035SerAsn: 3.035 ± 0.665
2.716SerPro: 2.716 ± 0.377
1.757SerGln: 1.757 ± 0.361
2.875SerArg: 2.875 ± 0.315
3.435SerSer: 3.435 ± 0.599
4.393SerThr: 4.393 ± 0.602
4.233SerVal: 4.233 ± 0.521
0.958SerTrp: 0.958 ± 0.253
1.837SerTyr: 1.837 ± 0.41
0.0SerXaa: 0.0 ± 0.0
Thr
5.671ThrAla: 5.671 ± 0.859
0.479ThrCys: 0.479 ± 0.198
3.035ThrAsp: 3.035 ± 0.471
2.716ThrGlu: 2.716 ± 0.492
2.077ThrPhe: 2.077 ± 0.513
5.272ThrGly: 5.272 ± 0.719
0.958ThrHis: 0.958 ± 0.295
2.236ThrIle: 2.236 ± 0.398
2.396ThrLys: 2.396 ± 0.383
5.112ThrLeu: 5.112 ± 0.62
1.358ThrMet: 1.358 ± 0.445
2.157ThrAsn: 2.157 ± 0.485
2.716ThrPro: 2.716 ± 0.29
2.875ThrGln: 2.875 ± 0.489
2.476ThrArg: 2.476 ± 0.486
4.233ThrSer: 4.233 ± 0.682
3.834ThrThr: 3.834 ± 0.537
4.153ThrVal: 4.153 ± 0.618
0.958ThrTrp: 0.958 ± 0.241
1.997ThrTyr: 1.997 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
6.55ValAla: 6.55 ± 0.728
0.559ValCys: 0.559 ± 0.218
5.032ValAsp: 5.032 ± 0.701
4.393ValGlu: 4.393 ± 0.68
1.597ValPhe: 1.597 ± 0.308
6.47ValGly: 6.47 ± 0.734
1.837ValHis: 1.837 ± 0.486
2.636ValIle: 2.636 ± 0.446
3.355ValLys: 3.355 ± 0.537
5.272ValLeu: 5.272 ± 0.858
2.157ValMet: 2.157 ± 0.41
2.396ValAsn: 2.396 ± 0.582
2.955ValPro: 2.955 ± 0.503
4.233ValGln: 4.233 ± 0.824
4.393ValArg: 4.393 ± 0.545
5.272ValSer: 5.272 ± 0.784
3.594ValThr: 3.594 ± 0.706
5.591ValVal: 5.591 ± 0.588
0.639ValTrp: 0.639 ± 0.234
2.716ValTyr: 2.716 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
0.799TrpAla: 0.799 ± 0.206
0.16TrpCys: 0.16 ± 0.098
0.559TrpAsp: 0.559 ± 0.192
1.358TrpGlu: 1.358 ± 0.346
0.879TrpPhe: 0.879 ± 0.338
0.879TrpGly: 0.879 ± 0.253
0.559TrpHis: 0.559 ± 0.212
0.319TrpIle: 0.319 ± 0.156
0.719TrpLys: 0.719 ± 0.255
1.118TrpLeu: 1.118 ± 0.246
0.24TrpMet: 0.24 ± 0.192
0.799TrpAsn: 0.799 ± 0.214
0.24TrpPro: 0.24 ± 0.164
0.559TrpGln: 0.559 ± 0.17
0.799TrpArg: 0.799 ± 0.239
0.719TrpSer: 0.719 ± 0.233
0.958TrpThr: 0.958 ± 0.175
1.198TrpVal: 1.198 ± 0.227
0.319TrpTrp: 0.319 ± 0.179
0.719TrpTyr: 0.719 ± 0.261
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.955TyrAla: 2.955 ± 0.551
0.958TyrCys: 0.958 ± 0.271
2.875TyrAsp: 2.875 ± 0.49
1.837TyrGlu: 1.837 ± 0.483
1.198TyrPhe: 1.198 ± 0.275
2.796TyrGly: 2.796 ± 0.475
0.399TyrHis: 0.399 ± 0.155
2.157TyrIle: 2.157 ± 0.505
2.157TyrLys: 2.157 ± 0.362
3.435TyrLeu: 3.435 ± 0.419
0.879TyrMet: 0.879 ± 0.28
1.518TyrAsn: 1.518 ± 0.319
1.278TyrPro: 1.278 ± 0.244
1.917TyrGln: 1.917 ± 0.45
2.077TyrArg: 2.077 ± 0.444
2.955TyrSer: 2.955 ± 0.436
2.955TyrThr: 2.955 ± 0.487
2.157TyrVal: 2.157 ± 0.48
0.719TyrTrp: 0.719 ± 0.238
1.038TyrTyr: 1.038 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (12521 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski