Amino acid dipepetide frequency for Klebsiella phage F19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.323AlaAla: 16.323 ± 1.473
0.529AlaCys: 0.529 ± 0.242
6.574AlaAsp: 6.574 ± 0.769
5.668AlaGlu: 5.668 ± 0.603
3.476AlaPhe: 3.476 ± 0.455
8.69AlaGly: 8.69 ± 1.039
1.436AlaHis: 1.436 ± 0.408
4.081AlaIle: 4.081 ± 0.612
5.063AlaLys: 5.063 ± 0.78
9.597AlaLeu: 9.597 ± 0.838
3.098AlaMet: 3.098 ± 0.35
2.872AlaAsn: 2.872 ± 0.52
4.307AlaPro: 4.307 ± 0.858
4.912AlaGln: 4.912 ± 0.812
5.97AlaArg: 5.97 ± 0.699
5.894AlaSer: 5.894 ± 0.723
5.214AlaThr: 5.214 ± 0.602
6.952AlaVal: 6.952 ± 0.884
1.209AlaTrp: 1.209 ± 0.372
4.383AlaTyr: 4.383 ± 0.533
0.0AlaXaa: 0.0 ± 0.0
Cys
0.982CysAla: 0.982 ± 0.275
0.378CysCys: 0.378 ± 0.233
0.453CysAsp: 0.453 ± 0.15
0.378CysGlu: 0.378 ± 0.139
0.302CysPhe: 0.302 ± 0.17
0.756CysGly: 0.756 ± 0.27
0.302CysHis: 0.302 ± 0.147
0.302CysIle: 0.302 ± 0.136
0.378CysLys: 0.378 ± 0.185
0.831CysLeu: 0.831 ± 0.264
0.756CysMet: 0.756 ± 0.224
0.605CysAsn: 0.605 ± 0.258
0.529CysPro: 0.529 ± 0.223
0.302CysGln: 0.302 ± 0.151
0.68CysArg: 0.68 ± 0.227
1.058CysSer: 1.058 ± 0.37
0.831CysThr: 0.831 ± 0.246
0.605CysVal: 0.605 ± 0.213
0.151CysTrp: 0.151 ± 0.108
0.605CysTyr: 0.605 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
7.632AspAla: 7.632 ± 0.894
1.134AspCys: 1.134 ± 0.295
3.325AspAsp: 3.325 ± 0.484
3.249AspGlu: 3.249 ± 0.601
2.267AspPhe: 2.267 ± 0.348
4.836AspGly: 4.836 ± 0.63
0.68AspHis: 0.68 ± 0.223
3.325AspIle: 3.325 ± 0.469
3.023AspLys: 3.023 ± 0.571
4.836AspLeu: 4.836 ± 0.569
2.947AspMet: 2.947 ± 0.488
2.796AspAsn: 2.796 ± 0.499
3.023AspPro: 3.023 ± 0.409
1.511AspGln: 1.511 ± 0.278
2.494AspArg: 2.494 ± 0.495
5.063AspSer: 5.063 ± 0.503
3.476AspThr: 3.476 ± 0.573
3.627AspVal: 3.627 ± 0.445
0.982AspTrp: 0.982 ± 0.199
1.965AspTyr: 1.965 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
5.592GluAla: 5.592 ± 0.717
0.453GluCys: 0.453 ± 0.183
2.796GluAsp: 2.796 ± 0.322
4.005GluGlu: 4.005 ± 0.814
2.418GluPhe: 2.418 ± 0.394
4.005GluGly: 4.005 ± 0.502
2.343GluHis: 2.343 ± 0.446
2.343GluIle: 2.343 ± 0.497
2.116GluLys: 2.116 ± 0.406
5.592GluLeu: 5.592 ± 0.594
1.965GluMet: 1.965 ± 0.324
1.587GluAsn: 1.587 ± 0.351
1.814GluPro: 1.814 ± 0.382
3.703GluGln: 3.703 ± 0.546
3.552GluArg: 3.552 ± 0.573
2.418GluSer: 2.418 ± 0.437
2.872GluThr: 2.872 ± 0.412
4.988GluVal: 4.988 ± 0.7
0.831GluTrp: 0.831 ± 0.223
2.72GluTyr: 2.72 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
2.72PheAla: 2.72 ± 0.501
0.378PheCys: 0.378 ± 0.213
2.267PheAsp: 2.267 ± 0.383
2.267PheGlu: 2.267 ± 0.389
1.285PhePhe: 1.285 ± 0.283
2.343PheGly: 2.343 ± 0.408
0.453PheHis: 0.453 ± 0.183
1.058PheIle: 1.058 ± 0.275
1.738PheLys: 1.738 ± 0.389
2.116PheLeu: 2.116 ± 0.367
0.529PheMet: 0.529 ± 0.221
1.738PheAsn: 1.738 ± 0.424
1.511PhePro: 1.511 ± 0.299
1.209PheGln: 1.209 ± 0.274
1.663PheArg: 1.663 ± 0.368
1.889PheSer: 1.889 ± 0.464
2.116PheThr: 2.116 ± 0.437
2.343PheVal: 2.343 ± 0.569
0.529PheTrp: 0.529 ± 0.187
1.436PheTyr: 1.436 ± 0.243
0.0PheXaa: 0.0 ± 0.0
Gly
6.121GlyAla: 6.121 ± 0.72
1.209GlyCys: 1.209 ± 0.367
4.232GlyAsp: 4.232 ± 0.542
3.476GlyGlu: 3.476 ± 0.404
2.72GlyPhe: 2.72 ± 0.436
4.912GlyGly: 4.912 ± 0.627
1.436GlyHis: 1.436 ± 0.347
4.61GlyIle: 4.61 ± 0.489
4.459GlyLys: 4.459 ± 0.617
6.877GlyLeu: 6.877 ± 0.695
1.889GlyMet: 1.889 ± 0.474
3.401GlyAsn: 3.401 ± 0.568
1.889GlyPro: 1.889 ± 0.249
3.098GlyGln: 3.098 ± 0.43
5.214GlyArg: 5.214 ± 0.534
4.912GlySer: 4.912 ± 0.642
5.441GlyThr: 5.441 ± 0.721
6.197GlyVal: 6.197 ± 0.744
0.831GlyTrp: 0.831 ± 0.242
3.098GlyTyr: 3.098 ± 0.598
0.0GlyXaa: 0.0 ± 0.0
His
1.511HisAla: 1.511 ± 0.405
0.302HisCys: 0.302 ± 0.133
1.285HisAsp: 1.285 ± 0.359
1.36HisGlu: 1.36 ± 0.321
0.378HisPhe: 0.378 ± 0.143
1.814HisGly: 1.814 ± 0.537
0.378HisHis: 0.378 ± 0.221
0.907HisIle: 0.907 ± 0.25
0.982HisLys: 0.982 ± 0.233
2.191HisLeu: 2.191 ± 0.438
0.453HisMet: 0.453 ± 0.185
0.907HisAsn: 0.907 ± 0.25
0.831HisPro: 0.831 ± 0.316
0.453HisGln: 0.453 ± 0.215
1.436HisArg: 1.436 ± 0.291
1.209HisSer: 1.209 ± 0.332
0.831HisThr: 0.831 ± 0.234
0.831HisVal: 0.831 ± 0.224
0.302HisTrp: 0.302 ± 0.159
0.831HisTyr: 0.831 ± 0.258
0.0HisXaa: 0.0 ± 0.0
Ile
3.401IleAla: 3.401 ± 0.461
0.529IleCys: 0.529 ± 0.2
2.872IleAsp: 2.872 ± 0.349
3.174IleGlu: 3.174 ± 0.538
0.756IlePhe: 0.756 ± 0.215
2.947IleGly: 2.947 ± 0.549
0.756IleHis: 0.756 ± 0.225
1.738IleIle: 1.738 ± 0.359
3.249IleLys: 3.249 ± 0.615
4.534IleLeu: 4.534 ± 0.508
1.285IleMet: 1.285 ± 0.298
1.965IleAsn: 1.965 ± 0.399
2.569IlePro: 2.569 ± 0.47
2.569IleGln: 2.569 ± 0.575
2.872IleArg: 2.872 ± 0.442
3.098IleSer: 3.098 ± 0.425
2.645IleThr: 2.645 ± 0.469
2.72IleVal: 2.72 ± 0.462
0.151IleTrp: 0.151 ± 0.099
1.209IleTyr: 1.209 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
6.197LysAla: 6.197 ± 0.871
0.302LysCys: 0.302 ± 0.142
3.325LysAsp: 3.325 ± 0.553
2.947LysGlu: 2.947 ± 0.555
1.209LysPhe: 1.209 ± 0.319
2.947LysGly: 2.947 ± 0.586
0.982LysHis: 0.982 ± 0.292
1.285LysIle: 1.285 ± 0.271
2.04LysLys: 2.04 ± 0.454
4.534LysLeu: 4.534 ± 0.606
1.209LysMet: 1.209 ± 0.271
1.663LysAsn: 1.663 ± 0.309
1.436LysPro: 1.436 ± 0.331
3.401LysGln: 3.401 ± 0.619
3.325LysArg: 3.325 ± 0.541
2.343LysSer: 2.343 ± 0.317
2.645LysThr: 2.645 ± 0.424
3.476LysVal: 3.476 ± 0.705
0.982LysTrp: 0.982 ± 0.233
1.587LysTyr: 1.587 ± 0.364
0.0LysXaa: 0.0 ± 0.0
Leu
8.01LeuAla: 8.01 ± 0.713
1.285LeuCys: 1.285 ± 0.316
6.877LeuAsp: 6.877 ± 0.783
5.063LeuGlu: 5.063 ± 0.655
2.72LeuPhe: 2.72 ± 0.388
6.952LeuGly: 6.952 ± 0.798
1.511LeuHis: 1.511 ± 0.32
4.232LeuIle: 4.232 ± 0.59
3.401LeuLys: 3.401 ± 0.804
6.65LeuLeu: 6.65 ± 0.555
2.116LeuMet: 2.116 ± 0.39
3.174LeuAsn: 3.174 ± 0.453
3.401LeuPro: 3.401 ± 0.463
4.383LeuGln: 4.383 ± 0.527
6.877LeuArg: 6.877 ± 0.717
5.139LeuSer: 5.139 ± 0.717
4.836LeuThr: 4.836 ± 0.598
5.97LeuVal: 5.97 ± 0.747
1.134LeuTrp: 1.134 ± 0.343
3.703LeuTyr: 3.703 ± 0.46
0.0LeuXaa: 0.0 ± 0.0
Met
3.401MetAla: 3.401 ± 0.556
0.227MetCys: 0.227 ± 0.137
1.889MetAsp: 1.889 ± 0.467
1.058MetGlu: 1.058 ± 0.251
0.831MetPhe: 0.831 ± 0.27
1.436MetGly: 1.436 ± 0.229
0.831MetHis: 0.831 ± 0.266
0.756MetIle: 0.756 ± 0.259
0.831MetLys: 0.831 ± 0.295
3.401MetLeu: 3.401 ± 0.45
0.756MetMet: 0.756 ± 0.318
0.68MetAsn: 0.68 ± 0.271
1.209MetPro: 1.209 ± 0.228
2.04MetGln: 2.04 ± 0.416
2.191MetArg: 2.191 ± 0.429
2.569MetSer: 2.569 ± 0.595
0.831MetThr: 0.831 ± 0.297
2.418MetVal: 2.418 ± 0.429
0.378MetTrp: 0.378 ± 0.131
0.982MetTyr: 0.982 ± 0.256
0.0MetXaa: 0.0 ± 0.0
Asn
3.401AsnAla: 3.401 ± 0.454
0.151AsnCys: 0.151 ± 0.097
2.191AsnAsp: 2.191 ± 0.401
1.134AsnGlu: 1.134 ± 0.25
0.831AsnPhe: 0.831 ± 0.23
3.627AsnGly: 3.627 ± 0.598
0.227AsnHis: 0.227 ± 0.132
2.872AsnIle: 2.872 ± 0.494
1.889AsnLys: 1.889 ± 0.403
3.249AsnLeu: 3.249 ± 0.627
1.058AsnMet: 1.058 ± 0.265
1.285AsnAsn: 1.285 ± 0.322
2.947AsnPro: 2.947 ± 0.404
1.209AsnGln: 1.209 ± 0.353
1.965AsnArg: 1.965 ± 0.478
3.174AsnSer: 3.174 ± 0.48
2.569AsnThr: 2.569 ± 0.416
3.401AsnVal: 3.401 ± 0.4
0.605AsnTrp: 0.605 ± 0.188
1.36AsnTyr: 1.36 ± 0.335
0.0AsnXaa: 0.0 ± 0.0
Pro
4.836ProAla: 4.836 ± 0.799
0.151ProCys: 0.151 ± 0.103
2.72ProAsp: 2.72 ± 0.496
3.249ProGlu: 3.249 ± 0.433
1.058ProPhe: 1.058 ± 0.254
2.947ProGly: 2.947 ± 0.491
0.529ProHis: 0.529 ± 0.183
1.965ProIle: 1.965 ± 0.336
1.663ProLys: 1.663 ± 0.385
2.947ProLeu: 2.947 ± 0.431
1.134ProMet: 1.134 ± 0.221
1.663ProAsn: 1.663 ± 0.363
0.605ProPro: 0.605 ± 0.198
1.587ProGln: 1.587 ± 0.304
1.587ProArg: 1.587 ± 0.352
2.418ProSer: 2.418 ± 0.47
2.418ProThr: 2.418 ± 0.383
3.023ProVal: 3.023 ± 0.334
0.756ProTrp: 0.756 ± 0.233
1.511ProTyr: 1.511 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
5.139GlnAla: 5.139 ± 0.704
0.529GlnCys: 0.529 ± 0.209
2.796GlnAsp: 2.796 ± 0.484
3.778GlnGlu: 3.778 ± 0.56
1.511GlnPhe: 1.511 ± 0.336
2.947GlnGly: 2.947 ± 0.399
1.436GlnHis: 1.436 ± 0.359
1.285GlnIle: 1.285 ± 0.286
2.116GlnLys: 2.116 ± 0.538
4.61GlnLeu: 4.61 ± 0.499
0.907GlnMet: 0.907 ± 0.196
2.191GlnAsn: 2.191 ± 0.384
1.36GlnPro: 1.36 ± 0.424
2.872GlnGln: 2.872 ± 0.798
2.418GlnArg: 2.418 ± 0.374
2.872GlnSer: 2.872 ± 0.625
1.965GlnThr: 1.965 ± 0.4
2.796GlnVal: 2.796 ± 0.443
0.68GlnTrp: 0.68 ± 0.231
2.116GlnTyr: 2.116 ± 0.404
0.0GlnXaa: 0.0 ± 0.0
Arg
6.423ArgAla: 6.423 ± 0.795
0.529ArgCys: 0.529 ± 0.21
3.098ArgAsp: 3.098 ± 0.414
3.778ArgGlu: 3.778 ± 0.468
2.04ArgPhe: 2.04 ± 0.356
4.534ArgGly: 4.534 ± 0.627
0.982ArgHis: 0.982 ± 0.237
3.401ArgIle: 3.401 ± 0.595
3.174ArgLys: 3.174 ± 0.615
5.214ArgLeu: 5.214 ± 0.513
1.889ArgMet: 1.889 ± 0.37
2.569ArgAsn: 2.569 ± 0.468
1.511ArgPro: 1.511 ± 0.349
2.645ArgGln: 2.645 ± 0.447
4.156ArgArg: 4.156 ± 0.692
2.645ArgSer: 2.645 ± 0.564
3.778ArgThr: 3.778 ± 0.524
3.778ArgVal: 3.778 ± 0.526
1.209ArgTrp: 1.209 ± 0.28
1.889ArgTyr: 1.889 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
8.161SerAla: 8.161 ± 0.719
0.982SerCys: 0.982 ± 0.273
3.778SerAsp: 3.778 ± 0.466
3.476SerGlu: 3.476 ± 0.524
1.814SerPhe: 1.814 ± 0.232
5.743SerGly: 5.743 ± 0.744
0.831SerHis: 0.831 ± 0.29
2.569SerIle: 2.569 ± 0.596
3.703SerLys: 3.703 ± 0.594
4.988SerLeu: 4.988 ± 0.614
2.569SerMet: 2.569 ± 0.325
3.023SerAsn: 3.023 ± 0.589
2.191SerPro: 2.191 ± 0.364
1.814SerGln: 1.814 ± 0.361
2.796SerArg: 2.796 ± 0.437
4.005SerSer: 4.005 ± 0.762
3.778SerThr: 3.778 ± 0.633
4.685SerVal: 4.685 ± 0.454
0.982SerTrp: 0.982 ± 0.32
1.738SerTyr: 1.738 ± 0.461
0.0SerXaa: 0.0 ± 0.0
Thr
5.29ThrAla: 5.29 ± 0.873
0.68ThrCys: 0.68 ± 0.3
3.552ThrAsp: 3.552 ± 0.379
2.796ThrGlu: 2.796 ± 0.534
2.116ThrPhe: 2.116 ± 0.338
4.988ThrGly: 4.988 ± 0.573
1.209ThrHis: 1.209 ± 0.337
2.494ThrIle: 2.494 ± 0.453
2.645ThrLys: 2.645 ± 0.47
4.307ThrLeu: 4.307 ± 0.659
1.285ThrMet: 1.285 ± 0.385
2.343ThrAsn: 2.343 ± 0.457
2.72ThrPro: 2.72 ± 0.39
2.418ThrGln: 2.418 ± 0.463
2.72ThrArg: 2.72 ± 0.539
4.383ThrSer: 4.383 ± 0.553
2.267ThrThr: 2.267 ± 0.439
4.459ThrVal: 4.459 ± 0.731
0.907ThrTrp: 0.907 ± 0.196
2.116ThrTyr: 2.116 ± 0.455
0.0ThrXaa: 0.0 ± 0.0
Val
6.952ValAla: 6.952 ± 0.897
0.529ValCys: 0.529 ± 0.216
5.139ValAsp: 5.139 ± 0.725
4.156ValGlu: 4.156 ± 0.669
1.814ValPhe: 1.814 ± 0.412
6.348ValGly: 6.348 ± 0.682
1.814ValHis: 1.814 ± 0.338
2.872ValIle: 2.872 ± 0.451
2.72ValLys: 2.72 ± 0.578
5.97ValLeu: 5.97 ± 0.922
1.738ValMet: 1.738 ± 0.328
2.796ValAsn: 2.796 ± 0.602
3.098ValPro: 3.098 ± 0.566
3.703ValGln: 3.703 ± 0.703
3.778ValArg: 3.778 ± 0.463
4.912ValSer: 4.912 ± 0.683
3.778ValThr: 3.778 ± 0.66
5.894ValVal: 5.894 ± 0.597
0.756ValTrp: 0.756 ± 0.218
2.569ValTyr: 2.569 ± 0.44
0.0ValXaa: 0.0 ± 0.0
Trp
1.511TrpAla: 1.511 ± 0.309
0.227TrpCys: 0.227 ± 0.123
0.68TrpAsp: 0.68 ± 0.2
0.982TrpGlu: 0.982 ± 0.284
0.68TrpPhe: 0.68 ± 0.262
0.529TrpGly: 0.529 ± 0.235
0.378TrpHis: 0.378 ± 0.147
0.529TrpIle: 0.529 ± 0.198
0.529TrpLys: 0.529 ± 0.203
1.209TrpLeu: 1.209 ± 0.264
0.151TrpMet: 0.151 ± 0.151
0.756TrpAsn: 0.756 ± 0.216
0.605TrpPro: 0.605 ± 0.217
0.378TrpGln: 0.378 ± 0.154
1.209TrpArg: 1.209 ± 0.295
0.831TrpSer: 0.831 ± 0.266
0.756TrpThr: 0.756 ± 0.182
1.134TrpVal: 1.134 ± 0.315
0.453TrpTrp: 0.453 ± 0.198
0.831TrpTyr: 0.831 ± 0.29
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.645TyrAla: 2.645 ± 0.419
0.68TyrCys: 0.68 ± 0.248
2.343TyrAsp: 2.343 ± 0.372
2.116TyrGlu: 2.116 ± 0.52
1.36TyrPhe: 1.36 ± 0.312
2.72TyrGly: 2.72 ± 0.495
0.68TyrHis: 0.68 ± 0.23
2.418TyrIle: 2.418 ± 0.44
2.116TyrLys: 2.116 ± 0.387
3.778TyrLeu: 3.778 ± 0.505
0.907TyrMet: 0.907 ± 0.264
1.134TyrAsn: 1.134 ± 0.249
1.209TyrPro: 1.209 ± 0.262
2.116TyrGln: 2.116 ± 0.42
2.418TyrArg: 2.418 ± 0.566
2.796TyrSer: 2.796 ± 0.381
2.645TyrThr: 2.645 ± 0.443
2.04TyrVal: 2.04 ± 0.434
0.529TyrTrp: 0.529 ± 0.165
1.285TyrTyr: 1.285 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (13234 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski