Amino acid dipepetide frequency for Klebsiella phage 48ST307

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.553AlaAla: 10.553 ± 1.466
1.094AlaCys: 1.094 ± 0.231
7.583AlaAsp: 7.583 ± 0.771
5.785AlaGlu: 5.785 ± 0.652
2.345AlaPhe: 2.345 ± 0.486
7.817AlaGly: 7.817 ± 1.152
0.782AlaHis: 0.782 ± 0.212
5.785AlaIle: 5.785 ± 0.619
6.645AlaLys: 6.645 ± 0.68
9.615AlaLeu: 9.615 ± 0.801
2.502AlaMet: 2.502 ± 0.453
5.238AlaAsn: 5.238 ± 1.165
2.658AlaPro: 2.658 ± 0.397
4.221AlaGln: 4.221 ± 0.668
5.316AlaArg: 5.316 ± 0.985
6.41AlaSer: 6.41 ± 0.868
6.801AlaThr: 6.801 ± 0.932
6.254AlaVal: 6.254 ± 0.992
1.876AlaTrp: 1.876 ± 0.353
2.502AlaTyr: 2.502 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.469CysAla: 0.469 ± 0.257
0.156CysCys: 0.156 ± 0.109
0.625CysAsp: 0.625 ± 0.195
0.469CysGlu: 0.469 ± 0.15
0.156CysPhe: 0.156 ± 0.091
0.86CysGly: 0.86 ± 0.272
0.0CysHis: 0.0 ± 0.0
0.782CysIle: 0.782 ± 0.324
0.313CysLys: 0.313 ± 0.185
0.625CysLeu: 0.625 ± 0.281
0.078CysMet: 0.078 ± 0.092
0.391CysAsn: 0.391 ± 0.209
0.235CysPro: 0.235 ± 0.108
0.547CysGln: 0.547 ± 0.214
0.625CysArg: 0.625 ± 0.204
0.469CysSer: 0.469 ± 0.235
0.391CysThr: 0.391 ± 0.17
0.782CysVal: 0.782 ± 0.243
0.469CysTrp: 0.469 ± 0.191
0.391CysTyr: 0.391 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
6.801AspAla: 6.801 ± 0.844
0.391AspCys: 0.391 ± 0.244
3.283AspAsp: 3.283 ± 0.74
3.909AspGlu: 3.909 ± 0.903
1.876AspPhe: 1.876 ± 0.479
5.629AspGly: 5.629 ± 0.76
0.625AspHis: 0.625 ± 0.236
3.049AspIle: 3.049 ± 0.481
3.127AspLys: 3.127 ± 0.628
6.957AspLeu: 6.957 ± 0.842
1.016AspMet: 1.016 ± 0.268
2.736AspAsn: 2.736 ± 0.391
2.502AspPro: 2.502 ± 0.545
1.642AspGln: 1.642 ± 0.303
3.127AspArg: 3.127 ± 0.488
3.361AspSer: 3.361 ± 0.563
3.049AspThr: 3.049 ± 0.491
3.44AspVal: 3.44 ± 0.448
0.547AspTrp: 0.547 ± 0.192
2.189AspTyr: 2.189 ± 0.575
0.0AspXaa: 0.0 ± 0.0
Glu
5.081GluAla: 5.081 ± 0.726
0.391GluCys: 0.391 ± 0.133
2.345GluAsp: 2.345 ± 0.575
2.892GluGlu: 2.892 ± 0.616
2.267GluPhe: 2.267 ± 0.419
3.049GluGly: 3.049 ± 0.622
1.094GluHis: 1.094 ± 0.351
3.361GluIle: 3.361 ± 0.509
3.518GluLys: 3.518 ± 0.677
6.332GluLeu: 6.332 ± 0.963
1.485GluMet: 1.485 ± 0.304
2.736GluAsn: 2.736 ± 0.596
1.798GluPro: 1.798 ± 0.387
3.283GluGln: 3.283 ± 0.664
3.752GluArg: 3.752 ± 0.781
3.518GluSer: 3.518 ± 0.501
3.831GluThr: 3.831 ± 0.361
3.361GluVal: 3.361 ± 0.511
0.313GluTrp: 0.313 ± 0.16
1.642GluTyr: 1.642 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
2.58PheAla: 2.58 ± 0.499
0.391PheCys: 0.391 ± 0.206
3.361PheAsp: 3.361 ± 0.509
2.111PheGlu: 2.111 ± 0.332
1.173PhePhe: 1.173 ± 0.408
2.189PheGly: 2.189 ± 0.542
0.469PheHis: 0.469 ± 0.193
1.407PheIle: 1.407 ± 0.313
1.954PheLys: 1.954 ± 0.553
1.485PheLeu: 1.485 ± 0.333
0.625PheMet: 0.625 ± 0.287
2.189PheAsn: 2.189 ± 0.284
0.547PhePro: 0.547 ± 0.237
1.094PheGln: 1.094 ± 0.286
1.329PheArg: 1.329 ± 0.347
1.798PheSer: 1.798 ± 0.31
2.345PheThr: 2.345 ± 0.423
2.189PheVal: 2.189 ± 0.431
0.391PheTrp: 0.391 ± 0.163
1.251PheTyr: 1.251 ± 0.361
0.0PheXaa: 0.0 ± 0.0
Gly
6.879GlyAla: 6.879 ± 0.876
0.625GlyCys: 0.625 ± 0.193
4.769GlyAsp: 4.769 ± 0.501
4.3GlyGlu: 4.3 ± 0.7
2.189GlyPhe: 2.189 ± 0.584
6.332GlyGly: 6.332 ± 0.881
1.173GlyHis: 1.173 ± 0.26
4.143GlyIle: 4.143 ± 0.606
4.065GlyLys: 4.065 ± 0.593
6.488GlyLeu: 6.488 ± 0.909
2.502GlyMet: 2.502 ± 0.464
5.238GlyAsn: 5.238 ± 1.123
1.016GlyPro: 1.016 ± 0.427
2.971GlyGln: 2.971 ± 0.503
3.909GlyArg: 3.909 ± 0.613
5.081GlySer: 5.081 ± 0.773
5.55GlyThr: 5.55 ± 0.697
4.925GlyVal: 4.925 ± 0.66
1.407GlyTrp: 1.407 ± 0.383
2.111GlyTyr: 2.111 ± 0.296
0.0GlyXaa: 0.0 ± 0.0
His
1.329HisAla: 1.329 ± 0.408
0.235HisCys: 0.235 ± 0.147
0.86HisAsp: 0.86 ± 0.277
1.173HisGlu: 1.173 ± 0.404
0.313HisPhe: 0.313 ± 0.163
1.016HisGly: 1.016 ± 0.284
0.469HisHis: 0.469 ± 0.216
1.094HisIle: 1.094 ± 0.265
0.391HisLys: 0.391 ± 0.216
0.86HisLeu: 0.86 ± 0.36
0.313HisMet: 0.313 ± 0.16
0.469HisAsn: 0.469 ± 0.149
0.86HisPro: 0.86 ± 0.276
0.547HisGln: 0.547 ± 0.243
0.86HisArg: 0.86 ± 0.28
1.094HisSer: 1.094 ± 0.325
0.391HisThr: 0.391 ± 0.172
0.782HisVal: 0.782 ± 0.257
0.156HisTrp: 0.156 ± 0.124
1.016HisTyr: 1.016 ± 0.386
0.0HisXaa: 0.0 ± 0.0
Ile
3.674IleAla: 3.674 ± 0.39
0.469IleCys: 0.469 ± 0.244
3.752IleAsp: 3.752 ± 0.592
4.143IleGlu: 4.143 ± 0.599
0.938IlePhe: 0.938 ± 0.361
2.971IleGly: 2.971 ± 0.306
0.938IleHis: 0.938 ± 0.337
3.127IleIle: 3.127 ± 1.041
2.971IleLys: 2.971 ± 0.501
3.518IleLeu: 3.518 ± 0.507
1.72IleMet: 1.72 ± 0.464
2.892IleAsn: 2.892 ± 0.339
2.502IlePro: 2.502 ± 0.444
1.954IleGln: 1.954 ± 0.488
2.658IleArg: 2.658 ± 0.376
5.472IleSer: 5.472 ± 0.708
6.019IleThr: 6.019 ± 0.81
2.423IleVal: 2.423 ± 0.441
0.391IleTrp: 0.391 ± 0.174
1.173IleTyr: 1.173 ± 0.354
0.0IleXaa: 0.0 ± 0.0
Lys
5.629LysAla: 5.629 ± 0.526
0.469LysCys: 0.469 ± 0.2
2.658LysAsp: 2.658 ± 0.422
2.423LysGlu: 2.423 ± 0.642
1.407LysPhe: 1.407 ± 0.426
3.674LysGly: 3.674 ± 0.611
0.625LysHis: 0.625 ± 0.2
2.892LysIle: 2.892 ± 0.527
2.814LysLys: 2.814 ± 0.632
4.221LysLeu: 4.221 ± 0.801
1.173LysMet: 1.173 ± 0.486
2.814LysAsn: 2.814 ± 0.621
2.189LysPro: 2.189 ± 0.482
2.658LysGln: 2.658 ± 0.36
2.502LysArg: 2.502 ± 0.678
3.127LysSer: 3.127 ± 0.475
4.3LysThr: 4.3 ± 0.554
3.205LysVal: 3.205 ± 0.518
0.86LysTrp: 0.86 ± 0.202
1.798LysTyr: 1.798 ± 0.503
0.0LysXaa: 0.0 ± 0.0
Leu
9.459LeuAla: 9.459 ± 1.094
0.547LeuCys: 0.547 ± 0.205
4.534LeuAsp: 4.534 ± 0.545
5.081LeuGlu: 5.081 ± 0.576
2.423LeuPhe: 2.423 ± 0.487
4.534LeuGly: 4.534 ± 0.627
1.72LeuHis: 1.72 ± 0.534
2.892LeuIle: 2.892 ± 0.408
4.534LeuLys: 4.534 ± 0.686
6.645LeuLeu: 6.645 ± 1.392
1.72LeuMet: 1.72 ± 0.469
4.378LeuAsn: 4.378 ± 0.702
3.596LeuPro: 3.596 ± 0.711
4.3LeuGln: 4.3 ± 0.583
5.707LeuArg: 5.707 ± 0.72
6.879LeuSer: 6.879 ± 0.856
7.974LeuThr: 7.974 ± 1.4
4.378LeuVal: 4.378 ± 0.532
0.469LeuTrp: 0.469 ± 0.17
2.736LeuTyr: 2.736 ± 0.488
0.0LeuXaa: 0.0 ± 0.0
Met
3.127MetAla: 3.127 ± 0.647
0.156MetCys: 0.156 ± 0.125
0.625MetAsp: 0.625 ± 0.26
1.251MetGlu: 1.251 ± 0.262
0.625MetPhe: 0.625 ± 0.298
1.485MetGly: 1.485 ± 0.331
0.235MetHis: 0.235 ± 0.165
1.251MetIle: 1.251 ± 0.349
1.876MetLys: 1.876 ± 0.419
1.954MetLeu: 1.954 ± 0.498
0.938MetMet: 0.938 ± 0.2
1.173MetAsn: 1.173 ± 0.25
1.016MetPro: 1.016 ± 0.297
0.782MetGln: 0.782 ± 0.283
1.642MetArg: 1.642 ± 0.371
1.485MetSer: 1.485 ± 0.368
1.954MetThr: 1.954 ± 0.807
1.563MetVal: 1.563 ± 0.44
0.078MetTrp: 0.078 ± 0.091
0.938MetTyr: 0.938 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
5.941AsnAla: 5.941 ± 0.917
0.156AsnCys: 0.156 ± 0.111
2.58AsnAsp: 2.58 ± 0.426
1.954AsnGlu: 1.954 ± 0.455
2.189AsnPhe: 2.189 ± 0.363
4.065AsnGly: 4.065 ± 0.62
0.547AsnHis: 0.547 ± 0.202
2.502AsnIle: 2.502 ± 0.272
2.267AsnLys: 2.267 ± 0.444
3.596AsnLeu: 3.596 ± 0.694
1.407AsnMet: 1.407 ± 0.404
2.892AsnAsn: 2.892 ± 0.541
1.642AsnPro: 1.642 ± 0.314
2.814AsnGln: 2.814 ± 0.516
2.892AsnArg: 2.892 ± 0.311
4.847AsnSer: 4.847 ± 0.996
3.596AsnThr: 3.596 ± 0.768
3.127AsnVal: 3.127 ± 0.402
0.469AsnTrp: 0.469 ± 0.164
0.938AsnTyr: 0.938 ± 0.265
0.0AsnXaa: 0.0 ± 0.0
Pro
3.909ProAla: 3.909 ± 0.598
0.547ProCys: 0.547 ± 0.314
2.189ProAsp: 2.189 ± 0.435
2.502ProGlu: 2.502 ± 0.533
1.016ProPhe: 1.016 ± 0.265
3.518ProGly: 3.518 ± 0.519
1.251ProHis: 1.251 ± 0.378
1.251ProIle: 1.251 ± 0.419
1.563ProLys: 1.563 ± 0.413
2.345ProLeu: 2.345 ± 0.34
0.625ProMet: 0.625 ± 0.282
1.642ProAsn: 1.642 ± 0.39
1.173ProPro: 1.173 ± 0.469
1.251ProGln: 1.251 ± 0.24
1.563ProArg: 1.563 ± 0.41
2.658ProSer: 2.658 ± 0.323
1.798ProThr: 1.798 ± 0.39
3.049ProVal: 3.049 ± 0.466
0.782ProTrp: 0.782 ± 0.227
1.094ProTyr: 1.094 ± 0.258
0.0ProXaa: 0.0 ± 0.0
Gln
5.785GlnAla: 5.785 ± 1.217
0.235GlnCys: 0.235 ± 0.135
2.58GlnAsp: 2.58 ± 0.453
2.658GlnGlu: 2.658 ± 0.377
1.407GlnPhe: 1.407 ± 0.392
3.44GlnGly: 3.44 ± 0.591
0.625GlnHis: 0.625 ± 0.3
2.111GlnIle: 2.111 ± 0.429
1.72GlnLys: 1.72 ± 0.363
4.534GlnLeu: 4.534 ± 0.679
1.485GlnMet: 1.485 ± 0.332
1.642GlnAsn: 1.642 ± 0.35
1.876GlnPro: 1.876 ± 0.351
3.518GlnGln: 3.518 ± 0.502
2.658GlnArg: 2.658 ± 0.581
2.971GlnSer: 2.971 ± 0.643
2.814GlnThr: 2.814 ± 0.598
2.814GlnVal: 2.814 ± 0.558
1.485GlnTrp: 1.485 ± 0.289
1.876GlnTyr: 1.876 ± 0.501
0.0GlnXaa: 0.0 ± 0.0
Arg
4.378ArgAla: 4.378 ± 0.477
0.235ArgCys: 0.235 ± 0.159
3.127ArgAsp: 3.127 ± 0.664
2.892ArgGlu: 2.892 ± 0.657
1.954ArgPhe: 1.954 ± 0.404
4.3ArgGly: 4.3 ± 0.619
0.86ArgHis: 0.86 ± 0.347
4.3ArgIle: 4.3 ± 0.675
3.127ArgLys: 3.127 ± 0.611
4.378ArgLeu: 4.378 ± 0.578
1.251ArgMet: 1.251 ± 0.367
2.658ArgAsn: 2.658 ± 0.308
2.267ArgPro: 2.267 ± 0.36
2.971ArgGln: 2.971 ± 0.73
3.596ArgArg: 3.596 ± 0.968
3.127ArgSer: 3.127 ± 0.453
2.814ArgThr: 2.814 ± 0.452
3.909ArgVal: 3.909 ± 0.475
0.782ArgTrp: 0.782 ± 0.317
2.033ArgTyr: 2.033 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
7.661SerAla: 7.661 ± 0.81
0.782SerCys: 0.782 ± 0.276
4.456SerAsp: 4.456 ± 0.747
3.049SerGlu: 3.049 ± 0.593
2.58SerPhe: 2.58 ± 0.377
7.036SerGly: 7.036 ± 0.856
0.782SerHis: 0.782 ± 0.197
4.534SerIle: 4.534 ± 0.466
2.423SerLys: 2.423 ± 0.467
7.583SerLeu: 7.583 ± 0.96
1.563SerMet: 1.563 ± 0.394
3.127SerAsn: 3.127 ± 0.764
2.736SerPro: 2.736 ± 0.387
4.534SerGln: 4.534 ± 1.008
3.44SerArg: 3.44 ± 0.576
5.785SerSer: 5.785 ± 0.86
3.987SerThr: 3.987 ± 0.827
4.769SerVal: 4.769 ± 0.42
1.485SerTrp: 1.485 ± 0.475
1.798SerTyr: 1.798 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
9.068ThrAla: 9.068 ± 1.25
0.625ThrCys: 0.625 ± 0.233
4.3ThrAsp: 4.3 ± 0.819
3.205ThrGlu: 3.205 ± 0.602
2.111ThrPhe: 2.111 ± 0.514
6.41ThrGly: 6.41 ± 0.853
0.469ThrHis: 0.469 ± 0.191
2.736ThrIle: 2.736 ± 0.373
2.423ThrLys: 2.423 ± 0.421
5.55ThrLeu: 5.55 ± 0.425
1.563ThrMet: 1.563 ± 0.34
3.205ThrAsn: 3.205 ± 0.848
2.267ThrPro: 2.267 ± 0.477
3.831ThrGln: 3.831 ± 0.882
3.44ThrArg: 3.44 ± 0.505
7.036ThrSer: 7.036 ± 1.268
4.847ThrThr: 4.847 ± 1.113
4.456ThrVal: 4.456 ± 0.922
0.782ThrTrp: 0.782 ± 0.221
1.485ThrTyr: 1.485 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
5.472ValAla: 5.472 ± 0.573
0.86ValCys: 0.86 ± 0.308
4.143ValAsp: 4.143 ± 0.361
3.361ValGlu: 3.361 ± 0.397
2.111ValPhe: 2.111 ± 0.317
4.3ValGly: 4.3 ± 0.764
0.782ValHis: 0.782 ± 0.313
4.065ValIle: 4.065 ± 0.455
3.752ValLys: 3.752 ± 0.349
3.987ValLeu: 3.987 ± 0.654
1.407ValMet: 1.407 ± 0.34
3.049ValAsn: 3.049 ± 0.796
2.658ValPro: 2.658 ± 0.488
2.423ValGln: 2.423 ± 0.482
2.814ValArg: 2.814 ± 0.462
5.316ValSer: 5.316 ± 1.096
4.378ValThr: 4.378 ± 1.019
4.3ValVal: 4.3 ± 0.616
1.173ValTrp: 1.173 ± 0.213
1.72ValTyr: 1.72 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
0.938TrpAla: 0.938 ± 0.248
0.156TrpCys: 0.156 ± 0.13
0.235TrpAsp: 0.235 ± 0.142
0.938TrpGlu: 0.938 ± 0.243
0.391TrpPhe: 0.391 ± 0.186
1.251TrpGly: 1.251 ± 0.4
0.313TrpHis: 0.313 ± 0.126
0.391TrpIle: 0.391 ± 0.167
0.625TrpLys: 0.625 ± 0.22
1.642TrpLeu: 1.642 ± 0.618
0.156TrpMet: 0.156 ± 0.121
0.704TrpAsn: 0.704 ± 0.192
0.547TrpPro: 0.547 ± 0.173
1.173TrpGln: 1.173 ± 0.259
1.016TrpArg: 1.016 ± 0.294
1.094TrpSer: 1.094 ± 0.348
1.485TrpThr: 1.485 ± 0.647
0.86TrpVal: 0.86 ± 0.392
0.235TrpTrp: 0.235 ± 0.127
0.235TrpTyr: 0.235 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.049TyrAla: 3.049 ± 0.547
0.313TyrCys: 0.313 ± 0.19
1.251TyrAsp: 1.251 ± 0.266
1.954TyrGlu: 1.954 ± 0.326
1.407TyrPhe: 1.407 ± 0.414
1.876TyrGly: 1.876 ± 0.448
0.469TyrHis: 0.469 ± 0.262
2.033TyrIle: 2.033 ± 0.437
1.329TyrLys: 1.329 ± 0.391
2.189TyrLeu: 2.189 ± 0.42
0.547TyrMet: 0.547 ± 0.226
1.251TyrAsn: 1.251 ± 0.326
1.642TyrPro: 1.642 ± 0.416
1.72TyrGln: 1.72 ± 0.454
2.111TyrArg: 2.111 ± 0.372
2.58TyrSer: 2.58 ± 0.363
1.485TyrThr: 1.485 ± 0.316
1.485TyrVal: 1.485 ± 0.358
0.235TyrTrp: 0.235 ± 0.134
0.86TyrTyr: 0.86 ± 0.262
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (12793 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski