Amino acid dipepetide frequency for Klebsiella phage KP34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.761AlaAla: 15.761 ± 1.275
1.017AlaCys: 1.017 ± 0.298
6.028AlaAsp: 6.028 ± 0.629
5.375AlaGlu: 5.375 ± 0.647
3.414AlaPhe: 3.414 ± 0.458
7.772AlaGly: 7.772 ± 0.986
1.453AlaHis: 1.453 ± 0.454
4.648AlaIle: 4.648 ± 0.58
5.157AlaLys: 5.157 ± 1.037
9.079AlaLeu: 9.079 ± 0.866
2.542AlaMet: 2.542 ± 0.38
2.978AlaAsn: 2.978 ± 0.387
4.285AlaPro: 4.285 ± 0.877
5.302AlaGln: 5.302 ± 0.907
5.738AlaArg: 5.738 ± 0.593
6.028AlaSer: 6.028 ± 0.909
5.593AlaThr: 5.593 ± 0.861
6.61AlaVal: 6.61 ± 0.792
1.235AlaTrp: 1.235 ± 0.341
3.995AlaTyr: 3.995 ± 0.56
0.0AlaXaa: 0.0 ± 0.0
Cys
1.017CysAla: 1.017 ± 0.318
0.363CysCys: 0.363 ± 0.222
0.508CysAsp: 0.508 ± 0.169
0.291CysGlu: 0.291 ± 0.139
0.291CysPhe: 0.291 ± 0.147
0.799CysGly: 0.799 ± 0.232
0.363CysHis: 0.363 ± 0.159
0.436CysIle: 0.436 ± 0.187
0.508CysLys: 0.508 ± 0.253
0.872CysLeu: 0.872 ± 0.228
0.654CysMet: 0.654 ± 0.217
0.436CysAsn: 0.436 ± 0.204
0.508CysPro: 0.508 ± 0.239
0.436CysGln: 0.436 ± 0.202
0.944CysArg: 0.944 ± 0.239
0.654CysSer: 0.654 ± 0.19
0.944CysThr: 0.944 ± 0.334
1.089CysVal: 1.089 ± 0.273
0.291CysTrp: 0.291 ± 0.173
0.508CysTyr: 0.508 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
7.118AspAla: 7.118 ± 0.777
0.872AspCys: 0.872 ± 0.288
3.268AspAsp: 3.268 ± 0.406
3.268AspGlu: 3.268 ± 0.512
2.324AspPhe: 2.324 ± 0.342
4.14AspGly: 4.14 ± 0.617
0.581AspHis: 0.581 ± 0.219
3.051AspIle: 3.051 ± 0.458
2.252AspLys: 2.252 ± 0.373
5.157AspLeu: 5.157 ± 0.571
2.615AspMet: 2.615 ± 0.393
2.833AspAsn: 2.833 ± 0.447
2.469AspPro: 2.469 ± 0.461
1.598AspGln: 1.598 ± 0.305
2.542AspArg: 2.542 ± 0.578
5.811AspSer: 5.811 ± 0.659
4.067AspThr: 4.067 ± 0.585
5.012AspVal: 5.012 ± 0.608
1.089AspTrp: 1.089 ± 0.2
2.542AspTyr: 2.542 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
5.811GluAla: 5.811 ± 0.646
0.508GluCys: 0.508 ± 0.151
2.833GluAsp: 2.833 ± 0.392
4.14GluGlu: 4.14 ± 0.786
2.397GluPhe: 2.397 ± 0.306
3.341GluGly: 3.341 ± 0.432
2.469GluHis: 2.469 ± 0.395
2.615GluIle: 2.615 ± 0.469
2.034GluLys: 2.034 ± 0.394
5.084GluLeu: 5.084 ± 0.536
2.252GluMet: 2.252 ± 0.385
1.816GluAsn: 1.816 ± 0.367
1.598GluPro: 1.598 ± 0.35
3.704GluGln: 3.704 ± 0.659
3.995GluArg: 3.995 ± 0.577
2.687GluSer: 2.687 ± 0.43
2.76GluThr: 2.76 ± 0.428
4.648GluVal: 4.648 ± 0.632
1.017GluTrp: 1.017 ± 0.274
2.469GluTyr: 2.469 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.615PheAla: 2.615 ± 0.402
0.508PheCys: 0.508 ± 0.273
2.179PheAsp: 2.179 ± 0.364
2.252PheGlu: 2.252 ± 0.484
1.162PhePhe: 1.162 ± 0.29
2.034PheGly: 2.034 ± 0.31
0.436PheHis: 0.436 ± 0.164
1.38PheIle: 1.38 ± 0.268
1.743PheLys: 1.743 ± 0.381
2.106PheLeu: 2.106 ± 0.433
0.508PheMet: 0.508 ± 0.216
1.525PheAsn: 1.525 ± 0.344
1.235PhePro: 1.235 ± 0.242
1.38PheGln: 1.38 ± 0.229
1.743PheArg: 1.743 ± 0.355
1.671PheSer: 1.671 ± 0.318
2.324PheThr: 2.324 ± 0.471
1.743PheVal: 1.743 ± 0.43
0.581PheTrp: 0.581 ± 0.185
1.307PheTyr: 1.307 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
5.738GlyAla: 5.738 ± 0.609
1.453GlyCys: 1.453 ± 0.356
4.939GlyAsp: 4.939 ± 0.494
3.85GlyGlu: 3.85 ± 0.435
2.542GlyPhe: 2.542 ± 0.496
4.14GlyGly: 4.14 ± 0.568
1.162GlyHis: 1.162 ± 0.283
4.431GlyIle: 4.431 ± 0.63
4.213GlyLys: 4.213 ± 0.57
6.319GlyLeu: 6.319 ± 0.67
1.816GlyMet: 1.816 ± 0.409
3.341GlyAsn: 3.341 ± 0.565
1.453GlyPro: 1.453 ± 0.279
2.76GlyGln: 2.76 ± 0.376
5.012GlyArg: 5.012 ± 0.453
5.665GlySer: 5.665 ± 0.723
4.648GlyThr: 4.648 ± 0.724
5.811GlyVal: 5.811 ± 0.699
0.654GlyTrp: 0.654 ± 0.211
3.051GlyTyr: 3.051 ± 0.544
0.0GlyXaa: 0.0 ± 0.0
His
1.598HisAla: 1.598 ± 0.474
0.363HisCys: 0.363 ± 0.169
1.089HisAsp: 1.089 ± 0.224
1.017HisGlu: 1.017 ± 0.295
0.436HisPhe: 0.436 ± 0.157
2.179HisGly: 2.179 ± 0.584
0.073HisHis: 0.073 ± 0.08
1.089HisIle: 1.089 ± 0.317
1.017HisLys: 1.017 ± 0.208
2.179HisLeu: 2.179 ± 0.393
0.436HisMet: 0.436 ± 0.165
0.581HisAsn: 0.581 ± 0.293
0.944HisPro: 0.944 ± 0.314
0.508HisGln: 0.508 ± 0.274
1.307HisArg: 1.307 ± 0.333
0.872HisSer: 0.872 ± 0.253
0.726HisThr: 0.726 ± 0.225
0.799HisVal: 0.799 ± 0.209
0.291HisTrp: 0.291 ± 0.144
0.726HisTyr: 0.726 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
3.268IleAla: 3.268 ± 0.581
0.436IleCys: 0.436 ± 0.195
3.559IleAsp: 3.559 ± 0.405
2.76IleGlu: 2.76 ± 0.526
0.654IlePhe: 0.654 ± 0.165
2.833IleGly: 2.833 ± 0.437
0.726IleHis: 0.726 ± 0.281
2.034IleIle: 2.034 ± 0.359
3.341IleLys: 3.341 ± 0.564
4.213IleLeu: 4.213 ± 0.59
1.162IleMet: 1.162 ± 0.266
1.888IleAsn: 1.888 ± 0.424
2.469IlePro: 2.469 ± 0.385
2.833IleGln: 2.833 ± 0.562
3.123IleArg: 3.123 ± 0.363
3.414IleSer: 3.414 ± 0.567
2.615IleThr: 2.615 ± 0.479
2.469IleVal: 2.469 ± 0.37
0.218IleTrp: 0.218 ± 0.132
1.307IleTyr: 1.307 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
5.956LysAla: 5.956 ± 1.002
0.363LysCys: 0.363 ± 0.163
2.469LysAsp: 2.469 ± 0.444
3.486LysGlu: 3.486 ± 0.497
1.017LysPhe: 1.017 ± 0.296
3.196LysGly: 3.196 ± 0.509
0.726LysHis: 0.726 ± 0.262
1.235LysIle: 1.235 ± 0.223
1.961LysLys: 1.961 ± 0.373
5.157LysLeu: 5.157 ± 0.708
1.235LysMet: 1.235 ± 0.295
1.453LysAsn: 1.453 ± 0.321
1.453LysPro: 1.453 ± 0.355
3.414LysGln: 3.414 ± 0.53
3.559LysArg: 3.559 ± 0.516
3.268LysSer: 3.268 ± 0.434
2.905LysThr: 2.905 ± 0.539
3.559LysVal: 3.559 ± 0.574
0.944LysTrp: 0.944 ± 0.219
1.525LysTyr: 1.525 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
8.207LeuAla: 8.207 ± 0.797
1.453LeuCys: 1.453 ± 0.33
7.118LeuAsp: 7.118 ± 0.642
5.23LeuGlu: 5.23 ± 0.461
2.833LeuPhe: 2.833 ± 0.371
6.755LeuGly: 6.755 ± 0.832
1.235LeuHis: 1.235 ± 0.263
4.431LeuIle: 4.431 ± 0.68
2.978LeuLys: 2.978 ± 0.488
6.464LeuLeu: 6.464 ± 0.628
2.252LeuMet: 2.252 ± 0.305
3.414LeuAsn: 3.414 ± 0.5
2.905LeuPro: 2.905 ± 0.483
4.431LeuGln: 4.431 ± 0.521
6.246LeuArg: 6.246 ± 0.501
5.811LeuSer: 5.811 ± 0.609
4.721LeuThr: 4.721 ± 0.607
6.392LeuVal: 6.392 ± 0.807
0.944LeuTrp: 0.944 ± 0.241
3.341LeuTyr: 3.341 ± 0.449
0.0LeuXaa: 0.0 ± 0.0
Met
3.341MetAla: 3.341 ± 0.59
0.145MetCys: 0.145 ± 0.104
2.034MetAsp: 2.034 ± 0.477
1.162MetGlu: 1.162 ± 0.241
0.799MetPhe: 0.799 ± 0.278
1.453MetGly: 1.453 ± 0.225
0.944MetHis: 0.944 ± 0.312
0.436MetIle: 0.436 ± 0.159
1.235MetLys: 1.235 ± 0.359
3.632MetLeu: 3.632 ± 0.529
0.508MetMet: 0.508 ± 0.191
1.089MetAsn: 1.089 ± 0.306
1.089MetPro: 1.089 ± 0.245
2.324MetGln: 2.324 ± 0.428
1.888MetArg: 1.888 ± 0.461
2.252MetSer: 2.252 ± 0.439
0.799MetThr: 0.799 ± 0.295
2.034MetVal: 2.034 ± 0.414
0.581MetTrp: 0.581 ± 0.188
1.017MetTyr: 1.017 ± 0.353
0.0MetXaa: 0.0 ± 0.0
Asn
3.414AsnAla: 3.414 ± 0.43
0.363AsnCys: 0.363 ± 0.147
2.397AsnAsp: 2.397 ± 0.405
1.525AsnGlu: 1.525 ± 0.446
0.944AsnPhe: 0.944 ± 0.285
3.341AsnGly: 3.341 ± 0.45
0.145AsnHis: 0.145 ± 0.099
2.542AsnIle: 2.542 ± 0.45
2.615AsnLys: 2.615 ± 0.397
2.905AsnLeu: 2.905 ± 0.477
1.38AsnMet: 1.38 ± 0.342
1.525AsnAsn: 1.525 ± 0.345
2.179AsnPro: 2.179 ± 0.399
1.38AsnGln: 1.38 ± 0.335
1.743AsnArg: 1.743 ± 0.409
2.978AsnSer: 2.978 ± 0.485
2.687AsnThr: 2.687 ± 0.444
3.196AsnVal: 3.196 ± 0.359
0.799AsnTrp: 0.799 ± 0.276
1.453AsnTyr: 1.453 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
4.14ProAla: 4.14 ± 0.731
0.291ProCys: 0.291 ± 0.137
2.687ProAsp: 2.687 ± 0.426
3.486ProGlu: 3.486 ± 0.579
0.872ProPhe: 0.872 ± 0.216
2.833ProGly: 2.833 ± 0.55
0.291ProHis: 0.291 ± 0.178
2.034ProIle: 2.034 ± 0.359
1.671ProLys: 1.671 ± 0.451
2.615ProLeu: 2.615 ± 0.476
1.017ProMet: 1.017 ± 0.245
1.307ProAsn: 1.307 ± 0.337
0.872ProPro: 0.872 ± 0.259
1.525ProGln: 1.525 ± 0.286
1.888ProArg: 1.888 ± 0.354
2.397ProSer: 2.397 ± 0.559
2.469ProThr: 2.469 ± 0.406
2.615ProVal: 2.615 ± 0.37
0.726ProTrp: 0.726 ± 0.245
1.38ProTyr: 1.38 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
4.866GlnAla: 4.866 ± 0.725
0.436GlnCys: 0.436 ± 0.203
3.123GlnAsp: 3.123 ± 0.548
3.414GlnGlu: 3.414 ± 0.596
1.162GlnPhe: 1.162 ± 0.306
3.196GlnGly: 3.196 ± 0.534
1.671GlnHis: 1.671 ± 0.312
1.089GlnIle: 1.089 ± 0.322
2.397GlnLys: 2.397 ± 0.42
4.721GlnLeu: 4.721 ± 0.446
1.235GlnMet: 1.235 ± 0.234
2.397GlnAsn: 2.397 ± 0.359
1.888GlnPro: 1.888 ± 0.47
2.905GlnGln: 2.905 ± 0.553
3.123GlnArg: 3.123 ± 0.383
3.268GlnSer: 3.268 ± 0.525
1.307GlnThr: 1.307 ± 0.354
2.905GlnVal: 2.905 ± 0.56
0.799GlnTrp: 0.799 ± 0.304
2.252GlnTyr: 2.252 ± 0.486
0.0GlnXaa: 0.0 ± 0.0
Arg
7.118ArgAla: 7.118 ± 1.018
0.581ArgCys: 0.581 ± 0.245
3.414ArgAsp: 3.414 ± 0.461
3.632ArgGlu: 3.632 ± 0.442
2.324ArgPhe: 2.324 ± 0.329
4.358ArgGly: 4.358 ± 0.697
1.162ArgHis: 1.162 ± 0.272
3.777ArgIle: 3.777 ± 0.589
3.414ArgLys: 3.414 ± 0.607
4.866ArgLeu: 4.866 ± 0.488
1.816ArgMet: 1.816 ± 0.386
2.542ArgAsn: 2.542 ± 0.394
1.671ArgPro: 1.671 ± 0.312
2.76ArgGln: 2.76 ± 0.462
4.576ArgArg: 4.576 ± 0.712
2.687ArgSer: 2.687 ± 0.546
3.559ArgThr: 3.559 ± 0.347
3.414ArgVal: 3.414 ± 0.444
1.089ArgTrp: 1.089 ± 0.272
2.106ArgTyr: 2.106 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
8.135SerAla: 8.135 ± 0.785
0.654SerCys: 0.654 ± 0.248
3.85SerAsp: 3.85 ± 0.597
3.268SerGlu: 3.268 ± 0.527
2.179SerPhe: 2.179 ± 0.416
5.956SerGly: 5.956 ± 0.605
0.799SerHis: 0.799 ± 0.298
2.469SerIle: 2.469 ± 0.568
3.995SerLys: 3.995 ± 0.555
4.866SerLeu: 4.866 ± 0.588
2.978SerMet: 2.978 ± 0.42
3.051SerAsn: 3.051 ± 0.605
2.397SerPro: 2.397 ± 0.38
1.743SerGln: 1.743 ± 0.32
3.486SerArg: 3.486 ± 0.435
3.85SerSer: 3.85 ± 0.887
4.576SerThr: 4.576 ± 0.591
4.503SerVal: 4.503 ± 0.685
0.944SerTrp: 0.944 ± 0.244
1.888SerTyr: 1.888 ± 0.393
0.0SerXaa: 0.0 ± 0.0
Thr
5.375ThrAla: 5.375 ± 0.774
0.508ThrCys: 0.508 ± 0.235
3.196ThrAsp: 3.196 ± 0.445
2.76ThrGlu: 2.76 ± 0.6
2.034ThrPhe: 2.034 ± 0.431
5.23ThrGly: 5.23 ± 0.638
1.017ThrHis: 1.017 ± 0.278
2.324ThrIle: 2.324 ± 0.438
2.542ThrLys: 2.542 ± 0.365
5.012ThrLeu: 5.012 ± 0.51
1.235ThrMet: 1.235 ± 0.367
2.034ThrAsn: 2.034 ± 0.467
2.833ThrPro: 2.833 ± 0.307
2.469ThrGln: 2.469 ± 0.447
2.905ThrArg: 2.905 ± 0.545
4.213ThrSer: 4.213 ± 0.589
3.196ThrThr: 3.196 ± 0.533
4.794ThrVal: 4.794 ± 0.797
1.017ThrTrp: 1.017 ± 0.222
2.106ThrTyr: 2.106 ± 0.452
0.0ThrXaa: 0.0 ± 0.0
Val
6.464ValAla: 6.464 ± 0.781
0.581ValCys: 0.581 ± 0.241
4.576ValAsp: 4.576 ± 0.563
4.14ValGlu: 4.14 ± 0.621
1.235ValPhe: 1.235 ± 0.337
5.956ValGly: 5.956 ± 0.702
1.961ValHis: 1.961 ± 0.346
2.76ValIle: 2.76 ± 0.53
3.051ValLys: 3.051 ± 0.542
6.246ValLeu: 6.246 ± 0.853
1.743ValMet: 1.743 ± 0.391
3.051ValAsn: 3.051 ± 0.532
3.268ValPro: 3.268 ± 0.582
3.85ValGln: 3.85 ± 0.701
3.995ValArg: 3.995 ± 0.504
5.084ValSer: 5.084 ± 0.79
3.123ValThr: 3.123 ± 0.744
6.174ValVal: 6.174 ± 0.571
0.581ValTrp: 0.581 ± 0.212
2.833ValTyr: 2.833 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
1.089TrpAla: 1.089 ± 0.292
0.291TrpCys: 0.291 ± 0.126
1.017TrpAsp: 1.017 ± 0.268
1.089TrpGlu: 1.089 ± 0.259
0.799TrpPhe: 0.799 ± 0.286
0.726TrpGly: 0.726 ± 0.235
0.291TrpHis: 0.291 ± 0.135
0.654TrpIle: 0.654 ± 0.246
0.654TrpLys: 0.654 ± 0.265
1.453TrpLeu: 1.453 ± 0.297
0.291TrpMet: 0.291 ± 0.198
0.872TrpAsn: 0.872 ± 0.244
0.291TrpPro: 0.291 ± 0.159
0.581TrpGln: 0.581 ± 0.194
0.872TrpArg: 0.872 ± 0.225
0.654TrpSer: 0.654 ± 0.211
1.017TrpThr: 1.017 ± 0.231
1.162TrpVal: 1.162 ± 0.259
0.291TrpTrp: 0.291 ± 0.143
0.799TrpTyr: 0.799 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.978TyrAla: 2.978 ± 0.523
0.726TyrCys: 0.726 ± 0.284
2.324TyrAsp: 2.324 ± 0.455
2.034TyrGlu: 2.034 ± 0.543
1.017TyrPhe: 1.017 ± 0.271
2.76TyrGly: 2.76 ± 0.585
0.726TyrHis: 0.726 ± 0.238
1.888TyrIle: 1.888 ± 0.392
2.179TyrLys: 2.179 ± 0.372
4.213TyrLeu: 4.213 ± 0.564
1.017TyrMet: 1.017 ± 0.288
1.38TyrAsn: 1.38 ± 0.256
1.307TyrPro: 1.307 ± 0.264
2.252TyrGln: 2.252 ± 0.446
2.179TyrArg: 2.179 ± 0.579
2.324TyrSer: 2.324 ± 0.364
2.76TyrThr: 2.76 ± 0.399
1.743TyrVal: 1.743 ± 0.411
0.799TyrTrp: 0.799 ± 0.241
1.307TyrTyr: 1.307 ± 0.282
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13769 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski