Amino acid dipepetide frequency for Klebsiella phage phiKO2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.609AlaAla: 9.609 ± 0.898
1.123AlaCys: 1.123 ± 0.399
6.926AlaAsp: 6.926 ± 0.958
5.99AlaGlu: 5.99 ± 0.645
1.747AlaPhe: 1.747 ± 0.302
6.926AlaGly: 6.926 ± 0.779
0.998AlaHis: 0.998 ± 0.233
6.177AlaIle: 6.177 ± 0.563
4.805AlaLys: 4.805 ± 0.782
9.11AlaLeu: 9.11 ± 0.65
1.872AlaMet: 1.872 ± 0.335
3.994AlaAsn: 3.994 ± 0.979
2.621AlaPro: 2.621 ± 0.398
5.179AlaGln: 5.179 ± 0.751
5.99AlaArg: 5.99 ± 0.501
6.365AlaSer: 6.365 ± 0.743
5.117AlaThr: 5.117 ± 0.58
6.552AlaVal: 6.552 ± 0.813
1.997AlaTrp: 1.997 ± 0.284
2.371AlaTyr: 2.371 ± 0.243
0.0AlaXaa: 0.0 ± 0.0
Cys
0.749CysAla: 0.749 ± 0.253
0.062CysCys: 0.062 ± 0.078
0.998CysAsp: 0.998 ± 0.234
0.499CysGlu: 0.499 ± 0.165
0.374CysPhe: 0.374 ± 0.186
0.811CysGly: 0.811 ± 0.235
0.062CysHis: 0.062 ± 0.081
0.624CysIle: 0.624 ± 0.21
0.562CysLys: 0.562 ± 0.229
0.874CysLeu: 0.874 ± 0.303
0.187CysMet: 0.187 ± 0.109
0.437CysAsn: 0.437 ± 0.183
0.499CysPro: 0.499 ± 0.231
0.562CysGln: 0.562 ± 0.236
0.936CysArg: 0.936 ± 0.335
0.499CysSer: 0.499 ± 0.215
0.562CysThr: 0.562 ± 0.244
0.624CysVal: 0.624 ± 0.218
0.374CysTrp: 0.374 ± 0.154
0.374CysTyr: 0.374 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
6.739AspAla: 6.739 ± 0.859
0.624AspCys: 0.624 ± 0.271
3.619AspAsp: 3.619 ± 0.59
3.994AspGlu: 3.994 ± 0.839
2.371AspPhe: 2.371 ± 0.325
5.491AspGly: 5.491 ± 0.59
0.624AspHis: 0.624 ± 0.168
3.307AspIle: 3.307 ± 0.399
3.245AspLys: 3.245 ± 0.536
6.053AspLeu: 6.053 ± 0.61
1.31AspMet: 1.31 ± 0.306
2.621AspAsn: 2.621 ± 0.351
2.246AspPro: 2.246 ± 0.499
1.747AspGln: 1.747 ± 0.349
2.683AspArg: 2.683 ± 0.359
3.37AspSer: 3.37 ± 0.615
2.434AspThr: 2.434 ± 0.381
2.87AspVal: 2.87 ± 0.55
0.874AspTrp: 0.874 ± 0.2
2.496AspTyr: 2.496 ± 0.653
0.0AspXaa: 0.0 ± 0.0
Glu
5.99GluAla: 5.99 ± 0.86
0.499GluCys: 0.499 ± 0.16
2.995GluAsp: 2.995 ± 0.617
4.368GluGlu: 4.368 ± 0.945
2.371GluPhe: 2.371 ± 0.413
3.744GluGly: 3.744 ± 0.396
1.31GluHis: 1.31 ± 0.413
3.557GluIle: 3.557 ± 0.603
4.181GluLys: 4.181 ± 0.571
7.737GluLeu: 7.737 ± 1.04
1.997GluMet: 1.997 ± 0.323
3.245GluAsn: 3.245 ± 0.426
1.81GluPro: 1.81 ± 0.43
2.434GluGln: 2.434 ± 0.373
4.306GluArg: 4.306 ± 0.541
4.056GluSer: 4.056 ± 0.613
3.557GluThr: 3.557 ± 0.379
3.182GluVal: 3.182 ± 0.403
0.686GluTrp: 0.686 ± 0.214
1.685GluTyr: 1.685 ± 0.29
0.0GluXaa: 0.0 ± 0.0
Phe
2.496PheAla: 2.496 ± 0.583
0.499PheCys: 0.499 ± 0.201
3.058PheAsp: 3.058 ± 0.403
2.621PheGlu: 2.621 ± 0.433
0.749PhePhe: 0.749 ± 0.271
2.246PheGly: 2.246 ± 0.465
0.562PheHis: 0.562 ± 0.238
1.435PheIle: 1.435 ± 0.451
1.248PheLys: 1.248 ± 0.39
1.622PheLeu: 1.622 ± 0.364
0.749PheMet: 0.749 ± 0.299
2.371PheAsn: 2.371 ± 0.459
0.936PhePro: 0.936 ± 0.296
0.749PheGln: 0.749 ± 0.159
1.56PheArg: 1.56 ± 0.347
1.685PheSer: 1.685 ± 0.406
2.309PheThr: 2.309 ± 0.402
1.997PheVal: 1.997 ± 0.316
0.499PheTrp: 0.499 ± 0.183
0.874PheTyr: 0.874 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
5.553GlyAla: 5.553 ± 0.735
0.749GlyCys: 0.749 ± 0.204
4.742GlyAsp: 4.742 ± 0.673
4.368GlyGlu: 4.368 ± 0.541
2.434GlyPhe: 2.434 ± 0.564
5.553GlyGly: 5.553 ± 0.67
1.31GlyHis: 1.31 ± 0.345
3.432GlyIle: 3.432 ± 0.466
4.617GlyLys: 4.617 ± 0.583
6.739GlyLeu: 6.739 ± 0.7
1.622GlyMet: 1.622 ± 0.423
4.43GlyAsn: 4.43 ± 1.027
1.373GlyPro: 1.373 ± 0.306
2.995GlyGln: 2.995 ± 0.621
3.619GlyArg: 3.619 ± 0.407
5.054GlySer: 5.054 ± 1.113
5.553GlyThr: 5.553 ± 0.883
4.617GlyVal: 4.617 ± 0.498
1.373GlyTrp: 1.373 ± 0.304
1.934GlyTyr: 1.934 ± 0.345
0.0GlyXaa: 0.0 ± 0.0
His
1.31HisAla: 1.31 ± 0.324
0.125HisCys: 0.125 ± 0.085
1.123HisAsp: 1.123 ± 0.289
1.186HisGlu: 1.186 ± 0.307
0.874HisPhe: 0.874 ± 0.211
1.123HisGly: 1.123 ± 0.296
0.624HisHis: 0.624 ± 0.212
0.998HisIle: 0.998 ± 0.32
0.25HisLys: 0.25 ± 0.109
1.123HisLeu: 1.123 ± 0.274
0.562HisMet: 0.562 ± 0.198
0.437HisAsn: 0.437 ± 0.192
0.998HisPro: 0.998 ± 0.308
0.437HisGln: 0.437 ± 0.173
1.186HisArg: 1.186 ± 0.382
1.31HisSer: 1.31 ± 0.344
0.811HisThr: 0.811 ± 0.283
0.499HisVal: 0.499 ± 0.169
0.25HisTrp: 0.25 ± 0.115
0.811HisTyr: 0.811 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
3.37IleAla: 3.37 ± 0.424
0.874IleCys: 0.874 ± 0.275
3.494IleAsp: 3.494 ± 0.385
3.994IleGlu: 3.994 ± 0.563
1.061IlePhe: 1.061 ± 0.326
3.931IleGly: 3.931 ± 0.529
1.186IleHis: 1.186 ± 0.312
3.432IleIle: 3.432 ± 0.653
2.309IleLys: 2.309 ± 0.42
3.058IleLeu: 3.058 ± 0.635
1.248IleMet: 1.248 ± 0.338
2.995IleAsn: 2.995 ± 0.386
2.746IlePro: 2.746 ± 0.393
2.87IleGln: 2.87 ± 0.429
3.245IleArg: 3.245 ± 0.583
4.992IleSer: 4.992 ± 0.577
5.741IleThr: 5.741 ± 1.072
2.434IleVal: 2.434 ± 0.408
0.749IleTrp: 0.749 ± 0.201
1.31IleTyr: 1.31 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
5.678LysAla: 5.678 ± 0.698
0.437LysCys: 0.437 ± 0.19
2.746LysAsp: 2.746 ± 0.325
2.933LysGlu: 2.933 ± 0.72
1.81LysPhe: 1.81 ± 0.361
2.746LysGly: 2.746 ± 0.515
0.499LysHis: 0.499 ± 0.204
2.808LysIle: 2.808 ± 0.382
3.994LysLys: 3.994 ± 0.48
4.867LysLeu: 4.867 ± 0.569
1.56LysMet: 1.56 ± 0.511
2.933LysAsn: 2.933 ± 0.476
2.496LysPro: 2.496 ± 0.617
2.558LysGln: 2.558 ± 0.421
3.494LysArg: 3.494 ± 0.601
3.058LysSer: 3.058 ± 0.55
3.682LysThr: 3.682 ± 0.468
2.87LysVal: 2.87 ± 0.447
0.936LysTrp: 0.936 ± 0.333
2.122LysTyr: 2.122 ± 0.399
0.0LysXaa: 0.0 ± 0.0
Leu
8.861LeuAla: 8.861 ± 1.305
1.061LeuCys: 1.061 ± 0.312
4.742LeuAsp: 4.742 ± 0.629
5.553LeuGlu: 5.553 ± 0.702
2.496LeuPhe: 2.496 ± 0.49
4.118LeuGly: 4.118 ± 0.579
0.998LeuHis: 0.998 ± 0.257
3.931LeuIle: 3.931 ± 0.489
5.366LeuLys: 5.366 ± 0.756
6.739LeuLeu: 6.739 ± 1.247
1.81LeuMet: 1.81 ± 0.427
4.555LeuAsn: 4.555 ± 0.453
3.994LeuPro: 3.994 ± 0.666
5.241LeuGln: 5.241 ± 0.64
6.302LeuArg: 6.302 ± 0.72
6.24LeuSer: 6.24 ± 0.692
7.051LeuThr: 7.051 ± 1.288
4.306LeuVal: 4.306 ± 0.82
0.562LeuTrp: 0.562 ± 0.164
2.558LeuTyr: 2.558 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
2.558MetAla: 2.558 ± 0.367
0.187MetCys: 0.187 ± 0.136
0.936MetAsp: 0.936 ± 0.294
1.685MetGlu: 1.685 ± 0.361
0.998MetPhe: 0.998 ± 0.332
1.373MetGly: 1.373 ± 0.35
0.25MetHis: 0.25 ± 0.137
1.31MetIle: 1.31 ± 0.261
1.81MetLys: 1.81 ± 0.494
2.309MetLeu: 2.309 ± 0.563
0.936MetMet: 0.936 ± 0.241
1.373MetAsn: 1.373 ± 0.269
1.248MetPro: 1.248 ± 0.31
0.936MetGln: 0.936 ± 0.219
1.435MetArg: 1.435 ± 0.353
1.747MetSer: 1.747 ± 0.467
1.498MetThr: 1.498 ± 0.393
1.81MetVal: 1.81 ± 0.387
0.125MetTrp: 0.125 ± 0.104
0.562MetTyr: 0.562 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
5.741AsnAla: 5.741 ± 1.236
0.374AsnCys: 0.374 ± 0.183
2.184AsnAsp: 2.184 ± 0.282
1.747AsnGlu: 1.747 ± 0.273
1.435AsnPhe: 1.435 ± 0.342
4.555AsnGly: 4.555 ± 1.244
0.811AsnHis: 0.811 ± 0.229
2.184AsnIle: 2.184 ± 0.332
2.808AsnLys: 2.808 ± 0.45
2.558AsnLeu: 2.558 ± 0.573
1.186AsnMet: 1.186 ± 0.336
1.872AsnAsn: 1.872 ± 0.457
2.309AsnPro: 2.309 ± 0.354
2.246AsnGln: 2.246 ± 0.482
2.434AsnArg: 2.434 ± 0.382
4.118AsnSer: 4.118 ± 0.783
3.744AsnThr: 3.744 ± 0.812
2.746AsnVal: 2.746 ± 0.504
0.374AsnTrp: 0.374 ± 0.149
1.248AsnTyr: 1.248 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
4.617ProAla: 4.617 ± 0.53
0.499ProCys: 0.499 ± 0.237
2.558ProAsp: 2.558 ± 0.365
2.995ProGlu: 2.995 ± 0.373
1.123ProPhe: 1.123 ± 0.227
3.12ProGly: 3.12 ± 0.506
1.373ProHis: 1.373 ± 0.408
1.373ProIle: 1.373 ± 0.411
2.122ProLys: 2.122 ± 0.443
2.184ProLeu: 2.184 ± 0.494
0.874ProMet: 0.874 ± 0.272
1.435ProAsn: 1.435 ± 0.237
1.498ProPro: 1.498 ± 0.403
1.248ProGln: 1.248 ± 0.266
1.56ProArg: 1.56 ± 0.383
2.309ProSer: 2.309 ± 0.301
1.997ProThr: 1.997 ± 0.426
2.558ProVal: 2.558 ± 0.515
0.998ProTrp: 0.998 ± 0.23
0.874ProTyr: 0.874 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
4.555GlnAla: 4.555 ± 0.917
0.187GlnCys: 0.187 ± 0.12
1.435GlnAsp: 1.435 ± 0.295
3.37GlnGlu: 3.37 ± 0.437
1.498GlnPhe: 1.498 ± 0.299
3.744GlnGly: 3.744 ± 1.0
0.749GlnHis: 0.749 ± 0.291
2.496GlnIle: 2.496 ± 0.456
1.997GlnLys: 1.997 ± 0.253
4.68GlnLeu: 4.68 ± 0.613
1.435GlnMet: 1.435 ± 0.36
2.059GlnAsn: 2.059 ± 0.634
1.498GlnPro: 1.498 ± 0.347
2.746GlnGln: 2.746 ± 0.599
3.182GlnArg: 3.182 ± 0.462
3.869GlnSer: 3.869 ± 0.901
2.309GlnThr: 2.309 ± 0.475
2.246GlnVal: 2.246 ± 0.28
1.186GlnTrp: 1.186 ± 0.308
1.622GlnTyr: 1.622 ± 0.521
0.0GlnXaa: 0.0 ± 0.0
Arg
4.181ArgAla: 4.181 ± 0.471
0.874ArgCys: 0.874 ± 0.292
3.307ArgAsp: 3.307 ± 0.523
3.744ArgGlu: 3.744 ± 0.488
1.872ArgPhe: 1.872 ± 0.327
4.181ArgGly: 4.181 ± 0.541
1.435ArgHis: 1.435 ± 0.443
3.432ArgIle: 3.432 ± 0.437
3.058ArgLys: 3.058 ± 0.53
4.992ArgLeu: 4.992 ± 0.634
2.309ArgMet: 2.309 ± 0.396
2.496ArgAsn: 2.496 ± 0.367
2.059ArgPro: 2.059 ± 0.293
3.12ArgGln: 3.12 ± 0.657
3.557ArgArg: 3.557 ± 0.725
3.058ArgSer: 3.058 ± 0.349
3.619ArgThr: 3.619 ± 0.59
4.368ArgVal: 4.368 ± 0.738
0.749ArgTrp: 0.749 ± 0.254
2.122ArgTyr: 2.122 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
7.675SerAla: 7.675 ± 0.985
0.499SerCys: 0.499 ± 0.117
4.43SerAsp: 4.43 ± 0.714
4.118SerGlu: 4.118 ± 0.741
2.122SerPhe: 2.122 ± 0.317
7.238SerGly: 7.238 ± 0.906
1.061SerHis: 1.061 ± 0.268
5.429SerIle: 5.429 ± 0.784
2.434SerLys: 2.434 ± 0.373
7.176SerLeu: 7.176 ± 1.145
1.31SerMet: 1.31 ± 0.346
1.934SerAsn: 1.934 ± 0.293
2.434SerPro: 2.434 ± 0.384
3.245SerGln: 3.245 ± 0.738
3.806SerArg: 3.806 ± 0.465
5.928SerSer: 5.928 ± 1.187
4.805SerThr: 4.805 ± 1.103
4.68SerVal: 4.68 ± 0.555
0.624SerTrp: 0.624 ± 0.197
1.685SerTyr: 1.685 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
7.425ThrAla: 7.425 ± 1.097
0.686ThrCys: 0.686 ± 0.21
4.118ThrAsp: 4.118 ± 0.567
3.931ThrGlu: 3.931 ± 0.572
1.56ThrPhe: 1.56 ± 0.328
4.68ThrGly: 4.68 ± 0.454
0.312ThrHis: 0.312 ± 0.116
3.557ThrIle: 3.557 ± 0.583
3.12ThrLys: 3.12 ± 0.405
5.99ThrLeu: 5.99 ± 1.0
1.373ThrMet: 1.373 ± 0.335
3.494ThrAsn: 3.494 ± 0.929
2.309ThrPro: 2.309 ± 0.348
2.87ThrGln: 2.87 ± 0.676
3.557ThrArg: 3.557 ± 0.364
5.865ThrSer: 5.865 ± 1.369
5.241ThrThr: 5.241 ± 1.321
4.617ThrVal: 4.617 ± 0.893
0.874ThrTrp: 0.874 ± 0.218
1.186ThrTyr: 1.186 ± 0.295
0.0ThrXaa: 0.0 ± 0.0
Val
4.867ValAla: 4.867 ± 0.606
0.499ValCys: 0.499 ± 0.224
3.744ValAsp: 3.744 ± 0.523
4.306ValGlu: 4.306 ± 0.452
1.685ValPhe: 1.685 ± 0.404
3.307ValGly: 3.307 ± 0.519
0.936ValHis: 0.936 ± 0.266
3.682ValIle: 3.682 ± 0.409
4.056ValLys: 4.056 ± 0.434
4.368ValLeu: 4.368 ± 0.729
1.685ValMet: 1.685 ± 0.308
2.87ValAsn: 2.87 ± 0.755
1.997ValPro: 1.997 ± 0.285
2.496ValGln: 2.496 ± 0.399
2.621ValArg: 2.621 ± 0.393
5.304ValSer: 5.304 ± 0.757
3.994ValThr: 3.994 ± 0.908
4.056ValVal: 4.056 ± 0.51
0.998ValTrp: 0.998 ± 0.174
2.122ValTyr: 2.122 ± 0.302
0.0ValXaa: 0.0 ± 0.0
Trp
0.874TrpAla: 0.874 ± 0.25
0.125TrpCys: 0.125 ± 0.107
0.25TrpAsp: 0.25 ± 0.099
0.624TrpGlu: 0.624 ± 0.178
0.374TrpPhe: 0.374 ± 0.122
0.998TrpGly: 0.998 ± 0.306
0.125TrpHis: 0.125 ± 0.074
0.499TrpIle: 0.499 ± 0.225
0.749TrpLys: 0.749 ± 0.229
2.122TrpLeu: 2.122 ± 0.446
0.25TrpMet: 0.25 ± 0.161
0.374TrpAsn: 0.374 ± 0.133
0.686TrpPro: 0.686 ± 0.216
1.435TrpGln: 1.435 ± 0.34
1.373TrpArg: 1.373 ± 0.281
1.435TrpSer: 1.435 ± 0.36
0.936TrpThr: 0.936 ± 0.337
0.998TrpVal: 0.998 ± 0.235
0.187TrpTrp: 0.187 ± 0.128
0.312TrpTyr: 0.312 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.621TyrAla: 2.621 ± 0.473
0.686TyrCys: 0.686 ± 0.255
1.31TyrAsp: 1.31 ± 0.216
1.685TyrGlu: 1.685 ± 0.365
1.373TyrPhe: 1.373 ± 0.356
2.184TyrGly: 2.184 ± 0.509
0.811TyrHis: 0.811 ± 0.26
1.373TyrIle: 1.373 ± 0.26
1.498TyrLys: 1.498 ± 0.48
2.184TyrLeu: 2.184 ± 0.414
0.749TyrMet: 0.749 ± 0.295
0.749TyrAsn: 0.749 ± 0.224
1.56TyrPro: 1.56 ± 0.397
1.747TyrGln: 1.747 ± 0.358
1.685TyrArg: 1.685 ± 0.299
2.371TyrSer: 2.371 ± 0.349
1.81TyrThr: 1.81 ± 0.389
1.56TyrVal: 1.56 ± 0.311
0.312TyrTrp: 0.312 ± 0.134
0.874TyrTyr: 0.874 ± 0.248
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (16027 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski