Amino acid dipepetide frequency for Klebsiella phage 1513

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.941AlaAla: 9.941 ± 1.171
1.252AlaCys: 1.252 ± 0.283
5.965AlaAsp: 5.965 ± 0.651
6.112AlaGlu: 6.112 ± 0.796
3.166AlaPhe: 3.166 ± 0.569
6.627AlaGly: 6.627 ± 0.687
1.252AlaHis: 1.252 ± 0.318
5.376AlaIle: 5.376 ± 0.576
6.775AlaLys: 6.775 ± 0.777
7.216AlaLeu: 7.216 ± 0.774
3.019AlaMet: 3.019 ± 0.459
2.725AlaAsn: 2.725 ± 0.514
2.283AlaPro: 2.283 ± 0.377
2.946AlaGln: 2.946 ± 0.617
4.418AlaArg: 4.418 ± 0.61
5.228AlaSer: 5.228 ± 0.723
4.492AlaThr: 4.492 ± 0.633
6.038AlaVal: 6.038 ± 0.683
1.031AlaTrp: 1.031 ± 0.272
2.798AlaTyr: 2.798 ± 0.484
0.0AlaXaa: 0.0 ± 0.0
Cys
0.81CysAla: 0.81 ± 0.237
0.074CysCys: 0.074 ± 0.085
1.767CysAsp: 1.767 ± 0.317
1.178CysGlu: 1.178 ± 0.352
0.295CysPhe: 0.295 ± 0.151
1.178CysGly: 1.178 ± 0.348
0.295CysHis: 0.295 ± 0.127
1.178CysIle: 1.178 ± 0.276
0.81CysLys: 0.81 ± 0.296
0.736CysLeu: 0.736 ± 0.219
0.663CysMet: 0.663 ± 0.186
0.368CysAsn: 0.368 ± 0.159
0.736CysPro: 0.736 ± 0.249
0.147CysGln: 0.147 ± 0.142
1.178CysArg: 1.178 ± 0.279
0.589CysSer: 0.589 ± 0.228
0.736CysThr: 0.736 ± 0.234
0.81CysVal: 0.81 ± 0.27
0.368CysTrp: 0.368 ± 0.154
0.589CysTyr: 0.589 ± 0.194
0.0CysXaa: 0.0 ± 0.0
Asp
6.186AspAla: 6.186 ± 0.64
0.884AspCys: 0.884 ± 0.257
4.05AspAsp: 4.05 ± 0.571
4.713AspGlu: 4.713 ± 0.594
2.946AspPhe: 2.946 ± 0.462
6.406AspGly: 6.406 ± 0.753
1.178AspHis: 1.178 ± 0.252
4.639AspIle: 4.639 ± 0.553
4.345AspLys: 4.345 ± 0.614
3.682AspLeu: 3.682 ± 0.555
1.031AspMet: 1.031 ± 0.345
2.946AspAsn: 2.946 ± 0.518
2.135AspPro: 2.135 ± 0.296
1.325AspGln: 1.325 ± 0.27
2.798AspArg: 2.798 ± 0.484
3.756AspSer: 3.756 ± 0.486
2.872AspThr: 2.872 ± 0.409
3.535AspVal: 3.535 ± 0.517
1.252AspTrp: 1.252 ± 0.335
2.209AspTyr: 2.209 ± 0.432
0.0AspXaa: 0.0 ± 0.0
Glu
4.639GluAla: 4.639 ± 0.571
0.884GluCys: 0.884 ± 0.279
3.387GluAsp: 3.387 ± 0.509
3.976GluGlu: 3.976 ± 0.67
4.566GluPhe: 4.566 ± 0.604
3.314GluGly: 3.314 ± 0.403
0.884GluHis: 0.884 ± 0.231
3.608GluIle: 3.608 ± 0.497
4.197GluLys: 4.197 ± 0.503
4.271GluLeu: 4.271 ± 0.673
2.651GluMet: 2.651 ± 0.357
2.872GluAsn: 2.872 ± 0.393
2.062GluPro: 2.062 ± 0.418
3.682GluGln: 3.682 ± 0.586
3.314GluArg: 3.314 ± 0.461
4.345GluSer: 4.345 ± 0.529
2.872GluThr: 2.872 ± 0.427
4.86GluVal: 4.86 ± 0.88
1.178GluTrp: 1.178 ± 0.286
2.651GluTyr: 2.651 ± 0.474
0.0GluXaa: 0.0 ± 0.0
Phe
3.387PheAla: 3.387 ± 0.445
0.884PheCys: 0.884 ± 0.268
3.535PheAsp: 3.535 ± 0.546
2.43PheGlu: 2.43 ± 0.342
1.252PhePhe: 1.252 ± 0.282
4.271PheGly: 4.271 ± 0.632
0.589PheHis: 0.589 ± 0.173
3.387PheIle: 3.387 ± 0.612
2.283PheLys: 2.283 ± 0.421
2.356PheLeu: 2.356 ± 0.348
0.884PheMet: 0.884 ± 0.237
2.209PheAsn: 2.209 ± 0.419
1.325PhePro: 1.325 ± 0.282
1.325PheGln: 1.325 ± 0.353
1.988PheArg: 1.988 ± 0.353
2.577PheSer: 2.577 ± 0.583
2.872PheThr: 2.872 ± 0.384
2.135PheVal: 2.135 ± 0.365
0.884PheTrp: 0.884 ± 0.296
1.546PheTyr: 1.546 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
4.934GlyAla: 4.934 ± 0.68
1.178GlyCys: 1.178 ± 0.276
4.786GlyAsp: 4.786 ± 0.513
5.67GlyGlu: 5.67 ± 0.637
3.608GlyPhe: 3.608 ± 0.404
8.247GlyGly: 8.247 ± 1.035
1.178GlyHis: 1.178 ± 0.375
4.492GlyIle: 4.492 ± 0.497
5.449GlyLys: 5.449 ± 0.626
4.786GlyLeu: 4.786 ± 0.559
2.577GlyMet: 2.577 ± 0.439
4.418GlyAsn: 4.418 ± 0.449
1.546GlyPro: 1.546 ± 0.292
1.988GlyGln: 1.988 ± 0.392
4.05GlyArg: 4.05 ± 0.414
5.228GlySer: 5.228 ± 0.581
4.124GlyThr: 4.124 ± 0.867
5.817GlyVal: 5.817 ± 0.476
0.736GlyTrp: 0.736 ± 0.194
3.314GlyTyr: 3.314 ± 0.476
0.0GlyXaa: 0.0 ± 0.0
His
1.178HisAla: 1.178 ± 0.344
0.368HisCys: 0.368 ± 0.147
1.031HisAsp: 1.031 ± 0.277
2.209HisGlu: 2.209 ± 0.478
0.663HisPhe: 0.663 ± 0.211
1.694HisGly: 1.694 ± 0.36
0.515HisHis: 0.515 ± 0.215
0.663HisIle: 0.663 ± 0.248
1.105HisLys: 1.105 ± 0.303
1.546HisLeu: 1.546 ± 0.426
0.074HisMet: 0.074 ± 0.073
0.515HisAsn: 0.515 ± 0.216
0.736HisPro: 0.736 ± 0.346
0.736HisGln: 0.736 ± 0.245
0.884HisArg: 0.884 ± 0.254
0.957HisSer: 0.957 ± 0.305
0.884HisThr: 0.884 ± 0.238
1.767HisVal: 1.767 ± 0.361
0.442HisTrp: 0.442 ± 0.164
0.736HisTyr: 0.736 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
6.186IleAla: 6.186 ± 0.676
1.252IleCys: 1.252 ± 0.327
4.934IleAsp: 4.934 ± 0.596
3.756IleGlu: 3.756 ± 0.555
2.135IlePhe: 2.135 ± 0.399
4.05IleGly: 4.05 ± 0.481
1.473IleHis: 1.473 ± 0.321
3.608IleIle: 3.608 ± 0.458
4.197IleLys: 4.197 ± 0.553
3.461IleLeu: 3.461 ± 0.507
1.841IleMet: 1.841 ± 0.359
2.725IleAsn: 2.725 ± 0.447
2.283IlePro: 2.283 ± 0.421
2.135IleGln: 2.135 ± 0.353
3.019IleArg: 3.019 ± 0.469
3.314IleSer: 3.314 ± 0.448
4.418IleThr: 4.418 ± 0.554
4.345IleVal: 4.345 ± 0.484
1.105IleTrp: 1.105 ± 0.269
1.988IleTyr: 1.988 ± 0.382
0.0IleXaa: 0.0 ± 0.0
Lys
6.112LysAla: 6.112 ± 0.727
0.884LysCys: 0.884 ± 0.31
3.314LysAsp: 3.314 ± 0.58
4.124LysGlu: 4.124 ± 0.408
2.283LysPhe: 2.283 ± 0.334
4.271LysGly: 4.271 ± 0.541
1.325LysHis: 1.325 ± 0.333
4.271LysIle: 4.271 ± 0.534
3.387LysLys: 3.387 ± 0.573
5.155LysLeu: 5.155 ± 0.544
3.829LysMet: 3.829 ± 0.642
2.872LysAsn: 2.872 ± 0.497
3.24LysPro: 3.24 ± 0.502
2.946LysGln: 2.946 ± 0.455
4.418LysArg: 4.418 ± 0.658
2.946LysSer: 2.946 ± 0.396
3.903LysThr: 3.903 ± 0.599
4.786LysVal: 4.786 ± 0.492
1.105LysTrp: 1.105 ± 0.282
1.915LysTyr: 1.915 ± 0.336
0.0LysXaa: 0.0 ± 0.0
Leu
6.775LeuAla: 6.775 ± 0.728
0.81LeuCys: 0.81 ± 0.211
3.682LeuAsp: 3.682 ± 0.522
3.976LeuGlu: 3.976 ± 0.494
2.135LeuPhe: 2.135 ± 0.302
4.713LeuGly: 4.713 ± 0.647
1.252LeuHis: 1.252 ± 0.436
3.829LeuIle: 3.829 ± 0.426
4.786LeuLys: 4.786 ± 0.589
4.345LeuLeu: 4.345 ± 0.492
1.62LeuMet: 1.62 ± 0.333
3.093LeuAsn: 3.093 ± 0.545
2.651LeuPro: 2.651 ± 0.417
2.725LeuGln: 2.725 ± 0.466
3.829LeuArg: 3.829 ± 0.488
5.376LeuSer: 5.376 ± 0.682
4.271LeuThr: 4.271 ± 0.649
4.05LeuVal: 4.05 ± 0.519
0.442LeuTrp: 0.442 ± 0.158
2.577LeuTyr: 2.577 ± 0.362
0.0LeuXaa: 0.0 ± 0.0
Met
3.608MetAla: 3.608 ± 0.668
0.221MetCys: 0.221 ± 0.117
1.62MetAsp: 1.62 ± 0.247
1.473MetGlu: 1.473 ± 0.369
1.62MetPhe: 1.62 ± 0.354
1.399MetGly: 1.399 ± 0.359
0.736MetHis: 0.736 ± 0.232
2.062MetIle: 2.062 ± 0.387
2.283MetLys: 2.283 ± 0.482
2.356MetLeu: 2.356 ± 0.341
1.252MetMet: 1.252 ± 0.344
1.473MetAsn: 1.473 ± 0.298
0.957MetPro: 0.957 ± 0.278
1.62MetGln: 1.62 ± 0.352
2.283MetArg: 2.283 ± 0.436
1.473MetSer: 1.473 ± 0.295
1.841MetThr: 1.841 ± 0.41
1.988MetVal: 1.988 ± 0.355
0.368MetTrp: 0.368 ± 0.141
0.736MetTyr: 0.736 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
3.829AsnAla: 3.829 ± 0.793
0.515AsnCys: 0.515 ± 0.234
2.725AsnAsp: 2.725 ± 0.489
1.988AsnGlu: 1.988 ± 0.343
1.252AsnPhe: 1.252 ± 0.245
5.523AsnGly: 5.523 ± 0.792
1.178AsnHis: 1.178 ± 0.282
3.019AsnIle: 3.019 ± 0.532
2.725AsnLys: 2.725 ± 0.474
2.725AsnLeu: 2.725 ± 0.443
1.325AsnMet: 1.325 ± 0.384
2.872AsnAsn: 2.872 ± 0.507
1.62AsnPro: 1.62 ± 0.275
1.841AsnGln: 1.841 ± 0.399
2.43AsnArg: 2.43 ± 0.377
2.725AsnSer: 2.725 ± 0.382
1.841AsnThr: 1.841 ± 0.393
3.093AsnVal: 3.093 ± 0.472
0.368AsnTrp: 0.368 ± 0.135
1.325AsnTyr: 1.325 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
1.988ProAla: 1.988 ± 0.454
0.368ProCys: 0.368 ± 0.168
3.019ProAsp: 3.019 ± 0.583
3.019ProGlu: 3.019 ± 0.495
1.694ProPhe: 1.694 ± 0.297
2.283ProGly: 2.283 ± 0.357
0.442ProHis: 0.442 ± 0.179
2.504ProIle: 2.504 ± 0.382
2.651ProLys: 2.651 ± 0.356
1.841ProLeu: 1.841 ± 0.341
0.81ProMet: 0.81 ± 0.266
1.325ProAsn: 1.325 ± 0.484
1.178ProPro: 1.178 ± 0.364
1.694ProGln: 1.694 ± 0.353
1.546ProArg: 1.546 ± 0.317
2.283ProSer: 2.283 ± 0.479
1.694ProThr: 1.694 ± 0.299
2.872ProVal: 2.872 ± 0.433
0.515ProTrp: 0.515 ± 0.213
1.178ProTyr: 1.178 ± 0.46
0.0ProXaa: 0.0 ± 0.0
Gln
3.608GlnAla: 3.608 ± 0.565
0.736GlnCys: 0.736 ± 0.22
1.767GlnAsp: 1.767 ± 0.389
1.915GlnGlu: 1.915 ± 0.403
1.031GlnPhe: 1.031 ± 0.221
1.841GlnGly: 1.841 ± 0.307
0.81GlnHis: 0.81 ± 0.238
3.019GlnIle: 3.019 ± 0.502
2.283GlnLys: 2.283 ± 0.475
2.651GlnLeu: 2.651 ± 0.471
1.694GlnMet: 1.694 ± 0.394
1.473GlnAsn: 1.473 ± 0.322
1.325GlnPro: 1.325 ± 0.347
2.135GlnGln: 2.135 ± 0.991
1.988GlnArg: 1.988 ± 0.401
2.062GlnSer: 2.062 ± 0.342
1.62GlnThr: 1.62 ± 0.344
2.946GlnVal: 2.946 ± 0.479
0.442GlnTrp: 0.442 ± 0.196
1.62GlnTyr: 1.62 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
4.492ArgAla: 4.492 ± 0.547
0.884ArgCys: 0.884 ± 0.436
2.651ArgAsp: 2.651 ± 0.416
4.197ArgGlu: 4.197 ± 0.585
2.946ArgPhe: 2.946 ± 0.43
3.608ArgGly: 3.608 ± 0.497
1.178ArgHis: 1.178 ± 0.285
2.135ArgIle: 2.135 ± 0.292
4.197ArgLys: 4.197 ± 0.53
4.566ArgLeu: 4.566 ± 0.563
1.841ArgMet: 1.841 ± 0.4
2.283ArgAsn: 2.283 ± 0.367
1.841ArgPro: 1.841 ± 0.377
1.473ArgGln: 1.473 ± 0.364
3.903ArgArg: 3.903 ± 0.609
2.651ArgSer: 2.651 ± 0.556
1.767ArgThr: 1.767 ± 0.325
3.903ArgVal: 3.903 ± 0.614
0.589ArgTrp: 0.589 ± 0.199
1.841ArgTyr: 1.841 ± 0.369
0.0ArgXaa: 0.0 ± 0.0
Ser
5.744SerAla: 5.744 ± 0.883
0.663SerCys: 0.663 ± 0.199
4.271SerAsp: 4.271 ± 0.562
4.492SerGlu: 4.492 ± 0.532
2.798SerPhe: 2.798 ± 0.334
5.523SerGly: 5.523 ± 0.601
1.031SerHis: 1.031 ± 0.25
3.093SerIle: 3.093 ± 0.482
4.271SerLys: 4.271 ± 0.485
3.756SerLeu: 3.756 ± 0.675
0.957SerMet: 0.957 ± 0.294
2.43SerAsn: 2.43 ± 0.404
2.504SerPro: 2.504 ± 0.459
2.356SerGln: 2.356 ± 0.358
2.798SerArg: 2.798 ± 0.449
3.461SerSer: 3.461 ± 0.54
2.577SerThr: 2.577 ± 0.395
4.492SerVal: 4.492 ± 0.557
1.105SerTrp: 1.105 ± 0.259
2.135SerTyr: 2.135 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
3.903ThrAla: 3.903 ± 0.53
0.884ThrCys: 0.884 ± 0.255
3.24ThrAsp: 3.24 ± 0.542
2.43ThrGlu: 2.43 ± 0.378
2.725ThrPhe: 2.725 ± 0.432
5.007ThrGly: 5.007 ± 0.669
0.957ThrHis: 0.957 ± 0.271
4.197ThrIle: 4.197 ± 0.573
3.166ThrLys: 3.166 ± 0.546
3.682ThrLeu: 3.682 ± 0.567
1.399ThrMet: 1.399 ± 0.325
2.356ThrAsn: 2.356 ± 0.398
2.209ThrPro: 2.209 ± 0.53
1.62ThrGln: 1.62 ± 0.401
2.209ThrArg: 2.209 ± 0.369
3.314ThrSer: 3.314 ± 0.651
2.356ThrThr: 2.356 ± 0.405
4.345ThrVal: 4.345 ± 0.627
0.663ThrTrp: 0.663 ± 0.216
2.209ThrTyr: 2.209 ± 0.377
0.0ThrXaa: 0.0 ± 0.0
Val
6.848ValAla: 6.848 ± 0.765
0.81ValCys: 0.81 ± 0.267
3.976ValAsp: 3.976 ± 0.52
4.271ValGlu: 4.271 ± 0.646
2.577ValPhe: 2.577 ± 0.371
4.492ValGly: 4.492 ± 0.683
1.178ValHis: 1.178 ± 0.313
4.345ValIle: 4.345 ± 0.501
5.228ValLys: 5.228 ± 0.628
4.713ValLeu: 4.713 ± 0.49
1.988ValMet: 1.988 ± 0.353
4.418ValAsn: 4.418 ± 0.582
2.062ValPro: 2.062 ± 0.551
2.062ValGln: 2.062 ± 0.318
2.872ValArg: 2.872 ± 0.432
5.007ValSer: 5.007 ± 0.568
4.786ValThr: 4.786 ± 0.742
5.744ValVal: 5.744 ± 1.019
1.252ValTrp: 1.252 ± 0.271
1.915ValTyr: 1.915 ± 0.345
0.0ValXaa: 0.0 ± 0.0
Trp
1.767TrpAla: 1.767 ± 0.377
0.295TrpCys: 0.295 ± 0.156
0.81TrpAsp: 0.81 ± 0.262
0.736TrpGlu: 0.736 ± 0.315
0.663TrpPhe: 0.663 ± 0.223
0.884TrpGly: 0.884 ± 0.229
0.515TrpHis: 0.515 ± 0.232
0.589TrpIle: 0.589 ± 0.191
0.957TrpLys: 0.957 ± 0.25
1.399TrpLeu: 1.399 ± 0.304
0.442TrpMet: 0.442 ± 0.196
0.295TrpAsn: 0.295 ± 0.129
0.515TrpPro: 0.515 ± 0.17
0.736TrpGln: 0.736 ± 0.24
0.957TrpArg: 0.957 ± 0.274
0.81TrpSer: 0.81 ± 0.252
0.589TrpThr: 0.589 ± 0.2
1.031TrpVal: 1.031 ± 0.239
0.295TrpTrp: 0.295 ± 0.134
0.368TrpTyr: 0.368 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.725TyrAla: 2.725 ± 0.549
0.663TyrCys: 0.663 ± 0.229
2.356TyrAsp: 2.356 ± 0.44
1.546TyrGlu: 1.546 ± 0.302
1.841TyrPhe: 1.841 ± 0.313
2.798TyrGly: 2.798 ± 0.38
0.736TyrHis: 0.736 ± 0.19
2.209TyrIle: 2.209 ± 0.383
1.988TyrLys: 1.988 ± 0.399
1.694TyrLeu: 1.694 ± 0.341
1.399TyrMet: 1.399 ± 0.257
1.252TyrAsn: 1.252 ± 0.302
1.694TyrPro: 1.694 ± 0.355
1.473TyrGln: 1.473 ± 0.281
2.062TyrArg: 2.062 ± 0.305
2.356TyrSer: 2.356 ± 0.39
2.356TyrThr: 2.356 ± 0.34
1.988TyrVal: 1.988 ± 0.383
0.515TyrTrp: 0.515 ± 0.217
1.252TyrTyr: 1.252 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (13581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski