Amino acid dipepetide frequency for Klebsiella phage KMI5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.491AlaAla: 15.491 ± 1.705
0.811AlaCys: 0.811 ± 0.273
6.492AlaAsp: 6.492 ± 0.677
5.533AlaGlu: 5.533 ± 0.715
2.803AlaPhe: 2.803 ± 0.36
8.262AlaGly: 8.262 ± 1.131
1.254AlaHis: 1.254 ± 0.308
4.721AlaIle: 4.721 ± 0.691
3.983AlaLys: 3.983 ± 0.736
8.483AlaLeu: 8.483 ± 0.801
2.951AlaMet: 2.951 ± 0.338
3.688AlaAsn: 3.688 ± 0.368
4.352AlaPro: 4.352 ± 0.958
4.942AlaGln: 4.942 ± 0.884
5.311AlaArg: 5.311 ± 0.585
5.385AlaSer: 5.385 ± 0.625
5.533AlaThr: 5.533 ± 0.726
7.008AlaVal: 7.008 ± 0.782
1.475AlaTrp: 1.475 ± 0.382
3.91AlaTyr: 3.91 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
0.885CysAla: 0.885 ± 0.259
0.443CysCys: 0.443 ± 0.231
0.443CysAsp: 0.443 ± 0.165
0.664CysGlu: 0.664 ± 0.216
0.369CysPhe: 0.369 ± 0.155
0.959CysGly: 0.959 ± 0.306
0.516CysHis: 0.516 ± 0.182
0.369CysIle: 0.369 ± 0.159
0.59CysLys: 0.59 ± 0.218
0.738CysLeu: 0.738 ± 0.25
0.59CysMet: 0.59 ± 0.184
0.516CysAsn: 0.516 ± 0.207
0.59CysPro: 0.59 ± 0.244
0.369CysGln: 0.369 ± 0.179
0.738CysArg: 0.738 ± 0.221
1.328CysSer: 1.328 ± 0.331
1.18CysThr: 1.18 ± 0.301
1.033CysVal: 1.033 ± 0.301
0.369CysTrp: 0.369 ± 0.21
0.59CysTyr: 0.59 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
7.082AspAla: 7.082 ± 0.939
1.107AspCys: 1.107 ± 0.308
2.508AspAsp: 2.508 ± 0.413
3.32AspGlu: 3.32 ± 0.467
2.508AspPhe: 2.508 ± 0.366
4.721AspGly: 4.721 ± 0.595
0.664AspHis: 0.664 ± 0.174
3.688AspIle: 3.688 ± 0.646
2.508AspLys: 2.508 ± 0.451
5.164AspLeu: 5.164 ± 0.529
2.361AspMet: 2.361 ± 0.39
2.729AspAsn: 2.729 ± 0.434
2.508AspPro: 2.508 ± 0.398
1.475AspGln: 1.475 ± 0.284
2.729AspArg: 2.729 ± 0.497
5.164AspSer: 5.164 ± 0.628
3.762AspThr: 3.762 ± 0.633
3.91AspVal: 3.91 ± 0.442
1.328AspTrp: 1.328 ± 0.178
2.066AspTyr: 2.066 ± 0.373
0.0AspXaa: 0.0 ± 0.0
Glu
5.385GluAla: 5.385 ± 0.851
0.664GluCys: 0.664 ± 0.188
3.024GluAsp: 3.024 ± 0.41
3.688GluGlu: 3.688 ± 0.621
2.139GluPhe: 2.139 ± 0.376
3.615GluGly: 3.615 ± 0.455
2.287GluHis: 2.287 ± 0.445
2.139GluIle: 2.139 ± 0.346
1.77GluLys: 1.77 ± 0.412
5.09GluLeu: 5.09 ± 0.604
2.361GluMet: 2.361 ± 0.382
1.697GluAsn: 1.697 ± 0.358
2.287GluPro: 2.287 ± 0.465
3.393GluGln: 3.393 ± 0.614
3.615GluArg: 3.615 ± 0.598
3.246GluSer: 3.246 ± 0.573
3.024GluThr: 3.024 ± 0.405
4.279GluVal: 4.279 ± 0.561
0.738GluTrp: 0.738 ± 0.194
3.024GluTyr: 3.024 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
2.656PheAla: 2.656 ± 0.413
0.369PheCys: 0.369 ± 0.233
2.582PheAsp: 2.582 ± 0.41
1.844PheGlu: 1.844 ± 0.372
1.328PhePhe: 1.328 ± 0.3
2.066PheGly: 2.066 ± 0.299
0.516PheHis: 0.516 ± 0.163
1.107PheIle: 1.107 ± 0.242
2.213PheLys: 2.213 ± 0.389
1.697PheLeu: 1.697 ± 0.388
0.738PheMet: 0.738 ± 0.265
1.402PheAsn: 1.402 ± 0.36
1.107PhePro: 1.107 ± 0.243
1.18PheGln: 1.18 ± 0.231
1.697PheArg: 1.697 ± 0.365
2.066PheSer: 2.066 ± 0.366
1.992PheThr: 1.992 ± 0.4
2.066PheVal: 2.066 ± 0.391
0.443PheTrp: 0.443 ± 0.163
1.254PheTyr: 1.254 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
6.713GlyAla: 6.713 ± 0.618
1.844GlyCys: 1.844 ± 0.43
4.721GlyAsp: 4.721 ± 0.644
4.131GlyGlu: 4.131 ± 0.551
2.729GlyPhe: 2.729 ± 0.437
4.426GlyGly: 4.426 ± 0.638
1.18GlyHis: 1.18 ± 0.353
3.983GlyIle: 3.983 ± 0.512
3.762GlyLys: 3.762 ± 0.536
6.197GlyLeu: 6.197 ± 0.741
1.992GlyMet: 1.992 ± 0.535
3.541GlyAsn: 3.541 ± 0.535
1.844GlyPro: 1.844 ± 0.327
3.024GlyGln: 3.024 ± 0.444
5.164GlyArg: 5.164 ± 0.499
5.238GlySer: 5.238 ± 0.716
4.942GlyThr: 4.942 ± 0.668
5.975GlyVal: 5.975 ± 0.702
0.738GlyTrp: 0.738 ± 0.198
3.467GlyTyr: 3.467 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
1.402HisAla: 1.402 ± 0.358
0.295HisCys: 0.295 ± 0.122
1.18HisAsp: 1.18 ± 0.292
1.402HisGlu: 1.402 ± 0.391
0.221HisPhe: 0.221 ± 0.108
2.066HisGly: 2.066 ± 0.49
0.221HisHis: 0.221 ± 0.122
1.18HisIle: 1.18 ± 0.266
0.885HisLys: 0.885 ± 0.2
2.361HisLeu: 2.361 ± 0.475
0.516HisMet: 0.516 ± 0.172
0.811HisAsn: 0.811 ± 0.243
0.811HisPro: 0.811 ± 0.354
0.664HisGln: 0.664 ± 0.217
1.402HisArg: 1.402 ± 0.307
0.59HisSer: 0.59 ± 0.226
0.516HisThr: 0.516 ± 0.199
0.59HisVal: 0.59 ± 0.184
0.148HisTrp: 0.148 ± 0.092
0.959HisTyr: 0.959 ± 0.292
0.0HisXaa: 0.0 ± 0.0
Ile
2.951IleAla: 2.951 ± 0.477
0.221IleCys: 0.221 ± 0.12
2.877IleAsp: 2.877 ± 0.427
2.656IleGlu: 2.656 ± 0.453
0.59IlePhe: 0.59 ± 0.176
2.803IleGly: 2.803 ± 0.429
0.959IleHis: 0.959 ± 0.283
2.361IleIle: 2.361 ± 0.401
3.024IleLys: 3.024 ± 0.597
4.352IleLeu: 4.352 ± 0.453
1.254IleMet: 1.254 ± 0.295
1.918IleAsn: 1.918 ± 0.297
2.139IlePro: 2.139 ± 0.394
3.615IleGln: 3.615 ± 0.479
2.803IleArg: 2.803 ± 0.378
3.32IleSer: 3.32 ± 0.535
2.951IleThr: 2.951 ± 0.455
2.508IleVal: 2.508 ± 0.37
0.295IleTrp: 0.295 ± 0.144
1.623IleTyr: 1.623 ± 0.31
0.0IleXaa: 0.0 ± 0.0
Lys
5.828LysAla: 5.828 ± 0.965
0.59LysCys: 0.59 ± 0.246
2.508LysAsp: 2.508 ± 0.334
3.024LysGlu: 3.024 ± 0.516
0.811LysPhe: 0.811 ± 0.246
3.024LysGly: 3.024 ± 0.523
0.811LysHis: 0.811 ± 0.24
1.254LysIle: 1.254 ± 0.244
1.697LysLys: 1.697 ± 0.489
4.205LysLeu: 4.205 ± 0.602
1.254LysMet: 1.254 ± 0.252
1.402LysAsn: 1.402 ± 0.297
1.549LysPro: 1.549 ± 0.342
3.836LysGln: 3.836 ± 0.583
2.877LysArg: 2.877 ± 0.398
2.656LysSer: 2.656 ± 0.342
2.951LysThr: 2.951 ± 0.379
3.688LysVal: 3.688 ± 0.484
0.738LysTrp: 0.738 ± 0.242
1.623LysTyr: 1.623 ± 0.417
0.0LysXaa: 0.0 ± 0.0
Leu
7.524LeuAla: 7.524 ± 0.754
1.107LeuCys: 1.107 ± 0.299
6.27LeuAsp: 6.27 ± 0.617
5.459LeuGlu: 5.459 ± 0.496
2.582LeuPhe: 2.582 ± 0.356
6.418LeuGly: 6.418 ± 0.7
1.697LeuHis: 1.697 ± 0.327
4.131LeuIle: 4.131 ± 0.809
3.098LeuLys: 3.098 ± 0.448
6.344LeuLeu: 6.344 ± 0.617
1.844LeuMet: 1.844 ± 0.287
3.762LeuAsn: 3.762 ± 0.481
3.615LeuPro: 3.615 ± 0.522
4.352LeuGln: 4.352 ± 0.494
6.492LeuArg: 6.492 ± 0.636
4.647LeuSer: 4.647 ± 0.537
5.09LeuThr: 5.09 ± 0.601
6.418LeuVal: 6.418 ± 0.772
1.328LeuTrp: 1.328 ± 0.334
3.246LeuTyr: 3.246 ± 0.453
0.0LeuXaa: 0.0 ± 0.0
Met
3.246MetAla: 3.246 ± 0.557
0.221MetCys: 0.221 ± 0.115
2.139MetAsp: 2.139 ± 0.449
1.107MetGlu: 1.107 ± 0.234
0.811MetPhe: 0.811 ± 0.296
1.549MetGly: 1.549 ± 0.248
0.885MetHis: 0.885 ± 0.285
1.107MetIle: 1.107 ± 0.27
1.328MetLys: 1.328 ± 0.356
3.393MetLeu: 3.393 ± 0.5
0.664MetMet: 0.664 ± 0.252
0.885MetAsn: 0.885 ± 0.296
1.18MetPro: 1.18 ± 0.217
1.623MetGln: 1.623 ± 0.298
2.066MetArg: 2.066 ± 0.391
2.287MetSer: 2.287 ± 0.462
1.254MetThr: 1.254 ± 0.313
2.066MetVal: 2.066 ± 0.389
0.59MetTrp: 0.59 ± 0.156
0.959MetTyr: 0.959 ± 0.319
0.0MetXaa: 0.0 ± 0.0
Asn
3.098AsnAla: 3.098 ± 0.455
0.516AsnCys: 0.516 ± 0.206
2.066AsnAsp: 2.066 ± 0.425
1.475AsnGlu: 1.475 ± 0.347
0.959AsnPhe: 0.959 ± 0.282
4.279AsnGly: 4.279 ± 0.538
0.148AsnHis: 0.148 ± 0.106
2.361AsnIle: 2.361 ± 0.391
1.844AsnLys: 1.844 ± 0.297
3.836AsnLeu: 3.836 ± 0.485
1.18AsnMet: 1.18 ± 0.276
1.992AsnAsn: 1.992 ± 0.419
2.287AsnPro: 2.287 ± 0.351
1.77AsnGln: 1.77 ± 0.359
2.729AsnArg: 2.729 ± 0.425
3.024AsnSer: 3.024 ± 0.536
2.582AsnThr: 2.582 ± 0.35
3.615AsnVal: 3.615 ± 0.455
0.443AsnTrp: 0.443 ± 0.19
1.475AsnTyr: 1.475 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
4.721ProAla: 4.721 ± 0.979
0.221ProCys: 0.221 ± 0.121
2.508ProAsp: 2.508 ± 0.471
3.467ProGlu: 3.467 ± 0.388
0.959ProPhe: 0.959 ± 0.225
2.656ProGly: 2.656 ± 0.604
0.443ProHis: 0.443 ± 0.198
1.623ProIle: 1.623 ± 0.335
2.066ProLys: 2.066 ± 0.39
2.656ProLeu: 2.656 ± 0.448
1.328ProMet: 1.328 ± 0.29
1.623ProAsn: 1.623 ± 0.344
0.59ProPro: 0.59 ± 0.225
1.402ProGln: 1.402 ± 0.302
1.77ProArg: 1.77 ± 0.347
2.803ProSer: 2.803 ± 0.427
2.434ProThr: 2.434 ± 0.31
3.024ProVal: 3.024 ± 0.383
0.664ProTrp: 0.664 ± 0.222
1.328ProTyr: 1.328 ± 0.356
0.0ProXaa: 0.0 ± 0.0
Gln
5.459GlnAla: 5.459 ± 0.691
0.443GlnCys: 0.443 ± 0.191
2.508GlnAsp: 2.508 ± 0.506
3.836GlnGlu: 3.836 ± 0.631
1.475GlnPhe: 1.475 ± 0.26
3.467GlnGly: 3.467 ± 0.579
1.328GlnHis: 1.328 ± 0.403
1.107GlnIle: 1.107 ± 0.282
2.287GlnLys: 2.287 ± 0.422
5.164GlnLeu: 5.164 ± 0.563
1.328GlnMet: 1.328 ± 0.265
2.213GlnAsn: 2.213 ± 0.397
1.77GlnPro: 1.77 ± 0.421
2.951GlnGln: 2.951 ± 0.641
3.393GlnArg: 3.393 ± 0.411
3.024GlnSer: 3.024 ± 0.423
2.066GlnThr: 2.066 ± 0.438
2.729GlnVal: 2.729 ± 0.537
0.738GlnTrp: 0.738 ± 0.212
1.992GlnTyr: 1.992 ± 0.464
0.0GlnXaa: 0.0 ± 0.0
Arg
6.049ArgAla: 6.049 ± 0.697
1.033ArgCys: 1.033 ± 0.303
3.246ArgAsp: 3.246 ± 0.491
3.32ArgGlu: 3.32 ± 0.435
2.066ArgPhe: 2.066 ± 0.327
3.836ArgGly: 3.836 ± 0.657
0.811ArgHis: 0.811 ± 0.163
3.32ArgIle: 3.32 ± 0.551
3.172ArgLys: 3.172 ± 0.473
5.09ArgLeu: 5.09 ± 0.632
2.213ArgMet: 2.213 ± 0.353
2.508ArgAsn: 2.508 ± 0.411
1.475ArgPro: 1.475 ± 0.367
2.656ArgGln: 2.656 ± 0.445
4.426ArgArg: 4.426 ± 0.791
2.951ArgSer: 2.951 ± 0.528
3.983ArgThr: 3.983 ± 0.411
4.5ArgVal: 4.5 ± 0.539
0.959ArgTrp: 0.959 ± 0.222
2.361ArgTyr: 2.361 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
7.819SerAla: 7.819 ± 0.748
0.885SerCys: 0.885 ± 0.269
3.836SerAsp: 3.836 ± 0.523
2.951SerGlu: 2.951 ± 0.468
2.066SerPhe: 2.066 ± 0.388
6.565SerGly: 6.565 ± 0.788
0.811SerHis: 0.811 ± 0.253
3.246SerIle: 3.246 ± 0.625
3.467SerLys: 3.467 ± 0.568
4.574SerLeu: 4.574 ± 0.627
2.582SerMet: 2.582 ± 0.312
2.803SerAsn: 2.803 ± 0.563
2.361SerPro: 2.361 ± 0.374
2.434SerGln: 2.434 ± 0.489
2.877SerArg: 2.877 ± 0.486
4.057SerSer: 4.057 ± 0.689
3.91SerThr: 3.91 ± 0.4
4.131SerVal: 4.131 ± 0.421
0.738SerTrp: 0.738 ± 0.212
1.697SerTyr: 1.697 ± 0.42
0.0SerXaa: 0.0 ± 0.0
Thr
5.901ThrAla: 5.901 ± 0.679
0.664ThrCys: 0.664 ± 0.243
3.393ThrAsp: 3.393 ± 0.471
2.582ThrGlu: 2.582 ± 0.537
2.139ThrPhe: 2.139 ± 0.356
5.164ThrGly: 5.164 ± 0.671
0.959ThrHis: 0.959 ± 0.295
2.508ThrIle: 2.508 ± 0.379
2.951ThrLys: 2.951 ± 0.513
4.5ThrLeu: 4.5 ± 0.44
1.77ThrMet: 1.77 ± 0.446
2.434ThrAsn: 2.434 ± 0.515
2.361ThrPro: 2.361 ± 0.31
3.024ThrGln: 3.024 ± 0.387
2.508ThrArg: 2.508 ± 0.44
4.5ThrSer: 4.5 ± 0.688
3.615ThrThr: 3.615 ± 0.464
4.721ThrVal: 4.721 ± 0.787
0.59ThrTrp: 0.59 ± 0.177
2.287ThrTyr: 2.287 ± 0.45
0.0ThrXaa: 0.0 ± 0.0
Val
6.639ValAla: 6.639 ± 0.729
0.59ValCys: 0.59 ± 0.243
5.385ValAsp: 5.385 ± 0.504
3.836ValGlu: 3.836 ± 0.525
1.918ValPhe: 1.918 ± 0.351
6.418ValGly: 6.418 ± 0.643
1.992ValHis: 1.992 ± 0.338
2.287ValIle: 2.287 ± 0.411
2.951ValLys: 2.951 ± 0.443
6.492ValLeu: 6.492 ± 0.942
1.402ValMet: 1.402 ± 0.29
2.656ValAsn: 2.656 ± 0.721
3.615ValPro: 3.615 ± 0.526
3.983ValGln: 3.983 ± 0.574
4.057ValArg: 4.057 ± 0.438
4.131ValSer: 4.131 ± 0.665
3.467ValThr: 3.467 ± 0.583
5.828ValVal: 5.828 ± 0.611
0.885ValTrp: 0.885 ± 0.24
2.434ValTyr: 2.434 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
0.811TrpAla: 0.811 ± 0.188
0.295TrpCys: 0.295 ± 0.123
0.811TrpAsp: 0.811 ± 0.255
1.254TrpGlu: 1.254 ± 0.281
0.811TrpPhe: 0.811 ± 0.261
0.664TrpGly: 0.664 ± 0.224
0.369TrpHis: 0.369 ± 0.136
0.59TrpIle: 0.59 ± 0.208
0.738TrpLys: 0.738 ± 0.244
1.549TrpLeu: 1.549 ± 0.244
0.148TrpMet: 0.148 ± 0.166
1.033TrpAsn: 1.033 ± 0.271
0.443TrpPro: 0.443 ± 0.212
0.516TrpGln: 0.516 ± 0.199
1.107TrpArg: 1.107 ± 0.25
0.516TrpSer: 0.516 ± 0.226
0.811TrpThr: 0.811 ± 0.184
0.959TrpVal: 0.959 ± 0.267
0.221TrpTrp: 0.221 ± 0.124
0.59TrpTyr: 0.59 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.729TyrAla: 2.729 ± 0.413
0.959TyrCys: 0.959 ± 0.225
2.803TyrAsp: 2.803 ± 0.451
1.918TyrGlu: 1.918 ± 0.48
1.033TyrPhe: 1.033 ± 0.236
2.656TyrGly: 2.656 ± 0.48
0.369TyrHis: 0.369 ± 0.148
2.213TyrIle: 2.213 ± 0.425
1.992TyrLys: 1.992 ± 0.378
3.541TyrLeu: 3.541 ± 0.401
0.811TyrMet: 0.811 ± 0.262
2.066TyrAsn: 2.066 ± 0.377
1.402TyrPro: 1.402 ± 0.29
2.066TyrGln: 2.066 ± 0.336
2.213TyrArg: 2.213 ± 0.368
2.877TyrSer: 2.877 ± 0.395
2.508TyrThr: 2.508 ± 0.355
1.918TyrVal: 1.918 ± 0.413
0.811TyrTrp: 0.811 ± 0.27
1.402TyrTyr: 1.402 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (13557 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski