Amino acid dipepetide frequency for Klebsiella phage ST101-KPC2phi6.3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.354AlaAla: 9.354 ± 1.201
0.928AlaCys: 0.928 ± 0.288
4.252AlaAsp: 4.252 ± 0.608
6.03AlaGlu: 6.03 ± 0.554
4.02AlaPhe: 4.02 ± 0.624
6.339AlaGly: 6.339 ± 0.75
1.855AlaHis: 1.855 ± 0.422
4.793AlaIle: 4.793 ± 0.476
4.329AlaLys: 4.329 ± 0.497
9.508AlaLeu: 9.508 ± 1.379
3.169AlaMet: 3.169 ± 0.546
3.479AlaAsn: 3.479 ± 0.493
2.938AlaPro: 2.938 ± 0.489
3.556AlaGln: 3.556 ± 0.811
6.957AlaArg: 6.957 ± 0.983
6.648AlaSer: 6.648 ± 0.802
4.947AlaThr: 4.947 ± 0.684
6.725AlaVal: 6.725 ± 0.701
1.082AlaTrp: 1.082 ± 0.269
2.551AlaTyr: 2.551 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
0.464CysAla: 0.464 ± 0.235
0.232CysCys: 0.232 ± 0.148
0.773CysAsp: 0.773 ± 0.244
0.696CysGlu: 0.696 ± 0.225
0.541CysPhe: 0.541 ± 0.197
0.618CysGly: 0.618 ± 0.232
0.155CysHis: 0.155 ± 0.117
0.618CysIle: 0.618 ± 0.256
0.696CysLys: 0.696 ± 0.279
0.541CysLeu: 0.541 ± 0.22
0.077CysMet: 0.077 ± 0.12
0.541CysAsn: 0.541 ± 0.2
0.155CysPro: 0.155 ± 0.098
0.541CysGln: 0.541 ± 0.22
1.469CysArg: 1.469 ± 0.372
0.928CysSer: 0.928 ± 0.281
0.696CysThr: 0.696 ± 0.259
0.85CysVal: 0.85 ± 0.271
0.309CysTrp: 0.309 ± 0.181
0.309CysTyr: 0.309 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
5.334AspAla: 5.334 ± 0.62
0.541AspCys: 0.541 ± 0.201
4.947AspAsp: 4.947 ± 0.971
3.633AspGlu: 3.633 ± 0.581
1.778AspPhe: 1.778 ± 0.448
4.793AspGly: 4.793 ± 0.651
0.773AspHis: 0.773 ± 0.207
3.401AspIle: 3.401 ± 0.491
2.165AspLys: 2.165 ± 0.31
4.793AspLeu: 4.793 ± 0.489
1.16AspMet: 1.16 ± 0.318
1.855AspAsn: 1.855 ± 0.318
2.628AspPro: 2.628 ± 0.436
1.469AspGln: 1.469 ± 0.406
3.401AspArg: 3.401 ± 0.471
3.247AspSer: 3.247 ± 0.582
2.551AspThr: 2.551 ± 0.436
3.556AspVal: 3.556 ± 0.614
1.082AspTrp: 1.082 ± 0.223
2.242AspTyr: 2.242 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
6.571GluAla: 6.571 ± 0.736
0.618GluCys: 0.618 ± 0.23
3.092GluAsp: 3.092 ± 0.531
3.633GluGlu: 3.633 ± 0.517
1.701GluPhe: 1.701 ± 0.427
3.633GluGly: 3.633 ± 0.546
2.01GluHis: 2.01 ± 0.394
4.406GluIle: 4.406 ± 0.556
3.015GluLys: 3.015 ± 0.504
7.73GluLeu: 7.73 ± 0.917
1.469GluMet: 1.469 ± 0.251
2.551GluAsn: 2.551 ± 0.337
3.092GluPro: 3.092 ± 0.622
3.865GluGln: 3.865 ± 0.536
4.638GluArg: 4.638 ± 0.47
4.097GluSer: 4.097 ± 0.477
2.783GluThr: 2.783 ± 0.422
4.561GluVal: 4.561 ± 0.535
1.546GluTrp: 1.546 ± 0.307
1.855GluTyr: 1.855 ± 0.395
0.0GluXaa: 0.0 ± 0.0
Phe
2.474PheAla: 2.474 ± 0.382
0.696PheCys: 0.696 ± 0.237
2.319PheAsp: 2.319 ± 0.534
2.396PheGlu: 2.396 ± 0.431
1.16PhePhe: 1.16 ± 0.326
2.628PheGly: 2.628 ± 0.378
0.85PheHis: 0.85 ± 0.258
1.933PheIle: 1.933 ± 0.411
1.546PheLys: 1.546 ± 0.361
2.86PheLeu: 2.86 ± 0.387
1.005PheMet: 1.005 ± 0.314
1.933PheAsn: 1.933 ± 0.466
1.701PhePro: 1.701 ± 0.356
0.928PheGln: 0.928 ± 0.22
2.087PheArg: 2.087 ± 0.413
2.706PheSer: 2.706 ± 0.535
2.86PheThr: 2.86 ± 0.39
1.469PheVal: 1.469 ± 0.404
0.387PheTrp: 0.387 ± 0.167
0.85PheTyr: 0.85 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
6.571GlyAla: 6.571 ± 1.085
0.85GlyCys: 0.85 ± 0.307
2.938GlyAsp: 2.938 ± 0.451
3.942GlyGlu: 3.942 ± 0.525
2.474GlyPhe: 2.474 ± 0.495
5.334GlyGly: 5.334 ± 0.889
0.85GlyHis: 0.85 ± 0.274
4.484GlyIle: 4.484 ± 0.538
4.174GlyLys: 4.174 ± 0.632
6.648GlyLeu: 6.648 ± 0.778
1.701GlyMet: 1.701 ± 0.391
3.169GlyAsn: 3.169 ± 0.453
1.778GlyPro: 1.778 ± 0.493
3.247GlyGln: 3.247 ± 0.476
3.942GlyArg: 3.942 ± 0.53
4.329GlySer: 4.329 ± 0.642
3.788GlyThr: 3.788 ± 0.615
5.798GlyVal: 5.798 ± 0.716
1.005GlyTrp: 1.005 ± 0.253
2.551GlyTyr: 2.551 ± 0.422
0.0GlyXaa: 0.0 ± 0.0
His
1.314HisAla: 1.314 ± 0.371
0.387HisCys: 0.387 ± 0.171
1.082HisAsp: 1.082 ± 0.323
1.16HisGlu: 1.16 ± 0.325
0.618HisPhe: 0.618 ± 0.267
1.314HisGly: 1.314 ± 0.341
0.541HisHis: 0.541 ± 0.206
0.928HisIle: 0.928 ± 0.302
1.237HisLys: 1.237 ± 0.317
1.933HisLeu: 1.933 ± 0.38
0.618HisMet: 0.618 ± 0.208
0.618HisAsn: 0.618 ± 0.208
1.237HisPro: 1.237 ± 0.292
1.469HisGln: 1.469 ± 0.344
1.082HisArg: 1.082 ± 0.247
0.541HisSer: 0.541 ± 0.172
0.85HisThr: 0.85 ± 0.238
1.237HisVal: 1.237 ± 0.324
0.309HisTrp: 0.309 ± 0.169
0.618HisTyr: 0.618 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
4.716IleAla: 4.716 ± 0.524
0.696IleCys: 0.696 ± 0.246
4.02IleAsp: 4.02 ± 0.633
3.556IleGlu: 3.556 ± 0.472
1.237IlePhe: 1.237 ± 0.326
3.247IleGly: 3.247 ± 0.552
0.541IleHis: 0.541 ± 0.207
2.86IleIle: 2.86 ± 0.442
2.938IleLys: 2.938 ± 0.452
4.097IleLeu: 4.097 ± 0.629
1.16IleMet: 1.16 ± 0.32
2.783IleAsn: 2.783 ± 0.565
2.396IlePro: 2.396 ± 0.484
1.933IleGln: 1.933 ± 0.391
2.938IleArg: 2.938 ± 0.43
4.484IleSer: 4.484 ± 0.517
4.793IleThr: 4.793 ± 0.609
3.092IleVal: 3.092 ± 0.6
0.541IleTrp: 0.541 ± 0.221
1.546IleTyr: 1.546 ± 0.289
0.0IleXaa: 0.0 ± 0.0
Lys
5.179LysAla: 5.179 ± 0.626
0.077LysCys: 0.077 ± 0.074
2.396LysAsp: 2.396 ± 0.481
4.406LysGlu: 4.406 ± 0.769
1.855LysPhe: 1.855 ± 0.323
3.015LysGly: 3.015 ± 0.491
0.773LysHis: 0.773 ± 0.261
1.701LysIle: 1.701 ± 0.366
2.474LysLys: 2.474 ± 0.443
3.942LysLeu: 3.942 ± 0.634
1.314LysMet: 1.314 ± 0.372
2.165LysAsn: 2.165 ± 0.488
2.551LysPro: 2.551 ± 0.512
2.396LysGln: 2.396 ± 0.335
3.092LysArg: 3.092 ± 0.623
3.324LysSer: 3.324 ± 0.596
3.556LysThr: 3.556 ± 0.569
2.938LysVal: 2.938 ± 0.388
1.005LysTrp: 1.005 ± 0.269
1.005LysTyr: 1.005 ± 0.245
0.0LysXaa: 0.0 ± 0.0
Leu
9.895LeuAla: 9.895 ± 0.856
1.855LeuCys: 1.855 ± 0.393
4.484LeuAsp: 4.484 ± 0.535
6.262LeuGlu: 6.262 ± 0.587
3.092LeuPhe: 3.092 ± 0.499
5.257LeuGly: 5.257 ± 0.895
1.778LeuHis: 1.778 ± 0.224
4.638LeuIle: 4.638 ± 0.648
5.798LeuLys: 5.798 ± 0.746
8.813LeuLeu: 8.813 ± 0.962
3.015LeuMet: 3.015 ± 0.496
3.169LeuAsn: 3.169 ± 0.417
4.947LeuPro: 4.947 ± 0.605
2.706LeuGln: 2.706 ± 0.562
5.489LeuArg: 5.489 ± 0.812
7.344LeuSer: 7.344 ± 0.698
5.72LeuThr: 5.72 ± 0.73
6.03LeuVal: 6.03 ± 0.697
0.928LeuTrp: 0.928 ± 0.284
3.169LeuTyr: 3.169 ± 0.67
0.0LeuXaa: 0.0 ± 0.0
Met
2.86MetAla: 2.86 ± 0.517
0.232MetCys: 0.232 ± 0.118
1.082MetAsp: 1.082 ± 0.236
0.696MetGlu: 0.696 ± 0.235
1.005MetPhe: 1.005 ± 0.268
1.933MetGly: 1.933 ± 0.374
0.155MetHis: 0.155 ± 0.111
1.546MetIle: 1.546 ± 0.272
2.01MetLys: 2.01 ± 0.372
2.706MetLeu: 2.706 ± 0.533
0.309MetMet: 0.309 ± 0.141
1.469MetAsn: 1.469 ± 0.308
1.778MetPro: 1.778 ± 0.293
0.85MetGln: 0.85 ± 0.286
1.855MetArg: 1.855 ± 0.352
1.933MetSer: 1.933 ± 0.34
2.396MetThr: 2.396 ± 0.437
1.314MetVal: 1.314 ± 0.339
0.387MetTrp: 0.387 ± 0.17
0.309MetTyr: 0.309 ± 0.128
0.0MetXaa: 0.0 ± 0.0
Asn
3.865AsnAla: 3.865 ± 0.712
0.618AsnCys: 0.618 ± 0.192
1.701AsnAsp: 1.701 ± 0.3
2.474AsnGlu: 2.474 ± 0.377
1.005AsnPhe: 1.005 ± 0.214
2.706AsnGly: 2.706 ± 0.391
0.618AsnHis: 0.618 ± 0.168
2.706AsnIle: 2.706 ± 0.475
1.778AsnLys: 1.778 ± 0.376
3.479AsnLeu: 3.479 ± 0.508
1.237AsnMet: 1.237 ± 0.272
1.933AsnAsn: 1.933 ± 0.395
2.628AsnPro: 2.628 ± 0.486
1.237AsnGln: 1.237 ± 0.292
1.778AsnArg: 1.778 ± 0.285
2.319AsnSer: 2.319 ± 0.432
1.623AsnThr: 1.623 ± 0.378
2.242AsnVal: 2.242 ± 0.375
0.541AsnTrp: 0.541 ± 0.194
1.391AsnTyr: 1.391 ± 0.288
0.0AsnXaa: 0.0 ± 0.0
Pro
5.102ProAla: 5.102 ± 0.562
0.464ProCys: 0.464 ± 0.206
3.015ProAsp: 3.015 ± 0.585
4.406ProGlu: 4.406 ± 0.869
1.855ProPhe: 1.855 ± 0.381
3.479ProGly: 3.479 ± 0.565
0.85ProHis: 0.85 ± 0.285
2.087ProIle: 2.087 ± 0.561
1.314ProLys: 1.314 ± 0.242
4.484ProLeu: 4.484 ± 0.711
1.701ProMet: 1.701 ± 0.415
1.005ProAsn: 1.005 ± 0.302
2.396ProPro: 2.396 ± 0.494
1.469ProGln: 1.469 ± 0.355
2.783ProArg: 2.783 ± 0.496
2.86ProSer: 2.86 ± 0.458
1.701ProThr: 1.701 ± 0.356
3.942ProVal: 3.942 ± 0.772
0.387ProTrp: 0.387 ± 0.177
1.237ProTyr: 1.237 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
4.638GlnAla: 4.638 ± 0.769
0.541GlnCys: 0.541 ± 0.187
1.778GlnAsp: 1.778 ± 0.387
2.474GlnGlu: 2.474 ± 0.506
1.314GlnPhe: 1.314 ± 0.299
2.86GlnGly: 2.86 ± 0.431
0.928GlnHis: 0.928 ± 0.245
2.01GlnIle: 2.01 ± 0.402
2.242GlnLys: 2.242 ± 0.589
3.788GlnLeu: 3.788 ± 0.646
1.082GlnMet: 1.082 ± 0.257
1.082GlnAsn: 1.082 ± 0.348
1.778GlnPro: 1.778 ± 0.394
2.319GlnGln: 2.319 ± 0.434
3.247GlnArg: 3.247 ± 0.501
2.087GlnSer: 2.087 ± 0.445
2.628GlnThr: 2.628 ± 0.504
3.092GlnVal: 3.092 ± 0.664
0.541GlnTrp: 0.541 ± 0.166
0.85GlnTyr: 0.85 ± 0.289
0.0GlnXaa: 0.0 ± 0.0
Arg
4.87ArgAla: 4.87 ± 0.747
0.464ArgCys: 0.464 ± 0.23
3.015ArgAsp: 3.015 ± 0.431
5.102ArgGlu: 5.102 ± 0.531
3.092ArgPhe: 3.092 ± 0.553
4.02ArgGly: 4.02 ± 0.528
2.242ArgHis: 2.242 ± 0.517
3.633ArgIle: 3.633 ± 0.639
3.015ArgLys: 3.015 ± 0.475
6.88ArgLeu: 6.88 ± 0.71
2.319ArgMet: 2.319 ± 0.404
2.087ArgAsn: 2.087 ± 0.381
2.087ArgPro: 2.087 ± 0.4
2.938ArgGln: 2.938 ± 0.718
5.72ArgArg: 5.72 ± 0.641
2.551ArgSer: 2.551 ± 0.42
3.092ArgThr: 3.092 ± 0.55
4.02ArgVal: 4.02 ± 0.667
1.314ArgTrp: 1.314 ± 0.373
1.933ArgTyr: 1.933 ± 0.549
0.0ArgXaa: 0.0 ± 0.0
Ser
6.107SerAla: 6.107 ± 0.759
0.618SerCys: 0.618 ± 0.221
5.179SerAsp: 5.179 ± 0.816
4.406SerGlu: 4.406 ± 0.697
2.551SerPhe: 2.551 ± 0.371
6.725SerGly: 6.725 ± 0.916
1.16SerHis: 1.16 ± 0.293
2.396SerIle: 2.396 ± 0.379
3.169SerLys: 3.169 ± 0.493
6.494SerLeu: 6.494 ± 0.861
1.701SerMet: 1.701 ± 0.284
2.474SerAsn: 2.474 ± 0.467
2.86SerPro: 2.86 ± 0.411
3.015SerGln: 3.015 ± 0.618
4.02SerArg: 4.02 ± 0.635
4.329SerSer: 4.329 ± 0.971
2.628SerThr: 2.628 ± 0.624
3.401SerVal: 3.401 ± 0.421
1.855SerTrp: 1.855 ± 0.424
1.469SerTyr: 1.469 ± 0.388
0.0SerXaa: 0.0 ± 0.0
Thr
5.952ThrAla: 5.952 ± 0.773
0.464ThrCys: 0.464 ± 0.18
3.015ThrAsp: 3.015 ± 0.497
4.097ThrGlu: 4.097 ± 0.558
1.855ThrPhe: 1.855 ± 0.386
4.87ThrGly: 4.87 ± 0.779
1.16ThrHis: 1.16 ± 0.236
3.015ThrIle: 3.015 ± 0.531
2.319ThrLys: 2.319 ± 0.438
5.489ThrLeu: 5.489 ± 0.586
1.391ThrMet: 1.391 ± 0.291
1.778ThrAsn: 1.778 ± 0.331
3.015ThrPro: 3.015 ± 0.412
2.783ThrGln: 2.783 ± 0.49
3.324ThrArg: 3.324 ± 0.401
3.324ThrSer: 3.324 ± 0.643
3.479ThrThr: 3.479 ± 0.473
4.174ThrVal: 4.174 ± 0.555
1.16ThrTrp: 1.16 ± 0.341
1.237ThrTyr: 1.237 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
4.638ValAla: 4.638 ± 0.503
0.464ValCys: 0.464 ± 0.214
3.865ValAsp: 3.865 ± 0.513
4.406ValGlu: 4.406 ± 0.725
2.319ValPhe: 2.319 ± 0.434
4.02ValGly: 4.02 ± 0.564
1.16ValHis: 1.16 ± 0.333
4.484ValIle: 4.484 ± 0.595
3.015ValLys: 3.015 ± 0.569
5.72ValLeu: 5.72 ± 0.771
1.314ValMet: 1.314 ± 0.357
2.706ValAsn: 2.706 ± 0.436
3.942ValPro: 3.942 ± 0.489
2.242ValGln: 2.242 ± 0.413
3.788ValArg: 3.788 ± 0.563
5.566ValSer: 5.566 ± 0.544
4.716ValThr: 4.716 ± 0.881
4.406ValVal: 4.406 ± 0.728
0.928ValTrp: 0.928 ± 0.278
2.396ValTyr: 2.396 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
1.237TrpAla: 1.237 ± 0.334
0.387TrpCys: 0.387 ± 0.167
0.541TrpAsp: 0.541 ± 0.18
1.391TrpGlu: 1.391 ± 0.314
0.387TrpPhe: 0.387 ± 0.157
0.928TrpGly: 0.928 ± 0.307
0.387TrpHis: 0.387 ± 0.159
0.928TrpIle: 0.928 ± 0.219
0.696TrpLys: 0.696 ± 0.259
1.546TrpLeu: 1.546 ± 0.375
0.387TrpMet: 0.387 ± 0.156
0.309TrpAsn: 0.309 ± 0.139
0.696TrpPro: 0.696 ± 0.194
0.928TrpGln: 0.928 ± 0.25
1.314TrpArg: 1.314 ± 0.301
1.237TrpSer: 1.237 ± 0.354
0.928TrpThr: 0.928 ± 0.261
1.623TrpVal: 1.623 ± 0.354
0.387TrpTrp: 0.387 ± 0.174
0.309TrpTyr: 0.309 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.319TyrAla: 2.319 ± 0.359
0.155TyrCys: 0.155 ± 0.128
1.855TyrAsp: 1.855 ± 0.438
1.701TyrGlu: 1.701 ± 0.386
0.928TyrPhe: 0.928 ± 0.231
1.855TyrGly: 1.855 ± 0.362
0.309TyrHis: 0.309 ± 0.141
1.005TyrIle: 1.005 ± 0.296
1.237TyrLys: 1.237 ± 0.256
2.938TyrLeu: 2.938 ± 0.521
0.618TyrMet: 0.618 ± 0.283
0.773TyrAsn: 0.773 ± 0.226
1.933TyrPro: 1.933 ± 0.396
1.391TyrGln: 1.391 ± 0.342
1.469TyrArg: 1.469 ± 0.353
2.706TyrSer: 2.706 ± 0.421
2.165TyrThr: 2.165 ± 0.438
1.623TyrVal: 1.623 ± 0.381
0.85TyrTrp: 0.85 ± 0.254
0.696TyrTyr: 0.696 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (12937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski