Amino acid dipepetide frequency for Klebsiella phage vB_KpnM_KpV52

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.099AlaAla: 8.099 ± 0.888
0.877AlaCys: 0.877 ± 0.313
5.197AlaAsp: 5.197 ± 0.635
5.265AlaGlu: 5.265 ± 0.656
2.497AlaPhe: 2.497 ± 0.416
7.154AlaGly: 7.154 ± 0.82
1.687AlaHis: 1.687 ± 0.377
4.792AlaIle: 4.792 ± 0.482
4.387AlaLys: 4.387 ± 0.643
6.344AlaLeu: 6.344 ± 0.714
2.565AlaMet: 2.565 ± 0.34
3.51AlaAsn: 3.51 ± 0.523
2.97AlaPro: 2.97 ± 0.507
4.522AlaGln: 4.522 ± 0.575
4.59AlaArg: 4.59 ± 0.7
5.602AlaSer: 5.602 ± 0.686
5.94AlaThr: 5.94 ± 0.677
5.602AlaVal: 5.602 ± 0.675
1.147AlaTrp: 1.147 ± 0.25
2.43AlaTyr: 2.43 ± 0.324
0.0AlaXaa: 0.0 ± 0.0
Cys
1.012CysAla: 1.012 ± 0.212
0.067CysCys: 0.067 ± 0.065
0.81CysAsp: 0.81 ± 0.239
1.215CysGlu: 1.215 ± 0.263
0.27CysPhe: 0.27 ± 0.166
1.08CysGly: 1.08 ± 0.299
0.472CysHis: 0.472 ± 0.221
0.877CysIle: 0.877 ± 0.277
0.742CysLys: 0.742 ± 0.31
1.012CysLeu: 1.012 ± 0.26
0.135CysMet: 0.135 ± 0.083
0.945CysAsn: 0.945 ± 0.277
1.485CysPro: 1.485 ± 0.564
0.27CysGln: 0.27 ± 0.11
0.945CysArg: 0.945 ± 0.253
1.08CysSer: 1.08 ± 0.254
1.012CysThr: 1.012 ± 0.249
1.08CysVal: 1.08 ± 0.365
0.135CysTrp: 0.135 ± 0.092
0.202CysTyr: 0.202 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
6.075AspAla: 6.075 ± 0.585
0.27AspCys: 0.27 ± 0.122
3.375AspAsp: 3.375 ± 0.445
4.252AspGlu: 4.252 ± 0.618
3.037AspPhe: 3.037 ± 0.543
6.479AspGly: 6.479 ± 0.698
0.81AspHis: 0.81 ± 0.225
3.847AspIle: 3.847 ± 0.557
2.902AspLys: 2.902 ± 0.459
4.05AspLeu: 4.05 ± 0.465
1.822AspMet: 1.822 ± 0.396
2.767AspAsn: 2.767 ± 0.485
2.227AspPro: 2.227 ± 0.353
1.282AspGln: 1.282 ± 0.25
3.24AspArg: 3.24 ± 0.479
3.577AspSer: 3.577 ± 0.444
3.78AspThr: 3.78 ± 0.45
4.59AspVal: 4.59 ± 0.551
1.08AspTrp: 1.08 ± 0.272
2.092AspTyr: 2.092 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
4.927GluAla: 4.927 ± 0.674
1.282GluCys: 1.282 ± 0.292
2.497GluAsp: 2.497 ± 0.451
3.442GluGlu: 3.442 ± 0.748
1.822GluPhe: 1.822 ± 0.362
3.172GluGly: 3.172 ± 0.414
1.08GluHis: 1.08 ± 0.245
3.712GluIle: 3.712 ± 0.505
3.51GluLys: 3.51 ± 0.524
5.94GluLeu: 5.94 ± 0.645
2.092GluMet: 2.092 ± 0.423
2.632GluAsn: 2.632 ± 0.484
1.485GluPro: 1.485 ± 0.402
3.442GluGln: 3.442 ± 0.486
3.037GluArg: 3.037 ± 0.458
2.902GluSer: 2.902 ± 0.404
2.902GluThr: 2.902 ± 0.415
3.712GluVal: 3.712 ± 0.477
1.485GluTrp: 1.485 ± 0.316
2.632GluTyr: 2.632 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
2.565PheAla: 2.565 ± 0.49
0.54PheCys: 0.54 ± 0.206
2.362PheAsp: 2.362 ± 0.477
2.025PheGlu: 2.025 ± 0.41
1.282PhePhe: 1.282 ± 0.264
2.362PheGly: 2.362 ± 0.325
0.405PheHis: 0.405 ± 0.166
2.16PheIle: 2.16 ± 0.501
2.362PheLys: 2.362 ± 0.349
1.957PheLeu: 1.957 ± 0.361
0.945PheMet: 0.945 ± 0.296
2.092PheAsn: 2.092 ± 0.487
1.485PhePro: 1.485 ± 0.333
0.945PheGln: 0.945 ± 0.209
1.89PheArg: 1.89 ± 0.363
1.755PheSer: 1.755 ± 0.38
2.565PheThr: 2.565 ± 0.412
2.025PheVal: 2.025 ± 0.309
0.337PheTrp: 0.337 ± 0.136
1.417PheTyr: 1.417 ± 0.312
0.0PheXaa: 0.0 ± 0.0
Gly
6.21GlyAla: 6.21 ± 0.904
1.147GlyCys: 1.147 ± 0.388
4.995GlyAsp: 4.995 ± 0.525
4.792GlyGlu: 4.792 ± 0.697
1.822GlyPhe: 1.822 ± 0.392
5.94GlyGly: 5.94 ± 1.218
1.485GlyHis: 1.485 ± 0.399
4.117GlyIle: 4.117 ± 0.53
4.657GlyLys: 4.657 ± 0.522
5.265GlyLeu: 5.265 ± 0.613
2.362GlyMet: 2.362 ± 0.367
2.767GlyAsn: 2.767 ± 0.443
2.025GlyPro: 2.025 ± 0.383
2.902GlyGln: 2.902 ± 0.376
3.645GlyArg: 3.645 ± 0.571
3.982GlySer: 3.982 ± 0.636
5.332GlyThr: 5.332 ± 0.961
6.749GlyVal: 6.749 ± 0.713
1.417GlyTrp: 1.417 ± 0.282
3.645GlyTyr: 3.645 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
1.215HisAla: 1.215 ± 0.297
0.607HisCys: 0.607 ± 0.198
1.35HisAsp: 1.35 ± 0.332
0.405HisGlu: 0.405 ± 0.176
0.742HisPhe: 0.742 ± 0.223
2.295HisGly: 2.295 ± 0.5
0.742HisHis: 0.742 ± 0.214
1.687HisIle: 1.687 ± 0.347
0.81HisLys: 0.81 ± 0.23
1.282HisLeu: 1.282 ± 0.253
0.405HisMet: 0.405 ± 0.168
0.945HisAsn: 0.945 ± 0.288
0.54HisPro: 0.54 ± 0.176
0.607HisGln: 0.607 ± 0.186
1.215HisArg: 1.215 ± 0.278
1.08HisSer: 1.08 ± 0.228
0.675HisThr: 0.675 ± 0.183
1.485HisVal: 1.485 ± 0.33
0.337HisTrp: 0.337 ± 0.16
0.945HisTyr: 0.945 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.927IleAla: 4.927 ± 0.597
1.417IleCys: 1.417 ± 0.343
4.252IleAsp: 4.252 ± 0.538
3.847IleGlu: 3.847 ± 0.522
1.282IlePhe: 1.282 ± 0.34
3.307IleGly: 3.307 ± 0.414
1.08IleHis: 1.08 ± 0.324
2.835IleIle: 2.835 ± 0.523
3.712IleLys: 3.712 ± 0.548
3.172IleLeu: 3.172 ± 0.481
1.485IleMet: 1.485 ± 0.326
3.375IleAsn: 3.375 ± 0.571
2.632IlePro: 2.632 ± 0.433
1.957IleGln: 1.957 ± 0.395
4.252IleArg: 4.252 ± 0.525
2.767IleSer: 2.767 ± 0.465
4.455IleThr: 4.455 ± 0.547
4.725IleVal: 4.725 ± 0.641
0.472IleTrp: 0.472 ± 0.182
1.822IleTyr: 1.822 ± 0.312
0.0IleXaa: 0.0 ± 0.0
Lys
5.805LysAla: 5.805 ± 0.618
1.012LysCys: 1.012 ± 0.333
3.51LysAsp: 3.51 ± 0.572
2.632LysGlu: 2.632 ± 0.541
2.227LysPhe: 2.227 ± 0.323
2.902LysGly: 2.902 ± 0.482
1.147LysHis: 1.147 ± 0.393
2.835LysIle: 2.835 ± 0.506
2.767LysLys: 2.767 ± 0.436
4.185LysLeu: 4.185 ± 0.556
2.295LysMet: 2.295 ± 0.381
2.43LysAsn: 2.43 ± 0.502
2.632LysPro: 2.632 ± 0.353
2.835LysGln: 2.835 ± 0.656
4.522LysArg: 4.522 ± 0.505
3.172LysSer: 3.172 ± 0.448
2.835LysThr: 2.835 ± 0.386
3.51LysVal: 3.51 ± 0.494
1.012LysTrp: 1.012 ± 0.227
1.215LysTyr: 1.215 ± 0.294
0.0LysXaa: 0.0 ± 0.0
Leu
5.265LeuAla: 5.265 ± 0.58
1.35LeuCys: 1.35 ± 0.355
6.277LeuAsp: 6.277 ± 0.636
4.185LeuGlu: 4.185 ± 0.548
2.025LeuPhe: 2.025 ± 0.32
4.995LeuGly: 4.995 ± 0.456
1.417LeuHis: 1.417 ± 0.34
4.32LeuIle: 4.32 ± 0.49
3.78LeuLys: 3.78 ± 0.536
5.535LeuLeu: 5.535 ± 0.64
1.89LeuMet: 1.89 ± 0.347
4.522LeuAsn: 4.522 ± 0.56
2.16LeuPro: 2.16 ± 0.327
2.565LeuGln: 2.565 ± 0.422
4.32LeuArg: 4.32 ± 0.493
5.197LeuSer: 5.197 ± 0.495
4.995LeuThr: 4.995 ± 0.594
4.927LeuVal: 4.927 ± 0.447
1.012LeuTrp: 1.012 ± 0.259
2.227LeuTyr: 2.227 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
2.227MetAla: 2.227 ± 0.393
0.405MetCys: 0.405 ± 0.161
1.147MetAsp: 1.147 ± 0.308
1.485MetGlu: 1.485 ± 0.242
1.215MetPhe: 1.215 ± 0.305
1.62MetGly: 1.62 ± 0.304
0.337MetHis: 0.337 ± 0.155
1.485MetIle: 1.485 ± 0.286
1.08MetLys: 1.08 ± 0.308
2.295MetLeu: 2.295 ± 0.363
1.012MetMet: 1.012 ± 0.283
1.485MetAsn: 1.485 ± 0.381
0.877MetPro: 0.877 ± 0.227
0.742MetGln: 0.742 ± 0.185
2.295MetArg: 2.295 ± 0.506
2.43MetSer: 2.43 ± 0.409
3.24MetThr: 3.24 ± 0.358
1.62MetVal: 1.62 ± 0.302
0.202MetTrp: 0.202 ± 0.11
0.877MetTyr: 0.877 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.117AsnAla: 4.117 ± 0.535
0.742AsnCys: 0.742 ± 0.275
2.97AsnAsp: 2.97 ± 0.442
2.565AsnGlu: 2.565 ± 0.415
1.62AsnPhe: 1.62 ± 0.304
4.387AsnGly: 4.387 ± 0.601
1.35AsnHis: 1.35 ± 0.306
2.902AsnIle: 2.902 ± 0.488
2.295AsnLys: 2.295 ± 0.442
2.632AsnLeu: 2.632 ± 0.374
1.147AsnMet: 1.147 ± 0.285
3.51AsnAsn: 3.51 ± 0.669
3.037AsnPro: 3.037 ± 0.552
1.147AsnGln: 1.147 ± 0.273
2.295AsnArg: 2.295 ± 0.517
3.645AsnSer: 3.645 ± 0.612
3.105AsnThr: 3.105 ± 0.484
3.78AsnVal: 3.78 ± 0.449
0.81AsnTrp: 0.81 ± 0.21
1.822AsnTyr: 1.822 ± 0.293
0.0AsnXaa: 0.0 ± 0.0
Pro
3.915ProAla: 3.915 ± 0.524
0.337ProCys: 0.337 ± 0.145
2.497ProAsp: 2.497 ± 0.378
2.092ProGlu: 2.092 ± 0.338
1.147ProPhe: 1.147 ± 0.245
3.915ProGly: 3.915 ± 0.601
0.675ProHis: 0.675 ± 0.227
2.092ProIle: 2.092 ± 0.347
1.62ProLys: 1.62 ± 0.265
3.307ProLeu: 3.307 ± 0.415
0.675ProMet: 0.675 ± 0.173
2.227ProAsn: 2.227 ± 0.321
2.16ProPro: 2.16 ± 0.549
1.282ProGln: 1.282 ± 0.237
2.16ProArg: 2.16 ± 0.327
2.362ProSer: 2.362 ± 0.299
2.092ProThr: 2.092 ± 0.399
3.78ProVal: 3.78 ± 0.517
0.607ProTrp: 0.607 ± 0.173
1.282ProTyr: 1.282 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
2.632GlnAla: 2.632 ± 0.544
0.472GlnCys: 0.472 ± 0.165
1.62GlnAsp: 1.62 ± 0.294
2.497GlnGlu: 2.497 ± 0.383
1.89GlnPhe: 1.89 ± 0.334
2.16GlnGly: 2.16 ± 0.424
0.607GlnHis: 0.607 ± 0.135
2.025GlnIle: 2.025 ± 0.396
1.755GlnLys: 1.755 ± 0.438
4.252GlnLeu: 4.252 ± 0.566
1.215GlnMet: 1.215 ± 0.328
1.687GlnAsn: 1.687 ± 0.362
1.282GlnPro: 1.282 ± 0.423
2.362GlnGln: 2.362 ± 0.348
2.43GlnArg: 2.43 ± 0.449
2.43GlnSer: 2.43 ± 0.49
3.037GlnThr: 3.037 ± 0.418
2.835GlnVal: 2.835 ± 0.392
0.472GlnTrp: 0.472 ± 0.159
1.552GlnTyr: 1.552 ± 0.301
0.0GlnXaa: 0.0 ± 0.0
Arg
4.387ArgAla: 4.387 ± 0.582
0.81ArgCys: 0.81 ± 0.21
2.632ArgAsp: 2.632 ± 0.446
3.915ArgGlu: 3.915 ± 0.538
1.755ArgPhe: 1.755 ± 0.361
2.97ArgGly: 2.97 ± 0.496
1.147ArgHis: 1.147 ± 0.295
3.78ArgIle: 3.78 ± 0.485
3.577ArgLys: 3.577 ± 0.579
4.455ArgLeu: 4.455 ± 0.546
1.62ArgMet: 1.62 ± 0.327
2.767ArgAsn: 2.767 ± 0.44
2.43ArgPro: 2.43 ± 0.377
2.632ArgGln: 2.632 ± 0.511
2.025ArgArg: 2.025 ± 0.339
2.902ArgSer: 2.902 ± 0.384
2.497ArgThr: 2.497 ± 0.328
4.86ArgVal: 4.86 ± 0.676
1.215ArgTrp: 1.215 ± 0.259
2.16ArgTyr: 2.16 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
4.927SerAla: 4.927 ± 0.711
0.472SerCys: 0.472 ± 0.19
3.847SerAsp: 3.847 ± 0.413
2.43SerGlu: 2.43 ± 0.405
1.957SerPhe: 1.957 ± 0.357
5.467SerGly: 5.467 ± 0.576
1.62SerHis: 1.62 ± 0.419
3.645SerIle: 3.645 ± 0.555
3.847SerLys: 3.847 ± 0.489
4.252SerLeu: 4.252 ± 0.463
1.755SerMet: 1.755 ± 0.329
3.712SerAsn: 3.712 ± 0.524
1.755SerPro: 1.755 ± 0.367
2.497SerGln: 2.497 ± 0.403
2.7SerArg: 2.7 ± 0.39
3.982SerSer: 3.982 ± 0.835
4.252SerThr: 4.252 ± 0.62
4.185SerVal: 4.185 ± 0.565
0.607SerTrp: 0.607 ± 0.197
2.7SerTyr: 2.7 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
5.805ThrAla: 5.805 ± 0.726
1.147ThrCys: 1.147 ± 0.277
3.982ThrAsp: 3.982 ± 0.503
3.24ThrGlu: 3.24 ± 0.444
2.362ThrPhe: 2.362 ± 0.4
6.412ThrGly: 6.412 ± 0.62
1.35ThrHis: 1.35 ± 0.448
4.522ThrIle: 4.522 ± 0.49
3.375ThrLys: 3.375 ± 0.519
5.062ThrLeu: 5.062 ± 0.544
1.552ThrMet: 1.552 ± 0.293
2.295ThrAsn: 2.295 ± 0.34
3.915ThrPro: 3.915 ± 0.501
2.092ThrGln: 2.092 ± 0.407
2.497ThrArg: 2.497 ± 0.421
4.05ThrSer: 4.05 ± 0.557
4.927ThrThr: 4.927 ± 0.643
6.479ThrVal: 6.479 ± 0.787
1.215ThrTrp: 1.215 ± 0.295
2.092ThrTyr: 2.092 ± 0.337
0.0ThrXaa: 0.0 ± 0.0
Val
7.154ValAla: 7.154 ± 0.91
1.08ValCys: 1.08 ± 0.241
4.792ValAsp: 4.792 ± 0.695
4.59ValGlu: 4.59 ± 0.626
2.632ValPhe: 2.632 ± 0.546
4.725ValGly: 4.725 ± 0.563
0.877ValHis: 0.877 ± 0.281
3.915ValIle: 3.915 ± 0.439
5.197ValLys: 5.197 ± 0.602
3.982ValLeu: 3.982 ± 0.497
2.16ValMet: 2.16 ± 0.351
4.185ValAsn: 4.185 ± 0.543
3.24ValPro: 3.24 ± 0.493
2.7ValGln: 2.7 ± 0.427
3.442ValArg: 3.442 ± 0.461
4.455ValSer: 4.455 ± 0.501
7.559ValThr: 7.559 ± 0.789
4.32ValVal: 4.32 ± 0.564
0.945ValTrp: 0.945 ± 0.233
2.632ValTyr: 2.632 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
0.945TrpAla: 0.945 ± 0.242
0.135TrpCys: 0.135 ± 0.099
0.81TrpAsp: 0.81 ± 0.194
0.81TrpGlu: 0.81 ± 0.209
0.675TrpPhe: 0.675 ± 0.216
0.81TrpGly: 0.81 ± 0.217
0.472TrpHis: 0.472 ± 0.157
0.81TrpIle: 0.81 ± 0.277
1.282TrpLys: 1.282 ± 0.285
1.35TrpLeu: 1.35 ± 0.298
0.135TrpMet: 0.135 ± 0.096
0.54TrpAsn: 0.54 ± 0.205
0.405TrpPro: 0.405 ± 0.156
0.877TrpGln: 0.877 ± 0.235
0.877TrpArg: 0.877 ± 0.225
1.012TrpSer: 1.012 ± 0.254
0.877TrpThr: 0.877 ± 0.256
1.687TrpVal: 1.687 ± 0.324
0.202TrpTrp: 0.202 ± 0.112
0.675TrpTyr: 0.675 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.97TyrAla: 2.97 ± 0.438
0.675TyrCys: 0.675 ± 0.214
2.632TyrAsp: 2.632 ± 0.371
2.025TyrGlu: 2.025 ± 0.393
1.215TyrPhe: 1.215 ± 0.253
3.105TyrGly: 3.105 ± 0.385
0.607TyrHis: 0.607 ± 0.225
1.417TyrIle: 1.417 ± 0.29
1.822TyrLys: 1.822 ± 0.32
2.632TyrLeu: 2.632 ± 0.458
0.742TyrMet: 0.742 ± 0.233
1.485TyrAsn: 1.485 ± 0.352
1.485TyrPro: 1.485 ± 0.373
1.552TyrGln: 1.552 ± 0.28
2.227TyrArg: 2.227 ± 0.419
2.227TyrSer: 2.227 ± 0.385
2.295TyrThr: 2.295 ± 0.328
2.632TyrVal: 2.632 ± 0.374
0.675TyrTrp: 0.675 ± 0.226
1.08TyrTyr: 1.08 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (14817 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski