Amino acid dipepetide frequency for Enterobacteria phage KhF3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.11AlaAla: 5.11 ± 0.663
0.506AlaCys: 0.506 ± 0.165
4.351AlaAsp: 4.351 ± 0.441
4.25AlaGlu: 4.25 ± 0.407
2.884AlaPhe: 2.884 ± 0.398
5.261AlaGly: 5.261 ± 0.476
1.669AlaHis: 1.669 ± 0.32
4.148AlaIle: 4.148 ± 0.373
5.312AlaLys: 5.312 ± 0.554
5.868AlaLeu: 5.868 ± 0.522
2.833AlaMet: 2.833 ± 0.42
3.997AlaAsn: 3.997 ± 0.55
1.872AlaPro: 1.872 ± 0.283
3.137AlaGln: 3.137 ± 0.449
2.58AlaArg: 2.58 ± 0.408
4.25AlaSer: 4.25 ± 0.597
4.553AlaThr: 4.553 ± 0.48
5.818AlaVal: 5.818 ± 0.524
0.961AlaTrp: 0.961 ± 0.17
3.035AlaTyr: 3.035 ± 0.341
0.0AlaXaa: 0.0 ± 0.0
Cys
0.506CysAla: 0.506 ± 0.223
0.202CysCys: 0.202 ± 0.117
0.354CysAsp: 0.354 ± 0.128
1.062CysGlu: 1.062 ± 0.23
0.658CysPhe: 0.658 ± 0.176
0.911CysGly: 0.911 ± 0.226
0.405CysHis: 0.405 ± 0.125
0.506CysIle: 0.506 ± 0.166
1.113CysLys: 1.113 ± 0.283
0.961CysLeu: 0.961 ± 0.223
0.405CysMet: 0.405 ± 0.133
0.607CysAsn: 0.607 ± 0.174
0.556CysPro: 0.556 ± 0.171
0.455CysGln: 0.455 ± 0.156
0.607CysArg: 0.607 ± 0.177
1.012CysSer: 1.012 ± 0.243
0.506CysThr: 0.506 ± 0.125
0.809CysVal: 0.809 ± 0.186
0.101CysTrp: 0.101 ± 0.069
0.455CysTyr: 0.455 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
4.553AspAla: 4.553 ± 0.372
0.556AspCys: 0.556 ± 0.183
3.44AspAsp: 3.44 ± 0.465
4.401AspGlu: 4.401 ± 0.546
3.137AspPhe: 3.137 ± 0.299
4.705AspGly: 4.705 ± 0.553
0.708AspHis: 0.708 ± 0.204
4.199AspIle: 4.199 ± 0.482
4.401AspLys: 4.401 ± 0.359
5.261AspLeu: 5.261 ± 0.367
1.568AspMet: 1.568 ± 0.296
3.997AspAsn: 3.997 ± 0.372
1.821AspPro: 1.821 ± 0.338
0.708AspGln: 0.708 ± 0.166
2.125AspArg: 2.125 ± 0.257
4.351AspSer: 4.351 ± 0.445
3.997AspThr: 3.997 ± 0.363
4.958AspVal: 4.958 ± 0.47
1.417AspTrp: 1.417 ± 0.304
2.985AspTyr: 2.985 ± 0.435
0.0AspXaa: 0.0 ± 0.0
Glu
5.717GluAla: 5.717 ± 0.7
0.556GluCys: 0.556 ± 0.164
4.502GluAsp: 4.502 ± 0.618
5.514GluGlu: 5.514 ± 0.662
2.934GluPhe: 2.934 ± 0.409
4.148GluGly: 4.148 ± 0.462
1.518GluHis: 1.518 ± 0.312
4.3GluIle: 4.3 ± 0.595
4.907GluLys: 4.907 ± 0.598
6.222GluLeu: 6.222 ± 0.503
2.58GluMet: 2.58 ± 0.313
3.288GluAsn: 3.288 ± 0.361
1.771GluPro: 1.771 ± 0.291
2.226GluGln: 2.226 ± 0.307
3.035GluArg: 3.035 ± 0.388
3.693GluSer: 3.693 ± 0.421
3.541GluThr: 3.541 ± 0.336
4.452GluVal: 4.452 ± 0.42
0.759GluTrp: 0.759 ± 0.209
2.428GluTyr: 2.428 ± 0.38
0.0GluXaa: 0.0 ± 0.0
Phe
3.137PheAla: 3.137 ± 0.526
0.455PheCys: 0.455 ± 0.162
3.137PheAsp: 3.137 ± 0.375
3.187PheGlu: 3.187 ± 0.397
1.062PhePhe: 1.062 ± 0.211
3.187PheGly: 3.187 ± 0.386
0.961PheHis: 0.961 ± 0.208
2.529PheIle: 2.529 ± 0.37
3.491PheLys: 3.491 ± 0.344
2.681PheLeu: 2.681 ± 0.381
1.012PheMet: 1.012 ± 0.233
2.631PheAsn: 2.631 ± 0.363
1.164PhePro: 1.164 ± 0.255
1.366PheGln: 1.366 ± 0.218
1.467PheArg: 1.467 ± 0.235
2.58PheSer: 2.58 ± 0.298
2.226PheThr: 2.226 ± 0.323
2.934PheVal: 2.934 ± 0.317
0.405PheTrp: 0.405 ± 0.133
1.771PheTyr: 1.771 ± 0.304
0.0PheXaa: 0.0 ± 0.0
Gly
4.654GlyAla: 4.654 ± 0.558
1.113GlyCys: 1.113 ± 0.253
3.44GlyAsp: 3.44 ± 0.438
4.047GlyGlu: 4.047 ± 0.523
3.238GlyPhe: 3.238 ± 0.409
4.553GlyGly: 4.553 ± 0.544
1.366GlyHis: 1.366 ± 0.217
4.755GlyIle: 4.755 ± 0.466
5.818GlyLys: 5.818 ± 0.524
4.907GlyLeu: 4.907 ± 0.481
0.911GlyMet: 0.911 ± 0.216
2.884GlyAsn: 2.884 ± 0.392
0.101GlyPro: 0.101 ± 0.072
2.327GlyGln: 2.327 ± 0.38
2.732GlyArg: 2.732 ± 0.29
4.502GlySer: 4.502 ± 0.509
4.199GlyThr: 4.199 ± 0.534
5.666GlyVal: 5.666 ± 0.446
1.012GlyTrp: 1.012 ± 0.216
2.782GlyTyr: 2.782 ± 0.363
0.0GlyXaa: 0.0 ± 0.0
His
1.062HisAla: 1.062 ± 0.18
0.253HisCys: 0.253 ± 0.105
1.164HisAsp: 1.164 ± 0.24
1.315HisGlu: 1.315 ± 0.26
0.809HisPhe: 0.809 ± 0.199
0.708HisGly: 0.708 ± 0.165
0.455HisHis: 0.455 ± 0.158
0.86HisIle: 0.86 ± 0.187
0.961HisLys: 0.961 ± 0.223
1.771HisLeu: 1.771 ± 0.332
0.708HisMet: 0.708 ± 0.158
1.315HisAsn: 1.315 ± 0.421
0.86HisPro: 0.86 ± 0.188
0.405HisGln: 0.405 ± 0.123
0.86HisArg: 0.86 ± 0.183
1.518HisSer: 1.518 ± 0.425
1.72HisThr: 1.72 ± 0.489
1.518HisVal: 1.518 ± 0.259
0.304HisTrp: 0.304 ± 0.176
0.911HisTyr: 0.911 ± 0.185
0.0HisXaa: 0.0 ± 0.0
Ile
4.098IleAla: 4.098 ± 0.388
1.164IleCys: 1.164 ± 0.274
4.502IleAsp: 4.502 ± 0.411
4.351IleGlu: 4.351 ± 0.479
2.58IlePhe: 2.58 ± 0.385
3.187IleGly: 3.187 ± 0.375
1.214IleHis: 1.214 ± 0.177
3.339IleIle: 3.339 ± 0.426
5.16IleLys: 5.16 ± 0.57
3.693IleLeu: 3.693 ± 0.505
1.568IleMet: 1.568 ± 0.274
3.238IleAsn: 3.238 ± 0.399
2.226IlePro: 2.226 ± 0.384
1.771IleGln: 1.771 ± 0.288
2.631IleArg: 2.631 ± 0.334
4.047IleSer: 4.047 ± 0.407
3.845IleThr: 3.845 ± 0.383
3.288IleVal: 3.288 ± 0.41
0.607IleTrp: 0.607 ± 0.155
2.833IleTyr: 2.833 ± 0.338
0.0IleXaa: 0.0 ± 0.0
Lys
5.767LysAla: 5.767 ± 0.611
0.86LysCys: 0.86 ± 0.216
4.755LysAsp: 4.755 ± 0.499
6.02LysGlu: 6.02 ± 0.606
2.074LysPhe: 2.074 ± 0.337
5.11LysGly: 5.11 ± 0.509
1.366LysHis: 1.366 ± 0.266
4.199LysIle: 4.199 ± 0.38
5.717LysLys: 5.717 ± 0.706
6.172LysLeu: 6.172 ± 0.586
2.479LysMet: 2.479 ± 0.392
3.794LysAsn: 3.794 ± 0.414
2.479LysPro: 2.479 ± 0.346
2.934LysGln: 2.934 ± 0.379
3.288LysArg: 3.288 ± 0.308
4.604LysSer: 4.604 ± 0.463
5.059LysThr: 5.059 ± 0.553
6.627LysVal: 6.627 ± 0.679
0.556LysTrp: 0.556 ± 0.171
2.732LysTyr: 2.732 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
6.172LeuAla: 6.172 ± 0.606
0.911LeuCys: 0.911 ± 0.222
5.666LeuAsp: 5.666 ± 0.572
6.273LeuGlu: 6.273 ± 0.541
2.479LeuPhe: 2.479 ± 0.222
3.946LeuGly: 3.946 ± 0.448
1.72LeuHis: 1.72 ± 0.327
3.44LeuIle: 3.44 ± 0.349
6.627LeuLys: 6.627 ± 0.574
6.222LeuLeu: 6.222 ± 0.517
2.631LeuMet: 2.631 ± 0.287
4.098LeuAsn: 4.098 ± 0.418
3.339LeuPro: 3.339 ± 0.311
3.137LeuGln: 3.137 ± 0.389
4.3LeuArg: 4.3 ± 0.463
5.464LeuSer: 5.464 ± 0.611
5.059LeuThr: 5.059 ± 0.458
4.047LeuVal: 4.047 ± 0.467
0.86LeuTrp: 0.86 ± 0.201
3.187LeuTyr: 3.187 ± 0.405
0.0LeuXaa: 0.0 ± 0.0
Met
2.074MetAla: 2.074 ± 0.343
0.304MetCys: 0.304 ± 0.132
1.214MetAsp: 1.214 ± 0.209
2.024MetGlu: 2.024 ± 0.31
1.568MetPhe: 1.568 ± 0.328
1.72MetGly: 1.72 ± 0.253
0.304MetHis: 0.304 ± 0.103
1.872MetIle: 1.872 ± 0.326
2.428MetLys: 2.428 ± 0.366
2.125MetLeu: 2.125 ± 0.337
0.759MetMet: 0.759 ± 0.231
1.518MetAsn: 1.518 ± 0.256
0.809MetPro: 0.809 ± 0.172
1.417MetGln: 1.417 ± 0.259
1.518MetArg: 1.518 ± 0.247
2.175MetSer: 2.175 ± 0.268
2.024MetThr: 2.024 ± 0.294
1.214MetVal: 1.214 ± 0.196
0.253MetTrp: 0.253 ± 0.103
0.809MetTyr: 0.809 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
4.199AsnAla: 4.199 ± 0.46
0.455AsnCys: 0.455 ± 0.144
2.934AsnAsp: 2.934 ± 0.379
2.428AsnGlu: 2.428 ± 0.372
2.681AsnPhe: 2.681 ± 0.31
4.502AsnGly: 4.502 ± 0.555
1.366AsnHis: 1.366 ± 0.377
3.744AsnIle: 3.744 ± 0.477
3.541AsnLys: 3.541 ± 0.519
4.958AsnLeu: 4.958 ± 0.647
1.012AsnMet: 1.012 ± 0.258
2.884AsnAsn: 2.884 ± 0.487
2.024AsnPro: 2.024 ± 0.287
1.669AsnGln: 1.669 ± 0.351
2.226AsnArg: 2.226 ± 0.309
3.642AsnSer: 3.642 ± 0.435
2.681AsnThr: 2.681 ± 0.496
3.541AsnVal: 3.541 ± 0.325
0.607AsnTrp: 0.607 ± 0.149
2.479AsnTyr: 2.479 ± 0.286
0.0AsnXaa: 0.0 ± 0.0
Pro
1.973ProAla: 1.973 ± 0.329
0.455ProCys: 0.455 ± 0.14
2.175ProAsp: 2.175 ± 0.343
3.035ProGlu: 3.035 ± 0.4
1.669ProPhe: 1.669 ± 0.31
0.304ProGly: 0.304 ± 0.155
0.607ProHis: 0.607 ± 0.228
1.265ProIle: 1.265 ± 0.199
2.529ProLys: 2.529 ± 0.351
2.175ProLeu: 2.175 ± 0.34
1.214ProMet: 1.214 ± 0.265
1.568ProAsn: 1.568 ± 0.295
1.062ProPro: 1.062 ± 0.265
1.164ProGln: 1.164 ± 0.226
1.012ProArg: 1.012 ± 0.233
2.479ProSer: 2.479 ± 0.443
2.277ProThr: 2.277 ± 0.304
2.277ProVal: 2.277 ± 0.333
0.152ProTrp: 0.152 ± 0.078
1.467ProTyr: 1.467 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
2.428GlnAla: 2.428 ± 0.43
0.455GlnCys: 0.455 ± 0.141
1.619GlnAsp: 1.619 ± 0.25
2.378GlnGlu: 2.378 ± 0.336
1.619GlnPhe: 1.619 ± 0.248
1.771GlnGly: 1.771 ± 0.293
0.556GlnHis: 0.556 ± 0.17
2.782GlnIle: 2.782 ± 0.352
2.631GlnLys: 2.631 ± 0.313
2.884GlnLeu: 2.884 ± 0.287
1.012GlnMet: 1.012 ± 0.285
1.973GlnAsn: 1.973 ± 0.342
1.265GlnPro: 1.265 ± 0.233
1.315GlnGln: 1.315 ± 0.233
1.821GlnArg: 1.821 ± 0.264
2.024GlnSer: 2.024 ± 0.358
1.922GlnThr: 1.922 ± 0.274
2.378GlnVal: 2.378 ± 0.339
0.455GlnTrp: 0.455 ± 0.146
1.973GlnTyr: 1.973 ± 0.256
0.0GlnXaa: 0.0 ± 0.0
Arg
2.378ArgAla: 2.378 ± 0.33
0.658ArgCys: 0.658 ± 0.198
3.288ArgAsp: 3.288 ± 0.475
2.327ArgGlu: 2.327 ± 0.368
1.518ArgPhe: 1.518 ± 0.312
2.631ArgGly: 2.631 ± 0.385
0.86ArgHis: 0.86 ± 0.203
2.782ArgIle: 2.782 ± 0.278
2.934ArgLys: 2.934 ± 0.393
3.693ArgLeu: 3.693 ± 0.452
1.366ArgMet: 1.366 ± 0.255
2.529ArgAsn: 2.529 ± 0.337
1.164ArgPro: 1.164 ± 0.198
1.619ArgGln: 1.619 ± 0.308
2.175ArgArg: 2.175 ± 0.344
1.973ArgSer: 1.973 ± 0.317
2.479ArgThr: 2.479 ± 0.376
3.086ArgVal: 3.086 ± 0.349
0.708ArgTrp: 0.708 ± 0.25
2.226ArgTyr: 2.226 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
4.654SerAla: 4.654 ± 0.561
0.708SerCys: 0.708 ± 0.17
4.705SerAsp: 4.705 ± 0.502
4.604SerGlu: 4.604 ± 0.463
2.833SerPhe: 2.833 ± 0.346
3.895SerGly: 3.895 ± 0.482
1.113SerHis: 1.113 ± 0.229
3.491SerIle: 3.491 ± 0.515
4.401SerLys: 4.401 ± 0.411
5.919SerLeu: 5.919 ± 0.529
1.568SerMet: 1.568 ± 0.217
3.389SerAsn: 3.389 ± 0.565
1.72SerPro: 1.72 ± 0.271
2.631SerGln: 2.631 ± 0.39
2.782SerArg: 2.782 ± 0.303
4.604SerSer: 4.604 ± 0.569
2.884SerThr: 2.884 ± 0.431
5.464SerVal: 5.464 ± 0.43
1.012SerTrp: 1.012 ± 0.231
2.884SerTyr: 2.884 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
4.553ThrAla: 4.553 ± 0.433
0.506ThrCys: 0.506 ± 0.13
3.693ThrAsp: 3.693 ± 0.373
3.035ThrGlu: 3.035 ± 0.42
2.884ThrPhe: 2.884 ± 0.333
5.565ThrGly: 5.565 ± 0.535
1.568ThrHis: 1.568 ± 0.401
3.693ThrIle: 3.693 ± 0.481
4.502ThrLys: 4.502 ± 0.52
4.755ThrLeu: 4.755 ± 0.441
0.708ThrMet: 0.708 ± 0.169
2.681ThrAsn: 2.681 ± 0.514
2.175ThrPro: 2.175 ± 0.29
2.732ThrGln: 2.732 ± 0.394
2.378ThrArg: 2.378 ± 0.27
3.845ThrSer: 3.845 ± 0.447
3.693ThrThr: 3.693 ± 0.697
4.907ThrVal: 4.907 ± 0.59
0.708ThrTrp: 0.708 ± 0.191
2.833ThrTyr: 2.833 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
6.172ValAla: 6.172 ± 0.445
1.012ValCys: 1.012 ± 0.21
4.755ValAsp: 4.755 ± 0.449
4.25ValGlu: 4.25 ± 0.468
2.479ValPhe: 2.479 ± 0.324
5.059ValGly: 5.059 ± 0.616
1.062ValHis: 1.062 ± 0.21
4.098ValIle: 4.098 ± 0.541
5.868ValLys: 5.868 ± 0.515
4.401ValLeu: 4.401 ± 0.491
2.226ValMet: 2.226 ± 0.325
4.047ValAsn: 4.047 ± 0.443
2.428ValPro: 2.428 ± 0.378
2.175ValGln: 2.175 ± 0.332
3.035ValArg: 3.035 ± 0.374
5.008ValSer: 5.008 ± 0.441
4.604ValThr: 4.604 ± 0.438
4.3ValVal: 4.3 ± 0.514
0.759ValTrp: 0.759 ± 0.22
2.833ValTyr: 2.833 ± 0.332
0.0ValXaa: 0.0 ± 0.0
Trp
0.304TrpAla: 0.304 ± 0.149
0.152TrpCys: 0.152 ± 0.078
1.012TrpAsp: 1.012 ± 0.207
0.911TrpGlu: 0.911 ± 0.19
0.455TrpPhe: 0.455 ± 0.155
0.708TrpGly: 0.708 ± 0.155
0.051TrpHis: 0.051 ± 0.041
0.708TrpIle: 0.708 ± 0.204
0.86TrpLys: 0.86 ± 0.182
1.265TrpLeu: 1.265 ± 0.268
0.354TrpMet: 0.354 ± 0.145
0.86TrpAsn: 0.86 ± 0.22
0.253TrpPro: 0.253 ± 0.102
0.455TrpGln: 0.455 ± 0.131
0.506TrpArg: 0.506 ± 0.196
0.809TrpSer: 0.809 ± 0.18
0.759TrpThr: 0.759 ± 0.158
0.809TrpVal: 0.809 ± 0.187
0.253TrpTrp: 0.253 ± 0.112
0.759TrpTyr: 0.759 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.884TyrAla: 2.884 ± 0.376
0.911TyrCys: 0.911 ± 0.197
2.58TyrAsp: 2.58 ± 0.388
2.732TyrGlu: 2.732 ± 0.316
1.821TyrPhe: 1.821 ± 0.242
3.137TyrGly: 3.137 ± 0.424
0.607TyrHis: 0.607 ± 0.188
2.631TyrIle: 2.631 ± 0.345
3.389TyrLys: 3.389 ± 0.316
3.642TyrLeu: 3.642 ± 0.419
1.164TyrMet: 1.164 ± 0.235
2.277TyrAsn: 2.277 ± 0.335
1.771TyrPro: 1.771 ± 0.288
1.619TyrGln: 1.619 ± 0.318
1.315TyrArg: 1.315 ± 0.205
2.681TyrSer: 2.681 ± 0.295
3.389TyrThr: 3.389 ± 0.463
2.479TyrVal: 2.479 ± 0.34
0.354TyrTrp: 0.354 ± 0.128
1.568TyrTyr: 1.568 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (19768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski