Amino acid dipepetide frequency for Klebsiella phage vB_KpnM_GF

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.466AlaAla: 5.466 ± 0.708
0.444AlaCys: 0.444 ± 0.14
4.533AlaAsp: 4.533 ± 0.42
4.489AlaGlu: 4.489 ± 0.535
2.444AlaPhe: 2.444 ± 0.351
4.933AlaGly: 4.933 ± 0.667
1.244AlaHis: 1.244 ± 0.23
4.978AlaIle: 4.978 ± 0.465
5.778AlaLys: 5.778 ± 0.476
6.577AlaLeu: 6.577 ± 0.611
1.467AlaMet: 1.467 ± 0.215
4.0AlaAsn: 4.0 ± 0.391
2.267AlaPro: 2.267 ± 0.267
3.067AlaGln: 3.067 ± 0.36
3.911AlaArg: 3.911 ± 0.377
4.266AlaSer: 4.266 ± 0.48
3.555AlaThr: 3.555 ± 0.544
3.555AlaVal: 3.555 ± 0.443
1.067AlaTrp: 1.067 ± 0.237
2.8AlaTyr: 2.8 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.622CysAla: 0.622 ± 0.161
0.222CysCys: 0.222 ± 0.114
0.711CysAsp: 0.711 ± 0.177
0.711CysGlu: 0.711 ± 0.198
0.356CysPhe: 0.356 ± 0.112
0.8CysGly: 0.8 ± 0.228
0.311CysHis: 0.311 ± 0.117
0.4CysIle: 0.4 ± 0.135
0.756CysLys: 0.756 ± 0.186
0.8CysLeu: 0.8 ± 0.184
0.356CysMet: 0.356 ± 0.135
0.444CysAsn: 0.444 ± 0.172
0.533CysPro: 0.533 ± 0.175
0.533CysGln: 0.533 ± 0.143
0.711CysArg: 0.711 ± 0.176
0.578CysSer: 0.578 ± 0.148
0.533CysThr: 0.533 ± 0.149
0.889CysVal: 0.889 ± 0.217
0.178CysTrp: 0.178 ± 0.088
0.444CysTyr: 0.444 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
4.089AspAla: 4.089 ± 0.393
0.8AspCys: 0.8 ± 0.184
4.355AspAsp: 4.355 ± 0.478
5.066AspGlu: 5.066 ± 0.539
3.111AspPhe: 3.111 ± 0.394
5.244AspGly: 5.244 ± 0.409
0.978AspHis: 0.978 ± 0.228
5.822AspIle: 5.822 ± 0.469
4.933AspLys: 4.933 ± 0.522
5.555AspLeu: 5.555 ± 0.515
2.133AspMet: 2.133 ± 0.303
3.289AspAsn: 3.289 ± 0.388
2.178AspPro: 2.178 ± 0.316
1.333AspGln: 1.333 ± 0.274
2.622AspArg: 2.622 ± 0.332
4.578AspSer: 4.578 ± 0.342
2.622AspThr: 2.622 ± 0.385
3.289AspVal: 3.289 ± 0.358
0.978AspTrp: 0.978 ± 0.232
3.022AspTyr: 3.022 ± 0.452
0.0AspXaa: 0.0 ± 0.0
Glu
4.978GluAla: 4.978 ± 0.46
1.022GluCys: 1.022 ± 0.246
3.778GluAsp: 3.778 ± 0.445
4.355GluGlu: 4.355 ± 0.492
2.755GluPhe: 2.755 ± 0.284
4.755GluGly: 4.755 ± 0.445
1.378GluHis: 1.378 ± 0.213
5.378GluIle: 5.378 ± 0.412
4.266GluLys: 4.266 ± 0.551
7.289GluLeu: 7.289 ± 0.601
2.444GluMet: 2.444 ± 0.335
3.733GluAsn: 3.733 ± 0.529
2.044GluPro: 2.044 ± 0.342
2.0GluGln: 2.0 ± 0.258
2.4GluArg: 2.4 ± 0.34
4.133GluSer: 4.133 ± 0.454
3.822GluThr: 3.822 ± 0.385
4.8GluVal: 4.8 ± 0.474
1.111GluTrp: 1.111 ± 0.202
2.622GluTyr: 2.622 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
2.711PheAla: 2.711 ± 0.385
0.4PheCys: 0.4 ± 0.115
2.8PheAsp: 2.8 ± 0.378
3.422PheGlu: 3.422 ± 0.326
1.467PhePhe: 1.467 ± 0.199
3.111PheGly: 3.111 ± 0.387
0.444PheHis: 0.444 ± 0.133
2.4PheIle: 2.4 ± 0.313
3.333PheLys: 3.333 ± 0.447
2.444PheLeu: 2.444 ± 0.274
1.6PheMet: 1.6 ± 0.289
2.622PheAsn: 2.622 ± 0.304
1.022PhePro: 1.022 ± 0.209
1.067PheGln: 1.067 ± 0.18
1.6PheArg: 1.6 ± 0.225
3.022PheSer: 3.022 ± 0.383
2.444PheThr: 2.444 ± 0.367
3.067PheVal: 3.067 ± 0.299
0.711PheTrp: 0.711 ± 0.187
1.467PheTyr: 1.467 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
3.689GlyAla: 3.689 ± 0.388
0.533GlyCys: 0.533 ± 0.177
4.311GlyAsp: 4.311 ± 0.504
4.355GlyGlu: 4.355 ± 0.374
2.089GlyPhe: 2.089 ± 0.282
3.822GlyGly: 3.822 ± 0.441
1.067GlyHis: 1.067 ± 0.206
4.533GlyIle: 4.533 ± 0.396
6.044GlyLys: 6.044 ± 0.612
5.511GlyLeu: 5.511 ± 0.598
1.911GlyMet: 1.911 ± 0.297
3.333GlyAsn: 3.333 ± 0.512
1.511GlyPro: 1.511 ± 0.322
2.755GlyGln: 2.755 ± 0.379
3.022GlyArg: 3.022 ± 0.385
4.178GlySer: 4.178 ± 0.406
3.955GlyThr: 3.955 ± 0.36
4.266GlyVal: 4.266 ± 0.468
1.022GlyTrp: 1.022 ± 0.24
3.467GlyTyr: 3.467 ± 0.384
0.0GlyXaa: 0.0 ± 0.0
His
0.889HisAla: 0.889 ± 0.191
0.267HisCys: 0.267 ± 0.135
1.2HisAsp: 1.2 ± 0.198
1.289HisGlu: 1.289 ± 0.252
0.756HisPhe: 0.756 ± 0.184
1.022HisGly: 1.022 ± 0.22
0.4HisHis: 0.4 ± 0.128
1.511HisIle: 1.511 ± 0.238
1.511HisLys: 1.511 ± 0.301
2.0HisLeu: 2.0 ± 0.332
0.178HisMet: 0.178 ± 0.078
0.622HisAsn: 0.622 ± 0.174
1.289HisPro: 1.289 ± 0.246
0.489HisGln: 0.489 ± 0.137
0.756HisArg: 0.756 ± 0.21
0.978HisSer: 0.978 ± 0.196
0.8HisThr: 0.8 ± 0.178
1.289HisVal: 1.289 ± 0.253
0.356HisTrp: 0.356 ± 0.124
0.578HisTyr: 0.578 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
4.489IleAla: 4.489 ± 0.4
0.711IleCys: 0.711 ± 0.183
5.022IleAsp: 5.022 ± 0.501
5.022IleGlu: 5.022 ± 0.472
2.755IlePhe: 2.755 ± 0.307
3.778IleGly: 3.778 ± 0.33
1.422IleHis: 1.422 ± 0.265
4.978IleIle: 4.978 ± 0.621
6.622IleLys: 6.622 ± 0.669
4.444IleLeu: 4.444 ± 0.416
2.444IleMet: 2.444 ± 0.357
4.311IleAsn: 4.311 ± 0.471
2.844IlePro: 2.844 ± 0.322
2.578IleGln: 2.578 ± 0.331
4.133IleArg: 4.133 ± 0.45
3.955IleSer: 3.955 ± 0.4
4.533IleThr: 4.533 ± 0.475
4.4IleVal: 4.4 ± 0.452
0.489IleTrp: 0.489 ± 0.175
1.955IleTyr: 1.955 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
5.911LysAla: 5.911 ± 0.592
0.844LysCys: 0.844 ± 0.24
5.778LysAsp: 5.778 ± 0.546
5.555LysGlu: 5.555 ± 0.519
3.689LysPhe: 3.689 ± 0.474
4.889LysGly: 4.889 ± 0.445
1.867LysHis: 1.867 ± 0.304
4.978LysIle: 4.978 ± 0.475
4.578LysLys: 4.578 ± 0.509
5.866LysLeu: 5.866 ± 0.486
2.622LysMet: 2.622 ± 0.433
3.6LysAsn: 3.6 ± 0.343
2.889LysPro: 2.889 ± 0.404
2.711LysGln: 2.711 ± 0.342
3.2LysArg: 3.2 ± 0.439
4.622LysSer: 4.622 ± 0.435
4.4LysThr: 4.4 ± 0.523
4.578LysVal: 4.578 ± 0.691
1.111LysTrp: 1.111 ± 0.241
3.467LysTyr: 3.467 ± 0.357
0.0LysXaa: 0.0 ± 0.0
Leu
5.733LeuAla: 5.733 ± 0.517
0.889LeuCys: 0.889 ± 0.244
5.022LeuAsp: 5.022 ± 0.461
5.866LeuGlu: 5.866 ± 0.593
3.244LeuPhe: 3.244 ± 0.394
4.489LeuGly: 4.489 ± 0.402
1.422LeuHis: 1.422 ± 0.261
5.111LeuIle: 5.111 ± 0.442
6.355LeuLys: 6.355 ± 0.596
5.333LeuLeu: 5.333 ± 0.492
2.444LeuMet: 2.444 ± 0.32
4.311LeuAsn: 4.311 ± 0.43
3.644LeuPro: 3.644 ± 0.387
2.667LeuGln: 2.667 ± 0.41
3.911LeuArg: 3.911 ± 0.457
4.4LeuSer: 4.4 ± 0.393
4.622LeuThr: 4.622 ± 0.468
4.222LeuVal: 4.222 ± 0.372
0.756LeuTrp: 0.756 ± 0.19
3.2LeuTyr: 3.2 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
2.444MetAla: 2.444 ± 0.335
0.222MetCys: 0.222 ± 0.102
1.555MetAsp: 1.555 ± 0.254
1.689MetGlu: 1.689 ± 0.33
1.333MetPhe: 1.333 ± 0.21
1.467MetGly: 1.467 ± 0.27
0.756MetHis: 0.756 ± 0.196
1.867MetIle: 1.867 ± 0.303
3.022MetLys: 3.022 ± 0.446
2.489MetLeu: 2.489 ± 0.277
0.844MetMet: 0.844 ± 0.21
1.733MetAsn: 1.733 ± 0.259
0.889MetPro: 0.889 ± 0.174
1.2MetGln: 1.2 ± 0.268
1.2MetArg: 1.2 ± 0.233
2.0MetSer: 2.0 ± 0.266
1.644MetThr: 1.644 ± 0.275
1.467MetVal: 1.467 ± 0.23
0.222MetTrp: 0.222 ± 0.114
0.978MetTyr: 0.978 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
3.911AsnAla: 3.911 ± 0.431
0.8AsnCys: 0.8 ± 0.186
3.111AsnAsp: 3.111 ± 0.398
3.733AsnGlu: 3.733 ± 0.419
2.355AsnPhe: 2.355 ± 0.277
4.0AsnGly: 4.0 ± 0.42
1.111AsnHis: 1.111 ± 0.222
3.511AsnIle: 3.511 ± 0.406
3.778AsnLys: 3.778 ± 0.425
4.355AsnLeu: 4.355 ± 0.45
1.555AsnMet: 1.555 ± 0.273
2.8AsnAsn: 2.8 ± 0.505
2.133AsnPro: 2.133 ± 0.283
1.422AsnGln: 1.422 ± 0.231
2.889AsnArg: 2.889 ± 0.305
2.8AsnSer: 2.8 ± 0.348
2.622AsnThr: 2.622 ± 0.338
3.467AsnVal: 3.467 ± 0.313
0.711AsnTrp: 0.711 ± 0.171
2.4AsnTyr: 2.4 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
2.889ProAla: 2.889 ± 0.353
0.578ProCys: 0.578 ± 0.164
2.4ProAsp: 2.4 ± 0.281
2.844ProGlu: 2.844 ± 0.416
1.244ProPhe: 1.244 ± 0.231
2.711ProGly: 2.711 ± 0.38
0.489ProHis: 0.489 ± 0.153
2.533ProIle: 2.533 ± 0.325
2.355ProLys: 2.355 ± 0.364
1.955ProLeu: 1.955 ± 0.282
0.978ProMet: 0.978 ± 0.22
1.644ProAsn: 1.644 ± 0.244
1.156ProPro: 1.156 ± 0.268
1.244ProGln: 1.244 ± 0.187
1.244ProArg: 1.244 ± 0.217
2.711ProSer: 2.711 ± 0.363
2.311ProThr: 2.311 ± 0.311
2.444ProVal: 2.444 ± 0.275
0.933ProTrp: 0.933 ± 0.204
1.022ProTyr: 1.022 ± 0.215
0.0ProXaa: 0.0 ± 0.0
Gln
2.755GlnAla: 2.755 ± 0.387
0.178GlnCys: 0.178 ± 0.081
2.089GlnAsp: 2.089 ± 0.277
2.178GlnGlu: 2.178 ± 0.272
1.333GlnPhe: 1.333 ± 0.286
2.4GlnGly: 2.4 ± 0.351
0.4GlnHis: 0.4 ± 0.132
2.355GlnIle: 2.355 ± 0.292
2.311GlnLys: 2.311 ± 0.284
3.511GlnLeu: 3.511 ± 0.387
1.244GlnMet: 1.244 ± 0.233
1.333GlnAsn: 1.333 ± 0.238
0.8GlnPro: 0.8 ± 0.187
1.156GlnGln: 1.156 ± 0.277
1.867GlnArg: 1.867 ± 0.279
1.689GlnSer: 1.689 ± 0.279
1.955GlnThr: 1.955 ± 0.324
2.178GlnVal: 2.178 ± 0.295
0.844GlnTrp: 0.844 ± 0.184
2.178GlnTyr: 2.178 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
3.022ArgAla: 3.022 ± 0.423
0.8ArgCys: 0.8 ± 0.211
3.289ArgAsp: 3.289 ± 0.369
3.555ArgGlu: 3.555 ± 0.366
1.733ArgPhe: 1.733 ± 0.246
3.644ArgGly: 3.644 ± 0.426
0.978ArgHis: 0.978 ± 0.196
3.289ArgIle: 3.289 ± 0.315
3.644ArgLys: 3.644 ± 0.451
3.378ArgLeu: 3.378 ± 0.482
0.978ArgMet: 0.978 ± 0.187
2.444ArgAsn: 2.444 ± 0.354
1.511ArgPro: 1.511 ± 0.247
1.778ArgGln: 1.778 ± 0.299
2.0ArgArg: 2.0 ± 0.301
2.533ArgSer: 2.533 ± 0.351
2.355ArgThr: 2.355 ± 0.336
2.622ArgVal: 2.622 ± 0.342
0.8ArgTrp: 0.8 ± 0.205
1.911ArgTyr: 1.911 ± 0.282
0.0ArgXaa: 0.0 ± 0.0
Ser
4.311SerAla: 4.311 ± 0.395
0.667SerCys: 0.667 ± 0.17
3.778SerAsp: 3.778 ± 0.437
4.0SerGlu: 4.0 ± 0.42
2.755SerPhe: 2.755 ± 0.393
3.955SerGly: 3.955 ± 0.324
1.022SerHis: 1.022 ± 0.218
4.311SerIle: 4.311 ± 0.454
4.711SerLys: 4.711 ± 0.508
4.266SerLeu: 4.266 ± 0.384
1.244SerMet: 1.244 ± 0.233
3.067SerAsn: 3.067 ± 0.356
2.178SerPro: 2.178 ± 0.35
2.4SerGln: 2.4 ± 0.327
3.022SerArg: 3.022 ± 0.349
3.644SerSer: 3.644 ± 0.393
3.911SerThr: 3.911 ± 0.426
3.644SerVal: 3.644 ± 0.374
0.711SerTrp: 0.711 ± 0.204
2.4SerTyr: 2.4 ± 0.301
0.0SerXaa: 0.0 ± 0.0
Thr
4.266ThrAla: 4.266 ± 0.564
0.533ThrCys: 0.533 ± 0.18
2.755ThrAsp: 2.755 ± 0.376
2.933ThrGlu: 2.933 ± 0.397
2.578ThrPhe: 2.578 ± 0.317
3.822ThrGly: 3.822 ± 0.424
1.111ThrHis: 1.111 ± 0.201
4.489ThrIle: 4.489 ± 0.457
3.6ThrLys: 3.6 ± 0.377
4.4ThrLeu: 4.4 ± 0.546
1.422ThrMet: 1.422 ± 0.248
3.022ThrAsn: 3.022 ± 0.369
2.755ThrPro: 2.755 ± 0.391
1.911ThrGln: 1.911 ± 0.265
2.889ThrArg: 2.889 ± 0.353
2.711ThrSer: 2.711 ± 0.366
3.555ThrThr: 3.555 ± 0.422
4.222ThrVal: 4.222 ± 0.473
0.978ThrTrp: 0.978 ± 0.205
1.822ThrTyr: 1.822 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
4.4ValAla: 4.4 ± 0.416
0.533ValCys: 0.533 ± 0.143
5.066ValAsp: 5.066 ± 0.386
4.578ValGlu: 4.578 ± 0.45
3.022ValPhe: 3.022 ± 0.469
3.289ValGly: 3.289 ± 0.492
1.111ValHis: 1.111 ± 0.216
4.666ValIle: 4.666 ± 0.416
5.244ValLys: 5.244 ± 0.529
4.0ValLeu: 4.0 ± 0.404
1.911ValMet: 1.911 ± 0.238
3.733ValAsn: 3.733 ± 0.42
2.0ValPro: 2.0 ± 0.333
2.267ValGln: 2.267 ± 0.281
2.667ValArg: 2.667 ± 0.296
3.333ValSer: 3.333 ± 0.423
3.333ValThr: 3.333 ± 0.395
3.955ValVal: 3.955 ± 0.416
0.756ValTrp: 0.756 ± 0.203
2.622ValTyr: 2.622 ± 0.351
0.0ValXaa: 0.0 ± 0.0
Trp
0.978TrpAla: 0.978 ± 0.202
0.089TrpCys: 0.089 ± 0.06
1.067TrpAsp: 1.067 ± 0.189
0.889TrpGlu: 0.889 ± 0.173
0.533TrpPhe: 0.533 ± 0.143
0.622TrpGly: 0.622 ± 0.169
0.178TrpHis: 0.178 ± 0.078
0.756TrpIle: 0.756 ± 0.213
1.689TrpLys: 1.689 ± 0.303
1.022TrpLeu: 1.022 ± 0.223
0.356TrpMet: 0.356 ± 0.118
0.8TrpAsn: 0.8 ± 0.184
0.4TrpPro: 0.4 ± 0.125
0.4TrpGln: 0.4 ± 0.137
0.711TrpArg: 0.711 ± 0.161
0.978TrpSer: 0.978 ± 0.211
0.889TrpThr: 0.889 ± 0.201
1.022TrpVal: 1.022 ± 0.181
0.178TrpTrp: 0.178 ± 0.079
1.022TrpTyr: 1.022 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.289TyrAla: 3.289 ± 0.414
0.356TyrCys: 0.356 ± 0.127
3.378TyrAsp: 3.378 ± 0.358
2.533TyrGlu: 2.533 ± 0.32
1.644TyrPhe: 1.644 ± 0.258
2.622TyrGly: 2.622 ± 0.364
0.489TyrHis: 0.489 ± 0.122
3.022TyrIle: 3.022 ± 0.383
2.622TyrLys: 2.622 ± 0.307
2.444TyrLeu: 2.444 ± 0.331
0.844TyrMet: 0.844 ± 0.189
2.622TyrAsn: 2.622 ± 0.323
1.644TyrPro: 1.644 ± 0.305
1.778TyrGln: 1.778 ± 0.279
1.511TyrArg: 1.511 ± 0.261
2.889TyrSer: 2.889 ± 0.312
1.911TyrThr: 1.911 ± 0.297
3.111TyrVal: 3.111 ± 0.445
0.667TyrTrp: 0.667 ± 0.159
1.422TyrTyr: 1.422 ± 0.304
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (22502 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski