Amino acid dipepetide frequency for Streptomyces phage Chymera

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.652AlaAla: 20.652 ± 2.001
0.934AlaCys: 0.934 ± 0.359
8.13AlaAsp: 8.13 ± 0.86
9.065AlaGlu: 9.065 ± 1.106
1.962AlaPhe: 1.962 ± 0.436
10.466AlaGly: 10.466 ± 0.904
2.99AlaHis: 2.99 ± 0.775
5.046AlaIle: 5.046 ± 0.924
2.523AlaLys: 2.523 ± 0.531
13.55AlaLeu: 13.55 ± 1.186
2.243AlaMet: 2.243 ± 0.59
2.149AlaAsn: 2.149 ± 0.395
8.317AlaPro: 8.317 ± 0.982
4.859AlaGln: 4.859 ± 0.813
11.307AlaArg: 11.307 ± 1.169
5.607AlaSer: 5.607 ± 0.751
7.289AlaThr: 7.289 ± 1.053
8.784AlaVal: 8.784 ± 0.847
2.056AlaTrp: 2.056 ± 0.566
2.99AlaTyr: 2.99 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.934CysAla: 0.934 ± 0.368
0.0CysCys: 0.0 ± 0.0
0.748CysAsp: 0.748 ± 0.261
1.215CysGlu: 1.215 ± 0.425
0.093CysPhe: 0.093 ± 0.081
1.682CysGly: 1.682 ± 0.663
0.467CysHis: 0.467 ± 0.256
0.093CysIle: 0.093 ± 0.089
0.28CysLys: 0.28 ± 0.15
1.121CysLeu: 1.121 ± 0.359
0.0CysMet: 0.0 ± 0.0
0.374CysAsn: 0.374 ± 0.165
1.495CysPro: 1.495 ± 0.67
0.28CysGln: 0.28 ± 0.147
0.654CysArg: 0.654 ± 0.296
0.187CysSer: 0.187 ± 0.133
0.748CysThr: 0.748 ± 0.238
0.187CysVal: 0.187 ± 0.146
0.187CysTrp: 0.187 ± 0.112
0.093CysTyr: 0.093 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
7.289AspAla: 7.289 ± 0.902
0.654AspCys: 0.654 ± 0.285
5.327AspAsp: 5.327 ± 0.998
4.859AspGlu: 4.859 ± 0.895
1.028AspPhe: 1.028 ± 0.282
7.569AspGly: 7.569 ± 0.935
1.121AspHis: 1.121 ± 0.329
2.71AspIle: 2.71 ± 0.473
1.776AspLys: 1.776 ± 0.292
5.887AspLeu: 5.887 ± 0.82
1.028AspMet: 1.028 ± 0.295
0.654AspAsn: 0.654 ± 0.266
3.458AspPro: 3.458 ± 0.408
2.149AspGln: 2.149 ± 0.483
4.299AspArg: 4.299 ± 0.628
3.084AspSer: 3.084 ± 0.772
4.018AspThr: 4.018 ± 0.428
4.766AspVal: 4.766 ± 0.648
0.934AspTrp: 0.934 ± 0.278
1.215AspTyr: 1.215 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
7.663GluAla: 7.663 ± 0.955
0.934GluCys: 0.934 ± 0.254
3.551GluAsp: 3.551 ± 0.632
4.205GluGlu: 4.205 ± 0.556
1.028GluPhe: 1.028 ± 0.243
4.579GluGly: 4.579 ± 0.589
2.617GluHis: 2.617 ± 0.612
2.149GluIle: 2.149 ± 0.502
0.934GluLys: 0.934 ± 0.232
5.981GluLeu: 5.981 ± 0.842
1.028GluMet: 1.028 ± 0.33
0.654GluAsn: 0.654 ± 0.247
2.71GluPro: 2.71 ± 0.456
2.43GluGln: 2.43 ± 0.502
6.168GluArg: 6.168 ± 0.854
2.71GluSer: 2.71 ± 0.427
3.645GluThr: 3.645 ± 0.532
4.205GluVal: 4.205 ± 0.553
0.841GluTrp: 0.841 ± 0.245
0.934GluTyr: 0.934 ± 0.26
0.0GluXaa: 0.0 ± 0.0
Phe
1.962PheAla: 1.962 ± 0.447
0.187PheCys: 0.187 ± 0.125
1.215PheAsp: 1.215 ± 0.336
1.215PheGlu: 1.215 ± 0.403
0.841PhePhe: 0.841 ± 0.302
2.243PheGly: 2.243 ± 0.47
0.561PheHis: 0.561 ± 0.316
0.934PheIle: 0.934 ± 0.305
0.374PheLys: 0.374 ± 0.154
1.682PheLeu: 1.682 ± 0.346
0.093PheMet: 0.093 ± 0.099
0.467PheAsn: 0.467 ± 0.262
1.121PhePro: 1.121 ± 0.273
0.187PheGln: 0.187 ± 0.12
1.682PheArg: 1.682 ± 0.397
1.589PheSer: 1.589 ± 0.337
1.402PheThr: 1.402 ± 0.357
1.869PheVal: 1.869 ± 0.432
0.187PheTrp: 0.187 ± 0.178
0.28PheTyr: 0.28 ± 0.14
0.0PheXaa: 0.0 ± 0.0
Gly
9.065GlyAla: 9.065 ± 0.862
1.028GlyCys: 1.028 ± 0.48
6.448GlyAsp: 6.448 ± 0.741
4.953GlyGlu: 4.953 ± 0.58
2.43GlyPhe: 2.43 ± 0.584
6.541GlyGly: 6.541 ± 0.715
2.336GlyHis: 2.336 ± 0.62
2.803GlyIle: 2.803 ± 0.664
2.617GlyLys: 2.617 ± 0.504
8.41GlyLeu: 8.41 ± 0.982
2.523GlyMet: 2.523 ± 0.576
1.776GlyAsn: 1.776 ± 0.392
4.766GlyPro: 4.766 ± 0.73
2.897GlyGln: 2.897 ± 0.428
7.756GlyArg: 7.756 ± 0.99
4.672GlySer: 4.672 ± 0.928
4.766GlyThr: 4.766 ± 0.862
6.915GlyVal: 6.915 ± 0.963
2.897GlyTrp: 2.897 ± 0.557
2.803GlyTyr: 2.803 ± 0.48
0.0GlyXaa: 0.0 ± 0.0
His
2.336HisAla: 2.336 ± 0.382
0.467HisCys: 0.467 ± 0.19
1.962HisAsp: 1.962 ± 0.359
0.841HisGlu: 0.841 ± 0.241
0.467HisPhe: 0.467 ± 0.217
2.056HisGly: 2.056 ± 0.487
0.841HisHis: 0.841 ± 0.272
0.654HisIle: 0.654 ± 0.185
0.374HisLys: 0.374 ± 0.145
2.43HisLeu: 2.43 ± 0.475
0.467HisMet: 0.467 ± 0.22
0.28HisAsn: 0.28 ± 0.139
2.99HisPro: 2.99 ± 0.616
1.121HisGln: 1.121 ± 0.302
2.243HisArg: 2.243 ± 0.435
1.028HisSer: 1.028 ± 0.368
1.495HisThr: 1.495 ± 0.356
1.682HisVal: 1.682 ± 0.44
0.374HisTrp: 0.374 ± 0.188
0.841HisTyr: 0.841 ± 0.287
0.0HisXaa: 0.0 ± 0.0
Ile
4.299IleAla: 4.299 ± 0.649
0.561IleCys: 0.561 ± 0.233
2.243IleAsp: 2.243 ± 0.474
2.99IleGlu: 2.99 ± 0.54
0.561IlePhe: 0.561 ± 0.244
3.084IleGly: 3.084 ± 0.451
0.841IleHis: 0.841 ± 0.257
0.467IleIle: 0.467 ± 0.168
1.308IleLys: 1.308 ± 0.362
2.897IleLeu: 2.897 ± 0.543
0.654IleMet: 0.654 ± 0.251
0.934IleAsn: 0.934 ± 0.302
2.336IlePro: 2.336 ± 0.486
0.561IleGln: 0.561 ± 0.186
3.645IleArg: 3.645 ± 0.717
1.308IleSer: 1.308 ± 0.35
2.243IleThr: 2.243 ± 0.611
2.149IleVal: 2.149 ± 0.534
0.561IleTrp: 0.561 ± 0.336
0.934IleTyr: 0.934 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
3.177LysAla: 3.177 ± 0.78
0.093LysCys: 0.093 ± 0.114
2.056LysAsp: 2.056 ± 0.509
1.215LysGlu: 1.215 ± 0.319
0.561LysPhe: 0.561 ± 0.214
2.243LysGly: 2.243 ± 0.482
0.187LysHis: 0.187 ± 0.114
1.028LysIle: 1.028 ± 0.292
1.308LysLys: 1.308 ± 0.298
2.243LysLeu: 2.243 ± 0.51
0.467LysMet: 0.467 ± 0.254
0.841LysAsn: 0.841 ± 0.289
0.934LysPro: 0.934 ± 0.234
0.374LysGln: 0.374 ± 0.138
2.43LysArg: 2.43 ± 0.505
1.308LysSer: 1.308 ± 0.404
1.495LysThr: 1.495 ± 0.415
1.402LysVal: 1.402 ± 0.344
0.561LysTrp: 0.561 ± 0.193
0.467LysTyr: 0.467 ± 0.204
0.0LysXaa: 0.0 ± 0.0
Leu
11.681LeuAla: 11.681 ± 1.133
1.028LeuCys: 1.028 ± 0.345
6.261LeuAsp: 6.261 ± 0.642
4.299LeuGlu: 4.299 ± 0.621
1.962LeuPhe: 1.962 ± 0.505
8.224LeuGly: 8.224 ± 1.23
1.776LeuHis: 1.776 ± 0.345
3.271LeuIle: 3.271 ± 0.709
1.962LeuLys: 1.962 ± 0.409
8.037LeuLeu: 8.037 ± 0.877
1.776LeuMet: 1.776 ± 0.331
2.056LeuAsn: 2.056 ± 0.399
5.233LeuPro: 5.233 ± 0.541
3.551LeuGln: 3.551 ± 0.584
8.13LeuArg: 8.13 ± 0.808
4.672LeuSer: 4.672 ± 0.771
8.037LeuThr: 8.037 ± 0.872
5.42LeuVal: 5.42 ± 0.657
0.934LeuTrp: 0.934 ± 0.295
2.336LeuTyr: 2.336 ± 0.422
0.0LeuXaa: 0.0 ± 0.0
Met
2.617MetAla: 2.617 ± 0.52
0.093MetCys: 0.093 ± 0.103
1.402MetAsp: 1.402 ± 0.47
1.308MetGlu: 1.308 ± 0.327
0.093MetPhe: 0.093 ± 0.088
0.748MetGly: 0.748 ± 0.251
0.561MetHis: 0.561 ± 0.248
0.654MetIle: 0.654 ± 0.221
0.28MetLys: 0.28 ± 0.154
1.402MetLeu: 1.402 ± 0.332
0.467MetMet: 0.467 ± 0.176
0.748MetAsn: 0.748 ± 0.202
1.495MetPro: 1.495 ± 0.383
0.374MetGln: 0.374 ± 0.159
2.056MetArg: 2.056 ± 0.355
1.495MetSer: 1.495 ± 0.241
2.243MetThr: 2.243 ± 0.346
1.962MetVal: 1.962 ± 0.398
0.374MetTrp: 0.374 ± 0.156
0.28MetTyr: 0.28 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
2.523AsnAla: 2.523 ± 0.441
0.28AsnCys: 0.28 ± 0.199
1.121AsnAsp: 1.121 ± 0.364
0.934AsnGlu: 0.934 ± 0.26
0.0AsnPhe: 0.0 ± 0.0
1.776AsnGly: 1.776 ± 0.392
0.654AsnHis: 0.654 ± 0.257
0.654AsnIle: 0.654 ± 0.222
0.561AsnLys: 0.561 ± 0.266
1.308AsnLeu: 1.308 ± 0.375
0.467AsnMet: 0.467 ± 0.236
0.187AsnAsn: 0.187 ± 0.114
1.589AsnPro: 1.589 ± 0.263
0.841AsnGln: 0.841 ± 0.243
1.308AsnArg: 1.308 ± 0.342
1.215AsnSer: 1.215 ± 0.321
0.561AsnThr: 0.561 ± 0.239
1.682AsnVal: 1.682 ± 0.378
0.28AsnTrp: 0.28 ± 0.148
0.654AsnTyr: 0.654 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
9.158ProAla: 9.158 ± 0.905
0.748ProCys: 0.748 ± 0.331
3.738ProAsp: 3.738 ± 0.536
4.766ProGlu: 4.766 ± 0.648
1.215ProPhe: 1.215 ± 0.303
6.728ProGly: 6.728 ± 1.076
2.336ProHis: 2.336 ± 0.478
1.869ProIle: 1.869 ± 0.403
1.028ProLys: 1.028 ± 0.393
4.205ProLeu: 4.205 ± 0.701
1.776ProMet: 1.776 ± 0.425
1.121ProAsn: 1.121 ± 0.343
4.579ProPro: 4.579 ± 0.919
1.869ProGln: 1.869 ± 0.411
4.953ProArg: 4.953 ± 0.858
3.177ProSer: 3.177 ± 0.491
4.672ProThr: 4.672 ± 0.84
4.953ProVal: 4.953 ± 0.754
1.121ProTrp: 1.121 ± 0.292
1.121ProTyr: 1.121 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
4.486GlnAla: 4.486 ± 0.653
0.374GlnCys: 0.374 ± 0.19
1.589GlnAsp: 1.589 ± 0.335
2.523GlnGlu: 2.523 ± 0.501
0.934GlnPhe: 0.934 ± 0.281
2.523GlnGly: 2.523 ± 0.491
0.934GlnHis: 0.934 ± 0.275
0.841GlnIle: 0.841 ± 0.252
0.748GlnLys: 0.748 ± 0.304
3.925GlnLeu: 3.925 ± 0.563
0.561GlnMet: 0.561 ± 0.193
0.374GlnAsn: 0.374 ± 0.176
3.084GlnPro: 3.084 ± 0.5
1.308GlnGln: 1.308 ± 0.502
3.458GlnArg: 3.458 ± 0.556
1.215GlnSer: 1.215 ± 0.253
1.121GlnThr: 1.121 ± 0.256
2.43GlnVal: 2.43 ± 0.436
1.121GlnTrp: 1.121 ± 0.257
0.654GlnTyr: 0.654 ± 0.215
0.0GlnXaa: 0.0 ± 0.0
Arg
11.214ArgAla: 11.214 ± 1.072
0.934ArgCys: 0.934 ± 0.286
4.672ArgAsp: 4.672 ± 0.724
4.018ArgGlu: 4.018 ± 0.583
2.149ArgPhe: 2.149 ± 0.449
6.915ArgGly: 6.915 ± 0.773
1.962ArgHis: 1.962 ± 0.403
3.177ArgIle: 3.177 ± 0.555
2.43ArgLys: 2.43 ± 0.573
8.878ArgLeu: 8.878 ± 0.701
2.523ArgMet: 2.523 ± 0.521
1.776ArgAsn: 1.776 ± 0.38
6.074ArgPro: 6.074 ± 0.869
2.99ArgGln: 2.99 ± 0.379
7.569ArgArg: 7.569 ± 1.14
4.486ArgSer: 4.486 ± 0.699
5.607ArgThr: 5.607 ± 0.731
5.794ArgVal: 5.794 ± 0.758
2.336ArgTrp: 2.336 ± 0.428
2.523ArgTyr: 2.523 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
6.822SerAla: 6.822 ± 0.851
0.374SerCys: 0.374 ± 0.235
2.99SerAsp: 2.99 ± 0.481
1.308SerGlu: 1.308 ± 0.341
1.402SerPhe: 1.402 ± 0.306
5.327SerGly: 5.327 ± 1.093
1.215SerHis: 1.215 ± 0.367
1.402SerIle: 1.402 ± 0.37
1.402SerLys: 1.402 ± 0.359
3.364SerLeu: 3.364 ± 0.478
1.215SerMet: 1.215 ± 0.358
0.841SerAsn: 0.841 ± 0.234
2.897SerPro: 2.897 ± 0.588
1.495SerGln: 1.495 ± 0.392
4.299SerArg: 4.299 ± 0.673
2.243SerSer: 2.243 ± 0.477
4.018SerThr: 4.018 ± 0.751
3.551SerVal: 3.551 ± 0.669
0.374SerTrp: 0.374 ± 0.168
1.402SerTyr: 1.402 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
9.345ThrAla: 9.345 ± 1.005
0.561ThrCys: 0.561 ± 0.226
4.018ThrAsp: 4.018 ± 0.579
3.458ThrGlu: 3.458 ± 0.522
1.589ThrPhe: 1.589 ± 0.393
6.915ThrGly: 6.915 ± 0.847
0.934ThrHis: 0.934 ± 0.248
2.71ThrIle: 2.71 ± 0.514
1.402ThrLys: 1.402 ± 0.311
6.168ThrLeu: 6.168 ± 1.015
0.841ThrMet: 0.841 ± 0.25
1.028ThrAsn: 1.028 ± 0.333
5.327ThrPro: 5.327 ± 0.706
1.776ThrGln: 1.776 ± 0.412
4.392ThrArg: 4.392 ± 0.649
2.71ThrSer: 2.71 ± 0.423
5.42ThrThr: 5.42 ± 0.951
4.766ThrVal: 4.766 ± 0.827
1.121ThrTrp: 1.121 ± 0.403
1.215ThrTyr: 1.215 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
10.093ValAla: 10.093 ± 1.059
0.841ValCys: 0.841 ± 0.288
3.925ValAsp: 3.925 ± 0.548
3.645ValGlu: 3.645 ± 0.61
1.402ValPhe: 1.402 ± 0.32
4.672ValGly: 4.672 ± 0.66
1.495ValHis: 1.495 ± 0.383
2.336ValIle: 2.336 ± 0.423
2.056ValLys: 2.056 ± 0.501
6.355ValLeu: 6.355 ± 0.794
1.589ValMet: 1.589 ± 0.32
1.028ValAsn: 1.028 ± 0.289
4.486ValPro: 4.486 ± 0.715
3.271ValGln: 3.271 ± 0.49
7.382ValArg: 7.382 ± 0.906
3.551ValSer: 3.551 ± 0.78
4.766ValThr: 4.766 ± 0.9
5.42ValVal: 5.42 ± 0.568
1.495ValTrp: 1.495 ± 0.296
0.934ValTyr: 0.934 ± 0.373
0.0ValXaa: 0.0 ± 0.0
Trp
2.149TrpAla: 2.149 ± 0.386
0.28TrpCys: 0.28 ± 0.162
0.934TrpAsp: 0.934 ± 0.313
1.215TrpGlu: 1.215 ± 0.349
0.093TrpPhe: 0.093 ± 0.088
1.589TrpGly: 1.589 ± 0.421
0.561TrpHis: 0.561 ± 0.198
1.028TrpIle: 1.028 ± 0.226
0.374TrpLys: 0.374 ± 0.15
1.776TrpLeu: 1.776 ± 0.44
0.374TrpMet: 0.374 ± 0.158
0.934TrpAsn: 0.934 ± 0.449
1.121TrpPro: 1.121 ± 0.282
0.934TrpGln: 0.934 ± 0.317
1.402TrpArg: 1.402 ± 0.309
1.028TrpSer: 1.028 ± 0.321
0.748TrpThr: 0.748 ± 0.292
1.495TrpVal: 1.495 ± 0.353
0.374TrpTrp: 0.374 ± 0.156
0.374TrpTyr: 0.374 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.458TyrAla: 3.458 ± 0.614
0.467TyrCys: 0.467 ± 0.171
1.495TyrAsp: 1.495 ± 0.323
0.934TyrGlu: 0.934 ± 0.241
0.28TyrPhe: 0.28 ± 0.134
2.523TyrGly: 2.523 ± 0.424
0.561TyrHis: 0.561 ± 0.242
0.748TyrIle: 0.748 ± 0.255
0.654TyrLys: 0.654 ± 0.292
1.028TyrLeu: 1.028 ± 0.361
0.467TyrMet: 0.467 ± 0.192
0.374TyrAsn: 0.374 ± 0.194
1.121TyrPro: 1.121 ± 0.321
0.934TyrGln: 0.934 ± 0.272
2.897TyrArg: 2.897 ± 0.441
0.654TyrSer: 0.654 ± 0.291
1.402TyrThr: 1.402 ± 0.383
1.308TyrVal: 1.308 ± 0.342
0.654TyrTrp: 0.654 ± 0.16
0.654TyrTyr: 0.654 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (10702 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski