Amino acid dipepetide frequency for Streptomyces phage Olicious

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.347AlaAla: 11.347 ± 1.293
0.751AlaCys: 0.751 ± 0.226
5.185AlaAsp: 5.185 ± 0.77
6.312AlaGlu: 6.312 ± 0.781
3.006AlaPhe: 3.006 ± 0.424
9.844AlaGly: 9.844 ± 0.943
1.353AlaHis: 1.353 ± 0.324
3.382AlaIle: 3.382 ± 0.542
5.786AlaLys: 5.786 ± 0.776
7.44AlaLeu: 7.44 ± 0.742
2.104AlaMet: 2.104 ± 0.424
3.983AlaAsn: 3.983 ± 0.45
3.833AlaPro: 3.833 ± 0.733
5.336AlaGln: 5.336 ± 0.921
4.584AlaArg: 4.584 ± 0.67
6.087AlaSer: 6.087 ± 0.657
6.839AlaThr: 6.839 ± 0.834
7.289AlaVal: 7.289 ± 0.935
2.104AlaTrp: 2.104 ± 0.499
3.081AlaTyr: 3.081 ± 0.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.601CysAla: 0.601 ± 0.256
0.0CysCys: 0.0 ± 0.0
0.301CysAsp: 0.301 ± 0.181
0.451CysGlu: 0.451 ± 0.184
0.15CysPhe: 0.15 ± 0.111
0.451CysGly: 0.451 ± 0.253
0.075CysHis: 0.075 ± 0.069
0.451CysIle: 0.451 ± 0.197
0.526CysLys: 0.526 ± 0.215
0.827CysLeu: 0.827 ± 0.235
0.225CysMet: 0.225 ± 0.161
0.15CysAsn: 0.15 ± 0.103
0.526CysPro: 0.526 ± 0.226
0.301CysGln: 0.301 ± 0.186
0.376CysArg: 0.376 ± 0.161
0.676CysSer: 0.676 ± 0.247
0.526CysThr: 0.526 ± 0.307
0.676CysVal: 0.676 ± 0.244
0.225CysTrp: 0.225 ± 0.137
0.225CysTyr: 0.225 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
6.237AspAla: 6.237 ± 0.701
0.225AspCys: 0.225 ± 0.119
3.983AspAsp: 3.983 ± 0.606
3.908AspGlu: 3.908 ± 0.572
2.931AspPhe: 2.931 ± 0.376
5.185AspGly: 5.185 ± 0.71
1.052AspHis: 1.052 ± 0.265
3.607AspIle: 3.607 ± 0.574
2.78AspLys: 2.78 ± 0.539
4.434AspLeu: 4.434 ± 0.653
2.179AspMet: 2.179 ± 0.445
2.254AspAsn: 2.254 ± 0.505
4.659AspPro: 4.659 ± 0.601
2.33AspGln: 2.33 ± 0.529
3.307AspArg: 3.307 ± 0.541
3.307AspSer: 3.307 ± 0.492
2.931AspThr: 2.931 ± 0.492
4.734AspVal: 4.734 ± 0.683
1.052AspTrp: 1.052 ± 0.239
2.63AspTyr: 2.63 ± 0.51
0.0AspXaa: 0.0 ± 0.0
Glu
5.561GluAla: 5.561 ± 0.668
0.601GluCys: 0.601 ± 0.272
4.058GluAsp: 4.058 ± 0.624
4.283GluGlu: 4.283 ± 0.622
2.104GluPhe: 2.104 ± 0.416
4.659GluGly: 4.659 ± 0.579
1.278GluHis: 1.278 ± 0.375
3.006GluIle: 3.006 ± 0.45
3.382GluLys: 3.382 ± 0.612
5.11GluLeu: 5.11 ± 0.697
1.353GluMet: 1.353 ± 0.319
2.555GluAsn: 2.555 ± 0.484
1.503GluPro: 1.503 ± 0.367
2.705GluGln: 2.705 ± 0.44
2.856GluArg: 2.856 ± 0.387
2.705GluSer: 2.705 ± 0.37
2.705GluThr: 2.705 ± 0.42
5.035GluVal: 5.035 ± 0.797
0.977GluTrp: 0.977 ± 0.279
2.705GluTyr: 2.705 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
3.006PheAla: 3.006 ± 0.466
0.301PheCys: 0.301 ± 0.144
2.931PheAsp: 2.931 ± 0.487
2.63PheGlu: 2.63 ± 0.49
0.977PhePhe: 0.977 ± 0.193
3.532PheGly: 3.532 ± 0.526
0.827PheHis: 0.827 ± 0.242
0.977PheIle: 0.977 ± 0.256
1.428PheLys: 1.428 ± 0.346
1.728PheLeu: 1.728 ± 0.359
0.902PheMet: 0.902 ± 0.252
1.879PheAsn: 1.879 ± 0.297
1.278PhePro: 1.278 ± 0.414
2.029PheGln: 2.029 ± 0.375
1.728PheArg: 1.728 ± 0.385
2.179PheSer: 2.179 ± 0.388
2.33PheThr: 2.33 ± 0.434
2.179PheVal: 2.179 ± 0.426
0.451PheTrp: 0.451 ± 0.246
1.278PheTyr: 1.278 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
9.018GlyAla: 9.018 ± 0.909
0.751GlyCys: 0.751 ± 0.25
5.636GlyAsp: 5.636 ± 0.519
5.862GlyGlu: 5.862 ± 0.658
3.081GlyPhe: 3.081 ± 0.401
8.868GlyGly: 8.868 ± 0.994
1.428GlyHis: 1.428 ± 0.413
4.133GlyIle: 4.133 ± 0.668
5.411GlyLys: 5.411 ± 0.631
5.561GlyLeu: 5.561 ± 0.686
3.231GlyMet: 3.231 ± 0.498
3.231GlyAsn: 3.231 ± 0.553
2.63GlyPro: 2.63 ± 0.429
4.283GlyGln: 4.283 ± 0.67
4.133GlyArg: 4.133 ± 0.499
5.937GlySer: 5.937 ± 0.652
5.937GlyThr: 5.937 ± 0.681
5.636GlyVal: 5.636 ± 0.814
1.653GlyTrp: 1.653 ± 0.274
3.307GlyTyr: 3.307 ± 0.481
0.0GlyXaa: 0.0 ± 0.0
His
1.353HisAla: 1.353 ± 0.367
0.225HisCys: 0.225 ± 0.132
1.202HisAsp: 1.202 ± 0.324
1.127HisGlu: 1.127 ± 0.296
0.676HisPhe: 0.676 ± 0.203
1.804HisGly: 1.804 ± 0.376
0.376HisHis: 0.376 ± 0.162
1.202HisIle: 1.202 ± 0.309
0.827HisLys: 0.827 ± 0.263
1.052HisLeu: 1.052 ± 0.281
0.451HisMet: 0.451 ± 0.148
0.376HisAsn: 0.376 ± 0.18
0.376HisPro: 0.376 ± 0.2
0.451HisGln: 0.451 ± 0.18
0.827HisArg: 0.827 ± 0.24
1.127HisSer: 1.127 ± 0.292
0.751HisThr: 0.751 ± 0.292
0.751HisVal: 0.751 ± 0.213
0.301HisTrp: 0.301 ± 0.177
0.676HisTyr: 0.676 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
3.457IleAla: 3.457 ± 0.449
0.301IleCys: 0.301 ± 0.147
3.006IleAsp: 3.006 ± 0.448
3.382IleGlu: 3.382 ± 0.528
1.353IlePhe: 1.353 ± 0.308
2.856IleGly: 2.856 ± 0.449
0.977IleHis: 0.977 ± 0.293
1.728IleIle: 1.728 ± 0.339
3.757IleLys: 3.757 ± 0.567
2.931IleLeu: 2.931 ± 0.481
1.127IleMet: 1.127 ± 0.346
1.728IleAsn: 1.728 ± 0.425
1.804IlePro: 1.804 ± 0.352
1.278IleGln: 1.278 ± 0.349
2.254IleArg: 2.254 ± 0.489
2.33IleSer: 2.33 ± 0.529
3.081IleThr: 3.081 ± 0.489
3.081IleVal: 3.081 ± 0.427
0.376IleTrp: 0.376 ± 0.164
1.804IleTyr: 1.804 ± 0.362
0.0IleXaa: 0.0 ± 0.0
Lys
5.711LysAla: 5.711 ± 0.662
0.376LysCys: 0.376 ± 0.152
4.283LysAsp: 4.283 ± 0.701
2.555LysGlu: 2.555 ± 0.454
1.879LysPhe: 1.879 ± 0.343
3.757LysGly: 3.757 ± 0.62
1.428LysHis: 1.428 ± 0.348
1.954LysIle: 1.954 ± 0.452
2.931LysLys: 2.931 ± 0.611
4.359LysLeu: 4.359 ± 0.693
1.728LysMet: 1.728 ± 0.345
2.705LysAsn: 2.705 ± 0.536
2.555LysPro: 2.555 ± 0.519
2.555LysGln: 2.555 ± 0.456
2.931LysArg: 2.931 ± 0.502
3.006LysSer: 3.006 ± 0.517
4.058LysThr: 4.058 ± 0.457
4.434LysVal: 4.434 ± 0.846
1.127LysTrp: 1.127 ± 0.241
2.33LysTyr: 2.33 ± 0.58
0.0LysXaa: 0.0 ± 0.0
Leu
6.312LeuAla: 6.312 ± 0.741
0.15LeuCys: 0.15 ± 0.108
3.833LeuAsp: 3.833 ± 0.573
4.208LeuGlu: 4.208 ± 0.496
2.705LeuPhe: 2.705 ± 0.443
6.688LeuGly: 6.688 ± 0.602
0.676LeuHis: 0.676 ± 0.255
3.382LeuIle: 3.382 ± 0.611
4.058LeuLys: 4.058 ± 0.7
3.682LeuLeu: 3.682 ± 0.486
1.804LeuMet: 1.804 ± 0.309
2.254LeuAsn: 2.254 ± 0.454
3.081LeuPro: 3.081 ± 0.453
3.983LeuGln: 3.983 ± 0.55
4.434LeuArg: 4.434 ± 0.523
4.584LeuSer: 4.584 ± 0.447
4.659LeuThr: 4.659 ± 0.496
5.336LeuVal: 5.336 ± 0.66
0.977LeuTrp: 0.977 ± 0.356
1.804LeuTyr: 1.804 ± 0.367
0.0LeuXaa: 0.0 ± 0.0
Met
4.584MetAla: 4.584 ± 0.624
0.301MetCys: 0.301 ± 0.171
2.104MetAsp: 2.104 ± 0.455
1.653MetGlu: 1.653 ± 0.392
0.902MetPhe: 0.902 ± 0.236
2.705MetGly: 2.705 ± 0.511
0.15MetHis: 0.15 ± 0.101
1.127MetIle: 1.127 ± 0.283
1.428MetLys: 1.428 ± 0.391
1.353MetLeu: 1.353 ± 0.338
0.526MetMet: 0.526 ± 0.214
1.278MetAsn: 1.278 ± 0.371
1.127MetPro: 1.127 ± 0.286
1.127MetGln: 1.127 ± 0.309
1.879MetArg: 1.879 ± 0.436
2.405MetSer: 2.405 ± 0.387
1.578MetThr: 1.578 ± 0.386
1.503MetVal: 1.503 ± 0.344
0.225MetTrp: 0.225 ± 0.116
0.601MetTyr: 0.601 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.908AsnAla: 3.908 ± 0.455
0.376AsnCys: 0.376 ± 0.19
1.578AsnAsp: 1.578 ± 0.448
2.48AsnGlu: 2.48 ± 0.344
1.428AsnPhe: 1.428 ± 0.3
3.307AsnGly: 3.307 ± 0.538
0.601AsnHis: 0.601 ± 0.144
1.278AsnIle: 1.278 ± 0.335
2.029AsnLys: 2.029 ± 0.308
2.705AsnLeu: 2.705 ± 0.433
0.902AsnMet: 0.902 ± 0.291
1.578AsnAsn: 1.578 ± 0.343
2.856AsnPro: 2.856 ± 0.486
1.578AsnGln: 1.578 ± 0.353
1.804AsnArg: 1.804 ± 0.377
2.78AsnSer: 2.78 ± 0.577
2.705AsnThr: 2.705 ± 0.469
2.856AsnVal: 2.856 ± 0.568
0.977AsnTrp: 0.977 ± 0.317
1.653AsnTyr: 1.653 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
4.434ProAla: 4.434 ± 0.71
0.526ProCys: 0.526 ± 0.259
2.705ProAsp: 2.705 ± 0.415
2.931ProGlu: 2.931 ± 0.466
1.879ProPhe: 1.879 ± 0.382
4.133ProGly: 4.133 ± 0.761
0.601ProHis: 0.601 ± 0.205
1.578ProIle: 1.578 ± 0.292
2.104ProLys: 2.104 ± 0.488
2.705ProLeu: 2.705 ± 0.378
1.127ProMet: 1.127 ± 0.262
1.954ProAsn: 1.954 ± 0.303
1.202ProPro: 1.202 ± 0.424
1.503ProGln: 1.503 ± 0.32
2.254ProArg: 2.254 ± 0.462
2.104ProSer: 2.104 ± 0.375
3.231ProThr: 3.231 ± 0.662
3.081ProVal: 3.081 ± 0.438
0.601ProTrp: 0.601 ± 0.253
1.503ProTyr: 1.503 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
5.862GlnAla: 5.862 ± 0.78
0.225GlnCys: 0.225 ± 0.186
3.231GlnAsp: 3.231 ± 0.531
2.029GlnGlu: 2.029 ± 0.436
1.428GlnPhe: 1.428 ± 0.295
3.382GlnGly: 3.382 ± 0.593
0.526GlnHis: 0.526 ± 0.186
2.179GlnIle: 2.179 ± 0.331
2.179GlnLys: 2.179 ± 0.39
3.833GlnLeu: 3.833 ± 0.599
1.578GlnMet: 1.578 ± 0.292
1.428GlnAsn: 1.428 ± 0.389
1.728GlnPro: 1.728 ± 0.423
2.33GlnGln: 2.33 ± 0.456
3.231GlnArg: 3.231 ± 0.613
3.607GlnSer: 3.607 ± 0.519
2.78GlnThr: 2.78 ± 0.375
2.931GlnVal: 2.931 ± 0.434
0.676GlnTrp: 0.676 ± 0.209
1.578GlnTyr: 1.578 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
5.035ArgAla: 5.035 ± 0.633
0.601ArgCys: 0.601 ± 0.214
3.833ArgAsp: 3.833 ± 0.471
3.081ArgGlu: 3.081 ± 0.506
1.353ArgPhe: 1.353 ± 0.362
4.058ArgGly: 4.058 ± 0.517
0.601ArgHis: 0.601 ± 0.213
2.254ArgIle: 2.254 ± 0.352
2.78ArgLys: 2.78 ± 0.551
3.307ArgLeu: 3.307 ± 0.678
1.353ArgMet: 1.353 ± 0.231
1.879ArgAsn: 1.879 ± 0.422
1.202ArgPro: 1.202 ± 0.262
2.78ArgGln: 2.78 ± 0.499
2.33ArgArg: 2.33 ± 0.388
3.081ArgSer: 3.081 ± 0.485
3.156ArgThr: 3.156 ± 0.442
4.734ArgVal: 4.734 ± 0.672
0.601ArgTrp: 0.601 ± 0.176
1.428ArgTyr: 1.428 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
6.463SerAla: 6.463 ± 0.716
0.451SerCys: 0.451 ± 0.191
4.283SerAsp: 4.283 ± 0.581
2.856SerGlu: 2.856 ± 0.518
2.104SerPhe: 2.104 ± 0.447
6.914SerGly: 6.914 ± 1.064
1.127SerHis: 1.127 ± 0.314
2.78SerIle: 2.78 ± 0.452
2.63SerLys: 2.63 ± 0.469
4.208SerLeu: 4.208 ± 0.471
2.179SerMet: 2.179 ± 0.462
2.48SerAsn: 2.48 ± 0.435
2.555SerPro: 2.555 ± 0.355
2.48SerGln: 2.48 ± 0.392
2.705SerArg: 2.705 ± 0.381
4.96SerSer: 4.96 ± 0.872
4.208SerThr: 4.208 ± 0.507
4.509SerVal: 4.509 ± 0.638
1.353SerTrp: 1.353 ± 0.279
2.33SerTyr: 2.33 ± 0.421
0.0SerXaa: 0.0 ± 0.0
Thr
5.937ThrAla: 5.937 ± 0.765
0.676ThrCys: 0.676 ± 0.233
3.457ThrAsp: 3.457 ± 0.526
3.231ThrGlu: 3.231 ± 0.412
2.555ThrPhe: 2.555 ± 0.39
7.515ThrGly: 7.515 ± 0.998
0.827ThrHis: 0.827 ± 0.354
3.081ThrIle: 3.081 ± 0.633
4.434ThrLys: 4.434 ± 0.757
4.96ThrLeu: 4.96 ± 0.63
1.954ThrMet: 1.954 ± 0.372
2.555ThrAsn: 2.555 ± 0.386
4.434ThrPro: 4.434 ± 0.589
3.231ThrGln: 3.231 ± 0.583
1.653ThrArg: 1.653 ± 0.335
3.156ThrSer: 3.156 ± 0.624
5.11ThrThr: 5.11 ± 0.797
4.509ThrVal: 4.509 ± 0.712
1.428ThrTrp: 1.428 ± 0.251
2.555ThrTyr: 2.555 ± 0.517
0.0ThrXaa: 0.0 ± 0.0
Val
6.012ValAla: 6.012 ± 0.872
0.526ValCys: 0.526 ± 0.228
4.885ValAsp: 4.885 ± 0.654
3.156ValGlu: 3.156 ± 0.579
2.179ValPhe: 2.179 ± 0.386
5.636ValGly: 5.636 ± 0.715
1.653ValHis: 1.653 ± 0.291
2.63ValIle: 2.63 ± 0.484
5.035ValLys: 5.035 ± 0.665
4.96ValLeu: 4.96 ± 0.462
2.555ValMet: 2.555 ± 0.459
3.006ValAsn: 3.006 ± 0.507
3.006ValPro: 3.006 ± 0.591
3.231ValGln: 3.231 ± 0.609
3.908ValArg: 3.908 ± 0.645
5.486ValSer: 5.486 ± 0.714
6.237ValThr: 6.237 ± 0.645
3.757ValVal: 3.757 ± 0.72
1.052ValTrp: 1.052 ± 0.279
2.555ValTyr: 2.555 ± 0.369
0.0ValXaa: 0.0 ± 0.0
Trp
1.353TrpAla: 1.353 ± 0.336
0.225TrpCys: 0.225 ± 0.124
1.127TrpAsp: 1.127 ± 0.283
0.977TrpGlu: 0.977 ± 0.244
0.751TrpPhe: 0.751 ± 0.243
1.127TrpGly: 1.127 ± 0.273
0.0TrpHis: 0.0 ± 0.0
0.526TrpIle: 0.526 ± 0.202
1.127TrpLys: 1.127 ± 0.33
0.751TrpLeu: 0.751 ± 0.247
0.225TrpMet: 0.225 ± 0.153
0.526TrpAsn: 0.526 ± 0.2
0.601TrpPro: 0.601 ± 0.25
0.902TrpGln: 0.902 ± 0.209
0.827TrpArg: 0.827 ± 0.246
1.728TrpSer: 1.728 ± 0.334
1.653TrpThr: 1.653 ± 0.355
1.653TrpVal: 1.653 ± 0.309
0.376TrpTrp: 0.376 ± 0.158
0.601TrpTyr: 0.601 ± 0.261
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.457TyrAla: 3.457 ± 0.504
0.225TyrCys: 0.225 ± 0.129
2.104TyrAsp: 2.104 ± 0.47
2.029TyrGlu: 2.029 ± 0.381
1.127TyrPhe: 1.127 ± 0.346
3.307TyrGly: 3.307 ± 0.523
0.376TyrHis: 0.376 ± 0.168
1.353TyrIle: 1.353 ± 0.311
2.029TyrLys: 2.029 ± 0.318
2.705TyrLeu: 2.705 ± 0.37
0.977TyrMet: 0.977 ± 0.273
1.578TyrAsn: 1.578 ± 0.357
1.202TyrPro: 1.202 ± 0.311
2.33TyrGln: 2.33 ± 0.418
1.428TyrArg: 1.428 ± 0.31
2.33TyrSer: 2.33 ± 0.42
2.78TyrThr: 2.78 ± 0.445
2.78TyrVal: 2.78 ± 0.449
0.526TyrTrp: 0.526 ± 0.217
1.353TyrTyr: 1.353 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13308 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski