Amino acid dipepetide frequency for Streptomyces phage HotFries

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.811AlaAla: 11.811 ± 1.815
0.22AlaCys: 0.22 ± 0.142
5.282AlaAsp: 5.282 ± 0.632
5.942AlaGlu: 5.942 ± 0.706
2.934AlaPhe: 2.934 ± 0.454
7.923AlaGly: 7.923 ± 1.463
1.1AlaHis: 1.1 ± 0.271
6.016AlaIle: 6.016 ± 1.027
7.263AlaLys: 7.263 ± 0.9
8.803AlaLeu: 8.803 ± 1.323
2.641AlaMet: 2.641 ± 0.415
3.301AlaAsn: 3.301 ± 0.42
2.934AlaPro: 2.934 ± 0.413
3.008AlaGln: 3.008 ± 0.497
4.915AlaArg: 4.915 ± 0.629
5.062AlaSer: 5.062 ± 0.575
6.016AlaThr: 6.016 ± 0.717
7.116AlaVal: 7.116 ± 0.9
1.467AlaTrp: 1.467 ± 0.37
3.155AlaTyr: 3.155 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.22CysAla: 0.22 ± 0.172
0.0CysCys: 0.0 ± 0.0
0.293CysAsp: 0.293 ± 0.15
0.514CysGlu: 0.514 ± 0.197
0.293CysPhe: 0.293 ± 0.154
0.22CysGly: 0.22 ± 0.123
0.0CysHis: 0.0 ± 0.0
0.367CysIle: 0.367 ± 0.229
0.073CysLys: 0.073 ± 0.065
0.44CysLeu: 0.44 ± 0.218
0.147CysMet: 0.147 ± 0.105
0.293CysAsn: 0.293 ± 0.162
0.073CysPro: 0.073 ± 0.065
0.0CysGln: 0.0 ± 0.0
0.073CysArg: 0.073 ± 0.08
0.367CysSer: 0.367 ± 0.179
0.147CysThr: 0.147 ± 0.1
0.22CysVal: 0.22 ± 0.117
0.073CysTrp: 0.073 ± 0.065
0.147CysTyr: 0.147 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
6.603AspAla: 6.603 ± 0.59
0.147AspCys: 0.147 ± 0.09
4.475AspAsp: 4.475 ± 0.724
5.942AspGlu: 5.942 ± 0.991
3.668AspPhe: 3.668 ± 0.549
6.089AspGly: 6.089 ± 0.696
0.66AspHis: 0.66 ± 0.199
3.301AspIle: 3.301 ± 0.655
2.714AspLys: 2.714 ± 0.425
4.915AspLeu: 4.915 ± 0.642
1.541AspMet: 1.541 ± 0.315
2.861AspAsn: 2.861 ± 0.505
3.888AspPro: 3.888 ± 0.701
2.421AspGln: 2.421 ± 0.368
2.788AspArg: 2.788 ± 0.543
2.494AspSer: 2.494 ± 0.409
3.375AspThr: 3.375 ± 0.546
3.741AspVal: 3.741 ± 0.578
0.88AspTrp: 0.88 ± 0.278
1.614AspTyr: 1.614 ± 0.29
0.0AspXaa: 0.0 ± 0.0
Glu
6.456GluAla: 6.456 ± 0.871
0.367GluCys: 0.367 ± 0.17
3.595GluAsp: 3.595 ± 0.66
4.548GluGlu: 4.548 ± 0.589
2.788GluPhe: 2.788 ± 0.507
3.741GluGly: 3.741 ± 0.596
1.1GluHis: 1.1 ± 0.289
4.622GluIle: 4.622 ± 0.389
4.182GluLys: 4.182 ± 0.638
5.135GluLeu: 5.135 ± 0.727
1.687GluMet: 1.687 ± 0.291
2.128GluAsn: 2.128 ± 0.442
2.348GluPro: 2.348 ± 0.481
3.081GluGln: 3.081 ± 0.562
3.668GluArg: 3.668 ± 0.636
2.568GluSer: 2.568 ± 0.465
3.888GluThr: 3.888 ± 0.635
4.915GluVal: 4.915 ± 0.804
1.394GluTrp: 1.394 ± 0.304
2.054GluTyr: 2.054 ± 0.399
0.0GluXaa: 0.0 ± 0.0
Phe
3.008PheAla: 3.008 ± 0.405
0.293PheCys: 0.293 ± 0.163
3.301PheAsp: 3.301 ± 0.533
2.274PheGlu: 2.274 ± 0.515
1.467PhePhe: 1.467 ± 0.268
3.008PheGly: 3.008 ± 0.463
0.44PheHis: 0.44 ± 0.166
1.687PheIle: 1.687 ± 0.358
2.348PheLys: 2.348 ± 0.692
2.641PheLeu: 2.641 ± 0.432
1.174PheMet: 1.174 ± 0.27
1.761PheAsn: 1.761 ± 0.326
0.954PhePro: 0.954 ± 0.201
1.1PheGln: 1.1 ± 0.242
2.861PheArg: 2.861 ± 0.469
1.761PheSer: 1.761 ± 0.384
2.714PheThr: 2.714 ± 0.489
2.568PheVal: 2.568 ± 0.41
0.44PheTrp: 0.44 ± 0.223
1.321PheTyr: 1.321 ± 0.349
0.0PheXaa: 0.0 ± 0.0
Gly
7.483GlyAla: 7.483 ± 1.006
0.22GlyCys: 0.22 ± 0.129
4.915GlyAsp: 4.915 ± 0.545
4.989GlyGlu: 4.989 ± 0.657
3.008GlyPhe: 3.008 ± 0.48
5.502GlyGly: 5.502 ± 0.844
0.954GlyHis: 0.954 ± 0.265
4.769GlyIle: 4.769 ± 1.161
5.062GlyLys: 5.062 ± 0.854
6.529GlyLeu: 6.529 ± 1.428
2.128GlyMet: 2.128 ± 0.367
1.834GlyAsn: 1.834 ± 0.318
2.568GlyPro: 2.568 ± 0.522
1.394GlyGln: 1.394 ± 0.403
2.788GlyArg: 2.788 ± 0.334
4.548GlySer: 4.548 ± 0.851
6.383GlyThr: 6.383 ± 0.635
6.236GlyVal: 6.236 ± 0.578
1.761GlyTrp: 1.761 ± 0.359
2.861GlyTyr: 2.861 ± 0.584
0.0GlyXaa: 0.0 ± 0.0
His
0.807HisAla: 0.807 ± 0.35
0.073HisCys: 0.073 ± 0.069
0.807HisAsp: 0.807 ± 0.276
1.174HisGlu: 1.174 ± 0.269
0.66HisPhe: 0.66 ± 0.207
0.88HisGly: 0.88 ± 0.199
0.514HisHis: 0.514 ± 0.196
1.027HisIle: 1.027 ± 0.297
0.587HisLys: 0.587 ± 0.171
1.174HisLeu: 1.174 ± 0.299
0.734HisMet: 0.734 ± 0.186
1.1HisAsn: 1.1 ± 0.355
0.66HisPro: 0.66 ± 0.244
0.88HisGln: 0.88 ± 0.258
0.587HisArg: 0.587 ± 0.234
1.467HisSer: 1.467 ± 0.351
0.734HisThr: 0.734 ± 0.245
0.954HisVal: 0.954 ± 0.249
0.22HisTrp: 0.22 ± 0.179
0.954HisTyr: 0.954 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.282IleAla: 5.282 ± 0.736
0.22IleCys: 0.22 ± 0.117
4.769IleAsp: 4.769 ± 0.511
4.182IleGlu: 4.182 ± 0.699
1.687IlePhe: 1.687 ± 0.41
3.888IleGly: 3.888 ± 0.922
0.88IleHis: 0.88 ± 0.301
1.907IleIle: 1.907 ± 0.387
3.741IleLys: 3.741 ± 0.571
4.475IleLeu: 4.475 ± 0.722
0.954IleMet: 0.954 ± 0.254
1.981IleAsn: 1.981 ± 0.303
2.274IlePro: 2.274 ± 0.388
1.981IleGln: 1.981 ± 0.379
2.421IleArg: 2.421 ± 0.515
3.008IleSer: 3.008 ± 0.471
4.769IleThr: 4.769 ± 0.599
3.521IleVal: 3.521 ± 0.426
0.807IleTrp: 0.807 ± 0.3
1.541IleTyr: 1.541 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
6.309LysAla: 6.309 ± 0.692
0.0LysCys: 0.0 ± 0.0
3.448LysAsp: 3.448 ± 0.61
3.081LysGlu: 3.081 ± 0.613
1.761LysPhe: 1.761 ± 0.534
4.035LysGly: 4.035 ± 1.038
1.247LysHis: 1.247 ± 0.375
3.448LysIle: 3.448 ± 0.521
5.062LysLys: 5.062 ± 0.693
6.236LysLeu: 6.236 ± 0.778
1.687LysMet: 1.687 ± 0.429
2.494LysAsn: 2.494 ± 0.509
2.641LysPro: 2.641 ± 0.449
2.861LysGln: 2.861 ± 0.475
2.861LysArg: 2.861 ± 0.447
3.815LysSer: 3.815 ± 0.659
4.035LysThr: 4.035 ± 0.531
4.769LysVal: 4.769 ± 0.645
1.027LysTrp: 1.027 ± 0.294
2.128LysTyr: 2.128 ± 0.397
0.0LysXaa: 0.0 ± 0.0
Leu
8.803LeuAla: 8.803 ± 1.449
0.293LeuCys: 0.293 ± 0.148
6.016LeuAsp: 6.016 ± 0.577
4.182LeuGlu: 4.182 ± 0.658
2.568LeuPhe: 2.568 ± 0.436
6.383LeuGly: 6.383 ± 1.008
1.174LeuHis: 1.174 ± 0.291
3.741LeuIle: 3.741 ± 0.806
5.576LeuLys: 5.576 ± 0.895
4.989LeuLeu: 4.989 ± 0.778
1.761LeuMet: 1.761 ± 0.312
3.448LeuAsn: 3.448 ± 0.524
3.228LeuPro: 3.228 ± 0.369
2.421LeuGln: 2.421 ± 0.549
4.622LeuArg: 4.622 ± 0.653
6.529LeuSer: 6.529 ± 0.838
5.282LeuThr: 5.282 ± 0.73
6.162LeuVal: 6.162 ± 0.703
0.807LeuTrp: 0.807 ± 0.236
2.934LeuTyr: 2.934 ± 0.524
0.0LeuXaa: 0.0 ± 0.0
Met
1.981MetAla: 1.981 ± 0.466
0.293MetCys: 0.293 ± 0.155
1.761MetAsp: 1.761 ± 0.376
1.834MetGlu: 1.834 ± 0.415
0.807MetPhe: 0.807 ± 0.234
1.614MetGly: 1.614 ± 0.292
0.367MetHis: 0.367 ± 0.149
1.541MetIle: 1.541 ± 0.289
1.321MetLys: 1.321 ± 0.389
2.494MetLeu: 2.494 ± 0.438
0.66MetMet: 0.66 ± 0.257
1.321MetAsn: 1.321 ± 0.315
1.247MetPro: 1.247 ± 0.26
0.587MetGln: 0.587 ± 0.226
0.66MetArg: 0.66 ± 0.251
2.054MetSer: 2.054 ± 0.418
1.467MetThr: 1.467 ± 0.339
2.201MetVal: 2.201 ± 0.398
0.147MetTrp: 0.147 ± 0.111
0.954MetTyr: 0.954 ± 0.285
0.0MetXaa: 0.0 ± 0.0
Asn
3.962AsnAla: 3.962 ± 0.539
0.147AsnCys: 0.147 ± 0.117
1.907AsnAsp: 1.907 ± 0.355
2.128AsnGlu: 2.128 ± 0.36
1.247AsnPhe: 1.247 ± 0.283
3.228AsnGly: 3.228 ± 0.477
0.88AsnHis: 0.88 ± 0.292
1.834AsnIle: 1.834 ± 0.288
2.421AsnLys: 2.421 ± 0.372
2.714AsnLeu: 2.714 ± 0.445
0.88AsnMet: 0.88 ± 0.211
1.467AsnAsn: 1.467 ± 0.318
2.568AsnPro: 2.568 ± 0.483
1.321AsnGln: 1.321 ± 0.309
3.155AsnArg: 3.155 ± 0.536
2.861AsnSer: 2.861 ± 0.477
2.201AsnThr: 2.201 ± 0.329
2.714AsnVal: 2.714 ± 0.539
0.807AsnTrp: 0.807 ± 0.235
1.394AsnTyr: 1.394 ± 0.27
0.0AsnXaa: 0.0 ± 0.0
Pro
4.255ProAla: 4.255 ± 0.585
0.293ProCys: 0.293 ± 0.151
3.155ProAsp: 3.155 ± 0.649
2.861ProGlu: 2.861 ± 0.62
1.614ProPhe: 1.614 ± 0.278
2.934ProGly: 2.934 ± 0.402
0.367ProHis: 0.367 ± 0.188
2.201ProIle: 2.201 ± 0.435
2.714ProLys: 2.714 ± 0.385
2.788ProLeu: 2.788 ± 0.508
1.027ProMet: 1.027 ± 0.276
1.834ProAsn: 1.834 ± 0.331
1.687ProPro: 1.687 ± 0.419
0.88ProGln: 0.88 ± 0.261
1.907ProArg: 1.907 ± 0.447
2.421ProSer: 2.421 ± 0.478
3.741ProThr: 3.741 ± 0.459
3.008ProVal: 3.008 ± 0.526
0.44ProTrp: 0.44 ± 0.195
1.907ProTyr: 1.907 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
3.155GlnAla: 3.155 ± 0.64
0.22GlnCys: 0.22 ± 0.127
1.1GlnAsp: 1.1 ± 0.296
1.614GlnGlu: 1.614 ± 0.34
1.321GlnPhe: 1.321 ± 0.295
2.788GlnGly: 2.788 ± 0.444
1.1GlnHis: 1.1 ± 0.316
2.274GlnIle: 2.274 ± 0.345
1.247GlnLys: 1.247 ± 0.246
3.155GlnLeu: 3.155 ± 0.555
0.954GlnMet: 0.954 ± 0.216
1.321GlnAsn: 1.321 ± 0.276
1.394GlnPro: 1.394 ± 0.399
1.247GlnGln: 1.247 ± 0.23
2.201GlnArg: 2.201 ± 0.497
1.907GlnSer: 1.907 ± 0.435
2.054GlnThr: 2.054 ± 0.473
2.128GlnVal: 2.128 ± 0.383
0.514GlnTrp: 0.514 ± 0.224
1.467GlnTyr: 1.467 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
3.815ArgAla: 3.815 ± 0.756
0.367ArgCys: 0.367 ± 0.18
3.228ArgAsp: 3.228 ± 0.523
3.301ArgGlu: 3.301 ± 0.454
1.761ArgPhe: 1.761 ± 0.43
2.494ArgGly: 2.494 ± 0.436
1.174ArgHis: 1.174 ± 0.292
2.348ArgIle: 2.348 ± 0.412
3.375ArgLys: 3.375 ± 0.375
4.769ArgLeu: 4.769 ± 0.869
1.761ArgMet: 1.761 ± 0.365
2.274ArgAsn: 2.274 ± 0.34
2.201ArgPro: 2.201 ± 0.449
2.861ArgGln: 2.861 ± 0.592
3.155ArgArg: 3.155 ± 0.68
2.714ArgSer: 2.714 ± 0.492
3.448ArgThr: 3.448 ± 0.648
3.815ArgVal: 3.815 ± 0.656
0.66ArgTrp: 0.66 ± 0.268
1.027ArgTyr: 1.027 ± 0.263
0.0ArgXaa: 0.0 ± 0.0
Ser
5.282SerAla: 5.282 ± 0.887
0.073SerCys: 0.073 ± 0.069
3.741SerAsp: 3.741 ± 0.484
3.008SerGlu: 3.008 ± 0.434
1.394SerPhe: 1.394 ± 0.262
5.502SerGly: 5.502 ± 0.761
0.954SerHis: 0.954 ± 0.279
2.568SerIle: 2.568 ± 0.353
3.301SerLys: 3.301 ± 0.474
4.915SerLeu: 4.915 ± 0.575
1.321SerMet: 1.321 ± 0.267
2.201SerAsn: 2.201 ± 0.363
2.861SerPro: 2.861 ± 0.427
1.907SerGln: 1.907 ± 0.391
2.641SerArg: 2.641 ± 0.468
3.521SerSer: 3.521 ± 0.65
4.989SerThr: 4.989 ± 0.703
5.062SerVal: 5.062 ± 0.618
1.174SerTrp: 1.174 ± 0.315
1.761SerTyr: 1.761 ± 0.369
0.0SerXaa: 0.0 ± 0.0
Thr
6.162ThrAla: 6.162 ± 0.774
0.147ThrCys: 0.147 ± 0.101
4.402ThrAsp: 4.402 ± 0.528
4.182ThrGlu: 4.182 ± 0.625
2.861ThrPhe: 2.861 ± 0.531
6.089ThrGly: 6.089 ± 0.73
1.027ThrHis: 1.027 ± 0.37
4.108ThrIle: 4.108 ± 0.695
4.108ThrLys: 4.108 ± 0.414
5.355ThrLeu: 5.355 ± 0.677
0.954ThrMet: 0.954 ± 0.255
2.348ThrAsn: 2.348 ± 0.354
3.815ThrPro: 3.815 ± 0.577
2.201ThrGln: 2.201 ± 0.459
3.155ThrArg: 3.155 ± 0.501
3.888ThrSer: 3.888 ± 0.729
6.016ThrThr: 6.016 ± 1.02
5.576ThrVal: 5.576 ± 0.636
1.1ThrTrp: 1.1 ± 0.275
3.081ThrTyr: 3.081 ± 0.516
0.0ThrXaa: 0.0 ± 0.0
Val
7.41ValAla: 7.41 ± 0.631
0.293ValCys: 0.293 ± 0.118
4.695ValAsp: 4.695 ± 0.694
5.355ValGlu: 5.355 ± 0.592
2.494ValPhe: 2.494 ± 0.385
5.942ValGly: 5.942 ± 0.677
1.027ValHis: 1.027 ± 0.401
4.255ValIle: 4.255 ± 0.758
4.255ValLys: 4.255 ± 0.466
6.162ValLeu: 6.162 ± 0.754
1.907ValMet: 1.907 ± 0.418
2.714ValAsn: 2.714 ± 0.466
3.008ValPro: 3.008 ± 0.473
1.614ValGln: 1.614 ± 0.46
4.328ValArg: 4.328 ± 0.652
3.741ValSer: 3.741 ± 0.397
5.576ValThr: 5.576 ± 0.698
4.769ValVal: 4.769 ± 0.66
1.467ValTrp: 1.467 ± 0.328
2.714ValTyr: 2.714 ± 0.552
0.0ValXaa: 0.0 ± 0.0
Trp
1.027TrpAla: 1.027 ± 0.266
0.073TrpCys: 0.073 ± 0.065
0.88TrpAsp: 0.88 ± 0.3
1.321TrpGlu: 1.321 ± 0.357
0.807TrpPhe: 0.807 ± 0.235
1.321TrpGly: 1.321 ± 0.388
0.293TrpHis: 0.293 ± 0.148
0.734TrpIle: 0.734 ± 0.259
0.734TrpLys: 0.734 ± 0.225
0.954TrpLeu: 0.954 ± 0.291
0.514TrpMet: 0.514 ± 0.217
1.321TrpAsn: 1.321 ± 0.383
0.44TrpPro: 0.44 ± 0.169
0.293TrpGln: 0.293 ± 0.125
0.367TrpArg: 0.367 ± 0.209
1.614TrpSer: 1.614 ± 0.401
1.394TrpThr: 1.394 ± 0.409
1.174TrpVal: 1.174 ± 0.353
0.293TrpTrp: 0.293 ± 0.123
0.293TrpTyr: 0.293 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.081TyrAla: 3.081 ± 0.874
0.22TyrCys: 0.22 ± 0.145
2.348TyrAsp: 2.348 ± 0.475
1.907TyrGlu: 1.907 ± 0.498
1.981TyrPhe: 1.981 ± 0.412
2.348TyrGly: 2.348 ± 0.442
0.734TyrHis: 0.734 ± 0.278
1.541TyrIle: 1.541 ± 0.391
2.861TyrLys: 2.861 ± 0.492
2.274TyrLeu: 2.274 ± 0.411
0.807TyrMet: 0.807 ± 0.218
1.834TyrAsn: 1.834 ± 0.291
1.247TyrPro: 1.247 ± 0.361
1.1TyrGln: 1.1 ± 0.256
1.394TyrArg: 1.394 ± 0.3
1.834TyrSer: 1.834 ± 0.36
2.348TyrThr: 2.348 ± 0.354
3.081TyrVal: 3.081 ± 0.4
0.367TyrTrp: 0.367 ± 0.173
1.027TyrTyr: 1.027 ± 0.41
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13632 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski