Amino acid dipepetide frequency for Streptomyces phage Caelum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.346AlaAla: 12.346 ± 1.15
0.52AlaCys: 0.52 ± 0.223
7.602AlaAsp: 7.602 ± 0.636
9.227AlaGlu: 9.227 ± 1.0
3.379AlaPhe: 3.379 ± 0.542
10.071AlaGly: 10.071 ± 0.952
1.754AlaHis: 1.754 ± 0.36
4.873AlaIle: 4.873 ± 0.457
6.368AlaLys: 6.368 ± 0.872
12.151AlaLeu: 12.151 ± 1.162
2.339AlaMet: 2.339 ± 0.355
3.639AlaAsn: 3.639 ± 0.552
4.938AlaPro: 4.938 ± 0.597
3.314AlaGln: 3.314 ± 0.421
7.927AlaArg: 7.927 ± 0.825
6.823AlaSer: 6.823 ± 0.837
6.888AlaThr: 6.888 ± 0.693
7.732AlaVal: 7.732 ± 0.586
1.689AlaTrp: 1.689 ± 0.336
3.314AlaTyr: 3.314 ± 0.475
0.0AlaXaa: 0.0 ± 0.0
Cys
0.845CysAla: 0.845 ± 0.266
0.065CysCys: 0.065 ± 0.058
0.65CysAsp: 0.65 ± 0.206
0.585CysGlu: 0.585 ± 0.253
0.13CysPhe: 0.13 ± 0.096
0.715CysGly: 0.715 ± 0.208
0.39CysHis: 0.39 ± 0.162
0.325CysIle: 0.325 ± 0.157
0.325CysLys: 0.325 ± 0.199
0.325CysLeu: 0.325 ± 0.155
0.13CysMet: 0.13 ± 0.098
0.13CysAsn: 0.13 ± 0.097
0.39CysPro: 0.39 ± 0.169
0.195CysGln: 0.195 ± 0.117
0.195CysArg: 0.195 ± 0.108
0.845CysSer: 0.845 ± 0.226
0.39CysThr: 0.39 ± 0.181
0.39CysVal: 0.39 ± 0.198
0.26CysTrp: 0.26 ± 0.131
0.455CysTyr: 0.455 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
7.147AspAla: 7.147 ± 0.675
0.52AspCys: 0.52 ± 0.218
4.483AspAsp: 4.483 ± 0.506
4.678AspGlu: 4.678 ± 0.796
1.624AspPhe: 1.624 ± 0.361
6.368AspGly: 6.368 ± 0.691
1.494AspHis: 1.494 ± 0.328
2.664AspIle: 2.664 ± 0.383
2.079AspLys: 2.079 ± 0.399
5.978AspLeu: 5.978 ± 0.554
1.754AspMet: 1.754 ± 0.326
1.494AspAsn: 1.494 ± 0.263
3.509AspPro: 3.509 ± 0.546
2.079AspGln: 2.079 ± 0.375
3.119AspArg: 3.119 ± 0.563
4.548AspSer: 4.548 ± 0.493
3.639AspThr: 3.639 ± 0.366
4.029AspVal: 4.029 ± 0.41
1.429AspTrp: 1.429 ± 0.288
1.235AspTyr: 1.235 ± 0.3
0.0AspXaa: 0.0 ± 0.0
Glu
8.447GluAla: 8.447 ± 1.025
0.78GluCys: 0.78 ± 0.304
3.509GluAsp: 3.509 ± 0.442
5.263GluGlu: 5.263 ± 0.862
1.494GluPhe: 1.494 ± 0.318
5.913GluGly: 5.913 ± 0.642
1.429GluHis: 1.429 ± 0.361
3.119GluIle: 3.119 ± 0.423
2.339GluLys: 2.339 ± 0.46
7.667GluLeu: 7.667 ± 0.943
1.3GluMet: 1.3 ± 0.246
1.235GluAsn: 1.235 ± 0.259
3.054GluPro: 3.054 ± 0.538
2.924GluGln: 2.924 ± 0.464
3.444GluArg: 3.444 ± 0.54
2.859GluSer: 2.859 ± 0.543
3.379GluThr: 3.379 ± 0.567
4.873GluVal: 4.873 ± 0.598
1.429GluTrp: 1.429 ± 0.304
2.144GluTyr: 2.144 ± 0.418
0.0GluXaa: 0.0 ± 0.0
Phe
3.769PheAla: 3.769 ± 0.518
0.26PheCys: 0.26 ± 0.123
2.079PheAsp: 2.079 ± 0.311
1.949PheGlu: 1.949 ± 0.339
0.78PhePhe: 0.78 ± 0.227
3.184PheGly: 3.184 ± 0.368
0.845PheHis: 0.845 ± 0.239
1.04PheIle: 1.04 ± 0.297
1.105PheLys: 1.105 ± 0.344
1.949PheLeu: 1.949 ± 0.295
0.585PheMet: 0.585 ± 0.23
1.235PheAsn: 1.235 ± 0.297
1.04PhePro: 1.04 ± 0.278
1.3PheGln: 1.3 ± 0.355
1.689PheArg: 1.689 ± 0.282
1.429PheSer: 1.429 ± 0.283
1.949PheThr: 1.949 ± 0.36
2.144PheVal: 2.144 ± 0.381
0.455PheTrp: 0.455 ± 0.161
0.52PheTyr: 0.52 ± 0.241
0.0PheXaa: 0.0 ± 0.0
Gly
8.187GlyAla: 8.187 ± 1.04
0.455GlyCys: 0.455 ± 0.183
4.873GlyAsp: 4.873 ± 0.605
4.548GlyGlu: 4.548 ± 0.471
3.119GlyPhe: 3.119 ± 0.503
5.718GlyGly: 5.718 ± 0.794
2.339GlyHis: 2.339 ± 0.369
3.444GlyIle: 3.444 ± 0.658
4.483GlyLys: 4.483 ± 0.628
7.927GlyLeu: 7.927 ± 0.721
1.429GlyMet: 1.429 ± 0.276
2.079GlyAsn: 2.079 ± 0.33
3.509GlyPro: 3.509 ± 0.462
2.859GlyGln: 2.859 ± 0.45
5.393GlyArg: 5.393 ± 0.714
5.783GlySer: 5.783 ± 0.843
5.848GlyThr: 5.848 ± 0.78
6.953GlyVal: 6.953 ± 0.538
2.274GlyTrp: 2.274 ± 0.369
2.404GlyTyr: 2.404 ± 0.417
0.0GlyXaa: 0.0 ± 0.0
His
2.014HisAla: 2.014 ± 0.371
0.195HisCys: 0.195 ± 0.105
1.365HisAsp: 1.365 ± 0.314
1.105HisGlu: 1.105 ± 0.354
1.04HisPhe: 1.04 ± 0.255
1.105HisGly: 1.105 ± 0.268
0.975HisHis: 0.975 ± 0.311
0.975HisIle: 0.975 ± 0.304
0.585HisLys: 0.585 ± 0.167
1.494HisLeu: 1.494 ± 0.396
0.65HisMet: 0.65 ± 0.169
0.585HisAsn: 0.585 ± 0.18
0.845HisPro: 0.845 ± 0.224
0.715HisGln: 0.715 ± 0.291
1.3HisArg: 1.3 ± 0.415
1.235HisSer: 1.235 ± 0.319
1.429HisThr: 1.429 ± 0.334
1.559HisVal: 1.559 ± 0.468
0.975HisTrp: 0.975 ± 0.265
0.585HisTyr: 0.585 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
5.068IleAla: 5.068 ± 0.713
0.195IleCys: 0.195 ± 0.124
3.509IleAsp: 3.509 ± 0.556
3.119IleGlu: 3.119 ± 0.48
1.429IlePhe: 1.429 ± 0.316
3.184IleGly: 3.184 ± 0.598
0.78IleHis: 0.78 ± 0.206
1.689IleIle: 1.689 ± 0.472
1.884IleLys: 1.884 ± 0.415
3.119IleLeu: 3.119 ± 0.515
0.455IleMet: 0.455 ± 0.246
1.105IleAsn: 1.105 ± 0.199
2.209IlePro: 2.209 ± 0.374
1.429IleGln: 1.429 ± 0.243
2.989IleArg: 2.989 ± 0.364
2.469IleSer: 2.469 ± 0.467
2.859IleThr: 2.859 ± 0.384
3.119IleVal: 3.119 ± 0.434
0.195IleTrp: 0.195 ± 0.095
0.91IleTyr: 0.91 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
6.108LysAla: 6.108 ± 0.843
0.715LysCys: 0.715 ± 0.246
2.404LysAsp: 2.404 ± 0.49
2.729LysGlu: 2.729 ± 0.403
1.17LysPhe: 1.17 ± 0.323
4.159LysGly: 4.159 ± 0.581
0.585LysHis: 0.585 ± 0.222
1.819LysIle: 1.819 ± 0.401
3.184LysLys: 3.184 ± 0.6
3.964LysLeu: 3.964 ± 0.565
0.845LysMet: 0.845 ± 0.217
0.585LysAsn: 0.585 ± 0.2
2.664LysPro: 2.664 ± 0.635
1.624LysGln: 1.624 ± 0.249
3.184LysArg: 3.184 ± 0.552
2.339LysSer: 2.339 ± 0.481
2.989LysThr: 2.989 ± 0.446
2.729LysVal: 2.729 ± 0.53
0.52LysTrp: 0.52 ± 0.192
2.144LysTyr: 2.144 ± 0.367
0.0LysXaa: 0.0 ± 0.0
Leu
11.696LeuAla: 11.696 ± 0.867
0.65LeuCys: 0.65 ± 0.233
5.393LeuAsp: 5.393 ± 0.652
3.964LeuGlu: 3.964 ± 0.472
1.949LeuPhe: 1.949 ± 0.316
7.667LeuGly: 7.667 ± 0.918
1.494LeuHis: 1.494 ± 0.354
3.834LeuIle: 3.834 ± 0.41
3.444LeuLys: 3.444 ± 0.678
5.718LeuLeu: 5.718 ± 0.684
2.274LeuMet: 2.274 ± 0.488
3.509LeuAsn: 3.509 ± 0.497
4.548LeuPro: 4.548 ± 0.556
2.794LeuGln: 2.794 ± 0.502
5.328LeuArg: 5.328 ± 0.791
5.718LeuSer: 5.718 ± 0.639
5.783LeuThr: 5.783 ± 0.496
6.238LeuVal: 6.238 ± 0.633
1.04LeuTrp: 1.04 ± 0.202
1.689LeuTyr: 1.689 ± 0.33
0.0LeuXaa: 0.0 ± 0.0
Met
3.899MetAla: 3.899 ± 0.466
0.13MetCys: 0.13 ± 0.08
1.429MetAsp: 1.429 ± 0.293
0.91MetGlu: 0.91 ± 0.235
0.455MetPhe: 0.455 ± 0.2
1.3MetGly: 1.3 ± 0.251
0.585MetHis: 0.585 ± 0.229
0.65MetIle: 0.65 ± 0.201
0.975MetLys: 0.975 ± 0.202
1.754MetLeu: 1.754 ± 0.377
0.26MetMet: 0.26 ± 0.123
0.455MetAsn: 0.455 ± 0.156
1.04MetPro: 1.04 ± 0.249
0.78MetGln: 0.78 ± 0.194
1.235MetArg: 1.235 ± 0.293
1.884MetSer: 1.884 ± 0.344
1.949MetThr: 1.949 ± 0.317
1.365MetVal: 1.365 ± 0.281
0.195MetTrp: 0.195 ± 0.118
0.455MetTyr: 0.455 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
3.054AsnAla: 3.054 ± 0.48
0.455AsnCys: 0.455 ± 0.183
1.494AsnAsp: 1.494 ± 0.321
1.559AsnGlu: 1.559 ± 0.329
1.17AsnPhe: 1.17 ± 0.327
2.729AsnGly: 2.729 ± 0.394
0.78AsnHis: 0.78 ± 0.224
1.3AsnIle: 1.3 ± 0.376
0.975AsnLys: 0.975 ± 0.244
2.079AsnLeu: 2.079 ± 0.459
0.455AsnMet: 0.455 ± 0.149
1.3AsnAsn: 1.3 ± 0.295
1.819AsnPro: 1.819 ± 0.361
0.91AsnGln: 0.91 ± 0.193
1.3AsnArg: 1.3 ± 0.234
1.429AsnSer: 1.429 ± 0.296
2.209AsnThr: 2.209 ± 0.431
1.754AsnVal: 1.754 ± 0.28
0.65AsnTrp: 0.65 ± 0.196
0.455AsnTyr: 0.455 ± 0.137
0.0AsnXaa: 0.0 ± 0.0
Pro
6.173ProAla: 6.173 ± 0.814
0.585ProCys: 0.585 ± 0.2
3.899ProAsp: 3.899 ± 0.495
3.054ProGlu: 3.054 ± 0.522
1.235ProPhe: 1.235 ± 0.325
4.418ProGly: 4.418 ± 0.473
0.52ProHis: 0.52 ± 0.183
1.559ProIle: 1.559 ± 0.355
2.729ProLys: 2.729 ± 0.528
3.054ProLeu: 3.054 ± 0.396
1.17ProMet: 1.17 ± 0.234
0.845ProAsn: 0.845 ± 0.22
1.949ProPro: 1.949 ± 0.343
1.365ProGln: 1.365 ± 0.237
2.209ProArg: 2.209 ± 0.374
3.314ProSer: 3.314 ± 0.506
3.899ProThr: 3.899 ± 0.553
3.899ProVal: 3.899 ± 0.475
0.78ProTrp: 0.78 ± 0.229
1.04ProTyr: 1.04 ± 0.252
0.0ProXaa: 0.0 ± 0.0
Gln
5.588GlnAla: 5.588 ± 0.754
0.195GlnCys: 0.195 ± 0.117
1.365GlnAsp: 1.365 ± 0.308
2.274GlnGlu: 2.274 ± 0.544
1.04GlnPhe: 1.04 ± 0.306
2.144GlnGly: 2.144 ± 0.25
0.325GlnHis: 0.325 ± 0.148
1.624GlnIle: 1.624 ± 0.295
1.624GlnLys: 1.624 ± 0.322
2.469GlnLeu: 2.469 ± 0.607
1.3GlnMet: 1.3 ± 0.333
0.975GlnAsn: 0.975 ± 0.277
0.78GlnPro: 0.78 ± 0.215
0.585GlnGln: 0.585 ± 0.237
2.014GlnArg: 2.014 ± 0.347
1.819GlnSer: 1.819 ± 0.259
2.079GlnThr: 2.079 ± 0.341
2.989GlnVal: 2.989 ± 0.382
0.585GlnTrp: 0.585 ± 0.188
0.39GlnTyr: 0.39 ± 0.135
0.0GlnXaa: 0.0 ± 0.0
Arg
6.043ArgAla: 6.043 ± 0.724
0.52ArgCys: 0.52 ± 0.211
4.483ArgAsp: 4.483 ± 0.63
3.704ArgGlu: 3.704 ± 0.554
2.469ArgPhe: 2.469 ± 0.425
4.418ArgGly: 4.418 ± 0.513
1.689ArgHis: 1.689 ± 0.471
2.274ArgIle: 2.274 ± 0.35
3.314ArgLys: 3.314 ± 0.455
5.328ArgLeu: 5.328 ± 0.734
1.819ArgMet: 1.819 ± 0.439
1.754ArgAsn: 1.754 ± 0.297
2.859ArgPro: 2.859 ± 0.454
2.209ArgGln: 2.209 ± 0.31
5.328ArgArg: 5.328 ± 0.811
3.509ArgSer: 3.509 ± 0.559
3.509ArgThr: 3.509 ± 0.636
4.678ArgVal: 4.678 ± 0.545
0.52ArgTrp: 0.52 ± 0.198
1.884ArgTyr: 1.884 ± 0.272
0.0ArgXaa: 0.0 ± 0.0
Ser
6.433SerAla: 6.433 ± 0.742
0.195SerCys: 0.195 ± 0.129
3.119SerAsp: 3.119 ± 0.463
4.159SerGlu: 4.159 ± 0.419
2.079SerPhe: 2.079 ± 0.371
5.978SerGly: 5.978 ± 0.536
1.3SerHis: 1.3 ± 0.445
2.859SerIle: 2.859 ± 0.435
2.989SerLys: 2.989 ± 0.442
5.588SerLeu: 5.588 ± 0.684
1.429SerMet: 1.429 ± 0.3
1.494SerAsn: 1.494 ± 0.343
2.534SerPro: 2.534 ± 0.371
2.014SerGln: 2.014 ± 0.386
3.769SerArg: 3.769 ± 0.704
4.353SerSer: 4.353 ± 0.59
3.964SerThr: 3.964 ± 0.46
4.548SerVal: 4.548 ± 0.653
0.975SerTrp: 0.975 ± 0.285
1.884SerTyr: 1.884 ± 0.323
0.0SerXaa: 0.0 ± 0.0
Thr
6.953ThrAla: 6.953 ± 0.726
0.325ThrCys: 0.325 ± 0.151
3.964ThrAsp: 3.964 ± 0.545
4.873ThrGlu: 4.873 ± 0.665
1.884ThrPhe: 1.884 ± 0.416
5.263ThrGly: 5.263 ± 0.714
0.975ThrHis: 0.975 ± 0.287
3.314ThrIle: 3.314 ± 0.522
2.664ThrLys: 2.664 ± 0.43
5.003ThrLeu: 5.003 ± 0.655
0.845ThrMet: 0.845 ± 0.227
1.559ThrAsn: 1.559 ± 0.403
4.029ThrPro: 4.029 ± 0.625
1.559ThrGln: 1.559 ± 0.38
3.639ThrArg: 3.639 ± 0.499
4.743ThrSer: 4.743 ± 0.609
4.613ThrThr: 4.613 ± 0.87
5.848ThrVal: 5.848 ± 0.739
1.3ThrTrp: 1.3 ± 0.274
3.379ThrTyr: 3.379 ± 0.536
0.0ThrXaa: 0.0 ± 0.0
Val
8.187ValAla: 8.187 ± 0.718
0.52ValCys: 0.52 ± 0.183
4.159ValAsp: 4.159 ± 0.433
5.718ValGlu: 5.718 ± 0.766
1.624ValPhe: 1.624 ± 0.292
5.783ValGly: 5.783 ± 0.753
1.559ValHis: 1.559 ± 0.333
3.249ValIle: 3.249 ± 0.442
3.444ValLys: 3.444 ± 0.421
5.588ValLeu: 5.588 ± 0.553
1.819ValMet: 1.819 ± 0.283
2.144ValAsn: 2.144 ± 0.339
3.574ValPro: 3.574 ± 0.501
2.729ValGln: 2.729 ± 0.46
5.068ValArg: 5.068 ± 0.632
3.574ValSer: 3.574 ± 0.478
6.498ValThr: 6.498 ± 0.725
4.873ValVal: 4.873 ± 0.427
1.559ValTrp: 1.559 ± 0.3
1.429ValTyr: 1.429 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
2.144TrpAla: 2.144 ± 0.345
0.26TrpCys: 0.26 ± 0.117
1.429TrpAsp: 1.429 ± 0.34
1.429TrpGlu: 1.429 ± 0.355
0.52TrpPhe: 0.52 ± 0.169
1.04TrpGly: 1.04 ± 0.285
0.39TrpHis: 0.39 ± 0.149
0.13TrpIle: 0.13 ± 0.082
0.975TrpLys: 0.975 ± 0.233
1.365TrpLeu: 1.365 ± 0.366
0.39TrpMet: 0.39 ± 0.134
0.715TrpAsn: 0.715 ± 0.231
0.975TrpPro: 0.975 ± 0.258
0.26TrpGln: 0.26 ± 0.145
1.365TrpArg: 1.365 ± 0.278
1.235TrpSer: 1.235 ± 0.355
1.235TrpThr: 1.235 ± 0.264
1.3TrpVal: 1.3 ± 0.323
0.13TrpTrp: 0.13 ± 0.095
0.26TrpTyr: 0.26 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.794TyrAla: 2.794 ± 0.404
0.195TyrCys: 0.195 ± 0.121
2.664TyrAsp: 2.664 ± 0.44
1.754TyrGlu: 1.754 ± 0.392
0.715TyrPhe: 0.715 ± 0.182
2.469TyrGly: 2.469 ± 0.518
0.52TyrHis: 0.52 ± 0.215
0.975TyrIle: 0.975 ± 0.208
0.975TyrLys: 0.975 ± 0.26
2.209TyrLeu: 2.209 ± 0.391
0.39TyrMet: 0.39 ± 0.13
1.04TyrAsn: 1.04 ± 0.223
1.429TyrPro: 1.429 ± 0.341
0.52TyrGln: 0.52 ± 0.193
1.819TyrArg: 1.819 ± 0.358
1.819TyrSer: 1.819 ± 0.461
1.624TyrThr: 1.624 ± 0.301
2.079TyrVal: 2.079 ± 0.438
0.585TyrTrp: 0.585 ± 0.18
0.975TyrTyr: 0.975 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (15391 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski