Amino acid dipepetide frequency for Streptomyces phage Amela

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.763AlaAla: 12.763 ± 1.155
0.798AlaCys: 0.798 ± 0.278
6.98AlaAsp: 6.98 ± 0.717
7.445AlaGlu: 7.445 ± 0.836
3.191AlaPhe: 3.191 ± 0.545
8.309AlaGly: 8.309 ± 0.837
1.662AlaHis: 1.662 ± 0.324
4.719AlaIle: 4.719 ± 0.662
4.786AlaLys: 4.786 ± 0.656
10.104AlaLeu: 10.104 ± 1.127
2.925AlaMet: 2.925 ± 0.377
2.925AlaAsn: 2.925 ± 0.399
4.52AlaPro: 4.52 ± 0.577
4.454AlaGln: 4.454 ± 0.54
6.847AlaArg: 6.847 ± 0.897
6.514AlaSer: 6.514 ± 0.584
6.647AlaThr: 6.647 ± 0.655
8.708AlaVal: 8.708 ± 0.919
2.459AlaTrp: 2.459 ± 0.372
3.058AlaTyr: 3.058 ± 0.54
0.0AlaXaa: 0.0 ± 0.0
Cys
0.465CysAla: 0.465 ± 0.209
0.066CysCys: 0.066 ± 0.073
0.399CysAsp: 0.399 ± 0.166
0.465CysGlu: 0.465 ± 0.187
0.066CysPhe: 0.066 ± 0.07
0.598CysGly: 0.598 ± 0.19
0.332CysHis: 0.332 ± 0.144
0.133CysIle: 0.133 ± 0.088
0.133CysLys: 0.133 ± 0.088
0.465CysLeu: 0.465 ± 0.182
0.066CysMet: 0.066 ± 0.054
0.199CysAsn: 0.199 ± 0.114
0.665CysPro: 0.665 ± 0.249
0.266CysGln: 0.266 ± 0.149
0.598CysArg: 0.598 ± 0.223
0.532CysSer: 0.532 ± 0.221
0.665CysThr: 0.665 ± 0.25
0.399CysVal: 0.399 ± 0.168
0.133CysTrp: 0.133 ± 0.086
0.332CysTyr: 0.332 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
6.847AspAla: 6.847 ± 0.674
0.598AspCys: 0.598 ± 0.187
3.789AspAsp: 3.789 ± 0.593
4.653AspGlu: 4.653 ± 0.748
2.393AspPhe: 2.393 ± 0.38
6.115AspGly: 6.115 ± 0.66
1.329AspHis: 1.329 ± 0.362
2.925AspIle: 2.925 ± 0.385
2.925AspLys: 2.925 ± 0.477
5.185AspLeu: 5.185 ± 0.5
1.728AspMet: 1.728 ± 0.301
1.662AspAsn: 1.662 ± 0.412
3.722AspPro: 3.722 ± 0.578
2.592AspGln: 2.592 ± 0.518
3.656AspArg: 3.656 ± 0.535
2.393AspSer: 2.393 ± 0.468
3.988AspThr: 3.988 ± 0.48
4.786AspVal: 4.786 ± 0.57
1.462AspTrp: 1.462 ± 0.265
1.662AspTyr: 1.662 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
7.91GluAla: 7.91 ± 0.908
0.598GluCys: 0.598 ± 0.163
5.185GluAsp: 5.185 ± 0.754
4.188GluGlu: 4.188 ± 0.759
1.662GluPhe: 1.662 ± 0.331
5.118GluGly: 5.118 ± 0.647
1.263GluHis: 1.263 ± 0.322
2.526GluIle: 2.526 ± 0.472
2.459GluLys: 2.459 ± 0.374
6.381GluLeu: 6.381 ± 0.794
1.595GluMet: 1.595 ± 0.385
1.994GluAsn: 1.994 ± 0.315
2.659GluPro: 2.659 ± 0.479
2.459GluGln: 2.459 ± 0.388
4.321GluArg: 4.321 ± 0.722
3.257GluSer: 3.257 ± 0.59
3.191GluThr: 3.191 ± 0.53
5.052GluVal: 5.052 ± 0.572
1.529GluTrp: 1.529 ± 0.309
1.795GluTyr: 1.795 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
3.324PheAla: 3.324 ± 0.467
0.199PheCys: 0.199 ± 0.127
2.393PheAsp: 2.393 ± 0.417
2.061PheGlu: 2.061 ± 0.411
0.931PhePhe: 0.931 ± 0.218
3.058PheGly: 3.058 ± 0.45
0.598PheHis: 0.598 ± 0.181
1.329PheIle: 1.329 ± 0.269
1.396PheLys: 1.396 ± 0.313
1.994PheLeu: 1.994 ± 0.331
0.798PheMet: 0.798 ± 0.286
0.931PheAsn: 0.931 ± 0.22
1.064PhePro: 1.064 ± 0.265
0.598PheGln: 0.598 ± 0.178
1.994PheArg: 1.994 ± 0.352
1.529PheSer: 1.529 ± 0.383
2.061PheThr: 2.061 ± 0.376
1.662PheVal: 1.662 ± 0.421
0.665PheTrp: 0.665 ± 0.213
1.263PheTyr: 1.263 ± 0.303
0.0PheXaa: 0.0 ± 0.0
Gly
7.777GlyAla: 7.777 ± 0.889
0.066GlyCys: 0.066 ± 0.071
6.448GlyAsp: 6.448 ± 0.953
4.985GlyGlu: 4.985 ± 0.576
3.058GlyPhe: 3.058 ± 0.528
7.179GlyGly: 7.179 ± 0.963
2.061GlyHis: 2.061 ± 0.4
5.318GlyIle: 5.318 ± 0.825
4.52GlyLys: 4.52 ± 0.721
7.245GlyLeu: 7.245 ± 0.856
1.994GlyMet: 1.994 ± 0.368
2.659GlyAsn: 2.659 ± 0.366
3.722GlyPro: 3.722 ± 0.846
2.459GlyGln: 2.459 ± 0.432
4.653GlyArg: 4.653 ± 0.59
5.517GlySer: 5.517 ± 1.026
6.115GlyThr: 6.115 ± 0.722
6.581GlyVal: 6.581 ± 0.585
2.061GlyTrp: 2.061 ± 0.443
2.393GlyTyr: 2.393 ± 0.444
0.0GlyXaa: 0.0 ± 0.0
His
1.861HisAla: 1.861 ± 0.36
0.266HisCys: 0.266 ± 0.13
0.864HisAsp: 0.864 ± 0.254
1.196HisGlu: 1.196 ± 0.287
1.13HisPhe: 1.13 ± 0.235
1.529HisGly: 1.529 ± 0.297
0.532HisHis: 0.532 ± 0.169
0.598HisIle: 0.598 ± 0.275
0.332HisLys: 0.332 ± 0.129
1.795HisLeu: 1.795 ± 0.374
0.199HisMet: 0.199 ± 0.102
0.465HisAsn: 0.465 ± 0.157
1.13HisPro: 1.13 ± 0.248
0.798HisGln: 0.798 ± 0.219
0.997HisArg: 0.997 ± 0.284
0.532HisSer: 0.532 ± 0.194
1.329HisThr: 1.329 ± 0.311
0.931HisVal: 0.931 ± 0.313
0.731HisTrp: 0.731 ± 0.268
1.13HisTyr: 1.13 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
5.251IleAla: 5.251 ± 0.617
0.199IleCys: 0.199 ± 0.103
2.327IleAsp: 2.327 ± 0.339
4.387IleGlu: 4.387 ± 0.498
0.864IlePhe: 0.864 ± 0.233
4.055IleGly: 4.055 ± 0.689
0.532IleHis: 0.532 ± 0.179
1.795IleIle: 1.795 ± 0.729
1.529IleLys: 1.529 ± 0.405
2.925IleLeu: 2.925 ± 0.488
0.598IleMet: 0.598 ± 0.185
1.462IleAsn: 1.462 ± 0.303
2.393IlePro: 2.393 ± 0.363
1.662IleGln: 1.662 ± 0.478
3.39IleArg: 3.39 ± 0.481
2.459IleSer: 2.459 ± 0.432
3.058IleThr: 3.058 ± 0.501
3.523IleVal: 3.523 ± 0.489
0.465IleTrp: 0.465 ± 0.188
1.728IleTyr: 1.728 ± 0.376
0.0IleXaa: 0.0 ± 0.0
Lys
6.115LysAla: 6.115 ± 0.934
0.066LysCys: 0.066 ± 0.068
2.725LysAsp: 2.725 ± 0.457
1.994LysGlu: 1.994 ± 0.336
0.931LysPhe: 0.931 ± 0.258
4.719LysGly: 4.719 ± 0.653
0.798LysHis: 0.798 ± 0.26
2.26LysIle: 2.26 ± 0.368
2.925LysLys: 2.925 ± 0.468
4.254LysLeu: 4.254 ± 0.57
0.731LysMet: 0.731 ± 0.208
1.196LysAsn: 1.196 ± 0.303
3.457LysPro: 3.457 ± 0.646
1.529LysGln: 1.529 ± 0.295
2.26LysArg: 2.26 ± 0.422
2.327LysSer: 2.327 ± 0.488
3.324LysThr: 3.324 ± 0.556
2.925LysVal: 2.925 ± 0.612
0.399LysTrp: 0.399 ± 0.168
1.064LysTyr: 1.064 ± 0.293
0.0LysXaa: 0.0 ± 0.0
Leu
10.303LeuAla: 10.303 ± 1.006
0.399LeuCys: 0.399 ± 0.153
5.916LeuAsp: 5.916 ± 0.648
4.055LeuGlu: 4.055 ± 0.587
1.728LeuPhe: 1.728 ± 0.404
6.714LeuGly: 6.714 ± 0.651
1.329LeuHis: 1.329 ± 0.323
3.855LeuIle: 3.855 ± 0.411
4.852LeuLys: 4.852 ± 0.691
5.916LeuLeu: 5.916 ± 0.729
1.728LeuMet: 1.728 ± 0.418
3.257LeuAsn: 3.257 ± 0.417
4.786LeuPro: 4.786 ± 0.567
2.127LeuGln: 2.127 ± 0.313
4.919LeuArg: 4.919 ± 0.704
5.982LeuSer: 5.982 ± 0.52
5.251LeuThr: 5.251 ± 0.652
5.783LeuVal: 5.783 ± 0.593
1.329LeuTrp: 1.329 ± 0.297
2.194LeuTyr: 2.194 ± 0.334
0.0LeuXaa: 0.0 ± 0.0
Met
3.124MetAla: 3.124 ± 0.478
0.133MetCys: 0.133 ± 0.089
0.997MetAsp: 0.997 ± 0.239
1.064MetGlu: 1.064 ± 0.24
0.665MetPhe: 0.665 ± 0.309
1.13MetGly: 1.13 ± 0.297
0.066MetHis: 0.066 ± 0.058
1.462MetIle: 1.462 ± 0.309
1.064MetLys: 1.064 ± 0.311
1.728MetLeu: 1.728 ± 0.267
0.399MetMet: 0.399 ± 0.158
0.798MetAsn: 0.798 ± 0.225
1.396MetPro: 1.396 ± 0.317
0.465MetGln: 0.465 ± 0.198
1.529MetArg: 1.529 ± 0.427
2.127MetSer: 2.127 ± 0.323
1.728MetThr: 1.728 ± 0.332
1.329MetVal: 1.329 ± 0.243
0.199MetTrp: 0.199 ± 0.112
0.266MetTyr: 0.266 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
3.257AsnAla: 3.257 ± 0.466
0.399AsnCys: 0.399 ± 0.2
1.994AsnAsp: 1.994 ± 0.363
1.662AsnGlu: 1.662 ± 0.36
1.13AsnPhe: 1.13 ± 0.297
3.39AsnGly: 3.39 ± 0.519
0.665AsnHis: 0.665 ± 0.196
0.931AsnIle: 0.931 ± 0.247
1.196AsnLys: 1.196 ± 0.307
2.792AsnLeu: 2.792 ± 0.48
0.266AsnMet: 0.266 ± 0.145
0.731AsnAsn: 0.731 ± 0.31
1.728AsnPro: 1.728 ± 0.292
1.13AsnGln: 1.13 ± 0.263
1.795AsnArg: 1.795 ± 0.293
1.595AsnSer: 1.595 ± 0.306
2.459AsnThr: 2.459 ± 0.379
1.662AsnVal: 1.662 ± 0.345
0.532AsnTrp: 0.532 ± 0.183
0.864AsnTyr: 0.864 ± 0.158
0.0AsnXaa: 0.0 ± 0.0
Pro
5.318ProAla: 5.318 ± 0.724
0.532ProCys: 0.532 ± 0.177
3.058ProAsp: 3.058 ± 0.345
4.121ProGlu: 4.121 ± 0.543
1.329ProPhe: 1.329 ± 0.331
4.719ProGly: 4.719 ± 0.471
0.731ProHis: 0.731 ± 0.189
2.26ProIle: 2.26 ± 0.601
2.127ProLys: 2.127 ± 0.447
3.324ProLeu: 3.324 ± 0.484
1.064ProMet: 1.064 ± 0.283
1.795ProAsn: 1.795 ± 0.424
1.861ProPro: 1.861 ± 0.404
1.263ProGln: 1.263 ± 0.412
2.393ProArg: 2.393 ± 0.394
3.589ProSer: 3.589 ± 0.534
4.254ProThr: 4.254 ± 0.759
3.789ProVal: 3.789 ± 0.445
0.864ProTrp: 0.864 ± 0.245
1.396ProTyr: 1.396 ± 0.367
0.0ProXaa: 0.0 ± 0.0
Gln
4.387GlnAla: 4.387 ± 0.62
0.199GlnCys: 0.199 ± 0.116
1.396GlnAsp: 1.396 ± 0.319
2.327GlnGlu: 2.327 ± 0.397
1.064GlnPhe: 1.064 ± 0.266
2.725GlnGly: 2.725 ± 0.691
0.598GlnHis: 0.598 ± 0.203
1.928GlnIle: 1.928 ± 0.337
1.462GlnLys: 1.462 ± 0.274
2.459GlnLeu: 2.459 ± 0.431
0.798GlnMet: 0.798 ± 0.269
1.196GlnAsn: 1.196 ± 0.303
1.263GlnPro: 1.263 ± 0.259
0.731GlnGln: 0.731 ± 0.231
2.725GlnArg: 2.725 ± 0.326
1.795GlnSer: 1.795 ± 0.312
1.928GlnThr: 1.928 ± 0.371
2.393GlnVal: 2.393 ± 0.397
0.665GlnTrp: 0.665 ± 0.182
0.997GlnTyr: 0.997 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
5.584ArgAla: 5.584 ± 0.777
0.665ArgCys: 0.665 ± 0.199
4.121ArgAsp: 4.121 ± 0.741
3.988ArgGlu: 3.988 ± 0.636
2.459ArgPhe: 2.459 ± 0.473
4.321ArgGly: 4.321 ± 0.573
1.196ArgHis: 1.196 ± 0.28
2.327ArgIle: 2.327 ± 0.353
3.257ArgLys: 3.257 ± 0.456
5.717ArgLeu: 5.717 ± 0.666
1.861ArgMet: 1.861 ± 0.306
1.795ArgAsn: 1.795 ± 0.346
2.393ArgPro: 2.393 ± 0.489
2.194ArgGln: 2.194 ± 0.475
6.182ArgArg: 6.182 ± 1.04
3.656ArgSer: 3.656 ± 0.438
3.39ArgThr: 3.39 ± 0.417
4.254ArgVal: 4.254 ± 0.637
1.13ArgTrp: 1.13 ± 0.311
2.127ArgTyr: 2.127 ± 0.483
0.0ArgXaa: 0.0 ± 0.0
Ser
6.049SerAla: 6.049 ± 0.591
0.399SerCys: 0.399 ± 0.16
2.991SerAsp: 2.991 ± 0.481
4.321SerGlu: 4.321 ± 0.5
1.994SerPhe: 1.994 ± 0.358
5.783SerGly: 5.783 ± 0.875
0.731SerHis: 0.731 ± 0.234
2.526SerIle: 2.526 ± 0.624
2.194SerLys: 2.194 ± 0.406
5.384SerLeu: 5.384 ± 0.494
1.329SerMet: 1.329 ± 0.271
1.728SerAsn: 1.728 ± 0.312
2.858SerPro: 2.858 ± 0.485
2.127SerGln: 2.127 ± 0.331
3.523SerArg: 3.523 ± 0.546
3.988SerSer: 3.988 ± 0.905
4.719SerThr: 4.719 ± 0.787
4.254SerVal: 4.254 ± 0.515
1.329SerTrp: 1.329 ± 0.266
1.861SerTyr: 1.861 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
6.448ThrAla: 6.448 ± 0.734
0.598ThrCys: 0.598 ± 0.208
4.321ThrAsp: 4.321 ± 0.542
3.855ThrGlu: 3.855 ± 0.551
2.526ThrPhe: 2.526 ± 0.384
5.584ThrGly: 5.584 ± 0.988
1.462ThrHis: 1.462 ± 0.439
2.858ThrIle: 2.858 ± 0.326
2.925ThrLys: 2.925 ± 0.403
5.517ThrLeu: 5.517 ± 0.69
1.064ThrMet: 1.064 ± 0.288
1.728ThrAsn: 1.728 ± 0.403
4.52ThrPro: 4.52 ± 1.02
1.861ThrGln: 1.861 ± 0.384
3.523ThrArg: 3.523 ± 0.483
4.852ThrSer: 4.852 ± 0.881
5.118ThrThr: 5.118 ± 0.787
5.185ThrVal: 5.185 ± 0.565
1.196ThrTrp: 1.196 ± 0.297
2.659ThrTyr: 2.659 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
7.445ValAla: 7.445 ± 0.638
0.332ValCys: 0.332 ± 0.121
4.985ValAsp: 4.985 ± 0.606
4.719ValGlu: 4.719 ± 0.656
1.662ValPhe: 1.662 ± 0.344
6.913ValGly: 6.913 ± 0.766
1.662ValHis: 1.662 ± 0.319
3.457ValIle: 3.457 ± 0.495
4.055ValLys: 4.055 ± 0.562
5.318ValLeu: 5.318 ± 0.876
1.396ValMet: 1.396 ± 0.277
1.795ValAsn: 1.795 ± 0.353
3.922ValPro: 3.922 ± 0.484
2.459ValGln: 2.459 ± 0.375
3.722ValArg: 3.722 ± 0.528
4.055ValSer: 4.055 ± 0.448
5.982ValThr: 5.982 ± 0.563
5.85ValVal: 5.85 ± 0.719
0.798ValTrp: 0.798 ± 0.175
1.595ValTyr: 1.595 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
2.194TrpAla: 2.194 ± 0.447
0.266TrpCys: 0.266 ± 0.14
1.595TrpAsp: 1.595 ± 0.303
1.196TrpGlu: 1.196 ± 0.293
0.465TrpPhe: 0.465 ± 0.183
1.263TrpGly: 1.263 ± 0.312
0.399TrpHis: 0.399 ± 0.151
0.332TrpIle: 0.332 ± 0.144
0.997TrpLys: 0.997 ± 0.243
1.595TrpLeu: 1.595 ± 0.358
0.532TrpMet: 0.532 ± 0.158
0.864TrpAsn: 0.864 ± 0.23
0.399TrpPro: 0.399 ± 0.187
0.798TrpGln: 0.798 ± 0.208
1.263TrpArg: 1.263 ± 0.331
1.263TrpSer: 1.263 ± 0.334
1.329TrpThr: 1.329 ± 0.272
1.396TrpVal: 1.396 ± 0.31
0.199TrpTrp: 0.199 ± 0.123
0.399TrpTyr: 0.399 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.457TyrAla: 3.457 ± 0.369
0.199TyrCys: 0.199 ± 0.115
1.928TyrAsp: 1.928 ± 0.389
2.459TyrGlu: 2.459 ± 0.464
0.598TyrPhe: 0.598 ± 0.207
3.589TyrGly: 3.589 ± 0.588
0.399TyrHis: 0.399 ± 0.171
1.064TyrIle: 1.064 ± 0.282
0.864TyrLys: 0.864 ± 0.282
2.327TyrLeu: 2.327 ± 0.448
0.532TyrMet: 0.532 ± 0.178
0.997TyrAsn: 0.997 ± 0.267
1.329TyrPro: 1.329 ± 0.31
1.064TyrGln: 1.064 ± 0.261
2.327TyrArg: 2.327 ± 0.422
2.127TyrSer: 2.127 ± 0.505
1.263TyrThr: 1.263 ± 0.273
1.662TyrVal: 1.662 ± 0.31
0.598TyrTrp: 0.598 ± 0.213
0.798TyrTyr: 0.798 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (15045 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski