Amino acid dipepetide frequency for Mycobacterium phage Journey13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.188AlaAla: 11.188 ± 0.943
0.932AlaCys: 0.932 ± 0.231
4.795AlaAsp: 4.795 ± 0.543
7.725AlaGlu: 7.725 ± 0.709
4.196AlaPhe: 4.196 ± 0.636
8.125AlaGly: 8.125 ± 0.941
1.598AlaHis: 1.598 ± 0.291
4.662AlaIle: 4.662 ± 0.521
4.861AlaLys: 4.861 ± 0.542
9.523AlaLeu: 9.523 ± 0.96
2.93AlaMet: 2.93 ± 0.493
3.596AlaAsn: 3.596 ± 0.538
5.061AlaPro: 5.061 ± 0.734
3.53AlaGln: 3.53 ± 0.564
6.06AlaArg: 6.06 ± 0.797
4.129AlaSer: 4.129 ± 0.529
4.795AlaThr: 4.795 ± 0.602
7.326AlaVal: 7.326 ± 0.693
2.198AlaTrp: 2.198 ± 0.443
2.464AlaTyr: 2.464 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.215
0.0CysCys: 0.0 ± 0.0
0.666CysAsp: 0.666 ± 0.243
0.533CysGlu: 0.533 ± 0.214
0.333CysPhe: 0.333 ± 0.169
1.132CysGly: 1.132 ± 0.295
0.2CysHis: 0.2 ± 0.109
0.4CysIle: 0.4 ± 0.199
0.599CysLys: 0.599 ± 0.223
0.599CysLeu: 0.599 ± 0.207
0.133CysMet: 0.133 ± 0.088
0.4CysAsn: 0.4 ± 0.15
0.333CysPro: 0.333 ± 0.162
0.133CysGln: 0.133 ± 0.09
0.866CysArg: 0.866 ± 0.241
0.333CysSer: 0.333 ± 0.148
0.466CysThr: 0.466 ± 0.183
0.533CysVal: 0.533 ± 0.167
0.533CysTrp: 0.533 ± 0.24
0.266CysTyr: 0.266 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
6.593AspAla: 6.593 ± 0.701
0.733AspCys: 0.733 ± 0.209
4.129AspAsp: 4.129 ± 0.544
5.194AspGlu: 5.194 ± 0.675
2.331AspPhe: 2.331 ± 0.42
5.527AspGly: 5.527 ± 0.628
1.598AspHis: 1.598 ± 0.377
3.396AspIle: 3.396 ± 0.481
1.931AspLys: 1.931 ± 0.352
6.127AspLeu: 6.127 ± 0.743
1.465AspMet: 1.465 ± 0.294
1.532AspAsn: 1.532 ± 0.319
5.128AspPro: 5.128 ± 0.636
1.465AspGln: 1.465 ± 0.281
2.797AspArg: 2.797 ± 0.417
2.93AspSer: 2.93 ± 0.392
3.13AspThr: 3.13 ± 0.545
4.329AspVal: 4.329 ± 0.514
1.199AspTrp: 1.199 ± 0.293
2.864AspTyr: 2.864 ± 0.454
0.0AspXaa: 0.0 ± 0.0
Glu
7.259GluAla: 7.259 ± 0.817
0.266GluCys: 0.266 ± 0.112
4.861GluAsp: 4.861 ± 0.747
4.995GluGlu: 4.995 ± 0.697
3.463GluPhe: 3.463 ± 0.526
5.461GluGly: 5.461 ± 0.58
1.132GluHis: 1.132 ± 0.263
4.529GluIle: 4.529 ± 0.537
1.931GluLys: 1.931 ± 0.371
7.326GluLeu: 7.326 ± 0.697
1.598GluMet: 1.598 ± 0.333
1.665GluAsn: 1.665 ± 0.316
2.797GluPro: 2.797 ± 0.449
2.464GluGln: 2.464 ± 0.339
5.061GluArg: 5.061 ± 0.54
3.396GluSer: 3.396 ± 0.471
4.196GluThr: 4.196 ± 0.473
4.395GluVal: 4.395 ± 0.498
1.532GluTrp: 1.532 ± 0.328
2.397GluTyr: 2.397 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.396PheAla: 3.396 ± 0.584
0.2PheCys: 0.2 ± 0.121
2.531PheAsp: 2.531 ± 0.433
2.464PheGlu: 2.464 ± 0.355
0.533PhePhe: 0.533 ± 0.205
2.464PheGly: 2.464 ± 0.39
0.866PheHis: 0.866 ± 0.253
1.465PheIle: 1.465 ± 0.385
1.399PheLys: 1.399 ± 0.317
2.397PheLeu: 2.397 ± 0.448
0.666PheMet: 0.666 ± 0.214
1.598PheAsn: 1.598 ± 0.331
1.865PhePro: 1.865 ± 0.451
1.332PheGln: 1.332 ± 0.404
2.531PheArg: 2.531 ± 0.439
2.064PheSer: 2.064 ± 0.316
2.198PheThr: 2.198 ± 0.37
2.597PheVal: 2.597 ± 0.359
0.466PheTrp: 0.466 ± 0.163
0.799PheTyr: 0.799 ± 0.224
0.0PheXaa: 0.0 ± 0.0
Gly
6.593GlyAla: 6.593 ± 0.879
0.799GlyCys: 0.799 ± 0.251
5.661GlyAsp: 5.661 ± 0.717
4.995GlyGlu: 4.995 ± 0.585
2.597GlyPhe: 2.597 ± 0.432
9.457GlyGly: 9.457 ± 1.512
1.532GlyHis: 1.532 ± 0.273
4.329GlyIle: 4.329 ± 0.653
3.729GlyLys: 3.729 ± 0.589
5.994GlyLeu: 5.994 ± 0.766
2.131GlyMet: 2.131 ± 0.378
3.53GlyAsn: 3.53 ± 0.485
4.196GlyPro: 4.196 ± 0.43
2.864GlyGln: 2.864 ± 0.371
3.863GlyArg: 3.863 ± 0.443
4.462GlySer: 4.462 ± 0.632
5.661GlyThr: 5.661 ± 0.752
6.593GlyVal: 6.593 ± 0.647
1.665GlyTrp: 1.665 ± 0.294
2.73GlyTyr: 2.73 ± 0.44
0.0GlyXaa: 0.0 ± 0.0
His
1.332HisAla: 1.332 ± 0.309
0.2HisCys: 0.2 ± 0.106
1.199HisAsp: 1.199 ± 0.281
1.598HisGlu: 1.598 ± 0.279
0.266HisPhe: 0.266 ± 0.138
1.199HisGly: 1.199 ± 0.322
0.333HisHis: 0.333 ± 0.151
1.598HisIle: 1.598 ± 0.333
0.866HisLys: 0.866 ± 0.267
1.332HisLeu: 1.332 ± 0.272
0.333HisMet: 0.333 ± 0.155
0.466HisAsn: 0.466 ± 0.178
1.066HisPro: 1.066 ± 0.21
0.866HisGln: 0.866 ± 0.244
1.532HisArg: 1.532 ± 0.36
0.866HisSer: 0.866 ± 0.194
1.199HisThr: 1.199 ± 0.262
1.199HisVal: 1.199 ± 0.273
0.2HisTrp: 0.2 ± 0.139
0.733HisTyr: 0.733 ± 0.336
0.0HisXaa: 0.0 ± 0.0
Ile
6.327IleAla: 6.327 ± 0.679
0.4IleCys: 0.4 ± 0.175
3.596IleAsp: 3.596 ± 0.468
5.128IleGlu: 5.128 ± 0.562
1.532IlePhe: 1.532 ± 0.357
4.262IleGly: 4.262 ± 0.784
0.999IleHis: 0.999 ± 0.259
1.798IleIle: 1.798 ± 0.34
2.597IleLys: 2.597 ± 0.372
3.596IleLeu: 3.596 ± 0.538
0.866IleMet: 0.866 ± 0.227
2.198IleAsn: 2.198 ± 0.367
4.262IlePro: 4.262 ± 0.45
1.532IleGln: 1.532 ± 0.307
3.596IleArg: 3.596 ± 0.454
2.597IleSer: 2.597 ± 0.417
3.263IleThr: 3.263 ± 0.462
2.664IleVal: 2.664 ± 0.431
0.799IleTrp: 0.799 ± 0.241
1.066IleTyr: 1.066 ± 0.276
0.0IleXaa: 0.0 ± 0.0
Lys
4.728LysAla: 4.728 ± 0.622
0.4LysCys: 0.4 ± 0.17
2.397LysAsp: 2.397 ± 0.453
1.998LysGlu: 1.998 ± 0.326
0.932LysPhe: 0.932 ± 0.202
2.997LysGly: 2.997 ± 0.532
0.533LysHis: 0.533 ± 0.201
2.464LysIle: 2.464 ± 0.379
3.33LysLys: 3.33 ± 0.623
2.597LysLeu: 2.597 ± 0.434
1.199LysMet: 1.199 ± 0.262
1.532LysAsn: 1.532 ± 0.343
2.864LysPro: 2.864 ± 0.604
1.532LysGln: 1.532 ± 0.33
3.33LysArg: 3.33 ± 0.623
1.598LysSer: 1.598 ± 0.351
3.463LysThr: 3.463 ± 0.424
3.996LysVal: 3.996 ± 0.635
0.666LysTrp: 0.666 ± 0.18
1.066LysTyr: 1.066 ± 0.309
0.0LysXaa: 0.0 ± 0.0
Leu
8.924LeuAla: 8.924 ± 0.746
0.666LeuCys: 0.666 ± 0.22
4.595LeuAsp: 4.595 ± 0.692
5.594LeuGlu: 5.594 ± 0.697
2.331LeuPhe: 2.331 ± 0.33
7.192LeuGly: 7.192 ± 1.033
1.465LeuHis: 1.465 ± 0.35
5.061LeuIle: 5.061 ± 0.498
2.597LeuLys: 2.597 ± 0.447
5.328LeuLeu: 5.328 ± 0.647
2.464LeuMet: 2.464 ± 0.484
2.531LeuAsn: 2.531 ± 0.422
4.728LeuPro: 4.728 ± 0.573
2.864LeuGln: 2.864 ± 0.568
6.06LeuArg: 6.06 ± 0.78
5.261LeuSer: 5.261 ± 0.627
4.928LeuThr: 4.928 ± 0.617
4.462LeuVal: 4.462 ± 0.567
1.532LeuTrp: 1.532 ± 0.259
1.731LeuTyr: 1.731 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
2.997MetAla: 2.997 ± 0.488
0.266MetCys: 0.266 ± 0.124
1.665MetAsp: 1.665 ± 0.365
1.598MetGlu: 1.598 ± 0.264
0.533MetPhe: 0.533 ± 0.189
1.598MetGly: 1.598 ± 0.349
0.133MetHis: 0.133 ± 0.088
1.332MetIle: 1.332 ± 0.377
1.399MetLys: 1.399 ± 0.256
1.598MetLeu: 1.598 ± 0.294
0.466MetMet: 0.466 ± 0.194
0.932MetAsn: 0.932 ± 0.235
1.665MetPro: 1.665 ± 0.381
0.799MetGln: 0.799 ± 0.21
1.265MetArg: 1.265 ± 0.329
1.865MetSer: 1.865 ± 0.371
2.597MetThr: 2.597 ± 0.414
1.598MetVal: 1.598 ± 0.406
0.2MetTrp: 0.2 ± 0.108
0.533MetTyr: 0.533 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
2.73AsnAla: 2.73 ± 0.39
0.466AsnCys: 0.466 ± 0.171
1.931AsnAsp: 1.931 ± 0.419
1.931AsnGlu: 1.931 ± 0.341
0.932AsnPhe: 0.932 ± 0.279
3.197AsnGly: 3.197 ± 0.426
1.199AsnHis: 1.199 ± 0.304
1.265AsnIle: 1.265 ± 0.296
1.265AsnLys: 1.265 ± 0.308
2.597AsnLeu: 2.597 ± 0.448
1.199AsnMet: 1.199 ± 0.264
0.999AsnAsn: 0.999 ± 0.273
2.73AsnPro: 2.73 ± 0.385
1.066AsnGln: 1.066 ± 0.228
2.531AsnArg: 2.531 ± 0.596
1.132AsnSer: 1.132 ± 0.284
1.798AsnThr: 1.798 ± 0.501
2.331AsnVal: 2.331 ± 0.319
0.799AsnTrp: 0.799 ± 0.231
1.265AsnTyr: 1.265 ± 0.248
0.0AsnXaa: 0.0 ± 0.0
Pro
5.194ProAla: 5.194 ± 0.48
0.533ProCys: 0.533 ± 0.202
3.396ProAsp: 3.396 ± 0.446
5.194ProGlu: 5.194 ± 0.572
1.731ProPhe: 1.731 ± 0.43
5.128ProGly: 5.128 ± 0.696
0.999ProHis: 0.999 ± 0.244
2.93ProIle: 2.93 ± 0.397
2.397ProLys: 2.397 ± 0.416
3.396ProLeu: 3.396 ± 0.407
1.132ProMet: 1.132 ± 0.293
2.597ProAsn: 2.597 ± 0.4
2.531ProPro: 2.531 ± 0.5
2.198ProGln: 2.198 ± 0.405
3.33ProArg: 3.33 ± 0.446
2.797ProSer: 2.797 ± 0.407
3.596ProThr: 3.596 ± 0.458
3.996ProVal: 3.996 ± 0.526
0.999ProTrp: 0.999 ± 0.414
1.665ProTyr: 1.665 ± 0.31
0.0ProXaa: 0.0 ± 0.0
Gln
4.662GlnAla: 4.662 ± 0.665
0.266GlnCys: 0.266 ± 0.114
2.064GlnAsp: 2.064 ± 0.386
1.532GlnGlu: 1.532 ± 0.32
1.265GlnPhe: 1.265 ± 0.271
2.73GlnGly: 2.73 ± 0.487
0.799GlnHis: 0.799 ± 0.216
2.397GlnIle: 2.397 ± 0.319
1.798GlnLys: 1.798 ± 0.358
3.463GlnLeu: 3.463 ± 0.498
0.733GlnMet: 0.733 ± 0.2
0.599GlnAsn: 0.599 ± 0.201
1.399GlnPro: 1.399 ± 0.393
1.865GlnGln: 1.865 ± 0.397
2.531GlnArg: 2.531 ± 0.407
1.066GlnSer: 1.066 ± 0.283
2.331GlnThr: 2.331 ± 0.424
1.998GlnVal: 1.998 ± 0.405
0.599GlnTrp: 0.599 ± 0.27
0.866GlnTyr: 0.866 ± 0.204
0.0GlnXaa: 0.0 ± 0.0
Arg
5.527ArgAla: 5.527 ± 0.714
0.866ArgCys: 0.866 ± 0.259
4.462ArgAsp: 4.462 ± 0.661
4.928ArgGlu: 4.928 ± 0.631
2.997ArgPhe: 2.997 ± 0.576
4.861ArgGly: 4.861 ± 0.7
1.399ArgHis: 1.399 ± 0.323
4.062ArgIle: 4.062 ± 0.53
3.263ArgLys: 3.263 ± 0.533
5.727ArgLeu: 5.727 ± 0.623
2.064ArgMet: 2.064 ± 0.386
1.931ArgAsn: 1.931 ± 0.394
2.198ArgPro: 2.198 ± 0.336
2.198ArgGln: 2.198 ± 0.428
5.328ArgArg: 5.328 ± 0.784
3.13ArgSer: 3.13 ± 0.444
2.464ArgThr: 2.464 ± 0.372
4.728ArgVal: 4.728 ± 0.518
1.332ArgTrp: 1.332 ± 0.266
2.531ArgTyr: 2.531 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
4.995SerAla: 4.995 ± 0.572
0.4SerCys: 0.4 ± 0.178
3.396SerAsp: 3.396 ± 0.628
3.596SerGlu: 3.596 ± 0.519
2.198SerPhe: 2.198 ± 0.473
4.662SerGly: 4.662 ± 0.648
0.666SerHis: 0.666 ± 0.205
2.464SerIle: 2.464 ± 0.408
2.331SerLys: 2.331 ± 0.485
3.663SerLeu: 3.663 ± 0.531
1.199SerMet: 1.199 ± 0.21
1.465SerAsn: 1.465 ± 0.378
3.33SerPro: 3.33 ± 0.447
1.798SerGln: 1.798 ± 0.391
3.996SerArg: 3.996 ± 0.489
2.797SerSer: 2.797 ± 0.619
2.73SerThr: 2.73 ± 0.457
3.263SerVal: 3.263 ± 0.43
1.199SerTrp: 1.199 ± 0.295
1.066SerTyr: 1.066 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
5.994ThrAla: 5.994 ± 0.639
0.666ThrCys: 0.666 ± 0.235
3.863ThrAsp: 3.863 ± 0.586
3.13ThrGlu: 3.13 ± 0.412
2.397ThrPhe: 2.397 ± 0.419
4.995ThrGly: 4.995 ± 0.71
0.999ThrHis: 0.999 ± 0.256
2.997ThrIle: 2.997 ± 0.452
2.664ThrLys: 2.664 ± 0.473
4.529ThrLeu: 4.529 ± 0.76
1.931ThrMet: 1.931 ± 0.339
2.064ThrAsn: 2.064 ± 0.391
4.062ThrPro: 4.062 ± 0.493
2.331ThrGln: 2.331 ± 0.43
3.063ThrArg: 3.063 ± 0.502
2.797ThrSer: 2.797 ± 0.514
2.997ThrThr: 2.997 ± 0.487
4.462ThrVal: 4.462 ± 0.526
0.799ThrTrp: 0.799 ± 0.274
1.998ThrTyr: 1.998 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
6.46ValAla: 6.46 ± 1.005
0.599ValCys: 0.599 ± 0.222
5.661ValAsp: 5.661 ± 0.714
4.861ValGlu: 4.861 ± 0.534
2.264ValPhe: 2.264 ± 0.476
4.462ValGly: 4.462 ± 0.607
1.132ValHis: 1.132 ± 0.255
3.596ValIle: 3.596 ± 0.518
3.463ValLys: 3.463 ± 0.442
5.794ValLeu: 5.794 ± 0.635
1.332ValMet: 1.332 ± 0.3
2.198ValAsn: 2.198 ± 0.428
3.063ValPro: 3.063 ± 0.449
2.131ValGln: 2.131 ± 0.359
4.728ValArg: 4.728 ± 0.585
4.529ValSer: 4.529 ± 0.566
4.329ValThr: 4.329 ± 0.694
5.394ValVal: 5.394 ± 0.529
0.932ValTrp: 0.932 ± 0.29
2.397ValTyr: 2.397 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
1.532TrpAla: 1.532 ± 0.444
0.4TrpCys: 0.4 ± 0.194
1.066TrpAsp: 1.066 ± 0.25
1.598TrpGlu: 1.598 ± 0.291
0.4TrpPhe: 0.4 ± 0.187
1.132TrpGly: 1.132 ± 0.268
0.533TrpHis: 0.533 ± 0.189
1.132TrpIle: 1.132 ± 0.232
0.4TrpLys: 0.4 ± 0.167
1.265TrpLeu: 1.265 ± 0.26
0.266TrpMet: 0.266 ± 0.131
0.666TrpAsn: 0.666 ± 0.227
0.932TrpPro: 0.932 ± 0.291
0.999TrpGln: 0.999 ± 0.256
1.265TrpArg: 1.265 ± 0.27
1.465TrpSer: 1.465 ± 0.326
1.199TrpThr: 1.199 ± 0.331
1.265TrpVal: 1.265 ± 0.33
0.466TrpTrp: 0.466 ± 0.182
0.733TrpTyr: 0.733 ± 0.232
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.464TyrAla: 2.464 ± 0.394
0.2TyrCys: 0.2 ± 0.126
2.464TyrAsp: 2.464 ± 0.329
2.131TyrGlu: 2.131 ± 0.391
0.599TyrPhe: 0.599 ± 0.196
2.797TyrGly: 2.797 ± 0.481
0.266TyrHis: 0.266 ± 0.155
1.199TyrIle: 1.199 ± 0.279
0.799TyrLys: 0.799 ± 0.251
3.463TyrLeu: 3.463 ± 0.432
0.866TyrMet: 0.866 ± 0.276
0.866TyrAsn: 0.866 ± 0.214
1.532TyrPro: 1.532 ± 0.309
0.999TyrGln: 0.999 ± 0.302
2.264TyrArg: 2.264 ± 0.447
2.064TyrSer: 2.064 ± 0.379
1.399TyrThr: 1.399 ± 0.321
2.198TyrVal: 2.198 ± 0.442
0.599TyrTrp: 0.599 ± 0.294
1.066TyrTyr: 1.066 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (15017 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski