Amino acid dipepetide frequency for Corynebacterium phage Stickynote

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.301AlaAla: 10.301 ± 1.332
0.495AlaCys: 0.495 ± 0.149
5.844AlaAsp: 5.844 ± 0.774
7.478AlaGlu: 7.478 ± 0.656
2.674AlaPhe: 2.674 ± 0.405
6.587AlaGly: 6.587 ± 0.774
1.535AlaHis: 1.535 ± 0.386
5.002AlaIle: 5.002 ± 0.699
5.695AlaLys: 5.695 ± 0.618
7.775AlaLeu: 7.775 ± 1.045
2.229AlaMet: 2.229 ± 0.305
3.764AlaAsn: 3.764 ± 0.641
3.318AlaPro: 3.318 ± 0.378
3.219AlaGln: 3.219 ± 0.491
4.309AlaArg: 4.309 ± 0.561
6.141AlaSer: 6.141 ± 1.019
3.417AlaThr: 3.417 ± 0.559
5.299AlaVal: 5.299 ± 0.732
1.535AlaTrp: 1.535 ± 0.307
3.318AlaTyr: 3.318 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.396CysAla: 0.396 ± 0.167
0.099CysCys: 0.099 ± 0.075
0.446CysAsp: 0.446 ± 0.168
0.495CysGlu: 0.495 ± 0.184
0.149CysPhe: 0.149 ± 0.084
0.545CysGly: 0.545 ± 0.153
0.149CysHis: 0.149 ± 0.079
0.198CysIle: 0.198 ± 0.089
0.198CysLys: 0.198 ± 0.102
0.594CysLeu: 0.594 ± 0.189
0.198CysMet: 0.198 ± 0.095
0.347CysAsn: 0.347 ± 0.1
0.297CysPro: 0.297 ± 0.147
0.297CysGln: 0.297 ± 0.135
0.297CysArg: 0.297 ± 0.126
0.743CysSer: 0.743 ± 0.245
0.099CysThr: 0.099 ± 0.066
0.545CysVal: 0.545 ± 0.181
0.198CysTrp: 0.198 ± 0.099
0.396CysTyr: 0.396 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
5.101AspAla: 5.101 ± 0.569
0.347AspCys: 0.347 ± 0.152
4.556AspAsp: 4.556 ± 0.539
5.448AspGlu: 5.448 ± 0.498
3.17AspPhe: 3.17 ± 0.401
5.596AspGly: 5.596 ± 0.503
1.486AspHis: 1.486 ± 0.201
3.368AspIle: 3.368 ± 0.46
2.922AspLys: 2.922 ± 0.316
6.389AspLeu: 6.389 ± 0.581
2.427AspMet: 2.427 ± 0.341
2.427AspAsn: 2.427 ± 0.301
4.21AspPro: 4.21 ± 0.458
2.179AspGln: 2.179 ± 0.289
3.912AspArg: 3.912 ± 0.41
3.912AspSer: 3.912 ± 0.345
3.615AspThr: 3.615 ± 0.487
3.516AspVal: 3.516 ± 0.532
1.04AspTrp: 1.04 ± 0.197
2.229AspTyr: 2.229 ± 0.317
0.0AspXaa: 0.0 ± 0.0
Glu
7.973GluAla: 7.973 ± 0.895
0.545GluCys: 0.545 ± 0.208
5.695GluAsp: 5.695 ± 0.671
6.438GluGlu: 6.438 ± 0.674
3.021GluPhe: 3.021 ± 0.425
5.943GluGly: 5.943 ± 0.609
0.941GluHis: 0.941 ± 0.238
5.794GluIle: 5.794 ± 0.65
3.615GluLys: 3.615 ± 0.463
6.092GluLeu: 6.092 ± 0.59
1.832GluMet: 1.832 ± 0.305
2.575GluAsn: 2.575 ± 0.448
2.674GluPro: 2.674 ± 0.384
2.328GluGln: 2.328 ± 0.359
4.705GluArg: 4.705 ± 0.573
3.912GluSer: 3.912 ± 0.443
4.556GluThr: 4.556 ± 0.472
5.992GluVal: 5.992 ± 0.715
1.337GluTrp: 1.337 ± 0.288
2.575GluTyr: 2.575 ± 0.37
0.0GluXaa: 0.0 ± 0.0
Phe
3.17PheAla: 3.17 ± 0.339
0.198PheCys: 0.198 ± 0.112
3.318PheAsp: 3.318 ± 0.453
3.269PheGlu: 3.269 ± 0.444
1.486PhePhe: 1.486 ± 0.222
3.021PheGly: 3.021 ± 0.454
0.693PheHis: 0.693 ± 0.182
1.634PheIle: 1.634 ± 0.259
1.684PheLys: 1.684 ± 0.307
2.625PheLeu: 2.625 ± 0.354
1.189PheMet: 1.189 ± 0.235
1.585PheAsn: 1.585 ± 0.275
1.238PhePro: 1.238 ± 0.277
1.387PheGln: 1.387 ± 0.203
2.476PheArg: 2.476 ± 0.322
3.269PheSer: 3.269 ± 0.494
2.328PheThr: 2.328 ± 0.34
1.832PheVal: 1.832 ± 0.315
0.297PheTrp: 0.297 ± 0.107
1.189PheTyr: 1.189 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
5.299GlyAla: 5.299 ± 0.947
0.396GlyCys: 0.396 ± 0.15
5.349GlyAsp: 5.349 ± 0.42
4.556GlyGlu: 4.556 ± 0.384
3.071GlyPhe: 3.071 ± 0.545
5.745GlyGly: 5.745 ± 1.042
1.238GlyHis: 1.238 ± 0.292
4.853GlyIle: 4.853 ± 0.542
3.912GlyLys: 3.912 ± 0.487
6.537GlyLeu: 6.537 ± 1.139
2.08GlyMet: 2.08 ± 0.315
3.368GlyAsn: 3.368 ± 0.42
2.724GlyPro: 2.724 ± 0.312
1.684GlyGln: 1.684 ± 0.252
4.655GlyArg: 4.655 ± 0.555
4.408GlySer: 4.408 ± 0.556
4.21GlyThr: 4.21 ± 0.497
4.804GlyVal: 4.804 ± 0.573
1.486GlyTrp: 1.486 ± 0.265
3.17GlyTyr: 3.17 ± 0.507
0.0GlyXaa: 0.0 ± 0.0
His
1.436HisAla: 1.436 ± 0.209
0.198HisCys: 0.198 ± 0.109
0.99HisAsp: 0.99 ± 0.252
0.99HisGlu: 0.99 ± 0.253
0.545HisPhe: 0.545 ± 0.162
1.189HisGly: 1.189 ± 0.282
0.644HisHis: 0.644 ± 0.2
1.189HisIle: 1.189 ± 0.248
0.99HisLys: 0.99 ± 0.199
1.09HisLeu: 1.09 ± 0.264
0.248HisMet: 0.248 ± 0.097
0.693HisAsn: 0.693 ± 0.171
1.436HisPro: 1.436 ± 0.269
0.495HisGln: 0.495 ± 0.165
1.189HisArg: 1.189 ± 0.3
0.941HisSer: 0.941 ± 0.291
1.238HisThr: 1.238 ± 0.377
1.238HisVal: 1.238 ± 0.273
0.149HisTrp: 0.149 ± 0.077
0.941HisTyr: 0.941 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
5.2IleAla: 5.2 ± 0.447
0.347IleCys: 0.347 ± 0.145
4.705IleAsp: 4.705 ± 0.495
4.457IleGlu: 4.457 ± 0.611
2.229IlePhe: 2.229 ± 0.354
3.665IleGly: 3.665 ± 0.659
1.288IleHis: 1.288 ± 0.282
2.823IleIle: 2.823 ± 0.376
3.071IleLys: 3.071 ± 0.409
4.061IleLeu: 4.061 ± 0.436
1.189IleMet: 1.189 ± 0.265
2.575IleAsn: 2.575 ± 0.273
2.625IlePro: 2.625 ± 0.286
1.684IleGln: 1.684 ± 0.349
2.575IleArg: 2.575 ± 0.378
4.408IleSer: 4.408 ± 0.408
2.971IleThr: 2.971 ± 0.439
4.259IleVal: 4.259 ± 0.416
0.941IleTrp: 0.941 ± 0.249
1.783IleTyr: 1.783 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
5.992LysAla: 5.992 ± 0.496
0.248LysCys: 0.248 ± 0.141
3.021LysAsp: 3.021 ± 0.368
4.507LysGlu: 4.507 ± 0.529
2.377LysPhe: 2.377 ± 0.373
3.269LysGly: 3.269 ± 0.317
1.189LysHis: 1.189 ± 0.29
3.368LysIle: 3.368 ± 0.35
3.021LysLys: 3.021 ± 0.495
2.872LysLeu: 2.872 ± 0.384
1.337LysMet: 1.337 ± 0.227
2.278LysAsn: 2.278 ± 0.304
1.436LysPro: 1.436 ± 0.228
1.733LysGln: 1.733 ± 0.405
3.071LysArg: 3.071 ± 0.383
2.872LysSer: 2.872 ± 0.381
3.714LysThr: 3.714 ± 0.395
4.309LysVal: 4.309 ± 0.449
1.189LysTrp: 1.189 ± 0.293
1.585LysTyr: 1.585 ± 0.386
0.0LysXaa: 0.0 ± 0.0
Leu
7.181LeuAla: 7.181 ± 0.753
0.693LeuCys: 0.693 ± 0.18
5.101LeuAsp: 5.101 ± 0.54
6.389LeuGlu: 6.389 ± 0.66
3.071LeuPhe: 3.071 ± 0.437
5.497LeuGly: 5.497 ± 0.724
0.941LeuHis: 0.941 ± 0.235
3.516LeuIle: 3.516 ± 0.434
4.309LeuLys: 4.309 ± 0.486
5.2LeuLeu: 5.2 ± 0.565
1.981LeuMet: 1.981 ± 0.428
3.665LeuAsn: 3.665 ± 0.335
3.962LeuPro: 3.962 ± 0.436
2.377LeuGln: 2.377 ± 0.344
5.992LeuArg: 5.992 ± 0.668
5.101LeuSer: 5.101 ± 0.504
5.893LeuThr: 5.893 ± 0.489
4.804LeuVal: 4.804 ± 0.476
1.04LeuTrp: 1.04 ± 0.224
2.328LeuTyr: 2.328 ± 0.352
0.0LeuXaa: 0.0 ± 0.0
Met
2.575MetAla: 2.575 ± 0.377
0.0MetCys: 0.0 ± 0.0
1.387MetAsp: 1.387 ± 0.289
2.179MetGlu: 2.179 ± 0.342
0.743MetPhe: 0.743 ± 0.228
1.634MetGly: 1.634 ± 0.431
0.198MetHis: 0.198 ± 0.085
1.684MetIle: 1.684 ± 0.303
1.09MetLys: 1.09 ± 0.248
2.031MetLeu: 2.031 ± 0.348
0.594MetMet: 0.594 ± 0.234
1.238MetAsn: 1.238 ± 0.203
1.337MetPro: 1.337 ± 0.323
1.139MetGln: 1.139 ± 0.258
1.337MetArg: 1.337 ± 0.303
2.031MetSer: 2.031 ± 0.331
1.882MetThr: 1.882 ± 0.247
1.387MetVal: 1.387 ± 0.27
0.495MetTrp: 0.495 ± 0.173
0.743MetTyr: 0.743 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
4.061AsnAla: 4.061 ± 0.476
0.347AsnCys: 0.347 ± 0.132
2.625AsnAsp: 2.625 ± 0.305
2.526AsnGlu: 2.526 ± 0.386
1.486AsnPhe: 1.486 ± 0.239
4.259AsnGly: 4.259 ± 0.354
0.743AsnHis: 0.743 ± 0.228
2.179AsnIle: 2.179 ± 0.387
2.229AsnLys: 2.229 ± 0.296
4.507AsnLeu: 4.507 ± 0.552
0.693AsnMet: 0.693 ± 0.218
2.922AsnAsn: 2.922 ± 0.393
2.773AsnPro: 2.773 ± 0.379
1.238AsnGln: 1.238 ± 0.29
2.229AsnArg: 2.229 ± 0.358
1.634AsnSer: 1.634 ± 0.337
2.229AsnThr: 2.229 ± 0.294
2.625AsnVal: 2.625 ± 0.307
0.743AsnTrp: 0.743 ± 0.185
1.436AsnTyr: 1.436 ± 0.314
0.0AsnXaa: 0.0 ± 0.0
Pro
3.318ProAla: 3.318 ± 0.384
0.347ProCys: 0.347 ± 0.158
3.12ProAsp: 3.12 ± 0.428
3.764ProGlu: 3.764 ± 0.483
1.832ProPhe: 1.832 ± 0.269
3.219ProGly: 3.219 ± 0.453
0.891ProHis: 0.891 ± 0.187
2.08ProIle: 2.08 ± 0.464
2.427ProLys: 2.427 ± 0.296
2.773ProLeu: 2.773 ± 0.45
1.436ProMet: 1.436 ± 0.271
1.882ProAsn: 1.882 ± 0.34
2.13ProPro: 2.13 ± 0.366
1.04ProGln: 1.04 ± 0.22
2.526ProArg: 2.526 ± 0.382
3.12ProSer: 3.12 ± 0.414
3.17ProThr: 3.17 ± 0.473
2.724ProVal: 2.724 ± 0.339
0.347ProTrp: 0.347 ± 0.139
1.733ProTyr: 1.733 ± 0.351
0.0ProXaa: 0.0 ± 0.0
Gln
3.417GlnAla: 3.417 ± 0.564
0.099GlnCys: 0.099 ± 0.078
1.486GlnAsp: 1.486 ± 0.274
2.229GlnGlu: 2.229 ± 0.373
0.891GlnPhe: 0.891 ± 0.209
1.585GlnGly: 1.585 ± 0.258
0.594GlnHis: 0.594 ± 0.179
2.031GlnIle: 2.031 ± 0.284
1.882GlnLys: 1.882 ± 0.356
2.229GlnLeu: 2.229 ± 0.361
0.941GlnMet: 0.941 ± 0.209
1.238GlnAsn: 1.238 ± 0.2
1.288GlnPro: 1.288 ± 0.279
0.891GlnGln: 0.891 ± 0.252
1.981GlnArg: 1.981 ± 0.367
1.981GlnSer: 1.981 ± 0.353
1.436GlnThr: 1.436 ± 0.242
1.931GlnVal: 1.931 ± 0.299
0.644GlnTrp: 0.644 ± 0.165
0.941GlnTyr: 0.941 ± 0.2
0.0GlnXaa: 0.0 ± 0.0
Arg
5.002ArgAla: 5.002 ± 0.571
0.446ArgCys: 0.446 ± 0.161
4.556ArgAsp: 4.556 ± 0.478
4.853ArgGlu: 4.853 ± 0.522
2.328ArgPhe: 2.328 ± 0.328
3.566ArgGly: 3.566 ± 0.465
1.238ArgHis: 1.238 ± 0.265
2.724ArgIle: 2.724 ± 0.33
4.507ArgLys: 4.507 ± 0.498
4.804ArgLeu: 4.804 ± 0.63
1.931ArgMet: 1.931 ± 0.356
2.674ArgAsn: 2.674 ± 0.38
1.832ArgPro: 1.832 ± 0.275
1.882ArgGln: 1.882 ± 0.342
4.21ArgArg: 4.21 ± 0.627
3.615ArgSer: 3.615 ± 0.413
2.377ArgThr: 2.377 ± 0.338
4.061ArgVal: 4.061 ± 0.465
1.189ArgTrp: 1.189 ± 0.346
2.031ArgTyr: 2.031 ± 0.264
0.0ArgXaa: 0.0 ± 0.0
Ser
5.299SerAla: 5.299 ± 0.747
0.495SerCys: 0.495 ± 0.27
4.457SerAsp: 4.457 ± 0.536
5.299SerGlu: 5.299 ± 0.437
2.179SerPhe: 2.179 ± 0.324
4.903SerGly: 4.903 ± 0.583
1.04SerHis: 1.04 ± 0.286
3.962SerIle: 3.962 ± 0.472
3.368SerLys: 3.368 ± 0.343
5.299SerLeu: 5.299 ± 0.577
1.288SerMet: 1.288 ± 0.285
2.674SerAsn: 2.674 ± 0.356
2.625SerPro: 2.625 ± 0.362
1.832SerGln: 1.832 ± 0.359
3.962SerArg: 3.962 ± 0.367
4.111SerSer: 4.111 ± 0.46
3.368SerThr: 3.368 ± 0.702
4.408SerVal: 4.408 ± 0.473
0.941SerTrp: 0.941 ± 0.182
2.278SerTyr: 2.278 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
4.952ThrAla: 4.952 ± 0.601
0.446ThrCys: 0.446 ± 0.15
2.674ThrAsp: 2.674 ± 0.428
3.269ThrGlu: 3.269 ± 0.391
2.773ThrPhe: 2.773 ± 0.483
4.457ThrGly: 4.457 ± 0.531
1.04ThrHis: 1.04 ± 0.325
3.516ThrIle: 3.516 ± 0.425
2.724ThrLys: 2.724 ± 0.513
4.21ThrLeu: 4.21 ± 0.48
1.486ThrMet: 1.486 ± 0.249
2.526ThrAsn: 2.526 ± 0.3
3.318ThrPro: 3.318 ± 0.341
1.238ThrGln: 1.238 ± 0.25
3.516ThrArg: 3.516 ± 0.417
4.111ThrSer: 4.111 ± 0.4
2.724ThrThr: 2.724 ± 0.631
3.368ThrVal: 3.368 ± 0.405
0.792ThrTrp: 0.792 ± 0.199
1.783ThrTyr: 1.783 ± 0.309
0.0ThrXaa: 0.0 ± 0.0
Val
5.349ValAla: 5.349 ± 0.468
0.396ValCys: 0.396 ± 0.146
4.655ValAsp: 4.655 ± 0.436
5.893ValGlu: 5.893 ± 0.655
2.526ValPhe: 2.526 ± 0.277
4.408ValGly: 4.408 ± 0.498
1.139ValHis: 1.139 ± 0.208
4.358ValIle: 4.358 ± 0.505
3.467ValLys: 3.467 ± 0.457
5.695ValLeu: 5.695 ± 0.676
1.733ValMet: 1.733 ± 0.235
2.971ValAsn: 2.971 ± 0.395
2.328ValPro: 2.328 ± 0.351
1.337ValGln: 1.337 ± 0.251
3.318ValArg: 3.318 ± 0.447
4.556ValSer: 4.556 ± 0.477
3.764ValThr: 3.764 ± 0.494
4.655ValVal: 4.655 ± 0.58
1.139ValTrp: 1.139 ± 0.267
2.08ValTyr: 2.08 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
1.337TrpAla: 1.337 ± 0.292
0.248TrpCys: 0.248 ± 0.109
1.387TrpAsp: 1.387 ± 0.247
1.733TrpGlu: 1.733 ± 0.323
0.396TrpPhe: 0.396 ± 0.146
1.337TrpGly: 1.337 ± 0.317
0.198TrpHis: 0.198 ± 0.131
0.842TrpIle: 0.842 ± 0.229
0.594TrpLys: 0.594 ± 0.18
1.337TrpLeu: 1.337 ± 0.306
0.446TrpMet: 0.446 ± 0.141
0.842TrpAsn: 0.842 ± 0.238
0.842TrpPro: 0.842 ± 0.184
0.693TrpGln: 0.693 ± 0.21
0.743TrpArg: 0.743 ± 0.172
0.693TrpSer: 0.693 ± 0.156
0.594TrpThr: 0.594 ± 0.16
1.535TrpVal: 1.535 ± 0.278
0.297TrpTrp: 0.297 ± 0.122
0.545TrpTyr: 0.545 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.575TyrAla: 2.575 ± 0.316
0.297TyrCys: 0.297 ± 0.121
2.476TyrAsp: 2.476 ± 0.35
2.773TyrGlu: 2.773 ± 0.382
0.941TyrPhe: 0.941 ± 0.167
3.12TyrGly: 3.12 ± 0.407
0.644TyrHis: 0.644 ± 0.192
1.733TyrIle: 1.733 ± 0.351
1.733TyrLys: 1.733 ± 0.278
2.922TyrLeu: 2.922 ± 0.372
0.396TyrMet: 0.396 ± 0.155
1.238TyrAsn: 1.238 ± 0.234
1.486TyrPro: 1.486 ± 0.382
1.04TyrGln: 1.04 ± 0.227
2.823TyrArg: 2.823 ± 0.375
2.229TyrSer: 2.229 ± 0.357
1.337TyrThr: 1.337 ± 0.316
2.476TyrVal: 2.476 ± 0.482
0.842TyrTrp: 0.842 ± 0.17
1.634TyrTyr: 1.634 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (20193 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski