Amino acid dipepetide frequency for Corynebacterium phage EmiRose

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.825AlaAla: 14.825 ± 1.616
0.505AlaCys: 0.505 ± 0.223
7.412AlaAsp: 7.412 ± 0.908
6.065AlaGlu: 6.065 ± 0.771
2.864AlaPhe: 2.864 ± 0.59
10.445AlaGly: 10.445 ± 1.141
1.685AlaHis: 1.685 ± 0.323
4.801AlaIle: 4.801 ± 0.918
5.559AlaLys: 5.559 ± 0.709
9.181AlaLeu: 9.181 ± 0.924
3.285AlaMet: 3.285 ± 0.476
3.875AlaAsn: 3.875 ± 0.724
7.075AlaPro: 7.075 ± 0.979
4.549AlaGln: 4.549 ± 0.737
8.17AlaArg: 8.17 ± 0.928
7.665AlaSer: 7.665 ± 1.0
5.728AlaThr: 5.728 ± 0.783
7.328AlaVal: 7.328 ± 1.055
2.022AlaTrp: 2.022 ± 0.347
3.538AlaTyr: 3.538 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.59CysAla: 0.59 ± 0.247
0.0CysCys: 0.0 ± 0.0
0.337CysAsp: 0.337 ± 0.176
0.59CysGlu: 0.59 ± 0.224
0.084CysPhe: 0.084 ± 0.074
0.505CysGly: 0.505 ± 0.205
0.0CysHis: 0.0 ± 0.0
0.168CysIle: 0.168 ± 0.114
0.421CysLys: 0.421 ± 0.236
0.59CysLeu: 0.59 ± 0.222
0.0CysMet: 0.0 ± 0.0
0.421CysAsn: 0.421 ± 0.252
0.253CysPro: 0.253 ± 0.12
0.253CysGln: 0.253 ± 0.143
0.421CysArg: 0.421 ± 0.207
0.084CysSer: 0.084 ± 0.081
0.842CysThr: 0.842 ± 0.33
0.59CysVal: 0.59 ± 0.189
0.084CysTrp: 0.084 ± 0.1
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.307AspAla: 5.307 ± 0.723
0.505AspCys: 0.505 ± 0.189
5.222AspAsp: 5.222 ± 0.97
3.369AspGlu: 3.369 ± 0.591
1.853AspPhe: 1.853 ± 0.424
5.728AspGly: 5.728 ± 0.814
0.842AspHis: 0.842 ± 0.251
3.454AspIle: 3.454 ± 0.552
4.296AspLys: 4.296 ± 0.629
6.486AspLeu: 6.486 ± 0.742
2.611AspMet: 2.611 ± 0.453
2.78AspAsn: 2.78 ± 0.571
3.032AspPro: 3.032 ± 0.573
1.011AspGln: 1.011 ± 0.295
2.78AspArg: 2.78 ± 0.469
3.959AspSer: 3.959 ± 0.575
5.98AspThr: 5.98 ± 0.657
4.717AspVal: 4.717 ± 0.59
1.348AspTrp: 1.348 ± 0.46
2.022AspTyr: 2.022 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
6.907GluAla: 6.907 ± 0.997
0.337GluCys: 0.337 ± 0.159
2.358GluAsp: 2.358 ± 0.484
3.538GluGlu: 3.538 ± 0.67
1.095GluPhe: 1.095 ± 0.265
4.043GluGly: 4.043 ± 0.602
1.263GluHis: 1.263 ± 0.294
2.358GluIle: 2.358 ± 0.437
1.937GluLys: 1.937 ± 0.522
5.391GluLeu: 5.391 ± 0.697
0.674GluMet: 0.674 ± 0.257
1.769GluAsn: 1.769 ± 0.411
3.201GluPro: 3.201 ± 0.455
2.611GluGln: 2.611 ± 0.373
4.212GluArg: 4.212 ± 0.652
3.117GluSer: 3.117 ± 0.538
3.538GluThr: 3.538 ± 0.625
4.801GluVal: 4.801 ± 0.85
1.348GluTrp: 1.348 ± 0.378
1.516GluTyr: 1.516 ± 0.337
0.0GluXaa: 0.0 ± 0.0
Phe
2.695PheAla: 2.695 ± 0.543
0.084PheCys: 0.084 ± 0.079
1.853PheAsp: 1.853 ± 0.458
1.853PheGlu: 1.853 ± 0.469
0.505PhePhe: 0.505 ± 0.337
1.769PheGly: 1.769 ± 0.387
0.505PheHis: 0.505 ± 0.188
1.348PheIle: 1.348 ± 0.32
1.179PheLys: 1.179 ± 0.308
1.516PheLeu: 1.516 ± 0.46
0.674PheMet: 0.674 ± 0.235
0.337PheAsn: 0.337 ± 0.15
1.263PhePro: 1.263 ± 0.299
0.758PheGln: 0.758 ± 0.271
1.348PheArg: 1.348 ± 0.392
2.864PheSer: 2.864 ± 0.748
2.443PheThr: 2.443 ± 0.434
1.6PheVal: 1.6 ± 0.338
0.168PheTrp: 0.168 ± 0.118
1.011PheTyr: 1.011 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
9.181GlyAla: 9.181 ± 1.377
0.505GlyCys: 0.505 ± 0.229
5.896GlyAsp: 5.896 ± 0.656
4.717GlyGlu: 4.717 ± 0.609
1.937GlyPhe: 1.937 ± 0.35
5.896GlyGly: 5.896 ± 0.902
1.6GlyHis: 1.6 ± 0.387
4.127GlyIle: 4.127 ± 0.625
3.875GlyLys: 3.875 ± 0.561
6.739GlyLeu: 6.739 ± 0.802
2.106GlyMet: 2.106 ± 0.406
2.611GlyAsn: 2.611 ± 0.529
4.717GlyPro: 4.717 ± 1.263
2.864GlyGln: 2.864 ± 0.571
4.801GlyArg: 4.801 ± 0.698
3.79GlySer: 3.79 ± 0.681
6.402GlyThr: 6.402 ± 0.804
6.486GlyVal: 6.486 ± 0.821
1.263GlyTrp: 1.263 ± 0.37
2.19GlyTyr: 2.19 ± 0.339
0.0GlyXaa: 0.0 ± 0.0
His
1.937HisAla: 1.937 ± 0.386
0.084HisCys: 0.084 ± 0.083
0.842HisAsp: 0.842 ± 0.29
0.59HisGlu: 0.59 ± 0.222
0.168HisPhe: 0.168 ± 0.136
0.927HisGly: 0.927 ± 0.206
0.337HisHis: 0.337 ± 0.207
1.263HisIle: 1.263 ± 0.274
0.927HisLys: 0.927 ± 0.295
1.348HisLeu: 1.348 ± 0.309
0.0HisMet: 0.0 ± 0.0
0.758HisAsn: 0.758 ± 0.239
1.011HisPro: 1.011 ± 0.288
0.168HisGln: 0.168 ± 0.116
1.432HisArg: 1.432 ± 0.332
1.348HisSer: 1.348 ± 0.259
1.011HisThr: 1.011 ± 0.254
1.011HisVal: 1.011 ± 0.275
0.168HisTrp: 0.168 ± 0.113
0.505HisTyr: 0.505 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
4.97IleAla: 4.97 ± 0.608
0.253IleCys: 0.253 ± 0.146
4.296IleAsp: 4.296 ± 0.782
3.79IleGlu: 3.79 ± 0.67
1.179IlePhe: 1.179 ± 0.321
3.622IleGly: 3.622 ± 0.878
0.253IleHis: 0.253 ± 0.149
2.358IleIle: 2.358 ± 0.508
1.853IleLys: 1.853 ± 0.401
3.285IleLeu: 3.285 ± 0.708
0.927IleMet: 0.927 ± 0.24
1.432IleAsn: 1.432 ± 0.337
2.19IlePro: 2.19 ± 0.491
2.022IleGln: 2.022 ± 0.447
2.443IleArg: 2.443 ± 0.445
3.538IleSer: 3.538 ± 0.574
3.369IleThr: 3.369 ± 0.526
3.117IleVal: 3.117 ± 0.54
0.59IleTrp: 0.59 ± 0.283
0.927IleTyr: 0.927 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
6.486LysAla: 6.486 ± 1.06
0.253LysCys: 0.253 ± 0.138
3.622LysAsp: 3.622 ± 0.616
2.358LysGlu: 2.358 ± 0.493
1.516LysPhe: 1.516 ± 0.318
3.538LysGly: 3.538 ± 0.745
0.674LysHis: 0.674 ± 0.266
1.516LysIle: 1.516 ± 0.34
2.022LysLys: 2.022 ± 0.457
3.369LysLeu: 3.369 ± 0.475
1.263LysMet: 1.263 ± 0.436
1.6LysAsn: 1.6 ± 0.411
2.443LysPro: 2.443 ± 0.602
1.685LysGln: 1.685 ± 0.422
3.285LysArg: 3.285 ± 0.607
1.937LysSer: 1.937 ± 0.485
2.527LysThr: 2.527 ± 0.545
3.454LysVal: 3.454 ± 0.557
0.758LysTrp: 0.758 ± 0.257
1.095LysTyr: 1.095 ± 0.367
0.0LysXaa: 0.0 ± 0.0
Leu
10.276LeuAla: 10.276 ± 0.962
0.505LeuCys: 0.505 ± 0.201
7.328LeuAsp: 7.328 ± 0.621
4.801LeuGlu: 4.801 ± 0.63
1.432LeuPhe: 1.432 ± 0.37
6.654LeuGly: 6.654 ± 0.886
1.853LeuHis: 1.853 ± 0.304
3.875LeuIle: 3.875 ± 0.589
4.801LeuLys: 4.801 ± 0.748
6.739LeuLeu: 6.739 ± 0.926
0.927LeuMet: 0.927 ± 0.258
3.117LeuAsn: 3.117 ± 0.536
5.222LeuPro: 5.222 ± 0.845
2.106LeuGln: 2.106 ± 0.413
5.222LeuArg: 5.222 ± 0.754
5.98LeuSer: 5.98 ± 0.683
6.149LeuThr: 6.149 ± 0.85
6.823LeuVal: 6.823 ± 0.651
1.6LeuTrp: 1.6 ± 0.49
1.685LeuTyr: 1.685 ± 0.361
0.0LeuXaa: 0.0 ± 0.0
Met
3.706MetAla: 3.706 ± 0.758
0.084MetCys: 0.084 ± 0.092
0.842MetAsp: 0.842 ± 0.285
1.432MetGlu: 1.432 ± 0.312
0.758MetPhe: 0.758 ± 0.274
1.011MetGly: 1.011 ± 0.33
0.168MetHis: 0.168 ± 0.122
0.337MetIle: 0.337 ± 0.169
0.59MetLys: 0.59 ± 0.187
1.769MetLeu: 1.769 ± 0.377
0.084MetMet: 0.084 ± 0.11
0.421MetAsn: 0.421 ± 0.2
1.769MetPro: 1.769 ± 0.424
0.59MetGln: 0.59 ± 0.187
0.842MetArg: 0.842 ± 0.442
2.443MetSer: 2.443 ± 0.398
2.022MetThr: 2.022 ± 0.406
1.263MetVal: 1.263 ± 0.338
0.59MetTrp: 0.59 ± 0.384
0.927MetTyr: 0.927 ± 0.27
0.0MetXaa: 0.0 ± 0.0
Asn
4.885AsnAla: 4.885 ± 0.873
0.084AsnCys: 0.084 ± 0.097
2.274AsnAsp: 2.274 ± 0.48
1.011AsnGlu: 1.011 ± 0.278
0.674AsnPhe: 0.674 ± 0.222
2.864AsnGly: 2.864 ± 0.415
0.59AsnHis: 0.59 ± 0.231
1.685AsnIle: 1.685 ± 0.403
1.516AsnLys: 1.516 ± 0.426
2.695AsnLeu: 2.695 ± 0.429
0.505AsnMet: 0.505 ± 0.268
0.842AsnAsn: 0.842 ± 0.309
2.358AsnPro: 2.358 ± 0.459
1.263AsnGln: 1.263 ± 0.419
1.516AsnArg: 1.516 ± 0.338
2.274AsnSer: 2.274 ± 0.433
1.769AsnThr: 1.769 ± 0.419
1.432AsnVal: 1.432 ± 0.277
0.758AsnTrp: 0.758 ± 0.284
0.842AsnTyr: 0.842 ± 0.281
0.0AsnXaa: 0.0 ± 0.0
Pro
6.57ProAla: 6.57 ± 1.049
0.421ProCys: 0.421 ± 0.19
4.549ProAsp: 4.549 ± 0.601
3.622ProGlu: 3.622 ± 0.669
1.685ProPhe: 1.685 ± 0.476
6.57ProGly: 6.57 ± 0.805
0.758ProHis: 0.758 ± 0.259
2.19ProIle: 2.19 ± 0.462
2.948ProLys: 2.948 ± 0.495
3.959ProLeu: 3.959 ± 0.595
0.927ProMet: 0.927 ± 0.285
0.758ProAsn: 0.758 ± 0.258
2.106ProPro: 2.106 ± 0.47
1.937ProGln: 1.937 ± 0.54
3.032ProArg: 3.032 ± 0.45
3.285ProSer: 3.285 ± 0.479
3.032ProThr: 3.032 ± 0.468
4.801ProVal: 4.801 ± 0.581
1.011ProTrp: 1.011 ± 0.294
1.348ProTyr: 1.348 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.611GlnAla: 2.611 ± 0.563
0.084GlnCys: 0.084 ± 0.092
1.516GlnAsp: 1.516 ± 0.369
1.685GlnGlu: 1.685 ± 0.389
1.263GlnPhe: 1.263 ± 0.328
4.38GlnGly: 4.38 ± 0.997
1.095GlnHis: 1.095 ± 0.39
1.011GlnIle: 1.011 ± 0.336
1.095GlnLys: 1.095 ± 0.351
3.538GlnLeu: 3.538 ± 0.605
0.927GlnMet: 0.927 ± 0.345
1.095GlnAsn: 1.095 ± 0.376
1.011GlnPro: 1.011 ± 0.373
1.6GlnGln: 1.6 ± 0.364
2.611GlnArg: 2.611 ± 0.554
1.516GlnSer: 1.516 ± 0.336
1.6GlnThr: 1.6 ± 0.399
3.117GlnVal: 3.117 ± 0.554
0.927GlnTrp: 0.927 ± 0.289
0.421GlnTyr: 0.421 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
8.507ArgAla: 8.507 ± 1.062
0.674ArgCys: 0.674 ± 0.261
2.78ArgAsp: 2.78 ± 0.5
3.538ArgGlu: 3.538 ± 0.806
2.022ArgPhe: 2.022 ± 0.418
5.054ArgGly: 5.054 ± 0.624
0.59ArgHis: 0.59 ± 0.173
2.78ArgIle: 2.78 ± 0.47
1.853ArgLys: 1.853 ± 0.523
6.065ArgLeu: 6.065 ± 0.966
1.263ArgMet: 1.263 ± 0.368
1.432ArgAsn: 1.432 ± 0.415
2.695ArgPro: 2.695 ± 0.447
2.022ArgGln: 2.022 ± 0.505
4.549ArgArg: 4.549 ± 0.871
3.285ArgSer: 3.285 ± 0.54
2.948ArgThr: 2.948 ± 0.493
4.801ArgVal: 4.801 ± 0.756
0.674ArgTrp: 0.674 ± 0.249
1.853ArgTyr: 1.853 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
5.896SerAla: 5.896 ± 0.989
0.421SerCys: 0.421 ± 0.25
3.622SerAsp: 3.622 ± 0.548
3.706SerGlu: 3.706 ± 0.762
2.022SerPhe: 2.022 ± 0.383
4.38SerGly: 4.38 ± 0.715
1.179SerHis: 1.179 ± 0.316
3.875SerIle: 3.875 ± 0.571
2.864SerLys: 2.864 ± 0.591
7.834SerLeu: 7.834 ± 0.709
1.432SerMet: 1.432 ± 0.296
3.538SerAsn: 3.538 ± 0.747
3.285SerPro: 3.285 ± 0.501
2.022SerGln: 2.022 ± 0.536
2.864SerArg: 2.864 ± 0.514
4.127SerSer: 4.127 ± 0.665
4.296SerThr: 4.296 ± 0.755
4.464SerVal: 4.464 ± 0.72
0.59SerTrp: 0.59 ± 0.268
1.348SerTyr: 1.348 ± 0.363
0.0SerXaa: 0.0 ± 0.0
Thr
8.423ThrAla: 8.423 ± 1.042
0.421ThrCys: 0.421 ± 0.242
4.885ThrAsp: 4.885 ± 0.639
2.695ThrGlu: 2.695 ± 0.708
1.685ThrPhe: 1.685 ± 0.409
6.654ThrGly: 6.654 ± 0.995
0.842ThrHis: 0.842 ± 0.277
2.78ThrIle: 2.78 ± 0.412
2.527ThrLys: 2.527 ± 0.513
5.812ThrLeu: 5.812 ± 0.868
1.095ThrMet: 1.095 ± 0.336
1.937ThrAsn: 1.937 ± 0.539
5.559ThrPro: 5.559 ± 0.815
1.937ThrGln: 1.937 ± 0.526
4.549ThrArg: 4.549 ± 0.656
4.212ThrSer: 4.212 ± 0.62
4.464ThrThr: 4.464 ± 0.571
4.127ThrVal: 4.127 ± 0.72
1.685ThrTrp: 1.685 ± 0.367
2.022ThrTyr: 2.022 ± 0.483
0.0ThrXaa: 0.0 ± 0.0
Val
8.255ValAla: 8.255 ± 0.863
0.674ValCys: 0.674 ± 0.276
5.559ValAsp: 5.559 ± 0.631
4.043ValGlu: 4.043 ± 0.432
1.516ValPhe: 1.516 ± 0.41
4.885ValGly: 4.885 ± 0.932
1.263ValHis: 1.263 ± 0.297
3.79ValIle: 3.79 ± 0.633
3.454ValLys: 3.454 ± 0.667
6.149ValLeu: 6.149 ± 0.723
2.19ValMet: 2.19 ± 0.491
1.769ValAsn: 1.769 ± 0.438
3.369ValPro: 3.369 ± 0.545
2.274ValGln: 2.274 ± 0.602
2.443ValArg: 2.443 ± 0.517
4.885ValSer: 4.885 ± 0.622
6.402ValThr: 6.402 ± 0.87
7.581ValVal: 7.581 ± 1.075
1.179ValTrp: 1.179 ± 0.325
2.358ValTyr: 2.358 ± 0.436
0.0ValXaa: 0.0 ± 0.0
Trp
1.432TrpAla: 1.432 ± 0.433
0.084TrpCys: 0.084 ± 0.092
0.421TrpAsp: 0.421 ± 0.16
1.685TrpGlu: 1.685 ± 0.342
0.842TrpPhe: 0.842 ± 0.271
0.674TrpGly: 0.674 ± 0.21
0.253TrpHis: 0.253 ± 0.192
1.011TrpIle: 1.011 ± 0.285
0.674TrpLys: 0.674 ± 0.27
2.022TrpLeu: 2.022 ± 0.411
0.168TrpMet: 0.168 ± 0.12
0.842TrpAsn: 0.842 ± 0.303
0.927TrpPro: 0.927 ± 0.259
0.758TrpGln: 0.758 ± 0.29
0.927TrpArg: 0.927 ± 0.271
1.348TrpSer: 1.348 ± 0.306
1.432TrpThr: 1.432 ± 0.425
1.095TrpVal: 1.095 ± 0.306
0.505TrpTrp: 0.505 ± 0.263
0.421TrpTyr: 0.421 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.117TyrAla: 3.117 ± 0.499
0.253TyrCys: 0.253 ± 0.157
1.685TyrAsp: 1.685 ± 0.487
0.927TyrGlu: 0.927 ± 0.243
0.674TyrPhe: 0.674 ± 0.198
2.106TyrGly: 2.106 ± 0.438
0.084TyrHis: 0.084 ± 0.096
1.685TyrIle: 1.685 ± 0.375
1.011TyrLys: 1.011 ± 0.326
2.611TyrLeu: 2.611 ± 0.518
0.59TyrMet: 0.59 ± 0.284
0.505TyrAsn: 0.505 ± 0.257
2.022TyrPro: 2.022 ± 0.372
0.674TyrGln: 0.674 ± 0.22
1.937TyrArg: 1.937 ± 0.394
2.106TyrSer: 2.106 ± 0.491
2.358TyrThr: 2.358 ± 0.466
1.432TyrVal: 1.432 ± 0.416
0.253TyrTrp: 0.253 ± 0.15
0.505TyrTyr: 0.505 ± 0.229
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11873 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski