Amino acid dipepetide frequency for Microbacterium phage Elva

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.929AlaAla: 13.929 ± 1.062
0.227AlaCys: 0.227 ± 0.14
6.207AlaAsp: 6.207 ± 0.782
7.646AlaGlu: 7.646 ± 0.948
3.709AlaPhe: 3.709 ± 0.533
12.112AlaGly: 12.112 ± 0.953
1.665AlaHis: 1.665 ± 0.361
4.693AlaIle: 4.693 ± 0.694
5.526AlaLys: 5.526 ± 0.907
9.765AlaLeu: 9.765 ± 0.891
2.725AlaMet: 2.725 ± 0.548
3.331AlaAsn: 3.331 ± 0.509
5.375AlaPro: 5.375 ± 0.692
4.921AlaGln: 4.921 ± 0.621
6.964AlaArg: 6.964 ± 0.731
5.829AlaSer: 5.829 ± 0.741
6.056AlaThr: 6.056 ± 0.811
9.917AlaVal: 9.917 ± 0.928
2.801AlaTrp: 2.801 ± 0.46
1.893AlaTyr: 1.893 ± 0.459
0.0AlaXaa: 0.0 ± 0.0
Cys
0.227CysAla: 0.227 ± 0.141
0.0CysCys: 0.0 ± 0.0
0.53CysAsp: 0.53 ± 0.182
0.076CysGlu: 0.076 ± 0.075
0.151CysPhe: 0.151 ± 0.103
0.833CysGly: 0.833 ± 0.302
0.151CysHis: 0.151 ± 0.129
0.0CysIle: 0.0 ± 0.0
0.151CysLys: 0.151 ± 0.101
0.076CysLeu: 0.076 ± 0.087
0.076CysMet: 0.076 ± 0.081
0.227CysAsn: 0.227 ± 0.141
0.908CysPro: 0.908 ± 0.311
0.076CysGln: 0.076 ± 0.079
0.303CysArg: 0.303 ± 0.149
0.379CysSer: 0.379 ± 0.185
0.227CysThr: 0.227 ± 0.128
0.227CysVal: 0.227 ± 0.141
0.0CysTrp: 0.0 ± 0.0
0.151CysTyr: 0.151 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
5.602AspAla: 5.602 ± 0.819
0.076AspCys: 0.076 ± 0.074
3.407AspAsp: 3.407 ± 0.678
3.861AspGlu: 3.861 ± 0.622
2.195AspPhe: 2.195 ± 0.427
6.435AspGly: 6.435 ± 0.724
1.893AspHis: 1.893 ± 0.427
1.968AspIle: 1.968 ± 0.364
1.817AspLys: 1.817 ± 0.414
6.889AspLeu: 6.889 ± 0.785
1.665AspMet: 1.665 ± 0.373
1.741AspAsn: 1.741 ± 0.319
4.391AspPro: 4.391 ± 0.594
1.665AspGln: 1.665 ± 0.319
3.634AspArg: 3.634 ± 0.594
3.861AspSer: 3.861 ± 0.669
3.104AspThr: 3.104 ± 0.474
5.072AspVal: 5.072 ± 0.553
1.287AspTrp: 1.287 ± 0.302
2.347AspTyr: 2.347 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
7.949GluAla: 7.949 ± 0.798
0.379GluCys: 0.379 ± 0.238
4.391GluAsp: 4.391 ± 0.528
5.148GluGlu: 5.148 ± 0.727
2.347GluPhe: 2.347 ± 0.333
3.936GluGly: 3.936 ± 0.753
1.514GluHis: 1.514 ± 0.392
1.438GluIle: 1.438 ± 0.348
2.12GluLys: 2.12 ± 0.385
7.116GluLeu: 7.116 ± 0.704
1.817GluMet: 1.817 ± 0.359
1.665GluAsn: 1.665 ± 0.54
3.255GluPro: 3.255 ± 0.541
2.195GluGln: 2.195 ± 0.368
2.877GluArg: 2.877 ± 0.505
2.422GluSer: 2.422 ± 0.506
3.407GluThr: 3.407 ± 0.518
6.132GluVal: 6.132 ± 0.801
1.136GluTrp: 1.136 ± 0.364
1.665GluTyr: 1.665 ± 0.317
0.0GluXaa: 0.0 ± 0.0
Phe
3.179PheAla: 3.179 ± 0.475
0.076PheCys: 0.076 ± 0.079
2.044PheAsp: 2.044 ± 0.339
2.422PheGlu: 2.422 ± 0.449
0.757PhePhe: 0.757 ± 0.309
2.65PheGly: 2.65 ± 0.432
0.303PheHis: 0.303 ± 0.133
1.438PheIle: 1.438 ± 0.279
1.287PheLys: 1.287 ± 0.276
2.574PheLeu: 2.574 ± 0.552
0.908PheMet: 0.908 ± 0.259
1.287PheAsn: 1.287 ± 0.326
1.06PhePro: 1.06 ± 0.241
1.59PheGln: 1.59 ± 0.336
2.65PheArg: 2.65 ± 0.52
1.363PheSer: 1.363 ± 0.324
2.877PheThr: 2.877 ± 0.49
1.665PheVal: 1.665 ± 0.304
0.454PheTrp: 0.454 ± 0.246
0.53PheTyr: 0.53 ± 0.165
0.0PheXaa: 0.0 ± 0.0
Gly
8.933GlyAla: 8.933 ± 0.951
0.984GlyCys: 0.984 ± 0.35
6.586GlyAsp: 6.586 ± 0.62
4.618GlyGlu: 4.618 ± 0.548
2.347GlyPhe: 2.347 ± 0.428
6.283GlyGly: 6.283 ± 1.012
1.363GlyHis: 1.363 ± 0.362
3.104GlyIle: 3.104 ± 0.418
4.012GlyLys: 4.012 ± 0.54
6.359GlyLeu: 6.359 ± 0.651
2.271GlyMet: 2.271 ± 0.432
2.574GlyAsn: 2.574 ± 0.499
4.088GlyPro: 4.088 ± 0.747
4.088GlyGln: 4.088 ± 0.823
5.678GlyArg: 5.678 ± 0.772
5.072GlySer: 5.072 ± 0.717
6.056GlyThr: 6.056 ± 1.044
8.176GlyVal: 8.176 ± 1.054
1.665GlyTrp: 1.665 ± 0.281
2.952GlyTyr: 2.952 ± 0.354
0.0GlyXaa: 0.0 ± 0.0
His
1.438HisAla: 1.438 ± 0.337
0.151HisCys: 0.151 ± 0.107
0.833HisAsp: 0.833 ± 0.318
0.681HisGlu: 0.681 ± 0.271
0.681HisPhe: 0.681 ± 0.227
1.363HisGly: 1.363 ± 0.37
0.53HisHis: 0.53 ± 0.241
0.681HisIle: 0.681 ± 0.17
0.757HisLys: 0.757 ± 0.236
2.271HisLeu: 2.271 ± 0.383
0.53HisMet: 0.53 ± 0.188
0.454HisAsn: 0.454 ± 0.162
1.136HisPro: 1.136 ± 0.309
0.454HisGln: 0.454 ± 0.176
1.06HisArg: 1.06 ± 0.261
0.833HisSer: 0.833 ± 0.299
0.681HisThr: 0.681 ± 0.228
1.817HisVal: 1.817 ± 0.4
0.379HisTrp: 0.379 ± 0.164
0.984HisTyr: 0.984 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
4.693IleAla: 4.693 ± 0.451
0.0IleCys: 0.0 ± 0.0
3.028IleAsp: 3.028 ± 0.537
2.801IleGlu: 2.801 ± 0.435
0.681IlePhe: 0.681 ± 0.224
3.558IleGly: 3.558 ± 0.542
1.287IleHis: 1.287 ± 0.351
1.514IleIle: 1.514 ± 0.325
2.574IleLys: 2.574 ± 0.423
2.877IleLeu: 2.877 ± 0.459
0.303IleMet: 0.303 ± 0.161
1.287IleAsn: 1.287 ± 0.33
1.514IlePro: 1.514 ± 0.356
0.984IleGln: 0.984 ± 0.269
2.801IleArg: 2.801 ± 0.569
1.817IleSer: 1.817 ± 0.305
3.482IleThr: 3.482 ± 0.59
3.028IleVal: 3.028 ± 0.518
0.53IleTrp: 0.53 ± 0.194
0.908IleTyr: 0.908 ± 0.261
0.0IleXaa: 0.0 ± 0.0
Lys
5.526LysAla: 5.526 ± 0.941
0.454LysCys: 0.454 ± 0.196
2.725LysAsp: 2.725 ± 0.377
2.044LysGlu: 2.044 ± 0.433
1.136LysPhe: 1.136 ± 0.416
3.558LysGly: 3.558 ± 0.5
0.757LysHis: 0.757 ± 0.241
1.287LysIle: 1.287 ± 0.284
2.044LysLys: 2.044 ± 0.457
3.407LysLeu: 3.407 ± 0.519
1.211LysMet: 1.211 ± 0.275
1.363LysAsn: 1.363 ± 0.307
2.952LysPro: 2.952 ± 0.462
1.438LysGln: 1.438 ± 0.296
3.558LysArg: 3.558 ± 0.481
2.271LysSer: 2.271 ± 0.618
2.65LysThr: 2.65 ± 0.396
3.028LysVal: 3.028 ± 0.366
0.53LysTrp: 0.53 ± 0.185
0.379LysTyr: 0.379 ± 0.186
0.0LysXaa: 0.0 ± 0.0
Leu
9.992LeuAla: 9.992 ± 1.179
0.454LeuCys: 0.454 ± 0.187
5.526LeuAsp: 5.526 ± 0.65
5.072LeuGlu: 5.072 ± 0.537
2.195LeuPhe: 2.195 ± 0.42
7.797LeuGly: 7.797 ± 0.633
1.211LeuHis: 1.211 ± 0.26
3.936LeuIle: 3.936 ± 0.506
3.558LeuLys: 3.558 ± 0.523
6.207LeuLeu: 6.207 ± 0.706
2.12LeuMet: 2.12 ± 0.333
2.725LeuAsn: 2.725 ± 0.586
4.693LeuPro: 4.693 ± 0.591
3.331LeuGln: 3.331 ± 0.663
5.375LeuArg: 5.375 ± 0.706
4.618LeuSer: 4.618 ± 0.533
5.526LeuThr: 5.526 ± 0.806
6.51LeuVal: 6.51 ± 0.655
1.06LeuTrp: 1.06 ± 0.246
1.817LeuTyr: 1.817 ± 0.358
0.0LeuXaa: 0.0 ± 0.0
Met
3.331MetAla: 3.331 ± 0.446
0.076MetCys: 0.076 ± 0.074
1.287MetAsp: 1.287 ± 0.256
1.06MetGlu: 1.06 ± 0.303
0.303MetPhe: 0.303 ± 0.193
1.741MetGly: 1.741 ± 0.376
0.0MetHis: 0.0 ± 0.0
0.53MetIle: 0.53 ± 0.229
1.363MetLys: 1.363 ± 0.348
1.363MetLeu: 1.363 ± 0.302
0.454MetMet: 0.454 ± 0.176
1.438MetAsn: 1.438 ± 0.413
1.136MetPro: 1.136 ± 0.316
0.454MetGln: 0.454 ± 0.164
1.665MetArg: 1.665 ± 0.338
1.665MetSer: 1.665 ± 0.363
2.195MetThr: 2.195 ± 0.377
1.817MetVal: 1.817 ± 0.393
0.379MetTrp: 0.379 ± 0.189
0.379MetTyr: 0.379 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
2.877AsnAla: 2.877 ± 0.582
0.076AsnCys: 0.076 ± 0.069
1.665AsnAsp: 1.665 ± 0.316
1.741AsnGlu: 1.741 ± 0.371
0.757AsnPhe: 0.757 ± 0.221
3.179AsnGly: 3.179 ± 0.559
0.303AsnHis: 0.303 ± 0.152
1.136AsnIle: 1.136 ± 0.283
1.287AsnLys: 1.287 ± 0.351
2.347AsnLeu: 2.347 ± 0.42
0.606AsnMet: 0.606 ± 0.223
0.833AsnAsn: 0.833 ± 0.214
2.347AsnPro: 2.347 ± 0.368
1.136AsnGln: 1.136 ± 0.287
1.968AsnArg: 1.968 ± 0.459
2.574AsnSer: 2.574 ± 0.525
1.817AsnThr: 1.817 ± 0.506
1.817AsnVal: 1.817 ± 0.344
0.681AsnTrp: 0.681 ± 0.26
0.833AsnTyr: 0.833 ± 0.273
0.0AsnXaa: 0.0 ± 0.0
Pro
6.964ProAla: 6.964 ± 0.648
0.303ProCys: 0.303 ± 0.165
3.407ProAsp: 3.407 ± 0.516
5.072ProGlu: 5.072 ± 0.809
1.59ProPhe: 1.59 ± 0.386
5.299ProGly: 5.299 ± 0.608
1.136ProHis: 1.136 ± 0.356
2.271ProIle: 2.271 ± 0.474
2.65ProLys: 2.65 ± 0.467
3.936ProLeu: 3.936 ± 0.555
0.53ProMet: 0.53 ± 0.199
1.665ProAsn: 1.665 ± 0.302
2.195ProPro: 2.195 ± 0.406
1.514ProGln: 1.514 ± 0.381
2.12ProArg: 2.12 ± 0.381
3.028ProSer: 3.028 ± 0.511
3.482ProThr: 3.482 ± 0.584
3.407ProVal: 3.407 ± 0.575
1.363ProTrp: 1.363 ± 0.39
0.681ProTyr: 0.681 ± 0.242
0.0ProXaa: 0.0 ± 0.0
Gln
5.148GlnAla: 5.148 ± 0.61
0.076GlnCys: 0.076 ± 0.07
1.968GlnAsp: 1.968 ± 0.434
1.817GlnGlu: 1.817 ± 0.355
1.514GlnPhe: 1.514 ± 0.39
2.801GlnGly: 2.801 ± 0.387
0.454GlnHis: 0.454 ± 0.169
1.514GlnIle: 1.514 ± 0.378
1.741GlnLys: 1.741 ± 0.299
3.482GlnLeu: 3.482 ± 0.487
0.606GlnMet: 0.606 ± 0.208
0.908GlnAsn: 0.908 ± 0.235
1.438GlnPro: 1.438 ± 0.323
0.984GlnGln: 0.984 ± 0.188
1.968GlnArg: 1.968 ± 0.384
1.59GlnSer: 1.59 ± 0.327
1.893GlnThr: 1.893 ± 0.372
3.179GlnVal: 3.179 ± 0.435
0.757GlnTrp: 0.757 ± 0.26
0.908GlnTyr: 0.908 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
7.04ArgAla: 7.04 ± 0.827
0.454ArgCys: 0.454 ± 0.236
3.331ArgAsp: 3.331 ± 0.436
4.315ArgGlu: 4.315 ± 0.738
2.271ArgPhe: 2.271 ± 0.406
4.921ArgGly: 4.921 ± 0.878
1.514ArgHis: 1.514 ± 0.351
2.347ArgIle: 2.347 ± 0.438
2.574ArgLys: 2.574 ± 0.515
6.056ArgLeu: 6.056 ± 0.721
1.363ArgMet: 1.363 ± 0.316
1.59ArgAsn: 1.59 ± 0.318
3.179ArgPro: 3.179 ± 0.496
2.195ArgGln: 2.195 ± 0.338
5.072ArgArg: 5.072 ± 0.81
3.255ArgSer: 3.255 ± 0.508
3.785ArgThr: 3.785 ± 0.489
5.602ArgVal: 5.602 ± 0.701
1.438ArgTrp: 1.438 ± 0.334
1.59ArgTyr: 1.59 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
6.964SerAla: 6.964 ± 0.814
0.227SerCys: 0.227 ± 0.123
4.542SerAsp: 4.542 ± 0.548
2.347SerGlu: 2.347 ± 0.434
1.968SerPhe: 1.968 ± 0.378
5.753SerGly: 5.753 ± 0.757
0.681SerHis: 0.681 ± 0.23
2.952SerIle: 2.952 ± 0.467
1.968SerLys: 1.968 ± 0.362
4.088SerLeu: 4.088 ± 0.636
1.438SerMet: 1.438 ± 0.394
1.817SerAsn: 1.817 ± 0.419
2.12SerPro: 2.12 ± 0.363
1.438SerGln: 1.438 ± 0.339
3.558SerArg: 3.558 ± 0.472
2.65SerSer: 2.65 ± 0.449
3.936SerThr: 3.936 ± 0.531
3.634SerVal: 3.634 ± 0.536
1.287SerTrp: 1.287 ± 0.231
1.211SerTyr: 1.211 ± 0.332
0.0SerXaa: 0.0 ± 0.0
Thr
7.192ThrAla: 7.192 ± 0.918
0.379ThrCys: 0.379 ± 0.185
3.558ThrAsp: 3.558 ± 0.532
3.407ThrGlu: 3.407 ± 0.505
2.574ThrPhe: 2.574 ± 0.505
5.299ThrGly: 5.299 ± 0.669
0.833ThrHis: 0.833 ± 0.253
3.634ThrIle: 3.634 ± 0.624
2.65ThrLys: 2.65 ± 0.392
5.299ThrLeu: 5.299 ± 0.656
0.606ThrMet: 0.606 ± 0.226
1.136ThrAsn: 1.136 ± 0.35
3.936ThrPro: 3.936 ± 0.398
1.438ThrGln: 1.438 ± 0.4
3.936ThrArg: 3.936 ± 0.491
4.012ThrSer: 4.012 ± 0.601
5.375ThrThr: 5.375 ± 0.886
5.905ThrVal: 5.905 ± 0.941
1.665ThrTrp: 1.665 ± 0.287
2.271ThrTyr: 2.271 ± 0.412
0.0ThrXaa: 0.0 ± 0.0
Val
8.857ValAla: 8.857 ± 0.754
0.076ValCys: 0.076 ± 0.088
5.299ValAsp: 5.299 ± 0.733
5.98ValGlu: 5.98 ± 0.644
2.801ValPhe: 2.801 ± 0.53
5.829ValGly: 5.829 ± 0.932
1.59ValHis: 1.59 ± 0.379
3.785ValIle: 3.785 ± 0.472
2.952ValLys: 2.952 ± 0.585
6.359ValLeu: 6.359 ± 0.761
1.59ValMet: 1.59 ± 0.296
2.498ValAsn: 2.498 ± 0.48
4.618ValPro: 4.618 ± 0.693
2.877ValGln: 2.877 ± 0.503
5.526ValArg: 5.526 ± 0.617
4.088ValSer: 4.088 ± 0.521
5.375ValThr: 5.375 ± 0.725
7.797ValVal: 7.797 ± 0.752
2.195ValTrp: 2.195 ± 0.589
2.952ValTyr: 2.952 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
2.498TrpAla: 2.498 ± 0.491
0.0TrpCys: 0.0 ± 0.0
0.833TrpAsp: 0.833 ± 0.233
1.438TrpGlu: 1.438 ± 0.397
0.606TrpPhe: 0.606 ± 0.204
1.438TrpGly: 1.438 ± 0.275
0.151TrpHis: 0.151 ± 0.098
0.833TrpIle: 0.833 ± 0.281
0.757TrpLys: 0.757 ± 0.212
1.817TrpLeu: 1.817 ± 0.384
0.833TrpMet: 0.833 ± 0.219
0.833TrpAsn: 0.833 ± 0.348
0.454TrpPro: 0.454 ± 0.185
1.211TrpGln: 1.211 ± 0.414
0.908TrpArg: 0.908 ± 0.319
1.665TrpSer: 1.665 ± 0.332
1.211TrpThr: 1.211 ± 0.347
2.12TrpVal: 2.12 ± 0.385
0.151TrpTrp: 0.151 ± 0.105
0.53TrpTyr: 0.53 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.952TyrAla: 2.952 ± 0.453
0.303TyrCys: 0.303 ± 0.133
1.741TyrAsp: 1.741 ± 0.351
1.438TyrGlu: 1.438 ± 0.33
0.53TyrPhe: 0.53 ± 0.207
2.422TyrGly: 2.422 ± 0.406
0.379TyrHis: 0.379 ± 0.148
0.606TyrIle: 0.606 ± 0.22
0.53TyrLys: 0.53 ± 0.191
1.438TyrLeu: 1.438 ± 0.345
0.908TyrMet: 0.908 ± 0.237
0.606TyrAsn: 0.606 ± 0.218
1.817TyrPro: 1.817 ± 0.414
0.757TyrGln: 0.757 ± 0.195
2.271TyrArg: 2.271 ± 0.473
1.59TyrSer: 1.59 ± 0.371
1.968TyrThr: 1.968 ± 0.309
2.195TyrVal: 2.195 ± 0.552
0.53TyrTrp: 0.53 ± 0.232
0.53TyrTyr: 0.53 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (13211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski