Amino acid dipepetide frequency for Escherichia phage Fraca

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.869AlaAla: 10.869 ± 1.666
1.14AlaCys: 1.14 ± 0.312
6.308AlaAsp: 6.308 ± 0.665
6.156AlaGlu: 6.156 ± 0.841
3.344AlaPhe: 3.344 ± 0.518
8.057AlaGly: 8.057 ± 0.799
1.368AlaHis: 1.368 ± 0.326
6.156AlaIle: 6.156 ± 0.633
7.449AlaLys: 7.449 ± 0.721
7.296AlaLeu: 7.296 ± 0.736
3.116AlaMet: 3.116 ± 0.447
3.876AlaAsn: 3.876 ± 0.712
2.432AlaPro: 2.432 ± 0.455
4.56AlaGln: 4.56 ± 0.815
4.56AlaArg: 4.56 ± 0.682
7.22AlaSer: 7.22 ± 1.205
4.788AlaThr: 4.788 ± 0.733
6.992AlaVal: 6.992 ± 0.661
1.672AlaTrp: 1.672 ± 0.341
2.356AlaTyr: 2.356 ± 0.412
0.0AlaXaa: 0.0 ± 0.0
Cys
0.836CysAla: 0.836 ± 0.292
0.076CysCys: 0.076 ± 0.081
0.76CysAsp: 0.76 ± 0.227
1.14CysGlu: 1.14 ± 0.268
0.228CysPhe: 0.228 ± 0.128
1.14CysGly: 1.14 ± 0.363
0.38CysHis: 0.38 ± 0.167
0.38CysIle: 0.38 ± 0.151
0.76CysLys: 0.76 ± 0.237
1.064CysLeu: 1.064 ± 0.336
0.76CysMet: 0.76 ± 0.244
0.304CysAsn: 0.304 ± 0.135
0.532CysPro: 0.532 ± 0.182
0.38CysGln: 0.38 ± 0.139
0.912CysArg: 0.912 ± 0.372
0.608CysSer: 0.608 ± 0.215
0.38CysThr: 0.38 ± 0.159
0.228CysVal: 0.228 ± 0.133
0.38CysTrp: 0.38 ± 0.23
0.228CysTyr: 0.228 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
5.472AspAla: 5.472 ± 0.505
0.836AspCys: 0.836 ± 0.292
3.952AspAsp: 3.952 ± 0.569
4.712AspGlu: 4.712 ± 0.616
1.292AspPhe: 1.292 ± 0.315
5.928AspGly: 5.928 ± 0.845
0.76AspHis: 0.76 ± 0.243
3.268AspIle: 3.268 ± 0.422
3.268AspLys: 3.268 ± 0.398
4.104AspLeu: 4.104 ± 0.518
1.444AspMet: 1.444 ± 0.354
1.9AspAsn: 1.9 ± 0.41
1.748AspPro: 1.748 ± 0.36
1.748AspGln: 1.748 ± 0.343
3.344AspArg: 3.344 ± 0.586
2.736AspSer: 2.736 ± 0.393
2.584AspThr: 2.584 ± 0.362
4.56AspVal: 4.56 ± 0.597
0.836AspTrp: 0.836 ± 0.232
1.748AspTyr: 1.748 ± 0.481
0.0AspXaa: 0.0 ± 0.0
Glu
6.916GluAla: 6.916 ± 0.953
0.76GluCys: 0.76 ± 0.265
2.432GluAsp: 2.432 ± 0.379
3.116GluGlu: 3.116 ± 0.562
2.508GluPhe: 2.508 ± 0.28
3.8GluGly: 3.8 ± 0.722
1.292GluHis: 1.292 ± 0.403
3.876GluIle: 3.876 ± 0.525
3.876GluLys: 3.876 ± 0.46
6.688GluLeu: 6.688 ± 0.87
1.748GluMet: 1.748 ± 0.355
3.572GluAsn: 3.572 ± 0.486
2.128GluPro: 2.128 ± 0.422
4.104GluGln: 4.104 ± 0.808
3.192GluArg: 3.192 ± 0.519
4.18GluSer: 4.18 ± 0.653
3.116GluThr: 3.116 ± 0.49
3.648GluVal: 3.648 ± 0.597
0.988GluTrp: 0.988 ± 0.278
2.28GluTyr: 2.28 ± 0.38
0.0GluXaa: 0.0 ± 0.0
Phe
3.268PheAla: 3.268 ± 0.445
0.228PheCys: 0.228 ± 0.127
1.976PheAsp: 1.976 ± 0.361
1.748PheGlu: 1.748 ± 0.38
0.988PhePhe: 0.988 ± 0.304
3.192PheGly: 3.192 ± 0.483
0.38PheHis: 0.38 ± 0.176
2.508PheIle: 2.508 ± 0.419
1.52PheLys: 1.52 ± 0.331
2.28PheLeu: 2.28 ± 0.349
1.064PheMet: 1.064 ± 0.274
1.9PheAsn: 1.9 ± 0.345
1.444PhePro: 1.444 ± 0.321
0.988PheGln: 0.988 ± 0.307
2.128PheArg: 2.128 ± 0.371
2.28PheSer: 2.28 ± 0.399
2.204PheThr: 2.204 ± 0.425
1.824PheVal: 1.824 ± 0.357
0.684PheTrp: 0.684 ± 0.201
0.608PheTyr: 0.608 ± 0.177
0.0PheXaa: 0.0 ± 0.0
Gly
7.601GlyAla: 7.601 ± 1.118
0.836GlyCys: 0.836 ± 0.228
4.028GlyAsp: 4.028 ± 0.641
3.42GlyGlu: 3.42 ± 0.582
2.508GlyPhe: 2.508 ± 0.412
6.612GlyGly: 6.612 ± 0.739
0.988GlyHis: 0.988 ± 0.352
4.788GlyIle: 4.788 ± 0.431
5.168GlyLys: 5.168 ± 0.631
6.308GlyLeu: 6.308 ± 0.585
1.9GlyMet: 1.9 ± 0.417
3.648GlyAsn: 3.648 ± 0.63
1.444GlyPro: 1.444 ± 0.323
3.192GlyGln: 3.192 ± 0.482
4.104GlyArg: 4.104 ± 0.504
4.256GlySer: 4.256 ± 0.633
6.536GlyThr: 6.536 ± 0.819
5.32GlyVal: 5.32 ± 0.613
1.368GlyTrp: 1.368 ± 0.291
2.66GlyTyr: 2.66 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
1.368HisAla: 1.368 ± 0.339
0.38HisCys: 0.38 ± 0.158
0.836HisAsp: 0.836 ± 0.281
1.672HisGlu: 1.672 ± 0.363
0.912HisPhe: 0.912 ± 0.291
1.292HisGly: 1.292 ± 0.39
0.456HisHis: 0.456 ± 0.179
1.14HisIle: 1.14 ± 0.307
0.532HisLys: 0.532 ± 0.176
1.292HisLeu: 1.292 ± 0.451
0.304HisMet: 0.304 ± 0.14
0.988HisAsn: 0.988 ± 0.353
1.292HisPro: 1.292 ± 0.273
0.912HisGln: 0.912 ± 0.288
0.988HisArg: 0.988 ± 0.255
0.912HisSer: 0.912 ± 0.264
0.912HisThr: 0.912 ± 0.232
1.14HisVal: 1.14 ± 0.293
0.228HisTrp: 0.228 ± 0.13
0.532HisTyr: 0.532 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
4.56IleAla: 4.56 ± 0.542
0.912IleCys: 0.912 ± 0.346
4.104IleAsp: 4.104 ± 0.524
4.484IleGlu: 4.484 ± 0.524
2.204IlePhe: 2.204 ± 0.441
3.648IleGly: 3.648 ± 0.424
1.14IleHis: 1.14 ± 0.386
2.584IleIle: 2.584 ± 0.463
3.42IleLys: 3.42 ± 0.534
3.648IleLeu: 3.648 ± 0.528
1.14IleMet: 1.14 ± 0.384
2.66IleAsn: 2.66 ± 0.424
1.52IlePro: 1.52 ± 0.338
2.584IleGln: 2.584 ± 0.589
2.964IleArg: 2.964 ± 0.44
4.788IleSer: 4.788 ± 0.567
3.572IleThr: 3.572 ± 0.503
2.888IleVal: 2.888 ± 0.426
0.912IleTrp: 0.912 ± 0.26
2.28IleTyr: 2.28 ± 0.439
0.0IleXaa: 0.0 ± 0.0
Lys
5.7LysAla: 5.7 ± 0.748
0.684LysCys: 0.684 ± 0.253
3.192LysAsp: 3.192 ± 0.461
3.116LysGlu: 3.116 ± 0.526
2.052LysPhe: 2.052 ± 0.399
3.8LysGly: 3.8 ± 0.491
1.368LysHis: 1.368 ± 0.445
2.964LysIle: 2.964 ± 0.436
3.344LysLys: 3.344 ± 0.53
3.876LysLeu: 3.876 ± 0.542
1.976LysMet: 1.976 ± 0.36
2.432LysAsn: 2.432 ± 0.472
3.572LysPro: 3.572 ± 0.741
2.432LysGln: 2.432 ± 0.406
2.736LysArg: 2.736 ± 0.548
2.508LysSer: 2.508 ± 0.387
4.408LysThr: 4.408 ± 0.425
4.18LysVal: 4.18 ± 0.526
1.368LysTrp: 1.368 ± 0.307
1.596LysTyr: 1.596 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
8.285LeuAla: 8.285 ± 0.999
1.14LeuCys: 1.14 ± 0.405
4.636LeuAsp: 4.636 ± 0.548
4.864LeuGlu: 4.864 ± 0.648
2.128LeuPhe: 2.128 ± 0.371
3.8LeuGly: 3.8 ± 0.454
1.596LeuHis: 1.596 ± 0.309
3.42LeuIle: 3.42 ± 0.535
4.408LeuLys: 4.408 ± 0.553
5.776LeuLeu: 5.776 ± 0.928
1.672LeuMet: 1.672 ± 0.384
3.496LeuAsn: 3.496 ± 0.377
2.888LeuPro: 2.888 ± 0.386
2.888LeuGln: 2.888 ± 0.383
5.928LeuArg: 5.928 ± 0.695
6.156LeuSer: 6.156 ± 0.518
6.08LeuThr: 6.08 ± 0.712
4.56LeuVal: 4.56 ± 0.541
0.836LeuTrp: 0.836 ± 0.294
2.128LeuTyr: 2.128 ± 0.393
0.0LeuXaa: 0.0 ± 0.0
Met
2.584MetAla: 2.584 ± 0.367
0.076MetCys: 0.076 ± 0.074
1.52MetAsp: 1.52 ± 0.344
1.064MetGlu: 1.064 ± 0.289
0.608MetPhe: 0.608 ± 0.225
1.672MetGly: 1.672 ± 0.286
0.532MetHis: 0.532 ± 0.244
1.52MetIle: 1.52 ± 0.335
2.204MetLys: 2.204 ± 0.421
2.66MetLeu: 2.66 ± 0.459
0.38MetMet: 0.38 ± 0.179
1.216MetAsn: 1.216 ± 0.384
1.292MetPro: 1.292 ± 0.29
0.836MetGln: 0.836 ± 0.233
1.52MetArg: 1.52 ± 0.309
3.42MetSer: 3.42 ± 0.539
1.748MetThr: 1.748 ± 0.377
1.824MetVal: 1.824 ± 0.497
0.304MetTrp: 0.304 ± 0.148
0.304MetTyr: 0.304 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
5.016AsnAla: 5.016 ± 0.57
0.304AsnCys: 0.304 ± 0.195
2.052AsnAsp: 2.052 ± 0.344
2.888AsnGlu: 2.888 ± 0.483
1.52AsnPhe: 1.52 ± 0.38
3.952AsnGly: 3.952 ± 0.574
0.684AsnHis: 0.684 ± 0.195
2.584AsnIle: 2.584 ± 0.424
2.128AsnLys: 2.128 ± 0.353
3.116AsnLeu: 3.116 ± 0.531
1.444AsnMet: 1.444 ± 0.35
1.9AsnAsn: 1.9 ± 0.339
1.976AsnPro: 1.976 ± 0.391
1.596AsnGln: 1.596 ± 0.312
1.9AsnArg: 1.9 ± 0.392
3.42AsnSer: 3.42 ± 0.493
2.812AsnThr: 2.812 ± 0.552
2.128AsnVal: 2.128 ± 0.459
1.216AsnTrp: 1.216 ± 0.311
1.368AsnTyr: 1.368 ± 0.374
0.0AsnXaa: 0.0 ± 0.0
Pro
3.876ProAla: 3.876 ± 0.639
0.456ProCys: 0.456 ± 0.196
3.192ProAsp: 3.192 ± 0.424
2.66ProGlu: 2.66 ± 0.471
1.672ProPhe: 1.672 ± 0.356
3.192ProGly: 3.192 ± 0.384
0.76ProHis: 0.76 ± 0.236
1.748ProIle: 1.748 ± 0.373
1.444ProLys: 1.444 ± 0.315
2.356ProLeu: 2.356 ± 0.54
0.684ProMet: 0.684 ± 0.285
0.836ProAsn: 0.836 ± 0.264
1.672ProPro: 1.672 ± 0.353
1.9ProGln: 1.9 ± 0.354
1.824ProArg: 1.824 ± 0.414
2.204ProSer: 2.204 ± 0.379
2.356ProThr: 2.356 ± 0.477
2.736ProVal: 2.736 ± 0.355
0.532ProTrp: 0.532 ± 0.223
0.912ProTyr: 0.912 ± 0.23
0.0ProXaa: 0.0 ± 0.0
Gln
4.712GlnAla: 4.712 ± 0.883
0.38GlnCys: 0.38 ± 0.17
0.988GlnAsp: 0.988 ± 0.249
2.964GlnGlu: 2.964 ± 0.697
1.368GlnPhe: 1.368 ± 0.292
1.596GlnGly: 1.596 ± 0.333
0.988GlnHis: 0.988 ± 0.256
2.128GlnIle: 2.128 ± 0.497
2.888GlnLys: 2.888 ± 0.469
3.42GlnLeu: 3.42 ± 0.465
1.976GlnMet: 1.976 ± 0.337
1.52GlnAsn: 1.52 ± 0.347
1.976GlnPro: 1.976 ± 0.387
3.572GlnGln: 3.572 ± 0.906
2.736GlnArg: 2.736 ± 0.67
2.736GlnSer: 2.736 ± 0.456
2.812GlnThr: 2.812 ± 0.461
2.736GlnVal: 2.736 ± 0.42
0.532GlnTrp: 0.532 ± 0.227
1.064GlnTyr: 1.064 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
4.332ArgAla: 4.332 ± 0.66
0.988ArgCys: 0.988 ± 0.293
3.344ArgAsp: 3.344 ± 0.533
3.952ArgGlu: 3.952 ± 0.643
1.9ArgPhe: 1.9 ± 0.307
3.8ArgGly: 3.8 ± 0.463
1.14ArgHis: 1.14 ± 0.298
3.8ArgIle: 3.8 ± 0.695
4.104ArgLys: 4.104 ± 0.584
4.864ArgLeu: 4.864 ± 0.667
1.216ArgMet: 1.216 ± 0.296
3.268ArgAsn: 3.268 ± 0.483
1.672ArgPro: 1.672 ± 0.286
2.052ArgGln: 2.052 ± 0.353
3.42ArgArg: 3.42 ± 0.638
2.66ArgSer: 2.66 ± 0.437
2.66ArgThr: 2.66 ± 0.432
4.256ArgVal: 4.256 ± 0.563
0.988ArgTrp: 0.988 ± 0.238
2.204ArgTyr: 2.204 ± 0.4
0.0ArgXaa: 0.0 ± 0.0
Ser
6.764SerAla: 6.764 ± 0.667
0.304SerCys: 0.304 ± 0.139
3.952SerAsp: 3.952 ± 0.492
3.876SerGlu: 3.876 ± 0.472
1.9SerPhe: 1.9 ± 0.388
6.84SerGly: 6.84 ± 0.63
0.988SerHis: 0.988 ± 0.247
3.192SerIle: 3.192 ± 0.538
2.964SerLys: 2.964 ± 0.515
5.548SerLeu: 5.548 ± 0.722
1.52SerMet: 1.52 ± 0.328
2.66SerAsn: 2.66 ± 0.471
2.812SerPro: 2.812 ± 0.359
2.888SerGln: 2.888 ± 0.567
3.648SerArg: 3.648 ± 0.689
5.396SerSer: 5.396 ± 0.813
2.508SerThr: 2.508 ± 0.537
5.472SerVal: 5.472 ± 0.806
1.14SerTrp: 1.14 ± 0.249
1.9SerTyr: 1.9 ± 0.382
0.0SerXaa: 0.0 ± 0.0
Thr
7.296ThrAla: 7.296 ± 0.847
0.608ThrCys: 0.608 ± 0.247
3.344ThrAsp: 3.344 ± 0.434
4.18ThrGlu: 4.18 ± 0.568
1.9ThrPhe: 1.9 ± 0.333
6.156ThrGly: 6.156 ± 0.832
1.064ThrHis: 1.064 ± 0.219
3.42ThrIle: 3.42 ± 0.569
2.128ThrLys: 2.128 ± 0.333
4.712ThrLeu: 4.712 ± 0.442
1.14ThrMet: 1.14 ± 0.281
2.28ThrAsn: 2.28 ± 0.389
2.128ThrPro: 2.128 ± 0.388
2.508ThrGln: 2.508 ± 0.616
3.344ThrArg: 3.344 ± 0.485
3.344ThrSer: 3.344 ± 0.597
3.724ThrThr: 3.724 ± 0.532
4.484ThrVal: 4.484 ± 0.507
1.292ThrTrp: 1.292 ± 0.448
1.824ThrTyr: 1.824 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
5.852ValAla: 5.852 ± 0.721
0.608ValCys: 0.608 ± 0.188
2.964ValAsp: 2.964 ± 0.429
5.32ValGlu: 5.32 ± 0.755
2.28ValPhe: 2.28 ± 0.304
4.484ValGly: 4.484 ± 0.816
0.988ValHis: 0.988 ± 0.227
4.408ValIle: 4.408 ± 0.564
4.104ValLys: 4.104 ± 0.573
4.788ValLeu: 4.788 ± 0.503
2.28ValMet: 2.28 ± 0.355
3.648ValAsn: 3.648 ± 0.462
2.812ValPro: 2.812 ± 0.521
2.128ValGln: 2.128 ± 0.283
3.8ValArg: 3.8 ± 0.561
4.332ValSer: 4.332 ± 0.68
4.94ValThr: 4.94 ± 0.711
4.864ValVal: 4.864 ± 0.628
1.216ValTrp: 1.216 ± 0.334
2.204ValTyr: 2.204 ± 0.452
0.0ValXaa: 0.0 ± 0.0
Trp
1.368TrpAla: 1.368 ± 0.246
0.608TrpCys: 0.608 ± 0.203
1.216TrpAsp: 1.216 ± 0.298
1.216TrpGlu: 1.216 ± 0.274
0.684TrpPhe: 0.684 ± 0.21
1.14TrpGly: 1.14 ± 0.332
0.532TrpHis: 0.532 ± 0.17
0.76TrpIle: 0.76 ± 0.233
0.76TrpLys: 0.76 ± 0.186
0.836TrpLeu: 0.836 ± 0.298
0.608TrpMet: 0.608 ± 0.197
0.684TrpAsn: 0.684 ± 0.275
0.38TrpPro: 0.38 ± 0.154
0.608TrpGln: 0.608 ± 0.193
0.912TrpArg: 0.912 ± 0.241
1.064TrpSer: 1.064 ± 0.284
1.14TrpThr: 1.14 ± 0.326
1.9TrpVal: 1.9 ± 0.357
0.532TrpTrp: 0.532 ± 0.229
0.608TrpTyr: 0.608 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.04TyrAla: 3.04 ± 0.45
0.152TyrCys: 0.152 ± 0.108
1.444TyrAsp: 1.444 ± 0.257
1.672TyrGlu: 1.672 ± 0.295
1.216TyrPhe: 1.216 ± 0.326
2.584TyrGly: 2.584 ± 0.397
0.684TyrHis: 0.684 ± 0.244
1.368TyrIle: 1.368 ± 0.35
0.912TyrLys: 0.912 ± 0.289
2.052TyrLeu: 2.052 ± 0.455
0.836TyrMet: 0.836 ± 0.22
1.216TyrAsn: 1.216 ± 0.23
1.292TyrPro: 1.292 ± 0.315
1.216TyrGln: 1.216 ± 0.287
2.66TyrArg: 2.66 ± 0.492
2.052TyrSer: 2.052 ± 0.416
1.596TyrThr: 1.596 ± 0.435
2.28TyrVal: 2.28 ± 0.422
0.532TyrTrp: 0.532 ± 0.182
0.76TyrTyr: 0.76 ± 0.215
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13158 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski