Amino acid dipepetide frequency for Enterococcus phage phiFL4A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.422AlaAla: 5.422 ± 1.173
0.085AlaCys: 0.085 ± 0.083
3.389AlaAsp: 3.389 ± 0.479
5.931AlaGlu: 5.931 ± 0.911
3.05AlaPhe: 3.05 ± 0.641
3.05AlaGly: 3.05 ± 0.547
1.101AlaHis: 1.101 ± 0.314
6.778AlaIle: 6.778 ± 1.071
6.015AlaLys: 6.015 ± 0.758
6.863AlaLeu: 6.863 ± 0.728
1.779AlaMet: 1.779 ± 0.43
2.881AlaAsn: 2.881 ± 0.54
1.44AlaPro: 1.44 ± 0.391
2.626AlaGln: 2.626 ± 0.678
2.372AlaArg: 2.372 ± 0.453
4.66AlaSer: 4.66 ± 0.637
5.338AlaThr: 5.338 ± 0.994
4.66AlaVal: 4.66 ± 0.947
0.763AlaTrp: 0.763 ± 0.256
2.711AlaTyr: 2.711 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.169CysAla: 0.169 ± 0.11
0.0CysCys: 0.0 ± 0.0
0.339CysAsp: 0.339 ± 0.228
0.424CysGlu: 0.424 ± 0.183
0.339CysPhe: 0.339 ± 0.212
0.339CysGly: 0.339 ± 0.144
0.0CysHis: 0.0 ± 0.0
0.424CysIle: 0.424 ± 0.204
0.424CysLys: 0.424 ± 0.204
0.508CysLeu: 0.508 ± 0.21
0.169CysMet: 0.169 ± 0.124
0.0CysAsn: 0.0 ± 0.0
0.169CysPro: 0.169 ± 0.132
0.085CysGln: 0.085 ± 0.09
0.339CysArg: 0.339 ± 0.142
0.339CysSer: 0.339 ± 0.15
0.424CysThr: 0.424 ± 0.186
0.339CysVal: 0.339 ± 0.175
0.0CysTrp: 0.0 ± 0.0
0.339CysTyr: 0.339 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
3.643AspAla: 3.643 ± 0.768
0.424AspCys: 0.424 ± 0.211
3.389AspAsp: 3.389 ± 0.736
5.422AspGlu: 5.422 ± 0.849
3.05AspPhe: 3.05 ± 0.46
4.321AspGly: 4.321 ± 0.64
0.932AspHis: 0.932 ± 0.251
4.236AspIle: 4.236 ± 0.421
4.66AspLys: 4.66 ± 0.561
5.083AspLeu: 5.083 ± 0.649
0.932AspMet: 0.932 ± 0.307
2.881AspAsn: 2.881 ± 0.562
1.864AspPro: 1.864 ± 0.521
1.694AspGln: 1.694 ± 0.362
2.796AspArg: 2.796 ± 0.611
3.135AspSer: 3.135 ± 0.535
3.728AspThr: 3.728 ± 0.617
4.406AspVal: 4.406 ± 0.595
1.525AspTrp: 1.525 ± 0.383
2.965AspTyr: 2.965 ± 0.43
0.0AspXaa: 0.0 ± 0.0
Glu
6.693GluAla: 6.693 ± 0.821
0.339GluCys: 0.339 ± 0.186
4.829GluAsp: 4.829 ± 0.798
5.931GluGlu: 5.931 ± 1.084
3.558GluPhe: 3.558 ± 0.635
5.083GluGly: 5.083 ± 0.954
0.847GluHis: 0.847 ± 0.247
6.608GluIle: 6.608 ± 0.686
8.049GluLys: 8.049 ± 0.86
8.303GluLeu: 8.303 ± 1.134
2.457GluMet: 2.457 ± 0.487
4.66GluAsn: 4.66 ± 0.638
1.694GluPro: 1.694 ± 0.325
3.813GluGln: 3.813 ± 0.438
3.643GluArg: 3.643 ± 0.636
3.22GluSer: 3.22 ± 0.572
5.253GluThr: 5.253 ± 0.696
5.168GluVal: 5.168 ± 0.866
1.779GluTrp: 1.779 ± 0.417
2.881GluTyr: 2.881 ± 0.634
0.0GluXaa: 0.0 ± 0.0
Phe
2.033PheAla: 2.033 ± 0.392
0.254PheCys: 0.254 ± 0.134
2.796PheAsp: 2.796 ± 0.492
3.897PheGlu: 3.897 ± 0.664
1.61PhePhe: 1.61 ± 0.3
2.965PheGly: 2.965 ± 0.417
0.424PheHis: 0.424 ± 0.267
2.711PheIle: 2.711 ± 0.47
4.914PheLys: 4.914 ± 0.796
3.558PheLeu: 3.558 ± 0.497
0.339PheMet: 0.339 ± 0.176
2.542PheAsn: 2.542 ± 0.891
1.017PhePro: 1.017 ± 0.283
1.525PheGln: 1.525 ± 0.481
1.186PheArg: 1.186 ± 0.344
2.457PheSer: 2.457 ± 0.426
3.304PheThr: 3.304 ± 0.399
2.288PheVal: 2.288 ± 0.497
0.508PheTrp: 0.508 ± 0.297
2.033PheTyr: 2.033 ± 0.428
0.0PheXaa: 0.0 ± 0.0
Gly
3.897GlyAla: 3.897 ± 0.511
0.085GlyCys: 0.085 ± 0.096
3.558GlyAsp: 3.558 ± 0.587
4.067GlyGlu: 4.067 ± 0.601
2.203GlyPhe: 2.203 ± 0.526
2.881GlyGly: 2.881 ± 0.451
0.763GlyHis: 0.763 ± 0.279
4.829GlyIle: 4.829 ± 0.777
4.321GlyLys: 4.321 ± 0.513
4.999GlyLeu: 4.999 ± 0.645
1.864GlyMet: 1.864 ± 0.335
2.626GlyAsn: 2.626 ± 0.405
1.271GlyPro: 1.271 ± 0.332
2.033GlyGln: 2.033 ± 0.331
2.203GlyArg: 2.203 ± 0.455
3.389GlySer: 3.389 ± 0.507
4.575GlyThr: 4.575 ± 0.587
4.575GlyVal: 4.575 ± 0.762
0.847GlyTrp: 0.847 ± 0.295
3.135GlyTyr: 3.135 ± 0.643
0.0GlyXaa: 0.0 ± 0.0
His
1.017HisAla: 1.017 ± 0.268
0.085HisCys: 0.085 ± 0.08
0.763HisAsp: 0.763 ± 0.224
1.356HisGlu: 1.356 ± 0.435
0.424HisPhe: 0.424 ± 0.174
0.847HisGly: 0.847 ± 0.256
0.424HisHis: 0.424 ± 0.15
0.932HisIle: 0.932 ± 0.272
0.593HisLys: 0.593 ± 0.229
1.271HisLeu: 1.271 ± 0.334
0.169HisMet: 0.169 ± 0.137
0.339HisAsn: 0.339 ± 0.151
1.017HisPro: 1.017 ± 0.304
0.508HisGln: 0.508 ± 0.236
0.593HisArg: 0.593 ± 0.186
0.593HisSer: 0.593 ± 0.269
0.932HisThr: 0.932 ± 0.325
0.847HisVal: 0.847 ± 0.268
0.169HisTrp: 0.169 ± 0.097
0.508HisTyr: 0.508 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
4.406IleAla: 4.406 ± 0.72
0.678IleCys: 0.678 ± 0.25
5.507IleAsp: 5.507 ± 0.536
6.27IleGlu: 6.27 ± 0.773
3.22IlePhe: 3.22 ± 1.049
2.203IleGly: 2.203 ± 0.542
1.186IleHis: 1.186 ± 0.3
4.321IleIle: 4.321 ± 0.599
7.456IleLys: 7.456 ± 0.873
4.321IleLeu: 4.321 ± 0.578
1.271IleMet: 1.271 ± 0.289
3.22IleAsn: 3.22 ± 0.583
3.22IlePro: 3.22 ± 0.506
2.542IleGln: 2.542 ± 0.558
2.288IleArg: 2.288 ± 0.487
5.168IleSer: 5.168 ± 0.996
4.575IleThr: 4.575 ± 0.795
3.643IleVal: 3.643 ± 0.549
0.932IleTrp: 0.932 ± 0.34
3.135IleTyr: 3.135 ± 0.521
0.0IleXaa: 0.0 ± 0.0
Lys
7.371LysAla: 7.371 ± 0.743
0.254LysCys: 0.254 ± 0.209
4.66LysAsp: 4.66 ± 0.978
8.642LysGlu: 8.642 ± 1.332
3.05LysPhe: 3.05 ± 0.574
5.761LysGly: 5.761 ± 0.735
1.356LysHis: 1.356 ± 0.421
4.829LysIle: 4.829 ± 0.498
8.303LysLys: 8.303 ± 0.986
5.592LysLeu: 5.592 ± 0.662
2.457LysMet: 2.457 ± 0.454
6.27LysAsn: 6.27 ± 1.008
2.033LysPro: 2.033 ± 0.421
3.558LysGln: 3.558 ± 0.5
3.474LysArg: 3.474 ± 0.65
4.829LysSer: 4.829 ± 0.587
6.947LysThr: 6.947 ± 0.742
5.846LysVal: 5.846 ± 0.709
1.356LysTrp: 1.356 ± 0.336
2.796LysTyr: 2.796 ± 0.537
0.0LysXaa: 0.0 ± 0.0
Leu
6.015LeuAla: 6.015 ± 0.698
0.678LeuCys: 0.678 ± 0.237
5.422LeuAsp: 5.422 ± 0.539
8.218LeuGlu: 8.218 ± 0.961
3.558LeuPhe: 3.558 ± 0.63
5.422LeuGly: 5.422 ± 0.808
1.186LeuHis: 1.186 ± 0.433
5.083LeuIle: 5.083 ± 0.671
8.134LeuLys: 8.134 ± 0.828
5.677LeuLeu: 5.677 ± 0.869
1.271LeuMet: 1.271 ± 0.306
5.253LeuAsn: 5.253 ± 0.593
2.965LeuPro: 2.965 ± 0.558
4.067LeuGln: 4.067 ± 0.624
3.643LeuArg: 3.643 ± 0.622
5.083LeuSer: 5.083 ± 0.71
4.321LeuThr: 4.321 ± 0.661
5.083LeuVal: 5.083 ± 0.524
0.932LeuTrp: 0.932 ± 0.325
2.118LeuTyr: 2.118 ± 0.474
0.0LeuXaa: 0.0 ± 0.0
Met
1.864MetAla: 1.864 ± 0.369
0.085MetCys: 0.085 ± 0.074
1.271MetAsp: 1.271 ± 0.291
2.033MetGlu: 2.033 ± 0.519
0.678MetPhe: 0.678 ± 0.248
1.271MetGly: 1.271 ± 0.376
0.339MetHis: 0.339 ± 0.143
1.186MetIle: 1.186 ± 0.322
2.372MetLys: 2.372 ± 0.387
1.525MetLeu: 1.525 ± 0.392
0.424MetMet: 0.424 ± 0.186
1.61MetAsn: 1.61 ± 0.466
1.101MetPro: 1.101 ± 0.331
0.847MetGln: 0.847 ± 0.301
0.932MetArg: 0.932 ± 0.261
1.271MetSer: 1.271 ± 0.286
1.017MetThr: 1.017 ± 0.261
0.847MetVal: 0.847 ± 0.197
0.169MetTrp: 0.169 ± 0.108
0.932MetTyr: 0.932 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.236AsnAla: 4.236 ± 0.751
0.254AsnCys: 0.254 ± 0.142
2.372AsnAsp: 2.372 ± 0.451
4.151AsnGlu: 4.151 ± 0.782
2.203AsnPhe: 2.203 ± 0.565
3.135AsnGly: 3.135 ± 0.517
0.593AsnHis: 0.593 ± 0.209
3.813AsnIle: 3.813 ± 0.578
3.728AsnLys: 3.728 ± 0.696
4.49AsnLeu: 4.49 ± 0.468
1.017AsnMet: 1.017 ± 0.333
2.288AsnAsn: 2.288 ± 0.51
2.033AsnPro: 2.033 ± 0.388
1.101AsnGln: 1.101 ± 0.322
2.542AsnArg: 2.542 ± 0.534
4.067AsnSer: 4.067 ± 0.582
3.05AsnThr: 3.05 ± 1.006
4.829AsnVal: 4.829 ± 0.784
0.339AsnTrp: 0.339 ± 0.148
1.779AsnTyr: 1.779 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
2.118ProAla: 2.118 ± 0.393
0.085ProCys: 0.085 ± 0.08
2.372ProAsp: 2.372 ± 0.487
3.389ProGlu: 3.389 ± 0.416
1.694ProPhe: 1.694 ± 0.402
1.101ProGly: 1.101 ± 0.345
0.424ProHis: 0.424 ± 0.203
1.525ProIle: 1.525 ± 0.345
2.033ProLys: 2.033 ± 0.356
2.457ProLeu: 2.457 ± 0.45
0.593ProMet: 0.593 ± 0.213
1.864ProAsn: 1.864 ± 0.426
0.763ProPro: 0.763 ± 0.325
0.847ProGln: 0.847 ± 0.28
1.949ProArg: 1.949 ± 0.358
1.44ProSer: 1.44 ± 0.316
2.118ProThr: 2.118 ± 0.405
2.796ProVal: 2.796 ± 0.522
0.339ProTrp: 0.339 ± 0.183
1.101ProTyr: 1.101 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
3.474GlnAla: 3.474 ± 0.551
0.085GlnCys: 0.085 ± 0.08
1.44GlnAsp: 1.44 ± 0.377
3.643GlnGlu: 3.643 ± 0.624
1.101GlnPhe: 1.101 ± 0.359
2.203GlnGly: 2.203 ± 0.553
0.424GlnHis: 0.424 ± 0.222
2.118GlnIle: 2.118 ± 0.422
4.49GlnLys: 4.49 ± 0.667
3.813GlnLeu: 3.813 ± 0.704
0.763GlnMet: 0.763 ± 0.229
1.525GlnAsn: 1.525 ± 0.292
1.101GlnPro: 1.101 ± 0.253
1.186GlnGln: 1.186 ± 0.295
1.44GlnArg: 1.44 ± 0.327
1.779GlnSer: 1.779 ± 0.411
2.118GlnThr: 2.118 ± 0.416
2.711GlnVal: 2.711 ± 0.503
0.508GlnTrp: 0.508 ± 0.17
2.288GlnTyr: 2.288 ± 0.398
0.0GlnXaa: 0.0 ± 0.0
Arg
1.779ArgAla: 1.779 ± 0.557
0.424ArgCys: 0.424 ± 0.17
1.779ArgAsp: 1.779 ± 0.527
4.067ArgGlu: 4.067 ± 0.64
1.61ArgPhe: 1.61 ± 0.335
2.796ArgGly: 2.796 ± 0.527
0.508ArgHis: 0.508 ± 0.201
3.135ArgIle: 3.135 ± 0.472
3.813ArgLys: 3.813 ± 0.637
3.813ArgLeu: 3.813 ± 0.67
1.271ArgMet: 1.271 ± 0.318
1.864ArgAsn: 1.864 ± 0.385
1.271ArgPro: 1.271 ± 0.294
1.271ArgGln: 1.271 ± 0.304
1.271ArgArg: 1.271 ± 0.379
0.763ArgSer: 0.763 ± 0.209
2.033ArgThr: 2.033 ± 0.499
1.779ArgVal: 1.779 ± 0.399
0.763ArgTrp: 0.763 ± 0.249
1.694ArgTyr: 1.694 ± 0.513
0.0ArgXaa: 0.0 ± 0.0
Ser
3.728SerAla: 3.728 ± 0.646
0.424SerCys: 0.424 ± 0.2
4.067SerAsp: 4.067 ± 0.632
3.558SerGlu: 3.558 ± 0.641
2.796SerPhe: 2.796 ± 0.441
3.389SerGly: 3.389 ± 0.47
0.847SerHis: 0.847 ± 0.33
4.914SerIle: 4.914 ± 0.965
4.575SerLys: 4.575 ± 0.468
5.761SerLeu: 5.761 ± 0.667
1.779SerMet: 1.779 ± 0.355
3.22SerAsn: 3.22 ± 0.522
1.525SerPro: 1.525 ± 0.369
1.949SerGln: 1.949 ± 0.411
1.101SerArg: 1.101 ± 0.289
4.66SerSer: 4.66 ± 0.757
3.643SerThr: 3.643 ± 0.461
2.711SerVal: 2.711 ± 0.418
0.847SerTrp: 0.847 ± 0.29
2.457SerTyr: 2.457 ± 0.528
0.0SerXaa: 0.0 ± 0.0
Thr
5.083ThrAla: 5.083 ± 0.693
0.169ThrCys: 0.169 ± 0.11
5.677ThrAsp: 5.677 ± 0.703
4.406ThrGlu: 4.406 ± 0.787
2.626ThrPhe: 2.626 ± 0.397
4.321ThrGly: 4.321 ± 0.516
0.678ThrHis: 0.678 ± 0.247
4.999ThrIle: 4.999 ± 0.989
5.253ThrLys: 5.253 ± 0.658
6.185ThrLeu: 6.185 ± 0.799
0.847ThrMet: 0.847 ± 0.345
3.135ThrAsn: 3.135 ± 0.609
2.033ThrPro: 2.033 ± 0.364
2.626ThrGln: 2.626 ± 0.421
1.949ThrArg: 1.949 ± 0.441
3.558ThrSer: 3.558 ± 0.564
4.236ThrThr: 4.236 ± 0.512
3.813ThrVal: 3.813 ± 0.81
0.678ThrTrp: 0.678 ± 0.303
2.288ThrTyr: 2.288 ± 0.429
0.0ThrXaa: 0.0 ± 0.0
Val
4.745ValAla: 4.745 ± 0.724
0.254ValCys: 0.254 ± 0.123
3.897ValAsp: 3.897 ± 0.599
5.083ValGlu: 5.083 ± 0.678
3.05ValPhe: 3.05 ± 0.395
3.813ValGly: 3.813 ± 0.543
0.593ValHis: 0.593 ± 0.245
4.151ValIle: 4.151 ± 0.703
6.185ValLys: 6.185 ± 0.809
4.406ValLeu: 4.406 ± 0.581
1.44ValMet: 1.44 ± 0.396
2.711ValAsn: 2.711 ± 0.458
2.711ValPro: 2.711 ± 0.527
2.542ValGln: 2.542 ± 0.46
1.864ValArg: 1.864 ± 0.43
4.321ValSer: 4.321 ± 0.672
3.813ValThr: 3.813 ± 0.538
4.999ValVal: 4.999 ± 0.803
1.017ValTrp: 1.017 ± 0.468
2.626ValTyr: 2.626 ± 0.593
0.0ValXaa: 0.0 ± 0.0
Trp
0.932TrpAla: 0.932 ± 0.234
0.169TrpCys: 0.169 ± 0.128
0.932TrpAsp: 0.932 ± 0.386
0.763TrpGlu: 0.763 ± 0.224
0.593TrpPhe: 0.593 ± 0.199
0.424TrpGly: 0.424 ± 0.189
0.339TrpHis: 0.339 ± 0.2
0.763TrpIle: 0.763 ± 0.278
1.101TrpLys: 1.101 ± 0.289
1.779TrpLeu: 1.779 ± 0.44
0.169TrpMet: 0.169 ± 0.121
0.847TrpAsn: 0.847 ± 0.448
0.593TrpPro: 0.593 ± 0.228
0.932TrpGln: 0.932 ± 0.21
0.763TrpArg: 0.763 ± 0.249
0.847TrpSer: 0.847 ± 0.215
0.593TrpThr: 0.593 ± 0.203
0.847TrpVal: 0.847 ± 0.2
0.0TrpTrp: 0.0 ± 0.0
0.508TrpTyr: 0.508 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.288TyrAla: 2.288 ± 0.44
0.339TyrCys: 0.339 ± 0.205
2.626TyrAsp: 2.626 ± 0.397
3.558TyrGlu: 3.558 ± 0.433
1.949TyrPhe: 1.949 ± 0.362
2.796TyrGly: 2.796 ± 0.584
0.424TyrHis: 0.424 ± 0.197
2.203TyrIle: 2.203 ± 0.434
2.965TyrLys: 2.965 ± 0.379
3.728TyrLeu: 3.728 ± 0.489
0.678TyrMet: 0.678 ± 0.203
2.288TyrAsn: 2.288 ± 0.395
1.186TyrPro: 1.186 ± 0.515
2.457TyrGln: 2.457 ± 0.626
1.356TyrArg: 1.356 ± 0.329
2.372TyrSer: 2.372 ± 0.463
2.542TyrThr: 2.542 ± 0.51
2.033TyrVal: 2.033 ± 0.45
0.424TyrTrp: 0.424 ± 0.192
1.525TyrTyr: 1.525 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11804 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski