Amino acid dipepetide frequency for Enterococcus phage EFAP-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.633AlaAla: 0.633 ± 0.334
0.0AlaCys: 0.0 ± 0.0
3.321AlaAsp: 3.321 ± 0.878
3.637AlaGlu: 3.637 ± 0.599
3.163AlaPhe: 3.163 ± 0.902
3.004AlaGly: 3.004 ± 0.599
0.949AlaHis: 0.949 ± 0.48
6.009AlaIle: 6.009 ± 1.382
6.325AlaLys: 6.325 ± 1.331
6.958AlaLeu: 6.958 ± 1.116
3.004AlaMet: 3.004 ± 0.932
4.902AlaAsn: 4.902 ± 0.915
2.056AlaPro: 2.056 ± 0.517
2.372AlaGln: 2.372 ± 0.42
2.214AlaArg: 2.214 ± 0.398
4.902AlaSer: 4.902 ± 1.32
4.586AlaThr: 4.586 ± 0.912
4.744AlaVal: 4.744 ± 1.196
0.633AlaTrp: 0.633 ± 0.301
2.688AlaTyr: 2.688 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.474CysAsp: 0.474 ± 0.279
0.949CysGlu: 0.949 ± 0.457
0.0CysPhe: 0.0 ± 0.0
0.316CysGly: 0.316 ± 0.247
0.158CysHis: 0.158 ± 0.203
0.158CysIle: 0.158 ± 0.13
0.474CysLys: 0.474 ± 0.313
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.316CysAsn: 0.316 ± 0.316
0.0CysPro: 0.0 ± 0.0
0.158CysGln: 0.158 ± 0.203
0.158CysArg: 0.158 ± 0.203
0.316CysSer: 0.316 ± 0.274
0.0CysThr: 0.0 ± 0.0
0.158CysVal: 0.158 ± 0.194
0.316CysTrp: 0.316 ± 0.198
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.111AspAla: 4.111 ± 0.764
0.158AspCys: 0.158 ± 0.177
2.372AspAsp: 2.372 ± 0.838
4.269AspGlu: 4.269 ± 0.839
3.637AspPhe: 3.637 ± 0.711
6.325AspGly: 6.325 ± 0.689
0.949AspHis: 0.949 ± 0.387
5.376AspIle: 5.376 ± 1.274
4.744AspLys: 4.744 ± 0.61
5.534AspLeu: 5.534 ± 0.901
2.372AspMet: 2.372 ± 0.648
3.479AspAsn: 3.479 ± 0.846
2.214AspPro: 2.214 ± 0.918
2.056AspGln: 2.056 ± 0.432
1.739AspArg: 1.739 ± 0.498
3.321AspSer: 3.321 ± 0.618
2.214AspThr: 2.214 ± 0.634
2.846AspVal: 2.846 ± 0.739
0.791AspTrp: 0.791 ± 0.297
2.688AspTyr: 2.688 ± 0.758
0.0AspXaa: 0.0 ± 0.0
Glu
5.693GluAla: 5.693 ± 1.18
0.474GluCys: 0.474 ± 0.293
3.479GluAsp: 3.479 ± 1.107
7.432GluGlu: 7.432 ± 3.231
2.372GluPhe: 2.372 ± 0.878
4.428GluGly: 4.428 ± 0.749
1.739GluHis: 1.739 ± 0.654
3.321GluIle: 3.321 ± 0.632
5.534GluLys: 5.534 ± 0.95
7.432GluLeu: 7.432 ± 1.069
2.372GluMet: 2.372 ± 0.692
3.321GluAsn: 3.321 ± 0.489
2.214GluPro: 2.214 ± 1.008
3.637GluGln: 3.637 ± 0.772
2.372GluArg: 2.372 ± 0.735
4.111GluSer: 4.111 ± 0.596
3.953GluThr: 3.953 ± 0.786
5.376GluVal: 5.376 ± 0.925
1.581GluTrp: 1.581 ± 0.473
2.372GluTyr: 2.372 ± 0.79
0.0GluXaa: 0.0 ± 0.0
Phe
1.739PheAla: 1.739 ± 0.341
0.0PheCys: 0.0 ± 0.0
3.321PheAsp: 3.321 ± 0.559
1.898PheGlu: 1.898 ± 0.409
1.898PhePhe: 1.898 ± 0.676
3.637PheGly: 3.637 ± 1.043
0.316PheHis: 0.316 ± 0.274
3.479PheIle: 3.479 ± 0.939
3.953PheLys: 3.953 ± 0.931
2.53PheLeu: 2.53 ± 0.574
0.949PheMet: 0.949 ± 0.511
3.163PheAsn: 3.163 ± 0.853
0.633PhePro: 0.633 ± 0.349
2.214PheGln: 2.214 ± 0.662
1.581PheArg: 1.581 ± 0.533
2.372PheSer: 2.372 ± 0.495
2.688PheThr: 2.688 ± 0.649
2.688PheVal: 2.688 ± 0.523
0.633PheTrp: 0.633 ± 0.3
0.949PheTyr: 0.949 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
6.009GlyAla: 6.009 ± 1.601
0.0GlyCys: 0.0 ± 0.0
4.269GlyAsp: 4.269 ± 0.638
3.953GlyGlu: 3.953 ± 0.865
4.428GlyPhe: 4.428 ± 0.73
6.483GlyGly: 6.483 ± 1.98
0.633GlyHis: 0.633 ± 0.366
6.799GlyIle: 6.799 ± 1.456
6.325GlyLys: 6.325 ± 1.173
7.432GlyLeu: 7.432 ± 1.842
1.581GlyMet: 1.581 ± 0.459
4.269GlyAsn: 4.269 ± 0.839
1.107GlyPro: 1.107 ± 0.462
2.056GlyGln: 2.056 ± 0.511
2.688GlyArg: 2.688 ± 0.542
4.269GlySer: 4.269 ± 0.884
4.428GlyThr: 4.428 ± 0.952
5.376GlyVal: 5.376 ± 0.988
0.949GlyTrp: 0.949 ± 0.31
2.214GlyTyr: 2.214 ± 0.663
0.0GlyXaa: 0.0 ± 0.0
His
0.474HisAla: 0.474 ± 0.289
0.158HisCys: 0.158 ± 0.203
0.633HisAsp: 0.633 ± 0.309
0.791HisGlu: 0.791 ± 0.466
1.107HisPhe: 1.107 ± 0.537
0.474HisGly: 0.474 ± 0.167
0.316HisHis: 0.316 ± 0.298
0.633HisIle: 0.633 ± 0.281
0.474HisLys: 0.474 ± 0.298
0.633HisLeu: 0.633 ± 0.246
0.158HisMet: 0.158 ± 0.193
0.474HisAsn: 0.474 ± 0.37
0.474HisPro: 0.474 ± 0.251
1.265HisGln: 1.265 ± 0.416
1.107HisArg: 1.107 ± 0.654
0.474HisSer: 0.474 ± 0.402
1.107HisThr: 1.107 ± 0.745
1.581HisVal: 1.581 ± 0.34
0.0HisTrp: 0.0 ± 0.0
0.316HisTyr: 0.316 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
4.269IleAla: 4.269 ± 0.809
0.474IleCys: 0.474 ± 0.28
4.586IleAsp: 4.586 ± 0.733
5.218IleGlu: 5.218 ± 1.423
2.214IlePhe: 2.214 ± 0.81
3.321IleGly: 3.321 ± 0.711
0.316IleHis: 0.316 ± 0.263
3.795IleIle: 3.795 ± 0.627
6.483IleLys: 6.483 ± 1.009
4.428IleLeu: 4.428 ± 0.748
1.581IleMet: 1.581 ± 0.638
4.744IleAsn: 4.744 ± 0.822
3.637IlePro: 3.637 ± 0.444
2.846IleGln: 2.846 ± 0.762
2.056IleArg: 2.056 ± 0.51
4.111IleSer: 4.111 ± 0.798
4.586IleThr: 4.586 ± 0.673
3.637IleVal: 3.637 ± 0.646
0.791IleTrp: 0.791 ± 0.36
1.265IleTyr: 1.265 ± 0.525
0.0IleXaa: 0.0 ± 0.0
Lys
6.799LysAla: 6.799 ± 1.382
0.316LysCys: 0.316 ± 0.258
5.376LysAsp: 5.376 ± 1.098
8.223LysGlu: 8.223 ± 2.01
2.846LysPhe: 2.846 ± 0.545
5.06LysGly: 5.06 ± 1.134
0.791LysHis: 0.791 ± 0.268
3.953LysIle: 3.953 ± 0.945
4.111LysLys: 4.111 ± 0.744
6.325LysLeu: 6.325 ± 1.137
2.372LysMet: 2.372 ± 0.596
4.902LysAsn: 4.902 ± 1.244
2.846LysPro: 2.846 ± 0.692
3.163LysGln: 3.163 ± 0.56
3.479LysArg: 3.479 ± 0.909
5.376LysSer: 5.376 ± 1.349
4.428LysThr: 4.428 ± 0.661
5.534LysVal: 5.534 ± 0.861
0.633LysTrp: 0.633 ± 0.259
2.688LysTyr: 2.688 ± 0.494
0.0LysXaa: 0.0 ± 0.0
Leu
4.269LeuAla: 4.269 ± 0.921
0.316LeuCys: 0.316 ± 0.318
7.274LeuAsp: 7.274 ± 0.829
7.116LeuGlu: 7.116 ± 1.341
2.056LeuPhe: 2.056 ± 0.633
7.59LeuGly: 7.59 ± 1.404
0.633LeuHis: 0.633 ± 0.345
3.795LeuIle: 3.795 ± 1.082
6.167LeuLys: 6.167 ± 0.956
5.693LeuLeu: 5.693 ± 0.906
1.581LeuMet: 1.581 ± 0.41
4.744LeuAsn: 4.744 ± 0.909
3.163LeuPro: 3.163 ± 0.649
4.111LeuGln: 4.111 ± 0.804
1.898LeuArg: 1.898 ± 0.6
6.958LeuSer: 6.958 ± 0.792
4.744LeuThr: 4.744 ± 1.032
4.902LeuVal: 4.902 ± 1.325
0.791LeuTrp: 0.791 ± 0.316
1.739LeuTyr: 1.739 ± 0.771
0.0LeuXaa: 0.0 ± 0.0
Met
1.265MetAla: 1.265 ± 0.453
0.316MetCys: 0.316 ± 0.229
1.739MetAsp: 1.739 ± 0.527
2.372MetGlu: 2.372 ± 0.882
1.107MetPhe: 1.107 ± 0.412
2.53MetGly: 2.53 ± 0.832
0.158MetHis: 0.158 ± 0.159
2.372MetIle: 2.372 ± 0.67
1.423MetLys: 1.423 ± 0.535
1.107MetLeu: 1.107 ± 0.388
0.158MetMet: 0.158 ± 0.117
2.688MetAsn: 2.688 ± 0.683
0.791MetPro: 0.791 ± 0.326
1.107MetGln: 1.107 ± 0.325
1.898MetArg: 1.898 ± 0.543
1.423MetSer: 1.423 ± 0.585
1.581MetThr: 1.581 ± 0.595
2.53MetVal: 2.53 ± 0.615
1.265MetTrp: 1.265 ± 0.482
1.107MetTyr: 1.107 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
5.06AsnAla: 5.06 ± 0.8
0.158AsnCys: 0.158 ± 0.13
2.688AsnAsp: 2.688 ± 0.692
3.637AsnGlu: 3.637 ± 0.7
2.214AsnPhe: 2.214 ± 0.651
6.167AsnGly: 6.167 ± 1.277
0.316AsnHis: 0.316 ± 0.189
2.688AsnIle: 2.688 ± 0.502
4.744AsnLys: 4.744 ± 0.777
4.111AsnLeu: 4.111 ± 0.731
1.581AsnMet: 1.581 ± 0.625
3.637AsnAsn: 3.637 ± 0.897
2.056AsnPro: 2.056 ± 0.573
1.739AsnGln: 1.739 ± 0.454
2.214AsnArg: 2.214 ± 0.65
3.321AsnSer: 3.321 ± 0.696
6.167AsnThr: 6.167 ± 0.941
4.269AsnVal: 4.269 ± 1.431
1.265AsnTrp: 1.265 ± 0.418
2.688AsnTyr: 2.688 ± 0.878
0.0AsnXaa: 0.0 ± 0.0
Pro
2.056ProAla: 2.056 ± 0.627
0.0ProCys: 0.0 ± 0.0
3.163ProAsp: 3.163 ± 0.662
3.321ProGlu: 3.321 ± 0.874
0.791ProPhe: 0.791 ± 0.328
0.316ProGly: 0.316 ± 0.142
0.158ProHis: 0.158 ± 0.13
1.898ProIle: 1.898 ± 0.444
3.321ProLys: 3.321 ± 1.183
1.898ProLeu: 1.898 ± 0.497
1.423ProMet: 1.423 ± 0.435
2.214ProAsn: 2.214 ± 0.538
0.316ProPro: 0.316 ± 0.233
1.107ProGln: 1.107 ± 0.495
1.265ProArg: 1.265 ± 0.461
1.898ProSer: 1.898 ± 0.461
2.53ProThr: 2.53 ± 0.885
2.372ProVal: 2.372 ± 0.637
0.316ProTrp: 0.316 ± 0.192
1.423ProTyr: 1.423 ± 0.678
0.0ProXaa: 0.0 ± 0.0
Gln
2.688GlnAla: 2.688 ± 0.566
0.316GlnCys: 0.316 ± 0.262
1.265GlnAsp: 1.265 ± 0.445
3.163GlnGlu: 3.163 ± 0.821
1.107GlnPhe: 1.107 ± 0.415
2.688GlnGly: 2.688 ± 0.644
0.474GlnHis: 0.474 ± 0.3
3.321GlnIle: 3.321 ± 0.594
2.372GlnLys: 2.372 ± 0.834
4.586GlnLeu: 4.586 ± 1.064
1.581GlnMet: 1.581 ± 0.535
1.265GlnAsn: 1.265 ± 0.464
1.107GlnPro: 1.107 ± 0.598
3.321GlnGln: 3.321 ± 0.756
1.423GlnArg: 1.423 ± 0.582
3.004GlnSer: 3.004 ± 0.631
2.688GlnThr: 2.688 ± 0.707
3.479GlnVal: 3.479 ± 0.838
0.633GlnTrp: 0.633 ± 0.322
2.056GlnTyr: 2.056 ± 0.449
0.0GlnXaa: 0.0 ± 0.0
Arg
3.163ArgAla: 3.163 ± 0.366
0.158ArgCys: 0.158 ± 0.203
1.739ArgAsp: 1.739 ± 0.411
1.265ArgGlu: 1.265 ± 0.324
2.056ArgPhe: 2.056 ± 0.339
2.214ArgGly: 2.214 ± 0.705
0.791ArgHis: 0.791 ± 0.517
2.214ArgIle: 2.214 ± 0.53
3.479ArgLys: 3.479 ± 0.983
2.846ArgLeu: 2.846 ± 1.142
0.633ArgMet: 0.633 ± 0.282
2.846ArgAsn: 2.846 ± 0.745
1.265ArgPro: 1.265 ± 0.598
1.739ArgGln: 1.739 ± 0.597
1.898ArgArg: 1.898 ± 0.394
1.898ArgSer: 1.898 ± 0.664
2.372ArgThr: 2.372 ± 0.717
2.372ArgVal: 2.372 ± 1.055
0.316ArgTrp: 0.316 ± 0.216
1.265ArgTyr: 1.265 ± 0.605
0.0ArgXaa: 0.0 ± 0.0
Ser
4.586SerAla: 4.586 ± 1.176
0.158SerCys: 0.158 ± 0.13
2.372SerAsp: 2.372 ± 0.49
2.372SerGlu: 2.372 ± 0.726
2.688SerPhe: 2.688 ± 0.51
6.325SerGly: 6.325 ± 1.222
1.898SerHis: 1.898 ± 0.514
4.111SerIle: 4.111 ± 0.736
4.902SerLys: 4.902 ± 1.159
5.06SerLeu: 5.06 ± 0.904
1.898SerMet: 1.898 ± 0.605
3.637SerAsn: 3.637 ± 0.898
1.581SerPro: 1.581 ± 0.693
3.004SerGln: 3.004 ± 0.71
1.423SerArg: 1.423 ± 0.658
3.953SerSer: 3.953 ± 1.233
5.218SerThr: 5.218 ± 1.309
3.637SerVal: 3.637 ± 0.974
1.107SerTrp: 1.107 ± 0.31
1.581SerTyr: 1.581 ± 0.828
0.0SerXaa: 0.0 ± 0.0
Thr
4.111ThrAla: 4.111 ± 0.656
0.0ThrCys: 0.0 ± 0.0
3.321ThrAsp: 3.321 ± 0.854
4.902ThrGlu: 4.902 ± 0.866
2.056ThrPhe: 2.056 ± 0.599
6.009ThrGly: 6.009 ± 0.91
1.265ThrHis: 1.265 ± 0.389
3.479ThrIle: 3.479 ± 0.857
5.534ThrLys: 5.534 ± 0.905
4.902ThrLeu: 4.902 ± 1.058
2.056ThrMet: 2.056 ± 0.614
2.214ThrAsn: 2.214 ± 0.629
3.004ThrPro: 3.004 ± 0.679
2.688ThrGln: 2.688 ± 0.922
2.846ThrArg: 2.846 ± 0.515
3.163ThrSer: 3.163 ± 0.882
5.693ThrThr: 5.693 ± 1.385
5.06ThrVal: 5.06 ± 0.761
0.316ThrTrp: 0.316 ± 0.142
2.056ThrTyr: 2.056 ± 0.926
0.0ThrXaa: 0.0 ± 0.0
Val
6.325ValAla: 6.325 ± 1.23
0.316ValCys: 0.316 ± 0.276
6.799ValAsp: 6.799 ± 0.872
3.953ValGlu: 3.953 ± 1.144
2.53ValPhe: 2.53 ± 0.516
5.218ValGly: 5.218 ± 0.878
0.474ValHis: 0.474 ± 0.267
3.637ValIle: 3.637 ± 0.628
5.534ValLys: 5.534 ± 1.407
4.902ValLeu: 4.902 ± 0.806
2.214ValMet: 2.214 ± 0.435
4.428ValAsn: 4.428 ± 1.145
2.056ValPro: 2.056 ± 0.457
2.688ValGln: 2.688 ± 0.782
3.004ValArg: 3.004 ± 0.827
5.06ValSer: 5.06 ± 0.793
3.637ValThr: 3.637 ± 0.896
4.902ValVal: 4.902 ± 1.014
0.791ValTrp: 0.791 ± 0.283
1.898ValTyr: 1.898 ± 0.628
0.0ValXaa: 0.0 ± 0.0
Trp
0.791TrpAla: 0.791 ± 0.424
0.316TrpCys: 0.316 ± 0.305
0.949TrpAsp: 0.949 ± 0.427
1.739TrpGlu: 1.739 ± 0.484
0.791TrpPhe: 0.791 ± 0.266
1.107TrpGly: 1.107 ± 0.499
0.158TrpHis: 0.158 ± 0.13
0.633TrpIle: 0.633 ± 0.423
0.316TrpLys: 0.316 ± 0.215
0.949TrpLeu: 0.949 ± 0.394
0.316TrpMet: 0.316 ± 0.235
0.633TrpAsn: 0.633 ± 0.467
0.0TrpPro: 0.0 ± 0.0
0.316TrpGln: 0.316 ± 0.242
0.633TrpArg: 0.633 ± 0.295
0.474TrpSer: 0.474 ± 0.204
0.316TrpThr: 0.316 ± 0.192
2.372TrpVal: 2.372 ± 0.585
0.316TrpTrp: 0.316 ± 0.142
0.316TrpTyr: 0.316 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.056TyrAla: 2.056 ± 0.692
0.316TyrCys: 0.316 ± 0.316
2.056TyrAsp: 2.056 ± 0.788
3.004TyrGlu: 3.004 ± 0.899
1.739TyrPhe: 1.739 ± 0.566
2.056TyrGly: 2.056 ± 0.451
0.158TyrHis: 0.158 ± 0.117
2.688TyrIle: 2.688 ± 0.662
3.004TyrLys: 3.004 ± 0.651
2.214TyrLeu: 2.214 ± 0.529
0.949TyrMet: 0.949 ± 0.346
2.846TyrAsn: 2.846 ± 0.529
1.265TyrPro: 1.265 ± 0.539
0.949TyrGln: 0.949 ± 0.472
0.633TyrArg: 0.633 ± 0.248
1.107TyrSer: 1.107 ± 0.55
1.739TyrThr: 1.739 ± 0.716
2.53TyrVal: 2.53 ± 1.043
0.0TyrTrp: 0.0 ± 0.0
2.056TyrTyr: 2.056 ± 0.804
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (6325 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski