Amino acid dipepetide frequency for Cronobacter phage ES2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.674AlaAla: 11.674 ± 1.585
0.64AlaCys: 0.64 ± 0.325
5.437AlaAsp: 5.437 ± 0.932
5.437AlaGlu: 5.437 ± 1.155
2.559AlaPhe: 2.559 ± 0.807
7.037AlaGly: 7.037 ± 1.036
1.119AlaHis: 1.119 ± 0.391
5.757AlaIle: 5.757 ± 0.971
6.397AlaLys: 6.397 ± 0.782
9.595AlaLeu: 9.595 ± 1.042
3.518AlaMet: 3.518 ± 0.783
4.958AlaAsn: 4.958 ± 1.278
2.399AlaPro: 2.399 ± 0.771
3.998AlaGln: 3.998 ± 0.996
6.077AlaArg: 6.077 ± 1.026
5.917AlaSer: 5.917 ± 1.301
4.798AlaThr: 4.798 ± 0.73
5.437AlaVal: 5.437 ± 0.935
0.96AlaTrp: 0.96 ± 0.434
3.039AlaTyr: 3.039 ± 0.695
0.0AlaXaa: 0.0 ± 0.0
Cys
0.64CysAla: 0.64 ± 0.373
0.48CysCys: 0.48 ± 0.284
0.96CysAsp: 0.96 ± 0.452
0.8CysGlu: 0.8 ± 0.324
0.16CysPhe: 0.16 ± 0.158
1.119CysGly: 1.119 ± 0.542
0.16CysHis: 0.16 ± 0.191
0.8CysIle: 0.8 ± 0.329
1.119CysLys: 1.119 ± 0.674
0.96CysLeu: 0.96 ± 0.402
0.64CysMet: 0.64 ± 0.35
0.8CysAsn: 0.8 ± 0.366
0.64CysPro: 0.64 ± 0.366
0.48CysGln: 0.48 ± 0.39
0.48CysArg: 0.48 ± 0.274
0.96CysSer: 0.96 ± 0.461
0.16CysThr: 0.16 ± 0.138
0.48CysVal: 0.48 ± 0.276
0.48CysTrp: 0.48 ± 0.266
0.48CysTyr: 0.48 ± 0.271
0.0CysXaa: 0.0 ± 0.0
Asp
6.717AspAla: 6.717 ± 0.943
0.64AspCys: 0.64 ± 0.417
4.638AspAsp: 4.638 ± 0.749
3.678AspGlu: 3.678 ± 0.716
1.599AspPhe: 1.599 ± 0.52
4.638AspGly: 4.638 ± 0.934
1.119AspHis: 1.119 ± 0.486
3.198AspIle: 3.198 ± 0.857
3.358AspLys: 3.358 ± 0.853
4.158AspLeu: 4.158 ± 1.095
1.759AspMet: 1.759 ± 0.46
1.279AspAsn: 1.279 ± 0.354
2.399AspPro: 2.399 ± 0.56
2.399AspGln: 2.399 ± 0.547
3.838AspArg: 3.838 ± 0.687
3.198AspSer: 3.198 ± 0.53
2.559AspThr: 2.559 ± 0.598
4.318AspVal: 4.318 ± 0.838
0.32AspTrp: 0.32 ± 0.264
3.198AspTyr: 3.198 ± 0.685
0.0AspXaa: 0.0 ± 0.0
Glu
5.277GluAla: 5.277 ± 1.295
1.599GluCys: 1.599 ± 0.594
2.239GluAsp: 2.239 ± 0.757
4.478GluGlu: 4.478 ± 0.904
2.719GluPhe: 2.719 ± 0.678
3.518GluGly: 3.518 ± 0.721
1.119GluHis: 1.119 ± 0.361
3.838GluIle: 3.838 ± 0.681
4.478GluLys: 4.478 ± 0.986
5.118GluLeu: 5.118 ± 0.983
1.599GluMet: 1.599 ± 0.638
2.559GluAsn: 2.559 ± 0.595
2.079GluPro: 2.079 ± 0.761
4.638GluGln: 4.638 ± 1.106
4.158GluArg: 4.158 ± 0.973
4.318GluSer: 4.318 ± 0.664
3.518GluThr: 3.518 ± 0.911
4.958GluVal: 4.958 ± 1.14
0.64GluTrp: 0.64 ± 0.307
2.239GluTyr: 2.239 ± 0.537
0.0GluXaa: 0.0 ± 0.0
Phe
3.998PheAla: 3.998 ± 0.743
0.48PheCys: 0.48 ± 0.344
1.919PheAsp: 1.919 ± 0.537
1.599PheGlu: 1.599 ± 0.578
1.599PhePhe: 1.599 ± 0.517
1.599PheGly: 1.599 ± 0.57
1.279PheHis: 1.279 ± 0.451
1.439PheIle: 1.439 ± 0.473
1.919PheLys: 1.919 ± 0.522
1.759PheLeu: 1.759 ± 0.444
0.64PheMet: 0.64 ± 0.283
2.239PheAsn: 2.239 ± 0.649
0.8PhePro: 0.8 ± 0.373
0.64PheGln: 0.64 ± 0.294
2.079PheArg: 2.079 ± 0.584
2.399PheSer: 2.399 ± 0.685
3.838PheThr: 3.838 ± 0.868
1.279PheVal: 1.279 ± 0.436
1.119PheTrp: 1.119 ± 0.46
0.48PheTyr: 0.48 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
6.077GlyAla: 6.077 ± 0.903
1.279GlyCys: 1.279 ± 0.637
4.638GlyAsp: 4.638 ± 0.888
5.277GlyGlu: 5.277 ± 0.832
3.358GlyPhe: 3.358 ± 0.774
4.958GlyGly: 4.958 ± 1.032
2.079GlyHis: 2.079 ± 0.548
3.678GlyIle: 3.678 ± 0.653
4.958GlyLys: 4.958 ± 0.901
5.757GlyLeu: 5.757 ± 0.963
1.599GlyMet: 1.599 ± 0.507
3.039GlyAsn: 3.039 ± 0.606
1.119GlyPro: 1.119 ± 0.403
1.599GlyGln: 1.599 ± 0.541
5.277GlyArg: 5.277 ± 0.733
3.998GlySer: 3.998 ± 0.683
2.719GlyThr: 2.719 ± 0.928
5.437GlyVal: 5.437 ± 1.125
0.8GlyTrp: 0.8 ± 0.474
3.838GlyTyr: 3.838 ± 0.669
0.0GlyXaa: 0.0 ± 0.0
His
1.599HisAla: 1.599 ± 0.464
0.32HisCys: 0.32 ± 0.217
1.439HisAsp: 1.439 ± 0.359
0.96HisGlu: 0.96 ± 0.426
0.48HisPhe: 0.48 ± 0.305
1.439HisGly: 1.439 ± 0.404
0.64HisHis: 0.64 ± 0.311
1.279HisIle: 1.279 ± 0.419
0.96HisLys: 0.96 ± 0.475
1.599HisLeu: 1.599 ± 0.427
1.119HisMet: 1.119 ± 0.395
1.119HisAsn: 1.119 ± 0.401
1.439HisPro: 1.439 ± 0.393
0.32HisGln: 0.32 ± 0.226
1.759HisArg: 1.759 ± 0.524
0.48HisSer: 0.48 ± 0.269
0.96HisThr: 0.96 ± 0.406
0.64HisVal: 0.64 ± 0.276
0.16HisTrp: 0.16 ± 0.139
1.439HisTyr: 1.439 ± 0.457
0.0HisXaa: 0.0 ± 0.0
Ile
3.998IleAla: 3.998 ± 0.859
0.16IleCys: 0.16 ± 0.162
5.118IleAsp: 5.118 ± 0.894
5.277IleGlu: 5.277 ± 0.966
0.32IlePhe: 0.32 ± 0.188
3.998IleGly: 3.998 ± 0.782
0.96IleHis: 0.96 ± 0.307
2.879IleIle: 2.879 ± 0.76
3.518IleLys: 3.518 ± 0.686
3.838IleLeu: 3.838 ± 0.898
1.119IleMet: 1.119 ± 0.378
3.678IleAsn: 3.678 ± 0.633
2.399IlePro: 2.399 ± 0.524
1.919IleGln: 1.919 ± 0.51
3.998IleArg: 3.998 ± 0.78
4.798IleSer: 4.798 ± 1.381
3.198IleThr: 3.198 ± 0.607
3.678IleVal: 3.678 ± 0.773
0.64IleTrp: 0.64 ± 0.264
0.32IleTyr: 0.32 ± 0.197
0.0IleXaa: 0.0 ± 0.0
Lys
5.437LysAla: 5.437 ± 0.992
0.64LysCys: 0.64 ± 0.326
2.079LysAsp: 2.079 ± 0.556
3.518LysGlu: 3.518 ± 0.802
2.719LysPhe: 2.719 ± 0.507
4.158LysGly: 4.158 ± 0.952
1.599LysHis: 1.599 ± 0.475
2.559LysIle: 2.559 ± 0.502
2.719LysLys: 2.719 ± 0.607
5.757LysLeu: 5.757 ± 1.035
1.439LysMet: 1.439 ± 0.461
2.559LysAsn: 2.559 ± 0.811
2.079LysPro: 2.079 ± 0.444
2.399LysGln: 2.399 ± 0.769
4.318LysArg: 4.318 ± 0.924
3.838LysSer: 3.838 ± 0.833
2.719LysThr: 2.719 ± 0.667
3.998LysVal: 3.998 ± 0.634
0.8LysTrp: 0.8 ± 0.324
2.239LysTyr: 2.239 ± 0.555
0.0LysXaa: 0.0 ± 0.0
Leu
8.156LeuAla: 8.156 ± 1.462
0.96LeuCys: 0.96 ± 0.429
4.478LeuAsp: 4.478 ± 0.733
4.638LeuGlu: 4.638 ± 0.825
1.919LeuPhe: 1.919 ± 0.454
3.998LeuGly: 3.998 ± 0.824
1.119LeuHis: 1.119 ± 0.309
5.917LeuIle: 5.917 ± 1.033
4.798LeuLys: 4.798 ± 1.07
5.757LeuLeu: 5.757 ± 1.08
1.919LeuMet: 1.919 ± 0.462
4.158LeuAsn: 4.158 ± 0.982
3.518LeuPro: 3.518 ± 0.575
2.719LeuGln: 2.719 ± 0.61
5.757LeuArg: 5.757 ± 0.929
6.237LeuSer: 6.237 ± 0.866
5.597LeuThr: 5.597 ± 0.859
4.318LeuVal: 4.318 ± 0.764
0.8LeuTrp: 0.8 ± 0.4
2.559LeuTyr: 2.559 ± 0.761
0.0LeuXaa: 0.0 ± 0.0
Met
2.239MetAla: 2.239 ± 0.596
0.32MetCys: 0.32 ± 0.239
1.119MetAsp: 1.119 ± 0.388
0.8MetGlu: 0.8 ± 0.471
0.48MetPhe: 0.48 ± 0.26
2.239MetGly: 2.239 ± 0.723
0.48MetHis: 0.48 ± 0.272
1.599MetIle: 1.599 ± 0.721
1.599MetLys: 1.599 ± 0.417
2.399MetLeu: 2.399 ± 0.703
1.119MetMet: 1.119 ± 0.5
1.439MetAsn: 1.439 ± 0.524
0.8MetPro: 0.8 ± 0.352
1.759MetGln: 1.759 ± 0.454
0.8MetArg: 0.8 ± 0.391
2.239MetSer: 2.239 ± 0.599
2.559MetThr: 2.559 ± 0.652
1.759MetVal: 1.759 ± 0.43
0.64MetTrp: 0.64 ± 0.457
0.64MetTyr: 0.64 ± 0.276
0.0MetXaa: 0.0 ± 0.0
Asn
3.518AsnAla: 3.518 ± 0.585
1.119AsnCys: 1.119 ± 0.391
2.719AsnAsp: 2.719 ± 0.597
2.399AsnGlu: 2.399 ± 0.619
0.8AsnPhe: 0.8 ± 0.348
5.917AsnGly: 5.917 ± 0.783
1.119AsnHis: 1.119 ± 0.346
2.239AsnIle: 2.239 ± 0.562
2.719AsnLys: 2.719 ± 0.641
4.638AsnLeu: 4.638 ± 0.754
1.599AsnMet: 1.599 ± 0.607
1.439AsnAsn: 1.439 ± 0.438
1.599AsnPro: 1.599 ± 0.435
2.879AsnGln: 2.879 ± 0.605
2.399AsnArg: 2.399 ± 0.639
4.318AsnSer: 4.318 ± 0.914
1.759AsnThr: 1.759 ± 0.487
2.559AsnVal: 2.559 ± 0.717
0.32AsnTrp: 0.32 ± 0.204
1.119AsnTyr: 1.119 ± 0.386
0.0AsnXaa: 0.0 ± 0.0
Pro
3.678ProAla: 3.678 ± 0.898
0.16ProCys: 0.16 ± 0.164
3.678ProAsp: 3.678 ± 0.676
2.399ProGlu: 2.399 ± 0.585
1.119ProPhe: 1.119 ± 0.405
1.279ProGly: 1.279 ± 0.553
0.32ProHis: 0.32 ± 0.201
2.239ProIle: 2.239 ± 0.618
0.96ProLys: 0.96 ± 0.374
3.998ProLeu: 3.998 ± 0.913
0.8ProMet: 0.8 ± 0.382
1.439ProAsn: 1.439 ± 0.414
0.96ProPro: 0.96 ± 0.332
1.919ProGln: 1.919 ± 0.485
1.599ProArg: 1.599 ± 0.526
2.719ProSer: 2.719 ± 0.545
2.079ProThr: 2.079 ± 0.468
3.039ProVal: 3.039 ± 0.637
0.16ProTrp: 0.16 ± 0.17
0.96ProTyr: 0.96 ± 0.424
0.0ProXaa: 0.0 ± 0.0
Gln
4.638GlnAla: 4.638 ± 0.837
0.32GlnCys: 0.32 ± 0.24
1.279GlnAsp: 1.279 ± 0.459
3.678GlnGlu: 3.678 ± 0.528
1.279GlnPhe: 1.279 ± 0.541
3.838GlnGly: 3.838 ± 0.693
1.279GlnHis: 1.279 ± 0.368
3.518GlnIle: 3.518 ± 0.694
2.239GlnLys: 2.239 ± 0.634
3.039GlnLeu: 3.039 ± 0.703
1.439GlnMet: 1.439 ± 0.4
1.599GlnAsn: 1.599 ± 0.431
1.119GlnPro: 1.119 ± 0.365
1.919GlnGln: 1.919 ± 0.715
1.919GlnArg: 1.919 ± 0.494
3.198GlnSer: 3.198 ± 0.64
1.759GlnThr: 1.759 ± 0.506
2.559GlnVal: 2.559 ± 0.638
0.8GlnTrp: 0.8 ± 0.425
1.279GlnTyr: 1.279 ± 0.552
0.0GlnXaa: 0.0 ± 0.0
Arg
5.437ArgAla: 5.437 ± 0.638
1.279ArgCys: 1.279 ± 0.453
3.838ArgAsp: 3.838 ± 0.728
6.397ArgGlu: 6.397 ± 1.359
2.399ArgPhe: 2.399 ± 0.569
3.039ArgGly: 3.039 ± 0.749
0.8ArgHis: 0.8 ± 0.305
3.518ArgIle: 3.518 ± 0.707
4.158ArgLys: 4.158 ± 1.104
5.917ArgLeu: 5.917 ± 1.027
1.279ArgMet: 1.279 ± 0.411
3.039ArgAsn: 3.039 ± 0.797
1.599ArgPro: 1.599 ± 0.543
2.719ArgGln: 2.719 ± 0.607
5.437ArgArg: 5.437 ± 0.935
2.399ArgSer: 2.399 ± 0.637
2.399ArgThr: 2.399 ± 0.607
4.318ArgVal: 4.318 ± 0.87
1.439ArgTrp: 1.439 ± 0.398
1.279ArgTyr: 1.279 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
5.597SerAla: 5.597 ± 0.732
0.48SerCys: 0.48 ± 0.256
4.798SerAsp: 4.798 ± 1.329
3.998SerGlu: 3.998 ± 0.835
2.399SerPhe: 2.399 ± 0.635
6.237SerGly: 6.237 ± 0.981
1.439SerHis: 1.439 ± 0.576
3.358SerIle: 3.358 ± 0.754
2.559SerLys: 2.559 ± 0.61
4.638SerLeu: 4.638 ± 0.884
1.759SerMet: 1.759 ± 0.413
3.358SerAsn: 3.358 ± 0.66
3.678SerPro: 3.678 ± 0.619
3.198SerGln: 3.198 ± 0.843
4.478SerArg: 4.478 ± 0.649
4.958SerSer: 4.958 ± 1.088
4.638SerThr: 4.638 ± 0.965
3.998SerVal: 3.998 ± 0.829
0.48SerTrp: 0.48 ± 0.263
2.079SerTyr: 2.079 ± 0.583
0.0SerXaa: 0.0 ± 0.0
Thr
7.676ThrAla: 7.676 ± 1.138
0.16ThrCys: 0.16 ± 0.155
2.879ThrAsp: 2.879 ± 0.74
4.478ThrGlu: 4.478 ± 0.884
2.239ThrPhe: 2.239 ± 0.504
5.118ThrGly: 5.118 ± 0.731
1.279ThrHis: 1.279 ± 0.466
2.239ThrIle: 2.239 ± 0.729
3.039ThrLys: 3.039 ± 0.739
3.358ThrLeu: 3.358 ± 0.612
1.119ThrMet: 1.119 ± 0.435
2.719ThrAsn: 2.719 ± 0.813
2.719ThrPro: 2.719 ± 0.549
1.439ThrGln: 1.439 ± 0.452
3.518ThrArg: 3.518 ± 0.644
3.998ThrSer: 3.998 ± 0.809
2.719ThrThr: 2.719 ± 0.453
3.518ThrVal: 3.518 ± 0.768
1.119ThrTrp: 1.119 ± 0.402
1.279ThrTyr: 1.279 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
6.557ValAla: 6.557 ± 1.559
0.8ValCys: 0.8 ± 0.347
3.678ValAsp: 3.678 ± 0.71
4.318ValGlu: 4.318 ± 0.746
2.559ValPhe: 2.559 ± 0.611
4.478ValGly: 4.478 ± 0.863
0.96ValHis: 0.96 ± 0.354
3.678ValIle: 3.678 ± 0.851
3.198ValLys: 3.198 ± 0.741
3.518ValLeu: 3.518 ± 0.815
1.759ValMet: 1.759 ± 0.449
3.998ValAsn: 3.998 ± 0.859
1.919ValPro: 1.919 ± 0.416
2.879ValGln: 2.879 ± 0.561
1.919ValArg: 1.919 ± 0.534
5.118ValSer: 5.118 ± 1.008
5.437ValThr: 5.437 ± 1.253
4.798ValVal: 4.798 ± 0.898
0.96ValTrp: 0.96 ± 0.352
1.759ValTyr: 1.759 ± 0.514
0.0ValXaa: 0.0 ± 0.0
Trp
1.599TrpAla: 1.599 ± 0.645
0.32TrpCys: 0.32 ± 0.219
0.64TrpAsp: 0.64 ± 0.332
0.16TrpGlu: 0.16 ± 0.152
0.48TrpPhe: 0.48 ± 0.244
0.16TrpGly: 0.16 ± 0.138
0.48TrpHis: 0.48 ± 0.259
0.48TrpIle: 0.48 ± 0.312
1.119TrpLys: 1.119 ± 0.392
1.119TrpLeu: 1.119 ± 0.496
0.16TrpMet: 0.16 ± 0.149
0.48TrpAsn: 0.48 ± 0.234
0.96TrpPro: 0.96 ± 0.476
0.64TrpGln: 0.64 ± 0.285
0.64TrpArg: 0.64 ± 0.287
0.8TrpSer: 0.8 ± 0.408
1.439TrpThr: 1.439 ± 0.442
1.279TrpVal: 1.279 ± 0.565
0.32TrpTrp: 0.32 ± 0.213
0.16TrpTyr: 0.16 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.239TyrAla: 2.239 ± 0.537
0.64TyrCys: 0.64 ± 0.265
1.439TyrAsp: 1.439 ± 0.463
0.8TyrGlu: 0.8 ± 0.347
1.919TyrPhe: 1.919 ± 0.388
2.719TyrGly: 2.719 ± 0.655
0.8TyrHis: 0.8 ± 0.336
1.599TyrIle: 1.599 ± 0.509
1.759TyrLys: 1.759 ± 0.37
2.399TyrLeu: 2.399 ± 0.701
0.48TyrMet: 0.48 ± 0.239
1.279TyrAsn: 1.279 ± 0.377
1.279TyrPro: 1.279 ± 0.372
2.239TyrGln: 2.239 ± 0.533
2.239TyrArg: 2.239 ± 0.622
2.399TyrSer: 2.399 ± 0.711
1.759TyrThr: 1.759 ± 0.438
1.919TyrVal: 1.919 ± 0.541
0.48TyrTrp: 0.48 ± 0.296
0.96TyrTyr: 0.96 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30 proteins (6254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski