Amino acid dipepetide frequency for Erysipelothrix phage SE-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.948AlaAla: 1.948 ± 0.748
0.292AlaCys: 0.292 ± 0.14
2.24AlaAsp: 2.24 ± 0.484
2.533AlaGlu: 2.533 ± 0.525
2.63AlaPhe: 2.63 ± 0.478
2.24AlaGly: 2.24 ± 0.795
1.559AlaHis: 1.559 ± 0.366
3.604AlaIle: 3.604 ± 0.94
4.286AlaLys: 4.286 ± 0.607
5.845AlaLeu: 5.845 ± 0.843
1.169AlaMet: 1.169 ± 0.335
2.825AlaAsn: 2.825 ± 0.528
1.656AlaPro: 1.656 ± 0.527
1.851AlaGln: 1.851 ± 0.324
1.364AlaArg: 1.364 ± 0.413
2.922AlaSer: 2.922 ± 0.743
2.143AlaThr: 2.143 ± 0.668
2.143AlaVal: 2.143 ± 0.608
0.779AlaTrp: 0.779 ± 0.38
2.046AlaTyr: 2.046 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
0.682CysAla: 0.682 ± 0.256
0.195CysCys: 0.195 ± 0.133
0.584CysAsp: 0.584 ± 0.233
0.487CysGlu: 0.487 ± 0.249
0.39CysPhe: 0.39 ± 0.212
0.487CysGly: 0.487 ± 0.242
0.292CysHis: 0.292 ± 0.255
0.877CysIle: 0.877 ± 0.246
0.682CysLys: 0.682 ± 0.279
0.779CysLeu: 0.779 ± 0.233
0.097CysMet: 0.097 ± 0.08
0.584CysAsn: 0.584 ± 0.263
0.292CysPro: 0.292 ± 0.163
0.779CysGln: 0.779 ± 0.294
0.487CysArg: 0.487 ± 0.22
0.487CysSer: 0.487 ± 0.228
0.39CysThr: 0.39 ± 0.192
0.779CysVal: 0.779 ± 0.305
0.097CysTrp: 0.097 ± 0.09
0.292CysTyr: 0.292 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
2.63AspAla: 2.63 ± 0.555
0.39AspCys: 0.39 ± 0.214
4.286AspAsp: 4.286 ± 0.723
3.799AspGlu: 3.799 ± 0.53
2.922AspPhe: 2.922 ± 0.615
3.02AspGly: 3.02 ± 0.534
0.877AspHis: 0.877 ± 0.201
4.578AspIle: 4.578 ± 0.564
4.676AspLys: 4.676 ± 0.711
5.845AspLeu: 5.845 ± 0.723
0.682AspMet: 0.682 ± 0.21
2.922AspAsn: 2.922 ± 0.49
0.974AspPro: 0.974 ± 0.338
1.364AspGln: 1.364 ± 0.411
1.948AspArg: 1.948 ± 0.51
2.63AspSer: 2.63 ± 0.602
1.948AspThr: 1.948 ± 0.502
3.409AspVal: 3.409 ± 0.667
1.071AspTrp: 1.071 ± 0.36
4.481AspTyr: 4.481 ± 0.667
0.0AspXaa: 0.0 ± 0.0
Glu
3.799GluAla: 3.799 ± 0.862
0.487GluCys: 0.487 ± 0.202
3.02GluAsp: 3.02 ± 0.588
5.163GluGlu: 5.163 ± 0.776
3.312GluPhe: 3.312 ± 0.654
2.727GluGly: 2.727 ± 0.514
1.559GluHis: 1.559 ± 0.364
7.403GluIle: 7.403 ± 0.768
5.942GluLys: 5.942 ± 0.596
7.988GluLeu: 7.988 ± 1.115
1.753GluMet: 1.753 ± 0.453
4.189GluAsn: 4.189 ± 0.568
1.559GluPro: 1.559 ± 0.367
3.02GluGln: 3.02 ± 0.53
2.143GluArg: 2.143 ± 0.587
4.773GluSer: 4.773 ± 0.615
3.312GluThr: 3.312 ± 0.609
5.065GluVal: 5.065 ± 0.666
0.877GluTrp: 0.877 ± 0.236
3.507GluTyr: 3.507 ± 0.611
0.0GluXaa: 0.0 ± 0.0
Phe
2.338PheAla: 2.338 ± 0.354
0.584PheCys: 0.584 ± 0.3
2.922PheAsp: 2.922 ± 0.651
3.896PheGlu: 3.896 ± 0.558
1.461PhePhe: 1.461 ± 0.387
2.533PheGly: 2.533 ± 0.447
0.584PheHis: 0.584 ± 0.218
2.727PheIle: 2.727 ± 0.464
4.87PheLys: 4.87 ± 0.723
3.799PheLeu: 3.799 ± 0.633
1.169PheMet: 1.169 ± 0.367
3.214PheAsn: 3.214 ± 0.594
1.753PhePro: 1.753 ± 0.452
1.169PheGln: 1.169 ± 0.259
1.266PheArg: 1.266 ± 0.392
2.143PheSer: 2.143 ± 0.53
2.533PheThr: 2.533 ± 0.567
2.435PheVal: 2.435 ± 0.549
0.487PheTrp: 0.487 ± 0.218
2.338PheTyr: 2.338 ± 0.655
0.0PheXaa: 0.0 ± 0.0
Gly
2.435GlyAla: 2.435 ± 0.599
0.195GlyCys: 0.195 ± 0.141
2.435GlyAsp: 2.435 ± 0.434
2.24GlyGlu: 2.24 ± 0.43
2.435GlyPhe: 2.435 ± 0.394
2.24GlyGly: 2.24 ± 0.772
1.364GlyHis: 1.364 ± 0.39
5.163GlyIle: 5.163 ± 0.719
4.383GlyLys: 4.383 ± 0.562
4.676GlyLeu: 4.676 ± 0.667
1.851GlyMet: 1.851 ± 0.386
3.702GlyAsn: 3.702 ± 0.605
0.974GlyPro: 0.974 ± 0.256
1.266GlyGln: 1.266 ± 0.303
2.435GlyArg: 2.435 ± 0.553
2.727GlySer: 2.727 ± 0.546
2.338GlyThr: 2.338 ± 0.564
3.214GlyVal: 3.214 ± 0.886
0.39GlyTrp: 0.39 ± 0.18
2.24GlyTyr: 2.24 ± 0.388
0.0GlyXaa: 0.0 ± 0.0
His
1.169HisAla: 1.169 ± 0.336
0.097HisCys: 0.097 ± 0.099
1.364HisAsp: 1.364 ± 0.409
2.63HisGlu: 2.63 ± 0.436
0.877HisPhe: 0.877 ± 0.345
0.487HisGly: 0.487 ± 0.246
0.779HisHis: 0.779 ± 0.335
1.364HisIle: 1.364 ± 0.385
1.071HisLys: 1.071 ± 0.337
2.143HisLeu: 2.143 ± 0.578
0.195HisMet: 0.195 ± 0.152
1.169HisAsn: 1.169 ± 0.35
0.292HisPro: 0.292 ± 0.142
0.877HisGln: 0.877 ± 0.278
0.877HisArg: 0.877 ± 0.308
1.071HisSer: 1.071 ± 0.364
0.584HisThr: 0.584 ± 0.249
1.364HisVal: 1.364 ± 0.443
0.195HisTrp: 0.195 ± 0.127
0.779HisTyr: 0.779 ± 0.349
0.0HisXaa: 0.0 ± 0.0
Ile
3.994IleAla: 3.994 ± 0.663
1.364IleCys: 1.364 ± 0.349
5.552IleAsp: 5.552 ± 0.723
6.429IleGlu: 6.429 ± 0.526
2.922IlePhe: 2.922 ± 0.504
3.409IleGly: 3.409 ± 0.836
1.851IleHis: 1.851 ± 0.354
6.332IleIle: 6.332 ± 0.756
6.819IleLys: 6.819 ± 0.808
6.721IleLeu: 6.721 ± 1.039
2.435IleMet: 2.435 ± 0.478
5.455IleAsn: 5.455 ± 0.762
3.507IlePro: 3.507 ± 0.673
3.312IleGln: 3.312 ± 0.478
4.383IleArg: 4.383 ± 0.615
6.234IleSer: 6.234 ± 0.705
4.968IleThr: 4.968 ± 0.65
3.896IleVal: 3.896 ± 0.802
0.39IleTrp: 0.39 ± 0.234
3.214IleTyr: 3.214 ± 0.58
0.0IleXaa: 0.0 ± 0.0
Lys
3.896LysAla: 3.896 ± 0.608
0.974LysCys: 0.974 ± 0.299
3.994LysAsp: 3.994 ± 0.747
6.721LysGlu: 6.721 ± 0.926
2.533LysPhe: 2.533 ± 0.517
3.02LysGly: 3.02 ± 0.503
1.753LysHis: 1.753 ± 0.554
8.669LysIle: 8.669 ± 1.139
8.767LysLys: 8.767 ± 1.31
7.306LysLeu: 7.306 ± 1.054
1.851LysMet: 1.851 ± 0.426
7.988LysAsn: 7.988 ± 1.047
2.63LysPro: 2.63 ± 0.542
3.604LysGln: 3.604 ± 0.615
4.578LysArg: 4.578 ± 0.851
7.013LysSer: 7.013 ± 1.164
4.773LysThr: 4.773 ± 0.751
3.799LysVal: 3.799 ± 0.779
1.266LysTrp: 1.266 ± 0.391
3.896LysTyr: 3.896 ± 0.662
0.0LysXaa: 0.0 ± 0.0
Leu
4.091LeuAla: 4.091 ± 0.88
0.974LeuCys: 0.974 ± 0.272
4.578LeuAsp: 4.578 ± 0.584
9.059LeuGlu: 9.059 ± 1.346
4.091LeuPhe: 4.091 ± 0.634
5.163LeuGly: 5.163 ± 0.67
0.974LeuHis: 0.974 ± 0.287
7.111LeuIle: 7.111 ± 0.976
9.546LeuLys: 9.546 ± 1.081
7.5LeuLeu: 7.5 ± 0.78
2.338LeuMet: 2.338 ± 0.524
7.793LeuAsn: 7.793 ± 0.897
2.63LeuPro: 2.63 ± 0.345
2.338LeuGln: 2.338 ± 0.459
4.383LeuArg: 4.383 ± 0.661
6.819LeuSer: 6.819 ± 0.749
4.676LeuThr: 4.676 ± 0.577
4.578LeuVal: 4.578 ± 0.677
1.071LeuTrp: 1.071 ± 0.367
3.994LeuTyr: 3.994 ± 0.872
0.0LeuXaa: 0.0 ± 0.0
Met
1.071MetAla: 1.071 ± 0.255
0.39MetCys: 0.39 ± 0.207
0.584MetAsp: 0.584 ± 0.209
0.584MetGlu: 0.584 ± 0.177
0.974MetPhe: 0.974 ± 0.306
1.461MetGly: 1.461 ± 0.3
0.195MetHis: 0.195 ± 0.171
1.948MetIle: 1.948 ± 0.402
3.702MetLys: 3.702 ± 0.675
2.24MetLeu: 2.24 ± 0.443
0.39MetMet: 0.39 ± 0.186
2.435MetAsn: 2.435 ± 0.54
0.584MetPro: 0.584 ± 0.21
0.779MetGln: 0.779 ± 0.207
1.169MetArg: 1.169 ± 0.379
1.851MetSer: 1.851 ± 0.458
1.169MetThr: 1.169 ± 0.31
0.584MetVal: 0.584 ± 0.214
0.39MetTrp: 0.39 ± 0.25
0.974MetTyr: 0.974 ± 0.45
0.0MetXaa: 0.0 ± 0.0
Asn
3.117AsnAla: 3.117 ± 0.674
0.779AsnCys: 0.779 ± 0.347
4.091AsnAsp: 4.091 ± 0.649
5.455AsnGlu: 5.455 ± 0.62
2.143AsnPhe: 2.143 ± 0.433
4.87AsnGly: 4.87 ± 0.804
1.266AsnHis: 1.266 ± 0.433
5.65AsnIle: 5.65 ± 0.788
4.968AsnLys: 4.968 ± 0.91
6.137AsnLeu: 6.137 ± 1.144
1.169AsnMet: 1.169 ± 0.335
3.799AsnAsn: 3.799 ± 1.279
2.63AsnPro: 2.63 ± 0.479
2.727AsnGln: 2.727 ± 0.522
4.481AsnArg: 4.481 ± 0.814
4.578AsnSer: 4.578 ± 0.749
3.02AsnThr: 3.02 ± 0.463
4.968AsnVal: 4.968 ± 0.562
0.39AsnTrp: 0.39 ± 0.166
2.63AsnTyr: 2.63 ± 0.37
0.0AsnXaa: 0.0 ± 0.0
Pro
0.779ProAla: 0.779 ± 0.295
0.39ProCys: 0.39 ± 0.224
1.461ProAsp: 1.461 ± 0.37
1.461ProGlu: 1.461 ± 0.335
1.656ProPhe: 1.656 ± 0.501
1.071ProGly: 1.071 ± 0.321
0.487ProHis: 0.487 ± 0.181
2.24ProIle: 2.24 ± 0.374
2.435ProLys: 2.435 ± 0.645
3.409ProLeu: 3.409 ± 0.526
0.974ProMet: 0.974 ± 0.331
2.046ProAsn: 2.046 ± 0.373
0.097ProPro: 0.097 ± 0.099
1.071ProGln: 1.071 ± 0.365
0.974ProArg: 0.974 ± 0.303
1.851ProSer: 1.851 ± 0.361
1.948ProThr: 1.948 ± 0.476
1.461ProVal: 1.461 ± 0.358
0.292ProTrp: 0.292 ± 0.151
1.461ProTyr: 1.461 ± 0.369
0.0ProXaa: 0.0 ± 0.0
Gln
2.24GlnAla: 2.24 ± 0.583
0.39GlnCys: 0.39 ± 0.209
1.851GlnAsp: 1.851 ± 0.422
1.656GlnGlu: 1.656 ± 0.637
2.046GlnPhe: 2.046 ± 0.421
1.266GlnGly: 1.266 ± 0.31
0.682GlnHis: 0.682 ± 0.221
3.117GlnIle: 3.117 ± 0.494
3.994GlnLys: 3.994 ± 0.527
2.825GlnLeu: 2.825 ± 0.48
0.584GlnMet: 0.584 ± 0.216
2.143GlnAsn: 2.143 ± 0.459
0.584GlnPro: 0.584 ± 0.205
1.071GlnGln: 1.071 ± 0.342
1.461GlnArg: 1.461 ± 0.44
2.727GlnSer: 2.727 ± 0.498
1.948GlnThr: 1.948 ± 0.399
2.435GlnVal: 2.435 ± 0.525
0.487GlnTrp: 0.487 ± 0.205
1.559GlnTyr: 1.559 ± 0.389
0.0GlnXaa: 0.0 ± 0.0
Arg
1.948ArgAla: 1.948 ± 0.47
0.195ArgCys: 0.195 ± 0.125
2.24ArgAsp: 2.24 ± 0.535
2.435ArgGlu: 2.435 ± 0.546
2.435ArgPhe: 2.435 ± 0.757
1.266ArgGly: 1.266 ± 0.441
0.779ArgHis: 0.779 ± 0.243
4.286ArgIle: 4.286 ± 0.731
4.189ArgLys: 4.189 ± 0.65
4.286ArgLeu: 4.286 ± 0.726
1.364ArgMet: 1.364 ± 0.534
2.922ArgAsn: 2.922 ± 0.513
0.974ArgPro: 0.974 ± 0.327
0.974ArgGln: 0.974 ± 0.234
2.63ArgArg: 2.63 ± 0.506
3.02ArgSer: 3.02 ± 0.584
2.338ArgThr: 2.338 ± 0.464
2.922ArgVal: 2.922 ± 0.53
1.461ArgTrp: 1.461 ± 0.442
2.63ArgTyr: 2.63 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
2.922SerAla: 2.922 ± 0.806
0.487SerCys: 0.487 ± 0.189
3.994SerAsp: 3.994 ± 0.464
5.26SerGlu: 5.26 ± 0.865
3.994SerPhe: 3.994 ± 0.527
3.896SerGly: 3.896 ± 0.693
1.169SerHis: 1.169 ± 0.467
4.87SerIle: 4.87 ± 0.764
6.624SerLys: 6.624 ± 0.976
4.481SerLeu: 4.481 ± 0.748
1.266SerMet: 1.266 ± 0.252
4.773SerAsn: 4.773 ± 0.592
0.974SerPro: 0.974 ± 0.272
2.727SerGln: 2.727 ± 0.58
2.922SerArg: 2.922 ± 0.522
5.552SerSer: 5.552 ± 0.926
3.409SerThr: 3.409 ± 0.479
6.624SerVal: 6.624 ± 0.886
0.682SerTrp: 0.682 ± 0.309
2.922SerTyr: 2.922 ± 0.529
0.0SerXaa: 0.0 ± 0.0
Thr
1.948ThrAla: 1.948 ± 0.614
0.292ThrCys: 0.292 ± 0.165
2.143ThrAsp: 2.143 ± 0.563
2.63ThrGlu: 2.63 ± 0.501
1.559ThrPhe: 1.559 ± 0.371
2.046ThrGly: 2.046 ± 0.534
0.974ThrHis: 0.974 ± 0.294
5.065ThrIle: 5.065 ± 0.674
4.189ThrLys: 4.189 ± 0.747
5.163ThrLeu: 5.163 ± 0.546
1.266ThrMet: 1.266 ± 0.347
3.702ThrAsn: 3.702 ± 0.615
1.461ThrPro: 1.461 ± 0.351
2.046ThrGln: 2.046 ± 0.521
2.338ThrArg: 2.338 ± 0.456
3.994ThrSer: 3.994 ± 0.662
3.214ThrThr: 3.214 ± 0.595
3.409ThrVal: 3.409 ± 0.845
0.584ThrTrp: 0.584 ± 0.228
2.825ThrTyr: 2.825 ± 0.494
0.0ThrXaa: 0.0 ± 0.0
Val
2.63ValAla: 2.63 ± 0.843
0.487ValCys: 0.487 ± 0.218
3.214ValAsp: 3.214 ± 0.463
5.163ValGlu: 5.163 ± 0.745
3.117ValPhe: 3.117 ± 0.71
3.604ValGly: 3.604 ± 0.68
1.071ValHis: 1.071 ± 0.292
4.286ValIle: 4.286 ± 0.616
4.481ValLys: 4.481 ± 1.006
5.65ValLeu: 5.65 ± 0.935
1.948ValMet: 1.948 ± 0.432
3.409ValAsn: 3.409 ± 0.643
1.948ValPro: 1.948 ± 0.441
2.338ValGln: 2.338 ± 0.47
2.727ValArg: 2.727 ± 0.654
4.091ValSer: 4.091 ± 0.589
3.117ValThr: 3.117 ± 0.607
3.507ValVal: 3.507 ± 0.582
0.584ValTrp: 0.584 ± 0.318
3.02ValTyr: 3.02 ± 0.521
0.0ValXaa: 0.0 ± 0.0
Trp
0.487TrpAla: 0.487 ± 0.172
0.195TrpCys: 0.195 ± 0.132
0.584TrpAsp: 0.584 ± 0.245
0.877TrpGlu: 0.877 ± 0.249
0.877TrpPhe: 0.877 ± 0.357
0.487TrpGly: 0.487 ± 0.255
0.195TrpHis: 0.195 ± 0.158
0.877TrpIle: 0.877 ± 0.247
0.584TrpLys: 0.584 ± 0.268
1.266TrpLeu: 1.266 ± 0.433
0.487TrpMet: 0.487 ± 0.21
0.974TrpAsn: 0.974 ± 0.354
0.0TrpPro: 0.0 ± 0.0
0.584TrpGln: 0.584 ± 0.2
0.487TrpArg: 0.487 ± 0.261
0.877TrpSer: 0.877 ± 0.278
0.584TrpThr: 0.584 ± 0.201
1.071TrpVal: 1.071 ± 0.239
0.0TrpTrp: 0.0 ± 0.0
0.584TrpTyr: 0.584 ± 0.195
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.948TyrAla: 1.948 ± 0.516
0.584TyrCys: 0.584 ± 0.249
3.312TyrAsp: 3.312 ± 0.497
3.117TyrGlu: 3.117 ± 0.52
1.851TyrPhe: 1.851 ± 0.386
3.409TyrGly: 3.409 ± 0.615
0.974TyrHis: 0.974 ± 0.343
3.02TyrIle: 3.02 ± 0.535
2.825TyrLys: 2.825 ± 0.625
5.65TyrLeu: 5.65 ± 0.69
0.487TyrMet: 0.487 ± 0.198
2.825TyrAsn: 2.825 ± 0.626
2.046TyrPro: 2.046 ± 0.58
1.169TyrGln: 1.169 ± 0.315
2.24TyrArg: 2.24 ± 0.449
4.189TyrSer: 4.189 ± 0.87
2.435TyrThr: 2.435 ± 0.504
2.727TyrVal: 2.727 ± 0.543
0.584TyrTrp: 0.584 ± 0.252
1.364TyrTyr: 1.364 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (10267 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski