Amino acid dipepetide frequency for Streptococcus phage phiS10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.253AlaAla: 3.253 ± 0.508
0.093AlaCys: 0.093 ± 0.095
6.227AlaAsp: 6.227 ± 1.085
6.691AlaGlu: 6.691 ± 0.945
2.881AlaPhe: 2.881 ± 0.575
4.647AlaGly: 4.647 ± 0.836
0.372AlaHis: 0.372 ± 0.217
5.576AlaIle: 5.576 ± 0.685
7.714AlaLys: 7.714 ± 0.86
5.669AlaLeu: 5.669 ± 0.538
1.952AlaMet: 1.952 ± 0.591
5.204AlaAsn: 5.204 ± 0.712
2.045AlaPro: 2.045 ± 0.382
3.067AlaGln: 3.067 ± 0.548
2.695AlaArg: 2.695 ± 0.556
4.275AlaSer: 4.275 ± 0.852
4.833AlaThr: 4.833 ± 0.726
6.134AlaVal: 6.134 ± 0.853
0.651AlaTrp: 0.651 ± 0.266
2.138AlaTyr: 2.138 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.465CysAsp: 0.465 ± 0.221
0.743CysGlu: 0.743 ± 0.254
0.186CysPhe: 0.186 ± 0.129
0.743CysGly: 0.743 ± 0.33
0.0CysHis: 0.0 ± 0.0
0.279CysIle: 0.279 ± 0.132
0.465CysLys: 0.465 ± 0.217
0.279CysLeu: 0.279 ± 0.173
0.093CysMet: 0.093 ± 0.095
0.186CysAsn: 0.186 ± 0.11
0.372CysPro: 0.372 ± 0.19
0.279CysGln: 0.279 ± 0.169
0.372CysArg: 0.372 ± 0.23
0.558CysSer: 0.558 ± 0.262
0.186CysThr: 0.186 ± 0.127
0.279CysVal: 0.279 ± 0.146
0.0CysTrp: 0.0 ± 0.0
0.186CysTyr: 0.186 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
4.275AspAla: 4.275 ± 0.512
0.558AspCys: 0.558 ± 0.263
3.81AspAsp: 3.81 ± 0.764
5.762AspGlu: 5.762 ± 0.807
2.788AspPhe: 2.788 ± 0.634
4.275AspGly: 4.275 ± 0.471
0.651AspHis: 0.651 ± 0.301
4.182AspIle: 4.182 ± 0.694
5.669AspLys: 5.669 ± 0.826
4.275AspLeu: 4.275 ± 0.647
1.766AspMet: 1.766 ± 0.408
4.368AspAsn: 4.368 ± 0.5
1.394AspPro: 1.394 ± 0.344
0.929AspGln: 0.929 ± 0.272
2.416AspArg: 2.416 ± 0.297
3.996AspSer: 3.996 ± 0.619
3.067AspThr: 3.067 ± 0.598
4.461AspVal: 4.461 ± 0.51
1.394AspTrp: 1.394 ± 0.252
2.509AspTyr: 2.509 ± 0.494
0.0AspXaa: 0.0 ± 0.0
Glu
6.599GluAla: 6.599 ± 0.891
0.558GluCys: 0.558 ± 0.239
4.275GluAsp: 4.275 ± 0.72
7.342GluGlu: 7.342 ± 1.223
2.788GluPhe: 2.788 ± 0.512
3.16GluGly: 3.16 ± 0.503
1.301GluHis: 1.301 ± 0.413
6.691GluIle: 6.691 ± 0.696
6.506GluLys: 6.506 ± 0.982
9.108GluLeu: 9.108 ± 1.102
2.509GluMet: 2.509 ± 0.567
5.297GluAsn: 5.297 ± 0.678
2.138GluPro: 2.138 ± 0.476
4.554GluGln: 4.554 ± 0.836
4.833GluArg: 4.833 ± 0.773
4.926GluSer: 4.926 ± 0.653
4.089GluThr: 4.089 ± 0.68
4.833GluVal: 4.833 ± 0.611
1.208GluTrp: 1.208 ± 0.329
2.695GluTyr: 2.695 ± 0.549
0.0GluXaa: 0.0 ± 0.0
Phe
2.695PheAla: 2.695 ± 0.574
0.093PheCys: 0.093 ± 0.094
3.903PheAsp: 3.903 ± 0.585
3.903PheGlu: 3.903 ± 0.65
1.859PhePhe: 1.859 ± 0.367
2.509PheGly: 2.509 ± 0.441
0.558PheHis: 0.558 ± 0.256
1.673PheIle: 1.673 ± 0.543
3.346PheLys: 3.346 ± 0.56
2.045PheLeu: 2.045 ± 0.425
0.929PheMet: 0.929 ± 0.258
2.881PheAsn: 2.881 ± 0.532
1.115PhePro: 1.115 ± 0.508
1.301PheGln: 1.301 ± 0.351
1.301PheArg: 1.301 ± 0.317
3.067PheSer: 3.067 ± 0.626
2.416PheThr: 2.416 ± 0.415
2.602PheVal: 2.602 ± 0.458
0.372PheTrp: 0.372 ± 0.207
1.58PheTyr: 1.58 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
4.275GlyAla: 4.275 ± 0.878
0.279GlyCys: 0.279 ± 0.141
3.16GlyAsp: 3.16 ± 0.52
4.089GlyGlu: 4.089 ± 0.533
2.695GlyPhe: 2.695 ± 0.557
4.182GlyGly: 4.182 ± 1.152
0.929GlyHis: 0.929 ± 0.261
4.647GlyIle: 4.647 ± 0.808
5.204GlyLys: 5.204 ± 0.736
5.019GlyLeu: 5.019 ± 0.645
1.952GlyMet: 1.952 ± 0.428
3.253GlyAsn: 3.253 ± 0.476
0.836GlyPro: 0.836 ± 0.286
2.788GlyGln: 2.788 ± 0.479
3.253GlyArg: 3.253 ± 0.435
3.532GlySer: 3.532 ± 0.57
2.788GlyThr: 2.788 ± 0.753
3.717GlyVal: 3.717 ± 0.489
1.301GlyTrp: 1.301 ± 0.352
2.788GlyTyr: 2.788 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.836HisAla: 0.836 ± 0.311
0.186HisCys: 0.186 ± 0.11
1.115HisAsp: 1.115 ± 0.34
1.487HisGlu: 1.487 ± 0.408
1.022HisPhe: 1.022 ± 0.339
1.58HisGly: 1.58 ± 0.453
0.558HisHis: 0.558 ± 0.178
0.743HisIle: 0.743 ± 0.239
0.836HisLys: 0.836 ± 0.317
1.115HisLeu: 1.115 ± 0.281
0.093HisMet: 0.093 ± 0.082
0.836HisAsn: 0.836 ± 0.33
0.465HisPro: 0.465 ± 0.224
0.558HisGln: 0.558 ± 0.269
0.558HisArg: 0.558 ± 0.235
0.465HisSer: 0.465 ± 0.192
0.558HisThr: 0.558 ± 0.206
0.651HisVal: 0.651 ± 0.238
0.372HisTrp: 0.372 ± 0.155
0.372HisTyr: 0.372 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
5.297IleAla: 5.297 ± 0.796
0.279IleCys: 0.279 ± 0.138
5.019IleAsp: 5.019 ± 0.573
5.855IleGlu: 5.855 ± 0.828
2.602IlePhe: 2.602 ± 0.746
3.16IleGly: 3.16 ± 0.998
1.022IleHis: 1.022 ± 0.247
3.903IleIle: 3.903 ± 0.529
4.74IleLys: 4.74 ± 0.652
5.855IleLeu: 5.855 ± 0.523
1.208IleMet: 1.208 ± 0.369
4.275IleAsn: 4.275 ± 0.437
1.487IlePro: 1.487 ± 0.284
1.952IleGln: 1.952 ± 0.497
4.461IleArg: 4.461 ± 0.523
3.067IleSer: 3.067 ± 0.502
3.625IleThr: 3.625 ± 0.698
3.16IleVal: 3.16 ± 0.643
1.022IleTrp: 1.022 ± 0.294
2.695IleTyr: 2.695 ± 0.536
0.0IleXaa: 0.0 ± 0.0
Lys
6.97LysAla: 6.97 ± 0.823
0.651LysCys: 0.651 ± 0.195
4.74LysAsp: 4.74 ± 0.613
7.993LysGlu: 7.993 ± 1.894
2.23LysPhe: 2.23 ± 0.454
4.089LysGly: 4.089 ± 0.696
0.836LysHis: 0.836 ± 0.291
4.647LysIle: 4.647 ± 0.724
5.669LysLys: 5.669 ± 0.84
6.413LysLeu: 6.413 ± 0.798
2.045LysMet: 2.045 ± 0.471
4.275LysAsn: 4.275 ± 0.492
3.346LysPro: 3.346 ± 0.495
2.881LysGln: 2.881 ± 0.503
4.647LysArg: 4.647 ± 0.753
4.74LysSer: 4.74 ± 0.59
4.926LysThr: 4.926 ± 0.677
5.855LysVal: 5.855 ± 0.72
0.929LysTrp: 0.929 ± 0.35
2.974LysTyr: 2.974 ± 0.57
0.0LysXaa: 0.0 ± 0.0
Leu
6.784LeuAla: 6.784 ± 0.831
0.372LeuCys: 0.372 ± 0.27
5.855LeuAsp: 5.855 ± 0.725
7.063LeuGlu: 7.063 ± 0.847
3.16LeuPhe: 3.16 ± 0.611
5.669LeuGly: 5.669 ± 0.936
0.929LeuHis: 0.929 ± 0.228
5.297LeuIle: 5.297 ± 0.843
8.829LeuLys: 8.829 ± 0.769
6.599LeuLeu: 6.599 ± 1.048
1.58LeuMet: 1.58 ± 0.33
3.346LeuAsn: 3.346 ± 0.547
1.952LeuPro: 1.952 ± 0.758
3.532LeuGln: 3.532 ± 0.535
4.461LeuArg: 4.461 ± 0.676
5.39LeuSer: 5.39 ± 0.674
3.81LeuThr: 3.81 ± 0.695
4.554LeuVal: 4.554 ± 0.643
0.465LeuTrp: 0.465 ± 0.208
2.695LeuTyr: 2.695 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
1.859MetAla: 1.859 ± 0.45
0.093MetCys: 0.093 ± 0.092
1.115MetAsp: 1.115 ± 0.232
1.115MetGlu: 1.115 ± 0.41
1.022MetPhe: 1.022 ± 0.281
1.301MetGly: 1.301 ± 0.383
0.372MetHis: 0.372 ± 0.213
1.673MetIle: 1.673 ± 0.384
1.766MetLys: 1.766 ± 0.422
1.673MetLeu: 1.673 ± 0.432
0.372MetMet: 0.372 ± 0.145
1.115MetAsn: 1.115 ± 0.358
0.651MetPro: 0.651 ± 0.278
0.558MetGln: 0.558 ± 0.207
0.651MetArg: 0.651 ± 0.343
2.416MetSer: 2.416 ± 0.452
2.509MetThr: 2.509 ± 0.527
1.673MetVal: 1.673 ± 0.4
0.186MetTrp: 0.186 ± 0.121
0.743MetTyr: 0.743 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
4.74AsnAla: 4.74 ± 0.906
0.279AsnCys: 0.279 ± 0.173
2.602AsnAsp: 2.602 ± 0.564
3.903AsnGlu: 3.903 ± 0.869
2.045AsnPhe: 2.045 ± 0.398
4.089AsnGly: 4.089 ± 1.007
1.208AsnHis: 1.208 ± 0.362
4.368AsnIle: 4.368 ± 0.667
3.717AsnLys: 3.717 ± 0.644
5.762AsnLeu: 5.762 ± 0.886
1.301AsnMet: 1.301 ± 0.317
2.881AsnAsn: 2.881 ± 0.605
1.115AsnPro: 1.115 ± 0.379
2.23AsnGln: 2.23 ± 0.588
3.16AsnArg: 3.16 ± 0.532
2.416AsnSer: 2.416 ± 0.455
3.16AsnThr: 3.16 ± 0.6
3.625AsnVal: 3.625 ± 0.712
1.115AsnTrp: 1.115 ± 0.327
1.952AsnTyr: 1.952 ± 0.41
0.0AsnXaa: 0.0 ± 0.0
Pro
2.138ProAla: 2.138 ± 0.418
0.0ProCys: 0.0 ± 0.0
1.673ProAsp: 1.673 ± 0.383
2.323ProGlu: 2.323 ± 0.473
1.394ProPhe: 1.394 ± 0.398
1.022ProGly: 1.022 ± 0.299
0.279ProHis: 0.279 ± 0.15
1.673ProIle: 1.673 ± 0.446
1.952ProLys: 1.952 ± 0.48
1.394ProLeu: 1.394 ± 0.341
0.372ProMet: 0.372 ± 0.154
1.394ProAsn: 1.394 ± 0.364
0.465ProPro: 0.465 ± 0.182
1.022ProGln: 1.022 ± 0.227
1.673ProArg: 1.673 ± 0.534
1.859ProSer: 1.859 ± 0.559
1.394ProThr: 1.394 ± 0.307
1.487ProVal: 1.487 ± 0.338
0.651ProTrp: 0.651 ± 0.265
1.673ProTyr: 1.673 ± 0.429
0.0ProXaa: 0.0 ± 0.0
Gln
4.461GlnAla: 4.461 ± 0.805
0.186GlnCys: 0.186 ± 0.149
1.022GlnAsp: 1.022 ± 0.301
2.695GlnGlu: 2.695 ± 0.538
1.673GlnPhe: 1.673 ± 0.325
2.323GlnGly: 2.323 ± 0.384
0.558GlnHis: 0.558 ± 0.239
2.788GlnIle: 2.788 ± 0.541
3.067GlnLys: 3.067 ± 0.532
3.996GlnLeu: 3.996 ± 0.581
0.929GlnMet: 0.929 ± 0.269
2.602GlnAsn: 2.602 ± 0.415
1.022GlnPro: 1.022 ± 0.284
2.602GlnGln: 2.602 ± 0.396
1.673GlnArg: 1.673 ± 0.48
2.509GlnSer: 2.509 ± 0.816
2.695GlnThr: 2.695 ± 0.403
2.881GlnVal: 2.881 ± 0.421
0.372GlnTrp: 0.372 ± 0.185
1.487GlnTyr: 1.487 ± 0.286
0.0GlnXaa: 0.0 ± 0.0
Arg
3.16ArgAla: 3.16 ± 0.585
0.279ArgCys: 0.279 ± 0.166
2.974ArgAsp: 2.974 ± 0.473
4.368ArgGlu: 4.368 ± 0.953
2.881ArgPhe: 2.881 ± 0.477
2.23ArgGly: 2.23 ± 0.414
0.743ArgHis: 0.743 ± 0.22
3.346ArgIle: 3.346 ± 0.427
3.16ArgLys: 3.16 ± 0.502
4.368ArgLeu: 4.368 ± 0.634
1.208ArgMet: 1.208 ± 0.415
2.788ArgAsn: 2.788 ± 0.437
1.301ArgPro: 1.301 ± 0.385
2.602ArgGln: 2.602 ± 0.498
2.416ArgArg: 2.416 ± 0.655
2.138ArgSer: 2.138 ± 0.376
3.346ArgThr: 3.346 ± 0.586
2.602ArgVal: 2.602 ± 0.436
1.115ArgTrp: 1.115 ± 0.303
2.23ArgTyr: 2.23 ± 0.493
0.0ArgXaa: 0.0 ± 0.0
Ser
5.297SerAla: 5.297 ± 0.697
0.186SerCys: 0.186 ± 0.138
3.903SerAsp: 3.903 ± 0.703
4.926SerGlu: 4.926 ± 0.684
2.416SerPhe: 2.416 ± 0.542
4.554SerGly: 4.554 ± 0.656
1.487SerHis: 1.487 ± 0.462
3.903SerIle: 3.903 ± 0.463
5.019SerLys: 5.019 ± 0.969
4.926SerLeu: 4.926 ± 0.94
1.394SerMet: 1.394 ± 0.407
2.509SerAsn: 2.509 ± 0.546
0.929SerPro: 0.929 ± 0.253
2.509SerGln: 2.509 ± 0.413
2.881SerArg: 2.881 ± 0.658
2.974SerSer: 2.974 ± 0.481
3.253SerThr: 3.253 ± 0.607
2.416SerVal: 2.416 ± 0.457
1.022SerTrp: 1.022 ± 0.314
2.788SerTyr: 2.788 ± 0.573
0.0SerXaa: 0.0 ± 0.0
Thr
5.112ThrAla: 5.112 ± 1.061
0.465ThrCys: 0.465 ± 0.206
3.532ThrAsp: 3.532 ± 0.621
4.089ThrGlu: 4.089 ± 0.647
2.602ThrPhe: 2.602 ± 0.553
3.81ThrGly: 3.81 ± 0.593
0.651ThrHis: 0.651 ± 0.24
3.532ThrIle: 3.532 ± 0.457
4.182ThrLys: 4.182 ± 0.704
4.74ThrLeu: 4.74 ± 0.535
0.651ThrMet: 0.651 ± 0.252
2.138ThrAsn: 2.138 ± 0.481
1.022ThrPro: 1.022 ± 0.263
2.602ThrGln: 2.602 ± 0.584
2.509ThrArg: 2.509 ± 0.43
3.996ThrSer: 3.996 ± 0.536
3.346ThrThr: 3.346 ± 0.742
4.926ThrVal: 4.926 ± 0.981
0.651ThrTrp: 0.651 ± 0.32
2.23ThrTyr: 2.23 ± 0.397
0.0ThrXaa: 0.0 ± 0.0
Val
5.39ValAla: 5.39 ± 0.69
0.651ValCys: 0.651 ± 0.283
4.74ValAsp: 4.74 ± 0.573
6.134ValGlu: 6.134 ± 0.68
1.301ValPhe: 1.301 ± 0.38
4.368ValGly: 4.368 ± 0.758
1.394ValHis: 1.394 ± 0.406
2.602ValIle: 2.602 ± 0.429
4.275ValLys: 4.275 ± 0.745
4.461ValLeu: 4.461 ± 0.707
1.301ValMet: 1.301 ± 0.401
3.625ValAsn: 3.625 ± 0.521
2.323ValPro: 2.323 ± 0.414
2.788ValGln: 2.788 ± 0.399
2.695ValArg: 2.695 ± 0.521
4.182ValSer: 4.182 ± 0.48
3.717ValThr: 3.717 ± 0.481
4.461ValVal: 4.461 ± 0.942
0.558ValTrp: 0.558 ± 0.266
2.509ValTyr: 2.509 ± 0.499
0.0ValXaa: 0.0 ± 0.0
Trp
0.558TrpAla: 0.558 ± 0.193
0.093TrpCys: 0.093 ± 0.09
0.372TrpAsp: 0.372 ± 0.199
1.673TrpGlu: 1.673 ± 0.502
0.836TrpPhe: 0.836 ± 0.331
0.929TrpGly: 0.929 ± 0.262
0.093TrpHis: 0.093 ± 0.085
0.558TrpIle: 0.558 ± 0.215
1.673TrpLys: 1.673 ± 0.428
1.394TrpLeu: 1.394 ± 0.354
0.372TrpMet: 0.372 ± 0.176
1.022TrpAsn: 1.022 ± 0.477
0.186TrpPro: 0.186 ± 0.133
0.836TrpGln: 0.836 ± 0.233
0.372TrpArg: 0.372 ± 0.17
0.558TrpSer: 0.558 ± 0.204
0.929TrpThr: 0.929 ± 0.269
1.115TrpVal: 1.115 ± 0.388
0.186TrpTrp: 0.186 ± 0.128
0.279TrpTyr: 0.279 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.479
0.372TyrCys: 0.372 ± 0.231
2.045TyrAsp: 2.045 ± 0.401
3.253TyrGlu: 3.253 ± 0.659
1.766TyrPhe: 1.766 ± 0.479
2.045TyrGly: 2.045 ± 0.464
0.465TyrHis: 0.465 ± 0.21
2.602TyrIle: 2.602 ± 0.4
3.067TyrLys: 3.067 ± 0.598
3.253TyrLeu: 3.253 ± 0.458
0.743TyrMet: 0.743 ± 0.309
1.673TyrAsn: 1.673 ± 0.403
1.673TyrPro: 1.673 ± 0.472
1.952TyrGln: 1.952 ± 0.416
2.23TyrArg: 2.23 ± 0.496
2.23TyrSer: 2.23 ± 0.364
2.138TyrThr: 2.138 ± 0.327
2.138TyrVal: 2.138 ± 0.376
0.465TyrTrp: 0.465 ± 0.247
1.859TyrTyr: 1.859 ± 0.599
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (10761 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski