Amino acid dipepetide frequency for Streptococcus phage Javan347

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.376AlaAla: 2.376 ± 0.703
0.648AlaCys: 0.648 ± 0.236
4.859AlaAsp: 4.859 ± 0.762
5.723AlaGlu: 5.723 ± 0.654
1.728AlaPhe: 1.728 ± 0.364
2.915AlaGly: 2.915 ± 0.542
0.648AlaHis: 0.648 ± 0.2
5.939AlaIle: 5.939 ± 1.239
5.615AlaLys: 5.615 ± 0.903
7.019AlaLeu: 7.019 ± 1.058
2.268AlaMet: 2.268 ± 0.487
3.455AlaAsn: 3.455 ± 0.558
1.62AlaPro: 1.62 ± 0.389
2.484AlaGln: 2.484 ± 0.605
3.239AlaArg: 3.239 ± 0.589
3.563AlaSer: 3.563 ± 0.5
3.563AlaThr: 3.563 ± 0.7
4.211AlaVal: 4.211 ± 0.569
1.08AlaTrp: 1.08 ± 0.38
2.592AlaTyr: 2.592 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.216CysAla: 0.216 ± 0.135
0.0CysCys: 0.0 ± 0.0
0.216CysAsp: 0.216 ± 0.16
0.54CysGlu: 0.54 ± 0.278
0.108CysPhe: 0.108 ± 0.119
0.108CysGly: 0.108 ± 0.085
0.324CysHis: 0.324 ± 0.181
0.432CysIle: 0.432 ± 0.234
0.216CysLys: 0.216 ± 0.12
0.972CysLeu: 0.972 ± 0.39
0.0CysMet: 0.0 ± 0.0
0.216CysAsn: 0.216 ± 0.138
0.0CysPro: 0.0 ± 0.0
0.216CysGln: 0.216 ± 0.17
0.108CysArg: 0.108 ± 0.112
0.324CysSer: 0.324 ± 0.252
0.216CysThr: 0.216 ± 0.168
0.324CysVal: 0.324 ± 0.189
0.108CysTrp: 0.108 ± 0.119
0.432CysTyr: 0.432 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
3.563AspAla: 3.563 ± 0.698
0.0AspCys: 0.0 ± 0.0
3.131AspAsp: 3.131 ± 0.635
4.319AspGlu: 4.319 ± 0.786
3.131AspPhe: 3.131 ± 0.612
5.183AspGly: 5.183 ± 0.954
0.756AspHis: 0.756 ± 0.307
5.075AspIle: 5.075 ± 0.958
4.751AspLys: 4.751 ± 0.794
4.751AspLeu: 4.751 ± 0.572
1.512AspMet: 1.512 ± 0.379
3.563AspAsn: 3.563 ± 0.512
1.944AspPro: 1.944 ± 0.508
0.648AspGln: 0.648 ± 0.215
2.592AspArg: 2.592 ± 0.487
3.455AspSer: 3.455 ± 0.494
3.023AspThr: 3.023 ± 0.55
3.887AspVal: 3.887 ± 0.456
1.188AspTrp: 1.188 ± 0.363
3.779AspTyr: 3.779 ± 0.667
0.0AspXaa: 0.0 ± 0.0
Glu
5.615GluAla: 5.615 ± 0.952
0.54GluCys: 0.54 ± 0.273
4.535GluAsp: 4.535 ± 0.612
8.206GluGlu: 8.206 ± 1.06
3.023GluPhe: 3.023 ± 0.399
2.699GluGly: 2.699 ± 0.474
0.864GluHis: 0.864 ± 0.308
5.939GluIle: 5.939 ± 0.676
8.098GluLys: 8.098 ± 1.296
9.502GluLeu: 9.502 ± 1.124
2.484GluMet: 2.484 ± 0.73
5.183GluAsn: 5.183 ± 0.94
1.836GluPro: 1.836 ± 0.422
3.131GluGln: 3.131 ± 0.81
3.023GluArg: 3.023 ± 0.519
3.563GluSer: 3.563 ± 0.495
4.211GluThr: 4.211 ± 0.589
5.291GluVal: 5.291 ± 0.931
1.404GluTrp: 1.404 ± 0.349
2.376GluTyr: 2.376 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
2.807PheAla: 2.807 ± 0.459
0.108PheCys: 0.108 ± 0.115
4.211PheAsp: 4.211 ± 0.846
2.915PheGlu: 2.915 ± 0.452
1.62PhePhe: 1.62 ± 0.431
2.268PheGly: 2.268 ± 0.421
0.54PheHis: 0.54 ± 0.269
2.915PheIle: 2.915 ± 0.538
3.131PheLys: 3.131 ± 0.659
2.592PheLeu: 2.592 ± 0.639
1.62PheMet: 1.62 ± 0.555
3.239PheAsn: 3.239 ± 0.476
0.648PhePro: 0.648 ± 0.298
1.296PheGln: 1.296 ± 0.447
1.08PheArg: 1.08 ± 0.283
2.699PheSer: 2.699 ± 0.426
2.807PheThr: 2.807 ± 0.413
2.268PheVal: 2.268 ± 0.564
0.756PheTrp: 0.756 ± 0.293
1.188PheTyr: 1.188 ± 0.363
0.0PheXaa: 0.0 ± 0.0
Gly
3.995GlyAla: 3.995 ± 0.953
0.108GlyCys: 0.108 ± 0.119
3.779GlyAsp: 3.779 ± 0.539
4.535GlyGlu: 4.535 ± 0.651
2.16GlyPhe: 2.16 ± 0.43
4.211GlyGly: 4.211 ± 0.682
1.08GlyHis: 1.08 ± 0.318
5.075GlyIle: 5.075 ± 0.715
6.155GlyLys: 6.155 ± 0.64
5.507GlyLeu: 5.507 ± 1.212
1.404GlyMet: 1.404 ± 0.429
2.484GlyAsn: 2.484 ± 0.419
0.216GlyPro: 0.216 ± 0.138
2.699GlyGln: 2.699 ± 0.607
2.16GlyArg: 2.16 ± 0.515
3.995GlySer: 3.995 ± 0.859
3.239GlyThr: 3.239 ± 0.522
3.239GlyVal: 3.239 ± 0.447
1.512GlyTrp: 1.512 ± 0.494
2.699GlyTyr: 2.699 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
0.54HisAla: 0.54 ± 0.175
0.108HisCys: 0.108 ± 0.101
0.648HisAsp: 0.648 ± 0.243
0.972HisGlu: 0.972 ± 0.27
1.188HisPhe: 1.188 ± 0.33
0.432HisGly: 0.432 ± 0.213
0.108HisHis: 0.108 ± 0.134
1.512HisIle: 1.512 ± 0.364
0.756HisLys: 0.756 ± 0.297
0.972HisLeu: 0.972 ± 0.296
0.432HisMet: 0.432 ± 0.223
0.756HisAsn: 0.756 ± 0.205
0.216HisPro: 0.216 ± 0.153
0.216HisGln: 0.216 ± 0.162
0.216HisArg: 0.216 ± 0.138
0.756HisSer: 0.756 ± 0.323
1.188HisThr: 1.188 ± 0.377
1.188HisVal: 1.188 ± 0.305
0.0HisTrp: 0.0 ± 0.0
0.864HisTyr: 0.864 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
5.399IleAla: 5.399 ± 0.843
0.648IleCys: 0.648 ± 0.34
5.399IleAsp: 5.399 ± 0.887
5.939IleGlu: 5.939 ± 1.14
3.023IlePhe: 3.023 ± 0.735
4.211IleGly: 4.211 ± 0.752
0.972IleHis: 0.972 ± 0.285
4.103IleIle: 4.103 ± 0.835
5.399IleLys: 5.399 ± 0.818
5.507IleLeu: 5.507 ± 0.714
1.296IleMet: 1.296 ± 0.439
4.751IleAsn: 4.751 ± 0.756
2.699IlePro: 2.699 ± 0.496
3.563IleGln: 3.563 ± 0.556
2.484IleArg: 2.484 ± 0.458
4.967IleSer: 4.967 ± 0.648
3.455IleThr: 3.455 ± 0.547
2.592IleVal: 2.592 ± 0.498
0.972IleTrp: 0.972 ± 0.408
2.699IleTyr: 2.699 ± 0.614
0.0IleXaa: 0.0 ± 0.0
Lys
5.723LysAla: 5.723 ± 0.84
0.216LysCys: 0.216 ± 0.139
4.967LysAsp: 4.967 ± 1.039
9.07LysGlu: 9.07 ± 1.085
2.052LysPhe: 2.052 ± 0.413
4.319LysGly: 4.319 ± 0.763
0.972LysHis: 0.972 ± 0.285
5.183LysIle: 5.183 ± 0.739
6.695LysLys: 6.695 ± 0.887
6.803LysLeu: 6.803 ± 0.882
3.239LysMet: 3.239 ± 0.638
4.751LysAsn: 4.751 ± 0.822
2.16LysPro: 2.16 ± 0.552
3.995LysGln: 3.995 ± 0.48
4.319LysArg: 4.319 ± 0.798
5.507LysSer: 5.507 ± 0.553
5.723LysThr: 5.723 ± 0.691
5.075LysVal: 5.075 ± 0.591
1.08LysTrp: 1.08 ± 0.412
3.131LysTyr: 3.131 ± 0.697
0.0LysXaa: 0.0 ± 0.0
Leu
5.723LeuAla: 5.723 ± 0.813
0.432LeuCys: 0.432 ± 0.216
6.263LeuAsp: 6.263 ± 0.611
6.587LeuGlu: 6.587 ± 0.99
3.239LeuPhe: 3.239 ± 0.624
6.155LeuGly: 6.155 ± 0.937
1.512LeuHis: 1.512 ± 0.379
5.075LeuIle: 5.075 ± 0.59
8.962LeuLys: 8.962 ± 0.923
6.047LeuLeu: 6.047 ± 0.661
2.376LeuMet: 2.376 ± 0.632
5.291LeuAsn: 5.291 ± 0.815
3.131LeuPro: 3.131 ± 0.713
2.16LeuGln: 2.16 ± 0.585
3.455LeuArg: 3.455 ± 0.692
6.803LeuSer: 6.803 ± 0.923
5.183LeuThr: 5.183 ± 0.879
5.183LeuVal: 5.183 ± 0.723
1.08LeuTrp: 1.08 ± 0.351
2.699LeuTyr: 2.699 ± 0.506
0.0LeuXaa: 0.0 ± 0.0
Met
1.512MetAla: 1.512 ± 0.357
0.216MetCys: 0.216 ± 0.155
1.62MetAsp: 1.62 ± 0.445
2.16MetGlu: 2.16 ± 0.62
0.54MetPhe: 0.54 ± 0.286
0.972MetGly: 0.972 ± 0.326
0.0MetHis: 0.0 ± 0.0
1.728MetIle: 1.728 ± 0.456
3.023MetLys: 3.023 ± 0.568
1.62MetLeu: 1.62 ± 0.341
0.864MetMet: 0.864 ± 0.292
2.16MetAsn: 2.16 ± 0.592
0.972MetPro: 0.972 ± 0.438
0.648MetGln: 0.648 ± 0.231
0.756MetArg: 0.756 ± 0.341
1.944MetSer: 1.944 ± 0.611
2.052MetThr: 2.052 ± 0.484
1.728MetVal: 1.728 ± 0.353
0.108MetTrp: 0.108 ± 0.115
0.216MetTyr: 0.216 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
2.807AsnAla: 2.807 ± 0.557
0.108AsnCys: 0.108 ± 0.115
3.023AsnAsp: 3.023 ± 0.858
4.859AsnGlu: 4.859 ± 0.704
2.807AsnPhe: 2.807 ± 0.513
5.507AsnGly: 5.507 ± 0.944
1.08AsnHis: 1.08 ± 0.363
3.563AsnIle: 3.563 ± 0.558
4.103AsnLys: 4.103 ± 0.618
5.183AsnLeu: 5.183 ± 0.722
0.756AsnMet: 0.756 ± 0.317
3.131AsnAsn: 3.131 ± 0.58
3.131AsnPro: 3.131 ± 0.482
3.023AsnGln: 3.023 ± 0.718
2.807AsnArg: 2.807 ± 0.461
3.779AsnSer: 3.779 ± 0.662
2.807AsnThr: 2.807 ± 0.35
3.995AsnVal: 3.995 ± 0.741
0.756AsnTrp: 0.756 ± 0.25
1.836AsnTyr: 1.836 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
2.052ProAla: 2.052 ± 0.42
0.432ProCys: 0.432 ± 0.212
1.944ProAsp: 1.944 ± 0.486
1.512ProGlu: 1.512 ± 0.431
1.08ProPhe: 1.08 ± 0.331
1.404ProGly: 1.404 ± 0.35
0.54ProHis: 0.54 ± 0.218
1.08ProIle: 1.08 ± 0.347
2.484ProLys: 2.484 ± 0.719
1.836ProLeu: 1.836 ± 0.431
0.432ProMet: 0.432 ± 0.222
1.08ProAsn: 1.08 ± 0.359
0.432ProPro: 0.432 ± 0.232
2.376ProGln: 2.376 ± 0.489
0.756ProArg: 0.756 ± 0.271
2.16ProSer: 2.16 ± 0.469
1.512ProThr: 1.512 ± 0.377
2.484ProVal: 2.484 ± 0.414
0.54ProTrp: 0.54 ± 0.256
1.08ProTyr: 1.08 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
4.535GlnAla: 4.535 ± 0.702
0.0GlnCys: 0.0 ± 0.0
1.512GlnAsp: 1.512 ± 0.362
3.671GlnGlu: 3.671 ± 0.702
1.728GlnPhe: 1.728 ± 0.38
1.944GlnGly: 1.944 ± 0.414
0.432GlnHis: 0.432 ± 0.214
2.807GlnIle: 2.807 ± 0.559
2.699GlnLys: 2.699 ± 0.559
3.995GlnLeu: 3.995 ± 0.641
1.296GlnMet: 1.296 ± 0.34
2.592GlnAsn: 2.592 ± 0.493
0.54GlnPro: 0.54 ± 0.221
1.728GlnGln: 1.728 ± 0.469
1.836GlnArg: 1.836 ± 0.535
2.807GlnSer: 2.807 ± 0.611
1.62GlnThr: 1.62 ± 0.43
2.484GlnVal: 2.484 ± 0.582
0.216GlnTrp: 0.216 ± 0.136
1.296GlnTyr: 1.296 ± 0.438
0.0GlnXaa: 0.0 ± 0.0
Arg
3.023ArgAla: 3.023 ± 0.417
0.324ArgCys: 0.324 ± 0.193
1.62ArgAsp: 1.62 ± 0.388
3.239ArgGlu: 3.239 ± 0.503
2.16ArgPhe: 2.16 ± 0.431
2.052ArgGly: 2.052 ± 0.606
0.54ArgHis: 0.54 ± 0.235
2.915ArgIle: 2.915 ± 0.543
3.563ArgLys: 3.563 ± 0.534
3.563ArgLeu: 3.563 ± 0.692
0.432ArgMet: 0.432 ± 0.266
2.699ArgAsn: 2.699 ± 0.516
1.62ArgPro: 1.62 ± 0.458
1.512ArgGln: 1.512 ± 0.312
2.16ArgArg: 2.16 ± 0.574
2.268ArgSer: 2.268 ± 0.364
2.052ArgThr: 2.052 ± 0.62
3.023ArgVal: 3.023 ± 0.529
0.972ArgTrp: 0.972 ± 0.256
2.268ArgTyr: 2.268 ± 0.57
0.0ArgXaa: 0.0 ± 0.0
Ser
4.751SerAla: 4.751 ± 0.858
0.324SerCys: 0.324 ± 0.162
3.347SerAsp: 3.347 ± 0.557
4.427SerGlu: 4.427 ± 0.564
3.347SerPhe: 3.347 ± 0.561
3.779SerGly: 3.779 ± 0.694
0.432SerHis: 0.432 ± 0.232
4.535SerIle: 4.535 ± 0.579
5.183SerLys: 5.183 ± 0.706
6.155SerLeu: 6.155 ± 0.852
1.08SerMet: 1.08 ± 0.419
4.319SerAsn: 4.319 ± 0.806
1.62SerPro: 1.62 ± 0.416
2.699SerGln: 2.699 ± 0.687
2.807SerArg: 2.807 ± 0.525
4.643SerSer: 4.643 ± 0.727
4.319SerThr: 4.319 ± 0.654
3.671SerVal: 3.671 ± 0.765
0.864SerTrp: 0.864 ± 0.3
2.484SerTyr: 2.484 ± 0.475
0.0SerXaa: 0.0 ± 0.0
Thr
4.319ThrAla: 4.319 ± 0.862
0.432ThrCys: 0.432 ± 0.222
2.915ThrAsp: 2.915 ± 0.697
2.484ThrGlu: 2.484 ± 0.458
2.376ThrPhe: 2.376 ± 0.496
4.535ThrGly: 4.535 ± 0.708
0.756ThrHis: 0.756 ± 0.307
5.075ThrIle: 5.075 ± 0.609
4.643ThrLys: 4.643 ± 0.474
4.751ThrLeu: 4.751 ± 0.713
0.864ThrMet: 0.864 ± 0.258
2.699ThrAsn: 2.699 ± 0.54
1.188ThrPro: 1.188 ± 0.475
2.484ThrGln: 2.484 ± 0.523
3.239ThrArg: 3.239 ± 0.612
3.671ThrSer: 3.671 ± 0.686
4.103ThrThr: 4.103 ± 0.658
4.427ThrVal: 4.427 ± 0.841
0.864ThrTrp: 0.864 ± 0.233
1.728ThrTyr: 1.728 ± 0.448
0.0ThrXaa: 0.0 ± 0.0
Val
3.887ValAla: 3.887 ± 0.782
0.324ValCys: 0.324 ± 0.166
3.455ValAsp: 3.455 ± 0.711
5.939ValGlu: 5.939 ± 0.707
2.484ValPhe: 2.484 ± 0.835
4.211ValGly: 4.211 ± 0.835
0.324ValHis: 0.324 ± 0.215
3.455ValIle: 3.455 ± 0.64
4.967ValLys: 4.967 ± 0.806
4.643ValLeu: 4.643 ± 0.788
1.404ValMet: 1.404 ± 0.306
3.779ValAsn: 3.779 ± 0.541
1.944ValPro: 1.944 ± 0.471
2.484ValGln: 2.484 ± 0.651
2.699ValArg: 2.699 ± 0.585
3.779ValSer: 3.779 ± 0.736
4.751ValThr: 4.751 ± 0.723
4.427ValVal: 4.427 ± 0.677
1.188ValTrp: 1.188 ± 0.346
1.728ValTyr: 1.728 ± 0.494
0.0ValXaa: 0.0 ± 0.0
Trp
0.756TrpAla: 0.756 ± 0.287
0.108TrpCys: 0.108 ± 0.11
0.216TrpAsp: 0.216 ± 0.126
0.972TrpGlu: 0.972 ± 0.233
0.972TrpPhe: 0.972 ± 0.3
0.756TrpGly: 0.756 ± 0.317
0.216TrpHis: 0.216 ± 0.185
1.404TrpIle: 1.404 ± 0.483
1.188TrpLys: 1.188 ± 0.388
1.836TrpLeu: 1.836 ± 0.412
0.324TrpMet: 0.324 ± 0.153
1.08TrpAsn: 1.08 ± 0.301
0.324TrpPro: 0.324 ± 0.193
0.648TrpGln: 0.648 ± 0.22
0.756TrpArg: 0.756 ± 0.232
1.512TrpSer: 1.512 ± 0.473
0.648TrpThr: 0.648 ± 0.245
0.756TrpVal: 0.756 ± 0.316
0.216TrpTrp: 0.216 ± 0.215
0.864TrpTyr: 0.864 ± 0.535
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.944TyrAla: 1.944 ± 0.418
0.0TyrCys: 0.0 ± 0.0
2.592TyrAsp: 2.592 ± 0.522
3.455TyrGlu: 3.455 ± 0.641
1.944TyrPhe: 1.944 ± 0.54
2.592TyrGly: 2.592 ± 0.588
0.972TyrHis: 0.972 ± 0.273
2.268TyrIle: 2.268 ± 0.447
3.347TyrLys: 3.347 ± 0.733
4.211TyrLeu: 4.211 ± 0.786
0.648TyrMet: 0.648 ± 0.284
1.836TyrAsn: 1.836 ± 0.447
0.972TyrPro: 0.972 ± 0.335
1.728TyrGln: 1.728 ± 0.512
1.512TyrArg: 1.512 ± 0.495
2.592TyrSer: 2.592 ± 0.408
1.08TyrThr: 1.08 ± 0.316
1.62TyrVal: 1.62 ± 0.375
0.648TyrTrp: 0.648 ± 0.236
2.592TyrTyr: 2.592 ± 1.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (9262 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski