Amino acid dipepetide frequency for Streptococcus phage Javan237

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.454AlaAla: 3.454 ± 0.804
0.0AlaCys: 0.0 ± 0.0
4.883AlaAsp: 4.883 ± 0.797
4.883AlaGlu: 4.883 ± 0.939
3.215AlaPhe: 3.215 ± 0.441
5.002AlaGly: 5.002 ± 0.922
0.715AlaHis: 0.715 ± 0.27
4.168AlaIle: 4.168 ± 0.533
7.026AlaLys: 7.026 ± 1.68
6.074AlaLeu: 6.074 ± 0.702
1.905AlaMet: 1.905 ± 0.463
5.597AlaAsn: 5.597 ± 1.082
1.667AlaPro: 1.667 ± 0.327
3.692AlaGln: 3.692 ± 0.686
2.382AlaArg: 2.382 ± 0.61
6.312AlaSer: 6.312 ± 1.105
5.478AlaThr: 5.478 ± 0.798
4.764AlaVal: 4.764 ± 0.737
1.786AlaTrp: 1.786 ± 0.407
3.454AlaTyr: 3.454 ± 0.707
0.0AlaXaa: 0.0 ± 0.0
Cys
0.238CysAla: 0.238 ± 0.171
0.119CysCys: 0.119 ± 0.148
0.357CysAsp: 0.357 ± 0.189
0.238CysGlu: 0.238 ± 0.163
0.119CysPhe: 0.119 ± 0.131
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.357CysLeu: 0.357 ± 0.24
0.119CysMet: 0.119 ± 0.129
0.238CysAsn: 0.238 ± 0.13
0.119CysPro: 0.119 ± 0.113
0.119CysGln: 0.119 ± 0.117
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.357CysThr: 0.357 ± 0.218
0.238CysVal: 0.238 ± 0.132
0.119CysTrp: 0.119 ± 0.121
0.119CysTyr: 0.119 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
2.382AspAla: 2.382 ± 0.581
0.0AspCys: 0.0 ± 0.0
4.883AspAsp: 4.883 ± 1.139
4.525AspGlu: 4.525 ± 0.874
3.692AspPhe: 3.692 ± 0.625
7.026AspGly: 7.026 ± 1.257
0.476AspHis: 0.476 ± 0.325
2.858AspIle: 2.858 ± 0.802
5.002AspLys: 5.002 ± 0.771
4.049AspLeu: 4.049 ± 0.885
0.834AspMet: 0.834 ± 0.288
4.645AspAsn: 4.645 ± 0.655
2.739AspPro: 2.739 ± 0.512
1.31AspGln: 1.31 ± 0.549
1.786AspArg: 1.786 ± 0.463
4.287AspSer: 4.287 ± 0.562
3.811AspThr: 3.811 ± 0.621
4.406AspVal: 4.406 ± 0.833
0.715AspTrp: 0.715 ± 0.269
3.93AspTyr: 3.93 ± 0.693
0.0AspXaa: 0.0 ± 0.0
Glu
5.24GluAla: 5.24 ± 1.021
0.476GluCys: 0.476 ± 0.273
3.811GluAsp: 3.811 ± 0.864
5.002GluGlu: 5.002 ± 1.288
2.144GluPhe: 2.144 ± 0.518
2.263GluGly: 2.263 ± 0.392
0.953GluHis: 0.953 ± 0.334
5.24GluIle: 5.24 ± 1.034
4.287GluLys: 4.287 ± 1.217
6.431GluLeu: 6.431 ± 0.939
1.191GluMet: 1.191 ± 0.398
3.573GluAsn: 3.573 ± 0.691
1.548GluPro: 1.548 ± 0.619
3.692GluGln: 3.692 ± 0.86
2.263GluArg: 2.263 ± 0.384
2.977GluSer: 2.977 ± 0.508
4.645GluThr: 4.645 ± 0.627
4.287GluVal: 4.287 ± 0.865
0.476GluTrp: 0.476 ± 0.259
2.382GluTyr: 2.382 ± 0.554
0.0GluXaa: 0.0 ± 0.0
Phe
3.215PheAla: 3.215 ± 0.633
0.238PheCys: 0.238 ± 0.141
3.096PheAsp: 3.096 ± 0.458
1.905PheGlu: 1.905 ± 0.562
1.786PhePhe: 1.786 ± 0.421
3.692PheGly: 3.692 ± 0.733
0.238PheHis: 0.238 ± 0.17
2.144PheIle: 2.144 ± 0.483
3.573PheLys: 3.573 ± 0.495
3.573PheLeu: 3.573 ± 0.902
0.715PheMet: 0.715 ± 0.319
2.382PheAsn: 2.382 ± 0.449
0.595PhePro: 0.595 ± 0.319
0.476PheGln: 0.476 ± 0.169
1.31PheArg: 1.31 ± 0.371
2.977PheSer: 2.977 ± 0.685
2.382PheThr: 2.382 ± 0.516
1.905PheVal: 1.905 ± 0.432
0.476PheTrp: 0.476 ± 0.207
1.905PheTyr: 1.905 ± 0.508
0.0PheXaa: 0.0 ± 0.0
Gly
3.692GlyAla: 3.692 ± 0.733
0.0GlyCys: 0.0 ± 0.0
3.93GlyAsp: 3.93 ± 0.491
3.692GlyGlu: 3.692 ± 0.651
2.62GlyPhe: 2.62 ± 0.443
5.121GlyGly: 5.121 ± 1.009
1.31GlyHis: 1.31 ± 0.332
5.955GlyIle: 5.955 ± 0.896
5.597GlyLys: 5.597 ± 0.714
7.264GlyLeu: 7.264 ± 0.611
2.025GlyMet: 2.025 ± 0.528
4.883GlyAsn: 4.883 ± 0.847
0.476GlyPro: 0.476 ± 0.175
2.025GlyGln: 2.025 ± 0.422
1.667GlyArg: 1.667 ± 0.606
5.359GlySer: 5.359 ± 1.161
5.359GlyThr: 5.359 ± 1.394
3.335GlyVal: 3.335 ± 0.588
1.429GlyTrp: 1.429 ± 0.335
2.858GlyTyr: 2.858 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
0.715HisAla: 0.715 ± 0.325
0.0HisCys: 0.0 ± 0.0
0.953HisAsp: 0.953 ± 0.316
0.357HisGlu: 0.357 ± 0.2
1.191HisPhe: 1.191 ± 0.331
0.119HisGly: 0.119 ± 0.096
0.119HisHis: 0.119 ± 0.096
0.834HisIle: 0.834 ± 0.346
1.429HisLys: 1.429 ± 0.465
0.476HisLeu: 0.476 ± 0.235
1.072HisMet: 1.072 ± 0.299
0.834HisAsn: 0.834 ± 0.328
0.595HisPro: 0.595 ± 0.24
0.595HisGln: 0.595 ± 0.24
0.357HisArg: 0.357 ± 0.179
0.476HisSer: 0.476 ± 0.216
0.715HisThr: 0.715 ± 0.338
0.953HisVal: 0.953 ± 0.302
0.119HisTrp: 0.119 ± 0.088
0.357HisTyr: 0.357 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
5.716IleAla: 5.716 ± 1.024
0.119IleCys: 0.119 ± 0.117
4.525IleAsp: 4.525 ± 0.577
3.573IleGlu: 3.573 ± 0.814
2.739IlePhe: 2.739 ± 0.652
4.287IleGly: 4.287 ± 0.582
0.953IleHis: 0.953 ± 0.335
2.977IleIle: 2.977 ± 0.551
6.669IleLys: 6.669 ± 0.955
4.049IleLeu: 4.049 ± 0.655
1.786IleMet: 1.786 ± 0.537
2.977IleAsn: 2.977 ± 0.457
2.858IlePro: 2.858 ± 0.502
3.335IleGln: 3.335 ± 0.476
1.548IleArg: 1.548 ± 0.442
5.359IleSer: 5.359 ± 0.617
5.24IleThr: 5.24 ± 0.936
1.548IleVal: 1.548 ± 0.399
1.31IleTrp: 1.31 ± 0.314
2.144IleTyr: 2.144 ± 0.475
0.0IleXaa: 0.0 ± 0.0
Lys
6.193LysAla: 6.193 ± 0.978
0.357LysCys: 0.357 ± 0.195
5.716LysAsp: 5.716 ± 0.876
6.312LysGlu: 6.312 ± 0.887
3.573LysPhe: 3.573 ± 0.862
5.002LysGly: 5.002 ± 0.82
1.31LysHis: 1.31 ± 0.415
5.955LysIle: 5.955 ± 0.744
5.955LysLys: 5.955 ± 1.086
6.788LysLeu: 6.788 ± 0.714
2.739LysMet: 2.739 ± 0.813
5.478LysAsn: 5.478 ± 0.567
2.144LysPro: 2.144 ± 0.452
2.62LysGln: 2.62 ± 0.603
2.382LysArg: 2.382 ± 0.61
4.764LysSer: 4.764 ± 0.988
6.669LysThr: 6.669 ± 0.933
4.525LysVal: 4.525 ± 0.649
1.31LysTrp: 1.31 ± 0.371
2.382LysTyr: 2.382 ± 0.584
0.0LysXaa: 0.0 ± 0.0
Leu
8.336LeuAla: 8.336 ± 1.234
0.238LeuCys: 0.238 ± 0.195
4.883LeuAsp: 4.883 ± 0.832
6.193LeuGlu: 6.193 ± 1.17
1.548LeuPhe: 1.548 ± 0.442
4.168LeuGly: 4.168 ± 0.66
0.595LeuHis: 0.595 ± 0.26
5.359LeuIle: 5.359 ± 0.711
8.217LeuLys: 8.217 ± 1.034
4.406LeuLeu: 4.406 ± 0.767
2.144LeuMet: 2.144 ± 0.528
4.645LeuAsn: 4.645 ± 0.639
1.905LeuPro: 1.905 ± 0.33
2.62LeuGln: 2.62 ± 0.5
2.858LeuArg: 2.858 ± 0.57
6.431LeuSer: 6.431 ± 0.859
7.026LeuThr: 7.026 ± 1.001
4.049LeuVal: 4.049 ± 0.506
1.191LeuTrp: 1.191 ± 0.366
2.977LeuTyr: 2.977 ± 0.489
0.0LeuXaa: 0.0 ± 0.0
Met
1.905MetAla: 1.905 ± 0.315
0.0MetCys: 0.0 ± 0.0
1.191MetAsp: 1.191 ± 0.434
1.072MetGlu: 1.072 ± 0.455
0.715MetPhe: 0.715 ± 0.248
1.429MetGly: 1.429 ± 0.331
0.119MetHis: 0.119 ± 0.166
1.786MetIle: 1.786 ± 0.407
3.215MetLys: 3.215 ± 0.607
2.382MetLeu: 2.382 ± 0.451
0.476MetMet: 0.476 ± 0.183
1.072MetAsn: 1.072 ± 0.402
0.476MetPro: 0.476 ± 0.193
1.31MetGln: 1.31 ± 0.471
0.953MetArg: 0.953 ± 0.239
2.025MetSer: 2.025 ± 0.461
1.905MetThr: 1.905 ± 0.459
1.786MetVal: 1.786 ± 0.4
0.238MetTrp: 0.238 ± 0.177
0.834MetTyr: 0.834 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
5.359AsnAla: 5.359 ± 0.794
0.238AsnCys: 0.238 ± 0.12
3.573AsnAsp: 3.573 ± 0.685
2.977AsnGlu: 2.977 ± 0.593
1.429AsnPhe: 1.429 ± 0.415
6.431AsnGly: 6.431 ± 1.034
0.834AsnHis: 0.834 ± 0.223
3.811AsnIle: 3.811 ± 0.479
3.692AsnLys: 3.692 ± 0.795
4.645AsnLeu: 4.645 ± 0.646
2.025AsnMet: 2.025 ± 0.463
3.692AsnAsn: 3.692 ± 0.808
2.263AsnPro: 2.263 ± 0.454
3.215AsnGln: 3.215 ± 0.62
2.144AsnArg: 2.144 ± 0.449
5.121AsnSer: 5.121 ± 0.651
3.811AsnThr: 3.811 ± 0.662
2.739AsnVal: 2.739 ± 0.663
0.595AsnTrp: 0.595 ± 0.257
2.382AsnTyr: 2.382 ± 0.392
0.0AsnXaa: 0.0 ± 0.0
Pro
2.025ProAla: 2.025 ± 0.575
0.0ProCys: 0.0 ± 0.0
1.429ProAsp: 1.429 ± 0.45
2.025ProGlu: 2.025 ± 0.475
1.31ProPhe: 1.31 ± 0.317
1.072ProGly: 1.072 ± 0.307
0.595ProHis: 0.595 ± 0.211
1.905ProIle: 1.905 ± 0.384
2.263ProLys: 2.263 ± 0.669
2.382ProLeu: 2.382 ± 0.587
0.953ProMet: 0.953 ± 0.293
1.905ProAsn: 1.905 ± 0.402
0.476ProPro: 0.476 ± 0.175
1.072ProGln: 1.072 ± 0.366
0.595ProArg: 0.595 ± 0.177
2.501ProSer: 2.501 ± 0.601
2.62ProThr: 2.62 ± 0.453
1.31ProVal: 1.31 ± 0.273
0.357ProTrp: 0.357 ± 0.154
1.191ProTyr: 1.191 ± 0.361
0.0ProXaa: 0.0 ± 0.0
Gln
4.287GlnAla: 4.287 ± 0.957
0.0GlnCys: 0.0 ± 0.0
2.501GlnAsp: 2.501 ± 0.6
3.335GlnGlu: 3.335 ± 0.63
0.953GlnPhe: 0.953 ± 0.275
3.454GlnGly: 3.454 ± 0.56
0.357GlnHis: 0.357 ± 0.223
3.215GlnIle: 3.215 ± 0.648
3.215GlnLys: 3.215 ± 0.574
4.287GlnLeu: 4.287 ± 0.685
0.595GlnMet: 0.595 ± 0.328
2.501GlnAsn: 2.501 ± 0.714
0.715GlnPro: 0.715 ± 0.224
2.977GlnGln: 2.977 ± 0.596
1.191GlnArg: 1.191 ± 0.368
2.977GlnSer: 2.977 ± 0.662
2.382GlnThr: 2.382 ± 0.625
2.263GlnVal: 2.263 ± 0.439
0.357GlnTrp: 0.357 ± 0.265
1.667GlnTyr: 1.667 ± 0.449
0.0GlnXaa: 0.0 ± 0.0
Arg
2.144ArgAla: 2.144 ± 0.393
0.0ArgCys: 0.0 ± 0.0
1.191ArgAsp: 1.191 ± 0.409
2.382ArgGlu: 2.382 ± 0.512
1.548ArgPhe: 1.548 ± 0.401
1.31ArgGly: 1.31 ± 0.289
0.238ArgHis: 0.238 ± 0.192
1.905ArgIle: 1.905 ± 0.627
2.739ArgLys: 2.739 ± 0.534
2.501ArgLeu: 2.501 ± 0.633
0.595ArgMet: 0.595 ± 0.25
2.144ArgAsn: 2.144 ± 0.532
0.953ArgPro: 0.953 ± 0.379
2.144ArgGln: 2.144 ± 0.58
1.072ArgArg: 1.072 ± 0.518
1.667ArgSer: 1.667 ± 0.385
2.382ArgThr: 2.382 ± 0.65
2.025ArgVal: 2.025 ± 0.39
1.072ArgTrp: 1.072 ± 0.204
1.191ArgTyr: 1.191 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
5.478SerAla: 5.478 ± 0.935
0.238SerCys: 0.238 ± 0.199
5.121SerAsp: 5.121 ± 0.786
5.716SerGlu: 5.716 ± 0.863
3.335SerPhe: 3.335 ± 0.771
5.716SerGly: 5.716 ± 1.014
0.953SerHis: 0.953 ± 0.296
4.049SerIle: 4.049 ± 0.754
4.764SerLys: 4.764 ± 1.089
5.478SerLeu: 5.478 ± 0.678
2.62SerMet: 2.62 ± 0.588
4.287SerAsn: 4.287 ± 0.407
2.263SerPro: 2.263 ± 0.376
3.215SerGln: 3.215 ± 0.896
2.025SerArg: 2.025 ± 0.603
6.55SerSer: 6.55 ± 1.596
5.359SerThr: 5.359 ± 0.883
4.049SerVal: 4.049 ± 0.678
0.953SerTrp: 0.953 ± 0.385
1.429SerTyr: 1.429 ± 0.455
0.0SerXaa: 0.0 ± 0.0
Thr
6.788ThrAla: 6.788 ± 1.066
0.119ThrCys: 0.119 ± 0.096
3.811ThrAsp: 3.811 ± 0.578
2.977ThrGlu: 2.977 ± 0.644
2.62ThrPhe: 2.62 ± 0.641
4.525ThrGly: 4.525 ± 0.796
1.31ThrHis: 1.31 ± 0.317
5.24ThrIle: 5.24 ± 1.061
6.55ThrLys: 6.55 ± 0.918
7.145ThrLeu: 7.145 ± 0.936
1.191ThrMet: 1.191 ± 0.433
3.454ThrAsn: 3.454 ± 0.478
2.263ThrPro: 2.263 ± 0.542
3.335ThrGln: 3.335 ± 0.527
3.096ThrArg: 3.096 ± 0.395
6.312ThrSer: 6.312 ± 0.725
4.525ThrThr: 4.525 ± 1.14
5.716ThrVal: 5.716 ± 1.078
1.429ThrTrp: 1.429 ± 0.322
2.739ThrTyr: 2.739 ± 0.635
0.0ThrXaa: 0.0 ± 0.0
Val
5.478ValAla: 5.478 ± 0.867
0.238ValCys: 0.238 ± 0.161
3.93ValAsp: 3.93 ± 0.807
3.096ValGlu: 3.096 ± 0.619
1.905ValPhe: 1.905 ± 0.523
3.93ValGly: 3.93 ± 0.933
0.357ValHis: 0.357 ± 0.168
3.692ValIle: 3.692 ± 0.605
4.049ValLys: 4.049 ± 0.595
4.168ValLeu: 4.168 ± 0.62
0.357ValMet: 0.357 ± 0.241
3.335ValAsn: 3.335 ± 0.655
2.025ValPro: 2.025 ± 0.469
2.382ValGln: 2.382 ± 0.594
1.786ValArg: 1.786 ± 0.462
3.573ValSer: 3.573 ± 0.559
6.55ValThr: 6.55 ± 0.977
4.049ValVal: 4.049 ± 0.786
0.595ValTrp: 0.595 ± 0.225
2.501ValTyr: 2.501 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
0.476TrpAla: 0.476 ± 0.17
0.119TrpCys: 0.119 ± 0.113
1.191TrpAsp: 1.191 ± 0.534
0.715TrpGlu: 0.715 ± 0.376
0.595TrpPhe: 0.595 ± 0.254
0.834TrpGly: 0.834 ± 0.237
0.238TrpHis: 0.238 ± 0.145
0.595TrpIle: 0.595 ± 0.223
1.429TrpLys: 1.429 ± 0.355
1.072TrpLeu: 1.072 ± 0.415
0.357TrpMet: 0.357 ± 0.199
1.667TrpAsn: 1.667 ± 0.494
0.0TrpPro: 0.0 ± 0.0
0.834TrpGln: 0.834 ± 0.254
0.595TrpArg: 0.595 ± 0.174
1.667TrpSer: 1.667 ± 0.448
0.834TrpThr: 0.834 ± 0.357
0.834TrpVal: 0.834 ± 0.372
0.476TrpTrp: 0.476 ± 0.255
0.715TrpTyr: 0.715 ± 0.273
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.335TyrAla: 3.335 ± 0.399
0.357TyrCys: 0.357 ± 0.261
2.382TyrAsp: 2.382 ± 0.565
1.905TyrGlu: 1.905 ± 0.43
1.548TyrPhe: 1.548 ± 0.373
3.215TyrGly: 3.215 ± 0.774
0.595TyrHis: 0.595 ± 0.243
2.025TyrIle: 2.025 ± 0.571
2.501TyrLys: 2.501 ± 0.495
2.263TyrLeu: 2.263 ± 0.448
1.072TyrMet: 1.072 ± 0.358
1.667TyrAsn: 1.667 ± 0.374
1.905TyrPro: 1.905 ± 0.521
2.263TyrGln: 2.263 ± 0.492
1.191TyrArg: 1.191 ± 0.328
2.382TyrSer: 2.382 ± 0.634
3.096TyrThr: 3.096 ± 0.883
3.215TyrVal: 3.215 ± 0.777
0.238TyrTrp: 0.238 ± 0.192
1.905TyrTyr: 1.905 ± 0.421
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (8398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski