Amino acid dipepetide frequency for Streptococcus satellite phage Javan89

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.718AlaAla: 1.718 ± 0.977
1.431AlaCys: 1.431 ± 0.537
2.29AlaAsp: 2.29 ± 0.815
4.008AlaGlu: 4.008 ± 1.102
2.577AlaPhe: 2.577 ± 0.657
2.004AlaGly: 2.004 ± 0.899
0.573AlaHis: 0.573 ± 0.59
3.722AlaIle: 3.722 ± 1.213
5.153AlaLys: 5.153 ± 1.075
5.726AlaLeu: 5.726 ± 1.319
1.718AlaMet: 1.718 ± 0.764
3.435AlaAsn: 3.435 ± 1.351
0.286AlaPro: 0.286 ± 0.279
3.149AlaGln: 3.149 ± 0.759
3.149AlaArg: 3.149 ± 1.203
2.863AlaSer: 2.863 ± 0.739
4.581AlaThr: 4.581 ± 1.227
3.722AlaVal: 3.722 ± 1.089
1.145AlaTrp: 1.145 ± 0.544
2.29AlaTyr: 2.29 ± 0.635
0.0AlaXaa: 0.0 ± 0.0
Cys
0.286CysAla: 0.286 ± 0.285
0.286CysCys: 0.286 ± 0.308
0.286CysAsp: 0.286 ± 0.301
0.286CysGlu: 0.286 ± 0.228
0.286CysPhe: 0.286 ± 0.228
0.573CysGly: 0.573 ± 0.356
0.286CysHis: 0.286 ± 0.301
0.286CysIle: 0.286 ± 0.244
0.286CysLys: 0.286 ± 0.285
0.573CysLeu: 0.573 ± 0.396
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.286CysArg: 0.286 ± 0.263
0.573CysSer: 0.573 ± 0.419
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.859CysTyr: 0.859 ± 0.502
0.0CysXaa: 0.0 ± 0.0
Asp
1.145AspAla: 1.145 ± 0.547
0.286AspCys: 0.286 ± 0.285
4.581AspAsp: 4.581 ± 1.183
4.867AspGlu: 4.867 ± 1.293
4.008AspPhe: 4.008 ± 1.423
1.431AspGly: 1.431 ± 0.69
1.431AspHis: 1.431 ± 0.679
9.734AspIle: 9.734 ± 1.994
5.726AspLys: 5.726 ± 1.226
7.443AspLeu: 7.443 ± 1.357
2.29AspMet: 2.29 ± 0.762
0.859AspAsn: 0.859 ± 0.477
0.0AspPro: 0.0 ± 0.0
1.431AspGln: 1.431 ± 0.577
3.149AspArg: 3.149 ± 0.76
2.577AspSer: 2.577 ± 1.085
2.29AspThr: 2.29 ± 0.786
3.149AspVal: 3.149 ± 0.975
0.573AspTrp: 0.573 ± 0.334
4.581AspTyr: 4.581 ± 1.041
0.0AspXaa: 0.0 ± 0.0
Glu
5.726GluAla: 5.726 ± 1.438
0.573GluCys: 0.573 ± 0.569
4.581GluAsp: 4.581 ± 1.508
6.871GluGlu: 6.871 ± 2.197
2.004GluPhe: 2.004 ± 0.974
4.008GluGly: 4.008 ± 0.908
1.718GluHis: 1.718 ± 0.691
6.298GluIle: 6.298 ± 1.293
9.161GluLys: 9.161 ± 1.639
11.738GluLeu: 11.738 ± 1.795
4.581GluMet: 4.581 ± 0.729
2.863GluAsn: 2.863 ± 0.761
0.859GluPro: 0.859 ± 0.472
3.722GluGln: 3.722 ± 1.264
3.722GluArg: 3.722 ± 1.13
3.149GluSer: 3.149 ± 0.783
4.581GluThr: 4.581 ± 1.27
5.439GluVal: 5.439 ± 0.991
1.431GluTrp: 1.431 ± 0.568
3.149GluTyr: 3.149 ± 0.644
0.0GluXaa: 0.0 ± 0.0
Phe
2.29PheAla: 2.29 ± 0.771
0.286PheCys: 0.286 ± 0.308
3.722PheAsp: 3.722 ± 0.828
3.722PheGlu: 3.722 ± 0.814
2.29PhePhe: 2.29 ± 0.704
1.145PheGly: 1.145 ± 0.43
1.718PheHis: 1.718 ± 0.649
2.577PheIle: 2.577 ± 0.905
3.722PheLys: 3.722 ± 1.083
2.863PheLeu: 2.863 ± 0.846
0.859PheMet: 0.859 ± 0.476
2.577PheAsn: 2.577 ± 0.794
0.573PhePro: 0.573 ± 0.357
1.431PheGln: 1.431 ± 0.589
2.004PheArg: 2.004 ± 0.819
4.581PheSer: 4.581 ± 1.664
3.435PheThr: 3.435 ± 0.914
2.004PheVal: 2.004 ± 0.755
0.859PheTrp: 0.859 ± 0.473
1.431PheTyr: 1.431 ± 0.663
0.0PheXaa: 0.0 ± 0.0
Gly
2.29GlyAla: 2.29 ± 0.576
0.859GlyCys: 0.859 ± 0.476
2.577GlyAsp: 2.577 ± 0.712
3.435GlyGlu: 3.435 ± 0.809
1.431GlyPhe: 1.431 ± 0.709
2.004GlyGly: 2.004 ± 0.95
0.573GlyHis: 0.573 ± 0.384
3.722GlyIle: 3.722 ± 0.977
2.29GlyLys: 2.29 ± 0.946
4.008GlyLeu: 4.008 ± 0.886
1.145GlyMet: 1.145 ± 0.526
3.149GlyAsn: 3.149 ± 0.916
0.286GlyPro: 0.286 ± 0.301
2.004GlyGln: 2.004 ± 0.675
2.577GlyArg: 2.577 ± 1.413
1.431GlySer: 1.431 ± 0.556
2.577GlyThr: 2.577 ± 0.599
3.149GlyVal: 3.149 ± 1.056
0.573GlyTrp: 0.573 ± 0.409
4.008GlyTyr: 4.008 ± 1.058
0.0GlyXaa: 0.0 ± 0.0
His
0.859HisAla: 0.859 ± 0.585
0.0HisCys: 0.0 ± 0.0
0.573HisAsp: 0.573 ± 0.333
0.859HisGlu: 0.859 ± 0.468
0.286HisPhe: 0.286 ± 0.263
0.859HisGly: 0.859 ± 0.486
0.286HisHis: 0.286 ± 0.244
1.718HisIle: 1.718 ± 0.603
1.718HisLys: 1.718 ± 0.465
0.859HisLeu: 0.859 ± 0.447
0.859HisMet: 0.859 ± 0.458
1.431HisAsn: 1.431 ± 0.588
0.859HisPro: 0.859 ± 0.487
0.859HisGln: 0.859 ± 0.527
0.859HisArg: 0.859 ± 0.413
0.859HisSer: 0.859 ± 0.393
0.859HisThr: 0.859 ± 0.385
0.859HisVal: 0.859 ± 0.459
0.0HisTrp: 0.0 ± 0.0
0.859HisTyr: 0.859 ± 0.495
0.0HisXaa: 0.0 ± 0.0
Ile
5.153IleAla: 5.153 ± 1.176
0.573IleCys: 0.573 ± 0.443
6.871IleAsp: 6.871 ± 1.27
7.157IleGlu: 7.157 ± 1.407
3.435IlePhe: 3.435 ± 0.902
3.722IleGly: 3.722 ± 1.071
0.859IleHis: 0.859 ± 0.582
3.722IleIle: 3.722 ± 1.106
6.012IleLys: 6.012 ± 1.286
4.867IleLeu: 4.867 ± 0.825
0.859IleMet: 0.859 ± 0.573
3.435IleAsn: 3.435 ± 1.316
2.29IlePro: 2.29 ± 0.797
2.863IleGln: 2.863 ± 0.884
3.435IleArg: 3.435 ± 1.098
5.153IleSer: 5.153 ± 1.13
4.867IleThr: 4.867 ± 1.469
2.863IleVal: 2.863 ± 0.834
0.0IleTrp: 0.0 ± 0.0
2.29IleTyr: 2.29 ± 0.762
0.0IleXaa: 0.0 ± 0.0
Lys
6.585LysAla: 6.585 ± 1.039
0.0LysCys: 0.0 ± 0.0
4.581LysAsp: 4.581 ± 1.144
9.447LysGlu: 9.447 ± 1.87
4.294LysPhe: 4.294 ± 1.032
4.008LysGly: 4.008 ± 1.201
1.145LysHis: 1.145 ± 0.545
6.298LysIle: 6.298 ± 1.377
8.589LysLys: 8.589 ± 1.635
9.734LysLeu: 9.734 ± 1.699
2.29LysMet: 2.29 ± 0.778
4.867LysAsn: 4.867 ± 1.038
2.577LysPro: 2.577 ± 1.11
4.867LysGln: 4.867 ± 0.845
4.294LysArg: 4.294 ± 0.824
4.867LysSer: 4.867 ± 1.288
4.294LysThr: 4.294 ± 1.051
5.726LysVal: 5.726 ± 1.217
0.859LysTrp: 0.859 ± 0.442
3.435LysTyr: 3.435 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
5.153LeuAla: 5.153 ± 1.167
0.286LeuCys: 0.286 ± 0.244
7.443LeuAsp: 7.443 ± 1.277
10.879LeuGlu: 10.879 ± 1.922
3.149LeuPhe: 3.149 ± 0.854
4.581LeuGly: 4.581 ± 0.812
1.431LeuHis: 1.431 ± 0.495
5.153LeuIle: 5.153 ± 1.596
8.302LeuLys: 8.302 ± 1.812
8.589LeuLeu: 8.589 ± 1.695
2.577LeuMet: 2.577 ± 0.837
5.153LeuAsn: 5.153 ± 1.15
1.145LeuPro: 1.145 ± 0.515
3.435LeuGln: 3.435 ± 0.949
4.581LeuArg: 4.581 ± 1.177
10.593LeuSer: 10.593 ± 1.567
2.863LeuThr: 2.863 ± 0.838
4.294LeuVal: 4.294 ± 1.217
0.573LeuTrp: 0.573 ± 0.331
4.867LeuTyr: 4.867 ± 0.882
0.0LeuXaa: 0.0 ± 0.0
Met
2.004MetAla: 2.004 ± 0.77
0.0MetCys: 0.0 ± 0.0
0.573MetAsp: 0.573 ± 0.357
2.577MetGlu: 2.577 ± 1.03
0.859MetPhe: 0.859 ± 0.618
0.286MetGly: 0.286 ± 0.256
0.0MetHis: 0.0 ± 0.0
1.431MetIle: 1.431 ± 0.649
3.149MetLys: 3.149 ± 0.777
1.718MetLeu: 1.718 ± 0.677
0.286MetMet: 0.286 ± 0.277
2.004MetAsn: 2.004 ± 0.756
0.0MetPro: 0.0 ± 0.0
2.004MetGln: 2.004 ± 0.693
1.145MetArg: 1.145 ± 0.468
2.29MetSer: 2.29 ± 0.707
2.29MetThr: 2.29 ± 0.67
2.29MetVal: 2.29 ± 1.056
0.0MetTrp: 0.0 ± 0.0
0.859MetTyr: 0.859 ± 0.415
0.0MetXaa: 0.0 ± 0.0
Asn
4.294AsnAla: 4.294 ± 0.742
0.0AsnCys: 0.0 ± 0.0
3.149AsnAsp: 3.149 ± 1.12
1.718AsnGlu: 1.718 ± 1.098
2.29AsnPhe: 2.29 ± 0.705
3.435AsnGly: 3.435 ± 0.817
0.573AsnHis: 0.573 ± 0.338
4.867AsnIle: 4.867 ± 1.22
4.581AsnLys: 4.581 ± 1.28
2.29AsnLeu: 2.29 ± 0.717
1.145AsnMet: 1.145 ± 0.747
4.867AsnAsn: 4.867 ± 1.595
2.577AsnPro: 2.577 ± 0.717
4.008AsnGln: 4.008 ± 1.03
3.149AsnArg: 3.149 ± 0.699
3.149AsnSer: 3.149 ± 0.666
3.149AsnThr: 3.149 ± 1.01
2.29AsnVal: 2.29 ± 0.672
0.573AsnTrp: 0.573 ± 0.385
3.149AsnTyr: 3.149 ± 0.933
0.0AsnXaa: 0.0 ± 0.0
Pro
0.859ProAla: 0.859 ± 0.583
0.286ProCys: 0.286 ± 0.26
2.004ProAsp: 2.004 ± 0.601
2.577ProGlu: 2.577 ± 0.646
1.431ProPhe: 1.431 ± 0.537
0.573ProGly: 0.573 ± 0.349
0.0ProHis: 0.0 ± 0.0
1.145ProIle: 1.145 ± 0.537
3.722ProLys: 3.722 ± 1.089
1.145ProLeu: 1.145 ± 0.569
0.573ProMet: 0.573 ± 0.361
1.431ProAsn: 1.431 ± 0.59
0.859ProPro: 0.859 ± 0.52
0.573ProGln: 0.573 ± 0.381
0.573ProArg: 0.573 ± 0.36
1.431ProSer: 1.431 ± 0.533
1.718ProThr: 1.718 ± 0.603
1.718ProVal: 1.718 ± 0.734
0.0ProTrp: 0.0 ± 0.0
0.573ProTyr: 0.573 ± 0.409
0.0ProXaa: 0.0 ± 0.0
Gln
2.004GlnAla: 2.004 ± 0.717
0.0GlnCys: 0.0 ± 0.0
2.004GlnAsp: 2.004 ± 0.661
3.722GlnGlu: 3.722 ± 0.92
2.577GlnPhe: 2.577 ± 0.694
2.863GlnGly: 2.863 ± 1.106
0.573GlnHis: 0.573 ± 0.353
1.431GlnIle: 1.431 ± 0.566
4.008GlnLys: 4.008 ± 0.681
4.581GlnLeu: 4.581 ± 0.994
0.573GlnMet: 0.573 ± 0.403
3.149GlnAsn: 3.149 ± 0.867
2.004GlnPro: 2.004 ± 0.908
3.149GlnGln: 3.149 ± 1.075
1.145GlnArg: 1.145 ± 0.476
2.29GlnSer: 2.29 ± 0.793
3.435GlnThr: 3.435 ± 1.075
4.008GlnVal: 4.008 ± 1.18
0.0GlnTrp: 0.0 ± 0.0
1.718GlnTyr: 1.718 ± 0.603
0.0GlnXaa: 0.0 ± 0.0
Arg
2.577ArgAla: 2.577 ± 0.792
0.0ArgCys: 0.0 ± 0.0
3.722ArgAsp: 3.722 ± 0.982
4.008ArgGlu: 4.008 ± 1.056
1.718ArgPhe: 1.718 ± 0.665
0.573ArgGly: 0.573 ± 0.366
0.573ArgHis: 0.573 ± 0.319
3.149ArgIle: 3.149 ± 0.656
5.439ArgLys: 5.439 ± 0.944
5.439ArgLeu: 5.439 ± 1.103
0.573ArgMet: 0.573 ± 0.346
2.29ArgAsn: 2.29 ± 1.098
0.573ArgPro: 0.573 ± 0.489
3.722ArgGln: 3.722 ± 0.796
1.431ArgArg: 1.431 ± 0.532
2.863ArgSer: 2.863 ± 0.833
3.722ArgThr: 3.722 ± 1.459
1.718ArgVal: 1.718 ± 0.574
0.859ArgTrp: 0.859 ± 0.463
2.004ArgTyr: 2.004 ± 0.742
0.0ArgXaa: 0.0 ± 0.0
Ser
2.004SerAla: 2.004 ± 1.015
0.286SerCys: 0.286 ± 0.228
3.722SerAsp: 3.722 ± 0.94
6.298SerGlu: 6.298 ± 1.417
2.577SerPhe: 2.577 ± 0.64
3.149SerGly: 3.149 ± 1.224
0.573SerHis: 0.573 ± 0.46
3.435SerIle: 3.435 ± 0.894
7.157SerLys: 7.157 ± 1.44
5.726SerLeu: 5.726 ± 1.395
1.718SerMet: 1.718 ± 0.522
4.008SerAsn: 4.008 ± 0.933
2.29SerPro: 2.29 ± 0.546
2.29SerGln: 2.29 ± 0.769
2.863SerArg: 2.863 ± 0.973
3.149SerSer: 3.149 ± 1.012
2.29SerThr: 2.29 ± 0.845
2.863SerVal: 2.863 ± 1.036
1.431SerTrp: 1.431 ± 0.625
3.149SerTyr: 3.149 ± 0.704
0.0SerXaa: 0.0 ± 0.0
Thr
3.722ThrAla: 3.722 ± 0.962
0.0ThrCys: 0.0 ± 0.0
2.004ThrAsp: 2.004 ± 0.775
4.581ThrGlu: 4.581 ± 0.983
3.149ThrPhe: 3.149 ± 1.356
3.149ThrGly: 3.149 ± 0.698
1.718ThrHis: 1.718 ± 0.661
4.294ThrIle: 4.294 ± 1.307
4.581ThrLys: 4.581 ± 1.347
5.439ThrLeu: 5.439 ± 1.355
0.859ThrMet: 0.859 ± 0.478
2.29ThrAsn: 2.29 ± 1.042
3.435ThrPro: 3.435 ± 0.97
1.145ThrGln: 1.145 ± 0.735
1.431ThrArg: 1.431 ± 0.651
3.149ThrSer: 3.149 ± 0.824
4.581ThrThr: 4.581 ± 1.435
4.008ThrVal: 4.008 ± 1.105
0.0ThrTrp: 0.0 ± 0.0
4.294ThrTyr: 4.294 ± 0.819
0.0ThrXaa: 0.0 ± 0.0
Val
2.577ValAla: 2.577 ± 0.947
0.0ValCys: 0.0 ± 0.0
4.008ValAsp: 4.008 ± 1.004
5.153ValGlu: 5.153 ± 1.182
3.149ValPhe: 3.149 ± 1.029
2.577ValGly: 2.577 ± 0.849
0.573ValHis: 0.573 ± 0.41
4.867ValIle: 4.867 ± 0.889
4.008ValLys: 4.008 ± 0.971
6.012ValLeu: 6.012 ± 1.317
0.859ValMet: 0.859 ± 0.423
4.294ValAsn: 4.294 ± 0.971
2.29ValPro: 2.29 ± 0.893
0.573ValGln: 0.573 ± 0.411
2.577ValArg: 2.577 ± 0.866
4.581ValSer: 4.581 ± 0.955
3.722ValThr: 3.722 ± 1.955
3.149ValVal: 3.149 ± 1.001
0.286ValTrp: 0.286 ± 0.297
1.431ValTyr: 1.431 ± 0.639
0.0ValXaa: 0.0 ± 0.0
Trp
0.573TrpAla: 0.573 ± 0.331
0.0TrpCys: 0.0 ± 0.0
1.145TrpAsp: 1.145 ± 0.594
1.145TrpGlu: 1.145 ± 0.496
0.286TrpPhe: 0.286 ± 0.297
0.286TrpGly: 0.286 ± 0.244
0.0TrpHis: 0.0 ± 0.0
0.859TrpIle: 0.859 ± 0.518
0.859TrpLys: 0.859 ± 0.476
1.145TrpLeu: 1.145 ± 0.617
0.0TrpMet: 0.0 ± 0.0
0.286TrpAsn: 0.286 ± 0.301
0.0TrpPro: 0.0 ± 0.0
0.859TrpGln: 0.859 ± 0.425
0.286TrpArg: 0.286 ± 0.244
0.286TrpSer: 0.286 ± 0.263
0.573TrpThr: 0.573 ± 0.358
0.573TrpVal: 0.573 ± 0.429
0.286TrpTrp: 0.286 ± 0.285
0.286TrpTyr: 0.286 ± 0.285
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.008TyrAla: 4.008 ± 1.015
0.0TyrCys: 0.0 ± 0.0
1.718TyrAsp: 1.718 ± 0.748
2.863TyrGlu: 2.863 ± 0.893
2.29TyrPhe: 2.29 ± 1.021
2.863TyrGly: 2.863 ± 0.534
1.718TyrHis: 1.718 ± 0.549
1.718TyrIle: 1.718 ± 0.645
4.294TyrLys: 4.294 ± 1.069
5.153TyrLeu: 5.153 ± 0.722
1.718TyrMet: 1.718 ± 0.801
2.863TyrAsn: 2.863 ± 0.841
0.286TyrPro: 0.286 ± 0.323
3.149TyrGln: 3.149 ± 0.781
4.294TyrArg: 4.294 ± 0.967
1.145TyrSer: 1.145 ± 0.548
2.004TyrThr: 2.004 ± 0.811
2.863TyrVal: 2.863 ± 0.902
0.286TyrTrp: 0.286 ± 0.244
2.004TyrTyr: 2.004 ± 0.719
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (3494 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski