Amino acid dipepetide frequency for Streptococcus satellite phage Javan734

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.63AlaAla: 1.63 ± 0.899
1.304AlaCys: 1.304 ± 0.764
2.934AlaAsp: 2.934 ± 0.817
5.867AlaGlu: 5.867 ± 2.088
3.911AlaPhe: 3.911 ± 0.958
1.956AlaGly: 1.956 ± 0.789
0.326AlaHis: 0.326 ± 0.269
3.585AlaIle: 3.585 ± 0.986
5.215AlaLys: 5.215 ± 0.931
3.911AlaLeu: 3.911 ± 0.861
2.608AlaMet: 2.608 ± 1.245
2.282AlaAsn: 2.282 ± 0.941
0.978AlaPro: 0.978 ± 0.473
2.608AlaGln: 2.608 ± 0.685
2.282AlaArg: 2.282 ± 0.595
3.259AlaSer: 3.259 ± 1.096
2.608AlaThr: 2.608 ± 0.959
3.911AlaVal: 3.911 ± 0.892
0.0AlaTrp: 0.0 ± 0.0
3.259AlaTyr: 3.259 ± 1.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.652CysAla: 0.652 ± 0.454
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.978CysGlu: 0.978 ± 0.644
0.652CysPhe: 0.652 ± 0.497
0.326CysGly: 0.326 ± 0.328
0.652CysHis: 0.652 ± 0.405
0.652CysIle: 0.652 ± 0.429
0.652CysLys: 0.652 ± 0.37
1.304CysLeu: 1.304 ± 0.538
0.652CysMet: 0.652 ± 0.431
0.652CysAsn: 0.652 ± 0.532
0.326CysPro: 0.326 ± 0.285
1.63CysGln: 1.63 ± 1.12
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.326CysThr: 0.326 ± 0.314
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.326CysTyr: 0.326 ± 0.316
0.0CysXaa: 0.0 ± 0.0
Asp
2.934AspAla: 2.934 ± 0.728
0.326AspCys: 0.326 ± 0.285
5.215AspAsp: 5.215 ± 1.172
3.911AspGlu: 3.911 ± 1.087
1.63AspPhe: 1.63 ± 0.526
1.956AspGly: 1.956 ± 0.665
0.326AspHis: 0.326 ± 0.279
7.823AspIle: 7.823 ± 1.329
4.889AspLys: 4.889 ± 0.969
7.497AspLeu: 7.497 ± 1.986
0.978AspMet: 0.978 ± 0.621
3.259AspAsn: 3.259 ± 0.93
0.652AspPro: 0.652 ± 0.367
1.304AspGln: 1.304 ± 0.485
2.608AspArg: 2.608 ± 0.782
1.304AspSer: 1.304 ± 0.518
2.282AspThr: 2.282 ± 1.03
1.956AspVal: 1.956 ± 0.607
0.326AspTrp: 0.326 ± 0.409
2.934AspTyr: 2.934 ± 1.453
0.0AspXaa: 0.0 ± 0.0
Glu
4.237GluAla: 4.237 ± 1.5
1.304GluCys: 1.304 ± 0.636
3.259GluAsp: 3.259 ± 1.364
3.911GluGlu: 3.911 ± 1.138
1.63GluPhe: 1.63 ± 0.704
1.63GluGly: 1.63 ± 0.528
1.304GluHis: 1.304 ± 0.777
5.541GluIle: 5.541 ± 0.98
8.801GluLys: 8.801 ± 2.175
11.082GluLeu: 11.082 ± 2.229
3.911GluMet: 3.911 ± 1.237
7.171GluAsn: 7.171 ± 1.79
2.608GluPro: 2.608 ± 1.02
3.585GluGln: 3.585 ± 1.037
3.585GluArg: 3.585 ± 1.088
4.237GluSer: 4.237 ± 1.375
2.608GluThr: 2.608 ± 0.673
4.563GluVal: 4.563 ± 1.619
0.326GluTrp: 0.326 ± 0.409
4.563GluTyr: 4.563 ± 1.206
0.0GluXaa: 0.0 ± 0.0
Phe
0.978PheAla: 0.978 ± 0.599
0.326PheCys: 0.326 ± 0.314
3.585PheAsp: 3.585 ± 1.088
3.259PheGlu: 3.259 ± 1.117
2.608PhePhe: 2.608 ± 1.141
1.956PheGly: 1.956 ± 0.832
1.304PheHis: 1.304 ± 0.483
3.585PheIle: 3.585 ± 1.223
3.259PheLys: 3.259 ± 1.126
2.608PheLeu: 2.608 ± 0.954
0.652PheMet: 0.652 ± 0.388
4.237PheAsn: 4.237 ± 1.046
1.304PhePro: 1.304 ± 0.73
0.652PheGln: 0.652 ± 0.404
1.304PheArg: 1.304 ± 0.702
3.911PheSer: 3.911 ± 1.126
1.63PheThr: 1.63 ± 0.637
2.934PheVal: 2.934 ± 1.196
0.326PheTrp: 0.326 ± 0.269
2.608PheTyr: 2.608 ± 0.696
0.0PheXaa: 0.0 ± 0.0
Gly
2.608GlyAla: 2.608 ± 1.041
0.326GlyCys: 0.326 ± 0.329
2.608GlyAsp: 2.608 ± 0.804
2.934GlyGlu: 2.934 ± 0.675
1.63GlyPhe: 1.63 ± 0.851
1.956GlyGly: 1.956 ± 0.656
0.652GlyHis: 0.652 ± 0.392
5.215GlyIle: 5.215 ± 1.308
4.237GlyLys: 4.237 ± 1.285
4.563GlyLeu: 4.563 ± 1.209
1.304GlyMet: 1.304 ± 0.534
1.63GlyAsn: 1.63 ± 0.765
0.0GlyPro: 0.0 ± 0.0
1.63GlyGln: 1.63 ± 0.764
2.282GlyArg: 2.282 ± 0.936
2.282GlySer: 2.282 ± 0.785
1.304GlyThr: 1.304 ± 0.599
4.563GlyVal: 4.563 ± 1.235
1.304GlyTrp: 1.304 ± 0.762
4.237GlyTyr: 4.237 ± 1.338
0.0GlyXaa: 0.0 ± 0.0
His
2.608HisAla: 2.608 ± 1.196
0.0HisCys: 0.0 ± 0.0
0.652HisAsp: 0.652 ± 0.424
2.282HisGlu: 2.282 ± 0.859
0.652HisPhe: 0.652 ± 0.434
1.304HisGly: 1.304 ± 0.513
0.326HisHis: 0.326 ± 0.329
0.326HisIle: 0.326 ± 0.329
0.978HisLys: 0.978 ± 0.423
1.63HisLeu: 1.63 ± 1.049
0.652HisMet: 0.652 ± 0.492
1.304HisAsn: 1.304 ± 0.669
1.304HisPro: 1.304 ± 0.595
0.0HisGln: 0.0 ± 0.0
1.304HisArg: 1.304 ± 0.495
0.652HisSer: 0.652 ± 0.703
1.304HisThr: 1.304 ± 0.665
0.326HisVal: 0.326 ± 0.269
0.0HisTrp: 0.0 ± 0.0
0.978HisTyr: 0.978 ± 0.745
0.0HisXaa: 0.0 ± 0.0
Ile
3.911IleAla: 3.911 ± 1.407
0.652IleCys: 0.652 ± 0.485
3.259IleAsp: 3.259 ± 1.103
6.519IleGlu: 6.519 ± 1.536
2.934IlePhe: 2.934 ± 0.944
3.911IleGly: 3.911 ± 0.977
1.304IleHis: 1.304 ± 0.483
4.889IleIle: 4.889 ± 1.305
9.452IleLys: 9.452 ± 1.39
7.171IleLeu: 7.171 ± 1.385
1.304IleMet: 1.304 ± 0.554
2.608IleAsn: 2.608 ± 0.791
2.608IlePro: 2.608 ± 0.716
4.237IleGln: 4.237 ± 1.207
3.585IleArg: 3.585 ± 1.06
3.585IleSer: 3.585 ± 0.937
4.237IleThr: 4.237 ± 0.831
4.237IleVal: 4.237 ± 0.991
0.652IleTrp: 0.652 ± 0.406
2.934IleTyr: 2.934 ± 0.818
0.0IleXaa: 0.0 ± 0.0
Lys
7.497LysAla: 7.497 ± 1.241
0.978LysCys: 0.978 ± 0.556
5.215LysAsp: 5.215 ± 1.048
8.149LysGlu: 8.149 ± 1.55
2.608LysPhe: 2.608 ± 0.967
5.867LysGly: 5.867 ± 1.352
1.956LysHis: 1.956 ± 0.721
6.193LysIle: 6.193 ± 1.192
8.149LysLys: 8.149 ± 2.054
7.823LysLeu: 7.823 ± 1.683
3.259LysMet: 3.259 ± 0.886
6.519LysAsn: 6.519 ± 1.258
2.934LysPro: 2.934 ± 1.053
3.585LysGln: 3.585 ± 1.094
4.563LysArg: 4.563 ± 1.126
6.519LysSer: 6.519 ± 1.405
5.867LysThr: 5.867 ± 1.566
3.585LysVal: 3.585 ± 0.856
2.282LysTrp: 2.282 ± 0.847
3.259LysTyr: 3.259 ± 0.898
0.0LysXaa: 0.0 ± 0.0
Leu
6.845LeuAla: 6.845 ± 0.909
0.326LeuCys: 0.326 ± 0.329
7.171LeuAsp: 7.171 ± 1.892
9.452LeuGlu: 9.452 ± 1.731
5.541LeuPhe: 5.541 ± 1.524
5.215LeuGly: 5.215 ± 1.012
1.956LeuHis: 1.956 ± 0.807
5.541LeuIle: 5.541 ± 1.442
9.452LeuLys: 9.452 ± 1.65
8.149LeuLeu: 8.149 ± 1.323
1.304LeuMet: 1.304 ± 0.555
7.497LeuAsn: 7.497 ± 1.655
3.911LeuPro: 3.911 ± 1.068
2.934LeuGln: 2.934 ± 0.921
4.563LeuArg: 4.563 ± 0.87
5.541LeuSer: 5.541 ± 1.141
8.149LeuThr: 8.149 ± 2.034
3.259LeuVal: 3.259 ± 0.794
0.978LeuTrp: 0.978 ± 0.458
4.563LeuTyr: 4.563 ± 1.148
0.0LeuXaa: 0.0 ± 0.0
Met
0.978MetAla: 0.978 ± 0.558
0.0MetCys: 0.0 ± 0.0
1.63MetAsp: 1.63 ± 0.899
2.934MetGlu: 2.934 ± 0.953
1.956MetPhe: 1.956 ± 0.631
1.63MetGly: 1.63 ± 0.609
0.326MetHis: 0.326 ± 0.311
0.652MetIle: 0.652 ± 0.424
1.63MetLys: 1.63 ± 0.572
1.304MetLeu: 1.304 ± 0.723
1.304MetMet: 1.304 ± 0.797
1.956MetAsn: 1.956 ± 0.745
1.304MetPro: 1.304 ± 0.677
0.652MetGln: 0.652 ± 0.504
2.608MetArg: 2.608 ± 1.091
1.63MetSer: 1.63 ± 0.785
3.585MetThr: 3.585 ± 1.31
0.326MetVal: 0.326 ± 0.29
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.282AsnAla: 2.282 ± 0.64
0.652AsnCys: 0.652 ± 0.458
3.259AsnAsp: 3.259 ± 0.991
3.259AsnGlu: 3.259 ± 0.921
1.63AsnPhe: 1.63 ± 0.78
5.215AsnGly: 5.215 ± 1.202
1.956AsnHis: 1.956 ± 1.003
2.282AsnIle: 2.282 ± 0.853
9.126AsnLys: 9.126 ± 1.791
4.889AsnLeu: 4.889 ± 1.001
2.934AsnMet: 2.934 ± 1.068
1.956AsnAsn: 1.956 ± 0.773
1.956AsnPro: 1.956 ± 0.577
2.608AsnGln: 2.608 ± 0.702
2.282AsnArg: 2.282 ± 0.768
4.563AsnSer: 4.563 ± 1.303
1.956AsnThr: 1.956 ± 0.847
2.608AsnVal: 2.608 ± 0.922
0.652AsnTrp: 0.652 ± 0.357
1.63AsnTyr: 1.63 ± 0.617
0.0AsnXaa: 0.0 ± 0.0
Pro
1.63ProAla: 1.63 ± 0.716
0.0ProCys: 0.0 ± 0.0
2.608ProAsp: 2.608 ± 0.791
3.911ProGlu: 3.911 ± 1.17
1.304ProPhe: 1.304 ± 0.425
0.326ProGly: 0.326 ± 0.311
0.978ProHis: 0.978 ± 0.699
0.978ProIle: 0.978 ± 0.536
2.282ProLys: 2.282 ± 1.047
3.259ProLeu: 3.259 ± 0.774
0.0ProMet: 0.0 ± 0.0
2.282ProAsn: 2.282 ± 0.742
1.63ProPro: 1.63 ± 0.792
0.978ProGln: 0.978 ± 0.543
1.956ProArg: 1.956 ± 0.644
2.282ProSer: 2.282 ± 1.082
1.63ProThr: 1.63 ± 0.815
1.63ProVal: 1.63 ± 0.618
0.0ProTrp: 0.0 ± 0.0
1.63ProTyr: 1.63 ± 0.639
0.0ProXaa: 0.0 ± 0.0
Gln
4.237GlnAla: 4.237 ± 1.588
0.652GlnCys: 0.652 ± 0.402
2.282GlnAsp: 2.282 ± 1.151
2.282GlnGlu: 2.282 ± 0.533
1.63GlnPhe: 1.63 ± 0.834
2.282GlnGly: 2.282 ± 0.749
0.978GlnHis: 0.978 ± 0.413
2.282GlnIle: 2.282 ± 0.744
2.934GlnLys: 2.934 ± 0.818
5.867GlnLeu: 5.867 ± 1.292
1.304GlnMet: 1.304 ± 0.659
1.304GlnAsn: 1.304 ± 0.492
0.652GlnPro: 0.652 ± 0.406
1.304GlnGln: 1.304 ± 0.656
1.63GlnArg: 1.63 ± 0.618
0.978GlnSer: 0.978 ± 0.422
2.282GlnThr: 2.282 ± 0.847
3.585GlnVal: 3.585 ± 1.023
0.326GlnTrp: 0.326 ± 0.285
0.978GlnTyr: 0.978 ± 0.671
0.0GlnXaa: 0.0 ± 0.0
Arg
1.956ArgAla: 1.956 ± 0.852
0.978ArgCys: 0.978 ± 0.587
2.282ArgAsp: 2.282 ± 0.799
5.541ArgGlu: 5.541 ± 1.347
2.282ArgPhe: 2.282 ± 0.94
0.652ArgGly: 0.652 ± 0.427
0.978ArgHis: 0.978 ± 0.521
3.911ArgIle: 3.911 ± 1.225
4.563ArgLys: 4.563 ± 1.418
6.193ArgLeu: 6.193 ± 1.546
0.652ArgMet: 0.652 ± 0.406
1.956ArgAsn: 1.956 ± 1.254
0.652ArgPro: 0.652 ± 0.539
3.911ArgGln: 3.911 ± 1.176
2.934ArgArg: 2.934 ± 0.812
1.956ArgSer: 1.956 ± 0.627
2.282ArgThr: 2.282 ± 0.661
2.934ArgVal: 2.934 ± 0.757
0.326ArgTrp: 0.326 ± 0.314
1.956ArgTyr: 1.956 ± 0.907
0.0ArgXaa: 0.0 ± 0.0
Ser
2.282SerAla: 2.282 ± 0.868
0.326SerCys: 0.326 ± 0.311
2.282SerAsp: 2.282 ± 0.781
3.585SerGlu: 3.585 ± 1.268
1.304SerPhe: 1.304 ± 0.605
1.63SerGly: 1.63 ± 0.733
0.652SerHis: 0.652 ± 0.458
4.563SerIle: 4.563 ± 0.906
6.845SerLys: 6.845 ± 1.556
6.845SerLeu: 6.845 ± 1.363
1.304SerMet: 1.304 ± 0.654
3.585SerAsn: 3.585 ± 0.953
2.282SerPro: 2.282 ± 0.919
1.956SerGln: 1.956 ± 0.822
3.585SerArg: 3.585 ± 0.799
3.585SerSer: 3.585 ± 0.879
2.608SerThr: 2.608 ± 0.758
3.259SerVal: 3.259 ± 1.085
0.326SerTrp: 0.326 ± 0.269
2.608SerTyr: 2.608 ± 0.684
0.0SerXaa: 0.0 ± 0.0
Thr
1.63ThrAla: 1.63 ± 0.663
0.0ThrCys: 0.0 ± 0.0
2.608ThrAsp: 2.608 ± 0.792
3.585ThrGlu: 3.585 ± 0.796
1.956ThrPhe: 1.956 ± 0.772
4.237ThrGly: 4.237 ± 0.971
1.304ThrHis: 1.304 ± 0.549
5.867ThrIle: 5.867 ± 1.031
3.585ThrLys: 3.585 ± 1.026
4.889ThrLeu: 4.889 ± 1.033
0.326ThrMet: 0.326 ± 0.285
0.652ThrAsn: 0.652 ± 0.472
3.259ThrPro: 3.259 ± 0.885
2.934ThrGln: 2.934 ± 1.346
3.585ThrArg: 3.585 ± 0.797
2.608ThrSer: 2.608 ± 0.54
4.237ThrThr: 4.237 ± 0.902
4.237ThrVal: 4.237 ± 1.673
0.326ThrTrp: 0.326 ± 0.331
2.934ThrTyr: 2.934 ± 1.524
0.0ThrXaa: 0.0 ± 0.0
Val
3.259ValAla: 3.259 ± 1.273
0.326ValCys: 0.326 ± 0.269
2.282ValAsp: 2.282 ± 0.973
4.563ValGlu: 4.563 ± 1.187
3.259ValPhe: 3.259 ± 1.036
3.259ValGly: 3.259 ± 1.331
0.0ValHis: 0.0 ± 0.0
4.889ValIle: 4.889 ± 1.145
3.911ValLys: 3.911 ± 0.88
4.563ValLeu: 4.563 ± 0.988
0.978ValMet: 0.978 ± 0.62
3.259ValAsn: 3.259 ± 0.849
1.63ValPro: 1.63 ± 0.805
1.63ValGln: 1.63 ± 0.721
2.282ValArg: 2.282 ± 0.906
3.259ValSer: 3.259 ± 1.077
3.585ValThr: 3.585 ± 0.925
2.282ValVal: 2.282 ± 1.167
0.978ValTrp: 0.978 ± 0.57
1.956ValTyr: 1.956 ± 1.092
0.0ValXaa: 0.0 ± 0.0
Trp
0.326TrpAla: 0.326 ± 0.285
0.326TrpCys: 0.326 ± 0.409
0.0TrpAsp: 0.0 ± 0.0
0.652TrpGlu: 0.652 ± 0.493
0.326TrpPhe: 0.326 ± 0.304
0.326TrpGly: 0.326 ± 0.34
0.0TrpHis: 0.0 ± 0.0
1.304TrpIle: 1.304 ± 0.634
1.63TrpLys: 1.63 ± 0.701
1.956TrpLeu: 1.956 ± 0.69
0.0TrpMet: 0.0 ± 0.0
0.652TrpAsn: 0.652 ± 0.454
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.652TrpSer: 0.652 ± 0.405
0.326TrpThr: 0.326 ± 0.279
0.326TrpVal: 0.326 ± 0.361
0.0TrpTrp: 0.0 ± 0.0
0.326TrpTyr: 0.326 ± 0.304
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.956TyrAla: 1.956 ± 0.689
0.978TyrCys: 0.978 ± 0.453
0.978TyrAsp: 0.978 ± 0.42
2.282TyrGlu: 2.282 ± 0.636
3.585TyrPhe: 3.585 ± 1.056
1.63TyrGly: 1.63 ± 0.711
1.304TyrHis: 1.304 ± 0.654
4.237TyrIle: 4.237 ± 1.036
5.215TyrLys: 5.215 ± 1.871
6.845TyrLeu: 6.845 ± 1.84
0.652TyrMet: 0.652 ± 0.397
2.934TyrAsn: 2.934 ± 0.682
1.304TyrPro: 1.304 ± 0.61
1.63TyrGln: 1.63 ± 0.624
1.956TyrArg: 1.956 ± 0.947
2.608TyrSer: 2.608 ± 0.8
1.956TyrThr: 1.956 ± 0.59
1.63TyrVal: 1.63 ± 0.793
0.0TyrTrp: 0.0 ± 0.0
0.652TyrTyr: 0.652 ± 0.406
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3069 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski