Amino acid dipepetide frequency for Streptococcus satellite phage Javan236

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.447AlaCys: 0.447 ± 0.621
3.578AlaAsp: 3.578 ± 1.922
3.131AlaGlu: 3.131 ± 1.314
3.131AlaPhe: 3.131 ± 0.628
4.472AlaGly: 4.472 ± 1.139
0.447AlaHis: 0.447 ± 0.359
8.05AlaIle: 8.05 ± 1.68
3.131AlaLys: 3.131 ± 0.912
3.131AlaLeu: 3.131 ± 1.481
1.342AlaMet: 1.342 ± 1.083
2.236AlaAsn: 2.236 ± 1.132
1.789AlaPro: 1.789 ± 0.813
0.894AlaGln: 0.894 ± 0.453
4.919AlaArg: 4.919 ± 0.849
4.025AlaSer: 4.025 ± 1.351
5.814AlaThr: 5.814 ± 1.521
1.342AlaVal: 1.342 ± 0.969
0.447AlaTrp: 0.447 ± 0.377
5.367AlaTyr: 5.367 ± 1.117
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.447CysAsp: 0.447 ± 0.569
0.447CysGlu: 0.447 ± 0.419
0.447CysPhe: 0.447 ± 0.452
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.894CysLys: 0.894 ± 0.523
1.342CysLeu: 1.342 ± 0.838
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.447CysPro: 0.447 ± 0.377
0.447CysGln: 0.447 ± 0.504
0.894CysArg: 0.894 ± 0.664
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.447CysTrp: 0.447 ± 0.359
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.342AspAla: 1.342 ± 0.702
0.0AspCys: 0.0 ± 0.0
5.367AspAsp: 5.367 ± 1.79
6.261AspGlu: 6.261 ± 1.764
2.683AspPhe: 2.683 ± 1.123
1.789AspGly: 1.789 ± 0.78
0.894AspHis: 0.894 ± 1.008
4.919AspIle: 4.919 ± 1.475
5.367AspLys: 5.367 ± 1.037
6.708AspLeu: 6.708 ± 1.586
3.131AspMet: 3.131 ± 1.237
4.919AspAsn: 4.919 ± 1.278
1.342AspPro: 1.342 ± 0.819
0.0AspGln: 0.0 ± 0.0
1.342AspArg: 1.342 ± 0.77
2.236AspSer: 2.236 ± 1.071
3.131AspThr: 3.131 ± 0.924
3.131AspVal: 3.131 ± 0.93
0.447AspTrp: 0.447 ± 0.359
5.814AspTyr: 5.814 ± 0.786
0.0AspXaa: 0.0 ± 0.0
Glu
3.578GluAla: 3.578 ± 2.069
0.447GluCys: 0.447 ± 0.569
6.261GluAsp: 6.261 ± 2.26
10.733GluGlu: 10.733 ± 3.999
4.025GluPhe: 4.025 ± 1.149
1.789GluGly: 1.789 ± 0.879
1.342GluHis: 1.342 ± 0.737
3.578GluIle: 3.578 ± 2.093
6.708GluLys: 6.708 ± 1.855
14.758GluLeu: 14.758 ± 4.943
0.894GluMet: 0.894 ± 0.839
2.236GluAsn: 2.236 ± 0.872
1.789GluPro: 1.789 ± 0.774
4.472GluGln: 4.472 ± 1.604
4.025GluArg: 4.025 ± 1.171
2.236GluSer: 2.236 ± 0.983
4.025GluThr: 4.025 ± 1.15
8.05GluVal: 8.05 ± 2.237
0.894GluTrp: 0.894 ± 0.465
3.131GluTyr: 3.131 ± 1.281
0.0GluXaa: 0.0 ± 0.0
Phe
0.894PheAla: 0.894 ± 0.652
0.894PheCys: 0.894 ± 0.706
2.236PheAsp: 2.236 ± 1.096
2.236PheGlu: 2.236 ± 0.682
1.789PhePhe: 1.789 ± 0.726
0.447PheGly: 0.447 ± 0.377
0.894PheHis: 0.894 ± 0.651
4.025PheIle: 4.025 ± 0.9
4.025PheLys: 4.025 ± 1.513
4.919PheLeu: 4.919 ± 1.743
1.342PheMet: 1.342 ± 0.795
3.131PheAsn: 3.131 ± 0.838
0.447PhePro: 0.447 ± 0.359
0.894PheGln: 0.894 ± 0.579
0.894PheArg: 0.894 ± 0.719
4.472PheSer: 4.472 ± 1.093
0.894PheThr: 0.894 ± 0.719
1.789PheVal: 1.789 ± 0.596
0.894PheTrp: 0.894 ± 0.542
4.472PheTyr: 4.472 ± 1.413
0.0PheXaa: 0.0 ± 0.0
Gly
1.342GlyAla: 1.342 ± 0.896
0.447GlyCys: 0.447 ± 0.377
3.131GlyAsp: 3.131 ± 1.264
3.131GlyGlu: 3.131 ± 1.257
0.894GlyPhe: 0.894 ± 0.626
1.342GlyGly: 1.342 ± 0.69
0.894GlyHis: 0.894 ± 0.548
2.236GlyIle: 2.236 ± 0.782
5.367GlyLys: 5.367 ± 1.62
4.025GlyLeu: 4.025 ± 1.125
0.894GlyMet: 0.894 ± 0.623
1.342GlyAsn: 1.342 ± 0.763
0.0GlyPro: 0.0 ± 0.0
0.894GlyGln: 0.894 ± 0.464
1.789GlyArg: 1.789 ± 0.822
1.342GlySer: 1.342 ± 0.708
3.131GlyThr: 3.131 ± 1.511
4.472GlyVal: 4.472 ± 1.363
0.894GlyTrp: 0.894 ± 0.495
3.131GlyTyr: 3.131 ± 1.072
0.0GlyXaa: 0.0 ± 0.0
His
1.342HisAla: 1.342 ± 1.132
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.236HisGlu: 2.236 ± 0.959
0.447HisPhe: 0.447 ± 0.359
0.894HisGly: 0.894 ± 0.651
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.342HisLys: 1.342 ± 0.815
1.789HisLeu: 1.789 ± 0.929
0.0HisMet: 0.0 ± 0.0
1.342HisAsn: 1.342 ± 1.121
0.447HisPro: 0.447 ± 0.359
0.894HisGln: 0.894 ± 0.719
0.447HisArg: 0.447 ± 0.359
0.447HisSer: 0.447 ± 0.377
0.447HisThr: 0.447 ± 0.377
1.342HisVal: 1.342 ± 0.822
0.447HisTrp: 0.447 ± 0.452
1.789HisTyr: 1.789 ± 0.735
0.0HisXaa: 0.0 ± 0.0
Ile
6.708IleAla: 6.708 ± 2.135
0.447IleCys: 0.447 ± 0.452
5.814IleAsp: 5.814 ± 1.814
5.367IleGlu: 5.367 ± 1.763
4.025IlePhe: 4.025 ± 1.395
2.236IleGly: 2.236 ± 0.731
0.894IleHis: 0.894 ± 0.453
4.919IleIle: 4.919 ± 1.1
8.945IleLys: 8.945 ± 1.965
5.367IleLeu: 5.367 ± 1.417
2.683IleMet: 2.683 ± 1.054
8.945IleAsn: 8.945 ± 2.15
4.025IlePro: 4.025 ± 1.349
3.578IleGln: 3.578 ± 1.496
1.789IleArg: 1.789 ± 0.837
1.789IleSer: 1.789 ± 1.186
5.367IleThr: 5.367 ± 1.337
1.342IleVal: 1.342 ± 0.436
0.894IleTrp: 0.894 ± 0.593
2.236IleTyr: 2.236 ± 0.881
0.0IleXaa: 0.0 ± 0.0
Lys
5.814LysAla: 5.814 ± 1.928
0.0LysCys: 0.0 ± 0.0
4.919LysAsp: 4.919 ± 1.075
10.733LysGlu: 10.733 ± 1.894
1.342LysPhe: 1.342 ± 0.74
3.131LysGly: 3.131 ± 1.106
0.894LysHis: 0.894 ± 0.755
8.05LysIle: 8.05 ± 2.045
10.733LysLys: 10.733 ± 1.835
7.156LysLeu: 7.156 ± 1.767
1.342LysMet: 1.342 ± 0.777
4.919LysAsn: 4.919 ± 1.819
2.683LysPro: 2.683 ± 1.018
6.708LysGln: 6.708 ± 1.755
4.919LysArg: 4.919 ± 1.375
5.367LysSer: 5.367 ± 1.164
6.261LysThr: 6.261 ± 1.42
4.472LysVal: 4.472 ± 1.3
0.894LysTrp: 0.894 ± 0.523
5.367LysTyr: 5.367 ± 1.271
0.0LysXaa: 0.0 ± 0.0
Leu
8.945LeuAla: 8.945 ± 1.666
0.0LeuCys: 0.0 ± 0.0
6.708LeuAsp: 6.708 ± 1.34
9.392LeuGlu: 9.392 ± 1.423
4.919LeuPhe: 4.919 ± 1.119
6.708LeuGly: 6.708 ± 1.511
1.789LeuHis: 1.789 ± 0.65
5.367LeuIle: 5.367 ± 1.716
9.392LeuLys: 9.392 ± 1.992
6.261LeuLeu: 6.261 ± 1.558
1.789LeuMet: 1.789 ± 0.813
9.392LeuAsn: 9.392 ± 2.298
3.578LeuPro: 3.578 ± 0.864
5.367LeuGln: 5.367 ± 2.189
5.367LeuArg: 5.367 ± 2.378
3.578LeuSer: 3.578 ± 2.121
4.472LeuThr: 4.472 ± 0.963
3.578LeuVal: 3.578 ± 0.94
0.894LeuTrp: 0.894 ± 0.544
5.367LeuTyr: 5.367 ± 1.538
0.0LeuXaa: 0.0 ± 0.0
Met
1.789MetAla: 1.789 ± 0.982
0.0MetCys: 0.0 ± 0.0
0.894MetAsp: 0.894 ± 0.695
1.789MetGlu: 1.789 ± 1.068
0.447MetPhe: 0.447 ± 0.359
0.447MetGly: 0.447 ± 0.569
0.0MetHis: 0.0 ± 0.0
1.342MetIle: 1.342 ± 1.0
1.342MetLys: 1.342 ± 0.907
4.025MetLeu: 4.025 ± 1.501
1.342MetMet: 1.342 ± 0.723
0.894MetAsn: 0.894 ± 0.464
0.894MetPro: 0.894 ± 0.862
1.342MetGln: 1.342 ± 0.915
0.0MetArg: 0.0 ± 0.0
1.342MetSer: 1.342 ± 0.643
3.578MetThr: 3.578 ± 0.85
0.447MetVal: 0.447 ± 0.471
0.0MetTrp: 0.0 ± 0.0
0.894MetTyr: 0.894 ± 0.575
0.0MetXaa: 0.0 ± 0.0
Asn
4.025AsnAla: 4.025 ± 0.784
0.894AsnCys: 0.894 ± 0.465
2.683AsnAsp: 2.683 ± 1.158
2.236AsnGlu: 2.236 ± 0.898
1.342AsnPhe: 1.342 ± 1.027
4.919AsnGly: 4.919 ± 1.295
1.342AsnHis: 1.342 ± 0.677
4.919AsnIle: 4.919 ± 1.394
6.708AsnLys: 6.708 ± 1.325
3.578AsnLeu: 3.578 ± 1.181
1.789AsnMet: 1.789 ± 0.832
2.236AsnAsn: 2.236 ± 0.749
3.578AsnPro: 3.578 ± 1.587
2.683AsnGln: 2.683 ± 0.713
4.919AsnArg: 4.919 ± 1.192
4.025AsnSer: 4.025 ± 1.619
4.025AsnThr: 4.025 ± 1.194
1.789AsnVal: 1.789 ± 0.749
0.0AsnTrp: 0.0 ± 0.0
1.342AsnTyr: 1.342 ± 0.982
0.0AsnXaa: 0.0 ± 0.0
Pro
2.683ProAla: 2.683 ± 1.119
0.0ProCys: 0.0 ± 0.0
2.236ProAsp: 2.236 ± 1.082
2.683ProGlu: 2.683 ± 0.93
2.236ProPhe: 2.236 ± 0.729
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
4.472ProIle: 4.472 ± 1.304
3.131ProLys: 3.131 ± 1.028
4.919ProLeu: 4.919 ± 1.463
0.447ProMet: 0.447 ± 0.419
1.789ProAsn: 1.789 ± 1.437
1.789ProPro: 1.789 ± 0.917
0.447ProGln: 0.447 ± 0.359
1.789ProArg: 1.789 ± 0.739
0.0ProSer: 0.0 ± 0.0
1.789ProThr: 1.789 ± 0.824
0.894ProVal: 0.894 ± 0.593
0.0ProTrp: 0.0 ± 0.0
0.894ProTyr: 0.894 ± 0.719
0.0ProXaa: 0.0 ± 0.0
Gln
3.578GlnAla: 3.578 ± 1.039
0.0GlnCys: 0.0 ± 0.0
2.236GlnAsp: 2.236 ± 1.042
4.472GlnGlu: 4.472 ± 1.857
1.789GlnPhe: 1.789 ± 0.811
1.789GlnGly: 1.789 ± 0.794
0.447GlnHis: 0.447 ± 0.377
4.472GlnIle: 4.472 ± 1.247
3.578GlnLys: 3.578 ± 0.978
4.919GlnLeu: 4.919 ± 0.89
0.0GlnMet: 0.0 ± 0.0
0.894GlnAsn: 0.894 ± 0.542
1.789GlnPro: 1.789 ± 0.78
1.789GlnGln: 1.789 ± 0.787
0.0GlnArg: 0.0 ± 0.0
3.131GlnSer: 3.131 ± 1.712
3.131GlnThr: 3.131 ± 1.289
3.131GlnVal: 3.131 ± 1.012
0.447GlnTrp: 0.447 ± 0.621
2.683GlnTyr: 2.683 ± 0.687
0.0GlnXaa: 0.0 ± 0.0
Arg
2.236ArgAla: 2.236 ± 1.063
0.0ArgCys: 0.0 ± 0.0
3.131ArgAsp: 3.131 ± 0.962
3.578ArgGlu: 3.578 ± 1.207
2.236ArgPhe: 2.236 ± 1.185
2.236ArgGly: 2.236 ± 0.922
1.789ArgHis: 1.789 ± 1.045
3.578ArgIle: 3.578 ± 0.925
3.578ArgLys: 3.578 ± 1.337
3.578ArgLeu: 3.578 ± 0.851
0.894ArgMet: 0.894 ± 0.532
0.894ArgAsn: 0.894 ± 0.911
0.894ArgPro: 0.894 ± 0.593
2.236ArgGln: 2.236 ± 0.99
2.683ArgArg: 2.683 ± 0.893
0.894ArgSer: 0.894 ± 0.701
4.025ArgThr: 4.025 ± 1.277
1.342ArgVal: 1.342 ± 0.645
0.447ArgTrp: 0.447 ± 0.662
2.236ArgTyr: 2.236 ± 1.466
0.0ArgXaa: 0.0 ± 0.0
Ser
2.683SerAla: 2.683 ± 1.33
0.894SerCys: 0.894 ± 0.701
4.025SerAsp: 4.025 ± 1.256
2.236SerGlu: 2.236 ± 0.807
2.683SerPhe: 2.683 ± 0.784
0.0SerGly: 0.0 ± 0.0
0.0SerHis: 0.0 ± 0.0
2.683SerIle: 2.683 ± 0.923
3.131SerLys: 3.131 ± 0.942
5.367SerLeu: 5.367 ± 1.623
1.342SerMet: 1.342 ± 0.654
1.789SerAsn: 1.789 ± 0.728
1.342SerPro: 1.342 ± 0.639
3.131SerGln: 3.131 ± 1.476
1.342SerArg: 1.342 ± 0.722
1.789SerSer: 1.789 ± 0.899
2.236SerThr: 2.236 ± 1.406
3.131SerVal: 3.131 ± 0.856
0.0SerTrp: 0.0 ± 0.0
3.131SerTyr: 3.131 ± 1.442
0.0SerXaa: 0.0 ± 0.0
Thr
4.025ThrAla: 4.025 ± 0.9
0.447ThrCys: 0.447 ± 0.419
2.683ThrAsp: 2.683 ± 1.002
5.814ThrGlu: 5.814 ± 1.3
2.683ThrPhe: 2.683 ± 1.409
3.131ThrGly: 3.131 ± 1.376
0.447ThrHis: 0.447 ± 0.377
7.156ThrIle: 7.156 ± 1.629
5.814ThrLys: 5.814 ± 1.945
6.261ThrLeu: 6.261 ± 2.294
0.0ThrMet: 0.0 ± 0.0
3.131ThrAsn: 3.131 ± 1.747
3.578ThrPro: 3.578 ± 2.024
0.894ThrGln: 0.894 ± 0.701
2.683ThrArg: 2.683 ± 0.828
2.236ThrSer: 2.236 ± 0.693
3.131ThrThr: 3.131 ± 0.852
3.578ThrVal: 3.578 ± 1.393
0.0ThrTrp: 0.0 ± 0.0
2.683ThrTyr: 2.683 ± 0.784
0.0ThrXaa: 0.0 ± 0.0
Val
2.683ValAla: 2.683 ± 1.041
0.0ValCys: 0.0 ± 0.0
1.342ValAsp: 1.342 ± 0.802
4.919ValGlu: 4.919 ± 1.498
2.683ValPhe: 2.683 ± 1.147
1.342ValGly: 1.342 ± 1.097
0.894ValHis: 0.894 ± 0.755
4.025ValIle: 4.025 ± 1.276
4.472ValLys: 4.472 ± 1.252
6.261ValLeu: 6.261 ± 0.843
1.789ValMet: 1.789 ± 1.308
4.025ValAsn: 4.025 ± 1.776
1.342ValPro: 1.342 ± 0.536
1.789ValGln: 1.789 ± 0.828
0.894ValArg: 0.894 ± 0.593
1.342ValSer: 1.342 ± 0.766
2.683ValThr: 2.683 ± 0.871
2.236ValVal: 2.236 ± 0.885
0.0ValTrp: 0.0 ± 0.0
2.683ValTyr: 2.683 ± 1.092
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.377
0.0TrpCys: 0.0 ± 0.0
0.447TrpAsp: 0.447 ± 0.377
0.447TrpGlu: 0.447 ± 0.662
0.447TrpPhe: 0.447 ± 0.448
0.447TrpGly: 0.447 ± 0.504
0.447TrpHis: 0.447 ± 0.471
0.447TrpIle: 0.447 ± 0.452
0.894TrpLys: 0.894 ± 0.719
0.894TrpLeu: 0.894 ± 0.719
0.0TrpMet: 0.0 ± 0.0
0.447TrpAsn: 0.447 ± 0.359
0.0TrpPro: 0.0 ± 0.0
0.894TrpGln: 0.894 ± 0.55
0.0TrpArg: 0.0 ± 0.0
0.447TrpSer: 0.447 ± 0.377
0.447TrpThr: 0.447 ± 0.621
0.894TrpVal: 0.894 ± 0.698
0.447TrpTrp: 0.447 ± 0.377
0.447TrpTyr: 0.447 ± 0.359
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.578TyrAla: 3.578 ± 0.86
0.894TyrCys: 0.894 ± 0.532
2.683TyrAsp: 2.683 ± 1.224
3.131TyrGlu: 3.131 ± 1.705
1.342TyrPhe: 1.342 ± 0.755
3.578TyrGly: 3.578 ± 1.235
2.236TyrHis: 2.236 ± 0.895
3.131TyrIle: 3.131 ± 0.704
7.156TyrLys: 7.156 ± 1.387
7.603TyrLeu: 7.603 ± 2.5
1.342TyrMet: 1.342 ± 0.807
4.472TyrAsn: 4.472 ± 1.059
0.447TyrPro: 0.447 ± 0.359
4.919TyrGln: 4.919 ± 1.669
1.789TyrArg: 1.789 ± 1.109
2.236TyrSer: 2.236 ± 0.878
1.789TyrThr: 1.789 ± 0.697
0.894TyrVal: 0.894 ± 0.755
0.447TyrTrp: 0.447 ± 0.359
2.683TyrTyr: 2.683 ± 1.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski