Amino acid dipepetide frequency for Winogradskyella epiphytica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.884AlaAla: 3.884 ± 0.086
0.571AlaCys: 0.571 ± 0.037
3.299AlaAsp: 3.299 ± 0.066
4.358AlaGlu: 4.358 ± 0.073
3.258AlaPhe: 3.258 ± 0.059
3.805AlaGly: 3.805 ± 0.079
1.15AlaHis: 1.15 ± 0.038
5.494AlaIle: 5.494 ± 0.083
4.793AlaLys: 4.793 ± 0.088
6.264AlaLeu: 6.264 ± 0.099
1.527AlaMet: 1.527 ± 0.05
3.468AlaAsn: 3.468 ± 0.066
1.725AlaPro: 1.725 ± 0.053
2.351AlaGln: 2.351 ± 0.055
1.895AlaArg: 1.895 ± 0.043
4.039AlaSer: 4.039 ± 0.068
3.696AlaThr: 3.696 ± 0.106
3.862AlaVal: 3.862 ± 0.067
0.561AlaTrp: 0.561 ± 0.023
2.468AlaTyr: 2.468 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.443CysAla: 0.443 ± 0.024
0.092CysCys: 0.092 ± 0.009
0.46CysAsp: 0.46 ± 0.03
0.483CysGlu: 0.483 ± 0.02
0.39CysPhe: 0.39 ± 0.023
0.668CysGly: 0.668 ± 0.094
0.17CysHis: 0.17 ± 0.014
0.569CysIle: 0.569 ± 0.027
0.457CysLys: 0.457 ± 0.022
0.628CysLeu: 0.628 ± 0.026
0.138CysMet: 0.138 ± 0.013
0.408CysAsn: 0.408 ± 0.022
0.347CysPro: 0.347 ± 0.029
0.202CysGln: 0.202 ± 0.014
0.177CysArg: 0.177 ± 0.014
0.575CysSer: 0.575 ± 0.03
0.401CysThr: 0.401 ± 0.025
0.44CysVal: 0.44 ± 0.022
0.043CysTrp: 0.043 ± 0.007
0.287CysTyr: 0.287 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.785AspAla: 3.785 ± 0.07
0.431AspCys: 0.431 ± 0.024
3.379AspAsp: 3.379 ± 0.069
3.931AspGlu: 3.931 ± 0.063
3.587AspPhe: 3.587 ± 0.065
3.65AspGly: 3.65 ± 0.073
0.982AspHis: 0.982 ± 0.038
4.818AspIle: 4.818 ± 0.068
4.272AspLys: 4.272 ± 0.074
5.601AspLeu: 5.601 ± 0.081
1.208AspMet: 1.208 ± 0.037
3.303AspAsn: 3.303 ± 0.053
1.69AspPro: 1.69 ± 0.052
1.593AspGln: 1.593 ± 0.038
1.946AspArg: 1.946 ± 0.041
3.62AspSer: 3.62 ± 0.06
2.952AspThr: 2.952 ± 0.067
3.821AspVal: 3.821 ± 0.066
0.724AspTrp: 0.724 ± 0.026
3.081AspTyr: 3.081 ± 0.064
0.0AspXaa: 0.0 ± 0.0
Glu
4.72GluAla: 4.72 ± 0.089
0.39GluCys: 0.39 ± 0.037
4.088GluAsp: 4.088 ± 0.066
4.906GluGlu: 4.906 ± 0.101
3.297GluPhe: 3.297 ± 0.06
3.794GluGly: 3.794 ± 0.062
1.276GluHis: 1.276 ± 0.042
5.392GluIle: 5.392 ± 0.081
5.387GluLys: 5.387 ± 0.098
6.593GluLeu: 6.593 ± 0.097
1.528GluMet: 1.528 ± 0.042
4.453GluAsn: 4.453 ± 0.079
1.671GluPro: 1.671 ± 0.052
2.327GluGln: 2.327 ± 0.055
2.65GluArg: 2.65 ± 0.063
3.71GluSer: 3.71 ± 0.063
3.949GluThr: 3.949 ± 0.066
4.386GluVal: 4.386 ± 0.073
0.619GluTrp: 0.619 ± 0.026
2.384GluTyr: 2.384 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
2.859PheAla: 2.859 ± 0.057
0.392PheCys: 0.392 ± 0.021
3.41PheAsp: 3.41 ± 0.075
3.522PheGlu: 3.522 ± 0.069
2.48PhePhe: 2.48 ± 0.068
3.668PheGly: 3.668 ± 0.068
0.772PheHis: 0.772 ± 0.028
4.028PheIle: 4.028 ± 0.081
4.035PheLys: 4.035 ± 0.074
4.386PheLeu: 4.386 ± 0.085
1.093PheMet: 1.093 ± 0.039
3.513PheAsn: 3.513 ± 0.076
1.61PhePro: 1.61 ± 0.041
1.608PheGln: 1.608 ± 0.038
1.552PheArg: 1.552 ± 0.037
3.87PheSer: 3.87 ± 0.07
2.938PheThr: 2.938 ± 0.054
3.035PheVal: 3.035 ± 0.053
0.514PheTrp: 0.514 ± 0.025
2.122PheTyr: 2.122 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
3.827GlyAla: 3.827 ± 0.079
0.536GlyCys: 0.536 ± 0.032
3.526GlyAsp: 3.526 ± 0.076
3.689GlyGlu: 3.689 ± 0.069
3.528GlyPhe: 3.528 ± 0.067
4.234GlyGly: 4.234 ± 0.088
1.2GlyHis: 1.2 ± 0.033
5.099GlyIle: 5.099 ± 0.085
4.597GlyLys: 4.597 ± 0.087
5.749GlyLeu: 5.749 ± 0.092
1.521GlyMet: 1.521 ± 0.039
3.462GlyAsn: 3.462 ± 0.099
1.311GlyPro: 1.311 ± 0.037
1.857GlyGln: 1.857 ± 0.045
2.068GlyArg: 2.068 ± 0.052
3.891GlySer: 3.891 ± 0.077
3.924GlyThr: 3.924 ± 0.094
4.188GlyVal: 4.188 ± 0.073
0.667GlyTrp: 0.667 ± 0.031
2.69GlyTyr: 2.69 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
0.936HisAla: 0.936 ± 0.031
0.188HisCys: 0.188 ± 0.014
0.924HisAsp: 0.924 ± 0.029
0.993HisGlu: 0.993 ± 0.035
1.092HisPhe: 1.092 ± 0.036
1.06HisGly: 1.06 ± 0.034
0.534HisHis: 0.534 ± 0.026
1.594HisIle: 1.594 ± 0.038
1.285HisLys: 1.285 ± 0.036
1.835HisLeu: 1.835 ± 0.043
0.359HisMet: 0.359 ± 0.019
1.031HisAsn: 1.031 ± 0.033
0.884HisPro: 0.884 ± 0.03
0.712HisGln: 0.712 ± 0.028
0.655HisArg: 0.655 ± 0.025
1.157HisSer: 1.157 ± 0.033
0.961HisThr: 0.961 ± 0.032
0.969HisVal: 0.969 ± 0.038
0.221HisTrp: 0.221 ± 0.016
0.925HisTyr: 0.925 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.551IleAla: 5.551 ± 0.083
0.65IleCys: 0.65 ± 0.027
5.178IleAsp: 5.178 ± 0.067
5.894IleGlu: 5.894 ± 0.087
3.467IlePhe: 3.467 ± 0.065
5.036IleGly: 5.036 ± 0.093
1.326IleHis: 1.326 ± 0.041
6.349IleIle: 6.349 ± 0.095
5.942IleLys: 5.942 ± 0.091
7.065IleLeu: 7.065 ± 0.107
1.48IleMet: 1.48 ± 0.04
4.855IleAsn: 4.855 ± 0.078
3.299IlePro: 3.299 ± 0.057
2.56IleGln: 2.56 ± 0.057
2.61IleArg: 2.61 ± 0.05
5.846IleSer: 5.846 ± 0.076
4.896IleThr: 4.896 ± 0.098
4.906IleVal: 4.906 ± 0.074
0.676IleTrp: 0.676 ± 0.025
2.88IleTyr: 2.88 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
5.146LysAla: 5.146 ± 0.093
0.334LysCys: 0.334 ± 0.021
4.721LysAsp: 4.721 ± 0.084
5.7LysGlu: 5.7 ± 0.106
3.067LysPhe: 3.067 ± 0.067
4.24LysGly: 4.24 ± 0.072
1.616LysHis: 1.616 ± 0.049
5.73LysIle: 5.73 ± 0.086
6.199LysLys: 6.199 ± 0.116
6.602LysLeu: 6.602 ± 0.094
1.912LysMet: 1.912 ± 0.049
4.863LysAsn: 4.863 ± 0.077
2.514LysPro: 2.514 ± 0.064
2.921LysGln: 2.921 ± 0.059
3.149LysArg: 3.149 ± 0.057
4.995LysSer: 4.995 ± 0.076
4.879LysThr: 4.879 ± 0.085
4.796LysVal: 4.796 ± 0.075
0.739LysTrp: 0.739 ± 0.027
2.888LysTyr: 2.888 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
5.605LeuAla: 5.605 ± 0.08
0.659LeuCys: 0.659 ± 0.028
5.329LeuAsp: 5.329 ± 0.085
6.224LeuGlu: 6.224 ± 0.103
4.766LeuPhe: 4.766 ± 0.095
5.612LeuGly: 5.612 ± 0.082
1.587LeuHis: 1.587 ± 0.042
7.323LeuIle: 7.323 ± 0.103
8.179LeuLys: 8.179 ± 0.09
8.526LeuLeu: 8.526 ± 0.126
2.155LeuMet: 2.155 ± 0.043
5.94LeuAsn: 5.94 ± 0.099
3.448LeuPro: 3.448 ± 0.06
3.037LeuGln: 3.037 ± 0.061
3.278LeuArg: 3.278 ± 0.064
6.673LeuSer: 6.673 ± 0.094
5.165LeuThr: 5.165 ± 0.088
5.478LeuVal: 5.478 ± 0.082
0.814LeuTrp: 0.814 ± 0.033
3.271LeuTyr: 3.271 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
1.674MetAla: 1.674 ± 0.04
0.133MetCys: 0.133 ± 0.01
1.185MetAsp: 1.185 ± 0.037
1.271MetGlu: 1.271 ± 0.044
0.897MetPhe: 0.897 ± 0.033
1.255MetGly: 1.255 ± 0.039
0.419MetHis: 0.419 ± 0.021
1.619MetIle: 1.619 ± 0.042
2.145MetLys: 2.145 ± 0.043
1.965MetLeu: 1.965 ± 0.049
0.611MetMet: 0.611 ± 0.03
1.27MetAsn: 1.27 ± 0.033
0.864MetPro: 0.864 ± 0.031
0.809MetGln: 0.809 ± 0.027
0.89MetArg: 0.89 ± 0.031
1.541MetSer: 1.541 ± 0.042
1.238MetThr: 1.238 ± 0.032
1.372MetVal: 1.372 ± 0.041
0.153MetTrp: 0.153 ± 0.014
0.723MetTyr: 0.723 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.744AsnAla: 3.744 ± 0.067
0.492AsnCys: 0.492 ± 0.028
3.454AsnAsp: 3.454 ± 0.075
3.79AsnGlu: 3.79 ± 0.063
3.07AsnPhe: 3.07 ± 0.07
3.842AsnGly: 3.842 ± 0.095
1.137AsnHis: 1.137 ± 0.034
4.758AsnIle: 4.758 ± 0.073
4.308AsnLys: 4.308 ± 0.07
5.558AsnLeu: 5.558 ± 0.094
1.283AsnMet: 1.283 ± 0.034
3.741AsnAsn: 3.741 ± 0.086
2.698AsnPro: 2.698 ± 0.058
2.24AsnGln: 2.24 ± 0.053
2.22AsnArg: 2.22 ± 0.048
4.226AsnSer: 4.226 ± 0.079
3.641AsnThr: 3.641 ± 0.086
3.518AsnVal: 3.518 ± 0.067
0.751AsnTrp: 0.751 ± 0.03
2.96AsnTyr: 2.96 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
1.697ProAla: 1.697 ± 0.041
0.217ProCys: 0.217 ± 0.015
1.965ProAsp: 1.965 ± 0.053
2.811ProGlu: 2.811 ± 0.054
1.828ProPhe: 1.828 ± 0.046
1.733ProGly: 1.733 ± 0.052
0.635ProHis: 0.635 ± 0.028
2.804ProIle: 2.804 ± 0.058
2.596ProLys: 2.596 ± 0.061
2.949ProLeu: 2.949 ± 0.06
0.754ProMet: 0.754 ± 0.029
2.418ProAsn: 2.418 ± 0.057
0.736ProPro: 0.736 ± 0.029
1.072ProGln: 1.072 ± 0.031
0.918ProArg: 0.918 ± 0.031
2.15ProSer: 2.15 ± 0.049
2.016ProThr: 2.016 ± 0.054
2.053ProVal: 2.053 ± 0.048
0.283ProTrp: 0.283 ± 0.019
1.417ProTyr: 1.417 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
1.971GlnAla: 1.971 ± 0.051
0.192GlnCys: 0.192 ± 0.014
1.747GlnAsp: 1.747 ± 0.04
2.196GlnGlu: 2.196 ± 0.053
1.771GlnPhe: 1.771 ± 0.046
1.759GlnGly: 1.759 ± 0.043
0.613GlnHis: 0.613 ± 0.027
2.649GlnIle: 2.649 ± 0.051
2.588GlnLys: 2.588 ± 0.056
3.619GlnLeu: 3.619 ± 0.072
0.807GlnMet: 0.807 ± 0.028
2.12GlnAsn: 2.12 ± 0.05
1.169GlnPro: 1.169 ± 0.036
1.355GlnGln: 1.355 ± 0.043
1.259GlnArg: 1.259 ± 0.039
1.993GlnSer: 1.993 ± 0.044
1.908GlnThr: 1.908 ± 0.049
1.991GlnVal: 1.991 ± 0.05
0.334GlnTrp: 0.334 ± 0.019
1.239GlnTyr: 1.239 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.147ArgAla: 2.147 ± 0.047
0.2ArgCys: 0.2 ± 0.014
1.847ArgAsp: 1.847 ± 0.048
2.189ArgGlu: 2.189 ± 0.047
1.885ArgPhe: 1.885 ± 0.049
1.857ArgGly: 1.857 ± 0.046
0.678ArgHis: 0.678 ± 0.028
2.903ArgIle: 2.903 ± 0.056
2.739ArgLys: 2.739 ± 0.06
3.446ArgLeu: 3.446 ± 0.054
0.84ArgMet: 0.84 ± 0.028
2.008ArgAsn: 2.008 ± 0.047
1.07ArgPro: 1.07 ± 0.035
1.171ArgGln: 1.171 ± 0.039
1.383ArgArg: 1.383 ± 0.043
1.943ArgSer: 1.943 ± 0.045
1.942ArgThr: 1.942 ± 0.046
2.188ArgVal: 2.188 ± 0.047
0.356ArgTrp: 0.356 ± 0.019
1.596ArgTyr: 1.596 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
3.88SerAla: 3.88 ± 0.071
0.632SerCys: 0.632 ± 0.025
3.76SerAsp: 3.76 ± 0.051
4.452SerGlu: 4.452 ± 0.071
3.672SerPhe: 3.672 ± 0.062
4.475SerGly: 4.475 ± 0.084
1.142SerHis: 1.142 ± 0.04
5.463SerIle: 5.463 ± 0.075
5.267SerLys: 5.267 ± 0.087
6.054SerLeu: 6.054 ± 0.086
1.306SerMet: 1.306 ± 0.039
4.104SerAsn: 4.104 ± 0.079
2.078SerPro: 2.078 ± 0.052
2.16SerGln: 2.16 ± 0.048
2.151SerArg: 2.151 ± 0.054
4.45SerSer: 4.45 ± 0.089
3.871SerThr: 3.871 ± 0.072
4.138SerVal: 4.138 ± 0.077
0.633SerTrp: 0.633 ± 0.026
2.781SerTyr: 2.781 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
3.655ThrAla: 3.655 ± 0.099
0.394ThrCys: 0.394 ± 0.031
3.369ThrAsp: 3.369 ± 0.084
3.868ThrGlu: 3.868 ± 0.062
2.946ThrPhe: 2.946 ± 0.055
3.869ThrGly: 3.869 ± 0.084
1.061ThrHis: 1.061 ± 0.04
5.148ThrIle: 5.148 ± 0.093
3.937ThrLys: 3.937 ± 0.071
5.456ThrLeu: 5.456 ± 0.077
1.103ThrMet: 1.103 ± 0.035
3.459ThrAsn: 3.459 ± 0.077
2.269ThrPro: 2.269 ± 0.061
1.83ThrGln: 1.83 ± 0.043
1.689ThrArg: 1.689 ± 0.042
4.0ThrSer: 4.0 ± 0.077
3.701ThrThr: 3.701 ± 0.098
3.871ThrVal: 3.871 ± 0.104
0.546ThrTrp: 0.546 ± 0.033
2.643ThrTyr: 2.643 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
4.0ValAla: 4.0 ± 0.076
0.49ValCys: 0.49 ± 0.022
3.667ValAsp: 3.667 ± 0.063
4.311ValGlu: 4.311 ± 0.07
3.379ValPhe: 3.379 ± 0.06
3.867ValGly: 3.867 ± 0.069
0.957ValHis: 0.957 ± 0.032
5.067ValIle: 5.067 ± 0.082
4.36ValLys: 4.36 ± 0.079
5.963ValLeu: 5.963 ± 0.089
1.304ValMet: 1.304 ± 0.035
3.558ValAsn: 3.558 ± 0.062
2.061ValPro: 2.061 ± 0.049
1.732ValGln: 1.732 ± 0.045
1.959ValArg: 1.959 ± 0.044
4.478ValSer: 4.478 ± 0.079
3.718ValThr: 3.718 ± 0.102
4.242ValVal: 4.242 ± 0.075
0.553ValTrp: 0.553 ± 0.024
2.385ValTyr: 2.385 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.545TrpAla: 0.545 ± 0.025
0.075TrpCys: 0.075 ± 0.01
0.56TrpAsp: 0.56 ± 0.03
0.613TrpGlu: 0.613 ± 0.025
0.572TrpPhe: 0.572 ± 0.029
0.51TrpGly: 0.51 ± 0.022
0.207TrpHis: 0.207 ± 0.013
0.713TrpIle: 0.713 ± 0.028
0.728TrpLys: 0.728 ± 0.034
1.004TrpLeu: 1.004 ± 0.032
0.304TrpMet: 0.304 ± 0.017
0.664TrpAsn: 0.664 ± 0.025
0.197TrpPro: 0.197 ± 0.014
0.346TrpGln: 0.346 ± 0.019
0.386TrpArg: 0.386 ± 0.019
0.631TrpSer: 0.631 ± 0.028
0.589TrpThr: 0.589 ± 0.039
0.589TrpVal: 0.589 ± 0.026
0.116TrpTrp: 0.116 ± 0.011
0.436TrpTyr: 0.436 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.398TyrAla: 2.398 ± 0.045
0.311TyrCys: 0.311 ± 0.017
2.508TyrAsp: 2.508 ± 0.053
2.319TyrGlu: 2.319 ± 0.052
2.453TyrPhe: 2.453 ± 0.053
2.627TyrGly: 2.627 ± 0.059
0.853TyrHis: 0.853 ± 0.03
2.948TyrIle: 2.948 ± 0.053
3.196TyrLys: 3.196 ± 0.061
3.806TyrLeu: 3.806 ± 0.067
0.804TyrMet: 0.804 ± 0.03
2.792TyrAsn: 2.792 ± 0.062
1.411TyrPro: 1.411 ± 0.036
1.397TyrGln: 1.397 ± 0.039
1.552TyrArg: 1.552 ± 0.047
2.681TyrSer: 2.681 ± 0.05
2.433TyrThr: 2.433 ± 0.055
2.207TyrVal: 2.207 ± 0.053
0.481TyrTrp: 0.481 ± 0.027
1.945TyrTyr: 1.945 ± 0.047
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3007 proteins (994994 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski