Amino acid dipepetide frequency for Neochlamydia sp. TUME1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.675AlaAla: 5.675 ± 0.116
1.141AlaCys: 1.141 ± 0.044
2.676AlaAsp: 2.676 ± 0.063
5.579AlaGlu: 5.579 ± 0.113
3.356AlaPhe: 3.356 ± 0.079
3.853AlaGly: 3.853 ± 0.091
1.63AlaHis: 1.63 ± 0.055
5.801AlaIle: 5.801 ± 0.121
5.088AlaLys: 5.088 ± 0.092
9.635AlaLeu: 9.635 ± 0.17
1.599AlaMet: 1.599 ± 0.051
2.743AlaAsn: 2.743 ± 0.059
2.204AlaPro: 2.204 ± 0.059
3.072AlaGln: 3.072 ± 0.077
3.069AlaArg: 3.069 ± 0.069
4.778AlaSer: 4.778 ± 0.103
3.305AlaThr: 3.305 ± 0.07
3.773AlaVal: 3.773 ± 0.081
0.74AlaTrp: 0.74 ± 0.034
2.463AlaTyr: 2.463 ± 0.072
0.0AlaXaa: 0.0 ± 0.0
Cys
0.709CysAla: 0.709 ± 0.032
0.231CysCys: 0.231 ± 0.016
0.42CysAsp: 0.42 ± 0.027
0.648CysGlu: 0.648 ± 0.031
0.632CysPhe: 0.632 ± 0.029
0.74CysGly: 0.74 ± 0.034
0.372CysHis: 0.372 ± 0.025
0.87CysIle: 0.87 ± 0.035
0.958CysLys: 0.958 ± 0.039
1.67CysLeu: 1.67 ± 0.047
0.259CysMet: 0.259 ± 0.02
0.417CysAsn: 0.417 ± 0.024
0.602CysPro: 0.602 ± 0.034
0.682CysGln: 0.682 ± 0.032
0.608CysArg: 0.608 ± 0.033
0.804CysSer: 0.804 ± 0.038
0.526CysThr: 0.526 ± 0.026
0.58CysVal: 0.58 ± 0.028
0.157CysTrp: 0.157 ± 0.015
0.565CysTyr: 0.565 ± 0.035
0.0CysXaa: 0.0 ± 0.0
Asp
2.365AspAla: 2.365 ± 0.066
0.49AspCys: 0.49 ± 0.03
1.433AspAsp: 1.433 ± 0.053
2.842AspGlu: 2.842 ± 0.075
2.118AspPhe: 2.118 ± 0.058
1.941AspGly: 1.941 ± 0.06
1.06AspHis: 1.06 ± 0.042
3.046AspIle: 3.046 ± 0.063
3.479AspLys: 3.479 ± 0.084
5.295AspLeu: 5.295 ± 0.093
0.82AspMet: 0.82 ± 0.034
1.87AspAsn: 1.87 ± 0.054
1.881AspPro: 1.881 ± 0.061
1.775AspGln: 1.775 ± 0.056
1.833AspArg: 1.833 ± 0.045
2.385AspSer: 2.385 ± 0.062
1.702AspThr: 1.702 ± 0.054
1.998AspVal: 1.998 ± 0.054
0.574AspTrp: 0.574 ± 0.027
1.893AspTyr: 1.893 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
5.225GluAla: 5.225 ± 0.096
0.632GluCys: 0.632 ± 0.033
2.888GluAsp: 2.888 ± 0.07
6.42GluGlu: 6.42 ± 0.132
2.253GluPhe: 2.253 ± 0.057
3.896GluGly: 3.896 ± 0.103
1.397GluHis: 1.397 ± 0.05
6.985GluIle: 6.985 ± 0.165
6.877GluLys: 6.877 ± 0.12
7.057GluLeu: 7.057 ± 0.125
1.575GluMet: 1.575 ± 0.053
3.84GluAsn: 3.84 ± 0.081
1.637GluPro: 1.637 ± 0.052
3.122GluGln: 3.122 ± 0.082
3.054GluArg: 3.054 ± 0.083
3.326GluSer: 3.326 ± 0.079
2.773GluThr: 2.773 ± 0.068
3.754GluVal: 3.754 ± 0.083
0.813GluTrp: 0.813 ± 0.036
2.228GluTyr: 2.228 ± 0.079
0.0GluXaa: 0.0 ± 0.0
Phe
2.762PheAla: 2.762 ± 0.065
0.674PheCys: 0.674 ± 0.027
1.924PheAsp: 1.924 ± 0.055
2.516PheGlu: 2.516 ± 0.062
2.687PhePhe: 2.687 ± 0.078
2.514PheGly: 2.514 ± 0.102
0.99PheHis: 0.99 ± 0.039
3.271PheIle: 3.271 ± 0.073
3.197PheLys: 3.197 ± 0.077
5.14PheLeu: 5.14 ± 0.113
0.955PheMet: 0.955 ± 0.036
2.006PheAsn: 2.006 ± 0.064
1.864PhePro: 1.864 ± 0.057
1.692PheGln: 1.692 ± 0.051
1.693PheArg: 1.693 ± 0.047
3.773PheSer: 3.773 ± 0.085
2.257PheThr: 2.257 ± 0.062
1.947PheVal: 1.947 ± 0.058
0.46PheTrp: 0.46 ± 0.03
1.807PheTyr: 1.807 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
3.495GlyAla: 3.495 ± 0.099
0.816GlyCys: 0.816 ± 0.032
2.161GlyAsp: 2.161 ± 0.07
3.671GlyGlu: 3.671 ± 0.107
2.493GlyPhe: 2.493 ± 0.07
3.175GlyGly: 3.175 ± 0.094
1.423GlyHis: 1.423 ± 0.047
4.164GlyIle: 4.164 ± 0.097
4.51GlyLys: 4.51 ± 0.086
5.849GlyLeu: 5.849 ± 0.104
1.457GlyMet: 1.457 ± 0.054
2.659GlyAsn: 2.659 ± 0.105
1.48GlyPro: 1.48 ± 0.053
3.581GlyGln: 3.581 ± 0.136
2.531GlyArg: 2.531 ± 0.075
3.08GlySer: 3.08 ± 0.068
2.475GlyThr: 2.475 ± 0.07
3.008GlyVal: 3.008 ± 0.072
0.749GlyTrp: 0.749 ± 0.036
1.975GlyTyr: 1.975 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.757HisAla: 1.757 ± 0.057
0.348HisCys: 0.348 ± 0.023
0.856HisAsp: 0.856 ± 0.038
1.334HisGlu: 1.334 ± 0.046
1.375HisPhe: 1.375 ± 0.041
1.156HisGly: 1.156 ± 0.048
0.971HisHis: 0.971 ± 0.035
1.48HisIle: 1.48 ± 0.047
1.282HisLys: 1.282 ± 0.045
4.1HisLeu: 4.1 ± 0.094
0.416HisMet: 0.416 ± 0.025
0.94HisAsn: 0.94 ± 0.041
1.839HisPro: 1.839 ± 0.073
1.359HisGln: 1.359 ± 0.05
1.124HisArg: 1.124 ± 0.039
1.545HisSer: 1.545 ± 0.051
1.174HisThr: 1.174 ± 0.047
1.172HisVal: 1.172 ± 0.041
0.3HisTrp: 0.3 ± 0.021
1.129HisTyr: 1.129 ± 0.043
0.0HisXaa: 0.0 ± 0.0
Ile
5.335IleAla: 5.335 ± 0.103
1.061IleCys: 1.061 ± 0.047
3.683IleAsp: 3.683 ± 0.082
5.2IleGlu: 5.2 ± 0.092
3.381IlePhe: 3.381 ± 0.079
5.473IleGly: 5.473 ± 0.17
1.867IleHis: 1.867 ± 0.059
4.651IleIle: 4.651 ± 0.08
5.362IleLys: 5.362 ± 0.095
7.12IleLeu: 7.12 ± 0.109
1.175IleMet: 1.175 ± 0.039
3.69IleAsn: 3.69 ± 0.082
3.279IlePro: 3.279 ± 0.076
2.887IleGln: 2.887 ± 0.063
3.008IleArg: 3.008 ± 0.073
4.756IleSer: 4.756 ± 0.093
3.22IleThr: 3.22 ± 0.067
3.489IleVal: 3.489 ± 0.073
0.682IleTrp: 0.682 ± 0.031
3.005IleTyr: 3.005 ± 0.115
0.0IleXaa: 0.0 ± 0.0
Lys
6.159LysAla: 6.159 ± 0.164
0.651LysCys: 0.651 ± 0.035
3.416LysAsp: 3.416 ± 0.081
6.281LysGlu: 6.281 ± 0.119
2.163LysPhe: 2.163 ± 0.06
4.131LysGly: 4.131 ± 0.08
1.662LysHis: 1.662 ± 0.052
6.02LysIle: 6.02 ± 0.103
6.72LysLys: 6.72 ± 0.127
7.552LysLeu: 7.552 ± 0.135
1.736LysMet: 1.736 ± 0.049
4.151LysAsn: 4.151 ± 0.096
2.284LysPro: 2.284 ± 0.055
3.434LysGln: 3.434 ± 0.079
3.425LysArg: 3.425 ± 0.078
3.945LysSer: 3.945 ± 0.088
3.414LysThr: 3.414 ± 0.075
3.964LysVal: 3.964 ± 0.087
0.739LysTrp: 0.739 ± 0.035
2.098LysTyr: 2.098 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
9.455LeuAla: 9.455 ± 0.16
1.307LeuCys: 1.307 ± 0.045
5.356LeuAsp: 5.356 ± 0.094
8.375LeuGlu: 8.375 ± 0.125
5.288LeuPhe: 5.288 ± 0.106
6.62LeuGly: 6.62 ± 0.142
2.665LeuHis: 2.665 ± 0.083
7.601LeuIle: 7.601 ± 0.113
9.647LeuLys: 9.647 ± 0.153
12.044LeuLeu: 12.044 ± 0.205
2.552LeuMet: 2.552 ± 0.067
6.327LeuAsn: 6.327 ± 0.121
7.085LeuPro: 7.085 ± 0.202
5.976LeuGln: 5.976 ± 0.158
4.728LeuArg: 4.728 ± 0.093
10.143LeuSer: 10.143 ± 0.24
7.059LeuThr: 7.059 ± 0.182
5.058LeuVal: 5.058 ± 0.09
1.186LeuTrp: 1.186 ± 0.051
3.958LeuTyr: 3.958 ± 0.093
0.0LeuXaa: 0.0 ± 0.0
Met
1.744MetAla: 1.744 ± 0.052
0.195MetCys: 0.195 ± 0.018
0.906MetAsp: 0.906 ± 0.039
1.273MetGlu: 1.273 ± 0.038
0.647MetPhe: 0.647 ± 0.032
1.245MetGly: 1.245 ± 0.045
0.696MetHis: 0.696 ± 0.036
1.497MetIle: 1.497 ± 0.053
1.463MetLys: 1.463 ± 0.05
2.386MetLeu: 2.386 ± 0.061
0.551MetMet: 0.551 ± 0.026
0.974MetAsn: 0.974 ± 0.035
1.052MetPro: 1.052 ± 0.039
1.183MetGln: 1.183 ± 0.049
1.107MetArg: 1.107 ± 0.039
1.269MetSer: 1.269 ± 0.046
1.094MetThr: 1.094 ± 0.038
1.158MetVal: 1.158 ± 0.043
0.173MetTrp: 0.173 ± 0.016
0.44MetTyr: 0.44 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
2.625AsnAla: 2.625 ± 0.066
0.539AsnCys: 0.539 ± 0.029
1.725AsnAsp: 1.725 ± 0.055
2.77AsnGlu: 2.77 ± 0.069
2.305AsnPhe: 2.305 ± 0.061
1.898AsnGly: 1.898 ± 0.068
1.578AsnHis: 1.578 ± 0.077
3.236AsnIle: 3.236 ± 0.071
3.184AsnLys: 3.184 ± 0.085
6.475AsnLeu: 6.475 ± 0.217
0.838AsnMet: 0.838 ± 0.035
2.804AsnAsn: 2.804 ± 0.131
2.46AsnPro: 2.46 ± 0.06
3.946AsnGln: 3.946 ± 0.206
2.18AsnArg: 2.18 ± 0.056
2.607AsnSer: 2.607 ± 0.068
1.77AsnThr: 1.77 ± 0.059
2.032AsnVal: 2.032 ± 0.058
0.477AsnTrp: 0.477 ± 0.028
1.825AsnTyr: 1.825 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
3.641ProAla: 3.641 ± 0.115
0.549ProCys: 0.549 ± 0.03
1.528ProAsp: 1.528 ± 0.053
2.927ProGlu: 2.927 ± 0.069
1.924ProPhe: 1.924 ± 0.06
1.998ProGly: 1.998 ± 0.055
1.27ProHis: 1.27 ± 0.046
2.786ProIle: 2.786 ± 0.072
2.194ProLys: 2.194 ± 0.061
5.319ProLeu: 5.319 ± 0.112
0.768ProMet: 0.768 ± 0.036
1.492ProAsn: 1.492 ± 0.047
1.819ProPro: 1.819 ± 0.07
1.91ProGln: 1.91 ± 0.062
1.387ProArg: 1.387 ± 0.044
3.6ProSer: 3.6 ± 0.082
2.435ProThr: 2.435 ± 0.071
1.982ProVal: 1.982 ± 0.056
0.478ProTrp: 0.478 ± 0.028
1.55ProTyr: 1.55 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
4.382GlnAla: 4.382 ± 0.099
0.326GlnCys: 0.326 ± 0.021
1.742GlnAsp: 1.742 ± 0.058
4.217GlnGlu: 4.217 ± 0.094
1.445GlnPhe: 1.445 ± 0.045
3.052GlnGly: 3.052 ± 0.097
1.057GlnHis: 1.057 ± 0.039
3.274GlnIle: 3.274 ± 0.083
3.308GlnLys: 3.308 ± 0.066
9.045GlnLeu: 9.045 ± 0.452
0.952GlnMet: 0.952 ± 0.036
2.545GlnAsn: 2.545 ± 0.114
1.431GlnPro: 1.431 ± 0.052
2.442GlnGln: 2.442 ± 0.076
2.047GlnArg: 2.047 ± 0.059
2.283GlnSer: 2.283 ± 0.061
2.136GlnThr: 2.136 ± 0.058
2.545GlnVal: 2.545 ± 0.061
0.715GlnTrp: 0.715 ± 0.035
1.226GlnTyr: 1.226 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
2.582ArgAla: 2.582 ± 0.062
0.52ArgCys: 0.52 ± 0.029
1.773ArgAsp: 1.773 ± 0.054
3.212ArgGlu: 3.212 ± 0.074
1.893ArgPhe: 1.893 ± 0.061
2.219ArgGly: 2.219 ± 0.057
1.272ArgHis: 1.272 ± 0.047
2.996ArgIle: 2.996 ± 0.071
3.092ArgLys: 3.092 ± 0.07
5.912ArgLeu: 5.912 ± 0.121
1.061ArgMet: 1.061 ± 0.04
1.757ArgAsn: 1.757 ± 0.053
1.437ArgPro: 1.437 ± 0.046
2.327ArgGln: 2.327 ± 0.06
2.154ArgArg: 2.154 ± 0.061
2.331ArgSer: 2.331 ± 0.066
1.756ArgThr: 1.756 ± 0.056
2.167ArgVal: 2.167 ± 0.062
0.7ArgTrp: 0.7 ± 0.031
1.616ArgTyr: 1.616 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
3.887SerAla: 3.887 ± 0.084
0.974SerCys: 0.974 ± 0.038
2.303SerAsp: 2.303 ± 0.063
3.525SerGlu: 3.525 ± 0.087
3.377SerPhe: 3.377 ± 0.082
2.953SerGly: 2.953 ± 0.08
1.796SerHis: 1.796 ± 0.056
4.546SerIle: 4.546 ± 0.089
4.333SerLys: 4.333 ± 0.094
9.552SerLeu: 9.552 ± 0.188
1.48SerMet: 1.48 ± 0.047
2.759SerAsn: 2.759 ± 0.079
3.157SerPro: 3.157 ± 0.08
4.288SerGln: 4.288 ± 0.136
2.614SerArg: 2.614 ± 0.068
5.88SerSer: 5.88 ± 0.146
3.172SerThr: 3.172 ± 0.076
2.851SerVal: 2.851 ± 0.069
0.653SerTrp: 0.653 ± 0.027
2.586SerTyr: 2.586 ± 0.068
0.0SerXaa: 0.0 ± 0.0
Thr
3.461ThrAla: 3.461 ± 0.082
0.571ThrCys: 0.571 ± 0.027
1.75ThrAsp: 1.75 ± 0.052
2.648ThrGlu: 2.648 ± 0.071
2.503ThrPhe: 2.503 ± 0.065
2.5ThrGly: 2.5 ± 0.069
1.362ThrHis: 1.362 ± 0.047
3.344ThrIle: 3.344 ± 0.076
2.475ThrLys: 2.475 ± 0.061
6.608ThrLeu: 6.608 ± 0.115
0.777ThrMet: 0.777 ± 0.035
1.742ThrAsn: 1.742 ± 0.056
2.431ThrPro: 2.431 ± 0.065
1.961ThrGln: 1.961 ± 0.063
1.853ThrArg: 1.853 ± 0.058
4.023ThrSer: 4.023 ± 0.102
2.478ThrThr: 2.478 ± 0.079
2.324ThrVal: 2.324 ± 0.068
0.449ThrTrp: 0.449 ± 0.027
1.708ThrTyr: 1.708 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
3.927ValAla: 3.927 ± 0.102
0.786ValCys: 0.786 ± 0.032
2.403ValAsp: 2.403 ± 0.066
3.65ValGlu: 3.65 ± 0.079
2.123ValPhe: 2.123 ± 0.059
2.996ValGly: 2.996 ± 0.063
1.162ValHis: 1.162 ± 0.043
3.704ValIle: 3.704 ± 0.081
3.467ValLys: 3.467 ± 0.083
5.031ValLeu: 5.031 ± 0.096
1.087ValMet: 1.087 ± 0.038
2.328ValAsn: 2.328 ± 0.071
1.899ValPro: 1.899 ± 0.062
1.936ValGln: 1.936 ± 0.051
2.09ValArg: 2.09 ± 0.068
3.076ValSer: 3.076 ± 0.077
2.244ValThr: 2.244 ± 0.062
2.836ValVal: 2.836 ± 0.073
0.46ValTrp: 0.46 ± 0.03
1.52ValTyr: 1.52 ± 0.053
0.0ValXaa: 0.0 ± 0.0
Trp
0.58TrpAla: 0.58 ± 0.033
0.132TrpCys: 0.132 ± 0.013
0.451TrpAsp: 0.451 ± 0.034
0.681TrpGlu: 0.681 ± 0.032
0.403TrpPhe: 0.403 ± 0.023
0.595TrpGly: 0.595 ± 0.035
0.358TrpHis: 0.358 ± 0.023
0.944TrpIle: 0.944 ± 0.04
1.052TrpLys: 1.052 ± 0.045
1.464TrpLeu: 1.464 ± 0.056
0.373TrpMet: 0.373 ± 0.026
0.494TrpAsn: 0.494 ± 0.029
0.299TrpPro: 0.299 ± 0.025
0.638TrpGln: 0.638 ± 0.03
0.598TrpArg: 0.598 ± 0.031
0.514TrpSer: 0.514 ± 0.027
0.369TrpThr: 0.369 ± 0.024
0.583TrpVal: 0.583 ± 0.03
0.192TrpTrp: 0.192 ± 0.019
0.34TrpTyr: 0.34 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.414TyrAla: 2.414 ± 0.068
0.509TyrCys: 0.509 ± 0.03
1.406TyrAsp: 1.406 ± 0.067
1.982TyrGlu: 1.982 ± 0.054
1.766TyrPhe: 1.766 ± 0.065
1.796TyrGly: 1.796 ± 0.067
1.072TyrHis: 1.072 ± 0.049
2.105TyrIle: 2.105 ± 0.058
2.26TyrLys: 2.26 ± 0.061
5.175TyrLeu: 5.175 ± 0.11
0.619TyrMet: 0.619 ± 0.033
1.735TyrAsn: 1.735 ± 0.078
1.415TyrPro: 1.415 ± 0.057
1.958TyrGln: 1.958 ± 0.07
1.674TyrArg: 1.674 ± 0.056
2.568TyrSer: 2.568 ± 0.075
1.611TyrThr: 1.611 ± 0.053
1.504TyrVal: 1.504 ± 0.049
0.364TyrTrp: 0.364 ± 0.023
1.365TyrTyr: 1.365 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2344 proteins (675547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski