Amino acid dipepetide frequency for Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.674AlaAla: 3.674 ± 0.1
0.474AlaCys: 0.474 ± 0.036
2.517AlaAsp: 2.517 ± 0.063
4.151AlaGlu: 4.151 ± 0.091
3.053AlaPhe: 3.053 ± 0.082
4.374AlaGly: 4.374 ± 0.101
0.937AlaHis: 0.937 ± 0.037
4.278AlaIle: 4.278 ± 0.087
4.01AlaLys: 4.01 ± 0.085
6.872AlaLeu: 6.872 ± 0.123
1.543AlaMet: 1.543 ± 0.056
1.699AlaAsn: 1.699 ± 0.059
1.772AlaPro: 1.772 ± 0.052
1.498AlaGln: 1.498 ± 0.054
3.569AlaArg: 3.569 ± 0.081
3.293AlaSer: 3.293 ± 0.08
2.521AlaThr: 2.521 ± 0.073
5.611AlaVal: 5.611 ± 0.104
0.54AlaTrp: 0.54 ± 0.032
2.105AlaTyr: 2.105 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.414CysAla: 0.414 ± 0.027
0.063CysCys: 0.063 ± 0.01
0.424CysAsp: 0.424 ± 0.03
0.604CysGlu: 0.604 ± 0.039
0.304CysPhe: 0.304 ± 0.023
0.842CysGly: 0.842 ± 0.045
0.168CysHis: 0.168 ± 0.018
0.336CysIle: 0.336 ± 0.024
0.381CysLys: 0.381 ± 0.029
0.475CysLeu: 0.475 ± 0.031
0.137CysMet: 0.137 ± 0.016
0.189CysAsn: 0.189 ± 0.017
0.498CysPro: 0.498 ± 0.03
0.154CysGln: 0.154 ± 0.017
0.389CysArg: 0.389 ± 0.025
0.434CysSer: 0.434 ± 0.026
0.302CysThr: 0.302 ± 0.024
0.621CysVal: 0.621 ± 0.035
0.089CysTrp: 0.089 ± 0.011
0.259CysTyr: 0.259 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
2.637AspAla: 2.637 ± 0.072
0.317AspCys: 0.317 ± 0.025
2.119AspAsp: 2.119 ± 0.077
4.581AspGlu: 4.581 ± 0.095
3.113AspPhe: 3.113 ± 0.081
3.492AspGly: 3.492 ± 0.085
0.848AspHis: 0.848 ± 0.04
3.535AspIle: 3.535 ± 0.081
2.224AspLys: 2.224 ± 0.071
5.884AspLeu: 5.884 ± 0.107
1.093AspMet: 1.093 ± 0.049
1.144AspAsn: 1.144 ± 0.046
2.936AspPro: 2.936 ± 0.071
0.925AspGln: 0.925 ± 0.044
2.769AspArg: 2.769 ± 0.069
2.33AspSer: 2.33 ± 0.066
2.076AspThr: 2.076 ± 0.064
4.834AspVal: 4.834 ± 0.095
0.669AspTrp: 0.669 ± 0.036
2.135AspTyr: 2.135 ± 0.057
0.0AspXaa: 0.0 ± 0.0
Glu
4.947GluAla: 4.947 ± 0.101
0.434GluCys: 0.434 ± 0.03
4.598GluAsp: 4.598 ± 0.095
10.127GluGlu: 10.127 ± 0.174
3.653GluPhe: 3.653 ± 0.083
5.58GluGly: 5.58 ± 0.117
1.182GluHis: 1.182 ± 0.052
7.953GluIle: 7.953 ± 0.134
9.635GluLys: 9.635 ± 0.147
7.527GluLeu: 7.527 ± 0.13
2.414GluMet: 2.414 ± 0.067
4.139GluAsn: 4.139 ± 0.097
2.236GluPro: 2.236 ± 0.068
1.457GluGln: 1.457 ± 0.05
5.522GluArg: 5.522 ± 0.116
3.62GluSer: 3.62 ± 0.073
3.797GluThr: 3.797 ± 0.08
6.503GluVal: 6.503 ± 0.1
0.909GluTrp: 0.909 ± 0.044
3.011GluTyr: 3.011 ± 0.08
0.0GluXaa: 0.0 ± 0.0
Phe
2.831PheAla: 2.831 ± 0.086
0.422PheCys: 0.422 ± 0.026
2.742PheAsp: 2.742 ± 0.064
4.3PheGlu: 4.3 ± 0.072
3.111PhePhe: 3.111 ± 0.09
3.521PheGly: 3.521 ± 0.088
0.913PheHis: 0.913 ± 0.044
3.059PheIle: 3.059 ± 0.075
2.859PheLys: 2.859 ± 0.061
6.417PheLeu: 6.417 ± 0.139
1.084PheMet: 1.084 ± 0.045
1.637PheAsn: 1.637 ± 0.055
2.177PhePro: 2.177 ± 0.066
1.124PheGln: 1.124 ± 0.041
2.395PheArg: 2.395 ± 0.067
4.077PheSer: 4.077 ± 0.105
2.201PheThr: 2.201 ± 0.063
4.516PheVal: 4.516 ± 0.098
0.676PheTrp: 0.676 ± 0.035
1.886PheTyr: 1.886 ± 0.059
0.0PheXaa: 0.0 ± 0.0
Gly
4.154GlyAla: 4.154 ± 0.107
0.642GlyCys: 0.642 ± 0.039
3.329GlyAsp: 3.329 ± 0.084
5.539GlyGlu: 5.539 ± 0.086
3.713GlyPhe: 3.713 ± 0.08
4.777GlyGly: 4.777 ± 0.114
1.098GlyHis: 1.098 ± 0.047
5.565GlyIle: 5.565 ± 0.092
6.115GlyLys: 6.115 ± 0.11
5.764GlyLeu: 5.764 ± 0.113
1.829GlyMet: 1.829 ± 0.06
2.539GlyAsn: 2.539 ± 0.079
1.81GlyPro: 1.81 ± 0.061
1.297GlyGln: 1.297 ± 0.052
3.607GlyArg: 3.607 ± 0.088
3.511GlySer: 3.511 ± 0.078
3.634GlyThr: 3.634 ± 0.089
6.426GlyVal: 6.426 ± 0.112
0.83GlyTrp: 0.83 ± 0.042
2.902GlyTyr: 2.902 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
0.867HisAla: 0.867 ± 0.033
0.156HisCys: 0.156 ± 0.017
0.719HisAsp: 0.719 ± 0.04
1.064HisGlu: 1.064 ± 0.04
0.858HisPhe: 0.858 ± 0.038
1.326HisGly: 1.326 ± 0.054
0.338HisHis: 0.338 ± 0.028
1.048HisIle: 1.048 ± 0.044
0.628HisLys: 0.628 ± 0.03
1.769HisLeu: 1.769 ± 0.06
0.35HisMet: 0.35 ± 0.022
0.491HisAsn: 0.491 ± 0.028
1.155HisPro: 1.155 ± 0.039
0.371HisGln: 0.371 ± 0.025
1.004HisArg: 1.004 ± 0.039
0.875HisSer: 0.875 ± 0.039
0.762HisThr: 0.762 ± 0.037
1.287HisVal: 1.287 ± 0.051
0.175HisTrp: 0.175 ± 0.017
0.609HisTyr: 0.609 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
5.039IleAla: 5.039 ± 0.096
0.491IleCys: 0.491 ± 0.029
3.96IleAsp: 3.96 ± 0.094
6.316IleGlu: 6.316 ± 0.119
3.874IlePhe: 3.874 ± 0.11
4.689IleGly: 4.689 ± 0.098
1.184IleHis: 1.184 ± 0.044
4.118IleIle: 4.118 ± 0.101
4.346IleLys: 4.346 ± 0.081
7.931IleLeu: 7.931 ± 0.148
1.457IleMet: 1.457 ± 0.048
2.275IleAsn: 2.275 ± 0.058
3.588IlePro: 3.588 ± 0.078
1.278IleGln: 1.278 ± 0.047
3.442IleArg: 3.442 ± 0.071
4.633IleSer: 4.633 ± 0.098
3.583IleThr: 3.583 ± 0.079
6.618IleVal: 6.618 ± 0.123
0.697IleTrp: 0.697 ± 0.036
2.328IleTyr: 2.328 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
4.477LysAla: 4.477 ± 0.102
0.407LysCys: 0.407 ± 0.031
3.945LysAsp: 3.945 ± 0.084
8.018LysGlu: 8.018 ± 0.119
2.57LysPhe: 2.57 ± 0.074
4.724LysGly: 4.724 ± 0.087
1.187LysHis: 1.187 ± 0.043
6.373LysIle: 6.373 ± 0.093
7.193LysLys: 7.193 ± 0.124
6.733LysLeu: 6.733 ± 0.102
1.923LysMet: 1.923 ± 0.055
3.478LysAsn: 3.478 ± 0.08
2.632LysPro: 2.632 ± 0.062
1.328LysGln: 1.328 ± 0.057
4.786LysArg: 4.786 ± 0.102
3.106LysSer: 3.106 ± 0.077
3.689LysThr: 3.689 ± 0.089
6.136LysVal: 6.136 ± 0.109
0.757LysTrp: 0.757 ± 0.038
2.713LysTyr: 2.713 ± 0.072
0.0LysXaa: 0.0 ± 0.0
Leu
5.882LeuAla: 5.882 ± 0.109
0.64LeuCys: 0.64 ± 0.027
4.861LeuAsp: 4.861 ± 0.09
9.2LeuGlu: 9.2 ± 0.136
5.005LeuPhe: 5.005 ± 0.113
6.694LeuGly: 6.694 ± 0.121
1.378LeuHis: 1.378 ± 0.05
6.369LeuIle: 6.369 ± 0.131
9.42LeuLys: 9.42 ± 0.136
9.748LeuLeu: 9.748 ± 0.164
2.469LeuMet: 2.469 ± 0.07
3.869LeuAsn: 3.869 ± 0.085
4.175LeuPro: 4.175 ± 0.09
1.961LeuGln: 1.961 ± 0.057
5.561LeuArg: 5.561 ± 0.114
7.308LeuSer: 7.308 ± 0.139
4.24LeuThr: 4.24 ± 0.095
7.702LeuVal: 7.702 ± 0.139
0.997LeuTrp: 0.997 ± 0.044
2.907LeuTyr: 2.907 ± 0.074
0.0LeuXaa: 0.0 ± 0.0
Met
1.453MetAla: 1.453 ± 0.051
0.182MetCys: 0.182 ± 0.02
1.156MetAsp: 1.156 ± 0.046
2.052MetGlu: 2.052 ± 0.065
1.11MetPhe: 1.11 ± 0.044
1.795MetGly: 1.795 ± 0.058
0.244MetHis: 0.244 ± 0.021
1.946MetIle: 1.946 ± 0.063
2.829MetLys: 2.829 ± 0.073
1.714MetLeu: 1.714 ± 0.058
0.697MetMet: 0.697 ± 0.039
1.163MetAsn: 1.163 ± 0.038
0.8MetPro: 0.8 ± 0.038
0.293MetGln: 0.293 ± 0.023
1.585MetArg: 1.585 ± 0.051
1.241MetSer: 1.241 ± 0.052
0.968MetThr: 0.968 ± 0.042
1.881MetVal: 1.881 ± 0.052
0.211MetTrp: 0.211 ± 0.017
0.705MetTyr: 0.705 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
2.129AsnAla: 2.129 ± 0.062
0.251AsnCys: 0.251 ± 0.022
1.582AsnAsp: 1.582 ± 0.058
2.5AsnGlu: 2.5 ± 0.063
2.038AsnPhe: 2.038 ± 0.061
2.773AsnGly: 2.773 ± 0.085
0.604AsnHis: 0.604 ± 0.031
2.6AsnIle: 2.6 ± 0.083
1.606AsnLys: 1.606 ± 0.049
4.266AsnLeu: 4.266 ± 0.1
0.841AsnMet: 0.841 ± 0.036
0.98AsnAsn: 0.98 ± 0.048
2.294AsnPro: 2.294 ± 0.066
0.781AsnGln: 0.781 ± 0.034
1.815AsnArg: 1.815 ± 0.059
1.891AsnSer: 1.891 ± 0.064
1.817AsnThr: 1.817 ± 0.048
3.499AsnVal: 3.499 ± 0.073
0.501AsnTrp: 0.501 ± 0.038
1.33AsnTyr: 1.33 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.195ProAla: 2.195 ± 0.059
0.273ProCys: 0.273 ± 0.02
2.644ProAsp: 2.644 ± 0.066
4.346ProGlu: 4.346 ± 0.101
2.241ProPhe: 2.241 ± 0.065
2.867ProGly: 2.867 ± 0.068
0.782ProHis: 0.782 ± 0.04
2.225ProIle: 2.225 ± 0.063
2.227ProLys: 2.227 ± 0.062
3.471ProLeu: 3.471 ± 0.077
0.726ProMet: 0.726 ± 0.038
1.392ProAsn: 1.392 ± 0.054
1.531ProPro: 1.531 ± 0.052
0.942ProGln: 0.942 ± 0.036
1.841ProArg: 1.841 ± 0.061
2.347ProSer: 2.347 ± 0.064
1.769ProThr: 1.769 ± 0.056
3.982ProVal: 3.982 ± 0.1
0.511ProTrp: 0.511 ± 0.034
1.62ProTyr: 1.62 ± 0.056
0.0ProXaa: 0.0 ± 0.0
Gln
1.271GlnAla: 1.271 ± 0.047
0.101GlnCys: 0.101 ± 0.015
0.815GlnAsp: 0.815 ± 0.039
1.81GlnGlu: 1.81 ± 0.064
0.842GlnPhe: 0.842 ± 0.041
1.196GlnGly: 1.196 ± 0.043
0.311GlnHis: 0.311 ± 0.021
1.838GlnIle: 1.838 ± 0.06
1.966GlnLys: 1.966 ± 0.068
1.798GlnLeu: 1.798 ± 0.056
0.606GlnMet: 0.606 ± 0.03
0.957GlnAsn: 0.957 ± 0.043
0.705GlnPro: 0.705 ± 0.031
0.522GlnGln: 0.522 ± 0.033
1.299GlnArg: 1.299 ± 0.05
0.837GlnSer: 0.837 ± 0.041
0.964GlnThr: 0.964 ± 0.044
1.477GlnVal: 1.477 ± 0.048
0.202GlnTrp: 0.202 ± 0.017
0.613GlnTyr: 0.613 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.936ArgAla: 2.936 ± 0.077
0.434ArgCys: 0.434 ± 0.034
2.481ArgAsp: 2.481 ± 0.063
5.396ArgGlu: 5.396 ± 0.099
2.926ArgPhe: 2.926 ± 0.069
3.306ArgGly: 3.306 ± 0.075
0.714ArgHis: 0.714 ± 0.033
4.787ArgIle: 4.787 ± 0.093
5.184ArgLys: 5.184 ± 0.097
4.727ArgLeu: 4.727 ± 0.087
1.642ArgMet: 1.642 ± 0.056
2.256ArgAsn: 2.256 ± 0.061
1.606ArgPro: 1.606 ± 0.055
0.896ArgGln: 0.896 ± 0.041
3.387ArgArg: 3.387 ± 0.093
2.625ArgSer: 2.625 ± 0.071
2.292ArgThr: 2.292 ± 0.07
4.715ArgVal: 4.715 ± 0.083
0.619ArgTrp: 0.619 ± 0.035
2.426ArgTyr: 2.426 ± 0.075
0.0ArgXaa: 0.0 ± 0.0
Ser
3.246SerAla: 3.246 ± 0.084
0.376SerCys: 0.376 ± 0.029
2.63SerAsp: 2.63 ± 0.078
4.405SerGlu: 4.405 ± 0.095
3.674SerPhe: 3.674 ± 0.101
4.526SerGly: 4.526 ± 0.099
0.911SerHis: 0.911 ± 0.041
3.72SerIle: 3.72 ± 0.095
3.514SerLys: 3.514 ± 0.081
6.453SerLeu: 6.453 ± 0.12
1.352SerMet: 1.352 ± 0.048
1.688SerAsn: 1.688 ± 0.058
2.363SerPro: 2.363 ± 0.064
1.493SerGln: 1.493 ± 0.059
2.879SerArg: 2.879 ± 0.068
3.722SerSer: 3.722 ± 0.099
2.529SerThr: 2.529 ± 0.07
4.461SerVal: 4.461 ± 0.1
0.642SerTrp: 0.642 ± 0.035
1.925SerTyr: 1.925 ± 0.055
0.002SerXaa: 0.002 ± 0.002
Thr
2.944ThrAla: 2.944 ± 0.073
0.374ThrCys: 0.374 ± 0.028
2.037ThrAsp: 2.037 ± 0.065
2.974ThrGlu: 2.974 ± 0.075
2.351ThrPhe: 2.351 ± 0.07
4.012ThrGly: 4.012 ± 0.077
0.8ThrHis: 0.8 ± 0.036
3.416ThrIle: 3.416 ± 0.078
2.582ThrLys: 2.582 ± 0.067
4.883ThrLeu: 4.883 ± 0.097
0.933ThrMet: 0.933 ± 0.041
1.414ThrAsn: 1.414 ± 0.059
2.371ThrPro: 2.371 ± 0.061
0.853ThrGln: 0.853 ± 0.04
2.2ThrArg: 2.2 ± 0.06
2.515ThrSer: 2.515 ± 0.056
2.406ThrThr: 2.406 ± 0.072
4.149ThrVal: 4.149 ± 0.088
0.468ThrTrp: 0.468 ± 0.029
1.575ThrTyr: 1.575 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
4.873ValAla: 4.873 ± 0.102
0.691ValCys: 0.691 ± 0.032
4.557ValAsp: 4.557 ± 0.095
8.044ValGlu: 8.044 ± 0.139
5.051ValPhe: 5.051 ± 0.124
5.173ValGly: 5.173 ± 0.101
1.342ValHis: 1.342 ± 0.049
5.743ValIle: 5.743 ± 0.09
6.52ValLys: 6.52 ± 0.116
9.185ValLeu: 9.185 ± 0.14
1.894ValMet: 1.894 ± 0.059
2.867ValAsn: 2.867 ± 0.068
3.435ValPro: 3.435 ± 0.079
1.779ValGln: 1.779 ± 0.062
4.109ValArg: 4.109 ± 0.085
5.431ValSer: 5.431 ± 0.1
3.59ValThr: 3.59 ± 0.093
8.162ValVal: 8.162 ± 0.12
0.889ValTrp: 0.889 ± 0.043
2.763ValTyr: 2.763 ± 0.08
0.0ValXaa: 0.0 ± 0.0
Trp
0.571TrpAla: 0.571 ± 0.033
0.098TrpCys: 0.098 ± 0.014
0.619TrpAsp: 0.619 ± 0.031
0.837TrpGlu: 0.837 ± 0.041
0.582TrpPhe: 0.582 ± 0.035
0.702TrpGly: 0.702 ± 0.039
0.189TrpHis: 0.189 ± 0.019
0.822TrpIle: 0.822 ± 0.041
1.083TrpLys: 1.083 ± 0.049
0.921TrpLeu: 0.921 ± 0.044
0.374TrpMet: 0.374 ± 0.025
0.614TrpAsn: 0.614 ± 0.039
0.324TrpPro: 0.324 ± 0.026
0.288TrpGln: 0.288 ± 0.028
0.631TrpArg: 0.631 ± 0.037
0.568TrpSer: 0.568 ± 0.033
0.415TrpThr: 0.415 ± 0.027
0.666TrpVal: 0.666 ± 0.037
0.239TrpTrp: 0.239 ± 0.02
0.477TrpTyr: 0.477 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.947TyrAla: 1.947 ± 0.06
0.259TyrCys: 0.259 ± 0.022
1.923TyrAsp: 1.923 ± 0.057
2.98TyrGlu: 2.98 ± 0.078
1.881TyrPhe: 1.881 ± 0.064
2.711TyrGly: 2.711 ± 0.065
0.722TyrHis: 0.722 ± 0.035
2.117TyrIle: 2.117 ± 0.06
1.851TyrLys: 1.851 ± 0.067
3.847TyrLeu: 3.847 ± 0.08
0.659TyrMet: 0.659 ± 0.036
1.304TyrAsn: 1.304 ± 0.052
1.525TyrPro: 1.525 ± 0.054
0.892TyrGln: 0.892 ± 0.041
2.563TyrArg: 2.563 ± 0.074
2.148TyrSer: 2.148 ± 0.057
1.702TyrThr: 1.702 ± 0.06
2.853TyrVal: 2.853 ± 0.065
0.4TyrTrp: 0.4 ± 0.034
1.51TyrTyr: 1.51 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.002
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1852 proteins (582801 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski