Amino acid dipepetide frequency for Lactobacillus thailandensis DSM 22698 = JCM 13996

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.968AlaAla: 12.968 ± 0.25
0.469AlaCys: 0.469 ± 0.027
6.799AlaAsp: 6.799 ± 0.14
4.378AlaGlu: 4.378 ± 0.104
3.4AlaPhe: 3.4 ± 0.088
7.948AlaGly: 7.948 ± 0.172
2.578AlaHis: 2.578 ± 0.065
6.033AlaIle: 6.033 ± 0.122
4.916AlaLys: 4.916 ± 0.132
9.713AlaLeu: 9.713 ± 0.162
2.736AlaMet: 2.736 ± 0.072
3.9AlaAsn: 3.9 ± 0.086
3.487AlaPro: 3.487 ± 0.088
5.076AlaGln: 5.076 ± 0.12
4.865AlaArg: 4.865 ± 0.109
5.054AlaSer: 5.054 ± 0.108
7.78AlaThr: 7.78 ± 0.174
8.289AlaVal: 8.289 ± 0.153
1.021AlaTrp: 1.021 ± 0.046
2.81AlaTyr: 2.81 ± 0.067
0.002AlaXaa: 0.002 ± 0.002
Cys
0.42CysAla: 0.42 ± 0.029
0.058CysCys: 0.058 ± 0.01
0.212CysAsp: 0.212 ± 0.021
0.12CysGlu: 0.12 ± 0.015
0.169CysPhe: 0.169 ± 0.017
0.48CysGly: 0.48 ± 0.031
0.108CysHis: 0.108 ± 0.015
0.227CysIle: 0.227 ± 0.019
0.122CysLys: 0.122 ± 0.015
0.44CysLeu: 0.44 ± 0.031
0.098CysMet: 0.098 ± 0.013
0.144CysAsn: 0.144 ± 0.014
0.189CysPro: 0.189 ± 0.021
0.148CysGln: 0.148 ± 0.016
0.177CysArg: 0.177 ± 0.017
0.189CysSer: 0.189 ± 0.019
0.279CysThr: 0.279 ± 0.023
0.359CysVal: 0.359 ± 0.027
0.074CysTrp: 0.074 ± 0.012
0.15CysTyr: 0.15 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
6.497AspAla: 6.497 ± 0.141
0.212AspCys: 0.212 ± 0.02
4.265AspAsp: 4.265 ± 0.127
3.494AspGlu: 3.494 ± 0.091
2.652AspPhe: 2.652 ± 0.078
4.566AspGly: 4.566 ± 0.143
1.771AspHis: 1.771 ± 0.06
3.36AspIle: 3.36 ± 0.1
2.504AspLys: 2.504 ± 0.073
5.193AspLeu: 5.193 ± 0.108
1.659AspMet: 1.659 ± 0.057
2.304AspAsn: 2.304 ± 0.06
2.566AspPro: 2.566 ± 0.078
2.927AspGln: 2.927 ± 0.076
2.942AspArg: 2.942 ± 0.073
2.722AspSer: 2.722 ± 0.083
3.508AspThr: 3.508 ± 0.088
4.794AspVal: 4.794 ± 0.11
0.736AspTrp: 0.736 ± 0.042
2.44AspTyr: 2.44 ± 0.088
0.0AspXaa: 0.0 ± 0.0
Glu
4.196GluAla: 4.196 ± 0.098
0.16GluCys: 0.16 ± 0.018
2.473GluAsp: 2.473 ± 0.071
2.181GluGlu: 2.181 ± 0.072
1.926GluPhe: 1.926 ± 0.065
2.43GluGly: 2.43 ± 0.072
1.396GluHis: 1.396 ± 0.049
2.616GluIle: 2.616 ± 0.079
1.857GluLys: 1.857 ± 0.063
4.87GluLeu: 4.87 ± 0.101
1.364GluMet: 1.364 ± 0.055
1.57GluAsn: 1.57 ± 0.056
1.713GluPro: 1.713 ± 0.061
2.622GluGln: 2.622 ± 0.073
2.781GluArg: 2.781 ± 0.073
2.043GluSer: 2.043 ± 0.07
2.435GluThr: 2.435 ± 0.068
3.125GluVal: 3.125 ± 0.086
0.495GluTrp: 0.495 ± 0.032
1.618GluTyr: 1.618 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
3.99PheAla: 3.99 ± 0.081
0.215PheCys: 0.215 ± 0.019
2.727PheAsp: 2.727 ± 0.073
1.534PheGlu: 1.534 ± 0.052
1.532PhePhe: 1.532 ± 0.064
3.29PheGly: 3.29 ± 0.09
0.762PheHis: 0.762 ± 0.037
2.366PheIle: 2.366 ± 0.072
1.671PheLys: 1.671 ± 0.056
3.154PheLeu: 3.154 ± 0.094
1.087PheMet: 1.087 ± 0.046
1.842PheAsn: 1.842 ± 0.069
1.336PhePro: 1.336 ± 0.057
1.195PheGln: 1.195 ± 0.043
1.463PheArg: 1.463 ± 0.05
2.196PheSer: 2.196 ± 0.068
2.659PheThr: 2.659 ± 0.071
3.15PheVal: 3.15 ± 0.08
0.471PheTrp: 0.471 ± 0.034
1.303PheTyr: 1.303 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
6.755GlyAla: 6.755 ± 0.126
0.399GlyCys: 0.399 ± 0.026
4.208GlyAsp: 4.208 ± 0.106
3.106GlyGlu: 3.106 ± 0.09
3.02GlyPhe: 3.02 ± 0.064
5.188GlyGly: 5.188 ± 0.108
2.046GlyHis: 2.046 ± 0.064
5.059GlyIle: 5.059 ± 0.102
3.489GlyLys: 3.489 ± 0.091
7.295GlyLeu: 7.295 ± 0.122
2.196GlyMet: 2.196 ± 0.07
2.717GlyAsn: 2.717 ± 0.073
1.964GlyPro: 1.964 ± 0.066
3.431GlyGln: 3.431 ± 0.089
3.573GlyArg: 3.573 ± 0.085
4.064GlySer: 4.064 ± 0.091
5.269GlyThr: 5.269 ± 0.131
6.242GlyVal: 6.242 ± 0.11
0.882GlyTrp: 0.882 ± 0.041
2.781GlyTyr: 2.781 ± 0.076
0.0GlyXaa: 0.0 ± 0.0
His
2.361HisAla: 2.361 ± 0.078
0.098HisCys: 0.098 ± 0.014
1.72HisAsp: 1.72 ± 0.059
1.212HisGlu: 1.212 ± 0.043
1.097HisPhe: 1.097 ± 0.041
2.069HisGly: 2.069 ± 0.065
0.892HisHis: 0.892 ± 0.047
1.393HisIle: 1.393 ± 0.048
0.848HisLys: 0.848 ± 0.034
2.586HisLeu: 2.586 ± 0.075
0.628HisMet: 0.628 ± 0.035
1.052HisAsn: 1.052 ± 0.044
1.371HisPro: 1.371 ± 0.049
1.377HisGln: 1.377 ± 0.048
1.333HisArg: 1.333 ± 0.05
1.071HisSer: 1.071 ± 0.045
1.369HisThr: 1.369 ± 0.048
2.146HisVal: 2.146 ± 0.064
0.315HisTrp: 0.315 ± 0.022
1.035HisTyr: 1.035 ± 0.042
0.0HisXaa: 0.0 ± 0.0
Ile
5.952IleAla: 5.952 ± 0.127
0.327IleCys: 0.327 ± 0.025
3.835IleAsp: 3.835 ± 0.1
2.598IleGlu: 2.598 ± 0.075
2.196IlePhe: 2.196 ± 0.07
4.581IleGly: 4.581 ± 0.115
1.343IleHis: 1.343 ± 0.05
4.093IleIle: 4.093 ± 0.101
2.834IleLys: 2.834 ± 0.078
5.061IleLeu: 5.061 ± 0.115
1.716IleMet: 1.716 ± 0.059
2.839IleAsn: 2.839 ± 0.065
2.504IlePro: 2.504 ± 0.069
2.072IleGln: 2.072 ± 0.056
2.696IleArg: 2.696 ± 0.071
3.262IleSer: 3.262 ± 0.095
4.203IleThr: 4.203 ± 0.103
4.865IleVal: 4.865 ± 0.106
0.454IleTrp: 0.454 ± 0.025
1.541IleTyr: 1.541 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
4.088LysAla: 4.088 ± 0.108
0.086LysCys: 0.086 ± 0.013
2.392LysAsp: 2.392 ± 0.083
2.21LysGlu: 2.21 ± 0.074
1.427LysPhe: 1.427 ± 0.055
2.495LysGly: 2.495 ± 0.075
1.042LysHis: 1.042 ± 0.044
2.561LysIle: 2.561 ± 0.077
2.377LysLys: 2.377 ± 0.097
3.938LysLeu: 3.938 ± 0.108
1.474LysMet: 1.474 ± 0.049
1.756LysAsn: 1.756 ± 0.068
1.658LysPro: 1.658 ± 0.061
2.514LysGln: 2.514 ± 0.085
2.421LysArg: 2.421 ± 0.064
2.411LysSer: 2.411 ± 0.09
2.788LysThr: 2.788 ± 0.082
3.429LysVal: 3.429 ± 0.088
0.413LysTrp: 0.413 ± 0.029
1.568LysTyr: 1.568 ± 0.055
0.002LysXaa: 0.002 ± 0.002
Leu
10.906LeuAla: 10.906 ± 0.157
0.456LeuCys: 0.456 ± 0.032
5.62LeuAsp: 5.62 ± 0.114
3.481LeuGlu: 3.481 ± 0.08
3.475LeuPhe: 3.475 ± 0.079
7.14LeuGly: 7.14 ± 0.119
2.313LeuHis: 2.313 ± 0.061
5.532LeuIle: 5.532 ± 0.128
3.83LeuLys: 3.83 ± 0.107
9.052LeuLeu: 9.052 ± 0.202
2.581LeuMet: 2.581 ± 0.09
3.933LeuAsn: 3.933 ± 0.084
4.401LeuPro: 4.401 ± 0.103
3.917LeuGln: 3.917 ± 0.079
5.138LeuArg: 5.138 ± 0.116
5.51LeuSer: 5.51 ± 0.126
7.204LeuThr: 7.204 ± 0.126
7.639LeuVal: 7.639 ± 0.151
0.953LeuTrp: 0.953 ± 0.045
2.535LeuTyr: 2.535 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
2.951MetAla: 2.951 ± 0.078
0.11MetCys: 0.11 ± 0.013
1.537MetAsp: 1.537 ± 0.047
1.008MetGlu: 1.008 ± 0.043
0.923MetPhe: 0.923 ± 0.044
1.993MetGly: 1.993 ± 0.069
0.619MetHis: 0.619 ± 0.035
1.623MetIle: 1.623 ± 0.059
1.271MetLys: 1.271 ± 0.051
2.586MetLeu: 2.586 ± 0.081
0.874MetMet: 0.874 ± 0.038
1.185MetAsn: 1.185 ± 0.043
1.169MetPro: 1.169 ± 0.048
1.343MetGln: 1.343 ± 0.053
1.457MetArg: 1.457 ± 0.053
1.546MetSer: 1.546 ± 0.052
2.122MetThr: 2.122 ± 0.056
2.258MetVal: 2.258 ± 0.068
0.229MetTrp: 0.229 ± 0.02
0.688MetTyr: 0.688 ± 0.039
0.0MetXaa: 0.0 ± 0.0
Asn
3.788AsnAla: 3.788 ± 0.087
0.169AsnCys: 0.169 ± 0.017
2.383AsnAsp: 2.383 ± 0.077
1.873AsnGlu: 1.873 ± 0.064
1.563AsnPhe: 1.563 ± 0.053
3.324AsnGly: 3.324 ± 0.09
1.078AsnHis: 1.078 ± 0.051
2.205AsnIle: 2.205 ± 0.059
1.802AsnLys: 1.802 ± 0.061
3.481AsnLeu: 3.481 ± 0.08
1.109AsnMet: 1.109 ± 0.044
1.663AsnAsn: 1.663 ± 0.066
1.838AsnPro: 1.838 ± 0.056
1.84AsnGln: 1.84 ± 0.061
1.905AsnArg: 1.905 ± 0.069
2.039AsnSer: 2.039 ± 0.078
2.408AsnThr: 2.408 ± 0.075
3.193AsnVal: 3.193 ± 0.072
0.533AsnTrp: 0.533 ± 0.029
1.453AsnTyr: 1.453 ± 0.066
0.0AsnXaa: 0.0 ± 0.0
Pro
4.28ProAla: 4.28 ± 0.114
0.119ProCys: 0.119 ± 0.015
3.004ProAsp: 3.004 ± 0.087
2.487ProGlu: 2.487 ± 0.072
1.498ProPhe: 1.498 ± 0.059
2.827ProGly: 2.827 ± 0.072
1.025ProHis: 1.025 ± 0.041
2.112ProIle: 2.112 ± 0.062
1.522ProLys: 1.522 ± 0.05
3.776ProLeu: 3.776 ± 0.092
0.788ProMet: 0.788 ± 0.039
1.439ProAsn: 1.439 ± 0.038
0.836ProPro: 0.836 ± 0.049
1.959ProGln: 1.959 ± 0.067
1.752ProArg: 1.752 ± 0.052
1.997ProSer: 1.997 ± 0.06
2.755ProThr: 2.755 ± 0.085
3.451ProVal: 3.451 ± 0.08
0.368ProTrp: 0.368 ± 0.028
1.278ProTyr: 1.278 ± 0.055
0.0ProXaa: 0.0 ± 0.0
Gln
4.818GlnAla: 4.818 ± 0.127
0.129GlnCys: 0.129 ± 0.015
2.16GlnAsp: 2.16 ± 0.064
2.012GlnGlu: 2.012 ± 0.067
1.696GlnPhe: 1.696 ± 0.059
2.555GlnGly: 2.555 ± 0.07
1.472GlnHis: 1.472 ± 0.053
2.354GlnIle: 2.354 ± 0.069
1.747GlnLys: 1.747 ± 0.063
5.352GlnLeu: 5.352 ± 0.107
1.36GlnMet: 1.36 ± 0.046
1.555GlnAsn: 1.555 ± 0.054
2.069GlnPro: 2.069 ± 0.066
3.064GlnGln: 3.064 ± 0.102
2.923GlnArg: 2.923 ± 0.093
2.492GlnSer: 2.492 ± 0.08
2.963GlnThr: 2.963 ± 0.096
3.761GlnVal: 3.761 ± 0.096
0.543GlnTrp: 0.543 ± 0.033
1.702GlnTyr: 1.702 ± 0.064
0.002GlnXaa: 0.002 ± 0.002
Arg
4.633ArgAla: 4.633 ± 0.113
0.172ArgCys: 0.172 ± 0.017
2.978ArgAsp: 2.978 ± 0.082
2.437ArgGlu: 2.437 ± 0.079
1.988ArgPhe: 1.988 ± 0.063
3.183ArgGly: 3.183 ± 0.085
1.618ArgHis: 1.618 ± 0.059
2.985ArgIle: 2.985 ± 0.081
2.033ArgLys: 2.033 ± 0.064
5.145ArgLeu: 5.145 ± 0.103
1.482ArgMet: 1.482 ± 0.052
1.916ArgAsn: 1.916 ± 0.063
1.919ArgPro: 1.919 ± 0.06
2.712ArgGln: 2.712 ± 0.088
3.479ArgArg: 3.479 ± 0.097
2.34ArgSer: 2.34 ± 0.068
3.102ArgThr: 3.102 ± 0.086
3.938ArgVal: 3.938 ± 0.091
0.629ArgTrp: 0.629 ± 0.036
1.943ArgTyr: 1.943 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
5.321SerAla: 5.321 ± 0.15
0.155SerCys: 0.155 ± 0.016
3.03SerAsp: 3.03 ± 0.089
2.205SerGlu: 2.205 ± 0.076
2.162SerPhe: 2.162 ± 0.067
4.617SerGly: 4.617 ± 0.099
1.248SerHis: 1.248 ± 0.045
2.982SerIle: 2.982 ± 0.086
2.469SerLys: 2.469 ± 0.09
5.028SerLeu: 5.028 ± 0.138
1.42SerMet: 1.42 ± 0.051
2.11SerAsn: 2.11 ± 0.068
1.721SerPro: 1.721 ± 0.057
2.261SerGln: 2.261 ± 0.079
2.566SerArg: 2.566 ± 0.069
3.806SerSer: 3.806 ± 0.203
3.704SerThr: 3.704 ± 0.101
4.155SerVal: 4.155 ± 0.086
0.707SerTrp: 0.707 ± 0.041
1.67SerTyr: 1.67 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
7.167ThrAla: 7.167 ± 0.166
0.217ThrCys: 0.217 ± 0.019
4.153ThrAsp: 4.153 ± 0.103
2.643ThrGlu: 2.643 ± 0.068
2.48ThrPhe: 2.48 ± 0.07
5.417ThrGly: 5.417 ± 0.132
1.51ThrHis: 1.51 ± 0.052
4.452ThrIle: 4.452 ± 0.107
3.159ThrLys: 3.159 ± 0.088
6.399ThrLeu: 6.399 ± 0.106
1.646ThrMet: 1.646 ± 0.052
2.717ThrAsn: 2.717 ± 0.08
3.247ThrPro: 3.247 ± 0.1
2.576ThrGln: 2.576 ± 0.085
2.703ThrArg: 2.703 ± 0.064
3.929ThrSer: 3.929 ± 0.122
5.467ThrThr: 5.467 ± 0.203
6.306ThrVal: 6.306 ± 0.172
0.738ThrTrp: 0.738 ± 0.041
2.196ThrTyr: 2.196 ± 0.075
0.002ThrXaa: 0.002 ± 0.002
Val
9.276ValAla: 9.276 ± 0.169
0.382ValCys: 0.382 ± 0.026
5.121ValAsp: 5.121 ± 0.117
3.286ValGlu: 3.286 ± 0.076
2.647ValPhe: 2.647 ± 0.07
6.217ValGly: 6.217 ± 0.125
1.838ValHis: 1.838 ± 0.063
4.851ValIle: 4.851 ± 0.102
3.254ValLys: 3.254 ± 0.104
7.725ValLeu: 7.725 ± 0.144
2.165ValMet: 2.165 ± 0.06
3.3ValAsn: 3.3 ± 0.085
3.486ValPro: 3.486 ± 0.083
3.221ValGln: 3.221 ± 0.088
3.89ValArg: 3.89 ± 0.082
4.359ValSer: 4.359 ± 0.103
6.299ValThr: 6.299 ± 0.152
7.112ValVal: 7.112 ± 0.145
0.791ValTrp: 0.791 ± 0.041
2.253ValTyr: 2.253 ± 0.065
0.002ValXaa: 0.002 ± 0.002
Trp
0.862TrpAla: 0.862 ± 0.041
0.057TrpCys: 0.057 ± 0.011
0.495TrpAsp: 0.495 ± 0.03
0.347TrpGlu: 0.347 ± 0.024
0.526TrpPhe: 0.526 ± 0.031
0.777TrpGly: 0.777 ± 0.04
0.404TrpHis: 0.404 ± 0.029
0.564TrpIle: 0.564 ± 0.035
0.239TrpLys: 0.239 ± 0.021
1.45TrpLeu: 1.45 ± 0.059
0.327TrpMet: 0.327 ± 0.025
0.418TrpAsn: 0.418 ± 0.032
0.414TrpPro: 0.414 ± 0.032
0.683TrpGln: 0.683 ± 0.038
0.717TrpArg: 0.717 ± 0.039
0.664TrpSer: 0.664 ± 0.036
0.604TrpThr: 0.604 ± 0.035
0.832TrpVal: 0.832 ± 0.04
0.253TrpTrp: 0.253 ± 0.024
0.408TrpTyr: 0.408 ± 0.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.996TyrAla: 2.996 ± 0.076
0.175TyrCys: 0.175 ± 0.019
2.0TyrAsp: 2.0 ± 0.066
1.259TyrGlu: 1.259 ± 0.051
1.567TyrPhe: 1.567 ± 0.05
2.586TyrGly: 2.586 ± 0.081
0.91TyrHis: 0.91 ± 0.043
1.604TyrIle: 1.604 ± 0.053
1.104TyrLys: 1.104 ± 0.046
3.371TyrLeu: 3.371 ± 0.078
0.781TyrMet: 0.781 ± 0.039
1.288TyrAsn: 1.288 ± 0.055
1.384TyrPro: 1.384 ± 0.044
1.788TyrGln: 1.788 ± 0.065
1.849TyrArg: 1.849 ± 0.065
1.642TyrSer: 1.642 ± 0.054
2.205TyrThr: 2.205 ± 0.085
2.428TyrVal: 2.428 ± 0.072
0.435TyrTrp: 0.435 ± 0.029
1.271TyrTyr: 1.271 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.002XaaPhe: 0.002 ± 0.002
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.002XaaGln: 0.002 ± 0.002
0.002XaaArg: 0.002 ± 0.002
0.0XaaSer: 0.0 ± 0.0
0.003XaaThr: 0.003 ± 0.003
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1860 proteins (581517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski