Amino acid dipepetide frequency for Lactobacillus reuteri (strain DSM 20016)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.528AlaAla: 6.528 ± 0.138
0.413AlaCys: 0.413 ± 0.025
4.787AlaAsp: 4.787 ± 0.083
4.238AlaGlu: 4.238 ± 0.088
3.114AlaPhe: 3.114 ± 0.093
5.552AlaGly: 5.552 ± 0.116
1.573AlaHis: 1.573 ± 0.057
5.926AlaIle: 5.926 ± 0.124
5.656AlaLys: 5.656 ± 0.107
7.168AlaLeu: 7.168 ± 0.133
2.134AlaMet: 2.134 ± 0.063
3.915AlaAsn: 3.915 ± 0.102
2.395AlaPro: 2.395 ± 0.066
3.407AlaGln: 3.407 ± 0.089
2.996AlaArg: 2.996 ± 0.078
4.045AlaSer: 4.045 ± 0.109
4.676AlaThr: 4.676 ± 0.098
5.304AlaVal: 5.304 ± 0.105
0.692AlaTrp: 0.692 ± 0.039
2.544AlaTyr: 2.544 ± 0.075
0.0AlaXaa: 0.0 ± 0.0
Cys
0.395CysAla: 0.395 ± 0.028
0.061CysCys: 0.061 ± 0.012
0.306CysAsp: 0.306 ± 0.024
0.272CysGlu: 0.272 ± 0.023
0.259CysPhe: 0.259 ± 0.023
0.601CysGly: 0.601 ± 0.032
0.223CysHis: 0.223 ± 0.023
0.399CysIle: 0.399 ± 0.026
0.282CysLys: 0.282 ± 0.025
0.615CysLeu: 0.615 ± 0.038
0.12CysMet: 0.12 ± 0.015
0.234CysAsn: 0.234 ± 0.021
0.265CysPro: 0.265 ± 0.026
0.232CysGln: 0.232 ± 0.021
0.241CysArg: 0.241 ± 0.02
0.365CysSer: 0.365 ± 0.025
0.297CysThr: 0.297 ± 0.023
0.363CysVal: 0.363 ± 0.025
0.075CysTrp: 0.075 ± 0.012
0.254CysTyr: 0.254 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.051AspAla: 4.051 ± 0.083
0.297AspCys: 0.297 ± 0.023
3.665AspAsp: 3.665 ± 0.1
4.364AspGlu: 4.364 ± 0.097
2.597AspPhe: 2.597 ± 0.081
3.881AspGly: 3.881 ± 0.094
1.557AspHis: 1.557 ± 0.056
3.777AspIle: 3.777 ± 0.08
3.8AspLys: 3.8 ± 0.09
5.577AspLeu: 5.577 ± 0.11
1.498AspMet: 1.498 ± 0.052
3.109AspAsn: 3.109 ± 0.101
2.438AspPro: 2.438 ± 0.06
2.946AspGln: 2.946 ± 0.077
2.521AspArg: 2.521 ± 0.066
2.925AspSer: 2.925 ± 0.083
2.708AspThr: 2.708 ± 0.071
3.777AspVal: 3.777 ± 0.088
0.776AspTrp: 0.776 ± 0.039
2.619AspTyr: 2.619 ± 0.09
0.0AspXaa: 0.0 ± 0.0
Glu
4.036GluAla: 4.036 ± 0.105
0.25GluCys: 0.25 ± 0.023
3.132GluAsp: 3.132 ± 0.08
4.028GluGlu: 4.028 ± 0.105
2.111GluPhe: 2.111 ± 0.062
2.751GluGly: 2.751 ± 0.071
1.359GluHis: 1.359 ± 0.047
4.203GluIle: 4.203 ± 0.104
4.589GluLys: 4.589 ± 0.101
6.026GluLeu: 6.026 ± 0.111
1.829GluMet: 1.829 ± 0.072
3.248GluAsn: 3.248 ± 0.081
1.695GluPro: 1.695 ± 0.059
3.048GluGln: 3.048 ± 0.09
2.83GluArg: 2.83 ± 0.081
2.547GluSer: 2.547 ± 0.073
2.871GluThr: 2.871 ± 0.077
3.834GluVal: 3.834 ± 0.095
0.577GluTrp: 0.577 ± 0.034
2.049GluTyr: 2.049 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.311PheAla: 3.311 ± 0.09
0.309PheCys: 0.309 ± 0.024
2.749PheAsp: 2.749 ± 0.067
2.056PheGlu: 2.056 ± 0.07
1.832PhePhe: 1.832 ± 0.079
3.034PheGly: 3.034 ± 0.083
0.867PheHis: 0.867 ± 0.04
3.212PheIle: 3.212 ± 0.085
2.438PheLys: 2.438 ± 0.074
3.627PheLeu: 3.627 ± 0.093
1.176PheMet: 1.176 ± 0.053
2.363PheAsn: 2.363 ± 0.064
1.496PhePro: 1.496 ± 0.053
1.225PheGln: 1.225 ± 0.048
1.376PheArg: 1.376 ± 0.058
2.599PheSer: 2.599 ± 0.073
2.469PheThr: 2.469 ± 0.075
2.785PheVal: 2.785 ± 0.074
0.538PheTrp: 0.538 ± 0.036
1.593PheTyr: 1.593 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
4.759GlyAla: 4.759 ± 0.103
0.42GlyCys: 0.42 ± 0.029
3.536GlyAsp: 3.536 ± 0.084
3.42GlyGlu: 3.42 ± 0.086
2.908GlyPhe: 2.908 ± 0.072
4.165GlyGly: 4.165 ± 0.104
1.627GlyHis: 1.627 ± 0.057
5.654GlyIle: 5.654 ± 0.116
4.857GlyLys: 4.857 ± 0.1
6.164GlyLeu: 6.164 ± 0.119
2.018GlyMet: 2.018 ± 0.062
3.025GlyAsn: 3.025 ± 0.071
1.602GlyPro: 1.602 ± 0.069
2.81GlyGln: 2.81 ± 0.087
2.755GlyArg: 2.755 ± 0.078
3.441GlySer: 3.441 ± 0.09
4.099GlyThr: 4.099 ± 0.093
4.626GlyVal: 4.626 ± 0.103
0.838GlyTrp: 0.838 ± 0.038
2.739GlyTyr: 2.739 ± 0.067
0.0GlyXaa: 0.0 ± 0.0
His
1.359HisAla: 1.359 ± 0.052
0.147HisCys: 0.147 ± 0.015
1.464HisAsp: 1.464 ± 0.051
1.285HisGlu: 1.285 ± 0.048
1.128HisPhe: 1.128 ± 0.046
1.702HisGly: 1.702 ± 0.061
0.915HisHis: 0.915 ± 0.048
1.348HisIle: 1.348 ± 0.054
1.062HisLys: 1.062 ± 0.045
2.249HisLeu: 2.249 ± 0.069
0.535HisMet: 0.535 ± 0.029
1.121HisAsn: 1.121 ± 0.043
1.251HisPro: 1.251 ± 0.049
1.403HisGln: 1.403 ± 0.053
1.223HisArg: 1.223 ± 0.048
1.166HisSer: 1.166 ± 0.047
1.108HisThr: 1.108 ± 0.045
1.437HisVal: 1.437 ± 0.048
0.297HisTrp: 0.297 ± 0.023
1.101HisTyr: 1.101 ± 0.049
0.0HisXaa: 0.0 ± 0.0
Ile
6.48IleAla: 6.48 ± 0.12
0.602IleCys: 0.602 ± 0.036
4.63IleAsp: 4.63 ± 0.093
3.761IleGlu: 3.761 ± 0.083
2.982IlePhe: 2.982 ± 0.093
5.168IleGly: 5.168 ± 0.121
1.523IleHis: 1.523 ± 0.057
6.017IleIle: 6.017 ± 0.141
4.973IleLys: 4.973 ± 0.096
6.511IleLeu: 6.511 ± 0.115
1.689IleMet: 1.689 ± 0.055
4.174IleAsn: 4.174 ± 0.102
2.953IlePro: 2.953 ± 0.078
2.533IleGln: 2.533 ± 0.075
2.644IleArg: 2.644 ± 0.086
4.437IleSer: 4.437 ± 0.103
4.517IleThr: 4.517 ± 0.09
5.113IleVal: 5.113 ± 0.109
0.633IleTrp: 0.633 ± 0.035
2.37IleTyr: 2.37 ± 0.068
0.0IleXaa: 0.0 ± 0.0
Lys
4.687LysAla: 4.687 ± 0.106
0.236LysCys: 0.236 ± 0.022
3.888LysAsp: 3.888 ± 0.095
4.739LysGlu: 4.739 ± 0.101
2.226LysPhe: 2.226 ± 0.069
3.332LysGly: 3.332 ± 0.098
1.425LysHis: 1.425 ± 0.052
4.834LysIle: 4.834 ± 0.105
5.343LysLys: 5.343 ± 0.113
6.044LysLeu: 6.044 ± 0.102
2.533LysMet: 2.533 ± 0.06
3.677LysAsn: 3.677 ± 0.097
2.22LysPro: 2.22 ± 0.069
3.588LysGln: 3.588 ± 0.093
3.162LysArg: 3.162 ± 0.077
3.153LysSer: 3.153 ± 0.091
3.715LysThr: 3.715 ± 0.082
4.487LysVal: 4.487 ± 0.1
0.703LysTrp: 0.703 ± 0.037
2.787LysTyr: 2.787 ± 0.084
0.0LysXaa: 0.0 ± 0.0
Leu
8.203LeuAla: 8.203 ± 0.153
0.606LeuCys: 0.606 ± 0.037
5.458LeuAsp: 5.458 ± 0.114
4.657LeuGlu: 4.657 ± 0.083
3.893LeuPhe: 3.893 ± 0.101
6.412LeuGly: 6.412 ± 0.128
2.008LeuHis: 2.008 ± 0.061
6.922LeuIle: 6.922 ± 0.138
6.198LeuLys: 6.198 ± 0.12
8.665LeuLeu: 8.665 ± 0.16
2.529LeuMet: 2.529 ± 0.07
5.02LeuAsn: 5.02 ± 0.081
4.087LeuPro: 4.087 ± 0.09
3.979LeuGln: 3.979 ± 0.093
4.031LeuArg: 4.031 ± 0.094
5.928LeuSer: 5.928 ± 0.104
6.275LeuThr: 6.275 ± 0.118
6.516LeuVal: 6.516 ± 0.122
0.792LeuTrp: 0.792 ± 0.041
2.862LeuTyr: 2.862 ± 0.072
0.0LeuXaa: 0.0 ± 0.0
Met
2.413MetAla: 2.413 ± 0.069
0.172MetCys: 0.172 ± 0.015
1.518MetAsp: 1.518 ± 0.052
1.427MetGlu: 1.427 ± 0.049
1.006MetPhe: 1.006 ± 0.044
1.766MetGly: 1.766 ± 0.051
0.52MetHis: 0.52 ± 0.029
2.152MetIle: 2.152 ± 0.069
2.049MetLys: 2.049 ± 0.064
2.385MetLeu: 2.385 ± 0.064
0.917MetMet: 0.917 ± 0.046
1.525MetAsn: 1.525 ± 0.05
1.101MetPro: 1.101 ± 0.045
1.069MetGln: 1.069 ± 0.043
1.198MetArg: 1.198 ± 0.054
1.645MetSer: 1.645 ± 0.056
1.934MetThr: 1.934 ± 0.056
1.818MetVal: 1.818 ± 0.059
0.195MetTrp: 0.195 ± 0.018
0.703MetTyr: 0.703 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.568AsnAla: 3.568 ± 0.078
0.352AsnCys: 0.352 ± 0.029
3.45AsnAsp: 3.45 ± 0.093
3.216AsnGlu: 3.216 ± 0.087
1.974AsnPhe: 1.974 ± 0.064
3.872AsnGly: 3.872 ± 0.092
1.409AsnHis: 1.409 ± 0.049
3.414AsnIle: 3.414 ± 0.08
3.447AsnLys: 3.447 ± 0.08
4.378AsnLeu: 4.378 ± 0.097
1.31AsnMet: 1.31 ± 0.046
3.144AsnAsn: 3.144 ± 0.092
2.292AsnPro: 2.292 ± 0.058
2.581AsnGln: 2.581 ± 0.085
2.145AsnArg: 2.145 ± 0.067
2.841AsnSer: 2.841 ± 0.075
2.487AsnThr: 2.487 ± 0.078
3.284AsnVal: 3.284 ± 0.082
0.695AsnTrp: 0.695 ± 0.033
2.229AsnTyr: 2.229 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
3.034ProAla: 3.034 ± 0.08
0.15ProCys: 0.15 ± 0.017
2.342ProAsp: 2.342 ± 0.069
2.737ProGlu: 2.737 ± 0.071
1.611ProPhe: 1.611 ± 0.051
2.242ProGly: 2.242 ± 0.066
0.894ProHis: 0.894 ± 0.039
2.571ProIle: 2.571 ± 0.072
2.013ProLys: 2.013 ± 0.064
3.582ProLeu: 3.582 ± 0.088
0.822ProMet: 0.822 ± 0.038
1.895ProAsn: 1.895 ± 0.061
0.631ProPro: 0.631 ± 0.037
1.934ProGln: 1.934 ± 0.059
1.314ProArg: 1.314 ± 0.05
2.02ProSer: 2.02 ± 0.079
2.494ProThr: 2.494 ± 0.079
2.955ProVal: 2.955 ± 0.085
0.397ProTrp: 0.397 ± 0.031
1.343ProTyr: 1.343 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
3.652GlnAla: 3.652 ± 0.104
0.138GlnCys: 0.138 ± 0.017
2.059GlnAsp: 2.059 ± 0.062
2.678GlnGlu: 2.678 ± 0.091
1.757GlnPhe: 1.757 ± 0.056
2.335GlnGly: 2.335 ± 0.066
1.082GlnHis: 1.082 ± 0.043
3.332GlnIle: 3.332 ± 0.081
3.009GlnLys: 3.009 ± 0.083
5.273GlnLeu: 5.273 ± 0.119
1.309GlnMet: 1.309 ± 0.054
2.159GlnAsn: 2.159 ± 0.061
1.75GlnPro: 1.75 ± 0.064
3.123GlnGln: 3.123 ± 0.105
2.517GlnArg: 2.517 ± 0.081
2.39GlnSer: 2.39 ± 0.072
2.66GlnThr: 2.66 ± 0.08
3.155GlnVal: 3.155 ± 0.075
0.602GlnTrp: 0.602 ± 0.041
1.793GlnTyr: 1.793 ± 0.058
0.0GlnXaa: 0.0 ± 0.0
Arg
2.869ArgAla: 2.869 ± 0.075
0.229ArgCys: 0.229 ± 0.019
2.379ArgAsp: 2.379 ± 0.076
2.533ArgGlu: 2.533 ± 0.067
1.863ArgPhe: 1.863 ± 0.058
2.451ArgGly: 2.451 ± 0.074
1.239ArgHis: 1.239 ± 0.047
2.95ArgIle: 2.95 ± 0.064
2.973ArgLys: 2.973 ± 0.086
4.314ArgLeu: 4.314 ± 0.093
1.196ArgMet: 1.196 ± 0.048
2.018ArgAsn: 2.018 ± 0.066
1.534ArgPro: 1.534 ± 0.05
2.671ArgGln: 2.671 ± 0.069
2.488ArgArg: 2.488 ± 0.075
2.05ArgSer: 2.05 ± 0.062
2.024ArgThr: 2.024 ± 0.061
2.857ArgVal: 2.857 ± 0.088
0.449ArgTrp: 0.449 ± 0.027
1.802ArgTyr: 1.802 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
4.224SerAla: 4.224 ± 0.109
0.295SerCys: 0.295 ± 0.023
2.967SerAsp: 2.967 ± 0.088
2.792SerGlu: 2.792 ± 0.082
2.673SerPhe: 2.673 ± 0.075
3.906SerGly: 3.906 ± 0.098
1.173SerHis: 1.173 ± 0.042
3.824SerIle: 3.824 ± 0.088
3.396SerLys: 3.396 ± 0.093
5.733SerLeu: 5.733 ± 0.101
1.514SerMet: 1.514 ± 0.049
2.73SerAsn: 2.73 ± 0.072
1.981SerPro: 1.981 ± 0.053
2.673SerGln: 2.673 ± 0.082
2.302SerArg: 2.302 ± 0.069
3.695SerSer: 3.695 ± 0.136
3.241SerThr: 3.241 ± 0.095
3.527SerVal: 3.527 ± 0.083
0.711SerTrp: 0.711 ± 0.042
2.063SerTyr: 2.063 ± 0.067
0.0SerXaa: 0.0 ± 0.0
Thr
4.657ThrAla: 4.657 ± 0.109
0.32ThrCys: 0.32 ± 0.028
3.482ThrAsp: 3.482 ± 0.084
2.832ThrGlu: 2.832 ± 0.071
2.236ThrPhe: 2.236 ± 0.068
4.387ThrGly: 4.387 ± 0.09
1.255ThrHis: 1.255 ± 0.047
4.773ThrIle: 4.773 ± 0.101
3.699ThrLys: 3.699 ± 0.08
5.164ThrLeu: 5.164 ± 0.095
1.448ThrMet: 1.448 ± 0.053
2.933ThrAsn: 2.933 ± 0.071
2.794ThrPro: 2.794 ± 0.075
2.066ThrGln: 2.066 ± 0.059
2.143ThrArg: 2.143 ± 0.057
3.43ThrSer: 3.43 ± 0.086
3.67ThrThr: 3.67 ± 0.101
4.34ThrVal: 4.34 ± 0.092
0.563ThrTrp: 0.563 ± 0.038
2.031ThrTyr: 2.031 ± 0.079
0.0ThrXaa: 0.0 ± 0.0
Val
5.674ValAla: 5.674 ± 0.105
0.49ValCys: 0.49 ± 0.028
4.226ValAsp: 4.226 ± 0.097
3.643ValGlu: 3.643 ± 0.088
2.555ValPhe: 2.555 ± 0.073
4.61ValGly: 4.61 ± 0.105
1.26ValHis: 1.26 ± 0.053
5.461ValIle: 5.461 ± 0.111
4.61ValLys: 4.61 ± 0.098
6.087ValLeu: 6.087 ± 0.128
1.825ValMet: 1.825 ± 0.059
3.581ValAsn: 3.581 ± 0.079
2.703ValPro: 2.703 ± 0.074
2.533ValGln: 2.533 ± 0.074
2.429ValArg: 2.429 ± 0.073
4.045ValSer: 4.045 ± 0.091
4.55ValThr: 4.55 ± 0.1
4.891ValVal: 4.891 ± 0.109
0.617ValTrp: 0.617 ± 0.031
2.206ValTyr: 2.206 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.699TrpAla: 0.699 ± 0.036
0.066TrpCys: 0.066 ± 0.011
0.524TrpAsp: 0.524 ± 0.037
0.452TrpGlu: 0.452 ± 0.029
0.522TrpPhe: 0.522 ± 0.036
0.729TrpGly: 0.729 ± 0.046
0.3TrpHis: 0.3 ± 0.023
0.772TrpIle: 0.772 ± 0.039
0.554TrpLys: 0.554 ± 0.035
1.368TrpLeu: 1.368 ± 0.052
0.299TrpMet: 0.299 ± 0.022
0.495TrpAsn: 0.495 ± 0.029
0.316TrpPro: 0.316 ± 0.025
0.665TrpGln: 0.665 ± 0.037
0.556TrpArg: 0.556 ± 0.03
0.595TrpSer: 0.595 ± 0.029
0.563TrpThr: 0.563 ± 0.032
0.645TrpVal: 0.645 ± 0.036
0.195TrpTrp: 0.195 ± 0.023
0.467TrpTyr: 0.467 ± 0.036
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.467TyrAla: 2.467 ± 0.076
0.306TyrCys: 0.306 ± 0.023
2.322TyrAsp: 2.322 ± 0.074
1.884TyrGlu: 1.884 ± 0.049
1.809TyrPhe: 1.809 ± 0.067
2.567TyrGly: 2.567 ± 0.07
1.092TyrHis: 1.092 ± 0.047
2.17TyrIle: 2.17 ± 0.063
1.831TyrLys: 1.831 ± 0.063
4.11TyrLeu: 4.11 ± 0.103
0.808TyrMet: 0.808 ± 0.037
1.721TyrAsn: 1.721 ± 0.064
1.475TyrPro: 1.475 ± 0.049
2.286TyrGln: 2.286 ± 0.066
2.015TyrArg: 2.015 ± 0.066
2.102TyrSer: 2.102 ± 0.058
1.868TyrThr: 1.868 ± 0.065
2.281TyrVal: 2.281 ± 0.066
0.438TyrTrp: 0.438 ± 0.032
1.618TyrTyr: 1.618 ± 0.08
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1865 proteins (559402 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski