Amino acid dipepetide frequency for Lactobacillus equigenerosi DSM 18793 = JCM 14505

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.149AlaAla: 8.149 ± 0.168
0.35AlaCys: 0.35 ± 0.031
5.377AlaAsp: 5.377 ± 0.126
4.683AlaGlu: 4.683 ± 0.114
3.008AlaPhe: 3.008 ± 0.087
6.544AlaGly: 6.544 ± 0.146
1.595AlaHis: 1.595 ± 0.058
5.748AlaIle: 5.748 ± 0.116
5.74AlaLys: 5.74 ± 0.125
8.106AlaLeu: 8.106 ± 0.142
2.524AlaMet: 2.524 ± 0.08
4.188AlaAsn: 4.188 ± 0.115
2.807AlaPro: 2.807 ± 0.089
4.644AlaGln: 4.644 ± 0.135
3.051AlaArg: 3.051 ± 0.092
4.337AlaSer: 4.337 ± 0.116
6.65AlaThr: 6.65 ± 0.125
6.397AlaVal: 6.397 ± 0.126
0.936AlaTrp: 0.936 ± 0.053
2.619AlaTyr: 2.619 ± 0.072
0.0AlaXaa: 0.0 ± 0.0
Cys
0.357CysAla: 0.357 ± 0.028
0.041CysCys: 0.041 ± 0.009
0.225CysAsp: 0.225 ± 0.022
0.212CysGlu: 0.212 ± 0.021
0.236CysPhe: 0.236 ± 0.021
0.445CysGly: 0.445 ± 0.035
0.158CysHis: 0.158 ± 0.02
0.264CysIle: 0.264 ± 0.025
0.138CysLys: 0.138 ± 0.018
0.542CysLeu: 0.542 ± 0.034
0.091CysMet: 0.091 ± 0.015
0.158CysAsn: 0.158 ± 0.019
0.21CysPro: 0.21 ± 0.026
0.22CysGln: 0.22 ± 0.021
0.182CysArg: 0.182 ± 0.021
0.192CysSer: 0.192 ± 0.02
0.214CysThr: 0.214 ± 0.022
0.309CysVal: 0.309 ± 0.027
0.076CysTrp: 0.076 ± 0.013
0.153CysTyr: 0.153 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.27AspAla: 4.27 ± 0.105
0.214AspCys: 0.214 ± 0.022
3.847AspAsp: 3.847 ± 0.106
4.428AspGlu: 4.428 ± 0.102
2.578AspPhe: 2.578 ± 0.079
3.672AspGly: 3.672 ± 0.104
1.683AspHis: 1.683 ± 0.066
2.876AspIle: 2.876 ± 0.08
2.842AspLys: 2.842 ± 0.102
6.109AspLeu: 6.109 ± 0.128
1.366AspMet: 1.366 ± 0.055
2.58AspAsn: 2.58 ± 0.078
2.295AspPro: 2.295 ± 0.077
3.989AspGln: 3.989 ± 0.106
2.539AspArg: 2.539 ± 0.074
2.403AspSer: 2.403 ± 0.068
2.58AspThr: 2.58 ± 0.085
3.963AspVal: 3.963 ± 0.1
0.795AspTrp: 0.795 ± 0.036
2.498AspTyr: 2.498 ± 0.077
0.0AspXaa: 0.0 ± 0.0
Glu
4.754GluAla: 4.754 ± 0.133
0.212GluCys: 0.212 ± 0.02
2.472GluAsp: 2.472 ± 0.092
2.755GluGlu: 2.755 ± 0.1
2.159GluPhe: 2.159 ± 0.082
2.31GluGly: 2.31 ± 0.08
1.374GluHis: 1.374 ± 0.06
3.929GluIle: 3.929 ± 0.106
2.738GluLys: 2.738 ± 0.088
6.522GluLeu: 6.522 ± 0.144
1.794GluMet: 1.794 ± 0.064
2.444GluAsn: 2.444 ± 0.083
1.731GluPro: 1.731 ± 0.064
3.419GluGln: 3.419 ± 0.106
2.688GluArg: 2.688 ± 0.09
2.511GluSer: 2.511 ± 0.086
3.168GluThr: 3.168 ± 0.08
4.18GluVal: 4.18 ± 0.115
0.625GluTrp: 0.625 ± 0.04
1.984GluTyr: 1.984 ± 0.072
0.0GluXaa: 0.0 ± 0.0
Phe
3.229PheAla: 3.229 ± 0.097
0.242PheCys: 0.242 ± 0.021
2.898PheAsp: 2.898 ± 0.073
2.062PheGlu: 2.062 ± 0.076
1.575PhePhe: 1.575 ± 0.077
3.244PheGly: 3.244 ± 0.095
0.767PheHis: 0.767 ± 0.042
2.574PheIle: 2.574 ± 0.098
2.401PheLys: 2.401 ± 0.083
3.231PheLeu: 3.231 ± 0.108
1.132PheMet: 1.132 ± 0.048
2.144PheAsn: 2.144 ± 0.066
1.34PhePro: 1.34 ± 0.053
1.4PheGln: 1.4 ± 0.062
1.256PheArg: 1.256 ± 0.059
2.165PheSer: 2.165 ± 0.07
2.377PheThr: 2.377 ± 0.074
2.861PheVal: 2.861 ± 0.085
0.478PheTrp: 0.478 ± 0.035
1.411PheTyr: 1.411 ± 0.058
0.0PheXaa: 0.0 ± 0.0
Gly
5.219GlyAla: 5.219 ± 0.119
0.322GlyCys: 0.322 ± 0.026
3.603GlyAsp: 3.603 ± 0.102
3.561GlyGlu: 3.561 ± 0.093
2.783GlyPhe: 2.783 ± 0.088
4.525GlyGly: 4.525 ± 0.132
1.623GlyHis: 1.623 ± 0.052
5.031GlyIle: 5.031 ± 0.129
4.121GlyLys: 4.121 ± 0.101
6.602GlyLeu: 6.602 ± 0.125
2.345GlyMet: 2.345 ± 0.07
3.002GlyAsn: 3.002 ± 0.103
1.627GlyPro: 1.627 ± 0.063
3.488GlyGln: 3.488 ± 0.117
2.803GlyArg: 2.803 ± 0.082
3.652GlySer: 3.652 ± 0.093
4.212GlyThr: 4.212 ± 0.089
5.182GlyVal: 5.182 ± 0.115
0.957GlyTrp: 0.957 ± 0.054
2.773GlyTyr: 2.773 ± 0.074
0.002GlyXaa: 0.002 ± 0.002
His
1.707HisAla: 1.707 ± 0.062
0.093HisCys: 0.093 ± 0.014
1.407HisAsp: 1.407 ± 0.055
1.333HisGlu: 1.333 ± 0.054
0.979HisPhe: 0.979 ± 0.051
1.837HisGly: 1.837 ± 0.077
0.927HisHis: 0.927 ± 0.048
1.001HisIle: 1.001 ± 0.045
0.821HisLys: 0.821 ± 0.044
2.28HisLeu: 2.28 ± 0.07
0.424HisMet: 0.424 ± 0.029
0.847HisAsn: 0.847 ± 0.044
1.307HisPro: 1.307 ± 0.056
1.839HisGln: 1.839 ± 0.062
1.128HisArg: 1.128 ± 0.049
0.94HisSer: 0.94 ± 0.052
1.117HisThr: 1.117 ± 0.054
1.511HisVal: 1.511 ± 0.056
0.307HisTrp: 0.307 ± 0.025
0.951HisTyr: 0.951 ± 0.048
0.0HisXaa: 0.0 ± 0.0
Ile
5.632IleAla: 5.632 ± 0.14
0.411IleCys: 0.411 ± 0.031
4.175IleAsp: 4.175 ± 0.104
3.711IleGlu: 3.711 ± 0.091
2.539IlePhe: 2.539 ± 0.095
4.674IleGly: 4.674 ± 0.114
1.169IleHis: 1.169 ± 0.054
4.493IleIle: 4.493 ± 0.133
4.026IleLys: 4.026 ± 0.1
5.643IleLeu: 5.643 ± 0.125
1.761IleMet: 1.761 ± 0.069
3.687IleAsn: 3.687 ± 0.111
2.619IlePro: 2.619 ± 0.082
2.71IleGln: 2.71 ± 0.091
2.295IleArg: 2.295 ± 0.073
3.808IleSer: 3.808 ± 0.097
4.279IleThr: 4.279 ± 0.085
4.782IleVal: 4.782 ± 0.107
0.663IleTrp: 0.663 ± 0.047
2.114IleTyr: 2.114 ± 0.072
0.0IleXaa: 0.0 ± 0.0
Lys
4.253LysAla: 4.253 ± 0.109
0.164LysCys: 0.164 ± 0.019
2.768LysAsp: 2.768 ± 0.094
3.16LysGlu: 3.16 ± 0.1
2.003LysPhe: 2.003 ± 0.079
2.693LysGly: 2.693 ± 0.088
1.271LysHis: 1.271 ± 0.052
3.373LysIle: 3.373 ± 0.094
2.961LysLys: 2.961 ± 0.098
5.705LysLeu: 5.705 ± 0.115
1.971LysMet: 1.971 ± 0.064
2.451LysAsn: 2.451 ± 0.095
2.101LysPro: 2.101 ± 0.068
3.546LysGln: 3.546 ± 0.098
2.861LysArg: 2.861 ± 0.079
2.818LysSer: 2.818 ± 0.088
3.451LysThr: 3.451 ± 0.093
3.92LysVal: 3.92 ± 0.111
0.581LysTrp: 0.581 ± 0.032
2.12LysTyr: 2.12 ± 0.071
0.002LysXaa: 0.002 ± 0.003
Leu
10.356LeuAla: 10.356 ± 0.176
0.439LeuCys: 0.439 ± 0.032
5.379LeuAsp: 5.379 ± 0.13
4.314LeuGlu: 4.314 ± 0.119
3.469LeuPhe: 3.469 ± 0.105
7.034LeuGly: 7.034 ± 0.14
1.824LeuHis: 1.824 ± 0.066
7.112LeuIle: 7.112 ± 0.169
5.383LeuLys: 5.383 ± 0.108
8.847LeuLeu: 8.847 ± 0.206
2.935LeuMet: 2.935 ± 0.074
4.674LeuAsn: 4.674 ± 0.113
4.385LeuPro: 4.385 ± 0.111
4.508LeuGln: 4.508 ± 0.119
3.76LeuArg: 3.76 ± 0.105
5.917LeuSer: 5.917 ± 0.126
7.683LeuThr: 7.683 ± 0.147
7.555LeuVal: 7.555 ± 0.17
0.953LeuTrp: 0.953 ± 0.047
2.833LeuTyr: 2.833 ± 0.101
0.0LeuXaa: 0.0 ± 0.0
Met
2.662MetAla: 2.662 ± 0.08
0.121MetCys: 0.121 ± 0.017
1.495MetAsp: 1.495 ± 0.053
1.437MetGlu: 1.437 ± 0.053
0.951MetPhe: 0.951 ± 0.05
1.787MetGly: 1.787 ± 0.062
0.486MetHis: 0.486 ± 0.03
2.085MetIle: 2.085 ± 0.063
1.709MetLys: 1.709 ± 0.071
2.678MetLeu: 2.678 ± 0.076
1.048MetMet: 1.048 ± 0.054
1.312MetAsn: 1.312 ± 0.052
1.102MetPro: 1.102 ± 0.048
1.394MetGln: 1.394 ± 0.056
1.119MetArg: 1.119 ± 0.048
1.612MetSer: 1.612 ± 0.062
2.057MetThr: 2.057 ± 0.067
2.176MetVal: 2.176 ± 0.073
0.231MetTrp: 0.231 ± 0.023
0.687MetTyr: 0.687 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
3.272AsnAla: 3.272 ± 0.082
0.238AsnCys: 0.238 ± 0.029
3.028AsnAsp: 3.028 ± 0.085
2.751AsnGlu: 2.751 ± 0.082
1.766AsnPhe: 1.766 ± 0.062
3.482AsnGly: 3.482 ± 0.089
1.349AsnHis: 1.349 ± 0.052
2.34AsnIle: 2.34 ± 0.084
2.159AsnLys: 2.159 ± 0.087
4.478AsnLeu: 4.478 ± 0.117
1.167AsnMet: 1.167 ± 0.055
2.245AsnAsn: 2.245 ± 0.105
2.477AsnPro: 2.477 ± 0.08
3.335AsnGln: 3.335 ± 0.095
2.159AsnArg: 2.159 ± 0.072
2.226AsnSer: 2.226 ± 0.087
2.159AsnThr: 2.159 ± 0.075
2.853AsnVal: 2.853 ± 0.069
0.73AsnTrp: 0.73 ± 0.044
1.919AsnTyr: 1.919 ± 0.07
0.0AsnXaa: 0.0 ± 0.0
Pro
4.004ProAla: 4.004 ± 0.096
0.11ProCys: 0.11 ± 0.015
2.232ProAsp: 2.232 ± 0.073
2.535ProGlu: 2.535 ± 0.081
1.534ProPhe: 1.534 ± 0.06
2.606ProGly: 2.606 ± 0.081
0.832ProHis: 0.832 ± 0.036
2.42ProIle: 2.42 ± 0.072
1.962ProLys: 1.962 ± 0.076
3.415ProLeu: 3.415 ± 0.099
0.912ProMet: 0.912 ± 0.045
1.854ProAsn: 1.854 ± 0.062
0.666ProPro: 0.666 ± 0.038
1.867ProGln: 1.867 ± 0.07
1.256ProArg: 1.256 ± 0.056
2.127ProSer: 2.127 ± 0.07
3.129ProThr: 3.129 ± 0.091
3.564ProVal: 3.564 ± 0.1
0.441ProTrp: 0.441 ± 0.03
1.249ProTyr: 1.249 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
6.326GlnAla: 6.326 ± 0.194
0.143GlnCys: 0.143 ± 0.018
2.042GlnAsp: 2.042 ± 0.068
2.217GlnGlu: 2.217 ± 0.068
1.973GlnPhe: 1.973 ± 0.068
2.911GlnGly: 2.911 ± 0.087
1.279GlnHis: 1.279 ± 0.054
3.581GlnIle: 3.581 ± 0.09
2.163GlnLys: 2.163 ± 0.071
6.628GlnLeu: 6.628 ± 0.178
1.398GlnMet: 1.398 ± 0.054
1.805GlnAsn: 1.805 ± 0.068
2.392GlnPro: 2.392 ± 0.079
3.791GlnGln: 3.791 ± 0.158
2.803GlnArg: 2.803 ± 0.089
2.678GlnSer: 2.678 ± 0.082
3.899GlnThr: 3.899 ± 0.12
4.657GlnVal: 4.657 ± 0.128
0.711GlnTrp: 0.711 ± 0.045
1.861GlnTyr: 1.861 ± 0.062
0.0GlnXaa: 0.0 ± 0.0
Arg
3.013ArgAla: 3.013 ± 0.078
0.197ArgCys: 0.197 ± 0.021
2.34ArgAsp: 2.34 ± 0.074
2.652ArgGlu: 2.652 ± 0.095
1.75ArgPhe: 1.75 ± 0.065
2.58ArgGly: 2.58 ± 0.082
1.176ArgHis: 1.176 ± 0.055
2.604ArgIle: 2.604 ± 0.089
2.124ArgLys: 2.124 ± 0.076
4.413ArgLeu: 4.413 ± 0.11
1.085ArgMet: 1.085 ± 0.043
1.72ArgAsn: 1.72 ± 0.064
1.642ArgPro: 1.642 ± 0.065
2.872ArgGln: 2.872 ± 0.088
2.315ArgArg: 2.315 ± 0.078
1.9ArgSer: 1.9 ± 0.06
2.127ArgThr: 2.127 ± 0.06
2.842ArgVal: 2.842 ± 0.082
0.562ArgTrp: 0.562 ± 0.04
1.718ArgTyr: 1.718 ± 0.066
0.0ArgXaa: 0.0 ± 0.0
Ser
4.033SerAla: 4.033 ± 0.12
0.205SerCys: 0.205 ± 0.019
2.894SerAsp: 2.894 ± 0.084
2.742SerGlu: 2.742 ± 0.091
2.425SerPhe: 2.425 ± 0.077
3.86SerGly: 3.86 ± 0.104
1.266SerHis: 1.266 ± 0.052
3.289SerIle: 3.289 ± 0.102
2.589SerLys: 2.589 ± 0.09
5.671SerLeu: 5.671 ± 0.118
1.396SerMet: 1.396 ± 0.059
2.327SerAsn: 2.327 ± 0.073
1.822SerPro: 1.822 ± 0.071
2.82SerGln: 2.82 ± 0.097
2.176SerArg: 2.176 ± 0.063
3.09SerSer: 3.09 ± 0.113
2.976SerThr: 2.976 ± 0.096
3.611SerVal: 3.611 ± 0.086
0.759SerTrp: 0.759 ± 0.04
2.072SerTyr: 2.072 ± 0.063
0.004SerXaa: 0.004 ± 0.003
Thr
5.548ThrAla: 5.548 ± 0.134
0.257ThrCys: 0.257 ± 0.024
3.773ThrAsp: 3.773 ± 0.095
3.038ThrGlu: 3.038 ± 0.078
2.366ThrPhe: 2.366 ± 0.078
4.904ThrGly: 4.904 ± 0.097
1.279ThrHis: 1.279 ± 0.057
4.577ThrIle: 4.577 ± 0.093
3.84ThrLys: 3.84 ± 0.1
6.047ThrLeu: 6.047 ± 0.128
1.614ThrMet: 1.614 ± 0.056
3.183ThrAsn: 3.183 ± 0.083
3.421ThrPro: 3.421 ± 0.096
2.814ThrGln: 2.814 ± 0.1
2.206ThrArg: 2.206 ± 0.068
3.395ThrSer: 3.395 ± 0.095
4.958ThrThr: 4.958 ± 0.141
4.841ThrVal: 4.841 ± 0.103
0.826ThrTrp: 0.826 ± 0.048
2.148ThrTyr: 2.148 ± 0.077
0.002ThrXaa: 0.002 ± 0.002
Val
7.462ValAla: 7.462 ± 0.138
0.365ValCys: 0.365 ± 0.029
4.752ValAsp: 4.752 ± 0.127
3.84ValGlu: 3.84 ± 0.096
2.423ValPhe: 2.423 ± 0.08
5.016ValGly: 5.016 ± 0.106
1.312ValHis: 1.312 ± 0.054
5.416ValIle: 5.416 ± 0.121
4.707ValLys: 4.707 ± 0.099
6.503ValLeu: 6.503 ± 0.14
2.129ValMet: 2.129 ± 0.073
3.581ValAsn: 3.581 ± 0.094
2.898ValPro: 2.898 ± 0.067
2.937ValGln: 2.937 ± 0.089
2.649ValArg: 2.649 ± 0.072
4.011ValSer: 4.011 ± 0.099
5.446ValThr: 5.446 ± 0.126
6.131ValVal: 6.131 ± 0.12
0.711ValTrp: 0.711 ± 0.041
2.202ValTyr: 2.202 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.754TrpAla: 0.754 ± 0.048
0.073TrpCys: 0.073 ± 0.012
0.607TrpAsp: 0.607 ± 0.04
0.508TrpGlu: 0.508 ± 0.037
0.573TrpPhe: 0.573 ± 0.037
0.864TrpGly: 0.864 ± 0.049
0.328TrpHis: 0.328 ± 0.03
0.694TrpIle: 0.694 ± 0.041
0.456TrpLys: 0.456 ± 0.033
1.757TrpLeu: 1.757 ± 0.078
0.348TrpMet: 0.348 ± 0.029
0.467TrpAsn: 0.467 ± 0.031
0.413TrpPro: 0.413 ± 0.03
0.845TrpGln: 0.845 ± 0.047
0.577TrpArg: 0.577 ± 0.045
0.627TrpSer: 0.627 ± 0.034
0.609TrpThr: 0.609 ± 0.038
0.813TrpVal: 0.813 ± 0.041
0.285TrpTrp: 0.285 ± 0.029
0.508TrpTyr: 0.508 ± 0.047
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.453TyrAla: 2.453 ± 0.075
0.223TyrCys: 0.223 ± 0.022
2.215TyrAsp: 2.215 ± 0.071
1.828TyrGlu: 1.828 ± 0.065
1.696TyrPhe: 1.696 ± 0.062
2.446TyrGly: 2.446 ± 0.075
1.057TyrHis: 1.057 ± 0.053
1.785TyrIle: 1.785 ± 0.072
1.292TyrLys: 1.292 ± 0.059
4.182TyrLeu: 4.182 ± 0.114
0.711TyrMet: 0.711 ± 0.035
1.461TyrAsn: 1.461 ± 0.068
1.461TyrPro: 1.461 ± 0.058
2.818TyrGln: 2.818 ± 0.09
1.807TyrArg: 1.807 ± 0.062
1.616TyrSer: 1.616 ± 0.058
1.947TyrThr: 1.947 ± 0.067
2.343TyrVal: 2.343 ± 0.068
0.48TyrTrp: 0.48 ± 0.035
1.498TyrTyr: 1.498 ± 0.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.002
0.002XaaCys: 0.002 ± 0.002
0.0XaaAsp: 0.0 ± 0.0
0.002XaaGlu: 0.002 ± 0.003
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.24XaaXaa: 0.24 ± 0.148
Statistics based on 1490 proteins (462731 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski