Amino acid dipepetide frequency for Lactobacillus senioris DSM 24302 = JCM 17472

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.213AlaAla: 7.213 ± 0.158
0.284AlaCys: 0.284 ± 0.024
5.314AlaAsp: 5.314 ± 0.113
4.887AlaGlu: 4.887 ± 0.126
2.917AlaPhe: 2.917 ± 0.083
5.883AlaGly: 5.883 ± 0.12
1.295AlaHis: 1.295 ± 0.048
6.067AlaIle: 6.067 ± 0.124
5.593AlaLys: 5.593 ± 0.124
6.964AlaLeu: 6.964 ± 0.155
2.077AlaMet: 2.077 ± 0.065
4.422AlaAsn: 4.422 ± 0.106
2.278AlaPro: 2.278 ± 0.07
3.59AlaGln: 3.59 ± 0.095
2.62AlaArg: 2.62 ± 0.082
4.463AlaSer: 4.463 ± 0.101
5.112AlaThr: 5.112 ± 0.111
5.823AlaVal: 5.823 ± 0.13
0.678AlaTrp: 0.678 ± 0.037
2.399AlaTyr: 2.399 ± 0.075
0.0AlaXaa: 0.0 ± 0.0
Cys
0.232CysAla: 0.232 ± 0.022
0.035CysCys: 0.035 ± 0.01
0.223CysAsp: 0.223 ± 0.023
0.156CysGlu: 0.156 ± 0.017
0.197CysPhe: 0.197 ± 0.019
0.409CysGly: 0.409 ± 0.031
0.117CysHis: 0.117 ± 0.017
0.197CysIle: 0.197 ± 0.021
0.117CysLys: 0.117 ± 0.016
0.392CysLeu: 0.392 ± 0.029
0.08CysMet: 0.08 ± 0.015
0.154CysAsn: 0.154 ± 0.018
0.208CysPro: 0.208 ± 0.029
0.152CysGln: 0.152 ± 0.021
0.139CysArg: 0.139 ± 0.017
0.21CysSer: 0.21 ± 0.021
0.156CysThr: 0.156 ± 0.02
0.258CysVal: 0.258 ± 0.024
0.041CysTrp: 0.041 ± 0.009
0.121CysTyr: 0.121 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.155AspAla: 4.155 ± 0.105
0.186AspCys: 0.186 ± 0.022
3.655AspAsp: 3.655 ± 0.102
4.03AspGlu: 4.03 ± 0.109
2.689AspPhe: 2.689 ± 0.086
3.759AspGly: 3.759 ± 0.095
1.412AspHis: 1.412 ± 0.063
3.822AspIle: 3.822 ± 0.102
3.408AspLys: 3.408 ± 0.098
5.732AspLeu: 5.732 ± 0.123
1.498AspMet: 1.498 ± 0.053
3.068AspAsn: 3.068 ± 0.093
2.501AspPro: 2.501 ± 0.071
3.415AspGln: 3.415 ± 0.095
2.358AspArg: 2.358 ± 0.074
3.38AspSer: 3.38 ± 0.09
3.083AspThr: 3.083 ± 0.087
4.03AspVal: 4.03 ± 0.102
0.762AspTrp: 0.762 ± 0.04
2.462AspTyr: 2.462 ± 0.084
0.0AspXaa: 0.0 ± 0.0
Glu
4.461GluAla: 4.461 ± 0.11
0.154GluCys: 0.154 ± 0.018
3.114GluAsp: 3.114 ± 0.115
3.462GluGlu: 3.462 ± 0.097
2.319GluPhe: 2.319 ± 0.071
2.694GluGly: 2.694 ± 0.091
1.215GluHis: 1.215 ± 0.047
4.229GluIle: 4.229 ± 0.11
3.454GluLys: 3.454 ± 0.102
6.409GluLeu: 6.409 ± 0.157
1.854GluMet: 1.854 ± 0.064
2.982GluAsn: 2.982 ± 0.096
1.745GluPro: 1.745 ± 0.073
3.153GluGln: 3.153 ± 0.108
2.484GluArg: 2.484 ± 0.09
2.72GluSer: 2.72 ± 0.086
3.289GluThr: 3.289 ± 0.085
4.24GluVal: 4.24 ± 0.109
0.569GluTrp: 0.569 ± 0.03
1.815GluTyr: 1.815 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
3.133PheAla: 3.133 ± 0.101
0.206PheCys: 0.206 ± 0.02
2.847PheAsp: 2.847 ± 0.091
2.345PheGlu: 2.345 ± 0.077
1.931PhePhe: 1.931 ± 0.068
3.077PheGly: 3.077 ± 0.093
0.749PheHis: 0.749 ± 0.04
2.973PheIle: 2.973 ± 0.088
2.284PheLys: 2.284 ± 0.063
3.683PheLeu: 3.683 ± 0.095
1.128PheMet: 1.128 ± 0.057
2.354PheAsn: 2.354 ± 0.062
1.438PhePro: 1.438 ± 0.059
1.429PheGln: 1.429 ± 0.054
1.273PheArg: 1.273 ± 0.052
2.713PheSer: 2.713 ± 0.082
2.49PheThr: 2.49 ± 0.076
3.044PheVal: 3.044 ± 0.086
0.559PheTrp: 0.559 ± 0.041
1.641PheTyr: 1.641 ± 0.068
0.0PheXaa: 0.0 ± 0.0
Gly
4.865GlyAla: 4.865 ± 0.108
0.286GlyCys: 0.286 ± 0.023
3.503GlyAsp: 3.503 ± 0.108
3.402GlyGlu: 3.402 ± 0.084
2.878GlyPhe: 2.878 ± 0.09
4.532GlyGly: 4.532 ± 0.156
1.416GlyHis: 1.416 ± 0.051
5.76GlyIle: 5.76 ± 0.134
4.612GlyLys: 4.612 ± 0.118
6.344GlyLeu: 6.344 ± 0.135
2.035GlyMet: 2.035 ± 0.066
3.083GlyAsn: 3.083 ± 0.089
1.641GlyPro: 1.641 ± 0.063
3.04GlyGln: 3.04 ± 0.091
2.65GlyArg: 2.65 ± 0.087
4.062GlySer: 4.062 ± 0.099
4.175GlyThr: 4.175 ± 0.11
4.924GlyVal: 4.924 ± 0.101
0.751GlyTrp: 0.751 ± 0.043
2.75GlyTyr: 2.75 ± 0.064
0.002GlyXaa: 0.002 ± 0.002
His
1.119HisAla: 1.119 ± 0.05
0.091HisCys: 0.091 ± 0.014
1.135HisAsp: 1.135 ± 0.055
1.111HisGlu: 1.111 ± 0.052
0.94HisPhe: 0.94 ± 0.044
1.444HisGly: 1.444 ± 0.053
0.565HisHis: 0.565 ± 0.033
1.158HisIle: 1.158 ± 0.051
0.955HisLys: 0.955 ± 0.047
1.983HisLeu: 1.983 ± 0.086
0.446HisMet: 0.446 ± 0.031
0.899HisAsn: 0.899 ± 0.044
1.029HisPro: 1.029 ± 0.046
1.117HisGln: 1.117 ± 0.053
0.907HisArg: 0.907 ± 0.048
1.059HisSer: 1.059 ± 0.046
0.959HisThr: 0.959 ± 0.051
1.293HisVal: 1.293 ± 0.052
0.236HisTrp: 0.236 ± 0.021
0.927HisTyr: 0.927 ± 0.047
0.0HisXaa: 0.0 ± 0.0
Ile
6.208IleAla: 6.208 ± 0.137
0.429IleCys: 0.429 ± 0.03
4.554IleAsp: 4.554 ± 0.099
3.709IleGlu: 3.709 ± 0.094
2.906IlePhe: 2.906 ± 0.1
5.143IleGly: 5.143 ± 0.122
1.148IleHis: 1.148 ± 0.048
5.513IleIle: 5.513 ± 0.152
4.526IleLys: 4.526 ± 0.103
6.689IleLeu: 6.689 ± 0.153
1.832IleMet: 1.832 ± 0.057
4.064IleAsn: 4.064 ± 0.091
2.826IlePro: 2.826 ± 0.078
2.62IleGln: 2.62 ± 0.088
2.514IleArg: 2.514 ± 0.075
5.021IleSer: 5.021 ± 0.124
4.785IleThr: 4.785 ± 0.094
5.366IleVal: 5.366 ± 0.113
0.693IleTrp: 0.693 ± 0.042
2.163IleTyr: 2.163 ± 0.061
0.0IleXaa: 0.0 ± 0.0
Lys
4.839LysAla: 4.839 ± 0.118
0.11LysCys: 0.11 ± 0.017
3.345LysAsp: 3.345 ± 0.092
3.787LysGlu: 3.787 ± 0.112
2.014LysPhe: 2.014 ± 0.065
3.27LysGly: 3.27 ± 0.089
1.109LysHis: 1.109 ± 0.05
4.177LysIle: 4.177 ± 0.105
4.502LysLys: 4.502 ± 0.128
5.714LysLeu: 5.714 ± 0.103
2.232LysMet: 2.232 ± 0.07
3.436LysAsn: 3.436 ± 0.106
2.196LysPro: 2.196 ± 0.072
3.765LysGln: 3.765 ± 0.1
3.062LysArg: 3.062 ± 0.094
3.371LysSer: 3.371 ± 0.1
4.079LysThr: 4.079 ± 0.116
4.315LysVal: 4.315 ± 0.11
0.63LysTrp: 0.63 ± 0.038
2.453LysTyr: 2.453 ± 0.079
0.0LysXaa: 0.0 ± 0.0
Leu
8.897LeuAla: 8.897 ± 0.145
0.346LeuCys: 0.346 ± 0.032
5.413LeuAsp: 5.413 ± 0.12
4.781LeuGlu: 4.781 ± 0.114
3.822LeuPhe: 3.822 ± 0.107
6.708LeuGly: 6.708 ± 0.146
1.613LeuHis: 1.613 ± 0.056
7.109LeuIle: 7.109 ± 0.148
5.998LeuLys: 5.998 ± 0.12
8.666LeuLeu: 8.666 ± 0.206
2.466LeuMet: 2.466 ± 0.081
5.132LeuAsn: 5.132 ± 0.129
4.088LeuPro: 4.088 ± 0.091
3.837LeuGln: 3.837 ± 0.106
3.436LeuArg: 3.436 ± 0.104
6.117LeuSer: 6.117 ± 0.117
7.213LeuThr: 7.213 ± 0.142
6.851LeuVal: 6.851 ± 0.135
0.745LeuTrp: 0.745 ± 0.038
2.581LeuTyr: 2.581 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
2.603MetAla: 2.603 ± 0.081
0.078MetCys: 0.078 ± 0.014
1.533MetAsp: 1.533 ± 0.058
1.306MetGlu: 1.306 ± 0.06
0.953MetPhe: 0.953 ± 0.047
1.834MetGly: 1.834 ± 0.064
0.468MetHis: 0.468 ± 0.033
1.992MetIle: 1.992 ± 0.069
1.888MetLys: 1.888 ± 0.072
2.36MetLeu: 2.36 ± 0.075
0.769MetMet: 0.769 ± 0.046
1.514MetAsn: 1.514 ± 0.058
1.124MetPro: 1.124 ± 0.05
1.325MetGln: 1.325 ± 0.048
0.909MetArg: 0.909 ± 0.041
1.594MetSer: 1.594 ± 0.054
1.789MetThr: 1.789 ± 0.062
2.018MetVal: 2.018 ± 0.063
0.217MetTrp: 0.217 ± 0.021
0.673MetTyr: 0.673 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
3.677AsnAla: 3.677 ± 0.088
0.18AsnCys: 0.18 ± 0.024
3.27AsnAsp: 3.27 ± 0.082
3.042AsnGlu: 3.042 ± 0.087
2.215AsnPhe: 2.215 ± 0.071
3.874AsnGly: 3.874 ± 0.101
1.312AsnHis: 1.312 ± 0.055
3.239AsnIle: 3.239 ± 0.093
2.951AsnLys: 2.951 ± 0.086
4.809AsnLeu: 4.809 ± 0.111
1.334AsnMet: 1.334 ± 0.051
2.824AsnAsn: 2.824 ± 0.086
2.254AsnPro: 2.254 ± 0.079
3.246AsnGln: 3.246 ± 0.099
2.146AsnArg: 2.146 ± 0.074
3.127AsnSer: 3.127 ± 0.101
2.698AsnThr: 2.698 ± 0.08
3.588AsnVal: 3.588 ± 0.092
0.678AsnTrp: 0.678 ± 0.037
2.116AsnTyr: 2.116 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
3.003ProAla: 3.003 ± 0.088
0.084ProCys: 0.084 ± 0.015
2.349ProAsp: 2.349 ± 0.081
2.932ProGlu: 2.932 ± 0.096
1.635ProPhe: 1.635 ± 0.059
2.154ProGly: 2.154 ± 0.07
0.643ProHis: 0.643 ± 0.038
2.817ProIle: 2.817 ± 0.083
2.323ProLys: 2.323 ± 0.073
3.335ProLeu: 3.335 ± 0.094
0.868ProMet: 0.868 ± 0.051
1.908ProAsn: 1.908 ± 0.061
0.548ProPro: 0.548 ± 0.035
1.585ProGln: 1.585 ± 0.058
1.126ProArg: 1.126 ± 0.05
1.892ProSer: 1.892 ± 0.068
2.486ProThr: 2.486 ± 0.083
2.973ProVal: 2.973 ± 0.087
0.318ProTrp: 0.318 ± 0.028
1.275ProTyr: 1.275 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
3.802GlnAla: 3.802 ± 0.089
0.084GlnCys: 0.084 ± 0.014
1.873GlnAsp: 1.873 ± 0.066
2.414GlnGlu: 2.414 ± 0.095
2.029GlnPhe: 2.029 ± 0.073
2.289GlnGly: 2.289 ± 0.064
0.942GlnHis: 0.942 ± 0.046
3.846GlnIle: 3.846 ± 0.096
3.014GlnLys: 3.014 ± 0.089
5.838GlnLeu: 5.838 ± 0.133
1.386GlnMet: 1.386 ± 0.057
2.423GlnAsn: 2.423 ± 0.079
1.836GlnPro: 1.836 ± 0.073
2.984GlnGln: 2.984 ± 0.141
2.308GlnArg: 2.308 ± 0.082
2.442GlnSer: 2.442 ± 0.076
3.077GlnThr: 3.077 ± 0.085
3.529GlnVal: 3.529 ± 0.09
0.431GlnTrp: 0.431 ± 0.033
1.676GlnTyr: 1.676 ± 0.062
0.0GlnXaa: 0.0 ± 0.0
Arg
2.536ArgAla: 2.536 ± 0.073
0.134ArgCys: 0.134 ± 0.018
2.083ArgAsp: 2.083 ± 0.077
2.163ArgGlu: 2.163 ± 0.082
1.86ArgPhe: 1.86 ± 0.065
2.406ArgGly: 2.406 ± 0.078
0.823ArgHis: 0.823 ± 0.04
2.739ArgIle: 2.739 ± 0.075
2.514ArgLys: 2.514 ± 0.076
3.93ArgLeu: 3.93 ± 0.1
0.961ArgMet: 0.961 ± 0.043
1.983ArgAsn: 1.983 ± 0.077
1.472ArgPro: 1.472 ± 0.056
2.261ArgGln: 2.261 ± 0.082
1.864ArgArg: 1.864 ± 0.067
2.103ArgSer: 2.103 ± 0.074
2.018ArgThr: 2.018 ± 0.066
2.678ArgVal: 2.678 ± 0.084
0.377ArgTrp: 0.377 ± 0.026
1.622ArgTyr: 1.622 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
4.441SerAla: 4.441 ± 0.121
0.193SerCys: 0.193 ± 0.022
3.791SerAsp: 3.791 ± 0.102
3.495SerGlu: 3.495 ± 0.092
2.73SerPhe: 2.73 ± 0.08
4.484SerGly: 4.484 ± 0.12
1.232SerHis: 1.232 ± 0.053
4.017SerIle: 4.017 ± 0.104
3.642SerLys: 3.642 ± 0.115
5.682SerLeu: 5.682 ± 0.113
1.596SerMet: 1.596 ± 0.052
2.925SerAsn: 2.925 ± 0.08
1.845SerPro: 1.845 ± 0.065
2.802SerGln: 2.802 ± 0.09
2.252SerArg: 2.252 ± 0.071
3.778SerSer: 3.778 ± 0.142
3.538SerThr: 3.538 ± 0.109
4.088SerVal: 4.088 ± 0.1
0.695SerTrp: 0.695 ± 0.041
2.126SerTyr: 2.126 ± 0.071
0.0SerXaa: 0.0 ± 0.0
Thr
5.097ThrAla: 5.097 ± 0.107
0.178ThrCys: 0.178 ± 0.023
3.989ThrAsp: 3.989 ± 0.082
3.389ThrGlu: 3.389 ± 0.085
2.399ThrPhe: 2.399 ± 0.078
4.658ThrGly: 4.658 ± 0.122
1.126ThrHis: 1.126 ± 0.054
4.716ThrIle: 4.716 ± 0.099
4.157ThrLys: 4.157 ± 0.095
5.48ThrLeu: 5.48 ± 0.12
1.533ThrMet: 1.533 ± 0.05
3.454ThrAsn: 3.454 ± 0.095
2.765ThrPro: 2.765 ± 0.075
2.458ThrGln: 2.458 ± 0.088
1.856ThrArg: 1.856 ± 0.059
3.859ThrSer: 3.859 ± 0.099
4.14ThrThr: 4.14 ± 0.108
4.872ThrVal: 4.872 ± 0.109
0.591ThrTrp: 0.591 ± 0.037
1.962ThrTyr: 1.962 ± 0.073
0.0ThrXaa: 0.0 ± 0.0
Val
6.648ValAla: 6.648 ± 0.137
0.325ValCys: 0.325 ± 0.029
4.588ValAsp: 4.588 ± 0.106
3.794ValGlu: 3.794 ± 0.105
2.672ValPhe: 2.672 ± 0.081
5.047ValGly: 5.047 ± 0.099
1.089ValHis: 1.089 ± 0.053
5.651ValIle: 5.651 ± 0.119
4.439ValLys: 4.439 ± 0.111
6.431ValLeu: 6.431 ± 0.148
1.836ValMet: 1.836 ± 0.073
3.839ValAsn: 3.839 ± 0.105
2.756ValPro: 2.756 ± 0.088
2.497ValGln: 2.497 ± 0.081
2.49ValArg: 2.49 ± 0.078
4.692ValSer: 4.692 ± 0.096
5.229ValThr: 5.229 ± 0.114
5.851ValVal: 5.851 ± 0.12
0.676ValTrp: 0.676 ± 0.041
2.25ValTyr: 2.25 ± 0.08
0.0ValXaa: 0.0 ± 0.0
Trp
0.645TrpAla: 0.645 ± 0.039
0.045TrpCys: 0.045 ± 0.009
0.567TrpAsp: 0.567 ± 0.042
0.379TrpGlu: 0.379 ± 0.025
0.487TrpPhe: 0.487 ± 0.037
0.717TrpGly: 0.717 ± 0.039
0.301TrpHis: 0.301 ± 0.028
0.741TrpIle: 0.741 ± 0.044
0.409TrpLys: 0.409 ± 0.03
1.488TrpLeu: 1.488 ± 0.061
0.275TrpMet: 0.275 ± 0.024
0.5TrpAsn: 0.5 ± 0.03
0.351TrpPro: 0.351 ± 0.023
0.736TrpGln: 0.736 ± 0.042
0.468TrpArg: 0.468 ± 0.034
0.559TrpSer: 0.559 ± 0.037
0.45TrpThr: 0.45 ± 0.033
0.598TrpVal: 0.598 ± 0.042
0.162TrpTrp: 0.162 ± 0.017
0.385TrpTyr: 0.385 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.302TyrAla: 2.302 ± 0.074
0.165TyrCys: 0.165 ± 0.023
2.258TyrAsp: 2.258 ± 0.077
1.776TyrGlu: 1.776 ± 0.074
1.706TyrPhe: 1.706 ± 0.059
2.332TyrGly: 2.332 ± 0.081
0.799TyrHis: 0.799 ± 0.041
1.944TyrIle: 1.944 ± 0.067
1.622TyrLys: 1.622 ± 0.075
3.891TyrLeu: 3.891 ± 0.101
0.736TyrMet: 0.736 ± 0.042
1.617TyrAsn: 1.617 ± 0.063
1.355TyrPro: 1.355 ± 0.057
2.213TyrGln: 2.213 ± 0.078
1.665TyrArg: 1.665 ± 0.064
2.189TyrSer: 2.189 ± 0.068
1.91TyrThr: 1.91 ± 0.07
2.462TyrVal: 2.462 ± 0.062
0.455TyrTrp: 0.455 ± 0.031
1.416TyrTyr: 1.416 ± 0.062
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.004XaaXaa: 0.004 ± 0.005
Statistics based on 1539 proteins (461827 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski