Amino acid dipepetide frequency for Lactobacillus algidus DSM 15638

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.742AlaAla: 5.742 ± 0.148
0.276AlaCys: 0.276 ± 0.025
4.167AlaAsp: 4.167 ± 0.103
4.379AlaGlu: 4.379 ± 0.117
3.09AlaPhe: 3.09 ± 0.098
5.345AlaGly: 5.345 ± 0.108
1.167AlaHis: 1.167 ± 0.052
6.12AlaIle: 6.12 ± 0.115
5.261AlaLys: 5.261 ± 0.133
6.712AlaLeu: 6.712 ± 0.133
1.96AlaMet: 1.96 ± 0.071
3.466AlaAsn: 3.466 ± 0.089
1.992AlaPro: 1.992 ± 0.071
2.522AlaGln: 2.522 ± 0.077
2.338AlaArg: 2.338 ± 0.073
4.353AlaSer: 4.353 ± 0.099
4.652AlaThr: 4.652 ± 0.117
5.092AlaVal: 5.092 ± 0.095
0.566AlaTrp: 0.566 ± 0.032
2.327AlaTyr: 2.327 ± 0.083
0.0AlaXaa: 0.0 ± 0.0
Cys
0.237CysAla: 0.237 ± 0.023
0.045CysCys: 0.045 ± 0.011
0.212CysAsp: 0.212 ± 0.019
0.218CysGlu: 0.218 ± 0.021
0.201CysPhe: 0.201 ± 0.02
0.385CysGly: 0.385 ± 0.031
0.094CysHis: 0.094 ± 0.017
0.25CysIle: 0.25 ± 0.022
0.12CysLys: 0.12 ± 0.018
0.444CysLeu: 0.444 ± 0.033
0.09CysMet: 0.09 ± 0.014
0.124CysAsn: 0.124 ± 0.015
0.158CysPro: 0.158 ± 0.02
0.169CysGln: 0.169 ± 0.019
0.154CysArg: 0.154 ± 0.018
0.231CysSer: 0.231 ± 0.02
0.154CysThr: 0.154 ± 0.021
0.259CysVal: 0.259 ± 0.026
0.03CysTrp: 0.03 ± 0.009
0.171CysTyr: 0.171 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
4.182AspAla: 4.182 ± 0.117
0.197AspCys: 0.197 ± 0.022
3.588AspAsp: 3.588 ± 0.106
4.259AspGlu: 4.259 ± 0.11
2.883AspPhe: 2.883 ± 0.083
3.791AspGly: 3.791 ± 0.129
1.019AspHis: 1.019 ± 0.047
4.479AspIle: 4.479 ± 0.107
4.184AspLys: 4.184 ± 0.102
5.471AspLeu: 5.471 ± 0.115
1.584AspMet: 1.584 ± 0.054
3.002AspAsn: 3.002 ± 0.083
1.855AspPro: 1.855 ± 0.073
2.225AspGln: 2.225 ± 0.077
2.052AspArg: 2.052 ± 0.075
3.441AspSer: 3.441 ± 0.098
3.193AspThr: 3.193 ± 0.097
4.381AspVal: 4.381 ± 0.108
0.62AspTrp: 0.62 ± 0.04
2.359AspTyr: 2.359 ± 0.079
0.0AspXaa: 0.0 ± 0.0
Glu
4.364GluAla: 4.364 ± 0.114
0.18GluCys: 0.18 ± 0.02
3.297GluAsp: 3.297 ± 0.084
4.024GluGlu: 4.024 ± 0.121
2.434GluPhe: 2.434 ± 0.077
3.058GluGly: 3.058 ± 0.087
1.068GluHis: 1.068 ± 0.047
5.016GluIle: 5.016 ± 0.111
5.437GluLys: 5.437 ± 0.128
6.381GluLeu: 6.381 ± 0.146
1.932GluMet: 1.932 ± 0.073
3.661GluAsn: 3.661 ± 0.088
1.744GluPro: 1.744 ± 0.071
2.573GluGln: 2.573 ± 0.091
2.507GluArg: 2.507 ± 0.093
3.768GluSer: 3.768 ± 0.095
4.065GluThr: 4.065 ± 0.103
4.242GluVal: 4.242 ± 0.144
0.592GluTrp: 0.592 ± 0.033
1.75GluTyr: 1.75 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
2.844PheAla: 2.844 ± 0.085
0.194PheCys: 0.194 ± 0.021
3.022PheAsp: 3.022 ± 0.078
2.772PheGlu: 2.772 ± 0.079
2.261PhePhe: 2.261 ± 0.083
3.259PheGly: 3.259 ± 0.098
0.684PheHis: 0.684 ± 0.039
3.688PheIle: 3.688 ± 0.109
2.819PheLys: 2.819 ± 0.072
3.938PheLeu: 3.938 ± 0.107
1.22PheMet: 1.22 ± 0.055
2.564PheAsn: 2.564 ± 0.08
1.436PhePro: 1.436 ± 0.057
1.261PheGln: 1.261 ± 0.054
1.216PheArg: 1.216 ± 0.052
3.357PheSer: 3.357 ± 0.104
2.494PheThr: 2.494 ± 0.074
3.114PheVal: 3.114 ± 0.095
0.453PheTrp: 0.453 ± 0.034
1.669PheTyr: 1.669 ± 0.061
0.0PheXaa: 0.0 ± 0.0
Gly
4.565GlyAla: 4.565 ± 0.126
0.306GlyCys: 0.306 ± 0.021
3.575GlyAsp: 3.575 ± 0.118
3.582GlyGlu: 3.582 ± 0.081
3.287GlyPhe: 3.287 ± 0.093
4.451GlyGly: 4.451 ± 0.125
1.312GlyHis: 1.312 ± 0.058
5.875GlyIle: 5.875 ± 0.14
4.684GlyLys: 4.684 ± 0.138
6.665GlyLeu: 6.665 ± 0.131
2.066GlyMet: 2.066 ± 0.068
2.915GlyAsn: 2.915 ± 0.094
1.498GlyPro: 1.498 ± 0.061
2.686GlyGln: 2.686 ± 0.085
2.398GlyArg: 2.398 ± 0.075
4.421GlySer: 4.421 ± 0.107
4.424GlyThr: 4.424 ± 0.124
4.791GlyVal: 4.791 ± 0.115
0.658GlyTrp: 0.658 ± 0.04
2.699GlyTyr: 2.699 ± 0.08
0.0GlyXaa: 0.0 ± 0.0
His
1.088HisAla: 1.088 ± 0.044
0.085HisCys: 0.085 ± 0.012
0.983HisAsp: 0.983 ± 0.046
1.066HisGlu: 1.066 ± 0.056
0.964HisPhe: 0.964 ± 0.043
1.169HisGly: 1.169 ± 0.058
0.511HisHis: 0.511 ± 0.039
1.28HisIle: 1.28 ± 0.051
0.932HisLys: 0.932 ± 0.043
1.904HisLeu: 1.904 ± 0.073
0.496HisMet: 0.496 ± 0.03
0.739HisAsn: 0.739 ± 0.041
0.827HisPro: 0.827 ± 0.038
0.795HisGln: 0.795 ± 0.044
0.669HisArg: 0.669 ± 0.042
0.868HisSer: 0.868 ± 0.048
0.827HisThr: 0.827 ± 0.039
1.195HisVal: 1.195 ± 0.048
0.15HisTrp: 0.15 ± 0.021
0.816HisTyr: 0.816 ± 0.045
0.0HisXaa: 0.0 ± 0.0
Ile
5.958IleAla: 5.958 ± 0.134
0.436IleCys: 0.436 ± 0.032
5.05IleAsp: 5.05 ± 0.115
4.9IleGlu: 4.9 ± 0.117
3.569IlePhe: 3.569 ± 0.121
5.86IleGly: 5.86 ± 0.131
1.239IleHis: 1.239 ± 0.051
6.296IleIle: 6.296 ± 0.154
5.323IleLys: 5.323 ± 0.095
7.46IleLeu: 7.46 ± 0.16
2.122IleMet: 2.122 ± 0.078
4.246IleAsn: 4.246 ± 0.096
3.035IlePro: 3.035 ± 0.076
2.977IleGln: 2.977 ± 0.093
2.611IleArg: 2.611 ± 0.082
5.633IleSer: 5.633 ± 0.096
4.62IleThr: 4.62 ± 0.095
5.723IleVal: 5.723 ± 0.134
0.618IleTrp: 0.618 ± 0.037
2.413IleTyr: 2.413 ± 0.081
0.0IleXaa: 0.0 ± 0.0
Lys
4.648LysAla: 4.648 ± 0.12
0.16LysCys: 0.16 ± 0.019
4.026LysAsp: 4.026 ± 0.104
4.932LysGlu: 4.932 ± 0.111
2.214LysPhe: 2.214 ± 0.076
3.851LysGly: 3.851 ± 0.102
1.068LysHis: 1.068 ± 0.052
5.695LysIle: 5.695 ± 0.121
6.473LysLys: 6.473 ± 0.147
6.065LysLeu: 6.065 ± 0.107
2.834LysMet: 2.834 ± 0.073
4.492LysAsn: 4.492 ± 0.117
2.105LysPro: 2.105 ± 0.077
3.12LysGln: 3.12 ± 0.094
2.992LysArg: 2.992 ± 0.094
4.289LysSer: 4.289 ± 0.091
4.657LysThr: 4.657 ± 0.125
4.733LysVal: 4.733 ± 0.152
0.643LysTrp: 0.643 ± 0.038
2.502LysTyr: 2.502 ± 0.082
0.0LysXaa: 0.0 ± 0.0
Leu
7.407LeuAla: 7.407 ± 0.12
0.34LeuCys: 0.34 ± 0.026
5.381LeuAsp: 5.381 ± 0.109
5.3LeuGlu: 5.3 ± 0.122
4.562LeuPhe: 4.562 ± 0.119
6.27LeuGly: 6.27 ± 0.119
1.325LeuHis: 1.325 ± 0.057
7.798LeuIle: 7.798 ± 0.16
6.879LeuLys: 6.879 ± 0.141
9.119LeuLeu: 9.119 ± 0.212
2.733LeuMet: 2.733 ± 0.089
5.11LeuAsn: 5.11 ± 0.104
3.492LeuPro: 3.492 ± 0.088
3.06LeuGln: 3.06 ± 0.087
3.146LeuArg: 3.146 ± 0.084
7.208LeuSer: 7.208 ± 0.128
6.488LeuThr: 6.488 ± 0.124
6.663LeuVal: 6.663 ± 0.128
0.667LeuTrp: 0.667 ± 0.044
2.601LeuTyr: 2.601 ± 0.071
0.002LeuXaa: 0.002 ± 0.002
Met
2.421MetAla: 2.421 ± 0.082
0.098MetCys: 0.098 ± 0.015
1.665MetAsp: 1.665 ± 0.066
1.406MetGlu: 1.406 ± 0.053
0.97MetPhe: 0.97 ± 0.041
2.071MetGly: 2.071 ± 0.062
0.44MetHis: 0.44 ± 0.032
2.319MetIle: 2.319 ± 0.065
2.178MetLys: 2.178 ± 0.077
2.49MetLeu: 2.49 ± 0.078
0.97MetMet: 0.97 ± 0.048
1.56MetAsn: 1.56 ± 0.058
1.113MetPro: 1.113 ± 0.044
1.06MetGln: 1.06 ± 0.046
0.962MetArg: 0.962 ± 0.041
1.96MetSer: 1.96 ± 0.062
2.199MetThr: 2.199 ± 0.065
2.052MetVal: 2.052 ± 0.07
0.177MetTrp: 0.177 ± 0.022
0.645MetTyr: 0.645 ± 0.041
0.0MetXaa: 0.0 ± 0.0
Asn
3.419AsnAla: 3.419 ± 0.104
0.197AsnCys: 0.197 ± 0.023
3.178AsnAsp: 3.178 ± 0.089
3.515AsnGlu: 3.515 ± 0.1
2.105AsnPhe: 2.105 ± 0.078
3.62AsnGly: 3.62 ± 0.115
1.096AsnHis: 1.096 ± 0.05
3.592AsnIle: 3.592 ± 0.092
3.759AsnLys: 3.759 ± 0.088
4.616AsnLeu: 4.616 ± 0.105
1.43AsnMet: 1.43 ± 0.05
2.693AsnAsn: 2.693 ± 0.102
2.173AsnPro: 2.173 ± 0.081
2.52AsnGln: 2.52 ± 0.083
2.118AsnArg: 2.118 ± 0.067
2.793AsnSer: 2.793 ± 0.091
2.552AsnThr: 2.552 ± 0.071
3.571AsnVal: 3.571 ± 0.093
0.607AsnTrp: 0.607 ± 0.035
1.846AsnTyr: 1.846 ± 0.069
0.0AsnXaa: 0.0 ± 0.0
Pro
2.351ProAla: 2.351 ± 0.081
0.094ProCys: 0.094 ± 0.016
2.056ProAsp: 2.056 ± 0.065
2.821ProGlu: 2.821 ± 0.082
1.584ProPhe: 1.584 ± 0.058
2.126ProGly: 2.126 ± 0.065
0.536ProHis: 0.536 ± 0.03
2.68ProIle: 2.68 ± 0.089
2.389ProLys: 2.389 ± 0.08
2.9ProLeu: 2.9 ± 0.076
0.895ProMet: 0.895 ± 0.047
1.65ProAsn: 1.65 ± 0.065
0.551ProPro: 0.551 ± 0.042
1.068ProGln: 1.068 ± 0.051
0.94ProArg: 0.94 ± 0.05
1.876ProSer: 1.876 ± 0.065
2.229ProThr: 2.229 ± 0.067
2.62ProVal: 2.62 ± 0.079
0.256ProTrp: 0.256 ± 0.02
1.188ProTyr: 1.188 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
2.763GlnAla: 2.763 ± 0.085
0.079GlnCys: 0.079 ± 0.012
1.934GlnAsp: 1.934 ± 0.064
2.355GlnGlu: 2.355 ± 0.116
1.699GlnPhe: 1.699 ± 0.059
1.934GlnGly: 1.934 ± 0.072
0.658GlnHis: 0.658 ± 0.04
2.97GlnIle: 2.97 ± 0.093
3.058GlnLys: 3.058 ± 0.088
4.505GlnLeu: 4.505 ± 0.133
1.177GlnMet: 1.177 ± 0.054
2.041GlnAsn: 2.041 ± 0.065
1.143GlnPro: 1.143 ± 0.047
1.754GlnGln: 1.754 ± 0.072
1.395GlnArg: 1.395 ± 0.058
2.381GlnSer: 2.381 ± 0.08
2.423GlnThr: 2.423 ± 0.093
2.77GlnVal: 2.77 ± 0.089
0.291GlnTrp: 0.291 ± 0.025
1.278GlnTyr: 1.278 ± 0.055
0.0GlnXaa: 0.0 ± 0.0
Arg
2.267ArgAla: 2.267 ± 0.078
0.126ArgCys: 0.126 ± 0.017
1.938ArgAsp: 1.938 ± 0.075
2.5ArgGlu: 2.5 ± 0.077
1.684ArgPhe: 1.684 ± 0.066
2.139ArgGly: 2.139 ± 0.071
0.724ArgHis: 0.724 ± 0.037
2.626ArgIle: 2.626 ± 0.075
2.402ArgLys: 2.402 ± 0.076
3.53ArgLeu: 3.53 ± 0.109
1.101ArgMet: 1.101 ± 0.052
1.72ArgAsn: 1.72 ± 0.07
1.218ArgPro: 1.218 ± 0.063
1.725ArgGln: 1.725 ± 0.065
1.812ArgArg: 1.812 ± 0.067
1.831ArgSer: 1.831 ± 0.063
1.881ArgThr: 1.881 ± 0.07
2.475ArgVal: 2.475 ± 0.085
0.271ArgTrp: 0.271 ± 0.024
1.428ArgTyr: 1.428 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
4.383SerAla: 4.383 ± 0.103
0.199SerCys: 0.199 ± 0.024
4.024SerAsp: 4.024 ± 0.105
4.3SerGlu: 4.3 ± 0.093
3.099SerPhe: 3.099 ± 0.085
4.958SerGly: 4.958 ± 0.115
1.175SerHis: 1.175 ± 0.055
5.118SerIle: 5.118 ± 0.12
4.419SerLys: 4.419 ± 0.114
6.217SerLeu: 6.217 ± 0.134
1.652SerMet: 1.652 ± 0.057
3.067SerAsn: 3.067 ± 0.093
1.814SerPro: 1.814 ± 0.059
2.291SerGln: 2.291 ± 0.073
2.225SerArg: 2.225 ± 0.073
4.368SerSer: 4.368 ± 0.146
3.806SerThr: 3.806 ± 0.105
4.618SerVal: 4.618 ± 0.101
0.603SerTrp: 0.603 ± 0.029
2.378SerTyr: 2.378 ± 0.073
0.0SerXaa: 0.0 ± 0.0
Thr
4.377ThrAla: 4.377 ± 0.103
0.197ThrCys: 0.197 ± 0.024
3.941ThrAsp: 3.941 ± 0.115
3.552ThrGlu: 3.552 ± 0.095
2.778ThrPhe: 2.778 ± 0.07
4.492ThrGly: 4.492 ± 0.096
1.167ThrHis: 1.167 ± 0.043
5.437ThrIle: 5.437 ± 0.124
4.15ThrLys: 4.15 ± 0.092
5.693ThrLeu: 5.693 ± 0.13
1.483ThrMet: 1.483 ± 0.054
3.199ThrAsn: 3.199 ± 0.096
2.492ThrPro: 2.492 ± 0.084
2.101ThrGln: 2.101 ± 0.069
1.746ThrArg: 1.746 ± 0.067
3.891ThrSer: 3.891 ± 0.104
4.159ThrThr: 4.159 ± 0.109
4.789ThrVal: 4.789 ± 0.119
0.573ThrTrp: 0.573 ± 0.042
2.167ThrTyr: 2.167 ± 0.085
0.0ThrXaa: 0.0 ± 0.0
Val
5.607ValAla: 5.607 ± 0.113
0.316ValCys: 0.316 ± 0.024
4.208ValAsp: 4.208 ± 0.084
4.005ValGlu: 4.005 ± 0.097
2.697ValPhe: 2.697 ± 0.082
5.201ValGly: 5.201 ± 0.113
1.088ValHis: 1.088 ± 0.05
6.014ValIle: 6.014 ± 0.144
4.595ValLys: 4.595 ± 0.121
6.458ValLeu: 6.458 ± 0.137
1.99ValMet: 1.99 ± 0.067
3.267ValAsn: 3.267 ± 0.089
2.646ValPro: 2.646 ± 0.082
2.4ValGln: 2.4 ± 0.117
2.272ValArg: 2.272 ± 0.071
5.323ValSer: 5.323 ± 0.127
5.159ValThr: 5.159 ± 0.131
5.419ValVal: 5.419 ± 0.114
0.575ValTrp: 0.575 ± 0.035
2.12ValTyr: 2.12 ± 0.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.56TrpAla: 0.56 ± 0.033
0.036TrpCys: 0.036 ± 0.008
0.468TrpAsp: 0.468 ± 0.038
0.385TrpGlu: 0.385 ± 0.028
0.438TrpPhe: 0.438 ± 0.033
0.605TrpGly: 0.605 ± 0.034
0.227TrpHis: 0.227 ± 0.022
0.658TrpIle: 0.658 ± 0.039
0.498TrpLys: 0.498 ± 0.037
1.124TrpLeu: 1.124 ± 0.054
0.235TrpMet: 0.235 ± 0.022
0.406TrpAsn: 0.406 ± 0.027
0.235TrpPro: 0.235 ± 0.025
0.425TrpGln: 0.425 ± 0.031
0.338TrpArg: 0.338 ± 0.029
0.558TrpSer: 0.558 ± 0.04
0.539TrpThr: 0.539 ± 0.034
0.579TrpVal: 0.579 ± 0.037
0.098TrpTrp: 0.098 ± 0.014
0.348TrpTyr: 0.348 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.344TyrAla: 2.344 ± 0.089
0.184TyrCys: 0.184 ± 0.02
2.052TyrAsp: 2.052 ± 0.063
1.821TyrGlu: 1.821 ± 0.068
1.748TyrPhe: 1.748 ± 0.062
2.274TyrGly: 2.274 ± 0.076
0.763TyrHis: 0.763 ± 0.04
2.178TyrIle: 2.178 ± 0.074
1.94TyrLys: 1.94 ± 0.067
3.857TyrLeu: 3.857 ± 0.084
0.784TyrMet: 0.784 ± 0.043
1.477TyrAsn: 1.477 ± 0.057
1.284TyrPro: 1.284 ± 0.057
1.878TyrGln: 1.878 ± 0.072
1.511TyrArg: 1.511 ± 0.066
2.163TyrSer: 2.163 ± 0.076
1.863TyrThr: 1.863 ± 0.085
2.274TyrVal: 2.274 ± 0.083
0.312TyrTrp: 0.312 ± 0.028
1.357TyrTyr: 1.357 ± 0.062
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1505 proteins (467948 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski