Amino acid dipepetide frequency for Lactobacillus collinoides DSM 20515 = JCM 1123

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.135AlaAla: 8.135 ± 0.11
0.433AlaCys: 0.433 ± 0.022
5.813AlaAsp: 5.813 ± 0.088
4.278AlaGlu: 4.278 ± 0.076
3.619AlaPhe: 3.619 ± 0.062
6.554AlaGly: 6.554 ± 0.084
1.79AlaHis: 1.79 ± 0.044
6.128AlaIle: 6.128 ± 0.083
5.539AlaLys: 5.539 ± 0.086
8.356AlaLeu: 8.356 ± 0.106
2.34AlaMet: 2.34 ± 0.052
3.826AlaAsn: 3.826 ± 0.071
2.546AlaPro: 2.546 ± 0.053
3.638AlaGln: 3.638 ± 0.077
2.822AlaArg: 2.822 ± 0.056
5.078AlaSer: 5.078 ± 0.085
5.843AlaThr: 5.843 ± 0.121
6.581AlaVal: 6.581 ± 0.096
0.738AlaTrp: 0.738 ± 0.027
2.864AlaTyr: 2.864 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.381CysAla: 0.381 ± 0.021
0.054CysCys: 0.054 ± 0.008
0.273CysAsp: 0.273 ± 0.016
0.206CysGlu: 0.206 ± 0.015
0.251CysPhe: 0.251 ± 0.02
0.502CysGly: 0.502 ± 0.024
0.172CysHis: 0.172 ± 0.013
0.272CysIle: 0.272 ± 0.017
0.141CysLys: 0.141 ± 0.012
0.524CysLeu: 0.524 ± 0.025
0.092CysMet: 0.092 ± 0.01
0.171CysAsn: 0.171 ± 0.013
0.213CysPro: 0.213 ± 0.018
0.209CysGln: 0.209 ± 0.015
0.206CysArg: 0.206 ± 0.016
0.268CysSer: 0.268 ± 0.017
0.259CysThr: 0.259 ± 0.016
0.351CysVal: 0.351 ± 0.02
0.085CysTrp: 0.085 ± 0.01
0.217CysTyr: 0.217 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
5.063AspAla: 5.063 ± 0.081
0.291AspCys: 0.291 ± 0.02
4.162AspAsp: 4.162 ± 0.095
3.768AspGlu: 3.768 ± 0.077
2.616AspPhe: 2.616 ± 0.053
3.934AspGly: 3.934 ± 0.065
1.577AspHis: 1.577 ± 0.045
3.549AspIle: 3.549 ± 0.064
3.283AspLys: 3.283 ± 0.065
5.615AspLeu: 5.615 ± 0.067
1.518AspMet: 1.518 ± 0.038
2.67AspAsn: 2.67 ± 0.056
2.406AspPro: 2.406 ± 0.055
2.985AspGln: 2.985 ± 0.06
2.416AspArg: 2.416 ± 0.054
3.076AspSer: 3.076 ± 0.069
3.719AspThr: 3.719 ± 0.092
4.304AspVal: 4.304 ± 0.065
0.804AspTrp: 0.804 ± 0.027
2.48AspTyr: 2.48 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
4.336GluAla: 4.336 ± 0.079
0.174GluCys: 0.174 ± 0.014
2.707GluAsp: 2.707 ± 0.057
2.529GluGlu: 2.529 ± 0.058
1.716GluPhe: 1.716 ± 0.048
2.627GluGly: 2.627 ± 0.06
1.089GluHis: 1.089 ± 0.037
3.319GluIle: 3.319 ± 0.061
3.445GluLys: 3.445 ± 0.075
5.065GluLeu: 5.065 ± 0.087
1.521GluMet: 1.521 ± 0.046
2.547GluAsn: 2.547 ± 0.057
1.664GluPro: 1.664 ± 0.048
2.351GluGln: 2.351 ± 0.052
2.262GluArg: 2.262 ± 0.055
2.495GluSer: 2.495 ± 0.054
3.761GluThr: 3.761 ± 0.072
3.211GluVal: 3.211 ± 0.061
0.5GluTrp: 0.5 ± 0.025
1.379GluTyr: 1.379 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.414PheAla: 3.414 ± 0.07
0.291PheCys: 0.291 ± 0.02
2.914PheAsp: 2.914 ± 0.058
2.087PheGlu: 2.087 ± 0.046
1.836PhePhe: 1.836 ± 0.047
3.34PheGly: 3.34 ± 0.075
0.872PheHis: 0.872 ± 0.034
2.766PheIle: 2.766 ± 0.063
2.433PheLys: 2.433 ± 0.056
3.704PheLeu: 3.704 ± 0.079
1.052PheMet: 1.052 ± 0.033
2.097PheAsn: 2.097 ± 0.047
1.427PhePro: 1.427 ± 0.04
1.463PheGln: 1.463 ± 0.037
1.298PheArg: 1.298 ± 0.042
2.999PheSer: 2.999 ± 0.058
2.683PheThr: 2.683 ± 0.059
3.009PheVal: 3.009 ± 0.066
0.549PheTrp: 0.549 ± 0.026
1.639PheTyr: 1.639 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
5.374GlyAla: 5.374 ± 0.092
0.454GlyCys: 0.454 ± 0.023
3.814GlyAsp: 3.814 ± 0.067
3.232GlyGlu: 3.232 ± 0.064
3.21GlyPhe: 3.21 ± 0.064
4.673GlyGly: 4.673 ± 0.1
1.758GlyHis: 1.758 ± 0.049
5.609GlyIle: 5.609 ± 0.084
4.095GlyLys: 4.095 ± 0.077
6.91GlyLeu: 6.91 ± 0.099
1.947GlyMet: 1.947 ± 0.046
2.839GlyAsn: 2.839 ± 0.068
1.697GlyPro: 1.697 ± 0.045
2.904GlyGln: 2.904 ± 0.058
2.689GlyArg: 2.689 ± 0.05
4.065GlySer: 4.065 ± 0.073
4.831GlyThr: 4.831 ± 0.112
5.402GlyVal: 5.402 ± 0.065
0.797GlyTrp: 0.797 ± 0.03
2.892GlyTyr: 2.892 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
1.704HisAla: 1.704 ± 0.045
0.12HisCys: 0.12 ± 0.012
1.467HisAsp: 1.467 ± 0.041
1.159HisGlu: 1.159 ± 0.035
1.147HisPhe: 1.147 ± 0.038
1.617HisGly: 1.617 ± 0.044
0.791HisHis: 0.791 ± 0.032
1.33HisIle: 1.33 ± 0.039
0.938HisLys: 0.938 ± 0.028
2.282HisLeu: 2.282 ± 0.051
0.592HisMet: 0.592 ± 0.023
0.952HisAsn: 0.952 ± 0.033
1.125HisPro: 1.125 ± 0.035
1.243HisGln: 1.243 ± 0.039
1.053HisArg: 1.053 ± 0.039
1.18HisSer: 1.18 ± 0.036
1.243HisThr: 1.243 ± 0.034
1.586HisVal: 1.586 ± 0.04
0.312HisTrp: 0.312 ± 0.018
1.028HisTyr: 1.028 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
6.115IleAla: 6.115 ± 0.089
0.431IleCys: 0.431 ± 0.024
4.236IleAsp: 4.236 ± 0.081
3.159IleGlu: 3.159 ± 0.064
2.66IlePhe: 2.66 ± 0.063
5.403IleGly: 5.403 ± 0.089
1.38IleHis: 1.38 ± 0.033
4.607IleIle: 4.607 ± 0.093
3.618IleLys: 3.618 ± 0.062
5.877IleLeu: 5.877 ± 0.094
1.714IleMet: 1.714 ± 0.045
3.264IleAsn: 3.264 ± 0.062
2.597IlePro: 2.597 ± 0.055
2.53IleGln: 2.53 ± 0.061
2.496IleArg: 2.496 ± 0.053
4.473IleSer: 4.473 ± 0.073
4.537IleThr: 4.537 ± 0.074
5.231IleVal: 5.231 ± 0.086
0.655IleTrp: 0.655 ± 0.03
2.021IleTyr: 2.021 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
5.026LysAla: 5.026 ± 0.075
0.151LysCys: 0.151 ± 0.012
3.073LysAsp: 3.073 ± 0.049
2.841LysGlu: 2.841 ± 0.065
1.71LysPhe: 1.71 ± 0.045
3.144LysGly: 3.144 ± 0.063
1.298LysHis: 1.298 ± 0.037
3.626LysIle: 3.626 ± 0.066
3.858LysLys: 3.858 ± 0.085
5.376LysLeu: 5.376 ± 0.091
1.953LysMet: 1.953 ± 0.05
2.851LysAsn: 2.851 ± 0.06
2.249LysPro: 2.249 ± 0.051
3.292LysGln: 3.292 ± 0.073
3.065LysArg: 3.065 ± 0.054
3.152LysSer: 3.152 ± 0.071
4.571LysThr: 4.571 ± 0.085
3.78LysVal: 3.78 ± 0.074
0.597LysTrp: 0.597 ± 0.026
1.962LysTyr: 1.962 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
9.09LeuAla: 9.09 ± 0.094
0.496LeuCys: 0.496 ± 0.022
5.252LeuAsp: 5.252 ± 0.093
3.781LeuGlu: 3.781 ± 0.075
4.094LeuPhe: 4.094 ± 0.081
6.854LeuGly: 6.854 ± 0.098
1.885LeuHis: 1.885 ± 0.047
6.866LeuIle: 6.866 ± 0.119
6.021LeuLys: 6.021 ± 0.094
9.1LeuLeu: 9.1 ± 0.141
2.703LeuMet: 2.703 ± 0.056
4.694LeuAsn: 4.694 ± 0.069
4.377LeuPro: 4.377 ± 0.081
3.515LeuGln: 3.515 ± 0.061
3.568LeuArg: 3.568 ± 0.072
6.606LeuSer: 6.606 ± 0.086
7.72LeuThr: 7.72 ± 0.106
6.877LeuVal: 6.877 ± 0.092
0.862LeuTrp: 0.862 ± 0.031
2.645LeuTyr: 2.645 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.674MetAla: 2.674 ± 0.054
0.106MetCys: 0.106 ± 0.011
1.292MetAsp: 1.292 ± 0.044
0.924MetGlu: 0.924 ± 0.033
0.916MetPhe: 0.916 ± 0.032
1.94MetGly: 1.94 ± 0.055
0.524MetHis: 0.524 ± 0.023
1.933MetIle: 1.933 ± 0.05
1.689MetLys: 1.689 ± 0.047
2.351MetLeu: 2.351 ± 0.049
0.887MetMet: 0.887 ± 0.032
1.253MetAsn: 1.253 ± 0.038
1.232MetPro: 1.232 ± 0.04
1.137MetGln: 1.137 ± 0.031
1.023MetArg: 1.023 ± 0.034
1.71MetSer: 1.71 ± 0.039
2.464MetThr: 2.464 ± 0.05
2.001MetVal: 2.001 ± 0.044
0.204MetTrp: 0.204 ± 0.014
0.643MetTyr: 0.643 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.765AsnAla: 3.765 ± 0.063
0.208AsnCys: 0.208 ± 0.015
2.873AsnAsp: 2.873 ± 0.05
2.471AsnGlu: 2.471 ± 0.055
1.765AsnPhe: 1.765 ± 0.048
3.529AsnGly: 3.529 ± 0.072
1.218AsnHis: 1.218 ± 0.035
2.611AsnIle: 2.611 ± 0.046
2.452AsnLys: 2.452 ± 0.058
4.069AsnLeu: 4.069 ± 0.053
1.173AsnMet: 1.173 ± 0.033
2.209AsnAsn: 2.209 ± 0.056
1.991AsnPro: 1.991 ± 0.04
2.479AsnGln: 2.479 ± 0.052
2.062AsnArg: 2.062 ± 0.054
2.461AsnSer: 2.461 ± 0.069
2.716AsnThr: 2.716 ± 0.065
3.307AsnVal: 3.307 ± 0.064
0.619AsnTrp: 0.619 ± 0.028
1.699AsnTyr: 1.699 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
3.266ProAla: 3.266 ± 0.063
0.125ProCys: 0.125 ± 0.011
2.636ProAsp: 2.636 ± 0.057
2.794ProGlu: 2.794 ± 0.063
1.7ProPhe: 1.7 ± 0.041
2.334ProGly: 2.334 ± 0.06
0.732ProHis: 0.732 ± 0.028
2.525ProIle: 2.525 ± 0.041
2.215ProLys: 2.215 ± 0.047
3.419ProLeu: 3.419 ± 0.066
0.884ProMet: 0.884 ± 0.028
1.787ProAsn: 1.787 ± 0.044
0.547ProPro: 0.547 ± 0.024
1.52ProGln: 1.52 ± 0.044
1.078ProArg: 1.078 ± 0.033
2.105ProSer: 2.105 ± 0.05
2.654ProThr: 2.654 ± 0.058
3.044ProVal: 3.044 ± 0.061
0.394ProTrp: 0.394 ± 0.019
1.283ProTyr: 1.283 ± 0.04
0.002ProXaa: 0.002 ± 0.001
Gln
4.133GlnAla: 4.133 ± 0.068
0.136GlnCys: 0.136 ± 0.014
2.055GlnAsp: 2.055 ± 0.05
2.012GlnGlu: 2.012 ± 0.048
1.851GlnPhe: 1.851 ± 0.044
2.259GlnGly: 2.259 ± 0.048
1.142GlnHis: 1.142 ± 0.03
2.896GlnIle: 2.896 ± 0.062
2.362GlnLys: 2.362 ± 0.058
5.46GlnLeu: 5.46 ± 0.099
1.152GlnMet: 1.152 ± 0.033
1.899GlnAsn: 1.899 ± 0.05
1.758GlnPro: 1.758 ± 0.04
2.679GlnGln: 2.679 ± 0.077
2.238GlnArg: 2.238 ± 0.059
2.488GlnSer: 2.488 ± 0.05
3.15GlnThr: 3.15 ± 0.074
3.162GlnVal: 3.162 ± 0.061
0.48GlnTrp: 0.48 ± 0.023
1.532GlnTyr: 1.532 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
2.833ArgAla: 2.833 ± 0.058
0.177ArgCys: 0.177 ± 0.013
2.342ArgAsp: 2.342 ± 0.054
2.23ArgGlu: 2.23 ± 0.058
1.919ArgPhe: 1.919 ± 0.048
2.261ArgGly: 2.261 ± 0.057
1.178ArgHis: 1.178 ± 0.032
2.489ArgIle: 2.489 ± 0.048
2.268ArgLys: 2.268 ± 0.05
4.338ArgLeu: 4.338 ± 0.078
1.099ArgMet: 1.099 ± 0.036
1.731ArgAsn: 1.731 ± 0.041
1.471ArgPro: 1.471 ± 0.042
2.23ArgGln: 2.23 ± 0.056
2.005ArgArg: 2.005 ± 0.055
2.015ArgSer: 2.015 ± 0.045
2.191ArgThr: 2.191 ± 0.05
2.88ArgVal: 2.88 ± 0.062
0.409ArgTrp: 0.409 ± 0.021
1.658ArgTyr: 1.658 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
5.005SerAla: 5.005 ± 0.094
0.255SerCys: 0.255 ± 0.015
3.78SerAsp: 3.78 ± 0.08
2.976SerGlu: 2.976 ± 0.061
2.791SerPhe: 2.791 ± 0.062
4.823SerGly: 4.823 ± 0.083
1.267SerHis: 1.267 ± 0.038
3.687SerIle: 3.687 ± 0.067
3.373SerLys: 3.373 ± 0.077
5.788SerLeu: 5.788 ± 0.084
1.556SerMet: 1.556 ± 0.042
2.585SerAsn: 2.585 ± 0.071
1.921SerPro: 1.921 ± 0.044
2.808SerGln: 2.808 ± 0.058
2.419SerArg: 2.419 ± 0.055
4.305SerSer: 4.305 ± 0.135
4.116SerThr: 4.116 ± 0.109
4.274SerVal: 4.274 ± 0.065
0.668SerTrp: 0.668 ± 0.032
2.114SerTyr: 2.114 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
6.361ThrAla: 6.361 ± 0.12
0.231ThrCys: 0.231 ± 0.02
4.479ThrAsp: 4.479 ± 0.092
3.152ThrGlu: 3.152 ± 0.065
2.804ThrPhe: 2.804 ± 0.06
5.225ThrGly: 5.225 ± 0.095
1.439ThrHis: 1.439 ± 0.04
5.034ThrIle: 5.034 ± 0.091
3.801ThrLys: 3.801 ± 0.078
6.581ThrLeu: 6.581 ± 0.086
1.692ThrMet: 1.692 ± 0.04
3.009ThrAsn: 3.009 ± 0.085
3.155ThrPro: 3.155 ± 0.06
2.659ThrGln: 2.659 ± 0.053
2.239ThrArg: 2.239 ± 0.053
4.416ThrSer: 4.416 ± 0.106
5.369ThrThr: 5.369 ± 0.165
5.614ThrVal: 5.614 ± 0.109
0.697ThrTrp: 0.697 ± 0.029
2.434ThrTyr: 2.434 ± 0.07
0.0ThrXaa: 0.0 ± 0.0
Val
6.869ValAla: 6.869 ± 0.092
0.437ValCys: 0.437 ± 0.02
4.294ValAsp: 4.294 ± 0.076
3.094ValGlu: 3.094 ± 0.065
2.978ValPhe: 2.978 ± 0.062
5.082ValGly: 5.082 ± 0.083
1.435ValHis: 1.435 ± 0.039
5.297ValIle: 5.297 ± 0.082
4.151ValLys: 4.151 ± 0.064
6.941ValLeu: 6.941 ± 0.098
1.948ValMet: 1.948 ± 0.04
3.342ValAsn: 3.342 ± 0.066
2.963ValPro: 2.963 ± 0.048
2.481ValGln: 2.481 ± 0.051
2.558ValArg: 2.558 ± 0.063
5.069ValSer: 5.069 ± 0.071
5.877ValThr: 5.877 ± 0.118
5.644ValVal: 5.644 ± 0.095
0.705ValTrp: 0.705 ± 0.029
2.245ValTyr: 2.245 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.749TrpAla: 0.749 ± 0.033
0.065TrpCys: 0.065 ± 0.008
0.503TrpAsp: 0.503 ± 0.026
0.402TrpGlu: 0.402 ± 0.02
0.543TrpPhe: 0.543 ± 0.024
0.654TrpGly: 0.654 ± 0.024
0.361TrpHis: 0.361 ± 0.02
0.671TrpIle: 0.671 ± 0.029
0.363TrpLys: 0.363 ± 0.018
1.499TrpLeu: 1.499 ± 0.048
0.285TrpMet: 0.285 ± 0.016
0.443TrpAsn: 0.443 ± 0.023
0.324TrpPro: 0.324 ± 0.021
0.712TrpGln: 0.712 ± 0.028
0.591TrpArg: 0.591 ± 0.023
0.607TrpSer: 0.607 ± 0.029
0.586TrpThr: 0.586 ± 0.029
0.777TrpVal: 0.777 ± 0.033
0.167TrpTrp: 0.167 ± 0.014
0.388TrpTyr: 0.388 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.629TyrAla: 2.629 ± 0.056
0.211TyrCys: 0.211 ± 0.014
2.271TyrAsp: 2.271 ± 0.051
1.606TyrGlu: 1.606 ± 0.047
1.745TyrPhe: 1.745 ± 0.043
2.438TyrGly: 2.438 ± 0.047
0.957TyrHis: 0.957 ± 0.036
1.788TyrIle: 1.788 ± 0.042
1.527TyrLys: 1.527 ± 0.044
3.75TyrLeu: 3.75 ± 0.069
0.798TyrMet: 0.798 ± 0.03
1.507TyrAsn: 1.507 ± 0.037
1.382TyrPro: 1.382 ± 0.036
2.01TyrGln: 2.01 ± 0.049
1.635TyrArg: 1.635 ± 0.045
1.941TyrSer: 1.941 ± 0.048
2.102TyrThr: 2.102 ± 0.056
2.376TyrVal: 2.376 ± 0.056
0.449TyrTrp: 0.449 ± 0.021
1.437TyrTyr: 1.437 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.03XaaXaa: 0.03 ± 0.019
Statistics based on 3157 proteins (958480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski