Amino acid dipepetide frequency for Lacrimispora algidixylanolytica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.955AlaAla: 5.955 ± 0.085
0.986AlaCys: 0.986 ± 0.029
3.702AlaAsp: 3.702 ± 0.062
4.244AlaGlu: 4.244 ± 0.057
3.002AlaPhe: 3.002 ± 0.049
5.651AlaGly: 5.651 ± 0.076
1.0AlaHis: 1.0 ± 0.031
5.155AlaIle: 5.155 ± 0.069
4.387AlaLys: 4.387 ± 0.063
6.505AlaLeu: 6.505 ± 0.083
2.23AlaMet: 2.23 ± 0.044
2.436AlaAsn: 2.436 ± 0.043
1.911AlaPro: 1.911 ± 0.046
1.864AlaGln: 1.864 ± 0.039
2.531AlaArg: 2.531 ± 0.052
4.249AlaSer: 4.249 ± 0.069
3.12AlaThr: 3.12 ± 0.061
5.458AlaVal: 5.458 ± 0.07
0.618AlaTrp: 0.618 ± 0.024
2.6AlaTyr: 2.6 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.838CysAla: 0.838 ± 0.028
0.242CysCys: 0.242 ± 0.015
0.729CysAsp: 0.729 ± 0.026
0.775CysGlu: 0.775 ± 0.027
0.66CysPhe: 0.66 ± 0.027
1.238CysGly: 1.238 ± 0.034
0.276CysHis: 0.276 ± 0.015
1.054CysIle: 1.054 ± 0.028
0.772CysLys: 0.772 ± 0.028
1.172CysLeu: 1.172 ± 0.033
0.414CysMet: 0.414 ± 0.017
0.648CysAsn: 0.648 ± 0.025
0.567CysPro: 0.567 ± 0.034
0.404CysGln: 0.404 ± 0.017
0.544CysArg: 0.544 ± 0.021
0.901CysSer: 0.901 ± 0.027
0.659CysThr: 0.659 ± 0.024
0.872CysVal: 0.872 ± 0.027
0.113CysTrp: 0.113 ± 0.01
0.525CysTyr: 0.525 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.275AspAla: 3.275 ± 0.051
0.756AspCys: 0.756 ± 0.026
2.597AspAsp: 2.597 ± 0.051
4.015AspGlu: 4.015 ± 0.059
2.594AspPhe: 2.594 ± 0.048
4.256AspGly: 4.256 ± 0.075
1.065AspHis: 1.065 ± 0.033
4.471AspIle: 4.471 ± 0.066
3.651AspLys: 3.651 ± 0.057
4.898AspLeu: 4.898 ± 0.066
1.797AspMet: 1.797 ± 0.035
2.27AspAsn: 2.27 ± 0.048
1.864AspPro: 1.864 ± 0.039
1.72AspGln: 1.72 ± 0.035
2.358AspArg: 2.358 ± 0.046
3.285AspSer: 3.285 ± 0.051
2.848AspThr: 2.848 ± 0.061
3.404AspVal: 3.404 ± 0.051
0.622AspTrp: 0.622 ± 0.024
2.692AspTyr: 2.692 ± 0.049
0.001AspXaa: 0.001 ± 0.001
Glu
5.377GluAla: 5.377 ± 0.072
0.758GluCys: 0.758 ± 0.026
4.042GluAsp: 4.042 ± 0.061
7.009GluGlu: 7.009 ± 0.091
2.674GluPhe: 2.674 ± 0.051
4.53GluGly: 4.53 ± 0.059
1.265GluHis: 1.265 ± 0.034
5.728GluIle: 5.728 ± 0.072
5.85GluLys: 5.85 ± 0.068
6.589GluLeu: 6.589 ± 0.082
2.224GluMet: 2.224 ± 0.044
3.702GluAsn: 3.702 ± 0.055
1.879GluPro: 1.879 ± 0.038
2.501GluGln: 2.501 ± 0.05
3.163GluArg: 3.163 ± 0.059
3.754GluSer: 3.754 ± 0.061
3.845GluThr: 3.845 ± 0.067
4.452GluVal: 4.452 ± 0.059
0.615GluTrp: 0.615 ± 0.024
2.86GluTyr: 2.86 ± 0.047
0.0GluXaa: 0.0 ± 0.0
Phe
2.755PheAla: 2.755 ± 0.048
0.645PheCys: 0.645 ± 0.022
2.611PheAsp: 2.611 ± 0.04
2.651PheGlu: 2.651 ± 0.049
2.008PhePhe: 2.008 ± 0.048
3.11PheGly: 3.11 ± 0.05
0.979PheHis: 0.979 ± 0.031
3.114PheIle: 3.114 ± 0.061
2.34PheLys: 2.34 ± 0.049
4.405PheLeu: 4.405 ± 0.076
1.278PheMet: 1.278 ± 0.033
1.75PheAsn: 1.75 ± 0.038
1.445PhePro: 1.445 ± 0.031
1.532PheGln: 1.532 ± 0.033
1.572PheArg: 1.572 ± 0.037
3.161PheSer: 3.161 ± 0.053
2.508PheThr: 2.508 ± 0.048
2.827PheVal: 2.827 ± 0.052
0.421PheTrp: 0.421 ± 0.022
1.947PheTyr: 1.947 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.868GlyAla: 4.868 ± 0.082
1.164GlyCys: 1.164 ± 0.033
3.678GlyAsp: 3.678 ± 0.072
4.736GlyGlu: 4.736 ± 0.059
3.472GlyPhe: 3.472 ± 0.054
5.112GlyGly: 5.112 ± 0.087
1.248GlyHis: 1.248 ± 0.035
6.691GlyIle: 6.691 ± 0.081
5.344GlyLys: 5.344 ± 0.072
6.231GlyLeu: 6.231 ± 0.086
2.383GlyMet: 2.383 ± 0.048
3.41GlyAsn: 3.41 ± 0.064
1.694GlyPro: 1.694 ± 0.104
1.993GlyGln: 1.993 ± 0.044
2.72GlyArg: 2.72 ± 0.052
4.514GlySer: 4.514 ± 0.061
4.232GlyThr: 4.232 ± 0.073
4.98GlyVal: 4.98 ± 0.073
0.761GlyTrp: 0.761 ± 0.03
3.267GlyTyr: 3.267 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
0.986HisAla: 0.986 ± 0.028
0.28HisCys: 0.28 ± 0.015
0.862HisAsp: 0.862 ± 0.025
1.094HisGlu: 1.094 ± 0.028
0.815HisPhe: 0.815 ± 0.025
1.372HisGly: 1.372 ± 0.035
0.429HisHis: 0.429 ± 0.017
1.417HisIle: 1.417 ± 0.033
1.024HisLys: 1.024 ± 0.029
1.685HisLeu: 1.685 ± 0.037
0.562HisMet: 0.562 ± 0.024
0.764HisAsn: 0.764 ± 0.024
0.763HisPro: 0.763 ± 0.025
0.554HisGln: 0.554 ± 0.019
0.727HisArg: 0.727 ± 0.021
1.122HisSer: 1.122 ± 0.029
0.957HisThr: 0.957 ± 0.029
1.111HisVal: 1.111 ± 0.03
0.172HisTrp: 0.172 ± 0.011
0.849HisTyr: 0.849 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.315IleAla: 5.315 ± 0.073
1.239IleCys: 1.239 ± 0.033
4.119IleAsp: 4.119 ± 0.056
4.876IleGlu: 4.876 ± 0.077
3.176IlePhe: 3.176 ± 0.062
5.439IleGly: 5.439 ± 0.073
1.482IleHis: 1.482 ± 0.038
5.803IleIle: 5.803 ± 0.082
4.996IleLys: 4.996 ± 0.067
7.626IleLeu: 7.626 ± 0.089
2.171IleMet: 2.171 ± 0.043
3.556IleAsn: 3.556 ± 0.056
3.416IlePro: 3.416 ± 0.06
2.431IleGln: 2.431 ± 0.05
3.523IleArg: 3.523 ± 0.06
5.647IleSer: 5.647 ± 0.061
4.617IleThr: 4.617 ± 0.056
4.757IleVal: 4.757 ± 0.074
0.681IleTrp: 0.681 ± 0.025
2.94IleTyr: 2.94 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
4.799LysAla: 4.799 ± 0.069
0.549LysCys: 0.549 ± 0.023
4.337LysAsp: 4.337 ± 0.062
7.264LysGlu: 7.264 ± 0.073
1.935LysPhe: 1.935 ± 0.042
4.568LysGly: 4.568 ± 0.059
0.969LysHis: 0.969 ± 0.027
4.888LysIle: 4.888 ± 0.069
6.112LysLys: 6.112 ± 0.077
5.591LysLeu: 5.591 ± 0.075
2.087LysMet: 2.087 ± 0.038
3.735LysAsn: 3.735 ± 0.055
2.066LysPro: 2.066 ± 0.042
2.36LysGln: 2.36 ± 0.043
3.038LysArg: 3.038 ± 0.051
3.871LysSer: 3.871 ± 0.056
3.906LysThr: 3.906 ± 0.059
4.355LysVal: 4.355 ± 0.065
0.609LysTrp: 0.609 ± 0.025
2.552LysTyr: 2.552 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
6.167LeuAla: 6.167 ± 0.076
1.376LeuCys: 1.376 ± 0.032
4.949LeuAsp: 4.949 ± 0.061
6.293LeuGlu: 6.293 ± 0.083
4.208LeuPhe: 4.208 ± 0.075
6.046LeuGly: 6.046 ± 0.08
1.507LeuHis: 1.507 ± 0.03
6.768LeuIle: 6.768 ± 0.087
7.107LeuLys: 7.107 ± 0.08
9.191LeuLeu: 9.191 ± 0.103
2.743LeuMet: 2.743 ± 0.047
4.45LeuAsn: 4.45 ± 0.058
3.566LeuPro: 3.566 ± 0.052
2.451LeuGln: 2.451 ± 0.044
3.437LeuArg: 3.437 ± 0.058
7.265LeuSer: 7.265 ± 0.071
5.389LeuThr: 5.389 ± 0.061
5.633LeuVal: 5.633 ± 0.066
0.836LeuTrp: 0.836 ± 0.023
3.415LeuTyr: 3.415 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
2.392MetAla: 2.392 ± 0.043
0.285MetCys: 0.285 ± 0.014
1.854MetAsp: 1.854 ± 0.034
2.775MetGlu: 2.775 ± 0.046
1.065MetPhe: 1.065 ± 0.031
2.169MetGly: 2.169 ± 0.038
0.413MetHis: 0.413 ± 0.017
2.396MetIle: 2.396 ± 0.049
2.573MetLys: 2.573 ± 0.044
2.48MetLeu: 2.48 ± 0.04
0.891MetMet: 0.891 ± 0.024
1.627MetAsn: 1.627 ± 0.034
1.107MetPro: 1.107 ± 0.034
0.863MetGln: 0.863 ± 0.027
1.143MetArg: 1.143 ± 0.033
1.839MetSer: 1.839 ± 0.034
1.762MetThr: 1.762 ± 0.043
2.093MetVal: 2.093 ± 0.046
0.222MetTrp: 0.222 ± 0.013
0.836MetTyr: 0.836 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.894AsnAla: 2.894 ± 0.048
0.562AsnCys: 0.562 ± 0.022
2.26AsnAsp: 2.26 ± 0.043
2.963AsnGlu: 2.963 ± 0.053
1.736AsnPhe: 1.736 ± 0.039
3.652AsnGly: 3.652 ± 0.063
0.998AsnHis: 0.998 ± 0.027
3.56AsnIle: 3.56 ± 0.063
2.976AsnLys: 2.976 ± 0.057
4.146AsnLeu: 4.146 ± 0.063
1.391AsnMet: 1.391 ± 0.031
2.141AsnAsn: 2.141 ± 0.053
2.132AsnPro: 2.132 ± 0.041
1.942AsnGln: 1.942 ± 0.042
2.045AsnArg: 2.045 ± 0.034
2.789AsnSer: 2.789 ± 0.049
2.467AsnThr: 2.467 ± 0.043
2.888AsnVal: 2.888 ± 0.047
0.475AsnTrp: 0.475 ± 0.022
2.019AsnTyr: 2.019 ± 0.041
0.0AsnXaa: 0.0 ± 0.0
Pro
2.108ProAla: 2.108 ± 0.045
0.448ProCys: 0.448 ± 0.021
2.218ProAsp: 2.218 ± 0.041
3.022ProGlu: 3.022 ± 0.052
1.678ProPhe: 1.678 ± 0.039
2.486ProGly: 2.486 ± 0.058
0.63ProHis: 0.63 ± 0.022
2.393ProIle: 2.393 ± 0.048
1.927ProLys: 1.927 ± 0.043
2.964ProLeu: 2.964 ± 0.052
0.982ProMet: 0.982 ± 0.027
1.372ProAsn: 1.372 ± 0.036
0.76ProPro: 0.76 ± 0.027
1.047ProGln: 1.047 ± 0.027
0.988ProArg: 0.988 ± 0.029
2.029ProSer: 2.029 ± 0.047
1.784ProThr: 1.784 ± 0.104
2.964ProVal: 2.964 ± 0.049
0.369ProTrp: 0.369 ± 0.02
1.52ProTyr: 1.52 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
2.224GlnAla: 2.224 ± 0.049
0.318GlnCys: 0.318 ± 0.016
1.548GlnAsp: 1.548 ± 0.038
2.501GlnGlu: 2.501 ± 0.046
1.271GlnPhe: 1.271 ± 0.033
2.073GlnGly: 2.073 ± 0.041
0.41GlnHis: 0.41 ± 0.016
2.637GlnIle: 2.637 ± 0.051
2.45GlnLys: 2.45 ± 0.041
2.772GlnLeu: 2.772 ± 0.051
1.19GlnMet: 1.19 ± 0.031
1.572GlnAsn: 1.572 ± 0.04
0.874GlnPro: 0.874 ± 0.027
0.997GlnGln: 0.997 ± 0.031
1.19GlnArg: 1.19 ± 0.031
1.867GlnSer: 1.867 ± 0.04
1.714GlnThr: 1.714 ± 0.046
2.164GlnVal: 2.164 ± 0.043
0.358GlnTrp: 0.358 ± 0.017
1.438GlnTyr: 1.438 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.242ArgAla: 2.242 ± 0.039
0.513ArgCys: 0.513 ± 0.021
2.032ArgAsp: 2.032 ± 0.039
3.202ArgGlu: 3.202 ± 0.054
1.906ArgPhe: 1.906 ± 0.037
2.331ArgGly: 2.331 ± 0.043
0.703ArgHis: 0.703 ± 0.024
3.381ArgIle: 3.381 ± 0.053
2.981ArgLys: 2.981 ± 0.053
3.932ArgLeu: 3.932 ± 0.061
1.345ArgMet: 1.345 ± 0.026
2.099ArgAsn: 2.099 ± 0.043
1.226ArgPro: 1.226 ± 0.033
1.406ArgGln: 1.406 ± 0.034
1.764ArgArg: 1.764 ± 0.043
2.21ArgSer: 2.21 ± 0.043
2.004ArgThr: 2.004 ± 0.037
2.43ArgVal: 2.43 ± 0.041
0.385ArgTrp: 0.385 ± 0.018
1.852ArgTyr: 1.852 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
4.046SerAla: 4.046 ± 0.051
0.85SerCys: 0.85 ± 0.029
3.413SerAsp: 3.413 ± 0.051
3.912SerGlu: 3.912 ± 0.053
3.104SerPhe: 3.104 ± 0.044
5.564SerGly: 5.564 ± 0.074
1.148SerHis: 1.148 ± 0.027
5.016SerIle: 5.016 ± 0.069
4.012SerLys: 4.012 ± 0.055
6.109SerLeu: 6.109 ± 0.072
2.09SerMet: 2.09 ± 0.039
2.784SerAsn: 2.784 ± 0.048
2.013SerPro: 2.013 ± 0.042
2.272SerGln: 2.272 ± 0.044
2.567SerArg: 2.567 ± 0.046
4.259SerSer: 4.259 ± 0.067
3.115SerThr: 3.115 ± 0.052
4.432SerVal: 4.432 ± 0.06
0.675SerTrp: 0.675 ± 0.023
2.761SerTyr: 2.761 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
3.991ThrAla: 3.991 ± 0.057
0.643ThrCys: 0.643 ± 0.024
2.922ThrAsp: 2.922 ± 0.055
3.496ThrGlu: 3.496 ± 0.052
2.318ThrPhe: 2.318 ± 0.048
5.037ThrGly: 5.037 ± 0.172
0.882ThrHis: 0.882 ± 0.027
4.39ThrIle: 4.39 ± 0.052
3.332ThrLys: 3.332 ± 0.05
4.971ThrLeu: 4.971 ± 0.058
1.578ThrMet: 1.578 ± 0.036
2.263ThrAsn: 2.263 ± 0.044
2.174ThrPro: 2.174 ± 0.046
1.547ThrGln: 1.547 ± 0.032
1.863ThrArg: 1.863 ± 0.035
3.383ThrSer: 3.383 ± 0.06
3.015ThrThr: 3.015 ± 0.056
4.188ThrVal: 4.188 ± 0.061
0.496ThrTrp: 0.496 ± 0.02
2.257ThrTyr: 2.257 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
4.409ValAla: 4.409 ± 0.068
1.032ValCys: 1.032 ± 0.029
3.511ValAsp: 3.511 ± 0.051
4.336ValGlu: 4.336 ± 0.053
3.11ValPhe: 3.11 ± 0.053
4.164ValGly: 4.164 ± 0.063
1.041ValHis: 1.041 ± 0.029
5.434ValIle: 5.434 ± 0.08
4.548ValLys: 4.548 ± 0.062
6.52ValLeu: 6.52 ± 0.065
2.065ValMet: 2.065 ± 0.046
2.976ValAsn: 2.976 ± 0.052
2.491ValPro: 2.491 ± 0.039
1.688ValGln: 1.688 ± 0.032
2.528ValArg: 2.528 ± 0.047
4.855ValSer: 4.855 ± 0.052
4.205ValThr: 4.205 ± 0.065
4.543ValVal: 4.543 ± 0.071
0.615ValTrp: 0.615 ± 0.024
2.564ValTyr: 2.564 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.539TrpAla: 0.539 ± 0.021
0.145TrpCys: 0.145 ± 0.012
0.574TrpAsp: 0.574 ± 0.023
0.673TrpGlu: 0.673 ± 0.023
0.455TrpPhe: 0.455 ± 0.02
0.697TrpGly: 0.697 ± 0.026
0.171TrpHis: 0.171 ± 0.012
0.658TrpIle: 0.658 ± 0.023
0.745TrpLys: 0.745 ± 0.025
0.931TrpLeu: 0.931 ± 0.032
0.266TrpMet: 0.266 ± 0.015
0.606TrpAsn: 0.606 ± 0.022
0.225TrpPro: 0.225 ± 0.014
0.355TrpGln: 0.355 ± 0.018
0.364TrpArg: 0.364 ± 0.017
0.59TrpSer: 0.59 ± 0.024
0.451TrpThr: 0.451 ± 0.017
0.568TrpVal: 0.568 ± 0.02
0.146TrpTrp: 0.146 ± 0.012
0.44TrpTyr: 0.44 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.422TyrAla: 2.422 ± 0.045
0.61TyrCys: 0.61 ± 0.021
2.48TyrAsp: 2.48 ± 0.061
2.979TyrGlu: 2.979 ± 0.049
1.876TyrPhe: 1.876 ± 0.041
3.001TyrGly: 3.001 ± 0.048
0.93TyrHis: 0.93 ± 0.03
2.858TyrIle: 2.858 ± 0.048
2.463TyrLys: 2.463 ± 0.045
4.081TyrLeu: 4.081 ± 0.066
1.098TyrMet: 1.098 ± 0.029
1.873TyrAsn: 1.873 ± 0.042
1.518TyrPro: 1.518 ± 0.039
1.64TyrGln: 1.64 ± 0.034
1.88TyrArg: 1.88 ± 0.038
2.55TyrSer: 2.55 ± 0.047
2.15TyrThr: 2.15 ± 0.043
2.516TyrVal: 2.516 ± 0.045
0.41TyrTrp: 0.41 ± 0.019
1.96TyrTyr: 1.96 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4081 proteins (1335621 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski