Amino acid dipepetide frequency for Lachnospiraceae bacterium XBD2001

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.53AlaAla: 7.53 ± 0.152
1.154AlaCys: 1.154 ± 0.04
4.853AlaAsp: 4.853 ± 0.085
5.635AlaGlu: 5.635 ± 0.089
3.172AlaPhe: 3.172 ± 0.062
6.106AlaGly: 6.106 ± 0.109
1.221AlaHis: 1.221 ± 0.039
5.66AlaIle: 5.66 ± 0.078
5.467AlaLys: 5.467 ± 0.092
7.237AlaLeu: 7.237 ± 0.105
2.766AlaMet: 2.766 ± 0.052
3.28AlaAsn: 3.28 ± 0.082
2.29AlaPro: 2.29 ± 0.071
2.384AlaGln: 2.384 ± 0.061
2.944AlaArg: 2.944 ± 0.071
4.704AlaSer: 4.704 ± 0.091
4.681AlaThr: 4.681 ± 0.115
6.058AlaVal: 6.058 ± 0.099
0.652AlaTrp: 0.652 ± 0.029
3.13AlaTyr: 3.13 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.896CysAla: 0.896 ± 0.035
0.211CysCys: 0.211 ± 0.016
0.916CysAsp: 0.916 ± 0.036
0.868CysGlu: 0.868 ± 0.037
0.592CysPhe: 0.592 ± 0.026
1.474CysGly: 1.474 ± 0.054
0.292CysHis: 0.292 ± 0.018
0.973CysIle: 0.973 ± 0.038
0.763CysLys: 0.763 ± 0.035
1.002CysLeu: 1.002 ± 0.038
0.391CysMet: 0.391 ± 0.023
0.579CysAsn: 0.579 ± 0.031
0.562CysPro: 0.562 ± 0.031
0.342CysGln: 0.342 ± 0.02
0.541CysArg: 0.541 ± 0.029
0.772CysSer: 0.772 ± 0.033
0.663CysThr: 0.663 ± 0.034
1.011CysVal: 1.011 ± 0.037
0.103CysTrp: 0.103 ± 0.012
0.546CysTyr: 0.546 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
5.678AspAla: 5.678 ± 0.09
0.801AspCys: 0.801 ± 0.027
3.319AspAsp: 3.319 ± 0.084
5.149AspGlu: 5.149 ± 0.093
2.796AspPhe: 2.796 ± 0.064
4.772AspGly: 4.772 ± 0.102
1.143AspHis: 1.143 ± 0.036
4.496AspIle: 4.496 ± 0.081
3.155AspLys: 3.155 ± 0.066
5.153AspLeu: 5.153 ± 0.097
1.969AspMet: 1.969 ± 0.051
2.348AspAsn: 2.348 ± 0.063
1.887AspPro: 1.887 ± 0.051
1.439AspGln: 1.439 ± 0.048
2.401AspArg: 2.401 ± 0.06
3.049AspSer: 3.049 ± 0.062
3.235AspThr: 3.235 ± 0.07
4.647AspVal: 4.647 ± 0.071
0.609AspTrp: 0.609 ± 0.029
3.244AspTyr: 3.244 ± 0.079
0.0AspXaa: 0.0 ± 0.0
Glu
6.628GluAla: 6.628 ± 0.11
0.712GluCys: 0.712 ± 0.033
4.919GluAsp: 4.919 ± 0.089
7.526GluGlu: 7.526 ± 0.127
2.563GluPhe: 2.563 ± 0.065
4.558GluGly: 4.558 ± 0.08
1.587GluHis: 1.587 ± 0.048
5.241GluIle: 5.241 ± 0.086
5.278GluLys: 5.278 ± 0.1
6.644GluLeu: 6.644 ± 0.098
2.508GluMet: 2.508 ± 0.066
3.656GluAsn: 3.656 ± 0.065
1.899GluPro: 1.899 ± 0.06
2.778GluGln: 2.778 ± 0.073
3.26GluArg: 3.26 ± 0.08
3.373GluSer: 3.373 ± 0.07
3.583GluThr: 3.583 ± 0.081
5.186GluVal: 5.186 ± 0.091
0.607GluTrp: 0.607 ± 0.029
3.016GluTyr: 3.016 ± 0.066
0.0GluXaa: 0.0 ± 0.0
Phe
3.349PheAla: 3.349 ± 0.069
0.633PheCys: 0.633 ± 0.031
2.699PheAsp: 2.699 ± 0.062
2.564PheGlu: 2.564 ± 0.068
2.047PhePhe: 2.047 ± 0.068
3.157PheGly: 3.157 ± 0.063
0.903PheHis: 0.903 ± 0.031
2.642PheIle: 2.642 ± 0.071
1.678PheLys: 1.678 ± 0.052
3.952PheLeu: 3.952 ± 0.085
1.055PheMet: 1.055 ± 0.043
1.415PheAsn: 1.415 ± 0.046
1.352PhePro: 1.352 ± 0.051
1.196PheGln: 1.196 ± 0.044
1.718PheArg: 1.718 ± 0.055
2.66PheSer: 2.66 ± 0.056
2.412PheThr: 2.412 ± 0.06
3.122PheVal: 3.122 ± 0.075
0.399PheTrp: 0.399 ± 0.027
1.85PheTyr: 1.85 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
5.263GlyAla: 5.263 ± 0.093
1.166GlyCys: 1.166 ± 0.051
4.092GlyAsp: 4.092 ± 0.082
4.807GlyGlu: 4.807 ± 0.093
3.231GlyPhe: 3.231 ± 0.066
4.927GlyGly: 4.927 ± 0.115
1.31GlyHis: 1.31 ± 0.053
5.865GlyIle: 5.865 ± 0.095
4.953GlyLys: 4.953 ± 0.077
5.794GlyLeu: 5.794 ± 0.097
2.486GlyMet: 2.486 ± 0.051
3.165GlyAsn: 3.165 ± 0.071
1.396GlyPro: 1.396 ± 0.036
2.093GlyGln: 2.093 ± 0.058
2.91GlyArg: 2.91 ± 0.07
3.993GlySer: 3.993 ± 0.094
4.216GlyThr: 4.216 ± 0.118
5.541GlyVal: 5.541 ± 0.085
0.695GlyTrp: 0.695 ± 0.037
3.231GlyTyr: 3.231 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.3HisAla: 1.3 ± 0.038
0.315HisCys: 0.315 ± 0.022
0.982HisAsp: 0.982 ± 0.039
1.232HisGlu: 1.232 ± 0.042
0.878HisPhe: 0.878 ± 0.034
1.472HisGly: 1.472 ± 0.048
0.461HisHis: 0.461 ± 0.031
1.341HisIle: 1.341 ± 0.037
0.996HisLys: 0.996 ± 0.036
1.579HisLeu: 1.579 ± 0.053
0.609HisMet: 0.609 ± 0.028
0.798HisAsn: 0.798 ± 0.032
1.049HisPro: 1.049 ± 0.037
0.591HisGln: 0.591 ± 0.026
0.917HisArg: 0.917 ± 0.037
0.961HisSer: 0.961 ± 0.037
0.958HisThr: 0.958 ± 0.041
1.36HisVal: 1.36 ± 0.047
0.163HisTrp: 0.163 ± 0.015
0.811HisTyr: 0.811 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.209IleAla: 6.209 ± 0.099
1.201IleCys: 1.201 ± 0.04
4.203IleAsp: 4.203 ± 0.079
4.517IleGlu: 4.517 ± 0.078
2.828IlePhe: 2.828 ± 0.07
5.096IleGly: 5.096 ± 0.079
1.427IleHis: 1.427 ± 0.044
4.719IleIle: 4.719 ± 0.094
3.147IleLys: 3.147 ± 0.067
6.402IleLeu: 6.402 ± 0.12
1.858IleMet: 1.858 ± 0.055
2.841IleAsn: 2.841 ± 0.077
3.065IlePro: 3.065 ± 0.066
2.206IleGln: 2.206 ± 0.05
3.439IleArg: 3.439 ± 0.077
4.66IleSer: 4.66 ± 0.09
4.355IleThr: 4.355 ± 0.105
4.889IleVal: 4.889 ± 0.085
0.58IleTrp: 0.58 ± 0.029
2.711IleTyr: 2.711 ± 0.066
0.0IleXaa: 0.0 ± 0.0
Lys
4.674LysAla: 4.674 ± 0.083
0.539LysCys: 0.539 ± 0.034
3.887LysAsp: 3.887 ± 0.087
5.708LysGlu: 5.708 ± 0.103
1.791LysPhe: 1.791 ± 0.041
3.386LysGly: 3.386 ± 0.077
1.144LysHis: 1.144 ± 0.038
3.806LysIle: 3.806 ± 0.074
5.047LysLys: 5.047 ± 0.105
4.67LysLeu: 4.67 ± 0.086
1.985LysMet: 1.985 ± 0.054
3.289LysAsn: 3.289 ± 0.061
1.85LysPro: 1.85 ± 0.05
2.124LysGln: 2.124 ± 0.052
2.874LysArg: 2.874 ± 0.07
3.058LysSer: 3.058 ± 0.063
3.523LysThr: 3.523 ± 0.074
4.0LysVal: 4.0 ± 0.088
0.49LysTrp: 0.49 ± 0.027
2.467LysTyr: 2.467 ± 0.052
0.0LysXaa: 0.0 ± 0.0
Leu
7.141LeuAla: 7.141 ± 0.102
1.356LeuCys: 1.356 ± 0.044
5.462LeuAsp: 5.462 ± 0.09
6.124LeuGlu: 6.124 ± 0.087
3.624LeuPhe: 3.624 ± 0.089
5.882LeuGly: 5.882 ± 0.096
1.774LeuHis: 1.774 ± 0.051
5.639LeuIle: 5.639 ± 0.098
4.644LeuLys: 4.644 ± 0.083
8.425LeuLeu: 8.425 ± 0.146
2.61LeuMet: 2.61 ± 0.069
3.582LeuAsn: 3.582 ± 0.078
3.407LeuPro: 3.407 ± 0.074
3.148LeuGln: 3.148 ± 0.072
3.962LeuArg: 3.962 ± 0.085
6.021LeuSer: 6.021 ± 0.091
4.731LeuThr: 4.731 ± 0.089
6.008LeuVal: 6.008 ± 0.096
0.776LeuTrp: 0.776 ± 0.033
3.344LeuTyr: 3.344 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
2.609MetAla: 2.609 ± 0.064
0.32MetCys: 0.32 ± 0.022
2.161MetAsp: 2.161 ± 0.06
2.611MetGlu: 2.611 ± 0.061
1.06MetPhe: 1.06 ± 0.038
2.203MetGly: 2.203 ± 0.053
0.548MetHis: 0.548 ± 0.025
2.191MetIle: 2.191 ± 0.059
2.351MetLys: 2.351 ± 0.052
2.582MetLeu: 2.582 ± 0.067
1.051MetMet: 1.051 ± 0.044
1.689MetAsn: 1.689 ± 0.051
1.107MetPro: 1.107 ± 0.038
1.059MetGln: 1.059 ± 0.035
1.344MetArg: 1.344 ± 0.043
1.766MetSer: 1.766 ± 0.047
1.7MetThr: 1.7 ± 0.046
2.24MetVal: 2.24 ± 0.052
0.194MetTrp: 0.194 ± 0.016
0.912MetTyr: 0.912 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
3.426AsnAla: 3.426 ± 0.072
0.488AsnCys: 0.488 ± 0.026
2.371AsnAsp: 2.371 ± 0.053
2.882AsnGlu: 2.882 ± 0.061
1.624AsnPhe: 1.624 ± 0.053
3.51AsnGly: 3.51 ± 0.09
0.929AsnHis: 0.929 ± 0.042
3.152AsnIle: 3.152 ± 0.066
2.281AsnLys: 2.281 ± 0.06
3.806AsnLeu: 3.806 ± 0.075
1.377AsnMet: 1.377 ± 0.044
1.98AsnAsn: 1.98 ± 0.059
2.01AsnPro: 2.01 ± 0.053
1.489AsnGln: 1.489 ± 0.039
2.233AsnArg: 2.233 ± 0.06
2.211AsnSer: 2.211 ± 0.062
2.405AsnThr: 2.405 ± 0.057
3.067AsnVal: 3.067 ± 0.07
0.403AsnTrp: 0.403 ± 0.024
1.981AsnTyr: 1.981 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
2.403ProAla: 2.403 ± 0.069
0.419ProCys: 0.419 ± 0.025
2.083ProAsp: 2.083 ± 0.058
3.244ProGlu: 3.244 ± 0.067
1.45ProPhe: 1.45 ± 0.044
2.226ProGly: 2.226 ± 0.055
0.517ProHis: 0.517 ± 0.027
2.165ProIle: 2.165 ± 0.059
2.014ProLys: 2.014 ± 0.058
2.733ProLeu: 2.733 ± 0.066
1.072ProMet: 1.072 ± 0.042
1.319ProAsn: 1.319 ± 0.042
0.626ProPro: 0.626 ± 0.031
1.16ProGln: 1.16 ± 0.041
1.067ProArg: 1.067 ± 0.038
1.887ProSer: 1.887 ± 0.049
1.908ProThr: 1.908 ± 0.061
2.744ProVal: 2.744 ± 0.065
0.378ProTrp: 0.378 ± 0.024
1.466ProTyr: 1.466 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
2.539GlnAla: 2.539 ± 0.062
0.354GlnCys: 0.354 ± 0.022
1.6GlnAsp: 1.6 ± 0.042
2.486GlnGlu: 2.486 ± 0.063
1.113GlnPhe: 1.113 ± 0.045
1.989GlnGly: 1.989 ± 0.051
0.543GlnHis: 0.543 ± 0.024
2.326GlnIle: 2.326 ± 0.059
2.253GlnLys: 2.253 ± 0.057
2.947GlnLeu: 2.947 ± 0.071
1.238GlnMet: 1.238 ± 0.038
1.413GlnAsn: 1.413 ± 0.045
0.949GlnPro: 0.949 ± 0.037
1.356GlnGln: 1.356 ± 0.051
1.476GlnArg: 1.476 ± 0.046
1.503GlnSer: 1.503 ± 0.048
1.611GlnThr: 1.611 ± 0.054
2.305GlnVal: 2.305 ± 0.057
0.342GlnTrp: 0.342 ± 0.019
1.27GlnTyr: 1.27 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.982ArgAla: 2.982 ± 0.066
0.529ArgCys: 0.529 ± 0.027
2.555ArgAsp: 2.555 ± 0.054
3.534ArgGlu: 3.534 ± 0.088
1.821ArgPhe: 1.821 ± 0.055
2.741ArgGly: 2.741 ± 0.062
0.736ArgHis: 0.736 ± 0.035
3.498ArgIle: 3.498 ± 0.073
2.894ArgLys: 2.894 ± 0.077
3.745ArgLeu: 3.745 ± 0.084
1.649ArgMet: 1.649 ± 0.052
1.982ArgAsn: 1.982 ± 0.051
1.285ArgPro: 1.285 ± 0.042
1.497ArgGln: 1.497 ± 0.051
2.294ArgArg: 2.294 ± 0.068
2.128ArgSer: 2.128 ± 0.052
2.12ArgThr: 2.12 ± 0.05
2.939ArgVal: 2.939 ± 0.066
0.428ArgTrp: 0.428 ± 0.024
1.778ArgTyr: 1.778 ± 0.053
0.001ArgXaa: 0.001 ± 0.001
Ser
4.24SerAla: 4.24 ± 0.096
0.712SerCys: 0.712 ± 0.033
3.604SerAsp: 3.604 ± 0.068
3.851SerGlu: 3.851 ± 0.074
2.61SerPhe: 2.61 ± 0.063
4.607SerGly: 4.607 ± 0.088
1.036SerHis: 1.036 ± 0.037
3.876SerIle: 3.876 ± 0.076
3.514SerLys: 3.514 ± 0.073
4.878SerLeu: 4.878 ± 0.092
1.896SerMet: 1.896 ± 0.059
2.449SerAsn: 2.449 ± 0.066
1.507SerPro: 1.507 ± 0.044
1.606SerGln: 1.606 ± 0.048
2.348SerArg: 2.348 ± 0.06
3.197SerSer: 3.197 ± 0.085
2.905SerThr: 2.905 ± 0.072
4.367SerVal: 4.367 ± 0.078
0.505SerTrp: 0.505 ± 0.029
2.532SerTyr: 2.532 ± 0.069
0.0SerXaa: 0.0 ± 0.0
Thr
4.495ThrAla: 4.495 ± 0.101
0.689ThrCys: 0.689 ± 0.032
3.321ThrAsp: 3.321 ± 0.079
3.882ThrGlu: 3.882 ± 0.084
2.348ThrPhe: 2.348 ± 0.064
4.419ThrGly: 4.419 ± 0.11
0.892ThrHis: 0.892 ± 0.036
4.184ThrIle: 4.184 ± 0.09
3.366ThrLys: 3.366 ± 0.072
4.809ThrLeu: 4.809 ± 0.077
1.591ThrMet: 1.591 ± 0.046
2.251ThrAsn: 2.251 ± 0.062
2.31ThrPro: 2.31 ± 0.066
1.488ThrGln: 1.488 ± 0.044
1.92ThrArg: 1.92 ± 0.05
2.948ThrSer: 2.948 ± 0.079
3.435ThrThr: 3.435 ± 0.103
4.541ThrVal: 4.541 ± 0.135
0.49ThrTrp: 0.49 ± 0.026
2.547ThrTyr: 2.547 ± 0.089
0.0ThrXaa: 0.0 ± 0.0
Val
6.17ValAla: 6.17 ± 0.106
1.154ValCys: 1.154 ± 0.047
4.743ValAsp: 4.743 ± 0.087
5.141ValGlu: 5.141 ± 0.09
3.046ValPhe: 3.046 ± 0.071
4.992ValGly: 4.992 ± 0.084
1.154ValHis: 1.154 ± 0.044
5.289ValIle: 5.289 ± 0.092
3.839ValLys: 3.839 ± 0.085
6.774ValLeu: 6.774 ± 0.105
2.055ValMet: 2.055 ± 0.053
3.062ValAsn: 3.062 ± 0.071
2.568ValPro: 2.568 ± 0.051
1.779ValGln: 1.779 ± 0.051
3.013ValArg: 3.013 ± 0.063
4.587ValSer: 4.587 ± 0.084
4.672ValThr: 4.672 ± 0.136
6.025ValVal: 6.025 ± 0.107
0.579ValTrp: 0.579 ± 0.03
2.913ValTyr: 2.913 ± 0.071
0.0ValXaa: 0.0 ± 0.0
Trp
0.531TrpAla: 0.531 ± 0.027
0.157TrpCys: 0.157 ± 0.013
0.566TrpAsp: 0.566 ± 0.031
0.587TrpGlu: 0.587 ± 0.03
0.41TrpPhe: 0.41 ± 0.027
0.601TrpGly: 0.601 ± 0.036
0.145TrpHis: 0.145 ± 0.016
0.613TrpIle: 0.613 ± 0.031
0.629TrpLys: 0.629 ± 0.034
0.788TrpLeu: 0.788 ± 0.032
0.365TrpMet: 0.365 ± 0.024
0.601TrpAsn: 0.601 ± 0.031
0.274TrpPro: 0.274 ± 0.022
0.328TrpGln: 0.328 ± 0.022
0.332TrpArg: 0.332 ± 0.021
0.468TrpSer: 0.468 ± 0.025
0.44TrpThr: 0.44 ± 0.026
0.567TrpVal: 0.567 ± 0.032
0.124TrpTrp: 0.124 ± 0.015
0.363TrpTyr: 0.363 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.864TyrAla: 2.864 ± 0.062
0.587TyrCys: 0.587 ± 0.03
2.956TyrAsp: 2.956 ± 0.073
3.173TyrGlu: 3.173 ± 0.061
1.799TyrPhe: 1.799 ± 0.054
3.015TyrGly: 3.015 ± 0.063
1.011TyrHis: 1.011 ± 0.031
2.674TyrIle: 2.674 ± 0.067
2.044TyrLys: 2.044 ± 0.054
3.739TyrLeu: 3.739 ± 0.063
1.164TyrMet: 1.164 ± 0.044
1.927TyrAsn: 1.927 ± 0.054
1.487TyrPro: 1.487 ± 0.044
1.55TyrGln: 1.55 ± 0.042
2.142TyrArg: 2.142 ± 0.058
2.296TyrSer: 2.296 ± 0.066
2.374TyrThr: 2.374 ± 0.082
2.939TyrVal: 2.939 ± 0.066
0.371TyrTrp: 0.371 ± 0.026
2.044TyrTyr: 2.044 ± 0.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.004XaaXaa: 0.004 ± 0.004
Statistics based on 2215 proteins (756667 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski