Amino acid dipepetide frequency for Opitutaceae bacterium TSB47

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.343AlaAla: 17.343 ± 0.156
1.126AlaCys: 1.126 ± 0.031
6.714AlaAsp: 6.714 ± 0.074
5.712AlaGlu: 5.712 ± 0.082
4.317AlaPhe: 4.317 ± 0.062
12.729AlaGly: 12.729 ± 0.207
2.26AlaHis: 2.26 ± 0.042
5.265AlaIle: 5.265 ± 0.061
3.685AlaLys: 3.685 ± 0.063
12.063AlaLeu: 12.063 ± 0.131
2.224AlaMet: 2.224 ± 0.042
3.693AlaAsn: 3.693 ± 0.085
6.228AlaPro: 6.228 ± 0.085
3.696AlaGln: 3.696 ± 0.05
8.558AlaArg: 8.558 ± 0.119
7.276AlaSer: 7.276 ± 0.096
7.068AlaThr: 7.068 ± 0.134
7.661AlaVal: 7.661 ± 0.072
1.945AlaTrp: 1.945 ± 0.035
2.602AlaTyr: 2.602 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.102CysAla: 1.102 ± 0.027
0.128CysCys: 0.128 ± 0.008
0.528CysAsp: 0.528 ± 0.021
0.505CysGlu: 0.505 ± 0.019
0.37CysPhe: 0.37 ± 0.014
0.805CysGly: 0.805 ± 0.024
0.24CysHis: 0.24 ± 0.013
0.418CysIle: 0.418 ± 0.017
0.23CysLys: 0.23 ± 0.012
0.85CysLeu: 0.85 ± 0.023
0.163CysMet: 0.163 ± 0.009
0.229CysAsn: 0.229 ± 0.011
0.443CysPro: 0.443 ± 0.017
0.223CysGln: 0.223 ± 0.01
0.626CysArg: 0.626 ± 0.02
0.444CysSer: 0.444 ± 0.016
0.363CysThr: 0.363 ± 0.014
0.663CysVal: 0.663 ± 0.019
0.165CysTrp: 0.165 ± 0.009
0.209CysTyr: 0.209 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
6.878AspAla: 6.878 ± 0.059
0.486AspCys: 0.486 ± 0.016
2.848AspAsp: 2.848 ± 0.043
2.904AspGlu: 2.904 ± 0.053
2.396AspPhe: 2.396 ± 0.042
5.94AspGly: 5.94 ± 0.082
1.072AspHis: 1.072 ± 0.027
2.939AspIle: 2.939 ± 0.043
1.596AspLys: 1.596 ± 0.038
4.715AspLeu: 4.715 ± 0.043
0.905AspMet: 0.905 ± 0.025
1.75AspAsn: 1.75 ± 0.038
2.804AspPro: 2.804 ± 0.048
1.255AspGln: 1.255 ± 0.025
3.143AspArg: 3.143 ± 0.055
2.812AspSer: 2.812 ± 0.043
3.299AspThr: 3.299 ± 0.048
3.055AspVal: 3.055 ± 0.04
1.041AspTrp: 1.041 ± 0.023
1.998AspTyr: 1.998 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
5.214GluAla: 5.214 ± 0.07
0.413GluCys: 0.413 ± 0.016
1.969GluAsp: 1.969 ± 0.039
2.368GluGlu: 2.368 ± 0.048
2.013GluPhe: 2.013 ± 0.035
3.43GluGly: 3.43 ± 0.051
1.109GluHis: 1.109 ± 0.031
3.204GluIle: 3.204 ± 0.051
2.716GluLys: 2.716 ± 0.054
4.875GluLeu: 4.875 ± 0.074
1.106GluMet: 1.106 ± 0.032
2.098GluAsn: 2.098 ± 0.035
2.304GluPro: 2.304 ± 0.042
1.747GluGln: 1.747 ± 0.037
3.577GluArg: 3.577 ± 0.063
2.63GluSer: 2.63 ± 0.037
3.249GluThr: 3.249 ± 0.047
3.042GluVal: 3.042 ± 0.051
0.847GluTrp: 0.847 ± 0.025
1.241GluTyr: 1.241 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
4.614PheAla: 4.614 ± 0.052
0.458PheCys: 0.458 ± 0.016
2.617PheAsp: 2.617 ± 0.032
1.994PheGlu: 1.994 ± 0.036
1.698PhePhe: 1.698 ± 0.034
3.293PheGly: 3.293 ± 0.043
0.855PheHis: 0.855 ± 0.022
1.831PheIle: 1.831 ± 0.036
1.214PheLys: 1.214 ± 0.029
3.423PheLeu: 3.423 ± 0.055
0.734PheMet: 0.734 ± 0.022
1.527PheAsn: 1.527 ± 0.032
1.768PhePro: 1.768 ± 0.043
1.106PheGln: 1.106 ± 0.025
2.359PheArg: 2.359 ± 0.042
2.763PheSer: 2.763 ± 0.036
2.639PheThr: 2.639 ± 0.045
2.512PheVal: 2.512 ± 0.042
0.62PheTrp: 0.62 ± 0.021
1.15PheTyr: 1.15 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
11.131GlyAla: 11.131 ± 0.277
0.767GlyCys: 0.767 ± 0.022
4.793GlyAsp: 4.793 ± 0.075
4.358GlyGlu: 4.358 ± 0.054
3.452GlyPhe: 3.452 ± 0.046
9.837GlyGly: 9.837 ± 0.301
1.729GlyHis: 1.729 ± 0.031
4.401GlyIle: 4.401 ± 0.056
3.459GlyLys: 3.459 ± 0.052
7.842GlyLeu: 7.842 ± 0.074
1.806GlyMet: 1.806 ± 0.033
3.505GlyAsn: 3.505 ± 0.112
2.973GlyPro: 2.973 ± 0.048
2.509GlyGln: 2.509 ± 0.036
6.07GlyArg: 6.07 ± 0.072
5.846GlySer: 5.846 ± 0.246
7.048GlyThr: 7.048 ± 0.353
6.304GlyVal: 6.304 ± 0.085
1.63GlyTrp: 1.63 ± 0.031
2.651GlyTyr: 2.651 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
2.597HisAla: 2.597 ± 0.041
0.236HisCys: 0.236 ± 0.011
1.202HisAsp: 1.202 ± 0.031
1.007HisGlu: 1.007 ± 0.028
0.948HisPhe: 0.948 ± 0.023
1.958HisGly: 1.958 ± 0.034
0.587HisHis: 0.587 ± 0.02
0.95HisIle: 0.95 ± 0.023
0.513HisLys: 0.513 ± 0.015
2.02HisLeu: 2.02 ± 0.042
0.342HisMet: 0.342 ± 0.012
0.614HisAsn: 0.614 ± 0.019
1.429HisPro: 1.429 ± 0.03
0.564HisGln: 0.564 ± 0.017
1.419HisArg: 1.419 ± 0.036
1.034HisSer: 1.034 ± 0.021
1.192HisThr: 1.192 ± 0.024
1.314HisVal: 1.314 ± 0.028
0.383HisTrp: 0.383 ± 0.016
0.737HisTyr: 0.737 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
5.943IleAla: 5.943 ± 0.068
0.432IleCys: 0.432 ± 0.015
3.116IleAsp: 3.116 ± 0.05
2.84IleGlu: 2.84 ± 0.046
1.656IlePhe: 1.656 ± 0.032
4.254IleGly: 4.254 ± 0.06
1.004IleHis: 1.004 ± 0.023
2.375IleIle: 2.375 ± 0.039
1.746IleLys: 1.746 ± 0.032
3.976IleLeu: 3.976 ± 0.051
0.763IleMet: 0.763 ± 0.023
1.904IleAsn: 1.904 ± 0.034
2.507IlePro: 2.507 ± 0.048
1.29IleGln: 1.29 ± 0.029
3.143IleArg: 3.143 ± 0.049
2.88IleSer: 2.88 ± 0.056
3.654IleThr: 3.654 ± 0.07
3.285IleVal: 3.285 ± 0.044
0.607IleTrp: 0.607 ± 0.017
1.364IleTyr: 1.364 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
3.343LysAla: 3.343 ± 0.066
0.205LysCys: 0.205 ± 0.011
1.642LysAsp: 1.642 ± 0.036
1.564LysGlu: 1.564 ± 0.037
1.237LysPhe: 1.237 ± 0.03
2.248LysGly: 2.248 ± 0.04
0.75LysHis: 0.75 ± 0.022
2.169LysIle: 2.169 ± 0.041
1.957LysLys: 1.957 ± 0.044
3.354LysLeu: 3.354 ± 0.059
0.783LysMet: 0.783 ± 0.024
1.811LysAsn: 1.811 ± 0.036
2.085LysPro: 2.085 ± 0.04
1.118LysGln: 1.118 ± 0.025
2.083LysArg: 2.083 ± 0.045
1.976LysSer: 1.976 ± 0.032
2.731LysThr: 2.731 ± 0.039
1.86LysVal: 1.86 ± 0.037
0.533LysTrp: 0.533 ± 0.018
0.922LysTyr: 0.922 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
13.307LeuAla: 13.307 ± 0.128
0.942LeuCys: 0.942 ± 0.027
5.672LeuAsp: 5.672 ± 0.07
4.461LeuGlu: 4.461 ± 0.07
3.588LeuPhe: 3.588 ± 0.053
7.916LeuGly: 7.916 ± 0.103
2.118LeuHis: 2.118 ± 0.038
3.836LeuIle: 3.836 ± 0.054
3.283LeuLys: 3.283 ± 0.05
9.069LeuLeu: 9.069 ± 0.101
1.646LeuMet: 1.646 ± 0.032
3.163LeuAsn: 3.163 ± 0.06
5.474LeuPro: 5.474 ± 0.091
2.589LeuGln: 2.589 ± 0.039
7.159LeuArg: 7.159 ± 0.099
5.719LeuSer: 5.719 ± 0.067
5.888LeuThr: 5.888 ± 0.119
6.751LeuVal: 6.751 ± 0.07
1.373LeuTrp: 1.373 ± 0.032
2.362LeuTyr: 2.362 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
1.891MetAla: 1.891 ± 0.036
0.15MetCys: 0.15 ± 0.01
0.896MetAsp: 0.896 ± 0.021
0.963MetGlu: 0.963 ± 0.028
0.61MetPhe: 0.61 ± 0.018
1.279MetGly: 1.279 ± 0.027
0.4MetHis: 0.4 ± 0.014
0.902MetIle: 0.902 ± 0.025
1.044MetLys: 1.044 ± 0.026
1.928MetLeu: 1.928 ± 0.037
0.408MetMet: 0.408 ± 0.017
0.78MetAsn: 0.78 ± 0.021
1.3MetPro: 1.3 ± 0.031
0.654MetGln: 0.654 ± 0.02
1.421MetArg: 1.421 ± 0.029
1.234MetSer: 1.234 ± 0.026
1.23MetThr: 1.23 ± 0.027
1.054MetVal: 1.054 ± 0.028
0.192MetTrp: 0.192 ± 0.01
0.292MetTyr: 0.292 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.925AsnAla: 3.925 ± 0.082
0.243AsnCys: 0.243 ± 0.013
1.647AsnAsp: 1.647 ± 0.036
1.399AsnGlu: 1.399 ± 0.028
1.352AsnPhe: 1.352 ± 0.034
3.389AsnGly: 3.389 ± 0.091
0.779AsnHis: 0.779 ± 0.022
2.011AsnIle: 2.011 ± 0.041
1.01AsnLys: 1.01 ± 0.026
3.378AsnLeu: 3.378 ± 0.055
0.558AsnMet: 0.558 ± 0.016
1.624AsnAsn: 1.624 ± 0.065
2.375AsnPro: 2.375 ± 0.044
1.023AsnGln: 1.023 ± 0.027
2.03AsnArg: 2.03 ± 0.032
2.09AsnSer: 2.09 ± 0.077
2.573AsnThr: 2.573 ± 0.081
2.109AsnVal: 2.109 ± 0.049
0.607AsnTrp: 0.607 ± 0.021
1.244AsnTyr: 1.244 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
7.701ProAla: 7.701 ± 0.126
0.397ProCys: 0.397 ± 0.015
3.53ProAsp: 3.53 ± 0.06
3.313ProGlu: 3.313 ± 0.057
1.909ProPhe: 1.909 ± 0.038
5.224ProGly: 5.224 ± 0.069
1.078ProHis: 1.078 ± 0.025
1.617ProIle: 1.617 ± 0.033
1.576ProLys: 1.576 ± 0.037
4.423ProLeu: 4.423 ± 0.077
0.964ProMet: 0.964 ± 0.024
1.398ProAsn: 1.398 ± 0.031
3.289ProPro: 3.289 ± 0.07
1.403ProGln: 1.403 ± 0.035
3.121ProArg: 3.121 ± 0.064
3.083ProSer: 3.083 ± 0.049
2.049ProThr: 2.049 ± 0.037
4.321ProVal: 4.321 ± 0.075
0.806ProTrp: 0.806 ± 0.022
1.148ProTyr: 1.148 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.185GlnAla: 3.185 ± 0.044
0.213GlnCys: 0.213 ± 0.012
1.182GlnAsp: 1.182 ± 0.024
1.179GlnGlu: 1.179 ± 0.027
1.128GlnPhe: 1.128 ± 0.025
2.027GlnGly: 2.027 ± 0.039
0.604GlnHis: 0.604 ± 0.017
1.706GlnIle: 1.706 ± 0.031
1.249GlnLys: 1.249 ± 0.032
2.984GlnLeu: 2.984 ± 0.042
0.653GlnMet: 0.653 ± 0.018
1.194GlnAsn: 1.194 ± 0.029
1.778GlnPro: 1.778 ± 0.039
1.107GlnGln: 1.107 ± 0.035
2.05GlnArg: 2.05 ± 0.039
1.857GlnSer: 1.857 ± 0.037
1.966GlnThr: 1.966 ± 0.032
1.832GlnVal: 1.832 ± 0.03
0.544GlnTrp: 0.544 ± 0.023
0.778GlnTyr: 0.778 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
7.713ArgAla: 7.713 ± 0.095
0.539ArgCys: 0.539 ± 0.019
3.802ArgAsp: 3.802 ± 0.057
4.21ArgGlu: 4.21 ± 0.082
2.853ArgPhe: 2.853 ± 0.05
4.73ArgGly: 4.73 ± 0.058
1.83ArgHis: 1.83 ± 0.048
3.51ArgIle: 3.51 ± 0.052
2.251ArgLys: 2.251 ± 0.046
7.373ArgLeu: 7.373 ± 0.093
1.432ArgMet: 1.432 ± 0.03
2.065ArgAsn: 2.065 ± 0.037
3.428ArgPro: 3.428 ± 0.061
2.284ArgGln: 2.284 ± 0.047
5.022ArgArg: 5.022 ± 0.093
3.007ArgSer: 3.007 ± 0.047
3.15ArgThr: 3.15 ± 0.039
4.807ArgVal: 4.807 ± 0.064
1.222ArgTrp: 1.222 ± 0.03
1.812ArgTyr: 1.812 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
7.174SerAla: 7.174 ± 0.108
0.424SerCys: 0.424 ± 0.016
2.924SerAsp: 2.924 ± 0.038
2.372SerGlu: 2.372 ± 0.041
2.327SerPhe: 2.327 ± 0.032
7.528SerGly: 7.528 ± 0.34
1.165SerHis: 1.165 ± 0.024
2.859SerIle: 2.859 ± 0.04
1.602SerLys: 1.602 ± 0.032
5.796SerLeu: 5.796 ± 0.053
1.065SerMet: 1.065 ± 0.025
1.852SerAsn: 1.852 ± 0.063
3.193SerPro: 3.193 ± 0.05
1.547SerGln: 1.547 ± 0.028
3.553SerArg: 3.553 ± 0.052
3.59SerSer: 3.59 ± 0.064
3.306SerThr: 3.306 ± 0.066
4.027SerVal: 4.027 ± 0.069
0.863SerTrp: 0.863 ± 0.023
1.613SerTyr: 1.613 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
7.464ThrAla: 7.464 ± 0.142
0.41ThrCys: 0.41 ± 0.015
3.012ThrAsp: 3.012 ± 0.045
2.321ThrGlu: 2.321 ± 0.037
2.306ThrPhe: 2.306 ± 0.047
7.676ThrGly: 7.676 ± 0.285
1.225ThrHis: 1.225 ± 0.024
3.18ThrIle: 3.18 ± 0.064
1.634ThrLys: 1.634 ± 0.033
7.514ThrLeu: 7.514 ± 0.185
0.921ThrMet: 0.921 ± 0.025
1.948ThrAsn: 1.948 ± 0.062
3.698ThrPro: 3.698 ± 0.063
1.625ThrGln: 1.625 ± 0.033
3.578ThrArg: 3.578 ± 0.048
3.23ThrSer: 3.23 ± 0.069
3.735ThrThr: 3.735 ± 0.089
4.918ThrVal: 4.918 ± 0.141
1.001ThrTrp: 1.001 ± 0.026
1.628ThrTyr: 1.628 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
7.57ValAla: 7.57 ± 0.081
0.719ValCys: 0.719 ± 0.021
3.169ValAsp: 3.169 ± 0.047
3.51ValGlu: 3.51 ± 0.05
3.113ValPhe: 3.113 ± 0.047
4.582ValGly: 4.582 ± 0.097
1.169ValHis: 1.169 ± 0.027
3.652ValIle: 3.652 ± 0.045
2.057ValLys: 2.057 ± 0.036
6.388ValLeu: 6.388 ± 0.061
1.268ValMet: 1.268 ± 0.031
2.43ValAsn: 2.43 ± 0.057
3.37ValPro: 3.37 ± 0.064
1.784ValGln: 1.784 ± 0.032
4.737ValArg: 4.737 ± 0.063
4.72ValSer: 4.72 ± 0.102
5.124ValThr: 5.124 ± 0.102
4.778ValVal: 4.778 ± 0.055
1.025ValTrp: 1.025 ± 0.024
1.69ValTyr: 1.69 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.292TrpAla: 1.292 ± 0.029
0.192TrpCys: 0.192 ± 0.011
0.863TrpAsp: 0.863 ± 0.026
0.77TrpGlu: 0.77 ± 0.025
0.715TrpPhe: 0.715 ± 0.019
1.007TrpGly: 1.007 ± 0.025
0.448TrpHis: 0.448 ± 0.016
0.707TrpIle: 0.707 ± 0.022
0.633TrpLys: 0.633 ± 0.021
1.95TrpLeu: 1.95 ± 0.037
0.421TrpMet: 0.421 ± 0.015
0.664TrpAsn: 0.664 ± 0.017
0.707TrpPro: 0.707 ± 0.022
0.742TrpGln: 0.742 ± 0.021
1.47TrpArg: 1.47 ± 0.032
0.981TrpSer: 0.981 ± 0.025
1.034TrpThr: 1.034 ± 0.029
0.885TrpVal: 0.885 ± 0.022
0.348TrpTrp: 0.348 ± 0.013
0.409TrpTyr: 0.409 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.776TyrAla: 2.776 ± 0.034
0.222TyrCys: 0.222 ± 0.011
1.712TyrAsp: 1.712 ± 0.04
1.399TyrGlu: 1.399 ± 0.03
1.188TyrPhe: 1.188 ± 0.024
2.262TyrGly: 2.262 ± 0.043
0.623TyrHis: 0.623 ± 0.019
1.207TyrIle: 1.207 ± 0.028
0.918TyrLys: 0.918 ± 0.025
2.396TyrLeu: 2.396 ± 0.044
0.467TyrMet: 0.467 ± 0.015
1.116TyrAsn: 1.116 ± 0.028
1.198TyrPro: 1.198 ± 0.035
0.952TyrGln: 0.952 ± 0.029
1.921TyrArg: 1.921 ± 0.037
1.608TyrSer: 1.608 ± 0.038
1.725TyrThr: 1.725 ± 0.039
1.676TyrVal: 1.676 ± 0.038
0.483TyrTrp: 0.483 ± 0.019
1.038TyrTyr: 1.038 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4993 proteins (2146511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski