Amino acid dipepetide frequency for Eubacterium sp. AF36-5BH

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.326AlaAla: 4.326 ± 0.1
0.791AlaCys: 0.791 ± 0.041
3.625AlaAsp: 3.625 ± 0.076
3.887AlaGlu: 3.887 ± 0.079
2.667AlaPhe: 2.667 ± 0.075
4.516AlaGly: 4.516 ± 0.087
0.863AlaHis: 0.863 ± 0.031
5.316AlaIle: 5.316 ± 0.093
5.731AlaLys: 5.731 ± 0.096
5.241AlaLeu: 5.241 ± 0.106
1.94AlaMet: 1.94 ± 0.055
3.097AlaAsn: 3.097 ± 0.072
1.593AlaPro: 1.593 ± 0.063
1.629AlaGln: 1.629 ± 0.05
1.947AlaArg: 1.947 ± 0.059
3.261AlaSer: 3.261 ± 0.07
3.648AlaThr: 3.648 ± 0.082
4.843AlaVal: 4.843 ± 0.091
0.473AlaTrp: 0.473 ± 0.029
2.597AlaTyr: 2.597 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.714CysAla: 0.714 ± 0.033
0.242CysCys: 0.242 ± 0.019
0.804CysAsp: 0.804 ± 0.032
0.794CysGlu: 0.794 ± 0.034
0.559CysPhe: 0.559 ± 0.03
1.252CysGly: 1.252 ± 0.053
0.22CysHis: 0.22 ± 0.017
1.205CysIle: 1.205 ± 0.041
1.153CysLys: 1.153 ± 0.044
0.895CysLeu: 0.895 ± 0.031
0.405CysMet: 0.405 ± 0.027
0.767CysAsn: 0.767 ± 0.037
0.503CysPro: 0.503 ± 0.028
0.316CysGln: 0.316 ± 0.021
0.454CysArg: 0.454 ± 0.023
0.895CysSer: 0.895 ± 0.041
0.741CysThr: 0.741 ± 0.036
0.923CysVal: 0.923 ± 0.037
0.104CysTrp: 0.104 ± 0.012
0.602CysTyr: 0.602 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
3.373AspAla: 3.373 ± 0.077
0.772AspCys: 0.772 ± 0.033
3.513AspAsp: 3.513 ± 0.115
4.89AspGlu: 4.89 ± 0.105
2.801AspPhe: 2.801 ± 0.062
4.305AspGly: 4.305 ± 0.133
0.63AspHis: 0.63 ± 0.028
5.141AspIle: 5.141 ± 0.087
5.217AspLys: 5.217 ± 0.105
4.353AspLeu: 4.353 ± 0.081
1.681AspMet: 1.681 ± 0.046
3.523AspAsn: 3.523 ± 0.083
1.213AspPro: 1.213 ± 0.042
1.024AspGln: 1.024 ± 0.038
1.887AspArg: 1.887 ± 0.059
3.47AspSer: 3.47 ± 0.102
3.177AspThr: 3.177 ± 0.088
3.928AspVal: 3.928 ± 0.079
0.51AspTrp: 0.51 ± 0.026
3.222AspTyr: 3.222 ± 0.072
0.0AspXaa: 0.0 ± 0.0
Glu
4.381GluAla: 4.381 ± 0.082
0.808GluCys: 0.808 ± 0.04
4.143GluAsp: 4.143 ± 0.088
6.24GluGlu: 6.24 ± 0.15
2.676GluPhe: 2.676 ± 0.062
3.722GluGly: 3.722 ± 0.075
1.215GluHis: 1.215 ± 0.047
6.33GluIle: 6.33 ± 0.122
7.675GluLys: 7.675 ± 0.136
5.759GluLeu: 5.759 ± 0.109
2.15GluMet: 2.15 ± 0.058
5.529GluAsn: 5.529 ± 0.112
1.556GluPro: 1.556 ± 0.045
2.455GluGln: 2.455 ± 0.068
2.443GluArg: 2.443 ± 0.068
3.358GluSer: 3.358 ± 0.066
3.788GluThr: 3.788 ± 0.088
4.38GluVal: 4.38 ± 0.079
0.505GluTrp: 0.505 ± 0.026
3.513GluTyr: 3.513 ± 0.065
0.0GluXaa: 0.0 ± 0.0
Phe
2.578PheAla: 2.578 ± 0.064
0.648PheCys: 0.648 ± 0.033
2.722PheAsp: 2.722 ± 0.068
2.717PheGlu: 2.717 ± 0.059
1.661PhePhe: 1.661 ± 0.066
2.792PheGly: 2.792 ± 0.073
0.579PheHis: 0.579 ± 0.027
3.106PheIle: 3.106 ± 0.077
3.157PheLys: 3.157 ± 0.066
3.077PheLeu: 3.077 ± 0.08
1.168PheMet: 1.168 ± 0.042
2.274PheAsn: 2.274 ± 0.053
1.115PhePro: 1.115 ± 0.041
1.013PheGln: 1.013 ± 0.036
1.256PheArg: 1.256 ± 0.039
2.824PheSer: 2.824 ± 0.064
2.42PheThr: 2.42 ± 0.061
2.913PheVal: 2.913 ± 0.07
0.351PheTrp: 0.351 ± 0.022
1.784PheTyr: 1.784 ± 0.059
0.0PheXaa: 0.0 ± 0.0
Gly
4.128GlyAla: 4.128 ± 0.081
1.071GlyCys: 1.071 ± 0.045
3.497GlyAsp: 3.497 ± 0.083
4.126GlyGlu: 4.126 ± 0.082
2.869GlyPhe: 2.869 ± 0.074
4.035GlyGly: 4.035 ± 0.102
1.127GlyHis: 1.127 ± 0.036
6.039GlyIle: 6.039 ± 0.089
6.631GlyLys: 6.631 ± 0.119
4.772GlyLeu: 4.772 ± 0.099
2.091GlyMet: 2.091 ± 0.058
3.789GlyAsn: 3.789 ± 0.084
1.046GlyPro: 1.046 ± 0.044
1.756GlyGln: 1.756 ± 0.051
2.404GlyArg: 2.404 ± 0.068
3.656GlySer: 3.656 ± 0.083
4.184GlyThr: 4.184 ± 0.089
4.68GlyVal: 4.68 ± 0.074
0.608GlyTrp: 0.608 ± 0.032
3.32GlyTyr: 3.32 ± 0.099
0.0GlyXaa: 0.0 ± 0.0
His
0.734HisAla: 0.734 ± 0.032
0.265HisCys: 0.265 ± 0.022
0.707HisAsp: 0.707 ± 0.031
0.816HisGlu: 0.816 ± 0.033
0.677HisPhe: 0.677 ± 0.033
1.017HisGly: 1.017 ± 0.037
0.337HisHis: 0.337 ± 0.031
1.303HisIle: 1.303 ± 0.041
1.138HisLys: 1.138 ± 0.037
1.083HisLeu: 1.083 ± 0.037
0.414HisMet: 0.414 ± 0.025
0.884HisAsn: 0.884 ± 0.035
0.606HisPro: 0.606 ± 0.03
0.411HisGln: 0.411 ± 0.025
0.584HisArg: 0.584 ± 0.031
0.873HisSer: 0.873 ± 0.03
0.791HisThr: 0.791 ± 0.031
0.942HisVal: 0.942 ± 0.037
0.141HisTrp: 0.141 ± 0.013
0.632HisTyr: 0.632 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.496IleAla: 5.496 ± 0.106
1.367IleCys: 1.367 ± 0.047
5.018IleAsp: 5.018 ± 0.078
5.847IleGlu: 5.847 ± 0.122
3.242IlePhe: 3.242 ± 0.078
5.315IleGly: 5.315 ± 0.1
1.134IleHis: 1.134 ± 0.039
7.087IleIle: 7.087 ± 0.142
7.357IleLys: 7.357 ± 0.114
6.693IleLeu: 6.693 ± 0.12
2.158IleMet: 2.158 ± 0.051
4.988IleAsn: 4.988 ± 0.101
2.883IlePro: 2.883 ± 0.066
2.069IleGln: 2.069 ± 0.049
2.821IleArg: 2.821 ± 0.075
5.65IleSer: 5.65 ± 0.093
4.887IleThr: 4.887 ± 0.084
6.011IleVal: 6.011 ± 0.102
0.594IleTrp: 0.594 ± 0.029
3.4IleTyr: 3.4 ± 0.072
0.0IleXaa: 0.0 ± 0.0
Lys
5.591LysAla: 5.591 ± 0.087
0.976LysCys: 0.976 ± 0.043
5.423LysAsp: 5.423 ± 0.116
7.962LysGlu: 7.962 ± 0.133
2.895LysPhe: 2.895 ± 0.066
5.18LysGly: 5.18 ± 0.085
1.257LysHis: 1.257 ± 0.039
7.614LysIle: 7.614 ± 0.106
9.939LysLys: 9.939 ± 0.179
6.769LysLeu: 6.769 ± 0.111
2.761LysMet: 2.761 ± 0.065
6.044LysAsn: 6.044 ± 0.115
2.158LysPro: 2.158 ± 0.06
2.733LysGln: 2.733 ± 0.079
3.06LysArg: 3.06 ± 0.087
4.877LysSer: 4.877 ± 0.093
5.074LysThr: 5.074 ± 0.091
6.432LysVal: 6.432 ± 0.118
0.864LysTrp: 0.864 ± 0.036
4.67LysTyr: 4.67 ± 0.1
0.0LysXaa: 0.0 ± 0.0
Leu
5.119LeuAla: 5.119 ± 0.093
1.118LeuCys: 1.118 ± 0.039
4.727LeuAsp: 4.727 ± 0.081
5.601LeuGlu: 5.601 ± 0.098
3.203LeuPhe: 3.203 ± 0.077
5.125LeuGly: 5.125 ± 0.088
1.116LeuHis: 1.116 ± 0.046
6.132LeuIle: 6.132 ± 0.139
7.202LeuLys: 7.202 ± 0.092
6.444LeuLeu: 6.444 ± 0.112
2.175LeuMet: 2.175 ± 0.054
4.333LeuAsn: 4.333 ± 0.077
2.659LeuPro: 2.659 ± 0.066
2.177LeuGln: 2.177 ± 0.055
2.615LeuArg: 2.615 ± 0.071
5.394LeuSer: 5.394 ± 0.088
4.307LeuThr: 4.307 ± 0.071
4.964LeuVal: 4.964 ± 0.09
0.55LeuTrp: 0.55 ± 0.029
3.285LeuTyr: 3.285 ± 0.07
0.001LeuXaa: 0.001 ± 0.001
Met
2.197MetAla: 2.197 ± 0.055
0.377MetCys: 0.377 ± 0.024
1.856MetAsp: 1.856 ± 0.048
2.243MetGlu: 2.243 ± 0.07
1.13MetPhe: 1.13 ± 0.035
1.795MetGly: 1.795 ± 0.052
0.355MetHis: 0.355 ± 0.022
2.089MetIle: 2.089 ± 0.056
2.598MetLys: 2.598 ± 0.053
2.318MetLeu: 2.318 ± 0.057
0.788MetMet: 0.788 ± 0.032
1.592MetAsn: 1.592 ± 0.039
0.991MetPro: 0.991 ± 0.035
0.739MetGln: 0.739 ± 0.032
0.959MetArg: 0.959 ± 0.032
1.818MetSer: 1.818 ± 0.042
1.459MetThr: 1.459 ± 0.045
1.865MetVal: 1.865 ± 0.055
0.239MetTrp: 0.239 ± 0.016
1.099MetTyr: 1.099 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.22AsnAla: 3.22 ± 0.079
0.822AsnCys: 0.822 ± 0.039
3.181AsnAsp: 3.181 ± 0.078
3.999AsnGlu: 3.999 ± 0.075
2.022AsnPhe: 2.022 ± 0.053
4.375AsnGly: 4.375 ± 0.104
0.877AsnHis: 0.877 ± 0.034
5.614AsnIle: 5.614 ± 0.107
5.497AsnLys: 5.497 ± 0.104
4.428AsnLeu: 4.428 ± 0.076
1.713AsnMet: 1.713 ± 0.047
3.995AsnAsn: 3.995 ± 0.102
1.972AsnPro: 1.972 ± 0.048
1.828AsnGln: 1.828 ± 0.062
2.059AsnArg: 2.059 ± 0.07
3.47AsnSer: 3.47 ± 0.083
3.158AsnThr: 3.158 ± 0.078
4.204AsnVal: 4.204 ± 0.081
0.5AsnTrp: 0.5 ± 0.028
2.703AsnTyr: 2.703 ± 0.067
0.0AsnXaa: 0.0 ± 0.0
Pro
1.641ProAla: 1.641 ± 0.055
0.391ProCys: 0.391 ± 0.025
1.789ProAsp: 1.789 ± 0.055
2.49ProGlu: 2.49 ± 0.062
1.262ProPhe: 1.262 ± 0.046
1.725ProGly: 1.725 ± 0.05
0.431ProHis: 0.431 ± 0.025
2.248ProIle: 2.248 ± 0.056
2.189ProLys: 2.189 ± 0.064
2.054ProLeu: 2.054 ± 0.052
0.77ProMet: 0.77 ± 0.03
1.454ProAsn: 1.454 ± 0.047
0.482ProPro: 0.482 ± 0.024
0.738ProGln: 0.738 ± 0.031
0.747ProArg: 0.747 ± 0.033
1.538ProSer: 1.538 ± 0.046
1.73ProThr: 1.73 ± 0.071
2.143ProVal: 2.143 ± 0.06
0.233ProTrp: 0.233 ± 0.019
1.283ProTyr: 1.283 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
1.692GlnAla: 1.692 ± 0.054
0.37GlnCys: 0.37 ± 0.023
1.236GlnAsp: 1.236 ± 0.037
1.886GlnGlu: 1.886 ± 0.056
1.093GlnPhe: 1.093 ± 0.035
1.752GlnGly: 1.752 ± 0.05
0.358GlnHis: 0.358 ± 0.024
2.413GlnIle: 2.413 ± 0.059
2.461GlnLys: 2.461 ± 0.066
2.491GlnLeu: 2.491 ± 0.062
0.934GlnMet: 0.934 ± 0.035
1.527GlnAsn: 1.527 ± 0.05
0.699GlnPro: 0.699 ± 0.036
0.889GlnGln: 0.889 ± 0.048
0.992GlnArg: 0.992 ± 0.038
1.415GlnSer: 1.415 ± 0.041
1.407GlnThr: 1.407 ± 0.055
1.793GlnVal: 1.793 ± 0.053
0.338GlnTrp: 0.338 ± 0.024
1.322GlnTyr: 1.322 ± 0.047
0.0GlnXaa: 0.0 ± 0.0
Arg
1.883ArgAla: 1.883 ± 0.061
0.433ArgCys: 0.433 ± 0.026
1.813ArgAsp: 1.813 ± 0.055
2.808ArgGlu: 2.808 ± 0.076
1.353ArgPhe: 1.353 ± 0.037
1.91ArgGly: 1.91 ± 0.063
0.547ArgHis: 0.547 ± 0.026
2.873ArgIle: 2.873 ± 0.063
3.364ArgLys: 3.364 ± 0.083
2.732ArgLeu: 2.732 ± 0.063
1.13ArgMet: 1.13 ± 0.046
2.066ArgAsn: 2.066 ± 0.058
0.906ArgPro: 0.906 ± 0.039
1.124ArgGln: 1.124 ± 0.039
1.456ArgArg: 1.456 ± 0.058
1.509ArgSer: 1.509 ± 0.045
1.76ArgThr: 1.76 ± 0.048
2.127ArgVal: 2.127 ± 0.058
0.298ArgTrp: 0.298 ± 0.02
1.578ArgTyr: 1.578 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
3.39SerAla: 3.39 ± 0.079
0.632SerCys: 0.632 ± 0.031
3.645SerAsp: 3.645 ± 0.084
4.051SerGlu: 4.051 ± 0.072
2.561SerPhe: 2.561 ± 0.055
4.628SerGly: 4.628 ± 0.096
0.836SerHis: 0.836 ± 0.036
4.773SerIle: 4.773 ± 0.09
5.413SerLys: 5.413 ± 0.108
4.764SerLeu: 4.764 ± 0.081
1.546SerMet: 1.546 ± 0.047
3.376SerAsn: 3.376 ± 0.075
1.433SerPro: 1.433 ± 0.035
1.719SerGln: 1.719 ± 0.058
2.065SerArg: 2.065 ± 0.059
3.861SerSer: 3.861 ± 0.082
3.293SerThr: 3.293 ± 0.082
4.369SerVal: 4.369 ± 0.081
0.542SerTrp: 0.542 ± 0.029
2.687SerTyr: 2.687 ± 0.067
0.001SerXaa: 0.001 ± 0.001
Thr
3.591ThrAla: 3.591 ± 0.081
0.613ThrCys: 0.613 ± 0.032
3.344ThrAsp: 3.344 ± 0.096
3.653ThrGlu: 3.653 ± 0.084
2.385ThrPhe: 2.385 ± 0.065
4.249ThrGly: 4.249 ± 0.09
0.766ThrHis: 0.766 ± 0.029
4.773ThrIle: 4.773 ± 0.083
4.708ThrLys: 4.708 ± 0.096
4.551ThrLeu: 4.551 ± 0.079
1.433ThrMet: 1.433 ± 0.042
2.974ThrAsn: 2.974 ± 0.069
1.947ThrPro: 1.947 ± 0.056
1.359ThrGln: 1.359 ± 0.045
1.677ThrArg: 1.677 ± 0.047
3.512ThrSer: 3.512 ± 0.083
4.338ThrThr: 4.338 ± 0.197
4.849ThrVal: 4.849 ± 0.101
0.494ThrTrp: 0.494 ± 0.027
2.606ThrTyr: 2.606 ± 0.07
0.0ThrXaa: 0.0 ± 0.0
Val
4.836ValAla: 4.836 ± 0.085
1.083ValCys: 1.083 ± 0.036
4.227ValAsp: 4.227 ± 0.066
4.82ValGlu: 4.82 ± 0.082
2.757ValPhe: 2.757 ± 0.066
4.376ValGly: 4.376 ± 0.089
0.902ValHis: 0.902 ± 0.034
5.66ValIle: 5.66 ± 0.096
6.306ValLys: 6.306 ± 0.107
5.637ValLeu: 5.637 ± 0.102
1.882ValMet: 1.882 ± 0.052
3.76ValAsn: 3.76 ± 0.078
2.136ValPro: 2.136 ± 0.053
1.641ValGln: 1.641 ± 0.049
2.308ValArg: 2.308 ± 0.061
4.62ValSer: 4.62 ± 0.082
4.525ValThr: 4.525 ± 0.094
5.337ValVal: 5.337 ± 0.085
0.585ValTrp: 0.585 ± 0.034
2.934ValTyr: 2.934 ± 0.076
0.0ValXaa: 0.0 ± 0.0
Trp
0.486TrpAla: 0.486 ± 0.026
0.118TrpCys: 0.118 ± 0.012
0.501TrpAsp: 0.501 ± 0.027
0.528TrpGlu: 0.528 ± 0.03
0.375TrpPhe: 0.375 ± 0.023
0.618TrpGly: 0.618 ± 0.03
0.145TrpHis: 0.145 ± 0.014
0.669TrpIle: 0.669 ± 0.029
0.729TrpLys: 0.729 ± 0.031
0.692TrpLeu: 0.692 ± 0.033
0.196TrpMet: 0.196 ± 0.013
0.588TrpAsn: 0.588 ± 0.031
0.181TrpPro: 0.181 ± 0.016
0.268TrpGln: 0.268 ± 0.016
0.275TrpArg: 0.275 ± 0.02
0.607TrpSer: 0.607 ± 0.031
0.481TrpThr: 0.481 ± 0.029
0.484TrpVal: 0.484 ± 0.027
0.104TrpTrp: 0.104 ± 0.012
0.361TrpTyr: 0.361 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.614TyrAla: 2.614 ± 0.062
0.648TyrCys: 0.648 ± 0.03
2.858TyrAsp: 2.858 ± 0.087
3.268TyrGlu: 3.268 ± 0.077
1.963TyrPhe: 1.963 ± 0.052
3.142TyrGly: 3.142 ± 0.079
0.659TyrHis: 0.659 ± 0.033
3.63TyrIle: 3.63 ± 0.075
3.816TyrLys: 3.816 ± 0.081
3.494TyrLeu: 3.494 ± 0.081
1.178TyrMet: 1.178 ± 0.035
3.077TyrAsn: 3.077 ± 0.073
1.261TyrPro: 1.261 ± 0.039
1.18TyrGln: 1.18 ± 0.043
1.661TyrArg: 1.661 ± 0.042
2.98TyrSer: 2.98 ± 0.065
2.65TyrThr: 2.65 ± 0.066
3.148TyrVal: 3.148 ± 0.083
0.373TyrTrp: 0.373 ± 0.019
2.353TyrTyr: 2.353 ± 0.078
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.005XaaXaa: 0.005 ± 0.004
Statistics based on 2295 proteins (785904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski