Amino acid dipepetide frequency for Eubacterium sp. CAG:161

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.698AlaAla: 4.698 ± 0.098
0.772AlaCys: 0.772 ± 0.039
3.909AlaAsp: 3.909 ± 0.082
3.872AlaGlu: 3.872 ± 0.077
2.656AlaPhe: 2.656 ± 0.067
5.012AlaGly: 5.012 ± 0.091
0.873AlaHis: 0.873 ± 0.038
5.348AlaIle: 5.348 ± 0.098
5.458AlaLys: 5.458 ± 0.103
5.451AlaLeu: 5.451 ± 0.11
2.073AlaMet: 2.073 ± 0.06
2.849AlaAsn: 2.849 ± 0.063
1.7AlaPro: 1.7 ± 0.059
1.698AlaGln: 1.698 ± 0.053
2.061AlaArg: 2.061 ± 0.046
3.409AlaSer: 3.409 ± 0.076
3.647AlaThr: 3.647 ± 0.088
5.185AlaVal: 5.185 ± 0.094
0.457AlaTrp: 0.457 ± 0.029
2.707AlaTyr: 2.707 ± 0.061
0.003AlaXaa: 0.003 ± 0.002
Cys
0.769CysAla: 0.769 ± 0.032
0.24CysCys: 0.24 ± 0.022
0.844CysAsp: 0.844 ± 0.033
0.761CysGlu: 0.761 ± 0.037
0.598CysPhe: 0.598 ± 0.028
1.348CysGly: 1.348 ± 0.058
0.259CysHis: 0.259 ± 0.019
1.259CysIle: 1.259 ± 0.045
0.963CysLys: 0.963 ± 0.04
0.957CysLeu: 0.957 ± 0.037
0.405CysMet: 0.405 ± 0.022
0.711CysAsn: 0.711 ± 0.035
0.512CysPro: 0.512 ± 0.033
0.309CysGln: 0.309 ± 0.02
0.453CysArg: 0.453 ± 0.024
0.879CysSer: 0.879 ± 0.039
0.639CysThr: 0.639 ± 0.029
0.998CysVal: 0.998 ± 0.037
0.094CysTrp: 0.094 ± 0.013
0.585CysTyr: 0.585 ± 0.027
0.001CysXaa: 0.001 ± 0.001
Asp
3.502AspAla: 3.502 ± 0.075
0.713AspCys: 0.713 ± 0.032
3.654AspAsp: 3.654 ± 0.08
4.898AspGlu: 4.898 ± 0.094
2.82AspPhe: 2.82 ± 0.063
4.483AspGly: 4.483 ± 0.093
0.618AspHis: 0.618 ± 0.034
5.515AspIle: 5.515 ± 0.094
5.278AspLys: 5.278 ± 0.092
4.202AspLeu: 4.202 ± 0.075
1.959AspMet: 1.959 ± 0.058
3.736AspAsn: 3.736 ± 0.091
1.232AspPro: 1.232 ± 0.049
0.825AspGln: 0.825 ± 0.035
2.07AspArg: 2.07 ± 0.059
3.453AspSer: 3.453 ± 0.083
3.376AspThr: 3.376 ± 0.091
4.183AspVal: 4.183 ± 0.083
0.512AspTrp: 0.512 ± 0.031
3.263AspTyr: 3.263 ± 0.069
0.001AspXaa: 0.001 ± 0.001
Glu
4.501GluAla: 4.501 ± 0.09
0.843GluCys: 0.843 ± 0.04
4.134GluAsp: 4.134 ± 0.097
5.856GluGlu: 5.856 ± 0.125
2.547GluPhe: 2.547 ± 0.063
3.897GluGly: 3.897 ± 0.081
1.072GluHis: 1.072 ± 0.043
6.129GluIle: 6.129 ± 0.111
7.378GluLys: 7.378 ± 0.126
5.619GluLeu: 5.619 ± 0.092
2.233GluMet: 2.233 ± 0.058
5.254GluAsn: 5.254 ± 0.097
1.553GluPro: 1.553 ± 0.054
1.988GluGln: 1.988 ± 0.063
2.447GluArg: 2.447 ± 0.061
3.252GluSer: 3.252 ± 0.07
3.555GluThr: 3.555 ± 0.08
4.381GluVal: 4.381 ± 0.083
0.535GluTrp: 0.535 ± 0.029
3.439GluTyr: 3.439 ± 0.082
0.001GluXaa: 0.001 ± 0.001
Phe
2.524PheAla: 2.524 ± 0.059
0.567PheCys: 0.567 ± 0.031
2.723PheAsp: 2.723 ± 0.059
2.717PheGlu: 2.717 ± 0.063
1.738PhePhe: 1.738 ± 0.059
2.945PheGly: 2.945 ± 0.08
0.604PheHis: 0.604 ± 0.031
3.143PheIle: 3.143 ± 0.083
2.983PheLys: 2.983 ± 0.06
2.967PheLeu: 2.967 ± 0.078
1.228PheMet: 1.228 ± 0.047
2.282PheAsn: 2.282 ± 0.057
1.105PhePro: 1.105 ± 0.04
0.897PheGln: 0.897 ± 0.039
1.316PheArg: 1.316 ± 0.042
2.979PheSer: 2.979 ± 0.071
2.296PheThr: 2.296 ± 0.062
3.01PheVal: 3.01 ± 0.068
0.333PheTrp: 0.333 ± 0.025
1.784PheTyr: 1.784 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.425GlyAla: 4.425 ± 0.097
1.104GlyCys: 1.104 ± 0.045
3.76GlyAsp: 3.76 ± 0.076
4.45GlyGlu: 4.45 ± 0.086
2.835GlyPhe: 2.835 ± 0.061
4.356GlyGly: 4.356 ± 0.086
1.149GlyHis: 1.149 ± 0.042
6.431GlyIle: 6.431 ± 0.109
6.574GlyLys: 6.574 ± 0.112
4.956GlyLeu: 4.956 ± 0.09
2.217GlyMet: 2.217 ± 0.058
3.785GlyAsn: 3.785 ± 0.078
1.091GlyPro: 1.091 ± 0.041
1.795GlyGln: 1.795 ± 0.058
2.502GlyArg: 2.502 ± 0.06
3.767GlySer: 3.767 ± 0.085
4.201GlyThr: 4.201 ± 0.104
4.874GlyVal: 4.874 ± 0.086
0.643GlyTrp: 0.643 ± 0.038
3.514GlyTyr: 3.514 ± 0.073
0.0GlyXaa: 0.0 ± 0.0
His
0.747HisAla: 0.747 ± 0.03
0.269HisCys: 0.269 ± 0.017
0.685HisAsp: 0.685 ± 0.033
0.862HisGlu: 0.862 ± 0.033
0.706HisPhe: 0.706 ± 0.033
1.058HisGly: 1.058 ± 0.042
0.331HisHis: 0.331 ± 0.033
1.282HisIle: 1.282 ± 0.048
1.044HisLys: 1.044 ± 0.044
1.058HisLeu: 1.058 ± 0.039
0.421HisMet: 0.421 ± 0.024
0.847HisAsn: 0.847 ± 0.033
0.585HisPro: 0.585 ± 0.035
0.405HisGln: 0.405 ± 0.023
0.596HisArg: 0.596 ± 0.032
0.855HisSer: 0.855 ± 0.037
0.804HisThr: 0.804 ± 0.033
0.891HisVal: 0.891 ± 0.04
0.152HisTrp: 0.152 ± 0.014
0.629HisTyr: 0.629 ± 0.031
0.001HisXaa: 0.001 ± 0.001
Ile
5.555IleAla: 5.555 ± 0.096
1.36IleCys: 1.36 ± 0.048
4.962IleAsp: 4.962 ± 0.095
5.47IleGlu: 5.47 ± 0.107
3.279IlePhe: 3.279 ± 0.097
5.276IleGly: 5.276 ± 0.082
1.108IleHis: 1.108 ± 0.045
7.141IleIle: 7.141 ± 0.152
7.242IleLys: 7.242 ± 0.113
6.593IleLeu: 6.593 ± 0.11
2.425IleMet: 2.425 ± 0.07
5.046IleAsn: 5.046 ± 0.089
2.904IlePro: 2.904 ± 0.065
1.91IleGln: 1.91 ± 0.049
2.893IleArg: 2.893 ± 0.073
5.823IleSer: 5.823 ± 0.107
4.994IleThr: 4.994 ± 0.082
5.958IleVal: 5.958 ± 0.103
0.598IleTrp: 0.598 ± 0.032
3.614IleTyr: 3.614 ± 0.091
0.0IleXaa: 0.0 ± 0.0
Lys
5.501LysAla: 5.501 ± 0.098
1.038LysCys: 1.038 ± 0.04
5.253LysAsp: 5.253 ± 0.09
7.313LysGlu: 7.313 ± 0.131
2.689LysPhe: 2.689 ± 0.062
5.156LysGly: 5.156 ± 0.1
1.119LysHis: 1.119 ± 0.04
7.209LysIle: 7.209 ± 0.11
9.092LysLys: 9.092 ± 0.177
6.312LysLeu: 6.312 ± 0.089
2.852LysMet: 2.852 ± 0.062
5.709LysAsn: 5.709 ± 0.089
2.22LysPro: 2.22 ± 0.057
2.272LysGln: 2.272 ± 0.058
3.111LysArg: 3.111 ± 0.07
4.786LysSer: 4.786 ± 0.1
4.683LysThr: 4.683 ± 0.075
6.22LysVal: 6.22 ± 0.114
0.811LysTrp: 0.811 ± 0.034
4.456LysTyr: 4.456 ± 0.076
0.0LysXaa: 0.0 ± 0.0
Leu
5.07LeuAla: 5.07 ± 0.078
1.231LeuCys: 1.231 ± 0.047
4.68LeuAsp: 4.68 ± 0.089
5.175LeuGlu: 5.175 ± 0.088
3.211LeuPhe: 3.211 ± 0.093
5.102LeuGly: 5.102 ± 0.095
1.127LeuHis: 1.127 ± 0.044
6.073LeuIle: 6.073 ± 0.119
6.857LeuLys: 6.857 ± 0.098
6.37LeuLeu: 6.37 ± 0.141
2.238LeuMet: 2.238 ± 0.059
4.241LeuAsn: 4.241 ± 0.087
2.648LeuPro: 2.648 ± 0.068
1.963LeuGln: 1.963 ± 0.056
2.681LeuArg: 2.681 ± 0.06
5.543LeuSer: 5.543 ± 0.102
4.196LeuThr: 4.196 ± 0.077
4.984LeuVal: 4.984 ± 0.085
0.631LeuTrp: 0.631 ± 0.033
3.277LeuTyr: 3.277 ± 0.069
0.001LeuXaa: 0.001 ± 0.002
Met
2.394MetAla: 2.394 ± 0.065
0.398MetCys: 0.398 ± 0.027
1.894MetAsp: 1.894 ± 0.048
2.325MetGlu: 2.325 ± 0.063
1.18MetPhe: 1.18 ± 0.04
1.982MetGly: 1.982 ± 0.059
0.385MetHis: 0.385 ± 0.024
2.271MetIle: 2.271 ± 0.066
2.613MetLys: 2.613 ± 0.057
2.386MetLeu: 2.386 ± 0.059
0.794MetMet: 0.794 ± 0.037
1.583MetAsn: 1.583 ± 0.043
1.012MetPro: 1.012 ± 0.031
0.8MetGln: 0.8 ± 0.037
1.077MetArg: 1.077 ± 0.043
1.924MetSer: 1.924 ± 0.057
1.6MetThr: 1.6 ± 0.053
2.012MetVal: 2.012 ± 0.056
0.223MetTrp: 0.223 ± 0.019
1.149MetTyr: 1.149 ± 0.043
0.0MetXaa: 0.0 ± 0.0
Asn
3.309AsnAla: 3.309 ± 0.071
0.747AsnCys: 0.747 ± 0.035
3.032AsnAsp: 3.032 ± 0.085
3.879AsnGlu: 3.879 ± 0.068
1.924AsnPhe: 1.924 ± 0.05
4.506AsnGly: 4.506 ± 0.108
0.843AsnHis: 0.843 ± 0.041
5.562AsnIle: 5.562 ± 0.086
4.847AsnLys: 4.847 ± 0.09
4.094AsnLeu: 4.094 ± 0.085
1.765AsnMet: 1.765 ± 0.051
3.711AsnAsn: 3.711 ± 0.086
2.091AsnPro: 2.091 ± 0.055
1.575AsnGln: 1.575 ± 0.044
2.196AsnArg: 2.196 ± 0.064
3.455AsnSer: 3.455 ± 0.083
3.128AsnThr: 3.128 ± 0.072
4.199AsnVal: 4.199 ± 0.088
0.53AsnTrp: 0.53 ± 0.031
2.563AsnTyr: 2.563 ± 0.069
0.001AsnXaa: 0.001 ± 0.001
Pro
1.737ProAla: 1.737 ± 0.053
0.37ProCys: 0.37 ± 0.022
2.088ProAsp: 2.088 ± 0.059
2.567ProGlu: 2.567 ± 0.069
1.221ProPhe: 1.221 ± 0.04
1.981ProGly: 1.981 ± 0.064
0.466ProHis: 0.466 ± 0.027
2.048ProIle: 2.048 ± 0.055
2.071ProLys: 2.071 ± 0.05
2.135ProLeu: 2.135 ± 0.063
0.715ProMet: 0.715 ± 0.027
1.246ProAsn: 1.246 ± 0.045
0.512ProPro: 0.512 ± 0.03
0.736ProGln: 0.736 ± 0.035
0.71ProArg: 0.71 ± 0.037
1.525ProSer: 1.525 ± 0.05
1.634ProThr: 1.634 ± 0.061
2.529ProVal: 2.529 ± 0.065
0.252ProTrp: 0.252 ± 0.02
1.397ProTyr: 1.397 ± 0.046
0.003ProXaa: 0.003 ± 0.002
Gln
1.598GlnAla: 1.598 ± 0.053
0.322GlnCys: 0.322 ± 0.018
1.253GlnAsp: 1.253 ± 0.045
1.668GlnGlu: 1.668 ± 0.043
1.047GlnPhe: 1.047 ± 0.037
1.607GlnGly: 1.607 ± 0.049
0.347GlnHis: 0.347 ± 0.024
2.276GlnIle: 2.276 ± 0.056
2.322GlnLys: 2.322 ± 0.054
2.203GlnLeu: 2.203 ± 0.053
0.841GlnMet: 0.841 ± 0.036
1.443GlnAsn: 1.443 ± 0.049
0.663GlnPro: 0.663 ± 0.034
0.894GlnGln: 0.894 ± 0.044
0.925GlnArg: 0.925 ± 0.04
1.357GlnSer: 1.357 ± 0.046
1.235GlnThr: 1.235 ± 0.038
1.756GlnVal: 1.756 ± 0.053
0.293GlnTrp: 0.293 ± 0.022
1.181GlnTyr: 1.181 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
1.974ArgAla: 1.974 ± 0.05
0.444ArgCys: 0.444 ± 0.024
2.041ArgAsp: 2.041 ± 0.061
2.834ArgGlu: 2.834 ± 0.07
1.467ArgPhe: 1.467 ± 0.052
2.027ArgGly: 2.027 ± 0.068
0.586ArgHis: 0.586 ± 0.031
2.985ArgIle: 2.985 ± 0.067
3.415ArgLys: 3.415 ± 0.069
2.688ArgLeu: 2.688 ± 0.061
1.133ArgMet: 1.133 ± 0.039
2.164ArgAsn: 2.164 ± 0.066
0.916ArgPro: 0.916 ± 0.042
1.144ArgGln: 1.144 ± 0.04
1.524ArgArg: 1.524 ± 0.058
1.596ArgSer: 1.596 ± 0.049
1.754ArgThr: 1.754 ± 0.048
2.333ArgVal: 2.333 ± 0.064
0.254ArgTrp: 0.254 ± 0.018
1.683ArgTyr: 1.683 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
3.732SerAla: 3.732 ± 0.076
0.682SerCys: 0.682 ± 0.032
4.101SerAsp: 4.101 ± 0.089
3.965SerGlu: 3.965 ± 0.079
2.604SerPhe: 2.604 ± 0.058
4.829SerGly: 4.829 ± 0.1
0.879SerHis: 0.879 ± 0.03
4.497SerIle: 4.497 ± 0.087
4.96SerLys: 4.96 ± 0.106
4.705SerLeu: 4.705 ± 0.091
1.636SerMet: 1.636 ± 0.046
3.211SerAsn: 3.211 ± 0.073
1.481SerPro: 1.481 ± 0.044
1.701SerGln: 1.701 ± 0.043
2.224SerArg: 2.224 ± 0.057
3.775SerSer: 3.775 ± 0.081
3.168SerThr: 3.168 ± 0.08
4.68SerVal: 4.68 ± 0.081
0.55SerTrp: 0.55 ± 0.031
2.82SerTyr: 2.82 ± 0.075
0.003SerXaa: 0.003 ± 0.002
Thr
3.653ThrAla: 3.653 ± 0.08
0.593ThrCys: 0.593 ± 0.031
3.437ThrAsp: 3.437 ± 0.089
3.535ThrGlu: 3.535 ± 0.069
2.262ThrPhe: 2.262 ± 0.059
4.567ThrGly: 4.567 ± 0.095
0.746ThrHis: 0.746 ± 0.034
4.686ThrIle: 4.686 ± 0.086
4.245ThrLys: 4.245 ± 0.079
4.46ThrLeu: 4.46 ± 0.077
1.479ThrMet: 1.479 ± 0.049
2.663ThrAsn: 2.663 ± 0.08
2.014ThrPro: 2.014 ± 0.049
1.296ThrGln: 1.296 ± 0.045
1.69ThrArg: 1.69 ± 0.044
3.294ThrSer: 3.294 ± 0.08
3.614ThrThr: 3.614 ± 0.109
4.819ThrVal: 4.819 ± 0.109
0.498ThrTrp: 0.498 ± 0.026
2.628ThrTyr: 2.628 ± 0.074
0.001ThrXaa: 0.001 ± 0.001
Val
4.984ValAla: 4.984 ± 0.094
1.092ValCys: 1.092 ± 0.04
4.223ValAsp: 4.223 ± 0.082
4.822ValGlu: 4.822 ± 0.081
2.994ValPhe: 2.994 ± 0.073
4.356ValGly: 4.356 ± 0.083
0.878ValHis: 0.878 ± 0.038
5.965ValIle: 5.965 ± 0.104
6.15ValLys: 6.15 ± 0.116
5.983ValLeu: 5.983 ± 0.104
2.074ValMet: 2.074 ± 0.062
3.86ValAsn: 3.86 ± 0.086
2.217ValPro: 2.217 ± 0.062
1.543ValGln: 1.543 ± 0.045
2.432ValArg: 2.432 ± 0.066
4.838ValSer: 4.838 ± 0.078
4.499ValThr: 4.499 ± 0.1
5.365ValVal: 5.365 ± 0.099
0.595ValTrp: 0.595 ± 0.031
3.171ValTyr: 3.171 ± 0.074
0.003ValXaa: 0.003 ± 0.002
Trp
0.489TrpAla: 0.489 ± 0.03
0.129TrpCys: 0.129 ± 0.015
0.516TrpAsp: 0.516 ± 0.028
0.506TrpGlu: 0.506 ± 0.031
0.391TrpPhe: 0.391 ± 0.025
0.646TrpGly: 0.646 ± 0.037
0.164TrpHis: 0.164 ± 0.016
0.668TrpIle: 0.668 ± 0.032
0.652TrpLys: 0.652 ± 0.033
0.661TrpLeu: 0.661 ± 0.028
0.262TrpMet: 0.262 ± 0.02
0.593TrpAsn: 0.593 ± 0.031
0.205TrpPro: 0.205 ± 0.019
0.299TrpGln: 0.299 ± 0.021
0.261TrpArg: 0.261 ± 0.019
0.584TrpSer: 0.584 ± 0.029
0.478TrpThr: 0.478 ± 0.031
0.494TrpVal: 0.494 ± 0.028
0.1TrpTrp: 0.1 ± 0.012
0.345TrpTyr: 0.345 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.674TyrAla: 2.674 ± 0.064
0.671TyrCys: 0.671 ± 0.028
3.105TyrAsp: 3.105 ± 0.072
3.139TyrGlu: 3.139 ± 0.075
1.973TyrPhe: 1.973 ± 0.046
3.297TyrGly: 3.297 ± 0.069
0.674TyrHis: 0.674 ± 0.036
3.66TyrIle: 3.66 ± 0.075
3.659TyrLys: 3.659 ± 0.071
3.496TyrLeu: 3.496 ± 0.079
1.252TyrMet: 1.252 ± 0.043
3.019TyrAsn: 3.019 ± 0.077
1.285TyrPro: 1.285 ± 0.048
1.178TyrGln: 1.178 ± 0.045
1.79TyrArg: 1.79 ± 0.056
3.105TyrSer: 3.105 ± 0.074
2.68TyrThr: 2.68 ± 0.077
3.171TyrVal: 3.171 ± 0.08
0.383TyrTrp: 0.383 ± 0.026
2.409TyrTyr: 2.409 ± 0.07
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.003XaaHis: 0.003 ± 0.002
0.003XaaIle: 0.003 ± 0.002
0.0XaaLys: 0.0 ± 0.0
0.003XaaLeu: 0.003 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.002
0.003XaaSer: 0.003 ± 0.002
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.001
0.019XaaXaa: 0.019 ± 0.007
Statistics based on 2196 proteins (721333 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski