Amino acid dipepetide frequency for Prevotella sp. CAG:485

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.896AlaAla: 8.896 ± 0.153
1.027AlaCys: 1.027 ± 0.04
6.21AlaAsp: 6.21 ± 0.125
5.761AlaGlu: 5.761 ± 0.109
3.412AlaPhe: 3.412 ± 0.088
6.445AlaGly: 6.445 ± 0.146
1.367AlaHis: 1.367 ± 0.045
4.897AlaIle: 4.897 ± 0.106
4.398AlaLys: 4.398 ± 0.095
8.625AlaLeu: 8.625 ± 0.12
2.54AlaMet: 2.54 ± 0.064
3.644AlaAsn: 3.644 ± 0.073
3.69AlaPro: 3.69 ± 0.093
3.286AlaGln: 3.286 ± 0.07
4.277AlaArg: 4.277 ± 0.101
4.96AlaSer: 4.96 ± 0.086
4.954AlaThr: 4.954 ± 0.104
6.001AlaVal: 6.001 ± 0.113
0.958AlaTrp: 0.958 ± 0.036
2.818AlaTyr: 2.818 ± 0.058
0.006AlaXaa: 0.006 ± 0.003
Cys
1.007CysAla: 1.007 ± 0.043
0.196CysCys: 0.196 ± 0.017
0.644CysAsp: 0.644 ± 0.031
0.604CysGlu: 0.604 ± 0.028
0.587CysPhe: 0.587 ± 0.031
1.203CysGly: 1.203 ± 0.048
0.311CysHis: 0.311 ± 0.021
0.708CysIle: 0.708 ± 0.035
0.514CysLys: 0.514 ± 0.03
1.108CysLeu: 1.108 ± 0.044
0.348CysMet: 0.348 ± 0.023
0.584CysAsn: 0.584 ± 0.028
0.567CysPro: 0.567 ± 0.031
0.354CysGln: 0.354 ± 0.02
0.866CysArg: 0.866 ± 0.035
0.901CysSer: 0.901 ± 0.039
0.623CysThr: 0.623 ± 0.032
0.892CysVal: 0.892 ± 0.037
0.129CysTrp: 0.129 ± 0.013
0.594CysTyr: 0.594 ± 0.039
0.0CysXaa: 0.0 ± 0.0
Asp
4.855AspAla: 4.855 ± 0.091
0.762AspCys: 0.762 ± 0.034
2.827AspAsp: 2.827 ± 0.081
3.294AspGlu: 3.294 ± 0.081
2.835AspPhe: 2.835 ± 0.071
4.192AspGly: 4.192 ± 0.103
0.849AspHis: 0.849 ± 0.036
3.583AspIle: 3.583 ± 0.078
3.145AspLys: 3.145 ± 0.076
4.628AspLeu: 4.628 ± 0.099
1.723AspMet: 1.723 ± 0.046
2.765AspAsn: 2.765 ± 0.067
2.3AspPro: 2.3 ± 0.059
1.205AspGln: 1.205 ± 0.042
2.837AspArg: 2.837 ± 0.073
3.519AspSer: 3.519 ± 0.084
3.162AspThr: 3.162 ± 0.078
3.874AspVal: 3.874 ± 0.094
0.761AspTrp: 0.761 ± 0.029
2.578AspTyr: 2.578 ± 0.068
0.001AspXaa: 0.001 ± 0.002
Glu
5.728GluAla: 5.728 ± 0.114
0.573GluCys: 0.573 ± 0.026
2.535GluAsp: 2.535 ± 0.063
4.121GluGlu: 4.121 ± 0.115
2.099GluPhe: 2.099 ± 0.063
4.034GluGly: 4.034 ± 0.079
1.092GluHis: 1.092 ± 0.038
3.744GluIle: 3.744 ± 0.083
3.652GluLys: 3.652 ± 0.083
5.373GluLeu: 5.373 ± 0.101
1.813GluMet: 1.813 ± 0.059
2.85GluAsn: 2.85 ± 0.073
2.208GluPro: 2.208 ± 0.056
2.296GluGln: 2.296 ± 0.065
3.053GluArg: 3.053 ± 0.084
2.92GluSer: 2.92 ± 0.069
2.867GluThr: 2.867 ± 0.076
3.877GluVal: 3.877 ± 0.091
0.696GluTrp: 0.696 ± 0.033
2.27GluTyr: 2.27 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.439PheAla: 3.439 ± 0.076
0.624PheCys: 0.624 ± 0.031
2.585PheAsp: 2.585 ± 0.062
1.969PheGlu: 1.969 ± 0.054
1.768PhePhe: 1.768 ± 0.054
3.232PheGly: 3.232 ± 0.072
0.784PheHis: 0.784 ± 0.033
2.517PheIle: 2.517 ± 0.069
1.9PheLys: 1.9 ± 0.056
3.427PheLeu: 3.427 ± 0.092
1.221PheMet: 1.221 ± 0.045
2.316PheAsn: 2.316 ± 0.061
1.571PhePro: 1.571 ± 0.047
0.993PheGln: 0.993 ± 0.033
2.3PheArg: 2.3 ± 0.054
2.985PheSer: 2.985 ± 0.074
2.703PheThr: 2.703 ± 0.065
2.539PheVal: 2.539 ± 0.076
0.512PheTrp: 0.512 ± 0.027
1.626PheTyr: 1.626 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
5.779GlyAla: 5.779 ± 0.099
1.066GlyCys: 1.066 ± 0.04
3.798GlyAsp: 3.798 ± 0.079
3.995GlyGlu: 3.995 ± 0.082
3.204GlyPhe: 3.204 ± 0.073
5.318GlyGly: 5.318 ± 0.134
1.528GlyHis: 1.528 ± 0.052
5.232GlyIle: 5.232 ± 0.119
4.786GlyLys: 4.786 ± 0.094
6.164GlyLeu: 6.164 ± 0.108
2.162GlyMet: 2.162 ± 0.059
3.785GlyAsn: 3.785 ± 0.104
1.735GlyPro: 1.735 ± 0.057
2.336GlyGln: 2.336 ± 0.07
3.931GlyArg: 3.931 ± 0.078
4.72GlySer: 4.72 ± 0.112
4.616GlyThr: 4.616 ± 0.128
5.321GlyVal: 5.321 ± 0.107
0.961GlyTrp: 0.961 ± 0.037
3.139GlyTyr: 3.139 ± 0.08
0.007GlyXaa: 0.007 ± 0.003
His
1.436HisAla: 1.436 ± 0.042
0.315HisCys: 0.315 ± 0.022
1.008HisAsp: 1.008 ± 0.041
0.948HisGlu: 0.948 ± 0.038
0.954HisPhe: 0.954 ± 0.04
1.371HisGly: 1.371 ± 0.045
0.542HisHis: 0.542 ± 0.038
1.308HisIle: 1.308 ± 0.043
0.906HisLys: 0.906 ± 0.037
1.828HisLeu: 1.828 ± 0.054
0.289HisMet: 0.289 ± 0.02
0.998HisAsn: 0.998 ± 0.033
1.096HisPro: 1.096 ± 0.045
0.587HisGln: 0.587 ± 0.03
1.121HisArg: 1.121 ± 0.05
1.2HisSer: 1.2 ± 0.044
1.121HisThr: 1.121 ± 0.039
0.957HisVal: 0.957 ± 0.043
0.255HisTrp: 0.255 ± 0.017
0.791HisTyr: 0.791 ± 0.031
0.001HisXaa: 0.001 ± 0.001
Ile
5.511IleAla: 5.511 ± 0.099
0.872IleCys: 0.872 ± 0.032
4.141IleAsp: 4.141 ± 0.078
3.655IleGlu: 3.655 ± 0.084
2.584IlePhe: 2.584 ± 0.068
4.472IleGly: 4.472 ± 0.102
1.021IleHis: 1.021 ± 0.041
3.732IleIle: 3.732 ± 0.099
3.047IleLys: 3.047 ± 0.067
5.019IleLeu: 5.019 ± 0.1
1.548IleMet: 1.548 ± 0.052
2.897IleAsn: 2.897 ± 0.07
2.858IlePro: 2.858 ± 0.071
1.479IleGln: 1.479 ± 0.05
2.93IleArg: 2.93 ± 0.064
4.081IleSer: 4.081 ± 0.08
4.183IleThr: 4.183 ± 0.094
3.819IleVal: 3.819 ± 0.093
0.584IleTrp: 0.584 ± 0.031
2.352IleTyr: 2.352 ± 0.062
0.0IleXaa: 0.0 ± 0.0
Lys
5.21LysAla: 5.21 ± 0.093
0.469LysCys: 0.469 ± 0.028
2.653LysAsp: 2.653 ± 0.062
3.734LysGlu: 3.734 ± 0.079
1.677LysPhe: 1.677 ± 0.049
4.062LysGly: 4.062 ± 0.084
0.98LysHis: 0.98 ± 0.038
3.438LysIle: 3.438 ± 0.081
3.376LysLys: 3.376 ± 0.091
4.422LysLeu: 4.422 ± 0.087
1.823LysMet: 1.823 ± 0.052
2.421LysAsn: 2.421 ± 0.065
2.571LysPro: 2.571 ± 0.072
1.749LysGln: 1.749 ± 0.051
2.886LysArg: 2.886 ± 0.072
2.785LysSer: 2.785 ± 0.077
2.914LysThr: 2.914 ± 0.071
3.975LysVal: 3.975 ± 0.081
0.624LysTrp: 0.624 ± 0.036
2.025LysTyr: 2.025 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
7.473LeuAla: 7.473 ± 0.12
1.382LeuCys: 1.382 ± 0.052
4.805LeuAsp: 4.805 ± 0.093
4.57LeuGlu: 4.57 ± 0.089
3.483LeuPhe: 3.483 ± 0.089
6.282LeuGly: 6.282 ± 0.1
1.975LeuHis: 1.975 ± 0.06
4.927LeuIle: 4.927 ± 0.11
5.123LeuLys: 5.123 ± 0.088
9.166LeuLeu: 9.166 ± 0.16
2.515LeuMet: 2.515 ± 0.06
4.213LeuAsn: 4.213 ± 0.087
4.865LeuPro: 4.865 ± 0.094
3.69LeuGln: 3.69 ± 0.088
5.835LeuArg: 5.835 ± 0.106
6.245LeuSer: 6.245 ± 0.093
6.098LeuThr: 6.098 ± 0.117
5.174LeuVal: 5.174 ± 0.09
1.083LeuTrp: 1.083 ± 0.042
3.195LeuTyr: 3.195 ± 0.075
0.006LeuXaa: 0.006 ± 0.003
Met
2.904MetAla: 2.904 ± 0.073
0.26MetCys: 0.26 ± 0.019
1.286MetAsp: 1.286 ± 0.05
1.7MetGlu: 1.7 ± 0.054
0.883MetPhe: 0.883 ± 0.038
1.89MetGly: 1.89 ± 0.047
0.468MetHis: 0.468 ± 0.026
1.44MetIle: 1.44 ± 0.05
1.883MetLys: 1.883 ± 0.051
2.863MetLeu: 2.863 ± 0.071
0.791MetMet: 0.791 ± 0.034
1.239MetAsn: 1.239 ± 0.045
1.509MetPro: 1.509 ± 0.051
1.132MetGln: 1.132 ± 0.045
1.866MetArg: 1.866 ± 0.048
1.627MetSer: 1.627 ± 0.049
1.738MetThr: 1.738 ± 0.051
1.752MetVal: 1.752 ± 0.052
0.252MetTrp: 0.252 ± 0.019
0.711MetTyr: 0.711 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
4.015AsnAla: 4.015 ± 0.081
0.731AsnCys: 0.731 ± 0.058
2.535AsnAsp: 2.535 ± 0.073
2.263AsnGlu: 2.263 ± 0.058
2.113AsnPhe: 2.113 ± 0.066
3.946AsnGly: 3.946 ± 0.108
0.925AsnHis: 0.925 ± 0.041
3.3AsnIle: 3.3 ± 0.065
2.349AsnLys: 2.349 ± 0.058
4.213AsnLeu: 4.213 ± 0.086
1.295AsnMet: 1.295 ± 0.039
2.44AsnAsn: 2.44 ± 0.066
2.666AsnPro: 2.666 ± 0.06
1.365AsnGln: 1.365 ± 0.046
2.33AsnArg: 2.33 ± 0.054
2.697AsnSer: 2.697 ± 0.074
2.694AsnThr: 2.694 ± 0.078
3.194AsnVal: 3.194 ± 0.069
0.591AsnTrp: 0.591 ± 0.03
2.018AsnTyr: 2.018 ± 0.061
0.001AsnXaa: 0.001 ± 0.001
Pro
4.259ProAla: 4.259 ± 0.095
0.374ProCys: 0.374 ± 0.025
3.194ProAsp: 3.194 ± 0.072
3.732ProGlu: 3.732 ± 0.082
1.797ProPhe: 1.797 ± 0.059
3.327ProGly: 3.327 ± 0.078
0.762ProHis: 0.762 ± 0.038
1.833ProIle: 1.833 ± 0.06
2.106ProLys: 2.106 ± 0.063
3.92ProLeu: 3.92 ± 0.077
1.154ProMet: 1.154 ± 0.043
1.674ProAsn: 1.674 ± 0.058
1.18ProPro: 1.18 ± 0.045
1.955ProGln: 1.955 ± 0.055
1.882ProArg: 1.882 ± 0.052
2.52ProSer: 2.52 ± 0.061
2.21ProThr: 2.21 ± 0.062
3.55ProVal: 3.55 ± 0.077
0.528ProTrp: 0.528 ± 0.032
1.751ProTyr: 1.751 ± 0.058
0.004ProXaa: 0.004 ± 0.002
Gln
2.94GlnAla: 2.94 ± 0.071
0.354GlnCys: 0.354 ± 0.025
1.346GlnAsp: 1.346 ± 0.041
1.761GlnGlu: 1.761 ± 0.053
1.211GlnPhe: 1.211 ± 0.043
2.476GlnGly: 2.476 ± 0.066
0.701GlnHis: 0.701 ± 0.032
2.136GlnIle: 2.136 ± 0.052
1.961GlnLys: 1.961 ± 0.06
3.278GlnLeu: 3.278 ± 0.089
1.079GlnMet: 1.079 ± 0.041
1.536GlnAsn: 1.536 ± 0.052
1.696GlnPro: 1.696 ± 0.048
1.834GlnGln: 1.834 ± 0.067
2.162GlnArg: 2.162 ± 0.069
1.788GlnSer: 1.788 ± 0.058
1.988GlnThr: 1.988 ± 0.057
1.918GlnVal: 1.918 ± 0.053
0.503GlnTrp: 0.503 ± 0.032
1.331GlnTyr: 1.331 ± 0.046
0.001GlnXaa: 0.001 ± 0.001
Arg
3.65ArgAla: 3.65 ± 0.075
0.696ArgCys: 0.696 ± 0.032
2.512ArgAsp: 2.512 ± 0.063
3.087ArgGlu: 3.087 ± 0.071
2.471ArgPhe: 2.471 ± 0.063
3.311ArgGly: 3.311 ± 0.068
1.428ArgHis: 1.428 ± 0.048
3.66ArgIle: 3.66 ± 0.076
3.178ArgLys: 3.178 ± 0.083
5.573ArgLeu: 5.573 ± 0.114
1.713ArgMet: 1.713 ± 0.049
2.752ArgAsn: 2.752 ± 0.054
2.158ArgPro: 2.158 ± 0.058
2.437ArgGln: 2.437 ± 0.072
4.006ArgArg: 4.006 ± 0.105
2.814ArgSer: 2.814 ± 0.07
2.635ArgThr: 2.635 ± 0.058
3.379ArgVal: 3.379 ± 0.071
0.695ArgTrp: 0.695 ± 0.031
2.379ArgTyr: 2.379 ± 0.061
0.004ArgXaa: 0.004 ± 0.002
Ser
5.448SerAla: 5.448 ± 0.099
0.819SerCys: 0.819 ± 0.036
3.357SerAsp: 3.357 ± 0.074
3.032SerGlu: 3.032 ± 0.074
2.72SerPhe: 2.72 ± 0.066
5.032SerGly: 5.032 ± 0.105
1.213SerHis: 1.213 ± 0.049
3.465SerIle: 3.465 ± 0.076
2.753SerLys: 2.753 ± 0.068
5.968SerLeu: 5.968 ± 0.101
1.651SerMet: 1.651 ± 0.05
2.771SerAsn: 2.771 ± 0.062
2.548SerPro: 2.548 ± 0.066
1.918SerGln: 1.918 ± 0.053
3.172SerArg: 3.172 ± 0.066
3.71SerSer: 3.71 ± 0.083
3.494SerThr: 3.494 ± 0.093
4.364SerVal: 4.364 ± 0.095
0.752SerTrp: 0.752 ± 0.035
2.464SerTyr: 2.464 ± 0.067
0.001SerXaa: 0.001 ± 0.001
Thr
5.548ThrAla: 5.548 ± 0.11
0.56ThrCys: 0.56 ± 0.027
3.691ThrAsp: 3.691 ± 0.088
2.989ThrGlu: 2.989 ± 0.062
2.477ThrPhe: 2.477 ± 0.06
4.856ThrGly: 4.856 ± 0.118
1.036ThrHis: 1.036 ± 0.04
3.425ThrIle: 3.425 ± 0.082
2.142ThrLys: 2.142 ± 0.057
6.118ThrLeu: 6.118 ± 0.087
1.244ThrMet: 1.244 ± 0.039
2.365ThrAsn: 2.365 ± 0.069
3.412ThrPro: 3.412 ± 0.074
1.761ThrGln: 1.761 ± 0.057
2.574ThrArg: 2.574 ± 0.062
3.245ThrSer: 3.245 ± 0.082
3.427ThrThr: 3.427 ± 0.089
4.703ThrVal: 4.703 ± 0.107
0.613ThrTrp: 0.613 ± 0.03
2.162ThrTyr: 2.162 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
6.088ValAla: 6.088 ± 0.122
0.947ValCys: 0.947 ± 0.043
3.703ValAsp: 3.703 ± 0.085
4.147ValGlu: 4.147 ± 0.086
2.533ValPhe: 2.533 ± 0.06
4.295ValGly: 4.295 ± 0.107
1.017ValHis: 1.017 ± 0.039
4.455ValIle: 4.455 ± 0.102
4.061ValLys: 4.061 ± 0.091
5.705ValLeu: 5.705 ± 0.101
1.969ValMet: 1.969 ± 0.062
3.444ValAsn: 3.444 ± 0.078
2.939ValPro: 2.939 ± 0.066
1.887ValGln: 1.887 ± 0.051
3.5ValArg: 3.5 ± 0.08
4.567ValSer: 4.567 ± 0.093
3.858ValThr: 3.858 ± 0.088
4.695ValVal: 4.695 ± 0.089
0.836ValTrp: 0.836 ± 0.032
2.523ValTyr: 2.523 ± 0.057
0.003ValXaa: 0.003 ± 0.002
Trp
0.808TrpAla: 0.808 ± 0.039
0.187TrpCys: 0.187 ± 0.019
0.614TrpAsp: 0.614 ± 0.032
0.591TrpGlu: 0.591 ± 0.034
0.489TrpPhe: 0.489 ± 0.03
0.958TrpGly: 0.958 ± 0.042
0.268TrpHis: 0.268 ± 0.018
0.69TrpIle: 0.69 ± 0.034
0.646TrpLys: 0.646 ± 0.032
1.3TrpLeu: 1.3 ± 0.052
0.364TrpMet: 0.364 ± 0.026
0.603TrpAsn: 0.603 ± 0.03
0.43TrpPro: 0.43 ± 0.026
0.594TrpGln: 0.594 ± 0.026
0.722TrpArg: 0.722 ± 0.034
0.69TrpSer: 0.69 ± 0.032
0.699TrpThr: 0.699 ± 0.032
0.709TrpVal: 0.709 ± 0.035
0.186TrpTrp: 0.186 ± 0.017
0.413TrpTyr: 0.413 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.169TyrAla: 3.169 ± 0.075
0.524TyrCys: 0.524 ± 0.027
2.25TyrAsp: 2.25 ± 0.059
1.846TyrGlu: 1.846 ± 0.057
1.715TyrPhe: 1.715 ± 0.048
2.873TyrGly: 2.873 ± 0.071
0.767TyrHis: 0.767 ± 0.03
2.296TyrIle: 2.296 ± 0.061
1.869TyrLys: 1.869 ± 0.055
3.402TyrLeu: 3.402 ± 0.075
0.949TyrMet: 0.949 ± 0.038
2.385TyrAsn: 2.385 ± 0.07
1.709TyrPro: 1.709 ± 0.055
1.144TyrGln: 1.144 ± 0.052
2.287TyrArg: 2.287 ± 0.063
2.676TyrSer: 2.676 ± 0.062
2.358TyrThr: 2.358 ± 0.067
2.48TyrVal: 2.48 ± 0.065
0.443TyrTrp: 0.443 ± 0.025
1.808TyrTyr: 1.808 ± 0.063
0.001TyrXaa: 0.001 ± 0.001
Xaa
0.007XaaAla: 0.007 ± 0.003
0.001XaaCys: 0.001 ± 0.001
0.001XaaAsp: 0.001 ± 0.001
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.006XaaGly: 0.006 ± 0.002
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.003XaaLeu: 0.003 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.004XaaPro: 0.004 ± 0.002
0.0XaaGln: 0.0 ± 0.0
0.003XaaArg: 0.003 ± 0.002
0.004XaaSer: 0.004 ± 0.003
0.006XaaThr: 0.006 ± 0.003
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.001
0.039XaaXaa: 0.039 ± 0.01
Statistics based on 1956 proteins (695163 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski