Amino acid dipepetide frequency for Bacteroides sp. CAG:598

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.792AlaAla: 6.792 ± 0.1
1.326AlaCys: 1.326 ± 0.041
5.022AlaAsp: 5.022 ± 0.079
5.234AlaGlu: 5.234 ± 0.08
3.367AlaPhe: 3.367 ± 0.064
5.582AlaGly: 5.582 ± 0.087
1.373AlaHis: 1.373 ± 0.039
4.418AlaIle: 4.418 ± 0.079
3.628AlaLys: 3.628 ± 0.067
7.376AlaLeu: 7.376 ± 0.102
2.047AlaMet: 2.047 ± 0.046
3.022AlaAsn: 3.022 ± 0.054
2.545AlaPro: 2.545 ± 0.055
3.0AlaGln: 3.0 ± 0.055
3.874AlaArg: 3.874 ± 0.066
4.532AlaSer: 4.532 ± 0.071
4.104AlaThr: 4.104 ± 0.067
5.431AlaVal: 5.431 ± 0.081
0.98AlaTrp: 0.98 ± 0.033
3.167AlaTyr: 3.167 ± 0.058
0.003AlaXaa: 0.003 ± 0.002
Cys
0.907CysAla: 0.907 ± 0.029
0.234CysCys: 0.234 ± 0.015
0.66CysAsp: 0.66 ± 0.024
0.702CysGlu: 0.702 ± 0.03
0.655CysPhe: 0.655 ± 0.026
1.138CysGly: 1.138 ± 0.036
0.324CysHis: 0.324 ± 0.019
0.965CysIle: 0.965 ± 0.037
0.615CysLys: 0.615 ± 0.026
1.333CysLeu: 1.333 ± 0.037
0.323CysMet: 0.323 ± 0.019
0.526CysAsn: 0.526 ± 0.024
0.584CysPro: 0.584 ± 0.026
0.415CysGln: 0.415 ± 0.019
0.806CysArg: 0.806 ± 0.029
0.743CysSer: 0.743 ± 0.028
0.776CysThr: 0.776 ± 0.033
0.82CysVal: 0.82 ± 0.029
0.185CysTrp: 0.185 ± 0.014
0.577CysTyr: 0.577 ± 0.027
0.001CysXaa: 0.001 ± 0.001
Asp
4.307AspAla: 4.307 ± 0.067
0.678AspCys: 0.678 ± 0.029
2.878AspAsp: 2.878 ± 0.063
4.12AspGlu: 4.12 ± 0.063
2.987AspPhe: 2.987 ± 0.058
4.566AspGly: 4.566 ± 0.075
0.905AspHis: 0.905 ± 0.033
3.865AspIle: 3.865 ± 0.058
3.465AspLys: 3.465 ± 0.061
4.907AspLeu: 4.907 ± 0.074
1.645AspMet: 1.645 ± 0.04
2.515AspAsn: 2.515 ± 0.052
1.96AspPro: 1.96 ± 0.044
1.327AspGln: 1.327 ± 0.038
2.945AspArg: 2.945 ± 0.066
2.958AspSer: 2.958 ± 0.062
2.953AspThr: 2.953 ± 0.056
3.795AspVal: 3.795 ± 0.06
0.865AspTrp: 0.865 ± 0.033
2.913AspTyr: 2.913 ± 0.059
0.001AspXaa: 0.001 ± 0.001
Glu
5.648GluAla: 5.648 ± 0.09
0.64GluCys: 0.64 ± 0.027
3.258GluAsp: 3.258 ± 0.061
5.006GluGlu: 5.006 ± 0.094
2.267GluPhe: 2.267 ± 0.05
4.611GluGly: 4.611 ± 0.067
1.311GluHis: 1.311 ± 0.04
3.995GluIle: 3.995 ± 0.072
4.74GluLys: 4.74 ± 0.078
6.066GluLeu: 6.066 ± 0.085
2.068GluMet: 2.068 ± 0.048
3.207GluAsn: 3.207 ± 0.064
1.812GluPro: 1.812 ± 0.046
2.839GluGln: 2.839 ± 0.058
3.578GluArg: 3.578 ± 0.067
2.843GluSer: 2.843 ± 0.049
3.347GluThr: 3.347 ± 0.063
4.533GluVal: 4.533 ± 0.079
0.818GluTrp: 0.818 ± 0.031
2.531GluTyr: 2.531 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
3.138PheAla: 3.138 ± 0.065
0.687PheCys: 0.687 ± 0.026
2.745PheAsp: 2.745 ± 0.055
2.31PheGlu: 2.31 ± 0.041
2.151PhePhe: 2.151 ± 0.055
3.226PheGly: 3.226 ± 0.065
0.963PheHis: 0.963 ± 0.029
2.842PheIle: 2.842 ± 0.064
2.04PheLys: 2.04 ± 0.047
4.078PheLeu: 4.078 ± 0.064
1.126PheMet: 1.126 ± 0.035
2.022PheAsn: 2.022 ± 0.047
1.667PhePro: 1.667 ± 0.041
1.302PheGln: 1.302 ± 0.037
2.337PheArg: 2.337 ± 0.05
3.105PheSer: 3.105 ± 0.058
2.631PheThr: 2.631 ± 0.052
3.005PheVal: 3.005 ± 0.064
0.578PheTrp: 0.578 ± 0.026
1.909PheTyr: 1.909 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.618GlyAla: 4.618 ± 0.084
1.027GlyCys: 1.027 ± 0.036
3.613GlyAsp: 3.613 ± 0.07
4.352GlyGlu: 4.352 ± 0.073
3.049GlyPhe: 3.049 ± 0.051
5.041GlyGly: 5.041 ± 0.088
1.428GlyHis: 1.428 ± 0.04
5.187GlyIle: 5.187 ± 0.077
4.794GlyLys: 4.794 ± 0.063
6.17GlyLeu: 6.17 ± 0.093
2.406GlyMet: 2.406 ± 0.051
3.342GlyAsn: 3.342 ± 0.065
1.451GlyPro: 1.451 ± 0.044
2.453GlyGln: 2.453 ± 0.054
3.575GlyArg: 3.575 ± 0.064
4.02GlySer: 4.02 ± 0.075
4.419GlyThr: 4.419 ± 0.076
5.028GlyVal: 5.028 ± 0.083
1.122GlyTrp: 1.122 ± 0.041
3.318GlyTyr: 3.318 ± 0.069
0.002GlyXaa: 0.002 ± 0.001
His
1.375HisAla: 1.375 ± 0.045
0.298HisCys: 0.298 ± 0.018
0.982HisAsp: 0.982 ± 0.031
1.115HisGlu: 1.115 ± 0.033
1.039HisPhe: 1.039 ± 0.032
1.322HisGly: 1.322 ± 0.039
0.523HisHis: 0.523 ± 0.023
1.391HisIle: 1.391 ± 0.038
0.985HisLys: 0.985 ± 0.033
1.963HisLeu: 1.963 ± 0.046
0.4HisMet: 0.4 ± 0.021
0.878HisAsn: 0.878 ± 0.032
1.173HisPro: 1.173 ± 0.032
0.672HisGln: 0.672 ± 0.03
1.069HisArg: 1.069 ± 0.032
1.056HisSer: 1.056 ± 0.033
1.16HisThr: 1.16 ± 0.033
1.274HisVal: 1.274 ± 0.035
0.276HisTrp: 0.276 ± 0.017
0.994HisTyr: 0.994 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
4.995IleAla: 4.995 ± 0.08
0.934IleCys: 0.934 ± 0.031
3.982IleAsp: 3.982 ± 0.066
3.938IleGlu: 3.938 ± 0.06
2.544IlePhe: 2.544 ± 0.064
4.52IleGly: 4.52 ± 0.088
1.276IleHis: 1.276 ± 0.035
3.693IleIle: 3.693 ± 0.078
3.217IleLys: 3.217 ± 0.073
5.549IleLeu: 5.549 ± 0.087
1.324IleMet: 1.324 ± 0.04
2.865IleAsn: 2.865 ± 0.059
2.875IlePro: 2.875 ± 0.056
1.956IleGln: 1.956 ± 0.044
3.597IleArg: 3.597 ± 0.061
3.898IleSer: 3.898 ± 0.071
3.513IleThr: 3.513 ± 0.066
4.244IleVal: 4.244 ± 0.082
0.673IleTrp: 0.673 ± 0.03
2.406IleTyr: 2.406 ± 0.054
0.002IleXaa: 0.002 ± 0.001
Lys
4.753LysAla: 4.753 ± 0.077
0.517LysCys: 0.517 ± 0.027
3.678LysAsp: 3.678 ± 0.059
4.997LysGlu: 4.997 ± 0.094
1.831LysPhe: 1.831 ± 0.049
4.035LysGly: 4.035 ± 0.063
1.144LysHis: 1.144 ± 0.034
3.281LysIle: 3.281 ± 0.064
4.148LysLys: 4.148 ± 0.083
4.911LysLeu: 4.911 ± 0.081
1.817LysMet: 1.817 ± 0.041
2.905LysAsn: 2.905 ± 0.067
2.055LysPro: 2.055 ± 0.048
2.442LysGln: 2.442 ± 0.062
2.963LysArg: 2.963 ± 0.057
2.706LysSer: 2.706 ± 0.059
3.067LysThr: 3.067 ± 0.049
3.853LysVal: 3.853 ± 0.069
0.647LysTrp: 0.647 ± 0.027
2.629LysTyr: 2.629 ± 0.057
0.002LysXaa: 0.002 ± 0.001
Leu
7.254LeuAla: 7.254 ± 0.102
1.535LeuCys: 1.535 ± 0.039
5.056LeuAsp: 5.056 ± 0.072
5.324LeuGlu: 5.324 ± 0.069
4.371LeuPhe: 4.371 ± 0.08
5.938LeuGly: 5.938 ± 0.092
1.996LeuHis: 1.996 ± 0.049
5.109LeuIle: 5.109 ± 0.086
6.067LeuLys: 6.067 ± 0.083
9.835LeuLeu: 9.835 ± 0.143
2.755LeuMet: 2.755 ± 0.058
4.286LeuAsn: 4.286 ± 0.069
4.456LeuPro: 4.456 ± 0.073
3.566LeuGln: 3.566 ± 0.063
4.898LeuArg: 4.898 ± 0.078
6.38LeuSer: 6.38 ± 0.095
5.833LeuThr: 5.833 ± 0.079
5.759LeuVal: 5.759 ± 0.09
1.17LeuTrp: 1.17 ± 0.042
3.876LeuTyr: 3.876 ± 0.065
0.008LeuXaa: 0.008 ± 0.003
Met
2.26MetAla: 2.26 ± 0.05
0.286MetCys: 0.286 ± 0.02
1.644MetAsp: 1.644 ± 0.038
1.925MetGlu: 1.925 ± 0.047
0.999MetPhe: 0.999 ± 0.034
1.969MetGly: 1.969 ± 0.042
0.49MetHis: 0.49 ± 0.023
1.457MetIle: 1.457 ± 0.039
2.439MetLys: 2.439 ± 0.051
2.621MetLeu: 2.621 ± 0.063
0.824MetMet: 0.824 ± 0.028
1.508MetAsn: 1.508 ± 0.038
1.167MetPro: 1.167 ± 0.038
1.157MetGln: 1.157 ± 0.035
1.347MetArg: 1.347 ± 0.036
1.445MetSer: 1.445 ± 0.032
1.53MetThr: 1.53 ± 0.043
1.597MetVal: 1.597 ± 0.041
0.263MetTrp: 0.263 ± 0.018
0.853MetTyr: 0.853 ± 0.031
0.001MetXaa: 0.001 ± 0.001
Asn
3.388AsnAla: 3.388 ± 0.074
0.476AsnCys: 0.476 ± 0.021
2.414AsnAsp: 2.414 ± 0.052
2.799AsnGlu: 2.799 ± 0.054
1.974AsnPhe: 1.974 ± 0.044
3.644AsnGly: 3.644 ± 0.062
0.91AsnHis: 0.91 ± 0.027
3.157AsnIle: 3.157 ± 0.065
2.459AsnLys: 2.459 ± 0.048
4.373AsnLeu: 4.373 ± 0.064
1.181AsnMet: 1.181 ± 0.035
2.212AsnAsn: 2.212 ± 0.057
2.274AsnPro: 2.274 ± 0.054
1.471AsnGln: 1.471 ± 0.042
2.495AsnArg: 2.495 ± 0.056
2.352AsnSer: 2.352 ± 0.056
2.467AsnThr: 2.467 ± 0.055
2.983AsnVal: 2.983 ± 0.054
0.635AsnTrp: 0.635 ± 0.023
2.155AsnTyr: 2.155 ± 0.048
0.001AsnXaa: 0.001 ± 0.001
Pro
3.168ProAla: 3.168 ± 0.061
0.415ProCys: 0.415 ± 0.019
2.711ProAsp: 2.711 ± 0.052
3.274ProGlu: 3.274 ± 0.062
1.81ProPhe: 1.81 ± 0.04
2.565ProGly: 2.565 ± 0.051
0.771ProHis: 0.771 ± 0.026
2.102ProIle: 2.102 ± 0.045
1.779ProLys: 1.779 ± 0.043
3.585ProLeu: 3.585 ± 0.058
1.029ProMet: 1.029 ± 0.033
1.555ProAsn: 1.555 ± 0.045
0.844ProPro: 0.844 ± 0.036
1.741ProGln: 1.741 ± 0.046
1.535ProArg: 1.535 ± 0.047
2.233ProSer: 2.233 ± 0.051
2.054ProThr: 2.054 ± 0.049
3.023ProVal: 3.023 ± 0.06
0.489ProTrp: 0.489 ± 0.023
1.811ProTyr: 1.811 ± 0.046
0.003ProXaa: 0.003 ± 0.002
Gln
3.084GlnAla: 3.084 ± 0.061
0.339GlnCys: 0.339 ± 0.018
1.602GlnAsp: 1.602 ± 0.04
2.457GlnGlu: 2.457 ± 0.05
1.336GlnPhe: 1.336 ± 0.035
2.32GlnGly: 2.32 ± 0.054
0.718GlnHis: 0.718 ± 0.025
2.232GlnIle: 2.232 ± 0.052
2.265GlnLys: 2.265 ± 0.055
3.553GlnLeu: 3.553 ± 0.072
1.186GlnMet: 1.186 ± 0.036
1.646GlnAsn: 1.646 ± 0.046
1.5GlnPro: 1.5 ± 0.042
1.725GlnGln: 1.725 ± 0.058
1.85GlnArg: 1.85 ± 0.044
1.927GlnSer: 1.927 ± 0.051
2.353GlnThr: 2.353 ± 0.048
2.431GlnVal: 2.431 ± 0.058
0.538GlnTrp: 0.538 ± 0.022
1.388GlnTyr: 1.388 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
3.269ArgAla: 3.269 ± 0.065
0.574ArgCys: 0.574 ± 0.024
2.467ArgAsp: 2.467 ± 0.048
3.335ArgGlu: 3.335 ± 0.059
2.455ArgPhe: 2.455 ± 0.05
2.782ArgGly: 2.782 ± 0.06
1.104ArgHis: 1.104 ± 0.031
3.63ArgIle: 3.63 ± 0.064
3.338ArgLys: 3.338 ± 0.057
5.68ArgLeu: 5.68 ± 0.083
1.737ArgMet: 1.737 ± 0.044
2.468ArgAsn: 2.468 ± 0.053
2.025ArgPro: 2.025 ± 0.052
2.306ArgGln: 2.306 ± 0.05
3.189ArgArg: 3.189 ± 0.058
2.574ArgSer: 2.574 ± 0.05
2.846ArgThr: 2.846 ± 0.057
3.071ArgVal: 3.071 ± 0.057
0.778ArgTrp: 0.778 ± 0.027
2.437ArgTyr: 2.437 ± 0.049
0.006ArgXaa: 0.006 ± 0.003
Ser
4.216SerAla: 4.216 ± 0.061
0.84SerCys: 0.84 ± 0.033
3.12SerAsp: 3.12 ± 0.059
3.15SerGlu: 3.15 ± 0.056
2.982SerPhe: 2.982 ± 0.051
4.291SerGly: 4.291 ± 0.075
1.156SerHis: 1.156 ± 0.038
3.838SerIle: 3.838 ± 0.073
2.704SerLys: 2.704 ± 0.055
6.057SerLeu: 6.057 ± 0.093
1.457SerMet: 1.457 ± 0.039
2.397SerAsn: 2.397 ± 0.057
2.248SerPro: 2.248 ± 0.046
1.869SerGln: 1.869 ± 0.05
2.815SerArg: 2.815 ± 0.055
3.499SerSer: 3.499 ± 0.069
2.957SerThr: 2.957 ± 0.057
4.029SerVal: 4.029 ± 0.064
0.751SerTrp: 0.751 ± 0.034
2.63SerTyr: 2.63 ± 0.057
0.001SerXaa: 0.001 ± 0.001
Thr
4.421ThrAla: 4.421 ± 0.072
0.625ThrCys: 0.625 ± 0.024
3.687ThrAsp: 3.687 ± 0.067
3.386ThrGlu: 3.386 ± 0.068
2.648ThrPhe: 2.648 ± 0.059
4.456ThrGly: 4.456 ± 0.078
1.059ThrHis: 1.059 ± 0.038
3.462ThrIle: 3.462 ± 0.062
2.431ThrLys: 2.431 ± 0.05
6.017ThrLeu: 6.017 ± 0.087
1.186ThrMet: 1.186 ± 0.032
2.291ThrAsn: 2.291 ± 0.049
2.865ThrPro: 2.865 ± 0.054
1.868ThrGln: 1.868 ± 0.046
2.489ThrArg: 2.489 ± 0.053
3.151ThrSer: 3.151 ± 0.065
3.294ThrThr: 3.294 ± 0.066
4.214ThrVal: 4.214 ± 0.073
0.688ThrTrp: 0.688 ± 0.026
2.429ThrTyr: 2.429 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
5.148ValAla: 5.148 ± 0.078
1.116ValCys: 1.116 ± 0.037
3.792ValAsp: 3.792 ± 0.063
4.4ValGlu: 4.4 ± 0.081
2.838ValPhe: 2.838 ± 0.065
4.45ValGly: 4.45 ± 0.078
1.279ValHis: 1.279 ± 0.037
4.081ValIle: 4.081 ± 0.066
4.071ValLys: 4.071 ± 0.071
6.086ValLeu: 6.086 ± 0.087
1.689ValMet: 1.689 ± 0.045
3.199ValAsn: 3.199 ± 0.065
2.748ValPro: 2.748 ± 0.056
2.195ValGln: 2.195 ± 0.049
3.486ValArg: 3.486 ± 0.062
4.391ValSer: 4.391 ± 0.065
3.932ValThr: 3.932 ± 0.073
4.971ValVal: 4.971 ± 0.085
0.857ValTrp: 0.857 ± 0.034
2.679ValTyr: 2.679 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.836TrpAla: 0.836 ± 0.032
0.191TrpCys: 0.191 ± 0.012
0.783TrpAsp: 0.783 ± 0.028
0.844TrpGlu: 0.844 ± 0.029
0.545TrpPhe: 0.545 ± 0.027
0.982TrpGly: 0.982 ± 0.033
0.28TrpHis: 0.28 ± 0.017
0.75TrpIle: 0.75 ± 0.034
0.901TrpLys: 0.901 ± 0.035
1.278TrpLeu: 1.278 ± 0.041
0.513TrpMet: 0.513 ± 0.023
0.737TrpAsn: 0.737 ± 0.028
0.331TrpPro: 0.331 ± 0.018
0.58TrpGln: 0.58 ± 0.024
0.641TrpArg: 0.641 ± 0.027
0.702TrpSer: 0.702 ± 0.027
0.749TrpThr: 0.749 ± 0.029
0.727TrpVal: 0.727 ± 0.029
0.203TrpTrp: 0.203 ± 0.015
0.541TrpTyr: 0.541 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.242TyrAla: 3.242 ± 0.064
0.55TyrCys: 0.55 ± 0.022
2.461TyrAsp: 2.461 ± 0.051
2.446TyrGlu: 2.446 ± 0.054
2.014TyrPhe: 2.014 ± 0.047
2.977TyrGly: 2.977 ± 0.06
0.932TyrHis: 0.932 ± 0.031
2.547TyrIle: 2.547 ± 0.051
2.295TyrLys: 2.295 ± 0.046
4.033TyrLeu: 4.033 ± 0.072
1.063TyrMet: 1.063 ± 0.033
2.24TyrAsn: 2.24 ± 0.055
1.916TyrPro: 1.916 ± 0.051
1.565TyrGln: 1.565 ± 0.044
2.574TyrArg: 2.574 ± 0.061
2.491TyrSer: 2.491 ± 0.051
2.679TyrThr: 2.679 ± 0.055
2.621TyrVal: 2.621 ± 0.056
0.598TyrTrp: 0.598 ± 0.024
2.002TyrTyr: 2.002 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.002
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.002XaaGlu: 0.002 ± 0.002
0.001XaaPhe: 0.001 ± 0.001
0.001XaaGly: 0.001 ± 0.001
0.002XaaHis: 0.002 ± 0.001
0.001XaaIle: 0.001 ± 0.001
0.002XaaLys: 0.002 ± 0.001
0.002XaaLeu: 0.002 ± 0.002
0.003XaaMet: 0.003 ± 0.002
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.002XaaGln: 0.002 ± 0.001
0.005XaaArg: 0.005 ± 0.003
0.002XaaSer: 0.002 ± 0.001
0.001XaaThr: 0.001 ± 0.001
0.001XaaVal: 0.001 ± 0.001
0.001XaaTrp: 0.001 ± 0.001
0.0XaaTyr: 0.0 ± 0.0
0.048XaaXaa: 0.048 ± 0.011
Statistics based on 2776 proteins (985690 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski