Amino acid dipepetide frequency for Pedobacter rhizosphaerae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.391AlaAla: 6.391 ± 0.095
0.653AlaCys: 0.653 ± 0.02
4.442AlaAsp: 4.442 ± 0.056
4.748AlaGlu: 4.748 ± 0.058
3.59AlaPhe: 3.59 ± 0.049
5.507AlaGly: 5.507 ± 0.083
1.221AlaHis: 1.221 ± 0.028
5.568AlaIle: 5.568 ± 0.065
5.222AlaLys: 5.222 ± 0.069
7.179AlaLeu: 7.179 ± 0.076
1.589AlaMet: 1.589 ± 0.03
4.19AlaAsn: 4.19 ± 0.066
2.326AlaPro: 2.326 ± 0.049
3.162AlaGln: 3.162 ± 0.042
2.387AlaArg: 2.387 ± 0.044
4.709AlaSer: 4.709 ± 0.061
4.42AlaThr: 4.42 ± 0.136
4.796AlaVal: 4.796 ± 0.061
0.773AlaTrp: 0.773 ± 0.023
3.047AlaTyr: 3.047 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.504CysAla: 0.504 ± 0.021
0.113CysCys: 0.113 ± 0.009
0.351CysAsp: 0.351 ± 0.017
0.367CysGlu: 0.367 ± 0.018
0.412CysPhe: 0.412 ± 0.014
0.571CysGly: 0.571 ± 0.022
0.182CysHis: 0.182 ± 0.012
0.576CysIle: 0.576 ± 0.02
0.524CysLys: 0.524 ± 0.017
0.726CysLeu: 0.726 ± 0.023
0.152CysMet: 0.152 ± 0.009
0.37CysAsn: 0.37 ± 0.016
0.288CysPro: 0.288 ± 0.013
0.201CysGln: 0.201 ± 0.012
0.286CysArg: 0.286 ± 0.013
0.491CysSer: 0.491 ± 0.019
0.391CysThr: 0.391 ± 0.018
0.408CysVal: 0.408 ± 0.017
0.072CysTrp: 0.072 ± 0.006
0.301CysTyr: 0.301 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.973AspAla: 3.973 ± 0.048
0.359AspCys: 0.359 ± 0.015
2.462AspAsp: 2.462 ± 0.051
3.294AspGlu: 3.294 ± 0.053
3.225AspPhe: 3.225 ± 0.05
3.854AspGly: 3.854 ± 0.058
1.022AspHis: 1.022 ± 0.026
3.839AspIle: 3.839 ± 0.054
3.784AspLys: 3.784 ± 0.054
5.184AspLeu: 5.184 ± 0.061
1.112AspMet: 1.112 ± 0.024
2.64AspAsn: 2.64 ± 0.049
2.074AspPro: 2.074 ± 0.036
2.108AspGln: 2.108 ± 0.033
2.166AspArg: 2.166 ± 0.038
2.611AspSer: 2.611 ± 0.044
2.392AspThr: 2.392 ± 0.044
3.337AspVal: 3.337 ± 0.052
0.812AspTrp: 0.812 ± 0.023
2.592AspTyr: 2.592 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
4.097GluAla: 4.097 ± 0.052
0.288GluCys: 0.288 ± 0.015
2.706GluAsp: 2.706 ± 0.041
3.744GluGlu: 3.744 ± 0.068
2.333GluPhe: 2.333 ± 0.039
3.351GluGly: 3.351 ± 0.054
1.068GluHis: 1.068 ± 0.028
4.59GluIle: 4.59 ± 0.061
4.905GluLys: 4.905 ± 0.064
5.441GluLeu: 5.441 ± 0.07
1.424GluMet: 1.424 ± 0.025
3.495GluAsn: 3.495 ± 0.048
1.528GluPro: 1.528 ± 0.032
2.429GluGln: 2.429 ± 0.045
2.457GluArg: 2.457 ± 0.046
2.932GluSer: 2.932 ± 0.044
2.859GluThr: 2.859 ± 0.047
3.804GluVal: 3.804 ± 0.058
0.648GluTrp: 0.648 ± 0.021
1.942GluTyr: 1.942 ± 0.035
0.0GluXaa: 0.0 ± 0.0
Phe
3.408PheAla: 3.408 ± 0.051
0.467PheCys: 0.467 ± 0.016
2.88PheAsp: 2.88 ± 0.043
2.811PheGlu: 2.811 ± 0.044
2.554PhePhe: 2.554 ± 0.051
3.512PheGly: 3.512 ± 0.048
0.763PheHis: 0.763 ± 0.021
3.554PheIle: 3.554 ± 0.057
3.55PheLys: 3.55 ± 0.05
4.44PheLeu: 4.44 ± 0.065
1.057PheMet: 1.057 ± 0.029
3.216PheAsn: 3.216 ± 0.053
1.712PhePro: 1.712 ± 0.035
1.473PheGln: 1.473 ± 0.033
1.787PheArg: 1.787 ± 0.035
4.008PheSer: 4.008 ± 0.058
3.082PheThr: 3.082 ± 0.058
2.768PheVal: 2.768 ± 0.038
0.601PheTrp: 0.601 ± 0.019
2.194PheTyr: 2.194 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
4.657GlyAla: 4.657 ± 0.068
0.561GlyCys: 0.561 ± 0.02
3.257GlyAsp: 3.257 ± 0.046
3.269GlyGlu: 3.269 ± 0.053
3.69GlyPhe: 3.69 ± 0.06
4.815GlyGly: 4.815 ± 0.065
1.104GlyHis: 1.104 ± 0.028
5.232GlyIle: 5.232 ± 0.065
5.603GlyLys: 5.603 ± 0.064
6.433GlyLeu: 6.433 ± 0.072
1.599GlyMet: 1.599 ± 0.034
4.054GlyAsn: 4.054 ± 0.059
1.528GlyPro: 1.528 ± 0.032
2.268GlyGln: 2.268 ± 0.041
2.517GlyArg: 2.517 ± 0.038
4.507GlySer: 4.507 ± 0.059
4.406GlyThr: 4.406 ± 0.115
4.333GlyVal: 4.333 ± 0.057
0.958GlyTrp: 0.958 ± 0.028
3.267GlyTyr: 3.267 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
1.135HisAla: 1.135 ± 0.025
0.172HisCys: 0.172 ± 0.01
0.829HisAsp: 0.829 ± 0.023
0.902HisGlu: 0.902 ± 0.026
1.124HisPhe: 1.124 ± 0.027
1.077HisGly: 1.077 ± 0.028
0.524HisHis: 0.524 ± 0.018
1.304HisIle: 1.304 ± 0.032
0.992HisLys: 0.992 ± 0.024
1.821HisLeu: 1.821 ± 0.035
0.32HisMet: 0.32 ± 0.014
0.895HisAsn: 0.895 ± 0.026
0.934HisPro: 0.934 ± 0.026
0.841HisGln: 0.841 ± 0.024
0.696HisArg: 0.696 ± 0.022
0.996HisSer: 0.996 ± 0.026
0.945HisThr: 0.945 ± 0.025
0.916HisVal: 0.916 ± 0.024
0.224HisTrp: 0.224 ± 0.011
0.823HisTyr: 0.823 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.879IleAla: 5.879 ± 0.065
0.675IleCys: 0.675 ± 0.024
4.228IleAsp: 4.228 ± 0.059
4.152IleGlu: 4.152 ± 0.059
3.311IlePhe: 3.311 ± 0.064
4.762IleGly: 4.762 ± 0.075
1.209IleHis: 1.209 ± 0.029
5.06IleIle: 5.06 ± 0.067
5.309IleLys: 5.309 ± 0.072
6.466IleLeu: 6.466 ± 0.078
1.278IleMet: 1.278 ± 0.029
4.515IleAsn: 4.515 ± 0.052
3.094IlePro: 3.094 ± 0.048
2.424IleGln: 2.424 ± 0.037
2.79IleArg: 2.79 ± 0.043
5.486IleSer: 5.486 ± 0.063
4.575IleThr: 4.575 ± 0.07
4.161IleVal: 4.161 ± 0.055
0.782IleTrp: 0.782 ± 0.024
2.723IleTyr: 2.723 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
5.567LysAla: 5.567 ± 0.062
0.306LysCys: 0.306 ± 0.016
4.192LysAsp: 4.192 ± 0.056
4.498LysGlu: 4.498 ± 0.059
2.858LysPhe: 2.858 ± 0.043
4.799LysGly: 4.799 ± 0.058
1.307LysHis: 1.307 ± 0.025
5.35LysIle: 5.35 ± 0.066
5.728LysLys: 5.728 ± 0.075
6.618LysLeu: 6.618 ± 0.064
1.874LysMet: 1.874 ± 0.035
4.596LysAsn: 4.596 ± 0.059
2.826LysPro: 2.826 ± 0.047
2.822LysGln: 2.822 ± 0.043
2.762LysArg: 2.762 ± 0.046
4.285LysSer: 4.285 ± 0.053
4.541LysThr: 4.541 ± 0.057
4.614LysVal: 4.614 ± 0.054
0.871LysTrp: 0.871 ± 0.023
2.992LysTyr: 2.992 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
7.17LeuAla: 7.17 ± 0.069
0.722LeuCys: 0.722 ± 0.023
4.698LeuAsp: 4.698 ± 0.054
4.941LeuGlu: 4.941 ± 0.07
4.673LeuPhe: 4.673 ± 0.064
5.895LeuGly: 5.895 ± 0.071
1.579LeuHis: 1.579 ± 0.033
6.738LeuIle: 6.738 ± 0.083
7.714LeuLys: 7.714 ± 0.09
9.048LeuLeu: 9.048 ± 0.105
2.112LeuMet: 2.112 ± 0.04
6.053LeuAsn: 6.053 ± 0.07
3.984LeuPro: 3.984 ± 0.051
3.539LeuGln: 3.539 ± 0.047
3.606LeuArg: 3.606 ± 0.055
7.401LeuSer: 7.401 ± 0.069
5.59LeuThr: 5.59 ± 0.081
5.413LeuVal: 5.413 ± 0.06
0.952LeuTrp: 0.952 ± 0.029
3.255LeuTyr: 3.255 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
1.738MetAla: 1.738 ± 0.033
0.138MetCys: 0.138 ± 0.01
1.148MetAsp: 1.148 ± 0.026
1.278MetGlu: 1.278 ± 0.027
0.82MetPhe: 0.82 ± 0.023
1.458MetGly: 1.458 ± 0.032
0.381MetHis: 0.381 ± 0.015
1.415MetIle: 1.415 ± 0.032
1.972MetLys: 1.972 ± 0.035
2.037MetLeu: 2.037 ± 0.035
0.577MetMet: 0.577 ± 0.019
1.177MetAsn: 1.177 ± 0.028
0.973MetPro: 0.973 ± 0.026
0.921MetGln: 0.921 ± 0.023
0.891MetArg: 0.891 ± 0.023
1.265MetSer: 1.265 ± 0.026
0.975MetThr: 0.975 ± 0.024
1.405MetVal: 1.405 ± 0.029
0.192MetTrp: 0.192 ± 0.011
0.67MetTyr: 0.67 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
4.324AsnAla: 4.324 ± 0.056
0.392AsnCys: 0.392 ± 0.018
2.869AsnAsp: 2.869 ± 0.05
2.96AsnGlu: 2.96 ± 0.047
2.901AsnPhe: 2.901 ± 0.046
4.422AsnGly: 4.422 ± 0.085
1.084AsnHis: 1.084 ± 0.028
4.455AsnIle: 4.455 ± 0.056
3.864AsnLys: 3.864 ± 0.051
5.614AsnLeu: 5.614 ± 0.073
1.198AsnMet: 1.198 ± 0.028
3.471AsnAsn: 3.471 ± 0.057
2.984AsnPro: 2.984 ± 0.043
2.427AsnGln: 2.427 ± 0.041
2.347AsnArg: 2.347 ± 0.039
3.442AsnSer: 3.442 ± 0.05
3.536AsnThr: 3.536 ± 0.057
3.375AsnVal: 3.375 ± 0.055
0.815AsnTrp: 0.815 ± 0.026
2.771AsnTyr: 2.771 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
3.141ProAla: 3.141 ± 0.073
0.219ProCys: 0.219 ± 0.011
2.33ProAsp: 2.33 ± 0.038
2.64ProGlu: 2.64 ± 0.044
1.889ProPhe: 1.889 ± 0.035
2.598ProGly: 2.598 ± 0.043
0.637ProHis: 0.637 ± 0.018
2.502ProIle: 2.502 ± 0.044
2.469ProLys: 2.469 ± 0.038
3.261ProLeu: 3.261 ± 0.046
0.743ProMet: 0.743 ± 0.024
2.166ProAsn: 2.166 ± 0.036
0.91ProPro: 0.91 ± 0.029
1.415ProGln: 1.415 ± 0.029
1.056ProArg: 1.056 ± 0.027
2.266ProSer: 2.266 ± 0.038
2.053ProThr: 2.053 ± 0.041
2.954ProVal: 2.954 ± 0.057
0.402ProTrp: 0.402 ± 0.017
1.467ProTyr: 1.467 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
2.699GlnAla: 2.699 ± 0.041
0.169GlnCys: 0.169 ± 0.011
1.767GlnAsp: 1.767 ± 0.037
2.126GlnGlu: 2.126 ± 0.036
1.723GlnPhe: 1.723 ± 0.029
2.098GlnGly: 2.098 ± 0.032
0.756GlnHis: 0.756 ± 0.024
2.712GlnIle: 2.712 ± 0.039
2.888GlnLys: 2.888 ± 0.044
3.806GlnLeu: 3.806 ± 0.052
0.838GlnMet: 0.838 ± 0.02
2.382GlnAsn: 2.382 ± 0.045
1.39GlnPro: 1.39 ± 0.029
1.976GlnGln: 1.976 ± 0.043
1.479GlnArg: 1.479 ± 0.031
2.24GlnSer: 2.24 ± 0.036
2.18GlnThr: 2.18 ± 0.04
2.392GlnVal: 2.392 ± 0.039
0.425GlnTrp: 0.425 ± 0.016
1.555GlnTyr: 1.555 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.491ArgAla: 2.491 ± 0.041
0.227ArgCys: 0.227 ± 0.013
1.908ArgAsp: 1.908 ± 0.036
2.135ArgGlu: 2.135 ± 0.039
2.124ArgPhe: 2.124 ± 0.039
2.144ArgGly: 2.144 ± 0.04
0.585ArgHis: 0.585 ± 0.02
2.968ArgIle: 2.968 ± 0.048
2.893ArgLys: 2.893 ± 0.046
3.842ArgLeu: 3.842 ± 0.053
0.967ArgMet: 0.967 ± 0.027
2.299ArgAsn: 2.299 ± 0.042
1.241ArgPro: 1.241 ± 0.031
1.263ArgGln: 1.263 ± 0.033
1.474ArgArg: 1.474 ± 0.031
2.281ArgSer: 2.281 ± 0.045
2.01ArgThr: 2.01 ± 0.037
2.231ArgVal: 2.231 ± 0.038
0.495ArgTrp: 0.495 ± 0.018
1.863ArgTyr: 1.863 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
5.18SerAla: 5.18 ± 0.065
0.548SerCys: 0.548 ± 0.017
3.098SerAsp: 3.098 ± 0.039
3.059SerGlu: 3.059 ± 0.051
3.728SerPhe: 3.728 ± 0.052
4.995SerGly: 4.995 ± 0.051
1.088SerHis: 1.088 ± 0.027
4.881SerIle: 4.881 ± 0.057
4.189SerLys: 4.189 ± 0.058
6.31SerLeu: 6.31 ± 0.068
1.252SerMet: 1.252 ± 0.028
3.517SerAsn: 3.517 ± 0.057
2.405SerPro: 2.405 ± 0.039
2.003SerGln: 2.003 ± 0.036
2.422SerArg: 2.422 ± 0.038
4.315SerSer: 4.315 ± 0.054
3.9SerThr: 3.9 ± 0.065
4.085SerVal: 4.085 ± 0.05
0.782SerTrp: 0.782 ± 0.022
2.824SerTyr: 2.824 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
5.071ThrAla: 5.071 ± 0.126
0.327ThrCys: 0.327 ± 0.014
3.395ThrAsp: 3.395 ± 0.062
3.138ThrGlu: 3.138 ± 0.049
2.855ThrPhe: 2.855 ± 0.056
4.713ThrGly: 4.713 ± 0.089
0.906ThrHis: 0.906 ± 0.026
4.29ThrIle: 4.29 ± 0.087
3.406ThrLys: 3.406 ± 0.052
5.736ThrLeu: 5.736 ± 0.077
0.941ThrMet: 0.941 ± 0.024
2.988ThrAsn: 2.988 ± 0.059
2.53ThrPro: 2.53 ± 0.063
2.027ThrGln: 2.027 ± 0.042
1.815ThrArg: 1.815 ± 0.038
3.578ThrSer: 3.578 ± 0.061
3.611ThrThr: 3.611 ± 0.097
3.933ThrVal: 3.933 ± 0.113
0.651ThrTrp: 0.651 ± 0.017
2.476ThrTyr: 2.476 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
4.633ValAla: 4.633 ± 0.069
0.511ValCys: 0.511 ± 0.021
3.382ValAsp: 3.382 ± 0.044
3.395ValGlu: 3.395 ± 0.05
3.114ValPhe: 3.114 ± 0.048
3.813ValGly: 3.813 ± 0.054
0.958ValHis: 0.958 ± 0.025
4.538ValIle: 4.538 ± 0.062
4.715ValLys: 4.715 ± 0.062
5.942ValLeu: 5.942 ± 0.067
1.349ValMet: 1.349 ± 0.031
3.818ValAsn: 3.818 ± 0.063
2.311ValPro: 2.311 ± 0.039
1.952ValGln: 1.952 ± 0.036
2.183ValArg: 2.183 ± 0.041
4.422ValSer: 4.422 ± 0.055
3.664ValThr: 3.664 ± 0.104
4.096ValVal: 4.096 ± 0.059
0.66ValTrp: 0.66 ± 0.022
2.468ValTyr: 2.468 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.811TrpAla: 0.811 ± 0.023
0.126TrpCys: 0.126 ± 0.008
0.655TrpAsp: 0.655 ± 0.021
0.663TrpGlu: 0.663 ± 0.018
0.595TrpPhe: 0.595 ± 0.021
0.853TrpGly: 0.853 ± 0.025
0.22TrpHis: 0.22 ± 0.013
0.764TrpIle: 0.764 ± 0.027
0.889TrpLys: 0.889 ± 0.023
1.132TrpLeu: 1.132 ± 0.027
0.345TrpMet: 0.345 ± 0.013
0.703TrpAsn: 0.703 ± 0.021
0.385TrpPro: 0.385 ± 0.016
0.486TrpGln: 0.486 ± 0.019
0.477TrpArg: 0.477 ± 0.017
0.684TrpSer: 0.684 ± 0.024
0.662TrpThr: 0.662 ± 0.021
0.696TrpVal: 0.696 ± 0.023
0.209TrpTrp: 0.209 ± 0.012
0.494TrpTyr: 0.494 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.053TyrAla: 3.053 ± 0.049
0.314TyrCys: 0.314 ± 0.012
2.244TyrAsp: 2.244 ± 0.04
1.92TyrGlu: 1.92 ± 0.038
2.317TyrPhe: 2.317 ± 0.045
2.819TyrGly: 2.819 ± 0.043
0.878TyrHis: 0.878 ± 0.023
2.542TyrIle: 2.542 ± 0.036
2.791TyrLys: 2.791 ± 0.045
4.045TyrLeu: 4.045 ± 0.053
0.702TyrMet: 0.702 ± 0.022
2.707TyrAsn: 2.707 ± 0.053
1.694TyrPro: 1.694 ± 0.034
1.863TyrGln: 1.863 ± 0.034
1.843TyrArg: 1.843 ± 0.034
2.661TyrSer: 2.661 ± 0.038
2.596TyrThr: 2.596 ± 0.055
2.209TyrVal: 2.209 ± 0.038
0.525TyrTrp: 0.525 ± 0.018
1.879TyrTyr: 1.879 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4844 proteins (1663111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski