Amino acid dipepetide frequency for Sphingobacteriales bacterium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.126AlaAla: 10.126 ± 0.14
0.876AlaCys: 0.876 ± 0.035
5.099AlaAsp: 5.099 ± 0.079
4.723AlaGlu: 4.723 ± 0.094
4.199AlaPhe: 4.199 ± 0.077
7.347AlaGly: 7.347 ± 0.126
1.687AlaHis: 1.687 ± 0.048
5.276AlaIle: 5.276 ± 0.082
3.519AlaLys: 3.519 ± 0.083
9.679AlaLeu: 9.679 ± 0.126
2.15AlaMet: 2.15 ± 0.052
3.379AlaAsn: 3.379 ± 0.076
3.912AlaPro: 3.912 ± 0.085
4.398AlaGln: 4.398 ± 0.082
4.483AlaArg: 4.483 ± 0.079
5.582AlaSer: 5.582 ± 0.107
6.568AlaThr: 6.568 ± 0.179
6.44AlaVal: 6.44 ± 0.103
0.944AlaTrp: 0.944 ± 0.032
3.145AlaTyr: 3.145 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.765CysAla: 0.765 ± 0.046
0.148CysCys: 0.148 ± 0.013
0.431CysAsp: 0.431 ± 0.019
0.433CysGlu: 0.433 ± 0.019
0.393CysPhe: 0.393 ± 0.023
0.774CysGly: 0.774 ± 0.03
0.192CysHis: 0.192 ± 0.016
0.584CysIle: 0.584 ± 0.027
0.329CysLys: 0.329 ± 0.021
0.768CysLeu: 0.768 ± 0.027
0.218CysMet: 0.218 ± 0.014
0.414CysAsn: 0.414 ± 0.026
0.463CysPro: 0.463 ± 0.026
0.301CysGln: 0.301 ± 0.019
0.488CysArg: 0.488 ± 0.023
0.617CysSer: 0.617 ± 0.034
0.661CysThr: 0.661 ± 0.039
0.544CysVal: 0.544 ± 0.025
0.112CysTrp: 0.112 ± 0.011
0.306CysTyr: 0.306 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.411AspAla: 5.411 ± 0.081
0.474AspCys: 0.474 ± 0.022
2.284AspAsp: 2.284 ± 0.052
2.845AspGlu: 2.845 ± 0.071
2.442AspPhe: 2.442 ± 0.06
3.987AspGly: 3.987 ± 0.08
0.878AspHis: 0.878 ± 0.032
2.642AspIle: 2.642 ± 0.053
2.176AspLys: 2.176 ± 0.056
4.891AspLeu: 4.891 ± 0.081
0.92AspMet: 0.92 ± 0.034
1.767AspAsn: 1.767 ± 0.046
2.364AspPro: 2.364 ± 0.063
1.652AspGln: 1.652 ± 0.046
2.75AspArg: 2.75 ± 0.065
3.323AspSer: 3.323 ± 0.071
3.334AspThr: 3.334 ± 0.072
3.481AspVal: 3.481 ± 0.065
0.82AspTrp: 0.82 ± 0.029
2.183AspTyr: 2.183 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
5.969GluAla: 5.969 ± 0.116
0.304GluCys: 0.304 ± 0.017
2.61GluAsp: 2.61 ± 0.064
3.025GluGlu: 3.025 ± 0.076
1.632GluPhe: 1.632 ± 0.043
3.701GluGly: 3.701 ± 0.074
1.033GluHis: 1.033 ± 0.034
2.947GluIle: 2.947 ± 0.072
2.993GluLys: 2.993 ± 0.081
5.142GluLeu: 5.142 ± 0.095
1.254GluMet: 1.254 ± 0.037
2.104GluAsn: 2.104 ± 0.056
1.869GluPro: 1.869 ± 0.052
2.406GluGln: 2.406 ± 0.061
3.054GluArg: 3.054 ± 0.067
2.767GluSer: 2.767 ± 0.063
3.427GluThr: 3.427 ± 0.074
3.586GluVal: 3.586 ± 0.076
0.691GluTrp: 0.691 ± 0.028
1.611GluTyr: 1.611 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.372PheAla: 3.372 ± 0.069
0.44PheCys: 0.44 ± 0.021
2.335PheAsp: 2.335 ± 0.05
2.355PheGlu: 2.355 ± 0.058
2.024PhePhe: 2.024 ± 0.051
3.197PheGly: 3.197 ± 0.069
0.686PheHis: 0.686 ± 0.028
2.302PheIle: 2.302 ± 0.064
1.803PheLys: 1.803 ± 0.052
3.911PheLeu: 3.911 ± 0.072
0.961PheMet: 0.961 ± 0.032
2.003PheAsn: 2.003 ± 0.049
1.998PhePro: 1.998 ± 0.043
1.444PheGln: 1.444 ± 0.038
2.833PheArg: 2.833 ± 0.056
3.234PheSer: 3.234 ± 0.062
2.949PheThr: 2.949 ± 0.066
2.659PheVal: 2.659 ± 0.057
0.586PheTrp: 0.586 ± 0.025
1.676PheTyr: 1.676 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
6.385GlyAla: 6.385 ± 0.102
0.891GlyCys: 0.891 ± 0.056
3.491GlyAsp: 3.491 ± 0.063
3.332GlyGlu: 3.332 ± 0.074
3.46GlyPhe: 3.46 ± 0.069
6.298GlyGly: 6.298 ± 0.167
1.327GlyHis: 1.327 ± 0.04
5.058GlyIle: 5.058 ± 0.08
3.96GlyLys: 3.96 ± 0.066
7.107GlyLeu: 7.107 ± 0.1
2.006GlyMet: 2.006 ± 0.049
3.337GlyAsn: 3.337 ± 0.086
2.147GlyPro: 2.147 ± 0.049
2.884GlyGln: 2.884 ± 0.054
3.714GlyArg: 3.714 ± 0.081
5.189GlySer: 5.189 ± 0.11
6.356GlyThr: 6.356 ± 0.158
4.771GlyVal: 4.771 ± 0.084
1.126GlyTrp: 1.126 ± 0.038
2.949GlyTyr: 2.949 ± 0.068
0.0GlyXaa: 0.0 ± 0.0
His
1.388HisAla: 1.388 ± 0.045
0.213HisCys: 0.213 ± 0.014
0.796HisAsp: 0.796 ± 0.031
0.853HisGlu: 0.853 ± 0.033
1.031HisPhe: 1.031 ± 0.034
1.217HisGly: 1.217 ± 0.042
0.486HisHis: 0.486 ± 0.027
1.074HisIle: 1.074 ± 0.032
0.742HisLys: 0.742 ± 0.032
2.087HisLeu: 2.087 ± 0.048
0.337HisMet: 0.337 ± 0.019
0.749HisAsn: 0.749 ± 0.03
1.389HisPro: 1.389 ± 0.038
0.892HisGln: 0.892 ± 0.036
1.153HisArg: 1.153 ± 0.036
1.144HisSer: 1.144 ± 0.034
1.216HisThr: 1.216 ± 0.038
0.963HisVal: 0.963 ± 0.031
0.307HisTrp: 0.307 ± 0.018
0.81HisTyr: 0.81 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.371IleAla: 5.371 ± 0.082
0.567IleCys: 0.567 ± 0.025
3.255IleAsp: 3.255 ± 0.072
3.098IleGlu: 3.098 ± 0.074
2.009IlePhe: 2.009 ± 0.056
4.536IleGly: 4.536 ± 0.086
1.169IleHis: 1.169 ± 0.038
2.812IleIle: 2.812 ± 0.076
2.249IleLys: 2.249 ± 0.057
5.222IleLeu: 5.222 ± 0.084
0.847IleMet: 0.847 ± 0.034
2.361IleAsn: 2.361 ± 0.06
2.96IlePro: 2.96 ± 0.059
2.345IleGln: 2.345 ± 0.052
3.707IleArg: 3.707 ± 0.063
3.693IleSer: 3.693 ± 0.065
3.559IleThr: 3.559 ± 0.069
3.505IleVal: 3.505 ± 0.073
0.632IleTrp: 0.632 ± 0.029
1.796IleTyr: 1.796 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
4.718LysAla: 4.718 ± 0.09
0.229LysCys: 0.229 ± 0.018
2.593LysAsp: 2.593 ± 0.061
2.766LysGlu: 2.766 ± 0.065
1.355LysPhe: 1.355 ± 0.045
3.435LysGly: 3.435 ± 0.072
0.808LysHis: 0.808 ± 0.031
2.644LysIle: 2.644 ± 0.059
2.802LysLys: 2.802 ± 0.074
4.027LysLeu: 4.027 ± 0.08
1.246LysMet: 1.246 ± 0.047
2.003LysAsn: 2.003 ± 0.054
2.255LysPro: 2.255 ± 0.053
1.813LysGln: 1.813 ± 0.046
2.353LysArg: 2.353 ± 0.05
2.609LysSer: 2.609 ± 0.058
3.251LysThr: 3.251 ± 0.064
2.81LysVal: 2.81 ± 0.059
0.531LysTrp: 0.531 ± 0.026
1.463LysTyr: 1.463 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
8.787LeuAla: 8.787 ± 0.119
0.831LeuCys: 0.831 ± 0.032
4.71LeuAsp: 4.71 ± 0.083
4.979LeuGlu: 4.979 ± 0.105
4.279LeuPhe: 4.279 ± 0.068
6.503LeuGly: 6.503 ± 0.097
2.101LeuHis: 2.101 ± 0.057
5.094LeuIle: 5.094 ± 0.097
4.813LeuLys: 4.813 ± 0.078
11.592LeuLeu: 11.592 ± 0.178
2.192LeuMet: 2.192 ± 0.054
4.102LeuAsn: 4.102 ± 0.067
5.508LeuPro: 5.508 ± 0.096
5.061LeuGln: 5.061 ± 0.093
5.767LeuArg: 5.767 ± 0.099
6.842LeuSer: 6.842 ± 0.101
6.093LeuThr: 6.093 ± 0.082
6.169LeuVal: 6.169 ± 0.09
1.123LeuTrp: 1.123 ± 0.042
3.399LeuTyr: 3.399 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
2.276MetAla: 2.276 ± 0.051
0.122MetCys: 0.122 ± 0.012
1.191MetAsp: 1.191 ± 0.036
1.158MetGlu: 1.158 ± 0.038
0.601MetPhe: 0.601 ± 0.028
1.653MetGly: 1.653 ± 0.047
0.481MetHis: 0.481 ± 0.025
1.162MetIle: 1.162 ± 0.038
1.307MetLys: 1.307 ± 0.039
2.169MetLeu: 2.169 ± 0.046
0.636MetMet: 0.636 ± 0.028
1.092MetAsn: 1.092 ± 0.037
1.219MetPro: 1.219 ± 0.033
1.109MetGln: 1.109 ± 0.034
1.345MetArg: 1.345 ± 0.044
1.2MetSer: 1.2 ± 0.036
1.327MetThr: 1.327 ± 0.037
1.379MetVal: 1.379 ± 0.033
0.173MetTrp: 0.173 ± 0.015
0.596MetTyr: 0.596 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
4.013AsnAla: 4.013 ± 0.08
0.437AsnCys: 0.437 ± 0.036
2.06AsnAsp: 2.06 ± 0.054
1.933AsnGlu: 1.933 ± 0.043
1.579AsnPhe: 1.579 ± 0.046
3.896AsnGly: 3.896 ± 0.107
0.748AsnHis: 0.748 ± 0.029
2.36AsnIle: 2.36 ± 0.068
1.61AsnLys: 1.61 ± 0.042
4.017AsnLeu: 4.017 ± 0.069
0.849AsnMet: 0.849 ± 0.028
2.014AsnAsn: 2.014 ± 0.072
2.764AsnPro: 2.764 ± 0.072
1.651AsnGln: 1.651 ± 0.043
2.503AsnArg: 2.503 ± 0.061
2.426AsnSer: 2.426 ± 0.07
2.956AsnThr: 2.956 ± 0.093
2.586AsnVal: 2.586 ± 0.065
0.618AsnTrp: 0.618 ± 0.028
1.65AsnTyr: 1.65 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
4.997ProAla: 4.997 ± 0.091
0.299ProCys: 0.299 ± 0.017
3.103ProAsp: 3.103 ± 0.06
3.985ProGlu: 3.985 ± 0.076
2.109ProPhe: 2.109 ± 0.05
3.243ProGly: 3.243 ± 0.066
0.78ProHis: 0.78 ± 0.033
2.261ProIle: 2.261 ± 0.05
1.914ProLys: 1.914 ± 0.05
4.017ProLeu: 4.017 ± 0.062
1.035ProMet: 1.035 ± 0.033
1.976ProAsn: 1.976 ± 0.059
1.576ProPro: 1.576 ± 0.045
2.065ProGln: 2.065 ± 0.051
1.653ProArg: 1.653 ± 0.048
2.641ProSer: 2.641 ± 0.067
2.849ProThr: 2.849 ± 0.077
4.401ProVal: 4.401 ± 0.09
0.471ProTrp: 0.471 ± 0.024
1.688ProTyr: 1.688 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
3.759GlnAla: 3.759 ± 0.067
0.27GlnCys: 0.27 ± 0.016
1.759GlnAsp: 1.759 ± 0.045
2.123GlnGlu: 2.123 ± 0.052
1.682GlnPhe: 1.682 ± 0.045
2.334GlnGly: 2.334 ± 0.052
0.903GlnHis: 0.903 ± 0.032
2.291GlnIle: 2.291 ± 0.048
2.365GlnLys: 2.365 ± 0.062
4.727GlnLeu: 4.727 ± 0.08
1.038GlnMet: 1.038 ± 0.035
1.857GlnAsn: 1.857 ± 0.051
2.307GlnPro: 2.307 ± 0.052
2.733GlnGln: 2.733 ± 0.083
2.37GlnArg: 2.37 ± 0.055
2.586GlnSer: 2.586 ± 0.064
3.021GlnThr: 3.021 ± 0.063
2.969GlnVal: 2.969 ± 0.06
0.606GlnTrp: 0.606 ± 0.03
1.713GlnTyr: 1.713 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.652ArgAla: 3.652 ± 0.072
0.428ArgCys: 0.428 ± 0.024
2.445ArgAsp: 2.445 ± 0.053
2.781ArgGlu: 2.781 ± 0.071
2.811ArgPhe: 2.811 ± 0.053
2.754ArgGly: 2.754 ± 0.061
1.083ArgHis: 1.083 ± 0.037
3.853ArgIle: 3.853 ± 0.057
3.245ArgLys: 3.245 ± 0.068
5.6ArgLeu: 5.6 ± 0.087
1.737ArgMet: 1.737 ± 0.049
2.698ArgAsn: 2.698 ± 0.055
2.278ArgPro: 2.278 ± 0.058
2.55ArgGln: 2.55 ± 0.065
2.887ArgArg: 2.887 ± 0.072
3.159ArgSer: 3.159 ± 0.059
3.387ArgThr: 3.387 ± 0.058
3.304ArgVal: 3.304 ± 0.057
0.794ArgTrp: 0.794 ± 0.033
2.451ArgTyr: 2.451 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
5.349SerAla: 5.349 ± 0.093
0.706SerCys: 0.706 ± 0.035
2.853SerAsp: 2.853 ± 0.057
2.724SerGlu: 2.724 ± 0.057
3.174SerPhe: 3.174 ± 0.058
7.076SerGly: 7.076 ± 0.135
1.077SerHis: 1.077 ± 0.035
3.503SerIle: 3.503 ± 0.074
2.501SerLys: 2.501 ± 0.055
6.04SerLeu: 6.04 ± 0.098
1.35SerMet: 1.35 ± 0.042
2.731SerAsn: 2.731 ± 0.076
2.703SerPro: 2.703 ± 0.07
2.248SerGln: 2.248 ± 0.047
3.063SerArg: 3.063 ± 0.055
3.891SerSer: 3.891 ± 0.099
3.952SerThr: 3.952 ± 0.094
4.305SerVal: 4.305 ± 0.078
0.814SerTrp: 0.814 ± 0.033
2.436SerTyr: 2.436 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
7.04ThrAla: 7.04 ± 0.165
0.53ThrCys: 0.53 ± 0.032
3.81ThrAsp: 3.81 ± 0.073
3.204ThrGlu: 3.204 ± 0.069
2.739ThrPhe: 2.739 ± 0.061
6.01ThrGly: 6.01 ± 0.13
1.175ThrHis: 1.175 ± 0.039
3.818ThrIle: 3.818 ± 0.074
2.207ThrLys: 2.207 ± 0.051
6.818ThrLeu: 6.818 ± 0.087
1.071ThrMet: 1.071 ± 0.033
2.69ThrAsn: 2.69 ± 0.088
3.833ThrPro: 3.833 ± 0.093
2.431ThrGln: 2.431 ± 0.05
2.881ThrArg: 2.881 ± 0.051
3.918ThrSer: 3.918 ± 0.107
4.862ThrThr: 4.862 ± 0.182
5.2ThrVal: 5.2 ± 0.143
0.688ThrTrp: 0.688 ± 0.028
2.633ThrTyr: 2.633 ± 0.09
0.0ThrXaa: 0.0 ± 0.0
Val
6.329ValAla: 6.329 ± 0.101
0.704ValCys: 0.704 ± 0.029
3.21ValAsp: 3.21 ± 0.064
3.087ValGlu: 3.087 ± 0.063
3.039ValPhe: 3.039 ± 0.059
4.195ValGly: 4.195 ± 0.074
1.18ValHis: 1.18 ± 0.042
3.711ValIle: 3.711 ± 0.078
2.945ValLys: 2.945 ± 0.061
7.074ValLeu: 7.074 ± 0.096
1.355ValMet: 1.355 ± 0.037
2.802ValAsn: 2.802 ± 0.069
3.444ValPro: 3.444 ± 0.062
2.895ValGln: 2.895 ± 0.05
3.799ValArg: 3.799 ± 0.064
4.642ValSer: 4.642 ± 0.088
4.409ValThr: 4.409 ± 0.122
4.98ValVal: 4.98 ± 0.09
0.799ValTrp: 0.799 ± 0.034
2.463ValTyr: 2.463 ± 0.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.986TrpAla: 0.986 ± 0.034
0.113TrpCys: 0.113 ± 0.011
0.577TrpAsp: 0.577 ± 0.026
0.663TrpGlu: 0.663 ± 0.029
0.488TrpPhe: 0.488 ± 0.024
0.867TrpGly: 0.867 ± 0.035
0.265TrpHis: 0.265 ± 0.017
0.749TrpIle: 0.749 ± 0.026
0.725TrpLys: 0.725 ± 0.028
1.308TrpLeu: 1.308 ± 0.045
0.38TrpMet: 0.38 ± 0.02
0.69TrpAsn: 0.69 ± 0.032
0.39TrpPro: 0.39 ± 0.022
0.671TrpGln: 0.671 ± 0.031
0.655TrpArg: 0.655 ± 0.025
0.735TrpSer: 0.735 ± 0.034
0.767TrpThr: 0.767 ± 0.031
0.872TrpVal: 0.872 ± 0.031
0.228TrpTrp: 0.228 ± 0.016
0.482TrpTyr: 0.482 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.836TyrAla: 2.836 ± 0.05
0.359TyrCys: 0.359 ± 0.022
2.012TyrAsp: 2.012 ± 0.053
2.058TyrGlu: 2.058 ± 0.051
1.722TyrPhe: 1.722 ± 0.047
2.764TyrGly: 2.764 ± 0.052
0.79TyrHis: 0.79 ± 0.028
1.555TyrIle: 1.555 ± 0.039
1.508TyrLys: 1.508 ± 0.039
3.748TyrLeu: 3.748 ± 0.069
0.612TyrMet: 0.612 ± 0.026
1.9TyrAsn: 1.9 ± 0.052
1.685TyrPro: 1.685 ± 0.049
1.865TyrGln: 1.865 ± 0.048
2.439TyrArg: 2.439 ± 0.056
2.296TyrSer: 2.296 ± 0.054
2.614TyrThr: 2.614 ± 0.078
2.14TyrVal: 2.14 ± 0.049
0.547TyrTrp: 0.547 ± 0.025
1.602TyrTyr: 1.602 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2778 proteins (974968 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski