Amino acid dipepetide frequency for Microbacterium sp. XT11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.419AlaAla: 21.419 ± 0.28
0.618AlaCys: 0.618 ± 0.033
8.953AlaAsp: 8.953 ± 0.132
8.932AlaGlu: 8.932 ± 0.145
3.962AlaPhe: 3.962 ± 0.082
12.074AlaGly: 12.074 ± 0.167
2.529AlaHis: 2.529 ± 0.067
5.886AlaIle: 5.886 ± 0.113
2.672AlaLys: 2.672 ± 0.086
14.341AlaLeu: 14.341 ± 0.175
2.823AlaMet: 2.823 ± 0.068
2.263AlaAsn: 2.263 ± 0.069
6.688AlaPro: 6.688 ± 0.158
4.018AlaGln: 4.018 ± 0.091
9.37AlaArg: 9.37 ± 0.14
7.612AlaSer: 7.612 ± 0.146
7.217AlaThr: 7.217 ± 0.113
11.881AlaVal: 11.881 ± 0.159
1.842AlaTrp: 1.842 ± 0.05
2.459AlaTyr: 2.459 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.616CysAla: 0.616 ± 0.034
0.035CysCys: 0.035 ± 0.008
0.299CysAsp: 0.299 ± 0.023
0.225CysGlu: 0.225 ± 0.019
0.159CysPhe: 0.159 ± 0.017
0.463CysGly: 0.463 ± 0.033
0.108CysHis: 0.108 ± 0.014
0.18CysIle: 0.18 ± 0.016
0.067CysLys: 0.067 ± 0.012
0.398CysLeu: 0.398 ± 0.027
0.059CysMet: 0.059 ± 0.012
0.11CysAsn: 0.11 ± 0.014
0.237CysPro: 0.237 ± 0.019
0.097CysGln: 0.097 ± 0.012
0.299CysArg: 0.299 ± 0.024
0.293CysSer: 0.293 ± 0.021
0.306CysThr: 0.306 ± 0.023
0.357CysVal: 0.357 ± 0.025
0.064CysTrp: 0.064 ± 0.011
0.092CysTyr: 0.092 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
10.238AspAla: 10.238 ± 0.166
0.204AspCys: 0.204 ± 0.019
4.779AspAsp: 4.779 ± 0.093
4.624AspGlu: 4.624 ± 0.096
1.811AspPhe: 1.811 ± 0.058
6.679AspGly: 6.679 ± 0.113
1.167AspHis: 1.167 ± 0.047
2.479AspIle: 2.479 ± 0.069
1.022AspLys: 1.022 ± 0.049
6.252AspLeu: 6.252 ± 0.106
0.871AspMet: 0.871 ± 0.039
0.978AspAsn: 0.978 ± 0.042
4.389AspPro: 4.389 ± 0.094
1.398AspGln: 1.398 ± 0.049
4.484AspArg: 4.484 ± 0.089
2.516AspSer: 2.516 ± 0.066
2.831AspThr: 2.831 ± 0.065
5.586AspVal: 5.586 ± 0.1
0.938AspTrp: 0.938 ± 0.049
1.279AspTyr: 1.279 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
7.223GluAla: 7.223 ± 0.119
0.209GluCys: 0.209 ± 0.018
2.784GluAsp: 2.784 ± 0.071
3.207GluGlu: 3.207 ± 0.086
1.809GluPhe: 1.809 ± 0.052
4.371GluGly: 4.371 ± 0.096
1.492GluHis: 1.492 ± 0.06
2.901GluIle: 2.901 ± 0.079
1.592GluLys: 1.592 ± 0.056
6.698GluLeu: 6.698 ± 0.103
0.916GluMet: 0.916 ± 0.037
1.295GluAsn: 1.295 ± 0.045
2.895GluPro: 2.895 ± 0.071
2.112GluGln: 2.112 ± 0.068
5.64GluArg: 5.64 ± 0.11
2.954GluSer: 2.954 ± 0.07
3.065GluThr: 3.065 ± 0.068
4.728GluVal: 4.728 ± 0.088
0.962GluTrp: 0.962 ± 0.044
1.285GluTyr: 1.285 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
4.475PheAla: 4.475 ± 0.097
0.132PheCys: 0.132 ± 0.015
2.42PheAsp: 2.42 ± 0.064
1.811PheGlu: 1.811 ± 0.054
1.097PhePhe: 1.097 ± 0.054
3.459PheGly: 3.459 ± 0.1
0.551PheHis: 0.551 ± 0.03
1.116PheIle: 1.116 ± 0.043
0.392PheLys: 0.392 ± 0.03
2.834PheLeu: 2.834 ± 0.087
0.444PheMet: 0.444 ± 0.03
0.632PheAsn: 0.632 ± 0.037
1.562PhePro: 1.562 ± 0.054
0.828PheGln: 0.828 ± 0.045
1.917PheArg: 1.917 ± 0.057
1.736PheSer: 1.736 ± 0.054
2.231PheThr: 2.231 ± 0.066
2.83PheVal: 2.83 ± 0.074
0.492PheTrp: 0.492 ± 0.036
0.645PheTyr: 0.645 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
11.249GlyAla: 11.249 ± 0.17
0.518GlyCys: 0.518 ± 0.032
5.287GlyAsp: 5.287 ± 0.103
5.194GlyGlu: 5.194 ± 0.097
3.244GlyPhe: 3.244 ± 0.087
7.51GlyGly: 7.51 ± 0.134
1.796GlyHis: 1.796 ± 0.056
4.94GlyIle: 4.94 ± 0.103
2.027GlyLys: 2.027 ± 0.067
8.467GlyLeu: 8.467 ± 0.123
1.962GlyMet: 1.962 ± 0.058
1.519GlyAsn: 1.519 ± 0.045
3.731GlyPro: 3.731 ± 0.069
2.325GlyGln: 2.325 ± 0.07
6.242GlyArg: 6.242 ± 0.106
5.116GlySer: 5.116 ± 0.105
5.564GlyThr: 5.564 ± 0.115
8.064GlyVal: 8.064 ± 0.118
1.65GlyTrp: 1.65 ± 0.053
2.362GlyTyr: 2.362 ± 0.066
0.0GlyXaa: 0.0 ± 0.0
His
2.626HisAla: 2.626 ± 0.071
0.102HisCys: 0.102 ± 0.013
1.405HisAsp: 1.405 ± 0.053
1.239HisGlu: 1.239 ± 0.047
0.549HisPhe: 0.549 ± 0.032
2.034HisGly: 2.034 ± 0.068
0.553HisHis: 0.553 ± 0.031
0.733HisIle: 0.733 ± 0.035
0.261HisLys: 0.261 ± 0.019
2.024HisLeu: 2.024 ± 0.056
0.352HisMet: 0.352 ± 0.023
0.339HisAsn: 0.339 ± 0.024
1.526HisPro: 1.526 ± 0.049
0.489HisGln: 0.489 ± 0.03
1.506HisArg: 1.506 ± 0.054
0.925HisSer: 0.925 ± 0.038
1.035HisThr: 1.035 ± 0.044
1.632HisVal: 1.632 ± 0.057
0.29HisTrp: 0.29 ± 0.023
0.389HisTyr: 0.389 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.209IleAla: 7.209 ± 0.093
0.245IleCys: 0.245 ± 0.02
3.648IleAsp: 3.648 ± 0.086
2.881IleGlu: 2.881 ± 0.068
1.145IlePhe: 1.145 ± 0.049
4.647IleGly: 4.647 ± 0.093
0.656IleHis: 0.656 ± 0.035
1.725IleIle: 1.725 ± 0.059
0.701IleLys: 0.701 ± 0.038
3.677IleLeu: 3.677 ± 0.085
0.65IleMet: 0.65 ± 0.03
0.903IleAsn: 0.903 ± 0.041
2.47IlePro: 2.47 ± 0.061
0.928IleGln: 0.928 ± 0.041
2.963IleArg: 2.963 ± 0.067
2.194IleSer: 2.194 ± 0.059
2.892IleThr: 2.892 ± 0.069
4.546IleVal: 4.546 ± 0.095
0.494IleTrp: 0.494 ± 0.028
0.68IleTyr: 0.68 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
2.473LysAla: 2.473 ± 0.079
0.065LysCys: 0.065 ± 0.009
1.194LysAsp: 1.194 ± 0.053
1.018LysGlu: 1.018 ± 0.043
0.449LysPhe: 0.449 ± 0.029
1.57LysGly: 1.57 ± 0.058
0.435LysHis: 0.435 ± 0.026
0.887LysIle: 0.887 ± 0.045
0.777LysLys: 0.777 ± 0.04
1.796LysLeu: 1.796 ± 0.067
0.4LysMet: 0.4 ± 0.026
0.521LysAsn: 0.521 ± 0.032
1.105LysPro: 1.105 ± 0.047
0.683LysGln: 0.683 ± 0.038
1.325LysArg: 1.325 ± 0.044
1.073LysSer: 1.073 ± 0.046
1.268LysThr: 1.268 ± 0.05
1.588LysVal: 1.588 ± 0.054
0.228LysTrp: 0.228 ± 0.019
0.416LysTyr: 0.416 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
14.464LeuAla: 14.464 ± 0.192
0.462LeuCys: 0.462 ± 0.028
6.925LeuAsp: 6.925 ± 0.117
4.976LeuGlu: 4.976 ± 0.099
2.979LeuPhe: 2.979 ± 0.081
8.941LeuGly: 8.941 ± 0.15
1.927LeuHis: 1.927 ± 0.053
4.379LeuIle: 4.379 ± 0.102
1.677LeuLys: 1.677 ± 0.061
9.989LeuLeu: 9.989 ± 0.168
1.639LeuMet: 1.639 ± 0.058
1.613LeuAsn: 1.613 ± 0.056
5.429LeuPro: 5.429 ± 0.09
2.502LeuGln: 2.502 ± 0.068
7.957LeuArg: 7.957 ± 0.123
5.976LeuSer: 5.976 ± 0.099
6.265LeuThr: 6.265 ± 0.107
8.895LeuVal: 8.895 ± 0.142
1.346LeuTrp: 1.346 ± 0.05
1.648LeuTyr: 1.648 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.161MetAla: 2.161 ± 0.06
0.086MetCys: 0.086 ± 0.013
0.806MetAsp: 0.806 ± 0.044
0.624MetGlu: 0.624 ± 0.031
0.557MetPhe: 0.557 ± 0.03
1.287MetGly: 1.287 ± 0.057
0.366MetHis: 0.366 ± 0.022
0.906MetIle: 0.906 ± 0.042
0.455MetLys: 0.455 ± 0.026
2.054MetLeu: 2.054 ± 0.063
0.373MetMet: 0.373 ± 0.028
0.473MetAsn: 0.473 ± 0.028
1.323MetPro: 1.323 ± 0.046
0.543MetGln: 0.543 ± 0.029
1.479MetArg: 1.479 ± 0.045
1.455MetSer: 1.455 ± 0.047
1.807MetThr: 1.807 ± 0.054
1.298MetVal: 1.298 ± 0.049
0.18MetTrp: 0.18 ± 0.018
0.28MetTyr: 0.28 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
2.37AsnAla: 2.37 ± 0.068
0.086AsnCys: 0.086 ± 0.013
1.154AsnAsp: 1.154 ± 0.043
0.984AsnGlu: 0.984 ± 0.04
0.608AsnPhe: 0.608 ± 0.031
1.887AsnGly: 1.887 ± 0.063
0.357AsnHis: 0.357 ± 0.023
0.771AsnIle: 0.771 ± 0.04
0.389AsnLys: 0.389 ± 0.03
1.847AsnLeu: 1.847 ± 0.06
0.33AsnMet: 0.33 ± 0.025
0.424AsnAsn: 0.424 ± 0.033
1.602AsnPro: 1.602 ± 0.053
0.535AsnGln: 0.535 ± 0.029
1.197AsnArg: 1.197 ± 0.047
0.892AsnSer: 0.892 ± 0.035
1.123AsnThr: 1.123 ± 0.043
1.476AsnVal: 1.476 ± 0.048
0.282AsnTrp: 0.282 ± 0.023
0.449AsnTyr: 0.449 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
7.269ProAla: 7.269 ± 0.155
0.18ProCys: 0.18 ± 0.014
4.005ProAsp: 4.005 ± 0.092
3.954ProGlu: 3.954 ± 0.081
1.86ProPhe: 1.86 ± 0.05
4.981ProGly: 4.981 ± 0.099
1.182ProHis: 1.182 ± 0.046
1.97ProIle: 1.97 ± 0.056
0.944ProLys: 0.944 ± 0.04
5.263ProLeu: 5.263 ± 0.094
0.844ProMet: 0.844 ± 0.041
0.952ProAsn: 0.952 ± 0.043
2.166ProPro: 2.166 ± 0.073
1.591ProGln: 1.591 ± 0.055
3.562ProArg: 3.562 ± 0.082
3.054ProSer: 3.054 ± 0.076
3.08ProThr: 3.08 ± 0.072
4.921ProVal: 4.921 ± 0.097
0.887ProTrp: 0.887 ± 0.043
1.057ProTyr: 1.057 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
3.561GlnAla: 3.561 ± 0.094
0.099GlnCys: 0.099 ± 0.013
1.233GlnAsp: 1.233 ± 0.045
1.36GlnGlu: 1.36 ± 0.049
0.863GlnPhe: 0.863 ± 0.036
2.18GlnGly: 2.18 ± 0.068
0.64GlnHis: 0.64 ± 0.03
1.357GlnIle: 1.357 ± 0.049
0.642GlnLys: 0.642 ± 0.037
2.967GlnLeu: 2.967 ± 0.075
0.455GlnMet: 0.455 ± 0.027
0.694GlnAsn: 0.694 ± 0.036
1.419GlnPro: 1.419 ± 0.048
1.078GlnGln: 1.078 ± 0.044
2.37GlnArg: 2.37 ± 0.066
1.416GlnSer: 1.416 ± 0.047
1.562GlnThr: 1.562 ± 0.054
2.199GlnVal: 2.199 ± 0.06
0.455GlnTrp: 0.455 ± 0.025
0.616GlnTyr: 0.616 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
9.222ArgAla: 9.222 ± 0.155
0.271ArgCys: 0.271 ± 0.02
4.58ArgAsp: 4.58 ± 0.096
4.534ArgGlu: 4.534 ± 0.095
2.483ArgPhe: 2.483 ± 0.06
5.483ArgGly: 5.483 ± 0.095
1.629ArgHis: 1.629 ± 0.054
3.836ArgIle: 3.836 ± 0.077
1.32ArgLys: 1.32 ± 0.058
7.422ArgLeu: 7.422 ± 0.102
1.962ArgMet: 1.962 ± 0.064
1.213ArgAsn: 1.213 ± 0.045
3.561ArgPro: 3.561 ± 0.074
2.016ArgGln: 2.016 ± 0.06
7.007ArgArg: 7.007 ± 0.134
4.096ArgSer: 4.096 ± 0.089
4.395ArgThr: 4.395 ± 0.092
6.239ArgVal: 6.239 ± 0.113
1.194ArgTrp: 1.194 ± 0.043
1.495ArgTyr: 1.495 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
7.516SerAla: 7.516 ± 0.148
0.256SerCys: 0.256 ± 0.019
3.285SerAsp: 3.285 ± 0.077
2.701SerGlu: 2.701 ± 0.065
1.889SerPhe: 1.889 ± 0.053
5.664SerGly: 5.664 ± 0.104
1.01SerHis: 1.01 ± 0.041
2.53SerIle: 2.53 ± 0.069
0.998SerLys: 0.998 ± 0.04
5.128SerLeu: 5.128 ± 0.1
1.264SerMet: 1.264 ± 0.04
1.0SerAsn: 1.0 ± 0.043
2.954SerPro: 2.954 ± 0.067
1.33SerGln: 1.33 ± 0.043
3.792SerArg: 3.792 ± 0.077
3.282SerSer: 3.282 ± 0.085
3.618SerThr: 3.618 ± 0.092
4.774SerVal: 4.774 ± 0.103
0.955SerTrp: 0.955 ± 0.051
1.104SerTyr: 1.104 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
8.048ThrAla: 8.048 ± 0.125
0.245ThrCys: 0.245 ± 0.019
3.629ThrAsp: 3.629 ± 0.075
3.011ThrGlu: 3.011 ± 0.078
1.914ThrPhe: 1.914 ± 0.06
5.803ThrGly: 5.803 ± 0.094
1.148ThrHis: 1.148 ± 0.046
2.892ThrIle: 2.892 ± 0.075
1.151ThrLys: 1.151 ± 0.048
5.903ThrLeu: 5.903 ± 0.093
1.033ThrMet: 1.033 ± 0.041
1.126ThrAsn: 1.126 ± 0.056
4.051ThrPro: 4.051 ± 0.09
1.339ThrGln: 1.339 ± 0.053
3.731ThrArg: 3.731 ± 0.075
3.306ThrSer: 3.306 ± 0.088
3.89ThrThr: 3.89 ± 0.087
5.812ThrVal: 5.812 ± 0.123
0.881ThrTrp: 0.881 ± 0.037
1.209ThrTyr: 1.209 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
11.365ValAla: 11.365 ± 0.152
0.462ValCys: 0.462 ± 0.026
5.882ValAsp: 5.882 ± 0.084
4.96ValGlu: 4.96 ± 0.102
2.92ValPhe: 2.92 ± 0.084
6.683ValGly: 6.683 ± 0.123
1.803ValHis: 1.803 ± 0.055
4.315ValIle: 4.315 ± 0.091
1.543ValLys: 1.543 ± 0.055
9.241ValLeu: 9.241 ± 0.162
1.455ValMet: 1.455 ± 0.056
1.788ValAsn: 1.788 ± 0.054
4.852ValPro: 4.852 ± 0.091
2.304ValGln: 2.304 ± 0.061
6.287ValArg: 6.287 ± 0.113
5.201ValSer: 5.201 ± 0.11
5.844ValThr: 5.844 ± 0.112
8.784ValVal: 8.784 ± 0.161
1.188ValTrp: 1.188 ± 0.046
1.483ValTyr: 1.483 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
1.594TrpAla: 1.594 ± 0.059
0.107TrpCys: 0.107 ± 0.013
0.86TrpAsp: 0.86 ± 0.031
0.748TrpGlu: 0.748 ± 0.037
0.6TrpPhe: 0.6 ± 0.035
1.134TrpGly: 1.134 ± 0.053
0.358TrpHis: 0.358 ± 0.025
0.745TrpIle: 0.745 ± 0.032
0.298TrpLys: 0.298 ± 0.024
1.635TrpLeu: 1.635 ± 0.053
0.385TrpMet: 0.385 ± 0.026
0.486TrpAsn: 0.486 ± 0.029
0.734TrpPro: 0.734 ± 0.037
0.535TrpGln: 0.535 ± 0.027
1.213TrpArg: 1.213 ± 0.056
0.876TrpSer: 0.876 ± 0.041
0.893TrpThr: 0.893 ± 0.039
1.121TrpVal: 1.121 ± 0.048
0.392TrpTrp: 0.392 ± 0.026
0.32TrpTyr: 0.32 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.462TyrAla: 2.462 ± 0.066
0.084TyrCys: 0.084 ± 0.011
1.397TyrAsp: 1.397 ± 0.046
1.154TyrGlu: 1.154 ± 0.042
0.705TyrPhe: 0.705 ± 0.035
1.917TyrGly: 1.917 ± 0.061
0.301TyrHis: 0.301 ± 0.019
0.705TyrIle: 0.705 ± 0.037
0.341TyrLys: 0.341 ± 0.022
2.053TyrLeu: 2.053 ± 0.062
0.296TyrMet: 0.296 ± 0.024
0.433TyrAsn: 0.433 ± 0.03
1.029TyrPro: 1.029 ± 0.045
0.514TyrGln: 0.514 ± 0.028
1.613TyrArg: 1.613 ± 0.053
1.076TyrSer: 1.076 ± 0.039
1.237TyrThr: 1.237 ± 0.054
1.631TyrVal: 1.631 ± 0.054
0.317TyrTrp: 0.317 ± 0.02
0.473TyrTyr: 0.473 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1927 proteins (627981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski