Amino acid dipepetide frequency for Kordiimonadales bacterium JCM 17845

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.218AlaAla: 13.218 ± 0.153
1.066AlaCys: 1.066 ± 0.038
7.177AlaAsp: 7.177 ± 0.105
6.067AlaGlu: 6.067 ± 0.11
4.421AlaPhe: 4.421 ± 0.075
8.939AlaGly: 8.939 ± 0.105
2.541AlaHis: 2.541 ± 0.058
6.133AlaIle: 6.133 ± 0.101
4.307AlaLys: 4.307 ± 0.081
13.431AlaLeu: 13.431 ± 0.164
3.284AlaMet: 3.284 ± 0.066
2.858AlaAsn: 2.858 ± 0.057
5.162AlaPro: 5.162 ± 0.091
4.934AlaGln: 4.934 ± 0.081
7.827AlaArg: 7.827 ± 0.12
6.104AlaSer: 6.104 ± 0.088
4.815AlaThr: 4.815 ± 0.073
7.696AlaVal: 7.696 ± 0.101
1.191AlaTrp: 1.191 ± 0.041
2.483AlaTyr: 2.483 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
1.021CysAla: 1.021 ± 0.038
0.121CysCys: 0.121 ± 0.014
0.591CysAsp: 0.591 ± 0.027
0.38CysGlu: 0.38 ± 0.024
0.357CysPhe: 0.357 ± 0.022
0.92CysGly: 0.92 ± 0.036
0.246CysHis: 0.246 ± 0.019
0.395CysIle: 0.395 ± 0.023
0.247CysLys: 0.247 ± 0.017
0.833CysLeu: 0.833 ± 0.035
0.201CysMet: 0.201 ± 0.016
0.205CysAsn: 0.205 ± 0.017
0.495CysPro: 0.495 ± 0.025
0.269CysGln: 0.269 ± 0.017
0.601CysArg: 0.601 ± 0.029
0.455CysSer: 0.455 ± 0.021
0.382CysThr: 0.382 ± 0.025
0.542CysVal: 0.542 ± 0.025
0.133CysTrp: 0.133 ± 0.013
0.196CysTyr: 0.196 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
7.024AspAla: 7.024 ± 0.106
0.5AspCys: 0.5 ± 0.024
3.796AspAsp: 3.796 ± 0.095
3.629AspGlu: 3.629 ± 0.071
2.639AspPhe: 2.639 ± 0.058
5.461AspGly: 5.461 ± 0.096
1.654AspHis: 1.654 ± 0.045
3.736AspIle: 3.736 ± 0.064
2.129AspLys: 2.129 ± 0.051
7.095AspLeu: 7.095 ± 0.108
1.957AspMet: 1.957 ± 0.052
1.546AspAsn: 1.546 ± 0.058
3.381AspPro: 3.381 ± 0.061
2.392AspGln: 2.392 ± 0.057
4.702AspArg: 4.702 ± 0.088
2.799AspSer: 2.799 ± 0.063
2.656AspThr: 2.656 ± 0.056
4.415AspVal: 4.415 ± 0.083
0.995AspTrp: 0.995 ± 0.04
1.688AspTyr: 1.688 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
7.066GluAla: 7.066 ± 0.105
0.328GluCys: 0.328 ± 0.021
3.557GluAsp: 3.557 ± 0.069
3.503GluGlu: 3.503 ± 0.077
1.573GluPhe: 1.573 ± 0.048
4.308GluGly: 4.308 ± 0.074
1.118GluHis: 1.118 ± 0.038
3.295GluIle: 3.295 ± 0.073
2.997GluLys: 2.997 ± 0.067
4.654GluLeu: 4.654 ± 0.084
1.574GluMet: 1.574 ± 0.042
2.023GluAsn: 2.023 ± 0.062
2.325GluPro: 2.325 ± 0.067
2.44GluGln: 2.44 ± 0.058
4.521GluArg: 4.521 ± 0.071
2.821GluSer: 2.821 ± 0.057
3.552GluThr: 3.552 ± 0.065
3.258GluVal: 3.258 ± 0.071
0.554GluTrp: 0.554 ± 0.025
0.84GluTyr: 0.84 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
3.987PheAla: 3.987 ± 0.075
0.407PheCys: 0.407 ± 0.021
2.923PheAsp: 2.923 ± 0.073
2.299PheGlu: 2.299 ± 0.055
1.654PhePhe: 1.654 ± 0.057
3.699PheGly: 3.699 ± 0.075
0.781PheHis: 0.781 ± 0.03
1.952PheIle: 1.952 ± 0.053
1.241PheLys: 1.241 ± 0.039
3.854PheLeu: 3.854 ± 0.079
0.947PheMet: 0.947 ± 0.034
1.24PheAsn: 1.24 ± 0.043
1.607PhePro: 1.607 ± 0.047
1.183PheGln: 1.183 ± 0.038
2.045PheArg: 2.045 ± 0.056
2.863PheSer: 2.863 ± 0.064
2.088PheThr: 2.088 ± 0.052
2.528PheVal: 2.528 ± 0.062
0.557PheTrp: 0.557 ± 0.029
1.009PheTyr: 1.009 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
8.389GlyAla: 8.389 ± 0.116
0.827GlyCys: 0.827 ± 0.036
4.903GlyAsp: 4.903 ± 0.083
4.398GlyGlu: 4.398 ± 0.076
3.791GlyPhe: 3.791 ± 0.065
6.816GlyGly: 6.816 ± 0.119
1.935GlyHis: 1.935 ± 0.062
4.524GlyIle: 4.524 ± 0.076
3.323GlyLys: 3.323 ± 0.062
8.962GlyLeu: 8.962 ± 0.12
2.15GlyMet: 2.15 ± 0.058
2.244GlyAsn: 2.244 ± 0.057
3.369GlyPro: 3.369 ± 0.058
2.972GlyGln: 2.972 ± 0.063
5.583GlyArg: 5.583 ± 0.085
4.234GlySer: 4.234 ± 0.081
4.302GlyThr: 4.302 ± 0.075
5.591GlyVal: 5.591 ± 0.088
1.294GlyTrp: 1.294 ± 0.047
2.152GlyTyr: 2.152 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
2.291HisAla: 2.291 ± 0.055
0.262HisCys: 0.262 ± 0.019
1.308HisAsp: 1.308 ± 0.044
1.182HisGlu: 1.182 ± 0.041
0.866HisPhe: 0.866 ± 0.036
1.83HisGly: 1.83 ± 0.057
0.673HisHis: 0.673 ± 0.032
1.259HisIle: 1.259 ± 0.038
0.825HisLys: 0.825 ± 0.032
2.391HisLeu: 2.391 ± 0.054
0.78HisMet: 0.78 ± 0.03
0.541HisAsn: 0.541 ± 0.027
1.342HisPro: 1.342 ± 0.042
0.722HisGln: 0.722 ± 0.03
1.499HisArg: 1.499 ± 0.044
1.177HisSer: 1.177 ± 0.039
0.728HisThr: 0.728 ± 0.03
1.61HisVal: 1.61 ± 0.046
0.4HisTrp: 0.4 ± 0.024
0.672HisTyr: 0.672 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.57IleAla: 6.57 ± 0.094
0.547IleCys: 0.547 ± 0.025
3.731IleAsp: 3.731 ± 0.073
3.548IleGlu: 3.548 ± 0.065
1.94IlePhe: 1.94 ± 0.05
4.71IleGly: 4.71 ± 0.076
1.108IleHis: 1.108 ± 0.039
2.606IleIle: 2.606 ± 0.074
1.939IleLys: 1.939 ± 0.05
5.257IleLeu: 5.257 ± 0.086
1.204IleMet: 1.204 ± 0.038
1.705IleAsn: 1.705 ± 0.05
2.5IlePro: 2.5 ± 0.054
1.311IleGln: 1.311 ± 0.04
3.294IleArg: 3.294 ± 0.069
3.648IleSer: 3.648 ± 0.079
3.126IleThr: 3.126 ± 0.069
3.677IleVal: 3.677 ± 0.07
0.673IleTrp: 0.673 ± 0.029
1.188IleTyr: 1.188 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
5.418LysAla: 5.418 ± 0.087
0.184LysCys: 0.184 ± 0.015
2.401LysAsp: 2.401 ± 0.054
1.934LysGlu: 1.934 ± 0.05
0.931LysPhe: 0.931 ± 0.03
3.085LysGly: 3.085 ± 0.067
0.723LysHis: 0.723 ± 0.032
2.088LysIle: 2.088 ± 0.048
1.78LysLys: 1.78 ± 0.052
3.18LysLeu: 3.18 ± 0.072
0.971LysMet: 0.971 ± 0.037
1.253LysAsn: 1.253 ± 0.041
2.178LysPro: 2.178 ± 0.055
1.263LysGln: 1.263 ± 0.039
2.983LysArg: 2.983 ± 0.068
2.373LysSer: 2.373 ± 0.052
2.471LysThr: 2.471 ± 0.054
2.239LysVal: 2.239 ± 0.056
0.389LysTrp: 0.389 ± 0.02
0.608LysTyr: 0.608 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
12.648LeuAla: 12.648 ± 0.168
0.971LeuCys: 0.971 ± 0.036
6.953LeuAsp: 6.953 ± 0.1
6.117LeuGlu: 6.117 ± 0.092
3.964LeuPhe: 3.964 ± 0.079
8.362LeuGly: 8.362 ± 0.105
2.054LeuHis: 2.054 ± 0.051
5.125LeuIle: 5.125 ± 0.078
4.13LeuLys: 4.13 ± 0.072
9.594LeuLeu: 9.594 ± 0.128
2.854LeuMet: 2.854 ± 0.068
2.796LeuAsn: 2.796 ± 0.06
5.18LeuPro: 5.18 ± 0.083
2.765LeuGln: 2.765 ± 0.06
6.021LeuArg: 6.021 ± 0.096
7.597LeuSer: 7.597 ± 0.092
5.193LeuThr: 5.193 ± 0.068
7.464LeuVal: 7.464 ± 0.098
1.314LeuTrp: 1.314 ± 0.048
2.307LeuTyr: 2.307 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
3.683MetAla: 3.683 ± 0.068
0.175MetCys: 0.175 ± 0.014
1.607MetAsp: 1.607 ± 0.046
1.473MetGlu: 1.473 ± 0.045
0.726MetPhe: 0.726 ± 0.032
2.361MetGly: 2.361 ± 0.052
0.481MetHis: 0.481 ± 0.023
1.475MetIle: 1.475 ± 0.041
1.118MetLys: 1.118 ± 0.04
2.415MetLeu: 2.415 ± 0.046
0.843MetMet: 0.843 ± 0.036
0.722MetAsn: 0.722 ± 0.031
1.6MetPro: 1.6 ± 0.047
0.863MetGln: 0.863 ± 0.036
2.001MetArg: 2.001 ± 0.05
1.64MetSer: 1.64 ± 0.037
1.79MetThr: 1.79 ± 0.052
1.941MetVal: 1.941 ± 0.05
0.265MetTrp: 0.265 ± 0.018
0.274MetTyr: 0.274 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.14AsnAla: 3.14 ± 0.066
0.246AsnCys: 0.246 ± 0.017
1.712AsnAsp: 1.712 ± 0.058
1.412AsnGlu: 1.412 ± 0.044
1.106AsnPhe: 1.106 ± 0.041
2.532AsnGly: 2.532 ± 0.062
0.59AsnHis: 0.59 ± 0.028
1.619AsnIle: 1.619 ± 0.039
0.893AsnLys: 0.893 ± 0.034
2.695AsnLeu: 2.695 ± 0.057
0.752AsnMet: 0.752 ± 0.033
0.833AsnAsn: 0.833 ± 0.038
1.858AsnPro: 1.858 ± 0.046
0.845AsnGln: 0.845 ± 0.032
2.035AsnArg: 2.035 ± 0.053
1.595AsnSer: 1.595 ± 0.062
1.341AsnThr: 1.341 ± 0.042
1.795AsnVal: 1.795 ± 0.051
0.459AsnTrp: 0.459 ± 0.024
0.616AsnTyr: 0.616 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
4.868ProAla: 4.868 ± 0.076
0.424ProCys: 0.424 ± 0.021
4.081ProAsp: 4.081 ± 0.071
3.126ProGlu: 3.126 ± 0.062
2.14ProPhe: 2.14 ± 0.06
3.771ProGly: 3.771 ± 0.069
1.191ProHis: 1.191 ± 0.038
2.525ProIle: 2.525 ± 0.054
2.081ProLys: 2.081 ± 0.046
4.844ProLeu: 4.844 ± 0.068
1.433ProMet: 1.433 ± 0.043
1.39ProAsn: 1.39 ± 0.042
2.257ProPro: 2.257 ± 0.059
1.89ProGln: 1.89 ± 0.055
2.446ProArg: 2.446 ± 0.059
2.789ProSer: 2.789 ± 0.059
2.204ProThr: 2.204 ± 0.057
3.862ProVal: 3.862 ± 0.066
0.626ProTrp: 0.626 ± 0.023
1.181ProTyr: 1.181 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
4.24GlnAla: 4.24 ± 0.079
0.253GlnCys: 0.253 ± 0.016
1.887GlnAsp: 1.887 ± 0.046
1.892GlnGlu: 1.892 ± 0.053
1.251GlnPhe: 1.251 ± 0.043
2.436GlnGly: 2.436 ± 0.051
0.724GlnHis: 0.724 ± 0.03
2.231GlnIle: 2.231 ± 0.054
1.705GlnLys: 1.705 ± 0.042
3.03GlnLeu: 3.03 ± 0.06
1.123GlnMet: 1.123 ± 0.036
1.102GlnAsn: 1.102 ± 0.033
1.563GlnPro: 1.563 ± 0.047
1.295GlnGln: 1.295 ± 0.037
2.296GlnArg: 2.296 ± 0.058
2.49GlnSer: 2.49 ± 0.053
1.967GlnThr: 1.967 ± 0.048
2.186GlnVal: 2.186 ± 0.058
0.47GlnTrp: 0.47 ± 0.02
0.573GlnTyr: 0.573 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
7.177ArgAla: 7.177 ± 0.099
0.532ArgCys: 0.532 ± 0.022
4.002ArgAsp: 4.002 ± 0.073
3.594ArgGlu: 3.594 ± 0.066
3.097ArgPhe: 3.097 ± 0.062
4.337ArgGly: 4.337 ± 0.073
1.791ArgHis: 1.791 ± 0.056
4.04ArgIle: 4.04 ± 0.07
2.524ArgLys: 2.524 ± 0.058
8.116ArgLeu: 8.116 ± 0.125
1.893ArgMet: 1.893 ± 0.053
1.756ArgAsn: 1.756 ± 0.047
3.202ArgPro: 3.202 ± 0.062
2.468ArgGln: 2.468 ± 0.056
4.784ArgArg: 4.784 ± 0.091
3.627ArgSer: 3.627 ± 0.07
3.173ArgThr: 3.173 ± 0.061
4.095ArgVal: 4.095 ± 0.073
0.929ArgTrp: 0.929 ± 0.034
1.778ArgTyr: 1.778 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
6.13SerAla: 6.13 ± 0.089
0.455SerCys: 0.455 ± 0.025
3.67SerAsp: 3.67 ± 0.076
2.952SerGlu: 2.952 ± 0.063
2.528SerPhe: 2.528 ± 0.06
5.482SerGly: 5.482 ± 0.086
1.314SerHis: 1.314 ± 0.036
2.962SerIle: 2.962 ± 0.067
1.995SerLys: 1.995 ± 0.052
6.458SerLeu: 6.458 ± 0.094
1.616SerMet: 1.616 ± 0.045
1.643SerAsn: 1.643 ± 0.056
2.972SerPro: 2.972 ± 0.065
2.034SerGln: 2.034 ± 0.054
3.764SerArg: 3.764 ± 0.069
3.214SerSer: 3.214 ± 0.073
2.89SerThr: 2.89 ± 0.077
4.07SerVal: 4.07 ± 0.077
0.705SerTrp: 0.705 ± 0.029
1.455SerTyr: 1.455 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
5.694ThrAla: 5.694 ± 0.095
0.375ThrCys: 0.375 ± 0.021
3.164ThrAsp: 3.164 ± 0.064
2.607ThrGlu: 2.607 ± 0.048
1.701ThrPhe: 1.701 ± 0.051
4.799ThrGly: 4.799 ± 0.079
1.115ThrHis: 1.115 ± 0.04
2.923ThrIle: 2.923 ± 0.072
1.649ThrLys: 1.649 ± 0.042
5.626ThrLeu: 5.626 ± 0.094
1.15ThrMet: 1.15 ± 0.035
1.4ThrAsn: 1.4 ± 0.035
3.113ThrPro: 3.113 ± 0.072
1.609ThrGln: 1.609 ± 0.049
3.002ThrArg: 3.002 ± 0.068
2.703ThrSer: 2.703 ± 0.061
2.255ThrThr: 2.255 ± 0.056
3.889ThrVal: 3.889 ± 0.069
0.513ThrTrp: 0.513 ± 0.024
1.055ThrTyr: 1.055 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
7.64ValAla: 7.64 ± 0.119
0.539ValCys: 0.539 ± 0.023
4.54ValAsp: 4.54 ± 0.079
4.208ValGlu: 4.208 ± 0.071
2.735ValPhe: 2.735 ± 0.062
4.899ValGly: 4.899 ± 0.078
1.385ValHis: 1.385 ± 0.049
3.802ValIle: 3.802 ± 0.07
2.367ValLys: 2.367 ± 0.063
7.067ValLeu: 7.067 ± 0.103
1.814ValMet: 1.814 ± 0.051
1.919ValAsn: 1.919 ± 0.046
3.302ValPro: 3.302 ± 0.061
1.994ValGln: 1.994 ± 0.048
4.652ValArg: 4.652 ± 0.07
4.207ValSer: 4.207 ± 0.086
3.836ValThr: 3.836 ± 0.066
5.016ValVal: 5.016 ± 0.077
0.71ValTrp: 0.71 ± 0.025
1.449ValTyr: 1.449 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
1.18TrpAla: 1.18 ± 0.04
0.15TrpCys: 0.15 ± 0.013
0.644TrpAsp: 0.644 ± 0.028
0.578TrpGlu: 0.578 ± 0.026
0.512TrpPhe: 0.512 ± 0.03
0.829TrpGly: 0.829 ± 0.029
0.344TrpHis: 0.344 ± 0.019
0.627TrpIle: 0.627 ± 0.029
0.446TrpLys: 0.446 ± 0.026
1.553TrpLeu: 1.553 ± 0.043
0.459TrpMet: 0.459 ± 0.026
0.316TrpAsn: 0.316 ± 0.02
0.703TrpPro: 0.703 ± 0.029
0.602TrpGln: 0.602 ± 0.025
1.106TrpArg: 1.106 ± 0.039
0.74TrpSer: 0.74 ± 0.032
0.679TrpThr: 0.679 ± 0.032
0.763TrpVal: 0.763 ± 0.034
0.22TrpTrp: 0.22 ± 0.015
0.265TrpTyr: 0.265 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.267TyrAla: 2.267 ± 0.053
0.228TyrCys: 0.228 ± 0.016
1.546TyrAsp: 1.546 ± 0.044
1.408TyrGlu: 1.408 ± 0.04
0.932TyrPhe: 0.932 ± 0.033
2.18TyrGly: 2.18 ± 0.049
0.589TyrHis: 0.589 ± 0.031
0.962TyrIle: 0.962 ± 0.033
0.756TyrLys: 0.756 ± 0.035
2.357TyrLeu: 2.357 ± 0.048
0.448TyrMet: 0.448 ± 0.025
0.584TyrAsn: 0.584 ± 0.031
1.021TyrPro: 1.021 ± 0.036
0.865TyrGln: 0.865 ± 0.03
1.754TyrArg: 1.754 ± 0.048
1.234TyrSer: 1.234 ± 0.042
0.893TyrThr: 0.893 ± 0.035
1.445TyrVal: 1.445 ± 0.039
0.318TyrTrp: 0.318 ± 0.02
0.613TyrTyr: 0.613 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2941 proteins (832409 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski