Amino acid dipepetide frequency for Desulfosporosinus fructosivorans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.277AlaAla: 6.277 ± 0.078
0.978AlaCys: 0.978 ± 0.025
3.673AlaAsp: 3.673 ± 0.06
5.081AlaGlu: 5.081 ± 0.07
3.065AlaPhe: 3.065 ± 0.051
6.054AlaGly: 6.054 ± 0.062
1.292AlaHis: 1.292 ± 0.029
5.82AlaIle: 5.82 ± 0.058
4.854AlaLys: 4.854 ± 0.067
8.53AlaLeu: 8.53 ± 0.083
2.26AlaMet: 2.26 ± 0.039
2.82AlaAsn: 2.82 ± 0.042
2.278AlaPro: 2.278 ± 0.043
3.011AlaGln: 3.011 ± 0.059
3.448AlaArg: 3.448 ± 0.046
4.317AlaSer: 4.317 ± 0.056
3.962AlaThr: 3.962 ± 0.058
5.91AlaVal: 5.91 ± 0.071
0.778AlaTrp: 0.778 ± 0.022
2.283AlaTyr: 2.283 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.807CysAla: 0.807 ± 0.023
0.207CysCys: 0.207 ± 0.013
0.576CysAsp: 0.576 ± 0.019
0.639CysGlu: 0.639 ± 0.022
0.428CysPhe: 0.428 ± 0.015
1.224CysGly: 1.224 ± 0.029
0.287CysHis: 0.287 ± 0.017
0.76CysIle: 0.76 ± 0.02
0.591CysLys: 0.591 ± 0.02
1.117CysLeu: 1.117 ± 0.026
0.286CysMet: 0.286 ± 0.013
0.439CysAsn: 0.439 ± 0.015
0.685CysPro: 0.685 ± 0.026
0.426CysGln: 0.426 ± 0.016
0.563CysArg: 0.563 ± 0.019
0.766CysSer: 0.766 ± 0.025
0.643CysThr: 0.643 ± 0.022
0.73CysVal: 0.73 ± 0.023
0.122CysTrp: 0.122 ± 0.008
0.362CysTyr: 0.362 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.328AspAla: 3.328 ± 0.05
0.569AspCys: 0.569 ± 0.018
2.138AspAsp: 2.138 ± 0.037
3.483AspGlu: 3.483 ± 0.048
2.31AspPhe: 2.31 ± 0.041
3.303AspGly: 3.303 ± 0.048
0.904AspHis: 0.904 ± 0.021
4.064AspIle: 4.064 ± 0.051
3.025AspLys: 3.025 ± 0.045
5.48AspLeu: 5.48 ± 0.054
1.322AspMet: 1.322 ± 0.027
1.853AspAsn: 1.853 ± 0.031
2.092AspPro: 2.092 ± 0.04
1.831AspGln: 1.831 ± 0.033
2.228AspArg: 2.228 ± 0.04
2.767AspSer: 2.767 ± 0.042
2.359AspThr: 2.359 ± 0.037
3.557AspVal: 3.557 ± 0.045
0.598AspTrp: 0.598 ± 0.018
2.029AspTyr: 2.029 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
5.15GluAla: 5.15 ± 0.063
0.648GluCys: 0.648 ± 0.021
3.28GluAsp: 3.28 ± 0.054
5.203GluGlu: 5.203 ± 0.06
2.506GluPhe: 2.506 ± 0.041
4.348GluGly: 4.348 ± 0.052
1.224GluHis: 1.224 ± 0.025
5.529GluIle: 5.529 ± 0.067
4.912GluLys: 4.912 ± 0.062
7.068GluLeu: 7.068 ± 0.074
2.001GluMet: 2.001 ± 0.034
2.983GluAsn: 2.983 ± 0.039
1.809GluPro: 1.809 ± 0.033
2.785GluGln: 2.785 ± 0.042
3.558GluArg: 3.558 ± 0.055
3.414GluSer: 3.414 ± 0.05
3.391GluThr: 3.391 ± 0.048
4.956GluVal: 4.956 ± 0.068
0.699GluTrp: 0.699 ± 0.02
2.031GluTyr: 2.031 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
3.004PheAla: 3.004 ± 0.044
0.537PheCys: 0.537 ± 0.018
2.074PheAsp: 2.074 ± 0.041
2.455PheGlu: 2.455 ± 0.038
1.767PhePhe: 1.767 ± 0.038
3.284PheGly: 3.284 ± 0.051
0.706PheHis: 0.706 ± 0.023
3.043PheIle: 3.043 ± 0.048
2.146PheLys: 2.146 ± 0.03
4.238PheLeu: 4.238 ± 0.06
1.093PheMet: 1.093 ± 0.027
1.732PheAsn: 1.732 ± 0.03
1.656PhePro: 1.656 ± 0.033
1.35PheGln: 1.35 ± 0.031
1.636PheArg: 1.636 ± 0.033
2.999PheSer: 2.999 ± 0.042
2.179PheThr: 2.179 ± 0.038
2.866PheVal: 2.866 ± 0.046
0.467PheTrp: 0.467 ± 0.021
1.457PheTyr: 1.457 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
5.501GlyAla: 5.501 ± 0.062
1.106GlyCys: 1.106 ± 0.029
3.263GlyAsp: 3.263 ± 0.051
4.46GlyGlu: 4.46 ± 0.054
3.326GlyPhe: 3.326 ± 0.046
5.45GlyGly: 5.45 ± 0.07
1.343GlyHis: 1.343 ± 0.028
6.519GlyIle: 6.519 ± 0.072
5.089GlyLys: 5.089 ± 0.06
7.709GlyLeu: 7.709 ± 0.083
2.351GlyMet: 2.351 ± 0.041
2.802GlyAsn: 2.802 ± 0.042
1.94GlyPro: 1.94 ± 0.036
2.577GlyGln: 2.577 ± 0.041
3.114GlyArg: 3.114 ± 0.043
4.304GlySer: 4.304 ± 0.057
4.301GlyThr: 4.301 ± 0.062
5.742GlyVal: 5.742 ± 0.064
0.839GlyTrp: 0.839 ± 0.021
2.787GlyTyr: 2.787 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
1.195HisAla: 1.195 ± 0.03
0.276HisCys: 0.276 ± 0.014
0.871HisAsp: 0.871 ± 0.022
1.13HisGlu: 1.13 ± 0.028
0.844HisPhe: 0.844 ± 0.022
1.322HisGly: 1.322 ± 0.033
0.462HisHis: 0.462 ± 0.019
1.216HisIle: 1.216 ± 0.025
0.916HisLys: 0.916 ± 0.028
1.945HisLeu: 1.945 ± 0.036
0.446HisMet: 0.446 ± 0.016
0.714HisAsn: 0.714 ± 0.022
1.053HisPro: 1.053 ± 0.024
0.693HisGln: 0.693 ± 0.02
0.847HisArg: 0.847 ± 0.026
1.095HisSer: 1.095 ± 0.027
0.974HisThr: 0.974 ± 0.026
1.145HisVal: 1.145 ± 0.025
0.25HisTrp: 0.25 ± 0.012
0.692HisTyr: 0.692 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.116IleAla: 6.116 ± 0.062
0.883IleCys: 0.883 ± 0.028
3.894IleAsp: 3.894 ± 0.05
4.962IleGlu: 4.962 ± 0.066
3.055IlePhe: 3.055 ± 0.04
5.743IleGly: 5.743 ± 0.058
1.382IleHis: 1.382 ± 0.031
5.679IleIle: 5.679 ± 0.086
4.399IleLys: 4.399 ± 0.05
7.809IleLeu: 7.809 ± 0.076
1.999IleMet: 1.999 ± 0.034
3.259IleAsn: 3.259 ± 0.045
3.618IlePro: 3.618 ± 0.047
2.632IleGln: 2.632 ± 0.042
3.348IleArg: 3.348 ± 0.048
5.215IleSer: 5.215 ± 0.065
4.235IleThr: 4.235 ± 0.049
5.283IleVal: 5.283 ± 0.059
0.645IleTrp: 0.645 ± 0.019
2.271IleTyr: 2.271 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
4.859LysAla: 4.859 ± 0.054
0.544LysCys: 0.544 ± 0.02
3.335LysAsp: 3.335 ± 0.051
4.898LysGlu: 4.898 ± 0.06
1.936LysPhe: 1.936 ± 0.038
4.192LysGly: 4.192 ± 0.047
1.002LysHis: 1.002 ± 0.025
4.642LysIle: 4.642 ± 0.05
4.012LysLys: 4.012 ± 0.051
5.652LysLeu: 5.652 ± 0.069
1.846LysMet: 1.846 ± 0.034
2.769LysAsn: 2.769 ± 0.044
2.254LysPro: 2.254 ± 0.04
2.227LysGln: 2.227 ± 0.039
2.879LysArg: 2.879 ± 0.047
3.368LysSer: 3.368 ± 0.052
3.413LysThr: 3.413 ± 0.044
4.711LysVal: 4.711 ± 0.051
0.544LysTrp: 0.544 ± 0.019
2.009LysTyr: 2.009 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
8.822LeuAla: 8.822 ± 0.089
1.098LeuCys: 1.098 ± 0.027
5.265LeuAsp: 5.265 ± 0.056
6.972LeuGlu: 6.972 ± 0.077
4.08LeuPhe: 4.08 ± 0.059
7.957LeuGly: 7.957 ± 0.085
1.704LeuHis: 1.704 ± 0.034
7.482LeuIle: 7.482 ± 0.072
6.542LeuLys: 6.542 ± 0.062
10.304LeuLeu: 10.304 ± 0.109
2.71LeuMet: 2.71 ± 0.043
4.409LeuAsn: 4.409 ± 0.049
4.304LeuPro: 4.304 ± 0.057
3.5LeuGln: 3.5 ± 0.048
4.722LeuArg: 4.722 ± 0.059
7.061LeuSer: 7.061 ± 0.072
5.977LeuThr: 5.977 ± 0.06
7.077LeuVal: 7.077 ± 0.069
0.983LeuTrp: 0.983 ± 0.025
2.746LeuTyr: 2.746 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
2.438MetAla: 2.438 ± 0.039
0.248MetCys: 0.248 ± 0.011
1.539MetAsp: 1.539 ± 0.031
1.83MetGlu: 1.83 ± 0.039
0.96MetPhe: 0.96 ± 0.027
2.199MetGly: 2.199 ± 0.042
0.462MetHis: 0.462 ± 0.017
1.988MetIle: 1.988 ± 0.033
1.837MetLys: 1.837 ± 0.036
2.621MetLeu: 2.621 ± 0.038
0.775MetMet: 0.775 ± 0.022
1.281MetAsn: 1.281 ± 0.03
1.08MetPro: 1.08 ± 0.027
0.934MetGln: 0.934 ± 0.029
1.258MetArg: 1.258 ± 0.026
1.753MetSer: 1.753 ± 0.034
1.581MetThr: 1.581 ± 0.031
2.002MetVal: 2.002 ± 0.036
0.196MetTrp: 0.196 ± 0.011
0.651MetTyr: 0.651 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
2.819AsnAla: 2.819 ± 0.042
0.543AsnCys: 0.543 ± 0.018
1.848AsnAsp: 1.848 ± 0.032
2.396AsnGlu: 2.396 ± 0.041
1.609AsnPhe: 1.609 ± 0.028
2.833AsnGly: 2.833 ± 0.042
0.789AsnHis: 0.789 ± 0.021
3.293AsnIle: 3.293 ± 0.042
2.429AsnLys: 2.429 ± 0.04
4.318AsnLeu: 4.318 ± 0.051
1.107AsnMet: 1.107 ± 0.026
1.834AsnAsn: 1.834 ± 0.042
2.284AsnPro: 2.284 ± 0.038
1.602AsnGln: 1.602 ± 0.03
1.837AsnArg: 1.837 ± 0.035
2.577AsnSer: 2.577 ± 0.041
2.136AsnThr: 2.136 ± 0.039
2.934AsnVal: 2.934 ± 0.045
0.468AsnTrp: 0.468 ± 0.017
1.572AsnTyr: 1.572 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
2.689ProAla: 2.689 ± 0.051
0.399ProCys: 0.399 ± 0.016
2.108ProAsp: 2.108 ± 0.039
3.187ProGlu: 3.187 ± 0.044
1.685ProPhe: 1.685 ± 0.033
2.839ProGly: 2.839 ± 0.05
0.721ProHis: 0.721 ± 0.022
2.833ProIle: 2.833 ± 0.04
2.115ProLys: 2.115 ± 0.037
3.966ProLeu: 3.966 ± 0.053
0.95ProMet: 0.95 ± 0.024
1.595ProAsn: 1.595 ± 0.03
1.251ProPro: 1.251 ± 0.033
1.433ProGln: 1.433 ± 0.031
1.454ProArg: 1.454 ± 0.031
2.39ProSer: 2.39 ± 0.044
2.125ProThr: 2.125 ± 0.042
3.074ProVal: 3.074 ± 0.041
0.458ProTrp: 0.458 ± 0.018
1.347ProTyr: 1.347 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.227GlnAla: 3.227 ± 0.07
0.358GlnCys: 0.358 ± 0.016
1.757GlnAsp: 1.757 ± 0.033
2.821GlnGlu: 2.821 ± 0.048
1.326GlnPhe: 1.326 ± 0.03
2.774GlnGly: 2.774 ± 0.056
0.666GlnHis: 0.666 ± 0.021
2.715GlnIle: 2.715 ± 0.04
2.363GlnLys: 2.363 ± 0.036
3.462GlnLeu: 3.462 ± 0.044
1.044GlnMet: 1.044 ± 0.022
1.454GlnAsn: 1.454 ± 0.031
1.208GlnPro: 1.208 ± 0.031
1.436GlnGln: 1.436 ± 0.034
1.777GlnArg: 1.777 ± 0.033
2.015GlnSer: 2.015 ± 0.038
2.031GlnThr: 2.031 ± 0.037
2.757GlnVal: 2.757 ± 0.048
0.401GlnTrp: 0.401 ± 0.017
1.095GlnTyr: 1.095 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
3.095ArgAla: 3.095 ± 0.047
0.517ArgCys: 0.517 ± 0.018
2.209ArgAsp: 2.209 ± 0.037
3.461ArgGlu: 3.461 ± 0.046
1.914ArgPhe: 1.914 ± 0.037
2.776ArgGly: 2.776 ± 0.045
0.852ArgHis: 0.852 ± 0.026
3.528ArgIle: 3.528 ± 0.049
2.929ArgLys: 2.929 ± 0.047
4.834ArgLeu: 4.834 ± 0.056
1.316ArgMet: 1.316 ± 0.027
1.855ArgAsn: 1.855 ± 0.041
1.557ArgPro: 1.557 ± 0.036
1.835ArgGln: 1.835 ± 0.035
2.291ArgArg: 2.291 ± 0.04
2.405ArgSer: 2.405 ± 0.037
2.273ArgThr: 2.273 ± 0.035
3.296ArgVal: 3.296 ± 0.041
0.54ArgTrp: 0.54 ± 0.019
1.541ArgTyr: 1.541 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
4.318SerAla: 4.318 ± 0.056
0.705SerCys: 0.705 ± 0.021
2.853SerAsp: 2.853 ± 0.048
3.885SerGlu: 3.885 ± 0.049
2.723SerPhe: 2.723 ± 0.049
5.048SerGly: 5.048 ± 0.063
1.136SerHis: 1.136 ± 0.026
4.645SerIle: 4.645 ± 0.06
3.585SerLys: 3.585 ± 0.055
6.586SerLeu: 6.586 ± 0.064
1.707SerMet: 1.707 ± 0.031
2.415SerAsn: 2.415 ± 0.039
2.427SerPro: 2.427 ± 0.039
2.255SerGln: 2.255 ± 0.038
2.672SerArg: 2.672 ± 0.038
4.076SerSer: 4.076 ± 0.063
3.221SerThr: 3.221 ± 0.048
4.367SerVal: 4.367 ± 0.055
0.676SerTrp: 0.676 ± 0.022
1.981SerTyr: 1.981 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
4.189ThrAla: 4.189 ± 0.062
0.597ThrCys: 0.597 ± 0.022
2.644ThrAsp: 2.644 ± 0.042
3.28ThrGlu: 3.28 ± 0.052
2.173ThrPhe: 2.173 ± 0.035
4.623ThrGly: 4.623 ± 0.054
1.037ThrHis: 1.037 ± 0.027
4.038ThrIle: 4.038 ± 0.05
2.911ThrLys: 2.911 ± 0.043
5.766ThrLeu: 5.766 ± 0.064
1.384ThrMet: 1.384 ± 0.031
2.171ThrAsn: 2.171 ± 0.036
2.556ThrPro: 2.556 ± 0.036
1.887ThrGln: 1.887 ± 0.035
2.134ThrArg: 2.134 ± 0.037
3.26ThrSer: 3.26 ± 0.052
3.17ThrThr: 3.17 ± 0.048
4.212ThrVal: 4.212 ± 0.054
0.564ThrTrp: 0.564 ± 0.021
1.662ThrTyr: 1.662 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
5.741ValAla: 5.741 ± 0.067
0.876ValCys: 0.876 ± 0.024
3.742ValAsp: 3.742 ± 0.047
4.751ValGlu: 4.751 ± 0.055
3.055ValPhe: 3.055 ± 0.049
5.264ValGly: 5.264 ± 0.066
1.233ValHis: 1.233 ± 0.027
5.737ValIle: 5.737 ± 0.056
4.116ValLys: 4.116 ± 0.057
7.646ValLeu: 7.646 ± 0.066
2.031ValMet: 2.031 ± 0.038
3.001ValAsn: 3.001 ± 0.042
2.758ValPro: 2.758 ± 0.039
2.484ValGln: 2.484 ± 0.04
3.161ValArg: 3.161 ± 0.043
4.779ValSer: 4.779 ± 0.06
4.1ValThr: 4.1 ± 0.065
5.61ValVal: 5.61 ± 0.069
0.72ValTrp: 0.72 ± 0.021
2.177ValTyr: 2.177 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.768TrpAla: 0.768 ± 0.024
0.117TrpCys: 0.117 ± 0.009
0.521TrpAsp: 0.521 ± 0.019
0.669TrpGlu: 0.669 ± 0.02
0.417TrpPhe: 0.417 ± 0.015
0.823TrpGly: 0.823 ± 0.023
0.222TrpHis: 0.222 ± 0.011
0.701TrpIle: 0.701 ± 0.021
0.585TrpLys: 0.585 ± 0.022
1.191TrpLeu: 1.191 ± 0.028
0.264TrpMet: 0.264 ± 0.012
0.496TrpAsn: 0.496 ± 0.015
0.381TrpPro: 0.381 ± 0.015
0.476TrpGln: 0.476 ± 0.018
0.491TrpArg: 0.491 ± 0.018
0.645TrpSer: 0.645 ± 0.019
0.507TrpThr: 0.507 ± 0.018
0.748TrpVal: 0.748 ± 0.022
0.143TrpTrp: 0.143 ± 0.01
0.302TrpTyr: 0.302 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.346TyrAla: 2.346 ± 0.038
0.452TyrCys: 0.452 ± 0.016
1.652TyrAsp: 1.652 ± 0.033
1.934TyrGlu: 1.934 ± 0.037
1.508TyrPhe: 1.508 ± 0.031
2.468TyrGly: 2.468 ± 0.04
0.664TyrHis: 0.664 ± 0.019
2.193TyrIle: 2.193 ± 0.033
1.736TyrLys: 1.736 ± 0.036
3.471TyrLeu: 3.471 ± 0.045
0.74TyrMet: 0.74 ± 0.024
1.346TyrAsn: 1.346 ± 0.027
1.428TyrPro: 1.428 ± 0.029
1.313TyrGln: 1.313 ± 0.031
1.607TyrArg: 1.607 ± 0.035
2.075TyrSer: 2.075 ± 0.035
1.678TyrThr: 1.678 ± 0.036
2.008TyrVal: 2.008 ± 0.034
0.375TyrTrp: 0.375 ± 0.018
1.213TyrTyr: 1.213 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5570 proteins (1667623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski