Amino acid dipepetide frequency for Wallemia mellicola (strain ATCC MYA-4683 / CBS 633.66) (Wallemia sebi (CBS 633.66))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.788AlaAla: 5.788 ± 0.069
0.77AlaCys: 0.77 ± 0.022
3.467AlaAsp: 3.467 ± 0.043
3.994AlaGlu: 3.994 ± 0.048
3.029AlaPhe: 3.029 ± 0.038
4.26AlaGly: 4.26 ± 0.058
1.553AlaHis: 1.553 ± 0.029
4.604AlaIle: 4.604 ± 0.051
4.239AlaLys: 4.239 ± 0.054
7.086AlaLeu: 7.086 ± 0.067
1.524AlaMet: 1.524 ± 0.028
3.123AlaAsn: 3.123 ± 0.035
3.363AlaPro: 3.363 ± 0.054
3.018AlaGln: 3.018 ± 0.039
3.548AlaArg: 3.548 ± 0.048
5.831AlaSer: 5.831 ± 0.066
4.134AlaThr: 4.134 ± 0.051
4.108AlaVal: 4.108 ± 0.051
0.738AlaTrp: 0.738 ± 0.019
2.13AlaTyr: 2.13 ± 0.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.71CysAla: 0.71 ± 0.019
0.189CysCys: 0.189 ± 0.01
0.595CysAsp: 0.595 ± 0.017
0.524CysGlu: 0.524 ± 0.018
0.435CysPhe: 0.435 ± 0.02
0.721CysGly: 0.721 ± 0.024
0.259CysHis: 0.259 ± 0.012
0.657CysIle: 0.657 ± 0.019
0.562CysLys: 0.562 ± 0.017
1.109CysLeu: 1.109 ± 0.024
0.217CysMet: 0.217 ± 0.009
0.462CysAsn: 0.462 ± 0.018
0.484CysPro: 0.484 ± 0.017
0.382CysGln: 0.382 ± 0.013
0.539CysArg: 0.539 ± 0.017
0.771CysSer: 0.771 ± 0.019
0.605CysThr: 0.605 ± 0.016
0.682CysVal: 0.682 ± 0.017
0.149CysTrp: 0.149 ± 0.009
0.356CysTyr: 0.356 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
4.144AspAla: 4.144 ± 0.047
0.57AspCys: 0.57 ± 0.018
5.092AspAsp: 5.092 ± 0.07
5.236AspGlu: 5.236 ± 0.06
2.333AspPhe: 2.333 ± 0.031
3.489AspGly: 3.489 ± 0.044
1.129AspHis: 1.129 ± 0.023
4.082AspIle: 4.082 ± 0.051
3.847AspLys: 3.847 ± 0.049
5.503AspLeu: 5.503 ± 0.045
1.174AspMet: 1.174 ± 0.023
3.301AspAsn: 3.301 ± 0.048
2.481AspPro: 2.481 ± 0.035
2.122AspGln: 2.122 ± 0.032
2.679AspArg: 2.679 ± 0.037
4.453AspSer: 4.453 ± 0.051
3.006AspThr: 3.006 ± 0.036
3.98AspVal: 3.98 ± 0.047
0.77AspTrp: 0.77 ± 0.02
2.048AspTyr: 2.048 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
4.531GluAla: 4.531 ± 0.046
0.657GluCys: 0.657 ± 0.018
4.575GluAsp: 4.575 ± 0.054
6.083GluGlu: 6.083 ± 0.093
2.27GluPhe: 2.27 ± 0.036
3.333GluGly: 3.333 ± 0.042
1.424GluHis: 1.424 ± 0.031
3.885GluIle: 3.885 ± 0.049
4.684GluLys: 4.684 ± 0.057
5.875GluLeu: 5.875 ± 0.055
1.456GluMet: 1.456 ± 0.025
3.28GluAsn: 3.28 ± 0.039
2.234GluPro: 2.234 ± 0.035
2.87GluGln: 2.87 ± 0.046
3.838GluArg: 3.838 ± 0.052
4.863GluSer: 4.863 ± 0.052
3.218GluThr: 3.218 ± 0.039
3.418GluVal: 3.418 ± 0.041
0.761GluTrp: 0.761 ± 0.021
1.96GluTyr: 1.96 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
2.717PheAla: 2.717 ± 0.042
0.477PheCys: 0.477 ± 0.015
2.712PheAsp: 2.712 ± 0.039
2.48PheGlu: 2.48 ± 0.033
1.503PhePhe: 1.503 ± 0.032
2.757PheGly: 2.757 ± 0.051
0.77PheHis: 0.77 ± 0.018
2.419PheIle: 2.419 ± 0.038
2.486PheLys: 2.486 ± 0.037
3.173PheLeu: 3.173 ± 0.047
0.765PheMet: 0.765 ± 0.018
2.295PheAsn: 2.295 ± 0.035
1.521PhePro: 1.521 ± 0.027
1.211PheGln: 1.211 ± 0.023
1.678PheArg: 1.678 ± 0.028
3.099PheSer: 3.099 ± 0.041
2.241PheThr: 2.241 ± 0.036
2.4PheVal: 2.4 ± 0.037
0.507PheTrp: 0.507 ± 0.015
1.264PheTyr: 1.264 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
3.776GlyAla: 3.776 ± 0.052
0.742GlyCys: 0.742 ± 0.02
3.135GlyAsp: 3.135 ± 0.043
3.216GlyGlu: 3.216 ± 0.046
2.477GlyPhe: 2.477 ± 0.039
4.191GlyGly: 4.191 ± 0.068
1.385GlyHis: 1.385 ± 0.042
3.486GlyIle: 3.486 ± 0.049
3.515GlyLys: 3.515 ± 0.044
5.439GlyLeu: 5.439 ± 0.054
1.302GlyMet: 1.302 ± 0.03
2.422GlyAsn: 2.422 ± 0.035
2.056GlyPro: 2.056 ± 0.037
2.172GlyGln: 2.172 ± 0.037
2.896GlyArg: 2.896 ± 0.036
4.535GlySer: 4.535 ± 0.055
3.111GlyThr: 3.111 ± 0.04
3.871GlyVal: 3.871 ± 0.049
0.854GlyTrp: 0.854 ± 0.021
1.999GlyTyr: 1.999 ± 0.032
0.0GlyXaa: 0.0 ± 0.0
His
1.434HisAla: 1.434 ± 0.027
0.262HisCys: 0.262 ± 0.011
1.214HisAsp: 1.214 ± 0.022
1.233HisGlu: 1.233 ± 0.026
0.924HisPhe: 0.924 ± 0.02
1.254HisGly: 1.254 ± 0.036
0.735HisHis: 0.735 ± 0.024
1.389HisIle: 1.389 ± 0.026
1.318HisLys: 1.318 ± 0.025
2.38HisLeu: 2.38 ± 0.035
0.415HisMet: 0.415 ± 0.014
1.086HisAsn: 1.086 ± 0.024
1.394HisPro: 1.394 ± 0.026
1.02HisGln: 1.02 ± 0.022
1.156HisArg: 1.156 ± 0.025
2.012HisSer: 2.012 ± 0.038
1.273HisThr: 1.273 ± 0.026
1.251HisVal: 1.251 ± 0.027
0.264HisTrp: 0.264 ± 0.011
0.753HisTyr: 0.753 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
4.39IleAla: 4.39 ± 0.054
0.735IleCys: 0.735 ± 0.019
4.136IleAsp: 4.136 ± 0.046
4.089IleGlu: 4.089 ± 0.045
2.142IlePhe: 2.142 ± 0.033
3.464IleGly: 3.464 ± 0.04
1.386IleHis: 1.386 ± 0.024
3.537IleIle: 3.537 ± 0.047
3.776IleLys: 3.776 ± 0.041
5.361IleLeu: 5.361 ± 0.054
1.101IleMet: 1.101 ± 0.021
3.174IleAsn: 3.174 ± 0.036
3.341IlePro: 3.341 ± 0.045
2.357IleGln: 2.357 ± 0.035
2.922IleArg: 2.922 ± 0.038
4.927IleSer: 4.927 ± 0.05
3.254IleThr: 3.254 ± 0.043
3.554IleVal: 3.554 ± 0.044
0.711IleTrp: 0.711 ± 0.019
1.835IleTyr: 1.835 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.561LysAla: 4.561 ± 0.051
0.602LysCys: 0.602 ± 0.019
3.861LysAsp: 3.861 ± 0.044
4.731LysGlu: 4.731 ± 0.061
2.092LysPhe: 2.092 ± 0.038
3.098LysGly: 3.098 ± 0.041
1.529LysHis: 1.529 ± 0.025
3.26LysIle: 3.26 ± 0.038
4.714LysLys: 4.714 ± 0.076
6.054LysLeu: 6.054 ± 0.062
1.287LysMet: 1.287 ± 0.028
2.878LysAsn: 2.878 ± 0.036
3.213LysPro: 3.213 ± 0.053
2.903LysGln: 2.903 ± 0.045
3.933LysArg: 3.933 ± 0.051
5.393LysSer: 5.393 ± 0.056
3.334LysThr: 3.334 ± 0.04
3.545LysVal: 3.545 ± 0.039
0.688LysTrp: 0.688 ± 0.02
1.977LysTyr: 1.977 ± 0.032
0.0LysXaa: 0.0 ± 0.0
Leu
6.804LeuAla: 6.804 ± 0.062
1.012LeuCys: 1.012 ± 0.023
5.89LeuAsp: 5.89 ± 0.048
5.988LeuGlu: 5.988 ± 0.062
3.541LeuPhe: 3.541 ± 0.049
5.191LeuGly: 5.191 ± 0.059
2.172LeuHis: 2.172 ± 0.034
5.414LeuIle: 5.414 ± 0.061
5.868LeuLys: 5.868 ± 0.056
8.773LeuLeu: 8.773 ± 0.069
1.759LeuMet: 1.759 ± 0.029
4.997LeuAsn: 4.997 ± 0.055
4.896LeuPro: 4.896 ± 0.055
3.808LeuGln: 3.808 ± 0.044
5.006LeuArg: 5.006 ± 0.059
8.429LeuSer: 8.429 ± 0.079
5.133LeuThr: 5.133 ± 0.055
5.293LeuVal: 5.293 ± 0.059
0.941LeuTrp: 0.941 ± 0.021
2.611LeuTyr: 2.611 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
1.436MetAla: 1.436 ± 0.029
0.18MetCys: 0.18 ± 0.01
1.138MetAsp: 1.138 ± 0.025
1.165MetGlu: 1.165 ± 0.025
0.757MetPhe: 0.757 ± 0.021
1.108MetGly: 1.108 ± 0.025
0.417MetHis: 0.417 ± 0.014
1.139MetIle: 1.139 ± 0.026
1.29MetLys: 1.29 ± 0.025
1.847MetLeu: 1.847 ± 0.029
0.529MetMet: 0.529 ± 0.016
0.984MetAsn: 0.984 ± 0.023
1.0MetPro: 1.0 ± 0.023
0.885MetGln: 0.885 ± 0.021
1.158MetArg: 1.158 ± 0.026
2.114MetSer: 2.114 ± 0.032
1.273MetThr: 1.273 ± 0.019
1.042MetVal: 1.042 ± 0.022
0.173MetTrp: 0.173 ± 0.008
0.524MetTyr: 0.524 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.496AsnAla: 3.496 ± 0.041
0.466AsnCys: 0.466 ± 0.017
3.387AsnAsp: 3.387 ± 0.046
3.385AsnGlu: 3.385 ± 0.04
1.734AsnPhe: 1.734 ± 0.032
2.806AsnGly: 2.806 ± 0.041
1.089AsnHis: 1.089 ± 0.02
3.04AsnIle: 3.04 ± 0.034
3.386AsnLys: 3.386 ± 0.04
4.613AsnLeu: 4.613 ± 0.044
0.974AsnMet: 0.974 ± 0.022
3.036AsnAsn: 3.036 ± 0.041
2.354AsnPro: 2.354 ± 0.03
2.116AsnGln: 2.116 ± 0.037
2.187AsnArg: 2.187 ± 0.024
4.008AsnSer: 4.008 ± 0.046
2.884AsnThr: 2.884 ± 0.035
3.159AsnVal: 3.159 ± 0.048
0.646AsnTrp: 0.646 ± 0.019
1.66AsnTyr: 1.66 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
3.451ProAla: 3.451 ± 0.06
0.314ProCys: 0.314 ± 0.015
2.474ProAsp: 2.474 ± 0.035
3.163ProGlu: 3.163 ± 0.038
1.915ProPhe: 1.915 ± 0.031
2.401ProGly: 2.401 ± 0.04
1.094ProHis: 1.094 ± 0.025
2.856ProIle: 2.856 ± 0.041
2.769ProLys: 2.769 ± 0.039
4.137ProLeu: 4.137 ± 0.047
0.817ProMet: 0.817 ± 0.021
2.266ProAsn: 2.266 ± 0.037
3.793ProPro: 3.793 ± 0.087
2.285ProGln: 2.285 ± 0.04
2.234ProArg: 2.234 ± 0.039
5.138ProSer: 5.138 ± 0.06
3.429ProThr: 3.429 ± 0.042
2.712ProVal: 2.712 ± 0.044
0.462ProTrp: 0.462 ± 0.015
1.478ProTyr: 1.478 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.103GlnAla: 3.103 ± 0.043
0.346GlnCys: 0.346 ± 0.013
2.221GlnAsp: 2.221 ± 0.034
2.564GlnGlu: 2.564 ± 0.038
1.573GlnPhe: 1.573 ± 0.031
1.945GlnGly: 1.945 ± 0.037
1.058GlnHis: 1.058 ± 0.025
2.349GlnIle: 2.349 ± 0.035
2.423GlnLys: 2.423 ± 0.039
4.21GlnLeu: 4.21 ± 0.044
0.935GlnMet: 0.935 ± 0.024
1.922GlnAsn: 1.922 ± 0.035
2.424GlnPro: 2.424 ± 0.045
3.044GlnGln: 3.044 ± 0.088
2.407GlnArg: 2.407 ± 0.037
4.026GlnSer: 4.026 ± 0.053
2.376GlnThr: 2.376 ± 0.034
2.329GlnVal: 2.329 ± 0.035
0.413GlnTrp: 0.413 ± 0.014
1.125GlnTyr: 1.125 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
3.429ArgAla: 3.429 ± 0.042
0.544ArgCys: 0.544 ± 0.019
2.931ArgAsp: 2.931 ± 0.042
3.285ArgGlu: 3.285 ± 0.056
1.918ArgPhe: 1.918 ± 0.032
2.745ArgGly: 2.745 ± 0.043
1.197ArgHis: 1.197 ± 0.024
2.92ArgIle: 2.92 ± 0.041
3.735ArgLys: 3.735 ± 0.053
4.912ArgLeu: 4.912 ± 0.056
1.194ArgMet: 1.194 ± 0.024
2.463ArgAsn: 2.463 ± 0.036
2.291ArgPro: 2.291 ± 0.033
2.429ArgGln: 2.429 ± 0.035
3.761ArgArg: 3.761 ± 0.058
4.263ArgSer: 4.263 ± 0.052
2.718ArgThr: 2.718 ± 0.035
2.966ArgVal: 2.966 ± 0.041
0.616ArgTrp: 0.616 ± 0.017
1.592ArgTyr: 1.592 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
5.683SerAla: 5.683 ± 0.063
0.775SerCys: 0.775 ± 0.021
4.849SerAsp: 4.849 ± 0.051
4.582SerGlu: 4.582 ± 0.047
3.332SerPhe: 3.332 ± 0.044
4.521SerGly: 4.521 ± 0.049
1.974SerHis: 1.974 ± 0.031
5.354SerIle: 5.354 ± 0.055
5.315SerLys: 5.315 ± 0.052
7.974SerLeu: 7.974 ± 0.067
1.57SerMet: 1.57 ± 0.026
4.701SerAsn: 4.701 ± 0.046
4.277SerPro: 4.277 ± 0.065
3.828SerGln: 3.828 ± 0.05
4.278SerArg: 4.278 ± 0.048
8.807SerSer: 8.807 ± 0.113
5.84SerThr: 5.84 ± 0.059
4.621SerVal: 4.621 ± 0.047
0.9SerTrp: 0.9 ± 0.022
2.422SerTyr: 2.422 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
3.888ThrAla: 3.888 ± 0.047
0.559ThrCys: 0.559 ± 0.018
2.808ThrAsp: 2.808 ± 0.038
3.013ThrGlu: 3.013 ± 0.033
2.363ThrPhe: 2.363 ± 0.032
3.329ThrGly: 3.329 ± 0.048
1.287ThrHis: 1.287 ± 0.022
3.682ThrIle: 3.682 ± 0.043
3.323ThrLys: 3.323 ± 0.041
5.706ThrLeu: 5.706 ± 0.059
1.019ThrMet: 1.019 ± 0.021
2.803ThrAsn: 2.803 ± 0.038
3.723ThrPro: 3.723 ± 0.051
2.328ThrGln: 2.328 ± 0.033
2.774ThrArg: 2.774 ± 0.04
5.311ThrSer: 5.311 ± 0.059
3.743ThrThr: 3.743 ± 0.049
3.2ThrVal: 3.2 ± 0.041
0.569ThrTrp: 0.569 ± 0.018
1.614ThrTyr: 1.614 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
4.124ValAla: 4.124 ± 0.044
0.688ValCys: 0.688 ± 0.018
4.21ValAsp: 4.21 ± 0.045
4.002ValGlu: 4.002 ± 0.052
2.329ValPhe: 2.329 ± 0.037
3.489ValGly: 3.489 ± 0.042
1.322ValHis: 1.322 ± 0.025
3.555ValIle: 3.555 ± 0.045
3.768ValLys: 3.768 ± 0.048
5.302ValLeu: 5.302 ± 0.054
1.219ValMet: 1.219 ± 0.023
2.87ValAsn: 2.87 ± 0.038
2.625ValPro: 2.625 ± 0.038
2.333ValGln: 2.333 ± 0.038
2.76ValArg: 2.76 ± 0.036
4.311ValSer: 4.311 ± 0.052
2.98ValThr: 2.98 ± 0.043
3.839ValVal: 3.839 ± 0.054
0.674ValTrp: 0.674 ± 0.019
1.881ValTyr: 1.881 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
0.688TrpAla: 0.688 ± 0.017
0.157TrpCys: 0.157 ± 0.009
0.731TrpAsp: 0.731 ± 0.024
0.626TrpGlu: 0.626 ± 0.018
0.493TrpPhe: 0.493 ± 0.015
0.622TrpGly: 0.622 ± 0.018
0.263TrpHis: 0.263 ± 0.011
0.704TrpIle: 0.704 ± 0.018
0.775TrpLys: 0.775 ± 0.02
1.105TrpLeu: 1.105 ± 0.025
0.295TrpMet: 0.295 ± 0.012
0.612TrpAsn: 0.612 ± 0.016
0.399TrpPro: 0.399 ± 0.015
0.433TrpGln: 0.433 ± 0.016
0.641TrpArg: 0.641 ± 0.017
1.017TrpSer: 1.017 ± 0.023
0.671TrpThr: 0.671 ± 0.021
0.598TrpVal: 0.598 ± 0.017
0.188TrpTrp: 0.188 ± 0.01
0.371TrpTyr: 0.371 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.153TyrAla: 2.153 ± 0.033
0.364TyrCys: 0.364 ± 0.013
2.013TyrAsp: 2.013 ± 0.033
1.855TyrGlu: 1.855 ± 0.029
1.317TyrPhe: 1.317 ± 0.024
1.854TyrGly: 1.854 ± 0.035
0.725TyrHis: 0.725 ± 0.017
1.862TyrIle: 1.862 ± 0.029
1.861TyrLys: 1.861 ± 0.032
2.97TyrLeu: 2.97 ± 0.032
0.607TyrMet: 0.607 ± 0.017
1.752TyrAsn: 1.752 ± 0.031
1.316TyrPro: 1.316 ± 0.025
1.251TyrGln: 1.251 ± 0.024
1.512TyrArg: 1.512 ± 0.029
2.294TyrSer: 2.294 ± 0.032
1.816TyrThr: 1.816 ± 0.029
1.712TyrVal: 1.712 ± 0.027
0.364TyrTrp: 0.364 ± 0.014
1.05TyrTyr: 1.05 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5262 proteins (2223108 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski