Amino acid dipepetide frequency for Bacillus megaterium (strain ATCC 12872 / QMB1551)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.276AlaAla: 6.276 ± 0.069
0.669AlaCys: 0.669 ± 0.022
3.329AlaAsp: 3.329 ± 0.058
4.459AlaGlu: 4.459 ± 0.068
3.525AlaPhe: 3.525 ± 0.054
5.059AlaGly: 5.059 ± 0.066
1.45AlaHis: 1.45 ± 0.032
5.625AlaIle: 5.625 ± 0.064
4.912AlaLys: 4.912 ± 0.059
7.529AlaLeu: 7.529 ± 0.073
1.95AlaMet: 1.95 ± 0.038
2.762AlaAsn: 2.762 ± 0.045
2.15AlaPro: 2.15 ± 0.041
2.475AlaGln: 2.475 ± 0.054
2.475AlaArg: 2.475 ± 0.041
4.559AlaSer: 4.559 ± 0.054
3.616AlaThr: 3.616 ± 0.072
5.791AlaVal: 5.791 ± 0.069
0.608AlaTrp: 0.608 ± 0.021
2.463AlaTyr: 2.463 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.466CysAla: 0.466 ± 0.017
0.122CysCys: 0.122 ± 0.009
0.356CysAsp: 0.356 ± 0.017
0.473CysGlu: 0.473 ± 0.017
0.38CysPhe: 0.38 ± 0.014
0.679CysGly: 0.679 ± 0.023
0.199CysHis: 0.199 ± 0.01
0.576CysIle: 0.576 ± 0.018
0.411CysLys: 0.411 ± 0.018
0.782CysLeu: 0.782 ± 0.02
0.217CysMet: 0.217 ± 0.012
0.273CysAsn: 0.273 ± 0.015
0.326CysPro: 0.326 ± 0.016
0.237CysGln: 0.237 ± 0.014
0.277CysArg: 0.277 ± 0.014
0.61CysSer: 0.61 ± 0.019
0.425CysThr: 0.425 ± 0.016
0.469CysVal: 0.469 ± 0.019
0.084CysTrp: 0.084 ± 0.007
0.302CysTyr: 0.302 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.189AspAla: 3.189 ± 0.047
0.356AspCys: 0.356 ± 0.015
2.163AspAsp: 2.163 ± 0.043
4.021AspGlu: 4.021 ± 0.059
2.214AspPhe: 2.214 ± 0.038
2.881AspGly: 2.881 ± 0.048
1.124AspHis: 1.124 ± 0.033
3.955AspIle: 3.955 ± 0.056
3.033AspLys: 3.033 ± 0.054
4.665AspLeu: 4.665 ± 0.06
1.256AspMet: 1.256 ± 0.031
1.608AspAsn: 1.608 ± 0.035
1.716AspPro: 1.716 ± 0.04
1.913AspGln: 1.913 ± 0.038
1.923AspArg: 1.923 ± 0.034
2.601AspSer: 2.601 ± 0.046
2.323AspThr: 2.323 ± 0.039
3.92AspVal: 3.92 ± 0.048
0.544AspTrp: 0.544 ± 0.02
2.002AspTyr: 2.002 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
5.401GluAla: 5.401 ± 0.077
0.431GluCys: 0.431 ± 0.02
3.412GluAsp: 3.412 ± 0.054
6.719GluGlu: 6.719 ± 0.097
2.422GluPhe: 2.422 ± 0.041
4.013GluGly: 4.013 ± 0.057
1.615GluHis: 1.615 ± 0.031
4.831GluIle: 4.831 ± 0.068
6.64GluLys: 6.64 ± 0.076
6.974GluLeu: 6.974 ± 0.083
2.077GluMet: 2.077 ± 0.038
3.307GluAsn: 3.307 ± 0.048
1.78GluPro: 1.78 ± 0.034
3.498GluGln: 3.498 ± 0.065
3.37GluArg: 3.37 ± 0.048
3.328GluSer: 3.328 ± 0.051
3.708GluThr: 3.708 ± 0.062
4.985GluVal: 4.985 ± 0.065
0.753GluTrp: 0.753 ± 0.024
2.169GluTyr: 2.169 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.023PheAla: 3.023 ± 0.053
0.366PheCys: 0.366 ± 0.015
2.194PheAsp: 2.194 ± 0.041
2.663PheGlu: 2.663 ± 0.04
2.514PhePhe: 2.514 ± 0.052
3.201PheGly: 3.201 ± 0.044
1.025PheHis: 1.025 ± 0.03
3.985PheIle: 3.985 ± 0.065
2.49PheLys: 2.49 ± 0.045
4.779PheLeu: 4.779 ± 0.064
1.211PheMet: 1.211 ± 0.028
1.769PheAsn: 1.769 ± 0.035
1.661PhePro: 1.661 ± 0.035
1.719PheGln: 1.719 ± 0.035
1.405PheArg: 1.405 ± 0.034
3.494PheSer: 3.494 ± 0.054
2.647PheThr: 2.647 ± 0.04
3.301PheVal: 3.301 ± 0.053
0.494PheTrp: 0.494 ± 0.019
1.786PheTyr: 1.786 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
4.941GlyAla: 4.941 ± 0.08
0.642GlyCys: 0.642 ± 0.02
2.967GlyAsp: 2.967 ± 0.043
4.146GlyGlu: 4.146 ± 0.058
3.387GlyPhe: 3.387 ± 0.054
4.796GlyGly: 4.796 ± 0.083
1.327GlyHis: 1.327 ± 0.031
5.881GlyIle: 5.881 ± 0.083
4.832GlyLys: 4.832 ± 0.059
6.392GlyLeu: 6.392 ± 0.078
2.07GlyMet: 2.07 ± 0.043
2.569GlyAsn: 2.569 ± 0.046
1.671GlyPro: 1.671 ± 0.036
2.196GlyGln: 2.196 ± 0.034
2.318GlyArg: 2.318 ± 0.043
3.976GlySer: 3.976 ± 0.056
3.902GlyThr: 3.902 ± 0.055
5.276GlyVal: 5.276 ± 0.077
0.796GlyTrp: 0.796 ± 0.025
2.766GlyTyr: 2.766 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
1.403HisAla: 1.403 ± 0.03
0.195HisCys: 0.195 ± 0.013
1.054HisAsp: 1.054 ± 0.03
1.541HisGlu: 1.541 ± 0.035
1.082HisPhe: 1.082 ± 0.032
1.367HisGly: 1.367 ± 0.03
0.783HisHis: 0.783 ± 0.028
1.783HisIle: 1.783 ± 0.036
1.226HisLys: 1.226 ± 0.031
2.308HisLeu: 2.308 ± 0.039
0.588HisMet: 0.588 ± 0.021
0.802HisAsn: 0.802 ± 0.024
1.193HisPro: 1.193 ± 0.031
0.951HisGln: 0.951 ± 0.027
0.904HisArg: 0.904 ± 0.025
1.404HisSer: 1.404 ± 0.032
1.254HisThr: 1.254 ± 0.032
1.651HisVal: 1.651 ± 0.036
0.234HisTrp: 0.234 ± 0.012
0.973HisTyr: 0.973 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.814IleAla: 5.814 ± 0.07
0.693IleCys: 0.693 ± 0.022
3.977IleAsp: 3.977 ± 0.047
5.34IleGlu: 5.34 ± 0.07
3.181IlePhe: 3.181 ± 0.063
5.954IleGly: 5.954 ± 0.077
1.738IleHis: 1.738 ± 0.034
5.552IleIle: 5.552 ± 0.087
4.628IleLys: 4.628 ± 0.058
6.862IleLeu: 6.862 ± 0.083
1.783IleMet: 1.783 ± 0.033
3.075IleAsn: 3.075 ± 0.051
3.155IlePro: 3.155 ± 0.042
3.002IleGln: 3.002 ± 0.042
2.714IleArg: 2.714 ± 0.044
5.155IleSer: 5.155 ± 0.062
4.174IleThr: 4.174 ± 0.064
5.652IleVal: 5.652 ± 0.064
0.642IleTrp: 0.642 ± 0.023
2.416IleTyr: 2.416 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.987LysAla: 4.987 ± 0.063
0.322LysCys: 0.322 ± 0.015
3.834LysAsp: 3.834 ± 0.06
7.277LysGlu: 7.277 ± 0.082
1.893LysPhe: 1.893 ± 0.038
4.777LysGly: 4.777 ± 0.059
1.519LysHis: 1.519 ± 0.034
4.356LysIle: 4.356 ± 0.059
6.605LysLys: 6.605 ± 0.084
5.72LysLeu: 5.72 ± 0.06
2.241LysMet: 2.241 ± 0.041
3.299LysAsn: 3.299 ± 0.054
2.244LysPro: 2.244 ± 0.043
3.805LysGln: 3.805 ± 0.059
3.468LysArg: 3.468 ± 0.053
3.704LysSer: 3.704 ± 0.049
3.76LysThr: 3.76 ± 0.056
4.8LysVal: 4.8 ± 0.065
0.9LysTrp: 0.9 ± 0.028
2.088LysTyr: 2.088 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
7.335LeuAla: 7.335 ± 0.079
0.836LeuCys: 0.836 ± 0.024
4.388LeuAsp: 4.388 ± 0.057
5.998LeuGlu: 5.998 ± 0.081
5.195LeuPhe: 5.195 ± 0.068
6.391LeuGly: 6.391 ± 0.078
2.276LeuHis: 2.276 ± 0.041
7.169LeuIle: 7.169 ± 0.089
6.776LeuLys: 6.776 ± 0.07
10.581LeuLeu: 10.581 ± 0.116
2.501LeuMet: 2.501 ± 0.044
4.232LeuAsn: 4.232 ± 0.05
3.887LeuPro: 3.887 ± 0.055
3.888LeuGln: 3.888 ± 0.059
3.36LeuArg: 3.36 ± 0.055
7.23LeuSer: 7.23 ± 0.078
5.999LeuThr: 5.999 ± 0.068
6.515LeuVal: 6.515 ± 0.082
0.845LeuTrp: 0.845 ± 0.027
3.273LeuTyr: 3.273 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
1.867MetAla: 1.867 ± 0.038
0.149MetCys: 0.149 ± 0.01
1.238MetAsp: 1.238 ± 0.027
1.691MetGlu: 1.691 ± 0.037
1.102MetPhe: 1.102 ± 0.029
1.669MetGly: 1.669 ± 0.033
0.493MetHis: 0.493 ± 0.022
2.151MetIle: 2.151 ± 0.042
2.674MetLys: 2.674 ± 0.041
2.615MetLeu: 2.615 ± 0.038
0.921MetMet: 0.921 ± 0.027
1.65MetAsn: 1.65 ± 0.036
0.989MetPro: 0.989 ± 0.031
0.935MetGln: 0.935 ± 0.029
1.007MetArg: 1.007 ± 0.025
1.872MetSer: 1.872 ± 0.039
1.603MetThr: 1.603 ± 0.032
1.712MetVal: 1.712 ± 0.037
0.231MetTrp: 0.231 ± 0.012
0.816MetTyr: 0.816 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.684AsnAla: 2.684 ± 0.048
0.298AsnCys: 0.298 ± 0.014
2.06AsnAsp: 2.06 ± 0.045
3.44AsnGlu: 3.44 ± 0.054
1.462AsnPhe: 1.462 ± 0.033
3.015AsnGly: 3.015 ± 0.051
1.075AsnHis: 1.075 ± 0.027
3.171AsnIle: 3.171 ± 0.05
2.98AsnLys: 2.98 ± 0.05
3.689AsnLeu: 3.689 ± 0.047
1.155AsnMet: 1.155 ± 0.029
1.832AsnAsn: 1.832 ± 0.041
1.947AsnPro: 1.947 ± 0.036
2.006AsnGln: 2.006 ± 0.039
1.916AsnArg: 1.916 ± 0.032
2.338AsnSer: 2.338 ± 0.047
2.087AsnThr: 2.087 ± 0.041
3.196AsnVal: 3.196 ± 0.047
0.508AsnTrp: 0.508 ± 0.019
1.48AsnTyr: 1.48 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
2.434ProAla: 2.434 ± 0.041
0.203ProCys: 0.203 ± 0.011
1.749ProAsp: 1.749 ± 0.036
2.513ProGlu: 2.513 ± 0.046
2.075ProPhe: 2.075 ± 0.039
2.091ProGly: 2.091 ± 0.042
0.88ProHis: 0.88 ± 0.027
2.591ProIle: 2.591 ± 0.039
2.202ProLys: 2.202 ± 0.04
3.668ProLeu: 3.668 ± 0.056
0.776ProMet: 0.776 ± 0.03
1.58ProAsn: 1.58 ± 0.035
0.993ProPro: 0.993 ± 0.031
1.23ProGln: 1.23 ± 0.032
1.077ProArg: 1.077 ± 0.031
2.483ProSer: 2.483 ± 0.045
1.965ProThr: 1.965 ± 0.036
2.821ProVal: 2.821 ± 0.046
0.382ProTrp: 0.382 ± 0.017
1.494ProTyr: 1.494 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.067GlnAla: 3.067 ± 0.057
0.229GlnCys: 0.229 ± 0.014
1.707GlnAsp: 1.707 ± 0.031
2.967GlnGlu: 2.967 ± 0.05
1.658GlnPhe: 1.658 ± 0.037
2.373GlnGly: 2.373 ± 0.045
1.065GlnHis: 1.065 ± 0.028
2.444GlnIle: 2.444 ± 0.043
3.217GlnLys: 3.217 ± 0.049
4.262GlnLeu: 4.262 ± 0.053
1.117GlnMet: 1.117 ± 0.028
1.645GlnAsn: 1.645 ± 0.035
1.332GlnPro: 1.332 ± 0.03
2.289GlnGln: 2.289 ± 0.058
1.621GlnArg: 1.621 ± 0.032
2.349GlnSer: 2.349 ± 0.037
2.309GlnThr: 2.309 ± 0.045
2.524GlnVal: 2.524 ± 0.041
0.409GlnTrp: 0.409 ± 0.016
1.469GlnTyr: 1.469 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.304ArgAla: 2.304 ± 0.042
0.249ArgCys: 0.249 ± 0.011
1.901ArgAsp: 1.901 ± 0.038
2.928ArgGlu: 2.928 ± 0.049
1.806ArgPhe: 1.806 ± 0.037
2.099ArgGly: 2.099 ± 0.041
0.829ArgHis: 0.829 ± 0.025
2.865ArgIle: 2.865 ± 0.045
3.079ArgLys: 3.079 ± 0.043
3.733ArgLeu: 3.733 ± 0.053
1.19ArgMet: 1.19 ± 0.025
1.711ArgAsn: 1.711 ± 0.033
1.226ArgPro: 1.226 ± 0.031
1.492ArgGln: 1.492 ± 0.035
1.71ArgArg: 1.71 ± 0.049
2.273ArgSer: 2.273 ± 0.047
2.078ArgThr: 2.078 ± 0.044
2.533ArgVal: 2.533 ± 0.044
0.364ArgTrp: 0.364 ± 0.015
1.599ArgTyr: 1.599 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
4.185SerAla: 4.185 ± 0.048
0.483SerCys: 0.483 ± 0.017
2.694SerAsp: 2.694 ± 0.049
3.779SerGlu: 3.779 ± 0.047
3.522SerPhe: 3.522 ± 0.047
4.289SerGly: 4.289 ± 0.055
1.41SerHis: 1.41 ± 0.033
5.106SerIle: 5.106 ± 0.067
4.147SerLys: 4.147 ± 0.056
6.865SerLeu: 6.865 ± 0.071
1.851SerMet: 1.851 ± 0.036
2.539SerAsn: 2.539 ± 0.041
2.288SerPro: 2.288 ± 0.041
2.324SerGln: 2.324 ± 0.044
2.149SerArg: 2.149 ± 0.04
4.729SerSer: 4.729 ± 0.072
3.534SerThr: 3.534 ± 0.053
4.471SerVal: 4.471 ± 0.054
0.652SerTrp: 0.652 ± 0.021
2.525SerTyr: 2.525 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.1ThrAla: 4.1 ± 0.054
0.414ThrCys: 0.414 ± 0.019
2.523ThrAsp: 2.523 ± 0.044
3.338ThrGlu: 3.338 ± 0.055
2.79ThrPhe: 2.79 ± 0.042
4.196ThrGly: 4.196 ± 0.163
1.182ThrHis: 1.182 ± 0.028
4.377ThrIle: 4.377 ± 0.058
3.604ThrLys: 3.604 ± 0.052
5.573ThrLeu: 5.573 ± 0.065
1.279ThrMet: 1.279 ± 0.027
2.486ThrAsn: 2.486 ± 0.038
2.323ThrPro: 2.323 ± 0.045
1.57ThrGln: 1.57 ± 0.032
1.786ThrArg: 1.786 ± 0.04
3.604ThrSer: 3.604 ± 0.051
2.916ThrThr: 2.916 ± 0.051
4.409ThrVal: 4.409 ± 0.063
0.559ThrTrp: 0.559 ± 0.02
2.188ThrTyr: 2.188 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
5.279ValAla: 5.279 ± 0.066
0.624ValCys: 0.624 ± 0.019
3.514ValAsp: 3.514 ± 0.053
4.856ValGlu: 4.856 ± 0.069
3.173ValPhe: 3.173 ± 0.052
4.805ValGly: 4.805 ± 0.066
1.564ValHis: 1.564 ± 0.035
5.683ValIle: 5.683 ± 0.067
5.09ValLys: 5.09 ± 0.071
7.134ValLeu: 7.134 ± 0.09
1.951ValMet: 1.951 ± 0.04
3.169ValAsn: 3.169 ± 0.045
2.728ValPro: 2.728 ± 0.04
2.657ValGln: 2.657 ± 0.041
2.54ValArg: 2.54 ± 0.04
4.988ValSer: 4.988 ± 0.055
4.36ValThr: 4.36 ± 0.087
5.417ValVal: 5.417 ± 0.075
0.65ValTrp: 0.65 ± 0.02
2.504ValTyr: 2.504 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.627TrpAla: 0.627 ± 0.019
0.078TrpCys: 0.078 ± 0.008
0.457TrpAsp: 0.457 ± 0.018
0.614TrpGlu: 0.614 ± 0.02
0.572TrpPhe: 0.572 ± 0.021
0.672TrpGly: 0.672 ± 0.023
0.232TrpHis: 0.232 ± 0.014
0.836TrpIle: 0.836 ± 0.026
0.809TrpLys: 0.809 ± 0.027
1.143TrpLeu: 1.143 ± 0.029
0.34TrpMet: 0.34 ± 0.015
0.588TrpAsn: 0.588 ± 0.022
0.249TrpPro: 0.249 ± 0.014
0.312TrpGln: 0.312 ± 0.013
0.374TrpArg: 0.374 ± 0.018
0.673TrpSer: 0.673 ± 0.019
0.52TrpThr: 0.52 ± 0.018
0.661TrpVal: 0.661 ± 0.024
0.153TrpTrp: 0.153 ± 0.01
0.32TrpTyr: 0.32 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.3TyrAla: 2.3 ± 0.039
0.307TyrCys: 0.307 ± 0.014
1.889TyrAsp: 1.889 ± 0.037
2.669TyrGlu: 2.669 ± 0.05
1.774TyrPhe: 1.774 ± 0.038
2.527TyrGly: 2.527 ± 0.045
0.909TyrHis: 0.909 ± 0.025
2.571TyrIle: 2.571 ± 0.047
2.244TyrLys: 2.244 ± 0.037
3.4TyrLeu: 3.4 ± 0.057
0.912TyrMet: 0.912 ± 0.022
1.445TyrAsn: 1.445 ± 0.033
1.348TyrPro: 1.348 ± 0.03
1.554TyrGln: 1.554 ± 0.033
1.575TyrArg: 1.575 ± 0.031
2.173TyrSer: 2.173 ± 0.037
2.036TyrThr: 2.036 ± 0.035
2.581TyrVal: 2.581 ± 0.045
0.422TyrTrp: 0.422 ± 0.015
1.498TyrTyr: 1.498 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5606 proteins (1495755 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski