Amino acid dipepetide frequency for Mesobacillus campisalis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.717AlaAla: 7.717 ± 0.107
0.629AlaCys: 0.629 ± 0.023
3.661AlaAsp: 3.661 ± 0.049
5.739AlaGlu: 5.739 ± 0.061
3.526AlaPhe: 3.526 ± 0.061
6.83AlaGly: 6.83 ± 0.085
1.392AlaHis: 1.392 ± 0.029
5.805AlaIle: 5.805 ± 0.078
4.689AlaLys: 4.689 ± 0.061
7.707AlaLeu: 7.707 ± 0.083
2.255AlaMet: 2.255 ± 0.037
2.74AlaAsn: 2.74 ± 0.042
2.359AlaPro: 2.359 ± 0.048
2.276AlaGln: 2.276 ± 0.04
3.151AlaArg: 3.151 ± 0.05
4.586AlaSer: 4.586 ± 0.058
3.244AlaThr: 3.244 ± 0.049
6.177AlaVal: 6.177 ± 0.079
0.683AlaTrp: 0.683 ± 0.024
2.4AlaTyr: 2.4 ± 0.036
0.0AlaXaa: 0.0 ± 0.0
Cys
0.454CysAla: 0.454 ± 0.016
0.096CysCys: 0.096 ± 0.008
0.383CysAsp: 0.383 ± 0.017
0.473CysGlu: 0.473 ± 0.018
0.321CysPhe: 0.321 ± 0.013
0.709CysGly: 0.709 ± 0.024
0.236CysHis: 0.236 ± 0.017
0.489CysIle: 0.489 ± 0.02
0.323CysLys: 0.323 ± 0.015
0.681CysLeu: 0.681 ± 0.021
0.164CysMet: 0.164 ± 0.011
0.242CysAsn: 0.242 ± 0.014
0.368CysPro: 0.368 ± 0.02
0.216CysGln: 0.216 ± 0.014
0.331CysArg: 0.331 ± 0.015
0.514CysSer: 0.514 ± 0.021
0.39CysThr: 0.39 ± 0.016
0.407CysVal: 0.407 ± 0.019
0.08CysTrp: 0.08 ± 0.009
0.249CysTyr: 0.249 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.27AspAla: 3.27 ± 0.056
0.392AspCys: 0.392 ± 0.017
2.138AspAsp: 2.138 ± 0.046
4.108AspGlu: 4.108 ± 0.062
2.462AspPhe: 2.462 ± 0.047
3.56AspGly: 3.56 ± 0.056
1.142AspHis: 1.142 ± 0.027
3.791AspIle: 3.791 ± 0.056
2.857AspLys: 2.857 ± 0.051
4.954AspLeu: 4.954 ± 0.07
1.281AspMet: 1.281 ± 0.031
1.64AspAsn: 1.64 ± 0.04
2.091AspPro: 2.091 ± 0.036
1.794AspGln: 1.794 ± 0.033
2.368AspArg: 2.368 ± 0.041
2.707AspSer: 2.707 ± 0.046
2.223AspThr: 2.223 ± 0.043
3.508AspVal: 3.508 ± 0.055
0.638AspTrp: 0.638 ± 0.019
2.053AspTyr: 2.053 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
6.004GluAla: 6.004 ± 0.078
0.364GluCys: 0.364 ± 0.016
3.816GluAsp: 3.816 ± 0.054
7.27GluGlu: 7.27 ± 0.088
2.738GluPhe: 2.738 ± 0.048
4.897GluGly: 4.897 ± 0.063
1.566GluHis: 1.566 ± 0.035
5.519GluIle: 5.519 ± 0.07
6.43GluLys: 6.43 ± 0.073
7.278GluLeu: 7.278 ± 0.079
2.381GluMet: 2.381 ± 0.045
3.616GluAsn: 3.616 ± 0.055
2.14GluPro: 2.14 ± 0.036
3.084GluGln: 3.084 ± 0.054
3.547GluArg: 3.547 ± 0.051
3.472GluSer: 3.472 ± 0.049
3.802GluThr: 3.802 ± 0.054
5.074GluVal: 5.074 ± 0.067
0.823GluTrp: 0.823 ± 0.026
2.242GluTyr: 2.242 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.48PheAla: 3.48 ± 0.057
0.355PheCys: 0.355 ± 0.017
2.198PheAsp: 2.198 ± 0.041
2.851PheGlu: 2.851 ± 0.041
2.355PhePhe: 2.355 ± 0.05
3.596PheGly: 3.596 ± 0.051
1.025PheHis: 1.025 ± 0.032
3.685PheIle: 3.685 ± 0.063
2.374PheLys: 2.374 ± 0.04
4.722PheLeu: 4.722 ± 0.078
1.211PheMet: 1.211 ± 0.026
1.85PheAsn: 1.85 ± 0.04
1.745PhePro: 1.745 ± 0.036
1.497PheGln: 1.497 ± 0.03
1.689PheArg: 1.689 ± 0.035
3.208PheSer: 3.208 ± 0.056
2.503PheThr: 2.503 ± 0.044
3.002PheVal: 3.002 ± 0.055
0.502PheTrp: 0.502 ± 0.021
1.733PheTyr: 1.733 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
5.504GlyAla: 5.504 ± 0.064
0.68GlyCys: 0.68 ± 0.027
3.352GlyAsp: 3.352 ± 0.054
5.007GlyGlu: 5.007 ± 0.061
3.659GlyPhe: 3.659 ± 0.059
5.502GlyGly: 5.502 ± 0.08
1.509GlyHis: 1.509 ± 0.035
6.243GlyIle: 6.243 ± 0.085
5.454GlyLys: 5.454 ± 0.063
7.326GlyLeu: 7.326 ± 0.084
2.422GlyMet: 2.422 ± 0.04
2.995GlyAsn: 2.995 ± 0.048
2.053GlyPro: 2.053 ± 0.04
2.472GlyGln: 2.472 ± 0.046
3.197GlyArg: 3.197 ± 0.047
4.564GlySer: 4.564 ± 0.058
4.255GlyThr: 4.255 ± 0.061
5.35GlyVal: 5.35 ± 0.074
0.846GlyTrp: 0.846 ± 0.026
2.814GlyTyr: 2.814 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.385HisAla: 1.385 ± 0.035
0.229HisCys: 0.229 ± 0.011
1.057HisAsp: 1.057 ± 0.027
1.401HisGlu: 1.401 ± 0.035
1.112HisPhe: 1.112 ± 0.03
1.595HisGly: 1.595 ± 0.034
0.658HisHis: 0.658 ± 0.024
1.433HisIle: 1.433 ± 0.034
0.986HisLys: 0.986 ± 0.024
2.133HisLeu: 2.133 ± 0.042
0.557HisMet: 0.557 ± 0.02
0.764HisAsn: 0.764 ± 0.023
1.227HisPro: 1.227 ± 0.031
0.844HisGln: 0.844 ± 0.024
0.909HisArg: 0.909 ± 0.026
1.283HisSer: 1.283 ± 0.036
0.994HisThr: 0.994 ± 0.028
1.341HisVal: 1.341 ± 0.028
0.227HisTrp: 0.227 ± 0.014
0.884HisTyr: 0.884 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.043IleAla: 6.043 ± 0.072
0.623IleCys: 0.623 ± 0.019
3.821IleAsp: 3.821 ± 0.056
5.292IleGlu: 5.292 ± 0.065
3.069IlePhe: 3.069 ± 0.062
5.977IleGly: 5.977 ± 0.079
1.61IleHis: 1.61 ± 0.037
5.262IleIle: 5.262 ± 0.071
4.134IleLys: 4.134 ± 0.061
7.106IleLeu: 7.106 ± 0.083
1.758IleMet: 1.758 ± 0.042
2.849IleAsn: 2.849 ± 0.046
3.378IlePro: 3.378 ± 0.053
2.733IleGln: 2.733 ± 0.048
3.062IleArg: 3.062 ± 0.052
4.782IleSer: 4.782 ± 0.061
3.848IleThr: 3.848 ± 0.055
5.149IleVal: 5.149 ± 0.066
0.653IleTrp: 0.653 ± 0.023
2.204IleTyr: 2.204 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
5.023LysAla: 5.023 ± 0.063
0.293LysCys: 0.293 ± 0.016
3.459LysAsp: 3.459 ± 0.051
6.288LysGlu: 6.288 ± 0.076
1.87LysPhe: 1.87 ± 0.038
4.595LysGly: 4.595 ± 0.056
1.245LysHis: 1.245 ± 0.031
4.279LysIle: 4.279 ± 0.06
5.236LysLys: 5.236 ± 0.061
5.72LysLeu: 5.72 ± 0.059
2.167LysMet: 2.167 ± 0.039
2.959LysAsn: 2.959 ± 0.048
2.241LysPro: 2.241 ± 0.037
2.725LysGln: 2.725 ± 0.048
3.078LysArg: 3.078 ± 0.046
3.231LysSer: 3.231 ± 0.049
3.347LysThr: 3.347 ± 0.049
4.589LysVal: 4.589 ± 0.067
0.782LysTrp: 0.782 ± 0.023
2.049LysTyr: 2.049 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
8.469LeuAla: 8.469 ± 0.081
0.647LeuCys: 0.647 ± 0.023
4.79LeuAsp: 4.79 ± 0.059
7.064LeuGlu: 7.064 ± 0.081
5.005LeuPhe: 5.005 ± 0.069
7.167LeuGly: 7.167 ± 0.089
1.943LeuHis: 1.943 ± 0.037
6.823LeuIle: 6.823 ± 0.086
6.526LeuLys: 6.526 ± 0.069
10.316LeuLeu: 10.316 ± 0.125
2.616LeuMet: 2.616 ± 0.05
4.076LeuAsn: 4.076 ± 0.055
4.201LeuPro: 4.201 ± 0.055
3.463LeuGln: 3.463 ± 0.055
3.886LeuArg: 3.886 ± 0.06
6.601LeuSer: 6.601 ± 0.078
5.389LeuThr: 5.389 ± 0.066
6.705LeuVal: 6.705 ± 0.073
0.87LeuTrp: 0.87 ± 0.028
3.066LeuTyr: 3.066 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
2.347MetAla: 2.347 ± 0.042
0.138MetCys: 0.138 ± 0.011
1.553MetAsp: 1.553 ± 0.038
2.288MetGlu: 2.288 ± 0.043
1.123MetPhe: 1.123 ± 0.028
2.111MetGly: 2.111 ± 0.045
0.463MetHis: 0.463 ± 0.017
2.037MetIle: 2.037 ± 0.041
2.303MetLys: 2.303 ± 0.041
2.676MetLeu: 2.676 ± 0.048
0.899MetMet: 0.899 ± 0.027
1.422MetAsn: 1.422 ± 0.034
1.078MetPro: 1.078 ± 0.029
0.847MetGln: 0.847 ± 0.027
1.045MetArg: 1.045 ± 0.027
1.612MetSer: 1.612 ± 0.035
1.595MetThr: 1.595 ± 0.033
1.942MetVal: 1.942 ± 0.042
0.187MetTrp: 0.187 ± 0.01
0.708MetTyr: 0.708 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.758AsnAla: 2.758 ± 0.044
0.299AsnCys: 0.299 ± 0.015
1.89AsnAsp: 1.89 ± 0.039
3.04AsnGlu: 3.04 ± 0.048
1.552AsnPhe: 1.552 ± 0.035
3.34AsnGly: 3.34 ± 0.057
0.914AsnHis: 0.914 ± 0.03
3.012AsnIle: 3.012 ± 0.055
2.504AsnLys: 2.504 ± 0.047
3.818AsnLeu: 3.818 ± 0.051
1.156AsnMet: 1.156 ± 0.029
1.633AsnAsn: 1.633 ± 0.047
2.229AsnPro: 2.229 ± 0.044
1.634AsnGln: 1.634 ± 0.034
1.959AsnArg: 1.959 ± 0.047
2.271AsnSer: 2.271 ± 0.041
1.905AsnThr: 1.905 ± 0.037
2.761AsnVal: 2.761 ± 0.049
0.519AsnTrp: 0.519 ± 0.017
1.386AsnTyr: 1.386 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
3.056ProAla: 3.056 ± 0.049
0.219ProCys: 0.219 ± 0.013
2.229ProAsp: 2.229 ± 0.043
3.49ProGlu: 3.49 ± 0.056
2.041ProPhe: 2.041 ± 0.046
3.001ProGly: 3.001 ± 0.049
0.805ProHis: 0.805 ± 0.023
2.572ProIle: 2.572 ± 0.045
2.136ProLys: 2.136 ± 0.04
3.773ProLeu: 3.773 ± 0.058
0.896ProMet: 0.896 ± 0.024
1.503ProAsn: 1.503 ± 0.031
1.147ProPro: 1.147 ± 0.029
1.242ProGln: 1.242 ± 0.032
1.28ProArg: 1.28 ± 0.03
2.272ProSer: 2.272 ± 0.042
1.684ProThr: 1.684 ± 0.036
3.267ProVal: 3.267 ± 0.058
0.397ProTrp: 0.397 ± 0.016
1.41ProTyr: 1.41 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
2.848GlnAla: 2.848 ± 0.048
0.207GlnCys: 0.207 ± 0.012
1.647GlnAsp: 1.647 ± 0.037
2.913GlnGlu: 2.913 ± 0.048
1.58GlnPhe: 1.58 ± 0.037
2.335GlnGly: 2.335 ± 0.045
0.802GlnHis: 0.802 ± 0.024
2.318GlnIle: 2.318 ± 0.042
2.508GlnLys: 2.508 ± 0.044
3.893GlnLeu: 3.893 ± 0.066
1.097GlnMet: 1.097 ± 0.024
1.486GlnAsn: 1.486 ± 0.033
1.247GlnPro: 1.247 ± 0.03
1.698GlnGln: 1.698 ± 0.045
1.519GlnArg: 1.519 ± 0.032
2.004GlnSer: 2.004 ± 0.039
1.743GlnThr: 1.743 ± 0.036
2.303GlnVal: 2.303 ± 0.042
0.396GlnTrp: 0.396 ± 0.018
1.275GlnTyr: 1.275 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.628ArgAla: 2.628 ± 0.043
0.252ArgCys: 0.252 ± 0.012
2.156ArgAsp: 2.156 ± 0.041
3.738ArgGlu: 3.738 ± 0.057
1.926ArgPhe: 1.926 ± 0.038
2.635ArgGly: 2.635 ± 0.043
0.908ArgHis: 0.908 ± 0.026
3.072ArgIle: 3.072 ± 0.048
3.372ArgLys: 3.372 ± 0.05
4.182ArgLeu: 4.182 ± 0.055
1.38ArgMet: 1.38 ± 0.029
1.94ArgAsn: 1.94 ± 0.043
1.536ArgPro: 1.536 ± 0.038
1.82ArgGln: 1.82 ± 0.038
2.092ArgArg: 2.092 ± 0.041
2.324ArgSer: 2.324 ± 0.044
2.102ArgThr: 2.102 ± 0.042
2.761ArgVal: 2.761 ± 0.053
0.448ArgTrp: 0.448 ± 0.018
1.521ArgTyr: 1.521 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
4.245SerAla: 4.245 ± 0.061
0.434SerCys: 0.434 ± 0.019
2.689SerAsp: 2.689 ± 0.045
3.854SerGlu: 3.854 ± 0.052
3.292SerPhe: 3.292 ± 0.05
4.894SerGly: 4.894 ± 0.064
1.267SerHis: 1.267 ± 0.032
4.545SerIle: 4.545 ± 0.058
3.357SerLys: 3.357 ± 0.051
6.351SerLeu: 6.351 ± 0.068
1.739SerMet: 1.739 ± 0.032
2.116SerAsn: 2.116 ± 0.048
2.366SerPro: 2.366 ± 0.043
2.045SerGln: 2.045 ± 0.037
2.769SerArg: 2.769 ± 0.045
3.877SerSer: 3.877 ± 0.053
2.822SerThr: 2.822 ± 0.051
4.257SerVal: 4.257 ± 0.054
0.673SerTrp: 0.673 ± 0.018
2.053SerTyr: 2.053 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
4.221ThrAla: 4.221 ± 0.066
0.333ThrCys: 0.333 ± 0.016
2.412ThrAsp: 2.412 ± 0.038
3.263ThrGlu: 3.263 ± 0.05
2.384ThrPhe: 2.384 ± 0.047
4.503ThrGly: 4.503 ± 0.063
1.007ThrHis: 1.007 ± 0.029
3.88ThrIle: 3.88 ± 0.055
2.781ThrLys: 2.781 ± 0.048
5.051ThrLeu: 5.051 ± 0.059
1.272ThrMet: 1.272 ± 0.033
1.957ThrAsn: 1.957 ± 0.045
2.277ThrPro: 2.277 ± 0.038
1.34ThrGln: 1.34 ± 0.028
1.969ThrArg: 1.969 ± 0.038
3.022ThrSer: 3.022 ± 0.052
2.421ThrThr: 2.421 ± 0.05
4.039ThrVal: 4.039 ± 0.053
0.512ThrTrp: 0.512 ± 0.019
1.696ThrTyr: 1.696 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
5.255ValAla: 5.255 ± 0.065
0.58ValCys: 0.58 ± 0.02
3.453ValAsp: 3.453 ± 0.057
4.787ValGlu: 4.787 ± 0.061
3.286ValPhe: 3.286 ± 0.054
4.682ValGly: 4.682 ± 0.06
1.48ValHis: 1.48 ± 0.031
5.56ValIle: 5.56 ± 0.078
4.539ValLys: 4.539 ± 0.062
7.341ValLeu: 7.341 ± 0.079
1.942ValMet: 1.942 ± 0.034
2.935ValAsn: 2.935 ± 0.047
3.011ValPro: 3.011 ± 0.048
2.318ValGln: 2.318 ± 0.042
2.826ValArg: 2.826 ± 0.041
4.595ValSer: 4.595 ± 0.064
3.897ValThr: 3.897 ± 0.047
4.844ValVal: 4.844 ± 0.066
0.635ValTrp: 0.635 ± 0.022
2.279ValTyr: 2.279 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
0.704TrpAla: 0.704 ± 0.026
0.077TrpCys: 0.077 ± 0.008
0.496TrpAsp: 0.496 ± 0.022
0.768TrpGlu: 0.768 ± 0.024
0.51TrpPhe: 0.51 ± 0.017
0.737TrpGly: 0.737 ± 0.026
0.202TrpHis: 0.202 ± 0.011
0.767TrpIle: 0.767 ± 0.025
0.738TrpLys: 0.738 ± 0.022
1.119TrpLeu: 1.119 ± 0.029
0.366TrpMet: 0.366 ± 0.017
0.537TrpAsn: 0.537 ± 0.018
0.308TrpPro: 0.308 ± 0.017
0.397TrpGln: 0.397 ± 0.019
0.423TrpArg: 0.423 ± 0.015
0.588TrpSer: 0.588 ± 0.021
0.516TrpThr: 0.516 ± 0.021
0.665TrpVal: 0.665 ± 0.023
0.147TrpTrp: 0.147 ± 0.01
0.333TrpTyr: 0.333 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.158TyrAla: 2.158 ± 0.039
0.279TyrCys: 0.279 ± 0.014
1.774TyrAsp: 1.774 ± 0.037
2.428TyrGlu: 2.428 ± 0.044
1.769TyrPhe: 1.769 ± 0.037
2.536TyrGly: 2.536 ± 0.039
0.841TyrHis: 0.841 ± 0.024
2.261TyrIle: 2.261 ± 0.045
1.927TyrLys: 1.927 ± 0.041
3.474TyrLeu: 3.474 ± 0.052
0.815TyrMet: 0.815 ± 0.025
1.319TyrAsn: 1.319 ± 0.031
1.42TyrPro: 1.42 ± 0.033
1.376TyrGln: 1.376 ± 0.031
1.697TyrArg: 1.697 ± 0.032
2.131TyrSer: 2.131 ± 0.041
1.663TyrThr: 1.663 ± 0.037
2.106TyrVal: 2.106 ± 0.04
0.38TyrTrp: 0.38 ± 0.016
1.339TyrTyr: 1.339 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4713 proteins (1386986 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski