Amino acid dipepetide frequency for Mesobacillus persicus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.647AlaAla: 5.647 ± 0.099
0.596AlaCys: 0.596 ± 0.024
3.352AlaAsp: 3.352 ± 0.049
5.057AlaGlu: 5.057 ± 0.072
3.311AlaPhe: 3.311 ± 0.064
5.456AlaGly: 5.456 ± 0.082
1.236AlaHis: 1.236 ± 0.034
5.759AlaIle: 5.759 ± 0.086
4.6AlaLys: 4.6 ± 0.065
7.022AlaLeu: 7.022 ± 0.079
1.988AlaMet: 1.988 ± 0.038
2.851AlaAsn: 2.851 ± 0.046
2.163AlaPro: 2.163 ± 0.05
2.141AlaGln: 2.141 ± 0.044
2.695AlaArg: 2.695 ± 0.06
4.016AlaSer: 4.016 ± 0.055
3.579AlaThr: 3.579 ± 0.063
5.375AlaVal: 5.375 ± 0.073
0.641AlaTrp: 0.641 ± 0.025
2.201AlaTyr: 2.201 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.426CysAla: 0.426 ± 0.018
0.106CysCys: 0.106 ± 0.01
0.383CysAsp: 0.383 ± 0.019
0.477CysGlu: 0.477 ± 0.02
0.305CysPhe: 0.305 ± 0.017
0.71CysGly: 0.71 ± 0.028
0.244CysHis: 0.244 ± 0.017
0.477CysIle: 0.477 ± 0.02
0.359CysLys: 0.359 ± 0.018
0.704CysLeu: 0.704 ± 0.024
0.17CysMet: 0.17 ± 0.012
0.279CysAsn: 0.279 ± 0.013
0.389CysPro: 0.389 ± 0.019
0.24CysGln: 0.24 ± 0.014
0.301CysArg: 0.301 ± 0.013
0.491CysSer: 0.491 ± 0.02
0.42CysThr: 0.42 ± 0.019
0.398CysVal: 0.398 ± 0.018
0.065CysTrp: 0.065 ± 0.007
0.247CysTyr: 0.247 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.138AspAla: 3.138 ± 0.056
0.377AspCys: 0.377 ± 0.018
2.357AspAsp: 2.357 ± 0.046
4.389AspGlu: 4.389 ± 0.074
2.445AspPhe: 2.445 ± 0.051
3.443AspGly: 3.443 ± 0.056
1.164AspHis: 1.164 ± 0.032
3.826AspIle: 3.826 ± 0.06
3.133AspLys: 3.133 ± 0.049
5.166AspLeu: 5.166 ± 0.061
1.267AspMet: 1.267 ± 0.03
1.723AspAsn: 1.723 ± 0.037
2.03AspPro: 2.03 ± 0.043
1.991AspGln: 1.991 ± 0.044
2.327AspArg: 2.327 ± 0.046
2.749AspSer: 2.749 ± 0.053
2.381AspThr: 2.381 ± 0.042
3.621AspVal: 3.621 ± 0.056
0.625AspTrp: 0.625 ± 0.025
2.13AspTyr: 2.13 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
5.536GluAla: 5.536 ± 0.079
0.409GluCys: 0.409 ± 0.017
3.965GluAsp: 3.965 ± 0.059
7.486GluGlu: 7.486 ± 0.101
2.836GluPhe: 2.836 ± 0.054
4.664GluGly: 4.664 ± 0.06
1.487GluHis: 1.487 ± 0.039
5.841GluIle: 5.841 ± 0.072
6.476GluLys: 6.476 ± 0.087
7.676GluLeu: 7.676 ± 0.101
2.387GluMet: 2.387 ± 0.043
3.816GluAsn: 3.816 ± 0.059
2.02GluPro: 2.02 ± 0.041
3.333GluGln: 3.333 ± 0.056
3.605GluArg: 3.605 ± 0.058
3.632GluSer: 3.632 ± 0.059
4.112GluThr: 4.112 ± 0.056
5.556GluVal: 5.556 ± 0.069
0.877GluTrp: 0.877 ± 0.03
2.414GluTyr: 2.414 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.152PheAla: 3.152 ± 0.057
0.343PheCys: 0.343 ± 0.017
2.375PheAsp: 2.375 ± 0.043
2.958PheGlu: 2.958 ± 0.05
2.392PhePhe: 2.392 ± 0.053
3.464PheGly: 3.464 ± 0.058
1.005PheHis: 1.005 ± 0.031
3.896PheIle: 3.896 ± 0.06
2.468PheLys: 2.468 ± 0.049
4.79PheLeu: 4.79 ± 0.072
1.178PheMet: 1.178 ± 0.031
1.959PheAsn: 1.959 ± 0.041
1.699PhePro: 1.699 ± 0.044
1.521PheGln: 1.521 ± 0.034
1.6PheArg: 1.6 ± 0.042
3.196PheSer: 3.196 ± 0.052
2.637PheThr: 2.637 ± 0.048
3.003PheVal: 3.003 ± 0.06
0.477PheTrp: 0.477 ± 0.023
1.745PheTyr: 1.745 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.949GlyAla: 4.949 ± 0.074
0.618GlyCys: 0.618 ± 0.025
3.285GlyAsp: 3.285 ± 0.056
4.77GlyGlu: 4.77 ± 0.073
3.604GlyPhe: 3.604 ± 0.065
5.101GlyGly: 5.101 ± 0.085
1.415GlyHis: 1.415 ± 0.037
5.864GlyIle: 5.864 ± 0.083
5.143GlyLys: 5.143 ± 0.069
6.993GlyLeu: 6.993 ± 0.086
2.245GlyMet: 2.245 ± 0.042
2.817GlyAsn: 2.817 ± 0.052
1.948GlyPro: 1.948 ± 0.047
2.339GlyGln: 2.339 ± 0.048
2.883GlyArg: 2.883 ± 0.046
4.139GlySer: 4.139 ± 0.065
4.065GlyThr: 4.065 ± 0.052
5.377GlyVal: 5.377 ± 0.064
0.796GlyTrp: 0.796 ± 0.028
2.834GlyTyr: 2.834 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.224HisAla: 1.224 ± 0.029
0.207HisCys: 0.207 ± 0.015
0.968HisAsp: 0.968 ± 0.03
1.408HisGlu: 1.408 ± 0.035
1.053HisPhe: 1.053 ± 0.03
1.498HisGly: 1.498 ± 0.04
0.67HisHis: 0.67 ± 0.028
1.399HisIle: 1.399 ± 0.028
1.082HisLys: 1.082 ± 0.03
2.063HisLeu: 2.063 ± 0.047
0.507HisMet: 0.507 ± 0.021
0.768HisAsn: 0.768 ± 0.029
1.151HisPro: 1.151 ± 0.032
0.884HisGln: 0.884 ± 0.027
0.878HisArg: 0.878 ± 0.029
1.273HisSer: 1.273 ± 0.034
1.027HisThr: 1.027 ± 0.032
1.307HisVal: 1.307 ± 0.031
0.231HisTrp: 0.231 ± 0.013
0.862HisTyr: 0.862 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.77IleAla: 5.77 ± 0.074
0.594IleCys: 0.594 ± 0.02
4.088IleAsp: 4.088 ± 0.07
5.777IleGlu: 5.777 ± 0.075
3.152IlePhe: 3.152 ± 0.059
6.064IleGly: 6.064 ± 0.087
1.681IleHis: 1.681 ± 0.032
5.703IleIle: 5.703 ± 0.096
4.462IleLys: 4.462 ± 0.062
7.288IleLeu: 7.288 ± 0.086
1.807IleMet: 1.807 ± 0.039
3.16IleAsn: 3.16 ± 0.051
3.294IlePro: 3.294 ± 0.053
2.746IleGln: 2.746 ± 0.048
3.048IleArg: 3.048 ± 0.052
4.948IleSer: 4.948 ± 0.068
4.127IleThr: 4.127 ± 0.063
5.526IleVal: 5.526 ± 0.067
0.642IleTrp: 0.642 ± 0.02
2.388IleTyr: 2.388 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.641LysAla: 4.641 ± 0.071
0.349LysCys: 0.349 ± 0.013
3.694LysAsp: 3.694 ± 0.064
6.652LysGlu: 6.652 ± 0.081
1.945LysPhe: 1.945 ± 0.039
4.507LysGly: 4.507 ± 0.066
1.235LysHis: 1.235 ± 0.031
4.625LysIle: 4.625 ± 0.06
5.606LysLys: 5.606 ± 0.071
5.846LysLeu: 5.846 ± 0.073
2.119LysMet: 2.119 ± 0.037
3.31LysAsn: 3.31 ± 0.055
2.186LysPro: 2.186 ± 0.04
3.067LysGln: 3.067 ± 0.055
3.279LysArg: 3.279 ± 0.055
3.416LysSer: 3.416 ± 0.049
3.633LysThr: 3.633 ± 0.054
4.786LysVal: 4.786 ± 0.069
0.798LysTrp: 0.798 ± 0.025
2.074LysTyr: 2.074 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
7.651LeuAla: 7.651 ± 0.09
0.656LeuCys: 0.656 ± 0.023
4.875LeuAsp: 4.875 ± 0.068
7.113LeuGlu: 7.113 ± 0.081
4.985LeuPhe: 4.985 ± 0.075
6.77LeuGly: 6.77 ± 0.088
1.893LeuHis: 1.893 ± 0.038
7.124LeuIle: 7.124 ± 0.104
6.565LeuLys: 6.565 ± 0.082
10.06LeuLeu: 10.06 ± 0.123
2.546LeuMet: 2.546 ± 0.054
4.293LeuAsn: 4.293 ± 0.062
4.058LeuPro: 4.058 ± 0.061
3.367LeuGln: 3.367 ± 0.056
3.696LeuArg: 3.696 ± 0.064
6.606LeuSer: 6.606 ± 0.074
5.817LeuThr: 5.817 ± 0.071
6.633LeuVal: 6.633 ± 0.073
0.85LeuTrp: 0.85 ± 0.029
3.123LeuTyr: 3.123 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.078MetAla: 2.078 ± 0.042
0.142MetCys: 0.142 ± 0.011
1.54MetAsp: 1.54 ± 0.036
2.076MetGlu: 2.076 ± 0.049
1.116MetPhe: 1.116 ± 0.033
1.877MetGly: 1.877 ± 0.043
0.44MetHis: 0.44 ± 0.019
2.138MetIle: 2.138 ± 0.047
2.415MetLys: 2.415 ± 0.041
2.394MetLeu: 2.394 ± 0.046
0.853MetMet: 0.853 ± 0.027
1.547MetAsn: 1.547 ± 0.036
0.996MetPro: 0.996 ± 0.036
0.817MetGln: 0.817 ± 0.028
1.008MetArg: 1.008 ± 0.03
1.669MetSer: 1.669 ± 0.039
1.611MetThr: 1.611 ± 0.035
1.974MetVal: 1.974 ± 0.034
0.18MetTrp: 0.18 ± 0.014
0.692MetTyr: 0.692 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
2.65AsnAla: 2.65 ± 0.047
0.31AsnCys: 0.31 ± 0.016
2.144AsnAsp: 2.144 ± 0.04
3.628AsnGlu: 3.628 ± 0.058
1.665AsnPhe: 1.665 ± 0.037
3.371AsnGly: 3.371 ± 0.058
1.131AsnHis: 1.131 ± 0.029
3.146AsnIle: 3.146 ± 0.05
2.723AsnLys: 2.723 ± 0.048
4.117AsnLeu: 4.117 ± 0.057
1.192AsnMet: 1.192 ± 0.032
1.871AsnAsn: 1.871 ± 0.045
2.217AsnPro: 2.217 ± 0.05
2.097AsnGln: 2.097 ± 0.043
2.186AsnArg: 2.186 ± 0.043
2.311AsnSer: 2.311 ± 0.05
2.092AsnThr: 2.092 ± 0.038
2.87AsnVal: 2.87 ± 0.049
0.552AsnTrp: 0.552 ± 0.021
1.454AsnTyr: 1.454 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
2.39ProAla: 2.39 ± 0.048
0.228ProCys: 0.228 ± 0.014
2.047ProAsp: 2.047 ± 0.044
3.324ProGlu: 3.324 ± 0.047
2.026ProPhe: 2.026 ± 0.043
2.529ProGly: 2.529 ± 0.057
0.76ProHis: 0.76 ± 0.026
2.869ProIle: 2.869 ± 0.047
2.216ProLys: 2.216 ± 0.044
3.566ProLeu: 3.566 ± 0.054
0.871ProMet: 0.871 ± 0.026
1.712ProAsn: 1.712 ± 0.04
1.065ProPro: 1.065 ± 0.032
1.133ProGln: 1.133 ± 0.035
1.158ProArg: 1.158 ± 0.029
2.267ProSer: 2.267 ± 0.041
1.992ProThr: 1.992 ± 0.037
2.951ProVal: 2.951 ± 0.048
0.346ProTrp: 0.346 ± 0.017
1.35ProTyr: 1.35 ± 0.033
0.001ProXaa: 0.001 ± 0.001
Gln
2.699GlnAla: 2.699 ± 0.051
0.224GlnCys: 0.224 ± 0.015
1.604GlnAsp: 1.604 ± 0.041
3.059GlnGlu: 3.059 ± 0.053
1.626GlnPhe: 1.626 ± 0.037
2.22GlnGly: 2.22 ± 0.04
0.734GlnHis: 0.734 ± 0.026
2.544GlnIle: 2.544 ± 0.046
2.568GlnLys: 2.568 ± 0.047
3.924GlnLeu: 3.924 ± 0.064
1.093GlnMet: 1.093 ± 0.031
1.604GlnAsn: 1.604 ± 0.037
1.25GlnPro: 1.25 ± 0.031
1.742GlnGln: 1.742 ± 0.046
1.554GlnArg: 1.554 ± 0.038
2.068GlnSer: 2.068 ± 0.044
2.008GlnThr: 2.008 ± 0.038
2.473GlnVal: 2.473 ± 0.045
0.395GlnTrp: 0.395 ± 0.018
1.292GlnTyr: 1.292 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.41ArgAla: 2.41 ± 0.045
0.241ArgCys: 0.241 ± 0.015
2.073ArgAsp: 2.073 ± 0.043
3.408ArgGlu: 3.408 ± 0.053
2.006ArgPhe: 2.006 ± 0.042
2.495ArgGly: 2.495 ± 0.04
0.782ArgHis: 0.782 ± 0.025
3.004ArgIle: 3.004 ± 0.054
3.349ArgLys: 3.349 ± 0.055
4.095ArgLeu: 4.095 ± 0.054
1.303ArgMet: 1.303 ± 0.03
1.963ArgAsn: 1.963 ± 0.04
1.409ArgPro: 1.409 ± 0.04
1.54ArgGln: 1.54 ± 0.04
1.84ArgArg: 1.84 ± 0.044
2.242ArgSer: 2.242 ± 0.042
2.078ArgThr: 2.078 ± 0.041
2.814ArgVal: 2.814 ± 0.051
0.43ArgTrp: 0.43 ± 0.022
1.592ArgTyr: 1.592 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
3.662SerAla: 3.662 ± 0.052
0.449SerCys: 0.449 ± 0.02
2.778SerAsp: 2.778 ± 0.047
4.215SerGlu: 4.215 ± 0.068
3.214SerPhe: 3.214 ± 0.054
4.478SerGly: 4.478 ± 0.062
1.141SerHis: 1.141 ± 0.029
4.893SerIle: 4.893 ± 0.058
3.742SerLys: 3.742 ± 0.053
6.173SerLeu: 6.173 ± 0.074
1.683SerMet: 1.683 ± 0.034
2.483SerAsn: 2.483 ± 0.045
2.119SerPro: 2.119 ± 0.049
1.995SerGln: 1.995 ± 0.039
2.375SerArg: 2.375 ± 0.042
3.872SerSer: 3.872 ± 0.071
3.092SerThr: 3.092 ± 0.049
4.179SerVal: 4.179 ± 0.052
0.615SerTrp: 0.615 ± 0.023
2.138SerTyr: 2.138 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
3.895ThrAla: 3.895 ± 0.066
0.373ThrCys: 0.373 ± 0.018
2.712ThrAsp: 2.712 ± 0.046
3.822ThrGlu: 3.822 ± 0.06
2.525ThrPhe: 2.525 ± 0.04
4.531ThrGly: 4.531 ± 0.054
1.031ThrHis: 1.031 ± 0.029
4.402ThrIle: 4.402 ± 0.066
3.323ThrLys: 3.323 ± 0.047
5.206ThrLeu: 5.206 ± 0.066
1.329ThrMet: 1.329 ± 0.033
2.502ThrAsn: 2.502 ± 0.047
2.246ThrPro: 2.246 ± 0.039
1.428ThrGln: 1.428 ± 0.031
1.883ThrArg: 1.883 ± 0.042
3.201ThrSer: 3.201 ± 0.054
2.918ThrThr: 2.918 ± 0.058
4.303ThrVal: 4.303 ± 0.058
0.503ThrTrp: 0.503 ± 0.023
1.788ThrTyr: 1.788 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
5.063ValAla: 5.063 ± 0.076
0.585ValCys: 0.585 ± 0.024
3.694ValAsp: 3.694 ± 0.055
5.325ValGlu: 5.325 ± 0.07
3.297ValPhe: 3.297 ± 0.054
4.879ValGly: 4.879 ± 0.06
1.37ValHis: 1.37 ± 0.033
5.692ValIle: 5.692 ± 0.07
4.755ValLys: 4.755 ± 0.06
6.902ValLeu: 6.902 ± 0.077
1.859ValMet: 1.859 ± 0.042
3.144ValAsn: 3.144 ± 0.053
2.831ValPro: 2.831 ± 0.048
2.368ValGln: 2.368 ± 0.041
2.706ValArg: 2.706 ± 0.04
4.493ValSer: 4.493 ± 0.057
4.08ValThr: 4.08 ± 0.063
5.234ValVal: 5.234 ± 0.074
0.593ValTrp: 0.593 ± 0.021
2.216ValTyr: 2.216 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.641TrpAla: 0.641 ± 0.024
0.076TrpCys: 0.076 ± 0.008
0.528TrpAsp: 0.528 ± 0.019
0.696TrpGlu: 0.696 ± 0.025
0.512TrpPhe: 0.512 ± 0.022
0.711TrpGly: 0.711 ± 0.027
0.191TrpHis: 0.191 ± 0.012
0.761TrpIle: 0.761 ± 0.027
0.748TrpLys: 0.748 ± 0.027
1.145TrpLeu: 1.145 ± 0.039
0.355TrpMet: 0.355 ± 0.016
0.516TrpAsn: 0.516 ± 0.021
0.262TrpPro: 0.262 ± 0.014
0.363TrpGln: 0.363 ± 0.017
0.372TrpArg: 0.372 ± 0.019
0.628TrpSer: 0.628 ± 0.021
0.494TrpThr: 0.494 ± 0.021
0.68TrpVal: 0.68 ± 0.021
0.126TrpTrp: 0.126 ± 0.01
0.329TrpTyr: 0.329 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.065TyrAla: 2.065 ± 0.04
0.309TyrCys: 0.309 ± 0.016
1.818TyrAsp: 1.818 ± 0.042
2.49TyrGlu: 2.49 ± 0.047
1.842TyrPhe: 1.842 ± 0.04
2.416TyrGly: 2.416 ± 0.052
0.851TyrHis: 0.851 ± 0.028
2.301TyrIle: 2.301 ± 0.042
2.038TyrLys: 2.038 ± 0.045
3.552TyrLeu: 3.552 ± 0.061
0.812TyrMet: 0.812 ± 0.026
1.406TyrAsn: 1.406 ± 0.036
1.464TyrPro: 1.464 ± 0.039
1.57TyrGln: 1.57 ± 0.042
1.68TyrArg: 1.68 ± 0.041
2.081TyrSer: 2.081 ± 0.034
1.76TyrThr: 1.76 ± 0.034
2.024TyrVal: 2.024 ± 0.04
0.392TyrTrp: 0.392 ± 0.018
1.336TyrTyr: 1.336 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 4551 proteins (1269223 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski