Amino acid dipepetide frequency for Candidatus Brocadia sapporoensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.116AlaAla: 5.116 ± 0.096
0.938AlaCys: 0.938 ± 0.036
3.51AlaAsp: 3.51 ± 0.066
4.608AlaGlu: 4.608 ± 0.087
3.143AlaPhe: 3.143 ± 0.066
5.35AlaGly: 5.35 ± 0.098
1.444AlaHis: 1.444 ± 0.047
6.011AlaIle: 6.011 ± 0.102
4.632AlaLys: 4.632 ± 0.086
6.818AlaLeu: 6.818 ± 0.092
1.859AlaMet: 1.859 ± 0.053
2.502AlaAsn: 2.502 ± 0.063
2.056AlaPro: 2.056 ± 0.053
2.276AlaGln: 2.276 ± 0.058
3.467AlaArg: 3.467 ± 0.075
4.147AlaSer: 4.147 ± 0.075
3.42AlaThr: 3.42 ± 0.071
5.011AlaVal: 5.011 ± 0.086
0.671AlaTrp: 0.671 ± 0.034
2.411AlaTyr: 2.411 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.908CysAla: 0.908 ± 0.036
0.239CysCys: 0.239 ± 0.019
0.641CysAsp: 0.641 ± 0.027
0.722CysGlu: 0.722 ± 0.033
0.656CysPhe: 0.656 ± 0.028
1.173CysGly: 1.173 ± 0.045
0.544CysHis: 0.544 ± 0.047
1.005CysIle: 1.005 ± 0.041
0.825CysLys: 0.825 ± 0.039
1.264CysLeu: 1.264 ± 0.04
0.299CysMet: 0.299 ± 0.018
0.604CysAsn: 0.604 ± 0.029
0.693CysPro: 0.693 ± 0.035
0.379CysGln: 0.379 ± 0.023
0.696CysArg: 0.696 ± 0.03
0.811CysSer: 0.811 ± 0.031
0.623CysThr: 0.623 ± 0.028
0.864CysVal: 0.864 ± 0.033
0.124CysTrp: 0.124 ± 0.013
0.453CysTyr: 0.453 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
4.087AspAla: 4.087 ± 0.084
0.64AspCys: 0.64 ± 0.029
2.469AspAsp: 2.469 ± 0.075
3.679AspGlu: 3.679 ± 0.073
2.563AspPhe: 2.563 ± 0.053
3.447AspGly: 3.447 ± 0.079
0.931AspHis: 0.931 ± 0.036
4.837AspIle: 4.837 ± 0.084
3.635AspLys: 3.635 ± 0.075
4.717AspLeu: 4.717 ± 0.089
1.28AspMet: 1.28 ± 0.042
2.06AspAsn: 2.06 ± 0.058
2.036AspPro: 2.036 ± 0.056
1.296AspGln: 1.296 ± 0.043
2.444AspArg: 2.444 ± 0.062
2.45AspSer: 2.45 ± 0.061
3.076AspThr: 3.076 ± 0.062
3.841AspVal: 3.841 ± 0.062
0.633AspTrp: 0.633 ± 0.033
2.07AspTyr: 2.07 ± 0.058
0.0AspXaa: 0.0 ± 0.0
Glu
4.505GluAla: 4.505 ± 0.092
0.719GluCys: 0.719 ± 0.034
3.318GluAsp: 3.318 ± 0.074
5.052GluGlu: 5.052 ± 0.111
2.534GluPhe: 2.534 ± 0.061
4.235GluGly: 4.235 ± 0.076
1.317GluHis: 1.317 ± 0.045
6.334GluIle: 6.334 ± 0.106
6.37GluLys: 6.37 ± 0.103
5.711GluLeu: 5.711 ± 0.107
1.924GluMet: 1.924 ± 0.055
3.126GluAsn: 3.126 ± 0.07
1.827GluPro: 1.827 ± 0.051
2.157GluGln: 2.157 ± 0.062
3.828GluArg: 3.828 ± 0.07
3.566GluSer: 3.566 ± 0.073
3.708GluThr: 3.708 ± 0.066
4.24GluVal: 4.24 ± 0.07
0.687GluTrp: 0.687 ± 0.031
2.161GluTyr: 2.161 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
2.835PheAla: 2.835 ± 0.075
0.789PheCys: 0.789 ± 0.03
2.462PheAsp: 2.462 ± 0.053
2.538PheGlu: 2.538 ± 0.058
2.481PhePhe: 2.481 ± 0.076
3.112PheGly: 3.112 ± 0.067
1.094PheHis: 1.094 ± 0.034
3.394PheIle: 3.394 ± 0.079
2.567PheLys: 2.567 ± 0.069
4.572PheLeu: 4.572 ± 0.108
1.059PheMet: 1.059 ± 0.037
1.756PheAsn: 1.756 ± 0.05
1.852PhePro: 1.852 ± 0.055
1.5PheGln: 1.5 ± 0.045
2.069PheArg: 2.069 ± 0.063
3.231PheSer: 3.231 ± 0.07
2.481PheThr: 2.481 ± 0.067
3.147PheVal: 3.147 ± 0.079
0.488PheTrp: 0.488 ± 0.023
1.661PheTyr: 1.661 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.544GlyAla: 4.544 ± 0.094
1.123GlyCys: 1.123 ± 0.041
3.298GlyAsp: 3.298 ± 0.072
4.209GlyGlu: 4.209 ± 0.074
3.278GlyPhe: 3.278 ± 0.064
4.835GlyGly: 4.835 ± 0.116
1.395GlyHis: 1.395 ± 0.051
6.559GlyIle: 6.559 ± 0.092
5.585GlyLys: 5.585 ± 0.086
6.021GlyLeu: 6.021 ± 0.085
1.947GlyMet: 1.947 ± 0.057
3.163GlyAsn: 3.163 ± 0.09
1.63GlyPro: 1.63 ± 0.045
1.916GlyGln: 1.916 ± 0.059
3.452GlyArg: 3.452 ± 0.066
3.98GlySer: 3.98 ± 0.093
3.801GlyThr: 3.801 ± 0.088
4.786GlyVal: 4.786 ± 0.099
0.779GlyTrp: 0.779 ± 0.036
2.669GlyTyr: 2.669 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.658HisAla: 1.658 ± 0.043
0.318HisCys: 0.318 ± 0.021
1.076HisAsp: 1.076 ± 0.037
1.394HisGlu: 1.394 ± 0.041
0.977HisPhe: 0.977 ± 0.037
1.599HisGly: 1.599 ± 0.045
0.597HisHis: 0.597 ± 0.03
1.82HisIle: 1.82 ± 0.056
1.284HisLys: 1.284 ± 0.041
2.025HisLeu: 2.025 ± 0.054
0.47HisMet: 0.47 ± 0.026
0.894HisAsn: 0.894 ± 0.039
1.146HisPro: 1.146 ± 0.04
0.675HisGln: 0.675 ± 0.026
1.062HisArg: 1.062 ± 0.033
1.179HisSer: 1.179 ± 0.043
1.2HisThr: 1.2 ± 0.038
1.356HisVal: 1.356 ± 0.042
0.241HisTrp: 0.241 ± 0.018
0.811HisTyr: 0.811 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
6.241IleAla: 6.241 ± 0.102
1.076IleCys: 1.076 ± 0.04
4.487IleAsp: 4.487 ± 0.073
5.246IleGlu: 5.246 ± 0.091
3.551IlePhe: 3.551 ± 0.082
5.231IleGly: 5.231 ± 0.089
1.842IleHis: 1.842 ± 0.046
6.059IleIle: 6.059 ± 0.102
5.391IleLys: 5.391 ± 0.094
7.548IleLeu: 7.548 ± 0.116
1.745IleMet: 1.745 ± 0.05
3.269IleAsn: 3.269 ± 0.066
3.966IlePro: 3.966 ± 0.083
2.674IleGln: 2.674 ± 0.059
4.052IleArg: 4.052 ± 0.08
5.294IleSer: 5.294 ± 0.091
4.623IleThr: 4.623 ± 0.08
5.407IleVal: 5.407 ± 0.09
0.664IleTrp: 0.664 ± 0.028
2.33IleTyr: 2.33 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
4.663LysAla: 4.663 ± 0.075
0.726LysCys: 0.726 ± 0.039
4.122LysAsp: 4.122 ± 0.085
6.167LysGlu: 6.167 ± 0.104
2.28LysPhe: 2.28 ± 0.055
4.747LysGly: 4.747 ± 0.088
1.333LysHis: 1.333 ± 0.042
6.225LysIle: 6.225 ± 0.095
6.827LysLys: 6.827 ± 0.123
5.811LysLeu: 5.811 ± 0.09
1.956LysMet: 1.956 ± 0.053
3.86LysAsn: 3.86 ± 0.083
2.564LysPro: 2.564 ± 0.057
2.335LysGln: 2.335 ± 0.062
3.933LysArg: 3.933 ± 0.072
3.775LysSer: 3.775 ± 0.071
4.195LysThr: 4.195 ± 0.076
4.436LysVal: 4.436 ± 0.073
0.563LysTrp: 0.563 ± 0.03
2.475LysTyr: 2.475 ± 0.063
0.0LysXaa: 0.0 ± 0.0
Leu
6.446LeuAla: 6.446 ± 0.105
1.373LeuCys: 1.373 ± 0.04
4.632LeuAsp: 4.632 ± 0.086
6.08LeuGlu: 6.08 ± 0.103
4.339LeuPhe: 4.339 ± 0.09
6.191LeuGly: 6.191 ± 0.111
1.924LeuHis: 1.924 ± 0.045
6.548LeuIle: 6.548 ± 0.101
7.122LeuLys: 7.122 ± 0.122
8.829LeuLeu: 8.829 ± 0.135
2.12LeuMet: 2.12 ± 0.053
3.757LeuAsn: 3.757 ± 0.079
4.119LeuPro: 4.119 ± 0.074
3.236LeuGln: 3.236 ± 0.069
5.022LeuArg: 5.022 ± 0.091
6.796LeuSer: 6.796 ± 0.098
4.883LeuThr: 4.883 ± 0.088
5.573LeuVal: 5.573 ± 0.087
0.985LeuTrp: 0.985 ± 0.04
2.857LeuTyr: 2.857 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
1.898MetAla: 1.898 ± 0.052
0.237MetCys: 0.237 ± 0.018
1.269MetAsp: 1.269 ± 0.036
1.663MetGlu: 1.663 ± 0.049
0.827MetPhe: 0.827 ± 0.032
1.795MetGly: 1.795 ± 0.051
0.52MetHis: 0.52 ± 0.029
1.797MetIle: 1.797 ± 0.049
2.179MetLys: 2.179 ± 0.053
2.173MetLeu: 2.173 ± 0.052
0.601MetMet: 0.601 ± 0.032
1.149MetAsn: 1.149 ± 0.037
1.071MetPro: 1.071 ± 0.035
0.864MetGln: 0.864 ± 0.03
1.227MetArg: 1.227 ± 0.041
1.508MetSer: 1.508 ± 0.044
1.274MetThr: 1.274 ± 0.044
1.801MetVal: 1.801 ± 0.045
0.188MetTrp: 0.188 ± 0.015
0.666MetTyr: 0.666 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
3.01AsnAla: 3.01 ± 0.071
0.511AsnCys: 0.511 ± 0.027
2.276AsnAsp: 2.276 ± 0.054
2.632AsnGlu: 2.632 ± 0.065
1.776AsnPhe: 1.776 ± 0.05
2.69AsnGly: 2.69 ± 0.061
0.91AsnHis: 0.91 ± 0.034
3.817AsnIle: 3.817 ± 0.077
2.881AsnLys: 2.881 ± 0.055
4.124AsnLeu: 4.124 ± 0.076
0.894AsnMet: 0.894 ± 0.034
2.03AsnAsn: 2.03 ± 0.066
2.35AsnPro: 2.35 ± 0.052
1.307AsnGln: 1.307 ± 0.046
2.063AsnArg: 2.063 ± 0.048
2.116AsnSer: 2.116 ± 0.054
2.475AsnThr: 2.475 ± 0.061
2.769AsnVal: 2.769 ± 0.051
0.457AsnTrp: 0.457 ± 0.026
1.588AsnTyr: 1.588 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.793ProAla: 2.793 ± 0.065
0.534ProCys: 0.534 ± 0.028
2.515ProAsp: 2.515 ± 0.056
3.118ProGlu: 3.118 ± 0.068
1.993ProPhe: 1.993 ± 0.052
2.716ProGly: 2.716 ± 0.068
0.943ProHis: 0.943 ± 0.033
2.471ProIle: 2.471 ± 0.056
2.284ProLys: 2.284 ± 0.053
3.683ProLeu: 3.683 ± 0.072
0.83ProMet: 0.83 ± 0.036
1.388ProAsn: 1.388 ± 0.044
1.578ProPro: 1.578 ± 0.048
1.316ProGln: 1.316 ± 0.041
1.67ProArg: 1.67 ± 0.048
2.289ProSer: 2.289 ± 0.056
1.934ProThr: 1.934 ± 0.068
3.265ProVal: 3.265 ± 0.069
0.439ProTrp: 0.439 ± 0.027
1.431ProTyr: 1.431 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
2.18GlnAla: 2.18 ± 0.056
0.338GlnCys: 0.338 ± 0.022
1.481GlnAsp: 1.481 ± 0.043
2.367GlnGlu: 2.367 ± 0.062
1.319GlnPhe: 1.319 ± 0.04
2.021GlnGly: 2.021 ± 0.055
0.714GlnHis: 0.714 ± 0.029
2.598GlnIle: 2.598 ± 0.061
2.885GlnLys: 2.885 ± 0.065
2.599GlnLeu: 2.599 ± 0.068
0.865GlnMet: 0.865 ± 0.039
1.532GlnAsn: 1.532 ± 0.04
1.188GlnPro: 1.188 ± 0.04
1.248GlnGln: 1.248 ± 0.046
1.809GlnArg: 1.809 ± 0.054
1.725GlnSer: 1.725 ± 0.051
1.767GlnThr: 1.767 ± 0.045
1.973GlnVal: 1.973 ± 0.041
0.34GlnTrp: 0.34 ± 0.022
1.183GlnTyr: 1.183 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
3.051ArgAla: 3.051 ± 0.068
0.753ArgCys: 0.753 ± 0.034
2.621ArgAsp: 2.621 ± 0.055
3.815ArgGlu: 3.815 ± 0.078
2.402ArgPhe: 2.402 ± 0.062
3.019ArgGly: 3.019 ± 0.079
1.218ArgHis: 1.218 ± 0.039
4.258ArgIle: 4.258 ± 0.079
3.674ArgLys: 3.674 ± 0.08
4.921ArgLeu: 4.921 ± 0.09
1.537ArgMet: 1.537 ± 0.043
2.363ArgAsn: 2.363 ± 0.054
1.661ArgPro: 1.661 ± 0.047
1.774ArgGln: 1.774 ± 0.045
2.781ArgArg: 2.781 ± 0.069
2.66ArgSer: 2.66 ± 0.061
2.459ArgThr: 2.459 ± 0.064
3.071ArgVal: 3.071 ± 0.067
0.586ArgTrp: 0.586 ± 0.028
2.004ArgTyr: 2.004 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
4.016SerAla: 4.016 ± 0.077
0.766SerCys: 0.766 ± 0.032
2.982SerAsp: 2.982 ± 0.063
3.529SerGlu: 3.529 ± 0.075
3.041SerPhe: 3.041 ± 0.073
4.712SerGly: 4.712 ± 0.11
1.261SerHis: 1.261 ± 0.04
4.491SerIle: 4.491 ± 0.079
3.694SerLys: 3.694 ± 0.074
6.257SerLeu: 6.257 ± 0.1
1.552SerMet: 1.552 ± 0.048
2.281SerAsn: 2.281 ± 0.062
2.577SerPro: 2.577 ± 0.053
2.007SerGln: 2.007 ± 0.052
2.945SerArg: 2.945 ± 0.061
3.679SerSer: 3.679 ± 0.082
3.03SerThr: 3.03 ± 0.08
4.135SerVal: 4.135 ± 0.083
0.722SerTrp: 0.722 ± 0.037
1.998SerTyr: 1.998 ± 0.061
0.0SerXaa: 0.0 ± 0.0
Thr
3.937ThrAla: 3.937 ± 0.076
0.716ThrCys: 0.716 ± 0.032
2.804ThrAsp: 2.804 ± 0.066
3.38ThrGlu: 3.38 ± 0.061
2.388ThrPhe: 2.388 ± 0.066
4.367ThrGly: 4.367 ± 0.085
1.238ThrHis: 1.238 ± 0.046
4.376ThrIle: 4.376 ± 0.073
3.445ThrLys: 3.445 ± 0.079
5.106ThrLeu: 5.106 ± 0.082
1.261ThrMet: 1.261 ± 0.042
2.234ThrAsn: 2.234 ± 0.062
2.499ThrPro: 2.499 ± 0.071
1.611ThrGln: 1.611 ± 0.052
2.451ThrArg: 2.451 ± 0.054
3.122ThrSer: 3.122 ± 0.081
3.078ThrThr: 3.078 ± 0.076
3.887ThrVal: 3.887 ± 0.074
0.505ThrTrp: 0.505 ± 0.028
1.736ThrTyr: 1.736 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
4.653ValAla: 4.653 ± 0.084
1.076ValCys: 1.076 ± 0.039
3.524ValAsp: 3.524 ± 0.068
4.182ValGlu: 4.182 ± 0.077
3.346ValPhe: 3.346 ± 0.071
4.579ValGly: 4.579 ± 0.094
1.347ValHis: 1.347 ± 0.047
5.177ValIle: 5.177 ± 0.093
4.676ValLys: 4.676 ± 0.078
6.167ValLeu: 6.167 ± 0.094
1.585ValMet: 1.585 ± 0.052
2.764ValAsn: 2.764 ± 0.065
2.562ValPro: 2.562 ± 0.064
1.833ValGln: 1.833 ± 0.052
3.288ValArg: 3.288 ± 0.071
4.741ValSer: 4.741 ± 0.081
3.879ValThr: 3.879 ± 0.08
5.105ValVal: 5.105 ± 0.103
0.682ValTrp: 0.682 ± 0.027
2.161ValTyr: 2.161 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.628TrpAla: 0.628 ± 0.029
0.143TrpCys: 0.143 ± 0.013
0.615TrpAsp: 0.615 ± 0.03
0.717TrpGlu: 0.717 ± 0.031
0.48TrpPhe: 0.48 ± 0.028
0.645TrpGly: 0.645 ± 0.028
0.274TrpHis: 0.274 ± 0.018
0.805TrpIle: 0.805 ± 0.036
0.749TrpLys: 0.749 ± 0.033
1.015TrpLeu: 1.015 ± 0.045
0.286TrpMet: 0.286 ± 0.019
0.46TrpAsn: 0.46 ± 0.024
0.309TrpPro: 0.309 ± 0.02
0.453TrpGln: 0.453 ± 0.024
0.557TrpArg: 0.557 ± 0.028
0.563TrpSer: 0.563 ± 0.032
0.481TrpThr: 0.481 ± 0.027
0.62TrpVal: 0.62 ± 0.031
0.142TrpTrp: 0.142 ± 0.013
0.361TrpTyr: 0.361 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.222TyrAla: 2.222 ± 0.064
0.505TyrCys: 0.505 ± 0.027
2.088TyrAsp: 2.088 ± 0.054
2.22TyrGlu: 2.22 ± 0.058
1.717TyrPhe: 1.717 ± 0.048
2.507TyrGly: 2.507 ± 0.069
0.937TyrHis: 0.937 ± 0.036
2.113TyrIle: 2.113 ± 0.045
2.287TyrLys: 2.287 ± 0.058
3.473TyrLeu: 3.473 ± 0.066
0.636TyrMet: 0.636 ± 0.033
1.485TyrAsn: 1.485 ± 0.047
1.502TyrPro: 1.502 ± 0.042
1.273TyrGln: 1.273 ± 0.043
1.814TyrArg: 1.814 ± 0.047
2.044TyrSer: 2.044 ± 0.063
1.755TyrThr: 1.755 ± 0.056
2.04TyrVal: 2.04 ± 0.061
0.408TyrTrp: 0.408 ± 0.028
1.421TyrTyr: 1.421 ± 0.056
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2485 proteins (783403 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski