Amino acid dipepetide frequency for unidentified eubacterium SCB49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.02AlaAla: 5.02 ± 0.102
0.577AlaCys: 0.577 ± 0.027
3.475AlaAsp: 3.475 ± 0.079
4.03AlaGlu: 4.03 ± 0.076
3.466AlaPhe: 3.466 ± 0.061
4.381AlaGly: 4.381 ± 0.093
1.158AlaHis: 1.158 ± 0.038
5.867AlaIle: 5.867 ± 0.087
4.689AlaLys: 4.689 ± 0.083
6.543AlaLeu: 6.543 ± 0.085
1.74AlaMet: 1.74 ± 0.042
3.618AlaAsn: 3.618 ± 0.079
2.142AlaPro: 2.142 ± 0.057
2.673AlaGln: 2.673 ± 0.057
2.076AlaArg: 2.076 ± 0.051
4.41AlaSer: 4.41 ± 0.085
4.45AlaThr: 4.45 ± 0.109
4.507AlaVal: 4.507 ± 0.075
0.629AlaTrp: 0.629 ± 0.029
2.494AlaTyr: 2.494 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.49CysAla: 0.49 ± 0.023
0.096CysCys: 0.096 ± 0.01
0.647CysAsp: 0.647 ± 0.08
0.481CysGlu: 0.481 ± 0.027
0.391CysPhe: 0.391 ± 0.021
0.656CysGly: 0.656 ± 0.029
0.152CysHis: 0.152 ± 0.014
0.587CysIle: 0.587 ± 0.025
0.437CysLys: 0.437 ± 0.023
0.661CysLeu: 0.661 ± 0.028
0.117CysMet: 0.117 ± 0.009
0.44CysAsn: 0.44 ± 0.024
0.322CysPro: 0.322 ± 0.022
0.226CysGln: 0.226 ± 0.018
0.161CysArg: 0.161 ± 0.012
0.523CysSer: 0.523 ± 0.027
0.445CysThr: 0.445 ± 0.024
0.479CysVal: 0.479 ± 0.025
0.049CysTrp: 0.049 ± 0.008
0.294CysTyr: 0.294 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.216AspAla: 4.216 ± 0.078
0.518AspCys: 0.518 ± 0.053
3.211AspAsp: 3.211 ± 0.074
3.775AspGlu: 3.775 ± 0.084
3.664AspPhe: 3.664 ± 0.074
4.223AspGly: 4.223 ± 0.169
0.935AspHis: 0.935 ± 0.031
4.554AspIle: 4.554 ± 0.077
4.16AspLys: 4.16 ± 0.077
5.195AspLeu: 5.195 ± 0.091
1.082AspMet: 1.082 ± 0.033
3.262AspAsn: 3.262 ± 0.067
1.89AspPro: 1.89 ± 0.086
1.645AspGln: 1.645 ± 0.042
1.9AspArg: 1.9 ± 0.047
3.246AspSer: 3.246 ± 0.064
3.432AspThr: 3.432 ± 0.065
4.036AspVal: 4.036 ± 0.082
0.722AspTrp: 0.722 ± 0.028
2.835AspTyr: 2.835 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
4.881GluAla: 4.881 ± 0.09
0.37GluCys: 0.37 ± 0.021
3.864GluAsp: 3.864 ± 0.068
4.902GluGlu: 4.902 ± 0.094
2.721GluPhe: 2.721 ± 0.06
3.972GluGly: 3.972 ± 0.079
1.129GluHis: 1.129 ± 0.034
5.743GluIle: 5.743 ± 0.085
5.698GluLys: 5.698 ± 0.09
5.835GluLeu: 5.835 ± 0.083
1.741GluMet: 1.741 ± 0.045
4.713GluAsn: 4.713 ± 0.08
1.488GluPro: 1.488 ± 0.044
2.345GluGln: 2.345 ± 0.048
2.415GluArg: 2.415 ± 0.057
3.191GluSer: 3.191 ± 0.06
4.146GluThr: 4.146 ± 0.069
4.497GluVal: 4.497 ± 0.084
0.569GluTrp: 0.569 ± 0.032
2.397GluTyr: 2.397 ± 0.062
0.0GluXaa: 0.0 ± 0.0
Phe
2.934PheAla: 2.934 ± 0.056
0.458PheCys: 0.458 ± 0.022
3.299PheAsp: 3.299 ± 0.068
3.339PheGlu: 3.339 ± 0.063
2.723PhePhe: 2.723 ± 0.071
3.474PheGly: 3.474 ± 0.067
0.752PheHis: 0.752 ± 0.028
3.905PheIle: 3.905 ± 0.071
3.901PheLys: 3.901 ± 0.075
4.495PheLeu: 4.495 ± 0.098
1.057PheMet: 1.057 ± 0.036
3.271PheAsn: 3.271 ± 0.068
1.7PhePro: 1.7 ± 0.044
1.381PheGln: 1.381 ± 0.041
1.424PheArg: 1.424 ± 0.037
3.921PheSer: 3.921 ± 0.073
3.493PheThr: 3.493 ± 0.075
3.09PheVal: 3.09 ± 0.068
0.55PheTrp: 0.55 ± 0.025
2.078PheTyr: 2.078 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.674GlyAla: 4.674 ± 0.116
0.583GlyCys: 0.583 ± 0.032
3.933GlyAsp: 3.933 ± 0.116
3.585GlyGlu: 3.585 ± 0.065
3.538GlyPhe: 3.538 ± 0.06
4.759GlyGly: 4.759 ± 0.114
1.084GlyHis: 1.084 ± 0.035
5.113GlyIle: 5.113 ± 0.088
4.421GlyLys: 4.421 ± 0.072
5.657GlyLeu: 5.657 ± 0.092
1.571GlyMet: 1.571 ± 0.049
3.668GlyAsn: 3.668 ± 0.069
1.437GlyPro: 1.437 ± 0.066
1.877GlyGln: 1.877 ± 0.058
2.095GlyArg: 2.095 ± 0.057
3.984GlySer: 3.984 ± 0.069
4.212GlyThr: 4.212 ± 0.089
4.834GlyVal: 4.834 ± 0.087
0.711GlyTrp: 0.711 ± 0.028
2.642GlyTyr: 2.642 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
0.915HisAla: 0.915 ± 0.034
0.17HisCys: 0.17 ± 0.014
0.794HisAsp: 0.794 ± 0.027
0.85HisGlu: 0.85 ± 0.036
1.114HisPhe: 1.114 ± 0.037
0.962HisGly: 0.962 ± 0.028
0.454HisHis: 0.454 ± 0.024
1.37HisIle: 1.37 ± 0.04
1.276HisLys: 1.276 ± 0.038
1.742HisLeu: 1.742 ± 0.046
0.289HisMet: 0.289 ± 0.017
0.98HisAsn: 0.98 ± 0.031
0.875HisPro: 0.875 ± 0.031
0.638HisGln: 0.638 ± 0.027
0.622HisArg: 0.622 ± 0.024
0.954HisSer: 0.954 ± 0.033
1.054HisThr: 1.054 ± 0.037
0.898HisVal: 0.898 ± 0.037
0.214HisTrp: 0.214 ± 0.017
0.809HisTyr: 0.809 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.067IleAla: 6.067 ± 0.092
0.616IleCys: 0.616 ± 0.027
5.188IleAsp: 5.188 ± 0.082
5.495IleGlu: 5.495 ± 0.082
3.514IlePhe: 3.514 ± 0.077
4.923IleGly: 4.923 ± 0.083
1.268IleHis: 1.268 ± 0.035
5.917IleIle: 5.917 ± 0.103
5.826IleLys: 5.826 ± 0.084
6.84IleLeu: 6.84 ± 0.116
1.334IleMet: 1.334 ± 0.043
4.563IleAsn: 4.563 ± 0.077
3.099IlePro: 3.099 ± 0.063
2.292IleGln: 2.292 ± 0.051
2.277IleArg: 2.277 ± 0.049
5.805IleSer: 5.805 ± 0.076
5.294IleThr: 5.294 ± 0.087
4.849IleVal: 4.849 ± 0.08
0.65IleTrp: 0.65 ± 0.027
2.726IleTyr: 2.726 ± 0.06
0.0IleXaa: 0.0 ± 0.0
Lys
4.885LysAla: 4.885 ± 0.087
0.306LysCys: 0.306 ± 0.02
4.412LysAsp: 4.412 ± 0.081
6.21LysGlu: 6.21 ± 0.088
2.684LysPhe: 2.684 ± 0.06
4.395LysGly: 4.395 ± 0.083
1.383LysHis: 1.383 ± 0.039
5.928LysIle: 5.928 ± 0.105
7.065LysLys: 7.065 ± 0.11
6.178LysLeu: 6.178 ± 0.101
2.217LysMet: 2.217 ± 0.051
5.127LysAsn: 5.127 ± 0.096
2.244LysPro: 2.244 ± 0.058
2.775LysGln: 2.775 ± 0.053
2.799LysArg: 2.799 ± 0.057
4.266LysSer: 4.266 ± 0.076
4.815LysThr: 4.815 ± 0.082
4.534LysVal: 4.534 ± 0.079
0.698LysTrp: 0.698 ± 0.028
2.882LysTyr: 2.882 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
6.093LeuAla: 6.093 ± 0.092
0.665LeuCys: 0.665 ± 0.033
5.607LeuAsp: 5.607 ± 0.083
6.098LeuGlu: 6.098 ± 0.083
4.91LeuPhe: 4.91 ± 0.097
5.787LeuGly: 5.787 ± 0.086
1.54LeuHis: 1.54 ± 0.042
6.712LeuIle: 6.712 ± 0.106
7.502LeuLys: 7.502 ± 0.113
8.858LeuLeu: 8.858 ± 0.139
1.957LeuMet: 1.957 ± 0.044
5.363LeuAsn: 5.363 ± 0.081
3.32LeuPro: 3.32 ± 0.062
3.183LeuGln: 3.183 ± 0.065
3.1LeuArg: 3.1 ± 0.067
6.476LeuSer: 6.476 ± 0.096
5.28LeuThr: 5.28 ± 0.081
5.503LeuVal: 5.503 ± 0.087
0.714LeuTrp: 0.714 ± 0.029
3.062LeuTyr: 3.062 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
1.684MetAla: 1.684 ± 0.047
0.149MetCys: 0.149 ± 0.013
1.16MetAsp: 1.16 ± 0.035
1.327MetGlu: 1.327 ± 0.04
0.8MetPhe: 0.8 ± 0.031
1.388MetGly: 1.388 ± 0.041
0.39MetHis: 0.39 ± 0.025
1.495MetIle: 1.495 ± 0.048
2.083MetLys: 2.083 ± 0.047
2.042MetLeu: 2.042 ± 0.051
0.586MetMet: 0.586 ± 0.027
1.216MetAsn: 1.216 ± 0.032
0.845MetPro: 0.845 ± 0.031
0.814MetGln: 0.814 ± 0.03
0.812MetArg: 0.812 ± 0.029
1.428MetSer: 1.428 ± 0.032
1.215MetThr: 1.215 ± 0.033
1.337MetVal: 1.337 ± 0.043
0.158MetTrp: 0.158 ± 0.013
0.755MetTyr: 0.755 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.898AsnAla: 3.898 ± 0.066
0.477AsnCys: 0.477 ± 0.035
3.441AsnAsp: 3.441 ± 0.071
3.912AsnGlu: 3.912 ± 0.058
2.886AsnPhe: 2.886 ± 0.056
3.963AsnGly: 3.963 ± 0.094
0.98AsnHis: 0.98 ± 0.033
4.751AsnIle: 4.751 ± 0.079
4.405AsnLys: 4.405 ± 0.075
5.105AsnLeu: 5.105 ± 0.085
1.123AsnMet: 1.123 ± 0.036
3.79AsnAsn: 3.79 ± 0.087
2.599AsnPro: 2.599 ± 0.051
1.992AsnGln: 1.992 ± 0.051
1.94AsnArg: 1.94 ± 0.047
3.594AsnSer: 3.594 ± 0.076
4.011AsnThr: 4.011 ± 0.085
3.61AsnVal: 3.61 ± 0.07
0.677AsnTrp: 0.677 ± 0.028
2.658AsnTyr: 2.658 ± 0.062
0.0AsnXaa: 0.0 ± 0.0
Pro
2.013ProAla: 2.013 ± 0.05
0.276ProCys: 0.276 ± 0.046
2.035ProAsp: 2.035 ± 0.057
2.703ProGlu: 2.703 ± 0.056
1.874ProPhe: 1.874 ± 0.051
1.885ProGly: 1.885 ± 0.056
0.546ProHis: 0.546 ± 0.027
2.585ProIle: 2.585 ± 0.053
2.418ProLys: 2.418 ± 0.059
2.999ProLeu: 2.999 ± 0.061
0.644ProMet: 0.644 ± 0.028
2.122ProAsn: 2.122 ± 0.051
0.82ProPro: 0.82 ± 0.039
1.122ProGln: 1.122 ± 0.05
0.861ProArg: 0.861 ± 0.03
2.239ProSer: 2.239 ± 0.046
2.094ProThr: 2.094 ± 0.048
2.375ProVal: 2.375 ± 0.054
0.3ProTrp: 0.3 ± 0.018
1.341ProTyr: 1.341 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
2.123GlnAla: 2.123 ± 0.048
0.186GlnCys: 0.186 ± 0.013
1.9GlnAsp: 1.9 ± 0.061
2.439GlnGlu: 2.439 ± 0.058
1.636GlnPhe: 1.636 ± 0.045
1.921GlnGly: 1.921 ± 0.05
0.601GlnHis: 0.601 ± 0.025
2.498GlnIle: 2.498 ± 0.043
2.64GlnLys: 2.64 ± 0.057
3.458GlnLeu: 3.458 ± 0.06
0.804GlnMet: 0.804 ± 0.028
2.004GlnAsn: 2.004 ± 0.049
0.994GlnPro: 0.994 ± 0.03
1.363GlnGln: 1.363 ± 0.049
1.207GlnArg: 1.207 ± 0.036
1.85GlnSer: 1.85 ± 0.041
1.944GlnThr: 1.944 ± 0.05
2.018GlnVal: 2.018 ± 0.047
0.351GlnTrp: 0.351 ± 0.018
1.295GlnTyr: 1.295 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.127ArgAla: 2.127 ± 0.048
0.184ArgCys: 0.184 ± 0.014
1.82ArgAsp: 1.82 ± 0.049
2.162ArgGlu: 2.162 ± 0.055
1.822ArgPhe: 1.822 ± 0.043
1.917ArgGly: 1.917 ± 0.055
0.557ArgHis: 0.557 ± 0.024
2.576ArgIle: 2.576 ± 0.058
2.686ArgLys: 2.686 ± 0.059
3.207ArgLeu: 3.207 ± 0.058
0.835ArgMet: 0.835 ± 0.026
1.942ArgAsn: 1.942 ± 0.044
1.045ArgPro: 1.045 ± 0.039
1.054ArgGln: 1.054 ± 0.033
1.246ArgArg: 1.246 ± 0.038
1.837ArgSer: 1.837 ± 0.05
1.745ArgThr: 1.745 ± 0.043
2.128ArgVal: 2.128 ± 0.05
0.357ArgTrp: 0.357 ± 0.02
1.44ArgTyr: 1.44 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.823SerAla: 3.823 ± 0.061
0.649SerCys: 0.649 ± 0.031
3.377SerAsp: 3.377 ± 0.065
4.537SerGlu: 4.537 ± 0.07
3.893SerPhe: 3.893 ± 0.071
4.441SerGly: 4.441 ± 0.07
1.03SerHis: 1.03 ± 0.027
5.221SerIle: 5.221 ± 0.085
4.593SerLys: 4.593 ± 0.076
6.185SerLeu: 6.185 ± 0.092
1.224SerMet: 1.224 ± 0.039
3.614SerAsn: 3.614 ± 0.072
2.051SerPro: 2.051 ± 0.051
2.181SerGln: 2.181 ± 0.048
2.022SerArg: 2.022 ± 0.042
4.074SerSer: 4.074 ± 0.075
3.561SerThr: 3.561 ± 0.07
3.922SerVal: 3.922 ± 0.063
0.628SerTrp: 0.628 ± 0.027
2.693SerTyr: 2.693 ± 0.059
0.0SerXaa: 0.0 ± 0.0
Thr
4.38ThrAla: 4.38 ± 0.092
0.41ThrCys: 0.41 ± 0.026
3.605ThrAsp: 3.605 ± 0.1
3.985ThrGlu: 3.985 ± 0.062
3.175ThrPhe: 3.175 ± 0.064
4.347ThrGly: 4.347 ± 0.123
1.039ThrHis: 1.039 ± 0.037
5.192ThrIle: 5.192 ± 0.089
3.966ThrLys: 3.966 ± 0.08
5.996ThrLeu: 5.996 ± 0.089
1.079ThrMet: 1.079 ± 0.036
3.404ThrAsn: 3.404 ± 0.076
2.631ThrPro: 2.631 ± 0.055
2.0ThrGln: 2.0 ± 0.05
1.836ThrArg: 1.836 ± 0.05
4.094ThrSer: 4.094 ± 0.075
4.083ThrThr: 4.083 ± 0.097
4.274ThrVal: 4.274 ± 0.093
0.568ThrTrp: 0.568 ± 0.027
2.483ThrTyr: 2.483 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
4.721ValAla: 4.721 ± 0.074
0.536ValCys: 0.536 ± 0.024
3.729ValAsp: 3.729 ± 0.082
3.838ValGlu: 3.838 ± 0.069
3.457ValPhe: 3.457 ± 0.069
3.857ValGly: 3.857 ± 0.07
0.986ValHis: 0.986 ± 0.036
5.182ValIle: 5.182 ± 0.078
4.366ValLys: 4.366 ± 0.079
6.016ValLeu: 6.016 ± 0.092
1.325ValMet: 1.325 ± 0.042
3.57ValAsn: 3.57 ± 0.072
2.133ValPro: 2.133 ± 0.052
1.883ValGln: 1.883 ± 0.044
2.001ValArg: 2.001 ± 0.051
4.679ValSer: 4.679 ± 0.075
4.244ValThr: 4.244 ± 0.106
4.612ValVal: 4.612 ± 0.081
0.608ValTrp: 0.608 ± 0.027
2.527ValTyr: 2.527 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.563TrpAla: 0.563 ± 0.028
0.097TrpCys: 0.097 ± 0.01
0.622TrpAsp: 0.622 ± 0.024
0.638TrpGlu: 0.638 ± 0.023
0.571TrpPhe: 0.571 ± 0.027
0.557TrpGly: 0.557 ± 0.028
0.203TrpHis: 0.203 ± 0.015
0.709TrpIle: 0.709 ± 0.032
0.681TrpLys: 0.681 ± 0.028
0.995TrpLeu: 0.995 ± 0.036
0.276TrpMet: 0.276 ± 0.02
0.607TrpAsn: 0.607 ± 0.027
0.229TrpPro: 0.229 ± 0.016
0.365TrpGln: 0.365 ± 0.02
0.379TrpArg: 0.379 ± 0.019
0.582TrpSer: 0.582 ± 0.027
0.509TrpThr: 0.509 ± 0.022
0.579TrpVal: 0.579 ± 0.023
0.145TrpTrp: 0.145 ± 0.014
0.425TrpTyr: 0.425 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.436TyrAla: 2.436 ± 0.054
0.33TyrCys: 0.33 ± 0.018
2.381TyrAsp: 2.381 ± 0.049
2.312TyrGlu: 2.312 ± 0.057
2.406TyrPhe: 2.406 ± 0.058
2.57TyrGly: 2.57 ± 0.054
0.795TyrHis: 0.795 ± 0.03
2.639TyrIle: 2.639 ± 0.056
2.99TyrLys: 2.99 ± 0.062
3.726TyrLeu: 3.726 ± 0.078
0.684TyrMet: 0.684 ± 0.026
2.535TyrAsn: 2.535 ± 0.057
1.386TyrPro: 1.386 ± 0.044
1.426TyrGln: 1.426 ± 0.04
1.515TyrArg: 1.515 ± 0.041
2.565TyrSer: 2.565 ± 0.06
2.518TyrThr: 2.518 ± 0.056
2.19TyrVal: 2.19 ± 0.049
0.432TyrTrp: 0.432 ± 0.023
1.83TyrTyr: 1.83 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2926 proteins (950377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski