Amino acid dipepetide frequency for Eubacterium nodatum ATCC 33099

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.245AlaAla: 5.245 ± 0.154
0.887AlaCys: 0.887 ± 0.05
3.716AlaAsp: 3.716 ± 0.091
5.087AlaGlu: 5.087 ± 0.106
2.891AlaPhe: 2.891 ± 0.084
5.421AlaGly: 5.421 ± 0.125
1.049AlaHis: 1.049 ± 0.048
5.135AlaIle: 5.135 ± 0.121
5.744AlaLys: 5.744 ± 0.137
5.816AlaLeu: 5.816 ± 0.125
2.194AlaMet: 2.194 ± 0.067
2.604AlaAsn: 2.604 ± 0.069
1.727AlaPro: 1.727 ± 0.057
1.572AlaGln: 1.572 ± 0.062
2.58AlaArg: 2.58 ± 0.066
3.474AlaSer: 3.474 ± 0.081
3.042AlaThr: 3.042 ± 0.095
5.253AlaVal: 5.253 ± 0.111
0.434AlaTrp: 0.434 ± 0.03
2.274AlaTyr: 2.274 ± 0.078
0.0AlaXaa: 0.0 ± 0.0
Cys
0.776CysAla: 0.776 ± 0.04
0.212CysCys: 0.212 ± 0.02
0.755CysAsp: 0.755 ± 0.041
0.803CysGlu: 0.803 ± 0.04
0.493CysPhe: 0.493 ± 0.034
1.374CysGly: 1.374 ± 0.056
0.279CysHis: 0.279 ± 0.025
1.023CysIle: 1.023 ± 0.051
0.912CysLys: 0.912 ± 0.051
0.848CysLeu: 0.848 ± 0.044
0.327CysMet: 0.327 ± 0.025
0.561CysAsn: 0.561 ± 0.035
0.495CysPro: 0.495 ± 0.037
0.27CysGln: 0.27 ± 0.024
0.678CysArg: 0.678 ± 0.037
0.778CysSer: 0.778 ± 0.041
0.608CysThr: 0.608 ± 0.034
0.778CysVal: 0.778 ± 0.046
0.074CysTrp: 0.074 ± 0.013
0.462CysTyr: 0.462 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
3.415AspAla: 3.415 ± 0.098
0.706AspCys: 0.706 ± 0.039
3.02AspAsp: 3.02 ± 0.101
4.991AspGlu: 4.991 ± 0.123
3.049AspPhe: 3.049 ± 0.088
3.984AspGly: 3.984 ± 0.111
0.713AspHis: 0.713 ± 0.04
5.131AspIle: 5.131 ± 0.096
5.081AspLys: 5.081 ± 0.11
4.643AspLeu: 4.643 ± 0.113
1.882AspMet: 1.882 ± 0.06
2.743AspAsn: 2.743 ± 0.071
1.605AspPro: 1.605 ± 0.052
0.802AspGln: 0.802 ± 0.038
2.53AspArg: 2.53 ± 0.076
3.28AspSer: 3.28 ± 0.083
2.697AspThr: 2.697 ± 0.086
3.611AspVal: 3.611 ± 0.089
0.451AspTrp: 0.451 ± 0.031
2.556AspTyr: 2.556 ± 0.073
0.0AspXaa: 0.0 ± 0.0
Glu
4.874GluAla: 4.874 ± 0.117
0.641GluCys: 0.641 ± 0.04
4.294GluAsp: 4.294 ± 0.111
6.607GluGlu: 6.607 ± 0.153
2.789GluPhe: 2.789 ± 0.075
4.274GluGly: 4.274 ± 0.094
1.134GluHis: 1.134 ± 0.046
7.015GluIle: 7.015 ± 0.153
8.263GluLys: 8.263 ± 0.143
6.655GluLeu: 6.655 ± 0.131
2.257GluMet: 2.257 ± 0.062
4.878GluAsn: 4.878 ± 0.099
1.672GluPro: 1.672 ± 0.06
1.869GluGln: 1.869 ± 0.066
3.275GluArg: 3.275 ± 0.104
3.568GluSer: 3.568 ± 0.085
3.447GluThr: 3.447 ± 0.08
4.557GluVal: 4.557 ± 0.116
0.469GluTrp: 0.469 ± 0.031
3.109GluTyr: 3.109 ± 0.079
0.0GluXaa: 0.0 ± 0.0
Phe
3.055PheAla: 3.055 ± 0.089
0.606PheCys: 0.606 ± 0.036
2.693PheAsp: 2.693 ± 0.07
2.861PheGlu: 2.861 ± 0.076
1.963PhePhe: 1.963 ± 0.082
3.083PheGly: 3.083 ± 0.088
0.696PheHis: 0.696 ± 0.037
3.411PheIle: 3.411 ± 0.105
3.022PheLys: 3.022 ± 0.087
3.533PheLeu: 3.533 ± 0.117
1.225PheMet: 1.225 ± 0.05
2.048PheAsn: 2.048 ± 0.067
1.378PhePro: 1.378 ± 0.052
1.066PheGln: 1.066 ± 0.038
1.683PheArg: 1.683 ± 0.06
3.24PheSer: 3.24 ± 0.083
2.255PheThr: 2.255 ± 0.066
2.737PheVal: 2.737 ± 0.08
0.377PheTrp: 0.377 ± 0.027
1.657PheTyr: 1.657 ± 0.069
0.0PheXaa: 0.0 ± 0.0
Gly
4.605GlyAla: 4.605 ± 0.118
0.997GlyCys: 0.997 ± 0.059
3.609GlyAsp: 3.609 ± 0.081
5.0GlyGlu: 5.0 ± 0.087
3.177GlyPhe: 3.177 ± 0.088
5.077GlyGly: 5.077 ± 0.159
1.228GlyHis: 1.228 ± 0.058
6.969GlyIle: 6.969 ± 0.121
7.246GlyLys: 7.246 ± 0.156
5.504GlyLeu: 5.504 ± 0.117
2.224GlyMet: 2.224 ± 0.076
3.679GlyAsn: 3.679 ± 0.12
1.225GlyPro: 1.225 ± 0.055
1.707GlyGln: 1.707 ± 0.058
3.133GlyArg: 3.133 ± 0.086
4.049GlySer: 4.049 ± 0.099
4.021GlyThr: 4.021 ± 0.086
4.636GlyVal: 4.636 ± 0.105
0.578GlyTrp: 0.578 ± 0.03
3.169GlyTyr: 3.169 ± 0.072
0.0GlyXaa: 0.0 ± 0.0
His
0.866HisAla: 0.866 ± 0.041
0.229HisCys: 0.229 ± 0.02
0.857HisAsp: 0.857 ± 0.042
0.918HisGlu: 0.918 ± 0.041
0.665HisPhe: 0.665 ± 0.041
1.328HisGly: 1.328 ± 0.054
0.377HisHis: 0.377 ± 0.037
1.372HisIle: 1.372 ± 0.051
1.151HisLys: 1.151 ± 0.042
1.324HisLeu: 1.324 ± 0.056
0.478HisMet: 0.478 ± 0.031
0.805HisAsn: 0.805 ± 0.04
0.696HisPro: 0.696 ± 0.037
0.434HisGln: 0.434 ± 0.027
0.8HisArg: 0.8 ± 0.039
0.935HisSer: 0.935 ± 0.037
0.794HisThr: 0.794 ± 0.04
0.951HisVal: 0.951 ± 0.041
0.129HisTrp: 0.129 ± 0.016
0.643HisTyr: 0.643 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.903IleAla: 5.903 ± 0.116
1.311IleCys: 1.311 ± 0.05
4.88IleAsp: 4.88 ± 0.093
5.801IleGlu: 5.801 ± 0.111
3.615IlePhe: 3.615 ± 0.105
5.68IleGly: 5.68 ± 0.13
1.249IleHis: 1.249 ± 0.047
6.692IleIle: 6.692 ± 0.134
6.869IleLys: 6.869 ± 0.138
7.479IleLeu: 7.479 ± 0.163
2.27IleMet: 2.27 ± 0.078
3.605IleAsn: 3.605 ± 0.082
3.193IlePro: 3.193 ± 0.073
1.984IleGln: 1.984 ± 0.066
3.463IleArg: 3.463 ± 0.088
6.106IleSer: 6.106 ± 0.126
4.608IleThr: 4.608 ± 0.097
5.26IleVal: 5.26 ± 0.111
0.53IleTrp: 0.53 ± 0.036
2.929IleTyr: 2.929 ± 0.092
0.0IleXaa: 0.0 ± 0.0
Lys
5.7LysAla: 5.7 ± 0.12
0.733LysCys: 0.733 ± 0.04
5.273LysAsp: 5.273 ± 0.107
7.807LysGlu: 7.807 ± 0.151
2.855LysPhe: 2.855 ± 0.072
5.824LysGly: 5.824 ± 0.117
1.389LysHis: 1.389 ± 0.05
7.137LysIle: 7.137 ± 0.115
9.353LysLys: 9.353 ± 0.185
7.013LysLeu: 7.013 ± 0.143
2.588LysMet: 2.588 ± 0.073
5.388LysAsn: 5.388 ± 0.103
2.477LysPro: 2.477 ± 0.084
2.244LysGln: 2.244 ± 0.073
3.696LysArg: 3.696 ± 0.094
5.101LysSer: 5.101 ± 0.097
4.834LysThr: 4.834 ± 0.118
5.914LysVal: 5.914 ± 0.142
0.682LysTrp: 0.682 ± 0.041
3.851LysTyr: 3.851 ± 0.091
0.0LysXaa: 0.0 ± 0.0
Leu
5.639LeuAla: 5.639 ± 0.112
1.073LeuCys: 1.073 ± 0.047
4.795LeuAsp: 4.795 ± 0.092
6.156LeuGlu: 6.156 ± 0.133
3.563LeuPhe: 3.563 ± 0.112
5.668LeuGly: 5.668 ± 0.116
1.258LeuHis: 1.258 ± 0.045
6.538LeuIle: 6.538 ± 0.14
7.514LeuLys: 7.514 ± 0.141
7.543LeuLeu: 7.543 ± 0.173
2.656LeuMet: 2.656 ± 0.077
4.119LeuAsn: 4.119 ± 0.092
3.107LeuPro: 3.107 ± 0.077
2.061LeuGln: 2.061 ± 0.07
3.507LeuArg: 3.507 ± 0.091
6.525LeuSer: 6.525 ± 0.126
4.202LeuThr: 4.202 ± 0.089
5.059LeuVal: 5.059 ± 0.103
0.628LeuTrp: 0.628 ± 0.042
2.983LeuTyr: 2.983 ± 0.08
0.0LeuXaa: 0.0 ± 0.0
Met
2.416MetAla: 2.416 ± 0.076
0.312MetCys: 0.312 ± 0.022
1.738MetAsp: 1.738 ± 0.068
2.333MetGlu: 2.333 ± 0.079
1.082MetPhe: 1.082 ± 0.052
2.191MetGly: 2.191 ± 0.076
0.373MetHis: 0.373 ± 0.026
2.248MetIle: 2.248 ± 0.066
2.798MetLys: 2.798 ± 0.075
2.434MetLeu: 2.434 ± 0.071
0.901MetMet: 0.901 ± 0.052
1.479MetAsn: 1.479 ± 0.052
1.051MetPro: 1.051 ± 0.052
0.794MetGln: 0.794 ± 0.039
1.26MetArg: 1.26 ± 0.052
1.882MetSer: 1.882 ± 0.061
1.572MetThr: 1.572 ± 0.055
1.827MetVal: 1.827 ± 0.069
0.199MetTrp: 0.199 ± 0.018
0.911MetTyr: 0.911 ± 0.043
0.0MetXaa: 0.0 ± 0.0
Asn
3.024AsnAla: 3.024 ± 0.082
0.646AsnCys: 0.646 ± 0.039
2.484AsnAsp: 2.484 ± 0.084
3.11AsnGlu: 3.11 ± 0.084
2.172AsnPhe: 2.172 ± 0.074
3.598AsnGly: 3.598 ± 0.096
0.881AsnHis: 0.881 ± 0.039
4.603AsnIle: 4.603 ± 0.104
4.174AsnLys: 4.174 ± 0.099
4.501AsnLeu: 4.501 ± 0.1
1.468AsnMet: 1.468 ± 0.058
2.412AsnAsn: 2.412 ± 0.084
2.128AsnPro: 2.128 ± 0.059
1.313AsnGln: 1.313 ± 0.05
2.455AsnArg: 2.455 ± 0.073
2.977AsnSer: 2.977 ± 0.081
2.449AsnThr: 2.449 ± 0.069
3.011AsnVal: 3.011 ± 0.078
0.419AsnTrp: 0.419 ± 0.032
1.913AsnTyr: 1.913 ± 0.07
0.0AsnXaa: 0.0 ± 0.0
Pro
1.912ProAla: 1.912 ± 0.067
0.375ProCys: 0.375 ± 0.026
1.954ProAsp: 1.954 ± 0.06
2.922ProGlu: 2.922 ± 0.09
1.4ProPhe: 1.4 ± 0.051
2.357ProGly: 2.357 ± 0.074
0.506ProHis: 0.506 ± 0.028
2.338ProIle: 2.338 ± 0.068
2.558ProLys: 2.558 ± 0.088
2.407ProLeu: 2.407 ± 0.071
0.811ProMet: 0.811 ± 0.042
1.369ProAsn: 1.369 ± 0.056
0.611ProPro: 0.611 ± 0.038
0.899ProGln: 0.899 ± 0.04
0.892ProArg: 0.892 ± 0.039
1.747ProSer: 1.747 ± 0.064
1.461ProThr: 1.461 ± 0.047
2.497ProVal: 2.497 ± 0.065
0.266ProTrp: 0.266 ± 0.021
1.226ProTyr: 1.226 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
1.54GlnAla: 1.54 ± 0.058
0.26GlnCys: 0.26 ± 0.019
1.254GlnAsp: 1.254 ± 0.051
1.703GlnGlu: 1.703 ± 0.056
1.027GlnPhe: 1.027 ± 0.042
1.817GlnGly: 1.817 ± 0.06
0.351GlnHis: 0.351 ± 0.026
2.045GlnIle: 2.045 ± 0.066
2.3GlnLys: 2.3 ± 0.073
2.063GlnLeu: 2.063 ± 0.061
0.803GlnMet: 0.803 ± 0.043
1.444GlnAsn: 1.444 ± 0.056
0.549GlnPro: 0.549 ± 0.037
0.718GlnGln: 0.718 ± 0.039
1.239GlnArg: 1.239 ± 0.051
1.43GlnSer: 1.43 ± 0.059
1.121GlnThr: 1.121 ± 0.045
1.559GlnVal: 1.559 ± 0.053
0.223GlnTrp: 0.223 ± 0.019
0.896GlnTyr: 0.896 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
2.525ArgAla: 2.525 ± 0.066
0.504ArgCys: 0.504 ± 0.029
2.42ArgAsp: 2.42 ± 0.074
3.637ArgGlu: 3.637 ± 0.103
1.749ArgPhe: 1.749 ± 0.057
2.715ArgGly: 2.715 ± 0.075
0.685ArgHis: 0.685 ± 0.036
3.589ArgIle: 3.589 ± 0.091
4.174ArgLys: 4.174 ± 0.109
3.781ArgLeu: 3.781 ± 0.095
1.295ArgMet: 1.295 ± 0.048
2.434ArgAsn: 2.434 ± 0.078
1.14ArgPro: 1.14 ± 0.051
1.282ArgGln: 1.282 ± 0.047
2.076ArgArg: 2.076 ± 0.063
2.017ArgSer: 2.017 ± 0.068
2.041ArgThr: 2.041 ± 0.061
2.606ArgVal: 2.606 ± 0.08
0.299ArgTrp: 0.299 ± 0.025
1.712ArgTyr: 1.712 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
3.884SerAla: 3.884 ± 0.077
0.755SerCys: 0.755 ± 0.039
3.526SerAsp: 3.526 ± 0.077
4.477SerGlu: 4.477 ± 0.113
2.813SerPhe: 2.813 ± 0.077
5.238SerGly: 5.238 ± 0.106
1.053SerHis: 1.053 ± 0.043
4.845SerIle: 4.845 ± 0.108
5.164SerLys: 5.164 ± 0.102
5.166SerLeu: 5.166 ± 0.087
1.792SerMet: 1.792 ± 0.064
2.682SerAsn: 2.682 ± 0.083
1.801SerPro: 1.801 ± 0.061
1.579SerGln: 1.579 ± 0.046
2.769SerArg: 2.769 ± 0.076
3.583SerSer: 3.583 ± 0.094
2.745SerThr: 2.745 ± 0.079
4.252SerVal: 4.252 ± 0.089
0.488SerTrp: 0.488 ± 0.032
2.348SerTyr: 2.348 ± 0.071
0.0SerXaa: 0.0 ± 0.0
Thr
3.631ThrAla: 3.631 ± 0.094
0.617ThrCys: 0.617 ± 0.035
2.9ThrAsp: 2.9 ± 0.082
3.561ThrGlu: 3.561 ± 0.078
2.203ThrPhe: 2.203 ± 0.072
4.414ThrGly: 4.414 ± 0.094
0.851ThrHis: 0.851 ± 0.039
3.805ThrIle: 3.805 ± 0.095
3.616ThrLys: 3.616 ± 0.092
4.473ThrLeu: 4.473 ± 0.099
1.383ThrMet: 1.383 ± 0.046
2.017ThrAsn: 2.017 ± 0.076
2.106ThrPro: 2.106 ± 0.072
1.184ThrGln: 1.184 ± 0.048
1.716ThrArg: 1.716 ± 0.056
2.961ThrSer: 2.961 ± 0.068
2.541ThrThr: 2.541 ± 0.07
4.344ThrVal: 4.344 ± 0.113
0.419ThrTrp: 0.419 ± 0.036
1.819ThrTyr: 1.819 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
4.4ValAla: 4.4 ± 0.111
1.029ValCys: 1.029 ± 0.042
3.779ValAsp: 3.779 ± 0.091
4.61ValGlu: 4.61 ± 0.11
2.868ValPhe: 2.868 ± 0.079
4.165ValGly: 4.165 ± 0.11
0.931ValHis: 0.931 ± 0.044
5.82ValIle: 5.82 ± 0.113
5.859ValLys: 5.859 ± 0.146
5.537ValLeu: 5.537 ± 0.112
1.923ValMet: 1.923 ± 0.079
3.103ValAsn: 3.103 ± 0.082
2.233ValPro: 2.233 ± 0.072
1.422ValGln: 1.422 ± 0.05
2.735ValArg: 2.735 ± 0.076
4.538ValSer: 4.538 ± 0.102
3.635ValThr: 3.635 ± 0.098
4.3ValVal: 4.3 ± 0.107
0.465ValTrp: 0.465 ± 0.029
2.449ValTyr: 2.449 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.453TrpAla: 0.453 ± 0.029
0.089TrpCys: 0.089 ± 0.013
0.417TrpAsp: 0.417 ± 0.029
0.528TrpGlu: 0.528 ± 0.034
0.344TrpPhe: 0.344 ± 0.028
0.513TrpGly: 0.513 ± 0.036
0.129TrpHis: 0.129 ± 0.016
0.632TrpIle: 0.632 ± 0.041
0.755TrpLys: 0.755 ± 0.038
0.659TrpLeu: 0.659 ± 0.037
0.242TrpMet: 0.242 ± 0.02
0.469TrpAsn: 0.469 ± 0.03
0.146TrpPro: 0.146 ± 0.018
0.227TrpGln: 0.227 ± 0.022
0.323TrpArg: 0.323 ± 0.027
0.397TrpSer: 0.397 ± 0.024
0.403TrpThr: 0.403 ± 0.035
0.399TrpVal: 0.399 ± 0.029
0.072TrpTrp: 0.072 ± 0.011
0.336TrpTyr: 0.336 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.283TyrAla: 2.283 ± 0.076
0.523TyrCys: 0.523 ± 0.034
2.525TyrAsp: 2.525 ± 0.078
2.734TyrGlu: 2.734 ± 0.077
1.869TyrPhe: 1.869 ± 0.064
3.11TyrGly: 3.11 ± 0.076
0.621TyrHis: 0.621 ± 0.04
3.083TyrIle: 3.083 ± 0.087
3.304TyrLys: 3.304 ± 0.098
3.214TyrLeu: 3.214 ± 0.091
1.08TyrMet: 1.08 ± 0.049
1.899TyrAsn: 1.899 ± 0.056
1.171TyrPro: 1.171 ± 0.052
0.916TyrGln: 0.916 ± 0.043
1.871TyrArg: 1.871 ± 0.058
2.37TyrSer: 2.37 ± 0.063
2.089TyrThr: 2.089 ± 0.089
2.251TyrVal: 2.251 ± 0.058
0.331TyrTrp: 0.331 ± 0.025
1.587TyrTyr: 1.587 ± 0.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1690 proteins (541419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski