Amino acid dipepetide frequency for Bifidobacteriaceae bacterium NR021

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.668AlaAla: 9.668 ± 0.238
1.111AlaCys: 1.111 ± 0.061
5.465AlaAsp: 5.465 ± 0.133
5.445AlaGlu: 5.445 ± 0.128
3.17AlaPhe: 3.17 ± 0.082
6.736AlaGly: 6.736 ± 0.131
2.066AlaHis: 2.066 ± 0.07
5.801AlaIle: 5.801 ± 0.139
6.053AlaLys: 6.053 ± 0.16
9.288AlaLeu: 9.288 ± 0.165
2.58AlaMet: 2.58 ± 0.085
4.26AlaAsn: 4.26 ± 0.111
3.043AlaPro: 3.043 ± 0.106
4.348AlaGln: 4.348 ± 0.144
4.741AlaArg: 4.741 ± 0.12
6.282AlaSer: 6.282 ± 0.128
4.831AlaThr: 4.831 ± 0.103
6.953AlaVal: 6.953 ± 0.163
1.152AlaTrp: 1.152 ± 0.058
2.601AlaTyr: 2.601 ± 0.083
0.0AlaXaa: 0.0 ± 0.0
Cys
1.324CysAla: 1.324 ± 0.064
0.162CysCys: 0.162 ± 0.025
0.764CysAsp: 0.764 ± 0.052
0.791CysGlu: 0.791 ± 0.043
0.375CysPhe: 0.375 ± 0.031
1.039CysGly: 1.039 ± 0.055
0.185CysHis: 0.185 ± 0.023
0.611CysIle: 0.611 ± 0.035
0.59CysLys: 0.59 ± 0.038
0.787CysLeu: 0.787 ± 0.046
0.238CysMet: 0.238 ± 0.024
0.465CysAsn: 0.465 ± 0.039
0.447CysPro: 0.447 ± 0.027
0.264CysGln: 0.264 ± 0.026
0.368CysArg: 0.368 ± 0.026
0.692CysSer: 0.692 ± 0.046
0.641CysThr: 0.641 ± 0.043
0.949CysVal: 0.949 ± 0.053
0.111CysTrp: 0.111 ± 0.017
0.301CysTyr: 0.301 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
6.18AspAla: 6.18 ± 0.187
0.694AspCys: 0.694 ± 0.043
4.174AspAsp: 4.174 ± 0.151
4.695AspGlu: 4.695 ± 0.118
2.603AspPhe: 2.603 ± 0.083
4.466AspGly: 4.466 ± 0.111
1.013AspHis: 1.013 ± 0.05
3.82AspIle: 3.82 ± 0.098
3.098AspLys: 3.098 ± 0.148
5.037AspLeu: 5.037 ± 0.115
1.486AspMet: 1.486 ± 0.049
2.585AspAsn: 2.585 ± 0.108
2.568AspPro: 2.568 ± 0.091
1.372AspGln: 1.372 ± 0.06
2.353AspArg: 2.353 ± 0.101
4.445AspSer: 4.445 ± 0.115
3.02AspThr: 3.02 ± 0.093
4.427AspVal: 4.427 ± 0.107
0.78AspTrp: 0.78 ± 0.047
1.944AspTyr: 1.944 ± 0.072
0.0AspXaa: 0.0 ± 0.0
Glu
5.549GluAla: 5.549 ± 0.137
0.544GluCys: 0.544 ± 0.036
3.802GluAsp: 3.802 ± 0.101
4.135GluGlu: 4.135 ± 0.124
2.013GluPhe: 2.013 ± 0.08
3.536GluGly: 3.536 ± 0.093
1.685GluHis: 1.685 ± 0.066
3.57GluIle: 3.57 ± 0.096
3.693GluLys: 3.693 ± 0.12
5.387GluLeu: 5.387 ± 0.137
1.356GluMet: 1.356 ± 0.055
3.517GluAsn: 3.517 ± 0.098
2.147GluPro: 2.147 ± 0.082
2.476GluGln: 2.476 ± 0.073
3.536GluArg: 3.536 ± 0.102
3.987GluSer: 3.987 ± 0.112
3.219GluThr: 3.219 ± 0.109
4.061GluVal: 4.061 ± 0.09
0.544GluTrp: 0.544 ± 0.033
1.969GluTyr: 1.969 ± 0.078
0.0GluXaa: 0.0 ± 0.0
Phe
4.128PheAla: 4.128 ± 0.106
0.414PheCys: 0.414 ± 0.033
2.522PheAsp: 2.522 ± 0.081
1.997PheGlu: 1.997 ± 0.067
1.219PhePhe: 1.219 ± 0.063
3.061PheGly: 3.061 ± 0.106
0.666PheHis: 0.666 ± 0.04
2.175PheIle: 2.175 ± 0.098
1.717PheLys: 1.717 ± 0.069
2.712PheLeu: 2.712 ± 0.101
0.882PheMet: 0.882 ± 0.05
1.668PheAsn: 1.668 ± 0.07
1.215PhePro: 1.215 ± 0.057
0.884PheGln: 0.884 ± 0.041
1.458PheArg: 1.458 ± 0.061
2.677PheSer: 2.677 ± 0.085
2.277PheThr: 2.277 ± 0.074
2.948PheVal: 2.948 ± 0.107
0.363PheTrp: 0.363 ± 0.03
0.944PheTyr: 0.944 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
5.748GlyAla: 5.748 ± 0.158
0.736GlyCys: 0.736 ± 0.046
3.799GlyAsp: 3.799 ± 0.109
4.107GlyGlu: 4.107 ± 0.121
3.147GlyPhe: 3.147 ± 0.095
4.561GlyGly: 4.561 ± 0.16
1.319GlyHis: 1.319 ± 0.059
4.679GlyIle: 4.679 ± 0.119
4.628GlyLys: 4.628 ± 0.118
5.905GlyLeu: 5.905 ± 0.134
1.974GlyMet: 1.974 ± 0.075
2.925GlyAsn: 2.925 ± 0.1
1.745GlyPro: 1.745 ± 0.079
1.696GlyGln: 1.696 ± 0.06
3.237GlyArg: 3.237 ± 0.104
4.917GlySer: 4.917 ± 0.117
3.843GlyThr: 3.843 ± 0.107
5.493GlyVal: 5.493 ± 0.133
0.863GlyTrp: 0.863 ± 0.044
2.367GlyTyr: 2.367 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
1.981HisAla: 1.981 ± 0.083
0.243HisCys: 0.243 ± 0.028
1.467HisAsp: 1.467 ± 0.062
1.238HisGlu: 1.238 ± 0.059
0.609HisPhe: 0.609 ± 0.039
1.634HisGly: 1.634 ± 0.069
0.53HisHis: 0.53 ± 0.041
1.363HisIle: 1.363 ± 0.059
1.032HisLys: 1.032 ± 0.048
1.536HisLeu: 1.536 ± 0.065
0.574HisMet: 0.574 ± 0.04
1.055HisAsn: 1.055 ± 0.053
1.081HisPro: 1.081 ± 0.053
0.553HisGln: 0.553 ± 0.034
1.12HisArg: 1.12 ± 0.053
1.256HisSer: 1.256 ± 0.054
1.277HisThr: 1.277 ± 0.057
1.775HisVal: 1.775 ± 0.074
0.231HisTrp: 0.231 ± 0.025
0.669HisTyr: 0.669 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
6.828IleAla: 6.828 ± 0.19
0.861IleCys: 0.861 ± 0.056
4.1IleAsp: 4.1 ± 0.109
3.543IleGlu: 3.543 ± 0.122
2.207IlePhe: 2.207 ± 0.088
4.35IleGly: 4.35 ± 0.133
1.194IleHis: 1.194 ± 0.062
3.89IleIle: 3.89 ± 0.145
2.855IleLys: 2.855 ± 0.099
4.709IleLeu: 4.709 ± 0.131
1.509IleMet: 1.509 ± 0.069
2.761IleAsn: 2.761 ± 0.081
2.821IlePro: 2.821 ± 0.082
1.587IleGln: 1.587 ± 0.066
2.959IleArg: 2.959 ± 0.092
4.741IleSer: 4.741 ± 0.129
3.573IleThr: 3.573 ± 0.095
5.273IleVal: 5.273 ± 0.122
0.565IleTrp: 0.565 ± 0.034
1.377IleTyr: 1.377 ± 0.061
0.0IleXaa: 0.0 ± 0.0
Lys
5.327LysAla: 5.327 ± 0.17
0.386LysCys: 0.386 ± 0.035
3.739LysAsp: 3.739 ± 0.143
3.383LysGlu: 3.383 ± 0.12
1.604LysPhe: 1.604 ± 0.065
3.263LysGly: 3.263 ± 0.105
1.395LysHis: 1.395 ± 0.065
3.191LysIle: 3.191 ± 0.1
3.797LysLys: 3.797 ± 0.151
5.021LysLeu: 5.021 ± 0.099
1.34LysMet: 1.34 ± 0.059
3.371LysAsn: 3.371 ± 0.122
2.835LysPro: 2.835 ± 0.106
2.402LysGln: 2.402 ± 0.089
3.084LysArg: 3.084 ± 0.089
4.193LysSer: 4.193 ± 0.117
3.406LysThr: 3.406 ± 0.112
3.806LysVal: 3.806 ± 0.116
0.629LysTrp: 0.629 ± 0.035
1.821LysTyr: 1.821 ± 0.074
0.0LysXaa: 0.0 ± 0.0
Leu
8.497LeuAla: 8.497 ± 0.176
1.113LeuCys: 1.113 ± 0.05
5.225LeuAsp: 5.225 ± 0.117
4.813LeuGlu: 4.813 ± 0.118
3.172LeuPhe: 3.172 ± 0.107
5.947LeuGly: 5.947 ± 0.151
2.055LeuHis: 2.055 ± 0.083
5.28LeuIle: 5.28 ± 0.146
4.679LeuLys: 4.679 ± 0.144
7.821LeuLeu: 7.821 ± 0.171
2.115LeuMet: 2.115 ± 0.085
3.804LeuAsn: 3.804 ± 0.095
4.144LeuPro: 4.144 ± 0.105
3.341LeuGln: 3.341 ± 0.082
5.199LeuArg: 5.199 ± 0.111
6.266LeuSer: 6.266 ± 0.136
4.908LeuThr: 4.908 ± 0.124
5.961LeuVal: 5.961 ± 0.128
0.96LeuTrp: 0.96 ± 0.051
2.106LeuTyr: 2.106 ± 0.072
0.0LeuXaa: 0.0 ± 0.0
Met
2.221MetAla: 2.221 ± 0.084
0.31MetCys: 0.31 ± 0.031
1.18MetAsp: 1.18 ± 0.06
1.069MetGlu: 1.069 ± 0.052
0.898MetPhe: 0.898 ± 0.043
1.37MetGly: 1.37 ± 0.066
0.562MetHis: 0.562 ± 0.035
1.481MetIle: 1.481 ± 0.063
1.344MetLys: 1.344 ± 0.055
2.559MetLeu: 2.559 ± 0.082
0.678MetMet: 0.678 ± 0.044
1.199MetAsn: 1.199 ± 0.053
1.337MetPro: 1.337 ± 0.059
1.078MetGln: 1.078 ± 0.058
1.703MetArg: 1.703 ± 0.065
1.897MetSer: 1.897 ± 0.071
1.479MetThr: 1.479 ± 0.058
1.717MetVal: 1.717 ± 0.063
0.25MetTrp: 0.25 ± 0.023
0.641MetTyr: 0.641 ± 0.042
0.0MetXaa: 0.0 ± 0.0
Asn
4.491AsnAla: 4.491 ± 0.139
0.493AsnCys: 0.493 ± 0.034
2.603AsnAsp: 2.603 ± 0.102
2.673AsnGlu: 2.673 ± 0.091
1.328AsnPhe: 1.328 ± 0.056
3.288AsnGly: 3.288 ± 0.13
0.891AsnHis: 0.891 ± 0.043
3.121AsnIle: 3.121 ± 0.11
3.022AsnLys: 3.022 ± 0.112
3.621AsnLeu: 3.621 ± 0.095
1.25AsnMet: 1.25 ± 0.057
3.177AsnAsn: 3.177 ± 0.147
2.492AsnPro: 2.492 ± 0.088
1.61AsnGln: 1.61 ± 0.068
2.092AsnArg: 2.092 ± 0.068
3.57AsnSer: 3.57 ± 0.117
2.751AsnThr: 2.751 ± 0.092
3.233AsnVal: 3.233 ± 0.089
0.53AsnTrp: 0.53 ± 0.036
1.381AsnTyr: 1.381 ± 0.061
0.0AsnXaa: 0.0 ± 0.0
Pro
3.346ProAla: 3.346 ± 0.103
0.361ProCys: 0.361 ± 0.034
2.437ProAsp: 2.437 ± 0.074
3.04ProGlu: 3.04 ± 0.098
1.502ProPhe: 1.502 ± 0.06
2.612ProGly: 2.612 ± 0.098
0.828ProHis: 0.828 ± 0.046
2.349ProIle: 2.349 ± 0.067
2.376ProLys: 2.376 ± 0.075
3.344ProLeu: 3.344 ± 0.091
0.902ProMet: 0.902 ± 0.046
1.828ProAsn: 1.828 ± 0.068
0.93ProPro: 0.93 ± 0.046
1.585ProGln: 1.585 ± 0.077
1.689ProArg: 1.689 ± 0.063
2.744ProSer: 2.744 ± 0.087
2.406ProThr: 2.406 ± 0.082
3.293ProVal: 3.293 ± 0.081
0.581ProTrp: 0.581 ± 0.036
1.303ProTyr: 1.303 ± 0.06
0.0ProXaa: 0.0 ± 0.0
Gln
3.318GlnAla: 3.318 ± 0.108
0.342GlnCys: 0.342 ± 0.031
1.837GlnAsp: 1.837 ± 0.071
2.11GlnGlu: 2.11 ± 0.081
1.143GlnPhe: 1.143 ± 0.058
2.101GlnGly: 2.101 ± 0.069
0.821GlnHis: 0.821 ± 0.045
2.277GlnIle: 2.277 ± 0.079
1.951GlnLys: 1.951 ± 0.072
3.135GlnLeu: 3.135 ± 0.092
0.826GlnMet: 0.826 ± 0.045
1.925GlnAsn: 1.925 ± 0.085
1.25GlnPro: 1.25 ± 0.066
1.738GlnGln: 1.738 ± 0.072
1.944GlnArg: 1.944 ± 0.08
2.471GlnSer: 2.471 ± 0.092
1.793GlnThr: 1.793 ± 0.071
2.434GlnVal: 2.434 ± 0.08
0.511GlnTrp: 0.511 ± 0.037
1.199GlnTyr: 1.199 ± 0.059
0.0GlnXaa: 0.0 ± 0.0
Arg
4.41ArgAla: 4.41 ± 0.119
0.444ArgCys: 0.444 ± 0.029
2.897ArgAsp: 2.897 ± 0.12
3.702ArgGlu: 3.702 ± 0.109
1.995ArgPhe: 1.995 ± 0.056
2.939ArgGly: 2.939 ± 0.1
1.041ArgHis: 1.041 ± 0.053
3.351ArgIle: 3.351 ± 0.093
3.385ArgLys: 3.385 ± 0.091
4.436ArgLeu: 4.436 ± 0.097
1.506ArgMet: 1.506 ± 0.072
2.312ArgAsn: 2.312 ± 0.077
1.687ArgPro: 1.687 ± 0.064
1.546ArgGln: 1.546 ± 0.058
3.124ArgArg: 3.124 ± 0.133
3.027ArgSer: 3.027 ± 0.088
2.693ArgThr: 2.693 ± 0.086
3.95ArgVal: 3.95 ± 0.096
0.636ArgTrp: 0.636 ± 0.041
1.698ArgTyr: 1.698 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
6.641SerAla: 6.641 ± 0.145
0.759SerCys: 0.759 ± 0.049
4.211SerAsp: 4.211 ± 0.124
4.26SerGlu: 4.26 ± 0.118
2.411SerPhe: 2.411 ± 0.09
5.095SerGly: 5.095 ± 0.128
1.518SerHis: 1.518 ± 0.066
4.216SerIle: 4.216 ± 0.108
4.413SerLys: 4.413 ± 0.113
6.243SerLeu: 6.243 ± 0.15
1.784SerMet: 1.784 ± 0.066
3.325SerAsn: 3.325 ± 0.104
2.122SerPro: 2.122 ± 0.072
2.772SerGln: 2.772 ± 0.097
3.496SerArg: 3.496 ± 0.108
5.634SerSer: 5.634 ± 0.155
3.769SerThr: 3.769 ± 0.117
5.26SerVal: 5.26 ± 0.105
0.868SerTrp: 0.868 ± 0.041
2.073SerTyr: 2.073 ± 0.082
0.0SerXaa: 0.0 ± 0.0
Thr
4.767ThrAla: 4.767 ± 0.125
0.641ThrCys: 0.641 ± 0.04
3.089ThrAsp: 3.089 ± 0.112
2.631ThrGlu: 2.631 ± 0.082
2.094ThrPhe: 2.094 ± 0.069
3.987ThrGly: 3.987 ± 0.103
1.296ThrHis: 1.296 ± 0.053
3.494ThrIle: 3.494 ± 0.103
2.98ThrLys: 2.98 ± 0.1
5.343ThrLeu: 5.343 ± 0.115
1.201ThrMet: 1.201 ± 0.055
2.457ThrAsn: 2.457 ± 0.074
2.691ThrPro: 2.691 ± 0.091
2.233ThrGln: 2.233 ± 0.079
2.619ThrArg: 2.619 ± 0.085
3.836ThrSer: 3.836 ± 0.111
3.383ThrThr: 3.383 ± 0.111
4.63ThrVal: 4.63 ± 0.104
0.629ThrTrp: 0.629 ± 0.037
1.499ThrTyr: 1.499 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
7.259ValAla: 7.259 ± 0.136
0.9ValCys: 0.9 ± 0.043
4.862ValAsp: 4.862 ± 0.122
4.646ValGlu: 4.646 ± 0.13
2.737ValPhe: 2.737 ± 0.091
4.612ValGly: 4.612 ± 0.125
1.381ValHis: 1.381 ± 0.057
4.799ValIle: 4.799 ± 0.13
4.093ValLys: 4.093 ± 0.113
7.037ValLeu: 7.037 ± 0.15
1.738ValMet: 1.738 ± 0.065
3.071ValAsn: 3.071 ± 0.087
3.422ValPro: 3.422 ± 0.094
2.166ValGln: 2.166 ± 0.079
3.788ValArg: 3.788 ± 0.084
5.581ValSer: 5.581 ± 0.126
4.241ValThr: 4.241 ± 0.1
5.745ValVal: 5.745 ± 0.139
0.775ValTrp: 0.775 ± 0.051
1.923ValTyr: 1.923 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.914TrpAla: 0.914 ± 0.046
0.185TrpCys: 0.185 ± 0.02
0.629TrpAsp: 0.629 ± 0.037
0.541TrpGlu: 0.541 ± 0.035
0.539TrpPhe: 0.539 ± 0.036
0.72TrpGly: 0.72 ± 0.042
0.289TrpHis: 0.289 ± 0.027
0.757TrpIle: 0.757 ± 0.043
0.664TrpLys: 0.664 ± 0.04
1.201TrpLeu: 1.201 ± 0.06
0.389TrpMet: 0.389 ± 0.028
0.604TrpAsn: 0.604 ± 0.046
0.377TrpPro: 0.377 ± 0.032
0.574TrpGln: 0.574 ± 0.041
0.683TrpArg: 0.683 ± 0.05
0.722TrpSer: 0.722 ± 0.039
0.535TrpThr: 0.535 ± 0.035
0.662TrpVal: 0.662 ± 0.041
0.213TrpTrp: 0.213 ± 0.024
0.382TrpTyr: 0.382 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.999TyrAla: 2.999 ± 0.096
0.403TyrCys: 0.403 ± 0.029
1.955TyrAsp: 1.955 ± 0.078
1.904TyrGlu: 1.904 ± 0.079
1.162TyrPhe: 1.162 ± 0.051
2.374TyrGly: 2.374 ± 0.084
0.486TyrHis: 0.486 ± 0.033
1.479TyrIle: 1.479 ± 0.061
1.622TyrLys: 1.622 ± 0.077
2.284TyrLeu: 2.284 ± 0.081
0.648TyrMet: 0.648 ± 0.037
1.31TyrAsn: 1.31 ± 0.066
1.104TyrPro: 1.104 ± 0.053
0.902TyrGln: 0.902 ± 0.044
1.585TyrArg: 1.585 ± 0.068
1.93TyrSer: 1.93 ± 0.067
1.455TyrThr: 1.455 ± 0.061
2.21TyrVal: 2.21 ± 0.083
0.386TyrTrp: 0.386 ± 0.028
0.995TyrTyr: 0.995 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1369 proteins (432169 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski