Amino acid dipepetide frequency for Oxalobacteraceae bacterium CAVE-383

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.138AlaAla: 18.138 ± 0.179
1.155AlaCys: 1.155 ± 0.033
6.856AlaAsp: 6.856 ± 0.079
6.265AlaGlu: 6.265 ± 0.083
4.12AlaPhe: 4.12 ± 0.062
10.656AlaGly: 10.656 ± 0.114
2.427AlaHis: 2.427 ± 0.051
6.478AlaIle: 6.478 ± 0.075
4.344AlaLys: 4.344 ± 0.087
13.769AlaLeu: 13.769 ± 0.145
3.584AlaMet: 3.584 ± 0.057
3.412AlaAsn: 3.412 ± 0.069
5.899AlaPro: 5.899 ± 0.093
5.292AlaGln: 5.292 ± 0.081
7.373AlaArg: 7.373 ± 0.086
6.559AlaSer: 6.559 ± 0.085
5.324AlaThr: 5.324 ± 0.08
8.144AlaVal: 8.144 ± 0.07
1.377AlaTrp: 1.377 ± 0.036
2.717AlaTyr: 2.717 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
1.066CysAla: 1.066 ± 0.029
0.135CysCys: 0.135 ± 0.011
0.523CysAsp: 0.523 ± 0.021
0.462CysGlu: 0.462 ± 0.018
0.322CysPhe: 0.322 ± 0.017
0.971CysGly: 0.971 ± 0.031
0.256CysHis: 0.256 ± 0.015
0.477CysIle: 0.477 ± 0.02
0.252CysLys: 0.252 ± 0.014
0.841CysLeu: 0.841 ± 0.027
0.21CysMet: 0.21 ± 0.015
0.273CysAsn: 0.273 ± 0.014
0.408CysPro: 0.408 ± 0.019
0.228CysGln: 0.228 ± 0.014
0.533CysArg: 0.533 ± 0.021
0.472CysSer: 0.472 ± 0.018
0.435CysThr: 0.435 ± 0.021
0.658CysVal: 0.658 ± 0.023
0.108CysTrp: 0.108 ± 0.008
0.215CysTyr: 0.215 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
7.231AspAla: 7.231 ± 0.082
0.505AspCys: 0.505 ± 0.021
2.906AspAsp: 2.906 ± 0.054
2.946AspGlu: 2.946 ± 0.055
2.201AspPhe: 2.201 ± 0.043
4.652AspGly: 4.652 ± 0.082
1.074AspHis: 1.074 ± 0.036
3.411AspIle: 3.411 ± 0.055
2.057AspLys: 2.057 ± 0.046
5.431AspLeu: 5.431 ± 0.07
1.417AspMet: 1.417 ± 0.034
1.664AspAsn: 1.664 ± 0.055
2.806AspPro: 2.806 ± 0.047
1.815AspGln: 1.815 ± 0.042
3.215AspArg: 3.215 ± 0.052
2.62AspSer: 2.62 ± 0.047
2.646AspThr: 2.646 ± 0.072
3.871AspVal: 3.871 ± 0.061
0.878AspTrp: 0.878 ± 0.026
1.646AspTyr: 1.646 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
6.01GluAla: 6.01 ± 0.092
0.401GluCys: 0.401 ± 0.019
2.291GluAsp: 2.291 ± 0.043
2.669GluGlu: 2.669 ± 0.061
1.888GluPhe: 1.888 ± 0.039
3.294GluGly: 3.294 ± 0.059
1.214GluHis: 1.214 ± 0.033
3.393GluIle: 3.393 ± 0.065
2.562GluLys: 2.562 ± 0.055
5.222GluLeu: 5.222 ± 0.08
1.493GluMet: 1.493 ± 0.034
1.828GluAsn: 1.828 ± 0.036
1.968GluPro: 1.968 ± 0.042
2.697GluGln: 2.697 ± 0.055
3.7GluArg: 3.7 ± 0.059
2.779GluSer: 2.779 ± 0.047
2.807GluThr: 2.807 ± 0.052
3.34GluVal: 3.34 ± 0.057
0.709GluTrp: 0.709 ± 0.024
1.226GluTyr: 1.226 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
4.098PheAla: 4.098 ± 0.059
0.427PheCys: 0.427 ± 0.019
2.514PheAsp: 2.514 ± 0.046
2.042PheGlu: 2.042 ± 0.045
1.524PhePhe: 1.524 ± 0.039
3.401PheGly: 3.401 ± 0.063
0.769PheHis: 0.769 ± 0.025
1.977PheIle: 1.977 ± 0.047
1.362PheLys: 1.362 ± 0.032
3.267PheLeu: 3.267 ± 0.062
0.912PheMet: 0.912 ± 0.027
1.366PheAsn: 1.366 ± 0.035
1.623PhePro: 1.623 ± 0.036
1.089PheGln: 1.089 ± 0.027
1.913PheArg: 1.913 ± 0.038
2.641PheSer: 2.641 ± 0.049
1.919PheThr: 1.919 ± 0.04
2.619PheVal: 2.619 ± 0.055
0.485PheTrp: 0.485 ± 0.022
1.058PheTyr: 1.058 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
9.131GlyAla: 9.131 ± 0.123
0.752GlyCys: 0.752 ± 0.03
4.176GlyAsp: 4.176 ± 0.079
3.955GlyGlu: 3.955 ± 0.065
3.209GlyPhe: 3.209 ± 0.05
6.721GlyGly: 6.721 ± 0.134
1.743GlyHis: 1.743 ± 0.039
4.815GlyIle: 4.815 ± 0.069
4.014GlyLys: 4.014 ± 0.058
8.048GlyLeu: 8.048 ± 0.091
2.517GlyMet: 2.517 ± 0.046
2.809GlyAsn: 2.809 ± 0.118
2.689GlyPro: 2.689 ± 0.053
2.924GlyGln: 2.924 ± 0.058
4.844GlyArg: 4.844 ± 0.07
4.598GlySer: 4.598 ± 0.081
3.849GlyThr: 3.849 ± 0.1
5.883GlyVal: 5.883 ± 0.087
1.197GlyTrp: 1.197 ± 0.032
2.278GlyTyr: 2.278 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.692HisAla: 2.692 ± 0.054
0.252HisCys: 0.252 ± 0.013
1.239HisAsp: 1.239 ± 0.035
1.053HisGlu: 1.053 ± 0.029
0.934HisPhe: 0.934 ± 0.025
1.967HisGly: 1.967 ± 0.044
0.603HisHis: 0.603 ± 0.023
1.182HisIle: 1.182 ± 0.03
0.629HisLys: 0.629 ± 0.028
2.126HisLeu: 2.126 ± 0.044
0.519HisMet: 0.519 ± 0.022
0.604HisAsn: 0.604 ± 0.024
1.415HisPro: 1.415 ± 0.04
0.743HisGln: 0.743 ± 0.023
1.325HisArg: 1.325 ± 0.037
1.085HisSer: 1.085 ± 0.03
0.974HisThr: 0.974 ± 0.031
1.439HisVal: 1.439 ± 0.034
0.343HisTrp: 0.343 ± 0.017
0.669HisTyr: 0.669 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
7.754IleAla: 7.754 ± 0.084
0.537IleCys: 0.537 ± 0.024
4.002IleAsp: 4.002 ± 0.054
3.501IleGlu: 3.501 ± 0.063
1.852IlePhe: 1.852 ± 0.045
5.199IleGly: 5.199 ± 0.084
1.019IleHis: 1.019 ± 0.029
2.373IleIle: 2.373 ± 0.05
2.22IleLys: 2.22 ± 0.046
4.673IleLeu: 4.673 ± 0.062
1.119IleMet: 1.119 ± 0.032
1.844IleAsn: 1.844 ± 0.046
2.466IlePro: 2.466 ± 0.043
1.507IleGln: 1.507 ± 0.037
3.014IleArg: 3.014 ± 0.045
3.317IleSer: 3.317 ± 0.054
2.797IleThr: 2.797 ± 0.062
4.32IleVal: 4.32 ± 0.053
0.638IleTrp: 0.638 ± 0.028
1.295IleTyr: 1.295 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
4.318LysAla: 4.318 ± 0.073
0.21LysCys: 0.21 ± 0.014
1.993LysAsp: 1.993 ± 0.037
1.889LysGlu: 1.889 ± 0.044
1.253LysPhe: 1.253 ± 0.035
2.581LysGly: 2.581 ± 0.052
0.752LysHis: 0.752 ± 0.028
2.47LysIle: 2.47 ± 0.049
2.027LysLys: 2.027 ± 0.053
4.171LysLeu: 4.171 ± 0.063
1.239LysMet: 1.239 ± 0.035
1.54LysAsn: 1.54 ± 0.041
2.252LysPro: 2.252 ± 0.055
1.675LysGln: 1.675 ± 0.037
2.437LysArg: 2.437 ± 0.051
2.264LysSer: 2.264 ± 0.048
2.349LysThr: 2.349 ± 0.048
2.679LysVal: 2.679 ± 0.061
0.478LysTrp: 0.478 ± 0.021
0.901LysTyr: 0.901 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
13.311LeuAla: 13.311 ± 0.133
1.028LeuCys: 1.028 ± 0.033
5.609LeuAsp: 5.609 ± 0.079
5.09LeuGlu: 5.09 ± 0.075
3.78LeuPhe: 3.78 ± 0.051
7.758LeuGly: 7.758 ± 0.099
2.382LeuHis: 2.382 ± 0.047
5.41LeuIle: 5.41 ± 0.064
4.096LeuLys: 4.096 ± 0.06
11.322LeuLeu: 11.322 ± 0.145
2.621LeuMet: 2.621 ± 0.048
3.347LeuAsn: 3.347 ± 0.063
5.835LeuPro: 5.835 ± 0.085
4.269LeuGln: 4.269 ± 0.072
6.807LeuArg: 6.807 ± 0.092
6.515LeuSer: 6.515 ± 0.088
5.657LeuThr: 5.657 ± 0.072
6.384LeuVal: 6.384 ± 0.081
1.051LeuTrp: 1.051 ± 0.032
2.214LeuTyr: 2.214 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
3.053MetAla: 3.053 ± 0.053
0.169MetCys: 0.169 ± 0.014
1.23MetAsp: 1.23 ± 0.03
1.217MetGlu: 1.217 ± 0.03
0.841MetPhe: 0.841 ± 0.026
1.722MetGly: 1.722 ± 0.039
0.712MetHis: 0.712 ± 0.024
1.34MetIle: 1.34 ± 0.036
1.181MetLys: 1.181 ± 0.035
3.074MetLeu: 3.074 ± 0.055
0.707MetMet: 0.707 ± 0.026
0.952MetAsn: 0.952 ± 0.029
1.689MetPro: 1.689 ± 0.041
1.353MetGln: 1.353 ± 0.031
1.88MetArg: 1.88 ± 0.042
1.713MetSer: 1.713 ± 0.04
1.634MetThr: 1.634 ± 0.034
1.693MetVal: 1.693 ± 0.037
0.18MetTrp: 0.18 ± 0.012
0.464MetTyr: 0.464 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
4.082AsnAla: 4.082 ± 0.067
0.296AsnCys: 0.296 ± 0.016
1.774AsnAsp: 1.774 ± 0.067
1.427AsnGlu: 1.427 ± 0.034
1.215AsnPhe: 1.215 ± 0.035
2.913AsnGly: 2.913 ± 0.072
0.588AsnHis: 0.588 ± 0.023
1.908AsnIle: 1.908 ± 0.056
1.159AsnLys: 1.159 ± 0.033
3.149AsnLeu: 3.149 ± 0.06
0.834AsnMet: 0.834 ± 0.027
1.183AsnAsn: 1.183 ± 0.048
2.01AsnPro: 2.01 ± 0.044
1.067AsnGln: 1.067 ± 0.031
1.879AsnArg: 1.879 ± 0.038
1.632AsnSer: 1.632 ± 0.055
1.733AsnThr: 1.733 ± 0.059
2.333AsnVal: 2.333 ± 0.057
0.445AsnTrp: 0.445 ± 0.021
0.838AsnTyr: 0.838 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
6.936ProAla: 6.936 ± 0.11
0.327ProCys: 0.327 ± 0.016
3.206ProAsp: 3.206 ± 0.057
3.046ProGlu: 3.046 ± 0.055
1.774ProPhe: 1.774 ± 0.043
3.992ProGly: 3.992 ± 0.064
1.141ProHis: 1.141 ± 0.032
2.256ProIle: 2.256 ± 0.043
1.751ProLys: 1.751 ± 0.048
4.969ProLeu: 4.969 ± 0.07
1.19ProMet: 1.19 ± 0.03
1.451ProAsn: 1.451 ± 0.039
2.407ProPro: 2.407 ± 0.056
2.027ProGln: 2.027 ± 0.048
2.465ProArg: 2.465 ± 0.049
2.593ProSer: 2.593 ± 0.045
2.1ProThr: 2.1 ± 0.041
3.76ProVal: 3.76 ± 0.071
0.542ProTrp: 0.542 ± 0.021
1.216ProTyr: 1.216 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.136GlnAla: 5.136 ± 0.089
0.301GlnCys: 0.301 ± 0.017
1.785GlnAsp: 1.785 ± 0.035
1.752GlnGlu: 1.752 ± 0.041
1.368GlnPhe: 1.368 ± 0.035
2.646GlnGly: 2.646 ± 0.054
0.875GlnHis: 0.875 ± 0.028
2.336GlnIle: 2.336 ± 0.046
1.561GlnLys: 1.561 ± 0.038
4.211GlnLeu: 4.211 ± 0.068
1.133GlnMet: 1.133 ± 0.033
1.204GlnAsn: 1.204 ± 0.037
1.939GlnPro: 1.939 ± 0.04
2.047GlnGln: 2.047 ± 0.056
2.906GlnArg: 2.906 ± 0.053
2.245GlnSer: 2.245 ± 0.048
2.039GlnThr: 2.039 ± 0.038
2.501GlnVal: 2.501 ± 0.046
0.52GlnTrp: 0.52 ± 0.023
0.976GlnTyr: 0.976 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
6.526ArgAla: 6.526 ± 0.089
0.489ArgCys: 0.489 ± 0.02
3.423ArgAsp: 3.423 ± 0.057
3.512ArgGlu: 3.512 ± 0.063
2.551ArgPhe: 2.551 ± 0.05
4.024ArgGly: 4.024 ± 0.058
1.72ArgHis: 1.72 ± 0.04
3.863ArgIle: 3.863 ± 0.067
2.52ArgLys: 2.52 ± 0.047
6.724ArgLeu: 6.724 ± 0.089
1.772ArgMet: 1.772 ± 0.035
2.22ArgAsn: 2.22 ± 0.045
2.746ArgPro: 2.746 ± 0.052
2.839ArgGln: 2.839 ± 0.054
4.29ArgArg: 4.29 ± 0.079
3.108ArgSer: 3.108 ± 0.051
2.83ArgThr: 2.83 ± 0.059
3.866ArgVal: 3.866 ± 0.057
0.881ArgTrp: 0.881 ± 0.028
1.875ArgTyr: 1.875 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
6.76SerAla: 6.76 ± 0.084
0.482SerCys: 0.482 ± 0.022
3.101SerAsp: 3.101 ± 0.056
2.683SerGlu: 2.683 ± 0.042
2.086SerPhe: 2.086 ± 0.041
5.476SerGly: 5.476 ± 0.077
1.257SerHis: 1.257 ± 0.032
3.163SerIle: 3.163 ± 0.061
2.028SerLys: 2.028 ± 0.045
5.769SerLeu: 5.769 ± 0.067
1.546SerMet: 1.546 ± 0.039
1.845SerAsn: 1.845 ± 0.052
2.691SerPro: 2.691 ± 0.044
1.894SerGln: 1.894 ± 0.039
3.315SerArg: 3.315 ± 0.057
3.287SerSer: 3.287 ± 0.074
2.898SerThr: 2.898 ± 0.06
3.967SerVal: 3.967 ± 0.069
0.663SerTrp: 0.663 ± 0.022
1.468SerTyr: 1.468 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
5.912ThrAla: 5.912 ± 0.081
0.344ThrCys: 0.344 ± 0.019
2.531ThrAsp: 2.531 ± 0.058
2.406ThrGlu: 2.406 ± 0.053
1.745ThrPhe: 1.745 ± 0.037
4.442ThrGly: 4.442 ± 0.128
1.087ThrHis: 1.087 ± 0.034
2.831ThrIle: 2.831 ± 0.055
1.419ThrLys: 1.419 ± 0.036
6.025ThrLeu: 6.025 ± 0.101
1.179ThrMet: 1.179 ± 0.034
1.405ThrAsn: 1.405 ± 0.041
3.19ThrPro: 3.19 ± 0.053
1.798ThrGln: 1.798 ± 0.04
2.797ThrArg: 2.797 ± 0.05
2.647ThrSer: 2.647 ± 0.061
2.441ThrThr: 2.441 ± 0.048
4.146ThrVal: 4.146 ± 0.078
0.537ThrTrp: 0.537 ± 0.021
1.201ThrTyr: 1.201 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
8.127ValAla: 8.127 ± 0.089
0.646ValCys: 0.646 ± 0.023
3.927ValAsp: 3.927 ± 0.064
3.743ValGlu: 3.743 ± 0.062
2.667ValPhe: 2.667 ± 0.053
4.941ValGly: 4.941 ± 0.078
1.363ValHis: 1.363 ± 0.035
3.967ValIle: 3.967 ± 0.06
2.906ValLys: 2.906 ± 0.055
7.307ValLeu: 7.307 ± 0.089
1.89ValMet: 1.89 ± 0.039
2.277ValAsn: 2.277 ± 0.048
3.264ValPro: 3.264 ± 0.056
2.524ValGln: 2.524 ± 0.05
4.14ValArg: 4.14 ± 0.06
4.163ValSer: 4.163 ± 0.06
3.848ValThr: 3.848 ± 0.07
5.167ValVal: 5.167 ± 0.081
0.726ValTrp: 0.726 ± 0.023
1.568ValTyr: 1.568 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.945TrpAla: 0.945 ± 0.028
0.132TrpCys: 0.132 ± 0.01
0.544TrpAsp: 0.544 ± 0.019
0.586TrpGlu: 0.586 ± 0.022
0.533TrpPhe: 0.533 ± 0.02
0.696TrpGly: 0.696 ± 0.025
0.323TrpHis: 0.323 ± 0.016
0.728TrpIle: 0.728 ± 0.026
0.487TrpLys: 0.487 ± 0.018
1.676TrpLeu: 1.676 ± 0.043
0.387TrpMet: 0.387 ± 0.019
0.463TrpAsn: 0.463 ± 0.018
0.576TrpPro: 0.576 ± 0.022
0.658TrpGln: 0.658 ± 0.023
1.062TrpArg: 1.062 ± 0.034
0.717TrpSer: 0.717 ± 0.025
0.555TrpThr: 0.555 ± 0.022
0.727TrpVal: 0.727 ± 0.024
0.156TrpTrp: 0.156 ± 0.013
0.332TrpTyr: 0.332 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.663TyrAla: 2.663 ± 0.049
0.26TyrCys: 0.26 ± 0.015
1.364TyrAsp: 1.364 ± 0.033
1.246TyrGlu: 1.246 ± 0.036
1.14TyrPhe: 1.14 ± 0.031
2.102TyrGly: 2.102 ± 0.042
0.49TyrHis: 0.49 ± 0.02
1.067TyrIle: 1.067 ± 0.026
0.906TyrLys: 0.906 ± 0.031
2.714TyrLeu: 2.714 ± 0.053
0.536TyrMet: 0.536 ± 0.02
0.777TyrAsn: 0.777 ± 0.024
1.275TyrPro: 1.275 ± 0.032
1.034TyrGln: 1.034 ± 0.031
1.881TyrArg: 1.881 ± 0.041
1.423TyrSer: 1.423 ± 0.038
1.248TyrThr: 1.248 ± 0.043
1.663TyrVal: 1.663 ± 0.037
0.369TyrTrp: 0.369 ± 0.018
0.682TyrTyr: 0.682 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3601 proteins (1228013 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski