Amino acid dipepetide frequency for Haemophilus parahaemolyticus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.964AlaAla: 5.964 ± 0.17
0.955AlaCys: 0.955 ± 0.038
4.336AlaAsp: 4.336 ± 0.107
7.209AlaGlu: 7.209 ± 0.397
3.575AlaPhe: 3.575 ± 0.087
5.566AlaGly: 5.566 ± 0.123
1.531AlaHis: 1.531 ± 0.055
6.169AlaIle: 6.169 ± 0.118
6.347AlaLys: 6.347 ± 0.111
9.192AlaLeu: 9.192 ± 0.139
2.401AlaMet: 2.401 ± 0.069
4.117AlaAsn: 4.117 ± 0.149
2.272AlaPro: 2.272 ± 0.064
3.838AlaGln: 3.838 ± 0.09
3.934AlaArg: 3.934 ± 0.396
4.404AlaSer: 4.404 ± 0.11
4.542AlaThr: 4.542 ± 0.133
6.231AlaVal: 6.231 ± 0.102
0.83AlaTrp: 0.83 ± 0.041
2.39AlaTyr: 2.39 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.651CysAla: 0.651 ± 0.033
0.151CysCys: 0.151 ± 0.017
0.527CysAsp: 0.527 ± 0.031
0.681CysGlu: 0.681 ± 0.038
0.42CysPhe: 0.42 ± 0.025
0.821CysGly: 0.821 ± 0.041
0.284CysHis: 0.284 ± 0.024
0.547CysIle: 0.547 ± 0.031
0.494CysLys: 0.494 ± 0.029
0.939CysLeu: 0.939 ± 0.042
0.168CysMet: 0.168 ± 0.016
0.382CysAsn: 0.382 ± 0.025
0.406CysPro: 0.406 ± 0.026
0.402CysGln: 0.402 ± 0.025
0.441CysArg: 0.441 ± 0.028
0.579CysSer: 0.579 ± 0.033
0.447CysThr: 0.447 ± 0.026
0.662CysVal: 0.662 ± 0.037
0.133CysTrp: 0.133 ± 0.014
0.333CysTyr: 0.333 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.641AspAla: 3.641 ± 0.094
0.475AspCys: 0.475 ± 0.03
2.342AspAsp: 2.342 ± 0.071
3.835AspGlu: 3.835 ± 0.089
2.676AspPhe: 2.676 ± 0.07
3.161AspGly: 3.161 ± 0.12
0.922AspHis: 0.922 ± 0.04
3.415AspIle: 3.415 ± 0.078
3.355AspLys: 3.355 ± 0.083
5.085AspLeu: 5.085 ± 0.104
1.081AspMet: 1.081 ± 0.038
2.24AspAsn: 2.24 ± 0.068
1.945AspPro: 1.945 ± 0.059
1.651AspGln: 1.651 ± 0.055
2.1AspArg: 2.1 ± 0.056
2.318AspSer: 2.318 ± 0.061
2.163AspThr: 2.163 ± 0.065
3.482AspVal: 3.482 ± 0.08
0.758AspTrp: 0.758 ± 0.039
2.14AspTyr: 2.14 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
5.904GluAla: 5.904 ± 0.371
0.521GluCys: 0.521 ± 0.029
2.634GluAsp: 2.634 ± 0.076
5.193GluGlu: 5.193 ± 0.347
2.47GluPhe: 2.47 ± 0.066
3.785GluGly: 3.785 ± 0.086
1.462GluHis: 1.462 ± 0.048
4.96GluIle: 4.96 ± 0.096
5.733GluLys: 5.733 ± 0.111
6.73GluLeu: 6.73 ± 0.123
2.001GluMet: 2.001 ± 0.065
4.126GluAsn: 4.126 ± 0.095
1.927GluPro: 1.927 ± 0.056
4.3GluGln: 4.3 ± 0.109
3.344GluArg: 3.344 ± 0.077
3.056GluSer: 3.056 ± 0.068
3.407GluThr: 3.407 ± 0.067
4.388GluVal: 4.388 ± 0.101
0.766GluTrp: 0.766 ± 0.034
1.868GluTyr: 1.868 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.859PheAla: 3.859 ± 0.09
0.577PheCys: 0.577 ± 0.034
2.725PheAsp: 2.725 ± 0.075
2.831PheGlu: 2.831 ± 0.069
2.066PhePhe: 2.066 ± 0.07
3.392PheGly: 3.392 ± 0.078
0.832PheHis: 0.832 ± 0.039
3.145PheIle: 3.145 ± 0.082
2.361PheLys: 2.361 ± 0.06
4.051PheLeu: 4.051 ± 0.093
1.039PheMet: 1.039 ± 0.037
2.437PheAsn: 2.437 ± 0.066
1.513PhePro: 1.513 ± 0.048
1.355PheGln: 1.355 ± 0.049
1.638PheArg: 1.638 ± 0.053
3.448PheSer: 3.448 ± 0.092
2.483PheThr: 2.483 ± 0.058
2.865PheVal: 2.865 ± 0.079
0.539PheTrp: 0.539 ± 0.031
1.637PheTyr: 1.637 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
4.84GlyAla: 4.84 ± 0.114
0.766GlyCys: 0.766 ± 0.041
3.125GlyAsp: 3.125 ± 0.108
4.298GlyGlu: 4.298 ± 0.1
3.208GlyPhe: 3.208 ± 0.071
4.47GlyGly: 4.47 ± 0.103
1.223GlyHis: 1.223 ± 0.046
5.035GlyIle: 5.035 ± 0.1
5.273GlyLys: 5.273 ± 0.126
6.672GlyLeu: 6.672 ± 0.124
1.824GlyMet: 1.824 ± 0.056
2.961GlyAsn: 2.961 ± 0.121
0.989GlyPro: 0.989 ± 0.047
2.403GlyGln: 2.403 ± 0.063
2.847GlyArg: 2.847 ± 0.083
3.722GlySer: 3.722 ± 0.097
3.365GlyThr: 3.365 ± 0.105
5.159GlyVal: 5.159 ± 0.105
0.899GlyTrp: 0.899 ± 0.037
2.421GlyTyr: 2.421 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.356HisAla: 1.356 ± 0.044
0.317HisCys: 0.317 ± 0.025
0.842HisAsp: 0.842 ± 0.034
0.981HisGlu: 0.981 ± 0.043
1.262HisPhe: 1.262 ± 0.053
1.287HisGly: 1.287 ± 0.05
0.62HisHis: 0.62 ± 0.039
1.432HisIle: 1.432 ± 0.049
1.074HisLys: 1.074 ± 0.038
2.27HisLeu: 2.27 ± 0.065
0.37HisMet: 0.37 ± 0.022
1.024HisAsn: 1.024 ± 0.036
1.004HisPro: 1.004 ± 0.041
1.148HisGln: 1.148 ± 0.042
0.96HisArg: 0.96 ± 0.042
1.323HisSer: 1.323 ± 0.048
0.994HisThr: 0.994 ± 0.04
0.914HisVal: 0.914 ± 0.037
0.325HisTrp: 0.325 ± 0.026
0.859HisTyr: 0.859 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
6.637IleAla: 6.637 ± 0.123
0.745IleCys: 0.745 ± 0.036
3.662IleAsp: 3.662 ± 0.095
4.898IleGlu: 4.898 ± 0.09
2.903IlePhe: 2.903 ± 0.085
4.912IleGly: 4.912 ± 0.1
1.274IleHis: 1.274 ± 0.046
4.186IleIle: 4.186 ± 0.099
3.982IleLys: 3.982 ± 0.097
6.175IleLeu: 6.175 ± 0.115
1.209IleMet: 1.209 ± 0.046
3.142IleAsn: 3.142 ± 0.082
2.682IlePro: 2.682 ± 0.082
2.714IleGln: 2.714 ± 0.087
2.935IleArg: 2.935 ± 0.067
4.649IleSer: 4.649 ± 0.104
3.952IleThr: 3.952 ± 0.076
4.309IleVal: 4.309 ± 0.076
0.651IleTrp: 0.651 ± 0.033
2.034IleTyr: 2.034 ± 0.066
0.0IleXaa: 0.0 ± 0.0
Lys
6.643LysAla: 6.643 ± 0.351
0.396LysCys: 0.396 ± 0.031
3.006LysAsp: 3.006 ± 0.105
4.463LysGlu: 4.463 ± 0.112
2.365LysPhe: 2.365 ± 0.075
3.984LysGly: 3.984 ± 0.091
1.183LysHis: 1.183 ± 0.042
4.231LysIle: 4.231 ± 0.086
4.077LysLys: 4.077 ± 0.081
6.374LysLeu: 6.374 ± 0.094
2.046LysMet: 2.046 ± 0.061
3.436LysAsn: 3.436 ± 0.098
2.592LysPro: 2.592 ± 0.07
3.33LysGln: 3.33 ± 0.078
3.091LysArg: 3.091 ± 0.091
3.434LysSer: 3.434 ± 0.078
3.665LysThr: 3.665 ± 0.084
4.388LysVal: 4.388 ± 0.104
0.713LysTrp: 0.713 ± 0.036
1.757LysTyr: 1.757 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
9.596LeuAla: 9.596 ± 0.145
0.935LeuCys: 0.935 ± 0.04
5.536LeuAsp: 5.536 ± 0.113
6.259LeuGlu: 6.259 ± 0.119
4.745LeuPhe: 4.745 ± 0.109
6.746LeuGly: 6.746 ± 0.119
1.97LeuHis: 1.97 ± 0.052
6.393LeuIle: 6.393 ± 0.123
6.31LeuLys: 6.31 ± 0.107
10.097LeuLeu: 10.097 ± 0.192
2.462LeuMet: 2.462 ± 0.072
5.332LeuAsn: 5.332 ± 0.096
4.495LeuPro: 4.495 ± 0.096
4.136LeuGln: 4.136 ± 0.093
4.304LeuArg: 4.304 ± 0.096
7.235LeuSer: 7.235 ± 0.12
5.967LeuThr: 5.967 ± 0.107
6.522LeuVal: 6.522 ± 0.106
1.007LeuTrp: 1.007 ± 0.042
2.615LeuTyr: 2.615 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
2.387MetAla: 2.387 ± 0.078
0.172MetCys: 0.172 ± 0.017
0.968MetAsp: 0.968 ± 0.04
1.236MetGlu: 1.236 ± 0.05
0.951MetPhe: 0.951 ± 0.043
1.61MetGly: 1.61 ± 0.055
0.426MetHis: 0.426 ± 0.027
1.51MetIle: 1.51 ± 0.047
1.763MetLys: 1.763 ± 0.056
2.623MetLeu: 2.623 ± 0.068
0.712MetMet: 0.712 ± 0.032
1.228MetAsn: 1.228 ± 0.043
1.095MetPro: 1.095 ± 0.039
1.315MetGln: 1.315 ± 0.049
1.007MetArg: 1.007 ± 0.046
1.637MetSer: 1.637 ± 0.052
1.428MetThr: 1.428 ± 0.05
1.525MetVal: 1.525 ± 0.047
0.228MetTrp: 0.228 ± 0.02
0.46MetTyr: 0.46 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
4.311AsnAla: 4.311 ± 0.138
0.386AsnCys: 0.386 ± 0.027
2.235AsnAsp: 2.235 ± 0.078
3.165AsnGlu: 3.165 ± 0.074
2.028AsnPhe: 2.028 ± 0.063
3.572AsnGly: 3.572 ± 0.159
1.074AsnHis: 1.074 ± 0.045
3.488AsnIle: 3.488 ± 0.082
2.777AsnLys: 2.777 ± 0.088
4.846AsnLeu: 4.846 ± 0.094
0.947AsnMet: 0.947 ± 0.04
2.369AsnAsn: 2.369 ± 0.115
2.342AsnPro: 2.342 ± 0.063
2.502AsnGln: 2.502 ± 0.069
2.251AsnArg: 2.251 ± 0.064
2.656AsnSer: 2.656 ± 0.063
2.446AsnThr: 2.446 ± 0.104
3.45AsnVal: 3.45 ± 0.135
0.633AsnTrp: 0.633 ± 0.035
1.658AsnTyr: 1.658 ± 0.053
0.0AsnXaa: 0.0 ± 0.0
Pro
2.69ProAla: 2.69 ± 0.065
0.293ProCys: 0.293 ± 0.02
1.704ProAsp: 1.704 ± 0.051
3.198ProGlu: 3.198 ± 0.077
1.826ProPhe: 1.826 ± 0.065
1.246ProGly: 1.246 ± 0.048
0.854ProHis: 0.854 ± 0.039
2.527ProIle: 2.527 ± 0.074
2.305ProLys: 2.305 ± 0.068
3.82ProLeu: 3.82 ± 0.095
0.967ProMet: 0.967 ± 0.046
2.161ProAsn: 2.161 ± 0.058
1.008ProPro: 1.008 ± 0.041
1.656ProGln: 1.656 ± 0.051
1.318ProArg: 1.318 ± 0.042
2.105ProSer: 2.105 ± 0.059
2.212ProThr: 2.212 ± 0.069
2.568ProVal: 2.568 ± 0.068
0.316ProTrp: 0.316 ± 0.026
1.297ProTyr: 1.297 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
4.926GlnAla: 4.926 ± 0.106
0.369GlnCys: 0.369 ± 0.027
1.943GlnAsp: 1.943 ± 0.062
2.629GlnGlu: 2.629 ± 0.07
2.105GlnPhe: 2.105 ± 0.06
2.828GlnGly: 2.828 ± 0.074
0.983GlnHis: 0.983 ± 0.044
3.189GlnIle: 3.189 ± 0.085
3.046GlnLys: 3.046 ± 0.073
4.971GlnLeu: 4.971 ± 0.104
1.052GlnMet: 1.052 ± 0.038
2.3GlnAsn: 2.3 ± 0.064
1.739GlnPro: 1.739 ± 0.064
2.903GlnGln: 2.903 ± 0.081
2.078GlnArg: 2.078 ± 0.066
2.297GlnSer: 2.297 ± 0.066
2.462GlnThr: 2.462 ± 0.06
2.931GlnVal: 2.931 ± 0.076
0.568GlnTrp: 0.568 ± 0.032
1.472GlnTyr: 1.472 ± 0.055
0.0GlnXaa: 0.0 ± 0.0
Arg
3.097ArgAla: 3.097 ± 0.084
0.414ArgCys: 0.414 ± 0.026
2.057ArgAsp: 2.057 ± 0.063
3.149ArgGlu: 3.149 ± 0.089
2.336ArgPhe: 2.336 ± 0.062
2.461ArgGly: 2.461 ± 0.073
0.976ArgHis: 0.976 ± 0.038
2.942ArgIle: 2.942 ± 0.082
3.327ArgLys: 3.327 ± 0.357
4.723ArgLeu: 4.723 ± 0.099
1.015ArgMet: 1.015 ± 0.041
2.05ArgAsn: 2.05 ± 0.062
1.561ArgPro: 1.561 ± 0.052
2.159ArgGln: 2.159 ± 0.06
2.026ArgArg: 2.026 ± 0.072
2.268ArgSer: 2.268 ± 0.063
2.11ArgThr: 2.11 ± 0.052
2.839ArgVal: 2.839 ± 0.071
0.524ArgTrp: 0.524 ± 0.026
1.77ArgTyr: 1.77 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
4.76SerAla: 4.76 ± 0.106
0.479SerCys: 0.479 ± 0.028
2.839SerAsp: 2.839 ± 0.068
3.718SerGlu: 3.718 ± 0.08
2.576SerPhe: 2.576 ± 0.076
4.418SerGly: 4.418 ± 0.098
1.416SerHis: 1.416 ± 0.046
3.748SerIle: 3.748 ± 0.087
3.195SerLys: 3.195 ± 0.073
6.227SerLeu: 6.227 ± 0.114
1.265SerMet: 1.265 ± 0.042
2.576SerAsn: 2.576 ± 0.064
2.169SerPro: 2.169 ± 0.063
2.995SerGln: 2.995 ± 0.071
2.421SerArg: 2.421 ± 0.064
3.572SerSer: 3.572 ± 0.093
2.985SerThr: 2.985 ± 0.088
4.102SerVal: 4.102 ± 0.098
0.675SerTrp: 0.675 ± 0.036
1.864SerTyr: 1.864 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
4.91ThrAla: 4.91 ± 0.134
0.415ThrCys: 0.415 ± 0.025
2.581ThrAsp: 2.581 ± 0.077
3.564ThrGlu: 3.564 ± 0.082
2.47ThrPhe: 2.47 ± 0.066
3.944ThrGly: 3.944 ± 0.103
1.18ThrHis: 1.18 ± 0.046
3.498ThrIle: 3.498 ± 0.09
2.913ThrLys: 2.913 ± 0.079
6.002ThrLeu: 6.002 ± 0.099
1.071ThrMet: 1.071 ± 0.044
2.224ThrAsn: 2.224 ± 0.107
2.379ThrPro: 2.379 ± 0.071
2.6ThrGln: 2.6 ± 0.065
1.957ThrArg: 1.957 ± 0.062
2.852ThrSer: 2.852 ± 0.074
2.99ThrThr: 2.99 ± 0.128
3.609ThrVal: 3.609 ± 0.152
0.503ThrTrp: 0.503 ± 0.027
1.452ThrTyr: 1.452 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
6.231ValAla: 6.231 ± 0.155
0.609ValCys: 0.609 ± 0.034
3.613ValAsp: 3.613 ± 0.09
4.962ValGlu: 4.962 ± 0.094
2.57ValPhe: 2.57 ± 0.069
4.563ValGly: 4.563 ± 0.098
1.081ValHis: 1.081 ± 0.045
4.696ValIle: 4.696 ± 0.088
4.548ValLys: 4.548 ± 0.108
6.823ValLeu: 6.823 ± 0.116
1.778ValMet: 1.778 ± 0.058
3.214ValAsn: 3.214 ± 0.151
2.437ValPro: 2.437 ± 0.068
2.498ValGln: 2.498 ± 0.069
2.892ValArg: 2.892 ± 0.075
4.139ValSer: 4.139 ± 0.127
3.553ValThr: 3.553 ± 0.143
4.899ValVal: 4.899 ± 0.112
0.588ValTrp: 0.588 ± 0.034
1.796ValTyr: 1.796 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
0.92TrpAla: 0.92 ± 0.039
0.136TrpCys: 0.136 ± 0.013
0.534TrpAsp: 0.534 ± 0.03
0.625TrpGlu: 0.625 ± 0.029
0.555TrpPhe: 0.555 ± 0.03
0.673TrpGly: 0.673 ± 0.037
0.265TrpHis: 0.265 ± 0.021
0.773TrpIle: 0.773 ± 0.037
0.707TrpLys: 0.707 ± 0.036
1.568TrpLeu: 1.568 ± 0.056
0.24TrpMet: 0.24 ± 0.021
0.531TrpAsn: 0.531 ± 0.026
0.115TrpPro: 0.115 ± 0.014
0.928TrpGln: 0.928 ± 0.041
0.55TrpArg: 0.55 ± 0.031
0.468TrpSer: 0.468 ± 0.031
0.458TrpThr: 0.458 ± 0.028
0.768TrpVal: 0.768 ± 0.037
0.159TrpTrp: 0.159 ± 0.016
0.338TrpTyr: 0.338 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.48TyrAla: 2.48 ± 0.072
0.356TyrCys: 0.356 ± 0.023
1.574TyrAsp: 1.574 ± 0.044
1.682TyrGlu: 1.682 ± 0.052
1.688TyrPhe: 1.688 ± 0.06
2.063TyrGly: 2.063 ± 0.066
0.806TyrHis: 0.806 ± 0.035
1.77TyrIle: 1.77 ± 0.054
1.587TyrLys: 1.587 ± 0.059
3.472TyrLeu: 3.472 ± 0.079
0.63TyrMet: 0.63 ± 0.03
1.287TyrAsn: 1.287 ± 0.046
1.401TyrPro: 1.401 ± 0.05
1.957TyrGln: 1.957 ± 0.062
1.709TyrArg: 1.709 ± 0.06
1.837TyrSer: 1.837 ± 0.056
1.486TyrThr: 1.486 ± 0.053
1.879TyrVal: 1.879 ± 0.06
0.495TyrTrp: 0.495 ± 0.03
1.064TyrTyr: 1.064 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2159 proteins (623779 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski