Amino acid dipepetide frequency for Helicobacter sanguini

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.64AlaAla: 3.64 ± 0.106
0.862AlaCys: 0.862 ± 0.052
2.986AlaAsp: 2.986 ± 0.067
2.776AlaGlu: 2.776 ± 0.067
3.477AlaPhe: 3.477 ± 0.068
3.349AlaGly: 3.349 ± 0.088
1.121AlaHis: 1.121 ± 0.035
5.313AlaIle: 5.313 ± 0.098
5.933AlaLys: 5.933 ± 0.093
7.399AlaLeu: 7.399 ± 0.096
1.607AlaMet: 1.607 ± 0.049
4.409AlaAsn: 4.409 ± 0.08
1.623AlaPro: 1.623 ± 0.055
2.25AlaGln: 2.25 ± 0.056
2.718AlaArg: 2.718 ± 0.078
4.038AlaSer: 4.038 ± 0.095
3.142AlaThr: 3.142 ± 0.063
2.941AlaVal: 2.941 ± 0.071
0.478AlaTrp: 0.478 ± 0.025
2.19AlaTyr: 2.19 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.73CysAla: 0.73 ± 0.039
0.11CysCys: 0.11 ± 0.011
0.792CysAsp: 0.792 ± 0.041
0.903CysGlu: 0.903 ± 0.038
0.592CysPhe: 0.592 ± 0.03
0.927CysGly: 0.927 ± 0.044
0.237CysHis: 0.237 ± 0.018
0.887CysIle: 0.887 ± 0.039
0.958CysLys: 0.958 ± 0.038
0.903CysLeu: 0.903 ± 0.036
0.225CysMet: 0.225 ± 0.017
0.562CysAsn: 0.562 ± 0.027
0.288CysPro: 0.288 ± 0.02
0.2CysGln: 0.2 ± 0.017
0.252CysArg: 0.252 ± 0.018
0.613CysSer: 0.613 ± 0.028
0.328CysThr: 0.328 ± 0.019
1.117CysVal: 1.117 ± 0.041
0.051CysTrp: 0.051 ± 0.009
0.47CysTyr: 0.47 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
2.466AspAla: 2.466 ± 0.063
0.692AspCys: 0.692 ± 0.033
2.11AspAsp: 2.11 ± 0.058
2.767AspGlu: 2.767 ± 0.069
4.418AspPhe: 4.418 ± 0.082
2.399AspGly: 2.399 ± 0.065
0.237AspHis: 0.237 ± 0.02
5.259AspIle: 5.259 ± 0.097
4.661AspLys: 4.661 ± 0.079
4.858AspLeu: 4.858 ± 0.095
1.335AspMet: 1.335 ± 0.038
3.09AspAsn: 3.09 ± 0.077
0.841AspPro: 0.841 ± 0.037
0.374AspGln: 0.374 ± 0.026
1.458AspArg: 1.458 ± 0.049
11.976AspSer: 11.976 ± 0.232
2.373AspThr: 2.373 ± 0.058
2.902AspVal: 2.902 ± 0.064
0.259AspTrp: 0.259 ± 0.018
2.317AspTyr: 2.317 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
3.199GluAla: 3.199 ± 0.072
0.801GluCys: 0.801 ± 0.034
2.075GluAsp: 2.075 ± 0.05
2.316GluGlu: 2.316 ± 0.067
3.165GluPhe: 3.165 ± 0.073
1.94GluGly: 1.94 ± 0.064
0.925GluHis: 0.925 ± 0.035
6.554GluIle: 6.554 ± 0.102
5.118GluLys: 5.118 ± 0.11
5.317GluLeu: 5.317 ± 0.097
1.655GluMet: 1.655 ± 0.04
5.071GluAsn: 5.071 ± 0.094
1.182GluPro: 1.182 ± 0.042
1.999GluGln: 1.999 ± 0.066
2.183GluArg: 2.183 ± 0.06
9.543GluSer: 9.543 ± 0.184
2.095GluThr: 2.095 ± 0.05
2.989GluVal: 2.989 ± 0.067
0.548GluTrp: 0.548 ± 0.029
2.5GluTyr: 2.5 ± 0.074
0.0GluXaa: 0.0 ± 0.0
Phe
3.517PheAla: 3.517 ± 0.083
0.854PheCys: 0.854 ± 0.037
2.987PheAsp: 2.987 ± 0.072
3.004PheGlu: 3.004 ± 0.069
2.982PhePhe: 2.982 ± 0.083
3.446PheGly: 3.446 ± 0.075
0.747PheHis: 0.747 ± 0.032
5.044PheIle: 5.044 ± 0.104
4.311PheLys: 4.311 ± 0.086
5.397PheLeu: 5.397 ± 0.106
1.432PheMet: 1.432 ± 0.042
3.536PheAsn: 3.536 ± 0.085
1.161PhePro: 1.161 ± 0.04
1.044PheGln: 1.044 ± 0.037
1.723PheArg: 1.723 ± 0.05
4.573PheSer: 4.573 ± 0.09
2.19PheThr: 2.19 ± 0.058
3.025PheVal: 3.025 ± 0.07
0.483PheTrp: 0.483 ± 0.026
2.652PheTyr: 2.652 ± 0.071
0.0PheXaa: 0.0 ± 0.0
Gly
3.766GlyAla: 3.766 ± 0.095
0.569GlyCys: 0.569 ± 0.03
2.787GlyAsp: 2.787 ± 0.069
3.372GlyGlu: 3.372 ± 0.07
3.53GlyPhe: 3.53 ± 0.078
4.51GlyGly: 4.51 ± 0.132
0.855GlyHis: 0.855 ± 0.033
4.696GlyIle: 4.696 ± 0.093
3.892GlyLys: 3.892 ± 0.086
5.076GlyLeu: 5.076 ± 0.087
1.316GlyMet: 1.316 ± 0.042
2.985GlyAsn: 2.985 ± 0.086
0.606GlyPro: 0.606 ± 0.031
1.39GlyGln: 1.39 ± 0.05
2.006GlyArg: 2.006 ± 0.063
3.522GlySer: 3.522 ± 0.084
1.906GlyThr: 1.906 ± 0.066
4.283GlyVal: 4.283 ± 0.089
0.464GlyTrp: 0.464 ± 0.024
2.6GlyTyr: 2.6 ± 0.069
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.042
0.194HisCys: 0.194 ± 0.019
0.743HisAsp: 0.743 ± 0.034
0.804HisGlu: 0.804 ± 0.036
1.217HisPhe: 1.217 ± 0.04
0.834HisGly: 0.834 ± 0.042
0.39HisHis: 0.39 ± 0.023
1.798HisIle: 1.798 ± 0.043
1.446HisLys: 1.446 ± 0.048
1.603HisLeu: 1.603 ± 0.049
0.187HisMet: 0.187 ± 0.016
1.314HisAsn: 1.314 ± 0.047
0.644HisPro: 0.644 ± 0.032
0.321HisGln: 0.321 ± 0.018
0.569HisArg: 0.569 ± 0.03
1.037HisSer: 1.037 ± 0.036
0.979HisThr: 0.979 ± 0.037
0.524HisVal: 0.524 ± 0.028
0.077HisTrp: 0.077 ± 0.011
0.715HisTyr: 0.715 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.194IleAla: 6.194 ± 0.124
1.071IleCys: 1.071 ± 0.039
5.485IleAsp: 5.485 ± 0.096
7.828IleGlu: 7.828 ± 0.137
4.908IlePhe: 4.908 ± 0.106
4.08IleGly: 4.08 ± 0.09
1.29IleHis: 1.29 ± 0.043
6.663IleIle: 6.663 ± 0.127
7.404IleLys: 7.404 ± 0.118
9.536IleLeu: 9.536 ± 0.14
1.841IleMet: 1.841 ± 0.053
6.028IleAsn: 6.028 ± 0.107
2.629IlePro: 2.629 ± 0.061
2.144IleGln: 2.144 ± 0.064
2.332IleArg: 2.332 ± 0.057
5.961IleSer: 5.961 ± 0.083
4.518IleThr: 4.518 ± 0.095
4.154IleVal: 4.154 ± 0.083
0.474IleTrp: 0.474 ± 0.027
3.452IleTyr: 3.452 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
5.218LysAla: 5.218 ± 0.095
0.644LysCys: 0.644 ± 0.035
7.215LysAsp: 7.215 ± 0.123
6.97LysGlu: 6.97 ± 0.122
3.149LysPhe: 3.149 ± 0.073
4.277LysGly: 4.277 ± 0.08
1.451LysHis: 1.451 ± 0.049
8.49LysIle: 8.49 ± 0.135
7.934LysLys: 7.934 ± 0.127
6.836LysLeu: 6.836 ± 0.1
2.085LysMet: 2.085 ± 0.051
8.105LysAsn: 8.105 ± 0.14
2.334LysPro: 2.334 ± 0.061
3.002LysGln: 3.002 ± 0.073
2.814LysArg: 2.814 ± 0.065
5.835LysSer: 5.835 ± 0.102
3.913LysThr: 3.913 ± 0.072
4.26LysVal: 4.26 ± 0.074
0.577LysTrp: 0.577 ± 0.029
3.227LysTyr: 3.227 ± 0.074
0.0LysXaa: 0.0 ± 0.0
Leu
6.336LeuAla: 6.336 ± 0.109
1.305LeuCys: 1.305 ± 0.046
7.553LeuAsp: 7.553 ± 0.123
7.467LeuGlu: 7.467 ± 0.12
4.459LeuPhe: 4.459 ± 0.106
5.609LeuGly: 5.609 ± 0.097
1.807LeuHis: 1.807 ± 0.055
6.251LeuIle: 6.251 ± 0.117
8.701LeuLys: 8.701 ± 0.125
8.237LeuLeu: 8.237 ± 0.126
1.898LeuMet: 1.898 ± 0.051
6.864LeuAsn: 6.864 ± 0.127
3.04LeuPro: 3.04 ± 0.072
4.533LeuGln: 4.533 ± 0.091
3.731LeuArg: 3.731 ± 0.068
7.781LeuSer: 7.781 ± 0.126
3.468LeuThr: 3.468 ± 0.063
4.286LeuVal: 4.286 ± 0.08
0.67LeuTrp: 0.67 ± 0.032
3.339LeuTyr: 3.339 ± 0.072
0.0LeuXaa: 0.0 ± 0.0
Met
1.382MetAla: 1.382 ± 0.043
0.295MetCys: 0.295 ± 0.019
1.179MetAsp: 1.179 ± 0.041
1.346MetGlu: 1.346 ± 0.04
1.072MetPhe: 1.072 ± 0.043
1.324MetGly: 1.324 ± 0.033
0.295MetHis: 0.295 ± 0.017
1.67MetIle: 1.67 ± 0.042
1.722MetLys: 1.722 ± 0.048
2.653MetLeu: 2.653 ± 0.06
0.463MetMet: 0.463 ± 0.027
1.145MetAsn: 1.145 ± 0.039
1.071MetPro: 1.071 ± 0.04
1.62MetGln: 1.62 ± 0.045
1.059MetArg: 1.059 ± 0.038
1.471MetSer: 1.471 ± 0.046
0.788MetThr: 0.788 ± 0.034
0.968MetVal: 0.968 ± 0.042
0.168MetTrp: 0.168 ± 0.015
0.677MetTyr: 0.677 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
5.692AsnAla: 5.692 ± 0.098
0.543AsnCys: 0.543 ± 0.028
3.582AsnAsp: 3.582 ± 0.07
4.153AsnGlu: 4.153 ± 0.1
3.865AsnPhe: 3.865 ± 0.085
3.366AsnGly: 3.366 ± 0.086
1.19AsnHis: 1.19 ± 0.042
7.884AsnIle: 7.884 ± 0.148
5.582AsnLys: 5.582 ± 0.112
8.352AsnLeu: 8.352 ± 0.149
1.677AsnMet: 1.677 ± 0.045
4.723AsnAsn: 4.723 ± 0.118
2.493AsnPro: 2.493 ± 0.069
1.611AsnGln: 1.611 ± 0.061
1.915AsnArg: 1.915 ± 0.056
4.256AsnSer: 4.256 ± 0.093
4.008AsnThr: 4.008 ± 0.095
4.134AsnVal: 4.134 ± 0.096
0.231AsnTrp: 0.231 ± 0.02
2.504AsnTyr: 2.504 ± 0.064
0.0AsnXaa: 0.0 ± 0.0
Pro
1.527ProAla: 1.527 ± 0.057
0.333ProCys: 0.333 ± 0.021
1.083ProAsp: 1.083 ± 0.041
1.103ProGlu: 1.103 ± 0.04
1.748ProPhe: 1.748 ± 0.051
0.96ProGly: 0.96 ± 0.041
0.734ProHis: 0.734 ± 0.033
2.156ProIle: 2.156 ± 0.056
2.508ProLys: 2.508 ± 0.072
3.338ProLeu: 3.338 ± 0.071
0.481ProMet: 0.481 ± 0.03
1.971ProAsn: 1.971 ± 0.05
1.118ProPro: 1.118 ± 0.047
1.489ProGln: 1.489 ± 0.052
0.958ProArg: 0.958 ± 0.035
1.802ProSer: 1.802 ± 0.056
1.922ProThr: 1.922 ± 0.058
1.202ProVal: 1.202 ± 0.042
0.162ProTrp: 0.162 ± 0.013
1.241ProTyr: 1.241 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
1.688GlnAla: 1.688 ± 0.051
0.198GlnCys: 0.198 ± 0.015
3.019GlnAsp: 3.019 ± 0.075
2.229GlnGlu: 2.229 ± 0.07
1.023GlnPhe: 1.023 ± 0.04
1.98GlnGly: 1.98 ± 0.064
0.499GlnHis: 0.499 ± 0.024
2.817GlnIle: 2.817 ± 0.067
3.306GlnLys: 3.306 ± 0.074
1.844GlnLeu: 1.844 ± 0.056
0.689GlnMet: 0.689 ± 0.033
4.057GlnAsn: 4.057 ± 0.086
0.703GlnPro: 0.703 ± 0.037
0.945GlnGln: 0.945 ± 0.05
1.06GlnArg: 1.06 ± 0.041
2.259GlnSer: 2.259 ± 0.06
1.645GlnThr: 1.645 ± 0.053
1.581GlnVal: 1.581 ± 0.048
0.16GlnTrp: 0.16 ± 0.015
1.073GlnTyr: 1.073 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
2.32ArgAla: 2.32 ± 0.054
0.213ArgCys: 0.213 ± 0.017
2.254ArgAsp: 2.254 ± 0.061
2.606ArgGlu: 2.606 ± 0.077
2.275ArgPhe: 2.275 ± 0.057
1.93ArgGly: 1.93 ± 0.057
0.605ArgHis: 0.605 ± 0.031
3.204ArgIle: 3.204 ± 0.063
2.361ArgLys: 2.361 ± 0.063
3.634ArgLeu: 3.634 ± 0.078
0.739ArgMet: 0.739 ± 0.034
2.029ArgAsn: 2.029 ± 0.054
0.892ArgPro: 0.892 ± 0.038
1.099ArgGln: 1.099 ± 0.044
1.14ArgArg: 1.14 ± 0.042
1.632ArgSer: 1.632 ± 0.049
1.228ArgThr: 1.228 ± 0.042
2.108ArgVal: 2.108 ± 0.052
0.211ArgTrp: 0.211 ± 0.016
1.485ArgTyr: 1.485 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
4.256SerAla: 4.256 ± 0.089
0.75SerCys: 0.75 ± 0.032
3.523SerAsp: 3.523 ± 0.069
3.954SerGlu: 3.954 ± 0.091
4.092SerPhe: 4.092 ± 0.072
4.579SerGly: 4.579 ± 0.096
1.352SerHis: 1.352 ± 0.043
7.713SerIle: 7.713 ± 0.109
10.796SerLys: 10.796 ± 0.183
7.601SerLeu: 7.601 ± 0.125
1.811SerMet: 1.811 ± 0.051
6.7SerAsn: 6.7 ± 0.145
2.158SerPro: 2.158 ± 0.052
2.771SerGln: 2.771 ± 0.072
2.523SerArg: 2.523 ± 0.064
5.325SerSer: 5.325 ± 0.107
3.232SerThr: 3.232 ± 0.08
4.179SerVal: 4.179 ± 0.086
0.52SerTrp: 0.52 ± 0.029
2.936SerTyr: 2.936 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
2.303ThrAla: 2.303 ± 0.065
0.476ThrCys: 0.476 ± 0.027
1.815ThrAsp: 1.815 ± 0.057
1.795ThrGlu: 1.795 ± 0.048
2.734ThrPhe: 2.734 ± 0.065
1.987ThrGly: 1.987 ± 0.071
1.328ThrHis: 1.328 ± 0.044
3.422ThrIle: 3.422 ± 0.088
3.869ThrLys: 3.869 ± 0.082
5.523ThrLeu: 5.523 ± 0.102
0.855ThrMet: 0.855 ± 0.036
3.314ThrAsn: 3.314 ± 0.094
2.254ThrPro: 2.254 ± 0.066
3.143ThrGln: 3.143 ± 0.07
1.753ThrArg: 1.753 ± 0.049
2.964ThrSer: 2.964 ± 0.079
2.556ThrThr: 2.556 ± 0.083
0.502ThrVal: 0.502 ± 0.034
0.342ThrTrp: 0.342 ± 0.025
1.772ThrTyr: 1.772 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
3.851ValAla: 3.851 ± 0.088
0.705ValCys: 0.705 ± 0.036
2.956ValAsp: 2.956 ± 0.072
3.01ValGlu: 3.01 ± 0.065
2.623ValPhe: 2.623 ± 0.061
3.893ValGly: 3.893 ± 0.082
0.581ValHis: 0.581 ± 0.03
4.56ValIle: 4.56 ± 0.071
3.713ValLys: 3.713 ± 0.073
4.864ValLeu: 4.864 ± 0.099
1.174ValMet: 1.174 ± 0.038
2.698ValAsn: 2.698 ± 0.064
1.463ValPro: 1.463 ± 0.051
1.404ValGln: 1.404 ± 0.045
2.085ValArg: 2.085 ± 0.057
3.555ValSer: 3.555 ± 0.072
2.147ValThr: 2.147 ± 0.061
3.26ValVal: 3.26 ± 0.078
0.402ValTrp: 0.402 ± 0.023
1.672ValTyr: 1.672 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.429TrpAla: 0.429 ± 0.026
0.112TrpCys: 0.112 ± 0.011
0.398TrpAsp: 0.398 ± 0.033
0.397TrpGlu: 0.397 ± 0.024
0.287TrpPhe: 0.287 ± 0.026
0.509TrpGly: 0.509 ± 0.033
0.166TrpHis: 0.166 ± 0.014
0.489TrpIle: 0.489 ± 0.025
0.37TrpLys: 0.37 ± 0.023
0.811TrpLeu: 0.811 ± 0.038
0.119TrpMet: 0.119 ± 0.015
0.447TrpAsn: 0.447 ± 0.028
0.064TrpPro: 0.064 ± 0.01
0.334TrpGln: 0.334 ± 0.019
0.29TrpArg: 0.29 ± 0.02
0.379TrpSer: 0.379 ± 0.025
0.204TrpThr: 0.204 ± 0.018
0.422TrpVal: 0.422 ± 0.025
0.085TrpTrp: 0.085 ± 0.011
0.245TrpTyr: 0.245 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.621TyrAla: 2.621 ± 0.065
0.417TyrCys: 0.417 ± 0.023
2.247TyrAsp: 2.247 ± 0.063
2.377TyrGlu: 2.377 ± 0.066
2.492TyrPhe: 2.492 ± 0.064
2.231TyrGly: 2.231 ± 0.069
0.701TyrHis: 0.701 ± 0.031
3.124TyrIle: 3.124 ± 0.066
3.752TyrLys: 3.752 ± 0.093
3.555TyrLeu: 3.555 ± 0.073
0.778TyrMet: 0.778 ± 0.034
2.714TyrAsn: 2.714 ± 0.077
1.272TyrPro: 1.272 ± 0.039
1.1TyrGln: 1.1 ± 0.042
1.397TyrArg: 1.397 ± 0.044
2.25TyrSer: 2.25 ± 0.056
2.066TyrThr: 2.066 ± 0.063
1.78TyrVal: 1.78 ± 0.053
0.194TyrTrp: 0.194 ± 0.016
1.803TyrTyr: 1.803 ± 0.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2259 proteins (738759 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski