Amino acid dipepetide frequency for Helicobacter muridarum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.771AlaAla: 2.771 ± 0.082
1.028AlaCys: 1.028 ± 0.049
2.544AlaAsp: 2.544 ± 0.065
2.405AlaGlu: 2.405 ± 0.069
3.03AlaPhe: 3.03 ± 0.083
3.395AlaGly: 3.395 ± 0.087
1.127AlaHis: 1.127 ± 0.045
6.023AlaIle: 6.023 ± 0.126
5.843AlaLys: 5.843 ± 0.12
7.16AlaLeu: 7.16 ± 0.124
1.74AlaMet: 1.74 ± 0.059
4.459AlaAsn: 4.459 ± 0.101
1.572AlaPro: 1.572 ± 0.053
2.542AlaGln: 2.542 ± 0.08
2.729AlaArg: 2.729 ± 0.068
4.456AlaSer: 4.456 ± 0.095
2.983AlaThr: 2.983 ± 0.071
2.705AlaVal: 2.705 ± 0.072
0.468AlaTrp: 0.468 ± 0.028
2.38AlaTyr: 2.38 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.887CysAla: 0.887 ± 0.034
0.157CysCys: 0.157 ± 0.018
0.76CysAsp: 0.76 ± 0.038
0.765CysGlu: 0.765 ± 0.041
0.758CysPhe: 0.758 ± 0.038
0.881CysGly: 0.881 ± 0.042
0.262CysHis: 0.262 ± 0.021
1.344CysIle: 1.344 ± 0.053
0.92CysLys: 0.92 ± 0.04
1.33CysLeu: 1.33 ± 0.052
0.309CysMet: 0.309 ± 0.024
0.809CysAsn: 0.809 ± 0.039
0.334CysPro: 0.334 ± 0.023
0.292CysGln: 0.292 ± 0.022
0.305CysArg: 0.305 ± 0.026
0.804CysSer: 0.804 ± 0.041
0.38CysThr: 0.38 ± 0.026
0.882CysVal: 0.882 ± 0.042
0.074CysTrp: 0.074 ± 0.01
0.626CysTyr: 0.626 ± 0.035
0.0CysXaa: 0.0 ± 0.0
Asp
2.241AspAla: 2.241 ± 0.07
0.826AspCys: 0.826 ± 0.039
2.508AspAsp: 2.508 ± 0.085
3.219AspGlu: 3.219 ± 0.084
3.575AspPhe: 3.575 ± 0.08
2.614AspGly: 2.614 ± 0.07
0.437AspHis: 0.437 ± 0.028
6.543AspIle: 6.543 ± 0.12
4.676AspLys: 4.676 ± 0.096
4.729AspLeu: 4.729 ± 0.088
1.564AspMet: 1.564 ± 0.056
3.511AspAsn: 3.511 ± 0.079
1.071AspPro: 1.071 ± 0.047
0.598AspGln: 0.598 ± 0.033
1.826AspArg: 1.826 ± 0.051
6.469AspSer: 6.469 ± 0.12
2.487AspThr: 2.487 ± 0.065
2.822AspVal: 2.822 ± 0.08
0.383AspTrp: 0.383 ± 0.024
2.41AspTyr: 2.41 ± 0.07
0.0AspXaa: 0.0 ± 0.0
Glu
3.627GluAla: 3.627 ± 0.095
0.78GluCys: 0.78 ± 0.038
2.649GluAsp: 2.649 ± 0.088
3.434GluGlu: 3.434 ± 0.091
2.952GluPhe: 2.952 ± 0.068
2.418GluGly: 2.418 ± 0.063
1.115GluHis: 1.115 ± 0.042
6.081GluIle: 6.081 ± 0.1
4.29GluLys: 4.29 ± 0.118
5.968GluLeu: 5.968 ± 0.104
1.372GluMet: 1.372 ± 0.056
3.691GluAsn: 3.691 ± 0.094
1.211GluPro: 1.211 ± 0.051
2.178GluGln: 2.178 ± 0.069
2.189GluArg: 2.189 ± 0.061
5.277GluSer: 5.277 ± 0.11
2.132GluThr: 2.132 ± 0.091
3.417GluVal: 3.417 ± 0.087
0.473GluTrp: 0.473 ± 0.029
2.574GluTyr: 2.574 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.374PheAla: 3.374 ± 0.084
0.947PheCys: 0.947 ± 0.042
2.99PheAsp: 2.99 ± 0.067
2.638PheGlu: 2.638 ± 0.072
2.914PhePhe: 2.914 ± 0.104
3.432PheGly: 3.432 ± 0.081
0.887PheHis: 0.887 ± 0.038
4.842PheIle: 4.842 ± 0.111
3.071PheLys: 3.071 ± 0.076
5.105PheLeu: 5.105 ± 0.099
1.31PheMet: 1.31 ± 0.043
2.958PheAsn: 2.958 ± 0.082
1.261PhePro: 1.261 ± 0.044
1.363PheGln: 1.363 ± 0.07
1.603PheArg: 1.603 ± 0.051
4.26PheSer: 4.26 ± 0.086
2.063PheThr: 2.063 ± 0.051
3.114PheVal: 3.114 ± 0.09
0.446PheTrp: 0.446 ± 0.028
2.459PheTyr: 2.459 ± 0.071
0.0PheXaa: 0.0 ± 0.0
Gly
3.841GlyAla: 3.841 ± 0.104
0.623GlyCys: 0.623 ± 0.035
2.966GlyAsp: 2.966 ± 0.07
3.132GlyGlu: 3.132 ± 0.082
3.792GlyPhe: 3.792 ± 0.078
4.225GlyGly: 4.225 ± 0.118
0.963GlyHis: 0.963 ± 0.044
6.122GlyIle: 6.122 ± 0.112
4.068GlyLys: 4.068 ± 0.097
5.263GlyLeu: 5.263 ± 0.1
1.523GlyMet: 1.523 ± 0.05
3.063GlyAsn: 3.063 ± 0.076
0.702GlyPro: 0.702 ± 0.039
1.504GlyGln: 1.504 ± 0.055
2.15GlyArg: 2.15 ± 0.066
3.577GlySer: 3.577 ± 0.096
2.135GlyThr: 2.135 ± 0.06
3.85GlyVal: 3.85 ± 0.091
0.391GlyTrp: 0.391 ± 0.026
2.746GlyTyr: 2.746 ± 0.078
0.0GlyXaa: 0.0 ± 0.0
His
1.107HisAla: 1.107 ± 0.046
0.294HisCys: 0.294 ± 0.021
0.95HisAsp: 0.95 ± 0.038
0.887HisGlu: 0.887 ± 0.034
1.079HisPhe: 1.079 ± 0.05
1.072HisGly: 1.072 ± 0.039
0.389HisHis: 0.389 ± 0.027
2.258HisIle: 2.258 ± 0.061
1.605HisLys: 1.605 ± 0.052
1.713HisLeu: 1.713 ± 0.054
0.267HisMet: 0.267 ± 0.021
1.33HisAsn: 1.33 ± 0.044
0.528HisPro: 0.528 ± 0.032
0.54HisGln: 0.54 ± 0.028
0.747HisArg: 0.747 ± 0.038
1.478HisSer: 1.478 ± 0.05
1.003HisThr: 1.003 ± 0.036
0.678HisVal: 0.678 ± 0.035
0.157HisTrp: 0.157 ± 0.017
0.842HisTyr: 0.842 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
7.177IleAla: 7.177 ± 0.127
1.465IleCys: 1.465 ± 0.056
5.752IleAsp: 5.752 ± 0.098
5.956IleGlu: 5.956 ± 0.121
4.828IlePhe: 4.828 ± 0.113
5.69IleGly: 5.69 ± 0.12
1.675IleHis: 1.675 ± 0.052
8.327IleIle: 8.327 ± 0.153
6.934IleLys: 6.934 ± 0.11
9.518IleLeu: 9.518 ± 0.15
2.135IleMet: 2.135 ± 0.057
5.684IleAsn: 5.684 ± 0.098
3.415IlePro: 3.415 ± 0.077
3.209IleGln: 3.209 ± 0.072
3.013IleArg: 3.013 ± 0.07
7.397IleSer: 7.397 ± 0.116
4.56IleThr: 4.56 ± 0.085
5.04IleVal: 5.04 ± 0.09
0.683IleTrp: 0.683 ± 0.031
3.735IleTyr: 3.735 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
4.841LysAla: 4.841 ± 0.103
0.568LysCys: 0.568 ± 0.032
5.521LysAsp: 5.521 ± 0.116
6.378LysGlu: 6.378 ± 0.135
2.53LysPhe: 2.53 ± 0.071
3.545LysGly: 3.545 ± 0.082
1.638LysHis: 1.638 ± 0.053
7.286LysIle: 7.286 ± 0.121
5.709LysLys: 5.709 ± 0.12
6.684LysLeu: 6.684 ± 0.098
1.798LysMet: 1.798 ± 0.053
5.689LysAsn: 5.689 ± 0.108
2.534LysPro: 2.534 ± 0.066
3.819LysGln: 3.819 ± 0.1
2.894LysArg: 2.894 ± 0.061
5.563LysSer: 5.563 ± 0.1
3.624LysThr: 3.624 ± 0.078
3.454LysVal: 3.454 ± 0.091
0.513LysTrp: 0.513 ± 0.028
2.952LysTyr: 2.952 ± 0.075
0.0LysXaa: 0.0 ± 0.0
Leu
6.587LeuAla: 6.587 ± 0.114
1.591LeuCys: 1.591 ± 0.056
6.271LeuAsp: 6.271 ± 0.115
6.794LeuGlu: 6.794 ± 0.114
4.806LeuPhe: 4.806 ± 0.117
6.208LeuGly: 6.208 ± 0.107
2.236LeuHis: 2.236 ± 0.056
7.268LeuIle: 7.268 ± 0.106
7.465LeuLys: 7.465 ± 0.138
9.468LeuLeu: 9.468 ± 0.176
1.972LeuMet: 1.972 ± 0.054
5.948LeuAsn: 5.948 ± 0.107
3.302LeuPro: 3.302 ± 0.072
4.85LeuGln: 4.85 ± 0.092
4.161LeuArg: 4.161 ± 0.111
8.229LeuSer: 8.229 ± 0.105
3.666LeuThr: 3.666 ± 0.069
4.61LeuVal: 4.61 ± 0.087
0.703LeuTrp: 0.703 ± 0.038
3.858LeuTyr: 3.858 ± 0.092
0.0LeuXaa: 0.0 ± 0.0
Met
1.435MetAla: 1.435 ± 0.049
0.234MetCys: 0.234 ± 0.02
1.14MetAsp: 1.14 ± 0.041
1.135MetGlu: 1.135 ± 0.036
1.017MetPhe: 1.017 ± 0.043
1.459MetGly: 1.459 ± 0.059
0.341MetHis: 0.341 ± 0.021
2.037MetIle: 2.037 ± 0.053
1.611MetLys: 1.611 ± 0.046
2.927MetLeu: 2.927 ± 0.08
0.507MetMet: 0.507 ± 0.029
1.294MetAsn: 1.294 ± 0.044
1.245MetPro: 1.245 ± 0.045
1.793MetGln: 1.793 ± 0.053
1.212MetArg: 1.212 ± 0.042
1.693MetSer: 1.693 ± 0.054
0.801MetThr: 0.801 ± 0.033
1.046MetVal: 1.046 ± 0.038
0.143MetTrp: 0.143 ± 0.015
0.625MetTyr: 0.625 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
4.453AsnAla: 4.453 ± 0.092
0.487AsnCys: 0.487 ± 0.029
3.31AsnAsp: 3.31 ± 0.082
3.936AsnGlu: 3.936 ± 0.093
2.853AsnPhe: 2.853 ± 0.071
3.436AsnGly: 3.436 ± 0.091
1.239AsnHis: 1.239 ± 0.041
7.281AsnIle: 7.281 ± 0.139
5.222AsnLys: 5.222 ± 0.113
6.807AsnLeu: 6.807 ± 0.124
1.591AsnMet: 1.591 ± 0.046
4.66AsnAsn: 4.66 ± 0.12
2.688AsnPro: 2.688 ± 0.069
2.267AsnGln: 2.267 ± 0.078
2.018AsnArg: 2.018 ± 0.054
4.373AsnSer: 4.373 ± 0.101
3.344AsnThr: 3.344 ± 0.081
3.299AsnVal: 3.299 ± 0.079
0.341AsnTrp: 0.341 ± 0.023
2.248AsnTyr: 2.248 ± 0.065
0.0AsnXaa: 0.0 ± 0.0
Pro
1.391ProAla: 1.391 ± 0.05
0.314ProCys: 0.314 ± 0.021
1.261ProAsp: 1.261 ± 0.049
1.248ProGlu: 1.248 ± 0.048
1.622ProPhe: 1.622 ± 0.049
1.025ProGly: 1.025 ± 0.048
0.757ProHis: 0.757 ± 0.032
2.884ProIle: 2.884 ± 0.079
2.567ProLys: 2.567 ± 0.072
3.384ProLeu: 3.384 ± 0.079
0.616ProMet: 0.616 ± 0.03
2.399ProAsn: 2.399 ± 0.062
0.871ProPro: 0.871 ± 0.044
1.214ProGln: 1.214 ± 0.037
0.908ProArg: 0.908 ± 0.039
2.233ProSer: 2.233 ± 0.062
1.611ProThr: 1.611 ± 0.056
1.27ProVal: 1.27 ± 0.051
0.192ProTrp: 0.192 ± 0.019
1.457ProTyr: 1.457 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
2.479GlnAla: 2.479 ± 0.073
0.369GlnCys: 0.369 ± 0.026
2.69GlnAsp: 2.69 ± 0.075
2.847GlnGlu: 2.847 ± 0.08
1.137GlnPhe: 1.137 ± 0.044
1.879GlnGly: 1.879 ± 0.056
0.641GlnHis: 0.641 ± 0.031
3.36GlnIle: 3.36 ± 0.081
3.373GlnLys: 3.373 ± 0.078
2.272GlnLeu: 2.272 ± 0.091
0.802GlnMet: 0.802 ± 0.037
3.194GlnAsn: 3.194 ± 0.086
0.733GlnPro: 0.733 ± 0.032
1.418GlnGln: 1.418 ± 0.053
1.598GlnArg: 1.598 ± 0.055
3.239GlnSer: 3.239 ± 0.081
1.958GlnThr: 1.958 ± 0.059
1.814GlnVal: 1.814 ± 0.051
0.243GlnTrp: 0.243 ± 0.018
1.448GlnTyr: 1.448 ± 0.053
0.0GlnXaa: 0.0 ± 0.0
Arg
2.368ArgAla: 2.368 ± 0.06
0.319ArgCys: 0.319 ± 0.024
2.482ArgAsp: 2.482 ± 0.062
2.512ArgGlu: 2.512 ± 0.068
2.274ArgPhe: 2.274 ± 0.063
2.142ArgGly: 2.142 ± 0.055
0.744ArgHis: 0.744 ± 0.038
3.836ArgIle: 3.836 ± 0.086
2.391ArgLys: 2.391 ± 0.057
4.112ArgLeu: 4.112 ± 0.088
0.78ArgMet: 0.78 ± 0.035
2.258ArgAsn: 2.258 ± 0.062
0.78ArgPro: 0.78 ± 0.037
1.247ArgGln: 1.247 ± 0.046
1.264ArgArg: 1.264 ± 0.046
1.994ArgSer: 1.994 ± 0.057
1.364ArgThr: 1.364 ± 0.045
2.154ArgVal: 2.154 ± 0.061
0.199ArgTrp: 0.199 ± 0.019
1.707ArgTyr: 1.707 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
3.995SerAla: 3.995 ± 0.088
0.873SerCys: 0.873 ± 0.045
3.425SerAsp: 3.425 ± 0.085
3.296SerGlu: 3.296 ± 0.068
4.379SerPhe: 4.379 ± 0.085
4.28SerGly: 4.28 ± 0.099
1.591SerHis: 1.591 ± 0.048
7.766SerIle: 7.766 ± 0.128
6.888SerLys: 6.888 ± 0.122
8.779SerLeu: 8.779 ± 0.142
2.11SerMet: 2.11 ± 0.061
5.982SerAsn: 5.982 ± 0.134
2.14SerPro: 2.14 ± 0.062
2.917SerGln: 2.917 ± 0.081
2.508SerArg: 2.508 ± 0.06
5.783SerSer: 5.783 ± 0.125
3.153SerThr: 3.153 ± 0.068
3.58SerVal: 3.58 ± 0.082
0.49SerTrp: 0.49 ± 0.024
3.401SerTyr: 3.401 ± 0.093
0.0SerXaa: 0.0 ± 0.0
Thr
2.079ThrAla: 2.079 ± 0.056
0.469ThrCys: 0.469 ± 0.032
1.799ThrAsp: 1.799 ± 0.06
1.688ThrGlu: 1.688 ± 0.053
2.066ThrPhe: 2.066 ± 0.064
2.292ThrGly: 2.292 ± 0.083
1.044ThrHis: 1.044 ± 0.042
3.814ThrIle: 3.814 ± 0.078
4.005ThrLys: 4.005 ± 0.089
4.915ThrLeu: 4.915 ± 0.107
1.041ThrMet: 1.041 ± 0.038
3.103ThrAsn: 3.103 ± 0.083
1.856ThrPro: 1.856 ± 0.05
2.454ThrGln: 2.454 ± 0.073
1.768ThrArg: 1.768 ± 0.053
3.522ThrSer: 3.522 ± 0.104
2.374ThrThr: 2.374 ± 0.07
1.408ThrVal: 1.408 ± 0.053
0.347ThrTrp: 0.347 ± 0.027
1.641ThrTyr: 1.641 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
3.66ValAla: 3.66 ± 0.093
0.871ValCys: 0.871 ± 0.04
2.815ValAsp: 2.815 ± 0.064
2.731ValGlu: 2.731 ± 0.072
2.895ValPhe: 2.895 ± 0.064
3.561ValGly: 3.561 ± 0.083
0.776ValHis: 0.776 ± 0.041
4.717ValIle: 4.717 ± 0.09
3.205ValLys: 3.205 ± 0.078
5.326ValLeu: 5.326 ± 0.097
1.225ValMet: 1.225 ± 0.053
2.683ValAsn: 2.683 ± 0.069
1.473ValPro: 1.473 ± 0.051
1.489ValGln: 1.489 ± 0.055
1.997ValArg: 1.997 ± 0.066
3.789ValSer: 3.789 ± 0.084
1.901ValThr: 1.901 ± 0.058
3.468ValVal: 3.468 ± 0.089
0.452ValTrp: 0.452 ± 0.029
1.754ValTyr: 1.754 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.408TrpAla: 0.408 ± 0.026
0.104TrpCys: 0.104 ± 0.013
0.396TrpAsp: 0.396 ± 0.024
0.391TrpGlu: 0.391 ± 0.027
0.309TrpPhe: 0.309 ± 0.025
0.526TrpGly: 0.526 ± 0.031
0.268TrpHis: 0.268 ± 0.022
0.647TrpIle: 0.647 ± 0.035
0.356TrpLys: 0.356 ± 0.024
0.925TrpLeu: 0.925 ± 0.041
0.129TrpMet: 0.129 ± 0.014
0.499TrpAsn: 0.499 ± 0.03
0.068TrpPro: 0.068 ± 0.009
0.342TrpGln: 0.342 ± 0.027
0.316TrpArg: 0.316 ± 0.02
0.411TrpSer: 0.411 ± 0.023
0.259TrpThr: 0.259 ± 0.02
0.404TrpVal: 0.404 ± 0.026
0.111TrpTrp: 0.111 ± 0.013
0.258TrpTyr: 0.258 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.641TyrAla: 2.641 ± 0.065
0.561TyrCys: 0.561 ± 0.027
2.281TyrAsp: 2.281 ± 0.067
2.435TyrGlu: 2.435 ± 0.057
2.219TyrPhe: 2.219 ± 0.071
2.555TyrGly: 2.555 ± 0.069
0.796TyrHis: 0.796 ± 0.036
3.759TyrIle: 3.759 ± 0.092
3.34TyrLys: 3.34 ± 0.083
3.61TyrLeu: 3.61 ± 0.082
0.901TyrMet: 0.901 ± 0.038
2.638TyrAsn: 2.638 ± 0.066
1.44TyrPro: 1.44 ± 0.05
1.457TyrGln: 1.457 ± 0.05
1.724TyrArg: 1.724 ± 0.067
2.789TyrSer: 2.789 ± 0.064
1.886TyrThr: 1.886 ± 0.058
1.773TyrVal: 1.773 ± 0.047
0.317TyrTrp: 0.317 ± 0.022
1.843TyrTyr: 1.843 ± 0.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1960 proteins (636876 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski