Amino acid dipepetide frequency for Hydrogenophaga sp. H7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.9AlaAla: 16.9 ± 0.178
1.281AlaCys: 1.281 ± 0.032
6.467AlaAsp: 6.467 ± 0.061
6.535AlaGlu: 6.535 ± 0.076
4.094AlaPhe: 4.094 ± 0.061
10.59AlaGly: 10.59 ± 0.113
2.86AlaHis: 2.86 ± 0.05
4.862AlaIle: 4.862 ± 0.054
3.671AlaLys: 3.671 ± 0.064
15.225AlaLeu: 15.225 ± 0.136
3.654AlaMet: 3.654 ± 0.055
2.573AlaAsn: 2.573 ± 0.047
6.305AlaPro: 6.305 ± 0.087
5.933AlaGln: 5.933 ± 0.075
8.67AlaArg: 8.67 ± 0.098
6.43AlaSer: 6.43 ± 0.075
5.945AlaThr: 5.945 ± 0.063
9.075AlaVal: 9.075 ± 0.093
2.058AlaTrp: 2.058 ± 0.05
2.395AlaTyr: 2.395 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.165CysAla: 1.165 ± 0.031
0.11CysCys: 0.11 ± 0.009
0.539CysAsp: 0.539 ± 0.019
0.517CysGlu: 0.517 ± 0.021
0.321CysPhe: 0.321 ± 0.015
1.001CysGly: 1.001 ± 0.029
0.287CysHis: 0.287 ± 0.016
0.399CysIle: 0.399 ± 0.016
0.238CysLys: 0.238 ± 0.014
0.873CysLeu: 0.873 ± 0.026
0.203CysMet: 0.203 ± 0.013
0.215CysAsn: 0.215 ± 0.013
0.54CysPro: 0.54 ± 0.024
0.28CysGln: 0.28 ± 0.014
0.553CysArg: 0.553 ± 0.019
0.512CysSer: 0.512 ± 0.021
0.499CysThr: 0.499 ± 0.019
0.701CysVal: 0.701 ± 0.026
0.128CysTrp: 0.128 ± 0.011
0.209CysTyr: 0.209 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.814AspAla: 6.814 ± 0.087
0.473AspCys: 0.473 ± 0.02
2.764AspAsp: 2.764 ± 0.048
3.313AspGlu: 3.313 ± 0.06
1.947AspPhe: 1.947 ± 0.038
4.715AspGly: 4.715 ± 0.067
1.279AspHis: 1.279 ± 0.031
2.336AspIle: 2.336 ± 0.048
1.709AspLys: 1.709 ± 0.038
5.56AspLeu: 5.56 ± 0.062
1.247AspMet: 1.247 ± 0.031
1.198AspAsn: 1.198 ± 0.032
3.143AspPro: 3.143 ± 0.051
1.789AspGln: 1.789 ± 0.038
3.405AspArg: 3.405 ± 0.052
2.126AspSer: 2.126 ± 0.038
2.63AspThr: 2.63 ± 0.051
3.858AspVal: 3.858 ± 0.061
1.047AspTrp: 1.047 ± 0.03
1.22AspTyr: 1.22 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
6.976GluAla: 6.976 ± 0.083
0.381GluCys: 0.381 ± 0.017
2.217GluAsp: 2.217 ± 0.04
2.434GluGlu: 2.434 ± 0.052
1.715GluPhe: 1.715 ± 0.035
3.951GluGly: 3.951 ± 0.06
1.457GluHis: 1.457 ± 0.037
2.426GluIle: 2.426 ± 0.049
1.988GluLys: 1.988 ± 0.046
5.93GluLeu: 5.93 ± 0.082
1.239GluMet: 1.239 ± 0.032
1.224GluAsn: 1.224 ± 0.03
2.679GluPro: 2.679 ± 0.048
2.744GluGln: 2.744 ± 0.054
4.655GluArg: 4.655 ± 0.07
2.417GluSer: 2.417 ± 0.042
2.51GluThr: 2.51 ± 0.046
4.226GluVal: 4.226 ± 0.06
0.746GluTrp: 0.746 ± 0.022
0.995GluTyr: 0.995 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
4.097PheAla: 4.097 ± 0.063
0.414PheCys: 0.414 ± 0.019
2.482PheAsp: 2.482 ± 0.044
2.115PheGlu: 2.115 ± 0.037
1.395PhePhe: 1.395 ± 0.035
3.249PheGly: 3.249 ± 0.059
0.791PheHis: 0.791 ± 0.026
1.403PheIle: 1.403 ± 0.039
1.184PheLys: 1.184 ± 0.028
3.074PheLeu: 3.074 ± 0.05
0.872PheMet: 0.872 ± 0.025
1.13PheAsn: 1.13 ± 0.034
1.528PhePro: 1.528 ± 0.035
1.147PheGln: 1.147 ± 0.028
1.943PheArg: 1.943 ± 0.038
2.105PheSer: 2.105 ± 0.044
1.895PheThr: 1.895 ± 0.039
2.852PheVal: 2.852 ± 0.05
0.551PheTrp: 0.551 ± 0.024
0.886PheTyr: 0.886 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
9.125GlyAla: 9.125 ± 0.093
0.919GlyCys: 0.919 ± 0.031
4.043GlyAsp: 4.043 ± 0.056
4.517GlyGlu: 4.517 ± 0.06
3.356GlyPhe: 3.356 ± 0.056
6.96GlyGly: 6.96 ± 0.101
2.197GlyHis: 2.197 ± 0.04
3.735GlyIle: 3.735 ± 0.057
3.151GlyLys: 3.151 ± 0.062
9.804GlyLeu: 9.804 ± 0.1
2.447GlyMet: 2.447 ± 0.044
2.061GlyAsn: 2.061 ± 0.048
3.362GlyPro: 3.362 ± 0.053
3.813GlyGln: 3.813 ± 0.057
5.459GlyArg: 5.459 ± 0.065
4.397GlySer: 4.397 ± 0.057
4.409GlyThr: 4.409 ± 0.079
6.808GlyVal: 6.808 ± 0.07
1.647GlyTrp: 1.647 ± 0.043
2.167GlyTyr: 2.167 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.815HisAla: 2.815 ± 0.05
0.294HisCys: 0.294 ± 0.016
1.266HisAsp: 1.266 ± 0.035
1.218HisGlu: 1.218 ± 0.03
0.903HisPhe: 0.903 ± 0.027
2.355HisGly: 2.355 ± 0.044
0.783HisHis: 0.783 ± 0.026
1.064HisIle: 1.064 ± 0.029
0.588HisLys: 0.588 ± 0.02
2.494HisLeu: 2.494 ± 0.045
0.55HisMet: 0.55 ± 0.021
0.563HisAsn: 0.563 ± 0.025
1.7HisPro: 1.7 ± 0.036
0.847HisGln: 0.847 ± 0.028
1.702HisArg: 1.702 ± 0.041
1.109HisSer: 1.109 ± 0.033
1.22HisThr: 1.22 ± 0.031
1.582HisVal: 1.582 ± 0.037
0.502HisTrp: 0.502 ± 0.018
0.588HisTyr: 0.588 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
5.45IleAla: 5.45 ± 0.071
0.394IleCys: 0.394 ± 0.016
2.861IleAsp: 2.861 ± 0.051
2.995IleGlu: 2.995 ± 0.052
1.158IlePhe: 1.158 ± 0.03
4.126IleGly: 4.126 ± 0.064
0.886IleHis: 0.886 ± 0.022
1.303IleIle: 1.303 ± 0.036
1.448IleLys: 1.448 ± 0.038
2.997IleLeu: 2.997 ± 0.054
0.718IleMet: 0.718 ± 0.026
1.295IleAsn: 1.295 ± 0.033
2.006IlePro: 2.006 ± 0.045
1.323IleGln: 1.323 ± 0.033
2.514IleArg: 2.514 ± 0.046
2.139IleSer: 2.139 ± 0.039
2.352IleThr: 2.352 ± 0.048
3.335IleVal: 3.335 ± 0.059
0.502IleTrp: 0.502 ± 0.021
0.899IleTyr: 0.899 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
4.46LysAla: 4.46 ± 0.077
0.171LysCys: 0.171 ± 0.012
1.817LysAsp: 1.817 ± 0.044
1.516LysGlu: 1.516 ± 0.038
0.911LysPhe: 0.911 ± 0.027
2.636LysGly: 2.636 ± 0.05
0.667LysHis: 0.667 ± 0.026
1.347LysIle: 1.347 ± 0.036
1.42LysLys: 1.42 ± 0.047
3.367LysLeu: 3.367 ± 0.059
0.805LysMet: 0.805 ± 0.027
0.911LysAsn: 0.911 ± 0.032
2.021LysPro: 2.021 ± 0.043
1.192LysGln: 1.192 ± 0.036
2.026LysArg: 2.026 ± 0.043
1.602LysSer: 1.602 ± 0.034
1.999LysThr: 1.999 ± 0.042
2.623LysVal: 2.623 ± 0.056
0.341LysTrp: 0.341 ± 0.015
0.644LysTyr: 0.644 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
14.753LeuAla: 14.753 ± 0.136
1.083LeuCys: 1.083 ± 0.028
5.932LeuAsp: 5.932 ± 0.065
5.062LeuGlu: 5.062 ± 0.07
3.603LeuPhe: 3.603 ± 0.061
9.078LeuGly: 9.078 ± 0.088
2.484LeuHis: 2.484 ± 0.044
4.266LeuIle: 4.266 ± 0.068
3.746LeuLys: 3.746 ± 0.056
11.952LeuLeu: 11.952 ± 0.14
2.821LeuMet: 2.821 ± 0.049
2.72LeuAsn: 2.72 ± 0.045
6.441LeuPro: 6.441 ± 0.083
4.562LeuGln: 4.562 ± 0.071
7.868LeuArg: 7.868 ± 0.089
6.333LeuSer: 6.333 ± 0.066
5.462LeuThr: 5.462 ± 0.066
8.363LeuVal: 8.363 ± 0.085
1.569LeuTrp: 1.569 ± 0.04
1.921LeuTyr: 1.921 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
3.396MetAla: 3.396 ± 0.045
0.18MetCys: 0.18 ± 0.012
1.342MetAsp: 1.342 ± 0.034
1.139MetGlu: 1.139 ± 0.032
0.704MetPhe: 0.704 ± 0.022
2.136MetGly: 2.136 ± 0.041
0.553MetHis: 0.553 ± 0.02
0.915MetIle: 0.915 ± 0.029
1.116MetLys: 1.116 ± 0.027
2.633MetLeu: 2.633 ± 0.045
0.587MetMet: 0.587 ± 0.022
0.953MetAsn: 0.953 ± 0.028
1.468MetPro: 1.468 ± 0.039
1.049MetGln: 1.049 ± 0.027
1.674MetArg: 1.674 ± 0.032
1.695MetSer: 1.695 ± 0.038
1.513MetThr: 1.513 ± 0.036
1.972MetVal: 1.972 ± 0.04
0.243MetTrp: 0.243 ± 0.015
0.365MetTyr: 0.365 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.133AsnAla: 3.133 ± 0.05
0.253AsnCys: 0.253 ± 0.013
1.27AsnAsp: 1.27 ± 0.03
1.231AsnGlu: 1.231 ± 0.031
0.889AsnPhe: 0.889 ± 0.028
2.139AsnGly: 2.139 ± 0.043
0.567AsnHis: 0.567 ± 0.019
1.124AsnIle: 1.124 ± 0.031
0.773AsnLys: 0.773 ± 0.027
2.594AsnLeu: 2.594 ± 0.048
0.609AsnMet: 0.609 ± 0.021
0.679AsnAsn: 0.679 ± 0.026
1.864AsnPro: 1.864 ± 0.037
0.907AsnGln: 0.907 ± 0.027
1.617AsnArg: 1.617 ± 0.042
1.088AsnSer: 1.088 ± 0.028
1.374AsnThr: 1.374 ± 0.036
1.79AsnVal: 1.79 ± 0.039
0.394AsnTrp: 0.394 ± 0.016
0.564AsnTyr: 0.564 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
7.306ProAla: 7.306 ± 0.093
0.386ProCys: 0.386 ± 0.017
3.333ProAsp: 3.333 ± 0.061
3.573ProGlu: 3.573 ± 0.053
1.868ProPhe: 1.868 ± 0.042
4.988ProGly: 4.988 ± 0.063
1.232ProHis: 1.232 ± 0.033
1.701ProIle: 1.701 ± 0.039
1.614ProLys: 1.614 ± 0.041
5.535ProLeu: 5.535 ± 0.069
1.408ProMet: 1.408 ± 0.03
1.225ProAsn: 1.225 ± 0.03
2.779ProPro: 2.779 ± 0.067
2.121ProGln: 2.121 ± 0.041
2.974ProArg: 2.974 ± 0.045
2.803ProSer: 2.803 ± 0.056
2.681ProThr: 2.681 ± 0.053
4.749ProVal: 4.749 ± 0.058
0.905ProTrp: 0.905 ± 0.03
1.036ProTyr: 1.036 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
5.762GlnAla: 5.762 ± 0.083
0.287GlnCys: 0.287 ± 0.013
1.757GlnAsp: 1.757 ± 0.038
1.563GlnGlu: 1.563 ± 0.032
1.167GlnPhe: 1.167 ± 0.031
3.287GlnGly: 3.287 ± 0.05
1.039GlnHis: 1.039 ± 0.032
1.704GlnIle: 1.704 ± 0.038
1.26GlnLys: 1.26 ± 0.033
4.287GlnLeu: 4.287 ± 0.061
1.062GlnMet: 1.062 ± 0.03
0.876GlnAsn: 0.876 ± 0.025
2.372GlnPro: 2.372 ± 0.041
1.988GlnGln: 1.988 ± 0.042
3.835GlnArg: 3.835 ± 0.062
1.999GlnSer: 1.999 ± 0.036
2.154GlnThr: 2.154 ± 0.039
3.207GlnVal: 3.207 ± 0.054
0.718GlnTrp: 0.718 ± 0.024
0.769GlnTyr: 0.769 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
7.433ArgAla: 7.433 ± 0.093
0.62ArgCys: 0.62 ± 0.02
3.463ArgAsp: 3.463 ± 0.052
4.409ArgGlu: 4.409 ± 0.065
2.87ArgPhe: 2.87 ± 0.046
4.48ArgGly: 4.48 ± 0.061
1.994ArgHis: 1.994 ± 0.04
3.281ArgIle: 3.281 ± 0.052
2.058ArgLys: 2.058 ± 0.037
8.254ArgLeu: 8.254 ± 0.101
1.882ArgMet: 1.882 ± 0.036
1.604ArgAsn: 1.604 ± 0.034
3.42ArgPro: 3.42 ± 0.055
3.066ArgGln: 3.066 ± 0.048
5.023ArgArg: 5.023 ± 0.073
3.593ArgSer: 3.593 ± 0.052
3.123ArgThr: 3.123 ± 0.049
5.165ArgVal: 5.165 ± 0.064
1.318ArgTrp: 1.318 ± 0.036
1.712ArgTyr: 1.712 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
6.345SerAla: 6.345 ± 0.075
0.438SerCys: 0.438 ± 0.019
2.598SerAsp: 2.598 ± 0.05
2.479SerGlu: 2.479 ± 0.045
2.046SerPhe: 2.046 ± 0.038
5.089SerGly: 5.089 ± 0.062
1.259SerHis: 1.259 ± 0.033
2.073SerIle: 2.073 ± 0.045
1.475SerLys: 1.475 ± 0.035
5.731SerLeu: 5.731 ± 0.07
1.374SerMet: 1.374 ± 0.03
1.228SerAsn: 1.228 ± 0.03
2.906SerPro: 2.906 ± 0.051
1.832SerGln: 1.832 ± 0.035
3.403SerArg: 3.403 ± 0.056
2.786SerSer: 2.786 ± 0.057
2.79SerThr: 2.79 ± 0.047
4.073SerVal: 4.073 ± 0.056
0.72SerTrp: 0.72 ± 0.025
1.13SerTyr: 1.13 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
6.015ThrAla: 6.015 ± 0.075
0.417ThrCys: 0.417 ± 0.019
2.504ThrAsp: 2.504 ± 0.049
2.535ThrGlu: 2.535 ± 0.049
1.717ThrPhe: 1.717 ± 0.037
4.841ThrGly: 4.841 ± 0.061
1.229ThrHis: 1.229 ± 0.029
1.975ThrIle: 1.975 ± 0.042
1.297ThrLys: 1.297 ± 0.036
6.299ThrLeu: 6.299 ± 0.075
1.164ThrMet: 1.164 ± 0.031
1.185ThrAsn: 1.185 ± 0.036
3.611ThrPro: 3.611 ± 0.056
1.867ThrGln: 1.867 ± 0.039
3.264ThrArg: 3.264 ± 0.053
2.565ThrSer: 2.565 ± 0.047
2.833ThrThr: 2.833 ± 0.046
4.35ThrVal: 4.35 ± 0.064
0.712ThrTrp: 0.712 ± 0.027
1.02ThrTyr: 1.02 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
9.659ValAla: 9.659 ± 0.089
0.766ValCys: 0.766 ± 0.023
4.088ValAsp: 4.088 ± 0.052
4.057ValGlu: 4.057 ± 0.061
2.974ValPhe: 2.974 ± 0.05
5.785ValGly: 5.785 ± 0.072
1.747ValHis: 1.747 ± 0.038
3.369ValIle: 3.369 ± 0.053
2.575ValLys: 2.575 ± 0.056
8.799ValLeu: 8.799 ± 0.097
2.065ValMet: 2.065 ± 0.042
2.182ValAsn: 2.182 ± 0.042
4.18ValPro: 4.18 ± 0.053
3.008ValGln: 3.008 ± 0.05
5.268ValArg: 5.268 ± 0.066
4.153ValSer: 4.153 ± 0.054
3.995ValThr: 3.995 ± 0.057
6.76ValVal: 6.76 ± 0.085
1.206ValTrp: 1.206 ± 0.032
1.577ValTyr: 1.577 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.576TrpAla: 1.576 ± 0.04
0.186TrpCys: 0.186 ± 0.012
0.668TrpAsp: 0.668 ± 0.021
0.564TrpGlu: 0.564 ± 0.02
0.617TrpPhe: 0.617 ± 0.024
1.081TrpGly: 1.081 ± 0.029
0.388TrpHis: 0.388 ± 0.018
0.659TrpIle: 0.659 ± 0.021
0.516TrpLys: 0.516 ± 0.02
2.312TrpLeu: 2.312 ± 0.05
0.491TrpMet: 0.491 ± 0.021
0.454TrpAsn: 0.454 ± 0.02
0.834TrpPro: 0.834 ± 0.027
0.793TrpGln: 0.793 ± 0.024
1.307TrpArg: 1.307 ± 0.032
0.862TrpSer: 0.862 ± 0.027
0.85TrpThr: 0.85 ± 0.026
1.241TrpVal: 1.241 ± 0.036
0.348TrpTrp: 0.348 ± 0.017
0.281TrpTyr: 0.281 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.343TyrAla: 2.343 ± 0.04
0.238TyrCys: 0.238 ± 0.015
1.164TyrAsp: 1.164 ± 0.032
1.137TyrGlu: 1.137 ± 0.03
0.855TyrPhe: 0.855 ± 0.027
1.886TyrGly: 1.886 ± 0.044
0.445TyrHis: 0.445 ± 0.019
0.758TyrIle: 0.758 ± 0.024
0.649TyrLys: 0.649 ± 0.023
2.298TyrLeu: 2.298 ± 0.046
0.434TyrMet: 0.434 ± 0.02
0.585TyrAsn: 0.585 ± 0.02
1.059TyrPro: 1.059 ± 0.027
0.832TyrGln: 0.832 ± 0.025
1.611TyrArg: 1.611 ± 0.036
1.059TyrSer: 1.059 ± 0.028
1.167TyrThr: 1.167 ± 0.033
1.483TyrVal: 1.483 ± 0.032
0.374TyrTrp: 0.374 ± 0.016
0.502TyrTyr: 0.502 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4173 proteins (1351864 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski