Amino acid dipepetide frequency for Aquabacterium commune

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.766AlaAla: 16.766 ± 0.167
1.305AlaCys: 1.305 ± 0.036
6.772AlaAsp: 6.772 ± 0.072
6.718AlaGlu: 6.718 ± 0.082
3.909AlaPhe: 3.909 ± 0.056
9.67AlaGly: 9.67 ± 0.113
3.204AlaHis: 3.204 ± 0.063
4.872AlaIle: 4.872 ± 0.066
4.037AlaLys: 4.037 ± 0.082
14.994AlaLeu: 14.994 ± 0.142
3.54AlaMet: 3.54 ± 0.06
2.911AlaAsn: 2.911 ± 0.054
6.366AlaPro: 6.366 ± 0.087
6.803AlaGln: 6.803 ± 0.097
8.717AlaArg: 8.717 ± 0.1
7.557AlaSer: 7.557 ± 0.109
6.446AlaThr: 6.446 ± 0.084
8.894AlaVal: 8.894 ± 0.09
2.228AlaTrp: 2.228 ± 0.048
2.312AlaTyr: 2.312 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.177CysAla: 1.177 ± 0.033
0.134CysCys: 0.134 ± 0.011
0.52CysAsp: 0.52 ± 0.019
0.501CysGlu: 0.501 ± 0.02
0.325CysPhe: 0.325 ± 0.016
0.925CysGly: 0.925 ± 0.031
0.286CysHis: 0.286 ± 0.016
0.328CysIle: 0.328 ± 0.017
0.235CysLys: 0.235 ± 0.013
0.932CysLeu: 0.932 ± 0.028
0.204CysMet: 0.204 ± 0.014
0.214CysAsn: 0.214 ± 0.013
0.498CysPro: 0.498 ± 0.02
0.348CysGln: 0.348 ± 0.016
0.546CysArg: 0.546 ± 0.025
0.465CysSer: 0.465 ± 0.024
0.507CysThr: 0.507 ± 0.02
0.687CysVal: 0.687 ± 0.024
0.125CysTrp: 0.125 ± 0.01
0.18CysTyr: 0.18 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.86AspAla: 7.86 ± 0.09
0.468AspCys: 0.468 ± 0.019
2.789AspAsp: 2.789 ± 0.051
3.318AspGlu: 3.318 ± 0.052
2.018AspPhe: 2.018 ± 0.04
4.551AspGly: 4.551 ± 0.064
1.294AspHis: 1.294 ± 0.032
2.598AspIle: 2.598 ± 0.052
1.833AspLys: 1.833 ± 0.048
5.272AspLeu: 5.272 ± 0.074
1.154AspMet: 1.154 ± 0.029
1.234AspAsn: 1.234 ± 0.034
2.814AspPro: 2.814 ± 0.047
1.868AspGln: 1.868 ± 0.038
3.383AspArg: 3.383 ± 0.058
2.217AspSer: 2.217 ± 0.037
2.981AspThr: 2.981 ± 0.054
4.333AspVal: 4.333 ± 0.06
0.954AspTrp: 0.954 ± 0.029
1.097AspTyr: 1.097 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
7.48GluAla: 7.48 ± 0.082
0.337GluCys: 0.337 ± 0.016
2.31GluAsp: 2.31 ± 0.049
2.207GluGlu: 2.207 ± 0.053
1.572GluPhe: 1.572 ± 0.04
3.935GluGly: 3.935 ± 0.061
1.487GluHis: 1.487 ± 0.036
2.213GluIle: 2.213 ± 0.046
1.596GluLys: 1.596 ± 0.037
5.757GluLeu: 5.757 ± 0.076
1.169GluMet: 1.169 ± 0.031
1.004GluAsn: 1.004 ± 0.03
2.407GluPro: 2.407 ± 0.048
2.562GluGln: 2.562 ± 0.051
4.5GluArg: 4.5 ± 0.064
2.249GluSer: 2.249 ± 0.044
2.238GluThr: 2.238 ± 0.047
4.321GluVal: 4.321 ± 0.067
0.667GluTrp: 0.667 ± 0.025
0.81GluTyr: 0.81 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
3.717PheAla: 3.717 ± 0.062
0.348PheCys: 0.348 ± 0.016
2.345PheAsp: 2.345 ± 0.044
1.977PheGlu: 1.977 ± 0.04
1.227PhePhe: 1.227 ± 0.038
3.008PheGly: 3.008 ± 0.058
0.737PheHis: 0.737 ± 0.025
1.372PheIle: 1.372 ± 0.036
1.218PheLys: 1.218 ± 0.032
2.756PheLeu: 2.756 ± 0.052
0.853PheMet: 0.853 ± 0.026
1.127PheAsn: 1.127 ± 0.03
1.34PhePro: 1.34 ± 0.034
1.158PheGln: 1.158 ± 0.032
1.732PheArg: 1.732 ± 0.035
1.985PheSer: 1.985 ± 0.039
1.979PheThr: 1.979 ± 0.04
2.611PheVal: 2.611 ± 0.058
0.471PheTrp: 0.471 ± 0.02
0.781PheTyr: 0.781 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
8.829GlyAla: 8.829 ± 0.103
0.853GlyCys: 0.853 ± 0.029
4.031GlyAsp: 4.031 ± 0.064
4.279GlyGlu: 4.279 ± 0.057
2.991GlyPhe: 2.991 ± 0.052
6.598GlyGly: 6.598 ± 0.098
2.206GlyHis: 2.206 ± 0.048
3.441GlyIle: 3.441 ± 0.054
3.227GlyLys: 3.227 ± 0.067
9.321GlyLeu: 9.321 ± 0.1
2.253GlyMet: 2.253 ± 0.041
2.139GlyAsn: 2.139 ± 0.047
2.941GlyPro: 2.941 ± 0.054
4.051GlyGln: 4.051 ± 0.064
5.319GlyArg: 5.319 ± 0.072
4.435GlySer: 4.435 ± 0.08
4.357GlyThr: 4.357 ± 0.076
6.56GlyVal: 6.56 ± 0.071
1.507GlyTrp: 1.507 ± 0.035
1.886GlyTyr: 1.886 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
3.272HisAla: 3.272 ± 0.059
0.263HisCys: 0.263 ± 0.015
1.312HisAsp: 1.312 ± 0.039
1.286HisGlu: 1.286 ± 0.037
0.932HisPhe: 0.932 ± 0.026
2.362HisGly: 2.362 ± 0.049
0.811HisHis: 0.811 ± 0.031
1.12HisIle: 1.12 ± 0.03
0.67HisLys: 0.67 ± 0.026
2.57HisLeu: 2.57 ± 0.045
0.535HisMet: 0.535 ± 0.023
0.528HisAsn: 0.528 ± 0.02
1.738HisPro: 1.738 ± 0.039
0.91HisGln: 0.91 ± 0.025
1.68HisArg: 1.68 ± 0.052
1.135HisSer: 1.135 ± 0.035
1.412HisThr: 1.412 ± 0.038
1.667HisVal: 1.667 ± 0.036
0.492HisTrp: 0.492 ± 0.02
0.541HisTyr: 0.541 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
5.094IleAla: 5.094 ± 0.063
0.336IleCys: 0.336 ± 0.018
3.129IleAsp: 3.129 ± 0.046
2.815IleGlu: 2.815 ± 0.046
1.095IlePhe: 1.095 ± 0.031
3.747IleGly: 3.747 ± 0.064
0.859IleHis: 0.859 ± 0.028
1.273IleIle: 1.273 ± 0.045
1.416IleLys: 1.416 ± 0.039
3.12IleLeu: 3.12 ± 0.05
0.69IleMet: 0.69 ± 0.022
1.313IleAsn: 1.313 ± 0.034
1.833IlePro: 1.833 ± 0.04
1.411IleGln: 1.411 ± 0.038
2.489IleArg: 2.489 ± 0.042
2.152IleSer: 2.152 ± 0.046
2.439IleThr: 2.439 ± 0.046
3.168IleVal: 3.168 ± 0.055
0.398IleTrp: 0.398 ± 0.019
0.851IleTyr: 0.851 ± 0.024
0.0IleXaa: 0.0 ± 0.0
Lys
4.376LysAla: 4.376 ± 0.084
0.183LysCys: 0.183 ± 0.013
1.65LysAsp: 1.65 ± 0.043
1.453LysGlu: 1.453 ± 0.043
0.875LysPhe: 0.875 ± 0.031
2.586LysGly: 2.586 ± 0.049
0.767LysHis: 0.767 ± 0.024
1.252LysIle: 1.252 ± 0.041
1.301LysLys: 1.301 ± 0.048
3.563LysLeu: 3.563 ± 0.062
0.764LysMet: 0.764 ± 0.027
0.83LysAsn: 0.83 ± 0.031
2.128LysPro: 2.128 ± 0.046
1.363LysGln: 1.363 ± 0.036
2.255LysArg: 2.255 ± 0.049
1.686LysSer: 1.686 ± 0.047
1.783LysThr: 1.783 ± 0.043
2.686LysVal: 2.686 ± 0.053
0.329LysTrp: 0.329 ± 0.015
0.59LysTyr: 0.59 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
14.203LeuAla: 14.203 ± 0.138
1.032LeuCys: 1.032 ± 0.031
6.238LeuAsp: 6.238 ± 0.089
4.546LeuGlu: 4.546 ± 0.071
3.128LeuPhe: 3.128 ± 0.062
8.872LeuGly: 8.872 ± 0.104
2.502LeuHis: 2.502 ± 0.051
4.128LeuIle: 4.128 ± 0.066
3.782LeuLys: 3.782 ± 0.06
10.933LeuLeu: 10.933 ± 0.141
2.661LeuMet: 2.661 ± 0.049
2.955LeuAsn: 2.955 ± 0.047
6.249LeuPro: 6.249 ± 0.087
4.86LeuGln: 4.86 ± 0.072
7.872LeuArg: 7.872 ± 0.104
7.289LeuSer: 7.289 ± 0.09
5.88LeuThr: 5.88 ± 0.068
7.615LeuVal: 7.615 ± 0.101
1.453LeuTrp: 1.453 ± 0.04
1.871LeuTyr: 1.871 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.333MetAla: 3.333 ± 0.059
0.168MetCys: 0.168 ± 0.013
1.201MetAsp: 1.201 ± 0.034
0.885MetGlu: 0.885 ± 0.03
0.625MetPhe: 0.625 ± 0.024
2.007MetGly: 2.007 ± 0.048
0.538MetHis: 0.538 ± 0.018
0.797MetIle: 0.797 ± 0.025
0.897MetLys: 0.897 ± 0.033
2.521MetLeu: 2.521 ± 0.048
0.568MetMet: 0.568 ± 0.021
0.77MetAsn: 0.77 ± 0.026
1.585MetPro: 1.585 ± 0.04
1.044MetGln: 1.044 ± 0.027
1.719MetArg: 1.719 ± 0.038
1.867MetSer: 1.867 ± 0.038
1.425MetThr: 1.425 ± 0.034
1.808MetVal: 1.808 ± 0.036
0.223MetTrp: 0.223 ± 0.014
0.317MetTyr: 0.317 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.147AsnAla: 3.147 ± 0.054
0.248AsnCys: 0.248 ± 0.013
1.308AsnAsp: 1.308 ± 0.034
1.23AsnGlu: 1.23 ± 0.029
0.886AsnPhe: 0.886 ± 0.028
2.219AsnGly: 2.219 ± 0.052
0.568AsnHis: 0.568 ± 0.02
1.059AsnIle: 1.059 ± 0.031
0.827AsnLys: 0.827 ± 0.028
2.663AsnLeu: 2.663 ± 0.047
0.579AsnMet: 0.579 ± 0.023
0.741AsnAsn: 0.741 ± 0.031
1.706AsnPro: 1.706 ± 0.037
0.981AsnGln: 0.981 ± 0.031
1.587AsnArg: 1.587 ± 0.036
1.143AsnSer: 1.143 ± 0.034
1.513AsnThr: 1.513 ± 0.043
1.946AsnVal: 1.946 ± 0.042
0.422AsnTrp: 0.422 ± 0.018
0.548AsnTyr: 0.548 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
6.88ProAla: 6.88 ± 0.08
0.423ProCys: 0.423 ± 0.019
3.276ProAsp: 3.276 ± 0.056
3.354ProGlu: 3.354 ± 0.058
1.653ProPhe: 1.653 ± 0.039
4.448ProGly: 4.448 ± 0.069
1.354ProHis: 1.354 ± 0.035
1.796ProIle: 1.796 ± 0.034
1.622ProLys: 1.622 ± 0.042
5.306ProLeu: 5.306 ± 0.073
1.41ProMet: 1.41 ± 0.038
1.284ProAsn: 1.284 ± 0.03
2.76ProPro: 2.76 ± 0.064
2.408ProGln: 2.408 ± 0.041
3.069ProArg: 3.069 ± 0.056
3.146ProSer: 3.146 ± 0.059
2.898ProThr: 2.898 ± 0.053
4.294ProVal: 4.294 ± 0.064
0.874ProTrp: 0.874 ± 0.031
0.976ProTyr: 0.976 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
7.575GlnAla: 7.575 ± 0.099
0.31GlnCys: 0.31 ± 0.017
1.988GlnAsp: 1.988 ± 0.043
1.718GlnGlu: 1.718 ± 0.041
1.271GlnPhe: 1.271 ± 0.032
3.853GlnGly: 3.853 ± 0.065
1.226GlnHis: 1.226 ± 0.032
1.671GlnIle: 1.671 ± 0.037
1.083GlnLys: 1.083 ± 0.035
4.448GlnLeu: 4.448 ± 0.069
1.043GlnMet: 1.043 ± 0.034
0.811GlnAsn: 0.811 ± 0.026
2.731GlnPro: 2.731 ± 0.052
2.35GlnGln: 2.35 ± 0.055
3.917GlnArg: 3.917 ± 0.071
2.227GlnSer: 2.227 ± 0.046
2.336GlnThr: 2.336 ± 0.048
3.525GlnVal: 3.525 ± 0.055
0.714GlnTrp: 0.714 ± 0.026
0.719GlnTyr: 0.719 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
7.58ArgAla: 7.58 ± 0.084
0.642ArgCys: 0.642 ± 0.022
3.667ArgAsp: 3.667 ± 0.06
4.073ArgGlu: 4.073 ± 0.064
2.687ArgPhe: 2.687 ± 0.046
4.335ArgGly: 4.335 ± 0.063
2.088ArgHis: 2.088 ± 0.05
3.316ArgIle: 3.316 ± 0.048
2.162ArgLys: 2.162 ± 0.041
8.193ArgLeu: 8.193 ± 0.09
1.844ArgMet: 1.844 ± 0.034
1.704ArgAsn: 1.704 ± 0.038
3.246ArgPro: 3.246 ± 0.05
3.559ArgGln: 3.559 ± 0.056
4.902ArgArg: 4.902 ± 0.069
3.548ArgSer: 3.548 ± 0.056
3.381ArgThr: 3.381 ± 0.053
5.099ArgVal: 5.099 ± 0.057
1.27ArgTrp: 1.27 ± 0.033
1.6ArgTyr: 1.6 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
7.122SerAla: 7.122 ± 0.086
0.465SerCys: 0.465 ± 0.022
2.929SerAsp: 2.929 ± 0.051
2.69SerGlu: 2.69 ± 0.043
1.939SerPhe: 1.939 ± 0.038
5.262SerGly: 5.262 ± 0.077
1.347SerHis: 1.347 ± 0.034
2.137SerIle: 2.137 ± 0.044
1.532SerLys: 1.532 ± 0.039
6.063SerLeu: 6.063 ± 0.071
1.249SerMet: 1.249 ± 0.035
1.44SerAsn: 1.44 ± 0.038
3.132SerPro: 3.132 ± 0.055
2.297SerGln: 2.297 ± 0.051
3.575SerArg: 3.575 ± 0.049
3.24SerSer: 3.24 ± 0.059
3.17SerThr: 3.17 ± 0.065
4.245SerVal: 4.245 ± 0.062
0.784SerTrp: 0.784 ± 0.024
1.153SerTyr: 1.153 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
6.412ThrAla: 6.412 ± 0.078
0.428ThrCys: 0.428 ± 0.02
2.819ThrAsp: 2.819 ± 0.054
2.55ThrGlu: 2.55 ± 0.047
1.723ThrPhe: 1.723 ± 0.036
4.642ThrGly: 4.642 ± 0.07
1.26ThrHis: 1.26 ± 0.031
1.832ThrIle: 1.832 ± 0.047
1.292ThrLys: 1.292 ± 0.036
6.714ThrLeu: 6.714 ± 0.077
1.032ThrMet: 1.032 ± 0.032
1.209ThrAsn: 1.209 ± 0.037
3.835ThrPro: 3.835 ± 0.054
2.369ThrGln: 2.369 ± 0.054
3.337ThrArg: 3.337 ± 0.052
3.027ThrSer: 3.027 ± 0.064
2.964ThrThr: 2.964 ± 0.071
4.495ThrVal: 4.495 ± 0.071
0.759ThrTrp: 0.759 ± 0.025
1.076ThrTyr: 1.076 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
9.354ValAla: 9.354 ± 0.091
0.804ValCys: 0.804 ± 0.026
4.042ValAsp: 4.042 ± 0.057
3.726ValGlu: 3.726 ± 0.057
2.54ValPhe: 2.54 ± 0.05
5.554ValGly: 5.554 ± 0.069
1.865ValHis: 1.865 ± 0.041
3.181ValIle: 3.181 ± 0.051
2.646ValLys: 2.646 ± 0.051
8.464ValLeu: 8.464 ± 0.103
1.94ValMet: 1.94 ± 0.048
2.205ValAsn: 2.205 ± 0.047
4.163ValPro: 4.163 ± 0.06
3.347ValGln: 3.347 ± 0.064
5.435ValArg: 5.435 ± 0.057
4.591ValSer: 4.591 ± 0.067
4.259ValThr: 4.259 ± 0.064
6.372ValVal: 6.372 ± 0.084
1.146ValTrp: 1.146 ± 0.032
1.344ValTyr: 1.344 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.612TrpAla: 1.612 ± 0.039
0.194TrpCys: 0.194 ± 0.012
0.651TrpAsp: 0.651 ± 0.021
0.467TrpGlu: 0.467 ± 0.018
0.533TrpPhe: 0.533 ± 0.022
1.026TrpGly: 1.026 ± 0.031
0.454TrpHis: 0.454 ± 0.021
0.491TrpIle: 0.491 ± 0.018
0.413TrpLys: 0.413 ± 0.016
2.378TrpLeu: 2.378 ± 0.06
0.448TrpMet: 0.448 ± 0.019
0.319TrpAsn: 0.319 ± 0.015
0.814TrpPro: 0.814 ± 0.031
0.976TrpGln: 0.976 ± 0.029
1.44TrpArg: 1.44 ± 0.037
0.768TrpSer: 0.768 ± 0.024
0.697TrpThr: 0.697 ± 0.024
1.192TrpVal: 1.192 ± 0.032
0.344TrpTrp: 0.344 ± 0.019
0.275TrpTyr: 0.275 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.232TyrAla: 2.232 ± 0.04
0.197TyrCys: 0.197 ± 0.012
1.061TyrAsp: 1.061 ± 0.041
1.027TyrGlu: 1.027 ± 0.029
0.812TyrPhe: 0.812 ± 0.025
1.625TyrGly: 1.625 ± 0.036
0.385TyrHis: 0.385 ± 0.016
0.751TyrIle: 0.751 ± 0.023
0.634TyrLys: 0.634 ± 0.024
2.102TyrLeu: 2.102 ± 0.042
0.358TyrMet: 0.358 ± 0.015
0.585TyrAsn: 0.585 ± 0.021
0.943TyrPro: 0.943 ± 0.026
0.848TyrGln: 0.848 ± 0.025
1.423TyrArg: 1.423 ± 0.036
1.04TyrSer: 1.04 ± 0.03
1.11TyrThr: 1.11 ± 0.031
1.461TyrVal: 1.461 ± 0.037
0.333TyrTrp: 0.333 ± 0.015
0.45TyrTyr: 0.45 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3649 proteins (1271893 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski