Amino acid dipepetide frequency for Zymomonas sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.402AlaAla: 14.402 ± 0.346
1.089AlaCys: 1.089 ± 0.077
6.701AlaAsp: 6.701 ± 0.173
7.216AlaGlu: 7.216 ± 0.18
4.364AlaPhe: 4.364 ± 0.126
9.171AlaGly: 9.171 ± 0.207
2.177AlaHis: 2.177 ± 0.117
6.208AlaIle: 6.208 ± 0.193
3.915AlaLys: 3.915 ± 0.157
11.918AlaLeu: 11.918 ± 0.247
3.386AlaMet: 3.386 ± 0.123
3.173AlaAsn: 3.173 ± 0.163
4.715AlaPro: 4.715 ± 0.148
3.755AlaGln: 3.755 ± 0.13
7.741AlaArg: 7.741 ± 0.196
6.852AlaSer: 6.852 ± 0.201
5.945AlaThr: 5.945 ± 0.15
8.243AlaVal: 8.243 ± 0.169
1.555AlaTrp: 1.555 ± 0.089
2.413AlaTyr: 2.413 ± 0.092
0.0AlaXaa: 0.0 ± 0.0
Cys
0.915CysAla: 0.915 ± 0.069
0.098CysCys: 0.098 ± 0.021
0.613CysAsp: 0.613 ± 0.052
0.551CysGlu: 0.551 ± 0.045
0.324CysPhe: 0.324 ± 0.038
0.795CysGly: 0.795 ± 0.065
0.218CysHis: 0.218 ± 0.033
0.413CysIle: 0.413 ± 0.046
0.187CysLys: 0.187 ± 0.026
0.813CysLeu: 0.813 ± 0.062
0.173CysMet: 0.173 ± 0.028
0.267CysAsn: 0.267 ± 0.034
0.422CysPro: 0.422 ± 0.044
0.24CysGln: 0.24 ± 0.032
0.658CysArg: 0.658 ± 0.054
0.68CysSer: 0.68 ± 0.057
0.44CysThr: 0.44 ± 0.04
0.587CysVal: 0.587 ± 0.051
0.129CysTrp: 0.129 ± 0.022
0.222CysTyr: 0.222 ± 0.035
0.0CysXaa: 0.0 ± 0.0
Asp
7.341AspAla: 7.341 ± 0.168
0.484AspCys: 0.484 ± 0.052
3.506AspAsp: 3.506 ± 0.146
3.808AspGlu: 3.808 ± 0.15
2.537AspPhe: 2.537 ± 0.093
5.208AspGly: 5.208 ± 0.176
1.466AspHis: 1.466 ± 0.096
3.097AspIle: 3.097 ± 0.121
1.702AspLys: 1.702 ± 0.088
6.425AspLeu: 6.425 ± 0.174
1.444AspMet: 1.444 ± 0.073
1.475AspAsn: 1.475 ± 0.082
3.524AspPro: 3.524 ± 0.134
1.933AspGln: 1.933 ± 0.099
4.755AspArg: 4.755 ± 0.16
2.302AspSer: 2.302 ± 0.098
2.853AspThr: 2.853 ± 0.132
4.364AspVal: 4.364 ± 0.134
1.053AspTrp: 1.053 ± 0.07
1.569AspTyr: 1.569 ± 0.074
0.0AspXaa: 0.0 ± 0.0
Glu
7.283GluAla: 7.283 ± 0.195
0.427GluCys: 0.427 ± 0.046
2.999GluAsp: 2.999 ± 0.124
3.182GluGlu: 3.182 ± 0.142
1.986GluPhe: 1.986 ± 0.099
4.297GluGly: 4.297 ± 0.15
1.235GluHis: 1.235 ± 0.071
3.475GluIle: 3.475 ± 0.13
2.208GluLys: 2.208 ± 0.094
5.581GluLeu: 5.581 ± 0.165
1.591GluMet: 1.591 ± 0.088
1.546GluAsn: 1.546 ± 0.077
2.666GluPro: 2.666 ± 0.108
2.182GluGln: 2.182 ± 0.1
5.363GluArg: 5.363 ± 0.176
2.457GluSer: 2.457 ± 0.104
3.51GluThr: 3.51 ± 0.137
4.021GluVal: 4.021 ± 0.129
0.822GluTrp: 0.822 ± 0.062
1.004GluTyr: 1.004 ± 0.066
0.0GluXaa: 0.0 ± 0.0
Phe
4.337PheAla: 4.337 ± 0.143
0.475PheCys: 0.475 ± 0.046
2.964PheAsp: 2.964 ± 0.104
2.368PheGlu: 2.368 ± 0.096
1.333PhePhe: 1.333 ± 0.082
3.701PheGly: 3.701 ± 0.148
0.818PheHis: 0.818 ± 0.061
1.813PheIle: 1.813 ± 0.084
1.035PheLys: 1.035 ± 0.06
3.448PheLeu: 3.448 ± 0.137
0.76PheMet: 0.76 ± 0.053
1.16PheAsn: 1.16 ± 0.083
1.693PhePro: 1.693 ± 0.075
0.951PheGln: 0.951 ± 0.07
2.355PheArg: 2.355 ± 0.108
2.4PheSer: 2.4 ± 0.102
1.995PheThr: 1.995 ± 0.095
2.839PheVal: 2.839 ± 0.109
0.533PheTrp: 0.533 ± 0.052
1.133PheTyr: 1.133 ± 0.071
0.0PheXaa: 0.0 ± 0.0
Gly
8.016GlyAla: 8.016 ± 0.207
0.769GlyCys: 0.769 ± 0.064
4.452GlyAsp: 4.452 ± 0.139
4.808GlyGlu: 4.808 ± 0.158
3.541GlyPhe: 3.541 ± 0.123
6.39GlyGly: 6.39 ± 0.234
1.8GlyHis: 1.8 ± 0.084
4.235GlyIle: 4.235 ± 0.146
3.062GlyLys: 3.062 ± 0.123
8.105GlyLeu: 8.105 ± 0.24
2.088GlyMet: 2.088 ± 0.108
2.235GlyAsn: 2.235 ± 0.115
3.022GlyPro: 3.022 ± 0.135
2.693GlyGln: 2.693 ± 0.127
5.843GlyArg: 5.843 ± 0.176
4.968GlySer: 4.968 ± 0.152
4.395GlyThr: 4.395 ± 0.143
5.963GlyVal: 5.963 ± 0.163
1.293GlyTrp: 1.293 ± 0.081
2.497GlyTyr: 2.497 ± 0.098
0.0GlyXaa: 0.0 ± 0.0
His
2.208HisAla: 2.208 ± 0.108
0.209HisCys: 0.209 ± 0.032
1.395HisAsp: 1.395 ± 0.078
1.115HisGlu: 1.115 ± 0.065
0.835HisPhe: 0.835 ± 0.065
1.937HisGly: 1.937 ± 0.088
0.52HisHis: 0.52 ± 0.052
0.88HisIle: 0.88 ± 0.067
0.547HisLys: 0.547 ± 0.054
2.213HisLeu: 2.213 ± 0.087
0.475HisMet: 0.475 ± 0.051
0.515HisAsn: 0.515 ± 0.05
1.497HisPro: 1.497 ± 0.083
0.715HisGln: 0.715 ± 0.051
1.773HisArg: 1.773 ± 0.085
1.013HisSer: 1.013 ± 0.069
0.813HisThr: 0.813 ± 0.061
1.609HisVal: 1.609 ± 0.087
0.324HisTrp: 0.324 ± 0.035
0.635HisTyr: 0.635 ± 0.05
0.0HisXaa: 0.0 ± 0.0
Ile
6.732IleAla: 6.732 ± 0.184
0.533IleCys: 0.533 ± 0.055
3.741IleAsp: 3.741 ± 0.133
3.875IleGlu: 3.875 ± 0.136
1.746IlePhe: 1.746 ± 0.094
4.923IleGly: 4.923 ± 0.166
0.929IleHis: 0.929 ± 0.061
2.164IleIle: 2.164 ± 0.105
1.626IleLys: 1.626 ± 0.1
4.43IleLeu: 4.43 ± 0.156
1.08IleMet: 1.08 ± 0.067
1.444IleAsn: 1.444 ± 0.089
2.293IlePro: 2.293 ± 0.096
1.369IleGln: 1.369 ± 0.081
3.359IleArg: 3.359 ± 0.125
3.293IleSer: 3.293 ± 0.141
2.777IleThr: 2.777 ± 0.107
4.67IleVal: 4.67 ± 0.152
0.533IleTrp: 0.533 ± 0.05
1.169IleTyr: 1.169 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
4.119LysAla: 4.119 ± 0.155
0.173LysCys: 0.173 ± 0.029
1.702LysAsp: 1.702 ± 0.075
1.36LysGlu: 1.36 ± 0.082
1.04LysPhe: 1.04 ± 0.063
2.355LysGly: 2.355 ± 0.112
0.715LysHis: 0.715 ± 0.059
1.897LysIle: 1.897 ± 0.099
1.155LysLys: 1.155 ± 0.083
3.621LysLeu: 3.621 ± 0.14
0.933LysMet: 0.933 ± 0.066
0.871LysAsn: 0.871 ± 0.061
2.115LysPro: 2.115 ± 0.123
1.133LysGln: 1.133 ± 0.066
2.653LysArg: 2.653 ± 0.109
1.817LysSer: 1.817 ± 0.084
2.057LysThr: 2.057 ± 0.11
2.271LysVal: 2.271 ± 0.101
0.333LysTrp: 0.333 ± 0.035
0.644LysTyr: 0.644 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
11.793LeuAla: 11.793 ± 0.234
0.804LeuCys: 0.804 ± 0.057
6.203LeuAsp: 6.203 ± 0.17
5.283LeuGlu: 5.283 ± 0.166
3.546LeuPhe: 3.546 ± 0.112
7.545LeuGly: 7.545 ± 0.161
1.937LeuHis: 1.937 ± 0.084
5.106LeuIle: 5.106 ± 0.155
3.63LeuLys: 3.63 ± 0.137
8.98LeuLeu: 8.98 ± 0.254
2.355LeuMet: 2.355 ± 0.101
2.706LeuAsn: 2.706 ± 0.11
4.932LeuPro: 4.932 ± 0.167
3.142LeuGln: 3.142 ± 0.134
7.594LeuArg: 7.594 ± 0.189
6.861LeuSer: 6.861 ± 0.2
5.848LeuThr: 5.848 ± 0.141
7.443LeuVal: 7.443 ± 0.192
1.058LeuTrp: 1.058 ± 0.085
2.351LeuTyr: 2.351 ± 0.107
0.0LeuXaa: 0.0 ± 0.0
Met
2.835MetAla: 2.835 ± 0.121
0.178MetCys: 0.178 ± 0.029
1.204MetAsp: 1.204 ± 0.078
1.253MetGlu: 1.253 ± 0.076
0.711MetPhe: 0.711 ± 0.057
1.742MetGly: 1.742 ± 0.086
0.48MetHis: 0.48 ± 0.043
1.391MetIle: 1.391 ± 0.075
1.133MetLys: 1.133 ± 0.08
2.453MetLeu: 2.453 ± 0.109
0.813MetMet: 0.813 ± 0.069
0.778MetAsn: 0.778 ± 0.062
1.391MetPro: 1.391 ± 0.085
0.902MetGln: 0.902 ± 0.062
1.968MetArg: 1.968 ± 0.096
1.764MetSer: 1.764 ± 0.107
1.973MetThr: 1.973 ± 0.085
1.577MetVal: 1.577 ± 0.08
0.271MetTrp: 0.271 ± 0.037
0.333MetTyr: 0.333 ± 0.039
0.0MetXaa: 0.0 ± 0.0
Asn
3.142AsnAla: 3.142 ± 0.126
0.191AsnCys: 0.191 ± 0.032
1.782AsnAsp: 1.782 ± 0.102
1.502AsnGlu: 1.502 ± 0.083
1.155AsnPhe: 1.155 ± 0.076
2.604AsnGly: 2.604 ± 0.128
0.56AsnHis: 0.56 ± 0.057
1.351AsnIle: 1.351 ± 0.088
0.68AsnLys: 0.68 ± 0.053
2.515AsnLeu: 2.515 ± 0.122
0.591AsnMet: 0.591 ± 0.057
0.826AsnAsn: 0.826 ± 0.066
1.702AsnPro: 1.702 ± 0.083
0.822AsnGln: 0.822 ± 0.07
2.151AsnArg: 2.151 ± 0.101
1.48AsnSer: 1.48 ± 0.069
1.417AsnThr: 1.417 ± 0.085
2.062AsnVal: 2.062 ± 0.107
0.44AsnTrp: 0.44 ± 0.044
0.764AsnTyr: 0.764 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
5.528ProAla: 5.528 ± 0.152
0.382ProCys: 0.382 ± 0.046
3.546ProAsp: 3.546 ± 0.142
3.213ProGlu: 3.213 ± 0.142
1.791ProPhe: 1.791 ± 0.09
4.004ProGly: 4.004 ± 0.131
1.049ProHis: 1.049 ± 0.069
2.4ProIle: 2.4 ± 0.096
1.609ProLys: 1.609 ± 0.087
4.452ProLeu: 4.452 ± 0.14
1.106ProMet: 1.106 ± 0.074
1.431ProAsn: 1.431 ± 0.089
2.2ProPro: 2.2 ± 0.116
1.617ProGln: 1.617 ± 0.088
3.137ProArg: 3.137 ± 0.132
3.008ProSer: 3.008 ± 0.12
2.617ProThr: 2.617 ± 0.103
3.853ProVal: 3.853 ± 0.123
0.618ProTrp: 0.618 ± 0.055
1.151ProTyr: 1.151 ± 0.067
0.0ProXaa: 0.0 ± 0.0
Gln
3.701GlnAla: 3.701 ± 0.139
0.253GlnCys: 0.253 ± 0.031
1.711GlnAsp: 1.711 ± 0.075
1.609GlnGlu: 1.609 ± 0.1
1.08GlnPhe: 1.08 ± 0.069
2.328GlnGly: 2.328 ± 0.103
0.724GlnHis: 0.724 ± 0.051
2.128GlnIle: 2.128 ± 0.111
1.022GlnLys: 1.022 ± 0.075
3.364GlnLeu: 3.364 ± 0.138
0.773GlnMet: 0.773 ± 0.062
0.929GlnAsn: 0.929 ± 0.058
1.755GlnPro: 1.755 ± 0.093
1.386GlnGln: 1.386 ± 0.082
2.737GlnArg: 2.737 ± 0.116
1.937GlnSer: 1.937 ± 0.095
1.902GlnThr: 1.902 ± 0.104
2.382GlnVal: 2.382 ± 0.116
0.458GlnTrp: 0.458 ± 0.043
0.658GlnTyr: 0.658 ± 0.064
0.0GlnXaa: 0.0 ± 0.0
Arg
7.35ArgAla: 7.35 ± 0.183
0.587ArgCys: 0.587 ± 0.046
4.488ArgAsp: 4.488 ± 0.136
4.195ArgGlu: 4.195 ± 0.123
3.159ArgPhe: 3.159 ± 0.11
4.937ArgGly: 4.937 ± 0.144
1.937ArgHis: 1.937 ± 0.089
4.208ArgIle: 4.208 ± 0.125
2.462ArgLys: 2.462 ± 0.12
8.163ArgLeu: 8.163 ± 0.21
2.084ArgMet: 2.084 ± 0.102
1.982ArgAsn: 1.982 ± 0.084
3.355ArgPro: 3.355 ± 0.123
2.817ArgGln: 2.817 ± 0.122
5.817ArgArg: 5.817 ± 0.186
4.675ArgSer: 4.675 ± 0.128
3.848ArgThr: 3.848 ± 0.134
4.63ArgVal: 4.63 ± 0.148
1.089ArgTrp: 1.089 ± 0.071
2.026ArgTyr: 2.026 ± 0.091
0.0ArgXaa: 0.0 ± 0.0
Ser
6.639SerAla: 6.639 ± 0.174
0.52SerCys: 0.52 ± 0.058
3.586SerAsp: 3.586 ± 0.14
3.275SerGlu: 3.275 ± 0.113
2.488SerPhe: 2.488 ± 0.101
5.239SerGly: 5.239 ± 0.175
1.133SerHis: 1.133 ± 0.067
3.088SerIle: 3.088 ± 0.114
2.04SerLys: 2.04 ± 0.102
5.666SerLeu: 5.666 ± 0.155
1.457SerMet: 1.457 ± 0.08
1.795SerAsn: 1.795 ± 0.088
2.919SerPro: 2.919 ± 0.115
1.968SerGln: 1.968 ± 0.1
4.19SerArg: 4.19 ± 0.149
3.724SerSer: 3.724 ± 0.178
3.186SerThr: 3.186 ± 0.161
4.155SerVal: 4.155 ± 0.139
0.818SerTrp: 0.818 ± 0.065
1.586SerTyr: 1.586 ± 0.091
0.0SerXaa: 0.0 ± 0.0
Thr
6.265ThrAla: 6.265 ± 0.184
0.453ThrCys: 0.453 ± 0.05
3.186ThrAsp: 3.186 ± 0.122
2.666ThrGlu: 2.666 ± 0.109
2.231ThrPhe: 2.231 ± 0.106
4.919ThrGly: 4.919 ± 0.176
1.133ThrHis: 1.133 ± 0.072
3.23ThrIle: 3.23 ± 0.112
1.569ThrLys: 1.569 ± 0.093
5.71ThrLeu: 5.71 ± 0.158
1.293ThrMet: 1.293 ± 0.08
1.386ThrAsn: 1.386 ± 0.085
3.088ThrPro: 3.088 ± 0.109
1.684ThrGln: 1.684 ± 0.075
3.355ThrArg: 3.355 ± 0.102
3.257ThrSer: 3.257 ± 0.129
3.177ThrThr: 3.177 ± 0.141
4.319ThrVal: 4.319 ± 0.139
0.711ThrTrp: 0.711 ± 0.051
1.382ThrTyr: 1.382 ± 0.084
0.0ThrXaa: 0.0 ± 0.0
Val
8.54ValAla: 8.54 ± 0.207
0.711ValCys: 0.711 ± 0.064
4.586ValAsp: 4.586 ± 0.133
4.59ValGlu: 4.59 ± 0.167
2.679ValPhe: 2.679 ± 0.125
5.043ValGly: 5.043 ± 0.173
1.506ValHis: 1.506 ± 0.093
3.893ValIle: 3.893 ± 0.126
2.244ValLys: 2.244 ± 0.109
7.252ValLeu: 7.252 ± 0.179
1.884ValMet: 1.884 ± 0.094
2.057ValAsn: 2.057 ± 0.099
3.684ValPro: 3.684 ± 0.127
2.075ValGln: 2.075 ± 0.084
5.332ValArg: 5.332 ± 0.152
4.541ValSer: 4.541 ± 0.153
4.306ValThr: 4.306 ± 0.14
5.701ValVal: 5.701 ± 0.161
0.929ValTrp: 0.929 ± 0.061
1.537ValTyr: 1.537 ± 0.087
0.0ValXaa: 0.0 ± 0.0
Trp
1.169TrpAla: 1.169 ± 0.074
0.129TrpCys: 0.129 ± 0.022
0.778TrpAsp: 0.778 ± 0.052
0.613TrpGlu: 0.613 ± 0.048
0.609TrpPhe: 0.609 ± 0.049
0.782TrpGly: 0.782 ± 0.062
0.329TrpHis: 0.329 ± 0.037
0.649TrpIle: 0.649 ± 0.053
0.533TrpLys: 0.533 ± 0.05
1.684TrpLeu: 1.684 ± 0.104
0.4TrpMet: 0.4 ± 0.037
0.493TrpAsn: 0.493 ± 0.048
0.573TrpPro: 0.573 ± 0.057
0.609TrpGln: 0.609 ± 0.046
1.129TrpArg: 1.129 ± 0.077
1.013TrpSer: 1.013 ± 0.066
0.831TrpThr: 0.831 ± 0.063
0.773TrpVal: 0.773 ± 0.053
0.196TrpTrp: 0.196 ± 0.029
0.227TrpTyr: 0.227 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.622TyrAla: 2.622 ± 0.105
0.28TyrCys: 0.28 ± 0.041
1.702TyrAsp: 1.702 ± 0.103
1.355TyrGlu: 1.355 ± 0.07
0.955TyrPhe: 0.955 ± 0.067
2.253TyrGly: 2.253 ± 0.102
0.564TyrHis: 0.564 ± 0.055
1.062TyrIle: 1.062 ± 0.071
0.609TyrLys: 0.609 ± 0.054
2.324TyrLeu: 2.324 ± 0.116
0.458TyrMet: 0.458 ± 0.047
0.667TyrAsn: 0.667 ± 0.057
1.169TyrPro: 1.169 ± 0.082
0.844TyrGln: 0.844 ± 0.054
1.897TyrArg: 1.897 ± 0.108
1.431TyrSer: 1.431 ± 0.072
1.124TyrThr: 1.124 ± 0.073
1.631TyrVal: 1.631 ± 0.084
0.355TyrTrp: 0.355 ± 0.042
0.707TyrTyr: 0.707 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1056 proteins (225047 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski