Amino acid dipepetide frequency for Enterobacter sp. EA-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.491AlaAla: 10.491 ± 0.344
1.442AlaCys: 1.442 ± 0.118
4.83AlaAsp: 4.83 ± 0.16
4.981AlaGlu: 4.981 ± 0.198
3.479AlaPhe: 3.479 ± 0.174
8.619AlaGly: 8.619 ± 0.259
2.023AlaHis: 2.023 ± 0.129
5.283AlaIle: 5.283 ± 0.207
3.63AlaLys: 3.63 ± 0.181
10.906AlaLeu: 10.906 ± 0.299
2.868AlaMet: 2.868 ± 0.149
2.845AlaAsn: 2.845 ± 0.16
3.66AlaPro: 3.66 ± 0.185
4.604AlaGln: 4.604 ± 0.203
5.766AlaArg: 5.766 ± 0.229
6.415AlaSer: 6.415 ± 0.195
5.359AlaThr: 5.359 ± 0.197
6.823AlaVal: 6.823 ± 0.277
0.808AlaTrp: 0.808 ± 0.078
1.894AlaTyr: 1.894 ± 0.129
0.0AlaXaa: 0.0 ± 0.0
Cys
1.494CysAla: 1.494 ± 0.106
0.543CysCys: 0.543 ± 0.071
0.717CysAsp: 0.717 ± 0.075
0.679CysGlu: 0.679 ± 0.066
0.649CysPhe: 0.649 ± 0.071
1.306CysGly: 1.306 ± 0.101
0.46CysHis: 0.46 ± 0.059
0.657CysIle: 0.657 ± 0.056
0.491CysLys: 0.491 ± 0.054
1.351CysLeu: 1.351 ± 0.101
0.664CysMet: 0.664 ± 0.067
0.438CysAsn: 0.438 ± 0.061
0.868CysPro: 0.868 ± 0.096
0.453CysGln: 0.453 ± 0.057
1.675CysArg: 1.675 ± 0.111
1.185CysSer: 1.185 ± 0.095
0.717CysThr: 0.717 ± 0.064
1.057CysVal: 1.057 ± 0.101
0.657CysTrp: 0.657 ± 0.08
0.475CysTyr: 0.475 ± 0.055
0.0CysXaa: 0.0 ± 0.0
Asp
5.291AspAla: 5.291 ± 0.202
0.657AspCys: 0.657 ± 0.079
2.921AspAsp: 2.921 ± 0.167
3.245AspGlu: 3.245 ± 0.173
2.219AspPhe: 2.219 ± 0.126
3.6AspGly: 3.6 ± 0.168
0.981AspHis: 0.981 ± 0.113
3.343AspIle: 3.343 ± 0.153
2.294AspLys: 2.294 ± 0.137
3.683AspLeu: 3.683 ± 0.176
1.321AspMet: 1.321 ± 0.097
2.091AspAsn: 2.091 ± 0.131
1.819AspPro: 1.819 ± 0.132
1.517AspGln: 1.517 ± 0.106
2.725AspArg: 2.725 ± 0.148
2.966AspSer: 2.966 ± 0.143
2.174AspThr: 2.174 ± 0.145
3.743AspVal: 3.743 ± 0.163
0.792AspTrp: 0.792 ± 0.075
1.555AspTyr: 1.555 ± 0.112
0.0AspXaa: 0.0 ± 0.0
Glu
4.287GluAla: 4.287 ± 0.182
0.543GluCys: 0.543 ± 0.068
2.242GluAsp: 2.242 ± 0.129
3.306GluGlu: 3.306 ± 0.186
1.691GluPhe: 1.691 ± 0.12
2.845GluGly: 2.845 ± 0.153
1.472GluHis: 1.472 ± 0.117
2.815GluIle: 2.815 ± 0.146
3.185GluLys: 3.185 ± 0.168
5.374GluLeu: 5.374 ± 0.194
1.608GluMet: 1.608 ± 0.098
2.174GluAsn: 2.174 ± 0.13
1.834GluPro: 1.834 ± 0.131
2.845GluGln: 2.845 ± 0.153
3.547GluArg: 3.547 ± 0.167
2.747GluSer: 2.747 ± 0.152
3.208GluThr: 3.208 ± 0.164
2.981GluVal: 2.981 ± 0.138
0.77GluTrp: 0.77 ± 0.082
1.215GluTyr: 1.215 ± 0.096
0.0GluXaa: 0.0 ± 0.0
Phe
3.676PheAla: 3.676 ± 0.18
0.664PheCys: 0.664 ± 0.067
2.008PheAsp: 2.008 ± 0.123
1.57PheGlu: 1.57 ± 0.095
1.638PhePhe: 1.638 ± 0.129
2.498PheGly: 2.498 ± 0.126
0.868PheHis: 0.868 ± 0.08
2.747PheIle: 2.747 ± 0.159
1.343PheLys: 1.343 ± 0.102
3.132PheLeu: 3.132 ± 0.158
1.102PheMet: 1.102 ± 0.086
1.502PheAsn: 1.502 ± 0.109
1.872PhePro: 1.872 ± 0.124
0.928PheGln: 0.928 ± 0.074
2.181PheArg: 2.181 ± 0.129
3.268PheSer: 3.268 ± 0.149
2.46PheThr: 2.46 ± 0.142
2.445PheVal: 2.445 ± 0.14
0.823PheTrp: 0.823 ± 0.076
1.155PheTyr: 1.155 ± 0.089
0.0PheXaa: 0.0 ± 0.0
Gly
6.332GlyAla: 6.332 ± 0.27
1.6GlyCys: 1.6 ± 0.111
3.593GlyAsp: 3.593 ± 0.153
4.151GlyGlu: 4.151 ± 0.17
2.838GlyPhe: 2.838 ± 0.131
5.6GlyGly: 5.6 ± 0.232
1.547GlyHis: 1.547 ± 0.118
4.362GlyIle: 4.362 ± 0.174
3.668GlyLys: 3.668 ± 0.165
6.438GlyLeu: 6.438 ± 0.238
2.46GlyMet: 2.46 ± 0.126
3.238GlyAsn: 3.238 ± 0.177
1.796GlyPro: 1.796 ± 0.103
2.521GlyGln: 2.521 ± 0.139
4.672GlyArg: 4.672 ± 0.17
4.128GlySer: 4.128 ± 0.205
3.645GlyThr: 3.645 ± 0.214
5.268GlyVal: 5.268 ± 0.201
1.268GlyTrp: 1.268 ± 0.107
2.513GlyTyr: 2.513 ± 0.126
0.0GlyXaa: 0.0 ± 0.0
His
2.196HisAla: 2.196 ± 0.129
0.468HisCys: 0.468 ± 0.064
1.283HisAsp: 1.283 ± 0.11
1.034HisGlu: 1.034 ± 0.087
1.079HisPhe: 1.079 ± 0.108
1.804HisGly: 1.804 ± 0.146
1.14HisHis: 1.14 ± 0.111
1.328HisIle: 1.328 ± 0.099
0.664HisLys: 0.664 ± 0.066
2.06HisLeu: 2.06 ± 0.115
0.762HisMet: 0.762 ± 0.079
0.702HisAsn: 0.702 ± 0.074
1.374HisPro: 1.374 ± 0.116
1.155HisGln: 1.155 ± 0.092
1.759HisArg: 1.759 ± 0.112
1.359HisSer: 1.359 ± 0.1
1.102HisThr: 1.102 ± 0.089
1.223HisVal: 1.223 ± 0.095
0.581HisTrp: 0.581 ± 0.063
1.004HisTyr: 1.004 ± 0.092
0.0HisXaa: 0.0 ± 0.0
Ile
5.706IleAla: 5.706 ± 0.216
1.034IleCys: 1.034 ± 0.09
2.86IleAsp: 2.86 ± 0.158
2.838IleGlu: 2.838 ± 0.153
2.174IlePhe: 2.174 ± 0.131
3.698IleGly: 3.698 ± 0.207
1.147IleHis: 1.147 ± 0.12
3.283IleIle: 3.283 ± 0.177
2.506IleLys: 2.506 ± 0.142
4.838IleLeu: 4.838 ± 0.222
1.177IleMet: 1.177 ± 0.102
2.853IleAsn: 2.853 ± 0.154
2.559IlePro: 2.559 ± 0.144
1.449IleGln: 1.449 ± 0.111
3.14IleArg: 3.14 ± 0.144
4.143IleSer: 4.143 ± 0.182
3.811IleThr: 3.811 ± 0.151
3.615IleVal: 3.615 ± 0.172
0.717IleTrp: 0.717 ± 0.082
1.502IleTyr: 1.502 ± 0.114
0.0IleXaa: 0.0 ± 0.0
Lys
4.098LysAla: 4.098 ± 0.176
0.528LysCys: 0.528 ± 0.062
1.864LysAsp: 1.864 ± 0.117
2.083LysGlu: 2.083 ± 0.122
1.275LysPhe: 1.275 ± 0.091
2.611LysGly: 2.611 ± 0.128
0.785LysHis: 0.785 ± 0.071
2.468LysIle: 2.468 ± 0.131
1.992LysLys: 1.992 ± 0.121
3.932LysLeu: 3.932 ± 0.179
1.313LysMet: 1.313 ± 0.111
1.894LysAsn: 1.894 ± 0.104
2.098LysPro: 2.098 ± 0.128
1.721LysGln: 1.721 ± 0.102
3.004LysArg: 3.004 ± 0.145
2.581LysSer: 2.581 ± 0.16
2.815LysThr: 2.815 ± 0.146
2.559LysVal: 2.559 ± 0.131
0.543LysTrp: 0.543 ± 0.056
1.275LysTyr: 1.275 ± 0.089
0.0LysXaa: 0.0 ± 0.0
Leu
9.781LeuAla: 9.781 ± 0.309
1.547LeuCys: 1.547 ± 0.11
4.536LeuAsp: 4.536 ± 0.179
4.808LeuGlu: 4.808 ± 0.202
4.521LeuPhe: 4.521 ± 0.185
6.098LeuGly: 6.098 ± 0.3
2.385LeuHis: 2.385 ± 0.131
5.6LeuIle: 5.6 ± 0.194
4.257LeuLys: 4.257 ± 0.189
11.698LeuLeu: 11.698 ± 0.377
2.921LeuMet: 2.921 ± 0.133
4.272LeuAsn: 4.272 ± 0.193
5.54LeuPro: 5.54 ± 0.22
3.826LeuGln: 3.826 ± 0.171
6.627LeuArg: 6.627 ± 0.224
7.63LeuSer: 7.63 ± 0.286
6.325LeuThr: 6.325 ± 0.206
5.676LeuVal: 5.676 ± 0.22
1.442LeuTrp: 1.442 ± 0.117
2.596LeuTyr: 2.596 ± 0.14
0.0LeuXaa: 0.0 ± 0.0
Met
3.057MetAla: 3.057 ± 0.144
0.37MetCys: 0.37 ± 0.053
1.125MetAsp: 1.125 ± 0.086
1.253MetGlu: 1.253 ± 0.096
0.928MetPhe: 0.928 ± 0.072
1.487MetGly: 1.487 ± 0.118
0.702MetHis: 0.702 ± 0.077
1.759MetIle: 1.759 ± 0.116
1.381MetLys: 1.381 ± 0.096
3.215MetLeu: 3.215 ± 0.161
0.936MetMet: 0.936 ± 0.078
1.283MetAsn: 1.283 ± 0.089
1.759MetPro: 1.759 ± 0.126
1.426MetGln: 1.426 ± 0.105
1.857MetArg: 1.857 ± 0.119
2.136MetSer: 2.136 ± 0.122
1.925MetThr: 1.925 ± 0.11
2.068MetVal: 2.068 ± 0.108
0.347MetTrp: 0.347 ± 0.05
0.543MetTyr: 0.543 ± 0.069
0.0MetXaa: 0.0 ± 0.0
Asn
3.789AsnAla: 3.789 ± 0.16
0.543AsnCys: 0.543 ± 0.06
2.136AsnAsp: 2.136 ± 0.123
1.683AsnGlu: 1.683 ± 0.105
1.23AsnPhe: 1.23 ± 0.093
3.298AsnGly: 3.298 ± 0.185
0.906AsnHis: 0.906 ± 0.071
2.143AsnIle: 2.143 ± 0.135
1.592AsnLys: 1.592 ± 0.117
3.102AsnLeu: 3.102 ± 0.16
1.177AsnMet: 1.177 ± 0.099
1.774AsnAsn: 1.774 ± 0.121
1.909AsnPro: 1.909 ± 0.132
1.494AsnGln: 1.494 ± 0.118
2.332AsnArg: 2.332 ± 0.129
2.34AsnSer: 2.34 ± 0.132
2.257AsnThr: 2.257 ± 0.142
2.415AsnVal: 2.415 ± 0.14
0.762AsnTrp: 0.762 ± 0.078
1.079AsnTyr: 1.079 ± 0.1
0.0AsnXaa: 0.0 ± 0.0
Pro
5.276ProAla: 5.276 ± 0.211
0.626ProCys: 0.626 ± 0.08
2.566ProAsp: 2.566 ± 0.151
2.913ProGlu: 2.913 ± 0.148
1.721ProPhe: 1.721 ± 0.133
3.208ProGly: 3.208 ± 0.148
1.14ProHis: 1.14 ± 0.088
1.774ProIle: 1.774 ± 0.131
1.313ProLys: 1.313 ± 0.099
5.042ProLeu: 5.042 ± 0.198
1.17ProMet: 1.17 ± 0.086
1.223ProAsn: 1.223 ± 0.1
2.377ProPro: 2.377 ± 0.164
2.136ProGln: 2.136 ± 0.128
2.808ProArg: 2.808 ± 0.131
2.513ProSer: 2.513 ± 0.139
2.272ProThr: 2.272 ± 0.141
4.234ProVal: 4.234 ± 0.189
0.634ProTrp: 0.634 ± 0.078
0.959ProTyr: 0.959 ± 0.089
0.0ProXaa: 0.0 ± 0.0
Gln
3.653GlnAla: 3.653 ± 0.189
0.566GlnCys: 0.566 ± 0.075
1.592GlnAsp: 1.592 ± 0.107
1.706GlnGlu: 1.706 ± 0.131
1.366GlnPhe: 1.366 ± 0.111
2.083GlnGly: 2.083 ± 0.133
1.223GlnHis: 1.223 ± 0.096
1.947GlnIle: 1.947 ± 0.131
1.706GlnLys: 1.706 ± 0.103
4.415GlnLeu: 4.415 ± 0.178
1.389GlnMet: 1.389 ± 0.105
1.577GlnAsn: 1.577 ± 0.112
2.332GlnPro: 2.332 ± 0.144
3.298GlnGln: 3.298 ± 0.237
3.426GlnArg: 3.426 ± 0.16
2.619GlnSer: 2.619 ± 0.163
2.445GlnThr: 2.445 ± 0.147
2.234GlnVal: 2.234 ± 0.12
0.694GlnTrp: 0.694 ± 0.068
1.291GlnTyr: 1.291 ± 0.096
0.0GlnXaa: 0.0 ± 0.0
Arg
4.8ArgAla: 4.8 ± 0.199
1.743ArgCys: 1.743 ± 0.1
3.125ArgAsp: 3.125 ± 0.172
3.706ArgGlu: 3.706 ± 0.166
2.445ArgPhe: 2.445 ± 0.16
4.023ArgGly: 4.023 ± 0.167
2.015ArgHis: 2.015 ± 0.137
3.434ArgIle: 3.434 ± 0.167
2.86ArgLys: 2.86 ± 0.165
6.642ArgLeu: 6.642 ± 0.227
2.355ArgMet: 2.355 ± 0.118
2.506ArgAsn: 2.506 ± 0.147
2.928ArgPro: 2.928 ± 0.163
3.464ArgGln: 3.464 ± 0.185
6.362ArgArg: 6.362 ± 0.268
3.947ArgSer: 3.947 ± 0.16
3.177ArgThr: 3.177 ± 0.149
4.106ArgVal: 4.106 ± 0.173
1.683ArgTrp: 1.683 ± 0.123
2.37ArgTyr: 2.37 ± 0.139
0.0ArgXaa: 0.0 ± 0.0
Ser
6.785SerAla: 6.785 ± 0.203
0.928SerCys: 0.928 ± 0.086
3.238SerAsp: 3.238 ± 0.165
3.276SerGlu: 3.276 ± 0.165
2.34SerPhe: 2.34 ± 0.142
6.385SerGly: 6.385 ± 0.27
1.494SerHis: 1.494 ± 0.11
3.087SerIle: 3.087 ± 0.166
2.211SerLys: 2.211 ± 0.136
7.019SerLeu: 7.019 ± 0.228
1.562SerMet: 1.562 ± 0.094
1.774SerAsn: 1.774 ± 0.118
3.049SerPro: 3.049 ± 0.173
2.362SerGln: 2.362 ± 0.125
4.423SerArg: 4.423 ± 0.167
4.476SerSer: 4.476 ± 0.231
3.698SerThr: 3.698 ± 0.167
4.732SerVal: 4.732 ± 0.192
0.891SerTrp: 0.891 ± 0.078
1.479SerTyr: 1.479 ± 0.114
0.0SerXaa: 0.0 ± 0.0
Thr
5.872ThrAla: 5.872 ± 0.236
0.709ThrCys: 0.709 ± 0.068
2.815ThrAsp: 2.815 ± 0.136
2.43ThrGlu: 2.43 ± 0.143
2.053ThrPhe: 2.053 ± 0.103
5.743ThrGly: 5.743 ± 0.218
1.351ThrHis: 1.351 ± 0.11
2.853ThrIle: 2.853 ± 0.13
1.547ThrLys: 1.547 ± 0.116
7.14ThrLeu: 7.14 ± 0.205
1.268ThrMet: 1.268 ± 0.097
1.691ThrAsn: 1.691 ± 0.133
3.396ThrPro: 3.396 ± 0.177
1.992ThrGln: 1.992 ± 0.127
3.411ThrArg: 3.411 ± 0.148
3.238ThrSer: 3.238 ± 0.165
3.502ThrThr: 3.502 ± 0.16
4.506ThrVal: 4.506 ± 0.188
0.34ThrTrp: 0.34 ± 0.052
1.064ThrTyr: 1.064 ± 0.098
0.0ThrXaa: 0.0 ± 0.0
Val
6.876ValAla: 6.876 ± 0.237
1.132ValCys: 1.132 ± 0.091
3.457ValAsp: 3.457 ± 0.164
3.276ValGlu: 3.276 ± 0.159
2.611ValPhe: 2.611 ± 0.133
4.03ValGly: 4.03 ± 0.185
1.14ValHis: 1.14 ± 0.083
4.143ValIle: 4.143 ± 0.167
3.14ValLys: 3.14 ± 0.15
7.253ValLeu: 7.253 ± 0.264
2.211ValMet: 2.211 ± 0.143
2.913ValAsn: 2.913 ± 0.146
2.679ValPro: 2.679 ± 0.119
2.196ValGln: 2.196 ± 0.135
3.721ValArg: 3.721 ± 0.165
4.71ValSer: 4.71 ± 0.198
4.272ValThr: 4.272 ± 0.194
5.291ValVal: 5.291 ± 0.231
1.019ValTrp: 1.019 ± 0.088
1.577ValTyr: 1.577 ± 0.091
0.0ValXaa: 0.0 ± 0.0
Trp
0.921TrpAla: 0.921 ± 0.086
0.332TrpCys: 0.332 ± 0.052
0.558TrpAsp: 0.558 ± 0.066
0.528TrpGlu: 0.528 ± 0.065
0.536TrpPhe: 0.536 ± 0.063
0.86TrpGly: 0.86 ± 0.081
0.581TrpHis: 0.581 ± 0.07
0.762TrpIle: 0.762 ± 0.073
0.687TrpLys: 0.687 ± 0.086
2.151TrpLeu: 2.151 ± 0.146
0.558TrpMet: 0.558 ± 0.063
0.385TrpAsn: 0.385 ± 0.05
0.694TrpPro: 0.694 ± 0.073
0.936TrpGln: 0.936 ± 0.081
1.985TrpArg: 1.985 ± 0.128
0.966TrpSer: 0.966 ± 0.09
0.574TrpThr: 0.574 ± 0.071
0.815TrpVal: 0.815 ± 0.08
0.302TrpTrp: 0.302 ± 0.055
0.475TrpTyr: 0.475 ± 0.062
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.347TyrAla: 2.347 ± 0.134
0.558TyrCys: 0.558 ± 0.068
1.472TyrAsp: 1.472 ± 0.111
1.223TyrGlu: 1.223 ± 0.117
1.087TyrPhe: 1.087 ± 0.099
1.842TyrGly: 1.842 ± 0.123
0.717TyrHis: 0.717 ± 0.079
1.26TyrIle: 1.26 ± 0.096
0.974TyrLys: 0.974 ± 0.085
2.694TyrLeu: 2.694 ± 0.142
0.725TyrMet: 0.725 ± 0.079
0.83TyrAsn: 0.83 ± 0.079
1.359TyrPro: 1.359 ± 0.106
1.238TyrGln: 1.238 ± 0.09
2.279TyrArg: 2.279 ± 0.141
1.842TyrSer: 1.842 ± 0.115
1.185TyrThr: 1.185 ± 0.095
1.857TyrVal: 1.857 ± 0.122
0.536TyrTrp: 0.536 ± 0.07
0.86TyrTyr: 0.86 ± 0.081
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1052 proteins (132499 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski