Amino acid dipepetide frequency for Erwinia phage Cronus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.381AlaAla: 5.381 ± 0.446
0.353AlaCys: 0.353 ± 0.08
4.75AlaAsp: 4.75 ± 0.321
4.453AlaGlu: 4.453 ± 0.324
2.431AlaPhe: 2.431 ± 0.208
4.769AlaGly: 4.769 ± 0.422
1.058AlaHis: 1.058 ± 0.125
4.546AlaIle: 4.546 ± 0.331
4.898AlaLys: 4.898 ± 0.355
6.364AlaLeu: 6.364 ± 0.438
1.559AlaMet: 1.559 ± 0.168
3.303AlaAsn: 3.303 ± 0.26
2.375AlaPro: 2.375 ± 0.238
2.616AlaGln: 2.616 ± 0.224
3.062AlaArg: 3.062 ± 0.201
4.806AlaSer: 4.806 ± 0.347
3.952AlaThr: 3.952 ± 0.518
4.88AlaVal: 4.88 ± 0.332
1.113AlaTrp: 1.113 ± 0.131
2.338AlaTyr: 2.338 ± 0.206
0.0AlaXaa: 0.0 ± 0.0
Cys
0.52CysAla: 0.52 ± 0.11
0.148CysCys: 0.148 ± 0.058
0.464CysAsp: 0.464 ± 0.079
0.631CysGlu: 0.631 ± 0.124
0.538CysPhe: 0.538 ± 0.099
0.612CysGly: 0.612 ± 0.105
0.278CysHis: 0.278 ± 0.068
0.779CysIle: 0.779 ± 0.094
0.705CysLys: 0.705 ± 0.116
0.649CysLeu: 0.649 ± 0.103
0.315CysMet: 0.315 ± 0.07
0.445CysAsn: 0.445 ± 0.09
0.501CysPro: 0.501 ± 0.12
0.278CysGln: 0.278 ± 0.069
0.482CysArg: 0.482 ± 0.085
0.854CysSer: 0.854 ± 0.115
0.668CysThr: 0.668 ± 0.112
0.612CysVal: 0.612 ± 0.104
0.167CysTrp: 0.167 ± 0.056
0.315CysTyr: 0.315 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
4.323AspAla: 4.323 ± 0.29
0.649AspCys: 0.649 ± 0.109
3.989AspAsp: 3.989 ± 0.286
4.305AspGlu: 4.305 ± 0.268
3.618AspPhe: 3.618 ± 0.251
4.564AspGly: 4.564 ± 0.377
1.113AspHis: 1.113 ± 0.138
4.806AspIle: 4.806 ± 0.294
4.342AspLys: 4.342 ± 0.272
5.288AspLeu: 5.288 ± 0.328
1.577AspMet: 1.577 ± 0.163
2.895AspAsn: 2.895 ± 0.209
2.245AspPro: 2.245 ± 0.253
1.93AspGln: 1.93 ± 0.181
2.245AspArg: 2.245 ± 0.24
4.416AspSer: 4.416 ± 0.27
3.507AspThr: 3.507 ± 0.311
4.824AspVal: 4.824 ± 0.339
1.187AspTrp: 1.187 ± 0.154
3.21AspTyr: 3.21 ± 0.263
0.0AspXaa: 0.0 ± 0.0
Glu
4.936GluAla: 4.936 ± 0.311
0.761GluCys: 0.761 ± 0.128
4.453GluAsp: 4.453 ± 0.301
4.472GluGlu: 4.472 ± 0.359
3.414GluPhe: 3.414 ± 0.255
3.859GluGly: 3.859 ± 0.23
1.076GluHis: 1.076 ± 0.158
5.529GluIle: 5.529 ± 0.341
4.509GluLys: 4.509 ± 0.316
6.698GluLeu: 6.698 ± 0.367
1.985GluMet: 1.985 ± 0.202
3.228GluAsn: 3.228 ± 0.251
1.911GluPro: 1.911 ± 0.177
2.06GluGln: 2.06 ± 0.208
2.839GluArg: 2.839 ± 0.226
3.971GluSer: 3.971 ± 0.289
3.544GluThr: 3.544 ± 0.253
5.103GluVal: 5.103 ± 0.289
0.965GluTrp: 0.965 ± 0.131
2.765GluTyr: 2.765 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
2.802PheAla: 2.802 ± 0.254
0.408PheCys: 0.408 ± 0.092
3.006PheAsp: 3.006 ± 0.244
3.674PheGlu: 3.674 ± 0.278
1.614PhePhe: 1.614 ± 0.196
2.672PheGly: 2.672 ± 0.213
0.705PheHis: 0.705 ± 0.103
3.34PheIle: 3.34 ± 0.249
3.581PheLys: 3.581 ± 0.294
2.783PheLeu: 2.783 ± 0.199
1.262PheMet: 1.262 ± 0.164
3.006PheAsn: 3.006 ± 0.248
1.262PhePro: 1.262 ± 0.164
1.317PheGln: 1.317 ± 0.17
1.596PheArg: 1.596 ± 0.181
3.507PheSer: 3.507 ± 0.283
2.561PheThr: 2.561 ± 0.199
3.062PheVal: 3.062 ± 0.222
0.724PheTrp: 0.724 ± 0.112
1.633PheTyr: 1.633 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
3.952GlyAla: 3.952 ± 0.333
0.538GlyCys: 0.538 ± 0.097
4.119GlyAsp: 4.119 ± 0.368
3.896GlyGlu: 3.896 ± 0.262
2.672GlyPhe: 2.672 ± 0.257
3.395GlyGly: 3.395 ± 0.293
1.095GlyHis: 1.095 ± 0.144
4.138GlyIle: 4.138 ± 0.305
4.175GlyLys: 4.175 ± 0.283
4.861GlyLeu: 4.861 ± 0.237
1.707GlyMet: 1.707 ± 0.223
3.266GlyAsn: 3.266 ± 0.295
1.447GlyPro: 1.447 ± 0.221
2.171GlyGln: 2.171 ± 0.198
2.561GlyArg: 2.561 ± 0.222
4.119GlySer: 4.119 ± 0.286
4.305GlyThr: 4.305 ± 0.418
3.952GlyVal: 3.952 ± 0.295
1.113GlyTrp: 1.113 ± 0.141
2.839GlyTyr: 2.839 ± 0.239
0.0GlyXaa: 0.0 ± 0.0
His
0.983HisAla: 0.983 ± 0.131
0.297HisCys: 0.297 ± 0.086
1.095HisAsp: 1.095 ± 0.16
1.15HisGlu: 1.15 ± 0.14
1.076HisPhe: 1.076 ± 0.148
1.076HisGly: 1.076 ± 0.137
0.334HisHis: 0.334 ± 0.089
1.095HisIle: 1.095 ± 0.183
0.946HisLys: 0.946 ± 0.134
1.392HisLeu: 1.392 ± 0.166
0.39HisMet: 0.39 ± 0.077
0.816HisAsn: 0.816 ± 0.122
0.872HisPro: 0.872 ± 0.112
0.52HisGln: 0.52 ± 0.092
0.872HisArg: 0.872 ± 0.132
1.113HisSer: 1.113 ± 0.126
0.909HisThr: 0.909 ± 0.134
1.243HisVal: 1.243 ± 0.166
0.26HisTrp: 0.26 ± 0.06
0.705HisTyr: 0.705 ± 0.135
0.0HisXaa: 0.0 ± 0.0
Ile
4.769IleAla: 4.769 ± 0.296
0.742IleCys: 0.742 ± 0.112
5.158IleAsp: 5.158 ± 0.289
5.381IleGlu: 5.381 ± 0.292
2.635IlePhe: 2.635 ± 0.227
3.655IleGly: 3.655 ± 0.25
0.946IleHis: 0.946 ± 0.144
4.342IleIle: 4.342 ± 0.307
5.863IleLys: 5.863 ± 0.354
4.472IleLeu: 4.472 ± 0.351
1.948IleMet: 1.948 ± 0.215
4.175IleAsn: 4.175 ± 0.301
2.412IlePro: 2.412 ± 0.224
2.635IleGln: 2.635 ± 0.231
3.395IleArg: 3.395 ± 0.268
4.249IleSer: 4.249 ± 0.281
4.379IleThr: 4.379 ± 0.274
4.676IleVal: 4.676 ± 0.289
0.668IleTrp: 0.668 ± 0.137
2.542IleTyr: 2.542 ± 0.233
0.0IleXaa: 0.0 ± 0.0
Lys
5.585LysAla: 5.585 ± 0.343
0.631LysCys: 0.631 ± 0.105
4.787LysAsp: 4.787 ± 0.398
5.789LysGlu: 5.789 ± 0.372
3.859LysPhe: 3.859 ± 0.31
3.785LysGly: 3.785 ± 0.228
1.28LysHis: 1.28 ± 0.162
5.418LysIle: 5.418 ± 0.387
4.249LysLys: 4.249 ± 0.388
5.937LysLeu: 5.937 ± 0.404
2.598LysMet: 2.598 ± 0.193
3.989LysAsn: 3.989 ± 0.282
1.763LysPro: 1.763 ± 0.181
2.319LysGln: 2.319 ± 0.193
2.987LysArg: 2.987 ± 0.302
4.824LysSer: 4.824 ± 0.283
3.878LysThr: 3.878 ± 0.254
4.676LysVal: 4.676 ± 0.343
1.095LysTrp: 1.095 ± 0.144
3.358LysTyr: 3.358 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
5.529LeuAla: 5.529 ± 0.388
0.816LeuCys: 0.816 ± 0.148
4.824LeuAsp: 4.824 ± 0.315
5.344LeuGlu: 5.344 ± 0.331
3.507LeuPhe: 3.507 ± 0.295
4.175LeuGly: 4.175 ± 0.311
1.15LeuHis: 1.15 ± 0.153
5.177LeuIle: 5.177 ± 0.344
6.049LeuLys: 6.049 ± 0.38
5.158LeuLeu: 5.158 ± 0.326
2.152LeuMet: 2.152 ± 0.216
4.973LeuAsn: 4.973 ± 0.28
3.024LeuPro: 3.024 ± 0.254
2.987LeuGln: 2.987 ± 0.224
3.21LeuArg: 3.21 ± 0.241
5.603LeuSer: 5.603 ± 0.324
5.14LeuThr: 5.14 ± 0.396
4.973LeuVal: 4.973 ± 0.308
0.668LeuTrp: 0.668 ± 0.11
3.006LeuTyr: 3.006 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
2.115MetAla: 2.115 ± 0.208
0.39MetCys: 0.39 ± 0.073
1.095MetAsp: 1.095 ± 0.111
1.336MetGlu: 1.336 ± 0.212
1.15MetPhe: 1.15 ± 0.161
1.206MetGly: 1.206 ± 0.144
0.334MetHis: 0.334 ± 0.075
1.93MetIle: 1.93 ± 0.189
2.82MetLys: 2.82 ± 0.224
2.227MetLeu: 2.227 ± 0.198
0.724MetMet: 0.724 ± 0.111
1.967MetAsn: 1.967 ± 0.164
0.798MetPro: 0.798 ± 0.125
1.021MetGln: 1.021 ± 0.139
1.076MetArg: 1.076 ± 0.157
2.115MetSer: 2.115 ± 0.199
1.8MetThr: 1.8 ± 0.173
1.187MetVal: 1.187 ± 0.148
0.223MetTrp: 0.223 ± 0.069
1.206MetTyr: 1.206 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
3.284AsnAla: 3.284 ± 0.268
0.464AsnCys: 0.464 ± 0.114
3.6AsnAsp: 3.6 ± 0.257
3.859AsnGlu: 3.859 ± 0.274
2.208AsnPhe: 2.208 ± 0.214
3.859AsnGly: 3.859 ± 0.243
1.113AsnHis: 1.113 ± 0.119
3.841AsnIle: 3.841 ± 0.263
4.101AsnLys: 4.101 ± 0.257
3.674AsnLeu: 3.674 ± 0.27
1.336AsnMet: 1.336 ± 0.157
2.746AsnAsn: 2.746 ± 0.283
2.264AsnPro: 2.264 ± 0.292
1.893AsnGln: 1.893 ± 0.15
2.189AsnArg: 2.189 ± 0.213
3.692AsnSer: 3.692 ± 0.274
3.6AsnThr: 3.6 ± 0.313
3.878AsnVal: 3.878 ± 0.289
0.649AsnTrp: 0.649 ± 0.113
2.078AsnTyr: 2.078 ± 0.191
0.0AsnXaa: 0.0 ± 0.0
Pro
2.319ProAla: 2.319 ± 0.261
0.408ProCys: 0.408 ± 0.088
2.338ProAsp: 2.338 ± 0.23
2.913ProGlu: 2.913 ± 0.238
1.41ProPhe: 1.41 ± 0.157
2.171ProGly: 2.171 ± 0.22
0.687ProHis: 0.687 ± 0.116
1.818ProIle: 1.818 ± 0.211
1.633ProLys: 1.633 ± 0.173
2.542ProLeu: 2.542 ± 0.206
0.761ProMet: 0.761 ± 0.133
1.967ProAsn: 1.967 ± 0.183
1.206ProPro: 1.206 ± 0.177
1.002ProGln: 1.002 ± 0.131
1.317ProArg: 1.317 ± 0.16
2.245ProSer: 2.245 ± 0.226
2.189ProThr: 2.189 ± 0.21
2.839ProVal: 2.839 ± 0.305
0.575ProTrp: 0.575 ± 0.101
1.317ProTyr: 1.317 ± 0.166
0.0ProXaa: 0.0 ± 0.0
Gln
2.338GlnAla: 2.338 ± 0.235
0.278GlnCys: 0.278 ± 0.076
1.726GlnAsp: 1.726 ± 0.184
2.208GlnGlu: 2.208 ± 0.243
1.577GlnPhe: 1.577 ± 0.186
2.394GlnGly: 2.394 ± 0.207
0.631GlnHis: 0.631 ± 0.118
2.505GlnIle: 2.505 ± 0.22
2.097GlnLys: 2.097 ± 0.218
3.284GlnLeu: 3.284 ± 0.221
1.058GlnMet: 1.058 ± 0.146
1.392GlnAsn: 1.392 ± 0.182
1.058GlnPro: 1.058 ± 0.148
1.225GlnGln: 1.225 ± 0.155
1.818GlnArg: 1.818 ± 0.17
2.115GlnSer: 2.115 ± 0.193
2.264GlnThr: 2.264 ± 0.212
2.227GlnVal: 2.227 ± 0.227
0.575GlnTrp: 0.575 ± 0.129
1.744GlnTyr: 1.744 ± 0.182
0.0GlnXaa: 0.0 ± 0.0
Arg
3.043ArgAla: 3.043 ± 0.259
0.52ArgCys: 0.52 ± 0.106
2.895ArgAsp: 2.895 ± 0.255
2.95ArgGlu: 2.95 ± 0.257
1.948ArgPhe: 1.948 ± 0.214
2.264ArgGly: 2.264 ± 0.188
0.687ArgHis: 0.687 ± 0.125
3.154ArgIle: 3.154 ± 0.267
3.637ArgLys: 3.637 ± 0.318
3.34ArgLeu: 3.34 ± 0.238
1.132ArgMet: 1.132 ± 0.125
1.948ArgAsn: 1.948 ± 0.175
1.076ArgPro: 1.076 ± 0.13
1.54ArgGln: 1.54 ± 0.183
2.078ArgArg: 2.078 ± 0.225
2.728ArgSer: 2.728 ± 0.237
2.486ArgThr: 2.486 ± 0.218
3.21ArgVal: 3.21 ± 0.25
0.761ArgTrp: 0.761 ± 0.135
1.726ArgTyr: 1.726 ± 0.204
0.0ArgXaa: 0.0 ± 0.0
Ser
4.527SerAla: 4.527 ± 0.308
0.557SerCys: 0.557 ± 0.132
4.416SerAsp: 4.416 ± 0.284
4.008SerGlu: 4.008 ± 0.27
2.449SerPhe: 2.449 ± 0.222
5.028SerGly: 5.028 ± 0.354
1.095SerHis: 1.095 ± 0.141
4.286SerIle: 4.286 ± 0.28
5.418SerLys: 5.418 ± 0.328
5.27SerLeu: 5.27 ± 0.288
1.614SerMet: 1.614 ± 0.21
3.247SerAsn: 3.247 ± 0.277
2.431SerPro: 2.431 ± 0.213
2.152SerGln: 2.152 ± 0.219
3.247SerArg: 3.247 ± 0.249
5.14SerSer: 5.14 ± 0.37
4.713SerThr: 4.713 ± 0.372
4.62SerVal: 4.62 ± 0.275
1.002SerTrp: 1.002 ± 0.142
3.136SerTyr: 3.136 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
4.435ThrAla: 4.435 ± 0.454
0.557ThrCys: 0.557 ± 0.104
3.785ThrAsp: 3.785 ± 0.371
3.507ThrGlu: 3.507 ± 0.27
2.709ThrPhe: 2.709 ± 0.27
4.323ThrGly: 4.323 ± 0.345
0.983ThrHis: 0.983 ± 0.147
4.026ThrIle: 4.026 ± 0.231
3.804ThrLys: 3.804 ± 0.281
4.769ThrLeu: 4.769 ± 0.318
1.076ThrMet: 1.076 ± 0.167
3.247ThrAsn: 3.247 ± 0.308
2.728ThrPro: 2.728 ± 0.311
2.449ThrGln: 2.449 ± 0.209
2.616ThrArg: 2.616 ± 0.231
4.416ThrSer: 4.416 ± 0.406
3.841ThrThr: 3.841 ± 0.435
4.323ThrVal: 4.323 ± 0.329
0.983ThrTrp: 0.983 ± 0.133
2.375ThrTyr: 2.375 ± 0.207
0.0ThrXaa: 0.0 ± 0.0
Val
4.564ValAla: 4.564 ± 0.283
0.724ValCys: 0.724 ± 0.102
4.323ValAsp: 4.323 ± 0.262
4.769ValGlu: 4.769 ± 0.33
3.247ValPhe: 3.247 ± 0.274
3.47ValGly: 3.47 ± 0.319
1.41ValHis: 1.41 ± 0.163
4.527ValIle: 4.527 ± 0.287
5.919ValLys: 5.919 ± 0.385
4.806ValLeu: 4.806 ± 0.304
1.763ValMet: 1.763 ± 0.179
4.379ValAsn: 4.379 ± 0.281
2.356ValPro: 2.356 ± 0.201
2.338ValGln: 2.338 ± 0.237
2.95ValArg: 2.95 ± 0.238
4.657ValSer: 4.657 ± 0.26
4.305ValThr: 4.305 ± 0.377
4.546ValVal: 4.546 ± 0.36
1.002ValTrp: 1.002 ± 0.15
2.839ValTyr: 2.839 ± 0.202
0.0ValXaa: 0.0 ± 0.0
Trp
0.779TrpAla: 0.779 ± 0.123
0.167TrpCys: 0.167 ± 0.054
0.816TrpAsp: 0.816 ± 0.112
0.854TrpGlu: 0.854 ± 0.121
0.761TrpPhe: 0.761 ± 0.125
0.631TrpGly: 0.631 ± 0.1
0.353TrpHis: 0.353 ± 0.088
1.058TrpIle: 1.058 ± 0.112
1.392TrpLys: 1.392 ± 0.173
0.928TrpLeu: 0.928 ± 0.14
0.649TrpMet: 0.649 ± 0.113
0.946TrpAsn: 0.946 ± 0.138
0.427TrpPro: 0.427 ± 0.093
0.464TrpGln: 0.464 ± 0.09
0.501TrpArg: 0.501 ± 0.091
0.779TrpSer: 0.779 ± 0.109
0.891TrpThr: 0.891 ± 0.168
1.169TrpVal: 1.169 ± 0.154
0.297TrpTrp: 0.297 ± 0.084
0.779TrpTyr: 0.779 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.82TyrAla: 2.82 ± 0.24
0.557TyrCys: 0.557 ± 0.095
3.395TyrAsp: 3.395 ± 0.252
2.598TyrGlu: 2.598 ± 0.214
1.688TyrPhe: 1.688 ± 0.199
2.412TyrGly: 2.412 ± 0.227
0.816TyrHis: 0.816 ± 0.137
2.728TyrIle: 2.728 ± 0.224
2.672TyrLys: 2.672 ± 0.216
3.024TyrLeu: 3.024 ± 0.265
1.095TyrMet: 1.095 ± 0.149
2.394TyrAsn: 2.394 ± 0.206
1.503TyrPro: 1.503 ± 0.192
1.596TyrGln: 1.596 ± 0.18
2.078TyrArg: 2.078 ± 0.182
3.006TyrSer: 3.006 ± 0.186
2.022TyrThr: 2.022 ± 0.212
2.895TyrVal: 2.895 ± 0.244
0.612TyrTrp: 0.612 ± 0.091
1.54TyrTyr: 1.54 ± 0.209
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 291 proteins (53896 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski