Amino acid dipepetide frequency for Dickeya phage vB_DsoM_JA13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.611AlaAla: 6.611 ± 0.449
0.661AlaCys: 0.661 ± 0.09
4.89AlaAsp: 4.89 ± 0.253
5.065AlaGlu: 5.065 ± 0.318
3.218AlaPhe: 3.218 ± 0.195
4.316AlaGly: 4.316 ± 0.298
1.559AlaHis: 1.559 ± 0.131
5.139AlaIle: 5.139 ± 0.217
5.975AlaLys: 5.975 ± 0.377
6.512AlaLeu: 6.512 ± 0.281
1.971AlaMet: 1.971 ± 0.15
3.717AlaAsn: 3.717 ± 0.276
3.106AlaPro: 3.106 ± 0.233
3.019AlaGln: 3.019 ± 0.206
4.166AlaArg: 4.166 ± 0.248
4.528AlaSer: 4.528 ± 0.267
4.354AlaThr: 4.354 ± 0.3
4.503AlaVal: 4.503 ± 0.254
0.624AlaTrp: 0.624 ± 0.087
2.532AlaTyr: 2.532 ± 0.192
0.0AlaXaa: 0.0 ± 0.0
Cys
0.761CysAla: 0.761 ± 0.102
0.137CysCys: 0.137 ± 0.044
0.699CysAsp: 0.699 ± 0.093
0.873CysGlu: 0.873 ± 0.127
0.499CysPhe: 0.499 ± 0.071
0.873CysGly: 0.873 ± 0.101
0.237CysHis: 0.237 ± 0.05
0.624CysIle: 0.624 ± 0.088
0.686CysLys: 0.686 ± 0.1
0.948CysLeu: 0.948 ± 0.1
0.324CysMet: 0.324 ± 0.068
0.349CysAsn: 0.349 ± 0.076
0.724CysPro: 0.724 ± 0.098
0.237CysGln: 0.237 ± 0.053
0.574CysArg: 0.574 ± 0.081
0.699CysSer: 0.699 ± 0.106
0.524CysThr: 0.524 ± 0.1
0.748CysVal: 0.748 ± 0.103
0.062CysTrp: 0.062 ± 0.03
0.362CysTyr: 0.362 ± 0.06
0.0CysXaa: 0.0 ± 0.0
Asp
4.815AspAla: 4.815 ± 0.287
0.561AspCys: 0.561 ± 0.102
5.202AspAsp: 5.202 ± 0.627
5.439AspGlu: 5.439 ± 0.602
3.256AspPhe: 3.256 ± 0.228
4.528AspGly: 4.528 ± 0.335
0.911AspHis: 0.911 ± 0.097
4.054AspIle: 4.054 ± 0.238
3.406AspLys: 3.406 ± 0.266
5.875AspLeu: 5.875 ± 0.262
1.509AspMet: 1.509 ± 0.149
2.545AspAsn: 2.545 ± 0.184
3.343AspPro: 3.343 ± 0.228
2.008AspGln: 2.008 ± 0.131
2.969AspArg: 2.969 ± 0.211
4.616AspSer: 4.616 ± 0.291
3.281AspThr: 3.281 ± 0.198
4.154AspVal: 4.154 ± 0.223
0.911AspTrp: 0.911 ± 0.085
2.433AspTyr: 2.433 ± 0.152
0.0AspXaa: 0.0 ± 0.0
Glu
5.015GluAla: 5.015 ± 0.27
0.536GluCys: 0.536 ± 0.091
5.439GluAsp: 5.439 ± 0.632
6.025GluGlu: 6.025 ± 0.78
2.956GluPhe: 2.956 ± 0.196
3.169GluGly: 3.169 ± 0.213
1.447GluHis: 1.447 ± 0.135
4.69GluIle: 4.69 ± 0.24
4.69GluLys: 4.69 ± 0.334
5.801GluLeu: 5.801 ± 0.294
1.934GluMet: 1.934 ± 0.169
3.63GluAsn: 3.63 ± 0.231
1.834GluPro: 1.834 ± 0.144
2.433GluGln: 2.433 ± 0.179
3.156GluArg: 3.156 ± 0.225
4.341GluSer: 4.341 ± 0.302
3.53GluThr: 3.53 ± 0.248
4.453GluVal: 4.453 ± 0.237
0.923GluTrp: 0.923 ± 0.112
2.395GluTyr: 2.395 ± 0.183
0.0GluXaa: 0.0 ± 0.0
Phe
3.131PheAla: 3.131 ± 0.192
0.611PheCys: 0.611 ± 0.077
3.58PheAsp: 3.58 ± 0.265
3.218PheGlu: 3.218 ± 0.237
1.572PhePhe: 1.572 ± 0.14
3.006PheGly: 3.006 ± 0.189
0.724PheHis: 0.724 ± 0.105
2.732PheIle: 2.732 ± 0.178
2.358PheLys: 2.358 ± 0.199
3.031PheLeu: 3.031 ± 0.21
0.948PheMet: 0.948 ± 0.103
2.308PheAsn: 2.308 ± 0.145
1.771PhePro: 1.771 ± 0.173
1.085PheGln: 1.085 ± 0.12
2.37PheArg: 2.37 ± 0.17
3.056PheSer: 3.056 ± 0.188
2.62PheThr: 2.62 ± 0.196
3.069PheVal: 3.069 ± 0.201
0.399PheTrp: 0.399 ± 0.069
1.484PheTyr: 1.484 ± 0.159
0.0PheXaa: 0.0 ± 0.0
Gly
4.366GlyAla: 4.366 ± 0.301
0.574GlyCys: 0.574 ± 0.091
3.593GlyAsp: 3.593 ± 0.256
3.742GlyGlu: 3.742 ± 0.254
2.657GlyPhe: 2.657 ± 0.184
3.805GlyGly: 3.805 ± 0.374
0.736GlyHis: 0.736 ± 0.099
3.618GlyIle: 3.618 ± 0.229
5.302GlyLys: 5.302 ± 0.329
4.092GlyLeu: 4.092 ± 0.203
1.759GlyMet: 1.759 ± 0.128
3.081GlyAsn: 3.081 ± 0.279
1.16GlyPro: 1.16 ± 0.282
1.697GlyGln: 1.697 ± 0.172
2.844GlyArg: 2.844 ± 0.196
4.74GlySer: 4.74 ± 0.277
4.291GlyThr: 4.291 ± 0.306
4.079GlyVal: 4.079 ± 0.239
0.786GlyTrp: 0.786 ± 0.091
2.507GlyTyr: 2.507 ± 0.197
0.0GlyXaa: 0.0 ± 0.0
His
1.385HisAla: 1.385 ± 0.118
0.387HisCys: 0.387 ± 0.078
0.836HisAsp: 0.836 ± 0.103
1.297HisGlu: 1.297 ± 0.111
0.948HisPhe: 0.948 ± 0.101
1.21HisGly: 1.21 ± 0.117
0.262HisHis: 0.262 ± 0.057
1.123HisIle: 1.123 ± 0.111
1.335HisLys: 1.335 ± 0.114
1.397HisLeu: 1.397 ± 0.144
0.487HisMet: 0.487 ± 0.086
0.898HisAsn: 0.898 ± 0.114
0.724HisPro: 0.724 ± 0.105
0.274HisGln: 0.274 ± 0.059
1.148HisArg: 1.148 ± 0.121
1.06HisSer: 1.06 ± 0.111
0.911HisThr: 0.911 ± 0.11
1.372HisVal: 1.372 ± 0.136
0.237HisTrp: 0.237 ± 0.057
0.873HisTyr: 0.873 ± 0.103
0.0HisXaa: 0.0 ± 0.0
Ile
4.89IleAla: 4.89 ± 0.245
0.773IleCys: 0.773 ± 0.09
4.004IleAsp: 4.004 ± 0.219
4.516IleGlu: 4.516 ± 0.255
2.133IlePhe: 2.133 ± 0.175
3.705IleGly: 3.705 ± 0.244
1.272IleHis: 1.272 ± 0.118
2.719IleIle: 2.719 ± 0.199
3.742IleLys: 3.742 ± 0.217
4.616IleLeu: 4.616 ± 0.258
1.372IleMet: 1.372 ± 0.125
2.931IleAsn: 2.931 ± 0.207
3.368IlePro: 3.368 ± 0.192
2.408IleGln: 2.408 ± 0.164
3.705IleArg: 3.705 ± 0.207
4.778IleSer: 4.778 ± 0.224
3.406IleThr: 3.406 ± 0.207
4.653IleVal: 4.653 ± 0.256
0.549IleTrp: 0.549 ± 0.088
1.859IleTyr: 1.859 ± 0.156
0.0IleXaa: 0.0 ± 0.0
Lys
5.127LysAla: 5.127 ± 0.385
0.449LysCys: 0.449 ± 0.072
3.717LysAsp: 3.717 ± 0.215
4.416LysGlu: 4.416 ± 0.245
2.919LysPhe: 2.919 ± 0.252
3.493LysGly: 3.493 ± 0.238
1.31LysHis: 1.31 ± 0.145
4.004LysIle: 4.004 ± 0.244
6.674LysLys: 6.674 ± 0.712
5.988LysLeu: 5.988 ± 0.294
1.784LysMet: 1.784 ± 0.157
3.119LysAsn: 3.119 ± 0.231
2.769LysPro: 2.769 ± 0.219
2.707LysGln: 2.707 ± 0.183
3.568LysArg: 3.568 ± 0.263
4.466LysSer: 4.466 ± 0.349
4.566LysThr: 4.566 ± 0.285
4.566LysVal: 4.566 ± 0.241
0.599LysTrp: 0.599 ± 0.092
2.52LysTyr: 2.52 ± 0.2
0.0LysXaa: 0.0 ± 0.0
Leu
6.424LeuAla: 6.424 ± 0.325
0.923LeuCys: 0.923 ± 0.099
5.139LeuAsp: 5.139 ± 0.235
5.551LeuGlu: 5.551 ± 0.293
3.468LeuPhe: 3.468 ± 0.228
4.129LeuGly: 4.129 ± 0.224
1.721LeuHis: 1.721 ± 0.135
4.678LeuIle: 4.678 ± 0.215
5.214LeuLys: 5.214 ± 0.341
6.424LeuLeu: 6.424 ± 0.314
2.158LeuMet: 2.158 ± 0.167
4.828LeuAsn: 4.828 ± 0.264
4.154LeuPro: 4.154 ± 0.252
2.395LeuGln: 2.395 ± 0.16
5.027LeuArg: 5.027 ± 0.269
6.998LeuSer: 6.998 ± 0.375
4.416LeuThr: 4.416 ± 0.242
5.289LeuVal: 5.289 ± 0.227
0.798LeuTrp: 0.798 ± 0.102
2.557LeuTyr: 2.557 ± 0.204
0.0LeuXaa: 0.0 ± 0.0
Met
1.884MetAla: 1.884 ± 0.165
0.399MetCys: 0.399 ± 0.084
1.422MetAsp: 1.422 ± 0.138
1.746MetGlu: 1.746 ± 0.156
1.135MetPhe: 1.135 ± 0.13
1.01MetGly: 1.01 ± 0.119
0.349MetHis: 0.349 ± 0.068
1.385MetIle: 1.385 ± 0.124
1.934MetLys: 1.934 ± 0.147
2.071MetLeu: 2.071 ± 0.159
0.649MetMet: 0.649 ± 0.09
0.973MetAsn: 0.973 ± 0.118
1.26MetPro: 1.26 ± 0.113
1.185MetGln: 1.185 ± 0.123
1.634MetArg: 1.634 ± 0.147
2.058MetSer: 2.058 ± 0.146
1.397MetThr: 1.397 ± 0.124
1.272MetVal: 1.272 ± 0.11
0.262MetTrp: 0.262 ± 0.06
0.861MetTyr: 0.861 ± 0.094
0.0MetXaa: 0.0 ± 0.0
Asn
4.092AsnAla: 4.092 ± 0.262
0.536AsnCys: 0.536 ± 0.085
2.67AsnAsp: 2.67 ± 0.157
2.707AsnGlu: 2.707 ± 0.171
2.358AsnPhe: 2.358 ± 0.198
3.48AsnGly: 3.48 ± 0.223
0.574AsnHis: 0.574 ± 0.075
3.381AsnIle: 3.381 ± 0.256
3.144AsnLys: 3.144 ± 0.213
4.017AsnLeu: 4.017 ± 0.209
1.173AsnMet: 1.173 ± 0.117
2.033AsnAsn: 2.033 ± 0.187
2.532AsnPro: 2.532 ± 0.2
1.709AsnGln: 1.709 ± 0.153
2.108AsnArg: 2.108 ± 0.162
3.281AsnSer: 3.281 ± 0.195
2.67AsnThr: 2.67 ± 0.195
3.58AsnVal: 3.58 ± 0.235
0.549AsnTrp: 0.549 ± 0.074
1.609AsnTyr: 1.609 ± 0.159
0.0AsnXaa: 0.0 ± 0.0
Pro
2.944ProAla: 2.944 ± 0.239
0.574ProCys: 0.574 ± 0.076
3.243ProAsp: 3.243 ± 0.179
3.418ProGlu: 3.418 ± 0.245
1.946ProPhe: 1.946 ± 0.16
1.958ProGly: 1.958 ± 0.193
0.699ProHis: 0.699 ± 0.093
2.694ProIle: 2.694 ± 0.161
3.48ProLys: 3.48 ± 0.242
3.206ProLeu: 3.206 ± 0.215
0.985ProMet: 0.985 ± 0.106
2.033ProAsn: 2.033 ± 0.161
1.21ProPro: 1.21 ± 0.141
1.173ProGln: 1.173 ± 0.123
2.008ProArg: 2.008 ± 0.208
2.844ProSer: 2.844 ± 0.228
2.744ProThr: 2.744 ± 0.221
3.318ProVal: 3.318 ± 0.175
0.312ProTrp: 0.312 ± 0.064
1.285ProTyr: 1.285 ± 0.148
0.0ProXaa: 0.0 ± 0.0
Gln
2.507GlnAla: 2.507 ± 0.165
0.249GlnCys: 0.249 ± 0.055
1.846GlnAsp: 1.846 ± 0.142
1.971GlnGlu: 1.971 ± 0.165
1.422GlnPhe: 1.422 ± 0.131
1.522GlnGly: 1.522 ± 0.12
0.674GlnHis: 0.674 ± 0.082
2.882GlnIle: 2.882 ± 0.179
2.345GlnLys: 2.345 ± 0.157
3.144GlnLeu: 3.144 ± 0.203
0.761GlnMet: 0.761 ± 0.102
1.821GlnAsn: 1.821 ± 0.171
1.247GlnPro: 1.247 ± 0.119
1.36GlnGln: 1.36 ± 0.169
1.871GlnArg: 1.871 ± 0.198
2.208GlnSer: 2.208 ± 0.178
2.108GlnThr: 2.108 ± 0.173
2.033GlnVal: 2.033 ± 0.187
0.474GlnTrp: 0.474 ± 0.087
1.484GlnTyr: 1.484 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
3.892ArgAla: 3.892 ± 0.222
0.624ArgCys: 0.624 ± 0.086
3.243ArgAsp: 3.243 ± 0.219
3.131ArgGlu: 3.131 ± 0.189
2.333ArgPhe: 2.333 ± 0.194
2.981ArgGly: 2.981 ± 0.239
0.998ArgHis: 0.998 ± 0.123
3.767ArgIle: 3.767 ± 0.246
3.892ArgLys: 3.892 ± 0.282
4.653ArgLeu: 4.653 ± 0.272
1.622ArgMet: 1.622 ± 0.135
2.832ArgAsn: 2.832 ± 0.237
1.721ArgPro: 1.721 ± 0.16
1.896ArgGln: 1.896 ± 0.173
3.019ArgArg: 3.019 ± 0.204
3.406ArgSer: 3.406 ± 0.215
2.907ArgThr: 2.907 ± 0.19
3.742ArgVal: 3.742 ± 0.227
0.586ArgTrp: 0.586 ± 0.074
1.983ArgTyr: 1.983 ± 0.139
0.0ArgXaa: 0.0 ± 0.0
Ser
5.826SerAla: 5.826 ± 0.258
0.699SerCys: 0.699 ± 0.092
4.902SerAsp: 4.902 ± 0.261
4.89SerGlu: 4.89 ± 0.353
2.944SerPhe: 2.944 ± 0.212
4.952SerGly: 4.952 ± 0.247
1.272SerHis: 1.272 ± 0.124
4.179SerIle: 4.179 ± 0.259
4.241SerLys: 4.241 ± 0.281
5.938SerLeu: 5.938 ± 0.283
1.796SerMet: 1.796 ± 0.154
3.331SerAsn: 3.331 ± 0.208
2.894SerPro: 2.894 ± 0.229
2.171SerGln: 2.171 ± 0.144
3.056SerArg: 3.056 ± 0.204
4.616SerSer: 4.616 ± 0.356
4.004SerThr: 4.004 ± 0.24
5.04SerVal: 5.04 ± 0.247
0.611SerTrp: 0.611 ± 0.087
2.408SerTyr: 2.408 ± 0.207
0.0SerXaa: 0.0 ± 0.0
Thr
4.528ThrAla: 4.528 ± 0.311
0.611ThrCys: 0.611 ± 0.094
3.593ThrAsp: 3.593 ± 0.259
3.243ThrGlu: 3.243 ± 0.214
2.532ThrPhe: 2.532 ± 0.168
4.279ThrGly: 4.279 ± 0.373
1.21ThrHis: 1.21 ± 0.115
3.792ThrIle: 3.792 ± 0.247
3.618ThrLys: 3.618 ± 0.264
5.489ThrLeu: 5.489 ± 0.268
1.023ThrMet: 1.023 ± 0.117
2.62ThrAsn: 2.62 ± 0.201
3.019ThrPro: 3.019 ± 0.207
1.971ThrGln: 1.971 ± 0.171
2.607ThrArg: 2.607 ± 0.167
3.593ThrSer: 3.593 ± 0.257
2.857ThrThr: 2.857 ± 0.208
4.665ThrVal: 4.665 ± 0.275
0.674ThrTrp: 0.674 ± 0.096
1.909ThrTyr: 1.909 ± 0.164
0.0ThrXaa: 0.0 ± 0.0
Val
4.94ValAla: 4.94 ± 0.233
0.823ValCys: 0.823 ± 0.114
4.79ValAsp: 4.79 ± 0.261
4.69ValGlu: 4.69 ± 0.267
2.657ValPhe: 2.657 ± 0.188
3.992ValGly: 3.992 ± 0.262
1.073ValHis: 1.073 ± 0.114
3.53ValIle: 3.53 ± 0.187
4.142ValLys: 4.142 ± 0.242
5.564ValLeu: 5.564 ± 0.254
1.347ValMet: 1.347 ± 0.149
3.069ValAsn: 3.069 ± 0.218
3.43ValPro: 3.43 ± 0.224
2.482ValGln: 2.482 ± 0.192
4.528ValArg: 4.528 ± 0.216
5.189ValSer: 5.189 ± 0.27
4.279ValThr: 4.279 ± 0.247
5.713ValVal: 5.713 ± 0.327
0.911ValTrp: 0.911 ± 0.117
2.408ValTyr: 2.408 ± 0.164
0.0ValXaa: 0.0 ± 0.0
Trp
0.674TrpAla: 0.674 ± 0.091
0.187TrpCys: 0.187 ± 0.049
0.511TrpAsp: 0.511 ± 0.064
0.611TrpGlu: 0.611 ± 0.1
0.499TrpPhe: 0.499 ± 0.093
0.574TrpGly: 0.574 ± 0.095
0.399TrpHis: 0.399 ± 0.067
0.674TrpIle: 0.674 ± 0.087
0.624TrpLys: 0.624 ± 0.083
0.911TrpLeu: 0.911 ± 0.095
0.349TrpMet: 0.349 ± 0.073
0.474TrpAsn: 0.474 ± 0.082
0.437TrpPro: 0.437 ± 0.085
0.474TrpGln: 0.474 ± 0.075
0.736TrpArg: 0.736 ± 0.095
0.661TrpSer: 0.661 ± 0.095
0.761TrpThr: 0.761 ± 0.096
0.773TrpVal: 0.773 ± 0.105
0.05TrpTrp: 0.05 ± 0.024
0.324TrpTyr: 0.324 ± 0.06
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.657TyrAla: 2.657 ± 0.19
0.624TyrCys: 0.624 ± 0.092
2.495TyrAsp: 2.495 ± 0.164
1.846TyrGlu: 1.846 ± 0.152
1.622TyrPhe: 1.622 ± 0.141
2.445TyrGly: 2.445 ± 0.189
0.736TyrHis: 0.736 ± 0.093
1.734TyrIle: 1.734 ± 0.159
1.958TyrLys: 1.958 ± 0.171
2.62TyrLeu: 2.62 ± 0.215
0.923TyrMet: 0.923 ± 0.111
1.609TyrAsn: 1.609 ± 0.175
1.372TyrPro: 1.372 ± 0.111
1.247TyrGln: 1.247 ± 0.112
2.133TyrArg: 2.133 ± 0.165
2.632TyrSer: 2.632 ± 0.17
2.208TyrThr: 2.208 ± 0.214
2.545TyrVal: 2.545 ± 0.179
0.387TyrTrp: 0.387 ± 0.069
1.285TyrTyr: 1.285 ± 0.139
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 323 proteins (80165 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski