Amino acid dipepetide frequency for Pseudomonas phage EL

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.42AlaAla: 4.42 ± 0.35
0.524AlaCys: 0.524 ± 0.086
3.296AlaAsp: 3.296 ± 0.223
4.05AlaGlu: 4.05 ± 0.288
3.249AlaPhe: 3.249 ± 0.209
4.312AlaGly: 4.312 ± 0.379
1.232AlaHis: 1.232 ± 0.141
3.573AlaIle: 3.573 ± 0.25
4.281AlaLys: 4.281 ± 0.291
6.268AlaLeu: 6.268 ± 0.365
1.709AlaMet: 1.709 ± 0.145
3.296AlaAsn: 3.296 ± 0.249
1.925AlaPro: 1.925 ± 0.15
1.833AlaGln: 1.833 ± 0.165
3.034AlaArg: 3.034 ± 0.242
3.65AlaSer: 3.65 ± 0.238
3.773AlaThr: 3.773 ± 0.263
4.312AlaVal: 4.312 ± 0.237
0.878AlaTrp: 0.878 ± 0.12
2.71AlaTyr: 2.71 ± 0.211
0.0AlaXaa: 0.0 ± 0.0
Cys
0.416CysAla: 0.416 ± 0.075
0.108CysCys: 0.108 ± 0.037
0.462CysAsp: 0.462 ± 0.093
0.416CysGlu: 0.416 ± 0.081
0.416CysPhe: 0.416 ± 0.091
0.601CysGly: 0.601 ± 0.117
0.216CysHis: 0.216 ± 0.062
0.524CysIle: 0.524 ± 0.112
0.493CysLys: 0.493 ± 0.078
0.785CysLeu: 0.785 ± 0.112
0.277CysMet: 0.277 ± 0.065
0.385CysAsn: 0.385 ± 0.092
0.4CysPro: 0.4 ± 0.1
0.354CysGln: 0.354 ± 0.065
0.431CysArg: 0.431 ± 0.087
0.57CysSer: 0.57 ± 0.1
0.447CysThr: 0.447 ± 0.088
0.693CysVal: 0.693 ± 0.104
0.031CysTrp: 0.031 ± 0.029
0.416CysTyr: 0.416 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
3.573AspAla: 3.573 ± 0.252
0.554AspCys: 0.554 ± 0.098
2.988AspAsp: 2.988 ± 0.21
3.912AspGlu: 3.912 ± 0.224
2.664AspPhe: 2.664 ± 0.227
4.343AspGly: 4.343 ± 0.331
1.34AspHis: 1.34 ± 0.167
3.958AspIle: 3.958 ± 0.265
3.711AspLys: 3.711 ± 0.268
6.006AspLeu: 6.006 ± 0.3
1.109AspMet: 1.109 ± 0.128
3.111AspAsn: 3.111 ± 0.229
3.373AspPro: 3.373 ± 0.223
1.971AspGln: 1.971 ± 0.158
3.373AspArg: 3.373 ± 0.254
2.972AspSer: 2.972 ± 0.217
3.234AspThr: 3.234 ± 0.218
3.896AspVal: 3.896 ± 0.257
0.755AspTrp: 0.755 ± 0.104
2.864AspTyr: 2.864 ± 0.215
0.0AspXaa: 0.0 ± 0.0
Glu
4.743GluAla: 4.743 ± 0.325
0.662GluCys: 0.662 ± 0.121
4.389GluAsp: 4.389 ± 0.289
5.236GluGlu: 5.236 ± 0.425
3.018GluPhe: 3.018 ± 0.197
4.774GluGly: 4.774 ± 0.336
1.632GluHis: 1.632 ± 0.172
3.835GluIle: 3.835 ± 0.218
3.45GluLys: 3.45 ± 0.291
7.1GluLeu: 7.1 ± 0.358
1.786GluMet: 1.786 ± 0.149
3.804GluAsn: 3.804 ± 0.261
2.171GluPro: 2.171 ± 0.175
2.911GluGln: 2.911 ± 0.192
3.573GluArg: 3.573 ± 0.239
3.095GluSer: 3.095 ± 0.167
4.127GluThr: 4.127 ± 0.214
5.359GluVal: 5.359 ± 0.322
1.201GluTrp: 1.201 ± 0.147
2.556GluTyr: 2.556 ± 0.217
0.0GluXaa: 0.0 ± 0.0
Phe
2.279PheAla: 2.279 ± 0.215
0.493PheCys: 0.493 ± 0.078
2.911PheAsp: 2.911 ± 0.22
2.341PheGlu: 2.341 ± 0.24
1.863PhePhe: 1.863 ± 0.2
2.88PheGly: 2.88 ± 0.27
0.785PheHis: 0.785 ± 0.113
2.88PheIle: 2.88 ± 0.191
3.142PheLys: 3.142 ± 0.251
3.142PheLeu: 3.142 ± 0.212
0.924PheMet: 0.924 ± 0.115
3.219PheAsn: 3.219 ± 0.203
1.725PhePro: 1.725 ± 0.173
0.924PheGln: 0.924 ± 0.125
2.202PheArg: 2.202 ± 0.186
3.157PheSer: 3.157 ± 0.216
3.234PheThr: 3.234 ± 0.197
2.68PheVal: 2.68 ± 0.244
0.385PheTrp: 0.385 ± 0.074
1.91PheTyr: 1.91 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
3.388GlyAla: 3.388 ± 0.282
0.431GlyCys: 0.431 ± 0.077
4.019GlyAsp: 4.019 ± 0.289
5.251GlyGlu: 5.251 ± 0.362
2.757GlyPhe: 2.757 ± 0.263
5.421GlyGly: 5.421 ± 0.497
1.063GlyHis: 1.063 ± 0.124
4.112GlyIle: 4.112 ± 0.291
5.236GlyLys: 5.236 ± 0.358
6.422GlyLeu: 6.422 ± 0.341
2.002GlyMet: 2.002 ± 0.272
3.835GlyAsn: 3.835 ± 0.26
1.663GlyPro: 1.663 ± 0.183
2.187GlyGln: 2.187 ± 0.193
3.419GlyArg: 3.419 ± 0.241
3.912GlySer: 3.912 ± 0.344
3.927GlyThr: 3.927 ± 0.272
5.621GlyVal: 5.621 ± 0.294
1.14GlyTrp: 1.14 ± 0.129
3.157GlyTyr: 3.157 ± 0.249
0.0GlyXaa: 0.0 ± 0.0
His
0.909HisAla: 0.909 ± 0.117
0.154HisCys: 0.154 ± 0.045
0.955HisAsp: 0.955 ± 0.135
1.232HisGlu: 1.232 ± 0.137
1.001HisPhe: 1.001 ± 0.143
1.124HisGly: 1.124 ± 0.126
0.554HisHis: 0.554 ± 0.085
1.186HisIle: 1.186 ± 0.155
0.893HisLys: 0.893 ± 0.117
2.541HisLeu: 2.541 ± 0.211
0.385HisMet: 0.385 ± 0.084
0.77HisAsn: 0.77 ± 0.085
1.355HisPro: 1.355 ± 0.15
1.016HisGln: 1.016 ± 0.138
1.093HisArg: 1.093 ± 0.105
1.432HisSer: 1.432 ± 0.161
1.186HisThr: 1.186 ± 0.16
1.278HisVal: 1.278 ± 0.124
0.385HisTrp: 0.385 ± 0.072
1.047HisTyr: 1.047 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
3.819IleAla: 3.819 ± 0.277
0.416IleCys: 0.416 ± 0.082
4.035IleAsp: 4.035 ± 0.266
4.127IleGlu: 4.127 ± 0.244
2.141IlePhe: 2.141 ± 0.206
3.665IleGly: 3.665 ± 0.285
1.371IleHis: 1.371 ± 0.153
2.926IleIle: 2.926 ± 0.192
3.758IleLys: 3.758 ± 0.308
4.435IleLeu: 4.435 ± 0.251
0.955IleMet: 0.955 ± 0.129
3.788IleAsn: 3.788 ± 0.261
3.065IlePro: 3.065 ± 0.234
2.202IleGln: 2.202 ± 0.201
3.542IleArg: 3.542 ± 0.255
3.234IleSer: 3.234 ± 0.219
4.096IleThr: 4.096 ± 0.266
3.758IleVal: 3.758 ± 0.241
0.601IleTrp: 0.601 ± 0.092
1.879IleTyr: 1.879 ± 0.178
0.0IleXaa: 0.0 ± 0.0
Lys
5.128LysAla: 5.128 ± 0.34
0.216LysCys: 0.216 ± 0.061
4.42LysAsp: 4.42 ± 0.259
5.159LysGlu: 5.159 ± 0.305
1.786LysPhe: 1.786 ± 0.194
4.389LysGly: 4.389 ± 0.361
1.448LysHis: 1.448 ± 0.151
3.034LysIle: 3.034 ± 0.2
2.895LysLys: 2.895 ± 0.258
6.068LysLeu: 6.068 ± 0.28
1.247LysMet: 1.247 ± 0.165
2.541LysAsn: 2.541 ± 0.216
2.402LysPro: 2.402 ± 0.167
2.31LysGln: 2.31 ± 0.18
3.203LysArg: 3.203 ± 0.225
2.556LysSer: 2.556 ± 0.188
3.896LysThr: 3.896 ± 0.298
4.666LysVal: 4.666 ± 0.266
0.678LysTrp: 0.678 ± 0.104
1.802LysTyr: 1.802 ± 0.182
0.0LysXaa: 0.0 ± 0.0
Leu
5.729LeuAla: 5.729 ± 0.3
0.939LeuCys: 0.939 ± 0.126
5.867LeuAsp: 5.867 ± 0.303
6.745LeuGlu: 6.745 ± 0.3
4.05LeuPhe: 4.05 ± 0.243
5.975LeuGly: 5.975 ± 0.476
1.756LeuHis: 1.756 ± 0.183
5.344LeuIle: 5.344 ± 0.32
6.222LeuLys: 6.222 ± 0.328
7.654LeuLeu: 7.654 ± 0.368
2.279LeuMet: 2.279 ± 0.196
5.39LeuAsn: 5.39 ± 0.278
4.528LeuPro: 4.528 ± 0.32
2.988LeuGln: 2.988 ± 0.206
4.389LeuArg: 4.389 ± 0.227
6.391LeuSer: 6.391 ± 0.307
6.761LeuThr: 6.761 ± 0.282
6.514LeuVal: 6.514 ± 0.369
0.909LeuTrp: 0.909 ± 0.137
3.511LeuTyr: 3.511 ± 0.23
0.0LeuXaa: 0.0 ± 0.0
Met
1.848MetAla: 1.848 ± 0.158
0.216MetCys: 0.216 ± 0.048
1.494MetAsp: 1.494 ± 0.165
1.525MetGlu: 1.525 ± 0.144
0.77MetPhe: 0.77 ± 0.12
1.632MetGly: 1.632 ± 0.176
0.262MetHis: 0.262 ± 0.071
1.247MetIle: 1.247 ± 0.121
1.463MetLys: 1.463 ± 0.131
2.094MetLeu: 2.094 ± 0.155
0.724MetMet: 0.724 ± 0.132
1.463MetAsn: 1.463 ± 0.162
0.924MetPro: 0.924 ± 0.116
0.662MetGln: 0.662 ± 0.099
1.016MetArg: 1.016 ± 0.109
1.879MetSer: 1.879 ± 0.166
1.648MetThr: 1.648 ± 0.137
1.694MetVal: 1.694 ± 0.139
0.2MetTrp: 0.2 ± 0.055
0.616MetTyr: 0.616 ± 0.084
0.0MetXaa: 0.0 ± 0.0
Asn
4.004AsnAla: 4.004 ± 0.243
0.339AsnCys: 0.339 ± 0.081
3.049AsnAsp: 3.049 ± 0.205
2.911AsnGlu: 2.911 ± 0.222
2.017AsnPhe: 2.017 ± 0.147
4.173AsnGly: 4.173 ± 0.256
0.939AsnHis: 0.939 ± 0.125
2.572AsnIle: 2.572 ± 0.225
3.819AsnLys: 3.819 ± 0.226
4.913AsnLeu: 4.913 ± 0.274
1.109AsnMet: 1.109 ± 0.142
2.757AsnAsn: 2.757 ± 0.216
3.742AsnPro: 3.742 ± 0.248
2.341AsnGln: 2.341 ± 0.185
3.172AsnArg: 3.172 ± 0.199
2.757AsnSer: 2.757 ± 0.228
3.65AsnThr: 3.65 ± 0.254
3.065AsnVal: 3.065 ± 0.189
0.57AsnTrp: 0.57 ± 0.082
1.694AsnTyr: 1.694 ± 0.153
0.0AsnXaa: 0.0 ± 0.0
Pro
2.125ProAla: 2.125 ± 0.19
0.323ProCys: 0.323 ± 0.069
2.556ProAsp: 2.556 ± 0.195
4.112ProGlu: 4.112 ± 0.235
2.372ProPhe: 2.372 ± 0.205
3.373ProGly: 3.373 ± 0.302
0.862ProHis: 0.862 ± 0.111
2.587ProIle: 2.587 ± 0.184
2.279ProLys: 2.279 ± 0.208
3.819ProLeu: 3.819 ± 0.213
1.032ProMet: 1.032 ± 0.13
2.295ProAsn: 2.295 ± 0.226
1.817ProPro: 1.817 ± 0.201
1.756ProGln: 1.756 ± 0.149
1.925ProArg: 1.925 ± 0.163
3.08ProSer: 3.08 ± 0.22
2.818ProThr: 2.818 ± 0.238
3.496ProVal: 3.496 ± 0.279
0.57ProTrp: 0.57 ± 0.104
1.679ProTyr: 1.679 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
2.341GlnAla: 2.341 ± 0.209
0.246GlnCys: 0.246 ± 0.064
1.894GlnAsp: 1.894 ± 0.182
2.295GlnGlu: 2.295 ± 0.192
1.756GlnPhe: 1.756 ± 0.159
2.603GlnGly: 2.603 ± 0.225
1.016GlnHis: 1.016 ± 0.135
2.033GlnIle: 2.033 ± 0.186
1.432GlnLys: 1.432 ± 0.147
3.727GlnLeu: 3.727 ± 0.258
1.093GlnMet: 1.093 ± 0.14
1.371GlnAsn: 1.371 ± 0.178
1.694GlnPro: 1.694 ± 0.173
1.509GlnGln: 1.509 ± 0.213
2.064GlnArg: 2.064 ± 0.162
1.879GlnSer: 1.879 ± 0.184
1.894GlnThr: 1.894 ± 0.168
2.526GlnVal: 2.526 ± 0.21
0.539GlnTrp: 0.539 ± 0.081
1.694GlnTyr: 1.694 ± 0.151
0.0GlnXaa: 0.0 ± 0.0
Arg
2.926ArgAla: 2.926 ± 0.229
0.493ArgCys: 0.493 ± 0.083
2.803ArgAsp: 2.803 ± 0.195
3.388ArgGlu: 3.388 ± 0.26
2.356ArgPhe: 2.356 ± 0.201
3.188ArgGly: 3.188 ± 0.294
0.847ArgHis: 0.847 ± 0.116
3.065ArgIle: 3.065 ± 0.188
3.126ArgLys: 3.126 ± 0.245
5.251ArgLeu: 5.251 ± 0.296
1.232ArgMet: 1.232 ± 0.14
2.68ArgAsn: 2.68 ± 0.2
1.771ArgPro: 1.771 ± 0.179
2.079ArgGln: 2.079 ± 0.199
2.787ArgArg: 2.787 ± 0.224
3.665ArgSer: 3.665 ± 0.279
3.018ArgThr: 3.018 ± 0.202
4.297ArgVal: 4.297 ± 0.251
0.708ArgTrp: 0.708 ± 0.117
2.433ArgTyr: 2.433 ± 0.226
0.0ArgXaa: 0.0 ± 0.0
Ser
3.681SerAla: 3.681 ± 0.279
0.508SerCys: 0.508 ± 0.084
3.342SerAsp: 3.342 ± 0.212
3.896SerGlu: 3.896 ± 0.192
2.664SerPhe: 2.664 ± 0.201
4.189SerGly: 4.189 ± 0.277
1.109SerHis: 1.109 ± 0.117
3.773SerIle: 3.773 ± 0.253
3.249SerLys: 3.249 ± 0.236
5.775SerLeu: 5.775 ± 0.283
1.555SerMet: 1.555 ± 0.131
2.88SerAsn: 2.88 ± 0.229
2.787SerPro: 2.787 ± 0.256
2.264SerGln: 2.264 ± 0.186
3.126SerArg: 3.126 ± 0.223
3.742SerSer: 3.742 ± 0.301
3.265SerThr: 3.265 ± 0.217
4.666SerVal: 4.666 ± 0.272
0.724SerTrp: 0.724 ± 0.118
1.848SerTyr: 1.848 ± 0.172
0.0SerXaa: 0.0 ± 0.0
Thr
4.297ThrAla: 4.297 ± 0.31
0.493ThrCys: 0.493 ± 0.091
3.865ThrAsp: 3.865 ± 0.285
3.896ThrGlu: 3.896 ± 0.206
2.926ThrPhe: 2.926 ± 0.223
4.589ThrGly: 4.589 ± 0.281
1.494ThrHis: 1.494 ± 0.147
3.634ThrIle: 3.634 ± 0.208
2.911ThrLys: 2.911 ± 0.231
6.838ThrLeu: 6.838 ± 0.347
1.032ThrMet: 1.032 ± 0.111
2.988ThrAsn: 2.988 ± 0.24
3.819ThrPro: 3.819 ± 0.203
2.387ThrGln: 2.387 ± 0.208
3.111ThrArg: 3.111 ± 0.214
3.511ThrSer: 3.511 ± 0.233
4.374ThrThr: 4.374 ± 0.27
4.836ThrVal: 4.836 ± 0.277
0.647ThrTrp: 0.647 ± 0.114
2.064ThrTyr: 2.064 ± 0.171
0.0ThrXaa: 0.0 ± 0.0
Val
3.973ValAla: 3.973 ± 0.243
0.601ValCys: 0.601 ± 0.085
4.173ValAsp: 4.173 ± 0.258
5.652ValGlu: 5.652 ± 0.329
2.926ValPhe: 2.926 ± 0.211
4.62ValGly: 4.62 ± 0.22
1.386ValHis: 1.386 ± 0.174
4.943ValIle: 4.943 ± 0.286
4.697ValLys: 4.697 ± 0.359
5.975ValLeu: 5.975 ± 0.31
1.663ValMet: 1.663 ± 0.146
4.312ValAsn: 4.312 ± 0.261
3.373ValPro: 3.373 ± 0.197
1.987ValGln: 1.987 ± 0.135
3.619ValArg: 3.619 ± 0.206
4.42ValSer: 4.42 ± 0.275
5.128ValThr: 5.128 ± 0.321
5.482ValVal: 5.482 ± 0.346
0.878ValTrp: 0.878 ± 0.117
2.633ValTyr: 2.633 ± 0.213
0.0ValXaa: 0.0 ± 0.0
Trp
0.616TrpAla: 0.616 ± 0.102
0.293TrpCys: 0.293 ± 0.067
0.832TrpAsp: 0.832 ± 0.11
0.893TrpGlu: 0.893 ± 0.12
0.631TrpPhe: 0.631 ± 0.101
0.878TrpGly: 0.878 ± 0.134
0.185TrpHis: 0.185 ± 0.056
0.693TrpIle: 0.693 ± 0.118
0.662TrpLys: 0.662 ± 0.104
1.309TrpLeu: 1.309 ± 0.156
0.431TrpMet: 0.431 ± 0.085
0.739TrpAsn: 0.739 ± 0.107
0.323TrpPro: 0.323 ± 0.075
0.246TrpGln: 0.246 ± 0.072
0.508TrpArg: 0.508 ± 0.114
0.785TrpSer: 0.785 ± 0.115
0.616TrpThr: 0.616 ± 0.096
1.186TrpVal: 1.186 ± 0.158
0.092TrpTrp: 0.092 ± 0.037
0.585TrpTyr: 0.585 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.11TyrAla: 2.11 ± 0.178
0.431TyrCys: 0.431 ± 0.107
2.356TyrAsp: 2.356 ± 0.188
2.387TyrGlu: 2.387 ± 0.188
1.709TyrPhe: 1.709 ± 0.155
2.233TyrGly: 2.233 ± 0.168
1.016TyrHis: 1.016 ± 0.14
1.894TyrIle: 1.894 ± 0.165
2.125TyrLys: 2.125 ± 0.185
3.819TyrLeu: 3.819 ± 0.263
0.816TyrMet: 0.816 ± 0.09
1.971TyrAsn: 1.971 ± 0.171
2.017TyrPro: 2.017 ± 0.205
1.679TyrGln: 1.679 ± 0.158
2.526TyrArg: 2.526 ± 0.19
2.356TyrSer: 2.356 ± 0.201
2.618TyrThr: 2.618 ± 0.227
2.418TyrVal: 2.418 ± 0.205
0.616TyrTrp: 0.616 ± 0.101
1.602TyrTyr: 1.602 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 201 proteins (64935 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski