Amino acid dipepetide frequency for Vibrio phage helene 12B3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.549AlaAla: 3.549 ± 0.421
0.856AlaCys: 0.856 ± 0.137
3.247AlaAsp: 3.247 ± 0.312
4.48AlaGlu: 4.48 ± 0.355
2.014AlaPhe: 2.014 ± 0.262
2.895AlaGly: 2.895 ± 0.316
0.881AlaHis: 0.881 ± 0.137
2.819AlaIle: 2.819 ± 0.255
3.625AlaLys: 3.625 ± 0.34
3.977AlaLeu: 3.977 ± 0.37
1.561AlaMet: 1.561 ± 0.211
1.812AlaAsn: 1.812 ± 0.216
1.057AlaPro: 1.057 ± 0.182
1.913AlaGln: 1.913 ± 0.23
1.661AlaArg: 1.661 ± 0.229
3.197AlaSer: 3.197 ± 0.294
3.348AlaThr: 3.348 ± 0.354
3.121AlaVal: 3.121 ± 0.305
0.805AlaTrp: 0.805 ± 0.162
2.416AlaTyr: 2.416 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
0.881CysAla: 0.881 ± 0.149
0.277CysCys: 0.277 ± 0.087
1.133CysAsp: 1.133 ± 0.196
1.309CysGlu: 1.309 ± 0.209
0.78CysPhe: 0.78 ± 0.141
1.233CysGly: 1.233 ± 0.236
0.428CysHis: 0.428 ± 0.115
1.359CysIle: 1.359 ± 0.191
1.561CysLys: 1.561 ± 0.231
1.158CysLeu: 1.158 ± 0.167
0.277CysMet: 0.277 ± 0.08
1.007CysAsn: 1.007 ± 0.175
0.982CysPro: 0.982 ± 0.185
0.554CysGln: 0.554 ± 0.107
0.73CysArg: 0.73 ± 0.127
1.535CysSer: 1.535 ± 0.342
1.233CysThr: 1.233 ± 0.174
0.856CysVal: 0.856 ± 0.133
0.227CysTrp: 0.227 ± 0.082
0.881CysTyr: 0.881 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
3.02AspAla: 3.02 ± 0.35
1.46AspCys: 1.46 ± 0.202
2.995AspAsp: 2.995 ± 0.335
4.908AspGlu: 4.908 ± 0.335
3.373AspPhe: 3.373 ± 0.295
4.657AspGly: 4.657 ± 0.343
1.41AspHis: 1.41 ± 0.191
4.304AspIle: 4.304 ± 0.324
5.487AspLys: 5.487 ± 0.374
6.066AspLeu: 6.066 ± 0.364
1.837AspMet: 1.837 ± 0.228
3.02AspAsn: 3.02 ± 0.326
2.291AspPro: 2.291 ± 0.225
1.611AspGln: 1.611 ± 0.218
2.869AspArg: 2.869 ± 0.274
3.977AspSer: 3.977 ± 0.284
4.329AspThr: 4.329 ± 0.372
3.851AspVal: 3.851 ± 0.301
1.41AspTrp: 1.41 ± 0.191
3.096AspTyr: 3.096 ± 0.304
0.0AspXaa: 0.0 ± 0.0
Glu
4.405GluAla: 4.405 ± 0.3
1.309GluCys: 1.309 ± 0.208
7.828GluAsp: 7.828 ± 0.495
7.249GluGlu: 7.249 ± 0.513
3.297GluPhe: 3.297 ± 0.239
5.336GluGly: 5.336 ± 0.286
1.636GluHis: 1.636 ± 0.19
4.355GluIle: 4.355 ± 0.331
5.261GluLys: 5.261 ± 0.387
7.526GluLeu: 7.526 ± 0.514
2.265GluMet: 2.265 ± 0.282
3.373GluAsn: 3.373 ± 0.219
1.284GluPro: 1.284 ± 0.201
3.096GluGln: 3.096 ± 0.26
3.096GluArg: 3.096 ± 0.32
4.329GluSer: 4.329 ± 0.323
3.901GluThr: 3.901 ± 0.287
6.57GluVal: 6.57 ± 0.451
1.535GluTrp: 1.535 ± 0.182
3.725GluTyr: 3.725 ± 0.259
0.0GluXaa: 0.0 ± 0.0
Phe
1.762PheAla: 1.762 ± 0.266
0.906PheCys: 0.906 ± 0.169
3.247PheAsp: 3.247 ± 0.306
2.844PheGlu: 2.844 ± 0.277
1.46PhePhe: 1.46 ± 0.2
2.416PheGly: 2.416 ± 0.234
0.73PheHis: 0.73 ± 0.15
2.819PheIle: 2.819 ± 0.312
3.801PheLys: 3.801 ± 0.324
3.7PheLeu: 3.7 ± 0.32
1.158PheMet: 1.158 ± 0.152
2.366PheAsn: 2.366 ± 0.259
1.485PhePro: 1.485 ± 0.158
1.158PheGln: 1.158 ± 0.168
1.535PheArg: 1.535 ± 0.275
3.398PheSer: 3.398 ± 0.278
3.272PheThr: 3.272 ± 0.29
2.014PheVal: 2.014 ± 0.213
0.453PheTrp: 0.453 ± 0.113
1.963PheTyr: 1.963 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
2.291GlyAla: 2.291 ± 0.239
1.712GlyCys: 1.712 ± 0.252
3.65GlyAsp: 3.65 ± 0.289
5.009GlyGlu: 5.009 ± 0.412
3.348GlyPhe: 3.348 ± 0.257
3.826GlyGly: 3.826 ± 0.424
0.906GlyHis: 0.906 ± 0.143
3.474GlyIle: 3.474 ± 0.349
6.167GlyLys: 6.167 ± 0.45
5.084GlyLeu: 5.084 ± 0.412
1.737GlyMet: 1.737 ± 0.199
3.776GlyAsn: 3.776 ± 0.338
0.201GlyPro: 0.201 ± 0.069
2.064GlyGln: 2.064 ± 0.248
2.769GlyArg: 2.769 ± 0.239
4.48GlySer: 4.48 ± 0.373
3.625GlyThr: 3.625 ± 0.368
4.833GlyVal: 4.833 ± 0.336
1.435GlyTrp: 1.435 ± 0.207
2.794GlyTyr: 2.794 ± 0.287
0.0GlyXaa: 0.0 ± 0.0
His
0.881HisAla: 0.881 ± 0.197
0.478HisCys: 0.478 ± 0.117
1.007HisAsp: 1.007 ± 0.189
1.636HisGlu: 1.636 ± 0.218
0.931HisPhe: 0.931 ± 0.141
1.233HisGly: 1.233 ± 0.189
0.453HisHis: 0.453 ± 0.109
1.485HisIle: 1.485 ± 0.223
1.51HisLys: 1.51 ± 0.198
1.913HisLeu: 1.913 ± 0.185
0.453HisMet: 0.453 ± 0.11
1.133HisAsn: 1.133 ± 0.184
1.007HisPro: 1.007 ± 0.171
0.503HisGln: 0.503 ± 0.106
0.68HisArg: 0.68 ± 0.127
1.284HisSer: 1.284 ± 0.225
1.133HisThr: 1.133 ± 0.173
1.032HisVal: 1.032 ± 0.123
0.403HisTrp: 0.403 ± 0.093
0.906HisTyr: 0.906 ± 0.168
0.0HisXaa: 0.0 ± 0.0
Ile
3.171IleAla: 3.171 ± 0.303
0.68IleCys: 0.68 ± 0.135
4.153IleAsp: 4.153 ± 0.329
4.455IleGlu: 4.455 ± 0.332
2.039IlePhe: 2.039 ± 0.236
3.423IleGly: 3.423 ± 0.352
1.158IleHis: 1.158 ± 0.165
3.096IleIle: 3.096 ± 0.297
5.135IleLys: 5.135 ± 0.336
4.959IleLeu: 4.959 ± 0.316
0.982IleMet: 0.982 ± 0.161
3.146IleAsn: 3.146 ± 0.301
2.492IlePro: 2.492 ± 0.265
1.888IleGln: 1.888 ± 0.201
2.693IleArg: 2.693 ± 0.246
3.474IleSer: 3.474 ± 0.245
4.178IleThr: 4.178 ± 0.327
3.222IleVal: 3.222 ± 0.314
0.831IleTrp: 0.831 ± 0.138
2.014IleTyr: 2.014 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
3.952LysAla: 3.952 ± 0.34
1.108LysCys: 1.108 ± 0.201
5.261LysAsp: 5.261 ± 0.38
6.167LysGlu: 6.167 ± 0.433
3.725LysPhe: 3.725 ± 0.311
5.714LysGly: 5.714 ± 0.493
1.963LysHis: 1.963 ± 0.238
4.304LysIle: 4.304 ± 0.341
5.059LysLys: 5.059 ± 0.394
6.922LysLeu: 6.922 ± 0.451
3.046LysMet: 3.046 ± 0.263
3.297LysAsn: 3.297 ± 0.281
2.366LysPro: 2.366 ± 0.227
3.474LysGln: 3.474 ± 0.324
3.524LysArg: 3.524 ± 0.381
4.782LysSer: 4.782 ± 0.402
4.43LysThr: 4.43 ± 0.336
6.62LysVal: 6.62 ± 0.421
1.284LysTrp: 1.284 ± 0.208
3.247LysTyr: 3.247 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
3.851LeuAla: 3.851 ± 0.359
1.51LeuCys: 1.51 ± 0.222
5.538LeuAsp: 5.538 ± 0.384
8.105LeuGlu: 8.105 ± 0.472
3.574LeuPhe: 3.574 ± 0.317
5.386LeuGly: 5.386 ± 0.477
2.039LeuHis: 2.039 ± 0.237
3.776LeuIle: 3.776 ± 0.295
7.148LeuLys: 7.148 ± 0.503
7.224LeuLeu: 7.224 ± 0.472
2.391LeuMet: 2.391 ± 0.258
4.229LeuAsn: 4.229 ± 0.332
2.693LeuPro: 2.693 ± 0.217
3.272LeuGln: 3.272 ± 0.264
3.977LeuArg: 3.977 ± 0.292
6.469LeuSer: 6.469 ± 0.45
5.588LeuThr: 5.588 ± 0.357
4.908LeuVal: 4.908 ± 0.36
1.208LeuTrp: 1.208 ± 0.168
2.92LeuTyr: 2.92 ± 0.303
0.0LeuXaa: 0.0 ± 0.0
Met
1.712MetAla: 1.712 ± 0.23
0.277MetCys: 0.277 ± 0.084
1.485MetAsp: 1.485 ± 0.23
2.265MetGlu: 2.265 ± 0.295
1.41MetPhe: 1.41 ± 0.197
1.233MetGly: 1.233 ± 0.183
0.277MetHis: 0.277 ± 0.093
1.712MetIle: 1.712 ± 0.231
2.819MetLys: 2.819 ± 0.271
2.718MetLeu: 2.718 ± 0.286
1.007MetMet: 1.007 ± 0.177
1.46MetAsn: 1.46 ± 0.214
0.805MetPro: 0.805 ± 0.129
0.831MetGln: 0.831 ± 0.155
0.73MetArg: 0.73 ± 0.132
1.988MetSer: 1.988 ± 0.216
1.561MetThr: 1.561 ± 0.18
1.51MetVal: 1.51 ± 0.211
0.302MetTrp: 0.302 ± 0.094
0.755MetTyr: 0.755 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
2.391AsnAla: 2.391 ± 0.249
0.78AsnCys: 0.78 ± 0.137
2.467AsnAsp: 2.467 ± 0.273
2.92AsnGlu: 2.92 ± 0.299
2.291AsnPhe: 2.291 ± 0.215
3.348AsnGly: 3.348 ± 0.3
0.805AsnHis: 0.805 ± 0.143
3.272AsnIle: 3.272 ± 0.259
4.153AsnLys: 4.153 ± 0.375
4.858AsnLeu: 4.858 ± 0.38
1.057AsnMet: 1.057 ± 0.183
2.618AsnAsn: 2.618 ± 0.263
2.316AsnPro: 2.316 ± 0.233
1.41AsnGln: 1.41 ± 0.165
2.114AsnArg: 2.114 ± 0.244
3.65AsnSer: 3.65 ± 0.284
3.373AsnThr: 3.373 ± 0.257
2.794AsnVal: 2.794 ± 0.296
0.654AsnTrp: 0.654 ± 0.141
2.014AsnTyr: 2.014 ± 0.209
0.0AsnXaa: 0.0 ± 0.0
Pro
1.334ProAla: 1.334 ± 0.211
0.529ProCys: 0.529 ± 0.145
2.442ProAsp: 2.442 ± 0.253
3.272ProGlu: 3.272 ± 0.326
1.259ProPhe: 1.259 ± 0.186
0.076ProGly: 0.076 ± 0.051
0.856ProHis: 0.856 ± 0.117
1.334ProIle: 1.334 ± 0.172
2.442ProLys: 2.442 ± 0.285
2.064ProLeu: 2.064 ± 0.257
0.428ProMet: 0.428 ± 0.114
1.259ProAsn: 1.259 ± 0.16
0.73ProPro: 0.73 ± 0.143
1.007ProGln: 1.007 ± 0.147
1.334ProArg: 1.334 ± 0.175
2.19ProSer: 2.19 ± 0.266
2.265ProThr: 2.265 ± 0.25
2.391ProVal: 2.391 ± 0.266
0.478ProTrp: 0.478 ± 0.108
1.586ProTyr: 1.586 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
2.039GlnAla: 2.039 ± 0.231
0.68GlnCys: 0.68 ± 0.133
2.416GlnAsp: 2.416 ± 0.24
3.046GlnGlu: 3.046 ± 0.276
1.309GlnPhe: 1.309 ± 0.165
2.316GlnGly: 2.316 ± 0.216
0.906GlnHis: 0.906 ± 0.142
1.938GlnIle: 1.938 ± 0.255
2.215GlnLys: 2.215 ± 0.236
3.348GlnLeu: 3.348 ± 0.282
1.007GlnMet: 1.007 ± 0.153
1.737GlnAsn: 1.737 ± 0.223
1.259GlnPro: 1.259 ± 0.147
1.586GlnGln: 1.586 ± 0.21
1.888GlnArg: 1.888 ± 0.245
1.661GlnSer: 1.661 ± 0.215
1.686GlnThr: 1.686 ± 0.169
2.215GlnVal: 2.215 ± 0.246
0.604GlnTrp: 0.604 ± 0.115
1.636GlnTyr: 1.636 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
2.165ArgAla: 2.165 ± 0.254
1.158ArgCys: 1.158 ± 0.19
2.744ArgAsp: 2.744 ± 0.312
3.927ArgGlu: 3.927 ± 0.347
1.535ArgPhe: 1.535 ± 0.218
2.819ArgGly: 2.819 ± 0.294
0.73ArgHis: 0.73 ± 0.129
2.945ArgIle: 2.945 ± 0.261
3.549ArgLys: 3.549 ± 0.38
2.769ArgLeu: 2.769 ± 0.282
1.359ArgMet: 1.359 ± 0.193
2.014ArgAsn: 2.014 ± 0.237
1.158ArgPro: 1.158 ± 0.171
1.988ArgGln: 1.988 ± 0.224
1.737ArgArg: 1.737 ± 0.256
2.593ArgSer: 2.593 ± 0.258
1.737ArgThr: 1.737 ± 0.24
2.542ArgVal: 2.542 ± 0.237
0.931ArgTrp: 0.931 ± 0.161
1.561ArgTyr: 1.561 ± 0.186
0.0ArgXaa: 0.0 ± 0.0
Ser
3.222SerAla: 3.222 ± 0.394
1.208SerCys: 1.208 ± 0.198
4.304SerAsp: 4.304 ± 0.306
5.663SerGlu: 5.663 ± 0.325
2.819SerPhe: 2.819 ± 0.298
4.833SerGly: 4.833 ± 0.418
1.133SerHis: 1.133 ± 0.172
3.574SerIle: 3.574 ± 0.299
5.814SerLys: 5.814 ± 0.352
5.16SerLeu: 5.16 ± 0.32
1.636SerMet: 1.636 ± 0.188
3.524SerAsn: 3.524 ± 0.307
1.737SerPro: 1.737 ± 0.219
2.442SerGln: 2.442 ± 0.229
2.718SerArg: 2.718 ± 0.26
4.027SerSer: 4.027 ± 0.326
3.474SerThr: 3.474 ± 0.327
4.38SerVal: 4.38 ± 0.363
1.032SerTrp: 1.032 ± 0.18
3.171SerTyr: 3.171 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
2.869ThrAla: 2.869 ± 0.331
0.931ThrCys: 0.931 ± 0.172
3.373ThrAsp: 3.373 ± 0.329
4.002ThrGlu: 4.002 ± 0.338
2.693ThrPhe: 2.693 ± 0.283
4.506ThrGly: 4.506 ± 0.326
1.259ThrHis: 1.259 ± 0.183
4.027ThrIle: 4.027 ± 0.347
4.556ThrLys: 4.556 ± 0.384
5.613ThrLeu: 5.613 ± 0.389
1.359ThrMet: 1.359 ± 0.182
2.718ThrAsn: 2.718 ± 0.264
2.492ThrPro: 2.492 ± 0.27
2.19ThrGln: 2.19 ± 0.231
2.139ThrArg: 2.139 ± 0.235
3.876ThrSer: 3.876 ± 0.317
4.178ThrThr: 4.178 ± 0.383
4.631ThrVal: 4.631 ± 0.358
0.856ThrTrp: 0.856 ± 0.155
3.121ThrTyr: 3.121 ± 0.276
0.0ThrXaa: 0.0 ± 0.0
Val
3.171ValAla: 3.171 ± 0.392
1.359ValCys: 1.359 ± 0.189
4.782ValAsp: 4.782 ± 0.423
5.865ValGlu: 5.865 ± 0.421
2.492ValPhe: 2.492 ± 0.229
4.43ValGly: 4.43 ± 0.374
1.208ValHis: 1.208 ± 0.196
3.323ValIle: 3.323 ± 0.324
5.16ValLys: 5.16 ± 0.415
5.663ValLeu: 5.663 ± 0.374
2.039ValMet: 2.039 ± 0.261
3.574ValAsn: 3.574 ± 0.281
1.41ValPro: 1.41 ± 0.192
2.215ValGln: 2.215 ± 0.265
2.542ValArg: 2.542 ± 0.254
4.304ValSer: 4.304 ± 0.343
3.851ValThr: 3.851 ± 0.335
5.21ValVal: 5.21 ± 0.436
1.032ValTrp: 1.032 ± 0.168
2.995ValTyr: 2.995 ± 0.285
0.0ValXaa: 0.0 ± 0.0
Trp
0.654TrpAla: 0.654 ± 0.133
0.227TrpCys: 0.227 ± 0.088
1.158TrpAsp: 1.158 ± 0.161
1.762TrpGlu: 1.762 ± 0.221
0.579TrpPhe: 0.579 ± 0.155
0.73TrpGly: 0.73 ± 0.127
0.327TrpHis: 0.327 ± 0.08
0.956TrpIle: 0.956 ± 0.179
1.535TrpLys: 1.535 ± 0.237
1.636TrpLeu: 1.636 ± 0.191
0.503TrpMet: 0.503 ± 0.095
0.78TrpAsn: 0.78 ± 0.124
0.126TrpPro: 0.126 ± 0.054
0.478TrpGln: 0.478 ± 0.106
0.856TrpArg: 0.856 ± 0.172
0.931TrpSer: 0.931 ± 0.157
0.755TrpThr: 0.755 ± 0.158
1.309TrpVal: 1.309 ± 0.178
0.302TrpTrp: 0.302 ± 0.092
0.503TrpTyr: 0.503 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.913TyrAla: 1.913 ± 0.195
0.956TyrCys: 0.956 ± 0.145
2.744TyrAsp: 2.744 ± 0.31
2.492TyrGlu: 2.492 ± 0.246
1.561TyrPhe: 1.561 ± 0.22
2.794TyrGly: 2.794 ± 0.313
0.956TyrHis: 0.956 ± 0.167
2.316TyrIle: 2.316 ± 0.208
3.247TyrLys: 3.247 ± 0.322
3.448TyrLeu: 3.448 ± 0.316
0.73TyrMet: 0.73 ± 0.138
2.442TyrAsn: 2.442 ± 0.275
1.384TyrPro: 1.384 ± 0.196
1.812TyrGln: 1.812 ± 0.204
2.366TyrArg: 2.366 ± 0.264
3.725TyrSer: 3.725 ± 0.323
3.474TyrThr: 3.474 ± 0.288
2.618TyrVal: 2.618 ± 0.242
0.327TyrTrp: 0.327 ± 0.086
1.888TyrTyr: 1.888 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 263 proteins (39730 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski