Amino acid dipepetide frequency for Vibrio phage vB_VmeM-32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.688AlaAla: 2.688 ± 0.288
0.321AlaCys: 0.321 ± 0.082
2.536AlaAsp: 2.536 ± 0.228
3.246AlaGlu: 3.246 ± 0.327
1.995AlaPhe: 1.995 ± 0.175
2.908AlaGly: 2.908 ± 0.333
0.744AlaHis: 0.744 ± 0.108
4.379AlaIle: 4.379 ± 0.276
3.635AlaLys: 3.635 ± 0.275
4.159AlaLeu: 4.159 ± 0.323
1.522AlaMet: 1.522 ± 0.181
3.702AlaAsn: 3.702 ± 0.316
1.031AlaPro: 1.031 ± 0.144
2.012AlaGln: 2.012 ± 0.238
2.451AlaArg: 2.451 ± 0.237
3.398AlaSer: 3.398 ± 0.343
3.55AlaThr: 3.55 ± 0.433
2.908AlaVal: 2.908 ± 0.238
0.693AlaTrp: 0.693 ± 0.116
2.147AlaTyr: 2.147 ± 0.188
0.0AlaXaa: 0.0 ± 0.0
Cys
0.575CysAla: 0.575 ± 0.097
0.186CysCys: 0.186 ± 0.057
1.014CysAsp: 1.014 ± 0.137
0.964CysGlu: 0.964 ± 0.134
0.44CysPhe: 0.44 ± 0.097
0.761CysGly: 0.761 ± 0.117
0.321CysHis: 0.321 ± 0.069
0.541CysIle: 0.541 ± 0.081
0.642CysLys: 0.642 ± 0.093
0.828CysLeu: 0.828 ± 0.122
0.304CysMet: 0.304 ± 0.069
0.727CysAsn: 0.727 ± 0.122
0.304CysPro: 0.304 ± 0.076
0.389CysGln: 0.389 ± 0.076
0.575CysArg: 0.575 ± 0.089
0.896CysSer: 0.896 ± 0.131
0.44CysThr: 0.44 ± 0.083
1.217CysVal: 1.217 ± 0.163
0.169CysTrp: 0.169 ± 0.048
0.575CysTyr: 0.575 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
3.364AspAla: 3.364 ± 0.255
1.014AspCys: 1.014 ± 0.143
4.294AspAsp: 4.294 ± 0.323
5.173AspGlu: 5.173 ± 0.355
3.381AspPhe: 3.381 ± 0.247
4.801AspGly: 4.801 ± 0.349
0.811AspHis: 0.811 ± 0.125
5.596AspIle: 5.596 ± 0.335
4.649AspLys: 4.649 ± 0.328
5.038AspLeu: 5.038 ± 0.291
1.386AspMet: 1.386 ± 0.141
4.142AspAsn: 4.142 ± 0.3
1.2AspPro: 1.2 ± 0.151
1.15AspGln: 1.15 ± 0.139
1.843AspArg: 1.843 ± 0.173
4.768AspSer: 4.768 ± 0.293
3.618AspThr: 3.618 ± 0.228
5.934AspVal: 5.934 ± 0.367
0.947AspTrp: 0.947 ± 0.131
3.567AspTyr: 3.567 ± 0.224
0.0AspXaa: 0.0 ± 0.0
Glu
2.654GluAla: 2.654 ± 0.253
0.828GluCys: 0.828 ± 0.132
2.181GluAsp: 2.181 ± 0.203
2.857GluGlu: 2.857 ± 0.255
3.584GluPhe: 3.584 ± 0.248
1.978GluGly: 1.978 ± 0.186
1.657GluHis: 1.657 ± 0.201
6.137GluIle: 6.137 ± 0.368
4.835GluLys: 4.835 ± 0.331
6.999GluLeu: 6.999 ± 0.458
2.079GluMet: 2.079 ± 0.227
4.497GluAsn: 4.497 ± 0.302
1.758GluPro: 1.758 ± 0.173
2.604GluGln: 2.604 ± 0.248
3.145GluArg: 3.145 ± 0.237
4.801GluSer: 4.801 ± 0.356
4.277GluThr: 4.277 ± 0.313
2.688GluVal: 2.688 ± 0.244
1.116GluTrp: 1.116 ± 0.113
3.297GluTyr: 3.297 ± 0.249
0.0GluXaa: 0.0 ± 0.0
Phe
2.316PheAla: 2.316 ± 0.192
0.541PheCys: 0.541 ± 0.096
3.77PheAsp: 3.77 ± 0.283
3.753PheGlu: 3.753 ± 0.301
1.234PhePhe: 1.234 ± 0.17
2.857PheGly: 2.857 ± 0.233
0.845PheHis: 0.845 ± 0.132
3.009PheIle: 3.009 ± 0.238
3.026PheLys: 3.026 ± 0.23
2.925PheLeu: 2.925 ± 0.237
1.251PheMet: 1.251 ± 0.158
3.212PheAsn: 3.212 ± 0.224
1.014PhePro: 1.014 ± 0.126
1.234PheGln: 1.234 ± 0.16
1.724PheArg: 1.724 ± 0.168
3.246PheSer: 3.246 ± 0.195
2.502PheThr: 2.502 ± 0.198
4.041PheVal: 4.041 ± 0.297
0.423PheTrp: 0.423 ± 0.095
1.623PheTyr: 1.623 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
2.705GlyAla: 2.705 ± 0.249
0.828GlyCys: 0.828 ± 0.109
4.294GlyAsp: 4.294 ± 0.36
3.432GlyGlu: 3.432 ± 0.247
2.434GlyPhe: 2.434 ± 0.18
3.145GlyGly: 3.145 ± 0.314
0.896GlyHis: 0.896 ± 0.122
4.024GlyIle: 4.024 ± 0.269
3.821GlyLys: 3.821 ± 0.283
4.193GlyLeu: 4.193 ± 0.351
1.42GlyMet: 1.42 ± 0.197
4.26GlyAsn: 4.26 ± 0.338
1.048GlyPro: 1.048 ± 0.144
1.64GlyGln: 1.64 ± 0.177
2.232GlyArg: 2.232 ± 0.207
4.734GlySer: 4.734 ± 0.374
4.193GlyThr: 4.193 ± 0.333
4.227GlyVal: 4.227 ± 0.278
0.558GlyTrp: 0.558 ± 0.091
2.604GlyTyr: 2.604 ± 0.291
0.0GlyXaa: 0.0 ± 0.0
His
1.133HisAla: 1.133 ± 0.135
0.203HisCys: 0.203 ± 0.057
1.319HisAsp: 1.319 ± 0.158
1.555HisGlu: 1.555 ± 0.169
1.082HisPhe: 1.082 ± 0.118
1.555HisGly: 1.555 ± 0.195
0.456HisHis: 0.456 ± 0.094
1.2HisIle: 1.2 ± 0.136
1.167HisLys: 1.167 ± 0.157
1.64HisLeu: 1.64 ± 0.207
0.338HisMet: 0.338 ± 0.071
1.336HisAsn: 1.336 ± 0.143
0.778HisPro: 0.778 ± 0.125
0.761HisGln: 0.761 ± 0.113
1.133HisArg: 1.133 ± 0.216
1.268HisSer: 1.268 ± 0.159
0.997HisThr: 0.997 ± 0.142
1.42HisVal: 1.42 ± 0.151
0.186HisTrp: 0.186 ± 0.057
1.116HisTyr: 1.116 ± 0.124
0.0HisXaa: 0.0 ± 0.0
Ile
4.784IleAla: 4.784 ± 0.351
0.845IleCys: 0.845 ± 0.126
6.762IleAsp: 6.762 ± 0.39
6.526IleGlu: 6.526 ± 0.382
2.857IlePhe: 2.857 ± 0.214
4.108IleGly: 4.108 ± 0.255
1.877IleHis: 1.877 ± 0.194
5.275IleIle: 5.275 ± 0.302
5.68IleLys: 5.68 ± 0.344
6.12IleLeu: 6.12 ± 0.368
1.843IleMet: 1.843 ± 0.179
5.714IleAsn: 5.714 ± 0.399
2.265IlePro: 2.265 ± 0.201
2.671IleGln: 2.671 ± 0.248
3.956IleArg: 3.956 ± 0.234
5.258IleSer: 5.258 ± 0.286
4.074IleThr: 4.074 ± 0.275
5.342IleVal: 5.342 ± 0.313
0.456IleTrp: 0.456 ± 0.086
2.451IleTyr: 2.451 ± 0.191
0.0IleXaa: 0.0 ± 0.0
Lys
3.111LysAla: 3.111 ± 0.31
0.727LysCys: 0.727 ± 0.115
3.449LysAsp: 3.449 ± 0.202
3.702LysGlu: 3.702 ± 0.266
3.263LysPhe: 3.263 ± 0.233
2.891LysGly: 2.891 ± 0.213
1.978LysHis: 1.978 ± 0.212
5.664LysIle: 5.664 ± 0.306
5.613LysLys: 5.613 ± 0.353
6.982LysLeu: 6.982 ± 0.435
2.198LysMet: 2.198 ± 0.204
4.751LysAsn: 4.751 ± 0.303
2.367LysPro: 2.367 ± 0.192
3.06LysGln: 3.06 ± 0.274
3.956LysArg: 3.956 ± 0.279
6.036LysSer: 6.036 ± 0.398
4.954LysThr: 4.954 ± 0.301
3.99LysVal: 3.99 ± 0.283
0.828LysTrp: 0.828 ± 0.139
2.975LysTyr: 2.975 ± 0.209
0.0LysXaa: 0.0 ± 0.0
Leu
4.243LeuAla: 4.243 ± 0.298
1.014LeuCys: 1.014 ± 0.127
5.461LeuAsp: 5.461 ± 0.319
4.903LeuGlu: 4.903 ± 0.343
3.128LeuPhe: 3.128 ± 0.249
3.567LeuGly: 3.567 ± 0.262
1.741LeuHis: 1.741 ± 0.163
6.391LeuIle: 6.391 ± 0.359
5.934LeuLys: 5.934 ± 0.362
5.19LeuLeu: 5.19 ± 0.348
1.657LeuMet: 1.657 ± 0.154
5.883LeuAsn: 5.883 ± 0.334
3.111LeuPro: 3.111 ± 0.308
2.401LeuGln: 2.401 ± 0.217
3.584LeuArg: 3.584 ± 0.248
6.678LeuSer: 6.678 ± 0.374
4.987LeuThr: 4.987 ± 0.288
4.784LeuVal: 4.784 ± 0.323
0.761LeuTrp: 0.761 ± 0.103
3.195LeuTyr: 3.195 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
1.352MetAla: 1.352 ± 0.164
0.423MetCys: 0.423 ± 0.099
0.964MetAsp: 0.964 ± 0.103
1.116MetGlu: 1.116 ± 0.168
1.268MetPhe: 1.268 ± 0.153
1.369MetGly: 1.369 ± 0.164
0.338MetHis: 0.338 ± 0.072
1.741MetIle: 1.741 ± 0.17
2.282MetLys: 2.282 ± 0.213
1.927MetLeu: 1.927 ± 0.174
0.575MetMet: 0.575 ± 0.108
2.181MetAsn: 2.181 ± 0.192
0.592MetPro: 0.592 ± 0.112
0.828MetGln: 0.828 ± 0.113
1.031MetArg: 1.031 ± 0.128
2.282MetSer: 2.282 ± 0.226
1.336MetThr: 1.336 ± 0.144
1.167MetVal: 1.167 ± 0.144
0.321MetTrp: 0.321 ± 0.067
0.778MetTyr: 0.778 ± 0.111
0.0MetXaa: 0.0 ± 0.0
Asn
3.652AsnAla: 3.652 ± 0.313
0.879AsnCys: 0.879 ± 0.121
5.478AsnAsp: 5.478 ± 0.411
4.615AsnGlu: 4.615 ± 0.24
2.959AsnPhe: 2.959 ± 0.274
5.173AsnGly: 5.173 ± 0.412
1.809AsnHis: 1.809 ± 0.195
5.495AsnIle: 5.495 ± 0.373
4.683AsnLys: 4.683 ± 0.309
4.801AsnLeu: 4.801 ± 0.275
1.691AsnMet: 1.691 ± 0.167
5.139AsnAsn: 5.139 ± 0.37
1.961AsnPro: 1.961 ± 0.22
2.401AsnGln: 2.401 ± 0.241
4.024AsnArg: 4.024 ± 0.276
4.531AsnSer: 4.531 ± 0.303
3.584AsnThr: 3.584 ± 0.286
5.106AsnVal: 5.106 ± 0.344
0.828AsnTrp: 0.828 ± 0.106
2.485AsnTyr: 2.485 ± 0.221
0.0AsnXaa: 0.0 ± 0.0
Pro
1.116ProAla: 1.116 ± 0.154
0.203ProCys: 0.203 ± 0.056
1.995ProAsp: 1.995 ± 0.169
1.843ProGlu: 1.843 ± 0.174
1.336ProPhe: 1.336 ± 0.141
0.626ProGly: 0.626 ± 0.122
0.609ProHis: 0.609 ± 0.095
2.232ProIle: 2.232 ± 0.216
2.079ProLys: 2.079 ± 0.206
1.843ProLeu: 1.843 ± 0.174
0.423ProMet: 0.423 ± 0.083
2.536ProAsn: 2.536 ± 0.204
0.456ProPro: 0.456 ± 0.102
0.964ProGln: 0.964 ± 0.144
1.454ProArg: 1.454 ± 0.237
1.893ProSer: 1.893 ± 0.186
2.401ProThr: 2.401 ± 0.211
1.944ProVal: 1.944 ± 0.161
0.254ProTrp: 0.254 ± 0.066
1.217ProTyr: 1.217 ± 0.165
0.0ProXaa: 0.0 ± 0.0
Gln
1.724GlnAla: 1.724 ± 0.209
0.355GlnCys: 0.355 ± 0.079
1.369GlnAsp: 1.369 ± 0.157
1.792GlnGlu: 1.792 ± 0.182
1.995GlnPhe: 1.995 ± 0.201
1.183GlnGly: 1.183 ± 0.158
0.71GlnHis: 0.71 ± 0.122
2.874GlnIle: 2.874 ± 0.213
2.604GlnLys: 2.604 ± 0.238
3.516GlnLeu: 3.516 ± 0.299
0.761GlnMet: 0.761 ± 0.093
2.536GlnAsn: 2.536 ± 0.199
0.913GlnPro: 0.913 ± 0.129
1.538GlnGln: 1.538 ± 0.173
1.488GlnArg: 1.488 ± 0.165
2.418GlnSer: 2.418 ± 0.225
2.451GlnThr: 2.451 ± 0.202
1.217GlnVal: 1.217 ± 0.135
0.372GlnTrp: 0.372 ± 0.077
1.488GlnTyr: 1.488 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
2.367ArgAla: 2.367 ± 0.228
0.626ArgCys: 0.626 ± 0.116
3.922ArgAsp: 3.922 ± 0.29
2.992ArgGlu: 2.992 ± 0.23
2.113ArgPhe: 2.113 ± 0.175
2.942ArgGly: 2.942 ± 0.238
0.659ArgHis: 0.659 ± 0.112
4.057ArgIle: 4.057 ± 0.264
3.888ArgLys: 3.888 ± 0.318
3.618ArgLeu: 3.618 ± 0.236
1.133ArgMet: 1.133 ± 0.16
2.908ArgAsn: 2.908 ± 0.214
1.065ArgPro: 1.065 ± 0.153
1.217ArgGln: 1.217 ± 0.136
1.724ArgArg: 1.724 ± 0.181
2.553ArgSer: 2.553 ± 0.271
2.468ArgThr: 2.468 ± 0.232
3.111ArgVal: 3.111 ± 0.26
0.456ArgTrp: 0.456 ± 0.098
2.198ArgTyr: 2.198 ± 0.204
0.0ArgXaa: 0.0 ± 0.0
Ser
3.246SerAla: 3.246 ± 0.297
0.778SerCys: 0.778 ± 0.125
5.021SerAsp: 5.021 ± 0.346
4.615SerGlu: 4.615 ± 0.312
3.584SerPhe: 3.584 ± 0.238
5.85SerGly: 5.85 ± 0.488
1.522SerHis: 1.522 ± 0.21
6.052SerIle: 6.052 ± 0.357
5.123SerLys: 5.123 ± 0.309
5.258SerLeu: 5.258 ± 0.314
1.369SerMet: 1.369 ± 0.153
4.835SerAsn: 4.835 ± 0.311
1.86SerPro: 1.86 ± 0.183
2.401SerGln: 2.401 ± 0.186
3.06SerArg: 3.06 ± 0.256
4.632SerSer: 4.632 ± 0.276
4.379SerThr: 4.379 ± 0.388
5.342SerVal: 5.342 ± 0.304
1.048SerTrp: 1.048 ± 0.216
2.485SerTyr: 2.485 ± 0.201
0.0SerXaa: 0.0 ± 0.0
Thr
3.009ThrAla: 3.009 ± 0.338
0.507ThrCys: 0.507 ± 0.086
4.024ThrAsp: 4.024 ± 0.324
3.905ThrGlu: 3.905 ± 0.258
2.62ThrPhe: 2.62 ± 0.22
3.652ThrGly: 3.652 ± 0.299
1.471ThrHis: 1.471 ± 0.166
5.495ThrIle: 5.495 ± 0.344
5.004ThrLys: 5.004 ± 0.311
4.987ThrLeu: 4.987 ± 0.306
1.302ThrMet: 1.302 ± 0.14
4.125ThrAsn: 4.125 ± 0.349
2.046ThrPro: 2.046 ± 0.259
2.181ThrGln: 2.181 ± 0.258
2.874ThrArg: 2.874 ± 0.22
3.804ThrSer: 3.804 ± 0.325
3.584ThrThr: 3.584 ± 0.292
4.007ThrVal: 4.007 ± 0.315
0.811ThrTrp: 0.811 ± 0.108
2.316ThrTyr: 2.316 ± 0.206
0.0ThrXaa: 0.0 ± 0.0
Val
3.5ValAla: 3.5 ± 0.303
0.947ValCys: 0.947 ± 0.131
4.734ValAsp: 4.734 ± 0.318
3.939ValGlu: 3.939 ± 0.319
2.891ValPhe: 2.891 ± 0.23
4.21ValGly: 4.21 ± 0.306
1.014ValHis: 1.014 ± 0.137
5.004ValIle: 5.004 ± 0.314
4.429ValLys: 4.429 ± 0.268
4.903ValLeu: 4.903 ± 0.261
1.437ValMet: 1.437 ± 0.166
5.004ValAsn: 5.004 ± 0.293
2.029ValPro: 2.029 ± 0.174
2.029ValGln: 2.029 ± 0.196
3.128ValArg: 3.128 ± 0.244
5.021ValSer: 5.021 ± 0.297
4.497ValThr: 4.497 ± 0.311
4.074ValVal: 4.074 ± 0.281
0.862ValTrp: 0.862 ± 0.155
2.485ValTyr: 2.485 ± 0.231
0.0ValXaa: 0.0 ± 0.0
Trp
0.321TrpAla: 0.321 ± 0.078
0.118TrpCys: 0.118 ± 0.044
0.71TrpAsp: 0.71 ± 0.113
0.659TrpGlu: 0.659 ± 0.108
0.659TrpPhe: 0.659 ± 0.109
0.558TrpGly: 0.558 ± 0.096
0.321TrpHis: 0.321 ± 0.087
1.065TrpIle: 1.065 ± 0.173
0.879TrpLys: 0.879 ± 0.107
0.964TrpLeu: 0.964 ± 0.135
0.203TrpMet: 0.203 ± 0.054
0.693TrpAsn: 0.693 ± 0.104
0.186TrpPro: 0.186 ± 0.06
0.44TrpGln: 0.44 ± 0.088
0.592TrpArg: 0.592 ± 0.11
0.997TrpSer: 0.997 ± 0.213
0.778TrpThr: 0.778 ± 0.11
0.811TrpVal: 0.811 ± 0.152
0.203TrpTrp: 0.203 ± 0.062
0.575TrpTyr: 0.575 ± 0.088
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.164TyrAla: 2.164 ± 0.264
0.524TyrCys: 0.524 ± 0.095
3.381TyrAsp: 3.381 ± 0.21
2.384TyrGlu: 2.384 ± 0.208
1.724TyrPhe: 1.724 ± 0.184
2.553TyrGly: 2.553 ± 0.296
0.862TyrHis: 0.862 ± 0.108
2.637TyrIle: 2.637 ± 0.234
2.519TyrLys: 2.519 ± 0.24
2.722TyrLeu: 2.722 ± 0.255
0.964TyrMet: 0.964 ± 0.137
3.111TyrAsn: 3.111 ± 0.236
1.437TyrPro: 1.437 ± 0.173
1.386TyrGln: 1.386 ± 0.164
2.046TyrArg: 2.046 ± 0.182
3.263TyrSer: 3.263 ± 0.26
2.654TyrThr: 2.654 ± 0.219
2.806TyrVal: 2.806 ± 0.215
0.44TyrTrp: 0.44 ± 0.102
2.063TyrTyr: 2.063 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 254 proteins (59151 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski