Amino acid dipepetide frequency for Vibrio phage vB_ValS_X1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.358AlaAla: 8.358 ± 1.13
0.845AlaCys: 0.845 ± 0.146
4.255AlaAsp: 4.255 ± 0.338
6.789AlaGlu: 6.789 ± 0.455
2.595AlaPhe: 2.595 ± 0.283
4.225AlaGly: 4.225 ± 0.404
1.298AlaHis: 1.298 ± 0.206
4.436AlaIle: 4.436 ± 0.357
5.673AlaLys: 5.673 ± 0.469
7.544AlaLeu: 7.544 ± 0.445
2.142AlaMet: 2.142 ± 0.296
3.893AlaAsn: 3.893 ± 0.387
2.505AlaPro: 2.505 ± 0.281
2.625AlaGln: 2.625 ± 0.275
3.41AlaArg: 3.41 ± 0.296
5.25AlaSer: 5.25 ± 0.586
4.194AlaThr: 4.194 ± 0.38
4.737AlaVal: 4.737 ± 0.388
0.845AlaTrp: 0.845 ± 0.166
2.957AlaTyr: 2.957 ± 0.309
0.0AlaXaa: 0.0 ± 0.0
Cys
0.664CysAla: 0.664 ± 0.147
0.241CysCys: 0.241 ± 0.089
0.785CysAsp: 0.785 ± 0.188
0.996CysGlu: 0.996 ± 0.187
0.543CysPhe: 0.543 ± 0.138
0.935CysGly: 0.935 ± 0.215
0.211CysHis: 0.211 ± 0.081
0.724CysIle: 0.724 ± 0.177
0.875CysLys: 0.875 ± 0.188
0.815CysLeu: 0.815 ± 0.179
0.302CysMet: 0.302 ± 0.119
0.694CysAsn: 0.694 ± 0.138
0.573CysPro: 0.573 ± 0.14
0.241CysGln: 0.241 ± 0.092
0.453CysArg: 0.453 ± 0.101
0.875CysSer: 0.875 ± 0.171
0.785CysThr: 0.785 ± 0.172
0.754CysVal: 0.754 ± 0.2
0.121CysTrp: 0.121 ± 0.057
0.362CysTyr: 0.362 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
4.436AspAla: 4.436 ± 0.428
0.724AspCys: 0.724 ± 0.15
3.168AspAsp: 3.168 ± 0.335
5.069AspGlu: 5.069 ± 0.415
2.655AspPhe: 2.655 ± 0.284
4.406AspGly: 4.406 ± 0.483
0.754AspHis: 0.754 ± 0.147
4.013AspIle: 4.013 ± 0.418
3.772AspLys: 3.772 ± 0.324
6.639AspLeu: 6.639 ± 0.412
1.961AspMet: 1.961 ± 0.256
2.957AspAsn: 2.957 ± 0.312
2.686AspPro: 2.686 ± 0.316
1.479AspGln: 1.479 ± 0.249
2.686AspArg: 2.686 ± 0.261
4.225AspSer: 4.225 ± 0.372
3.862AspThr: 3.862 ± 0.374
4.375AspVal: 4.375 ± 0.372
1.539AspTrp: 1.539 ± 0.201
2.716AspTyr: 2.716 ± 0.257
0.0AspXaa: 0.0 ± 0.0
Glu
6.729GluAla: 6.729 ± 0.598
0.664GluCys: 0.664 ± 0.159
5.19GluAsp: 5.19 ± 0.395
5.944GluGlu: 5.944 ± 0.573
3.108GluPhe: 3.108 ± 0.287
4.225GluGly: 4.225 ± 0.352
1.479GluHis: 1.479 ± 0.212
4.496GluIle: 4.496 ± 0.424
5.432GluLys: 5.432 ± 0.478
7.906GluLeu: 7.906 ± 0.661
2.867GluMet: 2.867 ± 0.363
3.53GluAsn: 3.53 ± 0.318
2.142GluPro: 2.142 ± 0.227
3.168GluGln: 3.168 ± 0.373
3.681GluArg: 3.681 ± 0.362
3.621GluSer: 3.621 ± 0.302
3.621GluThr: 3.621 ± 0.295
5.341GluVal: 5.341 ± 0.4
0.996GluTrp: 0.996 ± 0.194
3.259GluTyr: 3.259 ± 0.311
0.0GluXaa: 0.0 ± 0.0
Phe
2.142PheAla: 2.142 ± 0.259
0.634PheCys: 0.634 ± 0.14
3.048PheAsp: 3.048 ± 0.294
3.229PheGlu: 3.229 ± 0.316
1.328PhePhe: 1.328 ± 0.227
2.716PheGly: 2.716 ± 0.327
0.845PheHis: 0.845 ± 0.157
1.69PheIle: 1.69 ± 0.202
2.323PheLys: 2.323 ± 0.213
3.078PheLeu: 3.078 ± 0.302
1.147PheMet: 1.147 ± 0.191
1.811PheAsn: 1.811 ± 0.221
1.509PhePro: 1.509 ± 0.166
1.147PheGln: 1.147 ± 0.179
1.509PheArg: 1.509 ± 0.22
2.776PheSer: 2.776 ± 0.285
2.867PheThr: 2.867 ± 0.333
1.811PheVal: 1.811 ± 0.2
0.513PheTrp: 0.513 ± 0.125
1.569PheTyr: 1.569 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
4.828GlyAla: 4.828 ± 0.416
0.996GlyCys: 0.996 ± 0.199
4.134GlyAsp: 4.134 ± 0.356
4.225GlyGlu: 4.225 ± 0.318
2.867GlyPhe: 2.867 ± 0.274
3.832GlyGly: 3.832 ± 0.485
1.328GlyHis: 1.328 ± 0.198
4.436GlyIle: 4.436 ± 0.313
3.862GlyLys: 3.862 ± 0.422
5.009GlyLeu: 5.009 ± 0.352
1.026GlyMet: 1.026 ± 0.174
2.957GlyAsn: 2.957 ± 0.408
1.086GlyPro: 1.086 ± 0.219
2.142GlyGln: 2.142 ± 0.225
3.229GlyArg: 3.229 ± 0.261
4.436GlySer: 4.436 ± 0.47
4.496GlyThr: 4.496 ± 0.562
4.707GlyVal: 4.707 ± 0.446
0.815GlyTrp: 0.815 ± 0.178
2.444GlyTyr: 2.444 ± 0.264
0.0GlyXaa: 0.0 ± 0.0
His
1.177HisAla: 1.177 ± 0.172
0.362HisCys: 0.362 ± 0.125
1.026HisAsp: 1.026 ± 0.2
1.147HisGlu: 1.147 ± 0.192
0.996HisPhe: 0.996 ± 0.174
1.207HisGly: 1.207 ± 0.178
0.543HisHis: 0.543 ± 0.146
1.328HisIle: 1.328 ± 0.224
1.448HisLys: 1.448 ± 0.216
1.509HisLeu: 1.509 ± 0.208
0.664HisMet: 0.664 ± 0.161
0.905HisAsn: 0.905 ± 0.19
0.996HisPro: 0.996 ± 0.172
0.483HisGln: 0.483 ± 0.134
1.298HisArg: 1.298 ± 0.217
0.966HisSer: 0.966 ± 0.199
1.237HisThr: 1.237 ± 0.19
1.177HisVal: 1.177 ± 0.196
0.241HisTrp: 0.241 ± 0.084
0.935HisTyr: 0.935 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
3.5IleAla: 3.5 ± 0.296
0.634IleCys: 0.634 ± 0.16
4.134IleAsp: 4.134 ± 0.394
4.013IleGlu: 4.013 ± 0.351
1.69IlePhe: 1.69 ± 0.225
3.138IleGly: 3.138 ± 0.328
1.147IleHis: 1.147 ± 0.207
2.897IleIle: 2.897 ± 0.341
3.561IleLys: 3.561 ± 0.312
4.375IleLeu: 4.375 ± 0.367
1.147IleMet: 1.147 ± 0.137
2.897IleAsn: 2.897 ± 0.264
2.595IlePro: 2.595 ± 0.299
2.595IleGln: 2.595 ± 0.309
3.078IleArg: 3.078 ± 0.283
4.013IleSer: 4.013 ± 0.396
3.893IleThr: 3.893 ± 0.4
4.134IleVal: 4.134 ± 0.346
0.573IleTrp: 0.573 ± 0.12
2.565IleTyr: 2.565 ± 0.295
0.0IleXaa: 0.0 ± 0.0
Lys
6.518LysAla: 6.518 ± 0.474
0.724LysCys: 0.724 ± 0.19
3.651LysAsp: 3.651 ± 0.344
5.432LysGlu: 5.432 ± 0.528
2.293LysPhe: 2.293 ± 0.295
3.53LysGly: 3.53 ± 0.378
1.569LysHis: 1.569 ± 0.19
3.138LysIle: 3.138 ± 0.332
3.349LysLys: 3.349 ± 0.435
6.85LysLeu: 6.85 ± 0.584
2.233LysMet: 2.233 ± 0.282
2.716LysAsn: 2.716 ± 0.269
2.414LysPro: 2.414 ± 0.397
2.082LysGln: 2.082 ± 0.218
2.655LysArg: 2.655 ± 0.368
3.44LysSer: 3.44 ± 0.321
3.561LysThr: 3.561 ± 0.318
4.828LysVal: 4.828 ± 0.373
0.785LysTrp: 0.785 ± 0.173
3.168LysTyr: 3.168 ± 0.33
0.0LysXaa: 0.0 ± 0.0
Leu
7.061LeuAla: 7.061 ± 0.546
0.754LeuCys: 0.754 ± 0.133
6.397LeuAsp: 6.397 ± 0.436
7.846LeuGlu: 7.846 ± 0.547
2.565LeuPhe: 2.565 ± 0.261
5.975LeuGly: 5.975 ± 0.381
2.052LeuHis: 2.052 ± 0.296
4.194LeuIle: 4.194 ± 0.37
5.643LeuLys: 5.643 ± 0.451
8.781LeuLeu: 8.781 ± 0.684
1.901LeuMet: 1.901 ± 0.292
3.983LeuAsn: 3.983 ± 0.364
3.742LeuPro: 3.742 ± 0.326
3.44LeuGln: 3.44 ± 0.33
5.009LeuArg: 5.009 ± 0.345
5.914LeuSer: 5.914 ± 0.485
5.552LeuThr: 5.552 ± 0.449
6.126LeuVal: 6.126 ± 0.345
0.634LeuTrp: 0.634 ± 0.146
3.078LeuTyr: 3.078 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
1.961MetAla: 1.961 ± 0.26
0.211MetCys: 0.211 ± 0.074
1.388MetAsp: 1.388 ± 0.213
1.811MetGlu: 1.811 ± 0.255
0.935MetPhe: 0.935 ± 0.155
1.629MetGly: 1.629 ± 0.202
0.664MetHis: 0.664 ± 0.159
1.147MetIle: 1.147 ± 0.215
1.75MetLys: 1.75 ± 0.237
2.414MetLeu: 2.414 ± 0.256
0.332MetMet: 0.332 ± 0.093
0.935MetAsn: 0.935 ± 0.203
0.996MetPro: 0.996 ± 0.211
0.996MetGln: 0.996 ± 0.178
1.388MetArg: 1.388 ± 0.263
2.022MetSer: 2.022 ± 0.206
1.448MetThr: 1.448 ± 0.212
1.599MetVal: 1.599 ± 0.225
0.302MetTrp: 0.302 ± 0.094
1.056MetTyr: 1.056 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
4.315AsnAla: 4.315 ± 0.4
0.392AsnCys: 0.392 ± 0.099
1.931AsnAsp: 1.931 ± 0.273
2.776AsnGlu: 2.776 ± 0.328
1.599AsnPhe: 1.599 ± 0.193
3.41AsnGly: 3.41 ± 0.468
0.875AsnHis: 0.875 ± 0.199
2.595AsnIle: 2.595 ± 0.298
3.259AsnLys: 3.259 ± 0.344
3.561AsnLeu: 3.561 ± 0.262
1.298AsnMet: 1.298 ± 0.178
2.203AsnAsn: 2.203 ± 0.276
2.716AsnPro: 2.716 ± 0.29
1.569AsnGln: 1.569 ± 0.23
2.112AsnArg: 2.112 ± 0.229
3.229AsnSer: 3.229 ± 0.337
3.199AsnThr: 3.199 ± 0.318
2.625AsnVal: 2.625 ± 0.248
0.694AsnTrp: 0.694 ± 0.163
1.629AsnTyr: 1.629 ± 0.211
0.0AsnXaa: 0.0 ± 0.0
Pro
2.052ProAla: 2.052 ± 0.236
0.513ProCys: 0.513 ± 0.164
2.565ProAsp: 2.565 ± 0.348
3.47ProGlu: 3.47 ± 0.27
1.207ProPhe: 1.207 ± 0.181
2.323ProGly: 2.323 ± 0.229
0.513ProHis: 0.513 ± 0.133
2.565ProIle: 2.565 ± 0.335
2.444ProLys: 2.444 ± 0.311
2.957ProLeu: 2.957 ± 0.292
0.604ProMet: 0.604 ± 0.111
1.75ProAsn: 1.75 ± 0.242
0.935ProPro: 0.935 ± 0.25
1.267ProGln: 1.267 ± 0.187
1.629ProArg: 1.629 ± 0.243
2.142ProSer: 2.142 ± 0.35
2.716ProThr: 2.716 ± 0.307
2.927ProVal: 2.927 ± 0.314
0.392ProTrp: 0.392 ± 0.133
1.358ProTyr: 1.358 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
2.565GlnAla: 2.565 ± 0.245
0.392GlnCys: 0.392 ± 0.11
2.173GlnAsp: 2.173 ± 0.261
2.806GlnGlu: 2.806 ± 0.385
1.328GlnPhe: 1.328 ± 0.185
2.474GlnGly: 2.474 ± 0.26
0.845GlnHis: 0.845 ± 0.153
1.841GlnIle: 1.841 ± 0.229
2.384GlnLys: 2.384 ± 0.338
3.53GlnLeu: 3.53 ± 0.361
0.905GlnMet: 0.905 ± 0.143
1.388GlnAsn: 1.388 ± 0.188
0.996GlnPro: 0.996 ± 0.187
1.328GlnGln: 1.328 ± 0.19
1.539GlnArg: 1.539 ± 0.222
1.629GlnSer: 1.629 ± 0.271
1.901GlnThr: 1.901 ± 0.211
2.384GlnVal: 2.384 ± 0.24
0.543GlnTrp: 0.543 ± 0.14
1.418GlnTyr: 1.418 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
3.681ArgAla: 3.681 ± 0.344
0.604ArgCys: 0.604 ± 0.123
2.836ArgAsp: 2.836 ± 0.379
3.712ArgGlu: 3.712 ± 0.344
2.052ArgPhe: 2.052 ± 0.262
3.018ArgGly: 3.018 ± 0.268
0.815ArgHis: 0.815 ± 0.151
2.927ArgIle: 2.927 ± 0.289
3.681ArgLys: 3.681 ± 0.387
4.677ArgLeu: 4.677 ± 0.447
1.056ArgMet: 1.056 ± 0.18
2.233ArgAsn: 2.233 ± 0.243
1.539ArgPro: 1.539 ± 0.231
1.629ArgGln: 1.629 ± 0.261
2.414ArgArg: 2.414 ± 0.272
2.535ArgSer: 2.535 ± 0.254
2.655ArgThr: 2.655 ± 0.264
3.561ArgVal: 3.561 ± 0.345
0.634ArgTrp: 0.634 ± 0.137
1.599ArgTyr: 1.599 ± 0.202
0.0ArgXaa: 0.0 ± 0.0
Ser
4.828SerAla: 4.828 ± 0.534
0.754SerCys: 0.754 ± 0.219
4.225SerAsp: 4.225 ± 0.319
4.798SerGlu: 4.798 ± 0.594
2.655SerPhe: 2.655 ± 0.291
4.194SerGly: 4.194 ± 0.403
0.785SerHis: 0.785 ± 0.168
3.651SerIle: 3.651 ± 0.358
4.074SerLys: 4.074 ± 0.33
5.25SerLeu: 5.25 ± 0.329
1.841SerMet: 1.841 ± 0.262
3.078SerAsn: 3.078 ± 0.308
2.203SerPro: 2.203 ± 0.296
2.022SerGln: 2.022 ± 0.219
3.018SerArg: 3.018 ± 0.322
4.707SerSer: 4.707 ± 0.609
4.164SerThr: 4.164 ± 0.398
3.923SerVal: 3.923 ± 0.35
0.875SerTrp: 0.875 ± 0.161
2.052SerTyr: 2.052 ± 0.313
0.0SerXaa: 0.0 ± 0.0
Thr
4.647ThrAla: 4.647 ± 0.532
0.875ThrCys: 0.875 ± 0.176
4.013ThrAsp: 4.013 ± 0.38
4.345ThrGlu: 4.345 ± 0.468
2.565ThrPhe: 2.565 ± 0.227
4.737ThrGly: 4.737 ± 0.489
1.267ThrHis: 1.267 ± 0.241
3.742ThrIle: 3.742 ± 0.395
4.194ThrLys: 4.194 ± 0.327
5.039ThrLeu: 5.039 ± 0.385
0.875ThrMet: 0.875 ± 0.151
2.746ThrAsn: 2.746 ± 0.299
2.535ThrPro: 2.535 ± 0.266
2.082ThrGln: 2.082 ± 0.242
2.716ThrArg: 2.716 ± 0.285
3.802ThrSer: 3.802 ± 0.463
3.832ThrThr: 3.832 ± 0.421
4.617ThrVal: 4.617 ± 0.375
0.694ThrTrp: 0.694 ± 0.166
2.595ThrTyr: 2.595 ± 0.284
0.0ThrXaa: 0.0 ± 0.0
Val
5.552ValAla: 5.552 ± 0.442
0.754ValCys: 0.754 ± 0.166
5.462ValAsp: 5.462 ± 0.405
5.522ValGlu: 5.522 ± 0.396
2.806ValPhe: 2.806 ± 0.296
3.742ValGly: 3.742 ± 0.375
1.448ValHis: 1.448 ± 0.205
3.893ValIle: 3.893 ± 0.382
3.953ValLys: 3.953 ± 0.334
5.944ValLeu: 5.944 ± 0.428
1.207ValMet: 1.207 ± 0.146
2.836ValAsn: 2.836 ± 0.245
2.444ValPro: 2.444 ± 0.289
1.992ValGln: 1.992 ± 0.276
3.229ValArg: 3.229 ± 0.276
4.436ValSer: 4.436 ± 0.349
4.647ValThr: 4.647 ± 0.35
5.432ValVal: 5.432 ± 0.397
0.815ValTrp: 0.815 ± 0.189
2.686ValTyr: 2.686 ± 0.21
0.0ValXaa: 0.0 ± 0.0
Trp
0.935TrpAla: 0.935 ± 0.197
0.211TrpCys: 0.211 ± 0.077
0.996TrpAsp: 0.996 ± 0.206
1.056TrpGlu: 1.056 ± 0.189
0.513TrpPhe: 0.513 ± 0.113
0.815TrpGly: 0.815 ± 0.168
0.211TrpHis: 0.211 ± 0.076
0.634TrpIle: 0.634 ± 0.122
0.634TrpLys: 0.634 ± 0.155
1.267TrpLeu: 1.267 ± 0.212
0.604TrpMet: 0.604 ± 0.147
0.875TrpAsn: 0.875 ± 0.201
0.151TrpPro: 0.151 ± 0.064
0.272TrpGln: 0.272 ± 0.103
0.664TrpArg: 0.664 ± 0.139
0.634TrpSer: 0.634 ± 0.173
0.785TrpThr: 0.785 ± 0.179
0.905TrpVal: 0.905 ± 0.15
0.151TrpTrp: 0.151 ± 0.071
0.422TrpTyr: 0.422 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.625TyrAla: 2.625 ± 0.281
0.664TyrCys: 0.664 ± 0.16
2.746TyrAsp: 2.746 ± 0.251
2.414TyrGlu: 2.414 ± 0.27
1.388TyrPhe: 1.388 ± 0.196
2.323TyrGly: 2.323 ± 0.268
0.996TyrHis: 0.996 ± 0.203
2.233TyrIle: 2.233 ± 0.262
2.625TyrLys: 2.625 ± 0.292
3.5TyrLeu: 3.5 ± 0.347
0.845TyrMet: 0.845 ± 0.163
1.599TyrAsn: 1.599 ± 0.212
1.569TyrPro: 1.569 ± 0.261
1.811TyrGln: 1.811 ± 0.292
2.173TyrArg: 2.173 ± 0.267
2.444TyrSer: 2.444 ± 0.277
2.505TyrThr: 2.505 ± 0.283
2.836TyrVal: 2.836 ± 0.259
0.573TyrTrp: 0.573 ± 0.123
1.66TyrTyr: 1.66 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 170 proteins (33141 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski