Amino acid dipepetide frequency for Vibrio phage 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.992AlaAla: 7.992 ± 0.788
0.888AlaCys: 0.888 ± 0.193
4.44AlaAsp: 4.44 ± 0.405
6.418AlaGlu: 6.418 ± 0.512
2.987AlaPhe: 2.987 ± 0.341
5.207AlaGly: 5.207 ± 0.431
1.413AlaHis: 1.413 ± 0.28
5.651AlaIle: 5.651 ± 0.615
5.974AlaLys: 5.974 ± 0.522
8.154AlaLeu: 8.154 ± 0.695
2.301AlaMet: 2.301 ± 0.32
4.884AlaAsn: 4.884 ± 0.433
3.633AlaPro: 3.633 ± 0.417
3.835AlaGln: 3.835 ± 0.517
5.772AlaArg: 5.772 ± 0.502
4.4AlaSer: 4.4 ± 0.431
5.005AlaThr: 5.005 ± 0.516
6.055AlaVal: 6.055 ± 0.639
0.848AlaTrp: 0.848 ± 0.217
2.382AlaTyr: 2.382 ± 0.273
0.0AlaXaa: 0.0 ± 0.0
Cys
0.928CysAla: 0.928 ± 0.2
0.283CysCys: 0.283 ± 0.114
0.323CysAsp: 0.323 ± 0.134
0.888CysGlu: 0.888 ± 0.195
0.484CysPhe: 0.484 ± 0.137
1.049CysGly: 1.049 ± 0.224
0.242CysHis: 0.242 ± 0.091
0.727CysIle: 0.727 ± 0.203
1.009CysLys: 1.009 ± 0.216
1.413CysLeu: 1.413 ± 0.238
0.161CysMet: 0.161 ± 0.076
0.848CysAsn: 0.848 ± 0.223
0.565CysPro: 0.565 ± 0.176
0.404CysGln: 0.404 ± 0.124
0.727CysArg: 0.727 ± 0.186
0.525CysSer: 0.525 ± 0.14
0.807CysThr: 0.807 ± 0.195
0.646CysVal: 0.646 ± 0.174
0.161CysTrp: 0.161 ± 0.103
0.202CysTyr: 0.202 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
4.763AspAla: 4.763 ± 0.438
0.767AspCys: 0.767 ± 0.203
2.583AspAsp: 2.583 ± 0.312
3.915AspGlu: 3.915 ± 0.388
2.624AspPhe: 2.624 ± 0.302
3.552AspGly: 3.552 ± 0.409
1.534AspHis: 1.534 ± 0.283
2.947AspIle: 2.947 ± 0.319
4.198AspLys: 4.198 ± 0.493
6.781AspLeu: 6.781 ± 0.485
1.453AspMet: 1.453 ± 0.242
2.139AspAsn: 2.139 ± 0.286
3.754AspPro: 3.754 ± 0.427
3.027AspGln: 3.027 ± 0.401
3.633AspArg: 3.633 ± 0.355
3.027AspSer: 3.027 ± 0.443
3.027AspThr: 3.027 ± 0.36
4.036AspVal: 4.036 ± 0.371
0.565AspTrp: 0.565 ± 0.146
2.139AspTyr: 2.139 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
7.508GluAla: 7.508 ± 0.839
1.13GluCys: 1.13 ± 0.278
4.682GluAsp: 4.682 ± 0.442
3.915GluGlu: 3.915 ± 0.398
2.462GluPhe: 2.462 ± 0.331
4.359GluGly: 4.359 ± 0.575
1.574GluHis: 1.574 ± 0.238
4.803GluIle: 4.803 ± 0.402
3.754GluLys: 3.754 ± 0.434
7.79GluLeu: 7.79 ± 0.649
1.534GluMet: 1.534 ± 0.237
2.785GluAsn: 2.785 ± 0.302
1.736GluPro: 1.736 ± 0.27
3.471GluGln: 3.471 ± 0.409
3.512GluArg: 3.512 ± 0.378
3.794GluSer: 3.794 ± 0.409
5.046GluThr: 5.046 ± 0.416
4.521GluVal: 4.521 ± 0.281
0.484GluTrp: 0.484 ± 0.128
2.099GluTyr: 2.099 ± 0.274
0.0GluXaa: 0.0 ± 0.0
Phe
2.664PheAla: 2.664 ± 0.362
0.646PheCys: 0.646 ± 0.169
3.148PheAsp: 3.148 ± 0.39
2.301PheGlu: 2.301 ± 0.312
1.251PhePhe: 1.251 ± 0.234
2.301PheGly: 2.301 ± 0.347
0.484PheHis: 0.484 ± 0.113
2.22PheIle: 2.22 ± 0.304
2.583PheLys: 2.583 ± 0.303
3.027PheLeu: 3.027 ± 0.361
1.09PheMet: 1.09 ± 0.211
2.301PheAsn: 2.301 ± 0.359
0.767PhePro: 0.767 ± 0.167
1.13PheGln: 1.13 ± 0.221
1.574PheArg: 1.574 ± 0.228
2.543PheSer: 2.543 ± 0.34
2.139PheThr: 2.139 ± 0.243
2.664PheVal: 2.664 ± 0.397
0.04PheTrp: 0.04 ± 0.037
1.494PheTyr: 1.494 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
5.53GlyAla: 5.53 ± 0.486
0.807GlyCys: 0.807 ± 0.195
4.481GlyAsp: 4.481 ± 0.432
4.642GlyGlu: 4.642 ± 0.565
2.745GlyPhe: 2.745 ± 0.35
4.481GlyGly: 4.481 ± 0.418
0.807GlyHis: 0.807 ± 0.204
3.35GlyIle: 3.35 ± 0.406
3.835GlyLys: 3.835 ± 0.379
5.893GlyLeu: 5.893 ± 0.508
1.09GlyMet: 1.09 ± 0.236
3.471GlyAsn: 3.471 ± 0.419
0.0GlyPro: 0.0 ± 0.0
2.018GlyGln: 2.018 ± 0.278
3.068GlyArg: 3.068 ± 0.387
4.642GlySer: 4.642 ± 0.453
3.633GlyThr: 3.633 ± 0.473
4.602GlyVal: 4.602 ± 0.51
0.767GlyTrp: 0.767 ± 0.185
2.745GlyTyr: 2.745 ± 0.293
0.0GlyXaa: 0.0 ± 0.0
His
1.938HisAla: 1.938 ± 0.293
0.363HisCys: 0.363 ± 0.15
1.049HisAsp: 1.049 ± 0.26
1.251HisGlu: 1.251 ± 0.271
0.646HisPhe: 0.646 ± 0.148
1.453HisGly: 1.453 ± 0.256
0.363HisHis: 0.363 ± 0.113
0.848HisIle: 0.848 ± 0.171
1.332HisLys: 1.332 ± 0.246
2.099HisLeu: 2.099 ± 0.293
0.323HisMet: 0.323 ± 0.123
0.767HisAsn: 0.767 ± 0.183
0.928HisPro: 0.928 ± 0.232
0.404HisGln: 0.404 ± 0.119
1.453HisArg: 1.453 ± 0.278
0.928HisSer: 0.928 ± 0.22
1.292HisThr: 1.292 ± 0.236
1.332HisVal: 1.332 ± 0.226
0.081HisTrp: 0.081 ± 0.056
0.888HisTyr: 0.888 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
5.57IleAla: 5.57 ± 0.452
0.848IleCys: 0.848 ± 0.226
4.925IleAsp: 4.925 ± 0.553
5.247IleGlu: 5.247 ± 0.461
1.171IlePhe: 1.171 ± 0.288
3.148IleGly: 3.148 ± 0.408
0.928IleHis: 0.928 ± 0.182
2.624IleIle: 2.624 ± 0.374
4.965IleLys: 4.965 ± 0.473
3.956IleLeu: 3.956 ± 0.415
1.171IleMet: 1.171 ± 0.235
3.835IleAsn: 3.835 ± 0.399
3.027IlePro: 3.027 ± 0.329
2.18IleGln: 2.18 ± 0.374
3.189IleArg: 3.189 ± 0.356
3.391IleSer: 3.391 ± 0.428
3.229IleThr: 3.229 ± 0.366
3.552IleVal: 3.552 ± 0.398
0.283IleTrp: 0.283 ± 0.111
1.857IleTyr: 1.857 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
7.185LysAla: 7.185 ± 0.679
0.525LysCys: 0.525 ± 0.159
3.592LysAsp: 3.592 ± 0.369
4.4LysGlu: 4.4 ± 0.503
1.736LysPhe: 1.736 ± 0.293
3.431LysGly: 3.431 ± 0.308
2.018LysHis: 2.018 ± 0.364
3.108LysIle: 3.108 ± 0.341
4.238LysLys: 4.238 ± 0.598
6.014LysLeu: 6.014 ± 0.466
1.453LysMet: 1.453 ± 0.287
3.35LysAsn: 3.35 ± 0.371
4.036LysPro: 4.036 ± 0.497
3.512LysGln: 3.512 ± 0.444
3.915LysArg: 3.915 ± 0.505
3.471LysSer: 3.471 ± 0.4
4.198LysThr: 4.198 ± 0.395
3.633LysVal: 3.633 ± 0.419
0.605LysTrp: 0.605 ± 0.175
1.776LysTyr: 1.776 ± 0.274
0.0LysXaa: 0.0 ± 0.0
Leu
7.589LeuAla: 7.589 ± 0.722
1.09LeuCys: 1.09 ± 0.18
5.853LeuAsp: 5.853 ± 0.376
6.418LeuGlu: 6.418 ± 0.52
3.027LeuPhe: 3.027 ± 0.289
4.844LeuGly: 4.844 ± 0.454
2.139LeuHis: 2.139 ± 0.366
7.387LeuIle: 7.387 ± 0.506
5.732LeuLys: 5.732 ± 0.524
6.983LeuLeu: 6.983 ± 0.541
1.897LeuMet: 1.897 ± 0.233
5.853LeuAsn: 5.853 ± 0.427
3.915LeuPro: 3.915 ± 0.36
4.198LeuGln: 4.198 ± 0.366
4.44LeuArg: 4.44 ± 0.42
6.499LeuSer: 6.499 ± 0.498
5.57LeuThr: 5.57 ± 0.516
4.925LeuVal: 4.925 ± 0.47
0.484LeuTrp: 0.484 ± 0.178
2.341LeuTyr: 2.341 ± 0.28
0.0LeuXaa: 0.0 ± 0.0
Met
2.503MetAla: 2.503 ± 0.287
0.404MetCys: 0.404 ± 0.13
0.686MetAsp: 0.686 ± 0.159
0.767MetGlu: 0.767 ± 0.183
0.928MetPhe: 0.928 ± 0.175
1.13MetGly: 1.13 ± 0.181
0.646MetHis: 0.646 ± 0.178
1.615MetIle: 1.615 ± 0.212
1.574MetLys: 1.574 ± 0.25
1.655MetLeu: 1.655 ± 0.268
0.323MetMet: 0.323 ± 0.118
1.049MetAsn: 1.049 ± 0.148
0.767MetPro: 0.767 ± 0.16
1.211MetGln: 1.211 ± 0.238
1.332MetArg: 1.332 ± 0.199
1.13MetSer: 1.13 ± 0.237
1.211MetThr: 1.211 ± 0.236
1.292MetVal: 1.292 ± 0.206
0.202MetTrp: 0.202 ± 0.095
0.565MetTyr: 0.565 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
5.409AsnAla: 5.409 ± 0.538
0.605AsnCys: 0.605 ± 0.182
2.866AsnAsp: 2.866 ± 0.361
3.673AsnGlu: 3.673 ± 0.498
2.503AsnPhe: 2.503 ± 0.375
4.158AsnGly: 4.158 ± 0.429
0.848AsnHis: 0.848 ± 0.177
2.745AsnIle: 2.745 ± 0.361
3.189AsnLys: 3.189 ± 0.349
4.481AsnLeu: 4.481 ± 0.371
1.13AsnMet: 1.13 ± 0.193
2.785AsnAsn: 2.785 ± 0.296
2.583AsnPro: 2.583 ± 0.338
2.422AsnGln: 2.422 ± 0.366
2.987AsnArg: 2.987 ± 0.41
2.099AsnSer: 2.099 ± 0.304
3.148AsnThr: 3.148 ± 0.422
2.987AsnVal: 2.987 ± 0.344
0.363AsnTrp: 0.363 ± 0.12
1.978AsnTyr: 1.978 ± 0.3
0.0AsnXaa: 0.0 ± 0.0
Pro
2.018ProAla: 2.018 ± 0.312
0.484ProCys: 0.484 ± 0.171
2.664ProAsp: 2.664 ± 0.349
3.956ProGlu: 3.956 ± 0.394
1.171ProPhe: 1.171 ± 0.207
2.382ProGly: 2.382 ± 0.352
0.767ProHis: 0.767 ± 0.201
1.776ProIle: 1.776 ± 0.286
3.108ProLys: 3.108 ± 0.377
3.189ProLeu: 3.189 ± 0.329
0.363ProMet: 0.363 ± 0.115
1.453ProAsn: 1.453 ± 0.255
1.494ProPro: 1.494 ± 0.275
1.494ProGln: 1.494 ± 0.278
2.26ProArg: 2.26 ± 0.403
2.462ProSer: 2.462 ± 0.381
3.068ProThr: 3.068 ± 0.388
3.552ProVal: 3.552 ± 0.407
0.525ProTrp: 0.525 ± 0.144
1.251ProTyr: 1.251 ± 0.25
0.0ProXaa: 0.0 ± 0.0
Gln
3.915GlnAla: 3.915 ± 0.447
0.363GlnCys: 0.363 ± 0.123
1.978GlnAsp: 1.978 ± 0.228
2.422GlnGlu: 2.422 ± 0.415
1.534GlnPhe: 1.534 ± 0.237
3.229GlnGly: 3.229 ± 0.4
0.444GlnHis: 0.444 ± 0.141
2.947GlnIle: 2.947 ± 0.309
2.462GlnLys: 2.462 ± 0.368
5.005GlnLeu: 5.005 ± 0.481
0.807GlnMet: 0.807 ± 0.18
1.494GlnAsn: 1.494 ± 0.284
1.534GlnPro: 1.534 ± 0.258
2.099GlnGln: 2.099 ± 0.27
2.301GlnArg: 2.301 ± 0.502
2.139GlnSer: 2.139 ± 0.306
2.382GlnThr: 2.382 ± 0.324
2.826GlnVal: 2.826 ± 0.407
0.484GlnTrp: 0.484 ± 0.146
1.534GlnTyr: 1.534 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
4.925ArgAla: 4.925 ± 0.509
0.283ArgCys: 0.283 ± 0.11
2.664ArgAsp: 2.664 ± 0.321
4.844ArgGlu: 4.844 ± 0.647
2.664ArgPhe: 2.664 ± 0.349
3.512ArgGly: 3.512 ± 0.357
1.211ArgHis: 1.211 ± 0.243
3.431ArgIle: 3.431 ± 0.374
3.391ArgLys: 3.391 ± 0.447
4.803ArgLeu: 4.803 ± 0.346
0.807ArgMet: 0.807 ± 0.178
2.745ArgAsn: 2.745 ± 0.294
2.018ArgPro: 2.018 ± 0.255
2.099ArgGln: 2.099 ± 0.357
3.391ArgArg: 3.391 ± 0.454
2.422ArgSer: 2.422 ± 0.279
3.31ArgThr: 3.31 ± 0.377
4.158ArgVal: 4.158 ± 0.375
0.363ArgTrp: 0.363 ± 0.122
1.695ArgTyr: 1.695 ± 0.292
0.0ArgXaa: 0.0 ± 0.0
Ser
4.279SerAla: 4.279 ± 0.613
0.605SerCys: 0.605 ± 0.156
3.31SerAsp: 3.31 ± 0.377
4.117SerGlu: 4.117 ± 0.392
2.341SerPhe: 2.341 ± 0.326
4.158SerGly: 4.158 ± 0.451
1.171SerHis: 1.171 ± 0.191
3.35SerIle: 3.35 ± 0.395
4.198SerLys: 4.198 ± 0.398
5.086SerLeu: 5.086 ± 0.467
1.171SerMet: 1.171 ± 0.188
3.148SerAsn: 3.148 ± 0.36
2.26SerPro: 2.26 ± 0.305
2.18SerGln: 2.18 ± 0.299
2.422SerArg: 2.422 ± 0.33
2.866SerSer: 2.866 ± 0.326
4.117SerThr: 4.117 ± 0.432
3.35SerVal: 3.35 ± 0.374
0.283SerTrp: 0.283 ± 0.101
1.695SerTyr: 1.695 ± 0.309
0.0SerXaa: 0.0 ± 0.0
Thr
5.288ThrAla: 5.288 ± 0.427
0.686ThrCys: 0.686 ± 0.155
3.714ThrAsp: 3.714 ± 0.377
3.794ThrGlu: 3.794 ± 0.46
2.664ThrPhe: 2.664 ± 0.365
5.126ThrGly: 5.126 ± 0.483
1.09ThrHis: 1.09 ± 0.226
3.35ThrIle: 3.35 ± 0.49
3.552ThrLys: 3.552 ± 0.389
6.216ThrLeu: 6.216 ± 0.572
1.695ThrMet: 1.695 ± 0.237
3.148ThrAsn: 3.148 ± 0.354
2.785ThrPro: 2.785 ± 0.419
2.785ThrGln: 2.785 ± 0.3
2.745ThrArg: 2.745 ± 0.276
2.866ThrSer: 2.866 ± 0.312
3.875ThrThr: 3.875 ± 0.522
4.561ThrVal: 4.561 ± 0.454
0.525ThrTrp: 0.525 ± 0.145
1.695ThrTyr: 1.695 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
4.642ValAla: 4.642 ± 0.455
1.049ValCys: 1.049 ± 0.212
4.117ValAsp: 4.117 ± 0.4
5.328ValGlu: 5.328 ± 0.492
2.099ValPhe: 2.099 ± 0.254
3.108ValGly: 3.108 ± 0.346
0.969ValHis: 0.969 ± 0.199
3.714ValIle: 3.714 ± 0.409
4.803ValLys: 4.803 ± 0.487
4.561ValLeu: 4.561 ± 0.423
1.332ValMet: 1.332 ± 0.238
4.319ValAsn: 4.319 ± 0.39
2.543ValPro: 2.543 ± 0.392
1.736ValGln: 1.736 ± 0.247
3.754ValArg: 3.754 ± 0.328
4.763ValSer: 4.763 ± 0.437
5.288ValThr: 5.288 ± 0.592
4.077ValVal: 4.077 ± 0.575
0.484ValTrp: 0.484 ± 0.131
2.22ValTyr: 2.22 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
0.686TrpAla: 0.686 ± 0.157
0.081TrpCys: 0.081 ± 0.061
0.444TrpAsp: 0.444 ± 0.142
0.484TrpGlu: 0.484 ± 0.144
0.121TrpPhe: 0.121 ± 0.077
0.565TrpGly: 0.565 ± 0.151
0.202TrpHis: 0.202 ± 0.116
0.605TrpIle: 0.605 ± 0.176
0.444TrpLys: 0.444 ± 0.161
1.211TrpLeu: 1.211 ± 0.222
0.283TrpMet: 0.283 ± 0.12
0.404TrpAsn: 0.404 ± 0.134
0.0TrpPro: 0.0 ± 0.0
0.404TrpGln: 0.404 ± 0.117
0.525TrpArg: 0.525 ± 0.135
0.363TrpSer: 0.363 ± 0.114
0.161TrpThr: 0.161 ± 0.081
0.605TrpVal: 0.605 ± 0.161
0.161TrpTrp: 0.161 ± 0.085
0.202TrpTyr: 0.202 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.947TyrAla: 2.947 ± 0.374
0.484TyrCys: 0.484 ± 0.155
2.947TyrAsp: 2.947 ± 0.382
2.18TyrGlu: 2.18 ± 0.318
1.211TyrPhe: 1.211 ± 0.207
1.413TyrGly: 1.413 ± 0.29
0.888TyrHis: 0.888 ± 0.189
1.897TyrIle: 1.897 ± 0.225
1.857TyrLys: 1.857 ± 0.284
2.866TyrLeu: 2.866 ± 0.343
0.565TyrMet: 0.565 ± 0.186
2.18TyrAsn: 2.18 ± 0.354
1.09TyrPro: 1.09 ± 0.193
1.292TyrGln: 1.292 ± 0.282
1.736TyrArg: 1.736 ± 0.254
1.776TyrSer: 1.776 ± 0.298
1.655TyrThr: 1.655 ± 0.262
1.534TyrVal: 1.534 ± 0.232
0.242TyrTrp: 0.242 ± 0.096
0.848TyrTyr: 0.848 ± 0.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 125 proteins (24775 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski