Amino acid dipepetide frequency for Vibrio phage SIO-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.951AlaAla: 6.951 ± 0.711
0.522AlaCys: 0.522 ± 0.154
4.58AlaAsp: 4.58 ± 0.428
5.705AlaGlu: 5.705 ± 0.531
3.013AlaPhe: 3.013 ± 0.395
3.536AlaGly: 3.536 ± 0.444
1.527AlaHis: 1.527 ± 0.294
5.464AlaIle: 5.464 ± 0.45
5.946AlaLys: 5.946 ± 0.471
8.598AlaLeu: 8.598 ± 0.659
2.21AlaMet: 2.21 ± 0.25
5.344AlaAsn: 5.344 ± 0.428
3.455AlaPro: 3.455 ± 0.364
2.812AlaGln: 2.812 ± 0.502
5.103AlaArg: 5.103 ± 0.473
4.741AlaSer: 4.741 ± 0.462
4.661AlaThr: 4.661 ± 0.603
4.741AlaVal: 4.741 ± 0.474
0.522AlaTrp: 0.522 ± 0.146
2.933AlaTyr: 2.933 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
0.643CysAla: 0.643 ± 0.14
0.121CysCys: 0.121 ± 0.092
0.482CysAsp: 0.482 ± 0.127
0.402CysGlu: 0.402 ± 0.136
0.603CysPhe: 0.603 ± 0.181
0.763CysGly: 0.763 ± 0.17
0.362CysHis: 0.362 ± 0.125
0.924CysIle: 0.924 ± 0.185
0.844CysLys: 0.844 ± 0.222
0.884CysLeu: 0.884 ± 0.207
0.08CysMet: 0.08 ± 0.062
0.603CysAsn: 0.603 ± 0.159
0.683CysPro: 0.683 ± 0.17
0.442CysGln: 0.442 ± 0.121
0.522CysArg: 0.522 ± 0.127
0.884CysSer: 0.884 ± 0.198
0.683CysThr: 0.683 ± 0.214
0.683CysVal: 0.683 ± 0.172
0.121CysTrp: 0.121 ± 0.067
0.442CysTyr: 0.442 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.464AspAla: 5.464 ± 0.508
0.603AspCys: 0.603 ± 0.131
3.496AspAsp: 3.496 ± 0.439
4.942AspGlu: 4.942 ± 0.54
2.531AspPhe: 2.531 ± 0.276
4.179AspGly: 4.179 ± 0.4
1.888AspHis: 1.888 ± 0.315
3.013AspIle: 3.013 ± 0.329
4.098AspLys: 4.098 ± 0.526
7.594AspLeu: 7.594 ± 0.57
1.246AspMet: 1.246 ± 0.205
2.25AspAsn: 2.25 ± 0.305
3.054AspPro: 3.054 ± 0.356
2.853AspGln: 2.853 ± 0.304
3.054AspArg: 3.054 ± 0.329
4.058AspSer: 4.058 ± 0.379
3.737AspThr: 3.737 ± 0.431
3.576AspVal: 3.576 ± 0.349
0.603AspTrp: 0.603 ± 0.153
2.29AspTyr: 2.29 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
6.147GluAla: 6.147 ± 0.47
0.844GluCys: 0.844 ± 0.177
4.54GluAsp: 4.54 ± 0.475
2.853GluGlu: 2.853 ± 0.327
2.612GluPhe: 2.612 ± 0.284
3.496GluGly: 3.496 ± 0.415
1.969GluHis: 1.969 ± 0.338
4.54GluIle: 4.54 ± 0.467
2.812GluLys: 2.812 ± 0.335
8.076GluLeu: 8.076 ± 0.685
1.326GluMet: 1.326 ± 0.23
2.571GluAsn: 2.571 ± 0.288
1.888GluPro: 1.888 ± 0.352
2.531GluGln: 2.531 ± 0.366
3.576GluArg: 3.576 ± 0.36
4.259GluSer: 4.259 ± 0.406
3.817GluThr: 3.817 ± 0.346
4.138GluVal: 4.138 ± 0.421
0.643GluTrp: 0.643 ± 0.181
2.812GluTyr: 2.812 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
2.29PheAla: 2.29 ± 0.22
0.522PheCys: 0.522 ± 0.157
3.335PheAsp: 3.335 ± 0.397
2.29PheGlu: 2.29 ± 0.322
1.246PhePhe: 1.246 ± 0.182
2.25PheGly: 2.25 ± 0.282
0.562PheHis: 0.562 ± 0.194
2.451PheIle: 2.451 ± 0.331
3.094PheLys: 3.094 ± 0.345
2.933PheLeu: 2.933 ± 0.33
0.884PheMet: 0.884 ± 0.138
1.848PheAsn: 1.848 ± 0.257
0.804PhePro: 0.804 ± 0.19
0.723PheGln: 0.723 ± 0.157
1.326PheArg: 1.326 ± 0.231
3.254PheSer: 3.254 ± 0.341
2.772PheThr: 2.772 ± 0.354
2.129PheVal: 2.129 ± 0.376
0.08PheTrp: 0.08 ± 0.069
1.205PheTyr: 1.205 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
4.942GlyAla: 4.942 ± 0.353
0.683GlyCys: 0.683 ± 0.182
4.942GlyAsp: 4.942 ± 0.562
3.496GlyGlu: 3.496 ± 0.376
2.933GlyPhe: 2.933 ± 0.336
3.897GlyGly: 3.897 ± 0.436
0.924GlyHis: 0.924 ± 0.205
3.094GlyIle: 3.094 ± 0.4
3.576GlyLys: 3.576 ± 0.456
5.344GlyLeu: 5.344 ± 0.541
1.165GlyMet: 1.165 ± 0.238
3.054GlyAsn: 3.054 ± 0.364
0.0GlyPro: 0.0 ± 0.0
1.687GlyGln: 1.687 ± 0.215
3.094GlyArg: 3.094 ± 0.354
5.062GlySer: 5.062 ± 0.495
2.973GlyThr: 2.973 ± 0.343
5.304GlyVal: 5.304 ± 0.436
0.603GlyTrp: 0.603 ± 0.168
2.853GlyTyr: 2.853 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
1.647HisAla: 1.647 ± 0.245
0.04HisCys: 0.04 ± 0.035
1.567HisAsp: 1.567 ± 0.29
1.045HisGlu: 1.045 ± 0.172
0.844HisPhe: 0.844 ± 0.175
1.808HisGly: 1.808 ± 0.352
0.281HisHis: 0.281 ± 0.114
1.286HisIle: 1.286 ± 0.209
1.728HisLys: 1.728 ± 0.338
2.491HisLeu: 2.491 ± 0.362
0.362HisMet: 0.362 ± 0.118
1.004HisAsn: 1.004 ± 0.165
0.844HisPro: 0.844 ± 0.183
0.402HisGln: 0.402 ± 0.164
0.964HisArg: 0.964 ± 0.201
0.844HisSer: 0.844 ± 0.171
0.683HisThr: 0.683 ± 0.163
1.246HisVal: 1.246 ± 0.217
0.201HisTrp: 0.201 ± 0.078
0.763HisTyr: 0.763 ± 0.157
0.0HisXaa: 0.0 ± 0.0
Ile
5.705IleAla: 5.705 ± 0.509
0.683IleCys: 0.683 ± 0.176
4.138IleAsp: 4.138 ± 0.361
4.942IleGlu: 4.942 ± 0.448
1.487IlePhe: 1.487 ± 0.218
2.812IleGly: 2.812 ± 0.374
0.964IleHis: 0.964 ± 0.242
2.893IleIle: 2.893 ± 0.362
5.384IleLys: 5.384 ± 0.441
4.179IleLeu: 4.179 ± 0.484
1.205IleMet: 1.205 ± 0.217
3.897IleAsn: 3.897 ± 0.386
2.371IlePro: 2.371 ± 0.335
1.848IleGln: 1.848 ± 0.299
3.254IleArg: 3.254 ± 0.354
3.817IleSer: 3.817 ± 0.385
3.455IleThr: 3.455 ± 0.352
3.937IleVal: 3.937 ± 0.402
0.281IleTrp: 0.281 ± 0.094
2.33IleTyr: 2.33 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
6.79LysAla: 6.79 ± 0.569
0.562LysCys: 0.562 ± 0.174
3.857LysAsp: 3.857 ± 0.393
4.701LysGlu: 4.701 ± 0.462
2.17LysPhe: 2.17 ± 0.283
4.862LysGly: 4.862 ± 0.414
1.446LysHis: 1.446 ± 0.262
3.094LysIle: 3.094 ± 0.353
4.661LysLys: 4.661 ± 0.64
6.187LysLeu: 6.187 ± 0.503
0.964LysMet: 0.964 ± 0.209
2.33LysAsn: 2.33 ± 0.302
3.737LysPro: 3.737 ± 0.456
2.772LysGln: 2.772 ± 0.298
4.058LysArg: 4.058 ± 0.439
3.375LysSer: 3.375 ± 0.461
4.179LysThr: 4.179 ± 0.407
5.223LysVal: 5.223 ± 0.465
0.522LysTrp: 0.522 ± 0.147
2.491LysTyr: 2.491 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
7.232LeuAla: 7.232 ± 0.617
1.246LeuCys: 1.246 ± 0.238
6.549LeuAsp: 6.549 ± 0.448
6.509LeuGlu: 6.509 ± 0.476
2.21LeuPhe: 2.21 ± 0.266
5.786LeuGly: 5.786 ± 0.383
2.129LeuHis: 2.129 ± 0.336
6.228LeuIle: 6.228 ± 0.509
6.268LeuLys: 6.268 ± 0.436
6.951LeuLeu: 6.951 ± 0.637
2.371LeuMet: 2.371 ± 0.304
5.585LeuAsn: 5.585 ± 0.401
3.254LeuPro: 3.254 ± 0.439
3.335LeuGln: 3.335 ± 0.292
5.143LeuArg: 5.143 ± 0.438
6.871LeuSer: 6.871 ± 0.516
7.433LeuThr: 7.433 ± 0.633
5.866LeuVal: 5.866 ± 0.497
0.884LeuTrp: 0.884 ± 0.194
2.772LeuTyr: 2.772 ± 0.375
0.0LeuXaa: 0.0 ± 0.0
Met
2.21MetAla: 2.21 ± 0.333
0.281MetCys: 0.281 ± 0.102
0.643MetAsp: 0.643 ± 0.188
0.964MetGlu: 0.964 ± 0.178
1.004MetPhe: 1.004 ± 0.203
1.286MetGly: 1.286 ± 0.21
0.442MetHis: 0.442 ± 0.133
1.446MetIle: 1.446 ± 0.215
0.964MetLys: 0.964 ± 0.172
1.406MetLeu: 1.406 ± 0.233
0.201MetMet: 0.201 ± 0.088
0.442MetAsn: 0.442 ± 0.118
0.763MetPro: 0.763 ± 0.195
0.763MetGln: 0.763 ± 0.165
1.366MetArg: 1.366 ± 0.269
1.808MetSer: 1.808 ± 0.225
2.089MetThr: 2.089 ± 0.29
1.286MetVal: 1.286 ± 0.225
0.201MetTrp: 0.201 ± 0.094
0.844MetTyr: 0.844 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
4.5AsnAla: 4.5 ± 0.464
0.683AsnCys: 0.683 ± 0.171
2.893AsnAsp: 2.893 ± 0.333
2.571AsnGlu: 2.571 ± 0.37
1.888AsnPhe: 1.888 ± 0.321
3.897AsnGly: 3.897 ± 0.415
0.723AsnHis: 0.723 ± 0.183
2.692AsnIle: 2.692 ± 0.358
3.094AsnLys: 3.094 ± 0.337
5.183AsnLeu: 5.183 ± 0.488
1.085AsnMet: 1.085 ± 0.212
2.29AsnAsn: 2.29 ± 0.313
2.732AsnPro: 2.732 ± 0.337
2.21AsnGln: 2.21 ± 0.297
2.451AsnArg: 2.451 ± 0.217
2.893AsnSer: 2.893 ± 0.341
3.576AsnThr: 3.576 ± 0.307
3.696AsnVal: 3.696 ± 0.359
0.321AsnTrp: 0.321 ± 0.106
1.768AsnTyr: 1.768 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
1.567ProAla: 1.567 ± 0.267
0.603ProCys: 0.603 ± 0.161
3.174ProAsp: 3.174 ± 0.337
2.973ProGlu: 2.973 ± 0.339
1.246ProPhe: 1.246 ± 0.228
0.0ProGly: 0.0 ± 0.0
0.924ProHis: 0.924 ± 0.207
2.009ProIle: 2.009 ± 0.292
3.054ProLys: 3.054 ± 0.358
3.375ProLeu: 3.375 ± 0.352
1.045ProMet: 1.045 ± 0.21
2.21ProAsn: 2.21 ± 0.31
1.125ProPro: 1.125 ± 0.207
0.844ProGln: 0.844 ± 0.178
1.487ProArg: 1.487 ± 0.257
3.455ProSer: 3.455 ± 0.398
2.531ProThr: 2.531 ± 0.286
3.174ProVal: 3.174 ± 0.354
0.321ProTrp: 0.321 ± 0.121
1.286ProTyr: 1.286 ± 0.242
0.0ProXaa: 0.0 ± 0.0
Gln
3.375GlnAla: 3.375 ± 0.45
0.281GlnCys: 0.281 ± 0.109
1.929GlnAsp: 1.929 ± 0.26
2.129GlnGlu: 2.129 ± 0.352
1.165GlnPhe: 1.165 ± 0.169
3.134GlnGly: 3.134 ± 0.317
0.643GlnHis: 0.643 ± 0.148
2.29GlnIle: 2.29 ± 0.32
1.929GlnLys: 1.929 ± 0.292
3.335GlnLeu: 3.335 ± 0.362
0.643GlnMet: 0.643 ± 0.151
1.085GlnAsn: 1.085 ± 0.196
0.884GlnPro: 0.884 ± 0.196
0.804GlnGln: 0.804 ± 0.162
1.808GlnArg: 1.808 ± 0.344
2.371GlnSer: 2.371 ± 0.309
2.33GlnThr: 2.33 ± 0.257
2.571GlnVal: 2.571 ± 0.304
0.402GlnTrp: 0.402 ± 0.114
0.964GlnTyr: 0.964 ± 0.202
0.0GlnXaa: 0.0 ± 0.0
Arg
5.464ArgAla: 5.464 ± 0.549
0.442ArgCys: 0.442 ± 0.134
3.013ArgAsp: 3.013 ± 0.308
4.138ArgGlu: 4.138 ± 0.475
1.768ArgPhe: 1.768 ± 0.275
3.415ArgGly: 3.415 ± 0.419
0.924ArgHis: 0.924 ± 0.216
3.455ArgIle: 3.455 ± 0.365
3.937ArgLys: 3.937 ± 0.455
5.143ArgLeu: 5.143 ± 0.47
0.763ArgMet: 0.763 ± 0.19
2.893ArgAsn: 2.893 ± 0.355
2.21ArgPro: 2.21 ± 0.303
1.768ArgGln: 1.768 ± 0.344
3.375ArgArg: 3.375 ± 0.413
2.612ArgSer: 2.612 ± 0.288
2.25ArgThr: 2.25 ± 0.318
3.937ArgVal: 3.937 ± 0.409
0.362ArgTrp: 0.362 ± 0.124
1.888ArgTyr: 1.888 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
4.54SerAla: 4.54 ± 0.467
0.723SerCys: 0.723 ± 0.192
3.696SerAsp: 3.696 ± 0.389
3.857SerGlu: 3.857 ± 0.388
3.054SerPhe: 3.054 ± 0.402
4.621SerGly: 4.621 ± 0.461
1.326SerHis: 1.326 ± 0.243
4.138SerIle: 4.138 ± 0.418
5.183SerLys: 5.183 ± 0.474
6.549SerLeu: 6.549 ± 0.525
1.487SerMet: 1.487 ± 0.245
4.179SerAsn: 4.179 ± 0.408
2.451SerPro: 2.451 ± 0.394
2.21SerGln: 2.21 ± 0.273
3.254SerArg: 3.254 ± 0.327
4.781SerSer: 4.781 ± 0.445
4.138SerThr: 4.138 ± 0.393
4.58SerVal: 4.58 ± 0.402
0.281SerTrp: 0.281 ± 0.103
2.29SerTyr: 2.29 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
4.741ThrAla: 4.741 ± 0.538
0.603ThrCys: 0.603 ± 0.143
3.696ThrAsp: 3.696 ± 0.385
3.656ThrGlu: 3.656 ± 0.476
2.371ThrPhe: 2.371 ± 0.276
4.339ThrGly: 4.339 ± 0.422
0.844ThrHis: 0.844 ± 0.189
3.496ThrIle: 3.496 ± 0.367
4.138ThrLys: 4.138 ± 0.369
6.549ThrLeu: 6.549 ± 0.567
1.246ThrMet: 1.246 ± 0.197
3.295ThrAsn: 3.295 ± 0.371
2.451ThrPro: 2.451 ± 0.252
2.25ThrGln: 2.25 ± 0.265
3.455ThrArg: 3.455 ± 0.336
3.455ThrSer: 3.455 ± 0.445
4.179ThrThr: 4.179 ± 0.461
5.062ThrVal: 5.062 ± 0.43
0.281ThrTrp: 0.281 ± 0.11
2.17ThrTyr: 2.17 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
4.379ValAla: 4.379 ± 0.417
1.004ValCys: 1.004 ± 0.235
4.42ValAsp: 4.42 ± 0.494
5.384ValGlu: 5.384 ± 0.492
2.491ValPhe: 2.491 ± 0.304
3.335ValGly: 3.335 ± 0.323
1.246ValHis: 1.246 ± 0.283
4.5ValIle: 4.5 ± 0.352
4.942ValLys: 4.942 ± 0.487
5.464ValLeu: 5.464 ± 0.469
1.165ValMet: 1.165 ± 0.216
4.138ValAsn: 4.138 ± 0.357
2.371ValPro: 2.371 ± 0.259
1.969ValGln: 1.969 ± 0.277
3.857ValArg: 3.857 ± 0.403
5.906ValSer: 5.906 ± 0.445
4.821ValThr: 4.821 ± 0.481
5.143ValVal: 5.143 ± 0.601
0.482ValTrp: 0.482 ± 0.146
2.21ValTyr: 2.21 ± 0.284
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.145
0.121TrpCys: 0.121 ± 0.069
0.402TrpAsp: 0.402 ± 0.122
0.321TrpGlu: 0.321 ± 0.1
0.402TrpPhe: 0.402 ± 0.154
0.522TrpGly: 0.522 ± 0.148
0.161TrpHis: 0.161 ± 0.078
0.643TrpIle: 0.643 ± 0.158
0.201TrpLys: 0.201 ± 0.075
0.763TrpLeu: 0.763 ± 0.169
0.08TrpMet: 0.08 ± 0.054
0.402TrpAsn: 0.402 ± 0.12
0.0TrpPro: 0.0 ± 0.0
0.281TrpGln: 0.281 ± 0.098
0.562TrpArg: 0.562 ± 0.169
0.562TrpSer: 0.562 ± 0.157
0.241TrpThr: 0.241 ± 0.097
0.482TrpVal: 0.482 ± 0.15
0.241TrpTrp: 0.241 ± 0.093
0.362TrpTyr: 0.362 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.933TyrAla: 2.933 ± 0.342
0.603TyrCys: 0.603 ± 0.169
3.174TyrAsp: 3.174 ± 0.406
2.571TyrGlu: 2.571 ± 0.277
0.964TyrPhe: 0.964 ± 0.199
1.687TyrGly: 1.687 ± 0.245
0.763TyrHis: 0.763 ± 0.131
1.929TyrIle: 1.929 ± 0.253
2.17TyrLys: 2.17 ± 0.269
3.616TyrLeu: 3.616 ± 0.381
0.844TyrMet: 0.844 ± 0.185
1.728TyrAsn: 1.728 ± 0.268
1.326TyrPro: 1.326 ± 0.261
1.567TyrGln: 1.567 ± 0.284
2.049TyrArg: 2.049 ± 0.297
2.33TyrSer: 2.33 ± 0.28
1.728TyrThr: 1.728 ± 0.253
2.451TyrVal: 2.451 ± 0.315
0.241TyrTrp: 0.241 ± 0.093
1.728TyrTyr: 1.728 ± 0.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 114 proteins (24890 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski