Amino acid dipepetide frequency for Ochrobactrum phage POI1126

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.515AlaAla: 15.515 ± 1.318
0.705AlaCys: 0.705 ± 0.269
8.029AlaAsp: 8.029 ± 0.846
8.463AlaGlu: 8.463 ± 0.785
4.177AlaPhe: 4.177 ± 0.44
8.246AlaGly: 8.246 ± 1.155
1.627AlaHis: 1.627 ± 0.321
6.618AlaIle: 6.618 ± 0.857
6.727AlaLys: 6.727 ± 0.661
8.951AlaLeu: 8.951 ± 0.791
3.092AlaMet: 3.092 ± 0.58
4.231AlaAsn: 4.231 ± 0.555
5.316AlaPro: 5.316 ± 0.742
6.293AlaGln: 6.293 ± 0.723
8.842AlaArg: 8.842 ± 1.173
6.727AlaSer: 6.727 ± 0.702
4.937AlaThr: 4.937 ± 0.629
6.455AlaVal: 6.455 ± 0.619
1.899AlaTrp: 1.899 ± 0.3
2.17AlaTyr: 2.17 ± 0.337
0.0AlaXaa: 0.0 ± 0.0
Cys
0.814CysAla: 0.814 ± 0.271
0.0CysCys: 0.0 ± 0.0
0.434CysAsp: 0.434 ± 0.172
0.651CysGlu: 0.651 ± 0.255
0.217CysPhe: 0.217 ± 0.129
0.542CysGly: 0.542 ± 0.208
0.108CysHis: 0.108 ± 0.097
0.488CysIle: 0.488 ± 0.207
0.163CysLys: 0.163 ± 0.11
0.542CysLeu: 0.542 ± 0.228
0.054CysMet: 0.054 ± 0.06
0.38CysAsn: 0.38 ± 0.183
0.542CysPro: 0.542 ± 0.285
0.325CysGln: 0.325 ± 0.16
0.38CysArg: 0.38 ± 0.176
0.325CysSer: 0.325 ± 0.153
0.271CysThr: 0.271 ± 0.135
0.434CysVal: 0.434 ± 0.175
0.108CysTrp: 0.108 ± 0.101
0.271CysTyr: 0.271 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
8.788AspAla: 8.788 ± 0.8
0.163AspCys: 0.163 ± 0.121
5.045AspAsp: 5.045 ± 0.777
4.014AspGlu: 4.014 ± 0.587
2.387AspPhe: 2.387 ± 0.388
5.479AspGly: 5.479 ± 0.398
1.031AspHis: 1.031 ± 0.33
4.231AspIle: 4.231 ± 0.538
3.092AspLys: 3.092 ± 0.459
4.937AspLeu: 4.937 ± 0.501
1.573AspMet: 1.573 ± 0.302
2.767AspAsn: 2.767 ± 0.536
3.58AspPro: 3.58 ± 0.399
2.116AspGln: 2.116 ± 0.281
4.665AspArg: 4.665 ± 0.471
2.333AspSer: 2.333 ± 0.409
3.201AspThr: 3.201 ± 0.487
3.092AspVal: 3.092 ± 0.359
1.356AspTrp: 1.356 ± 0.229
2.224AspTyr: 2.224 ± 0.429
0.0AspXaa: 0.0 ± 0.0
Glu
8.3GluAla: 8.3 ± 0.756
0.597GluCys: 0.597 ± 0.252
3.418GluAsp: 3.418 ± 0.736
4.286GluGlu: 4.286 ± 0.619
1.356GluPhe: 1.356 ± 0.317
3.472GluGly: 3.472 ± 0.396
1.302GluHis: 1.302 ± 0.209
4.611GluIle: 4.611 ± 0.546
3.96GluLys: 3.96 ± 0.576
6.076GluLeu: 6.076 ± 0.784
1.899GluMet: 1.899 ± 0.305
2.278GluAsn: 2.278 ± 0.361
2.984GluPro: 2.984 ± 0.464
3.255GluGln: 3.255 ± 0.59
5.425GluArg: 5.425 ± 0.485
2.278GluSer: 2.278 ± 0.308
3.797GluThr: 3.797 ± 0.373
2.875GluVal: 2.875 ± 0.32
1.248GluTrp: 1.248 ± 0.287
2.061GluTyr: 2.061 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
4.014PheAla: 4.014 ± 0.493
0.271PheCys: 0.271 ± 0.139
2.658PheAsp: 2.658 ± 0.4
1.736PheGlu: 1.736 ± 0.391
1.193PhePhe: 1.193 ± 0.295
2.767PheGly: 2.767 ± 0.44
0.922PheHis: 0.922 ± 0.199
1.79PheIle: 1.79 ± 0.28
1.465PheLys: 1.465 ± 0.28
2.116PheLeu: 2.116 ± 0.379
0.922PheMet: 0.922 ± 0.187
1.193PheAsn: 1.193 ± 0.247
1.248PhePro: 1.248 ± 0.377
1.248PheGln: 1.248 ± 0.246
1.844PheArg: 1.844 ± 0.295
2.278PheSer: 2.278 ± 0.363
2.116PheThr: 2.116 ± 0.413
2.116PheVal: 2.116 ± 0.37
0.759PheTrp: 0.759 ± 0.239
0.922PheTyr: 0.922 ± 0.238
0.0PheXaa: 0.0 ± 0.0
Gly
7.595GlyAla: 7.595 ± 0.915
0.434GlyCys: 0.434 ± 0.2
4.937GlyAsp: 4.937 ± 0.559
4.882GlyGlu: 4.882 ± 0.434
2.658GlyPhe: 2.658 ± 0.263
5.75GlyGly: 5.75 ± 0.682
1.031GlyHis: 1.031 ± 0.228
3.201GlyIle: 3.201 ± 0.36
5.316GlyLys: 5.316 ± 0.675
5.099GlyLeu: 5.099 ± 0.695
2.495GlyMet: 2.495 ± 0.387
2.984GlyAsn: 2.984 ± 0.652
2.17GlyPro: 2.17 ± 0.344
2.604GlyGln: 2.604 ± 0.416
5.154GlyArg: 5.154 ± 0.503
3.96GlySer: 3.96 ± 0.667
4.557GlyThr: 4.557 ± 0.619
4.448GlyVal: 4.448 ± 0.443
1.356GlyTrp: 1.356 ± 0.411
2.116GlyTyr: 2.116 ± 0.389
0.0GlyXaa: 0.0 ± 0.0
His
2.387HisAla: 2.387 ± 0.444
0.217HisCys: 0.217 ± 0.12
0.814HisAsp: 0.814 ± 0.185
1.248HisGlu: 1.248 ± 0.315
0.868HisPhe: 0.868 ± 0.222
0.868HisGly: 0.868 ± 0.198
0.38HisHis: 0.38 ± 0.153
1.302HisIle: 1.302 ± 0.311
0.814HisLys: 0.814 ± 0.204
0.922HisLeu: 0.922 ± 0.271
0.434HisMet: 0.434 ± 0.167
0.597HisAsn: 0.597 ± 0.168
0.868HisPro: 0.868 ± 0.278
0.597HisGln: 0.597 ± 0.18
1.573HisArg: 1.573 ± 0.381
0.976HisSer: 0.976 ± 0.256
0.705HisThr: 0.705 ± 0.201
1.465HisVal: 1.465 ± 0.259
0.38HisTrp: 0.38 ± 0.185
0.542HisTyr: 0.542 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
6.184IleAla: 6.184 ± 0.851
0.434IleCys: 0.434 ± 0.201
4.177IleAsp: 4.177 ± 0.507
4.448IleGlu: 4.448 ± 0.53
1.465IlePhe: 1.465 ± 0.419
4.286IleGly: 4.286 ± 0.607
0.868IleHis: 0.868 ± 0.225
3.255IleIle: 3.255 ± 0.512
1.79IleLys: 1.79 ± 0.36
3.309IleLeu: 3.309 ± 0.407
0.651IleMet: 0.651 ± 0.234
2.17IleAsn: 2.17 ± 0.321
2.495IlePro: 2.495 ± 0.375
2.007IleGln: 2.007 ± 0.408
3.689IleArg: 3.689 ± 0.504
2.495IleSer: 2.495 ± 0.457
3.852IleThr: 3.852 ± 0.531
4.394IleVal: 4.394 ± 0.625
0.868IleTrp: 0.868 ± 0.197
1.519IleTyr: 1.519 ± 0.317
0.0IleXaa: 0.0 ± 0.0
Lys
7.378LysAla: 7.378 ± 0.917
0.271LysCys: 0.271 ± 0.136
2.821LysAsp: 2.821 ± 0.579
3.146LysGlu: 3.146 ± 0.479
1.031LysPhe: 1.031 ± 0.218
3.743LysGly: 3.743 ± 0.734
0.814LysHis: 0.814 ± 0.197
1.736LysIle: 1.736 ± 0.229
3.363LysLys: 3.363 ± 0.51
4.014LysLeu: 4.014 ± 0.5
0.976LysMet: 0.976 ± 0.267
2.278LysAsn: 2.278 ± 0.297
3.472LysPro: 3.472 ± 0.545
2.658LysGln: 2.658 ± 0.49
4.286LysArg: 4.286 ± 0.546
3.309LysSer: 3.309 ± 0.424
2.604LysThr: 2.604 ± 0.342
2.495LysVal: 2.495 ± 0.384
0.868LysTrp: 0.868 ± 0.223
1.302LysTyr: 1.302 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
9.819LeuAla: 9.819 ± 0.791
0.651LeuCys: 0.651 ± 0.242
4.882LeuAsp: 4.882 ± 0.575
4.937LeuGlu: 4.937 ± 0.523
1.79LeuPhe: 1.79 ± 0.374
6.076LeuGly: 6.076 ± 0.682
1.953LeuHis: 1.953 ± 0.428
3.906LeuIle: 3.906 ± 0.702
3.255LeuLys: 3.255 ± 0.558
5.371LeuLeu: 5.371 ± 0.837
1.519LeuMet: 1.519 ± 0.495
3.255LeuAsn: 3.255 ± 0.425
4.069LeuPro: 4.069 ± 0.485
2.55LeuGln: 2.55 ± 0.411
5.913LeuArg: 5.913 ± 0.614
4.882LeuSer: 4.882 ± 0.819
4.394LeuThr: 4.394 ± 0.471
3.201LeuVal: 3.201 ± 0.452
1.085LeuTrp: 1.085 ± 0.423
2.007LeuTyr: 2.007 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
1.844MetAla: 1.844 ± 0.273
0.163MetCys: 0.163 ± 0.098
1.139MetAsp: 1.139 ± 0.242
1.41MetGlu: 1.41 ± 0.32
0.705MetPhe: 0.705 ± 0.246
1.41MetGly: 1.41 ± 0.258
0.488MetHis: 0.488 ± 0.168
1.085MetIle: 1.085 ± 0.202
1.519MetLys: 1.519 ± 0.26
1.573MetLeu: 1.573 ± 0.321
0.651MetMet: 0.651 ± 0.189
1.031MetAsn: 1.031 ± 0.258
1.736MetPro: 1.736 ± 0.355
1.465MetGln: 1.465 ± 0.294
1.844MetArg: 1.844 ± 0.296
2.333MetSer: 2.333 ± 0.341
1.682MetThr: 1.682 ± 0.411
1.519MetVal: 1.519 ± 0.299
0.325MetTrp: 0.325 ± 0.132
0.271MetTyr: 0.271 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
4.774AsnAla: 4.774 ± 0.54
0.325AsnCys: 0.325 ± 0.145
1.844AsnAsp: 1.844 ± 0.292
2.278AsnGlu: 2.278 ± 0.419
1.736AsnPhe: 1.736 ± 0.318
3.743AsnGly: 3.743 ± 0.459
0.814AsnHis: 0.814 ± 0.218
1.953AsnIle: 1.953 ± 0.391
1.356AsnLys: 1.356 ± 0.249
2.929AsnLeu: 2.929 ± 0.423
0.814AsnMet: 0.814 ± 0.186
1.356AsnAsn: 1.356 ± 0.228
2.767AsnPro: 2.767 ± 0.466
0.976AsnGln: 0.976 ± 0.25
2.929AsnArg: 2.929 ± 0.409
2.007AsnSer: 2.007 ± 0.444
1.736AsnThr: 1.736 ± 0.224
2.333AsnVal: 2.333 ± 0.326
0.542AsnTrp: 0.542 ± 0.244
1.085AsnTyr: 1.085 ± 0.247
0.0AsnXaa: 0.0 ± 0.0
Pro
6.238ProAla: 6.238 ± 0.729
0.542ProCys: 0.542 ± 0.214
4.014ProAsp: 4.014 ± 0.533
4.014ProGlu: 4.014 ± 0.5
1.193ProPhe: 1.193 ± 0.227
3.689ProGly: 3.689 ± 0.427
1.031ProHis: 1.031 ± 0.296
2.658ProIle: 2.658 ± 0.311
2.116ProLys: 2.116 ± 0.406
3.146ProLeu: 3.146 ± 0.553
0.814ProMet: 0.814 ± 0.219
1.953ProAsn: 1.953 ± 0.32
2.278ProPro: 2.278 ± 0.55
1.953ProGln: 1.953 ± 0.485
2.712ProArg: 2.712 ± 0.382
3.363ProSer: 3.363 ± 0.44
2.984ProThr: 2.984 ± 0.415
3.743ProVal: 3.743 ± 0.419
0.434ProTrp: 0.434 ± 0.139
1.031ProTyr: 1.031 ± 0.23
0.0ProXaa: 0.0 ± 0.0
Gln
5.099GlnAla: 5.099 ± 0.759
0.271GlnCys: 0.271 ± 0.174
2.061GlnAsp: 2.061 ± 0.341
1.953GlnGlu: 1.953 ± 0.243
1.682GlnPhe: 1.682 ± 0.356
2.495GlnGly: 2.495 ± 0.288
1.031GlnHis: 1.031 ± 0.221
2.604GlnIle: 2.604 ± 0.342
2.767GlnLys: 2.767 ± 0.396
3.743GlnLeu: 3.743 ± 0.521
1.031GlnMet: 1.031 ± 0.255
0.976GlnAsn: 0.976 ± 0.241
2.007GlnPro: 2.007 ± 0.331
2.767GlnGln: 2.767 ± 0.474
3.526GlnArg: 3.526 ± 0.404
2.875GlnSer: 2.875 ± 0.522
2.116GlnThr: 2.116 ± 0.331
1.465GlnVal: 1.465 ± 0.274
0.705GlnTrp: 0.705 ± 0.164
0.705GlnTyr: 0.705 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
8.517ArgAla: 8.517 ± 0.771
0.434ArgCys: 0.434 ± 0.214
4.557ArgAsp: 4.557 ± 0.503
4.286ArgGlu: 4.286 ± 0.538
2.929ArgPhe: 2.929 ± 0.483
3.743ArgGly: 3.743 ± 0.436
1.302ArgHis: 1.302 ± 0.307
3.743ArgIle: 3.743 ± 0.479
4.448ArgLys: 4.448 ± 0.55
7.161ArgLeu: 7.161 ± 0.922
1.573ArgMet: 1.573 ± 0.257
2.821ArgAsn: 2.821 ± 0.365
3.363ArgPro: 3.363 ± 0.464
3.472ArgGln: 3.472 ± 0.378
4.882ArgArg: 4.882 ± 0.732
3.96ArgSer: 3.96 ± 0.5
3.689ArgThr: 3.689 ± 0.46
4.177ArgVal: 4.177 ± 0.588
1.085ArgTrp: 1.085 ± 0.3
2.17ArgTyr: 2.17 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
6.293SerAla: 6.293 ± 0.681
0.488SerCys: 0.488 ± 0.206
4.394SerAsp: 4.394 ± 0.572
3.906SerGlu: 3.906 ± 0.453
2.495SerPhe: 2.495 ± 0.366
4.665SerGly: 4.665 ± 0.618
0.922SerHis: 0.922 ± 0.264
3.418SerIle: 3.418 ± 0.379
2.495SerLys: 2.495 ± 0.342
4.177SerLeu: 4.177 ± 0.503
1.248SerMet: 1.248 ± 0.241
1.899SerAsn: 1.899 ± 0.313
3.038SerPro: 3.038 ± 0.422
2.007SerGln: 2.007 ± 0.358
2.767SerArg: 2.767 ± 0.398
2.495SerSer: 2.495 ± 0.446
2.604SerThr: 2.604 ± 0.363
3.418SerVal: 3.418 ± 0.449
0.651SerTrp: 0.651 ± 0.222
1.573SerTyr: 1.573 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
5.859ThrAla: 5.859 ± 0.748
0.434ThrCys: 0.434 ± 0.179
3.526ThrAsp: 3.526 ± 0.468
3.255ThrGlu: 3.255 ± 0.384
1.627ThrPhe: 1.627 ± 0.388
5.099ThrGly: 5.099 ± 0.549
0.651ThrHis: 0.651 ± 0.179
3.255ThrIle: 3.255 ± 0.404
2.929ThrLys: 2.929 ± 0.379
4.069ThrLeu: 4.069 ± 0.662
1.356ThrMet: 1.356 ± 0.273
2.116ThrAsn: 2.116 ± 0.269
3.363ThrPro: 3.363 ± 0.42
1.573ThrGln: 1.573 ± 0.268
3.418ThrArg: 3.418 ± 0.445
2.441ThrSer: 2.441 ± 0.378
2.441ThrThr: 2.441 ± 0.341
4.123ThrVal: 4.123 ± 0.563
0.651ThrTrp: 0.651 ± 0.175
1.573ThrTyr: 1.573 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
5.371ValAla: 5.371 ± 0.644
0.325ValCys: 0.325 ± 0.175
4.231ValAsp: 4.231 ± 0.416
3.852ValGlu: 3.852 ± 0.442
2.116ValPhe: 2.116 ± 0.383
3.635ValGly: 3.635 ± 0.484
0.922ValHis: 0.922 ± 0.217
2.712ValIle: 2.712 ± 0.375
3.146ValLys: 3.146 ± 0.504
4.069ValLeu: 4.069 ± 0.421
1.899ValMet: 1.899 ± 0.313
2.55ValAsn: 2.55 ± 0.424
2.767ValPro: 2.767 ± 0.474
2.224ValGln: 2.224 ± 0.394
4.557ValArg: 4.557 ± 0.483
4.014ValSer: 4.014 ± 0.523
3.906ValThr: 3.906 ± 0.41
3.146ValVal: 3.146 ± 0.578
1.139ValTrp: 1.139 ± 0.251
1.302ValTyr: 1.302 ± 0.261
0.0ValXaa: 0.0 ± 0.0
Trp
1.302TrpAla: 1.302 ± 0.257
0.163TrpCys: 0.163 ± 0.095
0.868TrpAsp: 0.868 ± 0.231
0.759TrpGlu: 0.759 ± 0.211
0.814TrpPhe: 0.814 ± 0.227
0.976TrpGly: 0.976 ± 0.173
0.434TrpHis: 0.434 ± 0.14
0.759TrpIle: 0.759 ± 0.204
0.922TrpLys: 0.922 ± 0.163
1.627TrpLeu: 1.627 ± 0.494
0.488TrpMet: 0.488 ± 0.204
0.705TrpAsn: 0.705 ± 0.212
0.759TrpPro: 0.759 ± 0.251
0.434TrpGln: 0.434 ± 0.157
1.41TrpArg: 1.41 ± 0.275
1.139TrpSer: 1.139 ± 0.261
1.031TrpThr: 1.031 ± 0.279
1.085TrpVal: 1.085 ± 0.253
0.325TrpTrp: 0.325 ± 0.159
0.434TrpTyr: 0.434 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.929TyrAla: 2.929 ± 0.57
0.163TyrCys: 0.163 ± 0.099
2.224TyrAsp: 2.224 ± 0.351
1.953TyrGlu: 1.953 ± 0.374
1.139TyrPhe: 1.139 ± 0.287
2.061TyrGly: 2.061 ± 0.326
0.325TyrHis: 0.325 ± 0.167
1.031TyrIle: 1.031 ± 0.245
0.922TyrLys: 0.922 ± 0.306
1.682TyrLeu: 1.682 ± 0.343
0.651TyrMet: 0.651 ± 0.171
0.759TyrAsn: 0.759 ± 0.204
1.085TyrPro: 1.085 ± 0.295
1.356TyrGln: 1.356 ± 0.259
2.495TyrArg: 2.495 ± 0.397
0.759TyrSer: 0.759 ± 0.216
1.193TyrThr: 1.193 ± 0.268
1.899TyrVal: 1.899 ± 0.387
0.651TyrTrp: 0.651 ± 0.191
0.759TyrTyr: 0.759 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (18435 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski