Amino acid dipepetide frequency for Bacteroides phage Barc2635

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.911AlaAla: 3.911 ± 0.739
0.908AlaCys: 0.908 ± 0.292
3.143AlaAsp: 3.143 ± 0.344
4.261AlaGlu: 4.261 ± 0.575
3.423AlaPhe: 3.423 ± 0.528
3.213AlaGly: 3.213 ± 0.652
0.559AlaHis: 0.559 ± 0.209
4.4AlaIle: 4.4 ± 0.644
5.588AlaLys: 5.588 ± 0.652
4.959AlaLeu: 4.959 ± 0.452
1.467AlaMet: 1.467 ± 0.352
4.261AlaAsn: 4.261 ± 0.684
0.768AlaPro: 0.768 ± 0.294
2.235AlaGln: 2.235 ± 0.578
2.375AlaArg: 2.375 ± 0.638
2.724AlaSer: 2.724 ± 0.382
3.842AlaThr: 3.842 ± 0.74
3.283AlaVal: 3.283 ± 0.493
0.768AlaTrp: 0.768 ± 0.239
1.886AlaTyr: 1.886 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.768CysAla: 0.768 ± 0.291
0.279CysCys: 0.279 ± 0.155
1.187CysAsp: 1.187 ± 0.287
1.257CysGlu: 1.257 ± 0.437
0.559CysPhe: 0.559 ± 0.214
1.327CysGly: 1.327 ± 0.406
0.349CysHis: 0.349 ± 0.198
0.978CysIle: 0.978 ± 0.34
1.257CysLys: 1.257 ± 0.312
1.327CysLeu: 1.327 ± 0.385
0.21CysMet: 0.21 ± 0.141
0.629CysAsn: 0.629 ± 0.203
0.768CysPro: 0.768 ± 0.3
0.279CysGln: 0.279 ± 0.148
0.768CysArg: 0.768 ± 0.213
0.698CysSer: 0.698 ± 0.294
0.419CysThr: 0.419 ± 0.162
1.187CysVal: 1.187 ± 0.451
0.07CysTrp: 0.07 ± 0.071
0.838CysTyr: 0.838 ± 0.269
0.0CysXaa: 0.0 ± 0.0
Asp
2.375AspAla: 2.375 ± 0.498
0.908AspCys: 0.908 ± 0.259
2.654AspAsp: 2.654 ± 0.517
4.331AspGlu: 4.331 ± 0.532
2.235AspPhe: 2.235 ± 0.356
3.492AspGly: 3.492 ± 0.561
0.419AspHis: 0.419 ± 0.164
3.772AspIle: 3.772 ± 0.537
5.239AspLys: 5.239 ± 0.758
6.286AspLeu: 6.286 ± 0.748
0.908AspMet: 0.908 ± 0.308
3.632AspAsn: 3.632 ± 0.541
1.816AspPro: 1.816 ± 0.328
1.327AspGln: 1.327 ± 0.292
2.584AspArg: 2.584 ± 0.592
5.169AspSer: 5.169 ± 0.621
3.213AspThr: 3.213 ± 0.415
3.772AspVal: 3.772 ± 0.625
0.838AspTrp: 0.838 ± 0.245
2.445AspTyr: 2.445 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
5.658GluAla: 5.658 ± 0.583
0.768GluCys: 0.768 ± 0.293
2.864GluAsp: 2.864 ± 0.394
6.635GluGlu: 6.635 ± 0.984
3.702GluPhe: 3.702 ± 0.486
4.191GluGly: 4.191 ± 0.548
1.257GluHis: 1.257 ± 0.272
5.937GluIle: 5.937 ± 0.687
7.404GluLys: 7.404 ± 1.023
8.032GluLeu: 8.032 ± 1.052
2.305GluMet: 2.305 ± 0.441
4.331GluAsn: 4.331 ± 0.633
1.746GluPro: 1.746 ± 0.358
3.073GluGln: 3.073 ± 0.579
5.588GluArg: 5.588 ± 1.129
4.261GluSer: 4.261 ± 0.6
4.121GluThr: 4.121 ± 0.633
4.68GluVal: 4.68 ± 0.576
0.629GluTrp: 0.629 ± 0.19
3.911GluTyr: 3.911 ± 0.459
0.0GluXaa: 0.0 ± 0.0
Phe
2.934PheAla: 2.934 ± 0.595
0.768PheCys: 0.768 ± 0.285
3.213PheAsp: 3.213 ± 0.49
3.842PheGlu: 3.842 ± 0.642
1.257PhePhe: 1.257 ± 0.316
2.864PheGly: 2.864 ± 0.432
0.908PheHis: 0.908 ± 0.247
3.003PheIle: 3.003 ± 0.411
3.213PheLys: 3.213 ± 0.434
2.864PheLeu: 2.864 ± 0.343
1.537PheMet: 1.537 ± 0.349
2.864PheAsn: 2.864 ± 0.503
1.397PhePro: 1.397 ± 0.341
1.746PheGln: 1.746 ± 0.383
1.956PheArg: 1.956 ± 0.297
3.073PheSer: 3.073 ± 0.427
2.165PheThr: 2.165 ± 0.481
2.584PheVal: 2.584 ± 0.49
0.559PheTrp: 0.559 ± 0.217
1.956PheTyr: 1.956 ± 0.413
0.0PheXaa: 0.0 ± 0.0
Gly
2.934GlyAla: 2.934 ± 0.648
1.187GlyCys: 1.187 ± 0.317
3.842GlyAsp: 3.842 ± 0.425
4.959GlyGlu: 4.959 ± 0.56
3.353GlyPhe: 3.353 ± 0.503
4.959GlyGly: 4.959 ± 0.844
0.419GlyHis: 0.419 ± 0.168
3.632GlyIle: 3.632 ± 0.423
6.216GlyLys: 6.216 ± 0.763
3.423GlyLeu: 3.423 ± 0.426
1.187GlyMet: 1.187 ± 0.297
3.842GlyAsn: 3.842 ± 0.632
0.0GlyPro: 0.0 ± 0.0
1.816GlyGln: 1.816 ± 0.447
2.235GlyArg: 2.235 ± 0.408
4.68GlySer: 4.68 ± 0.684
2.794GlyThr: 2.794 ± 0.595
3.911GlyVal: 3.911 ± 0.652
0.629GlyTrp: 0.629 ± 0.196
3.003GlyTyr: 3.003 ± 0.399
0.0GlyXaa: 0.0 ± 0.0
His
0.419HisAla: 0.419 ± 0.164
0.419HisCys: 0.419 ± 0.189
1.048HisAsp: 1.048 ± 0.275
1.327HisGlu: 1.327 ± 0.269
0.768HisPhe: 0.768 ± 0.207
0.768HisGly: 0.768 ± 0.218
0.0HisHis: 0.0 ± 0.0
1.397HisIle: 1.397 ± 0.379
0.698HisLys: 0.698 ± 0.252
0.908HisLeu: 0.908 ± 0.234
0.21HisMet: 0.21 ± 0.116
1.048HisAsn: 1.048 ± 0.297
0.629HisPro: 0.629 ± 0.173
0.279HisGln: 0.279 ± 0.143
0.279HisArg: 0.279 ± 0.143
0.349HisSer: 0.349 ± 0.154
0.629HisThr: 0.629 ± 0.217
0.908HisVal: 0.908 ± 0.317
0.0HisTrp: 0.0 ± 0.0
0.768HisTyr: 0.768 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
4.051IleAla: 4.051 ± 0.575
1.746IleCys: 1.746 ± 0.431
5.727IleAsp: 5.727 ± 0.659
6.775IleGlu: 6.775 ± 0.685
2.514IlePhe: 2.514 ± 0.4
3.353IleGly: 3.353 ± 0.568
0.698IleHis: 0.698 ± 0.276
3.423IleIle: 3.423 ± 0.494
5.448IleLys: 5.448 ± 0.952
4.331IleLeu: 4.331 ± 0.485
2.026IleMet: 2.026 ± 0.365
3.981IleAsn: 3.981 ± 0.562
2.305IlePro: 2.305 ± 0.398
2.724IleGln: 2.724 ± 0.453
3.143IleArg: 3.143 ± 0.469
6.566IleSer: 6.566 ± 0.841
4.261IleThr: 4.261 ± 0.668
3.492IleVal: 3.492 ± 0.675
0.698IleTrp: 0.698 ± 0.193
2.584IleTyr: 2.584 ± 0.494
0.0IleXaa: 0.0 ± 0.0
Lys
5.937LysAla: 5.937 ± 0.643
1.118LysCys: 1.118 ± 0.433
4.051LysAsp: 4.051 ± 0.566
8.451LysGlu: 8.451 ± 1.09
3.423LysPhe: 3.423 ± 0.54
4.61LysGly: 4.61 ± 0.595
1.327LysHis: 1.327 ± 0.326
5.937LysIle: 5.937 ± 0.637
7.474LysLys: 7.474 ± 1.119
7.474LysLeu: 7.474 ± 0.927
2.375LysMet: 2.375 ± 0.555
4.331LysAsn: 4.331 ± 0.426
2.445LysPro: 2.445 ± 0.402
3.772LysGln: 3.772 ± 0.667
4.889LysArg: 4.889 ± 0.81
5.169LysSer: 5.169 ± 0.66
5.239LysThr: 5.239 ± 0.506
5.867LysVal: 5.867 ± 0.721
0.908LysTrp: 0.908 ± 0.25
4.68LysTyr: 4.68 ± 0.581
0.0LysXaa: 0.0 ± 0.0
Leu
4.191LeuAla: 4.191 ± 0.551
1.187LeuCys: 1.187 ± 0.404
4.121LeuAsp: 4.121 ± 0.657
4.61LeuGlu: 4.61 ± 0.79
2.654LeuPhe: 2.654 ± 0.493
3.213LeuGly: 3.213 ± 0.449
0.838LeuHis: 0.838 ± 0.25
5.518LeuIle: 5.518 ± 0.643
7.613LeuLys: 7.613 ± 0.906
6.216LeuLeu: 6.216 ± 0.651
1.606LeuMet: 1.606 ± 0.379
6.077LeuAsn: 6.077 ± 0.547
2.864LeuPro: 2.864 ± 0.45
2.375LeuGln: 2.375 ± 0.324
4.68LeuArg: 4.68 ± 0.725
8.661LeuSer: 8.661 ± 0.815
5.518LeuThr: 5.518 ± 0.722
4.68LeuVal: 4.68 ± 0.451
0.559LeuTrp: 0.559 ± 0.184
2.934LeuTyr: 2.934 ± 0.366
0.0LeuXaa: 0.0 ± 0.0
Met
1.187MetAla: 1.187 ± 0.384
0.279MetCys: 0.279 ± 0.151
1.118MetAsp: 1.118 ± 0.281
1.746MetGlu: 1.746 ± 0.422
1.118MetPhe: 1.118 ± 0.276
1.257MetGly: 1.257 ± 0.311
0.559MetHis: 0.559 ± 0.201
1.467MetIle: 1.467 ± 0.433
3.423MetLys: 3.423 ± 0.776
1.886MetLeu: 1.886 ± 0.347
0.489MetMet: 0.489 ± 0.243
2.165MetAsn: 2.165 ± 0.432
0.838MetPro: 0.838 ± 0.212
1.187MetGln: 1.187 ± 0.366
0.559MetArg: 0.559 ± 0.183
1.048MetSer: 1.048 ± 0.233
1.397MetThr: 1.397 ± 0.389
0.768MetVal: 0.768 ± 0.211
0.279MetTrp: 0.279 ± 0.141
1.397MetTyr: 1.397 ± 0.32
0.0MetXaa: 0.0 ± 0.0
Asn
3.702AsnAla: 3.702 ± 0.759
0.559AsnCys: 0.559 ± 0.251
3.073AsnAsp: 3.073 ± 0.424
5.937AsnGlu: 5.937 ± 0.723
2.514AsnPhe: 2.514 ± 0.451
4.4AsnGly: 4.4 ± 0.597
0.978AsnHis: 0.978 ± 0.264
4.191AsnIle: 4.191 ± 0.472
4.61AsnLys: 4.61 ± 0.68
4.051AsnLeu: 4.051 ± 0.425
1.537AsnMet: 1.537 ± 0.492
2.724AsnAsn: 2.724 ± 0.371
1.956AsnPro: 1.956 ± 0.349
1.537AsnGln: 1.537 ± 0.334
2.305AsnArg: 2.305 ± 0.408
3.073AsnSer: 3.073 ± 0.601
3.353AsnThr: 3.353 ± 0.683
4.261AsnVal: 4.261 ± 0.705
0.489AsnTrp: 0.489 ± 0.171
2.235AsnTyr: 2.235 ± 0.409
0.0AsnXaa: 0.0 ± 0.0
Pro
1.606ProAla: 1.606 ± 0.348
0.14ProCys: 0.14 ± 0.114
2.026ProAsp: 2.026 ± 0.425
2.934ProGlu: 2.934 ± 0.464
1.257ProPhe: 1.257 ± 0.307
1.606ProGly: 1.606 ± 0.374
0.21ProHis: 0.21 ± 0.115
2.095ProIle: 2.095 ± 0.459
2.165ProLys: 2.165 ± 0.472
2.165ProLeu: 2.165 ± 0.456
0.419ProMet: 0.419 ± 0.171
1.327ProAsn: 1.327 ± 0.366
0.698ProPro: 0.698 ± 0.285
1.048ProGln: 1.048 ± 0.293
0.768ProArg: 0.768 ± 0.277
1.816ProSer: 1.816 ± 0.392
1.816ProThr: 1.816 ± 0.382
1.886ProVal: 1.886 ± 0.365
0.07ProTrp: 0.07 ± 0.071
1.606ProTyr: 1.606 ± 0.305
0.0ProXaa: 0.0 ± 0.0
Gln
2.095GlnAla: 2.095 ± 0.497
0.279GlnCys: 0.279 ± 0.132
1.467GlnAsp: 1.467 ± 0.394
1.886GlnGlu: 1.886 ± 0.353
2.165GlnPhe: 2.165 ± 0.322
1.327GlnGly: 1.327 ± 0.309
0.279GlnHis: 0.279 ± 0.14
2.514GlnIle: 2.514 ± 0.388
3.981GlnLys: 3.981 ± 0.762
3.702GlnLeu: 3.702 ± 0.362
1.257GlnMet: 1.257 ± 0.287
1.676GlnAsn: 1.676 ± 0.276
1.118GlnPro: 1.118 ± 0.326
1.956GlnGln: 1.956 ± 0.456
1.816GlnArg: 1.816 ± 0.352
1.816GlnSer: 1.816 ± 0.43
2.095GlnThr: 2.095 ± 0.548
1.886GlnVal: 1.886 ± 0.324
0.349GlnTrp: 0.349 ± 0.146
1.397GlnTyr: 1.397 ± 0.3
0.0GlnXaa: 0.0 ± 0.0
Arg
2.514ArgAla: 2.514 ± 0.468
0.629ArgCys: 0.629 ± 0.213
2.934ArgAsp: 2.934 ± 0.588
3.842ArgGlu: 3.842 ± 0.595
2.584ArgPhe: 2.584 ± 0.509
2.445ArgGly: 2.445 ± 0.418
0.279ArgHis: 0.279 ± 0.114
4.121ArgIle: 4.121 ± 0.536
4.819ArgLys: 4.819 ± 0.748
3.981ArgLeu: 3.981 ± 0.59
1.048ArgMet: 1.048 ± 0.269
2.165ArgAsn: 2.165 ± 0.407
0.908ArgPro: 0.908 ± 0.222
1.467ArgGln: 1.467 ± 0.27
1.886ArgArg: 1.886 ± 0.402
2.934ArgSer: 2.934 ± 0.407
2.095ArgThr: 2.095 ± 0.339
3.073ArgVal: 3.073 ± 0.372
0.419ArgTrp: 0.419 ± 0.193
2.235ArgTyr: 2.235 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
4.121SerAla: 4.121 ± 0.7
0.978SerCys: 0.978 ± 0.335
4.331SerAsp: 4.331 ± 0.574
5.658SerGlu: 5.658 ± 0.997
2.584SerPhe: 2.584 ± 0.407
4.75SerGly: 4.75 ± 1.062
0.978SerHis: 0.978 ± 0.251
5.378SerIle: 5.378 ± 0.585
5.099SerLys: 5.099 ± 0.471
5.518SerLeu: 5.518 ± 0.603
1.676SerMet: 1.676 ± 0.34
3.353SerAsn: 3.353 ± 0.542
2.305SerPro: 2.305 ± 0.534
1.676SerGln: 1.676 ± 0.405
2.305SerArg: 2.305 ± 0.387
2.934SerSer: 2.934 ± 0.44
3.213SerThr: 3.213 ± 0.59
5.239SerVal: 5.239 ± 0.722
0.489SerTrp: 0.489 ± 0.183
3.492SerTyr: 3.492 ± 0.474
0.0SerXaa: 0.0 ± 0.0
Thr
3.772ThrAla: 3.772 ± 1.007
0.698ThrCys: 0.698 ± 0.216
3.772ThrAsp: 3.772 ± 0.514
4.191ThrGlu: 4.191 ± 0.677
2.305ThrPhe: 2.305 ± 0.398
4.889ThrGly: 4.889 ± 0.746
1.118ThrHis: 1.118 ± 0.279
3.911ThrIle: 3.911 ± 0.54
4.75ThrLys: 4.75 ± 0.588
3.423ThrLeu: 3.423 ± 0.472
0.978ThrMet: 0.978 ± 0.188
2.305ThrAsn: 2.305 ± 0.418
2.095ThrPro: 2.095 ± 0.476
2.654ThrGln: 2.654 ± 0.396
2.445ThrArg: 2.445 ± 0.4
3.283ThrSer: 3.283 ± 0.547
2.934ThrThr: 2.934 ± 0.449
4.54ThrVal: 4.54 ± 0.686
0.629ThrTrp: 0.629 ± 0.222
2.165ThrTyr: 2.165 ± 0.48
0.0ThrXaa: 0.0 ± 0.0
Val
2.375ValAla: 2.375 ± 0.433
1.257ValCys: 1.257 ± 0.29
4.331ValAsp: 4.331 ± 0.887
4.75ValGlu: 4.75 ± 0.667
3.073ValPhe: 3.073 ± 0.633
3.353ValGly: 3.353 ± 0.595
0.768ValHis: 0.768 ± 0.221
4.051ValIle: 4.051 ± 0.65
6.775ValLys: 6.775 ± 0.62
4.889ValLeu: 4.889 ± 0.503
1.187ValMet: 1.187 ± 0.27
3.073ValAsn: 3.073 ± 0.466
1.956ValPro: 1.956 ± 0.375
1.746ValGln: 1.746 ± 0.351
3.003ValArg: 3.003 ± 0.437
5.029ValSer: 5.029 ± 0.679
4.121ValThr: 4.121 ± 0.49
3.562ValVal: 3.562 ± 0.603
0.489ValTrp: 0.489 ± 0.168
3.702ValTyr: 3.702 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
0.629TrpAla: 0.629 ± 0.199
0.14TrpCys: 0.14 ± 0.104
0.419TrpAsp: 0.419 ± 0.14
0.978TrpGlu: 0.978 ± 0.333
0.629TrpPhe: 0.629 ± 0.212
0.419TrpGly: 0.419 ± 0.176
0.14TrpHis: 0.14 ± 0.088
0.768TrpIle: 0.768 ± 0.237
0.908TrpLys: 0.908 ± 0.23
0.768TrpLeu: 0.768 ± 0.194
0.279TrpMet: 0.279 ± 0.147
0.559TrpAsn: 0.559 ± 0.188
0.0TrpPro: 0.0 ± 0.0
0.279TrpGln: 0.279 ± 0.119
0.489TrpArg: 0.489 ± 0.249
0.629TrpSer: 0.629 ± 0.224
0.349TrpThr: 0.349 ± 0.152
0.559TrpVal: 0.559 ± 0.221
0.0TrpTrp: 0.0 ± 0.0
0.419TrpTyr: 0.419 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.794TyrAla: 2.794 ± 0.417
0.908TyrCys: 0.908 ± 0.355
2.095TyrAsp: 2.095 ± 0.367
2.724TyrGlu: 2.724 ± 0.376
2.584TyrPhe: 2.584 ± 0.363
2.584TyrGly: 2.584 ± 0.351
0.768TyrHis: 0.768 ± 0.293
3.423TyrIle: 3.423 ± 0.54
2.794TyrLys: 2.794 ± 0.475
3.911TyrLeu: 3.911 ± 0.553
1.397TyrMet: 1.397 ± 0.445
3.143TyrAsn: 3.143 ± 0.394
1.048TyrPro: 1.048 ± 0.25
1.886TyrGln: 1.886 ± 0.282
2.305TyrArg: 2.305 ± 0.486
2.165TyrSer: 2.165 ± 0.308
3.283TyrThr: 3.283 ± 0.705
3.353TyrVal: 3.353 ± 0.62
0.489TyrTrp: 0.489 ± 0.155
2.165TyrTyr: 2.165 ± 0.37
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (14318 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski