Amino acid dipepetide frequency for Mycobacterium phage Beatrix

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.441AlaAla: 12.441 ± 1.087
0.801AlaCys: 0.801 ± 0.212
6.406AlaAsp: 6.406 ± 0.579
5.913AlaGlu: 5.913 ± 0.726
3.018AlaPhe: 3.018 ± 0.472
8.192AlaGly: 8.192 ± 0.968
1.478AlaHis: 1.478 ± 0.328
4.127AlaIle: 4.127 ± 0.559
4.373AlaLys: 4.373 ± 0.576
9.67AlaLeu: 9.67 ± 0.858
2.648AlaMet: 2.648 ± 0.435
2.34AlaAsn: 2.34 ± 0.33
4.558AlaPro: 4.558 ± 0.679
3.141AlaGln: 3.141 ± 0.489
6.221AlaArg: 6.221 ± 0.568
4.866AlaSer: 4.866 ± 0.476
5.974AlaThr: 5.974 ± 0.649
8.13AlaVal: 8.13 ± 0.721
1.909AlaTrp: 1.909 ± 0.376
3.388AlaTyr: 3.388 ± 0.355
0.0AlaXaa: 0.0 ± 0.0
Cys
0.862CysAla: 0.862 ± 0.188
0.0CysCys: 0.0 ± 0.0
0.493CysAsp: 0.493 ± 0.201
0.739CysGlu: 0.739 ± 0.184
0.123CysPhe: 0.123 ± 0.078
0.801CysGly: 0.801 ± 0.212
0.185CysHis: 0.185 ± 0.103
0.37CysIle: 0.37 ± 0.153
0.37CysLys: 0.37 ± 0.176
0.431CysLeu: 0.431 ± 0.209
0.062CysMet: 0.062 ± 0.063
0.493CysAsn: 0.493 ± 0.173
0.37CysPro: 0.37 ± 0.135
0.185CysGln: 0.185 ± 0.09
0.554CysArg: 0.554 ± 0.189
0.431CysSer: 0.431 ± 0.14
0.37CysThr: 0.37 ± 0.13
0.185CysVal: 0.185 ± 0.099
0.185CysTrp: 0.185 ± 0.108
0.062CysTyr: 0.062 ± 0.053
0.0CysXaa: 0.0 ± 0.0
Asp
5.851AspAla: 5.851 ± 0.549
0.554AspCys: 0.554 ± 0.183
4.989AspAsp: 4.989 ± 0.59
3.449AspGlu: 3.449 ± 0.466
2.217AspPhe: 2.217 ± 0.287
6.159AspGly: 6.159 ± 0.686
1.232AspHis: 1.232 ± 0.246
2.895AspIle: 2.895 ± 0.326
2.833AspLys: 2.833 ± 0.494
7.083AspLeu: 7.083 ± 0.777
0.924AspMet: 0.924 ± 0.176
1.971AspAsn: 1.971 ± 0.31
4.496AspPro: 4.496 ± 0.547
1.601AspGln: 1.601 ± 0.326
3.449AspArg: 3.449 ± 0.464
3.264AspSer: 3.264 ± 0.55
3.88AspThr: 3.88 ± 0.383
4.681AspVal: 4.681 ± 0.579
1.786AspTrp: 1.786 ± 0.312
1.909AspTyr: 1.909 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
6.159GluAla: 6.159 ± 0.745
0.185GluCys: 0.185 ± 0.124
4.804GluAsp: 4.804 ± 0.409
5.051GluGlu: 5.051 ± 0.569
2.34GluPhe: 2.34 ± 0.398
4.065GluGly: 4.065 ± 0.455
1.54GluHis: 1.54 ± 0.357
3.449GluIle: 3.449 ± 0.436
2.464GluLys: 2.464 ± 0.396
7.206GluLeu: 7.206 ± 0.61
1.478GluMet: 1.478 ± 0.276
1.848GluAsn: 1.848 ± 0.393
3.203GluPro: 3.203 ± 0.511
2.525GluGln: 2.525 ± 0.484
3.634GluArg: 3.634 ± 0.498
3.572GluSer: 3.572 ± 0.446
3.757GluThr: 3.757 ± 0.519
5.051GluVal: 5.051 ± 0.64
1.601GluTrp: 1.601 ± 0.307
2.402GluTyr: 2.402 ± 0.433
0.0GluXaa: 0.0 ± 0.0
Phe
2.34PheAla: 2.34 ± 0.298
0.37PheCys: 0.37 ± 0.146
2.833PheAsp: 2.833 ± 0.354
2.71PheGlu: 2.71 ± 0.339
0.554PhePhe: 0.554 ± 0.181
3.388PheGly: 3.388 ± 0.475
0.862PheHis: 0.862 ± 0.295
1.848PheIle: 1.848 ± 0.3
1.417PheLys: 1.417 ± 0.359
2.34PheLeu: 2.34 ± 0.421
0.678PheMet: 0.678 ± 0.235
1.478PheAsn: 1.478 ± 0.29
1.355PhePro: 1.355 ± 0.267
1.109PheGln: 1.109 ± 0.228
1.971PheArg: 1.971 ± 0.365
1.725PheSer: 1.725 ± 0.321
1.786PheThr: 1.786 ± 0.315
1.909PheVal: 1.909 ± 0.349
0.678PheTrp: 0.678 ± 0.224
0.924PheTyr: 0.924 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
6.898GlyAla: 6.898 ± 0.888
0.801GlyCys: 0.801 ± 0.225
5.235GlyAsp: 5.235 ± 0.577
5.297GlyGlu: 5.297 ± 0.55
3.203GlyPhe: 3.203 ± 0.47
9.239GlyGly: 9.239 ± 2.243
1.971GlyHis: 1.971 ± 0.321
4.496GlyIle: 4.496 ± 0.604
3.572GlyLys: 3.572 ± 0.477
7.453GlyLeu: 7.453 ± 0.9
1.786GlyMet: 1.786 ± 0.313
3.264GlyAsn: 3.264 ± 0.467
3.88GlyPro: 3.88 ± 0.54
2.587GlyGln: 2.587 ± 0.38
4.804GlyArg: 4.804 ± 0.54
6.159GlySer: 6.159 ± 0.921
5.666GlyThr: 5.666 ± 0.738
5.543GlyVal: 5.543 ± 0.588
2.34GlyTrp: 2.34 ± 0.403
2.956GlyTyr: 2.956 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
1.725HisAla: 1.725 ± 0.376
0.123HisCys: 0.123 ± 0.086
0.985HisAsp: 0.985 ± 0.209
1.293HisGlu: 1.293 ± 0.276
0.739HisPhe: 0.739 ± 0.189
2.217HisGly: 2.217 ± 0.438
0.678HisHis: 0.678 ± 0.209
1.232HisIle: 1.232 ± 0.288
1.047HisLys: 1.047 ± 0.297
1.54HisLeu: 1.54 ± 0.277
0.123HisMet: 0.123 ± 0.087
0.308HisAsn: 0.308 ± 0.131
1.293HisPro: 1.293 ± 0.271
1.17HisGln: 1.17 ± 0.294
1.601HisArg: 1.601 ± 0.386
0.493HisSer: 0.493 ± 0.144
0.801HisThr: 0.801 ± 0.23
1.786HisVal: 1.786 ± 0.333
0.37HisTrp: 0.37 ± 0.14
0.678HisTyr: 0.678 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
6.406IleAla: 6.406 ± 0.768
0.37IleCys: 0.37 ± 0.141
3.88IleAsp: 3.88 ± 0.417
3.572IleGlu: 3.572 ± 0.476
0.739IlePhe: 0.739 ± 0.213
4.188IleGly: 4.188 ± 0.498
0.801IleHis: 0.801 ± 0.204
1.786IleIle: 1.786 ± 0.352
1.848IleLys: 1.848 ± 0.319
2.956IleLeu: 2.956 ± 0.327
0.924IleMet: 0.924 ± 0.197
1.663IleAsn: 1.663 ± 0.327
3.264IlePro: 3.264 ± 0.376
1.293IleGln: 1.293 ± 0.284
3.449IleArg: 3.449 ± 0.492
3.203IleSer: 3.203 ± 0.526
3.203IleThr: 3.203 ± 0.398
2.833IleVal: 2.833 ± 0.497
0.678IleTrp: 0.678 ± 0.188
1.417IleTyr: 1.417 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
3.141LysAla: 3.141 ± 0.479
0.308LysCys: 0.308 ± 0.121
2.587LysAsp: 2.587 ± 0.373
2.402LysGlu: 2.402 ± 0.417
1.54LysPhe: 1.54 ± 0.289
2.648LysGly: 2.648 ± 0.414
0.985LysHis: 0.985 ± 0.259
2.895LysIle: 2.895 ± 0.516
1.909LysLys: 1.909 ± 0.419
3.08LysLeu: 3.08 ± 0.415
0.862LysMet: 0.862 ± 0.206
1.663LysAsn: 1.663 ± 0.292
2.772LysPro: 2.772 ± 0.459
1.232LysGln: 1.232 ± 0.336
3.264LysArg: 3.264 ± 0.484
2.587LysSer: 2.587 ± 0.422
2.648LysThr: 2.648 ± 0.411
3.326LysVal: 3.326 ± 0.534
0.801LysTrp: 0.801 ± 0.228
0.862LysTyr: 0.862 ± 0.242
0.0LysXaa: 0.0 ± 0.0
Leu
9.547LeuAla: 9.547 ± 0.813
0.431LeuCys: 0.431 ± 0.123
6.159LeuAsp: 6.159 ± 0.589
5.666LeuGlu: 5.666 ± 0.641
2.34LeuPhe: 2.34 ± 0.359
7.453LeuGly: 7.453 ± 0.775
1.417LeuHis: 1.417 ± 0.347
4.188LeuIle: 4.188 ± 0.519
3.695LeuLys: 3.695 ± 0.456
5.297LeuLeu: 5.297 ± 0.584
1.848LeuMet: 1.848 ± 0.315
2.895LeuAsn: 2.895 ± 0.455
5.79LeuPro: 5.79 ± 0.543
2.402LeuGln: 2.402 ± 0.514
5.605LeuArg: 5.605 ± 0.547
6.036LeuSer: 6.036 ± 0.631
6.159LeuThr: 6.159 ± 0.485
4.558LeuVal: 4.558 ± 0.56
1.047LeuTrp: 1.047 ± 0.246
2.34LeuTyr: 2.34 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
2.648MetAla: 2.648 ± 0.382
0.062MetCys: 0.062 ± 0.056
1.047MetAsp: 1.047 ± 0.233
1.478MetGlu: 1.478 ± 0.317
0.739MetPhe: 0.739 ± 0.202
1.293MetGly: 1.293 ± 0.268
0.678MetHis: 0.678 ± 0.232
0.431MetIle: 0.431 ± 0.166
1.109MetLys: 1.109 ± 0.235
1.54MetLeu: 1.54 ± 0.275
0.062MetMet: 0.062 ± 0.062
1.54MetAsn: 1.54 ± 0.263
0.924MetPro: 0.924 ± 0.23
0.678MetGln: 0.678 ± 0.183
1.478MetArg: 1.478 ± 0.306
1.601MetSer: 1.601 ± 0.32
2.033MetThr: 2.033 ± 0.298
0.924MetVal: 0.924 ± 0.248
0.185MetTrp: 0.185 ± 0.082
0.37MetTyr: 0.37 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
3.388AsnAla: 3.388 ± 0.531
0.123AsnCys: 0.123 ± 0.086
2.094AsnAsp: 2.094 ± 0.361
1.663AsnGlu: 1.663 ± 0.4
1.109AsnPhe: 1.109 ± 0.318
3.388AsnGly: 3.388 ± 0.523
0.493AsnHis: 0.493 ± 0.149
1.601AsnIle: 1.601 ± 0.354
1.047AsnLys: 1.047 ± 0.282
2.833AsnLeu: 2.833 ± 0.443
0.678AsnMet: 0.678 ± 0.164
0.616AsnAsn: 0.616 ± 0.159
2.895AsnPro: 2.895 ± 0.373
1.109AsnGln: 1.109 ± 0.256
1.478AsnArg: 1.478 ± 0.318
1.786AsnSer: 1.786 ± 0.384
2.033AsnThr: 2.033 ± 0.366
2.34AsnVal: 2.34 ± 0.419
0.862AsnTrp: 0.862 ± 0.214
1.355AsnTyr: 1.355 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
6.036ProAla: 6.036 ± 0.573
0.554ProCys: 0.554 ± 0.192
4.127ProAsp: 4.127 ± 0.559
4.003ProGlu: 4.003 ± 0.551
2.094ProPhe: 2.094 ± 0.386
4.804ProGly: 4.804 ± 0.597
0.985ProHis: 0.985 ± 0.236
2.217ProIle: 2.217 ± 0.39
1.971ProLys: 1.971 ± 0.327
4.619ProLeu: 4.619 ± 0.549
1.109ProMet: 1.109 ± 0.294
1.663ProAsn: 1.663 ± 0.299
2.833ProPro: 2.833 ± 0.376
1.848ProGln: 1.848 ± 0.322
2.217ProArg: 2.217 ± 0.422
3.572ProSer: 3.572 ± 0.46
3.88ProThr: 3.88 ± 0.545
4.311ProVal: 4.311 ± 0.567
0.862ProTrp: 0.862 ± 0.312
1.601ProTyr: 1.601 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
3.449GlnAla: 3.449 ± 0.838
0.062GlnCys: 0.062 ± 0.07
1.478GlnAsp: 1.478 ± 0.371
1.786GlnGlu: 1.786 ± 0.329
1.109GlnPhe: 1.109 ± 0.234
2.402GlnGly: 2.402 ± 0.331
0.554GlnHis: 0.554 ± 0.182
2.464GlnIle: 2.464 ± 0.423
1.17GlnLys: 1.17 ± 0.355
3.572GlnLeu: 3.572 ± 0.5
0.739GlnMet: 0.739 ± 0.245
0.678GlnAsn: 0.678 ± 0.165
2.156GlnPro: 2.156 ± 0.362
1.601GlnGln: 1.601 ± 0.336
1.909GlnArg: 1.909 ± 0.373
1.786GlnSer: 1.786 ± 0.309
1.663GlnThr: 1.663 ± 0.293
2.34GlnVal: 2.34 ± 0.35
0.801GlnTrp: 0.801 ± 0.17
0.554GlnTyr: 0.554 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
6.159ArgAla: 6.159 ± 0.546
0.801ArgCys: 0.801 ± 0.197
2.648ArgAsp: 2.648 ± 0.431
4.866ArgGlu: 4.866 ± 0.626
1.909ArgPhe: 1.909 ± 0.41
5.112ArgGly: 5.112 ± 0.727
1.293ArgHis: 1.293 ± 0.299
3.018ArgIle: 3.018 ± 0.419
3.08ArgLys: 3.08 ± 0.55
5.297ArgLeu: 5.297 ± 0.721
2.156ArgMet: 2.156 ± 0.383
2.34ArgAsn: 2.34 ± 0.476
2.217ArgPro: 2.217 ± 0.351
1.909ArgGln: 1.909 ± 0.356
5.543ArgArg: 5.543 ± 0.628
3.819ArgSer: 3.819 ± 0.453
2.648ArgThr: 2.648 ± 0.526
5.112ArgVal: 5.112 ± 0.582
1.293ArgTrp: 1.293 ± 0.256
1.54ArgTyr: 1.54 ± 0.28
0.0ArgXaa: 0.0 ± 0.0
Ser
6.036SerAla: 6.036 ± 0.646
0.431SerCys: 0.431 ± 0.154
3.449SerAsp: 3.449 ± 0.383
3.511SerGlu: 3.511 ± 0.529
2.033SerPhe: 2.033 ± 0.36
6.898SerGly: 6.898 ± 0.995
1.293SerHis: 1.293 ± 0.291
2.402SerIle: 2.402 ± 0.394
2.156SerLys: 2.156 ± 0.377
5.112SerLeu: 5.112 ± 0.777
1.293SerMet: 1.293 ± 0.241
2.217SerAsn: 2.217 ± 0.425
2.772SerPro: 2.772 ± 0.445
1.725SerGln: 1.725 ± 0.262
2.956SerArg: 2.956 ± 0.444
3.264SerSer: 3.264 ± 0.639
3.264SerThr: 3.264 ± 0.506
4.188SerVal: 4.188 ± 0.593
1.293SerTrp: 1.293 ± 0.285
1.417SerTyr: 1.417 ± 0.313
0.0SerXaa: 0.0 ± 0.0
Thr
5.79ThrAla: 5.79 ± 0.661
0.37ThrCys: 0.37 ± 0.18
4.311ThrAsp: 4.311 ± 0.575
4.743ThrGlu: 4.743 ± 0.542
2.094ThrPhe: 2.094 ± 0.371
6.036ThrGly: 6.036 ± 0.504
1.17ThrHis: 1.17 ± 0.279
2.587ThrIle: 2.587 ± 0.498
2.648ThrLys: 2.648 ± 0.352
5.358ThrLeu: 5.358 ± 0.57
1.109ThrMet: 1.109 ± 0.216
1.848ThrAsn: 1.848 ± 0.381
3.942ThrPro: 3.942 ± 0.47
1.971ThrGln: 1.971 ± 0.371
3.511ThrArg: 3.511 ± 0.531
3.08ThrSer: 3.08 ± 0.41
4.804ThrThr: 4.804 ± 0.726
5.974ThrVal: 5.974 ± 0.667
1.109ThrTrp: 1.109 ± 0.278
1.848ThrTyr: 1.848 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
6.837ValAla: 6.837 ± 0.687
0.493ValCys: 0.493 ± 0.184
5.112ValAsp: 5.112 ± 0.602
4.804ValGlu: 4.804 ± 0.582
2.525ValPhe: 2.525 ± 0.377
4.25ValGly: 4.25 ± 0.639
1.54ValHis: 1.54 ± 0.27
3.572ValIle: 3.572 ± 0.453
3.264ValLys: 3.264 ± 0.483
5.358ValLeu: 5.358 ± 0.62
1.293ValMet: 1.293 ± 0.325
2.34ValAsn: 2.34 ± 0.334
4.558ValPro: 4.558 ± 0.47
2.217ValGln: 2.217 ± 0.357
5.297ValArg: 5.297 ± 0.649
4.311ValSer: 4.311 ± 0.553
5.913ValThr: 5.913 ± 0.573
4.989ValVal: 4.989 ± 0.722
1.109ValTrp: 1.109 ± 0.265
2.464ValTyr: 2.464 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
1.293TrpAla: 1.293 ± 0.294
0.185TrpCys: 0.185 ± 0.119
1.417TrpAsp: 1.417 ± 0.258
1.109TrpGlu: 1.109 ± 0.259
1.17TrpPhe: 1.17 ± 0.271
1.725TrpGly: 1.725 ± 0.326
0.493TrpHis: 0.493 ± 0.176
1.293TrpIle: 1.293 ± 0.295
0.246TrpLys: 0.246 ± 0.103
1.971TrpLeu: 1.971 ± 0.342
0.431TrpMet: 0.431 ± 0.175
0.493TrpAsn: 0.493 ± 0.15
0.739TrpPro: 0.739 ± 0.234
0.678TrpGln: 0.678 ± 0.185
1.355TrpArg: 1.355 ± 0.324
0.801TrpSer: 0.801 ± 0.225
1.663TrpThr: 1.663 ± 0.4
1.909TrpVal: 1.909 ± 0.299
0.554TrpTrp: 0.554 ± 0.211
0.308TrpTyr: 0.308 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.587TyrAla: 2.587 ± 0.409
0.246TyrCys: 0.246 ± 0.172
1.17TyrAsp: 1.17 ± 0.283
2.279TyrGlu: 2.279 ± 0.341
0.678TyrPhe: 0.678 ± 0.169
2.895TyrGly: 2.895 ± 0.466
0.554TyrHis: 0.554 ± 0.194
1.601TyrIle: 1.601 ± 0.375
1.293TyrLys: 1.293 ± 0.227
2.156TyrLeu: 2.156 ± 0.4
0.554TyrMet: 0.554 ± 0.16
1.232TyrAsn: 1.232 ± 0.24
1.293TyrPro: 1.293 ± 0.284
1.293TyrGln: 1.293 ± 0.288
2.525TyrArg: 2.525 ± 0.369
1.355TyrSer: 1.355 ± 0.265
2.156TyrThr: 2.156 ± 0.386
2.156TyrVal: 2.156 ± 0.385
0.37TyrTrp: 0.37 ± 0.159
0.554TyrTyr: 0.554 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (16237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski