Amino acid dipepetide frequency for Mycobacterium phage Traft412

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.468AlaAla: 12.468 ± 1.341
0.633AlaCys: 0.633 ± 0.167
6.899AlaAsp: 6.899 ± 0.683
6.013AlaGlu: 6.013 ± 0.691
2.975AlaPhe: 2.975 ± 0.482
8.038AlaGly: 8.038 ± 0.9
1.582AlaHis: 1.582 ± 0.36
4.367AlaIle: 4.367 ± 0.577
4.367AlaLys: 4.367 ± 0.607
9.177AlaLeu: 9.177 ± 0.801
2.532AlaMet: 2.532 ± 0.434
2.722AlaAsn: 2.722 ± 0.411
4.747AlaPro: 4.747 ± 0.755
2.911AlaGln: 2.911 ± 0.42
6.139AlaArg: 6.139 ± 0.576
5.19AlaSer: 5.19 ± 0.603
5.696AlaThr: 5.696 ± 0.708
8.671AlaVal: 8.671 ± 0.882
1.962AlaTrp: 1.962 ± 0.351
2.722AlaTyr: 2.722 ± 0.462
0.0AlaXaa: 0.0 ± 0.0
Cys
0.823CysAla: 0.823 ± 0.225
0.063CysCys: 0.063 ± 0.064
0.57CysAsp: 0.57 ± 0.189
0.633CysGlu: 0.633 ± 0.207
0.127CysPhe: 0.127 ± 0.092
0.506CysGly: 0.506 ± 0.212
0.127CysHis: 0.127 ± 0.093
0.443CysIle: 0.443 ± 0.195
0.127CysLys: 0.127 ± 0.084
0.38CysLeu: 0.38 ± 0.196
0.063CysMet: 0.063 ± 0.057
0.253CysAsn: 0.253 ± 0.111
0.19CysPro: 0.19 ± 0.101
0.19CysGln: 0.19 ± 0.114
0.633CysArg: 0.633 ± 0.181
0.316CysSer: 0.316 ± 0.15
0.253CysThr: 0.253 ± 0.118
0.19CysVal: 0.19 ± 0.111
0.316CysTrp: 0.316 ± 0.122
0.253CysTyr: 0.253 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
6.519AspAla: 6.519 ± 0.614
0.443AspCys: 0.443 ± 0.143
4.684AspAsp: 4.684 ± 0.549
3.481AspGlu: 3.481 ± 0.547
2.595AspPhe: 2.595 ± 0.364
6.392AspGly: 6.392 ± 0.574
1.139AspHis: 1.139 ± 0.245
2.658AspIle: 2.658 ± 0.454
2.722AspLys: 2.722 ± 0.463
6.772AspLeu: 6.772 ± 0.653
1.456AspMet: 1.456 ± 0.278
1.899AspAsn: 1.899 ± 0.362
5.127AspPro: 5.127 ± 0.687
1.646AspGln: 1.646 ± 0.327
3.544AspArg: 3.544 ± 0.424
3.228AspSer: 3.228 ± 0.474
4.114AspThr: 4.114 ± 0.383
4.747AspVal: 4.747 ± 0.591
1.709AspTrp: 1.709 ± 0.339
1.899AspTyr: 1.899 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
6.139GluAla: 6.139 ± 0.712
0.443GluCys: 0.443 ± 0.227
4.937GluAsp: 4.937 ± 0.585
5.063GluGlu: 5.063 ± 0.621
1.962GluPhe: 1.962 ± 0.376
3.987GluGly: 3.987 ± 0.451
1.266GluHis: 1.266 ± 0.3
3.291GluIle: 3.291 ± 0.499
2.722GluLys: 2.722 ± 0.455
6.772GluLeu: 6.772 ± 0.607
1.582GluMet: 1.582 ± 0.302
1.582GluAsn: 1.582 ± 0.382
2.405GluPro: 2.405 ± 0.385
2.722GluGln: 2.722 ± 0.451
3.924GluArg: 3.924 ± 0.551
3.354GluSer: 3.354 ± 0.448
3.861GluThr: 3.861 ± 0.499
5.38GluVal: 5.38 ± 0.64
1.582GluTrp: 1.582 ± 0.366
2.468GluTyr: 2.468 ± 0.415
0.0GluXaa: 0.0 ± 0.0
Phe
2.785PheAla: 2.785 ± 0.502
0.316PheCys: 0.316 ± 0.17
2.785PheAsp: 2.785 ± 0.364
2.278PheGlu: 2.278 ± 0.326
0.633PhePhe: 0.633 ± 0.195
3.544PheGly: 3.544 ± 0.495
0.633PheHis: 0.633 ± 0.26
1.203PheIle: 1.203 ± 0.266
1.392PheLys: 1.392 ± 0.283
2.405PheLeu: 2.405 ± 0.466
0.506PheMet: 0.506 ± 0.159
1.456PheAsn: 1.456 ± 0.338
1.709PhePro: 1.709 ± 0.368
1.013PheGln: 1.013 ± 0.263
1.962PheArg: 1.962 ± 0.368
1.709PheSer: 1.709 ± 0.298
2.278PheThr: 2.278 ± 0.415
1.772PheVal: 1.772 ± 0.394
0.506PheTrp: 0.506 ± 0.141
0.823PheTyr: 0.823 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
7.785GlyAla: 7.785 ± 1.233
0.696GlyCys: 0.696 ± 0.217
6.203GlyAsp: 6.203 ± 0.538
4.62GlyGlu: 4.62 ± 0.541
3.038GlyPhe: 3.038 ± 0.45
10.38GlyGly: 10.38 ± 2.903
2.025GlyHis: 2.025 ± 0.378
4.43GlyIle: 4.43 ± 0.782
3.924GlyLys: 3.924 ± 0.534
7.911GlyLeu: 7.911 ± 0.933
1.962GlyMet: 1.962 ± 0.421
3.418GlyAsn: 3.418 ± 0.402
3.481GlyPro: 3.481 ± 0.575
2.342GlyGln: 2.342 ± 0.355
4.367GlyArg: 4.367 ± 0.585
6.203GlySer: 6.203 ± 0.893
5.19GlyThr: 5.19 ± 0.686
5.127GlyVal: 5.127 ± 0.58
2.658GlyTrp: 2.658 ± 0.385
2.722GlyTyr: 2.722 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.709HisAla: 1.709 ± 0.392
0.19HisCys: 0.19 ± 0.171
1.266HisAsp: 1.266 ± 0.232
1.392HisGlu: 1.392 ± 0.32
0.759HisPhe: 0.759 ± 0.206
1.392HisGly: 1.392 ± 0.335
0.633HisHis: 0.633 ± 0.188
0.886HisIle: 0.886 ± 0.208
1.203HisLys: 1.203 ± 0.342
1.203HisLeu: 1.203 ± 0.287
0.127HisMet: 0.127 ± 0.087
0.316HisAsn: 0.316 ± 0.141
1.519HisPro: 1.519 ± 0.308
0.886HisGln: 0.886 ± 0.227
1.456HisArg: 1.456 ± 0.284
0.506HisSer: 0.506 ± 0.179
1.392HisThr: 1.392 ± 0.315
1.899HisVal: 1.899 ± 0.346
0.443HisTrp: 0.443 ± 0.153
0.696HisTyr: 0.696 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
5.949IleAla: 5.949 ± 0.697
0.316IleCys: 0.316 ± 0.132
3.165IleAsp: 3.165 ± 0.415
3.608IleGlu: 3.608 ± 0.504
0.949IlePhe: 0.949 ± 0.234
4.051IleGly: 4.051 ± 0.603
0.949IleHis: 0.949 ± 0.241
1.772IleIle: 1.772 ± 0.282
1.772IleLys: 1.772 ± 0.329
3.354IleLeu: 3.354 ± 0.44
0.949IleMet: 0.949 ± 0.226
1.646IleAsn: 1.646 ± 0.314
3.228IlePro: 3.228 ± 0.346
1.139IleGln: 1.139 ± 0.279
3.165IleArg: 3.165 ± 0.459
3.481IleSer: 3.481 ± 0.515
3.987IleThr: 3.987 ± 0.524
2.911IleVal: 2.911 ± 0.56
0.696IleTrp: 0.696 ± 0.196
1.392IleTyr: 1.392 ± 0.29
0.0IleXaa: 0.0 ± 0.0
Lys
3.608LysAla: 3.608 ± 0.638
0.253LysCys: 0.253 ± 0.126
2.722LysAsp: 2.722 ± 0.45
2.342LysGlu: 2.342 ± 0.409
1.519LysPhe: 1.519 ± 0.31
2.911LysGly: 2.911 ± 0.355
1.519LysHis: 1.519 ± 0.366
2.532LysIle: 2.532 ± 0.456
2.215LysLys: 2.215 ± 0.499
3.544LysLeu: 3.544 ± 0.442
1.076LysMet: 1.076 ± 0.256
1.456LysAsn: 1.456 ± 0.262
2.468LysPro: 2.468 ± 0.467
1.709LysGln: 1.709 ± 0.418
3.038LysArg: 3.038 ± 0.541
2.532LysSer: 2.532 ± 0.443
2.595LysThr: 2.595 ± 0.383
3.291LysVal: 3.291 ± 0.439
0.633LysTrp: 0.633 ± 0.223
0.759LysTyr: 0.759 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
8.987LeuAla: 8.987 ± 0.788
0.443LeuCys: 0.443 ± 0.195
6.329LeuAsp: 6.329 ± 0.624
5.886LeuGlu: 5.886 ± 0.567
2.278LeuPhe: 2.278 ± 0.362
7.595LeuGly: 7.595 ± 0.611
1.456LeuHis: 1.456 ± 0.32
4.684LeuIle: 4.684 ± 0.449
4.114LeuLys: 4.114 ± 0.59
5.886LeuLeu: 5.886 ± 0.537
1.646LeuMet: 1.646 ± 0.307
2.848LeuAsn: 2.848 ± 0.408
5.57LeuPro: 5.57 ± 0.59
2.532LeuGln: 2.532 ± 0.466
5.886LeuArg: 5.886 ± 0.575
5.759LeuSer: 5.759 ± 0.52
5.759LeuThr: 5.759 ± 0.495
4.557LeuVal: 4.557 ± 0.657
1.329LeuTrp: 1.329 ± 0.388
2.089LeuTyr: 2.089 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
2.785MetAla: 2.785 ± 0.421
0.0MetCys: 0.0 ± 0.0
1.076MetAsp: 1.076 ± 0.276
1.456MetGlu: 1.456 ± 0.329
0.57MetPhe: 0.57 ± 0.168
1.392MetGly: 1.392 ± 0.315
0.253MetHis: 0.253 ± 0.12
0.57MetIle: 0.57 ± 0.181
1.076MetLys: 1.076 ± 0.238
1.139MetLeu: 1.139 ± 0.232
0.063MetMet: 0.063 ± 0.061
1.013MetAsn: 1.013 ± 0.207
1.013MetPro: 1.013 ± 0.233
0.57MetGln: 0.57 ± 0.175
1.203MetArg: 1.203 ± 0.288
1.835MetSer: 1.835 ± 0.374
2.025MetThr: 2.025 ± 0.307
1.139MetVal: 1.139 ± 0.257
0.253MetTrp: 0.253 ± 0.118
0.57MetTyr: 0.57 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
3.038AsnAla: 3.038 ± 0.485
0.127AsnCys: 0.127 ± 0.092
2.089AsnAsp: 2.089 ± 0.445
1.899AsnGlu: 1.899 ± 0.383
0.886AsnPhe: 0.886 ± 0.288
3.418AsnGly: 3.418 ± 0.523
0.633AsnHis: 0.633 ± 0.182
1.646AsnIle: 1.646 ± 0.371
0.823AsnLys: 0.823 ± 0.336
2.342AsnLeu: 2.342 ± 0.333
0.506AsnMet: 0.506 ± 0.176
0.886AsnAsn: 0.886 ± 0.201
2.785AsnPro: 2.785 ± 0.401
0.886AsnGln: 0.886 ± 0.261
1.582AsnArg: 1.582 ± 0.354
1.962AsnSer: 1.962 ± 0.395
2.025AsnThr: 2.025 ± 0.286
2.595AsnVal: 2.595 ± 0.459
0.886AsnTrp: 0.886 ± 0.225
1.266AsnTyr: 1.266 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
4.937ProAla: 4.937 ± 0.503
0.316ProCys: 0.316 ± 0.143
4.304ProAsp: 4.304 ± 0.506
4.114ProGlu: 4.114 ± 0.492
2.342ProPhe: 2.342 ± 0.346
5.316ProGly: 5.316 ± 0.598
0.823ProHis: 0.823 ± 0.224
2.215ProIle: 2.215 ± 0.445
2.405ProLys: 2.405 ± 0.342
4.304ProLeu: 4.304 ± 0.589
0.886ProMet: 0.886 ± 0.222
1.519ProAsn: 1.519 ± 0.357
2.595ProPro: 2.595 ± 0.443
1.456ProGln: 1.456 ± 0.38
2.848ProArg: 2.848 ± 0.547
3.861ProSer: 3.861 ± 0.435
3.797ProThr: 3.797 ± 0.577
4.241ProVal: 4.241 ± 0.511
0.949ProTrp: 0.949 ± 0.317
1.456ProTyr: 1.456 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
3.101GlnAla: 3.101 ± 0.454
0.063GlnCys: 0.063 ± 0.051
1.392GlnAsp: 1.392 ± 0.346
1.519GlnGlu: 1.519 ± 0.305
1.266GlnPhe: 1.266 ± 0.264
2.468GlnGly: 2.468 ± 0.348
0.506GlnHis: 0.506 ± 0.15
2.785GlnIle: 2.785 ± 0.535
1.329GlnLys: 1.329 ± 0.336
3.734GlnLeu: 3.734 ± 0.494
0.949GlnMet: 0.949 ± 0.221
0.443GlnAsn: 0.443 ± 0.154
1.899GlnPro: 1.899 ± 0.354
1.899GlnGln: 1.899 ± 0.393
1.646GlnArg: 1.646 ± 0.316
1.646GlnSer: 1.646 ± 0.31
1.772GlnThr: 1.772 ± 0.342
2.405GlnVal: 2.405 ± 0.285
0.57GlnTrp: 0.57 ± 0.177
0.443GlnTyr: 0.443 ± 0.169
0.0GlnXaa: 0.0 ± 0.0
Arg
5.57ArgAla: 5.57 ± 0.761
0.696ArgCys: 0.696 ± 0.23
2.911ArgAsp: 2.911 ± 0.383
4.494ArgGlu: 4.494 ± 0.569
1.962ArgPhe: 1.962 ± 0.359
5.063ArgGly: 5.063 ± 0.584
1.076ArgHis: 1.076 ± 0.264
3.165ArgIle: 3.165 ± 0.518
3.228ArgLys: 3.228 ± 0.52
5.633ArgLeu: 5.633 ± 0.686
1.899ArgMet: 1.899 ± 0.352
2.215ArgAsn: 2.215 ± 0.477
2.405ArgPro: 2.405 ± 0.387
1.709ArgGln: 1.709 ± 0.341
5.19ArgArg: 5.19 ± 0.677
3.228ArgSer: 3.228 ± 0.527
3.165ArgThr: 3.165 ± 0.486
5.127ArgVal: 5.127 ± 0.57
1.139ArgTrp: 1.139 ± 0.258
1.772ArgTyr: 1.772 ± 0.306
0.0ArgXaa: 0.0 ± 0.0
Ser
6.962SerAla: 6.962 ± 0.785
0.443SerCys: 0.443 ± 0.172
3.228SerAsp: 3.228 ± 0.391
3.861SerGlu: 3.861 ± 0.538
2.025SerPhe: 2.025 ± 0.43
5.886SerGly: 5.886 ± 0.799
1.456SerHis: 1.456 ± 0.31
2.405SerIle: 2.405 ± 0.406
2.089SerLys: 2.089 ± 0.362
4.81SerLeu: 4.81 ± 0.53
1.266SerMet: 1.266 ± 0.249
2.532SerAsn: 2.532 ± 0.405
3.165SerPro: 3.165 ± 0.503
2.152SerGln: 2.152 ± 0.285
2.785SerArg: 2.785 ± 0.362
3.671SerSer: 3.671 ± 0.741
3.354SerThr: 3.354 ± 0.471
3.924SerVal: 3.924 ± 0.543
1.392SerTrp: 1.392 ± 0.329
1.519SerTyr: 1.519 ± 0.308
0.0SerXaa: 0.0 ± 0.0
Thr
5.886ThrAla: 5.886 ± 0.651
0.316ThrCys: 0.316 ± 0.136
4.43ThrAsp: 4.43 ± 0.552
4.241ThrGlu: 4.241 ± 0.514
2.152ThrPhe: 2.152 ± 0.379
6.962ThrGly: 6.962 ± 0.702
1.329ThrHis: 1.329 ± 0.316
2.468ThrIle: 2.468 ± 0.513
2.595ThrLys: 2.595 ± 0.371
6.139ThrLeu: 6.139 ± 0.723
0.696ThrMet: 0.696 ± 0.202
1.582ThrAsn: 1.582 ± 0.361
4.241ThrPro: 4.241 ± 0.574
1.772ThrGln: 1.772 ± 0.292
3.608ThrArg: 3.608 ± 0.564
3.418ThrSer: 3.418 ± 0.556
3.797ThrThr: 3.797 ± 0.549
5.316ThrVal: 5.316 ± 0.72
0.886ThrTrp: 0.886 ± 0.247
2.342ThrTyr: 2.342 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
7.152ValAla: 7.152 ± 0.766
0.316ValCys: 0.316 ± 0.129
5.19ValAsp: 5.19 ± 0.615
5.127ValGlu: 5.127 ± 0.526
2.215ValPhe: 2.215 ± 0.37
4.937ValGly: 4.937 ± 0.759
1.392ValHis: 1.392 ± 0.245
4.177ValIle: 4.177 ± 0.473
2.975ValLys: 2.975 ± 0.406
5.443ValLeu: 5.443 ± 0.607
0.886ValMet: 0.886 ± 0.301
2.785ValAsn: 2.785 ± 0.333
4.241ValPro: 4.241 ± 0.562
2.215ValGln: 2.215 ± 0.467
4.873ValArg: 4.873 ± 0.726
4.557ValSer: 4.557 ± 0.479
5.57ValThr: 5.57 ± 0.641
4.747ValVal: 4.747 ± 0.673
1.329ValTrp: 1.329 ± 0.287
2.342ValTyr: 2.342 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.519TrpAla: 1.519 ± 0.334
0.127TrpCys: 0.127 ± 0.073
1.519TrpAsp: 1.519 ± 0.318
1.139TrpGlu: 1.139 ± 0.229
0.949TrpPhe: 0.949 ± 0.233
1.772TrpGly: 1.772 ± 0.303
0.443TrpHis: 0.443 ± 0.176
1.203TrpIle: 1.203 ± 0.247
0.38TrpLys: 0.38 ± 0.191
1.772TrpLeu: 1.772 ± 0.292
0.443TrpMet: 0.443 ± 0.191
0.506TrpAsn: 0.506 ± 0.154
0.759TrpPro: 0.759 ± 0.258
0.886TrpGln: 0.886 ± 0.216
1.076TrpArg: 1.076 ± 0.27
0.949TrpSer: 0.949 ± 0.234
1.709TrpThr: 1.709 ± 0.369
2.152TrpVal: 2.152 ± 0.388
0.57TrpTrp: 0.57 ± 0.225
0.316TrpTyr: 0.316 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.089TyrAla: 2.089 ± 0.402
0.253TyrCys: 0.253 ± 0.142
1.076TyrAsp: 1.076 ± 0.264
2.278TyrGlu: 2.278 ± 0.33
0.506TyrPhe: 0.506 ± 0.157
2.532TyrGly: 2.532 ± 0.428
0.633TyrHis: 0.633 ± 0.216
1.582TyrIle: 1.582 ± 0.365
1.266TyrLys: 1.266 ± 0.333
2.911TyrLeu: 2.911 ± 0.473
0.443TyrMet: 0.443 ± 0.152
1.203TyrAsn: 1.203 ± 0.312
1.139TyrPro: 1.139 ± 0.257
1.203TyrGln: 1.203 ± 0.282
2.722TyrArg: 2.722 ± 0.419
1.519TyrSer: 1.519 ± 0.299
1.835TyrThr: 1.835 ± 0.359
2.278TyrVal: 2.278 ± 0.376
0.38TyrTrp: 0.38 ± 0.137
0.57TyrTyr: 0.57 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (15801 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski