Amino acid dipepetide frequency for Mycobacterium phage TipsytheTRex

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.102AlaAla: 10.102 ± 0.879
0.801AlaCys: 0.801 ± 0.242
5.113AlaAsp: 5.113 ± 0.652
7.638AlaGlu: 7.638 ± 0.865
3.388AlaPhe: 3.388 ± 0.494
8.316AlaGly: 8.316 ± 1.0
1.602AlaHis: 1.602 ± 0.335
4.805AlaIle: 4.805 ± 0.547
4.99AlaLys: 4.99 ± 0.555
8.685AlaLeu: 8.685 ± 0.942
2.649AlaMet: 2.649 ± 0.433
3.45AlaAsn: 3.45 ± 0.592
4.62AlaPro: 4.62 ± 0.57
3.758AlaGln: 3.758 ± 0.533
4.743AlaArg: 4.743 ± 0.553
4.743AlaSer: 4.743 ± 0.489
5.359AlaThr: 5.359 ± 0.615
7.207AlaVal: 7.207 ± 0.708
1.91AlaTrp: 1.91 ± 0.409
3.203AlaTyr: 3.203 ± 0.594
0.0AlaXaa: 0.0 ± 0.0
Cys
0.616CysAla: 0.616 ± 0.194
0.0CysCys: 0.0 ± 0.0
0.678CysAsp: 0.678 ± 0.198
0.678CysGlu: 0.678 ± 0.242
0.493CysPhe: 0.493 ± 0.182
0.678CysGly: 0.678 ± 0.23
0.185CysHis: 0.185 ± 0.111
0.246CysIle: 0.246 ± 0.131
0.554CysLys: 0.554 ± 0.145
0.801CysLeu: 0.801 ± 0.172
0.246CysMet: 0.246 ± 0.12
0.308CysAsn: 0.308 ± 0.122
0.678CysPro: 0.678 ± 0.211
0.246CysGln: 0.246 ± 0.119
0.739CysArg: 0.739 ± 0.231
0.554CysSer: 0.554 ± 0.232
0.554CysThr: 0.554 ± 0.181
0.554CysVal: 0.554 ± 0.162
0.493CysTrp: 0.493 ± 0.186
0.308CysTyr: 0.308 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
5.914AspAla: 5.914 ± 0.624
0.616AspCys: 0.616 ± 0.213
3.45AspAsp: 3.45 ± 0.544
5.174AspGlu: 5.174 ± 0.685
2.156AspPhe: 2.156 ± 0.368
5.852AspGly: 5.852 ± 0.657
1.417AspHis: 1.417 ± 0.304
3.511AspIle: 3.511 ± 0.467
2.464AspLys: 2.464 ± 0.351
5.113AspLeu: 5.113 ± 0.631
1.663AspMet: 1.663 ± 0.305
1.971AspAsn: 1.971 ± 0.42
4.62AspPro: 4.62 ± 0.496
1.848AspGln: 1.848 ± 0.366
3.142AspArg: 3.142 ± 0.435
2.895AspSer: 2.895 ± 0.505
2.895AspThr: 2.895 ± 0.429
4.435AspVal: 4.435 ± 0.456
1.417AspTrp: 1.417 ± 0.282
2.71AspTyr: 2.71 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
7.269GluAla: 7.269 ± 0.801
0.308GluCys: 0.308 ± 0.131
4.374GluAsp: 4.374 ± 0.646
5.359GluGlu: 5.359 ± 0.721
3.142GluPhe: 3.142 ± 0.468
5.544GluGly: 5.544 ± 0.673
1.232GluHis: 1.232 ± 0.31
4.189GluIle: 4.189 ± 0.488
2.649GluLys: 2.649 ± 0.383
7.638GluLeu: 7.638 ± 0.6
2.094GluMet: 2.094 ± 0.363
2.341GluAsn: 2.341 ± 0.373
2.895GluPro: 2.895 ± 0.489
1.602GluGln: 1.602 ± 0.332
4.743GluArg: 4.743 ± 0.621
2.71GluSer: 2.71 ± 0.372
3.942GluThr: 3.942 ± 0.513
4.866GluVal: 4.866 ± 0.659
1.355GluTrp: 1.355 ± 0.321
2.341GluTyr: 2.341 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
2.957PheAla: 2.957 ± 0.504
0.308PheCys: 0.308 ± 0.155
2.587PheAsp: 2.587 ± 0.398
2.71PheGlu: 2.71 ± 0.389
0.739PhePhe: 0.739 ± 0.22
3.018PheGly: 3.018 ± 0.421
0.554PheHis: 0.554 ± 0.235
1.478PheIle: 1.478 ± 0.32
1.478PheLys: 1.478 ± 0.28
1.91PheLeu: 1.91 ± 0.372
0.493PheMet: 0.493 ± 0.182
1.417PheAsn: 1.417 ± 0.269
1.848PhePro: 1.848 ± 0.39
1.54PheGln: 1.54 ± 0.362
2.279PheArg: 2.279 ± 0.308
1.786PheSer: 1.786 ± 0.349
2.402PheThr: 2.402 ± 0.371
2.218PheVal: 2.218 ± 0.361
0.493PheTrp: 0.493 ± 0.197
0.924PheTyr: 0.924 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
7.207GlyAla: 7.207 ± 0.945
0.801GlyCys: 0.801 ± 0.216
6.591GlyAsp: 6.591 ± 0.802
4.99GlyGlu: 4.99 ± 0.541
2.957GlyPhe: 2.957 ± 0.46
9.609GlyGly: 9.609 ± 1.628
1.602GlyHis: 1.602 ± 0.31
4.25GlyIle: 4.25 ± 0.547
4.558GlyLys: 4.558 ± 0.52
6.53GlyLeu: 6.53 ± 0.813
2.218GlyMet: 2.218 ± 0.445
2.587GlyAsn: 2.587 ± 0.391
3.326GlyPro: 3.326 ± 0.374
2.649GlyGln: 2.649 ± 0.364
4.866GlyArg: 4.866 ± 0.545
3.881GlySer: 3.881 ± 0.504
5.298GlyThr: 5.298 ± 0.679
6.406GlyVal: 6.406 ± 0.713
1.786GlyTrp: 1.786 ± 0.336
2.341GlyTyr: 2.341 ± 0.323
0.0GlyXaa: 0.0 ± 0.0
His
1.417HisAla: 1.417 ± 0.273
0.431HisCys: 0.431 ± 0.156
1.478HisAsp: 1.478 ± 0.375
1.355HisGlu: 1.355 ± 0.355
0.37HisPhe: 0.37 ± 0.144
1.663HisGly: 1.663 ± 0.329
0.678HisHis: 0.678 ± 0.245
1.725HisIle: 1.725 ± 0.336
1.294HisLys: 1.294 ± 0.311
1.725HisLeu: 1.725 ± 0.378
0.246HisMet: 0.246 ± 0.133
0.493HisAsn: 0.493 ± 0.203
1.294HisPro: 1.294 ± 0.27
0.801HisGln: 0.801 ± 0.223
1.478HisArg: 1.478 ± 0.335
0.678HisSer: 0.678 ± 0.231
0.862HisThr: 0.862 ± 0.19
1.109HisVal: 1.109 ± 0.297
0.246HisTrp: 0.246 ± 0.157
0.739HisTyr: 0.739 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
5.79IleAla: 5.79 ± 0.716
0.431IleCys: 0.431 ± 0.146
3.388IleAsp: 3.388 ± 0.422
4.682IleGlu: 4.682 ± 0.565
1.294IlePhe: 1.294 ± 0.295
3.573IleGly: 3.573 ± 0.57
1.232IleHis: 1.232 ± 0.264
1.663IleIle: 1.663 ± 0.333
2.526IleLys: 2.526 ± 0.393
4.004IleLeu: 4.004 ± 0.413
0.554IleMet: 0.554 ± 0.198
1.91IleAsn: 1.91 ± 0.368
4.066IlePro: 4.066 ± 0.431
1.725IleGln: 1.725 ± 0.325
3.388IleArg: 3.388 ± 0.451
2.71IleSer: 2.71 ± 0.438
3.511IleThr: 3.511 ± 0.366
2.587IleVal: 2.587 ± 0.297
0.616IleTrp: 0.616 ± 0.168
1.047IleTyr: 1.047 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
5.606LysAla: 5.606 ± 0.667
0.246LysCys: 0.246 ± 0.136
2.834LysAsp: 2.834 ± 0.355
2.156LysGlu: 2.156 ± 0.436
1.355LysPhe: 1.355 ± 0.258
3.326LysGly: 3.326 ± 0.513
0.986LysHis: 0.986 ± 0.245
2.033LysIle: 2.033 ± 0.328
3.203LysLys: 3.203 ± 0.583
3.511LysLeu: 3.511 ± 0.504
1.294LysMet: 1.294 ± 0.303
1.478LysAsn: 1.478 ± 0.276
3.758LysPro: 3.758 ± 0.574
1.478LysGln: 1.478 ± 0.311
3.696LysArg: 3.696 ± 0.544
1.848LysSer: 1.848 ± 0.355
3.018LysThr: 3.018 ± 0.39
4.312LysVal: 4.312 ± 0.534
0.801LysTrp: 0.801 ± 0.225
1.294LysTyr: 1.294 ± 0.292
0.0LysXaa: 0.0 ± 0.0
Leu
9.117LeuAla: 9.117 ± 0.863
0.739LeuCys: 0.739 ± 0.215
4.312LeuAsp: 4.312 ± 0.434
5.482LeuGlu: 5.482 ± 0.54
2.587LeuPhe: 2.587 ± 0.353
6.776LeuGly: 6.776 ± 1.01
1.725LeuHis: 1.725 ± 0.383
4.866LeuIle: 4.866 ± 0.544
3.203LeuLys: 3.203 ± 0.414
5.852LeuLeu: 5.852 ± 0.632
2.094LeuMet: 2.094 ± 0.383
2.772LeuAsn: 2.772 ± 0.442
4.312LeuPro: 4.312 ± 0.414
2.834LeuGln: 2.834 ± 0.537
5.606LeuArg: 5.606 ± 0.69
5.544LeuSer: 5.544 ± 0.597
4.25LeuThr: 4.25 ± 0.574
4.62LeuVal: 4.62 ± 0.642
1.663LeuTrp: 1.663 ± 0.289
2.402LeuTyr: 2.402 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
2.772MetAla: 2.772 ± 0.39
0.246MetCys: 0.246 ± 0.122
1.17MetAsp: 1.17 ± 0.216
1.725MetGlu: 1.725 ± 0.369
0.37MetPhe: 0.37 ± 0.166
1.54MetGly: 1.54 ± 0.358
0.37MetHis: 0.37 ± 0.152
1.355MetIle: 1.355 ± 0.286
1.971MetLys: 1.971 ± 0.318
1.478MetLeu: 1.478 ± 0.333
0.616MetMet: 0.616 ± 0.184
0.554MetAsn: 0.554 ± 0.177
1.478MetPro: 1.478 ± 0.352
1.17MetGln: 1.17 ± 0.343
1.355MetArg: 1.355 ± 0.251
2.156MetSer: 2.156 ± 0.354
2.402MetThr: 2.402 ± 0.351
1.355MetVal: 1.355 ± 0.337
0.123MetTrp: 0.123 ± 0.087
0.431MetTyr: 0.431 ± 0.154
0.0MetXaa: 0.0 ± 0.0
Asn
2.71AsnAla: 2.71 ± 0.39
0.493AsnCys: 0.493 ± 0.164
1.725AsnAsp: 1.725 ± 0.336
2.279AsnGlu: 2.279 ± 0.453
0.862AsnPhe: 0.862 ± 0.258
3.388AsnGly: 3.388 ± 0.463
0.739AsnHis: 0.739 ± 0.222
1.294AsnIle: 1.294 ± 0.311
0.862AsnLys: 0.862 ± 0.219
2.464AsnLeu: 2.464 ± 0.356
1.109AsnMet: 1.109 ± 0.279
0.862AsnAsn: 0.862 ± 0.255
2.341AsnPro: 2.341 ± 0.351
1.17AsnGln: 1.17 ± 0.259
2.71AsnArg: 2.71 ± 0.426
1.109AsnSer: 1.109 ± 0.247
2.464AsnThr: 2.464 ± 0.484
2.402AsnVal: 2.402 ± 0.355
0.554AsnTrp: 0.554 ± 0.177
1.047AsnTyr: 1.047 ± 0.241
0.0AsnXaa: 0.0 ± 0.0
Pro
4.312ProAla: 4.312 ± 0.452
0.493ProCys: 0.493 ± 0.166
4.004ProAsp: 4.004 ± 0.568
4.25ProGlu: 4.25 ± 0.609
1.848ProPhe: 1.848 ± 0.357
4.805ProGly: 4.805 ± 0.607
1.047ProHis: 1.047 ± 0.201
3.018ProIle: 3.018 ± 0.314
2.279ProLys: 2.279 ± 0.585
3.388ProLeu: 3.388 ± 0.441
1.232ProMet: 1.232 ± 0.241
1.971ProAsn: 1.971 ± 0.34
2.033ProPro: 2.033 ± 0.41
1.971ProGln: 1.971 ± 0.358
3.573ProArg: 3.573 ± 0.582
2.71ProSer: 2.71 ± 0.494
3.942ProThr: 3.942 ± 0.494
3.696ProVal: 3.696 ± 0.463
1.17ProTrp: 1.17 ± 0.348
1.478ProTyr: 1.478 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
4.189GlnAla: 4.189 ± 0.649
0.308GlnCys: 0.308 ± 0.129
1.232GlnAsp: 1.232 ± 0.295
1.848GlnGlu: 1.848 ± 0.364
1.417GlnPhe: 1.417 ± 0.28
2.834GlnGly: 2.834 ± 0.428
1.047GlnHis: 1.047 ± 0.283
1.971GlnIle: 1.971 ± 0.347
1.786GlnLys: 1.786 ± 0.33
3.634GlnLeu: 3.634 ± 0.578
0.986GlnMet: 0.986 ± 0.273
0.986GlnAsn: 0.986 ± 0.228
1.294GlnPro: 1.294 ± 0.303
1.725GlnGln: 1.725 ± 0.327
2.402GlnArg: 2.402 ± 0.359
1.478GlnSer: 1.478 ± 0.301
2.094GlnThr: 2.094 ± 0.267
2.279GlnVal: 2.279 ± 0.341
0.801GlnTrp: 0.801 ± 0.217
0.862GlnTyr: 0.862 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
5.298ArgAla: 5.298 ± 0.679
1.109ArgCys: 1.109 ± 0.273
4.25ArgAsp: 4.25 ± 0.505
5.236ArgGlu: 5.236 ± 0.696
2.71ArgPhe: 2.71 ± 0.507
3.942ArgGly: 3.942 ± 0.554
1.17ArgHis: 1.17 ± 0.249
3.881ArgIle: 3.881 ± 0.492
4.066ArgLys: 4.066 ± 0.53
5.359ArgLeu: 5.359 ± 0.534
1.786ArgMet: 1.786 ± 0.321
2.341ArgAsn: 2.341 ± 0.417
2.464ArgPro: 2.464 ± 0.413
2.156ArgGln: 2.156 ± 0.356
4.743ArgArg: 4.743 ± 0.585
3.696ArgSer: 3.696 ± 0.57
2.279ArgThr: 2.279 ± 0.299
5.051ArgVal: 5.051 ± 0.479
1.232ArgTrp: 1.232 ± 0.211
2.526ArgTyr: 2.526 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
5.236SerAla: 5.236 ± 0.579
0.431SerCys: 0.431 ± 0.177
3.758SerAsp: 3.758 ± 0.444
3.265SerGlu: 3.265 ± 0.568
2.156SerPhe: 2.156 ± 0.395
4.805SerGly: 4.805 ± 0.622
0.986SerHis: 0.986 ± 0.229
2.279SerIle: 2.279 ± 0.405
2.279SerLys: 2.279 ± 0.42
4.127SerLeu: 4.127 ± 0.569
1.047SerMet: 1.047 ± 0.213
1.047SerAsn: 1.047 ± 0.264
3.203SerPro: 3.203 ± 0.4
1.91SerGln: 1.91 ± 0.39
4.127SerArg: 4.127 ± 0.527
2.649SerSer: 2.649 ± 0.524
2.402SerThr: 2.402 ± 0.404
3.573SerVal: 3.573 ± 0.523
1.478SerTrp: 1.478 ± 0.384
1.047SerTyr: 1.047 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
6.222ThrAla: 6.222 ± 0.56
0.554ThrCys: 0.554 ± 0.167
3.881ThrAsp: 3.881 ± 0.547
3.142ThrGlu: 3.142 ± 0.44
2.033ThrPhe: 2.033 ± 0.333
5.174ThrGly: 5.174 ± 0.661
1.109ThrHis: 1.109 ± 0.27
2.464ThrIle: 2.464 ± 0.481
3.388ThrLys: 3.388 ± 0.497
4.743ThrLeu: 4.743 ± 0.519
1.663ThrMet: 1.663 ± 0.29
1.848ThrAsn: 1.848 ± 0.283
3.573ThrPro: 3.573 ± 0.492
1.971ThrGln: 1.971 ± 0.305
3.265ThrArg: 3.265 ± 0.415
3.08ThrSer: 3.08 ± 0.535
3.326ThrThr: 3.326 ± 0.466
5.113ThrVal: 5.113 ± 0.535
1.109ThrTrp: 1.109 ± 0.283
1.663ThrTyr: 1.663 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
6.037ValAla: 6.037 ± 0.662
0.801ValCys: 0.801 ± 0.193
4.99ValAsp: 4.99 ± 0.528
5.174ValGlu: 5.174 ± 0.593
1.91ValPhe: 1.91 ± 0.41
5.852ValGly: 5.852 ± 0.516
1.417ValHis: 1.417 ± 0.295
3.326ValIle: 3.326 ± 0.423
3.018ValLys: 3.018 ± 0.417
5.298ValLeu: 5.298 ± 0.633
1.417ValMet: 1.417 ± 0.324
2.649ValAsn: 2.649 ± 0.508
3.203ValPro: 3.203 ± 0.496
2.279ValGln: 2.279 ± 0.31
4.682ValArg: 4.682 ± 0.573
4.497ValSer: 4.497 ± 0.596
4.805ValThr: 4.805 ± 0.67
5.482ValVal: 5.482 ± 0.581
1.294ValTrp: 1.294 ± 0.255
2.341ValTyr: 2.341 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
1.848TrpAla: 1.848 ± 0.397
0.308TrpCys: 0.308 ± 0.151
1.232TrpAsp: 1.232 ± 0.291
1.355TrpGlu: 1.355 ± 0.245
0.616TrpPhe: 0.616 ± 0.2
1.478TrpGly: 1.478 ± 0.301
0.678TrpHis: 0.678 ± 0.243
0.801TrpIle: 0.801 ± 0.207
0.678TrpLys: 0.678 ± 0.203
1.232TrpLeu: 1.232 ± 0.319
0.37TrpMet: 0.37 ± 0.139
0.616TrpAsn: 0.616 ± 0.204
0.924TrpPro: 0.924 ± 0.264
0.986TrpGln: 0.986 ± 0.22
1.109TrpArg: 1.109 ± 0.256
1.294TrpSer: 1.294 ± 0.312
1.602TrpThr: 1.602 ± 0.367
1.109TrpVal: 1.109 ± 0.248
0.493TrpTrp: 0.493 ± 0.206
0.678TrpTyr: 0.678 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.464TyrAla: 2.464 ± 0.384
0.185TyrCys: 0.185 ± 0.114
2.218TyrAsp: 2.218 ± 0.298
2.094TyrGlu: 2.094 ± 0.377
0.739TyrPhe: 0.739 ± 0.2
2.279TyrGly: 2.279 ± 0.405
0.308TyrHis: 0.308 ± 0.115
1.54TyrIle: 1.54 ± 0.256
0.986TyrLys: 0.986 ± 0.28
3.265TyrLeu: 3.265 ± 0.421
0.801TyrMet: 0.801 ± 0.246
0.986TyrAsn: 0.986 ± 0.252
1.294TyrPro: 1.294 ± 0.238
1.355TyrGln: 1.355 ± 0.285
2.71TyrArg: 2.71 ± 0.417
1.786TyrSer: 1.786 ± 0.296
1.91TyrThr: 1.91 ± 0.314
2.094TyrVal: 2.094 ± 0.423
0.37TyrTrp: 0.37 ± 0.197
0.801TyrTyr: 0.801 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (16235 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski