Amino acid dipepetide frequency for Mycobacterium phage PP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.261AlaAla: 11.261 ± 1.072
0.7AlaCys: 0.7 ± 0.194
6.362AlaAsp: 6.362 ± 0.732
7.126AlaGlu: 7.126 ± 0.876
2.99AlaPhe: 2.99 ± 0.406
7.635AlaGly: 7.635 ± 0.749
1.718AlaHis: 1.718 ± 0.361
4.453AlaIle: 4.453 ± 0.607
3.499AlaLys: 3.499 ± 0.452
8.334AlaLeu: 8.334 ± 0.785
2.799AlaMet: 2.799 ± 0.489
2.799AlaAsn: 2.799 ± 0.476
4.517AlaPro: 4.517 ± 0.758
3.054AlaGln: 3.054 ± 0.474
5.79AlaArg: 5.79 ± 0.637
4.772AlaSer: 4.772 ± 0.634
5.344AlaThr: 5.344 ± 0.729
7.507AlaVal: 7.507 ± 0.754
1.463AlaTrp: 1.463 ± 0.286
2.227AlaTyr: 2.227 ± 0.448
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.178
0.318CysCys: 0.318 ± 0.195
0.636CysAsp: 0.636 ± 0.197
0.318CysGlu: 0.318 ± 0.159
0.254CysPhe: 0.254 ± 0.164
0.827CysGly: 0.827 ± 0.253
0.127CysHis: 0.127 ± 0.086
0.445CysIle: 0.445 ± 0.17
0.382CysLys: 0.382 ± 0.174
0.509CysLeu: 0.509 ± 0.155
0.254CysMet: 0.254 ± 0.116
0.382CysAsn: 0.382 ± 0.157
0.382CysPro: 0.382 ± 0.174
0.254CysGln: 0.254 ± 0.118
1.018CysArg: 1.018 ± 0.262
0.445CysSer: 0.445 ± 0.154
0.318CysThr: 0.318 ± 0.167
0.636CysVal: 0.636 ± 0.169
0.318CysTrp: 0.318 ± 0.131
0.254CysTyr: 0.254 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
6.617AspAla: 6.617 ± 0.675
0.636AspCys: 0.636 ± 0.199
4.517AspAsp: 4.517 ± 0.601
4.962AspGlu: 4.962 ± 0.633
2.354AspPhe: 2.354 ± 0.407
5.79AspGly: 5.79 ± 0.623
1.654AspHis: 1.654 ± 0.367
2.545AspIle: 2.545 ± 0.395
2.163AspLys: 2.163 ± 0.365
6.299AspLeu: 6.299 ± 0.757
1.463AspMet: 1.463 ± 0.263
2.227AspAsn: 2.227 ± 0.37
4.453AspPro: 4.453 ± 0.556
2.545AspGln: 2.545 ± 0.376
4.39AspArg: 4.39 ± 0.459
3.245AspSer: 3.245 ± 0.545
3.563AspThr: 3.563 ± 0.431
5.471AspVal: 5.471 ± 0.576
2.227AspTrp: 2.227 ± 0.391
1.909AspTyr: 1.909 ± 0.313
0.0AspXaa: 0.0 ± 0.0
Glu
5.917GluAla: 5.917 ± 0.641
0.382GluCys: 0.382 ± 0.15
5.09GluAsp: 5.09 ± 0.721
4.263GluGlu: 4.263 ± 0.508
2.99GluPhe: 2.99 ± 0.521
6.235GluGly: 6.235 ± 1.047
1.463GluHis: 1.463 ± 0.299
3.308GluIle: 3.308 ± 0.398
2.672GluLys: 2.672 ± 0.375
6.426GluLeu: 6.426 ± 0.607
1.463GluMet: 1.463 ± 0.309
1.972GluAsn: 1.972 ± 0.437
2.608GluPro: 2.608 ± 0.497
2.736GluGln: 2.736 ± 0.458
4.899GluArg: 4.899 ± 0.642
3.117GluSer: 3.117 ± 0.497
4.581GluThr: 4.581 ± 0.509
5.98GluVal: 5.98 ± 0.646
1.209GluTrp: 1.209 ± 0.277
1.909GluTyr: 1.909 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
3.117PheAla: 3.117 ± 0.472
0.127PheCys: 0.127 ± 0.094
2.863PheAsp: 2.863 ± 0.378
2.608PheGlu: 2.608 ± 0.375
0.827PhePhe: 0.827 ± 0.231
3.245PheGly: 3.245 ± 0.459
0.445PheHis: 0.445 ± 0.161
1.4PheIle: 1.4 ± 0.247
0.891PheLys: 0.891 ± 0.273
2.227PheLeu: 2.227 ± 0.52
0.573PheMet: 0.573 ± 0.221
2.1PheAsn: 2.1 ± 0.368
1.781PhePro: 1.781 ± 0.309
0.954PheGln: 0.954 ± 0.243
2.418PheArg: 2.418 ± 0.292
2.163PheSer: 2.163 ± 0.412
2.036PheThr: 2.036 ± 0.354
1.972PheVal: 1.972 ± 0.363
0.318PheTrp: 0.318 ± 0.123
0.382PheTyr: 0.382 ± 0.152
0.0PheXaa: 0.0 ± 0.0
Gly
6.68GlyAla: 6.68 ± 0.723
0.763GlyCys: 0.763 ± 0.21
6.044GlyAsp: 6.044 ± 0.823
5.471GlyGlu: 5.471 ± 0.652
2.927GlyPhe: 2.927 ± 0.496
7.698GlyGly: 7.698 ± 0.685
2.29GlyHis: 2.29 ± 0.421
3.881GlyIle: 3.881 ± 0.68
3.626GlyLys: 3.626 ± 0.464
6.744GlyLeu: 6.744 ± 0.691
2.163GlyMet: 2.163 ± 0.341
2.672GlyAsn: 2.672 ± 0.501
5.726GlyPro: 5.726 ± 1.804
2.799GlyGln: 2.799 ± 0.489
5.408GlyArg: 5.408 ± 0.679
4.581GlySer: 4.581 ± 0.539
5.344GlyThr: 5.344 ± 0.521
6.362GlyVal: 6.362 ± 0.555
1.781GlyTrp: 1.781 ± 0.265
2.608GlyTyr: 2.608 ± 0.382
0.0GlyXaa: 0.0 ± 0.0
His
1.336HisAla: 1.336 ± 0.308
0.318HisCys: 0.318 ± 0.127
1.527HisAsp: 1.527 ± 0.29
1.527HisGlu: 1.527 ± 0.322
0.7HisPhe: 0.7 ± 0.217
2.1HisGly: 2.1 ± 0.44
0.636HisHis: 0.636 ± 0.2
1.018HisIle: 1.018 ± 0.282
1.018HisLys: 1.018 ± 0.26
1.527HisLeu: 1.527 ± 0.353
0.254HisMet: 0.254 ± 0.131
0.636HisAsn: 0.636 ± 0.23
1.336HisPro: 1.336 ± 0.29
0.954HisGln: 0.954 ± 0.223
2.418HisArg: 2.418 ± 0.419
0.954HisSer: 0.954 ± 0.276
1.209HisThr: 1.209 ± 0.318
1.718HisVal: 1.718 ± 0.374
0.382HisTrp: 0.382 ± 0.166
0.827HisTyr: 0.827 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
5.281IleAla: 5.281 ± 0.561
0.191IleCys: 0.191 ± 0.099
3.245IleAsp: 3.245 ± 0.449
3.436IleGlu: 3.436 ± 0.406
0.954IlePhe: 0.954 ± 0.253
3.436IleGly: 3.436 ± 0.584
1.527IleHis: 1.527 ± 0.357
2.036IleIle: 2.036 ± 0.364
1.4IleLys: 1.4 ± 0.353
3.436IleLeu: 3.436 ± 0.423
0.636IleMet: 0.636 ± 0.205
1.527IleAsn: 1.527 ± 0.34
3.563IlePro: 3.563 ± 0.475
2.1IleGln: 2.1 ± 0.463
3.563IleArg: 3.563 ± 0.513
2.29IleSer: 2.29 ± 0.325
2.863IleThr: 2.863 ± 0.432
3.054IleVal: 3.054 ± 0.517
0.827IleTrp: 0.827 ± 0.216
0.763IleTyr: 0.763 ± 0.205
0.0IleXaa: 0.0 ± 0.0
Lys
4.072LysAla: 4.072 ± 0.506
0.254LysCys: 0.254 ± 0.117
2.354LysAsp: 2.354 ± 0.428
1.4LysGlu: 1.4 ± 0.266
1.018LysPhe: 1.018 ± 0.266
3.181LysGly: 3.181 ± 0.595
0.891LysHis: 0.891 ± 0.236
2.672LysIle: 2.672 ± 0.641
1.718LysLys: 1.718 ± 0.349
4.135LysLeu: 4.135 ± 0.561
0.891LysMet: 0.891 ± 0.246
1.018LysAsn: 1.018 ± 0.307
2.29LysPro: 2.29 ± 0.435
1.463LysGln: 1.463 ± 0.31
2.481LysArg: 2.481 ± 0.395
1.972LysSer: 1.972 ± 0.399
2.036LysThr: 2.036 ± 0.382
3.308LysVal: 3.308 ± 0.548
0.509LysTrp: 0.509 ± 0.163
1.018LysTyr: 1.018 ± 0.223
0.0LysXaa: 0.0 ± 0.0
Leu
7.825LeuAla: 7.825 ± 0.843
0.509LeuCys: 0.509 ± 0.212
5.026LeuAsp: 5.026 ± 0.504
5.98LeuGlu: 5.98 ± 0.775
2.036LeuPhe: 2.036 ± 0.299
6.617LeuGly: 6.617 ± 0.697
1.909LeuHis: 1.909 ± 0.393
3.563LeuIle: 3.563 ± 0.474
3.436LeuLys: 3.436 ± 0.665
5.853LeuLeu: 5.853 ± 0.568
2.354LeuMet: 2.354 ± 0.34
2.736LeuAsn: 2.736 ± 0.42
4.263LeuPro: 4.263 ± 0.423
2.672LeuGln: 2.672 ± 0.444
6.299LeuArg: 6.299 ± 0.69
4.517LeuSer: 4.517 ± 0.398
4.581LeuThr: 4.581 ± 0.661
5.917LeuVal: 5.917 ± 0.518
1.527LeuTrp: 1.527 ± 0.348
2.481LeuTyr: 2.481 ± 0.353
0.0LeuXaa: 0.0 ± 0.0
Met
2.481MetAla: 2.481 ± 0.405
0.0MetCys: 0.0 ± 0.0
1.909MetAsp: 1.909 ± 0.31
1.209MetGlu: 1.209 ± 0.251
0.636MetPhe: 0.636 ± 0.17
1.909MetGly: 1.909 ± 0.341
0.382MetHis: 0.382 ± 0.152
0.954MetIle: 0.954 ± 0.268
0.891MetLys: 0.891 ± 0.211
1.4MetLeu: 1.4 ± 0.29
0.254MetMet: 0.254 ± 0.112
0.891MetAsn: 0.891 ± 0.193
0.954MetPro: 0.954 ± 0.293
0.827MetGln: 0.827 ± 0.203
1.463MetArg: 1.463 ± 0.266
2.227MetSer: 2.227 ± 0.35
1.972MetThr: 1.972 ± 0.287
1.845MetVal: 1.845 ± 0.39
0.191MetTrp: 0.191 ± 0.105
0.827MetTyr: 0.827 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
3.117AsnAla: 3.117 ± 0.508
0.254AsnCys: 0.254 ± 0.135
1.781AsnAsp: 1.781 ± 0.341
1.781AsnGlu: 1.781 ± 0.3
0.891AsnPhe: 0.891 ± 0.213
3.054AsnGly: 3.054 ± 0.425
0.891AsnHis: 0.891 ± 0.252
1.845AsnIle: 1.845 ± 0.339
0.7AsnLys: 0.7 ± 0.223
3.117AsnLeu: 3.117 ± 0.591
0.509AsnMet: 0.509 ± 0.17
0.636AsnAsn: 0.636 ± 0.194
2.863AsnPro: 2.863 ± 0.364
1.145AsnGln: 1.145 ± 0.242
2.29AsnArg: 2.29 ± 0.426
1.654AsnSer: 1.654 ± 0.291
1.336AsnThr: 1.336 ± 0.259
2.227AsnVal: 2.227 ± 0.351
0.382AsnTrp: 0.382 ± 0.149
1.082AsnTyr: 1.082 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
4.772ProAla: 4.772 ± 0.627
0.636ProCys: 0.636 ± 0.243
4.263ProAsp: 4.263 ± 0.477
4.899ProGlu: 4.899 ± 0.892
2.736ProPhe: 2.736 ± 0.418
4.644ProGly: 4.644 ± 0.503
0.954ProHis: 0.954 ± 0.231
2.227ProIle: 2.227 ± 0.306
1.781ProLys: 1.781 ± 0.345
3.372ProLeu: 3.372 ± 0.447
1.209ProMet: 1.209 ± 0.289
1.718ProAsn: 1.718 ± 0.372
2.863ProPro: 2.863 ± 0.659
2.227ProGln: 2.227 ± 0.815
2.672ProArg: 2.672 ± 0.5
2.799ProSer: 2.799 ± 0.384
3.626ProThr: 3.626 ± 0.542
4.453ProVal: 4.453 ± 0.558
1.145ProTrp: 1.145 ± 0.343
1.4ProTyr: 1.4 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
3.881GlnAla: 3.881 ± 0.652
0.191GlnCys: 0.191 ± 0.114
1.781GlnAsp: 1.781 ± 0.356
2.418GlnGlu: 2.418 ± 0.428
1.018GlnPhe: 1.018 ± 0.204
3.563GlnGly: 3.563 ± 1.401
0.636GlnHis: 0.636 ± 0.216
2.29GlnIle: 2.29 ± 0.357
1.209GlnLys: 1.209 ± 0.257
3.372GlnLeu: 3.372 ± 0.786
0.636GlnMet: 0.636 ± 0.158
1.082GlnAsn: 1.082 ± 0.333
1.654GlnPro: 1.654 ± 0.377
2.036GlnGln: 2.036 ± 0.456
2.736GlnArg: 2.736 ± 0.583
1.272GlnSer: 1.272 ± 0.247
2.1GlnThr: 2.1 ± 0.383
2.799GlnVal: 2.799 ± 0.41
0.891GlnTrp: 0.891 ± 0.234
1.145GlnTyr: 1.145 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
6.108ArgAla: 6.108 ± 0.651
1.018ArgCys: 1.018 ± 0.328
5.344ArgAsp: 5.344 ± 0.622
5.217ArgGlu: 5.217 ± 0.72
2.036ArgPhe: 2.036 ± 0.379
4.581ArgGly: 4.581 ± 0.579
1.845ArgHis: 1.845 ± 0.357
3.945ArgIle: 3.945 ± 0.468
3.499ArgLys: 3.499 ± 0.49
5.408ArgLeu: 5.408 ± 0.661
2.1ArgMet: 2.1 ± 0.407
1.972ArgAsn: 1.972 ± 0.285
2.863ArgPro: 2.863 ± 0.44
2.163ArgGln: 2.163 ± 0.379
6.171ArgArg: 6.171 ± 0.74
3.372ArgSer: 3.372 ± 0.458
2.736ArgThr: 2.736 ± 0.433
4.708ArgVal: 4.708 ± 0.447
1.781ArgTrp: 1.781 ± 0.331
2.481ArgTyr: 2.481 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
5.408SerAla: 5.408 ± 0.862
0.509SerCys: 0.509 ± 0.193
4.199SerAsp: 4.199 ± 0.526
4.008SerGlu: 4.008 ± 0.461
2.036SerPhe: 2.036 ± 0.394
5.281SerGly: 5.281 ± 0.818
0.7SerHis: 0.7 ± 0.235
2.418SerIle: 2.418 ± 0.396
2.1SerLys: 2.1 ± 0.451
3.626SerLeu: 3.626 ± 0.468
1.463SerMet: 1.463 ± 0.373
1.209SerAsn: 1.209 ± 0.29
2.736SerPro: 2.736 ± 0.475
1.972SerGln: 1.972 ± 0.345
3.563SerArg: 3.563 ± 0.591
3.117SerSer: 3.117 ± 0.638
3.181SerThr: 3.181 ± 0.449
3.754SerVal: 3.754 ± 0.47
1.018SerTrp: 1.018 ± 0.211
1.781SerTyr: 1.781 ± 0.374
0.0SerXaa: 0.0 ± 0.0
Thr
4.962ThrAla: 4.962 ± 0.731
0.382ThrCys: 0.382 ± 0.161
3.245ThrAsp: 3.245 ± 0.536
4.263ThrGlu: 4.263 ± 0.465
1.845ThrPhe: 1.845 ± 0.331
6.235ThrGly: 6.235 ± 0.601
1.654ThrHis: 1.654 ± 0.351
1.972ThrIle: 1.972 ± 0.365
2.545ThrLys: 2.545 ± 0.345
4.326ThrLeu: 4.326 ± 0.576
0.954ThrMet: 0.954 ± 0.232
1.591ThrAsn: 1.591 ± 0.339
3.626ThrPro: 3.626 ± 0.429
2.418ThrGln: 2.418 ± 0.404
3.245ThrArg: 3.245 ± 0.426
3.626ThrSer: 3.626 ± 0.575
3.245ThrThr: 3.245 ± 0.442
5.79ThrVal: 5.79 ± 0.57
1.145ThrTrp: 1.145 ± 0.298
1.591ThrTyr: 1.591 ± 0.292
0.0ThrXaa: 0.0 ± 0.0
Val
6.235ValAla: 6.235 ± 0.615
0.827ValCys: 0.827 ± 0.25
5.026ValAsp: 5.026 ± 0.714
5.153ValGlu: 5.153 ± 0.569
3.117ValPhe: 3.117 ± 0.565
5.726ValGly: 5.726 ± 0.697
1.463ValHis: 1.463 ± 0.348
3.626ValIle: 3.626 ± 0.453
3.563ValLys: 3.563 ± 0.487
6.044ValLeu: 6.044 ± 0.523
1.781ValMet: 1.781 ± 0.428
2.863ValAsn: 2.863 ± 0.493
3.626ValPro: 3.626 ± 0.454
2.545ValGln: 2.545 ± 0.291
5.599ValArg: 5.599 ± 0.678
4.962ValSer: 4.962 ± 0.569
5.09ValThr: 5.09 ± 0.501
6.044ValVal: 6.044 ± 0.523
1.591ValTrp: 1.591 ± 0.349
2.354ValTyr: 2.354 ± 0.376
0.0ValXaa: 0.0 ± 0.0
Trp
2.163TrpAla: 2.163 ± 0.35
0.318TrpCys: 0.318 ± 0.159
1.4TrpAsp: 1.4 ± 0.263
1.527TrpGlu: 1.527 ± 0.346
0.382TrpPhe: 0.382 ± 0.142
1.463TrpGly: 1.463 ± 0.385
0.318TrpHis: 0.318 ± 0.171
0.573TrpIle: 0.573 ± 0.211
0.636TrpLys: 0.636 ± 0.185
1.591TrpLeu: 1.591 ± 0.364
0.827TrpMet: 0.827 ± 0.211
0.573TrpAsn: 0.573 ± 0.269
0.636TrpPro: 0.636 ± 0.201
0.954TrpGln: 0.954 ± 0.2
0.827TrpArg: 0.827 ± 0.221
1.272TrpSer: 1.272 ± 0.267
1.591TrpThr: 1.591 ± 0.301
1.463TrpVal: 1.463 ± 0.312
0.382TrpTrp: 0.382 ± 0.147
0.636TrpTyr: 0.636 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.608TyrAla: 2.608 ± 0.412
0.318TyrCys: 0.318 ± 0.145
2.418TyrAsp: 2.418 ± 0.346
1.654TyrGlu: 1.654 ± 0.318
0.7TyrPhe: 0.7 ± 0.181
2.354TyrGly: 2.354 ± 0.397
0.891TyrHis: 0.891 ± 0.234
0.954TyrIle: 0.954 ± 0.21
0.954TyrLys: 0.954 ± 0.317
2.354TyrLeu: 2.354 ± 0.426
0.573TyrMet: 0.573 ± 0.263
0.954TyrAsn: 0.954 ± 0.27
1.654TyrPro: 1.654 ± 0.276
0.891TyrGln: 0.891 ± 0.208
2.1TyrArg: 2.1 ± 0.359
1.718TyrSer: 1.718 ± 0.323
1.909TyrThr: 1.909 ± 0.357
2.227TyrVal: 2.227 ± 0.374
0.445TyrTrp: 0.445 ± 0.173
1.018TyrTyr: 1.018 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (15719 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski