Amino acid dipepetide frequency for Mycobacterium phage Adjutor

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.261AlaAla: 10.261 ± 0.791
0.729AlaCys: 0.729 ± 0.202
5.982AlaAsp: 5.982 ± 0.552
7.878AlaGlu: 7.878 ± 0.69
2.14AlaPhe: 2.14 ± 0.304
7.052AlaGly: 7.052 ± 0.722
1.508AlaHis: 1.508 ± 0.242
4.134AlaIle: 4.134 ± 0.556
5.544AlaLys: 5.544 ± 0.806
8.073AlaLeu: 8.073 ± 0.831
2.529AlaMet: 2.529 ± 0.337
3.501AlaAsn: 3.501 ± 0.405
3.988AlaPro: 3.988 ± 0.374
3.696AlaGln: 3.696 ± 0.492
4.571AlaArg: 4.571 ± 0.649
5.252AlaSer: 5.252 ± 0.485
4.814AlaThr: 4.814 ± 0.507
5.982AlaVal: 5.982 ± 0.607
1.556AlaTrp: 1.556 ± 0.242
2.918AlaTyr: 2.918 ± 0.339
0.0AlaXaa: 0.0 ± 0.0
Cys
1.021CysAla: 1.021 ± 0.278
0.146CysCys: 0.146 ± 0.081
0.535CysAsp: 0.535 ± 0.177
0.681CysGlu: 0.681 ± 0.205
0.34CysPhe: 0.34 ± 0.122
0.681CysGly: 0.681 ± 0.189
0.292CysHis: 0.292 ± 0.116
0.389CysIle: 0.389 ± 0.137
0.438CysLys: 0.438 ± 0.147
0.632CysLeu: 0.632 ± 0.2
0.195CysMet: 0.195 ± 0.134
0.535CysAsn: 0.535 ± 0.165
0.973CysPro: 0.973 ± 0.345
0.243CysGln: 0.243 ± 0.108
0.486CysArg: 0.486 ± 0.152
0.729CysSer: 0.729 ± 0.186
0.584CysThr: 0.584 ± 0.172
0.438CysVal: 0.438 ± 0.155
0.146CysTrp: 0.146 ± 0.079
0.389CysTyr: 0.389 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
5.69AspAla: 5.69 ± 0.427
0.34AspCys: 0.34 ± 0.139
6.565AspAsp: 6.565 ± 1.374
5.69AspGlu: 5.69 ± 1.199
2.286AspPhe: 2.286 ± 0.335
5.447AspGly: 5.447 ± 0.613
1.264AspHis: 1.264 ± 0.271
2.918AspIle: 2.918 ± 0.448
3.307AspLys: 3.307 ± 0.482
6.273AspLeu: 6.273 ± 0.622
1.362AspMet: 1.362 ± 0.214
2.14AspAsn: 2.14 ± 0.3
4.28AspPro: 4.28 ± 0.436
1.945AspGln: 1.945 ± 0.31
4.814AspArg: 4.814 ± 0.49
3.404AspSer: 3.404 ± 0.449
3.21AspThr: 3.21 ± 0.348
4.231AspVal: 4.231 ± 0.446
1.41AspTrp: 1.41 ± 0.278
2.237AspTyr: 2.237 ± 0.268
0.0AspXaa: 0.0 ± 0.0
Glu
6.322GluAla: 6.322 ± 0.585
0.535GluCys: 0.535 ± 0.201
6.322GluAsp: 6.322 ± 1.32
5.933GluGlu: 5.933 ± 0.601
2.723GluPhe: 2.723 ± 0.404
4.231GluGly: 4.231 ± 0.409
1.362GluHis: 1.362 ± 0.272
3.453GluIle: 3.453 ± 0.375
2.48GluLys: 2.48 ± 0.447
6.954GluLeu: 6.954 ± 0.56
1.702GluMet: 1.702 ± 0.311
2.091GluAsn: 2.091 ± 0.261
2.432GluPro: 2.432 ± 0.359
2.869GluGln: 2.869 ± 0.402
5.301GluArg: 5.301 ± 0.594
3.842GluSer: 3.842 ± 0.383
2.723GluThr: 2.723 ± 0.386
4.28GluVal: 4.28 ± 0.61
1.556GluTrp: 1.556 ± 0.297
1.799GluTyr: 1.799 ± 0.296
0.0GluXaa: 0.0 ± 0.0
Phe
2.383PheAla: 2.383 ± 0.429
0.389PheCys: 0.389 ± 0.128
2.286PheAsp: 2.286 ± 0.259
2.043PheGlu: 2.043 ± 0.301
0.778PhePhe: 0.778 ± 0.187
3.842PheGly: 3.842 ± 0.42
0.535PheHis: 0.535 ± 0.168
1.41PheIle: 1.41 ± 0.273
1.848PheLys: 1.848 ± 0.283
2.577PheLeu: 2.577 ± 0.422
0.632PheMet: 0.632 ± 0.177
1.41PheAsn: 1.41 ± 0.314
1.459PhePro: 1.459 ± 0.346
1.362PheGln: 1.362 ± 0.261
2.334PheArg: 2.334 ± 0.362
1.751PheSer: 1.751 ± 0.347
1.653PheThr: 1.653 ± 0.24
1.605PheVal: 1.605 ± 0.277
0.486PheTrp: 0.486 ± 0.149
1.119PheTyr: 1.119 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
6.371GlyAla: 6.371 ± 0.824
1.167GlyCys: 1.167 ± 0.254
5.058GlyAsp: 5.058 ± 0.495
5.058GlyGlu: 5.058 ± 0.481
2.529GlyPhe: 2.529 ± 0.364
5.982GlyGly: 5.982 ± 0.713
1.556GlyHis: 1.556 ± 0.282
4.474GlyIle: 4.474 ± 0.479
5.447GlyLys: 5.447 ± 0.574
6.273GlyLeu: 6.273 ± 0.653
2.334GlyMet: 2.334 ± 0.3
2.529GlyAsn: 2.529 ± 0.308
4.28GlyPro: 4.28 ± 0.526
2.821GlyGln: 2.821 ± 0.397
3.988GlyArg: 3.988 ± 0.414
5.982GlySer: 5.982 ± 0.813
6.711GlyThr: 6.711 ± 0.602
5.252GlyVal: 5.252 ± 0.513
1.702GlyTrp: 1.702 ± 0.282
3.112GlyTyr: 3.112 ± 0.392
0.0GlyXaa: 0.0 ± 0.0
His
1.653HisAla: 1.653 ± 0.318
0.243HisCys: 0.243 ± 0.108
1.021HisAsp: 1.021 ± 0.216
1.07HisGlu: 1.07 ± 0.217
0.924HisPhe: 0.924 ± 0.2
1.848HisGly: 1.848 ± 0.229
0.34HisHis: 0.34 ± 0.138
0.924HisIle: 0.924 ± 0.214
1.021HisLys: 1.021 ± 0.206
1.799HisLeu: 1.799 ± 0.331
0.486HisMet: 0.486 ± 0.146
0.973HisAsn: 0.973 ± 0.244
1.41HisPro: 1.41 ± 0.321
0.632HisGln: 0.632 ± 0.195
1.264HisArg: 1.264 ± 0.274
0.486HisSer: 0.486 ± 0.167
0.778HisThr: 0.778 ± 0.182
1.119HisVal: 1.119 ± 0.27
0.243HisTrp: 0.243 ± 0.124
0.827HisTyr: 0.827 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
4.523IleAla: 4.523 ± 0.554
0.486IleCys: 0.486 ± 0.152
3.21IleAsp: 3.21 ± 0.365
3.015IleGlu: 3.015 ± 0.406
1.362IlePhe: 1.362 ± 0.261
3.696IleGly: 3.696 ± 0.506
0.778IleHis: 0.778 ± 0.197
2.577IleIle: 2.577 ± 0.431
2.432IleLys: 2.432 ± 0.339
3.064IleLeu: 3.064 ± 0.384
1.167IleMet: 1.167 ± 0.258
2.869IleAsn: 2.869 ± 0.483
3.064IlePro: 3.064 ± 0.502
2.188IleGln: 2.188 ± 0.486
3.647IleArg: 3.647 ± 0.435
2.383IleSer: 2.383 ± 0.312
3.161IleThr: 3.161 ± 0.424
3.161IleVal: 3.161 ± 0.493
0.875IleTrp: 0.875 ± 0.217
1.07IleTyr: 1.07 ± 0.249
0.0IleXaa: 0.0 ± 0.0
Lys
5.982LysAla: 5.982 ± 0.842
0.438LysCys: 0.438 ± 0.147
3.599LysAsp: 3.599 ± 0.436
3.696LysGlu: 3.696 ± 0.525
1.459LysPhe: 1.459 ± 0.211
3.55LysGly: 3.55 ± 0.554
1.119LysHis: 1.119 ± 0.254
3.064LysIle: 3.064 ± 0.323
3.21LysLys: 3.21 ± 0.507
4.474LysLeu: 4.474 ± 0.478
0.973LysMet: 0.973 ± 0.197
1.994LysAsn: 1.994 ± 0.33
3.404LysPro: 3.404 ± 0.482
1.799LysGln: 1.799 ± 0.245
4.523LysArg: 4.523 ± 0.615
2.966LysSer: 2.966 ± 0.343
3.015LysThr: 3.015 ± 0.413
3.696LysVal: 3.696 ± 0.409
1.07LysTrp: 1.07 ± 0.197
1.313LysTyr: 1.313 ± 0.213
0.0LysXaa: 0.0 ± 0.0
Leu
7.295LeuAla: 7.295 ± 0.578
0.535LeuCys: 0.535 ± 0.223
5.009LeuAsp: 5.009 ± 0.592
5.058LeuGlu: 5.058 ± 0.672
2.334LeuPhe: 2.334 ± 0.346
5.982LeuGly: 5.982 ± 0.645
1.556LeuHis: 1.556 ± 0.255
2.869LeuIle: 2.869 ± 0.412
4.863LeuLys: 4.863 ± 0.509
5.738LeuLeu: 5.738 ± 0.513
1.799LeuMet: 1.799 ± 0.276
3.307LeuAsn: 3.307 ± 0.384
4.425LeuPro: 4.425 ± 0.498
2.577LeuGln: 2.577 ± 0.357
5.544LeuArg: 5.544 ± 0.586
5.252LeuSer: 5.252 ± 0.549
5.058LeuThr: 5.058 ± 0.493
4.377LeuVal: 4.377 ± 0.432
1.119LeuTrp: 1.119 ± 0.29
1.313LeuTyr: 1.313 ± 0.278
0.0LeuXaa: 0.0 ± 0.0
Met
2.577MetAla: 2.577 ± 0.314
0.243MetCys: 0.243 ± 0.119
1.264MetAsp: 1.264 ± 0.211
1.264MetGlu: 1.264 ± 0.18
0.632MetPhe: 0.632 ± 0.175
1.556MetGly: 1.556 ± 0.284
0.34MetHis: 0.34 ± 0.132
1.119MetIle: 1.119 ± 0.208
1.41MetLys: 1.41 ± 0.282
1.216MetLeu: 1.216 ± 0.306
0.535MetMet: 0.535 ± 0.202
1.021MetAsn: 1.021 ± 0.225
1.459MetPro: 1.459 ± 0.299
0.875MetGln: 0.875 ± 0.199
1.556MetArg: 1.556 ± 0.251
1.799MetSer: 1.799 ± 0.288
1.459MetThr: 1.459 ± 0.186
0.973MetVal: 0.973 ± 0.246
0.438MetTrp: 0.438 ± 0.123
1.021MetTyr: 1.021 ± 0.26
0.0MetXaa: 0.0 ± 0.0
Asn
3.988AsnAla: 3.988 ± 0.462
0.195AsnCys: 0.195 ± 0.092
2.577AsnAsp: 2.577 ± 0.398
2.383AsnGlu: 2.383 ± 0.364
1.313AsnPhe: 1.313 ± 0.311
3.745AsnGly: 3.745 ± 0.521
0.875AsnHis: 0.875 ± 0.228
1.848AsnIle: 1.848 ± 0.292
2.14AsnLys: 2.14 ± 0.35
2.286AsnLeu: 2.286 ± 0.272
0.778AsnMet: 0.778 ± 0.193
1.459AsnAsn: 1.459 ± 0.319
3.112AsnPro: 3.112 ± 0.594
1.313AsnGln: 1.313 ± 0.319
2.821AsnArg: 2.821 ± 0.336
2.091AsnSer: 2.091 ± 0.325
2.188AsnThr: 2.188 ± 0.262
2.043AsnVal: 2.043 ± 0.308
0.632AsnTrp: 0.632 ± 0.162
1.459AsnTyr: 1.459 ± 0.27
0.0AsnXaa: 0.0 ± 0.0
Pro
4.036ProAla: 4.036 ± 0.421
0.389ProCys: 0.389 ± 0.174
3.258ProAsp: 3.258 ± 0.468
4.134ProGlu: 4.134 ± 0.47
1.945ProPhe: 1.945 ± 0.265
5.301ProGly: 5.301 ± 0.604
1.313ProHis: 1.313 ± 0.216
2.237ProIle: 2.237 ± 0.271
3.356ProLys: 3.356 ± 0.551
3.501ProLeu: 3.501 ± 0.349
0.875ProMet: 0.875 ± 0.234
2.675ProAsn: 2.675 ± 0.375
2.334ProPro: 2.334 ± 0.38
1.653ProGln: 1.653 ± 0.275
2.383ProArg: 2.383 ± 0.382
3.501ProSer: 3.501 ± 0.379
3.258ProThr: 3.258 ± 0.369
3.161ProVal: 3.161 ± 0.401
1.167ProTrp: 1.167 ± 0.282
1.653ProTyr: 1.653 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
3.015GlnAla: 3.015 ± 0.409
0.243GlnCys: 0.243 ± 0.128
2.286GlnAsp: 2.286 ± 0.262
1.994GlnGlu: 1.994 ± 0.284
1.605GlnPhe: 1.605 ± 0.266
2.675GlnGly: 2.675 ± 0.415
0.584GlnHis: 0.584 ± 0.165
2.626GlnIle: 2.626 ± 0.364
2.091GlnLys: 2.091 ± 0.383
2.675GlnLeu: 2.675 ± 0.343
1.07GlnMet: 1.07 ± 0.19
1.799GlnAsn: 1.799 ± 0.301
1.459GlnPro: 1.459 ± 0.276
1.362GlnGln: 1.362 ± 0.289
2.772GlnArg: 2.772 ± 0.419
2.091GlnSer: 2.091 ± 0.327
1.799GlnThr: 1.799 ± 0.322
2.383GlnVal: 2.383 ± 0.361
0.535GlnTrp: 0.535 ± 0.16
0.924GlnTyr: 0.924 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
5.884ArgAla: 5.884 ± 0.668
0.486ArgCys: 0.486 ± 0.166
4.036ArgAsp: 4.036 ± 0.481
4.231ArgGlu: 4.231 ± 0.473
1.897ArgPhe: 1.897 ± 0.275
5.204ArgGly: 5.204 ± 0.435
1.021ArgHis: 1.021 ± 0.222
3.501ArgIle: 3.501 ± 0.495
4.62ArgLys: 4.62 ± 0.664
4.036ArgLeu: 4.036 ± 0.454
1.897ArgMet: 1.897 ± 0.275
2.723ArgAsn: 2.723 ± 0.449
2.675ArgPro: 2.675 ± 0.429
2.626ArgGln: 2.626 ± 0.244
5.058ArgArg: 5.058 ± 0.589
3.696ArgSer: 3.696 ± 0.387
3.404ArgThr: 3.404 ± 0.395
5.204ArgVal: 5.204 ± 0.575
1.653ArgTrp: 1.653 ± 0.299
2.14ArgTyr: 2.14 ± 0.356
0.0ArgXaa: 0.0 ± 0.0
Ser
6.176SerAla: 6.176 ± 0.581
0.924SerCys: 0.924 ± 0.261
4.571SerAsp: 4.571 ± 0.482
3.112SerGlu: 3.112 ± 0.446
2.383SerPhe: 2.383 ± 0.434
6.517SerGly: 6.517 ± 0.552
1.07SerHis: 1.07 ± 0.226
2.577SerIle: 2.577 ± 0.395
2.432SerLys: 2.432 ± 0.268
3.89SerLeu: 3.89 ± 0.506
1.264SerMet: 1.264 ± 0.223
1.751SerAsn: 1.751 ± 0.23
2.432SerPro: 2.432 ± 0.382
2.188SerGln: 2.188 ± 0.334
3.501SerArg: 3.501 ± 0.363
3.939SerSer: 3.939 ± 0.515
4.28SerThr: 4.28 ± 0.5
2.626SerVal: 2.626 ± 0.361
1.653SerTrp: 1.653 ± 0.378
1.702SerTyr: 1.702 ± 0.262
0.0SerXaa: 0.0 ± 0.0
Thr
5.447ThrAla: 5.447 ± 0.652
0.875ThrCys: 0.875 ± 0.227
3.55ThrAsp: 3.55 ± 0.44
4.182ThrGlu: 4.182 ± 0.52
2.237ThrPhe: 2.237 ± 0.342
5.69ThrGly: 5.69 ± 0.634
1.216ThrHis: 1.216 ± 0.223
3.55ThrIle: 3.55 ± 0.496
2.821ThrLys: 2.821 ± 0.374
3.745ThrLeu: 3.745 ± 0.485
1.119ThrMet: 1.119 ± 0.18
2.286ThrAsn: 2.286 ± 0.362
3.404ThrPro: 3.404 ± 0.36
1.556ThrGln: 1.556 ± 0.303
2.821ThrArg: 2.821 ± 0.463
3.988ThrSer: 3.988 ± 0.511
3.647ThrThr: 3.647 ± 0.515
3.696ThrVal: 3.696 ± 0.47
1.459ThrTrp: 1.459 ± 0.252
1.751ThrTyr: 1.751 ± 0.222
0.0ThrXaa: 0.0 ± 0.0
Val
5.641ValAla: 5.641 ± 0.594
0.778ValCys: 0.778 ± 0.248
4.036ValAsp: 4.036 ± 0.456
4.523ValGlu: 4.523 ± 0.57
1.799ValPhe: 1.799 ± 0.291
5.69ValGly: 5.69 ± 0.623
1.216ValHis: 1.216 ± 0.314
2.675ValIle: 2.675 ± 0.298
3.599ValLys: 3.599 ± 0.488
5.009ValLeu: 5.009 ± 0.516
1.167ValMet: 1.167 ± 0.227
1.994ValAsn: 1.994 ± 0.352
3.356ValPro: 3.356 ± 0.391
2.043ValGln: 2.043 ± 0.312
3.89ValArg: 3.89 ± 0.43
2.869ValSer: 2.869 ± 0.37
3.842ValThr: 3.842 ± 0.446
4.377ValVal: 4.377 ± 0.566
1.119ValTrp: 1.119 ± 0.284
2.577ValTyr: 2.577 ± 0.291
0.0ValXaa: 0.0 ± 0.0
Trp
1.508TrpAla: 1.508 ± 0.314
0.243TrpCys: 0.243 ± 0.111
1.167TrpAsp: 1.167 ± 0.226
1.167TrpGlu: 1.167 ± 0.253
0.486TrpPhe: 0.486 ± 0.167
1.313TrpGly: 1.313 ± 0.332
0.486TrpHis: 0.486 ± 0.149
1.119TrpIle: 1.119 ± 0.218
1.021TrpLys: 1.021 ± 0.233
1.459TrpLeu: 1.459 ± 0.231
0.243TrpMet: 0.243 ± 0.124
0.827TrpAsn: 0.827 ± 0.211
0.778TrpPro: 0.778 ± 0.229
0.924TrpGln: 0.924 ± 0.213
1.751TrpArg: 1.751 ± 0.316
1.216TrpSer: 1.216 ± 0.224
1.751TrpThr: 1.751 ± 0.279
1.508TrpVal: 1.508 ± 0.246
0.146TrpTrp: 0.146 ± 0.078
0.438TrpTyr: 0.438 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.334TyrAla: 2.334 ± 0.357
0.486TyrCys: 0.486 ± 0.15
2.48TyrAsp: 2.48 ± 0.409
1.799TyrGlu: 1.799 ± 0.348
0.729TyrPhe: 0.729 ± 0.175
2.626TyrGly: 2.626 ± 0.324
0.632TyrHis: 0.632 ± 0.193
1.264TyrIle: 1.264 ± 0.325
1.264TyrLys: 1.264 ± 0.234
2.334TyrLeu: 2.334 ± 0.361
0.584TyrMet: 0.584 ± 0.15
1.362TyrAsn: 1.362 ± 0.224
1.41TyrPro: 1.41 ± 0.308
1.313TyrGln: 1.313 ± 0.32
2.966TyrArg: 2.966 ± 0.43
1.799TyrSer: 1.799 ± 0.292
1.702TyrThr: 1.702 ± 0.336
2.091TyrVal: 2.091 ± 0.347
0.584TyrTrp: 0.584 ± 0.171
0.973TyrTyr: 0.973 ± 0.254
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (20564 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski