Amino acid dipepetide frequency for Microbacterium phage Dismas

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.979AlaAla: 13.979 ± 1.34
0.153AlaCys: 0.153 ± 0.127
6.951AlaAsp: 6.951 ± 0.741
7.257AlaGlu: 7.257 ± 0.697
3.59AlaPhe: 3.59 ± 0.556
12.528AlaGly: 12.528 ± 0.846
1.528AlaHis: 1.528 ± 0.35
4.66AlaIle: 4.66 ± 0.627
5.882AlaLys: 5.882 ± 0.787
11.458AlaLeu: 11.458 ± 0.921
2.215AlaMet: 2.215 ± 0.392
3.514AlaAsn: 3.514 ± 0.582
6.035AlaPro: 6.035 ± 0.53
4.889AlaGln: 4.889 ± 0.526
7.333AlaArg: 7.333 ± 0.786
5.424AlaSer: 5.424 ± 0.579
5.729AlaThr: 5.729 ± 0.79
8.326AlaVal: 8.326 ± 0.912
2.521AlaTrp: 2.521 ± 0.55
2.75AlaTyr: 2.75 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.229CysAla: 0.229 ± 0.149
0.076CysCys: 0.076 ± 0.094
0.229CysAsp: 0.229 ± 0.135
0.076CysGlu: 0.076 ± 0.077
0.306CysPhe: 0.306 ± 0.174
0.687CysGly: 0.687 ± 0.254
0.0CysHis: 0.0 ± 0.0
0.076CysIle: 0.076 ± 0.075
0.382CysLys: 0.382 ± 0.155
0.076CysLeu: 0.076 ± 0.072
0.076CysMet: 0.076 ± 0.068
0.229CysAsn: 0.229 ± 0.126
0.611CysPro: 0.611 ± 0.229
0.076CysGln: 0.076 ± 0.061
0.306CysArg: 0.306 ± 0.142
0.306CysSer: 0.306 ± 0.123
0.153CysThr: 0.153 ± 0.109
0.229CysVal: 0.229 ± 0.147
0.076CysTrp: 0.076 ± 0.079
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.264AspAla: 6.264 ± 0.712
0.382AspCys: 0.382 ± 0.172
3.132AspAsp: 3.132 ± 0.573
3.437AspGlu: 3.437 ± 0.61
2.062AspPhe: 2.062 ± 0.379
7.028AspGly: 7.028 ± 0.849
1.375AspHis: 1.375 ± 0.302
1.91AspIle: 1.91 ± 0.329
1.833AspLys: 1.833 ± 0.477
6.569AspLeu: 6.569 ± 0.82
1.069AspMet: 1.069 ± 0.308
1.069AspAsn: 1.069 ± 0.298
4.125AspPro: 4.125 ± 0.514
1.91AspGln: 1.91 ± 0.323
3.208AspArg: 3.208 ± 0.481
3.437AspSer: 3.437 ± 0.463
3.361AspThr: 3.361 ± 0.511
5.118AspVal: 5.118 ± 0.623
1.604AspTrp: 1.604 ± 0.278
2.521AspTyr: 2.521 ± 0.421
0.0AspXaa: 0.0 ± 0.0
Glu
8.785GluAla: 8.785 ± 0.842
0.076GluCys: 0.076 ± 0.075
4.125GluAsp: 4.125 ± 0.581
5.042GluGlu: 5.042 ± 0.898
2.674GluPhe: 2.674 ± 0.333
3.972GluGly: 3.972 ± 0.5
1.375GluHis: 1.375 ± 0.315
0.993GluIle: 0.993 ± 0.299
1.986GluLys: 1.986 ± 0.344
7.41GluLeu: 7.41 ± 0.605
1.146GluMet: 1.146 ± 0.311
1.604GluAsn: 1.604 ± 0.415
3.208GluPro: 3.208 ± 0.395
2.062GluGln: 2.062 ± 0.365
3.056GluArg: 3.056 ± 0.517
2.444GluSer: 2.444 ± 0.531
3.437GluThr: 3.437 ± 0.409
5.424GluVal: 5.424 ± 0.68
1.451GluTrp: 1.451 ± 0.318
1.604GluTyr: 1.604 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.903PheAla: 2.903 ± 0.549
0.306PheCys: 0.306 ± 0.144
1.604PheAsp: 1.604 ± 0.373
2.292PheGlu: 2.292 ± 0.383
0.535PhePhe: 0.535 ± 0.21
2.521PheGly: 2.521 ± 0.447
0.229PheHis: 0.229 ± 0.129
1.299PheIle: 1.299 ± 0.359
1.528PheLys: 1.528 ± 0.282
2.444PheLeu: 2.444 ± 0.385
0.764PheMet: 0.764 ± 0.231
1.528PheAsn: 1.528 ± 0.286
1.146PhePro: 1.146 ± 0.304
1.451PheGln: 1.451 ± 0.302
3.285PheArg: 3.285 ± 0.461
2.062PheSer: 2.062 ± 0.331
2.597PheThr: 2.597 ± 0.362
2.75PheVal: 2.75 ± 0.416
0.687PheTrp: 0.687 ± 0.27
0.687PheTyr: 0.687 ± 0.162
0.0PheXaa: 0.0 ± 0.0
Gly
9.167GlyAla: 9.167 ± 0.947
0.535GlyCys: 0.535 ± 0.256
6.34GlyAsp: 6.34 ± 0.643
4.125GlyGlu: 4.125 ± 0.613
2.903GlyPhe: 2.903 ± 0.563
6.799GlyGly: 6.799 ± 0.816
1.681GlyHis: 1.681 ± 0.362
3.667GlyIle: 3.667 ± 0.566
3.896GlyLys: 3.896 ± 0.703
7.104GlyLeu: 7.104 ± 0.747
2.521GlyMet: 2.521 ± 0.47
2.75GlyAsn: 2.75 ± 0.76
3.667GlyPro: 3.667 ± 0.764
4.354GlyGln: 4.354 ± 0.756
4.965GlyArg: 4.965 ± 0.481
5.042GlySer: 5.042 ± 0.616
6.569GlyThr: 6.569 ± 0.954
7.792GlyVal: 7.792 ± 1.031
2.215GlyTrp: 2.215 ± 0.423
2.903GlyTyr: 2.903 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
2.444HisAla: 2.444 ± 0.439
0.076HisCys: 0.076 ± 0.079
0.917HisAsp: 0.917 ± 0.233
1.069HisGlu: 1.069 ± 0.314
1.146HisPhe: 1.146 ± 0.305
1.375HisGly: 1.375 ± 0.41
0.458HisHis: 0.458 ± 0.168
0.382HisIle: 0.382 ± 0.167
0.764HisLys: 0.764 ± 0.216
2.521HisLeu: 2.521 ± 0.455
0.306HisMet: 0.306 ± 0.13
0.306HisAsn: 0.306 ± 0.136
1.222HisPro: 1.222 ± 0.288
0.382HisGln: 0.382 ± 0.144
0.687HisArg: 0.687 ± 0.167
0.535HisSer: 0.535 ± 0.286
0.917HisThr: 0.917 ± 0.277
1.451HisVal: 1.451 ± 0.355
0.153HisTrp: 0.153 ± 0.09
0.917HisTyr: 0.917 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
4.583IleAla: 4.583 ± 0.51
0.076IleCys: 0.076 ± 0.083
2.674IleAsp: 2.674 ± 0.368
2.903IleGlu: 2.903 ± 0.422
0.687IlePhe: 0.687 ± 0.233
3.285IleGly: 3.285 ± 0.5
0.993IleHis: 0.993 ± 0.312
1.757IleIle: 1.757 ± 0.312
2.139IleLys: 2.139 ± 0.437
3.056IleLeu: 3.056 ± 0.481
0.382IleMet: 0.382 ± 0.149
1.146IleAsn: 1.146 ± 0.283
1.375IlePro: 1.375 ± 0.386
0.993IleGln: 0.993 ± 0.356
2.597IleArg: 2.597 ± 0.492
2.139IleSer: 2.139 ± 0.398
2.444IleThr: 2.444 ± 0.612
3.056IleVal: 3.056 ± 0.482
0.687IleTrp: 0.687 ± 0.261
0.917IleTyr: 0.917 ± 0.263
0.0IleXaa: 0.0 ± 0.0
Lys
6.035LysAla: 6.035 ± 1.094
0.611LysCys: 0.611 ± 0.249
1.986LysAsp: 1.986 ± 0.441
2.139LysGlu: 2.139 ± 0.381
1.222LysPhe: 1.222 ± 0.342
3.437LysGly: 3.437 ± 0.469
0.535LysHis: 0.535 ± 0.231
1.146LysIle: 1.146 ± 0.332
1.451LysLys: 1.451 ± 0.387
3.896LysLeu: 3.896 ± 0.624
0.993LysMet: 0.993 ± 0.301
0.84LysAsn: 0.84 ± 0.212
2.75LysPro: 2.75 ± 0.43
1.757LysGln: 1.757 ± 0.36
3.514LysArg: 3.514 ± 0.382
2.139LysSer: 2.139 ± 0.47
2.826LysThr: 2.826 ± 0.448
3.056LysVal: 3.056 ± 0.404
0.306LysTrp: 0.306 ± 0.155
0.611LysTyr: 0.611 ± 0.199
0.0LysXaa: 0.0 ± 0.0
Leu
9.93LeuAla: 9.93 ± 0.996
0.306LeuCys: 0.306 ± 0.135
5.5LeuAsp: 5.5 ± 0.696
5.424LeuGlu: 5.424 ± 0.625
2.674LeuPhe: 2.674 ± 0.447
7.792LeuGly: 7.792 ± 0.708
1.757LeuHis: 1.757 ± 0.393
3.437LeuIle: 3.437 ± 0.593
3.667LeuLys: 3.667 ± 0.65
6.646LeuLeu: 6.646 ± 0.597
2.521LeuMet: 2.521 ± 0.4
3.208LeuAsn: 3.208 ± 0.486
4.354LeuPro: 4.354 ± 0.574
3.667LeuGln: 3.667 ± 0.786
5.653LeuArg: 5.653 ± 0.581
4.812LeuSer: 4.812 ± 0.532
5.5LeuThr: 5.5 ± 0.991
8.174LeuVal: 8.174 ± 0.647
1.451LeuTrp: 1.451 ± 0.316
1.757LeuTyr: 1.757 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
3.437MetAla: 3.437 ± 0.512
0.0MetCys: 0.0 ± 0.0
1.604MetAsp: 1.604 ± 0.366
1.222MetGlu: 1.222 ± 0.281
0.458MetPhe: 0.458 ± 0.154
1.069MetGly: 1.069 ± 0.286
0.076MetHis: 0.076 ± 0.071
0.611MetIle: 0.611 ± 0.214
1.222MetLys: 1.222 ± 0.308
1.681MetLeu: 1.681 ± 0.328
0.306MetMet: 0.306 ± 0.134
0.84MetAsn: 0.84 ± 0.22
0.764MetPro: 0.764 ± 0.186
0.535MetGln: 0.535 ± 0.186
1.146MetArg: 1.146 ± 0.295
1.604MetSer: 1.604 ± 0.335
1.91MetThr: 1.91 ± 0.372
1.146MetVal: 1.146 ± 0.274
0.458MetTrp: 0.458 ± 0.176
0.382MetTyr: 0.382 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
2.826AsnAla: 2.826 ± 0.428
0.229AsnCys: 0.229 ± 0.129
1.528AsnAsp: 1.528 ± 0.332
1.146AsnGlu: 1.146 ± 0.275
0.764AsnPhe: 0.764 ± 0.246
3.896AsnGly: 3.896 ± 0.811
0.687AsnHis: 0.687 ± 0.279
0.687AsnIle: 0.687 ± 0.218
0.993AsnLys: 0.993 ± 0.292
2.215AsnLeu: 2.215 ± 0.4
0.611AsnMet: 0.611 ± 0.251
0.84AsnAsn: 0.84 ± 0.228
1.986AsnPro: 1.986 ± 0.339
1.91AsnGln: 1.91 ± 0.373
2.215AsnArg: 2.215 ± 0.586
1.757AsnSer: 1.757 ± 0.323
1.451AsnThr: 1.451 ± 0.301
2.062AsnVal: 2.062 ± 0.467
0.687AsnTrp: 0.687 ± 0.237
0.84AsnTyr: 0.84 ± 0.239
0.0AsnXaa: 0.0 ± 0.0
Pro
7.868ProAla: 7.868 ± 0.649
0.306ProCys: 0.306 ± 0.161
3.437ProAsp: 3.437 ± 0.483
4.507ProGlu: 4.507 ± 0.557
1.833ProPhe: 1.833 ± 0.339
5.347ProGly: 5.347 ± 0.606
1.069ProHis: 1.069 ± 0.334
2.215ProIle: 2.215 ± 0.441
2.139ProLys: 2.139 ± 0.475
4.125ProLeu: 4.125 ± 0.657
0.917ProMet: 0.917 ± 0.237
1.681ProAsn: 1.681 ± 0.373
1.528ProPro: 1.528 ± 0.342
1.681ProGln: 1.681 ± 0.511
2.215ProArg: 2.215 ± 0.446
3.667ProSer: 3.667 ± 0.589
2.368ProThr: 2.368 ± 0.35
3.743ProVal: 3.743 ± 0.494
1.222ProTrp: 1.222 ± 0.289
0.382ProTyr: 0.382 ± 0.155
0.0ProXaa: 0.0 ± 0.0
Gln
4.049GlnAla: 4.049 ± 0.507
0.153GlnCys: 0.153 ± 0.188
2.215GlnAsp: 2.215 ± 0.414
2.292GlnGlu: 2.292 ± 0.473
1.299GlnPhe: 1.299 ± 0.408
2.521GlnGly: 2.521 ± 0.346
0.535GlnHis: 0.535 ± 0.239
0.764GlnIle: 0.764 ± 0.218
1.222GlnLys: 1.222 ± 0.24
3.667GlnLeu: 3.667 ± 0.546
0.535GlnMet: 0.535 ± 0.187
1.146GlnAsn: 1.146 ± 0.241
1.833GlnPro: 1.833 ± 0.392
1.299GlnGln: 1.299 ± 0.243
2.139GlnArg: 2.139 ± 0.349
1.681GlnSer: 1.681 ± 0.354
2.674GlnThr: 2.674 ± 0.541
4.201GlnVal: 4.201 ± 0.541
0.687GlnTrp: 0.687 ± 0.276
0.993GlnTyr: 0.993 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
6.875ArgAla: 6.875 ± 0.828
0.229ArgCys: 0.229 ± 0.185
4.201ArgAsp: 4.201 ± 0.49
4.201ArgGlu: 4.201 ± 0.684
2.292ArgPhe: 2.292 ± 0.43
5.194ArgGly: 5.194 ± 0.664
0.917ArgHis: 0.917 ± 0.273
2.597ArgIle: 2.597 ± 0.503
2.368ArgLys: 2.368 ± 0.38
6.493ArgLeu: 6.493 ± 0.722
1.069ArgMet: 1.069 ± 0.261
1.681ArgAsn: 1.681 ± 0.336
3.132ArgPro: 3.132 ± 0.492
2.062ArgGln: 2.062 ± 0.363
5.576ArgArg: 5.576 ± 0.716
3.361ArgSer: 3.361 ± 0.483
4.049ArgThr: 4.049 ± 0.545
5.347ArgVal: 5.347 ± 0.753
1.222ArgTrp: 1.222 ± 0.324
1.299ArgTyr: 1.299 ± 0.315
0.0ArgXaa: 0.0 ± 0.0
Ser
6.111SerAla: 6.111 ± 0.63
0.229SerCys: 0.229 ± 0.102
3.437SerAsp: 3.437 ± 0.478
2.75SerGlu: 2.75 ± 0.437
2.139SerPhe: 2.139 ± 0.433
6.264SerGly: 6.264 ± 0.815
0.764SerHis: 0.764 ± 0.247
2.368SerIle: 2.368 ± 0.513
2.292SerLys: 2.292 ± 0.426
4.66SerLeu: 4.66 ± 0.681
1.451SerMet: 1.451 ± 0.312
1.757SerAsn: 1.757 ± 0.363
3.208SerPro: 3.208 ± 0.481
1.528SerGln: 1.528 ± 0.325
3.667SerArg: 3.667 ± 0.385
2.674SerSer: 2.674 ± 0.479
3.819SerThr: 3.819 ± 0.626
4.431SerVal: 4.431 ± 0.65
1.222SerTrp: 1.222 ± 0.29
0.917SerTyr: 0.917 ± 0.232
0.0SerXaa: 0.0 ± 0.0
Thr
7.181ThrAla: 7.181 ± 0.893
0.076ThrCys: 0.076 ± 0.078
3.437ThrAsp: 3.437 ± 0.499
3.285ThrGlu: 3.285 ± 0.496
2.292ThrPhe: 2.292 ± 0.513
5.424ThrGly: 5.424 ± 0.784
0.993ThrHis: 0.993 ± 0.331
3.972ThrIle: 3.972 ± 0.657
2.826ThrLys: 2.826 ± 0.424
4.736ThrLeu: 4.736 ± 0.647
0.611ThrMet: 0.611 ± 0.223
1.222ThrAsn: 1.222 ± 0.348
4.431ThrPro: 4.431 ± 0.511
0.764ThrGln: 0.764 ± 0.201
3.59ThrArg: 3.59 ± 0.544
4.125ThrSer: 4.125 ± 0.56
3.667ThrThr: 3.667 ± 0.609
6.187ThrVal: 6.187 ± 0.809
1.451ThrTrp: 1.451 ± 0.305
1.681ThrTyr: 1.681 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
9.854ValAla: 9.854 ± 0.815
0.153ValCys: 0.153 ± 0.111
5.042ValAsp: 5.042 ± 0.663
5.729ValGlu: 5.729 ± 0.595
2.75ValPhe: 2.75 ± 0.469
5.806ValGly: 5.806 ± 0.844
1.986ValHis: 1.986 ± 0.36
3.437ValIle: 3.437 ± 0.453
3.208ValLys: 3.208 ± 0.585
5.958ValLeu: 5.958 ± 0.74
1.375ValMet: 1.375 ± 0.269
2.444ValAsn: 2.444 ± 0.392
5.118ValPro: 5.118 ± 0.642
2.674ValGln: 2.674 ± 0.39
5.729ValArg: 5.729 ± 0.585
4.889ValSer: 4.889 ± 0.597
5.576ValThr: 5.576 ± 0.734
8.25ValVal: 8.25 ± 0.789
1.833ValTrp: 1.833 ± 0.417
2.368ValTyr: 2.368 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
2.139TrpAla: 2.139 ± 0.435
0.076TrpCys: 0.076 ± 0.071
0.84TrpAsp: 0.84 ± 0.314
1.222TrpGlu: 1.222 ± 0.381
0.382TrpPhe: 0.382 ± 0.205
1.681TrpGly: 1.681 ± 0.297
0.458TrpHis: 0.458 ± 0.154
0.993TrpIle: 0.993 ± 0.273
0.84TrpLys: 0.84 ± 0.194
2.215TrpLeu: 2.215 ± 0.388
0.687TrpMet: 0.687 ± 0.192
0.917TrpAsn: 0.917 ± 0.253
0.382TrpPro: 0.382 ± 0.17
1.222TrpGln: 1.222 ± 0.414
1.069TrpArg: 1.069 ± 0.384
1.833TrpSer: 1.833 ± 0.371
1.375TrpThr: 1.375 ± 0.421
1.604TrpVal: 1.604 ± 0.253
0.153TrpTrp: 0.153 ± 0.113
0.458TrpTyr: 0.458 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.139TyrAla: 2.139 ± 0.493
0.153TyrCys: 0.153 ± 0.112
2.215TyrAsp: 2.215 ± 0.325
1.757TyrGlu: 1.757 ± 0.305
0.611TyrPhe: 0.611 ± 0.175
2.368TyrGly: 2.368 ± 0.385
0.611TyrHis: 0.611 ± 0.191
0.917TyrIle: 0.917 ± 0.256
0.611TyrLys: 0.611 ± 0.203
1.299TyrLeu: 1.299 ± 0.308
0.84TyrMet: 0.84 ± 0.24
0.611TyrAsn: 0.611 ± 0.206
1.375TyrPro: 1.375 ± 0.325
0.687TyrGln: 0.687 ± 0.231
2.215TyrArg: 2.215 ± 0.429
1.757TyrSer: 1.757 ± 0.458
1.528TyrThr: 1.528 ± 0.364
1.833TyrVal: 1.833 ± 0.34
0.458TyrTrp: 0.458 ± 0.203
0.306TyrTyr: 0.306 ± 0.17
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski