Amino acid dipepetide frequency for Bacillus phage pW4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.422AlaAla: 5.422 ± 0.739
0.583AlaCys: 0.583 ± 0.165
3.092AlaAsp: 3.092 ± 0.376
4.66AlaGlu: 4.66 ± 0.539
2.241AlaPhe: 2.241 ± 0.337
4.795AlaGly: 4.795 ± 0.63
0.538AlaHis: 0.538 ± 0.15
4.078AlaIle: 4.078 ± 0.464
5.557AlaLys: 5.557 ± 0.494
4.705AlaLeu: 4.705 ± 0.364
1.389AlaMet: 1.389 ± 0.325
3.361AlaAsn: 3.361 ± 0.414
1.255AlaPro: 1.255 ± 0.2
2.599AlaGln: 2.599 ± 0.467
1.882AlaArg: 1.882 ± 0.276
3.585AlaSer: 3.585 ± 0.699
3.63AlaThr: 3.63 ± 0.547
3.988AlaVal: 3.988 ± 0.465
0.762AlaTrp: 0.762 ± 0.183
2.823AlaTyr: 2.823 ± 0.39
0.0AlaXaa: 0.0 ± 0.0
Cys
0.403CysAla: 0.403 ± 0.149
0.045CysCys: 0.045 ± 0.04
0.807CysAsp: 0.807 ± 0.216
0.538CysGlu: 0.538 ± 0.136
0.493CysPhe: 0.493 ± 0.156
0.314CysGly: 0.314 ± 0.109
0.179CysHis: 0.179 ± 0.084
0.717CysIle: 0.717 ± 0.172
0.672CysLys: 0.672 ± 0.208
0.583CysLeu: 0.583 ± 0.181
0.538CysMet: 0.538 ± 0.162
0.314CysAsn: 0.314 ± 0.102
0.403CysPro: 0.403 ± 0.116
0.179CysGln: 0.179 ± 0.08
0.224CysArg: 0.224 ± 0.102
0.358CysSer: 0.358 ± 0.115
0.358CysThr: 0.358 ± 0.131
0.358CysVal: 0.358 ± 0.136
0.045CysTrp: 0.045 ± 0.042
0.358CysTyr: 0.358 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
3.271AspAla: 3.271 ± 0.436
0.314AspCys: 0.314 ± 0.133
4.033AspAsp: 4.033 ± 0.462
4.75AspGlu: 4.75 ± 0.564
3.271AspPhe: 3.271 ± 0.407
3.899AspGly: 3.899 ± 0.374
1.255AspHis: 1.255 ± 0.24
4.75AspIle: 4.75 ± 0.561
6.094AspLys: 6.094 ± 0.565
5.691AspLeu: 5.691 ± 0.543
1.927AspMet: 1.927 ± 0.267
3.988AspAsn: 3.988 ± 0.414
1.837AspPro: 1.837 ± 0.265
1.524AspGln: 1.524 ± 0.303
2.913AspArg: 2.913 ± 0.533
3.361AspSer: 3.361 ± 0.489
3.585AspThr: 3.585 ± 0.406
3.764AspVal: 3.764 ± 0.367
0.941AspTrp: 0.941 ± 0.174
3.361AspTyr: 3.361 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
3.943GluAla: 3.943 ± 0.432
0.717GluCys: 0.717 ± 0.184
3.63GluAsp: 3.63 ± 0.403
6.587GluGlu: 6.587 ± 0.756
3.585GluPhe: 3.585 ± 0.394
3.899GluGly: 3.899 ± 0.408
1.21GluHis: 1.21 ± 0.269
5.422GluIle: 5.422 ± 0.6
6.991GluLys: 6.991 ± 0.732
6.677GluLeu: 6.677 ± 0.599
2.465GluMet: 2.465 ± 0.327
3.899GluAsn: 3.899 ± 0.486
1.882GluPro: 1.882 ± 0.363
3.674GluGln: 3.674 ± 0.467
3.45GluArg: 3.45 ± 0.475
3.988GluSer: 3.988 ± 0.386
4.391GluThr: 4.391 ± 0.607
5.243GluVal: 5.243 ± 0.463
0.851GluTrp: 0.851 ± 0.195
2.823GluTyr: 2.823 ± 0.339
0.0GluXaa: 0.0 ± 0.0
Phe
3.226PheAla: 3.226 ± 0.37
0.448PheCys: 0.448 ± 0.172
3.585PheAsp: 3.585 ± 0.407
3.63PheGlu: 3.63 ± 0.377
1.882PhePhe: 1.882 ± 0.363
2.823PheGly: 2.823 ± 0.521
0.538PheHis: 0.538 ± 0.158
2.733PheIle: 2.733 ± 0.418
3.54PheLys: 3.54 ± 0.426
2.958PheLeu: 2.958 ± 0.393
1.21PheMet: 1.21 ± 0.226
2.823PheAsn: 2.823 ± 0.309
1.031PhePro: 1.031 ± 0.19
1.255PheGln: 1.255 ± 0.209
1.792PheArg: 1.792 ± 0.371
2.554PheSer: 2.554 ± 0.372
3.271PheThr: 3.271 ± 0.342
3.002PheVal: 3.002 ± 0.335
0.627PheTrp: 0.627 ± 0.184
1.703PheTyr: 1.703 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
3.585GlyAla: 3.585 ± 0.574
0.627GlyCys: 0.627 ± 0.196
2.733GlyAsp: 2.733 ± 0.396
3.719GlyGlu: 3.719 ± 0.373
2.913GlyPhe: 2.913 ± 0.365
5.198GlyGly: 5.198 ± 0.834
1.344GlyHis: 1.344 ± 0.236
4.078GlyIle: 4.078 ± 0.382
5.781GlyLys: 5.781 ± 0.527
5.288GlyLeu: 5.288 ± 0.459
2.151GlyMet: 2.151 ± 0.41
4.123GlyAsn: 4.123 ± 0.526
0.045GlyPro: 0.045 ± 0.039
3.45GlyGln: 3.45 ± 0.917
2.913GlyArg: 2.913 ± 0.338
4.212GlySer: 4.212 ± 0.546
4.391GlyThr: 4.391 ± 0.724
4.526GlyVal: 4.526 ± 0.518
0.762GlyTrp: 0.762 ± 0.187
3.54GlyTyr: 3.54 ± 0.35
0.0GlyXaa: 0.0 ± 0.0
His
0.672HisAla: 0.672 ± 0.141
0.134HisCys: 0.134 ± 0.084
1.255HisAsp: 1.255 ± 0.275
1.3HisGlu: 1.3 ± 0.266
0.896HisPhe: 0.896 ± 0.212
1.12HisGly: 1.12 ± 0.236
0.269HisHis: 0.269 ± 0.118
1.12HisIle: 1.12 ± 0.181
1.479HisLys: 1.479 ± 0.317
1.613HisLeu: 1.613 ± 0.314
0.672HisMet: 0.672 ± 0.181
1.389HisAsn: 1.389 ± 0.257
0.448HisPro: 0.448 ± 0.152
0.627HisGln: 0.627 ± 0.154
0.403HisArg: 0.403 ± 0.124
0.762HisSer: 0.762 ± 0.183
0.762HisThr: 0.762 ± 0.17
0.851HisVal: 0.851 ± 0.173
0.224HisTrp: 0.224 ± 0.099
0.583HisTyr: 0.583 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.167IleAla: 4.167 ± 0.454
0.672IleCys: 0.672 ± 0.222
6.274IleAsp: 6.274 ± 0.59
6.049IleGlu: 6.049 ± 0.494
2.196IlePhe: 2.196 ± 0.279
4.526IleGly: 4.526 ± 0.381
1.3IleHis: 1.3 ± 0.228
4.257IleIle: 4.257 ± 0.541
7.17IleLys: 7.17 ± 0.742
3.316IleLeu: 3.316 ± 0.373
1.568IleMet: 1.568 ± 0.242
4.078IleAsn: 4.078 ± 0.406
1.927IlePro: 1.927 ± 0.254
2.778IleGln: 2.778 ± 0.383
2.33IleArg: 2.33 ± 0.306
3.361IleSer: 3.361 ± 0.395
4.436IleThr: 4.436 ± 0.439
4.66IleVal: 4.66 ± 0.466
0.583IleTrp: 0.583 ± 0.196
2.509IleTyr: 2.509 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
5.467LysAla: 5.467 ± 0.487
0.538LysCys: 0.538 ± 0.165
5.512LysAsp: 5.512 ± 0.506
8.335LysGlu: 8.335 ± 0.75
4.123LysPhe: 4.123 ± 0.482
5.243LysGly: 5.243 ± 0.383
1.389LysHis: 1.389 ± 0.251
6.498LysIle: 6.498 ± 0.514
7.528LysLys: 7.528 ± 0.742
7.349LysLeu: 7.349 ± 0.594
3.406LysMet: 3.406 ± 0.412
4.66LysAsn: 4.66 ± 0.553
2.599LysPro: 2.599 ± 0.383
2.958LysGln: 2.958 ± 0.392
3.406LysArg: 3.406 ± 0.392
4.391LysSer: 4.391 ± 0.387
5.019LysThr: 5.019 ± 0.52
6.766LysVal: 6.766 ± 0.711
1.075LysTrp: 1.075 ± 0.188
4.167LysTyr: 4.167 ± 0.493
0.0LysXaa: 0.0 ± 0.0
Leu
5.243LeuAla: 5.243 ± 0.45
0.583LeuCys: 0.583 ± 0.163
5.377LeuAsp: 5.377 ± 0.393
6.094LeuGlu: 6.094 ± 0.608
2.554LeuPhe: 2.554 ± 0.365
4.929LeuGly: 4.929 ± 0.451
1.389LeuHis: 1.389 ± 0.21
4.929LeuIle: 4.929 ± 0.484
7.842LeuLys: 7.842 ± 0.611
5.108LeuLeu: 5.108 ± 0.544
2.151LeuMet: 2.151 ± 0.3
4.929LeuAsn: 4.929 ± 0.4
2.733LeuPro: 2.733 ± 0.334
2.823LeuGln: 2.823 ± 0.392
3.316LeuArg: 3.316 ± 0.453
3.63LeuSer: 3.63 ± 0.437
4.616LeuThr: 4.616 ± 0.437
4.302LeuVal: 4.302 ± 0.547
0.896LeuTrp: 0.896 ± 0.192
2.509LeuTyr: 2.509 ± 0.394
0.0LeuXaa: 0.0 ± 0.0
Met
2.106MetAla: 2.106 ± 0.292
0.09MetCys: 0.09 ± 0.062
1.524MetAsp: 1.524 ± 0.251
1.568MetGlu: 1.568 ± 0.231
1.658MetPhe: 1.658 ± 0.279
1.613MetGly: 1.613 ± 0.387
0.358MetHis: 0.358 ± 0.152
2.196MetIle: 2.196 ± 0.324
3.137MetLys: 3.137 ± 0.399
2.061MetLeu: 2.061 ± 0.353
1.031MetMet: 1.031 ± 0.236
2.106MetAsn: 2.106 ± 0.37
0.672MetPro: 0.672 ± 0.167
1.031MetGln: 1.031 ± 0.274
1.21MetArg: 1.21 ± 0.28
2.241MetSer: 2.241 ± 0.411
1.613MetThr: 1.613 ± 0.269
2.42MetVal: 2.42 ± 0.286
0.224MetTrp: 0.224 ± 0.127
1.031MetTyr: 1.031 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
3.226AsnAla: 3.226 ± 0.477
0.314AsnCys: 0.314 ± 0.126
4.078AsnAsp: 4.078 ± 0.427
4.257AsnGlu: 4.257 ± 0.522
1.927AsnPhe: 1.927 ± 0.286
5.467AsnGly: 5.467 ± 0.813
1.165AsnHis: 1.165 ± 0.206
4.033AsnIle: 4.033 ± 0.368
5.153AsnLys: 5.153 ± 0.543
5.467AsnLeu: 5.467 ± 0.393
1.703AsnMet: 1.703 ± 0.284
4.884AsnAsn: 4.884 ± 0.514
2.554AsnPro: 2.554 ± 0.501
2.599AsnGln: 2.599 ± 0.463
2.196AsnArg: 2.196 ± 0.372
3.361AsnSer: 3.361 ± 0.651
2.778AsnThr: 2.778 ± 0.329
3.764AsnVal: 3.764 ± 0.496
0.717AsnTrp: 0.717 ± 0.18
2.599AsnTyr: 2.599 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
1.748ProAla: 1.748 ± 0.217
0.224ProCys: 0.224 ± 0.095
1.748ProAsp: 1.748 ± 0.358
2.42ProGlu: 2.42 ± 0.333
1.703ProPhe: 1.703 ± 0.441
0.045ProGly: 0.045 ± 0.042
0.358ProHis: 0.358 ± 0.103
1.972ProIle: 1.972 ± 0.341
2.465ProLys: 2.465 ± 0.396
1.748ProLeu: 1.748 ± 0.292
1.12ProMet: 1.12 ± 0.228
2.151ProAsn: 2.151 ± 0.498
0.403ProPro: 0.403 ± 0.167
1.3ProGln: 1.3 ± 0.277
0.403ProArg: 0.403 ± 0.119
1.658ProSer: 1.658 ± 0.291
1.972ProThr: 1.972 ± 0.318
2.106ProVal: 2.106 ± 0.306
0.224ProTrp: 0.224 ± 0.075
1.12ProTyr: 1.12 ± 0.232
0.0ProXaa: 0.0 ± 0.0
Gln
2.33GlnAla: 2.33 ± 0.337
0.314GlnCys: 0.314 ± 0.107
1.3GlnAsp: 1.3 ± 0.212
2.554GlnGlu: 2.554 ± 0.367
1.344GlnPhe: 1.344 ± 0.224
3.137GlnGly: 3.137 ± 0.607
0.583GlnHis: 0.583 ± 0.181
2.241GlnIle: 2.241 ± 0.315
2.733GlnLys: 2.733 ± 0.38
3.361GlnLeu: 3.361 ± 0.351
1.613GlnMet: 1.613 ± 0.245
2.644GlnAsn: 2.644 ± 0.358
1.434GlnPro: 1.434 ± 0.453
3.002GlnGln: 3.002 ± 1.329
1.568GlnArg: 1.568 ± 0.247
1.613GlnSer: 1.613 ± 0.246
2.241GlnThr: 2.241 ± 0.528
2.241GlnVal: 2.241 ± 0.294
0.583GlnTrp: 0.583 ± 0.137
1.524GlnTyr: 1.524 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
1.927ArgAla: 1.927 ± 0.337
0.224ArgCys: 0.224 ± 0.106
2.644ArgAsp: 2.644 ± 0.342
2.285ArgGlu: 2.285 ± 0.415
2.016ArgPhe: 2.016 ± 0.303
2.509ArgGly: 2.509 ± 0.283
0.807ArgHis: 0.807 ± 0.186
2.823ArgIle: 2.823 ± 0.373
3.63ArgLys: 3.63 ± 0.411
3.137ArgLeu: 3.137 ± 0.394
1.075ArgMet: 1.075 ± 0.223
1.927ArgAsn: 1.927 ± 0.264
1.12ArgPro: 1.12 ± 0.277
1.21ArgGln: 1.21 ± 0.189
1.613ArgArg: 1.613 ± 0.267
1.837ArgSer: 1.837 ± 0.27
2.33ArgThr: 2.33 ± 0.27
2.241ArgVal: 2.241 ± 0.297
0.627ArgTrp: 0.627 ± 0.159
2.016ArgTyr: 2.016 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
3.719SerAla: 3.719 ± 0.544
0.672SerCys: 0.672 ± 0.169
3.316SerAsp: 3.316 ± 0.43
3.271SerGlu: 3.271 ± 0.36
3.316SerPhe: 3.316 ± 0.407
4.705SerGly: 4.705 ± 0.538
0.851SerHis: 0.851 ± 0.195
4.033SerIle: 4.033 ± 0.483
4.616SerLys: 4.616 ± 0.436
4.391SerLeu: 4.391 ± 0.398
1.479SerMet: 1.479 ± 0.296
3.047SerAsn: 3.047 ± 0.494
1.3SerPro: 1.3 ± 0.237
1.389SerGln: 1.389 ± 0.339
1.434SerArg: 1.434 ± 0.315
3.271SerSer: 3.271 ± 0.47
2.868SerThr: 2.868 ± 0.388
2.733SerVal: 2.733 ± 0.326
0.986SerTrp: 0.986 ± 0.236
2.599SerTyr: 2.599 ± 0.299
0.0SerXaa: 0.0 ± 0.0
Thr
4.033ThrAla: 4.033 ± 0.8
0.314ThrCys: 0.314 ± 0.13
3.943ThrAsp: 3.943 ± 0.447
3.809ThrGlu: 3.809 ± 0.449
3.271ThrPhe: 3.271 ± 0.415
4.705ThrGly: 4.705 ± 0.399
0.851ThrHis: 0.851 ± 0.174
4.347ThrIle: 4.347 ± 0.488
4.616ThrLys: 4.616 ± 0.403
4.302ThrLeu: 4.302 ± 0.578
1.568ThrMet: 1.568 ± 0.26
3.585ThrAsn: 3.585 ± 0.5
2.151ThrPro: 2.151 ± 0.355
1.972ThrGln: 1.972 ± 0.337
2.061ThrArg: 2.061 ± 0.24
3.316ThrSer: 3.316 ± 0.607
4.123ThrThr: 4.123 ± 0.581
4.078ThrVal: 4.078 ± 0.59
0.448ThrTrp: 0.448 ± 0.11
2.509ThrTyr: 2.509 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
3.495ValAla: 3.495 ± 0.436
0.583ValCys: 0.583 ± 0.158
4.84ValAsp: 4.84 ± 0.433
5.153ValGlu: 5.153 ± 0.539
3.002ValPhe: 3.002 ± 0.36
3.182ValGly: 3.182 ± 0.41
0.986ValHis: 0.986 ± 0.223
4.212ValIle: 4.212 ± 0.471
7.304ValLys: 7.304 ± 0.629
4.257ValLeu: 4.257 ± 0.49
1.389ValMet: 1.389 ± 0.368
4.123ValAsn: 4.123 ± 0.502
2.106ValPro: 2.106 ± 0.241
2.241ValGln: 2.241 ± 0.357
2.42ValArg: 2.42 ± 0.322
3.092ValSer: 3.092 ± 0.316
4.436ValThr: 4.436 ± 0.423
4.616ValVal: 4.616 ± 0.537
0.807ValTrp: 0.807 ± 0.213
2.823ValTyr: 2.823 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
0.269TrpAla: 0.269 ± 0.098
0.179TrpCys: 0.179 ± 0.086
1.031TrpAsp: 1.031 ± 0.225
1.075TrpGlu: 1.075 ± 0.197
0.627TrpPhe: 0.627 ± 0.196
0.583TrpGly: 0.583 ± 0.149
0.314TrpHis: 0.314 ± 0.114
1.075TrpIle: 1.075 ± 0.26
0.717TrpLys: 0.717 ± 0.153
0.762TrpLeu: 0.762 ± 0.16
0.314TrpMet: 0.314 ± 0.129
0.941TrpAsn: 0.941 ± 0.193
0.045TrpPro: 0.045 ± 0.039
0.358TrpGln: 0.358 ± 0.154
0.583TrpArg: 0.583 ± 0.178
0.493TrpSer: 0.493 ± 0.141
0.896TrpThr: 0.896 ± 0.26
0.896TrpVal: 0.896 ± 0.152
0.0TrpTrp: 0.0 ± 0.0
0.717TrpTyr: 0.717 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.689TyrAla: 2.689 ± 0.325
0.314TyrCys: 0.314 ± 0.124
3.809TyrAsp: 3.809 ± 0.402
3.271TyrGlu: 3.271 ± 0.447
1.613TyrPhe: 1.613 ± 0.33
2.689TyrGly: 2.689 ± 0.378
0.896TyrHis: 0.896 ± 0.179
2.644TyrIle: 2.644 ± 0.355
3.406TyrLys: 3.406 ± 0.404
3.226TyrLeu: 3.226 ± 0.406
1.075TyrMet: 1.075 ± 0.209
3.271TyrAsn: 3.271 ± 0.385
0.941TyrPro: 0.941 ± 0.219
1.389TyrGln: 1.389 ± 0.239
1.882TyrArg: 1.882 ± 0.292
2.913TyrSer: 2.913 ± 0.411
2.285TyrThr: 2.285 ± 0.312
2.509TyrVal: 2.509 ± 0.32
0.493TyrTrp: 0.493 ± 0.146
2.554TyrTyr: 2.554 ± 0.473
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (22317 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski