Amino acid dipepetide frequency for Bacillus phage BceA1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.58AlaAla: 5.58 ± 1.438
0.159AlaCys: 0.159 ± 0.113
3.587AlaAsp: 3.587 ± 0.549
5.421AlaGlu: 5.421 ± 0.616
2.232AlaPhe: 2.232 ± 0.366
3.587AlaGly: 3.587 ± 0.637
0.717AlaHis: 0.717 ± 0.248
4.385AlaIle: 4.385 ± 0.691
5.182AlaLys: 5.182 ± 0.69
5.261AlaLeu: 5.261 ± 0.759
1.913AlaMet: 1.913 ± 0.369
3.268AlaAsn: 3.268 ± 0.579
1.515AlaPro: 1.515 ± 0.263
2.232AlaGln: 2.232 ± 0.527
3.747AlaArg: 3.747 ± 0.482
3.268AlaSer: 3.268 ± 0.454
3.428AlaThr: 3.428 ± 0.581
3.747AlaVal: 3.747 ± 0.61
1.036AlaTrp: 1.036 ± 0.236
1.674AlaTyr: 1.674 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.319CysAla: 0.319 ± 0.148
0.159CysCys: 0.159 ± 0.109
0.877CysAsp: 0.877 ± 0.322
0.717CysGlu: 0.717 ± 0.239
0.558CysPhe: 0.558 ± 0.196
0.558CysGly: 0.558 ± 0.202
0.319CysHis: 0.319 ± 0.22
0.239CysIle: 0.239 ± 0.141
0.399CysLys: 0.399 ± 0.17
0.319CysLeu: 0.319 ± 0.16
0.558CysMet: 0.558 ± 0.226
0.478CysAsn: 0.478 ± 0.231
0.159CysPro: 0.159 ± 0.129
0.239CysGln: 0.239 ± 0.127
0.797CysArg: 0.797 ± 0.243
0.399CysSer: 0.399 ± 0.159
0.478CysThr: 0.478 ± 0.171
0.319CysVal: 0.319 ± 0.163
0.0CysTrp: 0.0 ± 0.0
0.239CysTyr: 0.239 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
3.109AspAla: 3.109 ± 0.557
0.399AspCys: 0.399 ± 0.195
3.428AspAsp: 3.428 ± 0.547
5.501AspGlu: 5.501 ± 0.747
2.312AspPhe: 2.312 ± 0.455
5.261AspGly: 5.261 ± 0.615
1.036AspHis: 1.036 ± 0.247
4.544AspIle: 4.544 ± 0.491
4.225AspLys: 4.225 ± 0.55
4.305AspLeu: 4.305 ± 0.521
2.471AspMet: 2.471 ± 0.421
2.312AspAsn: 2.312 ± 0.429
1.834AspPro: 1.834 ± 0.276
1.594AspGln: 1.594 ± 0.431
2.232AspArg: 2.232 ± 0.359
2.232AspSer: 2.232 ± 0.356
3.428AspThr: 3.428 ± 0.499
3.747AspVal: 3.747 ± 0.533
1.276AspTrp: 1.276 ± 0.293
2.152AspTyr: 2.152 ± 0.405
0.0AspXaa: 0.0 ± 0.0
Glu
6.218GluAla: 6.218 ± 0.663
0.638GluCys: 0.638 ± 0.236
3.906GluAsp: 3.906 ± 0.476
7.095GluGlu: 7.095 ± 1.038
3.827GluPhe: 3.827 ± 0.653
4.385GluGly: 4.385 ± 0.512
1.834GluHis: 1.834 ± 0.43
6.298GluIle: 6.298 ± 0.835
8.211GluLys: 8.211 ± 0.762
7.095GluLeu: 7.095 ± 0.713
2.87GluMet: 2.87 ± 0.696
3.827GluAsn: 3.827 ± 0.436
1.993GluPro: 1.993 ± 0.513
3.587GluGln: 3.587 ± 0.65
4.305GluArg: 4.305 ± 0.554
4.624GluSer: 4.624 ± 0.627
4.145GluThr: 4.145 ± 0.587
5.022GluVal: 5.022 ± 0.587
1.674GluTrp: 1.674 ± 0.353
3.189GluTyr: 3.189 ± 0.572
0.0GluXaa: 0.0 ± 0.0
Phe
1.993PheAla: 1.993 ± 0.367
0.399PheCys: 0.399 ± 0.155
2.232PheAsp: 2.232 ± 0.456
3.109PheGlu: 3.109 ± 0.522
1.754PhePhe: 1.754 ± 0.438
4.066PheGly: 4.066 ± 0.56
0.558PheHis: 0.558 ± 0.206
3.906PheIle: 3.906 ± 0.492
4.464PheLys: 4.464 ± 0.548
3.029PheLeu: 3.029 ± 0.606
1.036PheMet: 1.036 ± 0.256
2.232PheAsn: 2.232 ± 0.374
0.877PhePro: 0.877 ± 0.281
1.594PheGln: 1.594 ± 0.35
1.913PheArg: 1.913 ± 0.389
2.392PheSer: 2.392 ± 0.44
2.551PheThr: 2.551 ± 0.469
2.551PheVal: 2.551 ± 0.424
0.638PheTrp: 0.638 ± 0.21
1.435PheTyr: 1.435 ± 0.306
0.0PheXaa: 0.0 ± 0.0
Gly
3.986GlyAla: 3.986 ± 0.89
0.08GlyCys: 0.08 ± 0.079
3.348GlyAsp: 3.348 ± 0.579
4.305GlyGlu: 4.305 ± 0.505
1.993GlyPhe: 1.993 ± 0.435
2.631GlyGly: 2.631 ± 0.458
0.717GlyHis: 0.717 ± 0.285
4.624GlyIle: 4.624 ± 0.744
6.457GlyLys: 6.457 ± 0.673
5.421GlyLeu: 5.421 ± 0.745
2.152GlyMet: 2.152 ± 0.346
2.392GlyAsn: 2.392 ± 0.362
0.797GlyPro: 0.797 ± 0.252
1.594GlyGln: 1.594 ± 0.335
2.79GlyArg: 2.79 ± 0.436
3.508GlySer: 3.508 ± 0.599
3.348GlyThr: 3.348 ± 0.44
4.624GlyVal: 4.624 ± 0.886
1.276GlyTrp: 1.276 ± 0.332
3.348GlyTyr: 3.348 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
0.877HisAla: 0.877 ± 0.24
0.399HisCys: 0.399 ± 0.191
0.717HisAsp: 0.717 ± 0.191
1.036HisGlu: 1.036 ± 0.261
0.797HisPhe: 0.797 ± 0.276
1.036HisGly: 1.036 ± 0.273
0.239HisHis: 0.239 ± 0.146
0.717HisIle: 0.717 ± 0.192
1.913HisLys: 1.913 ± 0.408
1.674HisLeu: 1.674 ± 0.33
0.319HisMet: 0.319 ± 0.15
0.957HisAsn: 0.957 ± 0.331
0.0HisPro: 0.0 ± 0.0
0.877HisGln: 0.877 ± 0.294
0.478HisArg: 0.478 ± 0.213
1.594HisSer: 1.594 ± 0.41
1.435HisThr: 1.435 ± 0.377
1.355HisVal: 1.355 ± 0.261
0.159HisTrp: 0.159 ± 0.086
0.638HisTyr: 0.638 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
5.58IleAla: 5.58 ± 0.7
0.797IleCys: 0.797 ± 0.271
4.305IleAsp: 4.305 ± 0.573
6.298IleGlu: 6.298 ± 0.641
2.71IlePhe: 2.71 ± 0.441
3.428IleGly: 3.428 ± 0.498
1.276IleHis: 1.276 ± 0.368
5.022IleIle: 5.022 ± 0.67
7.015IleLys: 7.015 ± 0.693
4.544IleLeu: 4.544 ± 0.591
1.754IleMet: 1.754 ± 0.325
5.66IleAsn: 5.66 ± 0.86
2.392IlePro: 2.392 ± 0.434
3.348IleGln: 3.348 ± 0.584
2.79IleArg: 2.79 ± 0.406
4.544IleSer: 4.544 ± 0.531
4.145IleThr: 4.145 ± 0.531
4.066IleVal: 4.066 ± 0.84
1.036IleTrp: 1.036 ± 0.215
1.913IleTyr: 1.913 ± 0.298
0.0IleXaa: 0.0 ± 0.0
Lys
5.501LysAla: 5.501 ± 0.603
0.717LysCys: 0.717 ± 0.309
5.182LysAsp: 5.182 ± 0.727
10.364LysGlu: 10.364 ± 1.043
2.79LysPhe: 2.79 ± 0.405
4.225LysGly: 4.225 ± 0.728
1.594LysHis: 1.594 ± 0.261
6.218LysIle: 6.218 ± 0.628
8.371LysLys: 8.371 ± 1.023
7.175LysLeu: 7.175 ± 0.746
2.95LysMet: 2.95 ± 0.481
4.305LysAsn: 4.305 ± 0.562
3.029LysPro: 3.029 ± 0.403
5.501LysGln: 5.501 ± 0.608
4.943LysArg: 4.943 ± 0.726
4.943LysSer: 4.943 ± 0.777
5.58LysThr: 5.58 ± 0.746
6.537LysVal: 6.537 ± 0.697
1.036LysTrp: 1.036 ± 0.307
3.189LysTyr: 3.189 ± 0.565
0.0LysXaa: 0.0 ± 0.0
Leu
4.863LeuAla: 4.863 ± 0.685
0.957LeuCys: 0.957 ± 0.268
5.421LeuAsp: 5.421 ± 0.667
6.617LeuGlu: 6.617 ± 0.7
2.79LeuPhe: 2.79 ± 0.492
4.943LeuGly: 4.943 ± 0.743
1.754LeuHis: 1.754 ± 0.404
5.82LeuIle: 5.82 ± 0.654
7.972LeuLys: 7.972 ± 0.743
4.943LeuLeu: 4.943 ± 0.832
1.435LeuMet: 1.435 ± 0.334
4.544LeuAsn: 4.544 ± 0.549
2.471LeuPro: 2.471 ± 0.393
3.508LeuGln: 3.508 ± 0.459
3.029LeuArg: 3.029 ± 0.381
5.82LeuSer: 5.82 ± 0.598
3.827LeuThr: 3.827 ± 0.525
4.863LeuVal: 4.863 ± 0.58
0.717LeuTrp: 0.717 ± 0.234
2.71LeuTyr: 2.71 ± 0.466
0.0LeuXaa: 0.0 ± 0.0
Met
1.355MetAla: 1.355 ± 0.349
0.159MetCys: 0.159 ± 0.102
1.674MetAsp: 1.674 ± 0.504
2.232MetGlu: 2.232 ± 0.346
2.073MetPhe: 2.073 ± 0.458
1.515MetGly: 1.515 ± 0.319
0.319MetHis: 0.319 ± 0.199
1.913MetIle: 1.913 ± 0.382
4.544MetLys: 4.544 ± 0.622
2.312MetLeu: 2.312 ± 0.525
0.638MetMet: 0.638 ± 0.192
2.152MetAsn: 2.152 ± 0.407
0.797MetPro: 0.797 ± 0.254
1.036MetGln: 1.036 ± 0.259
1.196MetArg: 1.196 ± 0.32
1.594MetSer: 1.594 ± 0.383
1.993MetThr: 1.993 ± 0.343
1.435MetVal: 1.435 ± 0.31
0.319MetTrp: 0.319 ± 0.143
0.717MetTyr: 0.717 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
3.268AsnAla: 3.268 ± 0.644
0.319AsnCys: 0.319 ± 0.206
2.87AsnAsp: 2.87 ± 0.457
5.022AsnGlu: 5.022 ± 0.637
1.674AsnPhe: 1.674 ± 0.486
4.385AsnGly: 4.385 ± 0.702
1.036AsnHis: 1.036 ± 0.261
3.189AsnIle: 3.189 ± 0.455
4.783AsnLys: 4.783 ± 0.578
4.863AsnLeu: 4.863 ± 0.818
1.276AsnMet: 1.276 ± 0.33
2.392AsnAsn: 2.392 ± 0.47
1.834AsnPro: 1.834 ± 0.339
1.674AsnGln: 1.674 ± 0.299
2.551AsnArg: 2.551 ± 0.547
2.073AsnSer: 2.073 ± 0.359
3.189AsnThr: 3.189 ± 0.533
3.428AsnVal: 3.428 ± 0.424
0.717AsnTrp: 0.717 ± 0.217
1.913AsnTyr: 1.913 ± 0.391
0.0AsnXaa: 0.0 ± 0.0
Pro
1.594ProAla: 1.594 ± 0.318
0.239ProCys: 0.239 ± 0.131
1.515ProAsp: 1.515 ± 0.4
1.834ProGlu: 1.834 ± 0.479
1.355ProPhe: 1.355 ± 0.318
1.116ProGly: 1.116 ± 0.316
0.797ProHis: 0.797 ± 0.218
2.232ProIle: 2.232 ± 0.517
2.392ProLys: 2.392 ± 0.425
2.232ProLeu: 2.232 ± 0.429
0.797ProMet: 0.797 ± 0.224
1.674ProAsn: 1.674 ± 0.337
0.957ProPro: 0.957 ± 0.301
0.957ProGln: 0.957 ± 0.267
0.399ProArg: 0.399 ± 0.154
1.834ProSer: 1.834 ± 0.315
2.392ProThr: 2.392 ± 0.445
2.073ProVal: 2.073 ± 0.418
0.399ProTrp: 0.399 ± 0.151
1.594ProTyr: 1.594 ± 0.369
0.0ProXaa: 0.0 ± 0.0
Gln
2.71GlnAla: 2.71 ± 0.432
0.08GlnCys: 0.08 ± 0.077
1.515GlnAsp: 1.515 ± 0.347
3.827GlnGlu: 3.827 ± 0.556
1.913GlnPhe: 1.913 ± 0.455
2.551GlnGly: 2.551 ± 0.378
0.717GlnHis: 0.717 ± 0.453
2.95GlnIle: 2.95 ± 0.458
3.827GlnLys: 3.827 ± 0.428
3.189GlnLeu: 3.189 ± 0.529
1.515GlnMet: 1.515 ± 0.37
1.993GlnAsn: 1.993 ± 0.483
1.355GlnPro: 1.355 ± 0.36
2.471GlnGln: 2.471 ± 0.486
1.913GlnArg: 1.913 ± 0.3
2.073GlnSer: 2.073 ± 0.342
2.152GlnThr: 2.152 ± 0.476
1.913GlnVal: 1.913 ± 0.431
0.399GlnTrp: 0.399 ± 0.16
1.913GlnTyr: 1.913 ± 0.316
0.0GlnXaa: 0.0 ± 0.0
Arg
2.073ArgAla: 2.073 ± 0.468
0.558ArgCys: 0.558 ± 0.216
2.71ArgAsp: 2.71 ± 0.521
3.189ArgGlu: 3.189 ± 0.505
2.392ArgPhe: 2.392 ± 0.515
3.109ArgGly: 3.109 ± 0.433
1.036ArgHis: 1.036 ± 0.277
3.428ArgIle: 3.428 ± 0.592
4.385ArgLys: 4.385 ± 0.69
4.145ArgLeu: 4.145 ± 0.488
1.674ArgMet: 1.674 ± 0.369
2.152ArgAsn: 2.152 ± 0.439
1.036ArgPro: 1.036 ± 0.233
1.435ArgGln: 1.435 ± 0.3
1.674ArgArg: 1.674 ± 0.351
1.834ArgSer: 1.834 ± 0.309
2.073ArgThr: 2.073 ± 0.362
3.189ArgVal: 3.189 ± 0.578
0.478ArgTrp: 0.478 ± 0.174
1.834ArgTyr: 1.834 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
2.95SerAla: 2.95 ± 0.606
0.399SerCys: 0.399 ± 0.167
4.066SerAsp: 4.066 ± 0.473
3.906SerGlu: 3.906 ± 0.487
3.428SerPhe: 3.428 ± 0.486
2.471SerGly: 2.471 ± 0.493
0.558SerHis: 0.558 ± 0.273
4.943SerIle: 4.943 ± 0.726
4.783SerLys: 4.783 ± 0.554
4.225SerLeu: 4.225 ± 0.499
1.834SerMet: 1.834 ± 0.326
3.587SerAsn: 3.587 ± 0.502
1.594SerPro: 1.594 ± 0.296
2.152SerGln: 2.152 ± 0.453
2.312SerArg: 2.312 ± 0.395
3.109SerSer: 3.109 ± 0.62
2.152SerThr: 2.152 ± 0.491
3.906SerVal: 3.906 ± 0.723
0.239SerTrp: 0.239 ± 0.141
2.073SerTyr: 2.073 ± 0.451
0.0SerXaa: 0.0 ± 0.0
Thr
3.268ThrAla: 3.268 ± 0.447
0.239ThrCys: 0.239 ± 0.129
2.87ThrAsp: 2.87 ± 0.573
4.145ThrGlu: 4.145 ± 0.611
2.95ThrPhe: 2.95 ± 0.336
3.348ThrGly: 3.348 ± 0.532
0.877ThrHis: 0.877 ± 0.317
4.624ThrIle: 4.624 ± 0.638
5.979ThrLys: 5.979 ± 0.728
5.102ThrLeu: 5.102 ± 0.765
1.515ThrMet: 1.515 ± 0.349
2.71ThrAsn: 2.71 ± 0.526
1.993ThrPro: 1.993 ± 0.433
2.073ThrGln: 2.073 ± 0.349
1.993ThrArg: 1.993 ± 0.379
2.073ThrSer: 2.073 ± 0.422
2.95ThrThr: 2.95 ± 0.535
4.385ThrVal: 4.385 ± 0.595
0.717ThrTrp: 0.717 ± 0.234
2.073ThrTyr: 2.073 ± 0.438
0.0ThrXaa: 0.0 ± 0.0
Val
4.066ValAla: 4.066 ± 0.582
0.558ValCys: 0.558 ± 0.199
4.066ValAsp: 4.066 ± 0.541
6.059ValGlu: 6.059 ± 0.766
2.551ValPhe: 2.551 ± 0.584
3.667ValGly: 3.667 ± 0.666
0.957ValHis: 0.957 ± 0.27
4.145ValIle: 4.145 ± 0.514
5.261ValLys: 5.261 ± 0.62
4.305ValLeu: 4.305 ± 0.546
1.674ValMet: 1.674 ± 0.297
3.268ValAsn: 3.268 ± 0.397
2.152ValPro: 2.152 ± 0.349
3.029ValGln: 3.029 ± 0.415
3.029ValArg: 3.029 ± 0.554
4.385ValSer: 4.385 ± 0.632
3.906ValThr: 3.906 ± 0.748
3.906ValVal: 3.906 ± 0.689
0.797ValTrp: 0.797 ± 0.257
2.79ValTyr: 2.79 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.797TrpAla: 0.797 ± 0.213
0.319TrpCys: 0.319 ± 0.146
1.276TrpAsp: 1.276 ± 0.325
1.515TrpGlu: 1.515 ± 0.335
0.638TrpPhe: 0.638 ± 0.228
0.877TrpGly: 0.877 ± 0.251
0.239TrpHis: 0.239 ± 0.173
0.797TrpIle: 0.797 ± 0.23
1.116TrpLys: 1.116 ± 0.218
1.196TrpLeu: 1.196 ± 0.272
0.319TrpMet: 0.319 ± 0.165
0.797TrpAsn: 0.797 ± 0.207
0.319TrpPro: 0.319 ± 0.169
0.717TrpGln: 0.717 ± 0.233
0.558TrpArg: 0.558 ± 0.218
0.558TrpSer: 0.558 ± 0.178
0.638TrpThr: 0.638 ± 0.185
0.638TrpVal: 0.638 ± 0.232
0.08TrpTrp: 0.08 ± 0.072
0.319TrpTyr: 0.319 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.834TyrAla: 1.834 ± 0.385
0.558TyrCys: 0.558 ± 0.182
1.834TyrAsp: 1.834 ± 0.438
2.631TyrGlu: 2.631 ± 0.422
1.993TyrPhe: 1.993 ± 0.486
2.232TyrGly: 2.232 ± 0.411
0.638TyrHis: 0.638 ± 0.205
2.631TyrIle: 2.631 ± 0.48
2.87TyrLys: 2.87 ± 0.462
3.667TyrLeu: 3.667 ± 0.739
1.196TyrMet: 1.196 ± 0.274
1.754TyrAsn: 1.754 ± 0.371
1.276TyrPro: 1.276 ± 0.325
1.355TyrGln: 1.355 ± 0.299
1.754TyrArg: 1.754 ± 0.403
1.834TyrSer: 1.834 ± 0.371
2.073TyrThr: 2.073 ± 0.435
2.87TyrVal: 2.87 ± 0.516
0.717TyrTrp: 0.717 ± 0.202
1.913TyrTyr: 1.913 ± 0.493
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (12545 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski