Amino acid dipepetide frequency for Bacillus phage SerPounce

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.662AlaAla: 0.662 ± 0.254
0.927AlaCys: 0.927 ± 0.33
2.383AlaAsp: 2.383 ± 0.545
3.971AlaGlu: 3.971 ± 0.982
2.78AlaPhe: 2.78 ± 0.775
3.574AlaGly: 3.574 ± 0.81
0.0AlaHis: 0.0 ± 0.0
2.515AlaIle: 2.515 ± 0.551
3.839AlaLys: 3.839 ± 0.763
2.648AlaLeu: 2.648 ± 0.475
2.25AlaMet: 2.25 ± 0.543
2.515AlaAsn: 2.515 ± 0.451
1.191AlaPro: 1.191 ± 0.402
2.25AlaGln: 2.25 ± 0.545
1.853AlaArg: 1.853 ± 0.383
1.853AlaSer: 1.853 ± 0.508
2.648AlaThr: 2.648 ± 0.503
2.118AlaVal: 2.118 ± 0.573
0.662AlaTrp: 0.662 ± 0.297
1.986AlaTyr: 1.986 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.53CysAla: 0.53 ± 0.258
0.0CysCys: 0.0 ± 0.0
1.721CysAsp: 1.721 ± 0.495
1.191CysGlu: 1.191 ± 0.316
0.397CysPhe: 0.397 ± 0.225
0.927CysGly: 0.927 ± 0.382
0.132CysHis: 0.132 ± 0.118
0.53CysIle: 0.53 ± 0.278
1.191CysLys: 1.191 ± 0.425
0.662CysLeu: 0.662 ± 0.273
0.397CysMet: 0.397 ± 0.216
0.132CysAsn: 0.132 ± 0.137
0.662CysPro: 0.662 ± 0.333
0.0CysGln: 0.0 ± 0.0
0.265CysArg: 0.265 ± 0.187
0.265CysSer: 0.265 ± 0.181
0.265CysThr: 0.265 ± 0.244
0.397CysVal: 0.397 ± 0.278
0.0CysTrp: 0.0 ± 0.0
0.662CysTyr: 0.662 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
2.78AspAla: 2.78 ± 0.471
1.456AspCys: 1.456 ± 0.571
3.971AspAsp: 3.971 ± 0.708
6.089AspGlu: 6.089 ± 0.885
3.971AspPhe: 3.971 ± 0.517
5.295AspGly: 5.295 ± 1.2
0.265AspHis: 0.265 ± 0.176
5.03AspIle: 5.03 ± 0.734
5.825AspLys: 5.825 ± 0.756
4.236AspLeu: 4.236 ± 0.696
2.118AspMet: 2.118 ± 0.512
3.574AspAsn: 3.574 ± 0.571
2.25AspPro: 2.25 ± 0.494
2.383AspGln: 2.383 ± 0.399
1.853AspArg: 1.853 ± 0.553
2.648AspSer: 2.648 ± 0.59
2.515AspThr: 2.515 ± 0.586
5.825AspVal: 5.825 ± 0.705
0.794AspTrp: 0.794 ± 0.298
2.515AspTyr: 2.515 ± 0.498
0.0AspXaa: 0.0 ± 0.0
Glu
3.442GluAla: 3.442 ± 0.804
0.53GluCys: 0.53 ± 0.235
4.104GluAsp: 4.104 ± 0.548
6.487GluGlu: 6.487 ± 1.233
3.971GluPhe: 3.971 ± 0.886
4.898GluGly: 4.898 ± 0.92
0.927GluHis: 0.927 ± 0.264
6.487GluIle: 6.487 ± 0.909
6.619GluLys: 6.619 ± 1.383
8.737GluLeu: 8.737 ± 1.319
3.177GluMet: 3.177 ± 0.799
5.295GluAsn: 5.295 ± 0.696
1.324GluPro: 1.324 ± 0.547
2.648GluGln: 2.648 ± 0.555
3.707GluArg: 3.707 ± 0.848
4.369GluSer: 4.369 ± 0.671
4.369GluThr: 4.369 ± 1.014
5.295GluVal: 5.295 ± 1.184
1.324GluTrp: 1.324 ± 0.366
4.898GluTyr: 4.898 ± 0.805
0.0GluXaa: 0.0 ± 0.0
Phe
1.853PheAla: 1.853 ± 0.401
0.397PheCys: 0.397 ± 0.22
3.574PheAsp: 3.574 ± 0.569
3.574PheGlu: 3.574 ± 0.507
1.324PhePhe: 1.324 ± 0.369
2.25PheGly: 2.25 ± 0.705
1.589PheHis: 1.589 ± 0.469
4.898PheIle: 4.898 ± 0.743
5.163PheLys: 5.163 ± 0.933
2.78PheLeu: 2.78 ± 0.586
1.721PheMet: 1.721 ± 0.467
3.177PheAsn: 3.177 ± 0.975
1.059PhePro: 1.059 ± 0.362
0.662PheGln: 0.662 ± 0.268
1.059PheArg: 1.059 ± 0.283
3.31PheSer: 3.31 ± 0.763
2.515PheThr: 2.515 ± 0.595
2.515PheVal: 2.515 ± 0.641
0.53PheTrp: 0.53 ± 0.226
2.383PheTyr: 2.383 ± 0.515
0.0PheXaa: 0.0 ± 0.0
Gly
3.045GlyAla: 3.045 ± 0.756
0.265GlyCys: 0.265 ± 0.174
2.912GlyAsp: 2.912 ± 0.765
4.766GlyGlu: 4.766 ± 0.68
3.707GlyPhe: 3.707 ± 0.638
3.839GlyGly: 3.839 ± 0.893
0.794GlyHis: 0.794 ± 0.321
5.295GlyIle: 5.295 ± 0.917
6.884GlyLys: 6.884 ± 1.25
4.633GlyLeu: 4.633 ± 0.814
2.648GlyMet: 2.648 ± 0.572
4.104GlyAsn: 4.104 ± 0.925
0.265GlyPro: 0.265 ± 0.152
2.118GlyGln: 2.118 ± 0.52
1.986GlyArg: 1.986 ± 0.423
3.839GlySer: 3.839 ± 0.84
5.56GlyThr: 5.56 ± 1.288
3.574GlyVal: 3.574 ± 0.717
0.53GlyTrp: 0.53 ± 0.248
3.045GlyTyr: 3.045 ± 0.721
0.0GlyXaa: 0.0 ± 0.0
His
0.662HisAla: 0.662 ± 0.275
0.132HisCys: 0.132 ± 0.119
1.059HisAsp: 1.059 ± 0.434
0.53HisGlu: 0.53 ± 0.302
0.927HisPhe: 0.927 ± 0.314
0.265HisGly: 0.265 ± 0.185
0.53HisHis: 0.53 ± 0.269
1.456HisIle: 1.456 ± 0.318
1.721HisLys: 1.721 ± 0.514
1.324HisLeu: 1.324 ± 0.444
0.132HisMet: 0.132 ± 0.131
0.794HisAsn: 0.794 ± 0.296
0.0HisPro: 0.0 ± 0.0
0.53HisGln: 0.53 ± 0.218
0.794HisArg: 0.794 ± 0.259
0.662HisSer: 0.662 ± 0.355
1.191HisThr: 1.191 ± 0.362
0.662HisVal: 0.662 ± 0.298
0.132HisTrp: 0.132 ± 0.121
1.456HisTyr: 1.456 ± 0.486
0.0HisXaa: 0.0 ± 0.0
Ile
3.045IleAla: 3.045 ± 0.533
0.662IleCys: 0.662 ± 0.271
6.354IleAsp: 6.354 ± 0.63
7.281IleGlu: 7.281 ± 1.356
2.383IlePhe: 2.383 ± 0.533
4.501IleGly: 4.501 ± 0.597
0.794IleHis: 0.794 ± 0.329
5.428IleIle: 5.428 ± 1.329
7.81IleLys: 7.81 ± 1.09
3.045IleLeu: 3.045 ± 0.645
2.648IleMet: 2.648 ± 0.607
5.56IleAsn: 5.56 ± 0.687
2.118IlePro: 2.118 ± 0.455
2.515IleGln: 2.515 ± 0.503
4.236IleArg: 4.236 ± 0.582
3.31IleSer: 3.31 ± 0.744
4.369IleThr: 4.369 ± 0.768
3.839IleVal: 3.839 ± 0.614
0.662IleTrp: 0.662 ± 0.293
2.912IleTyr: 2.912 ± 0.576
0.0IleXaa: 0.0 ± 0.0
Lys
4.369LysAla: 4.369 ± 0.77
0.662LysCys: 0.662 ± 0.269
6.751LysAsp: 6.751 ± 0.954
8.472LysGlu: 8.472 ± 1.395
3.707LysPhe: 3.707 ± 0.774
6.884LysGly: 6.884 ± 0.932
2.25LysHis: 2.25 ± 0.501
5.957LysIle: 5.957 ± 0.734
10.458LysLys: 10.458 ± 1.176
7.678LysLeu: 7.678 ± 1.079
4.104LysMet: 4.104 ± 0.738
5.163LysAsn: 5.163 ± 0.739
1.324LysPro: 1.324 ± 0.397
3.31LysGln: 3.31 ± 0.995
5.163LysArg: 5.163 ± 1.009
4.766LysSer: 4.766 ± 0.754
5.295LysThr: 5.295 ± 0.935
5.692LysVal: 5.692 ± 0.742
1.191LysTrp: 1.191 ± 0.474
4.369LysTyr: 4.369 ± 0.736
0.0LysXaa: 0.0 ± 0.0
Leu
2.383LeuAla: 2.383 ± 0.495
0.132LeuCys: 0.132 ± 0.154
5.163LeuAsp: 5.163 ± 0.684
6.354LeuGlu: 6.354 ± 1.102
2.118LeuPhe: 2.118 ± 0.534
3.839LeuGly: 3.839 ± 0.724
1.589LeuHis: 1.589 ± 0.375
4.898LeuIle: 4.898 ± 0.69
8.605LeuLys: 8.605 ± 1.425
4.766LeuLeu: 4.766 ± 0.675
2.118LeuMet: 2.118 ± 0.514
4.104LeuAsn: 4.104 ± 0.591
1.853LeuPro: 1.853 ± 0.373
3.442LeuGln: 3.442 ± 0.888
2.78LeuArg: 2.78 ± 0.549
3.839LeuSer: 3.839 ± 0.809
4.369LeuThr: 4.369 ± 0.805
4.236LeuVal: 4.236 ± 0.701
0.794LeuTrp: 0.794 ± 0.307
3.177LeuTyr: 3.177 ± 0.588
0.0LeuXaa: 0.0 ± 0.0
Met
0.794MetAla: 0.794 ± 0.307
0.53MetCys: 0.53 ± 0.228
1.986MetAsp: 1.986 ± 0.457
1.986MetGlu: 1.986 ± 0.622
2.118MetPhe: 2.118 ± 0.569
2.25MetGly: 2.25 ± 0.479
0.132MetHis: 0.132 ± 0.119
1.853MetIle: 1.853 ± 0.549
2.383MetLys: 2.383 ± 0.572
2.383MetLeu: 2.383 ± 0.563
1.324MetMet: 1.324 ± 0.61
3.839MetAsn: 3.839 ± 0.585
0.662MetPro: 0.662 ± 0.283
0.927MetGln: 0.927 ± 0.275
1.324MetArg: 1.324 ± 0.359
2.648MetSer: 2.648 ± 0.546
1.456MetThr: 1.456 ± 0.4
1.324MetVal: 1.324 ± 0.365
0.927MetTrp: 0.927 ± 0.317
1.853MetTyr: 1.853 ± 0.356
0.0MetXaa: 0.0 ± 0.0
Asn
2.25AsnAla: 2.25 ± 0.739
1.059AsnCys: 1.059 ± 0.417
4.633AsnAsp: 4.633 ± 0.642
7.016AsnGlu: 7.016 ± 0.617
2.78AsnPhe: 2.78 ± 0.6
4.501AsnGly: 4.501 ± 0.798
0.794AsnHis: 0.794 ± 0.342
4.104AsnIle: 4.104 ± 0.701
5.825AsnLys: 5.825 ± 0.978
4.501AsnLeu: 4.501 ± 0.682
1.191AsnMet: 1.191 ± 0.404
5.825AsnAsn: 5.825 ± 1.052
2.118AsnPro: 2.118 ± 0.581
3.177AsnGln: 3.177 ± 0.562
2.648AsnArg: 2.648 ± 0.753
3.707AsnSer: 3.707 ± 0.592
3.839AsnThr: 3.839 ± 0.959
5.163AsnVal: 5.163 ± 0.863
0.927AsnTrp: 0.927 ± 0.258
2.648AsnTyr: 2.648 ± 0.585
0.0AsnXaa: 0.0 ± 0.0
Pro
1.324ProAla: 1.324 ± 0.451
0.397ProCys: 0.397 ± 0.198
1.721ProAsp: 1.721 ± 0.477
1.721ProGlu: 1.721 ± 0.459
1.589ProPhe: 1.589 ± 0.441
0.53ProGly: 0.53 ± 0.229
0.265ProHis: 0.265 ± 0.177
1.456ProIle: 1.456 ± 0.416
1.853ProLys: 1.853 ± 0.481
1.059ProLeu: 1.059 ± 0.36
0.794ProMet: 0.794 ± 0.397
1.853ProAsn: 1.853 ± 0.474
0.53ProPro: 0.53 ± 0.2
0.397ProGln: 0.397 ± 0.181
0.662ProArg: 0.662 ± 0.306
1.456ProSer: 1.456 ± 0.457
1.986ProThr: 1.986 ± 0.438
1.456ProVal: 1.456 ± 0.357
0.397ProTrp: 0.397 ± 0.182
1.589ProTyr: 1.589 ± 0.467
0.0ProXaa: 0.0 ± 0.0
Gln
0.927GlnAla: 0.927 ± 0.32
0.265GlnCys: 0.265 ± 0.157
1.324GlnAsp: 1.324 ± 0.525
1.589GlnGlu: 1.589 ± 0.538
1.456GlnPhe: 1.456 ± 0.459
1.986GlnGly: 1.986 ± 0.567
0.265GlnHis: 0.265 ± 0.176
1.853GlnIle: 1.853 ± 0.589
3.31GlnLys: 3.31 ± 0.726
3.971GlnLeu: 3.971 ± 0.815
0.662GlnMet: 0.662 ± 0.273
3.177GlnAsn: 3.177 ± 0.525
0.794GlnPro: 0.794 ± 0.319
1.059GlnGln: 1.059 ± 0.395
1.721GlnArg: 1.721 ± 0.381
1.589GlnSer: 1.589 ± 0.42
2.118GlnThr: 2.118 ± 0.521
1.853GlnVal: 1.853 ± 0.423
0.265GlnTrp: 0.265 ± 0.171
2.912GlnTyr: 2.912 ± 0.584
0.0GlnXaa: 0.0 ± 0.0
Arg
3.442ArgAla: 3.442 ± 0.547
0.53ArgCys: 0.53 ± 0.21
2.383ArgAsp: 2.383 ± 0.611
3.31ArgGlu: 3.31 ± 0.703
3.045ArgPhe: 3.045 ± 0.565
2.383ArgGly: 2.383 ± 0.641
0.265ArgHis: 0.265 ± 0.163
2.515ArgIle: 2.515 ± 0.527
4.633ArgLys: 4.633 ± 0.765
2.118ArgLeu: 2.118 ± 0.569
1.191ArgMet: 1.191 ± 0.404
2.25ArgAsn: 2.25 ± 0.584
0.662ArgPro: 0.662 ± 0.303
1.456ArgGln: 1.456 ± 0.456
2.118ArgArg: 2.118 ± 0.4
1.589ArgSer: 1.589 ± 0.373
1.721ArgThr: 1.721 ± 0.555
2.25ArgVal: 2.25 ± 0.621
0.397ArgTrp: 0.397 ± 0.192
2.383ArgTyr: 2.383 ± 0.56
0.0ArgXaa: 0.0 ± 0.0
Ser
2.383SerAla: 2.383 ± 0.432
0.662SerCys: 0.662 ± 0.229
3.31SerAsp: 3.31 ± 0.631
3.707SerGlu: 3.707 ± 0.72
2.78SerPhe: 2.78 ± 0.698
2.78SerGly: 2.78 ± 0.634
1.059SerHis: 1.059 ± 0.296
4.236SerIle: 4.236 ± 0.808
4.766SerLys: 4.766 ± 0.694
3.707SerLeu: 3.707 ± 0.701
1.721SerMet: 1.721 ± 0.512
5.03SerAsn: 5.03 ± 0.797
0.662SerPro: 0.662 ± 0.223
1.853SerGln: 1.853 ± 0.544
2.25SerArg: 2.25 ± 0.487
2.78SerSer: 2.78 ± 0.634
2.383SerThr: 2.383 ± 0.582
3.442SerVal: 3.442 ± 0.627
0.397SerTrp: 0.397 ± 0.206
3.971SerTyr: 3.971 ± 0.688
0.0SerXaa: 0.0 ± 0.0
Thr
2.383ThrAla: 2.383 ± 0.54
0.397ThrCys: 0.397 ± 0.189
2.515ThrAsp: 2.515 ± 0.612
4.104ThrGlu: 4.104 ± 0.648
2.118ThrPhe: 2.118 ± 0.495
4.633ThrGly: 4.633 ± 1.169
1.191ThrHis: 1.191 ± 0.348
6.089ThrIle: 6.089 ± 0.625
5.163ThrLys: 5.163 ± 0.726
4.633ThrLeu: 4.633 ± 0.883
1.456ThrMet: 1.456 ± 0.378
3.442ThrAsn: 3.442 ± 0.75
1.721ThrPro: 1.721 ± 0.455
1.191ThrGln: 1.191 ± 0.369
2.118ThrArg: 2.118 ± 0.5
4.501ThrSer: 4.501 ± 0.891
3.839ThrThr: 3.839 ± 1.338
3.574ThrVal: 3.574 ± 0.715
0.397ThrTrp: 0.397 ± 0.169
1.456ThrTyr: 1.456 ± 0.389
0.0ThrXaa: 0.0 ± 0.0
Val
3.177ValAla: 3.177 ± 0.622
0.265ValCys: 0.265 ± 0.177
5.163ValAsp: 5.163 ± 0.901
5.56ValGlu: 5.56 ± 0.668
2.118ValPhe: 2.118 ± 0.564
4.633ValGly: 4.633 ± 0.614
1.059ValHis: 1.059 ± 0.396
4.501ValIle: 4.501 ± 0.82
5.957ValLys: 5.957 ± 0.746
3.442ValLeu: 3.442 ± 0.636
1.456ValMet: 1.456 ± 0.419
4.104ValAsn: 4.104 ± 0.699
2.25ValPro: 2.25 ± 0.665
1.456ValGln: 1.456 ± 0.452
2.25ValArg: 2.25 ± 0.503
3.839ValSer: 3.839 ± 0.559
3.31ValThr: 3.31 ± 0.705
3.707ValVal: 3.707 ± 0.79
0.927ValTrp: 0.927 ± 0.401
2.648ValTyr: 2.648 ± 0.754
0.0ValXaa: 0.0 ± 0.0
Trp
0.397TrpAla: 0.397 ± 0.21
0.132TrpCys: 0.132 ± 0.147
0.397TrpAsp: 0.397 ± 0.227
0.927TrpGlu: 0.927 ± 0.395
1.191TrpPhe: 1.191 ± 0.436
0.927TrpGly: 0.927 ± 0.332
0.53TrpHis: 0.53 ± 0.208
0.662TrpIle: 0.662 ± 0.25
1.324TrpLys: 1.324 ± 0.439
1.191TrpLeu: 1.191 ± 0.417
0.397TrpMet: 0.397 ± 0.206
1.191TrpAsn: 1.191 ± 0.313
0.0TrpPro: 0.0 ± 0.0
0.397TrpGln: 0.397 ± 0.202
0.53TrpArg: 0.53 ± 0.218
0.397TrpSer: 0.397 ± 0.18
0.265TrpThr: 0.265 ± 0.176
0.927TrpVal: 0.927 ± 0.265
0.265TrpTrp: 0.265 ± 0.261
0.397TrpTyr: 0.397 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.78TyrAla: 2.78 ± 0.572
0.927TyrCys: 0.927 ± 0.316
3.971TyrAsp: 3.971 ± 0.595
3.971TyrGlu: 3.971 ± 0.63
1.853TyrPhe: 1.853 ± 0.34
2.912TyrGly: 2.912 ± 0.615
0.53TyrHis: 0.53 ± 0.278
3.707TyrIle: 3.707 ± 0.79
4.236TyrLys: 4.236 ± 0.721
3.045TyrLeu: 3.045 ± 0.699
1.191TyrMet: 1.191 ± 0.477
3.574TyrAsn: 3.574 ± 0.524
1.589TyrPro: 1.589 ± 0.388
1.324TyrGln: 1.324 ± 0.336
1.324TyrArg: 1.324 ± 0.325
2.515TyrSer: 2.515 ± 0.669
2.912TyrThr: 2.912 ± 0.485
3.971TyrVal: 3.971 ± 0.716
0.927TyrTrp: 0.927 ± 0.454
1.853TyrTyr: 1.853 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (7555 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski