Amino acid dipepetide frequency for Bacillus phage vB_BhaS-171

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.159AlaAla: 4.159 ± 0.946
0.34AlaCys: 0.34 ± 0.182
3.48AlaAsp: 3.48 ± 0.66
4.838AlaGlu: 4.838 ± 0.57
3.226AlaPhe: 3.226 ± 0.788
3.565AlaGly: 3.565 ± 0.675
1.358AlaHis: 1.358 ± 0.369
3.989AlaIle: 3.989 ± 0.631
5.348AlaLys: 5.348 ± 0.757
6.112AlaLeu: 6.112 ± 0.924
1.613AlaMet: 1.613 ± 0.345
3.31AlaAsn: 3.31 ± 0.58
1.358AlaPro: 1.358 ± 0.324
2.122AlaGln: 2.122 ± 0.396
2.377AlaArg: 2.377 ± 0.397
4.244AlaSer: 4.244 ± 0.634
2.886AlaThr: 2.886 ± 0.483
3.226AlaVal: 3.226 ± 0.634
1.019AlaTrp: 1.019 ± 0.252
1.613AlaTyr: 1.613 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.34CysAla: 0.34 ± 0.169
0.0CysCys: 0.0 ± 0.0
0.509CysAsp: 0.509 ± 0.204
0.764CysGlu: 0.764 ± 0.297
0.085CysPhe: 0.085 ± 0.078
0.934CysGly: 0.934 ± 0.355
0.085CysHis: 0.085 ± 0.078
0.255CysIle: 0.255 ± 0.124
0.424CysLys: 0.424 ± 0.159
0.594CysLeu: 0.594 ± 0.249
0.0CysMet: 0.0 ± 0.0
0.255CysAsn: 0.255 ± 0.156
0.424CysPro: 0.424 ± 0.178
0.594CysGln: 0.594 ± 0.222
0.424CysArg: 0.424 ± 0.231
0.424CysSer: 0.424 ± 0.199
0.255CysThr: 0.255 ± 0.136
0.34CysVal: 0.34 ± 0.172
0.0CysTrp: 0.0 ± 0.0
0.255CysTyr: 0.255 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
4.159AspAla: 4.159 ± 0.601
0.17AspCys: 0.17 ± 0.125
3.735AspAsp: 3.735 ± 0.533
4.329AspGlu: 4.329 ± 0.584
3.31AspPhe: 3.31 ± 0.575
5.857AspGly: 5.857 ± 0.753
1.103AspHis: 1.103 ± 0.325
3.735AspIle: 3.735 ± 0.511
4.753AspLys: 4.753 ± 0.592
4.838AspLeu: 4.838 ± 0.705
1.613AspMet: 1.613 ± 0.339
3.65AspAsn: 3.65 ± 0.573
1.952AspPro: 1.952 ± 0.331
1.188AspGln: 1.188 ± 0.284
2.377AspArg: 2.377 ± 0.543
3.48AspSer: 3.48 ± 0.619
3.226AspThr: 3.226 ± 0.472
3.48AspVal: 3.48 ± 0.516
0.679AspTrp: 0.679 ± 0.246
2.546AspTyr: 2.546 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
3.565GluAla: 3.565 ± 0.595
0.679GluCys: 0.679 ± 0.295
3.565GluAsp: 3.565 ± 0.617
7.639GluGlu: 7.639 ± 1.31
2.631GluPhe: 2.631 ± 0.419
5.687GluGly: 5.687 ± 0.629
1.188GluHis: 1.188 ± 0.299
7.215GluIle: 7.215 ± 0.741
7.385GluLys: 7.385 ± 0.859
7.3GluLeu: 7.3 ± 0.815
2.546GluMet: 2.546 ± 0.477
4.923GluAsn: 4.923 ± 0.682
1.528GluPro: 1.528 ± 0.467
3.141GluGln: 3.141 ± 0.55
4.244GluArg: 4.244 ± 0.568
3.48GluSer: 3.48 ± 0.593
4.074GluThr: 4.074 ± 0.616
6.281GluVal: 6.281 ± 0.738
1.273GluTrp: 1.273 ± 0.421
3.226GluTyr: 3.226 ± 0.523
0.0GluXaa: 0.0 ± 0.0
Phe
2.546PheAla: 2.546 ± 0.463
0.255PheCys: 0.255 ± 0.133
2.292PheAsp: 2.292 ± 0.492
3.565PheGlu: 3.565 ± 0.476
1.867PhePhe: 1.867 ± 0.451
2.801PheGly: 2.801 ± 0.579
0.594PheHis: 0.594 ± 0.188
3.82PheIle: 3.82 ± 0.707
3.905PheLys: 3.905 ± 0.507
3.82PheLeu: 3.82 ± 0.548
1.103PheMet: 1.103 ± 0.344
2.462PheAsn: 2.462 ± 0.42
1.783PhePro: 1.783 ± 0.431
1.698PheGln: 1.698 ± 0.45
1.698PheArg: 1.698 ± 0.398
2.631PheSer: 2.631 ± 0.363
2.886PheThr: 2.886 ± 0.476
1.443PheVal: 1.443 ± 0.379
0.509PheTrp: 0.509 ± 0.249
1.358PheTyr: 1.358 ± 0.42
0.0PheXaa: 0.0 ± 0.0
Gly
5.432GlyAla: 5.432 ± 0.938
0.509GlyCys: 0.509 ± 0.209
3.82GlyAsp: 3.82 ± 0.557
5.093GlyGlu: 5.093 ± 0.51
3.82GlyPhe: 3.82 ± 0.546
5.432GlyGly: 5.432 ± 0.889
1.613GlyHis: 1.613 ± 0.543
3.82GlyIle: 3.82 ± 0.653
5.432GlyLys: 5.432 ± 0.654
5.178GlyLeu: 5.178 ± 0.695
1.783GlyMet: 1.783 ± 0.402
3.565GlyAsn: 3.565 ± 0.674
0.594GlyPro: 0.594 ± 0.208
2.292GlyGln: 2.292 ± 0.492
1.952GlyArg: 1.952 ± 0.358
3.141GlySer: 3.141 ± 0.435
4.074GlyThr: 4.074 ± 0.818
5.093GlyVal: 5.093 ± 0.986
0.849GlyTrp: 0.849 ± 0.201
2.716GlyTyr: 2.716 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
0.934HisAla: 0.934 ± 0.288
0.0HisCys: 0.0 ± 0.0
1.273HisAsp: 1.273 ± 0.353
1.528HisGlu: 1.528 ± 0.411
1.103HisPhe: 1.103 ± 0.299
1.698HisGly: 1.698 ± 0.639
0.509HisHis: 0.509 ± 0.287
1.188HisIle: 1.188 ± 0.358
1.273HisLys: 1.273 ± 0.379
1.358HisLeu: 1.358 ± 0.372
0.17HisMet: 0.17 ± 0.113
1.188HisAsn: 1.188 ± 0.325
0.849HisPro: 0.849 ± 0.26
0.424HisGln: 0.424 ± 0.193
0.594HisArg: 0.594 ± 0.195
1.613HisSer: 1.613 ± 0.36
1.103HisThr: 1.103 ± 0.393
1.188HisVal: 1.188 ± 0.282
0.17HisTrp: 0.17 ± 0.112
1.443HisTyr: 1.443 ± 0.455
0.0HisXaa: 0.0 ± 0.0
Ile
4.074IleAla: 4.074 ± 0.572
0.764IleCys: 0.764 ± 0.212
5.178IleAsp: 5.178 ± 0.652
5.942IleGlu: 5.942 ± 0.582
1.613IlePhe: 1.613 ± 0.273
4.669IleGly: 4.669 ± 0.703
1.783IleHis: 1.783 ± 0.383
4.329IleIle: 4.329 ± 0.599
4.923IleLys: 4.923 ± 0.651
4.923IleLeu: 4.923 ± 0.644
1.698IleMet: 1.698 ± 0.332
4.329IleAsn: 4.329 ± 0.553
3.141IlePro: 3.141 ± 0.422
3.141IleGln: 3.141 ± 0.527
3.48IleArg: 3.48 ± 0.595
4.923IleSer: 4.923 ± 0.666
3.565IleThr: 3.565 ± 0.555
4.414IleVal: 4.414 ± 0.472
1.019IleTrp: 1.019 ± 0.288
1.783IleTyr: 1.783 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
5.687LysAla: 5.687 ± 0.704
0.17LysCys: 0.17 ± 0.13
5.687LysAsp: 5.687 ± 0.601
8.913LysGlu: 8.913 ± 1.217
2.716LysPhe: 2.716 ± 0.498
5.263LysGly: 5.263 ± 0.741
1.783LysHis: 1.783 ± 0.46
4.414LysIle: 4.414 ± 0.685
8.318LysLys: 8.318 ± 1.351
7.639LysLeu: 7.639 ± 0.649
2.631LysMet: 2.631 ± 0.659
5.008LysAsn: 5.008 ± 0.745
2.462LysPro: 2.462 ± 0.444
3.141LysGln: 3.141 ± 0.616
4.329LysArg: 4.329 ± 0.765
4.074LysSer: 4.074 ± 0.529
4.669LysThr: 4.669 ± 0.612
6.281LysVal: 6.281 ± 0.671
1.443LysTrp: 1.443 ± 0.333
2.886LysTyr: 2.886 ± 0.516
0.0LysXaa: 0.0 ± 0.0
Leu
4.923LeuAla: 4.923 ± 0.512
0.764LeuCys: 0.764 ± 0.251
5.432LeuAsp: 5.432 ± 0.588
7.13LeuGlu: 7.13 ± 0.84
3.82LeuPhe: 3.82 ± 0.641
4.838LeuGly: 4.838 ± 0.567
1.613LeuHis: 1.613 ± 0.444
6.027LeuIle: 6.027 ± 0.762
7.639LeuLys: 7.639 ± 1.226
7.215LeuLeu: 7.215 ± 0.903
2.462LeuMet: 2.462 ± 0.415
4.753LeuAsn: 4.753 ± 0.705
2.207LeuPro: 2.207 ± 0.344
3.48LeuGln: 3.48 ± 0.473
4.159LeuArg: 4.159 ± 0.635
4.584LeuSer: 4.584 ± 0.623
5.093LeuThr: 5.093 ± 0.63
3.65LeuVal: 3.65 ± 0.617
0.849LeuTrp: 0.849 ± 0.266
2.631LeuTyr: 2.631 ± 0.411
0.0LeuXaa: 0.0 ± 0.0
Met
2.037MetAla: 2.037 ± 0.411
0.17MetCys: 0.17 ± 0.103
1.188MetAsp: 1.188 ± 0.422
2.122MetGlu: 2.122 ± 0.426
1.188MetPhe: 1.188 ± 0.288
1.528MetGly: 1.528 ± 0.356
0.255MetHis: 0.255 ± 0.152
1.698MetIle: 1.698 ± 0.398
2.971MetLys: 2.971 ± 0.486
1.867MetLeu: 1.867 ± 0.374
0.849MetMet: 0.849 ± 0.342
2.207MetAsn: 2.207 ± 0.464
0.764MetPro: 0.764 ± 0.276
1.019MetGln: 1.019 ± 0.344
1.613MetArg: 1.613 ± 0.383
1.698MetSer: 1.698 ± 0.36
1.613MetThr: 1.613 ± 0.287
1.273MetVal: 1.273 ± 0.44
0.255MetTrp: 0.255 ± 0.164
0.679MetTyr: 0.679 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
4.074AsnAla: 4.074 ± 0.682
0.594AsnCys: 0.594 ± 0.291
3.905AsnAsp: 3.905 ± 0.743
4.074AsnGlu: 4.074 ± 0.608
1.952AsnPhe: 1.952 ± 0.375
3.65AsnGly: 3.65 ± 0.686
1.019AsnHis: 1.019 ± 0.312
4.669AsnIle: 4.669 ± 0.765
4.923AsnLys: 4.923 ± 0.54
4.329AsnLeu: 4.329 ± 0.533
1.528AsnMet: 1.528 ± 0.394
1.867AsnAsn: 1.867 ± 0.656
2.037AsnPro: 2.037 ± 0.486
2.207AsnGln: 2.207 ± 0.488
2.801AsnArg: 2.801 ± 0.45
3.056AsnSer: 3.056 ± 0.501
2.971AsnThr: 2.971 ± 0.546
3.905AsnVal: 3.905 ± 0.465
0.34AsnTrp: 0.34 ± 0.141
2.122AsnTyr: 2.122 ± 0.503
0.0AsnXaa: 0.0 ± 0.0
Pro
1.103ProAla: 1.103 ± 0.275
0.34ProCys: 0.34 ± 0.158
1.273ProAsp: 1.273 ± 0.315
2.037ProGlu: 2.037 ± 0.37
1.443ProPhe: 1.443 ± 0.468
1.273ProGly: 1.273 ± 0.403
0.679ProHis: 0.679 ± 0.257
1.698ProIle: 1.698 ± 0.448
3.226ProLys: 3.226 ± 0.635
3.565ProLeu: 3.565 ± 0.551
0.849ProMet: 0.849 ± 0.265
1.698ProAsn: 1.698 ± 0.365
0.764ProPro: 0.764 ± 0.233
1.103ProGln: 1.103 ± 0.308
1.019ProArg: 1.019 ± 0.322
2.292ProSer: 2.292 ± 0.42
1.952ProThr: 1.952 ± 0.475
2.207ProVal: 2.207 ± 0.515
0.255ProTrp: 0.255 ± 0.18
1.103ProTyr: 1.103 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
2.716GlnAla: 2.716 ± 0.63
0.34GlnCys: 0.34 ± 0.153
1.613GlnAsp: 1.613 ± 0.373
2.631GlnGlu: 2.631 ± 0.535
1.867GlnPhe: 1.867 ± 0.337
1.613GlnGly: 1.613 ± 0.385
0.509GlnHis: 0.509 ± 0.184
2.546GlnIle: 2.546 ± 0.496
2.801GlnLys: 2.801 ± 0.464
3.31GlnLeu: 3.31 ± 0.472
0.934GlnMet: 0.934 ± 0.271
1.528GlnAsn: 1.528 ± 0.333
1.698GlnPro: 1.698 ± 0.327
2.801GlnGln: 2.801 ± 0.616
1.698GlnArg: 1.698 ± 0.384
2.801GlnSer: 2.801 ± 0.41
2.292GlnThr: 2.292 ± 0.444
2.377GlnVal: 2.377 ± 0.404
0.424GlnTrp: 0.424 ± 0.158
1.358GlnTyr: 1.358 ± 0.454
0.0GlnXaa: 0.0 ± 0.0
Arg
2.377ArgAla: 2.377 ± 0.509
0.424ArgCys: 0.424 ± 0.241
2.462ArgAsp: 2.462 ± 0.485
2.886ArgGlu: 2.886 ± 0.543
1.698ArgPhe: 1.698 ± 0.334
2.716ArgGly: 2.716 ± 0.53
0.849ArgHis: 0.849 ± 0.235
4.414ArgIle: 4.414 ± 0.809
3.48ArgLys: 3.48 ± 0.46
4.669ArgLeu: 4.669 ± 0.799
1.613ArgMet: 1.613 ± 0.305
2.886ArgAsn: 2.886 ± 0.61
1.019ArgPro: 1.019 ± 0.32
1.273ArgGln: 1.273 ± 0.267
2.207ArgArg: 2.207 ± 0.583
2.292ArgSer: 2.292 ± 0.413
2.207ArgThr: 2.207 ± 0.407
2.292ArgVal: 2.292 ± 0.394
0.34ArgTrp: 0.34 ± 0.168
2.292ArgTyr: 2.292 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
2.886SerAla: 2.886 ± 0.654
0.255SerCys: 0.255 ± 0.228
3.31SerAsp: 3.31 ± 0.461
4.244SerGlu: 4.244 ± 0.673
3.31SerPhe: 3.31 ± 0.58
5.008SerGly: 5.008 ± 0.771
1.188SerHis: 1.188 ± 0.392
4.669SerIle: 4.669 ± 0.614
4.923SerLys: 4.923 ± 0.779
4.159SerLeu: 4.159 ± 0.615
1.613SerMet: 1.613 ± 0.437
2.971SerAsn: 2.971 ± 0.542
1.273SerPro: 1.273 ± 0.389
2.377SerGln: 2.377 ± 0.444
2.801SerArg: 2.801 ± 0.527
3.735SerSer: 3.735 ± 0.493
2.886SerThr: 2.886 ± 0.56
3.82SerVal: 3.82 ± 0.499
0.679SerTrp: 0.679 ± 0.231
2.716SerTyr: 2.716 ± 0.456
0.0SerXaa: 0.0 ± 0.0
Thr
3.65ThrAla: 3.65 ± 0.532
0.424ThrCys: 0.424 ± 0.165
3.395ThrAsp: 3.395 ± 0.442
3.735ThrGlu: 3.735 ± 0.517
2.292ThrPhe: 2.292 ± 0.413
3.48ThrGly: 3.48 ± 0.606
1.019ThrHis: 1.019 ± 0.306
3.565ThrIle: 3.565 ± 0.494
5.348ThrLys: 5.348 ± 0.656
5.008ThrLeu: 5.008 ± 0.711
0.934ThrMet: 0.934 ± 0.337
2.207ThrAsn: 2.207 ± 0.456
2.801ThrPro: 2.801 ± 0.523
2.292ThrGln: 2.292 ± 0.384
1.613ThrArg: 1.613 ± 0.395
3.82ThrSer: 3.82 ± 0.623
3.565ThrThr: 3.565 ± 0.691
4.669ThrVal: 4.669 ± 0.553
0.849ThrTrp: 0.849 ± 0.291
1.358ThrTyr: 1.358 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
3.65ValAla: 3.65 ± 0.579
0.255ValCys: 0.255 ± 0.148
4.414ValAsp: 4.414 ± 0.649
4.584ValGlu: 4.584 ± 0.551
2.716ValPhe: 2.716 ± 0.485
3.31ValGly: 3.31 ± 0.51
1.103ValHis: 1.103 ± 0.326
3.989ValIle: 3.989 ± 0.623
5.687ValLys: 5.687 ± 0.689
4.414ValLeu: 4.414 ± 0.524
1.698ValMet: 1.698 ± 0.415
4.159ValAsn: 4.159 ± 0.703
2.037ValPro: 2.037 ± 0.44
2.037ValGln: 2.037 ± 0.38
2.971ValArg: 2.971 ± 0.388
4.329ValSer: 4.329 ± 0.878
3.905ValThr: 3.905 ± 0.712
3.735ValVal: 3.735 ± 0.617
0.679ValTrp: 0.679 ± 0.215
2.546ValTyr: 2.546 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.34TrpAla: 0.34 ± 0.175
0.085TrpCys: 0.085 ± 0.074
1.188TrpAsp: 1.188 ± 0.345
1.528TrpGlu: 1.528 ± 0.363
0.424TrpPhe: 0.424 ± 0.187
1.103TrpGly: 1.103 ± 0.313
0.255TrpHis: 0.255 ± 0.125
0.764TrpIle: 0.764 ± 0.246
1.358TrpLys: 1.358 ± 0.359
0.679TrpLeu: 0.679 ± 0.279
0.34TrpMet: 0.34 ± 0.184
0.594TrpAsn: 0.594 ± 0.196
0.085TrpPro: 0.085 ± 0.082
0.509TrpGln: 0.509 ± 0.19
0.34TrpArg: 0.34 ± 0.16
0.594TrpSer: 0.594 ± 0.179
0.849TrpThr: 0.849 ± 0.298
0.509TrpVal: 0.509 ± 0.185
0.17TrpTrp: 0.17 ± 0.121
0.679TrpTyr: 0.679 ± 0.258
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.528TyrAla: 1.528 ± 0.307
0.424TyrCys: 0.424 ± 0.209
2.462TyrAsp: 2.462 ± 0.446
3.989TyrGlu: 3.989 ± 0.71
1.952TyrPhe: 1.952 ± 0.442
1.783TyrGly: 1.783 ± 0.335
1.103TyrHis: 1.103 ± 0.371
2.886TyrIle: 2.886 ± 0.481
3.395TyrLys: 3.395 ± 0.574
2.377TyrLeu: 2.377 ± 0.523
0.934TyrMet: 0.934 ± 0.319
2.462TyrAsn: 2.462 ± 0.509
1.103TyrPro: 1.103 ± 0.325
1.019TyrGln: 1.019 ± 0.274
1.698TyrArg: 1.698 ± 0.415
1.613TyrSer: 1.613 ± 0.379
1.952TyrThr: 1.952 ± 0.448
2.037TyrVal: 2.037 ± 0.423
0.594TyrTrp: 0.594 ± 0.191
1.867TyrTyr: 1.867 ± 0.55
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (11782 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski