Amino acid dipepetide frequency for Streptococcus phage Javan617

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.089AlaAla: 4.089 ± 1.634
0.389AlaCys: 0.389 ± 0.195
2.921AlaAsp: 2.921 ± 0.509
5.16AlaGlu: 5.16 ± 0.601
3.115AlaPhe: 3.115 ± 0.626
5.257AlaGly: 5.257 ± 1.551
0.779AlaHis: 0.779 ± 0.249
5.841AlaIle: 5.841 ± 1.26
5.257AlaLys: 5.257 ± 0.628
5.354AlaLeu: 5.354 ± 1.261
1.947AlaMet: 1.947 ± 0.586
4.381AlaAsn: 4.381 ± 0.761
2.531AlaPro: 2.531 ± 0.485
3.213AlaGln: 3.213 ± 0.975
2.434AlaArg: 2.434 ± 0.566
5.646AlaSer: 5.646 ± 0.86
4.673AlaThr: 4.673 ± 0.981
4.186AlaVal: 4.186 ± 0.944
0.389AlaTrp: 0.389 ± 0.229
2.142AlaTyr: 2.142 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.292CysAla: 0.292 ± 0.15
0.0CysCys: 0.0 ± 0.0
0.292CysAsp: 0.292 ± 0.223
0.292CysGlu: 0.292 ± 0.144
0.195CysPhe: 0.195 ± 0.129
0.389CysGly: 0.389 ± 0.175
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.681CysLys: 0.681 ± 0.262
0.292CysLeu: 0.292 ± 0.156
0.097CysMet: 0.097 ± 0.11
0.195CysAsn: 0.195 ± 0.128
0.195CysPro: 0.195 ± 0.146
0.097CysGln: 0.097 ± 0.115
0.195CysArg: 0.195 ± 0.129
0.487CysSer: 0.487 ± 0.219
0.195CysThr: 0.195 ± 0.134
0.097CysVal: 0.097 ± 0.085
0.097CysTrp: 0.097 ± 0.111
0.195CysTyr: 0.195 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
3.407AspAla: 3.407 ± 0.776
0.389AspCys: 0.389 ± 0.163
4.089AspAsp: 4.089 ± 0.829
4.478AspGlu: 4.478 ± 0.886
3.407AspPhe: 3.407 ± 0.603
4.381AspGly: 4.381 ± 0.752
0.487AspHis: 0.487 ± 0.224
5.744AspIle: 5.744 ± 0.883
5.16AspLys: 5.16 ± 1.004
6.62AspLeu: 6.62 ± 1.08
1.363AspMet: 1.363 ± 0.327
4.381AspAsn: 4.381 ± 0.686
1.947AspPro: 1.947 ± 0.542
0.876AspGln: 0.876 ± 0.356
3.115AspArg: 3.115 ± 0.573
4.186AspSer: 4.186 ± 0.47
3.602AspThr: 3.602 ± 0.523
2.823AspVal: 2.823 ± 0.477
0.584AspTrp: 0.584 ± 0.203
3.31AspTyr: 3.31 ± 0.671
0.0AspXaa: 0.0 ± 0.0
Glu
3.797GluAla: 3.797 ± 0.649
0.292GluCys: 0.292 ± 0.14
4.868GluAsp: 4.868 ± 0.795
4.576GluGlu: 4.576 ± 0.958
2.823GluPhe: 2.823 ± 0.594
2.726GluGly: 2.726 ± 0.428
1.071GluHis: 1.071 ± 0.405
5.549GluIle: 5.549 ± 0.902
6.62GluLys: 6.62 ± 0.969
9.248GluLeu: 9.248 ± 1.163
2.531GluMet: 2.531 ± 0.652
3.894GluAsn: 3.894 ± 0.878
1.266GluPro: 1.266 ± 0.311
3.115GluGln: 3.115 ± 0.48
3.213GluArg: 3.213 ± 0.526
3.213GluSer: 3.213 ± 0.599
3.505GluThr: 3.505 ± 0.553
3.797GluVal: 3.797 ± 0.622
0.584GluTrp: 0.584 ± 0.296
2.921GluTyr: 2.921 ± 0.627
0.0GluXaa: 0.0 ± 0.0
Phe
2.921PheAla: 2.921 ± 0.511
0.292PheCys: 0.292 ± 0.191
5.16PheAsp: 5.16 ± 0.701
4.478PheGlu: 4.478 ± 0.726
1.752PhePhe: 1.752 ± 0.441
3.991PheGly: 3.991 ± 0.809
0.779PheHis: 0.779 ± 0.229
3.115PheIle: 3.115 ± 0.477
3.407PheLys: 3.407 ± 0.498
2.921PheLeu: 2.921 ± 0.599
0.779PheMet: 0.779 ± 0.23
2.531PheAsn: 2.531 ± 0.465
0.681PhePro: 0.681 ± 0.228
1.46PheGln: 1.46 ± 0.349
1.46PheArg: 1.46 ± 0.427
2.629PheSer: 2.629 ± 0.611
2.044PheThr: 2.044 ± 0.439
1.947PheVal: 1.947 ± 0.627
0.292PheTrp: 0.292 ± 0.14
1.752PheTyr: 1.752 ± 0.513
0.0PheXaa: 0.0 ± 0.0
Gly
4.673GlyAla: 4.673 ± 1.46
0.389GlyCys: 0.389 ± 0.249
2.531GlyAsp: 2.531 ± 0.509
3.213GlyGlu: 3.213 ± 0.633
2.629GlyPhe: 2.629 ± 0.598
3.115GlyGly: 3.115 ± 1.111
1.071GlyHis: 1.071 ± 0.276
5.354GlyIle: 5.354 ± 0.85
5.16GlyLys: 5.16 ± 0.61
6.133GlyLeu: 6.133 ± 0.88
1.752GlyMet: 1.752 ± 0.413
4.186GlyAsn: 4.186 ± 0.487
0.389GlyPro: 0.389 ± 0.177
2.434GlyGln: 2.434 ± 0.535
2.629GlyArg: 2.629 ± 0.419
3.602GlySer: 3.602 ± 0.852
4.868GlyThr: 4.868 ± 1.449
5.062GlyVal: 5.062 ± 0.828
0.487GlyTrp: 0.487 ± 0.206
3.018GlyTyr: 3.018 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
0.779HisAla: 0.779 ± 0.297
0.097HisCys: 0.097 ± 0.098
1.363HisAsp: 1.363 ± 0.328
0.974HisGlu: 0.974 ± 0.348
0.584HisPhe: 0.584 ± 0.245
0.681HisGly: 0.681 ± 0.283
0.292HisHis: 0.292 ± 0.157
0.876HisIle: 0.876 ± 0.284
1.558HisLys: 1.558 ± 0.415
0.779HisLeu: 0.779 ± 0.212
0.389HisMet: 0.389 ± 0.172
0.487HisAsn: 0.487 ± 0.174
0.681HisPro: 0.681 ± 0.26
0.779HisGln: 0.779 ± 0.26
0.292HisArg: 0.292 ± 0.163
0.779HisSer: 0.779 ± 0.251
0.681HisThr: 0.681 ± 0.287
0.681HisVal: 0.681 ± 0.235
0.292HisTrp: 0.292 ± 0.196
0.779HisTyr: 0.779 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
6.231IleAla: 6.231 ± 1.242
0.195IleCys: 0.195 ± 0.126
7.399IleAsp: 7.399 ± 1.037
6.425IleGlu: 6.425 ± 1.038
3.115IlePhe: 3.115 ± 0.797
3.991IleGly: 3.991 ± 0.739
0.779IleHis: 0.779 ± 0.233
4.381IleIle: 4.381 ± 0.617
7.496IleLys: 7.496 ± 0.78
4.673IleLeu: 4.673 ± 0.882
2.044IleMet: 2.044 ± 0.376
4.965IleAsn: 4.965 ± 0.711
2.044IlePro: 2.044 ± 0.405
1.85IleGln: 1.85 ± 0.38
3.505IleArg: 3.505 ± 0.556
7.204IleSer: 7.204 ± 1.118
5.257IleThr: 5.257 ± 0.63
4.089IleVal: 4.089 ± 0.572
0.876IleTrp: 0.876 ± 0.295
2.336IleTyr: 2.336 ± 0.51
0.0IleXaa: 0.0 ± 0.0
Lys
5.452LysAla: 5.452 ± 0.728
0.195LysCys: 0.195 ± 0.139
4.673LysAsp: 4.673 ± 1.004
7.009LysGlu: 7.009 ± 1.033
3.213LysPhe: 3.213 ± 0.557
4.186LysGly: 4.186 ± 0.578
1.46LysHis: 1.46 ± 0.418
5.938LysIle: 5.938 ± 0.837
6.036LysLys: 6.036 ± 1.033
6.815LysLeu: 6.815 ± 0.901
2.239LysMet: 2.239 ± 0.437
4.283LysAsn: 4.283 ± 0.563
3.407LysPro: 3.407 ± 0.63
2.921LysGln: 2.921 ± 0.59
3.505LysArg: 3.505 ± 0.773
6.036LysSer: 6.036 ± 0.706
5.354LysThr: 5.354 ± 0.931
6.231LysVal: 6.231 ± 1.015
0.974LysTrp: 0.974 ± 0.363
3.505LysTyr: 3.505 ± 0.697
0.0LysXaa: 0.0 ± 0.0
Leu
7.107LeuAla: 7.107 ± 1.087
0.195LeuCys: 0.195 ± 0.158
5.062LeuAsp: 5.062 ± 0.696
5.452LeuGlu: 5.452 ± 0.866
3.213LeuPhe: 3.213 ± 0.463
6.523LeuGly: 6.523 ± 1.537
1.558LeuHis: 1.558 ± 0.327
6.231LeuIle: 6.231 ± 0.855
8.762LeuLys: 8.762 ± 1.035
5.257LeuLeu: 5.257 ± 0.879
1.168LeuMet: 1.168 ± 0.392
5.16LeuAsn: 5.16 ± 0.707
2.921LeuPro: 2.921 ± 0.601
2.629LeuGln: 2.629 ± 0.525
3.505LeuArg: 3.505 ± 0.522
6.523LeuSer: 6.523 ± 0.867
4.868LeuThr: 4.868 ± 0.659
3.699LeuVal: 3.699 ± 0.719
0.779LeuTrp: 0.779 ± 0.271
2.142LeuTyr: 2.142 ± 0.472
0.0LeuXaa: 0.0 ± 0.0
Met
2.239MetAla: 2.239 ± 0.547
0.097MetCys: 0.097 ± 0.089
1.168MetAsp: 1.168 ± 0.306
1.752MetGlu: 1.752 ± 0.43
0.584MetPhe: 0.584 ± 0.214
1.363MetGly: 1.363 ± 0.301
0.097MetHis: 0.097 ± 0.111
1.947MetIle: 1.947 ± 0.407
2.531MetLys: 2.531 ± 0.496
1.85MetLeu: 1.85 ± 0.5
0.681MetMet: 0.681 ± 0.264
1.558MetAsn: 1.558 ± 0.272
0.487MetPro: 0.487 ± 0.222
1.168MetGln: 1.168 ± 0.33
1.071MetArg: 1.071 ± 0.347
1.168MetSer: 1.168 ± 0.265
2.726MetThr: 2.726 ± 0.439
1.85MetVal: 1.85 ± 0.519
0.195MetTrp: 0.195 ± 0.141
1.168MetTyr: 1.168 ± 0.374
0.0MetXaa: 0.0 ± 0.0
Asn
3.505AsnAla: 3.505 ± 0.458
0.584AsnCys: 0.584 ± 0.198
3.602AsnAsp: 3.602 ± 0.633
3.31AsnGlu: 3.31 ± 0.713
2.726AsnPhe: 2.726 ± 0.476
3.991AsnGly: 3.991 ± 0.556
1.168AsnHis: 1.168 ± 0.288
4.283AsnIle: 4.283 ± 0.592
4.283AsnLys: 4.283 ± 0.982
4.381AsnLeu: 4.381 ± 0.626
1.266AsnMet: 1.266 ± 0.316
3.505AsnAsn: 3.505 ± 0.632
1.752AsnPro: 1.752 ± 0.511
2.434AsnGln: 2.434 ± 0.506
2.142AsnArg: 2.142 ± 0.387
4.186AsnSer: 4.186 ± 0.721
3.213AsnThr: 3.213 ± 0.499
2.823AsnVal: 2.823 ± 0.586
0.584AsnTrp: 0.584 ± 0.278
2.434AsnTyr: 2.434 ± 0.584
0.0AsnXaa: 0.0 ± 0.0
Pro
2.044ProAla: 2.044 ± 0.379
0.097ProCys: 0.097 ± 0.099
1.558ProAsp: 1.558 ± 0.427
1.752ProGlu: 1.752 ± 0.479
1.558ProPhe: 1.558 ± 0.366
1.752ProGly: 1.752 ± 0.379
0.292ProHis: 0.292 ± 0.168
2.921ProIle: 2.921 ± 0.405
2.336ProLys: 2.336 ± 0.55
1.752ProLeu: 1.752 ± 0.426
0.681ProMet: 0.681 ± 0.219
1.071ProAsn: 1.071 ± 0.342
0.487ProPro: 0.487 ± 0.199
1.363ProGln: 1.363 ± 0.278
1.558ProArg: 1.558 ± 0.513
1.655ProSer: 1.655 ± 0.36
2.239ProThr: 2.239 ± 0.412
1.85ProVal: 1.85 ± 0.448
0.292ProTrp: 0.292 ± 0.188
1.071ProTyr: 1.071 ± 0.294
0.0ProXaa: 0.0 ± 0.0
Gln
3.407GlnAla: 3.407 ± 0.925
0.292GlnCys: 0.292 ± 0.17
1.168GlnAsp: 1.168 ± 0.332
2.142GlnGlu: 2.142 ± 0.512
1.266GlnPhe: 1.266 ± 0.332
1.752GlnGly: 1.752 ± 0.468
0.097GlnHis: 0.097 ± 0.096
2.921GlnIle: 2.921 ± 0.449
3.407GlnLys: 3.407 ± 0.553
3.505GlnLeu: 3.505 ± 0.642
1.071GlnMet: 1.071 ± 0.292
1.655GlnAsn: 1.655 ± 0.447
0.681GlnPro: 0.681 ± 0.257
2.239GlnGln: 2.239 ± 0.668
2.044GlnArg: 2.044 ± 0.474
2.726GlnSer: 2.726 ± 0.481
2.336GlnThr: 2.336 ± 0.536
1.947GlnVal: 1.947 ± 0.51
0.195GlnTrp: 0.195 ± 0.144
1.46GlnTyr: 1.46 ± 0.451
0.0GlnXaa: 0.0 ± 0.0
Arg
1.85ArgAla: 1.85 ± 0.59
0.097ArgCys: 0.097 ± 0.088
3.018ArgAsp: 3.018 ± 0.477
2.142ArgGlu: 2.142 ± 0.434
2.239ArgPhe: 2.239 ± 0.534
2.629ArgGly: 2.629 ± 0.376
1.168ArgHis: 1.168 ± 0.334
3.31ArgIle: 3.31 ± 0.522
2.921ArgLys: 2.921 ± 0.692
3.602ArgLeu: 3.602 ± 0.727
1.363ArgMet: 1.363 ± 0.328
2.142ArgAsn: 2.142 ± 0.494
1.168ArgPro: 1.168 ± 0.329
1.655ArgGln: 1.655 ± 0.349
1.558ArgArg: 1.558 ± 0.448
3.018ArgSer: 3.018 ± 0.529
1.363ArgThr: 1.363 ± 0.413
2.336ArgVal: 2.336 ± 0.438
0.584ArgTrp: 0.584 ± 0.374
1.947ArgTyr: 1.947 ± 0.536
0.0ArgXaa: 0.0 ± 0.0
Ser
5.16SerAla: 5.16 ± 1.265
0.389SerCys: 0.389 ± 0.242
3.991SerAsp: 3.991 ± 0.66
5.646SerGlu: 5.646 ± 0.782
3.602SerPhe: 3.602 ± 0.751
5.16SerGly: 5.16 ± 0.767
0.974SerHis: 0.974 ± 0.375
5.16SerIle: 5.16 ± 0.635
5.938SerLys: 5.938 ± 0.881
5.646SerLeu: 5.646 ± 0.731
2.239SerMet: 2.239 ± 0.538
4.283SerAsn: 4.283 ± 0.447
2.239SerPro: 2.239 ± 0.429
3.213SerGln: 3.213 ± 0.515
2.726SerArg: 2.726 ± 0.475
4.381SerSer: 4.381 ± 1.053
2.921SerThr: 2.921 ± 0.693
3.602SerVal: 3.602 ± 0.657
0.876SerTrp: 0.876 ± 0.29
2.629SerTyr: 2.629 ± 0.476
0.0SerXaa: 0.0 ± 0.0
Thr
5.841ThrAla: 5.841 ± 1.637
0.0ThrCys: 0.0 ± 0.0
4.186ThrAsp: 4.186 ± 0.699
3.699ThrGlu: 3.699 ± 0.608
3.797ThrPhe: 3.797 ± 0.543
5.062ThrGly: 5.062 ± 1.004
0.389ThrHis: 0.389 ± 0.19
5.549ThrIle: 5.549 ± 0.652
3.991ThrLys: 3.991 ± 0.69
4.868ThrLeu: 4.868 ± 0.893
1.46ThrMet: 1.46 ± 0.378
1.85ThrAsn: 1.85 ± 0.413
2.044ThrPro: 2.044 ± 0.399
1.752ThrGln: 1.752 ± 0.44
2.142ThrArg: 2.142 ± 0.383
3.407ThrSer: 3.407 ± 0.588
3.797ThrThr: 3.797 ± 0.808
3.699ThrVal: 3.699 ± 0.547
0.779ThrTrp: 0.779 ± 0.302
2.823ThrTyr: 2.823 ± 0.628
0.0ThrXaa: 0.0 ± 0.0
Val
4.478ValAla: 4.478 ± 0.892
0.0ValCys: 0.0 ± 0.0
3.602ValAsp: 3.602 ± 0.524
4.186ValGlu: 4.186 ± 0.669
2.044ValPhe: 2.044 ± 0.479
2.921ValGly: 2.921 ± 0.642
0.389ValHis: 0.389 ± 0.162
5.938ValIle: 5.938 ± 0.768
4.186ValLys: 4.186 ± 0.474
3.213ValLeu: 3.213 ± 0.556
1.46ValMet: 1.46 ± 0.312
2.726ValAsn: 2.726 ± 0.62
1.85ValPro: 1.85 ± 0.357
1.071ValGln: 1.071 ± 0.331
1.947ValArg: 1.947 ± 0.383
5.841ValSer: 5.841 ± 0.979
4.965ValThr: 4.965 ± 0.654
3.991ValVal: 3.991 ± 0.587
0.292ValTrp: 0.292 ± 0.163
2.336ValTyr: 2.336 ± 0.557
0.0ValXaa: 0.0 ± 0.0
Trp
0.389TrpAla: 0.389 ± 0.182
0.0TrpCys: 0.0 ± 0.0
0.681TrpAsp: 0.681 ± 0.275
0.681TrpGlu: 0.681 ± 0.229
0.195TrpPhe: 0.195 ± 0.143
0.487TrpGly: 0.487 ± 0.248
0.195TrpHis: 0.195 ± 0.124
0.974TrpIle: 0.974 ± 0.359
1.071TrpLys: 1.071 ± 0.301
0.487TrpLeu: 0.487 ± 0.255
0.389TrpMet: 0.389 ± 0.191
0.195TrpAsn: 0.195 ± 0.144
0.292TrpPro: 0.292 ± 0.172
0.584TrpGln: 0.584 ± 0.272
0.195TrpArg: 0.195 ± 0.145
1.168TrpSer: 1.168 ± 0.338
0.487TrpThr: 0.487 ± 0.207
0.584TrpVal: 0.584 ± 0.201
0.097TrpTrp: 0.097 ± 0.081
0.487TrpTyr: 0.487 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.044TyrAla: 2.044 ± 0.541
0.292TyrCys: 0.292 ± 0.201
3.018TyrAsp: 3.018 ± 0.61
2.239TyrGlu: 2.239 ± 0.585
2.336TyrPhe: 2.336 ± 0.725
2.336TyrGly: 2.336 ± 0.517
0.779TyrHis: 0.779 ± 0.362
2.823TyrIle: 2.823 ± 0.427
2.629TyrLys: 2.629 ± 0.474
5.354TyrLeu: 5.354 ± 0.924
0.681TyrMet: 0.681 ± 0.279
2.726TyrAsn: 2.726 ± 0.547
1.266TyrPro: 1.266 ± 0.395
1.46TyrGln: 1.46 ± 0.406
0.876TyrArg: 0.876 ± 0.365
2.921TyrSer: 2.921 ± 0.546
2.044TyrThr: 2.044 ± 0.582
2.239TyrVal: 2.239 ± 0.44
0.389TyrTrp: 0.389 ± 0.173
2.044TyrTyr: 2.044 ± 0.609
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (10273 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski