Amino acid dipepetide frequency for Streptococcus phage Javan577

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.802AlaAla: 3.802 ± 0.965
0.185AlaCys: 0.185 ± 0.135
4.359AlaAsp: 4.359 ± 1.11
5.843AlaGlu: 5.843 ± 0.824
2.504AlaPhe: 2.504 ± 0.424
4.451AlaGly: 4.451 ± 0.989
0.649AlaHis: 0.649 ± 0.266
5.935AlaIle: 5.935 ± 1.25
5.564AlaLys: 5.564 ± 0.637
4.822AlaLeu: 4.822 ± 0.907
2.226AlaMet: 2.226 ± 0.449
4.544AlaAsn: 4.544 ± 0.461
1.762AlaPro: 1.762 ± 0.391
2.226AlaGln: 2.226 ± 0.58
2.875AlaArg: 2.875 ± 0.52
5.657AlaSer: 5.657 ± 0.894
5.101AlaThr: 5.101 ± 0.645
4.359AlaVal: 4.359 ± 0.639
0.371AlaTrp: 0.371 ± 0.172
2.689AlaTyr: 2.689 ± 0.616
0.0AlaXaa: 0.0 ± 0.0
Cys
0.371CysAla: 0.371 ± 0.183
0.093CysCys: 0.093 ± 0.11
0.185CysAsp: 0.185 ± 0.129
0.927CysGlu: 0.927 ± 0.288
0.278CysPhe: 0.278 ± 0.176
0.093CysGly: 0.093 ± 0.097
0.185CysHis: 0.185 ± 0.12
0.371CysIle: 0.371 ± 0.223
0.371CysLys: 0.371 ± 0.188
0.464CysLeu: 0.464 ± 0.224
0.0CysMet: 0.0 ± 0.0
0.185CysAsn: 0.185 ± 0.118
0.185CysPro: 0.185 ± 0.116
0.093CysGln: 0.093 ± 0.097
0.278CysArg: 0.278 ± 0.164
0.742CysSer: 0.742 ± 0.216
0.093CysThr: 0.093 ± 0.092
0.464CysVal: 0.464 ± 0.242
0.185CysTrp: 0.185 ± 0.138
0.093CysTyr: 0.093 ± 0.094
0.0CysXaa: 0.0 ± 0.0
Asp
3.06AspAla: 3.06 ± 0.714
0.649AspCys: 0.649 ± 0.276
4.637AspAsp: 4.637 ± 0.924
5.843AspGlu: 5.843 ± 0.909
3.246AspPhe: 3.246 ± 0.459
5.379AspGly: 5.379 ± 0.728
0.835AspHis: 0.835 ± 0.27
3.895AspIle: 3.895 ± 0.54
5.564AspLys: 5.564 ± 0.575
5.101AspLeu: 5.101 ± 0.794
1.113AspMet: 1.113 ± 0.308
3.617AspAsn: 3.617 ± 0.52
1.206AspPro: 1.206 ± 0.414
1.762AspGln: 1.762 ± 0.424
2.597AspArg: 2.597 ± 0.597
3.617AspSer: 3.617 ± 0.61
3.153AspThr: 3.153 ± 0.638
4.173AspVal: 4.173 ± 0.646
1.113AspTrp: 1.113 ± 0.333
2.04AspTyr: 2.04 ± 0.513
0.0AspXaa: 0.0 ± 0.0
Glu
6.399GluAla: 6.399 ± 1.015
0.278GluCys: 0.278 ± 0.226
3.802GluAsp: 3.802 ± 0.715
5.75GluGlu: 5.75 ± 1.125
3.802GluPhe: 3.802 ± 0.557
2.968GluGly: 2.968 ± 0.481
0.835GluHis: 0.835 ± 0.28
6.77GluIle: 6.77 ± 0.832
7.234GluLys: 7.234 ± 1.046
7.326GluLeu: 7.326 ± 0.779
2.504GluMet: 2.504 ± 0.637
3.524GluAsn: 3.524 ± 0.532
1.577GluPro: 1.577 ± 0.515
2.968GluGln: 2.968 ± 0.646
4.266GluArg: 4.266 ± 0.707
5.193GluSer: 5.193 ± 0.739
3.895GluThr: 3.895 ± 0.59
4.915GluVal: 4.915 ± 0.698
1.206GluTrp: 1.206 ± 0.268
3.06GluTyr: 3.06 ± 0.52
0.0GluXaa: 0.0 ± 0.0
Phe
2.875PheAla: 2.875 ± 0.546
0.093PheCys: 0.093 ± 0.097
2.597PheAsp: 2.597 ± 0.662
3.431PheGlu: 3.431 ± 0.493
1.206PhePhe: 1.206 ± 0.3
3.06PheGly: 3.06 ± 0.386
0.742PheHis: 0.742 ± 0.248
2.133PheIle: 2.133 ± 0.408
4.359PheLys: 4.359 ± 0.812
2.875PheLeu: 2.875 ± 0.508
0.649PheMet: 0.649 ± 0.246
3.06PheAsn: 3.06 ± 0.649
1.298PhePro: 1.298 ± 0.327
1.391PheGln: 1.391 ± 0.414
1.762PheArg: 1.762 ± 0.446
3.06PheSer: 3.06 ± 0.632
2.968PheThr: 2.968 ± 0.74
2.689PheVal: 2.689 ± 0.668
0.556PheTrp: 0.556 ± 0.302
1.577PheTyr: 1.577 ± 0.372
0.0PheXaa: 0.0 ± 0.0
Gly
4.08GlyAla: 4.08 ± 0.664
0.278GlyCys: 0.278 ± 0.161
3.71GlyAsp: 3.71 ± 0.513
3.71GlyGlu: 3.71 ± 0.663
3.153GlyPhe: 3.153 ± 0.657
4.451GlyGly: 4.451 ± 0.854
1.113GlyHis: 1.113 ± 0.418
4.544GlyIle: 4.544 ± 0.634
5.75GlyLys: 5.75 ± 0.585
6.213GlyLeu: 6.213 ± 0.973
2.875GlyMet: 2.875 ± 0.515
4.359GlyAsn: 4.359 ± 1.015
1.298GlyPro: 1.298 ± 0.428
2.411GlyGln: 2.411 ± 0.364
3.988GlyArg: 3.988 ± 0.605
3.988GlySer: 3.988 ± 1.327
3.617GlyThr: 3.617 ± 0.855
3.246GlyVal: 3.246 ± 0.576
1.113GlyTrp: 1.113 ± 0.436
2.782GlyTyr: 2.782 ± 0.591
0.0GlyXaa: 0.0 ± 0.0
His
0.927HisAla: 0.927 ± 0.362
0.093HisCys: 0.093 ± 0.086
1.855HisAsp: 1.855 ± 0.587
1.113HisGlu: 1.113 ± 0.339
0.556HisPhe: 0.556 ± 0.25
1.113HisGly: 1.113 ± 0.319
0.093HisHis: 0.093 ± 0.088
0.927HisIle: 0.927 ± 0.327
0.742HisLys: 0.742 ± 0.282
1.113HisLeu: 1.113 ± 0.309
0.185HisMet: 0.185 ± 0.135
0.464HisAsn: 0.464 ± 0.215
0.742HisPro: 0.742 ± 0.261
0.556HisGln: 0.556 ± 0.314
0.649HisArg: 0.649 ± 0.246
0.649HisSer: 0.649 ± 0.213
1.02HisThr: 1.02 ± 0.23
0.927HisVal: 0.927 ± 0.363
0.278HisTrp: 0.278 ± 0.138
1.02HisTyr: 1.02 ± 0.399
0.0HisXaa: 0.0 ± 0.0
Ile
4.266IleAla: 4.266 ± 0.71
0.556IleCys: 0.556 ± 0.296
3.802IleAsp: 3.802 ± 0.568
7.234IleGlu: 7.234 ± 0.839
2.226IlePhe: 2.226 ± 0.364
5.101IleGly: 5.101 ± 0.971
1.855IleHis: 1.855 ± 0.425
3.431IleIle: 3.431 ± 0.53
6.677IleLys: 6.677 ± 0.84
3.802IleLeu: 3.802 ± 0.717
1.577IleMet: 1.577 ± 0.383
4.359IleAsn: 4.359 ± 0.695
2.968IlePro: 2.968 ± 0.596
1.669IleGln: 1.669 ± 0.28
3.71IleArg: 3.71 ± 0.554
3.06IleSer: 3.06 ± 0.549
3.71IleThr: 3.71 ± 0.574
3.524IleVal: 3.524 ± 0.537
0.649IleTrp: 0.649 ± 0.208
2.968IleTyr: 2.968 ± 0.552
0.0IleXaa: 0.0 ± 0.0
Lys
7.048LysAla: 7.048 ± 0.891
0.185LysCys: 0.185 ± 0.126
4.822LysAsp: 4.822 ± 0.674
7.141LysGlu: 7.141 ± 0.951
2.875LysPhe: 2.875 ± 0.551
4.544LysGly: 4.544 ± 0.627
1.113LysHis: 1.113 ± 0.361
6.399LysIle: 6.399 ± 0.916
6.306LysLys: 6.306 ± 0.82
6.492LysLeu: 6.492 ± 0.938
1.577LysMet: 1.577 ± 0.407
4.544LysAsn: 4.544 ± 0.667
2.411LysPro: 2.411 ± 0.525
4.544LysGln: 4.544 ± 0.723
4.173LysArg: 4.173 ± 0.735
4.637LysSer: 4.637 ± 0.73
4.544LysThr: 4.544 ± 0.591
5.472LysVal: 5.472 ± 0.737
1.484LysTrp: 1.484 ± 0.416
2.133LysTyr: 2.133 ± 0.439
0.0LysXaa: 0.0 ± 0.0
Leu
7.048LeuAla: 7.048 ± 1.138
0.371LeuCys: 0.371 ± 0.16
5.472LeuAsp: 5.472 ± 0.635
6.955LeuGlu: 6.955 ± 1.042
2.689LeuPhe: 2.689 ± 0.542
4.451LeuGly: 4.451 ± 0.868
0.927LeuHis: 0.927 ± 0.329
3.895LeuIle: 3.895 ± 0.632
7.141LeuLys: 7.141 ± 0.905
5.935LeuLeu: 5.935 ± 0.794
1.669LeuMet: 1.669 ± 0.448
4.544LeuAsn: 4.544 ± 0.836
1.948LeuPro: 1.948 ± 0.425
1.855LeuGln: 1.855 ± 0.319
3.339LeuArg: 3.339 ± 0.614
5.843LeuSer: 5.843 ± 0.985
6.306LeuThr: 6.306 ± 0.639
4.637LeuVal: 4.637 ± 0.78
0.649LeuTrp: 0.649 ± 0.249
2.689LeuTyr: 2.689 ± 0.525
0.0LeuXaa: 0.0 ± 0.0
Met
3.246MetAla: 3.246 ± 0.658
0.185MetCys: 0.185 ± 0.142
1.669MetAsp: 1.669 ± 0.378
1.855MetGlu: 1.855 ± 0.362
1.02MetPhe: 1.02 ± 0.321
1.484MetGly: 1.484 ± 0.459
0.371MetHis: 0.371 ± 0.226
2.133MetIle: 2.133 ± 0.445
2.226MetLys: 2.226 ± 0.425
1.206MetLeu: 1.206 ± 0.372
0.556MetMet: 0.556 ± 0.226
1.669MetAsn: 1.669 ± 0.304
0.464MetPro: 0.464 ± 0.214
0.835MetGln: 0.835 ± 0.333
0.649MetArg: 0.649 ± 0.323
2.133MetSer: 2.133 ± 0.451
1.762MetThr: 1.762 ± 0.3
1.113MetVal: 1.113 ± 0.293
0.093MetTrp: 0.093 ± 0.096
1.206MetTyr: 1.206 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
3.524AsnAla: 3.524 ± 0.585
0.464AsnCys: 0.464 ± 0.179
2.597AsnAsp: 2.597 ± 0.643
4.08AsnGlu: 4.08 ± 0.654
2.968AsnPhe: 2.968 ± 0.446
6.121AsnGly: 6.121 ± 0.859
1.298AsnHis: 1.298 ± 0.358
3.617AsnIle: 3.617 ± 0.667
3.802AsnLys: 3.802 ± 0.481
3.524AsnLeu: 3.524 ± 0.545
1.391AsnMet: 1.391 ± 0.409
1.855AsnAsn: 1.855 ± 0.312
2.504AsnPro: 2.504 ± 0.524
3.06AsnGln: 3.06 ± 0.654
2.689AsnArg: 2.689 ± 0.429
3.802AsnSer: 3.802 ± 0.749
2.782AsnThr: 2.782 ± 0.528
3.524AsnVal: 3.524 ± 0.459
1.206AsnTrp: 1.206 ± 0.419
1.855AsnTyr: 1.855 ± 0.458
0.0AsnXaa: 0.0 ± 0.0
Pro
1.391ProAla: 1.391 ± 0.393
0.093ProCys: 0.093 ± 0.097
1.391ProAsp: 1.391 ± 0.364
1.855ProGlu: 1.855 ± 0.523
1.669ProPhe: 1.669 ± 0.356
0.742ProGly: 0.742 ± 0.217
0.464ProHis: 0.464 ± 0.231
1.577ProIle: 1.577 ± 0.423
2.689ProLys: 2.689 ± 0.662
2.133ProLeu: 2.133 ± 0.398
0.464ProMet: 0.464 ± 0.264
2.597ProAsn: 2.597 ± 0.528
0.556ProPro: 0.556 ± 0.237
0.927ProGln: 0.927 ± 0.266
1.206ProArg: 1.206 ± 0.458
1.855ProSer: 1.855 ± 0.482
2.226ProThr: 2.226 ± 0.499
1.669ProVal: 1.669 ± 0.364
0.278ProTrp: 0.278 ± 0.144
1.02ProTyr: 1.02 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
3.71GlnAla: 3.71 ± 0.664
0.093GlnCys: 0.093 ± 0.103
1.855GlnAsp: 1.855 ± 0.461
2.597GlnGlu: 2.597 ± 0.565
1.391GlnPhe: 1.391 ± 0.257
2.411GlnGly: 2.411 ± 0.644
0.185GlnHis: 0.185 ± 0.14
2.318GlnIle: 2.318 ± 0.427
3.153GlnLys: 3.153 ± 0.559
3.246GlnLeu: 3.246 ± 0.812
0.835GlnMet: 0.835 ± 0.245
1.762GlnAsn: 1.762 ± 0.511
0.927GlnPro: 0.927 ± 0.319
1.391GlnGln: 1.391 ± 0.339
2.226GlnArg: 2.226 ± 0.66
2.782GlnSer: 2.782 ± 0.543
1.762GlnThr: 1.762 ± 0.361
2.504GlnVal: 2.504 ± 0.537
0.556GlnTrp: 0.556 ± 0.193
1.762GlnTyr: 1.762 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
2.318ArgAla: 2.318 ± 0.544
0.278ArgCys: 0.278 ± 0.165
2.968ArgAsp: 2.968 ± 0.607
3.802ArgGlu: 3.802 ± 0.544
2.133ArgPhe: 2.133 ± 0.473
2.689ArgGly: 2.689 ± 0.546
0.742ArgHis: 0.742 ± 0.28
2.689ArgIle: 2.689 ± 0.533
3.617ArgLys: 3.617 ± 0.742
3.431ArgLeu: 3.431 ± 0.589
1.948ArgMet: 1.948 ± 0.487
2.968ArgAsn: 2.968 ± 0.434
1.484ArgPro: 1.484 ± 0.56
2.226ArgGln: 2.226 ± 0.425
2.318ArgArg: 2.318 ± 0.47
2.133ArgSer: 2.133 ± 0.382
2.226ArgThr: 2.226 ± 0.392
2.968ArgVal: 2.968 ± 0.506
1.02ArgTrp: 1.02 ± 0.4
2.782ArgTyr: 2.782 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
4.544SerAla: 4.544 ± 0.728
0.278SerCys: 0.278 ± 0.163
4.637SerAsp: 4.637 ± 0.683
4.637SerGlu: 4.637 ± 0.772
3.339SerPhe: 3.339 ± 0.513
4.637SerGly: 4.637 ± 1.103
1.113SerHis: 1.113 ± 0.368
3.246SerIle: 3.246 ± 0.559
4.173SerLys: 4.173 ± 0.836
5.564SerLeu: 5.564 ± 0.702
2.411SerMet: 2.411 ± 0.359
3.524SerAsn: 3.524 ± 0.54
1.948SerPro: 1.948 ± 0.473
2.689SerGln: 2.689 ± 0.529
1.577SerArg: 1.577 ± 0.42
4.73SerSer: 4.73 ± 0.761
4.08SerThr: 4.08 ± 0.753
3.71SerVal: 3.71 ± 0.677
0.835SerTrp: 0.835 ± 0.236
2.504SerTyr: 2.504 ± 0.587
0.0SerXaa: 0.0 ± 0.0
Thr
3.617ThrAla: 3.617 ± 0.555
0.464ThrCys: 0.464 ± 0.251
3.524ThrAsp: 3.524 ± 0.64
3.988ThrGlu: 3.988 ± 0.743
3.246ThrPhe: 3.246 ± 0.704
4.915ThrGly: 4.915 ± 0.845
0.742ThrHis: 0.742 ± 0.313
5.101ThrIle: 5.101 ± 1.026
4.637ThrLys: 4.637 ± 0.602
5.286ThrLeu: 5.286 ± 0.763
1.02ThrMet: 1.02 ± 0.343
3.06ThrAsn: 3.06 ± 0.741
1.02ThrPro: 1.02 ± 0.295
2.133ThrGln: 2.133 ± 0.772
2.318ThrArg: 2.318 ± 0.438
2.504ThrSer: 2.504 ± 0.455
3.71ThrThr: 3.71 ± 0.644
5.75ThrVal: 5.75 ± 0.776
0.742ThrTrp: 0.742 ± 0.204
1.855ThrTyr: 1.855 ± 0.471
0.0ThrXaa: 0.0 ± 0.0
Val
4.266ValAla: 4.266 ± 0.559
0.649ValCys: 0.649 ± 0.253
4.915ValAsp: 4.915 ± 0.569
4.359ValGlu: 4.359 ± 0.54
2.318ValPhe: 2.318 ± 0.439
4.915ValGly: 4.915 ± 0.628
0.371ValHis: 0.371 ± 0.152
5.101ValIle: 5.101 ± 0.784
4.08ValLys: 4.08 ± 0.695
4.544ValLeu: 4.544 ± 0.738
1.577ValMet: 1.577 ± 0.354
3.246ValAsn: 3.246 ± 0.462
1.669ValPro: 1.669 ± 0.388
2.133ValGln: 2.133 ± 0.41
2.689ValArg: 2.689 ± 0.654
4.544ValSer: 4.544 ± 0.538
4.08ValThr: 4.08 ± 0.625
4.544ValVal: 4.544 ± 0.798
0.649ValTrp: 0.649 ± 0.227
1.855ValTyr: 1.855 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
1.02TrpAla: 1.02 ± 0.359
0.0TrpCys: 0.0 ± 0.0
0.742TrpAsp: 0.742 ± 0.26
0.835TrpGlu: 0.835 ± 0.261
0.742TrpPhe: 0.742 ± 0.298
1.298TrpGly: 1.298 ± 0.298
0.278TrpHis: 0.278 ± 0.19
0.556TrpIle: 0.556 ± 0.278
1.02TrpLys: 1.02 ± 0.289
1.391TrpLeu: 1.391 ± 0.3
0.371TrpMet: 0.371 ± 0.159
0.649TrpAsn: 0.649 ± 0.245
0.0TrpPro: 0.0 ± 0.0
0.742TrpGln: 0.742 ± 0.257
0.835TrpArg: 0.835 ± 0.191
0.835TrpSer: 0.835 ± 0.284
0.835TrpThr: 0.835 ± 0.296
0.742TrpVal: 0.742 ± 0.34
0.185TrpTrp: 0.185 ± 0.135
0.371TrpTyr: 0.371 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.411TyrAla: 2.411 ± 0.479
0.371TyrCys: 0.371 ± 0.17
3.06TyrAsp: 3.06 ± 0.599
2.318TyrGlu: 2.318 ± 0.398
1.02TyrPhe: 1.02 ± 0.311
1.855TyrGly: 1.855 ± 0.418
1.113TyrHis: 1.113 ± 0.372
2.504TyrIle: 2.504 ± 0.571
3.339TyrLys: 3.339 ± 0.617
3.988TyrLeu: 3.988 ± 0.568
0.835TyrMet: 0.835 ± 0.262
2.133TyrAsn: 2.133 ± 0.52
0.835TyrPro: 0.835 ± 0.317
1.855TyrGln: 1.855 ± 0.394
2.597TyrArg: 2.597 ± 0.424
2.411TyrSer: 2.411 ± 0.538
1.762TyrThr: 1.762 ± 0.411
1.577TyrVal: 1.577 ± 0.46
0.278TyrTrp: 0.278 ± 0.197
1.948TyrTyr: 1.948 ± 0.405
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10784 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski