Amino acid dipepetide frequency for Streptococcus phage Javan48

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.089AlaAla: 3.089 ± 1.187
0.163AlaCys: 0.163 ± 0.125
3.576AlaAsp: 3.576 ± 0.586
4.227AlaGlu: 4.227 ± 0.523
2.845AlaPhe: 2.845 ± 0.548
4.064AlaGly: 4.064 ± 0.855
0.732AlaHis: 0.732 ± 0.215
5.446AlaIle: 5.446 ± 0.7
6.909AlaLys: 6.909 ± 0.933
4.714AlaLeu: 4.714 ± 0.9
1.3AlaMet: 1.3 ± 0.324
3.82AlaAsn: 3.82 ± 0.833
2.195AlaPro: 2.195 ± 0.501
2.113AlaGln: 2.113 ± 0.312
2.357AlaArg: 2.357 ± 0.429
4.227AlaSer: 4.227 ± 0.85
4.145AlaThr: 4.145 ± 0.765
3.901AlaVal: 3.901 ± 0.639
0.894AlaTrp: 0.894 ± 0.334
2.764AlaTyr: 2.764 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.325CysAla: 0.325 ± 0.172
0.081CysCys: 0.081 ± 0.081
0.406CysAsp: 0.406 ± 0.225
0.244CysGlu: 0.244 ± 0.131
0.244CysPhe: 0.244 ± 0.16
0.081CysGly: 0.081 ± 0.085
0.163CysHis: 0.163 ± 0.119
0.081CysIle: 0.081 ± 0.08
0.732CysLys: 0.732 ± 0.272
0.569CysLeu: 0.569 ± 0.247
0.163CysMet: 0.163 ± 0.097
0.325CysAsn: 0.325 ± 0.151
0.081CysPro: 0.081 ± 0.081
0.081CysGln: 0.081 ± 0.065
0.244CysArg: 0.244 ± 0.162
0.163CysSer: 0.163 ± 0.111
0.163CysThr: 0.163 ± 0.104
0.488CysVal: 0.488 ± 0.199
0.081CysTrp: 0.081 ± 0.085
0.163CysTyr: 0.163 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
3.007AspAla: 3.007 ± 0.607
0.244AspCys: 0.244 ± 0.157
5.121AspAsp: 5.121 ± 0.8
4.958AspGlu: 4.958 ± 0.637
4.227AspPhe: 4.227 ± 0.516
4.47AspGly: 4.47 ± 0.881
0.732AspHis: 0.732 ± 0.218
4.633AspIle: 4.633 ± 0.784
6.096AspLys: 6.096 ± 0.738
5.039AspLeu: 5.039 ± 0.563
2.276AspMet: 2.276 ± 0.409
4.064AspAsn: 4.064 ± 0.619
1.951AspPro: 1.951 ± 0.412
1.463AspGln: 1.463 ± 0.345
2.113AspArg: 2.113 ± 0.381
4.877AspSer: 4.877 ± 0.544
3.251AspThr: 3.251 ± 0.438
3.658AspVal: 3.658 ± 0.638
0.732AspTrp: 0.732 ± 0.237
3.901AspTyr: 3.901 ± 0.663
0.0AspXaa: 0.0 ± 0.0
Glu
4.958GluAla: 4.958 ± 0.847
0.163GluCys: 0.163 ± 0.12
4.308GluAsp: 4.308 ± 0.566
6.34GluGlu: 6.34 ± 0.752
2.845GluPhe: 2.845 ± 0.512
2.438GluGly: 2.438 ± 0.528
1.219GluHis: 1.219 ± 0.415
5.771GluIle: 5.771 ± 0.695
5.69GluLys: 5.69 ± 0.942
7.559GluLeu: 7.559 ± 1.157
1.544GluMet: 1.544 ± 0.429
4.308GluAsn: 4.308 ± 0.501
1.869GluPro: 1.869 ± 0.431
3.17GluGln: 3.17 ± 0.643
2.926GluArg: 2.926 ± 0.739
3.658GluSer: 3.658 ± 0.527
4.552GluThr: 4.552 ± 0.727
5.608GluVal: 5.608 ± 0.604
1.138GluTrp: 1.138 ± 0.346
3.007GluTyr: 3.007 ± 0.542
0.0GluXaa: 0.0 ± 0.0
Phe
3.007PheAla: 3.007 ± 0.497
0.244PheCys: 0.244 ± 0.132
3.658PheAsp: 3.658 ± 0.719
3.333PheGlu: 3.333 ± 0.681
1.544PhePhe: 1.544 ± 0.483
1.951PheGly: 1.951 ± 0.396
0.325PheHis: 0.325 ± 0.172
3.414PheIle: 3.414 ± 0.487
3.983PheLys: 3.983 ± 0.524
3.17PheLeu: 3.17 ± 0.626
1.544PheMet: 1.544 ± 0.404
2.195PheAsn: 2.195 ± 0.446
0.488PhePro: 0.488 ± 0.238
0.894PheGln: 0.894 ± 0.31
1.788PheArg: 1.788 ± 0.399
3.82PheSer: 3.82 ± 0.61
2.601PheThr: 2.601 ± 0.401
3.007PheVal: 3.007 ± 0.519
0.244PheTrp: 0.244 ± 0.127
2.357PheTyr: 2.357 ± 0.43
0.0PheXaa: 0.0 ± 0.0
Gly
3.82GlyAla: 3.82 ± 0.82
0.244GlyCys: 0.244 ± 0.156
4.308GlyAsp: 4.308 ± 0.497
3.576GlyGlu: 3.576 ± 0.758
3.251GlyPhe: 3.251 ± 0.617
4.552GlyGly: 4.552 ± 0.86
0.65GlyHis: 0.65 ± 0.199
5.121GlyIle: 5.121 ± 0.612
5.283GlyLys: 5.283 ± 0.691
3.983GlyLeu: 3.983 ± 0.651
1.707GlyMet: 1.707 ± 0.426
3.089GlyAsn: 3.089 ± 0.529
1.057GlyPro: 1.057 ± 0.381
1.707GlyGln: 1.707 ± 0.41
2.195GlyArg: 2.195 ± 0.442
2.845GlySer: 2.845 ± 0.597
4.47GlyThr: 4.47 ± 0.813
3.82GlyVal: 3.82 ± 0.474
1.382GlyTrp: 1.382 ± 0.288
3.333GlyTyr: 3.333 ± 0.647
0.0GlyXaa: 0.0 ± 0.0
His
0.65HisAla: 0.65 ± 0.213
0.0HisCys: 0.0 ± 0.0
1.057HisAsp: 1.057 ± 0.323
0.65HisGlu: 0.65 ± 0.276
0.732HisPhe: 0.732 ± 0.276
0.244HisGly: 0.244 ± 0.144
0.325HisHis: 0.325 ± 0.222
0.732HisIle: 0.732 ± 0.321
1.3HisLys: 1.3 ± 0.369
0.975HisLeu: 0.975 ± 0.261
0.163HisMet: 0.163 ± 0.122
1.138HisAsn: 1.138 ± 0.327
0.244HisPro: 0.244 ± 0.134
0.975HisGln: 0.975 ± 0.204
0.406HisArg: 0.406 ± 0.172
0.732HisSer: 0.732 ± 0.327
0.813HisThr: 0.813 ± 0.322
1.057HisVal: 1.057 ± 0.384
0.0HisTrp: 0.0 ± 0.0
0.65HisTyr: 0.65 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
5.365IleAla: 5.365 ± 0.59
0.244IleCys: 0.244 ± 0.136
5.934IleAsp: 5.934 ± 0.616
6.421IleGlu: 6.421 ± 0.758
3.414IlePhe: 3.414 ± 0.814
4.47IleGly: 4.47 ± 0.636
0.569IleHis: 0.569 ± 0.239
3.983IleIle: 3.983 ± 0.663
5.039IleLys: 5.039 ± 0.647
4.389IleLeu: 4.389 ± 0.688
1.219IleMet: 1.219 ± 0.323
4.064IleAsn: 4.064 ± 0.544
3.333IlePro: 3.333 ± 0.51
2.438IleGln: 2.438 ± 0.42
2.764IleArg: 2.764 ± 0.468
4.633IleSer: 4.633 ± 0.785
3.333IleThr: 3.333 ± 0.497
5.121IleVal: 5.121 ± 0.686
0.732IleTrp: 0.732 ± 0.241
2.764IleTyr: 2.764 ± 0.361
0.0IleXaa: 0.0 ± 0.0
Lys
6.584LysAla: 6.584 ± 1.006
0.488LysCys: 0.488 ± 0.219
6.015LysAsp: 6.015 ± 0.774
7.234LysGlu: 7.234 ± 1.028
2.926LysPhe: 2.926 ± 0.512
5.527LysGly: 5.527 ± 0.741
0.813LysHis: 0.813 ± 0.322
5.527LysIle: 5.527 ± 0.744
9.022LysLys: 9.022 ± 0.961
8.86LysLeu: 8.86 ± 0.864
2.357LysMet: 2.357 ± 0.502
6.34LysAsn: 6.34 ± 0.633
2.682LysPro: 2.682 ± 0.557
3.901LysGln: 3.901 ± 0.588
4.064LysArg: 4.064 ± 0.667
4.552LysSer: 4.552 ± 0.582
6.096LysThr: 6.096 ± 0.583
5.446LysVal: 5.446 ± 0.678
0.894LysTrp: 0.894 ± 0.296
3.576LysTyr: 3.576 ± 0.526
0.0LysXaa: 0.0 ± 0.0
Leu
5.69LeuAla: 5.69 ± 0.612
0.488LeuCys: 0.488 ± 0.177
4.877LeuAsp: 4.877 ± 0.597
6.746LeuGlu: 6.746 ± 0.807
3.251LeuPhe: 3.251 ± 0.628
4.308LeuGly: 4.308 ± 0.835
1.057LeuHis: 1.057 ± 0.351
5.365LeuIle: 5.365 ± 0.621
8.535LeuLys: 8.535 ± 1.169
6.177LeuLeu: 6.177 ± 0.849
1.788LeuMet: 1.788 ± 0.401
5.202LeuAsn: 5.202 ± 0.63
2.52LeuPro: 2.52 ± 0.453
3.007LeuGln: 3.007 ± 0.481
2.845LeuArg: 2.845 ± 0.57
5.608LeuSer: 5.608 ± 0.692
5.283LeuThr: 5.283 ± 0.749
4.958LeuVal: 4.958 ± 0.642
1.138LeuTrp: 1.138 ± 0.254
2.926LeuTyr: 2.926 ± 0.474
0.0LeuXaa: 0.0 ± 0.0
Met
1.707MetAla: 1.707 ± 0.234
0.081MetCys: 0.081 ± 0.079
1.788MetAsp: 1.788 ± 0.444
1.219MetGlu: 1.219 ± 0.403
0.894MetPhe: 0.894 ± 0.242
0.732MetGly: 0.732 ± 0.276
0.244MetHis: 0.244 ± 0.112
1.219MetIle: 1.219 ± 0.333
1.544MetLys: 1.544 ± 0.393
2.113MetLeu: 2.113 ± 0.427
0.732MetMet: 0.732 ± 0.227
0.975MetAsn: 0.975 ± 0.305
0.325MetPro: 0.325 ± 0.153
1.219MetGln: 1.219 ± 0.236
0.569MetArg: 0.569 ± 0.174
2.113MetSer: 2.113 ± 0.354
2.438MetThr: 2.438 ± 0.459
1.544MetVal: 1.544 ± 0.322
0.244MetTrp: 0.244 ± 0.129
0.732MetTyr: 0.732 ± 0.245
0.0MetXaa: 0.0 ± 0.0
Asn
4.064AsnAla: 4.064 ± 0.854
0.325AsnCys: 0.325 ± 0.178
3.495AsnAsp: 3.495 ± 0.512
3.901AsnGlu: 3.901 ± 0.539
2.52AsnPhe: 2.52 ± 0.453
5.365AsnGly: 5.365 ± 0.702
0.813AsnHis: 0.813 ± 0.207
4.633AsnIle: 4.633 ± 0.478
4.958AsnLys: 4.958 ± 0.582
4.796AsnLeu: 4.796 ± 0.658
0.813AsnMet: 0.813 ± 0.289
3.251AsnAsn: 3.251 ± 0.449
2.601AsnPro: 2.601 ± 0.358
1.463AsnGln: 1.463 ± 0.448
1.951AsnArg: 1.951 ± 0.449
3.658AsnSer: 3.658 ± 0.678
2.682AsnThr: 2.682 ± 0.521
3.658AsnVal: 3.658 ± 0.561
1.382AsnTrp: 1.382 ± 0.358
2.764AsnTyr: 2.764 ± 0.5
0.0AsnXaa: 0.0 ± 0.0
Pro
0.975ProAla: 0.975 ± 0.262
0.0ProCys: 0.0 ± 0.0
1.869ProAsp: 1.869 ± 0.43
2.113ProGlu: 2.113 ± 0.419
1.626ProPhe: 1.626 ± 0.39
1.707ProGly: 1.707 ± 0.577
0.244ProHis: 0.244 ± 0.143
2.113ProIle: 2.113 ± 0.424
3.089ProLys: 3.089 ± 0.589
2.357ProLeu: 2.357 ± 0.51
0.325ProMet: 0.325 ± 0.144
1.544ProAsn: 1.544 ± 0.349
0.65ProPro: 0.65 ± 0.217
1.463ProGln: 1.463 ± 0.422
1.057ProArg: 1.057 ± 0.286
2.52ProSer: 2.52 ± 0.417
1.707ProThr: 1.707 ± 0.445
1.219ProVal: 1.219 ± 0.267
0.0ProTrp: 0.0 ± 0.0
1.219ProTyr: 1.219 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
3.17GlnAla: 3.17 ± 0.511
0.163GlnCys: 0.163 ± 0.107
1.788GlnAsp: 1.788 ± 0.324
2.438GlnGlu: 2.438 ± 0.459
1.219GlnPhe: 1.219 ± 0.353
1.626GlnGly: 1.626 ± 0.332
0.569GlnHis: 0.569 ± 0.171
2.926GlnIle: 2.926 ± 0.393
4.145GlnLys: 4.145 ± 0.527
3.739GlnLeu: 3.739 ± 0.703
0.975GlnMet: 0.975 ± 0.322
1.707GlnAsn: 1.707 ± 0.347
0.569GlnPro: 0.569 ± 0.251
1.869GlnGln: 1.869 ± 0.59
1.544GlnArg: 1.544 ± 0.368
3.414GlnSer: 3.414 ± 0.453
2.113GlnThr: 2.113 ± 0.473
1.869GlnVal: 1.869 ± 0.406
0.406GlnTrp: 0.406 ± 0.16
0.894GlnTyr: 0.894 ± 0.246
0.0GlnXaa: 0.0 ± 0.0
Arg
1.869ArgAla: 1.869 ± 0.323
0.081ArgCys: 0.081 ± 0.065
2.195ArgAsp: 2.195 ± 0.496
2.845ArgGlu: 2.845 ± 0.6
1.788ArgPhe: 1.788 ± 0.397
1.951ArgGly: 1.951 ± 0.389
0.732ArgHis: 0.732 ± 0.285
2.926ArgIle: 2.926 ± 0.454
3.82ArgLys: 3.82 ± 0.761
3.82ArgLeu: 3.82 ± 0.754
1.3ArgMet: 1.3 ± 0.304
2.357ArgAsn: 2.357 ± 0.532
1.463ArgPro: 1.463 ± 0.393
1.707ArgGln: 1.707 ± 0.31
1.544ArgArg: 1.544 ± 0.392
1.626ArgSer: 1.626 ± 0.362
2.113ArgThr: 2.113 ± 0.343
1.707ArgVal: 1.707 ± 0.395
0.406ArgTrp: 0.406 ± 0.211
1.788ArgTyr: 1.788 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
3.82SerAla: 3.82 ± 0.758
0.244SerCys: 0.244 ± 0.144
5.527SerAsp: 5.527 ± 0.814
4.308SerGlu: 4.308 ± 0.547
3.089SerPhe: 3.089 ± 0.565
4.877SerGly: 4.877 ± 0.889
0.813SerHis: 0.813 ± 0.3
4.552SerIle: 4.552 ± 0.464
5.121SerLys: 5.121 ± 0.689
4.308SerLeu: 4.308 ± 0.634
1.463SerMet: 1.463 ± 0.331
4.145SerAsn: 4.145 ± 0.628
0.894SerPro: 0.894 ± 0.261
2.764SerGln: 2.764 ± 0.539
2.682SerArg: 2.682 ± 0.395
5.283SerSer: 5.283 ± 0.627
3.983SerThr: 3.983 ± 0.856
4.47SerVal: 4.47 ± 0.532
0.488SerTrp: 0.488 ± 0.147
2.845SerTyr: 2.845 ± 0.579
0.0SerXaa: 0.0 ± 0.0
Thr
3.82ThrAla: 3.82 ± 0.818
0.325ThrCys: 0.325 ± 0.157
3.007ThrAsp: 3.007 ± 0.472
4.47ThrGlu: 4.47 ± 0.567
3.333ThrPhe: 3.333 ± 0.511
3.658ThrGly: 3.658 ± 0.585
0.894ThrHis: 0.894 ± 0.282
4.389ThrIle: 4.389 ± 0.735
6.584ThrLys: 6.584 ± 0.702
4.958ThrLeu: 4.958 ± 0.61
0.732ThrMet: 0.732 ± 0.283
3.333ThrAsn: 3.333 ± 0.549
1.869ThrPro: 1.869 ± 0.291
2.113ThrGln: 2.113 ± 0.363
2.195ThrArg: 2.195 ± 0.402
4.064ThrSer: 4.064 ± 0.734
4.389ThrThr: 4.389 ± 0.561
4.958ThrVal: 4.958 ± 0.842
0.406ThrTrp: 0.406 ± 0.207
2.682ThrTyr: 2.682 ± 0.448
0.0ThrXaa: 0.0 ± 0.0
Val
3.576ValAla: 3.576 ± 0.476
0.325ValCys: 0.325 ± 0.147
4.47ValAsp: 4.47 ± 0.536
4.633ValGlu: 4.633 ± 0.591
1.463ValPhe: 1.463 ± 0.414
4.877ValGly: 4.877 ± 0.868
0.732ValHis: 0.732 ± 0.3
4.552ValIle: 4.552 ± 0.702
6.665ValLys: 6.665 ± 0.685
5.283ValLeu: 5.283 ± 0.814
0.813ValMet: 0.813 ± 0.248
3.739ValAsn: 3.739 ± 0.572
1.626ValPro: 1.626 ± 0.412
2.032ValGln: 2.032 ± 0.428
3.17ValArg: 3.17 ± 0.568
4.308ValSer: 4.308 ± 0.656
4.877ValThr: 4.877 ± 0.576
4.552ValVal: 4.552 ± 0.662
0.65ValTrp: 0.65 ± 0.238
1.463ValTyr: 1.463 ± 0.294
0.0ValXaa: 0.0 ± 0.0
Trp
0.732TrpAla: 0.732 ± 0.252
0.163TrpCys: 0.163 ± 0.119
0.488TrpAsp: 0.488 ± 0.199
0.488TrpGlu: 0.488 ± 0.221
0.569TrpPhe: 0.569 ± 0.233
1.057TrpGly: 1.057 ± 0.29
0.244TrpHis: 0.244 ± 0.148
1.057TrpIle: 1.057 ± 0.253
0.894TrpLys: 0.894 ± 0.262
1.138TrpLeu: 1.138 ± 0.338
0.081TrpMet: 0.081 ± 0.091
0.732TrpAsn: 0.732 ± 0.298
0.0TrpPro: 0.0 ± 0.0
0.894TrpGln: 0.894 ± 0.24
0.406TrpArg: 0.406 ± 0.162
0.813TrpSer: 0.813 ± 0.251
0.975TrpThr: 0.975 ± 0.377
0.325TrpVal: 0.325 ± 0.143
0.163TrpTrp: 0.163 ± 0.116
0.488TrpTyr: 0.488 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.251TyrAla: 3.251 ± 0.471
0.813TyrCys: 0.813 ± 0.235
2.926TyrAsp: 2.926 ± 0.394
2.926TyrGlu: 2.926 ± 0.593
1.788TyrPhe: 1.788 ± 0.335
2.438TyrGly: 2.438 ± 0.418
0.975TyrHis: 0.975 ± 0.231
1.788TyrIle: 1.788 ± 0.382
3.576TyrLys: 3.576 ± 0.617
3.82TyrLeu: 3.82 ± 0.605
1.057TyrMet: 1.057 ± 0.287
2.764TyrAsn: 2.764 ± 0.483
1.382TyrPro: 1.382 ± 0.369
1.869TyrGln: 1.869 ± 0.321
1.219TyrArg: 1.219 ± 0.295
2.764TyrSer: 2.764 ± 0.454
2.195TyrThr: 2.195 ± 0.465
2.438TyrVal: 2.438 ± 0.396
0.325TyrTrp: 0.325 ± 0.165
1.869TyrTyr: 1.869 ± 0.461
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12304 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski