Amino acid dipepetide frequency for Streptococcus satellite phage Javan455

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.893AlaAla: 0.893 ± 0.41
1.191AlaCys: 1.191 ± 0.569
2.382AlaAsp: 2.382 ± 0.859
4.466AlaGlu: 4.466 ± 1.136
4.168AlaPhe: 4.168 ± 0.883
3.275AlaGly: 3.275 ± 0.778
0.0AlaHis: 0.0 ± 0.0
6.252AlaIle: 6.252 ± 1.349
4.168AlaLys: 4.168 ± 1.137
4.466AlaLeu: 4.466 ± 1.369
1.489AlaMet: 1.489 ± 0.631
3.275AlaAsn: 3.275 ± 0.726
1.191AlaPro: 1.191 ± 0.601
1.786AlaGln: 1.786 ± 0.449
2.679AlaArg: 2.679 ± 0.628
4.763AlaSer: 4.763 ± 1.486
4.168AlaThr: 4.168 ± 0.537
2.977AlaVal: 2.977 ± 0.868
1.191AlaTrp: 1.191 ± 0.646
2.084AlaTyr: 2.084 ± 0.702
0.0AlaXaa: 0.0 ± 0.0
Cys
1.191CysAla: 1.191 ± 0.542
0.298CysCys: 0.298 ± 0.291
0.595CysAsp: 0.595 ± 0.373
0.298CysGlu: 0.298 ± 0.291
0.0CysPhe: 0.0 ± 0.0
0.298CysGly: 0.298 ± 0.291
0.298CysHis: 0.298 ± 0.291
0.0CysIle: 0.0 ± 0.0
0.298CysLys: 0.298 ± 0.279
1.191CysLeu: 1.191 ± 0.565
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.595CysPro: 0.595 ± 0.417
0.595CysGln: 0.595 ± 0.336
0.298CysArg: 0.298 ± 0.291
0.298CysSer: 0.298 ± 0.306
0.298CysThr: 0.298 ± 0.281
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.595CysTyr: 0.595 ± 0.368
0.0CysXaa: 0.0 ± 0.0
Asp
3.275AspAla: 3.275 ± 0.63
1.191AspCys: 1.191 ± 0.676
2.977AspAsp: 2.977 ± 0.835
4.168AspGlu: 4.168 ± 1.178
2.679AspPhe: 2.679 ± 0.928
2.679AspGly: 2.679 ± 0.743
1.191AspHis: 1.191 ± 0.621
8.634AspIle: 8.634 ± 1.385
5.359AspLys: 5.359 ± 1.114
4.763AspLeu: 4.763 ± 0.739
1.786AspMet: 1.786 ± 0.864
1.786AspAsn: 1.786 ± 0.67
1.489AspPro: 1.489 ± 0.581
1.786AspGln: 1.786 ± 0.765
2.382AspArg: 2.382 ± 0.677
4.168AspSer: 4.168 ± 1.247
4.168AspThr: 4.168 ± 1.246
0.595AspVal: 0.595 ± 0.379
0.595AspTrp: 0.595 ± 0.388
4.168AspTyr: 4.168 ± 1.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.847GluAla: 6.847 ± 1.164
1.191GluCys: 1.191 ± 0.581
5.359GluAsp: 5.359 ± 1.586
7.443GluGlu: 7.443 ± 1.663
2.382GluPhe: 2.382 ± 0.943
1.786GluGly: 1.786 ± 0.79
2.084GluHis: 2.084 ± 0.611
5.656GluIle: 5.656 ± 1.259
5.359GluLys: 5.359 ± 0.733
8.634GluLeu: 8.634 ± 1.353
1.786GluMet: 1.786 ± 0.583
2.084GluAsn: 2.084 ± 0.692
1.786GluPro: 1.786 ± 0.694
3.572GluGln: 3.572 ± 1.203
3.572GluArg: 3.572 ± 1.047
0.893GluSer: 0.893 ± 0.544
4.466GluThr: 4.466 ± 1.179
3.275GluVal: 3.275 ± 0.854
0.595GluTrp: 0.595 ± 0.432
4.466GluTyr: 4.466 ± 1.024
0.0GluXaa: 0.0 ± 0.0
Phe
1.191PheAla: 1.191 ± 0.398
0.0PheCys: 0.0 ± 0.0
2.977PheAsp: 2.977 ± 0.681
2.679PheGlu: 2.679 ± 0.807
2.084PhePhe: 2.084 ± 0.663
1.786PheGly: 1.786 ± 0.772
1.786PheHis: 1.786 ± 0.509
2.382PheIle: 2.382 ± 0.594
5.359PheLys: 5.359 ± 1.193
3.572PheLeu: 3.572 ± 0.98
0.0PheMet: 0.0 ± 0.0
2.084PheAsn: 2.084 ± 0.789
1.191PhePro: 1.191 ± 0.744
0.893PheGln: 0.893 ± 0.385
1.489PheArg: 1.489 ± 0.63
2.382PheSer: 2.382 ± 0.636
3.87PheThr: 3.87 ± 0.54
2.382PheVal: 2.382 ± 0.636
0.298PheTrp: 0.298 ± 0.238
1.786PheTyr: 1.786 ± 0.656
0.0PheXaa: 0.0 ± 0.0
Gly
2.679GlyAla: 2.679 ± 1.059
0.0GlyCys: 0.0 ± 0.0
5.359GlyAsp: 5.359 ± 1.306
2.084GlyGlu: 2.084 ± 0.551
2.084GlyPhe: 2.084 ± 0.828
2.382GlyGly: 2.382 ± 0.803
0.893GlyHis: 0.893 ± 0.588
1.489GlyIle: 1.489 ± 0.859
4.466GlyLys: 4.466 ± 0.985
5.359GlyLeu: 5.359 ± 1.259
0.893GlyMet: 0.893 ± 0.527
2.084GlyAsn: 2.084 ± 0.656
0.595GlyPro: 0.595 ± 0.389
2.977GlyGln: 2.977 ± 1.434
2.977GlyArg: 2.977 ± 0.787
2.382GlySer: 2.382 ± 0.83
3.572GlyThr: 3.572 ± 0.876
3.572GlyVal: 3.572 ± 0.901
0.595GlyTrp: 0.595 ± 0.342
2.977GlyTyr: 2.977 ± 0.655
0.0GlyXaa: 0.0 ± 0.0
His
2.977HisAla: 2.977 ± 1.178
0.0HisCys: 0.0 ± 0.0
0.298HisAsp: 0.298 ± 0.281
0.298HisGlu: 0.298 ± 0.238
0.0HisPhe: 0.0 ± 0.0
1.191HisGly: 1.191 ± 0.489
0.298HisHis: 0.298 ± 0.238
1.191HisIle: 1.191 ± 0.543
2.382HisLys: 2.382 ± 0.774
2.382HisLeu: 2.382 ± 0.9
0.0HisMet: 0.0 ± 0.0
1.489HisAsn: 1.489 ± 0.735
0.595HisPro: 0.595 ± 0.343
0.893HisGln: 0.893 ± 0.535
0.595HisArg: 0.595 ± 0.392
0.595HisSer: 0.595 ± 0.525
1.191HisThr: 1.191 ± 0.428
0.298HisVal: 0.298 ± 0.238
0.595HisTrp: 0.595 ± 0.336
1.786HisTyr: 1.786 ± 0.769
0.0HisXaa: 0.0 ± 0.0
Ile
5.359IleAla: 5.359 ± 1.282
0.298IleCys: 0.298 ± 0.291
7.74IleAsp: 7.74 ± 1.341
4.763IleGlu: 4.763 ± 0.877
2.382IlePhe: 2.382 ± 0.75
2.679IleGly: 2.679 ± 0.864
1.489IleHis: 1.489 ± 0.943
4.466IleIle: 4.466 ± 1.166
8.931IleLys: 8.931 ± 1.303
4.168IleLeu: 4.168 ± 0.746
1.191IleMet: 1.191 ± 0.541
4.763IleAsn: 4.763 ± 1.028
3.275IlePro: 3.275 ± 1.07
2.382IleGln: 2.382 ± 0.82
3.275IleArg: 3.275 ± 0.796
4.466IleSer: 4.466 ± 1.234
5.359IleThr: 5.359 ± 1.146
1.489IleVal: 1.489 ± 0.447
0.0IleTrp: 0.0 ± 0.0
1.191IleTyr: 1.191 ± 0.634
0.0IleXaa: 0.0 ± 0.0
Lys
5.954LysAla: 5.954 ± 1.188
0.0LysCys: 0.0 ± 0.0
4.168LysAsp: 4.168 ± 0.946
9.824LysGlu: 9.824 ± 1.833
2.382LysPhe: 2.382 ± 0.625
5.656LysGly: 5.656 ± 1.524
2.382LysHis: 2.382 ± 0.574
4.168LysIle: 4.168 ± 0.981
7.145LysLys: 7.145 ± 1.802
7.443LysLeu: 7.443 ± 1.695
1.786LysMet: 1.786 ± 0.901
6.252LysAsn: 6.252 ± 1.118
4.763LysPro: 4.763 ± 1.281
4.168LysGln: 4.168 ± 0.97
5.061LysArg: 5.061 ± 0.987
3.87LysSer: 3.87 ± 1.237
5.954LysThr: 5.954 ± 1.353
5.359LysVal: 5.359 ± 0.922
0.298LysTrp: 0.298 ± 0.28
2.977LysTyr: 2.977 ± 0.82
0.0LysXaa: 0.0 ± 0.0
Leu
5.061LeuAla: 5.061 ± 1.215
0.298LeuCys: 0.298 ± 0.284
5.061LeuAsp: 5.061 ± 1.253
11.015LeuGlu: 11.015 ± 1.219
4.466LeuPhe: 4.466 ± 1.075
5.954LeuGly: 5.954 ± 1.328
1.786LeuHis: 1.786 ± 0.603
7.145LeuIle: 7.145 ± 1.21
9.824LeuLys: 9.824 ± 1.262
10.122LeuLeu: 10.122 ± 1.562
1.191LeuMet: 1.191 ± 0.482
5.954LeuAsn: 5.954 ± 1.629
5.061LeuPro: 5.061 ± 1.307
3.275LeuGln: 3.275 ± 0.716
2.084LeuArg: 2.084 ± 0.728
8.634LeuSer: 8.634 ± 1.8
4.466LeuThr: 4.466 ± 0.795
3.87LeuVal: 3.87 ± 1.12
0.595LeuTrp: 0.595 ± 0.364
5.656LeuTyr: 5.656 ± 0.999
0.0LeuXaa: 0.0 ± 0.0
Met
2.679MetAla: 2.679 ± 1.032
0.298MetCys: 0.298 ± 0.284
1.489MetAsp: 1.489 ± 0.536
1.191MetGlu: 1.191 ± 0.572
0.298MetPhe: 0.298 ± 0.306
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.298MetIle: 0.298 ± 0.285
2.679MetLys: 2.679 ± 0.628
1.786MetLeu: 1.786 ± 0.574
0.298MetMet: 0.298 ± 0.31
1.786MetAsn: 1.786 ± 0.587
0.298MetPro: 0.298 ± 0.279
0.298MetGln: 0.298 ± 0.31
1.786MetArg: 1.786 ± 0.691
0.893MetSer: 0.893 ± 0.48
4.168MetThr: 4.168 ± 0.982
0.298MetVal: 0.298 ± 0.291
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.275AsnAla: 3.275 ± 0.951
0.0AsnCys: 0.0 ± 0.0
1.786AsnAsp: 1.786 ± 0.852
2.679AsnGlu: 2.679 ± 0.934
1.191AsnPhe: 1.191 ± 0.692
3.572AsnGly: 3.572 ± 0.912
1.191AsnHis: 1.191 ± 0.481
2.977AsnIle: 2.977 ± 1.102
3.572AsnLys: 3.572 ± 0.644
6.55AsnLeu: 6.55 ± 1.151
1.191AsnMet: 1.191 ± 0.461
2.382AsnAsn: 2.382 ± 0.726
2.084AsnPro: 2.084 ± 0.599
3.275AsnGln: 3.275 ± 1.147
4.466AsnArg: 4.466 ± 0.969
2.977AsnSer: 2.977 ± 0.809
2.679AsnThr: 2.679 ± 0.884
2.679AsnVal: 2.679 ± 0.93
1.191AsnTrp: 1.191 ± 0.662
2.084AsnTyr: 2.084 ± 0.604
0.0AsnXaa: 0.0 ± 0.0
Pro
0.298ProAla: 0.298 ± 0.328
0.595ProCys: 0.595 ± 0.434
1.786ProAsp: 1.786 ± 0.717
4.763ProGlu: 4.763 ± 1.021
1.489ProPhe: 1.489 ± 0.615
0.298ProGly: 0.298 ± 0.291
0.0ProHis: 0.0 ± 0.0
1.489ProIle: 1.489 ± 0.768
4.763ProLys: 4.763 ± 1.145
2.679ProLeu: 2.679 ± 0.968
0.298ProMet: 0.298 ± 0.291
2.382ProAsn: 2.382 ± 0.732
0.893ProPro: 0.893 ± 0.468
1.489ProGln: 1.489 ± 0.538
2.382ProArg: 2.382 ± 0.853
1.489ProSer: 1.489 ± 0.491
3.572ProThr: 3.572 ± 0.695
2.382ProVal: 2.382 ± 0.794
0.0ProTrp: 0.0 ± 0.0
1.786ProTyr: 1.786 ± 0.961
0.0ProXaa: 0.0 ± 0.0
Gln
2.679GlnAla: 2.679 ± 0.758
0.298GlnCys: 0.298 ± 0.285
2.084GlnAsp: 2.084 ± 0.644
2.977GlnGlu: 2.977 ± 0.95
0.893GlnPhe: 0.893 ± 0.415
2.382GlnGly: 2.382 ± 0.898
0.893GlnHis: 0.893 ± 0.362
2.977GlnIle: 2.977 ± 0.656
4.466GlnLys: 4.466 ± 1.235
7.443GlnLeu: 7.443 ± 1.275
0.893GlnMet: 0.893 ± 0.836
1.489GlnAsn: 1.489 ± 0.553
2.084GlnPro: 2.084 ± 0.831
3.275GlnGln: 3.275 ± 1.112
3.275GlnArg: 3.275 ± 0.684
2.382GlnSer: 2.382 ± 0.833
1.191GlnThr: 1.191 ± 0.543
3.275GlnVal: 3.275 ± 0.968
0.298GlnTrp: 0.298 ± 0.284
1.489GlnTyr: 1.489 ± 0.489
0.0GlnXaa: 0.0 ± 0.0
Arg
1.489ArgAla: 1.489 ± 0.467
0.595ArgCys: 0.595 ± 0.364
3.275ArgAsp: 3.275 ± 0.851
2.382ArgGlu: 2.382 ± 0.887
2.382ArgPhe: 2.382 ± 0.726
2.382ArgGly: 2.382 ± 0.779
1.489ArgHis: 1.489 ± 0.673
2.679ArgIle: 2.679 ± 0.658
5.359ArgLys: 5.359 ± 1.311
5.359ArgLeu: 5.359 ± 1.082
0.893ArgMet: 0.893 ± 0.38
2.382ArgAsn: 2.382 ± 0.89
1.489ArgPro: 1.489 ± 0.511
3.572ArgGln: 3.572 ± 0.785
2.382ArgArg: 2.382 ± 0.728
2.679ArgSer: 2.679 ± 0.86
2.679ArgThr: 2.679 ± 0.7
3.87ArgVal: 3.87 ± 0.854
0.595ArgTrp: 0.595 ± 0.389
3.87ArgTyr: 3.87 ± 1.095
0.0ArgXaa: 0.0 ± 0.0
Ser
3.275SerAla: 3.275 ± 0.753
0.298SerCys: 0.298 ± 0.291
4.763SerAsp: 4.763 ± 0.954
3.275SerGlu: 3.275 ± 0.831
2.679SerPhe: 2.679 ± 0.826
1.786SerGly: 1.786 ± 0.686
0.595SerHis: 0.595 ± 0.364
5.061SerIle: 5.061 ± 0.872
5.061SerLys: 5.061 ± 0.984
7.443SerLeu: 7.443 ± 0.955
1.786SerMet: 1.786 ± 0.575
1.786SerAsn: 1.786 ± 0.6
0.893SerPro: 0.893 ± 0.371
2.679SerGln: 2.679 ± 0.597
1.786SerArg: 1.786 ± 0.534
1.489SerSer: 1.489 ± 0.672
2.977SerThr: 2.977 ± 1.119
3.275SerVal: 3.275 ± 1.096
0.595SerTrp: 0.595 ± 0.392
2.084SerTyr: 2.084 ± 0.848
0.0SerXaa: 0.0 ± 0.0
Thr
4.168ThrAla: 4.168 ± 1.038
0.0ThrCys: 0.0 ± 0.0
2.084ThrAsp: 2.084 ± 0.862
3.572ThrGlu: 3.572 ± 1.029
3.275ThrPhe: 3.275 ± 1.15
5.954ThrGly: 5.954 ± 0.987
0.893ThrHis: 0.893 ± 0.408
4.763ThrIle: 4.763 ± 0.96
2.679ThrLys: 2.679 ± 1.134
7.145ThrLeu: 7.145 ± 1.298
1.786ThrMet: 1.786 ± 0.65
1.786ThrAsn: 1.786 ± 0.681
3.87ThrPro: 3.87 ± 0.942
3.87ThrGln: 3.87 ± 0.867
3.572ThrArg: 3.572 ± 1.071
3.275ThrSer: 3.275 ± 0.788
2.679ThrThr: 2.679 ± 1.053
3.572ThrVal: 3.572 ± 1.056
0.893ThrTrp: 0.893 ± 0.492
4.763ThrTyr: 4.763 ± 1.296
0.0ThrXaa: 0.0 ± 0.0
Val
2.084ValAla: 2.084 ± 0.641
0.298ValCys: 0.298 ± 0.291
1.489ValAsp: 1.489 ± 0.663
1.786ValGlu: 1.786 ± 0.79
2.382ValPhe: 2.382 ± 0.72
2.679ValGly: 2.679 ± 0.697
0.0ValHis: 0.0 ± 0.0
6.252ValIle: 6.252 ± 1.283
3.572ValLys: 3.572 ± 0.676
5.359ValLeu: 5.359 ± 1.234
1.489ValMet: 1.489 ± 0.696
2.977ValAsn: 2.977 ± 0.932
0.893ValPro: 0.893 ± 0.489
2.382ValGln: 2.382 ± 0.928
2.679ValArg: 2.679 ± 0.821
3.275ValSer: 3.275 ± 0.85
3.87ValThr: 3.87 ± 1.338
2.382ValVal: 2.382 ± 0.786
0.595ValTrp: 0.595 ± 0.476
2.084ValTyr: 2.084 ± 0.507
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.893TrpAsp: 0.893 ± 0.522
1.191TrpGlu: 1.191 ± 0.612
0.0TrpPhe: 0.0 ± 0.0
0.595TrpGly: 0.595 ± 0.363
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.595TrpLys: 0.595 ± 0.476
2.084TrpLeu: 2.084 ± 0.733
0.0TrpMet: 0.0 ± 0.0
0.298TrpAsn: 0.298 ± 0.284
0.298TrpPro: 0.298 ± 0.238
0.595TrpGln: 0.595 ± 0.368
0.595TrpArg: 0.595 ± 0.389
0.595TrpSer: 0.595 ± 0.347
0.0TrpThr: 0.0 ± 0.0
1.191TrpVal: 1.191 ± 0.572
0.595TrpTrp: 0.595 ± 0.567
0.595TrpTyr: 0.595 ± 0.374
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.489TyrAla: 1.489 ± 0.444
0.298TyrCys: 0.298 ± 0.28
3.275TyrAsp: 3.275 ± 0.783
2.382TyrGlu: 2.382 ± 0.777
2.977TyrPhe: 2.977 ± 0.843
2.382TyrGly: 2.382 ± 0.76
1.786TyrHis: 1.786 ± 0.575
1.786TyrIle: 1.786 ± 0.653
3.572TyrLys: 3.572 ± 1.047
3.572TyrLeu: 3.572 ± 0.755
1.489TyrMet: 1.489 ± 0.832
4.763TyrAsn: 4.763 ± 0.921
1.489TyrPro: 1.489 ± 0.845
2.977TyrGln: 2.977 ± 1.029
4.466TyrArg: 4.466 ± 1.353
2.084TyrSer: 2.084 ± 0.668
3.275TyrThr: 3.275 ± 0.528
1.786TyrVal: 1.786 ± 0.738
0.595TyrTrp: 0.595 ± 0.581
4.466TyrTyr: 4.466 ± 1.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3360 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski