Amino acid dipepetide frequency for Streptococcus satellite phage Javan315

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.383AlaAla: 0.383 ± 0.355
0.0AlaCys: 0.0 ± 0.0
3.447AlaAsp: 3.447 ± 1.194
5.745AlaGlu: 5.745 ± 1.817
3.064AlaPhe: 3.064 ± 1.006
2.298AlaGly: 2.298 ± 0.657
0.0AlaHis: 0.0 ± 0.0
4.979AlaIle: 4.979 ± 0.868
6.894AlaLys: 6.894 ± 1.511
4.979AlaLeu: 4.979 ± 1.683
2.298AlaMet: 2.298 ± 0.922
0.383AlaAsn: 0.383 ± 0.355
0.0AlaPro: 0.0 ± 0.0
1.532AlaGln: 1.532 ± 0.829
2.298AlaArg: 2.298 ± 0.963
4.979AlaSer: 4.979 ± 1.244
3.064AlaThr: 3.064 ± 0.722
1.532AlaVal: 1.532 ± 0.569
0.766AlaTrp: 0.766 ± 0.479
1.532AlaTyr: 1.532 ± 0.577
0.0AlaXaa: 0.0 ± 0.0
Cys
0.383CysAla: 0.383 ± 0.355
0.0CysCys: 0.0 ± 0.0
0.383CysAsp: 0.383 ± 0.355
0.383CysGlu: 0.383 ± 0.304
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.383CysHis: 0.383 ± 0.326
0.0CysIle: 0.0 ± 0.0
0.383CysLys: 0.383 ± 0.399
0.766CysLeu: 0.766 ± 0.416
0.383CysMet: 0.383 ± 0.381
0.383CysAsn: 0.383 ± 0.416
0.383CysPro: 0.383 ± 0.326
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.383CysVal: 0.383 ± 0.304
0.0CysTrp: 0.0 ± 0.0
0.383CysTyr: 0.383 ± 0.318
0.0CysXaa: 0.0 ± 0.0
Asp
0.383AspAla: 0.383 ± 0.426
0.0AspCys: 0.0 ± 0.0
4.596AspAsp: 4.596 ± 1.418
4.979AspGlu: 4.979 ± 1.841
3.447AspPhe: 3.447 ± 1.25
2.681AspGly: 2.681 ± 0.602
0.383AspHis: 0.383 ± 0.304
5.362AspIle: 5.362 ± 0.975
5.362AspLys: 5.362 ± 1.182
9.575AspLeu: 9.575 ± 2.69
2.298AspMet: 2.298 ± 0.693
3.064AspAsn: 3.064 ± 1.009
1.149AspPro: 1.149 ± 0.638
1.915AspGln: 1.915 ± 0.664
5.362AspArg: 5.362 ± 1.533
4.213AspSer: 4.213 ± 1.638
4.213AspThr: 4.213 ± 1.5
2.681AspVal: 2.681 ± 0.909
0.0AspTrp: 0.0 ± 0.0
3.83AspTyr: 3.83 ± 1.253
0.0AspXaa: 0.0 ± 0.0
Glu
3.83GluAla: 3.83 ± 0.757
0.766GluCys: 0.766 ± 0.416
3.064GluAsp: 3.064 ± 1.047
9.192GluGlu: 9.192 ± 2.17
3.447GluPhe: 3.447 ± 1.237
2.681GluGly: 2.681 ± 0.766
1.149GluHis: 1.149 ± 0.501
8.809GluIle: 8.809 ± 1.417
6.894GluLys: 6.894 ± 1.415
8.809GluLeu: 8.809 ± 1.669
2.681GluMet: 2.681 ± 0.998
4.213GluAsn: 4.213 ± 0.788
0.383GluPro: 0.383 ± 0.416
5.745GluGln: 5.745 ± 1.774
4.979GluArg: 4.979 ± 1.757
3.447GluSer: 3.447 ± 1.048
5.362GluThr: 5.362 ± 1.425
6.511GluVal: 6.511 ± 1.517
0.383GluTrp: 0.383 ± 0.318
4.979GluTyr: 4.979 ± 0.78
0.0GluXaa: 0.0 ± 0.0
Phe
2.681PheAla: 2.681 ± 0.99
0.0PheCys: 0.0 ± 0.0
4.596PheAsp: 4.596 ± 1.108
3.447PheGlu: 3.447 ± 1.147
1.915PhePhe: 1.915 ± 0.919
2.681PheGly: 2.681 ± 0.977
0.766PheHis: 0.766 ± 0.489
4.213PheIle: 4.213 ± 1.387
3.447PheLys: 3.447 ± 1.623
5.745PheLeu: 5.745 ± 1.955
1.532PheMet: 1.532 ± 0.916
0.0PheAsn: 0.0 ± 0.0
0.766PhePro: 0.766 ± 0.417
0.766PheGln: 0.766 ± 0.44
1.532PheArg: 1.532 ± 0.671
3.83PheSer: 3.83 ± 0.905
3.064PheThr: 3.064 ± 1.067
2.681PheVal: 2.681 ± 0.938
0.383PheTrp: 0.383 ± 0.304
1.149PheTyr: 1.149 ± 0.654
0.0PheXaa: 0.0 ± 0.0
Gly
3.447GlyAla: 3.447 ± 0.94
0.383GlyCys: 0.383 ± 0.326
3.83GlyAsp: 3.83 ± 1.053
4.213GlyGlu: 4.213 ± 1.393
2.681GlyPhe: 2.681 ± 0.713
1.149GlyGly: 1.149 ± 0.713
1.532GlyHis: 1.532 ± 0.535
3.064GlyIle: 3.064 ± 0.995
3.83GlyLys: 3.83 ± 0.969
5.745GlyLeu: 5.745 ± 1.363
1.532GlyMet: 1.532 ± 0.925
1.149GlyAsn: 1.149 ± 0.751
0.0GlyPro: 0.0 ± 0.0
1.532GlyGln: 1.532 ± 0.782
2.298GlyArg: 2.298 ± 0.837
1.915GlySer: 1.915 ± 0.974
1.532GlyThr: 1.532 ± 0.928
4.213GlyVal: 4.213 ± 1.121
2.298GlyTrp: 2.298 ± 0.976
3.447GlyTyr: 3.447 ± 1.046
0.0GlyXaa: 0.0 ± 0.0
His
1.149HisAla: 1.149 ± 0.979
0.0HisCys: 0.0 ± 0.0
1.532HisAsp: 1.532 ± 0.617
1.532HisGlu: 1.532 ± 0.714
0.766HisPhe: 0.766 ± 0.583
0.766HisGly: 0.766 ± 0.415
0.0HisHis: 0.0 ± 0.0
0.383HisIle: 0.383 ± 0.318
1.532HisLys: 1.532 ± 0.908
1.915HisLeu: 1.915 ± 0.892
0.0HisMet: 0.0 ± 0.0
0.766HisAsn: 0.766 ± 0.494
0.0HisPro: 0.0 ± 0.0
0.766HisGln: 0.766 ± 0.468
0.0HisArg: 0.0 ± 0.0
1.149HisSer: 1.149 ± 0.747
0.766HisThr: 0.766 ± 0.652
0.383HisVal: 0.383 ± 0.355
0.0HisTrp: 0.0 ± 0.0
2.298HisTyr: 2.298 ± 0.777
0.0HisXaa: 0.0 ± 0.0
Ile
5.745IleAla: 5.745 ± 1.846
0.383IleCys: 0.383 ± 0.416
2.681IleAsp: 2.681 ± 1.116
4.213IleGlu: 4.213 ± 1.064
1.915IlePhe: 1.915 ± 1.036
3.064IleGly: 3.064 ± 1.053
0.766IleHis: 0.766 ± 0.541
6.128IleIle: 6.128 ± 1.36
8.426IleLys: 8.426 ± 2.869
6.128IleLeu: 6.128 ± 1.086
1.532IleMet: 1.532 ± 0.563
1.532IleAsn: 1.532 ± 0.562
3.447IlePro: 3.447 ± 1.324
3.064IleGln: 3.064 ± 0.984
2.681IleArg: 2.681 ± 0.828
4.596IleSer: 4.596 ± 1.447
3.064IleThr: 3.064 ± 1.052
1.532IleVal: 1.532 ± 0.611
0.383IleTrp: 0.383 ± 0.434
3.83IleTyr: 3.83 ± 1.05
0.0IleXaa: 0.0 ± 0.0
Lys
6.894LysAla: 6.894 ± 1.789
0.0LysCys: 0.0 ± 0.0
7.277LysAsp: 7.277 ± 1.525
8.426LysGlu: 8.426 ± 1.497
5.362LysPhe: 5.362 ± 1.424
6.894LysGly: 6.894 ± 1.137
1.532LysHis: 1.532 ± 0.706
4.213LysIle: 4.213 ± 1.147
11.873LysLys: 11.873 ± 1.915
7.66LysLeu: 7.66 ± 1.091
1.915LysMet: 1.915 ± 0.867
6.128LysAsn: 6.128 ± 1.398
2.681LysPro: 2.681 ± 0.684
3.064LysGln: 3.064 ± 1.044
7.277LysArg: 7.277 ± 1.25
6.894LysSer: 6.894 ± 1.094
5.362LysThr: 5.362 ± 1.402
4.979LysVal: 4.979 ± 1.504
1.149LysTrp: 1.149 ± 0.58
3.064LysTyr: 3.064 ± 1.008
0.0LysXaa: 0.0 ± 0.0
Leu
6.128LeuAla: 6.128 ± 1.846
0.383LeuCys: 0.383 ± 0.381
9.192LeuAsp: 9.192 ± 2.185
11.107LeuGlu: 11.107 ± 1.659
4.979LeuPhe: 4.979 ± 1.991
5.362LeuGly: 5.362 ± 1.342
1.532LeuHis: 1.532 ± 0.677
3.447LeuIle: 3.447 ± 0.84
11.49LeuLys: 11.49 ± 1.469
7.277LeuLeu: 7.277 ± 1.565
1.915LeuMet: 1.915 ± 1.099
5.362LeuAsn: 5.362 ± 1.426
3.064LeuPro: 3.064 ± 1.146
3.83LeuGln: 3.83 ± 1.698
3.064LeuArg: 3.064 ± 0.922
5.362LeuSer: 5.362 ± 1.053
6.894LeuThr: 6.894 ± 1.971
4.596LeuVal: 4.596 ± 1.135
0.383LeuTrp: 0.383 ± 0.326
4.596LeuTyr: 4.596 ± 1.352
0.0LeuXaa: 0.0 ± 0.0
Met
1.915MetAla: 1.915 ± 0.86
0.0MetCys: 0.0 ± 0.0
2.681MetAsp: 2.681 ± 0.928
2.681MetGlu: 2.681 ± 1.191
1.149MetPhe: 1.149 ± 0.682
1.532MetGly: 1.532 ± 1.066
0.383MetHis: 0.383 ± 0.48
2.681MetIle: 2.681 ± 1.244
2.298MetLys: 2.298 ± 0.796
1.915MetLeu: 1.915 ± 0.825
1.915MetMet: 1.915 ± 0.54
1.915MetAsn: 1.915 ± 0.674
0.0MetPro: 0.0 ± 0.0
0.766MetGln: 0.766 ± 0.417
1.532MetArg: 1.532 ± 0.785
0.383MetSer: 0.383 ± 0.355
2.298MetThr: 2.298 ± 1.003
2.681MetVal: 2.681 ± 1.442
0.0MetTrp: 0.0 ± 0.0
0.766MetTyr: 0.766 ± 0.417
0.0MetXaa: 0.0 ± 0.0
Asn
2.298AsnAla: 2.298 ± 0.865
0.383AsnCys: 0.383 ± 0.318
2.681AsnAsp: 2.681 ± 0.74
3.447AsnGlu: 3.447 ± 0.879
1.149AsnPhe: 1.149 ± 0.526
3.447AsnGly: 3.447 ± 1.217
0.766AsnHis: 0.766 ± 0.494
1.532AsnIle: 1.532 ± 0.538
4.596AsnLys: 4.596 ± 1.677
6.894AsnLeu: 6.894 ± 1.478
1.915AsnMet: 1.915 ± 0.615
1.915AsnAsn: 1.915 ± 0.886
1.532AsnPro: 1.532 ± 0.543
3.064AsnGln: 3.064 ± 1.241
2.298AsnArg: 2.298 ± 0.845
2.298AsnSer: 2.298 ± 1.251
3.064AsnThr: 3.064 ± 0.796
1.149AsnVal: 1.149 ± 0.492
0.383AsnTrp: 0.383 ± 0.304
3.064AsnTyr: 3.064 ± 1.002
0.0AsnXaa: 0.0 ± 0.0
Pro
0.766ProAla: 0.766 ± 0.652
0.0ProCys: 0.0 ± 0.0
2.298ProAsp: 2.298 ± 0.93
1.532ProGlu: 1.532 ± 0.859
1.532ProPhe: 1.532 ± 0.639
1.149ProGly: 1.149 ± 0.609
0.0ProHis: 0.0 ± 0.0
1.149ProIle: 1.149 ± 0.621
2.681ProLys: 2.681 ± 0.917
1.532ProLeu: 1.532 ± 0.821
1.149ProMet: 1.149 ± 0.462
1.149ProAsn: 1.149 ± 0.488
1.915ProPro: 1.915 ± 1.078
0.0ProGln: 0.0 ± 0.0
1.532ProArg: 1.532 ± 0.768
0.383ProSer: 0.383 ± 0.426
1.532ProThr: 1.532 ± 0.68
0.766ProVal: 0.766 ± 0.541
0.0ProTrp: 0.0 ± 0.0
1.149ProTyr: 1.149 ± 0.659
0.0ProXaa: 0.0 ± 0.0
Gln
1.915GlnAla: 1.915 ± 0.897
0.0GlnCys: 0.0 ± 0.0
0.766GlnAsp: 0.766 ± 0.494
4.979GlnGlu: 4.979 ± 1.445
1.149GlnPhe: 1.149 ± 0.66
1.149GlnGly: 1.149 ± 0.55
2.298GlnHis: 2.298 ± 1.126
3.064GlnIle: 3.064 ± 1.226
3.064GlnLys: 3.064 ± 1.272
4.213GlnLeu: 4.213 ± 1.524
1.149GlnMet: 1.149 ± 0.567
3.447GlnAsn: 3.447 ± 1.092
1.149GlnPro: 1.149 ± 0.665
3.447GlnGln: 3.447 ± 1.531
2.681GlnArg: 2.681 ± 0.914
2.298GlnSer: 2.298 ± 0.693
2.298GlnThr: 2.298 ± 0.853
3.064GlnVal: 3.064 ± 1.081
0.383GlnTrp: 0.383 ± 0.326
1.532GlnTyr: 1.532 ± 0.485
0.0GlnXaa: 0.0 ± 0.0
Arg
2.298ArgAla: 2.298 ± 1.258
0.0ArgCys: 0.0 ± 0.0
3.064ArgAsp: 3.064 ± 1.1
3.83ArgGlu: 3.83 ± 1.327
2.298ArgPhe: 2.298 ± 0.753
3.83ArgGly: 3.83 ± 0.967
0.766ArgHis: 0.766 ± 0.422
2.681ArgIle: 2.681 ± 0.762
3.447ArgLys: 3.447 ± 1.188
6.894ArgLeu: 6.894 ± 1.586
2.298ArgMet: 2.298 ± 1.467
2.681ArgAsn: 2.681 ± 0.855
0.766ArgPro: 0.766 ± 0.417
3.447ArgGln: 3.447 ± 1.065
1.915ArgArg: 1.915 ± 0.948
1.532ArgSer: 1.532 ± 0.62
1.532ArgThr: 1.532 ± 0.538
2.298ArgVal: 2.298 ± 1.022
0.0ArgTrp: 0.0 ± 0.0
3.447ArgTyr: 3.447 ± 1.211
0.0ArgXaa: 0.0 ± 0.0
Ser
1.532SerAla: 1.532 ± 0.823
1.149SerCys: 1.149 ± 0.501
4.213SerAsp: 4.213 ± 1.647
3.447SerGlu: 3.447 ± 1.019
2.298SerPhe: 2.298 ± 0.783
1.915SerGly: 1.915 ± 0.659
0.766SerHis: 0.766 ± 0.541
4.979SerIle: 4.979 ± 1.356
6.894SerLys: 6.894 ± 1.521
4.213SerLeu: 4.213 ± 0.8
1.915SerMet: 1.915 ± 1.148
1.915SerAsn: 1.915 ± 0.736
2.298SerPro: 2.298 ± 0.687
2.298SerGln: 2.298 ± 0.806
1.915SerArg: 1.915 ± 0.822
3.064SerSer: 3.064 ± 1.185
2.681SerThr: 2.681 ± 1.044
2.681SerVal: 2.681 ± 1.054
0.766SerTrp: 0.766 ± 0.621
2.681SerTyr: 2.681 ± 0.628
0.0SerXaa: 0.0 ± 0.0
Thr
1.149ThrAla: 1.149 ± 0.598
0.0ThrCys: 0.0 ± 0.0
4.213ThrAsp: 4.213 ± 1.398
4.596ThrGlu: 4.596 ± 1.125
2.298ThrPhe: 2.298 ± 0.801
3.447ThrGly: 3.447 ± 1.423
1.532ThrHis: 1.532 ± 0.58
2.681ThrIle: 2.681 ± 0.719
6.511ThrLys: 6.511 ± 1.713
5.745ThrLeu: 5.745 ± 1.822
1.149ThrMet: 1.149 ± 0.574
3.83ThrAsn: 3.83 ± 1.032
1.149ThrPro: 1.149 ± 0.495
2.681ThrGln: 2.681 ± 0.837
1.532ThrArg: 1.532 ± 0.717
2.681ThrSer: 2.681 ± 0.971
1.915ThrThr: 1.915 ± 0.711
5.745ThrVal: 5.745 ± 1.773
0.383ThrTrp: 0.383 ± 0.355
3.064ThrTyr: 3.064 ± 1.163
0.0ThrXaa: 0.0 ± 0.0
Val
4.979ValAla: 4.979 ± 1.42
0.383ValCys: 0.383 ± 0.355
3.447ValAsp: 3.447 ± 1.043
4.596ValGlu: 4.596 ± 0.982
2.681ValPhe: 2.681 ± 0.6
2.681ValGly: 2.681 ± 1.149
0.383ValHis: 0.383 ± 0.304
2.681ValIle: 2.681 ± 1.34
4.596ValLys: 4.596 ± 1.058
2.681ValLeu: 2.681 ± 1.019
1.149ValMet: 1.149 ± 0.597
4.213ValAsn: 4.213 ± 1.22
0.766ValPro: 0.766 ± 0.541
1.532ValGln: 1.532 ± 0.746
2.298ValArg: 2.298 ± 0.988
1.532ValSer: 1.532 ± 0.543
4.596ValThr: 4.596 ± 1.364
1.915ValVal: 1.915 ± 0.56
0.766ValTrp: 0.766 ± 0.607
3.064ValTyr: 3.064 ± 0.765
0.0ValXaa: 0.0 ± 0.0
Trp
0.766TrpAla: 0.766 ± 0.505
0.383TrpCys: 0.383 ± 0.304
0.0TrpAsp: 0.0 ± 0.0
1.532TrpGlu: 1.532 ± 0.772
0.383TrpPhe: 0.383 ± 0.355
0.766TrpGly: 0.766 ± 0.468
0.0TrpHis: 0.0 ± 0.0
0.766TrpIle: 0.766 ± 0.57
1.149TrpLys: 1.149 ± 0.705
0.766TrpLeu: 0.766 ± 0.416
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.383TrpGln: 0.383 ± 0.318
0.766TrpArg: 0.766 ± 0.448
0.383TrpSer: 0.383 ± 0.326
1.149TrpThr: 1.149 ± 0.462
0.383TrpVal: 0.383 ± 0.416
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.532TyrAla: 1.532 ± 0.628
0.383TyrCys: 0.383 ± 0.326
1.532TyrAsp: 1.532 ± 0.534
3.447TyrGlu: 3.447 ± 1.038
2.681TyrPhe: 2.681 ± 1.397
2.298TyrGly: 2.298 ± 1.003
0.766TyrHis: 0.766 ± 0.463
2.298TyrIle: 2.298 ± 1.116
7.277TyrLys: 7.277 ± 1.81
6.511TyrLeu: 6.511 ± 1.966
0.383TyrMet: 0.383 ± 0.304
3.83TyrAsn: 3.83 ± 0.917
0.766TyrPro: 0.766 ± 0.468
4.213TyrGln: 4.213 ± 1.161
3.064TyrArg: 3.064 ± 0.748
3.064TyrSer: 3.064 ± 0.656
1.915TyrThr: 1.915 ± 0.822
0.766TyrVal: 0.766 ± 0.607
1.149TyrTrp: 1.149 ± 0.635
3.064TyrTyr: 3.064 ± 1.094
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (2612 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski