Amino acid dipepetide frequency for Streptococcus satellite phage Javan760

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.288AlaAla: 0.288 ± 0.304
0.0AlaCys: 0.0 ± 0.0
3.165AlaAsp: 3.165 ± 0.788
4.317AlaGlu: 4.317 ± 1.122
3.165AlaPhe: 3.165 ± 1.247
2.014AlaGly: 2.014 ± 0.747
0.863AlaHis: 0.863 ± 0.593
4.892AlaIle: 4.892 ± 1.332
4.892AlaLys: 4.892 ± 0.768
5.18AlaLeu: 5.18 ± 0.837
1.727AlaMet: 1.727 ± 1.058
2.014AlaAsn: 2.014 ± 0.635
0.576AlaPro: 0.576 ± 0.423
1.151AlaGln: 1.151 ± 0.502
5.468AlaArg: 5.468 ± 1.119
1.727AlaSer: 1.727 ± 0.716
2.878AlaThr: 2.878 ± 0.956
4.317AlaVal: 4.317 ± 0.794
0.288AlaTrp: 0.288 ± 0.323
1.151AlaTyr: 1.151 ± 0.655
0.0AlaXaa: 0.0 ± 0.0
Cys
0.576CysAla: 0.576 ± 0.342
0.0CysCys: 0.0 ± 0.0
0.576CysAsp: 0.576 ± 0.415
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.576CysGly: 0.576 ± 0.332
0.576CysHis: 0.576 ± 0.324
0.863CysIle: 0.863 ± 0.623
0.288CysLys: 0.288 ± 0.248
1.151CysLeu: 1.151 ± 0.507
0.576CysMet: 0.576 ± 0.383
0.576CysAsn: 0.576 ± 0.378
0.576CysPro: 0.576 ± 0.334
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.151AspAla: 1.151 ± 0.5
0.576AspCys: 0.576 ± 0.381
2.878AspAsp: 2.878 ± 0.728
5.18AspGlu: 5.18 ± 1.303
4.892AspPhe: 4.892 ± 1.202
3.741AspGly: 3.741 ± 0.839
0.288AspHis: 0.288 ± 0.314
7.194AspIle: 7.194 ± 1.337
5.755AspLys: 5.755 ± 1.059
7.77AspLeu: 7.77 ± 1.619
2.302AspMet: 2.302 ± 0.8
3.165AspAsn: 3.165 ± 0.976
1.439AspPro: 1.439 ± 0.711
1.151AspGln: 1.151 ± 0.591
2.59AspArg: 2.59 ± 0.749
2.59AspSer: 2.59 ± 1.005
3.453AspThr: 3.453 ± 0.945
2.014AspVal: 2.014 ± 0.642
0.0AspTrp: 0.0 ± 0.0
4.029AspTyr: 4.029 ± 1.254
0.0AspXaa: 0.0 ± 0.0
Glu
5.755GluAla: 5.755 ± 1.227
0.863GluCys: 0.863 ± 0.449
4.892GluAsp: 4.892 ± 1.064
7.482GluGlu: 7.482 ± 1.713
2.014GluPhe: 2.014 ± 0.903
2.59GluGly: 2.59 ± 0.833
0.863GluHis: 0.863 ± 0.557
9.209GluIle: 9.209 ± 1.693
6.331GluLys: 6.331 ± 1.519
10.647GluLeu: 10.647 ± 1.486
2.014GluMet: 2.014 ± 0.663
5.468GluAsn: 5.468 ± 1.218
2.878GluPro: 2.878 ± 1.097
5.18GluGln: 5.18 ± 1.567
5.755GluArg: 5.755 ± 0.974
4.029GluSer: 4.029 ± 1.25
3.453GluThr: 3.453 ± 1.013
4.604GluVal: 4.604 ± 1.471
0.288GluTrp: 0.288 ± 0.268
4.029GluTyr: 4.029 ± 0.963
0.0GluXaa: 0.0 ± 0.0
Phe
0.576PheAla: 0.576 ± 0.409
0.288PheCys: 0.288 ± 0.248
2.59PheAsp: 2.59 ± 0.851
3.165PheGlu: 3.165 ± 1.191
1.151PhePhe: 1.151 ± 0.766
2.014PheGly: 2.014 ± 0.637
0.288PheHis: 0.288 ± 0.261
3.741PheIle: 3.741 ± 1.356
4.604PheLys: 4.604 ± 1.298
5.18PheLeu: 5.18 ± 1.049
0.576PheMet: 0.576 ± 0.395
1.727PheAsn: 1.727 ± 0.721
0.288PhePro: 0.288 ± 0.323
2.302PheGln: 2.302 ± 0.696
2.878PheArg: 2.878 ± 0.756
3.453PheSer: 3.453 ± 0.798
1.727PheThr: 1.727 ± 0.579
1.727PheVal: 1.727 ± 0.697
0.863PheTrp: 0.863 ± 0.592
2.014PheTyr: 2.014 ± 0.876
0.0PheXaa: 0.0 ± 0.0
Gly
1.439GlyAla: 1.439 ± 0.527
0.288GlyCys: 0.288 ± 0.261
3.165GlyAsp: 3.165 ± 1.078
2.014GlyGlu: 2.014 ± 0.642
2.302GlyPhe: 2.302 ± 0.64
2.302GlyGly: 2.302 ± 0.717
0.863GlyHis: 0.863 ± 0.383
2.59GlyIle: 2.59 ± 0.93
5.755GlyLys: 5.755 ± 0.835
5.18GlyLeu: 5.18 ± 1.37
1.439GlyMet: 1.439 ± 0.596
2.014GlyAsn: 2.014 ± 0.656
0.0GlyPro: 0.0 ± 0.0
2.302GlyGln: 2.302 ± 0.52
3.165GlyArg: 3.165 ± 0.764
2.302GlySer: 2.302 ± 0.593
3.165GlyThr: 3.165 ± 1.18
2.59GlyVal: 2.59 ± 0.973
0.863GlyTrp: 0.863 ± 0.804
2.302GlyTyr: 2.302 ± 0.789
0.0GlyXaa: 0.0 ± 0.0
His
1.151HisAla: 1.151 ± 0.81
0.0HisCys: 0.0 ± 0.0
0.863HisAsp: 0.863 ± 0.654
0.576HisGlu: 0.576 ± 0.536
0.576HisPhe: 0.576 ± 0.373
0.863HisGly: 0.863 ± 0.497
0.0HisHis: 0.0 ± 0.0
0.288HisIle: 0.288 ± 0.261
1.439HisLys: 1.439 ± 0.706
2.014HisLeu: 2.014 ± 0.658
0.0HisMet: 0.0 ± 0.0
0.863HisAsn: 0.863 ± 0.509
0.288HisPro: 0.288 ± 0.358
0.288HisGln: 0.288 ± 0.275
1.151HisArg: 1.151 ± 0.772
1.151HisSer: 1.151 ± 0.531
0.576HisThr: 0.576 ± 0.521
0.288HisVal: 0.288 ± 0.268
0.0HisTrp: 0.0 ± 0.0
0.288HisTyr: 0.288 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
3.165IleAla: 3.165 ± 1.132
0.576IleCys: 0.576 ± 0.369
7.482IleAsp: 7.482 ± 2.121
5.755IleGlu: 5.755 ± 1.644
2.878IlePhe: 2.878 ± 0.708
3.165IleGly: 3.165 ± 0.871
0.288IleHis: 0.288 ± 0.268
4.317IleIle: 4.317 ± 1.114
8.058IleLys: 8.058 ± 1.482
6.331IleLeu: 6.331 ± 1.216
0.576IleMet: 0.576 ± 0.446
2.59IleAsn: 2.59 ± 0.883
3.741IlePro: 3.741 ± 1.066
2.59IleGln: 2.59 ± 0.63
2.59IleArg: 2.59 ± 0.875
6.906IleSer: 6.906 ± 1.607
3.741IleThr: 3.741 ± 0.86
3.741IleVal: 3.741 ± 0.877
1.151IleTrp: 1.151 ± 0.729
2.59IleTyr: 2.59 ± 0.911
0.0IleXaa: 0.0 ± 0.0
Lys
7.77LysAla: 7.77 ± 1.803
0.0LysCys: 0.0 ± 0.0
3.741LysAsp: 3.741 ± 1.124
10.072LysGlu: 10.072 ± 1.35
3.165LysPhe: 3.165 ± 0.89
4.317LysGly: 4.317 ± 0.953
2.59LysHis: 2.59 ± 1.186
5.755LysIle: 5.755 ± 1.331
7.482LysLys: 7.482 ± 1.521
10.072LysLeu: 10.072 ± 1.8
2.302LysMet: 2.302 ± 0.752
5.755LysAsn: 5.755 ± 1.164
3.165LysPro: 3.165 ± 1.244
3.165LysGln: 3.165 ± 0.986
6.043LysArg: 6.043 ± 1.132
3.453LysSer: 3.453 ± 0.846
7.77LysThr: 7.77 ± 1.78
4.892LysVal: 4.892 ± 0.711
1.439LysTrp: 1.439 ± 0.561
3.165LysTyr: 3.165 ± 1.237
0.0LysXaa: 0.0 ± 0.0
Leu
5.468LeuAla: 5.468 ± 1.323
1.151LeuCys: 1.151 ± 0.756
10.36LeuAsp: 10.36 ± 1.651
11.223LeuGlu: 11.223 ± 1.841
3.741LeuPhe: 3.741 ± 1.522
4.317LeuGly: 4.317 ± 1.37
1.151LeuHis: 1.151 ± 0.503
5.18LeuIle: 5.18 ± 1.189
9.209LeuLys: 9.209 ± 1.8
10.647LeuLeu: 10.647 ± 1.197
2.878LeuMet: 2.878 ± 0.751
6.906LeuAsn: 6.906 ± 1.472
2.878LeuPro: 2.878 ± 1.024
3.741LeuGln: 3.741 ± 0.632
5.755LeuArg: 5.755 ± 1.361
5.755LeuSer: 5.755 ± 1.112
6.619LeuThr: 6.619 ± 1.311
3.741LeuVal: 3.741 ± 1.048
0.288LeuTrp: 0.288 ± 0.268
4.892LeuTyr: 4.892 ± 0.928
0.0LeuXaa: 0.0 ± 0.0
Met
2.59MetAla: 2.59 ± 0.873
0.288MetCys: 0.288 ± 0.248
1.727MetAsp: 1.727 ± 0.758
2.878MetGlu: 2.878 ± 1.057
0.863MetPhe: 0.863 ± 0.494
2.014MetGly: 2.014 ± 0.781
0.0MetHis: 0.0 ± 0.0
1.727MetIle: 1.727 ± 0.782
2.302MetLys: 2.302 ± 0.587
1.727MetLeu: 1.727 ± 0.755
0.576MetMet: 0.576 ± 0.457
2.302MetAsn: 2.302 ± 0.945
0.863MetPro: 0.863 ± 0.407
0.576MetGln: 0.576 ± 0.423
0.863MetArg: 0.863 ± 0.504
1.439MetSer: 1.439 ± 0.538
3.741MetThr: 3.741 ± 1.393
1.727MetVal: 1.727 ± 0.65
0.288MetTrp: 0.288 ± 0.268
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.59AsnAla: 2.59 ± 0.861
0.288AsnCys: 0.288 ± 0.282
3.741AsnAsp: 3.741 ± 1.348
3.165AsnGlu: 3.165 ± 0.712
1.439AsnPhe: 1.439 ± 0.576
3.165AsnGly: 3.165 ± 1.085
1.151AsnHis: 1.151 ± 0.564
2.878AsnIle: 2.878 ± 0.861
7.482AsnLys: 7.482 ± 1.427
4.029AsnLeu: 4.029 ± 0.726
1.727AsnMet: 1.727 ± 0.644
1.439AsnAsn: 1.439 ± 0.616
1.439AsnPro: 1.439 ± 0.555
2.302AsnGln: 2.302 ± 0.934
1.727AsnArg: 1.727 ± 0.747
3.165AsnSer: 3.165 ± 1.022
3.165AsnThr: 3.165 ± 1.013
2.878AsnVal: 2.878 ± 1.175
1.439AsnTrp: 1.439 ± 0.649
1.727AsnTyr: 1.727 ± 0.872
0.0AsnXaa: 0.0 ± 0.0
Pro
1.151ProAla: 1.151 ± 0.582
0.288ProCys: 0.288 ± 0.268
2.014ProAsp: 2.014 ± 0.734
2.302ProGlu: 2.302 ± 0.782
1.727ProPhe: 1.727 ± 0.833
0.288ProGly: 0.288 ± 0.324
0.288ProHis: 0.288 ± 0.287
2.302ProIle: 2.302 ± 0.569
2.878ProLys: 2.878 ± 0.794
1.151ProLeu: 1.151 ± 0.585
1.439ProMet: 1.439 ± 0.544
1.727ProAsn: 1.727 ± 0.787
1.151ProPro: 1.151 ± 0.721
0.863ProGln: 0.863 ± 0.4
2.878ProArg: 2.878 ± 1.012
2.014ProSer: 2.014 ± 0.708
0.863ProThr: 0.863 ± 0.474
2.302ProVal: 2.302 ± 0.823
0.0ProTrp: 0.0 ± 0.0
1.727ProTyr: 1.727 ± 0.62
0.0ProXaa: 0.0 ± 0.0
Gln
3.453GlnAla: 3.453 ± 1.143
0.0GlnCys: 0.0 ± 0.0
1.727GlnAsp: 1.727 ± 0.657
2.878GlnGlu: 2.878 ± 0.985
1.151GlnPhe: 1.151 ± 0.577
2.014GlnGly: 2.014 ± 0.891
0.0GlnHis: 0.0 ± 0.0
2.59GlnIle: 2.59 ± 0.807
3.453GlnLys: 3.453 ± 0.856
3.741GlnLeu: 3.741 ± 1.062
0.576GlnMet: 0.576 ± 0.462
1.151GlnAsn: 1.151 ± 0.604
1.727GlnPro: 1.727 ± 0.702
1.727GlnGln: 1.727 ± 0.711
2.302GlnArg: 2.302 ± 0.824
1.727GlnSer: 1.727 ± 0.63
1.727GlnThr: 1.727 ± 0.536
3.165GlnVal: 3.165 ± 0.694
0.576GlnTrp: 0.576 ± 0.324
0.576GlnTyr: 0.576 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
2.302ArgAla: 2.302 ± 0.655
0.288ArgCys: 0.288 ± 0.235
1.727ArgAsp: 1.727 ± 0.625
6.906ArgGlu: 6.906 ± 1.795
3.165ArgPhe: 3.165 ± 1.094
2.59ArgGly: 2.59 ± 0.86
0.863ArgHis: 0.863 ± 0.36
4.029ArgIle: 4.029 ± 1.216
5.18ArgLys: 5.18 ± 1.059
6.331ArgLeu: 6.331 ± 1.275
2.59ArgMet: 2.59 ± 0.785
3.453ArgAsn: 3.453 ± 1.006
0.576ArgPro: 0.576 ± 0.363
1.727ArgGln: 1.727 ± 0.708
1.727ArgArg: 1.727 ± 0.635
4.029ArgSer: 4.029 ± 0.997
2.302ArgThr: 2.302 ± 0.744
3.741ArgVal: 3.741 ± 0.681
0.288ArgTrp: 0.288 ± 0.268
2.59ArgTyr: 2.59 ± 0.81
0.0ArgXaa: 0.0 ± 0.0
Ser
3.165SerAla: 3.165 ± 0.724
0.0SerCys: 0.0 ± 0.0
3.165SerAsp: 3.165 ± 0.736
6.043SerGlu: 6.043 ± 1.726
2.014SerPhe: 2.014 ± 0.853
2.878SerGly: 2.878 ± 0.654
0.863SerHis: 0.863 ± 0.481
4.604SerIle: 4.604 ± 1.484
3.741SerLys: 3.741 ± 0.943
6.331SerLeu: 6.331 ± 0.888
2.302SerMet: 2.302 ± 0.706
4.892SerAsn: 4.892 ± 1.399
2.014SerPro: 2.014 ± 0.771
1.727SerGln: 1.727 ± 0.515
2.014SerArg: 2.014 ± 0.919
3.165SerSer: 3.165 ± 0.94
2.59SerThr: 2.59 ± 1.127
2.014SerVal: 2.014 ± 0.831
0.288SerTrp: 0.288 ± 0.278
2.878SerTyr: 2.878 ± 0.833
0.0SerXaa: 0.0 ± 0.0
Thr
1.727ThrAla: 1.727 ± 0.78
0.576ThrCys: 0.576 ± 0.39
3.165ThrAsp: 3.165 ± 1.058
4.029ThrGlu: 4.029 ± 0.916
2.59ThrPhe: 2.59 ± 0.941
2.878ThrGly: 2.878 ± 0.726
0.863ThrHis: 0.863 ± 0.393
2.014ThrIle: 2.014 ± 0.789
6.043ThrLys: 6.043 ± 1.815
7.194ThrLeu: 7.194 ± 1.644
2.014ThrMet: 2.014 ± 0.714
0.288ThrAsn: 0.288 ± 0.278
2.014ThrPro: 2.014 ± 0.854
1.727ThrGln: 1.727 ± 0.695
1.727ThrArg: 1.727 ± 0.464
3.453ThrSer: 3.453 ± 0.878
2.878ThrThr: 2.878 ± 0.885
4.892ThrVal: 4.892 ± 1.504
0.576ThrTrp: 0.576 ± 0.376
4.317ThrTyr: 4.317 ± 1.61
0.0ThrXaa: 0.0 ± 0.0
Val
3.453ValAla: 3.453 ± 0.911
0.288ValCys: 0.288 ± 0.268
3.453ValAsp: 3.453 ± 1.037
6.043ValGlu: 6.043 ± 1.393
1.439ValPhe: 1.439 ± 0.576
2.59ValGly: 2.59 ± 0.983
0.0ValHis: 0.0 ± 0.0
4.604ValIle: 4.604 ± 1.25
4.317ValLys: 4.317 ± 1.059
5.468ValLeu: 5.468 ± 1.483
1.727ValMet: 1.727 ± 0.696
2.59ValAsn: 2.59 ± 0.878
2.014ValPro: 2.014 ± 0.765
1.151ValGln: 1.151 ± 0.559
2.59ValArg: 2.59 ± 1.091
3.453ValSer: 3.453 ± 0.815
3.165ValThr: 3.165 ± 0.932
2.878ValVal: 2.878 ± 0.918
0.863ValTrp: 0.863 ± 0.482
2.302ValTyr: 2.302 ± 0.622
0.0ValXaa: 0.0 ± 0.0
Trp
0.288TrpAla: 0.288 ± 0.261
0.0TrpCys: 0.0 ± 0.0
0.288TrpAsp: 0.288 ± 0.294
1.439TrpGlu: 1.439 ± 0.529
0.288TrpPhe: 0.288 ± 0.268
0.863TrpGly: 0.863 ± 0.482
0.288TrpHis: 0.288 ± 0.268
0.288TrpIle: 0.288 ± 0.282
0.576TrpLys: 0.576 ± 0.431
1.151TrpLeu: 1.151 ± 0.706
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.576TrpGln: 0.576 ± 0.4
1.151TrpArg: 1.151 ± 0.608
1.151TrpSer: 1.151 ± 0.486
0.288TrpThr: 0.288 ± 0.278
0.863TrpVal: 0.863 ± 0.537
0.0TrpTrp: 0.0 ± 0.0
0.288TrpTyr: 0.288 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.014TyrAla: 2.014 ± 0.895
0.576TyrCys: 0.576 ± 0.379
1.439TyrAsp: 1.439 ± 0.577
2.878TyrGlu: 2.878 ± 1.025
2.59TyrPhe: 2.59 ± 1.271
0.863TyrGly: 0.863 ± 0.543
0.576TyrHis: 0.576 ± 0.385
3.165TyrIle: 3.165 ± 1.269
6.331TyrLys: 6.331 ± 1.89
4.604TyrLeu: 4.604 ± 1.191
0.863TyrMet: 0.863 ± 0.545
2.014TyrAsn: 2.014 ± 0.561
1.727TyrPro: 1.727 ± 0.879
2.014TyrGln: 2.014 ± 0.905
4.029TyrArg: 4.029 ± 0.852
1.727TyrSer: 1.727 ± 0.624
1.151TyrThr: 1.151 ± 0.655
2.014TyrVal: 2.014 ± 0.788
0.288TyrTrp: 0.288 ± 0.278
0.863TyrTyr: 0.863 ± 0.456
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (3476 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski