Amino acid dipepetide frequency for Streptococcus satellite phage Javan58

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.848AlaAla: 0.848 ± 0.471
0.565AlaCys: 0.565 ± 0.426
3.673AlaAsp: 3.673 ± 0.69
4.804AlaGlu: 4.804 ± 1.167
2.826AlaPhe: 2.826 ± 0.962
1.695AlaGly: 1.695 ± 0.769
0.0AlaHis: 0.0 ± 0.0
6.782AlaIle: 6.782 ± 1.142
3.956AlaLys: 3.956 ± 0.87
5.086AlaLeu: 5.086 ± 0.869
1.413AlaMet: 1.413 ± 0.604
5.086AlaAsn: 5.086 ± 0.793
1.978AlaPro: 1.978 ± 0.93
3.391AlaGln: 3.391 ± 0.856
2.826AlaArg: 2.826 ± 0.976
3.673AlaSer: 3.673 ± 1.033
3.673AlaThr: 3.673 ± 0.958
3.673AlaVal: 3.673 ± 1.316
0.565AlaTrp: 0.565 ± 0.41
2.826AlaTyr: 2.826 ± 0.662
0.0AlaXaa: 0.0 ± 0.0
Cys
0.565CysAla: 0.565 ± 0.359
0.0CysCys: 0.0 ± 0.0
0.848CysAsp: 0.848 ± 0.431
0.283CysGlu: 0.283 ± 0.29
0.0CysPhe: 0.0 ± 0.0
1.413CysGly: 1.413 ± 0.669
0.565CysHis: 0.565 ± 0.372
0.565CysIle: 0.565 ± 0.329
0.0CysLys: 0.0 ± 0.0
1.413CysLeu: 1.413 ± 0.537
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.283CysPro: 0.283 ± 0.29
0.565CysGln: 0.565 ± 0.58
1.13CysArg: 1.13 ± 0.495
0.283CysSer: 0.283 ± 0.321
0.0CysThr: 0.0 ± 0.0
0.283CysVal: 0.283 ± 0.276
0.0CysTrp: 0.0 ± 0.0
0.283CysTyr: 0.283 ± 0.251
0.0CysXaa: 0.0 ± 0.0
Asp
0.848AspAla: 0.848 ± 0.424
1.695AspCys: 1.695 ± 0.862
3.391AspAsp: 3.391 ± 1.362
2.261AspGlu: 2.261 ± 0.875
1.695AspPhe: 1.695 ± 0.761
1.13AspGly: 1.13 ± 0.564
1.413AspHis: 1.413 ± 0.531
5.651AspIle: 5.651 ± 1.33
7.629AspLys: 7.629 ± 0.876
5.651AspLeu: 5.651 ± 1.058
1.978AspMet: 1.978 ± 1.078
3.956AspAsn: 3.956 ± 1.079
0.848AspPro: 0.848 ± 0.434
1.13AspGln: 1.13 ± 0.536
2.261AspArg: 2.261 ± 0.763
4.804AspSer: 4.804 ± 1.604
5.086AspThr: 5.086 ± 1.363
1.978AspVal: 1.978 ± 0.661
0.283AspTrp: 0.283 ± 0.323
5.934AspTyr: 5.934 ± 1.536
0.0AspXaa: 0.0 ± 0.0
Glu
4.521GluAla: 4.521 ± 1.174
1.13GluCys: 1.13 ± 0.647
4.238GluAsp: 4.238 ± 1.177
3.673GluGlu: 3.673 ± 1.015
3.391GluPhe: 3.391 ± 0.963
2.543GluGly: 2.543 ± 0.602
2.261GluHis: 2.261 ± 0.758
7.912GluIle: 7.912 ± 1.586
7.347GluLys: 7.347 ± 1.062
10.172GluLeu: 10.172 ± 1.809
1.413GluMet: 1.413 ± 0.855
2.543GluAsn: 2.543 ± 0.6
1.13GluPro: 1.13 ± 0.835
4.238GluGln: 4.238 ± 1.077
3.956GluArg: 3.956 ± 1.001
2.261GluSer: 2.261 ± 0.793
5.934GluThr: 5.934 ± 0.987
5.651GluVal: 5.651 ± 1.355
0.565GluTrp: 0.565 ± 0.376
2.543GluTyr: 2.543 ± 0.952
0.0GluXaa: 0.0 ± 0.0
Phe
1.695PheAla: 1.695 ± 0.665
0.0PheCys: 0.0 ± 0.0
2.826PheAsp: 2.826 ± 0.849
3.391PheGlu: 3.391 ± 0.796
1.413PhePhe: 1.413 ± 0.596
2.826PheGly: 2.826 ± 0.922
1.13PheHis: 1.13 ± 0.526
2.826PheIle: 2.826 ± 1.069
4.238PheLys: 4.238 ± 0.911
3.956PheLeu: 3.956 ± 0.979
0.283PheMet: 0.283 ± 0.283
5.086PheAsn: 5.086 ± 0.781
0.848PhePro: 0.848 ± 0.41
0.565PheGln: 0.565 ± 0.385
1.978PheArg: 1.978 ± 0.75
2.543PheSer: 2.543 ± 0.725
1.978PheThr: 1.978 ± 0.668
1.413PheVal: 1.413 ± 0.542
0.283PheTrp: 0.283 ± 0.271
1.13PheTyr: 1.13 ± 0.455
0.0PheXaa: 0.0 ± 0.0
Gly
1.695GlyAla: 1.695 ± 0.739
0.565GlyCys: 0.565 ± 0.343
2.543GlyAsp: 2.543 ± 0.738
4.521GlyGlu: 4.521 ± 1.283
2.826GlyPhe: 2.826 ± 0.624
3.673GlyGly: 3.673 ± 1.48
0.848GlyHis: 0.848 ± 0.385
3.673GlyIle: 3.673 ± 0.923
3.956GlyLys: 3.956 ± 1.183
5.651GlyLeu: 5.651 ± 1.641
0.848GlyMet: 0.848 ± 0.411
0.848GlyAsn: 0.848 ± 0.464
0.0GlyPro: 0.0 ± 0.0
1.413GlyGln: 1.413 ± 0.616
2.261GlyArg: 2.261 ± 0.528
1.695GlySer: 1.695 ± 0.822
3.391GlyThr: 3.391 ± 1.309
1.695GlyVal: 1.695 ± 0.664
0.565GlyTrp: 0.565 ± 0.552
3.673GlyTyr: 3.673 ± 1.482
0.0GlyXaa: 0.0 ± 0.0
His
1.978HisAla: 1.978 ± 0.835
0.283HisCys: 0.283 ± 0.276
0.283HisAsp: 0.283 ± 0.318
0.565HisGlu: 0.565 ± 0.37
0.848HisPhe: 0.848 ± 0.569
1.413HisGly: 1.413 ± 0.624
0.283HisHis: 0.283 ± 0.321
1.695HisIle: 1.695 ± 0.634
1.13HisLys: 1.13 ± 0.538
1.13HisLeu: 1.13 ± 0.473
0.0HisMet: 0.0 ± 0.0
0.848HisAsn: 0.848 ± 0.636
1.413HisPro: 1.413 ± 0.821
1.13HisGln: 1.13 ± 0.632
0.283HisArg: 0.283 ± 0.251
0.848HisSer: 0.848 ± 0.531
1.13HisThr: 1.13 ± 0.538
0.848HisVal: 0.848 ± 0.43
0.283HisTrp: 0.283 ± 0.29
3.108HisTyr: 3.108 ± 1.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.086IleAla: 5.086 ± 1.359
0.565IleCys: 0.565 ± 0.372
5.086IleAsp: 5.086 ± 1.11
6.499IleGlu: 6.499 ± 1.622
2.543IlePhe: 2.543 ± 0.905
1.978IleGly: 1.978 ± 0.582
1.13IleHis: 1.13 ± 0.589
4.238IleIle: 4.238 ± 0.613
7.912IleLys: 7.912 ± 1.22
4.521IleLeu: 4.521 ± 0.975
1.13IleMet: 1.13 ± 0.576
2.826IleAsn: 2.826 ± 1.285
2.261IlePro: 2.261 ± 0.87
1.695IleGln: 1.695 ± 0.655
2.826IleArg: 2.826 ± 0.823
3.956IleSer: 3.956 ± 1.241
3.673IleThr: 3.673 ± 0.802
3.108IleVal: 3.108 ± 0.623
0.0IleTrp: 0.0 ± 0.0
4.521IleTyr: 4.521 ± 1.1
0.0IleXaa: 0.0 ± 0.0
Lys
8.76LysAla: 8.76 ± 1.144
0.283LysCys: 0.283 ± 0.318
4.521LysAsp: 4.521 ± 1.204
8.76LysGlu: 8.76 ± 1.216
2.543LysPhe: 2.543 ± 1.044
3.956LysGly: 3.956 ± 1.114
1.978LysHis: 1.978 ± 0.716
3.956LysIle: 3.956 ± 0.799
8.477LysLys: 8.477 ± 1.49
6.216LysLeu: 6.216 ± 1.505
2.261LysMet: 2.261 ± 0.809
4.521LysAsn: 4.521 ± 1.008
4.238LysPro: 4.238 ± 1.643
5.651LysGln: 5.651 ± 1.292
7.347LysArg: 7.347 ± 0.925
7.347LysSer: 7.347 ± 2.087
6.782LysThr: 6.782 ± 1.086
4.521LysVal: 4.521 ± 1.144
0.848LysTrp: 0.848 ± 0.454
3.391LysTyr: 3.391 ± 1.24
0.0LysXaa: 0.0 ± 0.0
Leu
7.347LeuAla: 7.347 ± 1.199
0.848LeuCys: 0.848 ± 0.439
7.064LeuAsp: 7.064 ± 1.156
8.76LeuGlu: 8.76 ± 2.409
3.108LeuPhe: 3.108 ± 0.871
5.086LeuGly: 5.086 ± 1.224
0.848LeuHis: 0.848 ± 0.411
6.782LeuIle: 6.782 ± 1.431
7.064LeuLys: 7.064 ± 1.101
7.064LeuLeu: 7.064 ± 1.401
1.978LeuMet: 1.978 ± 0.776
5.086LeuAsn: 5.086 ± 1.37
3.673LeuPro: 3.673 ± 1.312
2.826LeuGln: 2.826 ± 0.843
2.543LeuArg: 2.543 ± 0.675
5.369LeuSer: 5.369 ± 1.061
5.369LeuThr: 5.369 ± 0.84
3.108LeuVal: 3.108 ± 1.147
0.848LeuTrp: 0.848 ± 0.38
2.543LeuTyr: 2.543 ± 0.746
0.0LeuXaa: 0.0 ± 0.0
Met
1.695MetAla: 1.695 ± 0.697
0.0MetCys: 0.0 ± 0.0
1.413MetAsp: 1.413 ± 0.559
0.848MetGlu: 0.848 ± 0.431
0.283MetPhe: 0.283 ± 0.267
0.565MetGly: 0.565 ± 0.406
0.565MetHis: 0.565 ± 0.394
0.848MetIle: 0.848 ± 0.458
1.978MetLys: 1.978 ± 0.443
3.391MetLeu: 3.391 ± 1.304
0.283MetMet: 0.283 ± 0.266
2.261MetAsn: 2.261 ± 0.69
0.0MetPro: 0.0 ± 0.0
0.565MetGln: 0.565 ± 0.394
1.413MetArg: 1.413 ± 0.576
1.13MetSer: 1.13 ± 0.674
2.826MetThr: 2.826 ± 1.089
0.848MetVal: 0.848 ± 0.464
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.956AsnAla: 3.956 ± 1.025
0.283AsnCys: 0.283 ± 0.276
3.108AsnAsp: 3.108 ± 0.716
2.826AsnGlu: 2.826 ± 0.903
1.695AsnPhe: 1.695 ± 0.613
3.956AsnGly: 3.956 ± 1.036
1.695AsnHis: 1.695 ± 0.68
3.108AsnIle: 3.108 ± 0.738
5.369AsnLys: 5.369 ± 0.906
3.108AsnLeu: 3.108 ± 1.119
1.695AsnMet: 1.695 ± 1.041
2.261AsnAsn: 2.261 ± 0.657
1.978AsnPro: 1.978 ± 0.65
3.673AsnGln: 3.673 ± 1.193
2.261AsnArg: 2.261 ± 0.625
4.238AsnSer: 4.238 ± 0.975
3.108AsnThr: 3.108 ± 1.211
3.108AsnVal: 3.108 ± 0.99
0.565AsnTrp: 0.565 ± 0.336
1.978AsnTyr: 1.978 ± 0.755
0.0AsnXaa: 0.0 ± 0.0
Pro
1.413ProAla: 1.413 ± 0.797
0.283ProCys: 0.283 ± 0.321
0.848ProAsp: 0.848 ± 0.537
3.391ProGlu: 3.391 ± 1.278
1.413ProPhe: 1.413 ± 0.704
0.848ProGly: 0.848 ± 0.475
0.848ProHis: 0.848 ± 0.396
1.695ProIle: 1.695 ± 0.779
3.108ProLys: 3.108 ± 0.898
2.261ProLeu: 2.261 ± 0.599
1.695ProMet: 1.695 ± 0.621
2.826ProAsn: 2.826 ± 0.918
0.848ProPro: 0.848 ± 0.436
0.0ProGln: 0.0 ± 0.0
2.261ProArg: 2.261 ± 0.833
1.978ProSer: 1.978 ± 0.916
1.13ProThr: 1.13 ± 0.448
1.413ProVal: 1.413 ± 0.601
0.283ProTrp: 0.283 ± 0.276
1.695ProTyr: 1.695 ± 0.699
0.0ProXaa: 0.0 ± 0.0
Gln
3.956GlnAla: 3.956 ± 0.989
0.283GlnCys: 0.283 ± 0.285
2.261GlnAsp: 2.261 ± 0.758
5.369GlnGlu: 5.369 ± 1.33
0.848GlnPhe: 0.848 ± 0.408
2.826GlnGly: 2.826 ± 0.66
0.0GlnHis: 0.0 ± 0.0
1.695GlnIle: 1.695 ± 0.524
3.673GlnLys: 3.673 ± 1.037
5.369GlnLeu: 5.369 ± 0.773
0.283GlnMet: 0.283 ± 0.282
1.978GlnAsn: 1.978 ± 0.964
2.826GlnPro: 2.826 ± 0.963
1.978GlnGln: 1.978 ± 0.674
1.978GlnArg: 1.978 ± 0.783
2.826GlnSer: 2.826 ± 1.092
2.826GlnThr: 2.826 ± 0.713
2.826GlnVal: 2.826 ± 0.735
0.565GlnTrp: 0.565 ± 0.646
1.978GlnTyr: 1.978 ± 0.822
0.0GlnXaa: 0.0 ± 0.0
Arg
2.826ArgAla: 2.826 ± 1.117
0.565ArgCys: 0.565 ± 0.359
1.695ArgAsp: 1.695 ± 0.568
3.108ArgGlu: 3.108 ± 0.878
2.543ArgPhe: 2.543 ± 0.707
2.826ArgGly: 2.826 ± 0.968
0.565ArgHis: 0.565 ± 0.343
3.108ArgIle: 3.108 ± 0.914
6.216ArgLys: 6.216 ± 1.284
3.673ArgLeu: 3.673 ± 0.677
0.848ArgMet: 0.848 ± 0.537
3.391ArgAsn: 3.391 ± 1.167
1.13ArgPro: 1.13 ± 0.535
2.826ArgGln: 2.826 ± 0.844
2.543ArgArg: 2.543 ± 0.691
2.261ArgSer: 2.261 ± 1.005
2.826ArgThr: 2.826 ± 0.994
3.956ArgVal: 3.956 ± 0.848
0.848ArgTrp: 0.848 ± 0.459
2.261ArgTyr: 2.261 ± 0.747
0.0ArgXaa: 0.0 ± 0.0
Ser
3.391SerAla: 3.391 ± 1.368
0.283SerCys: 0.283 ± 0.318
5.934SerAsp: 5.934 ± 0.969
5.086SerGlu: 5.086 ± 1.272
3.391SerPhe: 3.391 ± 0.707
1.413SerGly: 1.413 ± 0.588
1.13SerHis: 1.13 ± 0.596
3.108SerIle: 3.108 ± 0.811
7.912SerLys: 7.912 ± 1.206
3.673SerLeu: 3.673 ± 1.062
0.848SerMet: 0.848 ± 0.441
1.695SerAsn: 1.695 ± 0.843
0.848SerPro: 0.848 ± 0.567
5.086SerGln: 5.086 ± 1.058
2.261SerArg: 2.261 ± 0.854
4.238SerSer: 4.238 ± 2.254
3.391SerThr: 3.391 ± 1.121
2.543SerVal: 2.543 ± 0.856
0.283SerTrp: 0.283 ± 0.251
5.086SerTyr: 5.086 ± 1.156
0.0SerXaa: 0.0 ± 0.0
Thr
4.238ThrAla: 4.238 ± 0.924
0.283ThrCys: 0.283 ± 0.29
4.238ThrAsp: 4.238 ± 0.973
3.673ThrGlu: 3.673 ± 0.976
2.826ThrPhe: 2.826 ± 1.269
4.521ThrGly: 4.521 ± 0.859
1.13ThrHis: 1.13 ± 0.648
3.108ThrIle: 3.108 ± 1.03
5.086ThrLys: 5.086 ± 1.269
5.086ThrLeu: 5.086 ± 1.372
1.13ThrMet: 1.13 ± 0.68
3.673ThrAsn: 3.673 ± 0.839
3.108ThrPro: 3.108 ± 1.139
4.804ThrGln: 4.804 ± 0.853
3.391ThrArg: 3.391 ± 0.806
4.521ThrSer: 4.521 ± 1.39
4.521ThrThr: 4.521 ± 1.923
3.956ThrVal: 3.956 ± 1.149
0.565ThrTrp: 0.565 ± 0.361
2.826ThrTyr: 2.826 ± 0.746
0.0ThrXaa: 0.0 ± 0.0
Val
2.826ValAla: 2.826 ± 0.93
0.283ValCys: 0.283 ± 0.256
1.978ValAsp: 1.978 ± 0.621
3.956ValGlu: 3.956 ± 1.091
3.391ValPhe: 3.391 ± 0.753
1.978ValGly: 1.978 ± 0.543
0.848ValHis: 0.848 ± 0.559
2.826ValIle: 2.826 ± 0.945
5.369ValLys: 5.369 ± 0.893
4.238ValLeu: 4.238 ± 1.033
1.13ValMet: 1.13 ± 0.562
1.695ValAsn: 1.695 ± 0.685
1.13ValPro: 1.13 ± 0.385
1.695ValGln: 1.695 ± 0.616
1.413ValArg: 1.413 ± 0.496
4.238ValSer: 4.238 ± 1.083
6.216ValThr: 6.216 ± 1.434
3.108ValVal: 3.108 ± 0.631
0.565ValTrp: 0.565 ± 0.552
1.13ValTyr: 1.13 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
0.283TrpAla: 0.283 ± 0.271
0.0TrpCys: 0.0 ± 0.0
0.848TrpAsp: 0.848 ± 0.567
0.848TrpGlu: 0.848 ± 0.469
0.283TrpPhe: 0.283 ± 0.276
0.283TrpGly: 0.283 ± 0.321
0.0TrpHis: 0.0 ± 0.0
0.283TrpIle: 0.283 ± 0.323
1.13TrpLys: 1.13 ± 0.464
1.13TrpLeu: 1.13 ± 0.539
0.0TrpMet: 0.0 ± 0.0
0.283TrpAsn: 0.283 ± 0.271
0.0TrpPro: 0.0 ± 0.0
0.565TrpGln: 0.565 ± 0.412
0.0TrpArg: 0.0 ± 0.0
1.13TrpSer: 1.13 ± 0.431
0.283TrpThr: 0.283 ± 0.276
0.848TrpVal: 0.848 ± 0.428
0.565TrpTrp: 0.565 ± 0.382
0.283TrpTyr: 0.283 ± 0.251
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.413TyrAla: 1.413 ± 0.664
0.283TyrCys: 0.283 ± 0.28
2.826TyrAsp: 2.826 ± 0.934
4.804TyrGlu: 4.804 ± 1.109
2.543TyrPhe: 2.543 ± 0.778
1.413TyrGly: 1.413 ± 0.564
2.261TyrHis: 2.261 ± 0.706
1.695TyrIle: 1.695 ± 0.802
4.804TyrLys: 4.804 ± 1.488
4.521TyrLeu: 4.521 ± 0.728
1.413TyrMet: 1.413 ± 0.765
3.391TyrAsn: 3.391 ± 0.843
1.695TyrPro: 1.695 ± 0.79
2.543TyrGln: 2.543 ± 0.75
5.086TyrArg: 5.086 ± 1.316
2.261TyrSer: 2.261 ± 0.516
2.543TyrThr: 2.543 ± 0.903
1.13TyrVal: 1.13 ± 0.575
0.565TyrTrp: 0.565 ± 0.58
3.673TyrTyr: 3.673 ± 1.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3540 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski