Amino acid dipepetide frequency for Streptococcus satellite phage Javan323

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.563AlaAla: 0.563 ± 0.37
0.563AlaCys: 0.563 ± 0.408
0.844AlaAsp: 0.844 ± 0.451
3.376AlaGlu: 3.376 ± 0.74
1.406AlaPhe: 1.406 ± 0.778
1.969AlaGly: 1.969 ± 0.762
0.563AlaHis: 0.563 ± 0.344
2.532AlaIle: 2.532 ± 0.94
3.376AlaLys: 3.376 ± 0.706
3.938AlaLeu: 3.938 ± 1.096
0.563AlaMet: 0.563 ± 0.412
2.813AlaAsn: 2.813 ± 1.069
1.969AlaPro: 1.969 ± 0.601
1.688AlaGln: 1.688 ± 0.695
2.532AlaArg: 2.532 ± 0.81
4.782AlaSer: 4.782 ± 1.257
4.501AlaThr: 4.501 ± 1.168
5.063AlaVal: 5.063 ± 0.929
0.563AlaTrp: 0.563 ± 0.359
3.376AlaTyr: 3.376 ± 0.974
0.0AlaXaa: 0.0 ± 0.0
Cys
0.281CysAla: 0.281 ± 0.248
0.0CysCys: 0.0 ± 0.0
1.406CysAsp: 1.406 ± 0.573
0.563CysGlu: 0.563 ± 0.408
0.281CysPhe: 0.281 ± 0.311
1.406CysGly: 1.406 ± 0.598
0.0CysHis: 0.0 ± 0.0
0.563CysIle: 0.563 ± 0.346
0.281CysLys: 0.281 ± 0.251
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.281CysAsn: 0.281 ± 0.248
1.406CysPro: 1.406 ± 0.629
0.563CysGln: 0.563 ± 0.47
0.844CysArg: 0.844 ± 0.445
1.125CysSer: 1.125 ± 0.668
0.0CysThr: 0.0 ± 0.0
0.844CysVal: 0.844 ± 0.37
0.281CysTrp: 0.281 ± 0.248
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.406AspAla: 1.406 ± 0.531
0.0AspCys: 0.0 ± 0.0
1.969AspAsp: 1.969 ± 1.344
4.501AspGlu: 4.501 ± 1.279
2.25AspPhe: 2.25 ± 0.804
4.782AspGly: 4.782 ± 1.019
0.0AspHis: 0.0 ± 0.0
6.751AspIle: 6.751 ± 1.197
6.47AspLys: 6.47 ± 1.381
3.376AspLeu: 3.376 ± 1.157
1.125AspMet: 1.125 ± 0.576
3.376AspAsn: 3.376 ± 1.017
1.125AspPro: 1.125 ± 0.422
0.0AspGln: 0.0 ± 0.0
1.406AspArg: 1.406 ± 0.488
4.219AspSer: 4.219 ± 0.702
3.938AspThr: 3.938 ± 1.059
2.25AspVal: 2.25 ± 0.919
0.563AspTrp: 0.563 ± 0.392
3.376AspTyr: 3.376 ± 1.37
0.0AspXaa: 0.0 ± 0.0
Glu
6.188GluAla: 6.188 ± 1.287
0.0GluCys: 0.0 ± 0.0
5.063GluAsp: 5.063 ± 1.019
7.595GluGlu: 7.595 ± 1.22
3.657GluPhe: 3.657 ± 0.963
4.219GluGly: 4.219 ± 1.242
1.125GluHis: 1.125 ± 0.679
6.47GluIle: 6.47 ± 1.111
8.439GluLys: 8.439 ± 1.909
7.032GluLeu: 7.032 ± 1.323
1.969GluMet: 1.969 ± 0.832
3.938GluAsn: 3.938 ± 0.955
1.406GluPro: 1.406 ± 0.727
3.376GluGln: 3.376 ± 1.003
4.219GluArg: 4.219 ± 1.272
3.657GluSer: 3.657 ± 0.995
4.501GluThr: 4.501 ± 0.773
6.188GluVal: 6.188 ± 0.8
0.281GluTrp: 0.281 ± 0.264
3.094GluTyr: 3.094 ± 1.523
0.0GluXaa: 0.0 ± 0.0
Phe
1.969PheAla: 1.969 ± 0.922
0.0PheCys: 0.0 ± 0.0
2.813PheAsp: 2.813 ± 0.797
3.657PheGlu: 3.657 ± 0.905
1.125PhePhe: 1.125 ± 0.596
1.406PheGly: 1.406 ± 0.786
0.844PheHis: 0.844 ± 0.374
3.094PheIle: 3.094 ± 0.73
4.219PheLys: 4.219 ± 0.94
3.094PheLeu: 3.094 ± 1.209
0.281PheMet: 0.281 ± 0.252
3.094PheAsn: 3.094 ± 0.659
1.406PhePro: 1.406 ± 0.618
1.125PheGln: 1.125 ± 0.359
1.125PheArg: 1.125 ± 0.39
1.406PheSer: 1.406 ± 0.482
1.969PheThr: 1.969 ± 0.737
0.844PheVal: 0.844 ± 0.473
0.563PheTrp: 0.563 ± 0.543
1.969PheTyr: 1.969 ± 0.6
0.0PheXaa: 0.0 ± 0.0
Gly
2.25GlyAla: 2.25 ± 0.871
1.125GlyCys: 1.125 ± 0.493
3.376GlyAsp: 3.376 ± 1.468
3.938GlyGlu: 3.938 ± 0.826
2.813GlyPhe: 2.813 ± 0.81
3.094GlyGly: 3.094 ± 1.13
1.125GlyHis: 1.125 ± 0.514
5.626GlyIle: 5.626 ± 0.844
3.938GlyLys: 3.938 ± 1.319
5.063GlyLeu: 5.063 ± 1.418
0.0GlyMet: 0.0 ± 0.0
2.25GlyAsn: 2.25 ± 0.726
1.125GlyPro: 1.125 ± 0.529
1.969GlyGln: 1.969 ± 0.598
3.094GlyArg: 3.094 ± 0.673
3.657GlySer: 3.657 ± 1.226
3.938GlyThr: 3.938 ± 1.262
3.094GlyVal: 3.094 ± 0.733
0.281GlyTrp: 0.281 ± 0.271
2.25GlyTyr: 2.25 ± 0.656
0.0GlyXaa: 0.0 ± 0.0
His
0.844HisAla: 0.844 ± 0.825
0.563HisCys: 0.563 ± 0.479
0.563HisAsp: 0.563 ± 0.396
0.281HisGlu: 0.281 ± 0.248
0.563HisPhe: 0.563 ± 0.374
0.844HisGly: 0.844 ± 0.491
0.0HisHis: 0.0 ± 0.0
1.969HisIle: 1.969 ± 0.677
2.532HisLys: 2.532 ± 0.89
1.406HisLeu: 1.406 ± 0.613
0.0HisMet: 0.0 ± 0.0
1.406HisAsn: 1.406 ± 0.529
1.688HisPro: 1.688 ± 0.7
0.281HisGln: 0.281 ± 0.311
0.563HisArg: 0.563 ± 0.412
1.688HisSer: 1.688 ± 0.58
1.969HisThr: 1.969 ± 0.71
1.125HisVal: 1.125 ± 0.597
0.281HisTrp: 0.281 ± 0.275
0.563HisTyr: 0.563 ± 0.36
0.0HisXaa: 0.0 ± 0.0
Ile
4.501IleAla: 4.501 ± 0.781
1.125IleCys: 1.125 ± 0.64
6.751IleAsp: 6.751 ± 1.056
7.314IleGlu: 7.314 ± 0.966
1.969IlePhe: 1.969 ± 0.805
4.219IleGly: 4.219 ± 0.8
1.969IleHis: 1.969 ± 0.599
8.439IleIle: 8.439 ± 2.082
7.314IleLys: 7.314 ± 1.397
8.158IleLeu: 8.158 ± 1.322
1.969IleMet: 1.969 ± 0.71
5.345IleAsn: 5.345 ± 1.156
4.219IlePro: 4.219 ± 1.011
1.969IleGln: 1.969 ± 0.607
2.532IleArg: 2.532 ± 0.636
3.657IleSer: 3.657 ± 1.214
6.188IleThr: 6.188 ± 1.386
5.907IleVal: 5.907 ± 1.493
0.844IleTrp: 0.844 ± 0.452
3.094IleTyr: 3.094 ± 0.693
0.0IleXaa: 0.0 ± 0.0
Lys
6.751LysAla: 6.751 ± 1.843
0.563LysCys: 0.563 ± 0.37
4.219LysAsp: 4.219 ± 1.182
8.158LysGlu: 8.158 ± 1.404
2.25LysPhe: 2.25 ± 0.78
5.063LysGly: 5.063 ± 1.469
1.969LysHis: 1.969 ± 0.568
10.689LysIle: 10.689 ± 1.859
8.72LysLys: 8.72 ± 1.799
4.782LysLeu: 4.782 ± 1.126
1.125LysMet: 1.125 ± 0.652
4.501LysAsn: 4.501 ± 0.874
2.25LysPro: 2.25 ± 0.547
4.501LysGln: 4.501 ± 1.032
4.219LysArg: 4.219 ± 1.149
6.47LysSer: 6.47 ± 1.485
5.907LysThr: 5.907 ± 1.448
4.501LysVal: 4.501 ± 0.9
0.844LysTrp: 0.844 ± 0.4
3.094LysTyr: 3.094 ± 0.784
0.0LysXaa: 0.0 ± 0.0
Leu
6.188LeuAla: 6.188 ± 1.927
0.844LeuCys: 0.844 ± 0.549
4.219LeuAsp: 4.219 ± 1.048
6.188LeuGlu: 6.188 ± 1.606
3.376LeuPhe: 3.376 ± 0.571
5.626LeuGly: 5.626 ± 1.504
2.532LeuHis: 2.532 ± 0.928
9.001LeuIle: 9.001 ± 1.748
7.595LeuLys: 7.595 ± 0.888
11.533LeuLeu: 11.533 ± 1.495
1.969LeuMet: 1.969 ± 0.503
4.782LeuAsn: 4.782 ± 1.102
2.532LeuPro: 2.532 ± 0.866
2.532LeuGln: 2.532 ± 0.813
2.813LeuArg: 2.813 ± 0.644
6.188LeuSer: 6.188 ± 1.295
4.782LeuThr: 4.782 ± 1.055
1.406LeuVal: 1.406 ± 0.609
0.844LeuTrp: 0.844 ± 0.492
2.813LeuTyr: 2.813 ± 0.602
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.281MetCys: 0.281 ± 0.264
1.406MetAsp: 1.406 ± 0.818
2.25MetGlu: 2.25 ± 1.055
0.844MetPhe: 0.844 ± 0.792
1.125MetGly: 1.125 ± 0.399
0.0MetHis: 0.0 ± 0.0
1.406MetIle: 1.406 ± 0.571
1.406MetLys: 1.406 ± 0.666
1.125MetLeu: 1.125 ± 0.522
0.281MetMet: 0.281 ± 0.358
2.25MetAsn: 2.25 ± 0.816
0.281MetPro: 0.281 ± 0.275
0.563MetGln: 0.563 ± 0.315
0.0MetArg: 0.0 ± 0.0
0.281MetSer: 0.281 ± 0.252
1.688MetThr: 1.688 ± 0.504
0.563MetVal: 0.563 ± 0.346
0.0MetTrp: 0.0 ± 0.0
0.281MetTyr: 0.281 ± 0.296
0.0MetXaa: 0.0 ± 0.0
Asn
2.25AsnAla: 2.25 ± 0.838
0.844AsnCys: 0.844 ± 0.381
2.813AsnAsp: 2.813 ± 0.734
4.501AsnGlu: 4.501 ± 0.802
1.125AsnPhe: 1.125 ± 0.644
3.657AsnGly: 3.657 ± 1.625
0.844AsnHis: 0.844 ± 0.597
4.501AsnIle: 4.501 ± 0.897
7.314AsnLys: 7.314 ± 1.432
4.782AsnLeu: 4.782 ± 1.378
0.281AsnMet: 0.281 ± 0.251
4.501AsnAsn: 4.501 ± 1.388
0.844AsnPro: 0.844 ± 0.363
2.25AsnGln: 2.25 ± 1.073
1.406AsnArg: 1.406 ± 0.623
4.219AsnSer: 4.219 ± 0.812
3.938AsnThr: 3.938 ± 0.842
2.813AsnVal: 2.813 ± 1.192
0.844AsnTrp: 0.844 ± 0.355
1.406AsnTyr: 1.406 ± 0.575
0.0AsnXaa: 0.0 ± 0.0
Pro
0.563ProAla: 0.563 ± 0.356
0.281ProCys: 0.281 ± 0.264
1.125ProAsp: 1.125 ± 0.541
1.688ProGlu: 1.688 ± 0.74
1.125ProPhe: 1.125 ± 0.456
0.844ProGly: 0.844 ± 0.365
1.125ProHis: 1.125 ± 0.474
2.25ProIle: 2.25 ± 0.675
2.25ProLys: 2.25 ± 0.581
1.969ProLeu: 1.969 ± 0.575
0.563ProMet: 0.563 ± 0.339
2.25ProAsn: 2.25 ± 1.019
0.563ProPro: 0.563 ± 0.38
3.094ProGln: 3.094 ± 0.933
1.688ProArg: 1.688 ± 0.494
1.406ProSer: 1.406 ± 0.481
2.813ProThr: 2.813 ± 0.961
2.532ProVal: 2.532 ± 0.847
0.844ProTrp: 0.844 ± 0.472
1.688ProTyr: 1.688 ± 0.816
0.0ProXaa: 0.0 ± 0.0
Gln
1.406GlnAla: 1.406 ± 0.591
0.0GlnCys: 0.0 ± 0.0
2.532GlnAsp: 2.532 ± 1.18
3.094GlnGlu: 3.094 ± 0.862
1.969GlnPhe: 1.969 ± 0.564
1.688GlnGly: 1.688 ± 0.591
1.688GlnHis: 1.688 ± 0.804
2.25GlnIle: 2.25 ± 0.76
2.813GlnLys: 2.813 ± 0.703
2.25GlnLeu: 2.25 ± 0.829
0.844GlnMet: 0.844 ± 0.413
3.094GlnAsn: 3.094 ± 0.833
0.563GlnPro: 0.563 ± 0.339
0.281GlnGln: 0.281 ± 0.271
0.844GlnArg: 0.844 ± 0.616
3.938GlnSer: 3.938 ± 1.339
2.532GlnThr: 2.532 ± 0.657
2.25GlnVal: 2.25 ± 0.741
0.281GlnTrp: 0.281 ± 0.341
1.688GlnTyr: 1.688 ± 0.472
0.0GlnXaa: 0.0 ± 0.0
Arg
1.969ArgAla: 1.969 ± 0.546
0.281ArgCys: 0.281 ± 0.26
1.406ArgAsp: 1.406 ± 0.434
3.938ArgGlu: 3.938 ± 1.803
1.406ArgPhe: 1.406 ± 0.577
1.969ArgGly: 1.969 ± 0.814
0.844ArgHis: 0.844 ± 0.443
3.094ArgIle: 3.094 ± 0.784
2.813ArgLys: 2.813 ± 0.879
5.907ArgLeu: 5.907 ± 1.32
1.125ArgMet: 1.125 ± 0.406
2.25ArgAsn: 2.25 ± 0.749
0.844ArgPro: 0.844 ± 0.385
1.406ArgGln: 1.406 ± 0.786
1.688ArgArg: 1.688 ± 1.0
1.688ArgSer: 1.688 ± 0.64
2.532ArgThr: 2.532 ± 0.749
2.813ArgVal: 2.813 ± 1.156
0.563ArgTrp: 0.563 ± 0.459
0.281ArgTyr: 0.281 ± 0.248
0.0ArgXaa: 0.0 ± 0.0
Ser
2.813SerAla: 2.813 ± 1.034
0.563SerCys: 0.563 ± 0.368
3.938SerAsp: 3.938 ± 0.884
5.063SerGlu: 5.063 ± 0.811
3.657SerPhe: 3.657 ± 1.066
3.376SerGly: 3.376 ± 0.668
0.844SerHis: 0.844 ± 0.469
5.626SerIle: 5.626 ± 1.718
6.751SerLys: 6.751 ± 1.334
5.907SerLeu: 5.907 ± 1.654
1.406SerMet: 1.406 ± 0.716
3.376SerAsn: 3.376 ± 1.339
2.813SerPro: 2.813 ± 0.853
3.376SerGln: 3.376 ± 1.066
1.406SerArg: 1.406 ± 0.474
18.284SerSer: 18.284 ± 5.625
4.501SerThr: 4.501 ± 1.074
3.094SerVal: 3.094 ± 0.816
0.563SerTrp: 0.563 ± 0.368
2.532SerTyr: 2.532 ± 1.361
0.0SerXaa: 0.0 ± 0.0
Thr
4.501ThrAla: 4.501 ± 1.036
0.844ThrCys: 0.844 ± 0.448
2.813ThrAsp: 2.813 ± 0.818
5.345ThrGlu: 5.345 ± 1.052
1.688ThrPhe: 1.688 ± 1.069
4.219ThrGly: 4.219 ± 1.224
1.125ThrHis: 1.125 ± 0.568
5.063ThrIle: 5.063 ± 1.574
3.657ThrLys: 3.657 ± 1.161
6.188ThrLeu: 6.188 ± 1.523
0.844ThrMet: 0.844 ± 0.632
1.406ThrAsn: 1.406 ± 0.634
1.969ThrPro: 1.969 ± 0.869
2.813ThrGln: 2.813 ± 0.802
3.094ThrArg: 3.094 ± 0.857
5.063ThrSer: 5.063 ± 1.533
5.345ThrThr: 5.345 ± 1.766
5.626ThrVal: 5.626 ± 1.178
0.563ThrTrp: 0.563 ± 0.406
3.657ThrTyr: 3.657 ± 0.877
0.0ThrXaa: 0.0 ± 0.0
Val
1.406ValAla: 1.406 ± 0.553
1.688ValCys: 1.688 ± 0.949
1.969ValAsp: 1.969 ± 0.626
5.907ValGlu: 5.907 ± 1.269
2.532ValPhe: 2.532 ± 0.701
1.406ValGly: 1.406 ± 0.575
0.844ValHis: 0.844 ± 0.618
3.938ValIle: 3.938 ± 1.131
7.032ValLys: 7.032 ± 0.913
5.063ValLeu: 5.063 ± 1.289
1.125ValMet: 1.125 ± 0.591
2.813ValAsn: 2.813 ± 0.727
2.532ValPro: 2.532 ± 0.989
1.125ValGln: 1.125 ± 0.62
1.688ValArg: 1.688 ± 0.672
5.626ValSer: 5.626 ± 0.905
3.094ValThr: 3.094 ± 0.881
3.657ValVal: 3.657 ± 0.96
0.0ValTrp: 0.0 ± 0.0
5.063ValTyr: 5.063 ± 0.73
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.281TrpCys: 0.281 ± 0.296
1.125TrpAsp: 1.125 ± 0.65
0.844TrpGlu: 0.844 ± 0.492
0.0TrpPhe: 0.0 ± 0.0
0.563TrpGly: 0.563 ± 0.3
0.563TrpHis: 0.563 ± 0.459
0.844TrpIle: 0.844 ± 0.598
0.563TrpLys: 0.563 ± 0.322
0.844TrpLeu: 0.844 ± 0.434
0.0TrpMet: 0.0 ± 0.0
0.281TrpAsn: 0.281 ± 0.311
0.0TrpPro: 0.0 ± 0.0
0.844TrpGln: 0.844 ± 0.447
0.563TrpArg: 0.563 ± 0.339
0.844TrpSer: 0.844 ± 0.392
0.281TrpThr: 0.281 ± 0.296
1.125TrpVal: 1.125 ± 0.79
0.281TrpTrp: 0.281 ± 0.275
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.125TyrAla: 1.125 ± 0.662
0.281TyrCys: 0.281 ± 0.311
2.532TyrAsp: 2.532 ± 0.59
4.219TyrGlu: 4.219 ± 0.967
1.969TyrPhe: 1.969 ± 0.847
2.532TyrGly: 2.532 ± 0.939
1.125TyrHis: 1.125 ± 0.5
3.376TyrIle: 3.376 ± 1.353
3.094TyrLys: 3.094 ± 0.565
5.345TyrLeu: 5.345 ± 1.097
0.281TyrMet: 0.281 ± 0.26
1.125TyrAsn: 1.125 ± 0.685
1.125TyrPro: 1.125 ± 0.426
2.25TyrGln: 2.25 ± 0.711
3.094TyrArg: 3.094 ± 1.066
1.969TyrSer: 1.969 ± 0.879
1.406TyrThr: 1.406 ± 0.818
2.813TyrVal: 2.813 ± 0.955
0.563TyrTrp: 0.563 ± 0.365
1.688TyrTyr: 1.688 ± 0.605
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3556 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski