Amino acid dipepetide frequency for Streptococcus satellite phage Javan620

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.346AlaCys: 0.346 ± 0.368
4.154AlaAsp: 4.154 ± 1.115
5.192AlaGlu: 5.192 ± 1.247
2.423AlaPhe: 2.423 ± 0.923
2.769AlaGly: 2.769 ± 0.84
0.346AlaHis: 0.346 ± 0.289
5.192AlaIle: 5.192 ± 1.602
3.808AlaLys: 3.808 ± 1.206
5.192AlaLeu: 5.192 ± 1.593
2.077AlaMet: 2.077 ± 0.843
3.115AlaAsn: 3.115 ± 1.221
2.423AlaPro: 2.423 ± 0.938
2.423AlaGln: 2.423 ± 1.254
1.038AlaArg: 1.038 ± 0.537
3.461AlaSer: 3.461 ± 1.038
1.731AlaThr: 1.731 ± 0.666
2.423AlaVal: 2.423 ± 0.873
0.346AlaTrp: 0.346 ± 0.343
2.769AlaTyr: 2.769 ± 0.77
0.0AlaXaa: 0.0 ± 0.0
Cys
0.692CysAla: 0.692 ± 0.703
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.692CysPhe: 0.692 ± 0.471
0.346CysGly: 0.346 ± 0.355
0.0CysHis: 0.0 ± 0.0
1.038CysIle: 1.038 ± 0.632
0.346CysLys: 0.346 ± 0.335
0.346CysLeu: 0.346 ± 0.351
0.346CysMet: 0.346 ± 0.368
0.692CysAsn: 0.692 ± 0.54
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.692CysArg: 0.692 ± 0.478
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.346CysVal: 0.346 ± 0.286
0.346CysTrp: 0.346 ± 0.377
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.038AspAla: 1.038 ± 0.534
0.692AspCys: 0.692 ± 0.55
5.538AspAsp: 5.538 ± 1.494
2.769AspGlu: 2.769 ± 0.901
3.808AspPhe: 3.808 ± 1.01
3.461AspGly: 3.461 ± 0.8
0.346AspHis: 0.346 ± 0.409
8.307AspIle: 8.307 ± 1.373
4.154AspLys: 4.154 ± 1.303
6.231AspLeu: 6.231 ± 1.393
2.077AspMet: 2.077 ± 0.748
4.5AspAsn: 4.5 ± 1.554
1.385AspPro: 1.385 ± 0.601
3.461AspGln: 3.461 ± 0.944
2.423AspArg: 2.423 ± 0.994
4.846AspSer: 4.846 ± 1.34
1.731AspThr: 1.731 ± 0.682
0.692AspVal: 0.692 ± 0.411
0.346AspTrp: 0.346 ± 0.372
5.538AspTyr: 5.538 ± 1.393
0.0AspXaa: 0.0 ± 0.0
Glu
4.154GluAla: 4.154 ± 1.158
0.346GluCys: 0.346 ± 0.351
4.846GluAsp: 4.846 ± 1.001
4.154GluGlu: 4.154 ± 0.984
4.154GluPhe: 4.154 ± 1.282
2.423GluGly: 2.423 ± 1.133
3.115GluHis: 3.115 ± 1.166
6.577GluIle: 6.577 ± 1.599
8.654GluLys: 8.654 ± 1.533
12.115GluLeu: 12.115 ± 1.493
2.077GluMet: 2.077 ± 0.84
5.884GluAsn: 5.884 ± 1.2
1.385GluPro: 1.385 ± 0.656
5.192GluGln: 5.192 ± 1.287
2.769GluArg: 2.769 ± 1.368
4.5GluSer: 4.5 ± 1.283
4.154GluThr: 4.154 ± 1.079
4.5GluVal: 4.5 ± 1.269
0.0GluTrp: 0.0 ± 0.0
4.154GluTyr: 4.154 ± 0.881
0.0GluXaa: 0.0 ± 0.0
Phe
0.692PheAla: 0.692 ± 0.456
0.0PheCys: 0.0 ± 0.0
2.423PheAsp: 2.423 ± 1.316
5.538PheGlu: 5.538 ± 1.561
3.115PhePhe: 3.115 ± 1.187
2.423PheGly: 2.423 ± 0.736
0.692PheHis: 0.692 ± 0.435
2.423PheIle: 2.423 ± 0.964
4.154PheLys: 4.154 ± 0.744
5.884PheLeu: 5.884 ± 1.204
1.038PheMet: 1.038 ± 0.609
2.423PheAsn: 2.423 ± 0.772
0.692PhePro: 0.692 ± 0.401
1.038PheGln: 1.038 ± 0.597
1.385PheArg: 1.385 ± 0.761
2.423PheSer: 2.423 ± 0.786
2.769PheThr: 2.769 ± 1.509
2.077PheVal: 2.077 ± 1.193
0.692PheTrp: 0.692 ± 0.443
1.038PheTyr: 1.038 ± 0.667
0.0PheXaa: 0.0 ± 0.0
Gly
2.077GlyAla: 2.077 ± 0.842
0.346GlyCys: 0.346 ± 0.343
1.731GlyAsp: 1.731 ± 0.707
1.731GlyGlu: 1.731 ± 0.521
2.077GlyPhe: 2.077 ± 0.811
2.423GlyGly: 2.423 ± 1.065
0.692GlyHis: 0.692 ± 0.445
4.154GlyIle: 4.154 ± 1.147
3.461GlyLys: 3.461 ± 1.146
3.115GlyLeu: 3.115 ± 0.978
0.692GlyMet: 0.692 ± 0.538
2.077GlyAsn: 2.077 ± 0.864
0.0GlyPro: 0.0 ± 0.0
1.385GlyGln: 1.385 ± 0.667
2.423GlyArg: 2.423 ± 0.996
2.769GlySer: 2.769 ± 1.039
1.038GlyThr: 1.038 ± 0.792
2.769GlyVal: 2.769 ± 0.938
0.0GlyTrp: 0.0 ± 0.0
2.769GlyTyr: 2.769 ± 1.533
0.0GlyXaa: 0.0 ± 0.0
His
1.731HisAla: 1.731 ± 0.845
0.0HisCys: 0.0 ± 0.0
1.385HisAsp: 1.385 ± 0.578
2.077HisGlu: 2.077 ± 0.703
1.385HisPhe: 1.385 ± 0.735
1.038HisGly: 1.038 ± 0.535
0.346HisHis: 0.346 ± 0.351
1.731HisIle: 1.731 ± 0.774
2.077HisLys: 2.077 ± 0.833
1.385HisLeu: 1.385 ± 0.542
0.0HisMet: 0.0 ± 0.0
0.346HisAsn: 0.346 ± 0.409
0.346HisPro: 0.346 ± 0.289
0.346HisGln: 0.346 ± 0.335
0.0HisArg: 0.0 ± 0.0
0.346HisSer: 0.346 ± 0.343
1.038HisThr: 1.038 ± 0.681
0.346HisVal: 0.346 ± 0.381
0.0HisTrp: 0.0 ± 0.0
0.346HisTyr: 0.346 ± 0.343
0.0HisXaa: 0.0 ± 0.0
Ile
5.884IleAla: 5.884 ± 2.385
0.692IleCys: 0.692 ± 0.711
6.923IleAsp: 6.923 ± 1.164
7.961IleGlu: 7.961 ± 1.767
0.346IlePhe: 0.346 ± 0.332
1.385IleGly: 1.385 ± 0.532
1.385IleHis: 1.385 ± 0.89
6.577IleIle: 6.577 ± 1.76
6.577IleLys: 6.577 ± 1.676
7.615IleLeu: 7.615 ± 2.139
1.731IleMet: 1.731 ± 0.611
7.615IleAsn: 7.615 ± 1.502
3.115IlePro: 3.115 ± 0.989
2.423IleGln: 2.423 ± 0.824
2.769IleArg: 2.769 ± 0.869
5.884IleSer: 5.884 ± 1.171
9.346IleThr: 9.346 ± 1.341
1.385IleVal: 1.385 ± 0.714
0.0IleTrp: 0.0 ± 0.0
4.5IleTyr: 4.5 ± 1.256
0.0IleXaa: 0.0 ± 0.0
Lys
6.577LysAla: 6.577 ± 1.623
0.346LysCys: 0.346 ± 0.335
5.538LysAsp: 5.538 ± 1.712
10.384LysGlu: 10.384 ± 1.541
2.077LysPhe: 2.077 ± 0.616
3.115LysGly: 3.115 ± 0.86
1.385LysHis: 1.385 ± 0.765
7.615LysIle: 7.615 ± 1.635
7.961LysLys: 7.961 ± 1.361
8.307LysLeu: 8.307 ± 1.736
2.077LysMet: 2.077 ± 0.882
4.154LysAsn: 4.154 ± 1.01
2.423LysPro: 2.423 ± 0.606
4.5LysGln: 4.5 ± 1.574
7.269LysArg: 7.269 ± 0.907
3.808LysSer: 3.808 ± 0.753
6.577LysThr: 6.577 ± 1.736
4.154LysVal: 4.154 ± 1.431
0.346LysTrp: 0.346 ± 0.338
3.115LysTyr: 3.115 ± 0.998
0.0LysXaa: 0.0 ± 0.0
Leu
6.577LeuAla: 6.577 ± 1.309
0.692LeuCys: 0.692 ± 0.456
6.577LeuAsp: 6.577 ± 1.003
9.0LeuGlu: 9.0 ± 2.117
2.423LeuPhe: 2.423 ± 0.82
4.5LeuGly: 4.5 ± 1.646
1.038LeuHis: 1.038 ± 0.493
6.923LeuIle: 6.923 ± 1.361
11.423LeuLys: 11.423 ± 1.797
10.73LeuLeu: 10.73 ± 1.366
2.077LeuMet: 2.077 ± 0.943
10.038LeuAsn: 10.038 ± 2.353
2.769LeuPro: 2.769 ± 0.728
2.769LeuGln: 2.769 ± 0.926
4.5LeuArg: 4.5 ± 1.214
7.961LeuSer: 7.961 ± 1.493
5.884LeuThr: 5.884 ± 1.076
3.115LeuVal: 3.115 ± 0.968
1.038LeuTrp: 1.038 ± 0.716
3.461LeuTyr: 3.461 ± 0.773
0.0LeuXaa: 0.0 ± 0.0
Met
1.385MetAla: 1.385 ± 0.753
0.0MetCys: 0.0 ± 0.0
2.077MetAsp: 2.077 ± 0.556
1.385MetGlu: 1.385 ± 0.605
0.692MetPhe: 0.692 ± 0.55
0.692MetGly: 0.692 ± 0.411
0.346MetHis: 0.346 ± 0.409
2.423MetIle: 2.423 ± 0.911
2.077MetLys: 2.077 ± 0.884
3.461MetLeu: 3.461 ± 1.186
0.0MetMet: 0.0 ± 0.0
2.423MetAsn: 2.423 ± 0.946
0.692MetPro: 0.692 ± 0.468
1.038MetGln: 1.038 ± 0.695
1.038MetArg: 1.038 ± 0.77
0.346MetSer: 0.346 ± 0.359
2.077MetThr: 2.077 ± 0.789
1.385MetVal: 1.385 ± 0.637
0.0MetTrp: 0.0 ± 0.0
1.385MetTyr: 1.385 ± 0.621
0.0MetXaa: 0.0 ± 0.0
Asn
4.846AsnAla: 4.846 ± 1.322
0.692AsnCys: 0.692 ± 0.52
4.5AsnAsp: 4.5 ± 1.338
7.269AsnGlu: 7.269 ± 1.868
3.808AsnPhe: 3.808 ± 0.91
1.731AsnGly: 1.731 ± 0.579
1.731AsnHis: 1.731 ± 0.903
5.538AsnIle: 5.538 ± 1.349
4.5AsnLys: 4.5 ± 1.283
6.231AsnLeu: 6.231 ± 1.415
1.731AsnMet: 1.731 ± 0.754
4.154AsnAsn: 4.154 ± 0.952
1.731AsnPro: 1.731 ± 0.633
1.731AsnGln: 1.731 ± 0.801
3.461AsnArg: 3.461 ± 0.856
6.231AsnSer: 6.231 ± 0.949
5.192AsnThr: 5.192 ± 1.301
1.731AsnVal: 1.731 ± 0.728
0.692AsnTrp: 0.692 ± 0.407
2.077AsnTyr: 2.077 ± 0.717
0.0AsnXaa: 0.0 ± 0.0
Pro
0.692ProAla: 0.692 ± 0.447
0.0ProCys: 0.0 ± 0.0
1.038ProAsp: 1.038 ± 0.723
3.115ProGlu: 3.115 ± 1.043
1.731ProPhe: 1.731 ± 0.697
0.346ProGly: 0.346 ± 0.338
0.0ProHis: 0.0 ± 0.0
1.038ProIle: 1.038 ± 0.468
2.077ProLys: 2.077 ± 0.813
3.808ProLeu: 3.808 ± 0.854
0.0ProMet: 0.0 ± 0.0
2.769ProAsn: 2.769 ± 0.915
0.346ProPro: 0.346 ± 0.289
0.692ProGln: 0.692 ± 0.492
1.731ProArg: 1.731 ± 0.893
1.731ProSer: 1.731 ± 0.565
1.385ProThr: 1.385 ± 0.568
0.346ProVal: 0.346 ± 0.355
0.0ProTrp: 0.0 ± 0.0
1.385ProTyr: 1.385 ± 0.636
0.0ProXaa: 0.0 ± 0.0
Gln
3.461GlnAla: 3.461 ± 0.876
0.0GlnCys: 0.0 ± 0.0
2.423GlnAsp: 2.423 ± 0.825
4.5GlnGlu: 4.5 ± 1.118
1.385GlnPhe: 1.385 ± 0.563
0.692GlnGly: 0.692 ± 0.512
0.0GlnHis: 0.0 ± 0.0
2.769GlnIle: 2.769 ± 0.613
3.808GlnLys: 3.808 ± 1.409
3.808GlnLeu: 3.808 ± 1.146
1.731GlnMet: 1.731 ± 0.648
2.423GlnAsn: 2.423 ± 0.809
0.346GlnPro: 0.346 ± 0.335
1.731GlnGln: 1.731 ± 0.759
4.154GlnArg: 4.154 ± 1.171
2.423GlnSer: 2.423 ± 0.558
2.077GlnThr: 2.077 ± 0.987
1.038GlnVal: 1.038 ± 0.53
0.692GlnTrp: 0.692 ± 0.538
1.731GlnTyr: 1.731 ± 0.755
0.0GlnXaa: 0.0 ± 0.0
Arg
2.769ArgAla: 2.769 ± 0.697
0.346ArgCys: 0.346 ± 0.351
2.077ArgAsp: 2.077 ± 0.778
4.5ArgGlu: 4.5 ± 1.024
1.038ArgPhe: 1.038 ± 0.648
1.038ArgGly: 1.038 ± 0.608
1.385ArgHis: 1.385 ± 0.739
3.808ArgIle: 3.808 ± 0.862
4.154ArgLys: 4.154 ± 1.238
6.923ArgLeu: 6.923 ± 1.476
2.423ArgMet: 2.423 ± 0.699
2.423ArgAsn: 2.423 ± 0.949
1.038ArgPro: 1.038 ± 0.604
1.385ArgGln: 1.385 ± 0.751
1.731ArgArg: 1.731 ± 0.708
2.077ArgSer: 2.077 ± 0.705
3.808ArgThr: 3.808 ± 1.244
2.077ArgVal: 2.077 ± 0.933
1.038ArgTrp: 1.038 ± 0.581
2.077ArgTyr: 2.077 ± 0.855
0.0ArgXaa: 0.0 ± 0.0
Ser
1.385SerAla: 1.385 ± 0.808
0.346SerCys: 0.346 ± 0.381
4.846SerAsp: 4.846 ± 1.468
3.808SerGlu: 3.808 ± 1.2
3.461SerPhe: 3.461 ± 1.358
3.461SerGly: 3.461 ± 0.903
1.038SerHis: 1.038 ± 0.519
4.846SerIle: 4.846 ± 1.791
6.577SerLys: 6.577 ± 1.255
4.154SerLeu: 4.154 ± 1.231
2.423SerMet: 2.423 ± 0.914
2.077SerAsn: 2.077 ± 0.864
2.077SerPro: 2.077 ± 1.031
4.154SerGln: 4.154 ± 1.529
3.461SerArg: 3.461 ± 0.862
2.423SerSer: 2.423 ± 0.804
1.731SerThr: 1.731 ± 1.086
2.769SerVal: 2.769 ± 1.111
1.038SerTrp: 1.038 ± 0.667
2.077SerTyr: 2.077 ± 1.702
0.0SerXaa: 0.0 ± 0.0
Thr
2.423ThrAla: 2.423 ± 1.03
0.0ThrCys: 0.0 ± 0.0
1.385ThrAsp: 1.385 ± 0.693
3.115ThrGlu: 3.115 ± 0.799
3.461ThrPhe: 3.461 ± 1.081
3.461ThrGly: 3.461 ± 0.98
1.038ThrHis: 1.038 ± 0.678
6.923ThrIle: 6.923 ± 1.442
6.231ThrLys: 6.231 ± 1.643
5.192ThrLeu: 5.192 ± 1.339
0.692ThrMet: 0.692 ± 0.448
5.538ThrAsn: 5.538 ± 1.422
0.692ThrPro: 0.692 ± 0.448
3.115ThrGln: 3.115 ± 1.012
2.077ThrArg: 2.077 ± 1.024
2.423ThrSer: 2.423 ± 0.994
3.808ThrThr: 3.808 ± 0.973
3.808ThrVal: 3.808 ± 0.877
0.692ThrTrp: 0.692 ± 0.466
3.461ThrTyr: 3.461 ± 1.528
0.0ThrXaa: 0.0 ± 0.0
Val
2.423ValAla: 2.423 ± 0.82
0.692ValCys: 0.692 ± 0.493
2.077ValAsp: 2.077 ± 0.887
2.769ValGlu: 2.769 ± 1.196
2.423ValPhe: 2.423 ± 0.905
1.731ValGly: 1.731 ± 0.854
0.346ValHis: 0.346 ± 0.289
3.808ValIle: 3.808 ± 1.32
3.808ValLys: 3.808 ± 0.932
1.385ValLeu: 1.385 ± 0.629
0.692ValMet: 0.692 ± 0.492
3.115ValAsn: 3.115 ± 0.787
1.038ValPro: 1.038 ± 0.475
0.692ValGln: 0.692 ± 0.447
2.423ValArg: 2.423 ± 0.98
1.038ValSer: 1.038 ± 0.529
3.115ValThr: 3.115 ± 0.874
1.385ValVal: 1.385 ± 0.725
0.0ValTrp: 0.0 ± 0.0
2.423ValTyr: 2.423 ± 0.901
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.038TrpAsp: 1.038 ± 0.532
2.077TrpGlu: 2.077 ± 0.829
0.346TrpPhe: 0.346 ± 0.409
0.0TrpGly: 0.0 ± 0.0
0.346TrpHis: 0.346 ± 0.381
0.692TrpIle: 0.692 ± 0.447
1.385TrpLys: 1.385 ± 0.554
0.692TrpLeu: 0.692 ± 0.473
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.346TrpGln: 0.346 ± 0.35
0.346TrpArg: 0.346 ± 0.377
0.346TrpSer: 0.346 ± 0.34
0.346TrpThr: 0.346 ± 0.381
0.0TrpVal: 0.0 ± 0.0
0.346TrpTrp: 0.346 ± 0.343
0.346TrpTyr: 0.346 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.423TyrAla: 2.423 ± 0.983
0.346TyrCys: 0.346 ± 0.299
2.769TyrAsp: 2.769 ± 0.956
3.808TyrGlu: 3.808 ± 1.337
2.077TyrPhe: 2.077 ± 0.868
0.692TyrGly: 0.692 ± 0.415
0.692TyrHis: 0.692 ± 0.419
1.731TyrIle: 1.731 ± 0.737
4.846TyrLys: 4.846 ± 1.457
6.923TyrLeu: 6.923 ± 1.185
0.692TyrMet: 0.692 ± 0.462
3.808TyrAsn: 3.808 ± 0.921
1.731TyrPro: 1.731 ± 0.71
2.769TyrGln: 2.769 ± 0.929
2.423TyrArg: 2.423 ± 1.02
3.115TyrSer: 3.115 ± 0.833
1.731TyrThr: 1.731 ± 0.969
1.038TyrVal: 1.038 ± 0.624
1.038TyrTrp: 1.038 ± 0.818
1.731TyrTyr: 1.731 ± 0.764
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (2890 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski