Amino acid dipepetide frequency for Streptococcus satellite phage Javan741

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.547AlaAla: 0.547 ± 0.359
0.82AlaCys: 0.82 ± 0.576
4.375AlaAsp: 4.375 ± 1.262
5.196AlaGlu: 5.196 ± 1.311
3.008AlaPhe: 3.008 ± 0.976
2.188AlaGly: 2.188 ± 0.836
0.273AlaHis: 0.273 ± 0.287
3.281AlaIle: 3.281 ± 0.912
4.922AlaLys: 4.922 ± 0.899
6.289AlaLeu: 6.289 ± 1.129
1.914AlaMet: 1.914 ± 0.892
3.281AlaAsn: 3.281 ± 0.733
0.547AlaPro: 0.547 ± 0.346
1.641AlaGln: 1.641 ± 0.575
5.742AlaArg: 5.742 ± 1.11
2.188AlaSer: 2.188 ± 0.754
3.828AlaThr: 3.828 ± 1.017
4.102AlaVal: 4.102 ± 0.943
0.273AlaTrp: 0.273 ± 0.337
1.914AlaTyr: 1.914 ± 0.756
0.0AlaXaa: 0.0 ± 0.0
Cys
0.547CysAla: 0.547 ± 0.298
0.0CysCys: 0.0 ± 0.0
0.547CysAsp: 0.547 ± 0.346
0.0CysGlu: 0.0 ± 0.0
0.273CysPhe: 0.273 ± 0.209
0.273CysGly: 0.273 ± 0.315
0.547CysHis: 0.547 ± 0.301
0.82CysIle: 0.82 ± 0.551
0.547CysLys: 0.547 ± 0.405
1.367CysLeu: 1.367 ± 0.5
0.273CysMet: 0.273 ± 0.279
0.273CysAsn: 0.273 ± 0.229
0.273CysPro: 0.273 ± 0.241
0.0CysGln: 0.0 ± 0.0
0.273CysArg: 0.273 ± 0.332
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.273CysTyr: 0.273 ± 0.276
0.0CysXaa: 0.0 ± 0.0
Asp
1.641AspAla: 1.641 ± 0.561
0.82AspCys: 0.82 ± 0.343
4.102AspAsp: 4.102 ± 1.011
6.289AspGlu: 6.289 ± 1.688
2.734AspPhe: 2.734 ± 0.735
2.734AspGly: 2.734 ± 0.896
0.0AspHis: 0.0 ± 0.0
6.016AspIle: 6.016 ± 0.828
5.196AspLys: 5.196 ± 0.946
5.742AspLeu: 5.742 ± 0.868
2.461AspMet: 2.461 ± 0.815
3.281AspAsn: 3.281 ± 1.058
1.641AspPro: 1.641 ± 0.581
1.367AspGln: 1.367 ± 0.618
2.734AspArg: 2.734 ± 1.019
2.461AspSer: 2.461 ± 0.631
3.008AspThr: 3.008 ± 0.958
1.641AspVal: 1.641 ± 0.727
0.0AspTrp: 0.0 ± 0.0
3.008AspTyr: 3.008 ± 0.921
0.0AspXaa: 0.0 ± 0.0
Glu
5.469GluAla: 5.469 ± 1.563
0.547GluCys: 0.547 ± 0.335
3.281GluAsp: 3.281 ± 0.772
7.93GluGlu: 7.93 ± 2.08
2.734GluPhe: 2.734 ± 1.001
3.828GluGly: 3.828 ± 0.96
1.914GluHis: 1.914 ± 0.723
9.024GluIle: 9.024 ± 1.554
9.297GluLys: 9.297 ± 1.861
10.938GluLeu: 10.938 ± 2.05
2.734GluMet: 2.734 ± 0.794
6.563GluAsn: 6.563 ± 1.322
1.367GluPro: 1.367 ± 0.662
4.649GluGln: 4.649 ± 1.121
5.469GluArg: 5.469 ± 1.314
4.649GluSer: 4.649 ± 1.25
4.102GluThr: 4.102 ± 1.164
4.649GluVal: 4.649 ± 1.166
0.82GluTrp: 0.82 ± 0.35
3.555GluTyr: 3.555 ± 0.782
0.0GluXaa: 0.0 ± 0.0
Phe
1.641PheAla: 1.641 ± 0.721
0.0PheCys: 0.0 ± 0.0
4.375PheAsp: 4.375 ± 1.0
2.734PheGlu: 2.734 ± 0.965
2.188PhePhe: 2.188 ± 1.03
1.914PheGly: 1.914 ± 0.521
0.547PheHis: 0.547 ± 0.377
3.555PheIle: 3.555 ± 1.046
3.555PheLys: 3.555 ± 0.785
3.281PheLeu: 3.281 ± 0.97
1.914PheMet: 1.914 ± 0.762
1.641PheAsn: 1.641 ± 0.544
0.273PhePro: 0.273 ± 0.209
3.008PheGln: 3.008 ± 0.715
2.461PheArg: 2.461 ± 0.707
2.734PheSer: 2.734 ± 0.845
1.367PheThr: 1.367 ± 0.504
1.641PheVal: 1.641 ± 0.529
0.547PheTrp: 0.547 ± 0.365
2.461PheTyr: 2.461 ± 0.748
0.0PheXaa: 0.0 ± 0.0
Gly
1.914GlyAla: 1.914 ± 0.727
0.273GlyCys: 0.273 ± 0.241
3.008GlyAsp: 3.008 ± 0.986
3.281GlyGlu: 3.281 ± 0.892
1.914GlyPhe: 1.914 ± 0.672
2.734GlyGly: 2.734 ± 0.957
1.094GlyHis: 1.094 ± 0.415
4.649GlyIle: 4.649 ± 1.109
4.375GlyLys: 4.375 ± 1.213
5.469GlyLeu: 5.469 ± 1.23
1.914GlyMet: 1.914 ± 0.639
2.734GlyAsn: 2.734 ± 0.61
0.273GlyPro: 0.273 ± 0.241
1.914GlyGln: 1.914 ± 0.558
3.008GlyArg: 3.008 ± 0.741
1.641GlySer: 1.641 ± 0.591
2.461GlyThr: 2.461 ± 0.661
4.922GlyVal: 4.922 ± 1.18
1.094GlyTrp: 1.094 ± 0.703
3.008GlyTyr: 3.008 ± 0.998
0.0GlyXaa: 0.0 ± 0.0
His
0.82HisAla: 0.82 ± 0.722
0.0HisCys: 0.0 ± 0.0
0.547HisAsp: 0.547 ± 0.379
1.094HisGlu: 1.094 ± 0.582
1.641HisPhe: 1.641 ± 0.671
0.547HisGly: 0.547 ± 0.357
0.273HisHis: 0.273 ± 0.276
1.094HisIle: 1.094 ± 0.657
1.094HisLys: 1.094 ± 0.54
1.367HisLeu: 1.367 ± 0.616
0.0HisMet: 0.0 ± 0.0
1.641HisAsn: 1.641 ± 0.897
0.273HisPro: 0.273 ± 0.225
0.273HisGln: 0.273 ± 0.332
1.094HisArg: 1.094 ± 0.576
1.094HisSer: 1.094 ± 0.459
1.094HisThr: 1.094 ± 0.563
0.547HisVal: 0.547 ± 0.298
0.0HisTrp: 0.0 ± 0.0
0.82HisTyr: 0.82 ± 0.447
0.0HisXaa: 0.0 ± 0.0
Ile
4.375IleAla: 4.375 ± 1.05
0.273IleCys: 0.273 ± 0.229
4.375IleAsp: 4.375 ± 1.416
6.836IleGlu: 6.836 ± 1.545
4.649IlePhe: 4.649 ± 1.101
3.555IleGly: 3.555 ± 1.039
0.547IleHis: 0.547 ± 0.275
3.555IleIle: 3.555 ± 0.869
9.297IleLys: 9.297 ± 1.508
3.828IleLeu: 3.828 ± 1.111
0.273IleMet: 0.273 ± 0.312
2.461IleAsn: 2.461 ± 1.028
2.734IlePro: 2.734 ± 1.017
3.281IleGln: 3.281 ± 0.848
2.188IleArg: 2.188 ± 0.935
6.563IleSer: 6.563 ± 1.882
2.461IleThr: 2.461 ± 0.737
2.734IleVal: 2.734 ± 0.662
0.82IleTrp: 0.82 ± 0.525
3.008IleTyr: 3.008 ± 0.734
0.0IleXaa: 0.0 ± 0.0
Lys
9.024LysAla: 9.024 ± 1.926
0.273LysCys: 0.273 ± 0.209
3.008LysAsp: 3.008 ± 0.923
11.211LysGlu: 11.211 ± 1.135
2.461LysPhe: 2.461 ± 0.931
4.102LysGly: 4.102 ± 0.809
3.008LysHis: 3.008 ± 0.968
4.922LysIle: 4.922 ± 0.898
8.203LysLys: 8.203 ± 1.384
9.571LysLeu: 9.571 ± 1.396
3.008LysMet: 3.008 ± 1.037
7.383LysAsn: 7.383 ± 1.281
3.008LysPro: 3.008 ± 1.107
3.555LysGln: 3.555 ± 0.852
6.289LysArg: 6.289 ± 1.453
5.196LysSer: 5.196 ± 0.987
5.742LysThr: 5.742 ± 1.331
4.375LysVal: 4.375 ± 0.889
0.82LysTrp: 0.82 ± 0.447
3.008LysTyr: 3.008 ± 1.065
0.0LysXaa: 0.0 ± 0.0
Leu
6.836LeuAla: 6.836 ± 1.341
0.82LeuCys: 0.82 ± 0.589
9.297LeuAsp: 9.297 ± 1.542
13.672LeuGlu: 13.672 ± 2.695
4.102LeuPhe: 4.102 ± 1.175
4.375LeuGly: 4.375 ± 1.249
0.547LeuHis: 0.547 ± 0.391
4.102LeuIle: 4.102 ± 0.913
7.93LeuLys: 7.93 ± 1.405
8.477LeuLeu: 8.477 ± 1.148
1.914LeuMet: 1.914 ± 0.657
5.742LeuAsn: 5.742 ± 1.181
3.828LeuPro: 3.828 ± 0.944
3.828LeuGln: 3.828 ± 0.908
3.555LeuArg: 3.555 ± 0.999
5.196LeuSer: 5.196 ± 1.343
6.563LeuThr: 6.563 ± 1.525
2.461LeuVal: 2.461 ± 0.894
0.82LeuTrp: 0.82 ± 0.445
4.375LeuTyr: 4.375 ± 0.894
0.0LeuXaa: 0.0 ± 0.0
Met
2.461MetAla: 2.461 ± 0.882
0.0MetCys: 0.0 ± 0.0
1.914MetAsp: 1.914 ± 0.719
3.281MetGlu: 3.281 ± 0.988
0.82MetPhe: 0.82 ± 0.435
2.461MetGly: 2.461 ± 0.635
0.0MetHis: 0.0 ± 0.0
0.547MetIle: 0.547 ± 0.37
2.461MetLys: 2.461 ± 0.568
1.367MetLeu: 1.367 ± 0.584
0.273MetMet: 0.273 ± 0.269
1.641MetAsn: 1.641 ± 0.716
1.367MetPro: 1.367 ± 0.652
1.094MetGln: 1.094 ± 0.743
1.094MetArg: 1.094 ± 0.639
1.094MetSer: 1.094 ± 0.534
2.734MetThr: 2.734 ± 0.99
1.641MetVal: 1.641 ± 0.661
0.273MetTrp: 0.273 ± 0.229
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.008AsnAla: 3.008 ± 0.925
0.0AsnCys: 0.0 ± 0.0
3.008AsnAsp: 3.008 ± 1.022
2.734AsnGlu: 2.734 ± 0.713
2.188AsnPhe: 2.188 ± 0.802
5.469AsnGly: 5.469 ± 1.12
0.82AsnHis: 0.82 ± 0.501
3.008AsnIle: 3.008 ± 0.944
6.289AsnLys: 6.289 ± 1.283
4.649AsnLeu: 4.649 ± 1.098
1.641AsnMet: 1.641 ± 0.712
2.461AsnAsn: 2.461 ± 0.707
2.734AsnPro: 2.734 ± 0.717
2.461AsnGln: 2.461 ± 0.977
1.641AsnArg: 1.641 ± 0.611
3.281AsnSer: 3.281 ± 0.992
4.102AsnThr: 4.102 ± 1.148
1.914AsnVal: 1.914 ± 0.884
0.273AsnTrp: 0.273 ± 0.241
2.461AsnTyr: 2.461 ± 1.047
0.0AsnXaa: 0.0 ± 0.0
Pro
1.641ProAla: 1.641 ± 0.696
0.273ProCys: 0.273 ± 0.229
1.914ProAsp: 1.914 ± 0.729
3.828ProGlu: 3.828 ± 0.921
1.641ProPhe: 1.641 ± 0.7
0.273ProGly: 0.273 ± 0.305
0.0ProHis: 0.0 ± 0.0
1.914ProIle: 1.914 ± 0.504
2.461ProLys: 2.461 ± 0.866
1.641ProLeu: 1.641 ± 0.508
0.273ProMet: 0.273 ± 0.229
1.641ProAsn: 1.641 ± 0.66
1.641ProPro: 1.641 ± 0.638
0.547ProGln: 0.547 ± 0.298
4.375ProArg: 4.375 ± 0.908
1.094ProSer: 1.094 ± 0.47
1.094ProThr: 1.094 ± 0.491
1.641ProVal: 1.641 ± 0.767
0.0ProTrp: 0.0 ± 0.0
1.094ProTyr: 1.094 ± 0.467
0.0ProXaa: 0.0 ± 0.0
Gln
3.555GlnAla: 3.555 ± 0.915
0.0GlnCys: 0.0 ± 0.0
1.914GlnAsp: 1.914 ± 0.821
4.649GlnGlu: 4.649 ± 1.183
0.82GlnPhe: 0.82 ± 0.492
2.734GlnGly: 2.734 ± 0.969
0.82GlnHis: 0.82 ± 0.438
1.641GlnIle: 1.641 ± 0.714
4.649GlnLys: 4.649 ± 0.942
4.649GlnLeu: 4.649 ± 1.13
1.367GlnMet: 1.367 ± 0.76
0.82GlnAsn: 0.82 ± 0.515
0.82GlnPro: 0.82 ± 0.484
1.914GlnGln: 1.914 ± 0.849
1.641GlnArg: 1.641 ± 0.708
1.641GlnSer: 1.641 ± 0.742
2.461GlnThr: 2.461 ± 0.672
3.008GlnVal: 3.008 ± 0.717
0.547GlnTrp: 0.547 ± 0.301
0.547GlnTyr: 0.547 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
3.008ArgAla: 3.008 ± 0.774
0.273ArgCys: 0.273 ± 0.315
2.461ArgAsp: 2.461 ± 0.966
4.102ArgGlu: 4.102 ± 1.023
3.828ArgPhe: 3.828 ± 0.848
3.008ArgGly: 3.008 ± 1.108
1.641ArgHis: 1.641 ± 0.599
4.375ArgIle: 4.375 ± 0.898
5.196ArgLys: 5.196 ± 1.03
7.11ArgLeu: 7.11 ± 1.383
1.094ArgMet: 1.094 ± 0.538
2.734ArgAsn: 2.734 ± 0.885
0.82ArgPro: 0.82 ± 0.483
3.281ArgGln: 3.281 ± 0.983
2.461ArgArg: 2.461 ± 0.836
1.914ArgSer: 1.914 ± 0.606
3.281ArgThr: 3.281 ± 0.758
1.641ArgVal: 1.641 ± 0.62
0.273ArgTrp: 0.273 ± 0.229
3.008ArgTyr: 3.008 ± 0.942
0.0ArgXaa: 0.0 ± 0.0
Ser
2.188SerAla: 2.188 ± 0.77
0.547SerCys: 0.547 ± 0.437
1.914SerAsp: 1.914 ± 0.582
4.649SerGlu: 4.649 ± 1.662
1.914SerPhe: 1.914 ± 0.728
3.555SerGly: 3.555 ± 0.858
0.82SerHis: 0.82 ± 0.434
5.196SerIle: 5.196 ± 1.185
6.016SerLys: 6.016 ± 1.197
4.375SerLeu: 4.375 ± 0.902
1.367SerMet: 1.367 ± 0.606
3.828SerAsn: 3.828 ± 1.423
1.641SerPro: 1.641 ± 0.569
2.188SerGln: 2.188 ± 0.775
2.461SerArg: 2.461 ± 0.966
3.008SerSer: 3.008 ± 1.019
1.914SerThr: 1.914 ± 0.689
4.102SerVal: 4.102 ± 0.902
0.0SerTrp: 0.0 ± 0.0
2.461SerTyr: 2.461 ± 0.905
0.0SerXaa: 0.0 ± 0.0
Thr
2.188ThrAla: 2.188 ± 0.812
0.273ThrCys: 0.273 ± 0.316
1.641ThrAsp: 1.641 ± 0.63
3.008ThrGlu: 3.008 ± 0.784
2.188ThrPhe: 2.188 ± 0.523
3.008ThrGly: 3.008 ± 0.909
1.367ThrHis: 1.367 ± 0.46
3.828ThrIle: 3.828 ± 1.255
5.196ThrLys: 5.196 ± 1.131
6.016ThrLeu: 6.016 ± 1.093
2.188ThrMet: 2.188 ± 0.847
0.273ThrAsn: 0.273 ± 0.332
1.914ThrPro: 1.914 ± 0.671
2.734ThrGln: 2.734 ± 0.846
2.461ThrArg: 2.461 ± 0.797
3.008ThrSer: 3.008 ± 0.838
3.828ThrThr: 3.828 ± 0.874
5.196ThrVal: 5.196 ± 1.475
0.547ThrTrp: 0.547 ± 0.366
3.008ThrTyr: 3.008 ± 1.291
0.0ThrXaa: 0.0 ± 0.0
Val
3.008ValAla: 3.008 ± 1.101
0.547ValCys: 0.547 ± 0.39
2.461ValAsp: 2.461 ± 0.791
6.289ValGlu: 6.289 ± 1.519
1.094ValPhe: 1.094 ± 0.547
2.734ValGly: 2.734 ± 1.011
0.0ValHis: 0.0 ± 0.0
2.461ValIle: 2.461 ± 0.836
5.742ValLys: 5.742 ± 1.26
4.922ValLeu: 4.922 ± 1.133
1.367ValMet: 1.367 ± 0.633
3.281ValAsn: 3.281 ± 0.689
1.914ValPro: 1.914 ± 0.622
1.094ValGln: 1.094 ± 0.443
2.461ValArg: 2.461 ± 0.902
4.649ValSer: 4.649 ± 0.906
2.188ValThr: 2.188 ± 0.661
4.102ValVal: 4.102 ± 1.036
0.547ValTrp: 0.547 ± 0.348
1.641ValTyr: 1.641 ± 0.724
0.0ValXaa: 0.0 ± 0.0
Trp
0.273TrpAla: 0.273 ± 0.241
0.0TrpCys: 0.0 ± 0.0
0.547TrpAsp: 0.547 ± 0.371
0.82TrpGlu: 0.82 ± 0.484
0.273TrpPhe: 0.273 ± 0.229
0.82TrpGly: 0.82 ± 0.469
0.273TrpHis: 0.273 ± 0.229
0.547TrpIle: 0.547 ± 0.417
0.273TrpLys: 0.273 ± 0.337
0.82TrpLeu: 0.82 ± 0.373
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.273TrpGln: 0.273 ± 0.229
0.547TrpArg: 0.547 ± 0.275
0.547TrpSer: 0.547 ± 0.305
0.273TrpThr: 0.273 ± 0.209
0.82TrpVal: 0.82 ± 0.492
0.0TrpTrp: 0.0 ± 0.0
0.547TrpTyr: 0.547 ± 0.275
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.188TyrAla: 2.188 ± 0.871
0.547TyrCys: 0.547 ± 0.305
2.188TyrAsp: 2.188 ± 0.62
1.914TyrGlu: 1.914 ± 0.657
1.367TyrPhe: 1.367 ± 0.64
1.367TyrGly: 1.367 ± 0.649
0.82TyrHis: 0.82 ± 0.402
3.281TyrIle: 3.281 ± 1.138
5.742TyrLys: 5.742 ± 1.32
6.563TyrLeu: 6.563 ± 1.056
0.273TyrMet: 0.273 ± 0.314
2.461TyrAsn: 2.461 ± 0.462
1.914TyrPro: 1.914 ± 0.801
1.094TyrGln: 1.094 ± 0.506
3.555TyrArg: 3.555 ± 0.748
2.188TyrSer: 2.188 ± 0.701
1.367TyrThr: 1.367 ± 0.637
1.367TyrVal: 1.367 ± 0.558
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (3658 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski