Amino acid dipepetide frequency for Streptococcus satellite phage Javan120

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.112AlaAla: 1.112 ± 0.559
0.556AlaCys: 0.556 ± 0.33
3.335AlaAsp: 3.335 ± 1.071
5.003AlaGlu: 5.003 ± 1.257
3.613AlaPhe: 3.613 ± 0.93
2.779AlaGly: 2.779 ± 0.779
0.278AlaHis: 0.278 ± 0.228
6.392AlaIle: 6.392 ± 1.334
4.725AlaLys: 4.725 ± 1.164
5.003AlaLeu: 5.003 ± 1.4
2.223AlaMet: 2.223 ± 0.757
4.725AlaAsn: 4.725 ± 1.308
1.668AlaPro: 1.668 ± 0.61
3.057AlaGln: 3.057 ± 0.751
1.668AlaArg: 1.668 ± 0.558
4.169AlaSer: 4.169 ± 0.941
4.169AlaThr: 4.169 ± 0.91
2.501AlaVal: 2.501 ± 0.774
0.834AlaTrp: 0.834 ± 0.695
1.946AlaTyr: 1.946 ± 0.664
0.0AlaXaa: 0.0 ± 0.0
Cys
1.39CysAla: 1.39 ± 0.6
0.278CysCys: 0.278 ± 0.258
0.556CysAsp: 0.556 ± 0.336
0.278CysGlu: 0.278 ± 0.258
0.0CysPhe: 0.0 ± 0.0
0.278CysGly: 0.278 ± 0.258
0.278CysHis: 0.278 ± 0.232
0.556CysIle: 0.556 ± 0.407
0.278CysLys: 0.278 ± 0.3
0.834CysLeu: 0.834 ± 0.441
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.278CysPro: 0.278 ± 0.258
0.556CysGln: 0.556 ± 0.31
0.556CysArg: 0.556 ± 0.314
0.278CysSer: 0.278 ± 0.306
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.556CysTyr: 0.556 ± 0.458
0.0CysXaa: 0.0 ± 0.0
Asp
1.668AspAla: 1.668 ± 0.589
1.39AspCys: 1.39 ± 0.622
3.613AspAsp: 3.613 ± 0.941
3.335AspGlu: 3.335 ± 1.17
1.668AspPhe: 1.668 ± 0.45
1.39AspGly: 1.39 ± 0.575
0.556AspHis: 0.556 ± 0.294
7.504AspIle: 7.504 ± 0.93
4.447AspLys: 4.447 ± 0.857
6.115AspLeu: 6.115 ± 1.186
2.223AspMet: 2.223 ± 0.716
1.946AspAsn: 1.946 ± 0.663
0.834AspPro: 0.834 ± 0.379
1.39AspGln: 1.39 ± 0.653
3.891AspArg: 3.891 ± 0.846
1.946AspSer: 1.946 ± 0.598
4.169AspThr: 4.169 ± 1.189
1.112AspVal: 1.112 ± 0.526
0.278AspTrp: 0.278 ± 0.345
4.169AspTyr: 4.169 ± 1.18
0.0AspXaa: 0.0 ± 0.0
Glu
5.837GluAla: 5.837 ± 1.269
1.39GluCys: 1.39 ± 0.743
4.447GluAsp: 4.447 ± 1.138
5.281GluGlu: 5.281 ± 1.033
2.223GluPhe: 2.223 ± 0.747
2.223GluGly: 2.223 ± 0.783
2.779GluHis: 2.779 ± 0.693
6.115GluIle: 6.115 ± 1.396
6.115GluLys: 6.115 ± 0.938
10.839GluLeu: 10.839 ± 1.356
1.946GluMet: 1.946 ± 0.626
3.891GluAsn: 3.891 ± 1.073
1.668GluPro: 1.668 ± 0.513
5.559GluGln: 5.559 ± 1.357
5.281GluArg: 5.281 ± 1.126
1.112GluSer: 1.112 ± 0.481
4.169GluThr: 4.169 ± 1.343
3.335GluVal: 3.335 ± 0.72
1.112GluTrp: 1.112 ± 0.571
4.169GluTyr: 4.169 ± 0.992
0.0GluXaa: 0.0 ± 0.0
Phe
1.668PheAla: 1.668 ± 0.735
0.0PheCys: 0.0 ± 0.0
3.335PheAsp: 3.335 ± 0.625
3.335PheGlu: 3.335 ± 0.747
2.223PhePhe: 2.223 ± 0.922
2.779PheGly: 2.779 ± 0.696
1.668PheHis: 1.668 ± 0.494
3.891PheIle: 3.891 ± 1.072
4.725PheLys: 4.725 ± 0.803
3.057PheLeu: 3.057 ± 0.77
0.278PheMet: 0.278 ± 0.232
2.223PheAsn: 2.223 ± 0.684
0.834PhePro: 0.834 ± 0.404
0.556PheGln: 0.556 ± 0.349
2.779PheArg: 2.779 ± 0.834
3.057PheSer: 3.057 ± 0.706
2.779PheThr: 2.779 ± 0.622
1.668PheVal: 1.668 ± 0.561
0.278PheTrp: 0.278 ± 0.228
1.39PheTyr: 1.39 ± 0.485
0.0PheXaa: 0.0 ± 0.0
Gly
2.223GlyAla: 2.223 ± 1.045
0.278GlyCys: 0.278 ± 0.206
3.057GlyAsp: 3.057 ± 1.036
2.223GlyGlu: 2.223 ± 0.63
3.057GlyPhe: 3.057 ± 0.846
2.779GlyGly: 2.779 ± 0.996
0.556GlyHis: 0.556 ± 0.306
2.779GlyIle: 2.779 ± 0.871
3.613GlyLys: 3.613 ± 0.873
6.67GlyLeu: 6.67 ± 1.349
1.112GlyMet: 1.112 ± 0.428
1.112GlyAsn: 1.112 ± 0.466
0.0GlyPro: 0.0 ± 0.0
2.779GlyGln: 2.779 ± 1.062
3.335GlyArg: 3.335 ± 1.005
1.946GlySer: 1.946 ± 0.683
3.613GlyThr: 3.613 ± 0.895
3.891GlyVal: 3.891 ± 0.953
0.278GlyTrp: 0.278 ± 0.228
2.501GlyTyr: 2.501 ± 0.545
0.0GlyXaa: 0.0 ± 0.0
His
2.223HisAla: 2.223 ± 0.991
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.556HisGlu: 0.556 ± 0.314
0.556HisPhe: 0.556 ± 0.539
1.112HisGly: 1.112 ± 0.462
0.556HisHis: 0.556 ± 0.297
0.556HisIle: 0.556 ± 0.415
2.223HisLys: 2.223 ± 0.731
2.779HisLeu: 2.779 ± 0.815
0.556HisMet: 0.556 ± 0.356
1.946HisAsn: 1.946 ± 0.691
1.39HisPro: 1.39 ± 0.56
1.39HisGln: 1.39 ± 0.791
0.278HisArg: 0.278 ± 0.342
0.834HisSer: 0.834 ± 0.408
1.39HisThr: 1.39 ± 0.431
0.834HisVal: 0.834 ± 0.479
0.556HisTrp: 0.556 ± 0.31
1.39HisTyr: 1.39 ± 0.465
0.0HisXaa: 0.0 ± 0.0
Ile
6.67IleAla: 6.67 ± 1.149
0.278IleCys: 0.278 ± 0.232
6.67IleAsp: 6.67 ± 1.455
5.003IleGlu: 5.003 ± 1.385
2.223IlePhe: 2.223 ± 0.584
1.946IleGly: 1.946 ± 0.684
0.834IleHis: 0.834 ± 0.426
5.003IleIle: 5.003 ± 1.259
8.338IleLys: 8.338 ± 1.523
4.169IleLeu: 4.169 ± 0.876
1.668IleMet: 1.668 ± 0.646
4.169IleAsn: 4.169 ± 0.983
3.057IlePro: 3.057 ± 0.68
1.668IleGln: 1.668 ± 0.738
3.057IleArg: 3.057 ± 0.708
4.447IleSer: 4.447 ± 1.113
4.725IleThr: 4.725 ± 1.182
3.613IleVal: 3.613 ± 0.731
0.0IleTrp: 0.0 ± 0.0
3.613IleTyr: 3.613 ± 1.082
0.0IleXaa: 0.0 ± 0.0
Lys
7.504LysAla: 7.504 ± 1.239
0.0LysCys: 0.0 ± 0.0
5.003LysAsp: 5.003 ± 1.183
10.283LysGlu: 10.283 ± 1.571
1.946LysPhe: 1.946 ± 0.718
4.169LysGly: 4.169 ± 1.461
3.335LysHis: 3.335 ± 0.703
2.779LysIle: 2.779 ± 1.105
6.948LysLys: 6.948 ± 1.594
6.115LysLeu: 6.115 ± 1.276
2.501LysMet: 2.501 ± 0.95
4.447LysAsn: 4.447 ± 1.052
5.281LysPro: 5.281 ± 1.294
3.891LysGln: 3.891 ± 1.278
5.281LysArg: 5.281 ± 1.151
3.891LysSer: 3.891 ± 1.063
5.837LysThr: 5.837 ± 1.211
4.447LysVal: 4.447 ± 1.045
0.278LysTrp: 0.278 ± 0.244
1.946LysTyr: 1.946 ± 0.676
0.0LysXaa: 0.0 ± 0.0
Leu
7.504LeuAla: 7.504 ± 1.101
0.278LeuCys: 0.278 ± 0.258
4.169LeuAsp: 4.169 ± 1.108
11.673LeuGlu: 11.673 ± 1.502
3.613LeuPhe: 3.613 ± 0.786
6.67LeuGly: 6.67 ± 1.012
1.668LeuHis: 1.668 ± 0.589
7.782LeuIle: 7.782 ± 1.44
8.338LeuLys: 8.338 ± 1.288
10.839LeuLeu: 10.839 ± 1.528
1.946LeuMet: 1.946 ± 0.594
5.281LeuAsn: 5.281 ± 1.522
4.725LeuPro: 4.725 ± 1.188
3.891LeuGln: 3.891 ± 0.839
1.668LeuArg: 1.668 ± 0.551
6.392LeuSer: 6.392 ± 1.471
4.725LeuThr: 4.725 ± 0.718
5.003LeuVal: 5.003 ± 1.316
0.556LeuTrp: 0.556 ± 0.357
4.725LeuTyr: 4.725 ± 0.769
0.0LeuXaa: 0.0 ± 0.0
Met
3.335MetAla: 3.335 ± 1.122
0.0MetCys: 0.0 ± 0.0
1.946MetAsp: 1.946 ± 0.601
0.834MetGlu: 0.834 ± 0.396
0.834MetPhe: 0.834 ± 0.408
0.556MetGly: 0.556 ± 0.304
0.556MetHis: 0.556 ± 0.337
1.39MetIle: 1.39 ± 0.692
1.39MetLys: 1.39 ± 0.567
3.057MetLeu: 3.057 ± 1.035
0.278MetMet: 0.278 ± 0.318
2.779MetAsn: 2.779 ± 0.59
0.278MetPro: 0.278 ± 0.228
0.278MetGln: 0.278 ± 0.318
1.946MetArg: 1.946 ± 0.679
1.946MetSer: 1.946 ± 0.794
3.891MetThr: 3.891 ± 0.896
0.556MetVal: 0.556 ± 0.406
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.057AsnAla: 3.057 ± 0.652
0.0AsnCys: 0.0 ± 0.0
1.946AsnAsp: 1.946 ± 0.782
3.335AsnGlu: 3.335 ± 0.865
1.39AsnPhe: 1.39 ± 0.696
5.003AsnGly: 5.003 ± 0.855
1.39AsnHis: 1.39 ± 0.52
1.946AsnIle: 1.946 ± 0.938
5.003AsnLys: 5.003 ± 0.98
5.003AsnLeu: 5.003 ± 1.325
1.39AsnMet: 1.39 ± 0.524
2.779AsnAsn: 2.779 ± 0.951
2.501AsnPro: 2.501 ± 0.64
3.057AsnGln: 3.057 ± 1.058
5.281AsnArg: 5.281 ± 0.817
2.223AsnSer: 2.223 ± 0.772
3.891AsnThr: 3.891 ± 1.227
1.946AsnVal: 1.946 ± 0.599
0.556AsnTrp: 0.556 ± 0.385
2.223AsnTyr: 2.223 ± 0.611
0.0AsnXaa: 0.0 ± 0.0
Pro
1.39ProAla: 1.39 ± 0.48
0.278ProCys: 0.278 ± 0.306
1.946ProAsp: 1.946 ± 0.775
4.169ProGlu: 4.169 ± 0.972
2.501ProPhe: 2.501 ± 0.602
0.556ProGly: 0.556 ± 0.337
0.556ProHis: 0.556 ± 0.337
1.668ProIle: 1.668 ± 0.588
5.837ProLys: 5.837 ± 1.279
1.946ProLeu: 1.946 ± 0.727
0.278ProMet: 0.278 ± 0.258
1.946ProAsn: 1.946 ± 0.707
1.39ProPro: 1.39 ± 0.489
0.556ProGln: 0.556 ± 0.383
1.946ProArg: 1.946 ± 0.614
1.112ProSer: 1.112 ± 0.5
2.223ProThr: 2.223 ± 0.679
1.946ProVal: 1.946 ± 0.674
0.0ProTrp: 0.0 ± 0.0
1.39ProTyr: 1.39 ± 0.641
0.0ProXaa: 0.0 ± 0.0
Gln
2.501GlnAla: 2.501 ± 0.571
0.278GlnCys: 0.278 ± 0.296
3.057GlnAsp: 3.057 ± 0.742
3.613GlnGlu: 3.613 ± 0.853
0.834GlnPhe: 0.834 ± 0.48
2.223GlnGly: 2.223 ± 0.606
1.112GlnHis: 1.112 ± 0.6
2.779GlnIle: 2.779 ± 0.608
4.447GlnLys: 4.447 ± 1.092
7.504GlnLeu: 7.504 ± 1.037
1.112GlnMet: 1.112 ± 0.632
2.223GlnAsn: 2.223 ± 0.721
1.112GlnPro: 1.112 ± 0.613
2.779GlnGln: 2.779 ± 1.002
3.613GlnArg: 3.613 ± 0.82
1.946GlnSer: 1.946 ± 0.839
1.39GlnThr: 1.39 ± 0.636
2.501GlnVal: 2.501 ± 1.051
0.278GlnTrp: 0.278 ± 0.269
1.39GlnTyr: 1.39 ± 0.575
0.0GlnXaa: 0.0 ± 0.0
Arg
2.501ArgAla: 2.501 ± 0.798
1.112ArgCys: 1.112 ± 0.476
2.223ArgAsp: 2.223 ± 0.641
3.057ArgGlu: 3.057 ± 0.713
3.057ArgPhe: 3.057 ± 0.965
2.501ArgGly: 2.501 ± 0.975
0.556ArgHis: 0.556 ± 0.333
2.501ArgIle: 2.501 ± 0.877
4.725ArgLys: 4.725 ± 0.843
5.559ArgLeu: 5.559 ± 1.144
2.223ArgMet: 2.223 ± 0.737
2.501ArgAsn: 2.501 ± 0.936
1.112ArgPro: 1.112 ± 0.524
4.169ArgGln: 4.169 ± 0.975
3.891ArgArg: 3.891 ± 1.145
1.946ArgSer: 1.946 ± 0.855
3.891ArgThr: 3.891 ± 0.839
3.891ArgVal: 3.891 ± 1.039
0.556ArgTrp: 0.556 ± 0.431
3.335ArgTyr: 3.335 ± 1.094
0.0ArgXaa: 0.0 ± 0.0
Ser
1.668SerAla: 1.668 ± 0.779
0.278SerCys: 0.278 ± 0.258
3.891SerAsp: 3.891 ± 1.236
4.447SerGlu: 4.447 ± 0.975
3.335SerPhe: 3.335 ± 1.093
1.112SerGly: 1.112 ± 0.532
0.556SerHis: 0.556 ± 0.314
5.003SerIle: 5.003 ± 1.133
4.169SerLys: 4.169 ± 1.005
6.67SerLeu: 6.67 ± 1.09
0.556SerMet: 0.556 ± 0.357
2.501SerAsn: 2.501 ± 1.026
0.556SerPro: 0.556 ± 0.395
2.779SerGln: 2.779 ± 0.77
2.223SerArg: 2.223 ± 0.648
1.946SerSer: 1.946 ± 0.642
2.223SerThr: 2.223 ± 0.638
3.335SerVal: 3.335 ± 1.142
0.556SerTrp: 0.556 ± 0.429
2.779SerTyr: 2.779 ± 0.954
0.0SerXaa: 0.0 ± 0.0
Thr
3.613ThrAla: 3.613 ± 1.197
0.0ThrCys: 0.0 ± 0.0
1.946ThrAsp: 1.946 ± 0.623
5.559ThrGlu: 5.559 ± 1.19
4.169ThrPhe: 4.169 ± 1.157
4.725ThrGly: 4.725 ± 0.915
0.834ThrHis: 0.834 ± 0.503
5.281ThrIle: 5.281 ± 1.296
1.946ThrLys: 1.946 ± 0.624
5.281ThrLeu: 5.281 ± 1.277
2.501ThrMet: 2.501 ± 0.762
1.946ThrAsn: 1.946 ± 0.663
3.891ThrPro: 3.891 ± 1.061
2.779ThrGln: 2.779 ± 0.771
2.779ThrArg: 2.779 ± 0.63
3.335ThrSer: 3.335 ± 1.036
3.335ThrThr: 3.335 ± 0.863
3.335ThrVal: 3.335 ± 0.866
1.112ThrTrp: 1.112 ± 0.464
2.501ThrTyr: 2.501 ± 1.029
0.0ThrXaa: 0.0 ± 0.0
Val
3.057ValAla: 3.057 ± 0.69
0.278ValCys: 0.278 ± 0.3
0.556ValAsp: 0.556 ± 0.354
3.613ValGlu: 3.613 ± 1.187
2.501ValPhe: 2.501 ± 0.675
3.057ValGly: 3.057 ± 0.836
0.834ValHis: 0.834 ± 0.378
4.725ValIle: 4.725 ± 1.28
3.891ValLys: 3.891 ± 0.895
4.447ValLeu: 4.447 ± 0.946
1.112ValMet: 1.112 ± 0.477
3.057ValAsn: 3.057 ± 0.878
0.834ValPro: 0.834 ± 0.371
2.223ValGln: 2.223 ± 0.737
1.946ValArg: 1.946 ± 0.573
5.003ValSer: 5.003 ± 1.195
2.223ValThr: 2.223 ± 0.903
3.335ValVal: 3.335 ± 0.888
0.556ValTrp: 0.556 ± 0.456
2.223ValTyr: 2.223 ± 0.59
0.0ValXaa: 0.0 ± 0.0
Trp
0.278TrpAla: 0.278 ± 0.309
0.0TrpCys: 0.0 ± 0.0
0.556TrpAsp: 0.556 ± 0.381
0.834TrpGlu: 0.834 ± 0.465
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.278TrpIle: 0.278 ± 0.244
1.112TrpLys: 1.112 ± 0.498
1.946TrpLeu: 1.946 ± 0.654
0.0TrpMet: 0.0 ± 0.0
0.278TrpAsn: 0.278 ± 0.309
0.278TrpPro: 0.278 ± 0.228
0.278TrpGln: 0.278 ± 0.342
0.556TrpArg: 0.556 ± 0.426
0.834TrpSer: 0.834 ± 0.402
0.0TrpThr: 0.0 ± 0.0
0.834TrpVal: 0.834 ± 0.454
0.278TrpTrp: 0.278 ± 0.309
0.278TrpTyr: 0.278 ± 0.342
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.556TyrAla: 0.556 ± 0.294
0.278TyrCys: 0.278 ± 0.244
1.668TyrAsp: 1.668 ± 0.607
3.057TyrGlu: 3.057 ± 0.933
2.779TyrPhe: 2.779 ± 0.771
1.946TyrGly: 1.946 ± 0.708
2.223TyrHis: 2.223 ± 0.939
2.223TyrIle: 2.223 ± 0.713
3.613TyrLys: 3.613 ± 0.981
3.613TyrLeu: 3.613 ± 0.844
1.668TyrMet: 1.668 ± 0.676
4.169TyrAsn: 4.169 ± 0.849
1.946TyrPro: 1.946 ± 0.751
3.057TyrGln: 3.057 ± 0.972
3.335TyrArg: 3.335 ± 1.151
2.223TyrSer: 2.223 ± 0.62
2.223TyrThr: 2.223 ± 0.568
1.39TyrVal: 1.39 ± 0.566
0.556TyrTrp: 0.556 ± 0.517
2.779TyrTyr: 2.779 ± 0.876
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (3599 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski