Amino acid dipepetide frequency for Streptococcus satellite phage Javan544

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.714AlaAla: 5.714 ± 1.035
0.672AlaCys: 0.672 ± 0.608
3.697AlaAsp: 3.697 ± 1.459
6.387AlaGlu: 6.387 ± 1.419
1.345AlaPhe: 1.345 ± 0.597
6.05AlaGly: 6.05 ± 1.254
1.008AlaHis: 1.008 ± 0.607
6.723AlaIle: 6.723 ± 1.831
6.387AlaLys: 6.387 ± 1.445
7.731AlaLeu: 7.731 ± 1.45
1.345AlaMet: 1.345 ± 0.732
2.017AlaAsn: 2.017 ± 0.683
1.345AlaPro: 1.345 ± 0.594
4.034AlaGln: 4.034 ± 1.021
2.017AlaArg: 2.017 ± 0.716
4.034AlaSer: 4.034 ± 0.812
2.689AlaThr: 2.689 ± 0.605
3.025AlaVal: 3.025 ± 0.904
0.0AlaTrp: 0.0 ± 0.0
2.017AlaTyr: 2.017 ± 0.76
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.345CysAsp: 1.345 ± 0.69
1.008CysGlu: 1.008 ± 0.502
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.336CysHis: 0.336 ± 0.307
0.336CysIle: 0.336 ± 0.376
1.008CysLys: 1.008 ± 0.624
1.008CysLeu: 1.008 ± 0.633
0.336CysMet: 0.336 ± 0.304
0.672CysAsn: 0.672 ± 0.452
0.0CysPro: 0.0 ± 0.0
0.672CysGln: 0.672 ± 0.806
1.008CysArg: 1.008 ± 0.403
0.672CysSer: 0.672 ± 0.588
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.336CysTyr: 0.336 ± 0.273
0.0CysXaa: 0.0 ± 0.0
Asp
4.706AspAla: 4.706 ± 1.409
0.672AspCys: 0.672 ± 0.547
5.042AspAsp: 5.042 ± 1.722
3.361AspGlu: 3.361 ± 1.144
2.689AspPhe: 2.689 ± 0.852
3.025AspGly: 3.025 ± 0.866
0.672AspHis: 0.672 ± 0.498
3.697AspIle: 3.697 ± 0.844
4.706AspLys: 4.706 ± 1.229
4.37AspLeu: 4.37 ± 1.216
1.681AspMet: 1.681 ± 0.724
2.353AspAsn: 2.353 ± 0.612
1.008AspPro: 1.008 ± 0.377
1.345AspGln: 1.345 ± 1.153
3.025AspArg: 3.025 ± 1.054
2.689AspSer: 2.689 ± 0.93
1.681AspThr: 1.681 ± 0.736
4.706AspVal: 4.706 ± 0.829
0.336AspTrp: 0.336 ± 0.39
2.353AspTyr: 2.353 ± 0.82
0.0AspXaa: 0.0 ± 0.0
Glu
5.042GluAla: 5.042 ± 1.703
1.008GluCys: 1.008 ± 0.662
4.034GluAsp: 4.034 ± 1.254
8.403GluGlu: 8.403 ± 1.872
3.697GluPhe: 3.697 ± 0.684
3.025GluGly: 3.025 ± 0.942
0.336GluHis: 0.336 ± 0.367
7.059GluIle: 7.059 ± 1.786
9.076GluLys: 9.076 ± 2.421
12.437GluLeu: 12.437 ± 1.719
0.672GluMet: 0.672 ± 0.388
2.017GluAsn: 2.017 ± 0.793
3.025GluPro: 3.025 ± 0.84
3.361GluGln: 3.361 ± 1.177
2.689GluArg: 2.689 ± 1.008
4.034GluSer: 4.034 ± 1.096
5.378GluThr: 5.378 ± 0.812
3.697GluVal: 3.697 ± 1.176
1.345GluTrp: 1.345 ± 0.443
2.017GluTyr: 2.017 ± 0.68
0.0GluXaa: 0.0 ± 0.0
Phe
2.689PheAla: 2.689 ± 0.623
0.336PheCys: 0.336 ± 0.294
2.353PheAsp: 2.353 ± 1.018
3.025PheGlu: 3.025 ± 0.828
0.672PhePhe: 0.672 ± 0.547
1.345PheGly: 1.345 ± 0.383
0.0PheHis: 0.0 ± 0.0
2.689PheIle: 2.689 ± 1.151
3.361PheLys: 3.361 ± 1.538
5.042PheLeu: 5.042 ± 1.291
0.336PheMet: 0.336 ± 0.304
1.345PheAsn: 1.345 ± 0.76
0.672PhePro: 0.672 ± 0.388
1.681PheGln: 1.681 ± 0.943
2.353PheArg: 2.353 ± 0.68
3.025PheSer: 3.025 ± 1.024
5.042PheThr: 5.042 ± 1.432
4.034PheVal: 4.034 ± 0.98
0.336PheTrp: 0.336 ± 0.273
2.017PheTyr: 2.017 ± 0.797
0.0PheXaa: 0.0 ± 0.0
Gly
1.681GlyAla: 1.681 ± 0.46
0.672GlyCys: 0.672 ± 0.435
1.681GlyAsp: 1.681 ± 1.152
3.361GlyGlu: 3.361 ± 0.795
3.697GlyPhe: 3.697 ± 1.148
3.697GlyGly: 3.697 ± 1.128
1.008GlyHis: 1.008 ± 0.603
3.697GlyIle: 3.697 ± 1.02
6.05GlyLys: 6.05 ± 1.48
4.37GlyLeu: 4.37 ± 1.312
1.681GlyMet: 1.681 ± 0.582
2.353GlyAsn: 2.353 ± 1.032
0.0GlyPro: 0.0 ± 0.0
3.697GlyGln: 3.697 ± 1.337
3.025GlyArg: 3.025 ± 0.987
3.025GlySer: 3.025 ± 1.167
2.353GlyThr: 2.353 ± 0.616
3.361GlyVal: 3.361 ± 0.668
1.008GlyTrp: 1.008 ± 0.722
2.689GlyTyr: 2.689 ± 0.755
0.0GlyXaa: 0.0 ± 0.0
His
2.017HisAla: 2.017 ± 1.068
0.0HisCys: 0.0 ± 0.0
0.336HisAsp: 0.336 ± 0.294
0.672HisGlu: 0.672 ± 0.528
1.008HisPhe: 1.008 ± 0.581
1.008HisGly: 1.008 ± 0.494
0.672HisHis: 0.672 ± 0.405
0.672HisIle: 0.672 ± 0.551
0.336HisLys: 0.336 ± 0.307
1.345HisLeu: 1.345 ± 0.785
0.336HisMet: 0.336 ± 0.367
0.672HisAsn: 0.672 ± 0.527
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.008HisArg: 1.008 ± 0.411
1.345HisSer: 1.345 ± 0.704
0.336HisThr: 0.336 ± 0.307
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.017HisTyr: 2.017 ± 0.942
0.0HisXaa: 0.0 ± 0.0
Ile
4.706IleAla: 4.706 ± 1.38
1.345IleCys: 1.345 ± 0.674
3.697IleAsp: 3.697 ± 1.448
4.706IleGlu: 4.706 ± 1.193
2.017IlePhe: 2.017 ± 0.668
3.361IleGly: 3.361 ± 0.877
1.345IleHis: 1.345 ± 0.675
4.034IleIle: 4.034 ± 1.161
10.084IleLys: 10.084 ± 2.247
4.706IleLeu: 4.706 ± 1.137
1.008IleMet: 1.008 ± 0.743
3.361IleAsn: 3.361 ± 1.183
2.689IlePro: 2.689 ± 0.674
1.681IleGln: 1.681 ± 0.655
4.37IleArg: 4.37 ± 1.177
4.034IleSer: 4.034 ± 1.376
4.034IleThr: 4.034 ± 1.38
2.689IleVal: 2.689 ± 0.93
0.336IleTrp: 0.336 ± 0.361
1.681IleTyr: 1.681 ± 0.687
0.0IleXaa: 0.0 ± 0.0
Lys
10.756LysAla: 10.756 ± 1.811
0.0LysCys: 0.0 ± 0.0
4.706LysAsp: 4.706 ± 1.018
10.084LysGlu: 10.084 ± 1.917
2.689LysPhe: 2.689 ± 1.1
5.378LysGly: 5.378 ± 1.971
1.681LysHis: 1.681 ± 0.832
5.714LysIle: 5.714 ± 1.622
8.739LysLys: 8.739 ± 2.108
9.748LysLeu: 9.748 ± 2.217
4.034LysMet: 4.034 ± 1.547
5.042LysAsn: 5.042 ± 1.626
2.353LysPro: 2.353 ± 0.748
3.361LysGln: 3.361 ± 1.178
3.361LysArg: 3.361 ± 0.822
4.37LysSer: 4.37 ± 1.042
8.403LysThr: 8.403 ± 2.231
4.706LysVal: 4.706 ± 1.057
0.672LysTrp: 0.672 ± 0.589
1.008LysTyr: 1.008 ± 0.632
0.0LysXaa: 0.0 ± 0.0
Leu
7.731LeuAla: 7.731 ± 1.674
1.345LeuCys: 1.345 ± 1.123
6.05LeuAsp: 6.05 ± 1.043
10.084LeuGlu: 10.084 ± 2.367
2.689LeuPhe: 2.689 ± 1.153
6.723LeuGly: 6.723 ± 1.507
1.008LeuHis: 1.008 ± 0.736
5.042LeuIle: 5.042 ± 0.975
9.412LeuLys: 9.412 ± 1.686
8.739LeuLeu: 8.739 ± 1.314
3.025LeuMet: 3.025 ± 1.345
5.042LeuAsn: 5.042 ± 0.928
4.034LeuPro: 4.034 ± 1.429
5.378LeuGln: 5.378 ± 1.076
4.37LeuArg: 4.37 ± 1.22
6.05LeuSer: 6.05 ± 1.381
7.395LeuThr: 7.395 ± 1.395
4.034LeuVal: 4.034 ± 1.073
1.345LeuTrp: 1.345 ± 0.604
3.025LeuTyr: 3.025 ± 1.409
0.0LeuXaa: 0.0 ± 0.0
Met
2.353MetAla: 2.353 ± 1.106
0.0MetCys: 0.0 ± 0.0
1.345MetAsp: 1.345 ± 0.576
2.017MetGlu: 2.017 ± 0.814
0.336MetPhe: 0.336 ± 0.307
1.008MetGly: 1.008 ± 0.503
0.0MetHis: 0.0 ± 0.0
1.681MetIle: 1.681 ± 0.751
3.025MetLys: 3.025 ± 1.292
1.681MetLeu: 1.681 ± 0.54
0.672MetMet: 0.672 ± 0.588
1.345MetAsn: 1.345 ± 0.651
1.345MetPro: 1.345 ± 0.584
0.672MetGln: 0.672 ± 0.404
0.336MetArg: 0.336 ± 0.344
1.681MetSer: 1.681 ± 0.906
3.697MetThr: 3.697 ± 1.017
1.345MetVal: 1.345 ± 0.78
0.0MetTrp: 0.0 ± 0.0
0.336MetTyr: 0.336 ± 0.404
0.0MetXaa: 0.0 ± 0.0
Asn
1.681AsnAla: 1.681 ± 0.691
0.0AsnCys: 0.0 ± 0.0
2.353AsnAsp: 2.353 ± 0.919
2.689AsnGlu: 2.689 ± 0.97
1.681AsnPhe: 1.681 ± 0.656
3.025AsnGly: 3.025 ± 0.758
1.008AsnHis: 1.008 ± 0.487
2.353AsnIle: 2.353 ± 0.723
4.034AsnLys: 4.034 ± 1.082
7.059AsnLeu: 7.059 ± 1.13
1.008AsnMet: 1.008 ± 0.585
3.361AsnAsn: 3.361 ± 1.123
4.034AsnPro: 4.034 ± 0.979
2.353AsnGln: 2.353 ± 0.917
2.353AsnArg: 2.353 ± 0.84
2.689AsnSer: 2.689 ± 0.753
2.017AsnThr: 2.017 ± 0.631
1.681AsnVal: 1.681 ± 0.554
0.672AsnTrp: 0.672 ± 0.527
2.017AsnTyr: 2.017 ± 0.688
0.0AsnXaa: 0.0 ± 0.0
Pro
2.017ProAla: 2.017 ± 0.711
0.336ProCys: 0.336 ± 0.376
2.017ProAsp: 2.017 ± 0.881
2.689ProGlu: 2.689 ± 0.83
2.017ProPhe: 2.017 ± 0.753
1.008ProGly: 1.008 ± 0.429
0.672ProHis: 0.672 ± 0.43
1.681ProIle: 1.681 ± 0.661
1.345ProLys: 1.345 ± 0.71
2.353ProLeu: 2.353 ± 0.881
0.672ProMet: 0.672 ± 0.459
1.681ProAsn: 1.681 ± 0.809
0.336ProPro: 0.336 ± 0.364
2.017ProGln: 2.017 ± 0.765
1.345ProArg: 1.345 ± 0.674
2.689ProSer: 2.689 ± 0.725
2.689ProThr: 2.689 ± 0.972
1.681ProVal: 1.681 ± 0.94
0.0ProTrp: 0.0 ± 0.0
0.336ProTyr: 0.336 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
3.361GlnAla: 3.361 ± 1.594
0.0GlnCys: 0.0 ± 0.0
3.025GlnAsp: 3.025 ± 0.715
6.387GlnGlu: 6.387 ± 1.221
2.353GlnPhe: 2.353 ± 1.017
2.017GlnGly: 2.017 ± 0.752
0.672GlnHis: 0.672 ± 0.614
3.697GlnIle: 3.697 ± 0.881
3.025GlnLys: 3.025 ± 0.754
4.034GlnLeu: 4.034 ± 1.34
0.336GlnMet: 0.336 ± 0.273
2.353GlnAsn: 2.353 ± 0.919
1.345GlnPro: 1.345 ± 0.557
2.689GlnGln: 2.689 ± 1.133
2.353GlnArg: 2.353 ± 0.775
2.017GlnSer: 2.017 ± 1.04
3.361GlnThr: 3.361 ± 1.054
2.017GlnVal: 2.017 ± 0.628
0.0GlnTrp: 0.0 ± 0.0
1.345GlnTyr: 1.345 ± 0.709
0.0GlnXaa: 0.0 ± 0.0
Arg
2.353ArgAla: 2.353 ± 0.909
0.336ArgCys: 0.336 ± 0.307
2.353ArgAsp: 2.353 ± 0.799
4.034ArgGlu: 4.034 ± 0.888
1.681ArgPhe: 1.681 ± 0.652
3.025ArgGly: 3.025 ± 1.204
1.008ArgHis: 1.008 ± 0.655
2.689ArgIle: 2.689 ± 0.658
7.059ArgLys: 7.059 ± 1.726
4.706ArgLeu: 4.706 ± 1.368
1.345ArgMet: 1.345 ± 0.534
1.008ArgAsn: 1.008 ± 0.561
1.681ArgPro: 1.681 ± 0.773
3.697ArgGln: 3.697 ± 1.056
3.025ArgArg: 3.025 ± 0.84
2.017ArgSer: 2.017 ± 0.991
1.681ArgThr: 1.681 ± 0.601
0.672ArgVal: 0.672 ± 0.441
0.672ArgTrp: 0.672 ± 0.492
1.681ArgTyr: 1.681 ± 0.612
0.0ArgXaa: 0.0 ± 0.0
Ser
2.353SerAla: 2.353 ± 0.86
0.336SerCys: 0.336 ± 0.36
4.034SerAsp: 4.034 ± 1.362
3.697SerGlu: 3.697 ± 1.338
4.37SerPhe: 4.37 ± 1.185
2.689SerGly: 2.689 ± 0.93
1.008SerHis: 1.008 ± 0.617
4.37SerIle: 4.37 ± 1.856
3.697SerLys: 3.697 ± 1.058
7.395SerLeu: 7.395 ± 1.101
2.017SerMet: 2.017 ± 0.951
5.042SerAsn: 5.042 ± 1.472
1.008SerPro: 1.008 ± 0.6
2.689SerGln: 2.689 ± 0.776
1.345SerArg: 1.345 ± 0.848
2.689SerSer: 2.689 ± 0.814
3.361SerThr: 3.361 ± 0.73
2.353SerVal: 2.353 ± 0.893
0.336SerTrp: 0.336 ± 0.344
4.37SerTyr: 4.37 ± 1.417
0.0SerXaa: 0.0 ± 0.0
Thr
4.37ThrAla: 4.37 ± 1.194
0.672ThrCys: 0.672 ± 0.486
2.689ThrAsp: 2.689 ± 0.873
4.706ThrGlu: 4.706 ± 1.344
3.697ThrPhe: 3.697 ± 0.91
3.697ThrGly: 3.697 ± 1.098
1.345ThrHis: 1.345 ± 0.615
4.034ThrIle: 4.034 ± 1.256
4.034ThrLys: 4.034 ± 1.191
5.714ThrLeu: 5.714 ± 1.697
1.345ThrMet: 1.345 ± 0.812
2.689ThrAsn: 2.689 ± 0.831
3.025ThrPro: 3.025 ± 0.66
2.353ThrGln: 2.353 ± 1.109
3.361ThrArg: 3.361 ± 0.891
3.025ThrSer: 3.025 ± 0.724
2.689ThrThr: 2.689 ± 0.952
5.042ThrVal: 5.042 ± 1.489
0.672ThrTrp: 0.672 ± 0.608
2.689ThrTyr: 2.689 ± 0.791
0.0ThrXaa: 0.0 ± 0.0
Val
3.025ValAla: 3.025 ± 0.91
0.336ValCys: 0.336 ± 0.367
1.008ValAsp: 1.008 ± 0.567
2.353ValGlu: 2.353 ± 1.09
3.025ValPhe: 3.025 ± 1.009
2.689ValGly: 2.689 ± 0.586
0.0ValHis: 0.0 ± 0.0
2.689ValIle: 2.689 ± 0.891
5.042ValLys: 5.042 ± 1.759
3.025ValLeu: 3.025 ± 1.045
1.681ValMet: 1.681 ± 0.699
3.361ValAsn: 3.361 ± 1.284
1.008ValPro: 1.008 ± 0.565
1.345ValGln: 1.345 ± 0.455
3.697ValArg: 3.697 ± 1.187
5.042ValSer: 5.042 ± 0.939
3.361ValThr: 3.361 ± 0.87
2.353ValVal: 2.353 ± 0.866
0.672ValTrp: 0.672 ± 0.547
3.025ValTyr: 3.025 ± 1.089
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.008TrpGlu: 1.008 ± 0.581
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.336TrpIle: 0.336 ± 0.36
1.681TrpLys: 1.681 ± 0.742
2.017TrpLeu: 2.017 ± 0.891
0.336TrpMet: 0.336 ± 0.344
0.336TrpAsn: 0.336 ± 0.278
0.336TrpPro: 0.336 ± 0.403
0.336TrpGln: 0.336 ± 0.39
0.0TrpArg: 0.0 ± 0.0
0.672TrpSer: 0.672 ± 0.459
0.336TrpThr: 0.336 ± 0.39
0.336TrpVal: 0.336 ± 0.273
0.336TrpTrp: 0.336 ± 0.273
0.672TrpTyr: 0.672 ± 0.481
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.681TyrAla: 1.681 ± 0.548
0.672TyrCys: 0.672 ± 0.534
2.353TyrAsp: 2.353 ± 0.913
1.681TyrGlu: 1.681 ± 0.483
2.689TyrPhe: 2.689 ± 1.053
0.336TyrGly: 0.336 ± 0.294
0.0TyrHis: 0.0 ± 0.0
2.689TyrIle: 2.689 ± 0.928
5.042TyrLys: 5.042 ± 1.656
5.042TyrLeu: 5.042 ± 1.209
0.672TyrMet: 0.672 ± 0.566
2.017TyrAsn: 2.017 ± 0.764
0.336TyrPro: 0.336 ± 0.294
3.025TyrGln: 3.025 ± 0.616
1.681TyrArg: 1.681 ± 0.899
3.361TyrSer: 3.361 ± 0.655
1.345TyrThr: 1.345 ± 0.549
1.008TyrVal: 1.008 ± 0.69
0.0TyrTrp: 0.0 ± 0.0
2.353TyrTyr: 2.353 ± 0.888
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (2976 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski