Amino acid dipepetide frequency for Streptococcus satellite phage Javan561

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.798AlaCys: 0.798 ± 0.71
5.185AlaAsp: 5.185 ± 1.564
5.584AlaGlu: 5.584 ± 1.238
1.596AlaPhe: 1.596 ± 0.612
3.191AlaGly: 3.191 ± 0.931
0.798AlaHis: 0.798 ± 0.71
4.388AlaIle: 4.388 ± 1.296
4.787AlaLys: 4.787 ± 1.304
3.989AlaLeu: 3.989 ± 1.491
1.197AlaMet: 1.197 ± 0.681
3.191AlaAsn: 3.191 ± 1.101
1.994AlaPro: 1.994 ± 0.726
4.388AlaGln: 4.388 ± 1.685
2.792AlaArg: 2.792 ± 0.922
4.787AlaSer: 4.787 ± 1.239
1.994AlaThr: 1.994 ± 0.87
3.191AlaVal: 3.191 ± 0.73
0.798AlaTrp: 0.798 ± 0.627
2.393AlaTyr: 2.393 ± 0.735
0.0AlaXaa: 0.0 ± 0.0
Cys
0.399CysAla: 0.399 ± 0.396
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.399CysGlu: 0.399 ± 0.396
0.0CysPhe: 0.0 ± 0.0
0.399CysGly: 0.399 ± 0.382
0.0CysHis: 0.0 ± 0.0
1.596CysIle: 1.596 ± 0.751
0.0CysLys: 0.0 ± 0.0
0.399CysLeu: 0.399 ± 0.398
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.798CysGln: 0.798 ± 0.468
0.399CysArg: 0.399 ± 0.333
0.798CysSer: 0.798 ± 0.543
0.0CysThr: 0.0 ± 0.0
1.197CysVal: 1.197 ± 0.609
0.0CysTrp: 0.0 ± 0.0
0.399CysTyr: 0.399 ± 0.355
0.0CysXaa: 0.0 ± 0.0
Asp
0.798AspAla: 0.798 ± 0.462
0.798AspCys: 0.798 ± 0.417
3.191AspAsp: 3.191 ± 1.08
5.185AspGlu: 5.185 ± 1.257
3.191AspPhe: 3.191 ± 1.149
2.393AspGly: 2.393 ± 0.781
1.596AspHis: 1.596 ± 1.005
5.185AspIle: 5.185 ± 1.646
5.983AspLys: 5.983 ± 1.503
8.377AspLeu: 8.377 ± 1.455
0.399AspMet: 0.399 ± 0.447
3.989AspAsn: 3.989 ± 0.956
1.197AspPro: 1.197 ± 0.699
1.197AspGln: 1.197 ± 0.659
1.994AspArg: 1.994 ± 0.727
0.798AspSer: 0.798 ± 0.501
4.388AspThr: 4.388 ± 1.428
3.59AspVal: 3.59 ± 1.141
0.399AspTrp: 0.399 ± 0.333
4.787AspTyr: 4.787 ± 1.031
0.0AspXaa: 0.0 ± 0.0
Glu
4.388GluAla: 4.388 ± 1.675
1.596GluCys: 1.596 ± 0.767
5.584GluAsp: 5.584 ± 1.572
7.978GluGlu: 7.978 ± 2.039
3.59GluPhe: 3.59 ± 1.103
3.989GluGly: 3.989 ± 1.511
1.596GluHis: 1.596 ± 0.689
4.388GluIle: 4.388 ± 1.494
9.174GluLys: 9.174 ± 1.753
9.573GluLeu: 9.573 ± 2.507
1.596GluMet: 1.596 ± 0.997
4.787GluAsn: 4.787 ± 1.426
2.792GluPro: 2.792 ± 1.093
2.792GluGln: 2.792 ± 1.176
4.787GluArg: 4.787 ± 1.857
4.787GluSer: 4.787 ± 1.212
3.989GluThr: 3.989 ± 1.108
3.989GluVal: 3.989 ± 1.616
0.399GluTrp: 0.399 ± 0.355
1.994GluTyr: 1.994 ± 0.866
0.0GluXaa: 0.0 ± 0.0
Phe
2.393PheAla: 2.393 ± 0.662
0.0PheCys: 0.0 ± 0.0
2.792PheAsp: 2.792 ± 0.873
2.792PheGlu: 2.792 ± 1.246
1.994PhePhe: 1.994 ± 1.038
1.994PheGly: 1.994 ± 0.875
0.399PheHis: 0.399 ± 0.333
3.989PheIle: 3.989 ± 1.417
3.989PheLys: 3.989 ± 1.006
3.989PheLeu: 3.989 ± 0.917
0.399PheMet: 0.399 ± 0.583
3.191PheAsn: 3.191 ± 1.305
2.393PhePro: 2.393 ± 0.724
1.596PheGln: 1.596 ± 0.742
2.792PheArg: 2.792 ± 1.116
1.994PheSer: 1.994 ± 0.92
4.787PheThr: 4.787 ± 1.306
0.798PheVal: 0.798 ± 0.667
0.399PheTrp: 0.399 ± 0.333
3.191PheTyr: 3.191 ± 0.785
0.0PheXaa: 0.0 ± 0.0
Gly
2.792GlyAla: 2.792 ± 0.856
0.0GlyCys: 0.0 ± 0.0
3.191GlyAsp: 3.191 ± 1.471
4.388GlyGlu: 4.388 ± 1.681
3.59GlyPhe: 3.59 ± 1.047
1.197GlyGly: 1.197 ± 0.68
0.798GlyHis: 0.798 ± 0.543
4.787GlyIle: 4.787 ± 1.191
7.18GlyLys: 7.18 ± 1.664
1.994GlyLeu: 1.994 ± 0.778
1.596GlyMet: 1.596 ± 1.134
3.191GlyAsn: 3.191 ± 1.4
0.0GlyPro: 0.0 ± 0.0
0.399GlyGln: 0.399 ± 0.432
1.197GlyArg: 1.197 ± 0.736
1.596GlySer: 1.596 ± 0.731
2.393GlyThr: 2.393 ± 1.189
2.393GlyVal: 2.393 ± 1.243
2.792GlyTrp: 2.792 ± 1.072
3.989GlyTyr: 3.989 ± 1.226
0.0GlyXaa: 0.0 ± 0.0
His
1.596HisAla: 1.596 ± 0.797
0.798HisCys: 0.798 ± 0.468
1.197HisAsp: 1.197 ± 0.623
0.399HisGlu: 0.399 ± 0.355
1.197HisPhe: 1.197 ± 0.651
1.197HisGly: 1.197 ± 0.658
0.399HisHis: 0.399 ± 0.36
2.393HisIle: 2.393 ± 0.982
2.393HisLys: 2.393 ± 1.138
1.596HisLeu: 1.596 ± 0.765
0.0HisMet: 0.0 ± 0.0
1.596HisAsn: 1.596 ± 0.974
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.798HisArg: 0.798 ± 0.494
1.596HisSer: 1.596 ± 0.536
2.393HisThr: 2.393 ± 0.591
0.399HisVal: 0.399 ± 0.355
0.0HisTrp: 0.0 ± 0.0
2.792HisTyr: 2.792 ± 0.844
0.0HisXaa: 0.0 ± 0.0
Ile
4.787IleAla: 4.787 ± 1.234
0.0IleCys: 0.0 ± 0.0
2.792IleAsp: 2.792 ± 1.19
5.584IleGlu: 5.584 ± 1.06
1.994IlePhe: 1.994 ± 0.853
4.787IleGly: 4.787 ± 1.639
0.798IleHis: 0.798 ± 0.481
4.388IleIle: 4.388 ± 1.217
6.781IleLys: 6.781 ± 1.258
3.989IleLeu: 3.989 ± 1.104
0.798IleMet: 0.798 ± 0.546
3.989IleAsn: 3.989 ± 0.935
1.596IlePro: 1.596 ± 0.957
3.191IleGln: 3.191 ± 0.953
4.388IleArg: 4.388 ± 1.255
3.59IleSer: 3.59 ± 0.714
6.382IleThr: 6.382 ± 1.053
3.59IleVal: 3.59 ± 1.155
0.798IleTrp: 0.798 ± 0.542
3.989IleTyr: 3.989 ± 0.875
0.0IleXaa: 0.0 ± 0.0
Lys
5.185LysAla: 5.185 ± 1.357
0.399LysCys: 0.399 ± 0.396
5.983LysAsp: 5.983 ± 1.126
7.18LysGlu: 7.18 ± 2.542
1.596LysPhe: 1.596 ± 0.941
5.983LysGly: 5.983 ± 2.325
2.393LysHis: 2.393 ± 1.018
4.787LysIle: 4.787 ± 1.25
9.174LysLys: 9.174 ± 2.816
7.579LysLeu: 7.579 ± 2.001
3.191LysMet: 3.191 ± 0.902
3.59LysAsn: 3.59 ± 1.08
4.787LysPro: 4.787 ± 1.002
6.382LysGln: 6.382 ± 1.167
5.584LysArg: 5.584 ± 1.526
4.787LysSer: 4.787 ± 1.38
7.978LysThr: 7.978 ± 1.505
4.388LysVal: 4.388 ± 1.559
1.197LysTrp: 1.197 ± 0.631
2.792LysTyr: 2.792 ± 1.18
0.0LysXaa: 0.0 ± 0.0
Leu
5.983LeuAla: 5.983 ± 1.047
0.399LeuCys: 0.399 ± 0.396
11.568LeuAsp: 11.568 ± 1.695
12.764LeuGlu: 12.764 ± 2.859
4.787LeuPhe: 4.787 ± 0.97
3.191LeuGly: 3.191 ± 1.228
2.393LeuHis: 2.393 ± 0.963
3.989LeuIle: 3.989 ± 1.269
11.169LeuLys: 11.169 ± 2.302
11.966LeuLeu: 11.966 ± 1.531
2.792LeuMet: 2.792 ± 1.212
4.787LeuAsn: 4.787 ± 1.308
3.989LeuPro: 3.989 ± 1.696
5.185LeuGln: 5.185 ± 1.501
4.787LeuArg: 4.787 ± 1.713
1.596LeuSer: 1.596 ± 0.556
4.388LeuThr: 4.388 ± 1.441
3.59LeuVal: 3.59 ± 0.965
0.399LeuTrp: 0.399 ± 0.432
3.191LeuTyr: 3.191 ± 0.799
0.0LeuXaa: 0.0 ± 0.0
Met
3.191MetAla: 3.191 ± 1.103
0.399MetCys: 0.399 ± 0.355
0.798MetAsp: 0.798 ± 0.543
2.393MetGlu: 2.393 ± 1.161
0.798MetPhe: 0.798 ± 0.445
0.798MetGly: 0.798 ± 0.533
0.0MetHis: 0.0 ± 0.0
1.197MetIle: 1.197 ± 0.594
2.393MetLys: 2.393 ± 1.01
1.994MetLeu: 1.994 ± 1.068
0.798MetMet: 0.798 ± 0.547
1.197MetAsn: 1.197 ± 0.625
0.0MetPro: 0.0 ± 0.0
0.798MetGln: 0.798 ± 0.55
0.798MetArg: 0.798 ± 0.439
1.197MetSer: 1.197 ± 0.645
2.393MetThr: 2.393 ± 0.895
1.197MetVal: 1.197 ± 0.537
0.0MetTrp: 0.0 ± 0.0
0.798MetTyr: 0.798 ± 0.855
0.0MetXaa: 0.0 ± 0.0
Asn
4.388AsnAla: 4.388 ± 1.056
0.0AsnCys: 0.0 ± 0.0
2.792AsnAsp: 2.792 ± 0.964
3.989AsnGlu: 3.989 ± 1.439
3.59AsnPhe: 3.59 ± 0.84
3.59AsnGly: 3.59 ± 1.071
1.994AsnHis: 1.994 ± 1.234
2.792AsnIle: 2.792 ± 0.595
3.59AsnLys: 3.59 ± 1.115
5.185AsnLeu: 5.185 ± 1.535
0.798AsnMet: 0.798 ± 0.553
4.388AsnAsn: 4.388 ± 1.521
1.994AsnPro: 1.994 ± 0.559
1.994AsnGln: 1.994 ± 0.842
2.393AsnArg: 2.393 ± 1.048
1.197AsnSer: 1.197 ± 0.603
3.59AsnThr: 3.59 ± 0.979
1.197AsnVal: 1.197 ± 0.664
0.798AsnTrp: 0.798 ± 0.462
3.191AsnTyr: 3.191 ± 1.49
0.0AsnXaa: 0.0 ± 0.0
Pro
1.994ProAla: 1.994 ± 0.716
0.0ProCys: 0.0 ± 0.0
1.994ProAsp: 1.994 ± 0.67
0.399ProGlu: 0.399 ± 0.432
1.994ProPhe: 1.994 ± 0.911
0.0ProGly: 0.0 ± 0.0
1.994ProHis: 1.994 ± 1.285
2.393ProIle: 2.393 ± 0.667
2.393ProLys: 2.393 ± 0.887
2.792ProLeu: 2.792 ± 0.764
0.798ProMet: 0.798 ± 0.71
2.393ProAsn: 2.393 ± 1.015
0.399ProPro: 0.399 ± 0.333
1.596ProGln: 1.596 ± 0.585
1.994ProArg: 1.994 ± 0.663
0.798ProSer: 0.798 ± 0.667
1.596ProThr: 1.596 ± 0.509
1.596ProVal: 1.596 ± 0.968
0.0ProTrp: 0.0 ± 0.0
0.399ProTyr: 0.399 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
4.787GlnAla: 4.787 ± 1.661
0.0GlnCys: 0.0 ± 0.0
1.994GlnAsp: 1.994 ± 0.904
5.584GlnGlu: 5.584 ± 1.374
1.994GlnPhe: 1.994 ± 0.715
2.792GlnGly: 2.792 ± 0.923
1.994GlnHis: 1.994 ± 0.914
2.393GlnIle: 2.393 ± 0.74
2.792GlnLys: 2.792 ± 1.226
3.59GlnLeu: 3.59 ± 0.945
1.197GlnMet: 1.197 ± 0.706
1.596GlnAsn: 1.596 ± 0.837
0.399GlnPro: 0.399 ± 0.333
1.994GlnGln: 1.994 ± 1.155
1.596GlnArg: 1.596 ± 0.594
1.197GlnSer: 1.197 ± 0.676
3.59GlnThr: 3.59 ± 1.283
1.994GlnVal: 1.994 ± 0.995
0.798GlnTrp: 0.798 ± 0.717
3.191GlnTyr: 3.191 ± 1.097
0.0GlnXaa: 0.0 ± 0.0
Arg
3.59ArgAla: 3.59 ± 1.189
0.399ArgCys: 0.399 ± 0.355
1.994ArgAsp: 1.994 ± 0.713
3.59ArgGlu: 3.59 ± 1.197
2.393ArgPhe: 2.393 ± 0.914
3.989ArgGly: 3.989 ± 1.037
1.994ArgHis: 1.994 ± 0.8
2.393ArgIle: 2.393 ± 0.644
5.983ArgLys: 5.983 ± 1.205
4.787ArgLeu: 4.787 ± 1.454
0.399ArgMet: 0.399 ± 0.333
1.596ArgAsn: 1.596 ± 0.947
0.399ArgPro: 0.399 ± 0.333
3.989ArgGln: 3.989 ± 1.078
3.191ArgArg: 3.191 ± 1.062
1.197ArgSer: 1.197 ± 0.683
3.191ArgThr: 3.191 ± 1.015
3.191ArgVal: 3.191 ± 0.985
0.399ArgTrp: 0.399 ± 0.355
1.197ArgTyr: 1.197 ± 0.53
0.0ArgXaa: 0.0 ± 0.0
Ser
1.596SerAla: 1.596 ± 0.86
0.399SerCys: 0.399 ± 0.382
2.792SerAsp: 2.792 ± 0.913
3.191SerGlu: 3.191 ± 1.032
0.798SerPhe: 0.798 ± 0.519
0.798SerGly: 0.798 ± 0.55
0.0SerHis: 0.0 ± 0.0
6.382SerIle: 6.382 ± 1.097
3.191SerLys: 3.191 ± 1.002
8.377SerLeu: 8.377 ± 1.535
0.798SerMet: 0.798 ± 0.623
0.0SerAsn: 0.0 ± 0.0
0.798SerPro: 0.798 ± 0.427
2.393SerGln: 2.393 ± 0.89
1.994SerArg: 1.994 ± 0.891
1.994SerSer: 1.994 ± 0.573
1.994SerThr: 1.994 ± 0.975
1.994SerVal: 1.994 ± 0.605
0.399SerTrp: 0.399 ± 0.36
2.393SerTyr: 2.393 ± 0.762
0.0SerXaa: 0.0 ± 0.0
Thr
3.59ThrAla: 3.59 ± 1.215
0.0ThrCys: 0.0 ± 0.0
2.393ThrAsp: 2.393 ± 0.682
2.393ThrGlu: 2.393 ± 1.093
3.191ThrPhe: 3.191 ± 1.066
4.388ThrGly: 4.388 ± 1.083
1.596ThrHis: 1.596 ± 0.628
3.59ThrIle: 3.59 ± 1.149
6.382ThrLys: 6.382 ± 1.423
9.174ThrLeu: 9.174 ± 2.078
1.994ThrMet: 1.994 ± 0.786
3.59ThrAsn: 3.59 ± 1.451
1.994ThrPro: 1.994 ± 0.766
2.792ThrGln: 2.792 ± 0.748
3.191ThrArg: 3.191 ± 1.155
1.994ThrSer: 1.994 ± 0.546
3.191ThrThr: 3.191 ± 1.247
4.388ThrVal: 4.388 ± 1.656
0.399ThrTrp: 0.399 ± 0.409
1.596ThrTyr: 1.596 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
2.792ValAla: 2.792 ± 0.691
0.399ValCys: 0.399 ± 0.333
1.197ValAsp: 1.197 ± 0.765
4.787ValGlu: 4.787 ± 1.329
3.59ValPhe: 3.59 ± 1.105
3.191ValGly: 3.191 ± 0.946
0.0ValHis: 0.0 ± 0.0
2.393ValIle: 2.393 ± 1.315
3.191ValLys: 3.191 ± 1.08
4.787ValLeu: 4.787 ± 1.042
2.393ValMet: 2.393 ± 1.216
2.792ValAsn: 2.792 ± 0.661
1.596ValPro: 1.596 ± 0.628
1.994ValGln: 1.994 ± 0.685
3.59ValArg: 3.59 ± 1.264
3.191ValSer: 3.191 ± 1.165
1.994ValThr: 1.994 ± 0.546
1.197ValVal: 1.197 ± 0.68
0.399ValTrp: 0.399 ± 0.333
2.393ValTyr: 2.393 ± 0.964
0.0ValXaa: 0.0 ± 0.0
Trp
0.399TrpAla: 0.399 ± 0.409
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
3.191TrpGlu: 3.191 ± 0.741
0.798TrpPhe: 0.798 ± 0.417
0.0TrpGly: 0.0 ± 0.0
0.399TrpHis: 0.399 ± 0.36
0.399TrpIle: 0.399 ± 0.36
0.399TrpLys: 0.399 ± 0.355
1.596TrpLeu: 1.596 ± 0.603
0.0TrpMet: 0.0 ± 0.0
0.399TrpAsn: 0.399 ± 0.36
0.0TrpPro: 0.0 ± 0.0
0.399TrpGln: 0.399 ± 0.333
0.399TrpArg: 0.399 ± 0.409
0.0TrpSer: 0.0 ± 0.0
0.399TrpThr: 0.399 ± 0.409
1.596TrpVal: 1.596 ± 1.029
0.399TrpTrp: 0.399 ± 0.333
0.399TrpTyr: 0.399 ± 0.333
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.393TyrAla: 2.393 ± 1.051
0.399TyrCys: 0.399 ± 0.432
1.994TyrAsp: 1.994 ± 0.839
1.994TyrGlu: 1.994 ± 1.126
3.59TyrPhe: 3.59 ± 0.996
1.596TyrGly: 1.596 ± 0.622
1.596TyrHis: 1.596 ± 0.977
4.388TyrIle: 4.388 ± 1.333
3.989TyrLys: 3.989 ± 1.151
7.18TyrLeu: 7.18 ± 2.216
1.596TyrMet: 1.596 ± 0.636
3.191TyrAsn: 3.191 ± 1.173
1.197TyrPro: 1.197 ± 0.68
1.596TyrGln: 1.596 ± 0.834
1.197TyrArg: 1.197 ± 0.566
3.191TyrSer: 3.191 ± 0.754
1.197TyrThr: 1.197 ± 0.714
2.393TyrVal: 2.393 ± 0.684
0.399TyrTrp: 0.399 ± 0.36
1.197TyrTyr: 1.197 ± 0.683
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (2508 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski