Amino acid dipepetide frequency for Streptococcus satellite phage Javan279

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.56AlaAla: 2.56 ± 1.068
0.366AlaCys: 0.366 ± 0.311
3.658AlaAsp: 3.658 ± 1.098
4.389AlaGlu: 4.389 ± 1.245
3.292AlaPhe: 3.292 ± 0.795
4.755AlaGly: 4.755 ± 1.339
1.463AlaHis: 1.463 ± 0.753
4.755AlaIle: 4.755 ± 1.168
6.95AlaLys: 6.95 ± 1.395
9.144AlaLeu: 9.144 ± 1.797
2.195AlaMet: 2.195 ± 0.65
3.292AlaAsn: 3.292 ± 1.213
2.195AlaPro: 2.195 ± 0.709
2.56AlaGln: 2.56 ± 0.778
2.926AlaArg: 2.926 ± 0.964
2.926AlaSer: 2.926 ± 1.238
3.292AlaThr: 3.292 ± 0.757
2.926AlaVal: 2.926 ± 1.03
0.732AlaTrp: 0.732 ± 0.506
4.023AlaTyr: 4.023 ± 1.195
0.0AlaXaa: 0.0 ± 0.0
Cys
1.097CysAla: 1.097 ± 0.529
0.0CysCys: 0.0 ± 0.0
0.366CysAsp: 0.366 ± 0.376
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.732CysGly: 0.732 ± 0.499
0.366CysHis: 0.366 ± 0.36
0.366CysIle: 0.366 ± 0.341
0.732CysLys: 0.732 ± 0.436
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.366CysPro: 0.366 ± 0.307
0.366CysGln: 0.366 ± 0.345
0.0CysArg: 0.0 ± 0.0
0.366CysSer: 0.366 ± 0.307
0.366CysThr: 0.366 ± 0.401
0.366CysVal: 0.366 ± 0.307
0.0CysTrp: 0.0 ± 0.0
0.366CysTyr: 0.366 ± 0.376
0.0CysXaa: 0.0 ± 0.0
Asp
1.829AspAla: 1.829 ± 0.777
0.732AspCys: 0.732 ± 0.496
2.56AspAsp: 2.56 ± 1.4
2.926AspGlu: 2.926 ± 0.812
3.658AspPhe: 3.658 ± 1.422
1.829AspGly: 1.829 ± 0.871
0.732AspHis: 0.732 ± 0.486
6.95AspIle: 6.95 ± 1.52
5.852AspLys: 5.852 ± 1.399
6.95AspLeu: 6.95 ± 1.448
1.829AspMet: 1.829 ± 0.716
3.658AspAsn: 3.658 ± 1.342
0.366AspPro: 0.366 ± 0.368
0.732AspGln: 0.732 ± 0.429
0.732AspArg: 0.732 ± 0.534
3.292AspSer: 3.292 ± 1.117
1.829AspThr: 1.829 ± 0.86
1.097AspVal: 1.097 ± 0.698
0.732AspTrp: 0.732 ± 0.432
3.292AspTyr: 3.292 ± 1.55
0.0AspXaa: 0.0 ± 0.0
Glu
8.047GluAla: 8.047 ± 1.337
0.732GluCys: 0.732 ± 0.438
3.658GluAsp: 3.658 ± 1.32
6.218GluGlu: 6.218 ± 2.11
4.755GluPhe: 4.755 ± 1.606
2.195GluGly: 2.195 ± 0.96
0.366GluHis: 0.366 ± 0.401
4.389GluIle: 4.389 ± 1.205
9.144GluLys: 9.144 ± 1.463
7.315GluLeu: 7.315 ± 1.551
3.292GluMet: 3.292 ± 0.96
3.658GluAsn: 3.658 ± 1.157
1.097GluPro: 1.097 ± 0.716
6.218GluGln: 6.218 ± 1.627
4.023GluArg: 4.023 ± 2.17
2.926GluSer: 2.926 ± 0.859
3.292GluThr: 3.292 ± 0.629
3.658GluVal: 3.658 ± 1.034
1.463GluTrp: 1.463 ± 0.675
2.195GluTyr: 2.195 ± 0.839
0.0GluXaa: 0.0 ± 0.0
Phe
2.926PheAla: 2.926 ± 0.774
0.732PheCys: 0.732 ± 0.516
3.658PheAsp: 3.658 ± 1.085
3.292PheGlu: 3.292 ± 1.121
1.829PhePhe: 1.829 ± 0.71
2.195PheGly: 2.195 ± 0.594
0.732PheHis: 0.732 ± 0.431
4.023PheIle: 4.023 ± 1.168
3.658PheLys: 3.658 ± 1.004
3.658PheLeu: 3.658 ± 1.02
0.732PheMet: 0.732 ± 0.44
1.097PheAsn: 1.097 ± 0.61
1.097PhePro: 1.097 ± 0.571
1.463PheGln: 1.463 ± 0.71
2.195PheArg: 2.195 ± 0.714
1.829PheSer: 1.829 ± 0.683
1.829PheThr: 1.829 ± 0.564
1.829PheVal: 1.829 ± 0.613
0.732PheTrp: 0.732 ± 0.464
2.195PheTyr: 2.195 ± 1.13
0.0PheXaa: 0.0 ± 0.0
Gly
2.195GlyAla: 2.195 ± 0.744
0.0GlyCys: 0.0 ± 0.0
0.732GlyAsp: 0.732 ± 0.417
1.829GlyGlu: 1.829 ± 0.779
1.097GlyPhe: 1.097 ± 0.613
2.926GlyGly: 2.926 ± 1.041
0.366GlyHis: 0.366 ± 0.311
3.658GlyIle: 3.658 ± 1.067
4.755GlyLys: 4.755 ± 1.197
9.144GlyLeu: 9.144 ± 1.975
2.195GlyMet: 2.195 ± 1.077
1.829GlyAsn: 1.829 ± 0.834
0.0GlyPro: 0.0 ± 0.0
1.829GlyGln: 1.829 ± 0.845
2.195GlyArg: 2.195 ± 0.756
2.926GlySer: 2.926 ± 0.964
2.195GlyThr: 2.195 ± 0.636
1.463GlyVal: 1.463 ± 0.52
1.097GlyTrp: 1.097 ± 0.694
2.926GlyTyr: 2.926 ± 0.986
0.0GlyXaa: 0.0 ± 0.0
His
1.463HisAla: 1.463 ± 0.934
0.0HisCys: 0.0 ± 0.0
0.366HisAsp: 0.366 ± 0.335
0.732HisGlu: 0.732 ± 0.632
0.366HisPhe: 0.366 ± 0.473
1.097HisGly: 1.097 ± 0.73
1.097HisHis: 1.097 ± 1.056
2.195HisIle: 2.195 ± 1.118
0.732HisLys: 0.732 ± 0.517
2.56HisLeu: 2.56 ± 0.713
0.0HisMet: 0.0 ± 0.0
0.732HisAsn: 0.732 ± 0.433
0.366HisPro: 0.366 ± 0.36
0.0HisGln: 0.0 ± 0.0
0.732HisArg: 0.732 ± 0.67
0.732HisSer: 0.732 ± 0.402
0.732HisThr: 0.732 ± 0.466
0.366HisVal: 0.366 ± 0.307
0.0HisTrp: 0.0 ± 0.0
0.366HisTyr: 0.366 ± 0.36
0.0HisXaa: 0.0 ± 0.0
Ile
4.389IleAla: 4.389 ± 1.181
1.097IleCys: 1.097 ± 0.628
6.218IleAsp: 6.218 ± 1.438
5.852IleGlu: 5.852 ± 1.468
2.56IlePhe: 2.56 ± 0.893
1.829IleGly: 1.829 ± 0.813
1.097IleHis: 1.097 ± 0.694
5.486IleIle: 5.486 ± 1.111
5.486IleLys: 5.486 ± 1.321
8.778IleLeu: 8.778 ± 2.312
1.097IleMet: 1.097 ± 0.616
3.292IleAsn: 3.292 ± 1.059
2.926IlePro: 2.926 ± 0.953
3.658IleGln: 3.658 ± 0.811
5.121IleArg: 5.121 ± 1.285
4.023IleSer: 4.023 ± 1.064
4.389IleThr: 4.389 ± 1.111
2.195IleVal: 2.195 ± 0.929
1.097IleTrp: 1.097 ± 0.512
2.56IleTyr: 2.56 ± 0.678
0.0IleXaa: 0.0 ± 0.0
Lys
9.144LysAla: 9.144 ± 2.008
0.366LysCys: 0.366 ± 0.341
3.292LysAsp: 3.292 ± 1.369
9.876LysGlu: 9.876 ± 1.354
2.56LysPhe: 2.56 ± 0.855
2.195LysGly: 2.195 ± 0.818
1.829LysHis: 1.829 ± 1.094
6.218LysIle: 6.218 ± 1.473
10.973LysLys: 10.973 ± 2.118
8.047LysLeu: 8.047 ± 1.39
1.097LysMet: 1.097 ± 0.717
8.778LysAsn: 8.778 ± 1.431
2.926LysPro: 2.926 ± 0.965
4.389LysGln: 4.389 ± 1.013
4.023LysArg: 4.023 ± 0.942
4.755LysSer: 4.755 ± 0.828
7.315LysThr: 7.315 ± 1.434
4.389LysVal: 4.389 ± 0.936
0.732LysTrp: 0.732 ± 0.506
2.195LysTyr: 2.195 ± 1.018
0.0LysXaa: 0.0 ± 0.0
Leu
8.778LeuAla: 8.778 ± 1.406
0.366LeuCys: 0.366 ± 0.345
6.95LeuAsp: 6.95 ± 1.587
11.704LeuGlu: 11.704 ± 1.766
4.755LeuPhe: 4.755 ± 1.557
6.218LeuGly: 6.218 ± 1.716
0.732LeuHis: 0.732 ± 0.484
4.023LeuIle: 4.023 ± 1.196
12.436LeuLys: 12.436 ± 1.971
13.533LeuLeu: 13.533 ± 2.382
3.292LeuMet: 3.292 ± 0.779
6.95LeuAsn: 6.95 ± 1.447
2.926LeuPro: 2.926 ± 1.182
5.121LeuGln: 5.121 ± 1.08
2.926LeuArg: 2.926 ± 1.119
6.584LeuSer: 6.584 ± 1.117
8.047LeuThr: 8.047 ± 1.392
4.389LeuVal: 4.389 ± 1.129
1.097LeuTrp: 1.097 ± 0.773
4.389LeuTyr: 4.389 ± 1.237
0.0LeuXaa: 0.0 ± 0.0
Met
3.658MetAla: 3.658 ± 1.143
0.0MetCys: 0.0 ± 0.0
0.732MetAsp: 0.732 ± 0.5
1.829MetGlu: 1.829 ± 0.878
0.366MetPhe: 0.366 ± 0.312
0.0MetGly: 0.0 ± 0.0
0.366MetHis: 0.366 ± 0.312
1.829MetIle: 1.829 ± 0.764
2.195MetLys: 2.195 ± 1.074
2.56MetLeu: 2.56 ± 0.929
0.0MetMet: 0.0 ± 0.0
1.829MetAsn: 1.829 ± 0.92
1.097MetPro: 1.097 ± 0.675
0.0MetGln: 0.0 ± 0.0
1.829MetArg: 1.829 ± 0.444
0.732MetSer: 0.732 ± 0.549
2.56MetThr: 2.56 ± 0.844
1.463MetVal: 1.463 ± 0.729
0.366MetTrp: 0.366 ± 0.376
0.732MetTyr: 0.732 ± 0.493
0.0MetXaa: 0.0 ± 0.0
Asn
4.389AsnAla: 4.389 ± 1.418
0.0AsnCys: 0.0 ± 0.0
2.195AsnAsp: 2.195 ± 0.718
3.658AsnGlu: 3.658 ± 1.53
3.658AsnPhe: 3.658 ± 0.715
4.023AsnGly: 4.023 ± 1.104
0.0AsnHis: 0.0 ± 0.0
3.292AsnIle: 3.292 ± 0.699
4.023AsnLys: 4.023 ± 0.902
5.121AsnLeu: 5.121 ± 1.702
0.732AsnMet: 0.732 ± 0.445
6.218AsnAsn: 6.218 ± 1.884
1.463AsnPro: 1.463 ± 0.553
3.292AsnGln: 3.292 ± 0.878
4.389AsnArg: 4.389 ± 1.175
1.829AsnSer: 1.829 ± 0.642
2.56AsnThr: 2.56 ± 1.035
2.56AsnVal: 2.56 ± 0.983
0.732AsnTrp: 0.732 ± 0.461
2.195AsnTyr: 2.195 ± 1.055
0.0AsnXaa: 0.0 ± 0.0
Pro
1.463ProAla: 1.463 ± 0.624
0.0ProCys: 0.0 ± 0.0
1.829ProAsp: 1.829 ± 0.6
1.829ProGlu: 1.829 ± 0.961
0.732ProPhe: 0.732 ± 0.467
0.366ProGly: 0.366 ± 0.341
0.0ProHis: 0.0 ± 0.0
0.732ProIle: 0.732 ± 0.586
3.292ProLys: 3.292 ± 1.176
3.292ProLeu: 3.292 ± 1.253
0.0ProMet: 0.0 ± 0.0
0.366ProAsn: 0.366 ± 0.448
0.366ProPro: 0.366 ± 0.347
2.195ProGln: 2.195 ± 0.567
1.463ProArg: 1.463 ± 0.681
1.829ProSer: 1.829 ± 0.603
2.926ProThr: 2.926 ± 0.798
1.463ProVal: 1.463 ± 0.599
0.0ProTrp: 0.0 ± 0.0
1.097ProTyr: 1.097 ± 0.677
0.0ProXaa: 0.0 ± 0.0
Gln
3.292GlnAla: 3.292 ± 1.373
0.0GlnCys: 0.0 ± 0.0
2.926GlnAsp: 2.926 ± 0.947
4.389GlnGlu: 4.389 ± 0.854
0.732GlnPhe: 0.732 ± 0.449
3.292GlnGly: 3.292 ± 0.893
0.732GlnHis: 0.732 ± 0.495
4.023GlnIle: 4.023 ± 1.022
3.292GlnLys: 3.292 ± 1.011
6.584GlnLeu: 6.584 ± 1.232
0.732GlnMet: 0.732 ± 0.493
3.292GlnAsn: 3.292 ± 0.953
1.829GlnPro: 1.829 ± 1.096
3.292GlnGln: 3.292 ± 0.998
3.292GlnArg: 3.292 ± 0.681
2.195GlnSer: 2.195 ± 0.785
1.829GlnThr: 1.829 ± 0.648
3.292GlnVal: 3.292 ± 1.087
0.0GlnTrp: 0.0 ± 0.0
0.732GlnTyr: 0.732 ± 0.623
0.0GlnXaa: 0.0 ± 0.0
Arg
3.292ArgAla: 3.292 ± 1.1
0.366ArgCys: 0.366 ± 0.311
2.926ArgAsp: 2.926 ± 0.783
5.486ArgGlu: 5.486 ± 1.34
1.097ArgPhe: 1.097 ± 0.781
2.195ArgGly: 2.195 ± 1.058
0.732ArgHis: 0.732 ± 0.414
3.292ArgIle: 3.292 ± 0.867
2.56ArgLys: 2.56 ± 0.836
3.658ArgLeu: 3.658 ± 1.197
0.732ArgMet: 0.732 ± 0.485
2.926ArgAsn: 2.926 ± 1.168
1.097ArgPro: 1.097 ± 0.639
4.755ArgGln: 4.755 ± 1.455
1.829ArgArg: 1.829 ± 0.797
2.926ArgSer: 2.926 ± 1.213
1.463ArgThr: 1.463 ± 0.95
1.097ArgVal: 1.097 ± 0.643
0.732ArgTrp: 0.732 ± 0.515
3.658ArgTyr: 3.658 ± 0.783
0.0ArgXaa: 0.0 ± 0.0
Ser
1.829SerAla: 1.829 ± 0.762
0.0SerCys: 0.0 ± 0.0
3.658SerAsp: 3.658 ± 0.745
4.755SerGlu: 4.755 ± 1.054
0.732SerPhe: 0.732 ± 0.432
1.829SerGly: 1.829 ± 0.492
1.097SerHis: 1.097 ± 0.583
5.486SerIle: 5.486 ± 0.847
2.926SerLys: 2.926 ± 0.907
5.121SerLeu: 5.121 ± 1.016
1.463SerMet: 1.463 ± 0.663
2.195SerAsn: 2.195 ± 0.741
1.463SerPro: 1.463 ± 0.696
1.829SerGln: 1.829 ± 0.998
2.195SerArg: 2.195 ± 0.752
2.195SerSer: 2.195 ± 1.173
5.121SerThr: 5.121 ± 1.12
3.658SerVal: 3.658 ± 1.008
0.366SerTrp: 0.366 ± 0.334
2.56SerTyr: 2.56 ± 0.925
0.0SerXaa: 0.0 ± 0.0
Thr
2.926ThrAla: 2.926 ± 0.951
0.366ThrCys: 0.366 ± 0.341
4.023ThrAsp: 4.023 ± 1.173
4.023ThrGlu: 4.023 ± 1.216
2.56ThrPhe: 2.56 ± 0.713
4.389ThrGly: 4.389 ± 1.162
1.097ThrHis: 1.097 ± 0.677
5.486ThrIle: 5.486 ± 1.177
5.121ThrLys: 5.121 ± 1.09
6.95ThrLeu: 6.95 ± 1.501
2.56ThrMet: 2.56 ± 1.409
2.195ThrAsn: 2.195 ± 0.583
1.097ThrPro: 1.097 ± 0.475
2.926ThrGln: 2.926 ± 1.24
1.829ThrArg: 1.829 ± 0.939
1.829ThrSer: 1.829 ± 0.539
1.829ThrThr: 1.829 ± 0.987
4.755ThrVal: 4.755 ± 1.466
0.366ThrTrp: 0.366 ± 0.335
2.56ThrTyr: 2.56 ± 0.907
0.0ThrXaa: 0.0 ± 0.0
Val
1.829ValAla: 1.829 ± 0.665
0.366ValCys: 0.366 ± 0.307
1.463ValAsp: 1.463 ± 0.602
2.56ValGlu: 2.56 ± 1.048
2.926ValPhe: 2.926 ± 1.153
1.463ValGly: 1.463 ± 0.651
0.366ValHis: 0.366 ± 0.369
2.926ValIle: 2.926 ± 0.934
6.218ValLys: 6.218 ± 1.54
5.121ValLeu: 5.121 ± 1.421
0.366ValMet: 0.366 ± 0.401
2.195ValAsn: 2.195 ± 0.819
1.463ValPro: 1.463 ± 0.693
1.097ValGln: 1.097 ± 0.692
2.195ValArg: 2.195 ± 0.942
2.56ValSer: 2.56 ± 0.809
4.023ValThr: 4.023 ± 1.495
1.097ValVal: 1.097 ± 0.731
0.366ValTrp: 0.366 ± 0.312
3.658ValTyr: 3.658 ± 0.827
0.0ValXaa: 0.0 ± 0.0
Trp
1.097TrpAla: 1.097 ± 0.616
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.463TrpGlu: 1.463 ± 0.69
0.366TrpPhe: 0.366 ± 0.36
0.366TrpGly: 0.366 ± 0.369
0.0TrpHis: 0.0 ± 0.0
0.732TrpIle: 0.732 ± 0.461
0.366TrpLys: 0.366 ± 0.334
2.926TrpLeu: 2.926 ± 0.644
0.732TrpMet: 0.732 ± 0.468
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.732TrpGln: 0.732 ± 0.375
0.0TrpArg: 0.0 ± 0.0
0.732TrpSer: 0.732 ± 0.721
0.732TrpThr: 0.732 ± 0.375
0.732TrpVal: 0.732 ± 0.571
0.0TrpTrp: 0.0 ± 0.0
0.366TrpTyr: 0.366 ± 0.426
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.195TyrAla: 2.195 ± 0.853
0.366TyrCys: 0.366 ± 0.36
0.366TyrAsp: 0.366 ± 0.369
1.829TyrGlu: 1.829 ± 0.744
3.658TyrPhe: 3.658 ± 1.292
1.463TyrGly: 1.463 ± 0.503
1.463TyrHis: 1.463 ± 0.536
3.292TyrIle: 3.292 ± 0.816
4.755TyrLys: 4.755 ± 1.228
4.755TyrLeu: 4.755 ± 1.235
0.732TyrMet: 0.732 ± 0.475
1.829TyrAsn: 1.829 ± 0.598
1.097TyrPro: 1.097 ± 0.542
3.292TyrGln: 3.292 ± 1.139
2.926TyrArg: 2.926 ± 0.869
3.292TyrSer: 3.292 ± 1.125
2.56TyrThr: 2.56 ± 0.976
1.463TyrVal: 1.463 ± 0.616
0.732TyrTrp: 0.732 ± 0.476
1.097TyrTyr: 1.097 ± 0.533
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (2735 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski