Amino acid dipepetide frequency for Streptococcus satellite phage Javan429

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.741AlaAla: 0.741 ± 0.458
0.0AlaCys: 0.0 ± 0.0
2.595AlaAsp: 2.595 ± 0.884
5.189AlaGlu: 5.189 ± 1.643
2.965AlaPhe: 2.965 ± 1.01
2.224AlaGly: 2.224 ± 0.613
0.371AlaHis: 0.371 ± 0.422
5.189AlaIle: 5.189 ± 1.204
5.56AlaLys: 5.56 ± 1.063
5.189AlaLeu: 5.189 ± 1.96
1.483AlaMet: 1.483 ± 0.726
2.224AlaAsn: 2.224 ± 0.816
0.741AlaPro: 0.741 ± 0.462
3.336AlaGln: 3.336 ± 0.942
3.706AlaArg: 3.706 ± 1.183
4.077AlaSer: 4.077 ± 1.033
4.077AlaThr: 4.077 ± 1.13
4.077AlaVal: 4.077 ± 0.769
0.371AlaTrp: 0.371 ± 0.28
2.595AlaTyr: 2.595 ± 0.793
0.0AlaXaa: 0.0 ± 0.0
Cys
0.741CysAla: 0.741 ± 0.471
0.0CysCys: 0.0 ± 0.0
0.371CysAsp: 0.371 ± 0.341
0.741CysGlu: 0.741 ± 0.399
0.371CysPhe: 0.371 ± 0.389
0.0CysGly: 0.0 ± 0.0
0.371CysHis: 0.371 ± 0.373
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.112CysLeu: 1.112 ± 0.601
0.371CysMet: 0.371 ± 0.384
0.371CysAsn: 0.371 ± 0.391
0.371CysPro: 0.371 ± 0.373
0.0CysGln: 0.0 ± 0.0
0.371CysArg: 0.371 ± 0.336
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.371CysVal: 0.371 ± 0.28
0.0CysTrp: 0.0 ± 0.0
0.741CysTyr: 0.741 ± 0.384
0.0CysXaa: 0.0 ± 0.0
Asp
1.483AspAla: 1.483 ± 0.588
0.741AspCys: 0.741 ± 0.533
4.448AspAsp: 4.448 ± 1.413
5.56AspGlu: 5.56 ± 1.308
1.853AspPhe: 1.853 ± 0.629
2.965AspGly: 2.965 ± 0.665
0.371AspHis: 0.371 ± 0.28
6.672AspIle: 6.672 ± 1.253
5.56AspLys: 5.56 ± 1.44
8.154AspLeu: 8.154 ± 2.473
1.853AspMet: 1.853 ± 0.615
3.706AspAsn: 3.706 ± 1.254
0.741AspPro: 0.741 ± 0.56
1.483AspGln: 1.483 ± 0.705
2.595AspArg: 2.595 ± 0.953
1.853AspSer: 1.853 ± 0.777
2.595AspThr: 2.595 ± 0.934
2.224AspVal: 2.224 ± 1.327
0.371AspTrp: 0.371 ± 0.28
4.818AspTyr: 4.818 ± 1.352
0.0AspXaa: 0.0 ± 0.0
Glu
2.595GluAla: 2.595 ± 0.89
1.112GluCys: 1.112 ± 0.487
4.448GluAsp: 4.448 ± 1.35
6.301GluGlu: 6.301 ± 1.493
2.595GluPhe: 2.595 ± 1.283
2.595GluGly: 2.595 ± 0.918
1.483GluHis: 1.483 ± 0.699
6.672GluIle: 6.672 ± 0.859
11.49GluLys: 11.49 ± 2.097
8.895GluLeu: 8.895 ± 2.263
2.965GluMet: 2.965 ± 1.004
3.706GluAsn: 3.706 ± 1.486
2.224GluPro: 2.224 ± 0.952
3.706GluGln: 3.706 ± 0.849
4.448GluArg: 4.448 ± 1.319
5.189GluSer: 5.189 ± 1.359
5.56GluThr: 5.56 ± 1.421
6.672GluVal: 6.672 ± 1.789
0.371GluTrp: 0.371 ± 0.342
4.077GluTyr: 4.077 ± 1.338
0.0GluXaa: 0.0 ± 0.0
Phe
1.853PheAla: 1.853 ± 0.834
0.0PheCys: 0.0 ± 0.0
2.965PheAsp: 2.965 ± 1.055
2.965PheGlu: 2.965 ± 1.194
2.595PhePhe: 2.595 ± 0.901
2.965PheGly: 2.965 ± 0.911
0.741PheHis: 0.741 ± 0.496
2.595PheIle: 2.595 ± 0.768
2.224PheLys: 2.224 ± 0.786
5.56PheLeu: 5.56 ± 1.25
0.0PheMet: 0.0 ± 0.52
1.112PheAsn: 1.112 ± 0.699
1.483PhePro: 1.483 ± 0.552
0.741PheGln: 0.741 ± 0.471
1.853PheArg: 1.853 ± 0.594
3.336PheSer: 3.336 ± 0.947
4.077PheThr: 4.077 ± 1.415
2.595PheVal: 2.595 ± 1.08
0.371PheTrp: 0.371 ± 0.28
1.853PheTyr: 1.853 ± 0.842
0.0PheXaa: 0.0 ± 0.0
Gly
3.336GlyAla: 3.336 ± 1.326
0.0GlyCys: 0.0 ± 0.0
3.336GlyAsp: 3.336 ± 0.922
4.077GlyGlu: 4.077 ± 1.332
2.224GlyPhe: 2.224 ± 0.646
1.112GlyGly: 1.112 ± 0.738
1.483GlyHis: 1.483 ± 0.64
4.077GlyIle: 4.077 ± 1.271
2.965GlyLys: 2.965 ± 0.851
5.93GlyLeu: 5.93 ± 1.559
0.741GlyMet: 0.741 ± 0.497
2.224GlyAsn: 2.224 ± 0.998
0.0GlyPro: 0.0 ± 0.0
1.483GlyGln: 1.483 ± 0.716
3.336GlyArg: 3.336 ± 0.825
1.853GlySer: 1.853 ± 0.709
1.483GlyThr: 1.483 ± 0.501
5.93GlyVal: 5.93 ± 1.342
1.853GlyTrp: 1.853 ± 0.982
2.965GlyTyr: 2.965 ± 0.586
0.0GlyXaa: 0.0 ± 0.0
His
0.741HisAla: 0.741 ± 0.746
0.0HisCys: 0.0 ± 0.0
1.112HisAsp: 1.112 ± 0.538
1.483HisGlu: 1.483 ± 0.695
1.112HisPhe: 1.112 ± 0.604
2.224HisGly: 2.224 ± 0.968
0.0HisHis: 0.0 ± 0.0
0.741HisIle: 0.741 ± 0.447
1.483HisLys: 1.483 ± 0.81
1.853HisLeu: 1.853 ± 1.086
0.371HisMet: 0.371 ± 0.28
1.112HisAsn: 1.112 ± 0.458
0.0HisPro: 0.0 ± 0.0
0.371HisGln: 0.371 ± 0.336
0.741HisArg: 0.741 ± 0.507
0.741HisSer: 0.741 ± 0.399
0.741HisThr: 0.741 ± 0.746
0.741HisVal: 0.741 ± 0.471
0.0HisTrp: 0.0 ± 0.0
1.483HisTyr: 1.483 ± 0.826
0.0HisXaa: 0.0 ± 0.0
Ile
3.706IleAla: 3.706 ± 1.225
0.371IleCys: 0.371 ± 0.391
2.595IleAsp: 2.595 ± 1.432
5.56IleGlu: 5.56 ± 1.087
3.336IlePhe: 3.336 ± 1.538
2.965IleGly: 2.965 ± 1.264
0.371IleHis: 0.371 ± 0.341
6.672IleIle: 6.672 ± 1.024
6.672IleLys: 6.672 ± 2.079
6.672IleLeu: 6.672 ± 1.2
1.112IleMet: 1.112 ± 0.56
2.595IleAsn: 2.595 ± 0.524
3.706IlePro: 3.706 ± 0.982
2.224IleGln: 2.224 ± 0.686
3.706IleArg: 3.706 ± 0.793
5.189IleSer: 5.189 ± 1.612
3.336IleThr: 3.336 ± 0.841
4.448IleVal: 4.448 ± 0.789
0.371IleTrp: 0.371 ± 0.422
4.448IleTyr: 4.448 ± 1.254
0.0IleXaa: 0.0 ± 0.0
Lys
7.413LysAla: 7.413 ± 1.715
0.371LysCys: 0.371 ± 0.389
6.301LysAsp: 6.301 ± 1.603
8.895LysGlu: 8.895 ± 1.836
4.448LysPhe: 4.448 ± 1.042
5.93LysGly: 5.93 ± 1.49
2.224LysHis: 2.224 ± 1.154
7.042LysIle: 7.042 ± 1.611
9.266LysLys: 9.266 ± 1.899
10.007LysLeu: 10.007 ± 1.359
1.853LysMet: 1.853 ± 0.936
7.413LysAsn: 7.413 ± 1.833
1.112LysPro: 1.112 ± 0.524
4.077LysGln: 4.077 ± 1.256
5.189LysArg: 5.189 ± 0.963
5.189LysSer: 5.189 ± 1.11
4.448LysThr: 4.448 ± 1.69
3.336LysVal: 3.336 ± 0.883
1.112LysTrp: 1.112 ± 0.519
2.965LysTyr: 2.965 ± 1.017
0.0LysXaa: 0.0 ± 0.0
Leu
7.413LeuAla: 7.413 ± 1.812
0.0LeuCys: 0.0 ± 0.0
7.413LeuAsp: 7.413 ± 2.152
11.49LeuGlu: 11.49 ± 1.62
4.077LeuPhe: 4.077 ± 1.791
4.448LeuGly: 4.448 ± 1.218
1.483LeuHis: 1.483 ± 0.808
4.818LeuIle: 4.818 ± 1.455
10.378LeuLys: 10.378 ± 2.073
5.56LeuLeu: 5.56 ± 1.356
1.853LeuMet: 1.853 ± 1.007
7.042LeuAsn: 7.042 ± 1.571
4.448LeuPro: 4.448 ± 0.949
2.965LeuGln: 2.965 ± 1.161
4.448LeuArg: 4.448 ± 1.538
4.077LeuSer: 4.077 ± 1.093
7.784LeuThr: 7.784 ± 1.598
3.706LeuVal: 3.706 ± 1.063
0.0LeuTrp: 0.0 ± 0.0
2.965LeuTyr: 2.965 ± 1.018
0.0LeuXaa: 0.0 ± 0.0
Met
1.112MetAla: 1.112 ± 0.581
0.0MetCys: 0.0 ± 0.0
1.483MetAsp: 1.483 ± 0.564
1.483MetGlu: 1.483 ± 0.79
1.112MetPhe: 1.112 ± 0.663
1.112MetGly: 1.112 ± 0.471
0.0MetHis: 0.0 ± 0.0
1.853MetIle: 1.853 ± 0.847
2.595MetLys: 2.595 ± 0.673
0.741MetLeu: 0.741 ± 0.43
0.741MetMet: 0.741 ± 0.447
1.853MetAsn: 1.853 ± 0.654
0.0MetPro: 0.0 ± 0.0
1.112MetGln: 1.112 ± 0.529
0.741MetArg: 0.741 ± 0.531
1.483MetSer: 1.483 ± 0.751
0.741MetThr: 0.741 ± 0.623
2.224MetVal: 2.224 ± 1.111
0.0MetTrp: 0.0 ± 0.0
1.112MetTyr: 1.112 ± 0.477
0.0MetXaa: 0.0 ± 0.0
Asn
4.077AsnAla: 4.077 ± 1.285
1.112AsnCys: 1.112 ± 0.412
3.336AsnAsp: 3.336 ± 1.15
1.853AsnGlu: 1.853 ± 0.633
1.112AsnPhe: 1.112 ± 0.484
5.189AsnGly: 5.189 ± 1.436
2.224AsnHis: 2.224 ± 0.699
2.224AsnIle: 2.224 ± 1.243
5.93AsnLys: 5.93 ± 1.464
6.301AsnLeu: 6.301 ± 1.554
1.112AsnMet: 1.112 ± 0.688
0.371AsnAsn: 0.371 ± 0.368
2.224AsnPro: 2.224 ± 0.628
3.706AsnGln: 3.706 ± 0.891
2.224AsnArg: 2.224 ± 0.835
2.595AsnSer: 2.595 ± 0.723
3.706AsnThr: 3.706 ± 1.475
1.112AsnVal: 1.112 ± 0.581
0.741AsnTrp: 0.741 ± 0.399
2.595AsnTyr: 2.595 ± 1.08
0.0AsnXaa: 0.0 ± 0.0
Pro
1.112ProAla: 1.112 ± 0.821
0.0ProCys: 0.0 ± 0.0
2.965ProAsp: 2.965 ± 0.831
3.706ProGlu: 3.706 ± 1.321
1.483ProPhe: 1.483 ± 0.519
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
0.741ProIle: 0.741 ± 0.525
1.853ProLys: 1.853 ± 0.652
1.483ProLeu: 1.483 ± 0.733
0.741ProMet: 0.741 ± 0.386
1.483ProAsn: 1.483 ± 0.785
2.595ProPro: 2.595 ± 1.086
1.483ProGln: 1.483 ± 0.57
1.483ProArg: 1.483 ± 0.864
0.371ProSer: 0.371 ± 0.33
1.483ProThr: 1.483 ± 0.663
1.483ProVal: 1.483 ± 0.749
0.0ProTrp: 0.0 ± 0.0
1.483ProTyr: 1.483 ± 0.655
0.0ProXaa: 0.0 ± 0.0
Gln
3.336GlnAla: 3.336 ± 0.742
0.0GlnCys: 0.0 ± 0.0
1.853GlnAsp: 1.853 ± 0.905
4.818GlnGlu: 4.818 ± 1.513
1.112GlnPhe: 1.112 ± 0.581
2.224GlnGly: 2.224 ± 0.86
1.483GlnHis: 1.483 ± 1.031
3.706GlnIle: 3.706 ± 1.313
4.818GlnLys: 4.818 ± 1.448
4.077GlnLeu: 4.077 ± 1.268
0.371GlnMet: 0.371 ± 0.342
1.853GlnAsn: 1.853 ± 0.782
0.371GlnPro: 0.371 ± 0.422
2.224GlnGln: 2.224 ± 0.902
1.483GlnArg: 1.483 ± 0.668
2.965GlnSer: 2.965 ± 0.935
1.853GlnThr: 1.853 ± 0.621
3.706GlnVal: 3.706 ± 1.072
0.741GlnTrp: 0.741 ± 0.523
1.483GlnTyr: 1.483 ± 0.574
0.0GlnXaa: 0.0 ± 0.0
Arg
4.448ArgAla: 4.448 ± 1.466
0.0ArgCys: 0.0 ± 0.0
3.706ArgAsp: 3.706 ± 0.74
4.818ArgGlu: 4.818 ± 1.182
1.483ArgPhe: 1.483 ± 0.623
2.965ArgGly: 2.965 ± 1.221
1.112ArgHis: 1.112 ± 0.412
2.224ArgIle: 2.224 ± 0.848
3.706ArgLys: 3.706 ± 1.064
4.818ArgLeu: 4.818 ± 1.607
1.853ArgMet: 1.853 ± 0.772
1.483ArgAsn: 1.483 ± 0.621
1.112ArgPro: 1.112 ± 0.581
3.336ArgGln: 3.336 ± 0.941
2.224ArgArg: 2.224 ± 0.898
1.483ArgSer: 1.483 ± 0.889
2.595ArgThr: 2.595 ± 0.578
2.595ArgVal: 2.595 ± 0.764
0.0ArgTrp: 0.0 ± 0.0
2.965ArgTyr: 2.965 ± 1.092
0.0ArgXaa: 0.0 ± 0.0
Ser
2.595SerAla: 2.595 ± 0.992
1.112SerCys: 1.112 ± 0.471
2.224SerAsp: 2.224 ± 0.867
5.189SerGlu: 5.189 ± 1.042
2.595SerPhe: 2.595 ± 0.707
2.224SerGly: 2.224 ± 0.561
0.741SerHis: 0.741 ± 0.533
5.189SerIle: 5.189 ± 1.649
5.189SerLys: 5.189 ± 1.393
4.818SerLeu: 4.818 ± 0.636
2.224SerMet: 2.224 ± 0.699
2.595SerAsn: 2.595 ± 0.663
1.483SerPro: 1.483 ± 0.522
2.965SerGln: 2.965 ± 0.89
2.595SerArg: 2.595 ± 0.974
1.853SerSer: 1.853 ± 1.076
1.853SerThr: 1.853 ± 0.631
1.853SerVal: 1.853 ± 0.72
0.0SerTrp: 0.0 ± 0.0
2.224SerTyr: 2.224 ± 0.843
0.0SerXaa: 0.0 ± 0.0
Thr
2.965ThrAla: 2.965 ± 1.062
0.371ThrCys: 0.371 ± 0.384
2.595ThrAsp: 2.595 ± 1.22
2.965ThrGlu: 2.965 ± 1.107
2.224ThrPhe: 2.224 ± 0.742
3.706ThrGly: 3.706 ± 1.496
1.112ThrHis: 1.112 ± 0.519
4.077ThrIle: 4.077 ± 1.583
5.93ThrLys: 5.93 ± 2.085
4.077ThrLeu: 4.077 ± 1.412
0.741ThrMet: 0.741 ± 0.583
2.965ThrAsn: 2.965 ± 1.315
1.483ThrPro: 1.483 ± 0.566
4.077ThrGln: 4.077 ± 1.146
2.224ThrArg: 2.224 ± 0.735
4.077ThrSer: 4.077 ± 1.296
1.853ThrThr: 1.853 ± 0.721
4.077ThrVal: 4.077 ± 1.809
0.371ThrTrp: 0.371 ± 0.341
3.336ThrTyr: 3.336 ± 2.141
0.0ThrXaa: 0.0 ± 0.0
Val
5.189ValAla: 5.189 ± 1.699
0.371ValCys: 0.371 ± 0.341
2.965ValAsp: 2.965 ± 1.105
5.189ValGlu: 5.189 ± 1.601
2.965ValPhe: 2.965 ± 0.81
1.853ValGly: 1.853 ± 0.709
0.741ValHis: 0.741 ± 0.386
2.965ValIle: 2.965 ± 1.192
6.301ValLys: 6.301 ± 1.159
4.077ValLeu: 4.077 ± 0.865
0.741ValMet: 0.741 ± 0.431
5.93ValAsn: 5.93 ± 1.028
0.741ValPro: 0.741 ± 0.5
1.483ValGln: 1.483 ± 0.477
2.224ValArg: 2.224 ± 0.91
2.224ValSer: 2.224 ± 1.203
3.706ValThr: 3.706 ± 0.786
5.189ValVal: 5.189 ± 1.491
0.371ValTrp: 0.371 ± 0.391
2.595ValTyr: 2.595 ± 0.825
0.0ValXaa: 0.0 ± 0.0
Trp
0.371TrpAla: 0.371 ± 0.373
0.371TrpCys: 0.371 ± 0.28
0.0TrpAsp: 0.0 ± 0.0
1.112TrpGlu: 1.112 ± 0.617
0.371TrpPhe: 0.371 ± 0.341
0.371TrpGly: 0.371 ± 0.28
0.0TrpHis: 0.0 ± 0.0
0.741TrpIle: 0.741 ± 0.674
0.741TrpLys: 0.741 ± 0.56
0.741TrpLeu: 0.741 ± 0.445
0.0TrpMet: 0.0 ± 0.0
0.371TrpAsn: 0.371 ± 0.341
0.0TrpPro: 0.0 ± 0.0
0.371TrpGln: 0.371 ± 0.366
0.371TrpArg: 0.371 ± 0.28
0.371TrpSer: 0.371 ± 0.373
0.741TrpThr: 0.741 ± 0.462
0.371TrpVal: 0.371 ± 0.342
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.483TyrAla: 1.483 ± 0.792
0.371TyrCys: 0.371 ± 0.373
3.336TyrAsp: 3.336 ± 0.889
3.706TyrGlu: 3.706 ± 1.214
2.224TyrPhe: 2.224 ± 1.128
3.336TyrGly: 3.336 ± 1.17
0.741TyrHis: 0.741 ± 0.386
1.853TyrIle: 1.853 ± 0.747
5.93TyrLys: 5.93 ± 1.836
6.301TyrLeu: 6.301 ± 1.625
0.0TyrMet: 0.0 ± 0.0
3.336TyrAsn: 3.336 ± 0.708
1.112TyrPro: 1.112 ± 0.579
3.336TyrGln: 3.336 ± 1.103
2.965TyrArg: 2.965 ± 1.122
2.595TyrSer: 2.595 ± 0.849
2.595TyrThr: 2.595 ± 0.83
1.112TyrVal: 1.112 ± 0.838
0.371TyrTrp: 0.371 ± 0.342
2.224TyrTyr: 2.224 ± 0.841
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (2699 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski