Amino acid dipepetide frequency for Streptococcus satellite phage Javan482

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.966AlaAla: 0.966 ± 0.496
0.322AlaCys: 0.322 ± 0.336
2.254AlaAsp: 2.254 ± 0.763
5.151AlaGlu: 5.151 ± 1.55
2.898AlaPhe: 2.898 ± 0.557
2.254AlaGly: 2.254 ± 0.696
0.322AlaHis: 0.322 ± 0.243
3.863AlaIle: 3.863 ± 1.284
6.117AlaLys: 6.117 ± 0.902
3.863AlaLeu: 3.863 ± 1.04
0.966AlaMet: 0.966 ± 0.483
2.254AlaAsn: 2.254 ± 0.66
1.288AlaPro: 1.288 ± 0.562
1.61AlaGln: 1.61 ± 0.428
2.576AlaArg: 2.576 ± 0.655
5.151AlaSer: 5.151 ± 1.114
3.22AlaThr: 3.22 ± 0.799
3.542AlaVal: 3.542 ± 1.117
0.322AlaTrp: 0.322 ± 0.243
1.61AlaTyr: 1.61 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.966CysAla: 0.966 ± 0.556
0.0CysCys: 0.0 ± 0.0
0.966CysAsp: 0.966 ± 0.548
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.322CysIle: 0.322 ± 0.33
0.644CysLys: 0.644 ± 0.431
0.644CysLeu: 0.644 ± 0.458
0.0CysMet: 0.0 ± 0.0
0.322CysAsn: 0.322 ± 0.284
0.0CysPro: 0.0 ± 0.0
0.322CysGln: 0.322 ± 0.243
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.322CysVal: 0.322 ± 0.307
0.0CysTrp: 0.0 ± 0.0
0.644CysTyr: 0.644 ± 0.513
0.0CysXaa: 0.0 ± 0.0
Asp
2.576AspAla: 2.576 ± 0.714
0.966AspCys: 0.966 ± 0.506
1.61AspAsp: 1.61 ± 0.692
5.151AspGlu: 5.151 ± 1.719
1.932AspPhe: 1.932 ± 0.7
3.22AspGly: 3.22 ± 1.153
0.644AspHis: 0.644 ± 0.334
7.083AspIle: 7.083 ± 1.397
6.439AspLys: 6.439 ± 1.077
7.405AspLeu: 7.405 ± 0.986
1.288AspMet: 1.288 ± 0.655
1.288AspAsn: 1.288 ± 0.651
0.966AspPro: 0.966 ± 0.429
1.932AspGln: 1.932 ± 1.023
1.61AspArg: 1.61 ± 0.554
2.254AspSer: 2.254 ± 0.835
4.185AspThr: 4.185 ± 0.943
1.288AspVal: 1.288 ± 0.745
0.644AspTrp: 0.644 ± 0.392
5.795AspTyr: 5.795 ± 1.458
0.0AspXaa: 0.0 ± 0.0
Glu
7.083GluAla: 7.083 ± 1.27
0.966GluCys: 0.966 ± 0.658
6.761GluAsp: 6.761 ± 1.406
8.693GluGlu: 8.693 ± 2.376
2.254GluPhe: 2.254 ± 1.074
3.863GluGly: 3.863 ± 1.096
0.966GluHis: 0.966 ± 0.428
6.439GluIle: 6.439 ± 1.844
9.337GluLys: 9.337 ± 1.734
8.693GluLeu: 8.693 ± 1.33
1.288GluMet: 1.288 ± 0.611
3.542GluAsn: 3.542 ± 1.133
1.288GluPro: 1.288 ± 0.542
3.863GluGln: 3.863 ± 1.012
6.439GluArg: 6.439 ± 1.501
0.966GluSer: 0.966 ± 0.628
4.507GluThr: 4.507 ± 1.11
3.863GluVal: 3.863 ± 1.154
0.644GluTrp: 0.644 ± 0.491
5.795GluTyr: 5.795 ± 1.281
0.0GluXaa: 0.0 ± 0.0
Phe
2.254PheAla: 2.254 ± 0.845
0.322PheCys: 0.322 ± 0.381
3.22PheAsp: 3.22 ± 0.888
3.22PheGlu: 3.22 ± 1.227
1.288PhePhe: 1.288 ± 0.631
1.61PheGly: 1.61 ± 0.867
0.966PheHis: 0.966 ± 0.369
2.254PheIle: 2.254 ± 0.709
4.507PheLys: 4.507 ± 1.033
3.22PheLeu: 3.22 ± 1.012
1.288PheMet: 1.288 ± 0.658
4.507PheAsn: 4.507 ± 1.039
0.322PhePro: 0.322 ± 0.243
0.966PheGln: 0.966 ± 0.472
1.288PheArg: 1.288 ± 0.461
3.22PheSer: 3.22 ± 0.966
2.898PheThr: 2.898 ± 0.638
1.61PheVal: 1.61 ± 0.608
0.322PheTrp: 0.322 ± 0.243
2.254PheTyr: 2.254 ± 0.667
0.0PheXaa: 0.0 ± 0.0
Gly
2.254GlyAla: 2.254 ± 0.994
0.0GlyCys: 0.0 ± 0.0
4.507GlyAsp: 4.507 ± 1.108
3.22GlyGlu: 3.22 ± 0.77
2.254GlyPhe: 2.254 ± 0.909
1.61GlyGly: 1.61 ± 0.494
0.644GlyHis: 0.644 ± 0.671
2.576GlyIle: 2.576 ± 1.014
3.22GlyLys: 3.22 ± 0.669
4.185GlyLeu: 4.185 ± 1.249
1.61GlyMet: 1.61 ± 0.593
1.932GlyAsn: 1.932 ± 0.988
0.322GlyPro: 0.322 ± 0.287
2.898GlyGln: 2.898 ± 1.191
3.542GlyArg: 3.542 ± 1.132
1.288GlySer: 1.288 ± 0.584
2.576GlyThr: 2.576 ± 0.708
3.542GlyVal: 3.542 ± 0.947
1.288GlyTrp: 1.288 ± 0.617
2.898GlyTyr: 2.898 ± 1.274
0.0GlyXaa: 0.0 ± 0.0
His
2.254HisAla: 2.254 ± 1.195
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.644HisGlu: 0.644 ± 0.414
0.0HisPhe: 0.0 ± 0.0
0.644HisGly: 0.644 ± 0.37
0.644HisHis: 0.644 ± 0.359
0.966HisIle: 0.966 ± 0.52
1.61HisLys: 1.61 ± 0.679
1.932HisLeu: 1.932 ± 0.878
0.0HisMet: 0.0 ± 0.0
0.966HisAsn: 0.966 ± 0.634
0.644HisPro: 0.644 ± 0.371
1.61HisGln: 1.61 ± 0.745
0.644HisArg: 0.644 ± 0.461
0.322HisSer: 0.322 ± 0.32
1.288HisThr: 1.288 ± 0.431
0.322HisVal: 0.322 ± 0.381
0.322HisTrp: 0.322 ± 0.243
1.61HisTyr: 1.61 ± 0.81
0.0HisXaa: 0.0 ± 0.0
Ile
4.507IleAla: 4.507 ± 1.755
0.644IleCys: 0.644 ± 0.412
5.151IleAsp: 5.151 ± 1.648
9.015IleGlu: 9.015 ± 1.706
2.576IlePhe: 2.576 ± 0.894
3.22IleGly: 3.22 ± 0.729
0.0IleHis: 0.0 ± 0.0
4.507IleIle: 4.507 ± 1.594
9.659IleLys: 9.659 ± 1.727
4.185IleLeu: 4.185 ± 0.915
0.644IleMet: 0.644 ± 0.329
4.507IleAsn: 4.507 ± 0.833
1.932IlePro: 1.932 ± 0.782
3.22IleGln: 3.22 ± 0.892
3.542IleArg: 3.542 ± 0.907
3.22IleSer: 3.22 ± 1.037
6.761IleThr: 6.761 ± 1.096
1.288IleVal: 1.288 ± 0.449
0.0IleTrp: 0.0 ± 0.0
3.863IleTyr: 3.863 ± 1.105
0.0IleXaa: 0.0 ± 0.0
Lys
5.473LysAla: 5.473 ± 1.122
0.0LysCys: 0.0 ± 0.0
3.863LysAsp: 3.863 ± 1.014
9.981LysGlu: 9.981 ± 1.815
3.22LysPhe: 3.22 ± 1.188
4.507LysGly: 4.507 ± 1.197
3.542LysHis: 3.542 ± 0.772
8.049LysIle: 8.049 ± 1.194
10.303LysLys: 10.303 ± 2.479
7.727LysLeu: 7.727 ± 1.898
1.61LysMet: 1.61 ± 0.906
4.829LysAsn: 4.829 ± 0.841
5.473LysPro: 5.473 ± 1.413
4.507LysGln: 4.507 ± 0.999
5.473LysArg: 5.473 ± 1.261
4.829LysSer: 4.829 ± 1.307
6.761LysThr: 6.761 ± 1.894
4.829LysVal: 4.829 ± 1.128
0.644LysTrp: 0.644 ± 0.431
2.254LysTyr: 2.254 ± 0.667
0.0LysXaa: 0.0 ± 0.0
Leu
3.863LeuAla: 3.863 ± 1.416
0.0LeuCys: 0.0 ± 0.0
6.439LeuAsp: 6.439 ± 1.745
9.981LeuGlu: 9.981 ± 1.543
3.22LeuPhe: 3.22 ± 0.7
5.151LeuGly: 5.151 ± 1.432
1.61LeuHis: 1.61 ± 0.741
6.439LeuIle: 6.439 ± 1.635
8.371LeuLys: 8.371 ± 1.105
10.947LeuLeu: 10.947 ± 2.803
1.61LeuMet: 1.61 ± 0.758
4.185LeuAsn: 4.185 ± 1.094
4.507LeuPro: 4.507 ± 1.394
4.829LeuGln: 4.829 ± 1.091
2.576LeuArg: 2.576 ± 0.799
7.083LeuSer: 7.083 ± 2.303
4.507LeuThr: 4.507 ± 0.997
4.829LeuVal: 4.829 ± 1.472
0.322LeuTrp: 0.322 ± 0.336
4.829LeuTyr: 4.829 ± 1.017
0.0LeuXaa: 0.0 ± 0.0
Met
1.932MetAla: 1.932 ± 0.919
0.322MetCys: 0.322 ± 0.381
0.966MetAsp: 0.966 ± 0.566
0.966MetGlu: 0.966 ± 0.588
0.322MetPhe: 0.322 ± 0.284
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.966MetIle: 0.966 ± 0.586
2.898MetLys: 2.898 ± 0.895
2.254MetLeu: 2.254 ± 0.618
0.0MetMet: 0.0 ± 0.0
3.22MetAsn: 3.22 ± 0.844
0.322MetPro: 0.322 ± 0.336
0.644MetGln: 0.644 ± 0.457
0.966MetArg: 0.966 ± 0.437
0.644MetSer: 0.644 ± 0.471
1.932MetThr: 1.932 ± 0.576
0.322MetVal: 0.322 ± 0.292
0.0MetTrp: 0.0 ± 0.0
0.322MetTyr: 0.322 ± 0.322
0.0MetXaa: 0.0 ± 0.0
Asn
3.542AsnAla: 3.542 ± 1.185
0.0AsnCys: 0.0 ± 0.0
2.254AsnAsp: 2.254 ± 0.82
5.473AsnGlu: 5.473 ± 1.08
1.61AsnPhe: 1.61 ± 0.619
2.898AsnGly: 2.898 ± 0.728
1.288AsnHis: 1.288 ± 0.591
3.542AsnIle: 3.542 ± 1.193
5.151AsnLys: 5.151 ± 1.222
6.117AsnLeu: 6.117 ± 1.224
1.932AsnMet: 1.932 ± 0.796
1.932AsnAsn: 1.932 ± 0.992
1.61AsnPro: 1.61 ± 0.599
3.863AsnGln: 3.863 ± 1.322
4.185AsnArg: 4.185 ± 0.921
2.576AsnSer: 2.576 ± 0.834
1.932AsnThr: 1.932 ± 0.733
2.576AsnVal: 2.576 ± 0.779
0.322AsnTrp: 0.322 ± 0.322
2.576AsnTyr: 2.576 ± 0.565
0.0AsnXaa: 0.0 ± 0.0
Pro
0.966ProAla: 0.966 ± 0.417
0.0ProCys: 0.0 ± 0.0
2.254ProAsp: 2.254 ± 0.814
3.22ProGlu: 3.22 ± 0.989
1.932ProPhe: 1.932 ± 0.747
0.0ProGly: 0.0 ± 0.0
0.322ProHis: 0.322 ± 0.315
1.288ProIle: 1.288 ± 0.633
3.542ProLys: 3.542 ± 1.2
2.898ProLeu: 2.898 ± 0.924
0.0ProMet: 0.0 ± 0.0
2.254ProAsn: 2.254 ± 0.745
1.288ProPro: 1.288 ± 0.492
0.966ProGln: 0.966 ± 0.487
2.576ProArg: 2.576 ± 0.891
1.61ProSer: 1.61 ± 0.693
1.932ProThr: 1.932 ± 0.597
1.288ProVal: 1.288 ± 0.659
0.0ProTrp: 0.0 ± 0.0
0.966ProTyr: 0.966 ± 0.568
0.0ProXaa: 0.0 ± 0.0
Gln
1.288GlnAla: 1.288 ± 0.632
0.322GlnCys: 0.322 ± 0.284
2.898GlnAsp: 2.898 ± 0.998
4.185GlnGlu: 4.185 ± 1.31
3.542GlnPhe: 3.542 ± 0.953
1.932GlnGly: 1.932 ± 0.555
0.966GlnHis: 0.966 ± 0.416
2.898GlnIle: 2.898 ± 0.962
5.473GlnLys: 5.473 ± 1.273
4.829GlnLeu: 4.829 ± 0.903
0.644GlnMet: 0.644 ± 0.421
3.22GlnAsn: 3.22 ± 0.705
0.644GlnPro: 0.644 ± 0.411
2.898GlnGln: 2.898 ± 0.854
3.22GlnArg: 3.22 ± 0.947
2.898GlnSer: 2.898 ± 0.789
0.966GlnThr: 0.966 ± 0.716
2.254GlnVal: 2.254 ± 0.925
0.322GlnTrp: 0.322 ± 0.381
1.932GlnTyr: 1.932 ± 0.934
0.0GlnXaa: 0.0 ± 0.0
Arg
2.576ArgAla: 2.576 ± 0.752
0.322ArgCys: 0.322 ± 0.336
1.932ArgAsp: 1.932 ± 0.67
3.22ArgGlu: 3.22 ± 1.076
2.254ArgPhe: 2.254 ± 0.866
3.863ArgGly: 3.863 ± 1.026
1.61ArgHis: 1.61 ± 0.731
1.932ArgIle: 1.932 ± 0.65
5.795ArgLys: 5.795 ± 1.558
5.151ArgLeu: 5.151 ± 0.806
1.288ArgMet: 1.288 ± 0.598
2.254ArgAsn: 2.254 ± 0.786
0.644ArgPro: 0.644 ± 0.33
2.576ArgGln: 2.576 ± 0.913
2.576ArgArg: 2.576 ± 0.932
2.898ArgSer: 2.898 ± 0.972
3.542ArgThr: 3.542 ± 0.822
4.507ArgVal: 4.507 ± 1.168
0.644ArgTrp: 0.644 ± 0.437
3.542ArgTyr: 3.542 ± 1.157
0.0ArgXaa: 0.0 ± 0.0
Ser
1.288SerAla: 1.288 ± 0.518
0.0SerCys: 0.0 ± 0.0
4.185SerAsp: 4.185 ± 0.947
4.507SerGlu: 4.507 ± 0.806
2.898SerPhe: 2.898 ± 0.901
0.966SerGly: 0.966 ± 0.531
0.322SerHis: 0.322 ± 0.336
6.761SerIle: 6.761 ± 1.233
2.898SerLys: 2.898 ± 0.606
5.795SerLeu: 5.795 ± 1.173
1.288SerMet: 1.288 ± 0.6
3.22SerAsn: 3.22 ± 0.963
1.932SerPro: 1.932 ± 0.82
2.254SerGln: 2.254 ± 0.713
1.288SerArg: 1.288 ± 0.528
2.254SerSer: 2.254 ± 0.647
2.254SerThr: 2.254 ± 1.049
4.829SerVal: 4.829 ± 1.303
0.644SerTrp: 0.644 ± 0.479
2.898SerTyr: 2.898 ± 0.688
0.0SerXaa: 0.0 ± 0.0
Thr
3.542ThrAla: 3.542 ± 1.352
0.322ThrCys: 0.322 ± 0.381
2.898ThrAsp: 2.898 ± 1.207
4.829ThrGlu: 4.829 ± 1.383
2.898ThrPhe: 2.898 ± 0.956
5.151ThrGly: 5.151 ± 1.082
0.322ThrHis: 0.322 ± 0.336
4.185ThrIle: 4.185 ± 1.454
1.932ThrLys: 1.932 ± 0.644
4.507ThrLeu: 4.507 ± 1.065
0.966ThrMet: 0.966 ± 0.41
2.254ThrAsn: 2.254 ± 0.931
2.576ThrPro: 2.576 ± 0.89
2.898ThrGln: 2.898 ± 1.182
3.22ThrArg: 3.22 ± 1.094
4.185ThrSer: 4.185 ± 1.042
4.829ThrThr: 4.829 ± 1.358
4.185ThrVal: 4.185 ± 1.084
1.61ThrTrp: 1.61 ± 0.84
3.22ThrTyr: 3.22 ± 1.695
0.0ThrXaa: 0.0 ± 0.0
Val
1.932ValAla: 1.932 ± 0.792
0.0ValCys: 0.0 ± 0.0
1.932ValAsp: 1.932 ± 0.747
0.966ValGlu: 0.966 ± 0.579
3.542ValPhe: 3.542 ± 1.04
2.576ValGly: 2.576 ± 0.695
0.644ValHis: 0.644 ± 0.454
5.151ValIle: 5.151 ± 1.222
4.829ValLys: 4.829 ± 1.241
5.473ValLeu: 5.473 ± 1.523
1.932ValMet: 1.932 ± 0.661
2.898ValAsn: 2.898 ± 1.08
0.966ValPro: 0.966 ± 0.493
1.61ValGln: 1.61 ± 0.734
2.898ValArg: 2.898 ± 0.886
3.863ValSer: 3.863 ± 0.882
4.185ValThr: 4.185 ± 1.42
1.932ValVal: 1.932 ± 0.759
0.966ValTrp: 0.966 ± 0.459
1.288ValTyr: 1.288 ± 0.376
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.415
0.0TrpCys: 0.0 ± 0.0
0.322TrpAsp: 0.322 ± 0.335
0.966TrpGlu: 0.966 ± 0.529
0.0TrpPhe: 0.0 ± 0.0
0.322TrpGly: 0.322 ± 0.336
0.322TrpHis: 0.322 ± 0.381
0.966TrpIle: 0.966 ± 0.572
0.644TrpLys: 0.644 ± 0.486
1.61TrpLeu: 1.61 ± 0.679
0.0TrpMet: 0.0 ± 0.0
0.322TrpAsn: 0.322 ± 0.381
0.322TrpPro: 0.322 ± 0.243
0.644TrpGln: 0.644 ± 0.483
0.0TrpArg: 0.0 ± 0.0
0.966TrpSer: 0.966 ± 0.565
0.0TrpThr: 0.0 ± 0.0
0.322TrpVal: 0.322 ± 0.287
0.644TrpTrp: 0.644 ± 0.479
0.966TrpTyr: 0.966 ± 0.525
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.644TyrAla: 0.644 ± 0.405
0.644TyrCys: 0.644 ± 0.388
3.863TyrAsp: 3.863 ± 0.856
3.863TyrGlu: 3.863 ± 0.958
2.898TyrPhe: 2.898 ± 0.686
2.576TyrGly: 2.576 ± 0.904
1.288TyrHis: 1.288 ± 0.612
2.254TyrIle: 2.254 ± 0.7
4.829TyrLys: 4.829 ± 1.063
3.863TyrLeu: 3.863 ± 0.988
0.322TyrMet: 0.322 ± 0.278
5.473TyrAsn: 5.473 ± 0.948
2.254TyrPro: 2.254 ± 0.93
3.22TyrGln: 3.22 ± 1.149
4.507TyrArg: 4.507 ± 1.37
2.254TyrSer: 2.254 ± 0.999
2.254TyrThr: 2.254 ± 0.574
1.932TyrVal: 1.932 ± 0.615
0.322TyrTrp: 0.322 ± 0.315
2.898TyrTyr: 2.898 ± 0.81
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3107 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski