Amino acid dipepetide frequency for Streptococcus satellite phage Javan76

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.188AlaAla: 1.188 ± 0.864
0.396AlaCys: 0.396 ± 0.326
4.355AlaAsp: 4.355 ± 1.256
2.771AlaGlu: 2.771 ± 1.28
2.375AlaPhe: 2.375 ± 0.897
3.959AlaGly: 3.959 ± 0.687
0.792AlaHis: 0.792 ± 0.598
4.355AlaIle: 4.355 ± 1.075
4.751AlaLys: 4.751 ± 1.284
8.314AlaLeu: 8.314 ± 1.501
0.792AlaMet: 0.792 ± 0.54
1.979AlaAsn: 1.979 ± 0.918
2.375AlaPro: 2.375 ± 0.775
3.167AlaGln: 3.167 ± 1.029
2.375AlaArg: 2.375 ± 0.731
4.751AlaSer: 4.751 ± 1.5
3.563AlaThr: 3.563 ± 2.171
1.979AlaVal: 1.979 ± 0.716
0.396AlaTrp: 0.396 ± 0.406
2.375AlaTyr: 2.375 ± 0.937
0.0AlaXaa: 0.0 ± 0.0
Cys
0.396CysAla: 0.396 ± 0.41
0.0CysCys: 0.0 ± 0.0
0.396CysAsp: 0.396 ± 0.398
0.396CysGlu: 0.396 ± 0.41
0.396CysPhe: 0.396 ± 0.326
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.188CysIle: 1.188 ± 0.75
0.0CysLys: 0.0 ± 0.0
0.396CysLeu: 0.396 ± 0.326
0.0CysMet: 0.0 ± 0.0
0.792CysAsn: 0.792 ± 0.652
0.396CysPro: 0.396 ± 0.387
0.0CysGln: 0.0 ± 0.0
0.396CysArg: 0.396 ± 0.387
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.396CysTrp: 0.396 ± 0.5
0.396CysTyr: 0.396 ± 0.326
0.0CysXaa: 0.0 ± 0.0
Asp
1.188AspAla: 1.188 ± 0.666
0.396AspCys: 0.396 ± 0.41
1.188AspAsp: 1.188 ± 0.584
4.355AspGlu: 4.355 ± 1.534
5.938AspPhe: 5.938 ± 1.109
1.188AspGly: 1.188 ± 0.666
0.396AspHis: 0.396 ± 0.325
4.355AspIle: 4.355 ± 1.005
7.918AspLys: 7.918 ± 1.791
3.959AspLeu: 3.959 ± 1.253
1.979AspMet: 1.979 ± 1.066
4.355AspAsn: 4.355 ± 1.934
1.188AspPro: 1.188 ± 0.566
0.792AspGln: 0.792 ± 0.445
0.792AspArg: 0.792 ± 0.445
1.584AspSer: 1.584 ± 0.637
4.751AspThr: 4.751 ± 1.811
3.563AspVal: 3.563 ± 1.339
0.396AspTrp: 0.396 ± 0.326
3.959AspTyr: 3.959 ± 1.98
0.0AspXaa: 0.0 ± 0.0
Glu
5.938GluAla: 5.938 ± 1.547
0.396GluCys: 0.396 ± 0.41
3.563GluAsp: 3.563 ± 1.021
4.751GluGlu: 4.751 ± 1.735
1.979GluPhe: 1.979 ± 0.768
2.375GluGly: 2.375 ± 1.367
1.584GluHis: 1.584 ± 1.037
6.334GluIle: 6.334 ± 1.26
10.689GluLys: 10.689 ± 1.184
10.689GluLeu: 10.689 ± 1.985
2.375GluMet: 2.375 ± 0.923
3.167GluAsn: 3.167 ± 1.019
1.979GluPro: 1.979 ± 0.908
1.979GluGln: 1.979 ± 1.007
3.959GluArg: 3.959 ± 1.737
3.959GluSer: 3.959 ± 1.369
2.771GluThr: 2.771 ± 1.014
4.355GluVal: 4.355 ± 1.151
1.188GluTrp: 1.188 ± 0.677
3.167GluTyr: 3.167 ± 1.266
0.0GluXaa: 0.0 ± 0.0
Phe
1.188PheAla: 1.188 ± 0.563
0.792PheCys: 0.792 ± 0.548
3.167PheAsp: 3.167 ± 1.514
3.563PheGlu: 3.563 ± 1.862
1.979PhePhe: 1.979 ± 1.141
2.771PheGly: 2.771 ± 1.156
0.792PheHis: 0.792 ± 0.529
3.167PheIle: 3.167 ± 1.195
3.563PheLys: 3.563 ± 1.578
4.355PheLeu: 4.355 ± 1.834
0.396PheMet: 0.396 ± 0.383
2.771PheAsn: 2.771 ± 0.95
0.792PhePro: 0.792 ± 0.531
1.188PheGln: 1.188 ± 0.684
1.979PheArg: 1.979 ± 0.854
2.771PheSer: 2.771 ± 1.066
2.771PheThr: 2.771 ± 0.964
1.979PheVal: 1.979 ± 0.783
0.0PheTrp: 0.0 ± 0.0
1.979PheTyr: 1.979 ± 0.957
0.0PheXaa: 0.0 ± 0.0
Gly
1.584GlyAla: 1.584 ± 0.685
0.396GlyCys: 0.396 ± 0.387
2.771GlyAsp: 2.771 ± 1.274
3.167GlyGlu: 3.167 ± 1.067
1.584GlyPhe: 1.584 ± 0.989
1.188GlyGly: 1.188 ± 0.642
1.584GlyHis: 1.584 ± 0.51
3.959GlyIle: 3.959 ± 1.722
4.751GlyLys: 4.751 ± 1.259
3.959GlyLeu: 3.959 ± 1.669
1.584GlyMet: 1.584 ± 1.001
1.584GlyAsn: 1.584 ± 0.985
0.0GlyPro: 0.0 ± 0.0
1.584GlyGln: 1.584 ± 0.962
3.167GlyArg: 3.167 ± 0.531
1.188GlySer: 1.188 ± 0.438
3.563GlyThr: 3.563 ± 1.075
2.771GlyVal: 2.771 ± 1.264
0.792GlyTrp: 0.792 ± 0.462
2.375GlyTyr: 2.375 ± 0.864
0.0GlyXaa: 0.0 ± 0.0
His
2.771HisAla: 2.771 ± 1.231
0.0HisCys: 0.0 ± 0.0
0.396HisAsp: 0.396 ± 0.326
2.375HisGlu: 2.375 ± 1.037
1.188HisPhe: 1.188 ± 0.565
0.792HisGly: 0.792 ± 0.462
0.0HisHis: 0.0 ± 0.0
0.396HisIle: 0.396 ± 0.41
0.792HisLys: 0.792 ± 0.545
2.375HisLeu: 2.375 ± 1.114
0.0HisMet: 0.0 ± 0.0
1.188HisAsn: 1.188 ± 0.638
0.0HisPro: 0.0 ± 0.0
0.792HisGln: 0.792 ± 0.407
0.792HisArg: 0.792 ± 0.82
0.792HisSer: 0.792 ± 0.518
0.792HisThr: 0.792 ± 0.494
1.584HisVal: 1.584 ± 0.799
0.396HisTrp: 0.396 ± 0.41
0.792HisTyr: 0.792 ± 0.612
0.0HisXaa: 0.0 ± 0.0
Ile
4.751IleAla: 4.751 ± 0.98
0.0IleCys: 0.0 ± 0.0
2.771IleAsp: 2.771 ± 1.229
5.938IleGlu: 5.938 ± 1.661
1.979IlePhe: 1.979 ± 0.792
3.563IleGly: 3.563 ± 1.19
0.396IleHis: 0.396 ± 0.398
4.751IleIle: 4.751 ± 1.826
8.709IleLys: 8.709 ± 1.666
8.709IleLeu: 8.709 ± 1.834
0.792IleMet: 0.792 ± 0.452
5.542IleAsn: 5.542 ± 1.584
3.167IlePro: 3.167 ± 1.241
2.375IleGln: 2.375 ± 0.949
1.979IleArg: 1.979 ± 0.718
5.146IleSer: 5.146 ± 1.255
3.959IleThr: 3.959 ± 1.394
3.563IleVal: 3.563 ± 1.18
0.0IleTrp: 0.0 ± 0.0
4.355IleTyr: 4.355 ± 1.219
0.0IleXaa: 0.0 ± 0.0
Lys
6.73LysAla: 6.73 ± 1.706
0.396LysCys: 0.396 ± 0.326
6.334LysAsp: 6.334 ± 1.629
12.668LysGlu: 12.668 ± 2.224
3.167LysPhe: 3.167 ± 0.656
4.355LysGly: 4.355 ± 1.358
2.771LysHis: 2.771 ± 1.288
7.918LysIle: 7.918 ± 1.711
10.689LysLys: 10.689 ± 1.577
7.918LysLeu: 7.918 ± 1.665
1.979LysMet: 1.979 ± 1.055
6.334LysAsn: 6.334 ± 1.422
3.959LysPro: 3.959 ± 1.1
1.979LysGln: 1.979 ± 0.657
7.126LysArg: 7.126 ± 1.495
5.542LysSer: 5.542 ± 1.154
7.522LysThr: 7.522 ± 0.855
4.751LysVal: 4.751 ± 0.914
1.188LysTrp: 1.188 ± 0.675
3.167LysTyr: 3.167 ± 0.895
0.0LysXaa: 0.0 ± 0.0
Leu
9.105LeuAla: 9.105 ± 2.696
0.792LeuCys: 0.792 ± 0.652
7.522LeuAsp: 7.522 ± 1.27
8.709LeuGlu: 8.709 ± 1.519
3.959LeuPhe: 3.959 ± 1.392
5.146LeuGly: 5.146 ± 1.682
2.375LeuHis: 2.375 ± 0.997
6.334LeuIle: 6.334 ± 1.253
13.064LeuLys: 13.064 ± 2.007
11.085LeuLeu: 11.085 ± 1.582
2.771LeuMet: 2.771 ± 0.755
4.355LeuAsn: 4.355 ± 0.942
4.751LeuPro: 4.751 ± 0.929
4.751LeuGln: 4.751 ± 1.409
3.563LeuArg: 3.563 ± 0.752
7.918LeuSer: 7.918 ± 1.349
5.542LeuThr: 5.542 ± 1.777
2.375LeuVal: 2.375 ± 1.35
1.584LeuTrp: 1.584 ± 1.043
4.355LeuTyr: 4.355 ± 1.862
0.0LeuXaa: 0.0 ± 0.0
Met
2.771MetAla: 2.771 ± 0.671
0.0MetCys: 0.0 ± 0.0
1.979MetAsp: 1.979 ± 0.801
1.979MetGlu: 1.979 ± 0.795
0.396MetPhe: 0.396 ± 0.398
0.0MetGly: 0.0 ± 0.0
0.396MetHis: 0.396 ± 0.406
1.584MetIle: 1.584 ± 0.58
3.167MetLys: 3.167 ± 1.232
1.584MetLeu: 1.584 ± 1.0
1.188MetMet: 1.188 ± 0.865
1.979MetAsn: 1.979 ± 0.808
0.0MetPro: 0.0 ± 0.0
0.792MetGln: 0.792 ± 0.493
0.396MetArg: 0.396 ± 0.41
0.792MetSer: 0.792 ± 0.451
1.584MetThr: 1.584 ± 0.762
0.792MetVal: 0.792 ± 0.54
0.396MetTrp: 0.396 ± 0.41
0.792MetTyr: 0.792 ± 0.645
0.0MetXaa: 0.0 ± 0.0
Asn
2.375AsnAla: 2.375 ± 0.82
0.792AsnCys: 0.792 ± 0.603
3.959AsnAsp: 3.959 ± 0.876
3.563AsnGlu: 3.563 ± 1.221
1.584AsnPhe: 1.584 ± 0.897
5.542AsnGly: 5.542 ± 1.612
0.792AsnHis: 0.792 ± 0.536
3.563AsnIle: 3.563 ± 0.929
4.355AsnLys: 4.355 ± 1.035
5.542AsnLeu: 5.542 ± 2.045
1.584AsnMet: 1.584 ± 0.936
3.959AsnAsn: 3.959 ± 1.238
3.959AsnPro: 3.959 ± 1.059
2.771AsnGln: 2.771 ± 0.95
3.167AsnArg: 3.167 ± 1.201
2.771AsnSer: 2.771 ± 1.585
2.375AsnThr: 2.375 ± 0.932
1.979AsnVal: 1.979 ± 0.876
0.0AsnTrp: 0.0 ± 0.0
0.792AsnTyr: 0.792 ± 0.445
0.0AsnXaa: 0.0 ± 0.0
Pro
1.584ProAla: 1.584 ± 0.741
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
2.375ProGlu: 2.375 ± 0.719
1.979ProPhe: 1.979 ± 0.736
0.0ProGly: 0.0 ± 0.0
0.396ProHis: 0.396 ± 0.528
2.375ProIle: 2.375 ± 1.044
3.959ProLys: 3.959 ± 1.121
3.563ProLeu: 3.563 ± 1.153
0.792ProMet: 0.792 ± 0.452
2.375ProAsn: 2.375 ± 1.212
1.188ProPro: 1.188 ± 0.66
1.584ProGln: 1.584 ± 0.965
1.584ProArg: 1.584 ± 0.66
1.979ProSer: 1.979 ± 0.626
1.979ProThr: 1.979 ± 0.78
1.584ProVal: 1.584 ± 0.54
0.0ProTrp: 0.0 ± 0.0
0.792ProTyr: 0.792 ± 0.456
0.0ProXaa: 0.0 ± 0.0
Gln
1.584GlnAla: 1.584 ± 0.799
0.0GlnCys: 0.0 ± 0.0
1.979GlnAsp: 1.979 ± 0.972
3.959GlnGlu: 3.959 ± 0.767
0.792GlnPhe: 0.792 ± 0.456
2.375GlnGly: 2.375 ± 0.908
0.396GlnHis: 0.396 ± 0.326
2.771GlnIle: 2.771 ± 0.732
3.167GlnLys: 3.167 ± 1.184
6.334GlnLeu: 6.334 ± 1.314
0.792GlnMet: 0.792 ± 0.463
1.584GlnAsn: 1.584 ± 0.481
0.792GlnPro: 0.792 ± 0.455
3.167GlnGln: 3.167 ± 0.761
2.375GlnArg: 2.375 ± 0.98
1.188GlnSer: 1.188 ± 0.686
2.375GlnThr: 2.375 ± 0.669
2.375GlnVal: 2.375 ± 1.088
0.396GlnTrp: 0.396 ± 0.325
1.584GlnTyr: 1.584 ± 0.719
0.0GlnXaa: 0.0 ± 0.0
Arg
4.355ArgAla: 4.355 ± 1.106
0.0ArgCys: 0.0 ± 0.0
1.979ArgAsp: 1.979 ± 0.881
4.355ArgGlu: 4.355 ± 0.924
1.979ArgPhe: 1.979 ± 0.838
2.375ArgGly: 2.375 ± 0.958
1.979ArgHis: 1.979 ± 0.837
3.167ArgIle: 3.167 ± 1.131
3.959ArgLys: 3.959 ± 1.124
4.355ArgLeu: 4.355 ± 1.528
0.396ArgMet: 0.396 ± 0.387
1.584ArgAsn: 1.584 ± 0.93
0.792ArgPro: 0.792 ± 0.455
3.959ArgGln: 3.959 ± 1.155
3.167ArgArg: 3.167 ± 1.236
1.188ArgSer: 1.188 ± 0.438
2.375ArgThr: 2.375 ± 0.698
2.375ArgVal: 2.375 ± 1.145
0.396ArgTrp: 0.396 ± 0.396
2.375ArgTyr: 2.375 ± 1.122
0.0ArgXaa: 0.0 ± 0.0
Ser
1.584SerAla: 1.584 ± 0.592
0.396SerCys: 0.396 ± 0.398
3.959SerAsp: 3.959 ± 0.766
1.979SerGlu: 1.979 ± 0.708
1.979SerPhe: 1.979 ± 1.049
1.979SerGly: 1.979 ± 0.789
0.792SerHis: 0.792 ± 0.436
4.355SerIle: 4.355 ± 1.298
4.751SerLys: 4.751 ± 1.151
6.334SerLeu: 6.334 ± 1.435
1.188SerMet: 1.188 ± 0.945
3.959SerAsn: 3.959 ± 1.19
1.188SerPro: 1.188 ± 0.564
1.979SerGln: 1.979 ± 0.92
0.396SerArg: 0.396 ± 0.326
2.771SerSer: 2.771 ± 1.245
1.584SerThr: 1.584 ± 0.623
5.542SerVal: 5.542 ± 1.448
1.188SerTrp: 1.188 ± 0.681
3.959SerTyr: 3.959 ± 1.128
0.0SerXaa: 0.0 ± 0.0
Thr
4.355ThrAla: 4.355 ± 1.474
0.396ThrCys: 0.396 ± 0.326
3.563ThrAsp: 3.563 ± 1.133
5.542ThrGlu: 5.542 ± 1.151
2.771ThrPhe: 2.771 ± 0.962
2.375ThrGly: 2.375 ± 1.074
0.792ThrHis: 0.792 ± 0.584
3.563ThrIle: 3.563 ± 1.101
4.751ThrLys: 4.751 ± 1.009
7.126ThrLeu: 7.126 ± 1.685
2.375ThrMet: 2.375 ± 0.985
1.979ThrAsn: 1.979 ± 1.177
1.584ThrPro: 1.584 ± 0.847
3.167ThrGln: 3.167 ± 1.116
3.167ThrArg: 3.167 ± 0.976
1.584ThrSer: 1.584 ± 0.51
3.563ThrThr: 3.563 ± 1.125
5.146ThrVal: 5.146 ± 1.924
0.792ThrTrp: 0.792 ± 0.535
1.979ThrTyr: 1.979 ± 0.918
0.0ThrXaa: 0.0 ± 0.0
Val
2.771ValAla: 2.771 ± 0.757
0.396ValCys: 0.396 ± 0.326
2.375ValAsp: 2.375 ± 0.831
0.792ValGlu: 0.792 ± 0.645
1.979ValPhe: 1.979 ± 0.586
0.396ValGly: 0.396 ± 0.398
1.188ValHis: 1.188 ± 0.768
3.959ValIle: 3.959 ± 1.461
8.709ValLys: 8.709 ± 1.547
4.355ValLeu: 4.355 ± 1.072
0.396ValMet: 0.396 ± 0.406
1.979ValAsn: 1.979 ± 0.611
1.979ValPro: 1.979 ± 0.821
0.396ValGln: 0.396 ± 0.406
3.959ValArg: 3.959 ± 1.702
3.563ValSer: 3.563 ± 0.753
5.146ValThr: 5.146 ± 1.128
1.979ValVal: 1.979 ± 1.069
0.0ValTrp: 0.0 ± 0.0
3.563ValTyr: 3.563 ± 1.031
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.396TrpAsp: 0.396 ± 0.406
1.188TrpGlu: 1.188 ± 0.716
0.396TrpPhe: 0.396 ± 0.326
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.792TrpIle: 0.792 ± 0.771
0.396TrpLys: 0.396 ± 0.528
2.771TrpLeu: 2.771 ± 0.735
0.0TrpMet: 0.0 ± 0.0
0.396TrpAsn: 0.396 ± 0.528
0.0TrpPro: 0.0 ± 0.0
0.792TrpGln: 0.792 ± 0.652
0.396TrpArg: 0.396 ± 0.326
0.396TrpSer: 0.396 ± 0.387
0.0TrpThr: 0.0 ± 0.0
0.396TrpVal: 0.396 ± 0.326
0.0TrpTrp: 0.0 ± 0.0
1.188TrpTyr: 1.188 ± 0.611
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.792TyrAla: 0.792 ± 0.531
0.0TyrCys: 0.0 ± 0.0
1.584TyrAsp: 1.584 ± 0.84
2.771TyrGlu: 2.771 ± 0.753
3.563TyrPhe: 3.563 ± 1.123
3.563TyrGly: 3.563 ± 1.119
0.792TyrHis: 0.792 ± 0.652
3.563TyrIle: 3.563 ± 1.053
4.355TyrLys: 4.355 ± 0.859
6.334TyrLeu: 6.334 ± 0.974
0.792TyrMet: 0.792 ± 0.518
3.563TyrAsn: 3.563 ± 1.033
0.0TyrPro: 0.0 ± 0.0
2.771TyrGln: 2.771 ± 1.291
2.375TyrArg: 2.375 ± 0.823
1.979TyrSer: 1.979 ± 1.264
4.355TyrThr: 4.355 ± 1.39
0.792TyrVal: 0.792 ± 0.436
0.0TyrTrp: 0.0 ± 0.0
2.375TyrTyr: 2.375 ± 1.119
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2527 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski