Amino acid dipepetide frequency for Streptococcus satellite phage Javan60

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.188AlaAla: 1.188 ± 0.787
0.396AlaCys: 0.396 ± 0.323
4.355AlaAsp: 4.355 ± 1.169
2.771AlaGlu: 2.771 ± 1.278
2.375AlaPhe: 2.375 ± 0.863
3.959AlaGly: 3.959 ± 0.72
0.792AlaHis: 0.792 ± 0.59
4.355AlaIle: 4.355 ± 1.066
4.751AlaLys: 4.751 ± 1.352
8.314AlaLeu: 8.314 ± 1.585
0.792AlaMet: 0.792 ± 0.481
1.979AlaAsn: 1.979 ± 0.921
2.375AlaPro: 2.375 ± 0.726
3.167AlaGln: 3.167 ± 1.136
2.375AlaArg: 2.375 ± 0.791
4.751AlaSer: 4.751 ± 1.391
3.563AlaThr: 3.563 ± 1.965
1.979AlaVal: 1.979 ± 0.692
0.396AlaTrp: 0.396 ± 0.378
2.375AlaTyr: 2.375 ± 0.928
0.0AlaXaa: 0.0 ± 0.0
Cys
0.396CysAla: 0.396 ± 0.384
0.0CysCys: 0.0 ± 0.0
0.396CysAsp: 0.396 ± 0.366
0.396CysGlu: 0.396 ± 0.384
0.396CysPhe: 0.396 ± 0.323
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.188CysIle: 1.188 ± 0.697
0.0CysLys: 0.0 ± 0.0
0.396CysLeu: 0.396 ± 0.323
0.0CysMet: 0.0 ± 0.0
0.792CysAsn: 0.792 ± 0.646
0.396CysPro: 0.396 ± 0.344
0.0CysGln: 0.0 ± 0.0
0.396CysArg: 0.396 ± 0.344
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.396CysTrp: 0.396 ± 0.428
0.396CysTyr: 0.396 ± 0.323
0.0CysXaa: 0.0 ± 0.0
Asp
1.188AspAla: 1.188 ± 0.785
0.396AspCys: 0.396 ± 0.384
1.188AspAsp: 1.188 ± 0.537
4.355AspGlu: 4.355 ± 1.487
5.938AspPhe: 5.938 ± 1.001
1.188AspGly: 1.188 ± 0.592
0.396AspHis: 0.396 ± 0.416
4.355AspIle: 4.355 ± 1.021
7.918AspLys: 7.918 ± 1.598
3.959AspLeu: 3.959 ± 1.302
1.979AspMet: 1.979 ± 1.12
4.355AspAsn: 4.355 ± 2.003
1.188AspPro: 1.188 ± 0.587
0.792AspGln: 0.792 ± 0.423
0.792AspArg: 0.792 ± 0.423
1.584AspSer: 1.584 ± 0.581
4.751AspThr: 4.751 ± 1.994
3.563AspVal: 3.563 ± 1.286
0.396AspTrp: 0.396 ± 0.323
3.959AspTyr: 3.959 ± 1.863
0.0AspXaa: 0.0 ± 0.0
Glu
5.938GluAla: 5.938 ± 1.764
0.396GluCys: 0.396 ± 0.384
3.563GluAsp: 3.563 ± 1.081
4.751GluGlu: 4.751 ± 1.602
1.979GluPhe: 1.979 ± 0.853
2.375GluGly: 2.375 ± 1.395
1.584GluHis: 1.584 ± 1.068
6.334GluIle: 6.334 ± 1.359
10.689GluLys: 10.689 ± 1.238
10.689GluLeu: 10.689 ± 1.909
2.375GluMet: 2.375 ± 0.872
3.167GluAsn: 3.167 ± 1.07
1.979GluPro: 1.979 ± 0.894
1.979GluGln: 1.979 ± 1.122
3.959GluArg: 3.959 ± 1.721
3.959GluSer: 3.959 ± 1.15
2.771GluThr: 2.771 ± 0.935
4.355GluVal: 4.355 ± 1.11
1.188GluTrp: 1.188 ± 0.669
3.167GluTyr: 3.167 ± 1.218
0.0GluXaa: 0.0 ± 0.0
Phe
1.188PheAla: 1.188 ± 0.56
0.792PheCys: 0.792 ± 0.48
3.167PheAsp: 3.167 ± 1.458
3.563PheGlu: 3.563 ± 1.633
1.979PhePhe: 1.979 ± 1.222
2.771PheGly: 2.771 ± 1.141
0.792PheHis: 0.792 ± 0.54
3.167PheIle: 3.167 ± 1.28
3.563PheLys: 3.563 ± 1.619
4.355PheLeu: 4.355 ± 1.854
0.396PheMet: 0.396 ± 0.372
2.771PheAsn: 2.771 ± 0.91
0.792PhePro: 0.792 ± 0.559
1.188PheGln: 1.188 ± 0.697
1.979PheArg: 1.979 ± 0.822
2.771PheSer: 2.771 ± 0.986
2.771PheThr: 2.771 ± 1.068
1.979PheVal: 1.979 ± 0.707
0.0PheTrp: 0.0 ± 0.0
1.979PheTyr: 1.979 ± 0.932
0.0PheXaa: 0.0 ± 0.0
Gly
1.584GlyAla: 1.584 ± 0.608
0.396GlyCys: 0.396 ± 0.344
2.771GlyAsp: 2.771 ± 1.268
3.167GlyGlu: 3.167 ± 1.062
1.584GlyPhe: 1.584 ± 0.859
1.188GlyGly: 1.188 ± 0.585
1.584GlyHis: 1.584 ± 0.512
3.959GlyIle: 3.959 ± 1.709
4.751GlyLys: 4.751 ± 1.224
3.959GlyLeu: 3.959 ± 1.656
1.584GlyMet: 1.584 ± 0.923
1.584GlyAsn: 1.584 ± 1.0
0.0GlyPro: 0.0 ± 0.0
1.584GlyGln: 1.584 ± 0.97
3.167GlyArg: 3.167 ± 0.554
1.188GlySer: 1.188 ± 0.412
3.563GlyThr: 3.563 ± 0.839
2.771GlyVal: 2.771 ± 1.171
0.792GlyTrp: 0.792 ± 0.439
2.375GlyTyr: 2.375 ± 0.87
0.0GlyXaa: 0.0 ± 0.0
His
2.771HisAla: 2.771 ± 1.155
0.0HisCys: 0.0 ± 0.0
0.396HisAsp: 0.396 ± 0.323
2.375HisGlu: 2.375 ± 1.157
1.188HisPhe: 1.188 ± 0.641
0.792HisGly: 0.792 ± 0.439
0.0HisHis: 0.0 ± 0.0
0.396HisIle: 0.396 ± 0.448
0.792HisLys: 0.792 ± 0.497
2.375HisLeu: 2.375 ± 1.098
0.0HisMet: 0.0 ± 0.0
1.188HisAsn: 1.188 ± 0.608
0.0HisPro: 0.0 ± 0.0
0.792HisGln: 0.792 ± 0.42
0.792HisArg: 0.792 ± 0.769
0.792HisSer: 0.792 ± 0.471
0.792HisThr: 0.792 ± 0.481
1.584HisVal: 1.584 ± 0.813
0.396HisTrp: 0.396 ± 0.384
0.792HisTyr: 0.792 ± 0.556
0.0HisXaa: 0.0 ± 0.0
Ile
4.751IleAla: 4.751 ± 1.024
0.0IleCys: 0.0 ± 0.0
2.771IleAsp: 2.771 ± 1.211
5.938IleGlu: 5.938 ± 1.52
1.979IlePhe: 1.979 ± 0.802
3.563IleGly: 3.563 ± 1.258
0.396IleHis: 0.396 ± 0.366
4.751IleIle: 4.751 ± 1.633
8.709IleLys: 8.709 ± 1.451
8.709IleLeu: 8.709 ± 1.657
0.792IleMet: 0.792 ± 0.521
5.542IleAsn: 5.542 ± 1.444
3.167IlePro: 3.167 ± 1.301
2.375IleGln: 2.375 ± 1.225
1.979IleArg: 1.979 ± 0.66
5.146IleSer: 5.146 ± 1.407
3.959IleThr: 3.959 ± 1.337
3.563IleVal: 3.563 ± 1.187
0.0IleTrp: 0.0 ± 0.0
4.355IleTyr: 4.355 ± 1.208
0.0IleXaa: 0.0 ± 0.0
Lys
6.73LysAla: 6.73 ± 1.68
0.396LysCys: 0.396 ± 0.323
6.334LysAsp: 6.334 ± 1.547
12.668LysGlu: 12.668 ± 2.021
3.167LysPhe: 3.167 ± 0.658
4.355LysGly: 4.355 ± 1.279
2.771LysHis: 2.771 ± 1.251
7.918LysIle: 7.918 ± 1.553
10.689LysLys: 10.689 ± 1.657
7.918LysLeu: 7.918 ± 1.437
1.979LysMet: 1.979 ± 1.021
6.334LysAsn: 6.334 ± 1.359
3.959LysPro: 3.959 ± 1.073
1.979LysGln: 1.979 ± 0.692
7.126LysArg: 7.126 ± 1.478
5.542LysSer: 5.542 ± 1.041
7.522LysThr: 7.522 ± 0.889
4.751LysVal: 4.751 ± 0.86
1.188LysTrp: 1.188 ± 0.685
3.167LysTyr: 3.167 ± 1.019
0.0LysXaa: 0.0 ± 0.0
Leu
9.105LeuAla: 9.105 ± 2.517
0.792LeuCys: 0.792 ± 0.646
7.522LeuAsp: 7.522 ± 1.246
8.709LeuGlu: 8.709 ± 1.672
3.959LeuPhe: 3.959 ± 1.4
5.146LeuGly: 5.146 ± 1.729
2.375LeuHis: 2.375 ± 0.98
6.334LeuIle: 6.334 ± 1.462
13.064LeuLys: 13.064 ± 2.128
11.085LeuLeu: 11.085 ± 1.567
2.771LeuMet: 2.771 ± 0.65
4.355LeuAsn: 4.355 ± 0.808
4.751LeuPro: 4.751 ± 0.918
4.751LeuGln: 4.751 ± 1.458
3.563LeuArg: 3.563 ± 0.885
7.918LeuSer: 7.918 ± 1.575
5.542LeuThr: 5.542 ± 1.773
2.375LeuVal: 2.375 ± 1.385
1.584LeuTrp: 1.584 ± 0.922
4.355LeuTyr: 4.355 ± 1.77
0.0LeuXaa: 0.0 ± 0.0
Met
2.771MetAla: 2.771 ± 0.738
0.0MetCys: 0.0 ± 0.0
1.979MetAsp: 1.979 ± 0.736
1.979MetGlu: 1.979 ± 0.859
0.396MetPhe: 0.396 ± 0.366
0.0MetGly: 0.0 ± 0.0
0.396MetHis: 0.396 ± 0.378
1.584MetIle: 1.584 ± 0.574
3.167MetLys: 3.167 ± 1.455
1.584MetLeu: 1.584 ± 0.958
1.188MetMet: 1.188 ± 0.8
1.979MetAsn: 1.979 ± 0.683
0.0MetPro: 0.0 ± 0.0
0.792MetGln: 0.792 ± 0.472
0.396MetArg: 0.396 ± 0.384
0.792MetSer: 0.792 ± 0.449
1.584MetThr: 1.584 ± 0.684
0.792MetVal: 0.792 ± 0.481
0.396MetTrp: 0.396 ± 0.448
0.792MetTyr: 0.792 ± 0.572
0.0MetXaa: 0.0 ± 0.0
Asn
2.375AsnAla: 2.375 ± 0.833
0.792AsnCys: 0.792 ± 0.54
3.959AsnAsp: 3.959 ± 0.857
3.563AsnGlu: 3.563 ± 1.09
1.584AsnPhe: 1.584 ± 0.962
5.542AsnGly: 5.542 ± 1.663
0.792AsnHis: 0.792 ± 0.532
3.563AsnIle: 3.563 ± 1.008
4.355AsnLys: 4.355 ± 1.064
5.542AsnLeu: 5.542 ± 2.135
1.584AsnMet: 1.584 ± 0.823
3.959AsnAsn: 3.959 ± 1.41
3.959AsnPro: 3.959 ± 0.941
2.771AsnGln: 2.771 ± 0.91
3.167AsnArg: 3.167 ± 1.195
2.771AsnSer: 2.771 ± 1.809
2.375AsnThr: 2.375 ± 0.896
1.979AsnVal: 1.979 ± 0.793
0.0AsnTrp: 0.0 ± 0.0
0.792AsnTyr: 0.792 ± 0.423
0.0AsnXaa: 0.0 ± 0.0
Pro
1.584ProAla: 1.584 ± 0.7
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
2.375ProGlu: 2.375 ± 0.693
1.979ProPhe: 1.979 ± 0.744
0.0ProGly: 0.0 ± 0.0
0.396ProHis: 0.396 ± 0.477
2.375ProIle: 2.375 ± 1.084
3.959ProLys: 3.959 ± 1.15
3.563ProLeu: 3.563 ± 1.08
0.792ProMet: 0.792 ± 0.521
2.375ProAsn: 2.375 ± 1.284
1.188ProPro: 1.188 ± 0.689
1.584ProGln: 1.584 ± 0.962
1.584ProArg: 1.584 ± 0.672
1.979ProSer: 1.979 ± 0.608
1.979ProThr: 1.979 ± 0.776
1.584ProVal: 1.584 ± 0.558
0.0ProTrp: 0.0 ± 0.0
0.792ProTyr: 0.792 ± 0.451
0.0ProXaa: 0.0 ± 0.0
Gln
1.584GlnAla: 1.584 ± 0.764
0.0GlnCys: 0.0 ± 0.0
1.979GlnAsp: 1.979 ± 0.929
3.959GlnGlu: 3.959 ± 0.74
0.792GlnPhe: 0.792 ± 0.451
2.375GlnGly: 2.375 ± 0.855
0.396GlnHis: 0.396 ± 0.323
2.771GlnIle: 2.771 ± 0.754
3.167GlnLys: 3.167 ± 1.125
6.334GlnLeu: 6.334 ± 1.411
0.792GlnMet: 0.792 ± 0.483
1.584GlnAsn: 1.584 ± 0.441
0.792GlnPro: 0.792 ± 0.482
3.167GlnGln: 3.167 ± 0.747
2.375GlnArg: 2.375 ± 0.949
1.188GlnSer: 1.188 ± 0.68
2.375GlnThr: 2.375 ± 0.632
2.375GlnVal: 2.375 ± 0.94
0.396GlnTrp: 0.396 ± 0.416
1.584GlnTyr: 1.584 ± 0.632
0.0GlnXaa: 0.0 ± 0.0
Arg
4.355ArgAla: 4.355 ± 1.039
0.0ArgCys: 0.0 ± 0.0
1.979ArgAsp: 1.979 ± 0.887
4.355ArgGlu: 4.355 ± 0.969
1.979ArgPhe: 1.979 ± 0.908
2.375ArgGly: 2.375 ± 1.02
1.979ArgHis: 1.979 ± 0.806
3.167ArgIle: 3.167 ± 1.279
3.959ArgLys: 3.959 ± 1.035
4.355ArgLeu: 4.355 ± 1.551
0.396ArgMet: 0.396 ± 0.344
1.584ArgAsn: 1.584 ± 0.862
0.792ArgPro: 0.792 ± 0.482
3.959ArgGln: 3.959 ± 1.096
3.167ArgArg: 3.167 ± 1.088
1.188ArgSer: 1.188 ± 0.412
2.375ArgThr: 2.375 ± 0.758
2.375ArgVal: 2.375 ± 0.995
0.396ArgTrp: 0.396 ± 0.46
2.375ArgTyr: 2.375 ± 1.085
0.0ArgXaa: 0.0 ± 0.0
Ser
1.584SerAla: 1.584 ± 0.569
0.396SerCys: 0.396 ± 0.366
3.959SerAsp: 3.959 ± 0.849
1.979SerGlu: 1.979 ± 0.745
1.979SerPhe: 1.979 ± 1.012
1.979SerGly: 1.979 ± 0.838
0.792SerHis: 0.792 ± 0.459
4.355SerIle: 4.355 ± 1.205
4.751SerLys: 4.751 ± 0.957
6.334SerLeu: 6.334 ± 1.329
1.188SerMet: 1.188 ± 0.85
3.959SerAsn: 3.959 ± 1.101
1.188SerPro: 1.188 ± 0.581
1.979SerGln: 1.979 ± 0.849
0.396SerArg: 0.396 ± 0.323
2.771SerSer: 2.771 ± 1.249
1.584SerThr: 1.584 ± 0.628
5.542SerVal: 5.542 ± 1.294
1.188SerTrp: 1.188 ± 0.677
3.959SerTyr: 3.959 ± 1.358
0.0SerXaa: 0.0 ± 0.0
Thr
4.355ThrAla: 4.355 ± 1.575
0.396ThrCys: 0.396 ± 0.323
3.563ThrAsp: 3.563 ± 1.123
5.542ThrGlu: 5.542 ± 1.456
2.771ThrPhe: 2.771 ± 1.006
2.375ThrGly: 2.375 ± 1.045
0.792ThrHis: 0.792 ± 0.526
3.563ThrIle: 3.563 ± 1.381
4.751ThrLys: 4.751 ± 0.917
7.126ThrLeu: 7.126 ± 1.993
2.375ThrMet: 2.375 ± 1.017
1.979ThrAsn: 1.979 ± 1.16
1.584ThrPro: 1.584 ± 0.841
3.167ThrGln: 3.167 ± 1.256
3.167ThrArg: 3.167 ± 1.021
1.584ThrSer: 1.584 ± 0.512
3.563ThrThr: 3.563 ± 1.084
5.146ThrVal: 5.146 ± 1.77
0.792ThrTrp: 0.792 ± 0.494
1.979ThrTyr: 1.979 ± 0.938
0.0ThrXaa: 0.0 ± 0.0
Val
2.771ValAla: 2.771 ± 0.742
0.396ValCys: 0.396 ± 0.323
2.375ValAsp: 2.375 ± 0.882
0.792ValGlu: 0.792 ± 0.547
1.979ValPhe: 1.979 ± 0.54
0.396ValGly: 0.396 ± 0.366
1.188ValHis: 1.188 ± 0.7
3.959ValIle: 3.959 ± 1.448
8.709ValLys: 8.709 ± 1.453
4.355ValLeu: 4.355 ± 1.03
0.396ValMet: 0.396 ± 0.378
1.979ValAsn: 1.979 ± 0.529
1.979ValPro: 1.979 ± 0.724
0.396ValGln: 0.396 ± 0.378
3.959ValArg: 3.959 ± 1.569
3.563ValSer: 3.563 ± 0.757
5.146ValThr: 5.146 ± 1.178
1.979ValVal: 1.979 ± 0.931
0.0ValTrp: 0.0 ± 0.0
3.563ValTyr: 3.563 ± 0.915
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.396TrpAsp: 0.396 ± 0.378
1.188TrpGlu: 1.188 ± 0.716
0.396TrpPhe: 0.396 ± 0.323
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.792TrpIle: 0.792 ± 0.727
0.396TrpLys: 0.396 ± 0.477
2.771TrpLeu: 2.771 ± 0.802
0.0TrpMet: 0.0 ± 0.0
0.396TrpAsn: 0.396 ± 0.477
0.0TrpPro: 0.0 ± 0.0
0.792TrpGln: 0.792 ± 0.646
0.396TrpArg: 0.396 ± 0.323
0.396TrpSer: 0.396 ± 0.344
0.0TrpThr: 0.0 ± 0.0
0.396TrpVal: 0.396 ± 0.323
0.0TrpTrp: 0.0 ± 0.0
1.188TrpTyr: 1.188 ± 0.655
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.792TyrAla: 0.792 ± 0.559
0.0TyrCys: 0.0 ± 0.0
1.584TyrAsp: 1.584 ± 0.842
2.771TyrGlu: 2.771 ± 0.724
3.563TyrPhe: 3.563 ± 0.997
3.563TyrGly: 3.563 ± 1.064
0.792TyrHis: 0.792 ± 0.646
3.563TyrIle: 3.563 ± 1.155
4.355TyrLys: 4.355 ± 0.857
6.334TyrLeu: 6.334 ± 1.067
0.792TyrMet: 0.792 ± 0.471
3.563TyrAsn: 3.563 ± 1.127
0.0TyrPro: 0.0 ± 0.0
2.771TyrGln: 2.771 ± 1.155
2.375TyrArg: 2.375 ± 0.709
1.979TyrSer: 1.979 ± 1.159
4.355TyrThr: 4.355 ± 1.335
0.792TyrVal: 0.792 ± 0.459
0.0TyrTrp: 0.0 ± 0.0
2.375TyrTyr: 2.375 ± 1.099
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2527 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski