Amino acid dipepetide frequency for Streptococcus satellite phage Javan651

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.847AlaAla: 0.847 ± 0.809
0.847AlaCys: 0.847 ± 0.519
2.54AlaAsp: 2.54 ± 0.916
6.774AlaGlu: 6.774 ± 2.205
1.693AlaPhe: 1.693 ± 0.997
2.54AlaGly: 2.54 ± 1.032
0.423AlaHis: 0.423 ± 0.436
4.234AlaIle: 4.234 ± 1.404
4.234AlaLys: 4.234 ± 1.191
3.387AlaLeu: 3.387 ± 1.157
0.423AlaMet: 0.423 ± 0.423
6.351AlaAsn: 6.351 ± 1.585
1.27AlaPro: 1.27 ± 0.869
2.117AlaGln: 2.117 ± 1.01
2.117AlaArg: 2.117 ± 0.644
2.54AlaSer: 2.54 ± 0.86
4.657AlaThr: 4.657 ± 1.197
2.54AlaVal: 2.54 ± 0.734
0.847AlaTrp: 0.847 ± 0.428
4.234AlaTyr: 4.234 ± 0.915
0.0AlaXaa: 0.0 ± 0.0
Cys
0.423CysAla: 0.423 ± 0.436
0.0CysCys: 0.0 ± 0.0
0.423CysAsp: 0.423 ± 0.326
0.0CysGlu: 0.0 ± 0.0
0.847CysPhe: 0.847 ± 0.614
0.423CysGly: 0.423 ± 0.338
0.0CysHis: 0.0 ± 0.0
1.693CysIle: 1.693 ± 0.721
0.423CysLys: 0.423 ± 0.436
0.423CysLeu: 0.423 ± 0.423
0.0CysMet: 0.0 ± 0.0
0.847CysAsn: 0.847 ± 0.366
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.423CysArg: 0.423 ± 0.326
0.423CysSer: 0.423 ± 0.436
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.693CysTyr: 1.693 ± 0.715
0.0CysXaa: 0.0 ± 0.0
Asp
1.27AspAla: 1.27 ± 0.733
0.423AspCys: 0.423 ± 0.456
2.117AspAsp: 2.117 ± 0.941
5.08AspGlu: 5.08 ± 1.532
4.234AspPhe: 4.234 ± 0.836
2.117AspGly: 2.117 ± 0.919
0.0AspHis: 0.0 ± 0.0
5.927AspIle: 5.927 ± 1.711
5.08AspLys: 5.08 ± 1.186
5.08AspLeu: 5.08 ± 1.217
2.117AspMet: 2.117 ± 0.93
4.657AspAsn: 4.657 ± 0.821
1.27AspPro: 1.27 ± 0.766
0.847AspGln: 0.847 ± 0.428
2.117AspArg: 2.117 ± 0.957
3.387AspSer: 3.387 ± 0.772
3.81AspThr: 3.81 ± 1.205
1.693AspVal: 1.693 ± 0.572
0.0AspTrp: 0.0 ± 0.0
3.387AspTyr: 3.387 ± 1.58
0.0AspXaa: 0.0 ± 0.0
Glu
6.774GluAla: 6.774 ± 1.707
0.423GluCys: 0.423 ± 0.404
3.387GluAsp: 3.387 ± 1.314
6.774GluGlu: 6.774 ± 1.62
4.234GluPhe: 4.234 ± 1.126
1.693GluGly: 1.693 ± 0.781
0.847GluHis: 0.847 ± 0.48
5.504GluIle: 5.504 ± 0.96
7.197GluLys: 7.197 ± 2.658
10.161GluLeu: 10.161 ± 1.716
0.847GluMet: 0.847 ± 0.49
4.657GluAsn: 4.657 ± 1.281
1.693GluPro: 1.693 ± 0.585
4.234GluGln: 4.234 ± 1.298
3.81GluArg: 3.81 ± 1.235
3.387GluSer: 3.387 ± 1.046
2.964GluThr: 2.964 ± 1.055
4.234GluVal: 4.234 ± 1.408
0.847GluTrp: 0.847 ± 0.452
1.693GluTyr: 1.693 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
5.504PheAsp: 5.504 ± 1.093
2.117PheGlu: 2.117 ± 1.199
2.964PhePhe: 2.964 ± 1.228
2.964PheGly: 2.964 ± 0.868
0.847PheHis: 0.847 ± 0.418
2.964PheIle: 2.964 ± 0.852
2.964PheLys: 2.964 ± 1.108
5.504PheLeu: 5.504 ± 1.148
1.27PheMet: 1.27 ± 0.759
3.387PheAsn: 3.387 ± 1.239
1.693PhePro: 1.693 ± 0.748
1.693PheGln: 1.693 ± 0.762
0.847PheArg: 0.847 ± 0.513
2.54PheSer: 2.54 ± 0.927
2.54PheThr: 2.54 ± 0.848
1.27PheVal: 1.27 ± 0.766
0.847PheTrp: 0.847 ± 0.871
1.693PheTyr: 1.693 ± 0.448
0.0PheXaa: 0.0 ± 0.0
Gly
0.423GlyAla: 0.423 ± 0.326
0.423GlyCys: 0.423 ± 0.326
2.964GlyAsp: 2.964 ± 1.432
4.657GlyGlu: 4.657 ± 0.632
2.54GlyPhe: 2.54 ± 0.857
2.117GlyGly: 2.117 ± 0.81
0.423GlyHis: 0.423 ± 0.326
4.234GlyIle: 4.234 ± 1.195
5.08GlyLys: 5.08 ± 1.47
3.81GlyLeu: 3.81 ± 1.437
1.27GlyMet: 1.27 ± 0.732
3.81GlyAsn: 3.81 ± 1.659
0.0GlyPro: 0.0 ± 0.0
0.423GlyGln: 0.423 ± 0.456
4.234GlyArg: 4.234 ± 0.777
1.27GlySer: 1.27 ± 0.766
3.387GlyThr: 3.387 ± 1.56
4.234GlyVal: 4.234 ± 1.003
0.847GlyTrp: 0.847 ± 0.677
3.81GlyTyr: 3.81 ± 1.293
0.0GlyXaa: 0.0 ± 0.0
His
0.847HisAla: 0.847 ± 0.5
0.0HisCys: 0.0 ± 0.0
0.423HisAsp: 0.423 ± 0.412
0.847HisGlu: 0.847 ± 0.452
1.27HisPhe: 1.27 ± 0.731
2.117HisGly: 2.117 ± 0.995
0.0HisHis: 0.0 ± 0.0
1.27HisIle: 1.27 ± 0.412
1.27HisLys: 1.27 ± 0.715
2.54HisLeu: 2.54 ± 0.798
0.0HisMet: 0.0 ± 0.0
1.27HisAsn: 1.27 ± 0.963
0.423HisPro: 0.423 ± 0.452
0.0HisGln: 0.0 ± 0.0
0.847HisArg: 0.847 ± 0.507
0.847HisSer: 0.847 ± 0.513
2.117HisThr: 2.117 ± 0.844
1.27HisVal: 1.27 ± 0.517
0.0HisTrp: 0.0 ± 0.0
1.27HisTyr: 1.27 ± 0.586
0.0HisXaa: 0.0 ± 0.0
Ile
5.08IleAla: 5.08 ± 1.808
0.423IleCys: 0.423 ± 0.423
4.234IleAsp: 4.234 ± 1.091
3.81IleGlu: 3.81 ± 1.721
2.964IlePhe: 2.964 ± 0.921
3.387IleGly: 3.387 ± 0.915
1.27IleHis: 1.27 ± 0.521
4.234IleIle: 4.234 ± 1.177
5.927IleLys: 5.927 ± 1.523
6.351IleLeu: 6.351 ± 1.588
0.423IleMet: 0.423 ± 0.317
2.964IleAsn: 2.964 ± 1.034
3.81IlePro: 3.81 ± 1.383
2.117IleGln: 2.117 ± 0.795
4.234IleArg: 4.234 ± 2.077
2.964IleSer: 2.964 ± 1.145
5.504IleThr: 5.504 ± 1.327
4.234IleVal: 4.234 ± 1.214
0.423IleTrp: 0.423 ± 0.436
3.387IleTyr: 3.387 ± 0.977
0.0IleXaa: 0.0 ± 0.0
Lys
5.504LysAla: 5.504 ± 2.307
0.423LysCys: 0.423 ± 0.404
5.08LysAsp: 5.08 ± 1.251
8.891LysGlu: 8.891 ± 2.463
2.54LysPhe: 2.54 ± 0.676
3.81LysGly: 3.81 ± 1.484
3.81LysHis: 3.81 ± 0.624
6.351LysIle: 6.351 ± 1.175
8.467LysLys: 8.467 ± 2.17
8.044LysLeu: 8.044 ± 1.807
2.964LysMet: 2.964 ± 0.852
5.927LysAsn: 5.927 ± 2.09
2.964LysPro: 2.964 ± 1.414
5.08LysGln: 5.08 ± 0.811
6.351LysArg: 6.351 ± 1.341
5.08LysSer: 5.08 ± 1.434
2.117LysThr: 2.117 ± 0.663
4.657LysVal: 4.657 ± 1.095
0.847LysTrp: 0.847 ± 0.506
1.693LysTyr: 1.693 ± 0.647
0.0LysXaa: 0.0 ± 0.0
Leu
4.234LeuAla: 4.234 ± 1.142
0.847LeuCys: 0.847 ± 0.677
8.044LeuAsp: 8.044 ± 1.089
8.467LeuGlu: 8.467 ± 3.126
2.54LeuPhe: 2.54 ± 0.883
5.927LeuGly: 5.927 ± 1.711
2.54LeuHis: 2.54 ± 0.838
3.81LeuIle: 3.81 ± 1.194
8.467LeuLys: 8.467 ± 1.971
12.278LeuLeu: 12.278 ± 1.313
1.27LeuMet: 1.27 ± 0.634
5.927LeuAsn: 5.927 ± 2.136
4.234LeuPro: 4.234 ± 1.148
5.504LeuGln: 5.504 ± 0.959
4.234LeuArg: 4.234 ± 1.606
7.621LeuSer: 7.621 ± 2.103
5.927LeuThr: 5.927 ± 1.592
2.964LeuVal: 2.964 ± 1.195
0.847LeuTrp: 0.847 ± 0.48
4.234LeuTyr: 4.234 ± 1.044
0.0LeuXaa: 0.0 ± 0.0
Met
1.693MetAla: 1.693 ± 1.502
0.0MetCys: 0.0 ± 0.0
0.847MetAsp: 0.847 ± 0.526
1.693MetGlu: 1.693 ± 0.881
0.0MetPhe: 0.0 ± 0.0
0.423MetGly: 0.423 ± 0.412
0.423MetHis: 0.423 ± 0.412
0.423MetIle: 0.423 ± 0.452
2.117MetLys: 2.117 ± 0.697
2.117MetLeu: 2.117 ± 0.838
0.423MetMet: 0.423 ± 0.326
2.54MetAsn: 2.54 ± 0.898
0.847MetPro: 0.847 ± 0.506
1.27MetGln: 1.27 ± 0.508
1.27MetArg: 1.27 ± 0.641
0.423MetSer: 0.423 ± 0.338
2.964MetThr: 2.964 ± 1.484
0.423MetVal: 0.423 ± 0.338
0.423MetTrp: 0.423 ± 0.456
0.423MetTyr: 0.423 ± 0.436
0.0MetXaa: 0.0 ± 0.0
Asn
4.657AsnAla: 4.657 ± 0.71
0.847AsnCys: 0.847 ± 0.428
2.117AsnAsp: 2.117 ± 0.44
5.08AsnGlu: 5.08 ± 0.978
2.117AsnPhe: 2.117 ± 0.933
5.927AsnGly: 5.927 ± 1.584
1.27AsnHis: 1.27 ± 0.586
5.504AsnIle: 5.504 ± 1.501
7.197AsnLys: 7.197 ± 0.848
4.657AsnLeu: 4.657 ± 1.398
1.693AsnMet: 1.693 ± 0.622
3.387AsnAsn: 3.387 ± 1.155
2.964AsnPro: 2.964 ± 1.103
0.847AsnGln: 0.847 ± 0.628
2.964AsnArg: 2.964 ± 0.738
4.234AsnSer: 4.234 ± 1.313
5.927AsnThr: 5.927 ± 2.053
2.54AsnVal: 2.54 ± 1.314
0.847AsnTrp: 0.847 ± 0.629
2.964AsnTyr: 2.964 ± 1.265
0.0AsnXaa: 0.0 ± 0.0
Pro
1.27ProAla: 1.27 ± 0.561
0.423ProCys: 0.423 ± 0.436
0.847ProAsp: 0.847 ± 0.578
2.117ProGlu: 2.117 ± 1.038
2.964ProPhe: 2.964 ± 0.887
0.423ProGly: 0.423 ± 0.423
0.0ProHis: 0.0 ± 0.0
2.54ProIle: 2.54 ± 0.777
2.964ProLys: 2.964 ± 1.638
2.54ProLeu: 2.54 ± 0.64
0.847ProMet: 0.847 ± 0.506
4.234ProAsn: 4.234 ± 1.322
0.847ProPro: 0.847 ± 0.677
1.693ProGln: 1.693 ± 0.798
2.54ProArg: 2.54 ± 0.757
2.964ProSer: 2.964 ± 1.266
1.27ProThr: 1.27 ± 0.561
2.117ProVal: 2.117 ± 1.243
0.0ProTrp: 0.0 ± 0.0
1.693ProTyr: 1.693 ± 0.776
0.0ProXaa: 0.0 ± 0.0
Gln
6.351GlnAla: 6.351 ± 2.317
0.423GlnCys: 0.423 ± 0.436
1.27GlnAsp: 1.27 ± 0.732
2.964GlnGlu: 2.964 ± 0.982
3.387GlnPhe: 3.387 ± 0.982
1.27GlnGly: 1.27 ± 0.84
1.693GlnHis: 1.693 ± 0.735
2.117GlnIle: 2.117 ± 0.73
2.964GlnLys: 2.964 ± 0.824
2.964GlnLeu: 2.964 ± 0.961
0.0GlnMet: 0.0 ± 0.399
0.0GlnAsn: 0.0 ± 0.0
0.847GlnPro: 0.847 ± 0.366
1.693GlnGln: 1.693 ± 0.794
1.27GlnArg: 1.27 ± 0.586
2.117GlnSer: 2.117 ± 1.132
2.54GlnThr: 2.54 ± 1.025
2.54GlnVal: 2.54 ± 0.775
0.423GlnTrp: 0.423 ± 0.338
1.27GlnTyr: 1.27 ± 0.586
0.0GlnXaa: 0.0 ± 0.0
Arg
2.54ArgAla: 2.54 ± 0.934
0.847ArgCys: 0.847 ± 0.871
3.81ArgAsp: 3.81 ± 0.955
2.117ArgGlu: 2.117 ± 1.051
2.117ArgPhe: 2.117 ± 0.543
2.54ArgGly: 2.54 ± 1.222
0.423ArgHis: 0.423 ± 0.326
2.54ArgIle: 2.54 ± 1.097
4.657ArgLys: 4.657 ± 1.082
7.621ArgLeu: 7.621 ± 1.486
0.423ArgMet: 0.423 ± 0.436
6.774ArgAsn: 6.774 ± 1.695
2.54ArgPro: 2.54 ± 0.932
1.693ArgGln: 1.693 ± 0.947
1.27ArgArg: 1.27 ± 0.608
2.117ArgSer: 2.117 ± 1.026
2.117ArgThr: 2.117 ± 0.831
1.27ArgVal: 1.27 ± 0.653
0.423ArgTrp: 0.423 ± 0.412
2.54ArgTyr: 2.54 ± 0.807
0.0ArgXaa: 0.0 ± 0.0
Ser
2.54SerAla: 2.54 ± 0.84
0.423SerCys: 0.423 ± 0.423
3.81SerAsp: 3.81 ± 1.005
1.27SerGlu: 1.27 ± 0.695
2.964SerPhe: 2.964 ± 1.104
2.964SerGly: 2.964 ± 1.083
1.693SerHis: 1.693 ± 0.816
2.964SerIle: 2.964 ± 0.74
6.351SerLys: 6.351 ± 1.502
5.08SerLeu: 5.08 ± 1.45
1.693SerMet: 1.693 ± 0.764
1.693SerAsn: 1.693 ± 0.842
1.27SerPro: 1.27 ± 0.699
2.54SerGln: 2.54 ± 1.068
3.387SerArg: 3.387 ± 1.129
2.964SerSer: 2.964 ± 0.846
3.387SerThr: 3.387 ± 1.191
3.387SerVal: 3.387 ± 0.717
0.847SerTrp: 0.847 ± 0.653
2.964SerTyr: 2.964 ± 0.634
0.0SerXaa: 0.0 ± 0.0
Thr
3.81ThrAla: 3.81 ± 0.958
0.0ThrCys: 0.0 ± 0.0
2.54ThrAsp: 2.54 ± 0.843
4.234ThrGlu: 4.234 ± 1.158
2.54ThrPhe: 2.54 ± 0.747
4.234ThrGly: 4.234 ± 1.055
0.423ThrHis: 0.423 ± 0.326
3.387ThrIle: 3.387 ± 1.284
5.504ThrLys: 5.504 ± 2.333
5.927ThrLeu: 5.927 ± 1.209
2.117ThrMet: 2.117 ± 0.694
2.54ThrAsn: 2.54 ± 1.367
2.54ThrPro: 2.54 ± 0.97
2.964ThrGln: 2.964 ± 1.055
2.54ThrArg: 2.54 ± 0.754
2.964ThrSer: 2.964 ± 0.78
4.234ThrThr: 4.234 ± 0.727
4.234ThrVal: 4.234 ± 1.258
0.423ThrTrp: 0.423 ± 0.393
3.81ThrTyr: 3.81 ± 1.087
0.0ThrXaa: 0.0 ± 0.0
Val
5.08ValAla: 5.08 ± 1.311
0.847ValCys: 0.847 ± 0.366
2.54ValAsp: 2.54 ± 0.834
2.54ValGlu: 2.54 ± 1.134
0.847ValPhe: 0.847 ± 0.507
2.54ValGly: 2.54 ± 0.649
0.423ValHis: 0.423 ± 0.326
3.81ValIle: 3.81 ± 0.885
3.81ValLys: 3.81 ± 0.948
5.08ValLeu: 5.08 ± 0.839
1.27ValMet: 1.27 ± 0.629
3.387ValAsn: 3.387 ± 0.76
3.387ValPro: 3.387 ± 1.133
0.847ValGln: 0.847 ± 0.555
1.27ValArg: 1.27 ± 0.551
2.964ValSer: 2.964 ± 1.165
3.81ValThr: 3.81 ± 1.513
4.234ValVal: 4.234 ± 1.215
0.0ValTrp: 0.0 ± 0.0
2.54ValTyr: 2.54 ± 0.981
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.423TrpAsp: 0.423 ± 0.326
1.27TrpGlu: 1.27 ± 0.695
0.423TrpPhe: 0.423 ± 0.338
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.847TrpIle: 0.847 ± 0.614
0.423TrpLys: 0.423 ± 0.326
0.847TrpLeu: 0.847 ± 0.677
0.423TrpMet: 0.423 ± 0.456
0.423TrpAsn: 0.423 ± 0.436
0.0TrpPro: 0.0 ± 0.0
0.423TrpGln: 0.423 ± 0.338
0.423TrpArg: 0.423 ± 0.338
0.847TrpSer: 0.847 ± 0.418
0.0TrpThr: 0.0 ± 0.0
1.693TrpVal: 1.693 ± 0.513
0.423TrpTrp: 0.423 ± 0.326
0.423TrpTyr: 0.423 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.27TyrAla: 1.27 ± 0.959
0.423TyrCys: 0.423 ± 0.393
1.693TyrAsp: 1.693 ± 0.629
4.657TyrGlu: 4.657 ± 1.013
0.423TyrPhe: 0.423 ± 0.338
2.54TyrGly: 2.54 ± 0.729
2.117TyrHis: 2.117 ± 0.716
3.387TyrIle: 3.387 ± 0.83
5.504TyrLys: 5.504 ± 1.556
5.504TyrLeu: 5.504 ± 1.238
0.847TyrMet: 0.847 ± 0.905
2.964TyrAsn: 2.964 ± 1.369
1.693TyrPro: 1.693 ± 0.741
2.54TyrGln: 2.54 ± 0.721
4.234TyrArg: 4.234 ± 0.973
2.54TyrSer: 2.54 ± 0.573
1.693TyrThr: 1.693 ± 0.852
1.693TyrVal: 1.693 ± 1.06
0.0TyrTrp: 0.0 ± 0.0
4.234TyrTyr: 4.234 ± 1.393
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (2363 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski