Amino acid dipepetide frequency for Streptococcus phage Javan486

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.737AlaAla: 4.737 ± 1.274
0.499AlaCys: 0.499 ± 0.201
4.321AlaAsp: 4.321 ± 0.601
5.319AlaGlu: 5.319 ± 0.766
1.828AlaPhe: 1.828 ± 0.395
3.906AlaGly: 3.906 ± 0.653
0.665AlaHis: 0.665 ± 0.192
5.402AlaIle: 5.402 ± 0.674
6.399AlaLys: 6.399 ± 0.78
6.981AlaLeu: 6.981 ± 0.717
1.995AlaMet: 1.995 ± 0.401
3.989AlaAsn: 3.989 ± 0.616
1.828AlaPro: 1.828 ± 0.373
3.407AlaGln: 3.407 ± 0.541
3.241AlaArg: 3.241 ± 0.51
4.986AlaSer: 4.986 ± 1.196
4.321AlaThr: 4.321 ± 0.9
4.488AlaVal: 4.488 ± 0.661
0.499AlaTrp: 0.499 ± 0.176
2.742AlaTyr: 2.742 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.249CysAla: 0.249 ± 0.145
0.166CysCys: 0.166 ± 0.117
0.582CysAsp: 0.582 ± 0.19
0.831CysGlu: 0.831 ± 0.302
0.499CysPhe: 0.499 ± 0.187
0.499CysGly: 0.499 ± 0.179
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.499CysLys: 0.499 ± 0.179
1.163CysLeu: 1.163 ± 0.257
0.166CysMet: 0.166 ± 0.137
0.083CysAsn: 0.083 ± 0.08
0.083CysPro: 0.083 ± 0.079
0.249CysGln: 0.249 ± 0.156
0.332CysArg: 0.332 ± 0.159
0.332CysSer: 0.332 ± 0.157
0.083CysThr: 0.083 ± 0.078
0.166CysVal: 0.166 ± 0.103
0.083CysTrp: 0.083 ± 0.079
0.332CysTyr: 0.332 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
4.405AspAla: 4.405 ± 0.717
0.582AspCys: 0.582 ± 0.211
3.823AspAsp: 3.823 ± 0.781
4.82AspGlu: 4.82 ± 0.635
2.992AspPhe: 2.992 ± 0.416
6.565AspGly: 6.565 ± 0.771
0.914AspHis: 0.914 ± 0.224
3.74AspIle: 3.74 ± 0.597
5.817AspLys: 5.817 ± 0.725
7.396AspLeu: 7.396 ± 0.963
1.33AspMet: 1.33 ± 0.296
4.654AspAsn: 4.654 ± 0.551
1.662AspPro: 1.662 ± 0.392
1.579AspGln: 1.579 ± 0.356
2.161AspArg: 2.161 ± 0.486
3.407AspSer: 3.407 ± 0.554
3.407AspThr: 3.407 ± 0.521
4.571AspVal: 4.571 ± 0.698
0.499AspTrp: 0.499 ± 0.184
2.826AspTyr: 2.826 ± 0.577
0.0AspXaa: 0.0 ± 0.0
Glu
5.319GluAla: 5.319 ± 0.673
0.249GluCys: 0.249 ± 0.15
3.574GluAsp: 3.574 ± 0.724
5.651GluGlu: 5.651 ± 0.864
3.241GluPhe: 3.241 ± 0.491
2.576GluGly: 2.576 ± 0.51
1.08GluHis: 1.08 ± 0.349
6.067GluIle: 6.067 ± 0.766
5.651GluLys: 5.651 ± 0.639
8.394GluLeu: 8.394 ± 0.758
1.828GluMet: 1.828 ± 0.37
3.407GluAsn: 3.407 ± 0.577
2.327GluPro: 2.327 ± 0.564
2.909GluGln: 2.909 ± 0.474
3.657GluArg: 3.657 ± 0.672
3.989GluSer: 3.989 ± 0.466
4.654GluThr: 4.654 ± 0.627
4.571GluVal: 4.571 ± 0.765
0.416GluTrp: 0.416 ± 0.163
3.075GluTyr: 3.075 ± 0.548
0.0GluXaa: 0.0 ± 0.0
Phe
1.995PheAla: 1.995 ± 0.413
0.582PheCys: 0.582 ± 0.221
3.906PheAsp: 3.906 ± 0.552
3.657PheGlu: 3.657 ± 0.758
0.831PhePhe: 0.831 ± 0.279
2.41PheGly: 2.41 ± 0.576
0.416PheHis: 0.416 ± 0.185
2.244PheIle: 2.244 ± 0.478
3.324PheLys: 3.324 ± 0.49
2.493PheLeu: 2.493 ± 0.461
0.914PheMet: 0.914 ± 0.287
2.659PheAsn: 2.659 ± 0.491
0.249PhePro: 0.249 ± 0.145
0.499PheGln: 0.499 ± 0.162
2.078PheArg: 2.078 ± 0.517
2.41PheSer: 2.41 ± 0.411
2.161PheThr: 2.161 ± 0.375
2.41PheVal: 2.41 ± 0.355
0.249PheTrp: 0.249 ± 0.163
1.413PheTyr: 1.413 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
4.488GlyAla: 4.488 ± 0.682
0.499GlyCys: 0.499 ± 0.201
3.49GlyAsp: 3.49 ± 0.48
3.574GlyGlu: 3.574 ± 0.606
3.075GlyPhe: 3.075 ± 0.424
3.49GlyGly: 3.49 ± 0.477
1.745GlyHis: 1.745 ± 0.369
4.903GlyIle: 4.903 ± 0.753
5.152GlyLys: 5.152 ± 0.494
4.571GlyLeu: 4.571 ± 0.69
1.33GlyMet: 1.33 ± 0.316
2.826GlyAsn: 2.826 ± 0.493
0.831GlyPro: 0.831 ± 0.283
2.992GlyGln: 2.992 ± 0.613
2.327GlyArg: 2.327 ± 0.437
4.072GlySer: 4.072 ± 0.578
3.74GlyThr: 3.74 ± 0.671
6.067GlyVal: 6.067 ± 0.762
1.163GlyTrp: 1.163 ± 0.366
2.576GlyTyr: 2.576 ± 0.572
0.0GlyXaa: 0.0 ± 0.0
His
1.413HisAla: 1.413 ± 0.348
0.083HisCys: 0.083 ± 0.074
0.582HisAsp: 0.582 ± 0.242
1.08HisGlu: 1.08 ± 0.292
0.914HisPhe: 0.914 ± 0.244
1.08HisGly: 1.08 ± 0.322
0.083HisHis: 0.083 ± 0.096
0.831HisIle: 0.831 ± 0.23
1.08HisLys: 1.08 ± 0.326
1.911HisLeu: 1.911 ± 0.446
0.166HisMet: 0.166 ± 0.115
0.914HisAsn: 0.914 ± 0.238
0.665HisPro: 0.665 ± 0.231
0.416HisGln: 0.416 ± 0.183
0.665HisArg: 0.665 ± 0.22
1.163HisSer: 1.163 ± 0.326
1.163HisThr: 1.163 ± 0.291
0.748HisVal: 0.748 ± 0.311
0.332HisTrp: 0.332 ± 0.146
0.499HisTyr: 0.499 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
5.734IleAla: 5.734 ± 0.812
0.166IleCys: 0.166 ± 0.114
5.485IleAsp: 5.485 ± 0.696
5.402IleGlu: 5.402 ± 0.704
2.161IlePhe: 2.161 ± 0.471
3.075IleGly: 3.075 ± 0.419
0.748IleHis: 0.748 ± 0.288
3.657IleIle: 3.657 ± 0.53
8.144IleLys: 8.144 ± 0.701
4.488IleLeu: 4.488 ± 0.685
1.413IleMet: 1.413 ± 0.402
4.155IleAsn: 4.155 ± 0.606
2.078IlePro: 2.078 ± 0.365
2.078IleGln: 2.078 ± 0.38
2.161IleArg: 2.161 ± 0.394
4.986IleSer: 4.986 ± 0.703
4.654IleThr: 4.654 ± 0.548
3.989IleVal: 3.989 ± 0.691
0.582IleTrp: 0.582 ± 0.249
2.659IleTyr: 2.659 ± 0.42
0.0IleXaa: 0.0 ± 0.0
Lys
6.731LysAla: 6.731 ± 0.745
0.499LysCys: 0.499 ± 0.194
6.067LysAsp: 6.067 ± 0.691
6.815LysGlu: 6.815 ± 0.789
1.995LysPhe: 1.995 ± 0.447
4.654LysGly: 4.654 ± 0.639
1.413LysHis: 1.413 ± 0.335
6.399LysIle: 6.399 ± 0.833
7.729LysLys: 7.729 ± 0.584
6.981LysLeu: 6.981 ± 0.737
2.327LysMet: 2.327 ± 0.51
5.319LysAsn: 5.319 ± 0.612
2.576LysPro: 2.576 ± 0.667
3.74LysGln: 3.74 ± 0.706
4.072LysArg: 4.072 ± 0.602
4.238LysSer: 4.238 ± 0.482
4.986LysThr: 4.986 ± 0.785
5.485LysVal: 5.485 ± 0.601
0.914LysTrp: 0.914 ± 0.306
3.74LysTyr: 3.74 ± 0.529
0.0LysXaa: 0.0 ± 0.0
Leu
7.479LeuAla: 7.479 ± 0.955
0.332LeuCys: 0.332 ± 0.188
7.563LeuAsp: 7.563 ± 0.996
6.565LeuGlu: 6.565 ± 0.689
3.075LeuPhe: 3.075 ± 0.393
6.067LeuGly: 6.067 ± 0.719
0.997LeuHis: 0.997 ± 0.316
5.319LeuIle: 5.319 ± 0.797
8.892LeuLys: 8.892 ± 0.695
6.815LeuLeu: 6.815 ± 0.954
1.828LeuMet: 1.828 ± 0.443
5.069LeuAsn: 5.069 ± 0.457
1.911LeuPro: 1.911 ± 0.388
3.075LeuGln: 3.075 ± 0.443
3.906LeuArg: 3.906 ± 0.74
5.651LeuSer: 5.651 ± 0.542
4.82LeuThr: 4.82 ± 0.579
4.82LeuVal: 4.82 ± 0.676
0.831LeuTrp: 0.831 ± 0.199
2.659LeuTyr: 2.659 ± 0.538
0.0LeuXaa: 0.0 ± 0.0
Met
1.995MetAla: 1.995 ± 0.414
0.166MetCys: 0.166 ± 0.108
1.33MetAsp: 1.33 ± 0.396
1.413MetGlu: 1.413 ± 0.43
0.582MetPhe: 0.582 ± 0.261
1.496MetGly: 1.496 ± 0.405
0.332MetHis: 0.332 ± 0.154
1.662MetIle: 1.662 ± 0.328
1.33MetLys: 1.33 ± 0.372
1.579MetLeu: 1.579 ± 0.396
0.499MetMet: 0.499 ± 0.212
1.08MetAsn: 1.08 ± 0.279
0.997MetPro: 0.997 ± 0.267
1.163MetGln: 1.163 ± 0.402
2.078MetArg: 2.078 ± 0.327
2.078MetSer: 2.078 ± 0.388
2.41MetThr: 2.41 ± 0.414
0.914MetVal: 0.914 ± 0.273
0.249MetTrp: 0.249 ± 0.154
0.416MetTyr: 0.416 ± 0.172
0.0MetXaa: 0.0 ± 0.0
Asn
3.906AsnAla: 3.906 ± 0.751
0.416AsnCys: 0.416 ± 0.169
3.407AsnAsp: 3.407 ± 0.54
3.241AsnGlu: 3.241 ± 0.51
2.161AsnPhe: 2.161 ± 0.452
4.155AsnGly: 4.155 ± 0.645
1.247AsnHis: 1.247 ± 0.377
3.657AsnIle: 3.657 ± 0.617
4.654AsnLys: 4.654 ± 0.535
5.236AsnLeu: 5.236 ± 0.574
1.08AsnMet: 1.08 ± 0.255
3.407AsnAsn: 3.407 ± 0.529
2.078AsnPro: 2.078 ± 0.47
2.659AsnGln: 2.659 ± 0.545
2.742AsnArg: 2.742 ± 0.408
4.155AsnSer: 4.155 ± 0.762
2.576AsnThr: 2.576 ± 0.505
2.659AsnVal: 2.659 ± 0.433
0.914AsnTrp: 0.914 ± 0.321
1.579AsnTyr: 1.579 ± 0.311
0.0AsnXaa: 0.0 ± 0.0
Pro
1.247ProAla: 1.247 ± 0.322
0.249ProCys: 0.249 ± 0.125
1.911ProAsp: 1.911 ± 0.373
1.828ProGlu: 1.828 ± 0.352
1.496ProPhe: 1.496 ± 0.416
1.163ProGly: 1.163 ± 0.39
0.914ProHis: 0.914 ± 0.293
1.745ProIle: 1.745 ± 0.455
2.659ProLys: 2.659 ± 0.547
2.327ProLeu: 2.327 ± 0.515
0.499ProMet: 0.499 ± 0.176
1.08ProAsn: 1.08 ± 0.338
0.582ProPro: 0.582 ± 0.218
1.662ProGln: 1.662 ± 0.362
0.914ProArg: 0.914 ± 0.3
1.579ProSer: 1.579 ± 0.316
1.163ProThr: 1.163 ± 0.274
2.244ProVal: 2.244 ± 0.422
0.249ProTrp: 0.249 ± 0.131
1.33ProTyr: 1.33 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
2.493GlnAla: 2.493 ± 0.478
0.166GlnCys: 0.166 ± 0.109
1.911GlnAsp: 1.911 ± 0.394
3.241GlnGlu: 3.241 ± 0.549
1.247GlnPhe: 1.247 ± 0.285
2.742GlnGly: 2.742 ± 0.475
0.332GlnHis: 0.332 ± 0.156
3.075GlnIle: 3.075 ± 0.494
2.826GlnLys: 2.826 ± 0.551
3.906GlnLeu: 3.906 ± 0.517
1.413GlnMet: 1.413 ± 0.329
2.161GlnAsn: 2.161 ± 0.506
0.582GlnPro: 0.582 ± 0.277
1.662GlnGln: 1.662 ± 0.539
2.161GlnArg: 2.161 ± 0.361
3.324GlnSer: 3.324 ± 0.57
2.742GlnThr: 2.742 ± 0.538
1.579GlnVal: 1.579 ± 0.354
0.499GlnTrp: 0.499 ± 0.18
1.163GlnTyr: 1.163 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
3.324ArgAla: 3.324 ± 0.602
0.332ArgCys: 0.332 ± 0.201
1.995ArgAsp: 1.995 ± 0.389
3.823ArgGlu: 3.823 ± 0.566
1.413ArgPhe: 1.413 ± 0.279
3.407ArgGly: 3.407 ± 0.47
0.831ArgHis: 0.831 ± 0.296
2.826ArgIle: 2.826 ± 0.413
3.158ArgLys: 3.158 ± 0.506
4.072ArgLeu: 4.072 ± 0.607
0.914ArgMet: 0.914 ± 0.183
2.576ArgAsn: 2.576 ± 0.381
0.831ArgPro: 0.831 ± 0.306
2.327ArgGln: 2.327 ± 0.427
1.911ArgArg: 1.911 ± 0.342
2.493ArgSer: 2.493 ± 0.442
2.493ArgThr: 2.493 ± 0.42
2.161ArgVal: 2.161 ± 0.442
0.831ArgTrp: 0.831 ± 0.248
1.911ArgTyr: 1.911 ± 0.585
0.0ArgXaa: 0.0 ± 0.0
Ser
4.321SerAla: 4.321 ± 0.978
0.332SerCys: 0.332 ± 0.153
4.571SerAsp: 4.571 ± 0.66
4.072SerGlu: 4.072 ± 0.551
2.576SerPhe: 2.576 ± 0.474
3.989SerGly: 3.989 ± 0.557
0.831SerHis: 0.831 ± 0.279
4.155SerIle: 4.155 ± 0.602
4.903SerLys: 4.903 ± 0.578
5.568SerLeu: 5.568 ± 0.75
1.828SerMet: 1.828 ± 0.376
3.906SerAsn: 3.906 ± 0.787
1.662SerPro: 1.662 ± 0.323
2.41SerGln: 2.41 ± 0.495
2.41SerArg: 2.41 ± 0.404
3.075SerSer: 3.075 ± 0.675
3.74SerThr: 3.74 ± 0.72
3.74SerVal: 3.74 ± 0.506
1.163SerTrp: 1.163 ± 0.322
2.659SerTyr: 2.659 ± 0.532
0.0SerXaa: 0.0 ± 0.0
Thr
4.986ThrAla: 4.986 ± 0.907
0.166ThrCys: 0.166 ± 0.142
4.321ThrAsp: 4.321 ± 0.631
3.49ThrGlu: 3.49 ± 0.588
2.41ThrPhe: 2.41 ± 0.468
4.986ThrGly: 4.986 ± 0.589
0.997ThrHis: 0.997 ± 0.256
4.238ThrIle: 4.238 ± 0.507
4.82ThrLys: 4.82 ± 0.645
4.571ThrLeu: 4.571 ± 0.541
1.496ThrMet: 1.496 ± 0.408
2.41ThrAsn: 2.41 ± 0.377
2.493ThrPro: 2.493 ± 0.379
2.078ThrGln: 2.078 ± 0.348
2.327ThrArg: 2.327 ± 0.483
3.574ThrSer: 3.574 ± 0.548
5.568ThrThr: 5.568 ± 0.764
4.488ThrVal: 4.488 ± 0.532
0.665ThrTrp: 0.665 ± 0.252
1.828ThrTyr: 1.828 ± 0.472
0.0ThrXaa: 0.0 ± 0.0
Val
3.906ValAla: 3.906 ± 0.513
0.166ValCys: 0.166 ± 0.126
4.571ValAsp: 4.571 ± 0.489
4.986ValGlu: 4.986 ± 0.748
2.244ValPhe: 2.244 ± 0.589
4.155ValGly: 4.155 ± 0.646
0.831ValHis: 0.831 ± 0.218
5.069ValIle: 5.069 ± 0.635
4.654ValLys: 4.654 ± 0.573
5.152ValLeu: 5.152 ± 0.691
1.995ValMet: 1.995 ± 0.336
3.823ValAsn: 3.823 ± 0.427
1.745ValPro: 1.745 ± 0.581
1.662ValGln: 1.662 ± 0.352
2.826ValArg: 2.826 ± 0.379
3.574ValSer: 3.574 ± 0.561
4.155ValThr: 4.155 ± 0.645
4.405ValVal: 4.405 ± 0.544
0.416ValTrp: 0.416 ± 0.181
2.161ValTyr: 2.161 ± 0.515
0.0ValXaa: 0.0 ± 0.0
Trp
0.582TrpAla: 0.582 ± 0.161
0.083TrpCys: 0.083 ± 0.08
0.748TrpAsp: 0.748 ± 0.272
0.914TrpGlu: 0.914 ± 0.299
0.748TrpPhe: 0.748 ± 0.243
0.499TrpGly: 0.499 ± 0.269
0.249TrpHis: 0.249 ± 0.147
0.665TrpIle: 0.665 ± 0.256
1.247TrpLys: 1.247 ± 0.322
0.748TrpLeu: 0.748 ± 0.303
0.166TrpMet: 0.166 ± 0.114
0.416TrpAsn: 0.416 ± 0.165
0.166TrpPro: 0.166 ± 0.116
0.748TrpGln: 0.748 ± 0.331
0.499TrpArg: 0.499 ± 0.217
0.665TrpSer: 0.665 ± 0.214
0.748TrpThr: 0.748 ± 0.296
0.665TrpVal: 0.665 ± 0.204
0.0TrpTrp: 0.0 ± 0.0
0.416TrpTyr: 0.416 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.244TyrAla: 2.244 ± 0.361
0.665TyrCys: 0.665 ± 0.241
2.992TyrAsp: 2.992 ± 0.508
1.911TyrGlu: 1.911 ± 0.444
1.413TyrPhe: 1.413 ± 0.428
1.995TyrGly: 1.995 ± 0.375
1.163TyrHis: 1.163 ± 0.333
2.161TyrIle: 2.161 ± 0.347
3.74TyrLys: 3.74 ± 0.69
3.324TyrLeu: 3.324 ± 0.682
0.499TyrMet: 0.499 ± 0.227
1.995TyrAsn: 1.995 ± 0.407
1.496TyrPro: 1.496 ± 0.417
1.911TyrGln: 1.911 ± 0.333
1.163TyrArg: 1.163 ± 0.284
2.161TyrSer: 2.161 ± 0.503
2.327TyrThr: 2.327 ± 0.531
2.41TyrVal: 2.41 ± 0.485
0.416TyrTrp: 0.416 ± 0.173
1.828TyrTyr: 1.828 ± 0.466
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12034 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski