Amino acid dipepetide frequency for Streptococcus phage Javan206

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.114AlaAla: 4.114 ± 1.475
0.179AlaCys: 0.179 ± 0.119
5.187AlaAsp: 5.187 ± 0.735
6.171AlaGlu: 6.171 ± 0.8
3.13AlaPhe: 3.13 ± 0.699
5.545AlaGly: 5.545 ± 0.937
0.715AlaHis: 0.715 ± 0.234
5.724AlaIle: 5.724 ± 1.029
5.635AlaLys: 5.635 ± 0.868
7.334AlaLeu: 7.334 ± 1.256
2.415AlaMet: 2.415 ± 0.788
4.651AlaAsn: 4.651 ± 0.648
2.057AlaPro: 2.057 ± 0.334
3.13AlaGln: 3.13 ± 0.868
2.236AlaArg: 2.236 ± 0.394
5.098AlaSer: 5.098 ± 0.972
4.382AlaThr: 4.382 ± 0.895
4.025AlaVal: 4.025 ± 0.927
0.626AlaTrp: 0.626 ± 0.199
3.309AlaTyr: 3.309 ± 0.735
0.0AlaXaa: 0.0 ± 0.0
Cys
0.447CysAla: 0.447 ± 0.226
0.0CysCys: 0.0 ± 0.0
0.179CysAsp: 0.179 ± 0.133
0.626CysGlu: 0.626 ± 0.247
0.179CysPhe: 0.179 ± 0.126
0.715CysGly: 0.715 ± 0.286
0.179CysHis: 0.179 ± 0.127
0.179CysIle: 0.179 ± 0.156
0.358CysLys: 0.358 ± 0.17
0.268CysLeu: 0.268 ± 0.161
0.089CysMet: 0.089 ± 0.105
0.447CysAsn: 0.447 ± 0.242
0.0CysPro: 0.0 ± 0.0
0.089CysGln: 0.089 ± 0.084
0.179CysArg: 0.179 ± 0.122
0.179CysSer: 0.179 ± 0.154
0.268CysThr: 0.268 ± 0.172
0.358CysVal: 0.358 ± 0.167
0.089CysTrp: 0.089 ± 0.087
0.358CysTyr: 0.358 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
3.846AspAla: 3.846 ± 0.528
0.447AspCys: 0.447 ± 0.288
5.366AspAsp: 5.366 ± 1.076
5.724AspGlu: 5.724 ± 1.096
3.667AspPhe: 3.667 ± 0.571
5.724AspGly: 5.724 ± 0.702
0.447AspHis: 0.447 ± 0.216
4.919AspIle: 4.919 ± 0.953
4.472AspLys: 4.472 ± 0.689
4.561AspLeu: 4.561 ± 0.739
1.699AspMet: 1.699 ± 0.355
3.756AspAsn: 3.756 ± 0.538
1.163AspPro: 1.163 ± 0.395
1.789AspGln: 1.789 ± 0.399
2.325AspArg: 2.325 ± 0.456
4.382AspSer: 4.382 ± 0.713
4.83AspThr: 4.83 ± 0.84
3.488AspVal: 3.488 ± 0.642
0.894AspTrp: 0.894 ± 0.424
3.577AspTyr: 3.577 ± 0.64
0.0AspXaa: 0.0 ± 0.0
Glu
4.382GluAla: 4.382 ± 0.713
0.179GluCys: 0.179 ± 0.126
3.935GluAsp: 3.935 ± 0.651
6.976GluGlu: 6.976 ± 1.435
3.13GluPhe: 3.13 ± 0.517
3.22GluGly: 3.22 ± 0.547
0.805GluHis: 0.805 ± 0.255
4.83GluIle: 4.83 ± 0.942
4.651GluLys: 4.651 ± 0.969
6.976GluLeu: 6.976 ± 1.062
2.504GluMet: 2.504 ± 0.614
3.756GluAsn: 3.756 ± 0.566
1.789GluPro: 1.789 ± 0.547
4.293GluGln: 4.293 ± 0.667
3.488GluArg: 3.488 ± 0.666
2.504GluSer: 2.504 ± 0.564
3.22GluThr: 3.22 ± 0.654
5.366GluVal: 5.366 ± 0.935
1.073GluTrp: 1.073 ± 0.311
2.773GluTyr: 2.773 ± 0.623
0.0GluXaa: 0.0 ± 0.0
Phe
2.773PheAla: 2.773 ± 0.402
0.358PheCys: 0.358 ± 0.178
3.488PheAsp: 3.488 ± 0.686
3.577PheGlu: 3.577 ± 0.656
1.342PhePhe: 1.342 ± 0.304
2.951PheGly: 2.951 ± 0.614
0.358PheHis: 0.358 ± 0.174
2.146PheIle: 2.146 ± 0.399
5.187PheLys: 5.187 ± 0.626
2.325PheLeu: 2.325 ± 0.517
1.073PheMet: 1.073 ± 0.323
2.862PheAsn: 2.862 ± 0.46
1.073PhePro: 1.073 ± 0.384
1.699PheGln: 1.699 ± 0.497
1.163PheArg: 1.163 ± 0.322
4.293PheSer: 4.293 ± 0.693
2.146PheThr: 2.146 ± 0.42
2.415PheVal: 2.415 ± 0.336
0.358PheTrp: 0.358 ± 0.201
1.342PheTyr: 1.342 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
5.098GlyAla: 5.098 ± 1.56
0.447GlyCys: 0.447 ± 0.226
3.309GlyAsp: 3.309 ± 0.845
3.041GlyGlu: 3.041 ± 0.491
2.594GlyPhe: 2.594 ± 0.407
3.935GlyGly: 3.935 ± 0.663
0.805GlyHis: 0.805 ± 0.214
5.545GlyIle: 5.545 ± 1.137
5.366GlyLys: 5.366 ± 0.692
4.74GlyLeu: 4.74 ± 0.571
1.61GlyMet: 1.61 ± 0.379
4.204GlyAsn: 4.204 ± 1.19
0.715GlyPro: 0.715 ± 0.351
2.862GlyGln: 2.862 ± 0.629
3.756GlyArg: 3.756 ± 0.543
4.382GlySer: 4.382 ± 0.86
4.74GlyThr: 4.74 ± 0.875
4.293GlyVal: 4.293 ± 0.729
0.537GlyTrp: 0.537 ± 0.211
3.13GlyTyr: 3.13 ± 0.755
0.0GlyXaa: 0.0 ± 0.0
His
0.805HisAla: 0.805 ± 0.258
0.179HisCys: 0.179 ± 0.121
0.894HisAsp: 0.894 ± 0.276
0.894HisGlu: 0.894 ± 0.26
0.626HisPhe: 0.626 ± 0.256
0.894HisGly: 0.894 ± 0.267
0.179HisHis: 0.179 ± 0.124
0.805HisIle: 0.805 ± 0.322
0.715HisLys: 0.715 ± 0.244
0.805HisLeu: 0.805 ± 0.218
0.447HisMet: 0.447 ± 0.245
0.715HisAsn: 0.715 ± 0.254
0.358HisPro: 0.358 ± 0.21
0.447HisGln: 0.447 ± 0.178
0.268HisArg: 0.268 ± 0.157
0.626HisSer: 0.626 ± 0.243
0.715HisThr: 0.715 ± 0.259
0.894HisVal: 0.894 ± 0.302
0.179HisTrp: 0.179 ± 0.143
0.358HisTyr: 0.358 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
5.456IleAla: 5.456 ± 0.896
0.179IleCys: 0.179 ± 0.114
5.098IleAsp: 5.098 ± 0.624
4.919IleGlu: 4.919 ± 0.903
2.594IlePhe: 2.594 ± 0.43
4.472IleGly: 4.472 ± 1.115
0.715IleHis: 0.715 ± 0.242
4.025IleIle: 4.025 ± 0.734
6.708IleLys: 6.708 ± 0.919
3.756IleLeu: 3.756 ± 0.58
1.073IleMet: 1.073 ± 0.292
5.277IleAsn: 5.277 ± 0.673
2.146IlePro: 2.146 ± 0.471
2.236IleGln: 2.236 ± 0.401
2.683IleArg: 2.683 ± 0.448
4.382IleSer: 4.382 ± 0.745
4.293IleThr: 4.293 ± 0.652
3.577IleVal: 3.577 ± 0.691
0.447IleTrp: 0.447 ± 0.167
2.773IleTyr: 2.773 ± 0.544
0.0IleXaa: 0.0 ± 0.0
Lys
7.334LysAla: 7.334 ± 1.113
0.537LysCys: 0.537 ± 0.197
5.813LysAsp: 5.813 ± 0.929
5.008LysGlu: 5.008 ± 1.108
2.504LysPhe: 2.504 ± 0.544
4.114LysGly: 4.114 ± 0.909
1.61LysHis: 1.61 ± 0.517
5.187LysIle: 5.187 ± 0.872
6.618LysLys: 6.618 ± 1.024
6.529LysLeu: 6.529 ± 1.18
2.236LysMet: 2.236 ± 0.542
4.472LysAsn: 4.472 ± 0.721
2.325LysPro: 2.325 ± 0.579
3.488LysGln: 3.488 ± 0.574
4.025LysArg: 4.025 ± 0.748
5.008LysSer: 5.008 ± 0.703
5.277LysThr: 5.277 ± 0.626
4.293LysVal: 4.293 ± 0.498
0.894LysTrp: 0.894 ± 0.272
3.577LysTyr: 3.577 ± 0.686
0.0LysXaa: 0.0 ± 0.0
Leu
6.082LeuAla: 6.082 ± 0.72
0.358LeuCys: 0.358 ± 0.15
5.635LeuAsp: 5.635 ± 0.964
5.545LeuGlu: 5.545 ± 0.883
3.13LeuPhe: 3.13 ± 0.569
5.366LeuGly: 5.366 ± 0.724
0.626LeuHis: 0.626 ± 0.24
4.025LeuIle: 4.025 ± 0.649
7.423LeuLys: 7.423 ± 0.806
3.756LeuLeu: 3.756 ± 0.767
1.789LeuMet: 1.789 ± 0.455
4.74LeuAsn: 4.74 ± 0.59
3.13LeuPro: 3.13 ± 0.636
2.504LeuGln: 2.504 ± 0.494
2.325LeuArg: 2.325 ± 0.56
6.35LeuSer: 6.35 ± 0.836
5.187LeuThr: 5.187 ± 0.747
4.74LeuVal: 4.74 ± 0.698
0.537LeuTrp: 0.537 ± 0.196
1.878LeuTyr: 1.878 ± 0.446
0.0LeuXaa: 0.0 ± 0.0
Met
1.431MetAla: 1.431 ± 0.542
0.268MetCys: 0.268 ± 0.156
1.073MetAsp: 1.073 ± 0.29
1.163MetGlu: 1.163 ± 0.427
0.984MetPhe: 0.984 ± 0.299
1.342MetGly: 1.342 ± 0.372
0.358MetHis: 0.358 ± 0.227
1.52MetIle: 1.52 ± 0.327
2.862MetLys: 2.862 ± 0.503
1.878MetLeu: 1.878 ± 0.487
0.805MetMet: 0.805 ± 0.222
1.073MetAsn: 1.073 ± 0.321
0.715MetPro: 0.715 ± 0.201
1.61MetGln: 1.61 ± 0.388
1.52MetArg: 1.52 ± 0.363
2.236MetSer: 2.236 ± 0.528
2.415MetThr: 2.415 ± 0.532
1.342MetVal: 1.342 ± 0.305
0.358MetTrp: 0.358 ± 0.155
1.252MetTyr: 1.252 ± 0.326
0.0MetXaa: 0.0 ± 0.0
Asn
4.919AsnAla: 4.919 ± 0.72
0.179AsnCys: 0.179 ± 0.137
4.382AsnAsp: 4.382 ± 0.701
5.008AsnGlu: 5.008 ± 0.717
2.594AsnPhe: 2.594 ± 0.761
4.651AsnGly: 4.651 ± 0.981
0.805AsnHis: 0.805 ± 0.226
3.399AsnIle: 3.399 ± 0.501
3.935AsnLys: 3.935 ± 0.713
4.293AsnLeu: 4.293 ± 0.5
1.342AsnMet: 1.342 ± 0.434
3.22AsnAsn: 3.22 ± 0.673
2.594AsnPro: 2.594 ± 0.487
3.13AsnGln: 3.13 ± 0.747
1.52AsnArg: 1.52 ± 0.348
3.309AsnSer: 3.309 ± 0.525
3.577AsnThr: 3.577 ± 0.686
4.114AsnVal: 4.114 ± 0.825
0.894AsnTrp: 0.894 ± 0.375
1.789AsnTyr: 1.789 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
2.773ProAla: 2.773 ± 0.514
0.179ProCys: 0.179 ± 0.128
1.61ProAsp: 1.61 ± 0.557
2.773ProGlu: 2.773 ± 0.518
1.52ProPhe: 1.52 ± 0.281
0.984ProGly: 0.984 ± 0.34
0.358ProHis: 0.358 ± 0.154
1.699ProIle: 1.699 ± 0.457
2.415ProLys: 2.415 ± 0.553
1.878ProLeu: 1.878 ± 0.395
0.626ProMet: 0.626 ± 0.232
0.894ProAsn: 0.894 ± 0.28
0.984ProPro: 0.984 ± 0.289
1.431ProGln: 1.431 ± 0.444
1.073ProArg: 1.073 ± 0.391
1.61ProSer: 1.61 ± 0.378
2.325ProThr: 2.325 ± 0.468
2.146ProVal: 2.146 ± 0.437
0.0ProTrp: 0.0 ± 0.0
1.073ProTyr: 1.073 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
3.667GlnAla: 3.667 ± 0.663
0.179GlnCys: 0.179 ± 0.209
2.236GlnAsp: 2.236 ± 0.415
2.773GlnGlu: 2.773 ± 0.612
1.699GlnPhe: 1.699 ± 0.311
3.041GlnGly: 3.041 ± 0.705
0.447GlnHis: 0.447 ± 0.169
3.488GlnIle: 3.488 ± 0.514
3.22GlnLys: 3.22 ± 0.802
4.114GlnLeu: 4.114 ± 0.535
2.146GlnMet: 2.146 ± 0.514
2.862GlnAsn: 2.862 ± 0.969
0.537GlnPro: 0.537 ± 0.275
2.146GlnGln: 2.146 ± 0.511
1.431GlnArg: 1.431 ± 0.369
2.594GlnSer: 2.594 ± 0.546
2.415GlnThr: 2.415 ± 0.512
2.504GlnVal: 2.504 ± 0.545
0.626GlnTrp: 0.626 ± 0.271
1.163GlnTyr: 1.163 ± 0.332
0.0GlnXaa: 0.0 ± 0.0
Arg
3.309ArgAla: 3.309 ± 0.653
0.089ArgCys: 0.089 ± 0.097
2.146ArgAsp: 2.146 ± 0.449
2.146ArgGlu: 2.146 ± 0.599
1.878ArgPhe: 1.878 ± 0.405
2.057ArgGly: 2.057 ± 0.491
0.179ArgHis: 0.179 ± 0.132
2.236ArgIle: 2.236 ± 0.45
3.22ArgLys: 3.22 ± 0.501
3.667ArgLeu: 3.667 ± 0.516
1.342ArgMet: 1.342 ± 0.382
2.236ArgAsn: 2.236 ± 0.417
1.52ArgPro: 1.52 ± 0.375
1.252ArgGln: 1.252 ± 0.406
1.163ArgArg: 1.163 ± 0.405
2.146ArgSer: 2.146 ± 0.529
2.325ArgThr: 2.325 ± 0.386
2.146ArgVal: 2.146 ± 0.459
0.805ArgTrp: 0.805 ± 0.302
1.878ArgTyr: 1.878 ± 0.351
0.0ArgXaa: 0.0 ± 0.0
Ser
4.561SerAla: 4.561 ± 1.08
0.358SerCys: 0.358 ± 0.149
5.366SerAsp: 5.366 ± 0.837
4.651SerGlu: 4.651 ± 0.624
2.862SerPhe: 2.862 ± 0.647
5.187SerGly: 5.187 ± 1.396
0.894SerHis: 0.894 ± 0.249
5.187SerIle: 5.187 ± 0.873
4.293SerLys: 4.293 ± 0.703
3.846SerLeu: 3.846 ± 0.573
1.61SerMet: 1.61 ± 0.41
4.919SerAsn: 4.919 ± 0.719
1.968SerPro: 1.968 ± 0.442
3.041SerGln: 3.041 ± 0.687
1.431SerArg: 1.431 ± 0.335
3.488SerSer: 3.488 ± 0.526
4.204SerThr: 4.204 ± 0.747
4.293SerVal: 4.293 ± 0.762
1.342SerTrp: 1.342 ± 0.315
2.236SerTyr: 2.236 ± 0.449
0.0SerXaa: 0.0 ± 0.0
Thr
6.529ThrAla: 6.529 ± 0.947
0.179ThrCys: 0.179 ± 0.124
3.935ThrAsp: 3.935 ± 0.682
2.773ThrGlu: 2.773 ± 0.492
3.22ThrPhe: 3.22 ± 0.701
3.846ThrGly: 3.846 ± 0.715
0.715ThrHis: 0.715 ± 0.208
4.204ThrIle: 4.204 ± 0.523
4.204ThrLys: 4.204 ± 0.74
5.992ThrLeu: 5.992 ± 1.014
0.805ThrMet: 0.805 ± 0.327
2.773ThrAsn: 2.773 ± 0.533
2.325ThrPro: 2.325 ± 0.448
3.041ThrGln: 3.041 ± 0.719
2.236ThrArg: 2.236 ± 0.49
3.577ThrSer: 3.577 ± 0.622
3.041ThrThr: 3.041 ± 0.778
6.082ThrVal: 6.082 ± 1.211
0.537ThrTrp: 0.537 ± 0.264
2.862ThrTyr: 2.862 ± 0.525
0.0ThrXaa: 0.0 ± 0.0
Val
4.919ValAla: 4.919 ± 1.083
0.179ValCys: 0.179 ± 0.111
4.025ValAsp: 4.025 ± 0.558
3.488ValGlu: 3.488 ± 0.747
2.683ValPhe: 2.683 ± 0.615
4.025ValGly: 4.025 ± 1.006
0.447ValHis: 0.447 ± 0.21
3.756ValIle: 3.756 ± 0.623
4.919ValLys: 4.919 ± 0.569
4.382ValLeu: 4.382 ± 0.604
1.968ValMet: 1.968 ± 0.345
3.399ValAsn: 3.399 ± 0.529
1.252ValPro: 1.252 ± 0.523
3.041ValGln: 3.041 ± 0.591
2.236ValArg: 2.236 ± 0.392
5.545ValSer: 5.545 ± 0.709
4.204ValThr: 4.204 ± 0.475
3.935ValVal: 3.935 ± 0.551
1.073ValTrp: 1.073 ± 0.305
3.22ValTyr: 3.22 ± 0.546
0.0ValXaa: 0.0 ± 0.0
Trp
0.626TrpAla: 0.626 ± 0.179
0.089TrpCys: 0.089 ± 0.078
0.626TrpAsp: 0.626 ± 0.228
0.537TrpGlu: 0.537 ± 0.252
0.447TrpPhe: 0.447 ± 0.214
0.984TrpGly: 0.984 ± 0.296
0.358TrpHis: 0.358 ± 0.183
0.984TrpIle: 0.984 ± 0.343
1.52TrpLys: 1.52 ± 0.465
0.626TrpLeu: 0.626 ± 0.228
0.089TrpMet: 0.089 ± 0.071
0.537TrpAsn: 0.537 ± 0.186
0.0TrpPro: 0.0 ± 0.0
0.626TrpGln: 0.626 ± 0.222
0.537TrpArg: 0.537 ± 0.243
1.52TrpSer: 1.52 ± 0.418
0.715TrpThr: 0.715 ± 0.379
0.537TrpVal: 0.537 ± 0.279
0.179TrpTrp: 0.179 ± 0.131
0.447TrpTyr: 0.447 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.594TyrAla: 2.594 ± 0.553
0.626TyrCys: 0.626 ± 0.381
2.773TyrAsp: 2.773 ± 0.584
2.057TyrGlu: 2.057 ± 0.507
2.146TyrPhe: 2.146 ± 0.509
2.057TyrGly: 2.057 ± 0.414
0.805TyrHis: 0.805 ± 0.271
2.951TyrIle: 2.951 ± 0.421
3.22TyrLys: 3.22 ± 0.466
3.399TyrLeu: 3.399 ± 0.712
0.447TyrMet: 0.447 ± 0.191
2.862TyrAsn: 2.862 ± 0.605
1.699TyrPro: 1.699 ± 0.338
1.431TyrGln: 1.431 ± 0.364
2.146TyrArg: 2.146 ± 0.44
2.504TyrSer: 2.504 ± 0.412
2.594TyrThr: 2.594 ± 0.453
2.146TyrVal: 2.146 ± 0.461
0.537TyrTrp: 0.537 ± 0.189
2.057TyrTyr: 2.057 ± 0.723
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (11182 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski