Amino acid dipepetide frequency for Streptococcus phage IPP42

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.553AlaAla: 2.553 ± 0.888
0.456AlaCys: 0.456 ± 0.239
4.377AlaAsp: 4.377 ± 0.605
6.383AlaGlu: 6.383 ± 1.007
1.733AlaPhe: 1.733 ± 0.334
4.195AlaGly: 4.195 ± 1.256
0.456AlaHis: 0.456 ± 0.199
4.924AlaIle: 4.924 ± 1.439
5.38AlaLys: 5.38 ± 0.599
6.292AlaLeu: 6.292 ± 0.927
2.097AlaMet: 2.097 ± 0.429
3.648AlaAsn: 3.648 ± 0.8
1.459AlaPro: 1.459 ± 0.367
2.918AlaGln: 2.918 ± 0.676
2.918AlaArg: 2.918 ± 0.374
4.56AlaSer: 4.56 ± 0.934
4.924AlaThr: 4.924 ± 0.906
3.83AlaVal: 3.83 ± 0.683
1.094AlaTrp: 1.094 ± 0.333
2.645AlaTyr: 2.645 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
0.365CysAla: 0.365 ± 0.149
0.0CysCys: 0.0 ± 0.0
0.274CysAsp: 0.274 ± 0.191
0.638CysGlu: 0.638 ± 0.323
0.182CysPhe: 0.182 ± 0.115
0.274CysGly: 0.274 ± 0.132
0.365CysHis: 0.365 ± 0.172
0.091CysIle: 0.091 ± 0.105
0.547CysLys: 0.547 ± 0.322
0.73CysLeu: 0.73 ± 0.279
0.0CysMet: 0.0 ± 0.0
0.091CysAsn: 0.091 ± 0.107
0.091CysPro: 0.091 ± 0.107
0.091CysGln: 0.091 ± 0.107
0.365CysArg: 0.365 ± 0.211
0.365CysSer: 0.365 ± 0.247
0.182CysThr: 0.182 ± 0.165
0.365CysVal: 0.365 ± 0.256
0.182CysTrp: 0.182 ± 0.123
0.547CysTyr: 0.547 ± 0.22
0.0CysXaa: 0.0 ± 0.0
Asp
3.009AspAla: 3.009 ± 0.417
0.091AspCys: 0.091 ± 0.115
3.556AspAsp: 3.556 ± 0.888
3.556AspGlu: 3.556 ± 0.988
2.918AspPhe: 2.918 ± 0.484
4.468AspGly: 4.468 ± 0.879
1.185AspHis: 1.185 ± 0.404
4.651AspIle: 4.651 ± 0.721
5.38AspLys: 5.38 ± 1.04
4.833AspLeu: 4.833 ± 0.623
1.003AspMet: 1.003 ± 0.371
3.283AspAsn: 3.283 ± 0.505
1.641AspPro: 1.641 ± 0.426
1.368AspGln: 1.368 ± 0.397
1.915AspArg: 1.915 ± 0.418
3.283AspSer: 3.283 ± 0.51
3.465AspThr: 3.465 ± 0.585
4.195AspVal: 4.195 ± 0.57
1.277AspTrp: 1.277 ± 0.318
3.465AspTyr: 3.465 ± 0.604
0.0AspXaa: 0.0 ± 0.0
Glu
5.38GluAla: 5.38 ± 1.336
0.638GluCys: 0.638 ± 0.295
3.556GluAsp: 3.556 ± 0.497
6.748GluGlu: 6.748 ± 1.279
3.192GluPhe: 3.192 ± 0.5
3.192GluGly: 3.192 ± 0.656
0.547GluHis: 0.547 ± 0.298
6.566GluIle: 6.566 ± 0.65
8.116GluLys: 8.116 ± 1.841
9.119GluLeu: 9.119 ± 0.801
1.733GluMet: 1.733 ± 0.558
4.833GluAsn: 4.833 ± 0.95
2.097GluPro: 2.097 ± 0.573
3.1GluGln: 3.1 ± 0.88
3.192GluArg: 3.192 ± 0.679
3.739GluSer: 3.739 ± 0.585
4.377GluThr: 4.377 ± 0.563
4.377GluVal: 4.377 ± 0.574
1.55GluTrp: 1.55 ± 0.45
1.915GluTyr: 1.915 ± 0.376
0.0GluXaa: 0.0 ± 0.0
Phe
2.097PheAla: 2.097 ± 0.598
0.182PheCys: 0.182 ± 0.15
3.83PheAsp: 3.83 ± 0.851
3.374PheGlu: 3.374 ± 0.486
1.185PhePhe: 1.185 ± 0.533
2.097PheGly: 2.097 ± 0.515
0.547PheHis: 0.547 ± 0.281
2.28PheIle: 2.28 ± 0.583
3.465PheLys: 3.465 ± 0.845
2.827PheLeu: 2.827 ± 0.834
1.733PheMet: 1.733 ± 0.689
3.374PheAsn: 3.374 ± 0.506
0.547PhePro: 0.547 ± 0.192
1.185PheGln: 1.185 ± 0.415
1.368PheArg: 1.368 ± 0.374
2.736PheSer: 2.736 ± 0.424
3.009PheThr: 3.009 ± 0.425
1.733PheVal: 1.733 ± 0.669
0.547PheTrp: 0.547 ± 0.219
1.641PheTyr: 1.641 ± 0.364
0.0PheXaa: 0.0 ± 0.0
Gly
4.012GlyAla: 4.012 ± 0.941
0.182GlyCys: 0.182 ± 0.115
2.28GlyAsp: 2.28 ± 0.406
3.192GlyGlu: 3.192 ± 0.52
3.009GlyPhe: 3.009 ± 0.496
3.739GlyGly: 3.739 ± 0.668
0.73GlyHis: 0.73 ± 0.257
4.377GlyIle: 4.377 ± 0.534
4.833GlyLys: 4.833 ± 0.621
5.016GlyLeu: 5.016 ± 0.805
1.55GlyMet: 1.55 ± 0.44
3.739GlyAsn: 3.739 ± 0.59
0.73GlyPro: 0.73 ± 0.305
3.192GlyGln: 3.192 ± 0.628
3.192GlyArg: 3.192 ± 0.662
3.739GlySer: 3.739 ± 0.784
3.556GlyThr: 3.556 ± 0.494
3.739GlyVal: 3.739 ± 0.432
1.277GlyTrp: 1.277 ± 0.494
2.28GlyTyr: 2.28 ± 0.398
0.0GlyXaa: 0.0 ± 0.0
His
0.73HisAla: 0.73 ± 0.24
0.091HisCys: 0.091 ± 0.115
0.456HisAsp: 0.456 ± 0.231
1.094HisGlu: 1.094 ± 0.325
0.73HisPhe: 0.73 ± 0.222
1.003HisGly: 1.003 ± 0.287
0.0HisHis: 0.0 ± 0.0
1.733HisIle: 1.733 ± 0.363
0.912HisLys: 0.912 ± 0.288
1.459HisLeu: 1.459 ± 0.455
0.274HisMet: 0.274 ± 0.156
0.73HisAsn: 0.73 ± 0.234
0.547HisPro: 0.547 ± 0.216
0.547HisGln: 0.547 ± 0.269
0.547HisArg: 0.547 ± 0.28
1.185HisSer: 1.185 ± 0.462
0.821HisThr: 0.821 ± 0.378
0.821HisVal: 0.821 ± 0.292
0.0HisTrp: 0.0 ± 0.0
1.185HisTyr: 1.185 ± 0.516
0.0HisXaa: 0.0 ± 0.0
Ile
5.654IleAla: 5.654 ± 0.819
0.73IleCys: 0.73 ± 0.34
4.833IleAsp: 4.833 ± 0.9
6.201IleGlu: 6.201 ± 0.795
2.645IlePhe: 2.645 ± 0.589
3.556IleGly: 3.556 ± 0.61
0.821IleHis: 0.821 ± 0.321
3.374IleIle: 3.374 ± 0.667
5.654IleLys: 5.654 ± 0.837
4.742IleLeu: 4.742 ± 0.837
1.277IleMet: 1.277 ± 0.374
4.104IleAsn: 4.104 ± 0.687
1.824IlePro: 1.824 ± 0.285
2.371IleGln: 2.371 ± 0.428
3.648IleArg: 3.648 ± 0.849
6.566IleSer: 6.566 ± 1.318
3.556IleThr: 3.556 ± 0.384
2.827IleVal: 2.827 ± 0.606
1.185IleTrp: 1.185 ± 0.328
1.368IleTyr: 1.368 ± 0.501
0.0IleXaa: 0.0 ± 0.0
Lys
5.745LysAla: 5.745 ± 0.986
0.091LysCys: 0.091 ± 0.106
5.289LysAsp: 5.289 ± 0.694
7.386LysGlu: 7.386 ± 1.454
2.371LysPhe: 2.371 ± 0.456
4.195LysGly: 4.195 ± 0.833
1.915LysHis: 1.915 ± 0.37
5.289LysIle: 5.289 ± 0.72
6.201LysLys: 6.201 ± 1.301
7.113LysLeu: 7.113 ± 0.976
2.462LysMet: 2.462 ± 0.469
4.56LysAsn: 4.56 ± 0.608
2.097LysPro: 2.097 ± 0.506
4.012LysGln: 4.012 ± 0.647
3.739LysArg: 3.739 ± 0.718
6.383LysSer: 6.383 ± 0.715
6.292LysThr: 6.292 ± 0.665
5.107LysVal: 5.107 ± 0.77
1.003LysTrp: 1.003 ± 0.245
3.83LysTyr: 3.83 ± 0.645
0.0LysXaa: 0.0 ± 0.0
Leu
6.201LeuAla: 6.201 ± 0.792
0.456LeuCys: 0.456 ± 0.201
6.019LeuAsp: 6.019 ± 0.687
6.657LeuGlu: 6.657 ± 0.72
3.283LeuPhe: 3.283 ± 0.563
5.471LeuGly: 5.471 ± 0.734
1.277LeuHis: 1.277 ± 0.422
5.016LeuIle: 5.016 ± 0.663
7.478LeuLys: 7.478 ± 1.053
5.471LeuLeu: 5.471 ± 1.02
2.097LeuMet: 2.097 ± 0.679
6.201LeuAsn: 6.201 ± 0.772
2.918LeuPro: 2.918 ± 0.854
2.097LeuGln: 2.097 ± 0.577
4.286LeuArg: 4.286 ± 0.481
8.025LeuSer: 8.025 ± 0.946
5.107LeuThr: 5.107 ± 0.767
3.374LeuVal: 3.374 ± 0.666
0.547LeuTrp: 0.547 ± 0.356
2.462LeuTyr: 2.462 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
1.459MetAla: 1.459 ± 0.433
0.365MetCys: 0.365 ± 0.213
0.821MetAsp: 0.821 ± 0.328
2.28MetGlu: 2.28 ± 0.747
0.274MetPhe: 0.274 ± 0.198
1.277MetGly: 1.277 ± 0.304
0.091MetHis: 0.091 ± 0.088
1.641MetIle: 1.641 ± 0.46
2.189MetLys: 2.189 ± 0.59
2.006MetLeu: 2.006 ± 0.464
0.547MetMet: 0.547 ± 0.241
1.824MetAsn: 1.824 ± 0.6
0.912MetPro: 0.912 ± 0.287
0.821MetGln: 0.821 ± 0.323
0.821MetArg: 0.821 ± 0.34
1.277MetSer: 1.277 ± 0.534
1.641MetThr: 1.641 ± 0.429
1.733MetVal: 1.733 ± 0.42
0.091MetTrp: 0.091 ± 0.097
0.547MetTyr: 0.547 ± 0.215
0.0MetXaa: 0.0 ± 0.0
Asn
3.921AsnAla: 3.921 ± 0.745
0.091AsnCys: 0.091 ± 0.062
3.192AsnAsp: 3.192 ± 0.692
4.56AsnGlu: 4.56 ± 0.475
2.736AsnPhe: 2.736 ± 0.445
3.921AsnGly: 3.921 ± 0.775
1.185AsnHis: 1.185 ± 0.37
4.104AsnIle: 4.104 ± 0.703
4.651AsnLys: 4.651 ± 0.667
5.289AsnLeu: 5.289 ± 0.835
0.821AsnMet: 0.821 ± 0.366
3.739AsnAsn: 3.739 ± 0.606
2.371AsnPro: 2.371 ± 0.523
3.192AsnGln: 3.192 ± 0.779
3.374AsnArg: 3.374 ± 0.63
3.83AsnSer: 3.83 ± 0.96
3.1AsnThr: 3.1 ± 0.452
3.648AsnVal: 3.648 ± 0.637
1.003AsnTrp: 1.003 ± 0.306
1.915AsnTyr: 1.915 ± 0.476
0.0AsnXaa: 0.0 ± 0.0
Pro
2.189ProAla: 2.189 ± 0.446
0.274ProCys: 0.274 ± 0.191
1.824ProAsp: 1.824 ± 0.363
1.641ProGlu: 1.641 ± 0.485
0.912ProPhe: 0.912 ± 0.385
0.73ProGly: 0.73 ± 0.246
0.456ProHis: 0.456 ± 0.198
1.277ProIle: 1.277 ± 0.506
3.192ProLys: 3.192 ± 0.578
2.28ProLeu: 2.28 ± 0.597
0.456ProMet: 0.456 ± 0.281
1.094ProAsn: 1.094 ± 0.279
0.456ProPro: 0.456 ± 0.209
1.915ProGln: 1.915 ± 0.609
0.912ProArg: 0.912 ± 0.45
1.55ProSer: 1.55 ± 0.415
1.368ProThr: 1.368 ± 0.42
1.824ProVal: 1.824 ± 0.348
0.456ProTrp: 0.456 ± 0.207
0.73ProTyr: 0.73 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
3.921GlnAla: 3.921 ± 0.761
0.091GlnCys: 0.091 ± 0.107
1.55GlnAsp: 1.55 ± 0.363
4.468GlnGlu: 4.468 ± 0.763
1.368GlnPhe: 1.368 ± 0.289
1.733GlnGly: 1.733 ± 0.318
0.274GlnHis: 0.274 ± 0.135
2.371GlnIle: 2.371 ± 0.373
3.465GlnLys: 3.465 ± 0.523
3.556GlnLeu: 3.556 ± 0.521
0.73GlnMet: 0.73 ± 0.243
2.645GlnAsn: 2.645 ± 0.475
1.094GlnPro: 1.094 ± 0.311
1.277GlnGln: 1.277 ± 0.415
2.462GlnArg: 2.462 ± 0.582
3.009GlnSer: 3.009 ± 0.677
2.645GlnThr: 2.645 ± 0.774
2.827GlnVal: 2.827 ± 0.66
0.274GlnTrp: 0.274 ± 0.189
0.73GlnTyr: 0.73 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
3.465ArgAla: 3.465 ± 0.525
0.547ArgCys: 0.547 ± 0.19
1.915ArgAsp: 1.915 ± 0.343
2.645ArgGlu: 2.645 ± 0.615
1.641ArgPhe: 1.641 ± 0.346
1.55ArgGly: 1.55 ± 0.325
1.003ArgHis: 1.003 ± 0.327
3.374ArgIle: 3.374 ± 0.609
3.83ArgLys: 3.83 ± 0.889
4.56ArgLeu: 4.56 ± 0.76
0.912ArgMet: 0.912 ± 0.256
2.827ArgAsn: 2.827 ± 0.614
1.459ArgPro: 1.459 ± 0.373
2.827ArgGln: 2.827 ± 0.73
2.189ArgArg: 2.189 ± 0.554
2.553ArgSer: 2.553 ± 0.445
3.921ArgThr: 3.921 ± 1.143
2.553ArgVal: 2.553 ± 0.54
0.638ArgTrp: 0.638 ± 0.189
2.28ArgTyr: 2.28 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
4.651SerAla: 4.651 ± 0.842
0.365SerCys: 0.365 ± 0.176
4.012SerAsp: 4.012 ± 0.675
4.742SerGlu: 4.742 ± 0.738
2.827SerPhe: 2.827 ± 0.479
5.198SerGly: 5.198 ± 1.037
0.821SerHis: 0.821 ± 0.312
4.104SerIle: 4.104 ± 0.432
5.38SerLys: 5.38 ± 0.685
5.289SerLeu: 5.289 ± 0.971
1.003SerMet: 1.003 ± 0.403
4.195SerAsn: 4.195 ± 0.756
1.641SerPro: 1.641 ± 0.345
2.28SerGln: 2.28 ± 0.536
3.648SerArg: 3.648 ± 0.911
5.198SerSer: 5.198 ± 1.241
5.107SerThr: 5.107 ± 0.814
4.56SerVal: 4.56 ± 1.285
1.185SerTrp: 1.185 ± 0.28
2.097SerTyr: 2.097 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
5.198ThrAla: 5.198 ± 1.347
0.365ThrCys: 0.365 ± 0.211
4.104ThrAsp: 4.104 ± 0.75
3.374ThrGlu: 3.374 ± 0.553
3.374ThrPhe: 3.374 ± 0.651
5.198ThrGly: 5.198 ± 0.906
0.912ThrHis: 0.912 ± 0.286
5.198ThrIle: 5.198 ± 0.57
5.198ThrLys: 5.198 ± 0.474
4.833ThrLeu: 4.833 ± 0.792
0.821ThrMet: 0.821 ± 0.271
3.283ThrAsn: 3.283 ± 0.524
0.912ThrPro: 0.912 ± 0.487
3.1ThrGln: 3.1 ± 0.793
2.462ThrArg: 2.462 ± 0.577
4.012ThrSer: 4.012 ± 0.7
5.198ThrThr: 5.198 ± 0.967
4.924ThrVal: 4.924 ± 0.548
1.003ThrTrp: 1.003 ± 0.329
2.371ThrTyr: 2.371 ± 0.53
0.0ThrXaa: 0.0 ± 0.0
Val
3.009ValAla: 3.009 ± 0.587
0.365ValCys: 0.365 ± 0.234
4.012ValAsp: 4.012 ± 0.525
5.927ValGlu: 5.927 ± 0.655
2.645ValPhe: 2.645 ± 0.815
4.104ValGly: 4.104 ± 0.777
1.094ValHis: 1.094 ± 0.325
2.827ValIle: 2.827 ± 0.471
5.471ValLys: 5.471 ± 0.876
3.465ValLeu: 3.465 ± 0.671
1.277ValMet: 1.277 ± 0.402
3.465ValAsn: 3.465 ± 0.475
1.368ValPro: 1.368 ± 0.362
1.824ValGln: 1.824 ± 0.386
3.009ValArg: 3.009 ± 0.606
3.283ValSer: 3.283 ± 0.581
4.924ValThr: 4.924 ± 0.855
4.377ValVal: 4.377 ± 0.726
1.003ValTrp: 1.003 ± 0.306
2.006ValTyr: 2.006 ± 0.645
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.243
0.091TrpCys: 0.091 ± 0.106
0.365TrpAsp: 0.365 ± 0.197
0.73TrpGlu: 0.73 ± 0.228
0.821TrpPhe: 0.821 ± 0.414
1.185TrpGly: 1.185 ± 0.343
0.274TrpHis: 0.274 ± 0.168
1.459TrpIle: 1.459 ± 0.297
0.638TrpLys: 0.638 ± 0.276
1.55TrpLeu: 1.55 ± 0.472
0.73TrpMet: 0.73 ± 0.242
1.277TrpAsn: 1.277 ± 0.288
0.091TrpPro: 0.091 ± 0.086
0.547TrpGln: 0.547 ± 0.178
0.547TrpArg: 0.547 ± 0.298
1.094TrpSer: 1.094 ± 0.51
1.094TrpThr: 1.094 ± 0.286
1.003TrpVal: 1.003 ± 0.275
0.091TrpTrp: 0.091 ± 0.062
0.821TrpTyr: 0.821 ± 0.586
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.553TyrAla: 2.553 ± 0.451
0.274TyrCys: 0.274 ± 0.162
2.28TyrAsp: 2.28 ± 0.568
2.371TyrGlu: 2.371 ± 0.466
1.915TyrPhe: 1.915 ± 0.58
1.277TyrGly: 1.277 ± 0.39
0.912TyrHis: 0.912 ± 0.271
2.28TyrIle: 2.28 ± 0.392
3.192TyrLys: 3.192 ± 0.646
4.012TyrLeu: 4.012 ± 0.84
1.003TyrMet: 1.003 ± 0.38
1.733TyrAsn: 1.733 ± 0.52
1.277TyrPro: 1.277 ± 0.375
1.824TyrGln: 1.824 ± 0.393
2.097TyrArg: 2.097 ± 0.486
1.915TyrSer: 1.915 ± 0.312
1.55TyrThr: 1.55 ± 0.358
1.641TyrVal: 1.641 ± 0.365
0.73TyrTrp: 0.73 ± 0.271
1.915TyrTyr: 1.915 ± 0.824
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (10967 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski