Amino acid dipepetide frequency for Clostridium phage phiCP26F

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.741AlaAla: 1.741 ± 1.095
0.249AlaCys: 0.249 ± 0.192
1.99AlaAsp: 1.99 ± 0.48
3.606AlaGlu: 3.606 ± 0.674
2.238AlaPhe: 2.238 ± 0.676
2.114AlaGly: 2.114 ± 0.974
0.746AlaHis: 0.746 ± 0.352
5.969AlaIle: 5.969 ± 0.849
3.855AlaLys: 3.855 ± 0.751
5.72AlaLeu: 5.72 ± 1.168
1.243AlaMet: 1.243 ± 0.408
4.601AlaAsn: 4.601 ± 0.757
0.746AlaPro: 0.746 ± 0.326
1.492AlaGln: 1.492 ± 0.361
2.984AlaArg: 2.984 ± 0.523
3.482AlaSer: 3.482 ± 0.914
3.482AlaThr: 3.482 ± 0.66
2.363AlaVal: 2.363 ± 0.515
0.373AlaTrp: 0.373 ± 0.193
2.238AlaTyr: 2.238 ± 0.576
0.0AlaXaa: 0.0 ± 0.0
Cys
0.373CysAla: 0.373 ± 0.19
0.373CysCys: 0.373 ± 0.212
1.368CysAsp: 1.368 ± 0.438
1.617CysGlu: 1.617 ± 0.413
0.746CysPhe: 0.746 ± 0.283
0.87CysGly: 0.87 ± 0.352
0.124CysHis: 0.124 ± 0.14
1.243CysIle: 1.243 ± 0.406
1.492CysLys: 1.492 ± 0.425
1.492CysLeu: 1.492 ± 0.487
0.373CysMet: 0.373 ± 0.193
0.622CysAsn: 0.622 ± 0.276
0.124CysPro: 0.124 ± 0.139
0.0CysGln: 0.0 ± 0.0
0.497CysArg: 0.497 ± 0.242
0.746CysSer: 0.746 ± 0.325
0.373CysThr: 0.373 ± 0.193
0.373CysVal: 0.373 ± 0.19
0.0CysTrp: 0.0 ± 0.0
0.373CysTyr: 0.373 ± 0.266
0.0CysXaa: 0.0 ± 0.0
Asp
3.606AspAla: 3.606 ± 0.707
0.995AspCys: 0.995 ± 0.453
1.617AspAsp: 1.617 ± 0.4
4.601AspGlu: 4.601 ± 0.657
3.855AspPhe: 3.855 ± 0.539
3.109AspGly: 3.109 ± 0.639
0.497AspHis: 0.497 ± 0.243
5.471AspIle: 5.471 ± 0.855
5.969AspLys: 5.969 ± 0.752
5.844AspLeu: 5.844 ± 0.723
2.363AspMet: 2.363 ± 0.546
3.357AspAsn: 3.357 ± 0.568
0.746AspPro: 0.746 ± 0.315
0.746AspGln: 0.746 ± 0.374
1.119AspArg: 1.119 ± 0.348
3.233AspSer: 3.233 ± 0.495
2.86AspThr: 2.86 ± 0.514
2.736AspVal: 2.736 ± 0.601
2.114AspTrp: 2.114 ± 0.643
3.233AspTyr: 3.233 ± 0.774
0.0AspXaa: 0.0 ± 0.0
Glu
3.482GluAla: 3.482 ± 0.651
1.492GluCys: 1.492 ± 0.424
5.72GluAsp: 5.72 ± 0.969
8.58GluGlu: 8.58 ± 1.499
3.73GluPhe: 3.73 ± 0.523
6.59GluGly: 6.59 ± 0.844
1.243GluHis: 1.243 ± 0.418
8.456GluIle: 8.456 ± 1.36
8.083GluLys: 8.083 ± 1.183
8.456GluLeu: 8.456 ± 1.072
3.357GluMet: 3.357 ± 0.548
4.476GluAsn: 4.476 ± 0.763
1.865GluPro: 1.865 ± 0.439
3.482GluGln: 3.482 ± 0.898
2.984GluArg: 2.984 ± 0.69
3.482GluSer: 3.482 ± 0.651
3.979GluThr: 3.979 ± 0.612
5.596GluVal: 5.596 ± 0.968
1.119GluTrp: 1.119 ± 0.348
3.233GluTyr: 3.233 ± 0.687
0.0GluXaa: 0.0 ± 0.0
Phe
2.238PheAla: 2.238 ± 0.428
0.373PheCys: 0.373 ± 0.19
2.238PheAsp: 2.238 ± 0.531
4.725PheGlu: 4.725 ± 0.713
0.87PhePhe: 0.87 ± 0.383
2.487PheGly: 2.487 ± 0.673
0.124PheHis: 0.124 ± 0.112
4.601PheIle: 4.601 ± 0.747
4.601PheLys: 4.601 ± 0.655
2.487PheLeu: 2.487 ± 0.464
1.741PheMet: 1.741 ± 0.443
4.103PheAsn: 4.103 ± 0.601
0.746PhePro: 0.746 ± 0.318
1.243PheGln: 1.243 ± 0.428
1.865PheArg: 1.865 ± 0.46
2.611PheSer: 2.611 ± 0.471
3.233PheThr: 3.233 ± 0.738
2.487PheVal: 2.487 ± 0.527
0.746PheTrp: 0.746 ± 0.281
1.865PheTyr: 1.865 ± 0.425
0.0PheXaa: 0.0 ± 0.0
Gly
3.233GlyAla: 3.233 ± 1.021
0.995GlyCys: 0.995 ± 0.313
2.487GlyAsp: 2.487 ± 0.604
4.601GlyGlu: 4.601 ± 0.821
2.86GlyPhe: 2.86 ± 0.725
3.482GlyGly: 3.482 ± 1.174
0.746GlyHis: 0.746 ± 0.338
5.347GlyIle: 5.347 ± 1.379
4.85GlyLys: 4.85 ± 0.705
4.974GlyLeu: 4.974 ± 0.93
1.368GlyMet: 1.368 ± 0.399
3.606GlyAsn: 3.606 ± 0.822
0.124GlyPro: 0.124 ± 0.113
2.114GlyGln: 2.114 ± 0.559
2.114GlyArg: 2.114 ± 0.563
2.984GlySer: 2.984 ± 0.528
3.357GlyThr: 3.357 ± 0.65
4.103GlyVal: 4.103 ± 0.682
0.497GlyTrp: 0.497 ± 0.215
2.984GlyTyr: 2.984 ± 0.54
0.0GlyXaa: 0.0 ± 0.0
His
0.497HisAla: 0.497 ± 0.248
0.124HisCys: 0.124 ± 0.109
1.119HisAsp: 1.119 ± 0.423
0.995HisGlu: 0.995 ± 0.347
0.87HisPhe: 0.87 ± 0.344
0.622HisGly: 0.622 ± 0.224
0.124HisHis: 0.124 ± 0.116
0.87HisIle: 0.87 ± 0.259
1.243HisLys: 1.243 ± 0.373
1.119HisLeu: 1.119 ± 0.362
0.124HisMet: 0.124 ± 0.14
0.373HisAsn: 0.373 ± 0.195
0.373HisPro: 0.373 ± 0.172
0.249HisGln: 0.249 ± 0.169
0.497HisArg: 0.497 ± 0.305
0.995HisSer: 0.995 ± 0.314
0.87HisThr: 0.87 ± 0.385
0.373HisVal: 0.373 ± 0.22
0.0HisTrp: 0.0 ± 0.0
0.746HisTyr: 0.746 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
4.974IleAla: 4.974 ± 0.874
1.243IleCys: 1.243 ± 0.433
6.963IleAsp: 6.963 ± 0.837
8.083IleGlu: 8.083 ± 1.229
2.238IlePhe: 2.238 ± 0.561
3.606IleGly: 3.606 ± 0.718
0.746IleHis: 0.746 ± 0.31
5.72IleIle: 5.72 ± 0.77
9.823IleLys: 9.823 ± 1.344
5.844IleLeu: 5.844 ± 0.76
2.363IleMet: 2.363 ± 0.52
5.72IleAsn: 5.72 ± 0.765
2.86IlePro: 2.86 ± 0.49
2.363IleGln: 2.363 ± 0.69
2.611IleArg: 2.611 ± 0.672
7.088IleSer: 7.088 ± 0.897
4.476IleThr: 4.476 ± 0.789
3.73IleVal: 3.73 ± 0.618
1.119IleTrp: 1.119 ± 0.323
3.233IleTyr: 3.233 ± 0.603
0.0IleXaa: 0.0 ± 0.0
Lys
6.466LysAla: 6.466 ± 0.792
1.119LysCys: 1.119 ± 0.472
3.73LysAsp: 3.73 ± 0.636
11.564LysGlu: 11.564 ± 2.03
4.228LysPhe: 4.228 ± 0.559
5.72LysGly: 5.72 ± 0.94
1.243LysHis: 1.243 ± 0.363
7.336LysIle: 7.336 ± 0.975
7.336LysLys: 7.336 ± 1.068
7.461LysLeu: 7.461 ± 0.933
2.611LysMet: 2.611 ± 0.566
5.347LysAsn: 5.347 ± 0.985
3.233LysPro: 3.233 ± 0.722
3.73LysGln: 3.73 ± 0.604
4.601LysArg: 4.601 ± 0.953
4.103LysSer: 4.103 ± 0.733
5.347LysThr: 5.347 ± 1.002
6.342LysVal: 6.342 ± 0.753
0.87LysTrp: 0.87 ± 0.244
3.606LysTyr: 3.606 ± 0.612
0.0LysXaa: 0.0 ± 0.0
Leu
5.969LeuAla: 5.969 ± 1.317
0.746LeuCys: 0.746 ± 0.396
4.974LeuAsp: 4.974 ± 0.677
7.834LeuGlu: 7.834 ± 0.906
3.109LeuPhe: 3.109 ± 0.641
5.098LeuGly: 5.098 ± 1.079
1.368LeuHis: 1.368 ± 0.345
5.72LeuIle: 5.72 ± 1.0
8.58LeuLys: 8.58 ± 0.959
5.471LeuLeu: 5.471 ± 1.005
2.736LeuMet: 2.736 ± 0.454
5.844LeuAsn: 5.844 ± 0.818
1.492LeuPro: 1.492 ± 0.387
3.73LeuGln: 3.73 ± 0.742
3.606LeuArg: 3.606 ± 0.617
4.85LeuSer: 4.85 ± 1.023
5.471LeuThr: 5.471 ± 1.162
4.601LeuVal: 4.601 ± 0.717
0.746LeuTrp: 0.746 ± 0.307
3.482LeuTyr: 3.482 ± 0.988
0.0LeuXaa: 0.0 ± 0.0
Met
1.741MetAla: 1.741 ± 0.641
0.497MetCys: 0.497 ± 0.233
1.368MetAsp: 1.368 ± 0.416
2.736MetGlu: 2.736 ± 0.713
1.119MetPhe: 1.119 ± 0.339
1.741MetGly: 1.741 ± 0.533
0.124MetHis: 0.124 ± 0.123
2.363MetIle: 2.363 ± 0.584
3.233MetLys: 3.233 ± 0.591
2.114MetLeu: 2.114 ± 0.533
0.746MetMet: 0.746 ± 0.237
2.238MetAsn: 2.238 ± 0.521
0.746MetPro: 0.746 ± 0.277
0.995MetGln: 0.995 ± 0.307
1.368MetArg: 1.368 ± 0.363
1.617MetSer: 1.617 ± 0.482
1.492MetThr: 1.492 ± 0.425
1.243MetVal: 1.243 ± 0.36
0.373MetTrp: 0.373 ± 0.194
0.746MetTyr: 0.746 ± 0.376
0.0MetXaa: 0.0 ± 0.0
Asn
2.86AsnAla: 2.86 ± 0.669
0.87AsnCys: 0.87 ± 0.316
4.228AsnAsp: 4.228 ± 0.77
3.855AsnGlu: 3.855 ± 0.591
3.606AsnPhe: 3.606 ± 0.522
4.476AsnGly: 4.476 ± 0.827
1.119AsnHis: 1.119 ± 0.394
5.72AsnIle: 5.72 ± 0.733
6.093AsnLys: 6.093 ± 0.773
4.85AsnLeu: 4.85 ± 0.953
1.492AsnMet: 1.492 ± 0.399
4.601AsnAsn: 4.601 ± 0.984
1.865AsnPro: 1.865 ± 0.418
1.741AsnGln: 1.741 ± 0.447
3.482AsnArg: 3.482 ± 0.585
2.984AsnSer: 2.984 ± 0.773
4.725AsnThr: 4.725 ± 0.715
3.109AsnVal: 3.109 ± 0.567
0.746AsnTrp: 0.746 ± 0.313
3.482AsnTyr: 3.482 ± 0.525
0.0AsnXaa: 0.0 ± 0.0
Pro
0.249ProAla: 0.249 ± 0.155
0.373ProCys: 0.373 ± 0.21
1.119ProAsp: 1.119 ± 0.475
1.492ProGlu: 1.492 ± 0.462
1.243ProPhe: 1.243 ± 0.487
0.124ProGly: 0.124 ± 0.116
0.497ProHis: 0.497 ± 0.225
2.238ProIle: 2.238 ± 0.486
2.238ProLys: 2.238 ± 0.475
3.233ProLeu: 3.233 ± 0.558
0.746ProMet: 0.746 ± 0.285
2.363ProAsn: 2.363 ± 0.646
0.124ProPro: 0.124 ± 0.116
1.243ProGln: 1.243 ± 0.356
0.373ProArg: 0.373 ± 0.214
0.995ProSer: 0.995 ± 0.353
1.368ProThr: 1.368 ± 0.414
1.119ProVal: 1.119 ± 0.408
0.0ProTrp: 0.0 ± 0.0
1.617ProTyr: 1.617 ± 0.447
0.0ProXaa: 0.0 ± 0.0
Gln
1.865GlnAla: 1.865 ± 0.659
0.124GlnCys: 0.124 ± 0.126
2.363GlnAsp: 2.363 ± 0.551
2.736GlnGlu: 2.736 ± 0.567
1.492GlnPhe: 1.492 ± 0.422
2.736GlnGly: 2.736 ± 0.583
0.373GlnHis: 0.373 ± 0.221
1.99GlnIle: 1.99 ± 0.63
2.487GlnLys: 2.487 ± 0.54
3.233GlnLeu: 3.233 ± 0.705
1.119GlnMet: 1.119 ± 0.34
1.99GlnAsn: 1.99 ± 0.51
0.746GlnPro: 0.746 ± 0.238
1.119GlnGln: 1.119 ± 0.361
1.243GlnArg: 1.243 ± 0.451
1.492GlnSer: 1.492 ± 0.447
1.617GlnThr: 1.617 ± 0.427
1.617GlnVal: 1.617 ± 0.382
0.249GlnTrp: 0.249 ± 0.145
1.865GlnTyr: 1.865 ± 0.531
0.0GlnXaa: 0.0 ± 0.0
Arg
1.243ArgAla: 1.243 ± 0.327
0.746ArgCys: 0.746 ± 0.267
2.487ArgAsp: 2.487 ± 0.543
3.979ArgGlu: 3.979 ± 0.815
2.736ArgPhe: 2.736 ± 0.644
2.736ArgGly: 2.736 ± 0.548
0.124ArgHis: 0.124 ± 0.139
3.109ArgIle: 3.109 ± 0.787
3.73ArgLys: 3.73 ± 0.848
3.357ArgLeu: 3.357 ± 0.634
1.492ArgMet: 1.492 ± 0.461
2.736ArgAsn: 2.736 ± 0.558
0.622ArgPro: 0.622 ± 0.24
1.119ArgGln: 1.119 ± 0.379
1.741ArgArg: 1.741 ± 0.417
1.492ArgSer: 1.492 ± 0.504
1.865ArgThr: 1.865 ± 0.484
1.99ArgVal: 1.99 ± 0.578
0.746ArgTrp: 0.746 ± 0.367
1.99ArgTyr: 1.99 ± 0.533
0.0ArgXaa: 0.0 ± 0.0
Ser
3.109SerAla: 3.109 ± 0.942
0.746SerCys: 0.746 ± 0.288
3.606SerAsp: 3.606 ± 0.658
4.601SerGlu: 4.601 ± 0.829
2.238SerPhe: 2.238 ± 0.612
3.233SerGly: 3.233 ± 1.098
0.622SerHis: 0.622 ± 0.298
5.471SerIle: 5.471 ± 0.761
5.844SerLys: 5.844 ± 1.123
4.476SerLeu: 4.476 ± 0.721
1.243SerMet: 1.243 ± 0.412
4.103SerAsn: 4.103 ± 0.548
0.995SerPro: 0.995 ± 0.311
1.865SerGln: 1.865 ± 0.469
2.611SerArg: 2.611 ± 0.563
2.363SerSer: 2.363 ± 0.452
2.611SerThr: 2.611 ± 0.656
2.114SerVal: 2.114 ± 0.516
0.746SerTrp: 0.746 ± 0.307
2.86SerTyr: 2.86 ± 0.557
0.0SerXaa: 0.0 ± 0.0
Thr
2.363ThrAla: 2.363 ± 0.544
0.497ThrCys: 0.497 ± 0.255
3.482ThrAsp: 3.482 ± 0.835
3.855ThrGlu: 3.855 ± 0.832
3.233ThrPhe: 3.233 ± 0.5
3.233ThrGly: 3.233 ± 0.602
1.492ThrHis: 1.492 ± 0.356
4.974ThrIle: 4.974 ± 0.92
3.979ThrLys: 3.979 ± 0.554
5.72ThrLeu: 5.72 ± 0.85
1.99ThrMet: 1.99 ± 0.422
3.855ThrAsn: 3.855 ± 0.625
1.99ThrPro: 1.99 ± 0.476
1.741ThrGln: 1.741 ± 0.43
1.617ThrArg: 1.617 ± 0.457
3.482ThrSer: 3.482 ± 0.661
3.482ThrThr: 3.482 ± 0.73
2.363ThrVal: 2.363 ± 0.495
0.497ThrTrp: 0.497 ± 0.204
1.617ThrTyr: 1.617 ± 0.51
0.0ThrXaa: 0.0 ± 0.0
Val
3.482ValAla: 3.482 ± 0.744
0.746ValCys: 0.746 ± 0.276
3.357ValAsp: 3.357 ± 0.554
3.73ValGlu: 3.73 ± 0.68
1.741ValPhe: 1.741 ± 0.365
2.984ValGly: 2.984 ± 0.843
0.373ValHis: 0.373 ± 0.199
4.103ValIle: 4.103 ± 0.825
6.963ValLys: 6.963 ± 0.912
3.482ValLeu: 3.482 ± 0.693
0.622ValMet: 0.622 ± 0.27
2.611ValAsn: 2.611 ± 0.57
1.617ValPro: 1.617 ± 0.427
1.617ValGln: 1.617 ± 0.374
2.363ValArg: 2.363 ± 0.558
3.482ValSer: 3.482 ± 0.486
2.363ValThr: 2.363 ± 0.599
2.114ValVal: 2.114 ± 0.645
0.622ValTrp: 0.622 ± 0.27
1.865ValTyr: 1.865 ± 0.441
0.0ValXaa: 0.0 ± 0.0
Trp
0.497TrpAla: 0.497 ± 0.309
0.124TrpCys: 0.124 ± 0.123
1.243TrpAsp: 1.243 ± 0.383
1.243TrpGlu: 1.243 ± 0.407
0.497TrpPhe: 0.497 ± 0.294
0.373TrpGly: 0.373 ± 0.175
0.249TrpHis: 0.249 ± 0.185
1.119TrpIle: 1.119 ± 0.557
0.249TrpLys: 0.249 ± 0.145
2.114TrpLeu: 2.114 ± 0.477
0.0TrpMet: 0.0 ± 0.0
0.622TrpAsn: 0.622 ± 0.259
0.0TrpPro: 0.0 ± 0.0
0.746TrpGln: 0.746 ± 0.374
0.373TrpArg: 0.373 ± 0.22
1.119TrpSer: 1.119 ± 0.429
0.497TrpThr: 0.497 ± 0.24
0.622TrpVal: 0.622 ± 0.285
0.0TrpTrp: 0.0 ± 0.0
0.622TrpTyr: 0.622 ± 0.323
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.243TyrAla: 1.243 ± 0.339
0.746TyrCys: 0.746 ± 0.307
2.611TyrAsp: 2.611 ± 0.49
4.974TyrGlu: 4.974 ± 0.824
2.736TyrPhe: 2.736 ± 0.625
1.492TyrGly: 1.492 ± 0.43
0.249TyrHis: 0.249 ± 0.195
2.984TyrIle: 2.984 ± 0.63
5.347TyrLys: 5.347 ± 0.826
4.103TyrLeu: 4.103 ± 0.633
0.746TyrMet: 0.746 ± 0.306
2.736TyrAsn: 2.736 ± 0.612
1.741TyrPro: 1.741 ± 0.471
1.119TyrGln: 1.119 ± 0.345
1.99TyrArg: 1.99 ± 0.552
2.86TyrSer: 2.86 ± 0.556
1.99TyrThr: 1.99 ± 0.541
1.243TyrVal: 1.243 ± 0.34
0.746TyrTrp: 0.746 ± 0.295
2.611TyrTyr: 2.611 ± 0.642
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (8043 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski