Amino acid dipepetide frequency for Cellulophaga phage phi48:2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.82AlaAla: 3.82 ± 1.292
0.273AlaCys: 0.273 ± 0.304
0.546AlaAsp: 0.546 ± 0.378
1.91AlaGlu: 1.91 ± 0.573
4.093AlaPhe: 4.093 ± 0.917
1.091AlaGly: 1.091 ± 0.548
0.546AlaHis: 0.546 ± 0.44
3.82AlaIle: 3.82 ± 0.87
4.911AlaLys: 4.911 ± 1.262
4.366AlaLeu: 4.366 ± 0.943
2.183AlaMet: 2.183 ± 0.678
1.637AlaAsn: 1.637 ± 0.666
1.637AlaPro: 1.637 ± 0.5
0.819AlaGln: 0.819 ± 0.433
0.819AlaArg: 0.819 ± 0.421
3.001AlaSer: 3.001 ± 1.361
3.274AlaThr: 3.274 ± 1.206
2.183AlaVal: 2.183 ± 0.787
0.0AlaTrp: 0.0 ± 0.0
1.091AlaTyr: 1.091 ± 0.469
0.0AlaXaa: 0.0 ± 0.0
Cys
0.546CysAla: 0.546 ± 0.405
0.0CysCys: 0.0 ± 0.0
0.546CysAsp: 0.546 ± 0.421
0.273CysGlu: 0.273 ± 0.334
1.091CysPhe: 1.091 ± 0.558
0.273CysGly: 0.273 ± 0.304
0.0CysHis: 0.0 ± 0.0
0.546CysIle: 0.546 ± 0.42
1.091CysLys: 1.091 ± 0.528
0.546CysLeu: 0.546 ± 0.583
0.546CysMet: 0.546 ± 0.463
0.819CysAsn: 0.819 ± 0.47
0.0CysPro: 0.0 ± 0.0
0.273CysGln: 0.273 ± 0.27
0.546CysArg: 0.546 ± 0.309
0.546CysSer: 0.546 ± 0.515
0.546CysThr: 0.546 ± 0.403
0.273CysVal: 0.273 ± 0.22
0.273CysTrp: 0.273 ± 0.273
1.091CysTyr: 1.091 ± 0.568
0.0CysXaa: 0.0 ± 0.0
Asp
2.456AspAla: 2.456 ± 0.594
0.819AspCys: 0.819 ± 0.574
3.274AspAsp: 3.274 ± 1.073
1.637AspGlu: 1.637 ± 0.552
5.184AspPhe: 5.184 ± 1.054
2.456AspGly: 2.456 ± 0.752
0.546AspHis: 0.546 ± 0.307
5.73AspIle: 5.73 ± 1.461
3.82AspLys: 3.82 ± 0.734
5.457AspLeu: 5.457 ± 0.944
1.637AspMet: 1.637 ± 0.544
3.274AspAsn: 3.274 ± 0.897
0.819AspPro: 0.819 ± 0.409
0.0AspGln: 0.0 ± 0.0
0.546AspArg: 0.546 ± 0.458
1.91AspSer: 1.91 ± 0.687
1.637AspThr: 1.637 ± 0.647
2.183AspVal: 2.183 ± 0.72
0.546AspTrp: 0.546 ± 0.402
4.093AspTyr: 4.093 ± 0.828
0.0AspXaa: 0.0 ± 0.0
Glu
2.729GluAla: 2.729 ± 1.228
0.0GluCys: 0.0 ± 0.0
2.456GluAsp: 2.456 ± 0.681
3.274GluGlu: 3.274 ± 0.901
2.456GluPhe: 2.456 ± 1.107
2.183GluGly: 2.183 ± 0.907
1.091GluHis: 1.091 ± 0.657
6.821GluIle: 6.821 ± 1.187
8.731GluLys: 8.731 ± 1.375
6.548GluLeu: 6.548 ± 1.208
1.091GluMet: 1.091 ± 0.514
6.276GluAsn: 6.276 ± 1.365
0.546GluPro: 0.546 ± 0.351
0.819GluGln: 0.819 ± 0.52
1.637GluArg: 1.637 ± 0.591
2.456GluSer: 2.456 ± 0.933
4.093GluThr: 4.093 ± 1.127
3.547GluVal: 3.547 ± 1.104
0.273GluTrp: 0.273 ± 0.302
2.183GluTyr: 2.183 ± 0.858
0.0GluXaa: 0.0 ± 0.0
Phe
1.91PheAla: 1.91 ± 0.658
0.546PheCys: 0.546 ± 0.421
3.82PheAsp: 3.82 ± 0.997
3.547PheGlu: 3.547 ± 0.999
3.547PhePhe: 3.547 ± 1.457
4.093PheGly: 4.093 ± 0.988
1.364PheHis: 1.364 ± 0.518
6.548PheIle: 6.548 ± 1.668
6.548PheLys: 6.548 ± 1.371
7.64PheLeu: 7.64 ± 1.211
1.091PheMet: 1.091 ± 0.622
5.73PheAsn: 5.73 ± 1.73
0.819PhePro: 0.819 ± 0.494
1.91PheGln: 1.91 ± 0.671
2.456PheArg: 2.456 ± 1.061
4.093PheSer: 4.093 ± 1.251
3.547PheThr: 3.547 ± 1.01
4.093PheVal: 4.093 ± 0.771
0.546PheTrp: 0.546 ± 0.418
3.274PheTyr: 3.274 ± 1.037
0.0PheXaa: 0.0 ± 0.0
Gly
1.637GlyAla: 1.637 ± 0.489
0.819GlyCys: 0.819 ± 0.518
2.183GlyAsp: 2.183 ± 0.726
2.456GlyGlu: 2.456 ± 0.671
4.093GlyPhe: 4.093 ± 1.172
4.911GlyGly: 4.911 ± 1.541
0.546GlyHis: 0.546 ± 0.358
4.366GlyIle: 4.366 ± 0.958
4.093GlyLys: 4.093 ± 1.151
5.73GlyLeu: 5.73 ± 1.327
1.637GlyMet: 1.637 ± 0.664
3.274GlyAsn: 3.274 ± 0.86
0.0GlyPro: 0.0 ± 0.0
2.456GlyGln: 2.456 ± 0.997
0.819GlyArg: 0.819 ± 0.389
3.001GlySer: 3.001 ± 0.866
1.91GlyThr: 1.91 ± 0.611
3.274GlyVal: 3.274 ± 1.022
1.364GlyTrp: 1.364 ± 0.719
2.729GlyTyr: 2.729 ± 1.052
0.0GlyXaa: 0.0 ± 0.0
His
0.819HisAla: 0.819 ± 0.416
0.273HisCys: 0.273 ± 0.258
0.0HisAsp: 0.0 ± 0.0
1.091HisGlu: 1.091 ± 0.473
1.364HisPhe: 1.364 ± 0.544
1.637HisGly: 1.637 ± 0.683
0.273HisHis: 0.273 ± 0.302
0.546HisIle: 0.546 ± 0.311
1.637HisLys: 1.637 ± 0.701
1.637HisLeu: 1.637 ± 0.746
0.819HisMet: 0.819 ± 0.522
0.819HisAsn: 0.819 ± 0.481
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.091HisArg: 1.091 ± 0.479
0.273HisSer: 0.273 ± 0.258
0.546HisThr: 0.546 ± 0.44
1.091HisVal: 1.091 ± 0.392
0.273HisTrp: 0.273 ± 0.273
2.183HisTyr: 2.183 ± 0.666
0.0HisXaa: 0.0 ± 0.0
Ile
2.183IleAla: 2.183 ± 0.984
1.364IleCys: 1.364 ± 0.547
4.366IleAsp: 4.366 ± 0.891
6.548IleGlu: 6.548 ± 1.723
7.094IlePhe: 7.094 ± 1.9
4.638IleGly: 4.638 ± 1.391
0.546IleHis: 0.546 ± 0.402
6.276IleIle: 6.276 ± 1.418
6.276IleLys: 6.276 ± 1.268
7.913IleLeu: 7.913 ± 1.647
2.456IleMet: 2.456 ± 0.735
6.821IleAsn: 6.821 ± 1.555
1.91IlePro: 1.91 ± 0.623
1.364IleGln: 1.364 ± 0.887
3.547IleArg: 3.547 ± 1.07
5.73IleSer: 5.73 ± 1.128
5.457IleThr: 5.457 ± 1.055
5.184IleVal: 5.184 ± 1.319
0.546IleTrp: 0.546 ± 0.387
3.274IleTyr: 3.274 ± 0.764
0.0IleXaa: 0.0 ± 0.0
Lys
4.093LysAla: 4.093 ± 0.966
0.819LysCys: 0.819 ± 0.494
4.366LysAsp: 4.366 ± 0.9
6.003LysGlu: 6.003 ± 1.614
3.001LysPhe: 3.001 ± 0.819
5.457LysGly: 5.457 ± 1.101
2.729LysHis: 2.729 ± 1.238
7.367LysIle: 7.367 ± 1.129
12.551LysLys: 12.551 ± 2.572
11.187LysLeu: 11.187 ± 1.7
3.001LysMet: 3.001 ± 0.847
8.458LysAsn: 8.458 ± 1.663
1.364LysPro: 1.364 ± 0.699
2.729LysGln: 2.729 ± 0.75
4.638LysArg: 4.638 ± 1.625
4.638LysSer: 4.638 ± 0.942
8.186LysThr: 8.186 ± 1.259
4.366LysVal: 4.366 ± 1.469
0.819LysTrp: 0.819 ± 0.497
3.82LysTyr: 3.82 ± 1.134
0.273LysXaa: 0.273 ± 0.273
Leu
2.456LeuAla: 2.456 ± 0.912
1.091LeuCys: 1.091 ± 0.7
7.367LeuAsp: 7.367 ± 1.251
4.911LeuGlu: 4.911 ± 0.892
5.457LeuPhe: 5.457 ± 1.637
5.184LeuGly: 5.184 ± 1.118
1.364LeuHis: 1.364 ± 0.712
9.004LeuIle: 9.004 ± 1.955
13.097LeuLys: 13.097 ± 1.95
8.731LeuLeu: 8.731 ± 1.359
1.364LeuMet: 1.364 ± 0.725
8.186LeuAsn: 8.186 ± 1.615
2.183LeuPro: 2.183 ± 0.952
3.82LeuGln: 3.82 ± 1.156
2.729LeuArg: 2.729 ± 0.899
7.094LeuSer: 7.094 ± 1.617
7.367LeuThr: 7.367 ± 1.542
4.638LeuVal: 4.638 ± 1.111
0.273LeuTrp: 0.273 ± 0.27
4.366LeuTyr: 4.366 ± 1.035
0.0LeuXaa: 0.0 ± 0.0
Met
0.819MetAla: 0.819 ± 0.467
0.819MetCys: 0.819 ± 0.536
1.637MetAsp: 1.637 ± 0.6
0.819MetGlu: 0.819 ± 0.522
1.091MetPhe: 1.091 ± 0.541
0.546MetGly: 0.546 ± 0.372
0.546MetHis: 0.546 ± 0.425
2.183MetIle: 2.183 ± 0.833
0.546MetLys: 0.546 ± 0.379
3.82MetLeu: 3.82 ± 1.42
1.091MetMet: 1.091 ± 0.557
1.091MetAsn: 1.091 ± 0.612
1.364MetPro: 1.364 ± 0.701
1.364MetGln: 1.364 ± 0.654
0.819MetArg: 0.819 ± 0.439
2.456MetSer: 2.456 ± 0.757
1.364MetThr: 1.364 ± 0.605
0.819MetVal: 0.819 ± 0.488
0.0MetTrp: 0.0 ± 0.0
0.819MetTyr: 0.819 ± 0.402
0.0MetXaa: 0.0 ± 0.0
Asn
3.001AsnAla: 3.001 ± 1.05
0.546AsnCys: 0.546 ± 0.373
4.093AsnAsp: 4.093 ± 1.152
4.638AsnGlu: 4.638 ± 0.898
5.457AsnPhe: 5.457 ± 1.415
3.274AsnGly: 3.274 ± 0.887
1.091AsnHis: 1.091 ± 0.644
3.82AsnIle: 3.82 ± 1.003
6.821AsnLys: 6.821 ± 1.468
5.457AsnLeu: 5.457 ± 1.275
1.637AsnMet: 1.637 ± 0.746
3.82AsnAsn: 3.82 ± 1.113
3.001AsnPro: 3.001 ± 0.786
1.91AsnGln: 1.91 ± 0.679
1.91AsnArg: 1.91 ± 0.676
6.821AsnSer: 6.821 ± 1.669
3.82AsnThr: 3.82 ± 1.032
4.366AsnVal: 4.366 ± 1.247
0.546AsnTrp: 0.546 ± 0.382
3.274AsnTyr: 3.274 ± 0.858
0.0AsnXaa: 0.0 ± 0.0
Pro
0.546ProAla: 0.546 ± 0.365
0.0ProCys: 0.0 ± 0.0
0.819ProAsp: 0.819 ± 0.486
1.364ProGlu: 1.364 ± 0.49
2.183ProPhe: 2.183 ± 0.802
0.273ProGly: 0.273 ± 0.27
0.0ProHis: 0.0 ± 0.0
1.91ProIle: 1.91 ± 0.78
2.456ProLys: 2.456 ± 0.708
2.183ProLeu: 2.183 ± 0.922
0.273ProMet: 0.273 ± 0.283
1.637ProAsn: 1.637 ± 0.577
0.273ProPro: 0.273 ± 0.305
0.273ProGln: 0.273 ± 0.294
0.0ProArg: 0.0 ± 0.0
1.91ProSer: 1.91 ± 0.691
2.456ProThr: 2.456 ± 1.078
2.456ProVal: 2.456 ± 1.164
0.0ProTrp: 0.0 ± 0.0
1.364ProTyr: 1.364 ± 0.68
0.0ProXaa: 0.0 ± 0.0
Gln
0.546GlnAla: 0.546 ± 0.309
0.546GlnCys: 0.546 ± 0.413
1.091GlnAsp: 1.091 ± 0.579
1.364GlnGlu: 1.364 ± 0.636
1.91GlnPhe: 1.91 ± 0.644
0.819GlnGly: 0.819 ± 0.417
1.091GlnHis: 1.091 ± 0.446
3.82GlnIle: 3.82 ± 0.965
4.366GlnLys: 4.366 ± 0.846
1.637GlnLeu: 1.637 ± 0.495
0.273GlnMet: 0.273 ± 0.256
2.183GlnAsn: 2.183 ± 0.738
0.0GlnPro: 0.0 ± 0.0
0.273GlnGln: 0.273 ± 0.259
1.364GlnArg: 1.364 ± 0.519
1.364GlnSer: 1.364 ± 0.531
1.637GlnThr: 1.637 ± 0.611
1.364GlnVal: 1.364 ± 0.594
0.0GlnTrp: 0.0 ± 0.0
1.091GlnTyr: 1.091 ± 0.511
0.0GlnXaa: 0.0 ± 0.0
Arg
2.456ArgAla: 2.456 ± 0.771
0.0ArgCys: 0.0 ± 0.0
2.183ArgAsp: 2.183 ± 0.913
2.729ArgGlu: 2.729 ± 0.688
2.183ArgPhe: 2.183 ± 0.756
1.364ArgGly: 1.364 ± 0.478
1.637ArgHis: 1.637 ± 0.607
3.274ArgIle: 3.274 ± 0.737
3.001ArgLys: 3.001 ± 0.903
2.183ArgLeu: 2.183 ± 0.729
1.091ArgMet: 1.091 ± 0.496
1.637ArgAsn: 1.637 ± 0.566
0.819ArgPro: 0.819 ± 0.462
1.091ArgGln: 1.091 ± 0.472
0.819ArgArg: 0.819 ± 0.526
2.729ArgSer: 2.729 ± 0.729
0.819ArgThr: 0.819 ± 0.565
2.729ArgVal: 2.729 ± 0.891
0.546ArgTrp: 0.546 ± 0.387
2.183ArgTyr: 2.183 ± 0.761
0.0ArgXaa: 0.0 ± 0.0
Ser
3.547SerAla: 3.547 ± 1.186
0.273SerCys: 0.273 ± 0.334
2.729SerAsp: 2.729 ± 0.736
5.457SerGlu: 5.457 ± 1.559
3.82SerPhe: 3.82 ± 1.091
3.001SerGly: 3.001 ± 1.157
0.546SerHis: 0.546 ± 0.345
3.001SerIle: 3.001 ± 0.561
6.276SerLys: 6.276 ± 1.226
6.548SerLeu: 6.548 ± 1.08
1.091SerMet: 1.091 ± 0.679
4.366SerAsn: 4.366 ± 1.095
2.183SerPro: 2.183 ± 0.737
1.91SerGln: 1.91 ± 0.554
3.82SerArg: 3.82 ± 0.689
6.003SerSer: 6.003 ± 1.084
3.82SerThr: 3.82 ± 0.67
4.911SerVal: 4.911 ± 2.064
0.273SerTrp: 0.273 ± 0.294
4.911SerTyr: 4.911 ± 1.374
0.0SerXaa: 0.0 ± 0.0
Thr
4.638ThrAla: 4.638 ± 1.884
0.546ThrCys: 0.546 ± 0.309
1.637ThrAsp: 1.637 ± 0.801
2.729ThrGlu: 2.729 ± 0.728
4.366ThrPhe: 4.366 ± 1.056
2.183ThrGly: 2.183 ± 0.816
0.819ThrHis: 0.819 ± 0.481
6.548ThrIle: 6.548 ± 1.115
5.184ThrLys: 5.184 ± 1.216
4.911ThrLeu: 4.911 ± 1.413
0.546ThrMet: 0.546 ± 0.364
3.001ThrAsn: 3.001 ± 0.853
2.456ThrPro: 2.456 ± 0.806
1.91ThrGln: 1.91 ± 0.606
3.274ThrArg: 3.274 ± 0.722
5.184ThrSer: 5.184 ± 1.384
3.001ThrThr: 3.001 ± 1.244
3.547ThrVal: 3.547 ± 1.113
0.273ThrTrp: 0.273 ± 0.273
3.274ThrTyr: 3.274 ± 0.846
0.0ThrXaa: 0.0 ± 0.0
Val
2.456ValAla: 2.456 ± 0.953
0.546ValCys: 0.546 ± 0.417
2.456ValAsp: 2.456 ± 0.704
3.547ValGlu: 3.547 ± 0.999
3.82ValPhe: 3.82 ± 0.912
4.366ValGly: 4.366 ± 1.062
0.273ValHis: 0.273 ± 0.25
3.547ValIle: 3.547 ± 0.718
5.184ValLys: 5.184 ± 0.849
7.367ValLeu: 7.367 ± 1.423
1.091ValMet: 1.091 ± 0.737
3.274ValAsn: 3.274 ± 0.813
1.91ValPro: 1.91 ± 0.712
1.637ValGln: 1.637 ± 0.56
2.456ValArg: 2.456 ± 0.953
4.366ValSer: 4.366 ± 0.768
4.366ValThr: 4.366 ± 0.873
6.276ValVal: 6.276 ± 1.611
0.273ValTrp: 0.273 ± 0.273
1.091ValTyr: 1.091 ± 0.514
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.273TrpAsp: 0.273 ± 0.273
1.091TrpGlu: 1.091 ± 0.499
0.273TrpPhe: 0.273 ± 0.22
0.546TrpGly: 0.546 ± 0.379
0.0TrpHis: 0.0 ± 0.0
0.273TrpIle: 0.273 ± 0.273
0.0TrpLys: 0.0 ± 0.0
1.637TrpLeu: 1.637 ± 0.77
0.0TrpMet: 0.0 ± 0.0
0.273TrpAsn: 0.273 ± 0.265
0.0TrpPro: 0.0 ± 0.0
0.273TrpGln: 0.273 ± 0.273
0.546TrpArg: 0.546 ± 0.402
1.091TrpSer: 1.091 ± 0.424
0.273TrpThr: 0.273 ± 0.273
0.546TrpVal: 0.546 ± 0.429
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.364TyrAla: 1.364 ± 0.621
0.546TyrCys: 0.546 ± 0.367
2.183TyrAsp: 2.183 ± 0.663
3.82TyrGlu: 3.82 ± 0.906
4.638TyrPhe: 4.638 ± 1.058
3.001TyrGly: 3.001 ± 1.155
1.364TyrHis: 1.364 ± 0.53
4.093TyrIle: 4.093 ± 1.356
3.274TyrLys: 3.274 ± 1.251
5.184TyrLeu: 5.184 ± 1.264
0.819TyrMet: 0.819 ± 0.512
2.729TyrAsn: 2.729 ± 1.088
1.091TyrPro: 1.091 ± 0.64
1.91TyrGln: 1.91 ± 0.638
1.637TyrArg: 1.637 ± 0.678
3.82TyrSer: 3.82 ± 1.187
1.91TyrThr: 1.91 ± 0.718
2.456TyrVal: 2.456 ± 0.96
0.273TyrTrp: 0.273 ± 0.27
1.91TyrTyr: 1.91 ± 0.694
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.273XaaLys: 0.273 ± 0.273
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 29 proteins (3666 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski