Amino acid dipepetide frequency for Clostridium phage phiCDKH01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.462AlaAla: 2.462 ± 0.828
0.462AlaCys: 0.462 ± 0.193
2.924AlaAsp: 2.924 ± 0.539
3.462AlaGlu: 3.462 ± 0.696
2.539AlaPhe: 2.539 ± 0.782
1.693AlaGly: 1.693 ± 0.378
0.462AlaHis: 0.462 ± 0.22
5.539AlaIle: 5.539 ± 0.926
4.847AlaLys: 4.847 ± 0.975
4.693AlaLeu: 4.693 ± 0.711
2.462AlaMet: 2.462 ± 0.545
3.462AlaAsn: 3.462 ± 0.481
0.539AlaPro: 0.539 ± 0.229
1.231AlaGln: 1.231 ± 0.33
1.846AlaArg: 1.846 ± 0.391
4.001AlaSer: 4.001 ± 0.624
3.462AlaThr: 3.462 ± 0.934
2.77AlaVal: 2.77 ± 0.665
0.462AlaTrp: 0.462 ± 0.216
2.231AlaTyr: 2.231 ± 0.506
0.0AlaXaa: 0.0 ± 0.0
Cys
0.462CysAla: 0.462 ± 0.188
0.077CysCys: 0.077 ± 0.075
0.769CysAsp: 0.769 ± 0.245
1.693CysGlu: 1.693 ± 0.761
0.539CysPhe: 0.539 ± 0.183
0.077CysGly: 0.077 ± 0.071
0.0CysHis: 0.0 ± 0.0
0.692CysIle: 0.692 ± 0.299
0.846CysLys: 0.846 ± 0.251
0.231CysLeu: 0.231 ± 0.142
0.154CysMet: 0.154 ± 0.119
0.615CysAsn: 0.615 ± 0.253
0.385CysPro: 0.385 ± 0.184
0.0CysGln: 0.0 ± 0.0
0.308CysArg: 0.308 ± 0.149
0.385CysSer: 0.385 ± 0.315
0.692CysThr: 0.692 ± 0.241
0.385CysVal: 0.385 ± 0.167
0.077CysTrp: 0.077 ± 0.079
0.539CysTyr: 0.539 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
2.616AspAla: 2.616 ± 0.702
0.308AspCys: 0.308 ± 0.156
2.847AspAsp: 2.847 ± 0.418
4.693AspGlu: 4.693 ± 0.689
2.539AspPhe: 2.539 ± 0.334
3.847AspGly: 3.847 ± 0.539
0.154AspHis: 0.154 ± 0.121
7.155AspIle: 7.155 ± 0.991
5.924AspLys: 5.924 ± 0.678
5.77AspLeu: 5.77 ± 0.613
2.154AspMet: 2.154 ± 0.519
3.616AspAsn: 3.616 ± 0.579
0.539AspPro: 0.539 ± 0.291
0.539AspGln: 0.539 ± 0.173
1.923AspArg: 1.923 ± 0.399
3.0AspSer: 3.0 ± 0.449
3.693AspThr: 3.693 ± 0.62
3.539AspVal: 3.539 ± 0.483
0.769AspTrp: 0.769 ± 0.256
3.539AspTyr: 3.539 ± 0.645
0.0AspXaa: 0.0 ± 0.0
Glu
3.693GluAla: 3.693 ± 0.6
0.462GluCys: 0.462 ± 0.201
4.308GluAsp: 4.308 ± 0.532
11.232GluGlu: 11.232 ± 2.011
3.0GluPhe: 3.0 ± 0.579
3.616GluGly: 3.616 ± 0.485
1.231GluHis: 1.231 ± 0.245
9.309GluIle: 9.309 ± 1.098
7.386GluLys: 7.386 ± 0.842
10.925GluLeu: 10.925 ± 1.471
2.539GluMet: 2.539 ± 0.52
7.232GluAsn: 7.232 ± 0.908
1.693GluPro: 1.693 ± 0.331
3.924GluGln: 3.924 ± 0.54
3.539GluArg: 3.539 ± 0.369
4.847GluSer: 4.847 ± 0.873
3.693GluThr: 3.693 ± 1.221
5.847GluVal: 5.847 ± 1.064
0.923GluTrp: 0.923 ± 0.265
3.0GluTyr: 3.0 ± 0.54
0.0GluXaa: 0.0 ± 0.0
Phe
1.154PheAla: 1.154 ± 0.309
0.385PheCys: 0.385 ± 0.167
2.308PheAsp: 2.308 ± 0.583
3.231PheGlu: 3.231 ± 0.46
1.077PhePhe: 1.077 ± 0.248
2.0PheGly: 2.0 ± 0.445
0.539PheHis: 0.539 ± 0.267
4.308PheIle: 4.308 ± 0.909
3.693PheLys: 3.693 ± 0.488
2.308PheLeu: 2.308 ± 0.419
1.077PheMet: 1.077 ± 0.27
3.462PheAsn: 3.462 ± 0.604
0.769PhePro: 0.769 ± 0.255
0.615PheGln: 0.615 ± 0.177
1.0PheArg: 1.0 ± 0.305
2.77PheSer: 2.77 ± 0.478
2.539PheThr: 2.539 ± 0.462
1.308PheVal: 1.308 ± 0.272
0.385PheTrp: 0.385 ± 0.227
1.0PheTyr: 1.0 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
2.77GlyAla: 2.77 ± 0.581
0.615GlyCys: 0.615 ± 0.2
2.231GlyAsp: 2.231 ± 0.328
4.078GlyGlu: 4.078 ± 0.572
1.616GlyPhe: 1.616 ± 0.344
2.077GlyGly: 2.077 ± 0.376
0.462GlyHis: 0.462 ± 0.171
4.078GlyIle: 4.078 ± 0.436
7.001GlyLys: 7.001 ± 0.943
3.77GlyLeu: 3.77 ± 0.528
1.77GlyMet: 1.77 ± 0.394
3.693GlyAsn: 3.693 ± 0.706
1.154GlyPro: 1.154 ± 0.297
2.154GlyGln: 2.154 ± 0.361
1.539GlyArg: 1.539 ± 0.3
3.847GlySer: 3.847 ± 0.536
3.077GlyThr: 3.077 ± 0.598
2.308GlyVal: 2.308 ± 0.465
0.539GlyTrp: 0.539 ± 0.198
2.616GlyTyr: 2.616 ± 0.454
0.0GlyXaa: 0.0 ± 0.0
His
0.692HisAla: 0.692 ± 0.214
0.0HisCys: 0.0 ± 0.0
0.539HisAsp: 0.539 ± 0.209
1.077HisGlu: 1.077 ± 0.291
0.385HisPhe: 0.385 ± 0.159
0.385HisGly: 0.385 ± 0.226
0.231HisHis: 0.231 ± 0.113
1.0HisIle: 1.0 ± 0.252
0.846HisLys: 0.846 ± 0.235
0.692HisLeu: 0.692 ± 0.251
0.077HisMet: 0.077 ± 0.073
0.462HisAsn: 0.462 ± 0.151
0.539HisPro: 0.539 ± 0.272
0.385HisGln: 0.385 ± 0.146
0.231HisArg: 0.231 ± 0.126
0.692HisSer: 0.692 ± 0.264
1.0HisThr: 1.0 ± 0.374
0.692HisVal: 0.692 ± 0.221
0.077HisTrp: 0.077 ± 0.079
0.308HisTyr: 0.308 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
5.77IleAla: 5.77 ± 0.863
1.77IleCys: 1.77 ± 0.761
5.693IleAsp: 5.693 ± 0.56
10.309IleGlu: 10.309 ± 1.022
2.539IlePhe: 2.539 ± 0.515
4.616IleGly: 4.616 ± 0.594
0.692IleHis: 0.692 ± 0.203
6.309IleIle: 6.309 ± 0.8
10.386IleLys: 10.386 ± 0.967
6.924IleLeu: 6.924 ± 0.703
0.923IleMet: 0.923 ± 0.271
6.386IleAsn: 6.386 ± 0.966
2.308IlePro: 2.308 ± 0.488
1.616IleGln: 1.616 ± 0.332
3.616IleArg: 3.616 ± 0.856
6.232IleSer: 6.232 ± 0.707
5.616IleThr: 5.616 ± 0.543
5.309IleVal: 5.309 ± 0.671
0.539IleTrp: 0.539 ± 0.201
2.924IleTyr: 2.924 ± 0.529
0.0IleXaa: 0.0 ± 0.0
Lys
5.309LysAla: 5.309 ± 0.779
0.539LysCys: 0.539 ± 0.217
7.232LysAsp: 7.232 ± 0.601
10.54LysGlu: 10.54 ± 1.03
3.693LysPhe: 3.693 ± 0.61
4.847LysGly: 4.847 ± 0.664
0.923LysHis: 0.923 ± 0.33
8.54LysIle: 8.54 ± 0.755
11.54LysLys: 11.54 ± 1.357
7.54LysLeu: 7.54 ± 0.777
3.077LysMet: 3.077 ± 0.428
6.77LysAsn: 6.77 ± 0.797
2.0LysPro: 2.0 ± 0.465
3.693LysGln: 3.693 ± 0.623
2.77LysArg: 2.77 ± 0.521
7.309LysSer: 7.309 ± 1.147
5.385LysThr: 5.385 ± 0.703
5.847LysVal: 5.847 ± 0.726
1.385LysTrp: 1.385 ± 0.359
4.847LysTyr: 4.847 ± 0.55
0.0LysXaa: 0.0 ± 0.0
Leu
4.847LeuAla: 4.847 ± 1.166
0.846LeuCys: 0.846 ± 0.213
5.309LeuAsp: 5.309 ± 0.483
6.693LeuGlu: 6.693 ± 0.774
2.616LeuPhe: 2.616 ± 0.585
4.154LeuGly: 4.154 ± 0.804
1.077LeuHis: 1.077 ± 0.243
6.001LeuIle: 6.001 ± 0.999
9.232LeuLys: 9.232 ± 0.644
5.77LeuLeu: 5.77 ± 0.897
1.693LeuMet: 1.693 ± 0.418
5.385LeuAsn: 5.385 ± 0.611
2.231LeuPro: 2.231 ± 0.308
3.231LeuGln: 3.231 ± 0.487
3.462LeuArg: 3.462 ± 0.48
6.539LeuSer: 6.539 ± 0.709
5.078LeuThr: 5.078 ± 0.685
3.231LeuVal: 3.231 ± 0.489
0.692LeuTrp: 0.692 ± 0.237
3.77LeuTyr: 3.77 ± 0.674
0.0LeuXaa: 0.0 ± 0.0
Met
3.0MetAla: 3.0 ± 0.848
0.154MetCys: 0.154 ± 0.116
1.154MetAsp: 1.154 ± 0.291
2.077MetGlu: 2.077 ± 0.334
1.231MetPhe: 1.231 ± 0.289
1.154MetGly: 1.154 ± 0.204
0.231MetHis: 0.231 ± 0.143
1.154MetIle: 1.154 ± 0.288
1.693MetLys: 1.693 ± 0.358
1.616MetLeu: 1.616 ± 0.337
0.539MetMet: 0.539 ± 0.214
2.077MetAsn: 2.077 ± 0.425
0.846MetPro: 0.846 ± 0.252
1.0MetGln: 1.0 ± 0.26
0.539MetArg: 0.539 ± 0.213
2.231MetSer: 2.231 ± 0.609
1.308MetThr: 1.308 ± 0.339
0.923MetVal: 0.923 ± 0.251
0.231MetTrp: 0.231 ± 0.151
1.154MetTyr: 1.154 ± 0.312
0.0MetXaa: 0.0 ± 0.0
Asn
2.693AsnAla: 2.693 ± 0.472
0.539AsnCys: 0.539 ± 0.161
3.0AsnAsp: 3.0 ± 0.462
7.617AsnGlu: 7.617 ± 1.682
2.539AsnPhe: 2.539 ± 0.542
4.77AsnGly: 4.77 ± 0.795
0.385AsnHis: 0.385 ± 0.161
7.078AsnIle: 7.078 ± 1.093
9.309AsnLys: 9.309 ± 0.791
4.616AsnLeu: 4.616 ± 0.758
1.154AsnMet: 1.154 ± 0.283
5.924AsnAsn: 5.924 ± 1.025
1.846AsnPro: 1.846 ± 0.431
2.693AsnGln: 2.693 ± 0.547
2.539AsnArg: 2.539 ± 0.386
4.924AsnSer: 4.924 ± 0.881
3.539AsnThr: 3.539 ± 0.698
2.539AsnVal: 2.539 ± 0.602
0.615AsnTrp: 0.615 ± 0.256
3.077AsnTyr: 3.077 ± 0.544
0.0AsnXaa: 0.0 ± 0.0
Pro
0.846ProAla: 0.846 ± 0.216
0.462ProCys: 0.462 ± 0.218
1.539ProAsp: 1.539 ± 0.356
1.308ProGlu: 1.308 ± 0.383
1.154ProPhe: 1.154 ± 0.242
0.923ProGly: 0.923 ± 0.302
0.385ProHis: 0.385 ± 0.185
1.539ProIle: 1.539 ± 0.421
2.308ProLys: 2.308 ± 0.426
1.231ProLeu: 1.231 ± 0.373
0.846ProMet: 0.846 ± 0.263
1.462ProAsn: 1.462 ± 0.364
0.154ProPro: 0.154 ± 0.111
0.846ProGln: 0.846 ± 0.202
1.077ProArg: 1.077 ± 0.302
1.308ProSer: 1.308 ± 0.295
1.308ProThr: 1.308 ± 0.284
1.154ProVal: 1.154 ± 0.316
0.231ProTrp: 0.231 ± 0.163
0.615ProTyr: 0.615 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
1.846GlnAla: 1.846 ± 0.379
0.231GlnCys: 0.231 ± 0.142
1.616GlnAsp: 1.616 ± 0.297
2.539GlnGlu: 2.539 ± 0.426
1.077GlnPhe: 1.077 ± 0.227
2.308GlnGly: 2.308 ± 0.416
0.615GlnHis: 0.615 ± 0.178
2.924GlnIle: 2.924 ± 0.541
3.077GlnLys: 3.077 ± 0.789
2.616GlnLeu: 2.616 ± 0.506
0.692GlnMet: 0.692 ± 0.247
1.923GlnAsn: 1.923 ± 0.354
0.615GlnPro: 0.615 ± 0.194
1.154GlnGln: 1.154 ± 0.271
1.308GlnArg: 1.308 ± 0.252
1.385GlnSer: 1.385 ± 0.335
2.385GlnThr: 2.385 ± 0.396
1.77GlnVal: 1.77 ± 0.367
0.154GlnTrp: 0.154 ± 0.105
1.77GlnTyr: 1.77 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
1.308ArgAla: 1.308 ± 0.335
0.385ArgCys: 0.385 ± 0.136
1.846ArgAsp: 1.846 ± 0.351
4.001ArgGlu: 4.001 ± 0.862
1.0ArgPhe: 1.0 ± 0.236
2.077ArgGly: 2.077 ± 0.45
0.231ArgHis: 0.231 ± 0.13
3.385ArgIle: 3.385 ± 0.607
4.231ArgLys: 4.231 ± 1.194
3.616ArgLeu: 3.616 ± 0.612
0.846ArgMet: 0.846 ± 0.263
2.77ArgAsn: 2.77 ± 0.578
0.539ArgPro: 0.539 ± 0.351
1.0ArgGln: 1.0 ± 0.329
1.077ArgArg: 1.077 ± 0.31
1.693ArgSer: 1.693 ± 0.439
1.462ArgThr: 1.462 ± 0.296
2.154ArgVal: 2.154 ± 0.372
0.462ArgTrp: 0.462 ± 0.174
1.385ArgTyr: 1.385 ± 0.255
0.0ArgXaa: 0.0 ± 0.0
Ser
3.462SerAla: 3.462 ± 0.919
0.308SerCys: 0.308 ± 0.178
4.154SerAsp: 4.154 ± 0.643
5.309SerGlu: 5.309 ± 1.433
2.308SerPhe: 2.308 ± 0.398
3.77SerGly: 3.77 ± 0.531
1.154SerHis: 1.154 ± 0.304
5.616SerIle: 5.616 ± 0.767
6.463SerLys: 6.463 ± 0.718
5.309SerLeu: 5.309 ± 0.713
2.154SerMet: 2.154 ± 0.355
4.462SerAsn: 4.462 ± 0.565
1.693SerPro: 1.693 ± 0.356
2.0SerGln: 2.0 ± 0.397
2.462SerArg: 2.462 ± 0.412
4.616SerSer: 4.616 ± 0.824
4.539SerThr: 4.539 ± 0.775
3.539SerVal: 3.539 ± 0.547
0.846SerTrp: 0.846 ± 0.193
2.462SerTyr: 2.462 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
3.847ThrAla: 3.847 ± 0.893
0.231ThrCys: 0.231 ± 0.112
4.078ThrAsp: 4.078 ± 0.566
4.693ThrGlu: 4.693 ± 0.418
2.077ThrPhe: 2.077 ± 0.408
4.078ThrGly: 4.078 ± 0.614
0.846ThrHis: 0.846 ± 0.326
5.847ThrIle: 5.847 ± 0.858
4.924ThrLys: 4.924 ± 0.649
4.924ThrLeu: 4.924 ± 0.697
0.769ThrMet: 0.769 ± 0.234
3.847ThrAsn: 3.847 ± 0.68
0.846ThrPro: 0.846 ± 0.219
2.0ThrGln: 2.0 ± 0.401
2.693ThrArg: 2.693 ± 1.178
4.308ThrSer: 4.308 ± 0.762
4.539ThrThr: 4.539 ± 0.865
2.539ThrVal: 2.539 ± 0.468
0.385ThrTrp: 0.385 ± 0.178
2.77ThrTyr: 2.77 ± 0.493
0.0ThrXaa: 0.0 ± 0.0
Val
2.693ValAla: 2.693 ± 0.496
0.615ValCys: 0.615 ± 0.243
4.154ValAsp: 4.154 ± 0.645
4.154ValGlu: 4.154 ± 0.8
1.846ValPhe: 1.846 ± 0.431
2.924ValGly: 2.924 ± 0.552
0.462ValHis: 0.462 ± 0.148
4.924ValIle: 4.924 ± 0.558
4.539ValLys: 4.539 ± 0.616
4.385ValLeu: 4.385 ± 0.573
0.769ValMet: 0.769 ± 0.277
3.693ValAsn: 3.693 ± 0.343
0.692ValPro: 0.692 ± 0.23
1.385ValGln: 1.385 ± 0.333
1.77ValArg: 1.77 ± 0.488
3.616ValSer: 3.616 ± 0.401
3.231ValThr: 3.231 ± 0.409
3.308ValVal: 3.308 ± 0.399
0.077ValTrp: 0.077 ± 0.063
2.231ValTyr: 2.231 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.231TrpAla: 0.231 ± 0.206
0.077TrpCys: 0.077 ± 0.063
1.077TrpAsp: 1.077 ± 0.44
0.462TrpGlu: 0.462 ± 0.202
0.308TrpPhe: 0.308 ± 0.18
0.462TrpGly: 0.462 ± 0.154
0.0TrpHis: 0.0 ± 0.0
0.923TrpIle: 0.923 ± 0.307
0.846TrpLys: 0.846 ± 0.293
1.077TrpLeu: 1.077 ± 0.257
0.077TrpMet: 0.077 ± 0.083
1.0TrpAsn: 1.0 ± 0.485
0.0TrpPro: 0.0 ± 0.0
0.615TrpGln: 0.615 ± 0.214
0.154TrpArg: 0.154 ± 0.113
0.539TrpSer: 0.539 ± 0.264
0.615TrpThr: 0.615 ± 0.173
0.385TrpVal: 0.385 ± 0.153
0.539TrpTrp: 0.539 ± 0.369
0.385TrpTyr: 0.385 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.846TyrAla: 1.846 ± 0.397
0.462TyrCys: 0.462 ± 0.155
2.385TyrAsp: 2.385 ± 0.42
3.231TyrGlu: 3.231 ± 0.558
1.846TyrPhe: 1.846 ± 0.556
1.616TyrGly: 1.616 ± 0.362
0.231TyrHis: 0.231 ± 0.147
4.154TyrIle: 4.154 ± 0.796
4.77TyrLys: 4.77 ± 0.766
3.847TyrLeu: 3.847 ± 0.63
0.462TyrMet: 0.462 ± 0.195
3.154TyrAsn: 3.154 ± 0.57
1.308TyrPro: 1.308 ± 0.323
1.923TyrGln: 1.923 ± 0.36
1.616TyrArg: 1.616 ± 0.343
2.385TyrSer: 2.385 ± 0.478
3.077TyrThr: 3.077 ± 0.78
1.923TyrVal: 1.923 ± 0.445
0.385TyrTrp: 0.385 ± 0.207
1.616TyrTyr: 1.616 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12999 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski