Amino acid dipepetide frequency for Pectobacterium phage Clickz_B3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.26AlaAla: 12.26 ± 1.474
0.881AlaCys: 0.881 ± 0.327
5.286AlaAsp: 5.286 ± 0.632
5.212AlaGlu: 5.212 ± 0.872
3.083AlaPhe: 3.083 ± 0.54
6.534AlaGly: 6.534 ± 0.669
2.349AlaHis: 2.349 ± 0.523
2.936AlaIle: 2.936 ± 0.478
4.698AlaLys: 4.698 ± 0.629
9.323AlaLeu: 9.323 ± 0.668
2.716AlaMet: 2.716 ± 0.408
3.744AlaAsn: 3.744 ± 0.624
3.744AlaPro: 3.744 ± 0.591
5.946AlaGln: 5.946 ± 0.848
4.698AlaArg: 4.698 ± 0.707
6.68AlaSer: 6.68 ± 0.776
4.845AlaThr: 4.845 ± 0.582
7.488AlaVal: 7.488 ± 0.777
1.321AlaTrp: 1.321 ± 0.358
3.303AlaTyr: 3.303 ± 0.487
0.0AlaXaa: 0.0 ± 0.0
Cys
0.587CysAla: 0.587 ± 0.197
0.22CysCys: 0.22 ± 0.129
0.808CysAsp: 0.808 ± 0.296
0.294CysGlu: 0.294 ± 0.139
0.22CysPhe: 0.22 ± 0.136
0.808CysGly: 0.808 ± 0.235
0.44CysHis: 0.44 ± 0.189
0.881CysIle: 0.881 ± 0.261
0.22CysLys: 0.22 ± 0.127
0.808CysLeu: 0.808 ± 0.221
0.661CysMet: 0.661 ± 0.207
0.734CysAsn: 0.734 ± 0.259
0.661CysPro: 0.661 ± 0.244
0.367CysGln: 0.367 ± 0.132
0.514CysArg: 0.514 ± 0.186
0.808CysSer: 0.808 ± 0.28
1.028CysThr: 1.028 ± 0.319
0.808CysVal: 0.808 ± 0.27
0.294CysTrp: 0.294 ± 0.16
0.514CysTyr: 0.514 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
6.974AspAla: 6.974 ± 0.629
0.661AspCys: 0.661 ± 0.257
3.964AspAsp: 3.964 ± 0.618
3.303AspGlu: 3.303 ± 0.491
2.055AspPhe: 2.055 ± 0.315
4.698AspGly: 4.698 ± 0.71
0.661AspHis: 0.661 ± 0.25
3.891AspIle: 3.891 ± 0.412
2.716AspLys: 2.716 ± 0.596
4.919AspLeu: 4.919 ± 0.549
2.496AspMet: 2.496 ± 0.4
2.569AspAsn: 2.569 ± 0.455
2.055AspPro: 2.055 ± 0.351
1.468AspGln: 1.468 ± 0.423
3.083AspArg: 3.083 ± 0.492
4.111AspSer: 4.111 ± 0.557
4.405AspThr: 4.405 ± 0.537
4.405AspVal: 4.405 ± 0.528
1.615AspTrp: 1.615 ± 0.314
1.982AspTyr: 1.982 ± 0.417
0.0AspXaa: 0.0 ± 0.0
Glu
5.065GluAla: 5.065 ± 0.714
0.661GluCys: 0.661 ± 0.284
3.744GluAsp: 3.744 ± 0.512
3.303GluGlu: 3.303 ± 0.657
2.716GluPhe: 2.716 ± 0.44
2.79GluGly: 2.79 ± 0.433
1.175GluHis: 1.175 ± 0.372
2.349GluIle: 2.349 ± 0.445
2.496GluLys: 2.496 ± 0.465
5.212GluLeu: 5.212 ± 0.552
1.468GluMet: 1.468 ± 0.343
1.909GluAsn: 1.909 ± 0.346
1.101GluPro: 1.101 ± 0.317
3.083GluGln: 3.083 ± 0.448
2.863GluArg: 2.863 ± 0.538
2.569GluSer: 2.569 ± 0.389
2.936GluThr: 2.936 ± 0.5
3.671GluVal: 3.671 ± 0.679
0.734GluTrp: 0.734 ± 0.238
2.349GluTyr: 2.349 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
2.863PheAla: 2.863 ± 0.355
0.147PheCys: 0.147 ± 0.093
2.79PheAsp: 2.79 ± 0.516
1.542PheGlu: 1.542 ± 0.275
0.954PhePhe: 0.954 ± 0.299
2.569PheGly: 2.569 ± 0.43
0.514PheHis: 0.514 ± 0.204
1.542PheIle: 1.542 ± 0.293
1.688PheLys: 1.688 ± 0.377
2.202PheLeu: 2.202 ± 0.428
0.661PheMet: 0.661 ± 0.206
1.762PheAsn: 1.762 ± 0.388
1.175PhePro: 1.175 ± 0.301
1.468PheGln: 1.468 ± 0.284
1.688PheArg: 1.688 ± 0.34
1.982PheSer: 1.982 ± 0.375
1.321PheThr: 1.321 ± 0.36
2.643PheVal: 2.643 ± 0.471
0.367PheTrp: 0.367 ± 0.132
0.808PheTyr: 0.808 ± 0.306
0.0PheXaa: 0.0 ± 0.0
Gly
7.268GlyAla: 7.268 ± 0.836
1.321GlyCys: 1.321 ± 0.472
4.478GlyAsp: 4.478 ± 0.783
3.083GlyGlu: 3.083 ± 0.527
2.716GlyPhe: 2.716 ± 0.347
5.873GlyGly: 5.873 ± 0.663
0.734GlyHis: 0.734 ± 0.242
5.139GlyIle: 5.139 ± 0.553
3.671GlyLys: 3.671 ± 0.465
6.607GlyLeu: 6.607 ± 0.557
1.982GlyMet: 1.982 ± 0.3
3.744GlyAsn: 3.744 ± 0.586
1.175GlyPro: 1.175 ± 0.355
2.276GlyGln: 2.276 ± 0.457
3.524GlyArg: 3.524 ± 0.415
5.212GlySer: 5.212 ± 0.646
7.414GlyThr: 7.414 ± 0.7
6.313GlyVal: 6.313 ± 0.652
0.954GlyTrp: 0.954 ± 0.319
3.671GlyTyr: 3.671 ± 0.671
0.0GlyXaa: 0.0 ± 0.0
His
1.762HisAla: 1.762 ± 0.401
0.514HisCys: 0.514 ± 0.165
1.321HisAsp: 1.321 ± 0.31
0.954HisGlu: 0.954 ± 0.317
0.294HisPhe: 0.294 ± 0.156
1.542HisGly: 1.542 ± 0.395
0.22HisHis: 0.22 ± 0.108
1.028HisIle: 1.028 ± 0.27
1.028HisLys: 1.028 ± 0.347
1.835HisLeu: 1.835 ± 0.361
0.514HisMet: 0.514 ± 0.21
0.881HisAsn: 0.881 ± 0.21
0.954HisPro: 0.954 ± 0.243
0.808HisGln: 0.808 ± 0.263
1.101HisArg: 1.101 ± 0.256
1.101HisSer: 1.101 ± 0.275
1.468HisThr: 1.468 ± 0.645
1.468HisVal: 1.468 ± 0.352
0.587HisTrp: 0.587 ± 0.182
0.661HisTyr: 0.661 ± 0.276
0.0HisXaa: 0.0 ± 0.0
Ile
3.303IleAla: 3.303 ± 0.526
0.734IleCys: 0.734 ± 0.258
3.671IleAsp: 3.671 ± 0.699
2.716IleGlu: 2.716 ± 0.434
0.808IlePhe: 0.808 ± 0.218
2.643IleGly: 2.643 ± 0.454
0.954IleHis: 0.954 ± 0.262
1.909IleIle: 1.909 ± 0.4
2.569IleLys: 2.569 ± 0.391
3.817IleLeu: 3.817 ± 0.586
1.101IleMet: 1.101 ± 0.277
2.643IleAsn: 2.643 ± 0.587
2.202IlePro: 2.202 ± 0.278
2.349IleGln: 2.349 ± 0.327
1.688IleArg: 1.688 ± 0.329
2.79IleSer: 2.79 ± 0.327
4.184IleThr: 4.184 ± 0.61
2.349IleVal: 2.349 ± 0.444
0.587IleTrp: 0.587 ± 0.22
1.248IleTyr: 1.248 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
5.065LysAla: 5.065 ± 0.936
0.294LysCys: 0.294 ± 0.14
3.083LysAsp: 3.083 ± 0.413
3.377LysGlu: 3.377 ± 0.583
0.808LysPhe: 0.808 ± 0.277
3.157LysGly: 3.157 ± 0.56
0.954LysHis: 0.954 ± 0.306
1.248LysIle: 1.248 ± 0.265
2.423LysLys: 2.423 ± 0.699
4.919LysLeu: 4.919 ± 0.63
1.028LysMet: 1.028 ± 0.317
1.468LysAsn: 1.468 ± 0.377
1.982LysPro: 1.982 ± 0.322
2.643LysGln: 2.643 ± 0.46
3.01LysArg: 3.01 ± 0.548
2.569LysSer: 2.569 ± 0.347
1.688LysThr: 1.688 ± 0.357
3.45LysVal: 3.45 ± 0.59
0.661LysTrp: 0.661 ± 0.208
2.202LysTyr: 2.202 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
7.708LeuAla: 7.708 ± 0.8
1.175LeuCys: 1.175 ± 0.328
5.139LeuAsp: 5.139 ± 0.655
4.845LeuGlu: 4.845 ± 0.576
2.496LeuPhe: 2.496 ± 0.45
6.166LeuGly: 6.166 ± 0.696
2.423LeuHis: 2.423 ± 0.415
3.23LeuIle: 3.23 ± 0.609
3.891LeuLys: 3.891 ± 0.523
7.414LeuLeu: 7.414 ± 0.783
2.276LeuMet: 2.276 ± 0.264
4.551LeuAsn: 4.551 ± 0.502
4.845LeuPro: 4.845 ± 0.578
3.597LeuGln: 3.597 ± 0.586
5.139LeuArg: 5.139 ± 0.818
6.534LeuSer: 6.534 ± 0.685
4.992LeuThr: 4.992 ± 0.633
6.827LeuVal: 6.827 ± 0.518
0.661LeuTrp: 0.661 ± 0.247
3.303LeuTyr: 3.303 ± 0.472
0.0LeuXaa: 0.0 ± 0.0
Met
2.643MetAla: 2.643 ± 0.446
0.22MetCys: 0.22 ± 0.143
1.101MetAsp: 1.101 ± 0.298
0.954MetGlu: 0.954 ± 0.251
1.175MetPhe: 1.175 ± 0.275
2.276MetGly: 2.276 ± 0.367
0.587MetHis: 0.587 ± 0.213
0.808MetIle: 0.808 ± 0.25
0.808MetLys: 0.808 ± 0.231
2.276MetLeu: 2.276 ± 0.485
0.587MetMet: 0.587 ± 0.192
1.028MetAsn: 1.028 ± 0.32
1.615MetPro: 1.615 ± 0.494
2.129MetGln: 2.129 ± 0.402
2.276MetArg: 2.276 ± 0.456
1.909MetSer: 1.909 ± 0.376
1.248MetThr: 1.248 ± 0.281
1.982MetVal: 1.982 ± 0.418
0.22MetTrp: 0.22 ± 0.114
1.542MetTyr: 1.542 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
3.597AsnAla: 3.597 ± 0.567
0.734AsnCys: 0.734 ± 0.25
1.615AsnAsp: 1.615 ± 0.344
2.055AsnGlu: 2.055 ± 0.39
1.321AsnPhe: 1.321 ± 0.32
3.597AsnGly: 3.597 ± 0.549
0.661AsnHis: 0.661 ± 0.252
1.542AsnIle: 1.542 ± 0.39
2.863AsnLys: 2.863 ± 0.408
4.551AsnLeu: 4.551 ± 0.752
1.395AsnMet: 1.395 ± 0.284
1.982AsnAsn: 1.982 ± 0.333
2.276AsnPro: 2.276 ± 0.42
2.055AsnGln: 2.055 ± 0.402
2.423AsnArg: 2.423 ± 0.508
2.936AsnSer: 2.936 ± 0.674
3.891AsnThr: 3.891 ± 0.483
3.083AsnVal: 3.083 ± 0.53
0.734AsnTrp: 0.734 ± 0.244
0.661AsnTyr: 0.661 ± 0.246
0.0AsnXaa: 0.0 ± 0.0
Pro
4.258ProAla: 4.258 ± 0.49
0.147ProCys: 0.147 ± 0.099
4.038ProAsp: 4.038 ± 0.547
3.157ProGlu: 3.157 ± 0.461
0.734ProPhe: 0.734 ± 0.244
2.936ProGly: 2.936 ± 0.343
0.514ProHis: 0.514 ± 0.166
1.615ProIle: 1.615 ± 0.425
1.395ProLys: 1.395 ± 0.375
2.496ProLeu: 2.496 ± 0.432
1.248ProMet: 1.248 ± 0.285
1.101ProAsn: 1.101 ± 0.28
1.542ProPro: 1.542 ± 0.399
1.468ProGln: 1.468 ± 0.365
1.762ProArg: 1.762 ± 0.31
2.569ProSer: 2.569 ± 0.442
2.936ProThr: 2.936 ± 0.566
3.964ProVal: 3.964 ± 0.47
0.808ProTrp: 0.808 ± 0.283
1.688ProTyr: 1.688 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
6.387GlnAla: 6.387 ± 0.742
0.367GlnCys: 0.367 ± 0.17
2.423GlnAsp: 2.423 ± 0.488
2.496GlnGlu: 2.496 ± 0.484
1.762GlnPhe: 1.762 ± 0.3
4.038GlnGly: 4.038 ± 0.575
1.321GlnHis: 1.321 ± 0.28
1.542GlnIle: 1.542 ± 0.408
1.982GlnLys: 1.982 ± 0.4
3.964GlnLeu: 3.964 ± 0.606
1.395GlnMet: 1.395 ± 0.31
2.276GlnAsn: 2.276 ± 0.502
1.248GlnPro: 1.248 ± 0.338
3.597GlnGln: 3.597 ± 0.842
2.716GlnArg: 2.716 ± 0.539
2.716GlnSer: 2.716 ± 0.519
1.835GlnThr: 1.835 ± 0.395
3.23GlnVal: 3.23 ± 0.493
0.44GlnTrp: 0.44 ± 0.158
2.863GlnTyr: 2.863 ± 0.428
0.0GlnXaa: 0.0 ± 0.0
Arg
4.038ArgAla: 4.038 ± 0.442
0.661ArgCys: 0.661 ± 0.245
3.744ArgAsp: 3.744 ± 0.549
3.524ArgGlu: 3.524 ± 0.465
1.248ArgPhe: 1.248 ± 0.325
4.111ArgGly: 4.111 ± 0.6
1.175ArgHis: 1.175 ± 0.279
3.377ArgIle: 3.377 ± 0.56
2.569ArgLys: 2.569 ± 0.372
3.891ArgLeu: 3.891 ± 0.531
1.762ArgMet: 1.762 ± 0.455
3.01ArgAsn: 3.01 ± 0.474
1.542ArgPro: 1.542 ± 0.345
2.202ArgGln: 2.202 ± 0.365
4.478ArgArg: 4.478 ± 0.543
3.45ArgSer: 3.45 ± 0.654
3.01ArgThr: 3.01 ± 0.449
4.405ArgVal: 4.405 ± 0.445
0.881ArgTrp: 0.881 ± 0.214
2.423ArgTyr: 2.423 ± 0.455
0.0ArgXaa: 0.0 ± 0.0
Ser
7.414SerAla: 7.414 ± 0.932
0.734SerCys: 0.734 ± 0.194
3.083SerAsp: 3.083 ± 0.475
2.276SerGlu: 2.276 ± 0.446
2.055SerPhe: 2.055 ± 0.428
6.166SerGly: 6.166 ± 0.834
1.028SerHis: 1.028 ± 0.351
3.524SerIle: 3.524 ± 0.674
3.597SerLys: 3.597 ± 0.472
5.506SerLeu: 5.506 ± 0.675
1.835SerMet: 1.835 ± 0.408
2.569SerAsn: 2.569 ± 0.56
2.423SerPro: 2.423 ± 0.424
2.276SerGln: 2.276 ± 0.38
2.716SerArg: 2.716 ± 0.508
3.891SerSer: 3.891 ± 0.625
4.772SerThr: 4.772 ± 0.735
5.726SerVal: 5.726 ± 0.59
0.808SerTrp: 0.808 ± 0.314
1.688SerTyr: 1.688 ± 0.312
0.0SerXaa: 0.0 ± 0.0
Thr
6.534ThrAla: 6.534 ± 0.809
0.587ThrCys: 0.587 ± 0.227
3.964ThrAsp: 3.964 ± 0.639
3.377ThrGlu: 3.377 ± 0.39
1.615ThrPhe: 1.615 ± 0.349
6.68ThrGly: 6.68 ± 0.864
1.615ThrHis: 1.615 ± 0.448
2.129ThrIle: 2.129 ± 0.312
2.716ThrLys: 2.716 ± 0.359
5.579ThrLeu: 5.579 ± 0.603
0.881ThrMet: 0.881 ± 0.256
2.643ThrAsn: 2.643 ± 0.479
4.038ThrPro: 4.038 ± 0.655
1.909ThrGln: 1.909 ± 0.427
3.303ThrArg: 3.303 ± 0.505
4.698ThrSer: 4.698 ± 0.548
4.405ThrThr: 4.405 ± 0.898
5.139ThrVal: 5.139 ± 0.807
0.44ThrTrp: 0.44 ± 0.192
2.349ThrTyr: 2.349 ± 0.394
0.0ThrXaa: 0.0 ± 0.0
Val
5.726ValAla: 5.726 ± 0.596
0.808ValCys: 0.808 ± 0.276
4.478ValAsp: 4.478 ± 0.549
2.863ValGlu: 2.863 ± 0.376
2.716ValPhe: 2.716 ± 0.335
6.02ValGly: 6.02 ± 0.589
1.909ValHis: 1.909 ± 0.403
3.157ValIle: 3.157 ± 0.455
3.083ValLys: 3.083 ± 0.698
7.268ValLeu: 7.268 ± 0.904
1.762ValMet: 1.762 ± 0.34
3.083ValAsn: 3.083 ± 0.577
3.597ValPro: 3.597 ± 0.555
6.093ValGln: 6.093 ± 0.727
4.698ValArg: 4.698 ± 0.587
4.258ValSer: 4.258 ± 0.512
4.919ValThr: 4.919 ± 0.58
4.551ValVal: 4.551 ± 0.626
0.808ValTrp: 0.808 ± 0.267
2.863ValTyr: 2.863 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
0.808TrpAla: 0.808 ± 0.269
0.147TrpCys: 0.147 ± 0.104
0.587TrpAsp: 0.587 ± 0.219
0.881TrpGlu: 0.881 ± 0.289
0.734TrpPhe: 0.734 ± 0.239
1.321TrpGly: 1.321 ± 0.367
0.073TrpHis: 0.073 ± 0.062
0.294TrpIle: 0.294 ± 0.187
0.22TrpLys: 0.22 ± 0.131
1.762TrpLeu: 1.762 ± 0.401
0.367TrpMet: 0.367 ± 0.155
0.881TrpAsn: 0.881 ± 0.244
0.367TrpPro: 0.367 ± 0.176
0.881TrpGln: 0.881 ± 0.246
0.661TrpArg: 0.661 ± 0.2
0.808TrpSer: 0.808 ± 0.242
0.514TrpThr: 0.514 ± 0.185
1.175TrpVal: 1.175 ± 0.283
0.294TrpTrp: 0.294 ± 0.171
1.101TrpTyr: 1.101 ± 0.308
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.79TyrAla: 2.79 ± 0.436
0.661TyrCys: 0.661 ± 0.245
2.716TyrAsp: 2.716 ± 0.332
1.982TyrGlu: 1.982 ± 0.393
1.175TyrPhe: 1.175 ± 0.303
3.23TyrGly: 3.23 ± 0.611
0.734TyrHis: 0.734 ± 0.207
2.129TyrIle: 2.129 ± 0.373
1.395TyrLys: 1.395 ± 0.471
2.863TyrLeu: 2.863 ± 0.418
1.175TyrMet: 1.175 ± 0.309
1.395TyrAsn: 1.395 ± 0.376
1.835TyrPro: 1.835 ± 0.333
1.982TyrGln: 1.982 ± 0.349
3.157TyrArg: 3.157 ± 0.437
2.496TyrSer: 2.496 ± 0.392
2.863TyrThr: 2.863 ± 0.51
1.982TyrVal: 1.982 ± 0.382
0.661TyrTrp: 0.661 ± 0.308
1.395TyrTyr: 1.395 ± 0.362
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski