Amino acid dipepetide frequency for Pasteurella phage PMP-GADVASU-IND

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.44AlaAla: 15.44 ± 2.091
0.866AlaCys: 0.866 ± 0.274
5.483AlaAsp: 5.483 ± 0.742
9.091AlaGlu: 9.091 ± 1.035
2.381AlaPhe: 2.381 ± 0.491
8.225AlaGly: 8.225 ± 0.95
1.804AlaHis: 1.804 ± 0.465
4.834AlaIle: 4.834 ± 0.544
4.473AlaLys: 4.473 ± 0.42
8.802AlaLeu: 8.802 ± 0.725
3.175AlaMet: 3.175 ± 0.48
3.319AlaAsn: 3.319 ± 0.457
5.483AlaPro: 5.483 ± 1.016
4.906AlaGln: 4.906 ± 0.849
6.421AlaArg: 6.421 ± 0.793
5.556AlaSer: 5.556 ± 0.712
5.772AlaThr: 5.772 ± 0.965
6.999AlaVal: 6.999 ± 0.792
1.876AlaTrp: 1.876 ± 0.324
2.814AlaTyr: 2.814 ± 0.529
0.0AlaXaa: 0.0 ± 0.0
Cys
0.433CysAla: 0.433 ± 0.276
0.144CysCys: 0.144 ± 0.099
0.216CysAsp: 0.216 ± 0.139
0.361CysGlu: 0.361 ± 0.17
0.289CysPhe: 0.289 ± 0.145
0.649CysGly: 0.649 ± 0.24
0.216CysHis: 0.216 ± 0.129
0.216CysIle: 0.216 ± 0.136
0.505CysLys: 0.505 ± 0.192
0.361CysLeu: 0.361 ± 0.161
0.216CysMet: 0.216 ± 0.119
0.433CysAsn: 0.433 ± 0.233
0.433CysPro: 0.433 ± 0.185
0.505CysGln: 0.505 ± 0.166
0.577CysArg: 0.577 ± 0.243
0.433CysSer: 0.433 ± 0.174
0.577CysThr: 0.577 ± 0.206
0.505CysVal: 0.505 ± 0.197
0.144CysTrp: 0.144 ± 0.095
0.289CysTyr: 0.289 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
7.071AspAla: 7.071 ± 0.722
0.505AspCys: 0.505 ± 0.295
4.329AspAsp: 4.329 ± 0.625
4.834AspGlu: 4.834 ± 0.659
2.02AspPhe: 2.02 ± 0.377
6.421AspGly: 6.421 ± 0.877
1.227AspHis: 1.227 ± 0.299
3.03AspIle: 3.03 ± 0.422
3.175AspLys: 3.175 ± 0.464
5.267AspLeu: 5.267 ± 0.645
1.948AspMet: 1.948 ± 0.417
2.381AspAsn: 2.381 ± 0.374
2.886AspPro: 2.886 ± 0.583
1.443AspGln: 1.443 ± 0.278
3.03AspArg: 3.03 ± 0.46
4.834AspSer: 4.834 ± 0.573
3.247AspThr: 3.247 ± 0.455
4.906AspVal: 4.906 ± 0.495
1.443AspTrp: 1.443 ± 0.281
1.948AspTyr: 1.948 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
7.071GluAla: 7.071 ± 0.729
0.505GluCys: 0.505 ± 0.194
3.608GluAsp: 3.608 ± 0.525
3.824GluGlu: 3.824 ± 0.614
2.165GluPhe: 2.165 ± 0.414
3.175GluGly: 3.175 ± 0.625
1.804GluHis: 1.804 ± 0.311
3.247GluIle: 3.247 ± 0.412
3.968GluLys: 3.968 ± 0.683
7.431GluLeu: 7.431 ± 0.838
1.659GluMet: 1.659 ± 0.288
2.67GluAsn: 2.67 ± 0.442
3.752GluPro: 3.752 ± 0.568
2.958GluGln: 2.958 ± 0.423
3.319GluArg: 3.319 ± 0.5
5.483GluSer: 5.483 ± 0.769
4.04GluThr: 4.04 ± 0.602
4.762GluVal: 4.762 ± 0.532
0.794GluTrp: 0.794 ± 0.172
2.165GluTyr: 2.165 ± 0.417
0.0GluXaa: 0.0 ± 0.0
Phe
2.309PheAla: 2.309 ± 0.492
0.289PheCys: 0.289 ± 0.146
2.886PheAsp: 2.886 ± 0.411
1.732PheGlu: 1.732 ± 0.399
1.01PhePhe: 1.01 ± 0.263
2.814PheGly: 2.814 ± 0.422
0.433PheHis: 0.433 ± 0.124
1.659PheIle: 1.659 ± 0.337
1.804PheLys: 1.804 ± 0.418
2.165PheLeu: 2.165 ± 0.395
0.505PheMet: 0.505 ± 0.164
1.443PheAsn: 1.443 ± 0.331
1.515PhePro: 1.515 ± 0.313
1.227PheGln: 1.227 ± 0.281
1.515PheArg: 1.515 ± 0.33
2.02PheSer: 2.02 ± 0.341
2.092PheThr: 2.092 ± 0.397
1.876PheVal: 1.876 ± 0.508
0.577PheTrp: 0.577 ± 0.188
0.722PheTyr: 0.722 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
7.215GlyAla: 7.215 ± 1.386
0.433GlyCys: 0.433 ± 0.204
5.195GlyAsp: 5.195 ± 0.712
4.834GlyGlu: 4.834 ± 0.488
2.958GlyPhe: 2.958 ± 0.495
6.71GlyGly: 6.71 ± 1.236
2.092GlyHis: 2.092 ± 0.436
4.185GlyIle: 4.185 ± 0.65
4.257GlyLys: 4.257 ± 0.67
6.277GlyLeu: 6.277 ± 0.645
2.02GlyMet: 2.02 ± 0.326
1.876GlyAsn: 1.876 ± 0.378
2.02GlyPro: 2.02 ± 0.367
2.525GlyGln: 2.525 ± 0.353
5.411GlyArg: 5.411 ± 0.738
4.978GlySer: 4.978 ± 0.638
4.185GlyThr: 4.185 ± 0.508
5.339GlyVal: 5.339 ± 0.634
1.659GlyTrp: 1.659 ± 0.472
2.742GlyTyr: 2.742 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
2.453HisAla: 2.453 ± 0.422
0.0HisCys: 0.0 ± 0.0
1.299HisAsp: 1.299 ± 0.238
1.299HisGlu: 1.299 ± 0.386
0.433HisPhe: 0.433 ± 0.207
1.948HisGly: 1.948 ± 0.402
0.289HisHis: 0.289 ± 0.168
1.082HisIle: 1.082 ± 0.309
1.082HisLys: 1.082 ± 0.337
2.381HisLeu: 2.381 ± 0.442
0.361HisMet: 0.361 ± 0.14
0.938HisAsn: 0.938 ± 0.237
1.515HisPro: 1.515 ± 0.429
0.866HisGln: 0.866 ± 0.314
1.154HisArg: 1.154 ± 0.274
1.227HisSer: 1.227 ± 0.225
1.082HisThr: 1.082 ± 0.296
1.227HisVal: 1.227 ± 0.253
0.433HisTrp: 0.433 ± 0.197
0.794HisTyr: 0.794 ± 0.25
0.0HisXaa: 0.0 ± 0.0
Ile
4.978IleAla: 4.978 ± 0.589
0.361IleCys: 0.361 ± 0.178
5.051IleAsp: 5.051 ± 0.485
4.906IleGlu: 4.906 ± 0.562
0.938IlePhe: 0.938 ± 0.258
3.824IleGly: 3.824 ± 0.842
1.587IleHis: 1.587 ± 0.354
2.165IleIle: 2.165 ± 0.425
2.886IleLys: 2.886 ± 0.451
1.948IleLeu: 1.948 ± 0.33
0.722IleMet: 0.722 ± 0.218
2.02IleAsn: 2.02 ± 0.315
2.525IlePro: 2.525 ± 0.505
1.948IleGln: 1.948 ± 0.36
2.958IleArg: 2.958 ± 0.348
2.886IleSer: 2.886 ± 0.467
2.958IleThr: 2.958 ± 0.384
2.742IleVal: 2.742 ± 0.383
0.722IleTrp: 0.722 ± 0.246
1.299IleTyr: 1.299 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
5.7LysAla: 5.7 ± 0.904
0.289LysCys: 0.289 ± 0.167
3.752LysAsp: 3.752 ± 0.725
2.381LysGlu: 2.381 ± 0.433
1.515LysPhe: 1.515 ± 0.312
3.247LysGly: 3.247 ± 0.401
1.587LysHis: 1.587 ± 0.358
2.381LysIle: 2.381 ± 0.459
2.381LysLys: 2.381 ± 0.49
3.968LysLeu: 3.968 ± 0.555
1.443LysMet: 1.443 ± 0.342
2.237LysAsn: 2.237 ± 0.327
3.463LysPro: 3.463 ± 0.633
1.876LysGln: 1.876 ± 0.335
2.958LysArg: 2.958 ± 0.53
3.535LysSer: 3.535 ± 0.578
2.814LysThr: 2.814 ± 0.472
3.102LysVal: 3.102 ± 0.377
0.794LysTrp: 0.794 ± 0.214
1.154LysTyr: 1.154 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
8.153LeuAla: 8.153 ± 0.874
0.722LeuCys: 0.722 ± 0.234
5.123LeuAsp: 5.123 ± 0.653
5.339LeuGlu: 5.339 ± 0.698
1.876LeuPhe: 1.876 ± 0.313
5.7LeuGly: 5.7 ± 0.681
1.299LeuHis: 1.299 ± 0.358
4.329LeuIle: 4.329 ± 0.508
4.545LeuLys: 4.545 ± 0.724
5.123LeuLeu: 5.123 ± 0.579
1.227LeuMet: 1.227 ± 0.244
3.247LeuAsn: 3.247 ± 0.439
3.968LeuPro: 3.968 ± 0.58
2.958LeuGln: 2.958 ± 0.399
5.051LeuArg: 5.051 ± 0.65
5.772LeuSer: 5.772 ± 0.568
3.896LeuThr: 3.896 ± 0.447
5.195LeuVal: 5.195 ± 0.505
1.299LeuTrp: 1.299 ± 0.316
2.453LeuTyr: 2.453 ± 0.502
0.0LeuXaa: 0.0 ± 0.0
Met
2.309MetAla: 2.309 ± 0.388
0.144MetCys: 0.144 ± 0.11
1.299MetAsp: 1.299 ± 0.295
2.165MetGlu: 2.165 ± 0.38
1.082MetPhe: 1.082 ± 0.246
1.299MetGly: 1.299 ± 0.362
0.361MetHis: 0.361 ± 0.165
0.938MetIle: 0.938 ± 0.253
1.371MetLys: 1.371 ± 0.361
1.587MetLeu: 1.587 ± 0.311
0.649MetMet: 0.649 ± 0.218
1.082MetAsn: 1.082 ± 0.318
1.371MetPro: 1.371 ± 0.299
0.866MetGln: 0.866 ± 0.238
1.299MetArg: 1.299 ± 0.219
1.732MetSer: 1.732 ± 0.328
1.804MetThr: 1.804 ± 0.328
1.299MetVal: 1.299 ± 0.277
0.144MetTrp: 0.144 ± 0.086
0.072MetTyr: 0.072 ± 0.073
0.0MetXaa: 0.0 ± 0.0
Asn
4.257AsnAla: 4.257 ± 0.505
0.216AsnCys: 0.216 ± 0.131
2.02AsnAsp: 2.02 ± 0.425
2.237AsnGlu: 2.237 ± 0.405
0.794AsnPhe: 0.794 ± 0.248
4.04AsnGly: 4.04 ± 0.633
1.371AsnHis: 1.371 ± 0.292
1.732AsnIle: 1.732 ± 0.338
1.804AsnLys: 1.804 ± 0.392
3.247AsnLeu: 3.247 ± 0.452
0.866AsnMet: 0.866 ± 0.223
1.948AsnAsn: 1.948 ± 0.4
1.948AsnPro: 1.948 ± 0.405
1.371AsnGln: 1.371 ± 0.32
2.525AsnArg: 2.525 ± 0.576
2.381AsnSer: 2.381 ± 0.408
2.165AsnThr: 2.165 ± 0.429
2.092AsnVal: 2.092 ± 0.428
0.577AsnTrp: 0.577 ± 0.226
0.577AsnTyr: 0.577 ± 0.213
0.0AsnXaa: 0.0 ± 0.0
Pro
5.772ProAla: 5.772 ± 1.045
0.144ProCys: 0.144 ± 0.098
3.608ProAsp: 3.608 ± 0.403
3.319ProGlu: 3.319 ± 0.735
1.082ProPhe: 1.082 ± 0.26
4.69ProGly: 4.69 ± 0.656
1.154ProHis: 1.154 ± 0.356
2.237ProIle: 2.237 ± 0.317
1.948ProLys: 1.948 ± 0.348
3.175ProLeu: 3.175 ± 0.423
1.082ProMet: 1.082 ± 0.262
1.948ProAsn: 1.948 ± 0.356
1.443ProPro: 1.443 ± 0.362
1.732ProGln: 1.732 ± 0.465
2.237ProArg: 2.237 ± 0.449
2.381ProSer: 2.381 ± 0.485
3.968ProThr: 3.968 ± 0.592
3.968ProVal: 3.968 ± 0.573
1.082ProTrp: 1.082 ± 0.29
1.299ProTyr: 1.299 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
4.04GlnAla: 4.04 ± 0.512
0.433GlnCys: 0.433 ± 0.159
2.381GlnAsp: 2.381 ± 0.358
2.525GlnGlu: 2.525 ± 0.457
1.154GlnPhe: 1.154 ± 0.267
2.958GlnGly: 2.958 ± 0.371
0.361GlnHis: 0.361 ± 0.147
1.587GlnIle: 1.587 ± 0.319
1.443GlnLys: 1.443 ± 0.358
2.453GlnLeu: 2.453 ± 0.479
0.938GlnMet: 0.938 ± 0.269
1.299GlnAsn: 1.299 ± 0.27
2.092GlnPro: 2.092 ± 0.392
1.659GlnGln: 1.659 ± 0.371
2.814GlnArg: 2.814 ± 0.556
2.742GlnSer: 2.742 ± 0.614
2.092GlnThr: 2.092 ± 0.355
2.814GlnVal: 2.814 ± 0.4
0.649GlnTrp: 0.649 ± 0.199
0.433GlnTyr: 0.433 ± 0.224
0.0GlnXaa: 0.0 ± 0.0
Arg
5.844ArgAla: 5.844 ± 0.998
0.505ArgCys: 0.505 ± 0.268
4.04ArgAsp: 4.04 ± 0.571
3.608ArgGlu: 3.608 ± 0.695
1.659ArgPhe: 1.659 ± 0.372
3.102ArgGly: 3.102 ± 0.481
1.371ArgHis: 1.371 ± 0.313
3.175ArgIle: 3.175 ± 0.538
3.824ArgLys: 3.824 ± 0.685
5.628ArgLeu: 5.628 ± 0.555
1.154ArgMet: 1.154 ± 0.321
2.309ArgAsn: 2.309 ± 0.434
2.814ArgPro: 2.814 ± 0.462
2.165ArgGln: 2.165 ± 0.381
4.257ArgArg: 4.257 ± 0.752
3.824ArgSer: 3.824 ± 0.497
3.319ArgThr: 3.319 ± 0.592
3.391ArgVal: 3.391 ± 0.473
1.515ArgTrp: 1.515 ± 0.314
2.309ArgTyr: 2.309 ± 0.482
0.0ArgXaa: 0.0 ± 0.0
Ser
7.431SerAla: 7.431 ± 0.854
0.361SerCys: 0.361 ± 0.177
4.113SerAsp: 4.113 ± 0.601
4.762SerGlu: 4.762 ± 0.696
2.814SerPhe: 2.814 ± 0.443
5.844SerGly: 5.844 ± 0.817
1.01SerHis: 1.01 ± 0.223
3.535SerIle: 3.535 ± 0.442
3.896SerLys: 3.896 ± 0.541
4.618SerLeu: 4.618 ± 0.554
2.02SerMet: 2.02 ± 0.371
2.742SerAsn: 2.742 ± 0.422
2.381SerPro: 2.381 ± 0.356
2.237SerGln: 2.237 ± 0.335
4.04SerArg: 4.04 ± 0.539
4.401SerSer: 4.401 ± 0.7
4.257SerThr: 4.257 ± 0.769
3.824SerVal: 3.824 ± 0.607
1.154SerTrp: 1.154 ± 0.242
1.659SerTyr: 1.659 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
6.421ThrAla: 6.421 ± 0.689
0.577ThrCys: 0.577 ± 0.24
3.968ThrAsp: 3.968 ± 0.556
2.958ThrGlu: 2.958 ± 0.377
1.876ThrPhe: 1.876 ± 0.281
4.473ThrGly: 4.473 ± 0.533
1.443ThrHis: 1.443 ± 0.333
3.535ThrIle: 3.535 ± 0.403
1.876ThrLys: 1.876 ± 0.305
4.473ThrLeu: 4.473 ± 0.619
1.01ThrMet: 1.01 ± 0.229
1.804ThrAsn: 1.804 ± 0.392
3.247ThrPro: 3.247 ± 0.476
1.443ThrGln: 1.443 ± 0.348
3.175ThrArg: 3.175 ± 0.547
4.257ThrSer: 4.257 ± 0.676
3.752ThrThr: 3.752 ± 0.815
5.123ThrVal: 5.123 ± 0.727
1.659ThrTrp: 1.659 ± 0.297
1.732ThrTyr: 1.732 ± 0.386
0.0ThrXaa: 0.0 ± 0.0
Val
6.349ValAla: 6.349 ± 0.765
0.216ValCys: 0.216 ± 0.117
4.69ValAsp: 4.69 ± 0.763
4.473ValGlu: 4.473 ± 0.757
2.742ValPhe: 2.742 ± 0.465
4.545ValGly: 4.545 ± 0.614
1.443ValHis: 1.443 ± 0.428
3.463ValIle: 3.463 ± 0.586
3.535ValLys: 3.535 ± 0.603
4.04ValLeu: 4.04 ± 0.496
1.227ValMet: 1.227 ± 0.235
2.67ValAsn: 2.67 ± 0.391
2.958ValPro: 2.958 ± 0.471
2.309ValGln: 2.309 ± 0.303
4.185ValArg: 4.185 ± 0.585
5.411ValSer: 5.411 ± 0.576
4.834ValThr: 4.834 ± 0.662
5.339ValVal: 5.339 ± 0.919
1.371ValTrp: 1.371 ± 0.278
1.154ValTyr: 1.154 ± 0.276
0.0ValXaa: 0.0 ± 0.0
Trp
1.515TrpAla: 1.515 ± 0.339
0.289TrpCys: 0.289 ± 0.139
1.515TrpAsp: 1.515 ± 0.412
0.866TrpGlu: 0.866 ± 0.239
0.938TrpPhe: 0.938 ± 0.208
1.01TrpGly: 1.01 ± 0.225
0.361TrpHis: 0.361 ± 0.159
1.01TrpIle: 1.01 ± 0.285
0.794TrpLys: 0.794 ± 0.222
1.876TrpLeu: 1.876 ± 0.303
0.433TrpMet: 0.433 ± 0.169
0.866TrpAsn: 0.866 ± 0.211
1.01TrpPro: 1.01 ± 0.265
1.082TrpGln: 1.082 ± 0.283
1.154TrpArg: 1.154 ± 0.265
1.587TrpSer: 1.587 ± 0.3
0.794TrpThr: 0.794 ± 0.233
0.938TrpVal: 0.938 ± 0.21
0.433TrpTrp: 0.433 ± 0.168
0.144TrpTyr: 0.144 ± 0.102
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.958TyrAla: 2.958 ± 0.402
0.505TyrCys: 0.505 ± 0.197
1.371TyrAsp: 1.371 ± 0.418
2.886TyrGlu: 2.886 ± 0.52
0.866TyrPhe: 0.866 ± 0.3
2.237TyrGly: 2.237 ± 0.388
0.649TyrHis: 0.649 ± 0.238
1.082TyrIle: 1.082 ± 0.331
1.01TyrLys: 1.01 ± 0.261
2.453TyrLeu: 2.453 ± 0.505
0.144TyrMet: 0.144 ± 0.111
0.938TyrAsn: 0.938 ± 0.25
1.443TyrPro: 1.443 ± 0.328
0.866TyrGln: 0.866 ± 0.249
1.732TyrArg: 1.732 ± 0.437
1.659TyrSer: 1.659 ± 0.341
1.154TyrThr: 1.154 ± 0.26
1.515TyrVal: 1.515 ± 0.352
0.289TyrTrp: 0.289 ± 0.14
0.577TyrTyr: 0.577 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski