Amino acid dipepetide frequency for Arthrobacter phage Elesar

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.455AlaAla: 21.455 ± 2.104
0.808AlaCys: 0.808 ± 0.261
8.964AlaAsp: 8.964 ± 0.899
8.891AlaGlu: 8.891 ± 1.096
2.939AlaPhe: 2.939 ± 0.733
11.683AlaGly: 11.683 ± 1.269
2.057AlaHis: 2.057 ± 0.312
4.702AlaIle: 4.702 ± 0.563
5.805AlaLys: 5.805 ± 0.762
11.683AlaLeu: 11.683 ± 0.855
3.086AlaMet: 3.086 ± 0.473
3.527AlaAsn: 3.527 ± 0.617
5.584AlaPro: 5.584 ± 0.824
5.143AlaGln: 5.143 ± 0.876
8.964AlaArg: 8.964 ± 1.021
5.731AlaSer: 5.731 ± 0.681
7.715AlaThr: 7.715 ± 0.963
9.772AlaVal: 9.772 ± 0.951
2.131AlaTrp: 2.131 ± 0.379
1.396AlaTyr: 1.396 ± 0.321
0.0AlaXaa: 0.0 ± 0.0
Cys
0.514CysAla: 0.514 ± 0.203
0.073CysCys: 0.073 ± 0.071
0.441CysAsp: 0.441 ± 0.175
0.588CysGlu: 0.588 ± 0.159
0.0CysPhe: 0.0 ± 0.0
0.808CysGly: 0.808 ± 0.245
0.147CysHis: 0.147 ± 0.095
0.514CysIle: 0.514 ± 0.185
0.073CysLys: 0.073 ± 0.06
0.661CysLeu: 0.661 ± 0.202
0.294CysMet: 0.294 ± 0.135
0.294CysAsn: 0.294 ± 0.121
0.441CysPro: 0.441 ± 0.174
0.588CysGln: 0.588 ± 0.234
0.661CysArg: 0.661 ± 0.236
0.882CysSer: 0.882 ± 0.285
0.588CysThr: 0.588 ± 0.228
0.441CysVal: 0.441 ± 0.209
0.073CysTrp: 0.073 ± 0.079
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.127AspAla: 7.127 ± 0.845
0.441AspCys: 0.441 ± 0.185
2.131AspAsp: 2.131 ± 0.359
3.747AspGlu: 3.747 ± 0.796
1.91AspPhe: 1.91 ± 0.401
6.172AspGly: 6.172 ± 0.769
1.176AspHis: 1.176 ± 0.362
2.939AspIle: 2.939 ± 0.479
2.057AspLys: 2.057 ± 0.454
6.098AspLeu: 6.098 ± 0.597
0.955AspMet: 0.955 ± 0.294
1.837AspAsn: 1.837 ± 0.432
3.233AspPro: 3.233 ± 0.455
1.029AspGln: 1.029 ± 0.239
4.629AspArg: 4.629 ± 0.589
2.425AspSer: 2.425 ± 0.304
2.645AspThr: 2.645 ± 0.514
4.409AspVal: 4.409 ± 0.661
1.176AspTrp: 1.176 ± 0.297
1.47AspTyr: 1.47 ± 0.287
0.0AspXaa: 0.0 ± 0.0
Glu
8.082GluAla: 8.082 ± 1.235
0.22GluCys: 0.22 ± 0.138
4.702GluAsp: 4.702 ± 0.664
4.482GluGlu: 4.482 ± 0.752
1.543GluPhe: 1.543 ± 0.294
4.482GluGly: 4.482 ± 0.701
1.396GluHis: 1.396 ± 0.324
2.645GluIle: 2.645 ± 0.46
2.351GluLys: 2.351 ± 0.336
5.143GluLeu: 5.143 ± 0.71
0.955GluMet: 0.955 ± 0.216
1.47GluAsn: 1.47 ± 0.384
2.572GluPro: 2.572 ± 0.527
3.747GluGln: 3.747 ± 0.659
3.38GluArg: 3.38 ± 0.533
3.159GluSer: 3.159 ± 0.513
3.086GluThr: 3.086 ± 0.508
4.115GluVal: 4.115 ± 0.687
1.837GluTrp: 1.837 ± 0.466
1.102GluTyr: 1.102 ± 0.295
0.0GluXaa: 0.0 ± 0.0
Phe
3.453PheAla: 3.453 ± 0.612
0.294PheCys: 0.294 ± 0.14
1.984PheAsp: 1.984 ± 0.341
1.249PheGlu: 1.249 ± 0.332
0.808PhePhe: 0.808 ± 0.213
3.306PheGly: 3.306 ± 0.511
0.514PheHis: 0.514 ± 0.174
1.176PheIle: 1.176 ± 0.31
0.661PheLys: 0.661 ± 0.276
1.763PheLeu: 1.763 ± 0.257
0.588PheMet: 0.588 ± 0.201
0.882PheAsn: 0.882 ± 0.278
2.425PhePro: 2.425 ± 0.368
0.882PheGln: 0.882 ± 0.244
1.47PheArg: 1.47 ± 0.34
1.249PheSer: 1.249 ± 0.287
2.719PheThr: 2.719 ± 0.667
1.984PheVal: 1.984 ± 0.404
0.147PheTrp: 0.147 ± 0.087
0.367PheTyr: 0.367 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
10.213GlyAla: 10.213 ± 1.201
0.735GlyCys: 0.735 ± 0.241
3.821GlyAsp: 3.821 ± 0.452
3.674GlyGlu: 3.674 ± 0.508
3.453GlyPhe: 3.453 ± 0.6
8.082GlyGly: 8.082 ± 1.41
1.176GlyHis: 1.176 ± 0.33
3.821GlyIle: 3.821 ± 0.576
3.233GlyLys: 3.233 ± 0.407
6.98GlyLeu: 6.98 ± 0.794
1.984GlyMet: 1.984 ± 0.365
3.233GlyAsn: 3.233 ± 0.587
3.894GlyPro: 3.894 ± 1.141
2.572GlyGln: 2.572 ± 0.333
5.364GlyArg: 5.364 ± 0.559
5.511GlySer: 5.511 ± 0.678
6.539GlyThr: 6.539 ± 0.885
6.98GlyVal: 6.98 ± 0.692
2.204GlyTrp: 2.204 ± 0.453
2.278GlyTyr: 2.278 ± 0.512
0.0GlyXaa: 0.0 ± 0.0
His
1.396HisAla: 1.396 ± 0.32
0.147HisCys: 0.147 ± 0.103
0.808HisAsp: 0.808 ± 0.258
1.323HisGlu: 1.323 ± 0.306
0.661HisPhe: 0.661 ± 0.194
1.323HisGly: 1.323 ± 0.311
0.441HisHis: 0.441 ± 0.151
1.176HisIle: 1.176 ± 0.259
0.588HisLys: 0.588 ± 0.21
2.572HisLeu: 2.572 ± 0.438
0.367HisMet: 0.367 ± 0.14
0.294HisAsn: 0.294 ± 0.126
1.543HisPro: 1.543 ± 0.308
0.882HisGln: 0.882 ± 0.25
0.955HisArg: 0.955 ± 0.278
1.176HisSer: 1.176 ± 0.294
0.808HisThr: 0.808 ± 0.257
1.47HisVal: 1.47 ± 0.313
0.441HisTrp: 0.441 ± 0.167
0.367HisTyr: 0.367 ± 0.168
0.0HisXaa: 0.0 ± 0.0
Ile
5.952IleAla: 5.952 ± 0.664
0.294IleCys: 0.294 ± 0.148
3.6IleAsp: 3.6 ± 0.54
2.572IleGlu: 2.572 ± 0.374
1.543IlePhe: 1.543 ± 0.322
3.38IleGly: 3.38 ± 0.503
0.882IleHis: 0.882 ± 0.27
1.176IleIle: 1.176 ± 0.244
1.543IleLys: 1.543 ± 0.284
1.837IleLeu: 1.837 ± 0.329
0.882IleMet: 0.882 ± 0.281
1.249IleAsn: 1.249 ± 0.239
2.057IlePro: 2.057 ± 0.346
2.351IleGln: 2.351 ± 0.392
3.306IleArg: 3.306 ± 0.47
2.351IleSer: 2.351 ± 0.368
4.702IleThr: 4.702 ± 0.697
2.425IleVal: 2.425 ± 0.366
0.22IleTrp: 0.22 ± 0.123
0.882IleTyr: 0.882 ± 0.308
0.0IleXaa: 0.0 ± 0.0
Lys
6.319LysAla: 6.319 ± 0.74
0.294LysCys: 0.294 ± 0.134
2.057LysAsp: 2.057 ± 0.415
2.204LysGlu: 2.204 ± 0.488
0.882LysPhe: 0.882 ± 0.23
2.204LysGly: 2.204 ± 0.468
0.367LysHis: 0.367 ± 0.133
0.955LysIle: 0.955 ± 0.25
1.47LysLys: 1.47 ± 0.374
3.968LysLeu: 3.968 ± 0.632
0.588LysMet: 0.588 ± 0.203
0.735LysAsn: 0.735 ± 0.215
2.866LysPro: 2.866 ± 0.36
1.029LysGln: 1.029 ± 0.272
3.012LysArg: 3.012 ± 0.455
1.616LysSer: 1.616 ± 0.429
2.645LysThr: 2.645 ± 0.459
2.939LysVal: 2.939 ± 0.497
0.367LysTrp: 0.367 ± 0.182
0.955LysTyr: 0.955 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
11.095LeuAla: 11.095 ± 0.742
0.808LeuCys: 0.808 ± 0.252
4.409LeuAsp: 4.409 ± 0.57
5.07LeuGlu: 5.07 ± 0.559
1.543LeuPhe: 1.543 ± 0.348
6.907LeuGly: 6.907 ± 0.948
1.249LeuHis: 1.249 ± 0.277
4.409LeuIle: 4.409 ± 0.517
3.012LeuLys: 3.012 ± 0.419
6.392LeuLeu: 6.392 ± 0.975
2.131LeuMet: 2.131 ± 0.4
2.498LeuAsn: 2.498 ± 0.364
5.511LeuPro: 5.511 ± 0.684
2.792LeuGln: 2.792 ± 0.503
5.364LeuArg: 5.364 ± 0.746
4.335LeuSer: 4.335 ± 0.399
5.878LeuThr: 5.878 ± 0.79
6.172LeuVal: 6.172 ± 0.718
1.47LeuTrp: 1.47 ± 0.269
1.543LeuTyr: 1.543 ± 0.239
0.0LeuXaa: 0.0 ± 0.0
Met
2.939MetAla: 2.939 ± 0.569
0.294MetCys: 0.294 ± 0.124
0.735MetAsp: 0.735 ± 0.206
0.808MetGlu: 0.808 ± 0.255
0.588MetPhe: 0.588 ± 0.169
1.396MetGly: 1.396 ± 0.341
0.588MetHis: 0.588 ± 0.182
0.955MetIle: 0.955 ± 0.308
0.735MetLys: 0.735 ± 0.233
1.47MetLeu: 1.47 ± 0.279
0.441MetMet: 0.441 ± 0.164
0.735MetAsn: 0.735 ± 0.193
1.47MetPro: 1.47 ± 0.342
0.661MetGln: 0.661 ± 0.175
1.029MetArg: 1.029 ± 0.263
1.69MetSer: 1.69 ± 0.382
1.69MetThr: 1.69 ± 0.312
0.955MetVal: 0.955 ± 0.23
0.22MetTrp: 0.22 ± 0.108
0.147MetTyr: 0.147 ± 0.106
0.0MetXaa: 0.0 ± 0.0
Asn
3.747AsnAla: 3.747 ± 0.718
0.0AsnCys: 0.0 ± 0.0
2.131AsnAsp: 2.131 ± 0.321
1.837AsnGlu: 1.837 ± 0.383
0.661AsnPhe: 0.661 ± 0.251
2.866AsnGly: 2.866 ± 0.434
0.735AsnHis: 0.735 ± 0.234
1.102AsnIle: 1.102 ± 0.303
0.808AsnLys: 0.808 ± 0.269
2.131AsnLeu: 2.131 ± 0.325
0.294AsnMet: 0.294 ± 0.119
0.735AsnAsn: 0.735 ± 0.293
2.645AsnPro: 2.645 ± 0.409
1.323AsnGln: 1.323 ± 0.278
2.131AsnArg: 2.131 ± 0.372
1.616AsnSer: 1.616 ± 0.465
2.498AsnThr: 2.498 ± 0.399
1.984AsnVal: 1.984 ± 0.489
0.955AsnTrp: 0.955 ± 0.262
1.029AsnTyr: 1.029 ± 0.394
0.0AsnXaa: 0.0 ± 0.0
Pro
9.037ProAla: 9.037 ± 1.125
0.588ProCys: 0.588 ± 0.19
3.086ProAsp: 3.086 ± 0.454
3.6ProGlu: 3.6 ± 0.638
1.543ProPhe: 1.543 ± 0.361
4.996ProGly: 4.996 ± 0.727
1.616ProHis: 1.616 ± 0.353
2.498ProIle: 2.498 ± 0.442
1.984ProLys: 1.984 ± 0.545
3.453ProLeu: 3.453 ± 0.443
0.955ProMet: 0.955 ± 0.208
1.323ProAsn: 1.323 ± 0.4
2.719ProPro: 2.719 ± 0.718
1.984ProGln: 1.984 ± 0.419
3.159ProArg: 3.159 ± 0.573
3.38ProSer: 3.38 ± 0.638
2.792ProThr: 2.792 ± 0.349
4.702ProVal: 4.702 ± 0.522
0.441ProTrp: 0.441 ± 0.185
0.735ProTyr: 0.735 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
5.29GlnAla: 5.29 ± 0.791
0.147GlnCys: 0.147 ± 0.103
2.057GlnAsp: 2.057 ± 0.355
2.498GlnGlu: 2.498 ± 0.463
1.029GlnPhe: 1.029 ± 0.367
2.351GlnGly: 2.351 ± 0.432
0.882GlnHis: 0.882 ± 0.264
2.057GlnIle: 2.057 ± 0.376
1.616GlnLys: 1.616 ± 0.382
3.086GlnLeu: 3.086 ± 0.596
0.735GlnMet: 0.735 ± 0.224
1.102GlnAsn: 1.102 ± 0.27
2.204GlnPro: 2.204 ± 0.517
2.278GlnGln: 2.278 ± 0.507
1.984GlnArg: 1.984 ± 0.338
2.131GlnSer: 2.131 ± 0.368
2.645GlnThr: 2.645 ± 0.515
2.204GlnVal: 2.204 ± 0.473
0.735GlnTrp: 0.735 ± 0.199
0.955GlnTyr: 0.955 ± 0.287
0.0GlnXaa: 0.0 ± 0.0
Arg
6.245ArgAla: 6.245 ± 0.74
0.808ArgCys: 0.808 ± 0.275
4.262ArgAsp: 4.262 ± 0.55
4.262ArgGlu: 4.262 ± 0.732
1.323ArgPhe: 1.323 ± 0.296
3.894ArgGly: 3.894 ± 0.635
1.543ArgHis: 1.543 ± 0.444
2.866ArgIle: 2.866 ± 0.317
2.792ArgLys: 2.792 ± 0.569
5.364ArgLeu: 5.364 ± 0.585
1.102ArgMet: 1.102 ± 0.307
2.719ArgAsn: 2.719 ± 0.46
3.821ArgPro: 3.821 ± 0.647
2.572ArgGln: 2.572 ± 0.392
4.996ArgArg: 4.996 ± 0.741
3.821ArgSer: 3.821 ± 0.551
4.115ArgThr: 4.115 ± 0.579
4.996ArgVal: 4.996 ± 0.65
1.323ArgTrp: 1.323 ± 0.32
1.47ArgTyr: 1.47 ± 0.35
0.0ArgXaa: 0.0 ± 0.0
Ser
7.494SerAla: 7.494 ± 0.806
0.367SerCys: 0.367 ± 0.14
3.453SerAsp: 3.453 ± 0.646
2.939SerGlu: 2.939 ± 0.482
1.102SerPhe: 1.102 ± 0.299
5.07SerGly: 5.07 ± 0.545
0.588SerHis: 0.588 ± 0.19
2.278SerIle: 2.278 ± 0.392
2.204SerLys: 2.204 ± 0.47
4.409SerLeu: 4.409 ± 0.665
1.176SerMet: 1.176 ± 0.23
1.69SerAsn: 1.69 ± 0.393
2.939SerPro: 2.939 ± 0.471
1.837SerGln: 1.837 ± 0.307
2.057SerArg: 2.057 ± 0.331
2.278SerSer: 2.278 ± 0.494
4.409SerThr: 4.409 ± 0.718
5.143SerVal: 5.143 ± 0.535
1.029SerTrp: 1.029 ± 0.331
1.029SerTyr: 1.029 ± 0.274
0.0SerXaa: 0.0 ± 0.0
Thr
9.184ThrAla: 9.184 ± 0.798
0.441ThrCys: 0.441 ± 0.181
2.939ThrAsp: 2.939 ± 0.434
4.262ThrGlu: 4.262 ± 0.616
2.204ThrPhe: 2.204 ± 0.458
7.054ThrGly: 7.054 ± 1.015
1.396ThrHis: 1.396 ± 0.332
3.012ThrIle: 3.012 ± 0.577
1.616ThrLys: 1.616 ± 0.269
5.364ThrLeu: 5.364 ± 0.661
1.102ThrMet: 1.102 ± 0.276
2.498ThrAsn: 2.498 ± 0.68
4.188ThrPro: 4.188 ± 0.596
1.763ThrGln: 1.763 ± 0.311
3.968ThrArg: 3.968 ± 0.549
3.747ThrSer: 3.747 ± 0.589
5.437ThrThr: 5.437 ± 0.748
6.98ThrVal: 6.98 ± 1.16
1.323ThrTrp: 1.323 ± 0.284
1.69ThrTyr: 1.69 ± 0.35
0.0ThrXaa: 0.0 ± 0.0
Val
8.376ValAla: 8.376 ± 0.716
0.882ValCys: 0.882 ± 0.276
3.821ValAsp: 3.821 ± 0.649
4.555ValGlu: 4.555 ± 0.509
2.866ValPhe: 2.866 ± 0.572
6.098ValGly: 6.098 ± 0.841
1.396ValHis: 1.396 ± 0.388
3.453ValIle: 3.453 ± 0.425
3.306ValLys: 3.306 ± 0.646
6.245ValLeu: 6.245 ± 0.593
1.029ValMet: 1.029 ± 0.257
3.086ValAsn: 3.086 ± 0.524
2.131ValPro: 2.131 ± 0.432
3.233ValGln: 3.233 ± 0.522
5.217ValArg: 5.217 ± 0.546
4.188ValSer: 4.188 ± 0.526
6.319ValThr: 6.319 ± 0.928
5.437ValVal: 5.437 ± 0.845
1.323ValTrp: 1.323 ± 0.348
2.425ValTyr: 2.425 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
1.69TrpAla: 1.69 ± 0.339
0.0TrpCys: 0.0 ± 0.0
0.808TrpAsp: 0.808 ± 0.196
0.955TrpGlu: 0.955 ± 0.235
0.882TrpPhe: 0.882 ± 0.233
1.323TrpGly: 1.323 ± 0.261
0.367TrpHis: 0.367 ± 0.169
0.882TrpIle: 0.882 ± 0.24
0.441TrpLys: 0.441 ± 0.145
2.204TrpLeu: 2.204 ± 0.502
0.367TrpMet: 0.367 ± 0.174
0.588TrpAsn: 0.588 ± 0.215
1.102TrpPro: 1.102 ± 0.23
0.735TrpGln: 0.735 ± 0.202
1.396TrpArg: 1.396 ± 0.35
1.102TrpSer: 1.102 ± 0.256
1.396TrpThr: 1.396 ± 0.349
1.176TrpVal: 1.176 ± 0.322
0.294TrpTrp: 0.294 ± 0.186
0.441TrpTyr: 0.441 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.425TyrAla: 2.425 ± 0.528
0.441TyrCys: 0.441 ± 0.191
1.396TyrAsp: 1.396 ± 0.323
0.955TyrGlu: 0.955 ± 0.269
0.441TyrPhe: 0.441 ± 0.157
2.351TyrGly: 2.351 ± 0.516
0.22TyrHis: 0.22 ± 0.127
0.588TyrIle: 0.588 ± 0.19
1.176TyrLys: 1.176 ± 0.267
1.763TyrLeu: 1.763 ± 0.341
0.441TyrMet: 0.441 ± 0.173
0.735TyrAsn: 0.735 ± 0.25
1.249TyrPro: 1.249 ± 0.314
0.588TyrGln: 0.588 ± 0.185
1.249TyrArg: 1.249 ± 0.333
1.102TyrSer: 1.102 ± 0.235
1.543TyrThr: 1.543 ± 0.404
1.102TyrVal: 1.102 ± 0.26
0.514TyrTrp: 0.514 ± 0.218
1.029TyrTyr: 1.029 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (13611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski