Amino acid dipepetide frequency for Erwinia amylovora phage Era103

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.758AlaAla: 8.758 ± 1.332
0.579AlaCys: 0.579 ± 0.212
5.718AlaAsp: 5.718 ± 0.662
6.08AlaGlu: 6.08 ± 0.677
3.112AlaPhe: 3.112 ± 0.491
8.613AlaGly: 8.613 ± 1.15
1.375AlaHis: 1.375 ± 0.291
4.922AlaIle: 4.922 ± 0.558
5.067AlaLys: 5.067 ± 0.729
6.514AlaLeu: 6.514 ± 0.942
3.836AlaMet: 3.836 ± 0.629
4.705AlaAsn: 4.705 ± 0.721
3.04AlaPro: 3.04 ± 0.581
4.053AlaGln: 4.053 ± 0.721
3.981AlaArg: 3.981 ± 0.582
7.093AlaSer: 7.093 ± 0.593
5.646AlaThr: 5.646 ± 0.952
6.225AlaVal: 6.225 ± 0.853
1.592AlaTrp: 1.592 ± 0.451
3.402AlaTyr: 3.402 ± 0.433
0.0AlaXaa: 0.0 ± 0.0
Cys
0.434CysAla: 0.434 ± 0.181
0.0CysCys: 0.0 ± 0.0
0.651CysAsp: 0.651 ± 0.225
0.507CysGlu: 0.507 ± 0.207
0.29CysPhe: 0.29 ± 0.132
0.579CysGly: 0.579 ± 0.212
0.145CysHis: 0.145 ± 0.089
0.362CysIle: 0.362 ± 0.136
0.651CysLys: 0.651 ± 0.245
0.579CysLeu: 0.579 ± 0.207
0.145CysMet: 0.145 ± 0.097
0.29CysAsn: 0.29 ± 0.182
0.579CysPro: 0.579 ± 0.213
0.145CysGln: 0.145 ± 0.103
0.507CysArg: 0.507 ± 0.182
0.507CysSer: 0.507 ± 0.233
0.217CysThr: 0.217 ± 0.13
0.651CysVal: 0.651 ± 0.242
0.072CysTrp: 0.072 ± 0.072
0.217CysTyr: 0.217 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
6.442AspAla: 6.442 ± 0.787
0.507AspCys: 0.507 ± 0.181
3.909AspAsp: 3.909 ± 0.626
3.547AspGlu: 3.547 ± 0.515
2.533AspPhe: 2.533 ± 0.408
5.573AspGly: 5.573 ± 0.726
0.796AspHis: 0.796 ± 0.21
3.691AspIle: 3.691 ± 0.505
3.764AspLys: 3.764 ± 0.474
4.705AspLeu: 4.705 ± 0.461
2.027AspMet: 2.027 ± 0.309
3.185AspAsn: 3.185 ± 0.425
2.461AspPro: 2.461 ± 0.534
1.448AspGln: 1.448 ± 0.368
2.533AspArg: 2.533 ± 0.582
2.968AspSer: 2.968 ± 0.487
4.27AspThr: 4.27 ± 0.532
4.56AspVal: 4.56 ± 0.508
0.796AspTrp: 0.796 ± 0.217
2.606AspTyr: 2.606 ± 0.485
0.0AspXaa: 0.0 ± 0.0
Glu
6.948GluAla: 6.948 ± 0.837
0.217GluCys: 0.217 ± 0.159
4.198GluAsp: 4.198 ± 0.551
4.27GluGlu: 4.27 ± 0.646
2.823GluPhe: 2.823 ± 0.416
5.501GluGly: 5.501 ± 0.545
1.448GluHis: 1.448 ± 0.271
3.185GluIle: 3.185 ± 0.551
2.75GluLys: 2.75 ± 0.432
5.284GluLeu: 5.284 ± 0.726
2.316GluMet: 2.316 ± 0.504
2.678GluAsn: 2.678 ± 0.575
1.737GluPro: 1.737 ± 0.392
3.619GluGln: 3.619 ± 0.576
3.112GluArg: 3.112 ± 0.533
2.968GluSer: 2.968 ± 0.463
2.678GluThr: 2.678 ± 0.46
4.415GluVal: 4.415 ± 0.509
1.448GluTrp: 1.448 ± 0.368
1.303GluTyr: 1.303 ± 0.298
0.0GluXaa: 0.0 ± 0.0
Phe
2.678PheAla: 2.678 ± 0.497
0.362PheCys: 0.362 ± 0.154
2.75PheAsp: 2.75 ± 0.578
1.809PheGlu: 1.809 ± 0.386
1.375PhePhe: 1.375 ± 0.342
2.389PheGly: 2.389 ± 0.445
0.796PheHis: 0.796 ± 0.244
1.158PheIle: 1.158 ± 0.282
2.171PheLys: 2.171 ± 0.38
2.75PheLeu: 2.75 ± 0.452
1.303PheMet: 1.303 ± 0.375
2.099PheAsn: 2.099 ± 0.329
1.448PhePro: 1.448 ± 0.331
1.303PheGln: 1.303 ± 0.28
2.099PheArg: 2.099 ± 0.264
1.665PheSer: 1.665 ± 0.388
2.027PheThr: 2.027 ± 0.469
2.244PheVal: 2.244 ± 0.446
0.217PheTrp: 0.217 ± 0.111
0.941PheTyr: 0.941 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
6.08GlyAla: 6.08 ± 0.952
0.796GlyCys: 0.796 ± 0.288
5.428GlyAsp: 5.428 ± 0.771
5.573GlyGlu: 5.573 ± 0.577
2.823GlyPhe: 2.823 ± 0.475
6.442GlyGly: 6.442 ± 0.925
2.099GlyHis: 2.099 ± 0.519
4.126GlyIle: 4.126 ± 0.627
7.021GlyLys: 7.021 ± 0.765
5.79GlyLeu: 5.79 ± 0.649
1.665GlyMet: 1.665 ± 0.358
3.257GlyAsn: 3.257 ± 0.527
2.316GlyPro: 2.316 ± 0.377
2.533GlyGln: 2.533 ± 0.347
4.343GlyArg: 4.343 ± 0.502
4.343GlySer: 4.343 ± 0.682
4.56GlyThr: 4.56 ± 0.69
4.56GlyVal: 4.56 ± 0.747
1.737GlyTrp: 1.737 ± 0.309
3.764GlyTyr: 3.764 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
0.796HisAla: 0.796 ± 0.238
0.072HisCys: 0.072 ± 0.072
1.303HisAsp: 1.303 ± 0.29
1.303HisGlu: 1.303 ± 0.256
0.941HisPhe: 0.941 ± 0.266
1.375HisGly: 1.375 ± 0.262
0.651HisHis: 0.651 ± 0.185
1.375HisIle: 1.375 ± 0.403
0.941HisLys: 0.941 ± 0.283
2.678HisLeu: 2.678 ± 0.429
0.362HisMet: 0.362 ± 0.208
1.086HisAsn: 1.086 ± 0.273
0.941HisPro: 0.941 ± 0.256
0.869HisGln: 0.869 ± 0.245
1.086HisArg: 1.086 ± 0.364
0.651HisSer: 0.651 ± 0.251
1.086HisThr: 1.086 ± 0.346
1.737HisVal: 1.737 ± 0.386
0.29HisTrp: 0.29 ± 0.13
0.869HisTyr: 0.869 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
5.646IleAla: 5.646 ± 0.682
0.072IleCys: 0.072 ± 0.085
3.619IleAsp: 3.619 ± 0.539
2.678IleGlu: 2.678 ± 0.442
1.158IlePhe: 1.158 ± 0.341
3.474IleGly: 3.474 ± 0.521
1.448IleHis: 1.448 ± 0.318
2.244IleIle: 2.244 ± 0.58
2.533IleLys: 2.533 ± 0.445
3.329IleLeu: 3.329 ± 0.615
1.158IleMet: 1.158 ± 0.283
2.533IleAsn: 2.533 ± 0.424
1.665IlePro: 1.665 ± 0.299
2.823IleGln: 2.823 ± 0.517
4.126IleArg: 4.126 ± 0.465
3.329IleSer: 3.329 ± 0.33
2.895IleThr: 2.895 ± 0.37
3.474IleVal: 3.474 ± 0.471
0.579IleTrp: 0.579 ± 0.198
1.303IleTyr: 1.303 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
5.79LysAla: 5.79 ± 0.786
0.362LysCys: 0.362 ± 0.158
3.257LysAsp: 3.257 ± 0.618
3.547LysGlu: 3.547 ± 0.547
1.809LysPhe: 1.809 ± 0.357
4.849LysGly: 4.849 ± 0.674
1.158LysHis: 1.158 ± 0.453
2.533LysIle: 2.533 ± 0.431
2.75LysLys: 2.75 ± 0.758
5.79LysLeu: 5.79 ± 0.684
1.158LysMet: 1.158 ± 0.301
2.316LysAsn: 2.316 ± 0.417
2.533LysPro: 2.533 ± 0.386
3.112LysGln: 3.112 ± 0.549
2.968LysArg: 2.968 ± 0.401
3.112LysSer: 3.112 ± 0.473
3.04LysThr: 3.04 ± 0.406
5.428LysVal: 5.428 ± 0.708
1.013LysTrp: 1.013 ± 0.307
2.244LysTyr: 2.244 ± 0.442
0.0LysXaa: 0.0 ± 0.0
Leu
7.962LeuAla: 7.962 ± 0.79
1.158LeuCys: 1.158 ± 0.344
5.139LeuAsp: 5.139 ± 0.563
5.284LeuGlu: 5.284 ± 0.561
1.809LeuPhe: 1.809 ± 0.382
4.994LeuGly: 4.994 ± 0.651
2.244LeuHis: 2.244 ± 0.324
4.27LeuIle: 4.27 ± 0.642
3.909LeuLys: 3.909 ± 0.562
4.994LeuLeu: 4.994 ± 0.578
2.895LeuMet: 2.895 ± 0.41
4.27LeuAsn: 4.27 ± 0.545
3.619LeuPro: 3.619 ± 0.624
3.257LeuGln: 3.257 ± 0.462
4.415LeuArg: 4.415 ± 0.571
6.297LeuSer: 6.297 ± 0.73
4.777LeuThr: 4.777 ± 0.59
5.863LeuVal: 5.863 ± 0.705
0.362LeuTrp: 0.362 ± 0.183
2.533LeuTyr: 2.533 ± 0.408
0.0LeuXaa: 0.0 ± 0.0
Met
3.836MetAla: 3.836 ± 0.608
0.0MetCys: 0.0 ± 0.0
1.52MetAsp: 1.52 ± 0.287
1.809MetGlu: 1.809 ± 0.295
0.651MetPhe: 0.651 ± 0.239
2.316MetGly: 2.316 ± 0.569
0.145MetHis: 0.145 ± 0.115
1.158MetIle: 1.158 ± 0.26
1.737MetLys: 1.737 ± 0.474
3.185MetLeu: 3.185 ± 0.456
1.086MetMet: 1.086 ± 0.296
1.303MetAsn: 1.303 ± 0.295
1.375MetPro: 1.375 ± 0.285
1.52MetGln: 1.52 ± 0.337
2.027MetArg: 2.027 ± 0.508
1.665MetSer: 1.665 ± 0.313
2.099MetThr: 2.099 ± 0.456
1.809MetVal: 1.809 ± 0.311
0.217MetTrp: 0.217 ± 0.145
0.941MetTyr: 0.941 ± 0.253
0.0MetXaa: 0.0 ± 0.0
Asn
3.474AsnAla: 3.474 ± 0.525
0.29AsnCys: 0.29 ± 0.143
2.244AsnAsp: 2.244 ± 0.401
2.895AsnGlu: 2.895 ± 0.507
1.52AsnPhe: 1.52 ± 0.331
3.764AsnGly: 3.764 ± 0.598
0.724AsnHis: 0.724 ± 0.283
2.027AsnIle: 2.027 ± 0.346
3.185AsnLys: 3.185 ± 0.444
4.705AsnLeu: 4.705 ± 0.716
1.809AsnMet: 1.809 ± 0.478
2.461AsnAsn: 2.461 ± 0.409
2.75AsnPro: 2.75 ± 0.465
1.737AsnGln: 1.737 ± 0.278
2.895AsnArg: 2.895 ± 0.529
2.244AsnSer: 2.244 ± 0.385
3.402AsnThr: 3.402 ± 0.702
2.968AsnVal: 2.968 ± 0.484
0.434AsnTrp: 0.434 ± 0.201
1.665AsnTyr: 1.665 ± 0.353
0.0AsnXaa: 0.0 ± 0.0
Pro
3.547ProAla: 3.547 ± 0.667
0.362ProCys: 0.362 ± 0.155
2.606ProAsp: 2.606 ± 0.645
3.764ProGlu: 3.764 ± 0.439
1.448ProPhe: 1.448 ± 0.297
2.895ProGly: 2.895 ± 0.565
1.013ProHis: 1.013 ± 0.352
1.665ProIle: 1.665 ± 0.487
1.448ProLys: 1.448 ± 0.344
2.968ProLeu: 2.968 ± 0.553
0.362ProMet: 0.362 ± 0.146
1.592ProAsn: 1.592 ± 0.405
1.013ProPro: 1.013 ± 0.291
1.448ProGln: 1.448 ± 0.294
1.665ProArg: 1.665 ± 0.293
2.606ProSer: 2.606 ± 0.607
2.678ProThr: 2.678 ± 0.437
3.981ProVal: 3.981 ± 0.542
0.724ProTrp: 0.724 ± 0.19
1.375ProTyr: 1.375 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
6.152GlnAla: 6.152 ± 0.886
0.362GlnCys: 0.362 ± 0.201
2.533GlnAsp: 2.533 ± 0.465
3.329GlnGlu: 3.329 ± 0.379
1.448GlnPhe: 1.448 ± 0.337
3.112GlnGly: 3.112 ± 0.543
0.796GlnHis: 0.796 ± 0.269
2.027GlnIle: 2.027 ± 0.331
2.027GlnLys: 2.027 ± 0.41
2.968GlnLeu: 2.968 ± 0.512
1.737GlnMet: 1.737 ± 0.346
1.448GlnAsn: 1.448 ± 0.337
1.448GlnPro: 1.448 ± 0.396
2.606GlnGln: 2.606 ± 0.526
2.389GlnArg: 2.389 ± 0.449
2.606GlnSer: 2.606 ± 0.448
1.882GlnThr: 1.882 ± 0.372
2.027GlnVal: 2.027 ± 0.308
0.434GlnTrp: 0.434 ± 0.177
1.592GlnTyr: 1.592 ± 0.407
0.0GlnXaa: 0.0 ± 0.0
Arg
4.777ArgAla: 4.777 ± 0.653
0.507ArgCys: 0.507 ± 0.196
3.329ArgAsp: 3.329 ± 0.454
2.968ArgGlu: 2.968 ± 0.496
2.099ArgPhe: 2.099 ± 0.298
3.909ArgGly: 3.909 ± 0.611
1.23ArgHis: 1.23 ± 0.292
3.185ArgIle: 3.185 ± 0.526
3.619ArgLys: 3.619 ± 0.568
4.415ArgLeu: 4.415 ± 0.553
2.823ArgMet: 2.823 ± 0.416
2.75ArgAsn: 2.75 ± 0.431
1.954ArgPro: 1.954 ± 0.329
2.171ArgGln: 2.171 ± 0.411
3.04ArgArg: 3.04 ± 0.468
2.461ArgSer: 2.461 ± 0.448
2.606ArgThr: 2.606 ± 0.37
3.691ArgVal: 3.691 ± 0.493
1.086ArgTrp: 1.086 ± 0.238
1.737ArgTyr: 1.737 ± 0.32
0.0ArgXaa: 0.0 ± 0.0
Ser
4.994SerAla: 4.994 ± 0.581
0.362SerCys: 0.362 ± 0.164
3.04SerAsp: 3.04 ± 0.46
3.257SerGlu: 3.257 ± 0.475
2.606SerPhe: 2.606 ± 0.452
4.849SerGly: 4.849 ± 0.697
1.013SerHis: 1.013 ± 0.237
3.909SerIle: 3.909 ± 0.663
4.053SerLys: 4.053 ± 0.649
4.705SerLeu: 4.705 ± 0.733
1.448SerMet: 1.448 ± 0.344
2.968SerAsn: 2.968 ± 0.706
2.895SerPro: 2.895 ± 0.473
2.461SerGln: 2.461 ± 0.446
3.257SerArg: 3.257 ± 0.608
4.27SerSer: 4.27 ± 0.632
3.402SerThr: 3.402 ± 0.422
4.053SerVal: 4.053 ± 0.561
0.724SerTrp: 0.724 ± 0.223
2.244SerTyr: 2.244 ± 0.532
0.0SerXaa: 0.0 ± 0.0
Thr
5.935ThrAla: 5.935 ± 0.786
0.362ThrCys: 0.362 ± 0.131
3.402ThrAsp: 3.402 ± 0.455
3.909ThrGlu: 3.909 ± 0.583
1.665ThrPhe: 1.665 ± 0.371
4.705ThrGly: 4.705 ± 0.776
1.23ThrHis: 1.23 ± 0.3
2.823ThrIle: 2.823 ± 0.423
2.823ThrLys: 2.823 ± 0.35
5.573ThrLeu: 5.573 ± 0.658
1.23ThrMet: 1.23 ± 0.293
2.389ThrAsn: 2.389 ± 0.499
2.461ThrPro: 2.461 ± 0.555
2.823ThrGln: 2.823 ± 0.454
3.112ThrArg: 3.112 ± 0.498
3.329ThrSer: 3.329 ± 0.647
3.04ThrThr: 3.04 ± 0.475
4.343ThrVal: 4.343 ± 0.586
1.303ThrTrp: 1.303 ± 0.342
1.737ThrTyr: 1.737 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
6.369ValAla: 6.369 ± 0.927
0.434ValCys: 0.434 ± 0.188
4.56ValAsp: 4.56 ± 0.659
4.126ValGlu: 4.126 ± 0.627
2.099ValPhe: 2.099 ± 0.369
5.935ValGly: 5.935 ± 0.771
1.23ValHis: 1.23 ± 0.436
2.895ValIle: 2.895 ± 0.533
4.777ValLys: 4.777 ± 0.659
5.067ValLeu: 5.067 ± 0.656
1.954ValMet: 1.954 ± 0.307
2.895ValAsn: 2.895 ± 0.439
2.533ValPro: 2.533 ± 0.423
2.895ValGln: 2.895 ± 0.4
4.126ValArg: 4.126 ± 0.46
5.356ValSer: 5.356 ± 0.842
4.777ValThr: 4.777 ± 0.695
4.415ValVal: 4.415 ± 0.622
1.158ValTrp: 1.158 ± 0.309
2.099ValTyr: 2.099 ± 0.523
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.25
0.145TrpCys: 0.145 ± 0.09
0.941TrpAsp: 0.941 ± 0.357
0.724TrpGlu: 0.724 ± 0.207
0.579TrpPhe: 0.579 ± 0.25
0.651TrpGly: 0.651 ± 0.255
0.145TrpHis: 0.145 ± 0.106
0.507TrpIle: 0.507 ± 0.2
1.013TrpLys: 1.013 ± 0.304
1.303TrpLeu: 1.303 ± 0.29
0.217TrpMet: 0.217 ± 0.107
1.23TrpAsn: 1.23 ± 0.293
1.013TrpPro: 1.013 ± 0.274
0.724TrpGln: 0.724 ± 0.371
0.869TrpArg: 0.869 ± 0.301
1.158TrpSer: 1.158 ± 0.283
0.869TrpThr: 0.869 ± 0.252
1.23TrpVal: 1.23 ± 0.28
0.145TrpTrp: 0.145 ± 0.108
0.29TrpTyr: 0.29 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.823TyrAla: 2.823 ± 0.325
0.579TyrCys: 0.579 ± 0.218
2.171TyrAsp: 2.171 ± 0.401
1.52TyrGlu: 1.52 ± 0.368
0.796TyrPhe: 0.796 ± 0.266
3.329TyrGly: 3.329 ± 0.368
0.724TyrHis: 0.724 ± 0.173
2.027TyrIle: 2.027 ± 0.31
2.533TyrLys: 2.533 ± 0.545
2.75TyrLeu: 2.75 ± 0.524
0.796TyrMet: 0.796 ± 0.266
1.737TyrAsn: 1.737 ± 0.379
1.375TyrPro: 1.375 ± 0.344
1.375TyrGln: 1.375 ± 0.227
1.809TyrArg: 1.809 ± 0.367
1.882TyrSer: 1.882 ± 0.338
2.244TyrThr: 2.244 ± 0.409
2.027TyrVal: 2.027 ± 0.486
0.362TyrTrp: 0.362 ± 0.204
0.796TyrTyr: 0.796 ± 0.307
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (13817 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski