Amino acid dipepetide frequency for Escherichia phage forsur

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.14AlaAla: 14.14 ± 2.232
1.376AlaCys: 1.376 ± 0.472
5.503AlaAsp: 5.503 ± 0.63
6.573AlaGlu: 6.573 ± 0.764
3.287AlaPhe: 3.287 ± 0.575
7.72AlaGly: 7.72 ± 0.987
1.682AlaHis: 1.682 ± 0.306
4.204AlaIle: 4.204 ± 0.492
5.198AlaLys: 5.198 ± 0.672
9.478AlaLeu: 9.478 ± 1.484
3.516AlaMet: 3.516 ± 0.579
4.051AlaAsn: 4.051 ± 0.573
4.051AlaPro: 4.051 ± 0.84
4.815AlaGln: 4.815 ± 0.8
5.656AlaArg: 5.656 ± 0.831
6.191AlaSer: 6.191 ± 0.815
5.503AlaThr: 5.503 ± 0.752
6.879AlaVal: 6.879 ± 0.814
1.376AlaTrp: 1.376 ± 0.29
4.51AlaTyr: 4.51 ± 0.636
0.0AlaXaa: 0.0 ± 0.0
Cys
1.299CysAla: 1.299 ± 0.715
0.535CysCys: 0.535 ± 0.417
0.535CysAsp: 0.535 ± 0.242
0.382CysGlu: 0.382 ± 0.177
0.306CysPhe: 0.306 ± 0.144
0.841CysGly: 0.841 ± 0.37
0.382CysHis: 0.382 ± 0.207
0.459CysIle: 0.459 ± 0.188
0.688CysLys: 0.688 ± 0.219
0.688CysLeu: 0.688 ± 0.213
0.382CysMet: 0.382 ± 0.18
0.382CysAsn: 0.382 ± 0.192
0.611CysPro: 0.611 ± 0.188
0.229CysGln: 0.229 ± 0.148
0.764CysArg: 0.764 ± 0.32
0.611CysSer: 0.611 ± 0.19
0.229CysThr: 0.229 ± 0.111
1.147CysVal: 1.147 ± 0.33
0.382CysTrp: 0.382 ± 0.207
0.459CysTyr: 0.459 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
6.497AspAla: 6.497 ± 0.665
0.306AspCys: 0.306 ± 0.176
4.663AspAsp: 4.663 ± 0.683
3.975AspGlu: 3.975 ± 0.543
1.758AspPhe: 1.758 ± 0.447
6.038AspGly: 6.038 ± 0.895
0.764AspHis: 0.764 ± 0.246
3.516AspIle: 3.516 ± 0.483
2.981AspLys: 2.981 ± 0.423
4.204AspLeu: 4.204 ± 0.475
1.299AspMet: 1.299 ± 0.364
2.905AspAsn: 2.905 ± 0.537
2.752AspPro: 2.752 ± 0.554
1.07AspGln: 1.07 ± 0.376
2.981AspArg: 2.981 ± 0.408
3.822AspSer: 3.822 ± 0.628
3.287AspThr: 3.287 ± 0.422
3.363AspVal: 3.363 ± 0.661
1.299AspTrp: 1.299 ± 0.363
2.599AspTyr: 2.599 ± 0.426
0.0AspXaa: 0.0 ± 0.0
Glu
6.65GluAla: 6.65 ± 0.951
0.841GluCys: 0.841 ± 0.308
3.669GluAsp: 3.669 ± 0.616
3.592GluGlu: 3.592 ± 0.621
2.752GluPhe: 2.752 ± 0.48
3.975GluGly: 3.975 ± 0.515
1.147GluHis: 1.147 ± 0.3
1.911GluIle: 1.911 ± 0.355
2.522GluLys: 2.522 ± 0.471
5.045GluLeu: 5.045 ± 0.696
1.911GluMet: 1.911 ± 0.382
2.293GluAsn: 2.293 ± 0.459
2.14GluPro: 2.14 ± 0.439
3.287GluGln: 3.287 ± 0.443
3.516GluArg: 3.516 ± 0.644
3.134GluSer: 3.134 ± 0.485
3.822GluThr: 3.822 ± 0.57
4.586GluVal: 4.586 ± 0.719
0.917GluTrp: 0.917 ± 0.24
2.522GluTyr: 2.522 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.369PheAla: 2.369 ± 0.522
0.688PheCys: 0.688 ± 0.22
2.905PheAsp: 2.905 ± 0.467
1.911PheGlu: 1.911 ± 0.418
0.994PhePhe: 0.994 ± 0.255
2.905PheGly: 2.905 ± 0.395
0.535PheHis: 0.535 ± 0.215
2.522PheIle: 2.522 ± 0.44
1.834PheLys: 1.834 ± 0.487
1.987PheLeu: 1.987 ± 0.345
1.223PheMet: 1.223 ± 0.242
1.987PheAsn: 1.987 ± 0.441
1.758PhePro: 1.758 ± 0.327
1.07PheGln: 1.07 ± 0.3
1.682PheArg: 1.682 ± 0.437
1.376PheSer: 1.376 ± 0.453
2.064PheThr: 2.064 ± 0.507
2.599PheVal: 2.599 ± 0.371
0.535PheTrp: 0.535 ± 0.166
0.841PheTyr: 0.841 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
7.796GlyAla: 7.796 ± 0.877
1.147GlyCys: 1.147 ± 0.285
4.586GlyAsp: 4.586 ± 0.583
4.968GlyGlu: 4.968 ± 0.71
1.758GlyPhe: 1.758 ± 0.333
6.573GlyGly: 6.573 ± 1.001
1.299GlyHis: 1.299 ± 0.433
4.51GlyIle: 4.51 ± 0.593
3.975GlyLys: 3.975 ± 0.695
6.421GlyLeu: 6.421 ± 0.678
2.064GlyMet: 2.064 ± 0.307
2.905GlyAsn: 2.905 ± 0.46
1.529GlyPro: 1.529 ± 0.257
3.822GlyGln: 3.822 ± 0.748
5.198GlyArg: 5.198 ± 0.534
4.586GlySer: 4.586 ± 0.769
5.58GlyThr: 5.58 ± 1.066
5.198GlyVal: 5.198 ± 0.672
1.07GlyTrp: 1.07 ± 0.238
3.21GlyTyr: 3.21 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
2.14HisAla: 2.14 ± 0.4
0.229HisCys: 0.229 ± 0.126
1.147HisAsp: 1.147 ± 0.318
1.376HisGlu: 1.376 ± 0.331
0.611HisPhe: 0.611 ± 0.208
1.529HisGly: 1.529 ± 0.364
0.688HisHis: 0.688 ± 0.283
0.764HisIle: 0.764 ± 0.268
1.07HisLys: 1.07 ± 0.317
1.682HisLeu: 1.682 ± 0.461
0.917HisMet: 0.917 ± 0.29
0.688HisAsn: 0.688 ± 0.226
0.688HisPro: 0.688 ± 0.234
0.535HisGln: 0.535 ± 0.188
0.841HisArg: 0.841 ± 0.297
0.917HisSer: 0.917 ± 0.254
0.841HisThr: 0.841 ± 0.23
1.376HisVal: 1.376 ± 0.423
0.306HisTrp: 0.306 ± 0.158
0.688HisTyr: 0.688 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
3.745IleAla: 3.745 ± 0.469
0.459IleCys: 0.459 ± 0.214
3.363IleAsp: 3.363 ± 0.524
3.516IleGlu: 3.516 ± 0.431
1.223IlePhe: 1.223 ± 0.328
3.21IleGly: 3.21 ± 0.532
0.994IleHis: 0.994 ± 0.235
2.446IleIle: 2.446 ± 0.485
1.299IleLys: 1.299 ± 0.329
2.752IleLeu: 2.752 ± 0.45
1.758IleMet: 1.758 ± 0.311
2.446IleAsn: 2.446 ± 0.389
2.828IlePro: 2.828 ± 0.44
2.522IleGln: 2.522 ± 0.347
3.134IleArg: 3.134 ± 0.499
2.064IleSer: 2.064 ± 0.356
2.675IleThr: 2.675 ± 0.534
3.669IleVal: 3.669 ± 0.492
0.306IleTrp: 0.306 ± 0.158
1.529IleTyr: 1.529 ± 0.341
0.0IleXaa: 0.0 ± 0.0
Lys
5.733LysAla: 5.733 ± 0.902
0.535LysCys: 0.535 ± 0.213
2.981LysAsp: 2.981 ± 0.619
2.369LysGlu: 2.369 ± 0.41
1.834LysPhe: 1.834 ± 0.383
2.599LysGly: 2.599 ± 0.504
0.917LysHis: 0.917 ± 0.309
0.535LysIle: 0.535 ± 0.227
1.758LysLys: 1.758 ± 0.415
4.739LysLeu: 4.739 ± 0.777
0.994LysMet: 0.994 ± 0.228
0.917LysAsn: 0.917 ± 0.301
1.834LysPro: 1.834 ± 0.336
2.446LysGln: 2.446 ± 0.474
2.675LysArg: 2.675 ± 0.437
3.057LysSer: 3.057 ± 0.636
2.522LysThr: 2.522 ± 0.398
3.287LysVal: 3.287 ± 0.565
0.688LysTrp: 0.688 ± 0.238
1.147LysTyr: 1.147 ± 0.316
0.0LysXaa: 0.0 ± 0.0
Leu
9.096LeuAla: 9.096 ± 0.656
0.764LeuCys: 0.764 ± 0.302
5.274LeuAsp: 5.274 ± 0.645
5.274LeuGlu: 5.274 ± 0.581
3.822LeuPhe: 3.822 ± 0.585
5.656LeuGly: 5.656 ± 0.669
1.911LeuHis: 1.911 ± 0.398
2.522LeuIle: 2.522 ± 0.441
2.675LeuLys: 2.675 ± 0.488
6.879LeuLeu: 6.879 ± 0.72
2.064LeuMet: 2.064 ± 0.41
3.363LeuAsn: 3.363 ± 0.518
3.975LeuPro: 3.975 ± 0.785
4.28LeuGln: 4.28 ± 0.419
4.663LeuArg: 4.663 ± 0.659
5.35LeuSer: 5.35 ± 0.737
5.656LeuThr: 5.656 ± 0.782
5.121LeuVal: 5.121 ± 0.526
1.452LeuTrp: 1.452 ± 0.416
2.522LeuTyr: 2.522 ± 0.486
0.0LeuXaa: 0.0 ± 0.0
Met
3.745MetAla: 3.745 ± 0.486
0.076MetCys: 0.076 ± 0.087
1.147MetAsp: 1.147 ± 0.356
0.764MetGlu: 0.764 ± 0.198
0.611MetPhe: 0.611 ± 0.194
2.064MetGly: 2.064 ± 0.447
0.611MetHis: 0.611 ± 0.171
0.841MetIle: 0.841 ± 0.3
1.529MetLys: 1.529 ± 0.419
2.905MetLeu: 2.905 ± 0.452
1.376MetMet: 1.376 ± 0.49
1.299MetAsn: 1.299 ± 0.226
1.147MetPro: 1.147 ± 0.326
1.987MetGln: 1.987 ± 0.471
1.911MetArg: 1.911 ± 0.486
1.605MetSer: 1.605 ± 0.297
2.064MetThr: 2.064 ± 0.395
1.834MetVal: 1.834 ± 0.383
0.535MetTrp: 0.535 ± 0.279
1.07MetTyr: 1.07 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
3.669AsnAla: 3.669 ± 0.607
0.535AsnCys: 0.535 ± 0.216
1.758AsnAsp: 1.758 ± 0.387
2.446AsnGlu: 2.446 ± 0.332
1.529AsnPhe: 1.529 ± 0.323
3.363AsnGly: 3.363 ± 0.554
1.223AsnHis: 1.223 ± 0.308
2.828AsnIle: 2.828 ± 0.427
2.217AsnLys: 2.217 ± 0.312
3.21AsnLeu: 3.21 ± 0.498
1.452AsnMet: 1.452 ± 0.372
1.682AsnAsn: 1.682 ± 0.447
2.293AsnPro: 2.293 ± 0.651
1.605AsnGln: 1.605 ± 0.425
2.217AsnArg: 2.217 ± 0.326
2.599AsnSer: 2.599 ± 0.639
2.599AsnThr: 2.599 ± 0.574
3.745AsnVal: 3.745 ± 0.787
0.459AsnTrp: 0.459 ± 0.174
0.535AsnTyr: 0.535 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
3.822ProAla: 3.822 ± 0.397
0.382ProCys: 0.382 ± 0.182
3.21ProAsp: 3.21 ± 0.485
3.363ProGlu: 3.363 ± 0.555
1.299ProPhe: 1.299 ± 0.33
3.134ProGly: 3.134 ± 0.417
0.764ProHis: 0.764 ± 0.263
1.911ProIle: 1.911 ± 0.326
1.223ProLys: 1.223 ± 0.335
2.446ProLeu: 2.446 ± 0.557
0.611ProMet: 0.611 ± 0.22
2.675ProAsn: 2.675 ± 0.654
1.605ProPro: 1.605 ± 0.532
2.599ProGln: 2.599 ± 1.211
1.911ProArg: 1.911 ± 0.432
2.599ProSer: 2.599 ± 0.459
2.981ProThr: 2.981 ± 0.527
3.134ProVal: 3.134 ± 0.586
0.688ProTrp: 0.688 ± 0.201
1.834ProTyr: 1.834 ± 0.424
0.0ProXaa: 0.0 ± 0.0
Gln
5.58GlnAla: 5.58 ± 0.94
0.076GlnCys: 0.076 ± 0.072
2.217GlnAsp: 2.217 ± 0.498
2.217GlnGlu: 2.217 ± 0.482
2.599GlnPhe: 2.599 ± 0.483
3.21GlnGly: 3.21 ± 0.614
0.841GlnHis: 0.841 ± 0.227
2.217GlnIle: 2.217 ± 0.458
1.682GlnLys: 1.682 ± 0.403
3.516GlnLeu: 3.516 ± 0.557
1.376GlnMet: 1.376 ± 0.282
1.758GlnAsn: 1.758 ± 0.463
2.369GlnPro: 2.369 ± 1.049
3.898GlnGln: 3.898 ± 1.473
3.975GlnArg: 3.975 ± 0.579
1.987GlnSer: 1.987 ± 0.388
2.828GlnThr: 2.828 ± 0.585
3.287GlnVal: 3.287 ± 0.536
0.611GlnTrp: 0.611 ± 0.187
1.834GlnTyr: 1.834 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
5.886ArgAla: 5.886 ± 0.968
0.764ArgCys: 0.764 ± 0.237
3.592ArgAsp: 3.592 ± 0.561
3.975ArgGlu: 3.975 ± 0.713
1.834ArgPhe: 1.834 ± 0.39
4.815ArgGly: 4.815 ± 0.538
1.07ArgHis: 1.07 ± 0.306
3.057ArgIle: 3.057 ± 0.48
2.599ArgLys: 2.599 ± 0.486
4.663ArgLeu: 4.663 ± 0.536
1.834ArgMet: 1.834 ± 0.325
1.834ArgAsn: 1.834 ± 0.456
1.758ArgPro: 1.758 ± 0.496
2.369ArgGln: 2.369 ± 0.403
3.898ArgArg: 3.898 ± 0.844
2.675ArgSer: 2.675 ± 0.476
3.363ArgThr: 3.363 ± 0.556
4.357ArgVal: 4.357 ± 0.721
0.841ArgTrp: 0.841 ± 0.267
2.293ArgTyr: 2.293 ± 0.439
0.0ArgXaa: 0.0 ± 0.0
Ser
5.274SerAla: 5.274 ± 0.614
0.229SerCys: 0.229 ± 0.189
2.828SerAsp: 2.828 ± 0.46
2.752SerGlu: 2.752 ± 0.544
2.446SerPhe: 2.446 ± 0.481
5.656SerGly: 5.656 ± 0.517
0.764SerHis: 0.764 ± 0.24
3.057SerIle: 3.057 ± 0.33
2.446SerLys: 2.446 ± 0.448
4.586SerLeu: 4.586 ± 0.638
1.605SerMet: 1.605 ± 0.357
2.675SerAsn: 2.675 ± 0.516
3.363SerPro: 3.363 ± 0.537
2.752SerGln: 2.752 ± 0.54
2.905SerArg: 2.905 ± 0.518
2.599SerSer: 2.599 ± 0.649
3.134SerThr: 3.134 ± 0.847
4.127SerVal: 4.127 ± 0.62
0.841SerTrp: 0.841 ± 0.243
2.293SerTyr: 2.293 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
7.567ThrAla: 7.567 ± 0.81
0.535ThrCys: 0.535 ± 0.211
3.21ThrAsp: 3.21 ± 0.755
2.828ThrGlu: 2.828 ± 0.497
1.682ThrPhe: 1.682 ± 0.427
5.656ThrGly: 5.656 ± 0.86
0.841ThrHis: 0.841 ± 0.235
2.905ThrIle: 2.905 ± 0.435
2.905ThrLys: 2.905 ± 0.467
5.886ThrLeu: 5.886 ± 0.703
1.299ThrMet: 1.299 ± 0.334
2.599ThrAsn: 2.599 ± 0.6
2.599ThrPro: 2.599 ± 0.514
2.522ThrGln: 2.522 ± 0.512
2.752ThrArg: 2.752 ± 0.322
3.44ThrSer: 3.44 ± 0.667
4.051ThrThr: 4.051 ± 0.936
5.045ThrVal: 5.045 ± 0.988
0.535ThrTrp: 0.535 ± 0.211
2.522ThrTyr: 2.522 ± 0.562
0.0ThrXaa: 0.0 ± 0.0
Val
6.497ValAla: 6.497 ± 0.638
0.994ValCys: 0.994 ± 0.331
4.204ValAsp: 4.204 ± 0.522
5.121ValGlu: 5.121 ± 0.594
1.834ValPhe: 1.834 ± 0.431
5.198ValGly: 5.198 ± 0.469
1.452ValHis: 1.452 ± 0.351
3.669ValIle: 3.669 ± 0.565
3.363ValLys: 3.363 ± 0.621
5.503ValLeu: 5.503 ± 0.535
1.758ValMet: 1.758 ± 0.353
3.21ValAsn: 3.21 ± 0.54
2.981ValPro: 2.981 ± 0.508
3.745ValGln: 3.745 ± 0.579
3.822ValArg: 3.822 ± 0.821
4.051ValSer: 4.051 ± 0.578
5.427ValThr: 5.427 ± 0.802
5.427ValVal: 5.427 ± 0.697
0.994ValTrp: 0.994 ± 0.278
2.599ValTyr: 2.599 ± 0.442
0.0ValXaa: 0.0 ± 0.0
Trp
0.841TrpAla: 0.841 ± 0.261
0.459TrpCys: 0.459 ± 0.196
1.07TrpAsp: 1.07 ± 0.21
0.764TrpGlu: 0.764 ± 0.221
0.917TrpPhe: 0.917 ± 0.273
0.841TrpGly: 0.841 ± 0.323
0.229TrpHis: 0.229 ± 0.119
0.459TrpIle: 0.459 ± 0.152
0.153TrpLys: 0.153 ± 0.105
2.217TrpLeu: 2.217 ± 0.424
0.382TrpMet: 0.382 ± 0.187
0.382TrpAsn: 0.382 ± 0.146
0.535TrpPro: 0.535 ± 0.212
0.688TrpGln: 0.688 ± 0.185
0.688TrpArg: 0.688 ± 0.21
0.994TrpSer: 0.994 ± 0.245
0.688TrpThr: 0.688 ± 0.224
0.994TrpVal: 0.994 ± 0.269
0.611TrpTrp: 0.611 ± 0.284
0.611TrpTyr: 0.611 ± 0.237
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.363TyrAla: 3.363 ± 0.558
0.382TyrCys: 0.382 ± 0.196
2.217TyrAsp: 2.217 ± 0.444
2.064TyrGlu: 2.064 ± 0.489
0.688TyrPhe: 0.688 ± 0.232
3.21TyrGly: 3.21 ± 0.569
0.917TyrHis: 0.917 ± 0.263
1.605TyrIle: 1.605 ± 0.418
1.452TyrLys: 1.452 ± 0.364
3.516TyrLeu: 3.516 ± 0.606
1.299TyrMet: 1.299 ± 0.28
1.911TyrAsn: 1.911 ± 0.321
1.376TyrPro: 1.376 ± 0.244
1.834TyrGln: 1.834 ± 0.362
2.293TyrArg: 2.293 ± 0.548
2.675TyrSer: 2.675 ± 0.473
2.064TyrThr: 2.064 ± 0.631
2.675TyrVal: 2.675 ± 0.546
0.076TyrTrp: 0.076 ± 0.075
1.299TyrTyr: 1.299 ± 0.34
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13084 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski