Amino acid dipepetide frequency for Escherichia virus P2_4A7b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.207AlaAla: 10.207 ± 2.252
1.178AlaCys: 1.178 ± 0.482
5.496AlaAsp: 5.496 ± 0.853
5.104AlaGlu: 5.104 ± 0.882
3.73AlaPhe: 3.73 ± 0.669
7.95AlaGly: 7.95 ± 1.161
2.061AlaHis: 2.061 ± 0.395
4.22AlaIle: 4.22 ± 0.548
4.907AlaLys: 4.907 ± 0.628
9.029AlaLeu: 9.029 ± 1.349
2.355AlaMet: 2.355 ± 0.448
1.963AlaAsn: 1.963 ± 0.353
4.024AlaPro: 4.024 ± 0.583
3.631AlaGln: 3.631 ± 0.674
4.809AlaArg: 4.809 ± 0.932
7.852AlaSer: 7.852 ± 0.874
5.889AlaThr: 5.889 ± 0.968
6.772AlaVal: 6.772 ± 0.782
1.865AlaTrp: 1.865 ± 0.365
2.65AlaTyr: 2.65 ± 0.508
0.0AlaXaa: 0.0 ± 0.0
Cys
0.687CysAla: 0.687 ± 0.259
0.0CysCys: 0.0 ± 0.0
0.785CysAsp: 0.785 ± 0.239
0.393CysGlu: 0.393 ± 0.22
0.393CysPhe: 0.393 ± 0.207
0.589CysGly: 0.589 ± 0.248
0.098CysHis: 0.098 ± 0.102
0.491CysIle: 0.491 ± 0.176
0.393CysLys: 0.393 ± 0.175
0.785CysLeu: 0.785 ± 0.296
0.294CysMet: 0.294 ± 0.168
0.294CysAsn: 0.294 ± 0.16
0.393CysPro: 0.393 ± 0.187
0.785CysGln: 0.785 ± 0.366
0.785CysArg: 0.785 ± 0.252
0.491CysSer: 0.491 ± 0.238
0.981CysThr: 0.981 ± 0.257
0.491CysVal: 0.491 ± 0.205
0.098CysTrp: 0.098 ± 0.101
0.393CysTyr: 0.393 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
6.478AspAla: 6.478 ± 0.808
0.491AspCys: 0.491 ± 0.213
2.846AspAsp: 2.846 ± 0.577
3.73AspGlu: 3.73 ± 0.806
2.846AspPhe: 2.846 ± 0.684
4.809AspGly: 4.809 ± 0.643
0.393AspHis: 0.393 ± 0.2
4.417AspIle: 4.417 ± 0.729
2.257AspLys: 2.257 ± 0.464
4.515AspLeu: 4.515 ± 0.608
0.981AspMet: 0.981 ± 0.309
2.061AspAsn: 2.061 ± 0.456
1.57AspPro: 1.57 ± 0.476
1.472AspGln: 1.472 ± 0.461
2.159AspArg: 2.159 ± 0.457
2.355AspSer: 2.355 ± 0.41
3.337AspThr: 3.337 ± 0.475
3.533AspVal: 3.533 ± 0.601
1.178AspTrp: 1.178 ± 0.296
2.257AspTyr: 2.257 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
4.907GluAla: 4.907 ± 0.805
0.294GluCys: 0.294 ± 0.161
2.159GluAsp: 2.159 ± 0.453
3.337GluGlu: 3.337 ± 0.487
2.061GluPhe: 2.061 ± 0.515
2.257GluGly: 2.257 ± 0.447
1.276GluHis: 1.276 ± 0.343
3.141GluIle: 3.141 ± 0.769
4.122GluLys: 4.122 ± 0.577
7.852GluLeu: 7.852 ± 0.744
2.355GluMet: 2.355 ± 0.438
3.435GluAsn: 3.435 ± 0.708
2.65GluPro: 2.65 ± 0.591
2.355GluGln: 2.355 ± 0.444
4.907GluArg: 4.907 ± 0.955
4.024GluSer: 4.024 ± 0.714
3.533GluThr: 3.533 ± 0.546
3.435GluVal: 3.435 ± 0.716
1.668GluTrp: 1.668 ± 0.496
2.552GluTyr: 2.552 ± 0.556
0.0GluXaa: 0.0 ± 0.0
Phe
2.65PheAla: 2.65 ± 0.621
0.393PheCys: 0.393 ± 0.165
1.963PheAsp: 1.963 ± 0.414
2.061PheGlu: 2.061 ± 0.443
1.472PhePhe: 1.472 ± 0.381
1.668PheGly: 1.668 ± 0.469
0.785PheHis: 0.785 ± 0.26
2.061PheIle: 2.061 ± 0.646
2.257PheLys: 2.257 ± 0.408
3.435PheLeu: 3.435 ± 0.58
1.08PheMet: 1.08 ± 0.312
2.552PheAsn: 2.552 ± 0.514
1.08PhePro: 1.08 ± 0.314
1.178PheGln: 1.178 ± 0.296
1.963PheArg: 1.963 ± 0.414
2.355PheSer: 2.355 ± 0.485
3.141PheThr: 3.141 ± 0.464
1.472PheVal: 1.472 ± 0.441
0.883PheTrp: 0.883 ± 0.288
1.178PheTyr: 1.178 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
5.987GlyAla: 5.987 ± 0.889
0.589GlyCys: 0.589 ± 0.316
4.22GlyAsp: 4.22 ± 0.426
4.515GlyGlu: 4.515 ± 0.572
2.061GlyPhe: 2.061 ± 0.439
5.496GlyGly: 5.496 ± 0.855
0.687GlyHis: 0.687 ± 0.161
3.631GlyIle: 3.631 ± 0.587
5.692GlyLys: 5.692 ± 0.736
4.613GlyLeu: 4.613 ± 0.462
2.846GlyMet: 2.846 ± 0.676
2.159GlyAsn: 2.159 ± 0.752
0.294GlyPro: 0.294 ± 0.192
2.454GlyGln: 2.454 ± 0.442
4.711GlyArg: 4.711 ± 0.76
3.533GlySer: 3.533 ± 0.61
4.024GlyThr: 4.024 ± 0.846
5.987GlyVal: 5.987 ± 0.916
1.472GlyTrp: 1.472 ± 0.334
2.061GlyTyr: 2.061 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
1.963HisAla: 1.963 ± 0.516
0.491HisCys: 0.491 ± 0.236
1.178HisAsp: 1.178 ± 0.379
1.178HisGlu: 1.178 ± 0.411
0.687HisPhe: 0.687 ± 0.299
1.374HisGly: 1.374 ± 0.383
0.687HisHis: 0.687 ± 0.226
1.276HisIle: 1.276 ± 0.352
0.491HisLys: 0.491 ± 0.202
1.668HisLeu: 1.668 ± 0.347
0.491HisMet: 0.491 ± 0.229
1.276HisAsn: 1.276 ± 0.4
0.981HisPro: 0.981 ± 0.294
0.883HisGln: 0.883 ± 0.274
0.981HisArg: 0.981 ± 0.299
0.981HisSer: 0.981 ± 0.346
1.276HisThr: 1.276 ± 0.543
0.883HisVal: 0.883 ± 0.299
0.393HisTrp: 0.393 ± 0.182
0.393HisTyr: 0.393 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
5.005IleAla: 5.005 ± 0.65
0.589IleCys: 0.589 ± 0.297
3.631IleAsp: 3.631 ± 0.662
3.435IleGlu: 3.435 ± 0.482
1.963IlePhe: 1.963 ± 0.643
3.926IleGly: 3.926 ± 0.793
0.589IleHis: 0.589 ± 0.296
3.631IleIle: 3.631 ± 0.609
2.65IleLys: 2.65 ± 0.643
3.337IleLeu: 3.337 ± 0.676
1.178IleMet: 1.178 ± 0.336
3.239IleAsn: 3.239 ± 0.435
2.748IlePro: 2.748 ± 0.584
1.472IleGln: 1.472 ± 0.411
4.318IleArg: 4.318 ± 0.594
3.828IleSer: 3.828 ± 0.722
4.417IleThr: 4.417 ± 0.597
3.631IleVal: 3.631 ± 0.605
0.589IleTrp: 0.589 ± 0.188
2.061IleTyr: 2.061 ± 0.425
0.0IleXaa: 0.0 ± 0.0
Lys
4.907LysAla: 4.907 ± 0.795
0.196LysCys: 0.196 ± 0.13
2.159LysAsp: 2.159 ± 0.505
3.239LysGlu: 3.239 ± 0.565
1.668LysPhe: 1.668 ± 0.291
2.846LysGly: 2.846 ± 0.408
1.57LysHis: 1.57 ± 0.424
2.355LysIle: 2.355 ± 0.449
4.515LysLys: 4.515 ± 0.646
6.478LysLeu: 6.478 ± 0.672
1.08LysMet: 1.08 ± 0.32
3.337LysAsn: 3.337 ± 0.725
3.141LysPro: 3.141 ± 0.618
2.159LysGln: 2.159 ± 0.461
3.926LysArg: 3.926 ± 0.653
2.846LysSer: 2.846 ± 0.451
5.104LysThr: 5.104 ± 0.84
3.435LysVal: 3.435 ± 0.579
0.883LysTrp: 0.883 ± 0.318
2.355LysTyr: 2.355 ± 0.519
0.0LysXaa: 0.0 ± 0.0
Leu
8.931LeuAla: 8.931 ± 0.917
0.687LeuCys: 0.687 ± 0.272
4.613LeuAsp: 4.613 ± 0.784
5.791LeuGlu: 5.791 ± 0.827
3.926LeuPhe: 3.926 ± 0.708
4.711LeuGly: 4.711 ± 0.693
1.865LeuHis: 1.865 ± 0.43
4.809LeuIle: 4.809 ± 0.79
6.183LeuLys: 6.183 ± 0.924
5.692LeuLeu: 5.692 ± 0.837
2.846LeuMet: 2.846 ± 0.501
4.907LeuAsn: 4.907 ± 0.518
3.926LeuPro: 3.926 ± 0.677
2.552LeuGln: 2.552 ± 0.538
4.907LeuArg: 4.907 ± 0.663
7.852LeuSer: 7.852 ± 1.01
6.772LeuThr: 6.772 ± 0.784
4.417LeuVal: 4.417 ± 0.548
0.687LeuTrp: 0.687 ± 0.233
2.748LeuTyr: 2.748 ± 0.538
0.0LeuXaa: 0.0 ± 0.0
Met
3.239MetAla: 3.239 ± 0.553
0.196MetCys: 0.196 ± 0.119
0.491MetAsp: 0.491 ± 0.252
1.767MetGlu: 1.767 ± 0.386
1.178MetPhe: 1.178 ± 0.373
0.785MetGly: 0.785 ± 0.257
0.589MetHis: 0.589 ± 0.235
1.374MetIle: 1.374 ± 0.377
1.472MetLys: 1.472 ± 0.363
2.748MetLeu: 2.748 ± 0.542
1.178MetMet: 1.178 ± 0.317
1.963MetAsn: 1.963 ± 0.494
1.276MetPro: 1.276 ± 0.329
0.883MetGln: 0.883 ± 0.311
2.159MetArg: 2.159 ± 0.452
1.963MetSer: 1.963 ± 0.324
2.552MetThr: 2.552 ± 0.51
1.472MetVal: 1.472 ± 0.374
0.294MetTrp: 0.294 ± 0.178
0.785MetTyr: 0.785 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
4.22AsnAla: 4.22 ± 0.758
0.491AsnCys: 0.491 ± 0.244
2.355AsnAsp: 2.355 ± 0.547
2.65AsnGlu: 2.65 ± 0.639
0.883AsnPhe: 0.883 ± 0.282
3.631AsnGly: 3.631 ± 0.765
0.981AsnHis: 0.981 ± 0.308
3.042AsnIle: 3.042 ± 0.592
3.042AsnLys: 3.042 ± 0.747
3.337AsnLeu: 3.337 ± 0.519
1.178AsnMet: 1.178 ± 0.347
1.865AsnAsn: 1.865 ± 0.425
2.159AsnPro: 2.159 ± 0.442
1.276AsnGln: 1.276 ± 0.351
2.748AsnArg: 2.748 ± 0.67
2.65AsnSer: 2.65 ± 0.43
1.963AsnThr: 1.963 ± 0.453
2.454AsnVal: 2.454 ± 0.374
0.098AsnTrp: 0.098 ± 0.092
1.276AsnTyr: 1.276 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
4.318ProAla: 4.318 ± 0.675
0.0ProCys: 0.0 ± 0.0
3.631ProAsp: 3.631 ± 0.636
3.337ProGlu: 3.337 ± 0.572
0.981ProPhe: 0.981 ± 0.439
2.159ProGly: 2.159 ± 0.495
1.178ProHis: 1.178 ± 0.341
1.472ProIle: 1.472 ± 0.362
1.963ProLys: 1.963 ± 0.472
3.435ProLeu: 3.435 ± 0.487
0.589ProMet: 0.589 ± 0.217
0.883ProAsn: 0.883 ± 0.401
1.963ProPro: 1.963 ± 0.499
1.767ProGln: 1.767 ± 0.391
2.454ProArg: 2.454 ± 0.68
2.748ProSer: 2.748 ± 0.554
1.472ProThr: 1.472 ± 0.34
5.005ProVal: 5.005 ± 0.772
0.589ProTrp: 0.589 ± 0.214
0.883ProTyr: 0.883 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
3.631GlnAla: 3.631 ± 1.199
0.196GlnCys: 0.196 ± 0.154
2.355GlnAsp: 2.355 ± 0.524
2.355GlnGlu: 2.355 ± 0.526
0.589GlnPhe: 0.589 ± 0.231
1.767GlnGly: 1.767 ± 0.342
0.393GlnHis: 0.393 ± 0.197
2.061GlnIle: 2.061 ± 0.632
2.257GlnLys: 2.257 ± 0.405
3.533GlnLeu: 3.533 ± 0.65
0.981GlnMet: 0.981 ± 0.269
0.589GlnAsn: 0.589 ± 0.192
1.57GlnPro: 1.57 ± 0.415
1.963GlnGln: 1.963 ± 0.501
4.122GlnArg: 4.122 ± 0.693
2.944GlnSer: 2.944 ± 0.493
2.552GlnThr: 2.552 ± 0.652
1.668GlnVal: 1.668 ± 0.387
0.589GlnTrp: 0.589 ± 0.208
0.393GlnTyr: 0.393 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
5.398ArgAla: 5.398 ± 0.565
1.08ArgCys: 1.08 ± 0.295
2.748ArgAsp: 2.748 ± 0.527
4.122ArgGlu: 4.122 ± 0.637
2.061ArgPhe: 2.061 ± 0.481
3.926ArgGly: 3.926 ± 0.792
1.767ArgHis: 1.767 ± 0.458
3.435ArgIle: 3.435 ± 0.654
3.239ArgLys: 3.239 ± 0.61
6.379ArgLeu: 6.379 ± 0.874
1.668ArgMet: 1.668 ± 0.461
2.65ArgAsn: 2.65 ± 0.507
1.963ArgPro: 1.963 ± 0.399
3.337ArgGln: 3.337 ± 0.776
4.515ArgArg: 4.515 ± 0.675
3.042ArgSer: 3.042 ± 0.535
3.141ArgThr: 3.141 ± 0.534
5.005ArgVal: 5.005 ± 0.939
0.981ArgTrp: 0.981 ± 0.264
2.846ArgTyr: 2.846 ± 0.542
0.0ArgXaa: 0.0 ± 0.0
Ser
6.085SerAla: 6.085 ± 0.919
0.491SerCys: 0.491 ± 0.21
4.122SerAsp: 4.122 ± 0.599
4.318SerGlu: 4.318 ± 0.641
2.159SerPhe: 2.159 ± 0.55
4.907SerGly: 4.907 ± 0.918
1.276SerHis: 1.276 ± 0.519
3.631SerIle: 3.631 ± 0.676
3.042SerLys: 3.042 ± 0.46
6.772SerLeu: 6.772 ± 0.94
1.865SerMet: 1.865 ± 0.419
2.65SerAsn: 2.65 ± 0.485
2.65SerPro: 2.65 ± 0.639
2.454SerGln: 2.454 ± 0.37
3.337SerArg: 3.337 ± 0.619
2.552SerSer: 2.552 ± 0.397
4.22SerThr: 4.22 ± 0.694
4.907SerVal: 4.907 ± 0.631
0.491SerTrp: 0.491 ± 0.169
1.767SerTyr: 1.767 ± 0.481
0.0SerXaa: 0.0 ± 0.0
Thr
6.478ThrAla: 6.478 ± 0.924
0.687ThrCys: 0.687 ± 0.306
3.73ThrAsp: 3.73 ± 0.548
3.042ThrGlu: 3.042 ± 0.524
2.846ThrPhe: 2.846 ± 0.644
7.066ThrGly: 7.066 ± 0.84
1.08ThrHis: 1.08 ± 0.296
3.435ThrIle: 3.435 ± 0.521
3.533ThrLys: 3.533 ± 0.817
6.085ThrLeu: 6.085 ± 0.989
2.159ThrMet: 2.159 ± 0.402
2.552ThrAsn: 2.552 ± 0.623
3.239ThrPro: 3.239 ± 0.428
1.57ThrGln: 1.57 ± 0.484
3.73ThrArg: 3.73 ± 0.522
3.828ThrSer: 3.828 ± 0.561
3.631ThrThr: 3.631 ± 0.635
5.398ThrVal: 5.398 ± 1.105
0.589ThrTrp: 0.589 ± 0.228
1.276ThrTyr: 1.276 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
6.379ValAla: 6.379 ± 0.846
1.276ValCys: 1.276 ± 0.33
3.337ValAsp: 3.337 ± 0.59
4.318ValGlu: 4.318 ± 0.656
2.355ValPhe: 2.355 ± 0.497
4.318ValGly: 4.318 ± 0.698
0.687ValHis: 0.687 ± 0.244
4.318ValIle: 4.318 ± 0.609
4.417ValLys: 4.417 ± 0.652
5.594ValLeu: 5.594 ± 0.773
1.963ValMet: 1.963 ± 0.343
2.65ValAsn: 2.65 ± 0.513
2.355ValPro: 2.355 ± 0.47
2.257ValGln: 2.257 ± 0.386
3.042ValArg: 3.042 ± 0.501
5.398ValSer: 5.398 ± 0.707
5.496ValThr: 5.496 ± 0.978
4.024ValVal: 4.024 ± 0.608
0.589ValTrp: 0.589 ± 0.267
1.374ValTyr: 1.374 ± 0.394
0.0ValXaa: 0.0 ± 0.0
Trp
1.178TrpAla: 1.178 ± 0.266
0.098TrpCys: 0.098 ± 0.123
0.981TrpAsp: 0.981 ± 0.276
1.08TrpGlu: 1.08 ± 0.247
0.393TrpPhe: 0.393 ± 0.187
0.589TrpGly: 0.589 ± 0.287
0.589TrpHis: 0.589 ± 0.24
0.687TrpIle: 0.687 ± 0.257
0.883TrpLys: 0.883 ± 0.348
1.57TrpLeu: 1.57 ± 0.465
0.294TrpMet: 0.294 ± 0.167
0.687TrpAsn: 0.687 ± 0.334
1.178TrpPro: 1.178 ± 0.323
0.393TrpGln: 0.393 ± 0.144
1.668TrpArg: 1.668 ± 0.429
0.687TrpSer: 0.687 ± 0.237
0.491TrpThr: 0.491 ± 0.209
0.687TrpVal: 0.687 ± 0.214
0.785TrpTrp: 0.785 ± 0.302
0.393TrpTyr: 0.393 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.042TyrAla: 3.042 ± 0.592
0.196TyrCys: 0.196 ± 0.143
0.883TyrAsp: 0.883 ± 0.309
2.552TyrGlu: 2.552 ± 0.493
1.374TyrPhe: 1.374 ± 0.442
2.257TyrGly: 2.257 ± 0.513
0.981TyrHis: 0.981 ± 0.314
2.748TyrIle: 2.748 ± 0.571
0.981TyrLys: 0.981 ± 0.38
1.963TyrLeu: 1.963 ± 0.542
0.883TyrMet: 0.883 ± 0.287
0.589TyrAsn: 0.589 ± 0.194
1.668TyrPro: 1.668 ± 0.403
1.668TyrGln: 1.668 ± 0.404
2.061TyrArg: 2.061 ± 0.423
1.767TyrSer: 1.767 ± 0.443
1.963TyrThr: 1.963 ± 0.456
1.472TyrVal: 1.472 ± 0.456
0.589TyrTrp: 0.589 ± 0.207
1.178TyrTyr: 1.178 ± 0.361
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10190 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski