Amino acid dipepetide frequency for Escherichia virus Lambda_2G7b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.656AlaAla: 13.656 ± 2.818
1.022AlaCys: 1.022 ± 0.316
3.87AlaAsp: 3.87 ± 0.669
6.718AlaGlu: 6.718 ± 0.711
2.921AlaPhe: 2.921 ± 0.44
7.229AlaGly: 7.229 ± 0.809
1.022AlaHis: 1.022 ± 0.272
5.477AlaIle: 5.477 ± 0.541
5.185AlaLys: 5.185 ± 1.168
7.814AlaLeu: 7.814 ± 1.117
2.921AlaMet: 2.921 ± 0.419
3.286AlaAsn: 3.286 ± 0.514
2.045AlaPro: 2.045 ± 0.403
4.601AlaGln: 4.601 ± 0.973
6.134AlaArg: 6.134 ± 0.963
8.033AlaSer: 8.033 ± 1.072
5.185AlaThr: 5.185 ± 0.869
6.134AlaVal: 6.134 ± 0.734
2.045AlaTrp: 2.045 ± 0.48
3.213AlaTyr: 3.213 ± 0.463
0.0AlaXaa: 0.0 ± 0.0
Cys
0.949CysAla: 0.949 ± 0.293
0.438CysCys: 0.438 ± 0.224
0.438CysAsp: 0.438 ± 0.155
0.73CysGlu: 0.73 ± 0.241
0.146CysPhe: 0.146 ± 0.103
0.584CysGly: 0.584 ± 0.231
0.438CysHis: 0.438 ± 0.179
0.803CysIle: 0.803 ± 0.218
0.292CysLys: 0.292 ± 0.144
0.876CysLeu: 0.876 ± 0.262
0.219CysMet: 0.219 ± 0.112
0.584CysAsn: 0.584 ± 0.173
0.438CysPro: 0.438 ± 0.181
0.292CysGln: 0.292 ± 0.145
1.314CysArg: 1.314 ± 0.319
0.949CysSer: 0.949 ± 0.305
0.657CysThr: 0.657 ± 0.246
1.022CysVal: 1.022 ± 0.222
0.365CysTrp: 0.365 ± 0.168
0.292CysTyr: 0.292 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
5.258AspAla: 5.258 ± 0.666
0.584AspCys: 0.584 ± 0.192
5.185AspAsp: 5.185 ± 0.519
4.016AspGlu: 4.016 ± 0.622
1.972AspPhe: 1.972 ± 0.297
5.988AspGly: 5.988 ± 0.606
0.438AspHis: 0.438 ± 0.171
4.089AspIle: 4.089 ± 0.49
2.921AspLys: 2.921 ± 0.443
4.089AspLeu: 4.089 ± 0.574
1.607AspMet: 1.607 ± 0.372
2.556AspAsn: 2.556 ± 0.49
2.264AspPro: 2.264 ± 0.691
1.607AspGln: 1.607 ± 0.316
2.921AspArg: 2.921 ± 0.479
3.797AspSer: 3.797 ± 0.537
3.651AspThr: 3.651 ± 0.504
3.797AspVal: 3.797 ± 0.546
1.387AspTrp: 1.387 ± 0.41
1.972AspTyr: 1.972 ± 0.369
0.0AspXaa: 0.0 ± 0.0
Glu
5.988GluAla: 5.988 ± 0.737
0.73GluCys: 0.73 ± 0.282
2.848GluAsp: 2.848 ± 0.438
4.455GluGlu: 4.455 ± 0.598
2.191GluPhe: 2.191 ± 0.401
3.432GluGly: 3.432 ± 0.541
1.68GluHis: 1.68 ± 0.411
3.578GluIle: 3.578 ± 0.389
3.578GluLys: 3.578 ± 0.453
5.477GluLeu: 5.477 ± 0.796
1.534GluMet: 1.534 ± 0.408
2.483GluAsn: 2.483 ± 0.439
2.191GluPro: 2.191 ± 0.342
4.089GluGln: 4.089 ± 0.687
3.651GluArg: 3.651 ± 0.53
3.87GluSer: 3.87 ± 0.475
4.381GluThr: 4.381 ± 0.934
3.067GluVal: 3.067 ± 0.573
1.095GluTrp: 1.095 ± 0.249
1.607GluTyr: 1.607 ± 0.337
0.0GluXaa: 0.0 ± 0.0
Phe
1.972PheAla: 1.972 ± 0.367
0.657PheCys: 0.657 ± 0.204
2.921PheAsp: 2.921 ± 0.431
2.118PheGlu: 2.118 ± 0.451
1.022PhePhe: 1.022 ± 0.277
2.994PheGly: 2.994 ± 0.511
0.73PheHis: 0.73 ± 0.207
1.314PheIle: 1.314 ± 0.313
2.045PheLys: 2.045 ± 0.415
1.899PheLeu: 1.899 ± 0.357
0.949PheMet: 0.949 ± 0.269
1.168PheAsn: 1.168 ± 0.275
1.899PhePro: 1.899 ± 0.352
0.511PheGln: 0.511 ± 0.155
2.483PheArg: 2.483 ± 0.466
3.286PheSer: 3.286 ± 0.581
2.848PheThr: 2.848 ± 0.424
2.41PheVal: 2.41 ± 0.349
0.657PheTrp: 0.657 ± 0.209
0.803PheTyr: 0.803 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
5.915GlyAla: 5.915 ± 0.874
0.803GlyCys: 0.803 ± 0.235
4.966GlyAsp: 4.966 ± 0.499
3.651GlyGlu: 3.651 ± 0.611
2.556GlyPhe: 2.556 ± 0.477
5.623GlyGly: 5.623 ± 0.744
0.949GlyHis: 0.949 ± 0.339
3.87GlyIle: 3.87 ± 0.533
4.601GlyLys: 4.601 ± 0.651
5.696GlyLeu: 5.696 ± 0.606
2.629GlyMet: 2.629 ± 0.551
3.724GlyAsn: 3.724 ± 0.515
1.168GlyPro: 1.168 ± 0.2
3.14GlyGln: 3.14 ± 0.5
3.651GlyArg: 3.651 ± 0.417
4.893GlySer: 4.893 ± 0.64
4.235GlyThr: 4.235 ± 0.617
5.55GlyVal: 5.55 ± 0.625
1.826GlyTrp: 1.826 ± 0.359
2.556GlyTyr: 2.556 ± 0.508
0.0GlyXaa: 0.0 ± 0.0
His
1.168HisAla: 1.168 ± 0.271
0.365HisCys: 0.365 ± 0.144
1.095HisAsp: 1.095 ± 0.268
0.73HisGlu: 0.73 ± 0.242
1.022HisPhe: 1.022 ± 0.321
1.534HisGly: 1.534 ± 0.308
0.511HisHis: 0.511 ± 0.199
1.095HisIle: 1.095 ± 0.261
1.095HisLys: 1.095 ± 0.269
1.607HisLeu: 1.607 ± 0.394
0.292HisMet: 0.292 ± 0.138
1.168HisAsn: 1.168 ± 0.301
0.803HisPro: 0.803 ± 0.279
0.584HisGln: 0.584 ± 0.201
0.876HisArg: 0.876 ± 0.235
0.657HisSer: 0.657 ± 0.211
1.022HisThr: 1.022 ± 0.238
1.095HisVal: 1.095 ± 0.228
0.292HisTrp: 0.292 ± 0.155
1.095HisTyr: 1.095 ± 0.288
0.0HisXaa: 0.0 ± 0.0
Ile
4.308IleAla: 4.308 ± 0.524
0.803IleCys: 0.803 ± 0.26
3.213IleAsp: 3.213 ± 0.508
3.724IleGlu: 3.724 ± 0.552
1.46IlePhe: 1.46 ± 0.366
3.651IleGly: 3.651 ± 0.523
0.73IleHis: 0.73 ± 0.259
2.921IleIle: 2.921 ± 0.538
3.067IleLys: 3.067 ± 0.611
2.848IleLeu: 2.848 ± 0.485
0.949IleMet: 0.949 ± 0.266
3.286IleAsn: 3.286 ± 0.546
2.045IlePro: 2.045 ± 0.379
2.191IleGln: 2.191 ± 0.407
3.14IleArg: 3.14 ± 0.387
4.016IleSer: 4.016 ± 0.51
4.674IleThr: 4.674 ± 0.775
2.629IleVal: 2.629 ± 0.383
0.73IleTrp: 0.73 ± 0.283
1.607IleTyr: 1.607 ± 0.376
0.0IleXaa: 0.0 ± 0.0
Lys
5.988LysAla: 5.988 ± 0.847
0.438LysCys: 0.438 ± 0.23
3.505LysAsp: 3.505 ± 0.594
3.286LysGlu: 3.286 ± 0.555
1.753LysPhe: 1.753 ± 0.303
3.797LysGly: 3.797 ± 0.599
1.314LysHis: 1.314 ± 0.29
2.41LysIle: 2.41 ± 0.442
3.651LysLys: 3.651 ± 0.581
3.87LysLeu: 3.87 ± 0.625
1.607LysMet: 1.607 ± 0.333
2.994LysAsn: 2.994 ± 0.525
1.68LysPro: 1.68 ± 0.358
2.556LysGln: 2.556 ± 0.449
3.505LysArg: 3.505 ± 0.574
3.724LysSer: 3.724 ± 0.483
4.162LysThr: 4.162 ± 0.713
2.848LysVal: 2.848 ± 0.452
1.387LysTrp: 1.387 ± 0.286
2.264LysTyr: 2.264 ± 0.357
0.0LysXaa: 0.0 ± 0.0
Leu
7.449LeuAla: 7.449 ± 0.945
1.022LeuCys: 1.022 ± 0.292
4.016LeuAsp: 4.016 ± 0.539
3.797LeuGlu: 3.797 ± 0.572
2.264LeuPhe: 2.264 ± 0.427
4.82LeuGly: 4.82 ± 0.656
1.241LeuHis: 1.241 ± 0.35
3.578LeuIle: 3.578 ± 0.472
4.82LeuLys: 4.82 ± 0.659
5.258LeuLeu: 5.258 ± 0.655
2.337LeuMet: 2.337 ± 0.531
3.432LeuAsn: 3.432 ± 0.441
3.651LeuPro: 3.651 ± 0.436
2.848LeuGln: 2.848 ± 0.414
4.82LeuArg: 4.82 ± 0.585
5.55LeuSer: 5.55 ± 0.748
6.645LeuThr: 6.645 ± 0.794
3.943LeuVal: 3.943 ± 0.608
1.68LeuTrp: 1.68 ± 0.313
1.899LeuTyr: 1.899 ± 0.368
0.0LeuXaa: 0.0 ± 0.0
Met
3.067MetAla: 3.067 ± 0.513
0.073MetCys: 0.073 ± 0.064
1.68MetAsp: 1.68 ± 0.366
1.022MetGlu: 1.022 ± 0.319
1.46MetPhe: 1.46 ± 0.35
1.168MetGly: 1.168 ± 0.217
0.365MetHis: 0.365 ± 0.226
0.949MetIle: 0.949 ± 0.248
1.899MetLys: 1.899 ± 0.443
2.629MetLeu: 2.629 ± 0.412
0.657MetMet: 0.657 ± 0.21
1.314MetAsn: 1.314 ± 0.32
1.826MetPro: 1.826 ± 0.406
1.095MetGln: 1.095 ± 0.357
1.972MetArg: 1.972 ± 0.311
1.607MetSer: 1.607 ± 0.325
3.067MetThr: 3.067 ± 0.547
1.899MetVal: 1.899 ± 0.349
0.219MetTrp: 0.219 ± 0.11
0.365MetTyr: 0.365 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
4.82AsnAla: 4.82 ± 0.693
0.584AsnCys: 0.584 ± 0.233
2.41AsnAsp: 2.41 ± 0.429
2.702AsnGlu: 2.702 ± 0.368
1.826AsnPhe: 1.826 ± 0.427
3.432AsnGly: 3.432 ± 0.507
1.168AsnHis: 1.168 ± 0.276
2.702AsnIle: 2.702 ± 0.452
2.921AsnLys: 2.921 ± 0.509
1.972AsnLeu: 1.972 ± 0.258
1.534AsnMet: 1.534 ± 0.299
2.41AsnAsn: 2.41 ± 0.48
1.899AsnPro: 1.899 ± 0.281
0.876AsnGln: 0.876 ± 0.259
2.702AsnArg: 2.702 ± 0.567
2.702AsnSer: 2.702 ± 0.697
2.702AsnThr: 2.702 ± 0.37
2.118AsnVal: 2.118 ± 0.295
0.511AsnTrp: 0.511 ± 0.172
1.314AsnTyr: 1.314 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
3.286ProAla: 3.286 ± 0.525
0.292ProCys: 0.292 ± 0.151
3.432ProAsp: 3.432 ± 0.581
2.994ProGlu: 2.994 ± 0.546
1.095ProPhe: 1.095 ± 0.329
3.14ProGly: 3.14 ± 0.483
0.803ProHis: 0.803 ± 0.21
1.387ProIle: 1.387 ± 0.296
1.68ProLys: 1.68 ± 0.421
2.337ProLeu: 2.337 ± 0.451
0.584ProMet: 0.584 ± 0.226
1.68ProAsn: 1.68 ± 0.392
1.314ProPro: 1.314 ± 0.314
1.314ProGln: 1.314 ± 0.306
1.314ProArg: 1.314 ± 0.343
2.629ProSer: 2.629 ± 0.394
1.972ProThr: 1.972 ± 0.378
3.213ProVal: 3.213 ± 0.453
0.803ProTrp: 0.803 ± 0.277
1.095ProTyr: 1.095 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
3.943GlnAla: 3.943 ± 0.867
0.438GlnCys: 0.438 ± 0.181
1.46GlnAsp: 1.46 ± 0.317
3.213GlnGlu: 3.213 ± 0.497
1.387GlnPhe: 1.387 ± 0.365
2.264GlnGly: 2.264 ± 0.401
0.73GlnHis: 0.73 ± 0.268
2.702GlnIle: 2.702 ± 0.355
1.972GlnLys: 1.972 ± 0.371
3.651GlnLeu: 3.651 ± 0.451
1.387GlnMet: 1.387 ± 0.334
1.972GlnAsn: 1.972 ± 0.381
1.168GlnPro: 1.168 ± 0.346
2.483GlnGln: 2.483 ± 0.659
2.629GlnArg: 2.629 ± 0.437
3.213GlnSer: 3.213 ± 0.518
2.921GlnThr: 2.921 ± 0.544
2.848GlnVal: 2.848 ± 0.452
0.511GlnTrp: 0.511 ± 0.187
1.241GlnTyr: 1.241 ± 0.323
0.0GlnXaa: 0.0 ± 0.0
Arg
4.601ArgAla: 4.601 ± 0.664
0.365ArgCys: 0.365 ± 0.161
3.505ArgAsp: 3.505 ± 0.624
3.797ArgGlu: 3.797 ± 0.683
2.41ArgPhe: 2.41 ± 0.404
3.432ArgGly: 3.432 ± 0.59
1.68ArgHis: 1.68 ± 0.34
4.162ArgIle: 4.162 ± 0.593
3.578ArgLys: 3.578 ± 0.561
5.331ArgLeu: 5.331 ± 0.716
2.337ArgMet: 2.337 ± 0.402
2.264ArgAsn: 2.264 ± 0.402
1.534ArgPro: 1.534 ± 0.321
3.067ArgGln: 3.067 ± 0.559
5.112ArgArg: 5.112 ± 0.867
2.629ArgSer: 2.629 ± 0.398
3.14ArgThr: 3.14 ± 0.504
3.578ArgVal: 3.578 ± 0.65
1.314ArgTrp: 1.314 ± 0.318
1.753ArgTyr: 1.753 ± 0.34
0.0ArgXaa: 0.0 ± 0.0
Ser
7.522SerAla: 7.522 ± 0.931
0.73SerCys: 0.73 ± 0.2
4.308SerAsp: 4.308 ± 0.602
5.258SerGlu: 5.258 ± 1.049
2.264SerPhe: 2.264 ± 0.447
6.937SerGly: 6.937 ± 0.8
0.949SerHis: 0.949 ± 0.264
2.629SerIle: 2.629 ± 0.354
3.724SerLys: 3.724 ± 0.533
4.528SerLeu: 4.528 ± 0.663
2.191SerMet: 2.191 ± 0.374
2.264SerAsn: 2.264 ± 0.355
2.483SerPro: 2.483 ± 0.464
3.797SerGln: 3.797 ± 0.714
4.162SerArg: 4.162 ± 0.641
3.505SerSer: 3.505 ± 0.544
4.381SerThr: 4.381 ± 0.57
4.893SerVal: 4.893 ± 0.658
1.168SerTrp: 1.168 ± 0.334
1.826SerTyr: 1.826 ± 0.386
0.0SerXaa: 0.0 ± 0.0
Thr
7.887ThrAla: 7.887 ± 1.111
0.803ThrCys: 0.803 ± 0.233
4.308ThrAsp: 4.308 ± 0.61
3.87ThrGlu: 3.87 ± 0.542
2.921ThrPhe: 2.921 ± 0.471
4.893ThrGly: 4.893 ± 0.602
1.534ThrHis: 1.534 ± 0.388
2.702ThrIle: 2.702 ± 0.428
3.505ThrLys: 3.505 ± 0.524
5.988ThrLeu: 5.988 ± 0.619
1.168ThrMet: 1.168 ± 0.323
1.826ThrAsn: 1.826 ± 0.568
3.87ThrPro: 3.87 ± 0.616
2.556ThrGln: 2.556 ± 0.398
2.848ThrArg: 2.848 ± 0.374
4.601ThrSer: 4.601 ± 0.705
4.016ThrThr: 4.016 ± 0.642
4.089ThrVal: 4.089 ± 0.867
1.387ThrTrp: 1.387 ± 0.302
2.337ThrTyr: 2.337 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
6.134ValAla: 6.134 ± 0.753
0.657ValCys: 0.657 ± 0.246
3.797ValAsp: 3.797 ± 0.448
3.286ValGlu: 3.286 ± 0.53
2.264ValPhe: 2.264 ± 0.382
3.432ValGly: 3.432 ± 0.589
0.876ValHis: 0.876 ± 0.216
2.994ValIle: 2.994 ± 0.509
3.87ValLys: 3.87 ± 0.55
4.966ValLeu: 4.966 ± 0.627
2.045ValMet: 2.045 ± 0.417
3.359ValAsn: 3.359 ± 0.429
2.045ValPro: 2.045 ± 0.367
2.337ValGln: 2.337 ± 0.467
3.067ValArg: 3.067 ± 0.538
5.258ValSer: 5.258 ± 0.718
4.455ValThr: 4.455 ± 0.504
4.016ValVal: 4.016 ± 0.517
0.876ValTrp: 0.876 ± 0.236
2.264ValTyr: 2.264 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
1.534TrpAla: 1.534 ± 0.287
0.438TrpCys: 0.438 ± 0.136
1.168TrpAsp: 1.168 ± 0.284
0.584TrpGlu: 0.584 ± 0.184
0.584TrpPhe: 0.584 ± 0.197
1.314TrpGly: 1.314 ± 0.28
0.584TrpHis: 0.584 ± 0.201
1.022TrpIle: 1.022 ± 0.302
1.095TrpLys: 1.095 ± 0.263
2.045TrpLeu: 2.045 ± 0.497
0.73TrpMet: 0.73 ± 0.183
0.584TrpAsn: 0.584 ± 0.2
0.876TrpPro: 0.876 ± 0.25
0.73TrpGln: 0.73 ± 0.211
1.022TrpArg: 1.022 ± 0.267
1.022TrpSer: 1.022 ± 0.248
1.168TrpThr: 1.168 ± 0.295
1.168TrpVal: 1.168 ± 0.303
0.365TrpTrp: 0.365 ± 0.19
0.949TrpTyr: 0.949 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.067TyrAla: 3.067 ± 0.525
0.584TyrCys: 0.584 ± 0.169
1.972TyrAsp: 1.972 ± 0.297
1.972TyrGlu: 1.972 ± 0.381
1.241TyrPhe: 1.241 ± 0.345
2.337TyrGly: 2.337 ± 0.41
0.365TyrHis: 0.365 ± 0.165
1.607TyrIle: 1.607 ± 0.408
1.241TyrLys: 1.241 ± 0.286
2.191TyrLeu: 2.191 ± 0.424
0.584TyrMet: 0.584 ± 0.251
0.949TyrAsn: 0.949 ± 0.219
1.168TyrPro: 1.168 ± 0.312
1.46TyrGln: 1.46 ± 0.278
2.264TyrArg: 2.264 ± 0.449
3.578TyrSer: 3.578 ± 0.598
1.826TyrThr: 1.826 ± 0.453
1.68TyrVal: 1.68 ± 0.405
0.365TyrTrp: 0.365 ± 0.153
1.241TyrTyr: 1.241 ± 0.312
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13695 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski