Amino acid dipepetide frequency for Escherichia phage vB_EcoS_fFiEco02

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.812AlaAla: 10.812 ± 1.58
1.067AlaCys: 1.067 ± 0.312
5.619AlaAsp: 5.619 ± 0.72
7.113AlaGlu: 7.113 ± 0.91
4.552AlaPhe: 4.552 ± 0.56
7.54AlaGly: 7.54 ± 0.652
1.778AlaHis: 1.778 ± 0.377
4.979AlaIle: 4.979 ± 0.6
5.477AlaLys: 5.477 ± 0.636
7.753AlaLeu: 7.753 ± 0.708
2.276AlaMet: 2.276 ± 0.44
3.983AlaAsn: 3.983 ± 0.448
3.485AlaPro: 3.485 ± 0.444
3.983AlaGln: 3.983 ± 0.681
4.908AlaArg: 4.908 ± 0.559
6.117AlaSer: 6.117 ± 0.894
6.544AlaThr: 6.544 ± 0.962
7.255AlaVal: 7.255 ± 0.771
1.494AlaTrp: 1.494 ± 0.278
3.699AlaTyr: 3.699 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
0.996CysAla: 0.996 ± 0.299
0.142CysCys: 0.142 ± 0.106
1.067CysAsp: 1.067 ± 0.261
1.209CysGlu: 1.209 ± 0.328
0.427CysPhe: 0.427 ± 0.173
1.138CysGly: 1.138 ± 0.303
0.071CysHis: 0.071 ± 0.061
0.427CysIle: 0.427 ± 0.172
0.427CysLys: 0.427 ± 0.175
0.64CysLeu: 0.64 ± 0.188
0.285CysMet: 0.285 ± 0.138
0.498CysAsn: 0.498 ± 0.203
0.356CysPro: 0.356 ± 0.18
0.213CysGln: 0.213 ± 0.115
1.067CysArg: 1.067 ± 0.292
0.782CysSer: 0.782 ± 0.207
0.64CysThr: 0.64 ± 0.204
0.711CysVal: 0.711 ± 0.209
0.285CysTrp: 0.285 ± 0.143
0.64CysTyr: 0.64 ± 0.246
0.0CysXaa: 0.0 ± 0.0
Asp
6.828AspAla: 6.828 ± 0.741
0.711AspCys: 0.711 ± 0.216
4.41AspAsp: 4.41 ± 0.681
4.766AspGlu: 4.766 ± 0.557
2.703AspPhe: 2.703 ± 0.403
5.335AspGly: 5.335 ± 0.684
0.782AspHis: 0.782 ± 0.223
3.628AspIle: 3.628 ± 0.524
3.201AspLys: 3.201 ± 0.412
4.41AspLeu: 4.41 ± 0.53
1.423AspMet: 1.423 ± 0.253
2.703AspAsn: 2.703 ± 0.371
1.92AspPro: 1.92 ± 0.388
0.711AspGln: 0.711 ± 0.189
2.916AspArg: 2.916 ± 0.399
2.703AspSer: 2.703 ± 0.462
3.912AspThr: 3.912 ± 0.508
4.766AspVal: 4.766 ± 0.455
0.854AspTrp: 0.854 ± 0.226
2.063AspTyr: 2.063 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
7.184GluAla: 7.184 ± 0.88
0.782GluCys: 0.782 ± 0.266
2.987GluAsp: 2.987 ± 0.451
5.121GluGlu: 5.121 ± 0.859
2.561GluPhe: 2.561 ± 0.581
4.766GluGly: 4.766 ± 0.731
1.138GluHis: 1.138 ± 0.298
3.414GluIle: 3.414 ± 0.541
4.979GluLys: 4.979 ± 0.7
6.188GluLeu: 6.188 ± 0.659
2.134GluMet: 2.134 ± 0.35
2.347GluAsn: 2.347 ± 0.522
1.92GluPro: 1.92 ± 0.473
3.272GluGln: 3.272 ± 0.698
3.556GluArg: 3.556 ± 0.612
2.916GluSer: 2.916 ± 0.43
3.201GluThr: 3.201 ± 0.48
5.264GluVal: 5.264 ± 0.503
1.067GluTrp: 1.067 ± 0.306
2.774GluTyr: 2.774 ± 0.474
0.0GluXaa: 0.0 ± 0.0
Phe
2.063PheAla: 2.063 ± 0.438
0.569PheCys: 0.569 ± 0.209
2.916PheAsp: 2.916 ± 0.527
2.916PheGlu: 2.916 ± 0.435
0.782PhePhe: 0.782 ± 0.228
3.343PheGly: 3.343 ± 0.475
0.569PheHis: 0.569 ± 0.222
2.774PheIle: 2.774 ± 0.514
2.418PheLys: 2.418 ± 0.376
2.205PheLeu: 2.205 ± 0.399
0.427PheMet: 0.427 ± 0.17
1.565PheAsn: 1.565 ± 0.274
1.565PhePro: 1.565 ± 0.388
1.209PheGln: 1.209 ± 0.273
1.707PheArg: 1.707 ± 0.336
2.774PheSer: 2.774 ± 0.505
3.272PheThr: 3.272 ± 0.448
2.49PheVal: 2.49 ± 0.385
0.711PheTrp: 0.711 ± 0.239
1.494PheTyr: 1.494 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
7.682GlyAla: 7.682 ± 0.819
1.351GlyCys: 1.351 ± 0.338
4.623GlyAsp: 4.623 ± 0.646
5.477GlyGlu: 5.477 ± 0.739
3.13GlyPhe: 3.13 ± 0.471
5.833GlyGly: 5.833 ± 0.912
1.351GlyHis: 1.351 ± 0.37
3.201GlyIle: 3.201 ± 0.518
5.05GlyLys: 5.05 ± 0.752
4.766GlyLeu: 4.766 ± 0.506
2.063GlyMet: 2.063 ± 0.584
3.77GlyAsn: 3.77 ± 0.772
1.778GlyPro: 1.778 ± 0.315
2.774GlyGln: 2.774 ± 0.563
3.556GlyArg: 3.556 ± 0.552
4.339GlySer: 4.339 ± 0.528
5.121GlyThr: 5.121 ± 0.644
6.615GlyVal: 6.615 ± 0.776
1.565GlyTrp: 1.565 ± 0.319
2.916GlyTyr: 2.916 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
1.067HisAla: 1.067 ± 0.227
0.285HisCys: 0.285 ± 0.131
1.067HisAsp: 1.067 ± 0.233
0.854HisGlu: 0.854 ± 0.25
1.067HisPhe: 1.067 ± 0.345
1.494HisGly: 1.494 ± 0.447
0.782HisHis: 0.782 ± 0.252
0.854HisIle: 0.854 ± 0.208
1.28HisLys: 1.28 ± 0.326
1.067HisLeu: 1.067 ± 0.286
0.213HisMet: 0.213 ± 0.128
0.498HisAsn: 0.498 ± 0.182
1.209HisPro: 1.209 ± 0.334
0.569HisGln: 0.569 ± 0.184
1.138HisArg: 1.138 ± 0.279
0.711HisSer: 0.711 ± 0.205
0.711HisThr: 0.711 ± 0.224
1.494HisVal: 1.494 ± 0.372
0.071HisTrp: 0.071 ± 0.073
0.356HisTyr: 0.356 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
5.619IleAla: 5.619 ± 0.805
0.925IleCys: 0.925 ± 0.2
4.268IleAsp: 4.268 ± 0.615
2.916IleGlu: 2.916 ± 0.301
1.138IlePhe: 1.138 ± 0.286
4.197IleGly: 4.197 ± 0.474
0.498IleHis: 0.498 ± 0.187
2.632IleIle: 2.632 ± 0.481
3.343IleLys: 3.343 ± 0.558
2.418IleLeu: 2.418 ± 0.385
1.067IleMet: 1.067 ± 0.347
2.49IleAsn: 2.49 ± 0.453
2.845IlePro: 2.845 ± 0.371
1.565IleGln: 1.565 ± 0.326
3.059IleArg: 3.059 ± 0.428
3.77IleSer: 3.77 ± 0.548
4.268IleThr: 4.268 ± 0.555
4.125IleVal: 4.125 ± 0.457
1.138IleTrp: 1.138 ± 0.283
1.423IleTyr: 1.423 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
6.402LysAla: 6.402 ± 0.886
0.64LysCys: 0.64 ± 0.225
3.556LysAsp: 3.556 ± 0.468
4.054LysGlu: 4.054 ± 0.672
2.205LysPhe: 2.205 ± 0.309
3.699LysGly: 3.699 ± 0.426
1.565LysHis: 1.565 ± 0.303
2.561LysIle: 2.561 ± 0.457
2.916LysLys: 2.916 ± 0.408
4.623LysLeu: 4.623 ± 0.653
3.13LysMet: 3.13 ± 0.593
2.276LysAsn: 2.276 ± 0.396
2.49LysPro: 2.49 ± 0.448
2.205LysGln: 2.205 ± 0.395
3.912LysArg: 3.912 ± 0.507
2.561LysSer: 2.561 ± 0.419
3.628LysThr: 3.628 ± 0.535
3.343LysVal: 3.343 ± 0.685
0.996LysTrp: 0.996 ± 0.264
2.774LysTyr: 2.774 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
8.322LeuAla: 8.322 ± 0.806
0.925LeuCys: 0.925 ± 0.252
4.054LeuAsp: 4.054 ± 0.487
5.264LeuGlu: 5.264 ± 0.65
2.347LeuPhe: 2.347 ± 0.539
4.979LeuGly: 4.979 ± 0.504
0.925LeuHis: 0.925 ± 0.282
4.552LeuIle: 4.552 ± 0.441
3.699LeuLys: 3.699 ± 0.486
5.619LeuLeu: 5.619 ± 0.682
1.92LeuMet: 1.92 ± 0.286
2.916LeuAsn: 2.916 ± 0.366
3.485LeuPro: 3.485 ± 0.499
2.632LeuGln: 2.632 ± 0.437
4.552LeuArg: 4.552 ± 0.69
4.41LeuSer: 4.41 ± 0.511
5.761LeuThr: 5.761 ± 0.788
4.766LeuVal: 4.766 ± 0.457
0.782LeuTrp: 0.782 ± 0.258
2.134LeuTyr: 2.134 ± 0.355
0.0LeuXaa: 0.0 ± 0.0
Met
2.774MetAla: 2.774 ± 0.465
0.285MetCys: 0.285 ± 0.118
0.854MetAsp: 0.854 ± 0.286
1.209MetGlu: 1.209 ± 0.34
0.854MetPhe: 0.854 ± 0.273
1.778MetGly: 1.778 ± 0.365
0.285MetHis: 0.285 ± 0.145
1.778MetIle: 1.778 ± 0.343
2.205MetLys: 2.205 ± 0.457
1.778MetLeu: 1.778 ± 0.287
0.213MetMet: 0.213 ± 0.106
0.64MetAsn: 0.64 ± 0.239
1.351MetPro: 1.351 ± 0.395
0.854MetGln: 0.854 ± 0.272
1.138MetArg: 1.138 ± 0.262
1.849MetSer: 1.849 ± 0.324
1.707MetThr: 1.707 ± 0.437
2.205MetVal: 2.205 ± 0.313
0.356MetTrp: 0.356 ± 0.162
0.498MetTyr: 0.498 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
4.695AsnAla: 4.695 ± 0.774
0.285AsnCys: 0.285 ± 0.145
2.845AsnAsp: 2.845 ± 0.401
1.992AsnGlu: 1.992 ± 0.37
1.067AsnPhe: 1.067 ± 0.245
3.77AsnGly: 3.77 ± 0.562
0.711AsnHis: 0.711 ± 0.216
1.992AsnIle: 1.992 ± 0.312
2.49AsnLys: 2.49 ± 0.36
2.845AsnLeu: 2.845 ± 0.377
0.925AsnMet: 0.925 ± 0.286
2.134AsnAsn: 2.134 ± 0.391
1.92AsnPro: 1.92 ± 0.337
0.996AsnGln: 0.996 ± 0.221
1.92AsnArg: 1.92 ± 0.346
2.561AsnSer: 2.561 ± 0.324
2.632AsnThr: 2.632 ± 0.484
3.485AsnVal: 3.485 ± 0.403
0.782AsnTrp: 0.782 ± 0.216
1.423AsnTyr: 1.423 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
2.987ProAla: 2.987 ± 0.434
0.498ProCys: 0.498 ± 0.198
2.987ProAsp: 2.987 ± 0.399
3.414ProGlu: 3.414 ± 0.527
1.707ProPhe: 1.707 ± 0.373
2.987ProGly: 2.987 ± 0.495
0.427ProHis: 0.427 ± 0.153
1.494ProIle: 1.494 ± 0.309
1.707ProLys: 1.707 ± 0.369
3.13ProLeu: 3.13 ± 0.516
0.711ProMet: 0.711 ± 0.274
1.423ProAsn: 1.423 ± 0.435
1.067ProPro: 1.067 ± 0.341
1.209ProGln: 1.209 ± 0.22
1.992ProArg: 1.992 ± 0.409
2.063ProSer: 2.063 ± 0.357
2.205ProThr: 2.205 ± 0.403
3.912ProVal: 3.912 ± 0.484
0.213ProTrp: 0.213 ± 0.124
0.996ProTyr: 0.996 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
2.845GlnAla: 2.845 ± 0.499
0.356GlnCys: 0.356 ± 0.195
1.778GlnAsp: 1.778 ± 0.308
1.992GlnGlu: 1.992 ± 0.378
0.996GlnPhe: 0.996 ± 0.243
1.992GlnGly: 1.992 ± 0.32
0.782GlnHis: 0.782 ± 0.242
2.49GlnIle: 2.49 ± 0.475
1.992GlnLys: 1.992 ± 0.416
3.13GlnLeu: 3.13 ± 0.583
0.854GlnMet: 0.854 ± 0.23
1.28GlnAsn: 1.28 ± 0.383
1.423GlnPro: 1.423 ± 0.335
2.418GlnGln: 2.418 ± 0.754
2.632GlnArg: 2.632 ± 0.44
1.707GlnSer: 1.707 ± 0.31
2.205GlnThr: 2.205 ± 0.451
2.49GlnVal: 2.49 ± 0.451
0.782GlnTrp: 0.782 ± 0.228
1.565GlnTyr: 1.565 ± 0.347
0.0GlnXaa: 0.0 ± 0.0
Arg
5.335ArgAla: 5.335 ± 0.605
0.711ArgCys: 0.711 ± 0.246
2.49ArgAsp: 2.49 ± 0.348
4.125ArgGlu: 4.125 ± 0.66
2.134ArgPhe: 2.134 ± 0.311
3.272ArgGly: 3.272 ± 0.419
1.28ArgHis: 1.28 ± 0.277
3.201ArgIle: 3.201 ± 0.521
4.41ArgLys: 4.41 ± 0.557
4.054ArgLeu: 4.054 ± 0.383
1.494ArgMet: 1.494 ± 0.313
2.703ArgAsn: 2.703 ± 0.402
1.351ArgPro: 1.351 ± 0.289
2.347ArgGln: 2.347 ± 0.423
4.339ArgArg: 4.339 ± 0.732
3.414ArgSer: 3.414 ± 0.581
2.276ArgThr: 2.276 ± 0.407
3.77ArgVal: 3.77 ± 0.47
0.569ArgTrp: 0.569 ± 0.192
1.778ArgTyr: 1.778 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
5.761SerAla: 5.761 ± 0.719
0.64SerCys: 0.64 ± 0.178
2.916SerAsp: 2.916 ± 0.444
3.13SerGlu: 3.13 ± 0.507
2.418SerPhe: 2.418 ± 0.387
6.33SerGly: 6.33 ± 0.895
0.925SerHis: 0.925 ± 0.264
2.774SerIle: 2.774 ± 0.612
3.699SerLys: 3.699 ± 0.568
4.552SerLeu: 4.552 ± 0.595
1.351SerMet: 1.351 ± 0.312
2.987SerAsn: 2.987 ± 0.635
1.992SerPro: 1.992 ± 0.333
1.636SerGln: 1.636 ± 0.376
2.703SerArg: 2.703 ± 0.402
3.059SerSer: 3.059 ± 0.698
3.841SerThr: 3.841 ± 0.683
4.695SerVal: 4.695 ± 0.476
0.427SerTrp: 0.427 ± 0.124
1.92SerTyr: 1.92 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
7.966ThrAla: 7.966 ± 0.995
0.64ThrCys: 0.64 ± 0.214
4.766ThrAsp: 4.766 ± 0.564
3.556ThrGlu: 3.556 ± 0.483
2.774ThrPhe: 2.774 ± 0.565
5.619ThrGly: 5.619 ± 0.941
0.711ThrHis: 0.711 ± 0.219
3.841ThrIle: 3.841 ± 0.414
3.841ThrLys: 3.841 ± 0.492
4.552ThrLeu: 4.552 ± 0.663
1.351ThrMet: 1.351 ± 0.354
1.778ThrAsn: 1.778 ± 0.509
3.699ThrPro: 3.699 ± 0.563
2.134ThrGln: 2.134 ± 0.519
2.49ThrArg: 2.49 ± 0.317
3.485ThrSer: 3.485 ± 0.567
3.841ThrThr: 3.841 ± 0.585
4.41ThrVal: 4.41 ± 0.637
0.569ThrTrp: 0.569 ± 0.219
2.418ThrTyr: 2.418 ± 0.475
0.0ThrXaa: 0.0 ± 0.0
Val
6.828ValAla: 6.828 ± 0.566
0.498ValCys: 0.498 ± 0.16
4.125ValAsp: 4.125 ± 0.558
5.406ValGlu: 5.406 ± 0.747
2.347ValPhe: 2.347 ± 0.421
5.121ValGly: 5.121 ± 0.459
0.996ValHis: 0.996 ± 0.261
4.695ValIle: 4.695 ± 0.604
3.628ValLys: 3.628 ± 0.539
6.188ValLeu: 6.188 ± 0.639
1.351ValMet: 1.351 ± 0.276
3.699ValAsn: 3.699 ± 0.651
1.92ValPro: 1.92 ± 0.386
2.632ValGln: 2.632 ± 0.447
4.268ValArg: 4.268 ± 0.61
5.05ValSer: 5.05 ± 0.691
6.046ValThr: 6.046 ± 0.768
6.046ValVal: 6.046 ± 0.861
1.138ValTrp: 1.138 ± 0.302
2.632ValTyr: 2.632 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
1.209TrpAla: 1.209 ± 0.411
0.356TrpCys: 0.356 ± 0.151
0.854TrpAsp: 0.854 ± 0.187
0.711TrpGlu: 0.711 ± 0.176
1.138TrpPhe: 1.138 ± 0.335
0.854TrpGly: 0.854 ± 0.243
0.498TrpHis: 0.498 ± 0.211
0.427TrpIle: 0.427 ± 0.171
0.711TrpLys: 0.711 ± 0.174
1.636TrpLeu: 1.636 ± 0.266
0.498TrpMet: 0.498 ± 0.182
0.498TrpAsn: 0.498 ± 0.195
0.285TrpPro: 0.285 ± 0.162
0.64TrpGln: 0.64 ± 0.226
0.996TrpArg: 0.996 ± 0.313
0.854TrpSer: 0.854 ± 0.273
0.427TrpThr: 0.427 ± 0.165
0.925TrpVal: 0.925 ± 0.311
0.213TrpTrp: 0.213 ± 0.13
0.498TrpTyr: 0.498 ± 0.195
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.059TyrAla: 3.059 ± 0.476
0.356TyrCys: 0.356 ± 0.154
2.561TyrAsp: 2.561 ± 0.701
2.418TyrGlu: 2.418 ± 0.421
1.423TyrPhe: 1.423 ± 0.367
2.916TyrGly: 2.916 ± 0.438
0.711TyrHis: 0.711 ± 0.211
1.992TyrIle: 1.992 ± 0.335
2.063TyrLys: 2.063 ± 0.328
2.49TyrLeu: 2.49 ± 0.38
0.854TyrMet: 0.854 ± 0.2
1.28TyrAsn: 1.28 ± 0.254
1.067TyrPro: 1.067 ± 0.387
1.707TyrGln: 1.707 ± 0.3
2.205TyrArg: 2.205 ± 0.5
2.632TyrSer: 2.632 ± 0.389
2.205TyrThr: 2.205 ± 0.374
1.849TyrVal: 1.849 ± 0.357
0.213TyrTrp: 0.213 ± 0.113
1.423TyrTyr: 1.423 ± 0.37
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (14060 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski