Amino acid dipepetide frequency for Escherichia phage phi G17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.886AlaAla: 7.886 ± 1.319
0.73AlaCys: 0.73 ± 0.252
4.673AlaAsp: 4.673 ± 0.487
5.549AlaGlu: 5.549 ± 0.478
2.336AlaPhe: 2.336 ± 0.314
6.085AlaGly: 6.085 ± 0.635
1.022AlaHis: 1.022 ± 0.201
4.673AlaIle: 4.673 ± 0.484
5.89AlaLys: 5.89 ± 0.69
7.009AlaLeu: 7.009 ± 1.036
3.115AlaMet: 3.115 ± 0.626
5.062AlaAsn: 5.062 ± 0.572
2.531AlaPro: 2.531 ± 0.557
4.478AlaGln: 4.478 ± 1.006
3.213AlaArg: 3.213 ± 0.405
5.257AlaSer: 5.257 ± 0.595
5.841AlaThr: 5.841 ± 0.573
5.354AlaVal: 5.354 ± 0.551
0.925AlaTrp: 0.925 ± 0.189
3.797AlaTyr: 3.797 ± 0.377
0.0AlaXaa: 0.0 ± 0.0
Cys
0.438CysAla: 0.438 ± 0.159
0.0CysCys: 0.0 ± 0.0
0.389CysAsp: 0.389 ± 0.166
0.633CysGlu: 0.633 ± 0.27
0.389CysPhe: 0.389 ± 0.154
0.438CysGly: 0.438 ± 0.164
0.243CysHis: 0.243 ± 0.106
0.681CysIle: 0.681 ± 0.196
0.681CysLys: 0.681 ± 0.189
0.73CysLeu: 0.73 ± 0.207
0.389CysMet: 0.389 ± 0.158
0.438CysAsn: 0.438 ± 0.154
0.487CysPro: 0.487 ± 0.198
0.097CysGln: 0.097 ± 0.08
0.243CysArg: 0.243 ± 0.096
0.535CysSer: 0.535 ± 0.159
0.584CysThr: 0.584 ± 0.226
0.827CysVal: 0.827 ± 0.275
0.146CysTrp: 0.146 ± 0.09
0.243CysTyr: 0.243 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
4.819AspAla: 4.819 ± 0.593
0.827AspCys: 0.827 ± 0.257
3.845AspAsp: 3.845 ± 0.41
3.602AspGlu: 3.602 ± 0.445
2.58AspPhe: 2.58 ± 0.323
3.505AspGly: 3.505 ± 0.38
0.827AspHis: 0.827 ± 0.244
4.381AspIle: 4.381 ± 0.572
3.164AspLys: 3.164 ± 0.333
4.916AspLeu: 4.916 ± 0.525
1.606AspMet: 1.606 ± 0.303
2.142AspAsn: 2.142 ± 0.226
2.677AspPro: 2.677 ± 0.379
1.801AspGln: 1.801 ± 0.39
2.58AspArg: 2.58 ± 0.35
4.089AspSer: 4.089 ± 0.307
4.089AspThr: 4.089 ± 0.49
3.31AspVal: 3.31 ± 0.4
0.925AspTrp: 0.925 ± 0.199
2.385AspTyr: 2.385 ± 0.455
0.0AspXaa: 0.0 ± 0.0
Glu
7.155GluAla: 7.155 ± 0.984
0.389GluCys: 0.389 ± 0.141
4.332GluAsp: 4.332 ± 0.652
5.987GluGlu: 5.987 ± 0.971
2.629GluPhe: 2.629 ± 0.397
3.991GluGly: 3.991 ± 0.438
0.827GluHis: 0.827 ± 0.184
3.456GluIle: 3.456 ± 0.396
3.31GluLys: 3.31 ± 0.509
5.744GluLeu: 5.744 ± 0.509
2.19GluMet: 2.19 ± 0.297
2.969GluAsn: 2.969 ± 0.51
3.505GluPro: 3.505 ± 0.478
2.921GluGln: 2.921 ± 0.456
1.898GluArg: 1.898 ± 0.289
3.602GluSer: 3.602 ± 0.342
3.31GluThr: 3.31 ± 0.442
4.722GluVal: 4.722 ± 0.578
0.974GluTrp: 0.974 ± 0.193
2.726GluTyr: 2.726 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
2.677PheAla: 2.677 ± 0.384
0.341PheCys: 0.341 ± 0.141
2.288PheAsp: 2.288 ± 0.346
1.898PheGlu: 1.898 ± 0.295
1.12PhePhe: 1.12 ± 0.269
2.872PheGly: 2.872 ± 0.314
0.535PheHis: 0.535 ± 0.137
2.823PheIle: 2.823 ± 0.572
1.898PheLys: 1.898 ± 0.314
2.726PheLeu: 2.726 ± 0.404
1.655PheMet: 1.655 ± 0.297
2.531PheAsn: 2.531 ± 0.435
1.168PhePro: 1.168 ± 0.237
1.509PheGln: 1.509 ± 0.279
1.898PheArg: 1.898 ± 0.291
2.19PheSer: 2.19 ± 0.297
2.775PheThr: 2.775 ± 0.335
2.142PheVal: 2.142 ± 0.355
0.146PheTrp: 0.146 ± 0.084
1.412PheTyr: 1.412 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
5.062GlyAla: 5.062 ± 0.563
0.633GlyCys: 0.633 ± 0.203
3.164GlyAsp: 3.164 ± 0.384
3.991GlyGlu: 3.991 ± 0.361
2.969GlyPhe: 2.969 ± 0.376
3.894GlyGly: 3.894 ± 0.469
0.974GlyHis: 0.974 ± 0.239
3.845GlyIle: 3.845 ± 0.477
5.354GlyLys: 5.354 ± 0.51
5.792GlyLeu: 5.792 ± 0.606
2.434GlyMet: 2.434 ± 0.313
4.77GlyAsn: 4.77 ± 0.529
0.827GlyPro: 0.827 ± 0.207
2.434GlyGln: 2.434 ± 0.385
2.58GlyArg: 2.58 ± 0.417
4.819GlySer: 4.819 ± 0.432
4.527GlyThr: 4.527 ± 0.377
4.722GlyVal: 4.722 ± 0.592
1.071GlyTrp: 1.071 ± 0.273
2.726GlyTyr: 2.726 ± 0.358
0.0GlyXaa: 0.0 ± 0.0
His
1.217HisAla: 1.217 ± 0.219
0.097HisCys: 0.097 ± 0.08
0.73HisAsp: 0.73 ± 0.242
1.12HisGlu: 1.12 ± 0.248
0.584HisPhe: 0.584 ± 0.223
0.633HisGly: 0.633 ± 0.186
0.389HisHis: 0.389 ± 0.188
0.876HisIle: 0.876 ± 0.22
1.46HisLys: 1.46 ± 0.311
1.898HisLeu: 1.898 ± 0.331
0.243HisMet: 0.243 ± 0.095
0.876HisAsn: 0.876 ± 0.176
0.681HisPro: 0.681 ± 0.2
0.633HisGln: 0.633 ± 0.188
0.633HisArg: 0.633 ± 0.164
1.46HisSer: 1.46 ± 0.242
0.925HisThr: 0.925 ± 0.223
0.876HisVal: 0.876 ± 0.191
0.389HisTrp: 0.389 ± 0.137
0.925HisTyr: 0.925 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.332IleAla: 4.332 ± 0.444
0.438IleCys: 0.438 ± 0.181
3.748IleAsp: 3.748 ± 0.555
3.748IleGlu: 3.748 ± 0.372
1.655IlePhe: 1.655 ± 0.346
3.505IleGly: 3.505 ± 0.462
0.876IleHis: 0.876 ± 0.23
2.677IleIle: 2.677 ± 0.514
3.894IleLys: 3.894 ± 0.456
3.748IleLeu: 3.748 ± 0.43
1.217IleMet: 1.217 ± 0.208
3.067IleAsn: 3.067 ± 0.385
2.872IlePro: 2.872 ± 0.454
2.629IleGln: 2.629 ± 0.337
2.872IleArg: 2.872 ± 0.333
2.531IleSer: 2.531 ± 0.34
5.208IleThr: 5.208 ± 0.489
3.164IleVal: 3.164 ± 0.391
0.389IleTrp: 0.389 ± 0.148
2.434IleTyr: 2.434 ± 0.446
0.0IleXaa: 0.0 ± 0.0
Lys
6.717LysAla: 6.717 ± 0.706
0.243LysCys: 0.243 ± 0.111
3.261LysAsp: 3.261 ± 0.379
4.478LysGlu: 4.478 ± 0.494
2.434LysPhe: 2.434 ± 0.308
3.943LysGly: 3.943 ± 0.511
1.314LysHis: 1.314 ± 0.28
2.726LysIle: 2.726 ± 0.356
3.505LysLys: 3.505 ± 0.555
6.766LysLeu: 6.766 ± 0.631
1.996LysMet: 1.996 ± 0.321
3.164LysAsn: 3.164 ± 0.441
2.872LysPro: 2.872 ± 0.453
2.872LysGln: 2.872 ± 0.404
2.726LysArg: 2.726 ± 0.379
3.748LysSer: 3.748 ± 0.551
3.845LysThr: 3.845 ± 0.412
3.991LysVal: 3.991 ± 0.438
0.779LysTrp: 0.779 ± 0.207
1.85LysTyr: 1.85 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
7.253LeuAla: 7.253 ± 0.639
0.73LeuCys: 0.73 ± 0.23
4.624LeuAsp: 4.624 ± 0.493
4.722LeuGlu: 4.722 ± 0.531
3.067LeuPhe: 3.067 ± 0.355
6.133LeuGly: 6.133 ± 0.481
1.606LeuHis: 1.606 ± 0.347
4.381LeuIle: 4.381 ± 0.499
5.16LeuLys: 5.16 ± 0.388
5.841LeuLeu: 5.841 ± 0.672
2.823LeuMet: 2.823 ± 0.411
5.306LeuAsn: 5.306 ± 0.512
4.186LeuPro: 4.186 ± 0.456
3.553LeuGln: 3.553 ± 0.423
3.699LeuArg: 3.699 ± 0.454
5.306LeuSer: 5.306 ± 0.472
5.744LeuThr: 5.744 ± 0.469
5.5LeuVal: 5.5 ± 0.57
0.925LeuTrp: 0.925 ± 0.203
2.482LeuTyr: 2.482 ± 0.319
0.0LeuXaa: 0.0 ± 0.0
Met
2.775MetAla: 2.775 ± 0.48
0.146MetCys: 0.146 ± 0.092
1.363MetAsp: 1.363 ± 0.252
2.19MetGlu: 2.19 ± 0.339
0.633MetPhe: 0.633 ± 0.194
2.093MetGly: 2.093 ± 0.266
0.341MetHis: 0.341 ± 0.103
1.801MetIle: 1.801 ± 0.302
2.239MetLys: 2.239 ± 0.31
2.629MetLeu: 2.629 ± 0.362
0.73MetMet: 0.73 ± 0.203
2.288MetAsn: 2.288 ± 0.3
1.266MetPro: 1.266 ± 0.27
1.655MetGln: 1.655 ± 0.309
1.217MetArg: 1.217 ± 0.207
2.823MetSer: 2.823 ± 0.406
1.996MetThr: 1.996 ± 0.396
1.947MetVal: 1.947 ± 0.394
0.243MetTrp: 0.243 ± 0.093
0.73MetTyr: 0.73 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
4.137AsnAla: 4.137 ± 0.622
0.438AsnCys: 0.438 ± 0.167
2.921AsnAsp: 2.921 ± 0.416
3.261AsnGlu: 3.261 ± 0.504
1.85AsnPhe: 1.85 ± 0.283
3.699AsnGly: 3.699 ± 0.478
1.558AsnHis: 1.558 ± 0.32
2.677AsnIle: 2.677 ± 0.374
3.505AsnLys: 3.505 ± 0.398
4.576AsnLeu: 4.576 ± 0.505
1.606AsnMet: 1.606 ± 0.249
3.213AsnAsn: 3.213 ± 0.387
3.164AsnPro: 3.164 ± 0.439
3.553AsnGln: 3.553 ± 0.473
3.407AsnArg: 3.407 ± 0.268
2.823AsnSer: 2.823 ± 0.392
3.359AsnThr: 3.359 ± 0.558
3.115AsnVal: 3.115 ± 0.462
0.681AsnTrp: 0.681 ± 0.196
2.239AsnTyr: 2.239 ± 0.48
0.0AsnXaa: 0.0 ± 0.0
Pro
3.31ProAla: 3.31 ± 0.534
0.097ProCys: 0.097 ± 0.07
2.823ProAsp: 2.823 ± 0.461
3.943ProGlu: 3.943 ± 0.475
1.85ProPhe: 1.85 ± 0.301
2.434ProGly: 2.434 ± 0.375
0.389ProHis: 0.389 ± 0.159
2.239ProIle: 2.239 ± 0.355
1.801ProLys: 1.801 ± 0.32
3.31ProLeu: 3.31 ± 0.337
1.168ProMet: 1.168 ± 0.193
2.044ProAsn: 2.044 ± 0.429
0.876ProPro: 0.876 ± 0.29
1.363ProGln: 1.363 ± 0.274
1.12ProArg: 1.12 ± 0.327
2.726ProSer: 2.726 ± 0.426
3.261ProThr: 3.261 ± 0.348
3.943ProVal: 3.943 ± 0.456
0.438ProTrp: 0.438 ± 0.168
1.655ProTyr: 1.655 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
5.062GlnAla: 5.062 ± 0.905
0.243GlnCys: 0.243 ± 0.111
2.434GlnAsp: 2.434 ± 0.413
3.213GlnGlu: 3.213 ± 0.321
1.509GlnPhe: 1.509 ± 0.234
3.213GlnGly: 3.213 ± 0.508
0.438GlnHis: 0.438 ± 0.129
2.142GlnIle: 2.142 ± 0.274
3.018GlnLys: 3.018 ± 0.478
3.407GlnLeu: 3.407 ± 0.475
1.168GlnMet: 1.168 ± 0.225
1.996GlnAsn: 1.996 ± 0.323
1.022GlnPro: 1.022 ± 0.272
1.85GlnGln: 1.85 ± 0.376
1.85GlnArg: 1.85 ± 0.355
2.872GlnSer: 2.872 ± 0.475
2.629GlnThr: 2.629 ± 0.36
2.969GlnVal: 2.969 ± 0.356
0.535GlnTrp: 0.535 ± 0.161
1.898GlnTyr: 1.898 ± 0.312
0.0GlnXaa: 0.0 ± 0.0
Arg
3.31ArgAla: 3.31 ± 0.551
0.438ArgCys: 0.438 ± 0.15
2.19ArgAsp: 2.19 ± 0.26
2.921ArgGlu: 2.921 ± 0.464
1.947ArgPhe: 1.947 ± 0.309
2.482ArgGly: 2.482 ± 0.319
0.487ArgHis: 0.487 ± 0.161
2.629ArgIle: 2.629 ± 0.38
3.067ArgLys: 3.067 ± 0.302
4.283ArgLeu: 4.283 ± 0.54
1.46ArgMet: 1.46 ± 0.259
3.213ArgAsn: 3.213 ± 0.464
1.558ArgPro: 1.558 ± 0.287
1.947ArgGln: 1.947 ± 0.35
1.655ArgArg: 1.655 ± 0.276
2.58ArgSer: 2.58 ± 0.368
2.385ArgThr: 2.385 ± 0.293
2.531ArgVal: 2.531 ± 0.397
0.584ArgTrp: 0.584 ± 0.183
1.655ArgTyr: 1.655 ± 0.268
0.0ArgXaa: 0.0 ± 0.0
Ser
4.576SerAla: 4.576 ± 0.493
0.73SerCys: 0.73 ± 0.211
3.651SerAsp: 3.651 ± 0.381
4.283SerGlu: 4.283 ± 0.521
1.996SerPhe: 1.996 ± 0.345
4.673SerGly: 4.673 ± 0.542
1.022SerHis: 1.022 ± 0.209
3.894SerIle: 3.894 ± 0.379
3.943SerLys: 3.943 ± 0.327
5.646SerLeu: 5.646 ± 0.53
2.093SerMet: 2.093 ± 0.325
2.58SerAsn: 2.58 ± 0.455
2.531SerPro: 2.531 ± 0.448
2.288SerGln: 2.288 ± 0.444
3.067SerArg: 3.067 ± 0.416
3.943SerSer: 3.943 ± 0.603
4.381SerThr: 4.381 ± 0.552
3.894SerVal: 3.894 ± 0.493
0.827SerTrp: 0.827 ± 0.245
2.239SerTyr: 2.239 ± 0.352
0.0SerXaa: 0.0 ± 0.0
Thr
4.527ThrAla: 4.527 ± 0.612
0.438ThrCys: 0.438 ± 0.174
3.602ThrAsp: 3.602 ± 0.338
3.894ThrGlu: 3.894 ± 0.591
2.921ThrPhe: 2.921 ± 0.305
4.819ThrGly: 4.819 ± 0.404
1.266ThrHis: 1.266 ± 0.432
3.943ThrIle: 3.943 ± 0.515
4.235ThrLys: 4.235 ± 0.429
5.744ThrLeu: 5.744 ± 0.493
1.314ThrMet: 1.314 ± 0.276
3.797ThrAsn: 3.797 ± 0.545
3.894ThrPro: 3.894 ± 0.383
2.482ThrGln: 2.482 ± 0.313
2.677ThrArg: 2.677 ± 0.331
3.797ThrSer: 3.797 ± 0.52
3.407ThrThr: 3.407 ± 0.552
4.819ThrVal: 4.819 ± 0.498
0.925ThrTrp: 0.925 ± 0.256
2.336ThrTyr: 2.336 ± 0.443
0.0ThrXaa: 0.0 ± 0.0
Val
6.182ValAla: 6.182 ± 0.747
0.779ValCys: 0.779 ± 0.27
4.137ValAsp: 4.137 ± 0.461
3.797ValGlu: 3.797 ± 0.461
2.093ValPhe: 2.093 ± 0.344
4.868ValGly: 4.868 ± 0.549
1.217ValHis: 1.217 ± 0.22
2.775ValIle: 2.775 ± 0.478
3.602ValLys: 3.602 ± 0.402
4.77ValLeu: 4.77 ± 0.414
2.531ValMet: 2.531 ± 0.339
3.797ValAsn: 3.797 ± 0.467
2.629ValPro: 2.629 ± 0.4
3.31ValGln: 3.31 ± 0.484
3.651ValArg: 3.651 ± 0.409
4.43ValSer: 4.43 ± 0.487
4.186ValThr: 4.186 ± 0.529
5.695ValVal: 5.695 ± 0.728
0.584ValTrp: 0.584 ± 0.178
2.969ValTyr: 2.969 ± 0.458
0.0ValXaa: 0.0 ± 0.0
Trp
1.217TrpAla: 1.217 ± 0.281
0.243TrpCys: 0.243 ± 0.104
1.168TrpAsp: 1.168 ± 0.312
0.633TrpGlu: 0.633 ± 0.228
0.389TrpPhe: 0.389 ± 0.15
0.779TrpGly: 0.779 ± 0.162
0.292TrpHis: 0.292 ± 0.135
0.535TrpIle: 0.535 ± 0.16
0.827TrpLys: 0.827 ± 0.212
1.071TrpLeu: 1.071 ± 0.222
0.243TrpMet: 0.243 ± 0.116
0.389TrpAsn: 0.389 ± 0.184
0.292TrpPro: 0.292 ± 0.114
0.535TrpGln: 0.535 ± 0.137
0.584TrpArg: 0.584 ± 0.138
0.584TrpSer: 0.584 ± 0.156
0.341TrpThr: 0.341 ± 0.108
1.363TrpVal: 1.363 ± 0.246
0.049TrpTrp: 0.049 ± 0.046
0.438TrpTyr: 0.438 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.775TyrAla: 2.775 ± 0.334
0.633TyrCys: 0.633 ± 0.231
2.726TyrAsp: 2.726 ± 0.307
2.775TyrGlu: 2.775 ± 0.332
1.655TyrPhe: 1.655 ± 0.304
2.531TyrGly: 2.531 ± 0.347
1.022TyrHis: 1.022 ± 0.245
1.898TyrIle: 1.898 ± 0.404
2.677TyrLys: 2.677 ± 0.503
2.434TyrLeu: 2.434 ± 0.333
1.071TyrMet: 1.071 ± 0.239
2.142TyrAsn: 2.142 ± 0.322
1.752TyrPro: 1.752 ± 0.424
1.558TyrGln: 1.558 ± 0.235
1.752TyrArg: 1.752 ± 0.358
2.142TyrSer: 2.142 ± 0.365
2.044TyrThr: 2.044 ± 0.35
3.115TyrVal: 3.115 ± 0.486
0.438TyrTrp: 0.438 ± 0.149
1.46TyrTyr: 1.46 ± 0.31
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (20545 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski