Amino acid dipepetide frequency for Escherichia phage Lambda_ev243

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.864AlaAla: 11.864 ± 1.569
0.884AlaCys: 0.884 ± 0.301
4.716AlaAsp: 4.716 ± 0.598
6.485AlaGlu: 6.485 ± 0.765
3.095AlaPhe: 3.095 ± 0.448
7.517AlaGly: 7.517 ± 0.945
1.326AlaHis: 1.326 ± 0.325
5.527AlaIle: 5.527 ± 0.656
4.569AlaLys: 4.569 ± 0.862
7.738AlaLeu: 7.738 ± 0.937
2.948AlaMet: 2.948 ± 0.518
2.874AlaAsn: 2.874 ± 0.463
2.284AlaPro: 2.284 ± 0.419
3.979AlaGln: 3.979 ± 0.752
6.264AlaArg: 6.264 ± 0.631
7.59AlaSer: 7.59 ± 0.798
4.937AlaThr: 4.937 ± 0.955
6.19AlaVal: 6.19 ± 0.886
1.99AlaTrp: 1.99 ± 0.396
2.727AlaTyr: 2.727 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.958CysAla: 0.958 ± 0.3
0.516CysCys: 0.516 ± 0.241
0.516CysAsp: 0.516 ± 0.185
0.663CysGlu: 0.663 ± 0.22
0.295CysPhe: 0.295 ± 0.14
0.59CysGly: 0.59 ± 0.191
0.221CysHis: 0.221 ± 0.124
0.884CysIle: 0.884 ± 0.236
0.368CysLys: 0.368 ± 0.165
0.737CysLeu: 0.737 ± 0.213
0.295CysMet: 0.295 ± 0.136
0.442CysAsn: 0.442 ± 0.159
0.442CysPro: 0.442 ± 0.197
0.295CysGln: 0.295 ± 0.141
1.032CysArg: 1.032 ± 0.28
1.032CysSer: 1.032 ± 0.346
0.811CysThr: 0.811 ± 0.246
0.958CysVal: 0.958 ± 0.263
0.221CysTrp: 0.221 ± 0.106
0.442CysTyr: 0.442 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
5.158AspAla: 5.158 ± 0.722
0.516AspCys: 0.516 ± 0.158
4.643AspAsp: 4.643 ± 0.646
3.832AspGlu: 3.832 ± 0.667
2.063AspPhe: 2.063 ± 0.323
5.822AspGly: 5.822 ± 0.621
0.442AspHis: 0.442 ± 0.161
4.495AspIle: 4.495 ± 0.799
3.242AspLys: 3.242 ± 0.56
3.832AspLeu: 3.832 ± 0.609
1.695AspMet: 1.695 ± 0.339
2.284AspAsn: 2.284 ± 0.372
2.432AspPro: 2.432 ± 0.657
1.621AspGln: 1.621 ± 0.381
2.358AspArg: 2.358 ± 0.434
3.537AspSer: 3.537 ± 0.474
3.464AspThr: 3.464 ± 0.587
3.758AspVal: 3.758 ± 0.47
1.4AspTrp: 1.4 ± 0.404
1.916AspTyr: 1.916 ± 0.372
0.0AspXaa: 0.0 ± 0.0
Glu
5.822GluAla: 5.822 ± 0.614
0.811GluCys: 0.811 ± 0.227
2.8GluAsp: 2.8 ± 0.414
4.127GluGlu: 4.127 ± 0.604
2.137GluPhe: 2.137 ± 0.405
3.316GluGly: 3.316 ± 0.429
1.474GluHis: 1.474 ± 0.387
3.758GluIle: 3.758 ± 0.435
3.832GluLys: 3.832 ± 0.62
5.232GluLeu: 5.232 ± 0.702
1.916GluMet: 1.916 ± 0.403
2.432GluAsn: 2.432 ± 0.425
2.211GluPro: 2.211 ± 0.365
3.979GluGln: 3.979 ± 0.725
3.758GluArg: 3.758 ± 0.648
3.537GluSer: 3.537 ± 0.501
3.906GluThr: 3.906 ± 0.663
3.095GluVal: 3.095 ± 0.573
1.105GluTrp: 1.105 ± 0.351
1.916GluTyr: 1.916 ± 0.358
0.0GluXaa: 0.0 ± 0.0
Phe
1.769PheAla: 1.769 ± 0.384
0.663PheCys: 0.663 ± 0.202
2.948PheAsp: 2.948 ± 0.531
2.211PheGlu: 2.211 ± 0.396
1.253PhePhe: 1.253 ± 0.288
3.095PheGly: 3.095 ± 0.668
0.811PheHis: 0.811 ± 0.245
1.179PheIle: 1.179 ± 0.288
1.916PheLys: 1.916 ± 0.366
2.358PheLeu: 2.358 ± 0.442
1.105PheMet: 1.105 ± 0.28
1.4PheAsn: 1.4 ± 0.242
2.063PhePro: 2.063 ± 0.333
0.663PheGln: 0.663 ± 0.183
2.948PheArg: 2.948 ± 0.466
3.169PheSer: 3.169 ± 0.551
3.242PheThr: 3.242 ± 0.449
2.727PheVal: 2.727 ± 0.355
0.663PheTrp: 0.663 ± 0.186
0.958PheTyr: 0.958 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
5.969GlyAla: 5.969 ± 0.78
0.59GlyCys: 0.59 ± 0.203
5.232GlyAsp: 5.232 ± 0.505
3.979GlyGlu: 3.979 ± 0.55
3.021GlyPhe: 3.021 ± 0.551
5.38GlyGly: 5.38 ± 0.967
0.59GlyHis: 0.59 ± 0.192
3.685GlyIle: 3.685 ± 0.459
4.2GlyLys: 4.2 ± 0.636
5.969GlyLeu: 5.969 ± 0.613
2.874GlyMet: 2.874 ± 0.542
3.832GlyAsn: 3.832 ± 0.576
1.032GlyPro: 1.032 ± 0.22
3.611GlyGln: 3.611 ± 0.564
3.906GlyArg: 3.906 ± 0.435
4.274GlySer: 4.274 ± 0.602
4.348GlyThr: 4.348 ± 0.691
5.748GlyVal: 5.748 ± 0.573
1.621GlyTrp: 1.621 ± 0.319
2.506GlyTyr: 2.506 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
1.105HisAla: 1.105 ± 0.282
0.295HisCys: 0.295 ± 0.137
1.105HisAsp: 1.105 ± 0.29
0.442HisGlu: 0.442 ± 0.186
0.958HisPhe: 0.958 ± 0.248
1.105HisGly: 1.105 ± 0.351
0.516HisHis: 0.516 ± 0.177
1.105HisIle: 1.105 ± 0.297
1.032HisLys: 1.032 ± 0.264
1.916HisLeu: 1.916 ± 0.449
0.221HisMet: 0.221 ± 0.113
1.105HisAsn: 1.105 ± 0.255
0.59HisPro: 0.59 ± 0.207
0.516HisGln: 0.516 ± 0.195
0.958HisArg: 0.958 ± 0.255
0.737HisSer: 0.737 ± 0.291
1.032HisThr: 1.032 ± 0.262
1.105HisVal: 1.105 ± 0.261
0.147HisTrp: 0.147 ± 0.096
1.105HisTyr: 1.105 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.864IleAla: 4.864 ± 0.692
1.179IleCys: 1.179 ± 0.325
2.579IleAsp: 2.579 ± 0.518
4.127IleGlu: 4.127 ± 0.646
1.474IlePhe: 1.474 ± 0.328
3.758IleGly: 3.758 ± 0.545
0.811IleHis: 0.811 ± 0.241
2.948IleIle: 2.948 ± 0.465
3.095IleLys: 3.095 ± 0.506
3.021IleLeu: 3.021 ± 0.466
0.811IleMet: 0.811 ± 0.252
3.611IleAsn: 3.611 ± 0.501
2.284IlePro: 2.284 ± 0.43
2.063IleGln: 2.063 ± 0.336
3.39IleArg: 3.39 ± 0.454
4.348IleSer: 4.348 ± 0.484
4.053IleThr: 4.053 ± 0.783
2.8IleVal: 2.8 ± 0.419
0.663IleTrp: 0.663 ± 0.283
1.621IleTyr: 1.621 ± 0.466
0.0IleXaa: 0.0 ± 0.0
Lys
5.674LysAla: 5.674 ± 0.841
0.368LysCys: 0.368 ± 0.184
3.095LysAsp: 3.095 ± 0.653
2.579LysGlu: 2.579 ± 0.399
1.474LysPhe: 1.474 ± 0.303
4.053LysGly: 4.053 ± 0.685
1.474LysHis: 1.474 ± 0.376
2.653LysIle: 2.653 ± 0.472
3.758LysLys: 3.758 ± 0.645
3.242LysLeu: 3.242 ± 0.583
1.548LysMet: 1.548 ± 0.368
2.727LysAsn: 2.727 ± 0.436
2.358LysPro: 2.358 ± 0.436
2.358LysGln: 2.358 ± 0.433
4.053LysArg: 4.053 ± 0.652
3.537LysSer: 3.537 ± 0.519
3.906LysThr: 3.906 ± 0.627
3.611LysVal: 3.611 ± 0.471
1.4LysTrp: 1.4 ± 0.304
1.548LysTyr: 1.548 ± 0.361
0.0LysXaa: 0.0 ± 0.0
Leu
8.475LeuAla: 8.475 ± 0.812
0.958LeuCys: 0.958 ± 0.244
3.979LeuAsp: 3.979 ± 0.464
3.832LeuGlu: 3.832 ± 0.542
2.432LeuPhe: 2.432 ± 0.427
4.716LeuGly: 4.716 ± 0.573
1.326LeuHis: 1.326 ± 0.366
3.242LeuIle: 3.242 ± 0.406
4.127LeuLys: 4.127 ± 0.5
5.969LeuLeu: 5.969 ± 0.696
2.137LeuMet: 2.137 ± 0.391
3.758LeuAsn: 3.758 ± 0.41
3.906LeuPro: 3.906 ± 0.487
2.727LeuGln: 2.727 ± 0.474
5.011LeuArg: 5.011 ± 0.508
5.601LeuSer: 5.601 ± 0.705
6.927LeuThr: 6.927 ± 0.693
3.169LeuVal: 3.169 ± 0.471
1.769LeuTrp: 1.769 ± 0.323
1.842LeuTyr: 1.842 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
2.948MetAla: 2.948 ± 0.595
0.147MetCys: 0.147 ± 0.111
1.695MetAsp: 1.695 ± 0.334
0.737MetGlu: 0.737 ± 0.244
1.4MetPhe: 1.4 ± 0.303
1.253MetGly: 1.253 ± 0.245
0.442MetHis: 0.442 ± 0.222
1.179MetIle: 1.179 ± 0.288
1.695MetLys: 1.695 ± 0.456
2.506MetLeu: 2.506 ± 0.409
0.368MetMet: 0.368 ± 0.151
1.032MetAsn: 1.032 ± 0.262
1.548MetPro: 1.548 ± 0.393
1.253MetGln: 1.253 ± 0.292
2.579MetArg: 2.579 ± 0.381
1.621MetSer: 1.621 ± 0.358
2.8MetThr: 2.8 ± 0.529
1.99MetVal: 1.99 ± 0.368
0.221MetTrp: 0.221 ± 0.112
0.516MetTyr: 0.516 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
4.643AsnAla: 4.643 ± 0.527
0.59AsnCys: 0.59 ± 0.256
2.063AsnAsp: 2.063 ± 0.441
3.095AsnGlu: 3.095 ± 0.405
1.916AsnPhe: 1.916 ± 0.422
3.464AsnGly: 3.464 ± 0.575
1.105AsnHis: 1.105 ± 0.301
2.358AsnIle: 2.358 ± 0.442
3.095AsnLys: 3.095 ± 0.507
2.432AsnLeu: 2.432 ± 0.31
1.326AsnMet: 1.326 ± 0.31
2.432AsnAsn: 2.432 ± 0.493
2.137AsnPro: 2.137 ± 0.428
0.958AsnGln: 0.958 ± 0.241
2.8AsnArg: 2.8 ± 0.563
2.432AsnSer: 2.432 ± 0.436
2.653AsnThr: 2.653 ± 0.416
2.137AsnVal: 2.137 ± 0.376
0.663AsnTrp: 0.663 ± 0.172
1.4AsnTyr: 1.4 ± 0.375
0.0AsnXaa: 0.0 ± 0.0
Pro
3.021ProAla: 3.021 ± 0.568
0.221ProCys: 0.221 ± 0.129
3.464ProAsp: 3.464 ± 0.507
2.8ProGlu: 2.8 ± 0.524
1.179ProPhe: 1.179 ± 0.32
3.39ProGly: 3.39 ± 0.546
0.737ProHis: 0.737 ± 0.23
1.695ProIle: 1.695 ± 0.36
1.621ProLys: 1.621 ± 0.347
2.948ProLeu: 2.948 ± 0.523
0.59ProMet: 0.59 ± 0.209
1.695ProAsn: 1.695 ± 0.365
1.179ProPro: 1.179 ± 0.353
1.474ProGln: 1.474 ± 0.242
1.4ProArg: 1.4 ± 0.352
2.727ProSer: 2.727 ± 0.571
2.063ProThr: 2.063 ± 0.407
3.242ProVal: 3.242 ± 0.434
0.811ProTrp: 0.811 ± 0.24
1.179ProTyr: 1.179 ± 0.348
0.0ProXaa: 0.0 ± 0.0
Gln
4.127GlnAla: 4.127 ± 0.951
0.442GlnCys: 0.442 ± 0.177
1.179GlnAsp: 1.179 ± 0.294
3.169GlnGlu: 3.169 ± 0.452
1.621GlnPhe: 1.621 ± 0.369
1.99GlnGly: 1.99 ± 0.388
0.442GlnHis: 0.442 ± 0.185
2.579GlnIle: 2.579 ± 0.357
2.063GlnLys: 2.063 ± 0.352
3.685GlnLeu: 3.685 ± 0.45
1.474GlnMet: 1.474 ± 0.306
2.284GlnAsn: 2.284 ± 0.46
1.179GlnPro: 1.179 ± 0.277
2.432GlnGln: 2.432 ± 0.492
2.432GlnArg: 2.432 ± 0.416
3.169GlnSer: 3.169 ± 0.46
2.653GlnThr: 2.653 ± 0.513
3.021GlnVal: 3.021 ± 0.464
0.737GlnTrp: 0.737 ± 0.213
1.253GlnTyr: 1.253 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
4.348ArgAla: 4.348 ± 0.528
0.295ArgCys: 0.295 ± 0.152
3.464ArgAsp: 3.464 ± 0.646
4.127ArgGlu: 4.127 ± 0.686
2.358ArgPhe: 2.358 ± 0.356
3.611ArgGly: 3.611 ± 0.673
1.621ArgHis: 1.621 ± 0.367
4.2ArgIle: 4.2 ± 0.544
3.537ArgLys: 3.537 ± 0.6
5.085ArgLeu: 5.085 ± 0.623
2.284ArgMet: 2.284 ± 0.401
2.432ArgAsn: 2.432 ± 0.468
1.621ArgPro: 1.621 ± 0.305
3.39ArgGln: 3.39 ± 0.551
5.453ArgArg: 5.453 ± 0.879
3.537ArgSer: 3.537 ± 0.555
3.169ArgThr: 3.169 ± 0.474
3.611ArgVal: 3.611 ± 0.655
1.621ArgTrp: 1.621 ± 0.377
2.063ArgTyr: 2.063 ± 0.455
0.0ArgXaa: 0.0 ± 0.0
Ser
7.222SerAla: 7.222 ± 0.688
0.737SerCys: 0.737 ± 0.27
4.495SerAsp: 4.495 ± 0.571
4.79SerGlu: 4.79 ± 0.808
2.506SerPhe: 2.506 ± 0.447
7.001SerGly: 7.001 ± 0.686
0.884SerHis: 0.884 ± 0.228
2.874SerIle: 2.874 ± 0.368
3.169SerLys: 3.169 ± 0.456
4.348SerLeu: 4.348 ± 0.677
2.063SerMet: 2.063 ± 0.452
2.432SerAsn: 2.432 ± 0.359
2.284SerPro: 2.284 ± 0.406
3.095SerGln: 3.095 ± 0.465
4.127SerArg: 4.127 ± 0.576
3.464SerSer: 3.464 ± 0.467
3.979SerThr: 3.979 ± 0.571
5.38SerVal: 5.38 ± 0.672
1.179SerTrp: 1.179 ± 0.313
1.695SerTyr: 1.695 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
7.369ThrAla: 7.369 ± 0.924
0.737ThrCys: 0.737 ± 0.199
3.906ThrAsp: 3.906 ± 0.427
3.979ThrGlu: 3.979 ± 0.483
2.874ThrPhe: 2.874 ± 0.473
5.011ThrGly: 5.011 ± 0.706
1.179ThrHis: 1.179 ± 0.309
3.021ThrIle: 3.021 ± 0.479
3.832ThrLys: 3.832 ± 0.543
5.38ThrLeu: 5.38 ± 0.699
1.105ThrMet: 1.105 ± 0.313
1.621ThrAsn: 1.621 ± 0.329
4.053ThrPro: 4.053 ± 0.716
2.653ThrGln: 2.653 ± 0.562
3.095ThrArg: 3.095 ± 0.356
4.495ThrSer: 4.495 ± 0.649
3.758ThrThr: 3.758 ± 0.556
4.348ThrVal: 4.348 ± 0.996
1.4ThrTrp: 1.4 ± 0.258
2.284ThrTyr: 2.284 ± 0.526
0.0ThrXaa: 0.0 ± 0.0
Val
5.527ValAla: 5.527 ± 0.747
0.59ValCys: 0.59 ± 0.234
3.758ValAsp: 3.758 ± 0.496
3.464ValGlu: 3.464 ± 0.503
3.095ValPhe: 3.095 ± 0.559
3.685ValGly: 3.685 ± 0.514
0.737ValHis: 0.737 ± 0.218
3.242ValIle: 3.242 ± 0.481
3.979ValLys: 3.979 ± 0.615
5.158ValLeu: 5.158 ± 0.567
1.916ValMet: 1.916 ± 0.333
3.537ValAsn: 3.537 ± 0.49
1.99ValPro: 1.99 ± 0.439
2.653ValGln: 2.653 ± 0.591
3.021ValArg: 3.021 ± 0.537
4.79ValSer: 4.79 ± 0.858
5.38ValThr: 5.38 ± 0.685
4.79ValVal: 4.79 ± 0.572
1.032ValTrp: 1.032 ± 0.284
2.358ValTyr: 2.358 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
1.621TrpAla: 1.621 ± 0.263
0.442TrpCys: 0.442 ± 0.145
1.032TrpAsp: 1.032 ± 0.292
0.811TrpGlu: 0.811 ± 0.212
0.663TrpPhe: 0.663 ± 0.24
1.4TrpGly: 1.4 ± 0.248
0.59TrpHis: 0.59 ± 0.243
0.958TrpIle: 0.958 ± 0.316
1.105TrpLys: 1.105 ± 0.307
1.916TrpLeu: 1.916 ± 0.553
0.59TrpMet: 0.59 ± 0.186
0.737TrpAsn: 0.737 ± 0.201
0.884TrpPro: 0.884 ± 0.259
0.811TrpGln: 0.811 ± 0.223
0.958TrpArg: 0.958 ± 0.262
0.737TrpSer: 0.737 ± 0.241
1.4TrpThr: 1.4 ± 0.359
1.4TrpVal: 1.4 ± 0.391
0.295TrpTrp: 0.295 ± 0.195
0.884TrpTyr: 0.884 ± 0.269
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.021TyrAla: 3.021 ± 0.448
0.737TyrCys: 0.737 ± 0.237
1.842TyrAsp: 1.842 ± 0.255
1.916TyrGlu: 1.916 ± 0.473
1.326TyrPhe: 1.326 ± 0.322
2.284TyrGly: 2.284 ± 0.431
0.368TyrHis: 0.368 ± 0.161
1.916TyrIle: 1.916 ± 0.364
1.179TyrLys: 1.179 ± 0.283
2.432TyrLeu: 2.432 ± 0.447
0.59TyrMet: 0.59 ± 0.27
1.032TyrAsn: 1.032 ± 0.187
0.958TyrPro: 0.958 ± 0.295
1.4TyrGln: 1.4 ± 0.339
2.284TyrArg: 2.284 ± 0.399
3.39TyrSer: 3.39 ± 0.632
1.621TyrThr: 1.621 ± 0.279
1.621TyrVal: 1.621 ± 0.356
0.295TyrTrp: 0.295 ± 0.126
0.884TyrTyr: 0.884 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13571 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski