Amino acid dipepetide frequency for Escherichia phage Lambda_ev207

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.965AlaAla: 12.965 ± 2.713
0.645AlaCys: 0.645 ± 0.253
4.226AlaAsp: 4.226 ± 0.531
6.948AlaGlu: 6.948 ± 0.739
3.51AlaPhe: 3.51 ± 0.446
6.948AlaGly: 6.948 ± 0.69
1.433AlaHis: 1.433 ± 0.333
5.515AlaIle: 5.515 ± 0.571
4.799AlaLys: 4.799 ± 1.267
7.807AlaLeu: 7.807 ± 0.787
2.937AlaMet: 2.937 ± 0.425
3.438AlaAsn: 3.438 ± 0.475
2.507AlaPro: 2.507 ± 0.597
4.513AlaGln: 4.513 ± 0.782
6.447AlaArg: 6.447 ± 0.876
7.091AlaSer: 7.091 ± 0.989
5.444AlaThr: 5.444 ± 0.978
6.447AlaVal: 6.447 ± 0.958
1.862AlaTrp: 1.862 ± 0.437
2.793AlaTyr: 2.793 ± 0.414
0.0AlaXaa: 0.0 ± 0.0
Cys
1.146CysAla: 1.146 ± 0.344
0.501CysCys: 0.501 ± 0.219
0.86CysAsp: 0.86 ± 0.221
0.788CysGlu: 0.788 ± 0.277
0.143CysPhe: 0.143 ± 0.093
0.86CysGly: 0.86 ± 0.331
0.358CysHis: 0.358 ± 0.152
0.931CysIle: 0.931 ± 0.241
0.716CysLys: 0.716 ± 0.253
0.788CysLeu: 0.788 ± 0.227
0.358CysMet: 0.358 ± 0.18
0.645CysAsn: 0.645 ± 0.189
0.143CysPro: 0.143 ± 0.096
0.287CysGln: 0.287 ± 0.146
1.003CysArg: 1.003 ± 0.319
1.146CysSer: 1.146 ± 0.381
0.86CysThr: 0.86 ± 0.245
0.788CysVal: 0.788 ± 0.228
0.215CysTrp: 0.215 ± 0.126
0.358CysTyr: 0.358 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.945AspAla: 5.945 ± 0.687
0.645AspCys: 0.645 ± 0.18
4.584AspAsp: 4.584 ± 0.54
3.868AspGlu: 3.868 ± 0.635
1.647AspPhe: 1.647 ± 0.272
5.3AspGly: 5.3 ± 0.624
0.501AspHis: 0.501 ± 0.188
4.011AspIle: 4.011 ± 0.527
3.223AspLys: 3.223 ± 0.461
3.796AspLeu: 3.796 ± 0.529
1.934AspMet: 1.934 ± 0.362
1.862AspAsn: 1.862 ± 0.414
2.507AspPro: 2.507 ± 0.731
1.433AspGln: 1.433 ± 0.282
2.937AspArg: 2.937 ± 0.459
3.295AspSer: 3.295 ± 0.452
3.223AspThr: 3.223 ± 0.407
4.298AspVal: 4.298 ± 0.697
1.146AspTrp: 1.146 ± 0.348
2.006AspTyr: 2.006 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
5.372GluAla: 5.372 ± 0.749
1.003GluCys: 1.003 ± 0.42
2.937GluAsp: 2.937 ± 0.459
3.94GluGlu: 3.94 ± 0.546
2.22GluPhe: 2.22 ± 0.547
3.438GluGly: 3.438 ± 0.499
1.003GluHis: 1.003 ± 0.305
3.223GluIle: 3.223 ± 0.451
3.51GluLys: 3.51 ± 0.474
5.73GluLeu: 5.73 ± 0.711
1.433GluMet: 1.433 ± 0.312
2.292GluAsn: 2.292 ± 0.356
1.934GluPro: 1.934 ± 0.328
3.868GluGln: 3.868 ± 0.747
3.94GluArg: 3.94 ± 0.689
3.653GluSer: 3.653 ± 0.441
4.727GluThr: 4.727 ± 0.717
3.223GluVal: 3.223 ± 0.559
1.218GluTrp: 1.218 ± 0.31
1.934GluTyr: 1.934 ± 0.378
0.0GluXaa: 0.0 ± 0.0
Phe
2.077PheAla: 2.077 ± 0.446
0.788PheCys: 0.788 ± 0.225
2.722PheAsp: 2.722 ± 0.426
2.077PheGlu: 2.077 ± 0.383
1.433PhePhe: 1.433 ± 0.467
3.08PheGly: 3.08 ± 0.518
1.074PheHis: 1.074 ± 0.267
1.504PheIle: 1.504 ± 0.313
2.149PheLys: 2.149 ± 0.307
2.65PheLeu: 2.65 ± 0.579
0.788PheMet: 0.788 ± 0.233
1.146PheAsn: 1.146 ± 0.274
1.289PhePro: 1.289 ± 0.269
0.716PheGln: 0.716 ± 0.213
2.507PheArg: 2.507 ± 0.424
3.223PheSer: 3.223 ± 0.43
3.008PheThr: 3.008 ± 0.433
2.865PheVal: 2.865 ± 0.363
0.573PheTrp: 0.573 ± 0.179
0.716PheTyr: 0.716 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
6.017GlyAla: 6.017 ± 0.888
0.716GlyCys: 0.716 ± 0.207
4.871GlyAsp: 4.871 ± 0.469
3.51GlyGlu: 3.51 ± 0.558
2.65GlyPhe: 2.65 ± 0.41
5.3GlyGly: 5.3 ± 0.768
1.003GlyHis: 1.003 ± 0.289
4.083GlyIle: 4.083 ± 0.639
4.656GlyLys: 4.656 ± 0.526
6.017GlyLeu: 6.017 ± 0.789
3.008GlyMet: 3.008 ± 0.573
3.581GlyAsn: 3.581 ± 0.673
1.218GlyPro: 1.218 ± 0.23
3.295GlyGln: 3.295 ± 0.513
3.796GlyArg: 3.796 ± 0.344
4.369GlySer: 4.369 ± 0.549
3.581GlyThr: 3.581 ± 0.642
5.157GlyVal: 5.157 ± 0.486
1.504GlyTrp: 1.504 ± 0.288
2.435GlyTyr: 2.435 ± 0.407
0.0GlyXaa: 0.0 ± 0.0
His
1.218HisAla: 1.218 ± 0.297
0.358HisCys: 0.358 ± 0.149
0.86HisAsp: 0.86 ± 0.206
0.716HisGlu: 0.716 ± 0.231
0.86HisPhe: 0.86 ± 0.26
1.433HisGly: 1.433 ± 0.341
0.287HisHis: 0.287 ± 0.145
1.074HisIle: 1.074 ± 0.27
0.86HisLys: 0.86 ± 0.189
1.647HisLeu: 1.647 ± 0.368
0.501HisMet: 0.501 ± 0.204
1.146HisAsn: 1.146 ± 0.293
0.931HisPro: 0.931 ± 0.246
0.501HisGln: 0.501 ± 0.191
1.074HisArg: 1.074 ± 0.252
0.501HisSer: 0.501 ± 0.212
1.003HisThr: 1.003 ± 0.269
1.146HisVal: 1.146 ± 0.213
0.143HisTrp: 0.143 ± 0.099
1.218HisTyr: 1.218 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
5.587IleAla: 5.587 ± 0.721
0.931IleCys: 0.931 ± 0.215
3.438IleAsp: 3.438 ± 0.44
3.008IleGlu: 3.008 ± 0.571
1.146IlePhe: 1.146 ± 0.245
3.152IleGly: 3.152 ± 0.494
0.645IleHis: 0.645 ± 0.211
3.868IleIle: 3.868 ± 0.667
2.937IleLys: 2.937 ± 0.54
3.295IleLeu: 3.295 ± 0.506
0.86IleMet: 0.86 ± 0.232
2.793IleAsn: 2.793 ± 0.503
2.65IlePro: 2.65 ± 0.383
2.006IleGln: 2.006 ± 0.317
3.367IleArg: 3.367 ± 0.406
4.298IleSer: 4.298 ± 0.584
4.727IleThr: 4.727 ± 0.691
3.223IleVal: 3.223 ± 0.529
0.645IleTrp: 0.645 ± 0.276
1.289IleTyr: 1.289 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
5.515LysAla: 5.515 ± 0.852
0.645LysCys: 0.645 ± 0.298
2.937LysAsp: 2.937 ± 0.445
3.295LysGlu: 3.295 ± 0.548
1.433LysPhe: 1.433 ± 0.307
4.298LysGly: 4.298 ± 0.642
1.218LysHis: 1.218 ± 0.305
2.006LysIle: 2.006 ± 0.348
3.51LysLys: 3.51 ± 0.535
3.796LysLeu: 3.796 ± 0.527
1.647LysMet: 1.647 ± 0.426
2.65LysAsn: 2.65 ± 0.442
2.149LysPro: 2.149 ± 0.402
2.435LysGln: 2.435 ± 0.376
3.725LysArg: 3.725 ± 0.592
3.008LysSer: 3.008 ± 0.535
4.083LysThr: 4.083 ± 0.632
3.295LysVal: 3.295 ± 0.542
1.576LysTrp: 1.576 ± 0.342
2.149LysTyr: 2.149 ± 0.322
0.0LysXaa: 0.0 ± 0.0
Leu
8.166LeuAla: 8.166 ± 0.903
1.003LeuCys: 1.003 ± 0.264
4.369LeuAsp: 4.369 ± 0.508
3.796LeuGlu: 3.796 ± 0.447
2.793LeuPhe: 2.793 ± 0.498
4.942LeuGly: 4.942 ± 0.639
1.504LeuHis: 1.504 ± 0.386
4.083LeuIle: 4.083 ± 0.644
5.3LeuLys: 5.3 ± 0.721
6.876LeuLeu: 6.876 ± 0.971
1.862LeuMet: 1.862 ± 0.414
3.295LeuAsn: 3.295 ± 0.474
3.581LeuPro: 3.581 ± 0.416
3.725LeuGln: 3.725 ± 0.48
4.656LeuArg: 4.656 ± 0.632
6.16LeuSer: 6.16 ± 0.841
5.874LeuThr: 5.874 ± 0.637
4.369LeuVal: 4.369 ± 0.562
1.504LeuTrp: 1.504 ± 0.301
2.006LeuTyr: 2.006 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
2.865MetAla: 2.865 ± 0.475
0.215MetCys: 0.215 ± 0.12
1.504MetAsp: 1.504 ± 0.394
1.003MetGlu: 1.003 ± 0.327
1.361MetPhe: 1.361 ± 0.321
1.361MetGly: 1.361 ± 0.291
0.43MetHis: 0.43 ± 0.195
1.289MetIle: 1.289 ± 0.337
1.647MetLys: 1.647 ± 0.469
3.152MetLeu: 3.152 ± 0.424
0.645MetMet: 0.645 ± 0.214
1.218MetAsn: 1.218 ± 0.284
1.576MetPro: 1.576 ± 0.437
1.218MetGln: 1.218 ± 0.352
1.647MetArg: 1.647 ± 0.399
1.719MetSer: 1.719 ± 0.333
3.08MetThr: 3.08 ± 0.493
2.077MetVal: 2.077 ± 0.4
0.358MetTrp: 0.358 ± 0.143
0.501MetTyr: 0.501 ± 0.232
0.0MetXaa: 0.0 ± 0.0
Asn
4.369AsnAla: 4.369 ± 0.743
0.573AsnCys: 0.573 ± 0.185
2.364AsnAsp: 2.364 ± 0.366
2.435AsnGlu: 2.435 ± 0.427
1.647AsnPhe: 1.647 ± 0.402
3.152AsnGly: 3.152 ± 0.534
0.931AsnHis: 0.931 ± 0.235
2.435AsnIle: 2.435 ± 0.385
2.579AsnLys: 2.579 ± 0.503
2.149AsnLeu: 2.149 ± 0.363
1.433AsnMet: 1.433 ± 0.336
2.435AsnAsn: 2.435 ± 0.435
1.934AsnPro: 1.934 ± 0.307
0.931AsnGln: 0.931 ± 0.242
2.507AsnArg: 2.507 ± 0.546
2.722AsnSer: 2.722 ± 0.614
2.22AsnThr: 2.22 ± 0.388
2.077AsnVal: 2.077 ± 0.348
0.43AsnTrp: 0.43 ± 0.15
1.289AsnTyr: 1.289 ± 0.26
0.0AsnXaa: 0.0 ± 0.0
Pro
3.295ProAla: 3.295 ± 0.54
0.358ProCys: 0.358 ± 0.134
3.223ProAsp: 3.223 ± 0.543
2.722ProGlu: 2.722 ± 0.435
1.361ProPhe: 1.361 ± 0.333
2.865ProGly: 2.865 ± 0.43
0.716ProHis: 0.716 ± 0.243
1.576ProIle: 1.576 ± 0.346
1.576ProLys: 1.576 ± 0.417
2.435ProLeu: 2.435 ± 0.394
0.573ProMet: 0.573 ± 0.197
1.504ProAsn: 1.504 ± 0.329
1.289ProPro: 1.289 ± 0.364
1.361ProGln: 1.361 ± 0.246
1.504ProArg: 1.504 ± 0.332
2.65ProSer: 2.65 ± 0.438
2.22ProThr: 2.22 ± 0.378
3.581ProVal: 3.581 ± 0.491
0.716ProTrp: 0.716 ± 0.229
0.86ProTyr: 0.86 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
4.083GlnAla: 4.083 ± 0.758
0.788GlnCys: 0.788 ± 0.258
1.576GlnAsp: 1.576 ± 0.31
2.722GlnGlu: 2.722 ± 0.504
1.074GlnPhe: 1.074 ± 0.306
2.507GlnGly: 2.507 ± 0.368
0.86GlnHis: 0.86 ± 0.329
2.435GlnIle: 2.435 ± 0.372
2.149GlnLys: 2.149 ± 0.363
3.868GlnLeu: 3.868 ± 0.501
1.289GlnMet: 1.289 ± 0.338
1.791GlnAsn: 1.791 ± 0.41
1.289GlnPro: 1.289 ± 0.341
2.507GlnGln: 2.507 ± 0.611
3.152GlnArg: 3.152 ± 0.51
3.152GlnSer: 3.152 ± 0.51
2.865GlnThr: 2.865 ± 0.61
3.152GlnVal: 3.152 ± 0.46
0.501GlnTrp: 0.501 ± 0.184
1.433GlnTyr: 1.433 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
4.656ArgAla: 4.656 ± 0.629
0.501ArgCys: 0.501 ± 0.214
3.581ArgAsp: 3.581 ± 0.606
4.298ArgGlu: 4.298 ± 0.652
2.579ArgPhe: 2.579 ± 0.407
3.51ArgGly: 3.51 ± 0.538
1.576ArgHis: 1.576 ± 0.327
4.154ArgIle: 4.154 ± 0.556
3.295ArgLys: 3.295 ± 0.547
5.802ArgLeu: 5.802 ± 0.599
2.364ArgMet: 2.364 ± 0.41
2.364ArgAsn: 2.364 ± 0.447
1.862ArgPro: 1.862 ± 0.34
3.295ArgGln: 3.295 ± 0.497
5.659ArgArg: 5.659 ± 1.007
2.22ArgSer: 2.22 ± 0.414
2.937ArgThr: 2.937 ± 0.524
3.152ArgVal: 3.152 ± 0.526
1.218ArgTrp: 1.218 ± 0.291
2.006ArgTyr: 2.006 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
7.378SerAla: 7.378 ± 0.979
0.716SerCys: 0.716 ± 0.242
4.011SerAsp: 4.011 ± 0.518
4.942SerGlu: 4.942 ± 0.947
2.364SerPhe: 2.364 ± 0.374
6.876SerGly: 6.876 ± 0.884
1.074SerHis: 1.074 ± 0.327
2.865SerIle: 2.865 ± 0.454
2.722SerLys: 2.722 ± 0.529
4.369SerLeu: 4.369 ± 0.523
2.22SerMet: 2.22 ± 0.393
1.576SerAsn: 1.576 ± 0.301
2.292SerPro: 2.292 ± 0.375
3.725SerGln: 3.725 ± 0.681
4.513SerArg: 4.513 ± 0.62
3.438SerSer: 3.438 ± 0.491
3.725SerThr: 3.725 ± 0.537
4.942SerVal: 4.942 ± 0.604
0.86SerTrp: 0.86 ± 0.213
1.934SerTyr: 1.934 ± 0.363
0.0SerXaa: 0.0 ± 0.0
Thr
7.449ThrAla: 7.449 ± 0.971
0.788ThrCys: 0.788 ± 0.242
3.725ThrAsp: 3.725 ± 0.542
4.799ThrGlu: 4.799 ± 0.671
3.08ThrPhe: 3.08 ± 0.456
4.871ThrGly: 4.871 ± 0.613
1.146ThrHis: 1.146 ± 0.286
2.865ThrIle: 2.865 ± 0.516
2.865ThrLys: 2.865 ± 0.442
5.802ThrLeu: 5.802 ± 0.557
1.361ThrMet: 1.361 ± 0.375
1.719ThrAsn: 1.719 ± 0.631
3.152ThrPro: 3.152 ± 0.703
2.507ThrGln: 2.507 ± 0.414
2.65ThrArg: 2.65 ± 0.305
4.369ThrSer: 4.369 ± 0.579
3.438ThrThr: 3.438 ± 0.523
4.369ThrVal: 4.369 ± 0.859
1.218ThrTrp: 1.218 ± 0.272
2.722ThrTyr: 2.722 ± 0.504
0.0ThrXaa: 0.0 ± 0.0
Val
6.017ValAla: 6.017 ± 0.746
0.716ValCys: 0.716 ± 0.244
3.725ValAsp: 3.725 ± 0.401
3.223ValGlu: 3.223 ± 0.535
3.08ValPhe: 3.08 ± 0.423
3.438ValGly: 3.438 ± 0.555
0.573ValHis: 0.573 ± 0.162
3.223ValIle: 3.223 ± 0.438
4.513ValLys: 4.513 ± 0.599
5.372ValLeu: 5.372 ± 0.708
2.149ValMet: 2.149 ± 0.396
3.796ValAsn: 3.796 ± 0.442
2.292ValPro: 2.292 ± 0.403
2.364ValGln: 2.364 ± 0.495
2.65ValArg: 2.65 ± 0.438
5.659ValSer: 5.659 ± 0.754
5.444ValThr: 5.444 ± 0.751
4.513ValVal: 4.513 ± 0.49
1.146ValTrp: 1.146 ± 0.288
1.934ValTyr: 1.934 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.647TrpAla: 1.647 ± 0.291
0.43TrpCys: 0.43 ± 0.155
1.146TrpAsp: 1.146 ± 0.303
0.501TrpGlu: 0.501 ± 0.156
0.43TrpPhe: 0.43 ± 0.168
1.146TrpGly: 1.146 ± 0.305
0.43TrpHis: 0.43 ± 0.16
1.003TrpIle: 1.003 ± 0.3
1.074TrpLys: 1.074 ± 0.301
1.791TrpLeu: 1.791 ± 0.382
0.788TrpMet: 0.788 ± 0.229
0.645TrpAsn: 0.645 ± 0.182
0.645TrpPro: 0.645 ± 0.227
0.716TrpGln: 0.716 ± 0.244
1.074TrpArg: 1.074 ± 0.323
1.146TrpSer: 1.146 ± 0.24
1.003TrpThr: 1.003 ± 0.315
1.218TrpVal: 1.218 ± 0.368
0.287TrpTrp: 0.287 ± 0.155
0.645TrpTyr: 0.645 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.507TyrAla: 2.507 ± 0.428
0.573TyrCys: 0.573 ± 0.18
1.504TyrAsp: 1.504 ± 0.292
2.006TyrGlu: 2.006 ± 0.43
1.576TyrPhe: 1.576 ± 0.313
2.292TyrGly: 2.292 ± 0.411
0.716TyrHis: 0.716 ± 0.277
1.719TyrIle: 1.719 ± 0.415
1.289TyrLys: 1.289 ± 0.306
2.722TyrLeu: 2.722 ± 0.451
0.645TyrMet: 0.645 ± 0.236
0.716TyrAsn: 0.716 ± 0.161
1.146TyrPro: 1.146 ± 0.288
1.791TyrGln: 1.791 ± 0.34
2.22TyrArg: 2.22 ± 0.45
2.722TyrSer: 2.722 ± 0.51
1.504TyrThr: 1.504 ± 0.292
1.934TyrVal: 1.934 ± 0.366
0.645TyrTrp: 0.645 ± 0.2
1.003TyrTyr: 1.003 ± 0.266
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski