Amino acid dipepetide frequency for Stx2-converting phage Stx2a_WGPS2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.777AlaAla: 10.777 ± 1.511
1.225AlaCys: 1.225 ± 0.281
5.94AlaAsp: 5.94 ± 0.696
7.654AlaGlu: 7.654 ± 1.077
3.429AlaPhe: 3.429 ± 0.502
8.879AlaGly: 8.879 ± 0.928
1.531AlaHis: 1.531 ± 0.337
4.348AlaIle: 4.348 ± 0.508
3.429AlaLys: 3.429 ± 0.418
8.879AlaLeu: 8.879 ± 0.983
2.633AlaMet: 2.633 ± 0.398
3.0AlaAsn: 3.0 ± 0.393
3.245AlaPro: 3.245 ± 0.53
3.98AlaGln: 3.98 ± 0.61
5.695AlaArg: 5.695 ± 0.864
6.368AlaSer: 6.368 ± 0.757
4.654AlaThr: 4.654 ± 0.739
7.164AlaVal: 7.164 ± 0.923
1.776AlaTrp: 1.776 ± 0.317
2.082AlaTyr: 2.082 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.98CysAla: 0.98 ± 0.24
0.367CysCys: 0.367 ± 0.161
0.735CysAsp: 0.735 ± 0.228
0.918CysGlu: 0.918 ± 0.261
0.306CysPhe: 0.306 ± 0.141
1.225CysGly: 1.225 ± 0.317
0.306CysHis: 0.306 ± 0.142
0.184CysIle: 0.184 ± 0.11
0.184CysLys: 0.184 ± 0.1
1.041CysLeu: 1.041 ± 0.254
0.367CysMet: 0.367 ± 0.163
0.245CysAsn: 0.245 ± 0.085
0.367CysPro: 0.367 ± 0.152
0.796CysGln: 0.796 ± 0.219
1.225CysArg: 1.225 ± 0.282
0.98CysSer: 0.98 ± 0.232
0.918CysThr: 0.918 ± 0.226
0.918CysVal: 0.918 ± 0.263
0.429CysTrp: 0.429 ± 0.178
0.367CysTyr: 0.367 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
5.389AspAla: 5.389 ± 0.722
0.551AspCys: 0.551 ± 0.159
3.858AspAsp: 3.858 ± 0.527
3.184AspGlu: 3.184 ± 0.344
1.837AspPhe: 1.837 ± 0.302
5.327AspGly: 5.327 ± 0.606
0.674AspHis: 0.674 ± 0.197
3.245AspIle: 3.245 ± 0.656
2.511AspLys: 2.511 ± 0.427
3.796AspLeu: 3.796 ± 0.533
1.286AspMet: 1.286 ± 0.269
3.062AspAsn: 3.062 ± 0.488
2.266AspPro: 2.266 ± 0.452
1.47AspGln: 1.47 ± 0.317
2.511AspArg: 2.511 ± 0.463
2.266AspSer: 2.266 ± 0.336
2.694AspThr: 2.694 ± 0.445
4.225AspVal: 4.225 ± 0.539
0.98AspTrp: 0.98 ± 0.278
1.531AspTyr: 1.531 ± 0.34
0.0AspXaa: 0.0 ± 0.0
Glu
6.674GluAla: 6.674 ± 0.787
0.98GluCys: 0.98 ± 0.267
3.123GluAsp: 3.123 ± 0.369
3.796GluGlu: 3.796 ± 0.645
2.327GluPhe: 2.327 ± 0.36
3.552GluGly: 3.552 ± 0.45
1.408GluHis: 1.408 ± 0.373
3.429GluIle: 3.429 ± 0.438
4.286GluLys: 4.286 ± 0.44
6.736GluLeu: 6.736 ± 0.852
2.082GluMet: 2.082 ± 0.3
3.245GluAsn: 3.245 ± 0.515
2.694GluPro: 2.694 ± 0.445
3.796GluGln: 3.796 ± 0.477
3.919GluArg: 3.919 ± 0.628
4.409GluSer: 4.409 ± 0.619
3.368GluThr: 3.368 ± 0.454
4.103GluVal: 4.103 ± 0.617
0.918GluTrp: 0.918 ± 0.242
1.776GluTyr: 1.776 ± 0.335
0.0GluXaa: 0.0 ± 0.0
Phe
2.388PheAla: 2.388 ± 0.41
0.612PheCys: 0.612 ± 0.209
1.959PheAsp: 1.959 ± 0.401
1.959PheGlu: 1.959 ± 0.364
1.286PhePhe: 1.286 ± 0.301
2.511PheGly: 2.511 ± 0.403
0.612PheHis: 0.612 ± 0.201
1.776PheIle: 1.776 ± 0.307
1.653PheLys: 1.653 ± 0.445
1.715PheLeu: 1.715 ± 0.35
1.041PheMet: 1.041 ± 0.256
1.776PheAsn: 1.776 ± 0.402
0.98PhePro: 0.98 ± 0.225
0.857PheGln: 0.857 ± 0.187
2.511PheArg: 2.511 ± 0.382
3.368PheSer: 3.368 ± 0.462
1.715PheThr: 1.715 ± 0.367
2.449PheVal: 2.449 ± 0.397
0.612PheTrp: 0.612 ± 0.202
1.163PheTyr: 1.163 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
6.123GlyAla: 6.123 ± 0.731
0.551GlyCys: 0.551 ± 0.187
4.164GlyAsp: 4.164 ± 0.593
5.144GlyGlu: 5.144 ± 0.766
3.0GlyPhe: 3.0 ± 0.44
5.695GlyGly: 5.695 ± 0.645
1.347GlyHis: 1.347 ± 0.243
3.674GlyIle: 3.674 ± 0.434
4.899GlyLys: 4.899 ± 0.576
5.205GlyLeu: 5.205 ± 0.674
2.694GlyMet: 2.694 ± 0.367
3.552GlyAsn: 3.552 ± 0.436
2.572GlyPro: 2.572 ± 1.26
2.878GlyGln: 2.878 ± 0.527
4.837GlyArg: 4.837 ± 0.565
4.837GlySer: 4.837 ± 0.468
3.552GlyThr: 3.552 ± 0.474
5.572GlyVal: 5.572 ± 0.471
1.408GlyTrp: 1.408 ± 0.234
1.776GlyTyr: 1.776 ± 0.344
0.0GlyXaa: 0.0 ± 0.0
His
1.837HisAla: 1.837 ± 0.342
0.306HisCys: 0.306 ± 0.131
1.286HisAsp: 1.286 ± 0.278
1.102HisGlu: 1.102 ± 0.254
0.674HisPhe: 0.674 ± 0.21
1.47HisGly: 1.47 ± 0.279
0.98HisHis: 0.98 ± 0.362
1.837HisIle: 1.837 ± 0.419
0.796HisLys: 0.796 ± 0.201
2.143HisLeu: 2.143 ± 0.363
0.184HisMet: 0.184 ± 0.11
0.49HisAsn: 0.49 ± 0.189
1.102HisPro: 1.102 ± 0.253
0.674HisGln: 0.674 ± 0.226
0.918HisArg: 0.918 ± 0.219
1.347HisSer: 1.347 ± 0.28
1.225HisThr: 1.225 ± 0.253
0.98HisVal: 0.98 ± 0.231
0.245HisTrp: 0.245 ± 0.129
1.102HisTyr: 1.102 ± 0.428
0.0HisXaa: 0.0 ± 0.0
Ile
4.164IleAla: 4.164 ± 0.489
0.612IleCys: 0.612 ± 0.165
3.552IleAsp: 3.552 ± 0.393
3.0IleGlu: 3.0 ± 0.611
1.102IlePhe: 1.102 ± 0.425
2.449IleGly: 2.449 ± 0.372
1.408IleHis: 1.408 ± 0.332
2.388IleIle: 2.388 ± 0.437
2.755IleLys: 2.755 ± 0.426
3.919IleLeu: 3.919 ± 0.676
1.041IleMet: 1.041 ± 0.271
2.755IleAsn: 2.755 ± 0.436
2.572IlePro: 2.572 ± 0.54
1.715IleGln: 1.715 ± 0.328
4.531IleArg: 4.531 ± 0.472
5.021IleSer: 5.021 ± 0.728
4.164IleThr: 4.164 ± 0.498
2.449IleVal: 2.449 ± 0.496
0.551IleTrp: 0.551 ± 0.172
1.163IleTyr: 1.163 ± 0.45
0.0IleXaa: 0.0 ± 0.0
Lys
4.286LysAla: 4.286 ± 0.469
0.796LysCys: 0.796 ± 0.218
1.898LysAsp: 1.898 ± 0.317
3.062LysGlu: 3.062 ± 0.499
0.98LysPhe: 0.98 ± 0.279
3.919LysGly: 3.919 ± 0.548
1.531LysHis: 1.531 ± 0.291
3.429LysIle: 3.429 ± 0.539
3.49LysLys: 3.49 ± 0.616
4.164LysLeu: 4.164 ± 0.531
1.347LysMet: 1.347 ± 0.28
2.511LysAsn: 2.511 ± 0.401
2.694LysPro: 2.694 ± 0.418
2.511LysGln: 2.511 ± 0.357
3.368LysArg: 3.368 ± 0.492
3.245LysSer: 3.245 ± 0.451
3.613LysThr: 3.613 ± 0.557
3.123LysVal: 3.123 ± 0.449
0.674LysTrp: 0.674 ± 0.226
1.347LysTyr: 1.347 ± 0.31
0.0LysXaa: 0.0 ± 0.0
Leu
9.124LeuAla: 9.124 ± 0.89
1.776LeuCys: 1.776 ± 0.434
3.674LeuAsp: 3.674 ± 0.431
4.96LeuGlu: 4.96 ± 0.63
2.755LeuPhe: 2.755 ± 0.412
4.837LeuGly: 4.837 ± 0.628
1.286LeuHis: 1.286 ± 0.265
4.409LeuIle: 4.409 ± 0.568
5.817LeuLys: 5.817 ± 0.588
6.919LeuLeu: 6.919 ± 0.729
2.266LeuMet: 2.266 ± 0.363
3.552LeuAsn: 3.552 ± 0.492
4.103LeuPro: 4.103 ± 0.554
3.674LeuGln: 3.674 ± 0.429
6.246LeuArg: 6.246 ± 0.657
6.429LeuSer: 6.429 ± 0.731
5.878LeuThr: 5.878 ± 0.689
5.389LeuVal: 5.389 ± 0.593
1.347LeuTrp: 1.347 ± 0.269
2.388LeuTyr: 2.388 ± 0.367
0.0LeuXaa: 0.0 ± 0.0
Met
2.266MetAla: 2.266 ± 0.362
0.061MetCys: 0.061 ± 0.066
0.49MetAsp: 0.49 ± 0.13
1.347MetGlu: 1.347 ± 0.289
0.674MetPhe: 0.674 ± 0.218
1.163MetGly: 1.163 ± 0.282
0.184MetHis: 0.184 ± 0.114
0.98MetIle: 0.98 ± 0.229
1.408MetLys: 1.408 ± 0.261
3.307MetLeu: 3.307 ± 0.519
0.551MetMet: 0.551 ± 0.181
1.408MetAsn: 1.408 ± 0.254
1.225MetPro: 1.225 ± 0.24
1.653MetGln: 1.653 ± 0.263
1.653MetArg: 1.653 ± 0.324
2.204MetSer: 2.204 ± 0.336
1.959MetThr: 1.959 ± 0.315
1.776MetVal: 1.776 ± 0.36
0.245MetTrp: 0.245 ± 0.119
0.367MetTyr: 0.367 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
4.776AsnAla: 4.776 ± 0.648
0.49AsnCys: 0.49 ± 0.185
1.837AsnAsp: 1.837 ± 0.295
3.245AsnGlu: 3.245 ± 0.467
0.674AsnPhe: 0.674 ± 0.246
3.858AsnGly: 3.858 ± 0.605
1.408AsnHis: 1.408 ± 0.381
3.123AsnIle: 3.123 ± 0.805
2.449AsnLys: 2.449 ± 0.651
2.755AsnLeu: 2.755 ± 0.437
0.918AsnMet: 0.918 ± 0.291
2.143AsnAsn: 2.143 ± 0.476
2.204AsnPro: 2.204 ± 0.392
1.837AsnGln: 1.837 ± 0.351
2.694AsnArg: 2.694 ± 0.426
2.449AsnSer: 2.449 ± 0.424
2.143AsnThr: 2.143 ± 0.444
1.837AsnVal: 1.837 ± 0.276
0.49AsnTrp: 0.49 ± 0.145
1.163AsnTyr: 1.163 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
4.899ProAla: 4.899 ± 0.67
0.49ProCys: 0.49 ± 0.167
3.184ProAsp: 3.184 ± 0.47
5.205ProGlu: 5.205 ± 0.591
1.531ProPhe: 1.531 ± 0.356
3.368ProGly: 3.368 ± 0.463
1.041ProHis: 1.041 ± 0.236
0.674ProIle: 0.674 ± 0.228
1.776ProLys: 1.776 ± 0.431
3.368ProLeu: 3.368 ± 0.501
0.551ProMet: 0.551 ± 0.23
1.102ProAsn: 1.102 ± 0.259
2.327ProPro: 2.327 ± 0.557
1.531ProGln: 1.531 ± 0.393
1.959ProArg: 1.959 ± 0.57
2.817ProSer: 2.817 ± 0.332
1.898ProThr: 1.898 ± 0.305
4.592ProVal: 4.592 ± 0.686
0.551ProTrp: 0.551 ± 0.174
0.98ProTyr: 0.98 ± 0.251
0.0ProXaa: 0.0 ± 0.0
Gln
4.409GlnAla: 4.409 ± 0.654
0.98GlnCys: 0.98 ± 0.213
1.776GlnAsp: 1.776 ± 0.261
1.898GlnGlu: 1.898 ± 0.421
1.653GlnPhe: 1.653 ± 0.28
3.062GlnGly: 3.062 ± 0.56
1.347GlnHis: 1.347 ± 0.239
2.817GlnIle: 2.817 ± 0.465
2.633GlnLys: 2.633 ± 0.349
3.429GlnLeu: 3.429 ± 0.428
1.408GlnMet: 1.408 ± 0.268
1.347GlnAsn: 1.347 ± 0.443
1.959GlnPro: 1.959 ± 0.318
2.204GlnGln: 2.204 ± 0.493
2.388GlnArg: 2.388 ± 0.436
3.0GlnSer: 3.0 ± 0.486
2.266GlnThr: 2.266 ± 0.389
2.266GlnVal: 2.266 ± 0.404
0.612GlnTrp: 0.612 ± 0.176
1.715GlnTyr: 1.715 ± 0.304
0.0GlnXaa: 0.0 ± 0.0
Arg
4.592ArgAla: 4.592 ± 0.786
0.796ArgCys: 0.796 ± 0.244
4.531ArgAsp: 4.531 ± 0.751
5.756ArgGlu: 5.756 ± 0.631
2.572ArgPhe: 2.572 ± 0.471
4.041ArgGly: 4.041 ± 0.481
2.082ArgHis: 2.082 ± 0.383
2.694ArgIle: 2.694 ± 0.409
3.245ArgLys: 3.245 ± 0.446
6.491ArgLeu: 6.491 ± 0.691
1.592ArgMet: 1.592 ± 0.298
3.0ArgAsn: 3.0 ± 0.436
2.694ArgPro: 2.694 ± 0.462
2.939ArgGln: 2.939 ± 0.532
4.409ArgArg: 4.409 ± 0.586
2.449ArgSer: 2.449 ± 0.346
2.817ArgThr: 2.817 ± 0.323
3.98ArgVal: 3.98 ± 0.492
1.163ArgTrp: 1.163 ± 0.349
1.837ArgTyr: 1.837 ± 0.375
0.0ArgXaa: 0.0 ± 0.0
Ser
7.777SerAla: 7.777 ± 1.086
0.735SerCys: 0.735 ± 0.208
3.0SerAsp: 3.0 ± 0.438
4.837SerGlu: 4.837 ± 0.666
1.776SerPhe: 1.776 ± 0.3
6.246SerGly: 6.246 ± 0.71
0.918SerHis: 0.918 ± 0.265
3.184SerIle: 3.184 ± 0.436
2.449SerLys: 2.449 ± 0.456
6.491SerLeu: 6.491 ± 0.684
1.653SerMet: 1.653 ± 0.315
2.143SerAsn: 2.143 ± 0.448
2.511SerPro: 2.511 ± 0.449
3.919SerGln: 3.919 ± 0.488
3.858SerArg: 3.858 ± 0.54
4.348SerSer: 4.348 ± 0.504
3.429SerThr: 3.429 ± 0.411
5.327SerVal: 5.327 ± 0.679
0.735SerTrp: 0.735 ± 0.184
2.327SerTyr: 2.327 ± 0.524
0.0SerXaa: 0.0 ± 0.0
Thr
6.062ThrAla: 6.062 ± 0.804
0.367ThrCys: 0.367 ± 0.152
3.245ThrAsp: 3.245 ± 0.473
3.919ThrGlu: 3.919 ± 0.589
2.266ThrPhe: 2.266 ± 0.399
5.205ThrGly: 5.205 ± 0.795
1.041ThrHis: 1.041 ± 0.248
3.062ThrIle: 3.062 ± 0.466
2.143ThrLys: 2.143 ± 0.348
5.878ThrLeu: 5.878 ± 0.648
0.918ThrMet: 0.918 ± 0.297
1.592ThrAsn: 1.592 ± 0.272
2.939ThrPro: 2.939 ± 0.558
1.959ThrGln: 1.959 ± 0.251
2.449ThrArg: 2.449 ± 0.354
3.184ThrSer: 3.184 ± 0.531
3.429ThrThr: 3.429 ± 0.532
4.715ThrVal: 4.715 ± 0.614
0.49ThrTrp: 0.49 ± 0.186
1.715ThrTyr: 1.715 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
6.368ValAla: 6.368 ± 0.766
0.674ValCys: 0.674 ± 0.207
2.633ValAsp: 2.633 ± 0.37
3.98ValGlu: 3.98 ± 0.578
2.143ValPhe: 2.143 ± 0.384
3.98ValGly: 3.98 ± 0.579
0.674ValHis: 0.674 ± 0.214
3.98ValIle: 3.98 ± 0.68
3.98ValLys: 3.98 ± 0.529
6.613ValLeu: 6.613 ± 1.023
1.653ValMet: 1.653 ± 0.301
4.103ValAsn: 4.103 ± 0.532
3.613ValPro: 3.613 ± 0.5
2.266ValGln: 2.266 ± 0.45
4.837ValArg: 4.837 ± 0.62
5.695ValSer: 5.695 ± 0.649
4.103ValThr: 4.103 ± 0.594
4.776ValVal: 4.776 ± 0.665
0.98ValTrp: 0.98 ± 0.28
1.776ValTyr: 1.776 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
1.163TrpAla: 1.163 ± 0.294
0.184TrpCys: 0.184 ± 0.105
0.429TrpAsp: 0.429 ± 0.234
0.674TrpGlu: 0.674 ± 0.19
0.551TrpPhe: 0.551 ± 0.202
0.857TrpGly: 0.857 ± 0.203
0.429TrpHis: 0.429 ± 0.183
0.612TrpIle: 0.612 ± 0.182
0.674TrpLys: 0.674 ± 0.191
1.837TrpLeu: 1.837 ± 0.266
0.367TrpMet: 0.367 ± 0.124
0.49TrpAsn: 0.49 ± 0.183
0.612TrpPro: 0.612 ± 0.211
0.857TrpGln: 0.857 ± 0.18
1.592TrpArg: 1.592 ± 0.297
0.674TrpSer: 0.674 ± 0.17
0.674TrpThr: 0.674 ± 0.167
1.531TrpVal: 1.531 ± 0.341
0.306TrpTrp: 0.306 ± 0.165
0.429TrpTyr: 0.429 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.633TyrAla: 2.633 ± 0.435
0.306TyrCys: 0.306 ± 0.134
1.347TyrAsp: 1.347 ± 0.326
0.98TyrGlu: 0.98 ± 0.264
1.408TyrPhe: 1.408 ± 0.321
1.837TyrGly: 1.837 ± 0.372
0.49TyrHis: 0.49 ± 0.135
1.47TyrIle: 1.47 ± 0.389
1.408TyrLys: 1.408 ± 0.383
2.143TyrLeu: 2.143 ± 0.3
0.306TyrMet: 0.306 ± 0.148
0.98TyrAsn: 0.98 ± 0.293
1.163TyrPro: 1.163 ± 0.266
1.653TyrGln: 1.653 ± 0.406
2.143TyrArg: 2.143 ± 0.382
2.572TyrSer: 2.572 ± 0.453
2.082TyrThr: 2.082 ± 0.318
1.653TyrVal: 1.653 ± 0.377
0.429TyrTrp: 0.429 ± 0.161
0.796TyrTyr: 0.796 ± 0.211
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (16332 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski