Amino acid dipepetide frequency for Stx converting phage vB_EcoS_P27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.655AlaAla: 8.655 ± 0.769
1.034AlaCys: 1.034 ± 0.27
5.171AlaAsp: 5.171 ± 0.558
7.458AlaGlu: 7.458 ± 0.894
3.647AlaPhe: 3.647 ± 0.512
7.295AlaGly: 7.295 ± 1.112
1.851AlaHis: 1.851 ± 0.362
4.137AlaIle: 4.137 ± 0.526
4.845AlaLys: 4.845 ± 0.514
6.641AlaLeu: 6.641 ± 0.65
3.375AlaMet: 3.375 ± 0.449
2.776AlaAsn: 2.776 ± 0.371
3.048AlaPro: 3.048 ± 0.447
5.171AlaGln: 5.171 ± 0.821
5.934AlaArg: 5.934 ± 0.586
5.607AlaSer: 5.607 ± 0.526
5.553AlaThr: 5.553 ± 0.678
6.206AlaVal: 6.206 ± 0.615
1.905AlaTrp: 1.905 ± 0.316
2.395AlaTyr: 2.395 ± 0.347
0.0AlaXaa: 0.0 ± 0.0
Cys
1.306CysAla: 1.306 ± 0.306
0.49CysCys: 0.49 ± 0.196
0.544CysAsp: 0.544 ± 0.189
0.762CysGlu: 0.762 ± 0.257
0.49CysPhe: 0.49 ± 0.176
0.871CysGly: 0.871 ± 0.293
0.218CysHis: 0.218 ± 0.116
0.599CysIle: 0.599 ± 0.219
0.599CysLys: 0.599 ± 0.226
0.653CysLeu: 0.653 ± 0.224
0.163CysMet: 0.163 ± 0.094
0.272CysAsn: 0.272 ± 0.138
0.381CysPro: 0.381 ± 0.172
0.327CysGln: 0.327 ± 0.135
1.198CysArg: 1.198 ± 0.302
0.708CysSer: 0.708 ± 0.242
0.599CysThr: 0.599 ± 0.191
0.871CysVal: 0.871 ± 0.225
0.109CysTrp: 0.109 ± 0.072
0.599CysTyr: 0.599 ± 0.239
0.0CysXaa: 0.0 ± 0.0
Asp
5.988AspAla: 5.988 ± 0.707
0.708AspCys: 0.708 ± 0.237
3.593AspAsp: 3.593 ± 0.382
4.355AspGlu: 4.355 ± 0.467
1.47AspPhe: 1.47 ± 0.23
4.845AspGly: 4.845 ± 0.478
1.089AspHis: 1.089 ± 0.34
3.647AspIle: 3.647 ± 0.462
3.865AspLys: 3.865 ± 0.375
4.028AspLeu: 4.028 ± 0.489
1.851AspMet: 1.851 ± 0.288
2.994AspAsn: 2.994 ± 0.387
2.559AspPro: 2.559 ± 0.438
1.415AspGln: 1.415 ± 0.307
3.212AspArg: 3.212 ± 0.379
2.994AspSer: 2.994 ± 0.432
2.722AspThr: 2.722 ± 0.442
4.192AspVal: 4.192 ± 0.424
1.143AspTrp: 1.143 ± 0.313
1.579AspTyr: 1.579 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
6.696GluAla: 6.696 ± 0.662
0.708GluCys: 0.708 ± 0.212
2.069GluAsp: 2.069 ± 0.301
3.647GluGlu: 3.647 ± 0.381
2.504GluPhe: 2.504 ± 0.365
4.028GluGly: 4.028 ± 0.517
1.415GluHis: 1.415 ± 0.251
3.593GluIle: 3.593 ± 0.475
4.845GluLys: 4.845 ± 0.582
5.988GluLeu: 5.988 ± 0.611
2.123GluMet: 2.123 ± 0.368
3.321GluAsn: 3.321 ± 0.518
1.851GluPro: 1.851 ± 0.402
4.192GluGln: 4.192 ± 0.566
5.77GluArg: 5.77 ± 0.616
3.702GluSer: 3.702 ± 0.496
3.865GluThr: 3.865 ± 0.634
4.409GluVal: 4.409 ± 0.565
1.361GluTrp: 1.361 ± 0.267
2.232GluTyr: 2.232 ± 0.379
0.0GluXaa: 0.0 ± 0.0
Phe
2.885PheAla: 2.885 ± 0.471
0.381PheCys: 0.381 ± 0.173
1.688PheAsp: 1.688 ± 0.273
1.742PheGlu: 1.742 ± 0.369
0.98PhePhe: 0.98 ± 0.243
2.45PheGly: 2.45 ± 0.277
0.762PheHis: 0.762 ± 0.184
1.905PheIle: 1.905 ± 0.29
1.742PheLys: 1.742 ± 0.33
2.232PheLeu: 2.232 ± 0.321
0.871PheMet: 0.871 ± 0.213
1.47PheAsn: 1.47 ± 0.277
1.306PhePro: 1.306 ± 0.286
0.817PheGln: 0.817 ± 0.187
2.667PheArg: 2.667 ± 0.336
2.831PheSer: 2.831 ± 0.461
2.45PheThr: 2.45 ± 0.318
2.177PheVal: 2.177 ± 0.31
0.762PheTrp: 0.762 ± 0.222
1.143PheTyr: 1.143 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
5.716GlyAla: 5.716 ± 0.956
0.871GlyCys: 0.871 ± 0.324
4.899GlyAsp: 4.899 ± 0.807
6.26GlyGlu: 6.26 ± 1.362
2.667GlyPhe: 2.667 ± 0.355
4.627GlyGly: 4.627 ± 0.558
1.306GlyHis: 1.306 ± 0.263
4.355GlyIle: 4.355 ± 0.707
5.063GlyLys: 5.063 ± 0.887
5.063GlyLeu: 5.063 ± 0.434
2.45GlyMet: 2.45 ± 0.303
2.94GlyAsn: 2.94 ± 0.333
3.974GlyPro: 3.974 ± 2.139
2.504GlyGln: 2.504 ± 0.414
4.137GlyArg: 4.137 ± 0.585
3.538GlySer: 3.538 ± 0.514
3.157GlyThr: 3.157 ± 0.46
5.063GlyVal: 5.063 ± 0.444
1.143GlyTrp: 1.143 ± 0.238
2.504GlyTyr: 2.504 ± 0.372
0.0GlyXaa: 0.0 ± 0.0
His
1.905HisAla: 1.905 ± 0.294
0.109HisCys: 0.109 ± 0.073
1.198HisAsp: 1.198 ± 0.329
0.762HisGlu: 0.762 ± 0.225
0.762HisPhe: 0.762 ± 0.263
1.688HisGly: 1.688 ± 0.394
0.327HisHis: 0.327 ± 0.146
0.708HisIle: 0.708 ± 0.211
1.252HisLys: 1.252 ± 0.316
1.415HisLeu: 1.415 ± 0.368
0.381HisMet: 0.381 ± 0.168
1.089HisAsn: 1.089 ± 0.234
0.98HisPro: 0.98 ± 0.265
0.708HisGln: 0.708 ± 0.201
1.089HisArg: 1.089 ± 0.258
0.925HisSer: 0.925 ± 0.185
0.871HisThr: 0.871 ± 0.214
0.925HisVal: 0.925 ± 0.224
0.435HisTrp: 0.435 ± 0.21
0.925HisTyr: 0.925 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
4.518IleAla: 4.518 ± 0.539
1.034IleCys: 1.034 ± 0.304
3.702IleAsp: 3.702 ± 0.434
3.647IleGlu: 3.647 ± 0.674
1.089IlePhe: 1.089 ± 0.235
2.613IleGly: 2.613 ± 0.412
1.089IleHis: 1.089 ± 0.244
2.286IleIle: 2.286 ± 0.306
3.266IleLys: 3.266 ± 0.455
2.994IleLeu: 2.994 ± 0.55
1.034IleMet: 1.034 ± 0.189
3.103IleAsn: 3.103 ± 0.406
2.504IlePro: 2.504 ± 0.388
2.177IleGln: 2.177 ± 0.324
4.355IleArg: 4.355 ± 0.427
4.192IleSer: 4.192 ± 0.514
3.919IleThr: 3.919 ± 0.617
2.341IleVal: 2.341 ± 0.423
0.653IleTrp: 0.653 ± 0.212
1.796IleTyr: 1.796 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
5.934LysAla: 5.934 ± 0.65
0.544LysCys: 0.544 ± 0.193
3.212LysAsp: 3.212 ± 0.431
4.355LysGlu: 4.355 ± 0.56
1.415LysPhe: 1.415 ± 0.269
5.879LysGly: 5.879 ± 1.089
0.98LysHis: 0.98 ± 0.206
3.593LysIle: 3.593 ± 0.454
4.083LysLys: 4.083 ± 0.703
5.171LysLeu: 5.171 ± 0.449
1.905LysMet: 1.905 ± 0.318
3.593LysAsn: 3.593 ± 0.428
2.831LysPro: 2.831 ± 0.444
2.885LysGln: 2.885 ± 0.441
3.212LysArg: 3.212 ± 0.375
3.048LysSer: 3.048 ± 0.394
3.484LysThr: 3.484 ± 0.452
2.94LysVal: 2.94 ± 0.418
0.708LysTrp: 0.708 ± 0.21
1.851LysTyr: 1.851 ± 0.388
0.0LysXaa: 0.0 ± 0.0
Leu
8.601LeuAla: 8.601 ± 0.746
0.925LeuCys: 0.925 ± 0.287
3.974LeuAsp: 3.974 ± 0.513
3.919LeuGlu: 3.919 ± 0.46
2.667LeuPhe: 2.667 ± 0.432
4.246LeuGly: 4.246 ± 0.483
1.361LeuHis: 1.361 ± 0.266
3.484LeuIle: 3.484 ± 0.527
4.899LeuLys: 4.899 ± 0.552
5.825LeuLeu: 5.825 ± 0.599
2.123LeuMet: 2.123 ± 0.362
4.137LeuAsn: 4.137 ± 0.533
3.811LeuPro: 3.811 ± 0.506
2.885LeuGln: 2.885 ± 0.561
4.573LeuArg: 4.573 ± 0.508
5.389LeuSer: 5.389 ± 0.57
4.954LeuThr: 4.954 ± 0.431
4.845LeuVal: 4.845 ± 0.5
0.599LeuTrp: 0.599 ± 0.198
2.504LeuTyr: 2.504 ± 0.446
0.0LeuXaa: 0.0 ± 0.0
Met
2.994MetAla: 2.994 ± 0.352
0.163MetCys: 0.163 ± 0.092
1.579MetAsp: 1.579 ± 0.281
1.524MetGlu: 1.524 ± 0.304
0.653MetPhe: 0.653 ± 0.18
1.579MetGly: 1.579 ± 0.312
0.381MetHis: 0.381 ± 0.13
1.143MetIle: 1.143 ± 0.231
2.177MetLys: 2.177 ± 0.328
1.96MetLeu: 1.96 ± 0.295
0.762MetMet: 0.762 ± 0.205
1.47MetAsn: 1.47 ± 0.285
1.579MetPro: 1.579 ± 0.294
1.198MetGln: 1.198 ± 0.243
1.688MetArg: 1.688 ± 0.294
2.069MetSer: 2.069 ± 0.342
2.613MetThr: 2.613 ± 0.316
1.47MetVal: 1.47 ± 0.346
0.327MetTrp: 0.327 ± 0.118
0.49MetTyr: 0.49 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
4.627AsnAla: 4.627 ± 0.604
0.327AsnCys: 0.327 ± 0.132
2.286AsnAsp: 2.286 ± 0.436
3.212AsnGlu: 3.212 ± 0.393
1.796AsnPhe: 1.796 ± 0.414
3.865AsnGly: 3.865 ± 0.425
1.415AsnHis: 1.415 ± 0.351
3.048AsnIle: 3.048 ± 0.422
2.613AsnLys: 2.613 ± 0.324
3.375AsnLeu: 3.375 ± 0.522
1.524AsnMet: 1.524 ± 0.251
1.579AsnAsn: 1.579 ± 0.334
1.742AsnPro: 1.742 ± 0.265
1.96AsnGln: 1.96 ± 0.371
2.667AsnArg: 2.667 ± 0.468
2.613AsnSer: 2.613 ± 0.423
2.123AsnThr: 2.123 ± 0.275
2.286AsnVal: 2.286 ± 0.393
0.49AsnTrp: 0.49 ± 0.162
1.252AsnTyr: 1.252 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
3.43ProAla: 3.43 ± 0.527
0.381ProCys: 0.381 ± 0.135
4.137ProAsp: 4.137 ± 0.505
5.171ProGlu: 5.171 ± 0.84
1.415ProPhe: 1.415 ± 0.256
2.994ProGly: 2.994 ± 0.491
0.381ProHis: 0.381 ± 0.147
1.198ProIle: 1.198 ± 0.311
2.831ProLys: 2.831 ± 0.661
2.395ProLeu: 2.395 ± 0.354
0.817ProMet: 0.817 ± 0.168
1.306ProAsn: 1.306 ± 0.266
1.252ProPro: 1.252 ± 0.246
2.014ProGln: 2.014 ± 0.509
2.232ProArg: 2.232 ± 0.399
2.341ProSer: 2.341 ± 0.381
1.905ProThr: 1.905 ± 0.308
4.573ProVal: 4.573 ± 0.479
0.762ProTrp: 0.762 ± 0.223
1.579ProTyr: 1.579 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
4.355GlnAla: 4.355 ± 0.585
0.925GlnCys: 0.925 ± 0.259
2.395GlnAsp: 2.395 ± 0.345
2.885GlnGlu: 2.885 ± 0.403
1.633GlnPhe: 1.633 ± 0.289
3.212GlnGly: 3.212 ± 0.712
0.708GlnHis: 0.708 ± 0.188
2.014GlnIle: 2.014 ± 0.426
2.885GlnLys: 2.885 ± 0.463
3.43GlnLeu: 3.43 ± 0.4
0.98GlnMet: 0.98 ± 0.264
2.014GlnAsn: 2.014 ± 0.344
1.633GlnPro: 1.633 ± 0.487
3.756GlnGln: 3.756 ± 0.795
2.776GlnArg: 2.776 ± 0.483
2.232GlnSer: 2.232 ± 0.28
2.232GlnThr: 2.232 ± 0.404
2.559GlnVal: 2.559 ± 0.453
0.817GlnTrp: 0.817 ± 0.234
1.524GlnTyr: 1.524 ± 0.271
0.0GlnXaa: 0.0 ± 0.0
Arg
4.355ArgAla: 4.355 ± 0.467
0.49ArgCys: 0.49 ± 0.176
4.192ArgAsp: 4.192 ± 0.639
5.28ArgGlu: 5.28 ± 0.649
2.123ArgPhe: 2.123 ± 0.322
4.355ArgGly: 4.355 ± 0.881
1.524ArgHis: 1.524 ± 0.274
3.919ArgIle: 3.919 ± 0.555
4.192ArgLys: 4.192 ± 0.521
5.117ArgLeu: 5.117 ± 0.421
1.742ArgMet: 1.742 ± 0.288
3.266ArgAsn: 3.266 ± 0.439
2.123ArgPro: 2.123 ± 0.408
2.994ArgGln: 2.994 ± 0.355
5.77ArgArg: 5.77 ± 0.639
4.246ArgSer: 4.246 ± 0.511
3.321ArgThr: 3.321 ± 0.6
4.028ArgVal: 4.028 ± 0.465
1.198ArgTrp: 1.198 ± 0.262
2.341ArgTyr: 2.341 ± 0.416
0.0ArgXaa: 0.0 ± 0.0
Ser
5.661SerAla: 5.661 ± 0.575
0.599SerCys: 0.599 ± 0.207
3.811SerAsp: 3.811 ± 0.446
4.192SerGlu: 4.192 ± 0.509
2.069SerPhe: 2.069 ± 0.328
5.553SerGly: 5.553 ± 0.574
0.871SerHis: 0.871 ± 0.205
2.831SerIle: 2.831 ± 0.448
3.157SerLys: 3.157 ± 0.573
5.879SerLeu: 5.879 ± 0.668
1.415SerMet: 1.415 ± 0.332
2.177SerAsn: 2.177 ± 0.304
3.212SerPro: 3.212 ± 0.417
2.831SerGln: 2.831 ± 0.374
3.702SerArg: 3.702 ± 0.477
2.94SerSer: 2.94 ± 0.529
3.593SerThr: 3.593 ± 0.468
3.702SerVal: 3.702 ± 0.437
0.817SerTrp: 0.817 ± 0.217
1.524SerTyr: 1.524 ± 0.278
0.0SerXaa: 0.0 ± 0.0
Thr
5.28ThrAla: 5.28 ± 0.41
0.544ThrCys: 0.544 ± 0.177
3.647ThrAsp: 3.647 ± 0.482
3.702ThrGlu: 3.702 ± 0.346
1.851ThrPhe: 1.851 ± 0.356
5.335ThrGly: 5.335 ± 0.832
0.98ThrHis: 0.98 ± 0.232
3.756ThrIle: 3.756 ± 0.444
2.994ThrLys: 2.994 ± 0.418
5.008ThrLeu: 5.008 ± 0.533
0.817ThrMet: 0.817 ± 0.159
2.177ThrAsn: 2.177 ± 0.381
3.375ThrPro: 3.375 ± 0.334
1.796ThrGln: 1.796 ± 0.374
2.885ThrArg: 2.885 ± 0.389
3.865ThrSer: 3.865 ± 0.428
2.994ThrThr: 2.994 ± 0.463
3.811ThrVal: 3.811 ± 0.55
0.708ThrTrp: 0.708 ± 0.18
1.198ThrTyr: 1.198 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
6.206ValAla: 6.206 ± 0.534
0.817ValCys: 0.817 ± 0.217
3.647ValAsp: 3.647 ± 0.345
3.538ValGlu: 3.538 ± 0.387
2.123ValPhe: 2.123 ± 0.324
3.321ValGly: 3.321 ± 0.554
0.653ValHis: 0.653 ± 0.188
3.538ValIle: 3.538 ± 0.471
3.865ValLys: 3.865 ± 0.447
5.444ValLeu: 5.444 ± 0.557
1.851ValMet: 1.851 ± 0.364
3.103ValAsn: 3.103 ± 0.36
2.776ValPro: 2.776 ± 0.391
2.395ValGln: 2.395 ± 0.377
4.627ValArg: 4.627 ± 0.992
4.518ValSer: 4.518 ± 0.468
4.028ValThr: 4.028 ± 0.375
4.464ValVal: 4.464 ± 0.438
0.925ValTrp: 0.925 ± 0.222
1.96ValTyr: 1.96 ± 0.302
0.0ValXaa: 0.0 ± 0.0
Trp
0.871TrpAla: 0.871 ± 0.272
0.272TrpCys: 0.272 ± 0.136
0.599TrpAsp: 0.599 ± 0.195
0.599TrpGlu: 0.599 ± 0.168
0.708TrpPhe: 0.708 ± 0.154
0.817TrpGly: 0.817 ± 0.217
0.381TrpHis: 0.381 ± 0.143
0.817TrpIle: 0.817 ± 0.209
1.089TrpLys: 1.089 ± 0.208
1.742TrpLeu: 1.742 ± 0.388
0.708TrpMet: 0.708 ± 0.204
0.653TrpAsn: 0.653 ± 0.184
0.599TrpPro: 0.599 ± 0.206
1.252TrpGln: 1.252 ± 0.199
1.633TrpArg: 1.633 ± 0.274
0.925TrpSer: 0.925 ± 0.217
0.544TrpThr: 0.544 ± 0.195
1.089TrpVal: 1.089 ± 0.223
0.544TrpTrp: 0.544 ± 0.199
0.327TrpTyr: 0.327 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.722TyrAla: 2.722 ± 0.427
0.435TyrCys: 0.435 ± 0.184
1.905TyrAsp: 1.905 ± 0.357
1.306TyrGlu: 1.306 ± 0.281
1.252TyrPhe: 1.252 ± 0.276
2.667TyrGly: 2.667 ± 0.439
0.762TyrHis: 0.762 ± 0.241
1.742TyrIle: 1.742 ± 0.3
1.361TyrLys: 1.361 ± 0.313
1.361TyrLeu: 1.361 ± 0.267
0.871TyrMet: 0.871 ± 0.234
1.361TyrAsn: 1.361 ± 0.286
1.579TyrPro: 1.579 ± 0.276
1.688TyrGln: 1.688 ± 0.286
2.45TyrArg: 2.45 ± 0.446
1.851TyrSer: 1.851 ± 0.243
1.796TyrThr: 1.796 ± 0.43
1.851TyrVal: 1.851 ± 0.277
0.762TyrTrp: 0.762 ± 0.211
1.143TyrTyr: 1.143 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 88 proteins (18371 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski