Amino acid dipepetide frequency for Stx2-converting phage Stx2a_F765

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.627AlaAla: 8.627 ± 0.881
1.024AlaCys: 1.024 ± 0.315
5.877AlaAsp: 5.877 ± 0.574
8.196AlaGlu: 8.196 ± 0.813
3.505AlaPhe: 3.505 ± 0.549
7.98AlaGly: 7.98 ± 1.16
1.348AlaHis: 1.348 ± 0.336
3.936AlaIle: 3.936 ± 0.633
4.853AlaLys: 4.853 ± 0.528
6.848AlaLeu: 6.848 ± 0.561
2.804AlaMet: 2.804 ± 0.456
2.642AlaAsn: 2.642 ± 0.405
3.882AlaPro: 3.882 ± 0.543
4.368AlaGln: 4.368 ± 0.837
5.823AlaArg: 5.823 ± 0.591
5.176AlaSer: 5.176 ± 0.43
5.716AlaThr: 5.716 ± 0.722
6.416AlaVal: 6.416 ± 0.646
1.618AlaTrp: 1.618 ± 0.306
2.912AlaTyr: 2.912 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
1.186CysAla: 1.186 ± 0.271
0.324CysCys: 0.324 ± 0.147
0.755CysAsp: 0.755 ± 0.293
0.701CysGlu: 0.701 ± 0.239
0.485CysPhe: 0.485 ± 0.185
1.186CysGly: 1.186 ± 0.374
0.216CysHis: 0.216 ± 0.113
0.647CysIle: 0.647 ± 0.203
0.593CysLys: 0.593 ± 0.241
0.971CysLeu: 0.971 ± 0.243
0.162CysMet: 0.162 ± 0.108
0.377CysAsn: 0.377 ± 0.153
0.701CysPro: 0.701 ± 0.284
0.377CysGln: 0.377 ± 0.145
0.971CysArg: 0.971 ± 0.258
0.971CysSer: 0.971 ± 0.287
0.539CysThr: 0.539 ± 0.21
1.132CysVal: 1.132 ± 0.293
0.108CysTrp: 0.108 ± 0.1
0.324CysTyr: 0.324 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
5.823AspAla: 5.823 ± 0.722
0.701AspCys: 0.701 ± 0.224
3.289AspAsp: 3.289 ± 0.316
4.152AspGlu: 4.152 ± 0.478
1.779AspPhe: 1.779 ± 0.296
4.368AspGly: 4.368 ± 0.469
0.809AspHis: 0.809 ± 0.273
3.451AspIle: 3.451 ± 0.419
4.314AspLys: 4.314 ± 0.469
3.72AspLeu: 3.72 ± 0.496
1.672AspMet: 1.672 ± 0.295
2.642AspAsn: 2.642 ± 0.395
2.265AspPro: 2.265 ± 0.439
1.294AspGln: 1.294 ± 0.287
3.667AspArg: 3.667 ± 0.49
2.966AspSer: 2.966 ± 0.406
3.181AspThr: 3.181 ± 0.458
4.475AspVal: 4.475 ± 0.484
0.917AspTrp: 0.917 ± 0.267
1.564AspTyr: 1.564 ± 0.359
0.0AspXaa: 0.0 ± 0.0
Glu
6.093GluAla: 6.093 ± 0.556
1.024GluCys: 1.024 ± 0.327
2.372GluAsp: 2.372 ± 0.332
4.206GluGlu: 4.206 ± 0.512
2.912GluPhe: 2.912 ± 0.545
3.99GluGly: 3.99 ± 0.488
1.456GluHis: 1.456 ± 0.237
3.882GluIle: 3.882 ± 0.469
4.475GluLys: 4.475 ± 0.541
5.823GluLeu: 5.823 ± 0.672
2.426GluMet: 2.426 ± 0.323
2.966GluAsn: 2.966 ± 0.439
1.887GluPro: 1.887 ± 0.457
4.314GluGln: 4.314 ± 0.485
5.554GluArg: 5.554 ± 0.572
3.451GluSer: 3.451 ± 0.372
3.936GluThr: 3.936 ± 0.683
4.583GluVal: 4.583 ± 0.501
0.863GluTrp: 0.863 ± 0.219
1.995GluTyr: 1.995 ± 0.327
0.0GluXaa: 0.0 ± 0.0
Phe
2.804PheAla: 2.804 ± 0.349
0.809PheCys: 0.809 ± 0.26
2.211PheAsp: 2.211 ± 0.331
1.348PheGlu: 1.348 ± 0.307
1.186PhePhe: 1.186 ± 0.27
2.319PheGly: 2.319 ± 0.32
0.593PheHis: 0.593 ± 0.155
2.103PheIle: 2.103 ± 0.346
1.564PheLys: 1.564 ± 0.277
2.049PheLeu: 2.049 ± 0.296
1.348PheMet: 1.348 ± 0.289
1.725PheAsn: 1.725 ± 0.317
1.402PhePro: 1.402 ± 0.283
0.809PheGln: 0.809 ± 0.227
3.073PheArg: 3.073 ± 0.464
3.235PheSer: 3.235 ± 0.532
2.211PheThr: 2.211 ± 0.258
2.966PheVal: 2.966 ± 0.428
0.593PheTrp: 0.593 ± 0.208
0.917PheTyr: 0.917 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
6.363GlyAla: 6.363 ± 0.953
0.917GlyCys: 0.917 ± 0.284
4.637GlyAsp: 4.637 ± 0.778
5.931GlyGlu: 5.931 ± 1.372
3.02GlyPhe: 3.02 ± 0.514
5.338GlyGly: 5.338 ± 0.663
1.294GlyHis: 1.294 ± 0.355
4.314GlyIle: 4.314 ± 0.571
5.392GlyLys: 5.392 ± 0.875
5.176GlyLeu: 5.176 ± 0.427
1.887GlyMet: 1.887 ± 0.329
3.073GlyAsn: 3.073 ± 0.41
3.99GlyPro: 3.99 ± 2.034
2.912GlyGln: 2.912 ± 0.435
3.936GlyArg: 3.936 ± 0.474
3.99GlySer: 3.99 ± 0.457
3.72GlyThr: 3.72 ± 0.493
5.176GlyVal: 5.176 ± 0.544
0.863GlyTrp: 0.863 ± 0.166
2.319GlyTyr: 2.319 ± 0.338
0.0GlyXaa: 0.0 ± 0.0
His
2.049HisAla: 2.049 ± 0.222
0.162HisCys: 0.162 ± 0.098
0.863HisAsp: 0.863 ± 0.194
0.917HisGlu: 0.917 ± 0.235
0.863HisPhe: 0.863 ± 0.24
1.348HisGly: 1.348 ± 0.333
0.539HisHis: 0.539 ± 0.166
0.863HisIle: 0.863 ± 0.236
0.701HisLys: 0.701 ± 0.197
1.887HisLeu: 1.887 ± 0.481
0.431HisMet: 0.431 ± 0.165
0.647HisAsn: 0.647 ± 0.165
0.809HisPro: 0.809 ± 0.192
0.485HisGln: 0.485 ± 0.168
0.917HisArg: 0.917 ± 0.243
1.24HisSer: 1.24 ± 0.248
1.024HisThr: 1.024 ± 0.212
0.971HisVal: 0.971 ± 0.19
0.27HisTrp: 0.27 ± 0.163
0.701HisTyr: 0.701 ± 0.177
0.0HisXaa: 0.0 ± 0.0
Ile
4.799IleAla: 4.799 ± 0.475
0.863IleCys: 0.863 ± 0.267
3.613IleAsp: 3.613 ± 0.461
3.181IleGlu: 3.181 ± 0.507
1.294IlePhe: 1.294 ± 0.289
2.75IleGly: 2.75 ± 0.509
0.917IleHis: 0.917 ± 0.21
2.75IleIle: 2.75 ± 0.515
2.534IleLys: 2.534 ± 0.384
3.181IleLeu: 3.181 ± 0.635
1.132IleMet: 1.132 ± 0.216
3.02IleAsn: 3.02 ± 0.467
2.534IlePro: 2.534 ± 0.32
1.833IleGln: 1.833 ± 0.258
4.853IleArg: 4.853 ± 0.461
3.936IleSer: 3.936 ± 0.66
3.343IleThr: 3.343 ± 0.552
2.049IleVal: 2.049 ± 0.393
0.324IleTrp: 0.324 ± 0.146
1.186IleTyr: 1.186 ± 0.279
0.0IleXaa: 0.0 ± 0.0
Lys
6.039LysAla: 6.039 ± 0.568
0.647LysCys: 0.647 ± 0.233
3.127LysAsp: 3.127 ± 0.415
3.559LysGlu: 3.559 ± 0.521
1.402LysPhe: 1.402 ± 0.264
5.554LysGly: 5.554 ± 1.043
1.348LysHis: 1.348 ± 0.257
3.451LysIle: 3.451 ± 0.475
3.72LysLys: 3.72 ± 0.509
4.691LysLeu: 4.691 ± 0.468
1.618LysMet: 1.618 ± 0.296
2.804LysAsn: 2.804 ± 0.356
2.75LysPro: 2.75 ± 0.421
2.696LysGln: 2.696 ± 0.45
2.534LysArg: 2.534 ± 0.346
3.235LysSer: 3.235 ± 0.372
3.397LysThr: 3.397 ± 0.507
3.02LysVal: 3.02 ± 0.427
0.701LysTrp: 0.701 ± 0.233
1.779LysTyr: 1.779 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
8.196LeuAla: 8.196 ± 0.826
0.971LeuCys: 0.971 ± 0.296
3.936LeuAsp: 3.936 ± 0.519
3.99LeuGlu: 3.99 ± 0.536
2.642LeuPhe: 2.642 ± 0.422
4.745LeuGly: 4.745 ± 0.513
1.456LeuHis: 1.456 ± 0.24
3.289LeuIle: 3.289 ± 0.542
4.475LeuLys: 4.475 ± 0.505
6.147LeuLeu: 6.147 ± 0.587
2.049LeuMet: 2.049 ± 0.352
4.206LeuAsn: 4.206 ± 0.418
3.882LeuPro: 3.882 ± 0.491
3.451LeuGln: 3.451 ± 0.731
4.853LeuArg: 4.853 ± 0.655
5.338LeuSer: 5.338 ± 0.52
5.931LeuThr: 5.931 ± 0.668
4.961LeuVal: 4.961 ± 0.434
0.593LeuTrp: 0.593 ± 0.21
2.319LeuTyr: 2.319 ± 0.401
0.0LeuXaa: 0.0 ± 0.0
Met
3.073MetAla: 3.073 ± 0.373
0.108MetCys: 0.108 ± 0.086
1.24MetAsp: 1.24 ± 0.237
1.456MetGlu: 1.456 ± 0.258
0.701MetPhe: 0.701 ± 0.161
1.725MetGly: 1.725 ± 0.335
0.431MetHis: 0.431 ± 0.106
0.809MetIle: 0.809 ± 0.174
2.049MetLys: 2.049 ± 0.289
1.618MetLeu: 1.618 ± 0.224
0.809MetMet: 0.809 ± 0.229
1.941MetAsn: 1.941 ± 0.354
1.833MetPro: 1.833 ± 0.324
1.24MetGln: 1.24 ± 0.218
1.51MetArg: 1.51 ± 0.267
2.049MetSer: 2.049 ± 0.329
2.426MetThr: 2.426 ± 0.33
1.402MetVal: 1.402 ± 0.304
0.216MetTrp: 0.216 ± 0.097
0.701MetTyr: 0.701 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
4.745AsnAla: 4.745 ± 0.666
0.485AsnCys: 0.485 ± 0.187
2.103AsnAsp: 2.103 ± 0.28
2.696AsnGlu: 2.696 ± 0.332
1.186AsnPhe: 1.186 ± 0.323
3.613AsnGly: 3.613 ± 0.444
1.294AsnHis: 1.294 ± 0.319
2.696AsnIle: 2.696 ± 0.399
2.534AsnLys: 2.534 ± 0.384
3.289AsnLeu: 3.289 ± 0.44
1.186AsnMet: 1.186 ± 0.201
1.887AsnAsn: 1.887 ± 0.401
2.103AsnPro: 2.103 ± 0.316
1.564AsnGln: 1.564 ± 0.281
2.804AsnArg: 2.804 ± 0.433
2.265AsnSer: 2.265 ± 0.418
2.157AsnThr: 2.157 ± 0.381
2.534AsnVal: 2.534 ± 0.394
0.539AsnTrp: 0.539 ± 0.152
0.917AsnTyr: 0.917 ± 0.221
0.0AsnXaa: 0.0 ± 0.0
Pro
3.451ProAla: 3.451 ± 0.56
0.539ProCys: 0.539 ± 0.185
3.99ProAsp: 3.99 ± 0.49
5.284ProGlu: 5.284 ± 0.861
1.672ProPhe: 1.672 ± 0.28
3.451ProGly: 3.451 ± 0.518
0.377ProHis: 0.377 ± 0.177
0.917ProIle: 0.917 ± 0.327
2.588ProLys: 2.588 ± 0.673
3.289ProLeu: 3.289 ± 0.497
0.917ProMet: 0.917 ± 0.212
1.024ProAsn: 1.024 ± 0.234
1.51ProPro: 1.51 ± 0.312
1.725ProGln: 1.725 ± 0.549
2.534ProArg: 2.534 ± 0.403
2.696ProSer: 2.696 ± 0.398
2.157ProThr: 2.157 ± 0.365
4.907ProVal: 4.907 ± 0.586
0.647ProTrp: 0.647 ± 0.172
1.833ProTyr: 1.833 ± 0.314
0.0ProXaa: 0.0 ± 0.0
Gln
3.882GlnAla: 3.882 ± 0.578
0.863GlnCys: 0.863 ± 0.253
2.103GlnAsp: 2.103 ± 0.347
2.858GlnGlu: 2.858 ± 0.419
1.402GlnPhe: 1.402 ± 0.235
3.343GlnGly: 3.343 ± 0.677
0.917GlnHis: 0.917 ± 0.253
1.941GlnIle: 1.941 ± 0.422
2.966GlnLys: 2.966 ± 0.336
3.72GlnLeu: 3.72 ± 0.475
0.863GlnMet: 0.863 ± 0.251
1.672GlnAsn: 1.672 ± 0.329
1.833GlnPro: 1.833 ± 0.41
3.073GlnGln: 3.073 ± 0.638
2.588GlnArg: 2.588 ± 0.439
2.103GlnSer: 2.103 ± 0.371
1.833GlnThr: 1.833 ± 0.288
2.319GlnVal: 2.319 ± 0.474
0.701GlnTrp: 0.701 ± 0.168
1.456GlnTyr: 1.456 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
4.475ArgAla: 4.475 ± 0.402
0.485ArgCys: 0.485 ± 0.194
3.667ArgAsp: 3.667 ± 0.631
5.392ArgGlu: 5.392 ± 0.76
3.235ArgPhe: 3.235 ± 0.456
5.068ArgGly: 5.068 ± 0.881
1.456ArgHis: 1.456 ± 0.285
3.235ArgIle: 3.235 ± 0.448
4.368ArgLys: 4.368 ± 0.426
5.068ArgLeu: 5.068 ± 0.431
1.51ArgMet: 1.51 ± 0.264
2.642ArgAsn: 2.642 ± 0.407
2.103ArgPro: 2.103 ± 0.364
2.912ArgGln: 2.912 ± 0.364
5.446ArgArg: 5.446 ± 0.603
4.152ArgSer: 4.152 ± 0.504
3.559ArgThr: 3.559 ± 0.538
4.314ArgVal: 4.314 ± 0.443
1.186ArgTrp: 1.186 ± 0.256
2.049ArgTyr: 2.049 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
5.823SerAla: 5.823 ± 0.56
0.755SerCys: 0.755 ± 0.23
3.828SerAsp: 3.828 ± 0.402
4.26SerGlu: 4.26 ± 0.413
2.103SerPhe: 2.103 ± 0.316
5.284SerGly: 5.284 ± 0.551
0.971SerHis: 0.971 ± 0.251
2.372SerIle: 2.372 ± 0.351
2.48SerLys: 2.48 ± 0.482
5.716SerLeu: 5.716 ± 0.696
1.779SerMet: 1.779 ± 0.361
2.157SerAsn: 2.157 ± 0.311
3.397SerPro: 3.397 ± 0.445
2.804SerGln: 2.804 ± 0.362
4.314SerArg: 4.314 ± 0.483
2.642SerSer: 2.642 ± 0.532
3.613SerThr: 3.613 ± 0.497
4.206SerVal: 4.206 ± 0.581
0.971SerTrp: 0.971 ± 0.207
1.672SerTyr: 1.672 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
5.662ThrAla: 5.662 ± 0.472
0.324ThrCys: 0.324 ± 0.151
3.397ThrAsp: 3.397 ± 0.466
4.152ThrGlu: 4.152 ± 0.383
2.103ThrPhe: 2.103 ± 0.39
5.823ThrGly: 5.823 ± 0.761
0.971ThrHis: 0.971 ± 0.234
3.451ThrIle: 3.451 ± 0.403
2.588ThrLys: 2.588 ± 0.349
5.5ThrLeu: 5.5 ± 0.51
1.348ThrMet: 1.348 ± 0.276
1.456ThrAsn: 1.456 ± 0.264
4.152ThrPro: 4.152 ± 0.386
1.941ThrGln: 1.941 ± 0.391
2.426ThrArg: 2.426 ± 0.395
3.99ThrSer: 3.99 ± 0.509
4.206ThrThr: 4.206 ± 0.661
3.99ThrVal: 3.99 ± 0.57
1.024ThrTrp: 1.024 ± 0.261
1.294ThrTyr: 1.294 ± 0.289
0.0ThrXaa: 0.0 ± 0.0
Val
6.848ValAla: 6.848 ± 0.546
1.132ValCys: 1.132 ± 0.326
3.72ValAsp: 3.72 ± 0.37
3.559ValGlu: 3.559 ± 0.352
2.049ValPhe: 2.049 ± 0.337
3.289ValGly: 3.289 ± 0.443
0.647ValHis: 0.647 ± 0.195
3.451ValIle: 3.451 ± 0.504
3.936ValLys: 3.936 ± 0.41
5.5ValLeu: 5.5 ± 0.653
1.995ValMet: 1.995 ± 0.361
3.451ValAsn: 3.451 ± 0.383
2.588ValPro: 2.588 ± 0.374
2.211ValGln: 2.211 ± 0.445
5.068ValArg: 5.068 ± 0.989
5.122ValSer: 5.122 ± 0.623
4.583ValThr: 4.583 ± 0.477
4.152ValVal: 4.152 ± 0.496
1.078ValTrp: 1.078 ± 0.229
2.157ValTyr: 2.157 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
0.917TrpAla: 0.917 ± 0.278
0.216TrpCys: 0.216 ± 0.118
0.593TrpAsp: 0.593 ± 0.217
0.863TrpGlu: 0.863 ± 0.2
0.593TrpPhe: 0.593 ± 0.146
1.024TrpGly: 1.024 ± 0.282
0.324TrpHis: 0.324 ± 0.13
0.809TrpIle: 0.809 ± 0.189
0.917TrpLys: 0.917 ± 0.215
1.402TrpLeu: 1.402 ± 0.379
0.593TrpMet: 0.593 ± 0.18
0.593TrpAsn: 0.593 ± 0.177
0.593TrpPro: 0.593 ± 0.186
0.971TrpGln: 0.971 ± 0.214
1.078TrpArg: 1.078 ± 0.224
0.755TrpSer: 0.755 ± 0.174
0.485TrpThr: 0.485 ± 0.179
0.917TrpVal: 0.917 ± 0.22
0.431TrpTrp: 0.431 ± 0.173
0.324TrpTyr: 0.324 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.696TyrAla: 2.696 ± 0.391
0.377TyrCys: 0.377 ± 0.146
1.725TyrAsp: 1.725 ± 0.333
1.51TyrGlu: 1.51 ± 0.348
1.132TyrPhe: 1.132 ± 0.211
2.103TyrGly: 2.103 ± 0.333
0.324TyrHis: 0.324 ± 0.138
1.725TyrIle: 1.725 ± 0.367
1.024TyrLys: 1.024 ± 0.203
1.833TyrLeu: 1.833 ± 0.355
0.755TyrMet: 0.755 ± 0.221
1.672TyrAsn: 1.672 ± 0.27
1.294TyrPro: 1.294 ± 0.271
1.456TyrGln: 1.456 ± 0.257
2.372TyrArg: 2.372 ± 0.419
1.833TyrSer: 1.833 ± 0.267
1.779TyrThr: 1.779 ± 0.439
2.049TyrVal: 2.049 ± 0.319
0.755TyrTrp: 0.755 ± 0.188
1.186TyrTyr: 1.186 ± 0.226
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (18547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski