Amino acid dipepetide frequency for Stx converting phage vB_EcoS_ST2-8624

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.907AlaAla: 8.907 ± 0.789
1.038AlaCys: 1.038 ± 0.299
5.027AlaAsp: 5.027 ± 0.599
8.306AlaGlu: 8.306 ± 0.79
3.606AlaPhe: 3.606 ± 0.607
7.814AlaGly: 7.814 ± 1.082
1.475AlaHis: 1.475 ± 0.302
4.863AlaIle: 4.863 ± 0.621
5.136AlaLys: 5.136 ± 0.495
7.103AlaLeu: 7.103 ± 0.641
3.224AlaMet: 3.224 ± 0.459
3.115AlaAsn: 3.115 ± 0.432
3.115AlaPro: 3.115 ± 0.45
5.3AlaGln: 5.3 ± 0.64
5.628AlaArg: 5.628 ± 0.631
6.175AlaSer: 6.175 ± 0.485
5.628AlaThr: 5.628 ± 0.731
6.011AlaVal: 6.011 ± 0.588
1.639AlaTrp: 1.639 ± 0.277
2.459AlaTyr: 2.459 ± 0.31
0.0AlaXaa: 0.0 ± 0.0
Cys
1.038CysAla: 1.038 ± 0.246
0.382CysCys: 0.382 ± 0.156
0.601CysAsp: 0.601 ± 0.182
0.71CysGlu: 0.71 ± 0.205
0.382CysPhe: 0.382 ± 0.14
0.929CysGly: 0.929 ± 0.284
0.328CysHis: 0.328 ± 0.162
0.437CysIle: 0.437 ± 0.192
0.437CysLys: 0.437 ± 0.185
0.765CysLeu: 0.765 ± 0.229
0.109CysMet: 0.109 ± 0.096
0.219CysAsn: 0.219 ± 0.108
0.437CysPro: 0.437 ± 0.159
0.328CysGln: 0.328 ± 0.131
0.984CysArg: 0.984 ± 0.287
1.038CysSer: 1.038 ± 0.309
0.656CysThr: 0.656 ± 0.197
0.71CysVal: 0.71 ± 0.198
0.164CysTrp: 0.164 ± 0.098
0.437CysTyr: 0.437 ± 0.195
0.0CysXaa: 0.0 ± 0.0
Asp
6.12AspAla: 6.12 ± 0.58
0.492AspCys: 0.492 ± 0.174
3.989AspAsp: 3.989 ± 0.413
4.481AspGlu: 4.481 ± 0.475
1.912AspPhe: 1.912 ± 0.348
5.136AspGly: 5.136 ± 0.577
0.929AspHis: 0.929 ± 0.339
3.497AspIle: 3.497 ± 0.332
4.371AspLys: 4.371 ± 0.393
4.207AspLeu: 4.207 ± 0.455
1.803AspMet: 1.803 ± 0.29
2.732AspAsn: 2.732 ± 0.401
2.514AspPro: 2.514 ± 0.389
1.639AspGln: 1.639 ± 0.341
3.224AspArg: 3.224 ± 0.403
3.442AspSer: 3.442 ± 0.412
2.514AspThr: 2.514 ± 0.334
3.88AspVal: 3.88 ± 0.393
1.093AspTrp: 1.093 ± 0.326
1.366AspTyr: 1.366 ± 0.274
0.0AspXaa: 0.0 ± 0.0
Glu
6.666GluAla: 6.666 ± 0.529
1.038GluCys: 1.038 ± 0.276
2.295GluAsp: 2.295 ± 0.313
4.207GluGlu: 4.207 ± 0.497
2.295GluPhe: 2.295 ± 0.395
4.153GluGly: 4.153 ± 0.603
1.257GluHis: 1.257 ± 0.236
3.77GluIle: 3.77 ± 0.399
4.645GluLys: 4.645 ± 0.621
6.229GluLeu: 6.229 ± 0.671
2.295GluMet: 2.295 ± 0.319
3.442GluAsn: 3.442 ± 0.452
1.749GluPro: 1.749 ± 0.417
4.426GluGln: 4.426 ± 0.522
5.628GluArg: 5.628 ± 0.571
3.552GluSer: 3.552 ± 0.454
4.043GluThr: 4.043 ± 0.695
4.207GluVal: 4.207 ± 0.521
1.257GluTrp: 1.257 ± 0.209
2.295GluTyr: 2.295 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
3.005PheAla: 3.005 ± 0.364
0.328PheCys: 0.328 ± 0.126
1.912PheAsp: 1.912 ± 0.305
1.366PheGlu: 1.366 ± 0.324
0.929PhePhe: 0.929 ± 0.269
2.35PheGly: 2.35 ± 0.287
0.765PheHis: 0.765 ± 0.21
1.967PheIle: 1.967 ± 0.273
1.749PheLys: 1.749 ± 0.379
1.858PheLeu: 1.858 ± 0.309
0.929PheMet: 0.929 ± 0.237
1.147PheAsn: 1.147 ± 0.267
1.311PhePro: 1.311 ± 0.29
1.038PheGln: 1.038 ± 0.231
2.35PheArg: 2.35 ± 0.375
2.732PheSer: 2.732 ± 0.428
2.732PheThr: 2.732 ± 0.343
2.404PheVal: 2.404 ± 0.364
0.82PheTrp: 0.82 ± 0.209
0.82PheTyr: 0.82 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
5.956GlyAla: 5.956 ± 0.946
0.656GlyCys: 0.656 ± 0.219
5.136GlyAsp: 5.136 ± 0.709
6.612GlyGlu: 6.612 ± 1.58
2.35GlyPhe: 2.35 ± 0.358
5.082GlyGly: 5.082 ± 0.659
1.202GlyHis: 1.202 ± 0.274
3.825GlyIle: 3.825 ± 0.6
4.699GlyLys: 4.699 ± 0.922
4.808GlyLeu: 4.808 ± 0.476
2.295GlyMet: 2.295 ± 0.342
3.115GlyAsn: 3.115 ± 0.357
4.262GlyPro: 4.262 ± 2.323
2.568GlyGln: 2.568 ± 0.406
4.043GlyArg: 4.043 ± 0.465
3.661GlySer: 3.661 ± 0.448
3.552GlyThr: 3.552 ± 0.48
5.355GlyVal: 5.355 ± 0.502
1.257GlyTrp: 1.257 ± 0.236
2.677GlyTyr: 2.677 ± 0.407
0.0GlyXaa: 0.0 ± 0.0
His
1.421HisAla: 1.421 ± 0.259
0.273HisCys: 0.273 ± 0.12
1.038HisAsp: 1.038 ± 0.201
0.929HisGlu: 0.929 ± 0.258
0.71HisPhe: 0.71 ± 0.219
1.639HisGly: 1.639 ± 0.372
0.437HisHis: 0.437 ± 0.158
0.601HisIle: 0.601 ± 0.181
1.093HisLys: 1.093 ± 0.316
1.421HisLeu: 1.421 ± 0.373
0.382HisMet: 0.382 ± 0.155
1.038HisAsn: 1.038 ± 0.229
0.929HisPro: 0.929 ± 0.245
0.71HisGln: 0.71 ± 0.21
0.82HisArg: 0.82 ± 0.248
0.71HisSer: 0.71 ± 0.195
0.984HisThr: 0.984 ± 0.25
0.874HisVal: 0.874 ± 0.192
0.382HisTrp: 0.382 ± 0.186
0.82HisTyr: 0.82 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
5.191IleAla: 5.191 ± 0.453
0.984IleCys: 0.984 ± 0.273
3.825IleAsp: 3.825 ± 0.551
3.716IleGlu: 3.716 ± 0.63
0.984IlePhe: 0.984 ± 0.241
3.169IleGly: 3.169 ± 0.511
0.874IleHis: 0.874 ± 0.186
2.295IleIle: 2.295 ± 0.363
3.442IleLys: 3.442 ± 0.447
3.224IleLeu: 3.224 ± 0.563
1.093IleMet: 1.093 ± 0.215
2.787IleAsn: 2.787 ± 0.414
2.295IlePro: 2.295 ± 0.35
1.912IleGln: 1.912 ± 0.242
4.262IleArg: 4.262 ± 0.397
3.606IleSer: 3.606 ± 0.484
3.88IleThr: 3.88 ± 0.656
2.076IleVal: 2.076 ± 0.367
0.492IleTrp: 0.492 ± 0.209
1.366IleTyr: 1.366 ± 0.317
0.0IleXaa: 0.0 ± 0.0
Lys
6.393LysAla: 6.393 ± 0.603
0.382LysCys: 0.382 ± 0.166
3.497LysAsp: 3.497 ± 0.443
3.716LysGlu: 3.716 ± 0.467
1.366LysPhe: 1.366 ± 0.276
5.464LysGly: 5.464 ± 1.053
0.765LysHis: 0.765 ± 0.207
3.442LysIle: 3.442 ± 0.376
3.552LysLys: 3.552 ± 0.504
5.082LysLeu: 5.082 ± 0.573
2.022LysMet: 2.022 ± 0.316
3.606LysAsn: 3.606 ± 0.465
2.568LysPro: 2.568 ± 0.413
3.06LysGln: 3.06 ± 0.488
3.224LysArg: 3.224 ± 0.41
3.005LysSer: 3.005 ± 0.373
3.552LysThr: 3.552 ± 0.491
3.224LysVal: 3.224 ± 0.46
0.546LysTrp: 0.546 ± 0.181
1.53LysTyr: 1.53 ± 0.263
0.0LysXaa: 0.0 ± 0.0
Leu
8.469LeuAla: 8.469 ± 0.745
1.038LeuCys: 1.038 ± 0.303
3.934LeuAsp: 3.934 ± 0.4
4.207LeuGlu: 4.207 ± 0.565
2.677LeuPhe: 2.677 ± 0.385
4.262LeuGly: 4.262 ± 0.48
1.421LeuHis: 1.421 ± 0.271
3.77LeuIle: 3.77 ± 0.503
4.59LeuLys: 4.59 ± 0.488
6.229LeuLeu: 6.229 ± 0.56
2.24LeuMet: 2.24 ± 0.359
3.88LeuAsn: 3.88 ± 0.539
3.934LeuPro: 3.934 ± 0.478
3.224LeuGln: 3.224 ± 0.697
5.246LeuArg: 5.246 ± 0.564
4.645LeuSer: 4.645 ± 0.474
4.808LeuThr: 4.808 ± 0.475
4.863LeuVal: 4.863 ± 0.389
0.71LeuTrp: 0.71 ± 0.271
2.186LeuTyr: 2.186 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
3.333MetAla: 3.333 ± 0.409
0.109MetCys: 0.109 ± 0.081
1.585MetAsp: 1.585 ± 0.221
1.585MetGlu: 1.585 ± 0.262
0.71MetPhe: 0.71 ± 0.191
1.585MetGly: 1.585 ± 0.339
0.328MetHis: 0.328 ± 0.099
1.038MetIle: 1.038 ± 0.218
2.076MetLys: 2.076 ± 0.314
2.076MetLeu: 2.076 ± 0.308
0.874MetMet: 0.874 ± 0.274
1.585MetAsn: 1.585 ± 0.319
1.858MetPro: 1.858 ± 0.32
1.366MetGln: 1.366 ± 0.276
1.639MetArg: 1.639 ± 0.283
1.967MetSer: 1.967 ± 0.282
2.623MetThr: 2.623 ± 0.344
1.475MetVal: 1.475 ± 0.359
0.273MetTrp: 0.273 ± 0.128
0.492MetTyr: 0.492 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
4.808AsnAla: 4.808 ± 0.703
0.273AsnCys: 0.273 ± 0.129
2.35AsnAsp: 2.35 ± 0.374
3.388AsnGlu: 3.388 ± 0.359
1.53AsnPhe: 1.53 ± 0.321
3.388AsnGly: 3.388 ± 0.431
1.257AsnHis: 1.257 ± 0.288
2.404AsnIle: 2.404 ± 0.407
2.131AsnLys: 2.131 ± 0.336
3.333AsnLeu: 3.333 ± 0.432
1.421AsnMet: 1.421 ± 0.232
1.749AsnAsn: 1.749 ± 0.329
1.858AsnPro: 1.858 ± 0.257
2.131AsnGln: 2.131 ± 0.373
3.005AsnArg: 3.005 ± 0.459
2.623AsnSer: 2.623 ± 0.342
2.131AsnThr: 2.131 ± 0.283
2.514AsnVal: 2.514 ± 0.356
0.382AsnTrp: 0.382 ± 0.135
1.257AsnTyr: 1.257 ± 0.271
0.0AsnXaa: 0.0 ± 0.0
Pro
3.442ProAla: 3.442 ± 0.565
0.328ProCys: 0.328 ± 0.141
4.481ProAsp: 4.481 ± 0.442
4.918ProGlu: 4.918 ± 0.833
1.257ProPhe: 1.257 ± 0.253
2.951ProGly: 2.951 ± 0.552
0.382ProHis: 0.382 ± 0.131
1.257ProIle: 1.257 ± 0.352
2.732ProLys: 2.732 ± 0.691
2.787ProLeu: 2.787 ± 0.353
0.984ProMet: 0.984 ± 0.21
1.311ProAsn: 1.311 ± 0.264
1.147ProPro: 1.147 ± 0.251
2.568ProGln: 2.568 ± 0.504
2.022ProArg: 2.022 ± 0.382
2.35ProSer: 2.35 ± 0.342
1.749ProThr: 1.749 ± 0.286
4.317ProVal: 4.317 ± 0.435
0.765ProTrp: 0.765 ± 0.201
1.585ProTyr: 1.585 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
4.535GlnAla: 4.535 ± 0.588
0.765GlnCys: 0.765 ± 0.252
2.514GlnAsp: 2.514 ± 0.322
3.169GlnGlu: 3.169 ± 0.459
1.639GlnPhe: 1.639 ± 0.307
3.333GlnGly: 3.333 ± 0.689
0.874GlnHis: 0.874 ± 0.226
2.131GlnIle: 2.131 ± 0.46
3.169GlnLys: 3.169 ± 0.466
3.552GlnLeu: 3.552 ± 0.398
1.038GlnMet: 1.038 ± 0.236
1.967GlnAsn: 1.967 ± 0.326
1.694GlnPro: 1.694 ± 0.41
4.153GlnGln: 4.153 ± 0.722
3.497GlnArg: 3.497 ± 0.572
2.514GlnSer: 2.514 ± 0.33
2.24GlnThr: 2.24 ± 0.338
2.459GlnVal: 2.459 ± 0.44
0.765GlnTrp: 0.765 ± 0.208
1.53GlnTyr: 1.53 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
4.535ArgAla: 4.535 ± 0.441
0.492ArgCys: 0.492 ± 0.173
4.317ArgAsp: 4.317 ± 0.679
5.3ArgGlu: 5.3 ± 0.644
2.131ArgPhe: 2.131 ± 0.296
4.808ArgGly: 4.808 ± 1.043
1.53ArgHis: 1.53 ± 0.289
3.716ArgIle: 3.716 ± 0.444
4.317ArgLys: 4.317 ± 0.522
5.3ArgLeu: 5.3 ± 0.449
1.967ArgMet: 1.967 ± 0.282
3.279ArgAsn: 3.279 ± 0.481
2.076ArgPro: 2.076 ± 0.334
3.169ArgGln: 3.169 ± 0.435
5.573ArgArg: 5.573 ± 0.612
3.552ArgSer: 3.552 ± 0.464
3.606ArgThr: 3.606 ± 0.527
3.825ArgVal: 3.825 ± 0.477
0.984ArgTrp: 0.984 ± 0.214
2.022ArgTyr: 2.022 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
6.011SerAla: 6.011 ± 0.533
0.656SerCys: 0.656 ± 0.223
3.497SerAsp: 3.497 ± 0.413
4.262SerGlu: 4.262 ± 0.457
1.639SerPhe: 1.639 ± 0.303
5.027SerGly: 5.027 ± 0.563
0.71SerHis: 0.71 ± 0.157
2.568SerIle: 2.568 ± 0.338
2.951SerLys: 2.951 ± 0.452
5.191SerLeu: 5.191 ± 0.652
1.585SerMet: 1.585 ± 0.36
2.459SerAsn: 2.459 ± 0.313
3.497SerPro: 3.497 ± 0.463
3.279SerGln: 3.279 ± 0.387
3.77SerArg: 3.77 ± 0.613
3.005SerSer: 3.005 ± 0.512
3.279SerThr: 3.279 ± 0.479
3.825SerVal: 3.825 ± 0.417
1.038SerTrp: 1.038 ± 0.262
1.53SerTyr: 1.53 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
5.573ThrAla: 5.573 ± 0.468
0.328ThrCys: 0.328 ± 0.137
3.115ThrAsp: 3.115 ± 0.415
3.497ThrGlu: 3.497 ± 0.401
2.131ThrPhe: 2.131 ± 0.44
5.901ThrGly: 5.901 ± 0.84
1.038ThrHis: 1.038 ± 0.265
3.77ThrIle: 3.77 ± 0.461
2.787ThrLys: 2.787 ± 0.392
4.808ThrLeu: 4.808 ± 0.439
0.656ThrMet: 0.656 ± 0.175
1.585ThrAsn: 1.585 ± 0.211
3.442ThrPro: 3.442 ± 0.356
1.749ThrGln: 1.749 ± 0.305
2.787ThrArg: 2.787 ± 0.375
4.043ThrSer: 4.043 ± 0.59
2.951ThrThr: 2.951 ± 0.385
4.371ThrVal: 4.371 ± 0.545
0.874ThrTrp: 0.874 ± 0.211
1.366ThrTyr: 1.366 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
6.448ValAla: 6.448 ± 0.515
1.038ValCys: 1.038 ± 0.264
3.88ValAsp: 3.88 ± 0.479
3.279ValGlu: 3.279 ± 0.37
2.131ValPhe: 2.131 ± 0.308
3.115ValGly: 3.115 ± 0.485
0.71ValHis: 0.71 ± 0.187
3.497ValIle: 3.497 ± 0.514
3.716ValLys: 3.716 ± 0.396
5.464ValLeu: 5.464 ± 0.635
1.858ValMet: 1.858 ± 0.288
3.06ValAsn: 3.06 ± 0.353
2.677ValPro: 2.677 ± 0.388
2.35ValGln: 2.35 ± 0.364
5.082ValArg: 5.082 ± 0.934
4.645ValSer: 4.645 ± 0.484
3.825ValThr: 3.825 ± 0.456
3.989ValVal: 3.989 ± 0.44
0.656ValTrp: 0.656 ± 0.178
1.858ValTyr: 1.858 ± 0.312
0.0ValXaa: 0.0 ± 0.0
Trp
0.984TrpAla: 0.984 ± 0.253
0.219TrpCys: 0.219 ± 0.124
0.492TrpAsp: 0.492 ± 0.175
0.437TrpGlu: 0.437 ± 0.131
0.71TrpPhe: 0.71 ± 0.171
0.546TrpGly: 0.546 ± 0.17
0.328TrpHis: 0.328 ± 0.129
0.874TrpIle: 0.874 ± 0.228
1.147TrpLys: 1.147 ± 0.22
1.53TrpLeu: 1.53 ± 0.363
0.929TrpMet: 0.929 ± 0.222
0.546TrpAsn: 0.546 ± 0.185
0.601TrpPro: 0.601 ± 0.241
1.147TrpGln: 1.147 ± 0.203
1.366TrpArg: 1.366 ± 0.29
0.984TrpSer: 0.984 ± 0.197
0.437TrpThr: 0.437 ± 0.127
1.038TrpVal: 1.038 ± 0.254
0.328TrpTrp: 0.328 ± 0.157
0.382TrpTyr: 0.382 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.732TyrAla: 2.732 ± 0.389
0.273TyrCys: 0.273 ± 0.116
1.858TyrAsp: 1.858 ± 0.3
1.366TyrGlu: 1.366 ± 0.303
1.475TyrPhe: 1.475 ± 0.265
2.459TyrGly: 2.459 ± 0.352
0.601TyrHis: 0.601 ± 0.21
1.749TyrIle: 1.749 ± 0.312
1.257TyrLys: 1.257 ± 0.282
1.366TyrLeu: 1.366 ± 0.287
0.82TyrMet: 0.82 ± 0.22
1.311TyrAsn: 1.311 ± 0.268
1.366TyrPro: 1.366 ± 0.233
1.421TyrGln: 1.421 ± 0.251
2.35TyrArg: 2.35 ± 0.48
1.53TyrSer: 1.53 ± 0.315
1.585TyrThr: 1.585 ± 0.379
1.858TyrVal: 1.858 ± 0.335
0.601TyrTrp: 0.601 ± 0.168
1.202TyrTyr: 1.202 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (18302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski