Amino acid dipepetide frequency for Salmonella phage 7-11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.318AlaAla: 7.318 ± 0.98
0.635AlaCys: 0.635 ± 0.141
4.257AlaAsp: 4.257 ± 0.418
5.339AlaGlu: 5.339 ± 0.48
3.174AlaPhe: 3.174 ± 0.337
5.899AlaGly: 5.899 ± 0.716
1.606AlaHis: 1.606 ± 0.281
3.584AlaIle: 3.584 ± 0.343
5.302AlaLys: 5.302 ± 0.496
6.385AlaLeu: 6.385 ± 0.517
1.979AlaMet: 1.979 ± 0.266
3.846AlaAsn: 3.846 ± 0.417
3.248AlaPro: 3.248 ± 0.414
4.107AlaGln: 4.107 ± 0.633
4.854AlaArg: 4.854 ± 0.445
4.667AlaSer: 4.667 ± 0.489
4.854AlaThr: 4.854 ± 0.522
5.003AlaVal: 5.003 ± 0.43
0.709AlaTrp: 0.709 ± 0.176
2.427AlaTyr: 2.427 ± 0.387
0.0AlaXaa: 0.0 ± 0.0
Cys
0.56CysAla: 0.56 ± 0.152
0.075CysCys: 0.075 ± 0.05
1.232CysAsp: 1.232 ± 0.221
0.747CysGlu: 0.747 ± 0.163
0.597CysPhe: 0.597 ± 0.184
0.859CysGly: 0.859 ± 0.189
0.224CysHis: 0.224 ± 0.09
0.411CysIle: 0.411 ± 0.121
0.784CysLys: 0.784 ± 0.202
0.747CysLeu: 0.747 ± 0.154
0.336CysMet: 0.336 ± 0.116
0.821CysAsn: 0.821 ± 0.186
0.821CysPro: 0.821 ± 0.167
0.448CysGln: 0.448 ± 0.147
0.523CysArg: 0.523 ± 0.138
0.747CysSer: 0.747 ± 0.213
0.635CysThr: 0.635 ± 0.138
0.971CysVal: 0.971 ± 0.197
0.149CysTrp: 0.149 ± 0.07
0.485CysTyr: 0.485 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.526AspAla: 5.526 ± 0.437
0.635AspCys: 0.635 ± 0.157
4.257AspAsp: 4.257 ± 0.46
5.265AspGlu: 5.265 ± 0.631
2.838AspPhe: 2.838 ± 0.304
5.489AspGly: 5.489 ± 0.45
0.784AspHis: 0.784 ± 0.193
3.472AspIle: 3.472 ± 0.342
3.734AspLys: 3.734 ± 0.421
4.033AspLeu: 4.033 ± 0.361
1.904AspMet: 1.904 ± 0.275
3.398AspAsn: 3.398 ± 0.346
2.016AspPro: 2.016 ± 0.273
1.419AspGln: 1.419 ± 0.303
2.464AspArg: 2.464 ± 0.274
3.809AspSer: 3.809 ± 0.369
3.248AspThr: 3.248 ± 0.335
4.705AspVal: 4.705 ± 0.387
1.494AspTrp: 1.494 ± 0.248
1.904AspTyr: 1.904 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
6.721GluAla: 6.721 ± 0.582
0.859GluCys: 0.859 ± 0.179
4.443GluAsp: 4.443 ± 0.656
5.899GluGlu: 5.899 ± 0.628
3.323GluPhe: 3.323 ± 0.383
4.219GluGly: 4.219 ± 0.35
1.195GluHis: 1.195 ± 0.225
3.286GluIle: 3.286 ± 0.3
4.145GluLys: 4.145 ± 0.395
5.675GluLeu: 5.675 ± 0.478
2.24GluMet: 2.24 ± 0.315
2.875GluAsn: 2.875 ± 0.346
2.054GluPro: 2.054 ± 0.302
2.912GluGln: 2.912 ± 0.329
4.63GluArg: 4.63 ± 0.405
4.107GluSer: 4.107 ± 0.366
3.584GluThr: 3.584 ± 0.351
5.041GluVal: 5.041 ± 0.57
1.307GluTrp: 1.307 ± 0.192
3.062GluTyr: 3.062 ± 0.348
0.0GluXaa: 0.0 ± 0.0
Phe
2.763PheAla: 2.763 ± 0.307
0.672PheCys: 0.672 ± 0.165
3.472PheAsp: 3.472 ± 0.328
3.024PheGlu: 3.024 ± 0.414
1.531PhePhe: 1.531 ± 0.285
2.614PheGly: 2.614 ± 0.384
0.896PheHis: 0.896 ± 0.195
2.278PheIle: 2.278 ± 0.27
2.651PheLys: 2.651 ± 0.286
3.062PheLeu: 3.062 ± 0.472
1.157PheMet: 1.157 ± 0.236
2.651PheAsn: 2.651 ± 0.336
1.307PhePro: 1.307 ± 0.25
1.494PheGln: 1.494 ± 0.162
1.979PheArg: 1.979 ± 0.277
2.726PheSer: 2.726 ± 0.319
2.128PheThr: 2.128 ± 0.358
2.8PheVal: 2.8 ± 0.299
0.896PheTrp: 0.896 ± 0.18
1.344PheTyr: 1.344 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
5.302GlyAla: 5.302 ± 0.657
0.672GlyCys: 0.672 ± 0.148
4.219GlyAsp: 4.219 ± 0.423
4.891GlyGlu: 4.891 ± 0.377
2.352GlyPhe: 2.352 ± 0.356
5.489GlyGly: 5.489 ± 0.642
1.68GlyHis: 1.68 ± 0.281
3.472GlyIle: 3.472 ± 0.343
5.265GlyLys: 5.265 ± 0.441
5.227GlyLeu: 5.227 ± 0.408
1.942GlyMet: 1.942 ± 0.286
3.398GlyAsn: 3.398 ± 0.364
0.971GlyPro: 0.971 ± 0.218
2.539GlyGln: 2.539 ± 0.374
3.024GlyArg: 3.024 ± 0.279
5.563GlySer: 5.563 ± 0.494
3.809GlyThr: 3.809 ± 0.433
5.451GlyVal: 5.451 ± 0.483
1.307GlyTrp: 1.307 ± 0.233
3.062GlyTyr: 3.062 ± 0.316
0.0GlyXaa: 0.0 ± 0.0
His
1.083HisAla: 1.083 ± 0.193
0.336HisCys: 0.336 ± 0.122
1.12HisAsp: 1.12 ± 0.191
1.27HisGlu: 1.27 ± 0.253
1.008HisPhe: 1.008 ± 0.19
1.531HisGly: 1.531 ± 0.295
0.299HisHis: 0.299 ± 0.098
1.12HisIle: 1.12 ± 0.196
1.307HisLys: 1.307 ± 0.213
1.344HisLeu: 1.344 ± 0.221
0.672HisMet: 0.672 ± 0.152
0.933HisAsn: 0.933 ± 0.213
1.157HisPro: 1.157 ± 0.256
0.709HisGln: 0.709 ± 0.156
1.568HisArg: 1.568 ± 0.218
1.157HisSer: 1.157 ± 0.191
1.008HisThr: 1.008 ± 0.21
1.531HisVal: 1.531 ± 0.244
0.597HisTrp: 0.597 ± 0.139
0.859HisTyr: 0.859 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
4.07IleAla: 4.07 ± 0.38
0.672IleCys: 0.672 ± 0.158
3.809IleAsp: 3.809 ± 0.366
4.406IleGlu: 4.406 ± 0.444
1.568IlePhe: 1.568 ± 0.254
3.584IleGly: 3.584 ± 0.314
1.12IleHis: 1.12 ± 0.197
2.464IleIle: 2.464 ± 0.303
3.809IleLys: 3.809 ± 0.376
3.398IleLeu: 3.398 ± 0.346
1.568IleMet: 1.568 ± 0.208
2.427IleAsn: 2.427 ± 0.283
2.614IlePro: 2.614 ± 0.369
2.128IleGln: 2.128 ± 0.298
2.576IleArg: 2.576 ± 0.341
2.651IleSer: 2.651 ± 0.297
2.688IleThr: 2.688 ± 0.319
2.763IleVal: 2.763 ± 0.35
1.008IleTrp: 1.008 ± 0.181
1.867IleTyr: 1.867 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
6.833LysAla: 6.833 ± 0.763
0.859LysCys: 0.859 ± 0.216
4.07LysAsp: 4.07 ± 0.407
4.966LysGlu: 4.966 ± 0.421
2.091LysPhe: 2.091 ± 0.249
4.107LysGly: 4.107 ± 0.439
1.531LysHis: 1.531 ± 0.257
3.099LysIle: 3.099 ± 0.318
5.339LysLys: 5.339 ± 0.71
4.667LysLeu: 4.667 ± 0.416
1.979LysMet: 1.979 ± 0.303
3.024LysAsn: 3.024 ± 0.343
2.278LysPro: 2.278 ± 0.271
2.166LysGln: 2.166 ± 0.301
3.51LysArg: 3.51 ± 0.397
4.443LysSer: 4.443 ± 0.398
3.36LysThr: 3.36 ± 0.336
4.966LysVal: 4.966 ± 0.562
0.821LysTrp: 0.821 ± 0.148
2.128LysTyr: 2.128 ± 0.306
0.0LysXaa: 0.0 ± 0.0
Leu
4.742LeuAla: 4.742 ± 0.528
0.709LeuCys: 0.709 ± 0.157
4.257LeuAsp: 4.257 ± 0.374
5.153LeuGlu: 5.153 ± 0.527
2.8LeuPhe: 2.8 ± 0.386
3.883LeuGly: 3.883 ± 0.38
1.382LeuHis: 1.382 ± 0.201
4.145LeuIle: 4.145 ± 0.476
5.003LeuLys: 5.003 ± 0.457
5.078LeuLeu: 5.078 ± 0.508
2.502LeuMet: 2.502 ± 0.34
3.734LeuAsn: 3.734 ± 0.33
2.726LeuPro: 2.726 ± 0.292
3.622LeuGln: 3.622 ± 0.431
4.219LeuArg: 4.219 ± 0.354
4.667LeuSer: 4.667 ± 0.453
4.331LeuThr: 4.331 ± 0.369
4.779LeuVal: 4.779 ± 0.436
0.747LeuTrp: 0.747 ± 0.134
2.651LeuTyr: 2.651 ± 0.3
0.0LeuXaa: 0.0 ± 0.0
Met
2.614MetAla: 2.614 ± 0.324
0.336MetCys: 0.336 ± 0.097
1.456MetAsp: 1.456 ± 0.225
2.203MetGlu: 2.203 ± 0.251
0.859MetPhe: 0.859 ± 0.191
2.054MetGly: 2.054 ± 0.333
0.336MetHis: 0.336 ± 0.096
1.307MetIle: 1.307 ± 0.212
2.128MetLys: 2.128 ± 0.32
2.278MetLeu: 2.278 ± 0.302
0.821MetMet: 0.821 ± 0.166
1.718MetAsn: 1.718 ± 0.209
1.008MetPro: 1.008 ± 0.199
0.933MetGln: 0.933 ± 0.195
1.606MetArg: 1.606 ± 0.253
2.278MetSer: 2.278 ± 0.308
2.091MetThr: 2.091 ± 0.319
2.278MetVal: 2.278 ± 0.282
0.523MetTrp: 0.523 ± 0.139
0.709MetTyr: 0.709 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.734AsnAla: 3.734 ± 0.391
0.411AsnCys: 0.411 ± 0.136
2.539AsnAsp: 2.539 ± 0.283
3.136AsnGlu: 3.136 ± 0.306
2.726AsnPhe: 2.726 ± 0.361
3.51AsnGly: 3.51 ± 0.419
0.933AsnHis: 0.933 ± 0.184
2.95AsnIle: 2.95 ± 0.323
2.875AsnLys: 2.875 ± 0.318
4.033AsnLeu: 4.033 ± 0.398
1.531AsnMet: 1.531 ± 0.232
2.539AsnAsn: 2.539 ± 0.304
2.875AsnPro: 2.875 ± 0.336
1.942AsnGln: 1.942 ± 0.344
3.099AsnArg: 3.099 ± 0.277
2.651AsnSer: 2.651 ± 0.38
2.763AsnThr: 2.763 ± 0.407
2.95AsnVal: 2.95 ± 0.314
1.083AsnTrp: 1.083 ± 0.261
1.942AsnTyr: 1.942 ± 0.24
0.0AsnXaa: 0.0 ± 0.0
Pro
2.614ProAla: 2.614 ± 0.377
0.373ProCys: 0.373 ± 0.109
2.539ProAsp: 2.539 ± 0.283
3.435ProGlu: 3.435 ± 0.417
1.979ProPhe: 1.979 ± 0.344
1.904ProGly: 1.904 ± 0.306
0.896ProHis: 0.896 ± 0.168
1.68ProIle: 1.68 ± 0.212
2.614ProLys: 2.614 ± 0.295
2.315ProLeu: 2.315 ± 0.31
1.045ProMet: 1.045 ± 0.185
1.867ProAsn: 1.867 ± 0.253
0.784ProPro: 0.784 ± 0.164
1.867ProGln: 1.867 ± 0.356
1.568ProArg: 1.568 ± 0.228
1.942ProSer: 1.942 ± 0.236
2.576ProThr: 2.576 ± 0.374
2.8ProVal: 2.8 ± 0.303
0.635ProTrp: 0.635 ± 0.148
1.531ProTyr: 1.531 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
3.547GlnAla: 3.547 ± 0.461
0.485GlnCys: 0.485 ± 0.136
1.83GlnAsp: 1.83 ± 0.273
3.099GlnGlu: 3.099 ± 0.451
1.606GlnPhe: 1.606 ± 0.217
2.502GlnGly: 2.502 ± 0.403
1.083GlnHis: 1.083 ± 0.183
1.979GlnIle: 1.979 ± 0.287
2.016GlnLys: 2.016 ± 0.303
3.286GlnLeu: 3.286 ± 0.376
1.456GlnMet: 1.456 ± 0.253
1.867GlnAsn: 1.867 ± 0.247
1.382GlnPro: 1.382 ± 0.223
2.614GlnGln: 2.614 ± 0.506
2.166GlnArg: 2.166 ± 0.273
2.016GlnSer: 2.016 ± 0.242
2.576GlnThr: 2.576 ± 0.332
2.726GlnVal: 2.726 ± 0.304
0.859GlnTrp: 0.859 ± 0.196
1.643GlnTyr: 1.643 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
3.958ArgAla: 3.958 ± 0.433
0.597ArgCys: 0.597 ± 0.169
3.211ArgAsp: 3.211 ± 0.288
3.883ArgGlu: 3.883 ± 0.43
2.427ArgPhe: 2.427 ± 0.263
3.211ArgGly: 3.211 ± 0.333
1.157ArgHis: 1.157 ± 0.199
2.8ArgIle: 2.8 ± 0.336
3.921ArgLys: 3.921 ± 0.486
3.958ArgLeu: 3.958 ± 0.438
1.979ArgMet: 1.979 ± 0.308
3.286ArgAsn: 3.286 ± 0.37
1.867ArgPro: 1.867 ± 0.252
2.24ArgGln: 2.24 ± 0.277
3.211ArgArg: 3.211 ± 0.453
2.651ArgSer: 2.651 ± 0.322
2.8ArgThr: 2.8 ± 0.355
3.734ArgVal: 3.734 ± 0.423
0.896ArgTrp: 0.896 ± 0.183
2.166ArgTyr: 2.166 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
4.817SerAla: 4.817 ± 0.502
0.784SerCys: 0.784 ± 0.182
3.771SerAsp: 3.771 ± 0.422
3.622SerGlu: 3.622 ± 0.378
2.576SerPhe: 2.576 ± 0.356
5.638SerGly: 5.638 ± 0.503
1.232SerHis: 1.232 ± 0.224
3.062SerIle: 3.062 ± 0.395
3.136SerLys: 3.136 ± 0.375
3.883SerLeu: 3.883 ± 0.474
1.979SerMet: 1.979 ± 0.272
3.099SerAsn: 3.099 ± 0.306
2.352SerPro: 2.352 ± 0.31
2.502SerGln: 2.502 ± 0.309
3.659SerArg: 3.659 ± 0.419
3.883SerSer: 3.883 ± 0.662
3.472SerThr: 3.472 ± 0.349
4.593SerVal: 4.593 ± 0.447
1.12SerTrp: 1.12 ± 0.234
2.166SerTyr: 2.166 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
4.182ThrAla: 4.182 ± 0.537
0.784ThrCys: 0.784 ± 0.164
3.099ThrAsp: 3.099 ± 0.313
3.211ThrGlu: 3.211 ± 0.341
2.539ThrPhe: 2.539 ± 0.309
4.817ThrGly: 4.817 ± 0.503
1.27ThrHis: 1.27 ± 0.191
3.435ThrIle: 3.435 ± 0.327
3.921ThrLys: 3.921 ± 0.37
3.809ThrLeu: 3.809 ± 0.36
1.232ThrMet: 1.232 ± 0.204
2.464ThrAsn: 2.464 ± 0.29
2.95ThrPro: 2.95 ± 0.389
2.128ThrGln: 2.128 ± 0.313
2.502ThrArg: 2.502 ± 0.326
3.136ThrSer: 3.136 ± 0.381
3.472ThrThr: 3.472 ± 0.412
4.891ThrVal: 4.891 ± 0.525
0.933ThrTrp: 0.933 ± 0.221
2.091ThrTyr: 2.091 ± 0.258
0.0ThrXaa: 0.0 ± 0.0
Val
5.115ValAla: 5.115 ± 0.377
1.157ValCys: 1.157 ± 0.235
4.443ValAsp: 4.443 ± 0.389
4.331ValGlu: 4.331 ± 0.407
3.174ValPhe: 3.174 ± 0.335
4.555ValGly: 4.555 ± 0.346
1.344ValHis: 1.344 ± 0.22
4.033ValIle: 4.033 ± 0.375
5.19ValLys: 5.19 ± 0.463
4.593ValLeu: 4.593 ± 0.492
1.867ValMet: 1.867 ± 0.245
3.248ValAsn: 3.248 ± 0.403
2.651ValPro: 2.651 ± 0.327
2.614ValGln: 2.614 ± 0.275
3.584ValArg: 3.584 ± 0.349
5.078ValSer: 5.078 ± 0.529
4.891ValThr: 4.891 ± 0.483
5.937ValVal: 5.937 ± 0.591
1.27ValTrp: 1.27 ± 0.222
2.352ValTyr: 2.352 ± 0.254
0.0ValXaa: 0.0 ± 0.0
Trp
0.859TrpAla: 0.859 ± 0.187
0.523TrpCys: 0.523 ± 0.142
1.382TrpAsp: 1.382 ± 0.232
1.344TrpGlu: 1.344 ± 0.245
0.784TrpPhe: 0.784 ± 0.168
1.045TrpGly: 1.045 ± 0.16
0.523TrpHis: 0.523 ± 0.158
1.232TrpIle: 1.232 ± 0.184
0.709TrpLys: 0.709 ± 0.163
1.083TrpLeu: 1.083 ± 0.228
0.485TrpMet: 0.485 ± 0.137
0.971TrpAsn: 0.971 ± 0.189
0.411TrpPro: 0.411 ± 0.139
0.747TrpGln: 0.747 ± 0.147
0.933TrpArg: 0.933 ± 0.186
1.344TrpSer: 1.344 ± 0.277
0.896TrpThr: 0.896 ± 0.191
1.12TrpVal: 1.12 ± 0.213
0.373TrpTrp: 0.373 ± 0.128
0.523TrpTyr: 0.523 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.838TyrAla: 2.838 ± 0.304
0.597TyrCys: 0.597 ± 0.127
2.987TyrAsp: 2.987 ± 0.339
2.352TyrGlu: 2.352 ± 0.265
1.494TyrPhe: 1.494 ± 0.274
2.726TyrGly: 2.726 ± 0.283
1.12TyrHis: 1.12 ± 0.222
1.382TyrIle: 1.382 ± 0.249
2.39TyrLys: 2.39 ± 0.281
2.278TyrLeu: 2.278 ± 0.302
0.821TyrMet: 0.821 ± 0.182
2.054TyrAsn: 2.054 ± 0.296
1.494TyrPro: 1.494 ± 0.291
1.606TyrGln: 1.606 ± 0.245
2.278TyrArg: 2.278 ± 0.268
1.867TyrSer: 1.867 ± 0.259
1.68TyrThr: 1.68 ± 0.316
2.39TyrVal: 2.39 ± 0.293
0.56TyrTrp: 0.56 ± 0.143
1.157TyrTyr: 1.157 ± 0.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 151 proteins (26783 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski