Amino acid dipepetide frequency for Pseudomonas phage PA10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.666AlaAla: 6.666 ± 0.704
1.004AlaCys: 1.004 ± 0.206
4.201AlaAsp: 4.201 ± 0.448
5.89AlaGlu: 5.89 ± 0.488
2.968AlaPhe: 2.968 ± 0.353
5.251AlaGly: 5.251 ± 0.481
1.781AlaHis: 1.781 ± 0.265
5.388AlaIle: 5.388 ± 0.429
4.84AlaLys: 4.84 ± 0.536
6.529AlaLeu: 6.529 ± 0.699
2.785AlaMet: 2.785 ± 0.328
3.15AlaAsn: 3.15 ± 0.37
1.872AlaPro: 1.872 ± 0.285
3.333AlaGln: 3.333 ± 0.47
4.885AlaArg: 4.885 ± 0.449
4.064AlaSer: 4.064 ± 0.466
5.114AlaThr: 5.114 ± 0.537
5.57AlaVal: 5.57 ± 0.49
1.461AlaTrp: 1.461 ± 0.294
2.922AlaTyr: 2.922 ± 0.361
0.0AlaXaa: 0.0 ± 0.0
Cys
0.959CysAla: 0.959 ± 0.24
0.137CysCys: 0.137 ± 0.086
0.868CysAsp: 0.868 ± 0.223
1.187CysGlu: 1.187 ± 0.26
0.365CysPhe: 0.365 ± 0.123
1.004CysGly: 1.004 ± 0.231
0.228CysHis: 0.228 ± 0.102
0.822CysIle: 0.822 ± 0.19
1.004CysLys: 1.004 ± 0.224
0.548CysLeu: 0.548 ± 0.153
0.457CysMet: 0.457 ± 0.134
1.05CysAsn: 1.05 ± 0.211
0.411CysPro: 0.411 ± 0.173
0.457CysGln: 0.457 ± 0.128
0.776CysArg: 0.776 ± 0.208
0.685CysSer: 0.685 ± 0.168
0.32CysThr: 0.32 ± 0.123
0.776CysVal: 0.776 ± 0.19
0.32CysTrp: 0.32 ± 0.116
0.548CysTyr: 0.548 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
4.657AspAla: 4.657 ± 0.445
1.141AspCys: 1.141 ± 0.259
3.561AspAsp: 3.561 ± 0.384
3.835AspGlu: 3.835 ± 0.476
3.059AspPhe: 3.059 ± 0.357
4.977AspGly: 4.977 ± 0.493
1.278AspHis: 1.278 ± 0.268
4.246AspIle: 4.246 ± 0.481
3.47AspLys: 3.47 ± 0.378
5.296AspLeu: 5.296 ± 0.436
1.918AspMet: 1.918 ± 0.317
2.831AspAsn: 2.831 ± 0.398
3.013AspPro: 3.013 ± 0.365
1.781AspGln: 1.781 ± 0.297
3.607AspArg: 3.607 ± 0.432
3.287AspSer: 3.287 ± 0.451
3.242AspThr: 3.242 ± 0.409
4.246AspVal: 4.246 ± 0.494
1.507AspTrp: 1.507 ± 0.289
2.694AspTyr: 2.694 ± 0.368
0.0AspXaa: 0.0 ± 0.0
Glu
7.168GluAla: 7.168 ± 0.46
1.096GluCys: 1.096 ± 0.234
5.662GluAsp: 5.662 ± 0.446
7.808GluGlu: 7.808 ± 0.681
3.47GluPhe: 3.47 ± 0.393
5.205GluGly: 5.205 ± 0.398
1.552GluHis: 1.552 ± 0.269
3.744GluIle: 3.744 ± 0.378
4.201GluLys: 4.201 ± 0.476
7.168GluLeu: 7.168 ± 0.689
2.055GluMet: 2.055 ± 0.325
2.694GluAsn: 2.694 ± 0.391
2.055GluPro: 2.055 ± 0.328
3.059GluGln: 3.059 ± 0.373
3.79GluArg: 3.79 ± 0.503
3.47GluSer: 3.47 ± 0.429
3.561GluThr: 3.561 ± 0.416
5.753GluVal: 5.753 ± 0.6
1.918GluTrp: 1.918 ± 0.284
3.333GluTyr: 3.333 ± 0.394
0.0GluXaa: 0.0 ± 0.0
Phe
2.922PheAla: 2.922 ± 0.367
0.411PheCys: 0.411 ± 0.162
3.013PheAsp: 3.013 ± 0.491
3.835PheGlu: 3.835 ± 0.428
1.735PhePhe: 1.735 ± 0.274
2.557PheGly: 2.557 ± 0.364
0.868PheHis: 0.868 ± 0.184
1.872PheIle: 1.872 ± 0.311
3.607PheLys: 3.607 ± 0.376
2.785PheLeu: 2.785 ± 0.325
1.096PheMet: 1.096 ± 0.242
2.237PheAsn: 2.237 ± 0.369
1.552PhePro: 1.552 ± 0.219
1.552PheGln: 1.552 ± 0.243
2.237PheArg: 2.237 ± 0.314
2.42PheSer: 2.42 ± 0.339
2.831PheThr: 2.831 ± 0.443
2.648PheVal: 2.648 ± 0.332
0.548PheTrp: 0.548 ± 0.127
1.278PheTyr: 1.278 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
4.429GlyAla: 4.429 ± 0.469
1.141GlyCys: 1.141 ± 0.211
4.52GlyAsp: 4.52 ± 0.418
5.068GlyGlu: 5.068 ± 0.579
3.698GlyPhe: 3.698 ± 0.375
5.57GlyGly: 5.57 ± 0.741
1.05GlyHis: 1.05 ± 0.236
3.607GlyIle: 3.607 ± 0.434
4.977GlyLys: 4.977 ± 0.449
5.342GlyLeu: 5.342 ± 0.583
1.781GlyMet: 1.781 ± 0.28
3.105GlyAsn: 3.105 ± 0.412
1.187GlyPro: 1.187 ± 0.259
2.922GlyGln: 2.922 ± 0.364
3.698GlyArg: 3.698 ± 0.416
4.566GlySer: 4.566 ± 0.431
4.018GlyThr: 4.018 ± 0.453
5.616GlyVal: 5.616 ± 0.421
1.37GlyTrp: 1.37 ± 0.211
3.059GlyTyr: 3.059 ± 0.378
0.0GlyXaa: 0.0 ± 0.0
His
1.644HisAla: 1.644 ± 0.286
0.228HisCys: 0.228 ± 0.096
1.187HisAsp: 1.187 ± 0.25
0.731HisGlu: 0.731 ± 0.17
0.913HisPhe: 0.913 ± 0.258
1.552HisGly: 1.552 ± 0.255
0.365HisHis: 0.365 ± 0.176
1.141HisIle: 1.141 ± 0.187
1.233HisLys: 1.233 ± 0.264
1.644HisLeu: 1.644 ± 0.295
0.639HisMet: 0.639 ± 0.167
0.594HisAsn: 0.594 ± 0.15
1.096HisPro: 1.096 ± 0.216
0.502HisGln: 0.502 ± 0.142
0.959HisArg: 0.959 ± 0.296
1.233HisSer: 1.233 ± 0.225
1.05HisThr: 1.05 ± 0.251
1.37HisVal: 1.37 ± 0.235
0.457HisTrp: 0.457 ± 0.156
0.731HisTyr: 0.731 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
4.292IleAla: 4.292 ± 0.448
0.502IleCys: 0.502 ± 0.131
4.155IleAsp: 4.155 ± 0.447
4.155IleGlu: 4.155 ± 0.452
1.598IlePhe: 1.598 ± 0.241
3.424IleGly: 3.424 ± 0.373
1.507IleHis: 1.507 ± 0.296
2.511IleIle: 2.511 ± 0.39
3.835IleLys: 3.835 ± 0.433
3.972IleLeu: 3.972 ± 0.374
1.507IleMet: 1.507 ± 0.308
2.603IleAsn: 2.603 ± 0.416
2.557IlePro: 2.557 ± 0.365
2.237IleGln: 2.237 ± 0.302
3.653IleArg: 3.653 ± 0.424
3.379IleSer: 3.379 ± 0.421
2.329IleThr: 2.329 ± 0.373
3.47IleVal: 3.47 ± 0.339
0.639IleTrp: 0.639 ± 0.154
1.735IleTyr: 1.735 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
6.301LysAla: 6.301 ± 0.697
0.457LysCys: 0.457 ± 0.119
4.611LysAsp: 4.611 ± 0.478
6.027LysGlu: 6.027 ± 0.681
1.963LysPhe: 1.963 ± 0.314
4.931LysGly: 4.931 ± 0.463
0.959LysHis: 0.959 ± 0.234
3.287LysIle: 3.287 ± 0.399
4.201LysLys: 4.201 ± 0.377
5.159LysLeu: 5.159 ± 0.468
2.237LysMet: 2.237 ± 0.265
2.192LysAsn: 2.192 ± 0.324
1.963LysPro: 1.963 ± 0.329
2.283LysGln: 2.283 ± 0.332
3.379LysArg: 3.379 ± 0.404
3.835LysSer: 3.835 ± 0.489
3.105LysThr: 3.105 ± 0.359
4.429LysVal: 4.429 ± 0.436
0.959LysTrp: 0.959 ± 0.231
2.42LysTyr: 2.42 ± 0.348
0.0LysXaa: 0.0 ± 0.0
Leu
5.525LeuAla: 5.525 ± 0.586
0.776LeuCys: 0.776 ± 0.185
5.89LeuAsp: 5.89 ± 0.473
5.844LeuGlu: 5.844 ± 0.565
2.831LeuPhe: 2.831 ± 0.378
5.89LeuGly: 5.89 ± 0.662
1.507LeuHis: 1.507 ± 0.264
4.018LeuIle: 4.018 ± 0.466
5.525LeuLys: 5.525 ± 0.548
4.885LeuLeu: 4.885 ± 0.544
2.466LeuMet: 2.466 ± 0.248
3.196LeuAsn: 3.196 ± 0.45
3.607LeuPro: 3.607 ± 0.353
2.648LeuGln: 2.648 ± 0.272
4.246LeuArg: 4.246 ± 0.401
5.57LeuSer: 5.57 ± 0.568
4.52LeuThr: 4.52 ± 0.538
4.657LeuVal: 4.657 ± 0.51
1.233LeuTrp: 1.233 ± 0.27
2.831LeuTyr: 2.831 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
3.698MetAla: 3.698 ± 0.427
0.411MetCys: 0.411 ± 0.141
1.278MetAsp: 1.278 ± 0.222
2.922MetGlu: 2.922 ± 0.335
1.096MetPhe: 1.096 ± 0.189
2.1MetGly: 2.1 ± 0.305
0.183MetHis: 0.183 ± 0.085
1.415MetIle: 1.415 ± 0.241
1.918MetLys: 1.918 ± 0.262
1.826MetLeu: 1.826 ± 0.252
1.278MetMet: 1.278 ± 0.288
1.324MetAsn: 1.324 ± 0.269
1.096MetPro: 1.096 ± 0.217
1.735MetGln: 1.735 ± 0.301
1.096MetArg: 1.096 ± 0.21
2.009MetSer: 2.009 ± 0.26
2.192MetThr: 2.192 ± 0.278
1.187MetVal: 1.187 ± 0.229
0.365MetTrp: 0.365 ± 0.113
1.096MetTyr: 1.096 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
2.694AsnAla: 2.694 ± 0.366
0.365AsnCys: 0.365 ± 0.113
2.237AsnAsp: 2.237 ± 0.377
2.283AsnGlu: 2.283 ± 0.265
1.963AsnPhe: 1.963 ± 0.266
3.287AsnGly: 3.287 ± 0.452
1.004AsnHis: 1.004 ± 0.223
2.694AsnIle: 2.694 ± 0.374
2.648AsnLys: 2.648 ± 0.355
4.109AsnLeu: 4.109 ± 0.627
1.233AsnMet: 1.233 ± 0.238
1.963AsnAsn: 1.963 ± 0.279
2.192AsnPro: 2.192 ± 0.412
1.415AsnGln: 1.415 ± 0.283
1.918AsnArg: 1.918 ± 0.273
2.694AsnSer: 2.694 ± 0.341
2.283AsnThr: 2.283 ± 0.321
3.561AsnVal: 3.561 ± 0.439
0.639AsnTrp: 0.639 ± 0.189
1.278AsnTyr: 1.278 ± 0.2
0.0AsnXaa: 0.0 ± 0.0
Pro
3.333ProAla: 3.333 ± 0.39
0.365ProCys: 0.365 ± 0.113
2.192ProAsp: 2.192 ± 0.37
3.835ProGlu: 3.835 ± 0.373
1.598ProPhe: 1.598 ± 0.255
2.511ProGly: 2.511 ± 0.337
0.959ProHis: 0.959 ± 0.254
1.598ProIle: 1.598 ± 0.357
2.146ProLys: 2.146 ± 0.315
2.374ProLeu: 2.374 ± 0.281
1.096ProMet: 1.096 ± 0.213
1.233ProAsn: 1.233 ± 0.272
1.096ProPro: 1.096 ± 0.22
1.05ProGln: 1.05 ± 0.212
1.278ProArg: 1.278 ± 0.242
2.192ProSer: 2.192 ± 0.317
2.374ProThr: 2.374 ± 0.31
3.47ProVal: 3.47 ± 0.373
0.32ProTrp: 0.32 ± 0.123
1.05ProTyr: 1.05 ± 0.247
0.0ProXaa: 0.0 ± 0.0
Gln
3.561GlnAla: 3.561 ± 0.386
0.548GlnCys: 0.548 ± 0.159
1.644GlnAsp: 1.644 ± 0.23
2.511GlnGlu: 2.511 ± 0.438
1.872GlnPhe: 1.872 ± 0.284
2.009GlnGly: 2.009 ± 0.425
0.776GlnHis: 0.776 ± 0.193
2.009GlnIle: 2.009 ± 0.329
1.735GlnLys: 1.735 ± 0.284
3.242GlnLeu: 3.242 ± 0.371
1.461GlnMet: 1.461 ± 0.26
1.187GlnAsn: 1.187 ± 0.206
0.959GlnPro: 0.959 ± 0.257
0.913GlnGln: 0.913 ± 0.167
2.1GlnArg: 2.1 ± 0.262
2.283GlnSer: 2.283 ± 0.352
1.781GlnThr: 1.781 ± 0.302
2.968GlnVal: 2.968 ± 0.314
0.548GlnTrp: 0.548 ± 0.157
1.461GlnTyr: 1.461 ± 0.246
0.0GlnXaa: 0.0 ± 0.0
Arg
4.292ArgAla: 4.292 ± 0.428
0.685ArgCys: 0.685 ± 0.14
3.196ArgAsp: 3.196 ± 0.441
4.201ArgGlu: 4.201 ± 0.458
2.283ArgPhe: 2.283 ± 0.39
3.424ArgGly: 3.424 ± 0.406
0.959ArgHis: 0.959 ± 0.21
2.694ArgIle: 2.694 ± 0.351
3.972ArgLys: 3.972 ± 0.412
4.611ArgLeu: 4.611 ± 0.481
1.415ArgMet: 1.415 ± 0.241
2.055ArgAsn: 2.055 ± 0.285
2.283ArgPro: 2.283 ± 0.275
1.963ArgGln: 1.963 ± 0.271
3.242ArgArg: 3.242 ± 0.387
2.876ArgSer: 2.876 ± 0.369
2.557ArgThr: 2.557 ± 0.34
4.155ArgVal: 4.155 ± 0.383
1.324ArgTrp: 1.324 ± 0.271
2.009ArgTyr: 2.009 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
4.064SerAla: 4.064 ± 0.388
0.776SerCys: 0.776 ± 0.192
3.79SerAsp: 3.79 ± 0.415
4.52SerGlu: 4.52 ± 0.432
2.283SerPhe: 2.283 ± 0.323
3.972SerGly: 3.972 ± 0.401
0.913SerHis: 0.913 ± 0.202
2.876SerIle: 2.876 ± 0.365
3.607SerLys: 3.607 ± 0.372
4.52SerLeu: 4.52 ± 0.42
1.415SerMet: 1.415 ± 0.243
2.694SerAsn: 2.694 ± 0.442
2.1SerPro: 2.1 ± 0.29
1.689SerGln: 1.689 ± 0.306
3.881SerArg: 3.881 ± 0.49
3.013SerSer: 3.013 ± 0.459
2.968SerThr: 2.968 ± 0.302
5.388SerVal: 5.388 ± 0.454
0.685SerTrp: 0.685 ± 0.128
2.146SerTyr: 2.146 ± 0.269
0.0SerXaa: 0.0 ± 0.0
Thr
4.246ThrAla: 4.246 ± 0.491
0.822ThrCys: 0.822 ± 0.209
2.831ThrAsp: 2.831 ± 0.336
4.566ThrGlu: 4.566 ± 0.444
2.739ThrPhe: 2.739 ± 0.296
4.474ThrGly: 4.474 ± 0.386
1.096ThrHis: 1.096 ± 0.209
3.059ThrIle: 3.059 ± 0.338
2.876ThrLys: 2.876 ± 0.422
4.931ThrLeu: 4.931 ± 0.511
1.278ThrMet: 1.278 ± 0.251
2.237ThrAsn: 2.237 ± 0.312
2.192ThrPro: 2.192 ± 0.371
1.644ThrGln: 1.644 ± 0.246
2.557ThrArg: 2.557 ± 0.374
2.831ThrSer: 2.831 ± 0.388
2.648ThrThr: 2.648 ± 0.28
4.611ThrVal: 4.611 ± 0.46
0.822ThrTrp: 0.822 ± 0.189
2.009ThrTyr: 2.009 ± 0.308
0.0ThrXaa: 0.0 ± 0.0
Val
5.57ValAla: 5.57 ± 0.545
0.959ValCys: 0.959 ± 0.274
4.611ValAsp: 4.611 ± 0.476
5.753ValGlu: 5.753 ± 0.475
3.287ValPhe: 3.287 ± 0.361
4.931ValGly: 4.931 ± 0.45
1.461ValHis: 1.461 ± 0.268
4.383ValIle: 4.383 ± 0.467
5.114ValLys: 5.114 ± 0.461
4.84ValLeu: 4.84 ± 0.377
2.466ValMet: 2.466 ± 0.337
3.287ValAsn: 3.287 ± 0.411
2.968ValPro: 2.968 ± 0.408
2.374ValGln: 2.374 ± 0.322
3.744ValArg: 3.744 ± 0.362
3.972ValSer: 3.972 ± 0.444
4.338ValThr: 4.338 ± 0.4
5.388ValVal: 5.388 ± 0.67
1.141ValTrp: 1.141 ± 0.248
2.146ValTyr: 2.146 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
1.05TrpAla: 1.05 ± 0.215
0.365TrpCys: 0.365 ± 0.122
1.507TrpAsp: 1.507 ± 0.254
1.644TrpGlu: 1.644 ± 0.259
0.639TrpPhe: 0.639 ± 0.178
1.05TrpGly: 1.05 ± 0.234
0.457TrpHis: 0.457 ± 0.127
0.639TrpIle: 0.639 ± 0.182
1.141TrpLys: 1.141 ± 0.275
1.415TrpLeu: 1.415 ± 0.245
0.731TrpMet: 0.731 ± 0.164
0.685TrpAsn: 0.685 ± 0.182
0.457TrpPro: 0.457 ± 0.149
0.411TrpGln: 0.411 ± 0.134
1.096TrpArg: 1.096 ± 0.219
0.913TrpSer: 0.913 ± 0.267
0.913TrpThr: 0.913 ± 0.227
1.05TrpVal: 1.05 ± 0.269
0.228TrpTrp: 0.228 ± 0.098
0.502TrpTyr: 0.502 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.603TyrAla: 2.603 ± 0.402
0.731TyrCys: 0.731 ± 0.234
2.694TyrAsp: 2.694 ± 0.302
2.146TyrGlu: 2.146 ± 0.409
1.644TyrPhe: 1.644 ± 0.281
2.603TyrGly: 2.603 ± 0.367
0.365TyrHis: 0.365 ± 0.122
2.1TyrIle: 2.1 ± 0.322
2.648TyrLys: 2.648 ± 0.373
2.466TyrLeu: 2.466 ± 0.342
0.868TyrMet: 0.868 ± 0.159
2.146TyrAsn: 2.146 ± 0.27
1.415TyrPro: 1.415 ± 0.254
1.598TyrGln: 1.598 ± 0.246
2.055TyrArg: 2.055 ± 0.297
2.009TyrSer: 2.009 ± 0.322
2.329TyrThr: 2.329 ± 0.374
2.42TyrVal: 2.42 ± 0.338
0.411TyrTrp: 0.411 ± 0.139
1.415TyrTyr: 1.415 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 127 proteins (21903 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski