Amino acid dipepetide frequency for Pseudomonas phage phiPsa374

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.497AlaAla: 8.497 ± 0.684
1.209AlaCys: 1.209 ± 0.23
5.25AlaAsp: 5.25 ± 0.405
5.457AlaGlu: 5.457 ± 0.435
2.901AlaPhe: 2.901 ± 0.276
6.113AlaGly: 6.113 ± 0.513
1.485AlaHis: 1.485 ± 0.243
5.354AlaIle: 5.354 ± 0.448
6.182AlaLys: 6.182 ± 0.522
8.22AlaLeu: 8.22 ± 0.599
2.176AlaMet: 2.176 ± 0.273
3.834AlaAsn: 3.834 ± 0.448
2.59AlaPro: 2.59 ± 0.34
4.214AlaGln: 4.214 ± 0.386
4.076AlaArg: 4.076 ± 0.393
4.455AlaSer: 4.455 ± 0.479
5.63AlaThr: 5.63 ± 0.609
5.941AlaVal: 5.941 ± 0.549
1.312AlaTrp: 1.312 ± 0.223
3.488AlaTyr: 3.488 ± 0.329
0.0AlaXaa: 0.0 ± 0.0
Cys
0.725CysAla: 0.725 ± 0.148
0.276CysCys: 0.276 ± 0.108
1.071CysAsp: 1.071 ± 0.194
0.656CysGlu: 0.656 ± 0.164
0.587CysPhe: 0.587 ± 0.156
0.76CysGly: 0.76 ± 0.184
0.449CysHis: 0.449 ± 0.13
0.414CysIle: 0.414 ± 0.122
0.691CysLys: 0.691 ± 0.164
0.829CysLeu: 0.829 ± 0.174
0.587CysMet: 0.587 ± 0.124
0.898CysAsn: 0.898 ± 0.18
0.794CysPro: 0.794 ± 0.207
0.38CysGln: 0.38 ± 0.104
0.725CysArg: 0.725 ± 0.162
0.518CysSer: 0.518 ± 0.127
1.036CysThr: 1.036 ± 0.196
0.691CysVal: 0.691 ± 0.161
0.035CysTrp: 0.035 ± 0.032
0.691CysTyr: 0.691 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
4.905AspAla: 4.905 ± 0.43
1.002AspCys: 1.002 ± 0.159
3.108AspAsp: 3.108 ± 0.29
3.73AspGlu: 3.73 ± 0.425
3.074AspPhe: 3.074 ± 0.321
4.905AspGly: 4.905 ± 0.425
1.174AspHis: 1.174 ± 0.213
3.316AspIle: 3.316 ± 0.348
4.214AspLys: 4.214 ± 0.438
4.974AspLeu: 4.974 ± 0.433
2.038AspMet: 2.038 ± 0.295
3.039AspAsn: 3.039 ± 0.359
2.003AspPro: 2.003 ± 0.209
2.072AspGln: 2.072 ± 0.273
3.074AspArg: 3.074 ± 0.372
3.488AspSer: 3.488 ± 0.379
3.454AspThr: 3.454 ± 0.354
4.455AspVal: 4.455 ± 0.399
1.174AspTrp: 1.174 ± 0.207
2.28AspTyr: 2.28 ± 0.296
0.0AspXaa: 0.0 ± 0.0
Glu
6.252GluAla: 6.252 ± 0.45
0.898GluCys: 0.898 ± 0.185
3.937GluAsp: 3.937 ± 0.462
5.112GluGlu: 5.112 ± 0.531
3.281GluPhe: 3.281 ± 0.321
3.972GluGly: 3.972 ± 0.387
1.451GluHis: 1.451 ± 0.233
4.11GluIle: 4.11 ± 0.421
3.247GluLys: 3.247 ± 0.441
5.664GluLeu: 5.664 ± 0.442
2.487GluMet: 2.487 ± 0.293
2.452GluAsn: 2.452 ± 0.242
1.485GluPro: 1.485 ± 0.274
2.867GluGln: 2.867 ± 0.268
3.35GluArg: 3.35 ± 0.385
3.488GluSer: 3.488 ± 0.352
2.832GluThr: 2.832 ± 0.294
4.628GluVal: 4.628 ± 0.424
1.071GluTrp: 1.071 ± 0.192
2.349GluTyr: 2.349 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
2.936PheAla: 2.936 ± 0.362
0.311PheCys: 0.311 ± 0.098
2.59PheAsp: 2.59 ± 0.285
3.178PheGlu: 3.178 ± 0.285
1.554PhePhe: 1.554 ± 0.276
3.039PheGly: 3.039 ± 0.342
0.656PheHis: 0.656 ± 0.13
1.969PheIle: 1.969 ± 0.248
1.934PheLys: 1.934 ± 0.261
2.729PheLeu: 2.729 ± 0.281
1.278PheMet: 1.278 ± 0.196
2.245PheAsn: 2.245 ± 0.286
1.658PhePro: 1.658 ± 0.236
1.347PheGln: 1.347 ± 0.209
1.623PheArg: 1.623 ± 0.235
2.521PheSer: 2.521 ± 0.27
2.59PheThr: 2.59 ± 0.345
2.625PheVal: 2.625 ± 0.324
0.311PheTrp: 0.311 ± 0.148
1.589PheTyr: 1.589 ± 0.23
0.0PheXaa: 0.0 ± 0.0
Gly
5.354GlyAla: 5.354 ± 0.444
1.14GlyCys: 1.14 ± 0.189
4.041GlyAsp: 4.041 ± 0.423
4.352GlyGlu: 4.352 ± 0.403
3.143GlyPhe: 3.143 ± 0.314
6.01GlyGly: 6.01 ± 0.671
1.209GlyHis: 1.209 ± 0.234
4.283GlyIle: 4.283 ± 0.38
4.421GlyLys: 4.421 ± 0.5
5.837GlyLeu: 5.837 ± 0.479
2.072GlyMet: 2.072 ± 0.274
3.385GlyAsn: 3.385 ± 0.449
1.416GlyPro: 1.416 ± 0.262
2.59GlyGln: 2.59 ± 0.358
3.178GlyArg: 3.178 ± 0.293
4.974GlySer: 4.974 ± 0.448
4.248GlyThr: 4.248 ± 0.468
5.975GlyVal: 5.975 ± 0.425
1.451GlyTrp: 1.451 ± 0.207
3.73GlyTyr: 3.73 ± 0.372
0.0GlyXaa: 0.0 ± 0.0
His
1.727HisAla: 1.727 ± 0.214
0.207HisCys: 0.207 ± 0.081
1.036HisAsp: 1.036 ± 0.18
1.105HisGlu: 1.105 ± 0.178
1.14HisPhe: 1.14 ± 0.206
1.589HisGly: 1.589 ± 0.224
0.484HisHis: 0.484 ± 0.149
1.071HisIle: 1.071 ± 0.197
1.174HisLys: 1.174 ± 0.25
1.105HisLeu: 1.105 ± 0.193
0.622HisMet: 0.622 ± 0.134
0.76HisAsn: 0.76 ± 0.151
0.656HisPro: 0.656 ± 0.14
0.553HisGln: 0.553 ± 0.144
0.725HisArg: 0.725 ± 0.151
0.898HisSer: 0.898 ± 0.172
1.485HisThr: 1.485 ± 0.244
1.485HisVal: 1.485 ± 0.249
0.587HisTrp: 0.587 ± 0.136
1.071HisTyr: 1.071 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
4.628IleAla: 4.628 ± 0.461
0.76IleCys: 0.76 ± 0.172
3.903IleAsp: 3.903 ± 0.365
3.696IleGlu: 3.696 ± 0.408
1.589IlePhe: 1.589 ± 0.228
4.11IleGly: 4.11 ± 0.375
1.347IleHis: 1.347 ± 0.198
2.867IleIle: 2.867 ± 0.296
4.214IleLys: 4.214 ± 0.391
3.903IleLeu: 3.903 ± 0.361
1.658IleMet: 1.658 ± 0.199
3.212IleAsn: 3.212 ± 0.401
2.28IlePro: 2.28 ± 0.262
2.245IleGln: 2.245 ± 0.276
2.694IleArg: 2.694 ± 0.296
3.143IleSer: 3.143 ± 0.286
4.248IleThr: 4.248 ± 0.388
4.283IleVal: 4.283 ± 0.449
0.656IleTrp: 0.656 ± 0.131
1.727IleTyr: 1.727 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
6.701LysAla: 6.701 ± 0.542
0.414LysCys: 0.414 ± 0.12
3.488LysAsp: 3.488 ± 0.35
3.972LysGlu: 3.972 ± 0.459
1.9LysPhe: 1.9 ± 0.267
4.145LysGly: 4.145 ± 0.447
1.036LysHis: 1.036 ± 0.179
3.73LysIle: 3.73 ± 0.312
3.316LysLys: 3.316 ± 0.467
5.112LysLeu: 5.112 ± 0.423
2.694LysMet: 2.694 ± 0.328
2.072LysAsn: 2.072 ± 0.277
2.452LysPro: 2.452 ± 0.336
2.659LysGln: 2.659 ± 0.311
2.729LysArg: 2.729 ± 0.316
3.143LysSer: 3.143 ± 0.346
3.108LysThr: 3.108 ± 0.376
4.697LysVal: 4.697 ± 0.392
0.794LysTrp: 0.794 ± 0.179
2.176LysTyr: 2.176 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
7.633LeuAla: 7.633 ± 0.564
0.76LeuCys: 0.76 ± 0.151
6.01LeuAsp: 6.01 ± 0.436
5.768LeuGlu: 5.768 ± 0.499
2.97LeuPhe: 2.97 ± 0.352
4.87LeuGly: 4.87 ± 0.431
1.658LeuHis: 1.658 ± 0.261
4.559LeuIle: 4.559 ± 0.377
4.905LeuLys: 4.905 ± 0.462
5.423LeuLeu: 5.423 ± 0.488
2.521LeuMet: 2.521 ± 0.266
3.765LeuAsn: 3.765 ± 0.389
3.696LeuPro: 3.696 ± 0.352
3.454LeuGln: 3.454 ± 0.33
3.523LeuArg: 3.523 ± 0.286
4.455LeuSer: 4.455 ± 0.378
6.01LeuThr: 6.01 ± 0.529
4.11LeuVal: 4.11 ± 0.356
1.071LeuTrp: 1.071 ± 0.204
2.28LeuTyr: 2.28 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
3.868MetAla: 3.868 ± 0.322
0.38MetCys: 0.38 ± 0.116
1.071MetAsp: 1.071 ± 0.188
1.934MetGlu: 1.934 ± 0.295
1.174MetPhe: 1.174 ± 0.235
1.554MetGly: 1.554 ± 0.205
0.484MetHis: 0.484 ± 0.166
1.934MetIle: 1.934 ± 0.253
2.521MetLys: 2.521 ± 0.358
2.383MetLeu: 2.383 ± 0.302
1.071MetMet: 1.071 ± 0.198
1.52MetAsn: 1.52 ± 0.231
0.898MetPro: 0.898 ± 0.207
1.209MetGln: 1.209 ± 0.214
1.209MetArg: 1.209 ± 0.204
1.692MetSer: 1.692 ± 0.248
2.521MetThr: 2.521 ± 0.244
1.9MetVal: 1.9 ± 0.247
0.414MetTrp: 0.414 ± 0.111
0.898MetTyr: 0.898 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.765AsnAla: 3.765 ± 0.413
0.518AsnCys: 0.518 ± 0.142
2.625AsnAsp: 2.625 ± 0.27
2.418AsnGlu: 2.418 ± 0.254
1.036AsnPhe: 1.036 ± 0.177
4.421AsnGly: 4.421 ± 0.524
1.002AsnHis: 1.002 ± 0.177
2.936AsnIle: 2.936 ± 0.339
2.798AsnLys: 2.798 ± 0.259
4.006AsnLeu: 4.006 ± 0.514
0.829AsnMet: 0.829 ± 0.165
2.349AsnAsn: 2.349 ± 0.307
2.659AsnPro: 2.659 ± 0.326
1.52AsnGln: 1.52 ± 0.213
2.349AsnArg: 2.349 ± 0.255
2.349AsnSer: 2.349 ± 0.285
3.108AsnThr: 3.108 ± 0.417
3.419AsnVal: 3.419 ± 0.313
0.553AsnTrp: 0.553 ± 0.134
1.727AsnTyr: 1.727 ± 0.297
0.0AsnXaa: 0.0 ± 0.0
Pro
2.832ProAla: 2.832 ± 0.322
0.518ProCys: 0.518 ± 0.168
2.452ProAsp: 2.452 ± 0.276
2.832ProGlu: 2.832 ± 0.288
1.312ProPhe: 1.312 ± 0.214
2.176ProGly: 2.176 ± 0.289
0.863ProHis: 0.863 ± 0.154
1.485ProIle: 1.485 ± 0.254
2.038ProLys: 2.038 ± 0.369
2.763ProLeu: 2.763 ± 0.306
1.071ProMet: 1.071 ± 0.212
1.934ProAsn: 1.934 ± 0.249
0.863ProPro: 0.863 ± 0.171
1.623ProGln: 1.623 ± 0.262
1.589ProArg: 1.589 ± 0.215
2.107ProSer: 2.107 ± 0.258
2.729ProThr: 2.729 ± 0.31
3.178ProVal: 3.178 ± 0.358
0.345ProTrp: 0.345 ± 0.127
1.347ProTyr: 1.347 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
4.386GlnAla: 4.386 ± 0.385
0.414GlnCys: 0.414 ± 0.118
2.107GlnAsp: 2.107 ± 0.295
2.349GlnGlu: 2.349 ± 0.299
1.589GlnPhe: 1.589 ± 0.219
2.521GlnGly: 2.521 ± 0.293
0.76GlnHis: 0.76 ± 0.131
2.59GlnIle: 2.59 ± 0.309
2.21GlnLys: 2.21 ± 0.31
3.488GlnLeu: 3.488 ± 0.361
1.658GlnMet: 1.658 ± 0.225
1.554GlnAsn: 1.554 ± 0.228
1.243GlnPro: 1.243 ± 0.183
1.831GlnGln: 1.831 ± 0.263
2.038GlnArg: 2.038 ± 0.283
1.727GlnSer: 1.727 ± 0.25
1.865GlnThr: 1.865 ± 0.239
2.97GlnVal: 2.97 ± 0.377
0.725GlnTrp: 0.725 ± 0.158
1.831GlnTyr: 1.831 ± 0.249
0.0GlnXaa: 0.0 ± 0.0
Arg
3.454ArgAla: 3.454 ± 0.305
0.484ArgCys: 0.484 ± 0.139
3.074ArgAsp: 3.074 ± 0.365
3.247ArgGlu: 3.247 ± 0.296
1.658ArgPhe: 1.658 ± 0.206
3.212ArgGly: 3.212 ± 0.339
0.794ArgHis: 0.794 ± 0.19
2.694ArgIle: 2.694 ± 0.319
2.901ArgLys: 2.901 ± 0.359
4.248ArgLeu: 4.248 ± 0.417
1.416ArgMet: 1.416 ± 0.224
2.452ArgAsn: 2.452 ± 0.275
1.761ArgPro: 1.761 ± 0.241
1.9ArgGln: 1.9 ± 0.196
2.418ArgArg: 2.418 ± 0.315
2.245ArgSer: 2.245 ± 0.284
3.143ArgThr: 3.143 ± 0.297
3.799ArgVal: 3.799 ± 0.356
0.656ArgTrp: 0.656 ± 0.148
1.416ArgTyr: 1.416 ± 0.233
0.0ArgXaa: 0.0 ± 0.0
Ser
4.801SerAla: 4.801 ± 0.439
0.933SerCys: 0.933 ± 0.221
3.972SerAsp: 3.972 ± 0.32
2.452SerGlu: 2.452 ± 0.281
1.934SerPhe: 1.934 ± 0.253
5.215SerGly: 5.215 ± 0.568
1.002SerHis: 1.002 ± 0.176
2.936SerIle: 2.936 ± 0.316
2.936SerLys: 2.936 ± 0.291
4.628SerLeu: 4.628 ± 0.44
1.278SerMet: 1.278 ± 0.229
2.729SerAsn: 2.729 ± 0.348
2.245SerPro: 2.245 ± 0.293
2.003SerGln: 2.003 ± 0.245
2.487SerArg: 2.487 ± 0.294
3.868SerSer: 3.868 ± 0.897
2.901SerThr: 2.901 ± 0.382
3.73SerVal: 3.73 ± 0.424
0.829SerTrp: 0.829 ± 0.155
2.521SerTyr: 2.521 ± 0.273
0.0SerXaa: 0.0 ± 0.0
Thr
5.595ThrAla: 5.595 ± 0.572
0.829ThrCys: 0.829 ± 0.155
3.834ThrAsp: 3.834 ± 0.36
3.696ThrGlu: 3.696 ± 0.336
2.763ThrPhe: 2.763 ± 0.298
5.526ThrGly: 5.526 ± 0.467
1.036ThrHis: 1.036 ± 0.181
4.697ThrIle: 4.697 ± 0.392
2.763ThrLys: 2.763 ± 0.312
5.492ThrLeu: 5.492 ± 0.488
1.52ThrMet: 1.52 ± 0.194
2.383ThrAsn: 2.383 ± 0.308
3.074ThrPro: 3.074 ± 0.345
2.659ThrGln: 2.659 ± 0.254
2.798ThrArg: 2.798 ± 0.375
3.627ThrSer: 3.627 ± 0.473
5.388ThrThr: 5.388 ± 0.543
4.974ThrVal: 4.974 ± 0.461
0.725ThrTrp: 0.725 ± 0.154
1.934ThrTyr: 1.934 ± 0.228
0.0ThrXaa: 0.0 ± 0.0
Val
6.182ValAla: 6.182 ± 0.509
1.14ValCys: 1.14 ± 0.189
4.352ValAsp: 4.352 ± 0.404
5.388ValGlu: 5.388 ± 0.514
3.108ValPhe: 3.108 ± 0.326
4.835ValGly: 4.835 ± 0.45
1.243ValHis: 1.243 ± 0.169
3.523ValIle: 3.523 ± 0.316
4.697ValLys: 4.697 ± 0.52
4.732ValLeu: 4.732 ± 0.38
2.418ValMet: 2.418 ± 0.272
2.729ValAsn: 2.729 ± 0.375
2.659ValPro: 2.659 ± 0.284
2.763ValGln: 2.763 ± 0.289
3.557ValArg: 3.557 ± 0.371
3.903ValSer: 3.903 ± 0.384
5.077ValThr: 5.077 ± 0.492
5.354ValVal: 5.354 ± 0.542
1.209ValTrp: 1.209 ± 0.172
2.659ValTyr: 2.659 ± 0.269
0.0ValXaa: 0.0 ± 0.0
Trp
1.243TrpAla: 1.243 ± 0.21
0.069TrpCys: 0.069 ± 0.049
0.829TrpAsp: 0.829 ± 0.146
1.243TrpGlu: 1.243 ± 0.216
0.622TrpPhe: 0.622 ± 0.132
0.898TrpGly: 0.898 ± 0.166
0.518TrpHis: 0.518 ± 0.125
0.794TrpIle: 0.794 ± 0.155
1.036TrpLys: 1.036 ± 0.192
1.485TrpLeu: 1.485 ± 0.245
0.207TrpMet: 0.207 ± 0.088
0.794TrpAsn: 0.794 ± 0.165
0.38TrpPro: 0.38 ± 0.112
0.38TrpGln: 0.38 ± 0.111
0.414TrpArg: 0.414 ± 0.127
0.656TrpSer: 0.656 ± 0.125
0.829TrpThr: 0.829 ± 0.183
1.071TrpVal: 1.071 ± 0.191
0.276TrpTrp: 0.276 ± 0.097
0.794TrpTyr: 0.794 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.901TyrAla: 2.901 ± 0.325
0.553TyrCys: 0.553 ± 0.126
2.729TyrAsp: 2.729 ± 0.353
2.349TyrGlu: 2.349 ± 0.255
1.347TyrPhe: 1.347 ± 0.216
2.936TyrGly: 2.936 ± 0.345
0.829TyrHis: 0.829 ± 0.179
1.9TyrIle: 1.9 ± 0.259
2.176TyrLys: 2.176 ± 0.3
2.418TyrLeu: 2.418 ± 0.298
1.002TyrMet: 1.002 ± 0.205
2.28TyrAsn: 2.28 ± 0.277
1.243TyrPro: 1.243 ± 0.185
1.589TyrGln: 1.589 ± 0.272
2.418TyrArg: 2.418 ± 0.256
2.141TyrSer: 2.141 ± 0.297
3.108TyrThr: 3.108 ± 0.332
2.245TyrVal: 2.245 ± 0.305
0.345TyrTrp: 0.345 ± 0.101
1.278TyrTyr: 1.278 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 173 proteins (28954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski