Amino acid dipepetide frequency for Serratia phage phiMAM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.688AlaAla: 6.688 ± 0.494
0.667AlaCys: 0.667 ± 0.131
4.875AlaAsp: 4.875 ± 0.348
5.0AlaGlu: 5.0 ± 0.364
2.667AlaPhe: 2.667 ± 0.25
5.438AlaGly: 5.438 ± 0.412
1.625AlaHis: 1.625 ± 0.181
4.75AlaIle: 4.75 ± 0.28
4.563AlaLys: 4.563 ± 0.314
5.563AlaLeu: 5.563 ± 0.346
2.167AlaMet: 2.167 ± 0.209
3.667AlaAsn: 3.667 ± 0.302
2.75AlaPro: 2.75 ± 0.236
2.917AlaGln: 2.917 ± 0.248
3.521AlaArg: 3.521 ± 0.252
3.792AlaSer: 3.792 ± 0.303
4.375AlaThr: 4.375 ± 0.329
5.771AlaVal: 5.771 ± 0.378
1.229AlaTrp: 1.229 ± 0.155
2.521AlaTyr: 2.521 ± 0.269
0.0AlaXaa: 0.0 ± 0.0
Cys
0.625CysAla: 0.625 ± 0.109
0.208CysCys: 0.208 ± 0.059
0.833CysAsp: 0.833 ± 0.145
0.646CysGlu: 0.646 ± 0.111
0.229CysPhe: 0.229 ± 0.079
0.854CysGly: 0.854 ± 0.148
0.292CysHis: 0.292 ± 0.084
0.542CysIle: 0.542 ± 0.119
0.417CysLys: 0.417 ± 0.116
0.875CysLeu: 0.875 ± 0.134
0.354CysMet: 0.354 ± 0.074
0.771CysAsn: 0.771 ± 0.12
0.604CysPro: 0.604 ± 0.122
0.271CysGln: 0.271 ± 0.084
0.813CysArg: 0.813 ± 0.131
0.792CysSer: 0.792 ± 0.126
0.792CysThr: 0.792 ± 0.149
1.021CysVal: 1.021 ± 0.163
0.083CysTrp: 0.083 ± 0.042
0.333CysTyr: 0.333 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
4.646AspAla: 4.646 ± 0.287
0.625AspCys: 0.625 ± 0.108
3.667AspAsp: 3.667 ± 0.372
3.521AspGlu: 3.521 ± 0.273
2.854AspPhe: 2.854 ± 0.246
5.438AspGly: 5.438 ± 0.316
1.458AspHis: 1.458 ± 0.189
4.438AspIle: 4.438 ± 0.291
3.479AspLys: 3.479 ± 0.272
5.188AspLeu: 5.188 ± 0.304
2.063AspMet: 2.063 ± 0.242
3.292AspAsn: 3.292 ± 0.236
2.542AspPro: 2.542 ± 0.246
2.146AspGln: 2.146 ± 0.216
2.646AspArg: 2.646 ± 0.235
3.209AspSer: 3.209 ± 0.249
3.875AspThr: 3.875 ± 0.3
4.938AspVal: 4.938 ± 0.316
0.792AspTrp: 0.792 ± 0.134
3.146AspTyr: 3.146 ± 0.313
0.0AspXaa: 0.0 ± 0.0
Glu
5.396GluAla: 5.396 ± 0.416
0.771GluCys: 0.771 ± 0.162
3.959GluAsp: 3.959 ± 0.339
3.75GluGlu: 3.75 ± 0.326
3.167GluPhe: 3.167 ± 0.267
4.23GluGly: 4.23 ± 0.263
1.583GluHis: 1.583 ± 0.18
4.646GluIle: 4.646 ± 0.366
3.959GluLys: 3.959 ± 0.312
5.605GluLeu: 5.605 ± 0.35
2.354GluMet: 2.354 ± 0.235
2.834GluAsn: 2.834 ± 0.256
1.896GluPro: 1.896 ± 0.223
2.542GluGln: 2.542 ± 0.243
3.084GluArg: 3.084 ± 0.335
3.084GluSer: 3.084 ± 0.224
3.042GluThr: 3.042 ± 0.268
4.625GluVal: 4.625 ± 0.285
1.042GluTrp: 1.042 ± 0.126
2.792GluTyr: 2.792 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
2.396PheAla: 2.396 ± 0.278
0.542PheCys: 0.542 ± 0.118
2.979PheAsp: 2.979 ± 0.289
3.146PheGlu: 3.146 ± 0.277
1.563PhePhe: 1.563 ± 0.181
2.959PheGly: 2.959 ± 0.301
0.667PheHis: 0.667 ± 0.146
2.813PheIle: 2.813 ± 0.232
2.417PheLys: 2.417 ± 0.235
2.875PheLeu: 2.875 ± 0.212
1.292PheMet: 1.292 ± 0.183
2.334PheAsn: 2.334 ± 0.233
1.542PhePro: 1.542 ± 0.229
1.563PheGln: 1.563 ± 0.169
2.313PheArg: 2.313 ± 0.214
2.938PheSer: 2.938 ± 0.27
2.896PheThr: 2.896 ± 0.243
2.709PheVal: 2.709 ± 0.224
0.667PheTrp: 0.667 ± 0.116
1.771PheTyr: 1.771 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
4.438GlyAla: 4.438 ± 0.402
0.646GlyCys: 0.646 ± 0.113
5.271GlyAsp: 5.271 ± 0.335
4.125GlyGlu: 4.125 ± 0.357
3.021GlyPhe: 3.021 ± 0.236
5.438GlyGly: 5.438 ± 0.508
0.896GlyHis: 0.896 ± 0.12
4.438GlyIle: 4.438 ± 0.337
4.813GlyLys: 4.813 ± 0.317
5.063GlyLeu: 5.063 ± 0.271
2.042GlyMet: 2.042 ± 0.212
3.459GlyAsn: 3.459 ± 0.375
1.375GlyPro: 1.375 ± 0.198
3.021GlyGln: 3.021 ± 0.252
3.25GlyArg: 3.25 ± 0.294
4.334GlySer: 4.334 ± 0.385
4.48GlyThr: 4.48 ± 0.525
5.771GlyVal: 5.771 ± 0.394
1.167GlyTrp: 1.167 ± 0.178
2.688GlyTyr: 2.688 ± 0.308
0.0GlyXaa: 0.0 ± 0.0
His
1.188HisAla: 1.188 ± 0.167
0.292HisCys: 0.292 ± 0.109
1.021HisAsp: 1.021 ± 0.154
1.0HisGlu: 1.0 ± 0.142
1.104HisPhe: 1.104 ± 0.179
1.021HisGly: 1.021 ± 0.14
0.479HisHis: 0.479 ± 0.109
1.083HisIle: 1.083 ± 0.189
1.042HisLys: 1.042 ± 0.145
1.667HisLeu: 1.667 ± 0.185
0.625HisMet: 0.625 ± 0.111
0.729HisAsn: 0.729 ± 0.118
0.792HisPro: 0.792 ± 0.123
0.542HisGln: 0.542 ± 0.113
1.146HisArg: 1.146 ± 0.178
1.083HisSer: 1.083 ± 0.156
1.0HisThr: 1.0 ± 0.174
1.771HisVal: 1.771 ± 0.191
0.271HisTrp: 0.271 ± 0.075
0.792HisTyr: 0.792 ± 0.116
0.0HisXaa: 0.0 ± 0.0
Ile
3.979IleAla: 3.979 ± 0.266
0.417IleCys: 0.417 ± 0.085
4.48IleAsp: 4.48 ± 0.362
4.834IleGlu: 4.834 ± 0.314
1.896IlePhe: 1.896 ± 0.211
3.438IleGly: 3.438 ± 0.294
1.271IleHis: 1.271 ± 0.203
3.646IleIle: 3.646 ± 0.252
4.271IleLys: 4.271 ± 0.265
3.938IleLeu: 3.938 ± 0.33
1.583IleMet: 1.583 ± 0.195
3.854IleAsn: 3.854 ± 0.294
2.854IlePro: 2.854 ± 0.266
2.479IleGln: 2.479 ± 0.214
3.146IleArg: 3.146 ± 0.251
3.709IleSer: 3.709 ± 0.317
4.042IleThr: 4.042 ± 0.28
3.813IleVal: 3.813 ± 0.295
0.75IleTrp: 0.75 ± 0.126
1.646IleTyr: 1.646 ± 0.176
0.0IleXaa: 0.0 ± 0.0
Lys
4.625LysAla: 4.625 ± 0.37
0.708LysCys: 0.708 ± 0.132
3.646LysAsp: 3.646 ± 0.29
4.188LysGlu: 4.188 ± 0.305
2.75LysPhe: 2.75 ± 0.222
3.729LysGly: 3.729 ± 0.293
1.188LysHis: 1.188 ± 0.181
3.563LysIle: 3.563 ± 0.282
4.021LysLys: 4.021 ± 0.399
4.563LysLeu: 4.563 ± 0.366
2.063LysMet: 2.063 ± 0.236
2.729LysAsn: 2.729 ± 0.268
2.771LysPro: 2.771 ± 0.295
2.604LysGln: 2.604 ± 0.224
2.979LysArg: 2.979 ± 0.239
4.167LysSer: 4.167 ± 0.276
3.854LysThr: 3.854 ± 0.364
4.355LysVal: 4.355 ± 0.319
0.917LysTrp: 0.917 ± 0.145
2.271LysTyr: 2.271 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
6.021LeuAla: 6.021 ± 0.429
0.813LeuCys: 0.813 ± 0.103
4.98LeuAsp: 4.98 ± 0.371
5.625LeuGlu: 5.625 ± 0.399
3.063LeuPhe: 3.063 ± 0.245
5.0LeuGly: 5.0 ± 0.285
1.333LeuHis: 1.333 ± 0.179
3.625LeuIle: 3.625 ± 0.227
6.042LeuLys: 6.042 ± 0.321
5.209LeuLeu: 5.209 ± 0.332
1.979LeuMet: 1.979 ± 0.219
4.084LeuAsn: 4.084 ± 0.233
3.25LeuPro: 3.25 ± 0.242
2.125LeuGln: 2.125 ± 0.22
4.063LeuArg: 4.063 ± 0.252
5.146LeuSer: 5.146 ± 0.368
4.584LeuThr: 4.584 ± 0.307
6.001LeuVal: 6.001 ± 0.431
0.75LeuTrp: 0.75 ± 0.152
2.688LeuTyr: 2.688 ± 0.211
0.0LeuXaa: 0.0 ± 0.0
Met
2.542MetAla: 2.542 ± 0.262
0.375MetCys: 0.375 ± 0.094
1.542MetAsp: 1.542 ± 0.178
1.771MetGlu: 1.771 ± 0.196
1.417MetPhe: 1.417 ± 0.176
1.688MetGly: 1.688 ± 0.2
0.438MetHis: 0.438 ± 0.091
1.292MetIle: 1.292 ± 0.177
2.084MetLys: 2.084 ± 0.212
2.417MetLeu: 2.417 ± 0.254
0.938MetMet: 0.938 ± 0.13
1.521MetAsn: 1.521 ± 0.195
1.083MetPro: 1.083 ± 0.16
0.958MetGln: 0.958 ± 0.146
1.563MetArg: 1.563 ± 0.177
2.334MetSer: 2.334 ± 0.253
2.104MetThr: 2.104 ± 0.216
2.584MetVal: 2.584 ± 0.216
0.271MetTrp: 0.271 ± 0.072
0.813MetTyr: 0.813 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.792AsnAla: 3.792 ± 0.319
0.833AsnCys: 0.833 ± 0.213
2.709AsnAsp: 2.709 ± 0.219
2.479AsnGlu: 2.479 ± 0.291
2.146AsnPhe: 2.146 ± 0.251
4.459AsnGly: 4.459 ± 0.351
1.042AsnHis: 1.042 ± 0.162
3.292AsnIle: 3.292 ± 0.315
3.063AsnLys: 3.063 ± 0.254
3.646AsnLeu: 3.646 ± 0.266
1.583AsnMet: 1.583 ± 0.172
2.625AsnAsn: 2.625 ± 0.276
2.563AsnPro: 2.563 ± 0.266
1.729AsnGln: 1.729 ± 0.206
2.979AsnArg: 2.979 ± 0.261
3.209AsnSer: 3.209 ± 0.271
2.875AsnThr: 2.875 ± 0.237
3.834AsnVal: 3.834 ± 0.413
0.646AsnTrp: 0.646 ± 0.13
1.813AsnTyr: 1.813 ± 0.224
0.0AsnXaa: 0.0 ± 0.0
Pro
3.334ProAla: 3.334 ± 0.281
0.375ProCys: 0.375 ± 0.074
2.729ProAsp: 2.729 ± 0.24
3.271ProGlu: 3.271 ± 0.323
1.583ProPhe: 1.583 ± 0.202
2.667ProGly: 2.667 ± 0.277
0.604ProHis: 0.604 ± 0.145
1.958ProIle: 1.958 ± 0.222
2.0ProLys: 2.0 ± 0.185
2.771ProLeu: 2.771 ± 0.267
1.0ProMet: 1.0 ± 0.159
1.75ProAsn: 1.75 ± 0.191
1.167ProPro: 1.167 ± 0.202
1.5ProGln: 1.5 ± 0.199
1.917ProArg: 1.917 ± 0.212
2.709ProSer: 2.709 ± 0.238
2.292ProThr: 2.292 ± 0.254
3.25ProVal: 3.25 ± 0.27
0.563ProTrp: 0.563 ± 0.114
1.313ProTyr: 1.313 ± 0.177
0.0ProXaa: 0.0 ± 0.0
Gln
3.334GlnAla: 3.334 ± 0.311
0.396GlnCys: 0.396 ± 0.087
2.042GlnAsp: 2.042 ± 0.22
2.271GlnGlu: 2.271 ± 0.256
1.708GlnPhe: 1.708 ± 0.197
2.125GlnGly: 2.125 ± 0.228
0.542GlnHis: 0.542 ± 0.099
2.646GlnIle: 2.646 ± 0.247
1.979GlnLys: 1.979 ± 0.21
3.188GlnLeu: 3.188 ± 0.29
0.896GlnMet: 0.896 ± 0.162
1.708GlnAsn: 1.708 ± 0.182
1.458GlnPro: 1.458 ± 0.194
1.667GlnGln: 1.667 ± 0.246
2.167GlnArg: 2.167 ± 0.221
2.063GlnSer: 2.063 ± 0.252
2.188GlnThr: 2.188 ± 0.194
2.875GlnVal: 2.875 ± 0.245
0.667GlnTrp: 0.667 ± 0.11
2.0GlnTyr: 2.0 ± 0.217
0.0GlnXaa: 0.0 ± 0.0
Arg
3.792ArgAla: 3.792 ± 0.27
0.771ArgCys: 0.771 ± 0.134
3.104ArgAsp: 3.104 ± 0.28
3.354ArgGlu: 3.354 ± 0.32
2.479ArgPhe: 2.479 ± 0.259
3.5ArgGly: 3.5 ± 0.277
1.042ArgHis: 1.042 ± 0.149
3.021ArgIle: 3.021 ± 0.248
2.979ArgLys: 2.979 ± 0.264
4.5ArgLeu: 4.5 ± 0.296
1.875ArgMet: 1.875 ± 0.183
2.5ArgAsn: 2.5 ± 0.223
1.563ArgPro: 1.563 ± 0.197
2.104ArgGln: 2.104 ± 0.224
2.875ArgArg: 2.875 ± 0.291
2.813ArgSer: 2.813 ± 0.259
2.167ArgThr: 2.167 ± 0.175
3.5ArgVal: 3.5 ± 0.281
0.646ArgTrp: 0.646 ± 0.122
2.313ArgTyr: 2.313 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
4.584SerAla: 4.584 ± 0.275
0.833SerCys: 0.833 ± 0.157
3.709SerAsp: 3.709 ± 0.284
3.896SerGlu: 3.896 ± 0.312
2.563SerPhe: 2.563 ± 0.265
4.521SerGly: 4.521 ± 0.378
1.063SerHis: 1.063 ± 0.172
3.75SerIle: 3.75 ± 0.319
4.0SerLys: 4.0 ± 0.276
4.5SerLeu: 4.5 ± 0.304
1.958SerMet: 1.958 ± 0.203
3.167SerAsn: 3.167 ± 0.267
2.771SerPro: 2.771 ± 0.247
2.188SerGln: 2.188 ± 0.206
3.146SerArg: 3.146 ± 0.246
3.709SerSer: 3.709 ± 0.342
3.459SerThr: 3.459 ± 0.3
4.313SerVal: 4.313 ± 0.358
0.75SerTrp: 0.75 ± 0.134
2.146SerTyr: 2.146 ± 0.193
0.0SerXaa: 0.0 ± 0.0
Thr
4.292ThrAla: 4.292 ± 0.373
0.604ThrCys: 0.604 ± 0.11
3.459ThrAsp: 3.459 ± 0.269
3.0ThrGlu: 3.0 ± 0.289
2.979ThrPhe: 2.979 ± 0.292
4.646ThrGly: 4.646 ± 0.367
0.833ThrHis: 0.833 ± 0.113
3.875ThrIle: 3.875 ± 0.317
3.438ThrLys: 3.438 ± 0.3
4.875ThrLeu: 4.875 ± 0.333
1.333ThrMet: 1.333 ± 0.185
3.229ThrAsn: 3.229 ± 0.311
3.021ThrPro: 3.021 ± 0.292
2.125ThrGln: 2.125 ± 0.221
2.792ThrArg: 2.792 ± 0.256
3.334ThrSer: 3.334 ± 0.312
3.667ThrThr: 3.667 ± 0.367
4.896ThrVal: 4.896 ± 0.36
0.938ThrTrp: 0.938 ± 0.146
1.979ThrTyr: 1.979 ± 0.258
0.0ThrXaa: 0.0 ± 0.0
Val
5.271ValAla: 5.271 ± 0.351
0.813ValCys: 0.813 ± 0.133
5.605ValAsp: 5.605 ± 0.355
5.375ValGlu: 5.375 ± 0.364
3.042ValPhe: 3.042 ± 0.264
5.146ValGly: 5.146 ± 0.347
1.229ValHis: 1.229 ± 0.143
3.959ValIle: 3.959 ± 0.287
4.355ValLys: 4.355 ± 0.327
5.146ValLeu: 5.146 ± 0.331
2.0ValMet: 2.0 ± 0.178
4.021ValAsn: 4.021 ± 0.318
2.834ValPro: 2.834 ± 0.258
3.188ValGln: 3.188 ± 0.259
3.229ValArg: 3.229 ± 0.288
5.167ValSer: 5.167 ± 0.399
4.917ValThr: 4.917 ± 0.425
5.917ValVal: 5.917 ± 0.383
1.146ValTrp: 1.146 ± 0.163
3.25ValTyr: 3.25 ± 0.251
0.0ValXaa: 0.0 ± 0.0
Trp
0.875TrpAla: 0.875 ± 0.167
0.271TrpCys: 0.271 ± 0.101
1.0TrpAsp: 1.0 ± 0.121
1.208TrpGlu: 1.208 ± 0.14
0.583TrpPhe: 0.583 ± 0.099
0.792TrpGly: 0.792 ± 0.126
0.292TrpHis: 0.292 ± 0.079
0.583TrpIle: 0.583 ± 0.121
0.667TrpLys: 0.667 ± 0.131
1.417TrpLeu: 1.417 ± 0.185
0.5TrpMet: 0.5 ± 0.106
0.833TrpAsn: 0.833 ± 0.142
0.5TrpPro: 0.5 ± 0.109
0.396TrpGln: 0.396 ± 0.09
0.875TrpArg: 0.875 ± 0.141
0.813TrpSer: 0.813 ± 0.123
0.625TrpThr: 0.625 ± 0.115
1.063TrpVal: 1.063 ± 0.146
0.188TrpTrp: 0.188 ± 0.059
0.479TrpTyr: 0.479 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.729TyrAla: 2.729 ± 0.248
0.396TyrCys: 0.396 ± 0.102
2.354TyrAsp: 2.354 ± 0.245
1.917TyrGlu: 1.917 ± 0.232
1.563TyrPhe: 1.563 ± 0.154
2.646TyrGly: 2.646 ± 0.261
0.771TyrHis: 0.771 ± 0.147
2.125TyrIle: 2.125 ± 0.222
2.104TyrLys: 2.104 ± 0.245
3.375TyrLeu: 3.375 ± 0.254
1.104TyrMet: 1.104 ± 0.134
2.209TyrAsn: 2.209 ± 0.234
1.417TyrPro: 1.417 ± 0.156
1.854TyrGln: 1.854 ± 0.19
2.396TyrArg: 2.396 ± 0.235
2.625TyrSer: 2.625 ± 0.26
2.084TyrThr: 2.084 ± 0.237
2.604TyrVal: 2.604 ± 0.234
0.479TyrTrp: 0.479 ± 0.088
1.604TyrTyr: 1.604 ± 0.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 198 proteins (47997 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski