Amino acid dipepetide frequency for Streptomyces phage Bmoc

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.359AlaAla: 7.359 ± 1.009
0.805AlaCys: 0.805 ± 0.147
4.829AlaAsp: 4.829 ± 0.416
5.375AlaGlu: 5.375 ± 0.471
3.133AlaPhe: 3.133 ± 0.304
5.806AlaGly: 5.806 ± 0.604
1.523AlaHis: 1.523 ± 0.208
4.484AlaIle: 4.484 ± 0.489
5.174AlaLys: 5.174 ± 0.385
6.87AlaLeu: 6.87 ± 0.586
2.616AlaMet: 2.616 ± 0.276
3.363AlaAsn: 3.363 ± 0.414
2.932AlaPro: 2.932 ± 0.378
2.961AlaGln: 2.961 ± 0.405
4.829AlaArg: 4.829 ± 0.454
4.398AlaSer: 4.398 ± 0.59
5.117AlaThr: 5.117 ± 0.724
5.347AlaVal: 5.347 ± 0.423
1.437AlaTrp: 1.437 ± 0.198
2.961AlaTyr: 2.961 ± 0.307
0.0AlaXaa: 0.0 ± 0.0
Cys
0.46CysAla: 0.46 ± 0.126
0.144CysCys: 0.144 ± 0.068
0.575CysAsp: 0.575 ± 0.149
0.604CysGlu: 0.604 ± 0.127
0.287CysPhe: 0.287 ± 0.088
1.207CysGly: 1.207 ± 0.278
0.23CysHis: 0.23 ± 0.081
0.489CysIle: 0.489 ± 0.113
0.977CysLys: 0.977 ± 0.192
0.719CysLeu: 0.719 ± 0.168
0.316CysMet: 0.316 ± 0.101
0.661CysAsn: 0.661 ± 0.137
0.632CysPro: 0.632 ± 0.15
0.201CysGln: 0.201 ± 0.098
0.776CysArg: 0.776 ± 0.168
0.604CysSer: 0.604 ± 0.162
0.46CysThr: 0.46 ± 0.119
0.604CysVal: 0.604 ± 0.144
0.201CysTrp: 0.201 ± 0.076
0.431CysTyr: 0.431 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
5.576AspAla: 5.576 ± 0.407
0.546AspCys: 0.546 ± 0.142
4.312AspAsp: 4.312 ± 0.464
5.03AspGlu: 5.03 ± 0.4
3.133AspPhe: 3.133 ± 0.36
5.433AspGly: 5.433 ± 0.525
1.035AspHis: 1.035 ± 0.218
3.766AspIle: 3.766 ± 0.404
4.312AspLys: 4.312 ± 0.479
4.369AspLeu: 4.369 ± 0.333
1.955AspMet: 1.955 ± 0.265
3.133AspAsn: 3.133 ± 0.335
2.472AspPro: 2.472 ± 0.32
1.466AspGln: 1.466 ± 0.212
2.616AspArg: 2.616 ± 0.248
3.881AspSer: 3.881 ± 0.37
3.248AspThr: 3.248 ± 0.389
4.513AspVal: 4.513 ± 0.478
1.61AspTrp: 1.61 ± 0.227
2.817AspTyr: 2.817 ± 0.333
0.0AspXaa: 0.0 ± 0.0
Glu
5.864GluAla: 5.864 ± 0.58
0.747GluCys: 0.747 ± 0.159
4.714GluAsp: 4.714 ± 0.444
5.375GluGlu: 5.375 ± 0.534
2.961GluPhe: 2.961 ± 0.339
4.685GluGly: 4.685 ± 0.428
1.523GluHis: 1.523 ± 0.275
4.398GluIle: 4.398 ± 0.341
4.168GluLys: 4.168 ± 0.415
5.921GluLeu: 5.921 ± 0.489
2.3GluMet: 2.3 ± 0.266
3.392GluAsn: 3.392 ± 0.34
2.213GluPro: 2.213 ± 0.343
2.731GluGln: 2.731 ± 0.352
4.657GluArg: 4.657 ± 0.437
3.593GluSer: 3.593 ± 0.4
3.536GluThr: 3.536 ± 0.364
5.174GluVal: 5.174 ± 0.365
1.495GluTrp: 1.495 ± 0.247
2.587GluTyr: 2.587 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
2.271PheAla: 2.271 ± 0.271
0.316PheCys: 0.316 ± 0.099
3.478PheAsp: 3.478 ± 0.299
3.593PheGlu: 3.593 ± 0.338
1.322PhePhe: 1.322 ± 0.2
2.472PheGly: 2.472 ± 0.256
0.805PheHis: 0.805 ± 0.154
2.098PheIle: 2.098 ± 0.257
2.328PheLys: 2.328 ± 0.321
2.357PheLeu: 2.357 ± 0.309
0.949PheMet: 0.949 ± 0.16
1.897PheAsn: 1.897 ± 0.206
1.121PhePro: 1.121 ± 0.155
1.15PheGln: 1.15 ± 0.204
1.983PheArg: 1.983 ± 0.235
2.788PheSer: 2.788 ± 0.326
2.328PheThr: 2.328 ± 0.269
2.874PheVal: 2.874 ± 0.343
0.517PheTrp: 0.517 ± 0.129
1.351PheTyr: 1.351 ± 0.196
0.0PheXaa: 0.0 ± 0.0
Gly
5.145GlyAla: 5.145 ± 0.448
0.834GlyCys: 0.834 ± 0.15
4.57GlyAsp: 4.57 ± 0.414
4.57GlyGlu: 4.57 ± 0.407
2.817GlyPhe: 2.817 ± 0.265
5.03GlyGly: 5.03 ± 0.519
1.696GlyHis: 1.696 ± 0.226
4.254GlyIle: 4.254 ± 0.38
5.174GlyLys: 5.174 ± 0.458
5.548GlyLeu: 5.548 ± 0.503
2.472GlyMet: 2.472 ± 0.215
3.737GlyAsn: 3.737 ± 0.367
1.955GlyPro: 1.955 ± 0.292
2.357GlyGln: 2.357 ± 0.296
4.254GlyArg: 4.254 ± 0.312
4.427GlySer: 4.427 ± 0.436
5.404GlyThr: 5.404 ± 0.837
5.893GlyVal: 5.893 ± 0.424
1.236GlyTrp: 1.236 ± 0.193
3.133GlyTyr: 3.133 ± 0.283
0.0GlyXaa: 0.0 ± 0.0
His
1.265HisAla: 1.265 ± 0.191
0.345HisCys: 0.345 ± 0.111
1.265HisAsp: 1.265 ± 0.209
1.207HisGlu: 1.207 ± 0.195
0.747HisPhe: 0.747 ± 0.123
1.955HisGly: 1.955 ± 0.268
0.546HisHis: 0.546 ± 0.144
0.977HisIle: 0.977 ± 0.181
0.977HisLys: 0.977 ± 0.143
1.437HisLeu: 1.437 ± 0.211
0.46HisMet: 0.46 ± 0.115
0.69HisAsn: 0.69 ± 0.122
0.776HisPro: 0.776 ± 0.166
0.632HisGln: 0.632 ± 0.152
1.437HisArg: 1.437 ± 0.212
1.092HisSer: 1.092 ± 0.168
1.064HisThr: 1.064 ± 0.167
1.179HisVal: 1.179 ± 0.167
0.316HisTrp: 0.316 ± 0.096
0.69HisTyr: 0.69 ± 0.136
0.0HisXaa: 0.0 ± 0.0
Ile
4.34IleAla: 4.34 ± 0.437
0.661IleCys: 0.661 ± 0.157
4.053IleAsp: 4.053 ± 0.369
4.628IleGlu: 4.628 ± 0.422
1.552IlePhe: 1.552 ± 0.217
3.478IleGly: 3.478 ± 0.317
1.035IleHis: 1.035 ± 0.158
2.443IleIle: 2.443 ± 0.298
3.421IleLys: 3.421 ± 0.311
3.593IleLeu: 3.593 ± 0.384
0.977IleMet: 0.977 ± 0.173
2.07IleAsn: 2.07 ± 0.24
2.271IlePro: 2.271 ± 0.254
1.696IleGln: 1.696 ± 0.309
3.679IleArg: 3.679 ± 0.388
3.104IleSer: 3.104 ± 0.293
2.903IleThr: 2.903 ± 0.357
4.657IleVal: 4.657 ± 0.402
0.862IleTrp: 0.862 ± 0.169
1.638IleTyr: 1.638 ± 0.219
0.0IleXaa: 0.0 ± 0.0
Lys
5.576LysAla: 5.576 ± 0.471
0.862LysCys: 0.862 ± 0.189
3.737LysAsp: 3.737 ± 0.401
4.657LysGlu: 4.657 ± 0.405
1.84LysPhe: 1.84 ± 0.258
4.398LysGly: 4.398 ± 0.468
1.236LysHis: 1.236 ± 0.203
3.219LysIle: 3.219 ± 0.363
4.455LysLys: 4.455 ± 0.505
4.283LysLeu: 4.283 ± 0.344
2.501LysMet: 2.501 ± 0.258
3.679LysAsn: 3.679 ± 0.39
2.328LysPro: 2.328 ± 0.289
2.386LysGln: 2.386 ± 0.304
4.082LysArg: 4.082 ± 0.508
3.334LysSer: 3.334 ± 0.294
3.507LysThr: 3.507 ± 0.336
4.513LysVal: 4.513 ± 0.347
1.294LysTrp: 1.294 ± 0.2
2.587LysTyr: 2.587 ± 0.287
0.0LysXaa: 0.0 ± 0.0
Leu
6.985LeuAla: 6.985 ± 0.498
1.092LeuCys: 1.092 ± 0.239
5.059LeuAsp: 5.059 ± 0.406
5.835LeuGlu: 5.835 ± 0.496
2.53LeuPhe: 2.53 ± 0.26
4.685LeuGly: 4.685 ± 0.43
1.408LeuHis: 1.408 ± 0.199
3.967LeuIle: 3.967 ± 0.348
4.628LeuLys: 4.628 ± 0.374
4.743LeuLeu: 4.743 ± 0.419
1.811LeuMet: 1.811 ± 0.211
2.961LeuAsn: 2.961 ± 0.388
2.702LeuPro: 2.702 ± 0.238
1.61LeuGln: 1.61 ± 0.205
4.139LeuArg: 4.139 ± 0.359
5.174LeuSer: 5.174 ± 0.359
4.973LeuThr: 4.973 ± 0.438
4.714LeuVal: 4.714 ± 0.399
1.121LeuTrp: 1.121 ± 0.175
2.472LeuTyr: 2.472 ± 0.244
0.0LeuXaa: 0.0 ± 0.0
Met
2.932MetAla: 2.932 ± 0.281
0.287MetCys: 0.287 ± 0.091
1.581MetAsp: 1.581 ± 0.231
1.61MetGlu: 1.61 ± 0.245
0.805MetPhe: 0.805 ± 0.173
2.127MetGly: 2.127 ± 0.286
0.604MetHis: 0.604 ± 0.161
1.294MetIle: 1.294 ± 0.21
1.696MetLys: 1.696 ± 0.228
2.012MetLeu: 2.012 ± 0.222
0.431MetMet: 0.431 ± 0.118
1.092MetAsn: 1.092 ± 0.18
1.179MetPro: 1.179 ± 0.191
1.121MetGln: 1.121 ± 0.277
2.012MetArg: 2.012 ± 0.244
2.156MetSer: 2.156 ± 0.27
2.213MetThr: 2.213 ± 0.221
1.782MetVal: 1.782 ± 0.229
0.402MetTrp: 0.402 ± 0.107
0.977MetTyr: 0.977 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
3.852AsnAla: 3.852 ± 0.471
0.23AsnCys: 0.23 ± 0.087
2.759AsnAsp: 2.759 ± 0.317
2.961AsnGlu: 2.961 ± 0.322
1.84AsnPhe: 1.84 ± 0.251
3.881AsnGly: 3.881 ± 0.377
1.121AsnHis: 1.121 ± 0.202
2.587AsnIle: 2.587 ± 0.277
3.047AsnLys: 3.047 ± 0.351
3.191AsnLeu: 3.191 ± 0.313
1.064AsnMet: 1.064 ± 0.184
2.012AsnAsn: 2.012 ± 0.285
2.07AsnPro: 2.07 ± 0.248
1.322AsnGln: 1.322 ± 0.208
2.731AsnArg: 2.731 ± 0.281
2.242AsnSer: 2.242 ± 0.34
2.788AsnThr: 2.788 ± 0.408
3.162AsnVal: 3.162 ± 0.299
0.862AsnTrp: 0.862 ± 0.166
1.351AsnTyr: 1.351 ± 0.23
0.0AsnXaa: 0.0 ± 0.0
Pro
2.932ProAla: 2.932 ± 0.317
0.316ProCys: 0.316 ± 0.097
2.702ProAsp: 2.702 ± 0.362
2.989ProGlu: 2.989 ± 0.316
1.495ProPhe: 1.495 ± 0.21
3.047ProGly: 3.047 ± 0.315
0.69ProHis: 0.69 ± 0.116
1.84ProIle: 1.84 ± 0.22
2.041ProLys: 2.041 ± 0.296
2.357ProLeu: 2.357 ± 0.241
0.891ProMet: 0.891 ± 0.173
1.782ProAsn: 1.782 ± 0.232
1.179ProPro: 1.179 ± 0.283
0.92ProGln: 0.92 ± 0.156
2.041ProArg: 2.041 ± 0.281
2.271ProSer: 2.271 ± 0.371
2.472ProThr: 2.472 ± 0.432
3.449ProVal: 3.449 ± 0.325
0.517ProTrp: 0.517 ± 0.116
1.351ProTyr: 1.351 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
2.903GlnAla: 2.903 ± 0.459
0.374GlnCys: 0.374 ± 0.109
1.61GlnAsp: 1.61 ± 0.228
2.156GlnGlu: 2.156 ± 0.411
1.351GlnPhe: 1.351 ± 0.208
2.185GlnGly: 2.185 ± 0.236
0.46GlnHis: 0.46 ± 0.12
1.552GlnIle: 1.552 ± 0.189
2.185GlnLys: 2.185 ± 0.28
2.271GlnLeu: 2.271 ± 0.282
1.092GlnMet: 1.092 ± 0.216
1.322GlnAsn: 1.322 ± 0.208
1.121GlnPro: 1.121 ± 0.2
1.15GlnGln: 1.15 ± 0.287
1.897GlnArg: 1.897 ± 0.328
1.782GlnSer: 1.782 ± 0.191
1.868GlnThr: 1.868 ± 0.267
2.616GlnVal: 2.616 ± 0.304
0.575GlnTrp: 0.575 ± 0.125
1.15GlnTyr: 1.15 ± 0.179
0.0GlnXaa: 0.0 ± 0.0
Arg
4.973ArgAla: 4.973 ± 0.455
0.489ArgCys: 0.489 ± 0.136
3.277ArgAsp: 3.277 ± 0.386
3.909ArgGlu: 3.909 ± 0.374
2.587ArgPhe: 2.587 ± 0.337
3.823ArgGly: 3.823 ± 0.331
0.949ArgHis: 0.949 ± 0.199
3.076ArgIle: 3.076 ± 0.308
4.599ArgLys: 4.599 ± 0.512
4.082ArgLeu: 4.082 ± 0.382
1.983ArgMet: 1.983 ± 0.242
2.817ArgAsn: 2.817 ± 0.349
2.07ArgPro: 2.07 ± 0.231
1.84ArgGln: 1.84 ± 0.277
3.794ArgArg: 3.794 ± 0.476
3.162ArgSer: 3.162 ± 0.343
2.731ArgThr: 2.731 ± 0.359
4.168ArgVal: 4.168 ± 0.43
1.351ArgTrp: 1.351 ± 0.228
2.443ArgTyr: 2.443 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
4.369SerAla: 4.369 ± 0.473
0.632SerCys: 0.632 ± 0.159
3.881SerAsp: 3.881 ± 0.361
3.938SerGlu: 3.938 ± 0.285
2.472SerPhe: 2.472 ± 0.268
5.921SerGly: 5.921 ± 0.626
0.977SerHis: 0.977 ± 0.166
2.903SerIle: 2.903 ± 0.256
3.852SerLys: 3.852 ± 0.407
5.002SerLeu: 5.002 ± 0.48
1.868SerMet: 1.868 ± 0.225
2.328SerAsn: 2.328 ± 0.346
2.07SerPro: 2.07 ± 0.286
1.725SerGln: 1.725 ± 0.229
2.961SerArg: 2.961 ± 0.277
3.507SerSer: 3.507 ± 0.497
3.392SerThr: 3.392 ± 0.586
4.484SerVal: 4.484 ± 0.509
1.408SerTrp: 1.408 ± 0.176
2.098SerTyr: 2.098 ± 0.331
0.0SerXaa: 0.0 ± 0.0
Thr
4.628ThrAla: 4.628 ± 0.635
0.776ThrCys: 0.776 ± 0.147
3.737ThrAsp: 3.737 ± 0.404
4.053ThrGlu: 4.053 ± 0.378
2.242ThrPhe: 2.242 ± 0.274
5.548ThrGly: 5.548 ± 0.747
1.064ThrHis: 1.064 ± 0.159
3.507ThrIle: 3.507 ± 0.373
3.191ThrLys: 3.191 ± 0.321
4.254ThrLeu: 4.254 ± 0.352
1.035ThrMet: 1.035 ± 0.202
2.501ThrAsn: 2.501 ± 0.383
3.277ThrPro: 3.277 ± 0.396
1.926ThrGln: 1.926 ± 0.282
2.846ThrArg: 2.846 ± 0.36
3.392ThrSer: 3.392 ± 0.362
3.823ThrThr: 3.823 ± 0.657
4.542ThrVal: 4.542 ± 0.543
1.236ThrTrp: 1.236 ± 0.196
2.415ThrTyr: 2.415 ± 0.259
0.0ThrXaa: 0.0 ± 0.0
Val
5.461ValAla: 5.461 ± 0.349
0.604ValCys: 0.604 ± 0.147
4.887ValAsp: 4.887 ± 0.515
5.002ValGlu: 5.002 ± 0.444
2.788ValPhe: 2.788 ± 0.281
4.34ValGly: 4.34 ± 0.408
0.977ValHis: 0.977 ± 0.197
3.967ValIle: 3.967 ± 0.387
5.002ValLys: 5.002 ± 0.357
4.685ValLeu: 4.685 ± 0.429
1.983ValMet: 1.983 ± 0.218
2.874ValAsn: 2.874 ± 0.299
2.989ValPro: 2.989 ± 0.366
2.587ValGln: 2.587 ± 0.299
4.398ValArg: 4.398 ± 0.354
5.375ValSer: 5.375 ± 0.448
4.312ValThr: 4.312 ± 0.599
5.117ValVal: 5.117 ± 0.385
1.523ValTrp: 1.523 ± 0.238
3.277ValTyr: 3.277 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
1.437TrpAla: 1.437 ± 0.214
0.172TrpCys: 0.172 ± 0.088
1.236TrpAsp: 1.236 ± 0.209
1.725TrpGlu: 1.725 ± 0.242
0.661TrpPhe: 0.661 ± 0.124
1.322TrpGly: 1.322 ± 0.193
0.517TrpHis: 0.517 ± 0.15
0.69TrpIle: 0.69 ± 0.134
1.236TrpLys: 1.236 ± 0.232
1.753TrpLeu: 1.753 ± 0.237
0.661TrpMet: 0.661 ± 0.162
1.092TrpAsn: 1.092 ± 0.184
0.402TrpPro: 0.402 ± 0.113
0.661TrpGln: 0.661 ± 0.139
0.805TrpArg: 0.805 ± 0.156
1.15TrpSer: 1.15 ± 0.19
1.437TrpThr: 1.437 ± 0.248
0.92TrpVal: 0.92 ± 0.149
0.489TrpTrp: 0.489 ± 0.127
0.719TrpTyr: 0.719 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.989TyrAla: 2.989 ± 0.295
0.345TyrCys: 0.345 ± 0.094
2.961TyrAsp: 2.961 ± 0.349
2.817TyrGlu: 2.817 ± 0.323
1.437TyrPhe: 1.437 ± 0.243
3.162TyrGly: 3.162 ± 0.353
0.661TyrHis: 0.661 ± 0.149
1.552TyrIle: 1.552 ± 0.205
2.012TyrLys: 2.012 ± 0.255
3.162TyrLeu: 3.162 ± 0.31
0.92TyrMet: 0.92 ± 0.167
1.667TyrAsn: 1.667 ± 0.21
1.523TyrPro: 1.523 ± 0.243
1.179TyrGln: 1.179 ± 0.173
2.185TyrArg: 2.185 ± 0.255
2.357TyrSer: 2.357 ± 0.244
2.415TyrThr: 2.415 ± 0.245
2.443TyrVal: 2.443 ± 0.327
0.632TyrTrp: 0.632 ± 0.141
1.294TyrTyr: 1.294 ± 0.214
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 214 proteins (34790 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski