Amino acid dipepetide frequency for Streptomyces phage Jay2Jay

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.097AlaAla: 8.097 ± 0.981
0.781AlaCys: 0.781 ± 0.195
5.234AlaAsp: 5.234 ± 0.434
5.986AlaGlu: 5.986 ± 0.547
3.065AlaPhe: 3.065 ± 0.293
5.697AlaGly: 5.697 ± 0.547
1.533AlaHis: 1.533 ± 0.197
4.858AlaIle: 4.858 ± 0.418
5.234AlaLys: 5.234 ± 0.42
6.911AlaLeu: 6.911 ± 0.566
2.487AlaMet: 2.487 ± 0.304
3.759AlaAsn: 3.759 ± 0.429
3.326AlaPro: 3.326 ± 0.435
2.863AlaGln: 2.863 ± 0.345
4.309AlaArg: 4.309 ± 0.388
4.858AlaSer: 4.858 ± 0.533
5.09AlaThr: 5.09 ± 0.611
5.494AlaVal: 5.494 ± 0.31
1.562AlaTrp: 1.562 ± 0.202
3.268AlaTyr: 3.268 ± 0.343
0.0AlaXaa: 0.0 ± 0.0
Cys
0.434CysAla: 0.434 ± 0.118
0.145CysCys: 0.145 ± 0.078
0.578CysAsp: 0.578 ± 0.145
0.665CysGlu: 0.665 ± 0.161
0.289CysPhe: 0.289 ± 0.111
0.925CysGly: 0.925 ± 0.189
0.174CysHis: 0.174 ± 0.076
0.665CysIle: 0.665 ± 0.164
0.723CysLys: 0.723 ± 0.146
0.665CysLeu: 0.665 ± 0.171
0.318CysMet: 0.318 ± 0.096
0.492CysAsn: 0.492 ± 0.139
0.607CysPro: 0.607 ± 0.157
0.376CysGln: 0.376 ± 0.125
0.752CysArg: 0.752 ± 0.167
0.723CysSer: 0.723 ± 0.172
0.318CysThr: 0.318 ± 0.104
0.578CysVal: 0.578 ± 0.132
0.145CysTrp: 0.145 ± 0.072
0.347CysTyr: 0.347 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
5.523AspAla: 5.523 ± 0.414
0.434AspCys: 0.434 ± 0.121
4.193AspAsp: 4.193 ± 0.402
5.437AspGlu: 5.437 ± 0.48
3.065AspPhe: 3.065 ± 0.305
5.784AspGly: 5.784 ± 0.448
0.896AspHis: 0.896 ± 0.157
3.615AspIle: 3.615 ± 0.313
3.875AspLys: 3.875 ± 0.411
4.482AspLeu: 4.482 ± 0.321
2.14AspMet: 2.14 ± 0.257
2.603AspAsn: 2.603 ± 0.243
2.313AspPro: 2.313 ± 0.295
1.504AspGln: 1.504 ± 0.198
3.065AspArg: 3.065 ± 0.278
4.048AspSer: 4.048 ± 0.344
3.644AspThr: 3.644 ± 0.381
3.991AspVal: 3.991 ± 0.385
1.619AspTrp: 1.619 ± 0.205
2.574AspTyr: 2.574 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
6.102GluAla: 6.102 ± 0.592
0.463GluCys: 0.463 ± 0.125
4.106GluAsp: 4.106 ± 0.352
4.887GluGlu: 4.887 ± 0.518
2.834GluPhe: 2.834 ± 0.248
4.251GluGly: 4.251 ± 0.435
1.619GluHis: 1.619 ± 0.304
4.048GluIle: 4.048 ± 0.319
4.395GluLys: 4.395 ± 0.466
5.234GluLeu: 5.234 ± 0.551
1.966GluMet: 1.966 ± 0.244
3.297GluAsn: 3.297 ± 0.315
2.227GluPro: 2.227 ± 0.332
2.718GluGln: 2.718 ± 0.314
4.193GluArg: 4.193 ± 0.381
3.21GluSer: 3.21 ± 0.344
3.788GluThr: 3.788 ± 0.386
5.263GluVal: 5.263 ± 0.498
1.272GluTrp: 1.272 ± 0.252
2.458GluTyr: 2.458 ± 0.322
0.0GluXaa: 0.0 ± 0.0
Phe
2.776PheAla: 2.776 ± 0.316
0.434PheCys: 0.434 ± 0.116
3.326PheAsp: 3.326 ± 0.265
3.412PheGlu: 3.412 ± 0.331
1.648PhePhe: 1.648 ± 0.227
2.834PheGly: 2.834 ± 0.301
0.868PheHis: 0.868 ± 0.156
2.082PheIle: 2.082 ± 0.244
2.082PheLys: 2.082 ± 0.283
2.198PheLeu: 2.198 ± 0.349
0.868PheMet: 0.868 ± 0.152
2.111PheAsn: 2.111 ± 0.206
0.925PhePro: 0.925 ± 0.164
1.157PheGln: 1.157 ± 0.209
1.764PheArg: 1.764 ± 0.243
2.805PheSer: 2.805 ± 0.313
2.632PheThr: 2.632 ± 0.273
2.632PheVal: 2.632 ± 0.34
0.463PheTrp: 0.463 ± 0.14
1.301PheTyr: 1.301 ± 0.181
0.0PheXaa: 0.0 ± 0.0
Gly
5.205GlyAla: 5.205 ± 0.433
0.81GlyCys: 0.81 ± 0.158
4.251GlyAsp: 4.251 ± 0.359
4.309GlyGlu: 4.309 ± 0.347
3.21GlyPhe: 3.21 ± 0.289
5.234GlyGly: 5.234 ± 0.543
1.648GlyHis: 1.648 ± 0.222
4.28GlyIle: 4.28 ± 0.432
4.656GlyLys: 4.656 ± 0.409
5.639GlyLeu: 5.639 ± 0.49
2.458GlyMet: 2.458 ± 0.254
3.73GlyAsn: 3.73 ± 0.295
2.284GlyPro: 2.284 ± 0.298
2.718GlyGln: 2.718 ± 0.314
4.135GlyArg: 4.135 ± 0.344
4.309GlySer: 4.309 ± 0.566
5.263GlyThr: 5.263 ± 0.599
6.333GlyVal: 6.333 ± 0.431
1.417GlyTrp: 1.417 ± 0.23
3.152GlyTyr: 3.152 ± 0.251
0.0GlyXaa: 0.0 ± 0.0
His
1.099HisAla: 1.099 ± 0.192
0.347HisCys: 0.347 ± 0.105
1.301HisAsp: 1.301 ± 0.203
0.868HisGlu: 0.868 ± 0.153
0.636HisPhe: 0.636 ± 0.145
1.562HisGly: 1.562 ± 0.197
0.578HisHis: 0.578 ± 0.109
1.186HisIle: 1.186 ± 0.194
1.07HisLys: 1.07 ± 0.186
1.417HisLeu: 1.417 ± 0.208
0.607HisMet: 0.607 ± 0.134
0.839HisAsn: 0.839 ± 0.166
0.752HisPro: 0.752 ± 0.162
0.578HisGln: 0.578 ± 0.161
1.272HisArg: 1.272 ± 0.239
1.07HisSer: 1.07 ± 0.172
0.896HisThr: 0.896 ± 0.165
1.446HisVal: 1.446 ± 0.236
0.347HisTrp: 0.347 ± 0.102
0.896HisTyr: 0.896 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
5.032IleAla: 5.032 ± 0.34
0.694IleCys: 0.694 ± 0.134
4.309IleAsp: 4.309 ± 0.359
4.656IleGlu: 4.656 ± 0.369
1.33IlePhe: 1.33 ± 0.19
3.528IleGly: 3.528 ± 0.363
0.896IleHis: 0.896 ± 0.194
2.053IleIle: 2.053 ± 0.259
3.152IleLys: 3.152 ± 0.339
3.615IleLeu: 3.615 ± 0.443
1.157IleMet: 1.157 ± 0.163
2.198IleAsn: 2.198 ± 0.344
2.429IlePro: 2.429 ± 0.264
2.198IleGln: 2.198 ± 0.335
3.326IleArg: 3.326 ± 0.335
3.065IleSer: 3.065 ± 0.302
3.123IleThr: 3.123 ± 0.281
4.511IleVal: 4.511 ± 0.451
0.752IleTrp: 0.752 ± 0.162
1.822IleTyr: 1.822 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
6.073LysAla: 6.073 ± 0.511
0.636LysCys: 0.636 ± 0.179
3.673LysAsp: 3.673 ± 0.413
3.586LysGlu: 3.586 ± 0.465
2.198LysPhe: 2.198 ± 0.282
4.251LysGly: 4.251 ± 0.346
1.012LysHis: 1.012 ± 0.202
2.863LysIle: 2.863 ± 0.318
3.991LysLys: 3.991 ± 0.475
4.193LysLeu: 4.193 ± 0.342
2.4LysMet: 2.4 ± 0.254
3.268LysAsn: 3.268 ± 0.323
2.4LysPro: 2.4 ± 0.268
2.227LysGln: 2.227 ± 0.254
3.875LysArg: 3.875 ± 0.42
3.644LysSer: 3.644 ± 0.37
3.962LysThr: 3.962 ± 0.287
4.54LysVal: 4.54 ± 0.328
1.186LysTrp: 1.186 ± 0.175
2.545LysTyr: 2.545 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
7.316LeuAla: 7.316 ± 0.386
0.665LeuCys: 0.665 ± 0.157
5.061LeuAsp: 5.061 ± 0.383
5.09LeuGlu: 5.09 ± 0.46
2.545LeuPhe: 2.545 ± 0.335
4.8LeuGly: 4.8 ± 0.381
1.533LeuHis: 1.533 ± 0.204
4.338LeuIle: 4.338 ± 0.368
4.367LeuLys: 4.367 ± 0.467
4.453LeuLeu: 4.453 ± 0.414
2.227LeuMet: 2.227 ± 0.282
3.21LeuAsn: 3.21 ± 0.375
2.95LeuPro: 2.95 ± 0.315
1.851LeuGln: 1.851 ± 0.283
3.846LeuArg: 3.846 ± 0.297
5.061LeuSer: 5.061 ± 0.337
4.685LeuThr: 4.685 ± 0.376
4.424LeuVal: 4.424 ± 0.438
1.475LeuTrp: 1.475 ± 0.185
2.371LeuTyr: 2.371 ± 0.267
0.0LeuXaa: 0.0 ± 0.0
Met
2.921MetAla: 2.921 ± 0.28
0.347MetCys: 0.347 ± 0.108
1.475MetAsp: 1.475 ± 0.23
1.648MetGlu: 1.648 ± 0.248
0.81MetPhe: 0.81 ± 0.179
2.313MetGly: 2.313 ± 0.376
0.636MetHis: 0.636 ± 0.144
1.157MetIle: 1.157 ± 0.158
2.053MetLys: 2.053 ± 0.217
2.053MetLeu: 2.053 ± 0.263
0.607MetMet: 0.607 ± 0.154
1.099MetAsn: 1.099 ± 0.164
1.243MetPro: 1.243 ± 0.212
1.099MetGln: 1.099 ± 0.281
2.14MetArg: 2.14 ± 0.243
2.429MetSer: 2.429 ± 0.326
2.198MetThr: 2.198 ± 0.247
1.619MetVal: 1.619 ± 0.21
0.289MetTrp: 0.289 ± 0.093
0.896MetTyr: 0.896 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
3.904AsnAla: 3.904 ± 0.414
0.347AsnCys: 0.347 ± 0.098
3.152AsnAsp: 3.152 ± 0.27
2.689AsnGlu: 2.689 ± 0.337
1.677AsnPhe: 1.677 ± 0.215
4.569AsnGly: 4.569 ± 0.421
0.896AsnHis: 0.896 ± 0.139
2.632AsnIle: 2.632 ± 0.247
2.95AsnLys: 2.95 ± 0.356
3.354AsnLeu: 3.354 ± 0.337
1.33AsnMet: 1.33 ± 0.198
2.371AsnAsn: 2.371 ± 0.286
1.88AsnPro: 1.88 ± 0.241
1.157AsnGln: 1.157 ± 0.184
2.313AsnArg: 2.313 ± 0.292
2.256AsnSer: 2.256 ± 0.284
2.805AsnThr: 2.805 ± 0.42
3.036AsnVal: 3.036 ± 0.357
0.781AsnTrp: 0.781 ± 0.153
1.619AsnTyr: 1.619 ± 0.24
0.0AsnXaa: 0.0 ± 0.0
Pro
2.863ProAla: 2.863 ± 0.282
0.289ProCys: 0.289 ± 0.094
2.747ProAsp: 2.747 ± 0.365
2.747ProGlu: 2.747 ± 0.297
1.735ProPhe: 1.735 ± 0.225
3.268ProGly: 3.268 ± 0.328
0.636ProHis: 0.636 ± 0.133
1.909ProIle: 1.909 ± 0.199
2.429ProLys: 2.429 ± 0.352
2.284ProLeu: 2.284 ± 0.263
0.868ProMet: 0.868 ± 0.155
1.822ProAsn: 1.822 ± 0.249
1.272ProPro: 1.272 ± 0.222
0.839ProGln: 0.839 ± 0.154
1.966ProArg: 1.966 ± 0.259
2.313ProSer: 2.313 ± 0.355
2.747ProThr: 2.747 ± 0.459
3.644ProVal: 3.644 ± 0.307
0.434ProTrp: 0.434 ± 0.111
1.215ProTyr: 1.215 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
3.007GlnAla: 3.007 ± 0.42
0.463GlnCys: 0.463 ± 0.118
1.706GlnAsp: 1.706 ± 0.203
2.256GlnGlu: 2.256 ± 0.234
1.59GlnPhe: 1.59 ± 0.243
2.169GlnGly: 2.169 ± 0.252
0.492GlnHis: 0.492 ± 0.12
1.417GlnIle: 1.417 ± 0.202
2.371GlnLys: 2.371 ± 0.306
2.718GlnLeu: 2.718 ± 0.291
1.33GlnMet: 1.33 ± 0.252
1.359GlnAsn: 1.359 ± 0.213
1.099GlnPro: 1.099 ± 0.202
1.07GlnGln: 1.07 ± 0.35
1.966GlnArg: 1.966 ± 0.284
2.313GlnSer: 2.313 ± 0.206
1.764GlnThr: 1.764 ± 0.242
2.227GlnVal: 2.227 ± 0.233
0.405GlnTrp: 0.405 ± 0.094
1.07GlnTyr: 1.07 ± 0.155
0.0GlnXaa: 0.0 ± 0.0
Arg
4.569ArgAla: 4.569 ± 0.455
0.463ArgCys: 0.463 ± 0.136
3.297ArgAsp: 3.297 ± 0.362
3.846ArgGlu: 3.846 ± 0.375
2.458ArgPhe: 2.458 ± 0.27
3.817ArgGly: 3.817 ± 0.336
0.925ArgHis: 0.925 ± 0.19
3.007ArgIle: 3.007 ± 0.29
4.135ArgLys: 4.135 ± 0.471
4.338ArgLeu: 4.338 ± 0.319
1.677ArgMet: 1.677 ± 0.281
2.632ArgAsn: 2.632 ± 0.229
1.909ArgPro: 1.909 ± 0.237
1.966ArgGln: 1.966 ± 0.28
3.412ArgArg: 3.412 ± 0.43
2.979ArgSer: 2.979 ± 0.257
2.979ArgThr: 2.979 ± 0.344
3.817ArgVal: 3.817 ± 0.35
1.215ArgTrp: 1.215 ± 0.21
2.4ArgTyr: 2.4 ± 0.27
0.0ArgXaa: 0.0 ± 0.0
Ser
4.685SerAla: 4.685 ± 0.453
0.81SerCys: 0.81 ± 0.162
3.933SerAsp: 3.933 ± 0.368
3.644SerGlu: 3.644 ± 0.313
2.545SerPhe: 2.545 ± 0.278
5.784SerGly: 5.784 ± 0.58
1.215SerHis: 1.215 ± 0.199
3.586SerIle: 3.586 ± 0.412
3.73SerLys: 3.73 ± 0.38
4.598SerLeu: 4.598 ± 0.364
1.88SerMet: 1.88 ± 0.223
2.429SerAsn: 2.429 ± 0.307
1.995SerPro: 1.995 ± 0.235
1.966SerGln: 1.966 ± 0.234
3.354SerArg: 3.354 ± 0.33
3.615SerSer: 3.615 ± 0.356
3.788SerThr: 3.788 ± 0.683
4.367SerVal: 4.367 ± 0.55
1.446SerTrp: 1.446 ± 0.215
1.937SerTyr: 1.937 ± 0.31
0.0SerXaa: 0.0 ± 0.0
Thr
4.858ThrAla: 4.858 ± 0.495
0.636ThrCys: 0.636 ± 0.174
4.077ThrAsp: 4.077 ± 0.379
4.048ThrGlu: 4.048 ± 0.393
2.227ThrPhe: 2.227 ± 0.271
5.263ThrGly: 5.263 ± 0.555
0.925ThrHis: 0.925 ± 0.134
3.615ThrIle: 3.615 ± 0.389
3.499ThrLys: 3.499 ± 0.321
4.395ThrLeu: 4.395 ± 0.366
1.388ThrMet: 1.388 ± 0.213
2.863ThrAsn: 2.863 ± 0.393
3.152ThrPro: 3.152 ± 0.446
1.909ThrGln: 1.909 ± 0.261
3.036ThrArg: 3.036 ± 0.342
3.701ThrSer: 3.701 ± 0.45
3.933ThrThr: 3.933 ± 0.735
4.742ThrVal: 4.742 ± 0.532
1.388ThrTrp: 1.388 ± 0.222
2.053ThrTyr: 2.053 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
5.552ValAla: 5.552 ± 0.422
0.694ValCys: 0.694 ± 0.169
4.714ValAsp: 4.714 ± 0.49
4.511ValGlu: 4.511 ± 0.405
2.574ValPhe: 2.574 ± 0.261
4.569ValGly: 4.569 ± 0.359
1.186ValHis: 1.186 ± 0.203
3.933ValIle: 3.933 ± 0.313
4.829ValLys: 4.829 ± 0.435
4.771ValLeu: 4.771 ± 0.415
1.706ValMet: 1.706 ± 0.251
3.094ValAsn: 3.094 ± 0.387
3.21ValPro: 3.21 ± 0.344
2.371ValGln: 2.371 ± 0.258
4.222ValArg: 4.222 ± 0.346
5.205ValSer: 5.205 ± 0.4
4.338ValThr: 4.338 ± 0.508
5.61ValVal: 5.61 ± 0.454
1.417ValTrp: 1.417 ± 0.223
2.95ValTyr: 2.95 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
1.504TrpAla: 1.504 ± 0.185
0.174TrpCys: 0.174 ± 0.073
1.243TrpAsp: 1.243 ± 0.211
1.504TrpGlu: 1.504 ± 0.224
0.636TrpPhe: 0.636 ± 0.126
1.359TrpGly: 1.359 ± 0.217
0.492TrpHis: 0.492 ± 0.14
0.781TrpIle: 0.781 ± 0.143
1.099TrpLys: 1.099 ± 0.177
1.706TrpLeu: 1.706 ± 0.252
0.665TrpMet: 0.665 ± 0.131
1.041TrpAsn: 1.041 ± 0.178
0.463TrpPro: 0.463 ± 0.125
0.839TrpGln: 0.839 ± 0.136
0.81TrpArg: 0.81 ± 0.152
1.07TrpSer: 1.07 ± 0.162
1.186TrpThr: 1.186 ± 0.19
0.983TrpVal: 0.983 ± 0.195
0.521TrpTrp: 0.521 ± 0.111
0.781TrpTyr: 0.781 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.921TyrAla: 2.921 ± 0.304
0.289TyrCys: 0.289 ± 0.097
2.429TyrAsp: 2.429 ± 0.298
2.545TyrGlu: 2.545 ± 0.33
1.243TyrPhe: 1.243 ± 0.194
3.065TyrGly: 3.065 ± 0.316
0.607TyrHis: 0.607 ± 0.113
1.822TyrIle: 1.822 ± 0.212
1.88TyrLys: 1.88 ± 0.227
3.239TyrLeu: 3.239 ± 0.287
0.925TyrMet: 0.925 ± 0.164
1.504TyrAsn: 1.504 ± 0.204
1.59TyrPro: 1.59 ± 0.234
1.388TyrGln: 1.388 ± 0.183
2.082TyrArg: 2.082 ± 0.237
2.632TyrSer: 2.632 ± 0.273
2.574TyrThr: 2.574 ± 0.269
2.198TyrVal: 2.198 ± 0.267
0.723TyrTrp: 0.723 ± 0.143
1.388TyrTyr: 1.388 ± 0.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 209 proteins (34582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski