Amino acid dipepetide frequency for Bacillus phage phi3Ts

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.563AlaAla: 3.563 ± 0.355
0.544AlaCys: 0.544 ± 0.152
3.482AlaAsp: 3.482 ± 0.352
3.101AlaGlu: 3.101 ± 0.319
2.176AlaPhe: 2.176 ± 0.289
2.666AlaGly: 2.666 ± 0.307
0.816AlaHis: 0.816 ± 0.141
4.76AlaIle: 4.76 ± 0.364
4.978AlaLys: 4.978 ± 0.511
4.135AlaLeu: 4.135 ± 0.399
1.523AlaMet: 1.523 ± 0.232
2.339AlaAsn: 2.339 ± 0.3
1.278AlaPro: 1.278 ± 0.185
1.551AlaGln: 1.551 ± 0.311
1.741AlaArg: 1.741 ± 0.208
3.427AlaSer: 3.427 ± 0.413
2.883AlaThr: 2.883 ± 0.272
3.019AlaVal: 3.019 ± 0.293
0.49AlaTrp: 0.49 ± 0.097
2.72AlaTyr: 2.72 ± 0.305
0.0AlaXaa: 0.0 ± 0.0
Cys
0.354CysAla: 0.354 ± 0.134
0.163CysCys: 0.163 ± 0.061
0.299CysAsp: 0.299 ± 0.09
0.707CysGlu: 0.707 ± 0.133
0.816CysPhe: 0.816 ± 0.159
0.843CysGly: 0.843 ± 0.18
0.272CysHis: 0.272 ± 0.097
0.626CysIle: 0.626 ± 0.133
0.734CysLys: 0.734 ± 0.164
0.762CysLeu: 0.762 ± 0.154
0.136CysMet: 0.136 ± 0.067
0.789CysAsn: 0.789 ± 0.154
0.381CysPro: 0.381 ± 0.124
0.245CysGln: 0.245 ± 0.075
0.462CysArg: 0.462 ± 0.13
0.68CysSer: 0.68 ± 0.136
0.462CysThr: 0.462 ± 0.114
0.544CysVal: 0.544 ± 0.134
0.19CysTrp: 0.19 ± 0.068
0.326CysTyr: 0.326 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
2.938AspAla: 2.938 ± 0.305
0.653AspCys: 0.653 ± 0.148
3.781AspAsp: 3.781 ± 0.43
5.304AspGlu: 5.304 ± 0.351
3.237AspPhe: 3.237 ± 0.267
3.999AspGly: 3.999 ± 0.377
0.979AspHis: 0.979 ± 0.191
5.413AspIle: 5.413 ± 0.345
5.767AspLys: 5.767 ± 0.491
5.631AspLeu: 5.631 ± 0.325
1.333AspMet: 1.333 ± 0.174
3.645AspAsn: 3.645 ± 0.313
1.986AspPro: 1.986 ± 0.221
2.122AspGln: 2.122 ± 0.192
1.823AspArg: 1.823 ± 0.235
3.781AspSer: 3.781 ± 0.262
2.53AspThr: 2.53 ± 0.255
3.863AspVal: 3.863 ± 0.363
0.544AspTrp: 0.544 ± 0.119
3.618AspTyr: 3.618 ± 0.274
0.0AspXaa: 0.0 ± 0.0
Glu
4.108GluAla: 4.108 ± 0.367
0.762GluCys: 0.762 ± 0.145
4.788GluAsp: 4.788 ± 0.327
6.909GluGlu: 6.909 ± 0.435
3.319GluPhe: 3.319 ± 0.308
3.89GluGly: 3.89 ± 0.37
1.714GluHis: 1.714 ± 0.267
7.018GluIle: 7.018 ± 0.456
7.698GluLys: 7.698 ± 0.525
8.541GluLeu: 8.541 ± 0.463
2.448GluMet: 2.448 ± 0.235
4.38GluAsn: 4.38 ± 0.365
1.523GluPro: 1.523 ± 0.213
3.373GluGln: 3.373 ± 0.312
3.509GluArg: 3.509 ± 0.295
3.808GluSer: 3.808 ± 0.316
3.699GluThr: 3.699 ± 0.325
5.549GluVal: 5.549 ± 0.496
1.197GluTrp: 1.197 ± 0.214
3.781GluTyr: 3.781 ± 0.31
0.0GluXaa: 0.0 ± 0.0
Phe
1.768PheAla: 1.768 ± 0.201
0.408PheCys: 0.408 ± 0.11
3.101PheAsp: 3.101 ± 0.281
3.808PheGlu: 3.808 ± 0.346
2.067PhePhe: 2.067 ± 0.281
2.421PheGly: 2.421 ± 0.24
0.952PheHis: 0.952 ± 0.165
3.237PheIle: 3.237 ± 0.282
4.543PheLys: 4.543 ± 0.376
2.992PheLeu: 2.992 ± 0.34
0.762PheMet: 0.762 ± 0.17
3.237PheAsn: 3.237 ± 0.289
1.251PhePro: 1.251 ± 0.163
0.925PheGln: 0.925 ± 0.155
1.741PheArg: 1.741 ± 0.203
3.155PheSer: 3.155 ± 0.284
2.122PheThr: 2.122 ± 0.224
2.231PheVal: 2.231 ± 0.326
0.218PheTrp: 0.218 ± 0.082
2.149PheTyr: 2.149 ± 0.251
0.0PheXaa: 0.0 ± 0.0
Gly
2.394GlyAla: 2.394 ± 0.245
0.571GlyCys: 0.571 ± 0.14
3.101GlyAsp: 3.101 ± 0.284
4.271GlyGlu: 4.271 ± 0.367
2.965GlyPhe: 2.965 ± 0.233
2.639GlyGly: 2.639 ± 0.257
1.17GlyHis: 1.17 ± 0.197
4.135GlyIle: 4.135 ± 0.376
5.005GlyLys: 5.005 ± 0.347
4.543GlyLeu: 4.543 ± 0.343
1.251GlyMet: 1.251 ± 0.192
3.618GlyAsn: 3.618 ± 0.341
0.109GlyPro: 0.109 ± 0.055
1.85GlyGln: 1.85 ± 0.214
1.741GlyArg: 1.741 ± 0.208
3.645GlySer: 3.645 ± 0.323
3.291GlyThr: 3.291 ± 0.295
3.4GlyVal: 3.4 ± 0.352
0.435GlyTrp: 0.435 ± 0.103
2.775GlyTyr: 2.775 ± 0.252
0.0GlyXaa: 0.0 ± 0.0
His
0.626HisAla: 0.626 ± 0.11
0.272HisCys: 0.272 ± 0.094
0.952HisAsp: 0.952 ± 0.139
1.659HisGlu: 1.659 ± 0.224
0.68HisPhe: 0.68 ± 0.142
1.034HisGly: 1.034 ± 0.165
0.435HisHis: 0.435 ± 0.125
1.415HisIle: 1.415 ± 0.249
1.959HisLys: 1.959 ± 0.248
1.496HisLeu: 1.496 ± 0.18
0.435HisMet: 0.435 ± 0.162
1.415HisAsn: 1.415 ± 0.212
0.734HisPro: 0.734 ± 0.134
0.571HisGln: 0.571 ± 0.137
0.925HisArg: 0.925 ± 0.151
1.442HisSer: 1.442 ± 0.226
1.115HisThr: 1.115 ± 0.189
1.088HisVal: 1.088 ± 0.153
0.109HisTrp: 0.109 ± 0.045
0.762HisTyr: 0.762 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
3.917IleAla: 3.917 ± 0.3
0.626IleCys: 0.626 ± 0.13
5.468IleAsp: 5.468 ± 0.346
6.528IleGlu: 6.528 ± 0.427
2.285IlePhe: 2.285 ± 0.257
4.108IleGly: 4.108 ± 0.359
2.149IleHis: 2.149 ± 0.267
4.216IleIle: 4.216 ± 0.403
8.269IleLys: 8.269 ± 0.533
5.087IleLeu: 5.087 ± 0.407
1.251IleMet: 1.251 ± 0.15
5.903IleAsn: 5.903 ± 0.459
2.258IlePro: 2.258 ± 0.217
2.421IleGln: 2.421 ± 0.275
2.775IleArg: 2.775 ± 0.29
5.413IleSer: 5.413 ± 0.53
4.135IleThr: 4.135 ± 0.37
4.271IleVal: 4.271 ± 0.332
0.952IleTrp: 0.952 ± 0.157
2.176IleTyr: 2.176 ± 0.248
0.0IleXaa: 0.0 ± 0.0
Lys
5.576LysAla: 5.576 ± 0.381
1.115LysCys: 1.115 ± 0.202
5.549LysAsp: 5.549 ± 0.405
8.949LysGlu: 8.949 ± 0.596
3.482LysPhe: 3.482 ± 0.287
5.522LysGly: 5.522 ± 0.447
1.741LysHis: 1.741 ± 0.241
6.882LysIle: 6.882 ± 0.449
10.092LysLys: 10.092 ± 0.755
8.46LysLeu: 8.46 ± 0.485
2.312LysMet: 2.312 ± 0.289
6.175LysAsn: 6.175 ± 0.422
2.285LysPro: 2.285 ± 0.298
4.135LysGln: 4.135 ± 0.402
4.516LysArg: 4.516 ± 0.446
5.712LysSer: 5.712 ± 0.461
5.087LysThr: 5.087 ± 0.424
6.148LysVal: 6.148 ± 0.414
0.789LysTrp: 0.789 ± 0.149
4.543LysTyr: 4.543 ± 0.401
0.0LysXaa: 0.0 ± 0.0
Leu
4.216LeuAla: 4.216 ± 0.323
0.789LeuCys: 0.789 ± 0.169
5.413LeuAsp: 5.413 ± 0.375
6.175LeuGlu: 6.175 ± 0.46
3.808LeuPhe: 3.808 ± 0.381
3.618LeuGly: 3.618 ± 0.318
1.551LeuHis: 1.551 ± 0.175
6.202LeuIle: 6.202 ± 0.413
9.194LeuLys: 9.194 ± 0.572
7.317LeuLeu: 7.317 ± 0.468
2.122LeuMet: 2.122 ± 0.254
6.692LeuAsn: 6.692 ± 0.48
2.231LeuPro: 2.231 ± 0.256
3.4LeuGln: 3.4 ± 0.391
3.618LeuArg: 3.618 ± 0.37
6.093LeuSer: 6.093 ± 0.417
5.196LeuThr: 5.196 ± 0.404
4.271LeuVal: 4.271 ± 0.351
0.789LeuTrp: 0.789 ± 0.163
3.155LeuTyr: 3.155 ± 0.403
0.0LeuXaa: 0.0 ± 0.0
Met
1.415MetAla: 1.415 ± 0.18
0.109MetCys: 0.109 ± 0.062
1.551MetAsp: 1.551 ± 0.176
1.795MetGlu: 1.795 ± 0.203
0.952MetPhe: 0.952 ± 0.193
1.197MetGly: 1.197 ± 0.182
0.272MetHis: 0.272 ± 0.083
1.632MetIle: 1.632 ± 0.184
2.666MetLys: 2.666 ± 0.252
2.067MetLeu: 2.067 ± 0.271
0.544MetMet: 0.544 ± 0.131
1.605MetAsn: 1.605 ± 0.215
0.707MetPro: 0.707 ± 0.125
0.843MetGln: 0.843 ± 0.192
1.006MetArg: 1.006 ± 0.156
1.768MetSer: 1.768 ± 0.201
1.197MetThr: 1.197 ± 0.177
0.952MetVal: 0.952 ± 0.166
0.272MetTrp: 0.272 ± 0.094
0.626MetTyr: 0.626 ± 0.113
0.0MetXaa: 0.0 ± 0.0
Asn
3.373AsnAla: 3.373 ± 0.398
0.898AsnCys: 0.898 ± 0.174
3.645AsnAsp: 3.645 ± 0.336
5.794AsnGlu: 5.794 ± 0.389
2.639AsnPhe: 2.639 ± 0.272
3.591AsnGly: 3.591 ± 0.301
1.034AsnHis: 1.034 ± 0.184
4.624AsnIle: 4.624 ± 0.295
7.181AsnLys: 7.181 ± 0.488
5.359AsnLeu: 5.359 ± 0.354
1.469AsnMet: 1.469 ± 0.166
4.162AsnAsn: 4.162 ± 0.342
1.823AsnPro: 1.823 ± 0.242
2.04AsnGln: 2.04 ± 0.298
2.666AsnArg: 2.666 ± 0.242
4.189AsnSer: 4.189 ± 0.428
3.101AsnThr: 3.101 ± 0.322
3.509AsnVal: 3.509 ± 0.292
0.517AsnTrp: 0.517 ± 0.117
2.503AsnTyr: 2.503 ± 0.284
0.0AsnXaa: 0.0 ± 0.0
Pro
1.387ProAla: 1.387 ± 0.187
0.163ProCys: 0.163 ± 0.062
2.258ProAsp: 2.258 ± 0.231
2.013ProGlu: 2.013 ± 0.22
1.061ProPhe: 1.061 ± 0.175
0.952ProGly: 0.952 ± 0.168
0.952ProHis: 0.952 ± 0.172
1.442ProIle: 1.442 ± 0.192
2.611ProLys: 2.611 ± 0.262
2.475ProLeu: 2.475 ± 0.293
0.408ProMet: 0.408 ± 0.109
1.469ProAsn: 1.469 ± 0.178
0.598ProPro: 0.598 ± 0.134
0.707ProGln: 0.707 ± 0.115
0.843ProArg: 0.843 ± 0.165
1.959ProSer: 1.959 ± 0.214
1.496ProThr: 1.496 ± 0.206
1.333ProVal: 1.333 ± 0.21
0.299ProTrp: 0.299 ± 0.095
1.387ProTyr: 1.387 ± 0.197
0.0ProXaa: 0.0 ± 0.0
Gln
2.258GlnAla: 2.258 ± 0.445
0.272GlnCys: 0.272 ± 0.086
1.768GlnAsp: 1.768 ± 0.221
3.101GlnGlu: 3.101 ± 0.357
1.36GlnPhe: 1.36 ± 0.175
1.496GlnGly: 1.496 ± 0.257
0.598GlnHis: 0.598 ± 0.151
2.53GlnIle: 2.53 ± 0.287
3.183GlnLys: 3.183 ± 0.417
3.754GlnLeu: 3.754 ± 0.369
1.17GlnMet: 1.17 ± 0.153
2.258GlnAsn: 2.258 ± 0.31
0.734GlnPro: 0.734 ± 0.136
1.795GlnGln: 1.795 ± 0.459
1.197GlnArg: 1.197 ± 0.155
2.231GlnSer: 2.231 ± 0.223
1.551GlnThr: 1.551 ± 0.215
2.258GlnVal: 2.258 ± 0.249
0.462GlnTrp: 0.462 ± 0.146
1.578GlnTyr: 1.578 ± 0.201
0.0GlnXaa: 0.0 ± 0.0
Arg
1.632ArgAla: 1.632 ± 0.189
0.245ArgCys: 0.245 ± 0.093
2.312ArgAsp: 2.312 ± 0.231
3.264ArgGlu: 3.264 ± 0.314
1.931ArgPhe: 1.931 ± 0.21
2.122ArgGly: 2.122 ± 0.307
0.843ArgHis: 0.843 ± 0.169
2.965ArgIle: 2.965 ± 0.294
3.509ArgLys: 3.509 ± 0.286
3.455ArgLeu: 3.455 ± 0.324
1.115ArgMet: 1.115 ± 0.134
2.911ArgAsn: 2.911 ± 0.301
0.952ArgPro: 0.952 ± 0.17
1.523ArgGln: 1.523 ± 0.206
1.714ArgArg: 1.714 ± 0.248
2.013ArgSer: 2.013 ± 0.278
1.904ArgThr: 1.904 ± 0.231
2.475ArgVal: 2.475 ± 0.272
0.435ArgTrp: 0.435 ± 0.12
1.795ArgTyr: 1.795 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
3.155SerAla: 3.155 ± 0.479
0.598SerCys: 0.598 ± 0.128
4.352SerAsp: 4.352 ± 0.367
5.032SerGlu: 5.032 ± 0.357
3.21SerPhe: 3.21 ± 0.286
3.373SerGly: 3.373 ± 0.322
1.006SerHis: 1.006 ± 0.165
4.978SerIle: 4.978 ± 0.423
5.848SerLys: 5.848 ± 0.425
6.202SerLeu: 6.202 ± 0.397
1.469SerMet: 1.469 ± 0.177
4.108SerAsn: 4.108 ± 0.345
2.122SerPro: 2.122 ± 0.248
2.04SerGln: 2.04 ± 0.264
2.231SerArg: 2.231 ± 0.231
5.277SerSer: 5.277 ± 0.665
3.455SerThr: 3.455 ± 0.418
3.944SerVal: 3.944 ± 0.264
0.68SerTrp: 0.68 ± 0.121
2.339SerTyr: 2.339 ± 0.206
0.0SerXaa: 0.0 ± 0.0
Thr
3.047ThrAla: 3.047 ± 0.405
0.245ThrCys: 0.245 ± 0.073
2.938ThrAsp: 2.938 ± 0.277
4.352ThrGlu: 4.352 ± 0.314
1.931ThrPhe: 1.931 ± 0.24
3.509ThrGly: 3.509 ± 0.294
0.68ThrHis: 0.68 ± 0.148
3.835ThrIle: 3.835 ± 0.312
4.543ThrLys: 4.543 ± 0.327
4.108ThrLeu: 4.108 ± 0.298
1.006ThrMet: 1.006 ± 0.144
2.611ThrAsn: 2.611 ± 0.254
1.904ThrPro: 1.904 ± 0.228
2.095ThrGln: 2.095 ± 0.259
1.85ThrArg: 1.85 ± 0.181
3.563ThrSer: 3.563 ± 0.462
2.938ThrThr: 2.938 ± 0.38
4.298ThrVal: 4.298 ± 0.342
0.544ThrTrp: 0.544 ± 0.102
2.883ThrTyr: 2.883 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
3.563ValAla: 3.563 ± 0.293
0.68ValCys: 0.68 ± 0.149
3.89ValAsp: 3.89 ± 0.346
4.652ValGlu: 4.652 ± 0.417
2.503ValPhe: 2.503 ± 0.257
3.128ValGly: 3.128 ± 0.324
1.197ValHis: 1.197 ± 0.182
4.189ValIle: 4.189 ± 0.344
5.903ValLys: 5.903 ± 0.4
4.733ValLeu: 4.733 ± 0.387
1.224ValMet: 1.224 ± 0.199
3.944ValAsn: 3.944 ± 0.32
1.659ValPro: 1.659 ± 0.211
2.149ValGln: 2.149 ± 0.298
2.312ValArg: 2.312 ± 0.227
3.645ValSer: 3.645 ± 0.3
3.645ValThr: 3.645 ± 0.303
3.237ValVal: 3.237 ± 0.351
0.517ValTrp: 0.517 ± 0.1
2.611ValTyr: 2.611 ± 0.288
0.0ValXaa: 0.0 ± 0.0
Trp
0.408TrpAla: 0.408 ± 0.107
0.218TrpCys: 0.218 ± 0.07
0.843TrpAsp: 0.843 ± 0.137
0.925TrpGlu: 0.925 ± 0.176
0.381TrpPhe: 0.381 ± 0.095
0.408TrpGly: 0.408 ± 0.098
0.136TrpHis: 0.136 ± 0.057
0.762TrpIle: 0.762 ± 0.143
0.843TrpLys: 0.843 ± 0.157
0.843TrpLeu: 0.843 ± 0.152
0.163TrpMet: 0.163 ± 0.067
0.49TrpAsn: 0.49 ± 0.133
0.109TrpPro: 0.109 ± 0.056
0.299TrpGln: 0.299 ± 0.091
0.49TrpArg: 0.49 ± 0.126
0.734TrpSer: 0.734 ± 0.127
0.517TrpThr: 0.517 ± 0.103
0.653TrpVal: 0.653 ± 0.123
0.082TrpTrp: 0.082 ± 0.042
0.598TrpTyr: 0.598 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.496TyrAla: 1.496 ± 0.182
0.462TyrCys: 0.462 ± 0.108
3.455TyrAsp: 3.455 ± 0.302
3.672TyrGlu: 3.672 ± 0.375
2.285TyrPhe: 2.285 ± 0.255
2.394TyrGly: 2.394 ± 0.264
0.653TyrHis: 0.653 ± 0.148
3.346TyrIle: 3.346 ± 0.281
4.543TyrLys: 4.543 ± 0.352
3.89TyrLeu: 3.89 ± 0.379
1.006TyrMet: 1.006 ± 0.167
2.367TyrAsn: 2.367 ± 0.304
1.17TyrPro: 1.17 ± 0.177
1.469TyrGln: 1.469 ± 0.183
1.959TyrArg: 1.959 ± 0.242
2.883TyrSer: 2.883 ± 0.291
2.557TyrThr: 2.557 ± 0.242
2.339TyrVal: 2.339 ± 0.282
0.381TyrTrp: 0.381 ± 0.103
2.122TyrTyr: 2.122 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 185 proteins (36763 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski