Amino acid dipepetide frequency for Bacillus phage BSP38

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.292AlaAla: 5.292 ± 0.522
0.37AlaCys: 0.37 ± 0.1
4.029AlaAsp: 4.029 ± 0.293
4.922AlaGlu: 4.922 ± 0.33
2.309AlaPhe: 2.309 ± 0.226
4.116AlaGly: 4.116 ± 0.444
0.98AlaHis: 0.98 ± 0.149
4.029AlaIle: 4.029 ± 0.359
4.791AlaLys: 4.791 ± 0.349
6.055AlaLeu: 6.055 ± 0.428
1.655AlaMet: 1.655 ± 0.211
2.701AlaAsn: 2.701 ± 0.254
2.505AlaPro: 2.505 ± 0.266
2.483AlaGln: 2.483 ± 0.299
2.548AlaArg: 2.548 ± 0.252
3.245AlaSer: 3.245 ± 0.313
3.833AlaThr: 3.833 ± 0.353
4.225AlaVal: 4.225 ± 0.352
0.675AlaTrp: 0.675 ± 0.12
2.809AlaTyr: 2.809 ± 0.282
0.022AlaXaa: 0.022 ± 0.018
Cys
0.392CysAla: 0.392 ± 0.106
0.065CysCys: 0.065 ± 0.036
0.479CysAsp: 0.479 ± 0.097
0.588CysGlu: 0.588 ± 0.13
0.392CysPhe: 0.392 ± 0.09
0.501CysGly: 0.501 ± 0.128
0.109CysHis: 0.109 ± 0.054
0.632CysIle: 0.632 ± 0.124
0.653CysLys: 0.653 ± 0.128
0.457CysLeu: 0.457 ± 0.116
0.327CysMet: 0.327 ± 0.09
0.544CysAsn: 0.544 ± 0.133
0.414CysPro: 0.414 ± 0.096
0.218CysGln: 0.218 ± 0.072
0.196CysArg: 0.196 ± 0.075
0.501CysSer: 0.501 ± 0.117
0.392CysThr: 0.392 ± 0.097
0.37CysVal: 0.37 ± 0.096
0.044CysTrp: 0.044 ± 0.028
0.414CysTyr: 0.414 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
4.094AspAla: 4.094 ± 0.3
0.479AspCys: 0.479 ± 0.136
3.245AspAsp: 3.245 ± 0.314
4.987AspGlu: 4.987 ± 0.302
3.093AspPhe: 3.093 ± 0.296
4.269AspGly: 4.269 ± 0.404
0.958AspHis: 0.958 ± 0.139
4.835AspIle: 4.835 ± 0.302
4.77AspLys: 4.77 ± 0.351
5.924AspLeu: 5.924 ± 0.379
2.243AspMet: 2.243 ± 0.217
3.093AspAsn: 3.093 ± 0.241
2.025AspPro: 2.025 ± 0.258
2.134AspGln: 2.134 ± 0.23
3.093AspArg: 3.093 ± 0.236
3.92AspSer: 3.92 ± 0.32
4.182AspThr: 4.182 ± 0.303
4.443AspVal: 4.443 ± 0.419
0.871AspTrp: 0.871 ± 0.129
3.354AspTyr: 3.354 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
6.055GluAla: 6.055 ± 0.369
0.566GluCys: 0.566 ± 0.106
5.51GluAsp: 5.51 ± 0.318
10.389GluGlu: 10.389 ± 0.836
2.875GluPhe: 2.875 ± 0.231
5.554GluGly: 5.554 ± 0.368
1.633GluHis: 1.633 ± 0.202
5.096GluIle: 5.096 ± 0.327
5.859GluLys: 5.859 ± 0.469
7.928GluLeu: 7.928 ± 0.532
2.592GluMet: 2.592 ± 0.275
4.029GluAsn: 4.029 ± 0.33
2.091GluPro: 2.091 ± 0.249
2.94GluGln: 2.94 ± 0.312
3.049GluArg: 3.049 ± 0.268
3.79GluSer: 3.79 ± 0.284
3.267GluThr: 3.267 ± 0.262
5.88GluVal: 5.88 ± 0.405
1.045GluTrp: 1.045 ± 0.149
3.376GluTyr: 3.376 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
1.917PheAla: 1.917 ± 0.227
0.457PheCys: 0.457 ± 0.107
2.352PheAsp: 2.352 ± 0.213
2.809PheGlu: 2.809 ± 0.267
1.35PhePhe: 1.35 ± 0.181
2.352PheGly: 2.352 ± 0.237
0.74PheHis: 0.74 ± 0.139
2.548PheIle: 2.548 ± 0.223
2.657PheLys: 2.657 ± 0.247
3.071PheLeu: 3.071 ± 0.291
1.002PheMet: 1.002 ± 0.141
2.505PheAsn: 2.505 ± 0.24
1.394PhePro: 1.394 ± 0.163
1.394PheGln: 1.394 ± 0.208
1.59PheArg: 1.59 ± 0.161
2.918PheSer: 2.918 ± 0.24
2.57PheThr: 2.57 ± 0.285
2.505PheVal: 2.505 ± 0.23
0.392PheTrp: 0.392 ± 0.099
1.655PheTyr: 1.655 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
4.051GlyAla: 4.051 ± 0.541
0.719GlyCys: 0.719 ± 0.14
3.768GlyAsp: 3.768 ± 0.323
4.639GlyGlu: 4.639 ± 0.328
2.374GlyPhe: 2.374 ± 0.232
4.987GlyGly: 4.987 ± 0.664
1.067GlyHis: 1.067 ± 0.188
3.986GlyIle: 3.986 ± 0.366
4.704GlyLys: 4.704 ± 0.291
5.096GlyLeu: 5.096 ± 0.355
1.546GlyMet: 1.546 ± 0.181
3.18GlyAsn: 3.18 ± 0.285
0.74GlyPro: 0.74 ± 0.125
2.221GlyGln: 2.221 ± 0.259
2.592GlyArg: 2.592 ± 0.235
4.16GlySer: 4.16 ± 0.333
3.964GlyThr: 3.964 ± 0.35
5.183GlyVal: 5.183 ± 0.281
0.871GlyTrp: 0.871 ± 0.121
3.223GlyTyr: 3.223 ± 0.342
0.022GlyXaa: 0.022 ± 0.018
His
0.915HisAla: 0.915 ± 0.16
0.174HisCys: 0.174 ± 0.066
1.002HisAsp: 1.002 ± 0.16
1.111HisGlu: 1.111 ± 0.173
0.784HisPhe: 0.784 ± 0.139
1.111HisGly: 1.111 ± 0.164
0.305HisHis: 0.305 ± 0.084
1.394HisIle: 1.394 ± 0.2
1.35HisLys: 1.35 ± 0.186
1.699HisLeu: 1.699 ± 0.231
0.588HisMet: 0.588 ± 0.111
0.784HisAsn: 0.784 ± 0.129
0.588HisPro: 0.588 ± 0.135
0.544HisGln: 0.544 ± 0.1
0.893HisArg: 0.893 ± 0.139
1.111HisSer: 1.111 ± 0.178
0.893HisThr: 0.893 ± 0.132
1.154HisVal: 1.154 ± 0.171
0.24HisTrp: 0.24 ± 0.076
0.958HisTyr: 0.958 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
4.443IleAla: 4.443 ± 0.361
0.436IleCys: 0.436 ± 0.108
4.944IleAsp: 4.944 ± 0.389
5.379IleGlu: 5.379 ± 0.365
1.917IlePhe: 1.917 ± 0.237
3.485IleGly: 3.485 ± 0.277
1.198IleHis: 1.198 ± 0.167
3.637IleIle: 3.637 ± 0.296
5.074IleLys: 5.074 ± 0.339
4.16IleLeu: 4.16 ± 0.363
1.764IleMet: 1.764 ± 0.182
3.376IleAsn: 3.376 ± 0.281
2.2IlePro: 2.2 ± 0.207
2.265IleGln: 2.265 ± 0.214
3.114IleArg: 3.114 ± 0.23
4.007IleSer: 4.007 ± 0.251
4.465IleThr: 4.465 ± 0.43
3.986IleVal: 3.986 ± 0.332
0.457IleTrp: 0.457 ± 0.102
1.982IleTyr: 1.982 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
5.14LysAla: 5.14 ± 0.442
0.479LysCys: 0.479 ± 0.118
4.878LysAsp: 4.878 ± 0.351
7.187LysGlu: 7.187 ± 0.453
2.57LysPhe: 2.57 ± 0.231
5.227LysGly: 5.227 ± 0.322
1.416LysHis: 1.416 ± 0.175
3.811LysIle: 3.811 ± 0.312
6.599LysLys: 6.599 ± 0.526
5.663LysLeu: 5.663 ± 0.416
2.417LysMet: 2.417 ± 0.241
3.637LysAsn: 3.637 ± 0.255
2.134LysPro: 2.134 ± 0.235
2.548LysGln: 2.548 ± 0.225
3.833LysArg: 3.833 ± 0.346
4.748LysSer: 4.748 ± 0.459
3.702LysThr: 3.702 ± 0.294
5.488LysVal: 5.488 ± 0.362
0.697LysTrp: 0.697 ± 0.151
2.962LysTyr: 2.962 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
5.053LeuAla: 5.053 ± 0.337
0.784LeuCys: 0.784 ± 0.152
6.294LeuAsp: 6.294 ± 0.383
7.405LeuGlu: 7.405 ± 0.504
3.332LeuPhe: 3.332 ± 0.293
4.269LeuGly: 4.269 ± 0.306
1.568LeuHis: 1.568 ± 0.198
4.465LeuIle: 4.465 ± 0.358
6.381LeuLys: 6.381 ± 0.439
6.839LeuLeu: 6.839 ± 0.54
2.33LeuMet: 2.33 ± 0.205
4.639LeuAsn: 4.639 ± 0.297
2.962LeuPro: 2.962 ± 0.329
3.158LeuGln: 3.158 ± 0.315
4.051LeuArg: 4.051 ± 0.347
6.011LeuSer: 6.011 ± 0.356
5.336LeuThr: 5.336 ± 0.293
5.118LeuVal: 5.118 ± 0.409
0.762LeuTrp: 0.762 ± 0.129
3.637LeuTyr: 3.637 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
1.633MetAla: 1.633 ± 0.156
0.218MetCys: 0.218 ± 0.077
1.982MetAsp: 1.982 ± 0.204
2.221MetGlu: 2.221 ± 0.243
0.915MetPhe: 0.915 ± 0.139
1.241MetGly: 1.241 ± 0.15
0.305MetHis: 0.305 ± 0.076
1.437MetIle: 1.437 ± 0.231
2.483MetLys: 2.483 ± 0.231
2.134MetLeu: 2.134 ± 0.243
0.523MetMet: 0.523 ± 0.117
1.699MetAsn: 1.699 ± 0.208
0.849MetPro: 0.849 ± 0.155
1.045MetGln: 1.045 ± 0.158
0.98MetArg: 0.98 ± 0.149
2.417MetSer: 2.417 ± 0.216
2.091MetThr: 2.091 ± 0.209
1.329MetVal: 1.329 ± 0.166
0.305MetTrp: 0.305 ± 0.087
1.111MetTyr: 1.111 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
2.505AsnAla: 2.505 ± 0.242
0.392AsnCys: 0.392 ± 0.096
2.897AsnAsp: 2.897 ± 0.272
3.31AsnGlu: 3.31 ± 0.28
1.568AsnPhe: 1.568 ± 0.197
3.594AsnGly: 3.594 ± 0.323
1.111AsnHis: 1.111 ± 0.162
3.768AsnIle: 3.768 ± 0.334
3.746AsnLys: 3.746 ± 0.266
4.225AsnLeu: 4.225 ± 0.278
1.154AsnMet: 1.154 ± 0.153
3.005AsnAsn: 3.005 ± 0.339
2.526AsnPro: 2.526 ± 0.286
1.655AsnGln: 1.655 ± 0.231
2.722AsnArg: 2.722 ± 0.215
3.201AsnSer: 3.201 ± 0.297
3.659AsnThr: 3.659 ± 0.306
3.201AsnVal: 3.201 ± 0.28
0.479AsnTrp: 0.479 ± 0.1
1.938AsnTyr: 1.938 ± 0.218
0.0AsnXaa: 0.0 ± 0.0
Pro
2.33ProAla: 2.33 ± 0.252
0.196ProCys: 0.196 ± 0.068
2.766ProAsp: 2.766 ± 0.234
3.289ProGlu: 3.289 ± 0.412
1.154ProPhe: 1.154 ± 0.167
0.915ProGly: 0.915 ± 0.142
0.566ProHis: 0.566 ± 0.115
2.047ProIle: 2.047 ± 0.208
2.265ProLys: 2.265 ± 0.205
2.657ProLeu: 2.657 ± 0.254
0.806ProMet: 0.806 ± 0.139
1.808ProAsn: 1.808 ± 0.173
1.22ProPro: 1.22 ± 0.192
1.241ProGln: 1.241 ± 0.392
1.198ProArg: 1.198 ± 0.129
1.786ProSer: 1.786 ± 0.217
2.178ProThr: 2.178 ± 0.267
2.004ProVal: 2.004 ± 0.217
0.131ProTrp: 0.131 ± 0.057
1.699ProTyr: 1.699 ± 0.151
0.0ProXaa: 0.0 ± 0.0
Gln
2.439GlnAla: 2.439 ± 0.272
0.196GlnCys: 0.196 ± 0.057
2.2GlnAsp: 2.2 ± 0.212
3.441GlnGlu: 3.441 ± 0.312
1.437GlnPhe: 1.437 ± 0.2
2.156GlnGly: 2.156 ± 0.201
0.523GlnHis: 0.523 ± 0.121
1.808GlnIle: 1.808 ± 0.193
2.875GlnLys: 2.875 ± 0.288
3.049GlnLeu: 3.049 ± 0.216
1.285GlnMet: 1.285 ± 0.177
1.568GlnAsn: 1.568 ± 0.203
1.241GlnPro: 1.241 ± 0.289
2.025GlnGln: 2.025 ± 0.368
1.808GlnArg: 1.808 ± 0.161
2.526GlnSer: 2.526 ± 0.297
1.655GlnThr: 1.655 ± 0.173
2.352GlnVal: 2.352 ± 0.238
0.436GlnTrp: 0.436 ± 0.09
1.503GlnTyr: 1.503 ± 0.171
0.0GlnXaa: 0.0 ± 0.0
Arg
2.592ArgAla: 2.592 ± 0.272
0.24ArgCys: 0.24 ± 0.076
2.548ArgAsp: 2.548 ± 0.229
3.898ArgGlu: 3.898 ± 0.259
1.96ArgPhe: 1.96 ± 0.182
3.093ArgGly: 3.093 ± 0.283
0.784ArgHis: 0.784 ± 0.11
2.809ArgIle: 2.809 ± 0.253
3.441ArgLys: 3.441 ± 0.286
4.399ArgLeu: 4.399 ± 0.33
1.329ArgMet: 1.329 ± 0.168
1.721ArgAsn: 1.721 ± 0.235
1.067ArgPro: 1.067 ± 0.15
1.808ArgGln: 1.808 ± 0.195
1.895ArgArg: 1.895 ± 0.233
2.265ArgSer: 2.265 ± 0.214
2.875ArgThr: 2.875 ± 0.267
3.615ArgVal: 3.615 ± 0.259
0.457ArgTrp: 0.457 ± 0.085
2.025ArgTyr: 2.025 ± 0.196
0.0ArgXaa: 0.0 ± 0.0
Ser
3.833SerAla: 3.833 ± 0.323
0.523SerCys: 0.523 ± 0.151
3.986SerAsp: 3.986 ± 0.325
4.29SerGlu: 4.29 ± 0.317
3.071SerPhe: 3.071 ± 0.243
4.443SerGly: 4.443 ± 0.353
0.936SerHis: 0.936 ± 0.137
4.029SerIle: 4.029 ± 0.309
4.857SerLys: 4.857 ± 0.34
4.639SerLeu: 4.639 ± 0.266
1.721SerMet: 1.721 ± 0.165
2.635SerAsn: 2.635 ± 0.299
1.655SerPro: 1.655 ± 0.183
2.178SerGln: 2.178 ± 0.247
2.526SerArg: 2.526 ± 0.295
5.249SerSer: 5.249 ± 0.53
3.898SerThr: 3.898 ± 0.368
4.356SerVal: 4.356 ± 0.289
0.784SerTrp: 0.784 ± 0.139
2.875SerTyr: 2.875 ± 0.229
0.0SerXaa: 0.0 ± 0.0
Thr
3.724ThrAla: 3.724 ± 0.385
0.348ThrCys: 0.348 ± 0.08
4.247ThrAsp: 4.247 ± 0.39
4.247ThrGlu: 4.247 ± 0.267
2.2ThrPhe: 2.2 ± 0.264
4.334ThrGly: 4.334 ± 0.386
1.133ThrHis: 1.133 ± 0.159
3.811ThrIle: 3.811 ± 0.382
3.898ThrLys: 3.898 ± 0.303
5.967ThrLeu: 5.967 ± 0.395
1.067ThrMet: 1.067 ± 0.132
3.049ThrAsn: 3.049 ± 0.302
2.505ThrPro: 2.505 ± 0.268
2.33ThrGln: 2.33 ± 0.227
2.875ThrArg: 2.875 ± 0.262
3.114ThrSer: 3.114 ± 0.291
4.051ThrThr: 4.051 ± 0.413
5.183ThrVal: 5.183 ± 0.425
0.544ThrTrp: 0.544 ± 0.097
2.766ThrTyr: 2.766 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
4.116ValAla: 4.116 ± 0.263
0.544ValCys: 0.544 ± 0.118
5.118ValAsp: 5.118 ± 0.397
5.292ValGlu: 5.292 ± 0.401
2.809ValPhe: 2.809 ± 0.291
3.746ValGly: 3.746 ± 0.277
1.394ValHis: 1.394 ± 0.162
4.378ValIle: 4.378 ± 0.312
5.031ValLys: 5.031 ± 0.35
5.401ValLeu: 5.401 ± 0.365
1.372ValMet: 1.372 ± 0.189
3.681ValAsn: 3.681 ± 0.342
2.635ValPro: 2.635 ± 0.262
2.243ValGln: 2.243 ± 0.233
3.158ValArg: 3.158 ± 0.265
4.617ValSer: 4.617 ± 0.408
4.704ValThr: 4.704 ± 0.394
4.661ValVal: 4.661 ± 0.322
0.479ValTrp: 0.479 ± 0.098
3.572ValTyr: 3.572 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
0.479TrpAla: 0.479 ± 0.107
0.131TrpCys: 0.131 ± 0.049
0.74TrpAsp: 0.74 ± 0.135
1.045TrpGlu: 1.045 ± 0.134
0.523TrpPhe: 0.523 ± 0.121
0.784TrpGly: 0.784 ± 0.149
0.174TrpHis: 0.174 ± 0.061
0.632TrpIle: 0.632 ± 0.111
0.719TrpLys: 0.719 ± 0.105
0.871TrpLeu: 0.871 ± 0.124
0.109TrpMet: 0.109 ± 0.048
0.392TrpAsn: 0.392 ± 0.079
0.0TrpPro: 0.0 ± 0.0
0.436TrpGln: 0.436 ± 0.09
0.392TrpArg: 0.392 ± 0.093
0.588TrpSer: 0.588 ± 0.114
0.61TrpThr: 0.61 ± 0.097
0.936TrpVal: 0.936 ± 0.151
0.261TrpTrp: 0.261 ± 0.089
0.479TrpTyr: 0.479 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.592TyrAla: 2.592 ± 0.204
0.37TyrCys: 0.37 ± 0.084
3.027TyrAsp: 3.027 ± 0.288
3.071TyrGlu: 3.071 ± 0.232
1.525TyrPhe: 1.525 ± 0.175
2.918TyrGly: 2.918 ± 0.261
0.697TyrHis: 0.697 ± 0.117
3.158TyrIle: 3.158 ± 0.259
2.984TyrLys: 2.984 ± 0.254
4.16TyrLeu: 4.16 ± 0.313
1.045TyrMet: 1.045 ± 0.138
2.613TyrAsn: 2.613 ± 0.239
1.503TyrPro: 1.503 ± 0.202
1.655TyrGln: 1.655 ± 0.178
2.309TyrArg: 2.309 ± 0.225
2.309TyrSer: 2.309 ± 0.248
3.071TyrThr: 3.071 ± 0.266
2.962TyrVal: 2.962 ± 0.314
0.37TyrTrp: 0.37 ± 0.082
1.808TyrTyr: 1.808 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.022XaaPhe: 0.022 ± 0.018
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.022XaaMet: 0.022 ± 0.018
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 254 proteins (45917 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski