Amino acid dipepetide frequency for Bacillus phage BC01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.286AlaAla: 5.286 ± 0.393
0.434AlaCys: 0.434 ± 0.102
3.649AlaAsp: 3.649 ± 0.274
4.379AlaGlu: 4.379 ± 0.377
2.722AlaPhe: 2.722 ± 0.216
3.787AlaGly: 3.787 ± 0.478
0.947AlaHis: 0.947 ± 0.139
4.241AlaIle: 4.241 ± 0.346
4.892AlaLys: 4.892 ± 0.327
5.444AlaLeu: 5.444 ± 0.311
1.894AlaMet: 1.894 ± 0.185
2.959AlaAsn: 2.959 ± 0.33
2.683AlaPro: 2.683 ± 0.488
2.643AlaGln: 2.643 ± 0.262
2.9AlaArg: 2.9 ± 0.278
3.689AlaSer: 3.689 ± 0.336
4.261AlaThr: 4.261 ± 0.407
3.807AlaVal: 3.807 ± 0.276
0.986AlaTrp: 0.986 ± 0.159
2.466AlaTyr: 2.466 ± 0.198
0.0AlaXaa: 0.0 ± 0.0
Cys
0.473CysAla: 0.473 ± 0.086
0.375CysCys: 0.375 ± 0.095
0.493CysAsp: 0.493 ± 0.089
0.335CysGlu: 0.335 ± 0.084
0.631CysPhe: 0.631 ± 0.115
0.769CysGly: 0.769 ± 0.168
0.335CysHis: 0.335 ± 0.078
0.572CysIle: 0.572 ± 0.11
0.967CysLys: 0.967 ± 0.17
0.769CysLeu: 0.769 ± 0.128
0.395CysMet: 0.395 ± 0.082
0.73CysAsn: 0.73 ± 0.138
0.533CysPro: 0.533 ± 0.132
0.158CysGln: 0.158 ± 0.06
0.414CysArg: 0.414 ± 0.091
0.848CysSer: 0.848 ± 0.177
0.809CysThr: 0.809 ± 0.136
0.671CysVal: 0.671 ± 0.122
0.099CysTrp: 0.099 ± 0.044
0.572CysTyr: 0.572 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
3.274AspAla: 3.274 ± 0.242
0.533AspCys: 0.533 ± 0.11
2.702AspAsp: 2.702 ± 0.279
4.34AspGlu: 4.34 ± 0.285
2.466AspPhe: 2.466 ± 0.251
3.472AspGly: 3.472 ± 0.302
0.927AspHis: 0.927 ± 0.112
4.675AspIle: 4.675 ± 0.298
5.03AspLys: 5.03 ± 0.396
4.537AspLeu: 4.537 ± 0.287
1.44AspMet: 1.44 ± 0.188
2.979AspAsn: 2.979 ± 0.264
1.795AspPro: 1.795 ± 0.193
1.184AspGln: 1.184 ± 0.169
2.702AspArg: 2.702 ± 0.236
3.097AspSer: 3.097 ± 0.266
2.781AspThr: 2.781 ± 0.283
3.728AspVal: 3.728 ± 0.304
0.828AspTrp: 0.828 ± 0.112
2.643AspTyr: 2.643 ± 0.224
0.0AspXaa: 0.0 ± 0.0
Glu
3.689GluAla: 3.689 ± 0.302
0.69GluCys: 0.69 ± 0.117
3.906GluAsp: 3.906 ± 0.289
7.318GluGlu: 7.318 ± 0.738
2.584GluPhe: 2.584 ± 0.252
4.655GluGly: 4.655 ± 0.371
1.203GluHis: 1.203 ± 0.154
4.261GluIle: 4.261 ± 0.305
6.095GluLys: 6.095 ± 0.46
6.509GluLeu: 6.509 ± 0.42
2.17GluMet: 2.17 ± 0.207
3.136GluAsn: 3.136 ± 0.296
2.091GluPro: 2.091 ± 0.367
2.821GluGln: 2.821 ± 0.237
3.215GluArg: 3.215 ± 0.285
2.998GluSer: 2.998 ± 0.263
3.59GluThr: 3.59 ± 0.263
5.089GluVal: 5.089 ± 0.442
0.986GluTrp: 0.986 ± 0.13
3.136GluTyr: 3.136 ± 0.247
0.0GluXaa: 0.0 ± 0.0
Phe
2.347PheAla: 2.347 ± 0.214
0.631PheCys: 0.631 ± 0.144
2.643PheAsp: 2.643 ± 0.249
2.268PheGlu: 2.268 ± 0.219
1.775PhePhe: 1.775 ± 0.244
2.407PheGly: 2.407 ± 0.183
0.848PheHis: 0.848 ± 0.149
2.545PheIle: 2.545 ± 0.213
2.821PheLys: 2.821 ± 0.271
3.827PheLeu: 3.827 ± 0.295
1.085PheMet: 1.085 ± 0.134
2.268PheAsn: 2.268 ± 0.217
1.361PhePro: 1.361 ± 0.169
1.499PheGln: 1.499 ± 0.197
1.716PheArg: 1.716 ± 0.191
3.511PheSer: 3.511 ± 0.299
3.077PheThr: 3.077 ± 0.271
3.057PheVal: 3.057 ± 0.298
0.316PheTrp: 0.316 ± 0.091
1.795PheTyr: 1.795 ± 0.193
0.0PheXaa: 0.0 ± 0.0
Gly
4.044GlyAla: 4.044 ± 0.537
0.828GlyCys: 0.828 ± 0.149
2.939GlyAsp: 2.939 ± 0.248
3.629GlyGlu: 3.629 ± 0.271
2.683GlyPhe: 2.683 ± 0.244
5.247GlyGly: 5.247 ± 0.871
1.203GlyHis: 1.203 ± 0.175
4.004GlyIle: 4.004 ± 0.332
5.346GlyLys: 5.346 ± 0.387
4.892GlyLeu: 4.892 ± 0.32
1.874GlyMet: 1.874 ± 0.265
3.629GlyAsn: 3.629 ± 0.352
0.0GlyPro: 0.0 ± 0.0
2.091GlyGln: 2.091 ± 0.209
2.407GlyArg: 2.407 ± 0.209
4.418GlySer: 4.418 ± 0.49
5.01GlyThr: 5.01 ± 0.451
4.596GlyVal: 4.596 ± 0.327
0.986GlyTrp: 0.986 ± 0.148
2.86GlyTyr: 2.86 ± 0.266
0.0GlyXaa: 0.0 ± 0.0
His
0.986HisAla: 0.986 ± 0.148
0.296HisCys: 0.296 ± 0.07
0.967HisAsp: 0.967 ± 0.141
1.105HisGlu: 1.105 ± 0.135
0.828HisPhe: 0.828 ± 0.158
1.184HisGly: 1.184 ± 0.167
0.473HisHis: 0.473 ± 0.114
1.045HisIle: 1.045 ± 0.141
1.144HisLys: 1.144 ± 0.154
1.854HisLeu: 1.854 ± 0.224
0.375HisMet: 0.375 ± 0.09
1.006HisAsn: 1.006 ± 0.142
0.888HisPro: 0.888 ± 0.129
0.454HisGln: 0.454 ± 0.079
1.184HisArg: 1.184 ± 0.183
1.243HisSer: 1.243 ± 0.148
1.085HisThr: 1.085 ± 0.136
1.42HisVal: 1.42 ± 0.18
0.217HisTrp: 0.217 ± 0.077
0.947HisTyr: 0.947 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
4.202IleAla: 4.202 ± 0.29
0.769IleCys: 0.769 ± 0.118
4.32IleAsp: 4.32 ± 0.407
4.635IleGlu: 4.635 ± 0.339
2.268IlePhe: 2.268 ± 0.233
3.827IleGly: 3.827 ± 0.29
1.085IleHis: 1.085 ± 0.142
4.103IleIle: 4.103 ± 0.345
4.813IleLys: 4.813 ± 0.306
4.616IleLeu: 4.616 ± 0.374
1.756IleMet: 1.756 ± 0.196
3.59IleAsn: 3.59 ± 0.248
2.979IlePro: 2.979 ± 0.259
2.288IleGln: 2.288 ± 0.164
3.077IleArg: 3.077 ± 0.267
4.872IleSer: 4.872 ± 0.356
4.714IleThr: 4.714 ± 0.37
4.458IleVal: 4.458 ± 0.297
0.513IleTrp: 0.513 ± 0.117
2.328IleTyr: 2.328 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
5.208LysAla: 5.208 ± 0.376
0.69LysCys: 0.69 ± 0.12
4.458LysAsp: 4.458 ± 0.362
6.825LysGlu: 6.825 ± 0.43
2.88LysPhe: 2.88 ± 0.238
4.833LysGly: 4.833 ± 0.391
1.598LysHis: 1.598 ± 0.164
4.32LysIle: 4.32 ± 0.282
6.135LysLys: 6.135 ± 0.493
6.253LysLeu: 6.253 ± 0.383
2.525LysMet: 2.525 ± 0.237
3.787LysAsn: 3.787 ± 0.29
2.702LysPro: 2.702 ± 0.203
3.728LysGln: 3.728 ± 0.33
3.846LysArg: 3.846 ± 0.33
4.123LysSer: 4.123 ± 0.323
3.334LysThr: 3.334 ± 0.238
5.523LysVal: 5.523 ± 0.454
0.848LysTrp: 0.848 ± 0.133
3.077LysTyr: 3.077 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
5.72LeuAla: 5.72 ± 0.345
0.73LeuCys: 0.73 ± 0.136
5.05LeuAsp: 5.05 ± 0.277
6.371LeuGlu: 6.371 ± 0.394
3.511LeuPhe: 3.511 ± 0.261
4.517LeuGly: 4.517 ± 0.3
1.44LeuHis: 1.44 ± 0.182
4.951LeuIle: 4.951 ± 0.362
6.194LeuLys: 6.194 ± 0.411
6.746LeuLeu: 6.746 ± 0.451
2.288LeuMet: 2.288 ± 0.24
3.925LeuAsn: 3.925 ± 0.284
3.551LeuPro: 3.551 ± 0.258
3.057LeuGln: 3.057 ± 0.284
4.103LeuArg: 4.103 ± 0.327
5.74LeuSer: 5.74 ± 0.358
5.602LeuThr: 5.602 ± 0.322
5.641LeuVal: 5.641 ± 0.416
0.789LeuTrp: 0.789 ± 0.141
3.57LeuTyr: 3.57 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
1.992MetAla: 1.992 ± 0.185
0.217MetCys: 0.217 ± 0.066
1.381MetAsp: 1.381 ± 0.179
1.913MetGlu: 1.913 ± 0.195
1.026MetPhe: 1.026 ± 0.138
1.341MetGly: 1.341 ± 0.192
0.473MetHis: 0.473 ± 0.098
1.953MetIle: 1.953 ± 0.195
2.683MetLys: 2.683 ± 0.219
2.308MetLeu: 2.308 ± 0.245
0.473MetMet: 0.473 ± 0.101
1.637MetAsn: 1.637 ± 0.17
0.651MetPro: 0.651 ± 0.126
0.848MetGln: 0.848 ± 0.126
1.44MetArg: 1.44 ± 0.185
2.13MetSer: 2.13 ± 0.239
1.834MetThr: 1.834 ± 0.201
1.657MetVal: 1.657 ± 0.185
0.335MetTrp: 0.335 ± 0.087
1.184MetTyr: 1.184 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.452AsnAla: 3.452 ± 0.268
0.69AsnCys: 0.69 ± 0.152
2.288AsnAsp: 2.288 ± 0.21
3.235AsnGlu: 3.235 ± 0.224
1.795AsnPhe: 1.795 ± 0.203
4.497AsnGly: 4.497 ± 0.334
1.085AsnHis: 1.085 ± 0.133
3.768AsnIle: 3.768 ± 0.281
3.846AsnLys: 3.846 ± 0.237
4.103AsnLeu: 4.103 ± 0.314
1.756AsnMet: 1.756 ± 0.201
2.9AsnAsn: 2.9 ± 0.317
2.762AsnPro: 2.762 ± 0.328
1.736AsnGln: 1.736 ± 0.182
2.821AsnArg: 2.821 ± 0.231
2.88AsnSer: 2.88 ± 0.284
3.432AsnThr: 3.432 ± 0.246
3.57AsnVal: 3.57 ± 0.275
0.592AsnTrp: 0.592 ± 0.099
2.387AsnTyr: 2.387 ± 0.174
0.0AsnXaa: 0.0 ± 0.0
Pro
2.268ProAla: 2.268 ± 0.286
0.473ProCys: 0.473 ± 0.115
1.795ProAsp: 1.795 ± 0.198
2.19ProGlu: 2.19 ± 0.306
1.479ProPhe: 1.479 ± 0.181
1.933ProGly: 1.933 ± 0.271
0.809ProHis: 0.809 ± 0.137
2.15ProIle: 2.15 ± 0.216
2.505ProLys: 2.505 ± 0.223
2.86ProLeu: 2.86 ± 0.274
0.927ProMet: 0.927 ± 0.135
2.15ProAsn: 2.15 ± 0.239
1.302ProPro: 1.302 ± 0.168
1.558ProGln: 1.558 ± 0.221
1.401ProArg: 1.401 ± 0.144
2.485ProSer: 2.485 ± 0.222
2.801ProThr: 2.801 ± 0.32
2.623ProVal: 2.623 ± 0.304
0.237ProTrp: 0.237 ± 0.074
1.677ProTyr: 1.677 ± 0.236
0.0ProXaa: 0.0 ± 0.0
Gln
2.663GlnAla: 2.663 ± 0.241
0.256GlnCys: 0.256 ± 0.077
1.736GlnAsp: 1.736 ± 0.176
3.294GlnGlu: 3.294 ± 0.295
1.302GlnPhe: 1.302 ± 0.181
2.249GlnGly: 2.249 ± 0.247
0.552GlnHis: 0.552 ± 0.099
2.15GlnIle: 2.15 ± 0.193
2.407GlnLys: 2.407 ± 0.224
2.762GlnLeu: 2.762 ± 0.212
1.184GlnMet: 1.184 ± 0.158
1.44GlnAsn: 1.44 ± 0.159
1.44GlnPro: 1.44 ± 0.229
2.091GlnGln: 2.091 ± 0.251
1.499GlnArg: 1.499 ± 0.19
2.268GlnSer: 2.268 ± 0.218
2.051GlnThr: 2.051 ± 0.22
2.781GlnVal: 2.781 ± 0.256
0.493GlnTrp: 0.493 ± 0.109
1.223GlnTyr: 1.223 ± 0.155
0.0GlnXaa: 0.0 ± 0.0
Arg
2.407ArgAla: 2.407 ± 0.266
0.552ArgCys: 0.552 ± 0.106
2.466ArgAsp: 2.466 ± 0.236
3.077ArgGlu: 3.077 ± 0.272
2.209ArgPhe: 2.209 ± 0.187
2.742ArgGly: 2.742 ± 0.23
0.651ArgHis: 0.651 ± 0.116
2.821ArgIle: 2.821 ± 0.247
3.846ArgLys: 3.846 ± 0.311
4.517ArgLeu: 4.517 ± 0.307
1.637ArgMet: 1.637 ± 0.176
2.801ArgAsn: 2.801 ± 0.205
1.262ArgPro: 1.262 ± 0.165
1.617ArgGln: 1.617 ± 0.136
2.209ArgArg: 2.209 ± 0.224
2.939ArgSer: 2.939 ± 0.291
2.584ArgThr: 2.584 ± 0.231
3.412ArgVal: 3.412 ± 0.243
0.454ArgTrp: 0.454 ± 0.092
2.209ArgTyr: 2.209 ± 0.223
0.0ArgXaa: 0.0 ± 0.0
Ser
3.629SerAla: 3.629 ± 0.359
0.828SerCys: 0.828 ± 0.176
3.255SerAsp: 3.255 ± 0.268
3.117SerGlu: 3.117 ± 0.273
3.511SerPhe: 3.511 ± 0.297
4.162SerGly: 4.162 ± 0.411
1.203SerHis: 1.203 ± 0.144
5.03SerIle: 5.03 ± 0.344
4.497SerLys: 4.497 ± 0.332
5.681SerLeu: 5.681 ± 0.372
1.598SerMet: 1.598 ± 0.197
3.57SerAsn: 3.57 ± 0.336
2.15SerPro: 2.15 ± 0.215
1.539SerGln: 1.539 ± 0.169
3.057SerArg: 3.057 ± 0.292
6.115SerSer: 6.115 ± 0.845
4.103SerThr: 4.103 ± 0.357
4.123SerVal: 4.123 ± 0.3
1.026SerTrp: 1.026 ± 0.151
2.604SerTyr: 2.604 ± 0.205
0.0SerXaa: 0.0 ± 0.0
Thr
4.3ThrAla: 4.3 ± 0.492
0.572ThrCys: 0.572 ± 0.12
3.373ThrAsp: 3.373 ± 0.273
4.458ThrGlu: 4.458 ± 0.277
3.097ThrPhe: 3.097 ± 0.289
4.32ThrGly: 4.32 ± 0.348
1.026ThrHis: 1.026 ± 0.154
4.557ThrIle: 4.557 ± 0.341
4.3ThrLys: 4.3 ± 0.279
5.247ThrLeu: 5.247 ± 0.351
1.124ThrMet: 1.124 ± 0.126
3.393ThrAsn: 3.393 ± 0.316
3.294ThrPro: 3.294 ± 0.296
1.913ThrGln: 1.913 ± 0.195
2.643ThrArg: 2.643 ± 0.246
3.807ThrSer: 3.807 ± 0.347
4.024ThrThr: 4.024 ± 0.436
5.109ThrVal: 5.109 ± 0.341
0.631ThrTrp: 0.631 ± 0.127
2.9ThrTyr: 2.9 ± 0.219
0.0ThrXaa: 0.0 ± 0.0
Val
4.892ValAla: 4.892 ± 0.307
0.651ValCys: 0.651 ± 0.116
4.004ValAsp: 4.004 ± 0.291
4.754ValGlu: 4.754 ± 0.424
2.998ValPhe: 2.998 ± 0.242
3.629ValGly: 3.629 ± 0.297
1.519ValHis: 1.519 ± 0.187
4.418ValIle: 4.418 ± 0.287
4.912ValLys: 4.912 ± 0.37
5.602ValLeu: 5.602 ± 0.415
1.716ValMet: 1.716 ± 0.199
4.024ValAsn: 4.024 ± 0.267
2.722ValPro: 2.722 ± 0.285
2.663ValGln: 2.663 ± 0.183
3.353ValArg: 3.353 ± 0.262
4.241ValSer: 4.241 ± 0.294
5.188ValThr: 5.188 ± 0.343
4.202ValVal: 4.202 ± 0.311
0.572ValTrp: 0.572 ± 0.1
2.919ValTyr: 2.919 ± 0.273
0.0ValXaa: 0.0 ± 0.0
Trp
0.572TrpAla: 0.572 ± 0.134
0.276TrpCys: 0.276 ± 0.067
0.927TrpAsp: 0.927 ± 0.179
0.769TrpGlu: 0.769 ± 0.108
0.493TrpPhe: 0.493 ± 0.109
0.552TrpGly: 0.552 ± 0.1
0.217TrpHis: 0.217 ± 0.068
0.789TrpIle: 0.789 ± 0.109
0.828TrpLys: 0.828 ± 0.12
1.085TrpLeu: 1.085 ± 0.175
0.178TrpMet: 0.178 ± 0.056
0.967TrpAsn: 0.967 ± 0.14
0.0TrpPro: 0.0 ± 0.0
0.533TrpGln: 0.533 ± 0.102
0.375TrpArg: 0.375 ± 0.09
0.848TrpSer: 0.848 ± 0.147
0.611TrpThr: 0.611 ± 0.119
0.828TrpVal: 0.828 ± 0.142
0.178TrpTrp: 0.178 ± 0.055
0.552TrpTyr: 0.552 ± 0.103
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.663TyrAla: 2.663 ± 0.28
0.454TyrCys: 0.454 ± 0.091
2.781TyrAsp: 2.781 ± 0.236
2.13TyrGlu: 2.13 ± 0.219
1.539TyrPhe: 1.539 ± 0.177
2.564TyrGly: 2.564 ± 0.242
1.105TyrHis: 1.105 ± 0.153
2.86TyrIle: 2.86 ± 0.232
3.432TyrLys: 3.432 ± 0.282
3.886TyrLeu: 3.886 ± 0.251
1.065TyrMet: 1.065 ± 0.147
2.702TyrAsn: 2.702 ± 0.23
1.46TyrPro: 1.46 ± 0.174
1.42TyrGln: 1.42 ± 0.187
2.051TyrArg: 2.051 ± 0.196
2.545TyrSer: 2.545 ± 0.212
3.215TyrThr: 3.215 ± 0.258
2.683TyrVal: 2.683 ± 0.237
0.493TyrTrp: 0.493 ± 0.092
1.677TyrTyr: 1.677 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 239 proteins (50697 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski