Amino acid dipepetide frequency for Bacillus phage PBC2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.02AlaAla: 0.02 ± 0.02
0.443AlaCys: 0.443 ± 0.091
3.685AlaAsp: 3.685 ± 0.28
4.51AlaGlu: 4.51 ± 0.304
2.175AlaPhe: 2.175 ± 0.22
3.222AlaGly: 3.222 ± 0.4
0.725AlaHis: 0.725 ± 0.106
4.289AlaIle: 4.289 ± 0.366
5.417AlaLys: 5.417 ± 0.503
4.571AlaLeu: 4.571 ± 0.312
1.993AlaMet: 1.993 ± 0.281
3.423AlaAsn: 3.423 ± 0.303
1.49AlaPro: 1.49 ± 0.157
2.618AlaGln: 2.618 ± 0.49
2.014AlaArg: 2.014 ± 0.234
3.685AlaSer: 3.685 ± 0.492
3.725AlaThr: 3.725 ± 0.391
3.987AlaVal: 3.987 ± 0.336
0.564AlaTrp: 0.564 ± 0.095
2.819AlaTyr: 2.819 ± 0.257
0.0AlaXaa: 0.0 ± 0.0
Cys
0.342CysAla: 0.342 ± 0.075
0.141CysCys: 0.141 ± 0.062
0.725CysAsp: 0.725 ± 0.15
0.685CysGlu: 0.685 ± 0.119
0.322CysPhe: 0.322 ± 0.085
0.725CysGly: 0.725 ± 0.147
0.242CysHis: 0.242 ± 0.084
0.584CysIle: 0.584 ± 0.112
0.946CysLys: 0.946 ± 0.158
0.342CysLeu: 0.342 ± 0.084
0.282CysMet: 0.282 ± 0.073
0.463CysAsn: 0.463 ± 0.094
0.302CysPro: 0.302 ± 0.087
0.221CysGln: 0.221 ± 0.061
0.362CysArg: 0.362 ± 0.087
0.342CysSer: 0.342 ± 0.089
0.221CysThr: 0.221 ± 0.082
0.785CysVal: 0.785 ± 0.143
0.101CysTrp: 0.101 ± 0.045
0.443CysTyr: 0.443 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
3.906AspAla: 3.906 ± 0.337
0.664AspCys: 0.664 ± 0.128
3.846AspAsp: 3.846 ± 0.317
5.658AspGlu: 5.658 ± 0.33
3.322AspPhe: 3.322 ± 0.254
5.517AspGly: 5.517 ± 0.37
0.644AspHis: 0.644 ± 0.105
5.276AspIle: 5.276 ± 0.357
6.182AspLys: 6.182 ± 0.333
5.034AspLeu: 5.034 ± 0.31
2.054AspMet: 2.054 ± 0.247
3.745AspAsn: 3.745 ± 0.265
1.087AspPro: 1.087 ± 0.151
0.866AspGln: 0.866 ± 0.144
2.235AspArg: 2.235 ± 0.234
3.242AspSer: 3.242 ± 0.242
3.524AspThr: 3.524 ± 0.257
4.813AspVal: 4.813 ± 0.35
0.866AspTrp: 0.866 ± 0.153
3.463AspTyr: 3.463 ± 0.279
0.0AspXaa: 0.0 ± 0.0
Glu
4.229GluAla: 4.229 ± 0.371
0.705GluCys: 0.705 ± 0.119
5.779GluAsp: 5.779 ± 0.434
7.45GluGlu: 7.45 ± 0.46
3.403GluPhe: 3.403 ± 0.256
4.007GluGly: 4.007 ± 0.296
1.832GluHis: 1.832 ± 0.233
6.605GluIle: 6.605 ± 0.358
6.887GluLys: 6.887 ± 0.399
7.37GluLeu: 7.37 ± 0.463
2.779GluMet: 2.779 ± 0.235
4.249GluAsn: 4.249 ± 0.322
1.51GluPro: 1.51 ± 0.202
3.343GluGln: 3.343 ± 0.296
3.161GluArg: 3.161 ± 0.278
3.866GluSer: 3.866 ± 0.298
4.027GluThr: 4.027 ± 0.323
5.477GluVal: 5.477 ± 0.327
1.188GluTrp: 1.188 ± 0.17
3.443GluTyr: 3.443 ± 0.363
0.0GluXaa: 0.0 ± 0.0
Phe
2.336PheAla: 2.336 ± 0.243
0.262PheCys: 0.262 ± 0.07
3.081PheAsp: 3.081 ± 0.249
3.987PheGlu: 3.987 ± 0.319
1.832PhePhe: 1.832 ± 0.249
2.98PheGly: 2.98 ± 0.231
0.725PheHis: 0.725 ± 0.128
2.759PheIle: 2.759 ± 0.244
3.806PheLys: 3.806 ± 0.234
2.96PheLeu: 2.96 ± 0.259
1.309PheMet: 1.309 ± 0.195
2.739PheAsn: 2.739 ± 0.249
0.805PhePro: 0.805 ± 0.138
1.027PheGln: 1.027 ± 0.133
1.812PheArg: 1.812 ± 0.22
2.316PheSer: 2.316 ± 0.245
3.101PheThr: 3.101 ± 0.259
2.316PheVal: 2.316 ± 0.232
0.664PheTrp: 0.664 ± 0.135
1.571PheTyr: 1.571 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
3.826GlyAla: 3.826 ± 0.376
0.644GlyCys: 0.644 ± 0.101
3.725GlyAsp: 3.725 ± 0.26
4.853GlyGlu: 4.853 ± 0.302
2.879GlyPhe: 2.879 ± 0.221
3.564GlyGly: 3.564 ± 0.335
1.289GlyHis: 1.289 ± 0.168
3.765GlyIle: 3.765 ± 0.293
6.242GlyLys: 6.242 ± 0.35
4.651GlyLeu: 4.651 ± 0.383
1.812GlyMet: 1.812 ± 0.195
3.947GlyAsn: 3.947 ± 0.36
0.0GlyPro: 0.0 ± 0.0
2.255GlyGln: 2.255 ± 0.295
2.658GlyArg: 2.658 ± 0.24
3.222GlySer: 3.222 ± 0.262
3.705GlyThr: 3.705 ± 0.377
5.517GlyVal: 5.517 ± 0.35
0.906GlyTrp: 0.906 ± 0.17
3.161GlyTyr: 3.161 ± 0.211
0.0GlyXaa: 0.0 ± 0.0
His
0.967HisAla: 0.967 ± 0.165
0.101HisCys: 0.101 ± 0.046
1.369HisAsp: 1.369 ± 0.179
1.289HisGlu: 1.289 ± 0.161
0.745HisPhe: 0.745 ± 0.128
1.269HisGly: 1.269 ± 0.131
0.403HisHis: 0.403 ± 0.1
1.389HisIle: 1.389 ± 0.174
1.55HisLys: 1.55 ± 0.184
1.772HisLeu: 1.772 ± 0.175
0.604HisMet: 0.604 ± 0.115
1.208HisAsn: 1.208 ± 0.141
0.463HisPro: 0.463 ± 0.1
0.362HisGln: 0.362 ± 0.101
0.604HisArg: 0.604 ± 0.11
1.248HisSer: 1.248 ± 0.173
0.926HisThr: 0.926 ± 0.147
1.269HisVal: 1.269 ± 0.213
0.141HisTrp: 0.141 ± 0.06
0.967HisTyr: 0.967 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
4.088IleAla: 4.088 ± 0.286
0.564IleCys: 0.564 ± 0.106
5.799IleAsp: 5.799 ± 0.403
5.92IleGlu: 5.92 ± 0.313
2.336IlePhe: 2.336 ± 0.23
4.229IleGly: 4.229 ± 0.254
1.389IleHis: 1.389 ± 0.189
4.752IleIle: 4.752 ± 0.435
6.504IleLys: 6.504 ± 0.422
5.014IleLeu: 5.014 ± 0.336
1.832IleMet: 1.832 ± 0.195
4.128IleAsn: 4.128 ± 0.277
1.873IlePro: 1.873 ± 0.211
2.054IleGln: 2.054 ± 0.205
2.678IleArg: 2.678 ± 0.214
4.349IleSer: 4.349 ± 0.305
4.168IleThr: 4.168 ± 0.296
5.054IleVal: 5.054 ± 0.361
0.624IleTrp: 0.624 ± 0.108
2.839IleTyr: 2.839 ± 0.236
0.0IleXaa: 0.0 ± 0.0
Lys
5.537LysAla: 5.537 ± 0.413
1.047LysCys: 1.047 ± 0.174
5.598LysAsp: 5.598 ± 0.336
8.638LysGlu: 8.638 ± 0.455
3.745LysPhe: 3.745 ± 0.252
5.578LysGly: 5.578 ± 0.302
1.792LysHis: 1.792 ± 0.193
6.121LysIle: 6.121 ± 0.412
8.417LysLys: 8.417 ± 0.678
7.108LysLeu: 7.108 ± 0.36
3.242LysMet: 3.242 ± 0.266
4.833LysAsn: 4.833 ± 0.28
2.034LysPro: 2.034 ± 0.194
3.423LysGln: 3.423 ± 0.25
3.645LysArg: 3.645 ± 0.293
4.994LysSer: 4.994 ± 0.346
4.43LysThr: 4.43 ± 0.31
6.182LysVal: 6.182 ± 0.353
1.027LysTrp: 1.027 ± 0.144
4.027LysTyr: 4.027 ± 0.23
0.0LysXaa: 0.0 ± 0.0
Leu
4.591LeuAla: 4.591 ± 0.351
0.584LeuCys: 0.584 ± 0.109
5.316LeuAsp: 5.316 ± 0.336
6.162LeuGlu: 6.162 ± 0.327
2.96LeuPhe: 2.96 ± 0.273
4.712LeuGly: 4.712 ± 0.369
1.712LeuHis: 1.712 ± 0.183
4.45LeuIle: 4.45 ± 0.284
6.725LeuLys: 6.725 ± 0.382
6.142LeuLeu: 6.142 ± 0.503
2.134LeuMet: 2.134 ± 0.257
4.45LeuAsn: 4.45 ± 0.341
2.175LeuPro: 2.175 ± 0.223
3.0LeuGln: 3.0 ± 0.282
3.383LeuArg: 3.383 ± 0.274
4.631LeuSer: 4.631 ± 0.295
4.37LeuThr: 4.37 ± 0.319
5.014LeuVal: 5.014 ± 0.343
1.128LeuTrp: 1.128 ± 0.183
3.504LeuTyr: 3.504 ± 0.334
0.0LeuXaa: 0.0 ± 0.0
Met
2.074MetAla: 2.074 ± 0.214
0.201MetCys: 0.201 ± 0.074
1.329MetAsp: 1.329 ± 0.165
2.155MetGlu: 2.155 ± 0.208
1.107MetPhe: 1.107 ± 0.127
1.591MetGly: 1.591 ± 0.172
0.544MetHis: 0.544 ± 0.096
2.155MetIle: 2.155 ± 0.181
3.202MetLys: 3.202 ± 0.261
2.114MetLeu: 2.114 ± 0.188
0.745MetMet: 0.745 ± 0.127
1.993MetAsn: 1.993 ± 0.18
0.705MetPro: 0.705 ± 0.146
1.228MetGln: 1.228 ± 0.141
1.47MetArg: 1.47 ± 0.162
2.658MetSer: 2.658 ± 0.367
1.913MetThr: 1.913 ± 0.179
1.41MetVal: 1.41 ± 0.166
0.322MetTrp: 0.322 ± 0.078
1.752MetTyr: 1.752 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
3.524AsnAla: 3.524 ± 0.453
0.383AsnCys: 0.383 ± 0.109
3.846AsnAsp: 3.846 ± 0.274
4.249AsnGlu: 4.249 ± 0.384
1.993AsnPhe: 1.993 ± 0.209
5.054AsnGly: 5.054 ± 0.402
1.148AsnHis: 1.148 ± 0.173
3.906AsnIle: 3.906 ± 0.255
4.893AsnLys: 4.893 ± 0.372
4.047AsnLeu: 4.047 ± 0.323
1.973AsnMet: 1.973 ± 0.176
3.363AsnAsn: 3.363 ± 0.287
1.812AsnPro: 1.812 ± 0.235
1.772AsnGln: 1.772 ± 0.182
2.457AsnArg: 2.457 ± 0.198
3.182AsnSer: 3.182 ± 0.303
3.443AsnThr: 3.443 ± 0.295
3.645AsnVal: 3.645 ± 0.291
0.584AsnTrp: 0.584 ± 0.117
2.819AsnTyr: 2.819 ± 0.236
0.0AsnXaa: 0.0 ± 0.0
Pro
1.309ProAla: 1.309 ± 0.18
0.221ProCys: 0.221 ± 0.074
1.329ProAsp: 1.329 ± 0.152
2.215ProGlu: 2.215 ± 0.202
1.047ProPhe: 1.047 ± 0.143
0.0ProGly: 0.0 ± 0.0
0.423ProHis: 0.423 ± 0.105
1.671ProIle: 1.671 ± 0.199
1.933ProLys: 1.933 ± 0.212
1.55ProLeu: 1.55 ± 0.185
0.644ProMet: 0.644 ± 0.116
1.329ProAsn: 1.329 ± 0.181
0.503ProPro: 0.503 ± 0.112
0.946ProGln: 0.946 ± 0.126
0.785ProArg: 0.785 ± 0.14
1.772ProSer: 1.772 ± 0.22
1.772ProThr: 1.772 ± 0.21
1.47ProVal: 1.47 ± 0.184
0.221ProTrp: 0.221 ± 0.082
1.128ProTyr: 1.128 ± 0.162
0.0ProXaa: 0.0 ± 0.0
Gln
2.316GlnAla: 2.316 ± 0.642
0.262GlnCys: 0.262 ± 0.069
1.651GlnAsp: 1.651 ± 0.195
3.041GlnGlu: 3.041 ± 0.288
1.269GlnPhe: 1.269 ± 0.158
1.772GlnGly: 1.772 ± 0.238
0.584GlnHis: 0.584 ± 0.112
2.537GlnIle: 2.537 ± 0.221
2.799GlnLys: 2.799 ± 0.312
3.161GlnLeu: 3.161 ± 0.267
1.087GlnMet: 1.087 ± 0.181
1.712GlnAsn: 1.712 ± 0.182
0.685GlnPro: 0.685 ± 0.128
1.248GlnGln: 1.248 ± 0.331
1.853GlnArg: 1.853 ± 0.159
1.812GlnSer: 1.812 ± 0.242
1.772GlnThr: 1.772 ± 0.203
1.812GlnVal: 1.812 ± 0.225
0.423GlnTrp: 0.423 ± 0.095
1.832GlnTyr: 1.832 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
1.873ArgAla: 1.873 ± 0.201
0.443ArgCys: 0.443 ± 0.084
2.255ArgAsp: 2.255 ± 0.23
3.242ArgGlu: 3.242 ± 0.266
2.316ArgPhe: 2.316 ± 0.199
2.134ArgGly: 2.134 ± 0.185
0.805ArgHis: 0.805 ± 0.128
3.0ArgIle: 3.0 ± 0.233
3.786ArgLys: 3.786 ± 0.276
3.141ArgLeu: 3.141 ± 0.274
1.41ArgMet: 1.41 ± 0.167
2.678ArgAsn: 2.678 ± 0.22
0.906ArgPro: 0.906 ± 0.133
1.47ArgGln: 1.47 ± 0.17
1.631ArgArg: 1.631 ± 0.204
1.591ArgSer: 1.591 ± 0.147
2.034ArgThr: 2.034 ± 0.191
2.839ArgVal: 2.839 ± 0.201
0.685ArgTrp: 0.685 ± 0.115
1.732ArgTyr: 1.732 ± 0.164
0.0ArgXaa: 0.0 ± 0.0
Ser
3.524SerAla: 3.524 ± 0.531
0.544SerCys: 0.544 ± 0.127
3.806SerAsp: 3.806 ± 0.294
3.363SerGlu: 3.363 ± 0.267
2.859SerPhe: 2.859 ± 0.218
3.886SerGly: 3.886 ± 0.353
1.067SerHis: 1.067 ± 0.141
4.168SerIle: 4.168 ± 0.312
5.336SerLys: 5.336 ± 0.321
4.551SerLeu: 4.551 ± 0.33
1.691SerMet: 1.691 ± 0.211
3.121SerAsn: 3.121 ± 0.248
1.309SerPro: 1.309 ± 0.171
2.074SerGln: 2.074 ± 0.292
1.893SerArg: 1.893 ± 0.191
3.967SerSer: 3.967 ± 0.384
3.242SerThr: 3.242 ± 0.257
3.262SerVal: 3.262 ± 0.221
0.805SerTrp: 0.805 ± 0.144
2.718SerTyr: 2.718 ± 0.268
0.0SerXaa: 0.0 ± 0.0
Thr
3.886ThrAla: 3.886 ± 0.428
0.322ThrCys: 0.322 ± 0.076
3.222ThrAsp: 3.222 ± 0.31
3.645ThrGlu: 3.645 ± 0.266
3.262ThrPhe: 3.262 ± 0.279
4.168ThrGly: 4.168 ± 0.347
1.027ThrHis: 1.027 ± 0.152
4.37ThrIle: 4.37 ± 0.296
5.034ThrLys: 5.034 ± 0.315
4.45ThrLeu: 4.45 ± 0.338
1.51ThrMet: 1.51 ± 0.206
2.98ThrAsn: 2.98 ± 0.27
1.873ThrPro: 1.873 ± 0.247
1.691ThrGln: 1.691 ± 0.227
2.114ThrArg: 2.114 ± 0.221
2.819ThrSer: 2.819 ± 0.23
3.665ThrThr: 3.665 ± 0.371
3.987ThrVal: 3.987 ± 0.293
0.745ThrTrp: 0.745 ± 0.127
2.9ThrTyr: 2.9 ± 0.29
0.0ThrXaa: 0.0 ± 0.0
Val
4.249ValAla: 4.249 ± 0.253
0.503ValCys: 0.503 ± 0.106
4.752ValAsp: 4.752 ± 0.302
6.041ValGlu: 6.041 ± 0.324
2.839ValPhe: 2.839 ± 0.242
4.148ValGly: 4.148 ± 0.302
1.007ValHis: 1.007 ± 0.157
4.168ValIle: 4.168 ± 0.298
6.202ValLys: 6.202 ± 0.328
4.692ValLeu: 4.692 ± 0.351
1.893ValMet: 1.893 ± 0.206
4.229ValAsn: 4.229 ± 0.253
1.45ValPro: 1.45 ± 0.184
2.175ValGln: 2.175 ± 0.235
2.718ValArg: 2.718 ± 0.226
3.846ValSer: 3.846 ± 0.278
4.088ValThr: 4.088 ± 0.3
4.933ValVal: 4.933 ± 0.332
0.725ValTrp: 0.725 ± 0.129
3.0ValTyr: 3.0 ± 0.297
0.0ValXaa: 0.0 ± 0.0
Trp
0.483TrpAla: 0.483 ± 0.118
0.201TrpCys: 0.201 ± 0.057
1.007TrpAsp: 1.007 ± 0.162
0.826TrpGlu: 0.826 ± 0.154
0.342TrpPhe: 0.342 ± 0.084
0.604TrpGly: 0.604 ± 0.123
0.463TrpHis: 0.463 ± 0.098
0.725TrpIle: 0.725 ± 0.123
1.128TrpLys: 1.128 ± 0.178
0.987TrpLeu: 0.987 ± 0.135
0.483TrpMet: 0.483 ± 0.096
0.705TrpAsn: 0.705 ± 0.122
0.0TrpPro: 0.0 ± 0.0
0.362TrpGln: 0.362 ± 0.082
0.624TrpArg: 0.624 ± 0.115
1.027TrpSer: 1.027 ± 0.161
0.805TrpThr: 0.805 ± 0.15
0.705TrpVal: 0.705 ± 0.141
0.201TrpTrp: 0.201 ± 0.055
0.765TrpTyr: 0.765 ± 0.118
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.296TyrAla: 2.296 ± 0.189
0.342TyrCys: 0.342 ± 0.081
3.826TyrAsp: 3.826 ± 0.307
3.383TyrGlu: 3.383 ± 0.259
1.631TyrPhe: 1.631 ± 0.208
3.463TyrGly: 3.463 ± 0.278
0.826TyrHis: 0.826 ± 0.147
3.504TyrIle: 3.504 ± 0.298
4.43TyrLys: 4.43 ± 0.301
3.484TyrLeu: 3.484 ± 0.274
1.168TyrMet: 1.168 ± 0.175
2.799TyrAsn: 2.799 ± 0.219
1.289TyrPro: 1.289 ± 0.166
1.49TyrGln: 1.49 ± 0.165
1.893TyrArg: 1.893 ± 0.185
2.618TyrSer: 2.618 ± 0.243
2.739TyrThr: 2.739 ± 0.243
3.182TyrVal: 3.182 ± 0.291
0.524TyrTrp: 0.524 ± 0.103
1.913TyrTyr: 1.913 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 251 proteins (49663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski