Amino acid dipepetide frequency for Bacillus phage vB_BsuM-Goe10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.365AlaAla: 2.365 ± 0.282
0.478AlaCys: 0.478 ± 0.092
3.321AlaAsp: 3.321 ± 0.291
3.497AlaGlu: 3.497 ± 0.353
2.289AlaPhe: 2.289 ± 0.28
3.824AlaGly: 3.824 ± 0.43
0.855AlaHis: 0.855 ± 0.138
3.648AlaIle: 3.648 ± 0.278
4.68AlaLys: 4.68 ± 0.4
5.258AlaLeu: 5.258 ± 0.393
1.811AlaMet: 1.811 ± 0.206
2.868AlaAsn: 2.868 ± 0.267
1.862AlaPro: 1.862 ± 0.25
1.988AlaGln: 1.988 ± 0.283
2.516AlaArg: 2.516 ± 0.27
3.019AlaSer: 3.019 ± 0.289
2.793AlaThr: 2.793 ± 0.312
3.95AlaVal: 3.95 ± 0.339
0.554AlaTrp: 0.554 ± 0.102
2.44AlaTyr: 2.44 ± 0.265
0.0AlaXaa: 0.0 ± 0.0
Cys
0.453CysAla: 0.453 ± 0.097
0.101CysCys: 0.101 ± 0.051
0.352CysAsp: 0.352 ± 0.091
0.981CysGlu: 0.981 ± 0.171
0.226CysPhe: 0.226 ± 0.078
0.528CysGly: 0.528 ± 0.129
0.226CysHis: 0.226 ± 0.07
0.428CysIle: 0.428 ± 0.123
0.931CysLys: 0.931 ± 0.161
0.629CysLeu: 0.629 ± 0.13
0.327CysMet: 0.327 ± 0.079
0.453CysAsn: 0.453 ± 0.106
0.327CysPro: 0.327 ± 0.096
0.403CysGln: 0.403 ± 0.076
0.428CysArg: 0.428 ± 0.109
0.755CysSer: 0.755 ± 0.128
0.654CysThr: 0.654 ± 0.133
0.704CysVal: 0.704 ± 0.143
0.101CysTrp: 0.101 ± 0.051
0.403CysTyr: 0.403 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
3.22AspAla: 3.22 ± 0.234
0.78AspCys: 0.78 ± 0.126
3.95AspAsp: 3.95 ± 0.407
4.73AspGlu: 4.73 ± 0.395
2.591AspPhe: 2.591 ± 0.237
4.68AspGly: 4.68 ± 0.402
1.258AspHis: 1.258 ± 0.174
4.906AspIle: 4.906 ± 0.374
4.705AspLys: 4.705 ± 0.37
6.868AspLeu: 6.868 ± 0.465
1.912AspMet: 1.912 ± 0.2
3.849AspAsn: 3.849 ± 0.337
2.717AspPro: 2.717 ± 0.278
2.139AspGln: 2.139 ± 0.263
2.969AspArg: 2.969 ± 0.29
3.95AspSer: 3.95 ± 0.359
4.353AspThr: 4.353 ± 0.307
4.554AspVal: 4.554 ± 0.333
0.881AspTrp: 0.881 ± 0.163
2.843AspTyr: 2.843 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
4.428GluAla: 4.428 ± 0.368
0.604GluCys: 0.604 ± 0.134
7.246GluAsp: 7.246 ± 0.653
8.579GluGlu: 8.579 ± 0.928
2.717GluPhe: 2.717 ± 0.224
6.34GluGly: 6.34 ± 0.456
1.308GluHis: 1.308 ± 0.184
4.906GluIle: 4.906 ± 0.305
6.265GluLys: 6.265 ± 0.528
6.265GluLeu: 6.265 ± 0.464
2.139GluMet: 2.139 ± 0.2
3.799GluAsn: 3.799 ± 0.339
2.189GluPro: 2.189 ± 0.266
2.843GluGln: 2.843 ± 0.246
3.271GluArg: 3.271 ± 0.278
4.277GluSer: 4.277 ± 0.35
3.145GluThr: 3.145 ± 0.29
6.466GluVal: 6.466 ± 0.488
0.981GluTrp: 0.981 ± 0.156
3.22GluTyr: 3.22 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
1.635PheAla: 1.635 ± 0.218
0.226PheCys: 0.226 ± 0.079
2.466PheAsp: 2.466 ± 0.262
2.591PheGlu: 2.591 ± 0.25
1.082PhePhe: 1.082 ± 0.173
2.39PheGly: 2.39 ± 0.293
0.704PheHis: 0.704 ± 0.129
2.164PheIle: 2.164 ± 0.25
2.768PheLys: 2.768 ± 0.298
3.019PheLeu: 3.019 ± 0.284
1.208PheMet: 1.208 ± 0.211
1.887PheAsn: 1.887 ± 0.236
1.132PhePro: 1.132 ± 0.156
0.906PheGln: 0.906 ± 0.171
2.063PheArg: 2.063 ± 0.218
2.742PheSer: 2.742 ± 0.274
2.013PheThr: 2.013 ± 0.271
2.264PheVal: 2.264 ± 0.227
0.377PheTrp: 0.377 ± 0.106
1.359PheTyr: 1.359 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
3.95GlyAla: 3.95 ± 0.486
0.73GlyCys: 0.73 ± 0.141
4.554GlyAsp: 4.554 ± 0.327
5.158GlyGlu: 5.158 ± 0.332
2.591GlyPhe: 2.591 ± 0.283
4.856GlyGly: 4.856 ± 0.515
0.855GlyHis: 0.855 ± 0.154
4.73GlyIle: 4.73 ± 0.365
4.604GlyLys: 4.604 ± 0.381
5.535GlyLeu: 5.535 ± 0.486
1.962GlyMet: 1.962 ± 0.203
3.422GlyAsn: 3.422 ± 0.382
1.107GlyPro: 1.107 ± 0.182
1.837GlyGln: 1.837 ± 0.221
2.944GlyArg: 2.944 ± 0.228
4.604GlySer: 4.604 ± 0.417
4.277GlyThr: 4.277 ± 0.346
5.46GlyVal: 5.46 ± 0.408
0.805GlyTrp: 0.805 ± 0.137
3.547GlyTyr: 3.547 ± 0.312
0.0GlyXaa: 0.0 ± 0.0
His
0.704HisAla: 0.704 ± 0.128
0.176HisCys: 0.176 ± 0.066
0.981HisAsp: 0.981 ± 0.157
1.082HisGlu: 1.082 ± 0.165
0.679HisPhe: 0.679 ± 0.128
1.333HisGly: 1.333 ± 0.239
0.554HisHis: 0.554 ± 0.115
1.51HisIle: 1.51 ± 0.212
1.61HisLys: 1.61 ± 0.224
1.535HisLeu: 1.535 ± 0.176
0.629HisMet: 0.629 ± 0.118
0.855HisAsn: 0.855 ± 0.153
0.78HisPro: 0.78 ± 0.119
0.428HisGln: 0.428 ± 0.107
1.132HisArg: 1.132 ± 0.152
1.157HisSer: 1.157 ± 0.157
1.132HisThr: 1.132 ± 0.163
1.308HisVal: 1.308 ± 0.16
0.327HisTrp: 0.327 ± 0.097
0.956HisTyr: 0.956 ± 0.202
0.0HisXaa: 0.0 ± 0.0
Ile
3.522IleAla: 3.522 ± 0.233
0.554IleCys: 0.554 ± 0.117
4.629IleAsp: 4.629 ± 0.315
5.761IleGlu: 5.761 ± 0.394
1.661IlePhe: 1.661 ± 0.201
3.95IleGly: 3.95 ± 0.318
1.51IleHis: 1.51 ± 0.206
4.403IleIle: 4.403 ± 0.437
5.938IleLys: 5.938 ± 0.372
4.654IleLeu: 4.654 ± 0.3
1.635IleMet: 1.635 ± 0.185
3.321IleAsn: 3.321 ± 0.267
2.516IlePro: 2.516 ± 0.263
2.491IleGln: 2.491 ± 0.292
2.768IleArg: 2.768 ± 0.278
3.799IleSer: 3.799 ± 0.347
4.227IleThr: 4.227 ± 0.344
3.824IleVal: 3.824 ± 0.286
0.352IleTrp: 0.352 ± 0.091
2.113IleTyr: 2.113 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
4.831LysAla: 4.831 ± 0.469
0.528LysCys: 0.528 ± 0.122
5.711LysAsp: 5.711 ± 0.374
7.397LysGlu: 7.397 ± 0.663
2.541LysPhe: 2.541 ± 0.307
5.46LysGly: 5.46 ± 0.418
1.736LysHis: 1.736 ± 0.257
4.0LysIle: 4.0 ± 0.322
7.346LysLys: 7.346 ± 0.676
5.711LysLeu: 5.711 ± 0.356
2.264LysMet: 2.264 ± 0.246
3.522LysAsn: 3.522 ± 0.349
2.692LysPro: 2.692 ± 0.256
2.617LysGln: 2.617 ± 0.264
3.724LysArg: 3.724 ± 0.367
4.705LysSer: 4.705 ± 0.424
4.0LysThr: 4.0 ± 0.254
6.466LysVal: 6.466 ± 0.466
0.981LysTrp: 0.981 ± 0.172
3.774LysTyr: 3.774 ± 0.287
0.0LysXaa: 0.0 ± 0.0
Leu
4.654LeuAla: 4.654 ± 0.331
0.704LeuCys: 0.704 ± 0.122
6.39LeuAsp: 6.39 ± 0.387
7.975LeuGlu: 7.975 ± 0.524
3.095LeuPhe: 3.095 ± 0.239
5.057LeuGly: 5.057 ± 0.325
1.459LeuHis: 1.459 ± 0.191
4.202LeuIle: 4.202 ± 0.312
5.963LeuLys: 5.963 ± 0.376
6.189LeuLeu: 6.189 ± 0.46
2.088LeuMet: 2.088 ± 0.237
3.371LeuAsn: 3.371 ± 0.256
2.843LeuPro: 2.843 ± 0.256
2.944LeuGln: 2.944 ± 0.316
3.925LeuArg: 3.925 ± 0.308
6.139LeuSer: 6.139 ± 0.409
4.428LeuThr: 4.428 ± 0.35
5.158LeuVal: 5.158 ± 0.378
0.855LeuTrp: 0.855 ± 0.153
3.22LeuTyr: 3.22 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
2.088MetAla: 2.088 ± 0.271
0.226MetCys: 0.226 ± 0.073
1.937MetAsp: 1.937 ± 0.243
2.34MetGlu: 2.34 ± 0.263
0.73MetPhe: 0.73 ± 0.141
1.811MetGly: 1.811 ± 0.242
0.352MetHis: 0.352 ± 0.093
1.661MetIle: 1.661 ± 0.208
2.315MetLys: 2.315 ± 0.29
1.686MetLeu: 1.686 ± 0.194
0.679MetMet: 0.679 ± 0.104
1.635MetAsn: 1.635 ± 0.207
0.855MetPro: 0.855 ± 0.117
0.881MetGln: 0.881 ± 0.162
1.359MetArg: 1.359 ± 0.196
1.862MetSer: 1.862 ± 0.208
1.635MetThr: 1.635 ± 0.212
1.635MetVal: 1.635 ± 0.197
0.277MetTrp: 0.277 ± 0.081
1.208MetTyr: 1.208 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
2.516AsnAla: 2.516 ± 0.281
0.679AsnCys: 0.679 ± 0.13
2.365AsnAsp: 2.365 ± 0.271
2.944AsnGlu: 2.944 ± 0.233
2.063AsnPhe: 2.063 ± 0.251
3.724AsnGly: 3.724 ± 0.349
1.032AsnHis: 1.032 ± 0.193
3.371AsnIle: 3.371 ± 0.319
4.176AsnLys: 4.176 ± 0.347
3.698AsnLeu: 3.698 ± 0.321
1.384AsnMet: 1.384 ± 0.18
2.918AsnAsn: 2.918 ± 0.282
2.415AsnPro: 2.415 ± 0.271
1.61AsnGln: 1.61 ± 0.219
2.667AsnArg: 2.667 ± 0.273
3.447AsnSer: 3.447 ± 0.255
3.799AsnThr: 3.799 ± 0.576
3.321AsnVal: 3.321 ± 0.326
0.428AsnTrp: 0.428 ± 0.098
1.988AsnTyr: 1.988 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
1.384ProAla: 1.384 ± 0.189
0.252ProCys: 0.252 ± 0.074
2.264ProAsp: 2.264 ± 0.238
3.195ProGlu: 3.195 ± 0.289
1.258ProPhe: 1.258 ± 0.207
1.686ProGly: 1.686 ± 0.193
0.352ProHis: 0.352 ± 0.095
2.189ProIle: 2.189 ± 0.233
3.271ProLys: 3.271 ± 0.338
2.365ProLeu: 2.365 ± 0.212
0.679ProMet: 0.679 ± 0.15
1.61ProAsn: 1.61 ± 0.239
1.057ProPro: 1.057 ± 0.197
1.032ProGln: 1.032 ± 0.164
1.308ProArg: 1.308 ± 0.17
2.516ProSer: 2.516 ± 0.272
2.34ProThr: 2.34 ± 0.209
2.642ProVal: 2.642 ± 0.308
0.302ProTrp: 0.302 ± 0.086
1.711ProTyr: 1.711 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
1.988GlnAla: 1.988 ± 0.225
0.428GlnCys: 0.428 ± 0.098
2.189GlnAsp: 2.189 ± 0.244
2.516GlnGlu: 2.516 ± 0.29
1.006GlnPhe: 1.006 ± 0.159
2.063GlnGly: 2.063 ± 0.216
0.78GlnHis: 0.78 ± 0.14
2.415GlnIle: 2.415 ± 0.231
2.44GlnLys: 2.44 ± 0.261
3.019GlnLeu: 3.019 ± 0.326
1.032GlnMet: 1.032 ± 0.148
1.51GlnAsn: 1.51 ± 0.192
0.881GlnPro: 0.881 ± 0.149
1.258GlnGln: 1.258 ± 0.211
1.459GlnArg: 1.459 ± 0.186
2.214GlnSer: 2.214 ± 0.22
1.711GlnThr: 1.711 ± 0.238
2.541GlnVal: 2.541 ± 0.227
0.403GlnTrp: 0.403 ± 0.109
1.912GlnTyr: 1.912 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
2.44ArgAla: 2.44 ± 0.219
0.428ArgCys: 0.428 ± 0.104
2.793ArgAsp: 2.793 ± 0.237
3.195ArgGlu: 3.195 ± 0.29
1.585ArgPhe: 1.585 ± 0.203
2.918ArgGly: 2.918 ± 0.269
0.604ArgHis: 0.604 ± 0.13
2.843ArgIle: 2.843 ± 0.245
4.302ArgLys: 4.302 ± 0.377
3.95ArgLeu: 3.95 ± 0.329
1.384ArgMet: 1.384 ± 0.213
2.617ArgAsn: 2.617 ± 0.289
1.333ArgPro: 1.333 ± 0.221
1.761ArgGln: 1.761 ± 0.179
2.541ArgArg: 2.541 ± 0.236
2.591ArgSer: 2.591 ± 0.243
2.39ArgThr: 2.39 ± 0.258
3.9ArgVal: 3.9 ± 0.296
0.73ArgTrp: 0.73 ± 0.142
1.811ArgTyr: 1.811 ± 0.246
0.0ArgXaa: 0.0 ± 0.0
Ser
3.246SerAla: 3.246 ± 0.308
0.352SerCys: 0.352 ± 0.095
4.126SerAsp: 4.126 ± 0.329
4.478SerGlu: 4.478 ± 0.367
2.415SerPhe: 2.415 ± 0.246
4.604SerGly: 4.604 ± 0.336
1.157SerHis: 1.157 ± 0.15
4.227SerIle: 4.227 ± 0.328
5.183SerLys: 5.183 ± 0.412
5.585SerLeu: 5.585 ± 0.399
1.761SerMet: 1.761 ± 0.243
3.396SerAsn: 3.396 ± 0.291
2.038SerPro: 2.038 ± 0.243
2.239SerGln: 2.239 ± 0.286
3.22SerArg: 3.22 ± 0.28
3.724SerSer: 3.724 ± 0.393
4.101SerThr: 4.101 ± 0.33
4.654SerVal: 4.654 ± 0.344
0.78SerTrp: 0.78 ± 0.15
2.944SerTyr: 2.944 ± 0.244
0.0SerXaa: 0.0 ± 0.0
Thr
4.151ThrAla: 4.151 ± 0.381
0.453ThrCys: 0.453 ± 0.114
3.824ThrAsp: 3.824 ± 0.37
4.629ThrGlu: 4.629 ± 0.391
2.466ThrPhe: 2.466 ± 0.26
4.629ThrGly: 4.629 ± 0.393
1.057ThrHis: 1.057 ± 0.194
4.403ThrIle: 4.403 ± 0.358
4.176ThrLys: 4.176 ± 0.31
4.378ThrLeu: 4.378 ± 0.36
1.057ThrMet: 1.057 ± 0.137
2.768ThrAsn: 2.768 ± 0.314
2.365ThrPro: 2.365 ± 0.229
2.239ThrGln: 2.239 ± 0.269
2.088ThrArg: 2.088 ± 0.269
3.422ThrSer: 3.422 ± 0.349
3.547ThrThr: 3.547 ± 0.329
4.176ThrVal: 4.176 ± 0.456
0.78ThrTrp: 0.78 ± 0.171
2.516ThrTyr: 2.516 ± 0.258
0.0ThrXaa: 0.0 ± 0.0
Val
4.126ValAla: 4.126 ± 0.355
0.78ValCys: 0.78 ± 0.154
4.982ValAsp: 4.982 ± 0.308
6.189ValGlu: 6.189 ± 0.443
1.988ValPhe: 1.988 ± 0.188
4.176ValGly: 4.176 ± 0.355
1.56ValHis: 1.56 ± 0.179
4.202ValIle: 4.202 ± 0.331
5.258ValLys: 5.258 ± 0.324
5.711ValLeu: 5.711 ± 0.332
1.761ValMet: 1.761 ± 0.207
3.346ValAsn: 3.346 ± 0.314
2.918ValPro: 2.918 ± 0.288
2.541ValGln: 2.541 ± 0.254
3.12ValArg: 3.12 ± 0.367
5.787ValSer: 5.787 ± 0.422
5.485ValThr: 5.485 ± 0.444
5.988ValVal: 5.988 ± 0.515
0.528ValTrp: 0.528 ± 0.106
3.019ValTyr: 3.019 ± 0.267
0.0ValXaa: 0.0 ± 0.0
Trp
0.579TrpAla: 0.579 ± 0.115
0.302TrpCys: 0.302 ± 0.111
0.83TrpAsp: 0.83 ± 0.156
0.906TrpGlu: 0.906 ± 0.15
0.403TrpPhe: 0.403 ± 0.088
0.906TrpGly: 0.906 ± 0.152
0.302TrpHis: 0.302 ± 0.096
0.704TrpIle: 0.704 ± 0.129
0.805TrpLys: 0.805 ± 0.144
0.956TrpLeu: 0.956 ± 0.16
0.277TrpMet: 0.277 ± 0.085
0.755TrpAsn: 0.755 ± 0.143
0.025TrpPro: 0.025 ± 0.028
0.226TrpGln: 0.226 ± 0.082
0.453TrpArg: 0.453 ± 0.096
0.73TrpSer: 0.73 ± 0.134
0.277TrpThr: 0.277 ± 0.086
1.107TrpVal: 1.107 ± 0.13
0.302TrpTrp: 0.302 ± 0.098
0.377TrpTyr: 0.377 ± 0.077
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.736TyrAla: 1.736 ± 0.184
0.629TyrCys: 0.629 ± 0.126
2.944TyrAsp: 2.944 ± 0.217
2.843TyrGlu: 2.843 ± 0.239
1.61TyrPhe: 1.61 ± 0.185
2.617TyrGly: 2.617 ± 0.251
1.208TyrHis: 1.208 ± 0.177
2.717TyrIle: 2.717 ± 0.275
3.271TyrLys: 3.271 ± 0.333
3.648TyrLeu: 3.648 ± 0.269
1.157TyrMet: 1.157 ± 0.17
2.642TyrAsn: 2.642 ± 0.241
1.359TyrPro: 1.359 ± 0.187
1.384TyrGln: 1.384 ± 0.202
2.164TyrArg: 2.164 ± 0.228
2.793TyrSer: 2.793 ± 0.307
2.818TyrThr: 2.818 ± 0.219
3.346TyrVal: 3.346 ± 0.359
0.478TyrTrp: 0.478 ± 0.108
1.635TyrTyr: 1.635 ± 0.201
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 195 proteins (39748 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski