Amino acid dipepetide frequency for Staphylococcus phage phiSA_BS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.452AlaAla: 0.452 ± 0.111
0.151AlaCys: 0.151 ± 0.051
2.059AlaAsp: 2.059 ± 0.221
3.29AlaGlu: 3.29 ± 0.34
1.783AlaPhe: 1.783 ± 0.208
1.883AlaGly: 1.883 ± 0.27
0.753AlaHis: 0.753 ± 0.15
2.963AlaIle: 2.963 ± 0.261
4.319AlaLys: 4.319 ± 0.313
3.164AlaLeu: 3.164 ± 0.325
1.08AlaMet: 1.08 ± 0.205
1.657AlaAsn: 1.657 ± 0.218
1.356AlaPro: 1.356 ± 0.272
1.682AlaGln: 1.682 ± 0.233
1.582AlaArg: 1.582 ± 0.199
2.637AlaSer: 2.637 ± 0.266
2.36AlaThr: 2.36 ± 0.244
2.285AlaVal: 2.285 ± 0.225
0.402AlaTrp: 0.402 ± 0.118
1.934AlaTyr: 1.934 ± 0.178
0.0AlaXaa: 0.0 ± 0.0
Cys
0.176CysAla: 0.176 ± 0.071
0.0CysCys: 0.0 ± 0.0
0.251CysAsp: 0.251 ± 0.079
0.452CysGlu: 0.452 ± 0.11
0.326CysPhe: 0.326 ± 0.083
0.753CysGly: 0.753 ± 0.221
0.075CysHis: 0.075 ± 0.039
0.377CysIle: 0.377 ± 0.093
0.628CysLys: 0.628 ± 0.158
0.552CysLeu: 0.552 ± 0.106
0.151CysMet: 0.151 ± 0.057
0.301CysAsn: 0.301 ± 0.1
0.402CysPro: 0.402 ± 0.1
0.201CysGln: 0.201 ± 0.077
0.301CysArg: 0.301 ± 0.104
0.527CysSer: 0.527 ± 0.127
0.402CysThr: 0.402 ± 0.108
0.201CysVal: 0.201 ± 0.065
0.075CysTrp: 0.075 ± 0.037
0.452CysTyr: 0.452 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
2.235AspAla: 2.235 ± 0.268
0.251AspCys: 0.251 ± 0.073
3.892AspAsp: 3.892 ± 0.364
5.474AspGlu: 5.474 ± 0.423
3.465AspPhe: 3.465 ± 0.3
3.164AspGly: 3.164 ± 0.257
0.678AspHis: 0.678 ± 0.125
7.056AspIle: 7.056 ± 0.481
7.207AspLys: 7.207 ± 0.364
5.951AspLeu: 5.951 ± 0.373
1.808AspMet: 1.808 ± 0.209
5.198AspAsn: 5.198 ± 0.343
1.783AspPro: 1.783 ± 0.23
1.632AspGln: 1.632 ± 0.183
2.838AspArg: 2.838 ± 0.269
4.093AspSer: 4.093 ± 0.35
4.118AspThr: 4.118 ± 0.306
3.917AspVal: 3.917 ± 0.303
0.502AspTrp: 0.502 ± 0.11
4.344AspTyr: 4.344 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
3.591GluAla: 3.591 ± 0.325
0.452GluCys: 0.452 ± 0.116
7.358GluAsp: 7.358 ± 0.54
10.597GluGlu: 10.597 ± 0.745
3.541GluPhe: 3.541 ± 0.285
5.047GluGly: 5.047 ± 0.401
1.507GluHis: 1.507 ± 0.194
6.102GluIle: 6.102 ± 0.439
7.282GluLys: 7.282 ± 0.626
7.483GluLeu: 7.483 ± 0.505
2.461GluMet: 2.461 ± 0.268
5.976GluAsn: 5.976 ± 0.366
2.31GluPro: 2.31 ± 0.364
4.721GluGln: 4.721 ± 0.398
3.415GluArg: 3.415 ± 0.304
4.872GluSer: 4.872 ± 0.352
4.369GluThr: 4.369 ± 0.308
5.65GluVal: 5.65 ± 0.46
0.829GluTrp: 0.829 ± 0.139
4.997GluTyr: 4.997 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
1.18PheAla: 1.18 ± 0.175
0.326PheCys: 0.326 ± 0.073
3.164PheAsp: 3.164 ± 0.255
3.29PheGlu: 3.29 ± 0.291
1.306PhePhe: 1.306 ± 0.157
2.134PheGly: 2.134 ± 0.284
0.628PheHis: 0.628 ± 0.122
3.038PheIle: 3.038 ± 0.337
3.465PheLys: 3.465 ± 0.234
3.189PheLeu: 3.189 ± 0.274
1.13PheMet: 1.13 ± 0.148
3.716PheAsn: 3.716 ± 0.296
0.829PhePro: 0.829 ± 0.145
1.331PheGln: 1.331 ± 0.164
1.456PheArg: 1.456 ± 0.181
2.21PheSer: 2.21 ± 0.25
2.787PheThr: 2.787 ± 0.293
2.586PheVal: 2.586 ± 0.272
0.326PheTrp: 0.326 ± 0.08
2.109PheTyr: 2.109 ± 0.22
0.025PheXaa: 0.025 ± 0.024
Gly
2.16GlyAla: 2.16 ± 0.353
0.502GlyCys: 0.502 ± 0.129
3.716GlyAsp: 3.716 ± 0.408
4.746GlyGlu: 4.746 ± 0.346
2.461GlyPhe: 2.461 ± 0.26
3.867GlyGly: 3.867 ± 0.874
1.155GlyHis: 1.155 ± 0.164
4.269GlyIle: 4.269 ± 0.369
5.198GlyLys: 5.198 ± 0.483
4.42GlyLeu: 4.42 ± 0.371
1.682GlyMet: 1.682 ± 0.32
4.319GlyAsn: 4.319 ± 0.351
0.025GlyPro: 0.025 ± 0.024
2.185GlyGln: 2.185 ± 0.275
1.858GlyArg: 1.858 ± 0.224
3.415GlySer: 3.415 ± 0.327
3.541GlyThr: 3.541 ± 0.389
3.742GlyVal: 3.742 ± 0.355
0.753GlyTrp: 0.753 ± 0.137
3.214GlyTyr: 3.214 ± 0.258
0.0GlyXaa: 0.0 ± 0.0
His
0.703HisAla: 0.703 ± 0.102
0.251HisCys: 0.251 ± 0.082
0.879HisAsp: 0.879 ± 0.144
0.904HisGlu: 0.904 ± 0.209
0.628HisPhe: 0.628 ± 0.119
1.004HisGly: 1.004 ± 0.171
0.301HisHis: 0.301 ± 0.076
1.18HisIle: 1.18 ± 0.172
1.23HisLys: 1.23 ± 0.189
1.682HisLeu: 1.682 ± 0.183
0.301HisMet: 0.301 ± 0.08
1.105HisAsn: 1.105 ± 0.194
0.527HisPro: 0.527 ± 0.112
0.653HisGln: 0.653 ± 0.123
0.552HisArg: 0.552 ± 0.101
0.904HisSer: 0.904 ± 0.144
0.879HisThr: 0.879 ± 0.132
0.929HisVal: 0.929 ± 0.162
0.151HisTrp: 0.151 ± 0.073
0.829HisTyr: 0.829 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
2.888IleAla: 2.888 ± 0.247
0.402IleCys: 0.402 ± 0.102
6.002IleAsp: 6.002 ± 0.39
6.981IleGlu: 6.981 ± 0.534
2.134IlePhe: 2.134 ± 0.212
4.018IleGly: 4.018 ± 0.338
1.18IleHis: 1.18 ± 0.178
4.696IleIle: 4.696 ± 0.424
7.081IleLys: 7.081 ± 0.448
5.524IleLeu: 5.524 ± 0.44
1.632IleMet: 1.632 ± 0.256
5.449IleAsn: 5.449 ± 0.354
2.461IlePro: 2.461 ± 0.286
2.737IleGln: 2.737 ± 0.23
2.963IleArg: 2.963 ± 0.318
3.942IleSer: 3.942 ± 0.29
4.47IleThr: 4.47 ± 0.308
4.369IleVal: 4.369 ± 0.369
0.352IleTrp: 0.352 ± 0.084
3.29IleTyr: 3.29 ± 0.336
0.0IleXaa: 0.0 ± 0.0
Lys
3.817LysAla: 3.817 ± 0.334
0.678LysCys: 0.678 ± 0.181
7.157LysAsp: 7.157 ± 0.347
10.898LysGlu: 10.898 ± 0.813
2.838LysPhe: 2.838 ± 0.25
5.449LysGly: 5.449 ± 0.527
1.482LysHis: 1.482 ± 0.188
5.499LysIle: 5.499 ± 0.404
7.282LysLys: 7.282 ± 0.482
6.73LysLeu: 6.73 ± 0.352
2.185LysMet: 2.185 ± 0.255
5.072LysAsn: 5.072 ± 0.358
2.687LysPro: 2.687 ± 0.277
3.942LysGln: 3.942 ± 0.359
3.365LysArg: 3.365 ± 0.33
4.972LysSer: 4.972 ± 0.427
4.821LysThr: 4.821 ± 0.346
6.052LysVal: 6.052 ± 0.445
0.427LysTrp: 0.427 ± 0.087
5.223LysTyr: 5.223 ± 0.405
0.0LysXaa: 0.0 ± 0.0
Leu
3.139LeuAla: 3.139 ± 0.341
0.527LeuCys: 0.527 ± 0.12
6.504LeuAsp: 6.504 ± 0.405
8.01LeuGlu: 8.01 ± 0.543
2.687LeuPhe: 2.687 ± 0.205
4.821LeuGly: 4.821 ± 0.474
1.03LeuHis: 1.03 ± 0.154
5.047LeuIle: 5.047 ± 0.437
7.106LeuLys: 7.106 ± 0.421
6.027LeuLeu: 6.027 ± 0.494
1.934LeuMet: 1.934 ± 0.198
6.202LeuAsn: 6.202 ± 0.398
2.26LeuPro: 2.26 ± 0.223
2.838LeuGln: 2.838 ± 0.272
3.516LeuArg: 3.516 ± 0.259
5.449LeuSer: 5.449 ± 0.332
4.57LeuThr: 4.57 ± 0.396
3.516LeuVal: 3.516 ± 0.299
0.578LeuTrp: 0.578 ± 0.112
3.566LeuTyr: 3.566 ± 0.367
0.0LeuXaa: 0.0 ± 0.0
Met
1.381MetAla: 1.381 ± 0.219
0.1MetCys: 0.1 ± 0.048
1.482MetAsp: 1.482 ± 0.189
1.934MetGlu: 1.934 ± 0.261
0.879MetPhe: 0.879 ± 0.154
1.406MetGly: 1.406 ± 0.355
0.176MetHis: 0.176 ± 0.06
1.783MetIle: 1.783 ± 0.263
2.486MetLys: 2.486 ± 0.3
1.959MetLeu: 1.959 ± 0.261
0.778MetMet: 0.778 ± 0.297
1.758MetAsn: 1.758 ± 0.217
0.402MetPro: 0.402 ± 0.108
0.728MetGln: 0.728 ± 0.114
1.155MetArg: 1.155 ± 0.155
1.758MetSer: 1.758 ± 0.234
1.406MetThr: 1.406 ± 0.202
1.105MetVal: 1.105 ± 0.196
0.226MetTrp: 0.226 ± 0.08
1.155MetTyr: 1.155 ± 0.164
0.025MetXaa: 0.025 ± 0.025
Asn
2.461AsnAla: 2.461 ± 0.252
0.427AsnCys: 0.427 ± 0.097
3.968AsnAsp: 3.968 ± 0.313
5.499AsnGlu: 5.499 ± 0.373
3.064AsnPhe: 3.064 ± 0.293
4.42AsnGly: 4.42 ± 0.398
1.281AsnHis: 1.281 ± 0.172
5.399AsnIle: 5.399 ± 0.46
7.483AsnLys: 7.483 ± 0.435
5.6AsnLeu: 5.6 ± 0.336
1.532AsnMet: 1.532 ± 0.189
5.198AsnAsn: 5.198 ± 0.405
2.461AsnPro: 2.461 ± 0.225
2.26AsnGln: 2.26 ± 0.247
2.712AsnArg: 2.712 ± 0.277
3.942AsnSer: 3.942 ± 0.294
4.269AsnThr: 4.269 ± 0.315
3.917AsnVal: 3.917 ± 0.295
0.578AsnTrp: 0.578 ± 0.112
3.239AsnTyr: 3.239 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
0.753ProAla: 0.753 ± 0.164
0.151ProCys: 0.151 ± 0.061
1.155ProAsp: 1.155 ± 0.167
2.536ProGlu: 2.536 ± 0.263
1.482ProPhe: 1.482 ± 0.201
0.703ProGly: 0.703 ± 0.138
0.502ProHis: 0.502 ± 0.094
1.908ProIle: 1.908 ± 0.249
2.461ProLys: 2.461 ± 0.262
2.009ProLeu: 2.009 ± 0.213
0.477ProMet: 0.477 ± 0.116
2.034ProAsn: 2.034 ± 0.193
0.402ProPro: 0.402 ± 0.086
1.733ProGln: 1.733 ± 0.326
1.03ProArg: 1.03 ± 0.148
1.883ProSer: 1.883 ± 0.269
2.185ProThr: 2.185 ± 0.297
1.256ProVal: 1.256 ± 0.179
0.151ProTrp: 0.151 ± 0.045
1.532ProTyr: 1.532 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
2.612GlnAla: 2.612 ± 0.317
0.176GlnCys: 0.176 ± 0.061
2.511GlnAsp: 2.511 ± 0.258
3.817GlnGlu: 3.817 ± 0.402
1.532GlnPhe: 1.532 ± 0.183
2.812GlnGly: 2.812 ± 0.321
0.628GlnHis: 0.628 ± 0.126
2.26GlnIle: 2.26 ± 0.224
2.712GlnLys: 2.712 ± 0.29
2.863GlnLeu: 2.863 ± 0.297
0.854GlnMet: 0.854 ± 0.163
2.436GlnAsn: 2.436 ± 0.22
1.406GlnPro: 1.406 ± 0.397
2.486GlnGln: 2.486 ± 0.696
1.431GlnArg: 1.431 ± 0.208
2.36GlnSer: 2.36 ± 0.266
2.034GlnThr: 2.034 ± 0.206
2.612GlnVal: 2.612 ± 0.307
0.326GlnTrp: 0.326 ± 0.09
1.532GlnTyr: 1.532 ± 0.198
0.0GlnXaa: 0.0 ± 0.0
Arg
1.532ArgAla: 1.532 ± 0.232
0.276ArgCys: 0.276 ± 0.102
2.712ArgAsp: 2.712 ± 0.267
3.742ArgGlu: 3.742 ± 0.315
1.783ArgPhe: 1.783 ± 0.199
2.109ArgGly: 2.109 ± 0.239
0.502ArgHis: 0.502 ± 0.101
2.511ArgIle: 2.511 ± 0.259
3.591ArgLys: 3.591 ± 0.378
3.516ArgLeu: 3.516 ± 0.294
1.13ArgMet: 1.13 ± 0.189
2.461ArgAsn: 2.461 ± 0.232
0.829ArgPro: 0.829 ± 0.14
1.532ArgGln: 1.532 ± 0.151
1.632ArgArg: 1.632 ± 0.197
1.934ArgSer: 1.934 ± 0.198
2.185ArgThr: 2.185 ± 0.297
2.687ArgVal: 2.687 ± 0.236
0.452ArgTrp: 0.452 ± 0.111
2.285ArgTyr: 2.285 ± 0.26
0.0ArgXaa: 0.0 ± 0.0
Ser
2.335SerAla: 2.335 ± 0.286
0.402SerCys: 0.402 ± 0.139
4.369SerAsp: 4.369 ± 0.328
4.821SerGlu: 4.821 ± 0.292
2.511SerPhe: 2.511 ± 0.213
3.892SerGly: 3.892 ± 0.396
0.753SerHis: 0.753 ± 0.119
4.796SerIle: 4.796 ± 0.382
5.298SerLys: 5.298 ± 0.407
5.022SerLeu: 5.022 ± 0.344
1.23SerMet: 1.23 ± 0.31
4.394SerAsn: 4.394 ± 0.294
1.456SerPro: 1.456 ± 0.16
2.034SerGln: 2.034 ± 0.278
2.084SerArg: 2.084 ± 0.23
4.219SerSer: 4.219 ± 0.55
3.415SerThr: 3.415 ± 0.288
3.013SerVal: 3.013 ± 0.317
0.452SerTrp: 0.452 ± 0.09
3.49SerTyr: 3.49 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
2.26ThrAla: 2.26 ± 0.239
0.352ThrCys: 0.352 ± 0.09
3.49ThrAsp: 3.49 ± 0.249
4.696ThrGlu: 4.696 ± 0.373
2.687ThrPhe: 2.687 ± 0.283
3.917ThrGly: 3.917 ± 0.385
1.055ThrHis: 1.055 ± 0.164
4.846ThrIle: 4.846 ± 0.352
4.997ThrLys: 4.997 ± 0.41
4.52ThrLeu: 4.52 ± 0.331
1.18ThrMet: 1.18 ± 0.219
3.968ThrAsn: 3.968 ± 0.313
1.959ThrPro: 1.959 ± 0.198
2.285ThrGln: 2.285 ± 0.33
2.662ThrArg: 2.662 ± 0.301
3.239ThrSer: 3.239 ± 0.251
3.691ThrThr: 3.691 ± 0.357
3.968ThrVal: 3.968 ± 0.347
0.603ThrTrp: 0.603 ± 0.131
2.838ThrTyr: 2.838 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
2.21ValAla: 2.21 ± 0.2
0.678ValCys: 0.678 ± 0.125
4.57ValAsp: 4.57 ± 0.39
5.725ValGlu: 5.725 ± 0.434
2.511ValPhe: 2.511 ± 0.28
2.411ValGly: 2.411 ± 0.188
0.954ValHis: 0.954 ± 0.184
4.018ValIle: 4.018 ± 0.313
5.273ValLys: 5.273 ± 0.396
4.52ValLeu: 4.52 ± 0.422
1.03ValMet: 1.03 ± 0.151
3.792ValAsn: 3.792 ± 0.296
1.682ValPro: 1.682 ± 0.266
2.134ValGln: 2.134 ± 0.298
2.787ValArg: 2.787 ± 0.261
4.068ValSer: 4.068 ± 0.308
3.767ValThr: 3.767 ± 0.359
3.189ValVal: 3.189 ± 0.325
0.377ValTrp: 0.377 ± 0.088
3.164ValTyr: 3.164 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
0.276TrpAla: 0.276 ± 0.093
0.025TrpCys: 0.025 ± 0.026
0.578TrpAsp: 0.578 ± 0.117
0.954TrpGlu: 0.954 ± 0.139
0.251TrpPhe: 0.251 ± 0.075
0.377TrpGly: 0.377 ± 0.085
0.1TrpHis: 0.1 ± 0.054
0.527TrpIle: 0.527 ± 0.11
0.854TrpLys: 0.854 ± 0.139
0.477TrpLeu: 0.477 ± 0.113
0.151TrpMet: 0.151 ± 0.061
0.452TrpAsn: 0.452 ± 0.106
0.0TrpPro: 0.0 ± 0.0
0.452TrpGln: 0.452 ± 0.104
0.352TrpArg: 0.352 ± 0.099
0.653TrpSer: 0.653 ± 0.149
0.352TrpThr: 0.352 ± 0.091
0.527TrpVal: 0.527 ± 0.106
0.176TrpTrp: 0.176 ± 0.076
0.502TrpTyr: 0.502 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.632TyrAla: 1.632 ± 0.165
0.477TyrCys: 0.477 ± 0.105
3.792TyrAsp: 3.792 ± 0.326
4.194TyrGlu: 4.194 ± 0.393
2.386TyrPhe: 2.386 ± 0.282
3.038TyrGly: 3.038 ± 0.255
0.854TyrHis: 0.854 ± 0.13
4.294TyrIle: 4.294 ± 0.357
4.62TyrLys: 4.62 ± 0.364
4.118TyrLeu: 4.118 ± 0.352
1.331TyrMet: 1.331 ± 0.216
4.219TyrAsn: 4.219 ± 0.359
1.105TyrPro: 1.105 ± 0.18
1.808TyrGln: 1.808 ± 0.224
1.708TyrArg: 1.708 ± 0.194
2.913TyrSer: 2.913 ± 0.29
3.566TyrThr: 3.566 ± 0.299
3.34TyrVal: 3.34 ± 0.331
0.301TyrTrp: 0.301 ± 0.087
3.139TyrTyr: 3.139 ± 0.31
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.025XaaGln: 0.025 ± 0.025
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.025XaaThr: 0.025 ± 0.024
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 200 proteins (39824 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski