Amino acid dipepetide frequency for Staphylococcus phage SPbeta-like

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.425AlaAla: 3.425 ± 0.67
0.432AlaCys: 0.432 ± 0.1
2.751AlaAsp: 2.751 ± 0.274
3.317AlaGlu: 3.317 ± 0.378
2.158AlaPhe: 2.158 ± 0.247
2.724AlaGly: 2.724 ± 0.417
0.863AlaHis: 0.863 ± 0.138
3.965AlaIle: 3.965 ± 0.31
5.124AlaLys: 5.124 ± 0.658
4.693AlaLeu: 4.693 ± 0.512
1.241AlaMet: 1.241 ± 0.186
3.56AlaAsn: 3.56 ± 0.402
1.025AlaPro: 1.025 ± 0.166
1.888AlaGln: 1.888 ± 0.309
2.023AlaArg: 2.023 ± 0.233
3.641AlaSer: 3.641 ± 0.542
3.479AlaThr: 3.479 ± 0.451
2.643AlaVal: 2.643 ± 0.292
0.378AlaTrp: 0.378 ± 0.095
2.373AlaTyr: 2.373 ± 0.202
0.0AlaXaa: 0.0 ± 0.0
Cys
0.351CysAla: 0.351 ± 0.097
0.054CysCys: 0.054 ± 0.052
0.405CysAsp: 0.405 ± 0.134
0.539CysGlu: 0.539 ± 0.158
0.162CysPhe: 0.162 ± 0.066
0.378CysGly: 0.378 ± 0.102
0.405CysHis: 0.405 ± 0.107
0.593CysIle: 0.593 ± 0.138
0.647CysLys: 0.647 ± 0.156
0.539CysLeu: 0.539 ± 0.125
0.135CysMet: 0.135 ± 0.061
0.566CysAsn: 0.566 ± 0.121
0.108CysPro: 0.108 ± 0.055
0.351CysGln: 0.351 ± 0.101
0.108CysArg: 0.108 ± 0.057
0.297CysSer: 0.297 ± 0.087
0.405CysThr: 0.405 ± 0.097
0.27CysVal: 0.27 ± 0.1
0.027CysTrp: 0.027 ± 0.029
0.378CysTyr: 0.378 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
2.319AspAla: 2.319 ± 0.306
0.27AspCys: 0.27 ± 0.085
3.587AspAsp: 3.587 ± 0.332
5.961AspGlu: 5.961 ± 0.509
3.722AspPhe: 3.722 ± 0.396
3.803AspGly: 3.803 ± 0.511
1.025AspHis: 1.025 ± 0.17
5.799AspIle: 5.799 ± 0.416
5.961AspLys: 5.961 ± 0.471
6.311AspLeu: 6.311 ± 0.36
1.537AspMet: 1.537 ± 0.226
4.342AspAsn: 4.342 ± 0.38
0.809AspPro: 0.809 ± 0.144
2.4AspGln: 2.4 ± 0.219
2.185AspArg: 2.185 ± 0.246
3.075AspSer: 3.075 ± 0.303
3.641AspThr: 3.641 ± 0.395
3.938AspVal: 3.938 ± 0.28
0.539AspTrp: 0.539 ± 0.125
3.803AspTyr: 3.803 ± 0.367
0.0AspXaa: 0.0 ± 0.0
Glu
4.423GluAla: 4.423 ± 0.446
0.647GluCys: 0.647 ± 0.139
5.367GluAsp: 5.367 ± 0.505
8.793GluGlu: 8.793 ± 0.712
3.776GluPhe: 3.776 ± 0.362
3.263GluGly: 3.263 ± 0.369
1.429GluHis: 1.429 ± 0.217
6.905GluIle: 6.905 ± 0.562
6.824GluLys: 6.824 ± 0.415
8.523GluLeu: 8.523 ± 0.486
2.212GluMet: 2.212 ± 0.237
5.664GluAsn: 5.664 ± 0.406
1.322GluPro: 1.322 ± 0.226
3.344GluGln: 3.344 ± 0.357
3.938GluArg: 3.938 ± 0.271
3.371GluSer: 3.371 ± 0.269
4.72GluThr: 4.72 ± 0.347
4.801GluVal: 4.801 ± 0.401
0.89GluTrp: 0.89 ± 0.12
4.396GluTyr: 4.396 ± 0.421
0.0GluXaa: 0.0 ± 0.0
Phe
1.726PheAla: 1.726 ± 0.237
0.324PheCys: 0.324 ± 0.106
3.183PheAsp: 3.183 ± 0.292
3.533PheGlu: 3.533 ± 0.304
1.349PhePhe: 1.349 ± 0.172
2.4PheGly: 2.4 ± 0.23
0.917PheHis: 0.917 ± 0.14
3.075PheIle: 3.075 ± 0.351
4.396PheLys: 4.396 ± 0.309
2.724PheLeu: 2.724 ± 0.345
0.998PheMet: 0.998 ± 0.144
3.317PheAsn: 3.317 ± 0.271
0.809PhePro: 0.809 ± 0.169
1.079PheGln: 1.079 ± 0.144
1.214PheArg: 1.214 ± 0.184
3.237PheSer: 3.237 ± 0.317
2.751PheThr: 2.751 ± 0.287
2.05PheVal: 2.05 ± 0.275
0.351PheTrp: 0.351 ± 0.107
2.104PheTyr: 2.104 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
2.454GlyAla: 2.454 ± 0.374
0.324GlyCys: 0.324 ± 0.101
3.075GlyAsp: 3.075 ± 0.322
3.884GlyGlu: 3.884 ± 0.279
2.185GlyPhe: 2.185 ± 0.245
2.373GlyGly: 2.373 ± 0.332
1.079GlyHis: 1.079 ± 0.208
4.046GlyIle: 4.046 ± 0.329
5.044GlyLys: 5.044 ± 0.37
4.073GlyLeu: 4.073 ± 0.457
1.187GlyMet: 1.187 ± 0.187
2.913GlyAsn: 2.913 ± 0.259
0.054GlyPro: 0.054 ± 0.034
1.591GlyGln: 1.591 ± 0.28
1.888GlyArg: 1.888 ± 0.218
3.776GlySer: 3.776 ± 0.35
2.994GlyThr: 2.994 ± 0.308
3.452GlyVal: 3.452 ± 0.276
0.647GlyTrp: 0.647 ± 0.137
2.805GlyTyr: 2.805 ± 0.302
0.0GlyXaa: 0.0 ± 0.0
His
0.674HisAla: 0.674 ± 0.12
0.216HisCys: 0.216 ± 0.078
1.079HisAsp: 1.079 ± 0.19
1.402HisGlu: 1.402 ± 0.224
0.512HisPhe: 0.512 ± 0.129
1.025HisGly: 1.025 ± 0.191
0.459HisHis: 0.459 ± 0.11
1.645HisIle: 1.645 ± 0.25
1.834HisLys: 1.834 ± 0.249
1.483HisLeu: 1.483 ± 0.221
0.324HisMet: 0.324 ± 0.087
1.214HisAsn: 1.214 ± 0.196
0.351HisPro: 0.351 ± 0.075
0.485HisGln: 0.485 ± 0.092
0.674HisArg: 0.674 ± 0.149
1.268HisSer: 1.268 ± 0.191
1.241HisThr: 1.241 ± 0.215
0.971HisVal: 0.971 ± 0.142
0.135HisTrp: 0.135 ± 0.073
0.863HisTyr: 0.863 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
3.884IleAla: 3.884 ± 0.324
0.62IleCys: 0.62 ± 0.147
6.311IleAsp: 6.311 ± 0.457
7.687IleGlu: 7.687 ± 0.482
2.778IlePhe: 2.778 ± 0.326
3.614IleGly: 3.614 ± 0.369
1.429IleHis: 1.429 ± 0.232
5.421IleIle: 5.421 ± 0.495
7.417IleLys: 7.417 ± 0.449
5.071IleLeu: 5.071 ± 0.343
1.915IleMet: 1.915 ± 0.238
6.311IleAsn: 6.311 ± 0.411
1.726IlePro: 1.726 ± 0.247
2.4IleGln: 2.4 ± 0.244
2.508IleArg: 2.508 ± 0.265
5.286IleSer: 5.286 ± 0.433
4.693IleThr: 4.693 ± 0.364
4.207IleVal: 4.207 ± 0.41
0.432IleTrp: 0.432 ± 0.127
3.29IleTyr: 3.29 ± 0.37
0.0IleXaa: 0.0 ± 0.0
Lys
5.745LysAla: 5.745 ± 0.619
0.459LysCys: 0.459 ± 0.104
6.392LysAsp: 6.392 ± 0.479
9.332LysGlu: 9.332 ± 0.821
3.452LysPhe: 3.452 ± 0.337
4.747LysGly: 4.747 ± 0.381
1.564LysHis: 1.564 ± 0.269
6.338LysIle: 6.338 ± 0.429
7.983LysLys: 7.983 ± 0.6
7.552LysLeu: 7.552 ± 0.418
2.185LysMet: 2.185 ± 0.182
6.311LysAsn: 6.311 ± 0.424
2.589LysPro: 2.589 ± 0.294
4.019LysGln: 4.019 ± 0.48
2.994LysArg: 2.994 ± 0.379
5.637LysSer: 5.637 ± 0.514
5.421LysThr: 5.421 ± 0.422
6.5LysVal: 6.5 ± 0.39
0.647LysTrp: 0.647 ± 0.135
5.044LysTyr: 5.044 ± 0.382
0.0LysXaa: 0.0 ± 0.0
Leu
4.342LeuAla: 4.342 ± 0.393
0.593LeuCys: 0.593 ± 0.1
5.529LeuAsp: 5.529 ± 0.399
7.201LeuGlu: 7.201 ± 0.494
2.724LeuPhe: 2.724 ± 0.388
3.776LeuGly: 3.776 ± 0.314
1.537LeuHis: 1.537 ± 0.213
5.394LeuIle: 5.394 ± 0.496
8.442LeuLys: 8.442 ± 0.511
6.257LeuLeu: 6.257 ± 0.531
1.888LeuMet: 1.888 ± 0.225
7.012LeuAsn: 7.012 ± 0.385
2.266LeuPro: 2.266 ± 0.23
3.29LeuGln: 3.29 ± 0.389
3.587LeuArg: 3.587 ± 0.292
6.419LeuSer: 6.419 ± 0.421
5.637LeuThr: 5.637 ± 0.4
3.722LeuVal: 3.722 ± 0.303
0.512LeuTrp: 0.512 ± 0.131
3.803LeuTyr: 3.803 ± 0.277
0.0LeuXaa: 0.0 ± 0.0
Met
1.51MetAla: 1.51 ± 0.161
0.216MetCys: 0.216 ± 0.076
1.268MetAsp: 1.268 ± 0.166
2.158MetGlu: 2.158 ± 0.32
1.025MetPhe: 1.025 ± 0.169
0.539MetGly: 0.539 ± 0.14
0.378MetHis: 0.378 ± 0.086
1.834MetIle: 1.834 ± 0.21
3.129MetLys: 3.129 ± 0.313
1.537MetLeu: 1.537 ± 0.186
0.62MetMet: 0.62 ± 0.115
1.78MetAsn: 1.78 ± 0.263
0.566MetPro: 0.566 ± 0.137
0.809MetGln: 0.809 ± 0.128
1.052MetArg: 1.052 ± 0.189
1.672MetSer: 1.672 ± 0.183
1.429MetThr: 1.429 ± 0.197
0.998MetVal: 0.998 ± 0.152
0.243MetTrp: 0.243 ± 0.09
0.809MetTyr: 0.809 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
3.749AsnAla: 3.749 ± 0.388
0.216AsnCys: 0.216 ± 0.075
4.046AsnAsp: 4.046 ± 0.352
5.286AsnGlu: 5.286 ± 0.341
2.913AsnPhe: 2.913 ± 0.263
4.369AsnGly: 4.369 ± 0.292
1.16AsnHis: 1.16 ± 0.168
5.34AsnIle: 5.34 ± 0.393
7.255AsnLys: 7.255 ± 0.478
5.475AsnLeu: 5.475 ± 0.363
1.376AsnMet: 1.376 ± 0.198
5.232AsnAsn: 5.232 ± 0.411
1.51AsnPro: 1.51 ± 0.218
2.454AsnGln: 2.454 ± 0.277
2.913AsnArg: 2.913 ± 0.279
4.585AsnSer: 4.585 ± 0.416
3.911AsnThr: 3.911 ± 0.444
3.884AsnVal: 3.884 ± 0.303
0.512AsnTrp: 0.512 ± 0.103
3.317AsnTyr: 3.317 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
0.917ProAla: 0.917 ± 0.158
0.189ProCys: 0.189 ± 0.073
1.295ProAsp: 1.295 ± 0.239
1.483ProGlu: 1.483 ± 0.214
0.863ProPhe: 0.863 ± 0.161
0.162ProGly: 0.162 ± 0.073
0.512ProHis: 0.512 ± 0.114
1.591ProIle: 1.591 ± 0.215
2.131ProLys: 2.131 ± 0.271
2.158ProLeu: 2.158 ± 0.254
0.459ProMet: 0.459 ± 0.097
1.726ProAsn: 1.726 ± 0.225
0.432ProPro: 0.432 ± 0.106
0.674ProGln: 0.674 ± 0.176
0.674ProArg: 0.674 ± 0.139
1.483ProSer: 1.483 ± 0.198
1.969ProThr: 1.969 ± 0.234
1.402ProVal: 1.402 ± 0.189
0.081ProTrp: 0.081 ± 0.054
1.268ProTyr: 1.268 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
2.427GlnAla: 2.427 ± 0.518
0.162GlnCys: 0.162 ± 0.059
1.996GlnAsp: 1.996 ± 0.215
2.913GlnGlu: 2.913 ± 0.299
2.05GlnPhe: 2.05 ± 0.285
1.349GlnGly: 1.349 ± 0.19
0.459GlnHis: 0.459 ± 0.09
3.183GlnIle: 3.183 ± 0.299
2.805GlnLys: 2.805 ± 0.421
3.884GlnLeu: 3.884 ± 0.459
0.998GlnMet: 0.998 ± 0.159
1.753GlnAsn: 1.753 ± 0.225
0.728GlnPro: 0.728 ± 0.141
1.429GlnGln: 1.429 ± 0.272
1.349GlnArg: 1.349 ± 0.198
1.996GlnSer: 1.996 ± 0.224
2.185GlnThr: 2.185 ± 0.329
2.023GlnVal: 2.023 ± 0.215
0.432GlnTrp: 0.432 ± 0.088
1.861GlnTyr: 1.861 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
2.05ArgAla: 2.05 ± 0.278
0.324ArgCys: 0.324 ± 0.1
2.4ArgAsp: 2.4 ± 0.262
2.724ArgGlu: 2.724 ± 0.349
1.51ArgPhe: 1.51 ± 0.227
2.212ArgGly: 2.212 ± 0.218
0.485ArgHis: 0.485 ± 0.108
2.643ArgIle: 2.643 ± 0.257
3.668ArgLys: 3.668 ± 0.364
3.452ArgLeu: 3.452 ± 0.325
1.079ArgMet: 1.079 ± 0.175
2.427ArgAsn: 2.427 ± 0.272
0.89ArgPro: 0.89 ± 0.153
1.349ArgGln: 1.349 ± 0.331
2.158ArgArg: 2.158 ± 0.199
1.834ArgSer: 1.834 ± 0.205
2.239ArgThr: 2.239 ± 0.242
2.427ArgVal: 2.427 ± 0.265
0.297ArgTrp: 0.297 ± 0.089
2.077ArgTyr: 2.077 ± 0.221
0.0ArgXaa: 0.0 ± 0.0
Ser
3.102SerAla: 3.102 ± 0.462
0.485SerCys: 0.485 ± 0.111
4.261SerAsp: 4.261 ± 0.426
4.396SerGlu: 4.396 ± 0.393
2.913SerPhe: 2.913 ± 0.224
3.776SerGly: 3.776 ± 0.382
1.268SerHis: 1.268 ± 0.216
4.963SerIle: 4.963 ± 0.364
6.635SerLys: 6.635 ± 0.461
5.448SerLeu: 5.448 ± 0.386
1.456SerMet: 1.456 ± 0.177
3.722SerAsn: 3.722 ± 0.264
1.645SerPro: 1.645 ± 0.206
2.158SerGln: 2.158 ± 0.319
2.212SerArg: 2.212 ± 0.239
4.261SerSer: 4.261 ± 0.566
3.776SerThr: 3.776 ± 0.349
3.803SerVal: 3.803 ± 0.296
0.459SerTrp: 0.459 ± 0.103
2.616SerTyr: 2.616 ± 0.255
0.0SerXaa: 0.0 ± 0.0
Thr
2.967ThrAla: 2.967 ± 0.425
0.351ThrCys: 0.351 ± 0.101
4.18ThrAsp: 4.18 ± 0.399
4.801ThrGlu: 4.801 ± 0.408
2.967ThrPhe: 2.967 ± 0.257
3.533ThrGly: 3.533 ± 0.32
0.971ThrHis: 0.971 ± 0.174
4.558ThrIle: 4.558 ± 0.368
5.394ThrLys: 5.394 ± 0.435
5.394ThrLeu: 5.394 ± 0.435
1.376ThrMet: 1.376 ± 0.166
4.046ThrAsn: 4.046 ± 0.357
1.834ThrPro: 1.834 ± 0.294
2.104ThrGln: 2.104 ± 0.204
2.293ThrArg: 2.293 ± 0.23
3.749ThrSer: 3.749 ± 0.302
3.641ThrThr: 3.641 ± 0.396
3.371ThrVal: 3.371 ± 0.33
0.27ThrTrp: 0.27 ± 0.076
3.183ThrTyr: 3.183 ± 0.303
0.0ThrXaa: 0.0 ± 0.0
Val
3.29ValAla: 3.29 ± 0.288
0.189ValCys: 0.189 ± 0.063
4.207ValAsp: 4.207 ± 0.343
4.99ValGlu: 4.99 ± 0.498
2.212ValPhe: 2.212 ± 0.24
2.967ValGly: 2.967 ± 0.333
0.836ValHis: 0.836 ± 0.139
4.936ValIle: 4.936 ± 0.371
5.151ValLys: 5.151 ± 0.341
3.587ValLeu: 3.587 ± 0.389
1.133ValMet: 1.133 ± 0.184
3.722ValAsn: 3.722 ± 0.312
1.429ValPro: 1.429 ± 0.207
1.969ValGln: 1.969 ± 0.182
2.212ValArg: 2.212 ± 0.318
3.776ValSer: 3.776 ± 0.284
3.452ValThr: 3.452 ± 0.265
3.237ValVal: 3.237 ± 0.286
0.405ValTrp: 0.405 ± 0.079
2.481ValTyr: 2.481 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
0.297TrpAla: 0.297 ± 0.09
0.054TrpCys: 0.054 ± 0.038
0.405TrpAsp: 0.405 ± 0.109
0.674TrpGlu: 0.674 ± 0.104
0.405TrpPhe: 0.405 ± 0.1
0.189TrpGly: 0.189 ± 0.074
0.189TrpHis: 0.189 ± 0.068
0.539TrpIle: 0.539 ± 0.135
0.836TrpLys: 0.836 ± 0.156
0.701TrpLeu: 0.701 ± 0.129
0.162TrpMet: 0.162 ± 0.058
0.593TrpAsn: 0.593 ± 0.109
0.027TrpPro: 0.027 ± 0.026
0.162TrpGln: 0.162 ± 0.062
0.432TrpArg: 0.432 ± 0.094
0.62TrpSer: 0.62 ± 0.133
0.485TrpThr: 0.485 ± 0.15
0.405TrpVal: 0.405 ± 0.1
0.081TrpTrp: 0.081 ± 0.05
0.459TrpTyr: 0.459 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.05TyrAla: 2.05 ± 0.207
0.593TyrCys: 0.593 ± 0.138
3.29TyrAsp: 3.29 ± 0.27
3.533TyrGlu: 3.533 ± 0.288
1.861TyrPhe: 1.861 ± 0.258
2.643TyrGly: 2.643 ± 0.312
0.89TyrHis: 0.89 ± 0.163
4.234TyrIle: 4.234 ± 0.358
4.261TyrLys: 4.261 ± 0.321
5.071TyrLeu: 5.071 ± 0.603
1.268TyrMet: 1.268 ± 0.167
3.29TyrAsn: 3.29 ± 0.44
1.349TyrPro: 1.349 ± 0.211
2.023TyrGln: 2.023 ± 0.228
1.834TyrArg: 1.834 ± 0.25
3.371TyrSer: 3.371 ± 0.31
2.913TyrThr: 2.913 ± 0.292
2.077TyrVal: 2.077 ± 0.244
0.378TyrTrp: 0.378 ± 0.105
2.94TyrTyr: 2.94 ± 0.257
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 154 proteins (37078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski