Amino acid dipepetide frequency for Streptococcus phage phi-SgaBSJ27_rum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.82AlaAla: 2.82 ± 0.519
0.425AlaCys: 0.425 ± 0.104
3.396AlaAsp: 3.396 ± 0.29
3.912AlaGlu: 3.912 ± 0.356
3.093AlaPhe: 3.093 ± 0.304
3.639AlaGly: 3.639 ± 0.379
0.788AlaHis: 0.788 ± 0.144
5.186AlaIle: 5.186 ± 0.444
4.761AlaLys: 4.761 ± 0.353
4.973AlaLeu: 4.973 ± 0.461
1.274AlaMet: 1.274 ± 0.203
3.063AlaAsn: 3.063 ± 0.316
1.486AlaPro: 1.486 ± 0.186
2.456AlaGln: 2.456 ± 0.307
2.699AlaArg: 2.699 ± 0.301
3.305AlaSer: 3.305 ± 0.466
3.275AlaThr: 3.275 ± 0.359
3.002AlaVal: 3.002 ± 0.36
0.576AlaTrp: 0.576 ± 0.151
2.881AlaTyr: 2.881 ± 0.329
0.0AlaXaa: 0.0 ± 0.0
Cys
0.485CysAla: 0.485 ± 0.145
0.152CysCys: 0.152 ± 0.063
0.364CysAsp: 0.364 ± 0.097
0.516CysGlu: 0.516 ± 0.103
0.425CysPhe: 0.425 ± 0.141
0.758CysGly: 0.758 ± 0.176
0.212CysHis: 0.212 ± 0.094
0.455CysIle: 0.455 ± 0.126
0.485CysLys: 0.485 ± 0.152
0.788CysLeu: 0.788 ± 0.176
0.182CysMet: 0.182 ± 0.079
0.182CysAsn: 0.182 ± 0.063
0.243CysPro: 0.243 ± 0.085
0.607CysGln: 0.607 ± 0.168
0.546CysArg: 0.546 ± 0.173
0.455CysSer: 0.455 ± 0.127
0.121CysThr: 0.121 ± 0.054
0.576CysVal: 0.576 ± 0.153
0.0CysTrp: 0.0 ± 0.0
0.394CysTyr: 0.394 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
2.456AspAla: 2.456 ± 0.337
0.697AspCys: 0.697 ± 0.163
3.245AspAsp: 3.245 ± 0.404
5.974AspGlu: 5.974 ± 0.475
3.457AspPhe: 3.457 ± 0.376
4.67AspGly: 4.67 ± 0.429
0.788AspHis: 0.788 ± 0.16
4.67AspIle: 4.67 ± 0.394
4.913AspLys: 4.913 ± 0.393
5.398AspLeu: 5.398 ± 0.424
1.85AspMet: 1.85 ± 0.199
3.033AspAsn: 3.033 ± 0.383
1.547AspPro: 1.547 ± 0.222
1.82AspGln: 1.82 ± 0.264
2.365AspArg: 2.365 ± 0.285
3.73AspSer: 3.73 ± 0.379
3.245AspThr: 3.245 ± 0.297
3.73AspVal: 3.73 ± 0.336
0.788AspTrp: 0.788 ± 0.13
3.942AspTyr: 3.942 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
4.367GluAla: 4.367 ± 0.428
0.607GluCys: 0.607 ± 0.156
4.185GluAsp: 4.185 ± 0.394
5.883GluGlu: 5.883 ± 0.53
2.487GluPhe: 2.487 ± 0.347
3.942GluGly: 3.942 ± 0.317
1.183GluHis: 1.183 ± 0.218
4.973GluIle: 4.973 ± 0.498
6.732GluLys: 6.732 ± 0.505
8.339GluLeu: 8.339 ± 0.664
1.88GluMet: 1.88 ± 0.279
4.367GluAsn: 4.367 ± 0.376
1.395GluPro: 1.395 ± 0.238
3.457GluGln: 3.457 ± 0.351
3.154GluArg: 3.154 ± 0.294
3.457GluSer: 3.457 ± 0.309
4.761GluThr: 4.761 ± 0.458
4.67GluVal: 4.67 ± 0.397
0.667GluTrp: 0.667 ± 0.141
2.365GluTyr: 2.365 ± 0.303
0.0GluXaa: 0.0 ± 0.0
Phe
2.274PheAla: 2.274 ± 0.331
0.546PheCys: 0.546 ± 0.148
3.487PheAsp: 3.487 ± 0.36
3.123PheGlu: 3.123 ± 0.376
1.759PhePhe: 1.759 ± 0.249
2.456PheGly: 2.456 ± 0.242
0.788PheHis: 0.788 ± 0.147
2.456PheIle: 2.456 ± 0.372
3.275PheLys: 3.275 ± 0.362
4.367PheLeu: 4.367 ± 0.422
0.849PheMet: 0.849 ± 0.155
2.365PheAsn: 2.365 ± 0.295
1.092PhePro: 1.092 ± 0.212
1.729PheGln: 1.729 ± 0.258
2.001PheArg: 2.001 ± 0.233
3.033PheSer: 3.033 ± 0.29
2.305PheThr: 2.305 ± 0.271
2.729PheVal: 2.729 ± 0.352
0.394PheTrp: 0.394 ± 0.117
1.789PheTyr: 1.789 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
3.033GlyAla: 3.033 ± 0.311
0.334GlyCys: 0.334 ± 0.099
3.548GlyAsp: 3.548 ± 0.373
3.609GlyGlu: 3.609 ± 0.329
2.638GlyPhe: 2.638 ± 0.293
3.73GlyGly: 3.73 ± 0.38
1.365GlyHis: 1.365 ± 0.216
5.034GlyIle: 5.034 ± 0.535
5.095GlyLys: 5.095 ± 0.329
5.853GlyLeu: 5.853 ± 0.419
1.456GlyMet: 1.456 ± 0.192
3.457GlyAsn: 3.457 ± 0.342
0.546GlyPro: 0.546 ± 0.129
2.82GlyGln: 2.82 ± 0.383
2.608GlyArg: 2.608 ± 0.3
4.003GlySer: 4.003 ± 0.33
3.639GlyThr: 3.639 ± 0.363
4.215GlyVal: 4.215 ± 0.344
0.425GlyTrp: 0.425 ± 0.098
2.76GlyTyr: 2.76 ± 0.266
0.0GlyXaa: 0.0 ± 0.0
His
0.697HisAla: 0.697 ± 0.126
0.152HisCys: 0.152 ± 0.075
1.243HisAsp: 1.243 ± 0.202
1.061HisGlu: 1.061 ± 0.145
0.849HisPhe: 0.849 ± 0.182
1.304HisGly: 1.304 ± 0.185
0.637HisHis: 0.637 ± 0.127
1.152HisIle: 1.152 ± 0.171
1.183HisLys: 1.183 ± 0.216
1.971HisLeu: 1.971 ± 0.245
0.425HisMet: 0.425 ± 0.12
0.728HisAsn: 0.728 ± 0.124
0.697HisPro: 0.697 ± 0.172
1.152HisGln: 1.152 ± 0.222
0.94HisArg: 0.94 ± 0.191
1.031HisSer: 1.031 ± 0.197
0.94HisThr: 0.94 ± 0.155
0.879HisVal: 0.879 ± 0.154
0.121HisTrp: 0.121 ± 0.055
1.001HisTyr: 1.001 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
4.276IleAla: 4.276 ± 0.315
0.637IleCys: 0.637 ± 0.173
5.095IleAsp: 5.095 ± 0.362
5.004IleGlu: 5.004 ± 0.373
2.426IlePhe: 2.426 ± 0.362
3.851IleGly: 3.851 ± 0.365
1.061IleHis: 1.061 ± 0.201
4.306IleIle: 4.306 ± 0.414
4.7IleLys: 4.7 ± 0.412
7.096IleLeu: 7.096 ± 0.666
1.304IleMet: 1.304 ± 0.285
3.305IleAsn: 3.305 ± 0.313
2.487IlePro: 2.487 ± 0.192
2.669IleGln: 2.669 ± 0.233
3.336IleArg: 3.336 ± 0.418
5.701IleSer: 5.701 ± 0.571
3.942IleThr: 3.942 ± 0.369
4.336IleVal: 4.336 ± 0.418
0.788IleTrp: 0.788 ± 0.177
2.547IleTyr: 2.547 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
5.125LysAla: 5.125 ± 0.371
0.516LysCys: 0.516 ± 0.133
4.64LysAsp: 4.64 ± 0.41
5.701LysGlu: 5.701 ± 0.426
2.881LysPhe: 2.881 ± 0.282
4.336LysGly: 4.336 ± 0.397
1.577LysHis: 1.577 ± 0.221
4.973LysIle: 4.973 ± 0.412
5.549LysLys: 5.549 ± 0.465
6.611LysLeu: 6.611 ± 0.416
1.85LysMet: 1.85 ± 0.26
3.791LysAsn: 3.791 ± 0.344
1.88LysPro: 1.88 ± 0.221
3.609LysGln: 3.609 ± 0.312
3.791LysArg: 3.791 ± 0.374
4.882LysSer: 4.882 ± 0.335
4.549LysThr: 4.549 ± 0.404
4.943LysVal: 4.943 ± 0.362
1.061LysTrp: 1.061 ± 0.201
2.82LysTyr: 2.82 ± 0.334
0.0LysXaa: 0.0 ± 0.0
Leu
6.52LeuAla: 6.52 ± 0.419
0.637LeuCys: 0.637 ± 0.163
6.308LeuAsp: 6.308 ± 0.465
7.733LeuGlu: 7.733 ± 0.551
3.7LeuPhe: 3.7 ± 0.35
5.489LeuGly: 5.489 ± 0.398
1.729LeuHis: 1.729 ± 0.206
6.581LeuIle: 6.581 ± 0.576
6.368LeuLys: 6.368 ± 0.598
8.916LeuLeu: 8.916 ± 0.802
2.032LeuMet: 2.032 ± 0.252
5.064LeuAsn: 5.064 ± 0.386
2.972LeuPro: 2.972 ± 0.347
3.578LeuGln: 3.578 ± 0.295
3.154LeuArg: 3.154 ± 0.284
8.309LeuSer: 8.309 ± 0.481
6.702LeuThr: 6.702 ± 0.486
6.399LeuVal: 6.399 ± 0.509
0.546LeuTrp: 0.546 ± 0.124
3.457LeuTyr: 3.457 ± 0.386
0.0LeuXaa: 0.0 ± 0.0
Met
1.243MetAla: 1.243 ± 0.243
0.091MetCys: 0.091 ± 0.05
1.395MetAsp: 1.395 ± 0.22
1.82MetGlu: 1.82 ± 0.216
0.637MetPhe: 0.637 ± 0.157
1.274MetGly: 1.274 ± 0.207
0.152MetHis: 0.152 ± 0.062
1.607MetIle: 1.607 ± 0.26
2.001MetLys: 2.001 ± 0.214
1.607MetLeu: 1.607 ± 0.234
0.516MetMet: 0.516 ± 0.128
1.152MetAsn: 1.152 ± 0.219
0.607MetPro: 0.607 ± 0.156
0.728MetGln: 0.728 ± 0.157
1.061MetArg: 1.061 ± 0.163
1.82MetSer: 1.82 ± 0.259
1.971MetThr: 1.971 ± 0.238
1.547MetVal: 1.547 ± 0.245
0.121MetTrp: 0.121 ± 0.053
0.485MetTyr: 0.485 ± 0.119
0.0MetXaa: 0.0 ± 0.0
Asn
2.972AsnAla: 2.972 ± 0.444
0.243AsnCys: 0.243 ± 0.096
2.942AsnAsp: 2.942 ± 0.355
3.033AsnGlu: 3.033 ± 0.323
2.335AsnPhe: 2.335 ± 0.296
4.609AsnGly: 4.609 ± 0.434
1.456AsnHis: 1.456 ± 0.186
3.184AsnIle: 3.184 ± 0.323
3.275AsnLys: 3.275 ± 0.309
5.004AsnLeu: 5.004 ± 0.347
1.031AsnMet: 1.031 ± 0.19
2.578AsnAsn: 2.578 ± 0.307
2.396AsnPro: 2.396 ± 0.279
2.881AsnGln: 2.881 ± 0.335
2.305AsnArg: 2.305 ± 0.305
3.942AsnSer: 3.942 ± 0.35
2.669AsnThr: 2.669 ± 0.382
2.638AsnVal: 2.638 ± 0.306
0.667AsnTrp: 0.667 ± 0.16
2.032AsnTyr: 2.032 ± 0.264
0.0AsnXaa: 0.0 ± 0.0
Pro
1.365ProAla: 1.365 ± 0.191
0.394ProCys: 0.394 ± 0.12
2.092ProAsp: 2.092 ± 0.299
2.001ProGlu: 2.001 ± 0.355
1.334ProPhe: 1.334 ± 0.21
0.697ProGly: 0.697 ± 0.146
0.455ProHis: 0.455 ± 0.126
2.092ProIle: 2.092 ± 0.266
2.547ProLys: 2.547 ± 0.394
2.729ProLeu: 2.729 ± 0.295
0.546ProMet: 0.546 ± 0.139
1.729ProAsn: 1.729 ± 0.215
0.849ProPro: 0.849 ± 0.189
1.274ProGln: 1.274 ± 0.262
1.213ProArg: 1.213 ± 0.18
2.001ProSer: 2.001 ± 0.254
2.092ProThr: 2.092 ± 0.268
2.062ProVal: 2.062 ± 0.248
0.243ProTrp: 0.243 ± 0.089
1.122ProTyr: 1.122 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
3.7GlnAla: 3.7 ± 0.467
0.273GlnCys: 0.273 ± 0.116
2.426GlnAsp: 2.426 ± 0.271
3.609GlnGlu: 3.609 ± 0.321
1.577GlnPhe: 1.577 ± 0.248
1.941GlnGly: 1.941 ± 0.255
0.667GlnHis: 0.667 ± 0.14
2.79GlnIle: 2.79 ± 0.307
3.457GlnLys: 3.457 ± 0.358
4.579GlnLeu: 4.579 ± 0.343
1.001GlnMet: 1.001 ± 0.185
2.487GlnAsn: 2.487 ± 0.286
1.365GlnPro: 1.365 ± 0.233
1.82GlnGln: 1.82 ± 0.268
1.516GlnArg: 1.516 ± 0.258
2.426GlnSer: 2.426 ± 0.256
3.093GlnThr: 3.093 ± 0.422
3.791GlnVal: 3.791 ± 0.431
0.485GlnTrp: 0.485 ± 0.163
1.334GlnTyr: 1.334 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
2.305ArgAla: 2.305 ± 0.26
0.425ArgCys: 0.425 ± 0.106
2.456ArgAsp: 2.456 ± 0.32
3.033ArgGlu: 3.033 ± 0.342
2.062ArgPhe: 2.062 ± 0.295
2.547ArgGly: 2.547 ± 0.273
0.849ArgHis: 0.849 ± 0.174
3.336ArgIle: 3.336 ± 0.379
3.245ArgLys: 3.245 ± 0.378
4.518ArgLeu: 4.518 ± 0.463
0.849ArgMet: 0.849 ± 0.163
2.244ArgAsn: 2.244 ± 0.278
1.183ArgPro: 1.183 ± 0.179
2.578ArgGln: 2.578 ± 0.248
2.274ArgArg: 2.274 ± 0.411
2.426ArgSer: 2.426 ± 0.254
2.76ArgThr: 2.76 ± 0.304
2.578ArgVal: 2.578 ± 0.262
0.546ArgTrp: 0.546 ± 0.152
1.971ArgTyr: 1.971 ± 0.243
0.0ArgXaa: 0.0 ± 0.0
Ser
3.184SerAla: 3.184 ± 0.428
0.394SerCys: 0.394 ± 0.116
5.034SerAsp: 5.034 ± 0.413
4.882SerGlu: 4.882 ± 0.372
3.305SerPhe: 3.305 ± 0.306
4.609SerGly: 4.609 ± 0.348
1.486SerHis: 1.486 ± 0.181
4.155SerIle: 4.155 ± 0.343
4.761SerLys: 4.761 ± 0.397
6.611SerLeu: 6.611 ± 0.438
1.213SerMet: 1.213 ± 0.168
3.457SerAsn: 3.457 ± 0.399
2.214SerPro: 2.214 ± 0.213
3.336SerGln: 3.336 ± 0.47
3.214SerArg: 3.214 ± 0.296
5.095SerSer: 5.095 ± 0.55
3.851SerThr: 3.851 ± 0.428
4.306SerVal: 4.306 ± 0.349
0.819SerTrp: 0.819 ± 0.142
2.942SerTyr: 2.942 ± 0.348
0.0SerXaa: 0.0 ± 0.0
Thr
3.518ThrAla: 3.518 ± 0.332
0.303ThrCys: 0.303 ± 0.095
3.184ThrAsp: 3.184 ± 0.356
3.821ThrGlu: 3.821 ± 0.378
2.608ThrPhe: 2.608 ± 0.332
4.246ThrGly: 4.246 ± 0.417
0.728ThrHis: 0.728 ± 0.128
4.761ThrIle: 4.761 ± 0.431
4.64ThrLys: 4.64 ± 0.442
5.822ThrLeu: 5.822 ± 0.393
1.183ThrMet: 1.183 ± 0.154
3.245ThrAsn: 3.245 ± 0.352
2.123ThrPro: 2.123 ± 0.397
2.001ThrGln: 2.001 ± 0.384
2.001ThrArg: 2.001 ± 0.264
4.609ThrSer: 4.609 ± 0.608
4.336ThrThr: 4.336 ± 0.529
5.61ThrVal: 5.61 ± 0.748
0.697ThrTrp: 0.697 ± 0.168
2.547ThrTyr: 2.547 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
3.882ValAla: 3.882 ± 0.377
0.485ValCys: 0.485 ± 0.165
4.033ValAsp: 4.033 ± 0.408
4.549ValGlu: 4.549 ± 0.362
2.82ValPhe: 2.82 ± 0.294
3.093ValGly: 3.093 ± 0.365
1.092ValHis: 1.092 ± 0.163
4.397ValIle: 4.397 ± 0.448
4.973ValLys: 4.973 ± 0.343
5.974ValLeu: 5.974 ± 0.368
1.274ValMet: 1.274 ± 0.204
3.154ValAsn: 3.154 ± 0.321
2.365ValPro: 2.365 ± 0.251
2.365ValGln: 2.365 ± 0.208
2.972ValArg: 2.972 ± 0.355
5.095ValSer: 5.095 ± 0.537
4.822ValThr: 4.822 ± 0.61
3.882ValVal: 3.882 ± 0.407
0.788ValTrp: 0.788 ± 0.155
2.79ValTyr: 2.79 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
0.576TrpAla: 0.576 ± 0.126
0.121TrpCys: 0.121 ± 0.062
0.637TrpAsp: 0.637 ± 0.127
0.819TrpGlu: 0.819 ± 0.173
0.607TrpPhe: 0.607 ± 0.142
0.607TrpGly: 0.607 ± 0.106
0.273TrpHis: 0.273 ± 0.092
0.697TrpIle: 0.697 ± 0.14
0.576TrpLys: 0.576 ± 0.149
0.788TrpLeu: 0.788 ± 0.188
0.182TrpMet: 0.182 ± 0.074
0.94TrpAsn: 0.94 ± 0.204
0.061TrpPro: 0.061 ± 0.039
0.607TrpGln: 0.607 ± 0.156
0.485TrpArg: 0.485 ± 0.126
0.697TrpSer: 0.697 ± 0.183
0.667TrpThr: 0.667 ± 0.158
0.667TrpVal: 0.667 ± 0.129
0.182TrpTrp: 0.182 ± 0.071
0.212TrpTyr: 0.212 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.214TyrAla: 2.214 ± 0.215
0.516TyrCys: 0.516 ± 0.138
2.608TyrAsp: 2.608 ± 0.247
2.79TyrGlu: 2.79 ± 0.266
1.82TyrPhe: 1.82 ± 0.26
2.365TyrGly: 2.365 ± 0.236
0.91TyrHis: 0.91 ± 0.152
2.062TyrIle: 2.062 ± 0.275
2.547TyrLys: 2.547 ± 0.299
4.246TyrLeu: 4.246 ± 0.426
0.91TyrMet: 0.91 ± 0.168
1.85TyrAsn: 1.85 ± 0.215
1.334TyrPro: 1.334 ± 0.194
2.851TyrGln: 2.851 ± 0.322
2.547TyrArg: 2.547 ± 0.241
2.851TyrSer: 2.851 ± 0.293
2.244TyrThr: 2.244 ± 0.25
2.244TyrVal: 2.244 ± 0.288
0.485TyrTrp: 0.485 ± 0.112
1.729TyrTyr: 1.729 ± 0.304
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (32977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski