Amino acid dipepetide frequency for Streptococcus phage phi-SsuZKB4_rum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.185AlaAla: 3.185 ± 0.584
0.511AlaCys: 0.511 ± 0.126
3.906AlaAsp: 3.906 ± 0.326
4.297AlaGlu: 4.297 ± 0.365
2.524AlaPhe: 2.524 ± 0.277
3.486AlaGly: 3.486 ± 0.405
0.811AlaHis: 0.811 ± 0.139
4.898AlaIle: 4.898 ± 0.388
4.748AlaLys: 4.748 ± 0.413
5.709AlaLeu: 5.709 ± 0.484
1.472AlaMet: 1.472 ± 0.224
2.764AlaAsn: 2.764 ± 0.323
1.292AlaPro: 1.292 ± 0.248
2.284AlaGln: 2.284 ± 0.383
2.554AlaArg: 2.554 ± 0.261
3.425AlaSer: 3.425 ± 0.457
3.035AlaThr: 3.035 ± 0.309
3.275AlaVal: 3.275 ± 0.441
0.601AlaTrp: 0.601 ± 0.142
2.825AlaTyr: 2.825 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.571CysAla: 0.571 ± 0.143
0.21CysCys: 0.21 ± 0.078
0.3CysAsp: 0.3 ± 0.101
0.481CysGlu: 0.481 ± 0.117
0.27CysPhe: 0.27 ± 0.091
0.691CysGly: 0.691 ± 0.152
0.21CysHis: 0.21 ± 0.075
0.571CysIle: 0.571 ± 0.162
0.751CysLys: 0.751 ± 0.185
0.931CysLeu: 0.931 ± 0.213
0.09CysMet: 0.09 ± 0.052
0.3CysAsn: 0.3 ± 0.1
0.391CysPro: 0.391 ± 0.109
0.631CysGln: 0.631 ± 0.146
0.631CysArg: 0.631 ± 0.147
0.421CysSer: 0.421 ± 0.131
0.27CysThr: 0.27 ± 0.081
0.691CysVal: 0.691 ± 0.153
0.0CysTrp: 0.0 ± 0.0
0.331CysTyr: 0.331 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
2.314AspAla: 2.314 ± 0.294
0.691AspCys: 0.691 ± 0.153
3.365AspAsp: 3.365 ± 0.363
4.898AspGlu: 4.898 ± 0.414
3.756AspPhe: 3.756 ± 0.372
4.297AspGly: 4.297 ± 0.391
0.992AspHis: 0.992 ± 0.176
5.499AspIle: 5.499 ± 0.455
4.627AspLys: 4.627 ± 0.377
4.748AspLeu: 4.748 ± 0.391
1.983AspMet: 1.983 ± 0.24
3.095AspAsn: 3.095 ± 0.322
1.352AspPro: 1.352 ± 0.238
1.743AspGln: 1.743 ± 0.212
2.825AspArg: 2.825 ± 0.351
4.237AspSer: 4.237 ± 0.431
2.855AspThr: 2.855 ± 0.299
3.155AspVal: 3.155 ± 0.326
0.571AspTrp: 0.571 ± 0.129
3.516AspTyr: 3.516 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
4.237GluAla: 4.237 ± 0.423
0.571GluCys: 0.571 ± 0.159
3.936GluAsp: 3.936 ± 0.341
6.49GluGlu: 6.49 ± 0.585
2.825GluPhe: 2.825 ± 0.371
3.966GluGly: 3.966 ± 0.295
1.502GluHis: 1.502 ± 0.256
5.258GluIle: 5.258 ± 0.485
7.452GluLys: 7.452 ± 0.515
8.624GluLeu: 8.624 ± 0.695
2.013GluMet: 2.013 ± 0.282
4.808GluAsn: 4.808 ± 0.364
1.593GluPro: 1.593 ± 0.229
4.087GluGln: 4.087 ± 0.333
3.636GluArg: 3.636 ± 0.39
3.365GluSer: 3.365 ± 0.332
4.477GluThr: 4.477 ± 0.491
4.718GluVal: 4.718 ± 0.455
0.691GluTrp: 0.691 ± 0.139
2.284GluTyr: 2.284 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
2.103PheAla: 2.103 ± 0.259
0.601PheCys: 0.601 ± 0.129
3.305PheAsp: 3.305 ± 0.318
3.516PheGlu: 3.516 ± 0.349
1.983PhePhe: 1.983 ± 0.236
2.254PheGly: 2.254 ± 0.242
0.931PheHis: 0.931 ± 0.189
2.885PheIle: 2.885 ± 0.353
2.915PheLys: 2.915 ± 0.387
4.237PheLeu: 4.237 ± 0.484
0.992PheMet: 0.992 ± 0.178
2.464PheAsn: 2.464 ± 0.315
0.751PhePro: 0.751 ± 0.147
1.833PheGln: 1.833 ± 0.257
1.923PheArg: 1.923 ± 0.244
2.434PheSer: 2.434 ± 0.262
2.043PheThr: 2.043 ± 0.27
2.704PheVal: 2.704 ± 0.357
0.481PheTrp: 0.481 ± 0.117
1.803PheTyr: 1.803 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
2.915GlyAla: 2.915 ± 0.347
0.571GlyCys: 0.571 ± 0.137
3.305GlyAsp: 3.305 ± 0.345
3.335GlyGlu: 3.335 ± 0.326
2.644GlyPhe: 2.644 ± 0.275
3.395GlyGly: 3.395 ± 0.382
1.202GlyHis: 1.202 ± 0.199
5.439GlyIle: 5.439 ± 0.5
4.507GlyLys: 4.507 ± 0.388
5.319GlyLeu: 5.319 ± 0.532
1.472GlyMet: 1.472 ± 0.252
3.125GlyAsn: 3.125 ± 0.29
0.391GlyPro: 0.391 ± 0.117
2.825GlyGln: 2.825 ± 0.353
2.674GlyArg: 2.674 ± 0.304
3.636GlySer: 3.636 ± 0.375
3.816GlyThr: 3.816 ± 0.395
3.486GlyVal: 3.486 ± 0.339
0.361GlyTrp: 0.361 ± 0.093
2.975GlyTyr: 2.975 ± 0.275
0.0GlyXaa: 0.0 ± 0.0
His
0.721HisAla: 0.721 ± 0.146
0.24HisCys: 0.24 ± 0.094
1.202HisAsp: 1.202 ± 0.219
1.232HisGlu: 1.232 ± 0.202
1.202HisPhe: 1.202 ± 0.193
1.232HisGly: 1.232 ± 0.172
0.481HisHis: 0.481 ± 0.104
1.502HisIle: 1.502 ± 0.193
0.871HisLys: 0.871 ± 0.141
2.374HisLeu: 2.374 ± 0.268
0.331HisMet: 0.331 ± 0.118
0.931HisAsn: 0.931 ± 0.151
0.811HisPro: 0.811 ± 0.179
1.022HisGln: 1.022 ± 0.215
0.721HisArg: 0.721 ± 0.167
0.751HisSer: 0.751 ± 0.125
0.901HisThr: 0.901 ± 0.2
1.082HisVal: 1.082 ± 0.169
0.15HisTrp: 0.15 ± 0.069
0.931HisTyr: 0.931 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
4.477IleAla: 4.477 ± 0.278
0.811IleCys: 0.811 ± 0.2
4.928IleAsp: 4.928 ± 0.313
5.168IleGlu: 5.168 ± 0.443
2.434IlePhe: 2.434 ± 0.381
4.447IleGly: 4.447 ± 0.406
1.202IleHis: 1.202 ± 0.25
4.447IleIle: 4.447 ± 0.489
4.838IleLys: 4.838 ± 0.504
6.731IleLeu: 6.731 ± 0.525
1.352IleMet: 1.352 ± 0.223
3.456IleAsn: 3.456 ± 0.327
2.855IlePro: 2.855 ± 0.28
3.125IleGln: 3.125 ± 0.338
3.516IleArg: 3.516 ± 0.428
6.581IleSer: 6.581 ± 0.479
4.297IleThr: 4.297 ± 0.483
4.417IleVal: 4.417 ± 0.519
0.751IleTrp: 0.751 ± 0.166
2.825IleTyr: 2.825 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
5.108LysAla: 5.108 ± 0.455
0.331LysCys: 0.331 ± 0.101
4.718LysAsp: 4.718 ± 0.412
6.43LysGlu: 6.43 ± 0.418
2.734LysPhe: 2.734 ± 0.296
3.846LysGly: 3.846 ± 0.338
1.562LysHis: 1.562 ± 0.186
5.739LysIle: 5.739 ± 0.46
4.988LysLys: 4.988 ± 0.479
6.761LysLeu: 6.761 ± 0.502
1.803LysMet: 1.803 ± 0.292
3.606LysAsn: 3.606 ± 0.281
1.923LysPro: 1.923 ± 0.232
3.305LysGln: 3.305 ± 0.353
4.267LysArg: 4.267 ± 0.372
4.778LysSer: 4.778 ± 0.355
3.936LysThr: 3.936 ± 0.319
5.078LysVal: 5.078 ± 0.383
0.871LysTrp: 0.871 ± 0.171
3.035LysTyr: 3.035 ± 0.311
0.0LysXaa: 0.0 ± 0.0
Leu
6.25LeuAla: 6.25 ± 0.483
1.022LeuCys: 1.022 ± 0.246
5.739LeuAsp: 5.739 ± 0.322
8.053LeuGlu: 8.053 ± 0.565
4.207LeuPhe: 4.207 ± 0.44
5.108LeuGly: 5.108 ± 0.436
1.653LeuHis: 1.653 ± 0.209
6.55LeuIle: 6.55 ± 0.552
7.121LeuLys: 7.121 ± 0.578
9.135LeuLeu: 9.135 ± 0.779
1.803LeuMet: 1.803 ± 0.187
4.778LeuAsn: 4.778 ± 0.469
3.275LeuPro: 3.275 ± 0.339
3.906LeuGln: 3.906 ± 0.316
3.185LeuArg: 3.185 ± 0.329
8.594LeuSer: 8.594 ± 0.559
5.98LeuThr: 5.98 ± 0.431
5.919LeuVal: 5.919 ± 0.434
0.421LeuTrp: 0.421 ± 0.108
4.056LeuTyr: 4.056 ± 0.4
0.0LeuXaa: 0.0 ± 0.0
Met
1.232MetAla: 1.232 ± 0.193
0.09MetCys: 0.09 ± 0.052
1.382MetAsp: 1.382 ± 0.237
1.833MetGlu: 1.833 ± 0.229
0.631MetPhe: 0.631 ± 0.147
1.442MetGly: 1.442 ± 0.221
0.15MetHis: 0.15 ± 0.066
1.502MetIle: 1.502 ± 0.234
1.953MetLys: 1.953 ± 0.231
1.803MetLeu: 1.803 ± 0.235
0.781MetMet: 0.781 ± 0.161
1.082MetAsn: 1.082 ± 0.167
0.691MetPro: 0.691 ± 0.15
0.631MetGln: 0.631 ± 0.152
1.292MetArg: 1.292 ± 0.174
1.923MetSer: 1.923 ± 0.276
1.773MetThr: 1.773 ± 0.197
1.262MetVal: 1.262 ± 0.205
0.18MetTrp: 0.18 ± 0.069
0.541MetTyr: 0.541 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
3.185AsnAla: 3.185 ± 0.385
0.361AsnCys: 0.361 ± 0.106
2.614AsnAsp: 2.614 ± 0.231
2.855AsnGlu: 2.855 ± 0.303
2.434AsnPhe: 2.434 ± 0.266
3.846AsnGly: 3.846 ± 0.454
1.352AsnHis: 1.352 ± 0.19
3.305AsnIle: 3.305 ± 0.363
3.696AsnLys: 3.696 ± 0.396
5.288AsnLeu: 5.288 ± 0.482
0.871AsnMet: 0.871 ± 0.155
2.644AsnAsn: 2.644 ± 0.371
1.863AsnPro: 1.863 ± 0.252
2.945AsnGln: 2.945 ± 0.343
2.855AsnArg: 2.855 ± 0.295
3.456AsnSer: 3.456 ± 0.362
2.374AsnThr: 2.374 ± 0.34
2.644AsnVal: 2.644 ± 0.32
0.781AsnTrp: 0.781 ± 0.181
1.923AsnTyr: 1.923 ± 0.254
0.0AsnXaa: 0.0 ± 0.0
Pro
1.532ProAla: 1.532 ± 0.186
0.24ProCys: 0.24 ± 0.091
1.923ProAsp: 1.923 ± 0.311
2.584ProGlu: 2.584 ± 0.33
1.142ProPhe: 1.142 ± 0.168
0.691ProGly: 0.691 ± 0.162
0.541ProHis: 0.541 ± 0.113
1.863ProIle: 1.863 ± 0.24
2.644ProLys: 2.644 ± 0.387
2.554ProLeu: 2.554 ± 0.282
0.571ProMet: 0.571 ± 0.119
1.502ProAsn: 1.502 ± 0.23
0.871ProPro: 0.871 ± 0.18
1.022ProGln: 1.022 ± 0.226
1.322ProArg: 1.322 ± 0.179
1.893ProSer: 1.893 ± 0.282
1.893ProThr: 1.893 ± 0.21
2.163ProVal: 2.163 ± 0.299
0.3ProTrp: 0.3 ± 0.098
1.382ProTyr: 1.382 ± 0.193
0.0ProXaa: 0.0 ± 0.0
Gln
3.456GlnAla: 3.456 ± 0.478
0.27GlnCys: 0.27 ± 0.104
2.194GlnAsp: 2.194 ± 0.26
3.425GlnGlu: 3.425 ± 0.306
1.593GlnPhe: 1.593 ± 0.256
2.043GlnGly: 2.043 ± 0.284
0.541GlnHis: 0.541 ± 0.12
2.915GlnIle: 2.915 ± 0.332
3.365GlnLys: 3.365 ± 0.315
4.868GlnLeu: 4.868 ± 0.448
1.352GlnMet: 1.352 ± 0.223
2.404GlnAsn: 2.404 ± 0.292
1.412GlnPro: 1.412 ± 0.255
1.653GlnGln: 1.653 ± 0.229
2.073GlnArg: 2.073 ± 0.266
2.794GlnSer: 2.794 ± 0.305
2.764GlnThr: 2.764 ± 0.383
3.395GlnVal: 3.395 ± 0.48
0.601GlnTrp: 0.601 ± 0.167
1.382GlnTyr: 1.382 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
2.554ArgAla: 2.554 ± 0.29
0.361ArgCys: 0.361 ± 0.109
2.734ArgAsp: 2.734 ± 0.328
3.636ArgGlu: 3.636 ± 0.367
1.863ArgPhe: 1.863 ± 0.25
2.254ArgGly: 2.254 ± 0.22
0.931ArgHis: 0.931 ± 0.173
3.876ArgIle: 3.876 ± 0.41
3.966ArgLys: 3.966 ± 0.43
4.387ArgLeu: 4.387 ± 0.378
0.931ArgMet: 0.931 ± 0.218
2.314ArgAsn: 2.314 ± 0.261
1.112ArgPro: 1.112 ± 0.222
2.494ArgGln: 2.494 ± 0.235
2.133ArgArg: 2.133 ± 0.372
2.584ArgSer: 2.584 ± 0.251
2.975ArgThr: 2.975 ± 0.318
2.524ArgVal: 2.524 ± 0.283
0.601ArgTrp: 0.601 ± 0.132
2.194ArgTyr: 2.194 ± 0.228
0.0ArgXaa: 0.0 ± 0.0
Ser
3.546SerAla: 3.546 ± 0.366
0.421SerCys: 0.421 ± 0.123
4.627SerAsp: 4.627 ± 0.448
5.349SerGlu: 5.349 ± 0.5
3.185SerPhe: 3.185 ± 0.306
4.357SerGly: 4.357 ± 0.354
1.502SerHis: 1.502 ± 0.204
4.657SerIle: 4.657 ± 0.389
4.688SerLys: 4.688 ± 0.361
6.52SerLeu: 6.52 ± 0.44
1.202SerMet: 1.202 ± 0.17
3.395SerAsn: 3.395 ± 0.367
2.224SerPro: 2.224 ± 0.215
3.305SerGln: 3.305 ± 0.403
3.035SerArg: 3.035 ± 0.397
4.718SerSer: 4.718 ± 0.496
3.756SerThr: 3.756 ± 0.396
3.966SerVal: 3.966 ± 0.374
0.751SerTrp: 0.751 ± 0.153
3.035SerTyr: 3.035 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
3.425ThrAla: 3.425 ± 0.385
0.21ThrCys: 0.21 ± 0.094
3.155ThrAsp: 3.155 ± 0.396
4.237ThrGlu: 4.237 ± 0.381
2.073ThrPhe: 2.073 ± 0.32
4.237ThrGly: 4.237 ± 0.484
0.751ThrHis: 0.751 ± 0.133
4.327ThrIle: 4.327 ± 0.401
4.207ThrLys: 4.207 ± 0.343
5.499ThrLeu: 5.499 ± 0.384
1.022ThrMet: 1.022 ± 0.15
2.764ThrAsn: 2.764 ± 0.364
2.494ThrPro: 2.494 ± 0.396
2.073ThrGln: 2.073 ± 0.343
1.893ThrArg: 1.893 ± 0.29
4.147ThrSer: 4.147 ± 0.556
3.846ThrThr: 3.846 ± 0.475
5.138ThrVal: 5.138 ± 0.685
0.661ThrTrp: 0.661 ± 0.163
2.734ThrTyr: 2.734 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
3.816ValAla: 3.816 ± 0.363
0.571ValCys: 0.571 ± 0.161
3.726ValAsp: 3.726 ± 0.391
4.718ValGlu: 4.718 ± 0.417
2.163ValPhe: 2.163 ± 0.251
3.095ValGly: 3.095 ± 0.363
1.112ValHis: 1.112 ± 0.176
4.357ValIle: 4.357 ± 0.357
4.177ValLys: 4.177 ± 0.349
6.16ValLeu: 6.16 ± 0.475
1.052ValMet: 1.052 ± 0.181
2.945ValAsn: 2.945 ± 0.267
2.103ValPro: 2.103 ± 0.23
2.284ValGln: 2.284 ± 0.278
2.825ValArg: 2.825 ± 0.319
4.988ValSer: 4.988 ± 0.495
4.718ValThr: 4.718 ± 0.762
3.576ValVal: 3.576 ± 0.406
0.871ValTrp: 0.871 ± 0.163
2.494ValTyr: 2.494 ± 0.289
0.0ValXaa: 0.0 ± 0.0
Trp
0.601TrpAla: 0.601 ± 0.161
0.12TrpCys: 0.12 ± 0.055
0.601TrpAsp: 0.601 ± 0.132
0.992TrpGlu: 0.992 ± 0.175
0.631TrpPhe: 0.631 ± 0.168
0.541TrpGly: 0.541 ± 0.114
0.21TrpHis: 0.21 ± 0.083
0.661TrpIle: 0.661 ± 0.147
0.481TrpLys: 0.481 ± 0.12
0.871TrpLeu: 0.871 ± 0.187
0.21TrpMet: 0.21 ± 0.078
0.931TrpAsn: 0.931 ± 0.203
0.06TrpPro: 0.06 ± 0.041
0.541TrpGln: 0.541 ± 0.126
0.511TrpArg: 0.511 ± 0.139
0.511TrpSer: 0.511 ± 0.152
0.691TrpThr: 0.691 ± 0.18
0.451TrpVal: 0.451 ± 0.111
0.18TrpTrp: 0.18 ± 0.073
0.3TrpTyr: 0.3 ± 0.089
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.374TyrAla: 2.374 ± 0.214
0.541TyrCys: 0.541 ± 0.129
2.855TyrAsp: 2.855 ± 0.318
3.005TyrGlu: 3.005 ± 0.266
1.863TyrPhe: 1.863 ± 0.284
2.194TyrGly: 2.194 ± 0.278
1.052TyrHis: 1.052 ± 0.176
2.404TyrIle: 2.404 ± 0.307
2.825TyrLys: 2.825 ± 0.316
4.237TyrLeu: 4.237 ± 0.382
0.841TyrMet: 0.841 ± 0.171
1.983TyrAsn: 1.983 ± 0.259
1.172TyrPro: 1.172 ± 0.194
2.794TyrGln: 2.794 ± 0.288
2.554TyrArg: 2.554 ± 0.276
3.005TyrSer: 3.005 ± 0.295
2.494TyrThr: 2.494 ± 0.298
2.103TyrVal: 2.103 ± 0.257
0.27TyrTrp: 0.27 ± 0.089
1.562TyrTyr: 1.562 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 113 proteins (33281 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski