Amino acid dipepetide frequency for Mycobacterium phage PhrostyMug

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.96AlaAla: 11.96 ± 1.127
0.897AlaCys: 0.897 ± 0.234
6.339AlaAsp: 6.339 ± 0.57
5.98AlaGlu: 5.98 ± 0.716
2.631AlaPhe: 2.631 ± 0.454
7.535AlaGly: 7.535 ± 0.734
1.375AlaHis: 1.375 ± 0.29
3.947AlaIle: 3.947 ± 0.567
4.246AlaLys: 4.246 ± 0.542
8.73AlaLeu: 8.73 ± 0.944
2.272AlaMet: 2.272 ± 0.393
2.213AlaAsn: 2.213 ± 0.4
4.903AlaPro: 4.903 ± 0.678
2.99AlaGln: 2.99 ± 0.483
6.518AlaArg: 6.518 ± 0.572
4.844AlaSer: 4.844 ± 0.491
5.8AlaThr: 5.8 ± 0.702
8.551AlaVal: 8.551 ± 0.849
1.615AlaTrp: 1.615 ± 0.344
2.631AlaTyr: 2.631 ± 0.319
0.0AlaXaa: 0.0 ± 0.0
Cys
0.897CysAla: 0.897 ± 0.23
0.179CysCys: 0.179 ± 0.118
0.478CysAsp: 0.478 ± 0.17
0.897CysGlu: 0.897 ± 0.248
0.179CysPhe: 0.179 ± 0.105
0.658CysGly: 0.658 ± 0.216
0.299CysHis: 0.299 ± 0.132
0.299CysIle: 0.299 ± 0.123
0.359CysLys: 0.359 ± 0.144
0.718CysLeu: 0.718 ± 0.294
0.179CysMet: 0.179 ± 0.111
0.239CysAsn: 0.239 ± 0.123
0.299CysPro: 0.299 ± 0.129
0.239CysGln: 0.239 ± 0.122
0.658CysArg: 0.658 ± 0.244
0.299CysSer: 0.299 ± 0.133
0.299CysThr: 0.299 ± 0.139
0.299CysVal: 0.299 ± 0.112
0.179CysTrp: 0.179 ± 0.105
0.179CysTyr: 0.179 ± 0.096
0.0CysXaa: 0.0 ± 0.0
Asp
6.279AspAla: 6.279 ± 0.718
0.658AspCys: 0.658 ± 0.191
4.425AspAsp: 4.425 ± 0.495
3.588AspGlu: 3.588 ± 0.407
2.452AspPhe: 2.452 ± 0.34
6.04AspGly: 6.04 ± 0.65
1.017AspHis: 1.017 ± 0.237
2.452AspIle: 2.452 ± 0.356
2.392AspLys: 2.392 ± 0.43
6.339AspLeu: 6.339 ± 0.73
1.375AspMet: 1.375 ± 0.258
1.794AspAsn: 1.794 ± 0.284
5.322AspPro: 5.322 ± 0.566
1.674AspGln: 1.674 ± 0.363
4.066AspArg: 4.066 ± 0.443
3.169AspSer: 3.169 ± 0.463
3.887AspThr: 3.887 ± 0.424
4.425AspVal: 4.425 ± 0.544
1.555AspTrp: 1.555 ± 0.306
2.213AspTyr: 2.213 ± 0.412
0.0AspXaa: 0.0 ± 0.0
Glu
5.561GluAla: 5.561 ± 0.676
0.299GluCys: 0.299 ± 0.158
4.844GluAsp: 4.844 ± 0.543
4.724GluGlu: 4.724 ± 0.589
2.093GluPhe: 2.093 ± 0.327
4.305GluGly: 4.305 ± 0.472
1.435GluHis: 1.435 ± 0.321
3.707GluIle: 3.707 ± 0.467
2.811GluLys: 2.811 ± 0.487
6.458GluLeu: 6.458 ± 0.554
1.615GluMet: 1.615 ± 0.296
1.375GluAsn: 1.375 ± 0.363
2.87GluPro: 2.87 ± 0.373
2.93GluGln: 2.93 ± 0.44
3.947GluArg: 3.947 ± 0.546
3.408GluSer: 3.408 ± 0.376
3.707GluThr: 3.707 ± 0.508
5.561GluVal: 5.561 ± 0.629
1.555GluTrp: 1.555 ± 0.315
2.452GluTyr: 2.452 ± 0.461
0.0GluXaa: 0.0 ± 0.0
Phe
2.332PheAla: 2.332 ± 0.311
0.299PheCys: 0.299 ± 0.146
2.691PheAsp: 2.691 ± 0.314
2.332PheGlu: 2.332 ± 0.332
0.658PhePhe: 0.658 ± 0.175
3.349PheGly: 3.349 ± 0.476
0.658PheHis: 0.658 ± 0.245
1.196PheIle: 1.196 ± 0.232
1.196PheLys: 1.196 ± 0.256
2.213PheLeu: 2.213 ± 0.435
0.538PheMet: 0.538 ± 0.204
1.256PheAsn: 1.256 ± 0.281
1.674PhePro: 1.674 ± 0.321
0.957PheGln: 0.957 ± 0.221
1.734PheArg: 1.734 ± 0.336
1.734PheSer: 1.734 ± 0.281
2.631PheThr: 2.631 ± 0.397
1.973PheVal: 1.973 ± 0.35
0.478PheTrp: 0.478 ± 0.139
0.957PheTyr: 0.957 ± 0.222
0.0PheXaa: 0.0 ± 0.0
Gly
7.056GlyAla: 7.056 ± 0.852
0.538GlyCys: 0.538 ± 0.16
5.741GlyAsp: 5.741 ± 0.537
4.545GlyGlu: 4.545 ± 0.462
2.93GlyPhe: 2.93 ± 0.506
8.312GlyGly: 8.312 ± 1.217
1.794GlyHis: 1.794 ± 0.335
4.545GlyIle: 4.545 ± 0.687
3.707GlyLys: 3.707 ± 0.517
7.594GlyLeu: 7.594 ± 0.611
2.272GlyMet: 2.272 ± 0.361
2.99GlyAsn: 2.99 ± 0.428
3.947GlyPro: 3.947 ± 0.564
2.452GlyGln: 2.452 ± 0.293
5.262GlyArg: 5.262 ± 0.607
5.501GlySer: 5.501 ± 0.585
5.083GlyThr: 5.083 ± 0.675
4.844GlyVal: 4.844 ± 0.52
2.87GlyTrp: 2.87 ± 0.409
2.631GlyTyr: 2.631 ± 0.395
0.0GlyXaa: 0.0 ± 0.0
His
1.495HisAla: 1.495 ± 0.293
0.12HisCys: 0.12 ± 0.128
1.017HisAsp: 1.017 ± 0.194
1.495HisGlu: 1.495 ± 0.35
0.658HisPhe: 0.658 ± 0.185
1.435HisGly: 1.435 ± 0.365
0.777HisHis: 0.777 ± 0.263
0.957HisIle: 0.957 ± 0.214
1.017HisLys: 1.017 ± 0.269
1.674HisLeu: 1.674 ± 0.372
0.06HisMet: 0.06 ± 0.058
0.239HisAsn: 0.239 ± 0.104
1.435HisPro: 1.435 ± 0.344
0.777HisGln: 0.777 ± 0.192
1.854HisArg: 1.854 ± 0.309
0.538HisSer: 0.538 ± 0.157
1.017HisThr: 1.017 ± 0.218
1.375HisVal: 1.375 ± 0.281
0.478HisTrp: 0.478 ± 0.147
0.598HisTyr: 0.598 ± 0.192
0.0HisXaa: 0.0 ± 0.0
Ile
5.98IleAla: 5.98 ± 0.603
0.299IleCys: 0.299 ± 0.127
3.648IleAsp: 3.648 ± 0.43
3.827IleGlu: 3.827 ± 0.485
0.777IlePhe: 0.777 ± 0.234
4.305IleGly: 4.305 ± 0.502
0.897IleHis: 0.897 ± 0.201
1.555IleIle: 1.555 ± 0.303
1.615IleLys: 1.615 ± 0.288
3.229IleLeu: 3.229 ± 0.403
0.658IleMet: 0.658 ± 0.186
1.973IleAsn: 1.973 ± 0.334
2.751IlePro: 2.751 ± 0.4
1.316IleGln: 1.316 ± 0.303
3.408IleArg: 3.408 ± 0.466
3.349IleSer: 3.349 ± 0.398
3.468IleThr: 3.468 ± 0.38
3.109IleVal: 3.109 ± 0.472
0.837IleTrp: 0.837 ± 0.206
1.495IleTyr: 1.495 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
3.947LysAla: 3.947 ± 0.491
0.299LysCys: 0.299 ± 0.142
2.691LysAsp: 2.691 ± 0.366
2.213LysGlu: 2.213 ± 0.321
1.316LysPhe: 1.316 ± 0.247
2.811LysGly: 2.811 ± 0.381
1.136LysHis: 1.136 ± 0.327
2.272LysIle: 2.272 ± 0.371
1.973LysLys: 1.973 ± 0.356
3.349LysLeu: 3.349 ± 0.407
1.196LysMet: 1.196 ± 0.24
1.435LysAsn: 1.435 ± 0.296
2.93LysPro: 2.93 ± 0.554
1.615LysGln: 1.615 ± 0.328
2.571LysArg: 2.571 ± 0.489
2.512LysSer: 2.512 ± 0.401
2.691LysThr: 2.691 ± 0.425
3.109LysVal: 3.109 ± 0.427
0.777LysTrp: 0.777 ± 0.214
1.017LysTyr: 1.017 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
9.269LeuAla: 9.269 ± 0.828
0.359LeuCys: 0.359 ± 0.124
6.219LeuAsp: 6.219 ± 0.546
5.741LeuGlu: 5.741 ± 0.583
2.153LeuPhe: 2.153 ± 0.378
6.817LeuGly: 6.817 ± 0.622
1.375LeuHis: 1.375 ± 0.312
4.305LeuIle: 4.305 ± 0.485
4.305LeuLys: 4.305 ± 0.472
5.8LeuLeu: 5.8 ± 0.555
1.615LeuMet: 1.615 ± 0.255
2.691LeuAsn: 2.691 ± 0.356
5.382LeuPro: 5.382 ± 0.598
2.811LeuGln: 2.811 ± 0.51
6.159LeuArg: 6.159 ± 0.567
5.621LeuSer: 5.621 ± 0.579
6.219LeuThr: 6.219 ± 0.505
4.784LeuVal: 4.784 ± 0.577
0.957LeuTrp: 0.957 ± 0.265
2.392LeuTyr: 2.392 ± 0.429
0.0LeuXaa: 0.0 ± 0.0
Met
2.452MetAla: 2.452 ± 0.358
0.06MetCys: 0.06 ± 0.052
0.957MetAsp: 0.957 ± 0.208
1.256MetGlu: 1.256 ± 0.282
0.598MetPhe: 0.598 ± 0.149
1.196MetGly: 1.196 ± 0.206
0.239MetHis: 0.239 ± 0.127
0.718MetIle: 0.718 ± 0.206
1.196MetLys: 1.196 ± 0.259
1.136MetLeu: 1.136 ± 0.271
0.239MetMet: 0.239 ± 0.133
1.136MetAsn: 1.136 ± 0.228
1.196MetPro: 1.196 ± 0.242
0.658MetGln: 0.658 ± 0.205
1.316MetArg: 1.316 ± 0.297
2.751MetSer: 2.751 ± 0.407
1.914MetThr: 1.914 ± 0.274
1.196MetVal: 1.196 ± 0.297
0.299MetTrp: 0.299 ± 0.11
0.359MetTyr: 0.359 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
2.93AsnAla: 2.93 ± 0.494
0.0AsnCys: 0.0 ± 0.0
1.734AsnAsp: 1.734 ± 0.367
1.854AsnGlu: 1.854 ± 0.297
0.957AsnPhe: 0.957 ± 0.223
3.468AsnGly: 3.468 ± 0.509
0.777AsnHis: 0.777 ± 0.206
1.674AsnIle: 1.674 ± 0.347
0.777AsnLys: 0.777 ± 0.199
2.631AsnLeu: 2.631 ± 0.367
0.478AsnMet: 0.478 ± 0.127
0.777AsnAsn: 0.777 ± 0.195
2.811AsnPro: 2.811 ± 0.357
1.017AsnGln: 1.017 ± 0.248
1.435AsnArg: 1.435 ± 0.264
1.794AsnSer: 1.794 ± 0.393
1.914AsnThr: 1.914 ± 0.294
2.571AsnVal: 2.571 ± 0.464
0.777AsnTrp: 0.777 ± 0.194
1.136AsnTyr: 1.136 ± 0.283
0.0AsnXaa: 0.0 ± 0.0
Pro
5.322ProAla: 5.322 ± 0.616
0.478ProCys: 0.478 ± 0.162
4.186ProAsp: 4.186 ± 0.469
3.947ProGlu: 3.947 ± 0.52
2.093ProPhe: 2.093 ± 0.332
5.023ProGly: 5.023 ± 0.569
0.837ProHis: 0.837 ± 0.224
2.392ProIle: 2.392 ± 0.37
2.332ProLys: 2.332 ± 0.326
4.545ProLeu: 4.545 ± 0.505
1.375ProMet: 1.375 ± 0.282
1.615ProAsn: 1.615 ± 0.306
3.588ProPro: 3.588 ± 0.49
1.495ProGln: 1.495 ± 0.299
3.349ProArg: 3.349 ± 0.707
4.066ProSer: 4.066 ± 0.487
4.246ProThr: 4.246 ± 0.503
3.947ProVal: 3.947 ± 0.471
0.957ProTrp: 0.957 ± 0.262
1.555ProTyr: 1.555 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
2.811GlnAla: 2.811 ± 0.417
0.06GlnCys: 0.06 ± 0.069
1.196GlnAsp: 1.196 ± 0.32
1.794GlnGlu: 1.794 ± 0.331
1.076GlnPhe: 1.076 ± 0.24
2.99GlnGly: 2.99 ± 0.38
0.658GlnHis: 0.658 ± 0.192
2.99GlnIle: 2.99 ± 0.513
1.076GlnLys: 1.076 ± 0.271
3.648GlnLeu: 3.648 ± 0.504
0.897GlnMet: 0.897 ± 0.229
0.478GlnAsn: 0.478 ± 0.142
1.794GlnPro: 1.794 ± 0.332
1.674GlnGln: 1.674 ± 0.329
2.033GlnArg: 2.033 ± 0.36
1.794GlnSer: 1.794 ± 0.344
1.734GlnThr: 1.734 ± 0.291
2.392GlnVal: 2.392 ± 0.325
0.777GlnTrp: 0.777 ± 0.218
0.718GlnTyr: 0.718 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
5.382ArgAla: 5.382 ± 0.712
1.196ArgCys: 1.196 ± 0.298
3.468ArgAsp: 3.468 ± 0.424
4.963ArgGlu: 4.963 ± 0.586
2.213ArgPhe: 2.213 ± 0.422
4.784ArgGly: 4.784 ± 0.579
0.897ArgHis: 0.897 ± 0.247
3.588ArgIle: 3.588 ± 0.465
3.528ArgLys: 3.528 ± 0.528
6.159ArgLeu: 6.159 ± 0.705
1.854ArgMet: 1.854 ± 0.326
2.631ArgAsn: 2.631 ± 0.472
2.691ArgPro: 2.691 ± 0.412
2.033ArgGln: 2.033 ± 0.291
6.578ArgArg: 6.578 ± 0.906
4.365ArgSer: 4.365 ± 0.661
3.229ArgThr: 3.229 ± 0.555
5.262ArgVal: 5.262 ± 0.495
1.256ArgTrp: 1.256 ± 0.302
1.674ArgTyr: 1.674 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
5.8SerAla: 5.8 ± 0.726
0.478SerCys: 0.478 ± 0.181
3.229SerAsp: 3.229 ± 0.438
4.186SerGlu: 4.186 ± 0.471
1.854SerPhe: 1.854 ± 0.345
5.92SerGly: 5.92 ± 0.538
1.495SerHis: 1.495 ± 0.291
2.751SerIle: 2.751 ± 0.433
2.213SerLys: 2.213 ± 0.327
4.844SerLeu: 4.844 ± 0.568
1.375SerMet: 1.375 ± 0.237
2.452SerAsn: 2.452 ± 0.406
3.349SerPro: 3.349 ± 0.513
1.973SerGln: 1.973 ± 0.319
3.109SerArg: 3.109 ± 0.451
3.229SerSer: 3.229 ± 0.535
3.408SerThr: 3.408 ± 0.438
4.425SerVal: 4.425 ± 0.441
1.375SerTrp: 1.375 ± 0.292
1.435SerTyr: 1.435 ± 0.342
0.0SerXaa: 0.0 ± 0.0
Thr
5.8ThrAla: 5.8 ± 0.655
0.538ThrCys: 0.538 ± 0.209
4.425ThrAsp: 4.425 ± 0.588
4.186ThrGlu: 4.186 ± 0.471
2.392ThrPhe: 2.392 ± 0.389
6.398ThrGly: 6.398 ± 0.586
0.957ThrHis: 0.957 ± 0.249
2.87ThrIle: 2.87 ± 0.542
2.87ThrLys: 2.87 ± 0.345
6.219ThrLeu: 6.219 ± 0.62
0.837ThrMet: 0.837 ± 0.208
1.914ThrAsn: 1.914 ± 0.367
3.947ThrPro: 3.947 ± 0.458
1.674ThrGln: 1.674 ± 0.291
3.887ThrArg: 3.887 ± 0.506
3.707ThrSer: 3.707 ± 0.58
4.485ThrThr: 4.485 ± 0.552
5.322ThrVal: 5.322 ± 0.6
1.076ThrTrp: 1.076 ± 0.252
1.794ThrTyr: 1.794 ± 0.334
0.0ThrXaa: 0.0 ± 0.0
Val
6.757ValAla: 6.757 ± 0.684
0.658ValCys: 0.658 ± 0.21
5.382ValAsp: 5.382 ± 0.541
4.784ValGlu: 4.784 ± 0.447
2.213ValPhe: 2.213 ± 0.331
5.143ValGly: 5.143 ± 0.769
1.316ValHis: 1.316 ± 0.229
3.707ValIle: 3.707 ± 0.481
2.631ValLys: 2.631 ± 0.34
5.262ValLeu: 5.262 ± 0.565
1.076ValMet: 1.076 ± 0.262
2.512ValAsn: 2.512 ± 0.35
4.186ValPro: 4.186 ± 0.418
2.272ValGln: 2.272 ± 0.36
5.621ValArg: 5.621 ± 0.706
3.887ValSer: 3.887 ± 0.445
5.86ValThr: 5.86 ± 0.562
5.86ValVal: 5.86 ± 0.698
1.375ValTrp: 1.375 ± 0.344
2.332ValTyr: 2.332 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
1.615TrpAla: 1.615 ± 0.32
0.299TrpCys: 0.299 ± 0.123
1.316TrpAsp: 1.316 ± 0.281
1.017TrpGlu: 1.017 ± 0.207
0.957TrpPhe: 0.957 ± 0.212
1.794TrpGly: 1.794 ± 0.329
0.478TrpHis: 0.478 ± 0.188
1.017TrpIle: 1.017 ± 0.204
0.359TrpLys: 0.359 ± 0.163
1.854TrpLeu: 1.854 ± 0.32
0.359TrpMet: 0.359 ± 0.145
0.598TrpAsn: 0.598 ± 0.165
0.957TrpPro: 0.957 ± 0.247
0.957TrpGln: 0.957 ± 0.249
1.316TrpArg: 1.316 ± 0.291
0.957TrpSer: 0.957 ± 0.202
1.734TrpThr: 1.734 ± 0.361
1.734TrpVal: 1.734 ± 0.326
0.538TrpTrp: 0.538 ± 0.187
0.239TrpTyr: 0.239 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.392TyrAla: 2.392 ± 0.395
0.12TyrCys: 0.12 ± 0.086
1.136TyrAsp: 1.136 ± 0.251
2.332TyrGlu: 2.332 ± 0.303
0.658TyrPhe: 0.658 ± 0.185
2.332TyrGly: 2.332 ± 0.354
0.598TyrHis: 0.598 ± 0.194
1.555TyrIle: 1.555 ± 0.318
1.256TyrLys: 1.256 ± 0.263
2.571TyrLeu: 2.571 ± 0.344
0.538TyrMet: 0.538 ± 0.179
1.196TyrAsn: 1.196 ± 0.299
1.316TyrPro: 1.316 ± 0.284
1.076TyrGln: 1.076 ± 0.202
2.93TyrArg: 2.93 ± 0.424
1.375TyrSer: 1.375 ± 0.271
1.973TyrThr: 1.973 ± 0.385
2.093TyrVal: 2.093 ± 0.343
0.359TyrTrp: 0.359 ± 0.145
0.598TyrTyr: 0.598 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (16724 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski