Amino acid dipepetide frequency for Mycobacterium phage Pomar16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.978AlaAla: 10.978 ± 1.081
0.61AlaCys: 0.61 ± 0.201
6.343AlaAsp: 6.343 ± 0.72
7.867AlaGlu: 7.867 ± 0.877
4.452AlaPhe: 4.452 ± 0.517
7.623AlaGly: 7.623 ± 0.754
1.403AlaHis: 1.403 ± 0.315
4.574AlaIle: 4.574 ± 0.624
4.391AlaLys: 4.391 ± 0.56
8.477AlaLeu: 8.477 ± 0.893
3.415AlaMet: 3.415 ± 0.403
3.476AlaAsn: 3.476 ± 0.432
4.879AlaPro: 4.879 ± 0.642
3.964AlaGln: 3.964 ± 0.503
5.855AlaArg: 5.855 ± 0.578
3.903AlaSer: 3.903 ± 0.507
4.696AlaThr: 4.696 ± 0.595
7.44AlaVal: 7.44 ± 0.665
2.074AlaTrp: 2.074 ± 0.412
2.744AlaTyr: 2.744 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.671CysAla: 0.671 ± 0.185
0.122CysCys: 0.122 ± 0.131
0.61CysAsp: 0.61 ± 0.187
0.427CysGlu: 0.427 ± 0.143
0.305CysPhe: 0.305 ± 0.136
0.671CysGly: 0.671 ± 0.211
0.183CysHis: 0.183 ± 0.1
0.366CysIle: 0.366 ± 0.174
0.488CysLys: 0.488 ± 0.147
0.793CysLeu: 0.793 ± 0.221
0.122CysMet: 0.122 ± 0.097
0.305CysAsn: 0.305 ± 0.104
0.61CysPro: 0.61 ± 0.218
0.183CysGln: 0.183 ± 0.112
0.732CysArg: 0.732 ± 0.187
0.61CysSer: 0.61 ± 0.201
0.488CysThr: 0.488 ± 0.189
0.549CysVal: 0.549 ± 0.161
0.366CysTrp: 0.366 ± 0.171
0.488CysTyr: 0.488 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
6.16AspAla: 6.16 ± 0.618
0.854AspCys: 0.854 ± 0.249
3.659AspAsp: 3.659 ± 0.483
4.452AspGlu: 4.452 ± 0.732
2.805AspPhe: 2.805 ± 0.403
6.465AspGly: 6.465 ± 0.618
1.83AspHis: 1.83 ± 0.393
3.171AspIle: 3.171 ± 0.368
2.439AspLys: 2.439 ± 0.376
4.696AspLeu: 4.696 ± 0.639
1.281AspMet: 1.281 ± 0.228
2.013AspAsn: 2.013 ± 0.387
5.062AspPro: 5.062 ± 0.566
1.952AspGln: 1.952 ± 0.359
3.354AspArg: 3.354 ± 0.463
2.744AspSer: 2.744 ± 0.405
3.354AspThr: 3.354 ± 0.424
4.33AspVal: 4.33 ± 0.497
1.281AspTrp: 1.281 ± 0.251
2.378AspTyr: 2.378 ± 0.313
0.0AspXaa: 0.0 ± 0.0
Glu
7.684GluAla: 7.684 ± 0.833
0.183GluCys: 0.183 ± 0.106
4.879GluAsp: 4.879 ± 0.731
5.184GluGlu: 5.184 ± 0.624
2.988GluPhe: 2.988 ± 0.328
5.123GluGly: 5.123 ± 0.634
1.525GluHis: 1.525 ± 0.321
3.659GluIle: 3.659 ± 0.515
2.378GluLys: 2.378 ± 0.364
6.952GluLeu: 6.952 ± 0.679
2.378GluMet: 2.378 ± 0.376
1.83GluAsn: 1.83 ± 0.336
2.5GluPro: 2.5 ± 0.366
2.196GluGln: 2.196 ± 0.3
4.574GluArg: 4.574 ± 0.539
2.927GluSer: 2.927 ± 0.487
3.72GluThr: 3.72 ± 0.363
4.696GluVal: 4.696 ± 0.61
1.464GluTrp: 1.464 ± 0.27
2.317GluTyr: 2.317 ± 0.33
0.0GluXaa: 0.0 ± 0.0
Phe
2.988PheAla: 2.988 ± 0.448
0.427PheCys: 0.427 ± 0.147
2.5PheAsp: 2.5 ± 0.457
2.561PheGlu: 2.561 ± 0.381
0.793PhePhe: 0.793 ± 0.2
3.232PheGly: 3.232 ± 0.451
0.61PheHis: 0.61 ± 0.197
1.342PheIle: 1.342 ± 0.266
1.647PheLys: 1.647 ± 0.333
2.683PheLeu: 2.683 ± 0.42
0.549PheMet: 0.549 ± 0.166
1.098PheAsn: 1.098 ± 0.249
1.952PhePro: 1.952 ± 0.354
1.159PheGln: 1.159 ± 0.376
2.561PheArg: 2.561 ± 0.374
2.561PheSer: 2.561 ± 0.492
2.683PheThr: 2.683 ± 0.328
2.439PheVal: 2.439 ± 0.352
0.61PheTrp: 0.61 ± 0.197
0.915PheTyr: 0.915 ± 0.197
0.0PheXaa: 0.0 ± 0.0
Gly
6.892GlyAla: 6.892 ± 0.886
0.854GlyCys: 0.854 ± 0.229
5.794GlyAsp: 5.794 ± 0.618
4.391GlyGlu: 4.391 ± 0.593
4.025GlyPhe: 4.025 ± 0.507
8.416GlyGly: 8.416 ± 1.011
2.561GlyHis: 2.561 ± 0.434
4.696GlyIle: 4.696 ± 0.653
3.964GlyLys: 3.964 ± 0.535
6.77GlyLeu: 6.77 ± 0.672
1.525GlyMet: 1.525 ± 0.264
2.744GlyAsn: 2.744 ± 0.372
3.049GlyPro: 3.049 ± 0.398
3.842GlyGln: 3.842 ± 0.514
4.269GlyArg: 4.269 ± 0.492
3.72GlySer: 3.72 ± 0.533
4.574GlyThr: 4.574 ± 0.678
6.221GlyVal: 6.221 ± 0.661
1.586GlyTrp: 1.586 ± 0.301
2.439GlyTyr: 2.439 ± 0.384
0.0GlyXaa: 0.0 ± 0.0
His
1.525HisAla: 1.525 ± 0.371
0.305HisCys: 0.305 ± 0.118
1.647HisAsp: 1.647 ± 0.39
1.342HisGlu: 1.342 ± 0.314
0.732HisPhe: 0.732 ± 0.174
2.074HisGly: 2.074 ± 0.46
0.549HisHis: 0.549 ± 0.184
1.464HisIle: 1.464 ± 0.328
0.976HisLys: 0.976 ± 0.26
1.647HisLeu: 1.647 ± 0.28
0.366HisMet: 0.366 ± 0.155
0.488HisAsn: 0.488 ± 0.189
1.22HisPro: 1.22 ± 0.202
0.915HisGln: 0.915 ± 0.219
1.952HisArg: 1.952 ± 0.418
0.976HisSer: 0.976 ± 0.221
1.342HisThr: 1.342 ± 0.267
1.525HisVal: 1.525 ± 0.306
0.305HisTrp: 0.305 ± 0.147
0.61HisTyr: 0.61 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
5.855IleAla: 5.855 ± 0.511
0.427IleCys: 0.427 ± 0.145
3.537IleAsp: 3.537 ± 0.429
4.269IleGlu: 4.269 ± 0.471
1.22IlePhe: 1.22 ± 0.269
4.147IleGly: 4.147 ± 0.594
1.159IleHis: 1.159 ± 0.241
1.708IleIle: 1.708 ± 0.322
2.439IleLys: 2.439 ± 0.348
4.269IleLeu: 4.269 ± 0.575
0.549IleMet: 0.549 ± 0.191
1.83IleAsn: 1.83 ± 0.304
3.537IlePro: 3.537 ± 0.506
1.586IleGln: 1.586 ± 0.279
3.415IleArg: 3.415 ± 0.378
2.561IleSer: 2.561 ± 0.418
3.171IleThr: 3.171 ± 0.319
3.049IleVal: 3.049 ± 0.411
0.671IleTrp: 0.671 ± 0.179
0.915IleTyr: 0.915 ± 0.213
0.0IleXaa: 0.0 ± 0.0
Lys
5.001LysAla: 5.001 ± 0.758
0.366LysCys: 0.366 ± 0.167
2.439LysAsp: 2.439 ± 0.294
2.866LysGlu: 2.866 ± 0.438
0.854LysPhe: 0.854 ± 0.245
3.232LysGly: 3.232 ± 0.425
0.732LysHis: 0.732 ± 0.218
2.135LysIle: 2.135 ± 0.353
3.476LysLys: 3.476 ± 0.584
3.659LysLeu: 3.659 ± 0.456
1.22LysMet: 1.22 ± 0.267
2.074LysAsn: 2.074 ± 0.397
3.415LysPro: 3.415 ± 0.573
1.647LysGln: 1.647 ± 0.35
2.805LysArg: 2.805 ± 0.435
1.891LysSer: 1.891 ± 0.338
2.805LysThr: 2.805 ± 0.483
3.598LysVal: 3.598 ± 0.497
0.854LysTrp: 0.854 ± 0.255
1.281LysTyr: 1.281 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
8.599LeuAla: 8.599 ± 0.689
0.671LeuCys: 0.671 ± 0.181
4.757LeuAsp: 4.757 ± 0.538
5.306LeuGlu: 5.306 ± 0.65
2.378LeuPhe: 2.378 ± 0.315
7.135LeuGly: 7.135 ± 0.945
2.561LeuHis: 2.561 ± 0.534
4.452LeuIle: 4.452 ± 0.438
3.232LeuLys: 3.232 ± 0.414
6.221LeuLeu: 6.221 ± 0.618
2.683LeuMet: 2.683 ± 0.407
2.257LeuAsn: 2.257 ± 0.429
4.269LeuPro: 4.269 ± 0.48
2.622LeuGln: 2.622 ± 0.641
5.489LeuArg: 5.489 ± 0.674
5.306LeuSer: 5.306 ± 0.724
4.818LeuThr: 4.818 ± 0.585
4.879LeuVal: 4.879 ± 0.489
1.586LeuTrp: 1.586 ± 0.235
2.317LeuTyr: 2.317 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
2.5MetAla: 2.5 ± 0.4
0.244MetCys: 0.244 ± 0.117
0.915MetAsp: 0.915 ± 0.214
0.915MetGlu: 0.915 ± 0.227
0.366MetPhe: 0.366 ± 0.166
1.769MetGly: 1.769 ± 0.399
0.488MetHis: 0.488 ± 0.137
1.281MetIle: 1.281 ± 0.264
1.891MetLys: 1.891 ± 0.328
1.647MetLeu: 1.647 ± 0.314
0.427MetMet: 0.427 ± 0.151
0.854MetAsn: 0.854 ± 0.243
1.22MetPro: 1.22 ± 0.296
0.915MetGln: 0.915 ± 0.205
1.403MetArg: 1.403 ± 0.317
2.257MetSer: 2.257 ± 0.355
2.257MetThr: 2.257 ± 0.285
1.22MetVal: 1.22 ± 0.29
0.305MetTrp: 0.305 ± 0.137
0.732MetTyr: 0.732 ± 0.241
0.0MetXaa: 0.0 ± 0.0
Asn
3.659AsnAla: 3.659 ± 0.524
0.366AsnCys: 0.366 ± 0.139
1.83AsnAsp: 1.83 ± 0.345
1.708AsnGlu: 1.708 ± 0.331
0.976AsnPhe: 0.976 ± 0.282
3.232AsnGly: 3.232 ± 0.459
0.732AsnHis: 0.732 ± 0.223
1.769AsnIle: 1.769 ± 0.325
0.915AsnLys: 0.915 ± 0.211
3.232AsnLeu: 3.232 ± 0.364
0.793AsnMet: 0.793 ± 0.214
0.488AsnAsn: 0.488 ± 0.157
2.744AsnPro: 2.744 ± 0.452
0.793AsnGln: 0.793 ± 0.246
2.317AsnArg: 2.317 ± 0.407
0.915AsnSer: 0.915 ± 0.236
1.769AsnThr: 1.769 ± 0.389
2.744AsnVal: 2.744 ± 0.41
0.549AsnTrp: 0.549 ± 0.18
1.037AsnTyr: 1.037 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
5.001ProAla: 5.001 ± 0.55
0.427ProCys: 0.427 ± 0.16
4.147ProAsp: 4.147 ± 0.42
3.659ProGlu: 3.659 ± 0.569
1.891ProPhe: 1.891 ± 0.391
4.574ProGly: 4.574 ± 0.619
1.281ProHis: 1.281 ± 0.272
2.683ProIle: 2.683 ± 0.428
2.805ProLys: 2.805 ± 0.575
3.476ProLeu: 3.476 ± 0.433
1.22ProMet: 1.22 ± 0.332
2.317ProAsn: 2.317 ± 0.374
1.952ProPro: 1.952 ± 0.421
1.281ProGln: 1.281 ± 0.301
3.537ProArg: 3.537 ± 0.51
2.5ProSer: 2.5 ± 0.392
3.659ProThr: 3.659 ± 0.454
4.025ProVal: 4.025 ± 0.497
1.464ProTrp: 1.464 ± 0.418
1.525ProTyr: 1.525 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
4.147GlnAla: 4.147 ± 0.623
0.122GlnCys: 0.122 ± 0.073
1.159GlnAsp: 1.159 ± 0.289
1.83GlnGlu: 1.83 ± 0.307
1.159GlnPhe: 1.159 ± 0.246
2.378GlnGly: 2.378 ± 0.349
0.732GlnHis: 0.732 ± 0.192
3.049GlnIle: 3.049 ± 0.398
1.83GlnLys: 1.83 ± 0.359
3.415GlnLeu: 3.415 ± 0.631
0.488GlnMet: 0.488 ± 0.17
1.037GlnAsn: 1.037 ± 0.242
1.586GlnPro: 1.586 ± 0.373
1.586GlnGln: 1.586 ± 0.394
2.866GlnArg: 2.866 ± 0.417
1.403GlnSer: 1.403 ± 0.255
2.013GlnThr: 2.013 ± 0.426
2.988GlnVal: 2.988 ± 0.337
0.671GlnTrp: 0.671 ± 0.204
1.159GlnTyr: 1.159 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
5.367ArgAla: 5.367 ± 0.724
1.098ArgCys: 1.098 ± 0.287
4.879ArgAsp: 4.879 ± 0.661
5.794ArgGlu: 5.794 ± 0.709
2.5ArgPhe: 2.5 ± 0.475
3.476ArgGly: 3.476 ± 0.499
1.281ArgHis: 1.281 ± 0.292
3.171ArgIle: 3.171 ± 0.333
3.232ArgLys: 3.232 ± 0.464
5.977ArgLeu: 5.977 ± 0.632
1.647ArgMet: 1.647 ± 0.332
2.135ArgAsn: 2.135 ± 0.331
3.11ArgPro: 3.11 ± 0.489
1.647ArgGln: 1.647 ± 0.351
5.794ArgArg: 5.794 ± 0.657
3.354ArgSer: 3.354 ± 0.44
2.561ArgThr: 2.561 ± 0.372
4.757ArgVal: 4.757 ± 0.49
1.464ArgTrp: 1.464 ± 0.329
2.561ArgTyr: 2.561 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
4.879SerAla: 4.879 ± 0.585
0.366SerCys: 0.366 ± 0.147
3.232SerAsp: 3.232 ± 0.458
3.049SerGlu: 3.049 ± 0.437
2.013SerPhe: 2.013 ± 0.387
4.574SerGly: 4.574 ± 0.508
0.671SerHis: 0.671 ± 0.172
2.013SerIle: 2.013 ± 0.444
2.257SerLys: 2.257 ± 0.421
3.598SerLeu: 3.598 ± 0.515
1.22SerMet: 1.22 ± 0.314
1.22SerAsn: 1.22 ± 0.227
2.744SerPro: 2.744 ± 0.393
2.5SerGln: 2.5 ± 0.34
3.903SerArg: 3.903 ± 0.504
2.988SerSer: 2.988 ± 0.549
3.049SerThr: 3.049 ± 0.373
2.927SerVal: 2.927 ± 0.336
1.647SerTrp: 1.647 ± 0.288
1.098SerTyr: 1.098 ± 0.268
0.0SerXaa: 0.0 ± 0.0
Thr
5.733ThrAla: 5.733 ± 0.486
0.488ThrCys: 0.488 ± 0.148
3.232ThrAsp: 3.232 ± 0.49
3.232ThrGlu: 3.232 ± 0.464
1.891ThrPhe: 1.891 ± 0.334
5.184ThrGly: 5.184 ± 0.566
1.098ThrHis: 1.098 ± 0.291
2.988ThrIle: 2.988 ± 0.53
3.781ThrLys: 3.781 ± 0.53
4.025ThrLeu: 4.025 ± 0.55
1.647ThrMet: 1.647 ± 0.298
1.769ThrAsn: 1.769 ± 0.362
4.025ThrPro: 4.025 ± 0.588
2.074ThrGln: 2.074 ± 0.334
3.293ThrArg: 3.293 ± 0.447
2.439ThrSer: 2.439 ± 0.444
3.049ThrThr: 3.049 ± 0.506
5.184ThrVal: 5.184 ± 0.559
0.915ThrTrp: 0.915 ± 0.25
1.769ThrTyr: 1.769 ± 0.288
0.0ThrXaa: 0.0 ± 0.0
Val
6.831ValAla: 6.831 ± 0.725
0.61ValCys: 0.61 ± 0.175
5.367ValAsp: 5.367 ± 0.567
6.343ValGlu: 6.343 ± 0.575
2.317ValPhe: 2.317 ± 0.388
5.245ValGly: 5.245 ± 0.48
1.708ValHis: 1.708 ± 0.326
3.11ValIle: 3.11 ± 0.433
3.049ValLys: 3.049 ± 0.372
5.55ValLeu: 5.55 ± 0.589
0.854ValMet: 0.854 ± 0.252
2.744ValAsn: 2.744 ± 0.523
3.232ValPro: 3.232 ± 0.39
2.439ValGln: 2.439 ± 0.491
4.513ValArg: 4.513 ± 0.485
3.842ValSer: 3.842 ± 0.484
4.757ValThr: 4.757 ± 0.5
6.038ValVal: 6.038 ± 0.501
1.464ValTrp: 1.464 ± 0.312
1.891ValTyr: 1.891 ± 0.301
0.0ValXaa: 0.0 ± 0.0
Trp
1.891TrpAla: 1.891 ± 0.378
0.366TrpCys: 0.366 ± 0.16
1.098TrpAsp: 1.098 ± 0.254
1.708TrpGlu: 1.708 ± 0.277
0.732TrpPhe: 0.732 ± 0.203
1.342TrpGly: 1.342 ± 0.273
0.549TrpHis: 0.549 ± 0.198
1.098TrpIle: 1.098 ± 0.181
0.549TrpLys: 0.549 ± 0.169
1.403TrpLeu: 1.403 ± 0.24
0.488TrpMet: 0.488 ± 0.18
0.854TrpAsn: 0.854 ± 0.213
0.915TrpPro: 0.915 ± 0.301
1.281TrpGln: 1.281 ± 0.229
0.915TrpArg: 0.915 ± 0.242
1.342TrpSer: 1.342 ± 0.243
1.525TrpThr: 1.525 ± 0.316
1.098TrpVal: 1.098 ± 0.233
0.427TrpTrp: 0.427 ± 0.16
0.671TrpTyr: 0.671 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.805TyrAla: 2.805 ± 0.37
0.183TyrCys: 0.183 ± 0.112
2.257TyrAsp: 2.257 ± 0.336
2.257TyrGlu: 2.257 ± 0.407
0.915TyrPhe: 0.915 ± 0.229
2.439TyrGly: 2.439 ± 0.322
0.305TyrHis: 0.305 ± 0.139
1.586TyrIle: 1.586 ± 0.249
0.61TyrLys: 0.61 ± 0.219
3.049TyrLeu: 3.049 ± 0.421
0.671TyrMet: 0.671 ± 0.215
1.037TyrAsn: 1.037 ± 0.28
1.403TyrPro: 1.403 ± 0.264
1.037TyrGln: 1.037 ± 0.218
2.257TyrArg: 2.257 ± 0.463
1.769TyrSer: 1.769 ± 0.268
1.525TyrThr: 1.525 ± 0.336
2.196TyrVal: 2.196 ± 0.423
0.549TyrTrp: 0.549 ± 0.207
0.732TyrTyr: 0.732 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (16398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski