Amino acid dipepetide frequency for Mycobacterium phage DRBy19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.347AlaAla: 14.347 ± 1.742
1.079AlaCys: 1.079 ± 0.311
7.228AlaAsp: 7.228 ± 0.556
7.012AlaGlu: 7.012 ± 0.574
3.074AlaPhe: 3.074 ± 0.443
9.439AlaGly: 9.439 ± 1.283
2.319AlaHis: 2.319 ± 0.395
4.531AlaIle: 4.531 ± 0.551
4.423AlaLys: 4.423 ± 0.509
7.821AlaLeu: 7.821 ± 0.82
2.104AlaMet: 2.104 ± 0.323
2.805AlaAsn: 2.805 ± 0.405
4.908AlaPro: 4.908 ± 0.526
3.344AlaGln: 3.344 ± 0.37
8.36AlaArg: 8.36 ± 0.748
5.232AlaSer: 5.232 ± 0.656
5.879AlaThr: 5.879 ± 0.53
6.257AlaVal: 6.257 ± 0.531
2.535AlaTrp: 2.535 ± 0.418
2.157AlaTyr: 2.157 ± 0.337
0.0AlaXaa: 0.0 ± 0.0
Cys
0.917CysAla: 0.917 ± 0.301
0.054CysCys: 0.054 ± 0.061
1.348CysAsp: 1.348 ± 0.321
0.539CysGlu: 0.539 ± 0.188
0.27CysPhe: 0.27 ± 0.114
1.348CysGly: 1.348 ± 0.317
0.27CysHis: 0.27 ± 0.145
0.216CysIle: 0.216 ± 0.131
0.539CysLys: 0.539 ± 0.17
0.917CysLeu: 0.917 ± 0.28
0.108CysMet: 0.108 ± 0.07
0.431CysAsn: 0.431 ± 0.15
1.079CysPro: 1.079 ± 0.294
0.485CysGln: 0.485 ± 0.156
0.647CysArg: 0.647 ± 0.215
0.701CysSer: 0.701 ± 0.224
0.647CysThr: 0.647 ± 0.213
0.809CysVal: 0.809 ± 0.189
0.216CysTrp: 0.216 ± 0.105
0.216CysTyr: 0.216 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
7.012AspAla: 7.012 ± 0.684
0.863AspCys: 0.863 ± 0.203
4.854AspAsp: 4.854 ± 0.555
3.668AspGlu: 3.668 ± 0.534
1.78AspPhe: 1.78 ± 0.271
7.174AspGly: 7.174 ± 0.646
1.672AspHis: 1.672 ± 0.327
2.427AspIle: 2.427 ± 0.378
2.05AspLys: 2.05 ± 0.322
5.663AspLeu: 5.663 ± 0.562
1.187AspMet: 1.187 ± 0.242
1.834AspAsn: 1.834 ± 0.426
4.693AspPro: 4.693 ± 0.615
2.643AspGln: 2.643 ± 0.339
5.502AspArg: 5.502 ± 0.657
3.506AspSer: 3.506 ± 0.416
4.261AspThr: 4.261 ± 0.484
4.045AspVal: 4.045 ± 0.482
1.618AspTrp: 1.618 ± 0.312
1.672AspTyr: 1.672 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
6.526GluAla: 6.526 ± 0.666
0.917GluCys: 0.917 ± 0.22
3.668GluAsp: 3.668 ± 0.403
3.074GluGlu: 3.074 ± 0.565
2.211GluPhe: 2.211 ± 0.379
3.236GluGly: 3.236 ± 0.372
1.51GluHis: 1.51 ± 0.366
1.996GluIle: 1.996 ± 0.357
2.589GluLys: 2.589 ± 0.374
5.933GluLeu: 5.933 ± 0.715
1.78GluMet: 1.78 ± 0.263
1.888GluAsn: 1.888 ± 0.267
2.967GluPro: 2.967 ± 0.446
3.02GluGln: 3.02 ± 0.365
4.746GluArg: 4.746 ± 0.632
2.967GluSer: 2.967 ± 0.473
3.991GluThr: 3.991 ± 0.576
3.883GluVal: 3.883 ± 0.432
1.51GluTrp: 1.51 ± 0.291
1.51GluTyr: 1.51 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
2.805PheAla: 2.805 ± 0.403
0.162PheCys: 0.162 ± 0.095
2.535PheAsp: 2.535 ± 0.451
1.618PheGlu: 1.618 ± 0.288
0.917PhePhe: 0.917 ± 0.243
2.805PheGly: 2.805 ± 0.67
0.431PheHis: 0.431 ± 0.145
1.402PheIle: 1.402 ± 0.319
1.025PheLys: 1.025 ± 0.229
1.888PheLeu: 1.888 ± 0.3
0.809PheMet: 0.809 ± 0.197
1.241PheAsn: 1.241 ± 0.337
1.564PhePro: 1.564 ± 0.28
1.133PheGln: 1.133 ± 0.325
1.672PheArg: 1.672 ± 0.271
1.672PheSer: 1.672 ± 0.416
2.211PheThr: 2.211 ± 0.397
2.211PheVal: 2.211 ± 0.284
0.593PheTrp: 0.593 ± 0.16
1.079PheTyr: 1.079 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
9.169GlyAla: 9.169 ± 1.317
1.187GlyCys: 1.187 ± 0.267
6.095GlyAsp: 6.095 ± 0.589
4.854GlyGlu: 4.854 ± 0.485
2.643GlyPhe: 2.643 ± 0.466
10.787GlyGly: 10.787 ± 2.499
1.834GlyHis: 1.834 ± 0.258
4.045GlyIle: 4.045 ± 0.552
2.373GlyLys: 2.373 ± 0.283
5.771GlyLeu: 5.771 ± 0.617
2.697GlyMet: 2.697 ± 0.506
2.805GlyAsn: 2.805 ± 0.394
3.776GlyPro: 3.776 ± 0.594
1.942GlyGln: 1.942 ± 0.539
5.394GlyArg: 5.394 ± 0.568
6.203GlySer: 6.203 ± 1.041
6.688GlyThr: 6.688 ± 0.756
5.502GlyVal: 5.502 ± 0.634
2.265GlyTrp: 2.265 ± 0.34
2.104GlyTyr: 2.104 ± 0.392
0.0GlyXaa: 0.0 ± 0.0
His
1.834HisAla: 1.834 ± 0.313
0.378HisCys: 0.378 ± 0.171
1.025HisAsp: 1.025 ± 0.216
1.025HisGlu: 1.025 ± 0.24
0.378HisPhe: 0.378 ± 0.139
1.834HisGly: 1.834 ± 0.355
0.863HisHis: 0.863 ± 0.233
1.672HisIle: 1.672 ± 0.356
0.755HisLys: 0.755 ± 0.203
1.348HisLeu: 1.348 ± 0.284
0.378HisMet: 0.378 ± 0.127
0.917HisAsn: 0.917 ± 0.202
1.51HisPro: 1.51 ± 0.21
0.917HisGln: 0.917 ± 0.269
1.942HisArg: 1.942 ± 0.381
0.863HisSer: 0.863 ± 0.187
1.402HisThr: 1.402 ± 0.315
1.187HisVal: 1.187 ± 0.349
0.431HisTrp: 0.431 ± 0.137
0.917HisTyr: 0.917 ± 0.161
0.0HisXaa: 0.0 ± 0.0
Ile
5.286IleAla: 5.286 ± 0.585
0.539IleCys: 0.539 ± 0.197
3.937IleAsp: 3.937 ± 0.538
3.776IleGlu: 3.776 ± 0.41
0.593IlePhe: 0.593 ± 0.178
3.506IleGly: 3.506 ± 0.471
1.187IleHis: 1.187 ± 0.263
1.294IleIle: 1.294 ± 0.266
1.133IleLys: 1.133 ± 0.237
2.05IleLeu: 2.05 ± 0.356
0.431IleMet: 0.431 ± 0.154
1.888IleAsn: 1.888 ± 0.249
2.859IlePro: 2.859 ± 0.344
1.618IleGln: 1.618 ± 0.286
2.104IleArg: 2.104 ± 0.335
2.373IleSer: 2.373 ± 0.5
3.614IleThr: 3.614 ± 0.398
3.182IleVal: 3.182 ± 0.344
0.809IleTrp: 0.809 ± 0.209
0.755IleTyr: 0.755 ± 0.195
0.0IleXaa: 0.0 ± 0.0
Lys
4.639LysAla: 4.639 ± 0.61
0.431LysCys: 0.431 ± 0.142
2.05LysAsp: 2.05 ± 0.27
1.241LysGlu: 1.241 ± 0.233
1.456LysPhe: 1.456 ± 0.25
2.751LysGly: 2.751 ± 0.386
0.971LysHis: 0.971 ± 0.217
0.863LysIle: 0.863 ± 0.211
1.187LysLys: 1.187 ± 0.31
2.535LysLeu: 2.535 ± 0.459
0.431LysMet: 0.431 ± 0.138
0.971LysAsn: 0.971 ± 0.223
2.319LysPro: 2.319 ± 0.395
1.888LysGln: 1.888 ± 0.315
2.373LysArg: 2.373 ± 0.323
1.888LysSer: 1.888 ± 0.251
2.104LysThr: 2.104 ± 0.371
2.805LysVal: 2.805 ± 0.51
0.863LysTrp: 0.863 ± 0.264
0.917LysTyr: 0.917 ± 0.244
0.0LysXaa: 0.0 ± 0.0
Leu
7.282LeuAla: 7.282 ± 0.763
0.863LeuCys: 0.863 ± 0.218
4.746LeuAsp: 4.746 ± 0.528
4.315LeuGlu: 4.315 ± 0.591
1.942LeuPhe: 1.942 ± 0.289
4.962LeuGly: 4.962 ± 0.533
0.809LeuHis: 0.809 ± 0.213
3.182LeuIle: 3.182 ± 0.428
2.319LeuLys: 2.319 ± 0.371
5.286LeuLeu: 5.286 ± 0.508
1.888LeuMet: 1.888 ± 0.327
2.751LeuAsn: 2.751 ± 0.316
5.016LeuPro: 5.016 ± 0.65
2.589LeuGln: 2.589 ± 0.44
5.448LeuArg: 5.448 ± 0.559
5.448LeuSer: 5.448 ± 0.543
5.448LeuThr: 5.448 ± 0.579
5.07LeuVal: 5.07 ± 0.63
1.025LeuTrp: 1.025 ± 0.266
2.319LeuTyr: 2.319 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
2.427MetAla: 2.427 ± 0.345
0.216MetCys: 0.216 ± 0.159
1.564MetAsp: 1.564 ± 0.268
0.863MetGlu: 0.863 ± 0.177
0.701MetPhe: 0.701 ± 0.2
1.51MetGly: 1.51 ± 0.231
0.216MetHis: 0.216 ± 0.123
0.863MetIle: 0.863 ± 0.235
0.755MetLys: 0.755 ± 0.204
1.564MetLeu: 1.564 ± 0.272
0.647MetMet: 0.647 ± 0.244
0.971MetAsn: 0.971 ± 0.214
1.133MetPro: 1.133 ± 0.212
0.378MetGln: 0.378 ± 0.138
1.672MetArg: 1.672 ± 0.319
3.02MetSer: 3.02 ± 0.428
2.05MetThr: 2.05 ± 0.26
1.402MetVal: 1.402 ± 0.299
0.324MetTrp: 0.324 ± 0.128
0.324MetTyr: 0.324 ± 0.133
0.0MetXaa: 0.0 ± 0.0
Asn
3.29AsnAla: 3.29 ± 0.322
0.27AsnCys: 0.27 ± 0.136
1.456AsnAsp: 1.456 ± 0.216
1.726AsnGlu: 1.726 ± 0.317
0.809AsnPhe: 0.809 ± 0.274
3.883AsnGly: 3.883 ± 0.561
1.079AsnHis: 1.079 ± 0.225
1.672AsnIle: 1.672 ± 0.446
0.971AsnLys: 0.971 ± 0.224
2.373AsnLeu: 2.373 ± 0.276
0.863AsnMet: 0.863 ± 0.222
1.78AsnAsn: 1.78 ± 0.364
2.751AsnPro: 2.751 ± 0.346
1.079AsnGln: 1.079 ± 0.276
2.05AsnArg: 2.05 ± 0.375
1.834AsnSer: 1.834 ± 0.307
2.157AsnThr: 2.157 ± 0.294
2.157AsnVal: 2.157 ± 0.355
0.647AsnTrp: 0.647 ± 0.193
0.809AsnTyr: 0.809 ± 0.185
0.0AsnXaa: 0.0 ± 0.0
Pro
4.962ProAla: 4.962 ± 0.528
0.809ProCys: 0.809 ± 0.237
3.991ProAsp: 3.991 ± 0.502
4.207ProGlu: 4.207 ± 0.515
1.726ProPhe: 1.726 ± 0.312
6.257ProGly: 6.257 ± 0.678
1.564ProHis: 1.564 ± 0.286
1.726ProIle: 1.726 ± 0.271
2.589ProLys: 2.589 ± 0.395
4.153ProLeu: 4.153 ± 0.505
1.348ProMet: 1.348 ± 0.328
2.211ProAsn: 2.211 ± 0.273
3.506ProPro: 3.506 ± 0.445
1.942ProGln: 1.942 ± 0.367
3.452ProArg: 3.452 ± 0.549
2.913ProSer: 2.913 ± 0.377
3.668ProThr: 3.668 ± 0.449
4.746ProVal: 4.746 ± 0.43
1.402ProTrp: 1.402 ± 0.254
1.294ProTyr: 1.294 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
4.369GlnAla: 4.369 ± 0.577
0.27GlnCys: 0.27 ± 0.137
1.564GlnAsp: 1.564 ± 0.278
1.618GlnGlu: 1.618 ± 0.311
1.187GlnPhe: 1.187 ± 0.222
2.319GlnGly: 2.319 ± 0.46
0.809GlnHis: 0.809 ± 0.22
1.834GlnIle: 1.834 ± 0.325
1.402GlnLys: 1.402 ± 0.229
2.859GlnLeu: 2.859 ± 0.412
0.485GlnMet: 0.485 ± 0.17
0.809GlnAsn: 0.809 ± 0.21
2.643GlnPro: 2.643 ± 0.441
1.133GlnGln: 1.133 ± 0.328
2.589GlnArg: 2.589 ± 0.361
1.942GlnSer: 1.942 ± 0.329
1.834GlnThr: 1.834 ± 0.349
2.805GlnVal: 2.805 ± 0.366
0.701GlnTrp: 0.701 ± 0.234
0.917GlnTyr: 0.917 ± 0.274
0.0GlnXaa: 0.0 ± 0.0
Arg
6.257ArgAla: 6.257 ± 0.605
1.025ArgCys: 1.025 ± 0.321
5.07ArgAsp: 5.07 ± 0.577
4.962ArgGlu: 4.962 ± 0.565
2.643ArgPhe: 2.643 ± 0.418
4.369ArgGly: 4.369 ± 0.487
1.187ArgHis: 1.187 ± 0.254
3.668ArgIle: 3.668 ± 0.504
2.643ArgLys: 2.643 ± 0.449
5.394ArgLeu: 5.394 ± 0.612
2.319ArgMet: 2.319 ± 0.336
2.05ArgAsn: 2.05 ± 0.336
3.883ArgPro: 3.883 ± 0.507
1.888ArgGln: 1.888 ± 0.351
5.987ArgArg: 5.987 ± 0.723
3.83ArgSer: 3.83 ± 0.471
3.668ArgThr: 3.668 ± 0.52
5.286ArgVal: 5.286 ± 0.561
1.672ArgTrp: 1.672 ± 0.305
2.265ArgTyr: 2.265 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
5.232SerAla: 5.232 ± 0.637
0.485SerCys: 0.485 ± 0.198
4.477SerAsp: 4.477 ± 0.504
3.128SerGlu: 3.128 ± 0.366
1.996SerPhe: 1.996 ± 0.458
6.85SerGly: 6.85 ± 1.159
1.294SerHis: 1.294 ± 0.243
2.859SerIle: 2.859 ± 0.411
2.319SerLys: 2.319 ± 0.36
3.668SerLeu: 3.668 ± 0.41
1.51SerMet: 1.51 ± 0.234
2.373SerAsn: 2.373 ± 0.408
3.074SerPro: 3.074 ± 0.366
1.456SerGln: 1.456 ± 0.216
3.29SerArg: 3.29 ± 0.356
3.83SerSer: 3.83 ± 0.683
3.56SerThr: 3.56 ± 0.416
5.394SerVal: 5.394 ± 0.745
1.348SerTrp: 1.348 ± 0.259
1.133SerTyr: 1.133 ± 0.181
0.0SerXaa: 0.0 ± 0.0
Thr
6.742ThrAla: 6.742 ± 0.612
0.539ThrCys: 0.539 ± 0.214
4.477ThrAsp: 4.477 ± 0.591
3.937ThrGlu: 3.937 ± 0.527
1.564ThrPhe: 1.564 ± 0.306
6.526ThrGly: 6.526 ± 0.745
1.51ThrHis: 1.51 ± 0.304
3.668ThrIle: 3.668 ± 0.432
1.834ThrLys: 1.834 ± 0.268
4.099ThrLeu: 4.099 ± 0.451
1.294ThrMet: 1.294 ± 0.267
2.104ThrAsn: 2.104 ± 0.297
4.8ThrPro: 4.8 ± 0.679
1.888ThrGln: 1.888 ± 0.308
3.83ThrArg: 3.83 ± 0.433
3.883ThrSer: 3.883 ± 0.517
4.746ThrThr: 4.746 ± 0.572
6.041ThrVal: 6.041 ± 0.652
1.348ThrTrp: 1.348 ± 0.324
1.888ThrTyr: 1.888 ± 0.39
0.0ThrXaa: 0.0 ± 0.0
Val
7.659ValAla: 7.659 ± 0.576
1.079ValCys: 1.079 ± 0.22
4.8ValAsp: 4.8 ± 0.529
5.07ValGlu: 5.07 ± 0.589
2.373ValPhe: 2.373 ± 0.446
5.502ValGly: 5.502 ± 0.616
1.133ValHis: 1.133 ± 0.215
2.859ValIle: 2.859 ± 0.37
2.535ValLys: 2.535 ± 0.393
5.556ValLeu: 5.556 ± 0.626
1.241ValMet: 1.241 ± 0.205
2.373ValAsn: 2.373 ± 0.289
3.668ValPro: 3.668 ± 0.414
3.02ValGln: 3.02 ± 0.34
4.261ValArg: 4.261 ± 0.6
4.908ValSer: 4.908 ± 0.686
5.448ValThr: 5.448 ± 0.589
6.365ValVal: 6.365 ± 0.684
1.726ValTrp: 1.726 ± 0.356
1.618ValTyr: 1.618 ± 0.295
0.0ValXaa: 0.0 ± 0.0
Trp
1.78TrpAla: 1.78 ± 0.271
0.27TrpCys: 0.27 ± 0.121
1.402TrpAsp: 1.402 ± 0.259
1.348TrpGlu: 1.348 ± 0.356
0.755TrpPhe: 0.755 ± 0.211
1.025TrpGly: 1.025 ± 0.225
0.593TrpHis: 0.593 ± 0.194
1.079TrpIle: 1.079 ± 0.233
0.647TrpLys: 0.647 ± 0.17
1.996TrpLeu: 1.996 ± 0.34
0.863TrpMet: 0.863 ± 0.201
0.701TrpAsn: 0.701 ± 0.226
0.809TrpPro: 0.809 ± 0.208
0.971TrpGln: 0.971 ± 0.248
2.589TrpArg: 2.589 ± 0.415
1.079TrpSer: 1.079 ± 0.247
1.456TrpThr: 1.456 ± 0.26
1.618TrpVal: 1.618 ± 0.421
0.971TrpTrp: 0.971 ± 0.208
0.485TrpTyr: 0.485 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.319TyrAla: 2.319 ± 0.311
0.324TyrCys: 0.324 ± 0.21
1.834TyrAsp: 1.834 ± 0.366
1.942TyrGlu: 1.942 ± 0.284
0.809TyrPhe: 0.809 ± 0.209
2.05TyrGly: 2.05 ± 0.381
0.324TyrHis: 0.324 ± 0.106
1.079TyrIle: 1.079 ± 0.223
0.593TyrLys: 0.593 ± 0.195
1.834TyrLeu: 1.834 ± 0.379
0.162TyrMet: 0.162 ± 0.077
0.755TyrAsn: 0.755 ± 0.19
1.51TyrPro: 1.51 ± 0.256
0.755TyrGln: 0.755 ± 0.235
2.157TyrArg: 2.157 ± 0.341
1.187TyrSer: 1.187 ± 0.267
1.888TyrThr: 1.888 ± 0.28
2.427TyrVal: 2.427 ± 0.346
0.431TyrTrp: 0.431 ± 0.137
0.647TyrTyr: 0.647 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 104 proteins (18541 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski