Amino acid dipepetide frequency for Mycobacterium phage DillTech15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.894AlaAla: 12.894 ± 1.621
0.816AlaCys: 0.816 ± 0.222
7.399AlaAsp: 7.399 ± 0.617
6.91AlaGlu: 6.91 ± 0.789
2.829AlaPhe: 2.829 ± 0.416
10.283AlaGly: 10.283 ± 1.255
2.394AlaHis: 2.394 ± 0.432
4.298AlaIle: 4.298 ± 0.554
3.7AlaLys: 3.7 ± 0.463
8.107AlaLeu: 8.107 ± 0.887
2.503AlaMet: 2.503 ± 0.414
2.666AlaAsn: 2.666 ± 0.349
5.005AlaPro: 5.005 ± 0.689
3.319AlaGln: 3.319 ± 0.551
7.835AlaArg: 7.835 ± 0.698
5.767AlaSer: 5.767 ± 0.601
6.039AlaThr: 6.039 ± 0.591
6.529AlaVal: 6.529 ± 0.675
2.503AlaTrp: 2.503 ± 0.338
2.503AlaTyr: 2.503 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
1.034CysAla: 1.034 ± 0.272
0.054CysCys: 0.054 ± 0.048
1.415CysAsp: 1.415 ± 0.345
0.979CysGlu: 0.979 ± 0.229
0.218CysPhe: 0.218 ± 0.098
1.523CysGly: 1.523 ± 0.353
0.381CysHis: 0.381 ± 0.139
0.218CysIle: 0.218 ± 0.124
0.381CysLys: 0.381 ± 0.141
1.088CysLeu: 1.088 ± 0.347
0.272CysMet: 0.272 ± 0.154
0.272CysAsn: 0.272 ± 0.124
1.469CysPro: 1.469 ± 0.375
0.381CysGln: 0.381 ± 0.139
0.598CysArg: 0.598 ± 0.195
0.49CysSer: 0.49 ± 0.186
0.707CysThr: 0.707 ± 0.204
0.653CysVal: 0.653 ± 0.205
0.326CysTrp: 0.326 ± 0.125
0.218CysTyr: 0.218 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
7.345AspAla: 7.345 ± 0.593
0.816AspCys: 0.816 ± 0.198
4.353AspAsp: 4.353 ± 0.607
3.482AspGlu: 3.482 ± 0.512
1.687AspPhe: 1.687 ± 0.284
6.801AspGly: 6.801 ± 0.624
1.034AspHis: 1.034 ± 0.212
2.176AspIle: 2.176 ± 0.374
1.741AspLys: 1.741 ± 0.305
6.202AspLeu: 6.202 ± 0.538
1.143AspMet: 1.143 ± 0.3
1.578AspAsn: 1.578 ± 0.329
4.951AspPro: 4.951 ± 0.694
2.394AspGln: 2.394 ± 0.333
5.277AspArg: 5.277 ± 0.598
3.428AspSer: 3.428 ± 0.56
4.353AspThr: 4.353 ± 0.502
4.298AspVal: 4.298 ± 0.541
1.36AspTrp: 1.36 ± 0.264
1.904AspTyr: 1.904 ± 0.293
0.0AspXaa: 0.0 ± 0.0
Glu
5.985GluAla: 5.985 ± 0.66
0.762GluCys: 0.762 ± 0.274
2.448GluAsp: 2.448 ± 0.409
2.884GluGlu: 2.884 ± 0.53
1.795GluPhe: 1.795 ± 0.294
3.156GluGly: 3.156 ± 0.44
1.795GluHis: 1.795 ± 0.386
2.067GluIle: 2.067 ± 0.345
1.959GluLys: 1.959 ± 0.35
5.658GluLeu: 5.658 ± 0.778
1.578GluMet: 1.578 ± 0.335
1.741GluAsn: 1.741 ± 0.27
2.829GluPro: 2.829 ± 0.436
2.829GluGln: 2.829 ± 0.372
4.788GluArg: 4.788 ± 0.593
2.829GluSer: 2.829 ± 0.457
4.625GluThr: 4.625 ± 0.709
4.081GluVal: 4.081 ± 0.544
1.469GluTrp: 1.469 ± 0.235
1.904GluTyr: 1.904 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
3.156PheAla: 3.156 ± 0.526
0.163PheCys: 0.163 ± 0.096
2.394PheAsp: 2.394 ± 0.395
1.578PheGlu: 1.578 ± 0.307
0.979PhePhe: 0.979 ± 0.263
3.101PheGly: 3.101 ± 0.62
0.381PheHis: 0.381 ± 0.15
1.306PheIle: 1.306 ± 0.346
1.034PheLys: 1.034 ± 0.233
1.687PheLeu: 1.687 ± 0.276
0.871PheMet: 0.871 ± 0.237
1.469PheAsn: 1.469 ± 0.361
1.469PhePro: 1.469 ± 0.282
1.088PheGln: 1.088 ± 0.31
1.632PheArg: 1.632 ± 0.263
1.36PheSer: 1.36 ± 0.271
2.339PheThr: 2.339 ± 0.356
1.85PheVal: 1.85 ± 0.285
0.544PheTrp: 0.544 ± 0.139
0.816PheTyr: 0.816 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
8.923GlyAla: 8.923 ± 1.102
0.979GlyCys: 0.979 ± 0.215
6.529GlyAsp: 6.529 ± 0.609
3.917GlyGlu: 3.917 ± 0.535
2.666GlyPhe: 2.666 ± 0.461
9.902GlyGly: 9.902 ± 2.591
2.176GlyHis: 2.176 ± 0.288
4.298GlyIle: 4.298 ± 0.601
2.612GlyLys: 2.612 ± 0.379
5.495GlyLeu: 5.495 ± 0.586
2.122GlyMet: 2.122 ± 0.434
3.591GlyAsn: 3.591 ± 0.45
4.407GlyPro: 4.407 ± 0.504
2.394GlyGln: 2.394 ± 0.564
5.332GlyArg: 5.332 ± 0.654
6.094GlySer: 6.094 ± 0.948
6.474GlyThr: 6.474 ± 0.661
5.658GlyVal: 5.658 ± 0.661
2.72GlyTrp: 2.72 ± 0.365
1.904GlyTyr: 1.904 ± 0.382
0.0GlyXaa: 0.0 ± 0.0
His
1.904HisAla: 1.904 ± 0.403
0.326HisCys: 0.326 ± 0.138
0.871HisAsp: 0.871 ± 0.208
1.251HisGlu: 1.251 ± 0.25
0.598HisPhe: 0.598 ± 0.178
1.36HisGly: 1.36 ± 0.249
1.143HisHis: 1.143 ± 0.279
1.469HisIle: 1.469 ± 0.325
0.925HisLys: 0.925 ± 0.231
1.523HisLeu: 1.523 ± 0.301
0.544HisMet: 0.544 ± 0.155
0.871HisAsn: 0.871 ± 0.201
1.578HisPro: 1.578 ± 0.262
0.653HisGln: 0.653 ± 0.17
2.176HisArg: 2.176 ± 0.441
0.979HisSer: 0.979 ± 0.214
1.741HisThr: 1.741 ± 0.374
1.306HisVal: 1.306 ± 0.293
0.544HisTrp: 0.544 ± 0.152
0.871HisTyr: 0.871 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
5.277IleAla: 5.277 ± 0.581
0.707IleCys: 0.707 ± 0.228
3.7IleAsp: 3.7 ± 0.526
3.645IleGlu: 3.645 ± 0.368
0.762IlePhe: 0.762 ± 0.251
3.754IleGly: 3.754 ± 0.444
1.469IleHis: 1.469 ± 0.338
1.578IleIle: 1.578 ± 0.323
1.034IleLys: 1.034 ± 0.245
2.122IleLeu: 2.122 ± 0.431
0.163IleMet: 0.163 ± 0.08
1.795IleAsn: 1.795 ± 0.275
2.72IlePro: 2.72 ± 0.381
1.306IleGln: 1.306 ± 0.268
2.394IleArg: 2.394 ± 0.383
2.394IleSer: 2.394 ± 0.442
3.754IleThr: 3.754 ± 0.43
2.938IleVal: 2.938 ± 0.422
0.925IleTrp: 0.925 ± 0.265
0.816IleTyr: 0.816 ± 0.27
0.0IleXaa: 0.0 ± 0.0
Lys
3.591LysAla: 3.591 ± 0.529
0.49LysCys: 0.49 ± 0.182
1.904LysAsp: 1.904 ± 0.312
1.36LysGlu: 1.36 ± 0.297
1.415LysPhe: 1.415 ± 0.227
2.448LysGly: 2.448 ± 0.362
0.871LysHis: 0.871 ± 0.241
1.088LysIle: 1.088 ± 0.281
1.469LysLys: 1.469 ± 0.364
2.448LysLeu: 2.448 ± 0.461
0.653LysMet: 0.653 ± 0.177
0.707LysAsn: 0.707 ± 0.17
2.448LysPro: 2.448 ± 0.374
1.632LysGln: 1.632 ± 0.246
2.884LysArg: 2.884 ± 0.407
2.231LysSer: 2.231 ± 0.324
2.122LysThr: 2.122 ± 0.336
2.612LysVal: 2.612 ± 0.517
0.871LysTrp: 0.871 ± 0.202
0.653LysTyr: 0.653 ± 0.179
0.0LysXaa: 0.0 ± 0.0
Leu
7.508LeuAla: 7.508 ± 0.798
0.816LeuCys: 0.816 ± 0.245
4.461LeuAsp: 4.461 ± 0.477
3.7LeuGlu: 3.7 ± 0.495
2.448LeuPhe: 2.448 ± 0.244
5.223LeuGly: 5.223 ± 0.608
0.979LeuHis: 0.979 ± 0.242
3.156LeuIle: 3.156 ± 0.44
2.285LeuLys: 2.285 ± 0.36
5.114LeuLeu: 5.114 ± 0.557
1.578LeuMet: 1.578 ± 0.306
2.775LeuAsn: 2.775 ± 0.38
5.55LeuPro: 5.55 ± 0.726
2.775LeuGln: 2.775 ± 0.517
4.842LeuArg: 4.842 ± 0.565
5.876LeuSer: 5.876 ± 0.604
5.005LeuThr: 5.005 ± 0.534
4.788LeuVal: 4.788 ± 0.534
1.36LeuTrp: 1.36 ± 0.259
2.231LeuTyr: 2.231 ± 0.417
0.0LeuXaa: 0.0 ± 0.0
Met
2.448MetAla: 2.448 ± 0.368
0.109MetCys: 0.109 ± 0.074
1.415MetAsp: 1.415 ± 0.269
0.979MetGlu: 0.979 ± 0.203
0.435MetPhe: 0.435 ± 0.167
1.741MetGly: 1.741 ± 0.287
0.163MetHis: 0.163 ± 0.105
0.871MetIle: 0.871 ± 0.213
0.762MetLys: 0.762 ± 0.238
1.687MetLeu: 1.687 ± 0.263
0.653MetMet: 0.653 ± 0.221
0.816MetAsn: 0.816 ± 0.204
1.687MetPro: 1.687 ± 0.33
0.598MetGln: 0.598 ± 0.148
1.251MetArg: 1.251 ± 0.237
3.047MetSer: 3.047 ± 0.411
1.85MetThr: 1.85 ± 0.301
1.415MetVal: 1.415 ± 0.298
0.381MetTrp: 0.381 ± 0.146
0.435MetTyr: 0.435 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.156AsnAla: 3.156 ± 0.35
0.272AsnCys: 0.272 ± 0.141
1.85AsnAsp: 1.85 ± 0.289
1.741AsnGlu: 1.741 ± 0.36
0.979AsnPhe: 0.979 ± 0.295
4.189AsnGly: 4.189 ± 0.682
0.925AsnHis: 0.925 ± 0.19
1.578AsnIle: 1.578 ± 0.581
1.034AsnLys: 1.034 ± 0.283
2.122AsnLeu: 2.122 ± 0.335
0.871AsnMet: 0.871 ± 0.183
1.632AsnAsn: 1.632 ± 0.322
2.557AsnPro: 2.557 ± 0.379
0.925AsnGln: 0.925 ± 0.326
2.176AsnArg: 2.176 ± 0.364
1.632AsnSer: 1.632 ± 0.327
2.285AsnThr: 2.285 ± 0.249
2.285AsnVal: 2.285 ± 0.354
0.816AsnTrp: 0.816 ± 0.203
0.49AsnTyr: 0.49 ± 0.152
0.0AsnXaa: 0.0 ± 0.0
Pro
5.604ProAla: 5.604 ± 0.634
0.762ProCys: 0.762 ± 0.182
4.57ProAsp: 4.57 ± 0.6
4.026ProGlu: 4.026 ± 0.403
1.741ProPhe: 1.741 ± 0.323
6.311ProGly: 6.311 ± 0.753
1.415ProHis: 1.415 ± 0.308
2.122ProIle: 2.122 ± 0.334
2.557ProLys: 2.557 ± 0.458
4.407ProLeu: 4.407 ± 0.518
1.741ProMet: 1.741 ± 0.351
2.448ProAsn: 2.448 ± 0.354
3.536ProPro: 3.536 ± 0.681
2.339ProGln: 2.339 ± 0.401
3.101ProArg: 3.101 ± 0.519
3.863ProSer: 3.863 ± 0.455
3.482ProThr: 3.482 ± 0.428
4.679ProVal: 4.679 ± 0.549
1.197ProTrp: 1.197 ± 0.244
1.523ProTyr: 1.523 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
4.407GlnAla: 4.407 ± 0.657
0.544GlnCys: 0.544 ± 0.226
1.415GlnAsp: 1.415 ± 0.236
1.306GlnGlu: 1.306 ± 0.267
1.197GlnPhe: 1.197 ± 0.259
2.775GlnGly: 2.775 ± 0.442
1.034GlnHis: 1.034 ± 0.281
1.795GlnIle: 1.795 ± 0.332
1.687GlnLys: 1.687 ± 0.288
2.72GlnLeu: 2.72 ± 0.457
0.707GlnMet: 0.707 ± 0.205
0.707GlnAsn: 0.707 ± 0.222
3.047GlnPro: 3.047 ± 0.396
0.871GlnGln: 0.871 ± 0.231
2.557GlnArg: 2.557 ± 0.348
2.176GlnSer: 2.176 ± 0.339
1.523GlnThr: 1.523 ± 0.407
2.448GlnVal: 2.448 ± 0.336
0.598GlnTrp: 0.598 ± 0.153
0.871GlnTyr: 0.871 ± 0.251
0.0GlnXaa: 0.0 ± 0.0
Arg
6.692ArgAla: 6.692 ± 0.63
1.251ArgCys: 1.251 ± 0.297
4.788ArgAsp: 4.788 ± 0.644
4.788ArgGlu: 4.788 ± 0.548
1.959ArgPhe: 1.959 ± 0.332
4.516ArgGly: 4.516 ± 0.473
1.687ArgHis: 1.687 ± 0.381
4.298ArgIle: 4.298 ± 0.594
2.503ArgLys: 2.503 ± 0.359
4.788ArgLeu: 4.788 ± 0.671
2.72ArgMet: 2.72 ± 0.381
2.285ArgAsn: 2.285 ± 0.34
3.482ArgPro: 3.482 ± 0.432
1.959ArgGln: 1.959 ± 0.406
5.441ArgArg: 5.441 ± 0.802
4.026ArgSer: 4.026 ± 0.354
3.319ArgThr: 3.319 ± 0.481
4.842ArgVal: 4.842 ± 0.579
2.394ArgTrp: 2.394 ± 0.387
2.122ArgTyr: 2.122 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.876SerAla: 5.876 ± 0.865
0.871SerCys: 0.871 ± 0.245
3.754SerAsp: 3.754 ± 0.493
2.992SerGlu: 2.992 ± 0.448
1.959SerPhe: 1.959 ± 0.402
6.746SerGly: 6.746 ± 0.817
0.871SerHis: 0.871 ± 0.201
2.612SerIle: 2.612 ± 0.458
2.339SerLys: 2.339 ± 0.372
4.244SerLeu: 4.244 ± 0.495
1.197SerMet: 1.197 ± 0.256
2.122SerAsn: 2.122 ± 0.431
3.373SerPro: 3.373 ± 0.312
1.959SerGln: 1.959 ± 0.282
3.972SerArg: 3.972 ± 0.472
2.992SerSer: 2.992 ± 0.456
4.353SerThr: 4.353 ± 0.456
4.516SerVal: 4.516 ± 0.539
1.469SerTrp: 1.469 ± 0.268
1.523SerTyr: 1.523 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
6.039ThrAla: 6.039 ± 0.682
0.816ThrCys: 0.816 ± 0.261
4.244ThrAsp: 4.244 ± 0.617
4.135ThrGlu: 4.135 ± 0.486
1.959ThrPhe: 1.959 ± 0.33
6.148ThrGly: 6.148 ± 0.58
1.469ThrHis: 1.469 ± 0.287
3.591ThrIle: 3.591 ± 0.455
1.959ThrLys: 1.959 ± 0.283
4.081ThrLeu: 4.081 ± 0.463
1.251ThrMet: 1.251 ± 0.237
2.557ThrAsn: 2.557 ± 0.41
4.407ThrPro: 4.407 ± 0.523
2.013ThrGln: 2.013 ± 0.319
4.353ThrArg: 4.353 ± 0.467
3.972ThrSer: 3.972 ± 0.44
4.679ThrThr: 4.679 ± 0.561
6.094ThrVal: 6.094 ± 0.599
1.306ThrTrp: 1.306 ± 0.297
1.795ThrTyr: 1.795 ± 0.329
0.0ThrXaa: 0.0 ± 0.0
Val
7.617ValAla: 7.617 ± 0.679
1.415ValCys: 1.415 ± 0.255
5.441ValAsp: 5.441 ± 0.541
4.679ValGlu: 4.679 ± 0.574
2.285ValPhe: 2.285 ± 0.398
5.277ValGly: 5.277 ± 0.648
1.36ValHis: 1.36 ± 0.309
2.666ValIle: 2.666 ± 0.453
2.503ValLys: 2.503 ± 0.434
5.332ValLeu: 5.332 ± 0.612
1.034ValMet: 1.034 ± 0.241
1.904ValAsn: 1.904 ± 0.312
3.972ValPro: 3.972 ± 0.405
2.884ValGln: 2.884 ± 0.416
4.461ValArg: 4.461 ± 0.617
4.353ValSer: 4.353 ± 0.492
5.169ValThr: 5.169 ± 0.478
6.692ValVal: 6.692 ± 0.78
1.795ValTrp: 1.795 ± 0.351
1.306ValTyr: 1.306 ± 0.255
0.0ValXaa: 0.0 ± 0.0
Trp
2.231TrpAla: 2.231 ± 0.297
0.218TrpCys: 0.218 ± 0.122
1.523TrpAsp: 1.523 ± 0.299
1.143TrpGlu: 1.143 ± 0.285
0.598TrpPhe: 0.598 ± 0.176
0.979TrpGly: 0.979 ± 0.212
0.816TrpHis: 0.816 ± 0.227
0.979TrpIle: 0.979 ± 0.223
0.762TrpLys: 0.762 ± 0.214
1.85TrpLeu: 1.85 ± 0.331
0.871TrpMet: 0.871 ± 0.232
0.653TrpAsn: 0.653 ± 0.22
1.36TrpPro: 1.36 ± 0.393
1.143TrpGln: 1.143 ± 0.283
2.557TrpArg: 2.557 ± 0.466
1.251TrpSer: 1.251 ± 0.24
1.632TrpThr: 1.632 ± 0.318
2.176TrpVal: 2.176 ± 0.422
0.816TrpTrp: 0.816 ± 0.171
0.544TrpTyr: 0.544 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.612TyrAla: 2.612 ± 0.383
0.49TyrCys: 0.49 ± 0.185
1.85TyrAsp: 1.85 ± 0.38
1.687TyrGlu: 1.687 ± 0.282
0.707TyrPhe: 0.707 ± 0.224
1.959TyrGly: 1.959 ± 0.381
0.272TyrHis: 0.272 ± 0.111
1.088TyrIle: 1.088 ± 0.271
0.653TyrLys: 0.653 ± 0.207
1.795TyrLeu: 1.795 ± 0.3
0.218TyrMet: 0.218 ± 0.105
0.925TyrAsn: 0.925 ± 0.232
1.578TyrPro: 1.578 ± 0.263
0.925TyrGln: 0.925 ± 0.223
2.231TyrArg: 2.231 ± 0.355
0.979TyrSer: 0.979 ± 0.238
1.469TyrThr: 1.469 ± 0.384
2.285TyrVal: 2.285 ± 0.301
0.707TyrTrp: 0.707 ± 0.213
0.653TyrTyr: 0.653 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (18381 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski