Amino acid dipepetide frequency for Mycobacterium phage Luchador

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.557AlaAla: 11.557 ± 1.379
0.426AlaCys: 0.426 ± 0.156
5.231AlaAsp: 5.231 ± 0.534
5.9AlaGlu: 5.9 ± 0.773
3.163AlaPhe: 3.163 ± 0.476
7.603AlaGly: 7.603 ± 1.06
1.582AlaHis: 1.582 ± 0.376
4.805AlaIle: 4.805 ± 0.497
4.866AlaLys: 4.866 ± 0.585
8.212AlaLeu: 8.212 ± 1.025
2.798AlaMet: 2.798 ± 0.361
3.345AlaAsn: 3.345 ± 0.47
4.562AlaPro: 4.562 ± 0.621
3.65AlaGln: 3.65 ± 0.55
5.535AlaArg: 5.535 ± 0.664
4.927AlaSer: 4.927 ± 0.633
5.17AlaThr: 5.17 ± 0.771
7.603AlaVal: 7.603 ± 0.618
2.251AlaTrp: 2.251 ± 0.41
2.494AlaTyr: 2.494 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.426CysAla: 0.426 ± 0.149
0.0CysCys: 0.0 ± 0.0
0.669CysAsp: 0.669 ± 0.166
0.669CysGlu: 0.669 ± 0.207
0.365CysPhe: 0.365 ± 0.16
0.912CysGly: 0.912 ± 0.223
0.182CysHis: 0.182 ± 0.131
0.547CysIle: 0.547 ± 0.169
0.304CysLys: 0.304 ± 0.134
0.608CysLeu: 0.608 ± 0.164
0.243CysMet: 0.243 ± 0.124
0.487CysAsn: 0.487 ± 0.158
0.426CysPro: 0.426 ± 0.212
0.122CysGln: 0.122 ± 0.088
0.547CysArg: 0.547 ± 0.185
0.365CysSer: 0.365 ± 0.173
0.487CysThr: 0.487 ± 0.17
0.487CysVal: 0.487 ± 0.17
0.243CysTrp: 0.243 ± 0.126
0.304CysTyr: 0.304 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
6.448AspAla: 6.448 ± 0.701
0.487AspCys: 0.487 ± 0.182
3.893AspAsp: 3.893 ± 0.586
4.075AspGlu: 4.075 ± 0.574
2.92AspPhe: 2.92 ± 0.433
6.265AspGly: 6.265 ± 0.633
1.764AspHis: 1.764 ± 0.327
2.981AspIle: 2.981 ± 0.395
2.494AspLys: 2.494 ± 0.422
5.718AspLeu: 5.718 ± 0.663
1.277AspMet: 1.277 ± 0.306
1.703AspAsn: 1.703 ± 0.304
4.44AspPro: 4.44 ± 0.581
2.068AspGln: 2.068 ± 0.357
3.285AspArg: 3.285 ± 0.476
2.92AspSer: 2.92 ± 0.467
3.163AspThr: 3.163 ± 0.401
4.805AspVal: 4.805 ± 0.468
1.521AspTrp: 1.521 ± 0.311
2.372AspTyr: 2.372 ± 0.33
0.0AspXaa: 0.0 ± 0.0
Glu
6.509GluAla: 6.509 ± 0.813
0.243GluCys: 0.243 ± 0.109
3.65GluAsp: 3.65 ± 0.481
5.596GluGlu: 5.596 ± 0.74
2.676GluPhe: 2.676 ± 0.319
4.988GluGly: 4.988 ± 0.547
1.946GluHis: 1.946 ± 0.351
3.285GluIle: 3.285 ± 0.477
3.406GluLys: 3.406 ± 0.488
6.387GluLeu: 6.387 ± 0.722
2.068GluMet: 2.068 ± 0.339
2.311GluAsn: 2.311 ± 0.415
3.467GluPro: 3.467 ± 0.408
1.886GluGln: 1.886 ± 0.276
4.38GluArg: 4.38 ± 0.557
2.859GluSer: 2.859 ± 0.418
3.771GluThr: 3.771 ± 0.441
4.866GluVal: 4.866 ± 0.438
1.399GluTrp: 1.399 ± 0.294
2.129GluTyr: 2.129 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
2.859PheAla: 2.859 ± 0.471
0.365PheCys: 0.365 ± 0.159
2.798PheAsp: 2.798 ± 0.421
2.616PheGlu: 2.616 ± 0.357
0.73PhePhe: 0.73 ± 0.222
3.467PheGly: 3.467 ± 0.316
0.73PheHis: 0.73 ± 0.23
1.521PheIle: 1.521 ± 0.306
1.46PheLys: 1.46 ± 0.326
2.311PheLeu: 2.311 ± 0.433
0.547PheMet: 0.547 ± 0.159
1.703PheAsn: 1.703 ± 0.351
1.946PhePro: 1.946 ± 0.317
1.095PheGln: 1.095 ± 0.221
2.494PheArg: 2.494 ± 0.352
2.494PheSer: 2.494 ± 0.564
2.372PheThr: 2.372 ± 0.417
2.372PheVal: 2.372 ± 0.382
0.426PheTrp: 0.426 ± 0.175
1.034PheTyr: 1.034 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
6.995GlyAla: 6.995 ± 0.88
0.487GlyCys: 0.487 ± 0.181
5.292GlyAsp: 5.292 ± 0.607
5.414GlyGlu: 5.414 ± 0.737
2.981GlyPhe: 2.981 ± 0.388
9.732GlyGly: 9.732 ± 1.715
1.825GlyHis: 1.825 ± 0.421
4.258GlyIle: 4.258 ± 0.771
4.197GlyLys: 4.197 ± 0.527
7.056GlyLeu: 7.056 ± 0.838
2.251GlyMet: 2.251 ± 0.328
3.102GlyAsn: 3.102 ± 0.467
3.589GlyPro: 3.589 ± 0.478
2.494GlyGln: 2.494 ± 0.419
4.136GlyArg: 4.136 ± 0.592
4.805GlySer: 4.805 ± 0.57
5.353GlyThr: 5.353 ± 0.642
6.144GlyVal: 6.144 ± 0.584
2.068GlyTrp: 2.068 ± 0.312
2.433GlyTyr: 2.433 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
1.642HisAla: 1.642 ± 0.339
0.243HisCys: 0.243 ± 0.106
1.521HisAsp: 1.521 ± 0.358
1.156HisGlu: 1.156 ± 0.224
0.608HisPhe: 0.608 ± 0.196
2.007HisGly: 2.007 ± 0.494
0.608HisHis: 0.608 ± 0.201
1.034HisIle: 1.034 ± 0.212
0.791HisLys: 0.791 ± 0.219
1.217HisLeu: 1.217 ± 0.273
0.243HisMet: 0.243 ± 0.11
0.547HisAsn: 0.547 ± 0.178
0.73HisPro: 0.73 ± 0.158
0.912HisGln: 0.912 ± 0.237
1.642HisArg: 1.642 ± 0.296
0.852HisSer: 0.852 ± 0.218
1.338HisThr: 1.338 ± 0.254
1.217HisVal: 1.217 ± 0.261
0.426HisTrp: 0.426 ± 0.175
0.608HisTyr: 0.608 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
4.988IleAla: 4.988 ± 0.59
0.487IleCys: 0.487 ± 0.153
4.745IleAsp: 4.745 ± 0.479
4.319IleGlu: 4.319 ± 0.481
1.46IlePhe: 1.46 ± 0.341
4.623IleGly: 4.623 ± 0.506
0.547IleHis: 0.547 ± 0.187
2.19IleIle: 2.19 ± 0.334
2.555IleLys: 2.555 ± 0.386
2.981IleLeu: 2.981 ± 0.446
0.608IleMet: 0.608 ± 0.213
1.825IleAsn: 1.825 ± 0.399
3.65IlePro: 3.65 ± 0.537
1.642IleGln: 1.642 ± 0.316
3.285IleArg: 3.285 ± 0.438
2.676IleSer: 2.676 ± 0.417
3.589IleThr: 3.589 ± 0.448
2.676IleVal: 2.676 ± 0.363
1.095IleTrp: 1.095 ± 0.293
0.973IleTyr: 0.973 ± 0.201
0.0IleXaa: 0.0 ± 0.0
Lys
4.684LysAla: 4.684 ± 0.537
0.182LysCys: 0.182 ± 0.105
2.129LysAsp: 2.129 ± 0.475
3.528LysGlu: 3.528 ± 0.548
1.703LysPhe: 1.703 ± 0.293
3.832LysGly: 3.832 ± 0.602
0.608LysHis: 0.608 ± 0.165
2.981LysIle: 2.981 ± 0.404
2.129LysLys: 2.129 ± 0.437
4.623LysLeu: 4.623 ± 0.423
1.217LysMet: 1.217 ± 0.333
1.521LysAsn: 1.521 ± 0.304
3.163LysPro: 3.163 ± 0.583
1.46LysGln: 1.46 ± 0.268
3.832LysArg: 3.832 ± 0.476
2.372LysSer: 2.372 ± 0.485
2.494LysThr: 2.494 ± 0.46
3.832LysVal: 3.832 ± 0.45
1.156LysTrp: 1.156 ± 0.263
1.095LysTyr: 1.095 ± 0.176
0.0LysXaa: 0.0 ± 0.0
Leu
7.117LeuAla: 7.117 ± 0.592
0.608LeuCys: 0.608 ± 0.226
5.535LeuAsp: 5.535 ± 0.674
5.353LeuGlu: 5.353 ± 0.643
2.737LeuPhe: 2.737 ± 0.385
5.231LeuGly: 5.231 ± 0.599
1.886LeuHis: 1.886 ± 0.339
3.65LeuIle: 3.65 ± 0.495
4.319LeuLys: 4.319 ± 0.585
5.657LeuLeu: 5.657 ± 0.488
2.007LeuMet: 2.007 ± 0.322
2.859LeuAsn: 2.859 ± 0.462
4.805LeuPro: 4.805 ± 0.541
2.251LeuGln: 2.251 ± 0.431
5.231LeuArg: 5.231 ± 0.68
4.684LeuSer: 4.684 ± 0.422
5.414LeuThr: 5.414 ± 0.604
5.292LeuVal: 5.292 ± 0.577
1.764LeuTrp: 1.764 ± 0.257
2.129LeuTyr: 2.129 ± 0.388
0.0LeuXaa: 0.0 ± 0.0
Met
2.798MetAla: 2.798 ± 0.46
0.243MetCys: 0.243 ± 0.114
0.791MetAsp: 0.791 ± 0.253
1.338MetGlu: 1.338 ± 0.259
0.669MetPhe: 0.669 ± 0.185
1.825MetGly: 1.825 ± 0.325
0.365MetHis: 0.365 ± 0.144
1.095MetIle: 1.095 ± 0.3
1.156MetLys: 1.156 ± 0.259
1.399MetLeu: 1.399 ± 0.287
0.608MetMet: 0.608 ± 0.191
0.669MetAsn: 0.669 ± 0.19
1.521MetPro: 1.521 ± 0.279
0.791MetGln: 0.791 ± 0.244
1.399MetArg: 1.399 ± 0.252
2.981MetSer: 2.981 ± 0.401
2.616MetThr: 2.616 ± 0.377
1.034MetVal: 1.034 ± 0.263
0.426MetTrp: 0.426 ± 0.142
0.365MetTyr: 0.365 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.832AsnAla: 3.832 ± 0.473
0.426AsnCys: 0.426 ± 0.157
1.521AsnAsp: 1.521 ± 0.291
1.946AsnGlu: 1.946 ± 0.372
1.156AsnPhe: 1.156 ± 0.26
3.771AsnGly: 3.771 ± 0.523
0.73AsnHis: 0.73 ± 0.184
1.886AsnIle: 1.886 ± 0.331
1.46AsnLys: 1.46 ± 0.266
3.041AsnLeu: 3.041 ± 0.362
1.034AsnMet: 1.034 ± 0.206
0.973AsnAsn: 0.973 ± 0.226
2.494AsnPro: 2.494 ± 0.409
1.521AsnGln: 1.521 ± 0.317
2.311AsnArg: 2.311 ± 0.363
1.582AsnSer: 1.582 ± 0.275
1.338AsnThr: 1.338 ± 0.274
2.311AsnVal: 2.311 ± 0.284
0.73AsnTrp: 0.73 ± 0.248
1.156AsnTyr: 1.156 ± 0.236
0.0AsnXaa: 0.0 ± 0.0
Pro
4.988ProAla: 4.988 ± 0.681
0.365ProCys: 0.365 ± 0.144
4.258ProAsp: 4.258 ± 0.472
4.075ProGlu: 4.075 ± 0.52
2.068ProPhe: 2.068 ± 0.311
4.319ProGly: 4.319 ± 0.566
1.095ProHis: 1.095 ± 0.263
2.311ProIle: 2.311 ± 0.313
2.311ProLys: 2.311 ± 0.403
3.467ProLeu: 3.467 ± 0.459
1.338ProMet: 1.338 ± 0.283
2.372ProAsn: 2.372 ± 0.375
2.19ProPro: 2.19 ± 0.421
1.582ProGln: 1.582 ± 0.336
2.981ProArg: 2.981 ± 0.476
3.163ProSer: 3.163 ± 0.468
3.771ProThr: 3.771 ± 0.584
4.197ProVal: 4.197 ± 0.588
0.973ProTrp: 0.973 ± 0.302
1.399ProTyr: 1.399 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
3.832GlnAla: 3.832 ± 0.548
0.182GlnCys: 0.182 ± 0.112
2.068GlnAsp: 2.068 ± 0.307
1.338GlnGlu: 1.338 ± 0.304
1.825GlnPhe: 1.825 ± 0.338
2.555GlnGly: 2.555 ± 0.429
0.669GlnHis: 0.669 ± 0.186
2.798GlnIle: 2.798 ± 0.406
2.007GlnLys: 2.007 ± 0.226
2.494GlnLeu: 2.494 ± 0.384
0.669GlnMet: 0.669 ± 0.19
0.608GlnAsn: 0.608 ± 0.222
1.277GlnPro: 1.277 ± 0.321
1.217GlnGln: 1.217 ± 0.362
1.825GlnArg: 1.825 ± 0.316
1.095GlnSer: 1.095 ± 0.256
1.946GlnThr: 1.946 ± 0.32
3.102GlnVal: 3.102 ± 0.46
1.277GlnTrp: 1.277 ± 0.302
1.277GlnTyr: 1.277 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
5.414ArgAla: 5.414 ± 0.747
1.095ArgCys: 1.095 ± 0.39
3.893ArgAsp: 3.893 ± 0.432
4.562ArgGlu: 4.562 ± 0.557
2.251ArgPhe: 2.251 ± 0.4
3.71ArgGly: 3.71 ± 0.526
1.277ArgHis: 1.277 ± 0.307
3.528ArgIle: 3.528 ± 0.516
2.981ArgLys: 2.981 ± 0.413
5.535ArgLeu: 5.535 ± 0.587
1.582ArgMet: 1.582 ± 0.283
1.825ArgAsn: 1.825 ± 0.275
2.433ArgPro: 2.433 ± 0.365
2.555ArgGln: 2.555 ± 0.45
5.779ArgArg: 5.779 ± 0.777
2.616ArgSer: 2.616 ± 0.392
3.589ArgThr: 3.589 ± 0.377
4.805ArgVal: 4.805 ± 0.584
1.034ArgTrp: 1.034 ± 0.231
2.433ArgTyr: 2.433 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
4.623SerAla: 4.623 ± 0.582
0.487SerCys: 0.487 ± 0.206
3.589SerAsp: 3.589 ± 0.434
4.136SerGlu: 4.136 ± 0.4
1.764SerPhe: 1.764 ± 0.365
4.684SerGly: 4.684 ± 0.686
0.669SerHis: 0.669 ± 0.208
2.676SerIle: 2.676 ± 0.459
2.981SerLys: 2.981 ± 0.395
4.319SerLeu: 4.319 ± 0.642
1.095SerMet: 1.095 ± 0.278
1.703SerAsn: 1.703 ± 0.318
3.041SerPro: 3.041 ± 0.465
1.703SerGln: 1.703 ± 0.332
3.71SerArg: 3.71 ± 0.501
3.102SerSer: 3.102 ± 0.551
3.102SerThr: 3.102 ± 0.422
3.65SerVal: 3.65 ± 0.421
1.095SerTrp: 1.095 ± 0.205
1.642SerTyr: 1.642 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
6.326ThrAla: 6.326 ± 0.619
0.608ThrCys: 0.608 ± 0.193
4.015ThrAsp: 4.015 ± 0.47
3.163ThrGlu: 3.163 ± 0.398
1.764ThrPhe: 1.764 ± 0.269
6.752ThrGly: 6.752 ± 0.676
1.095ThrHis: 1.095 ± 0.24
3.041ThrIle: 3.041 ± 0.39
2.798ThrLys: 2.798 ± 0.452
4.319ThrLeu: 4.319 ± 0.528
1.582ThrMet: 1.582 ± 0.241
2.311ThrAsn: 2.311 ± 0.391
3.832ThrPro: 3.832 ± 0.525
2.007ThrGln: 2.007 ± 0.39
3.406ThrArg: 3.406 ± 0.56
2.494ThrSer: 2.494 ± 0.437
3.65ThrThr: 3.65 ± 0.426
4.866ThrVal: 4.866 ± 0.602
1.034ThrTrp: 1.034 ± 0.237
1.886ThrTyr: 1.886 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
5.353ValAla: 5.353 ± 0.673
0.973ValCys: 0.973 ± 0.231
5.657ValAsp: 5.657 ± 0.728
4.805ValGlu: 4.805 ± 0.54
2.737ValPhe: 2.737 ± 0.343
4.258ValGly: 4.258 ± 0.519
1.095ValHis: 1.095 ± 0.239
3.954ValIle: 3.954 ± 0.592
4.319ValLys: 4.319 ± 0.371
5.231ValLeu: 5.231 ± 0.613
1.703ValMet: 1.703 ± 0.337
3.65ValAsn: 3.65 ± 0.566
2.859ValPro: 2.859 ± 0.468
2.433ValGln: 2.433 ± 0.399
4.075ValArg: 4.075 ± 0.411
4.745ValSer: 4.745 ± 0.578
4.805ValThr: 4.805 ± 0.569
4.988ValVal: 4.988 ± 0.617
1.582ValTrp: 1.582 ± 0.306
2.068ValTyr: 2.068 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
2.372TrpAla: 2.372 ± 0.43
0.365TrpCys: 0.365 ± 0.148
1.521TrpAsp: 1.521 ± 0.313
1.582TrpGlu: 1.582 ± 0.315
0.912TrpPhe: 0.912 ± 0.249
1.886TrpGly: 1.886 ± 0.407
0.304TrpHis: 0.304 ± 0.126
1.338TrpIle: 1.338 ± 0.243
0.973TrpLys: 0.973 ± 0.263
1.338TrpLeu: 1.338 ± 0.228
0.304TrpMet: 0.304 ± 0.124
0.973TrpAsn: 0.973 ± 0.313
0.912TrpPro: 0.912 ± 0.237
1.46TrpGln: 1.46 ± 0.293
1.034TrpArg: 1.034 ± 0.235
1.521TrpSer: 1.521 ± 0.332
0.973TrpThr: 0.973 ± 0.222
1.156TrpVal: 1.156 ± 0.266
0.487TrpTrp: 0.487 ± 0.177
0.426TrpTyr: 0.426 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.92TyrAla: 2.92 ± 0.415
0.122TyrCys: 0.122 ± 0.073
1.825TyrAsp: 1.825 ± 0.325
2.251TyrGlu: 2.251 ± 0.336
0.852TyrPhe: 0.852 ± 0.217
2.19TyrGly: 2.19 ± 0.358
0.365TyrHis: 0.365 ± 0.138
1.034TyrIle: 1.034 ± 0.245
1.034TyrLys: 1.034 ± 0.235
2.616TyrLeu: 2.616 ± 0.39
0.852TyrMet: 0.852 ± 0.225
0.73TyrAsn: 0.73 ± 0.174
1.886TyrPro: 1.886 ± 0.342
1.217TyrGln: 1.217 ± 0.182
2.007TyrArg: 2.007 ± 0.429
1.582TyrSer: 1.582 ± 0.291
2.007TyrThr: 2.007 ± 0.305
1.886TyrVal: 1.886 ± 0.348
0.852TyrTrp: 0.852 ± 0.253
0.669TyrTyr: 0.669 ± 0.205
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (16441 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski