Amino acid dipepetide frequency for Mycobacterium phage Lucyedi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.134AlaAla: 11.134 ± 1.22
0.73AlaCys: 0.73 ± 0.225
5.537AlaAsp: 5.537 ± 0.686
6.997AlaGlu: 6.997 ± 0.74
3.285AlaPhe: 3.285 ± 0.453
7.605AlaGly: 7.605 ± 0.716
1.46AlaHis: 1.46 ± 0.286
4.807AlaIle: 4.807 ± 0.557
5.658AlaLys: 5.658 ± 0.797
8.335AlaLeu: 8.335 ± 0.719
2.616AlaMet: 2.616 ± 0.381
2.799AlaAsn: 2.799 ± 0.397
5.232AlaPro: 5.232 ± 0.669
3.042AlaGln: 3.042 ± 0.465
5.719AlaArg: 5.719 ± 0.546
4.685AlaSer: 4.685 ± 0.551
5.05AlaThr: 5.05 ± 0.504
6.51AlaVal: 6.51 ± 0.7
1.886AlaTrp: 1.886 ± 0.361
2.555AlaTyr: 2.555 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.608CysAla: 0.608 ± 0.162
0.0CysCys: 0.0 ± 0.0
0.852CysAsp: 0.852 ± 0.218
0.73CysGlu: 0.73 ± 0.21
0.183CysPhe: 0.183 ± 0.101
0.791CysGly: 0.791 ± 0.197
0.304CysHis: 0.304 ± 0.118
0.243CysIle: 0.243 ± 0.112
0.183CysLys: 0.183 ± 0.116
0.608CysLeu: 0.608 ± 0.212
0.183CysMet: 0.183 ± 0.111
0.426CysAsn: 0.426 ± 0.185
0.608CysPro: 0.608 ± 0.223
0.304CysGln: 0.304 ± 0.141
0.608CysArg: 0.608 ± 0.223
0.791CysSer: 0.791 ± 0.24
0.426CysThr: 0.426 ± 0.18
0.73CysVal: 0.73 ± 0.185
0.365CysTrp: 0.365 ± 0.157
0.304CysTyr: 0.304 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
5.597AspAla: 5.597 ± 0.675
0.791AspCys: 0.791 ± 0.239
3.651AspAsp: 3.651 ± 0.494
4.076AspGlu: 4.076 ± 0.542
2.373AspPhe: 2.373 ± 0.365
5.719AspGly: 5.719 ± 0.613
1.582AspHis: 1.582 ± 0.313
3.346AspIle: 3.346 ± 0.353
2.92AspLys: 2.92 ± 0.412
6.267AspLeu: 6.267 ± 0.708
1.582AspMet: 1.582 ± 0.326
1.704AspAsn: 1.704 ± 0.302
4.989AspPro: 4.989 ± 0.506
1.521AspGln: 1.521 ± 0.337
3.164AspArg: 3.164 ± 0.355
3.346AspSer: 3.346 ± 0.476
4.016AspThr: 4.016 ± 0.461
5.172AspVal: 5.172 ± 0.48
1.399AspTrp: 1.399 ± 0.311
2.616AspTyr: 2.616 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
8.153GluAla: 8.153 ± 0.878
0.183GluCys: 0.183 ± 0.111
5.05GluAsp: 5.05 ± 0.657
4.381GluGlu: 4.381 ± 0.64
2.069GluPhe: 2.069 ± 0.362
5.293GluGly: 5.293 ± 0.54
1.825GluHis: 1.825 ± 0.385
4.076GluIle: 4.076 ± 0.505
3.042GluLys: 3.042 ± 0.56
6.449GluLeu: 6.449 ± 0.766
1.704GluMet: 1.704 ± 0.328
2.495GluAsn: 2.495 ± 0.36
2.92GluPro: 2.92 ± 0.421
2.008GluGln: 2.008 ± 0.332
4.381GluArg: 4.381 ± 0.547
3.103GluSer: 3.103 ± 0.496
3.529GluThr: 3.529 ± 0.481
4.563GluVal: 4.563 ± 0.546
1.521GluTrp: 1.521 ± 0.355
1.825GluTyr: 1.825 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
2.92PheAla: 2.92 ± 0.364
0.426PheCys: 0.426 ± 0.183
2.434PheAsp: 2.434 ± 0.439
2.008PheGlu: 2.008 ± 0.369
0.852PhePhe: 0.852 ± 0.281
3.042PheGly: 3.042 ± 0.348
0.608PheHis: 0.608 ± 0.244
1.339PheIle: 1.339 ± 0.246
1.278PheLys: 1.278 ± 0.297
2.069PheLeu: 2.069 ± 0.3
0.852PheMet: 0.852 ± 0.224
1.46PheAsn: 1.46 ± 0.309
2.312PhePro: 2.312 ± 0.404
1.399PheGln: 1.399 ± 0.314
2.069PheArg: 2.069 ± 0.358
2.373PheSer: 2.373 ± 0.361
2.677PheThr: 2.677 ± 0.332
2.19PheVal: 2.19 ± 0.349
0.548PheTrp: 0.548 ± 0.164
0.608PheTyr: 0.608 ± 0.193
0.0PheXaa: 0.0 ± 0.0
Gly
6.449GlyAla: 6.449 ± 0.821
0.791GlyCys: 0.791 ± 0.207
5.232GlyAsp: 5.232 ± 0.608
4.867GlyGlu: 4.867 ± 0.596
3.164GlyPhe: 3.164 ± 0.423
9.248GlyGly: 9.248 ± 2.149
1.886GlyHis: 1.886 ± 0.331
3.711GlyIle: 3.711 ± 0.408
4.32GlyLys: 4.32 ± 0.396
6.875GlyLeu: 6.875 ± 0.874
1.582GlyMet: 1.582 ± 0.277
2.86GlyAsn: 2.86 ± 0.521
3.407GlyPro: 3.407 ± 0.413
2.86GlyGln: 2.86 ± 0.465
3.894GlyArg: 3.894 ± 0.383
5.05GlySer: 5.05 ± 0.725
4.928GlyThr: 4.928 ± 0.65
5.902GlyVal: 5.902 ± 0.56
1.825GlyTrp: 1.825 ± 0.292
2.373GlyTyr: 2.373 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
1.886HisAla: 1.886 ± 0.335
0.243HisCys: 0.243 ± 0.109
1.339HisAsp: 1.339 ± 0.294
1.764HisGlu: 1.764 ± 0.369
0.608HisPhe: 0.608 ± 0.211
1.521HisGly: 1.521 ± 0.314
0.487HisHis: 0.487 ± 0.161
1.156HisIle: 1.156 ± 0.265
1.034HisLys: 1.034 ± 0.256
1.217HisLeu: 1.217 ± 0.297
0.061HisMet: 0.061 ± 0.054
0.608HisAsn: 0.608 ± 0.208
1.704HisPro: 1.704 ± 0.373
0.791HisGln: 0.791 ± 0.192
1.643HisArg: 1.643 ± 0.271
0.973HisSer: 0.973 ± 0.233
1.156HisThr: 1.156 ± 0.222
1.034HisVal: 1.034 ± 0.286
0.243HisTrp: 0.243 ± 0.122
0.487HisTyr: 0.487 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
5.841IleAla: 5.841 ± 0.621
0.365IleCys: 0.365 ± 0.144
4.381IleAsp: 4.381 ± 0.491
4.624IleGlu: 4.624 ± 0.504
1.46IlePhe: 1.46 ± 0.25
3.529IleGly: 3.529 ± 0.442
1.217IleHis: 1.217 ± 0.299
1.947IleIle: 1.947 ± 0.361
2.738IleLys: 2.738 ± 0.418
3.346IleLeu: 3.346 ± 0.411
0.548IleMet: 0.548 ± 0.193
2.19IleAsn: 2.19 ± 0.303
3.285IlePro: 3.285 ± 0.433
1.095IleGln: 1.095 ± 0.273
3.833IleArg: 3.833 ± 0.441
2.92IleSer: 2.92 ± 0.371
2.92IleThr: 2.92 ± 0.468
2.555IleVal: 2.555 ± 0.449
0.669IleTrp: 0.669 ± 0.204
1.095IleTyr: 1.095 ± 0.27
0.0IleXaa: 0.0 ± 0.0
Lys
4.807LysAla: 4.807 ± 0.639
0.243LysCys: 0.243 ± 0.122
3.164LysAsp: 3.164 ± 0.506
2.738LysGlu: 2.738 ± 0.468
1.278LysPhe: 1.278 ± 0.232
3.468LysGly: 3.468 ± 0.454
0.608LysHis: 0.608 ± 0.203
2.738LysIle: 2.738 ± 0.472
3.711LysLys: 3.711 ± 0.55
3.651LysLeu: 3.651 ± 0.474
1.339LysMet: 1.339 ± 0.306
1.095LysAsn: 1.095 ± 0.229
2.86LysPro: 2.86 ± 0.591
1.764LysGln: 1.764 ± 0.409
3.164LysArg: 3.164 ± 0.446
2.555LysSer: 2.555 ± 0.396
3.346LysThr: 3.346 ± 0.389
4.685LysVal: 4.685 ± 0.566
0.852LysTrp: 0.852 ± 0.399
1.278LysTyr: 1.278 ± 0.346
0.0LysXaa: 0.0 ± 0.0
Leu
7.97LeuAla: 7.97 ± 0.658
0.73LeuCys: 0.73 ± 0.188
4.441LeuAsp: 4.441 ± 0.56
5.902LeuGlu: 5.902 ± 0.79
2.677LeuPhe: 2.677 ± 0.328
5.476LeuGly: 5.476 ± 0.658
1.764LeuHis: 1.764 ± 0.416
4.076LeuIle: 4.076 ± 0.533
3.833LeuLys: 3.833 ± 0.489
4.928LeuLeu: 4.928 ± 0.62
2.129LeuMet: 2.129 ± 0.429
2.373LeuAsn: 2.373 ± 0.391
4.624LeuPro: 4.624 ± 0.509
2.555LeuGln: 2.555 ± 0.46
5.963LeuArg: 5.963 ± 0.624
5.658LeuSer: 5.658 ± 0.522
4.198LeuThr: 4.198 ± 0.457
5.537LeuVal: 5.537 ± 0.555
1.886LeuTrp: 1.886 ± 0.344
2.312LeuTyr: 2.312 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
2.312MetAla: 2.312 ± 0.393
0.122MetCys: 0.122 ± 0.092
0.973MetAsp: 0.973 ± 0.224
1.278MetGlu: 1.278 ± 0.273
0.426MetPhe: 0.426 ± 0.131
1.217MetGly: 1.217 ± 0.366
0.426MetHis: 0.426 ± 0.144
1.278MetIle: 1.278 ± 0.315
1.399MetLys: 1.399 ± 0.269
1.704MetLeu: 1.704 ± 0.331
0.426MetMet: 0.426 ± 0.17
0.852MetAsn: 0.852 ± 0.232
1.217MetPro: 1.217 ± 0.308
0.73MetGln: 0.73 ± 0.24
1.521MetArg: 1.521 ± 0.287
2.19MetSer: 2.19 ± 0.345
2.799MetThr: 2.799 ± 0.381
1.399MetVal: 1.399 ± 0.308
0.304MetTrp: 0.304 ± 0.117
0.548MetTyr: 0.548 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
3.042AsnAla: 3.042 ± 0.485
0.365AsnCys: 0.365 ± 0.166
1.886AsnAsp: 1.886 ± 0.292
2.616AsnGlu: 2.616 ± 0.396
1.217AsnPhe: 1.217 ± 0.312
3.407AsnGly: 3.407 ± 0.409
0.791AsnHis: 0.791 ± 0.18
1.582AsnIle: 1.582 ± 0.3
1.46AsnLys: 1.46 ± 0.285
2.555AsnLeu: 2.555 ± 0.416
0.73AsnMet: 0.73 ± 0.208
0.913AsnAsn: 0.913 ± 0.276
2.008AsnPro: 2.008 ± 0.427
1.034AsnGln: 1.034 ± 0.218
1.947AsnArg: 1.947 ± 0.305
2.312AsnSer: 2.312 ± 0.382
1.947AsnThr: 1.947 ± 0.364
2.129AsnVal: 2.129 ± 0.407
0.791AsnTrp: 0.791 ± 0.217
0.669AsnTyr: 0.669 ± 0.201
0.0AsnXaa: 0.0 ± 0.0
Pro
4.928ProAla: 4.928 ± 0.627
0.548ProCys: 0.548 ± 0.205
4.32ProAsp: 4.32 ± 0.463
4.867ProGlu: 4.867 ± 0.5
2.129ProPhe: 2.129 ± 0.36
5.354ProGly: 5.354 ± 0.747
0.973ProHis: 0.973 ± 0.279
2.799ProIle: 2.799 ± 0.426
2.129ProLys: 2.129 ± 0.409
3.164ProLeu: 3.164 ± 0.427
0.852ProMet: 0.852 ± 0.248
2.86ProAsn: 2.86 ± 0.419
2.069ProPro: 2.069 ± 0.372
2.129ProGln: 2.129 ± 0.371
3.59ProArg: 3.59 ± 0.578
2.251ProSer: 2.251 ± 0.413
4.137ProThr: 4.137 ± 0.574
4.016ProVal: 4.016 ± 0.452
0.852ProTrp: 0.852 ± 0.315
1.521ProTyr: 1.521 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
4.198GlnAla: 4.198 ± 0.698
0.183GlnCys: 0.183 ± 0.105
1.521GlnAsp: 1.521 ± 0.356
1.582GlnGlu: 1.582 ± 0.305
1.156GlnPhe: 1.156 ± 0.239
2.738GlnGly: 2.738 ± 0.36
0.608GlnHis: 0.608 ± 0.153
2.373GlnIle: 2.373 ± 0.429
1.339GlnLys: 1.339 ± 0.261
2.555GlnLeu: 2.555 ± 0.408
0.852GlnMet: 0.852 ± 0.233
0.73GlnAsn: 0.73 ± 0.199
1.704GlnPro: 1.704 ± 0.33
1.704GlnGln: 1.704 ± 0.475
2.434GlnArg: 2.434 ± 0.341
1.886GlnSer: 1.886 ± 0.441
1.582GlnThr: 1.582 ± 0.315
3.164GlnVal: 3.164 ± 0.377
0.73GlnTrp: 0.73 ± 0.244
0.852GlnTyr: 0.852 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
5.415ArgAla: 5.415 ± 0.525
0.913ArgCys: 0.913 ± 0.28
3.772ArgAsp: 3.772 ± 0.679
4.441ArgGlu: 4.441 ± 0.445
2.495ArgPhe: 2.495 ± 0.45
3.894ArgGly: 3.894 ± 0.525
1.46ArgHis: 1.46 ± 0.326
3.468ArgIle: 3.468 ± 0.435
3.346ArgLys: 3.346 ± 0.494
5.78ArgLeu: 5.78 ± 0.662
2.434ArgMet: 2.434 ± 0.407
1.886ArgAsn: 1.886 ± 0.386
2.555ArgPro: 2.555 ± 0.402
1.886ArgGln: 1.886 ± 0.314
5.841ArgArg: 5.841 ± 0.89
3.59ArgSer: 3.59 ± 0.535
2.86ArgThr: 2.86 ± 0.417
4.259ArgVal: 4.259 ± 0.568
1.46ArgTrp: 1.46 ± 0.3
1.886ArgTyr: 1.886 ± 0.365
0.0ArgXaa: 0.0 ± 0.0
Ser
5.293SerAla: 5.293 ± 0.568
0.608SerCys: 0.608 ± 0.187
4.685SerAsp: 4.685 ± 0.629
4.016SerGlu: 4.016 ± 0.552
2.19SerPhe: 2.19 ± 0.344
5.476SerGly: 5.476 ± 0.676
0.913SerHis: 0.913 ± 0.251
2.738SerIle: 2.738 ± 0.384
2.799SerLys: 2.799 ± 0.468
4.624SerLeu: 4.624 ± 0.653
1.278SerMet: 1.278 ± 0.301
1.582SerAsn: 1.582 ± 0.375
3.529SerPro: 3.529 ± 0.434
2.312SerGln: 2.312 ± 0.282
4.32SerArg: 4.32 ± 0.499
3.833SerSer: 3.833 ± 0.536
2.86SerThr: 2.86 ± 0.398
4.076SerVal: 4.076 ± 0.501
1.278SerTrp: 1.278 ± 0.316
1.095SerTyr: 1.095 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
5.111ThrAla: 5.111 ± 0.417
0.608ThrCys: 0.608 ± 0.204
3.407ThrAsp: 3.407 ± 0.551
3.59ThrGlu: 3.59 ± 0.437
2.251ThrPhe: 2.251 ± 0.36
5.05ThrGly: 5.05 ± 0.577
1.156ThrHis: 1.156 ± 0.273
2.738ThrIle: 2.738 ± 0.362
3.225ThrLys: 3.225 ± 0.448
4.867ThrLeu: 4.867 ± 0.465
1.217ThrMet: 1.217 ± 0.323
1.704ThrAsn: 1.704 ± 0.253
4.198ThrPro: 4.198 ± 0.466
2.373ThrGln: 2.373 ± 0.364
2.86ThrArg: 2.86 ± 0.409
3.833ThrSer: 3.833 ± 0.504
2.92ThrThr: 2.92 ± 0.439
5.172ThrVal: 5.172 ± 0.647
0.973ThrTrp: 0.973 ± 0.242
1.704ThrTyr: 1.704 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
6.571ValAla: 6.571 ± 0.578
0.852ValCys: 0.852 ± 0.243
5.719ValAsp: 5.719 ± 0.494
4.867ValGlu: 4.867 ± 0.557
2.312ValPhe: 2.312 ± 0.457
4.989ValGly: 4.989 ± 0.582
0.852ValHis: 0.852 ± 0.225
3.955ValIle: 3.955 ± 0.536
3.346ValLys: 3.346 ± 0.399
5.658ValLeu: 5.658 ± 0.554
1.825ValMet: 1.825 ± 0.336
3.285ValAsn: 3.285 ± 0.509
4.076ValPro: 4.076 ± 0.494
2.008ValGln: 2.008 ± 0.366
3.955ValArg: 3.955 ± 0.547
4.807ValSer: 4.807 ± 0.576
4.076ValThr: 4.076 ± 0.518
5.719ValVal: 5.719 ± 0.725
1.399ValTrp: 1.399 ± 0.283
1.825ValTyr: 1.825 ± 0.333
0.0ValXaa: 0.0 ± 0.0
Trp
1.399TrpAla: 1.399 ± 0.321
0.304TrpCys: 0.304 ± 0.148
1.278TrpAsp: 1.278 ± 0.307
1.339TrpGlu: 1.339 ± 0.267
0.487TrpPhe: 0.487 ± 0.17
1.156TrpGly: 1.156 ± 0.289
0.669TrpHis: 0.669 ± 0.204
1.095TrpIle: 1.095 ± 0.275
0.913TrpLys: 0.913 ± 0.206
1.399TrpLeu: 1.399 ± 0.332
0.365TrpMet: 0.365 ± 0.15
0.852TrpAsn: 0.852 ± 0.218
0.913TrpPro: 0.913 ± 0.231
1.156TrpGln: 1.156 ± 0.233
0.791TrpArg: 0.791 ± 0.176
1.704TrpSer: 1.704 ± 0.307
1.339TrpThr: 1.339 ± 0.296
1.46TrpVal: 1.46 ± 0.241
0.73TrpTrp: 0.73 ± 0.255
0.548TrpTyr: 0.548 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.069TyrAla: 2.069 ± 0.328
0.243TyrCys: 0.243 ± 0.137
2.19TyrAsp: 2.19 ± 0.313
1.886TyrGlu: 1.886 ± 0.393
0.852TyrPhe: 0.852 ± 0.239
2.251TyrGly: 2.251 ± 0.332
0.365TyrHis: 0.365 ± 0.137
1.095TyrIle: 1.095 ± 0.23
0.669TyrLys: 0.669 ± 0.226
2.981TyrLeu: 2.981 ± 0.479
0.487TyrMet: 0.487 ± 0.195
0.73TyrAsn: 0.73 ± 0.203
1.278TyrPro: 1.278 ± 0.286
1.217TyrGln: 1.217 ± 0.282
1.947TyrArg: 1.947 ± 0.422
1.643TyrSer: 1.643 ± 0.319
2.19TyrThr: 2.19 ± 0.391
1.825TyrVal: 1.825 ± 0.342
0.183TyrTrp: 0.183 ± 0.107
1.034TyrTyr: 1.034 ± 0.31
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (16437 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski