Amino acid dipepetide frequency for Mycobacterium phage Lokk

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.8AlaAla: 11.8 ± 1.052
0.568AlaCys: 0.568 ± 0.204
5.174AlaAsp: 5.174 ± 0.499
7.951AlaGlu: 7.951 ± 0.812
3.344AlaPhe: 3.344 ± 0.546
8.014AlaGly: 8.014 ± 0.857
1.577AlaHis: 1.577 ± 0.279
4.101AlaIle: 4.101 ± 0.49
4.859AlaLys: 4.859 ± 0.527
8.897AlaLeu: 8.897 ± 1.027
2.272AlaMet: 2.272 ± 0.343
3.344AlaAsn: 3.344 ± 0.541
4.606AlaPro: 4.606 ± 0.653
3.723AlaGln: 3.723 ± 0.713
5.994AlaArg: 5.994 ± 0.672
4.859AlaSer: 4.859 ± 0.526
4.922AlaThr: 4.922 ± 0.469
7.761AlaVal: 7.761 ± 0.806
1.577AlaTrp: 1.577 ± 0.262
2.903AlaTyr: 2.903 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.82CysAla: 0.82 ± 0.233
0.315CysCys: 0.315 ± 0.154
0.946CysAsp: 0.946 ± 0.269
0.379CysGlu: 0.379 ± 0.126
0.189CysPhe: 0.189 ± 0.114
0.883CysGly: 0.883 ± 0.225
0.189CysHis: 0.189 ± 0.142
0.189CysIle: 0.189 ± 0.152
0.568CysLys: 0.568 ± 0.18
0.757CysLeu: 0.757 ± 0.219
0.126CysMet: 0.126 ± 0.089
0.315CysAsn: 0.315 ± 0.111
0.757CysPro: 0.757 ± 0.299
0.063CysGln: 0.063 ± 0.068
0.442CysArg: 0.442 ± 0.168
0.315CysSer: 0.315 ± 0.14
0.82CysThr: 0.82 ± 0.243
0.568CysVal: 0.568 ± 0.169
0.379CysTrp: 0.379 ± 0.157
0.252CysTyr: 0.252 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
6.878AspAla: 6.878 ± 0.646
0.568AspCys: 0.568 ± 0.177
4.228AspAsp: 4.228 ± 0.67
4.165AspGlu: 4.165 ± 0.679
2.524AspPhe: 2.524 ± 0.389
5.805AspGly: 5.805 ± 0.662
1.262AspHis: 1.262 ± 0.362
3.344AspIle: 3.344 ± 0.452
2.461AspLys: 2.461 ± 0.444
5.931AspLeu: 5.931 ± 0.693
1.704AspMet: 1.704 ± 0.325
0.883AspAsn: 0.883 ± 0.243
5.237AspPro: 5.237 ± 0.595
2.524AspGln: 2.524 ± 0.398
2.839AspArg: 2.839 ± 0.485
2.966AspSer: 2.966 ± 0.421
3.218AspThr: 3.218 ± 0.444
4.101AspVal: 4.101 ± 0.529
1.388AspTrp: 1.388 ± 0.259
2.335AspTyr: 2.335 ± 0.282
0.0AspXaa: 0.0 ± 0.0
Glu
6.247GluAla: 6.247 ± 0.833
0.252GluCys: 0.252 ± 0.113
4.165GluAsp: 4.165 ± 0.623
4.291GluGlu: 4.291 ± 0.707
2.966GluPhe: 2.966 ± 0.417
5.49GluGly: 5.49 ± 0.682
1.388GluHis: 1.388 ± 0.337
3.66GluIle: 3.66 ± 0.565
2.461GluLys: 2.461 ± 0.296
6.247GluLeu: 6.247 ± 0.804
2.335GluMet: 2.335 ± 0.339
2.019GluAsn: 2.019 ± 0.329
2.587GluPro: 2.587 ± 0.407
2.713GluGln: 2.713 ± 0.292
4.291GluArg: 4.291 ± 0.481
2.713GluSer: 2.713 ± 0.402
4.101GluThr: 4.101 ± 0.436
5.363GluVal: 5.363 ± 0.551
1.325GluTrp: 1.325 ± 0.282
1.704GluTyr: 1.704 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
3.597PheAla: 3.597 ± 0.464
0.442PheCys: 0.442 ± 0.191
2.461PheAsp: 2.461 ± 0.437
2.587PheGlu: 2.587 ± 0.342
0.757PhePhe: 0.757 ± 0.227
3.029PheGly: 3.029 ± 0.438
0.883PheHis: 0.883 ± 0.285
1.577PheIle: 1.577 ± 0.355
1.01PheLys: 1.01 ± 0.284
2.65PheLeu: 2.65 ± 0.538
0.505PheMet: 0.505 ± 0.163
1.893PheAsn: 1.893 ± 0.341
1.577PhePro: 1.577 ± 0.323
1.325PheGln: 1.325 ± 0.251
2.082PheArg: 2.082 ± 0.439
1.956PheSer: 1.956 ± 0.32
2.145PheThr: 2.145 ± 0.336
2.524PheVal: 2.524 ± 0.464
0.379PheTrp: 0.379 ± 0.173
0.694PheTyr: 0.694 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
6.941GlyAla: 6.941 ± 0.909
0.694GlyCys: 0.694 ± 0.243
6.247GlyAsp: 6.247 ± 0.719
4.291GlyGlu: 4.291 ± 0.625
3.407GlyPhe: 3.407 ± 0.578
6.941GlyGly: 6.941 ± 1.421
1.514GlyHis: 1.514 ± 0.313
4.669GlyIle: 4.669 ± 0.657
4.543GlyLys: 4.543 ± 0.554
6.689GlyLeu: 6.689 ± 0.934
1.893GlyMet: 1.893 ± 0.281
3.66GlyAsn: 3.66 ± 0.599
4.291GlyPro: 4.291 ± 1.175
3.407GlyGln: 3.407 ± 0.493
4.038GlyArg: 4.038 ± 0.619
4.48GlySer: 4.48 ± 0.489
5.49GlyThr: 5.49 ± 0.714
6.436GlyVal: 6.436 ± 0.738
1.325GlyTrp: 1.325 ± 0.266
2.335GlyTyr: 2.335 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
1.514HisAla: 1.514 ± 0.302
0.252HisCys: 0.252 ± 0.125
1.262HisAsp: 1.262 ± 0.254
1.514HisGlu: 1.514 ± 0.316
0.315HisPhe: 0.315 ± 0.15
1.767HisGly: 1.767 ± 0.325
0.379HisHis: 0.379 ± 0.147
1.325HisIle: 1.325 ± 0.249
0.82HisLys: 0.82 ± 0.243
1.893HisLeu: 1.893 ± 0.415
0.315HisMet: 0.315 ± 0.134
0.631HisAsn: 0.631 ± 0.202
0.946HisPro: 0.946 ± 0.232
0.631HisGln: 0.631 ± 0.208
1.325HisArg: 1.325 ± 0.307
1.199HisSer: 1.199 ± 0.329
0.757HisThr: 0.757 ± 0.208
1.073HisVal: 1.073 ± 0.251
0.568HisTrp: 0.568 ± 0.203
0.568HisTyr: 0.568 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
5.048IleAla: 5.048 ± 0.513
0.505IleCys: 0.505 ± 0.179
4.48IleAsp: 4.48 ± 0.558
4.922IleGlu: 4.922 ± 0.636
1.451IlePhe: 1.451 ± 0.267
4.228IleGly: 4.228 ± 0.638
1.01IleHis: 1.01 ± 0.264
2.587IleIle: 2.587 ± 0.392
2.903IleLys: 2.903 ± 0.515
3.155IleLeu: 3.155 ± 0.437
0.379IleMet: 0.379 ± 0.174
2.272IleAsn: 2.272 ± 0.355
3.344IlePro: 3.344 ± 0.405
1.956IleGln: 1.956 ± 0.504
3.155IleArg: 3.155 ± 0.441
2.335IleSer: 2.335 ± 0.498
3.344IleThr: 3.344 ± 0.436
2.587IleVal: 2.587 ± 0.388
0.82IleTrp: 0.82 ± 0.225
1.073IleTyr: 1.073 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
4.922LysAla: 4.922 ± 0.703
0.315LysCys: 0.315 ± 0.148
2.398LysAsp: 2.398 ± 0.415
2.587LysGlu: 2.587 ± 0.367
1.073LysPhe: 1.073 ± 0.299
3.534LysGly: 3.534 ± 0.683
0.757LysHis: 0.757 ± 0.225
1.956LysIle: 1.956 ± 0.368
2.587LysLys: 2.587 ± 0.444
3.66LysLeu: 3.66 ± 0.499
1.073LysMet: 1.073 ± 0.264
1.514LysAsn: 1.514 ± 0.323
3.155LysPro: 3.155 ± 0.642
1.577LysGln: 1.577 ± 0.336
3.597LysArg: 3.597 ± 0.459
2.839LysSer: 2.839 ± 0.409
3.155LysThr: 3.155 ± 0.408
3.975LysVal: 3.975 ± 0.456
0.757LysTrp: 0.757 ± 0.233
1.262LysTyr: 1.262 ± 0.312
0.0LysXaa: 0.0 ± 0.0
Leu
8.834LeuAla: 8.834 ± 0.806
0.631LeuCys: 0.631 ± 0.18
5.237LeuAsp: 5.237 ± 0.554
5.111LeuGlu: 5.111 ± 0.641
2.966LeuPhe: 2.966 ± 0.353
5.931LeuGly: 5.931 ± 0.85
1.767LeuHis: 1.767 ± 0.321
4.669LeuIle: 4.669 ± 0.607
3.66LeuLys: 3.66 ± 0.431
6.184LeuLeu: 6.184 ± 0.58
2.019LeuMet: 2.019 ± 0.368
2.335LeuAsn: 2.335 ± 0.438
4.543LeuPro: 4.543 ± 0.524
2.65LeuGln: 2.65 ± 0.46
6.058LeuArg: 6.058 ± 0.741
5.868LeuSer: 5.868 ± 0.664
4.859LeuThr: 4.859 ± 0.577
4.543LeuVal: 4.543 ± 0.59
1.262LeuTrp: 1.262 ± 0.285
2.145LeuTyr: 2.145 ± 0.367
0.0LeuXaa: 0.0 ± 0.0
Met
2.145MetAla: 2.145 ± 0.284
0.063MetCys: 0.063 ± 0.068
1.199MetAsp: 1.199 ± 0.267
0.946MetGlu: 0.946 ± 0.217
0.694MetPhe: 0.694 ± 0.219
1.451MetGly: 1.451 ± 0.303
0.442MetHis: 0.442 ± 0.165
1.262MetIle: 1.262 ± 0.251
1.073MetLys: 1.073 ± 0.24
1.704MetLeu: 1.704 ± 0.321
0.757MetMet: 0.757 ± 0.216
0.946MetAsn: 0.946 ± 0.242
1.388MetPro: 1.388 ± 0.292
0.883MetGln: 0.883 ± 0.208
1.514MetArg: 1.514 ± 0.306
2.208MetSer: 2.208 ± 0.349
2.208MetThr: 2.208 ± 0.324
1.136MetVal: 1.136 ± 0.245
0.189MetTrp: 0.189 ± 0.116
0.82MetTyr: 0.82 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.597AsnAla: 3.597 ± 0.546
0.505AsnCys: 0.505 ± 0.187
1.767AsnAsp: 1.767 ± 0.286
1.451AsnGlu: 1.451 ± 0.282
0.883AsnPhe: 0.883 ± 0.314
3.912AsnGly: 3.912 ± 0.501
0.946AsnHis: 0.946 ± 0.186
1.641AsnIle: 1.641 ± 0.319
1.199AsnLys: 1.199 ± 0.293
2.903AsnLeu: 2.903 ± 0.41
0.694AsnMet: 0.694 ± 0.194
0.694AsnAsn: 0.694 ± 0.2
2.082AsnPro: 2.082 ± 0.437
1.325AsnGln: 1.325 ± 0.274
1.704AsnArg: 1.704 ± 0.36
1.262AsnSer: 1.262 ± 0.271
1.577AsnThr: 1.577 ± 0.345
2.65AsnVal: 2.65 ± 0.441
0.946AsnTrp: 0.946 ± 0.251
0.82AsnTyr: 0.82 ± 0.207
0.0AsnXaa: 0.0 ± 0.0
Pro
5.805ProAla: 5.805 ± 0.711
0.315ProCys: 0.315 ± 0.159
3.849ProAsp: 3.849 ± 0.428
4.669ProGlu: 4.669 ± 0.546
1.956ProPhe: 1.956 ± 0.431
4.859ProGly: 4.859 ± 0.559
1.01ProHis: 1.01 ± 0.273
3.029ProIle: 3.029 ± 0.367
2.776ProLys: 2.776 ± 0.61
3.155ProLeu: 3.155 ± 0.467
1.01ProMet: 1.01 ± 0.281
1.641ProAsn: 1.641 ± 0.306
2.587ProPro: 2.587 ± 0.534
2.335ProGln: 2.335 ± 0.65
3.597ProArg: 3.597 ± 0.611
3.155ProSer: 3.155 ± 0.398
3.597ProThr: 3.597 ± 0.556
3.975ProVal: 3.975 ± 0.461
1.136ProTrp: 1.136 ± 0.374
1.262ProTyr: 1.262 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
4.669GlnAla: 4.669 ± 0.734
0.126GlnCys: 0.126 ± 0.095
1.388GlnAsp: 1.388 ± 0.268
2.019GlnGlu: 2.019 ± 0.34
1.325GlnPhe: 1.325 ± 0.263
4.606GlnGly: 4.606 ± 1.573
0.883GlnHis: 0.883 ± 0.195
2.776GlnIle: 2.776 ± 0.463
1.641GlnLys: 1.641 ± 0.357
2.966GlnLeu: 2.966 ± 0.658
1.136GlnMet: 1.136 ± 0.219
1.01GlnAsn: 1.01 ± 0.217
1.514GlnPro: 1.514 ± 0.313
1.956GlnGln: 1.956 ± 0.358
2.335GlnArg: 2.335 ± 0.376
1.767GlnSer: 1.767 ± 0.387
2.461GlnThr: 2.461 ± 0.39
2.461GlnVal: 2.461 ± 0.469
0.757GlnTrp: 0.757 ± 0.231
0.883GlnTyr: 0.883 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
4.417ArgAla: 4.417 ± 0.564
0.883ArgCys: 0.883 ± 0.275
3.597ArgAsp: 3.597 ± 0.622
5.3ArgGlu: 5.3 ± 0.822
2.272ArgPhe: 2.272 ± 0.317
4.228ArgGly: 4.228 ± 0.517
1.325ArgHis: 1.325 ± 0.239
3.66ArgIle: 3.66 ± 0.481
3.597ArgLys: 3.597 ± 0.586
6.184ArgLeu: 6.184 ± 0.744
2.019ArgMet: 2.019 ± 0.312
2.461ArgAsn: 2.461 ± 0.503
2.713ArgPro: 2.713 ± 0.471
1.83ArgGln: 1.83 ± 0.337
5.048ArgArg: 5.048 ± 0.66
2.524ArgSer: 2.524 ± 0.433
2.903ArgThr: 2.903 ± 0.46
4.354ArgVal: 4.354 ± 0.48
1.388ArgTrp: 1.388 ± 0.282
1.704ArgTyr: 1.704 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
4.291SerAla: 4.291 ± 0.483
0.631SerCys: 0.631 ± 0.194
3.786SerAsp: 3.786 ± 0.435
3.47SerGlu: 3.47 ± 0.397
2.019SerPhe: 2.019 ± 0.472
4.543SerGly: 4.543 ± 0.617
0.883SerHis: 0.883 ± 0.198
2.587SerIle: 2.587 ± 0.411
2.398SerLys: 2.398 ± 0.508
4.48SerLeu: 4.48 ± 0.622
1.262SerMet: 1.262 ± 0.271
1.262SerAsn: 1.262 ± 0.247
3.912SerPro: 3.912 ± 0.475
2.208SerGln: 2.208 ± 0.363
3.66SerArg: 3.66 ± 0.512
3.092SerSer: 3.092 ± 0.516
3.786SerThr: 3.786 ± 0.453
3.849SerVal: 3.849 ± 0.563
1.073SerTrp: 1.073 ± 0.274
1.704SerTyr: 1.704 ± 0.297
0.0SerXaa: 0.0 ± 0.0
Thr
6.247ThrAla: 6.247 ± 0.579
0.82ThrCys: 0.82 ± 0.245
3.534ThrAsp: 3.534 ± 0.669
3.281ThrGlu: 3.281 ± 0.5
2.145ThrPhe: 2.145 ± 0.361
5.363ThrGly: 5.363 ± 0.652
0.757ThrHis: 0.757 ± 0.221
2.713ThrIle: 2.713 ± 0.399
2.966ThrLys: 2.966 ± 0.416
5.427ThrLeu: 5.427 ± 0.603
1.199ThrMet: 1.199 ± 0.253
1.577ThrAsn: 1.577 ± 0.341
4.038ThrPro: 4.038 ± 0.496
2.587ThrGln: 2.587 ± 0.399
2.966ThrArg: 2.966 ± 0.467
3.723ThrSer: 3.723 ± 0.446
3.344ThrThr: 3.344 ± 0.443
4.922ThrVal: 4.922 ± 0.7
1.073ThrTrp: 1.073 ± 0.296
2.335ThrTyr: 2.335 ± 0.348
0.0ThrXaa: 0.0 ± 0.0
Val
6.373ValAla: 6.373 ± 0.777
0.883ValCys: 0.883 ± 0.242
5.616ValAsp: 5.616 ± 0.652
4.606ValGlu: 4.606 ± 0.528
2.461ValPhe: 2.461 ± 0.407
5.3ValGly: 5.3 ± 0.696
1.136ValHis: 1.136 ± 0.236
3.534ValIle: 3.534 ± 0.482
3.218ValLys: 3.218 ± 0.359
4.922ValLeu: 4.922 ± 0.627
1.073ValMet: 1.073 ± 0.265
2.713ValAsn: 2.713 ± 0.483
3.723ValPro: 3.723 ± 0.551
2.65ValGln: 2.65 ± 0.461
4.606ValArg: 4.606 ± 0.623
4.669ValSer: 4.669 ± 0.642
5.174ValThr: 5.174 ± 0.726
5.3ValVal: 5.3 ± 0.557
1.199ValTrp: 1.199 ± 0.302
1.893ValTyr: 1.893 ± 0.468
0.0ValXaa: 0.0 ± 0.0
Trp
1.83TrpAla: 1.83 ± 0.414
0.315TrpCys: 0.315 ± 0.14
0.946TrpAsp: 0.946 ± 0.231
1.01TrpGlu: 1.01 ± 0.241
0.631TrpPhe: 0.631 ± 0.208
1.136TrpGly: 1.136 ± 0.27
0.442TrpHis: 0.442 ± 0.193
1.073TrpIle: 1.073 ± 0.264
0.946TrpLys: 0.946 ± 0.246
0.946TrpLeu: 0.946 ± 0.239
0.505TrpMet: 0.505 ± 0.186
0.631TrpAsn: 0.631 ± 0.196
1.073TrpPro: 1.073 ± 0.293
1.262TrpGln: 1.262 ± 0.281
1.262TrpArg: 1.262 ± 0.257
1.388TrpSer: 1.388 ± 0.296
1.451TrpThr: 1.451 ± 0.368
1.073TrpVal: 1.073 ± 0.219
0.379TrpTrp: 0.379 ± 0.178
0.315TrpTyr: 0.315 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.335TyrAla: 2.335 ± 0.394
0.189TyrCys: 0.189 ± 0.115
2.082TyrAsp: 2.082 ± 0.387
1.767TyrGlu: 1.767 ± 0.307
0.694TyrPhe: 0.694 ± 0.176
2.272TyrGly: 2.272 ± 0.402
0.505TyrHis: 0.505 ± 0.17
1.199TyrIle: 1.199 ± 0.256
1.01TyrLys: 1.01 ± 0.256
2.524TyrLeu: 2.524 ± 0.41
0.694TyrMet: 0.694 ± 0.202
0.757TyrAsn: 0.757 ± 0.21
1.641TyrPro: 1.641 ± 0.354
1.073TyrGln: 1.073 ± 0.241
1.893TyrArg: 1.893 ± 0.344
1.577TyrSer: 1.577 ± 0.287
1.704TyrThr: 1.704 ± 0.301
2.398TyrVal: 2.398 ± 0.41
0.694TyrTrp: 0.694 ± 0.243
0.883TyrTyr: 0.883 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (15849 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski