Amino acid dipepetide frequency for Mycobacterium phage Juice456

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.509AlaAla: 14.509 ± 1.864
1.339AlaCys: 1.339 ± 0.306
6.92AlaAsp: 6.92 ± 0.714
7.645AlaGlu: 7.645 ± 0.82
2.958AlaPhe: 2.958 ± 0.43
9.71AlaGly: 9.71 ± 1.334
2.623AlaHis: 2.623 ± 0.451
4.297AlaIle: 4.297 ± 0.598
4.297AlaLys: 4.297 ± 0.507
8.259AlaLeu: 8.259 ± 0.748
2.734AlaMet: 2.734 ± 0.396
2.79AlaAsn: 2.79 ± 0.506
4.911AlaPro: 4.911 ± 0.662
3.739AlaGln: 3.739 ± 0.482
7.478AlaArg: 7.478 ± 0.757
5.301AlaSer: 5.301 ± 0.688
6.306AlaThr: 6.306 ± 0.64
6.752AlaVal: 6.752 ± 0.578
2.958AlaTrp: 2.958 ± 0.439
2.4AlaTyr: 2.4 ± 0.33
0.0AlaXaa: 0.0 ± 0.0
Cys
0.893CysAla: 0.893 ± 0.259
0.112CysCys: 0.112 ± 0.086
0.949CysAsp: 0.949 ± 0.281
0.67CysGlu: 0.67 ± 0.208
0.167CysPhe: 0.167 ± 0.085
1.953CysGly: 1.953 ± 0.411
0.112CysHis: 0.112 ± 0.085
0.335CysIle: 0.335 ± 0.134
0.446CysLys: 0.446 ± 0.188
1.228CysLeu: 1.228 ± 0.31
0.223CysMet: 0.223 ± 0.135
0.502CysAsn: 0.502 ± 0.163
1.228CysPro: 1.228 ± 0.26
0.446CysGln: 0.446 ± 0.194
0.949CysArg: 0.949 ± 0.339
1.228CysSer: 1.228 ± 0.344
0.725CysThr: 0.725 ± 0.224
0.614CysVal: 0.614 ± 0.161
0.223CysTrp: 0.223 ± 0.112
0.112CysTyr: 0.112 ± 0.07
0.0CysXaa: 0.0 ± 0.0
Asp
6.808AspAla: 6.808 ± 0.611
1.004AspCys: 1.004 ± 0.193
4.52AspAsp: 4.52 ± 0.563
3.516AspGlu: 3.516 ± 0.501
1.507AspPhe: 1.507 ± 0.25
6.529AspGly: 6.529 ± 0.573
1.283AspHis: 1.283 ± 0.248
2.567AspIle: 2.567 ± 0.39
1.562AspLys: 1.562 ± 0.316
6.529AspLeu: 6.529 ± 0.546
1.116AspMet: 1.116 ± 0.223
1.953AspAsn: 1.953 ± 0.344
5.246AspPro: 5.246 ± 0.611
2.4AspGln: 2.4 ± 0.301
4.967AspArg: 4.967 ± 0.649
3.292AspSer: 3.292 ± 0.511
3.683AspThr: 3.683 ± 0.482
4.52AspVal: 4.52 ± 0.575
1.507AspTrp: 1.507 ± 0.26
1.842AspTyr: 1.842 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
6.194GluAla: 6.194 ± 0.681
1.116GluCys: 1.116 ± 0.259
3.125GluAsp: 3.125 ± 0.374
2.958GluGlu: 2.958 ± 0.497
2.455GluPhe: 2.455 ± 0.327
2.79GluGly: 2.79 ± 0.418
1.562GluHis: 1.562 ± 0.307
3.013GluIle: 3.013 ± 0.418
2.009GluLys: 2.009 ± 0.416
4.241GluLeu: 4.241 ± 0.605
1.73GluMet: 1.73 ± 0.293
1.73GluAsn: 1.73 ± 0.239
2.679GluPro: 2.679 ± 0.381
2.958GluGln: 2.958 ± 0.488
4.967GluArg: 4.967 ± 0.489
3.571GluSer: 3.571 ± 0.51
4.074GluThr: 4.074 ± 0.576
3.85GluVal: 3.85 ± 0.566
1.172GluTrp: 1.172 ± 0.242
1.507GluTyr: 1.507 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
3.292PheAla: 3.292 ± 0.442
0.391PheCys: 0.391 ± 0.146
2.4PheAsp: 2.4 ± 0.292
1.507PheGlu: 1.507 ± 0.31
1.004PhePhe: 1.004 ± 0.271
3.292PheGly: 3.292 ± 0.596
0.279PheHis: 0.279 ± 0.119
1.395PheIle: 1.395 ± 0.335
1.283PheLys: 1.283 ± 0.275
2.176PheLeu: 2.176 ± 0.353
0.781PheMet: 0.781 ± 0.242
1.116PheAsn: 1.116 ± 0.387
1.562PhePro: 1.562 ± 0.335
0.893PheGln: 0.893 ± 0.291
1.562PheArg: 1.562 ± 0.276
1.339PheSer: 1.339 ± 0.277
2.176PheThr: 2.176 ± 0.339
1.674PheVal: 1.674 ± 0.258
0.502PheTrp: 0.502 ± 0.149
0.837PheTyr: 0.837 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
10.212GlyAla: 10.212 ± 1.108
0.949GlyCys: 0.949 ± 0.234
6.585GlyAsp: 6.585 ± 0.678
4.52GlyGlu: 4.52 ± 0.585
2.79GlyPhe: 2.79 ± 0.383
11.496GlyGly: 11.496 ± 2.367
1.786GlyHis: 1.786 ± 0.323
4.297GlyIle: 4.297 ± 0.618
2.623GlyLys: 2.623 ± 0.36
5.636GlyLeu: 5.636 ± 0.517
2.623GlyMet: 2.623 ± 0.507
2.902GlyAsn: 2.902 ± 0.379
4.241GlyPro: 4.241 ± 0.589
2.121GlyGln: 2.121 ± 0.529
4.799GlyArg: 4.799 ± 0.572
5.469GlySer: 5.469 ± 0.678
6.194GlyThr: 6.194 ± 0.616
6.306GlyVal: 6.306 ± 0.802
2.902GlyTrp: 2.902 ± 0.356
2.232GlyTyr: 2.232 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
1.842HisAla: 1.842 ± 0.317
0.335HisCys: 0.335 ± 0.157
1.116HisAsp: 1.116 ± 0.296
1.06HisGlu: 1.06 ± 0.228
0.446HisPhe: 0.446 ± 0.132
2.176HisGly: 2.176 ± 0.294
0.837HisHis: 0.837 ± 0.224
1.172HisIle: 1.172 ± 0.303
0.837HisLys: 0.837 ± 0.213
1.507HisLeu: 1.507 ± 0.276
0.391HisMet: 0.391 ± 0.137
0.781HisAsn: 0.781 ± 0.236
1.395HisPro: 1.395 ± 0.297
0.558HisGln: 0.558 ± 0.154
2.288HisArg: 2.288 ± 0.388
0.837HisSer: 0.837 ± 0.215
1.395HisThr: 1.395 ± 0.315
1.283HisVal: 1.283 ± 0.313
0.614HisTrp: 0.614 ± 0.204
0.837HisTyr: 0.837 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
5.636IleAla: 5.636 ± 0.534
0.614IleCys: 0.614 ± 0.238
3.85IleAsp: 3.85 ± 0.483
3.181IleGlu: 3.181 ± 0.401
0.893IlePhe: 0.893 ± 0.29
4.018IleGly: 4.018 ± 0.571
1.339IleHis: 1.339 ± 0.298
1.562IleIle: 1.562 ± 0.292
1.451IleLys: 1.451 ± 0.303
2.567IleLeu: 2.567 ± 0.423
0.279IleMet: 0.279 ± 0.111
1.953IleAsn: 1.953 ± 0.334
2.846IlePro: 2.846 ± 0.38
1.283IleGln: 1.283 ± 0.246
2.902IleArg: 2.902 ± 0.428
1.897IleSer: 1.897 ± 0.361
3.627IleThr: 3.627 ± 0.463
3.013IleVal: 3.013 ± 0.403
0.949IleTrp: 0.949 ± 0.232
0.67IleTyr: 0.67 ± 0.182
0.0IleXaa: 0.0 ± 0.0
Lys
3.627LysAla: 3.627 ± 0.433
0.391LysCys: 0.391 ± 0.145
1.618LysAsp: 1.618 ± 0.27
1.395LysGlu: 1.395 ± 0.26
1.004LysPhe: 1.004 ± 0.22
2.79LysGly: 2.79 ± 0.341
1.06LysHis: 1.06 ± 0.266
0.949LysIle: 0.949 ± 0.276
1.618LysLys: 1.618 ± 0.359
2.679LysLeu: 2.679 ± 0.427
0.67LysMet: 0.67 ± 0.164
1.06LysAsn: 1.06 ± 0.251
2.511LysPro: 2.511 ± 0.414
1.618LysGln: 1.618 ± 0.224
2.511LysArg: 2.511 ± 0.441
1.897LysSer: 1.897 ± 0.298
2.288LysThr: 2.288 ± 0.407
2.734LysVal: 2.734 ± 0.374
0.725LysTrp: 0.725 ± 0.202
0.949LysTyr: 0.949 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
8.259LeuAla: 8.259 ± 0.914
0.781LeuCys: 0.781 ± 0.208
5.301LeuAsp: 5.301 ± 0.688
3.404LeuGlu: 3.404 ± 0.543
1.897LeuPhe: 1.897 ± 0.3
5.469LeuGly: 5.469 ± 0.396
0.893LeuHis: 0.893 ± 0.227
3.627LeuIle: 3.627 ± 0.45
2.121LeuLys: 2.121 ± 0.377
4.241LeuLeu: 4.241 ± 0.513
1.507LeuMet: 1.507 ± 0.379
2.288LeuAsn: 2.288 ± 0.396
4.632LeuPro: 4.632 ± 0.561
2.902LeuGln: 2.902 ± 0.428
5.134LeuArg: 5.134 ± 0.569
5.357LeuSer: 5.357 ± 0.465
5.525LeuThr: 5.525 ± 0.552
5.859LeuVal: 5.859 ± 0.644
1.339LeuTrp: 1.339 ± 0.322
1.897LeuTyr: 1.897 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
1.842MetAla: 1.842 ± 0.308
0.335MetCys: 0.335 ± 0.134
1.06MetAsp: 1.06 ± 0.246
0.893MetGlu: 0.893 ± 0.188
0.781MetPhe: 0.781 ± 0.169
2.065MetGly: 2.065 ± 0.276
0.112MetHis: 0.112 ± 0.075
0.781MetIle: 0.781 ± 0.207
0.949MetLys: 0.949 ± 0.228
1.562MetLeu: 1.562 ± 0.214
0.614MetMet: 0.614 ± 0.229
1.004MetAsn: 1.004 ± 0.206
1.283MetPro: 1.283 ± 0.26
0.446MetGln: 0.446 ± 0.133
1.395MetArg: 1.395 ± 0.281
2.679MetSer: 2.679 ± 0.397
1.786MetThr: 1.786 ± 0.26
1.451MetVal: 1.451 ± 0.314
0.391MetTrp: 0.391 ± 0.144
0.167MetTyr: 0.167 ± 0.09
0.0MetXaa: 0.0 ± 0.0
Asn
3.516AsnAla: 3.516 ± 0.413
0.112AsnCys: 0.112 ± 0.084
1.73AsnAsp: 1.73 ± 0.309
1.953AsnGlu: 1.953 ± 0.374
0.837AsnPhe: 0.837 ± 0.285
3.906AsnGly: 3.906 ± 0.469
0.837AsnHis: 0.837 ± 0.196
1.562AsnIle: 1.562 ± 0.468
1.004AsnLys: 1.004 ± 0.224
2.176AsnLeu: 2.176 ± 0.322
0.67AsnMet: 0.67 ± 0.172
1.618AsnAsn: 1.618 ± 0.356
2.4AsnPro: 2.4 ± 0.358
1.283AsnGln: 1.283 ± 0.344
1.73AsnArg: 1.73 ± 0.356
1.507AsnSer: 1.507 ± 0.273
2.121AsnThr: 2.121 ± 0.329
1.786AsnVal: 1.786 ± 0.363
0.725AsnTrp: 0.725 ± 0.165
0.67AsnTyr: 0.67 ± 0.19
0.0AsnXaa: 0.0 ± 0.0
Pro
5.58ProAla: 5.58 ± 0.622
0.725ProCys: 0.725 ± 0.214
4.52ProAsp: 4.52 ± 0.567
4.576ProGlu: 4.576 ± 0.555
1.953ProPhe: 1.953 ± 0.38
6.808ProGly: 6.808 ± 0.716
1.339ProHis: 1.339 ± 0.28
1.786ProIle: 1.786 ± 0.246
1.786ProLys: 1.786 ± 0.271
4.129ProLeu: 4.129 ± 0.552
0.949ProMet: 0.949 ± 0.284
2.009ProAsn: 2.009 ± 0.324
3.906ProPro: 3.906 ± 0.565
2.455ProGln: 2.455 ± 0.342
3.237ProArg: 3.237 ± 0.553
3.013ProSer: 3.013 ± 0.411
2.958ProThr: 2.958 ± 0.429
4.688ProVal: 4.688 ± 0.546
0.949ProTrp: 0.949 ± 0.205
2.009ProTyr: 2.009 ± 0.352
0.0ProXaa: 0.0 ± 0.0
Gln
5.19GlnAla: 5.19 ± 0.546
0.391GlnCys: 0.391 ± 0.187
1.451GlnAsp: 1.451 ± 0.285
1.897GlnGlu: 1.897 ± 0.359
1.116GlnPhe: 1.116 ± 0.222
2.176GlnGly: 2.176 ± 0.383
0.893GlnHis: 0.893 ± 0.205
1.897GlnIle: 1.897 ± 0.312
1.339GlnLys: 1.339 ± 0.238
3.404GlnLeu: 3.404 ± 0.476
0.558GlnMet: 0.558 ± 0.178
1.172GlnAsn: 1.172 ± 0.232
2.344GlnPro: 2.344 ± 0.449
1.618GlnGln: 1.618 ± 0.426
2.009GlnArg: 2.009 ± 0.318
2.958GlnSer: 2.958 ± 0.323
1.451GlnThr: 1.451 ± 0.331
2.4GlnVal: 2.4 ± 0.388
0.837GlnTrp: 0.837 ± 0.198
1.004GlnTyr: 1.004 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
5.469ArgAla: 5.469 ± 0.582
1.228ArgCys: 1.228 ± 0.31
4.799ArgAsp: 4.799 ± 0.642
5.022ArgGlu: 5.022 ± 0.758
2.455ArgPhe: 2.455 ± 0.356
4.576ArgGly: 4.576 ± 0.495
1.507ArgHis: 1.507 ± 0.426
3.739ArgIle: 3.739 ± 0.459
2.232ArgLys: 2.232 ± 0.333
4.632ArgLeu: 4.632 ± 0.55
2.4ArgMet: 2.4 ± 0.428
2.009ArgAsn: 2.009 ± 0.403
3.46ArgPro: 3.46 ± 0.39
2.846ArgGln: 2.846 ± 0.454
5.804ArgArg: 5.804 ± 0.764
3.795ArgSer: 3.795 ± 0.395
3.571ArgThr: 3.571 ± 0.437
4.855ArgVal: 4.855 ± 0.625
1.73ArgTrp: 1.73 ± 0.379
2.121ArgTyr: 2.121 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
6.25SerAla: 6.25 ± 0.811
0.725SerCys: 0.725 ± 0.226
4.074SerAsp: 4.074 ± 0.507
3.46SerGlu: 3.46 ± 0.447
2.288SerPhe: 2.288 ± 0.426
5.971SerGly: 5.971 ± 0.768
1.395SerHis: 1.395 ± 0.221
2.344SerIle: 2.344 ± 0.403
2.121SerLys: 2.121 ± 0.369
3.571SerLeu: 3.571 ± 0.422
1.283SerMet: 1.283 ± 0.267
1.618SerAsn: 1.618 ± 0.348
3.46SerPro: 3.46 ± 0.427
2.065SerGln: 2.065 ± 0.356
3.85SerArg: 3.85 ± 0.442
4.129SerSer: 4.129 ± 0.56
3.571SerThr: 3.571 ± 0.385
4.688SerVal: 4.688 ± 0.53
1.562SerTrp: 1.562 ± 0.258
1.339SerTyr: 1.339 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
6.194ThrAla: 6.194 ± 0.756
0.614ThrCys: 0.614 ± 0.216
3.85ThrAsp: 3.85 ± 0.577
3.85ThrGlu: 3.85 ± 0.443
1.73ThrPhe: 1.73 ± 0.362
5.804ThrGly: 5.804 ± 0.636
1.451ThrHis: 1.451 ± 0.291
3.683ThrIle: 3.683 ± 0.485
2.344ThrLys: 2.344 ± 0.386
4.743ThrLeu: 4.743 ± 0.513
0.949ThrMet: 0.949 ± 0.209
2.176ThrAsn: 2.176 ± 0.367
4.464ThrPro: 4.464 ± 0.428
2.009ThrGln: 2.009 ± 0.314
4.185ThrArg: 4.185 ± 0.449
3.683ThrSer: 3.683 ± 0.408
4.911ThrThr: 4.911 ± 0.727
4.967ThrVal: 4.967 ± 0.656
1.228ThrTrp: 1.228 ± 0.263
1.953ThrTyr: 1.953 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
7.924ValAla: 7.924 ± 0.687
1.339ValCys: 1.339 ± 0.306
5.301ValAsp: 5.301 ± 0.708
3.627ValGlu: 3.627 ± 0.515
1.73ValPhe: 1.73 ± 0.361
6.027ValGly: 6.027 ± 0.684
1.395ValHis: 1.395 ± 0.347
3.181ValIle: 3.181 ± 0.454
2.455ValLys: 2.455 ± 0.402
5.469ValLeu: 5.469 ± 0.682
1.116ValMet: 1.116 ± 0.204
2.232ValAsn: 2.232 ± 0.363
4.074ValPro: 4.074 ± 0.428
2.4ValGln: 2.4 ± 0.318
4.52ValArg: 4.52 ± 0.56
5.19ValSer: 5.19 ± 0.533
4.855ValThr: 4.855 ± 0.405
5.748ValVal: 5.748 ± 0.644
1.73ValTrp: 1.73 ± 0.329
1.228ValTyr: 1.228 ± 0.314
0.0ValXaa: 0.0 ± 0.0
Trp
1.953TrpAla: 1.953 ± 0.275
0.167TrpCys: 0.167 ± 0.091
1.562TrpAsp: 1.562 ± 0.33
1.06TrpGlu: 1.06 ± 0.275
0.725TrpPhe: 0.725 ± 0.188
1.116TrpGly: 1.116 ± 0.257
0.502TrpHis: 0.502 ± 0.159
1.228TrpIle: 1.228 ± 0.255
0.837TrpLys: 0.837 ± 0.189
1.674TrpLeu: 1.674 ± 0.341
0.781TrpMet: 0.781 ± 0.242
0.614TrpAsn: 0.614 ± 0.181
1.339TrpPro: 1.339 ± 0.33
1.228TrpGln: 1.228 ± 0.279
2.009TrpArg: 2.009 ± 0.356
1.618TrpSer: 1.618 ± 0.341
1.73TrpThr: 1.73 ± 0.385
1.897TrpVal: 1.897 ± 0.407
0.893TrpTrp: 0.893 ± 0.249
0.558TrpTyr: 0.558 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.734TyrAla: 2.734 ± 0.403
0.502TyrCys: 0.502 ± 0.187
1.618TyrAsp: 1.618 ± 0.353
1.507TyrGlu: 1.507 ± 0.309
0.725TyrPhe: 0.725 ± 0.193
2.009TyrGly: 2.009 ± 0.353
0.502TyrHis: 0.502 ± 0.136
1.116TyrIle: 1.116 ± 0.298
0.781TyrLys: 0.781 ± 0.266
2.009TyrLeu: 2.009 ± 0.377
0.167TyrMet: 0.167 ± 0.089
0.614TyrAsn: 0.614 ± 0.217
1.395TyrPro: 1.395 ± 0.269
0.837TyrGln: 0.837 ± 0.21
1.953TyrArg: 1.953 ± 0.343
1.004TyrSer: 1.004 ± 0.253
1.842TyrThr: 1.842 ± 0.354
2.455TyrVal: 2.455 ± 0.317
0.558TyrTrp: 0.558 ± 0.159
0.614TyrTyr: 0.614 ± 0.165
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 102 proteins (17921 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski