Amino acid dipepetide frequency for Mycobacterium phage Zeuska

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.742AlaAla: 11.742 ± 1.105
0.791AlaCys: 0.791 ± 0.252
6.327AlaAsp: 6.327 ± 0.603
6.327AlaGlu: 6.327 ± 0.671
2.799AlaPhe: 2.799 ± 0.504
7.97AlaGly: 7.97 ± 0.832
1.582AlaHis: 1.582 ± 0.359
4.259AlaIle: 4.259 ± 0.565
4.745AlaLys: 4.745 ± 0.523
8.578AlaLeu: 8.578 ± 0.84
2.494AlaMet: 2.494 ± 0.367
2.738AlaAsn: 2.738 ± 0.391
4.989AlaPro: 4.989 ± 0.519
2.92AlaGln: 2.92 ± 0.487
6.51AlaArg: 6.51 ± 0.579
4.989AlaSer: 4.989 ± 0.455
5.962AlaThr: 5.962 ± 0.606
8.335AlaVal: 8.335 ± 0.653
1.886AlaTrp: 1.886 ± 0.337
2.799AlaTyr: 2.799 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.73CysAla: 0.73 ± 0.239
0.061CysCys: 0.061 ± 0.068
0.365CysAsp: 0.365 ± 0.174
0.608CysGlu: 0.608 ± 0.193
0.122CysPhe: 0.122 ± 0.072
0.608CysGly: 0.608 ± 0.24
0.304CysHis: 0.304 ± 0.13
0.304CysIle: 0.304 ± 0.15
0.304CysLys: 0.304 ± 0.162
0.608CysLeu: 0.608 ± 0.225
0.122CysMet: 0.122 ± 0.077
0.243CysAsn: 0.243 ± 0.119
0.304CysPro: 0.304 ± 0.14
0.243CysGln: 0.243 ± 0.119
0.608CysArg: 0.608 ± 0.204
0.426CysSer: 0.426 ± 0.15
0.304CysThr: 0.304 ± 0.221
0.304CysVal: 0.304 ± 0.145
0.304CysTrp: 0.304 ± 0.142
0.243CysTyr: 0.243 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
6.084AspAla: 6.084 ± 0.613
0.608AspCys: 0.608 ± 0.201
4.502AspAsp: 4.502 ± 0.498
4.076AspGlu: 4.076 ± 0.482
2.312AspPhe: 2.312 ± 0.33
5.84AspGly: 5.84 ± 0.68
1.278AspHis: 1.278 ± 0.253
2.555AspIle: 2.555 ± 0.402
2.677AspLys: 2.677 ± 0.432
6.388AspLeu: 6.388 ± 0.716
1.278AspMet: 1.278 ± 0.216
1.947AspAsn: 1.947 ± 0.322
4.806AspPro: 4.806 ± 0.558
1.521AspGln: 1.521 ± 0.313
3.833AspArg: 3.833 ± 0.401
3.468AspSer: 3.468 ± 0.493
3.772AspThr: 3.772 ± 0.366
4.076AspVal: 4.076 ± 0.455
1.582AspTrp: 1.582 ± 0.317
1.947AspTyr: 1.947 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
6.449GluAla: 6.449 ± 0.815
0.183GluCys: 0.183 ± 0.114
5.171GluAsp: 5.171 ± 0.471
5.293GluGlu: 5.293 ± 0.628
2.008GluPhe: 2.008 ± 0.348
4.38GluGly: 4.38 ± 0.521
1.399GluHis: 1.399 ± 0.313
3.772GluIle: 3.772 ± 0.45
2.494GluLys: 2.494 ± 0.373
7.179GluLeu: 7.179 ± 0.702
1.582GluMet: 1.582 ± 0.318
1.582GluAsn: 1.582 ± 0.353
2.494GluPro: 2.494 ± 0.393
2.799GluGln: 2.799 ± 0.384
4.32GluArg: 4.32 ± 0.58
3.65GluSer: 3.65 ± 0.479
3.65GluThr: 3.65 ± 0.472
5.415GluVal: 5.415 ± 0.578
1.399GluTrp: 1.399 ± 0.3
2.494GluTyr: 2.494 ± 0.497
0.0GluXaa: 0.0 ± 0.0
Phe
2.434PheAla: 2.434 ± 0.398
0.365PheCys: 0.365 ± 0.162
2.555PheAsp: 2.555 ± 0.307
2.251PheGlu: 2.251 ± 0.393
0.548PhePhe: 0.548 ± 0.166
3.346PheGly: 3.346 ± 0.476
0.548PheHis: 0.548 ± 0.212
1.278PheIle: 1.278 ± 0.275
1.338PheLys: 1.338 ± 0.276
2.434PheLeu: 2.434 ± 0.418
0.73PheMet: 0.73 ± 0.199
1.338PheAsn: 1.338 ± 0.302
1.643PhePro: 1.643 ± 0.289
0.791PheGln: 0.791 ± 0.174
1.703PheArg: 1.703 ± 0.394
1.643PheSer: 1.643 ± 0.255
2.373PheThr: 2.373 ± 0.367
1.764PheVal: 1.764 ± 0.358
0.487PheTrp: 0.487 ± 0.163
0.973PheTyr: 0.973 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
8.031GlyAla: 8.031 ± 1.354
0.487GlyCys: 0.487 ± 0.166
5.84GlyAsp: 5.84 ± 0.429
4.38GlyGlu: 4.38 ± 0.524
3.042GlyPhe: 3.042 ± 0.558
10.038GlyGly: 10.038 ± 2.812
2.312GlyHis: 2.312 ± 0.459
4.563GlyIle: 4.563 ± 0.796
4.137GlyLys: 4.137 ± 0.55
7.361GlyLeu: 7.361 ± 0.754
2.008GlyMet: 2.008 ± 0.342
3.346GlyAsn: 3.346 ± 0.447
3.285GlyPro: 3.285 ± 0.519
2.251GlyGln: 2.251 ± 0.34
5.05GlyArg: 5.05 ± 0.564
5.962GlySer: 5.962 ± 0.775
5.05GlyThr: 5.05 ± 0.691
5.354GlyVal: 5.354 ± 0.466
3.103GlyTrp: 3.103 ± 0.449
2.555GlyTyr: 2.555 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
1.825HisAla: 1.825 ± 0.381
0.183HisCys: 0.183 ± 0.149
1.217HisAsp: 1.217 ± 0.261
1.582HisGlu: 1.582 ± 0.277
0.73HisPhe: 0.73 ± 0.196
1.643HisGly: 1.643 ± 0.312
0.608HisHis: 0.608 ± 0.181
0.852HisIle: 0.852 ± 0.192
0.913HisLys: 0.913 ± 0.312
1.643HisLeu: 1.643 ± 0.327
0.183HisMet: 0.183 ± 0.091
0.183HisAsn: 0.183 ± 0.106
1.399HisPro: 1.399 ± 0.281
0.913HisGln: 0.913 ± 0.203
1.582HisArg: 1.582 ± 0.3
0.669HisSer: 0.669 ± 0.192
1.034HisThr: 1.034 ± 0.268
1.886HisVal: 1.886 ± 0.319
0.487HisTrp: 0.487 ± 0.146
0.669HisTyr: 0.669 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
6.327IleAla: 6.327 ± 0.691
0.365IleCys: 0.365 ± 0.149
3.772IleAsp: 3.772 ± 0.398
3.833IleGlu: 3.833 ± 0.468
0.791IlePhe: 0.791 ± 0.208
3.711IleGly: 3.711 ± 0.484
0.791IleHis: 0.791 ± 0.222
1.825IleIle: 1.825 ± 0.287
1.825IleLys: 1.825 ± 0.367
3.346IleLeu: 3.346 ± 0.384
0.73IleMet: 0.73 ± 0.167
1.886IleAsn: 1.886 ± 0.301
3.103IlePro: 3.103 ± 0.365
1.338IleGln: 1.338 ± 0.339
3.224IleArg: 3.224 ± 0.406
3.468IleSer: 3.468 ± 0.414
3.224IleThr: 3.224 ± 0.432
2.799IleVal: 2.799 ± 0.49
0.73IleTrp: 0.73 ± 0.177
1.703IleTyr: 1.703 ± 0.269
0.0IleXaa: 0.0 ± 0.0
Lys
4.076LysAla: 4.076 ± 0.573
0.183LysCys: 0.183 ± 0.113
2.251LysAsp: 2.251 ± 0.443
2.555LysGlu: 2.555 ± 0.434
1.46LysPhe: 1.46 ± 0.313
2.738LysGly: 2.738 ± 0.371
1.338LysHis: 1.338 ± 0.352
2.251LysIle: 2.251 ± 0.403
1.947LysLys: 1.947 ± 0.389
3.285LysLeu: 3.285 ± 0.505
1.095LysMet: 1.095 ± 0.268
1.278LysAsn: 1.278 ± 0.237
2.92LysPro: 2.92 ± 0.527
1.643LysGln: 1.643 ± 0.417
2.799LysArg: 2.799 ± 0.412
2.373LysSer: 2.373 ± 0.437
2.799LysThr: 2.799 ± 0.427
3.042LysVal: 3.042 ± 0.418
0.73LysTrp: 0.73 ± 0.209
1.034LysTyr: 1.034 ± 0.282
0.0LysXaa: 0.0 ± 0.0
Leu
8.943LeuAla: 8.943 ± 0.733
0.365LeuCys: 0.365 ± 0.126
6.388LeuAsp: 6.388 ± 0.525
5.597LeuGlu: 5.597 ± 0.615
2.312LeuPhe: 2.312 ± 0.399
7.544LeuGly: 7.544 ± 0.764
1.278LeuHis: 1.278 ± 0.289
4.38LeuIle: 4.38 ± 0.478
4.38LeuLys: 4.38 ± 0.65
5.78LeuLeu: 5.78 ± 0.527
1.582LeuMet: 1.582 ± 0.27
2.859LeuAsn: 2.859 ± 0.414
5.354LeuPro: 5.354 ± 0.559
2.677LeuGln: 2.677 ± 0.497
5.597LeuArg: 5.597 ± 0.514
5.901LeuSer: 5.901 ± 0.556
5.84LeuThr: 5.84 ± 0.437
4.745LeuVal: 4.745 ± 0.661
1.156LeuTrp: 1.156 ± 0.268
2.373LeuTyr: 2.373 ± 0.393
0.0LeuXaa: 0.0 ± 0.0
Met
2.616MetAla: 2.616 ± 0.368
0.0MetCys: 0.0 ± 0.0
1.034MetAsp: 1.034 ± 0.264
1.521MetGlu: 1.521 ± 0.28
0.608MetPhe: 0.608 ± 0.171
1.278MetGly: 1.278 ± 0.255
0.365MetHis: 0.365 ± 0.148
0.548MetIle: 0.548 ± 0.208
0.973MetLys: 0.973 ± 0.257
1.217MetLeu: 1.217 ± 0.291
0.122MetMet: 0.122 ± 0.081
0.973MetAsn: 0.973 ± 0.199
1.034MetPro: 1.034 ± 0.218
0.608MetGln: 0.608 ± 0.165
1.217MetArg: 1.217 ± 0.268
2.555MetSer: 2.555 ± 0.41
1.643MetThr: 1.643 ± 0.301
1.156MetVal: 1.156 ± 0.249
0.365MetTrp: 0.365 ± 0.118
0.426MetTyr: 0.426 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
3.407AsnAla: 3.407 ± 0.431
0.061AsnCys: 0.061 ± 0.064
1.825AsnAsp: 1.825 ± 0.378
1.886AsnGlu: 1.886 ± 0.363
0.852AsnPhe: 0.852 ± 0.239
3.894AsnGly: 3.894 ± 0.548
0.669AsnHis: 0.669 ± 0.208
1.582AsnIle: 1.582 ± 0.333
0.669AsnLys: 0.669 ± 0.205
2.434AsnLeu: 2.434 ± 0.307
0.548AsnMet: 0.548 ± 0.177
0.791AsnAsn: 0.791 ± 0.176
2.859AsnPro: 2.859 ± 0.399
0.973AsnGln: 0.973 ± 0.203
1.521AsnArg: 1.521 ± 0.345
1.825AsnSer: 1.825 ± 0.478
1.947AsnThr: 1.947 ± 0.351
2.799AsnVal: 2.799 ± 0.467
1.034AsnTrp: 1.034 ± 0.183
1.095AsnTyr: 1.095 ± 0.267
0.0AsnXaa: 0.0 ± 0.0
Pro
4.806ProAla: 4.806 ± 0.645
0.304ProCys: 0.304 ± 0.135
4.076ProAsp: 4.076 ± 0.423
3.954ProGlu: 3.954 ± 0.555
2.069ProPhe: 2.069 ± 0.39
5.171ProGly: 5.171 ± 0.586
0.73ProHis: 0.73 ± 0.187
2.312ProIle: 2.312 ± 0.357
2.312ProLys: 2.312 ± 0.278
4.137ProLeu: 4.137 ± 0.448
0.973ProMet: 0.973 ± 0.274
1.764ProAsn: 1.764 ± 0.268
3.103ProPro: 3.103 ± 0.482
1.278ProGln: 1.278 ± 0.303
2.677ProArg: 2.677 ± 0.499
3.954ProSer: 3.954 ± 0.443
4.015ProThr: 4.015 ± 0.546
3.772ProVal: 3.772 ± 0.495
0.669ProTrp: 0.669 ± 0.26
1.703ProTyr: 1.703 ± 0.361
0.0ProXaa: 0.0 ± 0.0
Gln
2.677GlnAla: 2.677 ± 0.408
0.365GlnCys: 0.365 ± 0.152
1.217GlnAsp: 1.217 ± 0.347
2.008GlnGlu: 2.008 ± 0.317
1.095GlnPhe: 1.095 ± 0.254
2.19GlnGly: 2.19 ± 0.287
0.608GlnHis: 0.608 ± 0.19
3.103GlnIle: 3.103 ± 0.542
0.913GlnLys: 0.913 ± 0.231
3.772GlnLeu: 3.772 ± 0.472
0.852GlnMet: 0.852 ± 0.248
0.608GlnAsn: 0.608 ± 0.179
1.825GlnPro: 1.825 ± 0.316
1.703GlnGln: 1.703 ± 0.379
1.886GlnArg: 1.886 ± 0.318
1.643GlnSer: 1.643 ± 0.259
1.703GlnThr: 1.703 ± 0.317
2.129GlnVal: 2.129 ± 0.28
0.913GlnTrp: 0.913 ± 0.206
0.487GlnTyr: 0.487 ± 0.138
0.0GlnXaa: 0.0 ± 0.0
Arg
5.84ArgAla: 5.84 ± 0.669
0.791ArgCys: 0.791 ± 0.289
3.103ArgAsp: 3.103 ± 0.363
5.05ArgGlu: 5.05 ± 0.699
1.947ArgPhe: 1.947 ± 0.384
4.867ArgGly: 4.867 ± 0.666
1.095ArgHis: 1.095 ± 0.278
3.468ArgIle: 3.468 ± 0.478
3.589ArgLys: 3.589 ± 0.606
5.962ArgLeu: 5.962 ± 0.733
1.703ArgMet: 1.703 ± 0.326
2.19ArgAsn: 2.19 ± 0.374
2.616ArgPro: 2.616 ± 0.427
2.008ArgGln: 2.008 ± 0.341
5.658ArgArg: 5.658 ± 0.771
4.015ArgSer: 4.015 ± 0.485
3.042ArgThr: 3.042 ± 0.509
5.05ArgVal: 5.05 ± 0.526
1.521ArgTrp: 1.521 ± 0.321
1.582ArgTyr: 1.582 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
6.266SerAla: 6.266 ± 0.717
0.487SerCys: 0.487 ± 0.171
3.346SerAsp: 3.346 ± 0.41
4.198SerGlu: 4.198 ± 0.513
1.947SerPhe: 1.947 ± 0.408
6.875SerGly: 6.875 ± 0.703
1.703SerHis: 1.703 ± 0.309
2.434SerIle: 2.434 ± 0.42
1.947SerLys: 1.947 ± 0.297
5.05SerLeu: 5.05 ± 0.538
1.278SerMet: 1.278 ± 0.247
2.555SerAsn: 2.555 ± 0.421
2.859SerPro: 2.859 ± 0.424
2.069SerGln: 2.069 ± 0.296
3.894SerArg: 3.894 ± 0.475
3.042SerSer: 3.042 ± 0.578
3.407SerThr: 3.407 ± 0.422
4.198SerVal: 4.198 ± 0.423
1.217SerTrp: 1.217 ± 0.29
1.582SerTyr: 1.582 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
5.536ThrAla: 5.536 ± 0.596
0.365ThrCys: 0.365 ± 0.199
4.076ThrAsp: 4.076 ± 0.547
3.954ThrGlu: 3.954 ± 0.478
2.373ThrPhe: 2.373 ± 0.382
6.996ThrGly: 6.996 ± 0.681
1.034ThrHis: 1.034 ± 0.281
2.859ThrIle: 2.859 ± 0.561
2.251ThrLys: 2.251 ± 0.374
5.962ThrLeu: 5.962 ± 0.687
0.913ThrMet: 0.913 ± 0.2
1.947ThrAsn: 1.947 ± 0.388
3.407ThrPro: 3.407 ± 0.497
1.764ThrGln: 1.764 ± 0.325
3.346ThrArg: 3.346 ± 0.529
3.529ThrSer: 3.529 ± 0.585
4.076ThrThr: 4.076 ± 0.435
5.354ThrVal: 5.354 ± 0.628
1.034ThrTrp: 1.034 ± 0.245
1.703ThrTyr: 1.703 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
6.996ValAla: 6.996 ± 0.618
0.669ValCys: 0.669 ± 0.226
4.989ValAsp: 4.989 ± 0.519
5.05ValGlu: 5.05 ± 0.569
2.312ValPhe: 2.312 ± 0.338
4.745ValGly: 4.745 ± 0.707
1.521ValHis: 1.521 ± 0.282
3.772ValIle: 3.772 ± 0.392
2.799ValLys: 2.799 ± 0.422
5.475ValLeu: 5.475 ± 0.576
1.095ValMet: 1.095 ± 0.288
2.373ValAsn: 2.373 ± 0.371
3.772ValPro: 3.772 ± 0.456
2.069ValGln: 2.069 ± 0.374
5.293ValArg: 5.293 ± 0.694
4.259ValSer: 4.259 ± 0.461
5.293ValThr: 5.293 ± 0.614
5.11ValVal: 5.11 ± 0.667
1.46ValTrp: 1.46 ± 0.327
2.312ValTyr: 2.312 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
1.582TrpAla: 1.582 ± 0.282
0.304TrpCys: 0.304 ± 0.125
1.399TrpAsp: 1.399 ± 0.265
1.278TrpGlu: 1.278 ± 0.218
0.73TrpPhe: 0.73 ± 0.205
1.825TrpGly: 1.825 ± 0.326
0.426TrpHis: 0.426 ± 0.172
1.156TrpIle: 1.156 ± 0.242
0.304TrpLys: 0.304 ± 0.227
2.008TrpLeu: 2.008 ± 0.339
0.487TrpMet: 0.487 ± 0.227
0.669TrpAsn: 0.669 ± 0.228
0.73TrpPro: 0.73 ± 0.227
0.852TrpGln: 0.852 ± 0.232
1.521TrpArg: 1.521 ± 0.296
1.278TrpSer: 1.278 ± 0.254
1.46TrpThr: 1.46 ± 0.354
2.19TrpVal: 2.19 ± 0.35
0.669TrpTrp: 0.669 ± 0.22
0.243TrpTyr: 0.243 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 0.371
0.183TyrCys: 0.183 ± 0.103
1.095TyrAsp: 1.095 ± 0.303
2.312TyrGlu: 2.312 ± 0.352
0.487TyrPhe: 0.487 ± 0.15
2.494TyrGly: 2.494 ± 0.4
0.608TyrHis: 0.608 ± 0.192
1.521TyrIle: 1.521 ± 0.354
1.278TyrLys: 1.278 ± 0.28
2.434TyrLeu: 2.434 ± 0.401
0.426TyrMet: 0.426 ± 0.142
1.399TyrAsn: 1.399 ± 0.322
1.278TyrPro: 1.278 ± 0.278
1.278TyrGln: 1.278 ± 0.259
2.92TyrArg: 2.92 ± 0.433
1.703TyrSer: 1.703 ± 0.329
1.886TyrThr: 1.886 ± 0.37
1.886TyrVal: 1.886 ± 0.353
0.365TyrTrp: 0.365 ± 0.124
0.608TyrTyr: 0.608 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (16438 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski