Amino acid dipepetide frequency for Mycobacteriophage ElTiger69

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.865AlaAla: 9.865 ± 0.967
0.375AlaCys: 0.375 ± 0.145
5.432AlaAsp: 5.432 ± 0.701
6.431AlaGlu: 6.431 ± 0.8
3.184AlaPhe: 3.184 ± 0.431
6.369AlaGly: 6.369 ± 0.543
1.748AlaHis: 1.748 ± 0.304
4.62AlaIle: 4.62 ± 0.544
4.183AlaLys: 4.183 ± 0.579
7.742AlaLeu: 7.742 ± 0.711
2.248AlaMet: 2.248 ± 0.408
3.247AlaAsn: 3.247 ± 0.475
4.246AlaPro: 4.246 ± 0.511
3.059AlaGln: 3.059 ± 0.456
4.933AlaArg: 4.933 ± 0.691
5.619AlaSer: 5.619 ± 0.624
5.182AlaThr: 5.182 ± 0.581
6.931AlaVal: 6.931 ± 0.797
1.436AlaTrp: 1.436 ± 0.28
1.623AlaTyr: 1.623 ± 0.296
0.0AlaXaa: 0.0 ± 0.0
Cys
0.562CysAla: 0.562 ± 0.226
0.062CysCys: 0.062 ± 0.056
0.5CysAsp: 0.5 ± 0.168
0.687CysGlu: 0.687 ± 0.231
0.312CysPhe: 0.312 ± 0.133
0.687CysGly: 0.687 ± 0.182
0.25CysHis: 0.25 ± 0.117
0.5CysIle: 0.5 ± 0.172
0.5CysLys: 0.5 ± 0.185
0.749CysLeu: 0.749 ± 0.231
0.062CysMet: 0.062 ± 0.051
0.437CysAsn: 0.437 ± 0.163
0.375CysPro: 0.375 ± 0.183
0.187CysGln: 0.187 ± 0.102
0.937CysArg: 0.937 ± 0.245
0.624CysSer: 0.624 ± 0.202
0.375CysThr: 0.375 ± 0.159
0.624CysVal: 0.624 ± 0.211
0.187CysTrp: 0.187 ± 0.1
0.187CysTyr: 0.187 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
6.618AspAla: 6.618 ± 0.733
0.874AspCys: 0.874 ± 0.236
3.309AspAsp: 3.309 ± 0.514
4.246AspGlu: 4.246 ± 0.467
2.56AspPhe: 2.56 ± 0.477
6.119AspGly: 6.119 ± 0.676
1.686AspHis: 1.686 ± 0.34
3.309AspIle: 3.309 ± 0.396
2.872AspLys: 2.872 ± 0.414
6.119AspLeu: 6.119 ± 0.595
1.499AspMet: 1.499 ± 0.268
1.998AspAsn: 1.998 ± 0.384
4.995AspPro: 4.995 ± 0.668
3.372AspGln: 3.372 ± 0.447
3.309AspArg: 3.309 ± 0.484
3.184AspSer: 3.184 ± 0.481
3.184AspThr: 3.184 ± 0.402
5.432AspVal: 5.432 ± 0.565
1.686AspTrp: 1.686 ± 0.31
3.621AspTyr: 3.621 ± 0.496
0.0AspXaa: 0.0 ± 0.0
Glu
6.306GluAla: 6.306 ± 0.679
0.5GluCys: 0.5 ± 0.176
5.245GluAsp: 5.245 ± 0.562
5.37GluGlu: 5.37 ± 0.675
2.373GluPhe: 2.373 ± 0.348
5.744GluGly: 5.744 ± 0.581
1.623GluHis: 1.623 ± 0.355
4.121GluIle: 4.121 ± 0.57
3.059GluLys: 3.059 ± 0.4
7.43GluLeu: 7.43 ± 0.605
2.248GluMet: 2.248 ± 0.414
2.498GluAsn: 2.498 ± 0.396
2.685GluPro: 2.685 ± 0.517
2.747GluGln: 2.747 ± 0.327
4.995GluArg: 4.995 ± 0.634
3.122GluSer: 3.122 ± 0.505
3.434GluThr: 3.434 ± 0.505
5.682GluVal: 5.682 ± 0.624
1.249GluTrp: 1.249 ± 0.279
2.373GluTyr: 2.373 ± 0.39
0.0GluXaa: 0.0 ± 0.0
Phe
2.685PheAla: 2.685 ± 0.404
0.25PheCys: 0.25 ± 0.121
3.309PheAsp: 3.309 ± 0.437
2.185PheGlu: 2.185 ± 0.334
0.812PhePhe: 0.812 ± 0.233
2.622PheGly: 2.622 ± 0.368
0.874PheHis: 0.874 ± 0.279
1.124PheIle: 1.124 ± 0.272
1.561PheLys: 1.561 ± 0.3
2.747PheLeu: 2.747 ± 0.49
0.5PheMet: 0.5 ± 0.183
1.436PheAsn: 1.436 ± 0.309
1.499PhePro: 1.499 ± 0.367
1.249PheGln: 1.249 ± 0.271
1.873PheArg: 1.873 ± 0.299
1.936PheSer: 1.936 ± 0.328
1.873PheThr: 1.873 ± 0.341
2.435PheVal: 2.435 ± 0.355
0.812PheTrp: 0.812 ± 0.259
1.311PheTyr: 1.311 ± 0.268
0.0PheXaa: 0.0 ± 0.0
Gly
5.557GlyAla: 5.557 ± 0.764
0.812GlyCys: 0.812 ± 0.271
5.682GlyAsp: 5.682 ± 0.78
5.182GlyGlu: 5.182 ± 0.589
2.123GlyPhe: 2.123 ± 0.374
9.178GlyGly: 9.178 ± 2.185
1.748GlyHis: 1.748 ± 0.332
4.745GlyIle: 4.745 ± 0.68
4.371GlyLys: 4.371 ± 0.588
6.618GlyLeu: 6.618 ± 0.756
2.123GlyMet: 2.123 ± 0.332
3.434GlyAsn: 3.434 ± 0.44
3.559GlyPro: 3.559 ± 0.504
2.622GlyGln: 2.622 ± 0.564
4.558GlyArg: 4.558 ± 0.449
4.62GlySer: 4.62 ± 0.612
4.371GlyThr: 4.371 ± 0.548
5.432GlyVal: 5.432 ± 0.514
1.998GlyTrp: 1.998 ± 0.346
2.622GlyTyr: 2.622 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.436HisAla: 1.436 ± 0.292
0.125HisCys: 0.125 ± 0.085
1.436HisAsp: 1.436 ± 0.267
1.436HisGlu: 1.436 ± 0.27
0.812HisPhe: 0.812 ± 0.239
1.686HisGly: 1.686 ± 0.285
0.937HisHis: 0.937 ± 0.269
1.374HisIle: 1.374 ± 0.281
0.5HisLys: 0.5 ± 0.222
1.499HisLeu: 1.499 ± 0.287
0.187HisMet: 0.187 ± 0.108
0.687HisAsn: 0.687 ± 0.191
1.124HisPro: 1.124 ± 0.261
0.624HisGln: 0.624 ± 0.186
1.811HisArg: 1.811 ± 0.406
1.061HisSer: 1.061 ± 0.265
0.999HisThr: 0.999 ± 0.244
1.561HisVal: 1.561 ± 0.312
0.437HisTrp: 0.437 ± 0.205
0.687HisTyr: 0.687 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
5.807IleAla: 5.807 ± 0.659
0.437IleCys: 0.437 ± 0.161
4.308IleAsp: 4.308 ± 0.543
3.934IleGlu: 3.934 ± 0.523
1.061IlePhe: 1.061 ± 0.376
3.621IleGly: 3.621 ± 0.563
1.061IleHis: 1.061 ± 0.295
2.56IleIle: 2.56 ± 0.371
2.06IleLys: 2.06 ± 0.372
3.372IleLeu: 3.372 ± 0.47
0.687IleMet: 0.687 ± 0.205
2.622IleAsn: 2.622 ± 0.455
3.621IlePro: 3.621 ± 0.507
1.998IleGln: 1.998 ± 0.299
3.996IleArg: 3.996 ± 0.418
3.122IleSer: 3.122 ± 0.464
3.434IleThr: 3.434 ± 0.376
3.434IleVal: 3.434 ± 0.451
0.437IleTrp: 0.437 ± 0.153
1.436IleTyr: 1.436 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
4.87LysAla: 4.87 ± 0.657
0.312LysCys: 0.312 ± 0.134
2.935LysAsp: 2.935 ± 0.474
2.435LysGlu: 2.435 ± 0.436
1.499LysPhe: 1.499 ± 0.336
3.184LysGly: 3.184 ± 0.538
1.061LysHis: 1.061 ± 0.219
2.31LysIle: 2.31 ± 0.353
3.122LysLys: 3.122 ± 0.558
3.621LysLeu: 3.621 ± 0.428
1.249LysMet: 1.249 ± 0.275
1.061LysAsn: 1.061 ± 0.287
2.622LysPro: 2.622 ± 0.445
1.124LysGln: 1.124 ± 0.263
3.059LysArg: 3.059 ± 0.48
2.56LysSer: 2.56 ± 0.437
3.184LysThr: 3.184 ± 0.445
4.371LysVal: 4.371 ± 0.506
0.874LysTrp: 0.874 ± 0.231
1.186LysTyr: 1.186 ± 0.247
0.0LysXaa: 0.0 ± 0.0
Leu
7.493LeuAla: 7.493 ± 0.977
0.812LeuCys: 0.812 ± 0.236
6.806LeuAsp: 6.806 ± 0.733
5.994LeuGlu: 5.994 ± 0.606
2.06LeuPhe: 2.06 ± 0.36
5.744LeuGly: 5.744 ± 0.709
0.812LeuHis: 0.812 ± 0.296
3.934LeuIle: 3.934 ± 0.561
4.433LeuLys: 4.433 ± 0.658
6.244LeuLeu: 6.244 ± 0.698
2.248LeuMet: 2.248 ± 0.331
3.497LeuAsn: 3.497 ± 0.542
3.746LeuPro: 3.746 ± 0.497
2.123LeuGln: 2.123 ± 0.418
5.619LeuArg: 5.619 ± 0.525
5.807LeuSer: 5.807 ± 0.724
5.557LeuThr: 5.557 ± 0.762
6.431LeuVal: 6.431 ± 0.664
1.249LeuTrp: 1.249 ± 0.283
2.31LeuTyr: 2.31 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
1.936MetAla: 1.936 ± 0.437
0.25MetCys: 0.25 ± 0.121
1.623MetAsp: 1.623 ± 0.433
1.374MetGlu: 1.374 ± 0.302
0.874MetPhe: 0.874 ± 0.233
1.436MetGly: 1.436 ± 0.292
0.312MetHis: 0.312 ± 0.176
0.562MetIle: 0.562 ± 0.212
1.186MetLys: 1.186 ± 0.252
1.623MetLeu: 1.623 ± 0.467
0.5MetMet: 0.5 ± 0.184
0.874MetAsn: 0.874 ± 0.227
1.061MetPro: 1.061 ± 0.231
0.375MetGln: 0.375 ± 0.168
1.623MetArg: 1.623 ± 0.347
2.498MetSer: 2.498 ± 0.413
2.81MetThr: 2.81 ± 0.392
0.874MetVal: 0.874 ± 0.236
0.687MetTrp: 0.687 ± 0.201
1.124MetTyr: 1.124 ± 0.316
0.0MetXaa: 0.0 ± 0.0
Asn
2.747AsnAla: 2.747 ± 0.383
0.25AsnCys: 0.25 ± 0.111
2.123AsnAsp: 2.123 ± 0.382
1.873AsnGlu: 1.873 ± 0.327
1.686AsnPhe: 1.686 ± 0.356
4.808AsnGly: 4.808 ± 0.71
0.812AsnHis: 0.812 ± 0.222
1.436AsnIle: 1.436 ± 0.469
1.311AsnLys: 1.311 ± 0.254
2.935AsnLeu: 2.935 ± 0.275
0.5AsnMet: 0.5 ± 0.174
0.749AsnAsn: 0.749 ± 0.191
2.685AsnPro: 2.685 ± 0.408
0.999AsnGln: 0.999 ± 0.277
2.123AsnArg: 2.123 ± 0.39
1.623AsnSer: 1.623 ± 0.316
2.248AsnThr: 2.248 ± 0.424
2.56AsnVal: 2.56 ± 0.317
0.562AsnTrp: 0.562 ± 0.187
0.812AsnTyr: 0.812 ± 0.198
0.0AsnXaa: 0.0 ± 0.0
Pro
3.996ProAla: 3.996 ± 0.562
0.312ProCys: 0.312 ± 0.153
3.621ProAsp: 3.621 ± 0.456
4.433ProGlu: 4.433 ± 0.534
1.623ProPhe: 1.623 ± 0.435
4.808ProGly: 4.808 ± 0.748
0.812ProHis: 0.812 ± 0.221
2.872ProIle: 2.872 ± 0.44
2.06ProLys: 2.06 ± 0.347
3.184ProLeu: 3.184 ± 0.43
1.186ProMet: 1.186 ± 0.311
1.811ProAsn: 1.811 ± 0.37
2.935ProPro: 2.935 ± 0.691
1.748ProGln: 1.748 ± 0.311
2.123ProArg: 2.123 ± 0.399
3.059ProSer: 3.059 ± 0.368
3.621ProThr: 3.621 ± 0.44
4.308ProVal: 4.308 ± 0.54
1.124ProTrp: 1.124 ± 0.414
1.561ProTyr: 1.561 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
3.247GlnAla: 3.247 ± 0.402
0.25GlnCys: 0.25 ± 0.108
2.06GlnAsp: 2.06 ± 0.381
2.622GlnGlu: 2.622 ± 0.411
1.436GlnPhe: 1.436 ± 0.27
1.998GlnGly: 1.998 ± 0.342
0.749GlnHis: 0.749 ± 0.194
2.56GlnIle: 2.56 ± 0.375
1.936GlnLys: 1.936 ± 0.32
3.934GlnLeu: 3.934 ± 0.705
0.937GlnMet: 0.937 ± 0.22
0.749GlnAsn: 0.749 ± 0.216
1.186GlnPro: 1.186 ± 0.364
1.061GlnGln: 1.061 ± 0.288
1.311GlnArg: 1.311 ± 0.392
1.561GlnSer: 1.561 ± 0.314
1.686GlnThr: 1.686 ± 0.317
2.685GlnVal: 2.685 ± 0.327
0.687GlnTrp: 0.687 ± 0.223
1.124GlnTyr: 1.124 ± 0.27
0.0GlnXaa: 0.0 ± 0.0
Arg
4.745ArgAla: 4.745 ± 0.603
0.749ArgCys: 0.749 ± 0.267
3.434ArgAsp: 3.434 ± 0.403
5.307ArgGlu: 5.307 ± 0.725
2.747ArgPhe: 2.747 ± 0.464
3.996ArgGly: 3.996 ± 0.555
1.499ArgHis: 1.499 ± 0.381
3.684ArgIle: 3.684 ± 0.463
2.935ArgLys: 2.935 ± 0.464
6.181ArgLeu: 6.181 ± 0.5
1.811ArgMet: 1.811 ± 0.314
2.123ArgAsn: 2.123 ± 0.476
2.56ArgPro: 2.56 ± 0.364
2.685ArgGln: 2.685 ± 0.505
4.808ArgArg: 4.808 ± 0.733
2.435ArgSer: 2.435 ± 0.501
3.059ArgThr: 3.059 ± 0.356
3.372ArgVal: 3.372 ± 0.472
1.374ArgTrp: 1.374 ± 0.317
1.686ArgTyr: 1.686 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
4.433SerAla: 4.433 ± 0.543
0.562SerCys: 0.562 ± 0.203
4.308SerAsp: 4.308 ± 0.489
4.745SerGlu: 4.745 ± 0.601
2.685SerPhe: 2.685 ± 0.343
4.558SerGly: 4.558 ± 0.586
0.937SerHis: 0.937 ± 0.221
3.309SerIle: 3.309 ± 0.368
3.059SerLys: 3.059 ± 0.452
3.684SerLeu: 3.684 ± 0.472
1.499SerMet: 1.499 ± 0.275
1.873SerAsn: 1.873 ± 0.361
2.373SerPro: 2.373 ± 0.451
2.185SerGln: 2.185 ± 0.324
4.183SerArg: 4.183 ± 0.562
3.184SerSer: 3.184 ± 0.436
2.685SerThr: 2.685 ± 0.44
3.059SerVal: 3.059 ± 0.45
1.311SerTrp: 1.311 ± 0.298
1.873SerTyr: 1.873 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
5.182ThrAla: 5.182 ± 0.433
0.5ThrCys: 0.5 ± 0.179
3.934ThrAsp: 3.934 ± 0.45
4.995ThrGlu: 4.995 ± 0.493
2.123ThrPhe: 2.123 ± 0.339
4.87ThrGly: 4.87 ± 0.764
1.249ThrHis: 1.249 ± 0.292
3.184ThrIle: 3.184 ± 0.45
2.685ThrLys: 2.685 ± 0.442
5.245ThrLeu: 5.245 ± 0.675
1.561ThrMet: 1.561 ± 0.369
1.623ThrAsn: 1.623 ± 0.361
4.496ThrPro: 4.496 ± 0.706
1.561ThrGln: 1.561 ± 0.324
3.059ThrArg: 3.059 ± 0.404
2.872ThrSer: 2.872 ± 0.439
3.934ThrThr: 3.934 ± 0.539
3.621ThrVal: 3.621 ± 0.51
0.937ThrTrp: 0.937 ± 0.247
1.186ThrTyr: 1.186 ± 0.247
0.0ThrXaa: 0.0 ± 0.0
Val
6.431ValAla: 6.431 ± 0.6
0.937ValCys: 0.937 ± 0.265
5.619ValAsp: 5.619 ± 0.675
5.682ValGlu: 5.682 ± 0.498
1.998ValPhe: 1.998 ± 0.419
5.495ValGly: 5.495 ± 0.551
1.374ValHis: 1.374 ± 0.349
3.934ValIle: 3.934 ± 0.5
3.247ValLys: 3.247 ± 0.474
5.432ValLeu: 5.432 ± 0.721
1.311ValMet: 1.311 ± 0.281
2.435ValAsn: 2.435 ± 0.454
2.997ValPro: 2.997 ± 0.483
2.248ValGln: 2.248 ± 0.441
3.621ValArg: 3.621 ± 0.416
4.933ValSer: 4.933 ± 0.489
4.683ValThr: 4.683 ± 0.484
5.744ValVal: 5.744 ± 0.597
1.748ValTrp: 1.748 ± 0.32
2.373ValTyr: 2.373 ± 0.413
0.0ValXaa: 0.0 ± 0.0
Trp
1.748TrpAla: 1.748 ± 0.315
0.187TrpCys: 0.187 ± 0.112
1.811TrpAsp: 1.811 ± 0.364
1.249TrpGlu: 1.249 ± 0.27
0.5TrpPhe: 0.5 ± 0.193
1.686TrpGly: 1.686 ± 0.358
0.437TrpHis: 0.437 ± 0.179
1.311TrpIle: 1.311 ± 0.3
0.562TrpLys: 0.562 ± 0.177
1.623TrpLeu: 1.623 ± 0.331
0.687TrpMet: 0.687 ± 0.218
0.999TrpAsn: 0.999 ± 0.292
0.874TrpPro: 0.874 ± 0.216
0.812TrpGln: 0.812 ± 0.24
0.812TrpArg: 0.812 ± 0.221
1.686TrpSer: 1.686 ± 0.327
1.124TrpThr: 1.124 ± 0.298
1.061TrpVal: 1.061 ± 0.254
0.5TrpTrp: 0.5 ± 0.19
0.375TrpTyr: 0.375 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.81TyrAla: 2.81 ± 0.38
0.25TyrCys: 0.25 ± 0.118
2.498TyrAsp: 2.498 ± 0.472
2.622TyrGlu: 2.622 ± 0.575
0.624TyrPhe: 0.624 ± 0.192
2.747TyrGly: 2.747 ± 0.388
0.437TyrHis: 0.437 ± 0.136
1.561TyrIle: 1.561 ± 0.294
0.999TyrLys: 0.999 ± 0.231
2.935TyrLeu: 2.935 ± 0.378
0.5TyrMet: 0.5 ± 0.174
0.687TyrAsn: 0.687 ± 0.227
1.561TyrPro: 1.561 ± 0.285
0.999TyrGln: 0.999 ± 0.234
2.435TyrArg: 2.435 ± 0.432
1.061TyrSer: 1.061 ± 0.288
1.374TyrThr: 1.374 ± 0.267
2.622TyrVal: 2.622 ± 0.366
0.749TyrTrp: 0.749 ± 0.234
0.812TyrTyr: 0.812 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (16017 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski