Amino acid dipepetide frequency for Mycobacterium phage Turj99

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.955AlaAla: 12.955 ± 1.376
0.629AlaCys: 0.629 ± 0.175
6.666AlaAsp: 6.666 ± 0.63
6.792AlaGlu: 6.792 ± 0.829
3.144AlaPhe: 3.144 ± 0.458
7.672AlaGly: 7.672 ± 0.769
1.572AlaHis: 1.572 ± 0.361
4.339AlaIle: 4.339 ± 0.66
4.088AlaLys: 4.088 ± 0.5
9.119AlaLeu: 9.119 ± 0.867
2.39AlaMet: 2.39 ± 0.43
2.641AlaAsn: 2.641 ± 0.434
5.094AlaPro: 5.094 ± 0.719
2.641AlaGln: 2.641 ± 0.473
6.478AlaArg: 6.478 ± 0.742
5.157AlaSer: 5.157 ± 0.707
6.1AlaThr: 6.1 ± 0.658
8.553AlaVal: 8.553 ± 0.915
1.887AlaTrp: 1.887 ± 0.32
2.83AlaTyr: 2.83 ± 0.372
0.0AlaXaa: 0.0 ± 0.0
Cys
0.755CysAla: 0.755 ± 0.234
0.0CysCys: 0.0 ± 0.0
0.629CysAsp: 0.629 ± 0.18
0.629CysGlu: 0.629 ± 0.196
0.189CysPhe: 0.189 ± 0.108
0.44CysGly: 0.44 ± 0.173
0.126CysHis: 0.126 ± 0.096
0.44CysIle: 0.44 ± 0.198
0.314CysLys: 0.314 ± 0.18
0.566CysLeu: 0.566 ± 0.198
0.063CysMet: 0.063 ± 0.066
0.189CysAsn: 0.189 ± 0.098
0.377CysPro: 0.377 ± 0.165
0.189CysGln: 0.189 ± 0.096
0.629CysArg: 0.629 ± 0.202
0.377CysSer: 0.377 ± 0.14
0.314CysThr: 0.314 ± 0.13
0.44CysVal: 0.44 ± 0.154
0.252CysTrp: 0.252 ± 0.119
0.063CysTyr: 0.063 ± 0.058
0.0CysXaa: 0.0 ± 0.0
Asp
6.855AspAla: 6.855 ± 0.637
0.566AspCys: 0.566 ± 0.195
4.339AspAsp: 4.339 ± 0.458
3.962AspGlu: 3.962 ± 0.466
2.453AspPhe: 2.453 ± 0.344
6.226AspGly: 6.226 ± 0.607
1.006AspHis: 1.006 ± 0.254
2.893AspIle: 2.893 ± 0.42
2.201AspLys: 2.201 ± 0.411
6.981AspLeu: 6.981 ± 0.737
1.195AspMet: 1.195 ± 0.223
1.509AspAsn: 1.509 ± 0.267
4.78AspPro: 4.78 ± 0.63
1.446AspGln: 1.446 ± 0.346
3.899AspArg: 3.899 ± 0.367
3.019AspSer: 3.019 ± 0.393
3.899AspThr: 3.899 ± 0.374
3.773AspVal: 3.773 ± 0.493
1.824AspTrp: 1.824 ± 0.335
2.012AspTyr: 2.012 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
5.849GluAla: 5.849 ± 0.809
0.503GluCys: 0.503 ± 0.244
4.402GluAsp: 4.402 ± 0.543
4.905GluGlu: 4.905 ± 0.615
2.201GluPhe: 2.201 ± 0.301
3.899GluGly: 3.899 ± 0.425
1.384GluHis: 1.384 ± 0.294
3.522GluIle: 3.522 ± 0.482
2.83GluLys: 2.83 ± 0.412
7.044GluLeu: 7.044 ± 0.595
1.509GluMet: 1.509 ± 0.251
1.698GluAsn: 1.698 ± 0.368
2.578GluPro: 2.578 ± 0.472
2.956GluGln: 2.956 ± 0.453
3.648GluArg: 3.648 ± 0.494
3.333GluSer: 3.333 ± 0.378
3.773GluThr: 3.773 ± 0.459
5.723GluVal: 5.723 ± 0.595
1.698GluTrp: 1.698 ± 0.381
2.704GluTyr: 2.704 ± 0.466
0.0GluXaa: 0.0 ± 0.0
Phe
2.39PheAla: 2.39 ± 0.312
0.314PheCys: 0.314 ± 0.167
3.144PheAsp: 3.144 ± 0.34
1.824PheGlu: 1.824 ± 0.348
0.692PhePhe: 0.692 ± 0.193
3.648PheGly: 3.648 ± 0.475
0.566PheHis: 0.566 ± 0.209
1.509PheIle: 1.509 ± 0.295
1.321PheLys: 1.321 ± 0.294
2.704PheLeu: 2.704 ± 0.465
0.692PheMet: 0.692 ± 0.225
0.88PheAsn: 0.88 ± 0.227
1.572PhePro: 1.572 ± 0.29
0.88PheGln: 0.88 ± 0.256
2.138PheArg: 2.138 ± 0.372
2.075PheSer: 2.075 ± 0.447
2.012PheThr: 2.012 ± 0.381
2.264PheVal: 2.264 ± 0.397
0.566PheTrp: 0.566 ± 0.158
0.943PheTyr: 0.943 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
7.295GlyAla: 7.295 ± 1.158
0.88GlyCys: 0.88 ± 0.23
5.22GlyAsp: 5.22 ± 0.45
4.968GlyGlu: 4.968 ± 0.582
2.83GlyPhe: 2.83 ± 0.459
8.553GlyGly: 8.553 ± 1.472
1.761GlyHis: 1.761 ± 0.344
4.528GlyIle: 4.528 ± 0.625
3.648GlyLys: 3.648 ± 0.468
7.861GlyLeu: 7.861 ± 0.873
1.887GlyMet: 1.887 ± 0.331
3.333GlyAsn: 3.333 ± 0.452
3.899GlyPro: 3.899 ± 0.68
2.578GlyGln: 2.578 ± 0.343
4.591GlyArg: 4.591 ± 0.46
5.597GlySer: 5.597 ± 0.607
4.717GlyThr: 4.717 ± 0.635
5.031GlyVal: 5.031 ± 0.536
2.39GlyTrp: 2.39 ± 0.378
2.893GlyTyr: 2.893 ± 0.401
0.0GlyXaa: 0.0 ± 0.0
His
1.761HisAla: 1.761 ± 0.342
0.126HisCys: 0.126 ± 0.12
1.069HisAsp: 1.069 ± 0.231
1.446HisGlu: 1.446 ± 0.286
0.629HisPhe: 0.629 ± 0.193
1.572HisGly: 1.572 ± 0.348
0.692HisHis: 0.692 ± 0.228
0.818HisIle: 0.818 ± 0.23
1.006HisLys: 1.006 ± 0.268
1.761HisLeu: 1.761 ± 0.458
0.063HisMet: 0.063 ± 0.062
0.377HisAsn: 0.377 ± 0.156
1.195HisPro: 1.195 ± 0.295
1.069HisGln: 1.069 ± 0.247
1.446HisArg: 1.446 ± 0.306
0.566HisSer: 0.566 ± 0.172
1.069HisThr: 1.069 ± 0.248
1.761HisVal: 1.761 ± 0.374
0.566HisTrp: 0.566 ± 0.166
0.629HisTyr: 0.629 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
6.352IleAla: 6.352 ± 0.799
0.314IleCys: 0.314 ± 0.142
3.27IleAsp: 3.27 ± 0.357
3.962IleGlu: 3.962 ± 0.491
0.88IlePhe: 0.88 ± 0.252
3.71IleGly: 3.71 ± 0.551
0.818IleHis: 0.818 ± 0.29
1.446IleIle: 1.446 ± 0.312
1.95IleLys: 1.95 ± 0.324
3.522IleLeu: 3.522 ± 0.463
0.755IleMet: 0.755 ± 0.186
1.824IleAsn: 1.824 ± 0.33
3.333IlePro: 3.333 ± 0.365
1.446IleGln: 1.446 ± 0.324
3.899IleArg: 3.899 ± 0.488
3.585IleSer: 3.585 ± 0.559
3.396IleThr: 3.396 ± 0.412
2.704IleVal: 2.704 ± 0.487
0.88IleTrp: 0.88 ± 0.186
1.635IleTyr: 1.635 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
3.836LysAla: 3.836 ± 0.556
0.252LysCys: 0.252 ± 0.123
2.327LysAsp: 2.327 ± 0.403
2.012LysGlu: 2.012 ± 0.311
1.572LysPhe: 1.572 ± 0.298
2.327LysGly: 2.327 ± 0.301
1.258LysHis: 1.258 ± 0.291
2.39LysIle: 2.39 ± 0.419
2.012LysLys: 2.012 ± 0.405
3.207LysLeu: 3.207 ± 0.406
1.069LysMet: 1.069 ± 0.204
1.635LysAsn: 1.635 ± 0.268
2.578LysPro: 2.578 ± 0.344
1.509LysGln: 1.509 ± 0.303
2.83LysArg: 2.83 ± 0.405
2.39LysSer: 2.39 ± 0.515
2.39LysThr: 2.39 ± 0.374
3.27LysVal: 3.27 ± 0.456
0.818LysTrp: 0.818 ± 0.204
0.88LysTyr: 0.88 ± 0.336
0.0LysXaa: 0.0 ± 0.0
Leu
9.182LeuAla: 9.182 ± 0.934
0.314LeuCys: 0.314 ± 0.133
6.163LeuAsp: 6.163 ± 0.633
5.597LeuGlu: 5.597 ± 0.566
2.012LeuPhe: 2.012 ± 0.354
7.421LeuGly: 7.421 ± 0.845
1.446LeuHis: 1.446 ± 0.311
5.22LeuIle: 5.22 ± 0.572
3.899LeuLys: 3.899 ± 0.397
5.408LeuLeu: 5.408 ± 0.56
1.635LeuMet: 1.635 ± 0.271
2.893LeuAsn: 2.893 ± 0.451
5.786LeuPro: 5.786 ± 0.598
2.327LeuGln: 2.327 ± 0.383
5.912LeuArg: 5.912 ± 0.541
5.534LeuSer: 5.534 ± 0.503
6.478LeuThr: 6.478 ± 0.509
5.094LeuVal: 5.094 ± 0.678
1.069LeuTrp: 1.069 ± 0.29
2.453LeuTyr: 2.453 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
2.704MetAla: 2.704 ± 0.341
0.0MetCys: 0.0 ± 0.0
1.321MetAsp: 1.321 ± 0.238
1.258MetGlu: 1.258 ± 0.258
0.629MetPhe: 0.629 ± 0.235
1.572MetGly: 1.572 ± 0.312
0.189MetHis: 0.189 ± 0.098
0.503MetIle: 0.503 ± 0.172
0.943MetLys: 0.943 ± 0.23
1.384MetLeu: 1.384 ± 0.3
0.126MetMet: 0.126 ± 0.086
1.069MetAsn: 1.069 ± 0.193
1.195MetPro: 1.195 ± 0.251
0.566MetGln: 0.566 ± 0.169
1.132MetArg: 1.132 ± 0.28
2.138MetSer: 2.138 ± 0.416
2.264MetThr: 2.264 ± 0.302
1.069MetVal: 1.069 ± 0.275
0.314MetTrp: 0.314 ± 0.144
0.503MetTyr: 0.503 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
2.893AsnAla: 2.893 ± 0.476
0.063AsnCys: 0.063 ± 0.068
2.327AsnAsp: 2.327 ± 0.452
1.95AsnGlu: 1.95 ± 0.346
1.006AsnPhe: 1.006 ± 0.27
3.27AsnGly: 3.27 ± 0.459
0.755AsnHis: 0.755 ± 0.228
1.446AsnIle: 1.446 ± 0.287
0.566AsnLys: 0.566 ± 0.166
2.075AsnLeu: 2.075 ± 0.314
0.503AsnMet: 0.503 ± 0.16
0.818AsnAsn: 0.818 ± 0.224
2.516AsnPro: 2.516 ± 0.438
1.132AsnGln: 1.132 ± 0.278
1.698AsnArg: 1.698 ± 0.394
1.761AsnSer: 1.761 ± 0.418
1.887AsnThr: 1.887 ± 0.321
2.39AsnVal: 2.39 ± 0.419
0.692AsnTrp: 0.692 ± 0.182
1.195AsnTyr: 1.195 ± 0.308
0.0AsnXaa: 0.0 ± 0.0
Pro
5.408ProAla: 5.408 ± 0.675
0.44ProCys: 0.44 ± 0.169
4.276ProAsp: 4.276 ± 0.493
4.591ProGlu: 4.591 ± 0.647
2.075ProPhe: 2.075 ± 0.32
5.157ProGly: 5.157 ± 0.562
0.88ProHis: 0.88 ± 0.214
2.39ProIle: 2.39 ± 0.446
2.138ProLys: 2.138 ± 0.276
4.402ProLeu: 4.402 ± 0.555
1.069ProMet: 1.069 ± 0.258
1.572ProAsn: 1.572 ± 0.284
3.082ProPro: 3.082 ± 0.455
1.635ProGln: 1.635 ± 0.298
2.767ProArg: 2.767 ± 0.477
3.836ProSer: 3.836 ± 0.455
4.276ProThr: 4.276 ± 0.587
3.71ProVal: 3.71 ± 0.425
0.818ProTrp: 0.818 ± 0.327
1.321ProTyr: 1.321 ± 0.338
0.0ProXaa: 0.0 ± 0.0
Gln
3.207GlnAla: 3.207 ± 0.42
0.063GlnCys: 0.063 ± 0.056
1.195GlnAsp: 1.195 ± 0.33
1.635GlnGlu: 1.635 ± 0.264
1.006GlnPhe: 1.006 ± 0.219
2.453GlnGly: 2.453 ± 0.405
0.566GlnHis: 0.566 ± 0.148
2.641GlnIle: 2.641 ± 0.528
0.943GlnLys: 0.943 ± 0.201
3.836GlnLeu: 3.836 ± 0.496
1.132GlnMet: 1.132 ± 0.28
0.44GlnAsn: 0.44 ± 0.14
2.012GlnPro: 2.012 ± 0.387
1.824GlnGln: 1.824 ± 0.36
1.761GlnArg: 1.761 ± 0.376
2.012GlnSer: 2.012 ± 0.26
1.698GlnThr: 1.698 ± 0.278
2.704GlnVal: 2.704 ± 0.359
0.629GlnTrp: 0.629 ± 0.18
0.692GlnTyr: 0.692 ± 0.195
0.0GlnXaa: 0.0 ± 0.0
Arg
6.163ArgAla: 6.163 ± 0.807
0.755ArgCys: 0.755 ± 0.196
2.83ArgAsp: 2.83 ± 0.341
5.031ArgGlu: 5.031 ± 0.73
2.516ArgPhe: 2.516 ± 0.549
5.094ArgGly: 5.094 ± 0.605
1.258ArgHis: 1.258 ± 0.323
3.144ArgIle: 3.144 ± 0.501
3.144ArgLys: 3.144 ± 0.399
5.66ArgLeu: 5.66 ± 0.547
2.138ArgMet: 2.138 ± 0.398
1.95ArgAsn: 1.95 ± 0.399
2.264ArgPro: 2.264 ± 0.414
1.95ArgGln: 1.95 ± 0.368
5.471ArgArg: 5.471 ± 0.821
3.773ArgSer: 3.773 ± 0.497
2.83ArgThr: 2.83 ± 0.438
4.968ArgVal: 4.968 ± 0.521
1.195ArgTrp: 1.195 ± 0.272
1.572ArgTyr: 1.572 ± 0.282
0.0ArgXaa: 0.0 ± 0.0
Ser
6.415SerAla: 6.415 ± 0.752
0.377SerCys: 0.377 ± 0.161
3.144SerAsp: 3.144 ± 0.42
3.836SerGlu: 3.836 ± 0.494
2.012SerPhe: 2.012 ± 0.417
6.289SerGly: 6.289 ± 0.619
1.572SerHis: 1.572 ± 0.273
2.893SerIle: 2.893 ± 0.453
2.516SerLys: 2.516 ± 0.4
5.408SerLeu: 5.408 ± 0.522
1.446SerMet: 1.446 ± 0.277
2.578SerAsn: 2.578 ± 0.389
2.956SerPro: 2.956 ± 0.365
2.012SerGln: 2.012 ± 0.317
2.83SerArg: 2.83 ± 0.403
3.648SerSer: 3.648 ± 0.601
3.144SerThr: 3.144 ± 0.482
3.899SerVal: 3.899 ± 0.46
1.384SerTrp: 1.384 ± 0.321
1.321SerTyr: 1.321 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
6.415ThrAla: 6.415 ± 0.722
0.377ThrCys: 0.377 ± 0.149
4.465ThrAsp: 4.465 ± 0.642
3.962ThrGlu: 3.962 ± 0.48
2.264ThrPhe: 2.264 ± 0.42
6.478ThrGly: 6.478 ± 0.661
1.069ThrHis: 1.069 ± 0.256
3.207ThrIle: 3.207 ± 0.528
2.578ThrLys: 2.578 ± 0.387
5.408ThrLeu: 5.408 ± 0.658
0.88ThrMet: 0.88 ± 0.214
1.446ThrAsn: 1.446 ± 0.338
4.214ThrPro: 4.214 ± 0.539
2.138ThrGln: 2.138 ± 0.371
3.333ThrArg: 3.333 ± 0.524
3.522ThrSer: 3.522 ± 0.52
4.276ThrThr: 4.276 ± 0.582
5.283ThrVal: 5.283 ± 0.64
1.195ThrTrp: 1.195 ± 0.266
2.012ThrTyr: 2.012 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
6.666ValAla: 6.666 ± 0.881
0.503ValCys: 0.503 ± 0.211
5.346ValAsp: 5.346 ± 0.567
4.654ValGlu: 4.654 ± 0.525
2.516ValPhe: 2.516 ± 0.357
4.591ValGly: 4.591 ± 0.608
1.509ValHis: 1.509 ± 0.275
3.836ValIle: 3.836 ± 0.49
2.767ValLys: 2.767 ± 0.402
4.968ValLeu: 4.968 ± 0.561
1.321ValMet: 1.321 ± 0.229
2.516ValAsn: 2.516 ± 0.362
4.276ValPro: 4.276 ± 0.518
2.075ValGln: 2.075 ± 0.366
4.968ValArg: 4.968 ± 0.739
4.717ValSer: 4.717 ± 0.503
6.1ValThr: 6.1 ± 0.681
4.842ValVal: 4.842 ± 0.647
1.132ValTrp: 1.132 ± 0.26
2.453ValTyr: 2.453 ± 0.376
0.0ValXaa: 0.0 ± 0.0
Trp
1.761TrpAla: 1.761 ± 0.31
0.189TrpCys: 0.189 ± 0.096
1.384TrpAsp: 1.384 ± 0.276
0.755TrpGlu: 0.755 ± 0.198
0.88TrpPhe: 0.88 ± 0.245
1.824TrpGly: 1.824 ± 0.305
0.377TrpHis: 0.377 ± 0.147
0.88TrpIle: 0.88 ± 0.184
0.44TrpLys: 0.44 ± 0.206
1.95TrpLeu: 1.95 ± 0.321
0.377TrpMet: 0.377 ± 0.177
0.44TrpAsn: 0.44 ± 0.139
0.818TrpPro: 0.818 ± 0.253
1.006TrpGln: 1.006 ± 0.278
1.321TrpArg: 1.321 ± 0.329
1.006TrpSer: 1.006 ± 0.26
1.824TrpThr: 1.824 ± 0.412
2.264TrpVal: 2.264 ± 0.316
0.503TrpTrp: 0.503 ± 0.222
0.189TrpTyr: 0.189 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.95TyrAla: 1.95 ± 0.369
0.314TyrCys: 0.314 ± 0.14
1.321TyrAsp: 1.321 ± 0.297
2.39TyrGlu: 2.39 ± 0.334
0.629TyrPhe: 0.629 ± 0.149
2.264TyrGly: 2.264 ± 0.359
0.818TyrHis: 0.818 ± 0.225
1.635TyrIle: 1.635 ± 0.349
1.258TyrLys: 1.258 ± 0.236
2.578TyrLeu: 2.578 ± 0.38
0.566TyrMet: 0.566 ± 0.164
1.132TyrAsn: 1.132 ± 0.296
1.321TyrPro: 1.321 ± 0.295
1.069TyrGln: 1.069 ± 0.257
2.956TyrArg: 2.956 ± 0.443
1.635TyrSer: 1.635 ± 0.252
2.075TyrThr: 2.075 ± 0.331
2.012TyrVal: 2.012 ± 0.295
0.377TyrTrp: 0.377 ± 0.154
0.692TyrTyr: 0.692 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (15902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski