Amino acid dipepetide frequency for Gordonia phage McGonagall

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
25.684AlaAla: 25.684 ± 2.508
1.478AlaCys: 1.478 ± 0.411
9.424AlaAsp: 9.424 ± 1.258
6.098AlaGlu: 6.098 ± 1.126
3.695AlaPhe: 3.695 ± 0.843
10.717AlaGly: 10.717 ± 1.739
2.956AlaHis: 2.956 ± 0.877
7.206AlaIle: 7.206 ± 2.08
3.88AlaLys: 3.88 ± 0.759
10.163AlaLeu: 10.163 ± 1.01
2.956AlaMet: 2.956 ± 0.863
3.695AlaAsn: 3.695 ± 0.786
7.576AlaPro: 7.576 ± 1.12
3.141AlaGln: 3.141 ± 0.487
7.761AlaArg: 7.761 ± 1.831
7.021AlaSer: 7.021 ± 1.283
12.565AlaThr: 12.565 ± 1.339
13.858AlaVal: 13.858 ± 3.018
2.402AlaTrp: 2.402 ± 0.37
3.511AlaTyr: 3.511 ± 0.642
0.0AlaXaa: 0.0 ± 0.0
Cys
0.924CysAla: 0.924 ± 0.353
0.0CysCys: 0.0 ± 0.0
0.37CysAsp: 0.37 ± 0.265
0.185CysGlu: 0.185 ± 0.202
0.37CysPhe: 0.37 ± 0.248
0.924CysGly: 0.924 ± 0.451
0.37CysHis: 0.37 ± 0.262
0.185CysIle: 0.185 ± 0.155
0.185CysLys: 0.185 ± 0.155
1.109CysLeu: 1.109 ± 0.39
0.185CysMet: 0.185 ± 0.163
0.37CysAsn: 0.37 ± 0.251
0.739CysPro: 0.739 ± 0.432
0.37CysGln: 0.37 ± 0.28
1.293CysArg: 1.293 ± 0.534
0.739CysSer: 0.739 ± 0.406
0.554CysThr: 0.554 ± 0.505
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.945AspAla: 7.945 ± 1.363
0.185AspCys: 0.185 ± 0.191
4.619AspAsp: 4.619 ± 0.989
3.141AspGlu: 3.141 ± 0.836
1.663AspPhe: 1.663 ± 0.464
4.989AspGly: 4.989 ± 0.797
1.478AspHis: 1.478 ± 0.488
2.033AspIle: 2.033 ± 0.635
2.402AspLys: 2.402 ± 0.617
4.619AspLeu: 4.619 ± 1.109
0.924AspMet: 0.924 ± 0.394
1.293AspAsn: 1.293 ± 0.348
4.619AspPro: 4.619 ± 1.142
2.402AspGln: 2.402 ± 0.676
5.543AspArg: 5.543 ± 1.243
4.435AspSer: 4.435 ± 1.176
3.326AspThr: 3.326 ± 0.689
4.804AspVal: 4.804 ± 1.141
1.109AspTrp: 1.109 ± 0.433
1.109AspTyr: 1.109 ± 0.436
0.0AspXaa: 0.0 ± 0.0
Glu
3.511GluAla: 3.511 ± 0.738
0.554GluCys: 0.554 ± 0.348
0.924GluAsp: 0.924 ± 0.439
0.554GluGlu: 0.554 ± 0.376
1.293GluPhe: 1.293 ± 0.62
2.217GluGly: 2.217 ± 0.505
1.848GluHis: 1.848 ± 0.519
1.293GluIle: 1.293 ± 0.503
0.554GluLys: 0.554 ± 0.378
7.206GluLeu: 7.206 ± 1.35
1.109GluMet: 1.109 ± 0.583
0.924GluAsn: 0.924 ± 0.389
2.402GluPro: 2.402 ± 0.629
2.033GluGln: 2.033 ± 0.636
2.956GluArg: 2.956 ± 0.742
3.141GluSer: 3.141 ± 0.787
2.217GluThr: 2.217 ± 0.883
2.033GluVal: 2.033 ± 0.849
0.37GluTrp: 0.37 ± 0.248
0.554GluTyr: 0.554 ± 0.35
0.0GluXaa: 0.0 ± 0.0
Phe
2.587PheAla: 2.587 ± 0.658
0.185PheCys: 0.185 ± 0.155
2.956PheAsp: 2.956 ± 0.605
1.293PheGlu: 1.293 ± 0.421
0.739PhePhe: 0.739 ± 0.289
2.772PheGly: 2.772 ± 0.661
0.924PheHis: 0.924 ± 0.445
1.293PheIle: 1.293 ± 0.444
1.109PheLys: 1.109 ± 0.486
1.478PheLeu: 1.478 ± 0.522
0.554PheMet: 0.554 ± 0.448
0.739PheAsn: 0.739 ± 0.5
0.554PhePro: 0.554 ± 0.301
0.37PheGln: 0.37 ± 0.248
0.554PheArg: 0.554 ± 0.254
1.293PheSer: 1.293 ± 0.46
2.033PheThr: 2.033 ± 0.375
2.217PheVal: 2.217 ± 0.782
0.739PheTrp: 0.739 ± 0.474
0.37PheTyr: 0.37 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
9.793GlyAla: 9.793 ± 1.827
0.739GlyCys: 0.739 ± 0.39
5.728GlyAsp: 5.728 ± 1.256
4.619GlyGlu: 4.619 ± 0.864
1.848GlyPhe: 1.848 ± 0.621
9.608GlyGly: 9.608 ± 4.154
2.033GlyHis: 2.033 ± 0.631
2.956GlyIle: 2.956 ± 1.103
1.478GlyLys: 1.478 ± 0.453
5.913GlyLeu: 5.913 ± 0.765
1.109GlyMet: 1.109 ± 0.507
1.478GlyAsn: 1.478 ± 0.565
5.358GlyPro: 5.358 ± 1.113
2.772GlyGln: 2.772 ± 0.553
7.021GlyArg: 7.021 ± 0.973
6.467GlySer: 6.467 ± 1.208
5.174GlyThr: 5.174 ± 1.574
7.021GlyVal: 7.021 ± 1.024
1.663GlyTrp: 1.663 ± 0.46
1.293GlyTyr: 1.293 ± 0.399
0.0GlyXaa: 0.0 ± 0.0
His
2.217HisAla: 2.217 ± 0.616
0.185HisCys: 0.185 ± 0.202
1.663HisAsp: 1.663 ± 0.489
0.739HisGlu: 0.739 ± 0.334
0.185HisPhe: 0.185 ± 0.202
1.663HisGly: 1.663 ± 0.583
0.554HisHis: 0.554 ± 0.358
0.37HisIle: 0.37 ± 0.241
0.37HisLys: 0.37 ± 0.24
2.956HisLeu: 2.956 ± 0.75
0.554HisMet: 0.554 ± 0.266
0.554HisAsn: 0.554 ± 0.329
2.217HisPro: 2.217 ± 0.638
0.739HisGln: 0.739 ± 0.369
2.033HisArg: 2.033 ± 0.533
0.185HisSer: 0.185 ± 0.222
2.033HisThr: 2.033 ± 0.778
1.478HisVal: 1.478 ± 0.587
0.554HisTrp: 0.554 ± 0.257
0.37HisTyr: 0.37 ± 0.285
0.0HisXaa: 0.0 ± 0.0
Ile
6.282IleAla: 6.282 ± 1.164
0.185IleCys: 0.185 ± 0.191
3.511IleAsp: 3.511 ± 0.583
0.37IleGlu: 0.37 ± 0.256
0.37IlePhe: 0.37 ± 0.234
3.326IleGly: 3.326 ± 0.903
0.924IleHis: 0.924 ± 0.342
1.848IleIle: 1.848 ± 0.67
1.109IleLys: 1.109 ± 0.5
4.065IleLeu: 4.065 ± 1.126
0.924IleMet: 0.924 ± 0.357
0.739IleAsn: 0.739 ± 0.51
3.141IlePro: 3.141 ± 0.672
1.663IleGln: 1.663 ± 1.076
1.848IleArg: 1.848 ± 0.513
2.587IleSer: 2.587 ± 0.723
3.695IleThr: 3.695 ± 0.965
4.065IleVal: 4.065 ± 0.793
0.924IleTrp: 0.924 ± 0.34
0.37IleTyr: 0.37 ± 0.248
0.0IleXaa: 0.0 ± 0.0
Lys
3.511LysAla: 3.511 ± 0.667
0.554LysCys: 0.554 ± 0.359
0.924LysAsp: 0.924 ± 0.374
0.37LysGlu: 0.37 ± 0.387
0.924LysPhe: 0.924 ± 0.408
0.924LysGly: 0.924 ± 0.523
0.554LysHis: 0.554 ± 0.268
0.739LysIle: 0.739 ± 0.316
0.924LysLys: 0.924 ± 0.41
2.772LysLeu: 2.772 ± 0.62
0.554LysMet: 0.554 ± 0.309
0.739LysAsn: 0.739 ± 0.413
2.772LysPro: 2.772 ± 0.794
0.739LysGln: 0.739 ± 0.293
1.663LysArg: 1.663 ± 0.554
1.293LysSer: 1.293 ± 0.436
1.478LysThr: 1.478 ± 0.39
1.293LysVal: 1.293 ± 0.694
0.554LysTrp: 0.554 ± 0.281
0.185LysTyr: 0.185 ± 0.191
0.0LysXaa: 0.0 ± 0.0
Leu
13.304LeuAla: 13.304 ± 1.816
1.663LeuCys: 1.663 ± 0.518
5.728LeuAsp: 5.728 ± 0.984
4.619LeuGlu: 4.619 ± 1.091
1.109LeuPhe: 1.109 ± 0.362
7.576LeuGly: 7.576 ± 0.949
2.402LeuHis: 2.402 ± 0.708
5.358LeuIle: 5.358 ± 1.11
1.293LeuLys: 1.293 ± 0.475
6.652LeuLeu: 6.652 ± 0.881
1.663LeuMet: 1.663 ± 0.598
2.217LeuAsn: 2.217 ± 0.612
4.619LeuPro: 4.619 ± 1.13
2.033LeuGln: 2.033 ± 0.445
5.543LeuArg: 5.543 ± 1.263
4.989LeuSer: 4.989 ± 1.098
5.913LeuThr: 5.913 ± 1.064
7.206LeuVal: 7.206 ± 0.987
1.848LeuTrp: 1.848 ± 0.696
1.293LeuTyr: 1.293 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
2.956MetAla: 2.956 ± 0.693
0.185MetCys: 0.185 ± 0.201
0.924MetAsp: 0.924 ± 0.414
0.37MetGlu: 0.37 ± 0.267
0.554MetPhe: 0.554 ± 0.284
0.554MetGly: 0.554 ± 0.3
0.37MetHis: 0.37 ± 0.233
0.739MetIle: 0.739 ± 0.268
0.554MetLys: 0.554 ± 0.282
2.772MetLeu: 2.772 ± 0.674
1.293MetMet: 1.293 ± 0.487
0.554MetAsn: 0.554 ± 0.336
0.739MetPro: 0.739 ± 0.339
0.554MetGln: 0.554 ± 0.264
0.37MetArg: 0.37 ± 0.302
3.141MetSer: 3.141 ± 0.638
1.848MetThr: 1.848 ± 0.595
1.109MetVal: 1.109 ± 0.467
0.37MetTrp: 0.37 ± 0.239
0.185MetTyr: 0.185 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
4.25AsnAla: 4.25 ± 1.018
0.0AsnCys: 0.0 ± 0.0
1.478AsnAsp: 1.478 ± 0.663
0.554AsnGlu: 0.554 ± 0.336
0.185AsnPhe: 0.185 ± 0.15
2.772AsnGly: 2.772 ± 1.011
0.554AsnHis: 0.554 ± 0.36
0.739AsnIle: 0.739 ± 0.297
0.554AsnLys: 0.554 ± 0.276
2.772AsnLeu: 2.772 ± 0.62
0.37AsnMet: 0.37 ± 0.3
0.185AsnAsn: 0.185 ± 0.188
1.663AsnPro: 1.663 ± 0.487
1.109AsnGln: 1.109 ± 0.504
2.033AsnArg: 2.033 ± 0.649
1.848AsnSer: 1.848 ± 0.92
1.848AsnThr: 1.848 ± 0.752
2.033AsnVal: 2.033 ± 0.692
0.0AsnTrp: 0.0 ± 0.0
0.185AsnTyr: 0.185 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
14.597ProAla: 14.597 ± 2.963
0.554ProCys: 0.554 ± 0.34
4.619ProAsp: 4.619 ± 0.901
1.109ProGlu: 1.109 ± 0.592
1.293ProPhe: 1.293 ± 0.464
5.358ProGly: 5.358 ± 1.243
1.293ProHis: 1.293 ± 0.451
2.402ProIle: 2.402 ± 0.658
1.109ProLys: 1.109 ± 0.315
4.619ProLeu: 4.619 ± 1.24
1.109ProMet: 1.109 ± 0.491
0.924ProAsn: 0.924 ± 0.546
4.804ProPro: 4.804 ± 0.801
1.293ProGln: 1.293 ± 0.366
4.435ProArg: 4.435 ± 0.997
4.435ProSer: 4.435 ± 0.773
4.804ProThr: 4.804 ± 0.999
3.141ProVal: 3.141 ± 0.628
0.924ProTrp: 0.924 ± 0.405
2.033ProTyr: 2.033 ± 0.611
0.0ProXaa: 0.0 ± 0.0
Gln
2.587GlnAla: 2.587 ± 0.704
0.0GlnCys: 0.0 ± 0.0
0.739GlnAsp: 0.739 ± 0.386
1.109GlnGlu: 1.109 ± 0.42
1.478GlnPhe: 1.478 ± 0.497
1.848GlnGly: 1.848 ± 0.656
0.185GlnHis: 0.185 ± 0.202
0.924GlnIle: 0.924 ± 0.352
1.109GlnLys: 1.109 ± 0.39
5.174GlnLeu: 5.174 ± 0.894
0.739GlnMet: 0.739 ± 0.381
0.924GlnAsn: 0.924 ± 0.36
1.848GlnPro: 1.848 ± 0.516
0.924GlnGln: 0.924 ± 0.338
1.848GlnArg: 1.848 ± 0.803
1.848GlnSer: 1.848 ± 0.497
2.956GlnThr: 2.956 ± 0.539
2.772GlnVal: 2.772 ± 0.614
0.0GlnTrp: 0.0 ± 0.0
0.37GlnTyr: 0.37 ± 0.299
0.0GlnXaa: 0.0 ± 0.0
Arg
9.608ArgAla: 9.608 ± 1.696
0.554ArgCys: 0.554 ± 0.316
4.435ArgAsp: 4.435 ± 0.853
2.402ArgGlu: 2.402 ± 0.592
2.033ArgPhe: 2.033 ± 0.6
5.913ArgGly: 5.913 ± 1.074
1.293ArgHis: 1.293 ± 0.495
2.033ArgIle: 2.033 ± 0.567
1.293ArgLys: 1.293 ± 0.442
5.358ArgLeu: 5.358 ± 1.038
1.293ArgMet: 1.293 ± 0.447
2.033ArgAsn: 2.033 ± 0.561
4.435ArgPro: 4.435 ± 1.109
2.033ArgGln: 2.033 ± 0.824
4.25ArgArg: 4.25 ± 1.465
3.88ArgSer: 3.88 ± 0.787
4.065ArgThr: 4.065 ± 0.84
4.804ArgVal: 4.804 ± 1.105
0.739ArgTrp: 0.739 ± 0.358
1.478ArgTyr: 1.478 ± 0.593
0.0ArgXaa: 0.0 ± 0.0
Ser
8.315SerAla: 8.315 ± 1.465
0.554SerCys: 0.554 ± 0.325
4.435SerAsp: 4.435 ± 1.109
2.772SerGlu: 2.772 ± 0.58
1.848SerPhe: 1.848 ± 0.552
5.543SerGly: 5.543 ± 1.013
0.554SerHis: 0.554 ± 0.339
1.663SerIle: 1.663 ± 0.494
1.478SerLys: 1.478 ± 0.509
5.174SerLeu: 5.174 ± 0.834
1.478SerMet: 1.478 ± 0.38
1.293SerAsn: 1.293 ± 0.662
3.88SerPro: 3.88 ± 0.717
2.587SerGln: 2.587 ± 0.966
3.141SerArg: 3.141 ± 0.653
5.543SerSer: 5.543 ± 0.994
4.435SerThr: 4.435 ± 0.888
4.989SerVal: 4.989 ± 1.243
1.478SerTrp: 1.478 ± 0.547
1.663SerTyr: 1.663 ± 0.531
0.0SerXaa: 0.0 ± 0.0
Thr
9.424ThrAla: 9.424 ± 1.151
0.37ThrCys: 0.37 ± 0.218
4.435ThrAsp: 4.435 ± 1.122
2.772ThrGlu: 2.772 ± 0.701
2.587ThrPhe: 2.587 ± 0.601
8.5ThrGly: 8.5 ± 1.146
1.478ThrHis: 1.478 ± 0.473
4.619ThrIle: 4.619 ± 1.131
0.924ThrLys: 0.924 ± 0.543
6.837ThrLeu: 6.837 ± 1.878
0.924ThrMet: 0.924 ± 0.313
1.663ThrAsn: 1.663 ± 0.496
7.021ThrPro: 7.021 ± 1.003
1.663ThrGln: 1.663 ± 0.577
3.141ThrArg: 3.141 ± 0.728
2.402ThrSer: 2.402 ± 0.615
8.684ThrThr: 8.684 ± 1.291
7.576ThrVal: 7.576 ± 1.214
1.109ThrTrp: 1.109 ± 0.427
1.293ThrTyr: 1.293 ± 0.421
0.0ThrXaa: 0.0 ± 0.0
Val
14.782ValAla: 14.782 ± 2.282
0.739ValCys: 0.739 ± 0.311
3.511ValAsp: 3.511 ± 0.698
3.141ValGlu: 3.141 ± 0.846
1.663ValPhe: 1.663 ± 0.346
6.098ValGly: 6.098 ± 1.481
1.109ValHis: 1.109 ± 0.635
4.804ValIle: 4.804 ± 1.282
2.587ValLys: 2.587 ± 0.837
4.065ValLeu: 4.065 ± 0.792
1.109ValMet: 1.109 ± 0.385
2.402ValAsn: 2.402 ± 0.69
4.619ValPro: 4.619 ± 0.73
1.293ValGln: 1.293 ± 0.44
5.174ValArg: 5.174 ± 0.807
4.989ValSer: 4.989 ± 1.004
6.837ValThr: 6.837 ± 1.356
9.608ValVal: 9.608 ± 1.247
2.772ValTrp: 2.772 ± 0.912
1.109ValTyr: 1.109 ± 0.547
0.0ValXaa: 0.0 ± 0.0
Trp
1.848TrpAla: 1.848 ± 0.474
0.185TrpCys: 0.185 ± 0.158
0.37TrpAsp: 0.37 ± 0.316
0.554TrpGlu: 0.554 ± 0.282
0.554TrpPhe: 0.554 ± 0.358
0.554TrpGly: 0.554 ± 0.281
0.554TrpHis: 0.554 ± 0.328
0.185TrpIle: 0.185 ± 0.15
0.37TrpLys: 0.37 ± 0.206
2.217TrpLeu: 2.217 ± 0.73
0.554TrpMet: 0.554 ± 0.225
1.848TrpAsn: 1.848 ± 0.944
0.739TrpPro: 0.739 ± 0.279
0.739TrpGln: 0.739 ± 0.289
1.848TrpArg: 1.848 ± 0.52
1.848TrpSer: 1.848 ± 0.515
1.109TrpThr: 1.109 ± 0.4
0.924TrpVal: 0.924 ± 0.367
0.0TrpTrp: 0.0 ± 0.0
0.739TrpTyr: 0.739 ± 0.365
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.663TyrAla: 1.663 ± 0.442
0.0TyrCys: 0.0 ± 0.0
1.848TyrAsp: 1.848 ± 0.74
0.924TyrGlu: 0.924 ± 0.49
0.37TyrPhe: 0.37 ± 0.211
2.217TyrGly: 2.217 ± 0.645
0.185TyrHis: 0.185 ± 0.166
0.554TyrIle: 0.554 ± 0.236
0.185TyrLys: 0.185 ± 0.204
1.293TyrLeu: 1.293 ± 0.407
0.554TyrMet: 0.554 ± 0.276
0.554TyrAsn: 0.554 ± 0.261
1.109TyrPro: 1.109 ± 0.43
0.924TyrGln: 0.924 ± 0.354
1.478TyrArg: 1.478 ± 0.472
0.739TyrSer: 0.739 ± 0.341
1.848TyrThr: 1.848 ± 0.603
1.478TyrVal: 1.478 ± 0.614
0.185TyrTrp: 0.185 ± 0.158
0.185TyrTyr: 0.185 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (5413 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski