Amino acid dipepetide frequency for Streptomyces phage Heather

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.4AlaAla: 13.4 ± 1.149
0.98AlaCys: 0.98 ± 0.298
9.56AlaAsp: 9.56 ± 0.807
7.435AlaGlu: 7.435 ± 1.172
3.023AlaPhe: 3.023 ± 0.51
8.579AlaGly: 8.579 ± 0.942
1.389AlaHis: 1.389 ± 0.323
3.105AlaIle: 3.105 ± 0.687
5.066AlaLys: 5.066 ± 0.834
10.703AlaLeu: 10.703 ± 0.895
3.105AlaMet: 3.105 ± 0.492
3.595AlaAsn: 3.595 ± 0.606
6.536AlaPro: 6.536 ± 0.795
4.33AlaGln: 4.33 ± 0.589
8.988AlaArg: 8.988 ± 0.958
6.782AlaSer: 6.782 ± 0.855
8.089AlaThr: 8.089 ± 0.953
10.785AlaVal: 10.785 ± 0.978
2.206AlaTrp: 2.206 ± 0.385
2.533AlaTyr: 2.533 ± 0.486
0.0AlaXaa: 0.0 ± 0.0
Cys
1.144CysAla: 1.144 ± 0.471
0.082CysCys: 0.082 ± 0.076
0.817CysAsp: 0.817 ± 0.304
0.654CysGlu: 0.654 ± 0.224
0.082CysPhe: 0.082 ± 0.081
1.062CysGly: 1.062 ± 0.445
0.245CysHis: 0.245 ± 0.147
0.49CysIle: 0.49 ± 0.188
0.654CysLys: 0.654 ± 0.256
0.327CysLeu: 0.327 ± 0.196
0.082CysMet: 0.082 ± 0.08
0.49CysAsn: 0.49 ± 0.225
0.817CysPro: 0.817 ± 0.39
0.245CysGln: 0.245 ± 0.141
0.654CysArg: 0.654 ± 0.261
0.409CysSer: 0.409 ± 0.161
0.327CysThr: 0.327 ± 0.149
0.49CysVal: 0.49 ± 0.211
0.082CysTrp: 0.082 ± 0.08
0.082CysTyr: 0.082 ± 0.081
0.0CysXaa: 0.0 ± 0.0
Asp
8.661AspAla: 8.661 ± 0.884
0.0AspCys: 0.0 ± 0.0
4.167AspAsp: 4.167 ± 0.719
5.229AspGlu: 5.229 ± 0.851
2.86AspPhe: 2.86 ± 0.435
6.373AspGly: 6.373 ± 0.782
0.899AspHis: 0.899 ± 0.312
1.716AspIle: 1.716 ± 0.306
2.369AspLys: 2.369 ± 0.551
6.373AspLeu: 6.373 ± 0.642
1.389AspMet: 1.389 ± 0.335
1.552AspAsn: 1.552 ± 0.359
4.085AspPro: 4.085 ± 0.609
0.98AspGln: 0.98 ± 0.217
4.085AspArg: 4.085 ± 0.637
2.86AspSer: 2.86 ± 0.488
3.677AspThr: 3.677 ± 0.577
5.147AspVal: 5.147 ± 0.714
0.817AspTrp: 0.817 ± 0.281
1.798AspTyr: 1.798 ± 0.426
0.0AspXaa: 0.0 ± 0.0
Glu
6.945GluAla: 6.945 ± 0.87
0.245GluCys: 0.245 ± 0.169
3.84GluAsp: 3.84 ± 0.693
2.451GluGlu: 2.451 ± 0.464
2.696GluPhe: 2.696 ± 0.5
3.758GluGly: 3.758 ± 0.724
1.062GluHis: 1.062 ± 0.337
2.941GluIle: 2.941 ± 0.572
2.533GluLys: 2.533 ± 0.739
5.719GluLeu: 5.719 ± 0.732
0.735GluMet: 0.735 ± 0.26
1.471GluAsn: 1.471 ± 0.361
3.84GluPro: 3.84 ± 0.831
1.062GluGln: 1.062 ± 0.268
5.393GluArg: 5.393 ± 0.786
3.023GluSer: 3.023 ± 0.552
4.167GluThr: 4.167 ± 0.648
3.677GluVal: 3.677 ± 0.476
1.879GluTrp: 1.879 ± 0.329
1.389GluTyr: 1.389 ± 0.366
0.0GluXaa: 0.0 ± 0.0
Phe
3.595PheAla: 3.595 ± 0.501
0.245PheCys: 0.245 ± 0.149
1.961PheAsp: 1.961 ± 0.452
1.144PheGlu: 1.144 ± 0.274
0.654PhePhe: 0.654 ± 0.237
3.187PheGly: 3.187 ± 0.57
0.082PheHis: 0.082 ± 0.081
1.062PheIle: 1.062 ± 0.265
1.961PheLys: 1.961 ± 0.399
2.288PheLeu: 2.288 ± 0.433
0.98PheMet: 0.98 ± 0.253
1.798PheAsn: 1.798 ± 0.377
2.043PhePro: 2.043 ± 0.433
0.49PheGln: 0.49 ± 0.202
1.634PheArg: 1.634 ± 0.413
2.043PheSer: 2.043 ± 0.462
2.778PheThr: 2.778 ± 0.427
1.879PheVal: 1.879 ± 0.48
0.163PheTrp: 0.163 ± 0.12
0.899PheTyr: 0.899 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
10.213GlyAla: 10.213 ± 1.493
0.735GlyCys: 0.735 ± 0.242
4.657GlyAsp: 4.657 ± 0.668
4.576GlyGlu: 4.576 ± 0.646
3.35GlyPhe: 3.35 ± 0.453
6.618GlyGly: 6.618 ± 0.855
1.471GlyHis: 1.471 ± 0.282
3.105GlyIle: 3.105 ± 0.532
4.739GlyLys: 4.739 ± 0.501
8.171GlyLeu: 8.171 ± 1.169
1.471GlyMet: 1.471 ± 0.387
2.86GlyAsn: 2.86 ± 0.54
4.085GlyPro: 4.085 ± 0.601
2.86GlyGln: 2.86 ± 0.429
4.33GlyArg: 4.33 ± 0.559
5.147GlySer: 5.147 ± 0.708
4.984GlyThr: 4.984 ± 0.766
7.027GlyVal: 7.027 ± 0.877
1.552GlyTrp: 1.552 ± 0.359
2.778GlyTyr: 2.778 ± 0.725
0.0GlyXaa: 0.0 ± 0.0
His
1.798HisAla: 1.798 ± 0.369
0.327HisCys: 0.327 ± 0.17
1.062HisAsp: 1.062 ± 0.25
1.798HisGlu: 1.798 ± 0.442
0.49HisPhe: 0.49 ± 0.213
1.226HisGly: 1.226 ± 0.35
0.163HisHis: 0.163 ± 0.116
0.817HisIle: 0.817 ± 0.277
1.144HisLys: 1.144 ± 0.353
0.899HisLeu: 0.899 ± 0.259
0.409HisMet: 0.409 ± 0.211
0.327HisAsn: 0.327 ± 0.166
1.062HisPro: 1.062 ± 0.338
0.163HisGln: 0.163 ± 0.107
1.226HisArg: 1.226 ± 0.311
1.389HisSer: 1.389 ± 0.357
0.735HisThr: 0.735 ± 0.23
0.98HisVal: 0.98 ± 0.333
0.49HisTrp: 0.49 ± 0.21
0.245HisTyr: 0.245 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
4.004IleAla: 4.004 ± 0.686
0.327IleCys: 0.327 ± 0.184
2.86IleAsp: 2.86 ± 0.439
2.288IleGlu: 2.288 ± 0.489
0.735IlePhe: 0.735 ± 0.237
4.004IleGly: 4.004 ± 0.69
0.735IleHis: 0.735 ± 0.31
0.817IleIle: 0.817 ± 0.356
1.798IleLys: 1.798 ± 0.386
2.124IleLeu: 2.124 ± 0.404
0.899IleMet: 0.899 ± 0.28
1.144IleAsn: 1.144 ± 0.384
1.879IlePro: 1.879 ± 0.318
1.307IleGln: 1.307 ± 0.321
2.533IleArg: 2.533 ± 0.49
1.471IleSer: 1.471 ± 0.323
2.941IleThr: 2.941 ± 0.479
3.268IleVal: 3.268 ± 0.545
0.409IleTrp: 0.409 ± 0.169
0.899IleTyr: 0.899 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
6.21LysAla: 6.21 ± 1.048
0.327LysCys: 0.327 ± 0.173
2.533LysAsp: 2.533 ± 0.416
2.86LysGlu: 2.86 ± 0.451
0.817LysPhe: 0.817 ± 0.315
3.595LysGly: 3.595 ± 0.552
0.98LysHis: 0.98 ± 0.284
1.716LysIle: 1.716 ± 0.432
2.043LysLys: 2.043 ± 0.56
3.922LysLeu: 3.922 ± 0.69
0.327LysMet: 0.327 ± 0.19
1.389LysAsn: 1.389 ± 0.359
3.268LysPro: 3.268 ± 0.514
1.144LysGln: 1.144 ± 0.35
3.84LysArg: 3.84 ± 0.728
3.105LysSer: 3.105 ± 0.616
2.86LysThr: 2.86 ± 0.389
3.268LysVal: 3.268 ± 0.535
0.735LysTrp: 0.735 ± 0.233
1.389LysTyr: 1.389 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
8.416LeuAla: 8.416 ± 1.061
0.817LeuCys: 0.817 ± 0.23
6.046LeuAsp: 6.046 ± 0.762
3.758LeuGlu: 3.758 ± 0.608
2.288LeuPhe: 2.288 ± 0.363
5.393LeuGly: 5.393 ± 0.652
1.307LeuHis: 1.307 ± 0.38
3.677LeuIle: 3.677 ± 0.667
3.432LeuLys: 3.432 ± 0.613
5.638LeuLeu: 5.638 ± 0.721
1.307LeuMet: 1.307 ± 0.31
2.369LeuAsn: 2.369 ± 0.423
5.311LeuPro: 5.311 ± 0.661
1.634LeuGln: 1.634 ± 0.36
6.128LeuArg: 6.128 ± 0.801
6.291LeuSer: 6.291 ± 0.89
7.027LeuThr: 7.027 ± 0.93
6.373LeuVal: 6.373 ± 0.763
0.735LeuTrp: 0.735 ± 0.222
1.961LeuTyr: 1.961 ± 0.446
0.0LeuXaa: 0.0 ± 0.0
Met
3.268MetAla: 3.268 ± 0.561
0.327MetCys: 0.327 ± 0.18
0.899MetAsp: 0.899 ± 0.275
0.327MetGlu: 0.327 ± 0.156
0.409MetPhe: 0.409 ± 0.164
1.961MetGly: 1.961 ± 0.396
0.409MetHis: 0.409 ± 0.185
0.654MetIle: 0.654 ± 0.216
0.899MetLys: 0.899 ± 0.397
1.634MetLeu: 1.634 ± 0.36
0.327MetMet: 0.327 ± 0.192
0.817MetAsn: 0.817 ± 0.258
0.98MetPro: 0.98 ± 0.27
0.572MetGln: 0.572 ± 0.206
1.716MetArg: 1.716 ± 0.451
1.389MetSer: 1.389 ± 0.316
2.206MetThr: 2.206 ± 0.362
1.226MetVal: 1.226 ± 0.34
0.245MetTrp: 0.245 ± 0.142
0.327MetTyr: 0.327 ± 0.17
0.0MetXaa: 0.0 ± 0.0
Asn
4.494AsnAla: 4.494 ± 0.63
0.082AsnCys: 0.082 ± 0.08
2.451AsnAsp: 2.451 ± 0.469
2.124AsnGlu: 2.124 ± 0.353
0.817AsnPhe: 0.817 ± 0.21
4.494AsnGly: 4.494 ± 0.501
0.572AsnHis: 0.572 ± 0.184
1.226AsnIle: 1.226 ± 0.32
1.471AsnLys: 1.471 ± 0.378
2.533AsnLeu: 2.533 ± 0.388
0.817AsnMet: 0.817 ± 0.3
0.49AsnAsn: 0.49 ± 0.188
2.696AsnPro: 2.696 ± 0.473
0.409AsnGln: 0.409 ± 0.252
1.062AsnArg: 1.062 ± 0.275
0.98AsnSer: 0.98 ± 0.267
1.226AsnThr: 1.226 ± 0.3
2.043AsnVal: 2.043 ± 0.356
0.327AsnTrp: 0.327 ± 0.168
0.735AsnTyr: 0.735 ± 0.209
0.0AsnXaa: 0.0 ± 0.0
Pro
7.517ProAla: 7.517 ± 1.019
0.654ProCys: 0.654 ± 0.293
3.84ProAsp: 3.84 ± 0.597
4.657ProGlu: 4.657 ± 0.829
1.716ProPhe: 1.716 ± 0.363
5.638ProGly: 5.638 ± 0.919
1.144ProHis: 1.144 ± 0.339
1.961ProIle: 1.961 ± 0.45
3.268ProLys: 3.268 ± 0.632
2.533ProLeu: 2.533 ± 0.555
0.899ProMet: 0.899 ± 0.233
2.206ProAsn: 2.206 ± 0.425
1.552ProPro: 1.552 ± 0.441
1.144ProGln: 1.144 ± 0.328
2.369ProArg: 2.369 ± 0.452
3.432ProSer: 3.432 ± 0.446
4.004ProThr: 4.004 ± 0.591
5.311ProVal: 5.311 ± 0.893
0.899ProTrp: 0.899 ± 0.32
1.389ProTyr: 1.389 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
3.187GlnAla: 3.187 ± 0.545
0.654GlnCys: 0.654 ± 0.268
1.389GlnAsp: 1.389 ± 0.278
1.144GlnGlu: 1.144 ± 0.299
1.062GlnPhe: 1.062 ± 0.243
2.124GlnGly: 2.124 ± 0.4
0.163GlnHis: 0.163 ± 0.107
0.735GlnIle: 0.735 ± 0.229
0.98GlnLys: 0.98 ± 0.314
1.716GlnLeu: 1.716 ± 0.394
0.817GlnMet: 0.817 ± 0.308
0.817GlnAsn: 0.817 ± 0.254
1.471GlnPro: 1.471 ± 0.339
0.49GlnGln: 0.49 ± 0.24
2.86GlnArg: 2.86 ± 0.503
1.307GlnSer: 1.307 ± 0.275
2.043GlnThr: 2.043 ± 0.457
1.961GlnVal: 1.961 ± 0.319
0.817GlnTrp: 0.817 ± 0.259
0.572GlnTyr: 0.572 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
7.19ArgAla: 7.19 ± 1.018
0.49ArgCys: 0.49 ± 0.237
4.167ArgAsp: 4.167 ± 0.638
4.902ArgGlu: 4.902 ± 0.735
2.778ArgPhe: 2.778 ± 0.457
5.147ArgGly: 5.147 ± 0.673
1.307ArgHis: 1.307 ± 0.435
2.533ArgIle: 2.533 ± 0.432
3.758ArgLys: 3.758 ± 0.58
5.719ArgLeu: 5.719 ± 0.732
2.124ArgMet: 2.124 ± 0.472
1.552ArgAsn: 1.552 ± 0.312
2.288ArgPro: 2.288 ± 0.511
2.206ArgGln: 2.206 ± 0.422
4.004ArgArg: 4.004 ± 0.642
3.758ArgSer: 3.758 ± 0.648
4.657ArgThr: 4.657 ± 0.542
5.229ArgVal: 5.229 ± 0.724
1.389ArgTrp: 1.389 ± 0.389
2.124ArgTyr: 2.124 ± 0.446
0.0ArgXaa: 0.0 ± 0.0
Ser
7.435SerAla: 7.435 ± 0.699
0.735SerCys: 0.735 ± 0.215
3.595SerAsp: 3.595 ± 0.742
3.105SerGlu: 3.105 ± 0.544
1.226SerPhe: 1.226 ± 0.317
4.902SerGly: 4.902 ± 0.693
0.817SerHis: 0.817 ± 0.316
3.023SerIle: 3.023 ± 0.427
2.533SerLys: 2.533 ± 0.599
4.984SerLeu: 4.984 ± 0.521
0.654SerMet: 0.654 ± 0.248
2.206SerAsn: 2.206 ± 0.442
3.187SerPro: 3.187 ± 0.642
1.716SerGln: 1.716 ± 0.315
3.35SerArg: 3.35 ± 0.524
2.615SerSer: 2.615 ± 0.517
4.085SerThr: 4.085 ± 0.602
5.147SerVal: 5.147 ± 0.647
1.552SerTrp: 1.552 ± 0.38
1.226SerTyr: 1.226 ± 0.302
0.0SerXaa: 0.0 ± 0.0
Thr
8.497ThrAla: 8.497 ± 0.816
0.654ThrCys: 0.654 ± 0.267
3.758ThrAsp: 3.758 ± 0.531
3.922ThrGlu: 3.922 ± 0.651
2.696ThrPhe: 2.696 ± 0.477
7.19ThrGly: 7.19 ± 0.799
1.307ThrHis: 1.307 ± 0.381
2.206ThrIle: 2.206 ± 0.413
3.105ThrLys: 3.105 ± 0.497
5.883ThrLeu: 5.883 ± 0.678
1.062ThrMet: 1.062 ± 0.33
1.961ThrAsn: 1.961 ± 0.38
3.84ThrPro: 3.84 ± 0.611
1.961ThrGln: 1.961 ± 0.424
3.268ThrArg: 3.268 ± 0.657
4.739ThrSer: 4.739 ± 0.711
4.085ThrThr: 4.085 ± 0.821
6.373ThrVal: 6.373 ± 0.783
1.307ThrTrp: 1.307 ± 0.344
2.369ThrTyr: 2.369 ± 0.499
0.0ThrXaa: 0.0 ± 0.0
Val
9.396ValAla: 9.396 ± 0.866
0.899ValCys: 0.899 ± 0.275
5.147ValAsp: 5.147 ± 0.62
3.105ValGlu: 3.105 ± 0.356
2.124ValPhe: 2.124 ± 0.41
5.311ValGly: 5.311 ± 0.67
1.798ValHis: 1.798 ± 0.381
3.758ValIle: 3.758 ± 0.568
2.778ValLys: 2.778 ± 0.602
5.474ValLeu: 5.474 ± 0.713
2.124ValMet: 2.124 ± 0.449
2.86ValAsn: 2.86 ± 0.507
5.311ValPro: 5.311 ± 0.764
1.961ValGln: 1.961 ± 0.328
5.719ValArg: 5.719 ± 0.661
4.739ValSer: 4.739 ± 0.557
7.517ValThr: 7.517 ± 0.746
5.147ValVal: 5.147 ± 0.737
1.389ValTrp: 1.389 ± 0.372
2.369ValTyr: 2.369 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
1.307TrpAla: 1.307 ± 0.302
0.409TrpCys: 0.409 ± 0.215
0.817TrpAsp: 0.817 ± 0.301
1.471TrpGlu: 1.471 ± 0.298
0.899TrpPhe: 0.899 ± 0.354
1.307TrpGly: 1.307 ± 0.299
0.49TrpHis: 0.49 ± 0.207
0.49TrpIle: 0.49 ± 0.19
0.735TrpLys: 0.735 ± 0.344
1.634TrpLeu: 1.634 ± 0.315
0.082TrpMet: 0.082 ± 0.094
0.49TrpAsn: 0.49 ± 0.207
0.409TrpPro: 0.409 ± 0.197
1.144TrpGln: 1.144 ± 0.253
1.062TrpArg: 1.062 ± 0.299
0.98TrpSer: 0.98 ± 0.243
1.389TrpThr: 1.389 ± 0.296
1.552TrpVal: 1.552 ± 0.46
0.245TrpTrp: 0.245 ± 0.123
0.409TrpTyr: 0.409 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.513TyrAla: 3.513 ± 0.596
0.245TyrCys: 0.245 ± 0.156
1.471TyrAsp: 1.471 ± 0.295
1.961TyrGlu: 1.961 ± 0.505
0.163TyrPhe: 0.163 ± 0.111
2.86TyrGly: 2.86 ± 0.534
0.409TyrHis: 0.409 ± 0.176
0.409TyrIle: 0.409 ± 0.198
1.062TyrLys: 1.062 ± 0.287
1.634TyrLeu: 1.634 ± 0.298
0.899TyrMet: 0.899 ± 0.312
0.654TyrAsn: 0.654 ± 0.242
1.471TyrPro: 1.471 ± 0.428
0.49TyrGln: 0.49 ± 0.204
3.105TyrArg: 3.105 ± 0.752
1.634TyrSer: 1.634 ± 0.388
1.389TyrThr: 1.389 ± 0.319
2.043TyrVal: 2.043 ± 0.286
0.082TyrTrp: 0.082 ± 0.071
0.245TyrTyr: 0.245 ± 0.133
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12240 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski