Amino acid dipepetide frequency for Streptomyces phage LibertyBell

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.569AlaAla: 11.569 ± 2.008
0.308AlaCys: 0.308 ± 0.176
4.923AlaAsp: 4.923 ± 0.611
6.277AlaGlu: 6.277 ± 0.747
2.954AlaPhe: 2.954 ± 0.357
7.815AlaGly: 7.815 ± 1.172
0.861AlaHis: 0.861 ± 0.183
5.169AlaIle: 5.169 ± 0.987
6.523AlaLys: 6.523 ± 0.689
6.769AlaLeu: 6.769 ± 1.337
3.877AlaMet: 3.877 ± 0.518
2.646AlaAsn: 2.646 ± 0.351
3.446AlaPro: 3.446 ± 0.544
3.692AlaGln: 3.692 ± 0.522
5.354AlaArg: 5.354 ± 0.569
4.123AlaSer: 4.123 ± 0.574
5.415AlaThr: 5.415 ± 0.585
6.03AlaVal: 6.03 ± 0.782
1.108AlaTrp: 1.108 ± 0.255
2.338AlaTyr: 2.338 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.217
0.123CysCys: 0.123 ± 0.086
0.185CysAsp: 0.185 ± 0.099
0.308CysGlu: 0.308 ± 0.14
0.246CysPhe: 0.246 ± 0.132
0.185CysGly: 0.185 ± 0.113
0.123CysHis: 0.123 ± 0.087
0.246CysIle: 0.246 ± 0.149
0.123CysLys: 0.123 ± 0.162
0.615CysLeu: 0.615 ± 0.228
0.0CysMet: 0.0 ± 0.0
0.246CysAsn: 0.246 ± 0.118
0.123CysPro: 0.123 ± 0.094
0.123CysGln: 0.123 ± 0.091
0.369CysArg: 0.369 ± 0.178
0.185CysSer: 0.185 ± 0.104
0.369CysThr: 0.369 ± 0.156
0.308CysVal: 0.308 ± 0.178
0.185CysTrp: 0.185 ± 0.106
0.123CysTyr: 0.123 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
5.415AspAla: 5.415 ± 0.498
0.246AspCys: 0.246 ± 0.136
3.2AspAsp: 3.2 ± 0.473
4.554AspGlu: 4.554 ± 0.645
2.461AspPhe: 2.461 ± 0.436
6.153AspGly: 6.153 ± 0.537
1.169AspHis: 1.169 ± 0.234
3.754AspIle: 3.754 ± 0.671
3.138AspLys: 3.138 ± 0.526
4.677AspLeu: 4.677 ± 0.469
1.785AspMet: 1.785 ± 0.288
2.277AspAsn: 2.277 ± 0.53
3.015AspPro: 3.015 ± 0.519
2.461AspGln: 2.461 ± 0.372
3.261AspArg: 3.261 ± 0.577
2.523AspSer: 2.523 ± 0.44
3.077AspThr: 3.077 ± 0.425
3.692AspVal: 3.692 ± 0.548
1.169AspTrp: 1.169 ± 0.264
2.154AspTyr: 2.154 ± 0.305
0.0AspXaa: 0.0 ± 0.0
Glu
5.23GluAla: 5.23 ± 0.603
0.185GluCys: 0.185 ± 0.112
3.446GluAsp: 3.446 ± 0.57
4.984GluGlu: 4.984 ± 0.858
3.631GluPhe: 3.631 ± 0.529
4.923GluGly: 4.923 ± 0.676
1.477GluHis: 1.477 ± 0.337
4.307GluIle: 4.307 ± 0.49
3.754GluLys: 3.754 ± 0.577
5.846GluLeu: 5.846 ± 0.626
2.4GluMet: 2.4 ± 0.371
2.461GluAsn: 2.461 ± 0.45
2.584GluPro: 2.584 ± 0.482
2.461GluGln: 2.461 ± 0.397
4.0GluArg: 4.0 ± 0.689
2.523GluSer: 2.523 ± 0.373
4.677GluThr: 4.677 ± 0.658
4.43GluVal: 4.43 ± 0.767
0.923GluTrp: 0.923 ± 0.279
2.215GluTyr: 2.215 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
3.015PheAla: 3.015 ± 0.506
0.369PheCys: 0.369 ± 0.209
3.077PheAsp: 3.077 ± 0.469
2.954PheGlu: 2.954 ± 0.482
1.415PhePhe: 1.415 ± 0.285
3.077PheGly: 3.077 ± 0.468
0.369PheHis: 0.369 ± 0.166
1.354PheIle: 1.354 ± 0.307
2.708PheLys: 2.708 ± 0.507
2.154PheLeu: 2.154 ± 0.367
1.108PheMet: 1.108 ± 0.331
1.723PheAsn: 1.723 ± 0.416
1.354PhePro: 1.354 ± 0.257
1.538PheGln: 1.538 ± 0.278
2.154PheArg: 2.154 ± 0.327
2.215PheSer: 2.215 ± 0.517
1.846PheThr: 1.846 ± 0.473
3.2PheVal: 3.2 ± 0.486
0.985PheTrp: 0.985 ± 0.244
1.538PheTyr: 1.538 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
7.261GlyAla: 7.261 ± 0.866
0.308GlyCys: 0.308 ± 0.159
5.415GlyAsp: 5.415 ± 0.567
4.554GlyGlu: 4.554 ± 0.492
2.708GlyPhe: 2.708 ± 0.479
4.738GlyGly: 4.738 ± 0.6
1.046GlyHis: 1.046 ± 0.313
4.984GlyIle: 4.984 ± 0.889
4.369GlyLys: 4.369 ± 0.738
6.092GlyLeu: 6.092 ± 1.289
2.4GlyMet: 2.4 ± 0.529
3.323GlyAsn: 3.323 ± 0.486
1.908GlyPro: 1.908 ± 0.363
2.031GlyGln: 2.031 ± 0.316
4.0GlyArg: 4.0 ± 0.505
4.923GlySer: 4.923 ± 0.819
5.969GlyThr: 5.969 ± 0.641
6.338GlyVal: 6.338 ± 0.858
1.046GlyTrp: 1.046 ± 0.289
3.446GlyTyr: 3.446 ± 0.614
0.0GlyXaa: 0.0 ± 0.0
His
0.615HisAla: 0.615 ± 0.193
0.123HisCys: 0.123 ± 0.121
1.108HisAsp: 1.108 ± 0.289
1.231HisGlu: 1.231 ± 0.284
0.861HisPhe: 0.861 ± 0.238
1.477HisGly: 1.477 ± 0.346
0.554HisHis: 0.554 ± 0.157
1.415HisIle: 1.415 ± 0.286
0.615HisLys: 0.615 ± 0.221
1.538HisLeu: 1.538 ± 0.278
0.492HisMet: 0.492 ± 0.144
0.554HisAsn: 0.554 ± 0.193
0.923HisPro: 0.923 ± 0.292
0.615HisGln: 0.615 ± 0.18
0.861HisArg: 0.861 ± 0.291
1.108HisSer: 1.108 ± 0.289
0.677HisThr: 0.677 ± 0.202
1.231HisVal: 1.231 ± 0.312
0.185HisTrp: 0.185 ± 0.112
1.538HisTyr: 1.538 ± 0.31
0.0HisXaa: 0.0 ± 0.0
Ile
4.554IleAla: 4.554 ± 0.79
0.308IleCys: 0.308 ± 0.166
4.492IleAsp: 4.492 ± 0.411
4.677IleGlu: 4.677 ± 0.869
1.846IlePhe: 1.846 ± 0.425
4.369IleGly: 4.369 ± 0.972
1.108IleHis: 1.108 ± 0.267
3.077IleIle: 3.077 ± 0.669
4.43IleLys: 4.43 ± 0.542
4.677IleLeu: 4.677 ± 0.488
1.046IleMet: 1.046 ± 0.299
1.969IleAsn: 1.969 ± 0.311
2.215IlePro: 2.215 ± 0.367
2.215IleGln: 2.215 ± 0.379
2.769IleArg: 2.769 ± 0.491
3.261IleSer: 3.261 ± 0.497
4.246IleThr: 4.246 ± 0.621
3.569IleVal: 3.569 ± 0.395
1.108IleTrp: 1.108 ± 0.408
1.785IleTyr: 1.785 ± 0.305
0.0IleXaa: 0.0 ± 0.0
Lys
5.23LysAla: 5.23 ± 0.641
0.062LysCys: 0.062 ± 0.06
2.892LysAsp: 2.892 ± 0.605
3.446LysGlu: 3.446 ± 0.514
2.461LysPhe: 2.461 ± 0.518
4.246LysGly: 4.246 ± 0.817
0.985LysHis: 0.985 ± 0.227
4.0LysIle: 4.0 ± 0.486
5.23LysLys: 5.23 ± 0.745
5.046LysLeu: 5.046 ± 0.786
1.661LysMet: 1.661 ± 0.37
3.2LysAsn: 3.2 ± 0.498
2.277LysPro: 2.277 ± 0.406
2.215LysGln: 2.215 ± 0.397
3.261LysArg: 3.261 ± 0.655
4.43LysSer: 4.43 ± 0.627
4.861LysThr: 4.861 ± 0.623
4.677LysVal: 4.677 ± 0.491
0.985LysTrp: 0.985 ± 0.306
1.969LysTyr: 1.969 ± 0.377
0.0LysXaa: 0.0 ± 0.0
Leu
8.369LeuAla: 8.369 ± 1.428
0.308LeuCys: 0.308 ± 0.149
5.169LeuAsp: 5.169 ± 0.582
3.815LeuGlu: 3.815 ± 0.575
2.584LeuPhe: 2.584 ± 0.459
6.03LeuGly: 6.03 ± 1.009
1.415LeuHis: 1.415 ± 0.353
4.615LeuIle: 4.615 ± 0.612
6.277LeuLys: 6.277 ± 0.741
4.738LeuLeu: 4.738 ± 0.704
2.584LeuMet: 2.584 ± 0.423
3.323LeuAsn: 3.323 ± 0.363
2.277LeuPro: 2.277 ± 0.371
2.523LeuGln: 2.523 ± 0.42
4.615LeuArg: 4.615 ± 0.644
5.846LeuSer: 5.846 ± 0.63
6.277LeuThr: 6.277 ± 0.769
5.354LeuVal: 5.354 ± 0.596
0.677LeuTrp: 0.677 ± 0.197
2.215LeuTyr: 2.215 ± 0.402
0.0LeuXaa: 0.0 ± 0.0
Met
2.4MetAla: 2.4 ± 0.373
0.123MetCys: 0.123 ± 0.088
1.723MetAsp: 1.723 ± 0.272
1.785MetGlu: 1.785 ± 0.398
1.108MetPhe: 1.108 ± 0.354
1.354MetGly: 1.354 ± 0.276
0.308MetHis: 0.308 ± 0.128
1.908MetIle: 1.908 ± 0.314
2.092MetLys: 2.092 ± 0.342
2.769MetLeu: 2.769 ± 0.485
0.677MetMet: 0.677 ± 0.147
0.861MetAsn: 0.861 ± 0.156
1.354MetPro: 1.354 ± 0.33
0.923MetGln: 0.923 ± 0.192
1.477MetArg: 1.477 ± 0.285
2.092MetSer: 2.092 ± 0.324
2.708MetThr: 2.708 ± 0.463
1.908MetVal: 1.908 ± 0.444
0.185MetTrp: 0.185 ± 0.105
0.492MetTyr: 0.492 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
3.507AsnAla: 3.507 ± 0.566
0.185AsnCys: 0.185 ± 0.1
2.584AsnAsp: 2.584 ± 0.337
2.4AsnGlu: 2.4 ± 0.405
1.477AsnPhe: 1.477 ± 0.269
3.507AsnGly: 3.507 ± 0.547
0.8AsnHis: 0.8 ± 0.27
1.908AsnIle: 1.908 ± 0.344
2.031AsnLys: 2.031 ± 0.412
3.261AsnLeu: 3.261 ± 0.51
0.738AsnMet: 0.738 ± 0.225
1.6AsnAsn: 1.6 ± 0.336
2.461AsnPro: 2.461 ± 0.421
1.231AsnGln: 1.231 ± 0.261
3.631AsnArg: 3.631 ± 0.728
2.954AsnSer: 2.954 ± 0.451
2.461AsnThr: 2.461 ± 0.416
2.892AsnVal: 2.892 ± 0.384
0.431AsnTrp: 0.431 ± 0.167
1.169AsnTyr: 1.169 ± 0.219
0.0AsnXaa: 0.0 ± 0.0
Pro
2.584ProAla: 2.584 ± 0.346
0.369ProCys: 0.369 ± 0.188
2.769ProAsp: 2.769 ± 0.579
3.631ProGlu: 3.631 ± 0.603
1.477ProPhe: 1.477 ± 0.311
2.523ProGly: 2.523 ± 0.4
0.738ProHis: 0.738 ± 0.314
2.092ProIle: 2.092 ± 0.323
2.154ProLys: 2.154 ± 0.363
2.769ProLeu: 2.769 ± 0.511
1.292ProMet: 1.292 ± 0.312
1.6ProAsn: 1.6 ± 0.41
1.846ProPro: 1.846 ± 0.386
1.169ProGln: 1.169 ± 0.231
2.154ProArg: 2.154 ± 0.492
2.646ProSer: 2.646 ± 0.411
3.569ProThr: 3.569 ± 0.565
3.138ProVal: 3.138 ± 0.404
0.677ProTrp: 0.677 ± 0.269
1.046ProTyr: 1.046 ± 0.248
0.0ProXaa: 0.0 ± 0.0
Gln
3.323GlnAla: 3.323 ± 0.489
0.246GlnCys: 0.246 ± 0.175
1.846GlnAsp: 1.846 ± 0.345
1.723GlnGlu: 1.723 ± 0.391
1.046GlnPhe: 1.046 ± 0.274
2.584GlnGly: 2.584 ± 0.514
0.615GlnHis: 0.615 ± 0.201
2.584GlnIle: 2.584 ± 0.379
2.154GlnLys: 2.154 ± 0.412
3.261GlnLeu: 3.261 ± 0.546
1.169GlnMet: 1.169 ± 0.266
1.292GlnAsn: 1.292 ± 0.255
1.231GlnPro: 1.231 ± 0.261
1.538GlnGln: 1.538 ± 0.271
1.661GlnArg: 1.661 ± 0.304
1.846GlnSer: 1.846 ± 0.304
1.785GlnThr: 1.785 ± 0.403
2.708GlnVal: 2.708 ± 0.445
0.246GlnTrp: 0.246 ± 0.133
1.169GlnTyr: 1.169 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
4.0ArgAla: 4.0 ± 0.587
0.369ArgCys: 0.369 ± 0.172
3.138ArgAsp: 3.138 ± 0.431
4.0ArgGlu: 4.0 ± 0.475
1.846ArgPhe: 1.846 ± 0.315
3.2ArgGly: 3.2 ± 0.493
1.415ArgHis: 1.415 ± 0.3
3.138ArgIle: 3.138 ± 0.467
3.323ArgLys: 3.323 ± 0.441
5.169ArgLeu: 5.169 ± 0.847
1.846ArgMet: 1.846 ± 0.33
3.015ArgAsn: 3.015 ± 0.418
2.4ArgPro: 2.4 ± 0.443
1.846ArgGln: 1.846 ± 0.407
5.107ArgArg: 5.107 ± 0.85
3.815ArgSer: 3.815 ± 0.63
3.015ArgThr: 3.015 ± 0.59
3.446ArgVal: 3.446 ± 0.521
0.985ArgTrp: 0.985 ± 0.267
2.154ArgTyr: 2.154 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
5.354SerAla: 5.354 ± 0.69
0.123SerCys: 0.123 ± 0.097
3.323SerAsp: 3.323 ± 0.463
3.754SerGlu: 3.754 ± 0.393
2.031SerPhe: 2.031 ± 0.339
6.153SerGly: 6.153 ± 0.711
1.231SerHis: 1.231 ± 0.213
3.323SerIle: 3.323 ± 0.464
2.831SerLys: 2.831 ± 0.442
5.661SerLeu: 5.661 ± 0.555
1.169SerMet: 1.169 ± 0.238
2.523SerAsn: 2.523 ± 0.376
3.384SerPro: 3.384 ± 0.459
1.354SerGln: 1.354 ± 0.247
2.461SerArg: 2.461 ± 0.37
3.261SerSer: 3.261 ± 0.596
3.015SerThr: 3.015 ± 0.582
4.677SerVal: 4.677 ± 0.691
0.861SerTrp: 0.861 ± 0.259
1.969SerTyr: 1.969 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
7.076ThrAla: 7.076 ± 0.975
0.062ThrCys: 0.062 ± 0.07
3.938ThrAsp: 3.938 ± 0.605
4.184ThrGlu: 4.184 ± 0.541
3.077ThrPhe: 3.077 ± 0.494
5.107ThrGly: 5.107 ± 0.488
1.292ThrHis: 1.292 ± 0.262
3.754ThrIle: 3.754 ± 0.562
3.692ThrLys: 3.692 ± 0.459
4.984ThrLeu: 4.984 ± 0.81
0.985ThrMet: 0.985 ± 0.201
2.277ThrAsn: 2.277 ± 0.354
3.2ThrPro: 3.2 ± 0.411
2.461ThrGln: 2.461 ± 0.519
3.631ThrArg: 3.631 ± 0.429
3.446ThrSer: 3.446 ± 0.639
4.8ThrThr: 4.8 ± 0.932
5.354ThrVal: 5.354 ± 0.726
0.861ThrTrp: 0.861 ± 0.203
1.846ThrTyr: 1.846 ± 0.438
0.0ThrXaa: 0.0 ± 0.0
Val
7.261ValAla: 7.261 ± 0.555
0.369ValCys: 0.369 ± 0.163
3.507ValAsp: 3.507 ± 0.387
5.538ValGlu: 5.538 ± 0.606
2.646ValPhe: 2.646 ± 0.451
5.6ValGly: 5.6 ± 0.444
1.354ValHis: 1.354 ± 0.314
3.692ValIle: 3.692 ± 0.482
4.984ValLys: 4.984 ± 0.606
5.723ValLeu: 5.723 ± 0.781
2.031ValMet: 2.031 ± 0.291
3.138ValAsn: 3.138 ± 0.429
2.769ValPro: 2.769 ± 0.47
1.785ValGln: 1.785 ± 0.407
4.061ValArg: 4.061 ± 0.558
4.307ValSer: 4.307 ± 0.46
4.307ValThr: 4.307 ± 0.622
4.369ValVal: 4.369 ± 0.489
0.861ValTrp: 0.861 ± 0.249
3.077ValTyr: 3.077 ± 0.639
0.0ValXaa: 0.0 ± 0.0
Trp
1.046TrpAla: 1.046 ± 0.265
0.062TrpCys: 0.062 ± 0.064
1.046TrpAsp: 1.046 ± 0.312
1.108TrpGlu: 1.108 ± 0.263
0.554TrpPhe: 0.554 ± 0.226
1.477TrpGly: 1.477 ± 0.382
0.123TrpHis: 0.123 ± 0.08
0.677TrpIle: 0.677 ± 0.184
0.554TrpLys: 0.554 ± 0.211
1.292TrpLeu: 1.292 ± 0.266
0.308TrpMet: 0.308 ± 0.139
1.292TrpAsn: 1.292 ± 0.318
0.185TrpPro: 0.185 ± 0.124
0.308TrpGln: 0.308 ± 0.147
0.554TrpArg: 0.554 ± 0.219
0.861TrpSer: 0.861 ± 0.361
0.861TrpThr: 0.861 ± 0.274
1.169TrpVal: 1.169 ± 0.295
0.185TrpTrp: 0.185 ± 0.088
0.431TrpTyr: 0.431 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.015TyrAla: 3.015 ± 0.563
0.369TyrCys: 0.369 ± 0.193
2.4TyrAsp: 2.4 ± 0.516
1.538TyrGlu: 1.538 ± 0.306
1.723TyrPhe: 1.723 ± 0.373
2.338TyrGly: 2.338 ± 0.445
0.738TyrHis: 0.738 ± 0.354
1.723TyrIle: 1.723 ± 0.488
2.154TyrLys: 2.154 ± 0.377
1.969TyrLeu: 1.969 ± 0.389
0.615TyrMet: 0.615 ± 0.232
1.908TyrAsn: 1.908 ± 0.348
1.292TyrPro: 1.292 ± 0.306
1.538TyrGln: 1.538 ± 0.276
1.846TyrArg: 1.846 ± 0.353
2.092TyrSer: 2.092 ± 0.357
2.092TyrThr: 2.092 ± 0.333
2.892TyrVal: 2.892 ± 0.459
0.431TyrTrp: 0.431 ± 0.16
1.354TyrTyr: 1.354 ± 0.309
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (16252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski