Amino acid dipepetide frequency for Streptomyces phage Paedore

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.971AlaAla: 12.971 ± 1.129
0.763AlaCys: 0.763 ± 0.2
6.804AlaAsp: 6.804 ± 0.629
8.711AlaGlu: 8.711 ± 0.81
3.116AlaPhe: 3.116 ± 0.466
9.919AlaGly: 9.919 ± 0.897
2.48AlaHis: 2.48 ± 0.439
4.705AlaIle: 4.705 ± 0.633
5.468AlaLys: 5.468 ± 0.846
10.809AlaLeu: 10.809 ± 1.224
3.497AlaMet: 3.497 ± 0.437
2.543AlaAsn: 2.543 ± 0.375
4.515AlaPro: 4.515 ± 0.581
3.434AlaGln: 3.434 ± 0.431
7.249AlaArg: 7.249 ± 0.686
5.659AlaSer: 5.659 ± 0.635
6.358AlaThr: 6.358 ± 0.68
8.584AlaVal: 8.584 ± 0.71
2.671AlaTrp: 2.671 ± 0.36
3.688AlaTyr: 3.688 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.699CysAla: 0.699 ± 0.212
0.064CysCys: 0.064 ± 0.064
0.572CysAsp: 0.572 ± 0.2
0.445CysGlu: 0.445 ± 0.155
0.318CysPhe: 0.318 ± 0.178
0.699CysGly: 0.699 ± 0.217
0.254CysHis: 0.254 ± 0.132
0.127CysIle: 0.127 ± 0.079
0.509CysLys: 0.509 ± 0.25
0.572CysLeu: 0.572 ± 0.202
0.0CysMet: 0.0 ± 0.0
0.064CysAsn: 0.064 ± 0.06
0.572CysPro: 0.572 ± 0.209
0.254CysGln: 0.254 ± 0.156
0.509CysArg: 0.509 ± 0.16
0.572CysSer: 0.572 ± 0.203
0.509CysThr: 0.509 ± 0.231
0.445CysVal: 0.445 ± 0.163
0.191CysTrp: 0.191 ± 0.104
0.191CysTyr: 0.191 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
6.295AspAla: 6.295 ± 0.743
0.827AspCys: 0.827 ± 0.244
4.197AspAsp: 4.197 ± 0.7
5.15AspGlu: 5.15 ± 0.669
2.098AspPhe: 2.098 ± 0.334
6.613AspGly: 6.613 ± 0.705
1.462AspHis: 1.462 ± 0.325
2.861AspIle: 2.861 ± 0.388
2.162AspLys: 2.162 ± 0.328
4.896AspLeu: 4.896 ± 0.587
1.526AspMet: 1.526 ± 0.29
1.081AspAsn: 1.081 ± 0.197
3.942AspPro: 3.942 ± 0.474
2.035AspGln: 2.035 ± 0.317
4.006AspArg: 4.006 ± 0.516
3.561AspSer: 3.561 ± 0.481
3.306AspThr: 3.306 ± 0.498
4.197AspVal: 4.197 ± 0.416
1.653AspTrp: 1.653 ± 0.303
1.717AspTyr: 1.717 ± 0.321
0.0AspXaa: 0.0 ± 0.0
Glu
9.665GluAla: 9.665 ± 0.906
0.699GluCys: 0.699 ± 0.249
4.451GluAsp: 4.451 ± 0.571
5.341GluGlu: 5.341 ± 0.87
1.971GluPhe: 1.971 ± 0.298
6.931GluGly: 6.931 ± 0.635
1.717GluHis: 1.717 ± 0.363
3.752GluIle: 3.752 ± 0.559
1.908GluLys: 1.908 ± 0.322
6.74GluLeu: 6.74 ± 0.753
1.908GluMet: 1.908 ± 0.338
1.399GluAsn: 1.399 ± 0.327
3.306GluPro: 3.306 ± 0.502
2.988GluGln: 2.988 ± 0.375
5.087GluArg: 5.087 ± 0.623
2.798GluSer: 2.798 ± 0.471
3.497GluThr: 3.497 ± 0.552
4.642GluVal: 4.642 ± 0.659
1.272GluTrp: 1.272 ± 0.304
1.844GluTyr: 1.844 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
3.561PheAla: 3.561 ± 0.441
0.254PheCys: 0.254 ± 0.124
2.543PheAsp: 2.543 ± 0.382
2.098PheGlu: 2.098 ± 0.344
0.763PhePhe: 0.763 ± 0.228
3.688PheGly: 3.688 ± 0.511
0.445PheHis: 0.445 ± 0.136
1.335PheIle: 1.335 ± 0.319
1.145PheLys: 1.145 ± 0.29
2.035PheLeu: 2.035 ± 0.374
0.445PheMet: 0.445 ± 0.175
0.954PheAsn: 0.954 ± 0.209
1.017PhePro: 1.017 ± 0.262
1.272PheGln: 1.272 ± 0.346
1.908PheArg: 1.908 ± 0.426
1.399PheSer: 1.399 ± 0.272
2.098PheThr: 2.098 ± 0.471
2.035PheVal: 2.035 ± 0.293
0.636PheTrp: 0.636 ± 0.174
0.89PheTyr: 0.89 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
7.948GlyAla: 7.948 ± 0.845
0.191GlyCys: 0.191 ± 0.1
5.087GlyAsp: 5.087 ± 0.701
6.74GlyGlu: 6.74 ± 0.614
2.734GlyPhe: 2.734 ± 0.437
7.312GlyGly: 7.312 ± 0.954
2.289GlyHis: 2.289 ± 0.424
3.688GlyIle: 3.688 ± 0.685
5.214GlyLys: 5.214 ± 0.597
7.185GlyLeu: 7.185 ± 0.616
1.78GlyMet: 1.78 ± 0.263
2.543GlyAsn: 2.543 ± 0.434
4.324GlyPro: 4.324 ± 0.563
3.179GlyGln: 3.179 ± 0.364
4.769GlyArg: 4.769 ± 0.552
5.278GlySer: 5.278 ± 0.738
5.723GlyThr: 5.723 ± 1.06
6.168GlyVal: 6.168 ± 0.611
2.353GlyTrp: 2.353 ± 0.42
2.607GlyTyr: 2.607 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
2.607HisAla: 2.607 ± 0.539
0.254HisCys: 0.254 ± 0.126
1.462HisAsp: 1.462 ± 0.308
1.462HisGlu: 1.462 ± 0.378
0.954HisPhe: 0.954 ± 0.208
2.098HisGly: 2.098 ± 0.385
0.827HisHis: 0.827 ± 0.206
0.89HisIle: 0.89 ± 0.243
0.509HisLys: 0.509 ± 0.156
1.971HisLeu: 1.971 ± 0.363
0.191HisMet: 0.191 ± 0.108
0.509HisAsn: 0.509 ± 0.157
1.272HisPro: 1.272 ± 0.281
0.699HisGln: 0.699 ± 0.212
1.399HisArg: 1.399 ± 0.294
1.462HisSer: 1.462 ± 0.483
1.017HisThr: 1.017 ± 0.232
1.017HisVal: 1.017 ± 0.27
0.382HisTrp: 0.382 ± 0.164
0.699HisTyr: 0.699 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
3.752IleAla: 3.752 ± 0.435
0.318IleCys: 0.318 ± 0.137
4.069IleAsp: 4.069 ± 0.546
4.197IleGlu: 4.197 ± 0.45
1.59IlePhe: 1.59 ± 0.366
3.052IleGly: 3.052 ± 0.521
0.763IleHis: 0.763 ± 0.169
1.844IleIle: 1.844 ± 0.365
2.035IleLys: 2.035 ± 0.358
2.734IleLeu: 2.734 ± 0.667
0.572IleMet: 0.572 ± 0.2
1.208IleAsn: 1.208 ± 0.303
1.59IlePro: 1.59 ± 0.308
1.272IleGln: 1.272 ± 0.243
3.179IleArg: 3.179 ± 0.381
2.225IleSer: 2.225 ± 0.409
3.243IleThr: 3.243 ± 0.428
3.179IleVal: 3.179 ± 0.393
0.318IleTrp: 0.318 ± 0.174
1.78IleTyr: 1.78 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
5.023LysAla: 5.023 ± 0.609
0.127LysCys: 0.127 ± 0.104
2.543LysAsp: 2.543 ± 0.584
2.671LysGlu: 2.671 ± 0.373
0.954LysPhe: 0.954 ± 0.205
3.752LysGly: 3.752 ± 0.468
0.827LysHis: 0.827 ± 0.258
1.78LysIle: 1.78 ± 0.285
3.243LysLys: 3.243 ± 0.62
3.942LysLeu: 3.942 ± 0.539
0.699LysMet: 0.699 ± 0.225
0.954LysAsn: 0.954 ± 0.231
2.48LysPro: 2.48 ± 0.444
1.653LysGln: 1.653 ± 0.379
3.879LysArg: 3.879 ± 0.695
2.162LysSer: 2.162 ± 0.377
2.289LysThr: 2.289 ± 0.388
2.671LysVal: 2.671 ± 0.411
0.254LysTrp: 0.254 ± 0.126
1.59LysTyr: 1.59 ± 0.303
0.0LysXaa: 0.0 ± 0.0
Leu
11.827LeuAla: 11.827 ± 1.236
0.699LeuCys: 0.699 ± 0.18
5.214LeuAsp: 5.214 ± 0.486
4.832LeuGlu: 4.832 ± 0.624
2.162LeuPhe: 2.162 ± 0.424
7.249LeuGly: 7.249 ± 0.887
1.145LeuHis: 1.145 ± 0.277
3.434LeuIle: 3.434 ± 0.365
2.734LeuLys: 2.734 ± 0.399
5.977LeuLeu: 5.977 ± 0.761
1.78LeuMet: 1.78 ± 0.358
2.734LeuAsn: 2.734 ± 0.385
4.197LeuPro: 4.197 ± 0.451
2.671LeuGln: 2.671 ± 0.42
5.341LeuArg: 5.341 ± 0.744
5.532LeuSer: 5.532 ± 0.626
5.214LeuThr: 5.214 ± 0.506
5.977LeuVal: 5.977 ± 0.771
1.081LeuTrp: 1.081 ± 0.217
1.908LeuTyr: 1.908 ± 0.318
0.0LeuXaa: 0.0 ± 0.0
Met
3.752MetAla: 3.752 ± 0.451
0.0MetCys: 0.0 ± 0.0
1.145MetAsp: 1.145 ± 0.29
1.145MetGlu: 1.145 ± 0.31
0.509MetPhe: 0.509 ± 0.213
0.954MetGly: 0.954 ± 0.284
0.763MetHis: 0.763 ± 0.194
0.954MetIle: 0.954 ± 0.204
0.763MetLys: 0.763 ± 0.178
1.971MetLeu: 1.971 ± 0.342
0.382MetMet: 0.382 ± 0.193
0.763MetAsn: 0.763 ± 0.208
1.272MetPro: 1.272 ± 0.269
0.827MetGln: 0.827 ± 0.233
1.526MetArg: 1.526 ± 0.328
1.908MetSer: 1.908 ± 0.295
2.162MetThr: 2.162 ± 0.45
1.462MetVal: 1.462 ± 0.267
0.254MetTrp: 0.254 ± 0.13
0.127MetTyr: 0.127 ± 0.083
0.0MetXaa: 0.0 ± 0.0
Asn
2.861AsnAla: 2.861 ± 0.444
0.445AsnCys: 0.445 ± 0.166
1.717AsnAsp: 1.717 ± 0.293
2.162AsnGlu: 2.162 ± 0.377
0.89AsnPhe: 0.89 ± 0.227
2.671AsnGly: 2.671 ± 0.456
0.636AsnHis: 0.636 ± 0.213
1.017AsnIle: 1.017 ± 0.197
0.763AsnLys: 0.763 ± 0.229
2.162AsnLeu: 2.162 ± 0.401
0.445AsnMet: 0.445 ± 0.166
0.699AsnAsn: 0.699 ± 0.208
1.653AsnPro: 1.653 ± 0.256
1.081AsnGln: 1.081 ± 0.209
1.272AsnArg: 1.272 ± 0.24
1.526AsnSer: 1.526 ± 0.254
1.59AsnThr: 1.59 ± 0.376
1.653AsnVal: 1.653 ± 0.291
0.572AsnTrp: 0.572 ± 0.182
0.636AsnTyr: 0.636 ± 0.167
0.0AsnXaa: 0.0 ± 0.0
Pro
5.341ProAla: 5.341 ± 0.689
0.445ProCys: 0.445 ± 0.146
2.925ProAsp: 2.925 ± 0.473
3.434ProGlu: 3.434 ± 0.404
1.081ProPhe: 1.081 ± 0.303
4.578ProGly: 4.578 ± 0.55
0.572ProHis: 0.572 ± 0.201
1.78ProIle: 1.78 ± 0.474
2.543ProLys: 2.543 ± 0.456
3.624ProLeu: 3.624 ± 0.485
1.335ProMet: 1.335 ± 0.302
1.526ProAsn: 1.526 ± 0.369
1.653ProPro: 1.653 ± 0.313
1.526ProGln: 1.526 ± 0.285
2.353ProArg: 2.353 ± 0.362
2.861ProSer: 2.861 ± 0.507
3.752ProThr: 3.752 ± 0.555
3.942ProVal: 3.942 ± 0.417
0.954ProTrp: 0.954 ± 0.261
0.954ProTyr: 0.954 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
4.006GlnAla: 4.006 ± 0.495
0.382GlnCys: 0.382 ± 0.162
1.335GlnAsp: 1.335 ± 0.312
2.225GlnGlu: 2.225 ± 0.469
1.208GlnPhe: 1.208 ± 0.257
2.035GlnGly: 2.035 ± 0.437
0.572GlnHis: 0.572 ± 0.167
1.653GlnIle: 1.653 ± 0.339
1.844GlnLys: 1.844 ± 0.324
2.734GlnLeu: 2.734 ± 0.458
1.017GlnMet: 1.017 ± 0.251
1.081GlnAsn: 1.081 ± 0.257
0.89GlnPro: 0.89 ± 0.196
0.699GlnGln: 0.699 ± 0.231
2.48GlnArg: 2.48 ± 0.457
1.971GlnSer: 1.971 ± 0.386
2.162GlnThr: 2.162 ± 0.389
2.734GlnVal: 2.734 ± 0.402
0.445GlnTrp: 0.445 ± 0.155
0.89GlnTyr: 0.89 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
6.422ArgAla: 6.422 ± 0.594
0.636ArgCys: 0.636 ± 0.227
4.387ArgAsp: 4.387 ± 0.536
4.769ArgGlu: 4.769 ± 0.635
2.607ArgPhe: 2.607 ± 0.271
4.069ArgGly: 4.069 ± 0.515
1.78ArgHis: 1.78 ± 0.337
2.353ArgIle: 2.353 ± 0.349
2.988ArgLys: 2.988 ± 0.423
5.723ArgLeu: 5.723 ± 0.626
1.653ArgMet: 1.653 ± 0.334
1.145ArgAsn: 1.145 ± 0.243
2.988ArgPro: 2.988 ± 0.473
2.543ArgGln: 2.543 ± 0.444
5.786ArgArg: 5.786 ± 0.905
4.324ArgSer: 4.324 ± 0.591
4.324ArgThr: 4.324 ± 0.49
4.832ArgVal: 4.832 ± 0.591
0.89ArgTrp: 0.89 ± 0.256
2.289ArgTyr: 2.289 ± 0.337
0.0ArgXaa: 0.0 ± 0.0
Ser
6.422SerAla: 6.422 ± 0.727
0.254SerCys: 0.254 ± 0.134
3.434SerAsp: 3.434 ± 0.558
3.434SerGlu: 3.434 ± 0.52
2.162SerPhe: 2.162 ± 0.418
5.532SerGly: 5.532 ± 0.701
1.717SerHis: 1.717 ± 0.485
2.734SerIle: 2.734 ± 0.506
2.353SerLys: 2.353 ± 0.411
4.642SerLeu: 4.642 ± 0.621
1.399SerMet: 1.399 ± 0.363
2.035SerAsn: 2.035 ± 0.408
2.607SerPro: 2.607 ± 0.439
1.208SerGln: 1.208 ± 0.258
3.815SerArg: 3.815 ± 0.61
4.006SerSer: 4.006 ± 0.575
4.578SerThr: 4.578 ± 0.531
3.752SerVal: 3.752 ± 0.472
0.827SerTrp: 0.827 ± 0.195
1.78SerTyr: 1.78 ± 0.348
0.0SerXaa: 0.0 ± 0.0
Thr
7.122ThrAla: 7.122 ± 0.653
0.445ThrCys: 0.445 ± 0.186
3.942ThrAsp: 3.942 ± 0.496
4.26ThrGlu: 4.26 ± 0.676
2.225ThrPhe: 2.225 ± 0.444
5.85ThrGly: 5.85 ± 0.819
0.89ThrHis: 0.89 ± 0.204
3.434ThrIle: 3.434 ± 0.491
2.035ThrLys: 2.035 ± 0.316
5.023ThrLeu: 5.023 ± 0.596
0.763ThrMet: 0.763 ± 0.208
1.335ThrAsn: 1.335 ± 0.32
3.688ThrPro: 3.688 ± 0.58
1.272ThrGln: 1.272 ± 0.311
3.752ThrArg: 3.752 ± 0.61
4.896ThrSer: 4.896 ± 0.573
4.387ThrThr: 4.387 ± 0.794
5.595ThrVal: 5.595 ± 0.56
1.145ThrTrp: 1.145 ± 0.348
2.289ThrTyr: 2.289 ± 0.318
0.0ThrXaa: 0.0 ± 0.0
Val
8.52ValAla: 8.52 ± 0.689
0.318ValCys: 0.318 ± 0.16
4.324ValAsp: 4.324 ± 0.599
5.468ValGlu: 5.468 ± 0.639
1.653ValPhe: 1.653 ± 0.335
6.168ValGly: 6.168 ± 0.621
1.59ValHis: 1.59 ± 0.238
2.988ValIle: 2.988 ± 0.361
3.243ValLys: 3.243 ± 0.442
5.278ValLeu: 5.278 ± 0.473
1.908ValMet: 1.908 ± 0.329
2.162ValAsn: 2.162 ± 0.438
3.37ValPro: 3.37 ± 0.483
2.798ValGln: 2.798 ± 0.366
4.578ValArg: 4.578 ± 0.551
3.37ValSer: 3.37 ± 0.496
5.341ValThr: 5.341 ± 0.555
5.85ValVal: 5.85 ± 0.585
1.145ValTrp: 1.145 ± 0.236
1.335ValTyr: 1.335 ± 0.294
0.0ValXaa: 0.0 ± 0.0
Trp
2.225TrpAla: 2.225 ± 0.349
0.254TrpCys: 0.254 ± 0.12
1.081TrpAsp: 1.081 ± 0.323
1.399TrpGlu: 1.399 ± 0.29
0.509TrpPhe: 0.509 ± 0.198
1.335TrpGly: 1.335 ± 0.263
0.382TrpHis: 0.382 ± 0.137
0.318TrpIle: 0.318 ± 0.161
1.081TrpLys: 1.081 ± 0.25
1.272TrpLeu: 1.272 ± 0.264
0.636TrpMet: 0.636 ± 0.189
0.954TrpAsn: 0.954 ± 0.265
0.636TrpPro: 0.636 ± 0.174
0.254TrpGln: 0.254 ± 0.148
1.653TrpArg: 1.653 ± 0.357
1.208TrpSer: 1.208 ± 0.331
1.208TrpThr: 1.208 ± 0.306
0.89TrpVal: 0.89 ± 0.222
0.191TrpTrp: 0.191 ± 0.136
0.318TrpTyr: 0.318 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.925TyrAla: 2.925 ± 0.353
0.191TyrCys: 0.191 ± 0.11
2.353TyrAsp: 2.353 ± 0.504
1.971TyrGlu: 1.971 ± 0.35
1.017TyrPhe: 1.017 ± 0.286
2.607TyrGly: 2.607 ± 0.464
0.572TyrHis: 0.572 ± 0.246
1.272TyrIle: 1.272 ± 0.235
0.954TyrLys: 0.954 ± 0.25
2.416TyrLeu: 2.416 ± 0.533
0.636TyrMet: 0.636 ± 0.201
0.827TyrAsn: 0.827 ± 0.214
1.208TyrPro: 1.208 ± 0.245
0.699TyrGln: 0.699 ± 0.215
1.971TyrArg: 1.971 ± 0.371
1.971TyrSer: 1.971 ± 0.457
1.59TyrThr: 1.59 ± 0.276
1.78TyrVal: 1.78 ± 0.325
0.572TyrTrp: 0.572 ± 0.18
0.636TyrTyr: 0.636 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (15728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski