Amino acid dipepetide frequency for Streptomyces phage AbbeyMikolon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.068AlaAla: 12.068 ± 1.238
0.473AlaCys: 0.473 ± 0.181
7.888AlaAsp: 7.888 ± 0.774
7.336AlaGlu: 7.336 ± 0.827
2.366AlaPhe: 2.366 ± 0.373
8.913AlaGly: 8.913 ± 0.923
1.893AlaHis: 1.893 ± 0.436
4.023AlaIle: 4.023 ± 0.54
6.547AlaLys: 6.547 ± 1.092
12.936AlaLeu: 12.936 ± 1.119
3.313AlaMet: 3.313 ± 0.508
3.392AlaAsn: 3.392 ± 0.428
4.969AlaPro: 4.969 ± 0.583
3.392AlaGln: 3.392 ± 0.483
7.099AlaArg: 7.099 ± 0.661
6.862AlaSer: 6.862 ± 0.765
7.099AlaThr: 7.099 ± 1.153
9.702AlaVal: 9.702 ± 0.886
2.603AlaTrp: 2.603 ± 0.426
2.918AlaTyr: 2.918 ± 0.487
0.0AlaXaa: 0.0 ± 0.0
Cys
0.789CysAla: 0.789 ± 0.223
0.237CysCys: 0.237 ± 0.158
0.789CysAsp: 0.789 ± 0.263
0.237CysGlu: 0.237 ± 0.123
0.158CysPhe: 0.158 ± 0.124
0.947CysGly: 0.947 ± 0.282
0.237CysHis: 0.237 ± 0.176
0.158CysIle: 0.158 ± 0.104
0.394CysLys: 0.394 ± 0.16
0.789CysLeu: 0.789 ± 0.291
0.158CysMet: 0.158 ± 0.095
0.079CysAsn: 0.079 ± 0.09
0.789CysPro: 0.789 ± 0.273
0.789CysGln: 0.789 ± 0.233
0.473CysArg: 0.473 ± 0.165
0.158CysSer: 0.158 ± 0.103
0.394CysThr: 0.394 ± 0.158
0.158CysVal: 0.158 ± 0.104
0.079CysTrp: 0.079 ± 0.083
0.158CysTyr: 0.158 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
7.336AspAla: 7.336 ± 0.736
0.789AspCys: 0.789 ± 0.25
3.076AspAsp: 3.076 ± 0.533
2.84AspGlu: 2.84 ± 0.622
1.183AspPhe: 1.183 ± 0.278
6.941AspGly: 6.941 ± 0.74
1.025AspHis: 1.025 ± 0.26
2.051AspIle: 2.051 ± 0.413
1.42AspLys: 1.42 ± 0.302
6.31AspLeu: 6.31 ± 0.835
1.262AspMet: 1.262 ± 0.289
1.893AspAsn: 1.893 ± 0.287
4.338AspPro: 4.338 ± 0.634
2.366AspGln: 2.366 ± 0.305
4.654AspArg: 4.654 ± 0.697
2.366AspSer: 2.366 ± 0.551
3.865AspThr: 3.865 ± 0.425
5.6AspVal: 5.6 ± 0.639
1.262AspTrp: 1.262 ± 0.267
1.656AspTyr: 1.656 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
8.992GluAla: 8.992 ± 0.935
0.631GluCys: 0.631 ± 0.212
4.338GluAsp: 4.338 ± 0.847
3.865GluGlu: 3.865 ± 0.613
2.84GluPhe: 2.84 ± 0.498
6.231GluGly: 6.231 ± 0.891
0.868GluHis: 0.868 ± 0.221
2.445GluIle: 2.445 ± 0.446
1.025GluLys: 1.025 ± 0.307
4.811GluLeu: 4.811 ± 0.752
1.972GluMet: 1.972 ± 0.35
1.262GluAsn: 1.262 ± 0.332
2.13GluPro: 2.13 ± 0.337
0.552GluGln: 0.552 ± 0.176
5.521GluArg: 5.521 ± 0.75
3.155GluSer: 3.155 ± 0.568
5.285GluThr: 5.285 ± 0.747
5.127GluVal: 5.127 ± 0.684
0.868GluTrp: 0.868 ± 0.238
1.656GluTyr: 1.656 ± 0.311
0.0GluXaa: 0.0 ± 0.0
Phe
2.682PheAla: 2.682 ± 0.521
0.158PheCys: 0.158 ± 0.102
1.499PheAsp: 1.499 ± 0.318
1.341PheGlu: 1.341 ± 0.315
0.316PhePhe: 0.316 ± 0.219
2.918PheGly: 2.918 ± 0.491
0.158PheHis: 0.158 ± 0.099
1.025PheIle: 1.025 ± 0.288
1.578PheLys: 1.578 ± 0.34
1.735PheLeu: 1.735 ± 0.381
1.104PheMet: 1.104 ± 0.269
0.868PheAsn: 0.868 ± 0.39
1.893PhePro: 1.893 ± 0.46
1.025PheGln: 1.025 ± 0.288
3.313PheArg: 3.313 ± 0.411
1.104PheSer: 1.104 ± 0.308
1.972PheThr: 1.972 ± 0.384
1.735PheVal: 1.735 ± 0.358
0.158PheTrp: 0.158 ± 0.115
1.262PheTyr: 1.262 ± 0.42
0.0PheXaa: 0.0 ± 0.0
Gly
10.017GlyAla: 10.017 ± 0.768
0.473GlyCys: 0.473 ± 0.186
5.127GlyAsp: 5.127 ± 0.579
5.995GlyGlu: 5.995 ± 0.715
2.682GlyPhe: 2.682 ± 0.59
7.099GlyGly: 7.099 ± 1.053
1.104GlyHis: 1.104 ± 0.229
3.234GlyIle: 3.234 ± 0.708
4.496GlyLys: 4.496 ± 0.787
7.73GlyLeu: 7.73 ± 0.809
2.051GlyMet: 2.051 ± 0.331
1.814GlyAsn: 1.814 ± 0.345
3.944GlyPro: 3.944 ± 0.663
3.234GlyGln: 3.234 ± 0.365
6.31GlyArg: 6.31 ± 0.489
5.521GlySer: 5.521 ± 0.692
6.231GlyThr: 6.231 ± 0.714
7.572GlyVal: 7.572 ± 0.722
2.209GlyTrp: 2.209 ± 0.463
2.761GlyTyr: 2.761 ± 0.506
0.0GlyXaa: 0.0 ± 0.0
His
2.051HisAla: 2.051 ± 0.379
0.079HisCys: 0.079 ± 0.086
0.789HisAsp: 0.789 ± 0.246
0.947HisGlu: 0.947 ± 0.236
0.473HisPhe: 0.473 ± 0.172
1.578HisGly: 1.578 ± 0.384
0.552HisHis: 0.552 ± 0.226
1.104HisIle: 1.104 ± 0.272
0.552HisLys: 0.552 ± 0.201
1.104HisLeu: 1.104 ± 0.357
0.394HisMet: 0.394 ± 0.24
0.71HisAsn: 0.71 ± 0.221
0.789HisPro: 0.789 ± 0.25
0.473HisGln: 0.473 ± 0.165
0.789HisArg: 0.789 ± 0.299
0.552HisSer: 0.552 ± 0.197
1.499HisThr: 1.499 ± 0.301
1.578HisVal: 1.578 ± 0.351
0.316HisTrp: 0.316 ± 0.16
0.158HisTyr: 0.158 ± 0.105
0.0HisXaa: 0.0 ± 0.0
Ile
3.786IleAla: 3.786 ± 0.66
0.158IleCys: 0.158 ± 0.098
2.13IleAsp: 2.13 ± 0.505
0.868IleGlu: 0.868 ± 0.245
1.025IlePhe: 1.025 ± 0.333
2.761IleGly: 2.761 ± 0.52
1.183IleHis: 1.183 ± 0.332
0.789IleIle: 0.789 ± 0.201
1.578IleLys: 1.578 ± 0.304
2.84IleLeu: 2.84 ± 0.494
0.552IleMet: 0.552 ± 0.188
0.631IleAsn: 0.631 ± 0.209
1.735IlePro: 1.735 ± 0.438
0.947IleGln: 0.947 ± 0.285
4.654IleArg: 4.654 ± 0.674
1.499IleSer: 1.499 ± 0.372
3.313IleThr: 3.313 ± 0.529
2.761IleVal: 2.761 ± 0.545
0.394IleTrp: 0.394 ± 0.138
0.552IleTyr: 0.552 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
6.31LysAla: 6.31 ± 1.024
0.237LysCys: 0.237 ± 0.119
2.918LysAsp: 2.918 ± 0.67
3.628LysGlu: 3.628 ± 0.576
1.183LysPhe: 1.183 ± 0.342
4.496LysGly: 4.496 ± 0.713
0.71LysHis: 0.71 ± 0.263
1.025LysIle: 1.025 ± 0.362
1.499LysLys: 1.499 ± 0.367
3.549LysLeu: 3.549 ± 0.591
1.104LysMet: 1.104 ± 0.351
1.025LysAsn: 1.025 ± 0.21
2.84LysPro: 2.84 ± 0.473
0.631LysGln: 0.631 ± 0.254
2.84LysArg: 2.84 ± 0.451
2.209LysSer: 2.209 ± 0.412
2.603LysThr: 2.603 ± 0.427
3.471LysVal: 3.471 ± 0.646
0.71LysTrp: 0.71 ± 0.226
1.104LysTyr: 1.104 ± 0.258
0.0LysXaa: 0.0 ± 0.0
Leu
10.727LeuAla: 10.727 ± 1.106
1.104LeuCys: 1.104 ± 0.263
4.496LeuAsp: 4.496 ± 0.698
6.626LeuGlu: 6.626 ± 0.887
1.814LeuPhe: 1.814 ± 0.353
6.705LeuGly: 6.705 ± 0.709
1.578LeuHis: 1.578 ± 0.353
3.076LeuIle: 3.076 ± 0.494
3.392LeuLys: 3.392 ± 0.613
5.995LeuLeu: 5.995 ± 0.601
1.656LeuMet: 1.656 ± 0.529
1.578LeuAsn: 1.578 ± 0.321
4.417LeuPro: 4.417 ± 0.619
3.155LeuGln: 3.155 ± 0.447
6.468LeuArg: 6.468 ± 0.757
4.496LeuSer: 4.496 ± 0.514
6.31LeuThr: 6.31 ± 0.725
7.336LeuVal: 7.336 ± 0.721
1.972LeuTrp: 1.972 ± 0.383
2.051LeuTyr: 2.051 ± 0.448
0.0LeuXaa: 0.0 ± 0.0
Met
4.102MetAla: 4.102 ± 0.532
0.316MetCys: 0.316 ± 0.152
1.893MetAsp: 1.893 ± 0.428
1.578MetGlu: 1.578 ± 0.338
0.552MetPhe: 0.552 ± 0.188
1.656MetGly: 1.656 ± 0.365
0.237MetHis: 0.237 ± 0.125
0.552MetIle: 0.552 ± 0.205
1.104MetLys: 1.104 ± 0.313
1.341MetLeu: 1.341 ± 0.385
0.079MetMet: 0.079 ± 0.073
0.631MetAsn: 0.631 ± 0.228
0.868MetPro: 0.868 ± 0.265
0.394MetGln: 0.394 ± 0.159
1.656MetArg: 1.656 ± 0.343
1.42MetSer: 1.42 ± 0.292
1.893MetThr: 1.893 ± 0.451
2.603MetVal: 2.603 ± 0.386
0.394MetTrp: 0.394 ± 0.179
0.079MetTyr: 0.079 ± 0.076
0.0MetXaa: 0.0 ± 0.0
Asn
3.865AsnAla: 3.865 ± 0.378
0.0AsnCys: 0.0 ± 0.0
1.183AsnAsp: 1.183 ± 0.285
1.025AsnGlu: 1.025 ± 0.246
0.316AsnPhe: 0.316 ± 0.182
3.707AsnGly: 3.707 ± 0.671
0.394AsnHis: 0.394 ± 0.194
0.789AsnIle: 0.789 ± 0.226
1.499AsnLys: 1.499 ± 0.433
2.209AsnLeu: 2.209 ± 0.453
0.158AsnMet: 0.158 ± 0.11
0.394AsnAsn: 0.394 ± 0.15
3.234AsnPro: 3.234 ± 0.511
0.71AsnGln: 0.71 ± 0.171
1.499AsnArg: 1.499 ± 0.388
0.868AsnSer: 0.868 ± 0.265
1.42AsnThr: 1.42 ± 0.354
2.13AsnVal: 2.13 ± 0.33
0.394AsnTrp: 0.394 ± 0.186
0.552AsnTyr: 0.552 ± 0.182
0.0AsnXaa: 0.0 ± 0.0
Pro
5.285ProAla: 5.285 ± 0.568
0.473ProCys: 0.473 ± 0.224
3.865ProAsp: 3.865 ± 0.576
4.18ProGlu: 4.18 ± 0.53
2.13ProPhe: 2.13 ± 0.356
5.206ProGly: 5.206 ± 0.649
0.789ProHis: 0.789 ± 0.336
1.104ProIle: 1.104 ± 0.28
2.761ProLys: 2.761 ± 0.6
4.102ProLeu: 4.102 ± 0.688
1.42ProMet: 1.42 ± 0.305
1.893ProAsn: 1.893 ± 0.549
2.209ProPro: 2.209 ± 0.474
1.341ProGln: 1.341 ± 0.313
2.682ProArg: 2.682 ± 0.516
2.84ProSer: 2.84 ± 0.458
2.84ProThr: 2.84 ± 0.362
4.969ProVal: 4.969 ± 0.441
0.71ProTrp: 0.71 ± 0.201
1.578ProTyr: 1.578 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
3.865GlnAla: 3.865 ± 0.542
0.552GlnCys: 0.552 ± 0.186
1.42GlnAsp: 1.42 ± 0.379
2.524GlnGlu: 2.524 ± 0.39
0.71GlnPhe: 0.71 ± 0.231
3.471GlnGly: 3.471 ± 0.609
0.394GlnHis: 0.394 ± 0.152
0.473GlnIle: 0.473 ± 0.225
0.631GlnLys: 0.631 ± 0.264
1.735GlnLeu: 1.735 ± 0.328
0.789GlnMet: 0.789 ± 0.281
0.789GlnAsn: 0.789 ± 0.33
0.868GlnPro: 0.868 ± 0.284
0.237GlnGln: 0.237 ± 0.125
3.313GlnArg: 3.313 ± 0.537
0.947GlnSer: 0.947 ± 0.237
2.524GlnThr: 2.524 ± 0.459
1.814GlnVal: 1.814 ± 0.415
0.789GlnTrp: 0.789 ± 0.252
1.025GlnTyr: 1.025 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
7.178ArgAla: 7.178 ± 0.867
0.473ArgCys: 0.473 ± 0.204
4.338ArgAsp: 4.338 ± 0.625
4.733ArgGlu: 4.733 ± 0.824
1.578ArgPhe: 1.578 ± 0.375
6.152ArgGly: 6.152 ± 0.677
1.341ArgHis: 1.341 ± 0.4
3.786ArgIle: 3.786 ± 0.69
3.944ArgLys: 3.944 ± 0.542
7.099ArgLeu: 7.099 ± 0.872
2.051ArgMet: 2.051 ± 0.401
1.814ArgAsn: 1.814 ± 0.399
3.944ArgPro: 3.944 ± 0.622
2.524ArgGln: 2.524 ± 0.478
7.099ArgArg: 7.099 ± 1.066
3.234ArgSer: 3.234 ± 0.55
4.417ArgThr: 4.417 ± 0.456
4.811ArgVal: 4.811 ± 0.853
1.893ArgTrp: 1.893 ± 0.436
2.13ArgTyr: 2.13 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
5.679SerAla: 5.679 ± 0.651
0.237SerCys: 0.237 ± 0.133
2.84SerAsp: 2.84 ± 0.503
3.234SerGlu: 3.234 ± 0.578
1.656SerPhe: 1.656 ± 0.343
5.916SerGly: 5.916 ± 0.794
0.947SerHis: 0.947 ± 0.228
2.287SerIle: 2.287 ± 0.527
3.313SerLys: 3.313 ± 0.488
5.364SerLeu: 5.364 ± 0.649
1.499SerMet: 1.499 ± 0.369
1.499SerAsn: 1.499 ± 0.312
1.814SerPro: 1.814 ± 0.399
0.947SerGln: 0.947 ± 0.328
3.155SerArg: 3.155 ± 0.402
2.366SerSer: 2.366 ± 0.518
3.707SerThr: 3.707 ± 0.688
3.313SerVal: 3.313 ± 0.524
1.262SerTrp: 1.262 ± 0.437
1.262SerTyr: 1.262 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
7.099ThrAla: 7.099 ± 0.699
0.394ThrCys: 0.394 ± 0.165
5.521ThrAsp: 5.521 ± 0.662
4.259ThrGlu: 4.259 ± 0.427
2.997ThrPhe: 2.997 ± 0.554
5.916ThrGly: 5.916 ± 0.676
1.183ThrHis: 1.183 ± 0.314
1.893ThrIle: 1.893 ± 0.406
2.524ThrLys: 2.524 ± 0.434
5.521ThrLeu: 5.521 ± 0.654
0.789ThrMet: 0.789 ± 0.312
1.893ThrAsn: 1.893 ± 0.376
4.102ThrPro: 4.102 ± 0.667
1.972ThrGln: 1.972 ± 0.376
4.102ThrArg: 4.102 ± 0.627
3.865ThrSer: 3.865 ± 0.554
4.023ThrThr: 4.023 ± 0.667
6.074ThrVal: 6.074 ± 0.839
1.42ThrTrp: 1.42 ± 0.355
3.234ThrTyr: 3.234 ± 0.566
0.0ThrXaa: 0.0 ± 0.0
Val
8.834ValAla: 8.834 ± 0.799
0.473ValCys: 0.473 ± 0.189
5.206ValAsp: 5.206 ± 0.562
5.679ValGlu: 5.679 ± 0.644
2.366ValPhe: 2.366 ± 0.378
4.417ValGly: 4.417 ± 0.635
1.341ValHis: 1.341 ± 0.322
3.313ValIle: 3.313 ± 0.532
3.865ValLys: 3.865 ± 0.555
5.521ValLeu: 5.521 ± 0.79
2.13ValMet: 2.13 ± 0.343
2.918ValAsn: 2.918 ± 0.564
4.89ValPro: 4.89 ± 0.463
2.997ValGln: 2.997 ± 0.54
5.206ValArg: 5.206 ± 0.629
5.679ValSer: 5.679 ± 0.746
6.389ValThr: 6.389 ± 0.867
4.575ValVal: 4.575 ± 0.617
1.341ValTrp: 1.341 ± 0.327
2.603ValTyr: 2.603 ± 0.515
0.0ValXaa: 0.0 ± 0.0
Trp
1.735TrpAla: 1.735 ± 0.311
0.394TrpCys: 0.394 ± 0.164
1.341TrpAsp: 1.341 ± 0.294
1.578TrpGlu: 1.578 ± 0.281
0.71TrpPhe: 0.71 ± 0.214
1.893TrpGly: 1.893 ± 0.364
0.552TrpHis: 0.552 ± 0.178
0.237TrpIle: 0.237 ± 0.136
0.947TrpLys: 0.947 ± 0.36
1.972TrpLeu: 1.972 ± 0.55
0.394TrpMet: 0.394 ± 0.151
0.552TrpAsn: 0.552 ± 0.232
0.552TrpPro: 0.552 ± 0.219
0.473TrpGln: 0.473 ± 0.166
1.893TrpArg: 1.893 ± 0.345
1.341TrpSer: 1.341 ± 0.286
0.789TrpThr: 0.789 ± 0.21
1.262TrpVal: 1.262 ± 0.359
0.394TrpTrp: 0.394 ± 0.17
0.316TrpTyr: 0.316 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.471TyrAla: 3.471 ± 0.601
0.316TyrCys: 0.316 ± 0.156
1.578TyrAsp: 1.578 ± 0.314
1.104TyrGlu: 1.104 ± 0.301
0.789TyrPhe: 0.789 ± 0.272
2.682TyrGly: 2.682 ± 0.499
0.0TyrHis: 0.0 ± 0.0
0.789TyrIle: 0.789 ± 0.26
1.025TyrLys: 1.025 ± 0.291
2.287TyrLeu: 2.287 ± 0.403
0.394TyrMet: 0.394 ± 0.153
0.71TyrAsn: 0.71 ± 0.227
2.051TyrPro: 2.051 ± 0.481
0.71TyrGln: 0.71 ± 0.225
1.814TyrArg: 1.814 ± 0.314
1.814TyrSer: 1.814 ± 0.528
2.209TyrThr: 2.209 ± 0.371
3.155TyrVal: 3.155 ± 0.691
0.158TyrTrp: 0.158 ± 0.097
0.631TyrTyr: 0.631 ± 0.207
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12679 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski