Amino acid dipepetide frequency for Synechococcus phage S-H1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.747AlaAla: 13.747 ± 1.232
1.718AlaCys: 1.718 ± 0.457
7.303AlaAsp: 7.303 ± 1.052
7.904AlaGlu: 7.904 ± 0.928
4.124AlaPhe: 4.124 ± 0.652
9.537AlaGly: 9.537 ± 0.87
1.117AlaHis: 1.117 ± 0.279
5.842AlaIle: 5.842 ± 0.681
4.897AlaLys: 4.897 ± 0.573
7.99AlaLeu: 7.99 ± 0.849
2.492AlaMet: 2.492 ± 0.428
3.78AlaAsn: 3.78 ± 0.703
3.952AlaPro: 3.952 ± 0.719
3.265AlaGln: 3.265 ± 0.714
5.757AlaArg: 5.757 ± 0.782
7.733AlaSer: 7.733 ± 0.861
7.389AlaThr: 7.389 ± 0.763
7.217AlaVal: 7.217 ± 0.72
0.945AlaTrp: 0.945 ± 0.391
2.492AlaTyr: 2.492 ± 0.464
0.0AlaXaa: 0.0 ± 0.0
Cys
1.117CysAla: 1.117 ± 0.285
0.773CysCys: 0.773 ± 0.297
1.289CysAsp: 1.289 ± 0.352
0.773CysGlu: 0.773 ± 0.266
0.344CysPhe: 0.344 ± 0.161
1.804CysGly: 1.804 ± 0.411
0.344CysHis: 0.344 ± 0.141
1.289CysIle: 1.289 ± 0.361
0.687CysLys: 0.687 ± 0.239
0.773CysLeu: 0.773 ± 0.242
0.172CysMet: 0.172 ± 0.122
0.601CysAsn: 0.601 ± 0.246
1.031CysPro: 1.031 ± 0.261
0.859CysGln: 0.859 ± 0.295
1.289CysArg: 1.289 ± 0.347
0.773CysSer: 0.773 ± 0.311
0.258CysThr: 0.258 ± 0.163
0.601CysVal: 0.601 ± 0.202
0.258CysTrp: 0.258 ± 0.136
0.516CysTyr: 0.516 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
7.647AspAla: 7.647 ± 0.768
0.945AspCys: 0.945 ± 0.298
3.437AspAsp: 3.437 ± 0.632
3.609AspGlu: 3.609 ± 0.47
2.578AspPhe: 2.578 ± 0.629
5.928AspGly: 5.928 ± 0.902
1.547AspHis: 1.547 ± 0.375
3.265AspIle: 3.265 ± 0.533
1.89AspLys: 1.89 ± 0.431
7.303AspLeu: 7.303 ± 0.709
0.859AspMet: 0.859 ± 0.24
2.062AspAsn: 2.062 ± 0.336
2.663AspPro: 2.663 ± 0.4
2.32AspGln: 2.32 ± 0.384
4.124AspArg: 4.124 ± 0.637
2.578AspSer: 2.578 ± 0.409
2.921AspThr: 2.921 ± 0.526
2.921AspVal: 2.921 ± 0.696
0.687AspTrp: 0.687 ± 0.27
1.89AspTyr: 1.89 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
5.757GluAla: 5.757 ± 0.661
1.031GluCys: 1.031 ± 0.285
2.578GluAsp: 2.578 ± 0.462
2.921GluGlu: 2.921 ± 0.502
1.117GluPhe: 1.117 ± 0.303
3.523GluGly: 3.523 ± 0.522
1.289GluHis: 1.289 ± 0.357
3.78GluIle: 3.78 ± 0.621
1.89GluLys: 1.89 ± 0.393
7.389GluLeu: 7.389 ± 1.074
1.117GluMet: 1.117 ± 0.289
1.375GluAsn: 1.375 ± 0.306
2.578GluPro: 2.578 ± 0.475
4.21GluGln: 4.21 ± 0.702
4.983GluArg: 4.983 ± 0.688
3.351GluSer: 3.351 ± 0.429
3.351GluThr: 3.351 ± 0.434
3.78GluVal: 3.78 ± 0.65
1.203GluTrp: 1.203 ± 0.333
1.117GluTyr: 1.117 ± 0.321
0.0GluXaa: 0.0 ± 0.0
Phe
2.663PheAla: 2.663 ± 0.412
0.258PheCys: 0.258 ± 0.152
3.093PheAsp: 3.093 ± 0.514
2.406PheGlu: 2.406 ± 0.508
0.516PhePhe: 0.516 ± 0.232
3.265PheGly: 3.265 ± 0.71
0.43PheHis: 0.43 ± 0.19
1.632PheIle: 1.632 ± 0.337
1.117PheLys: 1.117 ± 0.275
1.89PheLeu: 1.89 ± 0.354
0.601PheMet: 0.601 ± 0.222
1.117PheAsn: 1.117 ± 0.254
1.547PhePro: 1.547 ± 0.344
0.859PheGln: 0.859 ± 0.416
2.062PheArg: 2.062 ± 0.383
2.062PheSer: 2.062 ± 0.465
2.492PheThr: 2.492 ± 0.545
2.062PheVal: 2.062 ± 0.394
0.773PheTrp: 0.773 ± 0.229
0.945PheTyr: 0.945 ± 0.261
0.0PheXaa: 0.0 ± 0.0
Gly
8.678GlyAla: 8.678 ± 0.884
0.687GlyCys: 0.687 ± 0.23
5.155GlyAsp: 5.155 ± 0.646
4.554GlyGlu: 4.554 ± 0.611
3.609GlyPhe: 3.609 ± 0.502
6.788GlyGly: 6.788 ± 1.399
1.461GlyHis: 1.461 ± 0.357
3.78GlyIle: 3.78 ± 0.584
4.811GlyLys: 4.811 ± 0.657
5.585GlyLeu: 5.585 ± 0.738
1.117GlyMet: 1.117 ± 0.256
2.406GlyAsn: 2.406 ± 0.384
1.89GlyPro: 1.89 ± 0.394
3.093GlyGln: 3.093 ± 0.531
4.725GlyArg: 4.725 ± 0.574
7.045GlySer: 7.045 ± 1.015
4.983GlyThr: 4.983 ± 1.067
5.241GlyVal: 5.241 ± 0.644
1.031GlyTrp: 1.031 ± 0.297
2.32GlyTyr: 2.32 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.976HisAla: 1.976 ± 0.389
0.516HisCys: 0.516 ± 0.214
0.687HisAsp: 0.687 ± 0.232
0.687HisGlu: 0.687 ± 0.231
0.687HisPhe: 0.687 ± 0.312
1.203HisGly: 1.203 ± 0.311
0.172HisHis: 0.172 ± 0.115
0.773HisIle: 0.773 ± 0.231
0.258HisLys: 0.258 ± 0.15
1.718HisLeu: 1.718 ± 0.416
0.516HisMet: 0.516 ± 0.226
0.859HisAsn: 0.859 ± 0.262
1.632HisPro: 1.632 ± 0.415
1.031HisGln: 1.031 ± 0.311
1.804HisArg: 1.804 ± 0.405
0.773HisSer: 0.773 ± 0.253
0.773HisThr: 0.773 ± 0.233
1.117HisVal: 1.117 ± 0.277
0.773HisTrp: 0.773 ± 0.246
1.117HisTyr: 1.117 ± 0.319
0.0HisXaa: 0.0 ± 0.0
Ile
5.842IleAla: 5.842 ± 0.828
0.773IleCys: 0.773 ± 0.266
4.382IleAsp: 4.382 ± 0.576
3.866IleGlu: 3.866 ± 0.567
1.117IlePhe: 1.117 ± 0.274
4.897IleGly: 4.897 ± 0.807
1.203IleHis: 1.203 ± 0.293
2.406IleIle: 2.406 ± 0.467
2.578IleLys: 2.578 ± 0.36
3.351IleLeu: 3.351 ± 0.542
0.859IleMet: 0.859 ± 0.303
1.976IleAsn: 1.976 ± 0.415
2.663IlePro: 2.663 ± 0.454
2.234IleGln: 2.234 ± 0.548
3.523IleArg: 3.523 ± 0.661
2.835IleSer: 2.835 ± 0.525
4.554IleThr: 4.554 ± 0.61
2.32IleVal: 2.32 ± 0.417
0.859IleTrp: 0.859 ± 0.307
1.203IleTyr: 1.203 ± 0.31
0.0IleXaa: 0.0 ± 0.0
Lys
4.21LysAla: 4.21 ± 0.692
0.516LysCys: 0.516 ± 0.28
2.234LysAsp: 2.234 ± 0.441
2.749LysGlu: 2.749 ± 0.515
1.031LysPhe: 1.031 ± 0.377
2.749LysGly: 2.749 ± 0.365
0.773LysHis: 0.773 ± 0.238
1.804LysIle: 1.804 ± 0.431
1.203LysLys: 1.203 ± 0.308
4.296LysLeu: 4.296 ± 0.57
0.945LysMet: 0.945 ± 0.249
1.031LysAsn: 1.031 ± 0.318
2.663LysPro: 2.663 ± 0.503
1.804LysGln: 1.804 ± 0.335
3.952LysArg: 3.952 ± 0.654
1.976LysSer: 1.976 ± 0.361
1.718LysThr: 1.718 ± 0.365
2.406LysVal: 2.406 ± 0.407
0.859LysTrp: 0.859 ± 0.258
0.859LysTyr: 0.859 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
9.451LeuAla: 9.451 ± 0.93
1.461LeuCys: 1.461 ± 0.463
4.983LeuAsp: 4.983 ± 0.565
5.327LeuGlu: 5.327 ± 0.951
2.835LeuPhe: 2.835 ± 0.397
6.1LeuGly: 6.1 ± 0.792
1.89LeuHis: 1.89 ± 0.419
3.694LeuIle: 3.694 ± 0.644
3.351LeuLys: 3.351 ± 0.44
6.616LeuLeu: 6.616 ± 0.558
1.461LeuMet: 1.461 ± 0.399
2.921LeuAsn: 2.921 ± 0.498
4.897LeuPro: 4.897 ± 0.719
5.413LeuGln: 5.413 ± 0.993
6.444LeuArg: 6.444 ± 0.589
5.069LeuSer: 5.069 ± 0.677
6.702LeuThr: 6.702 ± 0.775
4.038LeuVal: 4.038 ± 0.549
0.859LeuTrp: 0.859 ± 0.337
1.804LeuTyr: 1.804 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
2.921MetAla: 2.921 ± 0.473
0.172MetCys: 0.172 ± 0.108
1.031MetAsp: 1.031 ± 0.278
0.601MetGlu: 0.601 ± 0.247
0.258MetPhe: 0.258 ± 0.135
1.375MetGly: 1.375 ± 0.362
0.516MetHis: 0.516 ± 0.263
1.117MetIle: 1.117 ± 0.332
0.773MetLys: 0.773 ± 0.24
1.289MetLeu: 1.289 ± 0.327
0.258MetMet: 0.258 ± 0.158
0.859MetAsn: 0.859 ± 0.292
0.773MetPro: 0.773 ± 0.284
1.289MetGln: 1.289 ± 0.298
1.117MetArg: 1.117 ± 0.313
1.117MetSer: 1.117 ± 0.269
2.406MetThr: 2.406 ± 0.537
1.203MetVal: 1.203 ± 0.293
0.172MetTrp: 0.172 ± 0.12
0.344MetTyr: 0.344 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
3.78AsnAla: 3.78 ± 0.496
0.601AsnCys: 0.601 ± 0.274
2.663AsnAsp: 2.663 ± 0.442
2.32AsnGlu: 2.32 ± 0.401
0.687AsnPhe: 0.687 ± 0.242
3.179AsnGly: 3.179 ± 0.497
0.516AsnHis: 0.516 ± 0.216
1.375AsnIle: 1.375 ± 0.286
0.773AsnLys: 0.773 ± 0.185
3.093AsnLeu: 3.093 ± 0.485
0.859AsnMet: 0.859 ± 0.254
1.031AsnAsn: 1.031 ± 0.425
2.234AsnPro: 2.234 ± 0.527
2.32AsnGln: 2.32 ± 0.423
1.375AsnArg: 1.375 ± 0.277
1.547AsnSer: 1.547 ± 0.336
2.835AsnThr: 2.835 ± 0.458
2.234AsnVal: 2.234 ± 0.531
0.687AsnTrp: 0.687 ± 0.223
0.687AsnTyr: 0.687 ± 0.322
0.0AsnXaa: 0.0 ± 0.0
Pro
4.21ProAla: 4.21 ± 0.595
0.344ProCys: 0.344 ± 0.151
3.694ProAsp: 3.694 ± 0.669
2.749ProGlu: 2.749 ± 0.45
1.89ProPhe: 1.89 ± 0.509
3.437ProGly: 3.437 ± 0.599
1.031ProHis: 1.031 ± 0.318
2.062ProIle: 2.062 ± 0.401
1.804ProLys: 1.804 ± 0.389
2.921ProLeu: 2.921 ± 0.647
0.687ProMet: 0.687 ± 0.289
2.148ProAsn: 2.148 ± 0.433
3.179ProPro: 3.179 ± 0.614
2.663ProGln: 2.663 ± 0.519
1.89ProArg: 1.89 ± 0.442
4.124ProSer: 4.124 ± 0.64
3.351ProThr: 3.351 ± 0.56
3.179ProVal: 3.179 ± 0.485
0.516ProTrp: 0.516 ± 0.203
1.461ProTyr: 1.461 ± 0.345
0.0ProXaa: 0.0 ± 0.0
Gln
5.842GlnAla: 5.842 ± 0.834
0.859GlnCys: 0.859 ± 0.243
2.406GlnAsp: 2.406 ± 0.48
1.547GlnGlu: 1.547 ± 0.3
1.031GlnPhe: 1.031 ± 0.284
2.32GlnGly: 2.32 ± 0.469
1.289GlnHis: 1.289 ± 0.344
2.921GlnIle: 2.921 ± 0.626
1.375GlnLys: 1.375 ± 0.344
5.069GlnLeu: 5.069 ± 0.667
1.031GlnMet: 1.031 ± 0.271
1.718GlnAsn: 1.718 ± 0.477
1.89GlnPro: 1.89 ± 0.369
3.179GlnGln: 3.179 ± 0.744
3.351GlnArg: 3.351 ± 0.639
3.351GlnSer: 3.351 ± 0.583
1.804GlnThr: 1.804 ± 0.378
3.351GlnVal: 3.351 ± 0.429
1.031GlnTrp: 1.031 ± 0.308
1.031GlnTyr: 1.031 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
6.014ArgAla: 6.014 ± 0.639
1.375ArgCys: 1.375 ± 0.446
3.437ArgAsp: 3.437 ± 0.628
2.921ArgGlu: 2.921 ± 0.513
2.663ArgPhe: 2.663 ± 0.438
3.093ArgGly: 3.093 ± 0.436
1.289ArgHis: 1.289 ± 0.33
4.382ArgIle: 4.382 ± 0.67
3.609ArgLys: 3.609 ± 0.713
6.444ArgLeu: 6.444 ± 0.722
1.976ArgMet: 1.976 ± 0.347
1.632ArgAsn: 1.632 ± 0.387
2.234ArgPro: 2.234 ± 0.403
2.921ArgGln: 2.921 ± 0.51
5.155ArgArg: 5.155 ± 0.704
4.64ArgSer: 4.64 ± 0.606
3.437ArgThr: 3.437 ± 0.423
3.351ArgVal: 3.351 ± 0.556
1.375ArgTrp: 1.375 ± 0.347
2.234ArgTyr: 2.234 ± 0.496
0.0ArgXaa: 0.0 ± 0.0
Ser
6.788SerAla: 6.788 ± 0.667
1.203SerCys: 1.203 ± 0.364
3.437SerAsp: 3.437 ± 0.62
3.351SerGlu: 3.351 ± 0.511
2.062SerPhe: 2.062 ± 0.449
6.616SerGly: 6.616 ± 0.812
1.031SerHis: 1.031 ± 0.298
4.124SerIle: 4.124 ± 0.581
2.663SerLys: 2.663 ± 0.468
5.499SerLeu: 5.499 ± 0.629
1.804SerMet: 1.804 ± 0.348
3.007SerAsn: 3.007 ± 0.683
2.492SerPro: 2.492 ± 0.504
2.578SerGln: 2.578 ± 0.406
3.437SerArg: 3.437 ± 0.519
4.983SerSer: 4.983 ± 0.725
3.609SerThr: 3.609 ± 0.515
4.21SerVal: 4.21 ± 0.711
1.031SerTrp: 1.031 ± 0.336
1.976SerTyr: 1.976 ± 0.327
0.0SerXaa: 0.0 ± 0.0
Thr
7.045ThrAla: 7.045 ± 0.864
0.344ThrCys: 0.344 ± 0.14
4.038ThrAsp: 4.038 ± 0.586
2.921ThrGlu: 2.921 ± 0.412
2.062ThrPhe: 2.062 ± 0.59
5.069ThrGly: 5.069 ± 0.993
1.031ThrHis: 1.031 ± 0.31
4.725ThrIle: 4.725 ± 0.624
2.062ThrLys: 2.062 ± 0.378
4.983ThrLeu: 4.983 ± 0.59
0.945ThrMet: 0.945 ± 0.313
1.718ThrAsn: 1.718 ± 0.422
4.725ThrPro: 4.725 ± 0.603
2.835ThrGln: 2.835 ± 0.438
2.578ThrArg: 2.578 ± 0.441
4.897ThrSer: 4.897 ± 0.722
4.554ThrThr: 4.554 ± 0.882
4.811ThrVal: 4.811 ± 0.557
0.601ThrTrp: 0.601 ± 0.212
2.406ThrTyr: 2.406 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
7.475ValAla: 7.475 ± 0.868
0.945ValCys: 0.945 ± 0.277
3.351ValAsp: 3.351 ± 0.613
4.296ValGlu: 4.296 ± 0.683
2.062ValPhe: 2.062 ± 0.275
4.897ValGly: 4.897 ± 0.883
1.031ValHis: 1.031 ± 0.314
2.32ValIle: 2.32 ± 0.47
2.406ValLys: 2.406 ± 0.51
5.069ValLeu: 5.069 ± 0.589
1.031ValMet: 1.031 ± 0.285
2.663ValAsn: 2.663 ± 0.498
2.148ValPro: 2.148 ± 0.425
1.89ValGln: 1.89 ± 0.406
3.351ValArg: 3.351 ± 0.505
3.609ValSer: 3.609 ± 0.521
5.327ValThr: 5.327 ± 0.745
5.155ValVal: 5.155 ± 0.73
0.601ValTrp: 0.601 ± 0.21
2.234ValTyr: 2.234 ± 0.402
0.0ValXaa: 0.0 ± 0.0
Trp
1.289TrpAla: 1.289 ± 0.346
0.687TrpCys: 0.687 ± 0.258
0.516TrpAsp: 0.516 ± 0.198
0.945TrpGlu: 0.945 ± 0.468
0.344TrpPhe: 0.344 ± 0.167
0.773TrpGly: 0.773 ± 0.227
0.601TrpHis: 0.601 ± 0.223
0.945TrpIle: 0.945 ± 0.26
0.687TrpLys: 0.687 ± 0.274
1.461TrpLeu: 1.461 ± 0.319
0.43TrpMet: 0.43 ± 0.165
0.687TrpAsn: 0.687 ± 0.22
1.031TrpPro: 1.031 ± 0.414
0.344TrpGln: 0.344 ± 0.162
1.117TrpArg: 1.117 ± 0.316
1.375TrpSer: 1.375 ± 0.35
0.516TrpThr: 0.516 ± 0.219
0.773TrpVal: 0.773 ± 0.206
0.344TrpTrp: 0.344 ± 0.16
0.344TrpTyr: 0.344 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.749TyrAla: 2.749 ± 0.561
0.344TyrCys: 0.344 ± 0.149
1.461TyrAsp: 1.461 ± 0.417
1.89TyrGlu: 1.89 ± 0.421
0.773TyrPhe: 0.773 ± 0.201
2.406TyrGly: 2.406 ± 0.458
0.516TyrHis: 0.516 ± 0.215
1.203TyrIle: 1.203 ± 0.306
0.945TyrLys: 0.945 ± 0.297
2.578TyrLeu: 2.578 ± 0.435
0.344TyrMet: 0.344 ± 0.176
1.203TyrAsn: 1.203 ± 0.319
1.117TyrPro: 1.117 ± 0.292
1.289TyrGln: 1.289 ± 0.291
2.148TyrArg: 2.148 ± 0.449
2.148TyrSer: 2.148 ± 0.378
1.375TyrThr: 1.375 ± 0.45
1.89TyrVal: 1.89 ± 0.458
0.601TyrTrp: 0.601 ± 0.23
0.773TyrTyr: 0.773 ± 0.247
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (11640 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski