Amino acid dipepetide frequency for Freshwater phage uvFW-CGR-AMD-COM-C493

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.886AlaAla: 10.886 ± 1.529
0.186AlaCys: 0.186 ± 0.129
3.256AlaAsp: 3.256 ± 0.663
6.606AlaGlu: 6.606 ± 0.736
2.977AlaPhe: 2.977 ± 0.476
8.467AlaGly: 8.467 ± 1.771
0.837AlaHis: 0.837 ± 0.322
5.024AlaIle: 5.024 ± 0.522
6.513AlaLys: 6.513 ± 1.211
6.792AlaLeu: 6.792 ± 0.679
3.07AlaMet: 3.07 ± 0.491
3.536AlaAsn: 3.536 ± 0.79
4.466AlaPro: 4.466 ± 0.771
4.559AlaGln: 4.559 ± 0.677
5.955AlaArg: 5.955 ± 0.725
5.862AlaSer: 5.862 ± 0.708
7.722AlaThr: 7.722 ± 1.176
5.582AlaVal: 5.582 ± 0.638
1.303AlaTrp: 1.303 ± 0.326
3.443AlaTyr: 3.443 ± 0.71
0.0AlaXaa: 0.0 ± 0.0
Cys
0.093CysAla: 0.093 ± 0.076
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.279CysGlu: 0.279 ± 0.163
0.093CysPhe: 0.093 ± 0.09
0.372CysGly: 0.372 ± 0.217
0.0CysHis: 0.0 ± 0.0
0.372CysIle: 0.372 ± 0.213
0.279CysLys: 0.279 ± 0.216
0.837CysLeu: 0.837 ± 0.35
0.0CysMet: 0.0 ± 0.0
0.186CysAsn: 0.186 ± 0.139
0.372CysPro: 0.372 ± 0.17
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.279CysSer: 0.279 ± 0.156
0.372CysThr: 0.372 ± 0.191
0.279CysVal: 0.279 ± 0.167
0.0CysTrp: 0.0 ± 0.0
0.186CysTyr: 0.186 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
4.559AspAla: 4.559 ± 0.606
0.465AspCys: 0.465 ± 0.214
2.512AspAsp: 2.512 ± 0.537
4.187AspGlu: 4.187 ± 0.498
1.861AspPhe: 1.861 ± 0.35
4.094AspGly: 4.094 ± 0.513
1.303AspHis: 1.303 ± 0.424
3.443AspIle: 3.443 ± 0.59
2.698AspLys: 2.698 ± 0.656
5.117AspLeu: 5.117 ± 0.725
1.116AspMet: 1.116 ± 0.328
1.396AspAsn: 1.396 ± 0.42
4.001AspPro: 4.001 ± 0.481
2.233AspGln: 2.233 ± 0.403
3.349AspArg: 3.349 ± 0.597
3.443AspSer: 3.443 ± 0.594
2.977AspThr: 2.977 ± 0.421
3.07AspVal: 3.07 ± 0.598
0.186AspTrp: 0.186 ± 0.116
1.768AspTyr: 1.768 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
5.862GluAla: 5.862 ± 0.622
0.0GluCys: 0.0 ± 0.0
2.605GluAsp: 2.605 ± 0.559
5.303GluGlu: 5.303 ± 1.062
2.419GluPhe: 2.419 ± 0.355
3.349GluGly: 3.349 ± 0.616
0.744GluHis: 0.744 ± 0.215
3.722GluIle: 3.722 ± 0.612
3.536GluLys: 3.536 ± 0.521
5.582GluLeu: 5.582 ± 0.743
1.768GluMet: 1.768 ± 0.485
2.14GluAsn: 2.14 ± 0.335
2.605GluPro: 2.605 ± 1.198
4.652GluGln: 4.652 ± 0.638
3.722GluArg: 3.722 ± 0.632
3.722GluSer: 3.722 ± 0.618
3.163GluThr: 3.163 ± 0.685
4.838GluVal: 4.838 ± 0.695
1.396GluTrp: 1.396 ± 0.364
1.489GluTyr: 1.489 ± 0.315
0.0GluXaa: 0.0 ± 0.0
Phe
2.698PheAla: 2.698 ± 0.504
0.0PheCys: 0.0 ± 0.0
2.698PheAsp: 2.698 ± 0.408
1.861PheGlu: 1.861 ± 0.328
0.93PhePhe: 0.93 ± 0.434
2.326PheGly: 2.326 ± 0.476
0.558PheHis: 0.558 ± 0.21
2.047PheIle: 2.047 ± 0.353
1.768PheLys: 1.768 ± 0.367
1.768PheLeu: 1.768 ± 0.284
0.465PheMet: 0.465 ± 0.172
1.303PheAsn: 1.303 ± 0.375
1.396PhePro: 1.396 ± 0.333
1.116PheGln: 1.116 ± 0.226
2.047PheArg: 2.047 ± 0.47
2.047PheSer: 2.047 ± 0.519
2.233PheThr: 2.233 ± 0.427
2.419PheVal: 2.419 ± 0.442
0.279PheTrp: 0.279 ± 0.128
1.116PheTyr: 1.116 ± 0.329
0.0PheXaa: 0.0 ± 0.0
Gly
7.257GlyAla: 7.257 ± 1.38
0.465GlyCys: 0.465 ± 0.256
5.769GlyAsp: 5.769 ± 0.82
4.931GlyGlu: 4.931 ± 0.659
2.698GlyPhe: 2.698 ± 0.665
12.095GlyGly: 12.095 ± 4.234
0.837GlyHis: 0.837 ± 0.314
3.07GlyIle: 3.07 ± 0.563
4.745GlyLys: 4.745 ± 0.723
6.234GlyLeu: 6.234 ± 0.71
2.698GlyMet: 2.698 ± 0.476
3.443GlyAsn: 3.443 ± 0.813
3.163GlyPro: 3.163 ± 0.614
2.326GlyGln: 2.326 ± 0.443
4.466GlyArg: 4.466 ± 0.567
6.42GlySer: 6.42 ± 1.413
6.048GlyThr: 6.048 ± 0.984
5.862GlyVal: 5.862 ± 0.81
0.465GlyTrp: 0.465 ± 0.278
3.443GlyTyr: 3.443 ± 0.713
0.0GlyXaa: 0.0 ± 0.0
His
1.023HisAla: 1.023 ± 0.392
0.093HisCys: 0.093 ± 0.091
0.465HisAsp: 0.465 ± 0.207
0.558HisGlu: 0.558 ± 0.23
0.186HisPhe: 0.186 ± 0.116
0.837HisGly: 0.837 ± 0.349
0.093HisHis: 0.093 ± 0.091
0.744HisIle: 0.744 ± 0.327
0.372HisLys: 0.372 ± 0.216
1.396HisLeu: 1.396 ± 0.333
0.279HisMet: 0.279 ± 0.134
0.744HisAsn: 0.744 ± 0.292
0.837HisPro: 0.837 ± 0.22
0.093HisGln: 0.093 ± 0.09
0.465HisArg: 0.465 ± 0.226
0.558HisSer: 0.558 ± 0.214
1.116HisThr: 1.116 ± 0.3
0.465HisVal: 0.465 ± 0.155
0.465HisTrp: 0.465 ± 0.23
0.372HisTyr: 0.372 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
4.745IleAla: 4.745 ± 0.576
0.186IleCys: 0.186 ± 0.123
2.698IleAsp: 2.698 ± 0.496
2.698IleGlu: 2.698 ± 0.449
2.047IlePhe: 2.047 ± 0.309
3.722IleGly: 3.722 ± 0.635
0.465IleHis: 0.465 ± 0.208
1.582IleIle: 1.582 ± 0.341
3.163IleLys: 3.163 ± 0.41
2.698IleLeu: 2.698 ± 0.496
0.744IleMet: 0.744 ± 0.26
3.443IleAsn: 3.443 ± 0.56
2.326IlePro: 2.326 ± 0.483
2.233IleGln: 2.233 ± 0.559
2.326IleArg: 2.326 ± 0.316
3.722IleSer: 3.722 ± 0.515
4.094IleThr: 4.094 ± 0.631
3.443IleVal: 3.443 ± 0.625
0.744IleTrp: 0.744 ± 0.266
1.023IleTyr: 1.023 ± 0.243
0.0IleXaa: 0.0 ± 0.0
Lys
6.606LysAla: 6.606 ± 1.392
0.186LysCys: 0.186 ± 0.119
3.629LysAsp: 3.629 ± 0.974
4.745LysGlu: 4.745 ± 0.622
2.14LysPhe: 2.14 ± 0.652
4.838LysGly: 4.838 ± 0.857
0.837LysHis: 0.837 ± 0.291
1.675LysIle: 1.675 ± 0.342
6.141LysLys: 6.141 ± 1.777
3.815LysLeu: 3.815 ± 0.562
1.954LysMet: 1.954 ± 0.599
3.349LysAsn: 3.349 ± 0.576
3.07LysPro: 3.07 ± 0.711
3.349LysGln: 3.349 ± 0.642
3.163LysArg: 3.163 ± 0.562
3.815LysSer: 3.815 ± 0.632
3.908LysThr: 3.908 ± 0.775
4.001LysVal: 4.001 ± 0.531
0.93LysTrp: 0.93 ± 0.36
1.861LysTyr: 1.861 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
7.908LeuAla: 7.908 ± 0.686
0.558LeuCys: 0.558 ± 0.229
4.931LeuAsp: 4.931 ± 0.725
4.745LeuGlu: 4.745 ± 0.75
2.233LeuPhe: 2.233 ± 0.407
6.141LeuGly: 6.141 ± 0.738
0.93LeuHis: 0.93 ± 0.291
4.187LeuIle: 4.187 ± 0.607
3.163LeuLys: 3.163 ± 0.749
5.675LeuLeu: 5.675 ± 0.92
1.582LeuMet: 1.582 ± 0.338
2.977LeuAsn: 2.977 ± 0.511
3.07LeuPro: 3.07 ± 0.606
2.512LeuGln: 2.512 ± 0.316
5.396LeuArg: 5.396 ± 0.85
7.071LeuSer: 7.071 ± 1.234
4.373LeuThr: 4.373 ± 0.773
5.582LeuVal: 5.582 ± 0.726
0.558LeuTrp: 0.558 ± 0.269
2.233LeuTyr: 2.233 ± 0.478
0.0LeuXaa: 0.0 ± 0.0
Met
2.791MetAla: 2.791 ± 0.728
0.0MetCys: 0.0 ± 0.0
0.744MetAsp: 0.744 ± 0.284
1.396MetGlu: 1.396 ± 0.362
0.465MetPhe: 0.465 ± 0.159
1.675MetGly: 1.675 ± 0.413
0.186MetHis: 0.186 ± 0.157
0.93MetIle: 0.93 ± 0.332
1.023MetLys: 1.023 ± 0.432
1.675MetLeu: 1.675 ± 0.379
0.372MetMet: 0.372 ± 0.194
1.489MetAsn: 1.489 ± 0.325
2.047MetPro: 2.047 ± 0.504
1.396MetGln: 1.396 ± 0.442
1.675MetArg: 1.675 ± 0.392
1.303MetSer: 1.303 ± 0.378
1.768MetThr: 1.768 ± 0.431
1.489MetVal: 1.489 ± 0.358
0.279MetTrp: 0.279 ± 0.149
0.651MetTyr: 0.651 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
4.094AsnAla: 4.094 ± 0.557
0.465AsnCys: 0.465 ± 0.218
2.14AsnAsp: 2.14 ± 0.388
2.233AsnGlu: 2.233 ± 0.444
1.116AsnPhe: 1.116 ± 0.335
4.373AsnGly: 4.373 ± 0.692
0.465AsnHis: 0.465 ± 0.261
2.14AsnIle: 2.14 ± 0.412
3.443AsnLys: 3.443 ± 0.68
3.07AsnLeu: 3.07 ± 0.391
0.651AsnMet: 0.651 ± 0.269
1.116AsnAsn: 1.116 ± 0.298
2.791AsnPro: 2.791 ± 0.552
1.489AsnGln: 1.489 ± 0.399
2.791AsnArg: 2.791 ± 0.463
1.954AsnSer: 1.954 ± 0.438
2.884AsnThr: 2.884 ± 0.467
2.233AsnVal: 2.233 ± 0.401
1.023AsnTrp: 1.023 ± 0.26
1.21AsnTyr: 1.21 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
4.373ProAla: 4.373 ± 0.839
0.0ProCys: 0.0 ± 0.0
2.605ProAsp: 2.605 ± 0.651
5.024ProGlu: 5.024 ± 1.153
1.582ProPhe: 1.582 ± 0.307
4.745ProGly: 4.745 ± 0.673
0.558ProHis: 0.558 ± 0.212
1.489ProIle: 1.489 ± 0.394
2.419ProLys: 2.419 ± 0.648
3.256ProLeu: 3.256 ± 0.522
0.372ProMet: 0.372 ± 0.137
2.698ProAsn: 2.698 ± 0.513
2.047ProPro: 2.047 ± 0.729
2.326ProGln: 2.326 ± 0.393
2.698ProArg: 2.698 ± 0.558
2.884ProSer: 2.884 ± 0.584
4.652ProThr: 4.652 ± 0.639
3.908ProVal: 3.908 ± 0.579
0.558ProTrp: 0.558 ± 0.188
1.582ProTyr: 1.582 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
6.048GlnAla: 6.048 ± 0.832
0.093GlnCys: 0.093 ± 0.095
2.047GlnAsp: 2.047 ± 0.428
2.791GlnGlu: 2.791 ± 0.667
1.396GlnPhe: 1.396 ± 0.315
3.163GlnGly: 3.163 ± 0.537
0.558GlnHis: 0.558 ± 0.222
2.884GlnIle: 2.884 ± 0.535
2.14GlnLys: 2.14 ± 0.493
4.094GlnLeu: 4.094 ± 0.854
0.837GlnMet: 0.837 ± 0.269
1.489GlnAsn: 1.489 ± 0.317
2.419GlnPro: 2.419 ± 0.529
2.14GlnGln: 2.14 ± 0.39
2.605GlnArg: 2.605 ± 0.473
2.419GlnSer: 2.419 ± 0.648
2.233GlnThr: 2.233 ± 0.526
2.884GlnVal: 2.884 ± 0.46
0.558GlnTrp: 0.558 ± 0.201
1.023GlnTyr: 1.023 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
5.024ArgAla: 5.024 ± 0.566
0.093ArgCys: 0.093 ± 0.101
2.326ArgAsp: 2.326 ± 0.415
2.884ArgGlu: 2.884 ± 0.492
1.768ArgPhe: 1.768 ± 0.333
3.815ArgGly: 3.815 ± 0.488
0.186ArgHis: 0.186 ± 0.124
3.536ArgIle: 3.536 ± 0.426
4.28ArgLys: 4.28 ± 0.697
5.675ArgLeu: 5.675 ± 0.926
1.861ArgMet: 1.861 ± 0.353
2.512ArgAsn: 2.512 ± 0.36
2.698ArgPro: 2.698 ± 0.67
2.605ArgGln: 2.605 ± 0.432
4.931ArgArg: 4.931 ± 0.733
3.256ArgSer: 3.256 ± 0.49
3.443ArgThr: 3.443 ± 0.824
4.094ArgVal: 4.094 ± 0.539
0.651ArgTrp: 0.651 ± 0.302
2.233ArgTyr: 2.233 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
6.699SerAla: 6.699 ± 0.676
0.0SerCys: 0.0 ± 0.0
3.443SerAsp: 3.443 ± 0.68
2.698SerGlu: 2.698 ± 0.53
1.954SerPhe: 1.954 ± 0.515
5.582SerGly: 5.582 ± 1.15
0.837SerHis: 0.837 ± 0.235
3.256SerIle: 3.256 ± 0.519
4.28SerLys: 4.28 ± 0.606
5.862SerLeu: 5.862 ± 1.043
1.303SerMet: 1.303 ± 0.291
2.233SerAsn: 2.233 ± 0.524
3.256SerPro: 3.256 ± 0.529
3.07SerGln: 3.07 ± 0.378
2.512SerArg: 2.512 ± 0.529
5.489SerSer: 5.489 ± 1.27
6.327SerThr: 6.327 ± 1.357
3.349SerVal: 3.349 ± 0.595
1.21SerTrp: 1.21 ± 0.345
2.512SerTyr: 2.512 ± 0.428
0.0SerXaa: 0.0 ± 0.0
Thr
6.513ThrAla: 6.513 ± 1.232
0.093ThrCys: 0.093 ± 0.087
3.629ThrAsp: 3.629 ± 0.721
3.07ThrGlu: 3.07 ± 0.836
2.326ThrPhe: 2.326 ± 0.411
7.908ThrGly: 7.908 ± 1.252
0.744ThrHis: 0.744 ± 0.299
3.07ThrIle: 3.07 ± 0.516
4.652ThrLys: 4.652 ± 1.193
5.675ThrLeu: 5.675 ± 0.674
1.954ThrMet: 1.954 ± 0.399
3.256ThrAsn: 3.256 ± 0.574
4.094ThrPro: 4.094 ± 0.875
2.698ThrGln: 2.698 ± 0.496
3.629ThrArg: 3.629 ± 0.525
5.024ThrSer: 5.024 ± 1.177
9.862ThrThr: 9.862 ± 4.232
3.815ThrVal: 3.815 ± 0.569
0.837ThrTrp: 0.837 ± 0.312
1.675ThrTyr: 1.675 ± 0.407
0.0ThrXaa: 0.0 ± 0.0
Val
6.234ValAla: 6.234 ± 0.7
0.744ValCys: 0.744 ± 0.313
4.559ValAsp: 4.559 ± 0.561
4.466ValGlu: 4.466 ± 0.774
1.582ValPhe: 1.582 ± 0.308
4.652ValGly: 4.652 ± 0.566
0.465ValHis: 0.465 ± 0.224
2.605ValIle: 2.605 ± 0.359
5.582ValLys: 5.582 ± 0.903
4.28ValLeu: 4.28 ± 0.665
1.675ValMet: 1.675 ± 0.489
2.791ValAsn: 2.791 ± 0.594
3.536ValPro: 3.536 ± 0.618
3.07ValGln: 3.07 ± 0.578
3.443ValArg: 3.443 ± 0.523
4.28ValSer: 4.28 ± 0.553
4.187ValThr: 4.187 ± 0.919
4.559ValVal: 4.559 ± 0.848
0.465ValTrp: 0.465 ± 0.17
1.954ValTyr: 1.954 ± 0.541
0.0ValXaa: 0.0 ± 0.0
Trp
1.303TrpAla: 1.303 ± 0.36
0.186TrpCys: 0.186 ± 0.134
0.93TrpAsp: 0.93 ± 0.242
0.465TrpGlu: 0.465 ± 0.191
0.093TrpPhe: 0.093 ± 0.091
0.558TrpGly: 0.558 ± 0.279
0.186TrpHis: 0.186 ± 0.148
0.558TrpIle: 0.558 ± 0.233
1.023TrpLys: 1.023 ± 0.288
0.93TrpLeu: 0.93 ± 0.239
0.093TrpMet: 0.093 ± 0.092
0.558TrpAsn: 0.558 ± 0.234
0.372TrpPro: 0.372 ± 0.176
1.116TrpGln: 1.116 ± 0.323
1.21TrpArg: 1.21 ± 0.329
0.558TrpSer: 0.558 ± 0.257
0.93TrpThr: 0.93 ± 0.333
0.744TrpVal: 0.744 ± 0.343
0.093TrpTrp: 0.093 ± 0.091
0.558TrpTyr: 0.558 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.326TyrAla: 2.326 ± 0.416
0.186TyrCys: 0.186 ± 0.137
2.791TyrAsp: 2.791 ± 0.601
1.675TyrGlu: 1.675 ± 0.36
1.21TyrPhe: 1.21 ± 0.297
3.722TyrGly: 3.722 ± 0.601
0.372TyrHis: 0.372 ± 0.172
1.768TyrIle: 1.768 ± 0.35
2.884TyrLys: 2.884 ± 0.644
1.396TyrLeu: 1.396 ± 0.43
0.465TyrMet: 0.465 ± 0.223
1.023TyrAsn: 1.023 ± 0.379
1.21TyrPro: 1.21 ± 0.399
0.837TyrGln: 0.837 ± 0.3
1.489TyrArg: 1.489 ± 0.31
1.768TyrSer: 1.768 ± 0.364
2.233TyrThr: 2.233 ± 0.376
2.512TyrVal: 2.512 ± 0.303
0.465TyrTrp: 0.465 ± 0.214
1.116TyrTyr: 1.116 ± 0.285
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (10749 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski