Amino acid dipepetide frequency for Streptococcus phage Javan197

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.98AlaAla: 5.98 ± 1.176
0.098AlaCys: 0.098 ± 0.087
5.098AlaAsp: 5.098 ± 0.764
5.49AlaGlu: 5.49 ± 0.907
2.549AlaPhe: 2.549 ± 0.536
5.294AlaGly: 5.294 ± 1.266
0.784AlaHis: 0.784 ± 0.306
7.156AlaIle: 7.156 ± 1.116
6.274AlaLys: 6.274 ± 0.775
5.882AlaLeu: 5.882 ± 0.694
1.47AlaMet: 1.47 ± 0.379
4.215AlaAsn: 4.215 ± 0.62
1.078AlaPro: 1.078 ± 0.302
2.745AlaGln: 2.745 ± 0.558
2.647AlaArg: 2.647 ± 0.547
4.901AlaSer: 4.901 ± 0.731
4.019AlaThr: 4.019 ± 0.785
4.117AlaVal: 4.117 ± 0.761
0.882AlaTrp: 0.882 ± 0.299
2.353AlaTyr: 2.353 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.294CysAla: 0.294 ± 0.141
0.294CysCys: 0.294 ± 0.155
0.686CysAsp: 0.686 ± 0.227
0.392CysGlu: 0.392 ± 0.176
0.294CysPhe: 0.294 ± 0.174
0.686CysGly: 0.686 ± 0.315
0.294CysHis: 0.294 ± 0.151
0.294CysIle: 0.294 ± 0.157
0.588CysLys: 0.588 ± 0.253
1.078CysLeu: 1.078 ± 0.306
0.0CysMet: 0.0 ± 0.0
0.196CysAsn: 0.196 ± 0.129
0.196CysPro: 0.196 ± 0.134
0.392CysGln: 0.392 ± 0.179
0.196CysArg: 0.196 ± 0.179
0.588CysSer: 0.588 ± 0.239
0.0CysThr: 0.0 ± 0.0
0.196CysVal: 0.196 ± 0.164
0.0CysTrp: 0.0 ± 0.0
0.098CysTyr: 0.098 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
4.019AspAla: 4.019 ± 0.687
0.686AspCys: 0.686 ± 0.237
4.705AspAsp: 4.705 ± 0.582
4.117AspGlu: 4.117 ± 0.586
3.529AspPhe: 3.529 ± 0.464
4.509AspGly: 4.509 ± 0.601
0.784AspHis: 0.784 ± 0.262
4.607AspIle: 4.607 ± 0.674
5.784AspLys: 5.784 ± 0.737
5.686AspLeu: 5.686 ± 0.846
0.98AspMet: 0.98 ± 0.311
4.607AspAsn: 4.607 ± 0.68
1.078AspPro: 1.078 ± 0.255
1.372AspGln: 1.372 ± 0.339
2.549AspArg: 2.549 ± 0.408
3.725AspSer: 3.725 ± 0.555
3.235AspThr: 3.235 ± 0.481
4.411AspVal: 4.411 ± 0.556
1.372AspTrp: 1.372 ± 0.41
2.941AspTyr: 2.941 ± 0.562
0.0AspXaa: 0.0 ± 0.0
Glu
5.49GluAla: 5.49 ± 0.972
0.294GluCys: 0.294 ± 0.17
2.255GluAsp: 2.255 ± 0.558
4.215GluGlu: 4.215 ± 0.813
2.647GluPhe: 2.647 ± 0.514
2.451GluGly: 2.451 ± 0.477
1.274GluHis: 1.274 ± 0.357
5.882GluIle: 5.882 ± 0.875
5.686GluLys: 5.686 ± 0.744
8.038GluLeu: 8.038 ± 1.077
1.47GluMet: 1.47 ± 0.408
3.137GluAsn: 3.137 ± 0.596
1.667GluPro: 1.667 ± 0.329
2.647GluGln: 2.647 ± 0.528
2.941GluArg: 2.941 ± 0.541
4.411GluSer: 4.411 ± 0.611
4.117GluThr: 4.117 ± 0.691
4.607GluVal: 4.607 ± 0.836
0.686GluTrp: 0.686 ± 0.239
3.039GluTyr: 3.039 ± 0.555
0.0GluXaa: 0.0 ± 0.0
Phe
3.627PheAla: 3.627 ± 0.714
0.196PheCys: 0.196 ± 0.138
2.941PheAsp: 2.941 ± 0.465
3.137PheGlu: 3.137 ± 0.58
1.176PhePhe: 1.176 ± 0.296
2.255PheGly: 2.255 ± 0.394
0.49PheHis: 0.49 ± 0.295
2.157PheIle: 2.157 ± 0.547
2.451PheLys: 2.451 ± 0.437
2.843PheLeu: 2.843 ± 0.616
0.882PheMet: 0.882 ± 0.292
3.333PheAsn: 3.333 ± 0.618
0.588PhePro: 0.588 ± 0.249
1.176PheGln: 1.176 ± 0.289
2.549PheArg: 2.549 ± 0.397
2.353PheSer: 2.353 ± 0.396
1.765PheThr: 1.765 ± 0.443
2.353PheVal: 2.353 ± 0.495
0.49PheTrp: 0.49 ± 0.194
0.882PheTyr: 0.882 ± 0.291
0.0PheXaa: 0.0 ± 0.0
Gly
3.921GlyAla: 3.921 ± 0.932
0.294GlyCys: 0.294 ± 0.188
4.215GlyAsp: 4.215 ± 0.63
3.823GlyGlu: 3.823 ± 0.683
2.451GlyPhe: 2.451 ± 0.602
4.215GlyGly: 4.215 ± 0.689
1.274GlyHis: 1.274 ± 0.415
5.196GlyIle: 5.196 ± 0.815
5.686GlyLys: 5.686 ± 0.834
6.764GlyLeu: 6.764 ± 1.072
1.863GlyMet: 1.863 ± 0.442
2.647GlyAsn: 2.647 ± 0.463
1.765GlyPro: 1.765 ± 1.458
3.333GlyGln: 3.333 ± 0.605
2.451GlyArg: 2.451 ± 0.393
3.431GlySer: 3.431 ± 0.636
3.627GlyThr: 3.627 ± 0.668
3.823GlyVal: 3.823 ± 0.657
0.49GlyTrp: 0.49 ± 0.243
3.725GlyTyr: 3.725 ± 0.579
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.306
0.196HisCys: 0.196 ± 0.137
1.078HisAsp: 1.078 ± 0.308
1.078HisGlu: 1.078 ± 0.396
0.588HisPhe: 0.588 ± 0.24
0.98HisGly: 0.98 ± 0.292
0.196HisHis: 0.196 ± 0.159
1.176HisIle: 1.176 ± 0.332
1.078HisLys: 1.078 ± 0.363
1.176HisLeu: 1.176 ± 0.283
0.196HisMet: 0.196 ± 0.134
0.882HisAsn: 0.882 ± 0.334
0.588HisPro: 0.588 ± 0.308
0.49HisGln: 0.49 ± 0.204
0.882HisArg: 0.882 ± 0.289
1.274HisSer: 1.274 ± 0.494
0.882HisThr: 0.882 ± 0.298
1.078HisVal: 1.078 ± 0.345
0.196HisTrp: 0.196 ± 0.126
1.078HisTyr: 1.078 ± 0.353
0.0HisXaa: 0.0 ± 0.0
Ile
5.686IleAla: 5.686 ± 0.893
0.588IleCys: 0.588 ± 0.214
5.686IleAsp: 5.686 ± 0.886
5.588IleGlu: 5.588 ± 0.722
2.353IlePhe: 2.353 ± 0.388
5.0IleGly: 5.0 ± 0.815
1.176IleHis: 1.176 ± 0.323
4.411IleIle: 4.411 ± 0.74
7.646IleLys: 7.646 ± 1.133
5.49IleLeu: 5.49 ± 0.785
2.255IleMet: 2.255 ± 0.523
4.509IleAsn: 4.509 ± 0.772
2.451IlePro: 2.451 ± 0.681
2.353IleGln: 2.353 ± 0.534
2.843IleArg: 2.843 ± 0.504
6.078IleSer: 6.078 ± 1.0
5.686IleThr: 5.686 ± 0.664
2.843IleVal: 2.843 ± 0.639
0.49IleTrp: 0.49 ± 0.228
1.863IleTyr: 1.863 ± 0.488
0.0IleXaa: 0.0 ± 0.0
Lys
5.294LysAla: 5.294 ± 0.72
0.49LysCys: 0.49 ± 0.2
4.215LysAsp: 4.215 ± 0.557
5.686LysGlu: 5.686 ± 0.806
2.059LysPhe: 2.059 ± 0.361
5.294LysGly: 5.294 ± 0.705
1.176LysHis: 1.176 ± 0.423
7.254LysIle: 7.254 ± 0.996
6.862LysLys: 6.862 ± 1.02
7.156LysLeu: 7.156 ± 0.913
2.745LysMet: 2.745 ± 0.584
5.098LysAsn: 5.098 ± 0.729
1.765LysPro: 1.765 ± 0.446
5.49LysGln: 5.49 ± 0.591
3.823LysArg: 3.823 ± 0.629
7.156LysSer: 7.156 ± 0.941
4.901LysThr: 4.901 ± 0.701
6.176LysVal: 6.176 ± 0.764
0.686LysTrp: 0.686 ± 0.227
3.039LysTyr: 3.039 ± 0.628
0.0LysXaa: 0.0 ± 0.0
Leu
6.568LeuAla: 6.568 ± 0.762
0.196LeuCys: 0.196 ± 0.122
5.98LeuAsp: 5.98 ± 0.659
5.784LeuGlu: 5.784 ± 0.848
3.431LeuPhe: 3.431 ± 0.547
5.196LeuGly: 5.196 ± 0.774
1.274LeuHis: 1.274 ± 0.396
5.98LeuIle: 5.98 ± 0.759
9.901LeuLys: 9.901 ± 1.092
7.352LeuLeu: 7.352 ± 0.997
2.157LeuMet: 2.157 ± 0.428
4.607LeuAsn: 4.607 ± 0.61
3.431LeuPro: 3.431 ± 0.56
3.333LeuGln: 3.333 ± 0.571
2.941LeuArg: 2.941 ± 0.559
6.47LeuSer: 6.47 ± 1.032
5.392LeuThr: 5.392 ± 0.834
5.0LeuVal: 5.0 ± 0.742
0.784LeuTrp: 0.784 ± 0.312
1.961LeuTyr: 1.961 ± 0.349
0.0LeuXaa: 0.0 ± 0.0
Met
3.039MetAla: 3.039 ± 0.676
0.196MetCys: 0.196 ± 0.126
1.667MetAsp: 1.667 ± 0.386
1.47MetGlu: 1.47 ± 0.431
0.784MetPhe: 0.784 ± 0.29
1.274MetGly: 1.274 ± 0.509
0.294MetHis: 0.294 ± 0.187
1.667MetIle: 1.667 ± 0.345
1.372MetLys: 1.372 ± 0.48
1.47MetLeu: 1.47 ± 0.406
0.392MetMet: 0.392 ± 0.212
1.078MetAsn: 1.078 ± 0.328
0.588MetPro: 0.588 ± 0.257
0.686MetGln: 0.686 ± 0.254
1.078MetArg: 1.078 ± 0.324
2.255MetSer: 2.255 ± 0.385
2.647MetThr: 2.647 ± 0.534
1.078MetVal: 1.078 ± 0.341
0.098MetTrp: 0.098 ± 0.079
0.686MetTyr: 0.686 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
3.823AsnAla: 3.823 ± 0.701
0.294AsnCys: 0.294 ± 0.142
2.941AsnAsp: 2.941 ± 0.453
3.627AsnGlu: 3.627 ± 0.728
2.157AsnPhe: 2.157 ± 0.453
5.784AsnGly: 5.784 ± 0.587
1.372AsnHis: 1.372 ± 0.419
4.411AsnIle: 4.411 ± 0.696
3.627AsnLys: 3.627 ± 0.612
5.098AsnLeu: 5.098 ± 0.643
1.274AsnMet: 1.274 ± 0.326
2.843AsnAsn: 2.843 ± 0.696
1.961AsnPro: 1.961 ± 0.482
2.745AsnGln: 2.745 ± 0.622
1.667AsnArg: 1.667 ± 0.446
2.745AsnSer: 2.745 ± 0.594
2.745AsnThr: 2.745 ± 0.534
3.039AsnVal: 3.039 ± 0.464
0.588AsnTrp: 0.588 ± 0.234
1.961AsnTyr: 1.961 ± 0.494
0.0AsnXaa: 0.0 ± 0.0
Pro
2.353ProAla: 2.353 ± 0.557
0.0ProCys: 0.0 ± 0.0
1.765ProAsp: 1.765 ± 0.421
1.667ProGlu: 1.667 ± 0.377
0.882ProPhe: 0.882 ± 0.278
1.372ProGly: 1.372 ± 0.82
0.49ProHis: 0.49 ± 0.255
1.667ProIle: 1.667 ± 0.337
3.235ProLys: 3.235 ± 0.652
2.157ProLeu: 2.157 ± 0.397
0.686ProMet: 0.686 ± 0.205
1.078ProAsn: 1.078 ± 0.328
1.078ProPro: 1.078 ± 0.685
1.372ProGln: 1.372 ± 0.535
0.784ProArg: 0.784 ± 0.219
1.863ProSer: 1.863 ± 0.394
1.568ProThr: 1.568 ± 0.375
2.353ProVal: 2.353 ± 0.554
0.0ProTrp: 0.0 ± 0.0
0.392ProTyr: 0.392 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.255GlnAla: 2.255 ± 0.515
0.294GlnCys: 0.294 ± 0.157
2.745GlnAsp: 2.745 ± 0.513
2.843GlnGlu: 2.843 ± 0.523
2.157GlnPhe: 2.157 ± 0.551
2.255GlnGly: 2.255 ± 0.668
0.686GlnHis: 0.686 ± 0.205
4.019GlnIle: 4.019 ± 0.721
4.509GlnLys: 4.509 ± 0.875
3.725GlnLeu: 3.725 ± 0.707
0.98GlnMet: 0.98 ± 0.315
2.255GlnAsn: 2.255 ± 0.566
1.47GlnPro: 1.47 ± 0.379
2.157GlnGln: 2.157 ± 0.325
1.47GlnArg: 1.47 ± 0.35
3.333GlnSer: 3.333 ± 0.57
1.667GlnThr: 1.667 ± 0.478
1.667GlnVal: 1.667 ± 0.354
0.49GlnTrp: 0.49 ± 0.227
1.568GlnTyr: 1.568 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
2.353ArgAla: 2.353 ± 0.552
0.392ArgCys: 0.392 ± 0.225
2.353ArgAsp: 2.353 ± 0.493
2.647ArgGlu: 2.647 ± 0.437
1.47ArgPhe: 1.47 ± 0.447
1.961ArgGly: 1.961 ± 0.43
0.784ArgHis: 0.784 ± 0.298
3.137ArgIle: 3.137 ± 0.665
3.921ArgLys: 3.921 ± 0.769
4.411ArgLeu: 4.411 ± 0.656
0.588ArgMet: 0.588 ± 0.198
2.451ArgAsn: 2.451 ± 0.566
1.372ArgPro: 1.372 ± 0.416
1.765ArgGln: 1.765 ± 0.342
1.568ArgArg: 1.568 ± 0.332
2.843ArgSer: 2.843 ± 0.59
2.843ArgThr: 2.843 ± 0.487
2.157ArgVal: 2.157 ± 0.38
0.196ArgTrp: 0.196 ± 0.133
1.372ArgTyr: 1.372 ± 0.351
0.0ArgXaa: 0.0 ± 0.0
Ser
5.196SerAla: 5.196 ± 1.124
0.588SerCys: 0.588 ± 0.239
5.196SerAsp: 5.196 ± 0.802
4.705SerGlu: 4.705 ± 0.66
3.431SerPhe: 3.431 ± 0.652
4.607SerGly: 4.607 ± 0.719
0.98SerHis: 0.98 ± 0.303
3.921SerIle: 3.921 ± 0.75
5.686SerLys: 5.686 ± 0.759
5.196SerLeu: 5.196 ± 0.908
2.255SerMet: 2.255 ± 0.331
3.627SerAsn: 3.627 ± 0.608
1.863SerPro: 1.863 ± 0.437
2.843SerGln: 2.843 ± 0.521
3.039SerArg: 3.039 ± 0.543
4.313SerSer: 4.313 ± 0.547
3.431SerThr: 3.431 ± 0.619
4.019SerVal: 4.019 ± 0.543
0.98SerTrp: 0.98 ± 0.236
3.333SerTyr: 3.333 ± 0.588
0.0SerXaa: 0.0 ± 0.0
Thr
4.705ThrAla: 4.705 ± 0.77
0.686ThrCys: 0.686 ± 0.251
3.431ThrAsp: 3.431 ± 0.556
3.431ThrGlu: 3.431 ± 0.441
1.765ThrPhe: 1.765 ± 0.373
5.196ThrGly: 5.196 ± 0.945
1.274ThrHis: 1.274 ± 0.38
5.882ThrIle: 5.882 ± 0.864
4.117ThrLys: 4.117 ± 0.688
5.0ThrLeu: 5.0 ± 0.882
0.49ThrMet: 0.49 ± 0.19
3.333ThrAsn: 3.333 ± 0.578
1.568ThrPro: 1.568 ± 0.376
2.941ThrGln: 2.941 ± 0.571
1.863ThrArg: 1.863 ± 0.495
3.823ThrSer: 3.823 ± 0.627
3.627ThrThr: 3.627 ± 0.694
4.019ThrVal: 4.019 ± 0.741
0.392ThrTrp: 0.392 ± 0.155
0.98ThrTyr: 0.98 ± 0.243
0.0ThrXaa: 0.0 ± 0.0
Val
4.411ValAla: 4.411 ± 0.764
0.294ValCys: 0.294 ± 0.202
4.607ValAsp: 4.607 ± 0.771
4.411ValGlu: 4.411 ± 0.663
2.157ValPhe: 2.157 ± 0.434
3.529ValGly: 3.529 ± 0.579
0.686ValHis: 0.686 ± 0.277
3.529ValIle: 3.529 ± 0.719
4.705ValLys: 4.705 ± 0.673
4.705ValLeu: 4.705 ± 0.708
1.372ValMet: 1.372 ± 0.32
2.549ValAsn: 2.549 ± 0.399
0.98ValPro: 0.98 ± 0.293
2.843ValGln: 2.843 ± 0.428
2.941ValArg: 2.941 ± 0.546
4.411ValSer: 4.411 ± 0.602
4.313ValThr: 4.313 ± 0.79
4.705ValVal: 4.705 ± 0.754
0.49ValTrp: 0.49 ± 0.195
2.255ValTyr: 2.255 ± 0.355
0.0ValXaa: 0.0 ± 0.0
Trp
0.49TrpAla: 0.49 ± 0.236
0.392TrpCys: 0.392 ± 0.173
0.0TrpAsp: 0.0 ± 0.0
0.784TrpGlu: 0.784 ± 0.294
0.196TrpPhe: 0.196 ± 0.145
1.078TrpGly: 1.078 ± 0.375
0.196TrpHis: 0.196 ± 0.164
0.392TrpIle: 0.392 ± 0.215
0.588TrpLys: 0.588 ± 0.226
1.176TrpLeu: 1.176 ± 0.39
0.588TrpMet: 0.588 ± 0.216
0.686TrpAsn: 0.686 ± 0.215
0.294TrpPro: 0.294 ± 0.144
0.196TrpGln: 0.196 ± 0.126
0.588TrpArg: 0.588 ± 0.265
0.882TrpSer: 0.882 ± 0.257
0.196TrpThr: 0.196 ± 0.114
0.392TrpVal: 0.392 ± 0.19
0.098TrpTrp: 0.098 ± 0.096
0.392TrpTyr: 0.392 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.745TyrAla: 2.745 ± 0.664
0.294TyrCys: 0.294 ± 0.185
3.137TyrAsp: 3.137 ± 0.595
2.059TyrGlu: 2.059 ± 0.365
1.568TyrPhe: 1.568 ± 0.499
2.059TyrGly: 2.059 ± 0.381
0.392TyrHis: 0.392 ± 0.188
2.255TyrIle: 2.255 ± 0.41
2.745TyrLys: 2.745 ± 0.521
3.333TyrLeu: 3.333 ± 0.573
1.078TyrMet: 1.078 ± 0.385
1.568TyrAsn: 1.568 ± 0.395
0.98TyrPro: 0.98 ± 0.309
1.667TyrGln: 1.667 ± 0.382
1.765TyrArg: 1.765 ± 0.436
2.353TyrSer: 2.353 ± 0.462
1.863TyrThr: 1.863 ± 0.522
1.863TyrVal: 1.863 ± 0.446
0.196TyrTrp: 0.196 ± 0.12
1.372TyrTyr: 1.372 ± 0.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10202 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski