Amino acid dipepetide frequency for Streptococcus phage Javan566

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.834AlaAla: 2.834 ± 0.687
0.64AlaCys: 0.64 ± 0.226
5.484AlaAsp: 5.484 ± 0.799
6.216AlaGlu: 6.216 ± 0.865
2.285AlaPhe: 2.285 ± 0.526
5.302AlaGly: 5.302 ± 1.164
1.188AlaHis: 1.188 ± 0.342
6.307AlaIle: 6.307 ± 1.045
6.947AlaLys: 6.947 ± 0.704
5.85AlaLeu: 5.85 ± 0.935
1.371AlaMet: 1.371 ± 0.41
5.027AlaAsn: 5.027 ± 0.614
0.64AlaPro: 0.64 ± 0.207
2.468AlaGln: 2.468 ± 0.55
3.291AlaArg: 3.291 ± 0.528
5.484AlaSer: 5.484 ± 0.912
4.388AlaThr: 4.388 ± 0.821
5.759AlaVal: 5.759 ± 0.472
0.548AlaTrp: 0.548 ± 0.286
1.737AlaTyr: 1.737 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
0.274CysAla: 0.274 ± 0.165
0.183CysCys: 0.183 ± 0.124
0.731CysAsp: 0.731 ± 0.21
0.366CysGlu: 0.366 ± 0.191
0.366CysPhe: 0.366 ± 0.179
0.64CysGly: 0.64 ± 0.282
0.0CysHis: 0.0 ± 0.0
0.183CysIle: 0.183 ± 0.117
0.274CysLys: 0.274 ± 0.162
0.548CysLeu: 0.548 ± 0.273
0.091CysMet: 0.091 ± 0.094
0.091CysAsn: 0.091 ± 0.1
0.274CysPro: 0.274 ± 0.133
0.183CysGln: 0.183 ± 0.111
0.457CysArg: 0.457 ± 0.262
0.366CysSer: 0.366 ± 0.185
0.091CysThr: 0.091 ± 0.095
0.091CysVal: 0.091 ± 0.091
0.0CysTrp: 0.0 ± 0.0
0.366CysTyr: 0.366 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
3.199AspAla: 3.199 ± 0.511
0.64AspCys: 0.64 ± 0.288
3.748AspAsp: 3.748 ± 0.736
4.753AspGlu: 4.753 ± 0.711
3.382AspPhe: 3.382 ± 0.787
5.027AspGly: 5.027 ± 0.709
0.823AspHis: 0.823 ± 0.257
5.21AspIle: 5.21 ± 0.59
4.662AspLys: 4.662 ± 0.701
7.13AspLeu: 7.13 ± 0.799
2.194AspMet: 2.194 ± 0.348
3.748AspAsn: 3.748 ± 0.682
1.737AspPro: 1.737 ± 0.487
1.554AspGln: 1.554 ± 0.353
2.468AspArg: 2.468 ± 0.487
3.382AspSer: 3.382 ± 0.465
3.108AspThr: 3.108 ± 0.451
4.845AspVal: 4.845 ± 0.786
1.005AspTrp: 1.005 ± 0.235
2.102AspTyr: 2.102 ± 0.529
0.0AspXaa: 0.0 ± 0.0
Glu
6.033GluAla: 6.033 ± 0.832
0.183GluCys: 0.183 ± 0.147
3.016GluAsp: 3.016 ± 0.53
5.302GluGlu: 5.302 ± 0.974
2.925GluPhe: 2.925 ± 0.386
3.108GluGly: 3.108 ± 0.497
1.463GluHis: 1.463 ± 0.4
5.027GluIle: 5.027 ± 0.698
5.85GluLys: 5.85 ± 1.213
7.038GluLeu: 7.038 ± 1.081
1.92GluMet: 1.92 ± 0.409
3.748GluAsn: 3.748 ± 0.483
2.011GluPro: 2.011 ± 0.575
3.839GluGln: 3.839 ± 0.642
4.022GluArg: 4.022 ± 0.62
3.108GluSer: 3.108 ± 0.53
4.388GluThr: 4.388 ± 0.572
4.753GluVal: 4.753 ± 0.558
0.731GluTrp: 0.731 ± 0.265
1.92GluTyr: 1.92 ± 0.553
0.0GluXaa: 0.0 ± 0.0
Phe
3.291PheAla: 3.291 ± 0.669
0.091PheCys: 0.091 ± 0.09
3.199PheAsp: 3.199 ± 0.617
4.845PheGlu: 4.845 ± 0.607
2.102PhePhe: 2.102 ± 0.443
3.016PheGly: 3.016 ± 0.477
0.548PheHis: 0.548 ± 0.199
1.92PheIle: 1.92 ± 0.428
2.742PheLys: 2.742 ± 0.539
2.834PheLeu: 2.834 ± 0.555
1.28PheMet: 1.28 ± 0.426
2.285PheAsn: 2.285 ± 0.487
0.548PhePro: 0.548 ± 0.225
1.097PheGln: 1.097 ± 0.564
1.005PheArg: 1.005 ± 0.296
2.102PheSer: 2.102 ± 0.491
2.011PheThr: 2.011 ± 0.441
2.011PheVal: 2.011 ± 0.432
0.366PheTrp: 0.366 ± 0.174
2.011PheTyr: 2.011 ± 0.451
0.0PheXaa: 0.0 ± 0.0
Gly
4.296GlyAla: 4.296 ± 1.282
0.183GlyCys: 0.183 ± 0.128
3.291GlyAsp: 3.291 ± 0.545
3.382GlyGlu: 3.382 ± 0.484
2.925GlyPhe: 2.925 ± 0.762
4.479GlyGly: 4.479 ± 0.833
1.005GlyHis: 1.005 ± 0.28
4.296GlyIle: 4.296 ± 0.705
5.667GlyLys: 5.667 ± 0.698
5.941GlyLeu: 5.941 ± 1.081
2.011GlyMet: 2.011 ± 0.408
3.382GlyAsn: 3.382 ± 0.651
1.005GlyPro: 1.005 ± 0.314
3.473GlyGln: 3.473 ± 0.389
3.199GlyArg: 3.199 ± 0.413
2.834GlySer: 2.834 ± 0.82
3.656GlyThr: 3.656 ± 0.532
4.113GlyVal: 4.113 ± 0.832
1.005GlyTrp: 1.005 ± 0.294
3.016GlyTyr: 3.016 ± 0.55
0.0GlyXaa: 0.0 ± 0.0
His
1.188HisAla: 1.188 ± 0.305
0.183HisCys: 0.183 ± 0.146
0.731HisAsp: 0.731 ± 0.311
1.005HisGlu: 1.005 ± 0.271
0.731HisPhe: 0.731 ± 0.276
1.28HisGly: 1.28 ± 0.335
0.366HisHis: 0.366 ± 0.158
0.731HisIle: 0.731 ± 0.282
1.005HisLys: 1.005 ± 0.383
1.371HisLeu: 1.371 ± 0.402
0.183HisMet: 0.183 ± 0.139
0.731HisAsn: 0.731 ± 0.236
1.005HisPro: 1.005 ± 0.331
0.457HisGln: 0.457 ± 0.189
0.548HisArg: 0.548 ± 0.22
0.731HisSer: 0.731 ± 0.271
0.64HisThr: 0.64 ± 0.245
0.823HisVal: 0.823 ± 0.245
0.091HisTrp: 0.091 ± 0.076
0.64HisTyr: 0.64 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
4.57IleAla: 4.57 ± 0.573
0.457IleCys: 0.457 ± 0.243
5.484IleAsp: 5.484 ± 0.634
5.576IleGlu: 5.576 ± 0.712
2.011IlePhe: 2.011 ± 0.466
4.205IleGly: 4.205 ± 0.65
0.548IleHis: 0.548 ± 0.212
3.565IleIle: 3.565 ± 0.569
6.764IleLys: 6.764 ± 0.681
3.839IleLeu: 3.839 ± 0.483
1.371IleMet: 1.371 ± 0.606
3.565IleAsn: 3.565 ± 0.807
2.102IlePro: 2.102 ± 0.461
2.651IleGln: 2.651 ± 0.475
2.285IleArg: 2.285 ± 0.536
4.662IleSer: 4.662 ± 0.796
4.296IleThr: 4.296 ± 0.716
4.113IleVal: 4.113 ± 0.586
0.457IleTrp: 0.457 ± 0.18
2.468IleTyr: 2.468 ± 0.486
0.0IleXaa: 0.0 ± 0.0
Lys
6.673LysAla: 6.673 ± 0.835
0.091LysCys: 0.091 ± 0.094
4.205LysAsp: 4.205 ± 0.626
4.753LysGlu: 4.753 ± 0.927
2.651LysPhe: 2.651 ± 0.498
4.296LysGly: 4.296 ± 0.623
1.097LysHis: 1.097 ± 0.404
4.753LysIle: 4.753 ± 0.694
6.764LysLys: 6.764 ± 1.066
5.85LysLeu: 5.85 ± 0.74
2.011LysMet: 2.011 ± 0.43
5.484LysAsn: 5.484 ± 0.765
2.742LysPro: 2.742 ± 0.461
4.113LysGln: 4.113 ± 0.619
4.845LysArg: 4.845 ± 0.541
5.85LysSer: 5.85 ± 0.632
6.399LysThr: 6.399 ± 0.935
6.307LysVal: 6.307 ± 0.659
1.005LysTrp: 1.005 ± 0.293
2.651LysTyr: 2.651 ± 0.489
0.0LysXaa: 0.0 ± 0.0
Leu
8.41LeuAla: 8.41 ± 0.901
0.548LeuCys: 0.548 ± 0.189
5.484LeuAsp: 5.484 ± 0.652
6.033LeuGlu: 6.033 ± 0.913
3.382LeuPhe: 3.382 ± 0.568
5.21LeuGly: 5.21 ± 0.927
1.097LeuHis: 1.097 ± 0.335
4.113LeuIle: 4.113 ± 0.553
8.592LeuLys: 8.592 ± 0.831
6.856LeuLeu: 6.856 ± 0.697
1.097LeuMet: 1.097 ± 0.281
4.845LeuAsn: 4.845 ± 0.907
3.016LeuPro: 3.016 ± 0.657
3.199LeuGln: 3.199 ± 0.501
3.199LeuArg: 3.199 ± 0.615
5.85LeuSer: 5.85 ± 0.941
5.576LeuThr: 5.576 ± 0.707
4.936LeuVal: 4.936 ± 0.567
0.914LeuTrp: 0.914 ± 0.267
0.731LeuTyr: 0.731 ± 0.286
0.0LeuXaa: 0.0 ± 0.0
Met
1.737MetAla: 1.737 ± 0.422
0.183MetCys: 0.183 ± 0.117
1.28MetAsp: 1.28 ± 0.327
1.92MetGlu: 1.92 ± 0.497
0.731MetPhe: 0.731 ± 0.201
1.097MetGly: 1.097 ± 0.566
0.274MetHis: 0.274 ± 0.136
1.28MetIle: 1.28 ± 0.387
1.645MetLys: 1.645 ± 0.416
2.011MetLeu: 2.011 ± 0.479
0.274MetMet: 0.274 ± 0.197
1.463MetAsn: 1.463 ± 0.403
1.097MetPro: 1.097 ± 0.331
0.823MetGln: 0.823 ± 0.25
0.823MetArg: 0.823 ± 0.345
1.92MetSer: 1.92 ± 0.521
2.285MetThr: 2.285 ± 0.511
2.102MetVal: 2.102 ± 0.378
0.366MetTrp: 0.366 ± 0.163
0.914MetTyr: 0.914 ± 0.333
0.0MetXaa: 0.0 ± 0.0
Asn
3.108AsnAla: 3.108 ± 0.423
0.366AsnCys: 0.366 ± 0.161
3.382AsnAsp: 3.382 ± 0.609
2.651AsnGlu: 2.651 ± 0.42
3.016AsnPhe: 3.016 ± 0.386
4.296AsnGly: 4.296 ± 0.72
0.64AsnHis: 0.64 ± 0.195
2.925AsnIle: 2.925 ± 0.628
3.656AsnLys: 3.656 ± 0.605
6.124AsnLeu: 6.124 ± 0.921
1.005AsnMet: 1.005 ± 0.267
2.834AsnAsn: 2.834 ± 0.61
2.925AsnPro: 2.925 ± 0.676
3.199AsnGln: 3.199 ± 0.403
3.016AsnArg: 3.016 ± 0.561
3.565AsnSer: 3.565 ± 0.634
2.742AsnThr: 2.742 ± 0.499
3.108AsnVal: 3.108 ± 0.474
1.188AsnTrp: 1.188 ± 0.373
3.016AsnTyr: 3.016 ± 0.61
0.0AsnXaa: 0.0 ± 0.0
Pro
1.92ProAla: 1.92 ± 0.457
0.183ProCys: 0.183 ± 0.118
2.377ProAsp: 2.377 ± 0.555
1.645ProGlu: 1.645 ± 0.37
1.463ProPhe: 1.463 ± 0.446
1.463ProGly: 1.463 ± 0.342
0.457ProHis: 0.457 ± 0.225
2.377ProIle: 2.377 ± 0.567
2.742ProLys: 2.742 ± 0.562
2.285ProLeu: 2.285 ± 0.363
1.188ProMet: 1.188 ± 0.277
1.645ProAsn: 1.645 ± 0.489
0.366ProPro: 0.366 ± 0.158
0.823ProGln: 0.823 ± 0.28
1.188ProArg: 1.188 ± 0.316
2.468ProSer: 2.468 ± 0.481
1.463ProThr: 1.463 ± 0.34
2.377ProVal: 2.377 ± 0.568
0.274ProTrp: 0.274 ± 0.155
0.914ProTyr: 0.914 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
4.022GlnAla: 4.022 ± 0.747
0.183GlnCys: 0.183 ± 0.142
2.377GlnAsp: 2.377 ± 0.434
2.834GlnGlu: 2.834 ± 0.514
1.463GlnPhe: 1.463 ± 0.33
1.463GlnGly: 1.463 ± 0.42
0.64GlnHis: 0.64 ± 0.268
2.285GlnIle: 2.285 ± 0.429
3.565GlnLys: 3.565 ± 0.655
3.565GlnLeu: 3.565 ± 0.6
0.366GlnMet: 0.366 ± 0.196
1.92GlnAsn: 1.92 ± 0.462
1.097GlnPro: 1.097 ± 0.328
1.188GlnGln: 1.188 ± 0.379
1.463GlnArg: 1.463 ± 0.445
3.108GlnSer: 3.108 ± 0.636
3.291GlnThr: 3.291 ± 0.919
3.199GlnVal: 3.199 ± 0.668
0.548GlnTrp: 0.548 ± 0.197
1.463GlnTyr: 1.463 ± 0.347
0.0GlnXaa: 0.0 ± 0.0
Arg
2.468ArgAla: 2.468 ± 0.465
0.366ArgCys: 0.366 ± 0.225
2.742ArgAsp: 2.742 ± 0.5
2.742ArgGlu: 2.742 ± 0.566
1.645ArgPhe: 1.645 ± 0.302
1.28ArgGly: 1.28 ± 0.369
1.097ArgHis: 1.097 ± 0.322
3.839ArgIle: 3.839 ± 0.672
3.748ArgLys: 3.748 ± 0.771
4.57ArgLeu: 4.57 ± 0.7
1.28ArgMet: 1.28 ± 0.351
3.565ArgAsn: 3.565 ± 0.568
1.371ArgPro: 1.371 ± 0.406
1.463ArgGln: 1.463 ± 0.351
2.285ArgArg: 2.285 ± 0.388
2.285ArgSer: 2.285 ± 0.421
3.016ArgThr: 3.016 ± 0.563
2.285ArgVal: 2.285 ± 0.43
0.64ArgTrp: 0.64 ± 0.265
2.011ArgTyr: 2.011 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
5.393SerAla: 5.393 ± 1.503
0.366SerCys: 0.366 ± 0.147
4.753SerAsp: 4.753 ± 0.601
4.57SerGlu: 4.57 ± 0.575
2.468SerPhe: 2.468 ± 0.449
4.662SerGly: 4.662 ± 0.812
0.823SerHis: 0.823 ± 0.206
4.296SerIle: 4.296 ± 0.631
4.662SerLys: 4.662 ± 0.764
5.119SerLeu: 5.119 ± 0.911
2.011SerMet: 2.011 ± 0.37
3.473SerAsn: 3.473 ± 0.634
1.554SerPro: 1.554 ± 0.347
2.011SerGln: 2.011 ± 0.606
2.925SerArg: 2.925 ± 0.661
4.296SerSer: 4.296 ± 0.777
3.291SerThr: 3.291 ± 0.616
5.393SerVal: 5.393 ± 0.927
1.005SerTrp: 1.005 ± 0.314
2.011SerTyr: 2.011 ± 0.307
0.0SerXaa: 0.0 ± 0.0
Thr
5.85ThrAla: 5.85 ± 1.355
0.091ThrCys: 0.091 ± 0.09
2.834ThrAsp: 2.834 ± 0.514
4.662ThrGlu: 4.662 ± 0.661
1.92ThrPhe: 1.92 ± 0.318
4.388ThrGly: 4.388 ± 0.586
0.731ThrHis: 0.731 ± 0.257
4.845ThrIle: 4.845 ± 0.648
5.393ThrLys: 5.393 ± 0.675
4.479ThrLeu: 4.479 ± 0.613
1.097ThrMet: 1.097 ± 0.398
2.925ThrAsn: 2.925 ± 0.538
1.188ThrPro: 1.188 ± 0.257
2.651ThrGln: 2.651 ± 0.715
2.468ThrArg: 2.468 ± 0.436
3.839ThrSer: 3.839 ± 0.473
3.931ThrThr: 3.931 ± 0.475
5.393ThrVal: 5.393 ± 0.797
0.457ThrTrp: 0.457 ± 0.178
2.559ThrTyr: 2.559 ± 0.5
0.0ThrXaa: 0.0 ± 0.0
Val
5.119ValAla: 5.119 ± 0.684
0.457ValCys: 0.457 ± 0.212
6.033ValAsp: 6.033 ± 0.859
4.662ValGlu: 4.662 ± 0.817
1.92ValPhe: 1.92 ± 0.357
4.662ValGly: 4.662 ± 0.717
1.005ValHis: 1.005 ± 0.27
4.57ValIle: 4.57 ± 0.769
4.845ValLys: 4.845 ± 0.674
4.662ValLeu: 4.662 ± 0.9
1.463ValMet: 1.463 ± 0.333
3.931ValAsn: 3.931 ± 0.565
2.925ValPro: 2.925 ± 0.495
2.285ValGln: 2.285 ± 0.577
2.559ValArg: 2.559 ± 0.448
6.216ValSer: 6.216 ± 0.84
4.296ValThr: 4.296 ± 0.71
5.393ValVal: 5.393 ± 1.015
0.64ValTrp: 0.64 ± 0.192
2.468ValTyr: 2.468 ± 0.414
0.0ValXaa: 0.0 ± 0.0
Trp
0.731TrpAla: 0.731 ± 0.26
0.091TrpCys: 0.091 ± 0.095
0.731TrpAsp: 0.731 ± 0.266
1.005TrpGlu: 1.005 ± 0.305
0.731TrpPhe: 0.731 ± 0.24
1.097TrpGly: 1.097 ± 0.344
0.183TrpHis: 0.183 ± 0.118
0.731TrpIle: 0.731 ± 0.287
1.371TrpLys: 1.371 ± 0.34
0.457TrpLeu: 0.457 ± 0.216
0.183TrpMet: 0.183 ± 0.12
0.64TrpAsn: 0.64 ± 0.242
0.183TrpPro: 0.183 ± 0.138
0.731TrpGln: 0.731 ± 0.292
0.548TrpArg: 0.548 ± 0.191
0.731TrpSer: 0.731 ± 0.238
0.548TrpThr: 0.548 ± 0.225
0.548TrpVal: 0.548 ± 0.194
0.091TrpTrp: 0.091 ± 0.089
0.457TrpTyr: 0.457 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.651TyrAla: 2.651 ± 0.481
0.091TyrCys: 0.091 ± 0.09
2.834TyrAsp: 2.834 ± 0.529
1.463TyrGlu: 1.463 ± 0.33
1.188TyrPhe: 1.188 ± 0.34
2.194TyrGly: 2.194 ± 0.44
0.457TyrHis: 0.457 ± 0.176
1.828TyrIle: 1.828 ± 0.458
1.645TyrLys: 1.645 ± 0.332
2.559TyrLeu: 2.559 ± 0.451
1.737TyrMet: 1.737 ± 0.45
1.463TyrAsn: 1.463 ± 0.381
1.828TyrPro: 1.828 ± 0.524
1.828TyrGln: 1.828 ± 0.432
2.102TyrArg: 2.102 ± 0.534
2.285TyrSer: 2.285 ± 0.446
2.285TyrThr: 2.285 ± 0.358
2.559TyrVal: 2.559 ± 0.423
0.457TyrTrp: 0.457 ± 0.182
0.548TyrTyr: 0.548 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (10941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski