Amino acid dipepetide frequency for Streptococcus phage Javan539

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.231AlaAla: 7.231 ± 2.481
0.265AlaCys: 0.265 ± 0.172
4.321AlaAsp: 4.321 ± 0.563
5.026AlaGlu: 5.026 ± 0.772
2.205AlaPhe: 2.205 ± 0.819
6.349AlaGly: 6.349 ± 1.322
1.058AlaHis: 1.058 ± 0.333
6.878AlaIle: 6.878 ± 1.862
6.79AlaLys: 6.79 ± 0.77
6.878AlaLeu: 6.878 ± 1.28
2.734AlaMet: 2.734 ± 0.761
3.88AlaAsn: 3.88 ± 0.61
4.497AlaPro: 4.497 ± 0.676
4.145AlaGln: 4.145 ± 0.982
3.88AlaArg: 3.88 ± 0.534
4.938AlaSer: 4.938 ± 0.972
5.467AlaThr: 5.467 ± 1.218
5.115AlaVal: 5.115 ± 1.332
0.97AlaTrp: 0.97 ± 0.253
1.94AlaTyr: 1.94 ± 0.43
0.0AlaXaa: 0.0 ± 0.0
Cys
0.265CysAla: 0.265 ± 0.137
0.0CysCys: 0.0 ± 0.0
0.265CysAsp: 0.265 ± 0.158
0.353CysGlu: 0.353 ± 0.177
0.088CysPhe: 0.088 ± 0.069
0.441CysGly: 0.441 ± 0.176
0.176CysHis: 0.176 ± 0.101
0.265CysIle: 0.265 ± 0.177
0.441CysLys: 0.441 ± 0.196
0.176CysLeu: 0.176 ± 0.12
0.088CysMet: 0.088 ± 0.101
0.441CysAsn: 0.441 ± 0.202
0.088CysPro: 0.088 ± 0.074
0.088CysGln: 0.088 ± 0.09
0.265CysArg: 0.265 ± 0.147
0.176CysSer: 0.176 ± 0.128
0.088CysThr: 0.088 ± 0.092
0.265CysVal: 0.265 ± 0.143
0.0CysTrp: 0.0 ± 0.0
0.265CysTyr: 0.265 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
4.321AspAla: 4.321 ± 0.593
0.441AspCys: 0.441 ± 0.222
3.88AspAsp: 3.88 ± 0.946
4.321AspGlu: 4.321 ± 0.638
3.527AspPhe: 3.527 ± 0.514
3.88AspGly: 3.88 ± 0.733
0.705AspHis: 0.705 ± 0.262
3.968AspIle: 3.968 ± 0.724
5.467AspLys: 5.467 ± 0.832
5.203AspLeu: 5.203 ± 0.896
1.764AspMet: 1.764 ± 0.308
2.646AspAsn: 2.646 ± 0.49
1.675AspPro: 1.675 ± 0.328
2.116AspGln: 2.116 ± 0.522
2.822AspArg: 2.822 ± 0.66
3.88AspSer: 3.88 ± 0.536
4.056AspThr: 4.056 ± 0.604
3.527AspVal: 3.527 ± 0.529
0.705AspTrp: 0.705 ± 0.239
3.439AspTyr: 3.439 ± 0.633
0.0AspXaa: 0.0 ± 0.0
Glu
3.616GluAla: 3.616 ± 0.536
0.265GluCys: 0.265 ± 0.148
4.586GluAsp: 4.586 ± 0.72
5.556GluGlu: 5.556 ± 1.277
3.263GluPhe: 3.263 ± 0.671
4.497GluGly: 4.497 ± 0.507
0.441GluHis: 0.441 ± 0.185
4.85GluIle: 4.85 ± 0.896
4.056GluLys: 4.056 ± 0.712
7.584GluLeu: 7.584 ± 1.169
2.116GluMet: 2.116 ± 0.508
3.616GluAsn: 3.616 ± 0.674
2.028GluPro: 2.028 ± 0.637
3.439GluGln: 3.439 ± 0.593
3.968GluArg: 3.968 ± 0.902
2.734GluSer: 2.734 ± 0.481
3.792GluThr: 3.792 ± 0.782
4.497GluVal: 4.497 ± 0.698
1.499GluTrp: 1.499 ± 0.416
2.293GluTyr: 2.293 ± 0.552
0.0GluXaa: 0.0 ± 0.0
Phe
2.381PheAla: 2.381 ± 0.381
0.088PheCys: 0.088 ± 0.096
3.704PheAsp: 3.704 ± 0.591
3.704PheGlu: 3.704 ± 0.726
1.146PhePhe: 1.146 ± 0.255
3.616PheGly: 3.616 ± 0.538
0.529PheHis: 0.529 ± 0.287
1.764PheIle: 1.764 ± 0.483
3.086PheLys: 3.086 ± 0.616
1.94PheLeu: 1.94 ± 0.447
1.235PheMet: 1.235 ± 0.326
2.734PheAsn: 2.734 ± 0.379
0.794PhePro: 0.794 ± 0.237
1.411PheGln: 1.411 ± 0.371
1.146PheArg: 1.146 ± 0.315
2.734PheSer: 2.734 ± 0.972
2.293PheThr: 2.293 ± 0.375
1.675PheVal: 1.675 ± 0.429
0.441PheTrp: 0.441 ± 0.219
1.235PheTyr: 1.235 ± 0.522
0.0PheXaa: 0.0 ± 0.0
Gly
4.762GlyAla: 4.762 ± 1.516
0.529GlyCys: 0.529 ± 0.21
3.616GlyAsp: 3.616 ± 0.593
3.88GlyGlu: 3.88 ± 0.573
2.998GlyPhe: 2.998 ± 0.412
4.762GlyGly: 4.762 ± 0.579
1.235GlyHis: 1.235 ± 0.308
6.526GlyIle: 6.526 ± 1.476
5.644GlyLys: 5.644 ± 0.756
6.173GlyLeu: 6.173 ± 1.231
2.116GlyMet: 2.116 ± 0.651
3.527GlyAsn: 3.527 ± 0.472
0.529GlyPro: 0.529 ± 0.199
2.469GlyGln: 2.469 ± 0.474
2.381GlyArg: 2.381 ± 0.407
3.88GlySer: 3.88 ± 0.654
5.203GlyThr: 5.203 ± 1.238
3.88GlyVal: 3.88 ± 0.567
1.235GlyTrp: 1.235 ± 0.387
3.086GlyTyr: 3.086 ± 0.639
0.0GlyXaa: 0.0 ± 0.0
His
0.794HisAla: 0.794 ± 0.265
0.0HisCys: 0.0 ± 0.0
0.97HisAsp: 0.97 ± 0.393
0.617HisGlu: 0.617 ± 0.21
0.882HisPhe: 0.882 ± 0.259
0.529HisGly: 0.529 ± 0.191
0.353HisHis: 0.353 ± 0.176
0.97HisIle: 0.97 ± 0.296
0.97HisLys: 0.97 ± 0.319
0.705HisLeu: 0.705 ± 0.181
0.088HisMet: 0.088 ± 0.082
0.882HisAsn: 0.882 ± 0.395
1.146HisPro: 1.146 ± 0.31
0.794HisGln: 0.794 ± 0.292
0.529HisArg: 0.529 ± 0.163
1.146HisSer: 1.146 ± 0.376
0.882HisThr: 0.882 ± 0.3
0.353HisVal: 0.353 ± 0.194
0.088HisTrp: 0.088 ± 0.1
0.176HisTyr: 0.176 ± 0.116
0.0HisXaa: 0.0 ± 0.0
Ile
6.79IleAla: 6.79 ± 0.887
0.176IleCys: 0.176 ± 0.11
5.026IleAsp: 5.026 ± 0.603
4.674IleGlu: 4.674 ± 0.691
1.499IlePhe: 1.499 ± 0.386
5.467IleGly: 5.467 ± 1.003
0.882IleHis: 0.882 ± 0.326
3.88IleIle: 3.88 ± 0.498
5.82IleLys: 5.82 ± 0.734
3.439IleLeu: 3.439 ± 0.575
1.058IleMet: 1.058 ± 0.213
4.056IleAsn: 4.056 ± 0.706
2.557IlePro: 2.557 ± 0.588
2.205IleGln: 2.205 ± 0.401
2.381IleArg: 2.381 ± 0.421
5.026IleSer: 5.026 ± 1.583
4.586IleThr: 4.586 ± 0.788
3.527IleVal: 3.527 ± 0.727
0.353IleTrp: 0.353 ± 0.204
2.293IleTyr: 2.293 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
6.173LysAla: 6.173 ± 0.503
0.176LysCys: 0.176 ± 0.122
4.145LysAsp: 4.145 ± 0.65
6.437LysGlu: 6.437 ± 1.041
2.646LysPhe: 2.646 ± 0.56
4.233LysGly: 4.233 ± 0.611
0.97LysHis: 0.97 ± 0.332
3.88LysIle: 3.88 ± 0.476
4.938LysLys: 4.938 ± 0.952
5.732LysLeu: 5.732 ± 0.849
1.852LysMet: 1.852 ± 0.305
3.704LysAsn: 3.704 ± 0.678
2.998LysPro: 2.998 ± 0.613
2.91LysGln: 2.91 ± 0.62
3.704LysArg: 3.704 ± 0.596
4.321LysSer: 4.321 ± 0.6
6.526LysThr: 6.526 ± 0.817
4.938LysVal: 4.938 ± 0.678
0.882LysTrp: 0.882 ± 0.313
2.205LysTyr: 2.205 ± 0.547
0.0LysXaa: 0.0 ± 0.0
Leu
8.554LeuAla: 8.554 ± 1.009
0.353LeuCys: 0.353 ± 0.194
5.996LeuAsp: 5.996 ± 0.737
5.026LeuGlu: 5.026 ± 0.985
2.469LeuPhe: 2.469 ± 0.484
5.556LeuGly: 5.556 ± 1.112
0.882LeuHis: 0.882 ± 0.335
3.616LeuIle: 3.616 ± 0.551
7.143LeuLys: 7.143 ± 0.796
3.88LeuLeu: 3.88 ± 0.804
2.646LeuMet: 2.646 ± 0.689
4.586LeuAsn: 4.586 ± 0.577
3.792LeuPro: 3.792 ± 0.767
3.616LeuGln: 3.616 ± 0.58
2.205LeuArg: 2.205 ± 0.426
4.233LeuSer: 4.233 ± 0.716
5.644LeuThr: 5.644 ± 0.647
5.291LeuVal: 5.291 ± 0.532
0.529LeuTrp: 0.529 ± 0.188
2.293LeuTyr: 2.293 ± 0.486
0.0LeuXaa: 0.0 ± 0.0
Met
3.175MetAla: 3.175 ± 1.079
0.176MetCys: 0.176 ± 0.12
1.852MetAsp: 1.852 ± 0.406
1.235MetGlu: 1.235 ± 0.288
0.794MetPhe: 0.794 ± 0.283
1.764MetGly: 1.764 ± 0.355
0.176MetHis: 0.176 ± 0.12
1.587MetIle: 1.587 ± 0.399
1.323MetLys: 1.323 ± 0.244
1.94MetLeu: 1.94 ± 0.484
1.146MetMet: 1.146 ± 0.689
1.235MetAsn: 1.235 ± 0.325
0.705MetPro: 0.705 ± 0.199
1.587MetGln: 1.587 ± 0.472
1.146MetArg: 1.146 ± 0.373
1.764MetSer: 1.764 ± 0.482
2.91MetThr: 2.91 ± 0.65
2.381MetVal: 2.381 ± 0.718
0.529MetTrp: 0.529 ± 0.192
0.441MetTyr: 0.441 ± 0.232
0.0MetXaa: 0.0 ± 0.0
Asn
3.792AsnAla: 3.792 ± 0.483
0.353AsnCys: 0.353 ± 0.204
3.616AsnAsp: 3.616 ± 0.67
2.91AsnGlu: 2.91 ± 0.561
1.499AsnPhe: 1.499 ± 0.398
4.497AsnGly: 4.497 ± 0.742
0.705AsnHis: 0.705 ± 0.253
2.91AsnIle: 2.91 ± 0.6
2.381AsnLys: 2.381 ± 0.495
3.968AsnLeu: 3.968 ± 0.47
1.587AsnMet: 1.587 ± 0.251
2.91AsnAsn: 2.91 ± 0.502
2.469AsnPro: 2.469 ± 0.524
2.734AsnGln: 2.734 ± 0.405
2.205AsnArg: 2.205 ± 0.568
3.175AsnSer: 3.175 ± 0.503
2.822AsnThr: 2.822 ± 0.424
3.792AsnVal: 3.792 ± 0.594
1.058AsnTrp: 1.058 ± 0.27
2.557AsnTyr: 2.557 ± 0.589
0.0AsnXaa: 0.0 ± 0.0
Pro
1.94ProAla: 1.94 ± 0.734
0.0ProCys: 0.0 ± 0.0
2.469ProAsp: 2.469 ± 0.591
2.822ProGlu: 2.822 ± 0.809
1.499ProPhe: 1.499 ± 0.43
1.587ProGly: 1.587 ± 0.378
0.353ProHis: 0.353 ± 0.18
2.469ProIle: 2.469 ± 0.474
2.646ProLys: 2.646 ± 0.448
2.557ProLeu: 2.557 ± 0.497
0.617ProMet: 0.617 ± 0.234
2.116ProAsn: 2.116 ± 0.584
1.411ProPro: 1.411 ± 0.3
1.675ProGln: 1.675 ± 0.542
0.882ProArg: 0.882 ± 0.288
2.822ProSer: 2.822 ± 0.532
2.469ProThr: 2.469 ± 0.539
3.351ProVal: 3.351 ± 0.734
0.088ProTrp: 0.088 ± 0.082
1.675ProTyr: 1.675 ± 0.476
0.0ProXaa: 0.0 ± 0.0
Gln
5.291GlnAla: 5.291 ± 1.107
0.353GlnCys: 0.353 ± 0.177
1.764GlnAsp: 1.764 ± 0.366
2.557GlnGlu: 2.557 ± 0.57
1.94GlnPhe: 1.94 ± 0.402
2.822GlnGly: 2.822 ± 0.628
0.353GlnHis: 0.353 ± 0.221
3.792GlnIle: 3.792 ± 0.699
3.439GlnLys: 3.439 ± 0.798
4.056GlnLeu: 4.056 ± 0.759
1.675GlnMet: 1.675 ± 0.442
1.94GlnAsn: 1.94 ± 0.488
1.411GlnPro: 1.411 ± 0.446
3.351GlnGln: 3.351 ± 0.742
2.205GlnArg: 2.205 ± 0.529
2.734GlnSer: 2.734 ± 0.511
2.116GlnThr: 2.116 ± 0.474
2.91GlnVal: 2.91 ± 0.577
0.441GlnTrp: 0.441 ± 0.158
0.97GlnTyr: 0.97 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
3.175ArgAla: 3.175 ± 0.654
0.088ArgCys: 0.088 ± 0.097
2.646ArgAsp: 2.646 ± 0.386
3.263ArgGlu: 3.263 ± 0.61
1.411ArgPhe: 1.411 ± 0.372
2.734ArgGly: 2.734 ± 0.756
0.705ArgHis: 0.705 ± 0.21
1.764ArgIle: 1.764 ± 0.408
2.91ArgLys: 2.91 ± 0.802
4.145ArgLeu: 4.145 ± 0.935
1.146ArgMet: 1.146 ± 0.249
1.852ArgAsn: 1.852 ± 0.392
0.97ArgPro: 0.97 ± 0.322
1.235ArgGln: 1.235 ± 0.425
1.235ArgArg: 1.235 ± 0.323
1.587ArgSer: 1.587 ± 0.405
2.469ArgThr: 2.469 ± 0.606
2.91ArgVal: 2.91 ± 0.687
0.441ArgTrp: 0.441 ± 0.187
1.94ArgTyr: 1.94 ± 0.611
0.0ArgXaa: 0.0 ± 0.0
Ser
6.173SerAla: 6.173 ± 2.333
0.176SerCys: 0.176 ± 0.127
2.469SerAsp: 2.469 ± 0.479
4.056SerGlu: 4.056 ± 0.696
2.646SerPhe: 2.646 ± 0.456
5.026SerGly: 5.026 ± 0.866
0.705SerHis: 0.705 ± 0.192
4.233SerIle: 4.233 ± 0.687
3.968SerLys: 3.968 ± 0.528
5.908SerLeu: 5.908 ± 0.833
1.852SerMet: 1.852 ± 0.576
3.086SerAsn: 3.086 ± 0.522
1.587SerPro: 1.587 ± 0.283
3.704SerGln: 3.704 ± 0.92
1.764SerArg: 1.764 ± 0.603
4.497SerSer: 4.497 ± 1.249
3.792SerThr: 3.792 ± 0.952
3.527SerVal: 3.527 ± 0.473
0.705SerTrp: 0.705 ± 0.213
1.764SerTyr: 1.764 ± 0.498
0.0SerXaa: 0.0 ± 0.0
Thr
7.319ThrAla: 7.319 ± 1.558
0.088ThrCys: 0.088 ± 0.092
3.616ThrAsp: 3.616 ± 0.655
4.586ThrGlu: 4.586 ± 0.564
2.116ThrPhe: 2.116 ± 0.497
3.88ThrGly: 3.88 ± 0.575
1.058ThrHis: 1.058 ± 0.391
4.586ThrIle: 4.586 ± 0.935
3.351ThrLys: 3.351 ± 0.618
5.82ThrLeu: 5.82 ± 0.691
2.028ThrMet: 2.028 ± 0.692
3.086ThrAsn: 3.086 ± 0.626
3.792ThrPro: 3.792 ± 0.742
3.175ThrGln: 3.175 ± 0.625
2.205ThrArg: 2.205 ± 0.581
3.968ThrSer: 3.968 ± 1.181
5.115ThrThr: 5.115 ± 0.819
5.203ThrVal: 5.203 ± 0.65
0.617ThrTrp: 0.617 ± 0.175
2.557ThrTyr: 2.557 ± 0.413
0.0ThrXaa: 0.0 ± 0.0
Val
6.173ValAla: 6.173 ± 0.85
0.441ValCys: 0.441 ± 0.237
3.527ValAsp: 3.527 ± 0.525
4.586ValGlu: 4.586 ± 0.793
2.381ValPhe: 2.381 ± 0.469
4.674ValGly: 4.674 ± 0.755
0.617ValHis: 0.617 ± 0.194
4.85ValIle: 4.85 ± 0.67
5.556ValLys: 5.556 ± 0.681
4.233ValLeu: 4.233 ± 0.614
1.235ValMet: 1.235 ± 0.33
3.263ValAsn: 3.263 ± 0.414
1.852ValPro: 1.852 ± 0.388
2.205ValGln: 2.205 ± 0.571
1.94ValArg: 1.94 ± 0.358
5.026ValSer: 5.026 ± 0.757
4.938ValThr: 4.938 ± 0.661
6.173ValVal: 6.173 ± 0.778
0.705ValTrp: 0.705 ± 0.293
1.94ValTyr: 1.94 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.441TrpAla: 0.441 ± 0.194
0.0TrpCys: 0.0 ± 0.0
1.146TrpAsp: 1.146 ± 0.383
0.705TrpGlu: 0.705 ± 0.233
0.794TrpPhe: 0.794 ± 0.245
0.529TrpGly: 0.529 ± 0.202
0.265TrpHis: 0.265 ± 0.118
0.617TrpIle: 0.617 ± 0.184
1.146TrpLys: 1.146 ± 0.314
0.794TrpLeu: 0.794 ± 0.235
0.265TrpMet: 0.265 ± 0.184
0.441TrpAsn: 0.441 ± 0.131
0.176TrpPro: 0.176 ± 0.127
0.617TrpGln: 0.617 ± 0.187
0.265TrpArg: 0.265 ± 0.132
0.794TrpSer: 0.794 ± 0.236
1.323TrpThr: 1.323 ± 0.37
0.794TrpVal: 0.794 ± 0.352
0.353TrpTrp: 0.353 ± 0.178
0.794TrpTyr: 0.794 ± 0.273
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.205TyrAla: 2.205 ± 0.408
0.265TyrCys: 0.265 ± 0.157
2.734TyrAsp: 2.734 ± 0.628
2.381TyrGlu: 2.381 ± 0.499
1.94TyrPhe: 1.94 ± 0.529
1.587TyrGly: 1.587 ± 0.479
0.705TyrHis: 0.705 ± 0.313
2.293TyrIle: 2.293 ± 0.314
2.293TyrLys: 2.293 ± 0.481
3.175TyrLeu: 3.175 ± 0.827
0.529TyrMet: 0.529 ± 0.177
1.94TyrAsn: 1.94 ± 0.505
0.97TyrPro: 0.97 ± 0.345
2.646TyrGln: 2.646 ± 0.711
1.499TyrArg: 1.499 ± 0.51
2.205TyrSer: 2.205 ± 0.528
1.675TyrThr: 1.675 ± 0.455
2.205TyrVal: 2.205 ± 0.551
0.617TyrTrp: 0.617 ± 0.237
1.587TyrTyr: 1.587 ± 0.437
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (11341 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski