Amino acid dipepetide frequency for Streptococcus phage Javan385

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.745AlaAla: 1.745 ± 0.54
0.238AlaCys: 0.238 ± 0.14
3.966AlaAsp: 3.966 ± 0.558
4.759AlaGlu: 4.759 ± 0.559
2.697AlaPhe: 2.697 ± 0.465
4.521AlaGly: 4.521 ± 1.042
0.714AlaHis: 0.714 ± 0.287
5.473AlaIle: 5.473 ± 0.686
6.821AlaLys: 6.821 ± 0.892
5.711AlaLeu: 5.711 ± 0.881
2.221AlaMet: 2.221 ± 0.5
3.49AlaAsn: 3.49 ± 0.463
1.348AlaPro: 1.348 ± 0.425
2.697AlaGln: 2.697 ± 0.526
2.141AlaArg: 2.141 ± 0.43
4.997AlaSer: 4.997 ± 1.081
4.442AlaThr: 4.442 ± 0.81
4.521AlaVal: 4.521 ± 0.781
0.872AlaTrp: 0.872 ± 0.32
2.935AlaTyr: 2.935 ± 0.54
0.0AlaXaa: 0.0 ± 0.0
Cys
0.079CysAla: 0.079 ± 0.072
0.0CysCys: 0.0 ± 0.0
0.238CysAsp: 0.238 ± 0.145
0.476CysGlu: 0.476 ± 0.215
0.317CysPhe: 0.317 ± 0.188
0.317CysGly: 0.317 ± 0.155
0.079CysHis: 0.079 ± 0.081
0.476CysIle: 0.476 ± 0.232
0.555CysLys: 0.555 ± 0.231
0.476CysLeu: 0.476 ± 0.225
0.0CysMet: 0.0 ± 0.0
0.079CysAsn: 0.079 ± 0.068
0.317CysPro: 0.317 ± 0.156
0.079CysGln: 0.079 ± 0.084
0.238CysArg: 0.238 ± 0.153
0.159CysSer: 0.159 ± 0.116
0.317CysThr: 0.317 ± 0.163
0.159CysVal: 0.159 ± 0.089
0.159CysTrp: 0.159 ± 0.103
0.238CysTyr: 0.238 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
2.935AspAla: 2.935 ± 0.607
0.317AspCys: 0.317 ± 0.214
4.521AspAsp: 4.521 ± 0.676
4.918AspGlu: 4.918 ± 0.875
4.442AspPhe: 4.442 ± 0.496
5.076AspGly: 5.076 ± 0.619
0.555AspHis: 0.555 ± 0.223
5.711AspIle: 5.711 ± 0.892
4.759AspLys: 4.759 ± 0.642
4.997AspLeu: 4.997 ± 0.726
1.666AspMet: 1.666 ± 0.296
2.617AspAsn: 2.617 ± 0.472
2.697AspPro: 2.697 ± 0.331
1.428AspGln: 1.428 ± 0.359
2.459AspArg: 2.459 ± 0.437
3.886AspSer: 3.886 ± 0.468
3.648AspThr: 3.648 ± 0.521
4.521AspVal: 4.521 ± 0.685
1.11AspTrp: 1.11 ± 0.258
3.331AspTyr: 3.331 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
4.838GluAla: 4.838 ± 0.748
0.397GluCys: 0.397 ± 0.167
3.807GluAsp: 3.807 ± 0.657
4.68GluGlu: 4.68 ± 0.785
1.983GluPhe: 1.983 ± 0.441
2.538GluGly: 2.538 ± 0.483
0.872GluHis: 0.872 ± 0.287
6.107GluIle: 6.107 ± 0.799
6.107GluLys: 6.107 ± 0.839
8.487GluLeu: 8.487 ± 0.916
2.141GluMet: 2.141 ± 0.395
4.204GluAsn: 4.204 ± 0.519
1.983GluPro: 1.983 ± 0.454
2.697GluGln: 2.697 ± 0.528
2.538GluArg: 2.538 ± 0.433
3.093GluSer: 3.093 ± 0.434
4.362GluThr: 4.362 ± 0.56
4.362GluVal: 4.362 ± 0.575
0.872GluTrp: 0.872 ± 0.295
2.776GluTyr: 2.776 ± 0.578
0.0GluXaa: 0.0 ± 0.0
Phe
2.855PheAla: 2.855 ± 0.493
0.238PheCys: 0.238 ± 0.134
4.045PheAsp: 4.045 ± 0.533
2.776PheGlu: 2.776 ± 0.432
1.19PhePhe: 1.19 ± 0.251
3.331PheGly: 3.331 ± 0.488
0.555PheHis: 0.555 ± 0.259
3.014PheIle: 3.014 ± 0.661
4.045PheLys: 4.045 ± 0.524
3.252PheLeu: 3.252 ± 0.582
0.714PheMet: 0.714 ± 0.272
3.014PheAsn: 3.014 ± 0.466
0.635PhePro: 0.635 ± 0.158
0.793PheGln: 0.793 ± 0.233
1.666PheArg: 1.666 ± 0.271
2.697PheSer: 2.697 ± 0.405
2.221PheThr: 2.221 ± 0.524
2.538PheVal: 2.538 ± 0.427
0.555PheTrp: 0.555 ± 0.183
1.745PheTyr: 1.745 ± 0.464
0.0PheXaa: 0.0 ± 0.0
Gly
3.807GlyAla: 3.807 ± 0.727
0.238GlyCys: 0.238 ± 0.183
3.49GlyAsp: 3.49 ± 0.567
3.886GlyGlu: 3.886 ± 0.461
3.014GlyPhe: 3.014 ± 0.564
5.235GlyGly: 5.235 ± 0.915
1.19GlyHis: 1.19 ± 0.411
4.442GlyIle: 4.442 ± 0.756
6.028GlyLys: 6.028 ± 0.554
6.345GlyLeu: 6.345 ± 0.735
2.3GlyMet: 2.3 ± 0.418
3.728GlyAsn: 3.728 ± 0.737
0.555GlyPro: 0.555 ± 0.185
2.935GlyGln: 2.935 ± 0.646
2.617GlyArg: 2.617 ± 0.514
4.283GlySer: 4.283 ± 0.83
4.204GlyThr: 4.204 ± 0.643
3.49GlyVal: 3.49 ± 0.624
1.269GlyTrp: 1.269 ± 0.277
2.221GlyTyr: 2.221 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
0.952HisAla: 0.952 ± 0.277
0.159HisCys: 0.159 ± 0.109
1.11HisAsp: 1.11 ± 0.374
1.11HisGlu: 1.11 ± 0.263
0.397HisPhe: 0.397 ± 0.192
0.872HisGly: 0.872 ± 0.235
0.317HisHis: 0.317 ± 0.161
0.952HisIle: 0.952 ± 0.254
1.11HisLys: 1.11 ± 0.268
1.348HisLeu: 1.348 ± 0.33
0.476HisMet: 0.476 ± 0.202
0.952HisAsn: 0.952 ± 0.341
0.635HisPro: 0.635 ± 0.195
0.397HisGln: 0.397 ± 0.175
0.317HisArg: 0.317 ± 0.168
0.872HisSer: 0.872 ± 0.327
0.952HisThr: 0.952 ± 0.284
0.872HisVal: 0.872 ± 0.334
0.159HisTrp: 0.159 ± 0.141
0.397HisTyr: 0.397 ± 0.177
0.0HisXaa: 0.0 ± 0.0
Ile
4.997IleAla: 4.997 ± 0.676
0.476IleCys: 0.476 ± 0.181
6.424IleAsp: 6.424 ± 0.755
5.155IleGlu: 5.155 ± 0.716
2.3IlePhe: 2.3 ± 0.56
4.045IleGly: 4.045 ± 0.601
1.269IleHis: 1.269 ± 0.28
4.442IleIle: 4.442 ± 0.553
7.852IleLys: 7.852 ± 0.774
4.6IleLeu: 4.6 ± 0.59
1.666IleMet: 1.666 ± 0.337
4.521IleAsn: 4.521 ± 0.53
3.173IlePro: 3.173 ± 0.526
2.617IleGln: 2.617 ± 0.329
2.697IleArg: 2.697 ± 0.564
4.6IleSer: 4.6 ± 0.818
4.204IleThr: 4.204 ± 0.602
3.966IleVal: 3.966 ± 0.709
0.635IleTrp: 0.635 ± 0.194
1.983IleTyr: 1.983 ± 0.491
0.0IleXaa: 0.0 ± 0.0
Lys
6.266LysAla: 6.266 ± 0.789
0.238LysCys: 0.238 ± 0.129
4.918LysAsp: 4.918 ± 0.724
6.583LysGlu: 6.583 ± 0.739
3.728LysPhe: 3.728 ± 0.79
3.728LysGly: 3.728 ± 0.515
0.872LysHis: 0.872 ± 0.273
6.821LysIle: 6.821 ± 0.605
6.583LysLys: 6.583 ± 0.852
5.711LysLeu: 5.711 ± 0.65
1.983LysMet: 1.983 ± 0.411
4.997LysAsn: 4.997 ± 0.691
2.855LysPro: 2.855 ± 0.435
4.283LysGln: 4.283 ± 0.538
3.331LysArg: 3.331 ± 0.534
5.552LysSer: 5.552 ± 0.731
5.79LysThr: 5.79 ± 0.711
5.393LysVal: 5.393 ± 0.917
1.586LysTrp: 1.586 ± 0.302
3.411LysTyr: 3.411 ± 0.486
0.0LysXaa: 0.0 ± 0.0
Leu
6.107LeuAla: 6.107 ± 0.907
0.397LeuCys: 0.397 ± 0.181
5.631LeuAsp: 5.631 ± 0.822
6.345LeuGlu: 6.345 ± 1.177
3.014LeuPhe: 3.014 ± 0.538
4.6LeuGly: 4.6 ± 0.622
1.031LeuHis: 1.031 ± 0.307
4.838LeuIle: 4.838 ± 0.429
7.376LeuLys: 7.376 ± 1.108
6.98LeuLeu: 6.98 ± 0.958
2.141LeuMet: 2.141 ± 0.394
6.662LeuAsn: 6.662 ± 0.59
2.459LeuPro: 2.459 ± 0.476
3.252LeuGln: 3.252 ± 0.54
2.617LeuArg: 2.617 ± 0.44
6.345LeuSer: 6.345 ± 0.777
7.297LeuThr: 7.297 ± 1.055
4.918LeuVal: 4.918 ± 0.572
0.952LeuTrp: 0.952 ± 0.337
1.824LeuTyr: 1.824 ± 0.405
0.0LeuXaa: 0.0 ± 0.0
Met
2.062MetAla: 2.062 ± 0.43
0.159MetCys: 0.159 ± 0.127
0.872MetAsp: 0.872 ± 0.312
1.666MetGlu: 1.666 ± 0.375
1.11MetPhe: 1.11 ± 0.273
1.428MetGly: 1.428 ± 0.284
0.238MetHis: 0.238 ± 0.155
1.348MetIle: 1.348 ± 0.333
2.538MetLys: 2.538 ± 0.463
2.3MetLeu: 2.3 ± 0.456
0.476MetMet: 0.476 ± 0.284
1.745MetAsn: 1.745 ± 0.457
1.269MetPro: 1.269 ± 0.256
0.635MetGln: 0.635 ± 0.245
0.555MetArg: 0.555 ± 0.212
2.459MetSer: 2.459 ± 0.416
2.062MetThr: 2.062 ± 0.451
1.666MetVal: 1.666 ± 0.351
0.317MetTrp: 0.317 ± 0.168
0.397MetTyr: 0.397 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
4.521AsnAla: 4.521 ± 0.874
0.079AsnCys: 0.079 ± 0.082
4.045AsnAsp: 4.045 ± 0.571
2.697AsnGlu: 2.697 ± 0.445
2.697AsnPhe: 2.697 ± 0.488
6.028AsnGly: 6.028 ± 1.043
1.428AsnHis: 1.428 ± 0.362
4.838AsnIle: 4.838 ± 0.705
4.442AsnLys: 4.442 ± 0.531
4.283AsnLeu: 4.283 ± 0.549
1.11AsnMet: 1.11 ± 0.287
4.442AsnAsn: 4.442 ± 0.78
2.221AsnPro: 2.221 ± 0.438
2.3AsnGln: 2.3 ± 0.336
1.428AsnArg: 1.428 ± 0.295
4.6AsnSer: 4.6 ± 0.54
2.855AsnThr: 2.855 ± 0.563
3.648AsnVal: 3.648 ± 0.537
1.031AsnTrp: 1.031 ± 0.372
2.3AsnTyr: 2.3 ± 0.537
0.0AsnXaa: 0.0 ± 0.0
Pro
1.904ProAla: 1.904 ± 0.381
0.079ProCys: 0.079 ± 0.072
1.666ProAsp: 1.666 ± 0.433
2.617ProGlu: 2.617 ± 0.501
1.031ProPhe: 1.031 ± 0.266
1.19ProGly: 1.19 ± 0.312
0.397ProHis: 0.397 ± 0.157
1.666ProIle: 1.666 ± 0.316
2.538ProLys: 2.538 ± 0.526
2.459ProLeu: 2.459 ± 0.414
0.555ProMet: 0.555 ± 0.247
2.379ProAsn: 2.379 ± 0.408
0.635ProPro: 0.635 ± 0.192
1.983ProGln: 1.983 ± 0.388
0.714ProArg: 0.714 ± 0.276
2.221ProSer: 2.221 ± 0.515
2.697ProThr: 2.697 ± 0.48
2.141ProVal: 2.141 ± 0.427
0.317ProTrp: 0.317 ± 0.127
1.11ProTyr: 1.11 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
3.648GlnAla: 3.648 ± 0.544
0.159GlnCys: 0.159 ± 0.127
1.983GlnAsp: 1.983 ± 0.485
2.776GlnGlu: 2.776 ± 0.537
2.221GlnPhe: 2.221 ± 0.419
2.935GlnGly: 2.935 ± 0.378
0.317GlnHis: 0.317 ± 0.153
2.617GlnIle: 2.617 ± 0.473
2.379GlnLys: 2.379 ± 0.372
3.411GlnLeu: 3.411 ± 0.524
1.428GlnMet: 1.428 ± 0.387
2.3GlnAsn: 2.3 ± 0.446
1.031GlnPro: 1.031 ± 0.204
1.983GlnGln: 1.983 ± 0.478
1.031GlnArg: 1.031 ± 0.242
2.379GlnSer: 2.379 ± 0.481
2.221GlnThr: 2.221 ± 0.334
1.983GlnVal: 1.983 ± 0.412
0.476GlnTrp: 0.476 ± 0.191
1.348GlnTyr: 1.348 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
2.062ArgAla: 2.062 ± 0.391
0.079ArgCys: 0.079 ± 0.072
2.221ArgAsp: 2.221 ± 0.49
2.776ArgGlu: 2.776 ± 0.564
1.428ArgPhe: 1.428 ± 0.351
1.745ArgGly: 1.745 ± 0.385
0.714ArgHis: 0.714 ± 0.281
2.3ArgIle: 2.3 ± 0.455
2.379ArgLys: 2.379 ± 0.5
3.49ArgLeu: 3.49 ± 0.599
0.952ArgMet: 0.952 ± 0.308
2.141ArgAsn: 2.141 ± 0.393
1.031ArgPro: 1.031 ± 0.309
1.666ArgGln: 1.666 ± 0.269
1.031ArgArg: 1.031 ± 0.279
1.586ArgSer: 1.586 ± 0.34
1.666ArgThr: 1.666 ± 0.37
1.824ArgVal: 1.824 ± 0.464
0.555ArgTrp: 0.555 ± 0.208
1.666ArgTyr: 1.666 ± 0.269
0.0ArgXaa: 0.0 ± 0.0
Ser
5.314SerAla: 5.314 ± 0.69
0.159SerCys: 0.159 ± 0.12
4.759SerAsp: 4.759 ± 0.645
4.759SerGlu: 4.759 ± 0.481
2.459SerPhe: 2.459 ± 0.451
5.473SerGly: 5.473 ± 0.732
1.031SerHis: 1.031 ± 0.37
4.521SerIle: 4.521 ± 0.69
5.552SerLys: 5.552 ± 0.846
5.393SerLeu: 5.393 ± 0.784
1.824SerMet: 1.824 ± 0.35
4.6SerAsn: 4.6 ± 0.679
1.586SerPro: 1.586 ± 0.269
2.062SerGln: 2.062 ± 0.35
2.062SerArg: 2.062 ± 0.345
4.68SerSer: 4.68 ± 0.566
4.997SerThr: 4.997 ± 1.109
3.49SerVal: 3.49 ± 0.528
0.714SerTrp: 0.714 ± 0.214
2.459SerTyr: 2.459 ± 0.447
0.0SerXaa: 0.0 ± 0.0
Thr
4.918ThrAla: 4.918 ± 0.743
0.397ThrCys: 0.397 ± 0.169
3.331ThrAsp: 3.331 ± 0.582
3.252ThrGlu: 3.252 ± 0.511
3.331ThrPhe: 3.331 ± 0.523
4.997ThrGly: 4.997 ± 0.717
1.031ThrHis: 1.031 ± 0.285
5.314ThrIle: 5.314 ± 0.904
5.393ThrLys: 5.393 ± 0.664
5.155ThrLeu: 5.155 ± 1.082
1.428ThrMet: 1.428 ± 0.354
3.569ThrAsn: 3.569 ± 0.691
1.269ThrPro: 1.269 ± 0.326
2.221ThrGln: 2.221 ± 0.448
1.904ThrArg: 1.904 ± 0.44
4.997ThrSer: 4.997 ± 0.885
3.093ThrThr: 3.093 ± 0.586
4.283ThrVal: 4.283 ± 0.727
1.11ThrTrp: 1.11 ± 0.323
3.252ThrTyr: 3.252 ± 0.603
0.0ThrXaa: 0.0 ± 0.0
Val
4.6ValAla: 4.6 ± 0.648
0.317ValCys: 0.317 ± 0.175
4.6ValAsp: 4.6 ± 0.653
4.6ValGlu: 4.6 ± 0.703
2.379ValPhe: 2.379 ± 0.399
4.283ValGly: 4.283 ± 0.814
0.872ValHis: 0.872 ± 0.333
3.569ValIle: 3.569 ± 0.494
3.886ValLys: 3.886 ± 0.732
4.6ValLeu: 4.6 ± 0.571
1.19ValMet: 1.19 ± 0.301
3.411ValAsn: 3.411 ± 0.471
2.3ValPro: 2.3 ± 0.464
2.221ValGln: 2.221 ± 0.432
1.824ValArg: 1.824 ± 0.542
4.68ValSer: 4.68 ± 0.7
3.648ValThr: 3.648 ± 0.637
3.093ValVal: 3.093 ± 0.564
0.555ValTrp: 0.555 ± 0.246
2.538ValTyr: 2.538 ± 0.464
0.0ValXaa: 0.0 ± 0.0
Trp
0.317TrpAla: 0.317 ± 0.175
0.079TrpCys: 0.079 ± 0.075
1.031TrpAsp: 1.031 ± 0.335
0.952TrpGlu: 0.952 ± 0.299
0.476TrpPhe: 0.476 ± 0.191
0.635TrpGly: 0.635 ± 0.204
0.317TrpHis: 0.317 ± 0.162
0.714TrpIle: 0.714 ± 0.196
1.269TrpLys: 1.269 ± 0.341
2.062TrpLeu: 2.062 ± 0.362
0.079TrpMet: 0.079 ± 0.068
0.635TrpAsn: 0.635 ± 0.234
0.476TrpPro: 0.476 ± 0.241
0.555TrpGln: 0.555 ± 0.236
0.714TrpArg: 0.714 ± 0.263
1.031TrpSer: 1.031 ± 0.33
1.11TrpThr: 1.11 ± 0.316
0.793TrpVal: 0.793 ± 0.209
0.238TrpTrp: 0.238 ± 0.164
0.317TrpTyr: 0.317 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.379TyrAla: 2.379 ± 0.383
0.555TyrCys: 0.555 ± 0.225
2.935TyrAsp: 2.935 ± 0.615
2.062TyrGlu: 2.062 ± 0.413
1.666TyrPhe: 1.666 ± 0.303
2.3TyrGly: 2.3 ± 0.464
0.555TyrHis: 0.555 ± 0.217
2.459TyrIle: 2.459 ± 0.454
3.014TyrLys: 3.014 ± 0.476
3.728TyrLeu: 3.728 ± 0.492
0.872TyrMet: 0.872 ± 0.297
1.507TyrAsn: 1.507 ± 0.358
1.824TyrPro: 1.824 ± 0.45
1.904TyrGln: 1.904 ± 0.387
1.428TyrArg: 1.428 ± 0.423
2.776TyrSer: 2.776 ± 0.537
2.459TyrThr: 2.459 ± 0.667
1.507TyrVal: 1.507 ± 0.339
0.317TyrTrp: 0.317 ± 0.143
2.062TyrTyr: 2.062 ± 0.445
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12609 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski