Amino acid dipepetide frequency for Streptococcus phage Javan254

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.974AlaAla: 6.974 ± 1.825
0.441AlaCys: 0.441 ± 0.198
4.59AlaAsp: 4.59 ± 1.267
4.944AlaGlu: 4.944 ± 0.953
3.178AlaPhe: 3.178 ± 0.955
6.621AlaGly: 6.621 ± 1.402
0.706AlaHis: 0.706 ± 0.276
5.915AlaIle: 5.915 ± 1.038
5.208AlaLys: 5.208 ± 0.763
8.121AlaLeu: 8.121 ± 1.373
2.648AlaMet: 2.648 ± 0.699
3.708AlaAsn: 3.708 ± 0.643
2.03AlaPro: 2.03 ± 0.351
2.737AlaGln: 2.737 ± 0.664
2.295AlaArg: 2.295 ± 0.515
4.061AlaSer: 4.061 ± 1.021
4.414AlaThr: 4.414 ± 0.593
5.915AlaVal: 5.915 ± 1.465
0.618AlaTrp: 0.618 ± 0.252
2.913AlaTyr: 2.913 ± 0.737
0.0AlaXaa: 0.0 ± 0.0
Cys
0.265CysAla: 0.265 ± 0.164
0.088CysCys: 0.088 ± 0.093
0.353CysAsp: 0.353 ± 0.168
0.441CysGlu: 0.441 ± 0.195
0.353CysPhe: 0.353 ± 0.185
0.265CysGly: 0.265 ± 0.146
0.177CysHis: 0.177 ± 0.135
0.0CysIle: 0.0 ± 0.0
0.353CysLys: 0.353 ± 0.246
0.177CysLeu: 0.177 ± 0.125
0.088CysMet: 0.088 ± 0.089
0.177CysAsn: 0.177 ± 0.134
0.265CysPro: 0.265 ± 0.221
0.177CysGln: 0.177 ± 0.109
0.0CysArg: 0.0 ± 0.0
0.265CysSer: 0.265 ± 0.151
0.177CysThr: 0.177 ± 0.104
0.441CysVal: 0.441 ± 0.158
0.088CysTrp: 0.088 ± 0.098
0.265CysTyr: 0.265 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
4.326AspAla: 4.326 ± 0.607
0.088AspCys: 0.088 ± 0.09
4.59AspAsp: 4.59 ± 0.855
5.208AspGlu: 5.208 ± 0.982
3.884AspPhe: 3.884 ± 0.715
6.268AspGly: 6.268 ± 1.178
0.353AspHis: 0.353 ± 0.17
4.149AspIle: 4.149 ± 0.676
4.59AspLys: 4.59 ± 0.665
3.884AspLeu: 3.884 ± 0.54
2.207AspMet: 2.207 ± 0.529
4.061AspAsn: 4.061 ± 0.674
1.236AspPro: 1.236 ± 0.328
1.412AspGln: 1.412 ± 0.257
2.295AspArg: 2.295 ± 0.429
3.884AspSer: 3.884 ± 0.54
3.178AspThr: 3.178 ± 0.538
3.796AspVal: 3.796 ± 0.624
0.441AspTrp: 0.441 ± 0.242
3.884AspTyr: 3.884 ± 0.708
0.0AspXaa: 0.0 ± 0.0
Glu
4.061GluAla: 4.061 ± 0.62
0.441GluCys: 0.441 ± 0.228
3.266GluAsp: 3.266 ± 0.682
3.708GluGlu: 3.708 ± 0.807
2.737GluPhe: 2.737 ± 0.61
2.383GluGly: 2.383 ± 0.452
0.53GluHis: 0.53 ± 0.194
5.12GluIle: 5.12 ± 1.014
5.032GluLys: 5.032 ± 0.734
6.709GluLeu: 6.709 ± 1.074
1.766GluMet: 1.766 ± 0.497
4.502GluAsn: 4.502 ± 0.683
1.677GluPro: 1.677 ± 0.611
3.796GluGln: 3.796 ± 0.675
4.414GluArg: 4.414 ± 0.776
2.56GluSer: 2.56 ± 0.488
3.884GluThr: 3.884 ± 0.615
4.149GluVal: 4.149 ± 0.649
0.706GluTrp: 0.706 ± 0.201
2.648GluTyr: 2.648 ± 0.547
0.0GluXaa: 0.0 ± 0.0
Phe
2.383PheAla: 2.383 ± 0.492
0.353PheCys: 0.353 ± 0.181
3.884PheAsp: 3.884 ± 0.635
3.355PheGlu: 3.355 ± 0.874
1.148PhePhe: 1.148 ± 0.348
3.178PheGly: 3.178 ± 0.697
1.059PheHis: 1.059 ± 0.283
2.825PheIle: 2.825 ± 0.484
3.178PheLys: 3.178 ± 0.547
2.207PheLeu: 2.207 ± 0.431
1.236PheMet: 1.236 ± 0.312
2.383PheAsn: 2.383 ± 0.438
0.53PhePro: 0.53 ± 0.16
1.589PheGln: 1.589 ± 0.513
1.677PheArg: 1.677 ± 0.382
2.913PheSer: 2.913 ± 0.529
3.09PheThr: 3.09 ± 0.647
2.648PheVal: 2.648 ± 0.543
0.265PheTrp: 0.265 ± 0.138
1.589PheTyr: 1.589 ± 0.452
0.0PheXaa: 0.0 ± 0.0
Gly
5.385GlyAla: 5.385 ± 1.615
0.177GlyCys: 0.177 ± 0.125
3.708GlyAsp: 3.708 ± 0.838
3.972GlyGlu: 3.972 ± 0.574
3.355GlyPhe: 3.355 ± 0.534
4.767GlyGly: 4.767 ± 0.654
1.148GlyHis: 1.148 ± 0.321
5.826GlyIle: 5.826 ± 0.771
5.65GlyLys: 5.65 ± 0.66
6.356GlyLeu: 6.356 ± 1.329
2.119GlyMet: 2.119 ± 0.626
3.355GlyAsn: 3.355 ± 0.66
0.353GlyPro: 0.353 ± 0.171
3.708GlyGln: 3.708 ± 0.676
3.266GlyArg: 3.266 ± 0.804
4.855GlySer: 4.855 ± 1.401
5.297GlyThr: 5.297 ± 0.834
5.738GlyVal: 5.738 ± 0.881
0.883GlyTrp: 0.883 ± 0.294
2.648GlyTyr: 2.648 ± 0.65
0.0GlyXaa: 0.0 ± 0.0
His
0.971HisAla: 0.971 ± 0.369
0.088HisCys: 0.088 ± 0.075
0.618HisAsp: 0.618 ± 0.263
0.706HisGlu: 0.706 ± 0.232
0.706HisPhe: 0.706 ± 0.282
0.883HisGly: 0.883 ± 0.25
0.088HisHis: 0.088 ± 0.073
1.412HisIle: 1.412 ± 0.309
0.441HisLys: 0.441 ± 0.289
0.794HisLeu: 0.794 ± 0.317
0.177HisMet: 0.177 ± 0.108
0.883HisAsn: 0.883 ± 0.278
0.618HisPro: 0.618 ± 0.222
0.794HisGln: 0.794 ± 0.281
1.059HisArg: 1.059 ± 0.32
0.883HisSer: 0.883 ± 0.369
0.794HisThr: 0.794 ± 0.299
0.441HisVal: 0.441 ± 0.164
0.265HisTrp: 0.265 ± 0.178
0.353HisTyr: 0.353 ± 0.161
0.0HisXaa: 0.0 ± 0.0
Ile
5.473IleAla: 5.473 ± 0.869
0.177IleCys: 0.177 ± 0.149
5.12IleAsp: 5.12 ± 0.679
5.65IleGlu: 5.65 ± 0.747
2.03IlePhe: 2.03 ± 0.664
4.767IleGly: 4.767 ± 0.545
1.324IleHis: 1.324 ± 0.392
3.619IleIle: 3.619 ± 0.593
6.179IleLys: 6.179 ± 0.733
3.531IleLeu: 3.531 ± 0.649
0.883IleMet: 0.883 ± 0.288
4.59IleAsn: 4.59 ± 0.71
3.09IlePro: 3.09 ± 0.774
3.531IleGln: 3.531 ± 0.557
4.237IleArg: 4.237 ± 0.681
4.061IleSer: 4.061 ± 0.439
4.414IleThr: 4.414 ± 0.658
4.061IleVal: 4.061 ± 0.695
0.618IleTrp: 0.618 ± 0.244
1.766IleTyr: 1.766 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
5.738LysAla: 5.738 ± 0.816
0.265LysCys: 0.265 ± 0.128
4.326LysAsp: 4.326 ± 0.687
5.032LysGlu: 5.032 ± 0.816
3.001LysPhe: 3.001 ± 0.531
5.12LysGly: 5.12 ± 0.921
0.971LysHis: 0.971 ± 0.291
6.356LysIle: 6.356 ± 0.808
7.062LysLys: 7.062 ± 0.956
5.12LysLeu: 5.12 ± 0.778
2.56LysMet: 2.56 ± 0.491
3.708LysAsn: 3.708 ± 0.402
2.648LysPro: 2.648 ± 0.593
2.56LysGln: 2.56 ± 0.585
3.619LysArg: 3.619 ± 0.615
4.679LysSer: 4.679 ± 0.568
4.237LysThr: 4.237 ± 0.752
4.502LysVal: 4.502 ± 0.676
0.971LysTrp: 0.971 ± 0.237
3.531LysTyr: 3.531 ± 0.614
0.0LysXaa: 0.0 ± 0.0
Leu
5.65LeuAla: 5.65 ± 0.971
0.353LeuCys: 0.353 ± 0.173
5.561LeuAsp: 5.561 ± 0.7
5.826LeuGlu: 5.826 ± 0.882
2.207LeuPhe: 2.207 ± 0.394
6.356LeuGly: 6.356 ± 1.03
0.971LeuHis: 0.971 ± 0.336
3.884LeuIle: 3.884 ± 0.536
7.415LeuLys: 7.415 ± 0.98
3.884LeuLeu: 3.884 ± 0.726
1.677LeuMet: 1.677 ± 0.404
5.032LeuAsn: 5.032 ± 0.6
2.825LeuPro: 2.825 ± 0.403
3.796LeuGln: 3.796 ± 0.541
1.942LeuArg: 1.942 ± 0.405
6.179LeuSer: 6.179 ± 0.715
4.679LeuThr: 4.679 ± 0.461
4.855LeuVal: 4.855 ± 0.755
0.441LeuTrp: 0.441 ± 0.245
2.825LeuTyr: 2.825 ± 0.574
0.0LeuXaa: 0.0 ± 0.0
Met
2.913MetAla: 2.913 ± 0.795
0.265MetCys: 0.265 ± 0.163
1.412MetAsp: 1.412 ± 0.307
1.324MetGlu: 1.324 ± 0.329
0.53MetPhe: 0.53 ± 0.266
1.501MetGly: 1.501 ± 0.288
0.088MetHis: 0.088 ± 0.075
1.766MetIle: 1.766 ± 0.369
2.383MetLys: 2.383 ± 0.464
1.854MetLeu: 1.854 ± 0.327
0.883MetMet: 0.883 ± 0.41
1.148MetAsn: 1.148 ± 0.313
1.059MetPro: 1.059 ± 0.332
1.324MetGln: 1.324 ± 0.399
1.589MetArg: 1.589 ± 0.431
1.324MetSer: 1.324 ± 0.364
1.766MetThr: 1.766 ± 0.44
1.324MetVal: 1.324 ± 0.258
0.441MetTrp: 0.441 ± 0.178
0.618MetTyr: 0.618 ± 0.248
0.0MetXaa: 0.0 ± 0.0
Asn
4.59AsnAla: 4.59 ± 0.543
0.177AsnCys: 0.177 ± 0.11
2.913AsnAsp: 2.913 ± 0.572
3.178AsnGlu: 3.178 ± 0.672
2.295AsnPhe: 2.295 ± 0.567
4.502AsnGly: 4.502 ± 0.791
0.706AsnHis: 0.706 ± 0.262
3.884AsnIle: 3.884 ± 0.673
3.619AsnLys: 3.619 ± 0.525
4.061AsnLeu: 4.061 ± 0.747
1.059AsnMet: 1.059 ± 0.332
3.619AsnAsn: 3.619 ± 0.629
2.825AsnPro: 2.825 ± 0.523
2.295AsnGln: 2.295 ± 0.553
1.942AsnArg: 1.942 ± 0.374
3.972AsnSer: 3.972 ± 0.576
2.913AsnThr: 2.913 ± 0.478
3.443AsnVal: 3.443 ± 0.595
0.794AsnTrp: 0.794 ± 0.28
1.324AsnTyr: 1.324 ± 0.363
0.0AsnXaa: 0.0 ± 0.0
Pro
2.03ProAla: 2.03 ± 0.343
0.353ProCys: 0.353 ± 0.212
2.207ProAsp: 2.207 ± 0.531
2.295ProGlu: 2.295 ± 0.434
1.148ProPhe: 1.148 ± 0.375
1.412ProGly: 1.412 ± 0.375
0.177ProHis: 0.177 ± 0.132
1.589ProIle: 1.589 ± 0.461
1.854ProLys: 1.854 ± 0.517
2.295ProLeu: 2.295 ± 0.439
0.794ProMet: 0.794 ± 0.224
1.236ProAsn: 1.236 ± 0.311
0.971ProPro: 0.971 ± 0.266
1.324ProGln: 1.324 ± 0.323
1.148ProArg: 1.148 ± 0.377
2.295ProSer: 2.295 ± 0.451
2.207ProThr: 2.207 ± 0.414
2.03ProVal: 2.03 ± 0.468
0.177ProTrp: 0.177 ± 0.136
2.207ProTyr: 2.207 ± 0.4
0.0ProXaa: 0.0 ± 0.0
Gln
3.972GlnAla: 3.972 ± 0.627
0.177GlnCys: 0.177 ± 0.145
2.03GlnAsp: 2.03 ± 0.424
2.383GlnGlu: 2.383 ± 0.584
1.148GlnPhe: 1.148 ± 0.244
4.149GlnGly: 4.149 ± 0.892
0.353GlnHis: 0.353 ± 0.182
2.207GlnIle: 2.207 ± 0.358
2.737GlnLys: 2.737 ± 0.588
3.708GlnLeu: 3.708 ± 0.418
0.971GlnMet: 0.971 ± 0.336
2.207GlnAsn: 2.207 ± 0.325
1.324GlnPro: 1.324 ± 0.347
2.913GlnGln: 2.913 ± 0.706
1.854GlnArg: 1.854 ± 0.491
3.619GlnSer: 3.619 ± 0.688
2.472GlnThr: 2.472 ± 0.502
2.207GlnVal: 2.207 ± 0.332
0.53GlnTrp: 0.53 ± 0.205
2.295GlnTyr: 2.295 ± 0.431
0.0GlnXaa: 0.0 ± 0.0
Arg
3.178ArgAla: 3.178 ± 0.685
0.088ArgCys: 0.088 ± 0.092
2.383ArgAsp: 2.383 ± 0.319
2.03ArgGlu: 2.03 ± 0.426
1.766ArgPhe: 1.766 ± 0.359
3.001ArgGly: 3.001 ± 0.905
0.794ArgHis: 0.794 ± 0.261
2.295ArgIle: 2.295 ± 0.559
3.619ArgLys: 3.619 ± 0.605
4.414ArgLeu: 4.414 ± 0.825
1.324ArgMet: 1.324 ± 0.364
2.648ArgAsn: 2.648 ± 0.434
1.148ArgPro: 1.148 ± 0.307
1.942ArgGln: 1.942 ± 0.383
1.766ArgArg: 1.766 ± 0.403
2.207ArgSer: 2.207 ± 0.37
2.472ArgThr: 2.472 ± 0.473
2.648ArgVal: 2.648 ± 0.49
1.059ArgTrp: 1.059 ± 0.354
2.207ArgTyr: 2.207 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
5.826SerAla: 5.826 ± 1.658
0.177SerCys: 0.177 ± 0.114
4.679SerAsp: 4.679 ± 0.549
3.531SerGlu: 3.531 ± 0.586
3.09SerPhe: 3.09 ± 0.524
4.855SerGly: 4.855 ± 0.989
0.883SerHis: 0.883 ± 0.275
3.972SerIle: 3.972 ± 0.557
4.061SerLys: 4.061 ± 0.508
4.767SerLeu: 4.767 ± 0.525
1.766SerMet: 1.766 ± 0.403
2.648SerAsn: 2.648 ± 0.692
2.03SerPro: 2.03 ± 0.343
2.295SerGln: 2.295 ± 0.489
1.854SerArg: 1.854 ± 0.367
4.326SerSer: 4.326 ± 0.868
5.12SerThr: 5.12 ± 0.697
4.237SerVal: 4.237 ± 0.669
0.971SerTrp: 0.971 ± 0.307
2.56SerTyr: 2.56 ± 0.531
0.0SerXaa: 0.0 ± 0.0
Thr
4.149ThrAla: 4.149 ± 0.899
0.265ThrCys: 0.265 ± 0.138
3.884ThrAsp: 3.884 ± 0.825
3.443ThrGlu: 3.443 ± 0.581
2.913ThrPhe: 2.913 ± 0.41
4.855ThrGly: 4.855 ± 0.69
1.059ThrHis: 1.059 ± 0.302
6.003ThrIle: 6.003 ± 0.756
4.061ThrLys: 4.061 ± 0.678
5.561ThrLeu: 5.561 ± 0.663
1.412ThrMet: 1.412 ± 0.311
2.648ThrAsn: 2.648 ± 0.55
2.913ThrPro: 2.913 ± 0.625
2.56ThrGln: 2.56 ± 0.55
2.03ThrArg: 2.03 ± 0.504
3.619ThrSer: 3.619 ± 0.649
4.237ThrThr: 4.237 ± 0.673
5.826ThrVal: 5.826 ± 0.738
0.794ThrTrp: 0.794 ± 0.248
2.648ThrTyr: 2.648 ± 0.509
0.0ThrXaa: 0.0 ± 0.0
Val
6.621ValAla: 6.621 ± 0.939
0.088ValCys: 0.088 ± 0.078
5.208ValAsp: 5.208 ± 0.705
4.237ValGlu: 4.237 ± 0.546
3.266ValPhe: 3.266 ± 0.502
5.032ValGly: 5.032 ± 1.193
0.618ValHis: 0.618 ± 0.224
4.061ValIle: 4.061 ± 0.527
3.972ValLys: 3.972 ± 0.572
4.59ValLeu: 4.59 ± 0.677
1.059ValMet: 1.059 ± 0.365
2.825ValAsn: 2.825 ± 0.538
1.059ValPro: 1.059 ± 0.302
1.677ValGln: 1.677 ± 0.308
2.383ValArg: 2.383 ± 0.424
4.237ValSer: 4.237 ± 0.56
6.268ValThr: 6.268 ± 0.7
3.884ValVal: 3.884 ± 0.734
0.706ValTrp: 0.706 ± 0.273
2.825ValTyr: 2.825 ± 0.567
0.0ValXaa: 0.0 ± 0.0
Trp
0.706TrpAla: 0.706 ± 0.207
0.0TrpCys: 0.0 ± 0.0
0.618TrpAsp: 0.618 ± 0.211
0.883TrpGlu: 0.883 ± 0.255
0.618TrpPhe: 0.618 ± 0.227
0.971TrpGly: 0.971 ± 0.403
0.265TrpHis: 0.265 ± 0.137
0.794TrpIle: 0.794 ± 0.2
0.618TrpLys: 0.618 ± 0.2
1.059TrpLeu: 1.059 ± 0.349
0.088TrpMet: 0.088 ± 0.094
0.265TrpAsn: 0.265 ± 0.12
0.0TrpPro: 0.0 ± 0.0
0.794TrpGln: 0.794 ± 0.294
0.794TrpArg: 0.794 ± 0.236
1.059TrpSer: 1.059 ± 0.405
0.441TrpThr: 0.441 ± 0.223
0.706TrpVal: 0.706 ± 0.215
0.177TrpTrp: 0.177 ± 0.168
0.53TrpTyr: 0.53 ± 0.241
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.825TyrAla: 2.825 ± 0.642
0.353TyrCys: 0.353 ± 0.175
2.383TyrAsp: 2.383 ± 0.594
1.942TyrGlu: 1.942 ± 0.427
2.119TyrPhe: 2.119 ± 0.508
1.766TyrGly: 1.766 ± 0.365
0.706TyrHis: 0.706 ± 0.302
3.443TyrIle: 3.443 ± 0.529
3.972TyrLys: 3.972 ± 0.585
3.266TyrLeu: 3.266 ± 0.607
0.706TyrMet: 0.706 ± 0.209
2.472TyrAsn: 2.472 ± 0.418
1.148TyrPro: 1.148 ± 0.259
2.03TyrGln: 2.03 ± 0.341
2.825TyrArg: 2.825 ± 0.458
2.648TyrSer: 2.648 ± 0.453
2.825TyrThr: 2.825 ± 0.543
1.766TyrVal: 1.766 ± 0.335
0.441TyrTrp: 0.441 ± 0.178
1.412TyrTyr: 1.412 ± 0.352
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (11329 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski