Amino acid dipepetide frequency for Streptococcus phage Javan190

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.876AlaAla: 5.876 ± 1.166
0.096AlaCys: 0.096 ± 0.085
5.49AlaAsp: 5.49 ± 0.758
5.394AlaGlu: 5.394 ± 1.01
2.408AlaPhe: 2.408 ± 0.494
5.105AlaGly: 5.105 ± 1.294
0.771AlaHis: 0.771 ± 0.324
7.513AlaIle: 7.513 ± 1.087
5.972AlaLys: 5.972 ± 0.954
5.876AlaLeu: 5.876 ± 0.593
1.445AlaMet: 1.445 ± 0.407
4.431AlaAsn: 4.431 ± 0.56
1.06AlaPro: 1.06 ± 0.304
2.697AlaGln: 2.697 ± 0.507
2.793AlaArg: 2.793 ± 0.622
5.009AlaSer: 5.009 ± 0.817
3.468AlaThr: 3.468 ± 0.686
4.142AlaVal: 4.142 ± 0.713
0.867AlaTrp: 0.867 ± 0.289
2.408AlaTyr: 2.408 ± 0.31
0.0AlaXaa: 0.0 ± 0.0
Cys
0.289CysAla: 0.289 ± 0.152
0.289CysCys: 0.289 ± 0.155
0.578CysAsp: 0.578 ± 0.2
0.385CysGlu: 0.385 ± 0.159
0.289CysPhe: 0.289 ± 0.164
0.674CysGly: 0.674 ± 0.312
0.385CysHis: 0.385 ± 0.166
0.289CysIle: 0.289 ± 0.154
0.674CysLys: 0.674 ± 0.278
0.963CysLeu: 0.963 ± 0.255
0.0CysMet: 0.0 ± 0.0
0.289CysAsn: 0.289 ± 0.179
0.193CysPro: 0.193 ± 0.142
0.289CysGln: 0.289 ± 0.153
0.193CysArg: 0.193 ± 0.184
0.578CysSer: 0.578 ± 0.288
0.0CysThr: 0.0 ± 0.0
0.193CysVal: 0.193 ± 0.186
0.0CysTrp: 0.0 ± 0.0
0.096CysTyr: 0.096 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
4.334AspAla: 4.334 ± 0.626
0.674AspCys: 0.674 ± 0.257
4.527AspAsp: 4.527 ± 0.553
3.757AspGlu: 3.757 ± 0.526
3.66AspPhe: 3.66 ± 0.473
4.334AspGly: 4.334 ± 0.544
0.867AspHis: 0.867 ± 0.228
4.623AspIle: 4.623 ± 0.605
5.779AspLys: 5.779 ± 0.748
5.779AspLeu: 5.779 ± 0.915
1.06AspMet: 1.06 ± 0.241
4.816AspAsn: 4.816 ± 0.656
1.06AspPro: 1.06 ± 0.271
1.445AspGln: 1.445 ± 0.378
2.504AspArg: 2.504 ± 0.388
3.757AspSer: 3.757 ± 0.567
3.371AspThr: 3.371 ± 0.479
4.334AspVal: 4.334 ± 0.558
1.348AspTrp: 1.348 ± 0.415
2.89AspTyr: 2.89 ± 0.487
0.0AspXaa: 0.0 ± 0.0
Glu
5.683GluAla: 5.683 ± 0.741
0.289GluCys: 0.289 ± 0.164
2.119GluAsp: 2.119 ± 0.492
4.238GluGlu: 4.238 ± 0.776
2.697GluPhe: 2.697 ± 0.519
2.215GluGly: 2.215 ± 0.581
1.252GluHis: 1.252 ± 0.409
5.49GluIle: 5.49 ± 0.817
5.972GluLys: 5.972 ± 0.85
7.898GluLeu: 7.898 ± 1.068
1.252GluMet: 1.252 ± 0.348
3.371GluAsn: 3.371 ± 0.664
1.734GluPro: 1.734 ± 0.374
2.697GluGln: 2.697 ± 0.497
2.697GluArg: 2.697 ± 0.498
4.431GluSer: 4.431 ± 0.75
4.045GluThr: 4.045 ± 0.725
4.912GluVal: 4.912 ± 0.877
0.963GluTrp: 0.963 ± 0.302
2.986GluTyr: 2.986 ± 0.503
0.0GluXaa: 0.0 ± 0.0
Phe
3.564PheAla: 3.564 ± 0.668
0.193PheCys: 0.193 ± 0.147
2.986PheAsp: 2.986 ± 0.549
3.179PheGlu: 3.179 ± 0.442
1.06PhePhe: 1.06 ± 0.225
2.312PheGly: 2.312 ± 0.407
0.482PheHis: 0.482 ± 0.277
2.119PheIle: 2.119 ± 0.506
2.504PheLys: 2.504 ± 0.475
2.697PheLeu: 2.697 ± 0.477
0.771PheMet: 0.771 ± 0.31
3.371PheAsn: 3.371 ± 0.683
0.674PhePro: 0.674 ± 0.28
1.252PheGln: 1.252 ± 0.336
2.504PheArg: 2.504 ± 0.436
2.408PheSer: 2.408 ± 0.453
1.637PheThr: 1.637 ± 0.392
2.312PheVal: 2.312 ± 0.436
0.482PheTrp: 0.482 ± 0.192
0.771PheTyr: 0.771 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
3.757GlyAla: 3.757 ± 0.972
0.289GlyCys: 0.289 ± 0.176
4.623GlyAsp: 4.623 ± 0.743
4.045GlyGlu: 4.045 ± 0.707
2.601GlyPhe: 2.601 ± 0.649
4.238GlyGly: 4.238 ± 0.712
1.348GlyHis: 1.348 ± 0.435
5.394GlyIle: 5.394 ± 0.806
5.876GlyLys: 5.876 ± 0.702
7.031GlyLeu: 7.031 ± 0.937
1.926GlyMet: 1.926 ± 0.429
2.408GlyAsn: 2.408 ± 0.429
2.215GlyPro: 2.215 ± 1.538
2.986GlyGln: 2.986 ± 0.59
2.504GlyArg: 2.504 ± 0.371
3.275GlySer: 3.275 ± 0.595
3.468GlyThr: 3.468 ± 0.622
3.66GlyVal: 3.66 ± 0.605
0.578GlyTrp: 0.578 ± 0.259
3.564GlyTyr: 3.564 ± 0.592
0.0GlyXaa: 0.0 ± 0.0
His
0.867HisAla: 0.867 ± 0.292
0.193HisCys: 0.193 ± 0.167
1.156HisAsp: 1.156 ± 0.331
1.06HisGlu: 1.06 ± 0.34
0.578HisPhe: 0.578 ± 0.281
0.963HisGly: 0.963 ± 0.291
0.193HisHis: 0.193 ± 0.124
1.348HisIle: 1.348 ± 0.349
1.156HisLys: 1.156 ± 0.388
1.252HisLeu: 1.252 ± 0.343
0.193HisMet: 0.193 ± 0.139
0.867HisAsn: 0.867 ± 0.326
0.578HisPro: 0.578 ± 0.285
0.482HisGln: 0.482 ± 0.181
0.867HisArg: 0.867 ± 0.278
1.252HisSer: 1.252 ± 0.353
0.771HisThr: 0.771 ± 0.261
0.963HisVal: 0.963 ± 0.355
0.193HisTrp: 0.193 ± 0.136
1.06HisTyr: 1.06 ± 0.348
0.0HisXaa: 0.0 ± 0.0
Ile
5.298IleAla: 5.298 ± 0.818
0.578IleCys: 0.578 ± 0.215
5.49IleAsp: 5.49 ± 0.783
5.683IleGlu: 5.683 ± 0.76
2.504IlePhe: 2.504 ± 0.452
5.394IleGly: 5.394 ± 0.821
1.06IleHis: 1.06 ± 0.265
4.142IleIle: 4.142 ± 0.576
7.802IleLys: 7.802 ± 0.904
5.683IleLeu: 5.683 ± 0.686
2.312IleMet: 2.312 ± 0.578
4.431IleAsn: 4.431 ± 0.782
2.408IlePro: 2.408 ± 0.574
2.119IleGln: 2.119 ± 0.489
2.504IleArg: 2.504 ± 0.394
5.683IleSer: 5.683 ± 0.942
5.972IleThr: 5.972 ± 0.741
2.793IleVal: 2.793 ± 0.522
0.482IleTrp: 0.482 ± 0.228
1.926IleTyr: 1.926 ± 0.468
0.0IleXaa: 0.0 ± 0.0
Lys
5.201LysAla: 5.201 ± 0.653
0.578LysCys: 0.578 ± 0.267
4.431LysAsp: 4.431 ± 0.669
5.779LysGlu: 5.779 ± 0.729
1.926LysPhe: 1.926 ± 0.371
5.394LysGly: 5.394 ± 0.615
1.252LysHis: 1.252 ± 0.378
7.224LysIle: 7.224 ± 1.064
6.839LysLys: 6.839 ± 1.019
7.031LysLeu: 7.031 ± 0.854
2.89LysMet: 2.89 ± 0.577
5.009LysAsn: 5.009 ± 0.59
2.023LysPro: 2.023 ± 0.436
5.394LysGln: 5.394 ± 0.545
4.142LysArg: 4.142 ± 0.621
7.417LysSer: 7.417 ± 0.762
4.72LysThr: 4.72 ± 0.707
6.357LysVal: 6.357 ± 0.797
0.674LysTrp: 0.674 ± 0.244
2.986LysTyr: 2.986 ± 0.532
0.0LysXaa: 0.0 ± 0.0
Leu
6.935LeuAla: 6.935 ± 0.781
0.096LeuCys: 0.096 ± 0.078
5.779LeuAsp: 5.779 ± 0.584
5.683LeuGlu: 5.683 ± 0.818
3.371LeuPhe: 3.371 ± 0.516
4.912LeuGly: 4.912 ± 0.756
1.252LeuHis: 1.252 ± 0.325
5.876LeuIle: 5.876 ± 0.722
10.017LeuLys: 10.017 ± 1.145
7.513LeuLeu: 7.513 ± 0.967
1.734LeuMet: 1.734 ± 0.373
4.238LeuAsn: 4.238 ± 0.555
3.468LeuPro: 3.468 ± 0.544
3.564LeuGln: 3.564 ± 0.481
3.179LeuArg: 3.179 ± 0.617
6.357LeuSer: 6.357 ± 0.979
5.49LeuThr: 5.49 ± 0.75
5.201LeuVal: 5.201 ± 0.688
0.771LeuTrp: 0.771 ± 0.299
2.119LeuTyr: 2.119 ± 0.368
0.0LeuXaa: 0.0 ± 0.0
Met
2.986MetAla: 2.986 ± 0.735
0.193MetCys: 0.193 ± 0.119
1.734MetAsp: 1.734 ± 0.363
1.541MetGlu: 1.541 ± 0.45
0.771MetPhe: 0.771 ± 0.314
1.252MetGly: 1.252 ± 0.513
0.289MetHis: 0.289 ± 0.237
1.541MetIle: 1.541 ± 0.258
1.348MetLys: 1.348 ± 0.506
1.252MetLeu: 1.252 ± 0.355
0.289MetMet: 0.289 ± 0.157
1.06MetAsn: 1.06 ± 0.281
0.674MetPro: 0.674 ± 0.225
0.674MetGln: 0.674 ± 0.227
0.963MetArg: 0.963 ± 0.296
2.023MetSer: 2.023 ± 0.326
2.697MetThr: 2.697 ± 0.611
1.156MetVal: 1.156 ± 0.351
0.096MetTrp: 0.096 ± 0.083
0.674MetTyr: 0.674 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
3.66AsnAla: 3.66 ± 0.569
0.289AsnCys: 0.289 ± 0.161
2.89AsnAsp: 2.89 ± 0.446
3.468AsnGlu: 3.468 ± 0.739
2.215AsnPhe: 2.215 ± 0.408
5.587AsnGly: 5.587 ± 0.63
1.445AsnHis: 1.445 ± 0.396
4.238AsnIle: 4.238 ± 0.717
3.371AsnLys: 3.371 ± 0.532
5.298AsnLeu: 5.298 ± 0.556
1.156AsnMet: 1.156 ± 0.355
2.793AsnAsn: 2.793 ± 0.682
2.215AsnPro: 2.215 ± 0.491
2.601AsnGln: 2.601 ± 0.506
1.83AsnArg: 1.83 ± 0.429
2.793AsnSer: 2.793 ± 0.502
2.89AsnThr: 2.89 ± 0.49
2.986AsnVal: 2.986 ± 0.495
0.674AsnTrp: 0.674 ± 0.252
1.926AsnTyr: 1.926 ± 0.469
0.0AsnXaa: 0.0 ± 0.0
Pro
2.312ProAla: 2.312 ± 0.606
0.0ProCys: 0.0 ± 0.0
1.926ProAsp: 1.926 ± 0.448
1.734ProGlu: 1.734 ± 0.38
0.963ProPhe: 0.963 ± 0.321
1.348ProGly: 1.348 ± 0.659
0.482ProHis: 0.482 ± 0.239
1.637ProIle: 1.637 ± 0.382
3.66ProLys: 3.66 ± 0.707
2.023ProLeu: 2.023 ± 0.425
0.578ProMet: 0.578 ± 0.208
1.156ProAsn: 1.156 ± 0.345
0.963ProPro: 0.963 ± 0.522
1.637ProGln: 1.637 ± 0.629
0.867ProArg: 0.867 ± 0.331
1.83ProSer: 1.83 ± 0.387
1.734ProThr: 1.734 ± 0.452
2.408ProVal: 2.408 ± 0.609
0.0ProTrp: 0.0 ± 0.0
0.385ProTyr: 0.385 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
2.312GlnAla: 2.312 ± 0.404
0.385GlnCys: 0.385 ± 0.181
2.601GlnAsp: 2.601 ± 0.461
2.986GlnGlu: 2.986 ± 0.626
2.119GlnPhe: 2.119 ± 0.522
2.601GlnGly: 2.601 ± 0.883
0.674GlnHis: 0.674 ± 0.227
3.853GlnIle: 3.853 ± 0.67
4.527GlnLys: 4.527 ± 0.836
3.564GlnLeu: 3.564 ± 0.655
0.963GlnMet: 0.963 ± 0.282
2.215GlnAsn: 2.215 ± 0.498
1.445GlnPro: 1.445 ± 0.385
2.408GlnGln: 2.408 ± 0.43
1.252GlnArg: 1.252 ± 0.334
2.986GlnSer: 2.986 ± 0.669
1.734GlnThr: 1.734 ± 0.488
1.83GlnVal: 1.83 ± 0.352
0.578GlnTrp: 0.578 ± 0.204
1.637GlnTyr: 1.637 ± 0.456
0.0GlnXaa: 0.0 ± 0.0
Arg
2.312ArgAla: 2.312 ± 0.457
0.385ArgCys: 0.385 ± 0.25
2.408ArgAsp: 2.408 ± 0.46
2.793ArgGlu: 2.793 ± 0.554
1.445ArgPhe: 1.445 ± 0.371
2.215ArgGly: 2.215 ± 0.526
0.771ArgHis: 0.771 ± 0.314
2.89ArgIle: 2.89 ± 0.565
4.045ArgLys: 4.045 ± 0.672
4.334ArgLeu: 4.334 ± 0.584
0.771ArgMet: 0.771 ± 0.234
2.408ArgAsn: 2.408 ± 0.468
1.348ArgPro: 1.348 ± 0.369
1.445ArgGln: 1.445 ± 0.288
1.734ArgArg: 1.734 ± 0.403
2.89ArgSer: 2.89 ± 0.645
2.793ArgThr: 2.793 ± 0.47
2.119ArgVal: 2.119 ± 0.424
0.193ArgTrp: 0.193 ± 0.143
1.348ArgTyr: 1.348 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
5.105SerAla: 5.105 ± 1.221
0.578SerCys: 0.578 ± 0.265
5.009SerAsp: 5.009 ± 0.74
4.816SerGlu: 4.816 ± 0.621
3.082SerPhe: 3.082 ± 0.593
4.912SerGly: 4.912 ± 0.764
0.963SerHis: 0.963 ± 0.287
4.238SerIle: 4.238 ± 0.645
5.587SerLys: 5.587 ± 0.785
4.912SerLeu: 4.912 ± 1.057
2.119SerMet: 2.119 ± 0.311
3.468SerAsn: 3.468 ± 0.617
1.83SerPro: 1.83 ± 0.442
3.082SerGln: 3.082 ± 0.516
2.986SerArg: 2.986 ± 0.523
4.238SerSer: 4.238 ± 0.652
3.468SerThr: 3.468 ± 0.661
3.853SerVal: 3.853 ± 0.544
0.867SerTrp: 0.867 ± 0.235
3.275SerTyr: 3.275 ± 0.731
0.0SerXaa: 0.0 ± 0.0
Thr
4.623ThrAla: 4.623 ± 0.736
0.674ThrCys: 0.674 ± 0.269
3.275ThrAsp: 3.275 ± 0.541
3.275ThrGlu: 3.275 ± 0.387
1.637ThrPhe: 1.637 ± 0.389
5.587ThrGly: 5.587 ± 1.149
1.252ThrHis: 1.252 ± 0.345
5.587ThrIle: 5.587 ± 0.868
4.142ThrLys: 4.142 ± 0.567
5.201ThrLeu: 5.201 ± 0.758
0.578ThrMet: 0.578 ± 0.218
3.179ThrAsn: 3.179 ± 0.515
1.541ThrPro: 1.541 ± 0.36
3.082ThrGln: 3.082 ± 0.522
1.83ThrArg: 1.83 ± 0.469
3.757ThrSer: 3.757 ± 0.476
3.66ThrThr: 3.66 ± 0.689
3.949ThrVal: 3.949 ± 0.573
0.385ThrTrp: 0.385 ± 0.196
1.06ThrTyr: 1.06 ± 0.282
0.0ThrXaa: 0.0 ± 0.0
Val
4.431ValAla: 4.431 ± 0.753
0.289ValCys: 0.289 ± 0.198
4.72ValAsp: 4.72 ± 0.736
4.431ValGlu: 4.431 ± 0.674
2.215ValPhe: 2.215 ± 0.432
3.564ValGly: 3.564 ± 0.559
0.674ValHis: 0.674 ± 0.218
3.275ValIle: 3.275 ± 0.688
4.912ValLys: 4.912 ± 0.648
5.009ValLeu: 5.009 ± 0.819
1.348ValMet: 1.348 ± 0.243
2.601ValAsn: 2.601 ± 0.415
0.963ValPro: 0.963 ± 0.361
2.793ValGln: 2.793 ± 0.438
2.89ValArg: 2.89 ± 0.634
4.623ValSer: 4.623 ± 0.619
4.238ValThr: 4.238 ± 0.715
5.009ValVal: 5.009 ± 0.657
0.385ValTrp: 0.385 ± 0.228
2.215ValTyr: 2.215 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
0.674TrpAla: 0.674 ± 0.24
0.385TrpCys: 0.385 ± 0.188
0.0TrpAsp: 0.0 ± 0.0
0.771TrpGlu: 0.771 ± 0.251
0.193TrpPhe: 0.193 ± 0.159
1.156TrpGly: 1.156 ± 0.411
0.193TrpHis: 0.193 ± 0.186
0.482TrpIle: 0.482 ± 0.225
0.482TrpLys: 0.482 ± 0.213
1.252TrpLeu: 1.252 ± 0.299
0.578TrpMet: 0.578 ± 0.277
0.674TrpAsn: 0.674 ± 0.195
0.289TrpPro: 0.289 ± 0.156
0.193TrpGln: 0.193 ± 0.136
0.674TrpArg: 0.674 ± 0.26
0.771TrpSer: 0.771 ± 0.241
0.193TrpThr: 0.193 ± 0.132
0.385TrpVal: 0.385 ± 0.186
0.096TrpTrp: 0.096 ± 0.108
0.385TrpTyr: 0.385 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.601TyrAla: 2.601 ± 0.685
0.193TyrCys: 0.193 ± 0.132
3.179TyrAsp: 3.179 ± 0.71
1.926TyrGlu: 1.926 ± 0.342
1.541TyrPhe: 1.541 ± 0.48
2.023TyrGly: 2.023 ± 0.384
0.385TyrHis: 0.385 ± 0.179
2.504TyrIle: 2.504 ± 0.405
2.697TyrLys: 2.697 ± 0.441
2.986TyrLeu: 2.986 ± 0.457
1.156TyrMet: 1.156 ± 0.399
1.541TyrAsn: 1.541 ± 0.347
0.963TyrPro: 0.963 ± 0.296
1.83TyrGln: 1.83 ± 0.316
1.734TyrArg: 1.734 ± 0.423
2.312TyrSer: 2.312 ± 0.484
1.926TyrThr: 1.926 ± 0.386
2.023TyrVal: 2.023 ± 0.463
0.193TyrTrp: 0.193 ± 0.148
1.252TyrTyr: 1.252 ± 0.473
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (10383 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski