Amino acid dipepetide frequency for Streptococcus phage Javan638

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.813AlaAla: 4.813 ± 0.801
0.661AlaCys: 0.661 ± 0.225
3.963AlaAsp: 3.963 ± 0.685
3.492AlaGlu: 3.492 ± 0.631
2.831AlaPhe: 2.831 ± 0.433
4.718AlaGly: 4.718 ± 0.659
0.566AlaHis: 0.566 ± 0.253
5.19AlaIle: 5.19 ± 0.782
5.19AlaLys: 5.19 ± 0.653
5.568AlaLeu: 5.568 ± 0.952
1.982AlaMet: 1.982 ± 0.371
3.586AlaAsn: 3.586 ± 0.611
1.415AlaPro: 1.415 ± 0.338
2.831AlaGln: 2.831 ± 0.563
3.586AlaArg: 3.586 ± 0.585
4.435AlaSer: 4.435 ± 0.537
6.511AlaThr: 6.511 ± 0.928
4.907AlaVal: 4.907 ± 0.646
0.849AlaTrp: 0.849 ± 0.376
3.68AlaTyr: 3.68 ± 0.465
0.0AlaXaa: 0.0 ± 0.0
Cys
0.283CysAla: 0.283 ± 0.24
0.094CysCys: 0.094 ± 0.095
0.472CysAsp: 0.472 ± 0.186
0.472CysGlu: 0.472 ± 0.174
0.283CysPhe: 0.283 ± 0.205
0.755CysGly: 0.755 ± 0.248
0.189CysHis: 0.189 ± 0.134
0.094CysIle: 0.094 ± 0.105
0.283CysLys: 0.283 ± 0.13
1.038CysLeu: 1.038 ± 0.361
0.283CysMet: 0.283 ± 0.151
0.189CysAsn: 0.189 ± 0.129
0.472CysPro: 0.472 ± 0.222
0.755CysGln: 0.755 ± 0.274
0.755CysArg: 0.755 ± 0.299
0.755CysSer: 0.755 ± 0.258
0.283CysThr: 0.283 ± 0.217
0.472CysVal: 0.472 ± 0.184
0.0CysTrp: 0.0 ± 0.0
0.849CysTyr: 0.849 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
3.397AspAla: 3.397 ± 0.593
0.377AspCys: 0.377 ± 0.196
3.869AspAsp: 3.869 ± 0.791
5.662AspGlu: 5.662 ± 0.673
3.02AspPhe: 3.02 ± 0.45
4.435AspGly: 4.435 ± 0.666
1.415AspHis: 1.415 ± 0.359
3.775AspIle: 3.775 ± 0.561
4.435AspLys: 4.435 ± 0.523
6.417AspLeu: 6.417 ± 0.967
1.887AspMet: 1.887 ± 0.43
2.454AspAsn: 2.454 ± 0.646
1.793AspPro: 1.793 ± 0.522
1.793AspGln: 1.793 ± 0.425
1.793AspArg: 1.793 ± 0.457
3.114AspSer: 3.114 ± 0.539
2.642AspThr: 2.642 ± 0.5
3.68AspVal: 3.68 ± 0.588
0.944AspTrp: 0.944 ± 0.34
3.114AspTyr: 3.114 ± 0.566
0.0AspXaa: 0.0 ± 0.0
Glu
5.001GluAla: 5.001 ± 0.692
0.661GluCys: 0.661 ± 0.251
4.341GluAsp: 4.341 ± 0.762
5.945GluGlu: 5.945 ± 0.908
1.982GluPhe: 1.982 ± 0.465
6.039GluGly: 6.039 ± 0.576
1.132GluHis: 1.132 ± 0.235
3.208GluIle: 3.208 ± 0.532
6.134GluLys: 6.134 ± 0.637
7.361GluLeu: 7.361 ± 0.682
2.076GluMet: 2.076 ± 0.505
3.397GluAsn: 3.397 ± 0.502
1.699GluPro: 1.699 ± 0.342
3.303GluGln: 3.303 ± 0.514
2.831GluArg: 2.831 ± 0.518
3.397GluSer: 3.397 ± 0.479
4.624GluThr: 4.624 ± 0.558
5.001GluVal: 5.001 ± 0.601
0.377GluTrp: 0.377 ± 0.169
2.265GluTyr: 2.265 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
2.076PheAla: 2.076 ± 0.487
0.566PheCys: 0.566 ± 0.226
2.548PheAsp: 2.548 ± 0.551
2.737PheGlu: 2.737 ± 0.471
1.699PhePhe: 1.699 ± 0.517
2.454PheGly: 2.454 ± 0.371
0.755PheHis: 0.755 ± 0.236
2.359PheIle: 2.359 ± 0.441
3.492PheLys: 3.492 ± 0.612
3.114PheLeu: 3.114 ± 0.438
1.415PheMet: 1.415 ± 0.366
1.982PheAsn: 1.982 ± 0.33
0.472PhePro: 0.472 ± 0.255
1.415PheGln: 1.415 ± 0.44
1.604PheArg: 1.604 ± 0.266
3.208PheSer: 3.208 ± 0.382
1.699PheThr: 1.699 ± 0.38
1.699PheVal: 1.699 ± 0.351
0.849PheTrp: 0.849 ± 0.247
1.887PheTyr: 1.887 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
3.586GlyAla: 3.586 ± 0.666
0.472GlyCys: 0.472 ± 0.188
4.624GlyAsp: 4.624 ± 0.741
4.246GlyGlu: 4.246 ± 0.478
2.454GlyPhe: 2.454 ± 0.377
4.53GlyGly: 4.53 ± 0.902
1.793GlyHis: 1.793 ± 0.402
4.813GlyIle: 4.813 ± 0.73
5.473GlyLys: 5.473 ± 0.881
5.568GlyLeu: 5.568 ± 0.793
2.925GlyMet: 2.925 ± 0.54
3.303GlyAsn: 3.303 ± 0.673
0.944GlyPro: 0.944 ± 0.264
3.492GlyGln: 3.492 ± 0.486
3.68GlyArg: 3.68 ± 0.711
4.624GlySer: 4.624 ± 0.863
4.718GlyThr: 4.718 ± 0.636
4.718GlyVal: 4.718 ± 0.645
0.755GlyTrp: 0.755 ± 0.243
2.925GlyTyr: 2.925 ± 0.567
0.0GlyXaa: 0.0 ± 0.0
His
1.132HisAla: 1.132 ± 0.282
0.094HisCys: 0.094 ± 0.102
1.51HisAsp: 1.51 ± 0.349
0.944HisGlu: 0.944 ± 0.304
0.944HisPhe: 0.944 ± 0.275
1.132HisGly: 1.132 ± 0.237
0.755HisHis: 0.755 ± 0.199
0.944HisIle: 0.944 ± 0.313
0.849HisLys: 0.849 ± 0.293
2.17HisLeu: 2.17 ± 0.408
0.472HisMet: 0.472 ± 0.186
1.415HisAsn: 1.415 ± 0.345
1.51HisPro: 1.51 ± 0.383
0.944HisGln: 0.944 ± 0.275
1.321HisArg: 1.321 ± 0.357
0.944HisSer: 0.944 ± 0.238
1.038HisThr: 1.038 ± 0.343
1.038HisVal: 1.038 ± 0.321
0.094HisTrp: 0.094 ± 0.085
1.038HisTyr: 1.038 ± 0.373
0.0HisXaa: 0.0 ± 0.0
Ile
4.53IleAla: 4.53 ± 0.431
0.755IleCys: 0.755 ± 0.226
5.001IleAsp: 5.001 ± 0.631
3.775IleGlu: 3.775 ± 0.56
1.132IlePhe: 1.132 ± 0.303
4.058IleGly: 4.058 ± 0.58
1.038IleHis: 1.038 ± 0.232
3.775IleIle: 3.775 ± 0.732
4.53IleLys: 4.53 ± 0.827
4.53IleLeu: 4.53 ± 0.664
1.227IleMet: 1.227 ± 0.364
2.265IleAsn: 2.265 ± 0.413
1.699IlePro: 1.699 ± 0.448
1.887IleGln: 1.887 ± 0.424
1.793IleArg: 1.793 ± 0.468
4.058IleSer: 4.058 ± 0.751
5.19IleThr: 5.19 ± 0.784
3.963IleVal: 3.963 ± 0.751
1.038IleTrp: 1.038 ± 0.336
2.265IleTyr: 2.265 ± 0.401
0.0IleXaa: 0.0 ± 0.0
Lys
7.832LysAla: 7.832 ± 0.685
0.283LysCys: 0.283 ± 0.153
3.586LysAsp: 3.586 ± 0.698
5.096LysGlu: 5.096 ± 0.557
2.17LysPhe: 2.17 ± 0.322
5.379LysGly: 5.379 ± 0.584
2.454LysHis: 2.454 ± 0.506
3.68LysIle: 3.68 ± 0.703
4.435LysLys: 4.435 ± 0.707
6.228LysLeu: 6.228 ± 0.783
1.51LysMet: 1.51 ± 0.341
2.548LysAsn: 2.548 ± 0.504
1.982LysPro: 1.982 ± 0.486
3.775LysGln: 3.775 ± 0.721
4.435LysArg: 4.435 ± 0.67
3.869LysSer: 3.869 ± 0.661
4.624LysThr: 4.624 ± 0.584
4.53LysVal: 4.53 ± 0.583
1.132LysTrp: 1.132 ± 0.315
1.887LysTyr: 1.887 ± 0.404
0.0LysXaa: 0.0 ± 0.0
Leu
7.738LeuAla: 7.738 ± 0.813
0.377LeuCys: 0.377 ± 0.202
5.756LeuAsp: 5.756 ± 0.615
7.832LeuGlu: 7.832 ± 0.811
2.831LeuPhe: 2.831 ± 0.487
5.285LeuGly: 5.285 ± 0.618
1.321LeuHis: 1.321 ± 0.336
4.152LeuIle: 4.152 ± 0.634
7.644LeuLys: 7.644 ± 0.862
7.927LeuLeu: 7.927 ± 0.862
1.982LeuMet: 1.982 ± 0.405
4.152LeuAsn: 4.152 ± 0.63
3.114LeuPro: 3.114 ± 0.569
3.586LeuGln: 3.586 ± 0.59
4.058LeuArg: 4.058 ± 0.722
7.927LeuSer: 7.927 ± 1.022
5.379LeuThr: 5.379 ± 0.665
6.039LeuVal: 6.039 ± 0.704
1.038LeuTrp: 1.038 ± 0.321
3.586LeuTyr: 3.586 ± 0.549
0.0LeuXaa: 0.0 ± 0.0
Met
2.454MetAla: 2.454 ± 0.441
0.189MetCys: 0.189 ± 0.128
1.51MetAsp: 1.51 ± 0.424
1.604MetGlu: 1.604 ± 0.534
0.661MetPhe: 0.661 ± 0.215
2.454MetGly: 2.454 ± 0.532
0.094MetHis: 0.094 ± 0.098
0.944MetIle: 0.944 ± 0.29
2.265MetLys: 2.265 ± 0.407
1.604MetLeu: 1.604 ± 0.499
0.755MetMet: 0.755 ± 0.292
0.661MetAsn: 0.661 ± 0.254
0.566MetPro: 0.566 ± 0.22
0.566MetGln: 0.566 ± 0.225
0.849MetArg: 0.849 ± 0.233
2.454MetSer: 2.454 ± 0.511
2.831MetThr: 2.831 ± 0.584
1.415MetVal: 1.415 ± 0.362
0.189MetTrp: 0.189 ± 0.126
0.283MetTyr: 0.283 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
3.492AsnAla: 3.492 ± 0.628
0.283AsnCys: 0.283 ± 0.196
2.17AsnAsp: 2.17 ± 0.382
3.02AsnGlu: 3.02 ± 0.74
1.699AsnPhe: 1.699 ± 0.471
4.718AsnGly: 4.718 ± 0.868
1.51AsnHis: 1.51 ± 0.353
2.454AsnIle: 2.454 ± 0.453
2.925AsnLys: 2.925 ± 0.435
4.152AsnLeu: 4.152 ± 0.672
0.944AsnMet: 0.944 ± 0.275
2.17AsnAsn: 2.17 ± 0.503
1.887AsnPro: 1.887 ± 0.409
2.454AsnGln: 2.454 ± 0.456
1.604AsnArg: 1.604 ± 0.398
3.397AsnSer: 3.397 ± 0.5
3.02AsnThr: 3.02 ± 0.671
1.887AsnVal: 1.887 ± 0.449
0.944AsnTrp: 0.944 ± 0.26
1.227AsnTyr: 1.227 ± 0.342
0.0AsnXaa: 0.0 ± 0.0
Pro
1.038ProAla: 1.038 ± 0.257
0.283ProCys: 0.283 ± 0.136
1.793ProAsp: 1.793 ± 0.394
1.887ProGlu: 1.887 ± 0.431
1.132ProPhe: 1.132 ± 0.324
1.793ProGly: 1.793 ± 0.533
0.377ProHis: 0.377 ± 0.164
1.604ProIle: 1.604 ± 0.391
2.737ProLys: 2.737 ± 0.52
2.454ProLeu: 2.454 ± 0.501
0.189ProMet: 0.189 ± 0.126
1.604ProAsn: 1.604 ± 0.432
0.849ProPro: 0.849 ± 0.315
1.132ProGln: 1.132 ± 0.29
1.132ProArg: 1.132 ± 0.311
2.642ProSer: 2.642 ± 0.516
2.737ProThr: 2.737 ± 0.558
1.793ProVal: 1.793 ± 0.353
0.377ProTrp: 0.377 ± 0.167
1.699ProTyr: 1.699 ± 0.44
0.0ProXaa: 0.0 ± 0.0
Gln
3.869GlnAla: 3.869 ± 0.816
0.189GlnCys: 0.189 ± 0.13
1.51GlnAsp: 1.51 ± 0.4
2.454GlnGlu: 2.454 ± 0.34
2.642GlnPhe: 2.642 ± 0.586
2.642GlnGly: 2.642 ± 0.574
0.944GlnHis: 0.944 ± 0.235
2.737GlnIle: 2.737 ± 0.422
3.208GlnLys: 3.208 ± 0.498
4.624GlnLeu: 4.624 ± 0.651
0.849GlnMet: 0.849 ± 0.268
2.359GlnAsn: 2.359 ± 0.577
1.132GlnPro: 1.132 ± 0.474
1.887GlnGln: 1.887 ± 0.619
1.699GlnArg: 1.699 ± 0.37
2.642GlnSer: 2.642 ± 0.506
2.737GlnThr: 2.737 ± 0.526
3.208GlnVal: 3.208 ± 0.645
0.661GlnTrp: 0.661 ± 0.298
1.038GlnTyr: 1.038 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
2.548ArgAla: 2.548 ± 0.442
0.849ArgCys: 0.849 ± 0.288
2.17ArgAsp: 2.17 ± 0.497
2.831ArgGlu: 2.831 ± 0.449
1.793ArgPhe: 1.793 ± 0.437
2.548ArgGly: 2.548 ± 0.438
0.849ArgHis: 0.849 ± 0.31
2.359ArgIle: 2.359 ± 0.476
3.303ArgLys: 3.303 ± 0.707
4.624ArgLeu: 4.624 ± 0.685
0.566ArgMet: 0.566 ± 0.281
2.548ArgAsn: 2.548 ± 0.465
1.699ArgPro: 1.699 ± 0.427
2.925ArgGln: 2.925 ± 0.385
2.076ArgArg: 2.076 ± 0.513
1.982ArgSer: 1.982 ± 0.342
2.548ArgThr: 2.548 ± 0.391
3.208ArgVal: 3.208 ± 0.585
0.755ArgTrp: 0.755 ± 0.268
1.699ArgTyr: 1.699 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
4.718SerAla: 4.718 ± 0.771
0.377SerCys: 0.377 ± 0.257
3.869SerAsp: 3.869 ± 0.701
4.907SerGlu: 4.907 ± 0.727
3.114SerPhe: 3.114 ± 0.517
4.813SerGly: 4.813 ± 0.671
2.17SerHis: 2.17 ± 0.328
4.246SerIle: 4.246 ± 0.675
4.152SerLys: 4.152 ± 0.424
5.756SerLeu: 5.756 ± 0.825
1.604SerMet: 1.604 ± 0.403
2.737SerAsn: 2.737 ± 0.545
1.982SerPro: 1.982 ± 0.392
1.982SerGln: 1.982 ± 0.363
2.359SerArg: 2.359 ± 0.502
4.341SerSer: 4.341 ± 1.015
3.68SerThr: 3.68 ± 0.572
5.568SerVal: 5.568 ± 0.681
1.793SerTrp: 1.793 ± 0.371
2.265SerTyr: 2.265 ± 0.423
0.0SerXaa: 0.0 ± 0.0
Thr
4.435ThrAla: 4.435 ± 0.772
0.566ThrCys: 0.566 ± 0.209
2.925ThrAsp: 2.925 ± 0.452
4.907ThrGlu: 4.907 ± 0.671
3.397ThrPhe: 3.397 ± 0.749
4.246ThrGly: 4.246 ± 0.559
0.755ThrHis: 0.755 ± 0.2
4.813ThrIle: 4.813 ± 0.907
4.435ThrLys: 4.435 ± 0.662
6.511ThrLeu: 6.511 ± 0.766
1.51ThrMet: 1.51 ± 0.383
3.775ThrAsn: 3.775 ± 0.516
2.642ThrPro: 2.642 ± 0.712
2.265ThrGln: 2.265 ± 0.557
1.982ThrArg: 1.982 ± 0.421
4.341ThrSer: 4.341 ± 0.632
5.285ThrThr: 5.285 ± 0.708
5.473ThrVal: 5.473 ± 0.68
1.227ThrTrp: 1.227 ± 0.334
2.642ThrTyr: 2.642 ± 0.393
0.0ThrXaa: 0.0 ± 0.0
Val
3.586ValAla: 3.586 ± 0.571
0.566ValCys: 0.566 ± 0.202
4.435ValAsp: 4.435 ± 0.655
4.435ValGlu: 4.435 ± 0.734
2.265ValPhe: 2.265 ± 0.475
4.058ValGly: 4.058 ± 0.528
1.038ValHis: 1.038 ± 0.299
4.624ValIle: 4.624 ± 0.571
3.492ValLys: 3.492 ± 0.465
7.455ValLeu: 7.455 ± 0.713
1.321ValMet: 1.321 ± 0.318
2.076ValAsn: 2.076 ± 0.3
1.982ValPro: 1.982 ± 0.451
2.642ValGln: 2.642 ± 0.411
2.831ValArg: 2.831 ± 0.479
5.379ValSer: 5.379 ± 0.698
5.568ValThr: 5.568 ± 0.834
3.775ValVal: 3.775 ± 0.721
1.415ValTrp: 1.415 ± 0.526
2.454ValTyr: 2.454 ± 0.521
0.0ValXaa: 0.0 ± 0.0
Trp
0.849TrpAla: 0.849 ± 0.271
0.189TrpCys: 0.189 ± 0.144
0.661TrpAsp: 0.661 ± 0.168
1.699TrpGlu: 1.699 ± 0.379
0.661TrpPhe: 0.661 ± 0.286
0.472TrpGly: 0.472 ± 0.18
0.283TrpHis: 0.283 ± 0.165
0.755TrpIle: 0.755 ± 0.27
1.038TrpLys: 1.038 ± 0.318
1.51TrpLeu: 1.51 ± 0.3
0.377TrpMet: 0.377 ± 0.189
1.321TrpAsn: 1.321 ± 0.512
0.0TrpPro: 0.0 ± 0.0
0.944TrpGln: 0.944 ± 0.247
0.944TrpArg: 0.944 ± 0.277
1.132TrpSer: 1.132 ± 0.375
0.849TrpThr: 0.849 ± 0.268
0.944TrpVal: 0.944 ± 0.434
0.283TrpTrp: 0.283 ± 0.153
0.189TrpTyr: 0.189 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.397TyrAla: 3.397 ± 0.539
0.944TyrCys: 0.944 ± 0.308
3.303TyrAsp: 3.303 ± 0.687
2.831TyrGlu: 2.831 ± 0.363
1.321TyrPhe: 1.321 ± 0.377
3.114TyrGly: 3.114 ± 0.469
0.849TyrHis: 0.849 ± 0.212
1.982TyrIle: 1.982 ± 0.483
1.415TyrLys: 1.415 ± 0.468
3.208TyrLeu: 3.208 ± 0.685
0.377TyrMet: 0.377 ± 0.177
1.415TyrAsn: 1.415 ± 0.345
1.415TyrPro: 1.415 ± 0.319
2.454TyrGln: 2.454 ± 0.475
2.548TyrArg: 2.548 ± 0.418
1.793TyrSer: 1.793 ± 0.366
2.17TyrThr: 2.17 ± 0.443
2.076TyrVal: 2.076 ± 0.38
0.377TyrTrp: 0.377 ± 0.154
1.51TyrTyr: 1.51 ± 0.463
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10598 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski