Amino acid dipepetide frequency for Streptococcus phage Javan94

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.956AlaAla: 3.956 ± 0.792
0.198AlaCys: 0.198 ± 0.131
3.956AlaAsp: 3.956 ± 0.573
6.231AlaGlu: 6.231 ± 0.675
1.879AlaPhe: 1.879 ± 0.562
4.648AlaGly: 4.648 ± 1.071
1.187AlaHis: 1.187 ± 0.444
7.022AlaIle: 7.022 ± 1.028
7.319AlaLys: 7.319 ± 1.052
5.044AlaLeu: 5.044 ± 0.794
2.275AlaMet: 2.275 ± 0.604
3.462AlaAsn: 3.462 ± 0.581
1.385AlaPro: 1.385 ± 0.32
2.374AlaGln: 2.374 ± 0.537
1.582AlaArg: 1.582 ± 0.373
4.846AlaSer: 4.846 ± 0.849
3.857AlaThr: 3.857 ± 0.734
4.451AlaVal: 4.451 ± 0.584
0.791AlaTrp: 0.791 ± 0.263
2.571AlaTyr: 2.571 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.198CysAla: 0.198 ± 0.125
0.297CysCys: 0.297 ± 0.175
0.396CysAsp: 0.396 ± 0.232
0.791CysGlu: 0.791 ± 0.268
0.495CysPhe: 0.495 ± 0.18
0.692CysGly: 0.692 ± 0.249
0.198CysHis: 0.198 ± 0.138
0.396CysIle: 0.396 ± 0.253
0.396CysLys: 0.396 ± 0.161
0.89CysLeu: 0.89 ± 0.291
0.297CysMet: 0.297 ± 0.186
0.297CysAsn: 0.297 ± 0.156
0.0CysPro: 0.0 ± 0.0
0.396CysGln: 0.396 ± 0.17
0.099CysArg: 0.099 ± 0.103
0.593CysSer: 0.593 ± 0.235
0.099CysThr: 0.099 ± 0.105
0.297CysVal: 0.297 ± 0.225
0.0CysTrp: 0.0 ± 0.0
0.198CysTyr: 0.198 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
3.857AspAla: 3.857 ± 0.547
1.187AspCys: 1.187 ± 0.311
3.956AspAsp: 3.956 ± 0.549
5.242AspGlu: 5.242 ± 0.936
3.659AspPhe: 3.659 ± 0.562
5.242AspGly: 5.242 ± 0.782
0.198AspHis: 0.198 ± 0.138
4.648AspIle: 4.648 ± 0.561
5.637AspLys: 5.637 ± 0.928
6.132AspLeu: 6.132 ± 0.863
1.187AspMet: 1.187 ± 0.289
3.857AspAsn: 3.857 ± 0.61
1.681AspPro: 1.681 ± 0.435
1.088AspGln: 1.088 ± 0.3
2.275AspArg: 2.275 ± 0.454
3.066AspSer: 3.066 ± 0.565
4.352AspThr: 4.352 ± 0.715
4.945AspVal: 4.945 ± 0.751
1.187AspTrp: 1.187 ± 0.38
3.066AspTyr: 3.066 ± 0.712
0.0AspXaa: 0.0 ± 0.0
Glu
6.725GluAla: 6.725 ± 0.793
0.297GluCys: 0.297 ± 0.18
3.956GluAsp: 3.956 ± 0.599
5.835GluGlu: 5.835 ± 1.011
3.066GluPhe: 3.066 ± 0.604
3.264GluGly: 3.264 ± 0.503
1.484GluHis: 1.484 ± 0.378
5.637GluIle: 5.637 ± 0.759
5.341GluLys: 5.341 ± 0.835
7.912GluLeu: 7.912 ± 1.031
2.473GluMet: 2.473 ± 0.49
3.857GluAsn: 3.857 ± 0.574
2.176GluPro: 2.176 ± 0.532
3.462GluGln: 3.462 ± 0.704
3.066GluArg: 3.066 ± 0.674
3.758GluSer: 3.758 ± 0.545
3.659GluThr: 3.659 ± 0.645
4.648GluVal: 4.648 ± 0.975
1.187GluTrp: 1.187 ± 0.4
2.473GluTyr: 2.473 ± 0.49
0.0GluXaa: 0.0 ± 0.0
Phe
3.758PheAla: 3.758 ± 0.558
0.0PheCys: 0.0 ± 0.0
3.956PheAsp: 3.956 ± 0.556
4.648PheGlu: 4.648 ± 0.587
1.385PhePhe: 1.385 ± 0.321
2.473PheGly: 2.473 ± 0.457
0.495PheHis: 0.495 ± 0.172
1.78PheIle: 1.78 ± 0.466
3.56PheLys: 3.56 ± 0.569
2.571PheLeu: 2.571 ± 0.562
1.681PheMet: 1.681 ± 0.479
2.473PheAsn: 2.473 ± 0.568
0.692PhePro: 0.692 ± 0.355
1.187PheGln: 1.187 ± 0.352
1.681PheArg: 1.681 ± 0.372
2.571PheSer: 2.571 ± 0.528
2.473PheThr: 2.473 ± 0.436
2.473PheVal: 2.473 ± 0.413
0.396PheTrp: 0.396 ± 0.187
0.89PheTyr: 0.89 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
3.56GlyAla: 3.56 ± 1.107
0.099GlyCys: 0.099 ± 0.102
4.352GlyAsp: 4.352 ± 0.619
3.956GlyGlu: 3.956 ± 0.956
3.659GlyPhe: 3.659 ± 0.772
4.352GlyGly: 4.352 ± 0.838
1.187GlyHis: 1.187 ± 0.258
5.539GlyIle: 5.539 ± 0.85
6.923GlyLys: 6.923 ± 0.886
5.637GlyLeu: 5.637 ± 0.918
2.275GlyMet: 2.275 ± 0.515
3.462GlyAsn: 3.462 ± 0.638
2.868GlyPro: 2.868 ± 2.168
3.264GlyGln: 3.264 ± 0.665
2.275GlyArg: 2.275 ± 0.512
4.154GlySer: 4.154 ± 0.754
4.055GlyThr: 4.055 ± 0.76
3.56GlyVal: 3.56 ± 0.573
0.593GlyTrp: 0.593 ± 0.28
3.659GlyTyr: 3.659 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
0.692HisAla: 0.692 ± 0.275
0.495HisCys: 0.495 ± 0.226
0.89HisAsp: 0.89 ± 0.28
1.286HisGlu: 1.286 ± 0.336
0.692HisPhe: 0.692 ± 0.298
1.286HisGly: 1.286 ± 0.29
0.198HisHis: 0.198 ± 0.133
0.89HisIle: 0.89 ± 0.304
0.791HisLys: 0.791 ± 0.243
1.286HisLeu: 1.286 ± 0.385
0.198HisMet: 0.198 ± 0.139
0.791HisAsn: 0.791 ± 0.238
0.495HisPro: 0.495 ± 0.196
1.187HisGln: 1.187 ± 0.393
0.396HisArg: 0.396 ± 0.195
0.692HisSer: 0.692 ± 0.229
0.495HisThr: 0.495 ± 0.194
0.495HisVal: 0.495 ± 0.26
0.297HisTrp: 0.297 ± 0.179
0.495HisTyr: 0.495 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
4.055IleAla: 4.055 ± 0.718
0.593IleCys: 0.593 ± 0.257
6.231IleAsp: 6.231 ± 0.884
5.934IleGlu: 5.934 ± 0.753
2.967IlePhe: 2.967 ± 0.498
2.769IleGly: 2.769 ± 0.693
0.791IleHis: 0.791 ± 0.251
3.758IleIle: 3.758 ± 0.526
6.528IleLys: 6.528 ± 0.982
5.539IleLeu: 5.539 ± 0.775
1.582IleMet: 1.582 ± 0.505
4.352IleAsn: 4.352 ± 0.879
2.374IlePro: 2.374 ± 0.494
1.78IleGln: 1.78 ± 0.391
3.363IleArg: 3.363 ± 0.551
5.835IleSer: 5.835 ± 1.051
4.154IleThr: 4.154 ± 0.704
3.56IleVal: 3.56 ± 0.531
0.396IleTrp: 0.396 ± 0.186
2.275IleTyr: 2.275 ± 0.692
0.0IleXaa: 0.0 ± 0.0
Lys
6.033LysAla: 6.033 ± 0.907
0.396LysCys: 0.396 ± 0.182
5.242LysAsp: 5.242 ± 0.832
7.121LysGlu: 7.121 ± 1.14
2.374LysPhe: 2.374 ± 0.418
5.242LysGly: 5.242 ± 0.865
1.286LysHis: 1.286 ± 0.323
6.231LysIle: 6.231 ± 0.974
6.923LysLys: 6.923 ± 1.142
7.418LysLeu: 7.418 ± 1.004
2.275LysMet: 2.275 ± 0.507
3.956LysAsn: 3.956 ± 0.692
2.868LysPro: 2.868 ± 0.522
4.154LysGln: 4.154 ± 0.624
4.055LysArg: 4.055 ± 0.633
5.044LysSer: 5.044 ± 0.707
5.341LysThr: 5.341 ± 0.697
6.132LysVal: 6.132 ± 0.66
0.989LysTrp: 0.989 ± 0.36
2.473LysTyr: 2.473 ± 0.495
0.0LysXaa: 0.0 ± 0.0
Leu
6.132LeuAla: 6.132 ± 0.761
0.396LeuCys: 0.396 ± 0.188
5.736LeuAsp: 5.736 ± 0.542
6.033LeuGlu: 6.033 ± 0.835
3.857LeuPhe: 3.857 ± 0.606
5.341LeuGly: 5.341 ± 0.997
1.385LeuHis: 1.385 ± 0.365
5.44LeuIle: 5.44 ± 0.955
9.099LeuLys: 9.099 ± 1.017
5.934LeuLeu: 5.934 ± 0.702
1.78LeuMet: 1.78 ± 0.306
4.55LeuAsn: 4.55 ± 0.762
2.374LeuPro: 2.374 ± 0.405
2.868LeuGln: 2.868 ± 0.614
3.462LeuArg: 3.462 ± 0.584
6.824LeuSer: 6.824 ± 0.988
5.143LeuThr: 5.143 ± 0.748
4.451LeuVal: 4.451 ± 0.833
0.791LeuTrp: 0.791 ± 0.271
2.967LeuTyr: 2.967 ± 0.543
0.0LeuXaa: 0.0 ± 0.0
Met
3.066MetAla: 3.066 ± 0.697
0.099MetCys: 0.099 ± 0.103
1.978MetAsp: 1.978 ± 0.459
1.681MetGlu: 1.681 ± 0.358
0.692MetPhe: 0.692 ± 0.27
1.879MetGly: 1.879 ± 0.398
0.593MetHis: 0.593 ± 0.249
1.484MetIle: 1.484 ± 0.391
1.582MetLys: 1.582 ± 0.378
1.582MetLeu: 1.582 ± 0.416
0.495MetMet: 0.495 ± 0.198
1.187MetAsn: 1.187 ± 0.347
0.692MetPro: 0.692 ± 0.293
1.484MetGln: 1.484 ± 0.34
0.396MetArg: 0.396 ± 0.188
1.879MetSer: 1.879 ± 0.365
2.473MetThr: 2.473 ± 0.542
1.681MetVal: 1.681 ± 0.408
0.495MetTrp: 0.495 ± 0.213
1.088MetTyr: 1.088 ± 0.386
0.0MetXaa: 0.0 ± 0.0
Asn
3.363AsnAla: 3.363 ± 0.593
0.495AsnCys: 0.495 ± 0.214
2.67AsnAsp: 2.67 ± 0.673
3.264AsnGlu: 3.264 ± 0.498
1.879AsnPhe: 1.879 ± 0.351
4.747AsnGly: 4.747 ± 0.61
0.692AsnHis: 0.692 ± 0.323
2.868AsnIle: 2.868 ± 0.461
4.352AsnLys: 4.352 ± 0.609
5.637AsnLeu: 5.637 ± 0.884
1.582AsnMet: 1.582 ± 0.351
2.374AsnAsn: 2.374 ± 0.456
2.077AsnPro: 2.077 ± 0.533
2.571AsnGln: 2.571 ± 0.454
2.077AsnArg: 2.077 ± 0.479
2.967AsnSer: 2.967 ± 0.622
2.473AsnThr: 2.473 ± 0.395
3.264AsnVal: 3.264 ± 0.511
0.593AsnTrp: 0.593 ± 0.224
1.879AsnTyr: 1.879 ± 0.454
0.0AsnXaa: 0.0 ± 0.0
Pro
1.978ProAla: 1.978 ± 0.628
0.099ProCys: 0.099 ± 0.103
2.374ProAsp: 2.374 ± 0.541
1.286ProGlu: 1.286 ± 0.424
1.088ProPhe: 1.088 ± 0.29
1.187ProGly: 1.187 ± 0.647
0.297ProHis: 0.297 ± 0.176
1.879ProIle: 1.879 ± 0.403
3.165ProLys: 3.165 ± 0.754
1.978ProLeu: 1.978 ± 0.406
0.495ProMet: 0.495 ± 0.161
1.484ProAsn: 1.484 ± 0.424
1.187ProPro: 1.187 ± 0.51
1.681ProGln: 1.681 ± 0.59
1.385ProArg: 1.385 ± 0.522
1.978ProSer: 1.978 ± 0.537
2.473ProThr: 2.473 ± 0.608
1.484ProVal: 1.484 ± 0.364
0.297ProTrp: 0.297 ± 0.166
1.088ProTyr: 1.088 ± 0.358
0.0ProXaa: 0.0 ± 0.0
Gln
2.473GlnAla: 2.473 ± 0.566
0.396GlnCys: 0.396 ± 0.194
1.978GlnAsp: 1.978 ± 0.419
3.066GlnGlu: 3.066 ± 0.75
2.176GlnPhe: 2.176 ± 0.542
3.264GlnGly: 3.264 ± 1.256
0.593GlnHis: 0.593 ± 0.298
2.868GlnIle: 2.868 ± 0.787
2.967GlnLys: 2.967 ± 0.602
4.253GlnLeu: 4.253 ± 0.702
0.989GlnMet: 0.989 ± 0.266
2.473GlnAsn: 2.473 ± 0.531
0.989GlnPro: 0.989 ± 0.339
1.582GlnGln: 1.582 ± 0.438
2.077GlnArg: 2.077 ± 0.52
2.967GlnSer: 2.967 ± 0.518
2.473GlnThr: 2.473 ± 0.453
1.582GlnVal: 1.582 ± 0.337
0.692GlnTrp: 0.692 ± 0.285
0.89GlnTyr: 0.89 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
2.769ArgAla: 2.769 ± 0.654
0.198ArgCys: 0.198 ± 0.149
3.066ArgAsp: 3.066 ± 0.631
2.176ArgGlu: 2.176 ± 0.634
0.791ArgPhe: 0.791 ± 0.256
3.165ArgGly: 3.165 ± 0.857
0.396ArgHis: 0.396 ± 0.216
2.176ArgIle: 2.176 ± 0.431
2.769ArgLys: 2.769 ± 0.582
3.956ArgLeu: 3.956 ± 0.547
0.89ArgMet: 0.89 ± 0.32
2.275ArgAsn: 2.275 ± 0.49
1.385ArgPro: 1.385 ± 0.36
1.78ArgGln: 1.78 ± 0.503
1.582ArgArg: 1.582 ± 0.495
1.582ArgSer: 1.582 ± 0.535
1.78ArgThr: 1.78 ± 0.447
2.473ArgVal: 2.473 ± 0.418
0.989ArgTrp: 0.989 ± 0.332
2.176ArgTyr: 2.176 ± 0.464
0.0ArgXaa: 0.0 ± 0.0
Ser
4.55SerAla: 4.55 ± 1.252
0.593SerCys: 0.593 ± 0.274
4.055SerAsp: 4.055 ± 0.571
4.154SerGlu: 4.154 ± 0.693
3.264SerPhe: 3.264 ± 0.678
6.033SerGly: 6.033 ± 0.765
0.791SerHis: 0.791 ± 0.325
4.451SerIle: 4.451 ± 0.86
4.747SerLys: 4.747 ± 0.706
5.242SerLeu: 5.242 ± 1.034
1.582SerMet: 1.582 ± 0.328
3.857SerAsn: 3.857 ± 0.692
1.088SerPro: 1.088 ± 0.286
3.066SerGln: 3.066 ± 0.429
2.176SerArg: 2.176 ± 0.498
4.154SerSer: 4.154 ± 0.897
3.165SerThr: 3.165 ± 0.531
3.56SerVal: 3.56 ± 0.459
0.593SerTrp: 0.593 ± 0.282
2.868SerTyr: 2.868 ± 0.614
0.0SerXaa: 0.0 ± 0.0
Thr
4.648ThrAla: 4.648 ± 0.754
0.198ThrCys: 0.198 ± 0.126
2.571ThrAsp: 2.571 ± 0.317
4.055ThrGlu: 4.055 ± 0.653
2.473ThrPhe: 2.473 ± 0.437
6.231ThrGly: 6.231 ± 1.252
0.791ThrHis: 0.791 ± 0.31
4.945ThrIle: 4.945 ± 0.591
5.143ThrLys: 5.143 ± 0.781
5.143ThrLeu: 5.143 ± 0.672
1.582ThrMet: 1.582 ± 0.338
2.275ThrAsn: 2.275 ± 0.416
1.78ThrPro: 1.78 ± 0.483
2.077ThrGln: 2.077 ± 0.499
1.187ThrArg: 1.187 ± 0.282
3.758ThrSer: 3.758 ± 0.777
3.066ThrThr: 3.066 ± 0.564
4.846ThrVal: 4.846 ± 0.559
0.396ThrTrp: 0.396 ± 0.182
1.978ThrTyr: 1.978 ± 0.644
0.0ThrXaa: 0.0 ± 0.0
Val
3.363ValAla: 3.363 ± 0.58
0.396ValCys: 0.396 ± 0.184
5.341ValAsp: 5.341 ± 0.806
4.648ValGlu: 4.648 ± 0.62
2.275ValPhe: 2.275 ± 0.395
4.154ValGly: 4.154 ± 0.724
0.791ValHis: 0.791 ± 0.232
3.857ValIle: 3.857 ± 0.789
4.154ValLys: 4.154 ± 0.518
4.945ValLeu: 4.945 ± 0.645
1.385ValMet: 1.385 ± 0.352
2.571ValAsn: 2.571 ± 0.562
1.78ValPro: 1.78 ± 0.457
2.275ValGln: 2.275 ± 0.434
2.67ValArg: 2.67 ± 0.482
4.352ValSer: 4.352 ± 0.542
4.846ValThr: 4.846 ± 0.63
3.956ValVal: 3.956 ± 0.582
0.495ValTrp: 0.495 ± 0.223
2.473ValTyr: 2.473 ± 0.607
0.0ValXaa: 0.0 ± 0.0
Trp
0.791TrpAla: 0.791 ± 0.355
0.099TrpCys: 0.099 ± 0.107
0.396TrpAsp: 0.396 ± 0.187
0.692TrpGlu: 0.692 ± 0.262
0.495TrpPhe: 0.495 ± 0.203
1.286TrpGly: 1.286 ± 0.403
0.198TrpHis: 0.198 ± 0.208
0.593TrpIle: 0.593 ± 0.24
0.692TrpLys: 0.692 ± 0.33
0.692TrpLeu: 0.692 ± 0.261
0.297TrpMet: 0.297 ± 0.196
0.692TrpAsn: 0.692 ± 0.26
0.099TrpPro: 0.099 ± 0.103
0.495TrpGln: 0.495 ± 0.221
0.989TrpArg: 0.989 ± 0.336
1.187TrpSer: 1.187 ± 0.293
0.593TrpThr: 0.593 ± 0.231
0.495TrpVal: 0.495 ± 0.188
0.0TrpTrp: 0.0 ± 0.0
0.791TrpTyr: 0.791 ± 0.254
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.066TyrAla: 3.066 ± 0.487
0.495TyrCys: 0.495 ± 0.253
3.363TyrAsp: 3.363 ± 0.723
2.176TyrGlu: 2.176 ± 0.509
1.978TyrPhe: 1.978 ± 0.498
2.176TyrGly: 2.176 ± 0.419
0.495TyrHis: 0.495 ± 0.35
2.374TyrIle: 2.374 ± 0.387
3.462TyrLys: 3.462 ± 0.748
2.769TyrLeu: 2.769 ± 0.454
1.088TyrMet: 1.088 ± 0.314
1.582TyrAsn: 1.582 ± 0.371
0.89TyrPro: 0.89 ± 0.252
1.978TyrGln: 1.978 ± 0.357
1.681TyrArg: 1.681 ± 0.372
1.78TyrSer: 1.78 ± 0.424
2.176TyrThr: 2.176 ± 0.508
2.374TyrVal: 2.374 ± 0.344
0.396TyrTrp: 0.396 ± 0.194
1.582TyrTyr: 1.582 ± 0.411
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (10112 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski