Amino acid dipepetide frequency for Bourbon virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.173AlaAla: 4.173 ± 0.471
1.192AlaCys: 1.192 ± 0.542
1.192AlaAsp: 1.192 ± 0.474
3.875AlaGlu: 3.875 ± 1.015
2.683AlaPhe: 2.683 ± 1.129
2.086AlaGly: 2.086 ± 0.68
1.49AlaHis: 1.49 ± 0.383
2.086AlaIle: 2.086 ± 0.321
4.471AlaLys: 4.471 ± 0.607
4.769AlaLeu: 4.769 ± 1.117
2.683AlaMet: 2.683 ± 0.752
2.683AlaAsn: 2.683 ± 0.799
2.981AlaPro: 2.981 ± 0.532
0.894AlaGln: 0.894 ± 0.578
2.683AlaArg: 2.683 ± 1.535
4.173AlaSer: 4.173 ± 1.08
2.981AlaThr: 2.981 ± 0.885
3.577AlaVal: 3.577 ± 1.038
0.596AlaTrp: 0.596 ± 0.327
1.49AlaTyr: 1.49 ± 0.704
0.0AlaXaa: 0.0 ± 0.0
Cys
0.298CysAla: 0.298 ± 0.264
0.298CysCys: 0.298 ± 0.283
0.596CysAsp: 0.596 ± 0.409
0.298CysGlu: 0.298 ± 0.289
1.788CysPhe: 1.788 ± 0.892
1.49CysGly: 1.49 ± 0.76
0.894CysHis: 0.894 ± 0.467
1.49CysIle: 1.49 ± 0.512
1.788CysLys: 1.788 ± 0.684
2.981CysLeu: 2.981 ± 0.729
0.0CysMet: 0.0 ± 0.0
1.192CysAsn: 1.192 ± 0.601
0.894CysPro: 0.894 ± 0.431
0.0CysGln: 0.0 ± 0.0
0.596CysArg: 0.596 ± 0.339
1.788CysSer: 1.788 ± 0.866
1.192CysThr: 1.192 ± 0.619
1.49CysVal: 1.49 ± 0.829
0.0CysTrp: 0.0 ± 0.0
1.49CysTyr: 1.49 ± 1.065
0.0CysXaa: 0.0 ± 0.0
Asp
3.279AspAla: 3.279 ± 0.991
0.298AspCys: 0.298 ± 0.289
2.086AspAsp: 2.086 ± 0.612
4.173AspGlu: 4.173 ± 0.746
1.192AspPhe: 1.192 ± 0.409
3.279AspGly: 3.279 ± 1.464
1.49AspHis: 1.49 ± 0.453
1.49AspIle: 1.49 ± 0.427
2.385AspLys: 2.385 ± 1.036
4.769AspLeu: 4.769 ± 1.037
1.788AspMet: 1.788 ± 0.78
2.385AspAsn: 2.385 ± 0.536
2.683AspPro: 2.683 ± 0.641
2.683AspGln: 2.683 ± 0.626
2.385AspArg: 2.385 ± 0.692
2.385AspSer: 2.385 ± 0.684
2.683AspThr: 2.683 ± 0.604
2.385AspVal: 2.385 ± 0.518
1.192AspTrp: 1.192 ± 0.331
1.788AspTyr: 1.788 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
3.279GluAla: 3.279 ± 0.886
2.385GluCys: 2.385 ± 0.947
4.471GluAsp: 4.471 ± 0.832
6.557GluGlu: 6.557 ± 1.066
2.683GluPhe: 2.683 ± 1.122
4.471GluGly: 4.471 ± 0.868
0.894GluHis: 0.894 ± 0.552
2.683GluIle: 2.683 ± 0.486
7.75GluLys: 7.75 ± 0.658
7.75GluLeu: 7.75 ± 1.5
2.981GluMet: 2.981 ± 0.69
1.49GluAsn: 1.49 ± 0.502
2.981GluPro: 2.981 ± 0.623
2.086GluGln: 2.086 ± 0.758
3.577GluArg: 3.577 ± 0.866
3.577GluSer: 3.577 ± 0.566
4.471GluThr: 4.471 ± 0.997
5.961GluVal: 5.961 ± 0.558
2.086GluTrp: 2.086 ± 1.403
2.385GluTyr: 2.385 ± 0.724
0.0GluXaa: 0.0 ± 0.0
Phe
1.49PheAla: 1.49 ± 0.575
0.596PheCys: 0.596 ± 0.337
2.385PheAsp: 2.385 ± 0.593
2.385PheGlu: 2.385 ± 0.773
2.385PhePhe: 2.385 ± 0.881
1.788PheGly: 1.788 ± 0.639
1.192PheHis: 1.192 ± 0.5
2.385PheIle: 2.385 ± 1.352
4.471PheLys: 4.471 ± 0.731
4.173PheLeu: 4.173 ± 1.327
1.49PheMet: 1.49 ± 0.383
1.788PheAsn: 1.788 ± 0.882
1.192PhePro: 1.192 ± 0.474
0.894PheGln: 0.894 ± 0.504
1.49PheArg: 1.49 ± 0.502
2.981PheSer: 2.981 ± 1.101
1.49PheThr: 1.49 ± 0.669
3.279PheVal: 3.279 ± 0.76
0.596PheTrp: 0.596 ± 0.343
0.596PheTyr: 0.596 ± 0.307
0.0PheXaa: 0.0 ± 0.0
Gly
2.683GlyAla: 2.683 ± 0.699
0.596GlyCys: 0.596 ± 0.442
2.683GlyAsp: 2.683 ± 0.864
4.173GlyGlu: 4.173 ± 1.39
2.683GlyPhe: 2.683 ± 0.637
2.683GlyGly: 2.683 ± 0.762
1.49GlyHis: 1.49 ± 0.636
2.981GlyIle: 2.981 ± 0.269
4.173GlyLys: 4.173 ± 1.527
4.471GlyLeu: 4.471 ± 1.602
1.788GlyMet: 1.788 ± 0.85
2.086GlyAsn: 2.086 ± 0.528
3.577GlyPro: 3.577 ± 0.89
1.49GlyGln: 1.49 ± 0.593
3.279GlyArg: 3.279 ± 0.781
2.683GlySer: 2.683 ± 0.699
5.067GlyThr: 5.067 ± 1.319
3.577GlyVal: 3.577 ± 0.866
0.596GlyTrp: 0.596 ± 0.529
0.596GlyTyr: 0.596 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
1.788HisAla: 1.788 ± 0.708
0.894HisCys: 0.894 ± 0.347
0.596HisAsp: 0.596 ± 0.339
0.894HisGlu: 0.894 ± 0.624
2.683HisPhe: 2.683 ± 0.515
0.0HisGly: 0.0 ± 0.0
1.49HisHis: 1.49 ± 0.363
0.894HisIle: 0.894 ± 0.624
1.788HisLys: 1.788 ± 0.555
2.683HisLeu: 2.683 ± 0.683
0.298HisMet: 0.298 ± 0.283
0.894HisAsn: 0.894 ± 0.511
0.298HisPro: 0.298 ± 0.264
0.894HisGln: 0.894 ± 0.548
1.788HisArg: 1.788 ± 0.811
2.385HisSer: 2.385 ± 0.617
1.788HisThr: 1.788 ± 0.632
1.192HisVal: 1.192 ± 0.426
0.894HisTrp: 0.894 ± 0.504
0.298HisTyr: 0.298 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
3.279IleAla: 3.279 ± 0.508
2.086IleCys: 2.086 ± 0.828
2.683IleAsp: 2.683 ± 0.925
3.577IleGlu: 3.577 ± 0.825
0.596IlePhe: 0.596 ± 0.309
4.471IleGly: 4.471 ± 0.44
2.683IleHis: 2.683 ± 0.358
2.683IleIle: 2.683 ± 0.828
3.577IleLys: 3.577 ± 0.602
5.365IleLeu: 5.365 ± 0.919
1.49IleMet: 1.49 ± 0.478
1.49IleAsn: 1.49 ± 0.46
2.086IlePro: 2.086 ± 0.921
1.788IleGln: 1.788 ± 0.605
3.577IleArg: 3.577 ± 0.695
5.365IleSer: 5.365 ± 1.018
2.981IleThr: 2.981 ± 1.479
2.683IleVal: 2.683 ± 1.034
0.894IleTrp: 0.894 ± 0.422
0.894IleTyr: 0.894 ± 0.293
0.0IleXaa: 0.0 ± 0.0
Lys
3.279LysAla: 3.279 ± 1.732
1.788LysCys: 1.788 ± 0.776
3.279LysAsp: 3.279 ± 0.796
3.577LysGlu: 3.577 ± 0.879
2.981LysPhe: 2.981 ± 0.838
3.875LysGly: 3.875 ± 1.222
1.192LysHis: 1.192 ± 0.5
5.067LysIle: 5.067 ± 1.553
5.365LysLys: 5.365 ± 1.632
5.961LysLeu: 5.961 ± 1.143
2.683LysMet: 2.683 ± 0.924
2.981LysAsn: 2.981 ± 0.968
2.683LysPro: 2.683 ± 0.901
3.279LysGln: 3.279 ± 0.904
5.663LysArg: 5.663 ± 1.234
3.577LysSer: 3.577 ± 0.763
8.346LysThr: 8.346 ± 0.469
4.471LysVal: 4.471 ± 0.643
0.596LysTrp: 0.596 ± 0.279
4.471LysTyr: 4.471 ± 1.12
0.0LysXaa: 0.0 ± 0.0
Leu
5.365LeuAla: 5.365 ± 0.914
2.981LeuCys: 2.981 ± 0.665
5.067LeuAsp: 5.067 ± 0.753
10.73LeuGlu: 10.73 ± 1.132
2.086LeuPhe: 2.086 ± 0.321
5.365LeuGly: 5.365 ± 1.358
2.086LeuHis: 2.086 ± 0.737
5.067LeuIle: 5.067 ± 0.758
4.769LeuLys: 4.769 ± 1.336
9.538LeuLeu: 9.538 ± 1.632
2.385LeuMet: 2.385 ± 0.712
4.769LeuAsn: 4.769 ± 1.632
5.663LeuPro: 5.663 ± 1.094
2.683LeuGln: 2.683 ± 0.604
3.875LeuArg: 3.875 ± 1.338
8.346LeuSer: 8.346 ± 1.623
3.279LeuThr: 3.279 ± 0.931
8.942LeuVal: 8.942 ± 1.575
2.086LeuTrp: 2.086 ± 0.377
2.981LeuTyr: 2.981 ± 0.729
0.0LeuXaa: 0.0 ± 0.0
Met
2.086MetAla: 2.086 ± 0.806
0.596MetCys: 0.596 ± 0.466
0.596MetAsp: 0.596 ± 0.325
4.173MetGlu: 4.173 ± 1.107
1.192MetPhe: 1.192 ± 0.668
1.192MetGly: 1.192 ± 0.933
1.192MetHis: 1.192 ± 0.456
1.49MetIle: 1.49 ± 0.52
2.385MetLys: 2.385 ± 0.508
1.192MetLeu: 1.192 ± 0.46
0.894MetMet: 0.894 ± 0.524
2.086MetAsn: 2.086 ± 0.998
0.894MetPro: 0.894 ± 0.357
0.894MetGln: 0.894 ± 0.359
1.788MetArg: 1.788 ± 0.462
2.086MetSer: 2.086 ± 0.765
1.192MetThr: 1.192 ± 0.331
1.788MetVal: 1.788 ± 0.772
1.192MetTrp: 1.192 ± 0.507
0.298MetTyr: 0.298 ± 0.386
0.0MetXaa: 0.0 ± 0.0
Asn
2.086AsnAla: 2.086 ± 0.675
0.298AsnCys: 0.298 ± 0.264
1.49AsnAsp: 1.49 ± 0.66
4.173AsnGlu: 4.173 ± 1.595
2.385AsnPhe: 2.385 ± 0.84
2.086AsnGly: 2.086 ± 1.28
0.894AsnHis: 0.894 ± 0.384
2.385AsnIle: 2.385 ± 0.312
2.981AsnLys: 2.981 ± 0.91
3.577AsnLeu: 3.577 ± 0.785
0.596AsnMet: 0.596 ± 0.309
2.086AsnAsn: 2.086 ± 1.28
3.577AsnPro: 3.577 ± 0.625
2.086AsnGln: 2.086 ± 0.624
2.385AsnArg: 2.385 ± 0.695
2.385AsnSer: 2.385 ± 0.684
1.192AsnThr: 1.192 ± 0.617
2.981AsnVal: 2.981 ± 0.441
0.298AsnTrp: 0.298 ± 0.235
1.192AsnTyr: 1.192 ± 0.518
0.0AsnXaa: 0.0 ± 0.0
Pro
2.086ProAla: 2.086 ± 0.646
0.894ProCys: 0.894 ± 0.523
2.086ProAsp: 2.086 ± 0.61
3.577ProGlu: 3.577 ± 0.696
2.086ProPhe: 2.086 ± 0.513
1.788ProGly: 1.788 ± 0.752
1.788ProHis: 1.788 ± 0.497
2.086ProIle: 2.086 ± 0.813
4.173ProLys: 4.173 ± 1.229
5.067ProLeu: 5.067 ± 1.774
0.894ProMet: 0.894 ± 0.293
2.981ProAsn: 2.981 ± 1.087
1.192ProPro: 1.192 ± 0.418
2.086ProGln: 2.086 ± 0.378
1.788ProArg: 1.788 ± 0.772
4.173ProSer: 4.173 ± 1.305
2.981ProThr: 2.981 ± 0.885
4.173ProVal: 4.173 ± 0.872
0.0ProTrp: 0.0 ± 0.0
1.788ProTyr: 1.788 ± 0.761
0.0ProXaa: 0.0 ± 0.0
Gln
2.086GlnAla: 2.086 ± 0.689
0.0GlnCys: 0.0 ± 0.0
2.086GlnAsp: 2.086 ± 0.777
4.173GlnGlu: 4.173 ± 0.82
0.894GlnPhe: 0.894 ± 0.504
1.788GlnGly: 1.788 ± 0.638
0.298GlnHis: 0.298 ± 0.235
2.385GlnIle: 2.385 ± 0.951
1.192GlnLys: 1.192 ± 0.654
3.875GlnLeu: 3.875 ± 1.29
2.086GlnMet: 2.086 ± 1.07
1.49GlnAsn: 1.49 ± 0.653
2.086GlnPro: 2.086 ± 0.372
2.385GlnGln: 2.385 ± 0.858
2.683GlnArg: 2.683 ± 1.261
2.385GlnSer: 2.385 ± 0.66
0.894GlnThr: 0.894 ± 0.347
2.683GlnVal: 2.683 ± 1.527
0.298GlnTrp: 0.298 ± 0.235
1.49GlnTyr: 1.49 ± 0.825
0.0GlnXaa: 0.0 ± 0.0
Arg
3.875ArgAla: 3.875 ± 1.123
0.596ArgCys: 0.596 ± 0.325
2.385ArgAsp: 2.385 ± 1.005
2.981ArgGlu: 2.981 ± 0.841
2.683ArgPhe: 2.683 ± 1.034
2.683ArgGly: 2.683 ± 0.766
0.894ArgHis: 0.894 ± 0.261
4.471ArgIle: 4.471 ± 1.223
2.385ArgLys: 2.385 ± 1.178
5.663ArgLeu: 5.663 ± 0.895
1.192ArgMet: 1.192 ± 0.616
2.086ArgAsn: 2.086 ± 0.877
3.875ArgPro: 3.875 ± 1.388
1.788ArgGln: 1.788 ± 0.632
2.683ArgArg: 2.683 ± 0.435
3.875ArgSer: 3.875 ± 1.184
3.279ArgThr: 3.279 ± 1.156
2.981ArgVal: 2.981 ± 0.816
0.894ArgTrp: 0.894 ± 0.334
1.49ArgTyr: 1.49 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
2.981SerAla: 2.981 ± 1.246
0.894SerCys: 0.894 ± 0.261
3.875SerAsp: 3.875 ± 0.583
4.769SerGlu: 4.769 ± 1.152
1.788SerPhe: 1.788 ± 0.516
4.471SerGly: 4.471 ± 1.359
1.192SerHis: 1.192 ± 0.559
3.279SerIle: 3.279 ± 1.249
6.557SerLys: 6.557 ± 1.85
7.75SerLeu: 7.75 ± 0.993
2.385SerMet: 2.385 ± 0.575
2.683SerAsn: 2.683 ± 0.732
4.173SerPro: 4.173 ± 1.044
4.471SerGln: 4.471 ± 1.173
1.49SerArg: 1.49 ± 0.823
7.154SerSer: 7.154 ± 1.596
3.875SerThr: 3.875 ± 0.92
5.067SerVal: 5.067 ± 1.284
1.788SerTrp: 1.788 ± 0.586
2.683SerTyr: 2.683 ± 0.59
0.0SerXaa: 0.0 ± 0.0
Thr
1.192ThrAla: 1.192 ± 0.939
0.894ThrCys: 0.894 ± 0.452
3.279ThrAsp: 3.279 ± 1.485
2.981ThrGlu: 2.981 ± 0.495
2.683ThrPhe: 2.683 ± 0.578
4.471ThrGly: 4.471 ± 0.97
1.192ThrHis: 1.192 ± 0.258
5.067ThrIle: 5.067 ± 0.924
5.365ThrLys: 5.365 ± 1.342
5.663ThrLeu: 5.663 ± 0.649
0.894ThrMet: 0.894 ± 0.357
1.49ThrAsn: 1.49 ± 0.363
2.683ThrPro: 2.683 ± 0.722
1.788ThrGln: 1.788 ± 0.549
5.067ThrArg: 5.067 ± 0.823
4.173ThrSer: 4.173 ± 1.562
3.577ThrThr: 3.577 ± 0.666
5.365ThrVal: 5.365 ± 1.422
1.788ThrTrp: 1.788 ± 0.654
2.086ThrTyr: 2.086 ± 0.868
0.0ThrXaa: 0.0 ± 0.0
Val
5.961ValAla: 5.961 ± 1.183
1.788ValCys: 1.788 ± 0.695
3.875ValAsp: 3.875 ± 0.981
4.769ValGlu: 4.769 ± 1.074
3.279ValPhe: 3.279 ± 0.723
3.577ValGly: 3.577 ± 0.989
0.894ValHis: 0.894 ± 0.319
3.875ValIle: 3.875 ± 0.663
4.471ValLys: 4.471 ± 0.885
8.346ValLeu: 8.346 ± 1.597
0.894ValMet: 0.894 ± 0.293
1.192ValAsn: 1.192 ± 0.258
2.385ValPro: 2.385 ± 0.732
2.683ValGln: 2.683 ± 0.475
3.279ValArg: 3.279 ± 0.957
4.769ValSer: 4.769 ± 1.42
5.365ValThr: 5.365 ± 1.016
4.471ValVal: 4.471 ± 0.792
0.0ValTrp: 0.0 ± 0.0
3.577ValTyr: 3.577 ± 1.17
0.0ValXaa: 0.0 ± 0.0
Trp
0.596TrpAla: 0.596 ± 0.337
0.298TrpCys: 0.298 ± 0.283
0.0TrpAsp: 0.0 ± 0.0
0.894TrpGlu: 0.894 ± 0.504
0.0TrpPhe: 0.0 ± 0.0
1.192TrpGly: 1.192 ± 0.476
0.0TrpHis: 0.0 ± 0.0
0.596TrpIle: 0.596 ± 0.567
2.385TrpLys: 2.385 ± 0.301
2.385TrpLeu: 2.385 ± 0.744
0.596TrpMet: 0.596 ± 0.293
0.596TrpAsn: 0.596 ± 0.327
0.0TrpPro: 0.0 ± 0.0
0.596TrpGln: 0.596 ± 0.395
1.192TrpArg: 1.192 ± 0.483
1.788TrpSer: 1.788 ± 0.294
1.49TrpThr: 1.49 ± 0.642
1.49TrpVal: 1.49 ± 0.483
0.0TrpTrp: 0.0 ± 0.0
0.298TrpTyr: 0.298 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.894TyrAla: 0.894 ± 0.319
1.192TyrCys: 1.192 ± 0.451
2.086TyrAsp: 2.086 ± 0.665
1.788TyrGlu: 1.788 ± 0.452
0.298TyrPhe: 0.298 ± 0.233
0.596TyrGly: 0.596 ± 0.293
1.192TyrHis: 1.192 ± 0.277
1.788TyrIle: 1.788 ± 0.854
2.683TyrLys: 2.683 ± 0.465
2.086TyrLeu: 2.086 ± 0.549
0.894TyrMet: 0.894 ± 0.445
2.683TyrAsn: 2.683 ± 0.83
1.788TyrPro: 1.788 ± 0.294
1.788TyrGln: 1.788 ± 0.77
1.49TyrArg: 1.49 ± 0.642
3.279TyrSer: 3.279 ± 1.004
3.577TyrThr: 3.577 ± 0.829
0.894TyrVal: 0.894 ± 0.422
0.596TyrTrp: 0.596 ± 0.325
0.596TyrTyr: 0.596 ± 0.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3356 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski