Amino acid dipepetide frequency for West Caucasian bat virus (WCBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.547AlaAla: 1.547 ± 0.89
0.774AlaCys: 0.774 ± 0.748
2.321AlaAsp: 2.321 ± 1.726
2.579AlaGlu: 2.579 ± 0.799
1.289AlaPhe: 1.289 ± 0.481
2.837AlaGly: 2.837 ± 0.666
2.321AlaHis: 2.321 ± 0.915
3.868AlaIle: 3.868 ± 1.07
1.805AlaLys: 1.805 ± 0.527
5.157AlaLeu: 5.157 ± 1.056
0.516AlaMet: 0.516 ± 0.297
1.289AlaAsn: 1.289 ± 0.643
1.289AlaPro: 1.289 ± 1.084
1.805AlaGln: 1.805 ± 0.796
2.837AlaArg: 2.837 ± 0.759
2.321AlaSer: 2.321 ± 0.583
1.805AlaThr: 1.805 ± 0.634
2.321AlaVal: 2.321 ± 0.972
0.258AlaTrp: 0.258 ± 0.155
2.063AlaTyr: 2.063 ± 0.898
0.0AlaXaa: 0.0 ± 0.0
Cys
0.516CysAla: 0.516 ± 0.747
0.516CysCys: 0.516 ± 0.24
0.258CysAsp: 0.258 ± 0.155
0.0CysGlu: 0.0 ± 0.0
0.516CysPhe: 0.516 ± 0.311
1.031CysGly: 1.031 ± 0.48
0.258CysHis: 0.258 ± 0.354
0.774CysIle: 0.774 ± 0.332
1.031CysLys: 1.031 ± 0.507
2.321CysLeu: 2.321 ± 0.683
0.516CysMet: 0.516 ± 0.561
1.031CysAsn: 1.031 ± 0.378
1.805CysPro: 1.805 ± 0.473
1.031CysGln: 1.031 ± 0.418
1.031CysArg: 1.031 ± 0.648
2.321CysSer: 2.321 ± 0.453
0.774CysThr: 0.774 ± 0.756
0.516CysVal: 0.516 ± 0.306
0.258CysTrp: 0.258 ± 0.155
0.774CysTyr: 0.774 ± 0.403
0.0CysXaa: 0.0 ± 0.0
Asp
2.063AspAla: 2.063 ± 0.65
0.516AspCys: 0.516 ± 0.518
4.642AspAsp: 4.642 ± 3.045
5.931AspGlu: 5.931 ± 3.272
3.61AspPhe: 3.61 ± 0.841
1.805AspGly: 1.805 ± 0.643
1.289AspHis: 1.289 ± 0.643
4.384AspIle: 4.384 ± 1.668
2.579AspLys: 2.579 ± 0.612
8.767AspLeu: 8.767 ± 1.544
1.289AspMet: 1.289 ± 0.423
2.321AspAsn: 2.321 ± 0.228
3.868AspPro: 3.868 ± 0.683
2.063AspGln: 2.063 ± 0.892
2.321AspArg: 2.321 ± 1.041
3.352AspSer: 3.352 ± 0.585
2.837AspThr: 2.837 ± 0.59
2.063AspVal: 2.063 ± 0.65
1.031AspTrp: 1.031 ± 0.325
1.289AspTyr: 1.289 ± 0.777
0.0AspXaa: 0.0 ± 0.0
Glu
2.837GluAla: 2.837 ± 0.828
0.258GluCys: 0.258 ± 0.296
4.899GluAsp: 4.899 ± 2.009
5.415GluGlu: 5.415 ± 1.311
2.063GluPhe: 2.063 ± 0.553
5.415GluGly: 5.415 ± 1.855
1.805GluHis: 1.805 ± 1.493
4.642GluIle: 4.642 ± 0.904
3.094GluLys: 3.094 ± 1.308
3.868GluLeu: 3.868 ± 0.894
3.094GluMet: 3.094 ± 0.53
2.321GluAsn: 2.321 ± 0.627
3.61GluPro: 3.61 ± 1.219
1.289GluGln: 1.289 ± 0.631
3.352GluArg: 3.352 ± 0.298
7.22GluSer: 7.22 ± 0.908
3.352GluThr: 3.352 ± 1.441
4.384GluVal: 4.384 ± 0.589
0.258GluTrp: 0.258 ± 0.155
2.579GluTyr: 2.579 ± 1.791
0.0GluXaa: 0.0 ± 0.0
Phe
0.774PheAla: 0.774 ± 0.403
0.516PheCys: 0.516 ± 0.306
2.063PheAsp: 2.063 ± 0.757
2.321PheGlu: 2.321 ± 1.482
2.321PhePhe: 2.321 ± 1.151
1.031PheGly: 1.031 ± 0.361
1.547PheHis: 1.547 ± 0.369
1.805PheIle: 1.805 ± 0.474
3.094PheLys: 3.094 ± 1.019
5.157PheLeu: 5.157 ± 0.906
0.258PheMet: 0.258 ± 0.155
1.805PheAsn: 1.805 ± 0.542
3.352PhePro: 3.352 ± 0.745
1.805PheGln: 1.805 ± 0.573
3.868PheArg: 3.868 ± 1.938
3.352PheSer: 3.352 ± 1.073
1.547PheThr: 1.547 ± 0.82
2.321PheVal: 2.321 ± 0.617
0.258PheTrp: 0.258 ± 0.155
0.516PheTyr: 0.516 ± 0.297
0.0PheXaa: 0.0 ± 0.0
Gly
2.063GlyAla: 2.063 ± 0.257
0.774GlyCys: 0.774 ± 0.403
3.868GlyAsp: 3.868 ± 1.322
3.352GlyGlu: 3.352 ± 1.869
1.031GlyPhe: 1.031 ± 0.325
4.642GlyGly: 4.642 ± 0.608
0.516GlyHis: 0.516 ± 0.297
3.61GlyIle: 3.61 ± 1.028
2.837GlyLys: 2.837 ± 0.816
7.478GlyLeu: 7.478 ± 0.682
0.774GlyMet: 0.774 ± 0.332
2.321GlyAsn: 2.321 ± 0.649
1.805GlyPro: 1.805 ± 0.555
2.321GlyGln: 2.321 ± 0.509
3.352GlyArg: 3.352 ± 0.585
3.61GlySer: 3.61 ± 0.809
2.837GlyThr: 2.837 ± 1.328
2.579GlyVal: 2.579 ± 0.951
0.774GlyTrp: 0.774 ± 0.466
3.094GlyTyr: 3.094 ± 0.31
0.0GlyXaa: 0.0 ± 0.0
His
1.805HisAla: 1.805 ± 0.408
0.258HisCys: 0.258 ± 0.296
1.289HisAsp: 1.289 ± 0.749
1.805HisGlu: 1.805 ± 0.479
1.031HisPhe: 1.031 ± 1.036
1.031HisGly: 1.031 ± 0.622
0.258HisHis: 0.258 ± 0.373
2.579HisIle: 2.579 ± 0.65
2.321HisLys: 2.321 ± 1.408
3.094HisLeu: 3.094 ± 1.105
0.258HisMet: 0.258 ± 0.155
0.0HisAsn: 0.0 ± 0.0
2.063HisPro: 2.063 ± 0.538
1.031HisGln: 1.031 ± 0.58
1.031HisArg: 1.031 ± 0.48
1.031HisSer: 1.031 ± 0.361
0.516HisThr: 0.516 ± 0.531
1.031HisVal: 1.031 ± 0.325
0.774HisTrp: 0.774 ± 0.466
1.547HisTyr: 1.547 ± 0.649
0.0HisXaa: 0.0 ± 0.0
Ile
2.321IleAla: 2.321 ± 1.647
1.289IleCys: 1.289 ± 0.377
3.094IleAsp: 3.094 ± 0.739
3.094IleGlu: 3.094 ± 0.574
1.805IlePhe: 1.805 ± 0.408
2.837IleGly: 2.837 ± 0.814
2.063IleHis: 2.063 ± 0.547
5.673IleIle: 5.673 ± 1.135
3.352IleLys: 3.352 ± 0.627
7.994IleLeu: 7.994 ± 2.784
1.031IleMet: 1.031 ± 0.593
2.579IleAsn: 2.579 ± 0.622
2.579IlePro: 2.579 ± 0.922
1.289IleGln: 1.289 ± 0.481
5.931IleArg: 5.931 ± 1.435
9.283IleSer: 9.283 ± 1.631
3.352IleThr: 3.352 ± 0.585
3.61IleVal: 3.61 ± 1.351
1.031IleTrp: 1.031 ± 0.378
3.094IleTyr: 3.094 ± 0.97
0.0IleXaa: 0.0 ± 0.0
Lys
1.805LysAla: 1.805 ± 0.725
0.258LysCys: 0.258 ± 0.296
3.094LysAsp: 3.094 ± 1.123
3.352LysGlu: 3.352 ± 2.021
0.774LysPhe: 0.774 ± 0.486
3.352LysGly: 3.352 ± 1.198
1.289LysHis: 1.289 ± 0.417
5.931LysIle: 5.931 ± 0.831
5.673LysLys: 5.673 ± 1.701
7.478LysLeu: 7.478 ± 1.797
1.805LysMet: 1.805 ± 0.533
1.289LysAsn: 1.289 ± 0.481
3.352LysPro: 3.352 ± 0.891
0.774LysGln: 0.774 ± 0.419
2.321LysArg: 2.321 ± 0.228
6.704LysSer: 6.704 ± 1.442
3.094LysThr: 3.094 ± 0.934
3.352LysVal: 3.352 ± 0.875
0.516LysTrp: 0.516 ± 0.311
2.321LysTyr: 2.321 ± 0.543
0.0LysXaa: 0.0 ± 0.0
Leu
4.126LeuAla: 4.126 ± 0.87
2.063LeuCys: 2.063 ± 0.455
5.931LeuAsp: 5.931 ± 0.78
6.447LeuGlu: 6.447 ± 1.121
5.415LeuPhe: 5.415 ± 1.668
4.384LeuGly: 4.384 ± 0.831
1.547LeuHis: 1.547 ± 0.338
7.22LeuIle: 7.22 ± 1.493
7.478LeuLys: 7.478 ± 0.858
7.736LeuLeu: 7.736 ± 2.719
5.931LeuMet: 5.931 ± 1.582
3.61LeuAsn: 3.61 ± 0.543
2.837LeuPro: 2.837 ± 0.888
3.352LeuGln: 3.352 ± 0.747
7.994LeuArg: 7.994 ± 1.483
9.025LeuSer: 9.025 ± 1.47
4.384LeuThr: 4.384 ± 2.328
7.22LeuVal: 7.22 ± 0.892
1.805LeuTrp: 1.805 ± 0.812
4.126LeuTyr: 4.126 ± 0.971
0.0LeuXaa: 0.0 ± 0.0
Met
1.805MetAla: 1.805 ± 0.389
0.774MetCys: 0.774 ± 0.292
1.289MetAsp: 1.289 ± 0.311
1.031MetGlu: 1.031 ± 0.325
1.289MetPhe: 1.289 ± 0.568
0.516MetGly: 0.516 ± 0.297
0.774MetHis: 0.774 ± 0.486
1.805MetIle: 1.805 ± 0.812
1.031MetLys: 1.031 ± 0.418
2.837MetLeu: 2.837 ± 0.829
0.258MetMet: 0.258 ± 0.354
1.805MetAsn: 1.805 ± 1.324
0.516MetPro: 0.516 ± 0.518
0.774MetGln: 0.774 ± 0.624
2.063MetArg: 2.063 ± 0.892
4.384MetSer: 4.384 ± 0.769
1.547MetThr: 1.547 ± 0.379
1.031MetVal: 1.031 ± 0.48
0.258MetTrp: 0.258 ± 0.155
0.258MetTyr: 0.258 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
2.837AsnAla: 2.837 ± 0.929
0.516AsnCys: 0.516 ± 0.311
1.289AsnAsp: 1.289 ± 0.508
1.547AsnGlu: 1.547 ± 1.101
2.837AsnPhe: 2.837 ± 1.533
1.289AsnGly: 1.289 ± 0.92
1.547AsnHis: 1.547 ± 0.488
4.126AsnIle: 4.126 ± 1.89
1.547AsnLys: 1.547 ± 0.584
4.384AsnLeu: 4.384 ± 0.46
0.774AsnMet: 0.774 ± 0.763
1.547AsnAsn: 1.547 ± 0.562
3.352AsnPro: 3.352 ± 0.438
1.547AsnGln: 1.547 ± 0.489
3.094AsnArg: 3.094 ± 0.654
3.094AsnSer: 3.094 ± 1.065
1.547AsnThr: 1.547 ± 0.71
1.547AsnVal: 1.547 ± 0.418
1.289AsnTrp: 1.289 ± 0.833
1.031AsnTyr: 1.031 ± 0.361
0.0AsnXaa: 0.0 ± 0.0
Pro
1.031ProAla: 1.031 ± 0.361
0.516ProCys: 0.516 ± 0.311
2.579ProAsp: 2.579 ± 0.588
5.931ProGlu: 5.931 ± 0.51
1.031ProPhe: 1.031 ± 0.778
1.547ProGly: 1.547 ± 0.562
1.289ProHis: 1.289 ± 0.915
1.289ProIle: 1.289 ± 0.508
2.321ProLys: 2.321 ± 0.568
6.189ProLeu: 6.189 ± 1.289
0.774ProMet: 0.774 ± 0.276
3.094ProAsn: 3.094 ± 1.414
3.352ProPro: 3.352 ± 1.361
1.031ProGln: 1.031 ± 0.622
2.579ProArg: 2.579 ± 0.795
5.415ProSer: 5.415 ± 0.646
2.321ProThr: 2.321 ± 0.228
3.868ProVal: 3.868 ± 0.797
0.258ProTrp: 0.258 ± 0.296
1.289ProTyr: 1.289 ± 0.549
0.0ProXaa: 0.0 ± 0.0
Gln
1.547GlnAla: 1.547 ± 1.101
0.774GlnCys: 0.774 ± 0.332
2.579GlnAsp: 2.579 ± 0.79
2.063GlnGlu: 2.063 ± 0.469
0.774GlnPhe: 0.774 ± 0.466
2.321GlnGly: 2.321 ± 0.438
0.516GlnHis: 0.516 ± 0.311
3.352GlnIle: 3.352 ± 0.924
3.352GlnLys: 3.352 ± 0.839
2.063GlnLeu: 2.063 ± 0.478
0.774GlnMet: 0.774 ± 0.301
1.031GlnAsn: 1.031 ± 0.361
0.516GlnPro: 0.516 ± 0.311
0.516GlnGln: 0.516 ± 0.306
1.031GlnArg: 1.031 ± 0.622
2.063GlnSer: 2.063 ± 0.476
1.289GlnThr: 1.289 ± 0.311
2.837GlnVal: 2.837 ± 0.814
0.516GlnTrp: 0.516 ± 0.306
0.774GlnTyr: 0.774 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
3.352ArgAla: 3.352 ± 0.715
1.805ArgCys: 1.805 ± 0.389
3.868ArgAsp: 3.868 ± 0.763
3.61ArgGlu: 3.61 ± 0.437
2.837ArgPhe: 2.837 ± 0.598
3.868ArgGly: 3.868 ± 0.839
1.547ArgHis: 1.547 ± 0.448
2.321ArgIle: 2.321 ± 0.828
2.063ArgLys: 2.063 ± 0.779
3.61ArgLeu: 3.61 ± 0.947
2.321ArgMet: 2.321 ± 0.589
2.579ArgAsn: 2.579 ± 0.588
2.063ArgPro: 2.063 ± 0.565
1.547ArgGln: 1.547 ± 0.649
4.126ArgArg: 4.126 ± 0.874
8.252ArgSer: 8.252 ± 0.77
3.868ArgThr: 3.868 ± 1.174
4.899ArgVal: 4.899 ± 1.286
1.031ArgTrp: 1.031 ± 0.622
1.805ArgTyr: 1.805 ± 0.745
0.0ArgXaa: 0.0 ± 0.0
Ser
3.868SerAla: 3.868 ± 0.148
1.547SerCys: 1.547 ± 0.933
7.22SerAsp: 7.22 ± 0.922
8.51SerGlu: 8.51 ± 1.843
4.384SerPhe: 4.384 ± 1.099
5.673SerGly: 5.673 ± 1.249
2.063SerHis: 2.063 ± 1.024
3.352SerIle: 3.352 ± 0.849
7.736SerLys: 7.736 ± 2.898
10.83SerLeu: 10.83 ± 3.132
1.289SerMet: 1.289 ± 0.508
4.126SerAsn: 4.126 ± 0.697
5.931SerPro: 5.931 ± 0.456
3.868SerGln: 3.868 ± 0.383
6.447SerArg: 6.447 ± 1.155
7.736SerSer: 7.736 ± 1.204
4.384SerThr: 4.384 ± 1.654
6.189SerVal: 6.189 ± 0.751
2.321SerTrp: 2.321 ± 0.743
4.642SerTyr: 4.642 ± 1.734
0.0SerXaa: 0.0 ± 0.0
Thr
2.321ThrAla: 2.321 ± 1.643
1.805ThrCys: 1.805 ± 1.301
1.547ThrAsp: 1.547 ± 0.369
2.063ThrGlu: 2.063 ± 1.024
1.031ThrPhe: 1.031 ± 0.739
2.837ThrGly: 2.837 ± 0.411
1.805ThrHis: 1.805 ± 0.474
3.094ThrIle: 3.094 ± 0.92
2.063ThrLys: 2.063 ± 0.461
4.899ThrLeu: 4.899 ± 0.841
1.289ThrMet: 1.289 ± 0.777
2.063ThrAsn: 2.063 ± 0.508
1.031ThrPro: 1.031 ± 0.378
2.063ThrGln: 2.063 ± 0.681
2.837ThrArg: 2.837 ± 1.028
5.157ThrSer: 5.157 ± 1.017
3.61ThrThr: 3.61 ± 1.002
3.352ThrVal: 3.352 ± 1.024
2.063ThrTrp: 2.063 ± 0.478
2.579ThrTyr: 2.579 ± 0.703
0.0ThrXaa: 0.0 ± 0.0
Val
3.352ValAla: 3.352 ± 0.72
1.031ValCys: 1.031 ± 0.739
3.352ValAsp: 3.352 ± 0.565
3.094ValGlu: 3.094 ± 0.961
3.61ValPhe: 3.61 ± 0.662
4.899ValGly: 4.899 ± 2.08
1.031ValHis: 1.031 ± 0.622
4.642ValIle: 4.642 ± 0.256
1.289ValLys: 1.289 ± 0.819
3.61ValLeu: 3.61 ± 0.905
1.031ValMet: 1.031 ± 0.455
2.837ValAsn: 2.837 ± 0.755
3.352ValPro: 3.352 ± 0.595
2.063ValGln: 2.063 ± 0.91
2.579ValArg: 2.579 ± 0.612
6.704ValSer: 6.704 ± 1.21
4.384ValThr: 4.384 ± 1.03
3.352ValVal: 3.352 ± 0.511
0.258ValTrp: 0.258 ± 0.155
3.61ValTyr: 3.61 ± 1.138
0.0ValXaa: 0.0 ± 0.0
Trp
0.516TrpAla: 0.516 ± 0.311
0.516TrpCys: 0.516 ± 0.306
0.516TrpAsp: 0.516 ± 0.306
1.031TrpGlu: 1.031 ± 0.378
0.258TrpPhe: 0.258 ± 0.155
1.031TrpGly: 1.031 ± 0.622
0.516TrpHis: 0.516 ± 0.311
1.289TrpIle: 1.289 ± 0.474
0.774TrpLys: 0.774 ± 0.276
1.031TrpLeu: 1.031 ± 0.362
0.774TrpMet: 0.774 ± 0.516
1.289TrpAsn: 1.289 ± 0.475
0.258TrpPro: 0.258 ± 0.155
0.0TrpGln: 0.0 ± 0.0
0.774TrpArg: 0.774 ± 0.466
2.837TrpSer: 2.837 ± 0.858
0.516TrpThr: 0.516 ± 0.24
0.774TrpVal: 0.774 ± 0.411
0.0TrpTrp: 0.0 ± 0.0
0.258TrpTyr: 0.258 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.516TyrAla: 0.516 ± 0.311
0.774TyrCys: 0.774 ± 0.516
2.321TyrAsp: 2.321 ± 0.543
2.579TyrGlu: 2.579 ± 0.955
1.805TyrPhe: 1.805 ± 0.716
1.289TyrGly: 1.289 ± 0.568
0.774TyrHis: 0.774 ± 0.466
1.547TyrIle: 1.547 ± 0.551
3.094TyrLys: 3.094 ± 1.049
4.126TyrLeu: 4.126 ± 0.488
1.031TyrMet: 1.031 ± 0.361
2.063TyrAsn: 2.063 ± 0.892
0.774TyrPro: 0.774 ± 0.403
0.516TyrGln: 0.516 ± 0.311
1.805TyrArg: 1.805 ± 0.857
8.252TyrSer: 8.252 ± 1.511
1.547TyrThr: 1.547 ± 1.395
2.837TyrVal: 2.837 ± 1.161
0.0TyrTrp: 0.0 ± 0.0
0.516TyrTyr: 0.516 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3879 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski