Amino acid dipepetide frequency for Vesicular stomatitis New Jersey virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.535AlaAla: 2.535 ± 1.565
0.563AlaCys: 0.563 ± 0.32
3.099AlaAsp: 3.099 ± 1.415
2.254AlaGlu: 2.254 ± 0.831
1.127AlaPhe: 1.127 ± 0.463
1.972AlaGly: 1.972 ± 0.59
1.408AlaHis: 1.408 ± 0.595
2.254AlaIle: 2.254 ± 0.59
1.408AlaLys: 1.408 ± 0.777
5.352AlaLeu: 5.352 ± 1.389
0.563AlaMet: 0.563 ± 0.32
2.254AlaAsn: 2.254 ± 0.604
1.408AlaPro: 1.408 ± 1.554
1.69AlaGln: 1.69 ± 0.658
2.817AlaArg: 2.817 ± 0.844
2.535AlaSer: 2.535 ± 1.023
3.662AlaThr: 3.662 ± 1.137
3.662AlaVal: 3.662 ± 1.039
0.563AlaTrp: 0.563 ± 0.32
1.408AlaTyr: 1.408 ± 0.794
0.0AlaXaa: 0.0 ± 0.0
Cys
0.845CysAla: 0.845 ± 0.83
0.282CysCys: 0.282 ± 0.16
0.563CysAsp: 0.563 ± 0.728
0.845CysGlu: 0.845 ± 0.559
0.845CysPhe: 0.845 ± 0.428
1.408CysGly: 1.408 ± 0.521
0.282CysHis: 0.282 ± 0.364
0.845CysIle: 0.845 ± 0.48
1.972CysLys: 1.972 ± 0.702
1.69CysLeu: 1.69 ± 0.455
0.0CysMet: 0.0 ± 0.0
0.845CysAsn: 0.845 ± 0.48
1.127CysPro: 1.127 ± 0.476
1.972CysGln: 1.972 ± 0.267
1.127CysArg: 1.127 ± 0.403
1.127CysSer: 1.127 ± 0.476
0.563CysThr: 0.563 ± 0.32
0.845CysVal: 0.845 ± 0.48
0.563CysTrp: 0.563 ± 0.32
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.69AspAla: 1.69 ± 0.455
1.127AspCys: 1.127 ± 0.628
5.634AspAsp: 5.634 ± 1.535
3.099AspGlu: 3.099 ± 1.208
3.099AspPhe: 3.099 ± 0.878
3.38AspGly: 3.38 ± 1.634
0.282AspHis: 0.282 ± 0.16
3.944AspIle: 3.944 ± 0.779
4.507AspLys: 4.507 ± 1.727
9.014AspLeu: 9.014 ± 1.229
2.254AspMet: 2.254 ± 0.748
2.535AspAsn: 2.535 ± 0.955
3.944AspPro: 3.944 ± 1.131
2.535AspGln: 2.535 ± 1.012
0.845AspArg: 0.845 ± 0.83
5.07AspSer: 5.07 ± 0.929
1.69AspThr: 1.69 ± 0.633
3.38AspVal: 3.38 ± 0.585
1.972AspTrp: 1.972 ± 0.685
3.099AspTyr: 3.099 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
2.535GluAla: 2.535 ± 1.927
0.282GluCys: 0.282 ± 0.55
3.944GluAsp: 3.944 ± 2.27
5.352GluGlu: 5.352 ± 0.965
1.69GluPhe: 1.69 ± 0.692
2.817GluGly: 2.817 ± 0.868
2.254GluHis: 2.254 ± 0.59
3.662GluIle: 3.662 ± 0.682
4.507GluLys: 4.507 ± 1.12
6.479GluLeu: 6.479 ± 1.241
0.845GluMet: 0.845 ± 0.315
3.099GluAsn: 3.099 ± 1.013
1.69GluPro: 1.69 ± 0.631
1.127GluGln: 1.127 ± 0.463
2.817GluArg: 2.817 ± 0.744
6.197GluSer: 6.197 ± 2.813
2.254GluThr: 2.254 ± 0.596
3.099GluVal: 3.099 ± 0.391
0.563GluTrp: 0.563 ± 0.301
3.944GluTyr: 3.944 ± 0.656
0.0GluXaa: 0.0 ± 0.0
Phe
1.408PheAla: 1.408 ± 0.777
0.563PheCys: 0.563 ± 0.301
1.408PheAsp: 1.408 ± 1.22
1.69PheGlu: 1.69 ± 0.435
2.254PhePhe: 2.254 ± 0.802
3.38PheGly: 3.38 ± 1.027
1.408PheHis: 1.408 ± 0.792
2.254PheIle: 2.254 ± 0.626
4.225PheLys: 4.225 ± 1.232
5.352PheLeu: 5.352 ± 2.322
1.408PheMet: 1.408 ± 0.408
1.69PheAsn: 1.69 ± 1.124
2.535PhePro: 2.535 ± 1.539
1.408PheGln: 1.408 ± 0.521
3.099PheArg: 3.099 ± 1.314
2.817PheSer: 2.817 ± 0.874
2.254PheThr: 2.254 ± 1.642
1.408PheVal: 1.408 ± 0.521
0.845PheTrp: 0.845 ± 0.399
1.127PheTyr: 1.127 ± 0.462
0.0PheXaa: 0.0 ± 0.0
Gly
0.845GlyAla: 0.845 ± 1.023
1.127GlyCys: 1.127 ± 0.64
3.662GlyAsp: 3.662 ± 1.055
3.662GlyGlu: 3.662 ± 1.765
2.817GlyPhe: 2.817 ± 1.19
2.817GlyGly: 2.817 ± 0.352
1.127GlyHis: 1.127 ± 0.463
3.662GlyIle: 3.662 ± 0.941
3.662GlyLys: 3.662 ± 1.557
8.732GlyLeu: 8.732 ± 2.036
1.69GlyMet: 1.69 ± 0.855
1.972GlyAsn: 1.972 ± 0.829
3.099GlyPro: 3.099 ± 1.216
1.972GlyGln: 1.972 ± 0.85
3.662GlyArg: 3.662 ± 0.849
4.507GlySer: 4.507 ± 1.845
5.07GlyThr: 5.07 ± 1.175
3.662GlyVal: 3.662 ± 1.323
1.69GlyTrp: 1.69 ± 0.592
1.127GlyTyr: 1.127 ± 1.008
0.0GlyXaa: 0.0 ± 0.0
His
1.127HisAla: 1.127 ± 0.4
0.282HisCys: 0.282 ± 0.16
0.845HisAsp: 0.845 ± 0.48
1.127HisGlu: 1.127 ± 0.403
1.69HisPhe: 1.69 ± 0.592
1.127HisGly: 1.127 ± 0.4
0.0HisHis: 0.0 ± 0.0
3.38HisIle: 3.38 ± 1.094
2.535HisLys: 2.535 ± 0.914
1.972HisLeu: 1.972 ± 0.567
0.282HisMet: 0.282 ± 0.45
0.845HisAsn: 0.845 ± 0.648
1.69HisPro: 1.69 ± 0.455
1.69HisGln: 1.69 ± 0.658
1.408HisArg: 1.408 ± 0.59
1.69HisSer: 1.69 ± 1.022
0.563HisThr: 0.563 ± 0.301
0.845HisVal: 0.845 ± 0.428
0.845HisTrp: 0.845 ± 0.315
1.408HisTyr: 1.408 ± 0.521
0.0HisXaa: 0.0 ± 0.0
Ile
2.535IleAla: 2.535 ± 1.137
1.408IleCys: 1.408 ± 0.595
4.789IleAsp: 4.789 ± 0.865
5.07IleGlu: 5.07 ± 0.698
1.408IlePhe: 1.408 ± 0.387
6.479IleGly: 6.479 ± 1.184
0.845IleHis: 0.845 ± 0.315
5.07IleIle: 5.07 ± 1.484
5.352IleLys: 5.352 ± 1.235
4.507IleLeu: 4.507 ± 1.055
1.972IleMet: 1.972 ± 0.865
2.535IleAsn: 2.535 ± 0.782
6.197IlePro: 6.197 ± 2.039
3.944IleGln: 3.944 ± 1.204
5.352IleArg: 5.352 ± 1.991
4.789IleSer: 4.789 ± 1.331
3.662IleThr: 3.662 ± 1.784
2.817IleVal: 2.817 ± 1.666
1.127IleTrp: 1.127 ± 0.463
3.099IleTyr: 3.099 ± 0.487
0.0IleXaa: 0.0 ± 0.0
Lys
1.972LysAla: 1.972 ± 0.707
0.282LysCys: 0.282 ± 0.16
5.352LysAsp: 5.352 ± 0.411
4.789LysGlu: 4.789 ± 2.002
2.817LysPhe: 2.817 ± 0.897
5.352LysGly: 5.352 ± 0.698
1.408LysHis: 1.408 ± 0.486
6.479LysIle: 6.479 ± 0.504
6.479LysLys: 6.479 ± 1.398
3.944LysLeu: 3.944 ± 0.655
3.38LysMet: 3.38 ± 0.584
3.38LysAsn: 3.38 ± 1.486
2.817LysPro: 2.817 ± 0.792
1.127LysGln: 1.127 ± 0.4
3.662LysArg: 3.662 ± 0.313
5.07LysSer: 5.07 ± 1.272
4.225LysThr: 4.225 ± 0.499
3.099LysVal: 3.099 ± 1.464
2.254LysTrp: 2.254 ± 0.257
3.38LysTyr: 3.38 ± 1.593
0.0LysXaa: 0.0 ± 0.0
Leu
3.662LeuAla: 3.662 ± 0.395
1.408LeuCys: 1.408 ± 0.387
4.507LeuAsp: 4.507 ± 0.796
4.789LeuGlu: 4.789 ± 1.184
3.944LeuPhe: 3.944 ± 1.273
5.352LeuGly: 5.352 ± 0.978
3.38LeuHis: 3.38 ± 0.972
10.141LeuIle: 10.141 ± 2.441
7.887LeuLys: 7.887 ± 0.666
6.479LeuLeu: 6.479 ± 2.61
2.535LeuMet: 2.535 ± 1.027
3.944LeuAsn: 3.944 ± 1.28
3.38LeuPro: 3.38 ± 1.328
1.69LeuGln: 1.69 ± 1.173
5.915LeuArg: 5.915 ± 1.213
9.296LeuSer: 9.296 ± 1.173
6.479LeuThr: 6.479 ± 0.934
3.944LeuVal: 3.944 ± 1.056
1.408LeuTrp: 1.408 ± 1.125
3.944LeuTyr: 3.944 ± 1.28
0.0LeuXaa: 0.0 ± 0.0
Met
1.972MetAla: 1.972 ± 0.528
1.408MetCys: 1.408 ± 0.521
1.408MetAsp: 1.408 ± 0.69
2.535MetGlu: 2.535 ± 0.813
1.408MetPhe: 1.408 ± 0.806
1.69MetGly: 1.69 ± 1.118
1.408MetHis: 1.408 ± 0.8
1.408MetIle: 1.408 ± 0.567
0.845MetLys: 0.845 ± 0.463
2.254MetLeu: 2.254 ± 0.448
1.127MetMet: 1.127 ± 0.467
1.127MetAsn: 1.127 ± 0.717
1.408MetPro: 1.408 ± 0.656
0.845MetGln: 0.845 ± 0.48
0.845MetArg: 0.845 ± 0.48
3.662MetSer: 3.662 ± 1.748
1.972MetThr: 1.972 ± 0.602
0.845MetVal: 0.845 ± 0.777
0.0MetTrp: 0.0 ± 0.0
0.282MetTyr: 0.282 ± 0.45
0.0MetXaa: 0.0 ± 0.0
Asn
2.254AsnAla: 2.254 ± 0.665
0.282AsnCys: 0.282 ± 0.55
2.254AsnAsp: 2.254 ± 0.628
1.408AsnGlu: 1.408 ± 0.858
0.845AsnPhe: 0.845 ± 0.83
3.099AsnGly: 3.099 ± 0.597
0.845AsnHis: 0.845 ± 0.48
3.099AsnIle: 3.099 ± 1.086
2.817AsnLys: 2.817 ± 1.288
3.38AsnLeu: 3.38 ± 1.364
0.563AsnMet: 0.563 ± 0.293
1.972AsnAsn: 1.972 ± 0.829
2.254AsnPro: 2.254 ± 1.209
1.972AsnGln: 1.972 ± 0.567
1.408AsnArg: 1.408 ± 0.567
4.507AsnSer: 4.507 ± 0.685
2.535AsnThr: 2.535 ± 0.782
1.972AsnVal: 1.972 ± 0.724
1.69AsnTrp: 1.69 ± 0.959
1.69AsnTyr: 1.69 ± 0.631
0.0AsnXaa: 0.0 ± 0.0
Pro
3.099ProAla: 3.099 ± 0.972
0.282ProCys: 0.282 ± 0.16
3.662ProAsp: 3.662 ± 1.355
1.972ProGlu: 1.972 ± 1.128
3.38ProPhe: 3.38 ± 1.571
2.535ProGly: 2.535 ± 1.176
3.099ProHis: 3.099 ± 1.631
4.507ProIle: 4.507 ± 1.141
3.944ProLys: 3.944 ± 1.018
3.662ProLeu: 3.662 ± 1.387
0.845ProMet: 0.845 ± 0.878
1.408ProAsn: 1.408 ± 0.387
3.38ProPro: 3.38 ± 1.019
1.408ProGln: 1.408 ± 0.595
1.127ProArg: 1.127 ± 0.4
4.225ProSer: 4.225 ± 1.167
3.38ProThr: 3.38 ± 0.458
4.507ProVal: 4.507 ± 1.644
0.282ProTrp: 0.282 ± 0.16
1.408ProTyr: 1.408 ± 0.928
0.0ProXaa: 0.0 ± 0.0
Gln
1.972GlnAla: 1.972 ± 0.744
1.408GlnCys: 1.408 ± 0.387
1.127GlnAsp: 1.127 ± 0.965
1.972GlnGlu: 1.972 ± 0.501
1.127GlnPhe: 1.127 ± 0.463
1.69GlnGly: 1.69 ± 0.358
1.127GlnHis: 1.127 ± 0.4
2.817GlnIle: 2.817 ± 0.711
2.254GlnLys: 2.254 ± 0.678
3.099GlnLeu: 3.099 ± 0.33
1.127GlnMet: 1.127 ± 0.463
1.972GlnAsn: 1.972 ± 0.523
1.972GlnPro: 1.972 ± 1.251
0.563GlnGln: 0.563 ± 0.394
0.563GlnArg: 0.563 ± 0.32
2.817GlnSer: 2.817 ± 0.316
1.69GlnThr: 1.69 ± 0.633
1.69GlnVal: 1.69 ± 0.658
0.845GlnTrp: 0.845 ± 0.942
1.127GlnTyr: 1.127 ± 0.64
0.0GlnXaa: 0.0 ± 0.0
Arg
4.225ArgAla: 4.225 ± 1.424
1.408ArgCys: 1.408 ± 0.408
1.972ArgAsp: 1.972 ± 0.528
3.099ArgGlu: 3.099 ± 1.013
2.254ArgPhe: 2.254 ± 0.734
3.099ArgGly: 3.099 ± 1.759
0.563ArgHis: 0.563 ± 0.32
2.535ArgIle: 2.535 ± 0.794
1.69ArgLys: 1.69 ± 0.712
4.789ArgLeu: 4.789 ± 1.44
1.69ArgMet: 1.69 ± 0.703
1.972ArgAsn: 1.972 ± 0.829
2.254ArgPro: 2.254 ± 0.596
1.972ArgGln: 1.972 ± 0.528
1.972ArgArg: 1.972 ± 0.267
5.352ArgSer: 5.352 ± 0.567
2.254ArgThr: 2.254 ± 0.973
3.38ArgVal: 3.38 ± 1.033
0.845ArgTrp: 0.845 ± 0.315
1.408ArgTyr: 1.408 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
3.944SerAla: 3.944 ± 0.188
1.69SerCys: 1.69 ± 0.658
7.042SerAsp: 7.042 ± 1.443
4.507SerGlu: 4.507 ± 1.869
2.254SerPhe: 2.254 ± 0.748
2.254SerGly: 2.254 ± 0.727
2.254SerHis: 2.254 ± 0.981
5.352SerIle: 5.352 ± 0.642
5.07SerLys: 5.07 ± 1.216
9.296SerLeu: 9.296 ± 1.018
2.817SerMet: 2.817 ± 1.263
3.38SerAsn: 3.38 ± 1.233
5.634SerPro: 5.634 ± 1.263
1.972SerGln: 1.972 ± 1.105
3.662SerArg: 3.662 ± 1.736
9.859SerSer: 9.859 ± 3.506
4.789SerThr: 4.789 ± 0.674
3.944SerVal: 3.944 ± 0.625
1.69SerTrp: 1.69 ± 0.959
2.817SerTyr: 2.817 ± 0.956
0.0SerXaa: 0.0 ± 0.0
Thr
1.69ThrAla: 1.69 ± 0.484
1.972ThrCys: 1.972 ± 0.528
4.225ThrAsp: 4.225 ± 1.945
2.535ThrGlu: 2.535 ± 0.85
3.38ThrPhe: 3.38 ± 0.698
4.507ThrGly: 4.507 ± 1.697
1.69ThrHis: 1.69 ± 0.631
2.817ThrIle: 2.817 ± 0.929
3.38ThrLys: 3.38 ± 0.458
3.944ThrLeu: 3.944 ± 0.955
2.535ThrMet: 2.535 ± 0.749
1.69ThrAsn: 1.69 ± 0.692
2.535ThrPro: 2.535 ± 1.008
1.408ThrGln: 1.408 ± 0.437
2.254ThrArg: 2.254 ± 0.727
4.507ThrSer: 4.507 ± 0.908
3.662ThrThr: 3.662 ± 0.652
2.817ThrVal: 2.817 ± 1.39
1.69ThrTrp: 1.69 ± 0.361
2.817ThrTyr: 2.817 ± 0.904
0.0ThrXaa: 0.0 ± 0.0
Val
2.535ValAla: 2.535 ± 1.663
1.69ValCys: 1.69 ± 0.91
3.099ValAsp: 3.099 ± 1.239
4.507ValGlu: 4.507 ± 2.237
1.408ValPhe: 1.408 ± 0.595
3.099ValGly: 3.099 ± 1.004
0.563ValHis: 0.563 ± 0.301
4.507ValIle: 4.507 ± 1.598
3.38ValLys: 3.38 ± 2.021
4.789ValLeu: 4.789 ± 0.767
1.127ValMet: 1.127 ± 0.965
1.69ValAsn: 1.69 ± 0.692
2.535ValPro: 2.535 ± 0.434
1.69ValGln: 1.69 ± 0.682
2.817ValArg: 2.817 ± 0.747
3.662ValSer: 3.662 ± 1.689
3.099ValThr: 3.099 ± 0.91
1.127ValVal: 1.127 ± 0.403
0.282ValTrp: 0.282 ± 0.471
1.69ValTyr: 1.69 ± 0.692
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.69TrpAsp: 1.69 ± 0.646
2.817TrpGlu: 2.817 ± 0.316
2.535TrpPhe: 2.535 ± 1.186
1.408TrpGly: 1.408 ± 0.387
0.282TrpHis: 0.282 ± 0.16
1.408TrpIle: 1.408 ± 0.521
1.408TrpLys: 1.408 ± 0.521
1.127TrpLeu: 1.127 ± 0.788
0.282TrpMet: 0.282 ± 0.364
0.845TrpAsn: 0.845 ± 0.463
0.282TrpPro: 0.282 ± 0.16
0.282TrpGln: 0.282 ± 0.16
1.408TrpArg: 1.408 ± 0.437
1.408TrpSer: 1.408 ± 0.8
0.845TrpThr: 0.845 ± 0.463
1.127TrpVal: 1.127 ± 0.888
0.0TrpTrp: 0.0 ± 0.0
0.563TrpTyr: 0.563 ± 0.584
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.69TyrAla: 1.69 ± 0.809
0.282TyrCys: 0.282 ± 0.364
2.817TyrAsp: 2.817 ± 1.33
1.127TyrGlu: 1.127 ± 0.49
2.254TyrPhe: 2.254 ± 0.586
3.099TyrGly: 3.099 ± 0.858
1.127TyrHis: 1.127 ± 0.64
2.817TyrIle: 2.817 ± 0.64
3.944TyrLys: 3.944 ± 1.151
4.225TyrLeu: 4.225 ± 0.965
1.127TyrMet: 1.127 ± 0.888
1.69TyrAsn: 1.69 ± 0.455
1.972TyrPro: 1.972 ± 0.85
1.69TyrGln: 1.69 ± 0.712
2.254TyrArg: 2.254 ± 0.665
1.127TyrSer: 1.127 ± 0.467
1.408TyrThr: 1.408 ± 0.794
1.127TyrVal: 1.127 ± 0.788
0.563TyrTrp: 0.563 ± 0.483
1.972TyrTyr: 1.972 ± 1.128
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3551 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski