Amino acid dipepetide frequency for Vesicular stomatitis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.535AlaAla: 2.535 ± 1.325
0.563AlaCys: 0.563 ± 0.315
3.099AlaAsp: 3.099 ± 1.16
2.254AlaGlu: 2.254 ± 0.801
1.127AlaPhe: 1.127 ± 0.416
1.972AlaGly: 1.972 ± 0.534
1.127AlaHis: 1.127 ± 0.376
2.254AlaIle: 2.254 ± 0.519
1.408AlaLys: 1.408 ± 1.029
5.352AlaLeu: 5.352 ± 1.172
0.563AlaMet: 0.563 ± 0.315
2.254AlaAsn: 2.254 ± 0.64
1.408AlaPro: 1.408 ± 1.459
1.408AlaGln: 1.408 ± 0.488
2.535AlaArg: 2.535 ± 0.761
2.535AlaSer: 2.535 ± 0.937
3.662AlaThr: 3.662 ± 0.906
3.662AlaVal: 3.662 ± 0.932
0.563AlaTrp: 0.563 ± 0.315
1.408AlaTyr: 1.408 ± 0.637
0.0AlaXaa: 0.0 ± 0.0
Cys
0.845CysAla: 0.845 ± 0.705
0.282CysCys: 0.282 ± 0.157
0.563CysAsp: 0.563 ± 0.765
0.845CysGlu: 0.845 ± 0.497
0.845CysPhe: 0.845 ± 0.372
1.408CysGly: 1.408 ± 0.488
0.282CysHis: 0.282 ± 0.382
0.845CysIle: 0.845 ± 0.472
2.254CysLys: 2.254 ± 0.485
1.69CysLeu: 1.69 ± 0.362
0.0CysMet: 0.0 ± 0.0
0.845CysAsn: 0.845 ± 0.472
1.127CysPro: 1.127 ± 0.399
1.69CysGln: 1.69 ± 0.33
1.408CysArg: 1.408 ± 0.347
1.127CysSer: 1.127 ± 0.399
0.563CysThr: 0.563 ± 0.315
0.845CysVal: 0.845 ± 0.472
0.563CysTrp: 0.563 ± 0.315
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.69AspAla: 1.69 ± 0.362
1.127AspCys: 1.127 ± 0.623
5.352AspAsp: 5.352 ± 1.134
2.817AspGlu: 2.817 ± 1.061
3.099AspPhe: 3.099 ± 0.766
3.38AspGly: 3.38 ± 1.315
0.282AspHis: 0.282 ± 0.157
3.944AspIle: 3.944 ± 0.636
4.507AspLys: 4.507 ± 1.689
9.014AspLeu: 9.014 ± 0.918
2.254AspMet: 2.254 ± 0.717
2.254AspAsn: 2.254 ± 0.515
3.944AspPro: 3.944 ± 0.836
2.535AspGln: 2.535 ± 0.739
1.127AspArg: 1.127 ± 0.876
5.07AspSer: 5.07 ± 0.799
1.69AspThr: 1.69 ± 0.591
3.38AspVal: 3.38 ± 0.568
1.972AspTrp: 1.972 ± 0.661
3.38AspTyr: 3.38 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
2.254GluAla: 2.254 ± 1.771
0.282GluCys: 0.282 ± 0.469
3.944GluAsp: 3.944 ± 2.209
5.634GluGlu: 5.634 ± 1.364
1.408GluPhe: 1.408 ± 0.528
2.817GluGly: 2.817 ± 0.892
2.254GluHis: 2.254 ± 0.519
3.38GluIle: 3.38 ± 0.675
4.507GluLys: 4.507 ± 1.029
6.761GluLeu: 6.761 ± 1.199
0.845GluMet: 0.845 ± 0.305
2.817GluAsn: 2.817 ± 1.08
1.69GluPro: 1.69 ± 0.611
1.127GluGln: 1.127 ± 0.416
2.817GluArg: 2.817 ± 0.68
5.915GluSer: 5.915 ± 1.762
2.535GluThr: 2.535 ± 0.673
3.099GluVal: 3.099 ± 0.335
0.563GluTrp: 0.563 ± 0.308
3.662GluTyr: 3.662 ± 0.71
0.0GluXaa: 0.0 ± 0.0
Phe
1.408PheAla: 1.408 ± 0.652
0.563PheCys: 0.563 ± 0.308
1.127PheAsp: 1.127 ± 1.089
1.408PheGlu: 1.408 ± 0.528
2.254PhePhe: 2.254 ± 0.756
3.38PheGly: 3.38 ± 0.86
1.127PheHis: 1.127 ± 0.661
2.254PheIle: 2.254 ± 0.549
4.225PheLys: 4.225 ± 1.151
5.352PheLeu: 5.352 ± 1.951
1.408PheMet: 1.408 ± 0.403
1.69PheAsn: 1.69 ± 0.91
2.817PhePro: 2.817 ± 1.581
1.408PheGln: 1.408 ± 0.488
3.38PheArg: 3.38 ± 1.352
2.535PheSer: 2.535 ± 0.551
2.254PheThr: 2.254 ± 1.711
1.408PheVal: 1.408 ± 0.488
0.845PheTrp: 0.845 ± 0.341
1.127PheTyr: 1.127 ± 0.402
0.0PheXaa: 0.0 ± 0.0
Gly
0.845GlyAla: 0.845 ± 0.855
1.127GlyCys: 1.127 ± 0.629
3.662GlyAsp: 3.662 ± 0.967
3.662GlyGlu: 3.662 ± 1.821
2.535GlyPhe: 2.535 ± 0.916
2.817GlyGly: 2.817 ± 0.31
0.845GlyHis: 0.845 ± 0.341
3.099GlyIle: 3.099 ± 0.478
3.38GlyLys: 3.38 ± 1.182
8.732GlyLeu: 8.732 ± 1.79
1.972GlyMet: 1.972 ± 0.689
2.535GlyAsn: 2.535 ± 0.787
3.38GlyPro: 3.38 ± 1.056
1.972GlyGln: 1.972 ± 0.744
3.944GlyArg: 3.944 ± 0.65
5.07GlySer: 5.07 ± 1.745
5.07GlyThr: 5.07 ± 1.233
3.944GlyVal: 3.944 ± 1.726
1.69GlyTrp: 1.69 ± 0.527
1.127GlyTyr: 1.127 ± 1.055
0.0GlyXaa: 0.0 ± 0.0
His
0.845HisAla: 0.845 ± 0.305
0.563HisCys: 0.563 ± 0.315
0.845HisAsp: 0.845 ± 0.472
1.127HisGlu: 1.127 ± 0.313
1.69HisPhe: 1.69 ± 0.527
1.127HisGly: 1.127 ± 0.376
0.282HisHis: 0.282 ± 0.157
3.662HisIle: 3.662 ± 0.987
2.535HisLys: 2.535 ± 0.825
1.972HisLeu: 1.972 ± 0.503
0.282HisMet: 0.282 ± 0.389
0.845HisAsn: 0.845 ± 0.677
1.408HisPro: 1.408 ± 0.488
1.69HisGln: 1.69 ± 0.621
1.408HisArg: 1.408 ± 0.566
1.69HisSer: 1.69 ± 1.031
0.845HisThr: 0.845 ± 0.305
0.845HisVal: 0.845 ± 0.472
0.845HisTrp: 0.845 ± 0.305
0.845HisTyr: 0.845 ± 0.305
0.0HisXaa: 0.0 ± 0.0
Ile
2.254IleAla: 2.254 ± 0.982
1.127IleCys: 1.127 ± 0.617
4.789IleAsp: 4.789 ± 1.003
5.07IleGlu: 5.07 ± 0.654
1.408IlePhe: 1.408 ± 0.281
6.197IleGly: 6.197 ± 1.255
0.845IleHis: 0.845 ± 0.305
4.789IleIle: 4.789 ± 1.173
5.352IleLys: 5.352 ± 1.605
4.507IleLeu: 4.507 ± 1.085
1.69IleMet: 1.69 ± 0.734
2.817IleAsn: 2.817 ± 0.685
5.915IlePro: 5.915 ± 1.817
3.38IleGln: 3.38 ± 1.07
5.07IleArg: 5.07 ± 2.082
4.507IleSer: 4.507 ± 1.177
3.38IleThr: 3.38 ± 1.85
2.817IleVal: 2.817 ± 1.505
1.408IleTrp: 1.408 ± 0.4
3.099IleTyr: 3.099 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
1.972LysAla: 1.972 ± 0.594
0.282LysCys: 0.282 ± 0.157
5.07LysAsp: 5.07 ± 0.535
4.225LysGlu: 4.225 ± 2.05
2.535LysPhe: 2.535 ± 0.767
5.915LysGly: 5.915 ± 0.663
1.408LysHis: 1.408 ± 0.482
6.197LysIle: 6.197 ± 0.556
6.197LysLys: 6.197 ± 1.218
3.944LysLeu: 3.944 ± 0.622
3.662LysMet: 3.662 ± 0.673
3.662LysAsn: 3.662 ± 1.19
2.817LysPro: 2.817 ± 0.632
1.127LysGln: 1.127 ± 0.376
3.944LysArg: 3.944 ± 0.65
5.634LysSer: 5.634 ± 1.683
4.225LysThr: 4.225 ± 0.495
3.38LysVal: 3.38 ± 1.718
2.254LysTrp: 2.254 ± 0.253
3.38LysTyr: 3.38 ± 1.476
0.0LysXaa: 0.0 ± 0.0
Leu
3.662LeuAla: 3.662 ± 0.402
1.408LeuCys: 1.408 ± 0.281
4.789LeuAsp: 4.789 ± 0.603
4.789LeuGlu: 4.789 ± 1.118
3.944LeuPhe: 3.944 ± 0.99
5.352LeuGly: 5.352 ± 0.929
3.38LeuHis: 3.38 ± 0.863
9.577LeuIle: 9.577 ± 2.042
8.169LeuLys: 8.169 ± 0.927
6.479LeuLeu: 6.479 ± 2.401
2.817LeuMet: 2.817 ± 1.055
4.225LeuAsn: 4.225 ± 1.297
3.38LeuPro: 3.38 ± 1.236
1.69LeuGln: 1.69 ± 1.017
5.634LeuArg: 5.634 ± 1.493
9.296LeuSer: 9.296 ± 1.391
6.197LeuThr: 6.197 ± 0.777
4.225LeuVal: 4.225 ± 0.994
1.408LeuTrp: 1.408 ± 1.141
3.944LeuTyr: 3.944 ± 1.192
0.0LeuXaa: 0.0 ± 0.0
Met
1.972MetAla: 1.972 ± 0.8
1.408MetCys: 1.408 ± 0.488
1.408MetAsp: 1.408 ± 0.516
2.535MetGlu: 2.535 ± 0.803
1.408MetPhe: 1.408 ± 0.698
1.69MetGly: 1.69 ± 0.994
1.408MetHis: 1.408 ± 0.787
1.408MetIle: 1.408 ± 0.528
1.408MetLys: 1.408 ± 1.248
2.254MetLeu: 2.254 ± 0.376
1.127MetMet: 1.127 ± 0.482
1.127MetAsn: 1.127 ± 0.535
1.408MetPro: 1.408 ± 0.611
1.127MetGln: 1.127 ± 0.428
1.127MetArg: 1.127 ± 0.376
3.662MetSer: 3.662 ± 1.741
2.254MetThr: 2.254 ± 0.624
0.845MetVal: 0.845 ± 0.597
0.0MetTrp: 0.0 ± 0.0
0.282MetTyr: 0.282 ± 0.389
0.0MetXaa: 0.0 ± 0.0
Asn
2.254AsnAla: 2.254 ± 0.692
0.282AsnCys: 0.282 ± 0.469
2.817AsnAsp: 2.817 ± 0.937
1.69AsnGlu: 1.69 ± 0.647
0.845AsnPhe: 0.845 ± 0.705
3.38AsnGly: 3.38 ± 0.422
1.408AsnHis: 1.408 ± 0.787
2.817AsnIle: 2.817 ± 0.948
2.817AsnLys: 2.817 ± 1.281
3.38AsnLeu: 3.38 ± 1.276
0.563AsnMet: 0.563 ± 0.29
1.69AsnAsn: 1.69 ± 0.659
2.254AsnPro: 2.254 ± 1.181
1.972AsnGln: 1.972 ± 0.503
1.408AsnArg: 1.408 ± 0.528
3.944AsnSer: 3.944 ± 0.342
2.535AsnThr: 2.535 ± 0.705
1.972AsnVal: 1.972 ± 0.633
1.69AsnTrp: 1.69 ± 0.944
1.69AsnTyr: 1.69 ± 0.611
0.0AsnXaa: 0.0 ± 0.0
Pro
3.099ProAla: 3.099 ± 0.939
0.282ProCys: 0.282 ± 0.157
3.944ProAsp: 3.944 ± 1.627
1.972ProGlu: 1.972 ± 0.873
3.38ProPhe: 3.38 ± 1.404
2.817ProGly: 2.817 ± 1.172
3.099ProHis: 3.099 ± 1.757
4.507ProIle: 4.507 ± 0.927
3.662ProLys: 3.662 ± 0.709
3.662ProLeu: 3.662 ± 1.367
0.845ProMet: 0.845 ± 0.726
0.845ProAsn: 0.845 ± 0.409
3.662ProPro: 3.662 ± 0.906
1.408ProGln: 1.408 ± 0.593
1.127ProArg: 1.127 ± 0.376
3.944ProSer: 3.944 ± 1.173
3.38ProThr: 3.38 ± 0.376
4.507ProVal: 4.507 ± 1.575
0.282ProTrp: 0.282 ± 0.157
1.408ProTyr: 1.408 ± 0.831
0.0ProXaa: 0.0 ± 0.0
Gln
1.972GlnAla: 1.972 ± 0.612
1.408GlnCys: 1.408 ± 0.281
1.127GlnAsp: 1.127 ± 0.795
1.972GlnGlu: 1.972 ± 0.54
1.127GlnPhe: 1.127 ± 0.416
1.69GlnGly: 1.69 ± 0.356
1.127GlnHis: 1.127 ± 0.376
2.535GlnIle: 2.535 ± 0.787
3.099GlnLys: 3.099 ± 0.312
3.099GlnLeu: 3.099 ± 0.303
1.127GlnMet: 1.127 ± 0.416
1.69GlnAsn: 1.69 ± 0.621
1.972GlnPro: 1.972 ± 1.111
0.563GlnGln: 0.563 ± 0.33
0.563GlnArg: 0.563 ± 0.315
2.817GlnSer: 2.817 ± 0.25
1.69GlnThr: 1.69 ± 0.591
1.69GlnVal: 1.69 ± 0.621
0.845GlnTrp: 0.845 ± 0.825
1.127GlnTyr: 1.127 ± 0.629
0.0GlnXaa: 0.0 ± 0.0
Arg
4.225ArgAla: 4.225 ± 1.358
1.408ArgCys: 1.408 ± 0.403
1.972ArgAsp: 1.972 ± 0.434
3.099ArgGlu: 3.099 ± 0.994
2.535ArgPhe: 2.535 ± 0.715
3.099ArgGly: 3.099 ± 1.731
0.563ArgHis: 0.563 ± 0.315
2.535ArgIle: 2.535 ± 0.795
1.408ArgLys: 1.408 ± 0.428
4.789ArgLeu: 4.789 ± 1.289
1.972ArgMet: 1.972 ± 0.502
1.972ArgAsn: 1.972 ± 0.8
1.972ArgPro: 1.972 ± 0.54
2.254ArgGln: 2.254 ± 0.485
1.69ArgArg: 1.69 ± 0.362
5.352ArgSer: 5.352 ± 0.82
2.254ArgThr: 2.254 ± 0.946
3.099ArgVal: 3.099 ± 0.916
0.845ArgTrp: 0.845 ± 0.305
1.69ArgTyr: 1.69 ± 0.457
0.0ArgXaa: 0.0 ± 0.0
Ser
3.944SerAla: 3.944 ± 0.162
1.69SerCys: 1.69 ± 0.621
6.761SerAsp: 6.761 ± 0.928
4.507SerGlu: 4.507 ± 1.421
1.972SerPhe: 1.972 ± 0.594
2.535SerGly: 2.535 ± 0.489
1.972SerHis: 1.972 ± 0.821
5.352SerIle: 5.352 ± 0.653
5.07SerLys: 5.07 ± 0.744
9.577SerLeu: 9.577 ± 1.165
3.099SerMet: 3.099 ± 0.964
3.662SerAsn: 3.662 ± 1.193
5.352SerPro: 5.352 ± 0.691
1.972SerGln: 1.972 ± 0.956
3.944SerArg: 3.944 ± 1.543
9.859SerSer: 9.859 ± 2.624
5.07SerThr: 5.07 ± 0.966
3.662SerVal: 3.662 ± 0.542
1.69SerTrp: 1.69 ± 0.944
2.817SerTyr: 2.817 ± 0.862
0.0SerXaa: 0.0 ± 0.0
Thr
1.69ThrAla: 1.69 ± 0.457
1.972ThrCys: 1.972 ± 0.434
3.944ThrAsp: 3.944 ± 1.4
2.535ThrGlu: 2.535 ± 0.795
3.662ThrPhe: 3.662 ± 0.545
4.225ThrGly: 4.225 ± 1.891
1.972ThrHis: 1.972 ± 0.895
2.535ThrIle: 2.535 ± 0.984
3.38ThrLys: 3.38 ± 0.68
3.944ThrLeu: 3.944 ± 0.942
2.535ThrMet: 2.535 ± 0.679
1.69ThrAsn: 1.69 ± 0.659
2.535ThrPro: 2.535 ± 1.446
1.972ThrGln: 1.972 ± 0.438
2.254ThrArg: 2.254 ± 0.692
4.507ThrSer: 4.507 ± 0.841
3.662ThrThr: 3.662 ± 0.731
3.099ThrVal: 3.099 ± 1.82
1.69ThrTrp: 1.69 ± 0.326
2.535ThrTyr: 2.535 ± 0.939
0.0ThrXaa: 0.0 ± 0.0
Val
2.535ValAla: 2.535 ± 1.368
1.972ValCys: 1.972 ± 0.681
3.099ValAsp: 3.099 ± 1.13
4.225ValGlu: 4.225 ± 2.096
1.408ValPhe: 1.408 ± 0.593
3.099ValGly: 3.099 ± 0.593
0.563ValHis: 0.563 ± 0.308
4.789ValIle: 4.789 ± 1.632
3.38ValLys: 3.38 ± 1.561
4.789ValLeu: 4.789 ± 0.722
1.127ValMet: 1.127 ± 0.795
1.972ValAsn: 1.972 ± 0.8
2.535ValPro: 2.535 ± 0.415
1.972ValGln: 1.972 ± 0.796
2.817ValArg: 2.817 ± 0.752
4.225ValSer: 4.225 ± 1.638
2.817ValThr: 2.817 ± 0.919
1.408ValVal: 1.408 ± 0.281
0.0ValTrp: 0.0 ± 0.0
1.69ValTyr: 1.69 ± 0.659
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.69TrpAsp: 1.69 ± 0.493
2.817TrpGlu: 2.817 ± 0.25
2.535TrpPhe: 2.535 ± 1.2
1.408TrpGly: 1.408 ± 0.528
0.282TrpHis: 0.282 ± 0.157
1.408TrpIle: 1.408 ± 0.488
1.408TrpLys: 1.408 ± 0.488
1.127TrpLeu: 1.127 ± 0.661
0.563TrpMet: 0.563 ± 0.62
0.845TrpAsn: 0.845 ± 0.382
0.282TrpPro: 0.282 ± 0.157
0.282TrpGln: 0.282 ± 0.157
0.845TrpArg: 0.845 ± 0.305
1.69TrpSer: 1.69 ± 0.621
0.845TrpThr: 0.845 ± 0.382
1.127TrpVal: 1.127 ± 0.778
0.0TrpTrp: 0.0 ± 0.0
0.563TrpTyr: 0.563 ± 0.534
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.69TyrAla: 1.69 ± 0.705
0.282TyrCys: 0.282 ± 0.382
2.817TyrAsp: 2.817 ± 1.067
1.127TyrGlu: 1.127 ± 0.452
2.254TyrPhe: 2.254 ± 0.485
3.099TyrGly: 3.099 ± 0.76
1.127TyrHis: 1.127 ± 0.629
2.535TyrIle: 2.535 ± 0.639
3.662TyrLys: 3.662 ± 1.006
3.944TyrLeu: 3.944 ± 0.633
1.127TyrMet: 1.127 ± 0.778
1.972TyrAsn: 1.972 ± 0.534
1.972TyrPro: 1.972 ± 0.744
1.69TyrGln: 1.69 ± 0.647
1.972TyrArg: 1.972 ± 0.69
1.127TyrSer: 1.127 ± 0.482
1.408TyrThr: 1.408 ± 0.637
1.408TyrVal: 1.408 ± 0.652
0.563TyrTrp: 0.563 ± 0.397
1.972TyrTyr: 1.972 ± 0.873
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3551 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski