Amino acid dipepetide frequency for Rabies virus (strain HEP-Flury) (RABV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.971AlaAla: 1.971 ± 1.319
1.095AlaCys: 1.095 ± 0.561
2.847AlaAsp: 2.847 ± 1.359
4.38AlaGlu: 4.38 ± 1.398
0.657AlaPhe: 0.657 ± 0.408
2.628AlaGly: 2.628 ± 0.833
3.504AlaHis: 3.504 ± 1.589
2.847AlaIle: 2.847 ± 0.813
1.971AlaLys: 1.971 ± 0.495
5.256AlaLeu: 5.256 ± 0.512
0.657AlaMet: 0.657 ± 0.301
2.19AlaAsn: 2.19 ± 0.436
2.19AlaPro: 2.19 ± 0.931
2.628AlaGln: 2.628 ± 0.616
2.628AlaArg: 2.628 ± 1.021
3.285AlaSer: 3.285 ± 0.265
2.19AlaThr: 2.19 ± 0.436
1.533AlaVal: 1.533 ± 0.841
0.219AlaTrp: 0.219 ± 0.136
1.095AlaTyr: 1.095 ± 0.667
0.0AlaXaa: 0.0 ± 0.0
Cys
0.438CysAla: 0.438 ± 0.585
0.438CysCys: 0.438 ± 0.222
0.438CysAsp: 0.438 ± 0.222
0.0CysGlu: 0.0 ± 0.0
0.438CysPhe: 0.438 ± 0.222
0.876CysGly: 0.876 ± 0.443
0.438CysHis: 0.438 ± 0.402
1.095CysIle: 1.095 ± 0.488
0.438CysLys: 0.438 ± 0.402
1.752CysLeu: 1.752 ± 0.693
0.876CysMet: 0.876 ± 0.655
0.438CysAsn: 0.438 ± 0.272
1.095CysPro: 1.095 ± 0.441
0.657CysGln: 0.657 ± 0.304
0.657CysArg: 0.657 ± 0.625
2.628CysSer: 2.628 ± 0.636
0.876CysThr: 0.876 ± 0.443
1.314CysVal: 1.314 ± 0.647
0.219CysTrp: 0.219 ± 0.136
0.657CysTyr: 0.657 ± 0.316
0.0CysXaa: 0.0 ± 0.0
Asp
1.752AspAla: 1.752 ± 0.617
0.219AspCys: 0.219 ± 0.292
7.665AspAsp: 7.665 ± 1.969
2.847AspGlu: 2.847 ± 1.572
3.504AspPhe: 3.504 ± 0.736
3.285AspGly: 3.285 ± 0.641
0.219AspHis: 0.219 ± 0.136
2.847AspIle: 2.847 ± 0.478
4.818AspLys: 4.818 ± 1.125
9.417AspLeu: 9.417 ± 1.278
1.314AspMet: 1.314 ± 0.678
3.285AspAsn: 3.285 ± 1.138
4.161AspPro: 4.161 ± 0.316
2.19AspGln: 2.19 ± 1.148
2.19AspArg: 2.19 ± 0.436
2.628AspSer: 2.628 ± 0.915
1.752AspThr: 1.752 ± 0.798
1.752AspVal: 1.752 ± 0.762
0.657AspTrp: 0.657 ± 0.316
2.847AspTyr: 2.847 ± 1.056
0.0AspXaa: 0.0 ± 0.0
Glu
5.256GluAla: 5.256 ± 1.991
0.438GluCys: 0.438 ± 0.222
8.322GluAsp: 8.322 ± 2.99
5.037GluGlu: 5.037 ± 1.007
1.533GluPhe: 1.533 ± 0.572
5.256GluGly: 5.256 ± 0.694
1.095GluHis: 1.095 ± 0.912
5.256GluIle: 5.256 ± 1.088
2.847GluLys: 2.847 ± 0.869
4.599GluLeu: 4.599 ± 0.571
3.504GluMet: 3.504 ± 1.258
0.876GluAsn: 0.876 ± 0.346
1.752GluPro: 1.752 ± 0.502
1.314GluGln: 1.314 ± 0.795
3.285GluArg: 3.285 ± 0.24
7.008GluSer: 7.008 ± 1.186
3.285GluThr: 3.285 ± 1.365
2.847GluVal: 2.847 ± 0.668
1.533GluTrp: 1.533 ± 0.565
0.876GluTyr: 0.876 ± 0.553
0.0GluXaa: 0.0 ± 0.0
Phe
1.095PheAla: 1.095 ± 0.319
0.219PheCys: 0.219 ± 0.265
1.533PheAsp: 1.533 ± 0.763
2.847PheGlu: 2.847 ± 1.38
2.628PhePhe: 2.628 ± 1.294
1.314PheGly: 1.314 ± 0.424
2.19PheHis: 2.19 ± 0.56
1.314PheIle: 1.314 ± 0.632
2.19PheLys: 2.19 ± 0.701
5.475PheLeu: 5.475 ± 0.622
0.219PheMet: 0.219 ± 0.136
1.752PheAsn: 1.752 ± 0.512
4.599PhePro: 4.599 ± 0.966
4.38PheGln: 4.38 ± 1.73
3.285PheArg: 3.285 ± 0.823
4.38PheSer: 4.38 ± 0.776
1.095PheThr: 1.095 ± 0.441
2.19PheVal: 2.19 ± 0.733
0.219PheTrp: 0.219 ± 0.136
0.657PheTyr: 0.657 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
1.971GlyAla: 1.971 ± 0.661
1.095GlyCys: 1.095 ± 0.441
3.066GlyAsp: 3.066 ± 0.858
7.884GlyGlu: 7.884 ± 3.339
1.752GlyPhe: 1.752 ± 0.867
3.942GlyGly: 3.942 ± 1.211
0.876GlyHis: 0.876 ± 0.279
2.628GlyIle: 2.628 ± 0.483
3.723GlyLys: 3.723 ± 1.118
6.351GlyLeu: 6.351 ± 1.586
1.971GlyMet: 1.971 ± 0.887
2.628GlyAsn: 2.628 ± 0.792
2.847GlyPro: 2.847 ± 0.463
1.314GlyGln: 1.314 ± 0.784
4.599GlyArg: 4.599 ± 0.815
2.847GlySer: 2.847 ± 0.478
3.504GlyThr: 3.504 ± 1.271
3.723GlyVal: 3.723 ± 1.586
1.314GlyTrp: 1.314 ± 0.647
1.971GlyTyr: 1.971 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
0.876HisAla: 0.876 ± 0.403
0.0HisCys: 0.0 ± 0.0
2.19HisAsp: 2.19 ± 0.637
0.657HisGlu: 0.657 ± 0.316
1.314HisPhe: 1.314 ± 0.505
0.657HisGly: 0.657 ± 0.408
0.438HisHis: 0.438 ± 0.375
1.752HisIle: 1.752 ± 0.618
1.095HisLys: 1.095 ± 0.511
3.285HisLeu: 3.285 ± 0.953
0.219HisMet: 0.219 ± 0.263
0.219HisAsn: 0.219 ± 0.263
1.314HisPro: 1.314 ± 0.514
1.533HisGln: 1.533 ± 0.683
0.657HisArg: 0.657 ± 0.304
1.752HisSer: 1.752 ± 0.439
0.219HisThr: 0.219 ± 0.292
1.314HisVal: 1.314 ± 0.424
0.876HisTrp: 0.876 ± 0.346
0.657HisTyr: 0.657 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
3.723IleAla: 3.723 ± 1.569
0.657IleCys: 0.657 ± 0.408
2.409IleAsp: 2.409 ± 1.039
3.066IleGlu: 3.066 ± 0.656
3.504IlePhe: 3.504 ± 0.826
1.314IleGly: 1.314 ± 0.635
1.533IleHis: 1.533 ± 0.712
4.599IleIle: 4.599 ± 0.631
2.628IleLys: 2.628 ± 0.978
6.57IleLeu: 6.57 ± 1.649
2.628IleMet: 2.628 ± 0.292
1.971IleAsn: 1.971 ± 0.455
3.285IlePro: 3.285 ± 0.811
0.876IleGln: 0.876 ± 0.399
2.847IleArg: 2.847 ± 1.058
5.694IleSer: 5.694 ± 1.269
3.723IleThr: 3.723 ± 1.397
4.161IleVal: 4.161 ± 1.217
2.19IleTrp: 2.19 ± 0.61
2.19IleTyr: 2.19 ± 0.911
0.0IleXaa: 0.0 ± 0.0
Lys
2.628LysAla: 2.628 ± 0.507
0.0LysCys: 0.0 ± 0.0
2.847LysAsp: 2.847 ± 1.059
3.723LysGlu: 3.723 ± 1.18
3.066LysPhe: 3.066 ± 1.231
2.19LysGly: 2.19 ± 0.683
0.438LysHis: 0.438 ± 0.222
6.132LysIle: 6.132 ± 2.003
4.818LysLys: 4.818 ± 1.921
5.256LysLeu: 5.256 ± 0.696
0.876LysMet: 0.876 ± 0.279
1.971LysAsn: 1.971 ± 0.922
1.533LysPro: 1.533 ± 0.572
0.876LysGln: 0.876 ± 0.279
3.504LysArg: 3.504 ± 1.091
5.256LysSer: 5.256 ± 0.709
2.847LysThr: 2.847 ± 0.858
3.723LysVal: 3.723 ± 1.051
0.876LysTrp: 0.876 ± 0.443
2.628LysTyr: 2.628 ± 1.342
0.0LysXaa: 0.0 ± 0.0
Leu
6.789LeuAla: 6.789 ± 0.788
1.971LeuCys: 1.971 ± 0.661
7.665LeuAsp: 7.665 ± 1.544
7.227LeuGlu: 7.227 ± 1.279
3.942LeuPhe: 3.942 ± 1.011
6.789LeuGly: 6.789 ± 0.855
1.533LeuHis: 1.533 ± 0.901
5.913LeuIle: 5.913 ± 1.661
7.008LeuLys: 7.008 ± 0.256
9.417LeuLeu: 9.417 ± 1.37
4.161LeuMet: 4.161 ± 1.492
2.19LeuAsn: 2.19 ± 1.025
2.628LeuPro: 2.628 ± 0.583
2.19LeuGln: 2.19 ± 0.484
7.665LeuArg: 7.665 ± 1.838
11.608LeuSer: 11.608 ± 1.149
3.285LeuThr: 3.285 ± 1.326
7.008LeuVal: 7.008 ± 1.052
1.095LeuTrp: 1.095 ± 0.54
4.818LeuTyr: 4.818 ± 0.911
0.0LeuXaa: 0.0 ± 0.0
Met
0.876MetAla: 0.876 ± 0.434
0.438MetCys: 0.438 ± 0.272
0.657MetAsp: 0.657 ± 0.301
0.876MetGlu: 0.876 ± 0.817
1.095MetPhe: 1.095 ± 0.574
0.438MetGly: 0.438 ± 0.272
0.219MetHis: 0.219 ± 0.263
1.533MetIle: 1.533 ± 0.549
0.219MetLys: 0.219 ± 0.136
1.971MetLeu: 1.971 ± 0.973
0.438MetMet: 0.438 ± 0.272
3.504MetAsn: 3.504 ± 1.788
0.219MetPro: 0.219 ± 0.263
1.971MetGln: 1.971 ± 0.861
2.847MetArg: 2.847 ± 1.205
3.723MetSer: 3.723 ± 1.573
1.752MetThr: 1.752 ± 0.512
2.628MetVal: 2.628 ± 1.22
0.0MetTrp: 0.0 ± 0.0
0.219MetTyr: 0.219 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
1.314AsnAla: 1.314 ± 0.817
1.314AsnCys: 1.314 ± 0.446
0.876AsnAsp: 0.876 ± 0.545
1.533AsnGlu: 1.533 ± 0.708
3.504AsnPhe: 3.504 ± 1.693
2.847AsnGly: 2.847 ± 1.142
1.095AsnHis: 1.095 ± 0.314
3.285AsnIle: 3.285 ± 1.329
1.095AsnLys: 1.095 ± 0.511
6.57AsnLeu: 6.57 ± 1.874
0.657AsnMet: 0.657 ± 0.304
0.657AsnAsn: 0.657 ± 0.301
3.066AsnPro: 3.066 ± 0.618
0.876AsnGln: 0.876 ± 0.376
3.504AsnArg: 3.504 ± 1.167
4.161AsnSer: 4.161 ± 0.777
0.657AsnThr: 0.657 ± 0.316
2.847AsnVal: 2.847 ± 1.331
1.095AsnTrp: 1.095 ± 0.575
1.095AsnTyr: 1.095 ± 0.512
0.0AsnXaa: 0.0 ± 0.0
Pro
1.095ProAla: 1.095 ± 0.488
0.438ProCys: 0.438 ± 0.272
3.285ProAsp: 3.285 ± 0.908
3.723ProGlu: 3.723 ± 0.666
0.219ProPhe: 0.219 ± 0.292
3.723ProGly: 3.723 ± 1.408
1.095ProHis: 1.095 ± 0.511
2.628ProIle: 2.628 ± 0.711
1.314ProLys: 1.314 ± 0.582
5.475ProLeu: 5.475 ± 1.042
0.438ProMet: 0.438 ± 0.272
3.723ProAsn: 3.723 ± 1.336
4.38ProPro: 4.38 ± 1.333
0.657ProGln: 0.657 ± 0.257
1.533ProArg: 1.533 ± 0.71
7.665ProSer: 7.665 ± 1.528
1.752ProThr: 1.752 ± 0.63
1.533ProVal: 1.533 ± 0.953
0.219ProTrp: 0.219 ± 0.263
1.752ProTyr: 1.752 ± 0.512
0.0ProXaa: 0.0 ± 0.0
Gln
0.876GlnAla: 0.876 ± 0.448
0.219GlnCys: 0.219 ± 0.265
1.752GlnAsp: 1.752 ± 0.507
1.533GlnGlu: 1.533 ± 0.572
1.533GlnPhe: 1.533 ± 0.454
1.314GlnGly: 1.314 ± 0.438
0.876GlnHis: 0.876 ± 0.346
3.723GlnIle: 3.723 ± 1.07
1.314GlnLys: 1.314 ± 0.635
3.504GlnLeu: 3.504 ± 1.215
2.409GlnMet: 2.409 ± 1.367
0.657GlnAsn: 0.657 ± 0.316
1.533GlnPro: 1.533 ± 0.565
0.657GlnGln: 0.657 ± 0.257
3.066GlnArg: 3.066 ± 0.679
3.066GlnSer: 3.066 ± 0.612
3.942GlnThr: 3.942 ± 1.187
3.723GlnVal: 3.723 ± 0.356
0.438GlnTrp: 0.438 ± 0.251
0.219GlnTyr: 0.219 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
2.847ArgAla: 2.847 ± 0.839
1.971ArgCys: 1.971 ± 0.512
2.19ArgAsp: 2.19 ± 0.825
5.913ArgGlu: 5.913 ± 1.202
3.504ArgPhe: 3.504 ± 0.671
3.285ArgGly: 3.285 ± 1.349
1.314ArgHis: 1.314 ± 0.635
2.409ArgIle: 2.409 ± 0.867
1.971ArgLys: 1.971 ± 0.599
4.818ArgLeu: 4.818 ± 0.513
1.314ArgMet: 1.314 ± 0.458
2.628ArgAsn: 2.628 ± 0.723
2.409ArgPro: 2.409 ± 0.814
2.847ArgGln: 2.847 ± 1.009
2.19ArgArg: 2.19 ± 0.737
6.57ArgSer: 6.57 ± 1.723
2.628ArgThr: 2.628 ± 0.929
5.694ArgVal: 5.694 ± 1.012
0.876ArgTrp: 0.876 ± 0.545
2.628ArgTyr: 2.628 ± 0.292
0.0ArgXaa: 0.0 ± 0.0
Ser
4.38SerAla: 4.38 ± 0.619
2.409SerCys: 2.409 ± 0.55
4.38SerAsp: 4.38 ± 1.336
5.256SerGlu: 5.256 ± 1.598
4.599SerPhe: 4.599 ± 0.545
8.541SerGly: 8.541 ± 1.551
1.533SerHis: 1.533 ± 0.511
4.38SerIle: 4.38 ± 1.264
9.198SerLys: 9.198 ± 2.897
9.855SerLeu: 9.855 ± 1.864
1.095SerMet: 1.095 ± 0.522
2.847SerAsn: 2.847 ± 0.5
3.504SerPro: 3.504 ± 0.903
5.256SerGln: 5.256 ± 1.675
6.132SerArg: 6.132 ± 1.362
7.884SerSer: 7.884 ± 0.383
4.599SerThr: 4.599 ± 0.631
6.132SerVal: 6.132 ± 0.951
1.752SerTrp: 1.752 ± 0.841
4.599SerTyr: 4.599 ± 1.071
0.0SerXaa: 0.0 ± 0.0
Thr
1.752ThrAla: 1.752 ± 1.089
1.314ThrCys: 1.314 ± 0.671
1.971ThrAsp: 1.971 ± 0.678
0.876ThrGlu: 0.876 ± 0.325
0.876ThrPhe: 0.876 ± 0.279
3.942ThrGly: 3.942 ± 0.455
0.876ThrHis: 0.876 ± 0.279
1.752ThrIle: 1.752 ± 0.617
1.314ThrLys: 1.314 ± 0.671
4.599ThrLeu: 4.599 ± 1.918
1.533ThrMet: 1.533 ± 0.953
2.628ThrAsn: 2.628 ± 0.796
1.314ThrPro: 1.314 ± 0.404
3.066ThrGln: 3.066 ± 0.622
4.161ThrArg: 4.161 ± 0.633
4.599ThrSer: 4.599 ± 0.916
4.161ThrThr: 4.161 ± 1.441
3.942ThrVal: 3.942 ± 1.283
1.533ThrTrp: 1.533 ± 0.523
2.409ThrTyr: 2.409 ± 0.715
0.0ThrXaa: 0.0 ± 0.0
Val
4.818ValAla: 4.818 ± 1.751
0.876ValCys: 0.876 ± 0.314
3.066ValAsp: 3.066 ± 0.587
5.913ValGlu: 5.913 ± 2.043
4.161ValPhe: 4.161 ± 1.297
5.256ValGly: 5.256 ± 1.341
1.095ValHis: 1.095 ± 0.46
2.847ValIle: 2.847 ± 0.924
3.066ValLys: 3.066 ± 1.061
4.599ValLeu: 4.599 ± 0.493
0.0ValMet: 0.0 ± 0.0
4.38ValAsn: 4.38 ± 0.629
3.504ValPro: 3.504 ± 0.536
2.409ValGln: 2.409 ± 0.414
2.847ValArg: 2.847 ± 1.116
6.351ValSer: 6.351 ± 0.828
3.504ValThr: 3.504 ± 0.414
2.409ValVal: 2.409 ± 0.609
0.219ValTrp: 0.219 ± 0.136
1.314ValTyr: 1.314 ± 0.458
0.0ValXaa: 0.0 ± 0.0
Trp
0.876TrpAla: 0.876 ± 0.314
0.438TrpCys: 0.438 ± 0.402
0.219TrpAsp: 0.219 ± 0.265
0.876TrpGlu: 0.876 ± 0.346
0.219TrpPhe: 0.219 ± 0.136
1.095TrpGly: 1.095 ± 0.459
0.438TrpHis: 0.438 ± 0.272
1.095TrpIle: 1.095 ± 0.681
0.438TrpLys: 0.438 ± 0.222
1.533TrpLeu: 1.533 ± 0.549
0.219TrpMet: 0.219 ± 0.263
0.876TrpAsn: 0.876 ± 0.346
0.438TrpPro: 0.438 ± 0.272
0.0TrpGln: 0.0 ± 0.0
0.438TrpArg: 0.438 ± 0.272
3.504TrpSer: 3.504 ± 1.291
0.438TrpThr: 0.438 ± 0.272
1.971TrpVal: 1.971 ± 0.707
0.0TrpTrp: 0.0 ± 0.0
0.219TrpTyr: 0.219 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.438TyrAla: 0.438 ± 0.272
0.438TyrCys: 0.438 ± 0.222
1.971TyrAsp: 1.971 ± 0.754
1.752TyrGlu: 1.752 ± 0.762
1.533TyrPhe: 1.533 ± 0.841
1.314TyrGly: 1.314 ± 0.632
0.219TyrHis: 0.219 ± 0.263
1.533TyrIle: 1.533 ± 0.71
3.723TyrLys: 3.723 ± 0.811
4.599TyrLeu: 4.599 ± 1.228
0.876TyrMet: 0.876 ± 0.279
2.628TyrAsn: 2.628 ± 0.426
0.876TyrPro: 0.876 ± 0.279
0.657TyrGln: 0.657 ± 0.408
1.971TyrArg: 1.971 ± 0.455
3.723TyrSer: 3.723 ± 1.38
2.409TyrThr: 2.409 ± 1.254
2.19TyrVal: 2.19 ± 0.952
0.0TyrTrp: 0.0 ± 0.0
0.438TyrTyr: 0.438 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4567 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski