Amino acid dipepetide frequency for Rabies virus (strain Pasteur vaccins / PV) (RABV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.056AlaAla: 3.056 ± 1.393
0.873AlaCys: 0.873 ± 0.48
1.965AlaAsp: 1.965 ± 1.085
4.584AlaGlu: 4.584 ± 1.549
0.873AlaPhe: 0.873 ± 0.546
1.528AlaGly: 1.528 ± 0.526
2.62AlaHis: 2.62 ± 0.72
2.62AlaIle: 2.62 ± 0.689
3.056AlaLys: 3.056 ± 1.295
6.767AlaLeu: 6.767 ± 0.407
0.655AlaMet: 0.655 ± 0.285
2.183AlaAsn: 2.183 ± 0.584
1.528AlaPro: 1.528 ± 0.811
2.62AlaGln: 2.62 ± 0.644
5.021AlaArg: 5.021 ± 1.135
3.274AlaSer: 3.274 ± 0.394
1.965AlaThr: 1.965 ± 0.658
3.056AlaVal: 3.056 ± 0.711
0.218AlaTrp: 0.218 ± 0.137
1.091AlaTyr: 1.091 ± 0.631
0.0AlaXaa: 0.0 ± 0.0
Cys
0.655CysAla: 0.655 ± 0.465
0.218CysCys: 0.218 ± 0.137
0.437CysAsp: 0.437 ± 0.232
0.0CysGlu: 0.0 ± 0.0
0.437CysPhe: 0.437 ± 0.232
0.873CysGly: 0.873 ± 0.464
0.437CysHis: 0.437 ± 0.392
1.091CysIle: 1.091 ± 0.503
0.218CysLys: 0.218 ± 0.24
1.746CysLeu: 1.746 ± 0.796
1.091CysMet: 1.091 ± 0.886
0.437CysAsn: 0.437 ± 0.273
1.091CysPro: 1.091 ± 0.447
0.437CysGln: 0.437 ± 0.273
0.437CysArg: 0.437 ± 0.392
2.62CysSer: 2.62 ± 0.7
0.873CysThr: 0.873 ± 0.464
1.528CysVal: 1.528 ± 0.687
0.437CysTrp: 0.437 ± 0.232
0.655CysTyr: 0.655 ± 0.246
0.0CysXaa: 0.0 ± 0.0
Asp
2.183AspAla: 2.183 ± 0.775
0.218AspCys: 0.218 ± 0.265
6.985AspAsp: 6.985 ± 2.784
3.493AspGlu: 3.493 ± 1.181
3.493AspPhe: 3.493 ± 0.826
3.493AspGly: 3.493 ± 0.886
0.218AspHis: 0.218 ± 0.137
2.62AspIle: 2.62 ± 0.431
3.056AspLys: 3.056 ± 0.526
8.95AspLeu: 8.95 ± 1.271
0.873AspMet: 0.873 ± 0.552
4.148AspAsn: 4.148 ± 0.895
4.366AspPro: 4.366 ± 0.305
2.183AspGln: 2.183 ± 1.096
1.31AspArg: 1.31 ± 0.409
2.183AspSer: 2.183 ± 0.811
1.31AspThr: 1.31 ± 0.651
1.965AspVal: 1.965 ± 0.64
0.655AspTrp: 0.655 ± 0.246
2.62AspTyr: 2.62 ± 1.013
0.0AspXaa: 0.0 ± 0.0
Glu
4.366GluAla: 4.366 ± 1.56
0.437GluCys: 0.437 ± 0.232
7.859GluAsp: 7.859 ± 3.329
6.112GluGlu: 6.112 ± 1.372
1.528GluPhe: 1.528 ± 0.477
5.021GluGly: 5.021 ± 0.769
0.873GluHis: 0.873 ± 0.83
5.239GluIle: 5.239 ± 1.265
2.183GluLys: 2.183 ± 1.096
4.802GluLeu: 4.802 ± 0.496
2.401GluMet: 2.401 ± 0.523
1.091GluAsn: 1.091 ± 0.516
1.965GluPro: 1.965 ± 0.385
1.31GluGln: 1.31 ± 0.964
3.274GluArg: 3.274 ± 0.247
7.422GluSer: 7.422 ± 1.277
3.711GluThr: 3.711 ± 1.325
2.62GluVal: 2.62 ± 0.843
1.528GluTrp: 1.528 ± 0.687
1.091GluTyr: 1.091 ± 0.677
0.0GluXaa: 0.0 ± 0.0
Phe
0.873PheAla: 0.873 ± 0.334
0.437PheCys: 0.437 ± 0.256
1.528PheAsp: 1.528 ± 0.779
2.838PheGlu: 2.838 ± 1.424
2.62PhePhe: 2.62 ± 1.042
1.528PheGly: 1.528 ± 0.515
1.528PheHis: 1.528 ± 0.699
1.31PheIle: 1.31 ± 0.492
2.183PheLys: 2.183 ± 0.777
4.366PheLeu: 4.366 ± 0.466
0.218PheMet: 0.218 ± 0.137
1.746PheAsn: 1.746 ± 0.575
4.802PhePro: 4.802 ± 0.884
4.366PheGln: 4.366 ± 1.903
3.493PheArg: 3.493 ± 1.035
4.148PheSer: 4.148 ± 0.58
1.091PheThr: 1.091 ± 0.447
2.401PheVal: 2.401 ± 0.961
0.218PheTrp: 0.218 ± 0.137
0.655PheTyr: 0.655 ± 0.41
0.0PheXaa: 0.0 ± 0.0
Gly
1.965GlyAla: 1.965 ± 0.789
1.091GlyCys: 1.091 ± 0.447
2.401GlyAsp: 2.401 ± 0.893
6.112GlyGlu: 6.112 ± 2.762
1.746GlyPhe: 1.746 ± 0.872
4.366GlyGly: 4.366 ± 1.283
0.655GlyHis: 0.655 ± 0.246
2.401GlyIle: 2.401 ± 0.637
5.457GlyLys: 5.457 ± 1.877
6.549GlyLeu: 6.549 ± 1.821
1.091GlyMet: 1.091 ± 0.356
3.056GlyAsn: 3.056 ± 0.886
2.838GlyPro: 2.838 ± 0.472
1.528GlyGln: 1.528 ± 0.725
3.929GlyArg: 3.929 ± 0.795
3.274GlySer: 3.274 ± 0.394
2.838GlyThr: 2.838 ± 1.058
4.584GlyVal: 4.584 ± 1.41
1.31GlyTrp: 1.31 ± 0.771
1.965GlyTyr: 1.965 ± 0.587
0.0GlyXaa: 0.0 ± 0.0
His
1.091HisAla: 1.091 ± 0.447
0.0HisCys: 0.0 ± 0.0
1.091HisAsp: 1.091 ± 0.514
0.873HisGlu: 0.873 ± 0.269
1.31HisPhe: 1.31 ± 0.43
0.437HisGly: 0.437 ± 0.273
0.655HisHis: 0.655 ± 0.285
1.746HisIle: 1.746 ± 0.62
0.873HisLys: 0.873 ± 0.433
3.274HisLeu: 3.274 ± 0.845
0.218HisMet: 0.218 ± 0.24
0.437HisAsn: 0.437 ± 0.48
1.528HisPro: 1.528 ± 0.688
1.528HisGln: 1.528 ± 0.852
0.873HisArg: 0.873 ± 0.412
1.31HisSer: 1.31 ± 0.409
0.218HisThr: 0.218 ± 0.265
1.091HisVal: 1.091 ± 0.318
0.873HisTrp: 0.873 ± 0.398
0.655HisTyr: 0.655 ± 0.296
0.0HisXaa: 0.0 ± 0.0
Ile
3.493IleAla: 3.493 ± 1.728
0.655IleCys: 0.655 ± 0.41
2.401IleAsp: 2.401 ± 1.304
2.401IleGlu: 2.401 ± 0.653
2.401IlePhe: 2.401 ± 0.658
1.746IleGly: 1.746 ± 0.909
1.746IleHis: 1.746 ± 0.519
4.802IleIle: 4.802 ± 0.645
2.401IleLys: 2.401 ± 0.748
5.894IleLeu: 5.894 ± 1.466
2.401IleMet: 2.401 ± 0.485
1.965IleAsn: 1.965 ± 0.476
2.183IlePro: 2.183 ± 0.892
0.655IleGln: 0.655 ± 0.41
4.584IleArg: 4.584 ± 0.668
5.021IleSer: 5.021 ± 0.814
3.493IleThr: 3.493 ± 1.456
4.802IleVal: 4.802 ± 1.473
2.183IleTrp: 2.183 ± 0.598
1.965IleTyr: 1.965 ± 0.869
0.0IleXaa: 0.0 ± 0.0
Lys
2.183LysAla: 2.183 ± 0.853
0.0LysCys: 0.0 ± 0.0
2.62LysAsp: 2.62 ± 1.167
3.711LysGlu: 3.711 ± 1.369
3.056LysPhe: 3.056 ± 1.4
1.965LysGly: 1.965 ± 0.698
0.437LysHis: 0.437 ± 0.232
5.021LysIle: 5.021 ± 1.651
5.021LysLys: 5.021 ± 2.129
5.239LysLeu: 5.239 ± 0.977
2.183LysMet: 2.183 ± 0.744
2.183LysAsn: 2.183 ± 1.014
2.183LysPro: 2.183 ± 0.723
0.873LysGln: 0.873 ± 0.296
2.838LysArg: 2.838 ± 0.881
6.549LysSer: 6.549 ± 1.129
3.056LysThr: 3.056 ± 0.909
4.802LysVal: 4.802 ± 1.091
0.655LysTrp: 0.655 ± 0.296
2.838LysTyr: 2.838 ± 1.35
0.0LysXaa: 0.0 ± 0.0
Leu
6.549LeuAla: 6.549 ± 0.858
1.746LeuCys: 1.746 ± 0.796
6.767LeuAsp: 6.767 ± 0.996
6.112LeuGlu: 6.112 ± 1.55
3.929LeuPhe: 3.929 ± 1.003
6.112LeuGly: 6.112 ± 0.751
1.965LeuHis: 1.965 ± 0.633
5.457LeuIle: 5.457 ± 2.035
7.859LeuLys: 7.859 ± 1.244
10.478LeuLeu: 10.478 ± 0.891
3.711LeuMet: 3.711 ± 1.483
3.274LeuAsn: 3.274 ± 0.686
3.711LeuPro: 3.711 ± 0.519
2.183LeuGln: 2.183 ± 0.486
7.422LeuArg: 7.422 ± 1.852
10.478LeuSer: 10.478 ± 1.637
4.148LeuThr: 4.148 ± 1.29
7.422LeuVal: 7.422 ± 1.365
1.091LeuTrp: 1.091 ± 0.557
4.802LeuTyr: 4.802 ± 0.751
0.0LeuXaa: 0.0 ± 0.0
Met
3.056MetAla: 3.056 ± 1.556
0.655MetCys: 0.655 ± 0.246
1.746MetAsp: 1.746 ± 0.829
0.873MetGlu: 0.873 ± 0.766
0.873MetPhe: 0.873 ± 0.433
0.873MetGly: 0.873 ± 0.699
0.218MetHis: 0.218 ± 0.24
0.873MetIle: 0.873 ± 0.546
0.218MetLys: 0.218 ± 0.137
1.965MetLeu: 1.965 ± 1.036
0.218MetMet: 0.218 ± 0.137
2.401MetAsn: 2.401 ± 1.186
0.218MetPro: 0.218 ± 0.24
1.965MetGln: 1.965 ± 0.874
0.873MetArg: 0.873 ± 0.546
3.274MetSer: 3.274 ± 0.888
1.965MetThr: 1.965 ± 0.67
1.091MetVal: 1.091 ± 0.433
0.0MetTrp: 0.0 ± 0.0
0.218MetTyr: 0.218 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
1.31AsnAla: 1.31 ± 0.492
1.091AsnCys: 1.091 ± 0.433
1.31AsnAsp: 1.31 ± 0.82
1.746AsnGlu: 1.746 ± 0.797
3.493AsnPhe: 3.493 ± 1.891
1.965AsnGly: 1.965 ± 1.1
1.091AsnHis: 1.091 ± 0.356
3.274AsnIle: 3.274 ± 1.32
2.183AsnLys: 2.183 ± 0.853
5.021AsnLeu: 5.021 ± 0.454
0.655AsnMet: 0.655 ± 0.315
0.655AsnAsn: 0.655 ± 0.285
3.929AsnPro: 3.929 ± 0.89
0.873AsnGln: 0.873 ± 0.406
4.366AsnArg: 4.366 ± 0.729
4.148AsnSer: 4.148 ± 0.365
0.655AsnThr: 0.655 ± 0.315
3.274AsnVal: 3.274 ± 1.307
1.31AsnTrp: 1.31 ± 0.71
1.31AsnTyr: 1.31 ± 0.571
0.0AsnXaa: 0.0 ± 0.0
Pro
2.183ProAla: 2.183 ± 0.584
0.437ProCys: 0.437 ± 0.217
2.838ProAsp: 2.838 ± 1.098
3.711ProGlu: 3.711 ± 0.862
0.218ProPhe: 0.218 ± 0.265
4.584ProGly: 4.584 ± 2.043
1.091ProHis: 1.091 ± 0.446
2.62ProIle: 2.62 ± 0.843
1.091ProLys: 1.091 ± 0.683
5.894ProLeu: 5.894 ± 1.252
0.218ProMet: 0.218 ± 0.137
3.711ProAsn: 3.711 ± 1.135
4.148ProPro: 4.148 ± 1.218
0.873ProGln: 0.873 ± 0.464
1.746ProArg: 1.746 ± 0.796
7.204ProSer: 7.204 ± 1.488
3.056ProThr: 3.056 ± 0.292
1.965ProVal: 1.965 ± 0.937
0.218ProTrp: 0.218 ± 0.24
1.746ProTyr: 1.746 ± 0.575
0.0ProXaa: 0.0 ± 0.0
Gln
1.091GlnAla: 1.091 ± 0.431
0.437GlnCys: 0.437 ± 0.256
1.746GlnAsp: 1.746 ± 0.621
1.528GlnGlu: 1.528 ± 0.477
0.873GlnPhe: 0.873 ± 0.334
1.091GlnGly: 1.091 ± 0.348
1.091GlnHis: 1.091 ± 0.514
5.239GlnIle: 5.239 ± 2.157
1.091GlnLys: 1.091 ± 0.528
3.711GlnLeu: 3.711 ± 1.237
0.873GlnMet: 0.873 ± 0.686
0.437GlnAsn: 0.437 ± 0.273
0.218GlnPro: 0.218 ± 0.137
0.437GlnGln: 0.437 ± 0.232
3.274GlnArg: 3.274 ± 0.739
4.148GlnSer: 4.148 ± 1.11
4.584GlnThr: 4.584 ± 2.186
2.401GlnVal: 2.401 ± 1.016
0.437GlnTrp: 0.437 ± 0.256
0.218GlnTyr: 0.218 ± 0.265
0.0GlnXaa: 0.0 ± 0.0
Arg
2.838ArgAla: 2.838 ± 1.001
1.746ArgCys: 1.746 ± 0.641
3.711ArgAsp: 3.711 ± 0.528
6.33ArgGlu: 6.33 ± 1.256
3.493ArgPhe: 3.493 ± 0.617
3.493ArgGly: 3.493 ± 1.409
1.091ArgHis: 1.091 ± 0.528
2.183ArgIle: 2.183 ± 0.635
2.183ArgLys: 2.183 ± 0.664
5.676ArgLeu: 5.676 ± 0.439
2.401ArgMet: 2.401 ± 0.591
2.183ArgAsn: 2.183 ± 0.736
1.746ArgPro: 1.746 ± 0.641
2.183ArgGln: 2.183 ± 0.571
2.62ArgArg: 2.62 ± 0.804
5.894ArgSer: 5.894 ± 1.297
2.62ArgThr: 2.62 ± 0.863
4.148ArgVal: 4.148 ± 1.597
0.873ArgTrp: 0.873 ± 0.546
2.62ArgTyr: 2.62 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
4.366SerAla: 4.366 ± 0.443
2.838SerCys: 2.838 ± 0.382
3.711SerAsp: 3.711 ± 1.383
5.239SerGlu: 5.239 ± 1.35
5.021SerPhe: 5.021 ± 0.685
7.859SerGly: 7.859 ± 1.855
1.746SerHis: 1.746 ± 0.641
3.274SerIle: 3.274 ± 1.844
9.605SerLys: 9.605 ± 3.103
9.168SerLeu: 9.168 ± 1.48
1.31SerMet: 1.31 ± 0.58
3.493SerAsn: 3.493 ± 0.362
4.148SerPro: 4.148 ± 0.608
5.021SerGln: 5.021 ± 1.985
5.676SerArg: 5.676 ± 1.119
9.168SerSer: 9.168 ± 0.798
4.584SerThr: 4.584 ± 0.392
5.239SerVal: 5.239 ± 0.647
1.965SerTrp: 1.965 ± 1.036
4.366SerTyr: 4.366 ± 1.422
0.0SerXaa: 0.0 ± 0.0
Thr
2.838ThrAla: 2.838 ± 1.154
1.31ThrCys: 1.31 ± 0.665
1.746ThrAsp: 1.746 ± 0.668
0.873ThrGlu: 0.873 ± 0.373
0.873ThrPhe: 0.873 ± 0.269
4.366ThrGly: 4.366 ± 0.377
0.873ThrHis: 0.873 ± 0.334
1.746ThrIle: 1.746 ± 0.669
1.31ThrLys: 1.31 ± 0.665
4.584ThrLeu: 4.584 ± 1.907
1.528ThrMet: 1.528 ± 0.956
2.838ThrAsn: 2.838 ± 0.888
3.711ThrPro: 3.711 ± 1.323
3.056ThrGln: 3.056 ± 0.561
3.711ThrArg: 3.711 ± 0.675
3.711ThrSer: 3.711 ± 0.389
4.148ThrThr: 4.148 ± 1.403
3.711ThrVal: 3.711 ± 1.255
1.528ThrTrp: 1.528 ± 0.62
2.401ThrTyr: 2.401 ± 0.72
0.0ThrXaa: 0.0 ± 0.0
Val
4.148ValAla: 4.148 ± 1.226
0.873ValCys: 0.873 ± 0.296
3.056ValAsp: 3.056 ± 0.595
5.894ValGlu: 5.894 ± 2.12
4.366ValPhe: 4.366 ± 1.554
5.457ValGly: 5.457 ± 1.265
1.091ValHis: 1.091 ± 0.514
3.056ValIle: 3.056 ± 0.886
3.056ValLys: 3.056 ± 1.024
5.021ValLeu: 5.021 ± 0.622
0.218ValMet: 0.218 ± 0.128
3.929ValAsn: 3.929 ± 0.492
3.929ValPro: 3.929 ± 0.675
2.183ValGln: 2.183 ± 0.571
2.183ValArg: 2.183 ± 0.591
6.33ValSer: 6.33 ± 1.877
3.711ValThr: 3.711 ± 0.331
2.401ValVal: 2.401 ± 0.74
0.218ValTrp: 0.218 ± 0.137
1.528ValTyr: 1.528 ± 0.535
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.296
0.437TrpCys: 0.437 ± 0.392
0.437TrpAsp: 0.437 ± 0.256
0.655TrpGlu: 0.655 ± 0.296
0.218TrpPhe: 0.218 ± 0.137
1.091TrpGly: 1.091 ± 0.516
0.437TrpHis: 0.437 ± 0.273
1.091TrpIle: 1.091 ± 0.683
0.873TrpLys: 0.873 ± 0.398
1.528TrpLeu: 1.528 ± 0.63
0.218TrpMet: 0.218 ± 0.24
0.873TrpAsn: 0.873 ± 0.398
0.437TrpPro: 0.437 ± 0.273
0.0TrpGln: 0.0 ± 0.0
0.437TrpArg: 0.437 ± 0.232
3.493TrpSer: 3.493 ± 1.48
0.437TrpThr: 0.437 ± 0.217
1.965TrpVal: 1.965 ± 0.699
0.0TrpTrp: 0.0 ± 0.0
0.218TrpTyr: 0.218 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.437TyrAla: 0.437 ± 0.273
0.437TyrCys: 0.437 ± 0.232
1.965TyrAsp: 1.965 ± 0.767
1.746TyrGlu: 1.746 ± 0.625
1.528TyrPhe: 1.528 ± 0.653
1.31TyrGly: 1.31 ± 0.571
0.218TyrHis: 0.218 ± 0.24
1.528TyrIle: 1.528 ± 0.771
3.493TyrLys: 3.493 ± 0.943
4.584TyrLeu: 4.584 ± 1.099
0.873TyrMet: 0.873 ± 0.269
2.62TyrAsn: 2.62 ± 0.449
0.873TyrPro: 0.873 ± 0.269
0.655TyrGln: 0.655 ± 0.41
1.965TyrArg: 1.965 ± 0.478
4.148TyrSer: 4.148 ± 1.384
2.401TyrThr: 2.401 ± 1.124
2.183TyrVal: 2.183 ± 0.945
0.0TyrTrp: 0.0 ± 0.0
0.437TyrTyr: 0.437 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski