Amino acid dipepetide frequency for European bat lyssavirus 1 (strain Bat/Germany/RV9/1968) (EBLV1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.174AlaAla: 3.174 ± 1.107
1.465AlaCys: 1.465 ± 0.788
3.174AlaAsp: 3.174 ± 1.362
3.906AlaGlu: 3.906 ± 1.83
1.221AlaPhe: 1.221 ± 0.522
1.709AlaGly: 1.709 ± 0.536
1.709AlaHis: 1.709 ± 1.07
2.93AlaIle: 2.93 ± 1.176
2.441AlaLys: 2.441 ± 0.533
5.371AlaLeu: 5.371 ± 1.371
0.732AlaMet: 0.732 ± 0.29
1.221AlaAsn: 1.221 ± 0.538
3.662AlaPro: 3.662 ± 1.634
2.686AlaGln: 2.686 ± 0.574
3.174AlaArg: 3.174 ± 0.749
4.639AlaSer: 4.639 ± 1.943
2.441AlaThr: 2.441 ± 0.755
3.418AlaVal: 3.418 ± 1.21
0.488AlaTrp: 0.488 ± 0.298
2.686AlaTyr: 2.686 ± 1.29
0.0AlaXaa: 0.0 ± 0.0
Cys
0.732CysAla: 0.732 ± 0.727
0.732CysCys: 0.732 ± 0.524
0.732CysAsp: 0.732 ± 0.3
0.0CysGlu: 0.0 ± 0.0
0.244CysPhe: 0.244 ± 0.149
1.221CysGly: 1.221 ± 0.535
0.488CysHis: 0.488 ± 0.485
0.977CysIle: 0.977 ± 0.737
0.732CysLys: 0.732 ± 0.689
2.93CysLeu: 2.93 ± 1.029
0.732CysMet: 0.732 ± 0.687
0.244CysAsn: 0.244 ± 0.149
1.221CysPro: 1.221 ± 0.535
0.732CysGln: 0.732 ± 0.39
0.244CysArg: 0.244 ± 0.289
2.197CysSer: 2.197 ± 0.471
0.732CysThr: 0.732 ± 0.524
0.0CysVal: 0.0 ± 0.0
0.244CysTrp: 0.244 ± 0.149
0.732CysTyr: 0.732 ± 0.29
0.0CysXaa: 0.0 ± 0.0
Asp
3.174AspAla: 3.174 ± 1.131
0.244AspCys: 0.244 ± 0.321
6.592AspAsp: 6.592 ± 2.963
3.906AspGlu: 3.906 ± 1.666
3.174AspPhe: 3.174 ± 0.547
4.883AspGly: 4.883 ± 1.822
0.244AspHis: 0.244 ± 0.149
3.906AspIle: 3.906 ± 1.365
3.662AspLys: 3.662 ± 0.279
7.324AspLeu: 7.324 ± 1.547
1.221AspMet: 1.221 ± 0.38
1.953AspAsn: 1.953 ± 1.19
3.662AspPro: 3.662 ± 1.109
2.441AspGln: 2.441 ± 0.927
2.197AspArg: 2.197 ± 1.044
4.15AspSer: 4.15 ± 0.886
0.977AspThr: 0.977 ± 0.319
3.418AspVal: 3.418 ± 0.675
1.465AspTrp: 1.465 ± 0.57
2.93AspTyr: 2.93 ± 0.624
0.0AspXaa: 0.0 ± 0.0
Glu
4.395GluAla: 4.395 ± 0.84
0.488GluCys: 0.488 ± 0.254
6.592GluAsp: 6.592 ± 3.628
4.395GluGlu: 4.395 ± 1.512
1.953GluPhe: 1.953 ± 0.569
4.639GluGly: 4.639 ± 0.444
0.977GluHis: 0.977 ± 1.03
3.906GluIle: 3.906 ± 0.608
3.418GluLys: 3.418 ± 0.929
3.662GluLeu: 3.662 ± 0.993
2.441GluMet: 2.441 ± 1.052
1.953GluAsn: 1.953 ± 0.393
1.221GluPro: 1.221 ± 0.492
0.977GluGln: 0.977 ± 0.796
1.221GluArg: 1.221 ± 0.342
6.836GluSer: 6.836 ± 1.199
2.197GluThr: 2.197 ± 1.255
3.174GluVal: 3.174 ± 0.327
1.221GluTrp: 1.221 ± 0.771
0.977GluTyr: 0.977 ± 0.78
0.0GluXaa: 0.0 ± 0.0
Phe
1.221PheAla: 1.221 ± 0.565
0.244PheCys: 0.244 ± 0.404
1.221PheAsp: 1.221 ± 0.744
2.441PheGlu: 2.441 ± 1.791
4.395PhePhe: 4.395 ± 1.123
1.221PheGly: 1.221 ± 0.342
1.709PheHis: 1.709 ± 0.53
1.221PheIle: 1.221 ± 0.538
3.418PheLys: 3.418 ± 0.477
4.15PheLeu: 4.15 ± 0.492
0.244PheMet: 0.244 ± 0.149
2.197PheAsn: 2.197 ± 0.789
4.15PhePro: 4.15 ± 0.826
1.953PheGln: 1.953 ± 0.633
3.662PheArg: 3.662 ± 0.815
4.883PheSer: 4.883 ± 1.452
1.221PheThr: 1.221 ± 0.535
2.441PheVal: 2.441 ± 0.875
0.244PheTrp: 0.244 ± 0.149
0.977PheTyr: 0.977 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
2.686GlyAla: 2.686 ± 0.539
1.221GlyCys: 1.221 ± 0.535
3.906GlyAsp: 3.906 ± 0.395
1.953GlyGlu: 1.953 ± 0.923
2.441GlyPhe: 2.441 ± 1.161
3.906GlyGly: 3.906 ± 1.164
0.732GlyHis: 0.732 ± 0.29
4.15GlyIle: 4.15 ± 1.895
3.418GlyLys: 3.418 ± 1.077
7.812GlyLeu: 7.812 ± 1.273
2.441GlyMet: 2.441 ± 1.514
2.197GlyAsn: 2.197 ± 0.64
3.418GlyPro: 3.418 ± 0.996
2.197GlyGln: 2.197 ± 1.156
2.686GlyArg: 2.686 ± 0.674
3.662GlySer: 3.662 ± 0.85
3.174GlyThr: 3.174 ± 1.236
2.441GlyVal: 2.441 ± 0.597
0.244GlyTrp: 0.244 ± 0.149
2.441GlyTyr: 2.441 ± 0.529
0.0GlyXaa: 0.0 ± 0.0
His
0.977HisAla: 0.977 ± 0.376
0.0HisCys: 0.0 ± 0.0
0.977HisAsp: 0.977 ± 0.508
0.732HisGlu: 0.732 ± 0.29
1.221HisPhe: 1.221 ± 0.803
0.732HisGly: 0.732 ± 0.446
0.244HisHis: 0.244 ± 0.321
1.709HisIle: 1.709 ± 0.655
0.732HisLys: 0.732 ± 0.572
2.686HisLeu: 2.686 ± 0.571
0.0HisMet: 0.0 ± 0.0
0.488HisAsn: 0.488 ± 0.451
1.709HisPro: 1.709 ± 0.69
1.221HisGln: 1.221 ± 0.676
0.732HisArg: 0.732 ± 0.29
2.197HisSer: 2.197 ± 0.441
0.488HisThr: 0.488 ± 0.451
1.465HisVal: 1.465 ± 0.622
0.977HisTrp: 0.977 ± 0.399
0.977HisTyr: 0.977 ± 0.508
0.0HisXaa: 0.0 ± 0.0
Ile
2.686IleAla: 2.686 ± 1.691
1.709IleCys: 1.709 ± 0.53
3.906IleAsp: 3.906 ± 1.275
2.93IleGlu: 2.93 ± 0.942
3.662IlePhe: 3.662 ± 0.711
2.686IleGly: 2.686 ± 0.865
1.709IleHis: 1.709 ± 0.798
4.15IleIle: 4.15 ± 1.289
2.686IleLys: 2.686 ± 1.066
6.348IleLeu: 6.348 ± 1.701
0.732IleMet: 0.732 ± 0.29
3.174IleAsn: 3.174 ± 0.661
4.395IlePro: 4.395 ± 1.408
0.977IleGln: 0.977 ± 0.373
3.662IleArg: 3.662 ± 1.007
5.127IleSer: 5.127 ± 1.916
4.883IleThr: 4.883 ± 1.254
4.395IleVal: 4.395 ± 1.783
1.709IleTrp: 1.709 ± 0.431
1.953IleTyr: 1.953 ± 0.938
0.0IleXaa: 0.0 ± 0.0
Lys
1.465LysAla: 1.465 ± 0.599
0.977LysCys: 0.977 ± 0.399
2.686LysAsp: 2.686 ± 0.596
2.686LysGlu: 2.686 ± 0.892
2.441LysPhe: 2.441 ± 0.475
2.686LysGly: 2.686 ± 0.94
0.732LysHis: 0.732 ± 0.3
5.371LysIle: 5.371 ± 1.073
4.15LysLys: 4.15 ± 0.438
6.592LysLeu: 6.592 ± 1.665
2.686LysMet: 2.686 ± 1.109
1.465LysAsn: 1.465 ± 1.158
2.441LysPro: 2.441 ± 0.529
2.197LysGln: 2.197 ± 1.018
6.104LysArg: 6.104 ± 1.685
4.395LysSer: 4.395 ± 0.268
4.639LysThr: 4.639 ± 0.448
5.371LysVal: 5.371 ± 0.44
0.244LysTrp: 0.244 ± 0.149
1.221LysTyr: 1.221 ± 0.676
0.0LysXaa: 0.0 ± 0.0
Leu
6.348LeuAla: 6.348 ± 0.92
1.953LeuCys: 1.953 ± 0.585
7.324LeuAsp: 7.324 ± 1.724
5.615LeuGlu: 5.615 ± 1.295
2.93LeuPhe: 2.93 ± 1.48
5.615LeuGly: 5.615 ± 0.875
0.977LeuHis: 0.977 ± 0.399
7.324LeuIle: 7.324 ± 2.43
5.615LeuLys: 5.615 ± 0.947
8.545LeuLeu: 8.545 ± 2.714
4.395LeuMet: 4.395 ± 0.713
4.15LeuAsn: 4.15 ± 0.745
3.418LeuPro: 3.418 ± 0.803
3.906LeuGln: 3.906 ± 1.617
8.545LeuArg: 8.545 ± 2.89
9.033LeuSer: 9.033 ± 2.445
3.418LeuThr: 3.418 ± 1.253
7.568LeuVal: 7.568 ± 0.491
2.441LeuTrp: 2.441 ± 1.299
3.906LeuTyr: 3.906 ± 1.164
0.0LeuXaa: 0.0 ± 0.0
Met
1.709MetAla: 1.709 ± 1.07
0.488MetCys: 0.488 ± 0.268
1.221MetAsp: 1.221 ± 0.535
1.221MetGlu: 1.221 ± 0.85
1.221MetPhe: 1.221 ± 0.565
0.244MetGly: 0.244 ± 0.321
0.0MetHis: 0.0 ± 0.0
1.709MetIle: 1.709 ± 0.576
2.197MetLys: 2.197 ± 1.357
2.686MetLeu: 2.686 ± 0.572
0.488MetMet: 0.488 ± 0.268
2.686MetAsn: 2.686 ± 1.853
0.0MetPro: 0.0 ± 0.0
1.953MetGln: 1.953 ± 1.574
1.709MetArg: 1.709 ± 0.709
3.662MetSer: 3.662 ± 1.327
2.686MetThr: 2.686 ± 0.533
1.221MetVal: 1.221 ± 0.492
0.0MetTrp: 0.0 ± 0.0
0.244MetTyr: 0.244 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
1.465AsnAla: 1.465 ± 0.803
0.732AsnCys: 0.732 ± 0.3
2.441AsnAsp: 2.441 ± 0.738
0.244AsnGlu: 0.244 ± 0.149
3.418AsnPhe: 3.418 ± 1.24
1.953AsnGly: 1.953 ± 0.741
1.953AsnHis: 1.953 ± 0.708
2.686AsnIle: 2.686 ± 1.334
3.174AsnLys: 3.174 ± 0.807
5.127AsnLeu: 5.127 ± 0.538
1.953AsnMet: 1.953 ± 1.524
0.732AsnAsn: 0.732 ± 0.3
3.174AsnPro: 3.174 ± 0.803
0.732AsnGln: 0.732 ± 0.29
2.686AsnArg: 2.686 ± 0.74
5.127AsnSer: 5.127 ± 0.94
1.465AsnThr: 1.465 ± 0.534
0.977AsnVal: 0.977 ± 0.319
1.221AsnTrp: 1.221 ± 0.697
1.221AsnTyr: 1.221 ± 0.492
0.0AsnXaa: 0.0 ± 0.0
Pro
1.709ProAla: 1.709 ± 0.765
0.244ProCys: 0.244 ± 0.289
3.174ProAsp: 3.174 ± 1.046
4.639ProGlu: 4.639 ± 0.988
0.732ProPhe: 0.732 ± 0.361
3.174ProGly: 3.174 ± 1.576
0.977ProHis: 0.977 ± 0.634
2.686ProIle: 2.686 ± 0.66
1.953ProLys: 1.953 ± 0.798
5.859ProLeu: 5.859 ± 1.128
0.244ProMet: 0.244 ± 0.149
2.197ProAsn: 2.197 ± 1.111
2.93ProPro: 2.93 ± 1.192
0.977ProGln: 0.977 ± 0.595
1.465ProArg: 1.465 ± 0.421
8.057ProSer: 8.057 ± 1.151
2.441ProThr: 2.441 ± 0.775
3.418ProVal: 3.418 ± 1.047
0.244ProTrp: 0.244 ± 0.289
1.465ProTyr: 1.465 ± 0.421
0.0ProXaa: 0.0 ± 0.0
Gln
3.662GlnAla: 3.662 ± 2.266
0.244GlnCys: 0.244 ± 0.404
1.953GlnAsp: 1.953 ± 0.666
3.174GlnGlu: 3.174 ± 1.492
1.221GlnPhe: 1.221 ± 0.744
2.441GlnGly: 2.441 ± 0.932
0.977GlnHis: 0.977 ± 0.399
3.906GlnIle: 3.906 ± 1.916
1.953GlnLys: 1.953 ± 0.604
2.686GlnLeu: 2.686 ± 0.397
0.732GlnMet: 0.732 ± 0.39
0.732GlnAsn: 0.732 ± 0.29
0.244GlnPro: 0.244 ± 0.149
1.221GlnGln: 1.221 ± 0.663
1.221GlnArg: 1.221 ± 0.522
3.174GlnSer: 3.174 ± 0.327
1.953GlnThr: 1.953 ± 0.901
1.221GlnVal: 1.221 ± 0.744
0.488GlnTrp: 0.488 ± 0.368
0.488GlnTyr: 0.488 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
2.441ArgAla: 2.441 ± 0.96
1.221ArgCys: 1.221 ± 0.616
4.15ArgAsp: 4.15 ± 0.492
4.639ArgGlu: 4.639 ± 0.62
2.686ArgPhe: 2.686 ± 0.436
2.686ArgGly: 2.686 ± 1.373
1.465ArgHis: 1.465 ± 0.474
2.686ArgIle: 2.686 ± 0.559
3.174ArgLys: 3.174 ± 0.626
5.127ArgLeu: 5.127 ± 0.724
2.686ArgMet: 2.686 ± 0.885
2.197ArgAsn: 2.197 ± 1.086
2.197ArgPro: 2.197 ± 0.644
2.197ArgGln: 2.197 ± 0.703
2.197ArgArg: 2.197 ± 1.224
5.371ArgSer: 5.371 ± 1.176
3.174ArgThr: 3.174 ± 0.331
3.174ArgVal: 3.174 ± 1.339
1.221ArgTrp: 1.221 ± 0.744
2.686ArgTyr: 2.686 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
6.104SerAla: 6.104 ± 0.949
1.953SerCys: 1.953 ± 0.458
4.639SerAsp: 4.639 ± 1.486
5.615SerGlu: 5.615 ± 2.067
4.15SerPhe: 4.15 ± 0.574
6.836SerGly: 6.836 ± 1.393
1.953SerHis: 1.953 ± 0.585
5.615SerIle: 5.615 ± 1.585
6.348SerLys: 6.348 ± 1.266
8.545SerLeu: 8.545 ± 2.616
1.465SerMet: 1.465 ± 0.592
4.395SerAsn: 4.395 ± 1.43
4.395SerPro: 4.395 ± 1.152
2.93SerGln: 2.93 ± 0.879
6.836SerArg: 6.836 ± 2.491
10.01SerSer: 10.01 ± 2.167
4.639SerThr: 4.639 ± 0.739
4.639SerVal: 4.639 ± 0.782
1.953SerTrp: 1.953 ± 0.798
4.883SerTyr: 4.883 ± 1.669
0.0SerXaa: 0.0 ± 0.0
Thr
2.197ThrAla: 2.197 ± 1.569
0.732ThrCys: 0.732 ± 0.3
1.465ThrAsp: 1.465 ± 0.656
2.686ThrGlu: 2.686 ± 0.353
0.977ThrPhe: 0.977 ± 0.319
3.906ThrGly: 3.906 ± 0.802
1.221ThrHis: 1.221 ± 0.492
2.441ThrIle: 2.441 ± 0.983
1.709ThrLys: 1.709 ± 0.756
6.592ThrLeu: 6.592 ± 0.368
1.709ThrMet: 1.709 ± 1.041
3.174ThrAsn: 3.174 ± 0.405
1.465ThrPro: 1.465 ± 0.531
1.953ThrGln: 1.953 ± 0.696
4.639ThrArg: 4.639 ± 0.934
2.441ThrSer: 2.441 ± 0.529
4.395ThrThr: 4.395 ± 1.67
2.93ThrVal: 2.93 ± 0.979
1.221ThrTrp: 1.221 ± 0.564
2.441ThrTyr: 2.441 ± 0.927
0.0ThrXaa: 0.0 ± 0.0
Val
3.174ValAla: 3.174 ± 1.05
0.488ValCys: 0.488 ± 0.508
2.686ValAsp: 2.686 ± 0.483
3.418ValGlu: 3.418 ± 0.843
3.174ValPhe: 3.174 ± 1.12
4.395ValGly: 4.395 ± 1.358
1.465ValHis: 1.465 ± 0.51
2.93ValIle: 2.93 ± 0.581
4.395ValLys: 4.395 ± 1.509
4.15ValLeu: 4.15 ± 1.443
0.244ValMet: 0.244 ± 0.149
4.395ValAsn: 4.395 ± 1.631
3.418ValPro: 3.418 ± 0.673
1.709ValGln: 1.709 ± 1.041
2.686ValArg: 2.686 ± 1.109
6.836ValSer: 6.836 ± 0.696
2.93ValThr: 2.93 ± 0.295
2.686ValVal: 2.686 ± 0.539
0.244ValTrp: 0.244 ± 0.149
2.197ValTyr: 2.197 ± 0.783
0.0ValXaa: 0.0 ± 0.0
Trp
2.441TrpAla: 2.441 ± 1.394
0.488TrpCys: 0.488 ± 0.485
0.488TrpAsp: 0.488 ± 0.368
0.732TrpGlu: 0.732 ± 0.3
0.244TrpPhe: 0.244 ± 0.149
0.977TrpGly: 0.977 ± 0.595
0.488TrpHis: 0.488 ± 0.298
1.465TrpIle: 1.465 ± 0.656
0.977TrpLys: 0.977 ± 0.807
1.709TrpLeu: 1.709 ± 0.576
0.0TrpMet: 0.0 ± 0.0
1.465TrpAsn: 1.465 ± 0.488
0.244TrpPro: 0.244 ± 0.149
0.0TrpGln: 0.0 ± 0.0
0.732TrpArg: 0.732 ± 0.446
1.465TrpSer: 1.465 ± 0.331
0.732TrpThr: 0.732 ± 0.29
0.977TrpVal: 0.977 ± 0.376
0.0TrpTrp: 0.0 ± 0.0
0.244TrpTyr: 0.244 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.488TyrAla: 0.488 ± 0.298
0.244TyrCys: 0.244 ± 0.149
2.197TyrAsp: 2.197 ± 0.789
1.709TyrGlu: 1.709 ± 0.808
1.709TyrPhe: 1.709 ± 0.536
1.221TyrGly: 1.221 ± 0.492
0.244TyrHis: 0.244 ± 0.149
1.709TyrIle: 1.709 ± 0.796
4.395TyrLys: 4.395 ± 1.054
5.127TyrLeu: 5.127 ± 1.057
1.953TyrMet: 1.953 ± 0.926
1.465TyrAsn: 1.465 ± 0.656
1.221TyrPro: 1.221 ± 0.342
0.732TyrGln: 0.732 ± 0.446
0.977TyrArg: 0.977 ± 0.376
4.639TyrSer: 4.639 ± 1.013
1.709TyrThr: 1.709 ± 1.076
2.686TyrVal: 2.686 ± 1.387
0.0TyrTrp: 0.0 ± 0.0
0.488TyrTyr: 0.488 ± 0.298
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski