Amino acid dipepetide frequency for Rabies lyssavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.056AlaAla: 3.056 ± 1.867
1.389AlaCys: 1.389 ± 0.937
2.5AlaAsp: 2.5 ± 1.694
3.333AlaGlu: 3.333 ± 1.033
0.833AlaPhe: 0.833 ± 0.459
2.5AlaGly: 2.5 ± 1.059
1.944AlaHis: 1.944 ± 1.107
3.611AlaIle: 3.611 ± 0.689
1.111AlaLys: 1.111 ± 0.415
6.111AlaLeu: 6.111 ± 0.48
0.556AlaMet: 0.556 ± 0.351
1.944AlaAsn: 1.944 ± 0.571
1.944AlaPro: 1.944 ± 1.524
2.222AlaGln: 2.222 ± 0.992
3.611AlaArg: 3.611 ± 0.954
3.333AlaSer: 3.333 ± 0.574
0.833AlaThr: 0.833 ± 0.481
1.944AlaVal: 1.944 ± 1.032
0.278AlaTrp: 0.278 ± 0.153
1.389AlaTyr: 1.389 ± 0.965
0.0AlaXaa: 0.0 ± 0.0
Cys
0.556CysAla: 0.556 ± 0.822
0.833CysCys: 0.833 ± 0.68
0.556CysAsp: 0.556 ± 0.312
0.0CysGlu: 0.0 ± 0.0
0.556CysPhe: 0.556 ± 0.312
1.111CysGly: 1.111 ± 0.624
0.556CysHis: 0.556 ± 0.702
1.111CysIle: 1.111 ± 0.982
0.278CysLys: 0.278 ± 0.382
2.222CysLeu: 2.222 ± 0.752
0.278CysMet: 0.278 ± 0.373
0.556CysAsn: 0.556 ± 0.306
1.111CysPro: 1.111 ± 0.33
0.833CysGln: 0.833 ± 0.477
0.833CysArg: 0.833 ± 0.987
3.611CysSer: 3.611 ± 0.702
1.111CysThr: 1.111 ± 0.624
0.556CysVal: 0.556 ± 0.382
0.278CysTrp: 0.278 ± 0.153
0.833CysTyr: 0.833 ± 0.352
0.0CysXaa: 0.0 ± 0.0
Asp
2.5AspAla: 2.5 ± 0.77
0.278AspCys: 0.278 ± 0.411
6.389AspAsp: 6.389 ± 2.928
3.611AspGlu: 3.611 ± 2.063
3.611AspPhe: 3.611 ± 1.085
3.056AspGly: 3.056 ± 0.859
0.278AspHis: 0.278 ± 0.153
2.778AspIle: 2.778 ± 0.745
3.611AspLys: 3.611 ± 0.664
9.444AspLeu: 9.444 ± 1.925
1.111AspMet: 1.111 ± 0.733
2.778AspAsn: 2.778 ± 1.295
4.444AspPro: 4.444 ± 0.585
3.056AspGln: 3.056 ± 0.962
1.389AspArg: 1.389 ± 0.516
2.778AspSer: 2.778 ± 1.029
1.944AspThr: 1.944 ± 0.976
1.944AspVal: 1.944 ± 0.567
0.833AspTrp: 0.833 ± 0.352
3.333AspTyr: 3.333 ± 0.837
0.0AspXaa: 0.0 ± 0.0
Glu
3.056GluAla: 3.056 ± 1.132
0.556GluCys: 0.556 ± 0.312
5.556GluAsp: 5.556 ± 1.88
4.722GluGlu: 4.722 ± 1.577
1.944GluPhe: 1.944 ± 0.74
4.444GluGly: 4.444 ± 0.757
1.111GluHis: 1.111 ± 1.271
4.722GluIle: 4.722 ± 1.228
2.778GluLys: 2.778 ± 1.201
4.722GluLeu: 4.722 ± 1.005
2.5GluMet: 2.5 ± 0.631
1.111GluAsn: 1.111 ± 0.376
2.222GluPro: 2.222 ± 0.764
0.556GluGln: 0.556 ± 0.57
2.778GluArg: 2.778 ± 0.526
6.389GluSer: 6.389 ± 1.757
3.056GluThr: 3.056 ± 1.764
3.056GluVal: 3.056 ± 0.761
0.833GluTrp: 0.833 ± 0.41
1.667GluTyr: 1.667 ± 1.238
0.0GluXaa: 0.0 ± 0.0
Phe
1.111PheAla: 1.111 ± 0.415
0.278PheCys: 0.278 ± 0.549
1.944PheAsp: 1.944 ± 0.801
2.5PheGlu: 2.5 ± 1.831
3.333PhePhe: 3.333 ± 1.619
1.667PheGly: 1.667 ± 0.539
2.222PheHis: 2.222 ± 0.543
1.667PheIle: 1.667 ± 0.705
3.056PheLys: 3.056 ± 0.718
4.722PheLeu: 4.722 ± 0.874
0.278PheMet: 0.278 ± 0.153
2.222PheAsn: 2.222 ± 0.49
3.333PhePro: 3.333 ± 0.96
2.5PheGln: 2.5 ± 1.192
3.611PheArg: 3.611 ± 0.79
5.0PheSer: 5.0 ± 0.802
1.389PheThr: 1.389 ± 0.621
3.056PheVal: 3.056 ± 1.465
0.278PheTrp: 0.278 ± 0.153
0.833PheTyr: 0.833 ± 0.459
0.0PheXaa: 0.0 ± 0.0
Gly
2.222GlyAla: 2.222 ± 0.652
1.389GlyCys: 1.389 ± 0.621
3.611GlyAsp: 3.611 ± 0.966
3.611GlyGlu: 3.611 ± 1.819
2.5GlyPhe: 2.5 ± 1.252
5.0GlyGly: 5.0 ± 0.988
1.111GlyHis: 1.111 ± 0.33
1.944GlyIle: 1.944 ± 0.542
4.444GlyLys: 4.444 ± 2.211
8.333GlyLeu: 8.333 ± 0.93
1.111GlyMet: 1.111 ± 0.929
3.056GlyAsn: 3.056 ± 0.885
2.778GlyPro: 2.778 ± 0.614
1.389GlyGln: 1.389 ± 1.095
3.611GlyArg: 3.611 ± 0.578
2.778GlySer: 2.778 ± 0.745
4.444GlyThr: 4.444 ± 1.458
2.778GlyVal: 2.778 ± 1.348
0.556GlyTrp: 0.556 ± 0.382
2.5GlyTyr: 2.5 ± 0.506
0.0GlyXaa: 0.0 ± 0.0
His
1.111HisAla: 1.111 ± 0.415
0.0HisCys: 0.0 ± 0.0
1.667HisAsp: 1.667 ± 0.617
0.833HisGlu: 0.833 ± 0.352
1.667HisPhe: 1.667 ± 0.606
0.556HisGly: 0.556 ± 0.306
0.556HisHis: 0.556 ± 0.545
2.222HisIle: 2.222 ± 0.928
1.667HisLys: 1.667 ± 0.851
3.333HisLeu: 3.333 ± 0.995
0.278HisMet: 0.278 ± 0.382
0.556HisAsn: 0.556 ± 0.763
1.944HisPro: 1.944 ± 0.672
1.111HisGln: 1.111 ± 0.532
0.833HisArg: 0.833 ± 0.477
2.222HisSer: 2.222 ± 0.816
0.278HisThr: 0.278 ± 0.411
1.667HisVal: 1.667 ± 0.539
1.111HisTrp: 1.111 ± 0.376
0.833HisTyr: 0.833 ± 0.31
0.0HisXaa: 0.0 ± 0.0
Ile
2.222IleAla: 2.222 ± 1.191
0.833IleCys: 0.833 ± 0.459
3.333IleAsp: 3.333 ± 1.48
2.778IleGlu: 2.778 ± 0.678
3.333IlePhe: 3.333 ± 0.944
2.778IleGly: 2.778 ± 1.168
2.5IleHis: 2.5 ± 0.608
4.167IleIle: 4.167 ± 0.77
3.611IleLys: 3.611 ± 0.831
6.667IleLeu: 6.667 ± 1.88
1.944IleMet: 1.944 ± 0.422
2.222IleAsn: 2.222 ± 0.972
3.056IlePro: 3.056 ± 1.474
1.667IleGln: 1.667 ± 0.71
3.333IleArg: 3.333 ± 0.798
5.833IleSer: 5.833 ± 1.069
4.722IleThr: 4.722 ± 1.46
3.333IleVal: 3.333 ± 1.078
1.667IleTrp: 1.667 ± 0.552
2.5IleTyr: 2.5 ± 0.743
0.0IleXaa: 0.0 ± 0.0
Lys
1.944LysAla: 1.944 ± 0.601
0.556LysCys: 0.556 ± 0.312
3.889LysAsp: 3.889 ± 1.025
2.778LysGlu: 2.778 ± 1.096
1.944LysPhe: 1.944 ± 0.723
2.778LysGly: 2.778 ± 0.837
0.556LysHis: 0.556 ± 0.312
3.889LysIle: 3.889 ± 1.339
3.611LysLys: 3.611 ± 0.934
5.278LysLeu: 5.278 ± 1.126
1.944LysMet: 1.944 ± 0.263
1.667LysAsn: 1.667 ± 1.11
3.056LysPro: 3.056 ± 0.714
0.833LysGln: 0.833 ± 0.424
3.611LysArg: 3.611 ± 0.745
5.556LysSer: 5.556 ± 1.05
3.611LysThr: 3.611 ± 1.233
5.0LysVal: 5.0 ± 0.753
0.833LysTrp: 0.833 ± 0.31
1.667LysTyr: 1.667 ± 0.748
0.0LysXaa: 0.0 ± 0.0
Leu
6.111LeuAla: 6.111 ± 1.124
1.667LeuCys: 1.667 ± 0.611
5.278LeuAsp: 5.278 ± 1.231
6.667LeuGlu: 6.667 ± 1.717
3.889LeuPhe: 3.889 ± 1.501
6.389LeuGly: 6.389 ± 1.081
1.667LeuHis: 1.667 ± 1.361
7.222LeuIle: 7.222 ± 1.718
7.222LeuLys: 7.222 ± 0.59
9.444LeuLeu: 9.444 ± 1.859
5.0LeuMet: 5.0 ± 1.567
3.889LeuAsn: 3.889 ± 1.135
3.333LeuPro: 3.333 ± 0.787
2.5LeuGln: 2.5 ± 1.158
8.056LeuArg: 8.056 ± 2.049
11.111LeuSer: 11.111 ± 2.041
5.0LeuThr: 5.0 ± 1.0
7.5LeuVal: 7.5 ± 1.357
1.667LeuTrp: 1.667 ± 0.954
5.0LeuTyr: 5.0 ± 1.202
0.0LeuXaa: 0.0 ± 0.0
Met
1.389MetAla: 1.389 ± 0.758
0.833MetCys: 0.833 ± 0.352
1.111MetAsp: 1.111 ± 0.532
1.111MetGlu: 1.111 ± 1.195
1.111MetPhe: 1.111 ± 0.702
0.278MetGly: 0.278 ± 0.411
0.278MetHis: 0.278 ± 0.382
2.5MetIle: 2.5 ± 0.743
0.278MetLys: 0.278 ± 0.153
2.222MetLeu: 2.222 ± 0.89
0.833MetMet: 0.833 ± 0.352
1.944MetAsn: 1.944 ± 1.873
0.556MetPro: 0.556 ± 0.763
1.111MetGln: 1.111 ± 0.532
1.667MetArg: 1.667 ± 0.864
5.278MetSer: 5.278 ± 1.872
2.222MetThr: 2.222 ± 0.49
0.833MetVal: 0.833 ± 0.41
0.0MetTrp: 0.0 ± 0.0
0.278MetTyr: 0.278 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
1.667AsnAla: 1.667 ± 1.053
1.389AsnCys: 1.389 ± 0.584
1.111AsnAsp: 1.111 ± 0.612
1.111AsnGlu: 1.111 ± 0.819
2.5AsnPhe: 2.5 ± 1.482
2.5AsnGly: 2.5 ± 1.769
1.389AsnHis: 1.389 ± 0.575
3.611AsnIle: 3.611 ± 1.369
1.944AsnLys: 1.944 ± 0.601
4.444AsnLeu: 4.444 ± 0.58
1.111AsnMet: 1.111 ± 0.982
0.833AsnAsn: 0.833 ± 0.424
3.056AsnPro: 3.056 ± 0.845
0.833AsnGln: 0.833 ± 0.481
3.611AsnArg: 3.611 ± 1.354
4.444AsnSer: 4.444 ± 0.693
1.111AsnThr: 1.111 ± 0.415
1.389AsnVal: 1.389 ± 0.778
1.667AsnTrp: 1.667 ± 1.151
1.389AsnTyr: 1.389 ± 0.516
0.0AsnXaa: 0.0 ± 0.0
Pro
1.389ProAla: 1.389 ± 0.43
0.278ProCys: 0.278 ± 0.153
5.0ProAsp: 5.0 ± 1.93
4.167ProGlu: 4.167 ± 0.996
0.556ProPhe: 0.556 ± 0.351
2.5ProGly: 2.5 ± 1.059
1.389ProHis: 1.389 ± 0.678
2.778ProIle: 2.778 ± 0.745
1.389ProLys: 1.389 ± 0.765
6.111ProLeu: 6.111 ± 1.513
0.556ProMet: 0.556 ± 0.312
2.222ProAsn: 2.222 ± 0.883
2.778ProPro: 2.778 ± 1.439
1.111ProGln: 1.111 ± 0.376
2.222ProArg: 2.222 ± 0.752
6.667ProSer: 6.667 ± 1.996
1.944ProThr: 1.944 ± 0.772
2.222ProVal: 2.222 ± 0.929
0.278ProTrp: 0.278 ± 0.382
2.222ProTyr: 2.222 ± 0.49
0.0ProXaa: 0.0 ± 0.0
Gln
0.833GlnAla: 0.833 ± 0.481
0.278GlnCys: 0.278 ± 0.549
1.111GlnAsp: 1.111 ± 0.488
1.667GlnGlu: 1.667 ± 0.779
1.111GlnPhe: 1.111 ± 0.415
1.667GlnGly: 1.667 ± 0.606
1.389GlnHis: 1.389 ± 0.603
3.056GlnIle: 3.056 ± 0.751
1.111GlnLys: 1.111 ± 0.612
2.5GlnLeu: 2.5 ± 1.152
1.111GlnMet: 1.111 ± 0.763
0.833GlnAsn: 0.833 ± 0.352
0.278GlnPro: 0.278 ± 0.153
0.278GlnGln: 0.278 ± 0.153
3.056GlnArg: 3.056 ± 1.111
2.5GlnSer: 2.5 ± 0.811
2.778GlnThr: 2.778 ± 1.113
2.778GlnVal: 2.778 ± 0.681
0.556GlnTrp: 0.556 ± 0.491
0.278GlnTyr: 0.278 ± 0.411
0.0GlnXaa: 0.0 ± 0.0
Arg
3.889ArgAla: 3.889 ± 0.855
1.111ArgCys: 1.111 ± 0.407
2.222ArgAsp: 2.222 ± 0.733
4.444ArgGlu: 4.444 ± 1.216
2.778ArgPhe: 2.778 ± 0.814
4.167ArgGly: 4.167 ± 1.274
1.667ArgHis: 1.667 ± 0.684
2.778ArgIle: 2.778 ± 1.3
3.333ArgLys: 3.333 ± 1.078
5.278ArgLeu: 5.278 ± 0.683
1.944ArgMet: 1.944 ± 0.536
2.5ArgAsn: 2.5 ± 0.864
2.222ArgPro: 2.222 ± 0.302
1.944ArgGln: 1.944 ± 0.668
2.778ArgArg: 2.778 ± 0.995
4.444ArgSer: 4.444 ± 1.478
3.889ArgThr: 3.889 ± 0.729
5.556ArgVal: 5.556 ± 1.288
1.111ArgTrp: 1.111 ± 0.612
1.944ArgTyr: 1.944 ± 0.263
0.0ArgXaa: 0.0 ± 0.0
Ser
4.722SerAla: 4.722 ± 1.532
2.222SerCys: 2.222 ± 0.889
5.278SerAsp: 5.278 ± 1.068
5.833SerGlu: 5.833 ± 1.74
4.444SerPhe: 4.444 ± 0.757
7.222SerGly: 7.222 ± 2.102
2.222SerHis: 2.222 ± 0.49
4.444SerIle: 4.444 ± 1.383
6.944SerLys: 6.944 ± 2.441
10.833SerLeu: 10.833 ± 1.717
1.389SerMet: 1.389 ± 0.679
2.5SerAsn: 2.5 ± 0.924
4.722SerPro: 4.722 ± 0.831
2.5SerGln: 2.5 ± 1.192
7.5SerArg: 7.5 ± 1.695
8.889SerSer: 8.889 ± 0.739
4.722SerThr: 4.722 ± 0.843
5.833SerVal: 5.833 ± 0.815
2.222SerTrp: 2.222 ± 0.89
3.611SerTyr: 3.611 ± 0.861
0.0SerXaa: 0.0 ± 0.0
Thr
2.778ThrAla: 2.778 ± 1.296
1.944ThrCys: 1.944 ± 1.346
2.222ThrAsp: 2.222 ± 0.83
1.111ThrGlu: 1.111 ± 0.525
1.111ThrPhe: 1.111 ± 0.33
3.611ThrGly: 3.611 ± 0.79
1.389ThrHis: 1.389 ± 0.514
2.778ThrIle: 2.778 ± 1.546
1.667ThrLys: 1.667 ± 0.977
5.278ThrLeu: 5.278 ± 1.757
1.944ThrMet: 1.944 ± 1.071
2.778ThrAsn: 2.778 ± 1.625
1.944ThrPro: 1.944 ± 0.733
2.222ThrGln: 2.222 ± 0.757
3.889ThrArg: 3.889 ± 1.199
3.333ThrSer: 3.333 ± 0.628
3.056ThrThr: 3.056 ± 1.067
4.167ThrVal: 4.167 ± 1.971
1.944ThrTrp: 1.944 ± 0.664
2.778ThrTyr: 2.778 ± 0.681
0.0ThrXaa: 0.0 ± 0.0
Val
2.222ValAla: 2.222 ± 0.814
1.111ValCys: 1.111 ± 0.548
3.333ValAsp: 3.333 ± 0.585
4.167ValGlu: 4.167 ± 1.218
5.556ValPhe: 5.556 ± 1.228
4.167ValGly: 4.167 ± 1.508
1.389ValHis: 1.389 ± 0.603
3.889ValIle: 3.889 ± 0.703
2.778ValLys: 2.778 ± 1.67
5.278ValLeu: 5.278 ± 0.831
0.278ValMet: 0.278 ± 0.412
4.167ValAsn: 4.167 ± 0.622
3.333ValPro: 3.333 ± 0.839
1.667ValGln: 1.667 ± 0.72
1.389ValArg: 1.389 ± 0.43
6.667ValSer: 6.667 ± 1.234
3.333ValThr: 3.333 ± 0.785
3.056ValVal: 3.056 ± 0.897
0.278ValTrp: 0.278 ± 0.153
1.944ValTyr: 1.944 ± 0.578
0.0ValXaa: 0.0 ± 0.0
Trp
1.111TrpAla: 1.111 ± 0.548
0.556TrpCys: 0.556 ± 0.702
0.556TrpAsp: 0.556 ± 0.491
1.111TrpGlu: 1.111 ± 0.376
0.278TrpPhe: 0.278 ± 0.153
1.389TrpGly: 1.389 ± 0.484
0.556TrpHis: 0.556 ± 0.306
1.111TrpIle: 1.111 ± 0.612
0.833TrpLys: 0.833 ± 0.31
1.944TrpLeu: 1.944 ± 0.549
0.278TrpMet: 0.278 ± 0.382
1.111TrpAsn: 1.111 ± 0.376
0.556TrpPro: 0.556 ± 0.306
0.0TrpGln: 0.0 ± 0.0
0.278TrpArg: 0.278 ± 0.153
2.222TrpSer: 2.222 ± 0.819
0.556TrpThr: 0.556 ± 0.351
1.667TrpVal: 1.667 ± 0.552
0.0TrpTrp: 0.0 ± 0.0
0.278TrpTyr: 0.278 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.556TyrAla: 0.556 ± 0.306
0.278TyrCys: 0.278 ± 0.153
2.5TyrAsp: 2.5 ± 0.677
2.222TyrGlu: 2.222 ± 0.934
1.944TyrPhe: 1.944 ± 1.032
1.389TyrGly: 1.389 ± 0.516
0.278TyrHis: 0.278 ± 0.382
2.222TyrIle: 2.222 ± 0.89
3.333TyrLys: 3.333 ± 1.248
4.444TyrLeu: 4.444 ± 1.536
1.111TyrMet: 1.111 ± 0.33
2.222TyrAsn: 2.222 ± 0.662
1.111TyrPro: 1.111 ± 0.33
0.833TyrGln: 0.833 ± 0.459
1.944TyrArg: 1.944 ± 0.571
5.278TyrSer: 5.278 ± 1.615
1.944TyrThr: 1.944 ± 1.479
1.667TyrVal: 1.667 ± 1.011
0.0TyrTrp: 0.0 ± 0.0
0.556TyrTyr: 0.556 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3601 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski