Amino acid dipepetide frequency for Elderberry carlavirus C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.745AlaAla: 6.745 ± 2.265
1.42AlaCys: 1.42 ± 0.917
5.325AlaAsp: 5.325 ± 2.417
4.615AlaGlu: 4.615 ± 1.838
3.195AlaPhe: 3.195 ± 1.497
5.68AlaGly: 5.68 ± 1.172
1.775AlaHis: 1.775 ± 0.662
7.1AlaIle: 7.1 ± 3.65
6.745AlaLys: 6.745 ± 2.837
9.585AlaLeu: 9.585 ± 1.923
2.485AlaMet: 2.485 ± 1.018
4.26AlaAsn: 4.26 ± 0.832
3.195AlaPro: 3.195 ± 0.789
2.84AlaGln: 2.84 ± 1.218
4.615AlaArg: 4.615 ± 2.141
3.55AlaSer: 3.55 ± 1.938
4.615AlaThr: 4.615 ± 0.782
5.68AlaVal: 5.68 ± 2.75
1.065AlaTrp: 1.065 ± 0.582
1.42AlaTyr: 1.42 ± 0.917
0.0AlaXaa: 0.0 ± 0.0
Cys
3.55CysAla: 3.55 ± 1.386
0.71CysCys: 0.71 ± 1.036
0.355CysAsp: 0.355 ± 1.039
0.71CysGlu: 0.71 ± 0.388
2.485CysPhe: 2.485 ± 0.856
1.775CysGly: 1.775 ± 0.969
0.355CysHis: 0.355 ± 0.194
1.775CysIle: 1.775 ± 0.969
1.42CysLys: 1.42 ± 0.775
1.775CysLeu: 1.775 ± 0.766
0.355CysMet: 0.355 ± 0.194
0.355CysAsn: 0.355 ± 1.142
0.355CysPro: 0.355 ± 0.811
0.71CysGln: 0.71 ± 0.388
2.13CysArg: 2.13 ± 0.985
2.485CysSer: 2.485 ± 1.005
3.195CysThr: 3.195 ± 1.349
1.42CysVal: 1.42 ± 1.038
0.355CysTrp: 0.355 ± 1.039
0.71CysTyr: 0.71 ± 0.727
0.0CysXaa: 0.0 ± 0.0
Asp
2.13AspAla: 2.13 ± 0.793
1.42AspCys: 1.42 ± 0.775
1.065AspAsp: 1.065 ± 0.687
6.035AspGlu: 6.035 ± 1.5
1.775AspPhe: 1.775 ± 0.662
4.97AspGly: 4.97 ± 0.727
0.71AspHis: 0.71 ± 0.388
2.13AspIle: 2.13 ± 0.612
1.42AspLys: 1.42 ± 0.569
4.97AspLeu: 4.97 ± 1.583
1.065AspMet: 1.065 ± 0.603
1.42AspAsn: 1.42 ± 0.701
2.485AspPro: 2.485 ± 1.276
1.42AspGln: 1.42 ± 0.701
2.485AspArg: 2.485 ± 0.898
1.775AspSer: 1.775 ± 1.081
0.71AspThr: 0.71 ± 0.388
2.485AspVal: 2.485 ± 1.001
1.42AspTrp: 1.42 ± 1.049
2.84AspTyr: 2.84 ± 0.89
0.0AspXaa: 0.0 ± 0.0
Glu
7.1GluAla: 7.1 ± 2.629
0.71GluCys: 0.71 ± 0.388
1.065GluAsp: 1.065 ± 0.582
8.165GluGlu: 8.165 ± 2.08
1.775GluPhe: 1.775 ± 0.969
4.615GluGly: 4.615 ± 1.227
3.55GluHis: 3.55 ± 2.161
4.615GluIle: 4.615 ± 1.261
4.26GluLys: 4.26 ± 2.326
6.035GluLeu: 6.035 ± 2.179
2.13GluMet: 2.13 ± 1.065
2.13GluAsn: 2.13 ± 0.985
3.195GluPro: 3.195 ± 1.456
1.775GluGln: 1.775 ± 0.969
4.97GluArg: 4.97 ± 1.785
3.905GluSer: 3.905 ± 0.782
2.13GluThr: 2.13 ± 0.793
6.39GluVal: 6.39 ± 1.583
0.0GluTrp: 0.0 ± 0.0
2.13GluTyr: 2.13 ± 0.87
0.0GluXaa: 0.0 ± 0.0
Phe
3.55PheAla: 3.55 ± 1.463
0.355PheCys: 0.355 ± 0.194
2.13PheAsp: 2.13 ± 1.106
6.39PheGlu: 6.39 ± 2.621
0.355PhePhe: 0.355 ± 0.655
3.55PheGly: 3.55 ± 1.19
0.71PheHis: 0.71 ± 0.727
1.065PheIle: 1.065 ± 0.687
1.42PheLys: 1.42 ± 0.701
5.68PheLeu: 5.68 ± 3.101
0.355PheMet: 0.355 ± 0.194
1.065PheAsn: 1.065 ± 0.533
1.065PhePro: 1.065 ± 0.533
1.065PheGln: 1.065 ± 0.582
2.13PheArg: 2.13 ± 1.453
2.84PheSer: 2.84 ± 1.111
2.84PheThr: 2.84 ± 1.15
2.84PheVal: 2.84 ± 2.392
0.355PheTrp: 0.355 ± 0.194
0.355PheTyr: 0.355 ± 0.194
0.0PheXaa: 0.0 ± 0.0
Gly
5.68GlyAla: 5.68 ± 1.715
1.42GlyCys: 1.42 ± 0.891
3.55GlyAsp: 3.55 ± 1.458
3.55GlyGlu: 3.55 ± 0.779
1.775GlyPhe: 1.775 ± 0.655
3.55GlyGly: 3.55 ± 2.086
0.355GlyHis: 0.355 ± 1.087
2.84GlyIle: 2.84 ± 1.205
4.26GlyLys: 4.26 ± 1.225
5.325GlyLeu: 5.325 ± 2.51
0.355GlyMet: 0.355 ± 0.194
1.775GlyAsn: 1.775 ± 0.969
3.905GlyPro: 3.905 ± 0.713
1.065GlyGln: 1.065 ± 0.582
4.615GlyArg: 4.615 ± 1.47
5.68GlySer: 5.68 ± 2.475
3.195GlyThr: 3.195 ± 1.254
7.455GlyVal: 7.455 ± 1.573
0.71GlyTrp: 0.71 ± 0.388
2.485GlyTyr: 2.485 ± 1.001
0.0GlyXaa: 0.0 ± 0.0
His
2.84HisAla: 2.84 ± 0.862
0.71HisCys: 0.71 ± 0.727
1.775HisAsp: 1.775 ± 0.903
1.42HisGlu: 1.42 ± 0.775
1.42HisPhe: 1.42 ± 0.746
2.13HisGly: 2.13 ± 1.743
0.355HisHis: 0.355 ± 0.194
1.775HisIle: 1.775 ± 0.984
2.485HisLys: 2.485 ± 0.945
3.905HisLeu: 3.905 ± 1.112
0.71HisMet: 0.71 ± 1.228
1.775HisAsn: 1.775 ± 1.081
0.71HisPro: 0.71 ± 0.565
0.355HisGln: 0.355 ± 0.194
1.065HisArg: 1.065 ± 0.687
3.905HisSer: 3.905 ± 1.34
0.0HisThr: 0.0 ± 0.0
1.42HisVal: 1.42 ± 1.093
0.0HisTrp: 0.0 ± 0.0
1.42HisTyr: 1.42 ± 0.917
0.0HisXaa: 0.0 ± 0.0
Ile
4.26IleAla: 4.26 ± 3.955
2.84IleCys: 2.84 ± 0.994
4.26IleAsp: 4.26 ± 0.898
2.84IleGlu: 2.84 ± 1.111
2.84IlePhe: 2.84 ± 0.947
2.485IleGly: 2.485 ± 1.401
0.71IleHis: 0.71 ± 0.388
1.065IleIle: 1.065 ± 1.174
2.84IleLys: 2.84 ± 1.15
4.615IleLeu: 4.615 ± 1.312
0.355IleMet: 0.355 ± 0.194
1.42IleAsn: 1.42 ± 0.891
1.42IlePro: 1.42 ± 1.091
1.42IleGln: 1.42 ± 1.13
2.84IleArg: 2.84 ± 0.906
5.325IleSer: 5.325 ± 1.481
4.97IleThr: 4.97 ± 1.729
4.97IleVal: 4.97 ± 0.771
0.0IleTrp: 0.0 ± 0.0
1.42IleTyr: 1.42 ± 0.775
0.0IleXaa: 0.0 ± 0.0
Lys
3.905LysAla: 3.905 ± 1.172
0.71LysCys: 0.71 ± 0.953
2.485LysAsp: 2.485 ± 1.286
4.26LysGlu: 4.26 ± 2.326
2.485LysPhe: 2.485 ± 1.357
2.13LysGly: 2.13 ± 1.163
1.065LysHis: 1.065 ± 0.582
3.195LysIle: 3.195 ± 0.937
6.035LysLys: 6.035 ± 1.793
6.745LysLeu: 6.745 ± 1.394
0.355LysMet: 0.355 ± 0.194
4.26LysAsn: 4.26 ± 1.39
3.195LysPro: 3.195 ± 1.284
0.71LysGln: 0.71 ± 1.036
3.55LysArg: 3.55 ± 1.139
3.905LysSer: 3.905 ± 1.628
4.26LysThr: 4.26 ± 1.069
2.84LysVal: 2.84 ± 0.947
1.065LysTrp: 1.065 ± 0.533
1.42LysTyr: 1.42 ± 0.775
0.0LysXaa: 0.0 ± 0.0
Leu
5.68LeuAla: 5.68 ± 2.832
3.905LeuCys: 3.905 ± 1.513
4.26LeuAsp: 4.26 ± 1.372
8.165LeuGlu: 8.165 ± 2.134
3.195LeuPhe: 3.195 ± 1.745
7.455LeuGly: 7.455 ± 1.324
4.97LeuHis: 4.97 ± 1.729
4.97LeuIle: 4.97 ± 1.146
8.165LeuLys: 8.165 ± 2.062
10.65LeuLeu: 10.65 ± 1.738
1.775LeuMet: 1.775 ± 0.662
4.26LeuAsn: 4.26 ± 1.33
5.325LeuPro: 5.325 ± 1.706
3.55LeuGln: 3.55 ± 3.112
3.55LeuArg: 3.55 ± 0.932
6.39LeuSer: 6.39 ± 2.384
6.39LeuThr: 6.39 ± 1.12
6.035LeuVal: 6.035 ± 0.884
0.71LeuTrp: 0.71 ± 0.953
3.195LeuTyr: 3.195 ± 2.615
0.0LeuXaa: 0.0 ± 0.0
Met
2.84MetAla: 2.84 ± 1.111
0.71MetCys: 0.71 ± 0.388
0.355MetAsp: 0.355 ± 0.194
1.775MetGlu: 1.775 ± 0.766
0.355MetPhe: 0.355 ± 0.655
1.42MetGly: 1.42 ± 1.091
0.71MetHis: 0.71 ± 0.388
1.065MetIle: 1.065 ± 0.533
0.355MetLys: 0.355 ± 0.194
2.485MetLeu: 2.485 ± 1.085
0.355MetMet: 0.355 ± 0.194
0.0MetAsn: 0.0 ± 0.0
1.065MetPro: 1.065 ± 0.959
1.065MetGln: 1.065 ± 0.921
2.13MetArg: 2.13 ± 0.793
0.0MetSer: 0.0 ± 0.0
0.71MetThr: 0.71 ± 1.311
1.065MetVal: 1.065 ± 0.533
0.0MetTrp: 0.0 ± 0.0
0.71MetTyr: 0.71 ± 0.953
0.0MetXaa: 0.0 ± 0.0
Asn
2.485AsnAla: 2.485 ± 1.03
2.13AsnCys: 2.13 ± 1.163
2.485AsnAsp: 2.485 ± 1.357
1.42AsnGlu: 1.42 ± 0.569
1.065AsnPhe: 1.065 ± 0.582
1.42AsnGly: 1.42 ± 0.569
0.71AsnHis: 0.71 ± 1.036
1.775AsnIle: 1.775 ± 0.969
2.13AsnLys: 2.13 ± 1.106
4.615AsnLeu: 4.615 ± 1.585
2.13AsnMet: 2.13 ± 0.836
0.71AsnAsn: 0.71 ± 0.565
0.71AsnPro: 0.71 ± 1.311
0.71AsnGln: 0.71 ± 0.989
2.84AsnArg: 2.84 ± 1.551
1.775AsnSer: 1.775 ± 1.858
2.485AsnThr: 2.485 ± 0.63
2.13AsnVal: 2.13 ± 1.034
0.355AsnTrp: 0.355 ± 0.194
2.485AsnTyr: 2.485 ± 0.945
0.0AsnXaa: 0.0 ± 0.0
Pro
4.26ProAla: 4.26 ± 1.963
1.42ProCys: 1.42 ± 0.775
2.84ProAsp: 2.84 ± 1.129
3.195ProGlu: 3.195 ± 1.2
1.065ProPhe: 1.065 ± 0.582
1.775ProGly: 1.775 ± 0.942
2.485ProHis: 2.485 ± 2.058
1.065ProIle: 1.065 ± 0.533
1.775ProLys: 1.775 ± 0.662
4.26ProLeu: 4.26 ± 3.255
0.355ProMet: 0.355 ± 0.194
1.065ProAsn: 1.065 ± 0.872
4.615ProPro: 4.615 ± 3.607
2.13ProGln: 2.13 ± 0.953
2.13ProArg: 2.13 ± 1.106
2.485ProSer: 2.485 ± 0.63
3.905ProThr: 3.905 ± 2.766
2.485ProVal: 2.485 ± 1.612
0.355ProTrp: 0.355 ± 0.194
3.195ProTyr: 3.195 ± 1.333
0.0ProXaa: 0.0 ± 0.0
Gln
2.13GlnAla: 2.13 ± 1.695
0.0GlnCys: 0.0 ± 0.0
0.355GlnAsp: 0.355 ± 0.194
0.355GlnGlu: 0.355 ± 0.194
1.775GlnPhe: 1.775 ± 0.969
0.355GlnGly: 0.355 ± 0.194
1.065GlnHis: 1.065 ± 0.959
1.065GlnIle: 1.065 ± 1.665
1.065GlnLys: 1.065 ± 0.582
3.55GlnLeu: 3.55 ± 1.479
0.71GlnMet: 0.71 ± 1.272
1.42GlnAsn: 1.42 ± 1.226
2.485GlnPro: 2.485 ± 1.501
0.0GlnGln: 0.0 ± 0.0
1.065GlnArg: 1.065 ± 0.959
2.485GlnSer: 2.485 ± 1.011
2.485GlnThr: 2.485 ± 1.03
1.065GlnVal: 1.065 ± 0.582
0.0GlnTrp: 0.0 ± 0.0
1.42GlnTyr: 1.42 ± 0.775
0.0GlnXaa: 0.0 ± 0.0
Arg
6.035ArgAla: 6.035 ± 1.385
1.775ArgCys: 1.775 ± 1.844
1.42ArgAsp: 1.42 ± 1.13
2.485ArgGlu: 2.485 ± 1.001
3.55ArgPhe: 3.55 ± 1.19
3.905ArgGly: 3.905 ± 0.711
3.55ArgHis: 3.55 ± 2.203
2.485ArgIle: 2.485 ± 0.63
2.485ArgLys: 2.485 ± 1.276
5.68ArgLeu: 5.68 ± 1.801
1.775ArgMet: 1.775 ± 0.969
2.485ArgAsn: 2.485 ± 0.945
1.775ArgPro: 1.775 ± 1.558
1.42ArgGln: 1.42 ± 1.86
4.615ArgArg: 4.615 ± 2.863
3.55ArgSer: 3.55 ± 1.463
3.905ArgThr: 3.905 ± 1.086
2.84ArgVal: 2.84 ± 2.283
1.42ArgTrp: 1.42 ± 0.888
2.84ArgTyr: 2.84 ± 1.137
0.0ArgXaa: 0.0 ± 0.0
Ser
6.745SerAla: 6.745 ± 2.382
1.42SerCys: 1.42 ± 0.775
4.615SerAsp: 4.615 ± 0.988
4.26SerGlu: 4.26 ± 1.842
3.55SerPhe: 3.55 ± 1.463
4.26SerGly: 4.26 ± 1.278
2.485SerHis: 2.485 ± 1.357
2.13SerIle: 2.13 ± 1.453
3.55SerLys: 3.55 ± 1.611
4.615SerLeu: 4.615 ± 0.779
0.0SerMet: 0.0 ± 0.0
1.775SerAsn: 1.775 ± 0.655
1.775SerPro: 1.775 ± 0.969
1.42SerGln: 1.42 ± 0.917
5.325SerArg: 5.325 ± 1.002
6.745SerSer: 6.745 ± 1.795
3.905SerThr: 3.905 ± 1.078
6.39SerVal: 6.39 ± 2.968
0.355SerTrp: 0.355 ± 0.194
2.84SerTyr: 2.84 ± 2.181
0.0SerXaa: 0.0 ± 0.0
Thr
4.97ThrAla: 4.97 ± 2.041
1.42ThrCys: 1.42 ± 0.888
2.13ThrAsp: 2.13 ± 0.793
2.84ThrGlu: 2.84 ± 1.303
4.615ThrPhe: 4.615 ± 1.865
4.615ThrGly: 4.615 ± 2.792
1.42ThrHis: 1.42 ± 0.746
4.615ThrIle: 4.615 ± 1.92
3.195ThrLys: 3.195 ± 2.071
5.325ThrLeu: 5.325 ± 1.504
1.775ThrMet: 1.775 ± 0.918
2.13ThrAsn: 2.13 ± 1.106
4.26ThrPro: 4.26 ± 1.167
0.71ThrGln: 0.71 ± 0.388
2.84ThrArg: 2.84 ± 0.994
4.26ThrSer: 4.26 ± 1.519
1.775ThrThr: 1.775 ± 1.987
1.775ThrVal: 1.775 ± 0.969
0.0ThrTrp: 0.0 ± 0.0
2.13ThrTyr: 2.13 ± 0.87
0.0ThrXaa: 0.0 ± 0.0
Val
6.39ValAla: 6.39 ± 1.573
2.84ValCys: 2.84 ± 2.153
2.84ValAsp: 2.84 ± 0.703
3.55ValGlu: 3.55 ± 0.956
1.775ValPhe: 1.775 ± 1.401
5.325ValGly: 5.325 ± 2.363
2.485ValHis: 2.485 ± 1.085
4.615ValIle: 4.615 ± 1.836
2.84ValLys: 2.84 ± 0.947
7.1ValLeu: 7.1 ± 4.22
1.065ValMet: 1.065 ± 0.582
2.13ValAsn: 2.13 ± 1.163
3.55ValPro: 3.55 ± 3.223
1.42ValGln: 1.42 ± 0.775
2.84ValArg: 2.84 ± 0.703
5.325ValSer: 5.325 ± 2.363
3.905ValThr: 3.905 ± 2.273
3.195ValVal: 3.195 ± 1.344
0.71ValTrp: 0.71 ± 1.369
1.775ValTyr: 1.775 ± 0.915
0.0ValXaa: 0.0 ± 0.0
Trp
0.71TrpAla: 0.71 ± 1.036
0.355TrpCys: 0.355 ± 0.194
0.355TrpAsp: 0.355 ± 0.194
0.355TrpGlu: 0.355 ± 1.039
0.0TrpPhe: 0.0 ± 0.0
0.71TrpGly: 0.71 ± 0.565
0.355TrpHis: 0.355 ± 0.194
1.065TrpIle: 1.065 ± 0.582
0.355TrpLys: 0.355 ± 1.039
1.065TrpLeu: 1.065 ± 0.582
0.0TrpMet: 0.0 ± 0.0
0.355TrpAsn: 0.355 ± 0.655
0.355TrpPro: 0.355 ± 0.194
0.71TrpGln: 0.71 ± 0.565
0.71TrpArg: 0.71 ± 1.797
0.355TrpSer: 0.355 ± 0.194
0.0TrpThr: 0.0 ± 0.0
0.71TrpVal: 0.71 ± 0.388
0.0TrpTrp: 0.0 ± 0.0
0.71TrpTyr: 0.71 ± 0.388
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.905TyrAla: 3.905 ± 2.076
0.71TyrCys: 0.71 ± 0.727
1.42TyrAsp: 1.42 ± 0.701
4.26TyrGlu: 4.26 ± 1.372
1.065TyrPhe: 1.065 ± 0.582
1.42TyrGly: 1.42 ± 0.917
0.355TyrHis: 0.355 ± 1.039
2.485TyrIle: 2.485 ± 1.083
1.775TyrLys: 1.775 ± 0.915
4.615TyrLeu: 4.615 ± 1.312
0.71TyrMet: 0.71 ± 0.565
1.775TyrAsn: 1.775 ± 0.969
1.42TyrPro: 1.42 ± 0.775
0.355TyrGln: 0.355 ± 0.194
3.55TyrArg: 3.55 ± 1.495
1.42TyrSer: 1.42 ± 0.775
1.42TyrThr: 1.42 ± 2.073
2.485TyrVal: 2.485 ± 1.286
0.355TyrTrp: 0.355 ± 0.194
1.065TyrTyr: 1.065 ± 0.533
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2818 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski