Amino acid dipepetide frequency for Cereal yellow dwarf virus RPV

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.825AlaAla: 10.825 ± 3.127
0.515AlaCys: 0.515 ± 0.392
3.093AlaAsp: 3.093 ± 0.712
3.093AlaGlu: 3.093 ± 0.995
4.639AlaPhe: 4.639 ± 2.068
6.186AlaGly: 6.186 ± 1.209
1.546AlaHis: 1.546 ± 0.985
4.639AlaIle: 4.639 ± 1.371
2.577AlaLys: 2.577 ± 0.434
5.155AlaLeu: 5.155 ± 1.573
0.515AlaMet: 0.515 ± 0.417
3.093AlaAsn: 3.093 ± 1.816
5.67AlaPro: 5.67 ± 1.885
3.608AlaGln: 3.608 ± 1.181
3.093AlaArg: 3.093 ± 2.503
5.67AlaSer: 5.67 ± 1.1
3.608AlaThr: 3.608 ± 1.446
5.67AlaVal: 5.67 ± 1.099
1.546AlaTrp: 1.546 ± 0.336
2.062AlaTyr: 2.062 ± 1.146
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.834
1.031CysCys: 1.031 ± 0.415
1.031CysAsp: 1.031 ± 0.834
0.0CysGlu: 0.0 ± 0.0
1.031CysPhe: 1.031 ± 1.308
2.062CysGly: 2.062 ± 0.675
0.515CysHis: 0.515 ± 0.536
1.546CysIle: 1.546 ± 0.691
1.031CysLys: 1.031 ± 0.415
2.577CysLeu: 2.577 ± 0.681
0.0CysMet: 0.0 ± 0.0
1.031CysAsn: 1.031 ± 0.834
1.546CysPro: 1.546 ± 1.176
1.031CysGln: 1.031 ± 0.784
0.515CysArg: 0.515 ± 0.392
2.062CysSer: 2.062 ± 1.12
1.031CysThr: 1.031 ± 0.415
0.515CysVal: 0.515 ± 0.654
0.0CysTrp: 0.0 ± 0.0
0.515CysTyr: 0.515 ± 0.417
0.0CysXaa: 0.0 ± 0.0
Asp
3.093AspAla: 3.093 ± 1.211
1.546AspCys: 1.546 ± 1.252
2.062AspAsp: 2.062 ± 1.061
1.546AspGlu: 1.546 ± 0.336
1.546AspPhe: 1.546 ± 0.523
3.608AspGly: 3.608 ± 0.808
0.0AspHis: 0.0 ± 0.0
4.124AspIle: 4.124 ± 1.13
0.0AspLys: 0.0 ± 0.0
4.124AspLeu: 4.124 ± 1.105
1.546AspMet: 1.546 ± 0.336
2.062AspAsn: 2.062 ± 0.793
5.155AspPro: 5.155 ± 2.739
2.062AspGln: 2.062 ± 0.548
4.639AspArg: 4.639 ± 1.291
4.124AspSer: 4.124 ± 1.436
1.546AspThr: 1.546 ± 0.336
3.093AspVal: 3.093 ± 1.211
1.031AspTrp: 1.031 ± 0.834
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.608GluAla: 3.608 ± 0.481
1.546GluCys: 1.546 ± 0.523
4.639GluAsp: 4.639 ± 1.447
3.093GluGlu: 3.093 ± 0.188
1.546GluPhe: 1.546 ± 0.336
3.608GluGly: 3.608 ± 0.823
1.031GluHis: 1.031 ± 0.415
3.093GluIle: 3.093 ± 0.712
3.093GluLys: 3.093 ± 1.235
5.155GluLeu: 5.155 ± 2.949
0.515GluMet: 0.515 ± 0.536
4.124GluAsn: 4.124 ± 1.103
1.546GluPro: 1.546 ± 0.981
2.577GluGln: 2.577 ± 1.574
1.031GluArg: 1.031 ± 0.53
7.216GluSer: 7.216 ± 1.791
2.577GluThr: 2.577 ± 1.279
3.608GluVal: 3.608 ± 1.717
1.546GluTrp: 1.546 ± 0.79
0.515GluTyr: 0.515 ± 0.654
0.0GluXaa: 0.0 ± 0.0
Phe
2.062PheAla: 2.062 ± 0.83
1.546PheCys: 1.546 ± 0.523
3.608PheAsp: 3.608 ± 1.184
2.577PheGlu: 2.577 ± 1.108
2.577PhePhe: 2.577 ± 0.441
2.577PheGly: 2.577 ± 0.797
1.031PheHis: 1.031 ± 0.524
1.031PheIle: 1.031 ± 0.832
4.124PheLys: 4.124 ± 0.474
3.093PheLeu: 3.093 ± 1.707
0.515PheMet: 0.515 ± 0.417
2.062PheAsn: 2.062 ± 1.083
2.062PhePro: 2.062 ± 1.286
1.546PheGln: 1.546 ± 0.709
2.062PheArg: 2.062 ± 0.83
2.062PheSer: 2.062 ± 0.661
2.062PheThr: 2.062 ± 1.89
2.062PheVal: 2.062 ± 0.793
0.515PheTrp: 0.515 ± 0.654
2.062PheTyr: 2.062 ± 0.83
0.0PheXaa: 0.0 ± 0.0
Gly
4.124GlyAla: 4.124 ± 0.788
1.546GlyCys: 1.546 ± 0.691
1.031GlyAsp: 1.031 ± 0.646
3.608GlyGlu: 3.608 ± 1.054
3.608GlyPhe: 3.608 ± 1.796
3.608GlyGly: 3.608 ± 0.76
4.639GlyHis: 4.639 ± 1.257
3.608GlyIle: 3.608 ± 0.808
4.639GlyLys: 4.639 ± 2.074
6.186GlyLeu: 6.186 ± 1.115
0.515GlyMet: 0.515 ± 0.654
2.062GlyAsn: 2.062 ± 0.83
4.639GlyPro: 4.639 ± 0.848
1.031GlyGln: 1.031 ± 0.662
4.124GlyArg: 4.124 ± 0.809
5.67GlySer: 5.67 ± 2.699
3.093GlyThr: 3.093 ± 0.758
4.639GlyVal: 4.639 ± 0.859
2.062GlyTrp: 2.062 ± 0.548
3.608GlyTyr: 3.608 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
1.031HisAla: 1.031 ± 0.832
0.515HisCys: 0.515 ± 0.417
1.031HisAsp: 1.031 ± 0.415
1.031HisGlu: 1.031 ± 0.834
1.546HisPhe: 1.546 ± 0.755
1.031HisGly: 1.031 ± 0.834
0.515HisHis: 0.515 ± 0.654
0.515HisIle: 0.515 ± 0.392
1.031HisLys: 1.031 ± 0.415
1.031HisLeu: 1.031 ± 0.646
0.0HisMet: 0.0 ± 0.0
2.062HisAsn: 2.062 ± 0.675
3.093HisPro: 3.093 ± 0.758
1.031HisGln: 1.031 ± 0.834
1.031HisArg: 1.031 ± 0.834
0.515HisSer: 0.515 ± 0.392
1.031HisThr: 1.031 ± 0.832
1.031HisVal: 1.031 ± 1.072
0.0HisTrp: 0.0 ± 0.0
0.515HisTyr: 0.515 ± 0.654
0.0HisXaa: 0.0 ± 0.0
Ile
4.639IleAla: 4.639 ± 0.155
1.031IleCys: 1.031 ± 0.784
4.639IleAsp: 4.639 ± 1.247
2.577IleGlu: 2.577 ± 1.443
1.546IlePhe: 1.546 ± 0.691
1.546IleGly: 1.546 ± 0.985
0.515IleHis: 0.515 ± 0.417
2.577IleIle: 2.577 ± 1.484
1.546IleLys: 1.546 ± 0.336
2.577IleLeu: 2.577 ± 1.496
2.062IleMet: 2.062 ± 1.045
2.062IleAsn: 2.062 ± 0.503
5.155IlePro: 5.155 ± 1.324
1.031IleGln: 1.031 ± 0.662
1.546IleArg: 1.546 ± 0.523
3.093IleSer: 3.093 ± 1.243
4.639IleThr: 4.639 ± 1.257
2.577IleVal: 2.577 ± 1.093
0.515IleTrp: 0.515 ± 0.392
1.031IleTyr: 1.031 ± 0.415
0.0IleXaa: 0.0 ± 0.0
Lys
6.186LysAla: 6.186 ± 1.989
0.0LysCys: 0.0 ± 0.0
2.577LysAsp: 2.577 ± 0.776
3.608LysGlu: 3.608 ± 0.807
1.546LysPhe: 1.546 ± 1.252
1.546LysGly: 1.546 ± 0.755
0.0LysHis: 0.0 ± 0.0
2.577LysIle: 2.577 ± 0.681
2.062LysLys: 2.062 ± 1.568
3.093LysLeu: 3.093 ± 1.469
0.515LysMet: 0.515 ± 0.417
1.031LysAsn: 1.031 ± 0.784
1.031LysPro: 1.031 ± 0.415
2.062LysGln: 2.062 ± 1.669
5.155LysArg: 5.155 ± 0.398
7.732LysSer: 7.732 ± 2.509
5.155LysThr: 5.155 ± 1.063
1.546LysVal: 1.546 ± 0.691
0.515LysTrp: 0.515 ± 0.536
1.546LysTyr: 1.546 ± 0.981
0.0LysXaa: 0.0 ± 0.0
Leu
5.155LeuAla: 5.155 ± 1.909
2.577LeuCys: 2.577 ± 1.522
3.608LeuAsp: 3.608 ± 0.801
3.608LeuGlu: 3.608 ± 1.924
4.639LeuPhe: 4.639 ± 1.716
5.155LeuGly: 5.155 ± 0.814
2.062LeuHis: 2.062 ± 1.109
4.639LeuIle: 4.639 ± 1.258
2.577LeuLys: 2.577 ± 1.071
9.278LeuLeu: 9.278 ± 3.345
1.031LeuMet: 1.031 ± 0.784
2.062LeuAsn: 2.062 ± 0.675
6.701LeuPro: 6.701 ± 1.183
2.577LeuGln: 2.577 ± 1.574
5.67LeuArg: 5.67 ± 2.125
7.216LeuSer: 7.216 ± 1.603
6.186LeuThr: 6.186 ± 1.66
7.732LeuVal: 7.732 ± 1.623
1.546LeuTrp: 1.546 ± 0.691
1.031LeuTyr: 1.031 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
1.031MetAla: 1.031 ± 0.415
0.0MetCys: 0.0 ± 0.0
1.031MetAsp: 1.031 ± 0.415
2.062MetGlu: 2.062 ± 1.078
1.031MetPhe: 1.031 ± 0.832
0.515MetGly: 0.515 ± 0.417
0.0MetHis: 0.0 ± 0.0
0.515MetIle: 0.515 ± 0.417
0.515MetLys: 0.515 ± 0.392
1.546MetLeu: 1.546 ± 1.252
0.515MetMet: 0.515 ± 0.392
1.031MetAsn: 1.031 ± 0.784
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.515MetArg: 0.515 ± 0.392
2.577MetSer: 2.577 ± 0.994
0.0MetThr: 0.0 ± 0.0
1.031MetVal: 1.031 ± 0.662
0.0MetTrp: 0.0 ± 0.0
0.515MetTyr: 0.515 ± 0.392
0.0MetXaa: 0.0 ± 0.0
Asn
1.546AsnAla: 1.546 ± 0.691
1.546AsnCys: 1.546 ± 0.734
0.0AsnAsp: 0.0 ± 0.0
3.608AsnGlu: 3.608 ± 1.054
2.577AsnPhe: 2.577 ± 0.776
1.031AsnGly: 1.031 ± 0.784
0.515AsnHis: 0.515 ± 0.417
2.577AsnIle: 2.577 ± 0.836
4.124AsnLys: 4.124 ± 0.885
3.093AsnLeu: 3.093 ± 0.883
0.515AsnMet: 0.515 ± 0.357
1.031AsnAsn: 1.031 ± 0.524
1.031AsnPro: 1.031 ± 0.646
2.577AsnGln: 2.577 ± 1.093
2.062AsnArg: 2.062 ± 0.83
6.701AsnSer: 6.701 ± 1.583
3.608AsnThr: 3.608 ± 0.426
2.062AsnVal: 2.062 ± 0.677
2.062AsnTrp: 2.062 ± 0.83
2.062AsnTyr: 2.062 ± 0.503
0.0AsnXaa: 0.0 ± 0.0
Pro
4.639ProAla: 4.639 ± 2.053
1.031ProCys: 1.031 ± 0.662
4.639ProAsp: 4.639 ± 2.779
4.124ProGlu: 4.124 ± 1.094
0.515ProPhe: 0.515 ± 0.392
6.186ProGly: 6.186 ± 0.705
0.515ProHis: 0.515 ± 0.417
3.608ProIle: 3.608 ± 1.051
3.093ProLys: 3.093 ± 0.939
4.639ProLeu: 4.639 ± 1.661
1.031ProMet: 1.031 ± 0.53
3.093ProAsn: 3.093 ± 0.702
5.67ProPro: 5.67 ± 3.033
2.577ProGln: 2.577 ± 1.071
4.639ProArg: 4.639 ± 1.578
7.732ProSer: 7.732 ± 2.053
4.124ProThr: 4.124 ± 0.788
4.124ProVal: 4.124 ± 0.95
0.515ProTrp: 0.515 ± 0.536
0.515ProTyr: 0.515 ± 0.654
0.0ProXaa: 0.0 ± 0.0
Gln
3.093GlnAla: 3.093 ± 0.814
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
3.093GlnGlu: 3.093 ± 0.916
1.031GlnPhe: 1.031 ± 0.662
1.546GlnGly: 1.546 ± 0.734
1.546GlnHis: 1.546 ± 0.734
0.515GlnIle: 0.515 ± 0.536
3.093GlnLys: 3.093 ± 0.312
3.608GlnLeu: 3.608 ± 1.759
0.515GlnMet: 0.515 ± 0.518
4.639GlnAsn: 4.639 ± 0.71
2.062GlnPro: 2.062 ± 1.176
1.031GlnGln: 1.031 ± 0.662
2.577GlnArg: 2.577 ± 0.797
1.546GlnSer: 1.546 ± 0.709
0.0GlnThr: 0.0 ± 0.0
4.124GlnVal: 4.124 ± 1.075
1.546GlnTrp: 1.546 ± 0.755
1.031GlnTyr: 1.031 ± 0.415
0.0GlnXaa: 0.0 ± 0.0
Arg
4.639ArgAla: 4.639 ± 1.939
1.031ArgCys: 1.031 ± 0.834
1.546ArgAsp: 1.546 ± 0.981
2.577ArgGlu: 2.577 ± 2.086
2.062ArgPhe: 2.062 ± 1.1
5.155ArgGly: 5.155 ± 0.918
2.577ArgHis: 2.577 ± 2.086
2.577ArgIle: 2.577 ± 0.434
2.577ArgLys: 2.577 ± 0.865
6.186ArgLeu: 6.186 ± 1.966
0.515ArgMet: 0.515 ± 0.808
2.062ArgAsn: 2.062 ± 1.12
2.577ArgPro: 2.577 ± 0.827
3.093ArgGln: 3.093 ± 0.916
3.093ArgArg: 3.093 ± 1.931
4.124ArgSer: 4.124 ± 1.623
2.062ArgThr: 2.062 ± 1.045
3.608ArgVal: 3.608 ± 1.652
2.062ArgTrp: 2.062 ± 1.109
3.608ArgTyr: 3.608 ± 2.429
0.0ArgXaa: 0.0 ± 0.0
Ser
7.732SerAla: 7.732 ± 1.028
1.546SerCys: 1.546 ± 0.733
4.639SerAsp: 4.639 ± 0.783
4.639SerGlu: 4.639 ± 2.312
3.093SerPhe: 3.093 ± 1.707
9.278SerGly: 9.278 ± 0.688
0.515SerHis: 0.515 ± 0.654
3.608SerIle: 3.608 ± 2.196
7.732SerLys: 7.732 ± 1.547
8.763SerLeu: 8.763 ± 1.757
0.0SerMet: 0.0 ± 0.0
4.124SerAsn: 4.124 ± 0.788
8.247SerPro: 8.247 ± 1.805
3.093SerGln: 3.093 ± 2.353
3.608SerArg: 3.608 ± 0.894
9.794SerSer: 9.794 ± 2.32
7.216SerThr: 7.216 ± 1.016
4.639SerVal: 4.639 ± 0.749
3.608SerTrp: 3.608 ± 0.481
4.124SerTyr: 4.124 ± 1.45
0.0SerXaa: 0.0 ± 0.0
Thr
4.639ThrAla: 4.639 ± 1.876
1.031ThrCys: 1.031 ± 0.662
4.124ThrAsp: 4.124 ± 2.454
3.608ThrGlu: 3.608 ± 1.19
2.062ThrPhe: 2.062 ± 0.83
2.577ThrGly: 2.577 ± 1.496
1.031ThrHis: 1.031 ± 0.834
2.577ThrIle: 2.577 ± 1.484
1.546ThrLys: 1.546 ± 0.336
4.639ThrLeu: 4.639 ± 2.185
2.577ThrMet: 2.577 ± 0.681
3.093ThrAsn: 3.093 ± 0.673
4.639ThrPro: 4.639 ± 1.185
1.031ThrGln: 1.031 ± 0.662
5.155ThrArg: 5.155 ± 2.207
8.763ThrSer: 8.763 ± 2.184
5.155ThrThr: 5.155 ± 1.551
2.577ThrVal: 2.577 ± 0.776
0.515ThrTrp: 0.515 ± 0.417
0.515ThrTyr: 0.515 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
5.67ValAla: 5.67 ± 2.267
1.546ValCys: 1.546 ± 0.691
1.546ValAsp: 1.546 ± 0.985
4.124ValGlu: 4.124 ± 1.105
2.062ValPhe: 2.062 ± 0.83
7.732ValGly: 7.732 ± 0.898
0.515ValHis: 0.515 ± 0.392
2.062ValIle: 2.062 ± 1.045
2.062ValLys: 2.062 ± 0.548
4.639ValLeu: 4.639 ± 1.291
0.515ValMet: 0.515 ± 0.417
1.031ValAsn: 1.031 ± 0.53
5.155ValPro: 5.155 ± 0.482
2.577ValGln: 2.577 ± 0.914
3.608ValArg: 3.608 ± 0.823
5.67ValSer: 5.67 ± 1.527
5.67ValThr: 5.67 ± 1.797
1.546ValVal: 1.546 ± 0.523
1.031ValTrp: 1.031 ± 0.415
0.515ValTyr: 0.515 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
1.031TrpAla: 1.031 ± 0.53
0.0TrpCys: 0.0 ± 0.0
1.031TrpAsp: 1.031 ± 0.646
2.577TrpGlu: 2.577 ± 0.681
0.515TrpPhe: 0.515 ± 0.536
1.546TrpGly: 1.546 ± 0.523
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.031TrpLys: 1.031 ± 0.524
3.608TrpLeu: 3.608 ± 1.518
0.515TrpMet: 0.515 ± 0.417
0.515TrpAsn: 0.515 ± 0.536
0.515TrpPro: 0.515 ± 0.417
1.031TrpGln: 1.031 ± 0.415
1.546TrpArg: 1.546 ± 0.523
3.093TrpSer: 3.093 ± 0.673
1.546TrpThr: 1.546 ± 0.336
1.031TrpVal: 1.031 ± 0.415
0.0TrpTrp: 0.0 ± 0.0
0.515TrpTyr: 0.515 ± 0.536
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.031TyrAla: 1.031 ± 0.524
0.515TyrCys: 0.515 ± 0.417
1.031TyrAsp: 1.031 ± 0.524
0.515TyrGlu: 0.515 ± 0.654
2.062TyrPhe: 2.062 ± 1.325
1.546TyrGly: 1.546 ± 0.985
0.515TyrHis: 0.515 ± 0.417
0.515TyrIle: 0.515 ± 0.654
1.031TyrLys: 1.031 ± 0.415
2.062TyrLeu: 2.062 ± 0.661
0.515TyrMet: 0.515 ± 0.417
2.577TyrAsn: 2.577 ± 1.246
0.515TyrPro: 0.515 ± 0.417
1.031TyrGln: 1.031 ± 0.524
2.062TyrArg: 2.062 ± 1.201
4.639TyrSer: 4.639 ± 1.257
1.546TyrThr: 1.546 ± 1.231
1.546TyrVal: 1.546 ± 0.336
1.031TyrTrp: 1.031 ± 0.53
1.031TyrTyr: 1.031 ± 0.646
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski