Amino acid dipepetide frequency for Cymbidium mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.174AlaAla: 9.174 ± 7.388
0.966AlaCys: 0.966 ± 0.518
5.794AlaAsp: 5.794 ± 2.542
3.38AlaGlu: 3.38 ± 1.41
2.897AlaPhe: 2.897 ± 1.067
3.863AlaGly: 3.863 ± 0.742
2.414AlaHis: 2.414 ± 0.884
7.726AlaIle: 7.726 ± 6.259
4.829AlaLys: 4.829 ± 1.767
9.174AlaLeu: 9.174 ± 2.101
2.414AlaMet: 2.414 ± 1.295
4.346AlaAsn: 4.346 ± 1.616
4.829AlaPro: 4.829 ± 1.715
1.449AlaGln: 1.449 ± 0.685
3.863AlaArg: 3.863 ± 1.521
4.346AlaSer: 4.346 ± 1.335
6.277AlaThr: 6.277 ± 2.776
3.863AlaVal: 3.863 ± 1.686
0.0AlaTrp: 0.0 ± 0.0
6.277AlaTyr: 6.277 ± 1.163
0.0AlaXaa: 0.0 ± 0.0
Cys
0.966CysAla: 0.966 ± 0.717
0.483CysCys: 0.483 ± 1.029
0.483CysAsp: 0.483 ± 0.259
1.449CysGlu: 1.449 ± 0.777
0.0CysPhe: 0.0 ± 0.0
0.966CysGly: 0.966 ± 0.518
0.0CysHis: 0.0 ± 0.0
0.483CysIle: 0.483 ± 0.927
0.483CysLys: 0.483 ± 0.259
2.897CysLeu: 2.897 ± 0.91
0.483CysMet: 0.483 ± 0.259
0.483CysAsn: 0.483 ± 1.029
4.346CysPro: 4.346 ± 2.204
0.966CysGln: 0.966 ± 0.802
0.483CysArg: 0.483 ± 0.259
2.414CysSer: 2.414 ± 0.894
0.483CysThr: 0.483 ± 1.415
0.966CysVal: 0.966 ± 0.802
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.829AspAla: 4.829 ± 1.977
0.483AspCys: 0.483 ± 0.259
2.414AspAsp: 2.414 ± 1.295
3.38AspGlu: 3.38 ± 1.51
2.897AspPhe: 2.897 ± 1.067
2.414AspGly: 2.414 ± 1.502
0.966AspHis: 0.966 ± 0.907
1.449AspIle: 1.449 ± 0.777
1.931AspLys: 1.931 ± 1.036
7.243AspLeu: 7.243 ± 1.443
0.966AspMet: 0.966 ± 0.518
1.931AspAsn: 1.931 ± 0.869
6.76AspPro: 6.76 ± 2.832
1.449AspGln: 1.449 ± 0.777
2.897AspArg: 2.897 ± 1.067
1.931AspSer: 1.931 ± 0.784
3.863AspThr: 3.863 ± 1.066
4.346AspVal: 4.346 ± 1.616
2.414AspTrp: 2.414 ± 0.894
1.931AspTyr: 1.931 ± 2.015
0.0AspXaa: 0.0 ± 0.0
Glu
3.38GluAla: 3.38 ± 1.813
0.966GluCys: 0.966 ± 0.518
3.38GluAsp: 3.38 ± 1.276
3.38GluGlu: 3.38 ± 0.658
0.966GluPhe: 0.966 ± 0.518
1.449GluGly: 1.449 ± 0.777
0.966GluHis: 0.966 ± 0.802
3.38GluIle: 3.38 ± 1.276
3.38GluLys: 3.38 ± 1.813
2.414GluLeu: 2.414 ± 0.894
0.966GluMet: 0.966 ± 0.649
2.897GluAsn: 2.897 ± 1.554
3.863GluPro: 3.863 ± 1.628
1.931GluGln: 1.931 ± 1.036
2.897GluArg: 2.897 ± 0.6
1.449GluSer: 1.449 ± 0.777
2.897GluThr: 2.897 ± 1.554
3.863GluVal: 3.863 ± 0.801
0.483GluTrp: 0.483 ± 0.833
2.897GluTyr: 2.897 ± 1.498
0.0GluXaa: 0.0 ± 0.0
Phe
4.829PheAla: 4.829 ± 2.227
2.414PheCys: 2.414 ± 1.981
3.38PheAsp: 3.38 ± 1.41
2.897PheGlu: 2.897 ± 1.058
0.483PhePhe: 0.483 ± 0.833
0.966PheGly: 0.966 ± 0.802
1.931PheHis: 1.931 ± 1.604
1.931PheIle: 1.931 ± 1.036
1.449PheLys: 1.449 ± 0.777
2.414PheLeu: 2.414 ± 1.295
0.483PheMet: 0.483 ± 0.259
3.38PheAsn: 3.38 ± 1.813
2.897PhePro: 2.897 ± 2.286
2.414PheGln: 2.414 ± 0.648
0.483PheArg: 0.483 ± 0.259
2.414PheSer: 2.414 ± 0.894
2.414PheThr: 2.414 ± 0.894
2.897PheVal: 2.897 ± 2.187
0.483PheTrp: 0.483 ± 0.259
0.483PheTyr: 0.483 ± 0.927
0.0PheXaa: 0.0 ± 0.0
Gly
4.346GlyAla: 4.346 ± 1.423
1.449GlyCys: 1.449 ± 0.777
4.829GlyAsp: 4.829 ± 1.305
1.931GlyGlu: 1.931 ± 0.747
2.897GlyPhe: 2.897 ± 1.271
1.931GlyGly: 1.931 ± 1.094
0.483GlyHis: 0.483 ± 0.833
1.931GlyIle: 1.931 ± 1.094
4.829GlyLys: 4.829 ± 1.208
3.38GlyLeu: 3.38 ± 3.251
0.966GlyMet: 0.966 ± 0.496
1.931GlyAsn: 1.931 ± 1.146
1.931GlyPro: 1.931 ± 1.41
1.449GlyGln: 1.449 ± 1.257
0.483GlyArg: 0.483 ± 0.259
1.931GlySer: 1.931 ± 0.784
3.863GlyThr: 3.863 ± 1.562
2.414GlyVal: 2.414 ± 1.295
0.483GlyTrp: 0.483 ± 0.259
0.966GlyTyr: 0.966 ± 0.518
0.0GlyXaa: 0.0 ± 0.0
His
1.449HisAla: 1.449 ± 0.85
0.483HisCys: 0.483 ± 0.259
0.483HisAsp: 0.483 ± 0.259
1.931HisGlu: 1.931 ± 1.036
1.931HisPhe: 1.931 ± 1.036
2.414HisGly: 2.414 ± 0.894
2.897HisHis: 2.897 ± 2.702
3.38HisIle: 3.38 ± 2.383
1.449HisLys: 1.449 ± 0.777
3.38HisLeu: 3.38 ± 1.288
0.0HisMet: 0.0 ± 0.0
1.449HisAsn: 1.449 ± 1.572
2.897HisPro: 2.897 ± 1.843
0.966HisGln: 0.966 ± 0.518
2.897HisArg: 2.897 ± 1.69
2.414HisSer: 2.414 ± 2.145
2.897HisThr: 2.897 ± 1.106
0.966HisVal: 0.966 ± 0.518
0.483HisTrp: 0.483 ± 0.259
0.966HisTyr: 0.966 ± 1.854
0.0HisXaa: 0.0 ± 0.0
Ile
4.346IleAla: 4.346 ± 1.858
1.449IleCys: 1.449 ± 0.85
1.931IleAsp: 1.931 ± 1.415
2.897IleGlu: 2.897 ± 1.554
6.277IlePhe: 6.277 ± 1.995
2.414IleGly: 2.414 ± 1.883
3.38IleHis: 3.38 ± 2.669
2.414IleIle: 2.414 ± 1.213
4.346IleLys: 4.346 ± 1.616
4.829IleLeu: 4.829 ± 3.614
0.966IleMet: 0.966 ± 0.518
4.346IleAsn: 4.346 ± 1.712
4.829IlePro: 4.829 ± 1.941
3.38IleGln: 3.38 ± 1.276
1.931IleArg: 1.931 ± 0.869
2.897IleSer: 2.897 ± 1.106
5.794IleThr: 5.794 ± 4.704
1.931IleVal: 1.931 ± 1.094
0.0IleTrp: 0.0 ± 0.0
0.966IleTyr: 0.966 ± 0.802
0.0IleXaa: 0.0 ± 0.0
Lys
6.277LysAla: 6.277 ± 1.511
0.0LysCys: 0.0 ± 0.0
3.863LysAsp: 3.863 ± 1.492
0.966LysGlu: 0.966 ± 0.518
0.966LysPhe: 0.966 ± 0.802
2.414LysGly: 2.414 ± 0.884
0.483LysHis: 0.483 ± 0.259
3.863LysIle: 3.863 ± 2.072
2.897LysLys: 2.897 ± 1.554
8.209LysLeu: 8.209 ± 2.924
1.449LysMet: 1.449 ± 1.068
1.449LysAsn: 1.449 ± 0.685
3.863LysPro: 3.863 ± 1.501
2.414LysGln: 2.414 ± 0.96
2.414LysArg: 2.414 ± 1.295
5.311LysSer: 5.311 ± 1.625
2.897LysThr: 2.897 ± 1.554
3.38LysVal: 3.38 ± 1.211
0.966LysTrp: 0.966 ± 0.907
1.931LysTyr: 1.931 ± 1.971
0.0LysXaa: 0.0 ± 0.0
Leu
9.174LeuAla: 9.174 ± 5.109
0.483LeuCys: 0.483 ± 1.029
6.277LeuAsp: 6.277 ± 2.997
4.346LeuGlu: 4.346 ± 1.171
4.829LeuPhe: 4.829 ± 1.588
6.277LeuGly: 6.277 ± 2.1
2.897LeuHis: 2.897 ± 1.058
5.311LeuIle: 5.311 ± 2.222
5.311LeuLys: 5.311 ± 1.692
7.726LeuLeu: 7.726 ± 2.606
0.483LeuMet: 0.483 ± 0.833
2.414LeuAsn: 2.414 ± 1.281
9.174LeuPro: 9.174 ± 2.061
2.414LeuGln: 2.414 ± 1.53
7.243LeuArg: 7.243 ± 2.682
7.726LeuSer: 7.726 ± 2.656
6.277LeuThr: 6.277 ± 1.71
4.829LeuVal: 4.829 ± 1.677
2.414LeuTrp: 2.414 ± 0.884
4.346LeuTyr: 4.346 ± 1.259
0.0LeuXaa: 0.0 ± 0.0
Met
1.449MetAla: 1.449 ± 0.777
0.483MetCys: 0.483 ± 0.259
0.483MetAsp: 0.483 ± 0.259
1.449MetGlu: 1.449 ± 0.749
0.483MetPhe: 0.483 ± 0.259
0.483MetGly: 0.483 ± 0.833
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.966MetLys: 0.966 ± 0.518
2.897MetLeu: 2.897 ± 1.229
0.0MetMet: 0.0 ± 0.0
1.449MetAsn: 1.449 ± 0.777
1.931MetPro: 1.931 ± 0.869
1.449MetGln: 1.449 ± 0.777
0.966MetArg: 0.966 ± 0.518
0.0MetSer: 0.0 ± 0.0
1.449MetThr: 1.449 ± 0.777
0.483MetVal: 0.483 ± 0.259
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.897AsnAla: 2.897 ± 1.067
1.449AsnCys: 1.449 ± 1.212
3.38AsnAsp: 3.38 ± 0.658
0.966AsnGlu: 0.966 ± 0.518
0.966AsnPhe: 0.966 ± 0.802
0.966AsnGly: 0.966 ± 0.717
1.449AsnHis: 1.449 ± 0.777
4.829AsnIle: 4.829 ± 2.084
4.829AsnLys: 4.829 ± 1.381
4.829AsnLeu: 4.829 ± 2.109
0.483AsnMet: 0.483 ± 0.259
1.931AsnAsn: 1.931 ± 2.073
1.449AsnPro: 1.449 ± 0.777
0.966AsnGln: 0.966 ± 0.518
1.931AsnArg: 1.931 ± 0.783
1.931AsnSer: 1.931 ± 1.036
2.897AsnThr: 2.897 ± 1.009
1.449AsnVal: 1.449 ± 0.685
0.483AsnTrp: 0.483 ± 0.833
3.38AsnTyr: 3.38 ± 0.92
0.0AsnXaa: 0.0 ± 0.0
Pro
6.76ProAla: 6.76 ± 4.315
1.449ProCys: 1.449 ± 0.85
5.794ProAsp: 5.794 ± 1.927
4.829ProGlu: 4.829 ± 1.463
2.414ProPhe: 2.414 ± 1.281
4.346ProGly: 4.346 ± 2.053
2.414ProHis: 2.414 ± 2.511
2.897ProIle: 2.897 ± 0.91
5.794ProLys: 5.794 ± 0.987
6.76ProLeu: 6.76 ± 3.967
1.449ProMet: 1.449 ± 0.796
0.966ProAsn: 0.966 ± 0.518
7.243ProPro: 7.243 ± 3.855
0.483ProGln: 0.483 ± 0.259
1.449ProArg: 1.449 ± 0.972
4.829ProSer: 4.829 ± 1.677
6.277ProThr: 6.277 ± 3.98
4.829ProVal: 4.829 ± 1.208
1.449ProTrp: 1.449 ± 1.212
1.449ProTyr: 1.449 ± 0.777
0.0ProXaa: 0.0 ± 0.0
Gln
0.966GlnAla: 0.966 ± 0.518
0.966GlnCys: 0.966 ± 1.854
1.449GlnAsp: 1.449 ± 0.777
1.449GlnGlu: 1.449 ± 0.685
1.449GlnPhe: 1.449 ± 0.685
1.931GlnGly: 1.931 ± 0.783
2.414GlnHis: 2.414 ± 1.295
2.897GlnIle: 2.897 ± 1.009
0.966GlnLys: 0.966 ± 0.518
6.277GlnLeu: 6.277 ± 1.921
0.0GlnMet: 0.0 ± 0.0
1.449GlnAsn: 1.449 ± 0.685
2.414GlnPro: 2.414 ± 1.53
1.931GlnGln: 1.931 ± 1.604
1.449GlnArg: 1.449 ± 1.533
1.449GlnSer: 1.449 ± 0.777
2.897GlnThr: 2.897 ± 1.554
2.414GlnVal: 2.414 ± 1.53
0.966GlnTrp: 0.966 ± 0.518
0.483GlnTyr: 0.483 ± 1.029
0.0GlnXaa: 0.0 ± 0.0
Arg
4.829ArgAla: 4.829 ± 2.087
0.966ArgCys: 0.966 ± 0.802
2.414ArgAsp: 2.414 ± 1.295
3.863ArgGlu: 3.863 ± 1.501
1.931ArgPhe: 1.931 ± 1.507
1.449ArgGly: 1.449 ± 1.212
0.966ArgHis: 0.966 ± 0.518
2.414ArgIle: 2.414 ± 1.281
0.483ArgLys: 0.483 ± 0.259
2.897ArgLeu: 2.897 ± 1.058
1.449ArgMet: 1.449 ± 0.777
1.931ArgAsn: 1.931 ± 1.185
1.449ArgPro: 1.449 ± 0.972
4.346ArgGln: 4.346 ± 1.047
2.897ArgArg: 2.897 ± 1.683
2.414ArgSer: 2.414 ± 1.53
2.414ArgThr: 2.414 ± 1.53
0.966ArgVal: 0.966 ± 0.518
0.0ArgTrp: 0.0 ± 0.0
3.863ArgTyr: 3.863 ± 0.999
0.0ArgXaa: 0.0 ± 0.0
Ser
4.829SerAla: 4.829 ± 1.793
1.449SerCys: 1.449 ± 0.749
1.931SerAsp: 1.931 ± 0.783
3.38SerGlu: 3.38 ± 1.813
3.863SerPhe: 3.863 ± 1.47
2.897SerGly: 2.897 ± 2.397
2.897SerHis: 2.897 ± 1.69
4.829SerIle: 4.829 ± 1.259
3.38SerLys: 3.38 ± 1.036
5.311SerLeu: 5.311 ± 4.46
0.483SerMet: 0.483 ± 0.259
1.449SerAsn: 1.449 ± 0.749
2.414SerPro: 2.414 ± 1.575
2.414SerGln: 2.414 ± 0.894
3.38SerArg: 3.38 ± 1.51
6.76SerSer: 6.76 ± 3.516
6.277SerThr: 6.277 ± 1.511
2.414SerVal: 2.414 ± 0.894
0.0SerTrp: 0.0 ± 0.0
3.863SerTyr: 3.863 ± 1.398
0.0SerXaa: 0.0 ± 0.0
Thr
5.794ThrAla: 5.794 ± 1.698
0.483ThrCys: 0.483 ± 0.259
2.414ThrAsp: 2.414 ± 0.648
2.414ThrGlu: 2.414 ± 0.884
3.863ThrPhe: 3.863 ± 0.999
2.897ThrGly: 2.897 ± 1.554
5.311ThrHis: 5.311 ± 1.503
4.829ThrIle: 4.829 ± 3.956
2.897ThrLys: 2.897 ± 1.683
7.726ThrLeu: 7.726 ± 1.601
1.449ThrMet: 1.449 ± 0.777
4.346ThrAsn: 4.346 ± 2.295
4.829ThrPro: 4.829 ± 2.195
1.931ThrGln: 1.931 ± 0.747
2.414ThrArg: 2.414 ± 1.3
6.76ThrSer: 6.76 ± 1.016
2.414ThrThr: 2.414 ± 3.325
5.311ThrVal: 5.311 ± 0.817
0.0ThrTrp: 0.0 ± 0.0
1.931ThrTyr: 1.931 ± 1.146
0.0ThrXaa: 0.0 ± 0.0
Val
4.346ValAla: 4.346 ± 2.343
0.966ValCys: 0.966 ± 0.518
3.38ValAsp: 3.38 ± 2.303
2.414ValGlu: 2.414 ± 1.295
0.966ValPhe: 0.966 ± 1.758
2.414ValGly: 2.414 ± 1.434
0.483ValHis: 0.483 ± 0.927
2.897ValIle: 2.897 ± 1.106
2.897ValLys: 2.897 ± 1.554
5.794ValLeu: 5.794 ± 1.701
1.449ValMet: 1.449 ± 0.623
2.897ValAsn: 2.897 ± 1.067
4.346ValPro: 4.346 ± 1.435
1.449ValGln: 1.449 ± 0.777
2.897ValArg: 2.897 ± 0.6
2.897ValSer: 2.897 ± 1.69
4.346ValThr: 4.346 ± 2.055
3.38ValVal: 3.38 ± 1.705
0.483ValTrp: 0.483 ± 0.833
1.931ValTyr: 1.931 ± 0.783
0.0ValXaa: 0.0 ± 0.0
Trp
0.966TrpAla: 0.966 ± 0.717
0.0TrpCys: 0.0 ± 0.0
0.483TrpAsp: 0.483 ± 0.259
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.483TrpIle: 0.483 ± 0.259
0.483TrpLys: 0.483 ± 0.259
2.897TrpLeu: 2.897 ± 1.106
0.0TrpMet: 0.0 ± 0.0
0.966TrpAsn: 0.966 ± 0.717
0.0TrpPro: 0.0 ± 0.0
0.966TrpGln: 0.966 ± 0.717
0.483TrpArg: 0.483 ± 1.415
0.483TrpSer: 0.483 ± 0.259
1.449TrpThr: 1.449 ± 0.777
1.449TrpVal: 1.449 ± 0.972
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.277TyrAla: 6.277 ± 1.7
1.449TyrCys: 1.449 ± 1.257
1.931TyrAsp: 1.931 ± 1.036
0.483TyrGlu: 0.483 ± 0.259
0.966TyrPhe: 0.966 ± 0.518
1.931TyrGly: 1.931 ± 0.784
3.38TyrHis: 3.38 ± 2.165
3.38TyrIle: 3.38 ± 1.386
2.414TyrLys: 2.414 ± 1.987
2.897TyrLeu: 2.897 ± 1.058
0.0TyrMet: 0.0 ± 0.0
1.449TyrAsn: 1.449 ± 1.714
2.897TyrPro: 2.897 ± 2.515
1.449TyrGln: 1.449 ± 0.749
0.483TyrArg: 0.483 ± 0.259
3.863TyrSer: 3.863 ± 2.491
1.931TyrThr: 1.931 ± 1.036
0.483TyrVal: 0.483 ± 0.259
0.0TyrTrp: 0.0 ± 0.0
0.966TyrTyr: 0.966 ± 1.186
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski