Amino acid dipepetide frequency for Cacao swollen shoot Ghana R virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.043AlaAla: 4.043 ± 2.486
0.449AlaCys: 0.449 ± 0.231
1.797AlaAsp: 1.797 ± 0.925
6.739AlaGlu: 6.739 ± 3.226
3.594AlaPhe: 3.594 ± 2.564
3.594AlaGly: 3.594 ± 1.459
0.449AlaHis: 0.449 ± 0.231
3.594AlaIle: 3.594 ± 1.265
4.043AlaLys: 4.043 ± 2.061
3.145AlaLeu: 3.145 ± 2.739
2.246AlaMet: 2.246 ± 1.02
1.348AlaAsn: 1.348 ± 3.037
1.348AlaPro: 1.348 ± 0.694
1.797AlaGln: 1.797 ± 1.282
3.145AlaArg: 3.145 ± 1.619
2.695AlaSer: 2.695 ± 1.038
4.492AlaThr: 4.492 ± 1.514
3.145AlaVal: 3.145 ± 1.619
1.348AlaTrp: 1.348 ± 0.694
3.594AlaTyr: 3.594 ± 2.752
0.0AlaXaa: 0.0 ± 0.0
Cys
0.449CysAla: 0.449 ± 0.231
0.449CysCys: 0.449 ± 0.231
0.449CysAsp: 0.449 ± 1.401
0.898CysGlu: 0.898 ± 1.253
1.797CysPhe: 1.797 ± 0.925
0.898CysGly: 0.898 ± 0.462
0.449CysHis: 0.449 ± 0.231
0.898CysIle: 0.898 ± 1.517
2.246CysLys: 2.246 ± 1.156
0.898CysLeu: 0.898 ± 1.253
0.449CysMet: 0.449 ± 0.231
1.797CysAsn: 1.797 ± 1.054
0.449CysPro: 0.449 ± 0.231
1.348CysGln: 1.348 ± 0.694
0.898CysArg: 0.898 ± 0.462
1.348CysSer: 1.348 ± 0.694
0.0CysThr: 0.0 ± 0.0
0.449CysVal: 0.449 ± 0.231
0.0CysTrp: 0.0 ± 0.0
0.898CysTyr: 0.898 ± 0.462
0.0CysXaa: 0.0 ± 0.0
Asp
3.145AspAla: 3.145 ± 1.105
0.898AspCys: 0.898 ± 1.253
5.391AspAsp: 5.391 ± 1.882
4.942AspGlu: 4.942 ± 1.693
3.145AspPhe: 3.145 ± 1.189
2.695AspGly: 2.695 ± 1.387
0.898AspHis: 0.898 ± 0.462
2.246AspIle: 2.246 ± 1.156
1.348AspLys: 1.348 ± 1.385
4.043AspLeu: 4.043 ± 1.854
0.449AspMet: 0.449 ± 0.231
5.391AspAsn: 5.391 ± 1.162
2.246AspPro: 2.246 ± 1.214
4.043AspGln: 4.043 ± 2.081
4.043AspArg: 4.043 ± 2.061
0.449AspSer: 0.449 ± 0.231
2.246AspThr: 2.246 ± 1.156
1.348AspVal: 1.348 ± 1.135
0.449AspTrp: 0.449 ± 0.231
2.695AspTyr: 2.695 ± 1.387
0.0AspXaa: 0.0 ± 0.0
Glu
6.289GluAla: 6.289 ± 3.56
0.449GluCys: 0.449 ± 0.231
5.391GluAsp: 5.391 ± 1.882
13.028GluGlu: 13.028 ± 5.51
1.797GluPhe: 1.797 ± 0.925
6.739GluGly: 6.739 ± 1.977
1.797GluHis: 1.797 ± 1.224
5.391GluIle: 5.391 ± 1.025
8.535GluLys: 8.535 ± 5.073
3.594GluLeu: 3.594 ± 3.575
0.898GluMet: 0.898 ± 0.462
3.145GluAsn: 3.145 ± 2.177
2.695GluPro: 2.695 ± 1.156
4.492GluGln: 4.492 ± 1.11
5.391GluArg: 5.391 ± 4.69
3.594GluSer: 3.594 ± 1.265
3.145GluThr: 3.145 ± 1.619
5.391GluVal: 5.391 ± 1.025
0.898GluTrp: 0.898 ± 0.462
2.246GluTyr: 2.246 ± 1.02
0.0GluXaa: 0.0 ± 0.0
Phe
1.797PheAla: 1.797 ± 0.925
1.797PheCys: 1.797 ± 0.925
1.797PheAsp: 1.797 ± 0.925
1.797PheGlu: 1.797 ± 1.224
0.449PhePhe: 0.449 ± 0.231
0.898PheGly: 0.898 ± 0.462
1.348PheHis: 1.348 ± 0.694
4.043PheIle: 4.043 ± 2.081
1.348PheLys: 1.348 ± 1.385
3.594PheLeu: 3.594 ± 1.268
0.0PheMet: 0.0 ± 0.0
2.246PheAsn: 2.246 ± 1.747
1.348PhePro: 1.348 ± 0.694
0.898PheGln: 0.898 ± 0.462
3.145PheArg: 3.145 ± 2.919
1.797PheSer: 1.797 ± 0.925
1.348PheThr: 1.348 ± 0.694
1.348PheVal: 1.348 ± 0.694
0.449PheTrp: 0.449 ± 0.231
0.898PheTyr: 0.898 ± 1.446
0.0PheXaa: 0.0 ± 0.0
Gly
3.145GlyAla: 3.145 ± 1.619
1.348GlyCys: 1.348 ± 0.694
2.246GlyAsp: 2.246 ± 1.02
8.086GlyGlu: 8.086 ± 2.705
1.797GlyPhe: 1.797 ± 1.282
2.246GlyGly: 2.246 ± 1.02
0.449GlyHis: 0.449 ± 0.231
4.492GlyIle: 4.492 ± 1.514
4.492GlyLys: 4.492 ± 2.312
3.594GlyLeu: 3.594 ± 2.564
2.246GlyMet: 2.246 ± 1.534
3.145GlyAsn: 3.145 ± 3.258
1.797GlyPro: 1.797 ± 1.282
1.797GlyGln: 1.797 ± 0.925
6.739GlyArg: 6.739 ± 2.38
2.695GlySer: 2.695 ± 1.742
0.898GlyThr: 0.898 ± 0.462
4.043GlyVal: 4.043 ± 2.081
1.348GlyTrp: 1.348 ± 1.385
3.145GlyTyr: 3.145 ± 1.619
0.0GlyXaa: 0.0 ± 0.0
His
1.797HisAla: 1.797 ± 1.054
1.348HisCys: 1.348 ± 0.694
0.898HisAsp: 0.898 ± 1.517
1.348HisGlu: 1.348 ± 1.135
1.348HisPhe: 1.348 ± 0.694
0.898HisGly: 0.898 ± 0.462
0.449HisHis: 0.449 ± 0.231
1.348HisIle: 1.348 ± 0.694
0.449HisLys: 0.449 ± 0.231
1.348HisLeu: 1.348 ± 3.037
0.449HisMet: 0.449 ± 0.231
0.449HisAsn: 0.449 ± 1.671
0.449HisPro: 0.449 ± 0.231
2.246HisGln: 2.246 ± 1.168
1.797HisArg: 1.797 ± 0.925
0.449HisSer: 0.449 ± 0.231
0.0HisThr: 0.0 ± 0.0
1.348HisVal: 1.348 ± 0.694
0.449HisTrp: 0.449 ± 0.231
1.797HisTyr: 1.797 ± 0.925
0.0HisXaa: 0.0 ± 0.0
Ile
3.594IleAla: 3.594 ± 1.265
0.898IleCys: 0.898 ± 0.462
4.942IleAsp: 4.942 ± 2.544
5.84IleGlu: 5.84 ± 1.059
2.695IlePhe: 2.695 ± 1.387
4.043IleGly: 4.043 ± 1.355
1.797IleHis: 1.797 ± 0.925
5.84IleIle: 5.84 ± 2.119
4.043IleLys: 4.043 ± 1.352
5.84IleLeu: 5.84 ± 1.059
0.0IleMet: 0.0 ± 0.0
1.797IleAsn: 1.797 ± 0.925
4.942IlePro: 4.942 ± 1.693
7.188IleGln: 7.188 ± 1.869
4.043IleArg: 4.043 ± 1.352
5.84IleSer: 5.84 ± 2.211
4.043IleThr: 4.043 ± 1.172
3.594IleVal: 3.594 ± 1.265
0.0IleTrp: 0.0 ± 0.0
0.898IleTyr: 0.898 ± 0.462
0.0IleXaa: 0.0 ± 0.0
Lys
4.043LysAla: 4.043 ± 1.355
1.797LysCys: 1.797 ± 0.925
4.043LysAsp: 4.043 ± 7.136
7.188LysGlu: 7.188 ± 3.137
3.145LysPhe: 3.145 ± 1.105
3.145LysGly: 3.145 ± 2.659
0.449LysHis: 0.449 ± 1.596
2.695LysIle: 2.695 ± 1.387
6.289LysLys: 6.289 ± 3.037
6.289LysLeu: 6.289 ± 5.07
2.246LysMet: 2.246 ± 0.959
3.594LysAsn: 3.594 ± 1.265
4.492LysPro: 4.492 ± 1.494
4.492LysGln: 4.492 ± 1.105
3.145LysArg: 3.145 ± 1.413
4.043LysSer: 4.043 ± 2.486
3.594LysThr: 3.594 ± 1.357
3.594LysVal: 3.594 ± 4.76
0.898LysTrp: 0.898 ± 0.462
0.898LysTyr: 0.898 ± 0.462
0.0LysXaa: 0.0 ± 0.0
Leu
3.145LeuAla: 3.145 ± 1.413
1.797LeuCys: 1.797 ± 2.507
2.695LeuAsp: 2.695 ± 2.639
5.84LeuGlu: 5.84 ± 2.494
0.449LeuPhe: 0.449 ± 0.231
4.492LeuGly: 4.492 ± 1.514
1.348LeuHis: 1.348 ± 1.385
4.942LeuIle: 4.942 ± 4.257
8.086LeuLys: 8.086 ± 3.91
6.739LeuLeu: 6.739 ± 5.684
0.898LeuMet: 0.898 ± 1.517
3.594LeuAsn: 3.594 ± 4.076
4.043LeuPro: 4.043 ± 2.061
3.594LeuGln: 3.594 ± 1.278
4.043LeuArg: 4.043 ± 1.355
6.739LeuSer: 6.739 ± 5.658
4.942LeuThr: 4.942 ± 1.673
4.942LeuVal: 4.942 ± 2.386
0.898LeuTrp: 0.898 ± 0.462
3.145LeuTyr: 3.145 ± 1.207
0.0LeuXaa: 0.0 ± 0.0
Met
1.348MetAla: 1.348 ± 1.135
0.449MetCys: 0.449 ± 0.231
0.449MetAsp: 0.449 ± 0.231
2.695MetGlu: 2.695 ± 1.038
0.898MetPhe: 0.898 ± 1.253
0.898MetGly: 0.898 ± 1.517
0.898MetHis: 0.898 ± 0.462
0.449MetIle: 0.449 ± 0.231
2.695MetLys: 2.695 ± 1.387
0.898MetLeu: 0.898 ± 0.462
1.797MetMet: 1.797 ± 0.925
1.348MetAsn: 1.348 ± 0.694
0.449MetPro: 0.449 ± 0.231
1.797MetGln: 1.797 ± 0.925
2.246MetArg: 2.246 ± 1.156
1.348MetSer: 1.348 ± 2.288
0.898MetThr: 0.898 ± 0.462
1.348MetVal: 1.348 ± 0.694
0.449MetTrp: 0.449 ± 0.231
0.898MetTyr: 0.898 ± 0.462
0.0MetXaa: 0.0 ± 0.0
Asn
2.695AsnAla: 2.695 ± 1.742
0.449AsnCys: 0.449 ± 0.231
4.492AsnAsp: 4.492 ± 2.312
0.898AsnGlu: 0.898 ± 0.462
1.797AsnPhe: 1.797 ± 0.925
0.449AsnGly: 0.449 ± 0.231
1.348AsnHis: 1.348 ± 0.694
2.246AsnIle: 2.246 ± 1.168
1.348AsnLys: 1.348 ± 0.694
8.535AsnLeu: 8.535 ± 9.289
1.348AsnMet: 1.348 ± 0.694
2.246AsnAsn: 2.246 ± 2.759
2.695AsnPro: 2.695 ± 1.188
2.246AsnGln: 2.246 ± 1.747
1.797AsnArg: 1.797 ± 1.224
3.145AsnSer: 3.145 ± 1.59
2.695AsnThr: 2.695 ± 1.156
1.348AsnVal: 1.348 ± 0.694
1.348AsnTrp: 1.348 ± 1.32
2.246AsnTyr: 2.246 ± 1.747
0.0AsnXaa: 0.0 ± 0.0
Pro
3.145ProAla: 3.145 ± 1.619
0.0ProCys: 0.0 ± 0.0
3.594ProAsp: 3.594 ± 1.85
2.246ProGlu: 2.246 ± 2.897
1.348ProPhe: 1.348 ± 0.694
3.594ProGly: 3.594 ± 1.213
0.449ProHis: 0.449 ± 0.231
4.043ProIle: 4.043 ± 2.081
2.246ProLys: 2.246 ± 3.444
3.594ProLeu: 3.594 ± 2.107
1.348ProMet: 1.348 ± 0.694
0.898ProAsn: 0.898 ± 0.462
2.695ProPro: 2.695 ± 1.387
3.145ProGln: 3.145 ± 1.619
1.797ProArg: 1.797 ± 0.925
4.043ProSer: 4.043 ± 2.081
2.695ProThr: 2.695 ± 1.742
1.348ProVal: 1.348 ± 0.694
0.898ProTrp: 0.898 ± 0.462
2.246ProTyr: 2.246 ± 1.214
0.0ProXaa: 0.0 ± 0.0
Gln
3.145GlnAla: 3.145 ± 1.413
1.348GlnCys: 1.348 ± 0.694
2.695GlnAsp: 2.695 ± 1.387
4.942GlnGlu: 4.942 ± 2.386
0.898GlnPhe: 0.898 ± 2.33
4.492GlnGly: 4.492 ± 2.312
2.695GlnHis: 2.695 ± 1.742
5.391GlnIle: 5.391 ± 1.847
2.246GlnLys: 2.246 ± 1.156
5.391GlnLeu: 5.391 ± 4.083
0.449GlnMet: 0.449 ± 0.231
3.594GlnAsn: 3.594 ± 2.88
4.492GlnPro: 4.492 ± 1.514
7.188GlnGln: 7.188 ± 3.7
2.695GlnArg: 2.695 ± 1.387
2.695GlnSer: 2.695 ± 1.387
2.695GlnThr: 2.695 ± 1.038
2.695GlnVal: 2.695 ± 1.572
0.449GlnTrp: 0.449 ± 0.231
2.695GlnTyr: 2.695 ± 1.387
0.0GlnXaa: 0.0 ± 0.0
Arg
2.246ArgAla: 2.246 ± 2.38
0.449ArgCys: 0.449 ± 0.231
2.695ArgAsp: 2.695 ± 1.156
4.942ArgGlu: 4.942 ± 1.043
1.797ArgPhe: 1.797 ± 1.224
2.695ArgGly: 2.695 ± 1.387
0.898ArgHis: 0.898 ± 0.462
8.086ArgIle: 8.086 ± 1.815
2.246ArgLys: 2.246 ± 1.214
4.492ArgLeu: 4.492 ± 1.514
2.695ArgMet: 2.695 ± 1.387
3.594ArgAsn: 3.594 ± 1.268
2.246ArgPro: 2.246 ± 1.156
1.797ArgGln: 1.797 ± 1.054
6.289ArgArg: 6.289 ± 2.285
7.637ArgSer: 7.637 ± 1.775
3.594ArgThr: 3.594 ± 1.213
2.695ArgVal: 2.695 ± 2.639
1.348ArgTrp: 1.348 ± 0.694
1.797ArgTyr: 1.797 ± 1.054
0.0ArgXaa: 0.0 ± 0.0
Ser
2.246SerAla: 2.246 ± 1.214
0.898SerCys: 0.898 ± 1.517
2.246SerAsp: 2.246 ± 1.156
4.942SerGlu: 4.942 ± 2.386
2.246SerPhe: 2.246 ± 1.156
5.391SerGly: 5.391 ± 2.377
1.797SerHis: 1.797 ± 1.054
4.492SerIle: 4.492 ± 1.11
6.739SerLys: 6.739 ± 4.714
5.391SerLeu: 5.391 ± 2.311
1.797SerMet: 1.797 ± 0.925
1.348SerAsn: 1.348 ± 0.694
1.348SerPro: 1.348 ± 1.135
4.942SerGln: 4.942 ± 2.391
4.492SerArg: 4.492 ± 1.514
6.739SerSer: 6.739 ± 1.437
4.942SerThr: 4.942 ± 2.544
3.594SerVal: 3.594 ± 1.85
0.449SerTrp: 0.449 ± 0.231
0.898SerTyr: 0.898 ± 1.517
0.0SerXaa: 0.0 ± 0.0
Thr
2.695ThrAla: 2.695 ± 1.038
1.797ThrCys: 1.797 ± 0.925
2.246ThrAsp: 2.246 ± 1.156
2.246ThrGlu: 2.246 ± 1.168
1.348ThrPhe: 1.348 ± 0.694
5.84ThrGly: 5.84 ± 1.993
1.348ThrHis: 1.348 ± 0.694
5.84ThrIle: 5.84 ± 2.081
4.942ThrLys: 4.942 ± 4.035
1.348ThrLeu: 1.348 ± 0.694
1.797ThrMet: 1.797 ± 0.925
1.348ThrAsn: 1.348 ± 1.32
1.797ThrPro: 1.797 ± 0.925
1.797ThrGln: 1.797 ± 1.224
3.145ThrArg: 3.145 ± 1.619
4.942ThrSer: 4.942 ± 2.544
1.797ThrThr: 1.797 ± 0.925
1.797ThrVal: 1.797 ± 1.933
0.898ThrTrp: 0.898 ± 0.462
1.348ThrTyr: 1.348 ± 0.694
0.0ThrXaa: 0.0 ± 0.0
Val
4.942ValAla: 4.942 ± 1.693
0.449ValCys: 0.449 ± 1.401
2.695ValAsp: 2.695 ± 2.771
2.695ValGlu: 2.695 ± 3.176
1.348ValPhe: 1.348 ± 0.694
4.492ValGly: 4.492 ± 1.514
0.449ValHis: 0.449 ± 0.231
3.145ValIle: 3.145 ± 1.207
2.246ValLys: 2.246 ± 1.91
1.348ValLeu: 1.348 ± 1.385
1.348ValMet: 1.348 ± 0.694
0.898ValAsn: 0.898 ± 0.462
4.492ValPro: 4.492 ± 1.494
4.492ValGln: 4.492 ± 2.336
3.145ValArg: 3.145 ± 1.189
2.695ValSer: 2.695 ± 2.639
4.492ValThr: 4.492 ± 2.336
1.797ValVal: 1.797 ± 1.054
0.0ValTrp: 0.0 ± 0.0
1.797ValTyr: 1.797 ± 0.925
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.449TrpAsp: 0.449 ± 0.231
1.797TrpGlu: 1.797 ± 0.925
0.0TrpPhe: 0.0 ± 0.0
1.797TrpGly: 1.797 ± 1.054
0.0TrpHis: 0.0 ± 0.0
0.449TrpIle: 0.449 ± 0.231
0.898TrpLys: 0.898 ± 0.462
2.695TrpLeu: 2.695 ± 1.188
0.449TrpMet: 0.449 ± 0.231
0.449TrpAsn: 0.449 ± 0.231
0.0TrpPro: 0.0 ± 0.0
1.348TrpGln: 1.348 ± 0.694
0.898TrpArg: 0.898 ± 0.462
0.898TrpSer: 0.898 ± 0.462
0.898TrpThr: 0.898 ± 0.462
1.348TrpVal: 1.348 ± 1.32
0.449TrpTrp: 0.449 ± 0.231
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.695TyrAla: 2.695 ± 1.188
0.0TyrCys: 0.0 ± 0.0
0.898TyrAsp: 0.898 ± 0.462
1.797TyrGlu: 1.797 ± 1.224
0.449TyrPhe: 0.449 ± 0.231
0.449TyrGly: 0.449 ± 0.231
1.348TyrHis: 1.348 ± 1.32
2.246TyrIle: 2.246 ± 1.156
4.043TyrLys: 4.043 ± 1.355
3.594TyrLeu: 3.594 ± 1.85
0.898TyrMet: 0.898 ± 1.03
3.145TyrAsn: 3.145 ± 1.619
1.348TyrPro: 1.348 ± 0.694
2.246TyrGln: 2.246 ± 1.91
1.348TyrArg: 1.348 ± 0.694
3.145TyrSer: 3.145 ± 1.619
0.898TyrThr: 0.898 ± 0.462
1.797TyrVal: 1.797 ± 2.09
1.797TyrTrp: 1.797 ± 1.054
0.898TyrTyr: 0.898 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2227 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski