Amino acid dipepetide frequency for Sidastrum golden leaf spot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.684AlaAla: 6.684 ± 4.342
0.0AlaCys: 0.0 ± 0.0
4.011AlaAsp: 4.011 ± 2.605
1.337AlaGlu: 1.337 ± 1.724
1.337AlaPhe: 1.337 ± 1.724
2.674AlaGly: 2.674 ± 2.361
2.674AlaHis: 2.674 ± 1.572
4.011AlaIle: 4.011 ± 2.605
1.337AlaLys: 1.337 ± 0.868
4.011AlaLeu: 4.011 ± 1.866
1.337AlaMet: 1.337 ± 1.027
4.011AlaAsn: 4.011 ± 2.605
4.011AlaPro: 4.011 ± 2.605
0.0AlaGln: 0.0 ± 0.0
2.674AlaArg: 2.674 ± 1.737
5.348AlaSer: 5.348 ± 2.721
1.337AlaThr: 1.337 ± 0.868
2.674AlaVal: 2.674 ± 3.447
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.674CysGlu: 2.674 ± 0.816
1.337CysPhe: 1.337 ± 0.868
1.337CysGly: 1.337 ± 0.868
0.0CysHis: 0.0 ± 0.0
4.011CysIle: 4.011 ± 2.431
1.337CysLys: 1.337 ± 1.181
0.0CysLeu: 0.0 ± 0.0
1.337CysMet: 1.337 ± 1.181
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
4.011CysSer: 4.011 ± 1.834
0.0CysThr: 0.0 ± 0.0
1.337CysVal: 1.337 ± 1.181
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.337AspAla: 1.337 ± 0.868
1.337AspCys: 1.337 ± 0.868
4.011AspAsp: 4.011 ± 2.605
4.011AspGlu: 4.011 ± 1.202
0.0AspPhe: 0.0 ± 0.0
2.674AspGly: 2.674 ± 1.737
1.337AspHis: 1.337 ± 1.724
2.674AspIle: 2.674 ± 1.745
0.0AspLys: 0.0 ± 0.0
1.337AspLeu: 1.337 ± 0.868
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
1.337AspPro: 1.337 ± 1.181
0.0AspGln: 0.0 ± 0.0
2.674AspArg: 2.674 ± 1.745
5.348AspSer: 5.348 ± 1.631
2.674AspThr: 2.674 ± 0.816
2.674AspVal: 2.674 ± 2.361
0.0AspTrp: 0.0 ± 0.0
1.337AspTyr: 1.337 ± 0.868
0.0AspXaa: 0.0 ± 0.0
Glu
1.337GluAla: 1.337 ± 1.724
0.0GluCys: 0.0 ± 0.0
4.011GluAsp: 4.011 ± 3.183
1.337GluGlu: 1.337 ± 0.868
0.0GluPhe: 0.0 ± 0.0
4.011GluGly: 4.011 ± 1.202
0.0GluHis: 0.0 ± 0.0
5.348GluIle: 5.348 ± 0.767
1.337GluLys: 1.337 ± 0.868
1.337GluLeu: 1.337 ± 1.724
1.337GluMet: 1.337 ± 0.868
4.011GluAsn: 4.011 ± 1.031
1.337GluPro: 1.337 ± 0.868
1.337GluGln: 1.337 ± 1.181
1.337GluArg: 1.337 ± 0.868
4.011GluSer: 4.011 ± 2.605
1.337GluThr: 1.337 ± 0.868
0.0GluVal: 0.0 ± 0.0
2.674GluTrp: 2.674 ± 0.816
2.674GluTyr: 2.674 ± 1.737
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.337PheCys: 1.337 ± 1.181
0.0PheAsp: 0.0 ± 0.0
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
2.674PheGly: 2.674 ± 0.816
0.0PheHis: 0.0 ± 0.0
2.674PheIle: 2.674 ± 1.572
5.348PheLys: 5.348 ± 2.721
2.674PheLeu: 2.674 ± 1.572
0.0PheMet: 0.0 ± 0.0
2.674PheAsn: 2.674 ± 1.745
1.337PhePro: 1.337 ± 0.868
5.348PheGln: 5.348 ± 2.449
2.674PheArg: 2.674 ± 1.572
2.674PheSer: 2.674 ± 0.816
5.348PheThr: 5.348 ± 2.449
0.0PheVal: 0.0 ± 0.0
1.337PheTrp: 1.337 ± 1.181
1.337PheTyr: 1.337 ± 1.181
0.0PheXaa: 0.0 ± 0.0
Gly
1.337GlyAla: 1.337 ± 0.868
1.337GlyCys: 1.337 ± 1.181
1.337GlyAsp: 1.337 ± 0.868
1.337GlyGlu: 1.337 ± 1.724
4.011GlyPhe: 4.011 ± 1.202
4.011GlyGly: 4.011 ± 1.834
1.337GlyHis: 1.337 ± 1.181
1.337GlyIle: 1.337 ± 1.181
6.684GlyLys: 6.684 ± 2.581
2.674GlyLeu: 2.674 ± 1.572
0.0GlyMet: 0.0 ± 0.0
1.337GlyAsn: 1.337 ± 0.868
5.348GlyPro: 5.348 ± 1.631
1.337GlyGln: 1.337 ± 0.868
5.348GlyArg: 5.348 ± 1.932
5.348GlySer: 5.348 ± 1.932
5.348GlyThr: 5.348 ± 3.49
2.674GlyVal: 2.674 ± 1.572
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
5.348HisAla: 5.348 ± 2.721
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.337HisPhe: 1.337 ± 0.868
1.337HisGly: 1.337 ± 1.181
0.0HisHis: 0.0 ± 0.0
4.011HisIle: 4.011 ± 1.866
1.337HisLys: 1.337 ± 1.724
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
4.011HisAsn: 4.011 ± 1.866
0.0HisPro: 0.0 ± 0.0
2.674HisGln: 2.674 ± 1.572
5.348HisArg: 5.348 ± 0.767
1.337HisSer: 1.337 ± 1.724
1.337HisThr: 1.337 ± 1.181
4.011HisVal: 4.011 ± 1.031
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.674IleAla: 2.674 ± 1.737
2.674IleCys: 2.674 ± 2.361
0.0IleAsp: 0.0 ± 0.0
1.337IleGlu: 1.337 ± 0.868
1.337IlePhe: 1.337 ± 0.868
0.0IleGly: 0.0 ± 0.0
0.0IleHis: 0.0 ± 0.0
2.674IleIle: 2.674 ± 1.737
6.684IleLys: 6.684 ± 1.274
10.695IleLeu: 10.695 ± 1.534
2.674IleMet: 2.674 ± 1.812
0.0IleAsn: 0.0 ± 0.0
4.011IlePro: 4.011 ± 1.202
2.674IleGln: 2.674 ± 1.572
6.684IleArg: 6.684 ± 2.571
9.358IleSer: 9.358 ± 3.361
8.021IleThr: 8.021 ± 2.062
2.674IleVal: 2.674 ± 0.816
2.674IleTrp: 2.674 ± 1.745
6.684IleTyr: 6.684 ± 2.571
0.0IleXaa: 0.0 ± 0.0
Lys
5.348LysAla: 5.348 ± 0.767
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
2.674LysGlu: 2.674 ± 1.737
2.674LysPhe: 2.674 ± 1.572
1.337LysGly: 1.337 ± 0.868
1.337LysHis: 1.337 ± 0.868
5.348LysIle: 5.348 ± 1.563
2.674LysLys: 2.674 ± 1.737
4.011LysLeu: 4.011 ± 1.834
5.348LysMet: 5.348 ± 0.767
4.011LysAsn: 4.011 ± 1.202
4.011LysPro: 4.011 ± 1.202
0.0LysGln: 0.0 ± 0.0
5.348LysArg: 5.348 ± 1.631
4.011LysSer: 4.011 ± 1.202
2.674LysThr: 2.674 ± 1.737
5.348LysVal: 5.348 ± 4.723
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
2.674LeuAla: 2.674 ± 1.737
0.0LeuCys: 0.0 ± 0.0
2.674LeuAsp: 2.674 ± 0.816
1.337LeuGlu: 1.337 ± 0.868
5.348LeuPhe: 5.348 ± 0.767
5.348LeuGly: 5.348 ± 0.767
2.674LeuHis: 2.674 ± 1.572
5.348LeuIle: 5.348 ± 0.767
6.684LeuLys: 6.684 ± 1.861
6.684LeuLeu: 6.684 ± 1.861
4.011LeuMet: 4.011 ± 2.431
4.011LeuAsn: 4.011 ± 1.866
1.337LeuPro: 1.337 ± 0.868
1.337LeuGln: 1.337 ± 1.181
8.021LeuArg: 8.021 ± 2.062
4.011LeuSer: 4.011 ± 2.605
2.674LeuThr: 2.674 ± 1.572
6.684LeuVal: 6.684 ± 4.138
0.0LeuTrp: 0.0 ± 0.0
4.011LeuTyr: 4.011 ± 1.866
0.0LeuXaa: 0.0 ± 0.0
Met
1.337MetAla: 1.337 ± 1.181
0.0MetCys: 0.0 ± 0.0
1.337MetAsp: 1.337 ± 1.181
1.337MetGlu: 1.337 ± 0.868
1.337MetPhe: 1.337 ± 1.181
1.337MetGly: 1.337 ± 0.868
2.674MetHis: 2.674 ± 3.447
2.674MetIle: 2.674 ± 2.361
1.337MetLys: 1.337 ± 0.868
1.337MetLeu: 1.337 ± 0.868
1.337MetMet: 1.337 ± 0.868
0.0MetAsn: 0.0 ± 0.0
8.021MetPro: 8.021 ± 5.309
2.674MetGln: 2.674 ± 0.816
2.674MetArg: 2.674 ± 3.447
8.021MetSer: 8.021 ± 3.211
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
4.011MetTyr: 4.011 ± 1.031
0.0MetXaa: 0.0 ± 0.0
Asn
2.674AsnAla: 2.674 ± 1.737
0.0AsnCys: 0.0 ± 0.0
1.337AsnAsp: 1.337 ± 1.181
0.0AsnGlu: 0.0 ± 0.0
2.674AsnPhe: 2.674 ± 1.745
4.011AsnGly: 4.011 ± 1.031
4.011AsnHis: 4.011 ± 1.031
9.358AsnIle: 9.358 ± 4.433
1.337AsnLys: 1.337 ± 1.181
4.011AsnLeu: 4.011 ± 1.866
1.337AsnMet: 1.337 ± 1.181
0.0AsnAsn: 0.0 ± 0.0
1.337AsnPro: 1.337 ± 1.724
0.0AsnGln: 0.0 ± 0.0
2.674AsnArg: 2.674 ± 1.745
2.674AsnSer: 2.674 ± 0.816
4.011AsnThr: 4.011 ± 1.202
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.337AsnTyr: 1.337 ± 0.868
0.0AsnXaa: 0.0 ± 0.0
Pro
4.011ProAla: 4.011 ± 1.202
2.674ProCys: 2.674 ± 2.361
2.674ProAsp: 2.674 ± 0.816
2.674ProGlu: 2.674 ± 1.737
1.337ProPhe: 1.337 ± 0.868
1.337ProGly: 1.337 ± 0.868
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
4.011ProLys: 4.011 ± 1.834
5.348ProLeu: 5.348 ± 0.767
2.674ProMet: 2.674 ± 0.816
2.674ProAsn: 2.674 ± 0.816
2.674ProPro: 2.674 ± 1.737
2.674ProGln: 2.674 ± 1.737
6.684ProArg: 6.684 ± 4.138
5.348ProSer: 5.348 ± 1.631
0.0ProThr: 0.0 ± 0.0
0.0ProVal: 0.0 ± 0.0
4.011ProTrp: 4.011 ± 1.834
1.337ProTyr: 1.337 ± 0.868
0.0ProXaa: 0.0 ± 0.0
Gln
2.674GlnAla: 2.674 ± 1.572
1.337GlnCys: 1.337 ± 0.868
1.337GlnAsp: 1.337 ± 0.868
1.337GlnGlu: 1.337 ± 0.868
1.337GlnPhe: 1.337 ± 0.868
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
2.674GlnIle: 2.674 ± 1.572
1.337GlnLys: 1.337 ± 0.868
5.348GlnLeu: 5.348 ± 1.631
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.337GlnGln: 1.337 ± 0.868
2.674GlnArg: 2.674 ± 0.816
2.674GlnSer: 2.674 ± 0.816
5.348GlnThr: 5.348 ± 3.474
1.337GlnVal: 1.337 ± 1.724
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.674ArgAla: 2.674 ± 3.447
1.337ArgCys: 1.337 ± 1.724
2.674ArgAsp: 2.674 ± 2.361
0.0ArgGlu: 0.0 ± 0.0
6.684ArgPhe: 6.684 ± 4.939
8.021ArgGly: 8.021 ± 2.447
2.674ArgHis: 2.674 ± 1.572
4.011ArgIle: 4.011 ± 1.834
5.348ArgLys: 5.348 ± 1.631
5.348ArgLeu: 5.348 ± 3.474
4.011ArgMet: 4.011 ± 2.431
5.348ArgAsn: 5.348 ± 0.767
5.348ArgPro: 5.348 ± 2.975
1.337ArgGln: 1.337 ± 0.868
10.695ArgArg: 10.695 ± 1.914
6.684ArgSer: 6.684 ± 4.138
8.021ArgThr: 8.021 ± 1.78
6.684ArgVal: 6.684 ± 2.382
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.011SerAla: 4.011 ± 2.605
4.011SerCys: 4.011 ± 1.202
2.674SerAsp: 2.674 ± 0.816
6.684SerGlu: 6.684 ± 2.732
1.337SerPhe: 1.337 ± 0.868
5.348SerGly: 5.348 ± 3.49
6.684SerHis: 6.684 ± 1.274
8.021SerIle: 8.021 ± 1.78
5.348SerLys: 5.348 ± 1.932
5.348SerLeu: 5.348 ± 1.563
6.684SerMet: 6.684 ± 0.696
2.674SerAsn: 2.674 ± 0.816
5.348SerPro: 5.348 ± 1.631
2.674SerGln: 2.674 ± 1.737
5.348SerArg: 5.348 ± 1.563
16.043SerSer: 16.043 ± 5.985
9.358SerThr: 9.358 ± 3.098
4.011SerVal: 4.011 ± 3.542
4.011SerTrp: 4.011 ± 1.031
2.674SerTyr: 2.674 ± 0.816
0.0SerXaa: 0.0 ± 0.0
Thr
5.348ThrAla: 5.348 ± 2.449
1.337ThrCys: 1.337 ± 1.181
5.348ThrAsp: 5.348 ± 0.767
4.011ThrGlu: 4.011 ± 2.605
0.0ThrPhe: 0.0 ± 0.0
4.011ThrGly: 4.011 ± 1.866
1.337ThrHis: 1.337 ± 1.181
1.337ThrIle: 1.337 ± 1.181
2.674ThrLys: 2.674 ± 1.737
6.684ThrLeu: 6.684 ± 1.861
2.674ThrMet: 2.674 ± 1.745
4.011ThrAsn: 4.011 ± 1.031
4.011ThrPro: 4.011 ± 1.202
2.674ThrGln: 2.674 ± 1.737
6.684ThrArg: 6.684 ± 2.571
13.369ThrSer: 13.369 ± 3.65
6.684ThrThr: 6.684 ± 4.437
0.0ThrVal: 0.0 ± 0.0
1.337ThrTrp: 1.337 ± 0.868
1.337ThrTyr: 1.337 ± 1.724
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
2.674ValGlu: 2.674 ± 1.572
1.337ValPhe: 1.337 ± 0.868
2.674ValGly: 2.674 ± 0.816
1.337ValHis: 1.337 ± 1.181
5.348ValIle: 5.348 ± 2.975
1.337ValLys: 1.337 ± 1.181
4.011ValLeu: 4.011 ± 2.431
2.674ValMet: 2.674 ± 0.816
4.011ValAsn: 4.011 ± 3.542
2.674ValPro: 2.674 ± 0.816
1.337ValGln: 1.337 ± 1.181
5.348ValArg: 5.348 ± 0.767
2.674ValSer: 2.674 ± 1.745
1.337ValThr: 1.337 ± 1.181
4.011ValVal: 4.011 ± 1.834
1.337ValTrp: 1.337 ± 1.724
4.011ValTyr: 4.011 ± 3.262
0.0ValXaa: 0.0 ± 0.0
Trp
1.337TrpAla: 1.337 ± 0.868
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.337TrpGlu: 1.337 ± 1.724
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.337TrpLeu: 1.337 ± 1.181
1.337TrpMet: 1.337 ± 1.181
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.337TrpArg: 1.337 ± 1.181
2.674TrpSer: 2.674 ± 1.745
6.684TrpThr: 6.684 ± 1.274
1.337TrpVal: 1.337 ± 1.181
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.337TyrAsp: 1.337 ± 1.181
2.674TyrGlu: 2.674 ± 0.816
4.011TyrPhe: 4.011 ± 1.031
0.0TyrGly: 0.0 ± 0.0
4.011TyrHis: 4.011 ± 1.866
2.674TyrIle: 2.674 ± 1.745
0.0TyrLys: 0.0 ± 0.0
2.674TyrLeu: 2.674 ± 0.816
1.337TyrMet: 1.337 ± 1.279
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
1.337TyrGln: 1.337 ± 0.868
2.674TyrArg: 2.674 ± 2.361
2.674TyrSer: 2.674 ± 3.447
2.674TyrThr: 2.674 ± 1.737
2.674TyrVal: 2.674 ± 1.737
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (749 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski