Amino acid dipepetide frequency for Discula destructiva virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.163AlaAla: 8.163 ± 6.437
1.02AlaCys: 1.02 ± 0.805
2.041AlaAsp: 2.041 ± 0.145
6.122AlaGlu: 6.122 ± 0.435
8.163AlaPhe: 8.163 ± 0.58
4.082AlaGly: 4.082 ± 0.29
1.02AlaHis: 1.02 ± 0.805
3.061AlaIle: 3.061 ± 0.95
5.102AlaLys: 5.102 ± 1.834
5.102AlaLeu: 5.102 ± 1.095
4.082AlaMet: 4.082 ± 0.377
1.02AlaAsn: 1.02 ± 0.805
6.122AlaPro: 6.122 ± 1.899
1.02AlaGln: 1.02 ± 0.66
4.082AlaArg: 4.082 ± 1.754
4.082AlaSer: 4.082 ± 3.219
3.061AlaThr: 3.061 ± 2.414
4.082AlaVal: 4.082 ± 3.219
2.041AlaTrp: 2.041 ± 0.145
1.02AlaTyr: 1.02 ± 0.66
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.02CysCys: 1.02 ± 0.66
0.0CysAsp: 0.0 ± 0.0
2.041CysGlu: 2.041 ± 1.319
1.02CysPhe: 1.02 ± 0.805
1.02CysGly: 1.02 ± 0.66
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
3.061CysLeu: 3.061 ± 0.515
0.0CysMet: 0.0 ± 0.0
1.02CysAsn: 1.02 ± 0.66
0.0CysPro: 0.0 ± 0.0
1.02CysGln: 1.02 ± 0.66
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.041CysVal: 2.041 ± 1.319
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.082AspAla: 4.082 ± 0.29
2.041AspCys: 2.041 ± 1.319
5.102AspAsp: 5.102 ± 1.834
2.041AspGlu: 2.041 ± 0.145
4.082AspPhe: 4.082 ± 1.754
5.102AspGly: 5.102 ± 1.095
0.0AspHis: 0.0 ± 0.0
3.061AspIle: 3.061 ± 0.515
1.02AspLys: 1.02 ± 0.66
2.041AspLeu: 2.041 ± 1.319
0.0AspMet: 0.0 ± 0.0
1.02AspAsn: 1.02 ± 0.805
7.143AspPro: 7.143 ± 3.154
3.061AspGln: 3.061 ± 0.515
4.082AspArg: 4.082 ± 0.29
4.082AspSer: 4.082 ± 1.174
3.061AspThr: 3.061 ± 0.515
3.061AspVal: 3.061 ± 1.979
2.041AspTrp: 2.041 ± 0.145
4.082AspTyr: 4.082 ± 1.174
0.0AspXaa: 0.0 ± 0.0
Glu
2.041GluAla: 2.041 ± 0.145
0.0GluCys: 0.0 ± 0.0
6.122GluAsp: 6.122 ± 1.03
3.061GluGlu: 3.061 ± 1.979
5.102GluPhe: 5.102 ± 1.095
7.143GluGly: 7.143 ± 3.154
1.02GluHis: 1.02 ± 0.805
4.082GluIle: 4.082 ± 1.174
3.061GluLys: 3.061 ± 0.515
5.102GluLeu: 5.102 ± 2.559
1.02GluMet: 1.02 ± 0.66
1.02GluAsn: 1.02 ± 0.805
4.082GluPro: 4.082 ± 0.29
2.041GluGln: 2.041 ± 1.609
4.082GluArg: 4.082 ± 1.754
2.041GluSer: 2.041 ± 0.145
3.061GluThr: 3.061 ± 1.979
5.102GluVal: 5.102 ± 0.37
2.041GluTrp: 2.041 ± 1.319
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.041PheAla: 2.041 ± 1.609
1.02PheCys: 1.02 ± 0.66
7.143PheAsp: 7.143 ± 1.689
4.082PheGlu: 4.082 ± 0.29
1.02PhePhe: 1.02 ± 0.805
2.041PheGly: 2.041 ± 0.145
0.0PheHis: 0.0 ± 0.0
1.02PheIle: 1.02 ± 0.66
5.102PheLys: 5.102 ± 1.095
5.102PheLeu: 5.102 ± 2.559
2.041PheMet: 2.041 ± 1.319
1.02PheAsn: 1.02 ± 0.805
3.061PhePro: 3.061 ± 0.95
1.02PheGln: 1.02 ± 0.805
2.041PheArg: 2.041 ± 1.319
7.143PheSer: 7.143 ± 1.239
5.102PheThr: 5.102 ± 1.095
2.041PheVal: 2.041 ± 1.609
1.02PheTrp: 1.02 ± 0.66
2.041PheTyr: 2.041 ± 1.319
0.0PheXaa: 0.0 ± 0.0
Gly
2.041GlyAla: 2.041 ± 0.145
0.0GlyCys: 0.0 ± 0.0
4.082GlyAsp: 4.082 ± 0.29
6.122GlyGlu: 6.122 ± 1.899
6.122GlyPhe: 6.122 ± 0.435
5.102GlyGly: 5.102 ± 1.834
3.061GlyHis: 3.061 ± 0.95
3.061GlyIle: 3.061 ± 0.95
9.184GlyLys: 9.184 ± 0.08
2.041GlyLeu: 2.041 ± 0.145
3.061GlyMet: 3.061 ± 2.332
0.0GlyAsn: 0.0 ± 0.0
1.02GlyPro: 1.02 ± 0.805
3.061GlyGln: 3.061 ± 0.515
4.082GlyArg: 4.082 ± 2.639
7.143GlySer: 7.143 ± 1.239
4.082GlyThr: 4.082 ± 0.29
3.061GlyVal: 3.061 ± 0.95
2.041GlyTrp: 2.041 ± 1.319
1.02GlyTyr: 1.02 ± 0.66
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.02HisAsp: 1.02 ± 0.66
1.02HisGlu: 1.02 ± 0.66
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.041HisLeu: 2.041 ± 0.145
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
3.061HisArg: 3.061 ± 0.515
3.061HisSer: 3.061 ± 0.95
1.02HisThr: 1.02 ± 0.805
1.02HisVal: 1.02 ± 0.805
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.061IleAla: 3.061 ± 0.515
0.0IleCys: 0.0 ± 0.0
3.061IleAsp: 3.061 ± 1.979
0.0IleGlu: 0.0 ± 0.0
1.02IlePhe: 1.02 ± 0.66
1.02IleGly: 1.02 ± 0.805
0.0IleHis: 0.0 ± 0.0
2.041IleIle: 2.041 ± 1.319
1.02IleLys: 1.02 ± 0.805
3.061IleLeu: 3.061 ± 0.515
0.0IleMet: 0.0 ± 0.0
2.041IleAsn: 2.041 ± 1.319
3.061IlePro: 3.061 ± 0.95
0.0IleGln: 0.0 ± 0.0
3.061IleArg: 3.061 ± 1.979
2.041IleSer: 2.041 ± 1.609
1.02IleThr: 1.02 ± 0.805
2.041IleVal: 2.041 ± 0.145
2.041IleTrp: 2.041 ± 1.319
4.082IleTyr: 4.082 ± 2.639
0.0IleXaa: 0.0 ± 0.0
Lys
7.143LysAla: 7.143 ± 0.225
1.02LysCys: 1.02 ± 0.66
3.061LysAsp: 3.061 ± 0.515
1.02LysGlu: 1.02 ± 0.805
2.041LysPhe: 2.041 ± 1.609
2.041LysGly: 2.041 ± 0.145
0.0LysHis: 0.0 ± 0.0
1.02LysIle: 1.02 ± 0.66
3.061LysLys: 3.061 ± 1.979
4.082LysLeu: 4.082 ± 1.174
1.02LysMet: 1.02 ± 0.66
1.02LysAsn: 1.02 ± 0.66
4.082LysPro: 4.082 ± 0.29
3.061LysGln: 3.061 ± 1.979
3.061LysArg: 3.061 ± 0.515
6.122LysSer: 6.122 ± 0.435
6.122LysThr: 6.122 ± 1.03
2.041LysVal: 2.041 ± 1.319
2.041LysTrp: 2.041 ± 0.145
4.082LysTyr: 4.082 ± 1.174
0.0LysXaa: 0.0 ± 0.0
Leu
7.143LeuAla: 7.143 ± 0.225
1.02LeuCys: 1.02 ± 0.66
3.061LeuAsp: 3.061 ± 0.515
7.143LeuGlu: 7.143 ± 2.704
3.061LeuPhe: 3.061 ± 0.95
7.143LeuGly: 7.143 ± 1.689
3.061LeuHis: 3.061 ± 1.979
3.061LeuIle: 3.061 ± 1.979
1.02LeuLys: 1.02 ± 0.66
2.041LeuLeu: 2.041 ± 0.145
2.041LeuMet: 2.041 ± 1.319
3.061LeuAsn: 3.061 ± 0.95
5.102LeuPro: 5.102 ± 1.095
0.0LeuGln: 0.0 ± 0.0
6.122LeuArg: 6.122 ± 1.03
9.184LeuSer: 9.184 ± 4.313
0.0LeuThr: 0.0 ± 0.0
5.102LeuVal: 5.102 ± 2.559
2.041LeuTrp: 2.041 ± 0.145
4.082LeuTyr: 4.082 ± 1.174
0.0LeuXaa: 0.0 ± 0.0
Met
3.061MetAla: 3.061 ± 2.414
0.0MetCys: 0.0 ± 0.0
2.041MetAsp: 2.041 ± 0.145
2.041MetGlu: 2.041 ± 1.319
2.041MetPhe: 2.041 ± 0.145
2.041MetGly: 2.041 ± 0.145
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.02MetLys: 1.02 ± 0.66
2.041MetLeu: 2.041 ± 1.319
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.02MetPro: 1.02 ± 0.66
0.0MetGln: 0.0 ± 0.0
2.041MetArg: 2.041 ± 1.319
3.061MetSer: 3.061 ± 0.515
0.0MetThr: 0.0 ± 0.0
2.041MetVal: 2.041 ± 1.319
1.02MetTrp: 1.02 ± 0.66
2.041MetTyr: 2.041 ± 1.319
0.0MetXaa: 0.0 ± 0.0
Asn
3.061AsnAla: 3.061 ± 2.414
0.0AsnCys: 0.0 ± 0.0
3.061AsnAsp: 3.061 ± 0.95
0.0AsnGlu: 0.0 ± 0.0
1.02AsnPhe: 1.02 ± 0.66
2.041AsnGly: 2.041 ± 0.145
0.0AsnHis: 0.0 ± 0.0
3.061AsnIle: 3.061 ± 0.515
1.02AsnLys: 1.02 ± 0.66
1.02AsnLeu: 1.02 ± 0.66
1.02AsnMet: 1.02 ± 0.805
2.041AsnAsn: 2.041 ± 1.319
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
1.02AsnArg: 1.02 ± 0.805
4.082AsnSer: 4.082 ± 0.29
3.061AsnThr: 3.061 ± 1.979
1.02AsnVal: 1.02 ± 0.66
2.041AsnTrp: 2.041 ± 1.609
1.02AsnTyr: 1.02 ± 0.66
0.0AsnXaa: 0.0 ± 0.0
Pro
5.102ProAla: 5.102 ± 0.37
2.041ProCys: 2.041 ± 1.319
2.041ProAsp: 2.041 ± 0.145
7.143ProGlu: 7.143 ± 1.689
0.0ProPhe: 0.0 ± 0.0
7.143ProGly: 7.143 ± 2.704
0.0ProHis: 0.0 ± 0.0
1.02ProIle: 1.02 ± 0.66
3.061ProLys: 3.061 ± 0.515
4.082ProLeu: 4.082 ± 0.29
2.041ProMet: 2.041 ± 0.145
3.061ProAsn: 3.061 ± 0.515
3.061ProPro: 3.061 ± 0.95
3.061ProGln: 3.061 ± 2.414
1.02ProArg: 1.02 ± 0.66
4.082ProSer: 4.082 ± 1.754
5.102ProThr: 5.102 ± 0.37
4.082ProVal: 4.082 ± 0.29
2.041ProTrp: 2.041 ± 0.145
2.041ProTyr: 2.041 ± 0.145
0.0ProXaa: 0.0 ± 0.0
Gln
2.041GlnAla: 2.041 ± 1.609
0.0GlnCys: 0.0 ± 0.0
1.02GlnAsp: 1.02 ± 0.66
1.02GlnGlu: 1.02 ± 0.66
5.102GlnPhe: 5.102 ± 0.37
1.02GlnGly: 1.02 ± 0.805
0.0GlnHis: 0.0 ± 0.0
3.061GlnIle: 3.061 ± 0.515
3.061GlnLys: 3.061 ± 1.979
1.02GlnLeu: 1.02 ± 0.805
2.041GlnMet: 2.041 ± 1.609
1.02GlnAsn: 1.02 ± 0.66
2.041GlnPro: 2.041 ± 1.609
3.061GlnGln: 3.061 ± 0.95
3.061GlnArg: 3.061 ± 0.515
2.041GlnSer: 2.041 ± 0.145
3.061GlnThr: 3.061 ± 0.95
1.02GlnVal: 1.02 ± 0.805
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.061ArgAla: 3.061 ± 0.95
1.02ArgCys: 1.02 ± 0.66
5.102ArgAsp: 5.102 ± 0.37
5.102ArgGlu: 5.102 ± 1.834
6.122ArgPhe: 6.122 ± 0.435
3.061ArgGly: 3.061 ± 1.979
0.0ArgHis: 0.0 ± 0.0
0.0ArgIle: 0.0 ± 0.0
3.061ArgLys: 3.061 ± 0.95
10.204ArgLeu: 10.204 ± 3.653
2.041ArgMet: 2.041 ± 1.319
0.0ArgAsn: 0.0 ± 0.0
3.061ArgPro: 3.061 ± 1.979
3.061ArgGln: 3.061 ± 0.95
6.122ArgArg: 6.122 ± 0.435
7.143ArgSer: 7.143 ± 3.154
2.041ArgThr: 2.041 ± 0.145
3.061ArgVal: 3.061 ± 0.95
2.041ArgTrp: 2.041 ± 1.319
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.143SerAla: 7.143 ± 1.239
1.02SerCys: 1.02 ± 0.805
5.102SerAsp: 5.102 ± 1.095
4.082SerGlu: 4.082 ± 0.29
3.061SerPhe: 3.061 ± 0.95
9.184SerGly: 9.184 ± 2.849
1.02SerHis: 1.02 ± 0.805
2.041SerIle: 2.041 ± 0.145
6.122SerLys: 6.122 ± 0.435
5.102SerLeu: 5.102 ± 0.37
2.041SerMet: 2.041 ± 0.145
6.122SerAsn: 6.122 ± 0.435
2.041SerPro: 2.041 ± 0.145
6.122SerGln: 6.122 ± 1.899
3.061SerArg: 3.061 ± 0.95
12.245SerSer: 12.245 ± 2.334
7.143SerThr: 7.143 ± 0.225
9.184SerVal: 9.184 ± 1.384
1.02SerTrp: 1.02 ± 0.66
2.041SerTyr: 2.041 ± 0.145
0.0SerXaa: 0.0 ± 0.0
Thr
5.102ThrAla: 5.102 ± 2.559
0.0ThrCys: 0.0 ± 0.0
2.041ThrAsp: 2.041 ± 1.319
5.102ThrGlu: 5.102 ± 1.095
2.041ThrPhe: 2.041 ± 1.319
3.061ThrGly: 3.061 ± 0.95
0.0ThrHis: 0.0 ± 0.0
1.02ThrIle: 1.02 ± 0.805
4.082ThrLys: 4.082 ± 1.174
5.102ThrLeu: 5.102 ± 0.37
0.0ThrMet: 0.0 ± 0.0
1.02ThrAsn: 1.02 ± 0.66
5.102ThrPro: 5.102 ± 1.095
2.041ThrGln: 2.041 ± 1.319
6.122ThrArg: 6.122 ± 1.03
4.082ThrSer: 4.082 ± 1.174
6.122ThrThr: 6.122 ± 0.435
4.082ThrVal: 4.082 ± 1.754
0.0ThrTrp: 0.0 ± 0.0
2.041ThrTyr: 2.041 ± 0.145
0.0ThrXaa: 0.0 ± 0.0
Val
5.102ValAla: 5.102 ± 2.559
0.0ValCys: 0.0 ± 0.0
3.061ValAsp: 3.061 ± 0.515
2.041ValGlu: 2.041 ± 0.145
1.02ValPhe: 1.02 ± 0.805
4.082ValGly: 4.082 ± 0.29
2.041ValHis: 2.041 ± 0.145
1.02ValIle: 1.02 ± 0.66
3.061ValLys: 3.061 ± 0.515
6.122ValLeu: 6.122 ± 1.03
1.02ValMet: 1.02 ± 0.66
3.061ValAsn: 3.061 ± 0.515
7.143ValPro: 7.143 ± 0.225
2.041ValGln: 2.041 ± 1.609
3.061ValArg: 3.061 ± 2.414
5.102ValSer: 5.102 ± 2.559
2.041ValThr: 2.041 ± 1.609
7.143ValVal: 7.143 ± 4.168
3.061ValTrp: 3.061 ± 0.515
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
2.041TrpAla: 2.041 ± 1.609
1.02TrpCys: 1.02 ± 0.66
2.041TrpAsp: 2.041 ± 1.319
0.0TrpGlu: 0.0 ± 0.0
2.041TrpPhe: 2.041 ± 1.319
1.02TrpGly: 1.02 ± 0.66
1.02TrpHis: 1.02 ± 0.66
2.041TrpIle: 2.041 ± 1.319
2.041TrpLys: 2.041 ± 0.145
5.102TrpLeu: 5.102 ± 1.834
0.0TrpMet: 0.0 ± 0.0
1.02TrpAsn: 1.02 ± 0.805
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.041TrpArg: 2.041 ± 0.145
2.041TrpSer: 2.041 ± 1.609
2.041TrpThr: 2.041 ± 1.319
0.0TrpVal: 0.0 ± 0.0
1.02TrpTrp: 1.02 ± 0.66
1.02TrpTyr: 1.02 ± 0.66
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.061TyrAla: 3.061 ± 0.515
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
2.041TyrGlu: 2.041 ± 1.319
1.02TyrPhe: 1.02 ± 0.66
3.061TyrGly: 3.061 ± 1.979
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
2.041TyrLys: 2.041 ± 1.319
1.02TyrLeu: 1.02 ± 0.66
1.02TyrMet: 1.02 ± 0.66
1.02TyrAsn: 1.02 ± 0.805
4.082TyrPro: 4.082 ± 1.174
1.02TyrGln: 1.02 ± 0.66
4.082TyrArg: 4.082 ± 1.174
6.122TyrSer: 6.122 ± 1.03
1.02TyrThr: 1.02 ± 0.66
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.02TyrTyr: 1.02 ± 0.66
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski