Amino acid dipepetide frequency for Gigaspora margarita giardia-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.695AlaCys: 0.695 ± 0.402
2.78AlaAsp: 2.78 ± 1.607
0.0AlaGlu: 0.0 ± 0.0
1.39AlaPhe: 1.39 ± 0.803
1.39AlaGly: 1.39 ± 0.581
0.0AlaHis: 0.0 ± 0.0
2.78AlaIle: 2.78 ± 1.607
1.39AlaLys: 1.39 ± 0.803
2.085AlaLeu: 2.085 ± 0.179
0.695AlaMet: 0.695 ± 0.533
2.085AlaAsn: 2.085 ± 0.179
2.78AlaPro: 2.78 ± 2.546
1.39AlaGln: 1.39 ± 0.803
1.39AlaArg: 1.39 ± 0.803
4.17AlaSer: 4.17 ± 1.025
2.78AlaThr: 2.78 ± 0.222
1.39AlaVal: 1.39 ± 0.581
0.0AlaTrp: 0.0 ± 0.0
2.085AlaTyr: 2.085 ± 1.564
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.085CysAsp: 2.085 ± 1.205
0.0CysGlu: 0.0 ± 0.0
0.695CysPhe: 0.695 ± 0.402
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.78CysIle: 2.78 ± 0.222
4.17CysLys: 4.17 ± 2.41
3.475CysLeu: 3.475 ± 0.624
0.0CysMet: 0.0 ± 0.0
0.695CysAsn: 0.695 ± 0.983
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.085CysSer: 2.085 ± 1.205
0.695CysThr: 0.695 ± 0.402
1.39CysVal: 1.39 ± 0.803
0.0CysTrp: 0.0 ± 0.0
2.085CysTyr: 2.085 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
2.78AspAla: 2.78 ± 1.162
1.39AspCys: 1.39 ± 0.803
9.034AspAsp: 9.034 ± 3.837
4.864AspGlu: 4.864 ± 1.342
1.39AspPhe: 1.39 ± 0.803
3.475AspGly: 3.475 ± 2.008
1.39AspHis: 1.39 ± 0.581
4.864AspIle: 4.864 ± 1.342
4.17AspLys: 4.17 ± 2.41
6.949AspLeu: 6.949 ± 2.632
0.695AspMet: 0.695 ± 0.983
4.17AspAsn: 4.17 ± 1.025
2.085AspPro: 2.085 ± 0.179
6.949AspGln: 6.949 ± 1.248
2.085AspArg: 2.085 ± 1.205
3.475AspSer: 3.475 ± 0.624
4.17AspThr: 4.17 ± 3.127
4.17AspVal: 4.17 ± 1.743
1.39AspTrp: 1.39 ± 0.581
9.034AspTyr: 9.034 ± 2.453
0.0AspXaa: 0.0 ± 0.0
Glu
1.39GluAla: 1.39 ± 0.803
0.695GluCys: 0.695 ± 0.983
4.864GluAsp: 4.864 ± 1.427
0.695GluGlu: 0.695 ± 0.402
2.78GluPhe: 2.78 ± 1.607
2.085GluGly: 2.085 ± 1.205
1.39GluHis: 1.39 ± 0.581
2.085GluIle: 2.085 ± 1.205
6.949GluLys: 6.949 ± 1.248
5.559GluLeu: 5.559 ± 0.444
2.78GluMet: 2.78 ± 0.222
4.17GluAsn: 4.17 ± 1.743
1.39GluPro: 1.39 ± 0.581
2.085GluGln: 2.085 ± 1.205
5.559GluArg: 5.559 ± 0.444
5.559GluSer: 5.559 ± 0.94
0.0GluThr: 0.0 ± 0.0
1.39GluVal: 1.39 ± 0.803
0.0GluTrp: 0.0 ± 0.0
4.17GluTyr: 4.17 ± 1.025
0.0GluXaa: 0.0 ± 0.0
Phe
0.695PheAla: 0.695 ± 0.983
4.17PheCys: 4.17 ± 2.41
6.254PheAsp: 6.254 ± 0.846
1.39PheGlu: 1.39 ± 0.803
4.17PhePhe: 4.17 ± 1.025
2.085PheGly: 2.085 ± 0.179
0.695PheHis: 0.695 ± 0.983
2.78PheIle: 2.78 ± 1.162
2.085PheLys: 2.085 ± 0.179
5.559PheLeu: 5.559 ± 3.213
2.085PheMet: 2.085 ± 1.564
4.864PheAsn: 4.864 ± 0.043
0.0PhePro: 0.0 ± 0.0
0.695PheGln: 0.695 ± 0.402
4.17PheArg: 4.17 ± 0.359
6.254PheSer: 6.254 ± 0.846
2.085PheThr: 2.085 ± 0.179
0.695PheVal: 0.695 ± 0.402
2.78PheTrp: 2.78 ± 1.607
3.475PheTyr: 3.475 ± 0.624
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
3.475GlyAsp: 3.475 ± 0.76
2.085GlyGlu: 2.085 ± 0.179
3.475GlyPhe: 3.475 ± 2.008
0.695GlyGly: 0.695 ± 0.402
0.695GlyHis: 0.695 ± 0.983
2.78GlyIle: 2.78 ± 0.222
6.254GlyLys: 6.254 ± 0.538
2.78GlyLeu: 2.78 ± 0.222
0.0GlyMet: 0.0 ± 0.0
1.39GlyAsn: 1.39 ± 0.581
0.695GlyPro: 0.695 ± 0.402
1.39GlyGln: 1.39 ± 1.965
2.085GlyArg: 2.085 ± 1.205
2.78GlySer: 2.78 ± 0.222
4.17GlyThr: 4.17 ± 4.512
4.17GlyVal: 4.17 ± 0.359
0.0GlyTrp: 0.0 ± 0.0
4.17GlyTyr: 4.17 ± 0.359
0.0GlyXaa: 0.0 ± 0.0
His
2.085HisAla: 2.085 ± 0.179
0.695HisCys: 0.695 ± 0.402
1.39HisAsp: 1.39 ± 0.803
0.0HisGlu: 0.0 ± 0.0
2.085HisPhe: 2.085 ± 1.564
0.695HisGly: 0.695 ± 0.983
0.0HisHis: 0.0 ± 0.0
0.695HisIle: 0.695 ± 0.402
0.695HisLys: 0.695 ± 0.402
1.39HisLeu: 1.39 ± 0.803
0.695HisMet: 0.695 ± 0.983
0.695HisAsn: 0.695 ± 0.983
0.695HisPro: 0.695 ± 0.983
0.695HisGln: 0.695 ± 0.983
1.39HisArg: 1.39 ± 0.581
1.39HisSer: 1.39 ± 0.581
0.0HisThr: 0.0 ± 0.0
5.559HisVal: 5.559 ± 3.708
0.0HisTrp: 0.0 ± 0.0
0.695HisTyr: 0.695 ± 0.983
0.0HisXaa: 0.0 ± 0.0
Ile
2.085IleAla: 2.085 ± 1.205
0.695IleCys: 0.695 ± 0.402
4.17IleAsp: 4.17 ± 0.359
8.339IleGlu: 8.339 ± 0.667
6.949IlePhe: 6.949 ± 2.632
4.17IleGly: 4.17 ± 3.127
2.78IleHis: 2.78 ± 0.222
3.475IleIle: 3.475 ± 0.624
6.254IleLys: 6.254 ± 0.846
4.17IleLeu: 4.17 ± 2.41
1.39IleMet: 1.39 ± 0.803
3.475IleAsn: 3.475 ± 0.76
4.17IlePro: 4.17 ± 1.743
0.0IleGln: 0.0 ± 0.0
5.559IleArg: 5.559 ± 1.829
6.254IleSer: 6.254 ± 3.615
1.39IleThr: 1.39 ± 0.803
3.475IleVal: 3.475 ± 2.008
0.0IleTrp: 0.0 ± 0.0
2.085IleTyr: 2.085 ± 1.564
0.0IleXaa: 0.0 ± 0.0
Lys
0.695LysAla: 0.695 ± 0.402
2.78LysCys: 2.78 ± 1.607
5.559LysAsp: 5.559 ± 1.829
5.559LysGlu: 5.559 ± 1.829
5.559LysPhe: 5.559 ± 0.444
3.475LysGly: 3.475 ± 2.008
2.78LysHis: 2.78 ± 1.162
6.254LysIle: 6.254 ± 3.615
8.339LysLys: 8.339 ± 3.435
6.949LysLeu: 6.949 ± 2.632
3.475LysMet: 3.475 ± 0.624
2.085LysAsn: 2.085 ± 0.179
0.695LysPro: 0.695 ± 0.983
1.39LysGln: 1.39 ± 0.803
4.17LysArg: 4.17 ± 1.025
3.475LysSer: 3.475 ± 0.624
2.085LysThr: 2.085 ± 1.205
3.475LysVal: 3.475 ± 0.76
3.475LysTrp: 3.475 ± 2.145
4.864LysTyr: 4.864 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
4.17LeuAla: 4.17 ± 0.359
0.0LeuCys: 0.0 ± 0.0
10.424LeuAsp: 10.424 ± 4.64
7.644LeuGlu: 7.644 ± 1.649
4.17LeuPhe: 4.17 ± 2.41
3.475LeuGly: 3.475 ± 0.76
4.864LeuHis: 4.864 ± 0.043
5.559LeuIle: 5.559 ± 1.829
4.864LeuLys: 4.864 ± 1.427
4.17LeuLeu: 4.17 ± 0.359
5.559LeuMet: 5.559 ± 1.829
4.17LeuAsn: 4.17 ± 0.359
2.085LeuPro: 2.085 ± 1.205
3.475LeuGln: 3.475 ± 0.76
2.78LeuArg: 2.78 ± 1.607
6.254LeuSer: 6.254 ± 0.538
2.78LeuThr: 2.78 ± 1.162
2.085LeuVal: 2.085 ± 0.179
0.695LeuTrp: 0.695 ± 0.983
9.034LeuTyr: 9.034 ± 1.068
0.0LeuXaa: 0.0 ± 0.0
Met
1.39MetAla: 1.39 ± 0.581
1.39MetCys: 1.39 ± 0.581
0.0MetAsp: 0.0 ± 0.0
2.78MetGlu: 2.78 ± 1.607
0.695MetPhe: 0.695 ± 0.402
3.475MetGly: 3.475 ± 2.145
0.0MetHis: 0.0 ± 0.0
2.085MetIle: 2.085 ± 0.179
4.17MetLys: 4.17 ± 1.025
2.78MetLeu: 2.78 ± 1.607
0.695MetMet: 0.695 ± 0.983
2.085MetAsn: 2.085 ± 1.205
0.695MetPro: 0.695 ± 0.402
0.0MetGln: 0.0 ± 0.0
1.39MetArg: 1.39 ± 0.803
4.864MetSer: 4.864 ± 1.342
2.78MetThr: 2.78 ± 2.546
2.085MetVal: 2.085 ± 0.179
0.0MetTrp: 0.0 ± 0.0
0.695MetTyr: 0.695 ± 0.402
0.0MetXaa: 0.0 ± 0.0
Asn
4.864AsnAla: 4.864 ± 1.342
0.695AsnCys: 0.695 ± 0.402
1.39AsnAsp: 1.39 ± 1.965
4.864AsnGlu: 4.864 ± 1.427
4.864AsnPhe: 4.864 ± 0.043
2.78AsnGly: 2.78 ± 0.222
2.78AsnHis: 2.78 ± 2.546
6.949AsnIle: 6.949 ± 0.137
5.559AsnLys: 5.559 ± 0.444
4.864AsnLeu: 4.864 ± 1.427
2.78AsnMet: 2.78 ± 0.222
6.254AsnAsn: 6.254 ± 0.846
2.78AsnPro: 2.78 ± 1.162
0.695AsnGln: 0.695 ± 0.983
2.78AsnArg: 2.78 ± 2.546
3.475AsnSer: 3.475 ± 0.624
1.39AsnThr: 1.39 ± 0.581
1.39AsnVal: 1.39 ± 0.581
1.39AsnTrp: 1.39 ± 0.581
4.17AsnTyr: 4.17 ± 0.359
0.0AsnXaa: 0.0 ± 0.0
Pro
0.695ProAla: 0.695 ± 0.402
0.0ProCys: 0.0 ± 0.0
3.475ProAsp: 3.475 ± 2.145
1.39ProGlu: 1.39 ± 0.803
2.78ProPhe: 2.78 ± 1.162
0.0ProGly: 0.0 ± 0.0
0.695ProHis: 0.695 ± 0.402
1.39ProIle: 1.39 ± 0.581
2.085ProLys: 2.085 ± 1.205
2.78ProLeu: 2.78 ± 0.222
0.695ProMet: 0.695 ± 0.983
1.39ProAsn: 1.39 ± 0.581
0.695ProPro: 0.695 ± 0.983
2.78ProGln: 2.78 ± 1.162
1.39ProArg: 1.39 ± 0.581
0.695ProSer: 0.695 ± 0.402
0.0ProThr: 0.0 ± 0.0
3.475ProVal: 3.475 ± 2.145
0.695ProTrp: 0.695 ± 0.983
1.39ProTyr: 1.39 ± 0.581
0.0ProXaa: 0.0 ± 0.0
Gln
2.78GlnAla: 2.78 ± 0.222
0.0GlnCys: 0.0 ± 0.0
2.085GlnAsp: 2.085 ± 0.179
1.39GlnGlu: 1.39 ± 0.803
0.0GlnPhe: 0.0 ± 0.0
2.78GlnGly: 2.78 ± 0.222
0.695GlnHis: 0.695 ± 0.983
0.695GlnIle: 0.695 ± 0.983
1.39GlnLys: 1.39 ± 0.803
0.695GlnLeu: 0.695 ± 0.983
0.695GlnMet: 0.695 ± 0.402
0.695GlnAsn: 0.695 ± 0.983
0.695GlnPro: 0.695 ± 0.402
1.39GlnGln: 1.39 ± 0.581
2.78GlnArg: 2.78 ± 2.546
2.78GlnSer: 2.78 ± 0.222
1.39GlnThr: 1.39 ± 0.581
0.695GlnVal: 0.695 ± 0.402
0.0GlnTrp: 0.0 ± 0.0
3.475GlnTyr: 3.475 ± 0.76
0.0GlnXaa: 0.0 ± 0.0
Arg
0.695ArgAla: 0.695 ± 0.402
1.39ArgCys: 1.39 ± 0.581
0.695ArgAsp: 0.695 ± 0.402
4.864ArgGlu: 4.864 ± 1.342
2.78ArgPhe: 2.78 ± 0.222
1.39ArgGly: 1.39 ± 0.803
0.695ArgHis: 0.695 ± 0.402
4.17ArgIle: 4.17 ± 2.41
2.78ArgLys: 2.78 ± 0.222
5.559ArgLeu: 5.559 ± 2.324
0.695ArgMet: 0.695 ± 0.402
4.864ArgAsn: 4.864 ± 2.811
0.695ArgPro: 0.695 ± 0.983
0.0ArgGln: 0.0 ± 0.0
3.475ArgArg: 3.475 ± 0.624
5.559ArgSer: 5.559 ± 0.444
2.085ArgThr: 2.085 ± 0.179
1.39ArgVal: 1.39 ± 0.581
0.695ArgTrp: 0.695 ± 0.983
4.864ArgTyr: 4.864 ± 1.342
0.0ArgXaa: 0.0 ± 0.0
Ser
1.39SerAla: 1.39 ± 0.581
2.085SerCys: 2.085 ± 0.179
7.644SerAsp: 7.644 ± 1.649
3.475SerGlu: 3.475 ± 2.008
6.254SerPhe: 6.254 ± 1.923
5.559SerGly: 5.559 ± 0.94
0.0SerHis: 0.0 ± 0.0
7.644SerIle: 7.644 ± 1.649
2.78SerLys: 2.78 ± 0.222
8.339SerLeu: 8.339 ± 2.051
2.78SerMet: 2.78 ± 1.816
5.559SerAsn: 5.559 ± 0.94
2.085SerPro: 2.085 ± 0.179
1.39SerGln: 1.39 ± 0.581
1.39SerArg: 1.39 ± 0.581
1.39SerSer: 1.39 ± 0.803
2.78SerThr: 2.78 ± 0.222
4.17SerVal: 4.17 ± 1.743
0.0SerTrp: 0.0 ± 0.0
6.949SerTyr: 6.949 ± 1.248
0.0SerXaa: 0.0 ± 0.0
Thr
2.085ThrAla: 2.085 ± 1.205
0.695ThrCys: 0.695 ± 0.402
0.0ThrAsp: 0.0 ± 0.0
1.39ThrGlu: 1.39 ± 0.581
2.78ThrPhe: 2.78 ± 0.222
1.39ThrGly: 1.39 ± 1.965
0.695ThrHis: 0.695 ± 0.983
0.695ThrIle: 0.695 ± 0.983
2.085ThrLys: 2.085 ± 0.179
6.949ThrLeu: 6.949 ± 1.248
2.085ThrMet: 2.085 ± 1.564
2.78ThrAsn: 2.78 ± 2.546
0.695ThrPro: 0.695 ± 0.983
1.39ThrGln: 1.39 ± 1.965
2.78ThrArg: 2.78 ± 1.162
4.17ThrSer: 4.17 ± 0.359
3.475ThrThr: 3.475 ± 3.529
2.78ThrVal: 2.78 ± 1.162
0.0ThrTrp: 0.0 ± 0.0
2.085ThrTyr: 2.085 ± 0.179
0.0ThrXaa: 0.0 ± 0.0
Val
1.39ValAla: 1.39 ± 0.803
2.085ValCys: 2.085 ± 1.205
2.085ValAsp: 2.085 ± 1.564
2.085ValGlu: 2.085 ± 1.205
1.39ValPhe: 1.39 ± 0.581
0.695ValGly: 0.695 ± 0.983
0.695ValHis: 0.695 ± 0.983
5.559ValIle: 5.559 ± 0.94
4.17ValLys: 4.17 ± 0.359
3.475ValLeu: 3.475 ± 2.145
2.085ValMet: 2.085 ± 0.179
7.644ValAsn: 7.644 ± 1.119
2.085ValPro: 2.085 ± 1.564
1.39ValGln: 1.39 ± 0.581
1.39ValArg: 1.39 ± 0.803
4.17ValSer: 4.17 ± 1.743
0.695ValThr: 0.695 ± 0.983
4.17ValVal: 4.17 ± 3.127
0.695ValTrp: 0.695 ± 0.402
2.78ValTyr: 2.78 ± 2.546
0.0ValXaa: 0.0 ± 0.0
Trp
1.39TrpAla: 1.39 ± 0.803
0.0TrpCys: 0.0 ± 0.0
0.695TrpAsp: 0.695 ± 0.983
0.695TrpGlu: 0.695 ± 0.983
0.695TrpPhe: 0.695 ± 0.983
0.0TrpGly: 0.0 ± 0.0
0.695TrpHis: 0.695 ± 0.983
0.695TrpIle: 0.695 ± 0.402
0.0TrpLys: 0.0 ± 0.0
0.695TrpLeu: 0.695 ± 0.402
0.695TrpMet: 0.695 ± 0.402
0.0TrpAsn: 0.0 ± 0.0
0.695TrpPro: 0.695 ± 0.402
0.0TrpGln: 0.0 ± 0.0
0.695TrpArg: 0.695 ± 0.402
0.0TrpSer: 0.0 ± 0.0
2.78TrpThr: 2.78 ± 1.162
2.085TrpVal: 2.085 ± 1.564
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.695TyrAla: 0.695 ± 0.402
0.695TyrCys: 0.695 ± 0.402
10.424TyrAsp: 10.424 ± 2.281
2.085TyrGlu: 2.085 ± 1.564
2.085TyrPhe: 2.085 ± 0.179
3.475TyrGly: 3.475 ± 0.76
0.0TyrHis: 0.0 ± 0.0
6.949TyrIle: 6.949 ± 2.632
6.949TyrLys: 6.949 ± 1.248
9.034TyrLeu: 9.034 ± 1.7
2.78TyrMet: 2.78 ± 0.222
7.644TyrAsn: 7.644 ± 1.119
2.78TyrPro: 2.78 ± 1.607
0.0TyrGln: 0.0 ± 0.0
2.085TyrArg: 2.085 ± 1.564
5.559TyrSer: 5.559 ± 0.94
3.475TyrThr: 3.475 ± 0.624
0.695TyrVal: 0.695 ± 0.983
0.695TyrTrp: 0.695 ± 0.402
4.864TyrTyr: 4.864 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1440 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski