Amino acid dipepetide frequency for Geminiviridae sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.618AlaAla: 12.618 ± 4.778
0.0AlaCys: 0.0 ± 0.0
3.155AlaAsp: 3.155 ± 1.816
0.0AlaGlu: 0.0 ± 0.0
1.577AlaPhe: 1.577 ± 1.113
4.732AlaGly: 4.732 ± 1.894
4.732AlaHis: 4.732 ± 1.894
7.886AlaIle: 7.886 ± 3.23
3.155AlaLys: 3.155 ± 2.232
7.886AlaLeu: 7.886 ± 2.762
1.577AlaMet: 1.577 ± 1.116
4.732AlaAsn: 4.732 ± 2.24
3.155AlaPro: 3.155 ± 1.817
3.155AlaGln: 3.155 ± 2.227
9.464AlaArg: 9.464 ± 5.104
7.886AlaSer: 7.886 ± 1.291
6.309AlaThr: 6.309 ± 2.907
3.155AlaVal: 3.155 ± 1.989
0.0AlaTrp: 0.0 ± 0.0
1.577AlaTyr: 1.577 ± 1.113
0.0AlaXaa: 0.0 ± 0.0
Cys
3.155CysAla: 3.155 ± 1.817
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
3.155CysGlu: 3.155 ± 2.227
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.577CysLys: 1.577 ± 1.116
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.577CysGln: 1.577 ± 1.113
0.0CysArg: 0.0 ± 0.0
3.155CysSer: 3.155 ± 2.232
3.155CysThr: 3.155 ± 1.989
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.577CysTyr: 1.577 ± 1.113
0.0CysXaa: 0.0 ± 0.0
Asp
3.155AspAla: 3.155 ± 2.227
0.0AspCys: 0.0 ± 0.0
1.577AspAsp: 1.577 ± 1.116
1.577AspGlu: 1.577 ± 1.113
0.0AspPhe: 0.0 ± 0.0
3.155AspGly: 3.155 ± 1.081
1.577AspHis: 1.577 ± 1.116
4.732AspIle: 4.732 ± 2.345
3.155AspLys: 3.155 ± 1.081
4.732AspLeu: 4.732 ± 2.526
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
0.0AspPro: 0.0 ± 0.0
0.0AspGln: 0.0 ± 0.0
4.732AspArg: 4.732 ± 1.5
3.155AspSer: 3.155 ± 1.961
4.732AspThr: 4.732 ± 1.894
3.155AspVal: 3.155 ± 1.081
0.0AspTrp: 0.0 ± 0.0
7.886AspTyr: 7.886 ± 2.762
0.0AspXaa: 0.0 ± 0.0
Glu
3.155GluAla: 3.155 ± 2.227
1.577GluCys: 1.577 ± 1.113
6.309GluAsp: 6.309 ± 2.915
0.0GluGlu: 0.0 ± 0.0
3.155GluPhe: 3.155 ± 1.081
1.577GluGly: 1.577 ± 1.891
1.577GluHis: 1.577 ± 1.116
1.577GluIle: 1.577 ± 1.113
4.732GluLys: 4.732 ± 3.348
1.577GluLeu: 1.577 ± 1.116
0.0GluMet: 0.0 ± 1.112
1.577GluAsn: 1.577 ± 1.116
1.577GluPro: 1.577 ± 1.116
0.0GluGln: 0.0 ± 0.0
9.464GluArg: 9.464 ± 4.396
0.0GluSer: 0.0 ± 0.0
1.577GluThr: 1.577 ± 1.113
0.0GluVal: 0.0 ± 0.0
1.577GluTrp: 1.577 ± 1.116
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.577PheAla: 1.577 ± 1.113
0.0PheCys: 0.0 ± 0.0
1.577PheAsp: 1.577 ± 1.113
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
1.577PheHis: 1.577 ± 1.116
4.732PheIle: 4.732 ± 1.89
1.577PheLys: 1.577 ± 1.116
3.155PheLeu: 3.155 ± 2.732
0.0PheMet: 0.0 ± 0.0
1.577PheAsn: 1.577 ± 1.113
3.155PhePro: 3.155 ± 1.081
3.155PheGln: 3.155 ± 1.081
4.732PheArg: 4.732 ± 2.526
1.577PheSer: 1.577 ± 1.116
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
1.577PheTyr: 1.577 ± 1.113
0.0PheXaa: 0.0 ± 0.0
Gly
1.577GlyAla: 1.577 ± 2.016
0.0GlyCys: 0.0 ± 0.0
1.577GlyAsp: 1.577 ± 1.113
3.155GlyGlu: 3.155 ± 1.816
3.155GlyPhe: 3.155 ± 1.816
4.732GlyGly: 4.732 ± 3.537
0.0GlyHis: 0.0 ± 0.0
6.309GlyIle: 6.309 ± 3.979
1.577GlyLys: 1.577 ± 1.116
4.732GlyLeu: 4.732 ± 2.526
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
3.155GlyPro: 3.155 ± 2.227
3.155GlyGln: 3.155 ± 2.227
6.309GlyArg: 6.309 ± 3.994
1.577GlySer: 1.577 ± 1.113
9.464GlyThr: 9.464 ± 1.895
3.155GlyVal: 3.155 ± 2.227
0.0GlyTrp: 0.0 ± 0.0
3.155GlyTyr: 3.155 ± 1.081
0.0GlyXaa: 0.0 ± 0.0
His
3.155HisAla: 3.155 ± 1.081
0.0HisCys: 0.0 ± 0.0
1.577HisAsp: 1.577 ± 1.113
6.309HisGlu: 6.309 ± 4.464
1.577HisPhe: 1.577 ± 1.116
0.0HisGly: 0.0 ± 0.0
4.732HisHis: 4.732 ± 1.89
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
4.732HisLeu: 4.732 ± 1.5
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.577HisPro: 1.577 ± 1.891
0.0HisGln: 0.0 ± 0.0
1.577HisArg: 1.577 ± 1.113
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.732IleAla: 4.732 ± 1.371
0.0IleCys: 0.0 ± 0.0
4.732IleAsp: 4.732 ± 1.89
0.0IleGlu: 0.0 ± 0.0
0.0IlePhe: 0.0 ± 0.0
3.155IleGly: 3.155 ± 1.081
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
3.155IleLys: 3.155 ± 2.227
3.155IleLeu: 3.155 ± 1.989
4.732IleMet: 4.732 ± 2.39
4.732IleAsn: 4.732 ± 3.687
6.309IlePro: 6.309 ± 2.907
3.155IleGln: 3.155 ± 1.081
6.309IleArg: 6.309 ± 1.668
3.155IleSer: 3.155 ± 1.989
3.155IleThr: 3.155 ± 1.817
4.732IleVal: 4.732 ± 2.277
0.0IleTrp: 0.0 ± 0.0
3.155IleTyr: 3.155 ± 1.961
0.0IleXaa: 0.0 ± 0.0
Lys
3.155LysAla: 3.155 ± 1.081
0.0LysCys: 0.0 ± 0.0
1.577LysAsp: 1.577 ± 1.116
4.732LysGlu: 4.732 ± 3.348
1.577LysPhe: 1.577 ± 1.113
3.155LysGly: 3.155 ± 1.081
1.577LysHis: 1.577 ± 1.113
4.732LysIle: 4.732 ± 2.57
1.577LysLys: 1.577 ± 1.116
1.577LysLeu: 1.577 ± 1.113
1.577LysMet: 1.577 ± 1.113
1.577LysAsn: 1.577 ± 1.116
1.577LysPro: 1.577 ± 1.116
1.577LysGln: 1.577 ± 1.116
4.732LysArg: 4.732 ± 1.89
1.577LysSer: 1.577 ± 1.116
3.155LysThr: 3.155 ± 1.081
3.155LysVal: 3.155 ± 1.081
1.577LysTrp: 1.577 ± 1.116
3.155LysTyr: 3.155 ± 2.232
0.0LysXaa: 0.0 ± 0.0
Leu
9.464LeuAla: 9.464 ± 2.304
4.732LeuCys: 4.732 ± 1.894
9.464LeuAsp: 9.464 ± 2.335
7.886LeuGlu: 7.886 ± 1.425
0.0LeuPhe: 0.0 ± 0.0
6.309LeuGly: 6.309 ± 1.668
0.0LeuHis: 0.0 ± 0.0
4.732LeuIle: 4.732 ± 3.847
3.155LeuLys: 3.155 ± 2.232
9.464LeuLeu: 9.464 ± 5.104
0.0LeuMet: 0.0 ± 0.0
6.309LeuAsn: 6.309 ± 1.509
3.155LeuPro: 3.155 ± 2.232
4.732LeuGln: 4.732 ± 1.894
14.196LeuArg: 14.196 ± 9.673
6.309LeuSer: 6.309 ± 3.274
6.309LeuThr: 6.309 ± 2.162
6.309LeuVal: 6.309 ± 1.509
0.0LeuTrp: 0.0 ± 0.0
3.155LeuTyr: 3.155 ± 2.732
0.0LeuXaa: 0.0 ± 0.0
Met
1.577MetAla: 1.577 ± 1.113
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.577MetPhe: 1.577 ± 1.116
0.0MetGly: 0.0 ± 0.0
1.577MetHis: 1.577 ± 1.113
0.0MetIle: 0.0 ± 0.0
1.577MetLys: 1.577 ± 1.116
1.577MetLeu: 1.577 ± 2.016
0.0MetMet: 0.0 ± 0.0
1.577MetAsn: 1.577 ± 2.016
3.155MetPro: 3.155 ± 3.782
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.577MetSer: 1.577 ± 1.113
1.577MetThr: 1.577 ± 1.116
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.577AsnAla: 1.577 ± 2.016
0.0AsnCys: 0.0 ± 0.0
3.155AsnAsp: 3.155 ± 1.961
4.732AsnGlu: 4.732 ± 2.518
1.577AsnPhe: 1.577 ± 2.016
1.577AsnGly: 1.577 ± 1.113
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
4.732AsnLys: 4.732 ± 3.34
6.309AsnLeu: 6.309 ± 4.464
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.577AsnPro: 1.577 ± 2.016
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
3.155AsnSer: 3.155 ± 2.227
1.577AsnThr: 1.577 ± 1.891
4.732AsnVal: 4.732 ± 3.34
0.0AsnTrp: 0.0 ± 0.0
4.732AsnTyr: 4.732 ± 1.5
0.0AsnXaa: 0.0 ± 0.0
Pro
6.309ProAla: 6.309 ± 1.922
0.0ProCys: 0.0 ± 0.0
3.155ProAsp: 3.155 ± 2.232
1.577ProGlu: 1.577 ± 1.113
1.577ProPhe: 1.577 ± 2.016
3.155ProGly: 3.155 ± 1.816
1.577ProHis: 1.577 ± 1.116
1.577ProIle: 1.577 ± 2.016
3.155ProLys: 3.155 ± 1.081
9.464ProLeu: 9.464 ± 3.513
0.0ProMet: 0.0 ± 0.0
3.155ProAsn: 3.155 ± 1.961
4.732ProPro: 4.732 ± 1.5
4.732ProGln: 4.732 ± 1.89
3.155ProArg: 3.155 ± 1.816
3.155ProSer: 3.155 ± 1.817
1.577ProThr: 1.577 ± 1.113
1.577ProVal: 1.577 ± 2.016
1.577ProTrp: 1.577 ± 2.016
1.577ProTyr: 1.577 ± 1.891
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.577GlnCys: 1.577 ± 1.113
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
4.732GlnGly: 4.732 ± 3.34
0.0GlnHis: 0.0 ± 0.0
3.155GlnIle: 3.155 ± 1.081
0.0GlnLys: 0.0 ± 0.0
6.309GlnLeu: 6.309 ± 2.915
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
3.155GlnPro: 3.155 ± 1.081
3.155GlnGln: 3.155 ± 1.081
3.155GlnArg: 3.155 ± 1.081
3.155GlnSer: 3.155 ± 1.081
3.155GlnThr: 3.155 ± 1.081
3.155GlnVal: 3.155 ± 1.081
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
11.041ArgAla: 11.041 ± 4.971
6.309ArgCys: 6.309 ± 3.274
1.577ArgAsp: 1.577 ± 1.891
3.155ArgGlu: 3.155 ± 1.961
1.577ArgPhe: 1.577 ± 1.113
4.732ArgGly: 4.732 ± 2.277
0.0ArgHis: 0.0 ± 0.0
6.309ArgIle: 6.309 ± 3.238
4.732ArgLys: 4.732 ± 3.34
9.464ArgLeu: 9.464 ± 7.265
0.0ArgMet: 0.0 ± 0.0
4.732ArgAsn: 4.732 ± 1.89
1.577ArgPro: 1.577 ± 1.113
1.577ArgGln: 1.577 ± 1.116
11.041ArgArg: 11.041 ± 6.056
7.886ArgSer: 7.886 ± 5.514
9.464ArgThr: 9.464 ± 4.907
6.309ArgVal: 6.309 ± 3.634
4.732ArgTrp: 4.732 ± 5.673
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.886SerAla: 7.886 ± 3.292
3.155SerCys: 3.155 ± 1.081
1.577SerAsp: 1.577 ± 1.113
1.577SerGlu: 1.577 ± 1.891
3.155SerPhe: 3.155 ± 1.081
3.155SerGly: 3.155 ± 1.989
1.577SerHis: 1.577 ± 1.891
1.577SerIle: 1.577 ± 1.113
1.577SerLys: 1.577 ± 1.116
6.309SerLeu: 6.309 ± 1.509
1.577SerMet: 1.577 ± 2.016
4.732SerAsn: 4.732 ± 1.371
6.309SerPro: 6.309 ± 3.634
0.0SerGln: 0.0 ± 0.0
4.732SerArg: 4.732 ± 2.518
3.155SerSer: 3.155 ± 1.081
1.577SerThr: 1.577 ± 1.113
4.732SerVal: 4.732 ± 1.89
0.0SerTrp: 0.0 ± 0.0
3.155SerTyr: 3.155 ± 1.961
0.0SerXaa: 0.0 ± 0.0
Thr
4.732ThrAla: 4.732 ± 3.34
0.0ThrCys: 0.0 ± 0.0
3.155ThrAsp: 3.155 ± 1.081
1.577ThrGlu: 1.577 ± 1.116
1.577ThrPhe: 1.577 ± 1.116
4.732ThrGly: 4.732 ± 1.89
0.0ThrHis: 0.0 ± 0.0
3.155ThrIle: 3.155 ± 1.816
3.155ThrLys: 3.155 ± 2.232
15.773ThrLeu: 15.773 ± 6.038
0.0ThrMet: 0.0 ± 0.0
3.155ThrAsn: 3.155 ± 1.081
3.155ThrPro: 3.155 ± 2.232
0.0ThrGln: 0.0 ± 0.0
1.577ThrArg: 1.577 ± 1.113
3.155ThrSer: 3.155 ± 1.817
6.309ThrThr: 6.309 ± 1.771
6.309ThrVal: 6.309 ± 1.716
0.0ThrTrp: 0.0 ± 0.0
4.732ThrTyr: 4.732 ± 3.34
0.0ThrXaa: 0.0 ± 0.0
Val
4.732ValAla: 4.732 ± 2.518
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
4.732ValGly: 4.732 ± 3.673
3.155ValHis: 3.155 ± 1.081
4.732ValIle: 4.732 ± 3.34
3.155ValLys: 3.155 ± 1.961
7.886ValLeu: 7.886 ± 1.425
1.577ValMet: 1.577 ± 1.113
1.577ValAsn: 1.577 ± 1.113
3.155ValPro: 3.155 ± 1.817
3.155ValGln: 3.155 ± 1.081
7.886ValArg: 7.886 ± 3.924
4.732ValSer: 4.732 ± 1.371
0.0ValThr: 0.0 ± 0.0
4.732ValVal: 4.732 ± 3.673
1.577ValTrp: 1.577 ± 1.113
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.577TrpAla: 1.577 ± 1.116
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
4.732TrpPhe: 4.732 ± 1.89
3.155TrpGly: 3.155 ± 2.732
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.577TrpMet: 1.577 ± 1.491
0.0TrpAsn: 0.0 ± 0.0
3.155TrpPro: 3.155 ± 3.782
1.577TrpGln: 1.577 ± 1.116
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.577TyrAla: 1.577 ± 1.116
0.0TyrCys: 0.0 ± 0.0
3.155TyrAsp: 3.155 ± 1.816
3.155TyrGlu: 3.155 ± 2.227
3.155TyrPhe: 3.155 ± 1.961
0.0TyrGly: 0.0 ± 0.0
1.577TyrHis: 1.577 ± 1.891
3.155TyrIle: 3.155 ± 1.961
1.577TyrLys: 1.577 ± 1.116
1.577TyrLeu: 1.577 ± 1.116
1.577TyrMet: 1.577 ± 0.993
0.0TyrAsn: 0.0 ± 0.0
3.155TyrPro: 3.155 ± 1.816
0.0TyrGln: 0.0 ± 0.0
3.155TyrArg: 3.155 ± 2.227
3.155TyrSer: 3.155 ± 1.081
3.155TyrThr: 3.155 ± 1.081
1.577TyrVal: 1.577 ± 2.016
4.732TyrTrp: 4.732 ± 1.5
1.577TyrTyr: 1.577 ± 1.113
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (635 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski