Amino acid dipepetide frequency for Odonata-associated circular virus-10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.559AlaAla: 3.559 ± 3.432
3.559AlaCys: 3.559 ± 2.15
3.559AlaAsp: 3.559 ± 0.641
3.559AlaGlu: 3.559 ± 2.15
3.559AlaPhe: 3.559 ± 0.641
5.338AlaGly: 5.338 ± 0.435
0.0AlaHis: 0.0 ± 0.0
7.117AlaIle: 7.117 ± 1.281
0.0AlaLys: 0.0 ± 0.0
1.779AlaLeu: 1.779 ± 1.075
0.0AlaMet: 0.0 ± 0.0
1.779AlaAsn: 1.779 ± 1.075
1.779AlaPro: 1.779 ± 1.075
3.559AlaGln: 3.559 ± 0.641
8.897AlaArg: 8.897 ± 2.585
7.117AlaSer: 7.117 ± 6.864
1.779AlaThr: 1.779 ± 1.075
10.676AlaVal: 10.676 ± 10.295
0.0AlaTrp: 0.0 ± 0.0
3.559AlaTyr: 3.559 ± 2.15
0.0AlaXaa: 0.0 ± 0.0
Cys
3.559CysAla: 3.559 ± 0.641
0.0CysCys: 0.0 ± 0.0
1.779CysAsp: 1.779 ± 1.075
1.779CysGlu: 1.779 ± 1.075
3.559CysPhe: 3.559 ± 2.15
0.0CysGly: 0.0 ± 0.0
1.779CysHis: 1.779 ± 1.075
3.559CysIle: 3.559 ± 2.15
1.779CysLys: 1.779 ± 1.075
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.779CysAsn: 1.779 ± 1.075
1.779CysPro: 1.779 ± 1.075
1.779CysGln: 1.779 ± 1.075
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.559AspAla: 3.559 ± 0.641
1.779AspCys: 1.779 ± 1.075
1.779AspAsp: 1.779 ± 1.075
3.559AspGlu: 3.559 ± 2.15
0.0AspPhe: 0.0 ± 0.0
7.117AspGly: 7.117 ± 1.281
0.0AspHis: 0.0 ± 0.0
3.559AspIle: 3.559 ± 2.15
5.338AspLys: 5.338 ± 0.435
8.897AspLeu: 8.897 ± 2.585
1.779AspMet: 1.779 ± 1.075
3.559AspAsn: 3.559 ± 3.432
1.779AspPro: 1.779 ± 1.716
3.559AspGln: 3.559 ± 3.432
3.559AspArg: 3.559 ± 3.432
3.559AspSer: 3.559 ± 0.641
0.0AspThr: 0.0 ± 0.0
1.779AspVal: 1.779 ± 1.075
0.0AspTrp: 0.0 ± 0.0
3.559AspTyr: 3.559 ± 2.15
0.0AspXaa: 0.0 ± 0.0
Glu
5.338GluAla: 5.338 ± 0.435
0.0GluCys: 0.0 ± 0.0
3.559GluAsp: 3.559 ± 2.15
5.338GluGlu: 5.338 ± 3.226
0.0GluPhe: 0.0 ± 0.0
3.559GluGly: 3.559 ± 2.15
1.779GluHis: 1.779 ± 1.075
1.779GluIle: 1.779 ± 1.716
3.559GluLys: 3.559 ± 2.15
3.559GluLeu: 3.559 ± 0.641
0.0GluMet: 0.0 ± 0.0
1.779GluAsn: 1.779 ± 1.716
0.0GluPro: 0.0 ± 0.0
1.779GluGln: 1.779 ± 1.075
3.559GluArg: 3.559 ± 2.15
1.779GluSer: 1.779 ± 1.075
5.338GluThr: 5.338 ± 0.435
1.779GluVal: 1.779 ± 1.075
0.0GluTrp: 0.0 ± 0.0
1.779GluTyr: 1.779 ± 1.075
0.0GluXaa: 0.0 ± 0.0
Phe
3.559PheAla: 3.559 ± 2.15
0.0PheCys: 0.0 ± 0.0
3.559PheAsp: 3.559 ± 0.641
0.0PheGlu: 0.0 ± 0.0
1.779PhePhe: 1.779 ± 1.075
1.779PheGly: 1.779 ± 1.075
0.0PheHis: 0.0 ± 0.0
3.559PheIle: 3.559 ± 2.15
1.779PheLys: 1.779 ± 1.075
1.779PheLeu: 1.779 ± 1.075
1.779PheMet: 1.779 ± 1.075
3.559PheAsn: 3.559 ± 0.641
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
0.0PheArg: 0.0 ± 0.0
0.0PheSer: 0.0 ± 0.0
1.779PheThr: 1.779 ± 1.075
3.559PheVal: 3.559 ± 2.15
0.0PheTrp: 0.0 ± 0.0
1.779PheTyr: 1.779 ± 1.716
0.0PheXaa: 0.0 ± 0.0
Gly
3.559GlyAla: 3.559 ± 3.432
0.0GlyCys: 0.0 ± 0.0
1.779GlyAsp: 1.779 ± 1.716
5.338GlyGlu: 5.338 ± 3.226
1.779GlyPhe: 1.779 ± 1.716
1.779GlyGly: 1.779 ± 1.716
0.0GlyHis: 0.0 ± 0.0
7.117GlyIle: 7.117 ± 1.51
7.117GlyLys: 7.117 ± 1.51
1.779GlyLeu: 1.779 ± 1.716
3.559GlyMet: 3.559 ± 0.641
3.559GlyAsn: 3.559 ± 0.641
0.0GlyPro: 0.0 ± 0.0
7.117GlyGln: 7.117 ± 4.072
1.779GlyArg: 1.779 ± 1.075
3.559GlySer: 3.559 ± 0.641
7.117GlyThr: 7.117 ± 1.51
5.338GlyVal: 5.338 ± 3.226
0.0GlyTrp: 0.0 ± 0.0
1.779GlyTyr: 1.779 ± 1.075
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.779HisPhe: 1.779 ± 1.075
1.779HisGly: 1.779 ± 1.075
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.779HisLeu: 1.779 ± 1.075
1.779HisMet: 1.779 ± 1.075
3.559HisAsn: 3.559 ± 0.641
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.779HisArg: 1.779 ± 1.075
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.779HisVal: 1.779 ± 1.075
1.779HisTrp: 1.779 ± 1.075
1.779HisTyr: 1.779 ± 1.716
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.779IleCys: 1.779 ± 1.075
7.117IleAsp: 7.117 ± 4.301
0.0IleGlu: 0.0 ± 0.0
1.779IlePhe: 1.779 ± 1.716
7.117IleGly: 7.117 ± 1.281
1.779IleHis: 1.779 ± 1.075
5.338IleIle: 5.338 ± 0.435
7.117IleLys: 7.117 ± 1.281
7.117IleLeu: 7.117 ± 1.51
1.779IleMet: 1.779 ± 1.716
3.559IleAsn: 3.559 ± 3.432
1.779IlePro: 1.779 ± 1.716
7.117IleGln: 7.117 ± 1.51
1.779IleArg: 1.779 ± 1.716
1.779IleSer: 1.779 ± 1.716
1.779IleThr: 1.779 ± 1.075
8.897IleVal: 8.897 ± 2.585
5.338IleTrp: 5.338 ± 3.226
5.338IleTyr: 5.338 ± 0.435
0.0IleXaa: 0.0 ± 0.0
Lys
5.338LysAla: 5.338 ± 3.226
0.0LysCys: 0.0 ± 0.0
3.559LysAsp: 3.559 ± 0.641
3.559LysGlu: 3.559 ± 0.641
1.779LysPhe: 1.779 ± 1.075
5.338LysGly: 5.338 ± 3.226
3.559LysHis: 3.559 ± 0.641
5.338LysIle: 5.338 ± 0.435
12.456LysLys: 12.456 ± 4.735
8.897LysLeu: 8.897 ± 2.997
0.0LysMet: 0.0 ± 0.0
3.559LysAsn: 3.559 ± 0.641
3.559LysPro: 3.559 ± 2.15
3.559LysGln: 3.559 ± 0.641
3.559LysArg: 3.559 ± 2.15
8.897LysSer: 8.897 ± 2.997
3.559LysThr: 3.559 ± 2.15
1.779LysVal: 1.779 ± 1.716
1.779LysTrp: 1.779 ± 1.075
7.117LysTyr: 7.117 ± 1.51
0.0LysXaa: 0.0 ± 0.0
Leu
3.559LeuAla: 3.559 ± 2.15
1.779LeuCys: 1.779 ± 1.075
5.338LeuAsp: 5.338 ± 0.435
0.0LeuGlu: 0.0 ± 0.0
1.779LeuPhe: 1.779 ± 1.075
3.559LeuGly: 3.559 ± 0.641
0.0LeuHis: 0.0 ± 0.0
3.559LeuIle: 3.559 ± 2.15
1.779LeuLys: 1.779 ± 1.075
3.559LeuLeu: 3.559 ± 2.15
3.559LeuMet: 3.559 ± 0.641
3.559LeuAsn: 3.559 ± 0.641
5.338LeuPro: 5.338 ± 3.226
3.559LeuGln: 3.559 ± 2.15
1.779LeuArg: 1.779 ± 1.075
1.779LeuSer: 1.779 ± 1.716
5.338LeuThr: 5.338 ± 2.357
5.338LeuVal: 5.338 ± 2.357
0.0LeuTrp: 0.0 ± 0.0
8.897LeuTyr: 8.897 ± 5.788
0.0LeuXaa: 0.0 ± 0.0
Met
3.559MetAla: 3.559 ± 0.641
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.559MetGlu: 3.559 ± 0.641
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
5.338MetIle: 5.338 ± 5.148
5.338MetLys: 5.338 ± 0.435
1.779MetLeu: 1.779 ± 1.075
0.0MetMet: 0.0 ± 0.0
1.779MetAsn: 1.779 ± 1.716
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.779MetArg: 1.779 ± 1.075
1.779MetSer: 1.779 ± 1.075
1.779MetThr: 1.779 ± 1.075
0.0MetVal: 0.0 ± 0.0
3.559MetTrp: 3.559 ± 3.432
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.779AsnAla: 1.779 ± 1.716
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
3.559AsnGlu: 3.559 ± 0.641
1.779AsnPhe: 1.779 ± 1.075
3.559AsnGly: 3.559 ± 0.641
1.779AsnHis: 1.779 ± 1.075
3.559AsnIle: 3.559 ± 0.641
1.779AsnLys: 1.779 ± 1.716
3.559AsnLeu: 3.559 ± 3.432
1.779AsnMet: 1.779 ± 1.075
8.897AsnAsn: 8.897 ± 5.788
3.559AsnPro: 3.559 ± 3.432
1.779AsnGln: 1.779 ± 1.075
5.338AsnArg: 5.338 ± 2.357
1.779AsnSer: 1.779 ± 1.075
5.338AsnThr: 5.338 ± 5.148
3.559AsnVal: 3.559 ± 0.641
3.559AsnTrp: 3.559 ± 0.641
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
1.779ProAsp: 1.779 ± 1.075
0.0ProGlu: 0.0 ± 0.0
0.0ProPhe: 0.0 ± 0.0
0.0ProGly: 0.0 ± 0.0
5.338ProHis: 5.338 ± 3.226
3.559ProIle: 3.559 ± 2.15
3.559ProLys: 3.559 ± 0.641
1.779ProLeu: 1.779 ± 1.075
1.779ProMet: 1.779 ± 1.716
3.559ProAsn: 3.559 ± 0.641
1.779ProPro: 1.779 ± 1.075
0.0ProGln: 0.0 ± 0.0
5.338ProArg: 5.338 ± 2.357
3.559ProSer: 3.559 ± 0.641
0.0ProThr: 0.0 ± 0.0
0.0ProVal: 0.0 ± 0.0
3.559ProTrp: 3.559 ± 0.641
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.559GlnAla: 3.559 ± 2.15
1.779GlnCys: 1.779 ± 1.075
7.117GlnAsp: 7.117 ± 4.072
5.338GlnGlu: 5.338 ± 3.226
1.779GlnPhe: 1.779 ± 1.075
5.338GlnGly: 5.338 ± 0.435
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
8.897GlnLys: 8.897 ± 2.585
3.559GlnLeu: 3.559 ± 3.432
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.779GlnGln: 1.779 ± 1.075
1.779GlnArg: 1.779 ± 1.716
1.779GlnSer: 1.779 ± 1.075
0.0GlnThr: 0.0 ± 0.0
1.779GlnVal: 1.779 ± 1.075
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.559ArgAla: 3.559 ± 3.432
1.779ArgCys: 1.779 ± 1.075
5.338ArgAsp: 5.338 ± 0.435
0.0ArgGlu: 0.0 ± 0.0
1.779ArgPhe: 1.779 ± 1.075
3.559ArgGly: 3.559 ± 3.432
0.0ArgHis: 0.0 ± 0.0
5.338ArgIle: 5.338 ± 2.357
1.779ArgLys: 1.779 ± 1.075
3.559ArgLeu: 3.559 ± 0.641
1.779ArgMet: 1.779 ± 2.635
1.779ArgAsn: 1.779 ± 1.716
7.117ArgPro: 7.117 ± 1.51
1.779ArgGln: 1.779 ± 1.075
1.779ArgArg: 1.779 ± 1.716
3.559ArgSer: 3.559 ± 0.641
1.779ArgThr: 1.779 ± 1.075
5.338ArgVal: 5.338 ± 3.226
0.0ArgTrp: 0.0 ± 0.0
1.779ArgTyr: 1.779 ± 1.075
0.0ArgXaa: 0.0 ± 0.0
Ser
10.676SerAla: 10.676 ± 1.922
1.779SerCys: 1.779 ± 1.716
5.338SerAsp: 5.338 ± 0.435
3.559SerGlu: 3.559 ± 3.432
1.779SerPhe: 1.779 ± 1.075
3.559SerGly: 3.559 ± 3.432
0.0SerHis: 0.0 ± 0.0
1.779SerIle: 1.779 ± 1.075
7.117SerLys: 7.117 ± 1.51
3.559SerLeu: 3.559 ± 2.15
1.779SerMet: 1.779 ± 1.716
1.779SerAsn: 1.779 ± 1.716
0.0SerPro: 0.0 ± 0.0
0.0SerGln: 0.0 ± 0.0
3.559SerArg: 3.559 ± 0.641
5.338SerSer: 5.338 ± 2.357
5.338SerThr: 5.338 ± 0.435
0.0SerVal: 0.0 ± 0.0
1.779SerTrp: 1.779 ± 1.716
1.779SerTyr: 1.779 ± 1.075
0.0SerXaa: 0.0 ± 0.0
Thr
7.117ThrAla: 7.117 ± 1.51
1.779ThrCys: 1.779 ± 1.075
1.779ThrAsp: 1.779 ± 1.716
3.559ThrGlu: 3.559 ± 2.15
1.779ThrPhe: 1.779 ± 1.075
0.0ThrGly: 0.0 ± 0.0
0.0ThrHis: 0.0 ± 0.0
1.779ThrIle: 1.779 ± 1.075
7.117ThrLys: 7.117 ± 1.281
1.779ThrLeu: 1.779 ± 1.716
0.0ThrMet: 0.0 ± 0.0
1.779ThrAsn: 1.779 ± 1.075
1.779ThrPro: 1.779 ± 1.075
1.779ThrGln: 1.779 ± 1.075
0.0ThrArg: 0.0 ± 0.0
1.779ThrSer: 1.779 ± 1.075
1.779ThrThr: 1.779 ± 1.075
3.559ThrVal: 3.559 ± 3.432
0.0ThrTrp: 0.0 ± 0.0
3.559ThrTyr: 3.559 ± 0.641
0.0ThrXaa: 0.0 ± 0.0
Val
5.338ValAla: 5.338 ± 5.148
5.338ValCys: 5.338 ± 3.226
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
3.559ValPhe: 3.559 ± 2.15
5.338ValGly: 5.338 ± 3.226
0.0ValHis: 0.0 ± 0.0
7.117ValIle: 7.117 ± 4.072
3.559ValLys: 3.559 ± 0.641
1.779ValLeu: 1.779 ± 1.075
3.559ValMet: 3.559 ± 0.641
3.559ValAsn: 3.559 ± 0.641
1.779ValPro: 1.779 ± 1.716
1.779ValGln: 1.779 ± 1.075
5.338ValArg: 5.338 ± 2.357
5.338ValSer: 5.338 ± 2.357
0.0ValThr: 0.0 ± 0.0
5.338ValVal: 5.338 ± 0.435
0.0ValTrp: 0.0 ± 0.0
3.559ValTyr: 3.559 ± 2.15
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.779TrpAsp: 1.779 ± 1.716
1.779TrpGlu: 1.779 ± 1.075
0.0TrpPhe: 0.0 ± 0.0
3.559TrpGly: 3.559 ± 0.641
0.0TrpHis: 0.0 ± 0.0
1.779TrpIle: 1.779 ± 1.075
3.559TrpLys: 3.559 ± 3.432
0.0TrpLeu: 0.0 ± 0.0
1.779TrpMet: 1.779 ± 1.716
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
3.559TrpGln: 3.559 ± 2.15
0.0TrpArg: 0.0 ± 0.0
3.559TrpSer: 3.559 ± 2.15
0.0TrpThr: 0.0 ± 0.0
1.779TrpVal: 1.779 ± 1.716
1.779TrpTrp: 1.779 ± 1.716
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.559TyrAla: 3.559 ± 0.641
1.779TyrCys: 1.779 ± 1.075
3.559TyrAsp: 3.559 ± 0.641
1.779TyrGlu: 1.779 ± 1.075
0.0TyrPhe: 0.0 ± 0.0
1.779TyrGly: 1.779 ± 1.716
1.779TyrHis: 1.779 ± 1.716
7.117TyrIle: 7.117 ± 1.51
3.559TyrLys: 3.559 ± 2.15
3.559TyrLeu: 3.559 ± 2.15
1.779TyrMet: 1.779 ± 0.782
3.559TyrAsn: 3.559 ± 0.641
3.559TyrPro: 3.559 ± 0.641
0.0TyrGln: 0.0 ± 0.0
3.559TyrArg: 3.559 ± 0.641
3.559TyrSer: 3.559 ± 2.15
0.0TyrThr: 0.0 ± 0.0
0.0TyrVal: 0.0 ± 0.0
1.779TyrTrp: 1.779 ± 1.716
1.779TyrTyr: 1.779 ± 1.716
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (563 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski