Amino acid dipepetide frequency for Odonata-associated circular virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.026AlaAla: 3.026 ± 1.678
0.0AlaCys: 0.0 ± 0.0
1.513AlaAsp: 1.513 ± 0.839
4.539AlaGlu: 4.539 ± 1.301
3.026AlaPhe: 3.026 ± 0.231
6.051AlaGly: 6.051 ± 0.462
4.539AlaHis: 4.539 ± 0.608
4.539AlaIle: 4.539 ± 3.21
3.026AlaLys: 3.026 ± 0.231
6.051AlaLeu: 6.051 ± 0.462
1.513AlaMet: 1.513 ± 1.07
3.026AlaAsn: 3.026 ± 0.231
3.026AlaPro: 3.026 ± 0.231
3.026AlaGln: 3.026 ± 1.678
3.026AlaArg: 3.026 ± 1.678
4.539AlaSer: 4.539 ± 2.517
1.513AlaThr: 1.513 ± 1.07
4.539AlaVal: 4.539 ± 0.608
1.513AlaTrp: 1.513 ± 1.07
1.513AlaTyr: 1.513 ± 1.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.513CysPhe: 1.513 ± 1.07
1.513CysGly: 1.513 ± 1.07
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
3.026CysLeu: 3.026 ± 0.231
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.513CysPro: 1.513 ± 1.07
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.513CysSer: 1.513 ± 0.839
1.513CysThr: 1.513 ± 1.07
1.513CysVal: 1.513 ± 1.07
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
9.077AspAla: 9.077 ± 3.125
1.513AspCys: 1.513 ± 1.07
0.0AspAsp: 0.0 ± 0.0
1.513AspGlu: 1.513 ± 1.07
0.0AspPhe: 0.0 ± 0.0
4.539AspGly: 4.539 ± 0.608
0.0AspHis: 0.0 ± 0.0
4.539AspIle: 4.539 ± 1.301
3.026AspLys: 3.026 ± 0.231
4.539AspLeu: 4.539 ± 2.517
1.513AspMet: 1.513 ± 1.07
1.513AspAsn: 1.513 ± 0.839
3.026AspPro: 3.026 ± 1.678
0.0AspGln: 0.0 ± 0.0
7.564AspArg: 7.564 ± 3.441
6.051AspSer: 6.051 ± 1.447
6.051AspThr: 6.051 ± 1.447
3.026AspVal: 3.026 ± 2.14
0.0AspTrp: 0.0 ± 0.0
4.539AspTyr: 4.539 ± 3.21
0.0AspXaa: 0.0 ± 0.0
Glu
1.513GluAla: 1.513 ± 1.07
0.0GluCys: 0.0 ± 0.0
4.539GluAsp: 4.539 ± 1.301
3.026GluGlu: 3.026 ± 2.14
1.513GluPhe: 1.513 ± 0.839
1.513GluGly: 1.513 ± 1.07
3.026GluHis: 3.026 ± 2.14
3.026GluIle: 3.026 ± 0.231
4.539GluLys: 4.539 ± 3.21
1.513GluLeu: 1.513 ± 1.07
0.0GluMet: 0.0 ± 0.0
1.513GluAsn: 1.513 ± 0.839
3.026GluPro: 3.026 ± 2.14
4.539GluGln: 4.539 ± 0.608
3.026GluArg: 3.026 ± 0.231
4.539GluSer: 4.539 ± 2.517
4.539GluThr: 4.539 ± 3.21
4.539GluVal: 4.539 ± 0.608
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.513PheAla: 1.513 ± 0.839
1.513PheCys: 1.513 ± 1.07
4.539PheAsp: 4.539 ± 0.608
1.513PheGlu: 1.513 ± 1.07
4.539PhePhe: 4.539 ± 2.517
3.026PheGly: 3.026 ± 2.14
0.0PheHis: 0.0 ± 0.0
1.513PheIle: 1.513 ± 1.07
1.513PheLys: 1.513 ± 1.07
1.513PheLeu: 1.513 ± 0.839
3.026PheMet: 3.026 ± 0.231
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
4.539PheGln: 4.539 ± 0.608
3.026PheArg: 3.026 ± 1.678
1.513PheSer: 1.513 ± 0.839
1.513PheThr: 1.513 ± 0.839
0.0PheVal: 0.0 ± 0.0
1.513PheTrp: 1.513 ± 1.07
1.513PheTyr: 1.513 ± 0.839
0.0PheXaa: 0.0 ± 0.0
Gly
7.564GlyAla: 7.564 ± 0.377
3.026GlyCys: 3.026 ± 0.231
3.026GlyAsp: 3.026 ± 0.231
7.564GlyGlu: 7.564 ± 0.377
1.513GlyPhe: 1.513 ± 0.839
1.513GlyGly: 1.513 ± 0.839
3.026GlyHis: 3.026 ± 2.14
3.026GlyIle: 3.026 ± 0.231
4.539GlyLys: 4.539 ± 1.301
12.103GlyLeu: 12.103 ± 4.803
0.0GlyMet: 0.0 ± 0.0
7.564GlyAsn: 7.564 ± 1.532
4.539GlyPro: 4.539 ± 0.608
0.0GlyGln: 0.0 ± 0.0
6.051GlyArg: 6.051 ± 0.462
3.026GlySer: 3.026 ± 0.231
6.051GlyThr: 6.051 ± 3.356
1.513GlyVal: 1.513 ± 0.839
1.513GlyTrp: 1.513 ± 0.839
3.026GlyTyr: 3.026 ± 2.14
0.0GlyXaa: 0.0 ± 0.0
His
1.513HisAla: 1.513 ± 1.07
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
3.026HisPhe: 3.026 ± 2.14
6.051HisGly: 6.051 ± 1.447
1.513HisHis: 1.513 ± 1.07
0.0HisIle: 0.0 ± 0.0
3.026HisLys: 3.026 ± 2.14
1.513HisLeu: 1.513 ± 1.07
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.026HisPro: 3.026 ± 1.678
0.0HisGln: 0.0 ± 0.0
1.513HisArg: 1.513 ± 0.839
0.0HisSer: 0.0 ± 0.0
1.513HisThr: 1.513 ± 0.839
0.0HisVal: 0.0 ± 0.0
3.026HisTrp: 3.026 ± 2.14
1.513HisTyr: 1.513 ± 0.839
0.0HisXaa: 0.0 ± 0.0
Ile
1.513IleAla: 1.513 ± 1.07
1.513IleCys: 1.513 ± 1.07
3.026IleAsp: 3.026 ± 1.678
0.0IleGlu: 0.0 ± 0.0
0.0IlePhe: 0.0 ± 0.0
3.026IleGly: 3.026 ± 0.231
1.513IleHis: 1.513 ± 0.839
3.026IleIle: 3.026 ± 0.231
1.513IleLys: 1.513 ± 1.07
3.026IleLeu: 3.026 ± 0.231
3.026IleMet: 3.026 ± 2.14
3.026IleAsn: 3.026 ± 1.678
1.513IlePro: 1.513 ± 1.07
0.0IleGln: 0.0 ± 0.0
4.539IleArg: 4.539 ± 3.21
3.026IleSer: 3.026 ± 0.231
0.0IleThr: 0.0 ± 0.0
3.026IleVal: 3.026 ± 2.14
1.513IleTrp: 1.513 ± 1.07
1.513IleTyr: 1.513 ± 1.07
0.0IleXaa: 0.0 ± 0.0
Lys
6.051LysAla: 6.051 ± 0.462
0.0LysCys: 0.0 ± 0.0
3.026LysAsp: 3.026 ± 2.14
1.513LysGlu: 1.513 ± 1.07
1.513LysPhe: 1.513 ± 0.839
6.051LysGly: 6.051 ± 0.462
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
1.513LysLys: 1.513 ± 1.07
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.0
3.026LysAsn: 3.026 ± 2.14
3.026LysPro: 3.026 ± 0.231
4.539LysGln: 4.539 ± 1.301
1.513LysArg: 1.513 ± 1.07
0.0LysSer: 0.0 ± 0.0
1.513LysThr: 1.513 ± 1.07
3.026LysVal: 3.026 ± 0.231
4.539LysTrp: 4.539 ± 3.21
1.513LysTyr: 1.513 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
1.513LeuAla: 1.513 ± 1.07
0.0LeuCys: 0.0 ± 0.0
7.564LeuAsp: 7.564 ± 1.532
3.026LeuGlu: 3.026 ± 2.14
1.513LeuPhe: 1.513 ± 0.839
6.051LeuGly: 6.051 ± 1.447
3.026LeuHis: 3.026 ± 1.678
0.0LeuIle: 0.0 ± 0.0
1.513LeuLys: 1.513 ± 0.839
4.539LeuLeu: 4.539 ± 0.608
3.026LeuMet: 3.026 ± 0.231
7.564LeuAsn: 7.564 ± 4.195
9.077LeuPro: 9.077 ± 3.125
3.026LeuGln: 3.026 ± 1.678
3.026LeuArg: 3.026 ± 1.678
4.539LeuSer: 4.539 ± 1.301
9.077LeuThr: 9.077 ± 1.216
4.539LeuVal: 4.539 ± 0.608
3.026LeuTrp: 3.026 ± 0.231
4.539LeuTyr: 4.539 ± 2.517
0.0LeuXaa: 0.0 ± 0.0
Met
3.026MetAla: 3.026 ± 0.231
0.0MetCys: 0.0 ± 0.0
1.513MetAsp: 1.513 ± 0.839
1.513MetGlu: 1.513 ± 1.07
0.0MetPhe: 0.0 ± 0.0
3.026MetGly: 3.026 ± 1.678
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.026MetLys: 3.026 ± 2.14
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.699
0.0MetAsn: 0.0 ± 0.0
1.513MetPro: 1.513 ± 1.07
1.513MetGln: 1.513 ± 1.07
1.513MetArg: 1.513 ± 1.07
0.0MetSer: 0.0 ± 0.0
6.051MetThr: 6.051 ± 0.462
3.026MetVal: 3.026 ± 0.231
0.0MetTrp: 0.0 ± 0.0
1.513MetTyr: 1.513 ± 1.07
0.0MetXaa: 0.0 ± 0.0
Asn
1.513AsnAla: 1.513 ± 1.07
0.0AsnCys: 0.0 ± 0.0
4.539AsnAsp: 4.539 ± 1.301
1.513AsnGlu: 1.513 ± 1.07
1.513AsnPhe: 1.513 ± 1.07
3.026AsnGly: 3.026 ± 2.14
1.513AsnHis: 1.513 ± 0.839
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
4.539AsnLeu: 4.539 ± 0.608
0.0AsnMet: 0.0 ± 0.0
1.513AsnAsn: 1.513 ± 0.839
7.564AsnPro: 7.564 ± 0.377
3.026AsnGln: 3.026 ± 1.678
0.0AsnArg: 0.0 ± 0.0
4.539AsnSer: 4.539 ± 0.608
3.026AsnThr: 3.026 ± 0.231
0.0AsnVal: 0.0 ± 0.0
1.513AsnTrp: 1.513 ± 1.07
1.513AsnTyr: 1.513 ± 0.839
0.0AsnXaa: 0.0 ± 0.0
Pro
4.539ProAla: 4.539 ± 2.517
0.0ProCys: 0.0 ± 0.0
3.026ProAsp: 3.026 ± 2.14
1.513ProGlu: 1.513 ± 1.07
1.513ProPhe: 1.513 ± 0.839
1.513ProGly: 1.513 ± 1.07
0.0ProHis: 0.0 ± 0.0
4.539ProIle: 4.539 ± 0.608
1.513ProLys: 1.513 ± 1.07
3.026ProLeu: 3.026 ± 1.678
1.513ProMet: 1.513 ± 0.839
1.513ProAsn: 1.513 ± 1.07
3.026ProPro: 3.026 ± 1.678
4.539ProGln: 4.539 ± 2.517
7.564ProArg: 7.564 ± 2.286
7.564ProSer: 7.564 ± 2.286
4.539ProThr: 4.539 ± 0.608
1.513ProVal: 1.513 ± 0.839
1.513ProTrp: 1.513 ± 1.07
4.539ProTyr: 4.539 ± 0.608
0.0ProXaa: 0.0 ± 0.0
Gln
3.026GlnAla: 3.026 ± 1.678
0.0GlnCys: 0.0 ± 0.0
1.513GlnAsp: 1.513 ± 0.839
3.026GlnGlu: 3.026 ± 0.231
6.051GlnPhe: 6.051 ± 1.447
1.513GlnGly: 1.513 ± 0.839
1.513GlnHis: 1.513 ± 1.07
1.513GlnIle: 1.513 ± 1.07
1.513GlnLys: 1.513 ± 0.839
3.026GlnLeu: 3.026 ± 2.14
1.513GlnMet: 1.513 ± 1.07
3.026GlnAsn: 3.026 ± 2.14
1.513GlnPro: 1.513 ± 0.839
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
1.513GlnSer: 1.513 ± 0.839
4.539GlnThr: 4.539 ± 2.517
6.051GlnVal: 6.051 ± 3.356
0.0GlnTrp: 0.0 ± 0.0
3.026GlnTyr: 3.026 ± 1.678
0.0GlnXaa: 0.0 ± 0.0
Arg
6.051ArgAla: 6.051 ± 1.447
1.513ArgCys: 1.513 ± 1.07
3.026ArgAsp: 3.026 ± 0.231
3.026ArgGlu: 3.026 ± 0.231
3.026ArgPhe: 3.026 ± 0.231
3.026ArgGly: 3.026 ± 1.678
1.513ArgHis: 1.513 ± 1.07
3.026ArgIle: 3.026 ± 0.231
6.051ArgLys: 6.051 ± 0.462
9.077ArgLeu: 9.077 ± 1.216
1.513ArgMet: 1.513 ± 0.839
4.539ArgAsn: 4.539 ± 1.301
0.0ArgPro: 0.0 ± 0.0
0.0ArgGln: 0.0 ± 0.0
1.513ArgArg: 1.513 ± 1.07
6.051ArgSer: 6.051 ± 0.462
1.513ArgThr: 1.513 ± 0.839
4.539ArgVal: 4.539 ± 0.608
1.513ArgTrp: 1.513 ± 1.07
4.539ArgTyr: 4.539 ± 1.301
0.0ArgXaa: 0.0 ± 0.0
Ser
1.513SerAla: 1.513 ± 0.839
0.0SerCys: 0.0 ± 0.0
4.539SerAsp: 4.539 ± 2.517
1.513SerGlu: 1.513 ± 1.07
3.026SerPhe: 3.026 ± 0.231
10.59SerGly: 10.59 ± 3.964
1.513SerHis: 1.513 ± 1.07
1.513SerIle: 1.513 ± 1.07
1.513SerLys: 1.513 ± 1.07
9.077SerLeu: 9.077 ± 5.034
4.539SerMet: 4.539 ± 0.95
1.513SerAsn: 1.513 ± 1.07
3.026SerPro: 3.026 ± 1.678
6.051SerGln: 6.051 ± 1.447
4.539SerArg: 4.539 ± 1.301
3.026SerSer: 3.026 ± 1.678
3.026SerThr: 3.026 ± 1.678
4.539SerVal: 4.539 ± 2.517
3.026SerTrp: 3.026 ± 0.231
6.051SerTyr: 6.051 ± 3.356
0.0SerXaa: 0.0 ± 0.0
Thr
7.564ThrAla: 7.564 ± 5.35
0.0ThrCys: 0.0 ± 0.0
6.051ThrAsp: 6.051 ± 3.356
3.026ThrGlu: 3.026 ± 0.231
3.026ThrPhe: 3.026 ± 0.231
3.026ThrGly: 3.026 ± 1.678
0.0ThrHis: 0.0 ± 0.0
3.026ThrIle: 3.026 ± 0.231
0.0ThrLys: 0.0 ± 0.0
3.026ThrLeu: 3.026 ± 0.231
3.026ThrMet: 3.026 ± 1.678
1.513ThrAsn: 1.513 ± 1.07
6.051ThrPro: 6.051 ± 3.356
3.026ThrGln: 3.026 ± 1.678
3.026ThrArg: 3.026 ± 0.231
12.103ThrSer: 12.103 ± 2.894
4.539ThrThr: 4.539 ± 2.517
1.513ThrVal: 1.513 ± 0.839
1.513ThrTrp: 1.513 ± 1.07
1.513ThrTyr: 1.513 ± 0.839
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.513ValCys: 1.513 ± 0.839
1.513ValAsp: 1.513 ± 1.07
6.051ValGlu: 6.051 ± 0.462
1.513ValPhe: 1.513 ± 1.07
9.077ValGly: 9.077 ± 0.693
0.0ValHis: 0.0 ± 0.0
4.539ValIle: 4.539 ± 1.301
1.513ValLys: 1.513 ± 0.839
4.539ValLeu: 4.539 ± 0.608
1.513ValMet: 1.513 ± 1.07
0.0ValAsn: 0.0 ± 0.0
3.026ValPro: 3.026 ± 1.678
0.0ValGln: 0.0 ± 0.0
4.539ValArg: 4.539 ± 0.608
6.051ValSer: 6.051 ± 3.356
3.026ValThr: 3.026 ± 1.678
3.026ValVal: 3.026 ± 2.14
0.0ValTrp: 0.0 ± 0.0
1.513ValTyr: 1.513 ± 0.839
0.0ValXaa: 0.0 ± 0.0
Trp
3.026TrpAla: 3.026 ± 2.14
0.0TrpCys: 0.0 ± 0.0
6.051TrpAsp: 6.051 ± 4.28
1.513TrpGlu: 1.513 ± 1.07
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
3.026TrpHis: 3.026 ± 0.231
0.0TrpIle: 0.0 ± 0.0
1.513TrpLys: 1.513 ± 1.07
1.513TrpLeu: 1.513 ± 0.839
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
3.026TrpGln: 3.026 ± 2.14
3.026TrpArg: 3.026 ± 0.231
3.026TrpSer: 3.026 ± 2.14
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.513TrpTrp: 1.513 ± 1.07
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.513TyrCys: 1.513 ± 1.07
3.026TyrAsp: 3.026 ± 1.678
4.539TyrGlu: 4.539 ± 2.517
0.0TyrPhe: 0.0 ± 0.0
6.051TyrGly: 6.051 ± 0.462
1.513TyrHis: 1.513 ± 0.839
1.513TyrIle: 1.513 ± 1.07
1.513TyrLys: 1.513 ± 0.839
4.539TyrLeu: 4.539 ± 0.608
1.513TyrMet: 1.513 ± 1.07
0.0TyrAsn: 0.0 ± 0.0
1.513TyrPro: 1.513 ± 1.07
3.026TyrGln: 3.026 ± 0.231
6.051TyrArg: 6.051 ± 1.447
1.513TyrSer: 1.513 ± 0.839
3.026TyrThr: 3.026 ± 0.231
3.026TyrVal: 3.026 ± 0.231
0.0TyrTrp: 0.0 ± 0.0
4.539TyrTyr: 4.539 ± 0.608
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (662 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski