Amino acid dipepetide frequency for Odonata-associated circular virus-8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.31AlaAla: 4.31 ± 2.638
1.437AlaCys: 1.437 ± 0.879
1.437AlaAsp: 1.437 ± 1.06
2.874AlaGlu: 2.874 ± 1.758
5.747AlaPhe: 5.747 ± 0.362
2.874AlaGly: 2.874 ± 1.758
0.0AlaHis: 0.0 ± 0.0
1.437AlaIle: 1.437 ± 0.879
5.747AlaLys: 5.747 ± 1.577
1.437AlaLeu: 1.437 ± 0.879
0.0AlaMet: 0.0 ± 0.0
1.437AlaAsn: 1.437 ± 0.879
1.437AlaPro: 1.437 ± 0.879
2.874AlaGln: 2.874 ± 0.181
4.31AlaArg: 4.31 ± 0.698
0.0AlaSer: 0.0 ± 0.0
5.747AlaThr: 5.747 ± 0.362
5.747AlaVal: 5.747 ± 1.577
1.437AlaTrp: 1.437 ± 1.06
4.31AlaTyr: 4.31 ± 1.242
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.874CysAsp: 2.874 ± 0.181
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.874CysAsn: 2.874 ± 2.121
1.437CysPro: 1.437 ± 0.879
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.437CysVal: 1.437 ± 0.879
0.0CysTrp: 0.0 ± 0.0
4.31CysTyr: 4.31 ± 0.698
0.0CysXaa: 0.0 ± 0.0
Asp
7.184AspAla: 7.184 ± 0.517
1.437AspCys: 1.437 ± 1.06
4.31AspAsp: 4.31 ± 3.181
2.874AspGlu: 2.874 ± 2.121
4.31AspPhe: 4.31 ± 1.242
8.621AspGly: 8.621 ± 2.483
2.874AspHis: 2.874 ± 0.181
1.437AspIle: 1.437 ± 1.06
2.874AspLys: 2.874 ± 2.121
2.874AspLeu: 2.874 ± 0.181
1.437AspMet: 1.437 ± 1.06
0.0AspAsn: 0.0 ± 0.0
4.31AspPro: 4.31 ± 3.181
1.437AspGln: 1.437 ± 0.879
1.437AspArg: 1.437 ± 1.06
0.0AspSer: 0.0 ± 0.0
4.31AspThr: 4.31 ± 3.181
4.31AspVal: 4.31 ± 1.242
0.0AspTrp: 0.0 ± 0.0
2.874AspTyr: 2.874 ± 2.121
0.0AspXaa: 0.0 ± 0.0
Glu
4.31GluAla: 4.31 ± 0.698
1.437GluCys: 1.437 ± 0.879
4.31GluAsp: 4.31 ± 3.181
1.437GluGlu: 1.437 ± 1.06
0.0GluPhe: 0.0 ± 0.0
5.747GluGly: 5.747 ± 2.302
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
4.31GluLys: 4.31 ± 3.181
1.437GluLeu: 1.437 ± 1.06
2.874GluMet: 2.874 ± 2.121
2.874GluAsn: 2.874 ± 0.181
4.31GluPro: 4.31 ± 3.181
1.437GluGln: 1.437 ± 0.879
0.0GluArg: 0.0 ± 0.0
4.31GluSer: 4.31 ± 0.698
0.0GluThr: 0.0 ± 0.0
2.874GluVal: 2.874 ± 0.181
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
5.747PheAsp: 5.747 ± 4.242
1.437PheGlu: 1.437 ± 1.06
2.874PhePhe: 2.874 ± 2.121
2.874PheGly: 2.874 ± 0.181
0.0PheHis: 0.0 ± 0.0
2.874PheIle: 2.874 ± 0.181
7.184PheLys: 7.184 ± 1.423
1.437PheLeu: 1.437 ± 1.06
1.437PheMet: 1.437 ± 0.879
1.437PheAsn: 1.437 ± 1.06
2.874PhePro: 2.874 ± 1.758
0.0PheGln: 0.0 ± 0.0
4.31PheArg: 4.31 ± 1.242
4.31PheSer: 4.31 ± 0.698
4.31PheThr: 4.31 ± 0.698
2.874PheVal: 2.874 ± 0.181
1.437PheTrp: 1.437 ± 1.06
1.437PheTyr: 1.437 ± 1.06
0.0PheXaa: 0.0 ± 0.0
Gly
1.437GlyAla: 1.437 ± 0.879
0.0GlyCys: 0.0 ± 0.0
4.31GlyAsp: 4.31 ± 3.181
2.874GlyGlu: 2.874 ± 0.181
2.874GlyPhe: 2.874 ± 0.181
1.437GlyGly: 1.437 ± 0.879
1.437GlyHis: 1.437 ± 0.879
1.437GlyIle: 1.437 ± 1.06
11.494GlyLys: 11.494 ± 2.664
8.621GlyLeu: 8.621 ± 5.275
0.0GlyMet: 0.0 ± 0.0
4.31GlyAsn: 4.31 ± 0.698
0.0GlyPro: 0.0 ± 0.0
2.874GlyGln: 2.874 ± 0.181
4.31GlyArg: 4.31 ± 0.698
2.874GlySer: 2.874 ± 1.758
12.931GlyThr: 12.931 ± 4.034
0.0GlyVal: 0.0 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
2.874GlyTyr: 2.874 ± 0.181
0.0GlyXaa: 0.0 ± 0.0
His
2.874HisAla: 2.874 ± 1.758
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.874HisGly: 2.874 ± 2.121
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.874HisLys: 2.874 ± 0.181
2.874HisLeu: 2.874 ± 0.181
1.437HisMet: 1.437 ± 1.06
2.874HisAsn: 2.874 ± 2.121
1.437HisPro: 1.437 ± 0.879
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.437HisSer: 1.437 ± 0.879
0.0HisThr: 0.0 ± 0.0
1.437HisVal: 1.437 ± 1.06
0.0HisTrp: 0.0 ± 0.0
1.437HisTyr: 1.437 ± 0.879
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.437IleCys: 1.437 ± 1.06
0.0IleAsp: 0.0 ± 0.0
2.874IleGlu: 2.874 ± 0.181
2.874IlePhe: 2.874 ± 0.181
1.437IleGly: 1.437 ± 1.06
2.874IleHis: 2.874 ± 0.181
5.747IleIle: 5.747 ± 2.302
2.874IleLys: 2.874 ± 0.181
1.437IleLeu: 1.437 ± 0.879
1.437IleMet: 1.437 ± 0.879
2.874IleAsn: 2.874 ± 0.181
4.31IlePro: 4.31 ± 1.242
0.0IleGln: 0.0 ± 0.0
4.31IleArg: 4.31 ± 2.638
4.31IleSer: 4.31 ± 2.638
1.437IleThr: 1.437 ± 0.879
2.874IleVal: 2.874 ± 0.181
1.437IleTrp: 1.437 ± 1.06
2.874IleTyr: 2.874 ± 0.181
0.0IleXaa: 0.0 ± 0.0
Lys
2.874LysAla: 2.874 ± 0.181
4.31LysCys: 4.31 ± 1.242
5.747LysAsp: 5.747 ± 0.362
2.874LysGlu: 2.874 ± 2.121
4.31LysPhe: 4.31 ± 0.698
2.874LysGly: 2.874 ± 0.181
0.0LysHis: 0.0 ± 0.0
1.437LysIle: 1.437 ± 0.879
11.494LysLys: 11.494 ± 4.604
7.184LysLeu: 7.184 ± 1.423
4.31LysMet: 4.31 ± 1.242
2.874LysAsn: 2.874 ± 2.121
0.0LysPro: 0.0 ± 0.0
7.184LysGln: 7.184 ± 1.423
8.621LysArg: 8.621 ± 1.396
7.184LysSer: 7.184 ± 0.517
4.31LysThr: 4.31 ± 0.698
2.874LysVal: 2.874 ± 1.758
0.0LysTrp: 0.0 ± 0.0
5.747LysTyr: 5.747 ± 2.302
0.0LysXaa: 0.0 ± 0.0
Leu
7.184LeuAla: 7.184 ± 2.457
1.437LeuCys: 1.437 ± 0.879
2.874LeuAsp: 2.874 ± 1.758
4.31LeuGlu: 4.31 ± 0.698
0.0LeuPhe: 0.0 ± 0.0
5.747LeuGly: 5.747 ± 0.362
2.874LeuHis: 2.874 ± 0.181
4.31LeuIle: 4.31 ± 1.242
5.747LeuLys: 5.747 ± 1.577
1.437LeuLeu: 1.437 ± 0.879
2.874LeuMet: 2.874 ± 0.181
1.437LeuAsn: 1.437 ± 1.06
4.31LeuPro: 4.31 ± 1.242
1.437LeuGln: 1.437 ± 0.879
0.0LeuArg: 0.0 ± 0.0
1.437LeuSer: 1.437 ± 0.879
5.747LeuThr: 5.747 ± 1.577
2.874LeuVal: 2.874 ± 0.181
2.874LeuTrp: 2.874 ± 1.758
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.874MetAla: 2.874 ± 0.181
0.0MetCys: 0.0 ± 0.0
1.437MetAsp: 1.437 ± 1.06
1.437MetGlu: 1.437 ± 1.06
0.0MetPhe: 0.0 ± 0.0
1.437MetGly: 1.437 ± 1.06
0.0MetHis: 0.0 ± 0.0
2.874MetIle: 2.874 ± 2.121
4.31MetLys: 4.31 ± 1.242
1.437MetLeu: 1.437 ± 1.06
0.0MetMet: 0.0 ± 0.0
1.437MetAsn: 1.437 ± 1.06
0.0MetPro: 0.0 ± 0.0
1.437MetGln: 1.437 ± 1.06
2.874MetArg: 2.874 ± 1.758
4.31MetSer: 4.31 ± 2.638
2.874MetThr: 2.874 ± 0.181
1.437MetVal: 1.437 ± 0.879
0.0MetTrp: 0.0 ± 0.0
1.437MetTyr: 1.437 ± 0.879
0.0MetXaa: 0.0 ± 0.0
Asn
1.437AsnAla: 1.437 ± 0.879
1.437AsnCys: 1.437 ± 0.879
2.874AsnAsp: 2.874 ± 0.181
2.874AsnGlu: 2.874 ± 2.121
7.184AsnPhe: 7.184 ± 5.302
2.874AsnGly: 2.874 ± 1.758
2.874AsnHis: 2.874 ± 2.121
4.31AsnIle: 4.31 ± 0.698
1.437AsnLys: 1.437 ± 0.879
4.31AsnLeu: 4.31 ± 1.242
0.0AsnMet: 0.0 ± 0.0
2.874AsnAsn: 2.874 ± 1.758
4.31AsnPro: 4.31 ± 1.242
1.437AsnGln: 1.437 ± 0.879
2.874AsnArg: 2.874 ± 0.181
2.874AsnSer: 2.874 ± 1.758
2.874AsnThr: 2.874 ± 0.181
2.874AsnVal: 2.874 ± 0.181
1.437AsnTrp: 1.437 ± 0.879
1.437AsnTyr: 1.437 ± 0.879
0.0AsnXaa: 0.0 ± 0.0
Pro
4.31ProAla: 4.31 ± 0.698
0.0ProCys: 0.0 ± 0.0
2.874ProAsp: 2.874 ± 2.121
2.874ProGlu: 2.874 ± 2.121
2.874ProPhe: 2.874 ± 2.121
2.874ProGly: 2.874 ± 1.758
1.437ProHis: 1.437 ± 1.06
0.0ProIle: 0.0 ± 0.0
4.31ProLys: 4.31 ± 1.242
1.437ProLeu: 1.437 ± 0.879
1.437ProMet: 1.437 ± 0.685
2.874ProAsn: 2.874 ± 2.121
0.0ProPro: 0.0 ± 0.0
2.874ProGln: 2.874 ± 0.181
4.31ProArg: 4.31 ± 0.698
4.31ProSer: 4.31 ± 0.698
2.874ProThr: 2.874 ± 0.181
5.747ProVal: 5.747 ± 1.577
0.0ProTrp: 0.0 ± 0.0
1.437ProTyr: 1.437 ± 1.06
0.0ProXaa: 0.0 ± 0.0
Gln
1.437GlnAla: 1.437 ± 1.06
0.0GlnCys: 0.0 ± 0.0
1.437GlnAsp: 1.437 ± 1.06
2.874GlnGlu: 2.874 ± 2.121
0.0GlnPhe: 0.0 ± 0.0
2.874GlnGly: 2.874 ± 1.758
0.0GlnHis: 0.0 ± 0.0
4.31GlnIle: 4.31 ± 2.638
1.437GlnLys: 1.437 ± 0.879
1.437GlnLeu: 1.437 ± 1.06
0.0GlnMet: 0.0 ± 0.0
2.874GlnAsn: 2.874 ± 1.758
1.437GlnPro: 1.437 ± 0.879
0.0GlnGln: 0.0 ± 0.0
1.437GlnArg: 1.437 ± 1.06
2.874GlnSer: 2.874 ± 1.758
2.874GlnThr: 2.874 ± 0.181
1.437GlnVal: 1.437 ± 0.879
1.437GlnTrp: 1.437 ± 1.06
1.437GlnTyr: 1.437 ± 0.879
0.0GlnXaa: 0.0 ± 0.0
Arg
2.874ArgAla: 2.874 ± 0.181
0.0ArgCys: 0.0 ± 0.0
2.874ArgAsp: 2.874 ± 2.121
0.0ArgGlu: 0.0 ± 0.0
4.31ArgPhe: 4.31 ± 0.698
0.0ArgGly: 0.0 ± 0.0
1.437ArgHis: 1.437 ± 0.879
2.874ArgIle: 2.874 ± 0.181
5.747ArgLys: 5.747 ± 1.577
0.0ArgLeu: 0.0 ± 0.0
2.874ArgMet: 2.874 ± 1.338
2.874ArgAsn: 2.874 ± 1.758
1.437ArgPro: 1.437 ± 0.879
2.874ArgGln: 2.874 ± 0.181
2.874ArgArg: 2.874 ± 0.181
8.621ArgSer: 8.621 ± 3.336
8.621ArgThr: 8.621 ± 1.396
4.31ArgVal: 4.31 ± 1.242
2.874ArgTrp: 2.874 ± 0.181
1.437ArgTyr: 1.437 ± 1.06
0.0ArgXaa: 0.0 ± 0.0
Ser
1.437SerAla: 1.437 ± 0.879
0.0SerCys: 0.0 ± 0.0
2.874SerAsp: 2.874 ± 0.181
1.437SerGlu: 1.437 ± 0.879
4.31SerPhe: 4.31 ± 0.698
8.621SerGly: 8.621 ± 5.275
0.0SerHis: 0.0 ± 0.0
2.874SerIle: 2.874 ± 1.758
2.874SerLys: 2.874 ± 0.181
2.874SerLeu: 2.874 ± 1.758
4.31SerMet: 4.31 ± 2.638
7.184SerAsn: 7.184 ± 2.457
1.437SerPro: 1.437 ± 1.06
0.0SerGln: 0.0 ± 0.0
5.747SerArg: 5.747 ± 0.362
4.31SerSer: 4.31 ± 2.638
7.184SerThr: 7.184 ± 4.396
2.874SerVal: 2.874 ± 1.758
1.437SerTrp: 1.437 ± 0.879
2.874SerTyr: 2.874 ± 1.758
0.0SerXaa: 0.0 ± 0.0
Thr
2.874ThrAla: 2.874 ± 1.758
0.0ThrCys: 0.0 ± 0.0
5.747ThrAsp: 5.747 ± 4.242
1.437ThrGlu: 1.437 ± 1.06
2.874ThrPhe: 2.874 ± 0.181
10.057ThrGly: 10.057 ± 2.275
0.0ThrHis: 0.0 ± 0.0
7.184ThrIle: 7.184 ± 2.457
5.747ThrLys: 5.747 ± 0.362
8.621ThrLeu: 8.621 ± 3.336
1.437ThrMet: 1.437 ± 1.06
2.874ThrAsn: 2.874 ± 1.758
5.747ThrPro: 5.747 ± 0.362
1.437ThrGln: 1.437 ± 0.879
5.747ThrArg: 5.747 ± 1.577
5.747ThrSer: 5.747 ± 3.517
8.621ThrThr: 8.621 ± 0.543
5.747ThrVal: 5.747 ± 0.362
2.874ThrTrp: 2.874 ± 0.181
4.31ThrTyr: 4.31 ± 0.698
0.0ThrXaa: 0.0 ± 0.0
Val
2.874ValAla: 2.874 ± 0.181
0.0ValCys: 0.0 ± 0.0
4.31ValAsp: 4.31 ± 0.698
4.31ValGlu: 4.31 ± 1.242
2.874ValPhe: 2.874 ± 2.121
2.874ValGly: 2.874 ± 1.758
2.874ValHis: 2.874 ± 1.758
2.874ValIle: 2.874 ± 0.181
2.874ValLys: 2.874 ± 0.181
4.31ValLeu: 4.31 ± 0.698
2.874ValMet: 2.874 ± 2.121
0.0ValAsn: 0.0 ± 0.0
4.31ValPro: 4.31 ± 2.638
2.874ValGln: 2.874 ± 1.758
1.437ValArg: 1.437 ± 1.06
4.31ValSer: 4.31 ± 0.698
7.184ValThr: 7.184 ± 2.457
4.31ValVal: 4.31 ± 0.698
1.437ValTrp: 1.437 ± 0.879
1.437ValTyr: 1.437 ± 1.06
0.0ValXaa: 0.0 ± 0.0
Trp
1.437TrpAla: 1.437 ± 1.06
0.0TrpCys: 0.0 ± 0.0
1.437TrpAsp: 1.437 ± 1.06
1.437TrpGlu: 1.437 ± 0.879
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.437TrpHis: 1.437 ± 1.06
1.437TrpIle: 1.437 ± 1.06
1.437TrpLys: 1.437 ± 1.06
1.437TrpLeu: 1.437 ± 0.879
0.0TrpMet: 0.0 ± 0.0
1.437TrpAsn: 1.437 ± 1.06
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
4.31TrpArg: 4.31 ± 2.638
0.0TrpSer: 0.0 ± 0.0
1.437TrpThr: 1.437 ± 0.879
1.437TrpVal: 1.437 ± 0.879
0.0TrpTrp: 0.0 ± 0.0
1.437TrpTyr: 1.437 ± 1.06
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.874TyrAla: 2.874 ± 0.181
0.0TyrCys: 0.0 ± 0.0
1.437TyrAsp: 1.437 ± 0.879
1.437TyrGlu: 1.437 ± 1.06
1.437TyrPhe: 1.437 ± 0.879
0.0TyrGly: 0.0 ± 0.0
1.437TyrHis: 1.437 ± 1.06
0.0TyrIle: 0.0 ± 0.0
1.437TyrLys: 1.437 ± 0.879
4.31TyrLeu: 4.31 ± 0.698
1.437TyrMet: 1.437 ± 1.06
7.184TyrAsn: 7.184 ± 0.517
5.747TyrPro: 5.747 ± 2.302
1.437TyrGln: 1.437 ± 0.879
1.437TyrArg: 1.437 ± 1.06
1.437TyrSer: 1.437 ± 0.879
5.747TyrThr: 5.747 ± 2.302
2.874TyrVal: 2.874 ± 0.181
1.437TyrTrp: 1.437 ± 1.06
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (697 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski