Amino acid dipepetide frequency for Lake Sarah-associated circular virus-51

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.605AlaAla: 1.605 ± 1.038
0.0AlaCys: 0.0 ± 0.0
3.21AlaAsp: 3.21 ± 2.076
1.605AlaGlu: 1.605 ± 1.327
3.21AlaPhe: 3.21 ± 2.653
4.815AlaGly: 4.815 ± 1.615
0.0AlaHis: 0.0 ± 0.0
3.21AlaIle: 3.21 ± 0.289
0.0AlaLys: 0.0 ± 0.0
1.605AlaLeu: 1.605 ± 1.327
0.0AlaMet: 0.0 ± 0.0
3.21AlaAsn: 3.21 ± 2.653
1.605AlaPro: 1.605 ± 1.327
3.21AlaGln: 3.21 ± 2.076
6.421AlaArg: 6.421 ± 4.153
11.236AlaSer: 11.236 ± 0.173
3.21AlaThr: 3.21 ± 2.076
1.605AlaVal: 1.605 ± 1.327
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.605CysGly: 1.605 ± 1.038
1.605CysHis: 1.605 ± 1.038
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.605CysLeu: 1.605 ± 1.038
0.0CysMet: 0.0 ± 0.0
4.815CysAsn: 4.815 ± 3.115
1.605CysPro: 1.605 ± 1.327
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.605CysSer: 1.605 ± 1.327
1.605CysThr: 1.605 ± 1.038
0.0CysVal: 0.0 ± 0.0
1.605CysTrp: 1.605 ± 1.327
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.605AspAla: 1.605 ± 1.038
0.0AspCys: 0.0 ± 0.0
6.421AspAsp: 6.421 ± 1.788
3.21AspGlu: 3.21 ± 2.076
3.21AspPhe: 3.21 ± 2.076
8.026AspGly: 8.026 ± 2.826
1.605AspHis: 1.605 ± 1.327
3.21AspIle: 3.21 ± 2.076
4.815AspLys: 4.815 ± 0.75
1.605AspLeu: 1.605 ± 1.038
0.0AspMet: 0.0 ± 0.0
6.421AspAsn: 6.421 ± 5.307
4.815AspPro: 4.815 ± 0.75
0.0AspGln: 0.0 ± 0.0
1.605AspArg: 1.605 ± 1.327
3.21AspSer: 3.21 ± 0.289
8.026AspThr: 8.026 ± 0.461
3.21AspVal: 3.21 ± 2.653
3.21AspTrp: 3.21 ± 0.289
3.21AspTyr: 3.21 ± 2.076
0.0AspXaa: 0.0 ± 0.0
Glu
1.605GluAla: 1.605 ± 1.038
0.0GluCys: 0.0 ± 0.0
1.605GluAsp: 1.605 ± 1.038
3.21GluGlu: 3.21 ± 2.076
1.605GluPhe: 1.605 ± 1.038
3.21GluGly: 3.21 ± 2.653
0.0GluHis: 0.0 ± 0.0
1.605GluIle: 1.605 ± 1.038
0.0GluLys: 0.0 ± 0.0
4.815GluLeu: 4.815 ± 3.115
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
3.21GluGln: 3.21 ± 0.289
0.0GluArg: 0.0 ± 0.0
1.605GluSer: 1.605 ± 1.327
1.605GluThr: 1.605 ± 1.038
3.21GluVal: 3.21 ± 0.289
4.815GluTrp: 4.815 ± 3.115
11.236GluTyr: 11.236 ± 7.268
0.0GluXaa: 0.0 ± 0.0
Phe
3.21PheAla: 3.21 ± 2.076
0.0PheCys: 0.0 ± 0.0
4.815PheAsp: 4.815 ± 3.98
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
3.21PheGly: 3.21 ± 0.289
0.0PheHis: 0.0 ± 0.0
1.605PheIle: 1.605 ± 1.038
3.21PheLys: 3.21 ± 2.653
1.605PheLeu: 1.605 ± 1.038
0.0PheMet: 0.0 ± 0.0
6.421PheAsn: 6.421 ± 1.788
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
8.026PheArg: 8.026 ± 2.826
1.605PheSer: 1.605 ± 1.038
3.21PheThr: 3.21 ± 2.076
3.21PheVal: 3.21 ± 2.653
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.21GlyAla: 3.21 ± 2.076
1.605GlyCys: 1.605 ± 1.038
4.815GlyAsp: 4.815 ± 0.75
0.0GlyGlu: 0.0 ± 0.0
0.0GlyPhe: 0.0 ± 0.0
8.026GlyGly: 8.026 ± 0.461
1.605GlyHis: 1.605 ± 1.038
4.815GlyIle: 4.815 ± 1.615
3.21GlyLys: 3.21 ± 2.076
1.605GlyLeu: 1.605 ± 1.038
1.605GlyMet: 1.605 ± 0.983
6.421GlyAsn: 6.421 ± 5.307
3.21GlyPro: 3.21 ± 2.076
6.421GlyGln: 6.421 ± 2.942
9.631GlyArg: 9.631 ± 0.866
3.21GlySer: 3.21 ± 2.653
3.21GlyThr: 3.21 ± 2.653
4.815GlyVal: 4.815 ± 1.615
1.605GlyTrp: 1.605 ± 1.038
4.815GlyTyr: 4.815 ± 1.615
0.0GlyXaa: 0.0 ± 0.0
His
1.605HisAla: 1.605 ± 1.038
0.0HisCys: 0.0 ± 0.0
1.605HisAsp: 1.605 ± 1.038
1.605HisGlu: 1.605 ± 1.038
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.605HisHis: 1.605 ± 1.038
1.605HisIle: 1.605 ± 1.038
1.605HisLys: 1.605 ± 1.038
3.21HisLeu: 3.21 ± 2.076
0.0HisMet: 0.0 ± 0.0
1.605HisAsn: 1.605 ± 1.327
3.21HisPro: 3.21 ± 0.289
1.605HisGln: 1.605 ± 1.038
0.0HisArg: 0.0 ± 0.0
1.605HisSer: 1.605 ± 1.038
3.21HisThr: 3.21 ± 0.289
1.605HisVal: 1.605 ± 1.327
0.0HisTrp: 0.0 ± 0.0
1.605HisTyr: 1.605 ± 1.038
0.0HisXaa: 0.0 ± 0.0
Ile
1.605IleAla: 1.605 ± 1.327
1.605IleCys: 1.605 ± 1.038
6.421IleAsp: 6.421 ± 0.577
0.0IleGlu: 0.0 ± 0.0
0.0IlePhe: 0.0 ± 0.0
4.815IleGly: 4.815 ± 0.75
0.0IleHis: 0.0 ± 0.0
1.605IleIle: 1.605 ± 1.038
4.815IleLys: 4.815 ± 1.615
3.21IleLeu: 3.21 ± 2.076
1.605IleMet: 1.605 ± 1.038
1.605IleAsn: 1.605 ± 1.038
3.21IlePro: 3.21 ± 2.076
0.0IleGln: 0.0 ± 0.0
3.21IleArg: 3.21 ± 2.653
4.815IleSer: 4.815 ± 1.615
8.026IleThr: 8.026 ± 0.461
3.21IleVal: 3.21 ± 0.289
1.605IleTrp: 1.605 ± 1.327
1.605IleTyr: 1.605 ± 1.038
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
6.421LysAsp: 6.421 ± 0.577
1.605LysGlu: 1.605 ± 1.038
8.026LysPhe: 8.026 ± 4.269
6.421LysGly: 6.421 ± 2.942
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
1.605LysLys: 1.605 ± 1.327
8.026LysLeu: 8.026 ± 0.461
1.605LysMet: 1.605 ± 1.038
0.0LysAsn: 0.0 ± 0.0
4.815LysPro: 4.815 ± 3.115
0.0LysGln: 0.0 ± 0.0
6.421LysArg: 6.421 ± 2.942
1.605LysSer: 1.605 ± 1.038
1.605LysThr: 1.605 ± 1.038
3.21LysVal: 3.21 ± 2.653
0.0LysTrp: 0.0 ± 0.0
3.21LysTyr: 3.21 ± 2.076
0.0LysXaa: 0.0 ± 0.0
Leu
4.815LeuAla: 4.815 ± 3.98
1.605LeuCys: 1.605 ± 1.038
0.0LeuAsp: 0.0 ± 0.0
4.815LeuGlu: 4.815 ± 3.115
4.815LeuPhe: 4.815 ± 0.75
1.605LeuGly: 1.605 ± 1.038
6.421LeuHis: 6.421 ± 1.788
6.421LeuIle: 6.421 ± 0.577
4.815LeuLys: 4.815 ± 0.75
3.21LeuLeu: 3.21 ± 0.289
1.605LeuMet: 1.605 ± 1.327
4.815LeuAsn: 4.815 ± 3.115
3.21LeuPro: 3.21 ± 0.289
3.21LeuGln: 3.21 ± 2.076
3.21LeuArg: 3.21 ± 2.653
0.0LeuSer: 0.0 ± 0.0
3.21LeuThr: 3.21 ± 0.289
4.815LeuVal: 4.815 ± 1.615
3.21LeuTrp: 3.21 ± 2.076
6.421LeuTyr: 6.421 ± 2.942
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
3.21MetAsp: 3.21 ± 2.076
1.605MetGlu: 1.605 ± 1.038
0.0MetPhe: 0.0 ± 0.0
1.605MetGly: 1.605 ± 1.327
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.605MetLys: 1.605 ± 1.327
3.21MetLeu: 3.21 ± 2.653
0.0MetMet: 0.0 ± 0.0
3.21MetAsn: 3.21 ± 0.289
1.605MetPro: 1.605 ± 1.327
1.605MetGln: 1.605 ± 1.038
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.815AsnAla: 4.815 ± 0.75
0.0AsnCys: 0.0 ± 0.0
4.815AsnAsp: 4.815 ± 0.75
3.21AsnGlu: 3.21 ± 2.653
0.0AsnPhe: 0.0 ± 0.0
1.605AsnGly: 1.605 ± 1.038
0.0AsnHis: 0.0 ± 0.0
4.815AsnIle: 4.815 ± 1.615
1.605AsnLys: 1.605 ± 1.038
8.026AsnLeu: 8.026 ± 4.269
1.605AsnMet: 1.605 ± 1.038
0.0AsnAsn: 0.0 ± 0.0
1.605AsnPro: 1.605 ± 1.327
1.605AsnGln: 1.605 ± 1.038
0.0AsnArg: 0.0 ± 0.0
3.21AsnSer: 3.21 ± 0.289
3.21AsnThr: 3.21 ± 0.289
4.815AsnVal: 4.815 ± 3.115
1.605AsnTrp: 1.605 ± 1.327
3.21AsnTyr: 3.21 ± 2.653
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
1.605ProAsp: 1.605 ± 1.327
4.815ProGlu: 4.815 ± 3.115
0.0ProPhe: 0.0 ± 0.0
0.0ProGly: 0.0 ± 0.0
1.605ProHis: 1.605 ± 1.038
6.421ProIle: 6.421 ± 0.577
1.605ProLys: 1.605 ± 1.038
1.605ProLeu: 1.605 ± 1.038
4.815ProMet: 4.815 ± 3.36
1.605ProAsn: 1.605 ± 1.327
1.605ProPro: 1.605 ± 1.327
0.0ProGln: 0.0 ± 0.0
8.026ProArg: 8.026 ± 2.826
6.421ProSer: 6.421 ± 4.153
0.0ProThr: 0.0 ± 0.0
3.21ProVal: 3.21 ± 2.076
1.605ProTrp: 1.605 ± 1.327
3.21ProTyr: 3.21 ± 0.289
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
3.21GlnCys: 3.21 ± 2.076
1.605GlnAsp: 1.605 ± 1.038
3.21GlnGlu: 3.21 ± 0.289
0.0GlnPhe: 0.0 ± 0.0
1.605GlnGly: 1.605 ± 1.327
0.0GlnHis: 0.0 ± 0.0
1.605GlnIle: 1.605 ± 1.038
4.815GlnLys: 4.815 ± 0.75
1.605GlnLeu: 1.605 ± 1.327
0.0GlnMet: 0.0 ± 0.0
1.605GlnAsn: 1.605 ± 1.038
1.605GlnPro: 1.605 ± 1.038
0.0GlnGln: 0.0 ± 0.0
6.421GlnArg: 6.421 ± 2.942
1.605GlnSer: 1.605 ± 1.327
0.0GlnThr: 0.0 ± 0.0
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
3.21GlnTyr: 3.21 ± 2.653
0.0GlnXaa: 0.0 ± 0.0
Arg
3.21ArgAla: 3.21 ± 0.289
3.21ArgCys: 3.21 ± 0.289
6.421ArgAsp: 6.421 ± 4.153
3.21ArgGlu: 3.21 ± 2.076
0.0ArgPhe: 0.0 ± 0.0
11.236ArgGly: 11.236 ± 2.192
3.21ArgHis: 3.21 ± 2.076
8.026ArgIle: 8.026 ± 0.461
4.815ArgLys: 4.815 ± 3.98
4.815ArgLeu: 4.815 ± 0.75
0.0ArgMet: 0.0 ± 0.0
3.21ArgAsn: 3.21 ± 2.076
1.605ArgPro: 1.605 ± 1.038
1.605ArgGln: 1.605 ± 1.327
9.631ArgArg: 9.631 ± 3.231
6.421ArgSer: 6.421 ± 2.942
6.421ArgThr: 6.421 ± 0.577
4.815ArgVal: 4.815 ± 1.615
0.0ArgTrp: 0.0 ± 0.0
6.421ArgTyr: 6.421 ± 2.942
0.0ArgXaa: 0.0 ± 0.0
Ser
6.421SerAla: 6.421 ± 2.942
0.0SerCys: 0.0 ± 0.0
4.815SerAsp: 4.815 ± 0.75
0.0SerGlu: 0.0 ± 0.0
1.605SerPhe: 1.605 ± 1.327
1.605SerGly: 1.605 ± 1.327
1.605SerHis: 1.605 ± 1.327
3.21SerIle: 3.21 ± 0.289
8.026SerLys: 8.026 ± 4.269
4.815SerLeu: 4.815 ± 1.615
0.0SerMet: 0.0 ± 0.0
1.605SerAsn: 1.605 ± 1.327
1.605SerPro: 1.605 ± 1.038
3.21SerGln: 3.21 ± 2.653
9.631SerArg: 9.631 ± 6.229
3.21SerSer: 3.21 ± 2.076
3.21SerThr: 3.21 ± 0.289
1.605SerVal: 1.605 ± 1.327
3.21SerTrp: 3.21 ± 2.076
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.815ThrAla: 4.815 ± 0.75
1.605ThrCys: 1.605 ± 1.327
1.605ThrAsp: 1.605 ± 1.327
4.815ThrGlu: 4.815 ± 0.75
8.026ThrPhe: 8.026 ± 2.826
1.605ThrGly: 1.605 ± 1.038
3.21ThrHis: 3.21 ± 0.289
0.0ThrIle: 0.0 ± 0.0
3.21ThrLys: 3.21 ± 2.076
1.605ThrLeu: 1.605 ± 1.038
0.0ThrMet: 0.0 ± 0.0
1.605ThrAsn: 1.605 ± 1.327
3.21ThrPro: 3.21 ± 0.289
1.605ThrGln: 1.605 ± 1.038
1.605ThrArg: 1.605 ± 1.327
1.605ThrSer: 1.605 ± 1.327
3.21ThrThr: 3.21 ± 2.076
8.026ThrVal: 8.026 ± 0.461
3.21ThrTrp: 3.21 ± 0.289
4.815ThrTyr: 4.815 ± 3.115
0.0ThrXaa: 0.0 ± 0.0
Val
4.815ValAla: 4.815 ± 1.615
0.0ValCys: 0.0 ± 0.0
3.21ValAsp: 3.21 ± 2.653
3.21ValGlu: 3.21 ± 2.076
0.0ValPhe: 0.0 ± 0.0
6.421ValGly: 6.421 ± 2.942
1.605ValHis: 1.605 ± 1.038
0.0ValIle: 0.0 ± 0.0
3.21ValLys: 3.21 ± 0.289
4.815ValLeu: 4.815 ± 1.615
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
8.026ValPro: 8.026 ± 2.826
1.605ValGln: 1.605 ± 1.327
8.026ValArg: 8.026 ± 1.904
3.21ValSer: 3.21 ± 2.653
3.21ValThr: 3.21 ± 0.289
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
4.815ValTyr: 4.815 ± 3.98
0.0ValXaa: 0.0 ± 0.0
Trp
3.21TrpAla: 3.21 ± 2.076
1.605TrpCys: 1.605 ± 1.327
0.0TrpAsp: 0.0 ± 0.0
1.605TrpGlu: 1.605 ± 1.038
1.605TrpPhe: 1.605 ± 1.038
1.605TrpGly: 1.605 ± 1.038
0.0TrpHis: 0.0 ± 0.0
1.605TrpIle: 1.605 ± 1.327
1.605TrpLys: 1.605 ± 1.038
4.815TrpLeu: 4.815 ± 0.75
3.21TrpMet: 3.21 ± 0.289
1.605TrpAsn: 1.605 ± 1.038
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.605TrpArg: 1.605 ± 1.327
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.605TrpVal: 1.605 ± 1.327
0.0TrpTrp: 0.0 ± 0.0
3.21TrpTyr: 3.21 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.21TyrAla: 3.21 ± 2.653
1.605TyrCys: 1.605 ± 1.038
4.815TyrAsp: 4.815 ± 0.75
1.605TyrGlu: 1.605 ± 1.038
8.026TyrPhe: 8.026 ± 2.826
6.421TyrGly: 6.421 ± 0.577
3.21TyrHis: 3.21 ± 2.076
3.21TyrIle: 3.21 ± 0.289
0.0TyrLys: 0.0 ± 0.0
6.421TyrLeu: 6.421 ± 0.577
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.605TyrPro: 1.605 ± 1.327
3.21TyrGln: 3.21 ± 2.653
4.815TyrArg: 4.815 ± 1.615
3.21TyrSer: 3.21 ± 0.289
3.21TyrThr: 3.21 ± 2.076
3.21TyrVal: 3.21 ± 0.289
3.21TyrTrp: 3.21 ± 0.289
1.605TyrTyr: 1.605 ± 1.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (624 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski