Amino acid dipepetide frequency for Longjawed orbweaver circular virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.967AlaAla: 4.967 ± 1.084
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
1.656AlaGly: 1.656 ± 1.186
1.656AlaHis: 1.656 ± 1.289
3.311AlaIle: 3.311 ± 2.373
4.967AlaLys: 4.967 ± 1.084
3.311AlaLeu: 3.311 ± 0.102
0.0AlaMet: 0.0 ± 0.0
1.656AlaAsn: 1.656 ± 1.186
1.656AlaPro: 1.656 ± 1.186
0.0AlaGln: 0.0 ± 0.0
0.0AlaArg: 0.0 ± 0.0
4.967AlaSer: 4.967 ± 1.084
4.967AlaThr: 4.967 ± 3.559
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
6.623AlaTyr: 6.623 ± 4.746
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.311CysLys: 3.311 ± 0.102
1.656CysLeu: 1.656 ± 1.186
0.0CysMet: 0.0 ± 0.0
1.656CysAsn: 1.656 ± 1.289
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.656CysArg: 1.656 ± 1.289
3.311CysSer: 3.311 ± 0.102
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.656CysTrp: 1.656 ± 1.289
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.311AspAla: 3.311 ± 0.102
1.656AspCys: 1.656 ± 1.186
3.311AspAsp: 3.311 ± 0.102
6.623AspGlu: 6.623 ± 0.205
0.0AspPhe: 0.0 ± 0.0
3.311AspGly: 3.311 ± 0.102
3.311AspHis: 3.311 ± 2.578
9.934AspIle: 9.934 ± 0.307
3.311AspLys: 3.311 ± 0.102
3.311AspLeu: 3.311 ± 0.102
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
0.0AspPro: 0.0 ± 0.0
1.656AspGln: 1.656 ± 1.186
4.967AspArg: 4.967 ± 1.391
1.656AspSer: 1.656 ± 1.289
3.311AspThr: 3.311 ± 2.373
1.656AspVal: 1.656 ± 1.289
0.0AspTrp: 0.0 ± 0.0
1.656AspTyr: 1.656 ± 1.289
0.0AspXaa: 0.0 ± 0.0
Glu
1.656GluAla: 1.656 ± 1.186
3.311GluCys: 3.311 ± 2.578
4.967GluAsp: 4.967 ± 1.391
0.0GluGlu: 0.0 ± 0.0
4.967GluPhe: 4.967 ± 1.391
1.656GluGly: 1.656 ± 1.186
1.656GluHis: 1.656 ± 1.186
3.311GluIle: 3.311 ± 0.102
3.311GluLys: 3.311 ± 2.578
1.656GluLeu: 1.656 ± 1.289
0.0GluMet: 0.0 ± 0.0
4.967GluAsn: 4.967 ± 1.084
3.311GluPro: 3.311 ± 2.373
0.0GluGln: 0.0 ± 0.0
1.656GluArg: 1.656 ± 1.289
3.311GluSer: 3.311 ± 0.102
1.656GluThr: 1.656 ± 1.289
3.311GluVal: 3.311 ± 0.102
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.656PheAsp: 1.656 ± 1.289
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
3.311PheGly: 3.311 ± 2.373
1.656PheHis: 1.656 ± 1.186
1.656PheIle: 1.656 ± 1.289
3.311PheLys: 3.311 ± 0.102
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
4.967PheAsn: 4.967 ± 1.391
4.967PhePro: 4.967 ± 1.084
3.311PheGln: 3.311 ± 0.102
1.656PheArg: 1.656 ± 1.186
1.656PheSer: 1.656 ± 1.289
3.311PheThr: 3.311 ± 0.102
3.311PheVal: 3.311 ± 2.578
0.0PheTrp: 0.0 ± 0.0
3.311PheTyr: 3.311 ± 0.102
0.0PheXaa: 0.0 ± 0.0
Gly
3.311GlyAla: 3.311 ± 2.373
1.656GlyCys: 1.656 ± 1.186
0.0GlyAsp: 0.0 ± 0.0
3.311GlyGlu: 3.311 ± 0.102
3.311GlyPhe: 3.311 ± 2.578
3.311GlyGly: 3.311 ± 0.102
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
4.967GlyLys: 4.967 ± 1.084
1.656GlyLeu: 1.656 ± 1.289
1.656GlyMet: 1.656 ± 1.186
3.311GlyAsn: 3.311 ± 2.373
8.278GlyPro: 8.278 ± 1.494
3.311GlyGln: 3.311 ± 0.102
0.0GlyArg: 0.0 ± 0.0
4.967GlySer: 4.967 ± 1.084
3.311GlyThr: 3.311 ± 0.102
4.967GlyVal: 4.967 ± 1.391
1.656GlyTrp: 1.656 ± 1.289
3.311GlyTyr: 3.311 ± 0.102
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.656HisAsp: 1.656 ± 1.186
1.656HisGlu: 1.656 ± 1.289
0.0HisPhe: 0.0 ± 0.0
4.967HisGly: 4.967 ± 1.391
0.0HisHis: 0.0 ± 0.0
1.656HisIle: 1.656 ± 1.186
0.0HisLys: 0.0 ± 0.0
3.311HisLeu: 3.311 ± 2.578
0.0HisMet: 0.0 ± 0.0
1.656HisAsn: 1.656 ± 1.289
1.656HisPro: 1.656 ± 1.289
0.0HisGln: 0.0 ± 0.0
3.311HisArg: 3.311 ± 2.373
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.656HisTrp: 1.656 ± 1.289
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.656IleAla: 1.656 ± 1.186
1.656IleCys: 1.656 ± 1.186
6.623IleAsp: 6.623 ± 4.746
3.311IleGlu: 3.311 ± 2.578
1.656IlePhe: 1.656 ± 1.289
3.311IleGly: 3.311 ± 0.102
1.656IleHis: 1.656 ± 1.186
1.656IleIle: 1.656 ± 1.186
11.589IleLys: 11.589 ± 3.354
4.967IleLeu: 4.967 ± 3.867
0.0IleMet: 0.0 ± 0.0
6.623IleAsn: 6.623 ± 2.68
1.656IlePro: 1.656 ± 1.186
1.656IleGln: 1.656 ± 1.289
3.311IleArg: 3.311 ± 0.102
3.311IleSer: 3.311 ± 0.102
6.623IleThr: 6.623 ± 0.205
3.311IleVal: 3.311 ± 0.102
4.967IleTrp: 4.967 ± 3.559
3.311IleTyr: 3.311 ± 0.102
0.0IleXaa: 0.0 ± 0.0
Lys
4.967LysAla: 4.967 ± 3.559
0.0LysCys: 0.0 ± 0.0
6.623LysAsp: 6.623 ± 5.156
1.656LysGlu: 1.656 ± 1.186
1.656LysPhe: 1.656 ± 1.186
4.967LysGly: 4.967 ± 1.391
1.656LysHis: 1.656 ± 1.186
4.967LysIle: 4.967 ± 1.391
8.278LysLys: 8.278 ± 0.982
4.967LysLeu: 4.967 ± 1.084
3.311LysMet: 3.311 ± 2.005
4.967LysAsn: 4.967 ± 1.391
4.967LysPro: 4.967 ± 1.391
4.967LysGln: 4.967 ± 1.084
6.623LysArg: 6.623 ± 2.27
6.623LysSer: 6.623 ± 2.27
8.278LysThr: 8.278 ± 3.457
6.623LysVal: 6.623 ± 0.205
4.967LysTrp: 4.967 ± 1.084
9.934LysTyr: 9.934 ± 4.643
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
4.967LeuAsp: 4.967 ± 1.391
3.311LeuGlu: 3.311 ± 2.578
1.656LeuPhe: 1.656 ± 1.289
0.0LeuGly: 0.0 ± 0.0
0.0LeuHis: 0.0 ± 0.0
0.0LeuIle: 0.0 ± 0.0
9.934LeuLys: 9.934 ± 2.783
8.278LeuLeu: 8.278 ± 0.982
0.0LeuMet: 0.0 ± 0.0
3.311LeuAsn: 3.311 ± 0.102
1.656LeuPro: 1.656 ± 1.186
3.311LeuGln: 3.311 ± 2.578
3.311LeuArg: 3.311 ± 0.102
8.278LeuSer: 8.278 ± 1.494
4.967LeuThr: 4.967 ± 1.391
3.311LeuVal: 3.311 ± 0.102
1.656LeuTrp: 1.656 ± 1.289
3.311LeuTyr: 3.311 ± 0.102
0.0LeuXaa: 0.0 ± 0.0
Met
1.656MetAla: 1.656 ± 1.186
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.656MetHis: 1.656 ± 1.289
1.656MetIle: 1.656 ± 1.186
3.311MetLys: 3.311 ± 0.102
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.656MetSer: 1.656 ± 1.289
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.656AsnCys: 1.656 ± 1.289
8.278AsnAsp: 8.278 ± 3.969
4.967AsnGlu: 4.967 ± 1.084
6.623AsnPhe: 6.623 ± 0.205
1.656AsnGly: 1.656 ± 1.186
0.0AsnHis: 0.0 ± 0.0
4.967AsnIle: 4.967 ± 1.084
1.656AsnLys: 1.656 ± 1.186
3.311AsnLeu: 3.311 ± 2.578
0.0AsnMet: 0.0 ± 0.0
6.623AsnAsn: 6.623 ± 2.68
1.656AsnPro: 1.656 ± 1.289
3.311AsnGln: 3.311 ± 2.578
4.967AsnArg: 4.967 ± 3.559
6.623AsnSer: 6.623 ± 0.205
3.311AsnThr: 3.311 ± 0.102
0.0AsnVal: 0.0 ± 0.0
1.656AsnTrp: 1.656 ± 1.289
6.623AsnTyr: 6.623 ± 0.205
0.0AsnXaa: 0.0 ± 0.0
Pro
6.623ProAla: 6.623 ± 2.27
0.0ProCys: 0.0 ± 0.0
1.656ProAsp: 1.656 ± 1.186
0.0ProGlu: 0.0 ± 0.0
1.656ProPhe: 1.656 ± 1.186
4.967ProGly: 4.967 ± 1.391
1.656ProHis: 1.656 ± 1.289
3.311ProIle: 3.311 ± 0.102
0.0ProLys: 0.0 ± 0.0
1.656ProLeu: 1.656 ± 1.289
0.0ProMet: 0.0 ± 0.0
4.967ProAsn: 4.967 ± 1.084
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
4.967ProArg: 4.967 ± 1.084
1.656ProSer: 1.656 ± 1.186
3.311ProThr: 3.311 ± 0.102
3.311ProVal: 3.311 ± 0.102
1.656ProTrp: 1.656 ± 1.289
1.656ProTyr: 1.656 ± 1.289
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.656GlnGlu: 1.656 ± 1.186
0.0GlnPhe: 0.0 ± 0.0
3.311GlnGly: 3.311 ± 0.102
0.0GlnHis: 0.0 ± 0.0
6.623GlnIle: 6.623 ± 2.68
4.967GlnLys: 4.967 ± 1.084
0.0GlnLeu: 0.0 ± 0.0
1.656GlnMet: 1.656 ± 1.186
3.311GlnAsn: 3.311 ± 2.578
1.656GlnPro: 1.656 ± 1.289
1.656GlnGln: 1.656 ± 1.289
0.0GlnArg: 0.0 ± 0.0
0.0GlnSer: 0.0 ± 0.0
0.0GlnThr: 0.0 ± 0.0
3.311GlnVal: 3.311 ± 2.578
0.0GlnTrp: 0.0 ± 0.0
1.656GlnTyr: 1.656 ± 1.289
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.0ArgCys: 0.0 ± 0.0
3.311ArgAsp: 3.311 ± 0.102
0.0ArgGlu: 0.0 ± 0.0
1.656ArgPhe: 1.656 ± 1.186
0.0ArgGly: 0.0 ± 0.0
0.0ArgHis: 0.0 ± 0.0
4.967ArgIle: 4.967 ± 3.559
6.623ArgLys: 6.623 ± 2.27
3.311ArgLeu: 3.311 ± 2.578
0.0ArgMet: 0.0 ± 0.0
1.656ArgAsn: 1.656 ± 1.186
1.656ArgPro: 1.656 ± 1.186
1.656ArgGln: 1.656 ± 1.186
14.901ArgArg: 14.901 ± 8.203
1.656ArgSer: 1.656 ± 1.289
6.623ArgThr: 6.623 ± 4.746
1.656ArgVal: 1.656 ± 1.289
0.0ArgTrp: 0.0 ± 0.0
9.934ArgTyr: 9.934 ± 0.307
0.0ArgXaa: 0.0 ± 0.0
Ser
4.967SerAla: 4.967 ± 1.084
1.656SerCys: 1.656 ± 1.289
1.656SerAsp: 1.656 ± 1.186
8.278SerGlu: 8.278 ± 1.494
1.656SerPhe: 1.656 ± 1.186
1.656SerGly: 1.656 ± 1.186
1.656SerHis: 1.656 ± 1.186
6.623SerIle: 6.623 ± 2.27
9.934SerLys: 9.934 ± 0.307
6.623SerLeu: 6.623 ± 0.205
0.0SerMet: 0.0 ± 0.804
3.311SerAsn: 3.311 ± 2.578
0.0SerPro: 0.0 ± 0.0
3.311SerGln: 3.311 ± 2.578
1.656SerArg: 1.656 ± 1.289
4.967SerSer: 4.967 ± 1.391
3.311SerThr: 3.311 ± 0.102
6.623SerVal: 6.623 ± 0.205
6.623SerTrp: 6.623 ± 0.205
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.656ThrAla: 1.656 ± 1.186
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
1.656ThrGlu: 1.656 ± 1.186
3.311ThrPhe: 3.311 ± 2.373
11.589ThrGly: 11.589 ± 1.596
0.0ThrHis: 0.0 ± 0.0
9.934ThrIle: 9.934 ± 0.307
6.623ThrLys: 6.623 ± 2.27
4.967ThrLeu: 4.967 ± 1.391
0.0ThrMet: 0.0 ± 0.0
6.623ThrAsn: 6.623 ± 0.205
1.656ThrPro: 1.656 ± 1.186
0.0ThrGln: 0.0 ± 0.0
1.656ThrArg: 1.656 ± 1.186
8.278ThrSer: 8.278 ± 1.494
1.656ThrThr: 1.656 ± 1.186
0.0ThrVal: 0.0 ± 0.0
1.656ThrTrp: 1.656 ± 1.186
6.623ThrTyr: 6.623 ± 2.27
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
1.656ValAsp: 1.656 ± 1.186
1.656ValGlu: 1.656 ± 1.289
1.656ValPhe: 1.656 ± 1.289
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
3.311ValIle: 3.311 ± 0.102
6.623ValLys: 6.623 ± 2.68
6.623ValLeu: 6.623 ± 0.205
1.656ValMet: 1.656 ± 1.289
1.656ValAsn: 1.656 ± 1.289
4.967ValPro: 4.967 ± 1.084
0.0ValGln: 0.0 ± 0.0
0.0ValArg: 0.0 ± 0.0
6.623ValSer: 6.623 ± 0.205
1.656ValThr: 1.656 ± 1.289
3.311ValVal: 3.311 ± 2.578
1.656ValTrp: 1.656 ± 1.186
3.311ValTyr: 3.311 ± 2.578
0.0ValXaa: 0.0 ± 0.0
Trp
1.656TrpAla: 1.656 ± 1.289
0.0TrpCys: 0.0 ± 0.0
3.311TrpAsp: 3.311 ± 0.102
3.311TrpGlu: 3.311 ± 0.102
4.967TrpPhe: 4.967 ± 1.084
0.0TrpGly: 0.0 ± 0.0
1.656TrpHis: 1.656 ± 1.289
1.656TrpIle: 1.656 ± 1.289
3.311TrpLys: 3.311 ± 2.373
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
3.311TrpAsn: 3.311 ± 2.373
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
4.967TrpThr: 4.967 ± 1.391
0.0TrpVal: 0.0 ± 0.0
4.967TrpTrp: 4.967 ± 1.084
1.656TrpTyr: 1.656 ± 1.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.311TyrAla: 3.311 ± 2.373
1.656TyrCys: 1.656 ± 1.289
3.311TyrAsp: 3.311 ± 0.102
3.311TyrGlu: 3.311 ± 0.102
3.311TyrPhe: 3.311 ± 2.578
6.623TyrGly: 6.623 ± 0.205
3.311TyrHis: 3.311 ± 2.578
3.311TyrIle: 3.311 ± 0.102
4.967TyrLys: 4.967 ± 3.559
1.656TyrLeu: 1.656 ± 1.186
0.0TyrMet: 0.0 ± 0.0
3.311TyrAsn: 3.311 ± 0.102
3.311TyrPro: 3.311 ± 2.578
1.656TyrGln: 1.656 ± 1.289
4.967TyrArg: 4.967 ± 3.559
6.623TyrSer: 6.623 ± 2.27
6.623TyrThr: 6.623 ± 0.205
1.656TyrVal: 1.656 ± 1.186
0.0TyrTrp: 0.0 ± 0.0
1.656TyrTyr: 1.656 ± 1.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski