Amino acid dipepetide frequency for Odonata-associated circular virus-13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.626AlaAla: 1.626 ± 1.444
6.504AlaCys: 6.504 ± 0.605
0.0AlaAsp: 0.0 ± 0.0
9.756AlaGlu: 9.756 ± 4.264
1.626AlaPhe: 1.626 ± 1.142
4.878AlaGly: 4.878 ± 0.839
1.626AlaHis: 1.626 ± 1.142
1.626AlaIle: 1.626 ± 1.142
1.626AlaLys: 1.626 ± 1.142
4.878AlaLeu: 4.878 ± 3.425
3.252AlaMet: 3.252 ± 1.141
0.0AlaAsn: 0.0 ± 0.0
3.252AlaPro: 3.252 ± 2.283
0.0AlaGln: 0.0 ± 0.0
4.878AlaArg: 4.878 ± 1.747
0.0AlaSer: 0.0 ± 0.0
3.252AlaThr: 3.252 ± 2.889
4.878AlaVal: 4.878 ± 0.839
1.626AlaTrp: 1.626 ± 1.444
1.626AlaTyr: 1.626 ± 1.444
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
3.252CysAsp: 3.252 ± 0.303
1.626CysGlu: 1.626 ± 1.444
1.626CysPhe: 1.626 ± 1.142
1.626CysGly: 1.626 ± 1.142
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
4.878CysLys: 4.878 ± 3.425
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
4.878CysPro: 4.878 ± 1.747
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.626CysSer: 1.626 ± 1.444
0.0CysThr: 0.0 ± 0.0
1.626CysVal: 1.626 ± 1.142
0.0CysTrp: 0.0 ± 0.0
1.626CysTyr: 1.626 ± 1.142
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
1.626AspCys: 1.626 ± 1.444
1.626AspAsp: 1.626 ± 1.444
1.626AspGlu: 1.626 ± 1.142
0.0AspPhe: 0.0 ± 0.0
8.13AspGly: 8.13 ± 3.122
0.0AspHis: 0.0 ± 0.0
3.252AspIle: 3.252 ± 0.303
3.252AspLys: 3.252 ± 0.303
4.878AspLeu: 4.878 ± 1.747
4.878AspMet: 4.878 ± 0.839
4.878AspAsn: 4.878 ± 4.333
6.504AspPro: 6.504 ± 0.605
1.626AspGln: 1.626 ± 1.142
3.252AspArg: 3.252 ± 2.283
3.252AspSer: 3.252 ± 0.303
1.626AspThr: 1.626 ± 1.444
6.504AspVal: 6.504 ± 1.98
0.0AspTrp: 0.0 ± 0.0
3.252AspTyr: 3.252 ± 2.283
0.0AspXaa: 0.0 ± 0.0
Glu
8.13GluAla: 8.13 ± 0.536
1.626GluCys: 1.626 ± 1.142
3.252GluAsp: 3.252 ± 2.283
3.252GluGlu: 3.252 ± 2.283
6.504GluPhe: 6.504 ± 1.98
1.626GluGly: 1.626 ± 1.142
0.0GluHis: 0.0 ± 0.0
4.878GluIle: 4.878 ± 3.425
3.252GluLys: 3.252 ± 2.889
0.0GluLeu: 0.0 ± 0.0
0.0GluMet: 0.0 ± 0.0
1.626GluAsn: 1.626 ± 1.142
1.626GluPro: 1.626 ± 1.142
1.626GluGln: 1.626 ± 1.142
1.626GluArg: 1.626 ± 1.444
4.878GluSer: 4.878 ± 3.425
1.626GluThr: 1.626 ± 1.142
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.626GluTyr: 1.626 ± 1.142
0.0GluXaa: 0.0 ± 0.0
Phe
3.252PheAla: 3.252 ± 2.283
0.0PheCys: 0.0 ± 0.0
3.252PheAsp: 3.252 ± 2.283
1.626PheGlu: 1.626 ± 1.444
1.626PhePhe: 1.626 ± 1.142
3.252PheGly: 3.252 ± 2.283
0.0PheHis: 0.0 ± 0.0
3.252PheIle: 3.252 ± 0.303
6.504PheLys: 6.504 ± 0.605
1.626PheLeu: 1.626 ± 1.142
1.626PheMet: 1.626 ± 1.142
8.13PheAsn: 8.13 ± 2.05
3.252PhePro: 3.252 ± 2.889
4.878PheGln: 4.878 ± 1.747
4.878PheArg: 4.878 ± 1.747
3.252PheSer: 3.252 ± 2.283
4.878PheThr: 4.878 ± 0.839
3.252PheVal: 3.252 ± 0.303
0.0PheTrp: 0.0 ± 0.0
3.252PheTyr: 3.252 ± 2.889
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
4.878GlyAsp: 4.878 ± 0.839
3.252GlyGlu: 3.252 ± 2.283
3.252GlyPhe: 3.252 ± 0.303
3.252GlyGly: 3.252 ± 2.283
0.0GlyHis: 0.0 ± 0.0
3.252GlyIle: 3.252 ± 0.303
6.504GlyLys: 6.504 ± 3.191
1.626GlyLeu: 1.626 ± 1.142
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
4.878GlyPro: 4.878 ± 3.425
0.0GlyGln: 0.0 ± 0.0
6.504GlyArg: 6.504 ± 0.605
11.382GlySer: 11.382 ± 2.819
6.504GlyThr: 6.504 ± 4.566
3.252GlyVal: 3.252 ± 0.303
0.0GlyTrp: 0.0 ± 0.0
3.252GlyTyr: 3.252 ± 0.303
0.0GlyXaa: 0.0 ± 0.0
His
1.626HisAla: 1.626 ± 1.142
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.252HisLeu: 3.252 ± 2.283
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
3.252HisVal: 3.252 ± 2.283
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
1.626IleAsp: 1.626 ± 1.444
0.0IleGlu: 0.0 ± 0.0
3.252IlePhe: 3.252 ± 0.303
1.626IleGly: 1.626 ± 1.444
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
8.13IleLys: 8.13 ± 3.122
1.626IleLeu: 1.626 ± 1.444
0.0IleMet: 0.0 ± 0.0
1.626IleAsn: 1.626 ± 1.444
3.252IlePro: 3.252 ± 0.303
3.252IleGln: 3.252 ± 2.283
3.252IleArg: 3.252 ± 2.283
0.0IleSer: 0.0 ± 0.0
1.626IleThr: 1.626 ± 1.444
1.626IleVal: 1.626 ± 1.142
3.252IleTrp: 3.252 ± 2.283
3.252IleTyr: 3.252 ± 2.283
0.0IleXaa: 0.0 ± 0.0
Lys
6.504LysAla: 6.504 ± 0.605
0.0LysCys: 0.0 ± 0.0
4.878LysAsp: 4.878 ± 3.425
3.252LysGlu: 3.252 ± 2.283
4.878LysPhe: 4.878 ± 4.333
6.504LysGly: 6.504 ± 1.98
0.0LysHis: 0.0 ± 0.0
1.626LysIle: 1.626 ± 1.142
14.634LysLys: 14.634 ± 0.069
6.504LysLeu: 6.504 ± 1.98
4.878LysMet: 4.878 ± 1.747
6.504LysAsn: 6.504 ± 1.98
0.0LysPro: 0.0 ± 0.0
1.626LysGln: 1.626 ± 1.444
6.504LysArg: 6.504 ± 0.605
3.252LysSer: 3.252 ± 0.303
3.252LysThr: 3.252 ± 0.303
3.252LysVal: 3.252 ± 2.889
1.626LysTrp: 1.626 ± 1.142
6.504LysTyr: 6.504 ± 1.98
0.0LysXaa: 0.0 ± 0.0
Leu
1.626LeuAla: 1.626 ± 1.444
0.0LeuCys: 0.0 ± 0.0
4.878LeuAsp: 4.878 ± 4.333
4.878LeuGlu: 4.878 ± 3.425
4.878LeuPhe: 4.878 ± 4.333
1.626LeuGly: 1.626 ± 1.444
0.0LeuHis: 0.0 ± 0.0
1.626LeuIle: 1.626 ± 1.142
1.626LeuLys: 1.626 ± 1.142
0.0LeuLeu: 0.0 ± 0.0
3.252LeuMet: 3.252 ± 1.013
4.878LeuAsn: 4.878 ± 1.747
1.626LeuPro: 1.626 ± 1.444
6.504LeuGln: 6.504 ± 4.566
8.13LeuArg: 8.13 ± 2.05
4.878LeuSer: 4.878 ± 0.839
3.252LeuThr: 3.252 ± 0.303
1.626LeuVal: 1.626 ± 1.444
1.626LeuTrp: 1.626 ± 1.444
4.878LeuTyr: 4.878 ± 3.425
0.0LeuXaa: 0.0 ± 0.0
Met
4.878MetAla: 4.878 ± 1.747
1.626MetCys: 1.626 ± 1.142
4.878MetAsp: 4.878 ± 1.747
1.626MetGlu: 1.626 ± 1.444
1.626MetPhe: 1.626 ± 1.444
4.878MetGly: 4.878 ± 0.839
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
4.878MetLeu: 4.878 ± 0.839
0.0MetMet: 0.0 ± 0.0
1.626MetAsn: 1.626 ± 1.444
0.0MetPro: 0.0 ± 0.0
1.626MetGln: 1.626 ± 1.142
1.626MetArg: 1.626 ± 1.444
1.626MetSer: 1.626 ± 1.444
0.0MetThr: 0.0 ± 0.0
3.252MetVal: 3.252 ± 2.283
0.0MetTrp: 0.0 ± 0.0
1.626MetTyr: 1.626 ± 1.444
0.0MetXaa: 0.0 ± 0.0
Asn
6.504AsnAla: 6.504 ± 0.605
0.0AsnCys: 0.0 ± 0.0
4.878AsnAsp: 4.878 ± 1.747
3.252AsnGlu: 3.252 ± 2.889
1.626AsnPhe: 1.626 ± 1.444
4.878AsnGly: 4.878 ± 1.747
1.626AsnHis: 1.626 ± 1.142
1.626AsnIle: 1.626 ± 1.444
1.626AsnLys: 1.626 ± 1.142
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
3.252AsnAsn: 3.252 ± 2.889
1.626AsnPro: 1.626 ± 1.444
1.626AsnGln: 1.626 ± 1.142
3.252AsnArg: 3.252 ± 0.303
6.504AsnSer: 6.504 ± 5.777
0.0AsnThr: 0.0 ± 0.0
1.626AsnVal: 1.626 ± 1.142
1.626AsnTrp: 1.626 ± 1.444
1.626AsnTyr: 1.626 ± 1.142
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
6.504ProAsp: 6.504 ± 0.605
1.626ProGlu: 1.626 ± 1.142
3.252ProPhe: 3.252 ± 0.303
1.626ProGly: 1.626 ± 1.142
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
8.13ProLys: 8.13 ± 3.122
1.626ProLeu: 1.626 ± 1.444
0.0ProMet: 0.0 ± 0.0
1.626ProAsn: 1.626 ± 1.444
1.626ProPro: 1.626 ± 1.142
1.626ProGln: 1.626 ± 1.444
4.878ProArg: 4.878 ± 3.425
6.504ProSer: 6.504 ± 1.98
6.504ProThr: 6.504 ± 3.191
3.252ProVal: 3.252 ± 2.283
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.252GlnAla: 3.252 ± 2.283
0.0GlnCys: 0.0 ± 0.0
1.626GlnAsp: 1.626 ± 1.444
6.504GlnGlu: 6.504 ± 4.566
0.0GlnPhe: 0.0 ± 0.0
1.626GlnGly: 1.626 ± 1.142
1.626GlnHis: 1.626 ± 1.142
0.0GlnIle: 0.0 ± 0.0
1.626GlnLys: 1.626 ± 1.444
6.504GlnLeu: 6.504 ± 3.191
0.0GlnMet: 0.0 ± 0.0
1.626GlnAsn: 1.626 ± 1.444
1.626GlnPro: 1.626 ± 1.142
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
0.0GlnSer: 0.0 ± 0.0
0.0GlnThr: 0.0 ± 0.0
1.626GlnVal: 1.626 ± 1.142
0.0GlnTrp: 0.0 ± 0.0
3.252GlnTyr: 3.252 ± 2.283
0.0GlnXaa: 0.0 ± 0.0
Arg
4.878ArgAla: 4.878 ± 0.839
1.626ArgCys: 1.626 ± 1.444
0.0ArgAsp: 0.0 ± 0.0
1.626ArgGlu: 1.626 ± 1.142
6.504ArgPhe: 6.504 ± 1.98
3.252ArgGly: 3.252 ± 2.889
0.0ArgHis: 0.0 ± 0.0
3.252ArgIle: 3.252 ± 2.889
8.13ArgLys: 8.13 ± 0.536
3.252ArgLeu: 3.252 ± 2.889
3.252ArgMet: 3.252 ± 2.889
1.626ArgAsn: 1.626 ± 1.444
3.252ArgPro: 3.252 ± 2.283
1.626ArgGln: 1.626 ± 1.142
8.13ArgArg: 8.13 ± 4.636
4.878ArgSer: 4.878 ± 1.747
4.878ArgThr: 4.878 ± 1.747
0.0ArgVal: 0.0 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
9.756ArgTyr: 9.756 ± 3.494
0.0ArgXaa: 0.0 ± 0.0
Ser
1.626SerAla: 1.626 ± 1.142
1.626SerCys: 1.626 ± 1.142
6.504SerAsp: 6.504 ± 1.98
0.0SerGlu: 0.0 ± 0.0
6.504SerPhe: 6.504 ± 0.605
8.13SerGly: 8.13 ± 3.122
0.0SerHis: 0.0 ± 0.0
6.504SerIle: 6.504 ± 1.98
1.626SerLys: 1.626 ± 1.142
6.504SerLeu: 6.504 ± 5.777
4.878SerMet: 4.878 ± 1.747
6.504SerAsn: 6.504 ± 1.98
0.0SerPro: 0.0 ± 0.0
3.252SerGln: 3.252 ± 0.303
3.252SerArg: 3.252 ± 2.889
14.634SerSer: 14.634 ± 2.655
3.252SerThr: 3.252 ± 0.303
6.504SerVal: 6.504 ± 5.777
0.0SerTrp: 0.0 ± 0.0
1.626SerTyr: 1.626 ± 1.142
0.0SerXaa: 0.0 ± 0.0
Thr
8.13ThrAla: 8.13 ± 3.122
1.626ThrCys: 1.626 ± 1.142
0.0ThrAsp: 0.0 ± 0.0
0.0ThrGlu: 0.0 ± 0.0
3.252ThrPhe: 3.252 ± 2.283
1.626ThrGly: 1.626 ± 1.142
1.626ThrHis: 1.626 ± 1.142
0.0ThrIle: 0.0 ± 0.0
3.252ThrLys: 3.252 ± 2.889
3.252ThrLeu: 3.252 ± 0.303
1.626ThrMet: 1.626 ± 1.142
0.0ThrAsn: 0.0 ± 0.0
3.252ThrPro: 3.252 ± 0.303
0.0ThrGln: 0.0 ± 0.0
1.626ThrArg: 1.626 ± 1.444
4.878ThrSer: 4.878 ± 0.839
1.626ThrThr: 1.626 ± 1.444
3.252ThrVal: 3.252 ± 2.889
1.626ThrTrp: 1.626 ± 1.444
8.13ThrTyr: 8.13 ± 7.222
0.0ThrXaa: 0.0 ± 0.0
Val
1.626ValAla: 1.626 ± 1.142
0.0ValCys: 0.0 ± 0.0
4.878ValAsp: 4.878 ± 0.839
1.626ValGlu: 1.626 ± 1.142
4.878ValPhe: 4.878 ± 0.839
1.626ValGly: 1.626 ± 1.444
1.626ValHis: 1.626 ± 1.142
1.626ValIle: 1.626 ± 1.142
3.252ValLys: 3.252 ± 2.889
3.252ValLeu: 3.252 ± 2.283
3.252ValMet: 3.252 ± 0.303
1.626ValAsn: 1.626 ± 1.444
4.878ValPro: 4.878 ± 0.839
0.0ValGln: 0.0 ± 0.0
0.0ValArg: 0.0 ± 0.0
6.504ValSer: 6.504 ± 3.191
4.878ValThr: 4.878 ± 0.839
6.504ValVal: 6.504 ± 1.98
0.0ValTrp: 0.0 ± 0.0
4.878ValTyr: 4.878 ± 1.747
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
3.252TrpCys: 3.252 ± 0.303
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.626TrpPhe: 1.626 ± 1.142
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.626TrpLeu: 1.626 ± 1.142
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.252TrpArg: 3.252 ± 2.889
1.626TrpSer: 1.626 ± 1.444
0.0TrpThr: 0.0 ± 0.0
1.626TrpVal: 1.626 ± 1.444
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.878TyrAla: 4.878 ± 1.747
3.252TyrCys: 3.252 ± 2.283
3.252TyrAsp: 3.252 ± 2.283
1.626TyrGlu: 1.626 ± 1.142
4.878TyrPhe: 4.878 ± 0.839
1.626TyrGly: 1.626 ± 1.142
0.0TyrHis: 0.0 ± 0.0
4.878TyrIle: 4.878 ± 0.839
8.13TyrLys: 8.13 ± 3.122
6.504TyrLeu: 6.504 ± 3.191
3.252TyrMet: 3.252 ± 0.303
1.626TyrAsn: 1.626 ± 1.444
3.252TyrPro: 3.252 ± 2.283
1.626TyrGln: 1.626 ± 1.444
4.878TyrArg: 4.878 ± 4.333
3.252TyrSer: 3.252 ± 0.303
1.626TyrThr: 1.626 ± 1.444
0.0TyrVal: 0.0 ± 0.0
1.626TyrTrp: 1.626 ± 1.444
1.626TyrTyr: 1.626 ± 1.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (616 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski