Amino acid dipepetide frequency for Odonata-associated circular virus-15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.634AlaAla: 7.634 ± 3.144
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
3.053AlaGlu: 3.053 ± 2.165
1.527AlaPhe: 1.527 ± 1.187
4.58AlaGly: 4.58 ± 3.248
0.0AlaHis: 0.0 ± 0.0
7.634AlaIle: 7.634 ± 3.144
7.634AlaLys: 7.634 ± 0.875
7.634AlaLeu: 7.634 ± 3.664
0.0AlaMet: 0.0 ± 0.0
3.053AlaAsn: 3.053 ± 0.104
3.053AlaPro: 3.053 ± 2.165
3.053AlaGln: 3.053 ± 0.104
7.634AlaArg: 7.634 ± 3.144
9.16AlaSer: 9.16 ± 0.312
0.0AlaThr: 0.0 ± 0.0
3.053AlaVal: 3.053 ± 0.104
1.527AlaTrp: 1.527 ± 1.083
1.527AlaTyr: 1.527 ± 1.083
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.527CysArg: 1.527 ± 1.187
1.527CysSer: 1.527 ± 1.187
3.053CysThr: 3.053 ± 2.165
1.527CysVal: 1.527 ± 1.083
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.107AspAla: 6.107 ± 2.061
0.0AspCys: 0.0 ± 0.0
1.527AspAsp: 1.527 ± 1.187
1.527AspGlu: 1.527 ± 1.187
0.0AspPhe: 0.0 ± 0.0
1.527AspGly: 1.527 ± 1.187
0.0AspHis: 0.0 ± 0.0
4.58AspIle: 4.58 ± 0.979
1.527AspLys: 1.527 ± 1.083
6.107AspLeu: 6.107 ± 2.477
4.58AspMet: 4.58 ± 1.291
1.527AspAsn: 1.527 ± 1.187
1.527AspPro: 1.527 ± 1.083
1.527AspGln: 1.527 ± 1.187
1.527AspArg: 1.527 ± 1.187
1.527AspSer: 1.527 ± 1.187
4.58AspThr: 4.58 ± 1.291
3.053AspVal: 3.053 ± 2.165
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.527GluAla: 1.527 ± 1.187
0.0GluCys: 0.0 ± 0.0
4.58GluAsp: 4.58 ± 3.56
6.107GluGlu: 6.107 ± 4.746
3.053GluPhe: 3.053 ± 2.373
1.527GluGly: 1.527 ± 1.083
0.0GluHis: 0.0 ± 0.0
3.053GluIle: 3.053 ± 0.104
1.527GluLys: 1.527 ± 1.187
1.527GluLeu: 1.527 ± 1.187
0.0GluMet: 0.0 ± 0.0
3.053GluAsn: 3.053 ± 0.104
3.053GluPro: 3.053 ± 2.373
4.58GluGln: 4.58 ± 3.248
1.527GluArg: 1.527 ± 1.187
0.0GluSer: 0.0 ± 0.0
1.527GluThr: 1.527 ± 1.083
0.0GluVal: 0.0 ± 0.0
1.527GluTrp: 1.527 ± 1.083
1.527GluTyr: 1.527 ± 1.187
0.0GluXaa: 0.0 ± 0.0
Phe
1.527PheAla: 1.527 ± 1.083
0.0PheCys: 0.0 ± 0.0
4.58PheAsp: 4.58 ± 1.291
1.527PheGlu: 1.527 ± 1.187
3.053PhePhe: 3.053 ± 2.373
1.527PheGly: 1.527 ± 1.083
4.58PheHis: 4.58 ± 1.291
1.527PheIle: 1.527 ± 1.083
0.0PheLys: 0.0 ± 0.0
6.107PheLeu: 6.107 ± 4.746
3.053PheMet: 3.053 ± 0.104
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
3.053PheGln: 3.053 ± 0.104
7.634PheArg: 7.634 ± 1.394
1.527PheSer: 1.527 ± 1.083
4.58PheThr: 4.58 ± 1.291
3.053PheVal: 3.053 ± 0.104
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.053GlyAla: 3.053 ± 2.165
0.0GlyCys: 0.0 ± 0.0
1.527GlyAsp: 1.527 ± 1.083
1.527GlyGlu: 1.527 ± 1.187
3.053GlyPhe: 3.053 ± 0.104
10.687GlyGly: 10.687 ± 7.579
0.0GlyHis: 0.0 ± 0.0
3.053GlyIle: 3.053 ± 0.104
1.527GlyLys: 1.527 ± 1.187
3.053GlyLeu: 3.053 ± 2.165
1.527GlyMet: 1.527 ± 1.083
6.107GlyAsn: 6.107 ± 2.061
1.527GlyPro: 1.527 ± 1.187
4.58GlyGln: 4.58 ± 0.979
6.107GlyArg: 6.107 ± 2.061
6.107GlySer: 6.107 ± 4.331
4.58GlyThr: 4.58 ± 0.979
4.58GlyVal: 4.58 ± 3.248
1.527GlyTrp: 1.527 ± 1.083
4.58GlyTyr: 4.58 ± 0.979
0.0GlyXaa: 0.0 ± 0.0
His
1.527HisAla: 1.527 ± 1.187
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.527HisGlu: 1.527 ± 1.187
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.527HisIle: 1.527 ± 1.187
1.527HisLys: 1.527 ± 1.187
4.58HisLeu: 4.58 ± 3.56
0.0HisMet: 0.0 ± 0.0
1.527HisAsn: 1.527 ± 1.083
4.58HisPro: 4.58 ± 3.56
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.527HisTyr: 1.527 ± 1.083
0.0HisXaa: 0.0 ± 0.0
Ile
3.053IleAla: 3.053 ± 2.165
1.527IleCys: 1.527 ± 1.187
1.527IleAsp: 1.527 ± 1.083
3.053IleGlu: 3.053 ± 2.373
7.634IlePhe: 7.634 ± 3.664
10.687IleGly: 10.687 ± 1.498
1.527IleHis: 1.527 ± 1.187
3.053IleIle: 3.053 ± 2.373
3.053IleLys: 3.053 ± 2.165
3.053IleLeu: 3.053 ± 0.104
0.0IleMet: 0.0 ± 0.0
1.527IleAsn: 1.527 ± 1.187
4.58IlePro: 4.58 ± 1.291
7.634IleGln: 7.634 ± 0.875
1.527IleArg: 1.527 ± 1.187
3.053IleSer: 3.053 ± 2.373
6.107IleThr: 6.107 ± 2.061
3.053IleVal: 3.053 ± 0.104
0.0IleTrp: 0.0 ± 0.0
3.053IleTyr: 3.053 ± 2.165
0.0IleXaa: 0.0 ± 0.0
Lys
3.053LysAla: 3.053 ± 2.165
0.0LysCys: 0.0 ± 0.0
1.527LysAsp: 1.527 ± 1.187
1.527LysGlu: 1.527 ± 1.187
1.527LysPhe: 1.527 ± 1.083
3.053LysGly: 3.053 ± 2.373
0.0LysHis: 0.0 ± 0.0
4.58LysIle: 4.58 ± 3.56
1.527LysLys: 1.527 ± 1.187
1.527LysLeu: 1.527 ± 1.083
1.527LysMet: 1.527 ± 1.083
4.58LysAsn: 4.58 ± 3.56
1.527LysPro: 1.527 ± 1.187
3.053LysGln: 3.053 ± 0.104
7.634LysArg: 7.634 ± 0.875
4.58LysSer: 4.58 ± 0.979
7.634LysThr: 7.634 ± 0.875
1.527LysVal: 1.527 ± 1.187
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.053LeuAla: 3.053 ± 0.104
0.0LeuCys: 0.0 ± 0.0
6.107LeuAsp: 6.107 ± 0.208
1.527LeuGlu: 1.527 ± 1.187
1.527LeuPhe: 1.527 ± 1.187
3.053LeuGly: 3.053 ± 2.165
1.527LeuHis: 1.527 ± 1.187
3.053LeuIle: 3.053 ± 0.104
7.634LeuLys: 7.634 ± 3.664
0.0LeuLeu: 0.0 ± 0.0
0.0LeuMet: 0.0 ± 0.0
3.053LeuAsn: 3.053 ± 0.104
4.58LeuPro: 4.58 ± 3.56
4.58LeuGln: 4.58 ± 1.291
4.58LeuArg: 4.58 ± 0.979
4.58LeuSer: 4.58 ± 1.291
9.16LeuThr: 9.16 ± 2.581
6.107LeuVal: 6.107 ± 2.477
1.527LeuTrp: 1.527 ± 1.187
3.053LeuTyr: 3.053 ± 0.104
0.0LeuXaa: 0.0 ± 0.0
Met
1.527MetAla: 1.527 ± 1.083
0.0MetCys: 0.0 ± 0.0
1.527MetAsp: 1.527 ± 1.083
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.527MetGly: 1.527 ± 1.083
0.0MetHis: 0.0 ± 0.0
1.527MetIle: 1.527 ± 1.083
1.527MetLys: 1.527 ± 1.083
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.527MetAsn: 1.527 ± 1.083
3.053MetPro: 3.053 ± 2.165
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
4.58MetSer: 4.58 ± 1.291
1.527MetThr: 1.527 ± 1.187
1.527MetVal: 1.527 ± 1.083
0.0MetTrp: 0.0 ± 0.0
4.58MetTyr: 4.58 ± 0.979
0.0MetXaa: 0.0 ± 0.0
Asn
6.107AsnAla: 6.107 ± 2.061
0.0AsnCys: 0.0 ± 0.0
1.527AsnAsp: 1.527 ± 1.187
1.527AsnGlu: 1.527 ± 1.083
3.053AsnPhe: 3.053 ± 2.373
9.16AsnGly: 9.16 ± 1.957
0.0AsnHis: 0.0 ± 0.0
6.107AsnIle: 6.107 ± 2.477
1.527AsnLys: 1.527 ± 1.187
9.16AsnLeu: 9.16 ± 2.581
3.053AsnMet: 3.053 ± 2.165
0.0AsnAsn: 0.0 ± 0.0
1.527AsnPro: 1.527 ± 1.083
3.053AsnGln: 3.053 ± 0.104
0.0AsnArg: 0.0 ± 0.0
3.053AsnSer: 3.053 ± 0.104
4.58AsnThr: 4.58 ± 0.979
1.527AsnVal: 1.527 ± 1.187
0.0AsnTrp: 0.0 ± 0.0
6.107AsnTyr: 6.107 ± 0.208
0.0AsnXaa: 0.0 ± 0.0
Pro
6.107ProAla: 6.107 ± 0.208
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.053ProGlu: 3.053 ± 2.373
3.053ProPhe: 3.053 ± 0.104
4.58ProGly: 4.58 ± 3.248
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
1.527ProLys: 1.527 ± 1.187
3.053ProLeu: 3.053 ± 0.104
0.0ProMet: 0.0 ± 0.0
3.053ProAsn: 3.053 ± 2.373
1.527ProPro: 1.527 ± 1.187
3.053ProGln: 3.053 ± 2.373
3.053ProArg: 3.053 ± 0.104
3.053ProSer: 3.053 ± 2.373
6.107ProThr: 6.107 ± 2.061
1.527ProVal: 1.527 ± 1.083
1.527ProTrp: 1.527 ± 1.083
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.58GlnAla: 4.58 ± 3.56
1.527GlnCys: 1.527 ± 1.187
1.527GlnAsp: 1.527 ± 1.187
1.527GlnGlu: 1.527 ± 1.187
1.527GlnPhe: 1.527 ± 1.083
3.053GlnGly: 3.053 ± 2.165
1.527GlnHis: 1.527 ± 1.187
4.58GlnIle: 4.58 ± 1.291
0.0GlnLys: 0.0 ± 0.0
1.527GlnLeu: 1.527 ± 1.187
1.527GlnMet: 1.527 ± 0.702
6.107GlnAsn: 6.107 ± 2.061
1.527GlnPro: 1.527 ± 1.083
4.58GlnGln: 4.58 ± 0.979
3.053GlnArg: 3.053 ± 2.165
0.0GlnSer: 0.0 ± 0.0
1.527GlnThr: 1.527 ± 1.083
4.58GlnVal: 4.58 ± 0.979
0.0GlnTrp: 0.0 ± 0.0
3.053GlnTyr: 3.053 ± 2.165
0.0GlnXaa: 0.0 ± 0.0
Arg
6.107ArgAla: 6.107 ± 4.331
0.0ArgCys: 0.0 ± 0.0
3.053ArgAsp: 3.053 ± 2.165
3.053ArgGlu: 3.053 ± 0.104
4.58ArgPhe: 4.58 ± 3.248
1.527ArgGly: 1.527 ± 1.083
3.053ArgHis: 3.053 ± 2.373
3.053ArgIle: 3.053 ± 2.373
9.16ArgLys: 9.16 ± 1.957
3.053ArgLeu: 3.053 ± 0.104
0.0ArgMet: 0.0 ± 0.0
3.053ArgAsn: 3.053 ± 0.104
0.0ArgPro: 0.0 ± 0.0
0.0ArgGln: 0.0 ± 0.0
10.687ArgArg: 10.687 ± 5.309
1.527ArgSer: 1.527 ± 1.083
3.053ArgThr: 3.053 ± 2.373
6.107ArgVal: 6.107 ± 0.208
0.0ArgTrp: 0.0 ± 0.0
7.634ArgTyr: 7.634 ± 0.875
0.0ArgXaa: 0.0 ± 0.0
Ser
4.58SerAla: 4.58 ± 0.979
1.527SerCys: 1.527 ± 1.083
3.053SerAsp: 3.053 ± 2.165
3.053SerGlu: 3.053 ± 2.165
6.107SerPhe: 6.107 ± 2.477
6.107SerGly: 6.107 ± 2.061
0.0SerHis: 0.0 ± 0.0
4.58SerIle: 4.58 ± 1.291
3.053SerLys: 3.053 ± 2.373
4.58SerLeu: 4.58 ± 1.291
1.527SerMet: 1.527 ± 1.083
7.634SerAsn: 7.634 ± 1.394
1.527SerPro: 1.527 ± 1.083
0.0SerGln: 0.0 ± 0.0
3.053SerArg: 3.053 ± 0.104
7.634SerSer: 7.634 ± 0.875
9.16SerThr: 9.16 ± 2.581
1.527SerVal: 1.527 ± 1.187
0.0SerTrp: 0.0 ± 0.0
1.527SerTyr: 1.527 ± 1.083
0.0SerXaa: 0.0 ± 0.0
Thr
4.58ThrAla: 4.58 ± 0.979
3.053ThrCys: 3.053 ± 2.165
6.107ThrAsp: 6.107 ± 0.208
1.527ThrGlu: 1.527 ± 1.083
1.527ThrPhe: 1.527 ± 1.187
1.527ThrGly: 1.527 ± 1.083
0.0ThrHis: 0.0 ± 0.0
7.634ThrIle: 7.634 ± 1.394
4.58ThrLys: 4.58 ± 0.979
4.58ThrLeu: 4.58 ± 1.291
3.053ThrMet: 3.053 ± 2.165
7.634ThrAsn: 7.634 ± 3.144
4.58ThrPro: 4.58 ± 0.979
1.527ThrGln: 1.527 ± 1.083
1.527ThrArg: 1.527 ± 1.083
9.16ThrSer: 9.16 ± 2.581
6.107ThrThr: 6.107 ± 0.208
6.107ThrVal: 6.107 ± 2.477
3.053ThrTrp: 3.053 ± 0.104
6.107ThrTyr: 6.107 ± 2.477
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
1.527ValAsp: 1.527 ± 1.187
1.527ValGlu: 1.527 ± 1.187
3.053ValPhe: 3.053 ± 2.165
1.527ValGly: 1.527 ± 1.083
1.527ValHis: 1.527 ± 1.187
6.107ValIle: 6.107 ± 2.061
3.053ValLys: 3.053 ± 2.373
4.58ValLeu: 4.58 ± 0.979
1.527ValMet: 1.527 ± 0.742
4.58ValAsn: 4.58 ± 3.56
3.053ValPro: 3.053 ± 0.104
1.527ValGln: 1.527 ± 1.083
4.58ValArg: 4.58 ± 0.979
1.527ValSer: 1.527 ± 1.187
9.16ValThr: 9.16 ± 1.957
1.527ValVal: 1.527 ± 1.187
0.0ValTrp: 0.0 ± 0.0
3.053ValTyr: 3.053 ± 0.104
0.0ValXaa: 0.0 ± 0.0
Trp
1.527TrpAla: 1.527 ± 1.187
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.527TrpGlu: 1.527 ± 1.083
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.527TrpIle: 1.527 ± 1.083
0.0TrpLys: 0.0 ± 0.0
1.527TrpLeu: 1.527 ± 1.083
0.0TrpMet: 0.0 ± 0.0
1.527TrpAsn: 1.527 ± 1.083
0.0TrpPro: 0.0 ± 0.0
1.527TrpGln: 1.527 ± 1.083
1.527TrpArg: 1.527 ± 1.083
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.527TrpVal: 1.527 ± 1.187
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.107TyrAla: 6.107 ± 0.208
0.0TyrCys: 0.0 ± 0.0
3.053TyrAsp: 3.053 ± 2.373
1.527TyrGlu: 1.527 ± 1.187
3.053TyrPhe: 3.053 ± 0.104
3.053TyrGly: 3.053 ± 2.165
4.58TyrHis: 4.58 ± 1.291
1.527TyrIle: 1.527 ± 1.187
0.0TyrLys: 0.0 ± 0.0
3.053TyrLeu: 3.053 ± 0.104
1.527TyrMet: 1.527 ± 1.083
1.527TyrAsn: 1.527 ± 1.083
3.053TyrPro: 3.053 ± 0.104
1.527TyrGln: 1.527 ± 1.187
1.527TyrArg: 1.527 ± 1.083
7.634TyrSer: 7.634 ± 5.413
1.527TyrThr: 1.527 ± 1.083
1.527TyrVal: 1.527 ± 1.083
1.527TyrTrp: 1.527 ± 1.083
3.053TyrTyr: 3.053 ± 0.104
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (656 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski