Amino acid dipepetide frequency for Odonata associated gemycircularvirus-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.063AlaAla: 3.063 ± 0.089
1.531AlaCys: 1.531 ± 1.077
3.063AlaAsp: 3.063 ± 2.154
6.126AlaGlu: 6.126 ± 2.064
1.531AlaPhe: 1.531 ± 1.077
9.188AlaGly: 9.188 ± 0.268
1.531AlaHis: 1.531 ± 1.077
3.063AlaIle: 3.063 ± 0.089
6.126AlaLys: 6.126 ± 0.179
3.063AlaLeu: 3.063 ± 2.154
1.531AlaMet: 1.531 ± 1.166
1.531AlaAsn: 1.531 ± 1.077
4.594AlaPro: 4.594 ± 3.23
1.531AlaGln: 1.531 ± 1.166
6.126AlaArg: 6.126 ± 0.179
6.126AlaSer: 6.126 ± 0.179
9.188AlaThr: 9.188 ± 2.511
3.063AlaVal: 3.063 ± 2.154
0.0AlaTrp: 0.0 ± 0.0
1.531AlaTyr: 1.531 ± 1.166
0.0AlaXaa: 0.0 ± 0.0
Cys
1.531CysAla: 1.531 ± 1.077
0.0CysCys: 0.0 ± 0.0
1.531CysAsp: 1.531 ± 1.077
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.531CysIle: 1.531 ± 1.077
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.531CysAsn: 1.531 ± 1.077
1.531CysPro: 1.531 ± 1.166
1.531CysGln: 1.531 ± 1.077
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.531CysThr: 1.531 ± 1.166
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.126AspAla: 6.126 ± 2.064
1.531AspCys: 1.531 ± 1.077
7.657AspAsp: 7.657 ± 0.898
1.531AspGlu: 1.531 ± 1.166
3.063AspPhe: 3.063 ± 0.089
7.657AspGly: 7.657 ± 3.141
0.0AspHis: 0.0 ± 0.0
4.594AspIle: 4.594 ± 0.988
0.0AspLys: 0.0 ± 0.0
1.531AspLeu: 1.531 ± 1.077
1.531AspMet: 1.531 ± 1.077
3.063AspAsn: 3.063 ± 0.089
3.063AspPro: 3.063 ± 0.089
1.531AspGln: 1.531 ± 1.077
1.531AspArg: 1.531 ± 1.166
0.0AspSer: 0.0 ± 0.0
6.126AspThr: 6.126 ± 2.422
4.594AspVal: 4.594 ± 0.988
7.657AspTrp: 7.657 ± 3.141
3.063AspTyr: 3.063 ± 2.154
0.0AspXaa: 0.0 ± 0.0
Glu
3.063GluAla: 3.063 ± 0.089
1.531GluCys: 1.531 ± 1.077
0.0GluAsp: 0.0 ± 0.0
1.531GluGlu: 1.531 ± 1.077
3.063GluPhe: 3.063 ± 2.154
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
1.531GluLys: 1.531 ± 1.166
3.063GluLeu: 3.063 ± 2.154
0.0GluMet: 0.0 ± 0.747
3.063GluAsn: 3.063 ± 0.089
1.531GluPro: 1.531 ± 1.166
1.531GluGln: 1.531 ± 1.077
4.594GluArg: 4.594 ± 0.988
0.0GluSer: 0.0 ± 0.0
0.0GluThr: 0.0 ± 0.0
1.531GluVal: 1.531 ± 1.166
3.063GluTrp: 3.063 ± 2.154
1.531GluTyr: 1.531 ± 1.077
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.531PheCys: 1.531 ± 1.166
6.126PheAsp: 6.126 ± 4.307
0.0PheGlu: 0.0 ± 0.0
1.531PhePhe: 1.531 ± 1.077
4.594PheGly: 4.594 ± 3.23
3.063PheHis: 3.063 ± 2.154
1.531PheIle: 1.531 ± 1.077
1.531PheLys: 1.531 ± 1.166
3.063PheLeu: 3.063 ± 2.154
0.0PheMet: 0.0 ± 0.0
6.126PheAsn: 6.126 ± 2.422
0.0PhePro: 0.0 ± 0.0
1.531PheGln: 1.531 ± 1.166
4.594PheArg: 4.594 ± 0.988
9.188PheSer: 9.188 ± 2.511
0.0PheThr: 0.0 ± 0.0
4.594PheVal: 4.594 ± 0.988
1.531PheTrp: 1.531 ± 1.077
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
9.188GlyAla: 9.188 ± 2.511
1.531GlyCys: 1.531 ± 1.077
4.594GlyAsp: 4.594 ± 1.255
3.063GlyGlu: 3.063 ± 2.154
1.531GlyPhe: 1.531 ± 1.077
4.594GlyGly: 4.594 ± 3.23
4.594GlyHis: 4.594 ± 0.988
0.0GlyIle: 0.0 ± 0.0
4.594GlyLys: 4.594 ± 0.988
1.531GlyLeu: 1.531 ± 1.077
7.657GlyMet: 7.657 ± 1.345
1.531GlyAsn: 1.531 ± 1.077
6.126GlyPro: 6.126 ± 0.179
0.0GlyGln: 0.0 ± 0.0
6.126GlyArg: 6.126 ± 4.307
7.657GlySer: 7.657 ± 3.141
4.594GlyThr: 4.594 ± 0.988
3.063GlyVal: 3.063 ± 0.089
0.0GlyTrp: 0.0 ± 0.0
3.063GlyTyr: 3.063 ± 0.089
0.0GlyXaa: 0.0 ± 0.0
His
1.531HisAla: 1.531 ± 1.077
0.0HisCys: 0.0 ± 0.0
1.531HisAsp: 1.531 ± 1.077
1.531HisGlu: 1.531 ± 1.166
4.594HisPhe: 4.594 ± 0.988
3.063HisGly: 3.063 ± 2.154
3.063HisHis: 3.063 ± 2.154
1.531HisIle: 1.531 ± 1.077
0.0HisLys: 0.0 ± 0.0
3.063HisLeu: 3.063 ± 2.154
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.531HisPro: 1.531 ± 1.077
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
4.594HisVal: 4.594 ± 0.988
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
1.531IleGlu: 1.531 ± 1.077
4.594IlePhe: 4.594 ± 0.988
3.063IleGly: 3.063 ± 0.089
0.0IleHis: 0.0 ± 0.0
4.594IleIle: 4.594 ± 0.988
1.531IleLys: 1.531 ± 1.166
3.063IleLeu: 3.063 ± 0.089
0.0IleMet: 0.0 ± 0.0
3.063IleAsn: 3.063 ± 2.332
0.0IlePro: 0.0 ± 0.0
3.063IleGln: 3.063 ± 0.089
0.0IleArg: 0.0 ± 0.0
3.063IleSer: 3.063 ± 0.089
4.594IleThr: 4.594 ± 1.255
6.126IleVal: 6.126 ± 0.179
1.531IleTrp: 1.531 ± 1.077
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.531LysAla: 1.531 ± 1.077
0.0LysCys: 0.0 ± 0.0
4.594LysAsp: 4.594 ± 3.23
1.531LysGlu: 1.531 ± 1.166
0.0LysPhe: 0.0 ± 0.0
4.594LysGly: 4.594 ± 1.255
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
6.126LysLys: 6.126 ± 4.665
6.126LysLeu: 6.126 ± 2.064
1.531LysMet: 1.531 ± 0.78
3.063LysAsn: 3.063 ± 0.089
1.531LysPro: 1.531 ± 1.166
1.531LysGln: 1.531 ± 1.166
4.594LysArg: 4.594 ± 3.498
3.063LysSer: 3.063 ± 0.089
6.126LysThr: 6.126 ± 2.064
1.531LysVal: 1.531 ± 1.166
1.531LysTrp: 1.531 ± 1.077
1.531LysTyr: 1.531 ± 1.166
0.0LysXaa: 0.0 ± 0.0
Leu
6.126LeuAla: 6.126 ± 4.307
1.531LeuCys: 1.531 ± 1.077
4.594LeuAsp: 4.594 ± 3.23
1.531LeuGlu: 1.531 ± 1.077
3.063LeuPhe: 3.063 ± 0.089
3.063LeuGly: 3.063 ± 2.154
4.594LeuHis: 4.594 ± 3.23
1.531LeuIle: 1.531 ± 1.077
1.531LeuLys: 1.531 ± 1.077
1.531LeuLeu: 1.531 ± 1.077
4.594LeuMet: 4.594 ± 3.23
1.531LeuAsn: 1.531 ± 1.166
3.063LeuPro: 3.063 ± 2.154
0.0LeuGln: 0.0 ± 0.0
3.063LeuArg: 3.063 ± 2.332
6.126LeuSer: 6.126 ± 2.064
3.063LeuThr: 3.063 ± 0.089
4.594LeuVal: 4.594 ± 1.255
6.126LeuTrp: 6.126 ± 2.422
3.063LeuTyr: 3.063 ± 2.332
0.0LeuXaa: 0.0 ± 0.0
Met
3.063MetAla: 3.063 ± 0.089
0.0MetCys: 0.0 ± 0.0
1.531MetAsp: 1.531 ± 1.077
0.0MetGlu: 0.0 ± 0.0
3.063MetPhe: 3.063 ± 0.089
1.531MetGly: 1.531 ± 1.166
1.531MetHis: 1.531 ± 1.077
0.0MetIle: 0.0 ± 0.0
3.063MetLys: 3.063 ± 0.089
4.594MetLeu: 4.594 ± 0.988
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.531MetPro: 1.531 ± 1.166
1.531MetGln: 1.531 ± 1.077
1.531MetArg: 1.531 ± 1.077
7.657MetSer: 7.657 ± 5.831
0.0MetThr: 0.0 ± 0.0
1.531MetVal: 1.531 ± 1.166
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.531AsnAla: 1.531 ± 1.166
1.531AsnCys: 1.531 ± 1.166
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
3.063AsnPhe: 3.063 ± 0.089
1.531AsnGly: 1.531 ± 1.166
1.531AsnHis: 1.531 ± 1.077
1.531AsnIle: 1.531 ± 1.077
1.531AsnLys: 1.531 ± 1.077
6.126AsnLeu: 6.126 ± 2.422
1.531AsnMet: 1.531 ± 1.166
0.0AsnAsn: 0.0 ± 0.0
1.531AsnPro: 1.531 ± 1.077
1.531AsnGln: 1.531 ± 1.166
1.531AsnArg: 1.531 ± 1.077
1.531AsnSer: 1.531 ± 1.166
4.594AsnThr: 4.594 ± 3.498
3.063AsnVal: 3.063 ± 2.154
0.0AsnTrp: 0.0 ± 0.0
3.063AsnTyr: 3.063 ± 0.089
0.0AsnXaa: 0.0 ± 0.0
Pro
1.531ProAla: 1.531 ± 1.166
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
1.531ProGlu: 1.531 ± 1.077
3.063ProPhe: 3.063 ± 0.089
9.188ProGly: 9.188 ± 4.218
1.531ProHis: 1.531 ± 1.166
4.594ProIle: 4.594 ± 1.255
0.0ProLys: 0.0 ± 0.0
1.531ProLeu: 1.531 ± 1.077
1.531ProMet: 1.531 ± 1.166
1.531ProAsn: 1.531 ± 1.077
6.126ProPro: 6.126 ± 2.422
0.0ProGln: 0.0 ± 0.0
3.063ProArg: 3.063 ± 2.154
3.063ProSer: 3.063 ± 2.332
6.126ProThr: 6.126 ± 2.422
1.531ProVal: 1.531 ± 1.077
1.531ProTrp: 1.531 ± 1.166
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.063GlnAla: 3.063 ± 2.154
0.0GlnCys: 0.0 ± 0.0
4.594GlnAsp: 4.594 ± 1.255
0.0GlnGlu: 0.0 ± 0.0
3.063GlnPhe: 3.063 ± 2.154
1.531GlnGly: 1.531 ± 1.166
0.0GlnHis: 0.0 ± 0.0
1.531GlnIle: 1.531 ± 1.077
1.531GlnLys: 1.531 ± 1.166
3.063GlnLeu: 3.063 ± 2.154
0.0GlnMet: 0.0 ± 0.0
1.531GlnAsn: 1.531 ± 1.166
1.531GlnPro: 1.531 ± 1.166
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
4.594GlnSer: 4.594 ± 1.255
6.126GlnThr: 6.126 ± 4.665
0.0GlnVal: 0.0 ± 0.0
1.531GlnTrp: 1.531 ± 1.077
1.531GlnTyr: 1.531 ± 1.077
0.0GlnXaa: 0.0 ± 0.0
Arg
1.531ArgAla: 1.531 ± 1.077
0.0ArgCys: 0.0 ± 0.0
1.531ArgAsp: 1.531 ± 1.077
1.531ArgGlu: 1.531 ± 1.077
0.0ArgPhe: 0.0 ± 0.0
3.063ArgGly: 3.063 ± 0.089
0.0ArgHis: 0.0 ± 0.0
6.126ArgIle: 6.126 ± 4.665
7.657ArgLys: 7.657 ± 0.898
6.126ArgLeu: 6.126 ± 2.064
1.531ArgMet: 1.531 ± 1.166
0.0ArgAsn: 0.0 ± 0.0
6.126ArgPro: 6.126 ± 2.064
1.531ArgGln: 1.531 ± 1.166
12.251ArgArg: 12.251 ± 7.086
12.251ArgSer: 12.251 ± 4.843
3.063ArgThr: 3.063 ± 2.332
1.531ArgVal: 1.531 ± 1.077
1.531ArgTrp: 1.531 ± 1.166
3.063ArgTyr: 3.063 ± 2.154
0.0ArgXaa: 0.0 ± 0.0
Ser
1.531SerAla: 1.531 ± 1.166
0.0SerCys: 0.0 ± 0.0
7.657SerAsp: 7.657 ± 5.831
4.594SerGlu: 4.594 ± 0.988
1.531SerPhe: 1.531 ± 1.166
6.126SerGly: 6.126 ± 2.064
1.531SerHis: 1.531 ± 1.077
3.063SerIle: 3.063 ± 2.332
4.594SerLys: 4.594 ± 3.498
7.657SerLeu: 7.657 ± 1.345
0.0SerMet: 0.0 ± 0.0
4.594SerAsn: 4.594 ± 3.498
1.531SerPro: 1.531 ± 1.166
9.188SerGln: 9.188 ± 1.975
7.657SerArg: 7.657 ± 3.588
9.188SerSer: 9.188 ± 4.754
12.251SerThr: 12.251 ± 4.843
6.126SerVal: 6.126 ± 2.422
0.0SerTrp: 0.0 ± 0.0
3.063SerTyr: 3.063 ± 0.089
0.0SerXaa: 0.0 ± 0.0
Thr
9.188ThrAla: 9.188 ± 6.997
0.0ThrCys: 0.0 ± 0.0
4.594ThrAsp: 4.594 ± 0.988
3.063ThrGlu: 3.063 ± 0.089
4.594ThrPhe: 4.594 ± 1.255
1.531ThrGly: 1.531 ± 1.077
1.531ThrHis: 1.531 ± 1.077
1.531ThrIle: 1.531 ± 1.166
1.531ThrLys: 1.531 ± 1.166
3.063ThrLeu: 3.063 ± 2.332
0.0ThrMet: 0.0 ± 0.0
1.531ThrAsn: 1.531 ± 1.077
3.063ThrPro: 3.063 ± 0.089
6.126ThrGln: 6.126 ± 2.422
7.657ThrArg: 7.657 ± 3.588
12.251ThrSer: 12.251 ± 9.329
4.594ThrThr: 4.594 ± 1.255
4.594ThrVal: 4.594 ± 3.498
3.063ThrTrp: 3.063 ± 2.154
1.531ThrTyr: 1.531 ± 1.077
0.0ThrXaa: 0.0 ± 0.0
Val
9.188ValAla: 9.188 ± 1.975
0.0ValCys: 0.0 ± 0.0
9.188ValAsp: 9.188 ± 1.975
1.531ValGlu: 1.531 ± 1.077
6.126ValPhe: 6.126 ± 2.064
3.063ValGly: 3.063 ± 2.332
0.0ValHis: 0.0 ± 0.0
1.531ValIle: 1.531 ± 1.077
0.0ValLys: 0.0 ± 0.0
1.531ValLeu: 1.531 ± 1.077
6.126ValMet: 6.126 ± 2.422
3.063ValAsn: 3.063 ± 0.089
1.531ValPro: 1.531 ± 1.166
1.531ValGln: 1.531 ± 1.166
3.063ValArg: 3.063 ± 2.154
0.0ValSer: 0.0 ± 0.0
3.063ValThr: 3.063 ± 2.332
0.0ValVal: 0.0 ± 0.0
1.531ValTrp: 1.531 ± 1.077
4.594ValTyr: 4.594 ± 3.498
0.0ValXaa: 0.0 ± 0.0
Trp
6.126TrpAla: 6.126 ± 4.307
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.531TrpGlu: 1.531 ± 1.077
3.063TrpPhe: 3.063 ± 0.089
3.063TrpGly: 3.063 ± 2.154
1.531TrpHis: 1.531 ± 1.166
0.0TrpIle: 0.0 ± 0.0
4.594TrpLys: 4.594 ± 3.23
4.594TrpLeu: 4.594 ± 0.988
3.063TrpMet: 3.063 ± 0.089
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.531TrpGln: 1.531 ± 1.077
1.531TrpArg: 1.531 ± 1.166
1.531TrpSer: 1.531 ± 1.166
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.531TrpTyr: 1.531 ± 1.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.594TyrAla: 4.594 ± 3.23
0.0TyrCys: 0.0 ± 0.0
3.063TyrAsp: 3.063 ± 0.089
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
4.594TyrGly: 4.594 ± 1.255
0.0TyrHis: 0.0 ± 0.0
1.531TyrIle: 1.531 ± 1.166
3.063TyrLys: 3.063 ± 0.089
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.531TyrPro: 1.531 ± 1.166
0.0TyrGln: 0.0 ± 0.0
1.531TyrArg: 1.531 ± 1.166
4.594TyrSer: 4.594 ± 1.255
0.0TyrThr: 0.0 ± 0.0
4.594TyrVal: 4.594 ± 0.988
3.063TyrTrp: 3.063 ± 0.089
1.531TyrTyr: 1.531 ± 1.166
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (654 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski