Amino acid dipepetide frequency for Hubei odonate virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.393AlaAla: 5.393 ± 0.0
1.348AlaCys: 1.348 ± 0.0
4.382AlaAsp: 4.382 ± 0.0
1.348AlaGlu: 1.348 ± 0.0
2.696AlaPhe: 2.696 ± 0.0
1.685AlaGly: 1.685 ± 0.0
1.685AlaHis: 1.685 ± 0.0
2.022AlaIle: 2.022 ± 0.0
2.696AlaLys: 2.696 ± 0.0
5.056AlaLeu: 5.056 ± 0.0
1.348AlaMet: 1.348 ± 0.0
2.022AlaAsn: 2.022 ± 0.0
2.022AlaPro: 2.022 ± 0.0
3.37AlaGln: 3.37 ± 0.0
2.696AlaArg: 2.696 ± 0.0
5.73AlaSer: 5.73 ± 0.0
1.685AlaThr: 1.685 ± 0.0
2.022AlaVal: 2.022 ± 0.0
1.011AlaTrp: 1.011 ± 0.0
3.37AlaTyr: 3.37 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.674CysAla: 0.674 ± 0.0
1.348CysCys: 1.348 ± 0.0
0.337CysAsp: 0.337 ± 0.0
2.359CysGlu: 2.359 ± 0.0
0.337CysPhe: 0.337 ± 0.0
2.022CysGly: 2.022 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.674CysIle: 0.674 ± 0.0
1.011CysLys: 1.011 ± 0.0
3.033CysLeu: 3.033 ± 0.0
1.011CysMet: 1.011 ± 0.0
1.011CysAsn: 1.011 ± 0.0
1.011CysPro: 1.011 ± 0.0
1.348CysGln: 1.348 ± 0.0
1.685CysArg: 1.685 ± 0.0
1.685CysSer: 1.685 ± 0.0
0.674CysThr: 0.674 ± 0.0
0.674CysVal: 0.674 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.359CysTyr: 2.359 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.37AspAla: 3.37 ± 0.0
1.348AspCys: 1.348 ± 0.0
3.707AspAsp: 3.707 ± 0.0
3.37AspGlu: 3.37 ± 0.0
2.022AspPhe: 2.022 ± 0.0
2.359AspGly: 2.359 ± 0.0
1.348AspHis: 1.348 ± 0.0
3.707AspIle: 3.707 ± 0.0
3.707AspLys: 3.707 ± 0.0
4.044AspLeu: 4.044 ± 0.0
1.348AspMet: 1.348 ± 0.0
2.696AspAsn: 2.696 ± 0.0
2.359AspPro: 2.359 ± 0.0
0.337AspGln: 0.337 ± 0.0
1.685AspArg: 1.685 ± 0.0
5.393AspSer: 5.393 ± 0.0
2.359AspThr: 2.359 ± 0.0
4.044AspVal: 4.044 ± 0.0
1.011AspTrp: 1.011 ± 0.0
1.685AspTyr: 1.685 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.022GluAla: 2.022 ± 0.0
1.348GluCys: 1.348 ± 0.0
3.707GluAsp: 3.707 ± 0.0
4.382GluGlu: 4.382 ± 0.0
3.37GluPhe: 3.37 ± 0.0
2.359GluGly: 2.359 ± 0.0
1.348GluHis: 1.348 ± 0.0
6.067GluIle: 6.067 ± 0.0
4.719GluLys: 4.719 ± 0.0
4.044GluLeu: 4.044 ± 0.0
1.348GluMet: 1.348 ± 0.0
2.359GluAsn: 2.359 ± 0.0
5.056GluPro: 5.056 ± 0.0
1.685GluGln: 1.685 ± 0.0
1.685GluArg: 1.685 ± 0.0
4.719GluSer: 4.719 ± 0.0
3.37GluThr: 3.37 ± 0.0
3.707GluVal: 3.707 ± 0.0
1.348GluTrp: 1.348 ± 0.0
2.696GluTyr: 2.696 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.696PheAla: 2.696 ± 0.0
0.337PheCys: 0.337 ± 0.0
2.022PheAsp: 2.022 ± 0.0
5.393PheGlu: 5.393 ± 0.0
1.685PhePhe: 1.685 ± 0.0
2.359PheGly: 2.359 ± 0.0
0.674PheHis: 0.674 ± 0.0
3.37PheIle: 3.37 ± 0.0
4.719PheLys: 4.719 ± 0.0
3.707PheLeu: 3.707 ± 0.0
0.674PheMet: 0.674 ± 0.0
1.348PheAsn: 1.348 ± 0.0
1.011PhePro: 1.011 ± 0.0
3.033PheGln: 3.033 ± 0.0
2.696PheArg: 2.696 ± 0.0
2.696PheSer: 2.696 ± 0.0
3.033PheThr: 3.033 ± 0.0
3.37PheVal: 3.37 ± 0.0
1.011PheTrp: 1.011 ± 0.0
2.359PheTyr: 2.359 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.685GlyAla: 1.685 ± 0.0
1.348GlyCys: 1.348 ± 0.0
4.044GlyAsp: 4.044 ± 0.0
4.044GlyGlu: 4.044 ± 0.0
2.022GlyPhe: 2.022 ± 0.0
1.348GlyGly: 1.348 ± 0.0
0.337GlyHis: 0.337 ± 0.0
5.056GlyIle: 5.056 ± 0.0
3.033GlyLys: 3.033 ± 0.0
4.382GlyLeu: 4.382 ± 0.0
1.685GlyMet: 1.685 ± 0.0
1.348GlyAsn: 1.348 ± 0.0
2.022GlyPro: 2.022 ± 0.0
1.685GlyGln: 1.685 ± 0.0
1.685GlyArg: 1.685 ± 0.0
4.382GlySer: 4.382 ± 0.0
2.696GlyThr: 2.696 ± 0.0
2.696GlyVal: 2.696 ± 0.0
0.337GlyTrp: 0.337 ± 0.0
2.696GlyTyr: 2.696 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.674HisAla: 0.674 ± 0.0
2.022HisCys: 2.022 ± 0.0
1.348HisAsp: 1.348 ± 0.0
0.337HisGlu: 0.337 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.011HisGly: 1.011 ± 0.0
0.674HisHis: 0.674 ± 0.0
2.022HisIle: 2.022 ± 0.0
2.022HisLys: 2.022 ± 0.0
1.685HisLeu: 1.685 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.685HisAsn: 1.685 ± 0.0
1.348HisPro: 1.348 ± 0.0
1.011HisGln: 1.011 ± 0.0
1.685HisArg: 1.685 ± 0.0
1.011HisSer: 1.011 ± 0.0
0.674HisThr: 0.674 ± 0.0
2.696HisVal: 2.696 ± 0.0
0.674HisTrp: 0.674 ± 0.0
1.011HisTyr: 1.011 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.393IleAla: 5.393 ± 0.0
0.674IleCys: 0.674 ± 0.0
3.707IleAsp: 3.707 ± 0.0
1.348IleGlu: 1.348 ± 0.0
4.382IlePhe: 4.382 ± 0.0
4.044IleGly: 4.044 ± 0.0
1.011IleHis: 1.011 ± 0.0
2.696IleIle: 2.696 ± 0.0
5.393IleLys: 5.393 ± 0.0
6.741IleLeu: 6.741 ± 0.0
2.359IleMet: 2.359 ± 0.0
3.033IleAsn: 3.033 ± 0.0
6.404IlePro: 6.404 ± 0.0
3.033IleGln: 3.033 ± 0.0
4.719IleArg: 4.719 ± 0.0
4.382IleSer: 4.382 ± 0.0
5.056IleThr: 5.056 ± 0.0
4.382IleVal: 4.382 ± 0.0
0.674IleTrp: 0.674 ± 0.0
3.707IleTyr: 3.707 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.696LysAla: 2.696 ± 0.0
1.685LysCys: 1.685 ± 0.0
3.37LysAsp: 3.37 ± 0.0
5.056LysGlu: 5.056 ± 0.0
5.056LysPhe: 5.056 ± 0.0
4.044LysGly: 4.044 ± 0.0
2.022LysHis: 2.022 ± 0.0
6.404LysIle: 6.404 ± 0.0
5.73LysLys: 5.73 ± 0.0
5.73LysLeu: 5.73 ± 0.0
1.011LysMet: 1.011 ± 0.0
2.696LysAsn: 2.696 ± 0.0
2.022LysPro: 2.022 ± 0.0
1.685LysGln: 1.685 ± 0.0
2.696LysArg: 2.696 ± 0.0
6.067LysSer: 6.067 ± 0.0
3.707LysThr: 3.707 ± 0.0
2.696LysVal: 2.696 ± 0.0
0.674LysTrp: 0.674 ± 0.0
4.044LysTyr: 4.044 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.393LeuAla: 5.393 ± 0.0
2.696LeuCys: 2.696 ± 0.0
3.033LeuAsp: 3.033 ± 0.0
4.382LeuGlu: 4.382 ± 0.0
3.37LeuPhe: 3.37 ± 0.0
3.033LeuGly: 3.033 ± 0.0
3.033LeuHis: 3.033 ± 0.0
4.382LeuIle: 4.382 ± 0.0
9.1LeuLys: 9.1 ± 0.0
7.752LeuLeu: 7.752 ± 0.0
1.685LeuMet: 1.685 ± 0.0
6.404LeuAsn: 6.404 ± 0.0
5.393LeuPro: 5.393 ± 0.0
3.707LeuGln: 3.707 ± 0.0
5.056LeuArg: 5.056 ± 0.0
7.415LeuSer: 7.415 ± 0.0
6.067LeuThr: 6.067 ± 0.0
3.707LeuVal: 3.707 ± 0.0
1.348LeuTrp: 1.348 ± 0.0
2.696LeuTyr: 2.696 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.685MetAla: 1.685 ± 0.0
0.337MetCys: 0.337 ± 0.0
2.022MetAsp: 2.022 ± 0.0
1.685MetGlu: 1.685 ± 0.0
1.685MetPhe: 1.685 ± 0.0
0.337MetGly: 0.337 ± 0.0
0.337MetHis: 0.337 ± 0.0
2.359MetIle: 2.359 ± 0.0
1.348MetLys: 1.348 ± 0.0
1.348MetLeu: 1.348 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.011MetAsn: 1.011 ± 0.0
1.348MetPro: 1.348 ± 0.0
0.337MetGln: 0.337 ± 0.0
1.685MetArg: 1.685 ± 0.0
1.348MetSer: 1.348 ± 0.0
0.674MetThr: 0.674 ± 0.0
2.359MetVal: 2.359 ± 0.0
0.337MetTrp: 0.337 ± 0.0
1.011MetTyr: 1.011 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.033AsnAla: 3.033 ± 0.0
0.337AsnCys: 0.337 ± 0.0
1.348AsnAsp: 1.348 ± 0.0
1.348AsnGlu: 1.348 ± 0.0
2.696AsnPhe: 2.696 ± 0.0
2.022AsnGly: 2.022 ± 0.0
1.348AsnHis: 1.348 ± 0.0
3.37AsnIle: 3.37 ± 0.0
2.696AsnLys: 2.696 ± 0.0
3.707AsnLeu: 3.707 ± 0.0
1.011AsnMet: 1.011 ± 0.0
0.674AsnAsn: 0.674 ± 0.0
2.022AsnPro: 2.022 ± 0.0
2.022AsnGln: 2.022 ± 0.0
2.359AsnArg: 2.359 ± 0.0
4.719AsnSer: 4.719 ± 0.0
3.033AsnThr: 3.033 ± 0.0
4.382AsnVal: 4.382 ± 0.0
1.348AsnTrp: 1.348 ± 0.0
0.337AsnTyr: 0.337 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.022ProAla: 2.022 ± 0.0
1.348ProCys: 1.348 ± 0.0
1.348ProAsp: 1.348 ± 0.0
4.044ProGlu: 4.044 ± 0.0
2.359ProPhe: 2.359 ± 0.0
3.033ProGly: 3.033 ± 0.0
1.685ProHis: 1.685 ± 0.0
5.73ProIle: 5.73 ± 0.0
3.033ProLys: 3.033 ± 0.0
5.056ProLeu: 5.056 ± 0.0
1.685ProMet: 1.685 ± 0.0
1.348ProAsn: 1.348 ± 0.0
2.696ProPro: 2.696 ± 0.0
2.359ProGln: 2.359 ± 0.0
2.359ProArg: 2.359 ± 0.0
4.382ProSer: 4.382 ± 0.0
2.359ProThr: 2.359 ± 0.0
3.033ProVal: 3.033 ± 0.0
1.348ProTrp: 1.348 ± 0.0
1.685ProTyr: 1.685 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.022GlnAla: 2.022 ± 0.0
1.011GlnCys: 1.011 ± 0.0
1.348GlnAsp: 1.348 ± 0.0
2.696GlnGlu: 2.696 ± 0.0
2.359GlnPhe: 2.359 ± 0.0
2.359GlnGly: 2.359 ± 0.0
1.685GlnHis: 1.685 ± 0.0
3.033GlnIle: 3.033 ± 0.0
2.359GlnLys: 2.359 ± 0.0
3.707GlnLeu: 3.707 ± 0.0
1.348GlnMet: 1.348 ± 0.0
3.033GlnAsn: 3.033 ± 0.0
1.685GlnPro: 1.685 ± 0.0
0.674GlnGln: 0.674 ± 0.0
1.348GlnArg: 1.348 ± 0.0
2.359GlnSer: 2.359 ± 0.0
1.348GlnThr: 1.348 ± 0.0
1.011GlnVal: 1.011 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.685GlnTyr: 1.685 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.359ArgAla: 2.359 ± 0.0
0.674ArgCys: 0.674 ± 0.0
3.707ArgAsp: 3.707 ± 0.0
2.696ArgGlu: 2.696 ± 0.0
2.359ArgPhe: 2.359 ± 0.0
2.696ArgGly: 2.696 ± 0.0
1.011ArgHis: 1.011 ± 0.0
3.707ArgIle: 3.707 ± 0.0
3.033ArgLys: 3.033 ± 0.0
5.73ArgLeu: 5.73 ± 0.0
0.674ArgMet: 0.674 ± 0.0
1.685ArgAsn: 1.685 ± 0.0
0.674ArgPro: 0.674 ± 0.0
1.685ArgGln: 1.685 ± 0.0
2.359ArgArg: 2.359 ± 0.0
1.348ArgSer: 1.348 ± 0.0
3.033ArgThr: 3.033 ± 0.0
3.37ArgVal: 3.37 ± 0.0
0.674ArgTrp: 0.674 ± 0.0
2.359ArgTyr: 2.359 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.033SerAla: 3.033 ± 0.0
0.337SerCys: 0.337 ± 0.0
3.37SerAsp: 3.37 ± 0.0
5.73SerGlu: 5.73 ± 0.0
4.719SerPhe: 4.719 ± 0.0
5.393SerGly: 5.393 ± 0.0
1.348SerHis: 1.348 ± 0.0
7.415SerIle: 7.415 ± 0.0
5.393SerLys: 5.393 ± 0.0
7.752SerLeu: 7.752 ± 0.0
1.685SerMet: 1.685 ± 0.0
5.73SerAsn: 5.73 ± 0.0
4.044SerPro: 4.044 ± 0.0
3.033SerGln: 3.033 ± 0.0
1.348SerArg: 1.348 ± 0.0
6.067SerSer: 6.067 ± 0.0
4.382SerThr: 4.382 ± 0.0
3.37SerVal: 3.37 ± 0.0
2.022SerTrp: 2.022 ± 0.0
2.359SerTyr: 2.359 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.033ThrAla: 3.033 ± 0.0
1.685ThrCys: 1.685 ± 0.0
3.033ThrAsp: 3.033 ± 0.0
3.707ThrGlu: 3.707 ± 0.0
4.044ThrPhe: 4.044 ± 0.0
2.359ThrGly: 2.359 ± 0.0
1.685ThrHis: 1.685 ± 0.0
2.022ThrIle: 2.022 ± 0.0
2.696ThrLys: 2.696 ± 0.0
4.719ThrLeu: 4.719 ± 0.0
1.685ThrMet: 1.685 ± 0.0
1.348ThrAsn: 1.348 ± 0.0
2.359ThrPro: 2.359 ± 0.0
2.359ThrGln: 2.359 ± 0.0
2.359ThrArg: 2.359 ± 0.0
3.37ThrSer: 3.37 ± 0.0
3.033ThrThr: 3.033 ± 0.0
3.37ThrVal: 3.37 ± 0.0
0.674ThrTrp: 0.674 ± 0.0
4.044ThrTyr: 4.044 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.696ValAla: 2.696 ± 0.0
0.674ValCys: 0.674 ± 0.0
2.696ValAsp: 2.696 ± 0.0
4.719ValGlu: 4.719 ± 0.0
2.022ValPhe: 2.022 ± 0.0
2.359ValGly: 2.359 ± 0.0
1.685ValHis: 1.685 ± 0.0
3.37ValIle: 3.37 ± 0.0
3.033ValLys: 3.033 ± 0.0
5.73ValLeu: 5.73 ± 0.0
1.685ValMet: 1.685 ± 0.0
2.022ValAsn: 2.022 ± 0.0
5.393ValPro: 5.393 ± 0.0
2.022ValGln: 2.022 ± 0.0
2.022ValArg: 2.022 ± 0.0
6.741ValSer: 6.741 ± 0.0
2.359ValThr: 2.359 ± 0.0
3.707ValVal: 3.707 ± 0.0
1.011ValTrp: 1.011 ± 0.0
2.696ValTyr: 2.696 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
2.022TrpAla: 2.022 ± 0.0
1.011TrpCys: 1.011 ± 0.0
1.011TrpAsp: 1.011 ± 0.0
0.674TrpGlu: 0.674 ± 0.0
0.674TrpPhe: 0.674 ± 0.0
1.011TrpGly: 1.011 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.674TrpIle: 0.674 ± 0.0
1.348TrpLys: 1.348 ± 0.0
2.359TrpLeu: 2.359 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.337TrpAsn: 0.337 ± 0.0
0.337TrpPro: 0.337 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.348TrpArg: 1.348 ± 0.0
1.348TrpSer: 1.348 ± 0.0
1.348TrpThr: 1.348 ± 0.0
0.337TrpVal: 0.337 ± 0.0
0.337TrpTrp: 0.337 ± 0.0
1.011TrpTyr: 1.011 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.685TyrAla: 1.685 ± 0.0
1.685TyrCys: 1.685 ± 0.0
2.359TyrAsp: 2.359 ± 0.0
2.022TyrGlu: 2.022 ± 0.0
0.337TyrPhe: 0.337 ± 0.0
3.033TyrGly: 3.033 ± 0.0
0.674TyrHis: 0.674 ± 0.0
5.056TyrIle: 5.056 ± 0.0
2.022TyrLys: 2.022 ± 0.0
3.37TyrLeu: 3.37 ± 0.0
0.674TyrMet: 0.674 ± 0.0
2.022TyrAsn: 2.022 ± 0.0
3.707TyrPro: 3.707 ± 0.0
1.685TyrGln: 1.685 ± 0.0
2.359TyrArg: 2.359 ± 0.0
3.37TyrSer: 3.37 ± 0.0
2.696TyrThr: 2.696 ± 0.0
3.707TyrVal: 3.707 ± 0.0
1.348TyrTrp: 1.348 ± 0.0
3.033TyrTyr: 3.033 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2968 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski