Amino acid dipepetide frequency for Sanxia water strider virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.618AlaAla: 6.618 ± 0.0
1.324AlaCys: 1.324 ± 0.0
2.316AlaAsp: 2.316 ± 0.0
2.978AlaGlu: 2.978 ± 0.0
2.647AlaPhe: 2.647 ± 0.0
4.964AlaGly: 4.964 ± 0.0
1.655AlaHis: 1.655 ± 0.0
5.295AlaIle: 5.295 ± 0.0
3.64AlaLys: 3.64 ± 0.0
5.295AlaLeu: 5.295 ± 0.0
1.655AlaMet: 1.655 ± 0.0
1.655AlaAsn: 1.655 ± 0.0
2.316AlaPro: 2.316 ± 0.0
2.647AlaGln: 2.647 ± 0.0
3.309AlaArg: 3.309 ± 0.0
5.956AlaSer: 5.956 ± 0.0
4.302AlaThr: 4.302 ± 0.0
2.316AlaVal: 2.316 ± 0.0
1.985AlaTrp: 1.985 ± 0.0
1.655AlaTyr: 1.655 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.655CysAla: 1.655 ± 0.0
0.662CysCys: 0.662 ± 0.0
0.331CysAsp: 0.331 ± 0.0
1.324CysGlu: 1.324 ± 0.0
0.662CysPhe: 0.662 ± 0.0
1.324CysGly: 1.324 ± 0.0
0.331CysHis: 0.331 ± 0.0
1.985CysIle: 1.985 ± 0.0
0.662CysLys: 0.662 ± 0.0
2.316CysLeu: 2.316 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.662CysAsn: 0.662 ± 0.0
1.655CysPro: 1.655 ± 0.0
0.331CysGln: 0.331 ± 0.0
0.993CysArg: 0.993 ± 0.0
0.662CysSer: 0.662 ± 0.0
0.331CysThr: 0.331 ± 0.0
1.324CysVal: 1.324 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.662CysTyr: 0.662 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.985AspAla: 1.985 ± 0.0
1.324AspCys: 1.324 ± 0.0
2.647AspAsp: 2.647 ± 0.0
5.295AspGlu: 5.295 ± 0.0
2.647AspPhe: 2.647 ± 0.0
2.316AspGly: 2.316 ± 0.0
0.331AspHis: 0.331 ± 0.0
3.64AspIle: 3.64 ± 0.0
3.309AspLys: 3.309 ± 0.0
3.971AspLeu: 3.971 ± 0.0
1.655AspMet: 1.655 ± 0.0
0.993AspAsn: 0.993 ± 0.0
2.978AspPro: 2.978 ± 0.0
0.993AspGln: 0.993 ± 0.0
1.985AspArg: 1.985 ± 0.0
4.633AspSer: 4.633 ± 0.0
1.324AspThr: 1.324 ± 0.0
2.647AspVal: 2.647 ± 0.0
1.324AspTrp: 1.324 ± 0.0
2.647AspTyr: 2.647 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.302GluAla: 4.302 ± 0.0
1.985GluCys: 1.985 ± 0.0
2.978GluAsp: 2.978 ± 0.0
4.964GluGlu: 4.964 ± 0.0
2.316GluPhe: 2.316 ± 0.0
3.64GluGly: 3.64 ± 0.0
3.309GluHis: 3.309 ± 0.0
2.647GluIle: 2.647 ± 0.0
4.633GluLys: 4.633 ± 0.0
6.287GluLeu: 6.287 ± 0.0
3.64GluMet: 3.64 ± 0.0
3.309GluAsn: 3.309 ± 0.0
1.985GluPro: 1.985 ± 0.0
1.985GluGln: 1.985 ± 0.0
3.971GluArg: 3.971 ± 0.0
2.978GluSer: 2.978 ± 0.0
2.316GluThr: 2.316 ± 0.0
7.611GluVal: 7.611 ± 0.0
0.331GluTrp: 0.331 ± 0.0
2.647GluTyr: 2.647 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.324PheAla: 1.324 ± 0.0
0.662PheCys: 0.662 ± 0.0
2.316PheAsp: 2.316 ± 0.0
3.971PheGlu: 3.971 ± 0.0
2.647PhePhe: 2.647 ± 0.0
1.324PheGly: 1.324 ± 0.0
0.662PheHis: 0.662 ± 0.0
2.316PheIle: 2.316 ± 0.0
4.302PheLys: 4.302 ± 0.0
2.647PheLeu: 2.647 ± 0.0
0.0PheMet: 0.0 ± 0.0
1.985PheAsn: 1.985 ± 0.0
1.655PhePro: 1.655 ± 0.0
1.655PheGln: 1.655 ± 0.0
1.324PheArg: 1.324 ± 0.0
4.302PheSer: 4.302 ± 0.0
2.647PheThr: 2.647 ± 0.0
3.309PheVal: 3.309 ± 0.0
0.993PheTrp: 0.993 ± 0.0
1.655PheTyr: 1.655 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.978GlyAla: 2.978 ± 0.0
1.985GlyCys: 1.985 ± 0.0
3.309GlyAsp: 3.309 ± 0.0
1.985GlyGlu: 1.985 ± 0.0
1.985GlyPhe: 1.985 ± 0.0
2.647GlyGly: 2.647 ± 0.0
0.993GlyHis: 0.993 ± 0.0
4.964GlyIle: 4.964 ± 0.0
4.633GlyLys: 4.633 ± 0.0
5.295GlyLeu: 5.295 ± 0.0
1.655GlyMet: 1.655 ± 0.0
2.316GlyAsn: 2.316 ± 0.0
1.655GlyPro: 1.655 ± 0.0
3.971GlyGln: 3.971 ± 0.0
1.655GlyArg: 1.655 ± 0.0
2.978GlySer: 2.978 ± 0.0
2.978GlyThr: 2.978 ± 0.0
1.655GlyVal: 1.655 ± 0.0
0.331GlyTrp: 0.331 ± 0.0
1.985GlyTyr: 1.985 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.993HisAla: 0.993 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.993HisAsp: 0.993 ± 0.0
1.324HisGlu: 1.324 ± 0.0
0.331HisPhe: 0.331 ± 0.0
2.978HisGly: 2.978 ± 0.0
0.662HisHis: 0.662 ± 0.0
0.662HisIle: 0.662 ± 0.0
0.993HisLys: 0.993 ± 0.0
2.978HisLeu: 2.978 ± 0.0
0.993HisMet: 0.993 ± 0.0
0.662HisAsn: 0.662 ± 0.0
1.985HisPro: 1.985 ± 0.0
0.993HisGln: 0.993 ± 0.0
2.316HisArg: 2.316 ± 0.0
0.662HisSer: 0.662 ± 0.0
0.331HisThr: 0.331 ± 0.0
1.985HisVal: 1.985 ± 0.0
0.331HisTrp: 0.331 ± 0.0
0.662HisTyr: 0.662 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.618IleAla: 6.618 ± 0.0
0.662IleCys: 0.662 ± 0.0
2.316IleAsp: 2.316 ± 0.0
3.309IleGlu: 3.309 ± 0.0
2.647IlePhe: 2.647 ± 0.0
2.647IleGly: 2.647 ± 0.0
1.985IleHis: 1.985 ± 0.0
2.647IleIle: 2.647 ± 0.0
3.64IleLys: 3.64 ± 0.0
7.611IleLeu: 7.611 ± 0.0
1.655IleMet: 1.655 ± 0.0
2.978IleAsn: 2.978 ± 0.0
4.964IlePro: 4.964 ± 0.0
2.647IleGln: 2.647 ± 0.0
4.633IleArg: 4.633 ± 0.0
2.316IleSer: 2.316 ± 0.0
3.64IleThr: 3.64 ± 0.0
5.956IleVal: 5.956 ± 0.0
0.331IleTrp: 0.331 ± 0.0
2.316IleTyr: 2.316 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.971LysAla: 3.971 ± 0.0
0.993LysCys: 0.993 ± 0.0
4.633LysAsp: 4.633 ± 0.0
2.978LysGlu: 2.978 ± 0.0
3.309LysPhe: 3.309 ± 0.0
1.985LysGly: 1.985 ± 0.0
1.324LysHis: 1.324 ± 0.0
4.964LysIle: 4.964 ± 0.0
4.302LysLys: 4.302 ± 0.0
5.295LysLeu: 5.295 ± 0.0
0.993LysMet: 0.993 ± 0.0
3.309LysAsn: 3.309 ± 0.0
2.978LysPro: 2.978 ± 0.0
2.978LysGln: 2.978 ± 0.0
2.316LysArg: 2.316 ± 0.0
4.633LysSer: 4.633 ± 0.0
4.964LysThr: 4.964 ± 0.0
3.971LysVal: 3.971 ± 0.0
0.662LysTrp: 0.662 ± 0.0
1.985LysTyr: 1.985 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.949LeuAla: 6.949 ± 0.0
1.985LeuCys: 1.985 ± 0.0
6.949LeuAsp: 6.949 ± 0.0
7.28LeuGlu: 7.28 ± 0.0
2.316LeuPhe: 2.316 ± 0.0
4.964LeuGly: 4.964 ± 0.0
0.993LeuHis: 0.993 ± 0.0
3.309LeuIle: 3.309 ± 0.0
5.956LeuLys: 5.956 ± 0.0
7.28LeuLeu: 7.28 ± 0.0
1.985LeuMet: 1.985 ± 0.0
4.964LeuAsn: 4.964 ± 0.0
3.64LeuPro: 3.64 ± 0.0
2.978LeuGln: 2.978 ± 0.0
5.295LeuArg: 5.295 ± 0.0
5.625LeuSer: 5.625 ± 0.0
6.949LeuThr: 6.949 ± 0.0
5.956LeuVal: 5.956 ± 0.0
0.662LeuTrp: 0.662 ± 0.0
2.647LeuTyr: 2.647 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.324MetAla: 1.324 ± 0.0
0.331MetCys: 0.331 ± 0.0
0.993MetAsp: 0.993 ± 0.0
1.324MetGlu: 1.324 ± 0.0
1.985MetPhe: 1.985 ± 0.0
2.647MetGly: 2.647 ± 0.0
0.331MetHis: 0.331 ± 0.0
1.985MetIle: 1.985 ± 0.0
1.324MetLys: 1.324 ± 0.0
3.971MetLeu: 3.971 ± 0.0
0.662MetMet: 0.662 ± 0.0
0.331MetAsn: 0.331 ± 0.0
0.662MetPro: 0.662 ± 0.0
0.993MetGln: 0.993 ± 0.0
1.655MetArg: 1.655 ± 0.0
2.316MetSer: 2.316 ± 0.0
0.993MetThr: 0.993 ± 0.0
2.316MetVal: 2.316 ± 0.0
0.331MetTrp: 0.331 ± 0.0
1.655MetTyr: 1.655 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.985AsnAla: 1.985 ± 0.0
0.662AsnCys: 0.662 ± 0.0
0.993AsnAsp: 0.993 ± 0.0
3.309AsnGlu: 3.309 ± 0.0
1.324AsnPhe: 1.324 ± 0.0
1.655AsnGly: 1.655 ± 0.0
0.0AsnHis: 0.0 ± 0.0
4.633AsnIle: 4.633 ± 0.0
2.316AsnLys: 2.316 ± 0.0
2.978AsnLeu: 2.978 ± 0.0
1.985AsnMet: 1.985 ± 0.0
2.978AsnAsn: 2.978 ± 0.0
2.978AsnPro: 2.978 ± 0.0
0.993AsnGln: 0.993 ± 0.0
1.655AsnArg: 1.655 ± 0.0
3.309AsnSer: 3.309 ± 0.0
2.647AsnThr: 2.647 ± 0.0
4.964AsnVal: 4.964 ± 0.0
0.331AsnTrp: 0.331 ± 0.0
1.324AsnTyr: 1.324 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.647ProAla: 2.647 ± 0.0
0.331ProCys: 0.331 ± 0.0
2.316ProAsp: 2.316 ± 0.0
2.647ProGlu: 2.647 ± 0.0
3.309ProPhe: 3.309 ± 0.0
2.316ProGly: 2.316 ± 0.0
2.316ProHis: 2.316 ± 0.0
4.302ProIle: 4.302 ± 0.0
3.971ProLys: 3.971 ± 0.0
3.971ProLeu: 3.971 ± 0.0
1.985ProMet: 1.985 ± 0.0
2.647ProAsn: 2.647 ± 0.0
2.978ProPro: 2.978 ± 0.0
1.985ProGln: 1.985 ± 0.0
3.309ProArg: 3.309 ± 0.0
2.647ProSer: 2.647 ± 0.0
3.971ProThr: 3.971 ± 0.0
3.309ProVal: 3.309 ± 0.0
0.331ProTrp: 0.331 ± 0.0
1.985ProTyr: 1.985 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.316GlnAla: 2.316 ± 0.0
0.662GlnCys: 0.662 ± 0.0
0.662GlnAsp: 0.662 ± 0.0
2.316GlnGlu: 2.316 ± 0.0
1.985GlnPhe: 1.985 ± 0.0
1.655GlnGly: 1.655 ± 0.0
1.324GlnHis: 1.324 ± 0.0
2.316GlnIle: 2.316 ± 0.0
1.655GlnLys: 1.655 ± 0.0
3.971GlnLeu: 3.971 ± 0.0
0.331GlnMet: 0.331 ± 0.0
1.324GlnAsn: 1.324 ± 0.0
1.324GlnPro: 1.324 ± 0.0
0.993GlnGln: 0.993 ± 0.0
3.64GlnArg: 3.64 ± 0.0
1.985GlnSer: 1.985 ± 0.0
2.978GlnThr: 2.978 ± 0.0
2.647GlnVal: 2.647 ± 0.0
0.331GlnTrp: 0.331 ± 0.0
3.309GlnTyr: 3.309 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.64ArgAla: 3.64 ± 0.0
0.993ArgCys: 0.993 ± 0.0
3.971ArgAsp: 3.971 ± 0.0
3.971ArgGlu: 3.971 ± 0.0
1.985ArgPhe: 1.985 ± 0.0
2.978ArgGly: 2.978 ± 0.0
0.662ArgHis: 0.662 ± 0.0
4.633ArgIle: 4.633 ± 0.0
4.964ArgLys: 4.964 ± 0.0
3.971ArgLeu: 3.971 ± 0.0
1.655ArgMet: 1.655 ± 0.0
1.655ArgAsn: 1.655 ± 0.0
4.302ArgPro: 4.302 ± 0.0
2.316ArgGln: 2.316 ± 0.0
5.956ArgArg: 5.956 ± 0.0
3.309ArgSer: 3.309 ± 0.0
2.978ArgThr: 2.978 ± 0.0
2.978ArgVal: 2.978 ± 0.0
0.331ArgTrp: 0.331 ± 0.0
2.316ArgTyr: 2.316 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.64SerAla: 3.64 ± 0.0
1.655SerCys: 1.655 ± 0.0
1.324SerAsp: 1.324 ± 0.0
6.287SerGlu: 6.287 ± 0.0
3.309SerPhe: 3.309 ± 0.0
2.978SerGly: 2.978 ± 0.0
0.993SerHis: 0.993 ± 0.0
4.302SerIle: 4.302 ± 0.0
4.964SerLys: 4.964 ± 0.0
5.956SerLeu: 5.956 ± 0.0
1.985SerMet: 1.985 ± 0.0
3.64SerAsn: 3.64 ± 0.0
4.633SerPro: 4.633 ± 0.0
1.985SerGln: 1.985 ± 0.0
3.309SerArg: 3.309 ± 0.0
5.295SerSer: 5.295 ± 0.0
4.633SerThr: 4.633 ± 0.0
4.964SerVal: 4.964 ± 0.0
0.993SerTrp: 0.993 ± 0.0
2.647SerTyr: 2.647 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.309ThrAla: 3.309 ± 0.0
0.331ThrCys: 0.331 ± 0.0
2.316ThrAsp: 2.316 ± 0.0
3.971ThrGlu: 3.971 ± 0.0
2.647ThrPhe: 2.647 ± 0.0
3.971ThrGly: 3.971 ± 0.0
0.993ThrHis: 0.993 ± 0.0
2.978ThrIle: 2.978 ± 0.0
1.655ThrLys: 1.655 ± 0.0
4.964ThrLeu: 4.964 ± 0.0
1.324ThrMet: 1.324 ± 0.0
2.316ThrAsn: 2.316 ± 0.0
5.956ThrPro: 5.956 ± 0.0
1.985ThrGln: 1.985 ± 0.0
3.971ThrArg: 3.971 ± 0.0
4.302ThrSer: 4.302 ± 0.0
3.64ThrThr: 3.64 ± 0.0
3.971ThrVal: 3.971 ± 0.0
0.662ThrTrp: 0.662 ± 0.0
1.655ThrTyr: 1.655 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.64ValAla: 3.64 ± 0.0
0.331ValCys: 0.331 ± 0.0
4.633ValAsp: 4.633 ± 0.0
5.625ValGlu: 5.625 ± 0.0
1.324ValPhe: 1.324 ± 0.0
2.647ValGly: 2.647 ± 0.0
1.655ValHis: 1.655 ± 0.0
4.633ValIle: 4.633 ± 0.0
4.633ValLys: 4.633 ± 0.0
4.633ValLeu: 4.633 ± 0.0
2.316ValMet: 2.316 ± 0.0
3.971ValAsn: 3.971 ± 0.0
2.316ValPro: 2.316 ± 0.0
3.309ValGln: 3.309 ± 0.0
4.302ValArg: 4.302 ± 0.0
6.949ValSer: 6.949 ± 0.0
2.647ValThr: 2.647 ± 0.0
2.647ValVal: 2.647 ± 0.0
0.993ValTrp: 0.993 ± 0.0
3.309ValTyr: 3.309 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.324TrpAla: 1.324 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.662TrpAsp: 0.662 ± 0.0
0.662TrpGlu: 0.662 ± 0.0
0.662TrpPhe: 0.662 ± 0.0
0.331TrpGly: 0.331 ± 0.0
0.993TrpHis: 0.993 ± 0.0
0.662TrpIle: 0.662 ± 0.0
0.993TrpLys: 0.993 ± 0.0
1.985TrpLeu: 1.985 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.331TrpAsn: 0.331 ± 0.0
0.331TrpPro: 0.331 ± 0.0
0.331TrpGln: 0.331 ± 0.0
0.662TrpArg: 0.662 ± 0.0
1.655TrpSer: 1.655 ± 0.0
0.331TrpThr: 0.331 ± 0.0
0.331TrpVal: 0.331 ± 0.0
0.331TrpTrp: 0.331 ± 0.0
0.662TrpTyr: 0.662 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.309TyrAla: 3.309 ± 0.0
0.993TyrCys: 0.993 ± 0.0
2.316TyrAsp: 2.316 ± 0.0
2.316TyrGlu: 2.316 ± 0.0
1.655TyrPhe: 1.655 ± 0.0
1.985TyrGly: 1.985 ± 0.0
1.324TyrHis: 1.324 ± 0.0
2.647TyrIle: 2.647 ± 0.0
0.0TyrLys: 0.0 ± 0.0
3.309TyrLeu: 3.309 ± 0.0
1.324TyrMet: 1.324 ± 0.0
0.993TyrAsn: 0.993 ± 0.0
1.985TyrPro: 1.985 ± 0.0
1.655TyrGln: 1.655 ± 0.0
3.309TyrArg: 3.309 ± 0.0
2.978TyrSer: 2.978 ± 0.0
2.316TyrThr: 2.316 ± 0.0
1.655TyrVal: 1.655 ± 0.0
1.655TyrTrp: 1.655 ± 0.0
2.316TyrTyr: 2.316 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski