Amino acid dipepetide frequency for Sanxia water strider virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.393AlaAla: 2.393 ± 0.0
1.026AlaCys: 1.026 ± 0.0
2.735AlaAsp: 2.735 ± 0.0
1.709AlaGlu: 1.709 ± 0.0
2.051AlaPhe: 2.051 ± 0.0
3.077AlaGly: 3.077 ± 0.0
0.684AlaHis: 0.684 ± 0.0
5.128AlaIle: 5.128 ± 0.0
2.051AlaLys: 2.051 ± 0.0
6.496AlaLeu: 6.496 ± 0.0
0.342AlaMet: 0.342 ± 0.0
3.077AlaAsn: 3.077 ± 0.0
2.393AlaPro: 2.393 ± 0.0
4.444AlaGln: 4.444 ± 0.0
1.368AlaArg: 1.368 ± 0.0
4.103AlaSer: 4.103 ± 0.0
5.128AlaThr: 5.128 ± 0.0
5.812AlaVal: 5.812 ± 0.0
1.368AlaTrp: 1.368 ± 0.0
2.393AlaTyr: 2.393 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.684CysAla: 0.684 ± 0.0
0.684CysCys: 0.684 ± 0.0
0.342CysAsp: 0.342 ± 0.0
0.684CysGlu: 0.684 ± 0.0
0.684CysPhe: 0.684 ± 0.0
1.368CysGly: 1.368 ± 0.0
0.342CysHis: 0.342 ± 0.0
1.368CysIle: 1.368 ± 0.0
0.684CysLys: 0.684 ± 0.0
1.709CysLeu: 1.709 ± 0.0
0.342CysMet: 0.342 ± 0.0
2.051CysAsn: 2.051 ± 0.0
0.684CysPro: 0.684 ± 0.0
1.026CysGln: 1.026 ± 0.0
1.368CysArg: 1.368 ± 0.0
1.709CysSer: 1.709 ± 0.0
1.026CysThr: 1.026 ± 0.0
1.026CysVal: 1.026 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.684CysTyr: 0.684 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.026AspAla: 1.026 ± 0.0
0.342AspCys: 0.342 ± 0.0
2.735AspAsp: 2.735 ± 0.0
3.077AspGlu: 3.077 ± 0.0
2.051AspPhe: 2.051 ± 0.0
2.735AspGly: 2.735 ± 0.0
1.026AspHis: 1.026 ± 0.0
1.709AspIle: 1.709 ± 0.0
3.419AspLys: 3.419 ± 0.0
6.154AspLeu: 6.154 ± 0.0
1.368AspMet: 1.368 ± 0.0
2.393AspAsn: 2.393 ± 0.0
2.051AspPro: 2.051 ± 0.0
2.393AspGln: 2.393 ± 0.0
1.368AspArg: 1.368 ± 0.0
1.709AspSer: 1.709 ± 0.0
5.128AspThr: 5.128 ± 0.0
5.47AspVal: 5.47 ± 0.0
0.0AspTrp: 0.0 ± 0.0
3.077AspTyr: 3.077 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.154GluAla: 6.154 ± 0.0
2.051GluCys: 2.051 ± 0.0
4.103GluAsp: 4.103 ± 0.0
1.709GluGlu: 1.709 ± 0.0
3.761GluPhe: 3.761 ± 0.0
3.761GluGly: 3.761 ± 0.0
0.0GluHis: 0.0 ± 0.0
2.051GluIle: 2.051 ± 0.0
4.103GluLys: 4.103 ± 0.0
4.444GluLeu: 4.444 ± 0.0
2.393GluMet: 2.393 ± 0.0
2.051GluAsn: 2.051 ± 0.0
2.735GluPro: 2.735 ± 0.0
3.077GluGln: 3.077 ± 0.0
2.735GluArg: 2.735 ± 0.0
3.419GluSer: 3.419 ± 0.0
1.709GluThr: 1.709 ± 0.0
3.077GluVal: 3.077 ± 0.0
1.709GluTrp: 1.709 ± 0.0
1.709GluTyr: 1.709 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.026PheAla: 1.026 ± 0.0
1.026PheCys: 1.026 ± 0.0
0.684PheAsp: 0.684 ± 0.0
1.368PheGlu: 1.368 ± 0.0
1.026PhePhe: 1.026 ± 0.0
2.051PheGly: 2.051 ± 0.0
2.735PheHis: 2.735 ± 0.0
2.735PheIle: 2.735 ± 0.0
2.393PheLys: 2.393 ± 0.0
2.393PheLeu: 2.393 ± 0.0
1.709PheMet: 1.709 ± 0.0
2.735PheAsn: 2.735 ± 0.0
2.393PhePro: 2.393 ± 0.0
2.051PheGln: 2.051 ± 0.0
1.368PheArg: 1.368 ± 0.0
1.368PheSer: 1.368 ± 0.0
4.103PheThr: 4.103 ± 0.0
4.444PheVal: 4.444 ± 0.0
1.709PheTrp: 1.709 ± 0.0
1.368PheTyr: 1.368 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.709GlyAla: 1.709 ± 0.0
1.026GlyCys: 1.026 ± 0.0
3.077GlyAsp: 3.077 ± 0.0
3.761GlyGlu: 3.761 ± 0.0
3.419GlyPhe: 3.419 ± 0.0
2.051GlyGly: 2.051 ± 0.0
0.684GlyHis: 0.684 ± 0.0
4.786GlyIle: 4.786 ± 0.0
2.393GlyLys: 2.393 ± 0.0
4.444GlyLeu: 4.444 ± 0.0
0.684GlyMet: 0.684 ± 0.0
2.051GlyAsn: 2.051 ± 0.0
3.419GlyPro: 3.419 ± 0.0
1.709GlyGln: 1.709 ± 0.0
1.709GlyArg: 1.709 ± 0.0
3.761GlySer: 3.761 ± 0.0
5.812GlyThr: 5.812 ± 0.0
6.838GlyVal: 6.838 ± 0.0
1.026GlyTrp: 1.026 ± 0.0
4.444GlyTyr: 4.444 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.368HisAla: 1.368 ± 0.0
0.342HisCys: 0.342 ± 0.0
1.026HisAsp: 1.026 ± 0.0
0.684HisGlu: 0.684 ± 0.0
0.684HisPhe: 0.684 ± 0.0
1.368HisGly: 1.368 ± 0.0
0.684HisHis: 0.684 ± 0.0
0.342HisIle: 0.342 ± 0.0
1.026HisLys: 1.026 ± 0.0
2.735HisLeu: 2.735 ± 0.0
0.342HisMet: 0.342 ± 0.0
1.026HisAsn: 1.026 ± 0.0
0.684HisPro: 0.684 ± 0.0
0.342HisGln: 0.342 ± 0.0
1.026HisArg: 1.026 ± 0.0
0.684HisSer: 0.684 ± 0.0
1.709HisThr: 1.709 ± 0.0
2.051HisVal: 2.051 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.368HisTyr: 1.368 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.128IleAla: 5.128 ± 0.0
2.051IleCys: 2.051 ± 0.0
3.419IleAsp: 3.419 ± 0.0
2.393IleGlu: 2.393 ± 0.0
1.368IlePhe: 1.368 ± 0.0
5.47IleGly: 5.47 ± 0.0
0.684IleHis: 0.684 ± 0.0
2.735IleIle: 2.735 ± 0.0
3.419IleLys: 3.419 ± 0.0
4.103IleLeu: 4.103 ± 0.0
1.709IleMet: 1.709 ± 0.0
2.393IleAsn: 2.393 ± 0.0
5.128IlePro: 5.128 ± 0.0
3.419IleGln: 3.419 ± 0.0
0.342IleArg: 0.342 ± 0.0
3.761IleSer: 3.761 ± 0.0
3.761IleThr: 3.761 ± 0.0
6.496IleVal: 6.496 ± 0.0
0.684IleTrp: 0.684 ± 0.0
2.393IleTyr: 2.393 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.051LysAla: 2.051 ± 0.0
1.368LysCys: 1.368 ± 0.0
1.709LysAsp: 1.709 ± 0.0
3.761LysGlu: 3.761 ± 0.0
3.419LysPhe: 3.419 ± 0.0
2.393LysGly: 2.393 ± 0.0
1.709LysHis: 1.709 ± 0.0
3.761LysIle: 3.761 ± 0.0
2.051LysLys: 2.051 ± 0.0
4.103LysLeu: 4.103 ± 0.0
1.026LysMet: 1.026 ± 0.0
2.393LysAsn: 2.393 ± 0.0
1.026LysPro: 1.026 ± 0.0
1.709LysGln: 1.709 ± 0.0
3.077LysArg: 3.077 ± 0.0
5.128LysSer: 5.128 ± 0.0
2.735LysThr: 2.735 ± 0.0
4.103LysVal: 4.103 ± 0.0
1.026LysTrp: 1.026 ± 0.0
2.051LysTyr: 2.051 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.786LeuAla: 4.786 ± 0.0
2.735LeuCys: 2.735 ± 0.0
2.735LeuAsp: 2.735 ± 0.0
5.47LeuGlu: 5.47 ± 0.0
1.368LeuPhe: 1.368 ± 0.0
3.761LeuGly: 3.761 ± 0.0
1.709LeuHis: 1.709 ± 0.0
4.786LeuIle: 4.786 ± 0.0
4.103LeuLys: 4.103 ± 0.0
7.521LeuLeu: 7.521 ± 0.0
1.709LeuMet: 1.709 ± 0.0
3.077LeuAsn: 3.077 ± 0.0
2.051LeuPro: 2.051 ± 0.0
3.077LeuGln: 3.077 ± 0.0
5.47LeuArg: 5.47 ± 0.0
4.786LeuSer: 4.786 ± 0.0
5.812LeuThr: 5.812 ± 0.0
8.547LeuVal: 8.547 ± 0.0
0.684LeuTrp: 0.684 ± 0.0
4.444LeuTyr: 4.444 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.709MetAla: 1.709 ± 0.0
0.684MetCys: 0.684 ± 0.0
3.077MetAsp: 3.077 ± 0.0
1.368MetGlu: 1.368 ± 0.0
1.026MetPhe: 1.026 ± 0.0
0.684MetGly: 0.684 ± 0.0
1.368MetHis: 1.368 ± 0.0
1.026MetIle: 1.026 ± 0.0
1.026MetLys: 1.026 ± 0.0
1.709MetLeu: 1.709 ± 0.0
0.342MetMet: 0.342 ± 0.0
1.368MetAsn: 1.368 ± 0.0
0.342MetPro: 0.342 ± 0.0
0.342MetGln: 0.342 ± 0.0
1.709MetArg: 1.709 ± 0.0
2.051MetSer: 2.051 ± 0.0
2.735MetThr: 2.735 ± 0.0
1.026MetVal: 1.026 ± 0.0
0.342MetTrp: 0.342 ± 0.0
1.026MetTyr: 1.026 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.761AsnAla: 3.761 ± 0.0
0.684AsnCys: 0.684 ± 0.0
1.368AsnAsp: 1.368 ± 0.0
3.419AsnGlu: 3.419 ± 0.0
5.812AsnPhe: 5.812 ± 0.0
3.419AsnGly: 3.419 ± 0.0
1.026AsnHis: 1.026 ± 0.0
1.709AsnIle: 1.709 ± 0.0
3.077AsnLys: 3.077 ± 0.0
3.419AsnLeu: 3.419 ± 0.0
2.051AsnMet: 2.051 ± 0.0
3.077AsnAsn: 3.077 ± 0.0
3.077AsnPro: 3.077 ± 0.0
2.393AsnGln: 2.393 ± 0.0
2.051AsnArg: 2.051 ± 0.0
2.393AsnSer: 2.393 ± 0.0
3.077AsnThr: 3.077 ± 0.0
4.103AsnVal: 4.103 ± 0.0
0.342AsnTrp: 0.342 ± 0.0
1.368AsnTyr: 1.368 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.393ProAla: 2.393 ± 0.0
0.684ProCys: 0.684 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.077ProGlu: 3.077 ± 0.0
1.709ProPhe: 1.709 ± 0.0
2.051ProGly: 2.051 ± 0.0
1.709ProHis: 1.709 ± 0.0
2.735ProIle: 2.735 ± 0.0
1.709ProLys: 1.709 ± 0.0
5.128ProLeu: 5.128 ± 0.0
1.368ProMet: 1.368 ± 0.0
2.051ProAsn: 2.051 ± 0.0
0.684ProPro: 0.684 ± 0.0
1.709ProGln: 1.709 ± 0.0
3.419ProArg: 3.419 ± 0.0
1.709ProSer: 1.709 ± 0.0
6.838ProThr: 6.838 ± 0.0
3.077ProVal: 3.077 ± 0.0
0.342ProTrp: 0.342 ± 0.0
2.735ProTyr: 2.735 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.393GlnAla: 2.393 ± 0.0
0.0GlnCys: 0.0 ± 0.0
3.419GlnAsp: 3.419 ± 0.0
2.393GlnGlu: 2.393 ± 0.0
1.368GlnPhe: 1.368 ± 0.0
1.709GlnGly: 1.709 ± 0.0
0.684GlnHis: 0.684 ± 0.0
3.077GlnIle: 3.077 ± 0.0
2.393GlnLys: 2.393 ± 0.0
3.761GlnLeu: 3.761 ± 0.0
1.368GlnMet: 1.368 ± 0.0
2.735GlnAsn: 2.735 ± 0.0
1.368GlnPro: 1.368 ± 0.0
1.368GlnGln: 1.368 ± 0.0
1.709GlnArg: 1.709 ± 0.0
4.786GlnSer: 4.786 ± 0.0
2.393GlnThr: 2.393 ± 0.0
3.761GlnVal: 3.761 ± 0.0
1.709GlnTrp: 1.709 ± 0.0
1.368GlnTyr: 1.368 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.051ArgAla: 2.051 ± 0.0
0.342ArgCys: 0.342 ± 0.0
2.393ArgAsp: 2.393 ± 0.0
1.368ArgGlu: 1.368 ± 0.0
3.077ArgPhe: 3.077 ± 0.0
1.368ArgGly: 1.368 ± 0.0
0.684ArgHis: 0.684 ± 0.0
1.368ArgIle: 1.368 ± 0.0
4.444ArgLys: 4.444 ± 0.0
2.051ArgLeu: 2.051 ± 0.0
1.709ArgMet: 1.709 ± 0.0
3.761ArgAsn: 3.761 ± 0.0
3.419ArgPro: 3.419 ± 0.0
2.051ArgGln: 2.051 ± 0.0
4.444ArgArg: 4.444 ± 0.0
1.368ArgSer: 1.368 ± 0.0
2.393ArgThr: 2.393 ± 0.0
4.444ArgVal: 4.444 ± 0.0
0.684ArgTrp: 0.684 ± 0.0
3.419ArgTyr: 3.419 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.154SerAla: 6.154 ± 0.0
0.0SerCys: 0.0 ± 0.0
3.419SerAsp: 3.419 ± 0.0
6.496SerGlu: 6.496 ± 0.0
2.051SerPhe: 2.051 ± 0.0
5.812SerGly: 5.812 ± 0.0
0.684SerHis: 0.684 ± 0.0
5.812SerIle: 5.812 ± 0.0
2.735SerLys: 2.735 ± 0.0
3.761SerLeu: 3.761 ± 0.0
1.026SerMet: 1.026 ± 0.0
3.761SerAsn: 3.761 ± 0.0
3.761SerPro: 3.761 ± 0.0
2.735SerGln: 2.735 ± 0.0
2.051SerArg: 2.051 ± 0.0
6.496SerSer: 6.496 ± 0.0
4.444SerThr: 4.444 ± 0.0
5.47SerVal: 5.47 ± 0.0
1.709SerTrp: 1.709 ± 0.0
1.368SerTyr: 1.368 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.444ThrAla: 4.444 ± 0.0
1.026ThrCys: 1.026 ± 0.0
4.786ThrAsp: 4.786 ± 0.0
2.393ThrGlu: 2.393 ± 0.0
2.735ThrPhe: 2.735 ± 0.0
5.47ThrGly: 5.47 ± 0.0
0.0ThrHis: 0.0 ± 0.0
6.496ThrIle: 6.496 ± 0.0
3.077ThrLys: 3.077 ± 0.0
4.786ThrLeu: 4.786 ± 0.0
2.051ThrMet: 2.051 ± 0.0
3.761ThrAsn: 3.761 ± 0.0
2.735ThrPro: 2.735 ± 0.0
2.393ThrGln: 2.393 ± 0.0
3.419ThrArg: 3.419 ± 0.0
8.889ThrSer: 8.889 ± 0.0
7.179ThrThr: 7.179 ± 0.0
7.179ThrVal: 7.179 ± 0.0
1.368ThrTrp: 1.368 ± 0.0
2.735ThrTyr: 2.735 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.812ValAla: 5.812 ± 0.0
2.051ValCys: 2.051 ± 0.0
3.761ValAsp: 3.761 ± 0.0
6.496ValGlu: 6.496 ± 0.0
2.393ValPhe: 2.393 ± 0.0
5.812ValGly: 5.812 ± 0.0
2.393ValHis: 2.393 ± 0.0
6.154ValIle: 6.154 ± 0.0
3.077ValLys: 3.077 ± 0.0
6.154ValLeu: 6.154 ± 0.0
2.393ValMet: 2.393 ± 0.0
4.786ValAsn: 4.786 ± 0.0
4.444ValPro: 4.444 ± 0.0
4.103ValGln: 4.103 ± 0.0
4.444ValArg: 4.444 ± 0.0
7.521ValSer: 7.521 ± 0.0
4.786ValThr: 4.786 ± 0.0
7.863ValVal: 7.863 ± 0.0
2.393ValTrp: 2.393 ± 0.0
2.051ValTyr: 2.051 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.026TrpAla: 1.026 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.026TrpAsp: 1.026 ± 0.0
1.709TrpGlu: 1.709 ± 0.0
0.342TrpPhe: 0.342 ± 0.0
1.368TrpGly: 1.368 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.684TrpIle: 0.684 ± 0.0
2.051TrpLys: 2.051 ± 0.0
1.026TrpLeu: 1.026 ± 0.0
0.342TrpMet: 0.342 ± 0.0
1.026TrpAsn: 1.026 ± 0.0
0.342TrpPro: 0.342 ± 0.0
0.684TrpGln: 0.684 ± 0.0
0.684TrpArg: 0.684 ± 0.0
1.709TrpSer: 1.709 ± 0.0
1.709TrpThr: 1.709 ± 0.0
1.026TrpVal: 1.026 ± 0.0
0.684TrpTrp: 0.684 ± 0.0
0.342TrpTyr: 0.342 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.709TyrAla: 1.709 ± 0.0
0.342TyrCys: 0.342 ± 0.0
3.419TyrAsp: 3.419 ± 0.0
3.419TyrGlu: 3.419 ± 0.0
0.684TyrPhe: 0.684 ± 0.0
3.077TyrGly: 3.077 ± 0.0
0.684TyrHis: 0.684 ± 0.0
2.393TyrIle: 2.393 ± 0.0
1.368TyrLys: 1.368 ± 0.0
2.735TyrLeu: 2.735 ± 0.0
0.342TyrMet: 0.342 ± 0.0
2.051TyrAsn: 2.051 ± 0.0
2.393TyrPro: 2.393 ± 0.0
2.735TyrGln: 2.735 ± 0.0
3.077TyrArg: 3.077 ± 0.0
2.393TyrSer: 2.393 ± 0.0
4.444TyrThr: 4.444 ± 0.0
3.077TyrVal: 3.077 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.735TyrTyr: 2.735 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2926 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski