Amino acid dipepetide frequency for Sanxia water strider virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.214AlaAla: 3.214 ± 0.0
0.357AlaCys: 0.357 ± 0.0
5.357AlaAsp: 5.357 ± 0.0
5.0AlaGlu: 5.0 ± 0.0
2.5AlaPhe: 2.5 ± 0.0
3.929AlaGly: 3.929 ± 0.0
0.0AlaHis: 0.0 ± 0.0
4.286AlaIle: 4.286 ± 0.0
4.286AlaLys: 4.286 ± 0.0
4.643AlaLeu: 4.643 ± 0.0
1.429AlaMet: 1.429 ± 0.0
3.214AlaAsn: 3.214 ± 0.0
4.286AlaPro: 4.286 ± 0.0
3.214AlaGln: 3.214 ± 0.0
3.571AlaArg: 3.571 ± 0.0
2.5AlaSer: 2.5 ± 0.0
5.357AlaThr: 5.357 ± 0.0
4.286AlaVal: 4.286 ± 0.0
0.714AlaTrp: 0.714 ± 0.0
3.214AlaTyr: 3.214 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.071CysAla: 1.071 ± 0.0
0.357CysCys: 0.357 ± 0.0
1.786CysAsp: 1.786 ± 0.0
0.357CysGlu: 0.357 ± 0.0
1.071CysPhe: 1.071 ± 0.0
1.071CysGly: 1.071 ± 0.0
0.357CysHis: 0.357 ± 0.0
1.071CysIle: 1.071 ± 0.0
0.714CysLys: 0.714 ± 0.0
2.143CysLeu: 2.143 ± 0.0
0.714CysMet: 0.714 ± 0.0
1.071CysAsn: 1.071 ± 0.0
0.357CysPro: 0.357 ± 0.0
0.714CysGln: 0.714 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.714CysSer: 0.714 ± 0.0
0.714CysThr: 0.714 ± 0.0
1.071CysVal: 1.071 ± 0.0
0.357CysTrp: 0.357 ± 0.0
0.714CysTyr: 0.714 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.429AspAla: 6.429 ± 0.0
1.071AspCys: 1.071 ± 0.0
4.286AspAsp: 4.286 ± 0.0
4.286AspGlu: 4.286 ± 0.0
3.214AspPhe: 3.214 ± 0.0
4.286AspGly: 4.286 ± 0.0
0.0AspHis: 0.0 ± 0.0
5.714AspIle: 5.714 ± 0.0
3.929AspLys: 3.929 ± 0.0
3.929AspLeu: 3.929 ± 0.0
1.786AspMet: 1.786 ± 0.0
3.929AspAsn: 3.929 ± 0.0
1.786AspPro: 1.786 ± 0.0
2.5AspGln: 2.5 ± 0.0
1.429AspArg: 1.429 ± 0.0
2.5AspSer: 2.5 ± 0.0
4.643AspThr: 4.643 ± 0.0
2.5AspVal: 2.5 ± 0.0
1.429AspTrp: 1.429 ± 0.0
1.071AspTyr: 1.071 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.571GluAla: 3.571 ± 0.0
1.429GluCys: 1.429 ± 0.0
3.929GluAsp: 3.929 ± 0.0
7.857GluGlu: 7.857 ± 0.0
3.929GluPhe: 3.929 ± 0.0
4.286GluGly: 4.286 ± 0.0
0.357GluHis: 0.357 ± 0.0
4.643GluIle: 4.643 ± 0.0
5.0GluLys: 5.0 ± 0.0
5.0GluLeu: 5.0 ± 0.0
3.214GluMet: 3.214 ± 0.0
2.5GluAsn: 2.5 ± 0.0
1.071GluPro: 1.071 ± 0.0
7.143GluGln: 7.143 ± 0.0
2.5GluArg: 2.5 ± 0.0
3.214GluSer: 3.214 ± 0.0
2.857GluThr: 2.857 ± 0.0
2.5GluVal: 2.5 ± 0.0
2.143GluTrp: 2.143 ± 0.0
1.786GluTyr: 1.786 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.5PheAla: 2.5 ± 0.0
1.071PheCys: 1.071 ± 0.0
3.214PheAsp: 3.214 ± 0.0
3.571PheGlu: 3.571 ± 0.0
3.214PhePhe: 3.214 ± 0.0
2.143PheGly: 2.143 ± 0.0
1.786PheHis: 1.786 ± 0.0
2.143PheIle: 2.143 ± 0.0
1.071PheLys: 1.071 ± 0.0
3.571PheLeu: 3.571 ± 0.0
1.071PheMet: 1.071 ± 0.0
3.214PheAsn: 3.214 ± 0.0
0.357PhePro: 0.357 ± 0.0
0.357PheGln: 0.357 ± 0.0
3.571PheArg: 3.571 ± 0.0
3.571PheSer: 3.571 ± 0.0
3.571PheThr: 3.571 ± 0.0
5.0PheVal: 5.0 ± 0.0
0.357PheTrp: 0.357 ± 0.0
2.857PheTyr: 2.857 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.929GlyAla: 3.929 ± 0.0
0.0GlyCys: 0.0 ± 0.0
5.0GlyAsp: 5.0 ± 0.0
5.357GlyGlu: 5.357 ± 0.0
3.571GlyPhe: 3.571 ± 0.0
2.857GlyGly: 2.857 ± 0.0
0.0GlyHis: 0.0 ± 0.0
5.357GlyIle: 5.357 ± 0.0
5.714GlyLys: 5.714 ± 0.0
6.071GlyLeu: 6.071 ± 0.0
0.357GlyMet: 0.357 ± 0.0
3.214GlyAsn: 3.214 ± 0.0
0.714GlyPro: 0.714 ± 0.0
2.857GlyGln: 2.857 ± 0.0
2.857GlyArg: 2.857 ± 0.0
3.571GlySer: 3.571 ± 0.0
4.643GlyThr: 4.643 ± 0.0
2.857GlyVal: 2.857 ± 0.0
1.429GlyTrp: 1.429 ± 0.0
1.429GlyTyr: 1.429 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.357HisAla: 0.357 ± 0.0
0.357HisCys: 0.357 ± 0.0
0.714HisAsp: 0.714 ± 0.0
0.714HisGlu: 0.714 ± 0.0
1.786HisPhe: 1.786 ± 0.0
2.143HisGly: 2.143 ± 0.0
0.357HisHis: 0.357 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.714HisLys: 0.714 ± 0.0
1.071HisLeu: 1.071 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.786HisAsn: 1.786 ± 0.0
0.714HisPro: 0.714 ± 0.0
1.071HisGln: 1.071 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.786HisSer: 1.786 ± 0.0
1.786HisThr: 1.786 ± 0.0
1.429HisVal: 1.429 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.714HisTyr: 0.714 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.286IleAla: 4.286 ± 0.0
1.429IleCys: 1.429 ± 0.0
4.286IleAsp: 4.286 ± 0.0
3.929IleGlu: 3.929 ± 0.0
2.857IlePhe: 2.857 ± 0.0
3.214IleGly: 3.214 ± 0.0
2.857IleHis: 2.857 ± 0.0
4.643IleIle: 4.643 ± 0.0
2.857IleLys: 2.857 ± 0.0
3.571IleLeu: 3.571 ± 0.0
1.786IleMet: 1.786 ± 0.0
6.071IleAsn: 6.071 ± 0.0
4.286IlePro: 4.286 ± 0.0
2.857IleGln: 2.857 ± 0.0
2.857IleArg: 2.857 ± 0.0
2.143IleSer: 2.143 ± 0.0
5.0IleThr: 5.0 ± 0.0
5.0IleVal: 5.0 ± 0.0
0.357IleTrp: 0.357 ± 0.0
3.214IleTyr: 3.214 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.214LysAla: 3.214 ± 0.0
1.071LysCys: 1.071 ± 0.0
3.929LysAsp: 3.929 ± 0.0
3.214LysGlu: 3.214 ± 0.0
2.857LysPhe: 2.857 ± 0.0
3.929LysGly: 3.929 ± 0.0
0.357LysHis: 0.357 ± 0.0
7.143LysIle: 7.143 ± 0.0
3.214LysLys: 3.214 ± 0.0
3.214LysLeu: 3.214 ± 0.0
1.429LysMet: 1.429 ± 0.0
1.071LysAsn: 1.071 ± 0.0
2.5LysPro: 2.5 ± 0.0
2.857LysGln: 2.857 ± 0.0
2.143LysArg: 2.143 ± 0.0
3.214LysSer: 3.214 ± 0.0
1.786LysThr: 1.786 ± 0.0
4.643LysVal: 4.643 ± 0.0
1.429LysTrp: 1.429 ± 0.0
3.929LysTyr: 3.929 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.357LeuAla: 5.357 ± 0.0
1.429LeuCys: 1.429 ± 0.0
3.929LeuAsp: 3.929 ± 0.0
2.857LeuGlu: 2.857 ± 0.0
2.857LeuPhe: 2.857 ± 0.0
4.286LeuGly: 4.286 ± 0.0
1.429LeuHis: 1.429 ± 0.0
2.857LeuIle: 2.857 ± 0.0
5.0LeuLys: 5.0 ± 0.0
3.571LeuLeu: 3.571 ± 0.0
2.5LeuMet: 2.5 ± 0.0
5.357LeuAsn: 5.357 ± 0.0
4.286LeuPro: 4.286 ± 0.0
4.286LeuGln: 4.286 ± 0.0
4.286LeuArg: 4.286 ± 0.0
3.214LeuSer: 3.214 ± 0.0
7.143LeuThr: 7.143 ± 0.0
7.5LeuVal: 7.5 ± 0.0
0.357LeuTrp: 0.357 ± 0.0
3.929LeuTyr: 3.929 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.714MetAla: 0.714 ± 0.0
1.071MetCys: 1.071 ± 0.0
2.143MetAsp: 2.143 ± 0.0
2.143MetGlu: 2.143 ± 0.0
2.143MetPhe: 2.143 ± 0.0
1.071MetGly: 1.071 ± 0.0
1.429MetHis: 1.429 ± 0.0
2.143MetIle: 2.143 ± 0.0
0.714MetLys: 0.714 ± 0.0
2.143MetLeu: 2.143 ± 0.0
0.714MetMet: 0.714 ± 0.0
1.786MetAsn: 1.786 ± 0.0
1.071MetPro: 1.071 ± 0.0
0.357MetGln: 0.357 ± 0.0
1.786MetArg: 1.786 ± 0.0
2.143MetSer: 2.143 ± 0.0
1.786MetThr: 1.786 ± 0.0
1.071MetVal: 1.071 ± 0.0
0.357MetTrp: 0.357 ± 0.0
1.429MetTyr: 1.429 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.214AsnAla: 3.214 ± 0.0
1.071AsnCys: 1.071 ± 0.0
2.143AsnAsp: 2.143 ± 0.0
2.857AsnGlu: 2.857 ± 0.0
2.5AsnPhe: 2.5 ± 0.0
3.571AsnGly: 3.571 ± 0.0
0.714AsnHis: 0.714 ± 0.0
3.571AsnIle: 3.571 ± 0.0
2.857AsnLys: 2.857 ± 0.0
4.643AsnLeu: 4.643 ± 0.0
2.143AsnMet: 2.143 ± 0.0
2.857AsnAsn: 2.857 ± 0.0
3.571AsnPro: 3.571 ± 0.0
3.571AsnGln: 3.571 ± 0.0
3.571AsnArg: 3.571 ± 0.0
3.214AsnSer: 3.214 ± 0.0
1.786AsnThr: 1.786 ± 0.0
4.286AsnVal: 4.286 ± 0.0
1.429AsnTrp: 1.429 ± 0.0
2.5AsnTyr: 2.5 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.143ProAla: 2.143 ± 0.0
1.071ProCys: 1.071 ± 0.0
1.429ProAsp: 1.429 ± 0.0
2.5ProGlu: 2.5 ± 0.0
1.786ProPhe: 1.786 ± 0.0
1.786ProGly: 1.786 ± 0.0
1.071ProHis: 1.071 ± 0.0
2.143ProIle: 2.143 ± 0.0
2.143ProLys: 2.143 ± 0.0
3.571ProLeu: 3.571 ± 0.0
0.0ProMet: 0.0 ± 0.0
2.857ProAsn: 2.857 ± 0.0
1.786ProPro: 1.786 ± 0.0
3.571ProGln: 3.571 ± 0.0
0.714ProArg: 0.714 ± 0.0
2.5ProSer: 2.5 ± 0.0
2.143ProThr: 2.143 ± 0.0
4.643ProVal: 4.643 ± 0.0
0.714ProTrp: 0.714 ± 0.0
2.143ProTyr: 2.143 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.857GlnAla: 2.857 ± 0.0
0.714GlnCys: 0.714 ± 0.0
3.571GlnAsp: 3.571 ± 0.0
2.5GlnGlu: 2.5 ± 0.0
2.143GlnPhe: 2.143 ± 0.0
6.786GlnGly: 6.786 ± 0.0
0.0GlnHis: 0.0 ± 0.0
3.929GlnIle: 3.929 ± 0.0
3.214GlnLys: 3.214 ± 0.0
4.643GlnLeu: 4.643 ± 0.0
1.071GlnMet: 1.071 ± 0.0
1.786GlnAsn: 1.786 ± 0.0
0.714GlnPro: 0.714 ± 0.0
2.143GlnGln: 2.143 ± 0.0
1.429GlnArg: 1.429 ± 0.0
2.857GlnSer: 2.857 ± 0.0
1.786GlnThr: 1.786 ± 0.0
2.5GlnVal: 2.5 ± 0.0
0.714GlnTrp: 0.714 ± 0.0
2.143GlnTyr: 2.143 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.357ArgAla: 5.357 ± 0.0
0.714ArgCys: 0.714 ± 0.0
2.5ArgAsp: 2.5 ± 0.0
3.571ArgGlu: 3.571 ± 0.0
2.143ArgPhe: 2.143 ± 0.0
2.857ArgGly: 2.857 ± 0.0
1.786ArgHis: 1.786 ± 0.0
3.214ArgIle: 3.214 ± 0.0
3.571ArgLys: 3.571 ± 0.0
5.714ArgLeu: 5.714 ± 0.0
0.357ArgMet: 0.357 ± 0.0
3.214ArgAsn: 3.214 ± 0.0
1.786ArgPro: 1.786 ± 0.0
1.071ArgGln: 1.071 ± 0.0
2.5ArgArg: 2.5 ± 0.0
2.5ArgSer: 2.5 ± 0.0
0.357ArgThr: 0.357 ± 0.0
1.786ArgVal: 1.786 ± 0.0
0.714ArgTrp: 0.714 ± 0.0
2.5ArgTyr: 2.5 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.929SerAla: 3.929 ± 0.0
0.714SerCys: 0.714 ± 0.0
1.786SerAsp: 1.786 ± 0.0
3.571SerGlu: 3.571 ± 0.0
4.286SerPhe: 4.286 ± 0.0
3.214SerGly: 3.214 ± 0.0
2.143SerHis: 2.143 ± 0.0
4.643SerIle: 4.643 ± 0.0
2.143SerLys: 2.143 ± 0.0
4.643SerLeu: 4.643 ± 0.0
0.357SerMet: 0.357 ± 0.0
2.5SerAsn: 2.5 ± 0.0
2.5SerPro: 2.5 ± 0.0
2.143SerGln: 2.143 ± 0.0
3.214SerArg: 3.214 ± 0.0
2.857SerSer: 2.857 ± 0.0
3.929SerThr: 3.929 ± 0.0
1.786SerVal: 1.786 ± 0.0
0.714SerTrp: 0.714 ± 0.0
2.5SerTyr: 2.5 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.286ThrAla: 4.286 ± 0.0
0.714ThrCys: 0.714 ± 0.0
3.571ThrAsp: 3.571 ± 0.0
3.929ThrGlu: 3.929 ± 0.0
1.429ThrPhe: 1.429 ± 0.0
4.643ThrGly: 4.643 ± 0.0
0.714ThrHis: 0.714 ± 0.0
3.214ThrIle: 3.214 ± 0.0
2.143ThrLys: 2.143 ± 0.0
4.643ThrLeu: 4.643 ± 0.0
4.286ThrMet: 4.286 ± 0.0
3.571ThrAsn: 3.571 ± 0.0
2.857ThrPro: 2.857 ± 0.0
1.786ThrGln: 1.786 ± 0.0
3.929ThrArg: 3.929 ± 0.0
4.286ThrSer: 4.286 ± 0.0
4.286ThrThr: 4.286 ± 0.0
5.0ThrVal: 5.0 ± 0.0
0.357ThrTrp: 0.357 ± 0.0
2.143ThrTyr: 2.143 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.714ValAla: 5.714 ± 0.0
0.714ValCys: 0.714 ± 0.0
3.214ValAsp: 3.214 ± 0.0
4.286ValGlu: 4.286 ± 0.0
1.429ValPhe: 1.429 ± 0.0
3.214ValGly: 3.214 ± 0.0
1.071ValHis: 1.071 ± 0.0
5.357ValIle: 5.357 ± 0.0
4.643ValLys: 4.643 ± 0.0
4.643ValLeu: 4.643 ± 0.0
3.214ValMet: 3.214 ± 0.0
2.857ValAsn: 2.857 ± 0.0
2.5ValPro: 2.5 ± 0.0
2.857ValGln: 2.857 ± 0.0
4.286ValArg: 4.286 ± 0.0
3.571ValSer: 3.571 ± 0.0
2.857ValThr: 2.857 ± 0.0
2.5ValVal: 2.5 ± 0.0
0.714ValTrp: 0.714 ± 0.0
3.929ValTyr: 3.929 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.071TrpAla: 1.071 ± 0.0
0.357TrpCys: 0.357 ± 0.0
0.714TrpAsp: 0.714 ± 0.0
0.357TrpGlu: 0.357 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.357TrpGly: 0.357 ± 0.0
0.357TrpHis: 0.357 ± 0.0
0.714TrpIle: 0.714 ± 0.0
1.429TrpLys: 1.429 ± 0.0
1.429TrpLeu: 1.429 ± 0.0
0.357TrpMet: 0.357 ± 0.0
1.071TrpAsn: 1.071 ± 0.0
0.714TrpPro: 0.714 ± 0.0
0.357TrpGln: 0.357 ± 0.0
1.786TrpArg: 1.786 ± 0.0
0.714TrpSer: 0.714 ± 0.0
2.143TrpThr: 2.143 ± 0.0
0.714TrpVal: 0.714 ± 0.0
0.357TrpTrp: 0.357 ± 0.0
1.071TrpTyr: 1.071 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.143TyrAla: 2.143 ± 0.0
0.714TyrCys: 0.714 ± 0.0
3.214TyrAsp: 3.214 ± 0.0
6.071TyrGlu: 6.071 ± 0.0
2.857TyrPhe: 2.857 ± 0.0
2.143TyrGly: 2.143 ± 0.0
0.714TyrHis: 0.714 ± 0.0
0.714TyrIle: 0.714 ± 0.0
1.786TyrLys: 1.786 ± 0.0
3.571TyrLeu: 3.571 ± 0.0
1.071TyrMet: 1.071 ± 0.0
2.143TyrAsn: 2.143 ± 0.0
2.857TyrPro: 2.857 ± 0.0
1.786TyrGln: 1.786 ± 0.0
1.786TyrArg: 1.786 ± 0.0
2.5TyrSer: 2.5 ± 0.0
3.214TyrThr: 3.214 ± 0.0
2.5TyrVal: 2.5 ± 0.0
1.429TyrTrp: 1.429 ± 0.0
1.071TyrTyr: 1.071 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2801 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski