Amino acid dipepetide frequency for Hot pepper alphaendornavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.072AlaAla: 3.072 ± 0.0
1.024AlaCys: 1.024 ± 0.0
3.072AlaAsp: 3.072 ± 0.0
3.481AlaGlu: 3.481 ± 0.0
1.229AlaPhe: 1.229 ± 0.0
2.867AlaGly: 2.867 ± 0.0
1.434AlaHis: 1.434 ± 0.0
3.277AlaIle: 3.277 ± 0.0
4.096AlaLys: 4.096 ± 0.0
6.758AlaLeu: 6.758 ± 0.0
2.048AlaMet: 2.048 ± 0.0
3.686AlaAsn: 3.686 ± 0.0
1.229AlaPro: 1.229 ± 0.0
2.253AlaGln: 2.253 ± 0.0
2.253AlaArg: 2.253 ± 0.0
2.458AlaSer: 2.458 ± 0.0
3.481AlaThr: 3.481 ± 0.0
3.891AlaVal: 3.891 ± 0.0
1.229AlaTrp: 1.229 ± 0.0
1.229AlaTyr: 1.229 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.048CysAla: 2.048 ± 0.0
1.638CysCys: 1.638 ± 0.0
1.638CysAsp: 1.638 ± 0.0
0.819CysGlu: 0.819 ± 0.0
0.41CysPhe: 0.41 ± 0.0
1.229CysGly: 1.229 ± 0.0
0.819CysHis: 0.819 ± 0.0
1.638CysIle: 1.638 ± 0.0
2.253CysLys: 2.253 ± 0.0
2.253CysLeu: 2.253 ± 0.0
0.819CysMet: 0.819 ± 0.0
0.614CysAsn: 0.614 ± 0.0
0.819CysPro: 0.819 ± 0.0
0.614CysGln: 0.614 ± 0.0
1.024CysArg: 1.024 ± 0.0
1.434CysSer: 1.434 ± 0.0
0.819CysThr: 0.819 ± 0.0
2.458CysVal: 2.458 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.024CysTyr: 1.024 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.458AspAla: 2.458 ± 0.0
1.843AspCys: 1.843 ± 0.0
3.481AspAsp: 3.481 ± 0.0
3.481AspGlu: 3.481 ± 0.0
1.638AspPhe: 1.638 ± 0.0
3.891AspGly: 3.891 ± 0.0
0.614AspHis: 0.614 ± 0.0
3.481AspIle: 3.481 ± 0.0
3.481AspLys: 3.481 ± 0.0
4.915AspLeu: 4.915 ± 0.0
1.229AspMet: 1.229 ± 0.0
2.458AspAsn: 2.458 ± 0.0
2.048AspPro: 2.048 ± 0.0
2.458AspGln: 2.458 ± 0.0
2.662AspArg: 2.662 ± 0.0
2.048AspSer: 2.048 ± 0.0
2.867AspThr: 2.867 ± 0.0
4.505AspVal: 4.505 ± 0.0
1.843AspTrp: 1.843 ± 0.0
1.024AspTyr: 1.024 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.277GluAla: 3.277 ± 0.0
2.867GluCys: 2.867 ± 0.0
3.277GluAsp: 3.277 ± 0.0
4.71GluGlu: 4.71 ± 0.0
1.638GluPhe: 1.638 ± 0.0
2.458GluGly: 2.458 ± 0.0
2.253GluHis: 2.253 ± 0.0
3.277GluIle: 3.277 ± 0.0
3.072GluLys: 3.072 ± 0.0
10.444GluLeu: 10.444 ± 0.0
2.458GluMet: 2.458 ± 0.0
1.229GluAsn: 1.229 ± 0.0
3.891GluPro: 3.891 ± 0.0
3.277GluGln: 3.277 ± 0.0
3.481GluArg: 3.481 ± 0.0
2.867GluSer: 2.867 ± 0.0
3.277GluThr: 3.277 ± 0.0
4.505GluVal: 4.505 ± 0.0
1.434GluTrp: 1.434 ± 0.0
1.229GluTyr: 1.229 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.843PheAla: 1.843 ± 0.0
1.024PheCys: 1.024 ± 0.0
2.048PheAsp: 2.048 ± 0.0
3.072PheGlu: 3.072 ± 0.0
0.205PhePhe: 0.205 ± 0.0
2.867PheGly: 2.867 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.024PheIle: 1.024 ± 0.0
3.072PheLys: 3.072 ± 0.0
1.229PheLeu: 1.229 ± 0.0
0.614PheMet: 0.614 ± 0.0
1.024PheAsn: 1.024 ± 0.0
0.614PhePro: 0.614 ± 0.0
0.819PheGln: 0.819 ± 0.0
0.41PheArg: 0.41 ± 0.0
3.072PheSer: 3.072 ± 0.0
2.458PheThr: 2.458 ± 0.0
1.843PheVal: 1.843 ± 0.0
0.819PheTrp: 0.819 ± 0.0
0.819PheTyr: 0.819 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.867GlyAla: 2.867 ± 0.0
1.229GlyCys: 1.229 ± 0.0
3.891GlyAsp: 3.891 ± 0.0
4.505GlyGlu: 4.505 ± 0.0
1.843GlyPhe: 1.843 ± 0.0
4.915GlyGly: 4.915 ± 0.0
1.434GlyHis: 1.434 ± 0.0
2.048GlyIle: 2.048 ± 0.0
4.301GlyLys: 4.301 ± 0.0
5.529GlyLeu: 5.529 ± 0.0
2.458GlyMet: 2.458 ± 0.0
3.891GlyAsn: 3.891 ± 0.0
2.662GlyPro: 2.662 ± 0.0
1.434GlyGln: 1.434 ± 0.0
1.229GlyArg: 1.229 ± 0.0
2.867GlySer: 2.867 ± 0.0
3.481GlyThr: 3.481 ± 0.0
4.301GlyVal: 4.301 ± 0.0
0.41GlyTrp: 0.41 ± 0.0
1.843GlyTyr: 1.843 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.434HisAla: 1.434 ± 0.0
0.614HisCys: 0.614 ± 0.0
1.638HisAsp: 1.638 ± 0.0
2.048HisGlu: 2.048 ± 0.0
0.819HisPhe: 0.819 ± 0.0
2.458HisGly: 2.458 ± 0.0
0.819HisHis: 0.819 ± 0.0
1.638HisIle: 1.638 ± 0.0
2.458HisLys: 2.458 ± 0.0
1.024HisLeu: 1.024 ± 0.0
1.024HisMet: 1.024 ± 0.0
1.843HisAsn: 1.843 ± 0.0
1.024HisPro: 1.024 ± 0.0
1.229HisGln: 1.229 ± 0.0
1.024HisArg: 1.024 ± 0.0
1.229HisSer: 1.229 ± 0.0
1.638HisThr: 1.638 ± 0.0
1.638HisVal: 1.638 ± 0.0
0.614HisTrp: 0.614 ± 0.0
0.205HisTyr: 0.205 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.301IleAla: 4.301 ± 0.0
1.434IleCys: 1.434 ± 0.0
3.686IleAsp: 3.686 ± 0.0
4.301IleGlu: 4.301 ± 0.0
1.024IlePhe: 1.024 ± 0.0
3.277IleGly: 3.277 ± 0.0
1.024IleHis: 1.024 ± 0.0
3.481IleIle: 3.481 ± 0.0
4.301IleLys: 4.301 ± 0.0
4.71IleLeu: 4.71 ± 0.0
2.458IleMet: 2.458 ± 0.0
2.867IleAsn: 2.867 ± 0.0
1.024IlePro: 1.024 ± 0.0
2.867IleGln: 2.867 ± 0.0
2.048IleArg: 2.048 ± 0.0
4.71IleSer: 4.71 ± 0.0
4.915IleThr: 4.915 ± 0.0
4.096IleVal: 4.096 ± 0.0
0.205IleTrp: 0.205 ± 0.0
1.024IleTyr: 1.024 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.867LysAla: 2.867 ± 0.0
1.638LysCys: 1.638 ± 0.0
1.024LysAsp: 1.024 ± 0.0
4.096LysGlu: 4.096 ± 0.0
4.301LysPhe: 4.301 ± 0.0
1.843LysGly: 1.843 ± 0.0
2.048LysHis: 2.048 ± 0.0
5.529LysIle: 5.529 ± 0.0
3.481LysLys: 3.481 ± 0.0
10.649LysLeu: 10.649 ± 0.0
2.253LysMet: 2.253 ± 0.0
2.458LysAsn: 2.458 ± 0.0
2.253LysPro: 2.253 ± 0.0
3.891LysGln: 3.891 ± 0.0
2.458LysArg: 2.458 ± 0.0
3.481LysSer: 3.481 ± 0.0
3.481LysThr: 3.481 ± 0.0
5.529LysVal: 5.529 ± 0.0
0.819LysTrp: 0.819 ± 0.0
3.891LysTyr: 3.891 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.734LeuAla: 5.734 ± 0.0
1.843LeuCys: 1.843 ± 0.0
5.529LeuAsp: 5.529 ± 0.0
5.325LeuGlu: 5.325 ± 0.0
2.048LeuPhe: 2.048 ± 0.0
5.734LeuGly: 5.734 ± 0.0
2.048LeuHis: 2.048 ± 0.0
5.12LeuIle: 5.12 ± 0.0
6.144LeuLys: 6.144 ± 0.0
8.396LeuLeu: 8.396 ± 0.0
2.253LeuMet: 2.253 ± 0.0
6.758LeuAsn: 6.758 ± 0.0
5.12LeuPro: 5.12 ± 0.0
6.349LeuGln: 6.349 ± 0.0
5.734LeuArg: 5.734 ± 0.0
6.144LeuSer: 6.144 ± 0.0
9.216LeuThr: 9.216 ± 0.0
6.553LeuVal: 6.553 ± 0.0
2.048LeuTrp: 2.048 ± 0.0
2.048LeuTyr: 2.048 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.638MetAla: 1.638 ± 0.0
0.41MetCys: 0.41 ± 0.0
1.434MetAsp: 1.434 ± 0.0
1.843MetGlu: 1.843 ± 0.0
2.048MetPhe: 2.048 ± 0.0
2.048MetGly: 2.048 ± 0.0
1.229MetHis: 1.229 ± 0.0
2.253MetIle: 2.253 ± 0.0
1.843MetLys: 1.843 ± 0.0
4.301MetLeu: 4.301 ± 0.0
0.819MetMet: 0.819 ± 0.0
1.229MetAsn: 1.229 ± 0.0
1.843MetPro: 1.843 ± 0.0
1.434MetGln: 1.434 ± 0.0
1.434MetArg: 1.434 ± 0.0
3.072MetSer: 3.072 ± 0.0
2.048MetThr: 2.048 ± 0.0
2.253MetVal: 2.253 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.024MetTyr: 1.024 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.048AsnAla: 2.048 ± 0.0
1.434AsnCys: 1.434 ± 0.0
3.072AsnAsp: 3.072 ± 0.0
3.481AsnGlu: 3.481 ± 0.0
1.024AsnPhe: 1.024 ± 0.0
2.048AsnGly: 2.048 ± 0.0
0.41AsnHis: 0.41 ± 0.0
2.662AsnIle: 2.662 ± 0.0
3.686AsnLys: 3.686 ± 0.0
5.325AsnLeu: 5.325 ± 0.0
3.072AsnMet: 3.072 ± 0.0
3.686AsnAsn: 3.686 ± 0.0
1.843AsnPro: 1.843 ± 0.0
2.253AsnGln: 2.253 ± 0.0
3.072AsnArg: 3.072 ± 0.0
3.686AsnSer: 3.686 ± 0.0
2.867AsnThr: 2.867 ± 0.0
4.505AsnVal: 4.505 ± 0.0
0.614AsnTrp: 0.614 ± 0.0
1.843AsnTyr: 1.843 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.253ProAla: 2.253 ± 0.0
1.229ProCys: 1.229 ± 0.0
2.458ProAsp: 2.458 ± 0.0
3.072ProGlu: 3.072 ± 0.0
0.819ProPhe: 0.819 ± 0.0
2.048ProGly: 2.048 ± 0.0
2.253ProHis: 2.253 ± 0.0
3.277ProIle: 3.277 ± 0.0
2.253ProLys: 2.253 ± 0.0
3.072ProLeu: 3.072 ± 0.0
2.458ProMet: 2.458 ± 0.0
3.891ProAsn: 3.891 ± 0.0
1.638ProPro: 1.638 ± 0.0
2.253ProGln: 2.253 ± 0.0
1.024ProArg: 1.024 ± 0.0
2.867ProSer: 2.867 ± 0.0
2.662ProThr: 2.662 ± 0.0
1.024ProVal: 1.024 ± 0.0
0.819ProTrp: 0.819 ± 0.0
1.024ProTyr: 1.024 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.638GlnAla: 1.638 ± 0.0
1.229GlnCys: 1.229 ± 0.0
2.458GlnAsp: 2.458 ± 0.0
2.253GlnGlu: 2.253 ± 0.0
2.048GlnPhe: 2.048 ± 0.0
2.048GlnGly: 2.048 ± 0.0
1.434GlnHis: 1.434 ± 0.0
2.458GlnIle: 2.458 ± 0.0
2.662GlnLys: 2.662 ± 0.0
6.349GlnLeu: 6.349 ± 0.0
0.614GlnMet: 0.614 ± 0.0
2.458GlnAsn: 2.458 ± 0.0
4.096GlnPro: 4.096 ± 0.0
2.867GlnGln: 2.867 ± 0.0
1.434GlnArg: 1.434 ± 0.0
3.481GlnSer: 3.481 ± 0.0
2.458GlnThr: 2.458 ± 0.0
3.481GlnVal: 3.481 ± 0.0
1.024GlnTrp: 1.024 ± 0.0
1.229GlnTyr: 1.229 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.072ArgAla: 3.072 ± 0.0
0.41ArgCys: 0.41 ± 0.0
1.229ArgAsp: 1.229 ± 0.0
3.277ArgGlu: 3.277 ± 0.0
1.434ArgPhe: 1.434 ± 0.0
2.048ArgGly: 2.048 ± 0.0
0.819ArgHis: 0.819 ± 0.0
2.253ArgIle: 2.253 ± 0.0
2.662ArgLys: 2.662 ± 0.0
4.915ArgLeu: 4.915 ± 0.0
0.614ArgMet: 0.614 ± 0.0
2.048ArgAsn: 2.048 ± 0.0
2.458ArgPro: 2.458 ± 0.0
1.843ArgGln: 1.843 ± 0.0
1.843ArgArg: 1.843 ± 0.0
2.458ArgSer: 2.458 ± 0.0
2.867ArgThr: 2.867 ± 0.0
3.072ArgVal: 3.072 ± 0.0
1.638ArgTrp: 1.638 ± 0.0
1.229ArgTyr: 1.229 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.072SerAla: 3.072 ± 0.0
0.819SerCys: 0.819 ± 0.0
2.458SerAsp: 2.458 ± 0.0
3.481SerGlu: 3.481 ± 0.0
1.229SerPhe: 1.229 ± 0.0
3.481SerGly: 3.481 ± 0.0
2.048SerHis: 2.048 ± 0.0
3.277SerIle: 3.277 ± 0.0
4.71SerLys: 4.71 ± 0.0
4.915SerLeu: 4.915 ± 0.0
2.458SerMet: 2.458 ± 0.0
3.072SerAsn: 3.072 ± 0.0
1.434SerPro: 1.434 ± 0.0
3.072SerGln: 3.072 ± 0.0
2.662SerArg: 2.662 ± 0.0
4.915SerSer: 4.915 ± 0.0
5.325SerThr: 5.325 ± 0.0
4.505SerVal: 4.505 ± 0.0
1.638SerTrp: 1.638 ± 0.0
2.253SerTyr: 2.253 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.072ThrAla: 3.072 ± 0.0
1.843ThrCys: 1.843 ± 0.0
3.277ThrAsp: 3.277 ± 0.0
3.891ThrGlu: 3.891 ± 0.0
1.024ThrPhe: 1.024 ± 0.0
4.915ThrGly: 4.915 ± 0.0
2.048ThrHis: 2.048 ± 0.0
5.734ThrIle: 5.734 ± 0.0
4.301ThrLys: 4.301 ± 0.0
6.144ThrLeu: 6.144 ± 0.0
2.662ThrMet: 2.662 ± 0.0
2.662ThrAsn: 2.662 ± 0.0
3.072ThrPro: 3.072 ± 0.0
3.481ThrGln: 3.481 ± 0.0
3.072ThrArg: 3.072 ± 0.0
3.891ThrSer: 3.891 ± 0.0
7.577ThrThr: 7.577 ± 0.0
3.891ThrVal: 3.891 ± 0.0
0.614ThrTrp: 0.614 ± 0.0
1.843ThrTyr: 1.843 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.301ValAla: 4.301 ± 0.0
1.434ValCys: 1.434 ± 0.0
4.505ValAsp: 4.505 ± 0.0
5.529ValGlu: 5.529 ± 0.0
1.843ValPhe: 1.843 ± 0.0
3.891ValGly: 3.891 ± 0.0
1.638ValHis: 1.638 ± 0.0
4.301ValIle: 4.301 ± 0.0
4.915ValLys: 4.915 ± 0.0
6.144ValLeu: 6.144 ± 0.0
2.253ValMet: 2.253 ± 0.0
3.891ValAsn: 3.891 ± 0.0
3.072ValPro: 3.072 ± 0.0
3.072ValGln: 3.072 ± 0.0
3.891ValArg: 3.891 ± 0.0
2.867ValSer: 2.867 ± 0.0
5.529ValThr: 5.529 ± 0.0
4.096ValVal: 4.096 ± 0.0
1.434ValTrp: 1.434 ± 0.0
1.843ValTyr: 1.843 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.819TrpAla: 0.819 ± 0.0
0.205TrpCys: 0.205 ± 0.0
0.614TrpAsp: 0.614 ± 0.0
1.229TrpGlu: 1.229 ± 0.0
1.229TrpPhe: 1.229 ± 0.0
0.819TrpGly: 0.819 ± 0.0
0.614TrpHis: 0.614 ± 0.0
0.41TrpIle: 0.41 ± 0.0
1.434TrpLys: 1.434 ± 0.0
1.843TrpLeu: 1.843 ± 0.0
1.024TrpMet: 1.024 ± 0.0
0.614TrpAsn: 0.614 ± 0.0
0.41TrpPro: 0.41 ± 0.0
1.229TrpGln: 1.229 ± 0.0
0.614TrpArg: 0.614 ± 0.0
1.229TrpSer: 1.229 ± 0.0
1.024TrpThr: 1.024 ± 0.0
1.229TrpVal: 1.229 ± 0.0
0.41TrpTrp: 0.41 ± 0.0
0.819TrpTyr: 0.819 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.253TyrAla: 2.253 ± 0.0
0.205TyrCys: 0.205 ± 0.0
1.638TyrAsp: 1.638 ± 0.0
1.024TyrGlu: 1.024 ± 0.0
1.024TyrPhe: 1.024 ± 0.0
2.253TyrGly: 2.253 ± 0.0
1.434TyrHis: 1.434 ± 0.0
0.614TyrIle: 0.614 ± 0.0
2.867TyrLys: 2.867 ± 0.0
1.229TyrLeu: 1.229 ± 0.0
0.614TyrMet: 0.614 ± 0.0
2.253TyrAsn: 2.253 ± 0.0
1.843TyrPro: 1.843 ± 0.0
1.024TyrGln: 1.024 ± 0.0
0.819TyrArg: 0.819 ± 0.0
2.048TyrSer: 2.048 ± 0.0
1.024TyrThr: 1.024 ± 0.0
3.072TyrVal: 3.072 ± 0.0
0.205TyrTrp: 0.205 ± 0.0
1.229TyrTyr: 1.229 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (4884 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski