Amino acid dipepetide frequency for Pegivirus A (isolate Saguinus labiatus/-/GBV-A-lab/1996)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.812AlaAla: 12.812 ± 0.0
1.686AlaCys: 1.686 ± 0.0
1.686AlaAsp: 1.686 ± 0.0
6.406AlaGlu: 6.406 ± 0.0
2.023AlaPhe: 2.023 ± 0.0
10.115AlaGly: 10.115 ± 0.0
1.011AlaHis: 1.011 ± 0.0
3.372AlaIle: 3.372 ± 0.0
3.034AlaLys: 3.034 ± 0.0
11.8AlaLeu: 11.8 ± 0.0
2.36AlaMet: 2.36 ± 0.0
0.674AlaAsn: 0.674 ± 0.0
5.732AlaPro: 5.732 ± 0.0
1.349AlaGln: 1.349 ± 0.0
6.069AlaArg: 6.069 ± 0.0
7.08AlaSer: 7.08 ± 0.0
5.732AlaThr: 5.732 ± 0.0
6.406AlaVal: 6.406 ± 0.0
2.36AlaTrp: 2.36 ± 0.0
5.057AlaTyr: 5.057 ± 0.0
0.674AlaXaa: 0.674 ± 0.0
Cys
3.372CysAla: 3.372 ± 0.0
2.023CysCys: 2.023 ± 0.0
3.372CysAsp: 3.372 ± 0.0
1.686CysGlu: 1.686 ± 0.0
2.023CysPhe: 2.023 ± 0.0
4.046CysGly: 4.046 ± 0.0
1.349CysHis: 1.349 ± 0.0
1.011CysIle: 1.011 ± 0.0
0.674CysLys: 0.674 ± 0.0
2.697CysLeu: 2.697 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.011CysAsn: 1.011 ± 0.0
1.349CysPro: 1.349 ± 0.0
0.674CysGln: 0.674 ± 0.0
2.36CysArg: 2.36 ± 0.0
2.697CysSer: 2.697 ± 0.0
1.686CysThr: 1.686 ± 0.0
4.383CysVal: 4.383 ± 0.0
0.337CysTrp: 0.337 ± 0.0
0.337CysTyr: 0.337 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.023AspAla: 2.023 ± 0.0
2.023AspCys: 2.023 ± 0.0
2.023AspAsp: 2.023 ± 0.0
4.046AspGlu: 4.046 ± 0.0
2.697AspPhe: 2.697 ± 0.0
3.372AspGly: 3.372 ± 0.0
1.349AspHis: 1.349 ± 0.0
1.686AspIle: 1.686 ± 0.0
1.349AspLys: 1.349 ± 0.0
4.046AspLeu: 4.046 ± 0.0
1.349AspMet: 1.349 ± 0.0
1.011AspAsn: 1.011 ± 0.0
2.697AspPro: 2.697 ± 0.0
0.337AspGln: 0.337 ± 0.0
1.011AspArg: 1.011 ± 0.0
2.697AspSer: 2.697 ± 0.0
2.36AspThr: 2.36 ± 0.0
2.697AspVal: 2.697 ± 0.0
2.023AspTrp: 2.023 ± 0.0
1.011AspTyr: 1.011 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
9.44GluAla: 9.44 ± 0.0
2.697GluCys: 2.697 ± 0.0
1.349GluAsp: 1.349 ± 0.0
3.034GluGlu: 3.034 ± 0.0
0.0GluPhe: 0.0 ± 0.0
3.709GluGly: 3.709 ± 0.0
1.686GluHis: 1.686 ± 0.0
1.349GluIle: 1.349 ± 0.0
1.349GluLys: 1.349 ± 0.0
3.709GluLeu: 3.709 ± 0.0
0.674GluMet: 0.674 ± 0.0
0.337GluAsn: 0.337 ± 0.0
2.697GluPro: 2.697 ± 0.0
1.011GluGln: 1.011 ± 0.0
3.034GluArg: 3.034 ± 0.0
2.023GluSer: 2.023 ± 0.0
3.034GluThr: 3.034 ± 0.0
6.406GluVal: 6.406 ± 0.0
1.011GluTrp: 1.011 ± 0.0
1.686GluTyr: 1.686 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.697PheAla: 2.697 ± 0.0
2.023PheCys: 2.023 ± 0.0
3.372PheAsp: 3.372 ± 0.0
1.349PheGlu: 1.349 ± 0.0
1.686PhePhe: 1.686 ± 0.0
2.023PheGly: 2.023 ± 0.0
0.337PheHis: 0.337 ± 0.0
1.011PheIle: 1.011 ± 0.0
0.0PheLys: 0.0 ± 0.0
2.023PheLeu: 2.023 ± 0.0
0.337PheMet: 0.337 ± 0.0
0.337PheAsn: 0.337 ± 0.0
1.686PhePro: 1.686 ± 0.0
0.674PheGln: 0.674 ± 0.0
0.674PheArg: 0.674 ± 0.0
2.023PheSer: 2.023 ± 0.0
2.697PheThr: 2.697 ± 0.0
1.686PheVal: 1.686 ± 0.0
0.337PheTrp: 0.337 ± 0.0
1.686PheTyr: 1.686 ± 0.0
0.337PheXaa: 0.337 ± 0.0
Gly
7.417GlyAla: 7.417 ± 0.0
3.034GlyCys: 3.034 ± 0.0
4.72GlyAsp: 4.72 ± 0.0
3.709GlyGlu: 3.709 ± 0.0
3.709GlyPhe: 3.709 ± 0.0
7.755GlyGly: 7.755 ± 0.0
4.383GlyHis: 4.383 ± 0.0
2.36GlyIle: 2.36 ± 0.0
4.72GlyLys: 4.72 ± 0.0
5.057GlyLeu: 5.057 ± 0.0
1.349GlyMet: 1.349 ± 0.0
3.034GlyAsn: 3.034 ± 0.0
5.394GlyPro: 5.394 ± 0.0
3.372GlyGln: 3.372 ± 0.0
5.732GlyArg: 5.732 ± 0.0
6.743GlySer: 6.743 ± 0.0
6.069GlyThr: 6.069 ± 0.0
11.463GlyVal: 11.463 ± 0.0
3.372GlyTrp: 3.372 ± 0.0
1.686GlyTyr: 1.686 ± 0.0
0.337GlyXaa: 0.337 ± 0.0
His
2.023HisAla: 2.023 ± 0.0
0.674HisCys: 0.674 ± 0.0
2.023HisAsp: 2.023 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.674HisPhe: 0.674 ± 0.0
1.686HisGly: 1.686 ± 0.0
0.674HisHis: 0.674 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.674HisLys: 0.674 ± 0.0
2.36HisLeu: 2.36 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.674HisAsn: 0.674 ± 0.0
2.36HisPro: 2.36 ± 0.0
0.337HisGln: 0.337 ± 0.0
1.349HisArg: 1.349 ± 0.0
1.349HisSer: 1.349 ± 0.0
0.0HisThr: 0.0 ± 0.0
3.372HisVal: 3.372 ± 0.0
1.011HisTrp: 1.011 ± 0.0
1.349HisTyr: 1.349 ± 0.0
0.337HisXaa: 0.337 ± 0.0
Ile
2.023IleAla: 2.023 ± 0.0
0.674IleCys: 0.674 ± 0.0
0.337IleAsp: 0.337 ± 0.0
1.686IleGlu: 1.686 ± 0.0
1.686IlePhe: 1.686 ± 0.0
1.686IleGly: 1.686 ± 0.0
1.011IleHis: 1.011 ± 0.0
2.023IleIle: 2.023 ± 0.0
1.686IleLys: 1.686 ± 0.0
3.034IleLeu: 3.034 ± 0.0
1.011IleMet: 1.011 ± 0.0
0.674IleAsn: 0.674 ± 0.0
2.697IlePro: 2.697 ± 0.0
1.011IleGln: 1.011 ± 0.0
0.674IleArg: 0.674 ± 0.0
1.349IleSer: 1.349 ± 0.0
4.383IleThr: 4.383 ± 0.0
1.349IleVal: 1.349 ± 0.0
0.337IleTrp: 0.337 ± 0.0
0.674IleTyr: 0.674 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.046LysAla: 4.046 ± 0.0
1.349LysCys: 1.349 ± 0.0
0.674LysAsp: 0.674 ± 0.0
3.372LysGlu: 3.372 ± 0.0
0.337LysPhe: 0.337 ± 0.0
2.697LysGly: 2.697 ± 0.0
0.674LysHis: 0.674 ± 0.0
1.011LysIle: 1.011 ± 0.0
2.023LysLys: 2.023 ± 0.0
2.697LysLeu: 2.697 ± 0.0
1.011LysMet: 1.011 ± 0.0
0.0LysAsn: 0.0 ± 0.0
2.36LysPro: 2.36 ± 0.0
0.674LysGln: 0.674 ± 0.0
1.349LysArg: 1.349 ± 0.0
1.011LysSer: 1.011 ± 0.0
3.034LysThr: 3.034 ± 0.0
2.697LysVal: 2.697 ± 0.0
0.337LysTrp: 0.337 ± 0.0
1.011LysTyr: 1.011 ± 0.0
1.011LysXaa: 1.011 ± 0.0
Leu
9.777LeuAla: 9.777 ± 0.0
2.36LeuCys: 2.36 ± 0.0
4.383LeuAsp: 4.383 ± 0.0
4.72LeuGlu: 4.72 ± 0.0
2.36LeuPhe: 2.36 ± 0.0
12.138LeuGly: 12.138 ± 0.0
1.686LeuHis: 1.686 ± 0.0
2.697LeuIle: 2.697 ± 0.0
2.023LeuLys: 2.023 ± 0.0
14.16LeuLeu: 14.16 ± 0.0
1.349LeuMet: 1.349 ± 0.0
1.011LeuAsn: 1.011 ± 0.0
8.092LeuPro: 8.092 ± 0.0
1.349LeuGln: 1.349 ± 0.0
5.394LeuArg: 5.394 ± 0.0
4.046LeuSer: 4.046 ± 0.0
5.732LeuThr: 5.732 ± 0.0
10.452LeuVal: 10.452 ± 0.0
3.034LeuTrp: 3.034 ± 0.0
1.349LeuTyr: 1.349 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.372MetAla: 3.372 ± 0.0
0.674MetCys: 0.674 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.349MetGlu: 1.349 ± 0.0
0.674MetPhe: 0.674 ± 0.0
3.034MetGly: 3.034 ± 0.0
0.674MetHis: 0.674 ± 0.0
0.337MetIle: 0.337 ± 0.0
0.674MetLys: 0.674 ± 0.0
3.034MetLeu: 3.034 ± 0.0
0.337MetMet: 0.337 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.349MetPro: 1.349 ± 0.0
0.674MetGln: 0.674 ± 0.0
0.674MetArg: 0.674 ± 0.0
0.674MetSer: 0.674 ± 0.0
0.674MetThr: 0.674 ± 0.0
1.686MetVal: 1.686 ± 0.0
0.337MetTrp: 0.337 ± 0.0
0.674MetTyr: 0.674 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.023AsnAla: 2.023 ± 0.0
1.011AsnCys: 1.011 ± 0.0
1.011AsnAsp: 1.011 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
0.674AsnPhe: 0.674 ± 0.0
3.709AsnGly: 3.709 ± 0.0
0.337AsnHis: 0.337 ± 0.0
0.337AsnIle: 0.337 ± 0.0
0.0AsnLys: 0.0 ± 0.0
1.349AsnLeu: 1.349 ± 0.0
0.337AsnMet: 0.337 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.686AsnPro: 1.686 ± 0.0
0.337AsnGln: 0.337 ± 0.0
1.349AsnArg: 1.349 ± 0.0
0.674AsnSer: 0.674 ± 0.0
0.0AsnThr: 0.0 ± 0.0
1.686AsnVal: 1.686 ± 0.0
0.674AsnTrp: 0.674 ± 0.0
0.674AsnTyr: 0.674 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.732ProAla: 5.732 ± 0.0
3.372ProCys: 3.372 ± 0.0
2.36ProAsp: 2.36 ± 0.0
2.697ProGlu: 2.697 ± 0.0
1.011ProPhe: 1.011 ± 0.0
6.743ProGly: 6.743 ± 0.0
1.349ProHis: 1.349 ± 0.0
2.36ProIle: 2.36 ± 0.0
1.011ProLys: 1.011 ± 0.0
7.417ProLeu: 7.417 ± 0.0
2.023ProMet: 2.023 ± 0.0
1.349ProAsn: 1.349 ± 0.0
7.08ProPro: 7.08 ± 0.0
0.674ProGln: 0.674 ± 0.0
4.72ProArg: 4.72 ± 0.0
3.034ProSer: 3.034 ± 0.0
3.372ProThr: 3.372 ± 0.0
7.755ProVal: 7.755 ± 0.0
1.686ProTrp: 1.686 ± 0.0
1.011ProTyr: 1.011 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.349GlnAla: 1.349 ± 0.0
0.674GlnCys: 0.674 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.011GlnGlu: 1.011 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
2.023GlnGly: 2.023 ± 0.0
0.674GlnHis: 0.674 ± 0.0
0.674GlnIle: 0.674 ± 0.0
0.674GlnLys: 0.674 ± 0.0
1.349GlnLeu: 1.349 ± 0.0
0.674GlnMet: 0.674 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.011GlnPro: 1.011 ± 0.0
0.337GlnGln: 0.337 ± 0.0
2.697GlnArg: 2.697 ± 0.0
1.011GlnSer: 1.011 ± 0.0
1.686GlnThr: 1.686 ± 0.0
2.36GlnVal: 2.36 ± 0.0
0.337GlnTrp: 0.337 ± 0.0
0.337GlnTyr: 0.337 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.069ArgAla: 6.069 ± 0.0
1.686ArgCys: 1.686 ± 0.0
2.36ArgAsp: 2.36 ± 0.0
3.372ArgGlu: 3.372 ± 0.0
1.349ArgPhe: 1.349 ± 0.0
5.732ArgGly: 5.732 ± 0.0
1.011ArgHis: 1.011 ± 0.0
1.349ArgIle: 1.349 ± 0.0
2.697ArgLys: 2.697 ± 0.0
6.406ArgLeu: 6.406 ± 0.0
1.011ArgMet: 1.011 ± 0.0
1.011ArgAsn: 1.011 ± 0.0
3.709ArgPro: 3.709 ± 0.0
1.011ArgGln: 1.011 ± 0.0
4.72ArgArg: 4.72 ± 0.0
3.034ArgSer: 3.034 ± 0.0
3.034ArgThr: 3.034 ± 0.0
5.057ArgVal: 5.057 ± 0.0
2.023ArgTrp: 2.023 ± 0.0
2.697ArgTyr: 2.697 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.406SerAla: 6.406 ± 0.0
2.023SerCys: 2.023 ± 0.0
2.023SerAsp: 2.023 ± 0.0
1.686SerGlu: 1.686 ± 0.0
1.686SerPhe: 1.686 ± 0.0
5.732SerGly: 5.732 ± 0.0
0.337SerHis: 0.337 ± 0.0
2.36SerIle: 2.36 ± 0.0
2.36SerLys: 2.36 ± 0.0
5.732SerLeu: 5.732 ± 0.0
1.686SerMet: 1.686 ± 0.0
1.349SerAsn: 1.349 ± 0.0
4.046SerPro: 4.046 ± 0.0
1.349SerGln: 1.349 ± 0.0
2.023SerArg: 2.023 ± 0.0
5.394SerSer: 5.394 ± 0.0
4.383SerThr: 4.383 ± 0.0
5.394SerVal: 5.394 ± 0.0
2.023SerTrp: 2.023 ± 0.0
1.686SerTyr: 1.686 ± 0.0
0.337SerXaa: 0.337 ± 0.0
Thr
5.057ThrAla: 5.057 ± 0.0
2.36ThrCys: 2.36 ± 0.0
3.709ThrAsp: 3.709 ± 0.0
2.36ThrGlu: 2.36 ± 0.0
3.034ThrPhe: 3.034 ± 0.0
5.732ThrGly: 5.732 ± 0.0
1.686ThrHis: 1.686 ± 0.0
2.023ThrIle: 2.023 ± 0.0
2.697ThrLys: 2.697 ± 0.0
5.394ThrLeu: 5.394 ± 0.0
1.686ThrMet: 1.686 ± 0.0
1.011ThrAsn: 1.011 ± 0.0
3.372ThrPro: 3.372 ± 0.0
1.686ThrGln: 1.686 ± 0.0
3.372ThrArg: 3.372 ± 0.0
4.046ThrSer: 4.046 ± 0.0
4.046ThrThr: 4.046 ± 0.0
6.743ThrVal: 6.743 ± 0.0
1.686ThrTrp: 1.686 ± 0.0
2.023ThrTyr: 2.023 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.743ValAla: 6.743 ± 0.0
4.383ValCys: 4.383 ± 0.0
3.709ValAsp: 3.709 ± 0.0
3.709ValGlu: 3.709 ± 0.0
2.023ValPhe: 2.023 ± 0.0
8.429ValGly: 8.429 ± 0.0
1.011ValHis: 1.011 ± 0.0
2.023ValIle: 2.023 ± 0.0
5.057ValLys: 5.057 ± 0.0
10.452ValLeu: 10.452 ± 0.0
3.034ValMet: 3.034 ± 0.0
3.709ValAsn: 3.709 ± 0.0
6.069ValPro: 6.069 ± 0.0
0.674ValGln: 0.674 ± 0.0
7.08ValArg: 7.08 ± 0.0
6.406ValSer: 6.406 ± 0.0
6.743ValThr: 6.743 ± 0.0
10.115ValVal: 10.115 ± 0.0
1.686ValTrp: 1.686 ± 0.0
2.023ValTyr: 2.023 ± 0.0
0.337ValXaa: 0.337 ± 0.0
Trp
2.36TrpAla: 2.36 ± 0.0
1.686TrpCys: 1.686 ± 0.0
0.674TrpAsp: 0.674 ± 0.0
2.36TrpGlu: 2.36 ± 0.0
0.674TrpPhe: 0.674 ± 0.0
3.034TrpGly: 3.034 ± 0.0
1.349TrpHis: 1.349 ± 0.0
1.349TrpIle: 1.349 ± 0.0
0.337TrpLys: 0.337 ± 0.0
2.023TrpLeu: 2.023 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.337TrpAsn: 0.337 ± 0.0
1.349TrpPro: 1.349 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.349TrpArg: 1.349 ± 0.0
2.023TrpSer: 2.023 ± 0.0
2.36TrpThr: 2.36 ± 0.0
2.023TrpVal: 2.023 ± 0.0
1.349TrpTrp: 1.349 ± 0.0
1.011TrpTyr: 1.011 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.36TyrAla: 2.36 ± 0.0
0.674TyrCys: 0.674 ± 0.0
2.023TyrAsp: 2.023 ± 0.0
1.011TyrGlu: 1.011 ± 0.0
0.674TyrPhe: 0.674 ± 0.0
2.023TyrGly: 2.023 ± 0.0
0.337TyrHis: 0.337 ± 0.0
0.337TyrIle: 0.337 ± 0.0
0.0TyrLys: 0.0 ± 0.0
3.372TyrLeu: 3.372 ± 0.0
0.337TyrMet: 0.337 ± 0.0
0.337TyrAsn: 0.337 ± 0.0
2.023TyrPro: 2.023 ± 0.0
1.349TyrGln: 1.349 ± 0.0
3.034TyrArg: 3.034 ± 0.0
2.697TyrSer: 2.697 ± 0.0
2.023TyrThr: 2.023 ± 0.0
1.686TyrVal: 1.686 ± 0.0
1.349TyrTrp: 1.349 ± 0.0
1.686TyrTyr: 1.686 ± 0.0
0.337TyrXaa: 0.337 ± 0.0
Xaa
0.337XaaAla: 0.337 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.674XaaGlu: 0.674 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.337XaaIle: 0.337 ± 0.0
0.337XaaLys: 0.337 ± 0.0
0.337XaaLeu: 0.337 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.674XaaArg: 0.674 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.674XaaThr: 0.674 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.337XaaTrp: 0.337 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.674XaaXaa: 0.674 ± 0.0
Statistics based on 1 proteins (2967 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski