Amino acid dipepetide frequency for Yerba mate alphaendornavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.437AlaAla: 0.437 ± 0.0
0.655AlaCys: 0.655 ± 0.0
1.965AlaAsp: 1.965 ± 0.0
1.965AlaGlu: 1.965 ± 0.0
0.437AlaPhe: 0.437 ± 0.0
0.874AlaGly: 0.874 ± 0.0
0.218AlaHis: 0.218 ± 0.0
3.931AlaIle: 3.931 ± 0.0
2.839AlaLys: 2.839 ± 0.0
4.805AlaLeu: 4.805 ± 0.0
1.965AlaMet: 1.965 ± 0.0
2.839AlaAsn: 2.839 ± 0.0
2.184AlaPro: 2.184 ± 0.0
0.437AlaGln: 0.437 ± 0.0
1.965AlaArg: 1.965 ± 0.0
1.529AlaSer: 1.529 ± 0.0
3.494AlaThr: 3.494 ± 0.0
3.057AlaVal: 3.057 ± 0.0
0.437AlaTrp: 0.437 ± 0.0
2.402AlaTyr: 2.402 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.437CysAla: 0.437 ± 0.0
0.437CysCys: 0.437 ± 0.0
1.092CysAsp: 1.092 ± 0.0
1.092CysGlu: 1.092 ± 0.0
0.655CysPhe: 0.655 ± 0.0
0.655CysGly: 0.655 ± 0.0
0.874CysHis: 0.874 ± 0.0
1.747CysIle: 1.747 ± 0.0
1.965CysLys: 1.965 ± 0.0
2.402CysLeu: 2.402 ± 0.0
0.874CysMet: 0.874 ± 0.0
1.747CysAsn: 1.747 ± 0.0
0.655CysPro: 0.655 ± 0.0
1.092CysGln: 1.092 ± 0.0
0.437CysArg: 0.437 ± 0.0
1.092CysSer: 1.092 ± 0.0
0.655CysThr: 0.655 ± 0.0
0.655CysVal: 0.655 ± 0.0
0.218CysTrp: 0.218 ± 0.0
0.437CysTyr: 0.437 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.184AspAla: 2.184 ± 0.0
0.437AspCys: 0.437 ± 0.0
5.241AspAsp: 5.241 ± 0.0
5.896AspGlu: 5.896 ± 0.0
0.874AspPhe: 0.874 ± 0.0
2.184AspGly: 2.184 ± 0.0
1.965AspHis: 1.965 ± 0.0
6.115AspIle: 6.115 ± 0.0
7.207AspLys: 7.207 ± 0.0
6.333AspLeu: 6.333 ± 0.0
2.621AspMet: 2.621 ± 0.0
3.276AspAsn: 3.276 ± 0.0
0.874AspPro: 0.874 ± 0.0
1.092AspGln: 1.092 ± 0.0
3.276AspArg: 3.276 ± 0.0
3.276AspSer: 3.276 ± 0.0
2.839AspThr: 2.839 ± 0.0
5.023AspVal: 5.023 ± 0.0
1.529AspTrp: 1.529 ± 0.0
1.965AspTyr: 1.965 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.057GluAla: 3.057 ± 0.0
0.0GluCys: 0.0 ± 0.0
3.494GluAsp: 3.494 ± 0.0
9.391GluGlu: 9.391 ± 0.0
2.184GluPhe: 2.184 ± 0.0
3.713GluGly: 3.713 ± 0.0
0.874GluHis: 0.874 ± 0.0
5.896GluIle: 5.896 ± 0.0
3.494GluLys: 3.494 ± 0.0
8.517GluLeu: 8.517 ± 0.0
1.529GluMet: 1.529 ± 0.0
5.896GluAsn: 5.896 ± 0.0
3.494GluPro: 3.494 ± 0.0
2.402GluGln: 2.402 ± 0.0
2.621GluArg: 2.621 ± 0.0
5.896GluSer: 5.896 ± 0.0
4.149GluThr: 4.149 ± 0.0
6.333GluVal: 6.333 ± 0.0
1.529GluTrp: 1.529 ± 0.0
1.965GluTyr: 1.965 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.31PheAla: 1.31 ± 0.0
0.874PheCys: 0.874 ± 0.0
3.276PheAsp: 3.276 ± 0.0
2.621PheGlu: 2.621 ± 0.0
1.092PhePhe: 1.092 ± 0.0
1.747PheGly: 1.747 ± 0.0
0.437PheHis: 0.437 ± 0.0
1.747PheIle: 1.747 ± 0.0
3.057PheLys: 3.057 ± 0.0
1.529PheLeu: 1.529 ± 0.0
0.437PheMet: 0.437 ± 0.0
1.529PheAsn: 1.529 ± 0.0
0.437PhePro: 0.437 ± 0.0
1.092PheGln: 1.092 ± 0.0
1.31PheArg: 1.31 ± 0.0
1.092PheSer: 1.092 ± 0.0
1.092PheThr: 1.092 ± 0.0
2.402PheVal: 2.402 ± 0.0
0.437PheTrp: 0.437 ± 0.0
0.437PheTyr: 0.437 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.621GlyAla: 2.621 ± 0.0
1.092GlyCys: 1.092 ± 0.0
5.023GlyAsp: 5.023 ± 0.0
3.494GlyGlu: 3.494 ± 0.0
1.092GlyPhe: 1.092 ± 0.0
1.747GlyGly: 1.747 ± 0.0
1.092GlyHis: 1.092 ± 0.0
3.494GlyIle: 3.494 ± 0.0
4.586GlyLys: 4.586 ± 0.0
4.368GlyLeu: 4.368 ± 0.0
1.092GlyMet: 1.092 ± 0.0
1.747GlyAsn: 1.747 ± 0.0
1.092GlyPro: 1.092 ± 0.0
1.092GlyGln: 1.092 ± 0.0
1.31GlyArg: 1.31 ± 0.0
1.965GlySer: 1.965 ± 0.0
1.747GlyThr: 1.747 ± 0.0
3.931GlyVal: 3.931 ± 0.0
1.092GlyTrp: 1.092 ± 0.0
3.057GlyTyr: 3.057 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.218HisAla: 0.218 ± 0.0
0.437HisCys: 0.437 ± 0.0
1.092HisAsp: 1.092 ± 0.0
1.092HisGlu: 1.092 ± 0.0
0.874HisPhe: 0.874 ± 0.0
1.092HisGly: 1.092 ± 0.0
0.437HisHis: 0.437 ± 0.0
1.092HisIle: 1.092 ± 0.0
2.402HisLys: 2.402 ± 0.0
0.874HisLeu: 0.874 ± 0.0
0.437HisMet: 0.437 ± 0.0
2.402HisAsn: 2.402 ± 0.0
0.218HisPro: 0.218 ± 0.0
1.31HisGln: 1.31 ± 0.0
0.874HisArg: 0.874 ± 0.0
1.529HisSer: 1.529 ± 0.0
1.31HisThr: 1.31 ± 0.0
1.965HisVal: 1.965 ± 0.0
0.218HisTrp: 0.218 ± 0.0
1.529HisTyr: 1.529 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.149IleAla: 4.149 ± 0.0
2.184IleCys: 2.184 ± 0.0
5.241IleAsp: 5.241 ± 0.0
6.988IleGlu: 6.988 ± 0.0
1.747IlePhe: 1.747 ± 0.0
5.241IleGly: 5.241 ± 0.0
2.621IleHis: 2.621 ± 0.0
8.736IleIle: 8.736 ± 0.0
8.08IleLys: 8.08 ± 0.0
5.678IleLeu: 5.678 ± 0.0
3.057IleMet: 3.057 ± 0.0
6.333IleAsn: 6.333 ± 0.0
3.057IlePro: 3.057 ± 0.0
0.655IleGln: 0.655 ± 0.0
5.46IleArg: 5.46 ± 0.0
3.931IleSer: 3.931 ± 0.0
7.207IleThr: 7.207 ± 0.0
4.368IleVal: 4.368 ± 0.0
0.655IleTrp: 0.655 ± 0.0
2.621IleTyr: 2.621 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.621LysAla: 2.621 ± 0.0
2.402LysCys: 2.402 ± 0.0
3.713LysAsp: 3.713 ± 0.0
6.77LysGlu: 6.77 ± 0.0
5.023LysPhe: 5.023 ± 0.0
3.057LysGly: 3.057 ± 0.0
2.402LysHis: 2.402 ± 0.0
6.988LysIle: 6.988 ± 0.0
4.805LysLys: 4.805 ± 0.0
8.954LysLeu: 8.954 ± 0.0
3.276LysMet: 3.276 ± 0.0
5.678LysAsn: 5.678 ± 0.0
3.057LysPro: 3.057 ± 0.0
1.965LysGln: 1.965 ± 0.0
1.965LysArg: 1.965 ± 0.0
5.896LysSer: 5.896 ± 0.0
4.586LysThr: 4.586 ± 0.0
4.368LysVal: 4.368 ± 0.0
0.874LysTrp: 0.874 ± 0.0
4.368LysTyr: 4.368 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.586LeuAla: 4.586 ± 0.0
2.621LeuCys: 2.621 ± 0.0
3.494LeuAsp: 3.494 ± 0.0
4.149LeuGlu: 4.149 ± 0.0
3.494LeuPhe: 3.494 ± 0.0
3.713LeuGly: 3.713 ± 0.0
1.31LeuHis: 1.31 ± 0.0
9.391LeuIle: 9.391 ± 0.0
6.988LeuLys: 6.988 ± 0.0
5.896LeuLeu: 5.896 ± 0.0
4.805LeuMet: 4.805 ± 0.0
7.207LeuAsn: 7.207 ± 0.0
4.805LeuPro: 4.805 ± 0.0
2.402LeuGln: 2.402 ± 0.0
5.896LeuArg: 5.896 ± 0.0
5.896LeuSer: 5.896 ± 0.0
7.644LeuThr: 7.644 ± 0.0
5.46LeuVal: 5.46 ± 0.0
1.31LeuTrp: 1.31 ± 0.0
3.713LeuTyr: 3.713 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.874MetAla: 0.874 ± 0.0
0.218MetCys: 0.218 ± 0.0
2.621MetAsp: 2.621 ± 0.0
2.402MetGlu: 2.402 ± 0.0
1.092MetPhe: 1.092 ± 0.0
1.092MetGly: 1.092 ± 0.0
0.218MetHis: 0.218 ± 0.0
1.747MetIle: 1.747 ± 0.0
2.184MetLys: 2.184 ± 0.0
4.586MetLeu: 4.586 ± 0.0
1.747MetMet: 1.747 ± 0.0
2.621MetAsn: 2.621 ± 0.0
0.655MetPro: 0.655 ± 0.0
1.31MetGln: 1.31 ± 0.0
3.276MetArg: 3.276 ± 0.0
2.621MetSer: 2.621 ± 0.0
2.184MetThr: 2.184 ± 0.0
2.621MetVal: 2.621 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.31MetTyr: 1.31 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.184AsnAla: 2.184 ± 0.0
1.092AsnCys: 1.092 ± 0.0
3.494AsnAsp: 3.494 ± 0.0
5.241AsnGlu: 5.241 ± 0.0
0.437AsnPhe: 0.437 ± 0.0
2.839AsnGly: 2.839 ± 0.0
1.092AsnHis: 1.092 ± 0.0
6.552AsnIle: 6.552 ± 0.0
5.023AsnLys: 5.023 ± 0.0
8.954AsnLeu: 8.954 ± 0.0
2.839AsnMet: 2.839 ± 0.0
5.241AsnAsn: 5.241 ± 0.0
2.402AsnPro: 2.402 ± 0.0
1.31AsnGln: 1.31 ± 0.0
2.402AsnArg: 2.402 ± 0.0
3.057AsnSer: 3.057 ± 0.0
3.713AsnThr: 3.713 ± 0.0
5.023AsnVal: 5.023 ± 0.0
1.31AsnTrp: 1.31 ± 0.0
2.839AsnTyr: 2.839 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.31ProAla: 1.31 ± 0.0
0.218ProCys: 0.218 ± 0.0
2.621ProAsp: 2.621 ± 0.0
2.184ProGlu: 2.184 ± 0.0
0.437ProPhe: 0.437 ± 0.0
2.184ProGly: 2.184 ± 0.0
0.437ProHis: 0.437 ± 0.0
2.839ProIle: 2.839 ± 0.0
2.184ProLys: 2.184 ± 0.0
3.494ProLeu: 3.494 ± 0.0
0.874ProMet: 0.874 ± 0.0
3.276ProAsn: 3.276 ± 0.0
0.218ProPro: 0.218 ± 0.0
0.218ProGln: 0.218 ± 0.0
0.874ProArg: 0.874 ± 0.0
2.402ProSer: 2.402 ± 0.0
0.655ProThr: 0.655 ± 0.0
3.057ProVal: 3.057 ± 0.0
0.655ProTrp: 0.655 ± 0.0
0.874ProTyr: 0.874 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.529GlnAla: 1.529 ± 0.0
0.218GlnCys: 0.218 ± 0.0
1.747GlnAsp: 1.747 ± 0.0
1.965GlnGlu: 1.965 ± 0.0
0.655GlnPhe: 0.655 ± 0.0
1.092GlnGly: 1.092 ± 0.0
0.218GlnHis: 0.218 ± 0.0
1.965GlnIle: 1.965 ± 0.0
1.092GlnLys: 1.092 ± 0.0
3.057GlnLeu: 3.057 ± 0.0
0.655GlnMet: 0.655 ± 0.0
0.655GlnAsn: 0.655 ± 0.0
0.874GlnPro: 0.874 ± 0.0
1.092GlnGln: 1.092 ± 0.0
1.31GlnArg: 1.31 ± 0.0
2.184GlnSer: 2.184 ± 0.0
1.092GlnThr: 1.092 ± 0.0
1.529GlnVal: 1.529 ± 0.0
0.655GlnTrp: 0.655 ± 0.0
0.655GlnTyr: 0.655 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.965ArgAla: 1.965 ± 0.0
0.874ArgCys: 0.874 ± 0.0
1.529ArgAsp: 1.529 ± 0.0
3.494ArgGlu: 3.494 ± 0.0
0.874ArgPhe: 0.874 ± 0.0
3.276ArgGly: 3.276 ± 0.0
1.092ArgHis: 1.092 ± 0.0
3.494ArgIle: 3.494 ± 0.0
4.149ArgLys: 4.149 ± 0.0
5.896ArgLeu: 5.896 ± 0.0
1.092ArgMet: 1.092 ± 0.0
2.402ArgAsn: 2.402 ± 0.0
1.092ArgPro: 1.092 ± 0.0
1.747ArgGln: 1.747 ± 0.0
3.057ArgArg: 3.057 ± 0.0
2.839ArgSer: 2.839 ± 0.0
2.184ArgThr: 2.184 ± 0.0
2.621ArgVal: 2.621 ± 0.0
0.218ArgTrp: 0.218 ± 0.0
1.965ArgTyr: 1.965 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.402SerAla: 2.402 ± 0.0
1.529SerCys: 1.529 ± 0.0
3.931SerAsp: 3.931 ± 0.0
4.368SerGlu: 4.368 ± 0.0
0.874SerPhe: 0.874 ± 0.0
3.057SerGly: 3.057 ± 0.0
1.092SerHis: 1.092 ± 0.0
6.115SerIle: 6.115 ± 0.0
6.115SerLys: 6.115 ± 0.0
6.115SerLeu: 6.115 ± 0.0
1.965SerMet: 1.965 ± 0.0
2.839SerAsn: 2.839 ± 0.0
1.092SerPro: 1.092 ± 0.0
1.747SerGln: 1.747 ± 0.0
1.965SerArg: 1.965 ± 0.0
4.149SerSer: 4.149 ± 0.0
2.839SerThr: 2.839 ± 0.0
3.276SerVal: 3.276 ± 0.0
0.655SerTrp: 0.655 ± 0.0
3.494SerTyr: 3.494 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.621ThrAla: 2.621 ± 0.0
0.437ThrCys: 0.437 ± 0.0
4.805ThrAsp: 4.805 ± 0.0
4.586ThrGlu: 4.586 ± 0.0
2.839ThrPhe: 2.839 ± 0.0
3.713ThrGly: 3.713 ± 0.0
1.529ThrHis: 1.529 ± 0.0
6.115ThrIle: 6.115 ± 0.0
5.896ThrLys: 5.896 ± 0.0
4.805ThrLeu: 4.805 ± 0.0
1.965ThrMet: 1.965 ± 0.0
2.839ThrAsn: 2.839 ± 0.0
1.529ThrPro: 1.529 ± 0.0
0.655ThrGln: 0.655 ± 0.0
2.621ThrArg: 2.621 ± 0.0
3.713ThrSer: 3.713 ± 0.0
2.184ThrThr: 2.184 ± 0.0
2.621ThrVal: 2.621 ± 0.0
0.874ThrTrp: 0.874 ± 0.0
1.965ThrTyr: 1.965 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.184ValAla: 2.184 ± 0.0
1.965ValCys: 1.965 ± 0.0
4.586ValAsp: 4.586 ± 0.0
5.023ValGlu: 5.023 ± 0.0
1.31ValPhe: 1.31 ± 0.0
2.184ValGly: 2.184 ± 0.0
1.747ValHis: 1.747 ± 0.0
7.425ValIle: 7.425 ± 0.0
6.77ValLys: 6.77 ± 0.0
4.368ValLeu: 4.368 ± 0.0
1.965ValMet: 1.965 ± 0.0
4.586ValAsn: 4.586 ± 0.0
1.965ValPro: 1.965 ± 0.0
1.747ValGln: 1.747 ± 0.0
1.965ValArg: 1.965 ± 0.0
3.276ValSer: 3.276 ± 0.0
6.333ValThr: 6.333 ± 0.0
3.494ValVal: 3.494 ± 0.0
0.655ValTrp: 0.655 ± 0.0
1.31ValTyr: 1.31 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.437TrpAla: 0.437 ± 0.0
0.655TrpCys: 0.655 ± 0.0
1.092TrpAsp: 1.092 ± 0.0
0.218TrpGlu: 0.218 ± 0.0
0.655TrpPhe: 0.655 ± 0.0
0.437TrpGly: 0.437 ± 0.0
0.655TrpHis: 0.655 ± 0.0
1.31TrpIle: 1.31 ± 0.0
0.655TrpLys: 0.655 ± 0.0
1.092TrpLeu: 1.092 ± 0.0
0.437TrpMet: 0.437 ± 0.0
0.655TrpAsn: 0.655 ± 0.0
0.218TrpPro: 0.218 ± 0.0
0.655TrpGln: 0.655 ± 0.0
1.31TrpArg: 1.31 ± 0.0
0.655TrpSer: 0.655 ± 0.0
0.437TrpThr: 0.437 ± 0.0
1.529TrpVal: 1.529 ± 0.0
0.437TrpTrp: 0.437 ± 0.0
0.874TrpTyr: 0.874 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.31TyrAla: 1.31 ± 0.0
0.874TyrCys: 0.874 ± 0.0
4.149TyrAsp: 4.149 ± 0.0
3.494TyrGlu: 3.494 ± 0.0
1.31TyrPhe: 1.31 ± 0.0
2.839TyrGly: 2.839 ± 0.0
1.092TyrHis: 1.092 ± 0.0
1.31TyrIle: 1.31 ± 0.0
3.276TyrLys: 3.276 ± 0.0
3.057TyrLeu: 3.057 ± 0.0
1.529TyrMet: 1.529 ± 0.0
3.276TyrAsn: 3.276 ± 0.0
1.092TyrPro: 1.092 ± 0.0
0.437TyrGln: 0.437 ± 0.0
1.965TyrArg: 1.965 ± 0.0
2.621TyrSer: 2.621 ± 0.0
1.965TyrThr: 1.965 ± 0.0
1.529TyrVal: 1.529 ± 0.0
0.655TyrTrp: 0.655 ± 0.0
1.529TyrTyr: 1.529 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (4580 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski