Amino acid dipepetide frequency for Persea americana alphaendornavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.049AlaAla: 2.049 ± 0.0
2.049AlaCys: 2.049 ± 0.0
3.643AlaAsp: 3.643 ± 0.0
4.326AlaGlu: 4.326 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
2.505AlaGly: 2.505 ± 0.0
0.683AlaHis: 0.683 ± 0.0
3.643AlaIle: 3.643 ± 0.0
3.415AlaLys: 3.415 ± 0.0
4.098AlaLeu: 4.098 ± 0.0
2.277AlaMet: 2.277 ± 0.0
3.871AlaAsn: 3.871 ± 0.0
1.138AlaPro: 1.138 ± 0.0
1.138AlaGln: 1.138 ± 0.0
2.732AlaArg: 2.732 ± 0.0
2.96AlaSer: 2.96 ± 0.0
2.732AlaThr: 2.732 ± 0.0
3.188AlaVal: 3.188 ± 0.0
0.683AlaTrp: 0.683 ± 0.0
1.138AlaTyr: 1.138 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.277CysAla: 2.277 ± 0.0
0.455CysCys: 0.455 ± 0.0
1.138CysAsp: 1.138 ± 0.0
1.821CysGlu: 1.821 ± 0.0
0.455CysPhe: 0.455 ± 0.0
0.911CysGly: 0.911 ± 0.0
0.455CysHis: 0.455 ± 0.0
1.821CysIle: 1.821 ± 0.0
2.049CysLys: 2.049 ± 0.0
1.366CysLeu: 1.366 ± 0.0
0.911CysMet: 0.911 ± 0.0
1.594CysAsn: 1.594 ± 0.0
0.455CysPro: 0.455 ± 0.0
0.228CysGln: 0.228 ± 0.0
1.138CysArg: 1.138 ± 0.0
1.821CysSer: 1.821 ± 0.0
1.138CysThr: 1.138 ± 0.0
0.911CysVal: 0.911 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.683CysTyr: 0.683 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.277AspAla: 2.277 ± 0.0
0.911AspCys: 0.911 ± 0.0
4.781AspAsp: 4.781 ± 0.0
5.237AspGlu: 5.237 ± 0.0
2.049AspPhe: 2.049 ± 0.0
3.415AspGly: 3.415 ± 0.0
1.138AspHis: 1.138 ± 0.0
6.603AspIle: 6.603 ± 0.0
4.098AspLys: 4.098 ± 0.0
4.781AspLeu: 4.781 ± 0.0
2.277AspMet: 2.277 ± 0.0
2.96AspAsn: 2.96 ± 0.0
1.138AspPro: 1.138 ± 0.0
1.594AspGln: 1.594 ± 0.0
3.643AspArg: 3.643 ± 0.0
3.871AspSer: 3.871 ± 0.0
3.871AspThr: 3.871 ± 0.0
3.871AspVal: 3.871 ± 0.0
0.911AspTrp: 0.911 ± 0.0
2.277AspTyr: 2.277 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.643GluAla: 3.643 ± 0.0
1.821GluCys: 1.821 ± 0.0
3.643GluAsp: 3.643 ± 0.0
4.781GluGlu: 4.781 ± 0.0
2.732GluPhe: 2.732 ± 0.0
2.049GluGly: 2.049 ± 0.0
1.366GluHis: 1.366 ± 0.0
3.871GluIle: 3.871 ± 0.0
4.098GluLys: 4.098 ± 0.0
6.603GluLeu: 6.603 ± 0.0
3.415GluMet: 3.415 ± 0.0
2.277GluAsn: 2.277 ± 0.0
3.415GluPro: 3.415 ± 0.0
2.049GluGln: 2.049 ± 0.0
2.277GluArg: 2.277 ± 0.0
3.188GluSer: 3.188 ± 0.0
3.871GluThr: 3.871 ± 0.0
4.098GluVal: 4.098 ± 0.0
0.911GluTrp: 0.911 ± 0.0
2.049GluTyr: 2.049 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.594PheAla: 1.594 ± 0.0
0.455PheCys: 0.455 ± 0.0
1.366PheAsp: 1.366 ± 0.0
1.821PheGlu: 1.821 ± 0.0
0.455PhePhe: 0.455 ± 0.0
2.277PheGly: 2.277 ± 0.0
0.228PheHis: 0.228 ± 0.0
0.683PheIle: 0.683 ± 0.0
1.594PheLys: 1.594 ± 0.0
0.911PheLeu: 0.911 ± 0.0
1.138PheMet: 1.138 ± 0.0
2.049PheAsn: 2.049 ± 0.0
1.138PhePro: 1.138 ± 0.0
0.455PheGln: 0.455 ± 0.0
0.911PheArg: 0.911 ± 0.0
1.594PheSer: 1.594 ± 0.0
1.821PheThr: 1.821 ± 0.0
1.821PheVal: 1.821 ± 0.0
0.228PheTrp: 0.228 ± 0.0
0.911PheTyr: 0.911 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.049GlyAla: 2.049 ± 0.0
0.228GlyCys: 0.228 ± 0.0
2.96GlyAsp: 2.96 ± 0.0
2.049GlyGlu: 2.049 ± 0.0
1.366GlyPhe: 1.366 ± 0.0
2.049GlyGly: 2.049 ± 0.0
0.911GlyHis: 0.911 ± 0.0
3.415GlyIle: 3.415 ± 0.0
3.415GlyLys: 3.415 ± 0.0
3.871GlyLeu: 3.871 ± 0.0
1.366GlyMet: 1.366 ± 0.0
3.871GlyAsn: 3.871 ± 0.0
1.821GlyPro: 1.821 ± 0.0
2.049GlyGln: 2.049 ± 0.0
2.049GlyArg: 2.049 ± 0.0
2.277GlySer: 2.277 ± 0.0
3.188GlyThr: 3.188 ± 0.0
2.505GlyVal: 2.505 ± 0.0
0.911GlyTrp: 0.911 ± 0.0
2.96GlyTyr: 2.96 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.455HisAla: 0.455 ± 0.0
1.138HisCys: 1.138 ± 0.0
2.505HisAsp: 2.505 ± 0.0
1.594HisGlu: 1.594 ± 0.0
0.683HisPhe: 0.683 ± 0.0
1.594HisGly: 1.594 ± 0.0
1.138HisHis: 1.138 ± 0.0
1.138HisIle: 1.138 ± 0.0
1.366HisLys: 1.366 ± 0.0
1.821HisLeu: 1.821 ± 0.0
0.911HisMet: 0.911 ± 0.0
1.821HisAsn: 1.821 ± 0.0
0.228HisPro: 0.228 ± 0.0
1.138HisGln: 1.138 ± 0.0
0.455HisArg: 0.455 ± 0.0
1.366HisSer: 1.366 ± 0.0
1.594HisThr: 1.594 ± 0.0
1.366HisVal: 1.366 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.911HisTyr: 0.911 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.505IleAla: 2.505 ± 0.0
2.049IleCys: 2.049 ± 0.0
5.009IleAsp: 5.009 ± 0.0
5.92IleGlu: 5.92 ± 0.0
1.366IlePhe: 1.366 ± 0.0
4.781IleGly: 4.781 ± 0.0
1.821IleHis: 1.821 ± 0.0
7.969IleIle: 7.969 ± 0.0
7.969IleLys: 7.969 ± 0.0
5.464IleLeu: 5.464 ± 0.0
3.643IleMet: 3.643 ± 0.0
6.375IleAsn: 6.375 ± 0.0
5.237IlePro: 5.237 ± 0.0
1.594IleGln: 1.594 ± 0.0
2.505IleArg: 2.505 ± 0.0
4.326IleSer: 4.326 ± 0.0
6.603IleThr: 6.603 ± 0.0
7.741IleVal: 7.741 ± 0.0
0.683IleTrp: 0.683 ± 0.0
2.505IleTyr: 2.505 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.96LysAla: 2.96 ± 0.0
0.911LysCys: 0.911 ± 0.0
3.871LysAsp: 3.871 ± 0.0
4.098LysGlu: 4.098 ± 0.0
2.505LysPhe: 2.505 ± 0.0
1.594LysGly: 1.594 ± 0.0
2.277LysHis: 2.277 ± 0.0
5.92LysIle: 5.92 ± 0.0
2.96LysLys: 2.96 ± 0.0
7.969LysLeu: 7.969 ± 0.0
1.594LysMet: 1.594 ± 0.0
2.049LysAsn: 2.049 ± 0.0
3.643LysPro: 3.643 ± 0.0
2.049LysGln: 2.049 ± 0.0
2.96LysArg: 2.96 ± 0.0
5.237LysSer: 5.237 ± 0.0
5.009LysThr: 5.009 ± 0.0
5.237LysVal: 5.237 ± 0.0
1.366LysTrp: 1.366 ± 0.0
3.188LysTyr: 3.188 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.692LeuAla: 5.692 ± 0.0
3.415LeuCys: 3.415 ± 0.0
3.871LeuAsp: 3.871 ± 0.0
5.464LeuGlu: 5.464 ± 0.0
2.049LeuPhe: 2.049 ± 0.0
3.871LeuGly: 3.871 ± 0.0
1.594LeuHis: 1.594 ± 0.0
7.741LeuIle: 7.741 ± 0.0
5.92LeuLys: 5.92 ± 0.0
6.375LeuLeu: 6.375 ± 0.0
2.732LeuMet: 2.732 ± 0.0
5.692LeuAsn: 5.692 ± 0.0
3.188LeuPro: 3.188 ± 0.0
2.96LeuGln: 2.96 ± 0.0
6.603LeuArg: 6.603 ± 0.0
7.514LeuSer: 7.514 ± 0.0
7.286LeuThr: 7.286 ± 0.0
5.464LeuVal: 5.464 ± 0.0
0.683LeuTrp: 0.683 ± 0.0
2.732LeuTyr: 2.732 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.821MetAla: 1.821 ± 0.0
0.683MetCys: 0.683 ± 0.0
1.594MetAsp: 1.594 ± 0.0
1.138MetGlu: 1.138 ± 0.0
0.455MetPhe: 0.455 ± 0.0
1.138MetGly: 1.138 ± 0.0
1.138MetHis: 1.138 ± 0.0
5.009MetIle: 5.009 ± 0.0
2.732MetLys: 2.732 ± 0.0
2.732MetLeu: 2.732 ± 0.0
1.366MetMet: 1.366 ± 0.0
2.732MetAsn: 2.732 ± 0.0
1.821MetPro: 1.821 ± 0.0
2.505MetGln: 2.505 ± 0.0
2.505MetArg: 2.505 ± 0.0
2.049MetSer: 2.049 ± 0.0
2.505MetThr: 2.505 ± 0.0
2.505MetVal: 2.505 ± 0.0
0.455MetTrp: 0.455 ± 0.0
1.138MetTyr: 1.138 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.821AsnAla: 1.821 ± 0.0
0.911AsnCys: 0.911 ± 0.0
4.326AsnAsp: 4.326 ± 0.0
3.415AsnGlu: 3.415 ± 0.0
1.138AsnPhe: 1.138 ± 0.0
2.505AsnGly: 2.505 ± 0.0
0.683AsnHis: 0.683 ± 0.0
5.464AsnIle: 5.464 ± 0.0
4.781AsnLys: 4.781 ± 0.0
6.375AsnLeu: 6.375 ± 0.0
2.505AsnMet: 2.505 ± 0.0
4.326AsnAsn: 4.326 ± 0.0
2.277AsnPro: 2.277 ± 0.0
1.821AsnGln: 1.821 ± 0.0
2.96AsnArg: 2.96 ± 0.0
4.098AsnSer: 4.098 ± 0.0
2.732AsnThr: 2.732 ± 0.0
4.781AsnVal: 4.781 ± 0.0
1.594AsnTrp: 1.594 ± 0.0
3.415AsnTyr: 3.415 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.911ProAla: 0.911 ± 0.0
0.0ProCys: 0.0 ± 0.0
2.277ProAsp: 2.277 ± 0.0
2.277ProGlu: 2.277 ± 0.0
0.683ProPhe: 0.683 ± 0.0
2.049ProGly: 2.049 ± 0.0
0.683ProHis: 0.683 ± 0.0
3.871ProIle: 3.871 ± 0.0
2.505ProLys: 2.505 ± 0.0
4.554ProLeu: 4.554 ± 0.0
2.049ProMet: 2.049 ± 0.0
1.821ProAsn: 1.821 ± 0.0
1.594ProPro: 1.594 ± 0.0
1.138ProGln: 1.138 ± 0.0
0.911ProArg: 0.911 ± 0.0
2.277ProSer: 2.277 ± 0.0
2.732ProThr: 2.732 ± 0.0
2.96ProVal: 2.96 ± 0.0
0.683ProTrp: 0.683 ± 0.0
1.138ProTyr: 1.138 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.188GlnAla: 3.188 ± 0.0
0.228GlnCys: 0.228 ± 0.0
1.366GlnAsp: 1.366 ± 0.0
2.732GlnGlu: 2.732 ± 0.0
0.911GlnPhe: 0.911 ± 0.0
1.366GlnGly: 1.366 ± 0.0
0.683GlnHis: 0.683 ± 0.0
3.188GlnIle: 3.188 ± 0.0
0.683GlnLys: 0.683 ± 0.0
3.188GlnLeu: 3.188 ± 0.0
1.594GlnMet: 1.594 ± 0.0
1.138GlnAsn: 1.138 ± 0.0
0.911GlnPro: 0.911 ± 0.0
1.821GlnGln: 1.821 ± 0.0
1.138GlnArg: 1.138 ± 0.0
1.594GlnSer: 1.594 ± 0.0
1.821GlnThr: 1.821 ± 0.0
1.821GlnVal: 1.821 ± 0.0
0.911GlnTrp: 0.911 ± 0.0
2.049GlnTyr: 2.049 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.277ArgAla: 2.277 ± 0.0
1.594ArgCys: 1.594 ± 0.0
2.732ArgAsp: 2.732 ± 0.0
2.732ArgGlu: 2.732 ± 0.0
1.366ArgPhe: 1.366 ± 0.0
2.049ArgGly: 2.049 ± 0.0
0.911ArgHis: 0.911 ± 0.0
4.781ArgIle: 4.781 ± 0.0
1.821ArgLys: 1.821 ± 0.0
7.514ArgLeu: 7.514 ± 0.0
1.366ArgMet: 1.366 ± 0.0
2.505ArgAsn: 2.505 ± 0.0
2.277ArgPro: 2.277 ± 0.0
2.505ArgGln: 2.505 ± 0.0
1.138ArgArg: 1.138 ± 0.0
4.781ArgSer: 4.781 ± 0.0
2.505ArgThr: 2.505 ± 0.0
4.098ArgVal: 4.098 ± 0.0
0.911ArgTrp: 0.911 ± 0.0
0.911ArgTyr: 0.911 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.505SerAla: 2.505 ± 0.0
0.683SerCys: 0.683 ± 0.0
3.643SerAsp: 3.643 ± 0.0
2.96SerGlu: 2.96 ± 0.0
1.138SerPhe: 1.138 ± 0.0
4.098SerGly: 4.098 ± 0.0
2.049SerHis: 2.049 ± 0.0
5.92SerIle: 5.92 ± 0.0
5.92SerLys: 5.92 ± 0.0
7.058SerLeu: 7.058 ± 0.0
2.505SerMet: 2.505 ± 0.0
4.554SerAsn: 4.554 ± 0.0
0.683SerPro: 0.683 ± 0.0
1.366SerGln: 1.366 ± 0.0
4.781SerArg: 4.781 ± 0.0
2.505SerSer: 2.505 ± 0.0
3.188SerThr: 3.188 ± 0.0
4.781SerVal: 4.781 ± 0.0
1.138SerTrp: 1.138 ± 0.0
1.594SerTyr: 1.594 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.871ThrAla: 3.871 ± 0.0
1.366ThrCys: 1.366 ± 0.0
5.92ThrAsp: 5.92 ± 0.0
3.415ThrGlu: 3.415 ± 0.0
1.138ThrPhe: 1.138 ± 0.0
3.188ThrGly: 3.188 ± 0.0
2.277ThrHis: 2.277 ± 0.0
5.464ThrIle: 5.464 ± 0.0
5.692ThrLys: 5.692 ± 0.0
4.781ThrLeu: 4.781 ± 0.0
1.821ThrMet: 1.821 ± 0.0
4.098ThrAsn: 4.098 ± 0.0
2.277ThrPro: 2.277 ± 0.0
2.049ThrGln: 2.049 ± 0.0
4.781ThrArg: 4.781 ± 0.0
3.871ThrSer: 3.871 ± 0.0
5.009ThrThr: 5.009 ± 0.0
4.098ThrVal: 4.098 ± 0.0
0.683ThrTrp: 0.683 ± 0.0
1.821ThrTyr: 1.821 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.188ValAla: 3.188 ± 0.0
1.594ValCys: 1.594 ± 0.0
4.781ValAsp: 4.781 ± 0.0
3.643ValGlu: 3.643 ± 0.0
1.138ValPhe: 1.138 ± 0.0
1.821ValGly: 1.821 ± 0.0
1.366ValHis: 1.366 ± 0.0
7.058ValIle: 7.058 ± 0.0
3.643ValLys: 3.643 ± 0.0
5.92ValLeu: 5.92 ± 0.0
2.505ValMet: 2.505 ± 0.0
3.643ValAsn: 3.643 ± 0.0
3.188ValPro: 3.188 ± 0.0
1.594ValGln: 1.594 ± 0.0
4.098ValArg: 4.098 ± 0.0
4.554ValSer: 4.554 ± 0.0
6.603ValThr: 6.603 ± 0.0
4.781ValVal: 4.781 ± 0.0
0.911ValTrp: 0.911 ± 0.0
2.277ValTyr: 2.277 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.911TrpAla: 0.911 ± 0.0
0.228TrpCys: 0.228 ± 0.0
0.911TrpAsp: 0.911 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.455TrpPhe: 0.455 ± 0.0
0.455TrpGly: 0.455 ± 0.0
0.455TrpHis: 0.455 ± 0.0
0.455TrpIle: 0.455 ± 0.0
0.455TrpLys: 0.455 ± 0.0
2.96TrpLeu: 2.96 ± 0.0
0.911TrpMet: 0.911 ± 0.0
0.455TrpAsn: 0.455 ± 0.0
0.228TrpPro: 0.228 ± 0.0
0.683TrpGln: 0.683 ± 0.0
1.366TrpArg: 1.366 ± 0.0
0.683TrpSer: 0.683 ± 0.0
0.911TrpThr: 0.911 ± 0.0
0.455TrpVal: 0.455 ± 0.0
0.228TrpTrp: 0.228 ± 0.0
1.138TrpTyr: 1.138 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.277TyrAla: 2.277 ± 0.0
0.911TyrCys: 0.911 ± 0.0
1.366TyrAsp: 1.366 ± 0.0
2.732TyrGlu: 2.732 ± 0.0
1.366TyrPhe: 1.366 ± 0.0
1.594TyrGly: 1.594 ± 0.0
1.366TyrHis: 1.366 ± 0.0
2.049TyrIle: 2.049 ± 0.0
2.277TyrLys: 2.277 ± 0.0
2.505TyrLeu: 2.505 ± 0.0
0.911TyrMet: 0.911 ± 0.0
4.098TyrAsn: 4.098 ± 0.0
0.683TyrPro: 0.683 ± 0.0
1.594TyrGln: 1.594 ± 0.0
1.821TyrArg: 1.821 ± 0.0
2.505TyrSer: 2.505 ± 0.0
2.505TyrThr: 2.505 ± 0.0
1.821TyrVal: 1.821 ± 0.0
0.455TyrTrp: 0.455 ± 0.0
1.594TyrTyr: 1.594 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (4393 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski