Amino acid dipepetide frequency for Phytophthora infestans RNA virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.01AlaAla: 7.01 ± 0.0
2.157AlaCys: 2.157 ± 0.0
3.775AlaAsp: 3.775 ± 0.0
2.157AlaGlu: 2.157 ± 0.0
3.235AlaPhe: 3.235 ± 0.0
4.314AlaGly: 4.314 ± 0.0
1.618AlaHis: 1.618 ± 0.0
7.549AlaIle: 7.549 ± 0.0
2.696AlaLys: 2.696 ± 0.0
7.28AlaLeu: 7.28 ± 0.0
1.348AlaMet: 1.348 ± 0.0
5.123AlaAsn: 5.123 ± 0.0
3.775AlaPro: 3.775 ± 0.0
4.314AlaGln: 4.314 ± 0.0
3.505AlaArg: 3.505 ± 0.0
5.932AlaSer: 5.932 ± 0.0
5.662AlaThr: 5.662 ± 0.0
6.74AlaVal: 6.74 ± 0.0
1.348AlaTrp: 1.348 ± 0.0
1.618AlaTyr: 1.618 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.887CysAla: 1.887 ± 0.0
0.809CysCys: 0.809 ± 0.0
1.078CysAsp: 1.078 ± 0.0
0.539CysGlu: 0.539 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.078CysGly: 1.078 ± 0.0
0.539CysHis: 0.539 ± 0.0
1.078CysIle: 1.078 ± 0.0
0.809CysLys: 0.809 ± 0.0
1.078CysLeu: 1.078 ± 0.0
1.618CysMet: 1.618 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.809CysPro: 0.809 ± 0.0
1.078CysGln: 1.078 ± 0.0
0.539CysArg: 0.539 ± 0.0
1.887CysSer: 1.887 ± 0.0
1.078CysThr: 1.078 ± 0.0
2.427CysVal: 2.427 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.27CysTyr: 0.27 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.966AspAla: 2.966 ± 0.0
0.539AspCys: 0.539 ± 0.0
3.505AspAsp: 3.505 ± 0.0
3.505AspGlu: 3.505 ± 0.0
0.539AspPhe: 0.539 ± 0.0
4.314AspGly: 4.314 ± 0.0
1.887AspHis: 1.887 ± 0.0
4.314AspIle: 4.314 ± 0.0
2.966AspLys: 2.966 ± 0.0
2.696AspLeu: 2.696 ± 0.0
2.427AspMet: 2.427 ± 0.0
3.505AspAsn: 3.505 ± 0.0
1.348AspPro: 1.348 ± 0.0
1.887AspGln: 1.887 ± 0.0
2.696AspArg: 2.696 ± 0.0
3.775AspSer: 3.775 ± 0.0
2.427AspThr: 2.427 ± 0.0
3.505AspVal: 3.505 ± 0.0
1.618AspTrp: 1.618 ± 0.0
1.348AspTyr: 1.348 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.887GluAla: 1.887 ± 0.0
1.078GluCys: 1.078 ± 0.0
2.966GluAsp: 2.966 ± 0.0
2.427GluGlu: 2.427 ± 0.0
1.078GluPhe: 1.078 ± 0.0
4.583GluGly: 4.583 ± 0.0
0.539GluHis: 0.539 ± 0.0
4.853GluIle: 4.853 ± 0.0
2.966GluLys: 2.966 ± 0.0
2.696GluLeu: 2.696 ± 0.0
3.235GluMet: 3.235 ± 0.0
1.348GluAsn: 1.348 ± 0.0
3.235GluPro: 3.235 ± 0.0
2.157GluGln: 2.157 ± 0.0
4.044GluArg: 4.044 ± 0.0
1.348GluSer: 1.348 ± 0.0
2.427GluThr: 2.427 ± 0.0
1.887GluVal: 1.887 ± 0.0
1.348GluTrp: 1.348 ± 0.0
0.539GluTyr: 0.539 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.427PheAla: 2.427 ± 0.0
0.809PheCys: 0.809 ± 0.0
0.27PheAsp: 0.27 ± 0.0
0.809PheGlu: 0.809 ± 0.0
0.809PhePhe: 0.809 ± 0.0
2.157PheGly: 2.157 ± 0.0
1.078PheHis: 1.078 ± 0.0
1.618PheIle: 1.618 ± 0.0
1.078PheLys: 1.078 ± 0.0
1.348PheLeu: 1.348 ± 0.0
0.809PheMet: 0.809 ± 0.0
1.348PheAsn: 1.348 ± 0.0
0.0PhePro: 0.0 ± 0.0
0.809PheGln: 0.809 ± 0.0
1.618PheArg: 1.618 ± 0.0
2.157PheSer: 2.157 ± 0.0
2.696PheThr: 2.696 ± 0.0
1.348PheVal: 1.348 ± 0.0
0.539PheTrp: 0.539 ± 0.0
0.539PheTyr: 0.539 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
8.628GlyAla: 8.628 ± 0.0
1.887GlyCys: 1.887 ± 0.0
4.853GlyAsp: 4.853 ± 0.0
2.157GlyGlu: 2.157 ± 0.0
1.078GlyPhe: 1.078 ± 0.0
3.775GlyGly: 3.775 ± 0.0
1.618GlyHis: 1.618 ± 0.0
2.696GlyIle: 2.696 ± 0.0
3.505GlyLys: 3.505 ± 0.0
4.314GlyLeu: 4.314 ± 0.0
2.157GlyMet: 2.157 ± 0.0
2.427GlyAsn: 2.427 ± 0.0
2.157GlyPro: 2.157 ± 0.0
2.427GlyGln: 2.427 ± 0.0
2.157GlyArg: 2.157 ± 0.0
4.583GlySer: 4.583 ± 0.0
5.932GlyThr: 5.932 ± 0.0
4.044GlyVal: 4.044 ± 0.0
0.539GlyTrp: 0.539 ± 0.0
2.157GlyTyr: 2.157 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.618HisAla: 1.618 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.809HisAsp: 0.809 ± 0.0
1.078HisGlu: 1.078 ± 0.0
0.539HisPhe: 0.539 ± 0.0
1.348HisGly: 1.348 ± 0.0
1.078HisHis: 1.078 ± 0.0
1.618HisIle: 1.618 ± 0.0
0.539HisLys: 0.539 ± 0.0
1.887HisLeu: 1.887 ± 0.0
0.809HisMet: 0.809 ± 0.0
0.809HisAsn: 0.809 ± 0.0
0.27HisPro: 0.27 ± 0.0
0.27HisGln: 0.27 ± 0.0
1.348HisArg: 1.348 ± 0.0
2.427HisSer: 2.427 ± 0.0
4.044HisThr: 4.044 ± 0.0
1.618HisVal: 1.618 ± 0.0
0.27HisTrp: 0.27 ± 0.0
0.27HisTyr: 0.27 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.123IleAla: 5.123 ± 0.0
1.618IleCys: 1.618 ± 0.0
3.235IleAsp: 3.235 ± 0.0
3.505IleGlu: 3.505 ± 0.0
1.078IlePhe: 1.078 ± 0.0
5.392IleGly: 5.392 ± 0.0
0.27IleHis: 0.27 ± 0.0
3.235IleIle: 3.235 ± 0.0
3.505IleLys: 3.505 ± 0.0
5.123IleLeu: 5.123 ± 0.0
2.157IleMet: 2.157 ± 0.0
2.157IleAsn: 2.157 ± 0.0
2.966IlePro: 2.966 ± 0.0
1.348IleGln: 1.348 ± 0.0
5.392IleArg: 5.392 ± 0.0
7.01IleSer: 7.01 ± 0.0
5.392IleThr: 5.392 ± 0.0
5.662IleVal: 5.662 ± 0.0
0.539IleTrp: 0.539 ± 0.0
2.157IleTyr: 2.157 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.696LysAla: 2.696 ± 0.0
1.078LysCys: 1.078 ± 0.0
2.966LysAsp: 2.966 ± 0.0
3.775LysGlu: 3.775 ± 0.0
1.887LysPhe: 1.887 ± 0.0
2.157LysGly: 2.157 ± 0.0
1.887LysHis: 1.887 ± 0.0
1.887LysIle: 1.887 ± 0.0
2.427LysLys: 2.427 ± 0.0
2.696LysLeu: 2.696 ± 0.0
1.887LysMet: 1.887 ± 0.0
1.618LysAsn: 1.618 ± 0.0
1.887LysPro: 1.887 ± 0.0
3.505LysGln: 3.505 ± 0.0
5.662LysArg: 5.662 ± 0.0
1.618LysSer: 1.618 ± 0.0
3.775LysThr: 3.775 ± 0.0
2.966LysVal: 2.966 ± 0.0
1.348LysTrp: 1.348 ± 0.0
1.887LysTyr: 1.887 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.819LeuAla: 7.819 ± 0.0
1.618LeuCys: 1.618 ± 0.0
2.696LeuAsp: 2.696 ± 0.0
4.044LeuGlu: 4.044 ± 0.0
0.809LeuPhe: 0.809 ± 0.0
4.044LeuGly: 4.044 ± 0.0
2.427LeuHis: 2.427 ± 0.0
4.044LeuIle: 4.044 ± 0.0
4.853LeuLys: 4.853 ± 0.0
7.819LeuLeu: 7.819 ± 0.0
3.505LeuMet: 3.505 ± 0.0
2.427LeuAsn: 2.427 ± 0.0
2.427LeuPro: 2.427 ± 0.0
1.078LeuGln: 1.078 ± 0.0
4.583LeuArg: 4.583 ± 0.0
6.471LeuSer: 6.471 ± 0.0
8.088LeuThr: 8.088 ± 0.0
9.706LeuVal: 9.706 ± 0.0
0.809LeuTrp: 0.809 ± 0.0
2.966LeuTyr: 2.966 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
4.853MetAla: 4.853 ± 0.0
0.539MetCys: 0.539 ± 0.0
1.348MetAsp: 1.348 ± 0.0
1.887MetGlu: 1.887 ± 0.0
0.809MetPhe: 0.809 ± 0.0
2.966MetGly: 2.966 ± 0.0
0.27MetHis: 0.27 ± 0.0
3.505MetIle: 3.505 ± 0.0
1.618MetLys: 1.618 ± 0.0
4.583MetLeu: 4.583 ± 0.0
1.348MetMet: 1.348 ± 0.0
0.809MetAsn: 0.809 ± 0.0
2.427MetPro: 2.427 ± 0.0
0.809MetGln: 0.809 ± 0.0
1.887MetArg: 1.887 ± 0.0
4.583MetSer: 4.583 ± 0.0
2.696MetThr: 2.696 ± 0.0
2.966MetVal: 2.966 ± 0.0
1.348MetTrp: 1.348 ± 0.0
1.348MetTyr: 1.348 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.123AsnAla: 5.123 ± 0.0
0.539AsnCys: 0.539 ± 0.0
2.966AsnAsp: 2.966 ± 0.0
1.887AsnGlu: 1.887 ± 0.0
0.27AsnPhe: 0.27 ± 0.0
1.348AsnGly: 1.348 ± 0.0
0.809AsnHis: 0.809 ± 0.0
1.078AsnIle: 1.078 ± 0.0
1.618AsnLys: 1.618 ± 0.0
2.966AsnLeu: 2.966 ± 0.0
1.348AsnMet: 1.348 ± 0.0
1.887AsnAsn: 1.887 ± 0.0
1.887AsnPro: 1.887 ± 0.0
0.539AsnGln: 0.539 ± 0.0
3.505AsnArg: 3.505 ± 0.0
3.775AsnSer: 3.775 ± 0.0
3.775AsnThr: 3.775 ± 0.0
3.235AsnVal: 3.235 ± 0.0
0.539AsnTrp: 0.539 ± 0.0
0.539AsnTyr: 0.539 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.157ProAla: 2.157 ± 0.0
0.539ProCys: 0.539 ± 0.0
2.157ProAsp: 2.157 ± 0.0
2.696ProGlu: 2.696 ± 0.0
0.809ProPhe: 0.809 ± 0.0
2.157ProGly: 2.157 ± 0.0
0.539ProHis: 0.539 ± 0.0
2.427ProIle: 2.427 ± 0.0
2.696ProLys: 2.696 ± 0.0
3.235ProLeu: 3.235 ± 0.0
1.618ProMet: 1.618 ± 0.0
2.696ProAsn: 2.696 ± 0.0
2.427ProPro: 2.427 ± 0.0
0.539ProGln: 0.539 ± 0.0
2.157ProArg: 2.157 ± 0.0
2.696ProSer: 2.696 ± 0.0
4.314ProThr: 4.314 ± 0.0
2.157ProVal: 2.157 ± 0.0
0.0ProTrp: 0.0 ± 0.0
2.427ProTyr: 2.427 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.618GlnAla: 1.618 ± 0.0
1.078GlnCys: 1.078 ± 0.0
1.618GlnAsp: 1.618 ± 0.0
1.618GlnGlu: 1.618 ± 0.0
1.618GlnPhe: 1.618 ± 0.0
1.078GlnGly: 1.078 ± 0.0
0.539GlnHis: 0.539 ± 0.0
1.618GlnIle: 1.618 ± 0.0
1.887GlnLys: 1.887 ± 0.0
4.583GlnLeu: 4.583 ± 0.0
1.618GlnMet: 1.618 ± 0.0
1.078GlnAsn: 1.078 ± 0.0
1.618GlnPro: 1.618 ± 0.0
1.348GlnGln: 1.348 ± 0.0
1.348GlnArg: 1.348 ± 0.0
2.696GlnSer: 2.696 ± 0.0
2.696GlnThr: 2.696 ± 0.0
2.966GlnVal: 2.966 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.348GlnTyr: 1.348 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.775ArgAla: 3.775 ± 0.0
0.809ArgCys: 0.809 ± 0.0
2.427ArgAsp: 2.427 ± 0.0
2.696ArgGlu: 2.696 ± 0.0
2.157ArgPhe: 2.157 ± 0.0
3.235ArgGly: 3.235 ± 0.0
1.078ArgHis: 1.078 ± 0.0
3.775ArgIle: 3.775 ± 0.0
3.235ArgLys: 3.235 ± 0.0
5.932ArgLeu: 5.932 ± 0.0
3.235ArgMet: 3.235 ± 0.0
1.887ArgAsn: 1.887 ± 0.0
1.618ArgPro: 1.618 ± 0.0
2.157ArgGln: 2.157 ± 0.0
4.314ArgArg: 4.314 ± 0.0
4.044ArgSer: 4.044 ± 0.0
4.583ArgThr: 4.583 ± 0.0
4.853ArgVal: 4.853 ± 0.0
0.27ArgTrp: 0.27 ± 0.0
3.235ArgTyr: 3.235 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.471SerAla: 6.471 ± 0.0
0.809SerCys: 0.809 ± 0.0
3.235SerAsp: 3.235 ± 0.0
3.235SerGlu: 3.235 ± 0.0
1.887SerPhe: 1.887 ± 0.0
4.583SerGly: 4.583 ± 0.0
2.427SerHis: 2.427 ± 0.0
6.74SerIle: 6.74 ± 0.0
2.966SerLys: 2.966 ± 0.0
7.549SerLeu: 7.549 ± 0.0
3.235SerMet: 3.235 ± 0.0
2.696SerAsn: 2.696 ± 0.0
2.427SerPro: 2.427 ± 0.0
2.157SerGln: 2.157 ± 0.0
5.123SerArg: 5.123 ± 0.0
3.235SerSer: 3.235 ± 0.0
7.01SerThr: 7.01 ± 0.0
3.505SerVal: 3.505 ± 0.0
1.348SerTrp: 1.348 ± 0.0
2.966SerTyr: 2.966 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.505ThrAla: 3.505 ± 0.0
0.539ThrCys: 0.539 ± 0.0
6.471ThrAsp: 6.471 ± 0.0
2.966ThrGlu: 2.966 ± 0.0
2.427ThrPhe: 2.427 ± 0.0
4.044ThrGly: 4.044 ± 0.0
1.618ThrHis: 1.618 ± 0.0
5.123ThrIle: 5.123 ± 0.0
4.583ThrLys: 4.583 ± 0.0
6.201ThrLeu: 6.201 ± 0.0
4.314ThrMet: 4.314 ± 0.0
2.966ThrAsn: 2.966 ± 0.0
4.853ThrPro: 4.853 ± 0.0
3.235ThrGln: 3.235 ± 0.0
3.505ThrArg: 3.505 ± 0.0
9.167ThrSer: 9.167 ± 0.0
7.819ThrThr: 7.819 ± 0.0
5.662ThrVal: 5.662 ± 0.0
0.539ThrTrp: 0.539 ± 0.0
1.887ThrTyr: 1.887 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.28ValAla: 7.28 ± 0.0
0.809ValCys: 0.809 ± 0.0
3.235ValAsp: 3.235 ± 0.0
3.235ValGlu: 3.235 ± 0.0
2.157ValPhe: 2.157 ± 0.0
7.819ValGly: 7.819 ± 0.0
1.348ValHis: 1.348 ± 0.0
5.932ValIle: 5.932 ± 0.0
3.775ValLys: 3.775 ± 0.0
6.201ValLeu: 6.201 ± 0.0
3.235ValMet: 3.235 ± 0.0
3.235ValAsn: 3.235 ± 0.0
2.966ValPro: 2.966 ± 0.0
3.775ValGln: 3.775 ± 0.0
4.044ValArg: 4.044 ± 0.0
3.235ValSer: 3.235 ± 0.0
4.314ValThr: 4.314 ± 0.0
9.437ValVal: 9.437 ± 0.0
0.539ValTrp: 0.539 ± 0.0
1.348ValTyr: 1.348 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.618TrpAla: 1.618 ± 0.0
0.539TrpCys: 0.539 ± 0.0
0.27TrpAsp: 0.27 ± 0.0
1.078TrpGlu: 1.078 ± 0.0
0.809TrpPhe: 0.809 ± 0.0
0.539TrpGly: 0.539 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.27TrpIle: 0.27 ± 0.0
0.539TrpLys: 0.539 ± 0.0
2.157TrpLeu: 2.157 ± 0.0
0.539TrpMet: 0.539 ± 0.0
0.27TrpAsn: 0.27 ± 0.0
0.539TrpPro: 0.539 ± 0.0
0.27TrpGln: 0.27 ± 0.0
0.809TrpArg: 0.809 ± 0.0
1.887TrpSer: 1.887 ± 0.0
0.27TrpThr: 0.27 ± 0.0
0.809TrpVal: 0.809 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.809TrpTyr: 0.809 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.696TyrAla: 2.696 ± 0.0
0.809TyrCys: 0.809 ± 0.0
1.887TyrAsp: 1.887 ± 0.0
1.348TyrGlu: 1.348 ± 0.0
0.27TyrPhe: 0.27 ± 0.0
2.157TyrGly: 2.157 ± 0.0
1.348TyrHis: 1.348 ± 0.0
3.505TyrIle: 3.505 ± 0.0
1.348TyrLys: 1.348 ± 0.0
1.887TyrLeu: 1.887 ± 0.0
1.887TyrMet: 1.887 ± 0.0
1.348TyrAsn: 1.348 ± 0.0
0.809TyrPro: 0.809 ± 0.0
0.27TyrGln: 0.27 ± 0.0
1.078TyrArg: 1.078 ± 0.0
1.078TyrSer: 1.078 ± 0.0
2.427TyrThr: 2.427 ± 0.0
2.427TyrVal: 2.427 ± 0.0
1.078TyrTrp: 1.078 ± 0.0
0.539TyrTyr: 0.539 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3710 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski