Amino acid dipepetide frequency for Beihai zhaovirus-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.747AlaAla: 0.747 ± 0.0
1.12AlaCys: 1.12 ± 0.0
2.24AlaAsp: 2.24 ± 0.0
4.479AlaGlu: 4.479 ± 0.0
2.986AlaPhe: 2.986 ± 0.0
4.106AlaGly: 4.106 ± 0.0
1.12AlaHis: 1.12 ± 0.0
4.479AlaIle: 4.479 ± 0.0
5.972AlaLys: 5.972 ± 0.0
5.599AlaLeu: 5.599 ± 0.0
1.493AlaMet: 1.493 ± 0.0
3.733AlaAsn: 3.733 ± 0.0
2.986AlaPro: 2.986 ± 0.0
3.733AlaGln: 3.733 ± 0.0
2.24AlaArg: 2.24 ± 0.0
3.733AlaSer: 3.733 ± 0.0
1.12AlaThr: 1.12 ± 0.0
3.359AlaVal: 3.359 ± 0.0
2.24AlaTrp: 2.24 ± 0.0
2.613AlaTyr: 2.613 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.12CysAla: 1.12 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.747CysAsp: 0.747 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.747CysPhe: 0.747 ± 0.0
0.373CysGly: 0.373 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.24CysIle: 2.24 ± 0.0
1.12CysLys: 1.12 ± 0.0
3.359CysLeu: 3.359 ± 0.0
0.373CysMet: 0.373 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.747CysPro: 0.747 ± 0.0
1.493CysGln: 1.493 ± 0.0
0.747CysArg: 0.747 ± 0.0
1.12CysSer: 1.12 ± 0.0
1.493CysThr: 1.493 ± 0.0
0.747CysVal: 0.747 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.373CysTyr: 0.373 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.479AspAla: 4.479 ± 0.0
1.12AspCys: 1.12 ± 0.0
2.24AspAsp: 2.24 ± 0.0
3.359AspGlu: 3.359 ± 0.0
2.24AspPhe: 2.24 ± 0.0
3.359AspGly: 3.359 ± 0.0
1.12AspHis: 1.12 ± 0.0
1.866AspIle: 1.866 ± 0.0
3.359AspLys: 3.359 ± 0.0
6.346AspLeu: 6.346 ± 0.0
1.12AspMet: 1.12 ± 0.0
2.986AspAsn: 2.986 ± 0.0
2.986AspPro: 2.986 ± 0.0
2.613AspGln: 2.613 ± 0.0
2.24AspArg: 2.24 ± 0.0
3.733AspSer: 3.733 ± 0.0
1.12AspThr: 1.12 ± 0.0
4.479AspVal: 4.479 ± 0.0
2.24AspTrp: 2.24 ± 0.0
2.24AspTyr: 2.24 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.359GluAla: 3.359 ± 0.0
0.747GluCys: 0.747 ± 0.0
4.479GluAsp: 4.479 ± 0.0
1.493GluGlu: 1.493 ± 0.0
2.986GluPhe: 2.986 ± 0.0
2.613GluGly: 2.613 ± 0.0
1.12GluHis: 1.12 ± 0.0
3.359GluIle: 3.359 ± 0.0
3.733GluLys: 3.733 ± 0.0
2.613GluLeu: 2.613 ± 0.0
1.12GluMet: 1.12 ± 0.0
1.866GluAsn: 1.866 ± 0.0
2.613GluPro: 2.613 ± 0.0
4.479GluGln: 4.479 ± 0.0
4.479GluArg: 4.479 ± 0.0
5.599GluSer: 5.599 ± 0.0
1.866GluThr: 1.866 ± 0.0
4.106GluVal: 4.106 ± 0.0
0.747GluTrp: 0.747 ± 0.0
1.866GluTyr: 1.866 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.866PheAla: 1.866 ± 0.0
1.866PheCys: 1.866 ± 0.0
1.493PheAsp: 1.493 ± 0.0
1.866PheGlu: 1.866 ± 0.0
2.986PhePhe: 2.986 ± 0.0
2.613PheGly: 2.613 ± 0.0
0.373PheHis: 0.373 ± 0.0
1.12PheIle: 1.12 ± 0.0
3.733PheLys: 3.733 ± 0.0
2.613PheLeu: 2.613 ± 0.0
0.747PheMet: 0.747 ± 0.0
4.106PheAsn: 4.106 ± 0.0
1.493PhePro: 1.493 ± 0.0
1.866PheGln: 1.866 ± 0.0
0.747PheArg: 0.747 ± 0.0
2.613PheSer: 2.613 ± 0.0
1.866PheThr: 1.866 ± 0.0
2.986PheVal: 2.986 ± 0.0
1.493PheTrp: 1.493 ± 0.0
1.12PheTyr: 1.12 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.493GlyAla: 1.493 ± 0.0
1.12GlyCys: 1.12 ± 0.0
3.733GlyAsp: 3.733 ± 0.0
1.866GlyGlu: 1.866 ± 0.0
2.613GlyPhe: 2.613 ± 0.0
1.866GlyGly: 1.866 ± 0.0
1.866GlyHis: 1.866 ± 0.0
3.359GlyIle: 3.359 ± 0.0
1.866GlyLys: 1.866 ± 0.0
5.599GlyLeu: 5.599 ± 0.0
2.24GlyMet: 2.24 ± 0.0
4.479GlyAsn: 4.479 ± 0.0
1.493GlyPro: 1.493 ± 0.0
4.479GlyGln: 4.479 ± 0.0
3.733GlyArg: 3.733 ± 0.0
4.106GlySer: 4.106 ± 0.0
3.733GlyThr: 3.733 ± 0.0
2.24GlyVal: 2.24 ± 0.0
0.747GlyTrp: 0.747 ± 0.0
2.24GlyTyr: 2.24 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.866HisAla: 1.866 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.24HisAsp: 2.24 ± 0.0
1.12HisGlu: 1.12 ± 0.0
0.747HisPhe: 0.747 ± 0.0
0.373HisGly: 0.373 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.12HisIle: 1.12 ± 0.0
1.866HisLys: 1.866 ± 0.0
2.613HisLeu: 2.613 ± 0.0
0.0HisMet: 0.0 ± 0.0
2.986HisAsn: 2.986 ± 0.0
0.747HisPro: 0.747 ± 0.0
0.747HisGln: 0.747 ± 0.0
1.493HisArg: 1.493 ± 0.0
1.493HisSer: 1.493 ± 0.0
1.493HisThr: 1.493 ± 0.0
1.493HisVal: 1.493 ± 0.0
0.373HisTrp: 0.373 ± 0.0
1.493HisTyr: 1.493 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.359IleAla: 3.359 ± 0.0
1.12IleCys: 1.12 ± 0.0
3.359IleAsp: 3.359 ± 0.0
2.986IleGlu: 2.986 ± 0.0
2.24IlePhe: 2.24 ± 0.0
1.12IleGly: 1.12 ± 0.0
2.24IleHis: 2.24 ± 0.0
1.493IleIle: 1.493 ± 0.0
4.853IleLys: 4.853 ± 0.0
4.479IleLeu: 4.479 ± 0.0
1.493IleMet: 1.493 ± 0.0
2.24IleAsn: 2.24 ± 0.0
2.613IlePro: 2.613 ± 0.0
5.599IleGln: 5.599 ± 0.0
2.613IleArg: 2.613 ± 0.0
2.986IleSer: 2.986 ± 0.0
3.733IleThr: 3.733 ± 0.0
2.986IleVal: 2.986 ± 0.0
0.373IleTrp: 0.373 ± 0.0
1.493IleTyr: 1.493 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.853LysAla: 4.853 ± 0.0
0.747LysCys: 0.747 ± 0.0
2.986LysAsp: 2.986 ± 0.0
3.733LysGlu: 3.733 ± 0.0
2.613LysPhe: 2.613 ± 0.0
3.359LysGly: 3.359 ± 0.0
1.866LysHis: 1.866 ± 0.0
3.359LysIle: 3.359 ± 0.0
5.972LysLys: 5.972 ± 0.0
4.853LysLeu: 4.853 ± 0.0
2.613LysMet: 2.613 ± 0.0
3.733LysAsn: 3.733 ± 0.0
2.986LysPro: 2.986 ± 0.0
6.346LysGln: 6.346 ± 0.0
3.359LysArg: 3.359 ± 0.0
3.359LysSer: 3.359 ± 0.0
4.106LysThr: 4.106 ± 0.0
5.599LysVal: 5.599 ± 0.0
1.12LysTrp: 1.12 ± 0.0
2.613LysTyr: 2.613 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.346LeuAla: 6.346 ± 0.0
2.613LeuCys: 2.613 ± 0.0
4.479LeuAsp: 4.479 ± 0.0
5.599LeuGlu: 5.599 ± 0.0
1.866LeuPhe: 1.866 ± 0.0
4.853LeuGly: 4.853 ± 0.0
1.866LeuHis: 1.866 ± 0.0
4.106LeuIle: 4.106 ± 0.0
4.106LeuLys: 4.106 ± 0.0
5.226LeuLeu: 5.226 ± 0.0
4.106LeuMet: 4.106 ± 0.0
2.24LeuAsn: 2.24 ± 0.0
4.853LeuPro: 4.853 ± 0.0
5.972LeuGln: 5.972 ± 0.0
4.106LeuArg: 4.106 ± 0.0
7.092LeuSer: 7.092 ± 0.0
2.613LeuThr: 2.613 ± 0.0
6.719LeuVal: 6.719 ± 0.0
1.12LeuTrp: 1.12 ± 0.0
4.106LeuTyr: 4.106 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.12MetAla: 1.12 ± 0.0
0.747MetCys: 0.747 ± 0.0
1.493MetAsp: 1.493 ± 0.0
1.866MetGlu: 1.866 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.373MetGly: 0.373 ± 0.0
0.373MetHis: 0.373 ± 0.0
1.493MetIle: 1.493 ± 0.0
3.359MetLys: 3.359 ± 0.0
1.866MetLeu: 1.866 ± 0.0
0.373MetMet: 0.373 ± 0.0
1.866MetAsn: 1.866 ± 0.0
0.373MetPro: 0.373 ± 0.0
2.24MetGln: 2.24 ± 0.0
1.866MetArg: 1.866 ± 0.0
2.613MetSer: 2.613 ± 0.0
2.986MetThr: 2.986 ± 0.0
2.24MetVal: 2.24 ± 0.0
0.373MetTrp: 0.373 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.986AsnAla: 2.986 ± 0.0
0.747AsnCys: 0.747 ± 0.0
1.866AsnAsp: 1.866 ± 0.0
4.853AsnGlu: 4.853 ± 0.0
1.866AsnPhe: 1.866 ± 0.0
3.359AsnGly: 3.359 ± 0.0
1.866AsnHis: 1.866 ± 0.0
3.359AsnIle: 3.359 ± 0.0
2.24AsnLys: 2.24 ± 0.0
4.853AsnLeu: 4.853 ± 0.0
1.493AsnMet: 1.493 ± 0.0
2.986AsnAsn: 2.986 ± 0.0
3.733AsnPro: 3.733 ± 0.0
2.24AsnGln: 2.24 ± 0.0
2.613AsnArg: 2.613 ± 0.0
2.613AsnSer: 2.613 ± 0.0
4.853AsnThr: 4.853 ± 0.0
2.613AsnVal: 2.613 ± 0.0
1.12AsnTrp: 1.12 ± 0.0
1.493AsnTyr: 1.493 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.12ProAla: 1.12 ± 0.0
0.373ProCys: 0.373 ± 0.0
1.866ProAsp: 1.866 ± 0.0
4.853ProGlu: 4.853 ± 0.0
1.866ProPhe: 1.866 ± 0.0
4.106ProGly: 4.106 ± 0.0
1.866ProHis: 1.866 ± 0.0
2.613ProIle: 2.613 ± 0.0
2.986ProLys: 2.986 ± 0.0
4.479ProLeu: 4.479 ± 0.0
1.866ProMet: 1.866 ± 0.0
1.493ProAsn: 1.493 ± 0.0
2.613ProPro: 2.613 ± 0.0
3.359ProGln: 3.359 ± 0.0
1.493ProArg: 1.493 ± 0.0
2.613ProSer: 2.613 ± 0.0
1.12ProThr: 1.12 ± 0.0
2.24ProVal: 2.24 ± 0.0
1.12ProTrp: 1.12 ± 0.0
1.866ProTyr: 1.866 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.733GlnAla: 3.733 ± 0.0
1.12GlnCys: 1.12 ± 0.0
2.613GlnAsp: 2.613 ± 0.0
5.226GlnGlu: 5.226 ± 0.0
2.613GlnPhe: 2.613 ± 0.0
3.359GlnGly: 3.359 ± 0.0
1.866GlnHis: 1.866 ± 0.0
5.226GlnIle: 5.226 ± 0.0
4.479GlnLys: 4.479 ± 0.0
7.092GlnLeu: 7.092 ± 0.0
2.24GlnMet: 2.24 ± 0.0
4.106GlnAsn: 4.106 ± 0.0
1.493GlnPro: 1.493 ± 0.0
6.719GlnGln: 6.719 ± 0.0
3.733GlnArg: 3.733 ± 0.0
4.853GlnSer: 4.853 ± 0.0
2.613GlnThr: 2.613 ± 0.0
4.853GlnVal: 4.853 ± 0.0
0.373GlnTrp: 0.373 ± 0.0
2.613GlnTyr: 2.613 ± 0.0
0.373GlnXaa: 0.373 ± 0.0
Arg
2.613ArgAla: 2.613 ± 0.0
0.373ArgCys: 0.373 ± 0.0
3.733ArgAsp: 3.733 ± 0.0
1.866ArgGlu: 1.866 ± 0.0
1.12ArgPhe: 1.12 ± 0.0
2.613ArgGly: 2.613 ± 0.0
1.12ArgHis: 1.12 ± 0.0
2.24ArgIle: 2.24 ± 0.0
4.479ArgLys: 4.479 ± 0.0
4.106ArgLeu: 4.106 ± 0.0
1.12ArgMet: 1.12 ± 0.0
2.24ArgAsn: 2.24 ± 0.0
4.106ArgPro: 4.106 ± 0.0
4.106ArgGln: 4.106 ± 0.0
1.493ArgArg: 1.493 ± 0.0
2.613ArgSer: 2.613 ± 0.0
2.24ArgThr: 2.24 ± 0.0
4.479ArgVal: 4.479 ± 0.0
0.373ArgTrp: 0.373 ± 0.0
0.747ArgTyr: 0.747 ± 0.0
0.373ArgXaa: 0.373 ± 0.0
Ser
4.479SerAla: 4.479 ± 0.0
0.747SerCys: 0.747 ± 0.0
4.853SerAsp: 4.853 ± 0.0
2.613SerGlu: 2.613 ± 0.0
2.24SerPhe: 2.24 ± 0.0
6.346SerGly: 6.346 ± 0.0
1.866SerHis: 1.866 ± 0.0
2.613SerIle: 2.613 ± 0.0
5.226SerLys: 5.226 ± 0.0
5.599SerLeu: 5.599 ± 0.0
1.866SerMet: 1.866 ± 0.0
2.986SerAsn: 2.986 ± 0.0
2.986SerPro: 2.986 ± 0.0
4.853SerGln: 4.853 ± 0.0
4.106SerArg: 4.106 ± 0.0
7.465SerSer: 7.465 ± 0.0
3.359SerThr: 3.359 ± 0.0
2.613SerVal: 2.613 ± 0.0
1.12SerTrp: 1.12 ± 0.0
3.733SerTyr: 3.733 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.359ThrAla: 3.359 ± 0.0
0.373ThrCys: 0.373 ± 0.0
3.733ThrAsp: 3.733 ± 0.0
2.24ThrGlu: 2.24 ± 0.0
3.733ThrPhe: 3.733 ± 0.0
3.359ThrGly: 3.359 ± 0.0
1.866ThrHis: 1.866 ± 0.0
2.986ThrIle: 2.986 ± 0.0
4.106ThrLys: 4.106 ± 0.0
5.226ThrLeu: 5.226 ± 0.0
1.12ThrMet: 1.12 ± 0.0
1.866ThrAsn: 1.866 ± 0.0
2.24ThrPro: 2.24 ± 0.0
1.12ThrGln: 1.12 ± 0.0
2.613ThrArg: 2.613 ± 0.0
3.359ThrSer: 3.359 ± 0.0
3.733ThrThr: 3.733 ± 0.0
2.613ThrVal: 2.613 ± 0.0
0.747ThrTrp: 0.747 ± 0.0
0.373ThrTyr: 0.373 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.599ValAla: 5.599 ± 0.0
0.747ValCys: 0.747 ± 0.0
3.359ValAsp: 3.359 ± 0.0
2.986ValGlu: 2.986 ± 0.0
1.866ValPhe: 1.866 ± 0.0
2.986ValGly: 2.986 ± 0.0
1.493ValHis: 1.493 ± 0.0
2.986ValIle: 2.986 ± 0.0
4.853ValLys: 4.853 ± 0.0
3.359ValLeu: 3.359 ± 0.0
1.12ValMet: 1.12 ± 0.0
5.226ValAsn: 5.226 ± 0.0
1.493ValPro: 1.493 ± 0.0
4.853ValGln: 4.853 ± 0.0
2.986ValArg: 2.986 ± 0.0
5.599ValSer: 5.599 ± 0.0
3.359ValThr: 3.359 ± 0.0
3.359ValVal: 3.359 ± 0.0
1.12ValTrp: 1.12 ± 0.0
3.359ValTyr: 3.359 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.12TrpAla: 1.12 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.493TrpAsp: 1.493 ± 0.0
1.12TrpGlu: 1.12 ± 0.0
0.747TrpPhe: 0.747 ± 0.0
0.373TrpGly: 0.373 ± 0.0
0.373TrpHis: 0.373 ± 0.0
1.12TrpIle: 1.12 ± 0.0
1.12TrpLys: 1.12 ± 0.0
1.12TrpLeu: 1.12 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.12TrpAsn: 1.12 ± 0.0
1.12TrpPro: 1.12 ± 0.0
1.12TrpGln: 1.12 ± 0.0
0.373TrpArg: 0.373 ± 0.0
1.12TrpSer: 1.12 ± 0.0
1.493TrpThr: 1.493 ± 0.0
0.373TrpVal: 0.373 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.493TrpTyr: 1.493 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.479TyrAla: 4.479 ± 0.0
1.12TyrCys: 1.12 ± 0.0
2.24TyrAsp: 2.24 ± 0.0
1.12TyrGlu: 1.12 ± 0.0
1.493TyrPhe: 1.493 ± 0.0
3.359TyrGly: 3.359 ± 0.0
0.0TyrHis: 0.0 ± 0.0
2.24TyrIle: 2.24 ± 0.0
1.12TyrLys: 1.12 ± 0.0
2.986TyrLeu: 2.986 ± 0.0
0.373TyrMet: 0.373 ± 0.0
1.493TyrAsn: 1.493 ± 0.0
2.24TyrPro: 2.24 ± 0.0
3.359TyrGln: 3.359 ± 0.0
1.12TyrArg: 1.12 ± 0.0
2.986TyrSer: 2.986 ± 0.0
1.866TyrThr: 1.866 ± 0.0
2.24TyrVal: 2.24 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.12TyrTyr: 1.12 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.373XaaMet: 0.373 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.373XaaVal: 0.373 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2680 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski