Amino acid dipepetide frequency for Zhejiang mosquito virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.213AlaAla: 3.213 ± 0.0
1.607AlaCys: 1.607 ± 0.0
2.571AlaAsp: 2.571 ± 0.0
4.177AlaGlu: 4.177 ± 0.0
2.892AlaPhe: 2.892 ± 0.0
5.141AlaGly: 5.141 ± 0.0
1.607AlaHis: 1.607 ± 0.0
5.141AlaIle: 5.141 ± 0.0
6.105AlaLys: 6.105 ± 0.0
7.069AlaLeu: 7.069 ± 0.0
1.928AlaMet: 1.928 ± 0.0
2.571AlaAsn: 2.571 ± 0.0
2.571AlaPro: 2.571 ± 0.0
3.535AlaGln: 3.535 ± 0.0
4.499AlaArg: 4.499 ± 0.0
6.748AlaSer: 6.748 ± 0.0
5.463AlaThr: 5.463 ± 0.0
4.82AlaVal: 4.82 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
3.213AlaTyr: 3.213 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.643CysAla: 0.643 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.285CysAsp: 1.285 ± 0.0
0.643CysGlu: 0.643 ± 0.0
0.643CysPhe: 0.643 ± 0.0
0.964CysGly: 0.964 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.964CysIle: 0.964 ± 0.0
0.321CysLys: 0.321 ± 0.0
0.964CysLeu: 0.964 ± 0.0
0.964CysMet: 0.964 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.643CysPro: 0.643 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.643CysArg: 0.643 ± 0.0
1.928CysSer: 1.928 ± 0.0
0.964CysThr: 0.964 ± 0.0
1.285CysVal: 1.285 ± 0.0
0.964CysTrp: 0.964 ± 0.0
0.643CysTyr: 0.643 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.463AspAla: 5.463 ± 0.0
0.964AspCys: 0.964 ± 0.0
3.213AspAsp: 3.213 ± 0.0
2.249AspGlu: 2.249 ± 0.0
3.856AspPhe: 3.856 ± 0.0
1.928AspGly: 1.928 ± 0.0
1.285AspHis: 1.285 ± 0.0
4.82AspIle: 4.82 ± 0.0
3.856AspLys: 3.856 ± 0.0
3.535AspLeu: 3.535 ± 0.0
0.321AspMet: 0.321 ± 0.0
5.463AspAsn: 5.463 ± 0.0
2.892AspPro: 2.892 ± 0.0
0.964AspGln: 0.964 ± 0.0
2.571AspArg: 2.571 ± 0.0
2.892AspSer: 2.892 ± 0.0
2.892AspThr: 2.892 ± 0.0
3.213AspVal: 3.213 ± 0.0
0.643AspTrp: 0.643 ± 0.0
0.964AspTyr: 0.964 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.82GluAla: 4.82 ± 0.0
0.964GluCys: 0.964 ± 0.0
3.856GluAsp: 3.856 ± 0.0
3.856GluGlu: 3.856 ± 0.0
2.249GluPhe: 2.249 ± 0.0
2.571GluGly: 2.571 ± 0.0
0.643GluHis: 0.643 ± 0.0
3.213GluIle: 3.213 ± 0.0
3.213GluLys: 3.213 ± 0.0
5.141GluLeu: 5.141 ± 0.0
1.928GluMet: 1.928 ± 0.0
1.285GluAsn: 1.285 ± 0.0
1.928GluPro: 1.928 ± 0.0
2.571GluGln: 2.571 ± 0.0
2.892GluArg: 2.892 ± 0.0
3.213GluSer: 3.213 ± 0.0
1.607GluThr: 1.607 ± 0.0
3.535GluVal: 3.535 ± 0.0
1.607GluTrp: 1.607 ± 0.0
3.535GluTyr: 3.535 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.249PheAla: 2.249 ± 0.0
1.285PheCys: 1.285 ± 0.0
1.607PheAsp: 1.607 ± 0.0
1.928PheGlu: 1.928 ± 0.0
2.249PhePhe: 2.249 ± 0.0
1.607PheGly: 1.607 ± 0.0
0.643PheHis: 0.643 ± 0.0
2.892PheIle: 2.892 ± 0.0
3.535PheLys: 3.535 ± 0.0
3.213PheLeu: 3.213 ± 0.0
0.643PheMet: 0.643 ± 0.0
3.213PheAsn: 3.213 ± 0.0
1.607PhePro: 1.607 ± 0.0
2.249PheGln: 2.249 ± 0.0
2.892PheArg: 2.892 ± 0.0
5.463PheSer: 5.463 ± 0.0
3.535PheThr: 3.535 ± 0.0
3.213PheVal: 3.213 ± 0.0
0.321PheTrp: 0.321 ± 0.0
1.607PheTyr: 1.607 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.535GlyAla: 3.535 ± 0.0
0.0GlyCys: 0.0 ± 0.0
3.213GlyAsp: 3.213 ± 0.0
0.964GlyGlu: 0.964 ± 0.0
1.928GlyPhe: 1.928 ± 0.0
1.607GlyGly: 1.607 ± 0.0
0.643GlyHis: 0.643 ± 0.0
5.141GlyIle: 5.141 ± 0.0
3.213GlyLys: 3.213 ± 0.0
7.069GlyLeu: 7.069 ± 0.0
1.928GlyMet: 1.928 ± 0.0
4.499GlyAsn: 4.499 ± 0.0
1.928GlyPro: 1.928 ± 0.0
1.928GlyGln: 1.928 ± 0.0
1.607GlyArg: 1.607 ± 0.0
4.177GlySer: 4.177 ± 0.0
1.607GlyThr: 1.607 ± 0.0
2.571GlyVal: 2.571 ± 0.0
0.964GlyTrp: 0.964 ± 0.0
1.285GlyTyr: 1.285 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.643HisAla: 0.643 ± 0.0
1.285HisCys: 1.285 ± 0.0
0.321HisAsp: 0.321 ± 0.0
1.607HisGlu: 1.607 ± 0.0
1.285HisPhe: 1.285 ± 0.0
1.607HisGly: 1.607 ± 0.0
0.643HisHis: 0.643 ± 0.0
0.643HisIle: 0.643 ± 0.0
1.285HisLys: 1.285 ± 0.0
1.928HisLeu: 1.928 ± 0.0
0.643HisMet: 0.643 ± 0.0
1.285HisAsn: 1.285 ± 0.0
1.285HisPro: 1.285 ± 0.0
0.643HisGln: 0.643 ± 0.0
0.321HisArg: 0.321 ± 0.0
0.964HisSer: 0.964 ± 0.0
2.571HisThr: 2.571 ± 0.0
2.249HisVal: 2.249 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.964HisTyr: 0.964 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.535IleAla: 3.535 ± 0.0
0.643IleCys: 0.643 ± 0.0
4.499IleAsp: 4.499 ± 0.0
4.499IleGlu: 4.499 ± 0.0
1.607IlePhe: 1.607 ± 0.0
3.856IleGly: 3.856 ± 0.0
2.571IleHis: 2.571 ± 0.0
4.82IleIle: 4.82 ± 0.0
4.82IleLys: 4.82 ± 0.0
6.427IleLeu: 6.427 ± 0.0
2.571IleMet: 2.571 ± 0.0
3.856IleAsn: 3.856 ± 0.0
1.928IlePro: 1.928 ± 0.0
2.249IleGln: 2.249 ± 0.0
2.249IleArg: 2.249 ± 0.0
5.784IleSer: 5.784 ± 0.0
4.499IleThr: 4.499 ± 0.0
2.571IleVal: 2.571 ± 0.0
0.964IleTrp: 0.964 ± 0.0
2.892IleTyr: 2.892 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.535LysAla: 3.535 ± 0.0
1.607LysCys: 1.607 ± 0.0
5.784LysAsp: 5.784 ± 0.0
5.784LysGlu: 5.784 ± 0.0
3.535LysPhe: 3.535 ± 0.0
2.571LysGly: 2.571 ± 0.0
2.892LysHis: 2.892 ± 0.0
2.892LysIle: 2.892 ± 0.0
3.213LysLys: 3.213 ± 0.0
4.499LysLeu: 4.499 ± 0.0
2.571LysMet: 2.571 ± 0.0
3.213LysAsn: 3.213 ± 0.0
2.571LysPro: 2.571 ± 0.0
1.607LysGln: 1.607 ± 0.0
0.964LysArg: 0.964 ± 0.0
4.177LysSer: 4.177 ± 0.0
7.069LysThr: 7.069 ± 0.0
5.141LysVal: 5.141 ± 0.0
0.321LysTrp: 0.321 ± 0.0
3.213LysTyr: 3.213 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.069LeuAla: 7.069 ± 0.0
1.285LeuCys: 1.285 ± 0.0
5.463LeuAsp: 5.463 ± 0.0
3.535LeuGlu: 3.535 ± 0.0
3.856LeuPhe: 3.856 ± 0.0
4.177LeuGly: 4.177 ± 0.0
1.928LeuHis: 1.928 ± 0.0
5.463LeuIle: 5.463 ± 0.0
6.105LeuLys: 6.105 ± 0.0
3.213LeuLeu: 3.213 ± 0.0
0.964LeuMet: 0.964 ± 0.0
5.141LeuAsn: 5.141 ± 0.0
6.427LeuPro: 6.427 ± 0.0
3.856LeuGln: 3.856 ± 0.0
4.177LeuArg: 4.177 ± 0.0
4.177LeuSer: 4.177 ± 0.0
8.997LeuThr: 8.997 ± 0.0
2.892LeuVal: 2.892 ± 0.0
0.643LeuTrp: 0.643 ± 0.0
2.571LeuTyr: 2.571 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.213MetAla: 3.213 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.571MetAsp: 2.571 ± 0.0
0.964MetGlu: 0.964 ± 0.0
0.964MetPhe: 0.964 ± 0.0
1.285MetGly: 1.285 ± 0.0
0.321MetHis: 0.321 ± 0.0
0.643MetIle: 0.643 ± 0.0
1.285MetLys: 1.285 ± 0.0
3.213MetLeu: 3.213 ± 0.0
0.643MetMet: 0.643 ± 0.0
3.213MetAsn: 3.213 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.285MetGln: 1.285 ± 0.0
0.643MetArg: 0.643 ± 0.0
2.892MetSer: 2.892 ± 0.0
1.607MetThr: 1.607 ± 0.0
1.285MetVal: 1.285 ± 0.0
0.643MetTrp: 0.643 ± 0.0
0.321MetTyr: 0.321 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.463AsnAla: 5.463 ± 0.0
0.964AsnCys: 0.964 ± 0.0
2.571AsnAsp: 2.571 ± 0.0
3.535AsnGlu: 3.535 ± 0.0
1.928AsnPhe: 1.928 ± 0.0
1.607AsnGly: 1.607 ± 0.0
1.285AsnHis: 1.285 ± 0.0
4.499AsnIle: 4.499 ± 0.0
3.213AsnLys: 3.213 ± 0.0
5.784AsnLeu: 5.784 ± 0.0
3.856AsnMet: 3.856 ± 0.0
3.535AsnAsn: 3.535 ± 0.0
2.892AsnPro: 2.892 ± 0.0
4.177AsnGln: 4.177 ± 0.0
0.643AsnArg: 0.643 ± 0.0
4.177AsnSer: 4.177 ± 0.0
2.249AsnThr: 2.249 ± 0.0
3.856AsnVal: 3.856 ± 0.0
0.321AsnTrp: 0.321 ± 0.0
1.285AsnTyr: 1.285 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.535ProAla: 3.535 ± 0.0
0.321ProCys: 0.321 ± 0.0
1.607ProAsp: 1.607 ± 0.0
2.249ProGlu: 2.249 ± 0.0
1.285ProPhe: 1.285 ± 0.0
2.892ProGly: 2.892 ± 0.0
1.607ProHis: 1.607 ± 0.0
2.892ProIle: 2.892 ± 0.0
2.892ProLys: 2.892 ± 0.0
4.177ProLeu: 4.177 ± 0.0
0.321ProMet: 0.321 ± 0.0
1.607ProAsn: 1.607 ± 0.0
1.285ProPro: 1.285 ± 0.0
1.285ProGln: 1.285 ± 0.0
1.285ProArg: 1.285 ± 0.0
2.571ProSer: 2.571 ± 0.0
4.177ProThr: 4.177 ± 0.0
3.213ProVal: 3.213 ± 0.0
0.321ProTrp: 0.321 ± 0.0
1.928ProTyr: 1.928 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.213GlnAla: 3.213 ± 0.0
0.643GlnCys: 0.643 ± 0.0
2.249GlnAsp: 2.249 ± 0.0
2.571GlnGlu: 2.571 ± 0.0
2.571GlnPhe: 2.571 ± 0.0
3.213GlnGly: 3.213 ± 0.0
0.643GlnHis: 0.643 ± 0.0
1.607GlnIle: 1.607 ± 0.0
1.928GlnLys: 1.928 ± 0.0
1.928GlnLeu: 1.928 ± 0.0
0.643GlnMet: 0.643 ± 0.0
1.607GlnAsn: 1.607 ± 0.0
2.249GlnPro: 2.249 ± 0.0
3.213GlnGln: 3.213 ± 0.0
1.607GlnArg: 1.607 ± 0.0
1.928GlnSer: 1.928 ± 0.0
3.213GlnThr: 3.213 ± 0.0
2.249GlnVal: 2.249 ± 0.0
0.321GlnTrp: 0.321 ± 0.0
1.607GlnTyr: 1.607 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.285ArgAla: 1.285 ± 0.0
0.964ArgCys: 0.964 ± 0.0
1.928ArgAsp: 1.928 ± 0.0
3.213ArgGlu: 3.213 ± 0.0
2.249ArgPhe: 2.249 ± 0.0
1.285ArgGly: 1.285 ± 0.0
0.964ArgHis: 0.964 ± 0.0
2.571ArgIle: 2.571 ± 0.0
2.892ArgLys: 2.892 ± 0.0
5.463ArgLeu: 5.463 ± 0.0
1.285ArgMet: 1.285 ± 0.0
1.285ArgAsn: 1.285 ± 0.0
2.249ArgPro: 2.249 ± 0.0
0.643ArgGln: 0.643 ± 0.0
1.928ArgArg: 1.928 ± 0.0
3.535ArgSer: 3.535 ± 0.0
0.964ArgThr: 0.964 ± 0.0
3.213ArgVal: 3.213 ± 0.0
0.964ArgTrp: 0.964 ± 0.0
2.892ArgTyr: 2.892 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.463SerAla: 5.463 ± 0.0
0.964SerCys: 0.964 ± 0.0
4.177SerAsp: 4.177 ± 0.0
1.928SerGlu: 1.928 ± 0.0
2.249SerPhe: 2.249 ± 0.0
4.82SerGly: 4.82 ± 0.0
1.285SerHis: 1.285 ± 0.0
7.069SerIle: 7.069 ± 0.0
5.463SerLys: 5.463 ± 0.0
4.499SerLeu: 4.499 ± 0.0
0.643SerMet: 0.643 ± 0.0
6.748SerAsn: 6.748 ± 0.0
2.249SerPro: 2.249 ± 0.0
2.571SerGln: 2.571 ± 0.0
3.535SerArg: 3.535 ± 0.0
4.82SerSer: 4.82 ± 0.0
5.784SerThr: 5.784 ± 0.0
4.177SerVal: 4.177 ± 0.0
1.928SerTrp: 1.928 ± 0.0
2.892SerTyr: 2.892 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
8.355ThrAla: 8.355 ± 0.0
0.0ThrCys: 0.0 ± 0.0
2.249ThrAsp: 2.249 ± 0.0
3.213ThrGlu: 3.213 ± 0.0
4.177ThrPhe: 4.177 ± 0.0
3.856ThrGly: 3.856 ± 0.0
0.643ThrHis: 0.643 ± 0.0
2.892ThrIle: 2.892 ± 0.0
5.141ThrLys: 5.141 ± 0.0
7.069ThrLeu: 7.069 ± 0.0
1.928ThrMet: 1.928 ± 0.0
3.535ThrAsn: 3.535 ± 0.0
1.928ThrPro: 1.928 ± 0.0
2.249ThrGln: 2.249 ± 0.0
4.177ThrArg: 4.177 ± 0.0
7.712ThrSer: 7.712 ± 0.0
7.069ThrThr: 7.069 ± 0.0
3.213ThrVal: 3.213 ± 0.0
0.964ThrTrp: 0.964 ± 0.0
2.571ThrTyr: 2.571 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.463ValAla: 5.463 ± 0.0
0.321ValCys: 0.321 ± 0.0
2.249ValAsp: 2.249 ± 0.0
4.177ValGlu: 4.177 ± 0.0
2.892ValPhe: 2.892 ± 0.0
2.892ValGly: 2.892 ± 0.0
1.607ValHis: 1.607 ± 0.0
5.141ValIle: 5.141 ± 0.0
5.141ValLys: 5.141 ± 0.0
4.499ValLeu: 4.499 ± 0.0
1.285ValMet: 1.285 ± 0.0
3.535ValAsn: 3.535 ± 0.0
1.928ValPro: 1.928 ± 0.0
2.249ValGln: 2.249 ± 0.0
1.607ValArg: 1.607 ± 0.0
3.856ValSer: 3.856 ± 0.0
6.427ValThr: 6.427 ± 0.0
3.535ValVal: 3.535 ± 0.0
0.321ValTrp: 0.321 ± 0.0
2.892ValTyr: 2.892 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.643TrpAla: 0.643 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.964TrpAsp: 0.964 ± 0.0
0.643TrpGlu: 0.643 ± 0.0
1.285TrpPhe: 1.285 ± 0.0
0.643TrpGly: 0.643 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.321TrpIle: 0.321 ± 0.0
0.643TrpLys: 0.643 ± 0.0
0.643TrpLeu: 0.643 ± 0.0
0.964TrpMet: 0.964 ± 0.0
0.321TrpAsn: 0.321 ± 0.0
1.285TrpPro: 1.285 ± 0.0
0.643TrpGln: 0.643 ± 0.0
0.964TrpArg: 0.964 ± 0.0
0.643TrpSer: 0.643 ± 0.0
0.321TrpThr: 0.321 ± 0.0
0.643TrpVal: 0.643 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.964TrpTyr: 0.964 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.535TyrAla: 3.535 ± 0.0
0.643TyrCys: 0.643 ± 0.0
1.607TyrAsp: 1.607 ± 0.0
2.892TyrGlu: 2.892 ± 0.0
2.249TyrPhe: 2.249 ± 0.0
1.285TyrGly: 1.285 ± 0.0
0.964TyrHis: 0.964 ± 0.0
3.213TyrIle: 3.213 ± 0.0
2.892TyrLys: 2.892 ± 0.0
1.607TyrLeu: 1.607 ± 0.0
0.643TyrMet: 0.643 ± 0.0
1.928TyrAsn: 1.928 ± 0.0
1.607TyrPro: 1.607 ± 0.0
1.285TyrGln: 1.285 ± 0.0
2.571TyrArg: 2.571 ± 0.0
1.928TyrSer: 1.928 ± 0.0
1.928TyrThr: 1.928 ± 0.0
5.141TyrVal: 5.141 ± 0.0
0.321TyrTrp: 0.321 ± 0.0
1.928TyrTyr: 1.928 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3113 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski