Amino acid dipepetide frequency for Changjiang crawfish virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.311AlaAla: 3.311 ± 0.0
0.331AlaCys: 0.331 ± 0.0
2.318AlaAsp: 2.318 ± 0.0
2.98AlaGlu: 2.98 ± 0.0
2.98AlaPhe: 2.98 ± 0.0
4.305AlaGly: 4.305 ± 0.0
1.325AlaHis: 1.325 ± 0.0
3.642AlaIle: 3.642 ± 0.0
3.974AlaLys: 3.974 ± 0.0
3.974AlaLeu: 3.974 ± 0.0
0.993AlaMet: 0.993 ± 0.0
2.649AlaAsn: 2.649 ± 0.0
4.305AlaPro: 4.305 ± 0.0
2.318AlaGln: 2.318 ± 0.0
0.662AlaArg: 0.662 ± 0.0
3.974AlaSer: 3.974 ± 0.0
2.318AlaThr: 2.318 ± 0.0
3.311AlaVal: 3.311 ± 0.0
0.662AlaTrp: 0.662 ± 0.0
1.987AlaTyr: 1.987 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.993CysAla: 0.993 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.993CysAsp: 0.993 ± 0.0
0.993CysGlu: 0.993 ± 0.0
0.662CysPhe: 0.662 ± 0.0
0.662CysGly: 0.662 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.662CysIle: 0.662 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.993CysLeu: 0.993 ± 0.0
0.331CysMet: 0.331 ± 0.0
1.656CysAsn: 1.656 ± 0.0
1.325CysPro: 1.325 ± 0.0
0.662CysGln: 0.662 ± 0.0
0.993CysArg: 0.993 ± 0.0
2.649CysSer: 2.649 ± 0.0
0.331CysThr: 0.331 ± 0.0
1.325CysVal: 1.325 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.662CysTyr: 0.662 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.993AspAla: 0.993 ± 0.0
0.331AspCys: 0.331 ± 0.0
4.305AspAsp: 4.305 ± 0.0
3.642AspGlu: 3.642 ± 0.0
5.298AspPhe: 5.298 ± 0.0
1.656AspGly: 1.656 ± 0.0
0.0AspHis: 0.0 ± 0.0
0.662AspIle: 0.662 ± 0.0
2.98AspLys: 2.98 ± 0.0
4.636AspLeu: 4.636 ± 0.0
1.325AspMet: 1.325 ± 0.0
3.974AspAsn: 3.974 ± 0.0
2.318AspPro: 2.318 ± 0.0
0.993AspGln: 0.993 ± 0.0
2.318AspArg: 2.318 ± 0.0
1.987AspSer: 1.987 ± 0.0
1.987AspThr: 1.987 ± 0.0
6.291AspVal: 6.291 ± 0.0
0.993AspTrp: 0.993 ± 0.0
2.98AspTyr: 2.98 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.656GluAla: 1.656 ± 0.0
0.993GluCys: 0.993 ± 0.0
4.305GluAsp: 4.305 ± 0.0
6.623GluGlu: 6.623 ± 0.0
4.305GluPhe: 4.305 ± 0.0
3.311GluGly: 3.311 ± 0.0
0.993GluHis: 0.993 ± 0.0
4.967GluIle: 4.967 ± 0.0
4.967GluLys: 4.967 ± 0.0
3.642GluLeu: 3.642 ± 0.0
3.974GluMet: 3.974 ± 0.0
3.642GluAsn: 3.642 ± 0.0
1.987GluPro: 1.987 ± 0.0
1.325GluGln: 1.325 ± 0.0
1.987GluArg: 1.987 ± 0.0
9.603GluSer: 9.603 ± 0.0
6.291GluThr: 6.291 ± 0.0
6.291GluVal: 6.291 ± 0.0
0.662GluTrp: 0.662 ± 0.0
2.98GluTyr: 2.98 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.311PheAla: 3.311 ± 0.0
1.325PheCys: 1.325 ± 0.0
1.656PheAsp: 1.656 ± 0.0
7.947PheGlu: 7.947 ± 0.0
2.318PhePhe: 2.318 ± 0.0
3.642PheGly: 3.642 ± 0.0
1.656PheHis: 1.656 ± 0.0
3.642PheIle: 3.642 ± 0.0
5.298PheLys: 5.298 ± 0.0
6.623PheLeu: 6.623 ± 0.0
2.318PheMet: 2.318 ± 0.0
0.662PheAsn: 0.662 ± 0.0
1.325PhePro: 1.325 ± 0.0
0.993PheGln: 0.993 ± 0.0
2.318PheArg: 2.318 ± 0.0
6.623PheSer: 6.623 ± 0.0
2.649PheThr: 2.649 ± 0.0
4.967PheVal: 4.967 ± 0.0
0.993PheTrp: 0.993 ± 0.0
2.649PheTyr: 2.649 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.325GlyAla: 1.325 ± 0.0
1.325GlyCys: 1.325 ± 0.0
2.98GlyAsp: 2.98 ± 0.0
2.98GlyGlu: 2.98 ± 0.0
2.98GlyPhe: 2.98 ± 0.0
0.662GlyGly: 0.662 ± 0.0
0.331GlyHis: 0.331 ± 0.0
2.318GlyIle: 2.318 ± 0.0
4.636GlyLys: 4.636 ± 0.0
3.642GlyLeu: 3.642 ± 0.0
1.656GlyMet: 1.656 ± 0.0
3.311GlyAsn: 3.311 ± 0.0
1.656GlyPro: 1.656 ± 0.0
1.656GlyGln: 1.656 ± 0.0
1.656GlyArg: 1.656 ± 0.0
4.636GlySer: 4.636 ± 0.0
3.311GlyThr: 3.311 ± 0.0
4.636GlyVal: 4.636 ± 0.0
0.331GlyTrp: 0.331 ± 0.0
2.649GlyTyr: 2.649 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.325HisAla: 1.325 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.318HisAsp: 2.318 ± 0.0
1.325HisGlu: 1.325 ± 0.0
1.325HisPhe: 1.325 ± 0.0
0.331HisGly: 0.331 ± 0.0
0.662HisHis: 0.662 ± 0.0
1.656HisIle: 1.656 ± 0.0
1.987HisLys: 1.987 ± 0.0
1.325HisLeu: 1.325 ± 0.0
1.987HisMet: 1.987 ± 0.0
0.331HisAsn: 0.331 ± 0.0
0.662HisPro: 0.662 ± 0.0
0.331HisGln: 0.331 ± 0.0
0.993HisArg: 0.993 ± 0.0
1.656HisSer: 1.656 ± 0.0
0.331HisThr: 0.331 ± 0.0
2.318HisVal: 2.318 ± 0.0
0.331HisTrp: 0.331 ± 0.0
1.656HisTyr: 1.656 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.98IleAla: 2.98 ± 0.0
1.987IleCys: 1.987 ± 0.0
1.987IleAsp: 1.987 ± 0.0
4.967IleGlu: 4.967 ± 0.0
3.974IlePhe: 3.974 ± 0.0
2.98IleGly: 2.98 ± 0.0
1.987IleHis: 1.987 ± 0.0
1.987IleIle: 1.987 ± 0.0
2.98IleLys: 2.98 ± 0.0
3.974IleLeu: 3.974 ± 0.0
1.325IleMet: 1.325 ± 0.0
1.987IleAsn: 1.987 ± 0.0
4.305IlePro: 4.305 ± 0.0
1.656IleGln: 1.656 ± 0.0
4.636IleArg: 4.636 ± 0.0
2.98IleSer: 2.98 ± 0.0
1.656IleThr: 1.656 ± 0.0
3.311IleVal: 3.311 ± 0.0
0.662IleTrp: 0.662 ± 0.0
2.98IleTyr: 2.98 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.305LysAla: 4.305 ± 0.0
0.662LysCys: 0.662 ± 0.0
2.98LysAsp: 2.98 ± 0.0
4.636LysGlu: 4.636 ± 0.0
4.636LysPhe: 4.636 ± 0.0
4.636LysGly: 4.636 ± 0.0
1.656LysHis: 1.656 ± 0.0
5.96LysIle: 5.96 ± 0.0
8.94LysLys: 8.94 ± 0.0
4.305LysLeu: 4.305 ± 0.0
1.987LysMet: 1.987 ± 0.0
5.298LysAsn: 5.298 ± 0.0
3.642LysPro: 3.642 ± 0.0
1.656LysGln: 1.656 ± 0.0
4.967LysArg: 4.967 ± 0.0
4.305LysSer: 4.305 ± 0.0
2.318LysThr: 2.318 ± 0.0
5.298LysVal: 5.298 ± 0.0
0.662LysTrp: 0.662 ± 0.0
4.636LysTyr: 4.636 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.298LeuAla: 5.298 ± 0.0
1.656LeuCys: 1.656 ± 0.0
3.642LeuAsp: 3.642 ± 0.0
6.291LeuGlu: 6.291 ± 0.0
6.623LeuPhe: 6.623 ± 0.0
2.98LeuGly: 2.98 ± 0.0
1.656LeuHis: 1.656 ± 0.0
2.98LeuIle: 2.98 ± 0.0
7.285LeuLys: 7.285 ± 0.0
6.291LeuLeu: 6.291 ± 0.0
2.649LeuMet: 2.649 ± 0.0
3.974LeuAsn: 3.974 ± 0.0
5.298LeuPro: 5.298 ± 0.0
1.325LeuGln: 1.325 ± 0.0
3.974LeuArg: 3.974 ± 0.0
6.291LeuSer: 6.291 ± 0.0
4.305LeuThr: 4.305 ± 0.0
3.311LeuVal: 3.311 ± 0.0
0.993LeuTrp: 0.993 ± 0.0
3.642LeuTyr: 3.642 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.987MetAla: 1.987 ± 0.0
0.662MetCys: 0.662 ± 0.0
1.325MetAsp: 1.325 ± 0.0
0.993MetGlu: 0.993 ± 0.0
1.656MetPhe: 1.656 ± 0.0
0.662MetGly: 0.662 ± 0.0
0.662MetHis: 0.662 ± 0.0
0.662MetIle: 0.662 ± 0.0
2.318MetLys: 2.318 ± 0.0
1.325MetLeu: 1.325 ± 0.0
0.662MetMet: 0.662 ± 0.0
2.98MetAsn: 2.98 ± 0.0
1.987MetPro: 1.987 ± 0.0
0.993MetGln: 0.993 ± 0.0
1.987MetArg: 1.987 ± 0.0
2.98MetSer: 2.98 ± 0.0
1.656MetThr: 1.656 ± 0.0
0.662MetVal: 0.662 ± 0.0
0.993MetTrp: 0.993 ± 0.0
1.325MetTyr: 1.325 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.311AsnAla: 3.311 ± 0.0
0.331AsnCys: 0.331 ± 0.0
1.325AsnAsp: 1.325 ± 0.0
4.636AsnGlu: 4.636 ± 0.0
4.636AsnPhe: 4.636 ± 0.0
3.642AsnGly: 3.642 ± 0.0
1.325AsnHis: 1.325 ± 0.0
3.974AsnIle: 3.974 ± 0.0
5.629AsnLys: 5.629 ± 0.0
6.623AsnLeu: 6.623 ± 0.0
0.662AsnMet: 0.662 ± 0.0
6.623AsnAsn: 6.623 ± 0.0
3.642AsnPro: 3.642 ± 0.0
1.656AsnGln: 1.656 ± 0.0
1.325AsnArg: 1.325 ± 0.0
3.311AsnSer: 3.311 ± 0.0
2.318AsnThr: 2.318 ± 0.0
1.656AsnVal: 1.656 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
2.318AsnTyr: 2.318 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.318ProAla: 2.318 ± 0.0
0.662ProCys: 0.662 ± 0.0
1.656ProAsp: 1.656 ± 0.0
2.649ProGlu: 2.649 ± 0.0
2.649ProPhe: 2.649 ± 0.0
1.325ProGly: 1.325 ± 0.0
2.318ProHis: 2.318 ± 0.0
3.311ProIle: 3.311 ± 0.0
2.318ProLys: 2.318 ± 0.0
4.636ProLeu: 4.636 ± 0.0
0.993ProMet: 0.993 ± 0.0
1.987ProAsn: 1.987 ± 0.0
0.993ProPro: 0.993 ± 0.0
1.656ProGln: 1.656 ± 0.0
1.325ProArg: 1.325 ± 0.0
3.642ProSer: 3.642 ± 0.0
1.325ProThr: 1.325 ± 0.0
3.642ProVal: 3.642 ± 0.0
0.331ProTrp: 0.331 ± 0.0
1.987ProTyr: 1.987 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.318GlnAla: 2.318 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.318GlnAsp: 2.318 ± 0.0
1.987GlnGlu: 1.987 ± 0.0
1.656GlnPhe: 1.656 ± 0.0
1.656GlnGly: 1.656 ± 0.0
0.331GlnHis: 0.331 ± 0.0
2.98GlnIle: 2.98 ± 0.0
0.993GlnLys: 0.993 ± 0.0
1.325GlnLeu: 1.325 ± 0.0
0.993GlnMet: 0.993 ± 0.0
1.325GlnAsn: 1.325 ± 0.0
0.662GlnPro: 0.662 ± 0.0
1.656GlnGln: 1.656 ± 0.0
1.656GlnArg: 1.656 ± 0.0
1.656GlnSer: 1.656 ± 0.0
1.987GlnThr: 1.987 ± 0.0
0.993GlnVal: 0.993 ± 0.0
0.331GlnTrp: 0.331 ± 0.0
1.325GlnTyr: 1.325 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.325ArgAla: 1.325 ± 0.0
0.331ArgCys: 0.331 ± 0.0
0.993ArgAsp: 0.993 ± 0.0
3.974ArgGlu: 3.974 ± 0.0
1.987ArgPhe: 1.987 ± 0.0
0.993ArgGly: 0.993 ± 0.0
1.656ArgHis: 1.656 ± 0.0
2.649ArgIle: 2.649 ± 0.0
4.967ArgLys: 4.967 ± 0.0
6.291ArgLeu: 6.291 ± 0.0
0.331ArgMet: 0.331 ± 0.0
3.642ArgAsn: 3.642 ± 0.0
0.331ArgPro: 0.331 ± 0.0
2.318ArgGln: 2.318 ± 0.0
1.987ArgArg: 1.987 ± 0.0
2.318ArgSer: 2.318 ± 0.0
1.987ArgThr: 1.987 ± 0.0
3.642ArgVal: 3.642 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
1.987ArgTyr: 1.987 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.636SerAla: 4.636 ± 0.0
1.656SerCys: 1.656 ± 0.0
3.642SerAsp: 3.642 ± 0.0
3.642SerGlu: 3.642 ± 0.0
4.636SerPhe: 4.636 ± 0.0
5.629SerGly: 5.629 ± 0.0
1.987SerHis: 1.987 ± 0.0
4.967SerIle: 4.967 ± 0.0
6.954SerLys: 6.954 ± 0.0
5.96SerLeu: 5.96 ± 0.0
1.325SerMet: 1.325 ± 0.0
4.305SerAsn: 4.305 ± 0.0
1.325SerPro: 1.325 ± 0.0
1.987SerGln: 1.987 ± 0.0
2.318SerArg: 2.318 ± 0.0
6.623SerSer: 6.623 ± 0.0
6.291SerThr: 6.291 ± 0.0
6.623SerVal: 6.623 ± 0.0
0.0SerTrp: 0.0 ± 0.0
3.642SerTyr: 3.642 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.305ThrAla: 4.305 ± 0.0
0.993ThrCys: 0.993 ± 0.0
2.318ThrAsp: 2.318 ± 0.0
2.318ThrGlu: 2.318 ± 0.0
2.98ThrPhe: 2.98 ± 0.0
2.318ThrGly: 2.318 ± 0.0
0.993ThrHis: 0.993 ± 0.0
3.642ThrIle: 3.642 ± 0.0
2.318ThrLys: 2.318 ± 0.0
3.974ThrLeu: 3.974 ± 0.0
1.325ThrMet: 1.325 ± 0.0
1.987ThrAsn: 1.987 ± 0.0
1.987ThrPro: 1.987 ± 0.0
1.987ThrGln: 1.987 ± 0.0
1.656ThrArg: 1.656 ± 0.0
3.311ThrSer: 3.311 ± 0.0
2.649ThrThr: 2.649 ± 0.0
5.298ThrVal: 5.298 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
2.649ThrTyr: 2.649 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.967ValAla: 4.967 ± 0.0
1.325ValCys: 1.325 ± 0.0
4.636ValAsp: 4.636 ± 0.0
7.285ValGlu: 7.285 ± 0.0
4.636ValPhe: 4.636 ± 0.0
4.305ValGly: 4.305 ± 0.0
0.993ValHis: 0.993 ± 0.0
2.98ValIle: 2.98 ± 0.0
4.305ValLys: 4.305 ± 0.0
5.629ValLeu: 5.629 ± 0.0
1.987ValMet: 1.987 ± 0.0
3.974ValAsn: 3.974 ± 0.0
2.649ValPro: 2.649 ± 0.0
1.656ValGln: 1.656 ± 0.0
4.967ValArg: 4.967 ± 0.0
6.954ValSer: 6.954 ± 0.0
2.649ValThr: 2.649 ± 0.0
3.311ValVal: 3.311 ± 0.0
1.325ValTrp: 1.325 ± 0.0
2.649ValTyr: 2.649 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.331TrpAla: 0.331 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.331TrpAsp: 0.331 ± 0.0
0.662TrpGlu: 0.662 ± 0.0
0.993TrpPhe: 0.993 ± 0.0
0.331TrpGly: 0.331 ± 0.0
0.662TrpHis: 0.662 ± 0.0
0.331TrpIle: 0.331 ± 0.0
0.662TrpLys: 0.662 ± 0.0
0.993TrpLeu: 0.993 ± 0.0
0.331TrpMet: 0.331 ± 0.0
0.331TrpAsn: 0.331 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.331TrpGln: 0.331 ± 0.0
0.331TrpArg: 0.331 ± 0.0
0.662TrpSer: 0.662 ± 0.0
0.331TrpThr: 0.331 ± 0.0
1.656TrpVal: 1.656 ± 0.0
0.331TrpTrp: 0.331 ± 0.0
0.662TrpTyr: 0.662 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.656TyrAla: 1.656 ± 0.0
1.325TyrCys: 1.325 ± 0.0
3.642TyrAsp: 3.642 ± 0.0
3.974TyrGlu: 3.974 ± 0.0
1.987TyrPhe: 1.987 ± 0.0
2.649TyrGly: 2.649 ± 0.0
1.656TyrHis: 1.656 ± 0.0
1.325TyrIle: 1.325 ± 0.0
3.974TyrLys: 3.974 ± 0.0
4.305TyrLeu: 4.305 ± 0.0
0.993TyrMet: 0.993 ± 0.0
4.305TyrAsn: 4.305 ± 0.0
1.325TyrPro: 1.325 ± 0.0
0.993TyrGln: 0.993 ± 0.0
1.656TyrArg: 1.656 ± 0.0
1.987TyrSer: 1.987 ± 0.0
2.318TyrThr: 2.318 ± 0.0
4.636TyrVal: 4.636 ± 0.0
0.662TyrTrp: 0.662 ± 0.0
2.98TyrTyr: 2.98 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3021 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski