Amino acid dipepetide frequency for Wuhan house centipede virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.218AlaAla: 6.218 ± 0.0
0.811AlaCys: 0.811 ± 0.0
3.514AlaAsp: 3.514 ± 0.0
3.785AlaGlu: 3.785 ± 0.0
2.433AlaPhe: 2.433 ± 0.0
2.703AlaGly: 2.703 ± 0.0
1.352AlaHis: 1.352 ± 0.0
5.137AlaIle: 5.137 ± 0.0
7.029AlaLys: 7.029 ± 0.0
5.948AlaLeu: 5.948 ± 0.0
1.622AlaMet: 1.622 ± 0.0
5.137AlaAsn: 5.137 ± 0.0
2.974AlaPro: 2.974 ± 0.0
1.892AlaGln: 1.892 ± 0.0
4.055AlaArg: 4.055 ± 0.0
5.677AlaSer: 5.677 ± 0.0
5.137AlaThr: 5.137 ± 0.0
2.703AlaVal: 2.703 ± 0.0
1.081AlaTrp: 1.081 ± 0.0
2.433AlaTyr: 2.433 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.541CysAsp: 0.541 ± 0.0
1.622CysGlu: 1.622 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.352CysGly: 1.352 ± 0.0
0.27CysHis: 0.27 ± 0.0
1.352CysIle: 1.352 ± 0.0
0.27CysLys: 0.27 ± 0.0
1.352CysLeu: 1.352 ± 0.0
0.541CysMet: 0.541 ± 0.0
1.622CysAsn: 1.622 ± 0.0
0.811CysPro: 0.811 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.541CysArg: 0.541 ± 0.0
0.811CysSer: 0.811 ± 0.0
0.811CysThr: 0.811 ± 0.0
0.541CysVal: 0.541 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.811CysTyr: 0.811 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.892AspAla: 1.892 ± 0.0
1.352AspCys: 1.352 ± 0.0
2.703AspAsp: 2.703 ± 0.0
3.785AspGlu: 3.785 ± 0.0
2.433AspPhe: 2.433 ± 0.0
3.514AspGly: 3.514 ± 0.0
0.541AspHis: 0.541 ± 0.0
3.244AspIle: 3.244 ± 0.0
1.352AspLys: 1.352 ± 0.0
2.974AspLeu: 2.974 ± 0.0
0.811AspMet: 0.811 ± 0.0
2.163AspAsn: 2.163 ± 0.0
1.081AspPro: 1.081 ± 0.0
1.892AspGln: 1.892 ± 0.0
2.703AspArg: 2.703 ± 0.0
3.244AspSer: 3.244 ± 0.0
4.055AspThr: 4.055 ± 0.0
2.163AspVal: 2.163 ± 0.0
0.811AspTrp: 0.811 ± 0.0
1.892AspTyr: 1.892 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.785GluAla: 3.785 ± 0.0
0.811GluCys: 0.811 ± 0.0
5.137GluAsp: 5.137 ± 0.0
7.57GluGlu: 7.57 ± 0.0
2.433GluPhe: 2.433 ± 0.0
2.974GluGly: 2.974 ± 0.0
2.163GluHis: 2.163 ± 0.0
6.218GluIle: 6.218 ± 0.0
4.866GluLys: 4.866 ± 0.0
9.192GluLeu: 9.192 ± 0.0
2.703GluMet: 2.703 ± 0.0
2.703GluAsn: 2.703 ± 0.0
2.163GluPro: 2.163 ± 0.0
2.703GluGln: 2.703 ± 0.0
3.785GluArg: 3.785 ± 0.0
2.974GluSer: 2.974 ± 0.0
2.974GluThr: 2.974 ± 0.0
5.137GluVal: 5.137 ± 0.0
0.811GluTrp: 0.811 ± 0.0
3.514GluTyr: 3.514 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.433PheAla: 2.433 ± 0.0
0.541PheCys: 0.541 ± 0.0
1.892PheAsp: 1.892 ± 0.0
2.163PheGlu: 2.163 ± 0.0
0.541PhePhe: 0.541 ± 0.0
2.703PheGly: 2.703 ± 0.0
0.0PheHis: 0.0 ± 0.0
3.785PheIle: 3.785 ± 0.0
2.703PheLys: 2.703 ± 0.0
1.892PheLeu: 1.892 ± 0.0
1.892PheMet: 1.892 ± 0.0
3.514PheAsn: 3.514 ± 0.0
1.622PhePro: 1.622 ± 0.0
0.811PheGln: 0.811 ± 0.0
1.622PheArg: 1.622 ± 0.0
2.703PheSer: 2.703 ± 0.0
1.352PheThr: 1.352 ± 0.0
3.514PheVal: 3.514 ± 0.0
0.27PheTrp: 0.27 ± 0.0
0.811PheTyr: 0.811 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.514GlyAla: 3.514 ± 0.0
0.27GlyCys: 0.27 ± 0.0
3.244GlyAsp: 3.244 ± 0.0
4.866GlyGlu: 4.866 ± 0.0
2.974GlyPhe: 2.974 ± 0.0
3.514GlyGly: 3.514 ± 0.0
0.541GlyHis: 0.541 ± 0.0
4.596GlyIle: 4.596 ± 0.0
4.055GlyLys: 4.055 ± 0.0
3.514GlyLeu: 3.514 ± 0.0
1.892GlyMet: 1.892 ± 0.0
3.785GlyAsn: 3.785 ± 0.0
1.892GlyPro: 1.892 ± 0.0
3.244GlyGln: 3.244 ± 0.0
2.703GlyArg: 2.703 ± 0.0
4.055GlySer: 4.055 ± 0.0
2.974GlyThr: 2.974 ± 0.0
3.514GlyVal: 3.514 ± 0.0
0.541GlyTrp: 0.541 ± 0.0
1.892GlyTyr: 1.892 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.811HisAla: 0.811 ± 0.0
0.541HisCys: 0.541 ± 0.0
0.811HisAsp: 0.811 ± 0.0
1.352HisGlu: 1.352 ± 0.0
1.892HisPhe: 1.892 ± 0.0
1.352HisGly: 1.352 ± 0.0
0.27HisHis: 0.27 ± 0.0
2.163HisIle: 2.163 ± 0.0
2.433HisLys: 2.433 ± 0.0
1.081HisLeu: 1.081 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.811HisAsn: 0.811 ± 0.0
0.541HisPro: 0.541 ± 0.0
0.811HisGln: 0.811 ± 0.0
1.081HisArg: 1.081 ± 0.0
0.541HisSer: 0.541 ± 0.0
1.352HisThr: 1.352 ± 0.0
0.811HisVal: 0.811 ± 0.0
0.811HisTrp: 0.811 ± 0.0
1.622HisTyr: 1.622 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.785IleAla: 3.785 ± 0.0
0.541IleCys: 0.541 ± 0.0
1.352IleAsp: 1.352 ± 0.0
4.866IleGlu: 4.866 ± 0.0
3.244IlePhe: 3.244 ± 0.0
5.948IleGly: 5.948 ± 0.0
0.811IleHis: 0.811 ± 0.0
5.407IleIle: 5.407 ± 0.0
6.218IleLys: 6.218 ± 0.0
6.218IleLeu: 6.218 ± 0.0
1.352IleMet: 1.352 ± 0.0
3.244IleAsn: 3.244 ± 0.0
4.325IlePro: 4.325 ± 0.0
2.703IleGln: 2.703 ± 0.0
3.785IleArg: 3.785 ± 0.0
4.055IleSer: 4.055 ± 0.0
5.407IleThr: 5.407 ± 0.0
5.677IleVal: 5.677 ± 0.0
1.081IleTrp: 1.081 ± 0.0
2.974IleTyr: 2.974 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.325LysAla: 4.325 ± 0.0
0.541LysCys: 0.541 ± 0.0
2.974LysAsp: 2.974 ± 0.0
6.488LysGlu: 6.488 ± 0.0
2.703LysPhe: 2.703 ± 0.0
2.433LysGly: 2.433 ± 0.0
2.163LysHis: 2.163 ± 0.0
5.677LysIle: 5.677 ± 0.0
4.596LysLys: 4.596 ± 0.0
3.785LysLeu: 3.785 ± 0.0
1.892LysMet: 1.892 ± 0.0
4.055LysAsn: 4.055 ± 0.0
3.785LysPro: 3.785 ± 0.0
2.703LysGln: 2.703 ± 0.0
2.974LysArg: 2.974 ± 0.0
4.325LysSer: 4.325 ± 0.0
4.055LysThr: 4.055 ± 0.0
4.325LysVal: 4.325 ± 0.0
0.541LysTrp: 0.541 ± 0.0
3.244LysTyr: 3.244 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.677LeuAla: 5.677 ± 0.0
1.081LeuCys: 1.081 ± 0.0
5.407LeuAsp: 5.407 ± 0.0
5.137LeuGlu: 5.137 ± 0.0
2.433LeuPhe: 2.433 ± 0.0
3.244LeuGly: 3.244 ± 0.0
1.892LeuHis: 1.892 ± 0.0
4.325LeuIle: 4.325 ± 0.0
4.866LeuLys: 4.866 ± 0.0
6.488LeuLeu: 6.488 ± 0.0
1.081LeuMet: 1.081 ± 0.0
4.055LeuAsn: 4.055 ± 0.0
5.407LeuPro: 5.407 ± 0.0
4.325LeuGln: 4.325 ± 0.0
3.244LeuArg: 3.244 ± 0.0
5.137LeuSer: 5.137 ± 0.0
7.029LeuThr: 7.029 ± 0.0
3.785LeuVal: 3.785 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
2.703LeuTyr: 2.703 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.433MetAla: 2.433 ± 0.0
0.27MetCys: 0.27 ± 0.0
0.811MetAsp: 0.811 ± 0.0
1.622MetGlu: 1.622 ± 0.0
0.27MetPhe: 0.27 ± 0.0
0.811MetGly: 0.811 ± 0.0
0.541MetHis: 0.541 ± 0.0
1.892MetIle: 1.892 ± 0.0
1.622MetLys: 1.622 ± 0.0
2.163MetLeu: 2.163 ± 0.0
0.541MetMet: 0.541 ± 0.0
1.622MetAsn: 1.622 ± 0.0
1.352MetPro: 1.352 ± 0.0
0.811MetGln: 0.811 ± 0.0
1.352MetArg: 1.352 ± 0.0
1.622MetSer: 1.622 ± 0.0
2.163MetThr: 2.163 ± 0.0
1.352MetVal: 1.352 ± 0.0
0.541MetTrp: 0.541 ± 0.0
1.892MetTyr: 1.892 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.759AsnAla: 6.759 ± 0.0
0.27AsnCys: 0.27 ± 0.0
2.433AsnAsp: 2.433 ± 0.0
3.785AsnGlu: 3.785 ± 0.0
2.163AsnPhe: 2.163 ± 0.0
2.163AsnGly: 2.163 ± 0.0
1.081AsnHis: 1.081 ± 0.0
5.677AsnIle: 5.677 ± 0.0
2.433AsnLys: 2.433 ± 0.0
4.596AsnLeu: 4.596 ± 0.0
2.703AsnMet: 2.703 ± 0.0
1.622AsnAsn: 1.622 ± 0.0
2.703AsnPro: 2.703 ± 0.0
1.622AsnGln: 1.622 ± 0.0
2.703AsnArg: 2.703 ± 0.0
4.866AsnSer: 4.866 ± 0.0
3.244AsnThr: 3.244 ± 0.0
2.703AsnVal: 2.703 ± 0.0
0.27AsnTrp: 0.27 ± 0.0
2.163AsnTyr: 2.163 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.703ProAla: 2.703 ± 0.0
0.541ProCys: 0.541 ± 0.0
1.352ProAsp: 1.352 ± 0.0
3.514ProGlu: 3.514 ± 0.0
0.541ProPhe: 0.541 ± 0.0
3.514ProGly: 3.514 ± 0.0
0.811ProHis: 0.811 ± 0.0
3.244ProIle: 3.244 ± 0.0
3.244ProLys: 3.244 ± 0.0
3.785ProLeu: 3.785 ± 0.0
1.352ProMet: 1.352 ± 0.0
2.703ProAsn: 2.703 ± 0.0
1.081ProPro: 1.081 ± 0.0
2.703ProGln: 2.703 ± 0.0
0.811ProArg: 0.811 ± 0.0
2.163ProSer: 2.163 ± 0.0
4.325ProThr: 4.325 ± 0.0
2.433ProVal: 2.433 ± 0.0
0.27ProTrp: 0.27 ± 0.0
2.163ProTyr: 2.163 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.703GlnAla: 2.703 ± 0.0
0.541GlnCys: 0.541 ± 0.0
0.811GlnAsp: 0.811 ± 0.0
4.325GlnGlu: 4.325 ± 0.0
1.622GlnPhe: 1.622 ± 0.0
2.703GlnGly: 2.703 ± 0.0
1.081GlnHis: 1.081 ± 0.0
2.163GlnIle: 2.163 ± 0.0
3.244GlnLys: 3.244 ± 0.0
3.514GlnLeu: 3.514 ± 0.0
1.892GlnMet: 1.892 ± 0.0
1.892GlnAsn: 1.892 ± 0.0
1.081GlnPro: 1.081 ± 0.0
3.785GlnGln: 3.785 ± 0.0
2.163GlnArg: 2.163 ± 0.0
2.163GlnSer: 2.163 ± 0.0
2.163GlnThr: 2.163 ± 0.0
1.622GlnVal: 1.622 ± 0.0
0.27GlnTrp: 0.27 ± 0.0
1.892GlnTyr: 1.892 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.325ArgAla: 4.325 ± 0.0
1.352ArgCys: 1.352 ± 0.0
2.163ArgAsp: 2.163 ± 0.0
4.055ArgGlu: 4.055 ± 0.0
2.433ArgPhe: 2.433 ± 0.0
4.596ArgGly: 4.596 ± 0.0
0.811ArgHis: 0.811 ± 0.0
1.622ArgIle: 1.622 ± 0.0
3.244ArgLys: 3.244 ± 0.0
1.892ArgLeu: 1.892 ± 0.0
1.081ArgMet: 1.081 ± 0.0
2.974ArgAsn: 2.974 ± 0.0
2.433ArgPro: 2.433 ± 0.0
1.352ArgGln: 1.352 ± 0.0
2.163ArgArg: 2.163 ± 0.0
2.974ArgSer: 2.974 ± 0.0
3.785ArgThr: 3.785 ± 0.0
2.163ArgVal: 2.163 ± 0.0
1.081ArgTrp: 1.081 ± 0.0
1.081ArgTyr: 1.081 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.866SerAla: 4.866 ± 0.0
0.811SerCys: 0.811 ± 0.0
2.433SerAsp: 2.433 ± 0.0
4.055SerGlu: 4.055 ± 0.0
2.974SerPhe: 2.974 ± 0.0
4.325SerGly: 4.325 ± 0.0
0.811SerHis: 0.811 ± 0.0
4.866SerIle: 4.866 ± 0.0
4.055SerLys: 4.055 ± 0.0
4.596SerLeu: 4.596 ± 0.0
0.811SerMet: 0.811 ± 0.0
3.785SerAsn: 3.785 ± 0.0
1.081SerPro: 1.081 ± 0.0
3.244SerGln: 3.244 ± 0.0
3.785SerArg: 3.785 ± 0.0
6.488SerSer: 6.488 ± 0.0
5.948SerThr: 5.948 ± 0.0
2.703SerVal: 2.703 ± 0.0
0.541SerTrp: 0.541 ± 0.0
3.514SerTyr: 3.514 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.866ThrAla: 4.866 ± 0.0
1.081ThrCys: 1.081 ± 0.0
2.163ThrAsp: 2.163 ± 0.0
2.974ThrGlu: 2.974 ± 0.0
1.892ThrPhe: 1.892 ± 0.0
3.514ThrGly: 3.514 ± 0.0
1.622ThrHis: 1.622 ± 0.0
4.596ThrIle: 4.596 ± 0.0
4.325ThrLys: 4.325 ± 0.0
6.218ThrLeu: 6.218 ± 0.0
1.622ThrMet: 1.622 ± 0.0
6.218ThrAsn: 6.218 ± 0.0
3.244ThrPro: 3.244 ± 0.0
3.244ThrGln: 3.244 ± 0.0
4.325ThrArg: 4.325 ± 0.0
4.325ThrSer: 4.325 ± 0.0
6.488ThrThr: 6.488 ± 0.0
2.703ThrVal: 2.703 ± 0.0
1.622ThrTrp: 1.622 ± 0.0
3.514ThrTyr: 3.514 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.514ValAla: 3.514 ± 0.0
0.811ValCys: 0.811 ± 0.0
2.433ValAsp: 2.433 ± 0.0
4.055ValGlu: 4.055 ± 0.0
1.892ValPhe: 1.892 ± 0.0
4.325ValGly: 4.325 ± 0.0
1.622ValHis: 1.622 ± 0.0
3.244ValIle: 3.244 ± 0.0
4.055ValLys: 4.055 ± 0.0
4.325ValLeu: 4.325 ± 0.0
1.081ValMet: 1.081 ± 0.0
1.622ValAsn: 1.622 ± 0.0
3.785ValPro: 3.785 ± 0.0
2.433ValGln: 2.433 ± 0.0
1.622ValArg: 1.622 ± 0.0
4.325ValSer: 4.325 ± 0.0
3.785ValThr: 3.785 ± 0.0
3.244ValVal: 3.244 ± 0.0
0.27ValTrp: 0.27 ± 0.0
2.703ValTyr: 2.703 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.0
0.541TrpCys: 0.541 ± 0.0
1.081TrpAsp: 1.081 ± 0.0
2.163TrpGlu: 2.163 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.27TrpGly: 0.27 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.541TrpIle: 0.541 ± 0.0
0.541TrpLys: 0.541 ± 0.0
0.541TrpLeu: 0.541 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.27TrpPro: 0.27 ± 0.0
0.541TrpGln: 0.541 ± 0.0
0.811TrpArg: 0.811 ± 0.0
0.811TrpSer: 0.811 ± 0.0
0.811TrpThr: 0.811 ± 0.0
1.081TrpVal: 1.081 ± 0.0
0.27TrpTrp: 0.27 ± 0.0
0.541TrpTyr: 0.541 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.677TyrAla: 5.677 ± 0.0
1.352TyrCys: 1.352 ± 0.0
1.352TyrAsp: 1.352 ± 0.0
2.433TyrGlu: 2.433 ± 0.0
1.892TyrPhe: 1.892 ± 0.0
1.622TyrGly: 1.622 ± 0.0
2.703TyrHis: 2.703 ± 0.0
3.244TyrIle: 3.244 ± 0.0
2.433TyrLys: 2.433 ± 0.0
3.244TyrLeu: 3.244 ± 0.0
0.541TyrMet: 0.541 ± 0.0
2.703TyrAsn: 2.703 ± 0.0
1.622TyrPro: 1.622 ± 0.0
0.811TyrGln: 0.811 ± 0.0
1.352TyrArg: 1.352 ± 0.0
2.433TyrSer: 2.433 ± 0.0
2.703TyrThr: 2.703 ± 0.0
2.974TyrVal: 2.974 ± 0.0
0.27TyrTrp: 0.27 ± 0.0
3.514TyrTyr: 3.514 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3700 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski