Amino acid dipepetide frequency for Hubei odonate virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.61AlaAla: 2.61 ± 0.0
0.746AlaCys: 0.746 ± 0.0
3.356AlaAsp: 3.356 ± 0.0
3.729AlaGlu: 3.729 ± 0.0
2.237AlaPhe: 2.237 ± 0.0
4.474AlaGly: 4.474 ± 0.0
1.491AlaHis: 1.491 ± 0.0
4.847AlaIle: 4.847 ± 0.0
2.61AlaLys: 2.61 ± 0.0
3.729AlaLeu: 3.729 ± 0.0
1.119AlaMet: 1.119 ± 0.0
1.864AlaAsn: 1.864 ± 0.0
1.864AlaPro: 1.864 ± 0.0
4.101AlaGln: 4.101 ± 0.0
2.237AlaArg: 2.237 ± 0.0
4.847AlaSer: 4.847 ± 0.0
2.237AlaThr: 2.237 ± 0.0
4.101AlaVal: 4.101 ± 0.0
1.119AlaTrp: 1.119 ± 0.0
1.864AlaTyr: 1.864 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.373CysAla: 0.373 ± 0.0
0.373CysCys: 0.373 ± 0.0
1.119CysAsp: 1.119 ± 0.0
1.119CysGlu: 1.119 ± 0.0
1.491CysPhe: 1.491 ± 0.0
1.491CysGly: 1.491 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.119CysIle: 1.119 ± 0.0
1.864CysLys: 1.864 ± 0.0
2.237CysLeu: 2.237 ± 0.0
0.373CysMet: 0.373 ± 0.0
0.746CysAsn: 0.746 ± 0.0
0.373CysPro: 0.373 ± 0.0
0.373CysGln: 0.373 ± 0.0
1.491CysArg: 1.491 ± 0.0
3.356CysSer: 3.356 ± 0.0
1.491CysThr: 1.491 ± 0.0
1.119CysVal: 1.119 ± 0.0
0.373CysTrp: 0.373 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.61AspAla: 2.61 ± 0.0
1.491AspCys: 1.491 ± 0.0
2.61AspAsp: 2.61 ± 0.0
4.101AspGlu: 4.101 ± 0.0
4.101AspPhe: 4.101 ± 0.0
2.61AspGly: 2.61 ± 0.0
0.746AspHis: 0.746 ± 0.0
1.119AspIle: 1.119 ± 0.0
3.729AspLys: 3.729 ± 0.0
4.474AspLeu: 4.474 ± 0.0
1.491AspMet: 1.491 ± 0.0
2.237AspAsn: 2.237 ± 0.0
0.373AspPro: 0.373 ± 0.0
1.491AspGln: 1.491 ± 0.0
2.61AspArg: 2.61 ± 0.0
4.847AspSer: 4.847 ± 0.0
2.61AspThr: 2.61 ± 0.0
6.339AspVal: 6.339 ± 0.0
1.119AspTrp: 1.119 ± 0.0
1.864AspTyr: 1.864 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.474GluAla: 4.474 ± 0.0
1.864GluCys: 1.864 ± 0.0
2.983GluAsp: 2.983 ± 0.0
8.949GluGlu: 8.949 ± 0.0
4.101GluPhe: 4.101 ± 0.0
1.864GluGly: 1.864 ± 0.0
1.119GluHis: 1.119 ± 0.0
5.966GluIle: 5.966 ± 0.0
4.101GluLys: 4.101 ± 0.0
4.847GluLeu: 4.847 ± 0.0
2.61GluMet: 2.61 ± 0.0
2.237GluAsn: 2.237 ± 0.0
2.983GluPro: 2.983 ± 0.0
0.746GluGln: 0.746 ± 0.0
1.491GluArg: 1.491 ± 0.0
3.729GluSer: 3.729 ± 0.0
4.474GluThr: 4.474 ± 0.0
4.847GluVal: 4.847 ± 0.0
1.864GluTrp: 1.864 ± 0.0
1.864GluTyr: 1.864 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.983PheAla: 2.983 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.356PheAsp: 3.356 ± 0.0
3.729PheGlu: 3.729 ± 0.0
1.119PhePhe: 1.119 ± 0.0
2.61PheGly: 2.61 ± 0.0
1.864PheHis: 1.864 ± 0.0
2.61PheIle: 2.61 ± 0.0
4.474PheLys: 4.474 ± 0.0
6.339PheLeu: 6.339 ± 0.0
0.373PheMet: 0.373 ± 0.0
2.983PheAsn: 2.983 ± 0.0
1.491PhePro: 1.491 ± 0.0
1.491PheGln: 1.491 ± 0.0
2.237PheArg: 2.237 ± 0.0
6.711PheSer: 6.711 ± 0.0
1.864PheThr: 1.864 ± 0.0
4.101PheVal: 4.101 ± 0.0
0.746PheTrp: 0.746 ± 0.0
1.491PheTyr: 1.491 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.983GlyAla: 2.983 ± 0.0
0.746GlyCys: 0.746 ± 0.0
4.101GlyAsp: 4.101 ± 0.0
5.593GlyGlu: 5.593 ± 0.0
3.356GlyPhe: 3.356 ± 0.0
4.101GlyGly: 4.101 ± 0.0
1.119GlyHis: 1.119 ± 0.0
5.966GlyIle: 5.966 ± 0.0
2.983GlyLys: 2.983 ± 0.0
5.593GlyLeu: 5.593 ± 0.0
1.864GlyMet: 1.864 ± 0.0
2.983GlyAsn: 2.983 ± 0.0
3.356GlyPro: 3.356 ± 0.0
1.864GlyGln: 1.864 ± 0.0
2.983GlyArg: 2.983 ± 0.0
3.729GlySer: 3.729 ± 0.0
5.593GlyThr: 5.593 ± 0.0
2.983GlyVal: 2.983 ± 0.0
0.746GlyTrp: 0.746 ± 0.0
2.983GlyTyr: 2.983 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.746HisAla: 0.746 ± 0.0
1.491HisCys: 1.491 ± 0.0
1.491HisAsp: 1.491 ± 0.0
2.61HisGlu: 2.61 ± 0.0
0.746HisPhe: 0.746 ± 0.0
1.864HisGly: 1.864 ± 0.0
0.373HisHis: 0.373 ± 0.0
0.746HisIle: 0.746 ± 0.0
0.746HisLys: 0.746 ± 0.0
1.864HisLeu: 1.864 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.119HisAsn: 1.119 ± 0.0
0.746HisPro: 0.746 ± 0.0
0.746HisGln: 0.746 ± 0.0
1.864HisArg: 1.864 ± 0.0
1.864HisSer: 1.864 ± 0.0
0.746HisThr: 0.746 ± 0.0
1.119HisVal: 1.119 ± 0.0
0.373HisTrp: 0.373 ± 0.0
0.373HisTyr: 0.373 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.864IleAla: 1.864 ± 0.0
1.491IleCys: 1.491 ± 0.0
1.491IleAsp: 1.491 ± 0.0
4.474IleGlu: 4.474 ± 0.0
4.474IlePhe: 4.474 ± 0.0
3.729IleGly: 3.729 ± 0.0
1.864IleHis: 1.864 ± 0.0
3.356IleIle: 3.356 ± 0.0
2.237IleLys: 2.237 ± 0.0
4.474IleLeu: 4.474 ± 0.0
1.119IleMet: 1.119 ± 0.0
3.356IleAsn: 3.356 ± 0.0
4.101IlePro: 4.101 ± 0.0
1.864IleGln: 1.864 ± 0.0
3.729IleArg: 3.729 ± 0.0
7.084IleSer: 7.084 ± 0.0
3.356IleThr: 3.356 ± 0.0
4.474IleVal: 4.474 ± 0.0
0.373IleTrp: 0.373 ± 0.0
5.22IleTyr: 5.22 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.966LysAla: 5.966 ± 0.0
1.864LysCys: 1.864 ± 0.0
2.983LysAsp: 2.983 ± 0.0
2.237LysGlu: 2.237 ± 0.0
3.356LysPhe: 3.356 ± 0.0
2.61LysGly: 2.61 ± 0.0
0.746LysHis: 0.746 ± 0.0
4.101LysIle: 4.101 ± 0.0
1.491LysLys: 1.491 ± 0.0
4.847LysLeu: 4.847 ± 0.0
0.373LysMet: 0.373 ± 0.0
2.61LysAsn: 2.61 ± 0.0
1.119LysPro: 1.119 ± 0.0
0.746LysGln: 0.746 ± 0.0
2.237LysArg: 2.237 ± 0.0
3.729LysSer: 3.729 ± 0.0
2.61LysThr: 2.61 ± 0.0
3.729LysVal: 3.729 ± 0.0
0.0LysTrp: 0.0 ± 0.0
1.119LysTyr: 1.119 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.847LeuAla: 4.847 ± 0.0
2.237LeuCys: 2.237 ± 0.0
5.593LeuAsp: 5.593 ± 0.0
3.356LeuGlu: 3.356 ± 0.0
2.983LeuPhe: 2.983 ± 0.0
6.339LeuGly: 6.339 ± 0.0
1.491LeuHis: 1.491 ± 0.0
2.983LeuIle: 2.983 ± 0.0
3.356LeuLys: 3.356 ± 0.0
5.966LeuLeu: 5.966 ± 0.0
1.491LeuMet: 1.491 ± 0.0
7.084LeuAsn: 7.084 ± 0.0
3.356LeuPro: 3.356 ± 0.0
1.119LeuGln: 1.119 ± 0.0
5.966LeuArg: 5.966 ± 0.0
4.474LeuSer: 4.474 ± 0.0
5.593LeuThr: 5.593 ± 0.0
4.101LeuVal: 4.101 ± 0.0
0.746LeuTrp: 0.746 ± 0.0
2.983LeuTyr: 2.983 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.373MetAla: 0.373 ± 0.0
0.373MetCys: 0.373 ± 0.0
0.746MetAsp: 0.746 ± 0.0
0.373MetGlu: 0.373 ± 0.0
0.746MetPhe: 0.746 ± 0.0
3.729MetGly: 3.729 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.864MetIle: 1.864 ± 0.0
0.746MetLys: 0.746 ± 0.0
1.491MetLeu: 1.491 ± 0.0
0.373MetMet: 0.373 ± 0.0
1.491MetAsn: 1.491 ± 0.0
1.864MetPro: 1.864 ± 0.0
0.373MetGln: 0.373 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.61MetSer: 2.61 ± 0.0
1.119MetThr: 1.119 ± 0.0
1.491MetVal: 1.491 ± 0.0
0.373MetTrp: 0.373 ± 0.0
1.491MetTyr: 1.491 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.237AsnAla: 2.237 ± 0.0
1.864AsnCys: 1.864 ± 0.0
2.237AsnAsp: 2.237 ± 0.0
3.356AsnGlu: 3.356 ± 0.0
2.983AsnPhe: 2.983 ± 0.0
3.356AsnGly: 3.356 ± 0.0
1.491AsnHis: 1.491 ± 0.0
4.474AsnIle: 4.474 ± 0.0
2.61AsnLys: 2.61 ± 0.0
2.983AsnLeu: 2.983 ± 0.0
3.356AsnMet: 3.356 ± 0.0
5.966AsnAsn: 5.966 ± 0.0
4.847AsnPro: 4.847 ± 0.0
1.491AsnGln: 1.491 ± 0.0
1.491AsnArg: 1.491 ± 0.0
3.729AsnSer: 3.729 ± 0.0
2.61AsnThr: 2.61 ± 0.0
4.847AsnVal: 4.847 ± 0.0
1.491AsnTrp: 1.491 ± 0.0
1.119AsnTyr: 1.119 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.119ProAla: 1.119 ± 0.0
0.373ProCys: 0.373 ± 0.0
2.61ProAsp: 2.61 ± 0.0
1.491ProGlu: 1.491 ± 0.0
1.864ProPhe: 1.864 ± 0.0
3.729ProGly: 3.729 ± 0.0
1.119ProHis: 1.119 ± 0.0
3.356ProIle: 3.356 ± 0.0
1.119ProLys: 1.119 ± 0.0
2.983ProLeu: 2.983 ± 0.0
0.373ProMet: 0.373 ± 0.0
2.237ProAsn: 2.237 ± 0.0
2.983ProPro: 2.983 ± 0.0
1.119ProGln: 1.119 ± 0.0
2.237ProArg: 2.237 ± 0.0
4.847ProSer: 4.847 ± 0.0
2.237ProThr: 2.237 ± 0.0
3.356ProVal: 3.356 ± 0.0
1.119ProTrp: 1.119 ± 0.0
2.983ProTyr: 2.983 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.237GlnAla: 2.237 ± 0.0
0.373GlnCys: 0.373 ± 0.0
1.491GlnAsp: 1.491 ± 0.0
2.237GlnGlu: 2.237 ± 0.0
1.491GlnPhe: 1.491 ± 0.0
1.864GlnGly: 1.864 ± 0.0
0.746GlnHis: 0.746 ± 0.0
1.119GlnIle: 1.119 ± 0.0
2.237GlnLys: 2.237 ± 0.0
2.237GlnLeu: 2.237 ± 0.0
0.746GlnMet: 0.746 ± 0.0
3.729GlnAsn: 3.729 ± 0.0
0.373GlnPro: 0.373 ± 0.0
0.746GlnGln: 0.746 ± 0.0
1.119GlnArg: 1.119 ± 0.0
2.983GlnSer: 2.983 ± 0.0
1.491GlnThr: 1.491 ± 0.0
1.491GlnVal: 1.491 ± 0.0
0.373GlnTrp: 0.373 ± 0.0
1.119GlnTyr: 1.119 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.61ArgAla: 2.61 ± 0.0
1.119ArgCys: 1.119 ± 0.0
2.983ArgAsp: 2.983 ± 0.0
2.61ArgGlu: 2.61 ± 0.0
4.101ArgPhe: 4.101 ± 0.0
0.746ArgGly: 0.746 ± 0.0
2.237ArgHis: 2.237 ± 0.0
5.22ArgIle: 5.22 ± 0.0
2.61ArgLys: 2.61 ± 0.0
6.711ArgLeu: 6.711 ± 0.0
0.373ArgMet: 0.373 ± 0.0
1.119ArgAsn: 1.119 ± 0.0
1.864ArgPro: 1.864 ± 0.0
1.491ArgGln: 1.491 ± 0.0
2.983ArgArg: 2.983 ± 0.0
3.729ArgSer: 3.729 ± 0.0
3.729ArgThr: 3.729 ± 0.0
3.356ArgVal: 3.356 ± 0.0
1.119ArgTrp: 1.119 ± 0.0
3.356ArgTyr: 3.356 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.474SerAla: 4.474 ± 0.0
1.119SerCys: 1.119 ± 0.0
4.474SerAsp: 4.474 ± 0.0
7.084SerGlu: 7.084 ± 0.0
4.474SerPhe: 4.474 ± 0.0
8.576SerGly: 8.576 ± 0.0
1.491SerHis: 1.491 ± 0.0
4.474SerIle: 4.474 ± 0.0
4.101SerLys: 4.101 ± 0.0
4.101SerLeu: 4.101 ± 0.0
1.864SerMet: 1.864 ± 0.0
3.729SerAsn: 3.729 ± 0.0
2.61SerPro: 2.61 ± 0.0
3.729SerGln: 3.729 ± 0.0
6.711SerArg: 6.711 ± 0.0
7.457SerSer: 7.457 ± 0.0
4.474SerThr: 4.474 ± 0.0
6.711SerVal: 6.711 ± 0.0
0.373SerTrp: 0.373 ± 0.0
1.864SerTyr: 1.864 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.61ThrAla: 2.61 ± 0.0
1.119ThrCys: 1.119 ± 0.0
4.101ThrAsp: 4.101 ± 0.0
4.101ThrGlu: 4.101 ± 0.0
2.61ThrPhe: 2.61 ± 0.0
4.101ThrGly: 4.101 ± 0.0
1.491ThrHis: 1.491 ± 0.0
4.101ThrIle: 4.101 ± 0.0
4.101ThrLys: 4.101 ± 0.0
2.61ThrLeu: 2.61 ± 0.0
1.119ThrMet: 1.119 ± 0.0
4.101ThrAsn: 4.101 ± 0.0
0.746ThrPro: 0.746 ± 0.0
1.864ThrGln: 1.864 ± 0.0
3.356ThrArg: 3.356 ± 0.0
2.61ThrSer: 2.61 ± 0.0
5.22ThrThr: 5.22 ± 0.0
4.101ThrVal: 4.101 ± 0.0
1.119ThrTrp: 1.119 ± 0.0
3.356ThrTyr: 3.356 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.711ValAla: 6.711 ± 0.0
1.119ValCys: 1.119 ± 0.0
2.983ValAsp: 2.983 ± 0.0
3.356ValGlu: 3.356 ± 0.0
4.101ValPhe: 4.101 ± 0.0
5.22ValGly: 5.22 ± 0.0
1.119ValHis: 1.119 ± 0.0
4.101ValIle: 4.101 ± 0.0
1.119ValLys: 1.119 ± 0.0
3.729ValLeu: 3.729 ± 0.0
1.119ValMet: 1.119 ± 0.0
5.966ValAsn: 5.966 ± 0.0
6.711ValPro: 6.711 ± 0.0
1.864ValGln: 1.864 ± 0.0
5.593ValArg: 5.593 ± 0.0
5.22ValSer: 5.22 ± 0.0
2.983ValThr: 2.983 ± 0.0
5.593ValVal: 5.593 ± 0.0
0.373ValTrp: 0.373 ± 0.0
2.61ValTyr: 2.61 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.373TrpAla: 0.373 ± 0.0
0.373TrpCys: 0.373 ± 0.0
0.373TrpAsp: 0.373 ± 0.0
0.373TrpGlu: 0.373 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.373TrpHis: 0.373 ± 0.0
0.746TrpIle: 0.746 ± 0.0
1.491TrpLys: 1.491 ± 0.0
0.746TrpLeu: 0.746 ± 0.0
0.373TrpMet: 0.373 ± 0.0
1.119TrpAsn: 1.119 ± 0.0
0.373TrpPro: 0.373 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.119TrpArg: 1.119 ± 0.0
2.61TrpSer: 2.61 ± 0.0
1.864TrpThr: 1.864 ± 0.0
1.864TrpVal: 1.864 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.373TrpTyr: 0.373 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.983TyrAla: 2.983 ± 0.0
0.746TyrCys: 0.746 ± 0.0
1.491TyrAsp: 1.491 ± 0.0
2.237TyrGlu: 2.237 ± 0.0
2.237TyrPhe: 2.237 ± 0.0
2.61TyrGly: 2.61 ± 0.0
0.746TyrHis: 0.746 ± 0.0
1.864TyrIle: 1.864 ± 0.0
0.746TyrLys: 0.746 ± 0.0
4.101TyrLeu: 4.101 ± 0.0
0.746TyrMet: 0.746 ± 0.0
1.864TyrAsn: 1.864 ± 0.0
1.491TyrPro: 1.491 ± 0.0
2.983TyrGln: 2.983 ± 0.0
2.61TyrArg: 2.61 ± 0.0
3.729TyrSer: 3.729 ± 0.0
2.237TyrThr: 2.237 ± 0.0
1.864TyrVal: 1.864 ± 0.0
0.746TyrTrp: 0.746 ± 0.0
1.491TyrTyr: 1.491 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2683 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski