Amino acid dipepetide frequency for Hubei endorna-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.886AlaAla: 0.886 ± 0.0
0.886AlaCys: 0.886 ± 0.0
2.878AlaAsp: 2.878 ± 0.0
2.878AlaGlu: 2.878 ± 0.0
0.664AlaPhe: 0.664 ± 0.0
1.771AlaGly: 1.771 ± 0.0
0.886AlaHis: 0.886 ± 0.0
3.099AlaIle: 3.099 ± 0.0
2.435AlaLys: 2.435 ± 0.0
3.542AlaLeu: 3.542 ± 0.0
1.107AlaMet: 1.107 ± 0.0
2.657AlaAsn: 2.657 ± 0.0
0.886AlaPro: 0.886 ± 0.0
0.664AlaGln: 0.664 ± 0.0
1.107AlaArg: 1.107 ± 0.0
0.886AlaSer: 0.886 ± 0.0
2.878AlaThr: 2.878 ± 0.0
1.992AlaVal: 1.992 ± 0.0
0.221AlaTrp: 0.221 ± 0.0
0.664AlaTyr: 0.664 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.886CysAla: 0.886 ± 0.0
0.443CysCys: 0.443 ± 0.0
0.443CysAsp: 0.443 ± 0.0
1.55CysGlu: 1.55 ± 0.0
0.664CysPhe: 0.664 ± 0.0
0.664CysGly: 0.664 ± 0.0
1.328CysHis: 1.328 ± 0.0
2.214CysIle: 2.214 ± 0.0
2.878CysLys: 2.878 ± 0.0
1.328CysLeu: 1.328 ± 0.0
0.443CysMet: 0.443 ± 0.0
0.886CysAsn: 0.886 ± 0.0
0.443CysPro: 0.443 ± 0.0
0.443CysGln: 0.443 ± 0.0
0.221CysArg: 0.221 ± 0.0
1.328CysSer: 1.328 ± 0.0
1.107CysThr: 1.107 ± 0.0
1.328CysVal: 1.328 ± 0.0
0.221CysTrp: 0.221 ± 0.0
1.107CysTyr: 1.107 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.55AspAla: 1.55 ± 0.0
0.886AspCys: 0.886 ± 0.0
4.649AspAsp: 4.649 ± 0.0
3.542AspGlu: 3.542 ± 0.0
1.107AspPhe: 1.107 ± 0.0
3.321AspGly: 3.321 ± 0.0
0.886AspHis: 0.886 ± 0.0
5.313AspIle: 5.313 ± 0.0
5.313AspLys: 5.313 ± 0.0
7.527AspLeu: 7.527 ± 0.0
1.107AspMet: 1.107 ± 0.0
3.764AspAsn: 3.764 ± 0.0
1.328AspPro: 1.328 ± 0.0
1.771AspGln: 1.771 ± 0.0
3.985AspArg: 3.985 ± 0.0
3.099AspSer: 3.099 ± 0.0
2.878AspThr: 2.878 ± 0.0
2.878AspVal: 2.878 ± 0.0
1.107AspTrp: 1.107 ± 0.0
1.771AspTyr: 1.771 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.55GluAla: 1.55 ± 0.0
0.443GluCys: 0.443 ± 0.0
1.992GluAsp: 1.992 ± 0.0
5.313GluGlu: 5.313 ± 0.0
3.321GluPhe: 3.321 ± 0.0
3.764GluGly: 3.764 ± 0.0
0.886GluHis: 0.886 ± 0.0
6.42GluIle: 6.42 ± 0.0
4.649GluLys: 4.649 ± 0.0
9.52GluLeu: 9.52 ± 0.0
1.328GluMet: 1.328 ± 0.0
4.206GluAsn: 4.206 ± 0.0
2.214GluPro: 2.214 ± 0.0
2.435GluGln: 2.435 ± 0.0
1.992GluArg: 1.992 ± 0.0
4.206GluSer: 4.206 ± 0.0
3.542GluThr: 3.542 ± 0.0
4.428GluVal: 4.428 ± 0.0
1.107GluTrp: 1.107 ± 0.0
1.992GluTyr: 1.992 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.886PheAla: 0.886 ± 0.0
0.886PheCys: 0.886 ± 0.0
1.107PheAsp: 1.107 ± 0.0
3.099PheGlu: 3.099 ± 0.0
0.664PhePhe: 0.664 ± 0.0
2.435PheGly: 2.435 ± 0.0
0.664PheHis: 0.664 ± 0.0
2.214PheIle: 2.214 ± 0.0
3.985PheLys: 3.985 ± 0.0
1.992PheLeu: 1.992 ± 0.0
1.55PheMet: 1.55 ± 0.0
3.985PheAsn: 3.985 ± 0.0
1.107PhePro: 1.107 ± 0.0
0.443PheGln: 0.443 ± 0.0
2.214PheArg: 2.214 ± 0.0
2.214PheSer: 2.214 ± 0.0
0.443PheThr: 0.443 ± 0.0
2.435PheVal: 2.435 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.664PheTyr: 0.664 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.992GlyAla: 1.992 ± 0.0
0.664GlyCys: 0.664 ± 0.0
3.764GlyAsp: 3.764 ± 0.0
3.321GlyGlu: 3.321 ± 0.0
1.55GlyPhe: 1.55 ± 0.0
2.214GlyGly: 2.214 ± 0.0
1.771GlyHis: 1.771 ± 0.0
4.428GlyIle: 4.428 ± 0.0
5.092GlyLys: 5.092 ± 0.0
4.428GlyLeu: 4.428 ± 0.0
1.107GlyMet: 1.107 ± 0.0
2.435GlyAsn: 2.435 ± 0.0
1.328GlyPro: 1.328 ± 0.0
1.771GlyGln: 1.771 ± 0.0
1.55GlyArg: 1.55 ± 0.0
1.992GlySer: 1.992 ± 0.0
2.657GlyThr: 2.657 ± 0.0
3.542GlyVal: 3.542 ± 0.0
1.107GlyTrp: 1.107 ± 0.0
2.435GlyTyr: 2.435 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.443HisAla: 0.443 ± 0.0
0.886HisCys: 0.886 ± 0.0
1.55HisAsp: 1.55 ± 0.0
1.328HisGlu: 1.328 ± 0.0
0.886HisPhe: 0.886 ± 0.0
1.771HisGly: 1.771 ± 0.0
0.664HisHis: 0.664 ± 0.0
1.328HisIle: 1.328 ± 0.0
3.542HisLys: 3.542 ± 0.0
1.55HisLeu: 1.55 ± 0.0
1.107HisMet: 1.107 ± 0.0
2.214HisAsn: 2.214 ± 0.0
0.664HisPro: 0.664 ± 0.0
1.55HisGln: 1.55 ± 0.0
0.443HisArg: 0.443 ± 0.0
1.771HisSer: 1.771 ± 0.0
1.328HisThr: 1.328 ± 0.0
1.55HisVal: 1.55 ± 0.0
0.443HisTrp: 0.443 ± 0.0
1.107HisTyr: 1.107 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.764IleAla: 3.764 ± 0.0
1.771IleCys: 1.771 ± 0.0
4.428IleAsp: 4.428 ± 0.0
3.985IleGlu: 3.985 ± 0.0
1.328IlePhe: 1.328 ± 0.0
5.535IleGly: 5.535 ± 0.0
2.214IleHis: 2.214 ± 0.0
8.413IleIle: 8.413 ± 0.0
7.97IleLys: 7.97 ± 0.0
5.977IleLeu: 5.977 ± 0.0
1.771IleMet: 1.771 ± 0.0
8.191IleAsn: 8.191 ± 0.0
3.321IlePro: 3.321 ± 0.0
1.771IleGln: 1.771 ± 0.0
4.206IleArg: 4.206 ± 0.0
6.642IleSer: 6.642 ± 0.0
9.298IleThr: 9.298 ± 0.0
5.313IleVal: 5.313 ± 0.0
1.107IleTrp: 1.107 ± 0.0
1.992IleTyr: 1.992 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.764LysAla: 3.764 ± 0.0
1.107LysCys: 1.107 ± 0.0
3.542LysAsp: 3.542 ± 0.0
5.092LysGlu: 5.092 ± 0.0
5.092LysPhe: 5.092 ± 0.0
2.214LysGly: 2.214 ± 0.0
3.099LysHis: 3.099 ± 0.0
5.092LysIle: 5.092 ± 0.0
3.099LysLys: 3.099 ± 0.0
11.512LysLeu: 11.512 ± 0.0
1.55LysMet: 1.55 ± 0.0
2.435LysAsn: 2.435 ± 0.0
4.206LysPro: 4.206 ± 0.0
3.985LysGln: 3.985 ± 0.0
1.328LysArg: 1.328 ± 0.0
6.642LysSer: 6.642 ± 0.0
5.535LysThr: 5.535 ± 0.0
5.756LysVal: 5.756 ± 0.0
1.771LysTrp: 1.771 ± 0.0
4.649LysTyr: 4.649 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
2.878LeuAla: 2.878 ± 0.0
1.992LeuCys: 1.992 ± 0.0
5.977LeuAsp: 5.977 ± 0.0
5.977LeuGlu: 5.977 ± 0.0
3.542LeuPhe: 3.542 ± 0.0
3.542LeuGly: 3.542 ± 0.0
1.771LeuHis: 1.771 ± 0.0
10.184LeuIle: 10.184 ± 0.0
7.749LeuLys: 7.749 ± 0.0
7.527LeuLeu: 7.527 ± 0.0
3.542LeuMet: 3.542 ± 0.0
7.084LeuAsn: 7.084 ± 0.0
5.977LeuPro: 5.977 ± 0.0
3.321LeuGln: 3.321 ± 0.0
4.428LeuArg: 4.428 ± 0.0
6.42LeuSer: 6.42 ± 0.0
8.855LeuThr: 8.855 ± 0.0
3.321LeuVal: 3.321 ± 0.0
1.107LeuTrp: 1.107 ± 0.0
4.428LeuTyr: 4.428 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.886MetAla: 0.886 ± 0.0
1.55MetCys: 1.55 ± 0.0
1.55MetAsp: 1.55 ± 0.0
1.771MetGlu: 1.771 ± 0.0
0.443MetPhe: 0.443 ± 0.0
1.55MetGly: 1.55 ± 0.0
0.664MetHis: 0.664 ± 0.0
3.099MetIle: 3.099 ± 0.0
1.55MetLys: 1.55 ± 0.0
3.764MetLeu: 3.764 ± 0.0
0.886MetMet: 0.886 ± 0.0
2.214MetAsn: 2.214 ± 0.0
1.107MetPro: 1.107 ± 0.0
1.328MetGln: 1.328 ± 0.0
1.992MetArg: 1.992 ± 0.0
2.214MetSer: 2.214 ± 0.0
1.992MetThr: 1.992 ± 0.0
1.107MetVal: 1.107 ± 0.0
0.443MetTrp: 0.443 ± 0.0
0.886MetTyr: 0.886 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.55AsnAla: 1.55 ± 0.0
1.328AsnCys: 1.328 ± 0.0
3.321AsnAsp: 3.321 ± 0.0
3.985AsnGlu: 3.985 ± 0.0
2.214AsnPhe: 2.214 ± 0.0
3.099AsnGly: 3.099 ± 0.0
1.107AsnHis: 1.107 ± 0.0
7.97AsnIle: 7.97 ± 0.0
6.199AsnLys: 6.199 ± 0.0
8.413AsnLeu: 8.413 ± 0.0
3.321AsnMet: 3.321 ± 0.0
6.199AsnAsn: 6.199 ± 0.0
1.107AsnPro: 1.107 ± 0.0
2.878AsnGln: 2.878 ± 0.0
2.657AsnArg: 2.657 ± 0.0
4.649AsnSer: 4.649 ± 0.0
2.214AsnThr: 2.214 ± 0.0
5.092AsnVal: 5.092 ± 0.0
1.328AsnTrp: 1.328 ± 0.0
2.435AsnTyr: 2.435 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.107ProAla: 1.107 ± 0.0
0.0ProCys: 0.0 ± 0.0
2.657ProAsp: 2.657 ± 0.0
2.657ProGlu: 2.657 ± 0.0
1.107ProPhe: 1.107 ± 0.0
3.099ProGly: 3.099 ± 0.0
0.443ProHis: 0.443 ± 0.0
4.649ProIle: 4.649 ± 0.0
3.542ProLys: 3.542 ± 0.0
2.878ProLeu: 2.878 ± 0.0
1.328ProMet: 1.328 ± 0.0
2.878ProAsn: 2.878 ± 0.0
0.664ProPro: 0.664 ± 0.0
0.221ProGln: 0.221 ± 0.0
0.664ProArg: 0.664 ± 0.0
1.55ProSer: 1.55 ± 0.0
1.107ProThr: 1.107 ± 0.0
2.214ProVal: 2.214 ± 0.0
1.107ProTrp: 1.107 ± 0.0
1.107ProTyr: 1.107 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.107GlnAla: 1.107 ± 0.0
0.221GlnCys: 0.221 ± 0.0
1.771GlnAsp: 1.771 ± 0.0
2.214GlnGlu: 2.214 ± 0.0
1.107GlnPhe: 1.107 ± 0.0
1.107GlnGly: 1.107 ± 0.0
0.221GlnHis: 0.221 ± 0.0
2.878GlnIle: 2.878 ± 0.0
1.328GlnLys: 1.328 ± 0.0
4.206GlnLeu: 4.206 ± 0.0
0.221GlnMet: 0.221 ± 0.0
0.664GlnAsn: 0.664 ± 0.0
1.107GlnPro: 1.107 ± 0.0
1.107GlnGln: 1.107 ± 0.0
1.55GlnArg: 1.55 ± 0.0
2.214GlnSer: 2.214 ± 0.0
2.657GlnThr: 2.657 ± 0.0
2.435GlnVal: 2.435 ± 0.0
1.107GlnTrp: 1.107 ± 0.0
2.214GlnTyr: 2.214 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.107ArgAla: 1.107 ± 0.0
0.886ArgCys: 0.886 ± 0.0
1.107ArgAsp: 1.107 ± 0.0
2.657ArgGlu: 2.657 ± 0.0
1.992ArgPhe: 1.992 ± 0.0
1.55ArgGly: 1.55 ± 0.0
1.328ArgHis: 1.328 ± 0.0
2.214ArgIle: 2.214 ± 0.0
2.878ArgLys: 2.878 ± 0.0
5.092ArgLeu: 5.092 ± 0.0
1.107ArgMet: 1.107 ± 0.0
2.435ArgAsn: 2.435 ± 0.0
0.886ArgPro: 0.886 ± 0.0
1.771ArgGln: 1.771 ± 0.0
0.664ArgArg: 0.664 ± 0.0
3.321ArgSer: 3.321 ± 0.0
2.435ArgThr: 2.435 ± 0.0
1.771ArgVal: 1.771 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
1.55ArgTyr: 1.55 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.878SerAla: 2.878 ± 0.0
1.771SerCys: 1.771 ± 0.0
3.764SerAsp: 3.764 ± 0.0
4.87SerGlu: 4.87 ± 0.0
1.771SerPhe: 1.771 ± 0.0
1.328SerGly: 1.328 ± 0.0
1.107SerHis: 1.107 ± 0.0
3.764SerIle: 3.764 ± 0.0
5.977SerLys: 5.977 ± 0.0
7.306SerLeu: 7.306 ± 0.0
3.542SerMet: 3.542 ± 0.0
3.764SerAsn: 3.764 ± 0.0
1.107SerPro: 1.107 ± 0.0
2.878SerGln: 2.878 ± 0.0
1.771SerArg: 1.771 ± 0.0
5.092SerSer: 5.092 ± 0.0
4.428SerThr: 4.428 ± 0.0
3.542SerVal: 3.542 ± 0.0
0.886SerTrp: 0.886 ± 0.0
2.657SerTyr: 2.657 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.657ThrAla: 2.657 ± 0.0
0.664ThrCys: 0.664 ± 0.0
4.649ThrAsp: 4.649 ± 0.0
4.649ThrGlu: 4.649 ± 0.0
1.771ThrPhe: 1.771 ± 0.0
4.428ThrGly: 4.428 ± 0.0
2.435ThrHis: 2.435 ± 0.0
5.977ThrIle: 5.977 ± 0.0
4.87ThrLys: 4.87 ± 0.0
4.649ThrLeu: 4.649 ± 0.0
2.657ThrMet: 2.657 ± 0.0
5.313ThrAsn: 5.313 ± 0.0
3.099ThrPro: 3.099 ± 0.0
0.664ThrGln: 0.664 ± 0.0
2.435ThrArg: 2.435 ± 0.0
2.878ThrSer: 2.878 ± 0.0
4.428ThrThr: 4.428 ± 0.0
3.985ThrVal: 3.985 ± 0.0
0.443ThrTrp: 0.443 ± 0.0
2.878ThrTyr: 2.878 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.107ValAla: 1.107 ± 0.0
2.657ValCys: 2.657 ± 0.0
4.428ValAsp: 4.428 ± 0.0
3.764ValGlu: 3.764 ± 0.0
1.328ValPhe: 1.328 ± 0.0
2.435ValGly: 2.435 ± 0.0
2.435ValHis: 2.435 ± 0.0
5.092ValIle: 5.092 ± 0.0
4.87ValLys: 4.87 ± 0.0
3.764ValLeu: 3.764 ± 0.0
1.55ValMet: 1.55 ± 0.0
5.313ValAsn: 5.313 ± 0.0
3.099ValPro: 3.099 ± 0.0
1.107ValGln: 1.107 ± 0.0
1.992ValArg: 1.992 ± 0.0
2.435ValSer: 2.435 ± 0.0
5.756ValThr: 5.756 ± 0.0
3.764ValVal: 3.764 ± 0.0
0.443ValTrp: 0.443 ± 0.0
0.886ValTyr: 0.886 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.443TrpAla: 0.443 ± 0.0
0.443TrpCys: 0.443 ± 0.0
0.664TrpAsp: 0.664 ± 0.0
0.221TrpGlu: 0.221 ± 0.0
1.328TrpPhe: 1.328 ± 0.0
0.443TrpGly: 0.443 ± 0.0
0.886TrpHis: 0.886 ± 0.0
0.886TrpIle: 0.886 ± 0.0
0.443TrpLys: 0.443 ± 0.0
1.992TrpLeu: 1.992 ± 0.0
1.107TrpMet: 1.107 ± 0.0
0.221TrpAsn: 0.221 ± 0.0
0.443TrpPro: 0.443 ± 0.0
0.886TrpGln: 0.886 ± 0.0
0.886TrpArg: 0.886 ± 0.0
1.328TrpSer: 1.328 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.328TrpVal: 1.328 ± 0.0
0.443TrpTrp: 0.443 ± 0.0
0.443TrpTyr: 0.443 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.55TyrAla: 1.55 ± 0.0
0.443TyrCys: 0.443 ± 0.0
3.542TyrAsp: 3.542 ± 0.0
2.435TyrGlu: 2.435 ± 0.0
1.107TyrPhe: 1.107 ± 0.0
2.878TyrGly: 2.878 ± 0.0
1.328TyrHis: 1.328 ± 0.0
3.764TyrIle: 3.764 ± 0.0
3.099TyrLys: 3.099 ± 0.0
2.657TyrLeu: 2.657 ± 0.0
0.443TyrMet: 0.443 ± 0.0
4.428TyrAsn: 4.428 ± 0.0
0.886TyrPro: 0.886 ± 0.0
0.443TyrGln: 0.443 ± 0.0
0.664TyrArg: 0.664 ± 0.0
3.321TyrSer: 3.321 ± 0.0
1.992TyrThr: 1.992 ± 0.0
0.443TyrVal: 0.443 ± 0.0
0.443TyrTrp: 0.443 ± 0.0
1.771TyrTyr: 1.771 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (4518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski