Amino acid dipepetide frequency for Hubei noda-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.136AlaAla: 7.136 ± 0.0
1.019AlaCys: 1.019 ± 0.0
5.097AlaAsp: 5.097 ± 0.0
6.116AlaGlu: 6.116 ± 0.0
6.116AlaPhe: 6.116 ± 0.0
4.077AlaGly: 4.077 ± 0.0
4.077AlaHis: 4.077 ± 0.0
4.077AlaIle: 4.077 ± 0.0
2.039AlaLys: 2.039 ± 0.0
2.039AlaLeu: 2.039 ± 0.0
2.039AlaMet: 2.039 ± 0.0
4.077AlaAsn: 4.077 ± 0.0
4.077AlaPro: 4.077 ± 0.0
5.097AlaGln: 5.097 ± 0.0
11.213AlaArg: 11.213 ± 0.0
4.077AlaSer: 4.077 ± 0.0
7.136AlaThr: 7.136 ± 0.0
5.097AlaVal: 5.097 ± 0.0
1.019AlaTrp: 1.019 ± 0.0
4.077AlaTyr: 4.077 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.019CysAsp: 1.019 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.039CysPhe: 2.039 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.019CysIle: 1.019 ± 0.0
1.019CysLys: 1.019 ± 0.0
1.019CysLeu: 1.019 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.039CysAsn: 2.039 ± 0.0
1.019CysPro: 1.019 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
3.058CysTyr: 3.058 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.116AspAla: 6.116 ± 0.0
0.0AspCys: 0.0 ± 0.0
1.019AspAsp: 1.019 ± 0.0
1.019AspGlu: 1.019 ± 0.0
5.097AspPhe: 5.097 ± 0.0
7.136AspGly: 7.136 ± 0.0
0.0AspHis: 0.0 ± 0.0
3.058AspIle: 3.058 ± 0.0
0.0AspLys: 0.0 ± 0.0
3.058AspLeu: 3.058 ± 0.0
2.039AspMet: 2.039 ± 0.0
2.039AspAsn: 2.039 ± 0.0
3.058AspPro: 3.058 ± 0.0
2.039AspGln: 2.039 ± 0.0
1.019AspArg: 1.019 ± 0.0
4.077AspSer: 4.077 ± 0.0
4.077AspThr: 4.077 ± 0.0
4.077AspVal: 4.077 ± 0.0
2.039AspTrp: 2.039 ± 0.0
3.058AspTyr: 3.058 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.097GluAla: 5.097 ± 0.0
0.0GluCys: 0.0 ± 0.0
3.058GluAsp: 3.058 ± 0.0
1.019GluGlu: 1.019 ± 0.0
4.077GluPhe: 4.077 ± 0.0
2.039GluGly: 2.039 ± 0.0
4.077GluHis: 4.077 ± 0.0
3.058GluIle: 3.058 ± 0.0
0.0GluLys: 0.0 ± 0.0
6.116GluLeu: 6.116 ± 0.0
1.019GluMet: 1.019 ± 0.0
1.019GluAsn: 1.019 ± 0.0
4.077GluPro: 4.077 ± 0.0
2.039GluGln: 2.039 ± 0.0
2.039GluArg: 2.039 ± 0.0
3.058GluSer: 3.058 ± 0.0
3.058GluThr: 3.058 ± 0.0
0.0GluVal: 0.0 ± 0.0
1.019GluTrp: 1.019 ± 0.0
5.097GluTyr: 5.097 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.039PheAla: 2.039 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.019PheAsp: 1.019 ± 0.0
1.019PheGlu: 1.019 ± 0.0
2.039PhePhe: 2.039 ± 0.0
3.058PheGly: 3.058 ± 0.0
0.0PheHis: 0.0 ± 0.0
3.058PheIle: 3.058 ± 0.0
3.058PheLys: 3.058 ± 0.0
3.058PheLeu: 3.058 ± 0.0
1.019PheMet: 1.019 ± 0.0
1.019PheAsn: 1.019 ± 0.0
1.019PhePro: 1.019 ± 0.0
4.077PheGln: 4.077 ± 0.0
1.019PheArg: 1.019 ± 0.0
5.097PheSer: 5.097 ± 0.0
5.097PheThr: 5.097 ± 0.0
3.058PheVal: 3.058 ± 0.0
0.0PheTrp: 0.0 ± 0.0
1.019PheTyr: 1.019 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.058GlyAla: 3.058 ± 0.0
1.019GlyCys: 1.019 ± 0.0
4.077GlyAsp: 4.077 ± 0.0
5.097GlyGlu: 5.097 ± 0.0
0.0GlyPhe: 0.0 ± 0.0
3.058GlyGly: 3.058 ± 0.0
2.039GlyHis: 2.039 ± 0.0
1.019GlyIle: 1.019 ± 0.0
2.039GlyLys: 2.039 ± 0.0
6.116GlyLeu: 6.116 ± 0.0
2.039GlyMet: 2.039 ± 0.0
5.097GlyAsn: 5.097 ± 0.0
0.0GlyPro: 0.0 ± 0.0
2.039GlyGln: 2.039 ± 0.0
1.019GlyArg: 1.019 ± 0.0
4.077GlySer: 4.077 ± 0.0
4.077GlyThr: 4.077 ± 0.0
2.039GlyVal: 2.039 ± 0.0
2.039GlyTrp: 2.039 ± 0.0
3.058GlyTyr: 3.058 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
5.097HisAla: 5.097 ± 0.0
1.019HisCys: 1.019 ± 0.0
1.019HisAsp: 1.019 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.019HisPhe: 1.019 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.019HisIle: 1.019 ± 0.0
1.019HisLys: 1.019 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.019HisAsn: 1.019 ± 0.0
1.019HisPro: 1.019 ± 0.0
4.077HisGln: 4.077 ± 0.0
2.039HisArg: 2.039 ± 0.0
3.058HisSer: 3.058 ± 0.0
2.039HisThr: 2.039 ± 0.0
2.039HisVal: 2.039 ± 0.0
1.019HisTrp: 1.019 ± 0.0
6.116HisTyr: 6.116 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.097IleAla: 5.097 ± 0.0
0.0IleCys: 0.0 ± 0.0
3.058IleAsp: 3.058 ± 0.0
3.058IleGlu: 3.058 ± 0.0
0.0IlePhe: 0.0 ± 0.0
3.058IleGly: 3.058 ± 0.0
1.019IleHis: 1.019 ± 0.0
0.0IleIle: 0.0 ± 0.0
4.077IleLys: 4.077 ± 0.0
4.077IleLeu: 4.077 ± 0.0
2.039IleMet: 2.039 ± 0.0
4.077IleAsn: 4.077 ± 0.0
1.019IlePro: 1.019 ± 0.0
1.019IleGln: 1.019 ± 0.0
2.039IleArg: 2.039 ± 0.0
6.116IleSer: 6.116 ± 0.0
2.039IleThr: 2.039 ± 0.0
5.097IleVal: 5.097 ± 0.0
1.019IleTrp: 1.019 ± 0.0
2.039IleTyr: 2.039 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.039LysAla: 2.039 ± 0.0
0.0LysCys: 0.0 ± 0.0
2.039LysAsp: 2.039 ± 0.0
2.039LysGlu: 2.039 ± 0.0
1.019LysPhe: 1.019 ± 0.0
1.019LysGly: 1.019 ± 0.0
5.097LysHis: 5.097 ± 0.0
1.019LysIle: 1.019 ± 0.0
1.019LysLys: 1.019 ± 0.0
3.058LysLeu: 3.058 ± 0.0
1.019LysMet: 1.019 ± 0.0
2.039LysAsn: 2.039 ± 0.0
3.058LysPro: 3.058 ± 0.0
3.058LysGln: 3.058 ± 0.0
2.039LysArg: 2.039 ± 0.0
2.039LysSer: 2.039 ± 0.0
4.077LysThr: 4.077 ± 0.0
2.039LysVal: 2.039 ± 0.0
0.0LysTrp: 0.0 ± 0.0
1.019LysTyr: 1.019 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.174LeuAla: 9.174 ± 0.0
1.019LeuCys: 1.019 ± 0.0
4.077LeuAsp: 4.077 ± 0.0
3.058LeuGlu: 3.058 ± 0.0
2.039LeuPhe: 2.039 ± 0.0
3.058LeuGly: 3.058 ± 0.0
4.077LeuHis: 4.077 ± 0.0
1.019LeuIle: 1.019 ± 0.0
1.019LeuLys: 1.019 ± 0.0
4.077LeuLeu: 4.077 ± 0.0
4.077LeuMet: 4.077 ± 0.0
2.039LeuAsn: 2.039 ± 0.0
5.097LeuPro: 5.097 ± 0.0
3.058LeuGln: 3.058 ± 0.0
5.097LeuArg: 5.097 ± 0.0
8.155LeuSer: 8.155 ± 0.0
3.058LeuThr: 3.058 ± 0.0
3.058LeuVal: 3.058 ± 0.0
1.019LeuTrp: 1.019 ± 0.0
2.039LeuTyr: 2.039 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.019MetAla: 1.019 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.019MetAsp: 1.019 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.019MetGly: 1.019 ± 0.0
2.039MetHis: 2.039 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.019MetLys: 1.019 ± 0.0
1.019MetLeu: 1.019 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.058MetPro: 3.058 ± 0.0
5.097MetGln: 5.097 ± 0.0
2.039MetArg: 2.039 ± 0.0
3.058MetSer: 3.058 ± 0.0
1.019MetThr: 1.019 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.019MetTyr: 1.019 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.058AsnAla: 3.058 ± 0.0
0.0AsnCys: 0.0 ± 0.0
2.039AsnAsp: 2.039 ± 0.0
5.097AsnGlu: 5.097 ± 0.0
2.039AsnPhe: 2.039 ± 0.0
2.039AsnGly: 2.039 ± 0.0
0.0AsnHis: 0.0 ± 0.0
4.077AsnIle: 4.077 ± 0.0
3.058AsnLys: 3.058 ± 0.0
2.039AsnLeu: 2.039 ± 0.0
0.0AsnMet: 0.0 ± 0.0
2.039AsnAsn: 2.039 ± 0.0
1.019AsnPro: 1.019 ± 0.0
4.077AsnGln: 4.077 ± 0.0
3.058AsnArg: 3.058 ± 0.0
5.097AsnSer: 5.097 ± 0.0
3.058AsnThr: 3.058 ± 0.0
3.058AsnVal: 3.058 ± 0.0
1.019AsnTrp: 1.019 ± 0.0
2.039AsnTyr: 2.039 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.097ProAla: 5.097 ± 0.0
2.039ProCys: 2.039 ± 0.0
2.039ProAsp: 2.039 ± 0.0
4.077ProGlu: 4.077 ± 0.0
0.0ProPhe: 0.0 ± 0.0
2.039ProGly: 2.039 ± 0.0
2.039ProHis: 2.039 ± 0.0
1.019ProIle: 1.019 ± 0.0
5.097ProLys: 5.097 ± 0.0
2.039ProLeu: 2.039 ± 0.0
0.0ProMet: 0.0 ± 0.0
2.039ProAsn: 2.039 ± 0.0
1.019ProPro: 1.019 ± 0.0
3.058ProGln: 3.058 ± 0.0
4.077ProArg: 4.077 ± 0.0
3.058ProSer: 3.058 ± 0.0
4.077ProThr: 4.077 ± 0.0
6.116ProVal: 6.116 ± 0.0
3.058ProTrp: 3.058 ± 0.0
2.039ProTyr: 2.039 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.039GlnAla: 2.039 ± 0.0
2.039GlnCys: 2.039 ± 0.0
3.058GlnAsp: 3.058 ± 0.0
4.077GlnGlu: 4.077 ± 0.0
1.019GlnPhe: 1.019 ± 0.0
3.058GlnGly: 3.058 ± 0.0
1.019GlnHis: 1.019 ± 0.0
2.039GlnIle: 2.039 ± 0.0
1.019GlnLys: 1.019 ± 0.0
7.136GlnLeu: 7.136 ± 0.0
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
8.155GlnPro: 8.155 ± 0.0
1.019GlnGln: 1.019 ± 0.0
5.097GlnArg: 5.097 ± 0.0
2.039GlnSer: 2.039 ± 0.0
5.097GlnThr: 5.097 ± 0.0
0.0GlnVal: 0.0 ± 0.0
1.019GlnTrp: 1.019 ± 0.0
2.039GlnTyr: 2.039 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.097ArgAla: 5.097 ± 0.0
1.019ArgCys: 1.019 ± 0.0
3.058ArgAsp: 3.058 ± 0.0
3.058ArgGlu: 3.058 ± 0.0
1.019ArgPhe: 1.019 ± 0.0
2.039ArgGly: 2.039 ± 0.0
2.039ArgHis: 2.039 ± 0.0
3.058ArgIle: 3.058 ± 0.0
3.058ArgLys: 3.058 ± 0.0
2.039ArgLeu: 2.039 ± 0.0
2.039ArgMet: 2.039 ± 0.0
6.116ArgAsn: 6.116 ± 0.0
3.058ArgPro: 3.058 ± 0.0
3.058ArgGln: 3.058 ± 0.0
6.116ArgArg: 6.116 ± 0.0
6.116ArgSer: 6.116 ± 0.0
4.077ArgThr: 4.077 ± 0.0
5.097ArgVal: 5.097 ± 0.0
1.019ArgTrp: 1.019 ± 0.0
3.058ArgTyr: 3.058 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.136SerAla: 7.136 ± 0.0
0.0SerCys: 0.0 ± 0.0
4.077SerAsp: 4.077 ± 0.0
1.019SerGlu: 1.019 ± 0.0
3.058SerPhe: 3.058 ± 0.0
9.174SerGly: 9.174 ± 0.0
1.019SerHis: 1.019 ± 0.0
8.155SerIle: 8.155 ± 0.0
3.058SerLys: 3.058 ± 0.0
2.039SerLeu: 2.039 ± 0.0
0.0SerMet: 0.0 ± 0.0
3.058SerAsn: 3.058 ± 0.0
5.097SerPro: 5.097 ± 0.0
2.039SerGln: 2.039 ± 0.0
6.116SerArg: 6.116 ± 0.0
4.077SerSer: 4.077 ± 0.0
10.194SerThr: 10.194 ± 0.0
4.077SerVal: 4.077 ± 0.0
0.0SerTrp: 0.0 ± 0.0
1.019SerTyr: 1.019 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
8.155ThrAla: 8.155 ± 0.0
1.019ThrCys: 1.019 ± 0.0
5.097ThrAsp: 5.097 ± 0.0
4.077ThrGlu: 4.077 ± 0.0
4.077ThrPhe: 4.077 ± 0.0
2.039ThrGly: 2.039 ± 0.0
0.0ThrHis: 0.0 ± 0.0
6.116ThrIle: 6.116 ± 0.0
3.058ThrLys: 3.058 ± 0.0
8.155ThrLeu: 8.155 ± 0.0
3.058ThrMet: 3.058 ± 0.0
5.097ThrAsn: 5.097 ± 0.0
5.097ThrPro: 5.097 ± 0.0
3.058ThrGln: 3.058 ± 0.0
4.077ThrArg: 4.077 ± 0.0
2.039ThrSer: 2.039 ± 0.0
6.116ThrThr: 6.116 ± 0.0
4.077ThrVal: 4.077 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.136ValAla: 7.136 ± 0.0
1.019ValCys: 1.019 ± 0.0
6.116ValAsp: 6.116 ± 0.0
3.058ValGlu: 3.058 ± 0.0
3.058ValPhe: 3.058 ± 0.0
1.019ValGly: 1.019 ± 0.0
1.019ValHis: 1.019 ± 0.0
6.116ValIle: 6.116 ± 0.0
3.058ValLys: 3.058 ± 0.0
4.077ValLeu: 4.077 ± 0.0
1.019ValMet: 1.019 ± 0.0
3.058ValAsn: 3.058 ± 0.0
2.039ValPro: 2.039 ± 0.0
1.019ValGln: 1.019 ± 0.0
3.058ValArg: 3.058 ± 0.0
5.097ValSer: 5.097 ± 0.0
4.077ValThr: 4.077 ± 0.0
3.058ValVal: 3.058 ± 0.0
0.0ValTrp: 0.0 ± 0.0
1.019ValTyr: 1.019 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.019TrpAla: 1.019 ± 0.0
1.019TrpCys: 1.019 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.019TrpGlu: 1.019 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.019TrpGly: 1.019 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.058TrpLeu: 3.058 ± 0.0
0.0TrpMet: 0.0 ± 0.0
2.039TrpAsn: 2.039 ± 0.0
1.019TrpPro: 1.019 ± 0.0
1.019TrpGln: 1.019 ± 0.0
1.019TrpArg: 1.019 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
2.039TrpVal: 2.039 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.019TrpTyr: 1.019 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.097TyrAla: 5.097 ± 0.0
0.0TyrCys: 0.0 ± 0.0
2.039TyrAsp: 2.039 ± 0.0
3.058TyrGlu: 3.058 ± 0.0
3.058TyrPhe: 3.058 ± 0.0
3.058TyrGly: 3.058 ± 0.0
3.058TyrHis: 3.058 ± 0.0
1.019TyrIle: 1.019 ± 0.0
1.019TyrLys: 1.019 ± 0.0
5.097TyrLeu: 5.097 ± 0.0
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.019TyrPro: 1.019 ± 0.0
1.019TyrGln: 1.019 ± 0.0
3.058TyrArg: 3.058 ± 0.0
4.077TyrSer: 4.077 ± 0.0
3.058TyrThr: 3.058 ± 0.0
5.097TyrVal: 5.097 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.019TyrTyr: 1.019 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (982 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski