Amino acid dipepetide frequency for Hubei macula-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.082AlaAla: 1.082 ± 1.615
1.082AlaCys: 1.082 ± 1.615
1.082AlaAsp: 1.082 ± 0.512
1.082AlaGlu: 1.082 ± 0.512
3.247AlaPhe: 3.247 ± 0.591
1.623AlaGly: 1.623 ± 0.768
1.623AlaHis: 1.623 ± 1.359
3.788AlaIle: 3.788 ± 2.462
4.87AlaLys: 4.87 ± 1.949
8.117AlaLeu: 8.117 ± 0.413
1.082AlaMet: 1.082 ± 0.699
5.952AlaAsn: 5.952 ± 0.69
4.87AlaPro: 4.87 ± 1.949
2.165AlaGln: 2.165 ± 1.024
0.541AlaArg: 0.541 ± 0.256
3.788AlaSer: 3.788 ± 0.334
2.706AlaThr: 2.706 ± 0.847
4.329AlaVal: 4.329 ± 2.206
0.541AlaTrp: 0.541 ± 0.256
0.541AlaTyr: 0.541 ± 0.256
0.0AlaXaa: 0.0 ± 0.0
Cys
0.541CysAla: 0.541 ± 1.871
0.0CysCys: 0.0 ± 0.0
0.541CysAsp: 0.541 ± 1.871
1.082CysGlu: 1.082 ± 0.512
1.082CysPhe: 1.082 ± 1.615
1.082CysGly: 1.082 ± 0.512
0.0CysHis: 0.0 ± 0.0
1.623CysIle: 1.623 ± 0.768
0.541CysLys: 0.541 ± 0.256
2.165CysLeu: 2.165 ± 1.103
0.541CysMet: 0.541 ± 0.256
1.082CysAsn: 1.082 ± 0.512
1.082CysPro: 1.082 ± 1.615
2.165CysGln: 2.165 ± 1.024
0.541CysArg: 0.541 ± 0.256
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.541CysVal: 0.541 ± 0.256
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.788AspAla: 3.788 ± 0.334
0.541AspCys: 0.541 ± 1.871
2.165AspAsp: 2.165 ± 1.103
1.082AspGlu: 1.082 ± 0.512
2.165AspPhe: 2.165 ± 1.024
1.082AspGly: 1.082 ± 1.615
1.082AspHis: 1.082 ± 0.512
2.706AspIle: 2.706 ± 1.281
1.623AspLys: 1.623 ± 0.768
3.788AspLeu: 3.788 ± 2.462
0.0AspMet: 0.0 ± 0.0
3.247AspAsn: 3.247 ± 1.537
6.494AspPro: 6.494 ± 0.946
1.623AspGln: 1.623 ± 0.768
2.165AspArg: 2.165 ± 1.103
3.788AspSer: 3.788 ± 1.793
0.541AspThr: 0.541 ± 1.871
0.0AspVal: 0.0 ± 0.0
0.541AspTrp: 0.541 ± 0.256
2.165AspTyr: 2.165 ± 1.024
0.0AspXaa: 0.0 ± 0.0
Glu
0.541GluAla: 0.541 ± 1.871
0.541GluCys: 0.541 ± 0.256
0.541GluAsp: 0.541 ± 0.256
1.082GluGlu: 1.082 ± 0.512
0.541GluPhe: 0.541 ± 0.256
0.541GluGly: 0.541 ± 0.256
1.082GluHis: 1.082 ± 1.615
1.082GluIle: 1.082 ± 1.615
2.165GluLys: 2.165 ± 1.103
4.87GluLeu: 4.87 ± 0.178
0.0GluMet: 0.0 ± 0.0
1.623GluAsn: 1.623 ± 0.768
2.706GluPro: 2.706 ± 1.281
1.082GluGln: 1.082 ± 0.512
0.0GluArg: 0.0 ± 0.0
4.329GluSer: 4.329 ± 0.078
3.247GluThr: 3.247 ± 1.537
2.165GluVal: 2.165 ± 1.024
0.541GluTrp: 0.541 ± 0.256
1.623GluTyr: 1.623 ± 0.768
0.0GluXaa: 0.0 ± 0.0
Phe
5.411PheAla: 5.411 ± 2.561
1.623PheCys: 1.623 ± 0.768
4.329PheAsp: 4.329 ± 0.078
1.623PheGlu: 1.623 ± 0.768
1.082PhePhe: 1.082 ± 1.615
1.623PheGly: 1.623 ± 1.359
1.623PheHis: 1.623 ± 0.768
3.788PheIle: 3.788 ± 1.793
0.541PheLys: 0.541 ± 0.256
3.788PheLeu: 3.788 ± 0.334
1.082PheMet: 1.082 ± 1.615
3.247PheAsn: 3.247 ± 1.537
4.329PhePro: 4.329 ± 0.078
2.706PheGln: 2.706 ± 0.847
4.329PheArg: 4.329 ± 0.078
2.706PheSer: 2.706 ± 1.281
3.788PheThr: 3.788 ± 0.334
1.082PheVal: 1.082 ± 0.512
0.541PheTrp: 0.541 ± 0.256
2.165PheTyr: 2.165 ± 1.103
0.0PheXaa: 0.0 ± 0.0
Gly
1.623GlyAla: 1.623 ± 1.359
0.541GlyCys: 0.541 ± 0.256
1.623GlyAsp: 1.623 ± 0.768
2.165GlyGlu: 2.165 ± 1.024
1.623GlyPhe: 1.623 ± 0.768
1.623GlyGly: 1.623 ± 1.359
1.082GlyHis: 1.082 ± 0.512
2.706GlyIle: 2.706 ± 2.974
1.623GlyLys: 1.623 ± 0.768
2.165GlyLeu: 2.165 ± 1.103
0.0GlyMet: 0.0 ± 0.0
2.165GlyAsn: 2.165 ± 1.024
3.247GlyPro: 3.247 ± 1.537
1.082GlyGln: 1.082 ± 1.615
0.0GlyArg: 0.0 ± 0.0
2.706GlySer: 2.706 ± 0.847
2.706GlyThr: 2.706 ± 0.847
1.082GlyVal: 1.082 ± 0.512
0.0GlyTrp: 0.0 ± 0.0
2.165GlyTyr: 2.165 ± 1.024
0.0GlyXaa: 0.0 ± 0.0
His
3.247HisAla: 3.247 ± 0.591
0.0HisCys: 0.0 ± 0.0
2.165HisAsp: 2.165 ± 1.103
1.082HisGlu: 1.082 ± 0.512
2.165HisPhe: 2.165 ± 1.103
1.082HisGly: 1.082 ± 0.512
2.165HisHis: 2.165 ± 1.024
1.623HisIle: 1.623 ± 1.359
2.706HisLys: 2.706 ± 1.281
5.411HisLeu: 5.411 ± 2.561
0.0HisMet: 0.0 ± 0.0
2.706HisAsn: 2.706 ± 1.281
4.87HisPro: 4.87 ± 1.949
1.082HisGln: 1.082 ± 0.512
1.623HisArg: 1.623 ± 1.359
3.247HisSer: 3.247 ± 1.537
2.165HisThr: 2.165 ± 1.024
2.706HisVal: 2.706 ± 1.281
1.082HisTrp: 1.082 ± 0.512
2.165HisTyr: 2.165 ± 1.024
0.0HisXaa: 0.0 ± 0.0
Ile
3.247IleAla: 3.247 ± 2.718
0.0IleCys: 0.0 ± 0.0
3.788IleAsp: 3.788 ± 1.793
3.247IleGlu: 3.247 ± 0.591
3.788IlePhe: 3.788 ± 1.793
1.082IleGly: 1.082 ± 0.512
3.788IleHis: 3.788 ± 1.793
6.494IleIle: 6.494 ± 9.69
1.623IleLys: 1.623 ± 1.359
8.117IleLeu: 8.117 ± 0.413
0.541IleMet: 0.541 ± 0.256
4.329IleAsn: 4.329 ± 2.049
4.87IlePro: 4.87 ± 4.077
0.0IleGln: 0.0 ± 0.0
1.082IleArg: 1.082 ± 1.615
4.87IleSer: 4.87 ± 1.949
5.952IleThr: 5.952 ± 1.437
2.165IleVal: 2.165 ± 1.024
0.541IleTrp: 0.541 ± 0.256
1.082IleTyr: 1.082 ± 0.512
0.0IleXaa: 0.0 ± 0.0
Lys
2.165LysAla: 2.165 ± 1.103
1.082LysCys: 1.082 ± 0.512
2.706LysAsp: 2.706 ± 1.281
1.623LysGlu: 1.623 ± 0.768
2.165LysPhe: 2.165 ± 1.103
1.082LysGly: 1.082 ± 0.512
1.623LysHis: 1.623 ± 0.768
2.706LysIle: 2.706 ± 0.847
3.247LysLys: 3.247 ± 0.591
5.952LysLeu: 5.952 ± 0.69
1.082LysMet: 1.082 ± 0.512
3.788LysAsn: 3.788 ± 2.462
3.247LysPro: 3.247 ± 0.591
1.082LysGln: 1.082 ± 0.512
2.165LysArg: 2.165 ± 1.024
2.165LysSer: 2.165 ± 1.103
4.329LysThr: 4.329 ± 2.049
2.165LysVal: 2.165 ± 1.103
0.541LysTrp: 0.541 ± 0.256
1.623LysTyr: 1.623 ± 0.768
0.0LysXaa: 0.0 ± 0.0
Leu
7.576LeuAla: 7.576 ± 1.458
1.082LeuCys: 1.082 ± 0.512
3.247LeuAsp: 3.247 ± 0.591
4.87LeuGlu: 4.87 ± 1.949
8.117LeuPhe: 8.117 ± 3.842
4.329LeuGly: 4.329 ± 2.049
5.952LeuHis: 5.952 ± 2.817
3.788LeuIle: 3.788 ± 1.793
4.329LeuLys: 4.329 ± 2.049
17.316LeuLeu: 17.316 ± 0.313
2.165LeuMet: 2.165 ± 1.103
7.576LeuAsn: 7.576 ± 2.796
11.905LeuPro: 11.905 ± 2.874
4.329LeuGln: 4.329 ± 2.206
7.576LeuArg: 7.576 ± 3.586
13.528LeuSer: 13.528 ± 4.276
8.658LeuThr: 8.658 ± 4.098
5.952LeuVal: 5.952 ± 0.69
1.082LeuTrp: 1.082 ± 0.512
3.247LeuTyr: 3.247 ± 1.537
0.0LeuXaa: 0.0 ± 0.0
Met
0.541MetAla: 0.541 ± 0.256
0.0MetCys: 0.0 ± 0.0
0.541MetAsp: 0.541 ± 0.256
0.0MetGlu: 0.0 ± 0.0
0.541MetPhe: 0.541 ± 0.256
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.082MetLys: 1.082 ± 0.512
0.541MetLeu: 0.541 ± 0.256
0.541MetMet: 0.541 ± 0.256
1.623MetAsn: 1.623 ± 3.486
1.082MetPro: 1.082 ± 0.512
0.541MetGln: 0.541 ± 0.256
0.541MetArg: 0.541 ± 0.256
1.623MetSer: 1.623 ± 0.768
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.541MetTrp: 0.541 ± 1.871
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.788AsnAla: 3.788 ± 2.462
1.082AsnCys: 1.082 ± 1.615
3.247AsnAsp: 3.247 ± 1.537
1.082AsnGlu: 1.082 ± 0.512
3.247AsnPhe: 3.247 ± 0.591
3.247AsnGly: 3.247 ± 1.537
3.788AsnHis: 3.788 ± 0.334
1.623AsnIle: 1.623 ± 0.768
2.706AsnLys: 2.706 ± 1.281
10.281AsnLeu: 10.281 ± 2.739
0.541AsnMet: 0.541 ± 0.42
2.706AsnAsn: 2.706 ± 1.281
5.952AsnPro: 5.952 ± 0.69
3.247AsnGln: 3.247 ± 0.591
0.541AsnArg: 0.541 ± 0.256
4.87AsnSer: 4.87 ± 0.178
4.329AsnThr: 4.329 ± 0.078
2.165AsnVal: 2.165 ± 1.024
0.0AsnTrp: 0.0 ± 0.0
2.706AsnTyr: 2.706 ± 1.281
0.0AsnXaa: 0.0 ± 0.0
Pro
3.788ProAla: 3.788 ± 2.462
1.082ProCys: 1.082 ± 1.615
4.329ProAsp: 4.329 ± 2.049
2.706ProGlu: 2.706 ± 0.847
6.494ProPhe: 6.494 ± 0.946
3.247ProGly: 3.247 ± 0.591
4.329ProHis: 4.329 ± 0.078
7.035ProIle: 7.035 ± 3.052
3.788ProLys: 3.788 ± 1.793
7.576ProLeu: 7.576 ± 3.586
0.0ProMet: 0.0 ± 0.0
5.411ProAsn: 5.411 ± 2.561
11.364ProPro: 11.364 ± 1.124
2.706ProGln: 2.706 ± 1.281
1.623ProArg: 1.623 ± 1.359
13.528ProSer: 13.528 ± 10.615
8.658ProThr: 8.658 ± 1.971
4.87ProVal: 4.87 ± 0.178
1.082ProTrp: 1.082 ± 0.512
2.706ProTyr: 2.706 ± 0.847
0.0ProXaa: 0.0 ± 0.0
Gln
1.082GlnAla: 1.082 ± 0.512
0.0GlnCys: 0.0 ± 0.0
0.541GlnAsp: 0.541 ± 0.256
0.541GlnGlu: 0.541 ± 0.256
3.247GlnPhe: 3.247 ± 0.591
1.082GlnGly: 1.082 ± 0.512
1.623GlnHis: 1.623 ± 0.768
3.788GlnIle: 3.788 ± 1.793
0.0GlnLys: 0.0 ± 0.0
4.329GlnLeu: 4.329 ± 2.049
0.0GlnMet: 0.0 ± 0.0
1.082GlnAsn: 1.082 ± 0.512
6.494GlnPro: 6.494 ± 3.073
2.706GlnGln: 2.706 ± 1.281
1.082GlnArg: 1.082 ± 0.512
4.87GlnSer: 4.87 ± 4.077
3.788GlnThr: 3.788 ± 2.462
2.165GlnVal: 2.165 ± 1.024
0.0GlnTrp: 0.0 ± 0.0
2.165GlnTyr: 2.165 ± 1.103
0.0GlnXaa: 0.0 ± 0.0
Arg
2.706ArgAla: 2.706 ± 1.281
0.541ArgCys: 0.541 ± 0.256
2.165ArgAsp: 2.165 ± 1.103
0.0ArgGlu: 0.0 ± 0.0
1.623ArgPhe: 1.623 ± 0.768
2.706ArgGly: 2.706 ± 0.847
0.541ArgHis: 0.541 ± 0.256
2.165ArgIle: 2.165 ± 1.024
2.165ArgLys: 2.165 ± 1.103
4.87ArgLeu: 4.87 ± 1.949
0.0ArgMet: 0.0 ± 0.0
1.623ArgAsn: 1.623 ± 0.768
0.541ArgPro: 0.541 ± 0.256
1.623ArgGln: 1.623 ± 0.768
0.0ArgArg: 0.0 ± 0.0
5.411ArgSer: 5.411 ± 2.561
1.623ArgThr: 1.623 ± 0.768
1.082ArgVal: 1.082 ± 0.512
1.082ArgTrp: 1.082 ± 0.512
1.623ArgTyr: 1.623 ± 0.768
0.0ArgXaa: 0.0 ± 0.0
Ser
4.329SerAla: 4.329 ± 2.206
0.541SerCys: 0.541 ± 0.256
2.706SerAsp: 2.706 ± 2.974
2.165SerGlu: 2.165 ± 1.024
3.788SerPhe: 3.788 ± 0.334
2.706SerGly: 2.706 ± 2.974
2.706SerHis: 2.706 ± 2.974
7.576SerIle: 7.576 ± 0.669
5.952SerLys: 5.952 ± 0.69
12.446SerLeu: 12.446 ± 3.764
1.623SerMet: 1.623 ± 0.768
5.411SerAsn: 5.411 ± 2.561
8.658SerPro: 8.658 ± 0.157
4.329SerGln: 4.329 ± 2.206
3.788SerArg: 3.788 ± 1.793
9.74SerSer: 9.74 ± 8.153
8.658SerThr: 8.658 ± 2.284
3.247SerVal: 3.247 ± 2.718
1.623SerTrp: 1.623 ± 0.768
3.247SerTyr: 3.247 ± 1.537
0.0SerXaa: 0.0 ± 0.0
Thr
2.706ThrAla: 2.706 ± 0.847
1.082ThrCys: 1.082 ± 0.512
2.165ThrAsp: 2.165 ± 1.024
2.706ThrGlu: 2.706 ± 2.974
2.706ThrPhe: 2.706 ± 1.281
1.082ThrGly: 1.082 ± 0.512
4.329ThrHis: 4.329 ± 2.049
3.247ThrIle: 3.247 ± 2.718
2.706ThrLys: 2.706 ± 1.281
11.905ThrLeu: 11.905 ± 3.507
0.0ThrMet: 0.0 ± 0.0
3.247ThrAsn: 3.247 ± 0.591
7.035ThrPro: 7.035 ± 1.202
3.788ThrGln: 3.788 ± 1.793
1.623ThrArg: 1.623 ± 0.768
8.117ThrSer: 8.117 ± 0.413
2.165ThrThr: 2.165 ± 1.024
3.247ThrVal: 3.247 ± 6.972
1.082ThrTrp: 1.082 ± 0.512
4.329ThrTyr: 4.329 ± 2.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.87ValAla: 4.87 ± 2.305
2.165ValCys: 2.165 ± 1.103
1.082ValAsp: 1.082 ± 1.615
1.623ValGlu: 1.623 ± 0.768
0.541ValPhe: 0.541 ± 0.256
2.165ValGly: 2.165 ± 1.103
3.247ValHis: 3.247 ± 0.591
3.247ValIle: 3.247 ± 1.537
1.623ValLys: 1.623 ± 3.486
7.035ValLeu: 7.035 ± 3.33
0.0ValMet: 0.0 ± 0.0
2.165ValAsn: 2.165 ± 1.103
3.247ValPro: 3.247 ± 2.718
1.623ValGln: 1.623 ± 0.768
2.165ValArg: 2.165 ± 1.024
2.165ValSer: 2.165 ± 1.024
1.082ValThr: 1.082 ± 0.512
2.165ValVal: 2.165 ± 1.024
0.541ValTrp: 0.541 ± 1.871
2.165ValTyr: 2.165 ± 3.23
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.082TrpAsp: 1.082 ± 0.512
0.541TrpGlu: 0.541 ± 0.256
0.541TrpPhe: 0.541 ± 1.871
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.082TrpLys: 1.082 ± 0.512
2.706TrpLeu: 2.706 ± 1.281
0.0TrpMet: 0.0 ± 0.0
1.082TrpAsn: 1.082 ± 0.512
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.082TrpArg: 1.082 ± 0.512
1.623TrpSer: 1.623 ± 1.359
1.082TrpThr: 1.082 ± 0.512
0.541TrpVal: 0.541 ± 0.256
0.0TrpTrp: 0.0 ± 0.0
0.541TrpTyr: 0.541 ± 0.256
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.082TyrAla: 1.082 ± 1.615
1.623TyrCys: 1.623 ± 0.768
0.541TyrAsp: 0.541 ± 0.256
0.0TyrGlu: 0.0 ± 0.0
2.706TyrPhe: 2.706 ± 1.281
0.541TyrGly: 0.541 ± 0.256
2.706TyrHis: 2.706 ± 1.281
1.623TyrIle: 1.623 ± 0.768
2.165TyrLys: 2.165 ± 1.103
3.247TyrLeu: 3.247 ± 1.537
0.0TyrMet: 0.0 ± 0.0
1.623TyrAsn: 1.623 ± 0.768
4.329TyrPro: 4.329 ± 0.078
2.706TyrGln: 2.706 ± 1.281
1.623TyrArg: 1.623 ± 0.768
2.165TyrSer: 2.165 ± 1.024
3.788TyrThr: 3.788 ± 0.334
3.247TyrVal: 3.247 ± 0.591
0.541TyrTrp: 0.541 ± 0.256
0.541TyrTyr: 0.541 ± 0.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1849 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski