Amino acid dipepetide frequency for Hubei sobemo-like virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.741AlaAla: 5.741 ± 3.199
0.0AlaCys: 0.0 ± 0.0
2.296AlaAsp: 2.296 ± 1.937
1.148AlaGlu: 1.148 ± 0.64
4.592AlaPhe: 4.592 ± 0.657
10.333AlaGly: 10.333 ± 4.15
1.148AlaHis: 1.148 ± 0.969
2.296AlaIle: 2.296 ± 1.937
3.444AlaLys: 3.444 ± 1.297
4.592AlaLeu: 4.592 ± 0.951
2.296AlaMet: 2.296 ± 0.91
1.148AlaAsn: 1.148 ± 0.969
3.444AlaPro: 3.444 ± 1.92
4.592AlaGln: 4.592 ± 0.657
6.889AlaArg: 6.889 ± 3.839
4.592AlaSer: 4.592 ± 2.559
2.296AlaThr: 2.296 ± 0.329
9.185AlaVal: 9.185 ± 1.902
1.148AlaTrp: 1.148 ± 0.969
8.037AlaTyr: 8.037 ± 0.346
0.0AlaXaa: 0.0 ± 0.0
Cys
1.148CysAla: 1.148 ± 0.64
1.148CysCys: 1.148 ± 0.969
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.148CysPhe: 1.148 ± 0.969
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.148CysIle: 1.148 ± 0.969
0.0CysLys: 0.0 ± 0.0
2.296CysLeu: 2.296 ± 1.937
1.148CysMet: 1.148 ± 0.64
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.296CysArg: 2.296 ± 0.329
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.148CysTrp: 1.148 ± 0.969
1.148CysTyr: 1.148 ± 0.969
0.0CysXaa: 0.0 ± 0.0
Asp
2.296AspAla: 2.296 ± 0.329
0.0AspCys: 0.0 ± 0.0
2.296AspAsp: 2.296 ± 0.329
2.296AspGlu: 2.296 ± 1.28
1.148AspPhe: 1.148 ± 0.969
4.592AspGly: 4.592 ± 0.657
1.148AspHis: 1.148 ± 0.64
1.148AspIle: 1.148 ± 0.969
2.296AspLys: 2.296 ± 0.329
6.889AspLeu: 6.889 ± 2.231
1.148AspMet: 1.148 ± 0.969
0.0AspAsn: 0.0 ± 0.0
4.592AspPro: 4.592 ± 0.657
2.296AspGln: 2.296 ± 0.329
4.592AspArg: 4.592 ± 0.657
3.444AspSer: 3.444 ± 0.311
3.444AspThr: 3.444 ± 1.297
1.148AspVal: 1.148 ± 0.64
2.296AspTrp: 2.296 ± 1.937
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.444GluAla: 3.444 ± 0.311
1.148GluCys: 1.148 ± 0.969
5.741GluAsp: 5.741 ± 1.591
5.741GluGlu: 5.741 ± 1.591
1.148GluPhe: 1.148 ± 0.969
2.296GluGly: 2.296 ± 0.329
1.148GluHis: 1.148 ± 0.64
5.741GluIle: 5.741 ± 1.591
0.0GluLys: 0.0 ± 0.0
2.296GluLeu: 2.296 ± 1.937
1.148GluMet: 1.148 ± 0.969
3.444GluAsn: 3.444 ± 1.92
4.592GluPro: 4.592 ± 2.266
2.296GluGln: 2.296 ± 1.28
3.444GluArg: 3.444 ± 1.92
4.592GluSer: 4.592 ± 0.951
0.0GluThr: 0.0 ± 0.0
5.741GluVal: 5.741 ± 0.018
1.148GluTrp: 1.148 ± 0.64
1.148GluTyr: 1.148 ± 0.64
0.0GluXaa: 0.0 ± 0.0
Phe
2.296PheAla: 2.296 ± 1.937
2.296PheCys: 2.296 ± 0.329
0.0PheAsp: 0.0 ± 0.0
6.889PheGlu: 6.889 ± 0.986
0.0PhePhe: 0.0 ± 0.0
4.592PheGly: 4.592 ± 2.266
0.0PheHis: 0.0 ± 0.0
2.296PheIle: 2.296 ± 1.28
3.444PheLys: 3.444 ± 2.906
3.444PheLeu: 3.444 ± 1.92
2.296PheMet: 2.296 ± 1.28
1.148PheAsn: 1.148 ± 0.969
0.0PhePro: 0.0 ± 0.0
4.592PheGln: 4.592 ± 0.657
1.148PheArg: 1.148 ± 0.969
2.296PheSer: 2.296 ± 1.28
1.148PheThr: 1.148 ± 0.64
10.333PheVal: 10.333 ± 0.933
1.148PheTrp: 1.148 ± 0.64
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.592GlyAla: 4.592 ± 2.266
2.296GlyCys: 2.296 ± 1.937
3.444GlyAsp: 3.444 ± 0.311
5.741GlyGlu: 5.741 ± 1.626
5.741GlyPhe: 5.741 ± 0.018
3.444GlyGly: 3.444 ± 0.311
2.296GlyHis: 2.296 ± 0.329
4.592GlyIle: 4.592 ± 0.951
8.037GlyLys: 8.037 ± 1.262
5.741GlyLeu: 5.741 ± 3.199
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
2.296GlyPro: 2.296 ± 1.28
3.444GlyGln: 3.444 ± 0.311
4.592GlyArg: 4.592 ± 3.874
11.481GlySer: 11.481 ± 3.182
1.148GlyThr: 1.148 ± 0.969
1.148GlyVal: 1.148 ± 0.969
4.592GlyTrp: 4.592 ± 3.874
2.296GlyTyr: 2.296 ± 1.28
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.148HisGlu: 1.148 ± 0.64
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.296HisIle: 2.296 ± 0.329
0.0HisLys: 0.0 ± 0.0
1.148HisLeu: 1.148 ± 0.969
0.0HisMet: 0.0 ± 0.0
1.148HisAsn: 1.148 ± 0.64
0.0HisPro: 0.0 ± 0.0
1.148HisGln: 1.148 ± 0.64
0.0HisArg: 0.0 ± 0.0
3.444HisSer: 3.444 ± 2.906
2.296HisThr: 2.296 ± 0.329
3.444HisVal: 3.444 ± 1.92
0.0HisTrp: 0.0 ± 0.0
1.148HisTyr: 1.148 ± 0.969
0.0HisXaa: 0.0 ± 0.0
Ile
1.148IleAla: 1.148 ± 0.64
0.0IleCys: 0.0 ± 0.0
1.148IleAsp: 1.148 ± 0.969
1.148IleGlu: 1.148 ± 0.64
2.296IlePhe: 2.296 ± 1.937
4.592IleGly: 4.592 ± 0.951
0.0IleHis: 0.0 ± 0.0
1.148IleIle: 1.148 ± 0.64
2.296IleLys: 2.296 ± 0.329
4.592IleLeu: 4.592 ± 0.657
3.444IleMet: 3.444 ± 1.297
1.148IleAsn: 1.148 ± 0.64
4.592IlePro: 4.592 ± 2.559
0.0IleGln: 0.0 ± 0.0
3.444IleArg: 3.444 ± 0.311
5.741IleSer: 5.741 ± 0.018
1.148IleThr: 1.148 ± 0.64
1.148IleVal: 1.148 ± 0.64
1.148IleTrp: 1.148 ± 0.64
2.296IleTyr: 2.296 ± 1.28
0.0IleXaa: 0.0 ± 0.0
Lys
2.296LysAla: 2.296 ± 0.329
0.0LysCys: 0.0 ± 0.0
1.148LysAsp: 1.148 ± 0.969
1.148LysGlu: 1.148 ± 0.969
0.0LysPhe: 0.0 ± 0.0
5.741LysGly: 5.741 ± 1.591
2.296LysHis: 2.296 ± 0.329
1.148LysIle: 1.148 ± 0.64
1.148LysLys: 1.148 ± 0.969
3.444LysLeu: 3.444 ± 1.297
4.592LysMet: 4.592 ± 0.657
1.148LysAsn: 1.148 ± 0.969
1.148LysPro: 1.148 ± 0.969
1.148LysGln: 1.148 ± 0.64
2.296LysArg: 2.296 ± 0.329
8.037LysSer: 8.037 ± 1.262
5.741LysThr: 5.741 ± 0.018
2.296LysVal: 2.296 ± 1.28
1.148LysTrp: 1.148 ± 0.969
2.296LysTyr: 2.296 ± 1.28
0.0LysXaa: 0.0 ± 0.0
Leu
9.185LeuAla: 9.185 ± 1.315
1.148LeuCys: 1.148 ± 0.969
6.889LeuAsp: 6.889 ± 0.622
4.592LeuGlu: 4.592 ± 0.951
5.741LeuPhe: 5.741 ± 1.626
6.889LeuGly: 6.889 ± 4.203
1.148LeuHis: 1.148 ± 0.64
3.444LeuIle: 3.444 ± 0.311
4.592LeuLys: 4.592 ± 0.951
10.333LeuLeu: 10.333 ± 0.675
1.148LeuMet: 1.148 ± 0.64
4.592LeuAsn: 4.592 ± 2.266
2.296LeuPro: 2.296 ± 1.28
1.148LeuGln: 1.148 ± 0.64
5.741LeuArg: 5.741 ± 1.591
5.741LeuSer: 5.741 ± 0.018
4.592LeuThr: 4.592 ± 0.657
9.185LeuVal: 9.185 ± 0.294
0.0LeuTrp: 0.0 ± 0.0
2.296LeuTyr: 2.296 ± 1.937
0.0LeuXaa: 0.0 ± 0.0
Met
2.296MetAla: 2.296 ± 0.329
0.0MetCys: 0.0 ± 0.0
2.296MetAsp: 2.296 ± 0.329
3.444MetGlu: 3.444 ± 0.311
3.444MetPhe: 3.444 ± 0.311
1.148MetGly: 1.148 ± 0.64
1.148MetHis: 1.148 ± 0.969
0.0MetIle: 0.0 ± 0.0
1.148MetLys: 1.148 ± 0.969
0.0MetLeu: 0.0 ± 0.0
2.296MetMet: 2.296 ± 0.329
1.148MetAsn: 1.148 ± 0.969
2.296MetPro: 2.296 ± 0.329
1.148MetGln: 1.148 ± 0.64
2.296MetArg: 2.296 ± 1.28
3.444MetSer: 3.444 ± 1.92
3.444MetThr: 3.444 ± 0.311
3.444MetVal: 3.444 ± 1.297
1.148MetTrp: 1.148 ± 0.969
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.148AsnAla: 1.148 ± 0.969
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.148AsnGlu: 1.148 ± 0.969
1.148AsnPhe: 1.148 ± 0.64
2.296AsnGly: 2.296 ± 0.329
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
6.889AsnLeu: 6.889 ± 0.622
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.296AsnPro: 2.296 ± 1.937
1.148AsnGln: 1.148 ± 0.969
0.0AsnArg: 0.0 ± 0.0
1.148AsnSer: 1.148 ± 0.969
0.0AsnThr: 0.0 ± 0.0
1.148AsnVal: 1.148 ± 0.969
1.148AsnTrp: 1.148 ± 0.64
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.889ProAla: 6.889 ± 0.622
1.148ProCys: 1.148 ± 0.969
2.296ProAsp: 2.296 ± 0.329
4.592ProGlu: 4.592 ± 0.657
0.0ProPhe: 0.0 ± 0.0
3.444ProGly: 3.444 ± 1.297
3.444ProHis: 3.444 ± 1.297
1.148ProIle: 1.148 ± 0.969
1.148ProLys: 1.148 ± 0.64
5.741ProLeu: 5.741 ± 0.018
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
0.0ProPro: 0.0 ± 0.0
1.148ProGln: 1.148 ± 0.64
4.592ProArg: 4.592 ± 2.559
5.741ProSer: 5.741 ± 3.199
1.148ProThr: 1.148 ± 0.969
4.592ProVal: 4.592 ± 0.657
0.0ProTrp: 0.0 ± 0.0
2.296ProTyr: 2.296 ± 1.28
0.0ProXaa: 0.0 ± 0.0
Gln
5.741GlnAla: 5.741 ± 3.199
0.0GlnCys: 0.0 ± 0.0
2.296GlnAsp: 2.296 ± 0.329
3.444GlnGlu: 3.444 ± 1.92
1.148GlnPhe: 1.148 ± 0.969
5.741GlnGly: 5.741 ± 1.626
0.0GlnHis: 0.0 ± 0.0
3.444GlnIle: 3.444 ± 0.311
5.741GlnLys: 5.741 ± 1.591
3.444GlnLeu: 3.444 ± 0.311
1.148GlnMet: 1.148 ± 0.52
1.148GlnAsn: 1.148 ± 0.969
3.444GlnPro: 3.444 ± 0.311
1.148GlnGln: 1.148 ± 0.64
2.296GlnArg: 2.296 ± 1.937
1.148GlnSer: 1.148 ± 0.969
0.0GlnThr: 0.0 ± 0.0
1.148GlnVal: 1.148 ± 0.969
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.592ArgAla: 4.592 ± 0.951
0.0ArgCys: 0.0 ± 0.0
2.296ArgAsp: 2.296 ± 1.28
4.592ArgGlu: 4.592 ± 0.657
5.741ArgPhe: 5.741 ± 1.591
5.741ArgGly: 5.741 ± 1.591
0.0ArgHis: 0.0 ± 0.0
0.0ArgIle: 0.0 ± 0.0
2.296ArgLys: 2.296 ± 0.329
6.889ArgLeu: 6.889 ± 0.986
1.148ArgMet: 1.148 ± 0.64
1.148ArgAsn: 1.148 ± 0.64
0.0ArgPro: 0.0 ± 0.0
3.444ArgGln: 3.444 ± 1.297
8.037ArgArg: 8.037 ± 2.871
3.444ArgSer: 3.444 ± 1.92
4.592ArgThr: 4.592 ± 0.657
6.889ArgVal: 6.889 ± 2.595
0.0ArgTrp: 0.0 ± 0.0
1.148ArgTyr: 1.148 ± 0.969
0.0ArgXaa: 0.0 ± 0.0
Ser
10.333SerAla: 10.333 ± 0.933
0.0SerCys: 0.0 ± 0.0
3.444SerAsp: 3.444 ± 0.311
2.296SerGlu: 2.296 ± 1.28
6.889SerPhe: 6.889 ± 2.231
5.741SerGly: 5.741 ± 0.018
0.0SerHis: 0.0 ± 0.0
3.444SerIle: 3.444 ± 1.92
2.296SerLys: 2.296 ± 1.28
9.185SerLeu: 9.185 ± 1.315
3.444SerMet: 3.444 ± 1.92
0.0SerAsn: 0.0 ± 0.0
10.333SerPro: 10.333 ± 0.933
3.444SerGln: 3.444 ± 1.92
1.148SerArg: 1.148 ± 0.64
9.185SerSer: 9.185 ± 5.119
9.185SerThr: 9.185 ± 1.902
4.592SerVal: 4.592 ± 2.266
0.0SerTrp: 0.0 ± 0.0
1.148SerTyr: 1.148 ± 0.969
0.0SerXaa: 0.0 ± 0.0
Thr
5.741ThrAla: 5.741 ± 3.199
2.296ThrCys: 2.296 ± 0.329
0.0ThrAsp: 0.0 ± 0.0
1.148ThrGlu: 1.148 ± 0.64
3.444ThrPhe: 3.444 ± 0.311
1.148ThrGly: 1.148 ± 0.64
0.0ThrHis: 0.0 ± 0.0
4.592ThrIle: 4.592 ± 0.657
0.0ThrLys: 0.0 ± 0.0
4.592ThrLeu: 4.592 ± 0.657
2.296ThrMet: 2.296 ± 0.329
1.148ThrAsn: 1.148 ± 0.969
4.592ThrPro: 4.592 ± 0.657
3.444ThrGln: 3.444 ± 1.297
1.148ThrArg: 1.148 ± 0.64
2.296ThrSer: 2.296 ± 0.329
0.0ThrThr: 0.0 ± 0.0
5.741ThrVal: 5.741 ± 1.626
0.0ThrTrp: 0.0 ± 0.0
1.148ThrTyr: 1.148 ± 0.64
0.0ThrXaa: 0.0 ± 0.0
Val
8.037ValAla: 8.037 ± 1.262
0.0ValCys: 0.0 ± 0.0
6.889ValAsp: 6.889 ± 2.595
4.592ValGlu: 4.592 ± 0.951
3.444ValPhe: 3.444 ± 1.297
6.889ValGly: 6.889 ± 2.595
2.296ValHis: 2.296 ± 0.329
3.444ValIle: 3.444 ± 1.92
8.037ValLys: 8.037 ± 0.346
4.592ValLeu: 4.592 ± 2.266
3.444ValMet: 3.444 ± 0.311
1.148ValAsn: 1.148 ± 0.969
1.148ValPro: 1.148 ± 0.64
4.592ValGln: 4.592 ± 0.657
4.592ValArg: 4.592 ± 0.657
6.889ValSer: 6.889 ± 0.622
3.444ValThr: 3.444 ± 0.311
4.592ValVal: 4.592 ± 0.951
0.0ValTrp: 0.0 ± 0.0
6.889ValTyr: 6.889 ± 2.231
0.0ValXaa: 0.0 ± 0.0
Trp
1.148TrpAla: 1.148 ± 0.64
0.0TrpCys: 0.0 ± 0.0
2.296TrpAsp: 2.296 ± 1.937
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.148TrpGly: 1.148 ± 0.969
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.148TrpLys: 1.148 ± 0.969
2.296TrpLeu: 2.296 ± 0.329
1.148TrpMet: 1.148 ± 0.969
0.0TrpAsn: 0.0 ± 0.0
2.296TrpPro: 2.296 ± 0.329
1.148TrpGln: 1.148 ± 0.969
1.148TrpArg: 1.148 ± 0.969
2.296TrpSer: 2.296 ± 1.937
0.0TrpThr: 0.0 ± 0.0
1.148TrpVal: 1.148 ± 0.64
1.148TrpTrp: 1.148 ± 0.969
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.296TyrAla: 2.296 ± 0.329
1.148TyrCys: 1.148 ± 0.64
2.296TyrAsp: 2.296 ± 1.28
1.148TyrGlu: 1.148 ± 0.64
2.296TyrPhe: 2.296 ± 1.28
1.148TyrGly: 1.148 ± 0.64
1.148TyrHis: 1.148 ± 0.64
1.148TyrIle: 1.148 ± 0.64
1.148TyrLys: 1.148 ± 0.64
2.296TyrLeu: 2.296 ± 1.937
2.296TyrMet: 2.296 ± 0.329
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
1.148TyrGln: 1.148 ± 0.64
2.296TyrArg: 2.296 ± 1.937
1.148TyrSer: 1.148 ± 0.64
1.148TyrThr: 1.148 ± 0.64
8.037TyrVal: 8.037 ± 0.346
1.148TyrTrp: 1.148 ± 0.969
3.444TyrTyr: 3.444 ± 1.92
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (872 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski