Amino acid dipepetide frequency for Hubei sobemo-like virus 35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.537AlaAla: 4.537 ± 2.499
3.63AlaCys: 3.63 ± 0.633
0.0AlaAsp: 0.0 ± 0.0
3.63AlaGlu: 3.63 ± 0.633
8.167AlaPhe: 8.167 ± 0.766
7.26AlaGly: 7.26 ± 2.682
0.0AlaHis: 0.0 ± 0.0
3.63AlaIle: 3.63 ± 0.683
1.815AlaLys: 1.815 ± 1.0
5.445AlaLeu: 5.445 ± 2.999
3.63AlaMet: 3.63 ± 1.949
1.815AlaAsn: 1.815 ± 1.0
0.907AlaPro: 0.907 ± 0.5
1.815AlaGln: 1.815 ± 1.0
3.63AlaArg: 3.63 ± 0.683
4.537AlaSer: 4.537 ± 2.499
2.722AlaThr: 2.722 ± 1.499
0.907AlaVal: 0.907 ± 0.5
1.815AlaTrp: 1.815 ± 0.316
4.537AlaTyr: 4.537 ± 2.499
0.0AlaXaa: 0.0 ± 0.0
Cys
0.907CysAla: 0.907 ± 0.5
0.0CysCys: 0.0 ± 0.0
0.907CysAsp: 0.907 ± 0.816
2.722CysGlu: 2.722 ± 1.499
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.907CysIle: 0.907 ± 0.816
0.0CysLys: 0.0 ± 0.0
1.815CysLeu: 1.815 ± 1.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.907CysGln: 0.907 ± 0.5
0.0CysArg: 0.0 ± 0.0
1.815CysSer: 1.815 ± 1.0
0.0CysThr: 0.0 ± 0.0
0.907CysVal: 0.907 ± 0.816
0.907CysTrp: 0.907 ± 0.816
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.722AspAla: 2.722 ± 1.133
0.907AspCys: 0.907 ± 0.5
2.722AspAsp: 2.722 ± 0.183
1.815AspGlu: 1.815 ± 1.0
2.722AspPhe: 2.722 ± 1.133
2.722AspGly: 2.722 ± 2.449
0.0AspHis: 0.0 ± 0.0
2.722AspIle: 2.722 ± 0.183
2.722AspLys: 2.722 ± 2.449
5.445AspLeu: 5.445 ± 0.367
0.0AspMet: 0.0 ± 0.0
1.815AspAsn: 1.815 ± 0.316
2.722AspPro: 2.722 ± 1.133
2.722AspGln: 2.722 ± 1.133
1.815AspArg: 1.815 ± 0.316
3.63AspSer: 3.63 ± 1.949
2.722AspThr: 2.722 ± 0.183
1.815AspVal: 1.815 ± 0.316
1.815AspTrp: 1.815 ± 1.633
0.907AspTyr: 0.907 ± 0.816
0.0AspXaa: 0.0 ± 0.0
Glu
7.26GluAla: 7.26 ± 2.682
0.0GluCys: 0.0 ± 0.0
2.722GluAsp: 2.722 ± 1.133
9.074GluGlu: 9.074 ± 2.366
1.815GluPhe: 1.815 ± 1.633
2.722GluGly: 2.722 ± 1.499
0.907GluHis: 0.907 ± 0.5
0.907GluIle: 0.907 ± 0.816
6.352GluLys: 6.352 ± 1.766
10.889GluLeu: 10.889 ± 3.215
0.0GluMet: 0.0 ± 0.535
2.722GluAsn: 2.722 ± 1.499
2.722GluPro: 2.722 ± 1.133
3.63GluGln: 3.63 ± 0.633
6.352GluArg: 6.352 ± 0.866
7.26GluSer: 7.26 ± 3.998
5.445GluThr: 5.445 ± 1.683
2.722GluVal: 2.722 ± 1.499
1.815GluTrp: 1.815 ± 0.316
3.63GluTyr: 3.63 ± 1.949
0.0GluXaa: 0.0 ± 0.0
Phe
1.815PheAla: 1.815 ± 0.316
0.0PheCys: 0.0 ± 0.0
0.907PheAsp: 0.907 ± 0.5
1.815PheGlu: 1.815 ± 1.0
1.815PhePhe: 1.815 ± 0.316
2.722PheGly: 2.722 ± 2.449
0.0PheHis: 0.0 ± 0.0
2.722PheIle: 2.722 ± 2.449
0.907PheLys: 0.907 ± 0.5
5.445PheLeu: 5.445 ± 0.949
2.722PheMet: 2.722 ± 2.449
0.907PheAsn: 0.907 ± 0.5
1.815PhePro: 1.815 ± 0.316
2.722PheGln: 2.722 ± 1.133
4.537PheArg: 4.537 ± 4.081
2.722PheSer: 2.722 ± 0.183
1.815PheThr: 1.815 ± 0.316
0.907PheVal: 0.907 ± 0.5
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.815GlyAla: 1.815 ± 1.0
1.815GlyCys: 1.815 ± 0.316
4.537GlyAsp: 4.537 ± 1.449
4.537GlyGlu: 4.537 ± 1.449
4.537GlyPhe: 4.537 ± 2.765
2.722GlyGly: 2.722 ± 1.133
0.907GlyHis: 0.907 ± 0.816
1.815GlyIle: 1.815 ± 1.0
4.537GlyLys: 4.537 ± 2.499
8.167GlyLeu: 8.167 ± 0.55
0.907GlyMet: 0.907 ± 0.816
1.815GlyAsn: 1.815 ± 1.0
0.0GlyPro: 0.0 ± 0.0
4.537GlyGln: 4.537 ± 1.183
3.63GlyArg: 3.63 ± 0.683
1.815GlySer: 1.815 ± 1.0
0.907GlyThr: 0.907 ± 0.816
4.537GlyVal: 4.537 ± 2.499
3.63GlyTrp: 3.63 ± 3.265
2.722GlyTyr: 2.722 ± 1.499
0.0GlyXaa: 0.0 ± 0.0
His
1.815HisAla: 1.815 ± 0.316
0.0HisCys: 0.0 ± 0.0
0.907HisAsp: 0.907 ± 0.5
0.907HisGlu: 0.907 ± 0.816
0.0HisPhe: 0.0 ± 0.0
0.907HisGly: 0.907 ± 0.5
1.815HisHis: 1.815 ± 1.633
1.815HisIle: 1.815 ± 1.0
1.815HisLys: 1.815 ± 1.0
2.722HisLeu: 2.722 ± 0.183
0.0HisMet: 0.0 ± 0.0
0.907HisAsn: 0.907 ± 0.816
0.907HisPro: 0.907 ± 0.5
0.0HisGln: 0.0 ± 0.0
0.907HisArg: 0.907 ± 0.5
0.907HisSer: 0.907 ± 0.816
2.722HisThr: 2.722 ± 2.449
2.722HisVal: 2.722 ± 1.133
0.0HisTrp: 0.0 ± 0.0
0.907HisTyr: 0.907 ± 0.816
0.0HisXaa: 0.0 ± 0.0
Ile
2.722IleAla: 2.722 ± 1.499
0.907IleCys: 0.907 ± 0.5
2.722IleAsp: 2.722 ± 1.499
4.537IleGlu: 4.537 ± 1.449
1.815IlePhe: 1.815 ± 1.633
3.63IleGly: 3.63 ± 0.683
2.722IleHis: 2.722 ± 1.133
1.815IleIle: 1.815 ± 1.0
2.722IleLys: 2.722 ± 1.133
6.352IleLeu: 6.352 ± 1.766
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
2.722IlePro: 2.722 ± 0.183
1.815IleGln: 1.815 ± 1.633
2.722IleArg: 2.722 ± 0.183
4.537IleSer: 4.537 ± 1.183
0.907IleThr: 0.907 ± 0.5
2.722IleVal: 2.722 ± 0.183
1.815IleTrp: 1.815 ± 1.633
1.815IleTyr: 1.815 ± 1.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.445LysAla: 5.445 ± 2.999
0.0LysCys: 0.0 ± 0.0
2.722LysAsp: 2.722 ± 1.133
6.352LysGlu: 6.352 ± 1.766
0.907LysPhe: 0.907 ± 0.5
0.907LysGly: 0.907 ± 0.816
2.722LysHis: 2.722 ± 0.183
3.63LysIle: 3.63 ± 0.683
6.352LysLys: 6.352 ± 0.45
1.815LysLeu: 1.815 ± 1.0
0.907LysMet: 0.907 ± 0.5
5.445LysAsn: 5.445 ± 0.367
1.815LysPro: 1.815 ± 0.316
5.445LysGln: 5.445 ± 1.683
4.537LysArg: 4.537 ± 1.183
4.537LysSer: 4.537 ± 2.765
1.815LysThr: 1.815 ± 0.316
2.722LysVal: 2.722 ± 0.183
0.907LysTrp: 0.907 ± 0.5
0.907LysTyr: 0.907 ± 0.816
0.0LysXaa: 0.0 ± 0.0
Leu
6.352LeuAla: 6.352 ± 2.183
0.0LeuCys: 0.0 ± 0.0
2.722LeuAsp: 2.722 ± 1.133
9.982LeuGlu: 9.982 ± 0.233
3.63LeuPhe: 3.63 ± 0.633
5.445LeuGly: 5.445 ± 0.949
3.63LeuHis: 3.63 ± 0.633
7.26LeuIle: 7.26 ± 1.366
6.352LeuLys: 6.352 ± 0.866
7.26LeuLeu: 7.26 ± 1.266
4.537LeuMet: 4.537 ± 1.449
0.0LeuAsn: 0.0 ± 0.0
5.445LeuPro: 5.445 ± 2.266
5.445LeuGln: 5.445 ± 2.999
9.982LeuArg: 9.982 ± 1.55
3.63LeuSer: 3.63 ± 0.633
3.63LeuThr: 3.63 ± 1.999
6.352LeuVal: 6.352 ± 0.866
3.63LeuTrp: 3.63 ± 0.683
1.815LeuTyr: 1.815 ± 1.633
0.0LeuXaa: 0.0 ± 0.0
Met
1.815MetAla: 1.815 ± 0.316
0.0MetCys: 0.0 ± 0.0
1.815MetAsp: 1.815 ± 1.0
1.815MetGlu: 1.815 ± 1.0
0.0MetPhe: 0.0 ± 0.0
1.815MetGly: 1.815 ± 0.316
0.0MetHis: 0.0 ± 0.0
0.907MetIle: 0.907 ± 0.816
2.722MetLys: 2.722 ± 1.133
0.907MetLeu: 0.907 ± 0.5
0.907MetMet: 0.907 ± 0.5
0.0MetAsn: 0.0 ± 0.0
3.63MetPro: 3.63 ± 1.949
2.722MetGln: 2.722 ± 1.133
0.907MetArg: 0.907 ± 0.5
0.907MetSer: 0.907 ± 0.5
1.815MetThr: 1.815 ± 0.316
4.537MetVal: 4.537 ± 0.133
0.0MetTrp: 0.0 ± 0.0
2.722MetTyr: 2.722 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
2.722AsnAla: 2.722 ± 1.499
0.907AsnCys: 0.907 ± 0.5
0.0AsnAsp: 0.0 ± 0.0
2.722AsnGlu: 2.722 ± 0.183
0.0AsnPhe: 0.0 ± 0.0
0.0AsnGly: 0.0 ± 0.0
1.815AsnHis: 1.815 ± 0.316
0.907AsnIle: 0.907 ± 0.816
2.722AsnLys: 2.722 ± 1.499
2.722AsnLeu: 2.722 ± 0.183
1.815AsnMet: 1.815 ± 1.0
0.907AsnAsn: 0.907 ± 0.816
0.907AsnPro: 0.907 ± 0.5
1.815AsnGln: 1.815 ± 0.316
2.722AsnArg: 2.722 ± 1.133
4.537AsnSer: 4.537 ± 0.133
0.907AsnThr: 0.907 ± 0.816
4.537AsnVal: 4.537 ± 2.499
0.907AsnTrp: 0.907 ± 0.5
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.722ProAla: 2.722 ± 1.499
0.0ProCys: 0.0 ± 0.0
2.722ProAsp: 2.722 ± 1.133
6.352ProGlu: 6.352 ± 3.082
0.907ProPhe: 0.907 ± 0.816
1.815ProGly: 1.815 ± 1.633
0.907ProHis: 0.907 ± 0.816
1.815ProIle: 1.815 ± 0.316
0.907ProLys: 0.907 ± 0.816
6.352ProLeu: 6.352 ± 0.866
1.815ProMet: 1.815 ± 1.0
0.907ProAsn: 0.907 ± 0.816
1.815ProPro: 1.815 ± 0.316
3.63ProGln: 3.63 ± 0.633
1.815ProArg: 1.815 ± 1.0
4.537ProSer: 4.537 ± 1.183
0.907ProThr: 0.907 ± 0.5
3.63ProVal: 3.63 ± 0.633
0.0ProTrp: 0.0 ± 0.0
1.815ProTyr: 1.815 ± 1.633
0.0ProXaa: 0.0 ± 0.0
Gln
2.722GlnAla: 2.722 ± 0.183
0.0GlnCys: 0.0 ± 0.0
3.63GlnAsp: 3.63 ± 1.949
2.722GlnGlu: 2.722 ± 0.183
0.907GlnPhe: 0.907 ± 0.5
2.722GlnGly: 2.722 ± 1.499
1.815GlnHis: 1.815 ± 1.0
4.537GlnIle: 4.537 ± 1.449
2.722GlnLys: 2.722 ± 1.499
4.537GlnLeu: 4.537 ± 1.183
1.815GlnMet: 1.815 ± 0.654
4.537GlnAsn: 4.537 ± 1.183
0.907GlnPro: 0.907 ± 0.816
1.815GlnGln: 1.815 ± 1.0
4.537GlnArg: 4.537 ± 0.133
2.722GlnSer: 2.722 ± 1.499
0.907GlnThr: 0.907 ± 0.816
4.537GlnVal: 4.537 ± 1.183
0.907GlnTrp: 0.907 ± 0.5
2.722GlnTyr: 2.722 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
4.537ArgAla: 4.537 ± 1.183
0.0ArgCys: 0.0 ± 0.0
2.722ArgAsp: 2.722 ± 1.133
3.63ArgGlu: 3.63 ± 1.999
2.722ArgPhe: 2.722 ± 2.449
4.537ArgGly: 4.537 ± 2.499
0.907ArgHis: 0.907 ± 0.5
4.537ArgIle: 4.537 ± 1.183
3.63ArgLys: 3.63 ± 1.999
8.167ArgLeu: 8.167 ± 2.082
0.907ArgMet: 0.907 ± 0.5
2.722ArgAsn: 2.722 ± 1.499
2.722ArgPro: 2.722 ± 1.499
1.815ArgGln: 1.815 ± 0.316
5.445ArgArg: 5.445 ± 2.999
2.722ArgSer: 2.722 ± 0.183
2.722ArgThr: 2.722 ± 1.499
9.982ArgVal: 9.982 ± 3.715
1.815ArgTrp: 1.815 ± 1.633
3.63ArgTyr: 3.63 ± 1.949
0.0ArgXaa: 0.0 ± 0.0
Ser
3.63SerAla: 3.63 ± 0.683
1.815SerCys: 1.815 ± 1.0
1.815SerAsp: 1.815 ± 0.316
2.722SerGlu: 2.722 ± 0.183
1.815SerPhe: 1.815 ± 0.316
8.167SerGly: 8.167 ± 0.766
1.815SerHis: 1.815 ± 0.316
1.815SerIle: 1.815 ± 0.316
3.63SerLys: 3.63 ± 0.683
8.167SerLeu: 8.167 ± 0.766
3.63SerMet: 3.63 ± 1.999
2.722SerAsn: 2.722 ± 0.183
1.815SerPro: 1.815 ± 0.316
0.907SerGln: 0.907 ± 0.5
7.26SerArg: 7.26 ± 1.366
7.26SerSer: 7.26 ± 1.366
6.352SerThr: 6.352 ± 3.499
5.445SerVal: 5.445 ± 0.949
1.815SerTrp: 1.815 ± 0.316
2.722SerTyr: 2.722 ± 1.133
0.0SerXaa: 0.0 ± 0.0
Thr
1.815ThrAla: 1.815 ± 0.316
0.907ThrCys: 0.907 ± 0.5
0.907ThrAsp: 0.907 ± 0.816
3.63ThrGlu: 3.63 ± 0.683
1.815ThrPhe: 1.815 ± 1.0
2.722ThrGly: 2.722 ± 0.183
0.907ThrHis: 0.907 ± 0.5
1.815ThrIle: 1.815 ± 1.633
3.63ThrLys: 3.63 ± 0.633
2.722ThrLeu: 2.722 ± 1.499
0.0ThrMet: 0.0 ± 0.0
1.815ThrAsn: 1.815 ± 1.0
5.445ThrPro: 5.445 ± 0.367
3.63ThrGln: 3.63 ± 1.999
3.63ThrArg: 3.63 ± 0.683
4.537ThrSer: 4.537 ± 1.183
0.907ThrThr: 0.907 ± 0.816
1.815ThrVal: 1.815 ± 0.316
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.722ValAla: 2.722 ± 0.183
0.0ValCys: 0.0 ± 0.0
3.63ValAsp: 3.63 ± 1.949
7.26ValGlu: 7.26 ± 2.682
1.815ValPhe: 1.815 ± 1.633
3.63ValGly: 3.63 ± 0.683
0.0ValHis: 0.0 ± 0.0
2.722ValIle: 2.722 ± 1.499
3.63ValLys: 3.63 ± 1.949
3.63ValLeu: 3.63 ± 1.999
2.722ValMet: 2.722 ± 1.499
2.722ValAsn: 2.722 ± 0.183
5.445ValPro: 5.445 ± 0.367
5.445ValGln: 5.445 ± 1.683
4.537ValArg: 4.537 ± 2.765
9.074ValSer: 9.074 ± 2.899
4.537ValThr: 4.537 ± 1.183
5.445ValVal: 5.445 ± 0.367
0.907ValTrp: 0.907 ± 0.5
0.907ValTyr: 0.907 ± 0.5
0.0ValXaa: 0.0 ± 0.0
Trp
3.63TrpAla: 3.63 ± 0.633
0.0TrpCys: 0.0 ± 0.0
4.537TrpAsp: 4.537 ± 2.765
1.815TrpGlu: 1.815 ± 0.316
0.0TrpPhe: 0.0 ± 0.0
1.815TrpGly: 1.815 ± 1.633
0.907TrpHis: 0.907 ± 0.816
0.907TrpIle: 0.907 ± 0.5
0.907TrpLys: 0.907 ± 0.816
2.722TrpLeu: 2.722 ± 0.183
0.907TrpMet: 0.907 ± 0.816
0.907TrpAsn: 0.907 ± 0.816
1.815TrpPro: 1.815 ± 0.316
0.907TrpGln: 0.907 ± 0.5
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.907TrpThr: 0.907 ± 0.816
1.815TrpVal: 1.815 ± 1.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.722TyrAla: 2.722 ± 0.183
0.907TyrCys: 0.907 ± 0.816
1.815TyrAsp: 1.815 ± 0.316
1.815TyrGlu: 1.815 ± 0.316
0.907TyrPhe: 0.907 ± 0.5
3.63TyrGly: 3.63 ± 1.999
0.907TyrHis: 0.907 ± 0.816
1.815TyrIle: 1.815 ± 1.633
1.815TyrLys: 1.815 ± 1.0
1.815TyrLeu: 1.815 ± 0.316
1.815TyrMet: 1.815 ± 0.316
0.907TyrAsn: 0.907 ± 0.816
1.815TyrPro: 1.815 ± 1.633
0.0TyrGln: 0.0 ± 0.0
0.907TyrArg: 0.907 ± 0.5
3.63TyrSer: 3.63 ± 0.633
0.0TyrThr: 0.0 ± 0.0
2.722TyrVal: 2.722 ± 1.133
1.815TyrTrp: 1.815 ± 0.316
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski