Amino acid dipepetide frequency for Hubei sobemo-like virus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.457AlaAla: 2.457 ± 1.329
0.0AlaCys: 0.0 ± 0.0
6.143AlaAsp: 6.143 ± 0.548
2.457AlaGlu: 2.457 ± 0.606
0.0AlaPhe: 0.0 ± 0.0
4.914AlaGly: 4.914 ± 0.723
0.0AlaHis: 0.0 ± 0.0
2.457AlaIle: 2.457 ± 0.606
7.371AlaLys: 7.371 ± 1.819
6.143AlaLeu: 6.143 ± 0.548
3.686AlaMet: 3.686 ± 0.058
2.457AlaAsn: 2.457 ± 1.329
4.914AlaPro: 4.914 ± 2.658
3.686AlaGln: 3.686 ± 1.993
3.686AlaArg: 3.686 ± 0.058
9.828AlaSer: 9.828 ± 5.315
2.457AlaThr: 2.457 ± 1.329
3.686AlaVal: 3.686 ± 1.877
1.229AlaTrp: 1.229 ± 1.271
1.229AlaTyr: 1.229 ± 1.271
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.229CysAsp: 1.229 ± 1.271
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.229CysGly: 1.229 ± 0.664
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.229CysLys: 1.229 ± 0.664
1.229CysLeu: 1.229 ± 1.271
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.229CysGln: 1.229 ± 0.664
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
3.686CysVal: 3.686 ± 1.877
0.0CysTrp: 0.0 ± 0.0
1.229CysTyr: 1.229 ± 1.271
0.0CysXaa: 0.0 ± 0.0
Asp
3.686AspAla: 3.686 ± 0.058
1.229AspCys: 1.229 ± 1.271
2.457AspAsp: 2.457 ± 0.606
3.686AspGlu: 3.686 ± 0.058
6.143AspPhe: 6.143 ± 1.387
1.229AspGly: 1.229 ± 0.664
0.0AspHis: 0.0 ± 0.0
2.457AspIle: 2.457 ± 1.329
3.686AspLys: 3.686 ± 1.877
7.371AspLeu: 7.371 ± 2.052
0.0AspMet: 0.0 ± 0.0
3.686AspAsn: 3.686 ± 1.877
3.686AspPro: 3.686 ± 0.058
1.229AspGln: 1.229 ± 1.271
2.457AspArg: 2.457 ± 0.606
2.457AspSer: 2.457 ± 0.606
2.457AspThr: 2.457 ± 2.541
4.914AspVal: 4.914 ± 0.723
3.686AspTrp: 3.686 ± 0.058
3.686AspTyr: 3.686 ± 1.877
0.0AspXaa: 0.0 ± 0.0
Glu
4.914GluAla: 4.914 ± 0.723
0.0GluCys: 0.0 ± 0.0
9.828GluAsp: 9.828 ± 1.445
3.686GluGlu: 3.686 ± 1.993
0.0GluPhe: 0.0 ± 0.0
2.457GluGly: 2.457 ± 0.606
0.0GluHis: 0.0 ± 0.0
2.457GluIle: 2.457 ± 1.329
1.229GluLys: 1.229 ± 0.664
6.143GluLeu: 6.143 ± 1.387
1.229GluMet: 1.229 ± 0.664
2.457GluAsn: 2.457 ± 0.606
2.457GluPro: 2.457 ± 0.606
4.914GluGln: 4.914 ± 0.723
1.229GluArg: 1.229 ± 0.664
1.229GluSer: 1.229 ± 0.664
6.143GluThr: 6.143 ± 2.483
3.686GluVal: 3.686 ± 1.993
0.0GluTrp: 0.0 ± 0.0
3.686GluTyr: 3.686 ± 1.993
0.0GluXaa: 0.0 ± 0.0
Phe
1.229PheAla: 1.229 ± 0.664
0.0PheCys: 0.0 ± 0.0
2.457PheAsp: 2.457 ± 0.606
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
6.143PheGly: 6.143 ± 0.548
1.229PheHis: 1.229 ± 1.271
1.229PheIle: 1.229 ± 1.271
1.229PheLys: 1.229 ± 0.664
3.686PheLeu: 3.686 ± 0.058
0.0PheMet: 0.0 ± 0.0
1.229PheAsn: 1.229 ± 1.271
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
3.686PheArg: 3.686 ± 0.058
2.457PheSer: 2.457 ± 1.329
1.229PheThr: 1.229 ± 1.271
2.457PheVal: 2.457 ± 0.606
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.457GlyAla: 2.457 ± 0.606
1.229GlyCys: 1.229 ± 1.271
3.686GlyAsp: 3.686 ± 0.058
3.686GlyGlu: 3.686 ± 1.993
3.686GlyPhe: 3.686 ± 1.993
4.914GlyGly: 4.914 ± 1.212
1.229GlyHis: 1.229 ± 1.271
4.914GlyIle: 4.914 ± 1.212
2.457GlyLys: 2.457 ± 1.329
3.686GlyLeu: 3.686 ± 1.993
0.0GlyMet: 0.0 ± 0.0
4.914GlyAsn: 4.914 ± 1.212
1.229GlyPro: 1.229 ± 1.271
4.914GlyGln: 4.914 ± 1.212
2.457GlyArg: 2.457 ± 1.329
3.686GlySer: 3.686 ± 1.993
4.914GlyThr: 4.914 ± 0.723
7.371GlyVal: 7.371 ± 0.116
1.229GlyTrp: 1.229 ± 1.271
4.914GlyTyr: 4.914 ± 3.147
0.0GlyXaa: 0.0 ± 0.0
His
3.686HisAla: 3.686 ± 0.058
1.229HisCys: 1.229 ± 1.271
1.229HisAsp: 1.229 ± 1.271
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.229HisGly: 1.229 ± 1.271
0.0HisHis: 0.0 ± 0.0
3.686HisIle: 3.686 ± 0.058
0.0HisLys: 0.0 ± 0.0
3.686HisLeu: 3.686 ± 1.877
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
3.686HisGln: 3.686 ± 1.993
2.457HisArg: 2.457 ± 0.606
2.457HisSer: 2.457 ± 0.606
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.457HisTyr: 2.457 ± 1.329
0.0HisXaa: 0.0 ± 0.0
Ile
4.914IleAla: 4.914 ± 1.212
0.0IleCys: 0.0 ± 0.0
2.457IleAsp: 2.457 ± 0.606
1.229IleGlu: 1.229 ± 0.664
1.229IlePhe: 1.229 ± 0.664
4.914IleGly: 4.914 ± 0.723
6.143IleHis: 6.143 ± 2.483
3.686IleIle: 3.686 ± 1.877
2.457IleLys: 2.457 ± 0.606
6.143IleLeu: 6.143 ± 0.548
2.457IleMet: 2.457 ± 0.606
1.229IleAsn: 1.229 ± 0.664
1.229IlePro: 1.229 ± 0.664
3.686IleGln: 3.686 ± 0.058
2.457IleArg: 2.457 ± 0.606
4.914IleSer: 4.914 ± 0.723
0.0IleThr: 0.0 ± 0.0
2.457IleVal: 2.457 ± 1.329
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
8.6LysAla: 8.6 ± 0.781
1.229LysCys: 1.229 ± 0.664
1.229LysAsp: 1.229 ± 0.664
2.457LysGlu: 2.457 ± 1.329
2.457LysPhe: 2.457 ± 2.541
6.143LysGly: 6.143 ± 1.387
1.229LysHis: 1.229 ± 1.271
2.457LysIle: 2.457 ± 0.606
6.143LysLys: 6.143 ± 1.387
6.143LysLeu: 6.143 ± 1.387
0.0LysMet: 0.0 ± 0.0
2.457LysAsn: 2.457 ± 1.329
0.0LysPro: 0.0 ± 0.0
3.686LysGln: 3.686 ± 0.058
2.457LysArg: 2.457 ± 1.329
4.914LysSer: 4.914 ± 1.212
2.457LysThr: 2.457 ± 1.329
2.457LysVal: 2.457 ± 0.606
0.0LysTrp: 0.0 ± 0.0
3.686LysTyr: 3.686 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
6.143LeuAla: 6.143 ± 0.548
0.0LeuCys: 0.0 ± 0.0
1.229LeuAsp: 1.229 ± 1.271
7.371LeuGlu: 7.371 ± 0.116
6.143LeuPhe: 6.143 ± 2.483
7.371LeuGly: 7.371 ± 3.754
2.457LeuHis: 2.457 ± 0.606
3.686LeuIle: 3.686 ± 0.058
4.914LeuLys: 4.914 ± 0.723
8.6LeuLeu: 8.6 ± 0.781
2.457LeuMet: 2.457 ± 0.606
4.914LeuAsn: 4.914 ± 0.723
7.371LeuPro: 7.371 ± 0.116
2.457LeuGln: 2.457 ± 0.606
6.143LeuArg: 6.143 ± 0.548
6.143LeuSer: 6.143 ± 3.322
3.686LeuThr: 3.686 ± 1.993
9.828LeuVal: 9.828 ± 0.49
1.229LeuTrp: 1.229 ± 0.664
6.143LeuTyr: 6.143 ± 1.387
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.229MetAsp: 1.229 ± 0.664
1.229MetGlu: 1.229 ± 0.664
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.229MetIle: 1.229 ± 1.271
1.229MetLys: 1.229 ± 1.271
3.686MetLeu: 3.686 ± 1.877
1.229MetMet: 1.229 ± 1.271
0.0MetAsn: 0.0 ± 0.0
2.457MetPro: 2.457 ± 1.329
2.457MetGln: 2.457 ± 0.606
4.914MetArg: 4.914 ± 0.723
3.686MetSer: 3.686 ± 0.058
1.229MetThr: 1.229 ± 0.664
1.229MetVal: 1.229 ± 1.271
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.457AsnAla: 2.457 ± 1.329
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.229AsnGlu: 1.229 ± 1.271
2.457AsnPhe: 2.457 ± 1.329
1.229AsnGly: 1.229 ± 0.664
1.229AsnHis: 1.229 ± 1.271
2.457AsnIle: 2.457 ± 1.329
1.229AsnLys: 1.229 ± 0.664
6.143AsnLeu: 6.143 ± 2.483
0.0AsnMet: 0.0 ± 0.0
1.229AsnAsn: 1.229 ± 0.664
1.229AsnPro: 1.229 ± 0.664
3.686AsnGln: 3.686 ± 0.058
1.229AsnArg: 1.229 ± 0.664
4.914AsnSer: 4.914 ± 0.723
1.229AsnThr: 1.229 ± 1.271
3.686AsnVal: 3.686 ± 1.993
1.229AsnTrp: 1.229 ± 1.271
1.229AsnTyr: 1.229 ± 1.271
0.0AsnXaa: 0.0 ± 0.0
Pro
2.457ProAla: 2.457 ± 1.329
0.0ProCys: 0.0 ± 0.0
1.229ProAsp: 1.229 ± 1.271
6.143ProGlu: 6.143 ± 2.483
0.0ProPhe: 0.0 ± 0.0
3.686ProGly: 3.686 ± 0.058
2.457ProHis: 2.457 ± 0.606
0.0ProIle: 0.0 ± 0.0
2.457ProLys: 2.457 ± 1.329
0.0ProLeu: 0.0 ± 0.0
0.0ProMet: 0.0 ± 0.0
2.457ProAsn: 2.457 ± 0.606
2.457ProPro: 2.457 ± 1.329
1.229ProGln: 1.229 ± 1.271
2.457ProArg: 2.457 ± 1.329
9.828ProSer: 9.828 ± 1.445
2.457ProThr: 2.457 ± 1.329
2.457ProVal: 2.457 ± 1.329
1.229ProTrp: 1.229 ± 1.271
2.457ProTyr: 2.457 ± 2.541
0.0ProXaa: 0.0 ± 0.0
Gln
3.686GlnAla: 3.686 ± 1.993
1.229GlnCys: 1.229 ± 1.271
4.914GlnAsp: 4.914 ± 1.212
4.914GlnGlu: 4.914 ± 2.658
1.229GlnPhe: 1.229 ± 1.271
4.914GlnGly: 4.914 ± 1.212
0.0GlnHis: 0.0 ± 0.0
1.229GlnIle: 1.229 ± 0.664
1.229GlnLys: 1.229 ± 0.664
8.6GlnLeu: 8.6 ± 2.716
0.0GlnMet: 0.0 ± 0.0
2.457GlnAsn: 2.457 ± 1.329
2.457GlnPro: 2.457 ± 0.606
3.686GlnGln: 3.686 ± 0.058
2.457GlnArg: 2.457 ± 2.541
0.0GlnSer: 0.0 ± 0.0
2.457GlnThr: 2.457 ± 0.606
2.457GlnVal: 2.457 ± 1.329
1.229GlnTrp: 1.229 ± 0.664
2.457GlnTyr: 2.457 ± 2.541
0.0GlnXaa: 0.0 ± 0.0
Arg
1.229ArgAla: 1.229 ± 0.664
0.0ArgCys: 0.0 ± 0.0
2.457ArgAsp: 2.457 ± 1.329
6.143ArgGlu: 6.143 ± 3.322
1.229ArgPhe: 1.229 ± 1.271
1.229ArgGly: 1.229 ± 0.664
1.229ArgHis: 1.229 ± 0.664
3.686ArgIle: 3.686 ± 0.058
2.457ArgLys: 2.457 ± 1.329
6.143ArgLeu: 6.143 ± 2.483
1.229ArgMet: 1.229 ± 1.271
2.457ArgAsn: 2.457 ± 0.606
2.457ArgPro: 2.457 ± 0.606
0.0ArgGln: 0.0 ± 0.0
2.457ArgArg: 2.457 ± 0.606
3.686ArgSer: 3.686 ± 1.877
3.686ArgThr: 3.686 ± 0.058
4.914ArgVal: 4.914 ± 1.212
1.229ArgTrp: 1.229 ± 0.664
1.229ArgTyr: 1.229 ± 1.271
0.0ArgXaa: 0.0 ± 0.0
Ser
3.686SerAla: 3.686 ± 1.877
1.229SerCys: 1.229 ± 0.664
3.686SerAsp: 3.686 ± 1.993
1.229SerGlu: 1.229 ± 0.664
0.0SerPhe: 0.0 ± 0.0
3.686SerGly: 3.686 ± 0.058
2.457SerHis: 2.457 ± 1.329
7.371SerIle: 7.371 ± 0.116
8.6SerLys: 8.6 ± 0.781
3.686SerLeu: 3.686 ± 1.993
3.686SerMet: 3.686 ± 2.26
4.914SerAsn: 4.914 ± 0.723
3.686SerPro: 3.686 ± 0.058
1.229SerGln: 1.229 ± 0.664
2.457SerArg: 2.457 ± 0.606
4.914SerSer: 4.914 ± 2.658
6.143SerThr: 6.143 ± 3.322
8.6SerVal: 8.6 ± 0.781
1.229SerTrp: 1.229 ± 0.664
3.686SerTyr: 3.686 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
4.914ThrAla: 4.914 ± 0.723
0.0ThrCys: 0.0 ± 0.0
1.229ThrAsp: 1.229 ± 0.664
1.229ThrGlu: 1.229 ± 0.664
0.0ThrPhe: 0.0 ± 0.0
2.457ThrGly: 2.457 ± 1.329
1.229ThrHis: 1.229 ± 0.664
4.914ThrIle: 4.914 ± 1.212
4.914ThrLys: 4.914 ± 0.723
4.914ThrLeu: 4.914 ± 1.212
1.229ThrMet: 1.229 ± 1.271
0.0ThrAsn: 0.0 ± 0.0
2.457ThrPro: 2.457 ± 0.606
4.914ThrGln: 4.914 ± 0.723
1.229ThrArg: 1.229 ± 0.664
1.229ThrSer: 1.229 ± 0.664
1.229ThrThr: 1.229 ± 0.664
6.143ThrVal: 6.143 ± 0.548
0.0ThrTrp: 0.0 ± 0.0
1.229ThrTyr: 1.229 ± 1.271
0.0ThrXaa: 0.0 ± 0.0
Val
9.828ValAla: 9.828 ± 1.445
2.457ValCys: 2.457 ± 1.329
8.6ValAsp: 8.6 ± 3.089
4.914ValGlu: 4.914 ± 1.212
2.457ValPhe: 2.457 ± 0.606
8.6ValGly: 8.6 ± 1.154
2.457ValHis: 2.457 ± 1.329
2.457ValIle: 2.457 ± 0.606
4.914ValLys: 4.914 ± 0.723
3.686ValLeu: 3.686 ± 0.058
4.914ValMet: 4.914 ± 2.078
1.229ValAsn: 1.229 ± 0.664
3.686ValPro: 3.686 ± 0.058
1.229ValGln: 1.229 ± 1.271
1.229ValArg: 1.229 ± 0.664
3.686ValSer: 3.686 ± 0.058
1.229ValThr: 1.229 ± 0.664
8.6ValVal: 8.6 ± 4.651
0.0ValTrp: 0.0 ± 0.0
3.686ValTyr: 3.686 ± 1.993
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.229TrpCys: 1.229 ± 1.271
1.229TrpAsp: 1.229 ± 1.271
2.457TrpGlu: 2.457 ± 1.329
0.0TrpPhe: 0.0 ± 0.0
1.229TrpGly: 1.229 ± 0.664
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.457TrpLeu: 2.457 ± 1.329
1.229TrpMet: 1.229 ± 0.664
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.229TrpSer: 1.229 ± 1.271
2.457TrpThr: 2.457 ± 2.541
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.229TrpTyr: 1.229 ± 1.271
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.457TyrAla: 2.457 ± 0.606
0.0TyrCys: 0.0 ± 0.0
3.686TyrAsp: 3.686 ± 0.058
3.686TyrGlu: 3.686 ± 0.058
1.229TyrPhe: 1.229 ± 1.271
0.0TyrGly: 0.0 ± 0.0
2.457TyrHis: 2.457 ± 1.329
1.229TyrIle: 1.229 ± 0.664
2.457TyrLys: 2.457 ± 0.606
6.143TyrLeu: 6.143 ± 2.483
1.229TyrMet: 1.229 ± 0.664
0.0TyrAsn: 0.0 ± 0.0
3.686TyrPro: 3.686 ± 3.812
3.686TyrGln: 3.686 ± 0.058
3.686TyrArg: 3.686 ± 3.812
4.914TyrSer: 4.914 ± 1.212
0.0TyrThr: 0.0 ± 0.0
2.457TyrVal: 2.457 ± 1.329
1.229TyrTrp: 1.229 ± 1.271
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski