Amino acid dipepetide frequency for Hubei sobemo-like virus 27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.824AlaAla: 3.824 ± 0.602
0.956AlaCys: 0.956 ± 0.502
1.912AlaAsp: 1.912 ± 0.402
2.868AlaGlu: 2.868 ± 0.1
3.824AlaPhe: 3.824 ± 0.602
4.78AlaGly: 4.78 ± 0.302
2.868AlaHis: 2.868 ± 1.506
1.912AlaIle: 1.912 ± 0.402
6.692AlaLys: 6.692 ± 2.108
4.78AlaLeu: 4.78 ± 2.509
1.912AlaMet: 1.912 ± 0.402
3.824AlaAsn: 3.824 ± 0.602
0.956AlaPro: 0.956 ± 0.502
0.0AlaGln: 0.0 ± 0.0
1.912AlaArg: 1.912 ± 1.807
2.868AlaSer: 2.868 ± 1.506
8.604AlaThr: 8.604 ± 1.706
7.648AlaVal: 7.648 ± 1.204
0.0AlaTrp: 0.0 ± 0.0
2.868AlaTyr: 2.868 ± 1.506
0.0AlaXaa: 0.0 ± 0.0
Cys
0.956CysAla: 0.956 ± 0.502
0.0CysCys: 0.0 ± 0.0
0.956CysAsp: 0.956 ± 0.904
1.912CysGlu: 1.912 ± 1.004
0.0CysPhe: 0.0 ± 0.0
1.912CysGly: 1.912 ± 0.402
0.956CysHis: 0.956 ± 0.502
0.956CysIle: 0.956 ± 0.502
0.956CysLys: 0.956 ± 0.502
0.956CysLeu: 0.956 ± 0.904
0.0CysMet: 0.0 ± 0.0
0.956CysAsn: 0.956 ± 0.904
0.956CysPro: 0.956 ± 0.502
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.912CysSer: 1.912 ± 0.402
0.0CysThr: 0.0 ± 0.0
0.956CysVal: 0.956 ± 0.904
0.956CysTrp: 0.956 ± 0.502
0.956CysTyr: 0.956 ± 0.502
0.0CysXaa: 0.0 ± 0.0
Asp
1.912AspAla: 1.912 ± 1.004
1.912AspCys: 1.912 ± 0.402
4.78AspAsp: 4.78 ± 3.113
3.824AspGlu: 3.824 ± 0.602
1.912AspPhe: 1.912 ± 1.807
3.824AspGly: 3.824 ± 0.804
4.78AspHis: 4.78 ± 1.707
0.956AspIle: 0.956 ± 0.904
0.956AspLys: 0.956 ± 0.904
1.912AspLeu: 1.912 ± 0.402
0.956AspMet: 0.956 ± 0.502
2.868AspAsn: 2.868 ± 2.711
2.868AspPro: 2.868 ± 1.305
0.956AspGln: 0.956 ± 0.904
0.0AspArg: 0.0 ± 0.0
2.868AspSer: 2.868 ± 0.1
0.956AspThr: 0.956 ± 0.904
3.824AspVal: 3.824 ± 0.602
2.868AspTrp: 2.868 ± 0.1
3.824AspTyr: 3.824 ± 2.209
0.0AspXaa: 0.0 ± 0.0
Glu
5.736GluAla: 5.736 ± 1.205
0.956GluCys: 0.956 ± 0.502
1.912GluAsp: 1.912 ± 0.402
6.692GluGlu: 6.692 ± 0.703
0.956GluPhe: 0.956 ± 0.904
3.824GluGly: 3.824 ± 2.007
0.0GluHis: 0.0 ± 0.0
4.78GluIle: 4.78 ± 0.302
3.824GluLys: 3.824 ± 0.602
0.956GluLeu: 0.956 ± 0.502
1.912GluMet: 1.912 ± 1.004
4.78GluAsn: 4.78 ± 1.104
4.78GluPro: 4.78 ± 1.707
2.868GluGln: 2.868 ± 1.305
2.868GluArg: 2.868 ± 0.1
3.824GluSer: 3.824 ± 0.602
3.824GluThr: 3.824 ± 0.602
2.868GluVal: 2.868 ± 0.1
0.956GluTrp: 0.956 ± 0.502
1.912GluTyr: 1.912 ± 1.004
0.0GluXaa: 0.0 ± 0.0
Phe
1.912PheAla: 1.912 ± 0.402
1.912PheCys: 1.912 ± 1.004
1.912PheAsp: 1.912 ± 1.807
2.868PheGlu: 2.868 ± 1.305
1.912PhePhe: 1.912 ± 0.402
1.912PheGly: 1.912 ± 0.402
0.0PheHis: 0.0 ± 0.0
1.912PheIle: 1.912 ± 0.402
0.956PheLys: 0.956 ± 0.502
6.692PheLeu: 6.692 ± 2.109
0.0PheMet: 0.0 ± 0.0
1.912PheAsn: 1.912 ± 1.807
0.956PhePro: 0.956 ± 0.904
1.912PheGln: 1.912 ± 1.004
1.912PheArg: 1.912 ± 1.807
2.868PheSer: 2.868 ± 0.1
0.0PheThr: 0.0 ± 0.0
1.912PheVal: 1.912 ± 1.004
0.956PheTrp: 0.956 ± 0.502
0.956PheTyr: 0.956 ± 0.904
0.0PheXaa: 0.0 ± 0.0
Gly
3.824GlyAla: 3.824 ± 2.007
0.956GlyCys: 0.956 ± 0.904
2.868GlyAsp: 2.868 ± 1.305
1.912GlyGlu: 1.912 ± 0.402
0.0GlyPhe: 0.0 ± 0.0
3.824GlyGly: 3.824 ± 3.614
0.0GlyHis: 0.0 ± 0.0
4.78GlyIle: 4.78 ± 2.509
4.78GlyLys: 4.78 ± 1.104
6.692GlyLeu: 6.692 ± 0.702
0.956GlyMet: 0.956 ± 0.904
0.0GlyAsn: 0.0 ± 0.0
2.868GlyPro: 2.868 ± 1.506
4.78GlyGln: 4.78 ± 2.509
1.912GlyArg: 1.912 ± 1.004
3.824GlySer: 3.824 ± 2.007
1.912GlyThr: 1.912 ± 1.004
5.736GlyVal: 5.736 ± 1.205
4.78GlyTrp: 4.78 ± 3.113
2.868GlyTyr: 2.868 ± 0.1
0.0GlyXaa: 0.0 ± 0.0
His
0.956HisAla: 0.956 ± 0.904
0.0HisCys: 0.0 ± 0.0
0.956HisAsp: 0.956 ± 0.502
0.956HisGlu: 0.956 ± 0.904
1.912HisPhe: 1.912 ± 1.807
0.0HisGly: 0.0 ± 0.0
0.956HisHis: 0.956 ± 0.502
3.824HisIle: 3.824 ± 0.602
1.912HisLys: 1.912 ± 1.807
1.912HisLeu: 1.912 ± 1.807
1.912HisMet: 1.912 ± 1.807
0.956HisAsn: 0.956 ± 0.502
1.912HisPro: 1.912 ± 1.004
0.956HisGln: 0.956 ± 0.502
3.824HisArg: 3.824 ± 2.209
0.956HisSer: 0.956 ± 0.502
0.0HisThr: 0.0 ± 0.0
3.824HisVal: 3.824 ± 2.007
0.0HisTrp: 0.0 ± 0.0
0.956HisTyr: 0.956 ± 0.502
0.0HisXaa: 0.0 ± 0.0
Ile
4.78IleAla: 4.78 ± 1.104
0.0IleCys: 0.0 ± 0.0
4.78IleAsp: 4.78 ± 1.707
3.824IleGlu: 3.824 ± 0.602
2.868IlePhe: 2.868 ± 1.305
3.824IleGly: 3.824 ± 2.007
0.0IleHis: 0.0 ± 0.0
4.78IleIle: 4.78 ± 1.104
4.78IleLys: 4.78 ± 1.707
1.912IleLeu: 1.912 ± 1.004
1.912IleMet: 1.912 ± 1.807
1.912IleAsn: 1.912 ± 0.402
3.824IlePro: 3.824 ± 0.804
1.912IleGln: 1.912 ± 1.004
5.736IleArg: 5.736 ± 0.2
3.824IleSer: 3.824 ± 0.602
1.912IleThr: 1.912 ± 1.004
4.78IleVal: 4.78 ± 1.707
0.956IleTrp: 0.956 ± 0.904
0.956IleTyr: 0.956 ± 0.502
0.0IleXaa: 0.0 ± 0.0
Lys
1.912LysAla: 1.912 ± 0.402
1.912LysCys: 1.912 ± 0.402
0.956LysAsp: 0.956 ± 0.502
4.78LysGlu: 4.78 ± 1.707
1.912LysPhe: 1.912 ± 0.402
2.868LysGly: 2.868 ± 0.1
1.912LysHis: 1.912 ± 1.807
0.956LysIle: 0.956 ± 0.502
2.868LysLys: 2.868 ± 0.1
5.736LysLeu: 5.736 ± 1.205
0.956LysMet: 0.956 ± 0.502
2.868LysAsn: 2.868 ± 1.506
3.824LysPro: 3.824 ± 2.007
1.912LysGln: 1.912 ± 1.807
2.868LysArg: 2.868 ± 1.506
3.824LysSer: 3.824 ± 2.209
1.912LysThr: 1.912 ± 1.004
8.604LysVal: 8.604 ± 3.111
0.0LysTrp: 0.0 ± 0.0
0.956LysTyr: 0.956 ± 0.904
0.0LysXaa: 0.0 ± 0.0
Leu
6.692LeuAla: 6.692 ± 0.702
0.956LeuCys: 0.956 ± 0.904
3.824LeuAsp: 3.824 ± 0.804
6.692LeuGlu: 6.692 ± 0.702
6.692LeuPhe: 6.692 ± 4.92
5.736LeuGly: 5.736 ± 1.606
3.824LeuHis: 3.824 ± 2.209
3.824LeuIle: 3.824 ± 0.804
3.824LeuLys: 3.824 ± 0.804
11.472LeuLeu: 11.472 ± 3.816
2.868LeuMet: 2.868 ± 1.506
3.824LeuAsn: 3.824 ± 0.804
0.0LeuPro: 0.0 ± 0.0
2.868LeuGln: 2.868 ± 1.506
5.736LeuArg: 5.736 ± 1.205
6.692LeuSer: 6.692 ± 2.108
4.78LeuThr: 4.78 ± 1.104
3.824LeuVal: 3.824 ± 0.804
1.912LeuTrp: 1.912 ± 1.004
2.868LeuTyr: 2.868 ± 1.305
0.0LeuXaa: 0.0 ± 0.0
Met
2.868MetAla: 2.868 ± 1.506
0.0MetCys: 0.0 ± 0.0
0.956MetAsp: 0.956 ± 0.502
0.956MetGlu: 0.956 ± 0.502
0.0MetPhe: 0.0 ± 0.0
0.956MetGly: 0.956 ± 0.904
0.956MetHis: 0.956 ± 0.904
0.956MetIle: 0.956 ± 0.502
1.912MetLys: 1.912 ± 0.402
4.78MetLeu: 4.78 ± 1.707
0.0MetMet: 0.0 ± 0.0
4.78MetAsn: 4.78 ± 0.302
0.0MetPro: 0.0 ± 0.0
1.912MetGln: 1.912 ± 1.807
0.956MetArg: 0.956 ± 0.502
0.0MetSer: 0.0 ± 0.0
0.956MetThr: 0.956 ± 0.502
4.78MetVal: 4.78 ± 0.302
0.0MetTrp: 0.0 ± 0.0
1.912MetTyr: 1.912 ± 0.402
0.0MetXaa: 0.0 ± 0.0
Asn
3.824AsnAla: 3.824 ± 2.007
0.0AsnCys: 0.0 ± 0.0
2.868AsnAsp: 2.868 ± 1.305
1.912AsnGlu: 1.912 ± 0.402
1.912AsnPhe: 1.912 ± 1.004
0.956AsnGly: 0.956 ± 0.502
2.868AsnHis: 2.868 ± 0.1
3.824AsnIle: 3.824 ± 0.804
0.0AsnLys: 0.0 ± 0.0
3.824AsnLeu: 3.824 ± 0.602
1.912AsnMet: 1.912 ± 1.3
0.956AsnAsn: 0.956 ± 0.904
5.736AsnPro: 5.736 ± 1.205
0.956AsnGln: 0.956 ± 0.904
3.824AsnArg: 3.824 ± 0.602
5.736AsnSer: 5.736 ± 1.205
0.956AsnThr: 0.956 ± 0.904
4.78AsnVal: 4.78 ± 2.509
0.956AsnTrp: 0.956 ± 0.904
1.912AsnTyr: 1.912 ± 0.402
0.0AsnXaa: 0.0 ± 0.0
Pro
1.912ProAla: 1.912 ± 0.402
0.0ProCys: 0.0 ± 0.0
1.912ProAsp: 1.912 ± 0.402
4.78ProGlu: 4.78 ± 1.104
0.956ProPhe: 0.956 ± 0.502
4.78ProGly: 4.78 ± 1.104
1.912ProHis: 1.912 ± 0.402
2.868ProIle: 2.868 ± 0.1
1.912ProLys: 1.912 ± 1.004
2.868ProLeu: 2.868 ± 1.305
1.912ProMet: 1.912 ± 1.807
3.824ProAsn: 3.824 ± 0.602
4.78ProPro: 4.78 ± 2.509
1.912ProGln: 1.912 ± 1.004
1.912ProArg: 1.912 ± 1.004
5.736ProSer: 5.736 ± 0.2
0.956ProThr: 0.956 ± 0.904
3.824ProVal: 3.824 ± 0.602
0.0ProTrp: 0.0 ± 0.0
2.868ProTyr: 2.868 ± 1.305
0.0ProXaa: 0.0 ± 0.0
Gln
1.912GlnAla: 1.912 ± 0.402
0.956GlnCys: 0.956 ± 0.502
1.912GlnAsp: 1.912 ± 0.402
1.912GlnGlu: 1.912 ± 1.004
2.868GlnPhe: 2.868 ± 1.305
0.956GlnGly: 0.956 ± 0.502
0.956GlnHis: 0.956 ± 0.502
2.868GlnIle: 2.868 ± 2.711
0.956GlnLys: 0.956 ± 0.904
2.868GlnLeu: 2.868 ± 0.1
0.956GlnMet: 0.956 ± 0.502
1.912GlnAsn: 1.912 ± 1.004
2.868GlnPro: 2.868 ± 0.1
0.956GlnGln: 0.956 ± 0.502
3.824GlnArg: 3.824 ± 0.602
2.868GlnSer: 2.868 ± 1.506
0.956GlnThr: 0.956 ± 0.904
3.824GlnVal: 3.824 ± 0.804
0.0GlnTrp: 0.0 ± 0.0
0.956GlnTyr: 0.956 ± 0.502
0.0GlnXaa: 0.0 ± 0.0
Arg
1.912ArgAla: 1.912 ± 0.402
1.912ArgCys: 1.912 ± 1.004
1.912ArgAsp: 1.912 ± 1.807
0.0ArgGlu: 0.0 ± 0.0
0.956ArgPhe: 0.956 ± 0.502
3.824ArgGly: 3.824 ± 0.804
0.0ArgHis: 0.0 ± 0.0
5.736ArgIle: 5.736 ± 0.2
1.912ArgLys: 1.912 ± 1.004
10.516ArgLeu: 10.516 ± 4.318
1.912ArgMet: 1.912 ± 1.004
0.956ArgAsn: 0.956 ± 0.904
4.78ArgPro: 4.78 ± 2.509
0.0ArgGln: 0.0 ± 0.0
0.956ArgArg: 0.956 ± 0.502
4.78ArgSer: 4.78 ± 1.104
3.824ArgThr: 3.824 ± 0.602
4.78ArgVal: 4.78 ± 1.104
0.956ArgTrp: 0.956 ± 0.904
0.956ArgTyr: 0.956 ± 0.904
0.0ArgXaa: 0.0 ± 0.0
Ser
4.78SerAla: 4.78 ± 2.509
0.0SerCys: 0.0 ± 0.0
2.868SerAsp: 2.868 ± 0.1
0.956SerGlu: 0.956 ± 0.502
1.912SerPhe: 1.912 ± 1.004
4.78SerGly: 4.78 ± 0.302
2.868SerHis: 2.868 ± 1.506
5.736SerIle: 5.736 ± 0.2
2.868SerLys: 2.868 ± 1.305
7.648SerLeu: 7.648 ± 1.204
1.912SerMet: 1.912 ± 0.342
6.692SerAsn: 6.692 ± 2.108
4.78SerPro: 4.78 ± 0.302
2.868SerGln: 2.868 ± 0.1
3.824SerArg: 3.824 ± 0.602
7.648SerSer: 7.648 ± 1.607
5.736SerThr: 5.736 ± 3.011
2.868SerVal: 2.868 ± 1.305
1.912SerTrp: 1.912 ± 1.807
2.868SerTyr: 2.868 ± 0.1
0.0SerXaa: 0.0 ± 0.0
Thr
4.78ThrAla: 4.78 ± 1.104
0.956ThrCys: 0.956 ± 0.502
2.868ThrAsp: 2.868 ± 1.506
3.824ThrGlu: 3.824 ± 0.602
1.912ThrPhe: 1.912 ± 1.004
2.868ThrGly: 2.868 ± 1.506
0.956ThrHis: 0.956 ± 0.904
1.912ThrIle: 1.912 ± 0.402
2.868ThrLys: 2.868 ± 1.506
3.824ThrLeu: 3.824 ± 2.007
0.956ThrMet: 0.956 ± 0.904
2.868ThrAsn: 2.868 ± 0.1
0.956ThrPro: 0.956 ± 0.502
4.78ThrGln: 4.78 ± 0.302
0.956ThrArg: 0.956 ± 0.502
6.692ThrSer: 6.692 ± 2.108
0.956ThrThr: 0.956 ± 0.502
4.78ThrVal: 4.78 ± 0.302
0.0ThrTrp: 0.0 ± 0.0
1.912ThrTyr: 1.912 ± 1.004
0.0ThrXaa: 0.0 ± 0.0
Val
8.604ValAla: 8.604 ± 1.706
1.912ValCys: 1.912 ± 1.807
5.736ValAsp: 5.736 ± 1.205
6.692ValGlu: 6.692 ± 0.702
1.912ValPhe: 1.912 ± 1.004
4.78ValGly: 4.78 ± 1.104
0.956ValHis: 0.956 ± 0.502
4.78ValIle: 4.78 ± 1.104
4.78ValLys: 4.78 ± 1.104
6.692ValLeu: 6.692 ± 2.108
2.868ValMet: 2.868 ± 0.1
2.868ValAsn: 2.868 ± 0.1
3.824ValPro: 3.824 ± 0.804
4.78ValGln: 4.78 ± 1.707
4.78ValArg: 4.78 ± 0.302
4.78ValSer: 4.78 ± 0.302
4.78ValThr: 4.78 ± 2.509
9.56ValVal: 9.56 ± 3.613
0.956ValTrp: 0.956 ± 0.502
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.912TrpAsp: 1.912 ± 1.807
0.956TrpGlu: 0.956 ± 0.904
0.956TrpPhe: 0.956 ± 0.904
0.956TrpGly: 0.956 ± 0.502
0.956TrpHis: 0.956 ± 0.904
0.956TrpIle: 0.956 ± 0.904
0.956TrpLys: 0.956 ± 0.502
1.912TrpLeu: 1.912 ± 0.402
0.0TrpMet: 0.0 ± 0.0
0.956TrpAsn: 0.956 ± 0.502
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.912TrpArg: 1.912 ± 0.402
1.912TrpSer: 1.912 ± 0.402
2.868TrpThr: 2.868 ± 0.1
0.956TrpVal: 0.956 ± 0.502
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.912TyrAla: 1.912 ± 1.004
0.956TyrCys: 0.956 ± 0.502
0.956TyrAsp: 0.956 ± 0.904
1.912TyrGlu: 1.912 ± 1.004
0.0TyrPhe: 0.0 ± 0.0
1.912TyrGly: 1.912 ± 0.402
0.956TyrHis: 0.956 ± 0.904
0.956TyrIle: 0.956 ± 0.904
3.824TyrLys: 3.824 ± 2.209
0.956TyrLeu: 0.956 ± 0.904
2.868TyrMet: 2.868 ± 1.305
0.956TyrAsn: 0.956 ± 0.502
0.956TyrPro: 0.956 ± 0.904
0.956TyrGln: 0.956 ± 0.502
2.868TyrArg: 2.868 ± 0.1
1.912TyrSer: 1.912 ± 0.402
5.736TyrThr: 5.736 ± 1.606
1.912TyrVal: 1.912 ± 1.004
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1047 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski