Amino acid dipepetide frequency for Hubei sobemo-like virus 28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.11AlaAla: 6.11 ± 2.178
2.037AlaCys: 2.037 ± 0.375
5.092AlaAsp: 5.092 ± 1.763
3.055AlaGlu: 3.055 ± 1.915
2.037AlaPhe: 2.037 ± 1.276
2.037AlaGly: 2.037 ± 0.375
2.037AlaHis: 2.037 ± 2.026
5.092AlaIle: 5.092 ± 1.763
6.11AlaLys: 6.11 ± 2.178
6.11AlaLeu: 6.11 ± 2.178
1.018AlaMet: 1.018 ± 1.013
3.055AlaAsn: 3.055 ± 1.388
0.0AlaPro: 0.0 ± 0.0
4.073AlaGln: 4.073 ± 0.901
5.092AlaArg: 5.092 ± 1.763
5.092AlaSer: 5.092 ± 3.191
3.055AlaThr: 3.055 ± 0.263
6.11AlaVal: 6.11 ± 1.125
2.037AlaTrp: 2.037 ± 0.375
4.073AlaTyr: 4.073 ± 0.901
0.0AlaXaa: 0.0 ± 0.0
Cys
1.018CysAla: 1.018 ± 0.638
0.0CysCys: 0.0 ± 0.0
3.055CysAsp: 3.055 ± 1.388
2.037CysGlu: 2.037 ± 1.276
1.018CysPhe: 1.018 ± 1.013
2.037CysGly: 2.037 ± 1.276
0.0CysHis: 0.0 ± 0.0
1.018CysIle: 1.018 ± 1.013
0.0CysLys: 0.0 ± 0.0
1.018CysLeu: 1.018 ± 1.013
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.018CysPro: 1.018 ± 0.638
0.0CysGln: 0.0 ± 0.0
3.055CysArg: 3.055 ± 1.915
0.0CysSer: 0.0 ± 0.0
2.037CysThr: 2.037 ± 0.375
3.055CysVal: 3.055 ± 1.388
0.0CysTrp: 0.0 ± 0.0
1.018CysTyr: 1.018 ± 0.638
0.0CysXaa: 0.0 ± 0.0
Asp
4.073AspAla: 4.073 ± 4.053
3.055AspCys: 3.055 ± 0.263
3.055AspAsp: 3.055 ± 1.388
1.018AspGlu: 1.018 ± 0.638
4.073AspPhe: 4.073 ± 0.75
3.055AspGly: 3.055 ± 3.04
1.018AspHis: 1.018 ± 1.013
2.037AspIle: 2.037 ± 1.276
2.037AspLys: 2.037 ± 0.375
3.055AspLeu: 3.055 ± 1.388
2.037AspMet: 2.037 ± 2.546
3.055AspAsn: 3.055 ± 3.04
2.037AspPro: 2.037 ± 2.026
3.055AspGln: 3.055 ± 1.388
4.073AspArg: 4.073 ± 0.75
5.092AspSer: 5.092 ± 3.191
2.037AspThr: 2.037 ± 0.375
3.055AspVal: 3.055 ± 1.915
1.018AspTrp: 1.018 ± 1.013
1.018AspTyr: 1.018 ± 0.638
0.0AspXaa: 0.0 ± 0.0
Glu
5.092GluAla: 5.092 ± 1.763
1.018GluCys: 1.018 ± 0.638
4.073GluAsp: 4.073 ± 2.401
8.147GluGlu: 8.147 ± 1.803
2.037GluPhe: 2.037 ± 0.375
1.018GluGly: 1.018 ± 0.638
1.018GluHis: 1.018 ± 1.013
3.055GluIle: 3.055 ± 1.388
5.092GluLys: 5.092 ± 0.112
9.165GluLeu: 9.165 ± 2.441
0.0GluMet: 0.0 ± 0.0
1.018GluAsn: 1.018 ± 1.013
3.055GluPro: 3.055 ± 1.388
4.073GluGln: 4.073 ± 2.553
3.055GluArg: 3.055 ± 0.263
4.073GluSer: 4.073 ± 2.553
4.073GluThr: 4.073 ± 0.901
3.055GluVal: 3.055 ± 1.915
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.055PheAla: 3.055 ± 0.263
2.037PheCys: 2.037 ± 0.375
3.055PheAsp: 3.055 ± 1.388
2.037PheGlu: 2.037 ± 2.026
0.0PhePhe: 0.0 ± 0.0
4.073PheGly: 4.073 ± 0.901
1.018PheHis: 1.018 ± 1.013
2.037PheIle: 2.037 ± 2.026
1.018PheLys: 1.018 ± 0.638
1.018PheLeu: 1.018 ± 0.638
0.0PheMet: 0.0 ± 0.0
1.018PheAsn: 1.018 ± 1.013
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
1.018PheArg: 1.018 ± 1.013
0.0PheSer: 0.0 ± 0.0
3.055PheThr: 3.055 ± 0.263
2.037PheVal: 2.037 ± 0.375
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.037GlyAla: 2.037 ± 1.276
2.037GlyCys: 2.037 ± 0.375
5.092GlyAsp: 5.092 ± 1.54
6.11GlyGlu: 6.11 ± 2.776
1.018GlyPhe: 1.018 ± 0.638
2.037GlyGly: 2.037 ± 2.026
0.0GlyHis: 0.0 ± 0.0
5.092GlyIle: 5.092 ± 1.54
4.073GlyLys: 4.073 ± 0.75
9.165GlyLeu: 9.165 ± 4.092
3.055GlyMet: 3.055 ± 0.263
0.0GlyAsn: 0.0 ± 0.0
4.073GlyPro: 4.073 ± 2.553
2.037GlyGln: 2.037 ± 1.276
2.037GlyArg: 2.037 ± 0.375
6.11GlySer: 6.11 ± 3.829
0.0GlyThr: 0.0 ± 0.0
5.092GlyVal: 5.092 ± 1.763
4.073GlyTrp: 4.073 ± 0.75
4.073GlyTyr: 4.073 ± 0.75
0.0GlyXaa: 0.0 ± 0.0
His
3.055HisAla: 3.055 ± 1.388
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.018HisGlu: 1.018 ± 0.638
1.018HisPhe: 1.018 ± 1.013
1.018HisGly: 1.018 ± 1.013
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.018HisLys: 1.018 ± 1.013
6.11HisLeu: 6.11 ± 1.125
1.018HisMet: 1.018 ± 0.638
1.018HisAsn: 1.018 ± 1.013
1.018HisPro: 1.018 ± 1.013
1.018HisGln: 1.018 ± 0.638
4.073HisArg: 4.073 ± 4.053
0.0HisSer: 0.0 ± 0.0
2.037HisThr: 2.037 ± 1.276
1.018HisVal: 1.018 ± 0.638
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.055IleAla: 3.055 ± 1.388
0.0IleCys: 0.0 ± 0.0
5.092IleAsp: 5.092 ± 3.415
3.055IleGlu: 3.055 ± 1.915
0.0IlePhe: 0.0 ± 0.0
3.055IleGly: 3.055 ± 0.263
1.018IleHis: 1.018 ± 0.638
4.073IleIle: 4.073 ± 2.401
3.055IleLys: 3.055 ± 0.263
3.055IleLeu: 3.055 ± 3.04
1.018IleMet: 1.018 ± 0.638
0.0IleAsn: 0.0 ± 0.0
4.073IlePro: 4.073 ± 0.75
0.0IleGln: 0.0 ± 0.0
2.037IleArg: 2.037 ± 0.375
5.092IleSer: 5.092 ± 0.112
1.018IleThr: 1.018 ± 0.638
3.055IleVal: 3.055 ± 1.388
2.037IleTrp: 2.037 ± 1.276
2.037IleTyr: 2.037 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
2.037LysAla: 2.037 ± 1.276
2.037LysCys: 2.037 ± 1.276
1.018LysAsp: 1.018 ± 1.013
5.092LysGlu: 5.092 ± 1.54
2.037LysPhe: 2.037 ± 0.375
1.018LysGly: 1.018 ± 0.638
1.018LysHis: 1.018 ± 1.013
2.037LysIle: 2.037 ± 1.276
5.092LysLys: 5.092 ± 1.54
9.165LysLeu: 9.165 ± 2.513
1.018LysMet: 1.018 ± 0.638
3.055LysAsn: 3.055 ± 1.915
4.073LysPro: 4.073 ± 2.401
4.073LysGln: 4.073 ± 0.901
4.073LysArg: 4.073 ± 0.901
3.055LysSer: 3.055 ± 1.388
4.073LysThr: 4.073 ± 0.901
5.092LysVal: 5.092 ± 1.54
3.055LysTrp: 3.055 ± 1.388
1.018LysTyr: 1.018 ± 1.013
0.0LysXaa: 0.0 ± 0.0
Leu
13.238LeuAla: 13.238 ± 3.342
2.037LeuCys: 2.037 ± 0.375
6.11LeuAsp: 6.11 ± 4.428
7.128LeuGlu: 7.128 ± 1.165
7.128LeuPhe: 7.128 ± 3.79
8.147LeuGly: 8.147 ± 3.454
4.073LeuHis: 4.073 ± 0.901
1.018LeuIle: 1.018 ± 0.638
7.128LeuLys: 7.128 ± 0.487
14.257LeuLeu: 14.257 ± 0.974
7.128LeuMet: 7.128 ± 2.816
3.055LeuAsn: 3.055 ± 1.915
0.0LeuPro: 0.0 ± 0.0
6.11LeuGln: 6.11 ± 1.125
7.128LeuArg: 7.128 ± 0.487
7.128LeuSer: 7.128 ± 2.816
3.055LeuThr: 3.055 ± 1.388
6.11LeuVal: 6.11 ± 2.178
0.0LeuTrp: 0.0 ± 0.0
5.092LeuTyr: 5.092 ± 1.763
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.018MetCys: 1.018 ± 0.638
1.018MetAsp: 1.018 ± 0.638
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
6.11MetGly: 6.11 ± 1.125
0.0MetHis: 0.0 ± 0.0
1.018MetIle: 1.018 ± 0.638
1.018MetLys: 1.018 ± 1.013
4.073MetLeu: 4.073 ± 0.901
0.0MetMet: 0.0 ± 0.0
2.037MetAsn: 2.037 ± 2.026
1.018MetPro: 1.018 ± 1.013
2.037MetGln: 2.037 ± 0.375
2.037MetArg: 2.037 ± 1.276
2.037MetSer: 2.037 ± 0.375
3.055MetThr: 3.055 ± 1.915
4.073MetVal: 4.073 ± 0.901
0.0MetTrp: 0.0 ± 0.0
1.018MetTyr: 1.018 ± 1.013
0.0MetXaa: 0.0 ± 0.0
Asn
5.092AsnAla: 5.092 ± 1.763
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
2.037AsnGlu: 2.037 ± 2.026
0.0AsnPhe: 0.0 ± 0.0
1.018AsnGly: 1.018 ± 1.013
1.018AsnHis: 1.018 ± 0.638
0.0AsnIle: 0.0 ± 0.0
1.018AsnLys: 1.018 ± 0.638
1.018AsnLeu: 1.018 ± 1.013
3.055AsnMet: 3.055 ± 0.338
0.0AsnAsn: 0.0 ± 0.0
3.055AsnPro: 3.055 ± 0.263
1.018AsnGln: 1.018 ± 0.638
0.0AsnArg: 0.0 ± 0.0
4.073AsnSer: 4.073 ± 0.901
2.037AsnThr: 2.037 ± 0.375
2.037AsnVal: 2.037 ± 2.026
2.037AsnTrp: 2.037 ± 2.026
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
4.073ProAsp: 4.073 ± 0.901
4.073ProGlu: 4.073 ± 2.401
2.037ProPhe: 2.037 ± 0.375
7.128ProGly: 7.128 ± 0.487
0.0ProHis: 0.0 ± 0.0
1.018ProIle: 1.018 ± 1.013
3.055ProLys: 3.055 ± 0.263
8.147ProLeu: 8.147 ± 1.803
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
3.055ProPro: 3.055 ± 1.915
0.0ProGln: 0.0 ± 0.0
4.073ProArg: 4.073 ± 0.75
4.073ProSer: 4.073 ± 0.901
1.018ProThr: 1.018 ± 0.638
2.037ProVal: 2.037 ± 2.026
1.018ProTrp: 1.018 ± 0.638
1.018ProTyr: 1.018 ± 1.013
0.0ProXaa: 0.0 ± 0.0
Gln
1.018GlnAla: 1.018 ± 1.013
1.018GlnCys: 1.018 ± 1.013
1.018GlnAsp: 1.018 ± 0.638
1.018GlnGlu: 1.018 ± 0.638
0.0GlnPhe: 0.0 ± 0.0
3.055GlnGly: 3.055 ± 1.915
3.055GlnHis: 3.055 ± 1.388
4.073GlnIle: 4.073 ± 0.901
7.128GlnLys: 7.128 ± 2.138
1.018GlnLeu: 1.018 ± 0.638
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
3.055GlnPro: 3.055 ± 1.915
4.073GlnGln: 4.073 ± 0.75
0.0GlnArg: 0.0 ± 0.0
4.073GlnSer: 4.073 ± 0.901
1.018GlnThr: 1.018 ± 0.638
1.018GlnVal: 1.018 ± 0.638
1.018GlnTrp: 1.018 ± 0.638
3.055GlnTyr: 3.055 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
5.092ArgAla: 5.092 ± 1.763
2.037ArgCys: 2.037 ± 1.276
2.037ArgAsp: 2.037 ± 2.026
1.018ArgGlu: 1.018 ± 0.638
0.0ArgPhe: 0.0 ± 0.0
3.055ArgGly: 3.055 ± 1.915
1.018ArgHis: 1.018 ± 0.638
4.073ArgIle: 4.073 ± 0.75
2.037ArgLys: 2.037 ± 1.276
6.11ArgLeu: 6.11 ± 1.125
0.0ArgMet: 0.0 ± 0.0
2.037ArgAsn: 2.037 ± 2.026
1.018ArgPro: 1.018 ± 0.638
1.018ArgGln: 1.018 ± 1.013
2.037ArgArg: 2.037 ± 1.276
7.128ArgSer: 7.128 ± 1.165
4.073ArgThr: 4.073 ± 0.901
7.128ArgVal: 7.128 ± 0.487
2.037ArgTrp: 2.037 ± 2.026
2.037ArgTyr: 2.037 ± 2.026
0.0ArgXaa: 0.0 ± 0.0
Ser
8.147SerAla: 8.147 ± 3.454
0.0SerCys: 0.0 ± 0.0
3.055SerAsp: 3.055 ± 1.915
3.055SerGlu: 3.055 ± 1.915
2.037SerPhe: 2.037 ± 0.375
8.147SerGly: 8.147 ± 1.803
2.037SerHis: 2.037 ± 1.276
3.055SerIle: 3.055 ± 1.388
5.092SerLys: 5.092 ± 1.54
7.128SerLeu: 7.128 ± 4.467
6.11SerMet: 6.11 ± 0.526
1.018SerAsn: 1.018 ± 0.638
6.11SerPro: 6.11 ± 0.526
1.018SerGln: 1.018 ± 0.638
6.11SerArg: 6.11 ± 2.178
11.202SerSer: 11.202 ± 0.415
2.037SerThr: 2.037 ± 1.276
2.037SerVal: 2.037 ± 0.375
3.055SerTrp: 3.055 ± 0.263
2.037SerTyr: 2.037 ± 1.276
0.0SerXaa: 0.0 ± 0.0
Thr
6.11ThrAla: 6.11 ± 2.178
1.018ThrCys: 1.018 ± 0.638
0.0ThrAsp: 0.0 ± 0.0
3.055ThrGlu: 3.055 ± 1.388
1.018ThrPhe: 1.018 ± 0.638
4.073ThrGly: 4.073 ± 2.553
0.0ThrHis: 0.0 ± 0.0
3.055ThrIle: 3.055 ± 1.388
3.055ThrLys: 3.055 ± 1.915
7.128ThrLeu: 7.128 ± 1.165
2.037ThrMet: 2.037 ± 0.375
3.055ThrAsn: 3.055 ± 1.915
3.055ThrPro: 3.055 ± 1.388
1.018ThrGln: 1.018 ± 0.638
1.018ThrArg: 1.018 ± 1.013
3.055ThrSer: 3.055 ± 0.263
2.037ThrThr: 2.037 ± 1.276
2.037ThrVal: 2.037 ± 0.375
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.055ValAla: 3.055 ± 1.915
1.018ValCys: 1.018 ± 1.013
2.037ValAsp: 2.037 ± 0.375
5.092ValGlu: 5.092 ± 0.112
0.0ValPhe: 0.0 ± 0.0
5.092ValGly: 5.092 ± 0.112
1.018ValHis: 1.018 ± 1.013
2.037ValIle: 2.037 ± 0.375
3.055ValLys: 3.055 ± 0.263
9.165ValLeu: 9.165 ± 0.79
3.055ValMet: 3.055 ± 1.388
4.073ValAsn: 4.073 ± 2.401
4.073ValPro: 4.073 ± 0.901
4.073ValGln: 4.073 ± 0.75
2.037ValArg: 2.037 ± 0.375
5.092ValSer: 5.092 ± 1.54
4.073ValThr: 4.073 ± 0.901
5.092ValVal: 5.092 ± 3.191
1.018ValTrp: 1.018 ± 1.013
2.037ValTyr: 2.037 ± 1.276
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.037TrpAsp: 2.037 ± 0.375
1.018TrpGlu: 1.018 ± 1.013
1.018TrpPhe: 1.018 ± 1.013
0.0TrpGly: 0.0 ± 0.0
2.037TrpHis: 2.037 ± 0.375
2.037TrpIle: 2.037 ± 0.375
2.037TrpLys: 2.037 ± 2.026
4.073TrpLeu: 4.073 ± 0.75
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
3.055TrpSer: 3.055 ± 0.263
1.018TrpThr: 1.018 ± 1.013
3.055TrpVal: 3.055 ± 0.263
0.0TrpTrp: 0.0 ± 0.0
1.018TrpTyr: 1.018 ± 0.638
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.037TyrAla: 2.037 ± 0.375
1.018TyrCys: 1.018 ± 1.013
3.055TyrAsp: 3.055 ± 1.388
3.055TyrGlu: 3.055 ± 1.915
0.0TyrPhe: 0.0 ± 0.0
2.037TyrGly: 2.037 ± 1.276
3.055TyrHis: 3.055 ± 3.04
0.0TyrIle: 0.0 ± 0.0
1.018TyrLys: 1.018 ± 0.638
5.092TyrLeu: 5.092 ± 1.763
0.0TyrMet: 0.0 ± 0.0
1.018TyrAsn: 1.018 ± 0.638
2.037TyrPro: 2.037 ± 0.375
1.018TyrGln: 1.018 ± 0.638
2.037TyrArg: 2.037 ± 0.375
3.055TyrSer: 3.055 ± 1.915
1.018TyrThr: 1.018 ± 1.013
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski