Amino acid dipepetide frequency for Hubei sobemo-like virus 38

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.847AlaAla: 0.847 ± 0.579
0.847AlaCys: 0.847 ± 0.579
2.542AlaAsp: 2.542 ± 0.977
2.542AlaGlu: 2.542 ± 2.261
0.847AlaPhe: 0.847 ± 0.579
3.39AlaGly: 3.39 ± 1.468
0.0AlaHis: 0.0 ± 0.0
0.847AlaIle: 0.847 ± 0.579
3.39AlaLys: 3.39 ± 1.473
4.237AlaLeu: 4.237 ± 1.582
1.695AlaMet: 1.695 ± 1.159
1.695AlaAsn: 1.695 ± 0.637
1.695AlaPro: 1.695 ± 0.652
1.695AlaGln: 1.695 ± 0.811
4.237AlaArg: 4.237 ± 2.014
4.237AlaSer: 4.237 ± 0.593
1.695AlaThr: 1.695 ± 1.159
4.237AlaVal: 4.237 ± 1.66
0.847AlaTrp: 0.847 ± 0.579
0.847AlaTyr: 0.847 ± 0.754
0.0AlaXaa: 0.0 ± 0.0
Cys
0.847CysAla: 0.847 ± 0.579
0.0CysCys: 0.0 ± 0.0
2.542CysAsp: 2.542 ± 1.285
0.0CysGlu: 0.0 ± 0.0
1.695CysPhe: 1.695 ± 0.652
2.542CysGly: 2.542 ± 0.232
0.0CysHis: 0.0 ± 0.0
2.542CysIle: 2.542 ± 1.346
0.847CysLys: 0.847 ± 0.754
4.237CysLeu: 4.237 ± 0.593
0.847CysMet: 0.847 ± 0.579
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
3.39CysGln: 3.39 ± 1.622
0.0CysArg: 0.0 ± 0.0
1.695CysSer: 1.695 ± 0.637
0.0CysThr: 0.0 ± 0.0
1.695CysVal: 1.695 ± 1.459
0.847CysTrp: 0.847 ± 0.754
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.237AspAla: 4.237 ± 1.557
0.847AspCys: 0.847 ± 0.754
7.627AspAsp: 7.627 ± 2.931
5.085AspGlu: 5.085 ± 1.957
2.542AspPhe: 2.542 ± 1.738
3.39AspGly: 3.39 ± 0.349
0.0AspHis: 0.0 ± 0.0
2.542AspIle: 2.542 ± 0.976
3.39AspLys: 3.39 ± 1.305
3.39AspLeu: 3.39 ± 3.015
0.847AspMet: 0.847 ± 0.754
4.237AspAsn: 4.237 ± 2.741
3.39AspPro: 3.39 ± 2.003
0.847AspGln: 0.847 ± 0.729
0.847AspArg: 0.847 ± 0.579
2.542AspSer: 2.542 ± 1.738
0.847AspThr: 0.847 ± 0.729
3.39AspVal: 3.39 ± 1.468
1.695AspTrp: 1.695 ± 0.652
2.542AspTyr: 2.542 ± 1.385
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.847GluCys: 0.847 ± 0.729
3.39GluAsp: 3.39 ± 2.317
0.847GluGlu: 0.847 ± 0.579
0.847GluPhe: 0.847 ± 0.579
0.847GluGly: 0.847 ± 0.579
0.847GluHis: 0.847 ± 0.754
0.847GluIle: 0.847 ± 0.579
1.695GluLys: 1.695 ± 0.652
5.085GluLeu: 5.085 ± 1.367
1.695GluMet: 1.695 ± 0.652
0.847GluAsn: 0.847 ± 0.579
4.237GluPro: 4.237 ± 1.66
1.695GluGln: 1.695 ± 0.652
1.695GluArg: 1.695 ± 1.159
7.627GluSer: 7.627 ± 3.245
3.39GluThr: 3.39 ± 0.864
2.542GluVal: 2.542 ± 0.977
0.847GluTrp: 0.847 ± 0.729
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.542PheAla: 2.542 ± 1.385
0.847PheCys: 0.847 ± 0.754
5.085PheAsp: 5.085 ± 1.954
1.695PheGlu: 1.695 ± 0.652
0.0PhePhe: 0.0 ± 0.0
2.542PheGly: 2.542 ± 0.976
0.0PheHis: 0.0 ± 0.0
1.695PheIle: 1.695 ± 1.507
4.237PheLys: 4.237 ± 0.593
5.085PheLeu: 5.085 ± 0.794
0.847PheMet: 0.847 ± 1.246
3.39PheAsn: 3.39 ± 2.078
1.695PhePro: 1.695 ± 1.159
4.237PheGln: 4.237 ± 1.894
1.695PheArg: 1.695 ± 0.652
3.39PheSer: 3.39 ± 1.622
1.695PheThr: 1.695 ± 1.159
4.237PheVal: 4.237 ± 3.646
3.39PheTrp: 3.39 ± 1.622
2.542PheTyr: 2.542 ± 1.285
0.0PheXaa: 0.0 ± 0.0
Gly
4.237GlyAla: 4.237 ± 2.007
1.695GlyCys: 1.695 ± 0.637
4.237GlyAsp: 4.237 ± 2.007
0.847GlyGlu: 0.847 ± 0.579
5.932GlyPhe: 5.932 ± 1.102
5.085GlyGly: 5.085 ± 0.794
2.542GlyHis: 2.542 ± 1.285
0.847GlyIle: 0.847 ± 0.579
2.542GlyLys: 2.542 ± 1.738
6.78GlyLeu: 6.78 ± 2.318
2.542GlyMet: 2.542 ± 0.976
1.695GlyAsn: 1.695 ± 1.159
1.695GlyPro: 1.695 ± 0.637
1.695GlyGln: 1.695 ± 0.652
1.695GlyArg: 1.695 ± 0.652
1.695GlySer: 1.695 ± 1.159
2.542GlyThr: 2.542 ± 1.241
1.695GlyVal: 1.695 ± 0.652
0.847GlyTrp: 0.847 ± 0.579
1.695GlyTyr: 1.695 ± 0.652
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.695HisCys: 1.695 ± 0.652
1.695HisAsp: 1.695 ± 0.637
0.847HisGlu: 0.847 ± 0.754
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.695HisIle: 1.695 ± 0.811
1.695HisLys: 1.695 ± 1.507
1.695HisLeu: 1.695 ± 0.811
0.847HisMet: 0.847 ± 0.579
0.847HisAsn: 0.847 ± 0.729
0.847HisPro: 0.847 ± 0.754
0.847HisGln: 0.847 ± 0.754
0.0HisArg: 0.0 ± 0.0
2.542HisSer: 2.542 ± 0.977
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.847HisTrp: 0.847 ± 0.754
1.695HisTyr: 1.695 ± 0.652
0.0HisXaa: 0.0 ± 0.0
Ile
0.847IleAla: 0.847 ± 0.579
1.695IleCys: 1.695 ± 1.459
0.0IleAsp: 0.0 ± 0.0
0.0IleGlu: 0.0 ± 0.0
3.39IlePhe: 3.39 ± 2.078
1.695IleGly: 1.695 ± 0.652
0.847IleHis: 0.847 ± 0.754
5.085IleIle: 5.085 ± 2.482
2.542IleLys: 2.542 ± 0.232
9.322IleLeu: 9.322 ± 1.276
1.695IleMet: 1.695 ± 1.459
2.542IleAsn: 2.542 ± 0.232
3.39IlePro: 3.39 ± 0.864
5.085IleGln: 5.085 ± 1.507
3.39IleArg: 3.39 ± 0.349
2.542IleSer: 2.542 ± 0.976
1.695IleThr: 1.695 ± 1.159
2.542IleVal: 2.542 ± 1.241
1.695IleTrp: 1.695 ± 0.637
1.695IleTyr: 1.695 ± 1.507
0.0IleXaa: 0.0 ± 0.0
Lys
2.542LysAla: 2.542 ± 2.261
0.847LysCys: 0.847 ± 0.754
3.39LysAsp: 3.39 ± 0.864
1.695LysGlu: 1.695 ± 0.652
4.237LysPhe: 4.237 ± 2.142
1.695LysGly: 1.695 ± 0.652
1.695LysHis: 1.695 ± 1.507
2.542LysIle: 2.542 ± 0.977
5.932LysLys: 5.932 ± 0.123
6.78LysLeu: 6.78 ± 0.637
3.39LysMet: 3.39 ± 2.05
0.0LysAsn: 0.0 ± 0.0
2.542LysPro: 2.542 ± 0.977
4.237LysGln: 4.237 ± 0.654
3.39LysArg: 3.39 ± 0.349
2.542LysSer: 2.542 ± 0.977
5.932LysThr: 5.932 ± 1.285
2.542LysVal: 2.542 ± 0.977
1.695LysTrp: 1.695 ± 0.811
3.39LysTyr: 3.39 ± 1.473
0.0LysXaa: 0.0 ± 0.0
Leu
5.932LeuAla: 5.932 ± 2.001
5.085LeuCys: 5.085 ± 1.283
5.932LeuAsp: 5.932 ± 5.275
5.932LeuGlu: 5.932 ± 3.034
5.932LeuPhe: 5.932 ± 2.186
7.627LeuGly: 7.627 ± 1.84
1.695LeuHis: 1.695 ± 1.459
7.627LeuIle: 7.627 ± 0.697
5.085LeuLys: 5.085 ± 1.602
16.949LeuLeu: 16.949 ± 7.74
4.237LeuMet: 4.237 ± 1.582
2.542LeuAsn: 2.542 ± 2.188
5.932LeuPro: 5.932 ± 1.033
1.695LeuGln: 1.695 ± 0.637
14.407LeuArg: 14.407 ± 2.385
6.78LeuSer: 6.78 ± 1.727
5.932LeuThr: 5.932 ± 1.256
10.169LeuVal: 10.169 ± 1.976
3.39LeuTrp: 3.39 ± 1.275
6.78LeuTyr: 6.78 ± 2.368
0.0LeuXaa: 0.0 ± 0.0
Met
0.847MetAla: 0.847 ± 0.729
0.0MetCys: 0.0 ± 0.0
0.847MetAsp: 0.847 ± 0.754
2.542MetGlu: 2.542 ± 0.977
3.39MetPhe: 3.39 ± 0.864
0.847MetGly: 0.847 ± 0.754
0.847MetHis: 0.847 ± 0.579
0.0MetIle: 0.0 ± 0.0
2.542MetLys: 2.542 ± 1.346
2.542MetLeu: 2.542 ± 1.346
4.237MetMet: 4.237 ± 3.646
1.695MetAsn: 1.695 ± 1.459
0.847MetPro: 0.847 ± 0.579
1.695MetGln: 1.695 ± 1.459
1.695MetArg: 1.695 ± 0.637
1.695MetSer: 1.695 ± 1.159
2.542MetThr: 2.542 ± 2.188
5.085MetVal: 5.085 ± 1.912
0.0MetTrp: 0.0 ± 0.0
1.695MetTyr: 1.695 ± 1.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.39AsnAla: 3.39 ± 0.915
0.847AsnCys: 0.847 ± 0.754
1.695AsnAsp: 1.695 ± 0.652
0.847AsnGlu: 0.847 ± 0.579
3.39AsnPhe: 3.39 ± 2.078
1.695AsnGly: 1.695 ± 1.159
1.695AsnHis: 1.695 ± 0.637
3.39AsnIle: 3.39 ± 0.915
3.39AsnLys: 3.39 ± 1.305
8.475AsnLeu: 8.475 ± 4.277
1.695AsnMet: 1.695 ± 1.329
0.847AsnAsn: 0.847 ± 0.729
2.542AsnPro: 2.542 ± 1.346
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
2.542AsnSer: 2.542 ± 0.232
0.847AsnThr: 0.847 ± 0.579
1.695AsnVal: 1.695 ± 0.637
0.847AsnTrp: 0.847 ± 0.729
1.695AsnTyr: 1.695 ± 0.652
0.0AsnXaa: 0.0 ± 0.0
Pro
2.542ProAla: 2.542 ± 1.738
0.847ProCys: 0.847 ± 0.754
1.695ProAsp: 1.695 ± 0.652
2.542ProGlu: 2.542 ± 1.738
4.237ProPhe: 4.237 ± 1.894
3.39ProGly: 3.39 ± 2.003
1.695ProHis: 1.695 ± 0.811
1.695ProIle: 1.695 ± 1.159
0.847ProLys: 0.847 ± 0.729
6.78ProLeu: 6.78 ± 2.318
0.0ProMet: 0.0 ± 0.0
2.542ProAsn: 2.542 ± 0.232
3.39ProPro: 3.39 ± 1.473
1.695ProGln: 1.695 ± 1.459
0.0ProArg: 0.0 ± 0.0
4.237ProSer: 4.237 ± 0.593
3.39ProThr: 3.39 ± 1.305
4.237ProVal: 4.237 ± 0.593
0.0ProTrp: 0.0 ± 0.0
1.695ProTyr: 1.695 ± 0.637
0.0ProXaa: 0.0 ± 0.0
Gln
1.695GlnAla: 1.695 ± 0.811
1.695GlnCys: 1.695 ± 1.507
1.695GlnAsp: 1.695 ± 0.811
0.0GlnGlu: 0.0 ± 0.0
2.542GlnPhe: 2.542 ± 2.261
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.695GlnIle: 1.695 ± 0.652
5.085GlnLys: 5.085 ± 1.602
4.237GlnLeu: 4.237 ± 1.543
4.237GlnMet: 4.237 ± 1.582
0.847GlnAsn: 0.847 ± 0.579
2.542GlnPro: 2.542 ± 0.976
4.237GlnGln: 4.237 ± 0.928
1.695GlnArg: 1.695 ± 0.652
3.39GlnSer: 3.39 ± 1.473
0.0GlnThr: 0.0 ± 0.0
6.78GlnVal: 6.78 ± 1.795
2.542GlnTrp: 2.542 ± 1.346
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.847ArgAla: 0.847 ± 0.579
0.0ArgCys: 0.0 ± 0.0
0.847ArgAsp: 0.847 ± 0.579
0.847ArgGlu: 0.847 ± 0.579
1.695ArgPhe: 1.695 ± 0.637
1.695ArgGly: 1.695 ± 1.159
0.0ArgHis: 0.0 ± 0.0
3.39ArgIle: 3.39 ± 0.349
2.542ArgLys: 2.542 ± 1.738
11.017ArgLeu: 11.017 ± 2.205
0.0ArgMet: 0.0 ± 0.0
3.39ArgAsn: 3.39 ± 1.473
0.0ArgPro: 0.0 ± 0.0
1.695ArgGln: 1.695 ± 0.637
1.695ArgArg: 1.695 ± 1.159
5.085ArgSer: 5.085 ± 0.829
2.542ArgThr: 2.542 ± 1.346
2.542ArgVal: 2.542 ± 0.976
2.542ArgTrp: 2.542 ± 2.261
4.237ArgTyr: 4.237 ± 2.007
0.0ArgXaa: 0.0 ± 0.0
Ser
1.695SerAla: 1.695 ± 1.159
0.0SerCys: 0.0 ± 0.0
2.542SerAsp: 2.542 ± 1.285
4.237SerGlu: 4.237 ± 2.897
2.542SerPhe: 2.542 ± 1.738
5.932SerGly: 5.932 ± 1.273
2.542SerHis: 2.542 ± 1.285
2.542SerIle: 2.542 ± 0.976
3.39SerLys: 3.39 ± 1.305
11.864SerLeu: 11.864 ± 3.099
0.847SerMet: 0.847 ± 0.729
4.237SerAsn: 4.237 ± 2.142
3.39SerPro: 3.39 ± 1.305
3.39SerGln: 3.39 ± 0.349
3.39SerArg: 3.39 ± 1.473
5.085SerSer: 5.085 ± 0.829
5.932SerThr: 5.932 ± 2.086
5.085SerVal: 5.085 ± 1.951
0.847SerTrp: 0.847 ± 0.729
1.695SerTyr: 1.695 ± 0.637
0.0SerXaa: 0.0 ± 0.0
Thr
3.39ThrAla: 3.39 ± 2.317
0.0ThrCys: 0.0 ± 0.0
1.695ThrAsp: 1.695 ± 1.159
2.542ThrGlu: 2.542 ± 0.976
2.542ThrPhe: 2.542 ± 2.188
3.39ThrGly: 3.39 ± 0.349
0.847ThrHis: 0.847 ± 0.579
2.542ThrIle: 2.542 ± 1.346
3.39ThrLys: 3.39 ± 2.003
5.085ThrLeu: 5.085 ± 0.829
0.847ThrMet: 0.847 ± 0.729
3.39ThrAsn: 3.39 ± 0.915
5.085ThrPro: 5.085 ± 1.507
0.847ThrGln: 0.847 ± 0.579
0.847ThrArg: 0.847 ± 0.729
5.085ThrSer: 5.085 ± 0.794
5.932ThrThr: 5.932 ± 4.055
4.237ThrVal: 4.237 ± 0.593
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.695ValAla: 1.695 ± 1.159
4.237ValCys: 4.237 ± 1.543
2.542ValAsp: 2.542 ± 1.285
5.085ValGlu: 5.085 ± 0.829
3.39ValPhe: 3.39 ± 0.349
3.39ValGly: 3.39 ± 1.473
0.0ValHis: 0.0 ± 0.0
6.78ValIle: 6.78 ± 3.762
5.085ValLys: 5.085 ± 0.464
6.78ValLeu: 6.78 ± 1.833
3.39ValMet: 3.39 ± 1.275
2.542ValAsn: 2.542 ± 1.385
1.695ValPro: 1.695 ± 1.459
3.39ValGln: 3.39 ± 1.468
2.542ValArg: 2.542 ± 0.232
4.237ValSer: 4.237 ± 0.654
3.39ValThr: 3.39 ± 1.275
5.932ValVal: 5.932 ± 3.166
1.695ValTrp: 1.695 ± 1.159
4.237ValTyr: 4.237 ± 1.043
0.0ValXaa: 0.0 ± 0.0
Trp
3.39TrpAla: 3.39 ± 0.864
0.0TrpCys: 0.0 ± 0.0
1.695TrpAsp: 1.695 ± 1.507
0.0TrpGlu: 0.0 ± 0.0
1.695TrpPhe: 1.695 ± 0.637
0.847TrpGly: 0.847 ± 0.754
1.695TrpHis: 1.695 ± 1.507
1.695TrpIle: 1.695 ± 1.159
0.847TrpLys: 0.847 ± 0.729
5.085TrpLeu: 5.085 ± 2.691
0.0TrpMet: 0.0 ± 0.0
0.847TrpAsn: 0.847 ± 0.729
1.695TrpPro: 1.695 ± 0.637
0.847TrpGln: 0.847 ± 0.729
1.695TrpArg: 1.695 ± 0.652
0.847TrpSer: 0.847 ± 0.754
1.695TrpThr: 1.695 ± 1.507
0.847TrpVal: 0.847 ± 0.579
0.0TrpTrp: 0.0 ± 0.0
0.847TrpTyr: 0.847 ± 0.579
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.847TyrAla: 0.847 ± 0.579
1.695TyrCys: 1.695 ± 0.811
2.542TyrAsp: 2.542 ± 0.232
1.695TyrGlu: 1.695 ± 1.159
0.847TyrPhe: 0.847 ± 0.579
2.542TyrGly: 2.542 ± 0.976
0.847TyrHis: 0.847 ± 0.579
1.695TyrIle: 1.695 ± 0.637
2.542TyrLys: 2.542 ± 0.232
4.237TyrLeu: 4.237 ± 2.099
0.847TyrMet: 0.847 ± 0.579
3.39TyrAsn: 3.39 ± 1.305
0.847TyrPro: 0.847 ± 0.754
1.695TyrGln: 1.695 ± 1.507
1.695TyrArg: 1.695 ± 0.652
3.39TyrSer: 3.39 ± 1.468
1.695TyrThr: 1.695 ± 0.652
2.542TyrVal: 2.542 ± 1.385
1.695TyrTrp: 1.695 ± 1.507
1.695TyrTyr: 1.695 ± 0.652
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1181 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski