Amino acid dipepetide frequency for Hubei picorna-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.99AlaAla: 5.99 ± 0.726
1.498AlaCys: 1.498 ± 0.146
2.995AlaAsp: 2.995 ± 0.292
3.744AlaGlu: 3.744 ± 0.037
2.995AlaPhe: 2.995 ± 0.947
6.365AlaGly: 6.365 ± 0.526
1.123AlaHis: 1.123 ± 0.601
2.246AlaIle: 2.246 ± 1.202
3.744AlaLys: 3.744 ± 1.347
5.99AlaLeu: 5.99 ± 1.894
2.246AlaMet: 2.246 ± 0.546
5.99AlaAsn: 5.99 ± 1.382
5.616AlaPro: 5.616 ± 2.237
2.246AlaGln: 2.246 ± 0.546
2.246AlaArg: 2.246 ± 0.546
7.862AlaSer: 7.862 ± 0.275
5.241AlaThr: 5.241 ± 3.092
7.113AlaVal: 7.113 ± 0.126
1.498AlaTrp: 1.498 ± 0.509
1.872AlaTyr: 1.872 ± 1.001
0.0AlaXaa: 0.0 ± 0.0
Cys
1.872CysAla: 1.872 ± 0.346
0.0CysCys: 0.0 ± 0.0
1.123CysAsp: 1.123 ± 0.709
1.123CysGlu: 1.123 ± 0.054
0.749CysPhe: 0.749 ± 0.91
0.374CysGly: 0.374 ± 0.2
0.374CysHis: 0.374 ± 0.2
1.123CysIle: 1.123 ± 0.601
0.749CysLys: 0.749 ± 0.401
0.749CysLeu: 0.749 ± 0.401
1.498CysMet: 1.498 ± 0.801
1.498CysAsn: 1.498 ± 0.146
0.374CysPro: 0.374 ± 0.2
1.498CysGln: 1.498 ± 0.146
1.498CysArg: 1.498 ± 0.146
0.749CysSer: 0.749 ± 0.401
1.498CysThr: 1.498 ± 0.146
1.872CysVal: 1.872 ± 1.001
0.374CysTrp: 0.374 ± 0.2
0.749CysTyr: 0.749 ± 0.401
0.0CysXaa: 0.0 ± 0.0
Asp
4.493AspAla: 4.493 ± 1.093
0.374AspCys: 0.374 ± 0.455
2.995AspAsp: 2.995 ± 0.947
3.37AspGlu: 3.37 ± 0.492
2.995AspPhe: 2.995 ± 1.018
2.621AspGly: 2.621 ± 0.564
1.123AspHis: 1.123 ± 0.601
4.118AspIle: 4.118 ± 0.238
1.498AspLys: 1.498 ± 0.146
3.744AspLeu: 3.744 ± 0.692
3.744AspMet: 3.744 ± 0.692
1.498AspAsn: 1.498 ± 0.801
3.37AspPro: 3.37 ± 0.818
1.498AspGln: 1.498 ± 0.801
1.123AspArg: 1.123 ± 0.601
2.246AspSer: 2.246 ± 1.419
2.246AspThr: 2.246 ± 0.764
4.493AspVal: 4.493 ± 0.872
0.374AspTrp: 0.374 ± 0.2
1.498AspTyr: 1.498 ± 0.146
0.0AspXaa: 0.0 ± 0.0
Glu
5.241GluAla: 5.241 ± 2.149
1.872GluCys: 1.872 ± 0.346
2.995GluAsp: 2.995 ± 1.018
2.246GluGlu: 2.246 ± 0.109
2.246GluPhe: 2.246 ± 0.109
2.621GluGly: 2.621 ± 0.747
0.749GluHis: 0.749 ± 0.401
2.995GluIle: 2.995 ± 0.292
2.995GluLys: 2.995 ± 1.602
4.493GluLeu: 4.493 ± 0.872
0.374GluMet: 0.374 ± 0.2
1.123GluAsn: 1.123 ± 0.709
0.749GluPro: 0.749 ± 0.401
1.872GluGln: 1.872 ± 0.309
2.995GluArg: 2.995 ± 0.292
3.744GluSer: 3.744 ± 0.037
3.744GluThr: 3.744 ± 0.692
1.872GluVal: 1.872 ± 1.001
1.123GluTrp: 1.123 ± 0.601
2.995GluTyr: 2.995 ± 0.947
0.0GluXaa: 0.0 ± 0.0
Phe
5.241PheAla: 5.241 ± 0.183
1.498PheCys: 1.498 ± 0.509
2.621PheAsp: 2.621 ± 0.747
1.498PheGlu: 1.498 ± 0.801
1.123PhePhe: 1.123 ± 0.709
1.872PheGly: 1.872 ± 0.309
0.749PheHis: 0.749 ± 0.255
0.374PheIle: 0.374 ± 0.2
2.246PheLys: 2.246 ± 0.109
3.744PheLeu: 3.744 ± 0.692
1.872PheMet: 1.872 ± 0.346
0.749PheAsn: 0.749 ± 0.255
0.749PhePro: 0.749 ± 0.255
1.123PheGln: 1.123 ± 1.365
2.246PheArg: 2.246 ± 0.109
4.493PheSer: 4.493 ± 1.528
2.246PheThr: 2.246 ± 0.764
2.995PheVal: 2.995 ± 1.673
0.374PheTrp: 0.374 ± 0.2
1.872PheTyr: 1.872 ± 0.964
0.0PheXaa: 0.0 ± 0.0
Gly
2.621GlyAla: 2.621 ± 1.219
1.498GlyCys: 1.498 ± 0.146
2.621GlyAsp: 2.621 ± 0.564
3.744GlyGlu: 3.744 ± 0.037
2.621GlyPhe: 2.621 ± 0.092
2.995GlyGly: 2.995 ± 0.947
1.123GlyHis: 1.123 ± 0.601
6.365GlyIle: 6.365 ± 0.526
1.123GlyLys: 1.123 ± 0.601
5.616GlyLeu: 5.616 ± 1.694
2.246GlyMet: 2.246 ± 0.109
1.872GlyAsn: 1.872 ± 0.964
1.498GlyPro: 1.498 ± 0.146
1.872GlyGln: 1.872 ± 0.346
2.621GlyArg: 2.621 ± 1.874
2.995GlySer: 2.995 ± 0.363
4.493GlyThr: 4.493 ± 0.872
5.241GlyVal: 5.241 ± 0.472
1.123GlyTrp: 1.123 ± 0.054
1.498GlyTyr: 1.498 ± 0.146
0.0GlyXaa: 0.0 ± 0.0
His
2.995HisAla: 2.995 ± 0.292
0.749HisCys: 0.749 ± 0.401
1.498HisAsp: 1.498 ± 0.509
0.749HisGlu: 0.749 ± 0.401
0.374HisPhe: 0.374 ± 0.2
0.749HisGly: 0.749 ± 0.401
0.749HisHis: 0.749 ± 0.401
1.498HisIle: 1.498 ± 0.801
1.872HisLys: 1.872 ± 0.346
2.246HisLeu: 2.246 ± 0.109
0.0HisMet: 0.0 ± 0.0
1.498HisAsn: 1.498 ± 0.146
1.498HisPro: 1.498 ± 0.801
0.374HisGln: 0.374 ± 0.2
1.498HisArg: 1.498 ± 0.801
0.749HisSer: 0.749 ± 0.401
2.621HisThr: 2.621 ± 0.747
0.374HisVal: 0.374 ± 0.455
0.374HisTrp: 0.374 ± 0.2
0.374HisTyr: 0.374 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
4.867IleAla: 4.867 ± 0.017
1.872IleCys: 1.872 ± 1.001
2.246IleAsp: 2.246 ± 0.109
3.37IleGlu: 3.37 ± 1.147
2.621IlePhe: 2.621 ± 0.564
3.37IleGly: 3.37 ± 1.147
1.498IleHis: 1.498 ± 0.801
1.872IleIle: 1.872 ± 0.346
2.246IleLys: 2.246 ± 1.202
4.867IleLeu: 4.867 ± 0.017
1.872IleMet: 1.872 ± 1.001
3.37IleAsn: 3.37 ± 0.492
3.37IlePro: 3.37 ± 0.818
0.749IleGln: 0.749 ± 0.401
2.246IleArg: 2.246 ± 0.546
4.867IleSer: 4.867 ± 0.672
4.118IleThr: 4.118 ± 1.728
4.867IleVal: 4.867 ± 0.638
1.498IleTrp: 1.498 ± 0.801
1.498IleTyr: 1.498 ± 0.509
0.0IleXaa: 0.0 ± 0.0
Lys
1.872LysAla: 1.872 ± 1.001
0.749LysCys: 0.749 ± 0.401
3.37LysAsp: 3.37 ± 1.147
2.246LysGlu: 2.246 ± 1.202
0.749LysPhe: 0.749 ± 0.401
2.621LysGly: 2.621 ± 0.747
1.872LysHis: 1.872 ± 0.346
2.995LysIle: 2.995 ± 0.947
1.498LysLys: 1.498 ± 0.801
4.118LysLeu: 4.118 ± 1.073
1.123LysMet: 1.123 ± 0.601
1.498LysAsn: 1.498 ± 0.801
2.621LysPro: 2.621 ± 1.402
2.621LysGln: 2.621 ± 0.092
3.37LysArg: 3.37 ± 1.147
3.37LysSer: 3.37 ± 1.802
2.621LysThr: 2.621 ± 1.402
3.37LysVal: 3.37 ± 0.492
0.374LysTrp: 0.374 ± 0.2
3.744LysTyr: 3.744 ± 0.618
0.0LysXaa: 0.0 ± 0.0
Leu
6.365LeuAla: 6.365 ± 0.784
2.246LeuCys: 2.246 ± 0.546
4.867LeuAsp: 4.867 ± 1.948
4.118LeuGlu: 4.118 ± 0.893
4.118LeuPhe: 4.118 ± 1.728
2.621LeuGly: 2.621 ± 1.874
1.872LeuHis: 1.872 ± 1.001
4.493LeuIle: 4.493 ± 2.403
3.744LeuLys: 3.744 ± 0.692
5.616LeuLeu: 5.616 ± 2.349
2.995LeuMet: 2.995 ± 0.947
1.872LeuAsn: 1.872 ± 0.309
2.621LeuPro: 2.621 ± 0.564
3.37LeuGln: 3.37 ± 1.147
2.995LeuArg: 2.995 ± 0.947
6.739LeuSer: 6.739 ± 2.291
9.36LeuThr: 9.36 ± 0.421
5.241LeuVal: 5.241 ± 1.127
0.749LeuTrp: 0.749 ± 0.255
2.621LeuTyr: 2.621 ± 0.747
0.0LeuXaa: 0.0 ± 0.0
Met
2.995MetAla: 2.995 ± 1.602
1.498MetCys: 1.498 ± 0.146
0.749MetAsp: 0.749 ± 0.91
3.37MetGlu: 3.37 ± 0.492
1.123MetPhe: 1.123 ± 0.601
4.118MetGly: 4.118 ± 0.238
1.872MetHis: 1.872 ± 0.346
1.872MetIle: 1.872 ± 0.346
2.995MetLys: 2.995 ± 1.018
1.123MetLeu: 1.123 ± 0.054
1.123MetMet: 1.123 ± 0.141
1.872MetAsn: 1.872 ± 0.346
1.123MetPro: 1.123 ± 0.054
1.123MetGln: 1.123 ± 0.601
2.995MetArg: 2.995 ± 0.292
2.621MetSer: 2.621 ± 1.219
2.246MetThr: 2.246 ± 1.202
1.872MetVal: 1.872 ± 0.964
0.0MetTrp: 0.0 ± 0.0
0.749MetTyr: 0.749 ± 0.401
0.0MetXaa: 0.0 ± 0.0
Asn
2.621AsnAla: 2.621 ± 0.564
1.123AsnCys: 1.123 ± 0.601
1.123AsnAsp: 1.123 ± 1.365
1.123AsnGlu: 1.123 ± 1.365
1.872AsnPhe: 1.872 ± 0.309
1.872AsnGly: 1.872 ± 1.619
1.123AsnHis: 1.123 ± 0.601
2.246AsnIle: 2.246 ± 0.764
1.872AsnLys: 1.872 ± 0.346
2.246AsnLeu: 2.246 ± 1.202
1.498AsnMet: 1.498 ± 0.801
3.744AsnAsn: 3.744 ± 0.037
1.123AsnPro: 1.123 ± 0.054
1.872AsnGln: 1.872 ± 0.346
2.246AsnArg: 2.246 ± 0.546
5.241AsnSer: 5.241 ± 0.472
6.365AsnThr: 6.365 ± 1.836
6.739AsnVal: 6.739 ± 2.946
0.749AsnTrp: 0.749 ± 0.401
1.498AsnTyr: 1.498 ± 1.164
0.0AsnXaa: 0.0 ± 0.0
Pro
1.872ProAla: 1.872 ± 1.619
0.374ProCys: 0.374 ± 0.455
2.246ProAsp: 2.246 ± 0.546
1.498ProGlu: 1.498 ± 0.801
2.246ProPhe: 2.246 ± 1.419
1.498ProGly: 1.498 ± 1.164
0.374ProHis: 0.374 ± 0.2
3.37ProIle: 3.37 ± 1.473
1.123ProLys: 1.123 ± 0.601
4.118ProLeu: 4.118 ± 0.893
1.498ProMet: 1.498 ± 0.509
1.872ProAsn: 1.872 ± 0.964
1.123ProPro: 1.123 ± 0.601
4.867ProGln: 4.867 ± 0.017
2.621ProArg: 2.621 ± 0.092
3.37ProSer: 3.37 ± 0.818
3.37ProThr: 3.37 ± 0.818
4.118ProVal: 4.118 ± 0.418
1.498ProTrp: 1.498 ± 0.146
2.995ProTyr: 2.995 ± 0.363
0.0ProXaa: 0.0 ± 0.0
Gln
1.498GlnAla: 1.498 ± 0.509
0.374GlnCys: 0.374 ± 0.2
0.749GlnAsp: 0.749 ± 0.401
1.498GlnGlu: 1.498 ± 0.146
0.749GlnPhe: 0.749 ± 0.401
1.123GlnGly: 1.123 ± 0.054
0.374GlnHis: 0.374 ± 0.2
2.621GlnIle: 2.621 ± 0.092
1.123GlnLys: 1.123 ± 0.601
3.37GlnLeu: 3.37 ± 0.492
1.123GlnMet: 1.123 ± 0.054
1.498GlnAsn: 1.498 ± 1.164
2.621GlnPro: 2.621 ± 0.564
0.374GlnGln: 0.374 ± 0.2
2.995GlnArg: 2.995 ± 0.292
2.621GlnSer: 2.621 ± 0.092
2.621GlnThr: 2.621 ± 1.402
1.498GlnVal: 1.498 ± 0.146
1.498GlnTrp: 1.498 ± 0.146
1.123GlnTyr: 1.123 ± 0.709
0.0GlnXaa: 0.0 ± 0.0
Arg
4.493ArgAla: 4.493 ± 1.748
1.123ArgCys: 1.123 ± 0.601
3.37ArgAsp: 3.37 ± 0.492
1.872ArgGlu: 1.872 ± 0.346
2.621ArgPhe: 2.621 ± 0.564
2.246ArgGly: 2.246 ± 0.764
0.374ArgHis: 0.374 ± 0.2
2.995ArgIle: 2.995 ± 0.363
2.246ArgLys: 2.246 ± 0.546
4.118ArgLeu: 4.118 ± 0.893
2.995ArgMet: 2.995 ± 0.363
2.621ArgAsn: 2.621 ± 0.092
2.995ArgPro: 2.995 ± 1.018
1.498ArgGln: 1.498 ± 0.509
3.744ArgArg: 3.744 ± 0.037
3.37ArgSer: 3.37 ± 0.492
2.995ArgThr: 2.995 ± 0.292
2.995ArgVal: 2.995 ± 0.292
0.374ArgTrp: 0.374 ± 0.2
2.995ArgTyr: 2.995 ± 1.018
0.0ArgXaa: 0.0 ± 0.0
Ser
5.99SerAla: 5.99 ± 0.726
1.498SerCys: 1.498 ± 0.146
3.37SerAsp: 3.37 ± 0.492
5.616SerGlu: 5.616 ± 2.349
2.621SerPhe: 2.621 ± 1.219
4.867SerGly: 4.867 ± 0.017
1.872SerHis: 1.872 ± 0.346
6.739SerIle: 6.739 ± 0.981
3.37SerLys: 3.37 ± 1.147
5.241SerLeu: 5.241 ± 2.437
2.621SerMet: 2.621 ± 0.564
4.867SerAsn: 4.867 ± 1.982
4.867SerPro: 4.867 ± 1.982
1.498SerGln: 1.498 ± 0.509
2.995SerArg: 2.995 ± 0.363
7.862SerSer: 7.862 ± 1.691
3.744SerThr: 3.744 ± 1.928
7.488SerVal: 7.488 ± 2.546
1.498SerTrp: 1.498 ± 0.146
1.872SerTyr: 1.872 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
7.862ThrAla: 7.862 ± 0.38
0.374ThrCys: 0.374 ± 0.2
2.246ThrAsp: 2.246 ± 0.764
3.744ThrGlu: 3.744 ± 0.618
3.37ThrPhe: 3.37 ± 0.818
3.37ThrGly: 3.37 ± 0.163
1.123ThrHis: 1.123 ± 0.601
4.867ThrIle: 4.867 ± 0.638
2.995ThrLys: 2.995 ± 0.947
6.739ThrLeu: 6.739 ± 0.981
2.995ThrMet: 2.995 ± 1.018
2.621ThrAsn: 2.621 ± 0.564
5.616ThrPro: 5.616 ± 1.582
0.749ThrGln: 0.749 ± 0.401
3.744ThrArg: 3.744 ± 0.618
6.739ThrSer: 6.739 ± 0.981
5.616ThrThr: 5.616 ± 3.547
3.744ThrVal: 3.744 ± 0.618
1.872ThrTrp: 1.872 ± 0.309
2.246ThrTyr: 2.246 ± 0.764
0.0ThrXaa: 0.0 ± 0.0
Val
7.113ValAla: 7.113 ± 2.091
0.374ValCys: 0.374 ± 0.2
4.493ValAsp: 4.493 ± 0.872
3.744ValGlu: 3.744 ± 0.618
2.246ValPhe: 2.246 ± 0.109
5.241ValGly: 5.241 ± 0.838
2.995ValHis: 2.995 ± 1.018
1.123ValIle: 1.123 ± 0.601
5.616ValLys: 5.616 ± 1.039
4.493ValLeu: 4.493 ± 0.438
3.37ValMet: 3.37 ± 0.144
3.37ValAsn: 3.37 ± 1.473
3.37ValPro: 3.37 ± 0.163
2.246ValGln: 2.246 ± 0.764
4.118ValArg: 4.118 ± 0.238
8.237ValSer: 8.237 ± 4.111
3.744ValThr: 3.744 ± 0.037
5.241ValVal: 5.241 ± 1.493
1.872ValTrp: 1.872 ± 0.346
3.37ValTyr: 3.37 ± 0.492
0.0ValXaa: 0.0 ± 0.0
Trp
1.123TrpAla: 1.123 ± 0.054
0.374TrpCys: 0.374 ± 0.2
1.123TrpAsp: 1.123 ± 0.054
0.0TrpGlu: 0.0 ± 0.0
1.498TrpPhe: 1.498 ± 0.801
1.498TrpGly: 1.498 ± 0.146
1.123TrpHis: 1.123 ± 0.054
1.123TrpIle: 1.123 ± 0.601
0.749TrpLys: 0.749 ± 0.401
2.246TrpLeu: 2.246 ± 0.546
0.374TrpMet: 0.374 ± 0.455
0.749TrpAsn: 0.749 ± 0.401
0.374TrpPro: 0.374 ± 0.2
0.0TrpGln: 0.0 ± 0.0
0.749TrpArg: 0.749 ± 0.255
1.123TrpSer: 1.123 ± 0.054
1.498TrpThr: 1.498 ± 1.164
1.498TrpVal: 1.498 ± 0.801
0.0TrpTrp: 0.0 ± 0.0
1.123TrpTyr: 1.123 ± 0.054
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.498TyrAla: 1.498 ± 0.509
0.374TyrCys: 0.374 ± 0.2
2.621TyrAsp: 2.621 ± 1.402
0.374TyrGlu: 0.374 ± 0.2
0.749TyrPhe: 0.749 ± 0.401
3.744TyrGly: 3.744 ± 0.618
0.374TyrHis: 0.374 ± 0.455
2.246TyrIle: 2.246 ± 0.109
2.995TyrLys: 2.995 ± 1.602
3.744TyrLeu: 3.744 ± 0.037
1.872TyrMet: 1.872 ± 0.309
4.118TyrAsn: 4.118 ± 1.073
0.374TyrPro: 0.374 ± 0.2
0.0TyrGln: 0.0 ± 0.0
2.995TyrArg: 2.995 ± 1.673
1.498TyrSer: 1.498 ± 0.146
2.246TyrThr: 2.246 ± 0.764
3.744TyrVal: 3.744 ± 0.692
1.123TyrTrp: 1.123 ± 0.709
2.246TyrTyr: 2.246 ± 0.109
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2672 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski