Amino acid dipepetide frequency for Human coronavirus HKU1 (HCoV-HKU1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.41AlaAla: 3.41 ± 1.072
1.805AlaCys: 1.805 ± 0.367
3.209AlaAsp: 3.209 ± 0.835
1.404AlaGlu: 1.404 ± 0.631
3.008AlaPhe: 3.008 ± 0.828
2.708AlaGly: 2.708 ± 0.337
0.602AlaHis: 0.602 ± 0.241
4.011AlaIle: 4.011 ± 0.952
3.008AlaLys: 3.008 ± 0.671
4.112AlaLeu: 4.112 ± 0.796
1.404AlaMet: 1.404 ± 0.248
3.71AlaAsn: 3.71 ± 1.208
1.705AlaPro: 1.705 ± 0.483
1.504AlaGln: 1.504 ± 0.283
1.504AlaArg: 1.504 ± 0.793
4.312AlaSer: 4.312 ± 0.771
3.008AlaThr: 3.008 ± 0.612
4.011AlaVal: 4.011 ± 1.144
0.602AlaTrp: 0.602 ± 0.26
2.306AlaTyr: 2.306 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
1.705CysAla: 1.705 ± 0.538
1.304CysCys: 1.304 ± 0.422
2.006CysAsp: 2.006 ± 0.657
1.304CysGlu: 1.304 ± 0.424
2.206CysPhe: 2.206 ± 0.321
2.206CysGly: 2.206 ± 0.564
0.903CysHis: 0.903 ± 0.288
2.206CysIle: 2.206 ± 0.539
2.006CysLys: 2.006 ± 0.433
2.908CysLeu: 2.908 ± 0.676
0.602CysMet: 0.602 ± 0.258
2.708CysAsn: 2.708 ± 0.356
0.903CysPro: 0.903 ± 0.24
1.103CysGln: 1.103 ± 0.283
1.304CysArg: 1.304 ± 0.384
3.41CysSer: 3.41 ± 1.05
1.805CysThr: 1.805 ± 0.442
2.607CysVal: 2.607 ± 0.675
0.501CysTrp: 0.501 ± 0.264
2.106CysTyr: 2.106 ± 0.511
0.0CysXaa: 0.0 ± 0.0
Asp
2.908AspAla: 2.908 ± 1.209
2.607AspCys: 2.607 ± 0.544
4.713AspAsp: 4.713 ± 2.156
3.109AspGlu: 3.109 ± 0.895
4.212AspPhe: 4.212 ± 0.912
3.811AspGly: 3.811 ± 1.342
0.602AspHis: 0.602 ± 0.291
4.112AspIle: 4.112 ± 0.996
3.51AspLys: 3.51 ± 0.53
6.017AspLeu: 6.017 ± 0.905
1.304AspMet: 1.304 ± 0.287
4.011AspAsn: 4.011 ± 1.145
1.304AspPro: 1.304 ± 0.614
1.103AspGln: 1.103 ± 0.578
1.203AspArg: 1.203 ± 0.384
4.112AspSer: 4.112 ± 1.681
2.708AspThr: 2.708 ± 0.395
6.819AspVal: 6.819 ± 1.615
0.602AspTrp: 0.602 ± 0.339
3.71AspTyr: 3.71 ± 0.879
0.0AspXaa: 0.0 ± 0.0
Glu
2.407GluAla: 2.407 ± 1.012
0.602GluCys: 0.602 ± 0.192
3.51GluAsp: 3.51 ± 1.062
1.404GluGlu: 1.404 ± 0.316
2.006GluPhe: 2.006 ± 0.535
1.404GluGly: 1.404 ± 0.778
0.501GluHis: 0.501 ± 0.157
2.206GluIle: 2.206 ± 0.903
1.203GluLys: 1.203 ± 0.199
3.008GluLeu: 3.008 ± 1.135
0.401GluMet: 0.401 ± 0.369
2.006GluAsn: 2.006 ± 0.917
1.203GluPro: 1.203 ± 0.332
0.702GluGln: 0.702 ± 0.279
1.203GluArg: 1.203 ± 0.232
2.708GluSer: 2.708 ± 0.764
2.106GluThr: 2.106 ± 0.482
2.808GluVal: 2.808 ± 0.537
0.201GluTrp: 0.201 ± 0.308
1.905GluTyr: 1.905 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
2.106PheAla: 2.106 ± 0.425
2.708PheCys: 2.708 ± 0.562
4.412PheAsp: 4.412 ± 0.719
2.106PheGlu: 2.106 ± 0.44
2.708PhePhe: 2.708 ± 0.722
3.309PheGly: 3.309 ± 1.202
0.802PheHis: 0.802 ± 0.454
4.312PheIle: 4.312 ± 0.947
4.513PheLys: 4.513 ± 0.529
4.513PheLeu: 4.513 ± 1.675
1.203PheMet: 1.203 ± 0.384
5.014PheAsn: 5.014 ± 1.106
1.705PhePro: 1.705 ± 0.668
1.203PheGln: 1.203 ± 0.409
1.604PheArg: 1.604 ± 0.806
5.114PheSer: 5.114 ± 1.307
4.412PheThr: 4.412 ± 0.83
6.318PheVal: 6.318 ± 1.179
0.903PheTrp: 0.903 ± 0.365
4.212PheTyr: 4.212 ± 0.961
0.0PheXaa: 0.0 ± 0.0
Gly
1.905GlyAla: 1.905 ± 0.455
2.607GlyCys: 2.607 ± 0.693
4.613GlyAsp: 4.613 ± 1.404
1.003GlyGlu: 1.003 ± 0.235
4.813GlyPhe: 4.813 ± 0.526
2.607GlyGly: 2.607 ± 0.932
1.003GlyHis: 1.003 ± 0.232
3.209GlyIle: 3.209 ± 0.918
2.607GlyLys: 2.607 ± 0.788
4.312GlyLeu: 4.312 ± 0.564
1.003GlyMet: 1.003 ± 0.581
2.908GlyAsn: 2.908 ± 0.443
1.304GlyPro: 1.304 ± 0.477
1.103GlyGln: 1.103 ± 0.412
2.206GlyArg: 2.206 ± 0.49
5.515GlySer: 5.515 ± 1.362
3.109GlyThr: 3.109 ± 0.945
6.518GlyVal: 6.518 ± 0.487
0.802GlyTrp: 0.802 ± 0.279
2.908GlyTyr: 2.908 ± 0.529
0.0GlyXaa: 0.0 ± 0.0
His
1.003HisAla: 1.003 ± 0.529
0.602HisCys: 0.602 ± 0.291
1.304HisAsp: 1.304 ± 0.346
0.501HisGlu: 0.501 ± 0.264
1.905HisPhe: 1.905 ± 0.332
0.401HisGly: 0.401 ± 0.225
0.301HisHis: 0.301 ± 0.131
1.203HisIle: 1.203 ± 0.367
1.304HisLys: 1.304 ± 0.268
1.504HisLeu: 1.504 ± 0.492
0.602HisMet: 0.602 ± 0.635
0.702HisAsn: 0.702 ± 0.422
0.903HisPro: 0.903 ± 0.581
0.501HisGln: 0.501 ± 0.166
0.401HisArg: 0.401 ± 0.261
1.604HisSer: 1.604 ± 0.736
1.003HisThr: 1.003 ± 0.627
1.905HisVal: 1.905 ± 0.521
0.401HisTrp: 0.401 ± 0.296
1.404HisTyr: 1.404 ± 0.484
0.0HisXaa: 0.0 ± 0.0
Ile
2.106IleAla: 2.106 ± 0.44
2.407IleCys: 2.407 ± 0.724
3.109IleAsp: 3.109 ± 0.382
2.006IleGlu: 2.006 ± 0.555
3.51IlePhe: 3.51 ± 0.964
3.309IleGly: 3.309 ± 0.653
0.401IleHis: 0.401 ± 0.225
3.209IleIle: 3.209 ± 1.568
4.112IleLys: 4.112 ± 0.622
6.318IleLeu: 6.318 ± 1.443
1.103IleMet: 1.103 ± 0.388
4.312IleAsn: 4.312 ± 1.372
2.407IlePro: 2.407 ± 0.471
2.306IleGln: 2.306 ± 0.617
2.306IleArg: 2.306 ± 0.917
5.114IleSer: 5.114 ± 1.886
3.309IleThr: 3.309 ± 0.57
6.117IleVal: 6.117 ± 1.43
0.401IleTrp: 0.401 ± 0.411
1.705IleTyr: 1.705 ± 0.528
0.0IleXaa: 0.0 ± 0.0
Lys
3.109LysAla: 3.109 ± 0.89
1.905LysCys: 1.905 ± 0.398
2.607LysAsp: 2.607 ± 0.663
2.306LysGlu: 2.306 ± 0.704
4.011LysPhe: 4.011 ± 0.644
3.008LysGly: 3.008 ± 0.587
1.604LysHis: 1.604 ± 0.369
3.41LysIle: 3.41 ± 0.546
2.006LysLys: 2.006 ± 0.378
6.117LysLeu: 6.117 ± 1.118
0.903LysMet: 0.903 ± 0.58
2.306LysAsn: 2.306 ± 0.672
3.109LysPro: 3.109 ± 0.94
2.106LysGln: 2.106 ± 0.809
2.407LysArg: 2.407 ± 0.606
5.315LysSer: 5.315 ± 1.22
2.407LysThr: 2.407 ± 0.57
3.911LysVal: 3.911 ± 0.415
1.003LysTrp: 1.003 ± 0.314
2.908LysTyr: 2.908 ± 1.1
0.0LysXaa: 0.0 ± 0.0
Leu
5.315LeuAla: 5.315 ± 0.779
4.312LeuCys: 4.312 ± 0.944
4.112LeuAsp: 4.112 ± 0.631
3.109LeuGlu: 3.109 ± 0.696
6.719LeuPhe: 6.719 ± 0.923
4.813LeuGly: 4.813 ± 0.468
1.504LeuHis: 1.504 ± 0.357
4.914LeuIle: 4.914 ± 0.709
5.816LeuLys: 5.816 ± 1.225
10.028LeuLeu: 10.028 ± 3.742
1.604LeuMet: 1.604 ± 0.613
6.719LeuAsn: 6.719 ± 2.18
4.513LeuPro: 4.513 ± 0.909
2.808LeuGln: 2.808 ± 0.587
2.507LeuArg: 2.507 ± 0.674
8.123LeuSer: 8.123 ± 1.715
5.114LeuThr: 5.114 ± 0.92
6.418LeuVal: 6.418 ± 0.662
1.203LeuTrp: 1.203 ± 0.364
5.114LeuTyr: 5.114 ± 1.2
0.0LeuXaa: 0.0 ± 0.0
Met
0.903MetAla: 0.903 ± 0.328
0.702MetCys: 0.702 ± 0.27
1.003MetAsp: 1.003 ± 0.391
0.501MetGlu: 0.501 ± 0.157
1.304MetPhe: 1.304 ± 0.659
0.802MetGly: 0.802 ± 0.279
0.903MetHis: 0.903 ± 0.178
0.702MetIle: 0.702 ± 0.329
0.802MetLys: 0.802 ± 0.306
2.607MetLeu: 2.607 ± 0.624
0.602MetMet: 0.602 ± 0.317
1.203MetAsn: 1.203 ± 0.316
1.203MetPro: 1.203 ± 0.279
1.003MetGln: 1.003 ± 0.436
0.602MetArg: 0.602 ± 0.317
1.203MetSer: 1.203 ± 0.509
1.304MetThr: 1.304 ± 0.687
1.304MetVal: 1.304 ± 0.759
0.401MetTrp: 0.401 ± 0.289
1.103MetTyr: 1.103 ± 0.426
0.0MetXaa: 0.0 ± 0.0
Asn
3.008AsnAla: 3.008 ± 1.175
2.507AsnCys: 2.507 ± 0.636
5.215AsnAsp: 5.215 ± 1.144
2.006AsnGlu: 2.006 ± 0.367
4.914AsnPhe: 4.914 ± 2.263
4.112AsnGly: 4.112 ± 1.09
1.805AsnHis: 1.805 ± 1.021
3.209AsnIle: 3.209 ± 0.784
3.209AsnLys: 3.209 ± 0.274
5.716AsnLeu: 5.716 ± 0.553
1.404AsnMet: 1.404 ± 0.561
3.71AsnAsn: 3.71 ± 1.977
1.805AsnPro: 1.805 ± 0.52
1.404AsnGln: 1.404 ± 0.316
1.805AsnArg: 1.805 ± 0.454
4.513AsnSer: 4.513 ± 1.149
2.708AsnThr: 2.708 ± 1.098
5.616AsnVal: 5.616 ± 0.797
0.602AsnTrp: 0.602 ± 0.192
2.507AsnTyr: 2.507 ± 0.753
0.0AsnXaa: 0.0 ± 0.0
Pro
1.604ProAla: 1.604 ± 0.467
1.003ProCys: 1.003 ± 0.314
1.905ProAsp: 1.905 ± 0.273
0.903ProGlu: 0.903 ± 0.388
2.006ProPhe: 2.006 ± 0.433
2.206ProGly: 2.206 ± 1.198
1.203ProHis: 1.203 ± 0.329
2.808ProIle: 2.808 ± 0.968
1.805ProLys: 1.805 ± 0.454
3.209ProLeu: 3.209 ± 1.28
0.702ProMet: 0.702 ± 0.753
1.604ProAsn: 1.604 ± 0.698
1.404ProPro: 1.404 ± 0.609
1.103ProGln: 1.103 ± 0.967
1.304ProArg: 1.304 ± 0.384
2.607ProSer: 2.607 ± 1.374
2.407ProThr: 2.407 ± 0.643
2.407ProVal: 2.407 ± 0.573
0.201ProTrp: 0.201 ± 0.359
1.705ProTyr: 1.705 ± 1.095
0.0ProXaa: 0.0 ± 0.0
Gln
1.304GlnAla: 1.304 ± 0.27
0.501GlnCys: 0.501 ± 0.166
0.903GlnAsp: 0.903 ± 0.476
1.905GlnGlu: 1.905 ± 0.477
1.504GlnPhe: 1.504 ± 0.807
2.006GlnGly: 2.006 ± 0.737
0.702GlnHis: 0.702 ± 0.278
1.705GlnIle: 1.705 ± 0.279
1.604GlnLys: 1.604 ± 0.597
3.51GlnLeu: 3.51 ± 0.397
0.401GlnMet: 0.401 ± 0.227
1.705GlnAsn: 1.705 ± 0.567
0.903GlnPro: 0.903 ± 0.644
1.504GlnGln: 1.504 ± 0.446
0.602GlnArg: 0.602 ± 0.24
2.407GlnSer: 2.407 ± 0.5
1.905GlnThr: 1.905 ± 0.54
2.206GlnVal: 2.206 ± 0.512
0.702GlnTrp: 0.702 ± 0.37
1.504GlnTyr: 1.504 ± 0.445
0.0GlnXaa: 0.0 ± 0.0
Arg
2.206ArgAla: 2.206 ± 0.82
0.903ArgCys: 0.903 ± 0.24
2.106ArgAsp: 2.106 ± 0.493
0.802ArgGlu: 0.802 ± 0.35
2.206ArgPhe: 2.206 ± 0.848
1.805ArgGly: 1.805 ± 0.738
1.203ArgHis: 1.203 ± 0.56
1.705ArgIle: 1.705 ± 0.556
1.805ArgLys: 1.805 ± 0.465
3.109ArgLeu: 3.109 ± 0.648
0.602ArgMet: 0.602 ± 0.192
1.304ArgAsn: 1.304 ± 0.214
1.103ArgPro: 1.103 ± 0.478
1.203ArgGln: 1.203 ± 0.248
1.003ArgArg: 1.003 ± 0.765
2.407ArgSer: 2.407 ± 1.636
1.103ArgThr: 1.103 ± 0.368
2.607ArgVal: 2.607 ± 0.932
0.401ArgTrp: 0.401 ± 0.359
1.705ArgTyr: 1.705 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
3.811SerAla: 3.811 ± 0.72
3.008SerCys: 3.008 ± 1.206
4.713SerAsp: 4.713 ± 0.709
2.808SerGlu: 2.808 ± 1.343
4.212SerPhe: 4.212 ± 1.017
4.011SerGly: 4.011 ± 1.26
2.106SerHis: 2.106 ± 1.214
4.813SerIle: 4.813 ± 0.922
4.312SerLys: 4.312 ± 0.658
9.226SerLeu: 9.226 ± 1.655
1.705SerMet: 1.705 ± 0.744
3.811SerAsn: 3.811 ± 1.836
1.905SerPro: 1.905 ± 0.605
2.306SerGln: 2.306 ± 0.363
2.908SerArg: 2.908 ± 1.969
7.12SerSer: 7.12 ± 2.94
3.811SerThr: 3.811 ± 0.461
7.722SerVal: 7.722 ± 1.916
0.903SerTrp: 0.903 ± 0.641
4.212SerTyr: 4.212 ± 0.968
0.0SerXaa: 0.0 ± 0.0
Thr
3.309ThrAla: 3.309 ± 0.615
1.304ThrCys: 1.304 ± 0.521
2.908ThrAsp: 2.908 ± 0.836
1.103ThrGlu: 1.103 ± 0.426
3.71ThrPhe: 3.71 ± 0.55
5.114ThrGly: 5.114 ± 0.731
0.903ThrHis: 0.903 ± 0.387
4.112ThrIle: 4.112 ± 1.282
3.109ThrLys: 3.109 ± 0.457
4.412ThrLeu: 4.412 ± 1.28
1.604ThrMet: 1.604 ± 0.558
3.109ThrAsn: 3.109 ± 0.848
1.504ThrPro: 1.504 ± 0.845
1.805ThrGln: 1.805 ± 0.8
1.504ThrArg: 1.504 ± 0.517
3.51ThrSer: 3.51 ± 0.707
3.51ThrThr: 3.51 ± 0.862
4.613ThrVal: 4.613 ± 0.859
0.401ThrTrp: 0.401 ± 0.211
2.908ThrTyr: 2.908 ± 0.587
0.0ThrXaa: 0.0 ± 0.0
Val
5.716ValAla: 5.716 ± 1.563
2.708ValCys: 2.708 ± 1.106
6.217ValAsp: 6.217 ± 1.169
3.109ValGlu: 3.109 ± 0.867
4.011ValPhe: 4.011 ± 0.403
4.312ValGly: 4.312 ± 0.72
1.203ValHis: 1.203 ± 0.377
4.914ValIle: 4.914 ± 1.402
6.117ValLys: 6.117 ± 1.017
8.323ValLeu: 8.323 ± 1.455
2.306ValMet: 2.306 ± 1.216
5.616ValAsn: 5.616 ± 0.649
3.41ValPro: 3.41 ± 0.526
2.908ValGln: 2.908 ± 0.27
2.206ValArg: 2.206 ± 0.683
5.816ValSer: 5.816 ± 0.729
5.014ValThr: 5.014 ± 0.844
9.326ValVal: 9.326 ± 2.825
1.504ValTrp: 1.504 ± 0.766
5.315ValTyr: 5.315 ± 0.828
0.0ValXaa: 0.0 ± 0.0
Trp
0.401TrpAla: 0.401 ± 0.187
0.602TrpCys: 0.602 ± 0.215
0.602TrpAsp: 0.602 ± 0.317
0.201TrpGlu: 0.201 ± 0.148
1.304TrpPhe: 1.304 ± 0.266
0.301TrpGly: 0.301 ± 0.159
0.401TrpHis: 0.401 ± 0.135
0.602TrpIle: 0.602 ± 0.307
0.201TrpLys: 0.201 ± 0.106
1.905TrpLeu: 1.905 ± 0.282
0.1TrpMet: 0.1 ± 0.179
1.103TrpAsn: 1.103 ± 0.267
0.501TrpPro: 0.501 ± 0.439
0.401TrpGln: 0.401 ± 0.135
0.702TrpArg: 0.702 ± 0.52
0.903TrpSer: 0.903 ± 0.321
0.401TrpThr: 0.401 ± 0.211
0.903TrpVal: 0.903 ± 0.207
0.1TrpTrp: 0.1 ± 0.33
1.003TrpTyr: 1.003 ± 0.45
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.209TyrAla: 3.209 ± 0.566
1.705TyrCys: 1.705 ± 0.51
3.309TyrAsp: 3.309 ± 0.614
1.905TyrGlu: 1.905 ± 0.398
2.908TyrPhe: 2.908 ± 0.648
3.811TyrGly: 3.811 ± 0.663
0.802TyrHis: 0.802 ± 0.342
2.206TyrIle: 2.206 ± 0.414
3.41TyrLys: 3.41 ± 0.842
3.911TyrLeu: 3.911 ± 0.532
0.501TyrMet: 0.501 ± 0.264
4.212TyrAsn: 4.212 ± 0.894
1.304TyrPro: 1.304 ± 0.229
1.404TyrGln: 1.404 ± 0.316
2.006TyrArg: 2.006 ± 0.497
3.61TyrSer: 3.61 ± 0.531
3.309TyrThr: 3.309 ± 0.524
5.917TyrVal: 5.917 ± 0.888
0.802TyrTrp: 0.802 ± 0.175
3.811TyrTyr: 3.811 ± 0.989
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (9973 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski