Amino acid dipepetide frequency for Sparrow coronavirus HKU17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.218AlaAla: 5.218 ± 0.721
2.372AlaCys: 2.372 ± 0.621
4.625AlaAsp: 4.625 ± 1.175
2.609AlaGlu: 2.609 ± 0.736
4.388AlaPhe: 4.388 ± 0.928
3.439AlaGly: 3.439 ± 0.578
2.372AlaHis: 2.372 ± 0.858
5.93AlaIle: 5.93 ± 0.411
5.218AlaLys: 5.218 ± 1.62
8.539AlaLeu: 8.539 ± 0.695
1.779AlaMet: 1.779 ± 0.307
4.625AlaAsn: 4.625 ± 0.969
2.609AlaPro: 2.609 ± 0.502
3.439AlaGln: 3.439 ± 0.638
3.558AlaArg: 3.558 ± 0.561
4.388AlaSer: 4.388 ± 0.603
5.455AlaThr: 5.455 ± 0.826
5.574AlaVal: 5.574 ± 1.005
0.356AlaTrp: 0.356 ± 0.582
2.609AlaTyr: 2.609 ± 0.761
0.0AlaXaa: 0.0 ± 0.0
Cys
1.542CysAla: 1.542 ± 0.603
1.067CysCys: 1.067 ± 0.383
1.423CysAsp: 1.423 ± 0.544
1.067CysGlu: 1.067 ± 0.478
1.779CysPhe: 1.779 ± 0.634
1.542CysGly: 1.542 ± 0.454
0.237CysHis: 0.237 ± 0.18
1.898CysIle: 1.898 ± 0.741
0.949CysLys: 0.949 ± 0.487
1.779CysLeu: 1.779 ± 1.295
0.712CysMet: 0.712 ± 0.221
1.779CysAsn: 1.779 ± 0.862
1.66CysPro: 1.66 ± 0.521
0.712CysGln: 0.712 ± 0.365
1.067CysArg: 1.067 ± 0.353
1.186CysSer: 1.186 ± 0.367
1.542CysThr: 1.542 ± 0.826
1.898CysVal: 1.898 ± 0.491
0.356CysTrp: 0.356 ± 0.183
1.423CysTyr: 1.423 ± 0.902
0.0CysXaa: 0.0 ± 0.0
Asp
4.981AspAla: 4.981 ± 1.207
1.305AspCys: 1.305 ± 0.486
3.083AspAsp: 3.083 ± 0.568
2.253AspGlu: 2.253 ± 0.99
2.491AspPhe: 2.491 ± 0.611
3.914AspGly: 3.914 ± 0.869
0.83AspHis: 0.83 ± 0.709
3.321AspIle: 3.321 ± 1.174
2.253AspLys: 2.253 ± 0.739
4.507AspLeu: 4.507 ± 0.86
0.712AspMet: 0.712 ± 0.352
3.083AspAsn: 3.083 ± 1.205
2.253AspPro: 2.253 ± 1.127
2.016AspGln: 2.016 ± 0.603
1.66AspArg: 1.66 ± 0.428
3.202AspSer: 3.202 ± 1.058
2.965AspThr: 2.965 ± 0.922
4.151AspVal: 4.151 ± 1.061
0.593AspTrp: 0.593 ± 0.358
3.202AspTyr: 3.202 ± 0.661
0.0AspXaa: 0.0 ± 0.0
Glu
3.202GluAla: 3.202 ± 0.777
1.067GluCys: 1.067 ± 0.373
1.66GluAsp: 1.66 ± 0.523
2.846GluGlu: 2.846 ± 1.699
2.016GluPhe: 2.016 ± 0.486
2.372GluGly: 2.372 ± 0.661
1.305GluHis: 1.305 ± 0.403
1.779GluIle: 1.779 ± 0.47
2.135GluLys: 2.135 ± 0.675
4.507GluLeu: 4.507 ± 1.362
0.949GluMet: 0.949 ± 0.247
1.542GluAsn: 1.542 ± 0.364
2.491GluPro: 2.491 ± 0.679
2.016GluGln: 2.016 ± 0.691
1.66GluArg: 1.66 ± 0.802
2.253GluSer: 2.253 ± 1.312
2.372GluThr: 2.372 ± 0.754
2.728GluVal: 2.728 ± 1.03
0.712GluTrp: 0.712 ± 0.634
2.016GluTyr: 2.016 ± 0.68
0.0GluXaa: 0.0 ± 0.0
Phe
2.728PheAla: 2.728 ± 0.861
1.067PheCys: 1.067 ± 0.373
2.491PheAsp: 2.491 ± 0.72
1.542PheGlu: 1.542 ± 0.587
0.83PhePhe: 0.83 ± 0.344
2.491PheGly: 2.491 ± 1.426
1.067PheHis: 1.067 ± 0.513
2.965PheIle: 2.965 ± 1.4
2.253PheLys: 2.253 ± 0.704
4.032PheLeu: 4.032 ± 1.108
0.593PheMet: 0.593 ± 0.388
2.846PheAsn: 2.846 ± 0.508
1.779PhePro: 1.779 ± 0.438
1.898PheGln: 1.898 ± 1.217
1.067PheArg: 1.067 ± 0.68
3.439PheSer: 3.439 ± 1.052
3.676PheThr: 3.676 ± 1.142
3.202PheVal: 3.202 ± 0.987
0.237PheTrp: 0.237 ± 0.122
2.965PheTyr: 2.965 ± 1.03
0.0PheXaa: 0.0 ± 0.0
Gly
3.083GlyAla: 3.083 ± 0.933
1.898GlyCys: 1.898 ± 1.495
2.965GlyAsp: 2.965 ± 0.832
1.423GlyGlu: 1.423 ± 0.527
2.135GlyPhe: 2.135 ± 0.464
3.439GlyGly: 3.439 ± 1.501
1.305GlyHis: 1.305 ± 1.116
3.558GlyIle: 3.558 ± 0.67
3.083GlyLys: 3.083 ± 0.62
3.202GlyLeu: 3.202 ± 1.374
0.356GlyMet: 0.356 ± 0.361
3.439GlyAsn: 3.439 ± 1.106
2.016GlyPro: 2.016 ± 1.803
1.305GlyGln: 1.305 ± 0.459
1.66GlyArg: 1.66 ± 0.423
3.795GlySer: 3.795 ± 0.741
5.455GlyThr: 5.455 ± 0.809
5.1GlyVal: 5.1 ± 0.667
0.593GlyTrp: 0.593 ± 0.183
2.016GlyTyr: 2.016 ± 0.532
0.0GlyXaa: 0.0 ± 0.0
His
2.135HisAla: 2.135 ± 0.726
0.593HisCys: 0.593 ± 0.792
1.067HisAsp: 1.067 ± 0.373
0.949HisGlu: 0.949 ± 0.455
1.305HisPhe: 1.305 ± 0.486
0.83HisGly: 0.83 ± 1.015
0.593HisHis: 0.593 ± 0.183
1.779HisIle: 1.779 ± 0.571
1.067HisLys: 1.067 ± 0.548
3.083HisLeu: 3.083 ± 0.62
0.593HisMet: 0.593 ± 0.331
0.712HisAsn: 0.712 ± 0.365
1.305HisPro: 1.305 ± 0.451
1.423HisGln: 1.423 ± 1.167
0.474HisArg: 0.474 ± 0.244
1.186HisSer: 1.186 ± 0.715
1.779HisThr: 1.779 ± 0.428
2.728HisVal: 2.728 ± 0.803
0.119HisTrp: 0.119 ± 0.427
1.305HisTyr: 1.305 ± 0.937
0.0HisXaa: 0.0 ± 0.0
Ile
4.388IleAla: 4.388 ± 0.962
1.305IleCys: 1.305 ± 0.374
4.032IleAsp: 4.032 ± 0.89
2.609IleGlu: 2.609 ± 1.106
2.253IlePhe: 2.253 ± 0.645
2.728IleGly: 2.728 ± 0.655
1.305IleHis: 1.305 ± 0.401
3.558IleIle: 3.558 ± 1.28
3.083IleLys: 3.083 ± 0.893
5.811IleLeu: 5.811 ± 1.84
1.186IleMet: 1.186 ± 0.596
3.439IleAsn: 3.439 ± 0.9
3.321IlePro: 3.321 ± 0.468
2.135IleGln: 2.135 ± 0.955
2.135IleArg: 2.135 ± 0.726
4.032IleSer: 4.032 ± 1.119
3.914IleThr: 3.914 ± 2.421
4.507IleVal: 4.507 ± 1.27
0.593IleTrp: 0.593 ± 0.308
3.321IleTyr: 3.321 ± 0.887
0.0IleXaa: 0.0 ± 0.0
Lys
4.507LysAla: 4.507 ± 1.153
1.779LysCys: 1.779 ± 0.526
2.253LysAsp: 2.253 ± 0.855
1.779LysGlu: 1.779 ± 0.462
2.253LysPhe: 2.253 ± 0.676
1.898LysGly: 1.898 ± 0.642
1.305LysHis: 1.305 ± 0.66
2.372LysIle: 2.372 ± 0.577
3.202LysLys: 3.202 ± 1.29
5.574LysLeu: 5.574 ± 1.044
0.474LysMet: 0.474 ± 0.244
1.542LysAsn: 1.542 ± 0.386
4.151LysPro: 4.151 ± 3.031
1.423LysGln: 1.423 ± 1.446
1.66LysArg: 1.66 ± 0.948
2.135LysSer: 2.135 ± 0.726
4.981LysThr: 4.981 ± 0.787
3.558LysVal: 3.558 ± 0.541
0.356LysTrp: 0.356 ± 0.159
3.202LysTyr: 3.202 ± 0.709
0.119LysXaa: 0.119 ± 0.061
Leu
10.911LeuAla: 10.911 ± 1.641
1.66LeuCys: 1.66 ± 0.503
4.032LeuAsp: 4.032 ± 1.134
4.388LeuGlu: 4.388 ± 2.169
4.388LeuPhe: 4.388 ± 0.767
3.795LeuGly: 3.795 ± 1.084
1.186LeuHis: 1.186 ± 0.661
3.676LeuIle: 3.676 ± 1.727
4.625LeuLys: 4.625 ± 1.11
8.42LeuLeu: 8.42 ± 1.88
1.542LeuMet: 1.542 ± 0.461
5.811LeuAsn: 5.811 ± 1.792
5.811LeuPro: 5.811 ± 1.368
5.1LeuGln: 5.1 ± 1.113
3.083LeuArg: 3.083 ± 0.69
5.337LeuSer: 5.337 ± 0.884
8.302LeuThr: 8.302 ± 1.269
6.048LeuVal: 6.048 ± 1.692
0.593LeuTrp: 0.593 ± 0.534
4.388LeuTyr: 4.388 ± 1.585
0.119LeuXaa: 0.119 ± 0.061
Met
2.135MetAla: 2.135 ± 1.294
0.712MetCys: 0.712 ± 0.479
0.356MetAsp: 0.356 ± 0.321
0.474MetGlu: 0.474 ± 0.244
0.949MetPhe: 0.949 ± 0.321
1.067MetGly: 1.067 ± 0.339
0.593MetHis: 0.593 ± 0.304
0.593MetIle: 0.593 ± 0.475
0.474MetLys: 0.474 ± 0.32
2.491MetLeu: 2.491 ± 0.878
0.356MetMet: 0.356 ± 0.301
0.593MetAsn: 0.593 ± 0.183
0.83MetPro: 0.83 ± 0.342
0.712MetGln: 0.712 ± 0.221
0.593MetArg: 0.593 ± 0.304
0.949MetSer: 0.949 ± 0.418
0.949MetThr: 0.949 ± 0.487
1.542MetVal: 1.542 ± 0.601
0.119MetTrp: 0.119 ± 0.061
0.712MetTyr: 0.712 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
4.507AsnAla: 4.507 ± 0.41
1.186AsnCys: 1.186 ± 0.568
1.66AsnAsp: 1.66 ± 0.53
2.491AsnGlu: 2.491 ± 0.712
2.253AsnPhe: 2.253 ± 0.991
4.269AsnGly: 4.269 ± 1.511
1.305AsnHis: 1.305 ± 0.425
3.321AsnIle: 3.321 ± 1.733
2.728AsnLys: 2.728 ± 0.64
5.1AsnLeu: 5.1 ± 0.999
1.186AsnMet: 1.186 ± 0.386
3.202AsnAsn: 3.202 ± 0.869
2.965AsnPro: 2.965 ± 1.679
2.846AsnGln: 2.846 ± 0.627
2.728AsnArg: 2.728 ± 0.344
2.491AsnSer: 2.491 ± 0.411
3.558AsnThr: 3.558 ± 0.78
4.269AsnVal: 4.269 ± 1.13
0.237AsnTrp: 0.237 ± 0.333
2.135AsnTyr: 2.135 ± 0.678
0.0AsnXaa: 0.0 ± 0.0
Pro
2.846ProAla: 2.846 ± 1.126
0.83ProCys: 0.83 ± 0.342
2.728ProAsp: 2.728 ± 0.51
3.558ProGlu: 3.558 ± 0.722
1.898ProPhe: 1.898 ± 0.58
3.795ProGly: 3.795 ± 1.655
1.423ProHis: 1.423 ± 0.56
3.795ProIle: 3.795 ± 0.717
2.609ProLys: 2.609 ± 2.085
3.558ProLeu: 3.558 ± 0.938
0.712ProMet: 0.712 ± 0.289
2.609ProAsn: 2.609 ± 0.73
3.202ProPro: 3.202 ± 0.814
2.253ProGln: 2.253 ± 0.512
2.609ProArg: 2.609 ± 2.636
3.202ProSer: 3.202 ± 0.772
4.269ProThr: 4.269 ± 1.172
2.965ProVal: 2.965 ± 0.626
0.474ProTrp: 0.474 ± 0.36
1.66ProTyr: 1.66 ± 0.41
0.0ProXaa: 0.0 ± 0.0
Gln
3.676GlnAla: 3.676 ± 0.919
0.474GlnCys: 0.474 ± 0.244
1.66GlnAsp: 1.66 ± 0.642
2.372GlnGlu: 2.372 ± 0.942
0.949GlnPhe: 0.949 ± 0.513
1.898GlnGly: 1.898 ± 0.773
1.305GlnHis: 1.305 ± 0.492
2.491GlnIle: 2.491 ± 0.778
1.542GlnLys: 1.542 ± 0.549
4.981GlnLeu: 4.981 ± 1.04
0.949GlnMet: 0.949 ± 0.602
2.135GlnAsn: 2.135 ± 0.433
2.491GlnPro: 2.491 ± 0.612
2.372GlnGln: 2.372 ± 0.443
1.423GlnArg: 1.423 ± 0.438
4.388GlnSer: 4.388 ± 1.541
2.609GlnThr: 2.609 ± 0.579
2.372GlnVal: 2.372 ± 0.652
0.593GlnTrp: 0.593 ± 0.183
2.609GlnTyr: 2.609 ± 1.047
0.0GlnXaa: 0.0 ± 0.0
Arg
2.491ArgAla: 2.491 ± 0.897
1.542ArgCys: 1.542 ± 0.541
1.898ArgAsp: 1.898 ± 0.535
1.423ArgGlu: 1.423 ± 0.643
2.016ArgPhe: 2.016 ± 0.798
1.779ArgGly: 1.779 ± 1.973
1.66ArgHis: 1.66 ± 0.423
1.66ArgIle: 1.66 ± 0.411
1.66ArgLys: 1.66 ± 1.137
2.965ArgLeu: 2.965 ± 0.59
0.712ArgMet: 0.712 ± 0.487
2.609ArgAsn: 2.609 ± 0.922
1.305ArgPro: 1.305 ± 0.83
2.016ArgGln: 2.016 ± 0.345
0.949ArgArg: 0.949 ± 0.536
2.253ArgSer: 2.253 ± 1.079
2.491ArgThr: 2.491 ± 0.452
2.135ArgVal: 2.135 ± 0.626
0.237ArgTrp: 0.237 ± 0.122
2.016ArgTyr: 2.016 ± 0.771
0.0ArgXaa: 0.0 ± 0.0
Ser
5.693SerAla: 5.693 ± 1.755
0.83SerCys: 0.83 ± 0.435
4.151SerAsp: 4.151 ± 1.078
1.542SerGlu: 1.542 ± 0.678
2.491SerPhe: 2.491 ± 0.6
3.558SerGly: 3.558 ± 0.59
0.949SerHis: 0.949 ± 0.77
4.151SerIle: 4.151 ± 2.573
1.779SerLys: 1.779 ± 0.539
5.574SerLeu: 5.574 ± 1.347
1.067SerMet: 1.067 ± 0.373
2.728SerAsn: 2.728 ± 0.54
3.676SerPro: 3.676 ± 1.328
2.491SerGln: 2.491 ± 0.87
2.016SerArg: 2.016 ± 0.705
4.744SerSer: 4.744 ± 2.562
5.337SerThr: 5.337 ± 2.04
4.151SerVal: 4.151 ± 1.419
1.067SerTrp: 1.067 ± 0.472
3.439SerTyr: 3.439 ± 0.683
0.0SerXaa: 0.0 ± 0.0
Thr
5.218ThrAla: 5.218 ± 1.704
1.779ThrCys: 1.779 ± 0.436
4.625ThrAsp: 4.625 ± 1.168
2.609ThrGlu: 2.609 ± 0.761
3.676ThrPhe: 3.676 ± 1.746
3.439ThrGly: 3.439 ± 1.696
2.728ThrHis: 2.728 ± 0.538
5.574ThrIle: 5.574 ± 0.937
3.914ThrLys: 3.914 ± 0.629
6.641ThrLeu: 6.641 ± 0.876
1.186ThrMet: 1.186 ± 0.609
4.032ThrAsn: 4.032 ± 1.189
4.507ThrPro: 4.507 ± 0.65
3.083ThrGln: 3.083 ± 1.115
2.135ThrArg: 2.135 ± 2.05
4.507ThrSer: 4.507 ± 0.93
7.472ThrThr: 7.472 ± 1.438
7.472ThrVal: 7.472 ± 1.587
0.83ThrTrp: 0.83 ± 0.75
3.202ThrTyr: 3.202 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
6.404ValAla: 6.404 ± 0.931
2.372ValCys: 2.372 ± 0.564
4.862ValAsp: 4.862 ± 0.922
3.676ValGlu: 3.676 ± 1.0
2.491ValPhe: 2.491 ± 0.684
3.439ValGly: 3.439 ± 1.087
1.898ValHis: 1.898 ± 0.44
3.676ValIle: 3.676 ± 0.946
4.862ValLys: 4.862 ± 1.135
6.879ValLeu: 6.879 ± 1.33
0.83ValMet: 0.83 ± 0.392
4.032ValAsn: 4.032 ± 1.116
2.491ValPro: 2.491 ± 0.99
2.965ValGln: 2.965 ± 0.574
2.728ValArg: 2.728 ± 0.774
4.269ValSer: 4.269 ± 0.99
5.811ValThr: 5.811 ± 0.626
9.962ValVal: 9.962 ± 2.3
0.474ValTrp: 0.474 ± 0.32
3.676ValTyr: 3.676 ± 0.887
0.119ValXaa: 0.119 ± 0.061
Trp
1.067TrpAla: 1.067 ± 1.608
0.0TrpCys: 0.0 ± 0.0
0.83TrpAsp: 0.83 ± 0.357
0.237TrpGlu: 0.237 ± 0.122
0.83TrpPhe: 0.83 ± 0.46
0.237TrpGly: 0.237 ± 0.18
0.237TrpHis: 0.237 ± 0.122
0.593TrpIle: 0.593 ± 0.37
0.237TrpLys: 0.237 ± 0.404
1.305TrpLeu: 1.305 ± 0.634
0.119TrpMet: 0.119 ± 0.206
0.237TrpAsn: 0.237 ± 0.18
0.237TrpPro: 0.237 ± 0.18
0.356TrpGln: 0.356 ± 0.649
0.237TrpArg: 0.237 ± 0.333
0.712TrpSer: 0.712 ± 0.318
0.712TrpThr: 0.712 ± 0.365
0.474TrpVal: 0.474 ± 0.344
0.119TrpTrp: 0.119 ± 0.061
0.237TrpTyr: 0.237 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.372TyrAla: 2.372 ± 0.367
1.66TyrCys: 1.66 ± 0.535
2.372TyrAsp: 2.372 ± 0.858
1.779TyrGlu: 1.779 ± 0.805
1.779TyrPhe: 1.779 ± 0.721
1.898TyrGly: 1.898 ± 0.999
1.423TyrHis: 1.423 ± 0.544
2.965TyrIle: 2.965 ± 1.245
2.846TyrLys: 2.846 ± 0.754
4.269TyrLeu: 4.269 ± 1.049
1.067TyrMet: 1.067 ± 0.457
3.439TyrAsn: 3.439 ± 0.873
1.898TyrPro: 1.898 ± 0.621
2.609TyrGln: 2.609 ± 0.775
2.372TyrArg: 2.372 ± 0.6
2.965TyrSer: 2.965 ± 1.302
4.981TyrThr: 4.981 ± 0.754
3.083TyrVal: 3.083 ± 0.665
0.356TyrTrp: 0.356 ± 0.579
2.253TyrTyr: 2.253 ± 0.567
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.119XaaCys: 0.119 ± 0.061
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.119XaaIle: 0.119 ± 0.061
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.119XaaGln: 0.119 ± 0.061
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (8433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski