Amino acid dipepetide frequency for Bat coronavirus HKU9-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.032AlaAla: 5.032 ± 1.369
2.726AlaCys: 2.726 ± 0.806
4.508AlaAsp: 4.508 ± 0.715
1.677AlaGlu: 1.677 ± 0.489
4.403AlaPhe: 4.403 ± 0.867
3.984AlaGly: 3.984 ± 1.138
1.258AlaHis: 1.258 ± 0.406
4.613AlaIle: 4.613 ± 1.329
3.459AlaLys: 3.459 ± 0.801
6.08AlaLeu: 6.08 ± 0.81
2.83AlaMet: 2.83 ± 0.801
4.193AlaAsn: 4.193 ± 0.767
3.04AlaPro: 3.04 ± 1.264
2.83AlaGln: 2.83 ± 0.519
3.25AlaArg: 3.25 ± 0.755
5.032AlaSer: 5.032 ± 1.6
4.717AlaThr: 4.717 ± 1.058
7.653AlaVal: 7.653 ± 1.651
0.524AlaTrp: 0.524 ± 0.384
3.25AlaTyr: 3.25 ± 1.1
0.0AlaXaa: 0.0 ± 0.0
Cys
1.887CysAla: 1.887 ± 0.463
0.943CysCys: 0.943 ± 0.375
2.201CysAsp: 2.201 ± 0.449
0.629CysGlu: 0.629 ± 0.334
1.572CysPhe: 1.572 ± 0.705
2.411CysGly: 2.411 ± 0.692
0.419CysHis: 0.419 ± 0.222
1.153CysIle: 1.153 ± 1.316
1.468CysLys: 1.468 ± 0.558
2.306CysLeu: 2.306 ± 0.69
0.839CysMet: 0.839 ± 0.297
1.258CysAsn: 1.258 ± 0.406
0.943CysPro: 0.943 ± 0.375
0.839CysGln: 0.839 ± 0.323
0.943CysArg: 0.943 ± 0.348
2.516CysSer: 2.516 ± 0.965
2.411CysThr: 2.411 ± 0.769
3.25CysVal: 3.25 ± 0.856
0.419CysTrp: 0.419 ± 0.222
2.516CysTyr: 2.516 ± 0.792
0.0CysXaa: 0.0 ± 0.0
Asp
3.669AspAla: 3.669 ± 0.797
0.943AspCys: 0.943 ± 0.332
2.726AspAsp: 2.726 ± 0.871
2.306AspGlu: 2.306 ± 0.739
3.04AspPhe: 3.04 ± 0.932
3.984AspGly: 3.984 ± 1.159
0.21AspHis: 0.21 ± 0.111
2.935AspIle: 2.935 ± 1.026
2.516AspLys: 2.516 ± 0.998
4.508AspLeu: 4.508 ± 0.962
1.153AspMet: 1.153 ± 0.397
1.992AspAsn: 1.992 ± 0.82
2.516AspPro: 2.516 ± 1.521
1.258AspGln: 1.258 ± 0.383
1.782AspArg: 1.782 ± 0.431
3.25AspSer: 3.25 ± 1.123
4.717AspThr: 4.717 ± 1.153
4.927AspVal: 4.927 ± 1.613
0.839AspTrp: 0.839 ± 0.345
3.459AspTyr: 3.459 ± 1.241
0.0AspXaa: 0.0 ± 0.0
Glu
2.83GluAla: 2.83 ± 0.795
1.048GluCys: 1.048 ± 0.369
2.097GluAsp: 2.097 ± 0.936
2.621GluGlu: 2.621 ± 1.514
1.363GluPhe: 1.363 ± 0.44
3.774GluGly: 3.774 ± 1.141
1.153GluHis: 1.153 ± 0.368
1.048GluIle: 1.048 ± 0.382
1.153GluLys: 1.153 ± 0.477
4.403GluLeu: 4.403 ± 0.696
0.524GluMet: 0.524 ± 0.503
2.201GluAsn: 2.201 ± 0.585
1.887GluPro: 1.887 ± 0.514
1.677GluGln: 1.677 ± 0.312
1.677GluArg: 1.677 ± 0.489
3.25GluSer: 3.25 ± 0.642
1.782GluThr: 1.782 ± 0.415
3.355GluVal: 3.355 ± 0.999
0.105GluTrp: 0.105 ± 0.309
1.048GluTyr: 1.048 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
2.621PheAla: 2.621 ± 1.734
1.468PheCys: 1.468 ± 0.516
2.621PheAsp: 2.621 ± 0.922
1.677PheGlu: 1.677 ± 0.605
0.839PhePhe: 0.839 ± 0.328
3.355PheGly: 3.355 ± 1.459
0.629PheHis: 0.629 ± 0.334
2.621PheIle: 2.621 ± 1.007
2.726PheLys: 2.726 ± 0.703
2.411PheLeu: 2.411 ± 0.772
2.097PheMet: 2.097 ± 0.768
3.355PheAsn: 3.355 ± 1.032
1.048PhePro: 1.048 ± 0.842
1.153PheGln: 1.153 ± 0.368
1.572PheArg: 1.572 ± 0.73
3.25PheSer: 3.25 ± 0.614
3.564PheThr: 3.564 ± 0.874
4.298PheVal: 4.298 ± 1.042
0.734PheTrp: 0.734 ± 0.298
2.83PheTyr: 2.83 ± 0.537
0.0PheXaa: 0.0 ± 0.0
Gly
4.822GlyAla: 4.822 ± 1.196
1.572GlyCys: 1.572 ± 0.608
3.459GlyAsp: 3.459 ± 0.648
2.411GlyGlu: 2.411 ± 0.825
3.774GlyPhe: 3.774 ± 0.658
4.088GlyGly: 4.088 ± 1.111
0.943GlyHis: 0.943 ± 0.375
2.411GlyIle: 2.411 ± 0.575
2.411GlyLys: 2.411 ± 0.677
4.403GlyLeu: 4.403 ± 0.875
0.839GlyMet: 0.839 ± 0.302
3.355GlyAsn: 3.355 ± 2.009
2.201GlyPro: 2.201 ± 0.837
1.468GlyGln: 1.468 ± 0.389
2.83GlyArg: 2.83 ± 1.774
4.822GlySer: 4.822 ± 1.304
5.451GlyThr: 5.451 ± 0.601
7.653GlyVal: 7.653 ± 1.207
1.153GlyTrp: 1.153 ± 0.484
3.04GlyTyr: 3.04 ± 0.862
0.0GlyXaa: 0.0 ± 0.0
His
1.677HisAla: 1.677 ± 0.465
0.524HisCys: 0.524 ± 0.475
0.629HisAsp: 0.629 ± 0.226
0.314HisGlu: 0.314 ± 0.167
0.839HisPhe: 0.839 ± 0.351
1.258HisGly: 1.258 ± 0.747
0.314HisHis: 0.314 ± 0.167
1.048HisIle: 1.048 ± 0.435
1.048HisLys: 1.048 ± 0.946
1.468HisLeu: 1.468 ± 0.642
0.314HisMet: 0.314 ± 0.392
0.943HisAsn: 0.943 ± 0.54
0.943HisPro: 0.943 ± 0.39
0.419HisGln: 0.419 ± 0.222
0.629HisArg: 0.629 ± 0.634
0.839HisSer: 0.839 ± 0.29
1.782HisThr: 1.782 ± 0.612
2.097HisVal: 2.097 ± 0.497
0.21HisTrp: 0.21 ± 0.686
0.943HisTyr: 0.943 ± 0.306
0.0HisXaa: 0.0 ± 0.0
Ile
3.669IleAla: 3.669 ± 2.306
1.258IleCys: 1.258 ± 0.887
2.201IleAsp: 2.201 ± 0.777
1.572IleGlu: 1.572 ± 0.455
1.363IlePhe: 1.363 ± 0.346
2.83IleGly: 2.83 ± 0.744
1.048IleHis: 1.048 ± 0.344
1.887IleIle: 1.887 ± 0.934
2.411IleLys: 2.411 ± 1.009
4.927IleLeu: 4.927 ± 1.887
1.153IleMet: 1.153 ± 0.54
2.83IleAsn: 2.83 ± 1.194
1.992IlePro: 1.992 ± 1.152
1.258IleGln: 1.258 ± 0.795
1.992IleArg: 1.992 ± 0.673
3.25IleSer: 3.25 ± 0.962
2.411IleThr: 2.411 ± 0.667
3.355IleVal: 3.355 ± 1.025
0.629IleTrp: 0.629 ± 0.29
1.258IleTyr: 1.258 ± 0.371
0.0IleXaa: 0.0 ± 0.0
Lys
3.25LysAla: 3.25 ± 0.964
1.782LysCys: 1.782 ± 0.447
2.411LysAsp: 2.411 ± 0.714
2.306LysGlu: 2.306 ± 0.415
1.992LysPhe: 1.992 ± 0.558
3.145LysGly: 3.145 ± 0.866
1.468LysHis: 1.468 ± 0.728
1.258LysIle: 1.258 ± 0.569
1.782LysLys: 1.782 ± 1.645
4.613LysLeu: 4.613 ± 1.223
1.048LysMet: 1.048 ± 0.453
1.992LysAsn: 1.992 ± 0.648
3.25LysPro: 3.25 ± 1.09
1.782LysGln: 1.782 ± 0.518
3.04LysArg: 3.04 ± 0.885
2.621LysSer: 2.621 ± 0.423
2.306LysThr: 2.306 ± 0.873
4.927LysVal: 4.927 ± 0.689
0.419LysTrp: 0.419 ± 0.151
2.621LysTyr: 2.621 ± 0.872
0.0LysXaa: 0.0 ± 0.0
Leu
7.548LeuAla: 7.548 ± 1.314
3.459LeuCys: 3.459 ± 0.913
4.403LeuAsp: 4.403 ± 1.414
4.508LeuGlu: 4.508 ± 1.156
3.04LeuPhe: 3.04 ± 0.886
4.822LeuGly: 4.822 ± 0.92
2.411LeuHis: 2.411 ± 0.723
3.669LeuIle: 3.669 ± 0.897
4.822LeuLys: 4.822 ± 1.466
10.903LeuLeu: 10.903 ± 4.327
1.887LeuMet: 1.887 ± 0.916
4.193LeuAsn: 4.193 ± 0.645
4.717LeuPro: 4.717 ± 1.443
4.403LeuGln: 4.403 ± 0.888
4.403LeuArg: 4.403 ± 1.119
6.5LeuSer: 6.5 ± 1.22
4.508LeuThr: 4.508 ± 1.144
8.806LeuVal: 8.806 ± 1.382
1.468LeuTrp: 1.468 ± 1.196
5.032LeuTyr: 5.032 ± 1.208
0.0LeuXaa: 0.0 ± 0.0
Met
1.782MetAla: 1.782 ± 0.68
1.363MetCys: 1.363 ± 0.72
0.839MetAsp: 0.839 ± 0.323
0.839MetGlu: 0.839 ± 0.385
1.048MetPhe: 1.048 ± 0.825
1.258MetGly: 1.258 ± 0.36
0.419MetHis: 0.419 ± 0.375
0.734MetIle: 0.734 ± 0.273
0.314MetLys: 0.314 ± 0.167
3.25MetLeu: 3.25 ± 0.576
0.524MetMet: 0.524 ± 0.365
1.572MetAsn: 1.572 ± 0.382
1.258MetPro: 1.258 ± 0.748
1.363MetGln: 1.363 ± 0.448
1.363MetArg: 1.363 ± 0.349
1.782MetSer: 1.782 ± 0.691
1.363MetThr: 1.363 ± 0.441
1.677MetVal: 1.677 ± 0.751
0.314MetTrp: 0.314 ± 0.395
0.943MetTyr: 0.943 ± 0.363
0.0MetXaa: 0.0 ± 0.0
Asn
4.508AsnAla: 4.508 ± 0.835
1.258AsnCys: 1.258 ± 0.453
1.468AsnAsp: 1.468 ± 0.515
1.887AsnGlu: 1.887 ± 0.934
2.621AsnPhe: 2.621 ± 1.599
3.879AsnGly: 3.879 ± 0.648
0.419AsnHis: 0.419 ± 0.38
2.306AsnIle: 2.306 ± 0.429
2.83AsnLys: 2.83 ± 0.733
4.613AsnLeu: 4.613 ± 0.793
1.048AsnMet: 1.048 ± 0.548
2.83AsnAsn: 2.83 ± 1.026
2.201AsnPro: 2.201 ± 0.817
1.048AsnGln: 1.048 ± 0.758
2.411AsnArg: 2.411 ± 0.69
4.088AsnSer: 4.088 ± 2.276
3.04AsnThr: 3.04 ± 1.409
4.613AsnVal: 4.613 ± 0.331
0.524AsnTrp: 0.524 ± 0.278
2.83AsnTyr: 2.83 ± 0.833
0.0AsnXaa: 0.0 ± 0.0
Pro
4.193ProAla: 4.193 ± 0.747
0.943ProCys: 0.943 ± 0.306
2.935ProAsp: 2.935 ± 0.431
2.306ProGlu: 2.306 ± 1.096
1.992ProPhe: 1.992 ± 0.429
3.04ProGly: 3.04 ± 0.586
1.153ProHis: 1.153 ± 0.391
2.726ProIle: 2.726 ± 0.592
3.04ProLys: 3.04 ± 1.747
4.927ProLeu: 4.927 ± 1.132
1.153ProMet: 1.153 ± 0.477
2.097ProAsn: 2.097 ± 1.249
1.992ProPro: 1.992 ± 0.688
1.468ProGln: 1.468 ± 0.515
2.097ProArg: 2.097 ± 1.226
1.572ProSer: 1.572 ± 0.512
2.935ProThr: 2.935 ± 0.696
3.459ProVal: 3.459 ± 0.639
0.839ProTrp: 0.839 ± 0.285
1.887ProTyr: 1.887 ± 0.575
0.0ProXaa: 0.0 ± 0.0
Gln
2.201GlnAla: 2.201 ± 0.476
0.524GlnCys: 0.524 ± 0.184
2.097GlnAsp: 2.097 ± 0.481
1.677GlnGlu: 1.677 ± 0.86
1.992GlnPhe: 1.992 ± 0.893
1.572GlnGly: 1.572 ± 0.517
0.943GlnHis: 0.943 ± 0.306
0.629GlnIle: 0.629 ± 0.331
1.677GlnLys: 1.677 ± 0.808
3.879GlnLeu: 3.879 ± 0.977
0.839GlnMet: 0.839 ± 0.43
1.468GlnAsn: 1.468 ± 0.76
1.782GlnPro: 1.782 ± 0.566
1.363GlnGln: 1.363 ± 0.368
1.258GlnArg: 1.258 ± 0.491
2.306GlnSer: 2.306 ± 0.414
2.726GlnThr: 2.726 ± 1.09
2.621GlnVal: 2.621 ± 0.322
1.048GlnTrp: 1.048 ± 0.328
1.048GlnTyr: 1.048 ± 0.398
0.0GlnXaa: 0.0 ± 0.0
Arg
2.726ArgAla: 2.726 ± 0.528
1.887ArgCys: 1.887 ± 0.56
1.572ArgAsp: 1.572 ± 0.401
1.782ArgGlu: 1.782 ± 0.697
1.677ArgPhe: 1.677 ± 0.562
2.097ArgGly: 2.097 ± 1.903
1.363ArgHis: 1.363 ± 1.187
2.201ArgIle: 2.201 ± 0.895
1.782ArgLys: 1.782 ± 0.945
3.879ArgLeu: 3.879 ± 1.075
1.468ArgMet: 1.468 ± 0.449
2.201ArgAsn: 2.201 ± 2.768
2.306ArgPro: 2.306 ± 1.329
1.572ArgGln: 1.572 ± 0.401
2.516ArgArg: 2.516 ± 0.916
2.201ArgSer: 2.201 ± 0.602
3.145ArgThr: 3.145 ± 0.712
3.669ArgVal: 3.669 ± 0.321
0.839ArgTrp: 0.839 ± 0.774
2.201ArgTyr: 2.201 ± 0.659
0.0ArgXaa: 0.0 ± 0.0
Ser
5.346SerAla: 5.346 ± 1.117
2.726SerCys: 2.726 ± 0.731
4.298SerAsp: 4.298 ± 0.863
2.935SerGlu: 2.935 ± 1.129
2.306SerPhe: 2.306 ± 1.309
3.984SerGly: 3.984 ± 1.088
1.468SerHis: 1.468 ± 0.419
2.516SerIle: 2.516 ± 0.458
4.193SerLys: 4.193 ± 0.878
6.814SerLeu: 6.814 ± 1.045
1.677SerMet: 1.677 ± 0.579
2.935SerAsn: 2.935 ± 0.798
3.04SerPro: 3.04 ± 0.655
1.782SerGln: 1.782 ± 0.786
3.459SerArg: 3.459 ± 2.104
5.032SerSer: 5.032 ± 1.462
3.669SerThr: 3.669 ± 0.9
6.814SerVal: 6.814 ± 1.544
1.048SerTrp: 1.048 ± 0.303
1.992SerTyr: 1.992 ± 0.715
0.105SerXaa: 0.105 ± 0.056
Thr
5.451ThrAla: 5.451 ± 0.498
1.992ThrCys: 1.992 ± 0.61
3.04ThrAsp: 3.04 ± 0.803
1.677ThrGlu: 1.677 ± 0.481
3.355ThrPhe: 3.355 ± 0.782
4.613ThrGly: 4.613 ± 0.777
0.943ThrHis: 0.943 ± 0.525
3.459ThrIle: 3.459 ± 1.271
3.355ThrLys: 3.355 ± 1.133
5.556ThrLeu: 5.556 ± 1.515
1.572ThrMet: 1.572 ± 0.596
2.516ThrAsn: 2.516 ± 0.745
4.193ThrPro: 4.193 ± 0.815
2.621ThrGln: 2.621 ± 1.492
2.097ThrArg: 2.097 ± 0.864
4.403ThrSer: 4.403 ± 0.652
4.298ThrThr: 4.298 ± 1.447
7.129ThrVal: 7.129 ± 1.167
0.524ThrTrp: 0.524 ± 0.329
2.411ThrTyr: 2.411 ± 0.746
0.0ThrXaa: 0.0 ± 0.0
Val
7.443ValAla: 7.443 ± 1.063
2.935ValCys: 2.935 ± 0.809
4.822ValAsp: 4.822 ± 1.39
3.669ValGlu: 3.669 ± 1.453
4.613ValPhe: 4.613 ± 1.403
5.137ValGly: 5.137 ± 0.672
1.153ValHis: 1.153 ± 0.441
3.879ValIle: 3.879 ± 1.932
4.822ValLys: 4.822 ± 0.675
10.693ValLeu: 10.693 ± 1.919
2.097ValMet: 2.097 ± 0.622
4.822ValAsn: 4.822 ± 1.188
4.403ValPro: 4.403 ± 0.765
3.879ValGln: 3.879 ± 1.135
3.669ValArg: 3.669 ± 0.618
7.233ValSer: 7.233 ± 0.971
5.871ValThr: 5.871 ± 0.816
10.274ValVal: 10.274 ± 2.366
0.839ValTrp: 0.839 ± 0.249
4.613ValTyr: 4.613 ± 0.999
0.0ValXaa: 0.0 ± 0.0
Trp
0.629TrpAla: 0.629 ± 0.334
0.21TrpCys: 0.21 ± 0.111
1.153TrpAsp: 1.153 ± 0.612
0.314TrpGlu: 0.314 ± 0.36
0.629TrpPhe: 0.629 ± 0.29
0.629TrpGly: 0.629 ± 0.821
0.0TrpHis: 0.0 ± 0.0
0.314TrpIle: 0.314 ± 0.134
0.314TrpLys: 0.314 ± 0.167
2.097TrpLeu: 2.097 ± 0.783
0.21TrpMet: 0.21 ± 0.346
0.524TrpAsn: 0.524 ± 0.373
0.734TrpPro: 0.734 ± 0.83
0.419TrpGln: 0.419 ± 0.222
0.419TrpArg: 0.419 ± 0.151
0.839TrpSer: 0.839 ± 0.526
0.839TrpThr: 0.839 ± 0.443
1.572TrpVal: 1.572 ± 0.734
0.105TrpTrp: 0.105 ± 0.056
1.153TrpTyr: 1.153 ± 0.347
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.774TyrAla: 3.774 ± 0.551
1.363TyrCys: 1.363 ± 0.998
3.25TyrAsp: 3.25 ± 1.113
1.887TyrGlu: 1.887 ± 0.442
2.411TyrPhe: 2.411 ± 0.542
3.25TyrGly: 3.25 ± 0.648
0.524TyrHis: 0.524 ± 0.278
1.782TyrIle: 1.782 ± 0.422
1.887TyrLys: 1.887 ± 0.61
3.774TyrLeu: 3.774 ± 0.976
0.629TyrMet: 0.629 ± 0.226
3.145TyrAsn: 3.145 ± 1.276
2.306TyrPro: 2.306 ± 0.742
1.048TyrGln: 1.048 ± 0.427
1.677TyrArg: 1.677 ± 0.403
3.355TyrSer: 3.355 ± 1.045
3.669TyrThr: 3.669 ± 0.783
4.822TyrVal: 4.822 ± 1.564
0.524TyrTrp: 0.524 ± 0.364
2.83TyrTyr: 2.83 ± 0.57
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.105XaaIle: 0.105 ± 0.056
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (9540 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski