Amino acid dipepetide frequency for Bat coronavirus HKU9 (BtCoV) (BtCoV/HKU9)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.953AlaAla: 5.953 ± 1.411
2.541AlaCys: 2.541 ± 0.586
3.92AlaAsp: 3.92 ± 0.486
2.541AlaGlu: 2.541 ± 0.426
3.63AlaPhe: 3.63 ± 0.359
3.993AlaGly: 3.993 ± 0.502
1.089AlaHis: 1.089 ± 0.212
4.864AlaIle: 4.864 ± 0.561
2.977AlaLys: 2.977 ± 0.616
6.171AlaLeu: 6.171 ± 1.264
3.049AlaMet: 3.049 ± 0.349
3.993AlaAsn: 3.993 ± 0.375
3.34AlaPro: 3.34 ± 0.473
3.194AlaGln: 3.194 ± 0.564
2.977AlaArg: 2.977 ± 0.483
4.864AlaSer: 4.864 ± 0.677
4.864AlaThr: 4.864 ± 0.622
6.534AlaVal: 6.534 ± 0.708
0.726AlaTrp: 0.726 ± 0.22
4.138AlaTyr: 4.138 ± 0.637
0.0AlaXaa: 0.0 ± 0.0
Cys
2.178CysAla: 2.178 ± 0.271
0.871CysCys: 0.871 ± 0.245
2.178CysAsp: 2.178 ± 0.307
0.581CysGlu: 0.581 ± 0.21
1.597CysPhe: 1.597 ± 0.564
2.396CysGly: 2.396 ± 0.364
0.653CysHis: 0.653 ± 0.1
0.799CysIle: 0.799 ± 0.301
1.307CysLys: 1.307 ± 0.242
2.178CysLeu: 2.178 ± 0.327
0.871CysMet: 0.871 ± 0.627
1.089CysAsn: 1.089 ± 0.352
1.089CysPro: 1.089 ± 0.162
0.944CysGln: 0.944 ± 0.179
1.162CysArg: 1.162 ± 0.17
2.251CysSer: 2.251 ± 0.343
3.194CysThr: 3.194 ± 0.42
2.977CysVal: 2.977 ± 0.503
0.436CysTrp: 0.436 ± 0.133
2.396CysTyr: 2.396 ± 0.473
0.0CysXaa: 0.0 ± 0.0
Asp
4.501AspAla: 4.501 ± 0.637
1.379AspCys: 1.379 ± 0.322
2.323AspAsp: 2.323 ± 0.514
2.251AspGlu: 2.251 ± 0.539
3.34AspPhe: 3.34 ± 0.371
4.066AspGly: 4.066 ± 0.477
0.218AspHis: 0.218 ± 0.105
3.63AspIle: 3.63 ± 0.54
2.686AspLys: 2.686 ± 0.449
4.066AspLeu: 4.066 ± 0.503
0.799AspMet: 0.799 ± 0.414
2.468AspAsn: 2.468 ± 1.238
2.396AspPro: 2.396 ± 0.492
1.307AspGln: 1.307 ± 0.297
1.742AspArg: 1.742 ± 0.609
3.122AspSer: 3.122 ± 0.455
4.066AspThr: 4.066 ± 0.609
5.227AspVal: 5.227 ± 0.487
0.799AspTrp: 0.799 ± 0.206
2.759AspTyr: 2.759 ± 0.51
0.0AspXaa: 0.0 ± 0.0
Glu
2.323GluAla: 2.323 ± 0.508
1.016GluCys: 1.016 ± 0.203
2.251GluAsp: 2.251 ± 0.768
1.96GluGlu: 1.96 ± 0.216
1.452GluPhe: 1.452 ± 0.34
2.904GluGly: 2.904 ± 0.485
1.452GluHis: 1.452 ± 0.25
0.871GluIle: 0.871 ± 0.183
0.944GluLys: 0.944 ± 0.123
3.557GluLeu: 3.557 ± 0.474
0.581GluMet: 0.581 ± 0.12
1.742GluAsn: 1.742 ± 0.205
1.888GluPro: 1.888 ± 0.438
1.162GluGln: 1.162 ± 0.265
1.96GluArg: 1.96 ± 0.653
3.267GluSer: 3.267 ± 0.589
1.888GluThr: 1.888 ± 0.388
3.485GluVal: 3.485 ± 0.486
0.363GluTrp: 0.363 ± 0.116
1.452GluTyr: 1.452 ± 0.287
0.0GluXaa: 0.0 ± 0.0
Phe
2.396PheAla: 2.396 ± 0.998
1.307PheCys: 1.307 ± 0.307
2.614PheAsp: 2.614 ± 0.402
1.379PheGlu: 1.379 ± 0.216
0.944PhePhe: 0.944 ± 0.32
3.049PheGly: 3.049 ± 0.37
0.944PheHis: 0.944 ± 0.488
2.251PheIle: 2.251 ± 0.665
2.178PheLys: 2.178 ± 0.436
3.485PheLeu: 3.485 ± 0.426
1.888PheMet: 1.888 ± 0.6
2.396PheAsn: 2.396 ± 0.744
1.089PhePro: 1.089 ± 0.504
1.452PheGln: 1.452 ± 0.206
1.67PheArg: 1.67 ± 0.499
4.574PheSer: 4.574 ± 0.498
3.557PheThr: 3.557 ± 0.367
4.429PheVal: 4.429 ± 0.448
0.436PheTrp: 0.436 ± 0.192
2.323PheTyr: 2.323 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
4.864GlyAla: 4.864 ± 0.464
1.815GlyCys: 1.815 ± 0.307
4.283GlyAsp: 4.283 ± 0.703
1.089GlyGlu: 1.089 ± 0.307
3.848GlyPhe: 3.848 ± 0.385
4.429GlyGly: 4.429 ± 0.556
1.234GlyHis: 1.234 ± 0.355
2.904GlyIle: 2.904 ± 0.298
2.396GlyLys: 2.396 ± 0.42
5.009GlyLeu: 5.009 ± 0.877
0.871GlyMet: 0.871 ± 0.221
2.396GlyAsn: 2.396 ± 0.875
2.614GlyPro: 2.614 ± 0.489
1.234GlyGln: 1.234 ± 0.201
2.614GlyArg: 2.614 ± 0.859
5.082GlySer: 5.082 ± 0.861
5.372GlyThr: 5.372 ± 0.672
8.857GlyVal: 8.857 ± 0.911
1.234GlyTrp: 1.234 ± 0.28
2.614GlyTyr: 2.614 ± 0.173
0.0GlyXaa: 0.0 ± 0.0
His
1.089HisAla: 1.089 ± 0.361
0.218HisCys: 0.218 ± 0.067
0.581HisAsp: 0.581 ± 0.201
0.363HisGlu: 0.363 ± 0.149
0.799HisPhe: 0.799 ± 0.332
1.307HisGly: 1.307 ± 0.429
0.363HisHis: 0.363 ± 0.104
1.162HisIle: 1.162 ± 0.563
0.653HisLys: 0.653 ± 0.494
1.96HisLeu: 1.96 ± 0.242
0.29HisMet: 0.29 ± 0.091
0.726HisAsn: 0.726 ± 0.152
0.944HisPro: 0.944 ± 0.131
0.508HisGln: 0.508 ± 0.098
0.726HisArg: 0.726 ± 0.325
0.944HisSer: 0.944 ± 0.289
1.888HisThr: 1.888 ± 0.308
2.468HisVal: 2.468 ± 0.442
0.073HisTrp: 0.073 ± 0.192
1.016HisTyr: 1.016 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
3.122IleAla: 3.122 ± 0.827
1.089IleCys: 1.089 ± 0.251
1.888IleAsp: 1.888 ± 0.409
1.379IleGlu: 1.379 ± 0.397
1.307IlePhe: 1.307 ± 0.234
2.831IleGly: 2.831 ± 0.635
0.581IleHis: 0.581 ± 0.22
1.742IleIle: 1.742 ± 0.581
2.977IleLys: 2.977 ± 1.177
5.082IleLeu: 5.082 ± 1.145
1.525IleMet: 1.525 ± 0.189
2.977IleAsn: 2.977 ± 1.101
2.105IlePro: 2.105 ± 0.356
0.871IleGln: 0.871 ± 0.253
2.614IleArg: 2.614 ± 0.512
3.267IleSer: 3.267 ± 0.394
2.251IleThr: 2.251 ± 0.337
3.848IleVal: 3.848 ± 0.359
1.234IleTrp: 1.234 ± 0.359
1.452IleTyr: 1.452 ± 0.646
0.0IleXaa: 0.0 ± 0.0
Lys
3.557LysAla: 3.557 ± 0.523
1.379LysCys: 1.379 ± 0.294
2.033LysAsp: 2.033 ± 0.502
2.033LysGlu: 2.033 ± 0.37
1.815LysPhe: 1.815 ± 0.329
3.485LysGly: 3.485 ± 0.326
1.089LysHis: 1.089 ± 0.379
1.162LysIle: 1.162 ± 0.177
1.742LysLys: 1.742 ± 0.717
4.864LysLeu: 4.864 ± 0.57
0.944LysMet: 0.944 ± 0.155
1.742LysAsn: 1.742 ± 0.568
2.759LysPro: 2.759 ± 0.379
1.67LysGln: 1.67 ± 0.33
3.194LysArg: 3.194 ± 0.446
1.815LysSer: 1.815 ± 0.389
2.033LysThr: 2.033 ± 0.338
4.646LysVal: 4.646 ± 0.517
0.508LysTrp: 0.508 ± 0.101
2.831LysTyr: 2.831 ± 0.543
0.0LysXaa: 0.0 ± 0.0
Leu
7.913LeuAla: 7.913 ± 0.836
4.574LeuCys: 4.574 ± 0.498
4.138LeuAsp: 4.138 ± 0.9
4.429LeuGlu: 4.429 ± 0.448
3.267LeuPhe: 3.267 ± 1.156
4.719LeuGly: 4.719 ± 0.404
2.977LeuHis: 2.977 ± 0.243
4.138LeuIle: 4.138 ± 0.469
5.808LeuLys: 5.808 ± 1.214
11.398LeuLeu: 11.398 ± 1.518
1.597LeuMet: 1.597 ± 0.301
4.138LeuAsn: 4.138 ± 0.561
4.937LeuPro: 4.937 ± 0.685
4.211LeuGln: 4.211 ± 0.498
4.138LeuArg: 4.138 ± 0.776
6.389LeuSer: 6.389 ± 1.378
4.283LeuThr: 4.283 ± 0.774
8.639LeuVal: 8.639 ± 1.003
1.307LeuTrp: 1.307 ± 0.323
5.3LeuTyr: 5.3 ± 0.805
0.0LeuXaa: 0.0 ± 0.0
Met
1.597MetAla: 1.597 ± 0.581
1.016MetCys: 1.016 ± 0.361
0.871MetAsp: 0.871 ± 0.208
0.799MetGlu: 0.799 ± 0.235
0.871MetPhe: 0.871 ± 0.274
0.944MetGly: 0.944 ± 0.123
0.363MetHis: 0.363 ± 0.147
0.581MetIle: 0.581 ± 0.113
0.436MetLys: 0.436 ± 0.318
3.63MetLeu: 3.63 ± 0.604
0.363MetMet: 0.363 ± 0.093
1.089MetAsn: 1.089 ± 0.351
1.452MetPro: 1.452 ± 0.199
1.525MetGln: 1.525 ± 0.281
1.379MetArg: 1.379 ± 0.223
2.178MetSer: 2.178 ± 0.265
1.452MetThr: 1.452 ± 0.284
2.759MetVal: 2.759 ± 0.566
0.29MetTrp: 0.29 ± 0.194
1.452MetTyr: 1.452 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
4.429AsnAla: 4.429 ± 0.393
1.597AsnCys: 1.597 ± 0.18
1.089AsnAsp: 1.089 ± 0.446
1.67AsnGlu: 1.67 ± 0.324
2.105AsnPhe: 2.105 ± 1.072
4.356AsnGly: 4.356 ± 0.789
0.581AsnHis: 0.581 ± 0.287
2.251AsnIle: 2.251 ± 0.403
1.888AsnLys: 1.888 ± 0.243
3.92AsnLeu: 3.92 ± 0.794
0.871AsnMet: 0.871 ± 0.092
3.122AsnAsn: 3.122 ± 0.613
2.759AsnPro: 2.759 ± 0.515
0.581AsnGln: 0.581 ± 0.485
1.888AsnArg: 1.888 ± 0.427
3.412AsnSer: 3.412 ± 0.665
3.412AsnThr: 3.412 ± 0.922
4.429AsnVal: 4.429 ± 0.478
0.653AsnTrp: 0.653 ± 0.2
3.194AsnTyr: 3.194 ± 0.245
0.0AsnXaa: 0.0 ± 0.0
Pro
3.63ProAla: 3.63 ± 0.371
1.162ProCys: 1.162 ± 0.278
2.977ProAsp: 2.977 ± 0.304
1.815ProGlu: 1.815 ± 0.391
2.323ProPhe: 2.323 ± 0.337
3.557ProGly: 3.557 ± 0.348
1.016ProHis: 1.016 ± 0.282
3.557ProIle: 3.557 ± 0.502
2.178ProLys: 2.178 ± 0.947
4.574ProLeu: 4.574 ± 0.581
1.307ProMet: 1.307 ± 0.312
1.815ProAsn: 1.815 ± 0.447
2.033ProPro: 2.033 ± 0.428
1.597ProGln: 1.597 ± 0.226
1.815ProArg: 1.815 ± 0.675
1.888ProSer: 1.888 ± 0.71
3.194ProThr: 3.194 ± 0.472
3.485ProVal: 3.485 ± 0.614
0.653ProTrp: 0.653 ± 0.109
2.105ProTyr: 2.105 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
1.888GlnAla: 1.888 ± 0.289
0.508GlnCys: 0.508 ± 0.308
2.033GlnAsp: 2.033 ± 0.266
1.307GlnGlu: 1.307 ± 0.2
1.96GlnPhe: 1.96 ± 0.301
1.016GlnGly: 1.016 ± 0.149
0.653GlnHis: 0.653 ± 0.111
0.726GlnIle: 0.726 ± 0.152
1.452GlnLys: 1.452 ± 0.448
3.848GlnLeu: 3.848 ± 0.671
0.653GlnMet: 0.653 ± 0.184
1.452GlnAsn: 1.452 ± 0.402
1.597GlnPro: 1.597 ± 0.299
1.452GlnGln: 1.452 ± 0.374
1.452GlnArg: 1.452 ± 0.361
1.888GlnSer: 1.888 ± 0.378
2.468GlnThr: 2.468 ± 0.39
2.614GlnVal: 2.614 ± 0.251
1.016GlnTrp: 1.016 ± 0.183
0.726GlnTyr: 0.726 ± 0.141
0.0GlnXaa: 0.0 ± 0.0
Arg
3.703ArgAla: 3.703 ± 0.269
1.742ArgCys: 1.742 ± 0.228
1.96ArgAsp: 1.96 ± 0.269
1.597ArgGlu: 1.597 ± 0.32
2.396ArgPhe: 2.396 ± 0.699
2.831ArgGly: 2.831 ± 1.015
1.016ArgHis: 1.016 ± 0.444
1.379ArgIle: 1.379 ± 0.317
1.815ArgLys: 1.815 ± 0.567
3.92ArgLeu: 3.92 ± 0.343
1.67ArgMet: 1.67 ± 0.416
1.67ArgAsn: 1.67 ± 0.836
1.888ArgPro: 1.888 ± 0.898
1.525ArgGln: 1.525 ± 0.336
2.105ArgArg: 2.105 ± 0.305
2.251ArgSer: 2.251 ± 0.886
3.34ArgThr: 3.34 ± 0.936
3.122ArgVal: 3.122 ± 0.455
0.436ArgTrp: 0.436 ± 0.143
2.105ArgTyr: 2.105 ± 0.314
0.0ArgXaa: 0.0 ± 0.0
Ser
5.009SerAla: 5.009 ± 0.656
2.178SerCys: 2.178 ± 0.31
4.066SerAsp: 4.066 ± 0.583
2.686SerGlu: 2.686 ± 0.307
2.977SerPhe: 2.977 ± 0.492
4.066SerGly: 4.066 ± 0.435
1.379SerHis: 1.379 ± 0.245
3.194SerIle: 3.194 ± 0.817
3.775SerLys: 3.775 ± 0.921
7.26SerLeu: 7.26 ± 0.929
2.251SerMet: 2.251 ± 0.879
3.194SerAsn: 3.194 ± 0.46
2.759SerPro: 2.759 ± 0.292
1.379SerGln: 1.379 ± 0.369
3.485SerArg: 3.485 ± 1.62
5.3SerSer: 5.3 ± 0.451
4.138SerThr: 4.138 ± 0.648
6.534SerVal: 6.534 ± 1.01
1.089SerTrp: 1.089 ± 0.173
2.323SerTyr: 2.323 ± 0.38
0.0SerXaa: 0.0 ± 0.0
Thr
5.155ThrAla: 5.155 ± 0.611
1.96ThrCys: 1.96 ± 0.338
3.34ThrAsp: 3.34 ± 0.421
2.033ThrGlu: 2.033 ± 0.369
2.541ThrPhe: 2.541 ± 0.513
5.009ThrGly: 5.009 ± 0.569
1.162ThrHis: 1.162 ± 0.252
3.267ThrIle: 3.267 ± 0.448
2.904ThrLys: 2.904 ± 0.633
5.082ThrLeu: 5.082 ± 0.341
1.67ThrMet: 1.67 ± 0.407
2.759ThrAsn: 2.759 ± 0.257
4.066ThrPro: 4.066 ± 0.647
1.379ThrGln: 1.379 ± 0.681
2.541ThrArg: 2.541 ± 0.497
5.082ThrSer: 5.082 ± 0.706
4.429ThrThr: 4.429 ± 0.461
8.567ThrVal: 8.567 ± 1.325
0.436ThrTrp: 0.436 ± 0.088
2.904ThrTyr: 2.904 ± 0.45
0.0ThrXaa: 0.0 ± 0.0
Val
7.115ValAla: 7.115 ± 0.831
3.122ValCys: 3.122 ± 0.654
5.808ValAsp: 5.808 ± 0.73
4.283ValGlu: 4.283 ± 1.038
3.775ValPhe: 3.775 ± 0.962
5.663ValGly: 5.663 ± 0.627
0.726ValHis: 0.726 ± 0.235
3.485ValIle: 3.485 ± 0.833
4.864ValLys: 4.864 ± 0.602
11.253ValLeu: 11.253 ± 1.078
2.831ValMet: 2.831 ± 0.405
6.244ValAsn: 6.244 ± 0.834
4.574ValPro: 4.574 ± 0.651
3.412ValGln: 3.412 ± 0.604
2.977ValArg: 2.977 ± 0.375
7.333ValSer: 7.333 ± 1.144
6.026ValThr: 6.026 ± 0.849
10.309ValVal: 10.309 ± 1.285
0.944ValTrp: 0.944 ± 0.26
3.703ValTyr: 3.703 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
0.581TrpAla: 0.581 ± 0.169
0.363TrpCys: 0.363 ± 0.128
1.016TrpAsp: 1.016 ± 0.339
0.363TrpGlu: 0.363 ± 0.149
0.799TrpPhe: 0.799 ± 0.18
0.29TrpGly: 0.29 ± 0.276
0.073TrpHis: 0.073 ± 0.106
0.0TrpIle: 0.0 ± 0.0
0.363TrpLys: 0.363 ± 0.104
2.251TrpLeu: 2.251 ± 0.592
0.145TrpMet: 0.145 ± 0.196
0.581TrpAsn: 0.581 ± 0.154
0.508TrpPro: 0.508 ± 0.244
0.581TrpGln: 0.581 ± 0.169
0.436TrpArg: 0.436 ± 0.088
1.016TrpSer: 1.016 ± 0.157
1.016TrpThr: 1.016 ± 0.215
1.379TrpVal: 1.379 ± 0.281
0.073TrpTrp: 0.073 ± 0.046
1.162TrpTyr: 1.162 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.356TyrAla: 4.356 ± 0.559
1.307TyrCys: 1.307 ± 0.265
3.775TyrAsp: 3.775 ± 0.424
1.96TyrGlu: 1.96 ± 0.32
2.614TyrPhe: 2.614 ± 0.287
3.848TyrGly: 3.848 ± 0.669
0.508TyrHis: 0.508 ± 0.175
2.178TyrIle: 2.178 ± 0.297
1.96TyrLys: 1.96 ± 0.489
4.792TyrLeu: 4.792 ± 0.793
0.799TyrMet: 0.799 ± 0.143
2.614TyrAsn: 2.614 ± 0.416
1.888TyrPro: 1.888 ± 0.244
0.726TyrGln: 0.726 ± 0.152
1.597TyrArg: 1.597 ± 0.329
3.122TyrSer: 3.122 ± 0.497
3.412TyrThr: 3.412 ± 0.325
4.211TyrVal: 4.211 ± 0.907
0.29TyrTrp: 0.29 ± 0.173
3.049TyrTyr: 3.049 ± 0.656
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (13775 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski