Amino acid dipepetide frequency for Human coronavirus 229E (HCoV-229E)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.106AlaAla: 5.106 ± 0.468
1.547AlaCys: 1.547 ± 0.318
2.398AlaAsp: 2.398 ± 0.696
3.558AlaGlu: 3.558 ± 0.275
4.796AlaPhe: 4.796 ± 0.523
3.713AlaGly: 3.713 ± 0.571
0.851AlaHis: 0.851 ± 0.165
4.951AlaIle: 4.951 ± 0.776
4.641AlaLys: 4.641 ± 0.709
5.647AlaLeu: 5.647 ± 1.116
2.321AlaMet: 2.321 ± 0.272
3.868AlaAsn: 3.868 ± 0.246
1.779AlaPro: 1.779 ± 0.216
2.321AlaGln: 2.321 ± 0.481
2.166AlaArg: 2.166 ± 0.496
3.791AlaSer: 3.791 ± 0.263
3.249AlaThr: 3.249 ± 0.608
6.498AlaVal: 6.498 ± 0.544
1.006AlaTrp: 1.006 ± 0.275
3.791AlaTyr: 3.791 ± 0.611
0.0AlaXaa: 0.0 ± 0.0
Cys
2.553CysAla: 2.553 ± 0.494
1.083CysCys: 1.083 ± 0.382
1.702CysAsp: 1.702 ± 0.399
1.16CysGlu: 1.16 ± 0.149
2.166CysPhe: 2.166 ± 0.38
2.166CysGly: 2.166 ± 0.44
0.155CysHis: 0.155 ± 0.19
1.625CysIle: 1.625 ± 0.293
1.857CysLys: 1.857 ± 0.405
1.547CysLeu: 1.547 ± 0.368
0.464CysMet: 0.464 ± 0.15
2.475CysAsn: 2.475 ± 0.496
0.696CysPro: 0.696 ± 0.13
0.387CysGln: 0.387 ± 0.106
1.392CysArg: 1.392 ± 0.406
1.625CysSer: 1.625 ± 0.337
3.791CysThr: 3.791 ± 0.493
3.868CysVal: 3.868 ± 0.427
0.851CysTrp: 0.851 ± 0.344
1.934CysTyr: 1.934 ± 0.374
0.0CysXaa: 0.0 ± 0.0
Asp
4.1AspAla: 4.1 ± 0.472
1.779AspCys: 1.779 ± 0.408
2.321AspAsp: 2.321 ± 0.327
2.089AspGlu: 2.089 ± 0.349
4.023AspPhe: 4.023 ± 0.478
4.487AspGly: 4.487 ± 0.245
1.547AspHis: 1.547 ± 0.307
4.255AspIle: 4.255 ± 0.498
3.404AspLys: 3.404 ± 0.356
3.868AspLeu: 3.868 ± 0.367
0.542AspMet: 0.542 ± 0.152
2.63AspAsn: 2.63 ± 0.493
1.315AspPro: 1.315 ± 0.326
1.006AspGln: 1.006 ± 0.298
1.47AspArg: 1.47 ± 0.247
2.475AspSer: 2.475 ± 0.474
2.398AspThr: 2.398 ± 0.32
5.647AspVal: 5.647 ± 0.895
1.006AspTrp: 1.006 ± 0.279
3.636AspTyr: 3.636 ± 0.862
0.0AspXaa: 0.0 ± 0.0
Glu
1.934GluAla: 1.934 ± 0.406
1.083GluCys: 1.083 ± 0.16
2.398GluAsp: 2.398 ± 0.5
2.63GluGlu: 2.63 ± 0.328
2.553GluPhe: 2.553 ± 0.44
2.94GluGly: 2.94 ± 0.367
1.392GluHis: 1.392 ± 0.273
1.934GluIle: 1.934 ± 0.23
2.708GluLys: 2.708 ± 0.489
3.558GluLeu: 3.558 ± 0.304
1.16GluMet: 1.16 ± 0.244
3.094GluAsn: 3.094 ± 0.315
1.625GluPro: 1.625 ± 0.819
1.392GluGln: 1.392 ± 0.203
1.702GluArg: 1.702 ± 0.208
3.094GluSer: 3.094 ± 0.219
1.779GluThr: 1.779 ± 0.392
3.249GluVal: 3.249 ± 0.389
0.851GluTrp: 0.851 ± 0.252
1.16GluTyr: 1.16 ± 0.121
0.0GluXaa: 0.0 ± 0.0
Phe
2.63PheAla: 2.63 ± 0.157
2.862PheCys: 2.862 ± 0.384
4.719PheAsp: 4.719 ± 0.612
3.094PheGlu: 3.094 ± 0.704
2.166PhePhe: 2.166 ± 0.304
4.951PheGly: 4.951 ± 0.463
0.232PheHis: 0.232 ± 0.206
2.63PheIle: 2.63 ± 0.374
3.094PheLys: 3.094 ± 0.392
3.713PheLeu: 3.713 ± 0.446
1.315PheMet: 1.315 ± 0.377
3.404PheAsn: 3.404 ± 0.558
0.774PhePro: 0.774 ± 0.203
0.542PheGln: 0.542 ± 0.21
1.47PheArg: 1.47 ± 0.68
4.177PheSer: 4.177 ± 0.368
3.481PheThr: 3.481 ± 0.436
9.592PheVal: 9.592 ± 1.192
0.696PheTrp: 0.696 ± 0.217
2.708PheTyr: 2.708 ± 0.598
0.0PheXaa: 0.0 ± 0.0
Gly
3.481GlyAla: 3.481 ± 0.622
2.089GlyCys: 2.089 ± 0.458
4.796GlyAsp: 4.796 ± 0.45
1.857GlyGlu: 1.857 ± 0.321
4.564GlyPhe: 4.564 ± 0.671
4.487GlyGly: 4.487 ± 0.601
0.928GlyHis: 0.928 ± 0.367
3.094GlyIle: 3.094 ± 0.178
4.1GlyLys: 4.1 ± 0.416
5.338GlyLeu: 5.338 ± 0.616
1.006GlyMet: 1.006 ± 0.298
3.558GlyAsn: 3.558 ± 0.715
2.011GlyPro: 2.011 ± 0.23
0.928GlyGln: 0.928 ± 0.241
2.089GlyArg: 2.089 ± 1.186
4.796GlySer: 4.796 ± 0.811
3.713GlyThr: 3.713 ± 0.547
7.813GlyVal: 7.813 ± 0.523
0.774GlyTrp: 0.774 ± 0.409
3.017GlyTyr: 3.017 ± 0.256
0.0GlyXaa: 0.0 ± 0.0
His
1.006HisAla: 1.006 ± 0.371
0.464HisCys: 0.464 ± 0.196
1.315HisAsp: 1.315 ± 0.369
1.083HisGlu: 1.083 ± 0.191
1.315HisPhe: 1.315 ± 0.459
1.238HisGly: 1.238 ± 0.301
0.077HisHis: 0.077 ± 0.047
1.083HisIle: 1.083 ± 0.637
0.928HisLys: 0.928 ± 0.206
1.238HisLeu: 1.238 ± 0.253
0.155HisMet: 0.155 ± 0.401
1.16HisAsn: 1.16 ± 0.468
0.619HisPro: 0.619 ± 0.581
0.232HisGln: 0.232 ± 0.066
0.619HisArg: 0.619 ± 0.312
0.928HisSer: 0.928 ± 0.262
1.238HisThr: 1.238 ± 0.256
1.625HisVal: 1.625 ± 0.464
0.155HisTrp: 0.155 ± 0.05
1.083HisTyr: 1.083 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
3.636IleAla: 3.636 ± 0.545
0.928IleCys: 0.928 ± 0.262
3.094IleAsp: 3.094 ± 0.375
2.089IleGlu: 2.089 ± 0.317
3.326IlePhe: 3.326 ± 0.486
2.862IleGly: 2.862 ± 0.361
0.619IleHis: 0.619 ± 0.234
2.089IleIle: 2.089 ± 0.22
3.868IleLys: 3.868 ± 0.836
5.26IleLeu: 5.26 ± 0.538
1.16IleMet: 1.16 ± 0.131
2.785IleAsn: 2.785 ± 0.327
1.934IlePro: 1.934 ± 0.826
2.63IleGln: 2.63 ± 0.388
1.392IleArg: 1.392 ± 0.285
4.023IleSer: 4.023 ± 0.764
3.249IleThr: 3.249 ± 0.564
5.802IleVal: 5.802 ± 0.694
0.542IleTrp: 0.542 ± 0.155
1.392IleTyr: 1.392 ± 0.502
0.0IleXaa: 0.0 ± 0.0
Lys
4.487LysAla: 4.487 ± 0.458
2.089LysCys: 2.089 ± 0.342
3.945LysAsp: 3.945 ± 0.561
2.708LysGlu: 2.708 ± 0.338
3.481LysPhe: 3.481 ± 0.404
2.708LysGly: 2.708 ± 0.219
2.475LysHis: 2.475 ± 0.369
2.166LysIle: 2.166 ± 0.333
1.625LysLys: 1.625 ± 0.196
5.647LysLeu: 5.647 ± 0.503
1.315LysMet: 1.315 ± 0.299
2.398LysAsn: 2.398 ± 0.448
4.177LysPro: 4.177 ± 0.555
1.702LysGln: 1.702 ± 0.399
2.398LysArg: 2.398 ± 0.36
3.791LysSer: 3.791 ± 0.397
3.249LysThr: 3.249 ± 0.208
5.338LysVal: 5.338 ± 0.866
1.083LysTrp: 1.083 ± 0.163
3.249LysTyr: 3.249 ± 0.403
0.0LysXaa: 0.0 ± 0.0
Leu
5.183LeuAla: 5.183 ± 0.398
3.713LeuCys: 3.713 ± 0.498
4.332LeuAsp: 4.332 ± 0.872
3.249LeuGlu: 3.249 ± 0.431
4.641LeuPhe: 4.641 ± 0.912
4.641LeuGly: 4.641 ± 0.383
2.089LeuHis: 2.089 ± 0.147
2.708LeuIle: 2.708 ± 0.792
6.575LeuLys: 6.575 ± 0.78
9.824LeuLeu: 9.824 ± 0.65
1.16LeuMet: 1.16 ± 0.239
5.415LeuAsn: 5.415 ± 0.786
3.249LeuPro: 3.249 ± 0.678
3.791LeuGln: 3.791 ± 0.4
2.475LeuArg: 2.475 ± 0.448
7.194LeuSer: 7.194 ± 0.372
5.338LeuThr: 5.338 ± 0.567
5.647LeuVal: 5.647 ± 0.957
1.315LeuTrp: 1.315 ± 0.395
3.558LeuTyr: 3.558 ± 0.973
0.0LeuXaa: 0.0 ± 0.0
Met
1.702MetAla: 1.702 ± 0.316
0.928MetCys: 0.928 ± 0.265
1.16MetAsp: 1.16 ± 0.373
0.309MetGlu: 0.309 ± 0.103
1.392MetPhe: 1.392 ± 0.354
1.315MetGly: 1.315 ± 0.369
0.851MetHis: 0.851 ± 0.241
1.315MetIle: 1.315 ± 0.369
0.851MetLys: 0.851 ± 0.131
2.708MetLeu: 2.708 ± 0.143
0.387MetMet: 0.387 ± 0.146
0.542MetAsn: 0.542 ± 0.13
1.16MetPro: 1.16 ± 0.243
0.774MetGln: 0.774 ± 0.549
0.928MetArg: 0.928 ± 0.265
1.083MetSer: 1.083 ± 0.48
1.16MetThr: 1.16 ± 0.358
1.16MetVal: 1.16 ± 0.251
0.077MetTrp: 0.077 ± 0.24
1.547MetTyr: 1.547 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
4.332AsnAla: 4.332 ± 0.662
2.166AsnCys: 2.166 ± 0.339
2.321AsnAsp: 2.321 ± 0.311
2.862AsnGlu: 2.862 ± 0.398
2.553AsnPhe: 2.553 ± 0.514
6.653AsnGly: 6.653 ± 0.716
0.542AsnHis: 0.542 ± 0.182
3.017AsnIle: 3.017 ± 0.613
2.94AsnLys: 2.94 ± 0.449
4.487AsnLeu: 4.487 ± 0.315
1.315AsnMet: 1.315 ± 0.25
3.017AsnAsn: 3.017 ± 0.618
1.238AsnPro: 1.238 ± 0.673
1.47AsnGln: 1.47 ± 0.453
1.702AsnArg: 1.702 ± 0.27
4.023AsnSer: 4.023 ± 0.495
3.172AsnThr: 3.172 ± 0.229
7.117AsnVal: 7.117 ± 0.649
0.928AsnTrp: 0.928 ± 0.659
1.315AsnTyr: 1.315 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
1.779ProAla: 1.779 ± 0.384
0.851ProCys: 0.851 ± 0.165
1.006ProAsp: 1.006 ± 0.278
2.089ProGlu: 2.089 ± 0.366
1.779ProPhe: 1.779 ± 0.294
2.011ProGly: 2.011 ± 0.464
0.851ProHis: 0.851 ± 0.513
2.166ProIle: 2.166 ± 0.235
1.702ProLys: 1.702 ± 0.507
3.094ProLeu: 3.094 ± 0.31
0.232ProMet: 0.232 ± 0.227
1.083ProAsn: 1.083 ± 0.314
1.006ProPro: 1.006 ± 0.264
1.083ProGln: 1.083 ± 0.757
1.315ProArg: 1.315 ± 0.478
2.785ProSer: 2.785 ± 1.069
2.089ProThr: 2.089 ± 0.135
3.249ProVal: 3.249 ± 0.476
0.774ProTrp: 0.774 ± 0.1
1.392ProTyr: 1.392 ± 0.16
0.0ProXaa: 0.0 ± 0.0
Gln
2.63GlnAla: 2.63 ± 0.388
0.542GlnCys: 0.542 ± 0.132
1.006GlnAsp: 1.006 ± 0.307
1.238GlnGlu: 1.238 ± 0.161
1.083GlnPhe: 1.083 ± 0.514
2.321GlnGly: 2.321 ± 0.254
0.155GlnHis: 0.155 ± 0.198
1.625GlnIle: 1.625 ± 0.32
1.47GlnLys: 1.47 ± 0.727
2.785GlnLeu: 2.785 ± 0.388
0.851GlnMet: 0.851 ± 0.175
1.083GlnAsn: 1.083 ± 0.142
1.625GlnPro: 1.625 ± 0.532
1.547GlnGln: 1.547 ± 0.372
1.16GlnArg: 1.16 ± 0.227
2.321GlnSer: 2.321 ± 1.01
2.243GlnThr: 2.243 ± 0.822
2.398GlnVal: 2.398 ± 0.198
0.077GlnTrp: 0.077 ± 0.047
1.083GlnTyr: 1.083 ± 0.373
0.0GlnXaa: 0.0 ± 0.0
Arg
2.475ArgAla: 2.475 ± 0.402
1.47ArgCys: 1.47 ± 0.28
1.315ArgAsp: 1.315 ± 0.248
0.774ArgGlu: 0.774 ± 0.326
2.708ArgPhe: 2.708 ± 0.342
2.166ArgGly: 2.166 ± 0.727
0.542ArgHis: 0.542 ± 0.155
1.392ArgIle: 1.392 ± 0.607
1.779ArgLys: 1.779 ± 0.406
3.713ArgLeu: 3.713 ± 0.531
1.006ArgMet: 1.006 ± 0.124
1.934ArgAsn: 1.934 ± 0.601
0.696ArgPro: 0.696 ± 0.199
1.547ArgGln: 1.547 ± 0.434
1.16ArgArg: 1.16 ± 0.258
1.857ArgSer: 1.857 ± 0.758
1.934ArgThr: 1.934 ± 0.272
3.017ArgVal: 3.017 ± 0.675
0.464ArgTrp: 0.464 ± 0.184
1.16ArgTyr: 1.16 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
5.647SerAla: 5.647 ± 0.634
1.779SerCys: 1.779 ± 0.441
3.172SerAsp: 3.172 ± 0.557
2.785SerGlu: 2.785 ± 0.556
4.874SerPhe: 4.874 ± 0.522
4.796SerGly: 4.796 ± 0.461
1.238SerHis: 1.238 ± 0.295
4.409SerIle: 4.409 ± 0.766
3.945SerLys: 3.945 ± 0.457
5.492SerLeu: 5.492 ± 0.376
1.47SerMet: 1.47 ± 0.38
4.487SerAsn: 4.487 ± 0.397
1.702SerPro: 1.702 ± 0.804
2.166SerGln: 2.166 ± 1.418
2.166SerArg: 2.166 ± 1.926
4.719SerSer: 4.719 ± 0.506
4.332SerThr: 4.332 ± 0.646
7.272SerVal: 7.272 ± 1.134
0.464SerTrp: 0.464 ± 0.095
3.172SerTyr: 3.172 ± 0.412
0.0SerXaa: 0.0 ± 0.0
Thr
3.481ThrAla: 3.481 ± 0.205
1.779ThrCys: 1.779 ± 0.418
2.321ThrAsp: 2.321 ± 0.197
1.934ThrGlu: 1.934 ± 0.222
2.553ThrPhe: 2.553 ± 0.256
3.636ThrGly: 3.636 ± 1.035
0.851ThrHis: 0.851 ± 0.132
4.332ThrIle: 4.332 ± 0.536
4.487ThrLys: 4.487 ± 0.634
5.028ThrLeu: 5.028 ± 0.779
1.315ThrMet: 1.315 ± 0.439
3.172ThrAsn: 3.172 ± 0.369
2.475ThrPro: 2.475 ± 0.378
1.779ThrGln: 1.779 ± 0.22
1.547ThrArg: 1.547 ± 0.473
5.106ThrSer: 5.106 ± 1.888
4.177ThrThr: 4.177 ± 0.413
7.117ThrVal: 7.117 ± 0.764
0.928ThrTrp: 0.928 ± 0.281
2.243ThrTyr: 2.243 ± 0.354
0.0ThrXaa: 0.0 ± 0.0
Val
7.736ValAla: 7.736 ± 0.331
4.177ValCys: 4.177 ± 0.523
6.266ValAsp: 6.266 ± 0.412
5.183ValGlu: 5.183 ± 0.884
4.564ValPhe: 4.564 ± 0.62
4.641ValGly: 4.641 ± 0.476
1.238ValHis: 1.238 ± 0.263
5.183ValIle: 5.183 ± 0.564
7.117ValLys: 7.117 ± 1.134
8.2ValLeu: 8.2 ± 0.42
2.785ValMet: 2.785 ± 0.632
6.807ValAsn: 6.807 ± 0.469
2.862ValPro: 2.862 ± 0.564
2.785ValGln: 2.785 ± 0.164
3.791ValArg: 3.791 ± 0.174
7.504ValSer: 7.504 ± 1.079
6.189ValThr: 6.189 ± 0.761
9.592ValVal: 9.592 ± 0.669
1.006ValTrp: 1.006 ± 0.113
4.023ValTyr: 4.023 ± 0.531
0.0ValXaa: 0.0 ± 0.0
Trp
0.928TrpAla: 0.928 ± 0.797
0.464TrpCys: 0.464 ± 0.169
0.928TrpAsp: 0.928 ± 0.293
0.232TrpGlu: 0.232 ± 0.066
1.083TrpPhe: 1.083 ± 0.16
0.155TrpGly: 0.155 ± 0.094
0.464TrpHis: 0.464 ± 0.162
0.464TrpIle: 0.464 ± 0.15
0.387TrpLys: 0.387 ± 0.372
2.321TrpLeu: 2.321 ± 0.294
0.077TrpMet: 0.077 ± 0.047
1.006TrpAsn: 1.006 ± 0.782
0.542TrpPro: 0.542 ± 0.172
0.232TrpGln: 0.232 ± 0.066
0.619TrpArg: 0.619 ± 0.172
1.006TrpSer: 1.006 ± 0.428
0.851TrpThr: 0.851 ± 0.165
1.238TrpVal: 1.238 ± 0.492
0.542TrpTrp: 0.542 ± 0.13
0.387TrpTyr: 0.387 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.172TyrAla: 3.172 ± 0.376
1.702TyrCys: 1.702 ± 0.321
3.481TyrAsp: 3.481 ± 0.729
1.315TyrGlu: 1.315 ± 0.455
2.708TyrPhe: 2.708 ± 0.292
2.553TyrGly: 2.553 ± 0.371
0.464TyrHis: 0.464 ± 0.203
2.243TyrIle: 2.243 ± 0.308
2.553TyrLys: 2.553 ± 0.162
2.63TyrLeu: 2.63 ± 0.314
1.238TyrMet: 1.238 ± 0.261
3.249TyrAsn: 3.249 ± 0.48
0.851TyrPro: 0.851 ± 0.274
0.928TyrGln: 0.928 ± 0.231
1.547TyrArg: 1.547 ± 0.306
3.791TyrSer: 3.791 ± 0.381
2.63TyrThr: 2.63 ± 0.583
4.409TyrVal: 4.409 ± 0.269
0.387TyrTrp: 0.387 ± 0.331
2.243TyrTyr: 2.243 ± 0.26
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (12928 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski