Amino acid dipepetide frequency for Equine coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.661AlaAla: 4.661 ± 0.526
2.262AlaCys: 2.262 ± 0.355
4.113AlaAsp: 4.113 ± 0.504
1.508AlaGlu: 1.508 ± 0.391
3.701AlaPhe: 3.701 ± 0.55
3.976AlaGly: 3.976 ± 0.237
1.028AlaHis: 1.028 ± 0.222
3.701AlaIle: 3.701 ± 0.534
4.181AlaLys: 4.181 ± 0.583
6.306AlaLeu: 6.306 ± 0.636
1.165AlaMet: 1.165 ± 0.126
5.346AlaAsn: 5.346 ± 0.531
1.577AlaPro: 1.577 ± 0.669
1.988AlaGln: 1.988 ± 0.263
2.399AlaArg: 2.399 ± 0.28
5.072AlaSer: 5.072 ± 0.42
3.77AlaThr: 3.77 ± 0.231
4.798AlaVal: 4.798 ± 0.356
1.234AlaTrp: 1.234 ± 0.195
2.742AlaTyr: 2.742 ± 0.237
0.0AlaXaa: 0.0 ± 0.0
Cys
2.125CysAla: 2.125 ± 0.274
1.165CysCys: 1.165 ± 0.249
2.81CysAsp: 2.81 ± 0.497
0.617CysGlu: 0.617 ± 0.15
2.193CysPhe: 2.193 ± 0.239
2.399CysGly: 2.399 ± 0.318
0.343CysHis: 0.343 ± 0.127
1.782CysIle: 1.782 ± 0.29
2.399CysLys: 2.399 ± 0.473
2.262CysLeu: 2.262 ± 0.256
0.343CysMet: 0.343 ± 0.145
2.399CysAsn: 2.399 ± 0.496
1.097CysPro: 1.097 ± 0.303
0.891CysGln: 0.891 ± 0.131
1.302CysArg: 1.302 ± 0.223
3.222CysSer: 3.222 ± 0.577
1.782CysThr: 1.782 ± 0.377
3.085CysVal: 3.085 ± 0.566
0.548CysTrp: 0.548 ± 0.139
2.536CysTyr: 2.536 ± 0.509
0.0CysXaa: 0.0 ± 0.0
Asp
3.29AspAla: 3.29 ± 0.262
2.399AspCys: 2.399 ± 0.317
3.427AspAsp: 3.427 ± 0.31
2.399AspGlu: 2.399 ± 0.343
4.181AspPhe: 4.181 ± 0.588
3.976AspGly: 3.976 ± 0.362
0.891AspHis: 0.891 ± 0.245
2.056AspIle: 2.056 ± 0.263
3.29AspLys: 3.29 ± 0.649
4.935AspLeu: 4.935 ± 0.454
1.508AspMet: 1.508 ± 0.261
3.222AspAsn: 3.222 ± 0.282
1.302AspPro: 1.302 ± 0.265
2.193AspGln: 2.193 ± 0.295
1.439AspArg: 1.439 ± 0.16
4.318AspSer: 4.318 ± 0.409
3.016AspThr: 3.016 ± 0.256
8.362AspVal: 8.362 ± 1.101
0.617AspTrp: 0.617 ± 0.163
2.879AspTyr: 2.879 ± 0.323
0.0AspXaa: 0.0 ± 0.0
Glu
3.29GluAla: 3.29 ± 0.33
0.685GluCys: 0.685 ± 0.133
2.742GluAsp: 2.742 ± 0.324
2.331GluGlu: 2.331 ± 0.266
1.988GluPhe: 1.988 ± 0.235
1.782GluGly: 1.782 ± 0.494
0.411GluHis: 0.411 ± 0.141
2.331GluIle: 2.331 ± 0.345
1.919GluLys: 1.919 ± 0.27
4.318GluLeu: 4.318 ± 0.481
0.823GluMet: 0.823 ± 0.162
1.577GluAsn: 1.577 ± 0.171
1.302GluPro: 1.302 ± 0.528
0.96GluGln: 0.96 ± 0.151
1.645GluArg: 1.645 ± 0.195
1.302GluSer: 1.302 ± 0.361
1.851GluThr: 1.851 ± 0.324
3.085GluVal: 3.085 ± 0.403
0.206GluTrp: 0.206 ± 0.144
1.645GluTyr: 1.645 ± 0.311
0.0GluXaa: 0.0 ± 0.0
Phe
2.468PheAla: 2.468 ± 0.321
1.577PheCys: 1.577 ± 0.211
3.77PheAsp: 3.77 ± 0.711
2.331PheGlu: 2.331 ± 0.343
1.165PhePhe: 1.165 ± 0.265
3.633PheGly: 3.633 ± 0.412
0.617PheHis: 0.617 ± 0.159
3.29PheIle: 3.29 ± 0.401
3.633PheLys: 3.633 ± 0.378
3.359PheLeu: 3.359 ± 0.453
1.302PheMet: 1.302 ± 0.209
4.455PheAsn: 4.455 ± 1.34
0.96PhePro: 0.96 ± 0.34
1.851PheGln: 1.851 ± 0.276
1.851PheArg: 1.851 ± 0.361
4.181PheSer: 4.181 ± 0.378
3.77PheThr: 3.77 ± 0.509
6.649PheVal: 6.649 ± 1.586
0.617PheTrp: 0.617 ± 0.18
3.701PheTyr: 3.701 ± 0.549
0.0PheXaa: 0.0 ± 0.0
Gly
2.947GlyAla: 2.947 ± 0.394
3.427GlyCys: 3.427 ± 0.486
2.742GlyAsp: 2.742 ± 0.291
1.439GlyGlu: 1.439 ± 0.182
3.77GlyPhe: 3.77 ± 0.994
3.427GlyGly: 3.427 ± 0.404
0.823GlyHis: 0.823 ± 0.174
3.427GlyIle: 3.427 ± 1.141
3.77GlyLys: 3.77 ± 0.517
5.209GlyLeu: 5.209 ± 0.393
1.165GlyMet: 1.165 ± 0.32
2.947GlyAsn: 2.947 ± 0.548
1.165GlyPro: 1.165 ± 0.275
1.234GlyGln: 1.234 ± 0.336
1.577GlyArg: 1.577 ± 0.29
5.072GlySer: 5.072 ± 0.334
4.181GlyThr: 4.181 ± 0.755
6.306GlyVal: 6.306 ± 0.339
0.823GlyTrp: 0.823 ± 0.26
3.153GlyTyr: 3.153 ± 0.596
0.0GlyXaa: 0.0 ± 0.0
His
1.028HisAla: 1.028 ± 0.281
0.343HisCys: 0.343 ± 0.139
0.754HisAsp: 0.754 ± 0.131
0.685HisGlu: 0.685 ± 0.142
1.371HisPhe: 1.371 ± 0.119
0.617HisGly: 0.617 ± 0.232
0.137HisHis: 0.137 ± 0.148
0.823HisIle: 0.823 ± 0.242
1.097HisLys: 1.097 ± 0.349
1.508HisLeu: 1.508 ± 0.403
0.206HisMet: 0.206 ± 0.068
0.411HisAsn: 0.411 ± 0.097
0.48HisPro: 0.48 ± 0.174
0.48HisGln: 0.48 ± 0.128
0.069HisArg: 0.069 ± 0.044
0.891HisSer: 0.891 ± 0.195
0.617HisThr: 0.617 ± 0.174
2.947HisVal: 2.947 ± 0.857
0.411HisTrp: 0.411 ± 0.094
0.617HisTyr: 0.617 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
3.77IleAla: 3.77 ± 0.234
2.262IleCys: 2.262 ± 0.255
3.564IleAsp: 3.564 ± 0.293
1.508IleGlu: 1.508 ± 0.289
1.714IlePhe: 1.714 ± 0.501
2.947IleGly: 2.947 ± 0.699
0.617IleHis: 0.617 ± 0.14
3.77IleIle: 3.77 ± 1.377
4.044IleLys: 4.044 ± 0.739
3.907IleLeu: 3.907 ± 0.629
1.302IleMet: 1.302 ± 0.328
3.427IleAsn: 3.427 ± 1.073
2.125IlePro: 2.125 ± 0.592
2.536IleGln: 2.536 ± 0.34
2.056IleArg: 2.056 ± 0.373
3.701IleSer: 3.701 ± 0.255
3.085IleThr: 3.085 ± 0.223
5.415IleVal: 5.415 ± 0.805
0.617IleTrp: 0.617 ± 0.219
1.851IleTyr: 1.851 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
3.496LysAla: 3.496 ± 0.49
2.331LysCys: 2.331 ± 0.208
2.673LysAsp: 2.673 ± 0.38
2.399LysGlu: 2.399 ± 0.31
3.153LysPhe: 3.153 ± 0.382
3.77LysGly: 3.77 ± 0.621
1.371LysHis: 1.371 ± 0.417
4.25LysIle: 4.25 ± 0.761
1.508LysLys: 1.508 ± 0.206
7.129LysLeu: 7.129 ± 1.169
0.823LysMet: 0.823 ± 0.262
2.331LysAsn: 2.331 ± 0.386
3.085LysPro: 3.085 ± 0.268
3.153LysGln: 3.153 ± 0.596
2.056LysArg: 2.056 ± 0.402
3.907LysSer: 3.907 ± 0.365
1.851LysThr: 1.851 ± 0.195
4.798LysVal: 4.798 ± 0.485
0.96LysTrp: 0.96 ± 0.13
2.947LysTyr: 2.947 ± 0.563
0.0LysXaa: 0.0 ± 0.0
Leu
6.032LeuAla: 6.032 ± 0.659
2.605LeuCys: 2.605 ± 0.449
4.798LeuAsp: 4.798 ± 0.681
3.976LeuGlu: 3.976 ± 0.435
5.963LeuPhe: 5.963 ± 0.58
4.318LeuGly: 4.318 ± 0.756
1.714LeuHis: 1.714 ± 0.283
4.113LeuIle: 4.113 ± 0.611
5.689LeuLys: 5.689 ± 0.679
8.225LeuLeu: 8.225 ± 1.688
2.056LeuMet: 2.056 ± 0.389
5.415LeuAsn: 5.415 ± 0.441
3.564LeuPro: 3.564 ± 0.425
4.044LeuGln: 4.044 ± 0.737
3.427LeuArg: 3.427 ± 0.548
7.608LeuSer: 7.608 ± 0.466
5.963LeuThr: 5.963 ± 0.488
6.375LeuVal: 6.375 ± 0.468
1.439LeuTrp: 1.439 ± 0.194
4.593LeuTyr: 4.593 ± 0.366
0.0LeuXaa: 0.0 ± 0.0
Met
2.468MetAla: 2.468 ± 0.429
0.685MetCys: 0.685 ± 0.304
1.302MetAsp: 1.302 ± 0.206
0.891MetGlu: 0.891 ± 0.129
1.165MetPhe: 1.165 ± 0.193
0.891MetGly: 0.891 ± 0.123
0.617MetHis: 0.617 ± 0.198
0.823MetIle: 0.823 ± 0.144
0.754MetLys: 0.754 ± 0.181
3.016MetLeu: 3.016 ± 0.512
0.343MetMet: 0.343 ± 0.14
0.96MetAsn: 0.96 ± 0.21
1.234MetPro: 1.234 ± 0.408
0.823MetGln: 0.823 ± 0.182
0.685MetArg: 0.685 ± 0.173
1.165MetSer: 1.165 ± 0.265
1.028MetThr: 1.028 ± 0.245
1.782MetVal: 1.782 ± 0.193
0.48MetTrp: 0.48 ± 0.296
0.891MetTyr: 0.891 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
3.839AsnAla: 3.839 ± 0.78
1.508AsnCys: 1.508 ± 0.16
1.851AsnAsp: 1.851 ± 0.386
1.234AsnGlu: 1.234 ± 0.223
3.29AsnPhe: 3.29 ± 0.577
4.935AsnGly: 4.935 ± 0.617
0.96AsnHis: 0.96 ± 0.306
1.919AsnIle: 1.919 ± 0.393
2.947AsnLys: 2.947 ± 0.447
4.867AsnLeu: 4.867 ± 0.579
2.331AsnMet: 2.331 ± 0.254
3.222AsnAsn: 3.222 ± 0.414
2.262AsnPro: 2.262 ± 0.525
2.056AsnGln: 2.056 ± 0.532
1.988AsnArg: 1.988 ± 0.435
4.113AsnSer: 4.113 ± 0.922
3.085AsnThr: 3.085 ± 0.819
5.963AsnVal: 5.963 ± 0.476
0.411AsnTrp: 0.411 ± 0.066
1.919AsnTyr: 1.919 ± 0.842
0.0AsnXaa: 0.0 ± 0.0
Pro
2.056ProAla: 2.056 ± 0.353
1.371ProCys: 1.371 ± 0.154
1.577ProAsp: 1.577 ± 0.241
1.508ProGlu: 1.508 ± 0.348
1.371ProPhe: 1.371 ± 0.185
1.988ProGly: 1.988 ± 0.395
0.548ProHis: 0.548 ± 0.122
2.125ProIle: 2.125 ± 0.474
2.331ProLys: 2.331 ± 0.273
2.673ProLeu: 2.673 ± 0.409
0.274ProMet: 0.274 ± 0.155
1.028ProAsn: 1.028 ± 0.659
1.577ProPro: 1.577 ± 0.197
1.577ProGln: 1.577 ± 0.483
1.097ProArg: 1.097 ± 0.31
2.536ProSer: 2.536 ± 0.505
3.222ProThr: 3.222 ± 0.771
2.742ProVal: 2.742 ± 0.252
0.48ProTrp: 0.48 ± 0.222
0.685ProTyr: 0.685 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
2.399GlnAla: 2.399 ± 0.282
0.617GlnCys: 0.617 ± 0.253
2.262GlnAsp: 2.262 ± 0.173
2.056GlnGlu: 2.056 ± 0.345
2.056GlnPhe: 2.056 ± 0.395
2.742GlnGly: 2.742 ± 0.284
0.823GlnHis: 0.823 ± 0.26
2.262GlnIle: 2.262 ± 0.318
1.988GlnLys: 1.988 ± 0.45
3.496GlnLeu: 3.496 ± 0.236
0.548GlnMet: 0.548 ± 0.122
1.577GlnAsn: 1.577 ± 0.686
0.891GlnPro: 0.891 ± 0.375
1.714GlnGln: 1.714 ± 0.65
1.234GlnArg: 1.234 ± 0.242
3.496GlnSer: 3.496 ± 0.375
1.782GlnThr: 1.782 ± 0.237
2.056GlnVal: 2.056 ± 0.173
1.097GlnTrp: 1.097 ± 0.244
1.165GlnTyr: 1.165 ± 0.164
0.0GlnXaa: 0.0 ± 0.0
Arg
2.399ArgAla: 2.399 ± 0.57
0.891ArgCys: 0.891 ± 0.173
1.371ArgAsp: 1.371 ± 0.174
1.577ArgGlu: 1.577 ± 0.254
1.439ArgPhe: 1.439 ± 0.382
2.193ArgGly: 2.193 ± 0.47
1.234ArgHis: 1.234 ± 0.314
2.056ArgIle: 2.056 ± 0.283
1.508ArgLys: 1.508 ± 0.207
3.359ArgLeu: 3.359 ± 0.911
0.411ArgMet: 0.411 ± 0.125
1.302ArgAsn: 1.302 ± 0.365
1.097ArgPro: 1.097 ± 0.338
1.165ArgGln: 1.165 ± 0.572
1.714ArgArg: 1.714 ± 0.495
3.701ArgSer: 3.701 ± 1.007
1.782ArgThr: 1.782 ± 0.188
2.879ArgVal: 2.879 ± 0.427
0.343ArgTrp: 0.343 ± 0.155
1.851ArgTyr: 1.851 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
4.661SerAla: 4.661 ± 0.56
2.879SerCys: 2.879 ± 0.508
5.415SerAsp: 5.415 ± 0.454
1.782SerGlu: 1.782 ± 0.454
3.427SerPhe: 3.427 ± 0.318
3.976SerGly: 3.976 ± 1.959
1.234SerHis: 1.234 ± 0.279
4.593SerIle: 4.593 ± 0.628
4.455SerLys: 4.455 ± 0.575
8.362SerLeu: 8.362 ± 0.575
1.919SerMet: 1.919 ± 0.265
3.633SerAsn: 3.633 ± 0.525
1.302SerPro: 1.302 ± 0.868
2.468SerGln: 2.468 ± 0.262
2.056SerArg: 2.056 ± 1.302
5.346SerSer: 5.346 ± 0.556
3.976SerThr: 3.976 ± 0.657
7.266SerVal: 7.266 ± 1.191
0.548SerTrp: 0.548 ± 0.262
4.318SerTyr: 4.318 ± 0.656
0.0SerXaa: 0.0 ± 0.0
Thr
4.593ThrAla: 4.593 ± 0.754
1.714ThrCys: 1.714 ± 0.516
3.222ThrAsp: 3.222 ± 0.405
1.851ThrGlu: 1.851 ± 0.285
5.141ThrPhe: 5.141 ± 0.587
3.976ThrGly: 3.976 ± 0.65
0.617ThrHis: 0.617 ± 0.132
3.496ThrIle: 3.496 ± 0.796
2.125ThrLys: 2.125 ± 0.334
3.701ThrLeu: 3.701 ± 0.35
1.782ThrMet: 1.782 ± 0.212
2.605ThrAsn: 2.605 ± 0.583
1.782ThrPro: 1.782 ± 0.451
1.714ThrGln: 1.714 ± 0.154
2.125ThrArg: 2.125 ± 0.353
4.044ThrSer: 4.044 ± 1.232
4.113ThrThr: 4.113 ± 0.581
5.209ThrVal: 5.209 ± 0.233
0.48ThrTrp: 0.48 ± 0.201
2.673ThrTyr: 2.673 ± 0.358
0.0ThrXaa: 0.0 ± 0.0
Val
5.552ValAla: 5.552 ± 0.372
3.976ValCys: 3.976 ± 0.729
7.677ValAsp: 7.677 ± 0.795
3.496ValGlu: 3.496 ± 0.588
4.25ValPhe: 4.25 ± 0.643
3.907ValGly: 3.907 ± 0.373
0.274ValHis: 0.274 ± 0.094
4.798ValIle: 4.798 ± 0.66
6.854ValLys: 6.854 ± 0.799
9.459ValLeu: 9.459 ± 1.148
2.125ValMet: 2.125 ± 0.317
5.072ValAsn: 5.072 ± 1.011
4.25ValPro: 4.25 ± 0.422
3.29ValGln: 3.29 ± 0.366
2.673ValArg: 2.673 ± 0.369
6.375ValSer: 6.375 ± 0.701
3.77ValThr: 3.77 ± 0.639
10.145ValVal: 10.145 ± 1.278
1.028ValTrp: 1.028 ± 0.247
6.306ValTyr: 6.306 ± 0.756
0.0ValXaa: 0.0 ± 0.0
Trp
0.548TrpAla: 0.548 ± 0.123
0.411TrpCys: 0.411 ± 0.109
0.411TrpAsp: 0.411 ± 0.211
0.206TrpGlu: 0.206 ± 0.185
0.754TrpPhe: 0.754 ± 0.142
0.343TrpGly: 0.343 ± 0.108
0.411TrpHis: 0.411 ± 0.066
0.685TrpIle: 0.685 ± 0.191
0.137TrpLys: 0.137 ± 0.087
2.125TrpLeu: 2.125 ± 0.382
0.343TrpMet: 0.343 ± 0.231
0.891TrpAsn: 0.891 ± 0.209
0.48TrpPro: 0.48 ± 0.208
0.823TrpGln: 0.823 ± 0.133
0.823TrpArg: 0.823 ± 0.218
0.754TrpSer: 0.754 ± 0.137
0.685TrpThr: 0.685 ± 0.121
1.165TrpVal: 1.165 ± 0.184
0.069TrpTrp: 0.069 ± 0.155
0.891TrpTyr: 0.891 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.907TyrAla: 3.907 ± 0.745
2.193TyrCys: 2.193 ± 0.293
3.222TyrAsp: 3.222 ± 0.478
2.399TyrGlu: 2.399 ± 0.243
3.085TyrPhe: 3.085 ± 0.402
2.81TyrGly: 2.81 ± 0.389
0.823TyrHis: 0.823 ± 0.206
1.919TyrIle: 1.919 ± 0.254
3.359TyrLys: 3.359 ± 0.7
3.701TyrLeu: 3.701 ± 0.237
1.234TyrMet: 1.234 ± 0.201
2.879TyrAsn: 2.879 ± 0.32
1.234TyrPro: 1.234 ± 0.268
1.302TyrGln: 1.302 ± 0.149
2.056TyrArg: 2.056 ± 0.877
2.81TyrSer: 2.81 ± 0.485
3.633TyrThr: 3.633 ± 0.324
4.25TyrVal: 4.25 ± 0.587
0.548TyrTrp: 0.548 ± 0.101
3.701TyrTyr: 3.701 ± 0.619
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (14590 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski