Amino acid dipepetide frequency for Betacoronavirus Erinaceus/VMC/DEU/2012

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.012AlaAla: 6.012 ± 0.512
1.845AlaCys: 1.845 ± 0.298
2.323AlaAsp: 2.323 ± 0.345
2.528AlaGlu: 2.528 ± 0.492
3.143AlaPhe: 3.143 ± 0.371
3.757AlaGly: 3.757 ± 0.582
1.845AlaHis: 1.845 ± 0.627
3.552AlaIle: 3.552 ± 0.446
3.757AlaLys: 3.757 ± 0.549
6.763AlaLeu: 6.763 ± 1.006
1.981AlaMet: 1.981 ± 0.478
4.44AlaAsn: 4.44 ± 0.593
2.733AlaPro: 2.733 ± 1.049
1.981AlaGln: 1.981 ± 0.359
2.459AlaArg: 2.459 ± 0.574
6.558AlaSer: 6.558 ± 0.577
4.714AlaThr: 4.714 ± 0.602
5.807AlaVal: 5.807 ± 0.866
0.82AlaTrp: 0.82 ± 0.223
3.211AlaTyr: 3.211 ± 0.372
0.0AlaXaa: 0.0 ± 0.0
Cys
1.161CysAla: 1.161 ± 0.241
0.956CysCys: 0.956 ± 0.22
1.571CysAsp: 1.571 ± 0.39
1.23CysGlu: 1.23 ± 0.248
1.64CysPhe: 1.64 ± 0.462
2.254CysGly: 2.254 ± 0.503
0.41CysHis: 0.41 ± 0.285
1.298CysIle: 1.298 ± 0.55
1.64CysLys: 1.64 ± 0.37
3.143CysLeu: 3.143 ± 0.376
0.547CysMet: 0.547 ± 0.197
1.845CysAsn: 1.845 ± 0.366
0.751CysPro: 0.751 ± 0.161
0.615CysGln: 0.615 ± 0.14
1.161CysArg: 1.161 ± 0.195
1.913CysSer: 1.913 ± 0.325
3.006CysThr: 3.006 ± 0.364
5.124CysVal: 5.124 ± 1.324
0.205CysTrp: 0.205 ± 0.13
1.776CysTyr: 1.776 ± 0.562
0.0CysXaa: 0.0 ± 0.0
Asp
4.645AspAla: 4.645 ± 0.651
1.776AspCys: 1.776 ± 0.326
2.254AspAsp: 2.254 ± 0.375
2.186AspGlu: 2.186 ± 0.385
2.459AspPhe: 2.459 ± 0.5
3.689AspGly: 3.689 ± 0.397
0.547AspHis: 0.547 ± 0.23
2.323AspIle: 2.323 ± 0.61
3.143AspLys: 3.143 ± 0.481
3.962AspLeu: 3.962 ± 0.54
0.956AspMet: 0.956 ± 0.586
2.459AspAsn: 2.459 ± 0.687
1.981AspPro: 1.981 ± 0.618
1.435AspGln: 1.435 ± 0.407
1.503AspArg: 1.503 ± 0.309
3.826AspSer: 3.826 ± 0.496
2.596AspThr: 2.596 ± 0.323
5.329AspVal: 5.329 ± 0.966
1.093AspTrp: 1.093 ± 0.335
2.869AspTyr: 2.869 ± 0.559
0.0AspXaa: 0.0 ± 0.0
Glu
3.211GluAla: 3.211 ± 0.694
1.435GluCys: 1.435 ± 0.361
3.347GluAsp: 3.347 ± 0.328
3.689GluGlu: 3.689 ± 0.707
2.391GluPhe: 2.391 ± 0.376
2.664GluGly: 2.664 ± 0.537
0.683GluHis: 0.683 ± 0.364
1.776GluIle: 1.776 ± 0.313
2.869GluLys: 2.869 ± 0.341
4.509GluLeu: 4.509 ± 0.531
0.615GluMet: 0.615 ± 0.14
1.025GluAsn: 1.025 ± 0.209
1.776GluPro: 1.776 ± 0.222
2.049GluGln: 2.049 ± 0.33
1.708GluArg: 1.708 ± 0.342
1.845GluSer: 1.845 ± 0.272
1.981GluThr: 1.981 ± 0.432
3.552GluVal: 3.552 ± 0.747
0.615GluTrp: 0.615 ± 0.127
1.435GluTyr: 1.435 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
2.528PheAla: 2.528 ± 0.319
1.503PheCys: 1.503 ± 0.288
2.254PheAsp: 2.254 ± 0.477
1.776PheGlu: 1.776 ± 0.625
1.23PhePhe: 1.23 ± 0.253
2.391PheGly: 2.391 ± 0.93
0.342PheHis: 0.342 ± 0.201
3.689PheIle: 3.689 ± 0.745
3.552PheLys: 3.552 ± 1.053
3.757PheLeu: 3.757 ± 0.671
1.025PheMet: 1.025 ± 0.289
3.826PheAsn: 3.826 ± 0.536
1.025PhePro: 1.025 ± 0.521
1.503PheGln: 1.503 ± 0.729
1.366PheArg: 1.366 ± 0.375
4.577PheSer: 4.577 ± 0.575
3.006PheThr: 3.006 ± 0.496
6.217PheVal: 6.217 ± 1.47
0.751PheTrp: 0.751 ± 0.23
2.391PheTyr: 2.391 ± 0.515
0.0PheXaa: 0.0 ± 0.0
Gly
3.826GlyAla: 3.826 ± 0.374
1.708GlyCys: 1.708 ± 0.384
3.211GlyAsp: 3.211 ± 0.368
1.708GlyGlu: 1.708 ± 0.659
3.484GlyPhe: 3.484 ± 0.744
2.938GlyGly: 2.938 ± 0.637
1.23GlyHis: 1.23 ± 0.414
4.167GlyIle: 4.167 ± 0.513
2.323GlyLys: 2.323 ± 0.262
4.782GlyLeu: 4.782 ± 0.392
0.683GlyMet: 0.683 ± 0.195
3.279GlyAsn: 3.279 ± 0.836
1.571GlyPro: 1.571 ± 0.808
2.323GlyGln: 2.323 ± 0.439
1.093GlyArg: 1.093 ± 0.608
5.397GlySer: 5.397 ± 0.552
4.167GlyThr: 4.167 ± 0.41
6.49GlyVal: 6.49 ± 0.598
0.137GlyTrp: 0.137 ± 0.087
2.664GlyTyr: 2.664 ± 1.381
0.0GlyXaa: 0.0 ± 0.0
His
1.845HisAla: 1.845 ± 0.328
1.161HisCys: 1.161 ± 0.339
0.956HisAsp: 0.956 ± 0.333
0.41HisGlu: 0.41 ± 0.16
0.956HisPhe: 0.956 ± 0.379
1.298HisGly: 1.298 ± 0.149
0.205HisHis: 0.205 ± 0.185
1.435HisIle: 1.435 ± 0.491
1.435HisLys: 1.435 ± 0.212
1.366HisLeu: 1.366 ± 0.238
0.615HisMet: 0.615 ± 0.194
0.342HisAsn: 0.342 ± 0.132
1.093HisPro: 1.093 ± 0.237
0.547HisGln: 0.547 ± 0.375
0.273HisArg: 0.273 ± 0.115
1.845HisSer: 1.845 ± 0.662
1.23HisThr: 1.23 ± 0.329
1.708HisVal: 1.708 ± 0.444
0.478HisTrp: 0.478 ± 0.187
0.888HisTyr: 0.888 ± 0.264
0.0HisXaa: 0.0 ± 0.0
Ile
3.621IleAla: 3.621 ± 0.735
1.776IleCys: 1.776 ± 0.483
2.459IleAsp: 2.459 ± 0.547
1.708IleGlu: 1.708 ± 0.288
1.64IlePhe: 1.64 ± 0.361
2.254IleGly: 2.254 ± 0.714
0.751IleHis: 0.751 ± 0.641
1.64IleIle: 1.64 ± 0.71
3.894IleLys: 3.894 ± 0.341
4.577IleLeu: 4.577 ± 0.705
0.683IleMet: 0.683 ± 0.113
2.596IleAsn: 2.596 ± 0.985
2.528IlePro: 2.528 ± 0.346
1.298IleGln: 1.298 ± 0.336
1.366IleArg: 1.366 ± 0.474
3.074IleSer: 3.074 ± 0.716
2.596IleThr: 2.596 ± 1.253
4.577IleVal: 4.577 ± 0.817
0.478IleTrp: 0.478 ± 0.181
1.571IleTyr: 1.571 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
3.826LysAla: 3.826 ± 0.545
1.981LysCys: 1.981 ± 0.413
2.869LysAsp: 2.869 ± 0.341
3.074LysGlu: 3.074 ± 0.715
3.211LysPhe: 3.211 ± 0.99
3.552LysGly: 3.552 ± 0.741
2.049LysHis: 2.049 ± 0.714
1.913LysIle: 1.913 ± 0.456
2.459LysLys: 2.459 ± 0.74
6.832LysLeu: 6.832 ± 0.903
1.913LysMet: 1.913 ± 0.669
3.621LysAsn: 3.621 ± 0.912
3.621LysPro: 3.621 ± 0.574
3.211LysGln: 3.211 ± 1.102
1.503LysArg: 1.503 ± 0.465
3.484LysSer: 3.484 ± 0.461
3.279LysThr: 3.279 ± 0.447
4.236LysVal: 4.236 ± 0.413
0.547LysTrp: 0.547 ± 0.181
2.118LysTyr: 2.118 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
7.788LeuAla: 7.788 ± 1.089
3.621LeuCys: 3.621 ± 0.872
3.962LeuAsp: 3.962 ± 0.544
3.826LeuGlu: 3.826 ± 0.793
4.304LeuPhe: 4.304 ± 0.844
4.44LeuGly: 4.44 ± 0.406
1.845LeuHis: 1.845 ± 0.599
2.664LeuIle: 2.664 ± 0.444
5.67LeuLys: 5.67 ± 0.584
10.111LeuLeu: 10.111 ± 1.275
2.733LeuMet: 2.733 ± 0.463
5.67LeuAsn: 5.67 ± 1.046
3.211LeuPro: 3.211 ± 0.672
4.919LeuGln: 4.919 ± 0.538
3.621LeuArg: 3.621 ± 0.568
7.788LeuSer: 7.788 ± 0.861
7.378LeuThr: 7.378 ± 0.873
7.241LeuVal: 7.241 ± 0.642
1.025LeuTrp: 1.025 ± 0.274
4.031LeuTyr: 4.031 ± 0.739
0.0LeuXaa: 0.0 ± 0.0
Met
1.435MetAla: 1.435 ± 0.453
1.298MetCys: 1.298 ± 0.388
0.683MetAsp: 0.683 ± 0.139
0.956MetGlu: 0.956 ± 0.287
1.64MetPhe: 1.64 ± 0.559
1.435MetGly: 1.435 ± 0.38
1.503MetHis: 1.503 ± 0.359
0.478MetIle: 0.478 ± 0.146
0.683MetLys: 0.683 ± 0.635
3.894MetLeu: 3.894 ± 0.619
0.82MetMet: 0.82 ± 0.311
0.888MetAsn: 0.888 ± 0.211
0.683MetPro: 0.683 ± 0.474
1.23MetGln: 1.23 ± 0.254
0.888MetArg: 0.888 ± 0.255
1.435MetSer: 1.435 ± 0.451
1.025MetThr: 1.025 ± 0.251
1.845MetVal: 1.845 ± 0.344
0.342MetTrp: 0.342 ± 0.13
2.118MetTyr: 2.118 ± 0.606
0.0MetXaa: 0.0 ± 0.0
Asn
3.347AsnAla: 3.347 ± 0.435
2.049AsnCys: 2.049 ± 0.159
2.664AsnAsp: 2.664 ± 0.501
2.049AsnGlu: 2.049 ± 0.581
3.552AsnPhe: 3.552 ± 0.449
5.192AsnGly: 5.192 ± 0.569
1.025AsnHis: 1.025 ± 0.225
2.869AsnIle: 2.869 ± 0.546
3.552AsnLys: 3.552 ± 0.738
4.099AsnLeu: 4.099 ± 0.553
1.503AsnMet: 1.503 ± 0.286
3.757AsnAsn: 3.757 ± 0.496
1.981AsnPro: 1.981 ± 0.7
1.503AsnGln: 1.503 ± 0.87
1.503AsnArg: 1.503 ± 0.462
3.689AsnSer: 3.689 ± 0.539
2.869AsnThr: 2.869 ± 0.95
4.987AsnVal: 4.987 ± 0.785
0.956AsnTrp: 0.956 ± 0.23
2.733AsnTyr: 2.733 ± 1.306
0.0AsnXaa: 0.0 ± 0.0
Pro
2.733ProAla: 2.733 ± 0.529
1.161ProCys: 1.161 ± 0.207
1.913ProAsp: 1.913 ± 0.38
2.254ProGlu: 2.254 ± 0.775
2.118ProPhe: 2.118 ± 0.236
2.254ProGly: 2.254 ± 0.651
0.888ProHis: 0.888 ± 0.155
2.049ProIle: 2.049 ± 0.825
2.323ProLys: 2.323 ± 1.784
3.962ProLeu: 3.962 ± 0.609
0.956ProMet: 0.956 ± 0.33
2.049ProAsn: 2.049 ± 0.237
1.776ProPro: 1.776 ± 0.83
1.366ProGln: 1.366 ± 0.934
1.161ProArg: 1.161 ± 0.904
3.143ProSer: 3.143 ± 0.455
2.049ProThr: 2.049 ± 0.814
3.074ProVal: 3.074 ± 0.931
0.478ProTrp: 0.478 ± 0.12
1.366ProTyr: 1.366 ± 0.379
0.0ProXaa: 0.0 ± 0.0
Gln
2.049GlnAla: 2.049 ± 0.453
1.366GlnCys: 1.366 ± 0.261
2.118GlnAsp: 2.118 ± 0.684
2.049GlnGlu: 2.049 ± 0.315
0.956GlnPhe: 0.956 ± 0.369
2.664GlnGly: 2.664 ± 0.266
0.205GlnHis: 0.205 ± 0.28
2.323GlnIle: 2.323 ± 0.37
2.254GlnLys: 2.254 ± 0.506
5.192GlnLeu: 5.192 ± 0.639
1.298GlnMet: 1.298 ± 0.378
1.366GlnAsn: 1.366 ± 0.592
2.049GlnPro: 2.049 ± 1.276
2.049GlnGln: 2.049 ± 0.562
0.751GlnArg: 0.751 ± 0.445
2.664GlnSer: 2.664 ± 0.721
1.708GlnThr: 1.708 ± 0.421
1.845GlnVal: 1.845 ± 0.324
0.547GlnTrp: 0.547 ± 0.377
1.503GlnTyr: 1.503 ± 0.539
0.0GlnXaa: 0.0 ± 0.0
Arg
2.869ArgAla: 2.869 ± 0.473
0.956ArgCys: 0.956 ± 0.31
1.435ArgAsp: 1.435 ± 0.349
1.435ArgGlu: 1.435 ± 0.316
1.298ArgPhe: 1.298 ± 0.347
1.571ArgGly: 1.571 ± 0.751
1.161ArgHis: 1.161 ± 0.341
1.298ArgIle: 1.298 ± 0.303
1.64ArgLys: 1.64 ± 0.252
2.186ArgLeu: 2.186 ± 0.465
0.41ArgMet: 0.41 ± 0.095
1.981ArgAsn: 1.981 ± 0.328
1.571ArgPro: 1.571 ± 0.846
1.298ArgGln: 1.298 ± 0.3
1.571ArgArg: 1.571 ± 0.824
3.006ArgSer: 3.006 ± 1.363
1.298ArgThr: 1.298 ± 0.216
3.347ArgVal: 3.347 ± 0.732
0.273ArgTrp: 0.273 ± 0.393
0.956ArgTyr: 0.956 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
6.148SerAla: 6.148 ± 0.47
1.025SerCys: 1.025 ± 0.248
4.987SerAsp: 4.987 ± 0.638
3.074SerGlu: 3.074 ± 0.38
4.236SerPhe: 4.236 ± 0.665
4.304SerGly: 4.304 ± 0.526
1.845SerHis: 1.845 ± 0.501
2.869SerIle: 2.869 ± 0.82
4.44SerLys: 4.44 ± 0.327
6.558SerLeu: 6.558 ± 0.85
2.801SerMet: 2.801 ± 0.251
4.44SerAsn: 4.44 ± 0.929
1.776SerPro: 1.776 ± 0.218
2.596SerGln: 2.596 ± 0.882
2.938SerArg: 2.938 ± 1.604
5.329SerSer: 5.329 ± 1.465
4.44SerThr: 4.44 ± 0.254
7.31SerVal: 7.31 ± 0.768
1.025SerTrp: 1.025 ± 0.138
3.621SerTyr: 3.621 ± 0.576
0.0SerXaa: 0.0 ± 0.0
Thr
4.236ThrAla: 4.236 ± 0.946
1.503ThrCys: 1.503 ± 0.632
1.913ThrAsp: 1.913 ± 0.315
1.776ThrGlu: 1.776 ± 0.357
3.279ThrPhe: 3.279 ± 0.302
4.714ThrGly: 4.714 ± 0.913
1.161ThrHis: 1.161 ± 0.189
2.186ThrIle: 2.186 ± 0.604
2.459ThrLys: 2.459 ± 0.525
6.49ThrLeu: 6.49 ± 0.659
2.049ThrMet: 2.049 ± 0.409
3.962ThrAsn: 3.962 ± 0.444
2.596ThrPro: 2.596 ± 0.471
2.049ThrGln: 2.049 ± 1.394
1.64ThrArg: 1.64 ± 0.277
4.099ThrSer: 4.099 ± 0.644
3.894ThrThr: 3.894 ± 0.571
6.148ThrVal: 6.148 ± 0.579
0.273ThrTrp: 0.273 ± 0.115
3.484ThrTyr: 3.484 ± 0.816
0.0ThrXaa: 0.0 ± 0.0
Val
5.875ValAla: 5.875 ± 1.162
3.074ValCys: 3.074 ± 0.492
6.285ValAsp: 6.285 ± 0.762
5.124ValGlu: 5.124 ± 0.973
3.416ValPhe: 3.416 ± 0.471
3.621ValGly: 3.621 ± 0.313
1.298ValHis: 1.298 ± 0.235
4.577ValIle: 4.577 ± 0.438
6.968ValLys: 6.968 ± 1.64
8.608ValLeu: 8.608 ± 0.957
1.981ValMet: 1.981 ± 0.443
5.124ValAsn: 5.124 ± 1.03
4.167ValPro: 4.167 ± 0.46
3.757ValGln: 3.757 ± 0.431
3.143ValArg: 3.143 ± 0.746
7.173ValSer: 7.173 ± 0.746
4.919ValThr: 4.919 ± 0.219
10.384ValVal: 10.384 ± 2.366
0.956ValTrp: 0.956 ± 0.213
4.44ValTyr: 4.44 ± 0.636
0.0ValXaa: 0.0 ± 0.0
Trp
0.82TrpAla: 0.82 ± 0.319
0.547TrpCys: 0.547 ± 0.211
0.82TrpAsp: 0.82 ± 0.264
0.342TrpGlu: 0.342 ± 0.181
1.298TrpPhe: 1.298 ± 0.326
0.137TrpGly: 0.137 ± 0.087
0.068TrpHis: 0.068 ± 0.043
0.205TrpIle: 0.205 ± 0.08
0.751TrpLys: 0.751 ± 0.3
1.571TrpLeu: 1.571 ± 0.408
0.273TrpMet: 0.273 ± 0.115
0.547TrpAsn: 0.547 ± 0.152
0.41TrpPro: 0.41 ± 0.321
0.273TrpGln: 0.273 ± 0.282
0.41TrpArg: 0.41 ± 0.17
1.025TrpSer: 1.025 ± 0.281
0.205TrpThr: 0.205 ± 0.239
1.025TrpVal: 1.025 ± 0.34
0.137TrpTrp: 0.137 ± 0.134
0.547TrpTyr: 0.547 ± 0.358
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.254TyrAla: 2.254 ± 0.582
1.503TyrCys: 1.503 ± 0.245
3.006TyrAsp: 3.006 ± 0.704
1.981TyrGlu: 1.981 ± 0.539
2.459TyrPhe: 2.459 ± 0.46
2.049TyrGly: 2.049 ± 0.332
1.025TyrHis: 1.025 ± 0.242
1.776TyrIle: 1.776 ± 0.404
3.484TyrLys: 3.484 ± 0.578
3.484TyrLeu: 3.484 ± 0.669
1.366TyrMet: 1.366 ± 0.161
2.733TyrAsn: 2.733 ± 0.283
1.776TyrPro: 1.776 ± 0.634
0.956TyrGln: 0.956 ± 0.194
1.503TyrArg: 1.503 ± 0.292
3.894TyrSer: 3.894 ± 0.733
3.552TyrThr: 3.552 ± 0.396
4.645TyrVal: 4.645 ± 0.753
0.273TyrTrp: 0.273 ± 0.393
2.664TyrTyr: 2.664 ± 0.4
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (14639 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski