Amino acid dipepetide frequency for Porcine coronavirus HKU15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.903AlaAla: 4.903 ± 0.724
2.272AlaCys: 2.272 ± 0.531
3.946AlaAsp: 3.946 ± 0.877
2.511AlaGlu: 2.511 ± 0.531
4.066AlaPhe: 4.066 ± 0.731
3.468AlaGly: 3.468 ± 1.233
2.392AlaHis: 2.392 ± 0.771
5.979AlaIle: 5.979 ± 0.79
4.664AlaLys: 4.664 ± 1.52
8.371AlaLeu: 8.371 ± 0.719
1.794AlaMet: 1.794 ± 0.407
4.784AlaAsn: 4.784 ± 0.752
2.392AlaPro: 2.392 ± 0.802
3.348AlaGln: 3.348 ± 1.329
3.348AlaArg: 3.348 ± 0.467
4.425AlaSer: 4.425 ± 1.294
6.099AlaThr: 6.099 ± 1.07
5.262AlaVal: 5.262 ± 1.404
0.239AlaTrp: 0.239 ± 0.456
2.751AlaTyr: 2.751 ± 0.82
0.0AlaXaa: 0.0 ± 0.0
Cys
1.315CysAla: 1.315 ± 0.678
0.957CysCys: 0.957 ± 0.331
1.196CysAsp: 1.196 ± 0.437
0.957CysGlu: 0.957 ± 0.331
1.794CysPhe: 1.794 ± 0.467
1.435CysGly: 1.435 ± 0.654
0.239CysHis: 0.239 ± 0.468
1.794CysIle: 1.794 ± 0.774
0.957CysLys: 0.957 ± 0.493
1.555CysLeu: 1.555 ± 0.679
0.598CysMet: 0.598 ± 0.308
1.674CysAsn: 1.674 ± 0.759
1.435CysPro: 1.435 ± 0.553
0.837CysGln: 0.837 ± 0.273
0.837CysArg: 0.837 ± 0.431
1.794CysSer: 1.794 ± 1.258
1.674CysThr: 1.674 ± 0.83
2.272CysVal: 2.272 ± 0.451
0.359CysTrp: 0.359 ± 0.185
1.196CysTyr: 1.196 ± 0.598
0.0CysXaa: 0.0 ± 0.0
Asp
4.664AspAla: 4.664 ± 1.107
1.435AspCys: 1.435 ± 0.453
2.99AspAsp: 2.99 ± 0.726
2.033AspGlu: 2.033 ± 0.803
2.392AspPhe: 2.392 ± 0.528
3.348AspGly: 3.348 ± 0.938
0.598AspHis: 0.598 ± 0.41
3.348AspIle: 3.348 ± 1.196
2.153AspLys: 2.153 ± 0.815
4.425AspLeu: 4.425 ± 1.496
0.837AspMet: 0.837 ± 0.356
3.229AspAsn: 3.229 ± 1.14
2.392AspPro: 2.392 ± 1.086
1.794AspGln: 1.794 ± 0.418
1.555AspArg: 1.555 ± 0.82
3.946AspSer: 3.946 ± 0.85
3.468AspThr: 3.468 ± 0.504
4.425AspVal: 4.425 ± 1.233
0.718AspTrp: 0.718 ± 0.637
3.109AspTyr: 3.109 ± 0.826
0.0AspXaa: 0.0 ± 0.0
Glu
2.511GluAla: 2.511 ± 0.705
1.196GluCys: 1.196 ± 0.437
2.511GluAsp: 2.511 ± 1.035
2.153GluGlu: 2.153 ± 0.682
2.033GluPhe: 2.033 ± 0.827
1.913GluGly: 1.913 ± 0.587
1.196GluHis: 1.196 ± 0.616
2.033GluIle: 2.033 ± 0.527
1.794GluLys: 1.794 ± 0.53
4.544GluLeu: 4.544 ± 0.935
1.076GluMet: 1.076 ± 0.381
1.794GluAsn: 1.794 ± 0.365
2.033GluPro: 2.033 ± 0.551
1.913GluGln: 1.913 ± 0.65
1.555GluArg: 1.555 ± 0.419
2.153GluSer: 2.153 ± 0.845
2.272GluThr: 2.272 ± 0.725
3.109GluVal: 3.109 ± 0.737
0.718GluTrp: 0.718 ± 0.663
2.033GluTyr: 2.033 ± 0.608
0.0GluXaa: 0.0 ± 0.0
Phe
3.348PheAla: 3.348 ± 0.629
1.196PheCys: 1.196 ± 0.377
2.751PheAsp: 2.751 ± 1.047
1.794PheGlu: 1.794 ± 0.752
1.076PhePhe: 1.076 ± 0.491
2.272PheGly: 2.272 ± 0.51
0.598PheHis: 0.598 ± 0.409
2.87PheIle: 2.87 ± 1.628
2.153PheLys: 2.153 ± 0.711
3.946PheLeu: 3.946 ± 1.099
0.598PheMet: 0.598 ± 0.342
3.229PheAsn: 3.229 ± 0.756
1.555PhePro: 1.555 ± 0.511
1.435PheGln: 1.435 ± 0.472
1.435PheArg: 1.435 ± 0.6
3.109PheSer: 3.109 ± 0.972
3.468PheThr: 3.468 ± 0.773
2.87PheVal: 2.87 ± 0.857
0.239PheTrp: 0.239 ± 0.123
3.468PheTyr: 3.468 ± 0.618
0.0PheXaa: 0.0 ± 0.0
Gly
3.468GlyAla: 3.468 ± 0.822
1.555GlyCys: 1.555 ± 0.904
2.87GlyAsp: 2.87 ± 0.844
2.033GlyGlu: 2.033 ± 0.838
1.913GlyPhe: 1.913 ± 0.418
2.631GlyGly: 2.631 ± 0.518
0.837GlyHis: 0.837 ± 0.207
3.946GlyIle: 3.946 ± 0.933
2.99GlyLys: 2.99 ± 0.706
2.99GlyLeu: 2.99 ± 0.977
0.478GlyMet: 0.478 ± 0.282
2.99GlyAsn: 2.99 ± 0.972
1.674GlyPro: 1.674 ± 0.74
1.555GlyGln: 1.555 ± 0.707
1.794GlyArg: 1.794 ± 0.369
3.348GlySer: 3.348 ± 1.247
4.903GlyThr: 4.903 ± 0.561
5.023GlyVal: 5.023 ± 1.037
0.478GlyTrp: 0.478 ± 0.247
1.913GlyTyr: 1.913 ± 0.325
0.0GlyXaa: 0.0 ± 0.0
His
2.033HisAla: 2.033 ± 0.705
0.359HisCys: 0.359 ± 0.185
0.837HisAsp: 0.837 ± 0.273
0.957HisGlu: 0.957 ± 0.325
1.315HisPhe: 1.315 ± 0.494
0.837HisGly: 0.837 ± 0.997
0.478HisHis: 0.478 ± 0.247
2.033HisIle: 2.033 ± 0.474
1.196HisLys: 1.196 ± 0.437
2.751HisLeu: 2.751 ± 0.606
0.718HisMet: 0.718 ± 0.379
0.957HisAsn: 0.957 ± 0.331
1.076HisPro: 1.076 ± 0.436
1.196HisGln: 1.196 ± 1.022
0.718HisArg: 0.718 ± 0.327
0.837HisSer: 0.837 ± 0.273
1.794HisThr: 1.794 ± 0.384
2.511HisVal: 2.511 ± 1.011
0.0HisTrp: 0.0 ± 0.0
0.837HisTyr: 0.837 ± 0.751
0.0HisXaa: 0.0 ± 0.0
Ile
4.425IleAla: 4.425 ± 0.72
1.315IleCys: 1.315 ± 0.459
3.588IleAsp: 3.588 ± 0.655
2.033IleGlu: 2.033 ± 0.705
2.272IlePhe: 2.272 ± 0.463
2.392IleGly: 2.392 ± 0.829
0.957IleHis: 0.957 ± 0.493
4.066IleIle: 4.066 ± 2.05
3.229IleLys: 3.229 ± 0.753
6.338IleLeu: 6.338 ± 2.43
1.315IleMet: 1.315 ± 0.403
2.99IleAsn: 2.99 ± 0.848
3.588IlePro: 3.588 ± 0.506
2.511IleGln: 2.511 ± 0.843
2.631IleArg: 2.631 ± 0.807
4.186IleSer: 4.186 ± 0.872
3.707IleThr: 3.707 ± 1.43
5.023IleVal: 5.023 ± 1.088
0.598IleTrp: 0.598 ± 0.921
2.751IleTyr: 2.751 ± 0.662
0.0IleXaa: 0.0 ± 0.0
Lys
4.784LysAla: 4.784 ± 1.824
1.555LysCys: 1.555 ± 0.693
2.153LysAsp: 2.153 ± 0.902
1.674LysGlu: 1.674 ± 0.474
2.272LysPhe: 2.272 ± 0.725
1.794LysGly: 1.794 ± 0.534
1.196LysHis: 1.196 ± 0.502
2.99LysIle: 2.99 ± 0.726
2.99LysLys: 2.99 ± 1.181
4.305LysLeu: 4.305 ± 1.461
0.598LysMet: 0.598 ± 0.189
1.913LysAsn: 1.913 ± 0.6
4.664LysPro: 4.664 ± 1.974
1.555LysGln: 1.555 ± 1.124
1.794LysArg: 1.794 ± 1.006
2.392LysSer: 2.392 ± 0.557
4.784LysThr: 4.784 ± 1.012
2.99LysVal: 2.99 ± 0.751
0.359LysTrp: 0.359 ± 0.164
2.631LysTyr: 2.631 ± 0.794
0.0LysXaa: 0.0 ± 0.0
Leu
9.926LeuAla: 9.926 ± 0.956
1.674LeuCys: 1.674 ± 0.395
3.946LeuAsp: 3.946 ± 1.355
3.707LeuGlu: 3.707 ± 0.78
4.186LeuPhe: 4.186 ± 1.08
3.707LeuGly: 3.707 ± 0.618
2.272LeuHis: 2.272 ± 1.399
3.827LeuIle: 3.827 ± 1.371
4.784LeuLys: 4.784 ± 1.224
8.73LeuLeu: 8.73 ± 2.723
2.033LeuMet: 2.033 ± 1.001
5.74LeuAsn: 5.74 ± 1.332
5.979LeuPro: 5.979 ± 1.988
5.979LeuGln: 5.979 ± 1.152
3.468LeuArg: 3.468 ± 1.895
5.381LeuSer: 5.381 ± 1.238
8.371LeuThr: 8.371 ± 1.589
6.338LeuVal: 6.338 ± 1.422
0.478LeuTrp: 0.478 ± 0.416
5.262LeuTyr: 5.262 ± 0.704
0.0LeuXaa: 0.0 ± 0.0
Met
2.392MetAla: 2.392 ± 1.239
0.718MetCys: 0.718 ± 0.409
0.478MetAsp: 0.478 ± 0.447
0.598MetGlu: 0.598 ± 0.299
0.837MetPhe: 0.837 ± 0.273
0.957MetGly: 0.957 ± 0.441
0.718MetHis: 0.718 ± 0.227
0.478MetIle: 0.478 ± 0.308
0.598MetLys: 0.598 ± 0.41
2.272MetLeu: 2.272 ± 0.526
0.359MetMet: 0.359 ± 0.346
0.837MetAsn: 0.837 ± 0.273
0.598MetPro: 0.598 ± 0.299
0.718MetGln: 0.718 ± 0.227
0.598MetArg: 0.598 ± 0.308
1.196MetSer: 1.196 ± 0.594
0.957MetThr: 0.957 ± 0.325
1.913MetVal: 1.913 ± 0.807
0.12MetTrp: 0.12 ± 0.062
0.837MetTyr: 0.837 ± 0.525
0.0MetXaa: 0.0 ± 0.0
Asn
4.186AsnAla: 4.186 ± 0.769
1.435AsnCys: 1.435 ± 0.385
1.555AsnAsp: 1.555 ± 0.462
2.153AsnGlu: 2.153 ± 0.913
2.153AsnPhe: 2.153 ± 0.673
4.305AsnGly: 4.305 ± 1.061
1.196AsnHis: 1.196 ± 0.274
2.631AsnIle: 2.631 ± 0.883
2.87AsnLys: 2.87 ± 0.65
5.86AsnLeu: 5.86 ± 0.972
1.315AsnMet: 1.315 ± 0.358
3.588AsnAsn: 3.588 ± 1.458
2.272AsnPro: 2.272 ± 1.396
2.99AsnGln: 2.99 ± 0.739
2.751AsnArg: 2.751 ± 0.346
3.109AsnSer: 3.109 ± 1.21
4.066AsnThr: 4.066 ± 1.508
4.425AsnVal: 4.425 ± 1.651
0.12AsnTrp: 0.12 ± 0.062
2.392AsnTyr: 2.392 ± 0.97
0.0AsnXaa: 0.0 ± 0.0
Pro
2.392ProAla: 2.392 ± 1.167
0.718ProCys: 0.718 ± 0.227
2.511ProAsp: 2.511 ± 0.465
3.348ProGlu: 3.348 ± 1.661
1.794ProPhe: 1.794 ± 1.275
3.468ProGly: 3.468 ± 0.545
1.315ProHis: 1.315 ± 0.531
3.946ProIle: 3.946 ± 0.868
2.87ProLys: 2.87 ± 1.206
3.588ProLeu: 3.588 ± 1.209
0.837ProMet: 0.837 ± 0.948
2.99ProAsn: 2.99 ± 0.57
3.348ProPro: 3.348 ± 0.749
2.392ProGln: 2.392 ± 0.629
2.153ProArg: 2.153 ± 1.86
3.588ProSer: 3.588 ± 1.844
4.305ProThr: 4.305 ± 0.818
2.99ProVal: 2.99 ± 1.112
0.359ProTrp: 0.359 ± 0.164
1.435ProTyr: 1.435 ± 0.553
0.0ProXaa: 0.0 ± 0.0
Gln
3.588GlnAla: 3.588 ± 0.485
0.478GlnCys: 0.478 ± 0.247
1.913GlnAsp: 1.913 ± 0.743
2.153GlnGlu: 2.153 ± 0.749
1.435GlnPhe: 1.435 ± 0.798
1.435GlnGly: 1.435 ± 0.827
1.435GlnHis: 1.435 ± 0.367
2.153GlnIle: 2.153 ± 0.438
1.435GlnLys: 1.435 ± 0.436
5.979GlnLeu: 5.979 ± 0.949
0.837GlnMet: 0.837 ± 0.273
2.153GlnAsn: 2.153 ± 0.522
2.631GlnPro: 2.631 ± 0.689
2.631GlnGln: 2.631 ± 0.403
1.555GlnArg: 1.555 ± 0.647
3.707GlnSer: 3.707 ± 0.864
3.348GlnThr: 3.348 ± 1.346
2.392GlnVal: 2.392 ± 1.246
0.598GlnTrp: 0.598 ± 0.189
2.153GlnTyr: 2.153 ± 0.509
0.0GlnXaa: 0.0 ± 0.0
Arg
2.87ArgAla: 2.87 ± 0.608
1.435ArgCys: 1.435 ± 0.479
1.794ArgAsp: 1.794 ± 0.505
1.674ArgGlu: 1.674 ± 0.455
2.272ArgPhe: 2.272 ± 0.578
1.794ArgGly: 1.794 ± 1.358
1.315ArgHis: 1.315 ± 0.587
1.794ArgIle: 1.794 ± 0.369
1.435ArgLys: 1.435 ± 0.812
4.544ArgLeu: 4.544 ± 2.047
0.239ArgMet: 0.239 ± 0.123
2.272ArgAsn: 2.272 ± 0.986
1.435ArgPro: 1.435 ± 1.059
2.153ArgGln: 2.153 ± 0.502
1.315ArgArg: 1.315 ± 1.046
2.272ArgSer: 2.272 ± 1.234
2.631ArgThr: 2.631 ± 0.503
2.751ArgVal: 2.751 ± 1.283
0.239ArgTrp: 0.239 ± 0.123
1.794ArgTyr: 1.794 ± 0.627
0.0ArgXaa: 0.0 ± 0.0
Ser
5.621SerAla: 5.621 ± 1.273
0.718SerCys: 0.718 ± 0.481
4.066SerAsp: 4.066 ± 0.721
1.555SerGlu: 1.555 ± 0.661
2.751SerPhe: 2.751 ± 0.864
3.468SerGly: 3.468 ± 0.557
0.718SerHis: 0.718 ± 0.327
2.99SerIle: 2.99 ± 1.699
2.033SerLys: 2.033 ± 0.937
6.458SerLeu: 6.458 ± 1.24
1.076SerMet: 1.076 ± 0.257
2.511SerAsn: 2.511 ± 0.821
2.99SerPro: 2.99 ± 0.727
2.87SerGln: 2.87 ± 1.288
2.153SerArg: 2.153 ± 1.047
3.946SerSer: 3.946 ± 1.272
5.142SerThr: 5.142 ± 1.331
5.501SerVal: 5.501 ± 0.905
1.076SerTrp: 1.076 ± 0.721
3.468SerTyr: 3.468 ± 0.821
0.0SerXaa: 0.0 ± 0.0
Thr
5.74ThrAla: 5.74 ± 1.457
1.794ThrCys: 1.794 ± 0.589
5.262ThrAsp: 5.262 ± 1.409
2.631ThrGlu: 2.631 ± 0.613
3.348ThrPhe: 3.348 ± 1.028
3.946ThrGly: 3.946 ± 2.377
2.631ThrHis: 2.631 ± 0.462
5.621ThrIle: 5.621 ± 0.68
3.468ThrLys: 3.468 ± 0.694
6.697ThrLeu: 6.697 ± 0.942
1.555ThrMet: 1.555 ± 0.372
3.946ThrAsn: 3.946 ± 1.165
5.023ThrPro: 5.023 ± 1.507
2.392ThrGln: 2.392 ± 0.518
3.109ThrArg: 3.109 ± 1.038
4.664ThrSer: 4.664 ± 1.448
6.936ThrThr: 6.936 ± 1.266
6.936ThrVal: 6.936 ± 1.75
0.718ThrTrp: 0.718 ± 0.53
3.468ThrTyr: 3.468 ± 0.825
0.0ThrXaa: 0.0 ± 0.0
Val
5.621ValAla: 5.621 ± 1.683
2.272ValCys: 2.272 ± 0.816
4.903ValAsp: 4.903 ± 0.648
4.425ValGlu: 4.425 ± 0.602
2.511ValPhe: 2.511 ± 0.65
4.305ValGly: 4.305 ± 1.086
1.794ValHis: 1.794 ± 0.589
4.784ValIle: 4.784 ± 1.562
5.023ValLys: 5.023 ± 1.347
6.697ValLeu: 6.697 ± 0.765
0.718ValMet: 0.718 ± 0.48
4.066ValAsn: 4.066 ± 0.733
2.99ValPro: 2.99 ± 1.503
2.99ValGln: 2.99 ± 0.687
2.87ValArg: 2.87 ± 1.041
4.186ValSer: 4.186 ± 0.877
6.817ValThr: 6.817 ± 0.833
10.165ValVal: 10.165 ± 1.42
0.598ValTrp: 0.598 ± 0.339
3.229ValTyr: 3.229 ± 1.217
0.0ValXaa: 0.0 ± 0.0
Trp
0.837TrpAla: 0.837 ± 1.679
0.0TrpCys: 0.0 ± 0.0
0.837TrpAsp: 0.837 ± 0.207
0.478TrpGlu: 0.478 ± 0.281
0.837TrpPhe: 0.837 ± 0.347
0.12TrpGly: 0.12 ± 0.062
0.239TrpHis: 0.239 ± 0.123
0.478TrpIle: 0.478 ± 0.247
0.12TrpLys: 0.12 ± 0.062
1.315TrpLeu: 1.315 ± 1.113
0.12TrpMet: 0.12 ± 0.187
0.359TrpAsn: 0.359 ± 0.164
0.239TrpPro: 0.239 ± 0.184
0.12TrpGln: 0.12 ± 0.487
0.359TrpArg: 0.359 ± 0.49
0.239TrpSer: 0.239 ± 0.123
0.837TrpThr: 0.837 ± 0.422
0.478TrpVal: 0.478 ± 0.281
0.12TrpTrp: 0.12 ± 0.062
0.359TrpTyr: 0.359 ± 0.49
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.392TyrAla: 2.392 ± 0.375
1.794TyrCys: 1.794 ± 0.566
2.751TyrAsp: 2.751 ± 0.75
1.674TyrGlu: 1.674 ± 0.474
2.153TyrPhe: 2.153 ± 0.913
1.794TyrGly: 1.794 ± 0.999
1.315TyrHis: 1.315 ± 0.678
2.153TyrIle: 2.153 ± 0.568
2.631TyrLys: 2.631 ± 0.8
4.544TyrLeu: 4.544 ± 0.903
0.957TyrMet: 0.957 ± 0.439
3.348TyrAsn: 3.348 ± 1.05
2.033TyrPro: 2.033 ± 0.577
2.631TyrGln: 2.631 ± 0.572
2.033TyrArg: 2.033 ± 0.474
2.631TyrSer: 2.631 ± 0.828
4.305TyrThr: 4.305 ± 0.534
3.588TyrVal: 3.588 ± 0.912
0.359TyrTrp: 0.359 ± 0.619
1.913TyrTyr: 1.913 ± 0.56
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (8363 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski