Amino acid dipepetide frequency for Night heron coronavirus HKU19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.255AlaAla: 4.255 ± 1.519
1.773AlaCys: 1.773 ± 0.431
3.546AlaAsp: 3.546 ± 1.249
2.955AlaGlu: 2.955 ± 0.828
2.837AlaPhe: 2.837 ± 0.432
2.719AlaGly: 2.719 ± 0.586
1.537AlaHis: 1.537 ± 0.469
5.319AlaIle: 5.319 ± 1.439
4.019AlaLys: 4.019 ± 1.254
6.738AlaLeu: 6.738 ± 1.269
1.182AlaMet: 1.182 ± 0.391
3.073AlaAsn: 3.073 ± 0.975
2.364AlaPro: 2.364 ± 1.593
2.128AlaGln: 2.128 ± 0.833
2.719AlaArg: 2.719 ± 0.835
4.846AlaSer: 4.846 ± 0.977
5.083AlaThr: 5.083 ± 1.278
4.61AlaVal: 4.61 ± 0.654
0.118AlaTrp: 0.118 ± 0.059
3.073AlaTyr: 3.073 ± 0.637
0.0AlaXaa: 0.0 ± 0.0
Cys
1.418CysAla: 1.418 ± 0.759
1.3CysCys: 1.3 ± 0.7
2.009CysAsp: 2.009 ± 0.521
0.591CysGlu: 0.591 ± 0.294
1.891CysPhe: 1.891 ± 0.424
1.418CysGly: 1.418 ± 0.566
0.355CysHis: 0.355 ± 0.177
1.418CysIle: 1.418 ± 1.077
0.827CysLys: 0.827 ± 0.309
2.128CysLeu: 2.128 ± 1.386
0.473CysMet: 0.473 ± 0.235
1.3CysAsn: 1.3 ± 0.319
0.709CysPro: 0.709 ± 0.48
0.591CysGln: 0.591 ± 0.172
0.827CysArg: 0.827 ± 0.436
2.246CysSer: 2.246 ± 0.545
2.246CysThr: 2.246 ± 0.717
2.009CysVal: 2.009 ± 0.702
0.236CysTrp: 0.236 ± 0.118
1.655CysTyr: 1.655 ± 0.433
0.0CysXaa: 0.0 ± 0.0
Asp
3.428AspAla: 3.428 ± 0.496
1.3AspCys: 1.3 ± 0.381
2.837AspAsp: 2.837 ± 0.77
2.128AspGlu: 2.128 ± 0.753
3.073AspPhe: 3.073 ± 0.789
3.546AspGly: 3.546 ± 0.558
0.827AspHis: 0.827 ± 1.219
3.901AspIle: 3.901 ± 0.855
2.364AspLys: 2.364 ± 0.429
4.374AspLeu: 4.374 ± 1.118
0.827AspMet: 0.827 ± 0.463
4.61AspAsn: 4.61 ± 1.05
1.773AspPro: 1.773 ± 1.482
1.418AspGln: 1.418 ± 0.343
1.773AspArg: 1.773 ± 1.377
3.783AspSer: 3.783 ± 0.829
3.664AspThr: 3.664 ± 0.897
4.728AspVal: 4.728 ± 1.549
0.473AspTrp: 0.473 ± 0.435
3.664AspTyr: 3.664 ± 1.109
0.0AspXaa: 0.0 ± 0.0
Glu
2.482GluAla: 2.482 ± 0.714
1.182GluCys: 1.182 ± 0.344
2.6GluAsp: 2.6 ± 0.921
2.482GluGlu: 2.482 ± 0.892
1.537GluPhe: 1.537 ± 0.438
1.655GluGly: 1.655 ± 0.498
0.946GluHis: 0.946 ± 0.471
1.655GluIle: 1.655 ± 0.502
2.009GluLys: 2.009 ± 0.684
2.955GluLeu: 2.955 ± 0.46
0.591GluMet: 0.591 ± 0.294
2.837GluAsn: 2.837 ± 0.877
1.773GluPro: 1.773 ± 0.321
2.719GluGln: 2.719 ± 0.338
0.709GluArg: 0.709 ± 0.348
0.946GluSer: 0.946 ± 0.268
3.783GluThr: 3.783 ± 0.43
1.3GluVal: 1.3 ± 0.319
0.591GluTrp: 0.591 ± 0.693
2.6GluTyr: 2.6 ± 0.763
0.0GluXaa: 0.0 ± 0.0
Phe
2.128PheAla: 2.128 ± 0.413
1.418PheCys: 1.418 ± 0.419
1.891PheAsp: 1.891 ± 0.457
1.3PheGlu: 1.3 ± 0.381
1.3PhePhe: 1.3 ± 0.457
2.009PheGly: 2.009 ± 0.785
0.946PheHis: 0.946 ± 0.473
2.482PheIle: 2.482 ± 1.079
3.073PheLys: 3.073 ± 0.491
3.191PheLeu: 3.191 ± 0.636
0.827PheMet: 0.827 ± 0.419
3.546PheAsn: 3.546 ± 0.694
1.182PhePro: 1.182 ± 0.477
1.3PheGln: 1.3 ± 0.709
1.182PheArg: 1.182 ± 0.589
2.482PheSer: 2.482 ± 0.467
4.61PheThr: 4.61 ± 0.958
4.374PheVal: 4.374 ± 1.192
0.355PheTrp: 0.355 ± 0.177
3.31PheTyr: 3.31 ± 0.705
0.0PheXaa: 0.0 ± 0.0
Gly
2.128GlyAla: 2.128 ± 0.285
2.128GlyCys: 2.128 ± 1.166
3.546GlyAsp: 3.546 ± 0.939
1.891GlyGlu: 1.891 ± 0.476
2.246GlyPhe: 2.246 ± 0.988
3.428GlyGly: 3.428 ± 0.764
1.182GlyHis: 1.182 ± 0.27
3.664GlyIle: 3.664 ± 0.941
3.073GlyLys: 3.073 ± 0.767
4.137GlyLeu: 4.137 ± 0.652
0.946GlyMet: 0.946 ± 0.348
2.837GlyAsn: 2.837 ± 0.788
1.3GlyPro: 1.3 ± 1.035
1.064GlyGln: 1.064 ± 0.711
1.418GlyArg: 1.418 ± 0.897
4.492GlySer: 4.492 ± 1.48
4.846GlyThr: 4.846 ± 0.535
3.31GlyVal: 3.31 ± 0.761
0.709GlyTrp: 0.709 ± 0.345
2.009GlyTyr: 2.009 ± 0.476
0.0GlyXaa: 0.0 ± 0.0
His
1.537HisAla: 1.537 ± 0.587
0.236HisCys: 0.236 ± 0.118
1.064HisAsp: 1.064 ± 0.27
0.709HisGlu: 0.709 ± 0.657
1.064HisPhe: 1.064 ± 0.488
0.946HisGly: 0.946 ± 0.331
0.591HisHis: 0.591 ± 1.231
1.418HisIle: 1.418 ± 0.572
1.773HisLys: 1.773 ± 0.618
3.073HisLeu: 3.073 ± 0.763
0.473HisMet: 0.473 ± 0.235
1.182HisAsn: 1.182 ± 0.344
0.591HisPro: 0.591 ± 0.344
0.591HisGln: 0.591 ± 0.465
0.355HisArg: 0.355 ± 0.177
1.3HisSer: 1.3 ± 0.317
2.246HisThr: 2.246 ± 0.873
2.009HisVal: 2.009 ± 0.686
0.355HisTrp: 0.355 ± 0.409
1.182HisTyr: 1.182 ± 0.421
0.0HisXaa: 0.0 ± 0.0
Ile
3.664IleAla: 3.664 ± 1.223
1.537IleCys: 1.537 ± 0.469
2.719IleAsp: 2.719 ± 0.515
2.128IleGlu: 2.128 ± 1.308
1.891IlePhe: 1.891 ± 0.702
2.719IleGly: 2.719 ± 0.804
0.709IleHis: 0.709 ± 0.398
4.019IleIle: 4.019 ± 1.288
4.728IleLys: 4.728 ± 1.294
5.556IleLeu: 5.556 ± 2.359
1.064IleMet: 1.064 ± 0.363
5.201IleAsn: 5.201 ± 1.143
3.546IlePro: 3.546 ± 0.973
2.6IleGln: 2.6 ± 1.175
1.773IleArg: 1.773 ± 0.466
4.019IleSer: 4.019 ± 0.948
4.728IleThr: 4.728 ± 1.816
5.91IleVal: 5.91 ± 1.977
1.064IleTrp: 1.064 ± 1.22
3.31IleTyr: 3.31 ± 1.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.019LysAla: 4.019 ± 0.717
1.537LysCys: 1.537 ± 0.405
2.6LysAsp: 2.6 ± 0.484
1.773LysGlu: 1.773 ± 1.36
2.6LysPhe: 2.6 ± 0.978
1.891LysGly: 1.891 ± 0.535
1.891LysHis: 1.891 ± 0.718
3.073LysIle: 3.073 ± 1.604
3.191LysLys: 3.191 ± 1.716
5.437LysLeu: 5.437 ± 2.327
0.473LysMet: 0.473 ± 0.235
3.073LysAsn: 3.073 ± 0.882
5.201LysPro: 5.201 ± 1.618
2.955LysGln: 2.955 ± 0.515
1.773LysArg: 1.773 ± 1.382
4.492LysSer: 4.492 ± 1.112
4.965LysThr: 4.965 ± 0.57
4.374LysVal: 4.374 ± 0.873
0.236LysTrp: 0.236 ± 0.16
3.191LysTyr: 3.191 ± 0.944
0.0LysXaa: 0.0 ± 0.0
Leu
8.038LeuAla: 8.038 ± 1.144
1.418LeuCys: 1.418 ± 0.531
4.255LeuAsp: 4.255 ± 1.65
2.364LeuGlu: 2.364 ± 0.689
4.374LeuPhe: 4.374 ± 1.165
2.837LeuGly: 2.837 ± 0.945
2.009LeuHis: 2.009 ± 0.528
3.428LeuIle: 3.428 ± 1.789
4.374LeuLys: 4.374 ± 0.946
9.22LeuLeu: 9.22 ± 2.613
2.009LeuMet: 2.009 ± 0.704
4.846LeuAsn: 4.846 ± 0.912
4.374LeuPro: 4.374 ± 0.84
5.201LeuGln: 5.201 ± 1.186
2.837LeuArg: 2.837 ± 0.914
6.619LeuSer: 6.619 ± 1.656
8.983LeuThr: 8.983 ± 2.246
6.501LeuVal: 6.501 ± 2.965
1.064LeuTrp: 1.064 ± 1.111
5.083LeuTyr: 5.083 ± 1.274
0.0LeuXaa: 0.0 ± 0.0
Met
1.537MetAla: 1.537 ± 0.324
0.827MetCys: 0.827 ± 0.863
1.418MetAsp: 1.418 ± 0.536
0.946MetGlu: 0.946 ± 0.473
1.3MetPhe: 1.3 ± 0.647
0.946MetGly: 0.946 ± 0.762
0.473MetHis: 0.473 ± 0.235
0.709MetIle: 0.709 ± 0.283
0.473MetLys: 0.473 ± 0.235
2.719MetLeu: 2.719 ± 0.689
0.118MetMet: 0.118 ± 0.059
0.591MetAsn: 0.591 ± 0.172
1.064MetPro: 1.064 ± 0.363
0.827MetGln: 0.827 ± 0.259
0.709MetArg: 0.709 ± 0.3
2.009MetSer: 2.009 ± 0.686
1.655MetThr: 1.655 ± 0.54
1.655MetVal: 1.655 ± 0.517
0.118MetTrp: 0.118 ± 0.514
1.064MetTyr: 1.064 ± 0.334
0.0MetXaa: 0.0 ± 0.0
Asn
4.374AsnAla: 4.374 ± 0.766
1.537AsnCys: 1.537 ± 0.398
2.246AsnAsp: 2.246 ± 0.574
1.891AsnGlu: 1.891 ± 0.424
2.837AsnPhe: 2.837 ± 0.847
5.556AsnGly: 5.556 ± 0.828
1.064AsnHis: 1.064 ± 0.363
4.492AsnIle: 4.492 ± 1.764
4.137AsnLys: 4.137 ± 0.738
5.437AsnLeu: 5.437 ± 0.769
2.009AsnMet: 2.009 ± 0.592
4.61AsnAsn: 4.61 ± 0.766
2.364AsnPro: 2.364 ± 0.516
3.31AsnGln: 3.31 ± 1.356
1.773AsnArg: 1.773 ± 1.091
2.364AsnSer: 2.364 ± 1.161
3.546AsnThr: 3.546 ± 1.084
4.965AsnVal: 4.965 ± 0.947
0.827AsnTrp: 0.827 ± 0.402
2.837AsnTyr: 2.837 ± 0.648
0.0AsnXaa: 0.0 ± 0.0
Pro
1.418ProAla: 1.418 ± 1.193
1.3ProCys: 1.3 ± 0.435
1.418ProAsp: 1.418 ± 0.3
1.891ProGlu: 1.891 ± 0.893
1.418ProPhe: 1.418 ± 0.508
3.428ProGly: 3.428 ± 1.373
1.418ProHis: 1.418 ± 0.438
2.955ProIle: 2.955 ± 0.511
2.719ProLys: 2.719 ± 2.5
3.191ProLeu: 3.191 ± 0.649
1.064ProMet: 1.064 ± 0.356
2.482ProAsn: 2.482 ± 1.072
2.128ProPro: 2.128 ± 0.659
2.246ProGln: 2.246 ± 0.548
1.537ProArg: 1.537 ± 1.384
3.191ProSer: 3.191 ± 2.214
2.955ProThr: 2.955 ± 0.727
4.137ProVal: 4.137 ± 1.072
0.355ProTrp: 0.355 ± 0.352
1.537ProTyr: 1.537 ± 0.726
0.0ProXaa: 0.0 ± 0.0
Gln
3.31GlnAla: 3.31 ± 1.822
0.827GlnCys: 0.827 ± 0.339
1.537GlnAsp: 1.537 ± 0.666
1.891GlnGlu: 1.891 ± 0.619
1.182GlnPhe: 1.182 ± 0.421
2.128GlnGly: 2.128 ± 0.59
1.418GlnHis: 1.418 ± 0.604
1.655GlnIle: 1.655 ± 0.479
2.364GlnLys: 2.364 ± 1.27
4.728GlnLeu: 4.728 ± 1.171
0.473GlnMet: 0.473 ± 0.235
2.009GlnAsn: 2.009 ± 0.852
3.073GlnPro: 3.073 ± 1.181
2.128GlnGln: 2.128 ± 0.988
1.418GlnArg: 1.418 ± 1.447
2.364GlnSer: 2.364 ± 0.807
3.073GlnThr: 3.073 ± 1.259
2.955GlnVal: 2.955 ± 0.859
0.473GlnTrp: 0.473 ± 0.747
2.6GlnTyr: 2.6 ± 0.772
0.0GlnXaa: 0.0 ± 0.0
Arg
1.655ArgAla: 1.655 ± 1.035
0.946ArgCys: 0.946 ± 0.31
1.655ArgAsp: 1.655 ± 0.6
0.827ArgGlu: 0.827 ± 0.259
1.182ArgPhe: 1.182 ± 0.37
1.418ArgGly: 1.418 ± 0.697
1.182ArgHis: 1.182 ± 0.751
1.3ArgIle: 1.3 ± 0.708
1.064ArgLys: 1.064 ± 0.356
2.719ArgLeu: 2.719 ± 1.258
0.118ArgMet: 0.118 ± 0.059
2.009ArgAsn: 2.009 ± 0.412
1.064ArgPro: 1.064 ± 0.314
1.3ArgGln: 1.3 ± 0.381
0.827ArgArg: 0.827 ± 0.436
2.246ArgSer: 2.246 ± 1.587
2.482ArgThr: 2.482 ± 1.81
1.891ArgVal: 1.891 ± 1.193
0.236ArgTrp: 0.236 ± 0.118
1.891ArgTyr: 1.891 ± 0.429
0.0ArgXaa: 0.0 ± 0.0
Ser
4.255SerAla: 4.255 ± 1.331
0.709SerCys: 0.709 ± 0.353
4.61SerAsp: 4.61 ± 0.992
3.191SerGlu: 3.191 ± 1.05
2.719SerPhe: 2.719 ± 1.002
3.664SerGly: 3.664 ± 0.606
1.182SerHis: 1.182 ± 0.289
4.492SerIle: 4.492 ± 0.828
4.492SerLys: 4.492 ± 0.521
4.492SerLeu: 4.492 ± 0.964
2.482SerMet: 2.482 ± 0.523
3.664SerAsn: 3.664 ± 1.58
1.418SerPro: 1.418 ± 0.415
2.482SerGln: 2.482 ± 0.628
0.946SerArg: 0.946 ± 0.512
4.137SerSer: 4.137 ± 1.073
6.974SerThr: 6.974 ± 1.686
5.201SerVal: 5.201 ± 1.365
0.709SerTrp: 0.709 ± 0.417
3.783SerTyr: 3.783 ± 0.52
0.0SerXaa: 0.0 ± 0.0
Thr
4.846ThrAla: 4.846 ± 1.602
2.009ThrCys: 2.009 ± 1.029
4.492ThrAsp: 4.492 ± 0.468
2.6ThrGlu: 2.6 ± 0.581
3.428ThrPhe: 3.428 ± 0.919
4.61ThrGly: 4.61 ± 1.245
2.009ThrHis: 2.009 ± 0.482
6.738ThrIle: 6.738 ± 1.376
4.374ThrLys: 4.374 ± 1.474
7.447ThrLeu: 7.447 ± 1.079
2.364ThrMet: 2.364 ± 0.421
5.437ThrAsn: 5.437 ± 0.958
4.374ThrPro: 4.374 ± 2.007
2.719ThrGln: 2.719 ± 0.505
1.773ThrArg: 1.773 ± 0.749
6.028ThrSer: 6.028 ± 1.768
6.265ThrThr: 6.265 ± 0.795
7.092ThrVal: 7.092 ± 1.589
0.827ThrTrp: 0.827 ± 0.363
4.728ThrTyr: 4.728 ± 0.759
0.0ThrXaa: 0.0 ± 0.0
Val
6.147ValAla: 6.147 ± 0.972
1.537ValCys: 1.537 ± 0.579
5.556ValAsp: 5.556 ± 0.79
3.546ValGlu: 3.546 ± 1.405
3.428ValPhe: 3.428 ± 0.692
3.428ValGly: 3.428 ± 0.724
1.064ValHis: 1.064 ± 0.363
5.91ValIle: 5.91 ± 1.466
5.556ValLys: 5.556 ± 0.816
6.501ValLeu: 6.501 ± 1.446
1.891ValMet: 1.891 ± 0.963
4.019ValAsn: 4.019 ± 0.748
2.482ValPro: 2.482 ± 0.862
3.546ValGln: 3.546 ± 0.72
2.009ValArg: 2.009 ± 0.78
4.61ValSer: 4.61 ± 1.125
6.501ValThr: 6.501 ± 1.249
7.801ValVal: 7.801 ± 1.718
0.591ValTrp: 0.591 ± 0.382
4.019ValTyr: 4.019 ± 1.129
0.0ValXaa: 0.0 ± 0.0
Trp
0.827TrpAla: 0.827 ± 0.536
0.236TrpCys: 0.236 ± 0.16
0.591TrpAsp: 0.591 ± 0.294
0.236TrpGlu: 0.236 ± 0.118
0.591TrpPhe: 0.591 ± 0.658
0.236TrpGly: 0.236 ± 0.118
0.355TrpHis: 0.355 ± 0.498
0.591TrpIle: 0.591 ± 0.172
0.473TrpLys: 0.473 ± 0.666
1.891TrpLeu: 1.891 ± 1.994
0.118TrpMet: 0.118 ± 0.187
0.355TrpAsn: 0.355 ± 0.177
0.355TrpPro: 0.355 ± 0.452
0.355TrpGln: 0.355 ± 0.61
0.473TrpArg: 0.473 ± 0.235
0.473TrpSer: 0.473 ± 0.369
0.473TrpThr: 0.473 ± 0.381
0.591TrpVal: 0.591 ± 0.562
0.118TrpTrp: 0.118 ± 0.059
0.591TrpTyr: 0.591 ± 0.359
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.428TyrAla: 3.428 ± 0.506
1.891TyrCys: 1.891 ± 0.424
4.019TyrAsp: 4.019 ± 1.083
2.009TyrGlu: 2.009 ± 0.673
2.009TyrPhe: 2.009 ± 0.577
1.773TyrGly: 1.773 ± 0.409
1.537TyrHis: 1.537 ± 0.496
3.664TyrIle: 3.664 ± 1.095
3.664TyrLys: 3.664 ± 1.121
3.901TyrLeu: 3.901 ± 0.435
1.537TyrMet: 1.537 ± 0.458
4.492TyrAsn: 4.492 ± 0.735
1.655TyrPro: 1.655 ± 0.517
2.128TyrGln: 2.128 ± 0.491
1.182TyrArg: 1.182 ± 0.418
2.955TyrSer: 2.955 ± 0.937
5.083TyrThr: 5.083 ± 0.792
4.728TyrVal: 4.728 ± 1.254
0.473TyrTrp: 0.473 ± 0.509
2.482TyrTyr: 2.482 ± 0.774
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (8461 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski