Amino acid dipepetide frequency for Common moorhen coronavirus HKU21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.299AlaAla: 5.299 ± 0.452
1.766AlaCys: 1.766 ± 0.431
3.533AlaAsp: 3.533 ± 1.208
2.237AlaGlu: 2.237 ± 0.585
4.122AlaPhe: 4.122 ± 0.935
3.65AlaGly: 3.65 ± 1.006
2.355AlaHis: 2.355 ± 0.592
4.71AlaIle: 4.71 ± 0.914
3.768AlaLys: 3.768 ± 1.171
6.123AlaLeu: 6.123 ± 1.09
1.766AlaMet: 1.766 ± 0.572
4.122AlaAsn: 4.122 ± 1.189
2.355AlaPro: 2.355 ± 0.35
2.12AlaGln: 2.12 ± 0.892
2.591AlaArg: 2.591 ± 0.473
4.593AlaSer: 4.593 ± 0.918
3.886AlaThr: 3.886 ± 0.633
5.535AlaVal: 5.535 ± 1.425
0.353AlaTrp: 0.353 ± 0.363
2.355AlaTyr: 2.355 ± 0.75
0.0AlaXaa: 0.0 ± 0.0
Cys
0.942CysAla: 0.942 ± 0.319
0.824CysCys: 0.824 ± 0.305
1.766CysAsp: 1.766 ± 0.73
0.589CysGlu: 0.589 ± 0.308
1.766CysPhe: 1.766 ± 0.415
1.531CysGly: 1.531 ± 0.804
0.589CysHis: 0.589 ± 0.531
1.649CysIle: 1.649 ± 0.459
0.824CysLys: 0.824 ± 0.381
1.531CysLeu: 1.531 ± 0.486
0.589CysMet: 0.589 ± 0.308
1.531CysAsn: 1.531 ± 0.485
1.06CysPro: 1.06 ± 0.331
1.06CysGln: 1.06 ± 0.374
1.06CysArg: 1.06 ± 0.291
3.062CysSer: 3.062 ± 0.79
1.531CysThr: 1.531 ± 0.347
3.062CysVal: 3.062 ± 0.485
0.707CysTrp: 0.707 ± 0.369
1.178CysTyr: 1.178 ± 0.409
0.0CysXaa: 0.0 ± 0.0
Asp
4.71AspAla: 4.71 ± 0.942
1.413AspCys: 1.413 ± 0.548
3.533AspAsp: 3.533 ± 1.128
2.591AspGlu: 2.591 ± 0.974
2.591AspPhe: 2.591 ± 0.532
4.239AspGly: 4.239 ± 0.686
0.707AspHis: 0.707 ± 0.184
4.122AspIle: 4.122 ± 1.283
2.237AspLys: 2.237 ± 0.83
4.475AspLeu: 4.475 ± 1.399
0.942AspMet: 0.942 ± 0.41
3.415AspAsn: 3.415 ± 0.807
1.531AspPro: 1.531 ± 0.674
2.12AspGln: 2.12 ± 0.889
1.649AspArg: 1.649 ± 0.501
3.415AspSer: 3.415 ± 1.317
3.179AspThr: 3.179 ± 0.994
6.006AspVal: 6.006 ± 2.143
0.589AspTrp: 0.589 ± 0.327
3.768AspTyr: 3.768 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
2.002GluAla: 2.002 ± 0.34
1.178GluCys: 1.178 ± 0.36
2.002GluAsp: 2.002 ± 0.928
1.884GluGlu: 1.884 ± 0.61
1.766GluPhe: 1.766 ± 0.593
1.649GluGly: 1.649 ± 0.667
1.295GluHis: 1.295 ± 0.4
1.413GluIle: 1.413 ± 0.422
1.413GluLys: 1.413 ± 0.512
3.65GluLeu: 3.65 ± 0.523
0.353GluMet: 0.353 ± 0.219
0.824GluAsn: 0.824 ± 0.305
1.884GluPro: 1.884 ± 0.638
1.884GluGln: 1.884 ± 0.453
0.942GluArg: 0.942 ± 0.339
2.944GluSer: 2.944 ± 2.725
2.237GluThr: 2.237 ± 0.534
1.766GluVal: 1.766 ± 0.614
0.589GluTrp: 0.589 ± 0.48
2.473GluTyr: 2.473 ± 0.92
0.0GluXaa: 0.0 ± 0.0
Phe
3.062PheAla: 3.062 ± 0.684
1.531PheCys: 1.531 ± 0.607
3.297PheAsp: 3.297 ± 0.849
1.295PheGlu: 1.295 ± 0.33
0.471PhePhe: 0.471 ± 0.156
2.355PheGly: 2.355 ± 0.842
0.589PheHis: 0.589 ± 0.33
3.415PheIle: 3.415 ± 0.799
2.12PheLys: 2.12 ± 0.525
4.239PheLeu: 4.239 ± 1.613
0.707PheMet: 0.707 ± 0.399
3.886PheAsn: 3.886 ± 1.008
1.413PhePro: 1.413 ± 0.593
1.649PheGln: 1.649 ± 0.889
1.766PheArg: 1.766 ± 0.527
4.004PheSer: 4.004 ± 2.345
3.886PheThr: 3.886 ± 0.681
3.65PheVal: 3.65 ± 0.506
0.353PheTrp: 0.353 ± 0.368
3.65PheTyr: 3.65 ± 0.898
0.0PheXaa: 0.0 ± 0.0
Gly
1.884GlyAla: 1.884 ± 0.5
1.649GlyCys: 1.649 ± 1.083
2.355GlyAsp: 2.355 ± 0.706
1.531GlyGlu: 1.531 ± 0.825
2.355GlyPhe: 2.355 ± 0.803
3.886GlyGly: 3.886 ± 1.298
1.531GlyHis: 1.531 ± 0.549
3.886GlyIle: 3.886 ± 1.02
3.179GlyLys: 3.179 ± 0.61
2.591GlyLeu: 2.591 ± 1.11
0.824GlyMet: 0.824 ± 0.564
4.239GlyAsn: 4.239 ± 1.683
1.884GlyPro: 1.884 ± 1.373
1.178GlyGln: 1.178 ± 0.408
1.649GlyArg: 1.649 ± 0.457
3.062GlySer: 3.062 ± 0.979
4.828GlyThr: 4.828 ± 0.581
6.359GlyVal: 6.359 ± 1.901
0.471GlyTrp: 0.471 ± 0.246
2.708GlyTyr: 2.708 ± 0.48
0.0GlyXaa: 0.0 ± 0.0
His
2.237HisAla: 2.237 ± 0.559
0.707HisCys: 0.707 ± 0.366
0.942HisAsp: 0.942 ± 0.492
1.06HisGlu: 1.06 ± 0.331
0.942HisPhe: 0.942 ± 0.492
1.178HisGly: 1.178 ± 0.741
0.471HisHis: 0.471 ± 0.156
2.002HisIle: 2.002 ± 0.662
1.178HisLys: 1.178 ± 0.243
3.062HisLeu: 3.062 ± 0.721
0.707HisMet: 0.707 ± 0.362
1.531HisAsn: 1.531 ± 0.628
1.06HisPro: 1.06 ± 0.389
0.589HisGln: 0.589 ± 0.838
0.236HisArg: 0.236 ± 0.123
0.942HisSer: 0.942 ± 0.696
1.766HisThr: 1.766 ± 1.073
2.708HisVal: 2.708 ± 0.603
0.236HisTrp: 0.236 ± 0.364
1.178HisTyr: 1.178 ± 0.703
0.0HisXaa: 0.0 ± 0.0
Ile
3.768IleAla: 3.768 ± 1.018
1.766IleCys: 1.766 ± 0.566
4.357IleAsp: 4.357 ± 0.816
1.413IleGlu: 1.413 ± 0.509
3.179IlePhe: 3.179 ± 0.807
3.062IleGly: 3.062 ± 1.215
1.178IleHis: 1.178 ± 0.498
3.768IleIle: 3.768 ± 1.582
4.004IleLys: 4.004 ± 1.142
5.77IleLeu: 5.77 ± 0.941
0.824IleMet: 0.824 ± 0.327
4.71IleAsn: 4.71 ± 1.107
4.357IlePro: 4.357 ± 1.012
3.415IleGln: 3.415 ± 0.903
2.591IleArg: 2.591 ± 0.704
3.768IleSer: 3.768 ± 1.398
5.064IleThr: 5.064 ± 2.141
5.417IleVal: 5.417 ± 0.795
0.589IleTrp: 0.589 ± 0.33
2.944IleTyr: 2.944 ± 1.141
0.0IleXaa: 0.0 ± 0.0
Lys
4.475LysAla: 4.475 ± 1.19
1.766LysCys: 1.766 ± 0.446
2.473LysAsp: 2.473 ± 0.531
2.237LysGlu: 2.237 ± 0.475
2.355LysPhe: 2.355 ± 0.75
2.591LysGly: 2.591 ± 0.778
1.413LysHis: 1.413 ± 0.685
2.591LysIle: 2.591 ± 0.703
2.473LysLys: 2.473 ± 0.685
5.77LysLeu: 5.77 ± 1.894
0.707LysMet: 0.707 ± 0.369
1.531LysAsn: 1.531 ± 0.367
3.062LysPro: 3.062 ± 1.096
1.766LysGln: 1.766 ± 1.122
1.649LysArg: 1.649 ± 0.831
3.886LysSer: 3.886 ± 0.557
2.944LysThr: 2.944 ± 1.043
4.475LysVal: 4.475 ± 0.959
0.471LysTrp: 0.471 ± 0.156
3.533LysTyr: 3.533 ± 0.63
0.0LysXaa: 0.0 ± 0.0
Leu
9.892LeuAla: 9.892 ± 1.4
1.531LeuCys: 1.531 ± 0.607
5.77LeuAsp: 5.77 ± 0.963
2.473LeuGlu: 2.473 ± 0.455
4.828LeuPhe: 4.828 ± 0.82
2.591LeuGly: 2.591 ± 0.557
1.884LeuHis: 1.884 ± 0.646
3.65LeuIle: 3.65 ± 2.311
3.886LeuLys: 3.886 ± 1.165
8.008LeuLeu: 8.008 ± 0.738
1.295LeuMet: 1.295 ± 0.549
5.064LeuAsn: 5.064 ± 1.135
4.593LeuPro: 4.593 ± 0.588
4.475LeuGln: 4.475 ± 0.715
3.533LeuArg: 3.533 ± 1.144
5.417LeuSer: 5.417 ± 2.674
7.89LeuThr: 7.89 ± 0.69
6.948LeuVal: 6.948 ± 1.391
0.353LeuTrp: 0.353 ± 0.363
4.357LeuTyr: 4.357 ± 1.056
0.0LeuXaa: 0.0 ± 0.0
Met
1.766MetAla: 1.766 ± 1.011
0.471MetCys: 0.471 ± 0.3
0.824MetAsp: 0.824 ± 0.392
0.471MetGlu: 0.471 ± 0.215
1.295MetPhe: 1.295 ± 0.396
0.589MetGly: 0.589 ± 0.613
0.353MetHis: 0.353 ± 0.397
0.589MetIle: 0.589 ± 0.308
0.236MetLys: 0.236 ± 0.448
2.237MetLeu: 2.237 ± 0.482
0.353MetMet: 0.353 ± 0.738
0.707MetAsn: 0.707 ± 0.219
0.942MetPro: 0.942 ± 0.274
1.178MetGln: 1.178 ± 0.626
0.589MetArg: 0.589 ± 0.308
1.531MetSer: 1.531 ± 0.485
1.295MetThr: 1.295 ± 0.529
2.002MetVal: 2.002 ± 0.506
0.0MetTrp: 0.0 ± 0.0
1.06MetTyr: 1.06 ± 0.309
0.0MetXaa: 0.0 ± 0.0
Asn
3.533AsnAla: 3.533 ± 0.696
1.766AsnCys: 1.766 ± 0.415
2.12AsnAsp: 2.12 ± 0.687
2.237AsnGlu: 2.237 ± 0.513
2.708AsnPhe: 2.708 ± 1.44
4.946AsnGly: 4.946 ± 2.157
1.295AsnHis: 1.295 ± 0.4
5.181AsnIle: 5.181 ± 1.712
3.533AsnLys: 3.533 ± 0.675
4.828AsnLeu: 4.828 ± 0.999
1.06AsnMet: 1.06 ± 0.483
3.65AsnAsn: 3.65 ± 1.37
2.355AsnPro: 2.355 ± 0.807
2.355AsnGln: 2.355 ± 0.795
2.826AsnArg: 2.826 ± 0.837
3.415AsnSer: 3.415 ± 1.342
3.297AsnThr: 3.297 ± 1.411
5.064AsnVal: 5.064 ± 0.944
0.236AsnTrp: 0.236 ± 0.362
3.062AsnTyr: 3.062 ± 0.507
0.0AsnXaa: 0.0 ± 0.0
Pro
2.473ProAla: 2.473 ± 0.894
0.942ProCys: 0.942 ± 0.295
2.591ProAsp: 2.591 ± 0.454
1.649ProGlu: 1.649 ± 0.422
1.884ProPhe: 1.884 ± 0.732
2.708ProGly: 2.708 ± 1.052
1.295ProHis: 1.295 ± 0.485
3.297ProIle: 3.297 ± 0.341
2.944ProLys: 2.944 ± 1.483
2.944ProLeu: 2.944 ± 0.695
0.589ProMet: 0.589 ± 0.326
2.944ProAsn: 2.944 ± 0.546
2.473ProPro: 2.473 ± 0.601
1.649ProGln: 1.649 ± 0.667
1.531ProArg: 1.531 ± 0.558
2.237ProSer: 2.237 ± 0.552
3.886ProThr: 3.886 ± 0.869
3.533ProVal: 3.533 ± 0.707
0.236ProTrp: 0.236 ± 0.176
1.766ProTyr: 1.766 ± 0.815
0.0ProXaa: 0.0 ± 0.0
Gln
2.708GlnAla: 2.708 ± 0.631
0.942GlnCys: 0.942 ± 0.492
1.884GlnAsp: 1.884 ± 0.452
2.12GlnGlu: 2.12 ± 0.76
0.942GlnPhe: 0.942 ± 0.635
2.002GlnGly: 2.002 ± 1.111
1.178GlnHis: 1.178 ± 0.615
0.942GlnIle: 0.942 ± 0.845
1.649GlnLys: 1.649 ± 0.587
3.533GlnLeu: 3.533 ± 0.526
1.178GlnMet: 1.178 ± 0.431
2.944GlnAsn: 2.944 ± 1.193
1.884GlnPro: 1.884 ± 0.968
1.649GlnGln: 1.649 ± 0.318
1.649GlnArg: 1.649 ± 1.106
4.004GlnSer: 4.004 ± 0.628
3.297GlnThr: 3.297 ± 0.528
2.944GlnVal: 2.944 ± 0.504
0.236GlnTrp: 0.236 ± 0.123
1.649GlnTyr: 1.649 ± 0.609
0.0GlnXaa: 0.0 ± 0.0
Arg
1.884ArgAla: 1.884 ± 0.899
1.531ArgCys: 1.531 ± 0.503
1.06ArgAsp: 1.06 ± 0.554
0.707ArgGlu: 0.707 ± 0.409
2.002ArgPhe: 2.002 ± 0.857
1.649ArgGly: 1.649 ± 1.072
1.413ArgHis: 1.413 ± 0.548
2.944ArgIle: 2.944 ± 0.737
1.884ArgLys: 1.884 ± 1.071
3.415ArgLeu: 3.415 ± 0.875
0.353ArgMet: 0.353 ± 0.155
2.708ArgAsn: 2.708 ± 0.678
0.824ArgPro: 0.824 ± 0.293
1.295ArgGln: 1.295 ± 0.822
1.06ArgArg: 1.06 ± 0.613
1.649ArgSer: 1.649 ± 1.129
2.826ArgThr: 2.826 ± 0.731
2.355ArgVal: 2.355 ± 0.534
0.353ArgTrp: 0.353 ± 0.185
1.649ArgTyr: 1.649 ± 0.46
0.0ArgXaa: 0.0 ± 0.0
Ser
3.65SerAla: 3.65 ± 1.381
0.824SerCys: 0.824 ± 0.663
4.828SerAsp: 4.828 ± 0.874
2.355SerGlu: 2.355 ± 1.981
3.886SerPhe: 3.886 ± 1.682
3.415SerGly: 3.415 ± 0.827
1.295SerHis: 1.295 ± 0.713
4.239SerIle: 4.239 ± 3.432
3.415SerLys: 3.415 ± 0.578
6.006SerLeu: 6.006 ± 1.094
1.413SerMet: 1.413 ± 0.378
3.886SerAsn: 3.886 ± 2.432
2.708SerPro: 2.708 ± 0.729
2.002SerGln: 2.002 ± 0.558
2.12SerArg: 2.12 ± 0.88
4.71SerSer: 4.71 ± 1.383
5.417SerThr: 5.417 ± 0.939
5.181SerVal: 5.181 ± 0.682
0.589SerTrp: 0.589 ± 0.47
4.122SerTyr: 4.122 ± 0.724
0.0SerXaa: 0.0 ± 0.0
Thr
4.357ThrAla: 4.357 ± 1.12
1.531ThrCys: 1.531 ± 0.681
3.768ThrAsp: 3.768 ± 0.764
1.531ThrGlu: 1.531 ± 0.619
3.65ThrPhe: 3.65 ± 0.782
4.239ThrGly: 4.239 ± 0.51
2.473ThrHis: 2.473 ± 0.863
7.419ThrIle: 7.419 ± 0.449
3.533ThrLys: 3.533 ± 1.174
6.241ThrLeu: 6.241 ± 0.902
1.649ThrMet: 1.649 ± 0.641
3.768ThrAsn: 3.768 ± 2.248
3.65ThrPro: 3.65 ± 0.712
3.062ThrGln: 3.062 ± 0.727
1.884ThrArg: 1.884 ± 0.628
5.417ThrSer: 5.417 ± 0.705
4.828ThrThr: 4.828 ± 0.946
6.241ThrVal: 6.241 ± 0.809
0.942ThrTrp: 0.942 ± 0.673
3.65ThrTyr: 3.65 ± 1.101
0.0ThrXaa: 0.0 ± 0.0
Val
5.652ValAla: 5.652 ± 1.513
2.473ValCys: 2.473 ± 0.92
6.241ValAsp: 6.241 ± 0.791
3.415ValGlu: 3.415 ± 0.869
4.004ValPhe: 4.004 ± 0.722
3.65ValGly: 3.65 ± 1.104
1.649ValHis: 1.649 ± 0.667
5.652ValIle: 5.652 ± 0.938
5.535ValLys: 5.535 ± 1.36
8.95ValLeu: 8.95 ± 1.233
1.178ValMet: 1.178 ± 0.784
4.357ValAsn: 4.357 ± 1.165
3.297ValPro: 3.297 ± 0.824
3.297ValGln: 3.297 ± 0.385
1.884ValArg: 1.884 ± 0.487
4.71ValSer: 4.71 ± 0.896
7.537ValThr: 7.537 ± 1.567
10.245ValVal: 10.245 ± 3.706
0.471ValTrp: 0.471 ± 0.344
4.71ValTyr: 4.71 ± 0.72
0.0ValXaa: 0.0 ± 0.0
Trp
0.471TrpAla: 0.471 ± 0.352
0.236TrpCys: 0.236 ± 0.384
0.824TrpAsp: 0.824 ± 0.457
0.353TrpGlu: 0.353 ± 0.368
0.471TrpPhe: 0.471 ± 0.215
0.236TrpGly: 0.236 ± 0.176
0.353TrpHis: 0.353 ± 0.348
0.471TrpIle: 0.471 ± 0.246
0.353TrpLys: 0.353 ± 0.348
0.942TrpLeu: 0.942 ± 0.494
0.118TrpMet: 0.118 ± 0.205
0.118TrpAsn: 0.118 ± 0.062
0.353TrpPro: 0.353 ± 0.155
0.236TrpGln: 0.236 ± 0.364
0.236TrpArg: 0.236 ± 0.123
0.589TrpSer: 0.589 ± 0.745
0.589TrpThr: 0.589 ± 0.228
0.824TrpVal: 0.824 ± 0.169
0.0TrpTrp: 0.0 ± 0.0
0.236TrpTyr: 0.236 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.473TyrAla: 2.473 ± 0.802
2.002TyrCys: 2.002 ± 0.494
3.062TyrAsp: 3.062 ± 1.17
2.002TyrGlu: 2.002 ± 0.46
2.12TyrPhe: 2.12 ± 0.38
1.884TyrGly: 1.884 ± 0.766
1.413TyrHis: 1.413 ± 0.577
4.004TyrIle: 4.004 ± 0.971
4.004TyrLys: 4.004 ± 1.019
4.122TyrLeu: 4.122 ± 0.921
1.766TyrMet: 1.766 ± 0.967
3.65TyrAsn: 3.65 ± 1.236
1.884TyrPro: 1.884 ± 0.448
2.355TyrGln: 2.355 ± 0.879
2.12TyrArg: 2.12 ± 0.457
2.708TyrSer: 2.708 ± 0.65
3.65TyrThr: 3.65 ± 0.705
4.593TyrVal: 4.593 ± 0.609
0.236TyrTrp: 0.236 ± 0.36
2.591TyrTyr: 2.591 ± 0.722
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (8493 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski