Amino acid dipepetide frequency for Thrush coronavirus HKU12 (isolate Grey-backed thrush/Hong Kong/HKU12-600/2007) (ThCoV-HKU12) (Thrush coronavirus HKU12-600)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.063AlaAla: 5.063 ± 1.426
2.002AlaCys: 2.002 ± 0.453
3.768AlaAsp: 3.768 ± 1.163
3.179AlaGlu: 3.179 ± 0.935
3.532AlaPhe: 3.532 ± 1.104
3.886AlaGly: 3.886 ± 0.916
2.002AlaHis: 2.002 ± 0.854
5.298AlaIle: 5.298 ± 1.256
4.121AlaLys: 4.121 ± 1.497
7.653AlaLeu: 7.653 ± 0.713
1.766AlaMet: 1.766 ± 0.487
4.003AlaAsn: 4.003 ± 0.802
3.297AlaPro: 3.297 ± 1.336
2.59AlaGln: 2.59 ± 0.773
3.179AlaArg: 3.179 ± 0.877
4.828AlaSer: 4.828 ± 0.638
4.357AlaThr: 4.357 ± 1.084
4.357AlaVal: 4.357 ± 1.922
0.471AlaTrp: 0.471 ± 0.528
3.297AlaTyr: 3.297 ± 0.971
0.0AlaXaa: 0.0 ± 0.0
Cys
1.413CysAla: 1.413 ± 0.556
0.942CysCys: 0.942 ± 0.403
1.884CysAsp: 1.884 ± 0.665
0.824CysGlu: 0.824 ± 0.282
2.237CysPhe: 2.237 ± 0.522
1.531CysGly: 1.531 ± 0.742
0.235CysHis: 0.235 ± 0.123
1.766CysIle: 1.766 ± 0.897
1.06CysLys: 1.06 ± 0.454
1.884CysLeu: 1.884 ± 0.5
0.471CysMet: 0.471 ± 0.421
1.413CysAsn: 1.413 ± 0.723
1.531CysPro: 1.531 ± 0.518
1.06CysGln: 1.06 ± 0.688
0.824CysArg: 0.824 ± 0.43
1.766CysSer: 1.766 ± 0.614
1.648CysThr: 1.648 ± 0.424
2.826CysVal: 2.826 ± 0.65
0.589CysTrp: 0.589 ± 0.202
1.413CysTyr: 1.413 ± 0.379
0.0CysXaa: 0.0 ± 0.0
Asp
4.003AspAla: 4.003 ± 0.593
1.531AspCys: 1.531 ± 0.42
3.415AspAsp: 3.415 ± 1.63
2.355AspGlu: 2.355 ± 0.686
2.473AspPhe: 2.473 ± 0.87
3.768AspGly: 3.768 ± 1.021
1.06AspHis: 1.06 ± 0.418
3.179AspIle: 3.179 ± 0.783
2.708AspLys: 2.708 ± 0.74
3.415AspLeu: 3.415 ± 0.969
0.824AspMet: 0.824 ± 0.43
3.179AspAsn: 3.179 ± 0.741
2.002AspPro: 2.002 ± 0.765
1.295AspGln: 1.295 ± 0.358
2.002AspArg: 2.002 ± 0.55
3.532AspSer: 3.532 ± 0.688
3.65AspThr: 3.65 ± 1.227
5.063AspVal: 5.063 ± 1.457
0.589AspTrp: 0.589 ± 0.346
3.179AspTyr: 3.179 ± 0.899
0.0AspXaa: 0.0 ± 0.0
Glu
2.119GluAla: 2.119 ± 0.772
1.531GluCys: 1.531 ± 0.682
2.355GluAsp: 2.355 ± 0.782
2.355GluGlu: 2.355 ± 1.648
2.002GluPhe: 2.002 ± 0.83
1.884GluGly: 1.884 ± 0.617
1.06GluHis: 1.06 ± 0.567
1.295GluIle: 1.295 ± 0.818
1.766GluLys: 1.766 ± 0.919
3.886GluLeu: 3.886 ± 0.855
0.942GluMet: 0.942 ± 0.411
1.531GluAsn: 1.531 ± 0.749
2.119GluPro: 2.119 ± 0.821
2.826GluGln: 2.826 ± 0.559
1.177GluArg: 1.177 ± 0.405
2.355GluSer: 2.355 ± 0.863
2.237GluThr: 2.237 ± 0.496
2.826GluVal: 2.826 ± 0.761
0.942GluTrp: 0.942 ± 0.543
2.237GluTyr: 2.237 ± 0.894
0.0GluXaa: 0.0 ± 0.0
Phe
2.59PheAla: 2.59 ± 1.285
1.06PheCys: 1.06 ± 0.386
2.708PheAsp: 2.708 ± 0.954
1.531PheGlu: 1.531 ± 0.619
0.824PhePhe: 0.824 ± 0.566
2.826PheGly: 2.826 ± 1.168
0.471PheHis: 0.471 ± 0.528
3.297PheIle: 3.297 ± 1.534
2.473PheLys: 2.473 ± 0.749
3.532PheLeu: 3.532 ± 0.559
0.589PheMet: 0.589 ± 0.202
2.944PheAsn: 2.944 ± 0.817
1.06PhePro: 1.06 ± 0.402
2.237PheGln: 2.237 ± 0.894
1.295PheArg: 1.295 ± 0.685
3.415PheSer: 3.415 ± 1.475
3.532PheThr: 3.532 ± 0.589
3.297PheVal: 3.297 ± 0.637
0.235PheTrp: 0.235 ± 0.343
3.061PheTyr: 3.061 ± 0.977
0.0PheXaa: 0.0 ± 0.0
Gly
3.179GlyAla: 3.179 ± 0.891
1.295GlyCys: 1.295 ± 0.725
2.708GlyAsp: 2.708 ± 0.977
1.295GlyGlu: 1.295 ± 0.58
2.355GlyPhe: 2.355 ± 0.602
3.297GlyGly: 3.297 ± 0.617
1.295GlyHis: 1.295 ± 0.674
3.65GlyIle: 3.65 ± 0.973
3.061GlyLys: 3.061 ± 1.067
3.179GlyLeu: 3.179 ± 0.643
0.589GlyMet: 0.589 ± 0.458
3.297GlyAsn: 3.297 ± 1.98
1.648GlyPro: 1.648 ± 1.118
1.648GlyGln: 1.648 ± 0.65
1.766GlyArg: 1.766 ± 0.475
4.003GlySer: 4.003 ± 1.351
4.592GlyThr: 4.592 ± 0.854
6.123GlyVal: 6.123 ± 1.601
0.471GlyTrp: 0.471 ± 0.18
1.884GlyTyr: 1.884 ± 0.721
0.0GlyXaa: 0.0 ± 0.0
His
1.884HisAla: 1.884 ± 0.844
0.824HisCys: 0.824 ± 0.382
1.06HisAsp: 1.06 ± 0.377
0.706HisGlu: 0.706 ± 0.365
1.177HisPhe: 1.177 ± 0.442
1.177HisGly: 1.177 ± 1.408
0.471HisHis: 0.471 ± 0.246
2.119HisIle: 2.119 ± 0.914
1.177HisLys: 1.177 ± 0.615
3.179HisLeu: 3.179 ± 0.665
0.706HisMet: 0.706 ± 0.514
0.942HisAsn: 0.942 ± 0.492
1.177HisPro: 1.177 ± 0.407
0.706HisGln: 0.706 ± 0.493
0.824HisArg: 0.824 ± 0.28
1.531HisSer: 1.531 ± 1.459
1.884HisThr: 1.884 ± 0.72
2.237HisVal: 2.237 ± 0.937
0.235HisTrp: 0.235 ± 0.396
1.06HisTyr: 1.06 ± 0.926
0.0HisXaa: 0.0 ± 0.0
Ile
4.357IleAla: 4.357 ± 0.995
1.648IleCys: 1.648 ± 0.577
3.061IleAsp: 3.061 ± 1.287
2.473IleGlu: 2.473 ± 0.593
2.002IlePhe: 2.002 ± 0.428
3.532IleGly: 3.532 ± 1.209
1.177IleHis: 1.177 ± 1.822
4.357IleIle: 4.357 ± 3.529
2.826IleLys: 2.826 ± 1.255
6.594IleLeu: 6.594 ± 2.096
0.824IleMet: 0.824 ± 0.861
3.297IleAsn: 3.297 ± 1.16
4.003IlePro: 4.003 ± 0.623
2.708IleGln: 2.708 ± 1.337
2.002IleArg: 2.002 ± 0.635
4.003IleSer: 4.003 ± 1.059
4.357IleThr: 4.357 ± 1.996
5.063IleVal: 5.063 ± 1.161
0.706IleTrp: 0.706 ± 1.354
3.179IleTyr: 3.179 ± 0.77
0.0IleXaa: 0.0 ± 0.0
Lys
4.592LysAla: 4.592 ± 1.595
1.884LysCys: 1.884 ± 0.456
2.59LysAsp: 2.59 ± 0.802
2.237LysGlu: 2.237 ± 0.596
2.355LysPhe: 2.355 ± 0.719
2.119LysGly: 2.119 ± 0.609
1.884LysHis: 1.884 ± 0.442
2.944LysIle: 2.944 ± 0.672
2.237LysLys: 2.237 ± 1.634
5.181LysLeu: 5.181 ± 1.895
0.706LysMet: 0.706 ± 0.237
1.295LysAsn: 1.295 ± 0.358
3.768LysPro: 3.768 ± 1.968
1.531LysGln: 1.531 ± 0.637
1.884LysArg: 1.884 ± 0.904
2.826LysSer: 2.826 ± 0.437
4.71LysThr: 4.71 ± 2.183
2.826LysVal: 2.826 ± 1.113
0.353LysTrp: 0.353 ± 0.424
2.355LysTyr: 2.355 ± 0.776
0.0LysXaa: 0.0 ± 0.0
Leu
9.89LeuAla: 9.89 ± 1.678
2.119LeuCys: 2.119 ± 0.712
3.886LeuAsp: 3.886 ± 0.93
2.708LeuGlu: 2.708 ± 1.021
5.181LeuPhe: 5.181 ± 1.028
4.121LeuGly: 4.121 ± 0.998
2.708LeuHis: 2.708 ± 1.034
3.532LeuIle: 3.532 ± 2.764
5.298LeuLys: 5.298 ± 0.696
9.42LeuLeu: 9.42 ± 4.057
1.177LeuMet: 1.177 ± 0.578
4.828LeuAsn: 4.828 ± 1.828
5.181LeuPro: 5.181 ± 1.192
5.534LeuGln: 5.534 ± 1.329
4.003LeuArg: 4.003 ± 0.533
4.71LeuSer: 4.71 ± 0.736
7.536LeuThr: 7.536 ± 1.191
6.711LeuVal: 6.711 ± 2.341
0.589LeuTrp: 0.589 ± 0.517
4.239LeuTyr: 4.239 ± 0.845
0.0LeuXaa: 0.0 ± 0.0
Met
1.766MetAla: 1.766 ± 0.784
0.589MetCys: 0.589 ± 0.507
0.706MetAsp: 0.706 ± 0.358
0.589MetGlu: 0.589 ± 0.307
0.706MetPhe: 0.706 ± 0.31
0.942MetGly: 0.942 ± 0.801
0.471MetHis: 0.471 ± 0.36
0.589MetIle: 0.589 ± 0.517
0.353MetLys: 0.353 ± 0.184
2.119MetLeu: 2.119 ± 0.974
0.235MetMet: 0.235 ± 0.123
0.942MetAsn: 0.942 ± 0.333
0.706MetPro: 0.706 ± 0.369
0.824MetGln: 0.824 ± 0.66
0.471MetArg: 0.471 ± 0.246
1.531MetSer: 1.531 ± 0.565
1.06MetThr: 1.06 ± 0.553
1.766MetVal: 1.766 ± 0.436
0.235MetTrp: 0.235 ± 0.123
0.471MetTyr: 0.471 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
4.828AsnAla: 4.828 ± 1.037
1.531AsnCys: 1.531 ± 0.503
1.648AsnAsp: 1.648 ± 0.51
1.884AsnGlu: 1.884 ± 0.794
1.884AsnPhe: 1.884 ± 0.72
4.592AsnGly: 4.592 ± 1.451
1.295AsnHis: 1.295 ± 0.358
3.886AsnIle: 3.886 ± 1.393
3.061AsnLys: 3.061 ± 0.7
4.592AsnLeu: 4.592 ± 1.515
1.06AsnMet: 1.06 ± 0.553
3.65AsnAsn: 3.65 ± 0.703
2.355AsnPro: 2.355 ± 0.957
3.415AsnGln: 3.415 ± 1.6
1.884AsnArg: 1.884 ± 0.654
2.826AsnSer: 2.826 ± 1.152
3.179AsnThr: 3.179 ± 0.671
4.474AsnVal: 4.474 ± 1.489
0.353AsnTrp: 0.353 ± 0.9
2.59AsnTyr: 2.59 ± 0.741
0.0AsnXaa: 0.0 ± 0.0
Pro
2.826ProAla: 2.826 ± 1.111
0.824ProCys: 0.824 ± 0.308
2.708ProAsp: 2.708 ± 0.732
2.944ProGlu: 2.944 ± 0.875
1.531ProPhe: 1.531 ± 0.312
2.826ProGly: 2.826 ± 1.183
1.295ProHis: 1.295 ± 0.401
3.061ProIle: 3.061 ± 1.346
2.002ProLys: 2.002 ± 0.743
4.592ProLeu: 4.592 ± 1.261
0.471ProMet: 0.471 ± 0.528
3.179ProAsn: 3.179 ± 0.695
2.237ProPro: 2.237 ± 1.106
1.766ProGln: 1.766 ± 0.638
2.355ProArg: 2.355 ± 1.483
2.59ProSer: 2.59 ± 1.116
4.357ProThr: 4.357 ± 0.898
3.415ProVal: 3.415 ± 0.479
0.235ProTrp: 0.235 ± 0.197
2.002ProTyr: 2.002 ± 0.87
0.0ProXaa: 0.0 ± 0.0
Gln
3.65GlnAla: 3.65 ± 0.795
0.942GlnCys: 0.942 ± 0.414
1.295GlnAsp: 1.295 ± 0.431
2.355GlnGlu: 2.355 ± 1.537
0.942GlnPhe: 0.942 ± 0.814
2.237GlnGly: 2.237 ± 1.353
1.766GlnHis: 1.766 ± 0.502
2.59GlnIle: 2.59 ± 1.364
1.766GlnLys: 1.766 ± 0.867
5.063GlnLeu: 5.063 ± 1.586
0.942GlnMet: 0.942 ± 0.492
2.237GlnAsn: 2.237 ± 0.424
2.708GlnPro: 2.708 ± 0.6
2.237GlnGln: 2.237 ± 0.995
1.884GlnArg: 1.884 ± 1.076
4.121GlnSer: 4.121 ± 0.986
3.65GlnThr: 3.65 ± 1.881
2.944GlnVal: 2.944 ± 1.718
0.353GlnTrp: 0.353 ± 0.178
2.002GlnTyr: 2.002 ± 1.292
0.0GlnXaa: 0.0 ± 0.0
Arg
2.473ArgAla: 2.473 ± 0.615
1.884ArgCys: 1.884 ± 0.836
1.766ArgAsp: 1.766 ± 0.414
1.06ArgGlu: 1.06 ± 0.377
1.413ArgPhe: 1.413 ± 0.376
1.413ArgGly: 1.413 ± 1.025
1.177ArgHis: 1.177 ± 0.615
1.884ArgIle: 1.884 ± 0.822
2.002ArgLys: 2.002 ± 0.586
4.357ArgLeu: 4.357 ± 2.166
0.353ArgMet: 0.353 ± 0.184
2.473ArgAsn: 2.473 ± 0.717
1.295ArgPro: 1.295 ± 0.724
1.413ArgGln: 1.413 ± 0.556
1.531ArgArg: 1.531 ± 1.183
1.531ArgSer: 1.531 ± 0.599
2.59ArgThr: 2.59 ± 0.808
3.179ArgVal: 3.179 ± 0.46
0.589ArgTrp: 0.589 ± 0.37
1.648ArgTyr: 1.648 ± 0.903
0.0ArgXaa: 0.0 ± 0.0
Ser
4.828SerAla: 4.828 ± 1.725
0.824SerCys: 0.824 ± 0.528
3.532SerAsp: 3.532 ± 1.256
2.708SerGlu: 2.708 ± 1.025
3.061SerPhe: 3.061 ± 1.232
3.768SerGly: 3.768 ± 0.5
1.06SerHis: 1.06 ± 1.105
4.71SerIle: 4.71 ± 2.036
2.708SerLys: 2.708 ± 0.577
4.945SerLeu: 4.945 ± 1.413
1.648SerMet: 1.648 ± 0.709
2.944SerAsn: 2.944 ± 0.943
2.826SerPro: 2.826 ± 0.87
2.826SerGln: 2.826 ± 1.075
2.355SerArg: 2.355 ± 0.719
4.239SerSer: 4.239 ± 1.192
5.416SerThr: 5.416 ± 1.389
4.71SerVal: 4.71 ± 1.578
0.824SerTrp: 0.824 ± 0.501
3.532SerTyr: 3.532 ± 1.222
0.0SerXaa: 0.0 ± 0.0
Thr
4.474ThrAla: 4.474 ± 1.503
1.413ThrCys: 1.413 ± 0.565
4.71ThrAsp: 4.71 ± 0.516
2.355ThrGlu: 2.355 ± 0.733
3.768ThrPhe: 3.768 ± 1.371
2.708ThrGly: 2.708 ± 0.657
2.355ThrHis: 2.355 ± 1.011
5.063ThrIle: 5.063 ± 0.834
3.415ThrLys: 3.415 ± 0.796
7.065ThrLeu: 7.065 ± 0.739
1.884ThrMet: 1.884 ± 0.535
3.886ThrAsn: 3.886 ± 1.018
4.357ThrPro: 4.357 ± 1.422
3.179ThrGln: 3.179 ± 1.246
1.884ThrArg: 1.884 ± 1.041
4.828ThrSer: 4.828 ± 1.499
5.416ThrThr: 5.416 ± 1.133
6.711ThrVal: 6.711 ± 1.085
0.471ThrTrp: 0.471 ± 0.311
4.239ThrTyr: 4.239 ± 1.192
0.0ThrXaa: 0.0 ± 0.0
Val
4.945ValAla: 4.945 ± 1.26
2.59ValCys: 2.59 ± 0.761
6.005ValAsp: 6.005 ± 1.962
3.297ValGlu: 3.297 ± 0.694
3.768ValPhe: 3.768 ± 1.125
3.179ValGly: 3.179 ± 0.975
1.413ValHis: 1.413 ± 0.556
5.652ValIle: 5.652 ± 1.841
5.298ValLys: 5.298 ± 1.228
6.829ValLeu: 6.829 ± 0.991
0.471ValMet: 0.471 ± 0.309
5.181ValAsn: 5.181 ± 0.905
2.59ValPro: 2.59 ± 1.099
4.828ValGln: 4.828 ± 1.145
2.119ValArg: 2.119 ± 0.906
5.769ValSer: 5.769 ± 1.2
5.534ValThr: 5.534 ± 0.897
10.479ValVal: 10.479 ± 2.653
0.353ValTrp: 0.353 ± 0.178
4.003ValTyr: 4.003 ± 1.175
0.0ValXaa: 0.0 ± 0.0
Trp
0.706TrpAla: 0.706 ± 0.465
0.118TrpCys: 0.118 ± 0.061
0.824TrpAsp: 0.824 ± 0.393
0.589TrpGlu: 0.589 ± 0.358
0.353TrpPhe: 0.353 ± 0.346
0.118TrpGly: 0.118 ± 0.061
0.471TrpHis: 0.471 ± 0.324
0.471TrpIle: 0.471 ± 0.246
0.235TrpLys: 0.235 ± 0.396
1.531TrpLeu: 1.531 ± 1.296
0.118TrpMet: 0.118 ± 0.216
0.235TrpAsn: 0.235 ± 0.197
0.235TrpPro: 0.235 ± 0.461
0.353TrpGln: 0.353 ± 0.852
0.235TrpArg: 0.235 ± 0.568
0.589TrpSer: 0.589 ± 1.411
0.471TrpThr: 0.471 ± 0.246
1.06TrpVal: 1.06 ± 0.546
0.353TrpTrp: 0.353 ± 0.424
0.353TrpTyr: 0.353 ± 0.545
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.061TyrAla: 3.061 ± 0.81
1.766TyrCys: 1.766 ± 0.605
2.59TyrAsp: 2.59 ± 0.684
2.002TyrGlu: 2.002 ± 0.71
1.413TyrPhe: 1.413 ± 0.38
1.413TyrGly: 1.413 ± 0.968
1.413TyrHis: 1.413 ± 0.475
2.944TyrIle: 2.944 ± 0.756
2.708TyrLys: 2.708 ± 0.625
4.357TyrLeu: 4.357 ± 0.79
1.177TyrMet: 1.177 ± 0.603
3.532TyrAsn: 3.532 ± 1.307
2.119TyrPro: 2.119 ± 0.581
2.708TyrGln: 2.708 ± 1.012
2.355TyrArg: 2.355 ± 1.046
2.473TyrSer: 2.473 ± 0.955
4.003TyrThr: 4.003 ± 1.07
4.239TyrVal: 4.239 ± 1.07
0.471TyrTrp: 0.471 ± 0.637
3.061TyrTyr: 3.061 ± 0.912
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (8494 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski