Amino acid dipepetide frequency for Bat coronavirus HKU9-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.085AlaAla: 6.085 ± 0.739
2.938AlaCys: 2.938 ± 0.639
4.512AlaAsp: 4.512 ± 0.502
1.889AlaGlu: 1.889 ± 0.864
3.882AlaPhe: 3.882 ± 0.608
4.407AlaGly: 4.407 ± 0.859
0.944AlaHis: 0.944 ± 0.471
4.407AlaIle: 4.407 ± 1.247
3.357AlaLys: 3.357 ± 0.751
7.449AlaLeu: 7.449 ± 1.078
2.938AlaMet: 2.938 ± 0.641
3.882AlaAsn: 3.882 ± 0.604
3.043AlaPro: 3.043 ± 1.169
3.148AlaGln: 3.148 ± 0.603
3.777AlaArg: 3.777 ± 0.568
4.302AlaSer: 4.302 ± 1.219
5.456AlaThr: 5.456 ± 1.044
6.61AlaVal: 6.61 ± 0.804
0.734AlaTrp: 0.734 ± 0.458
3.777AlaTyr: 3.777 ± 1.049
0.0AlaXaa: 0.0 ± 0.0
Cys
2.203CysAla: 2.203 ± 0.612
0.944CysCys: 0.944 ± 0.325
2.413CysAsp: 2.413 ± 0.344
0.734CysGlu: 0.734 ± 0.367
1.259CysPhe: 1.259 ± 0.324
2.413CysGly: 2.413 ± 0.566
0.734CysHis: 0.734 ± 0.435
1.049CysIle: 1.049 ± 0.501
1.574CysLys: 1.574 ± 0.432
2.098CysLeu: 2.098 ± 0.67
1.049CysMet: 1.049 ± 0.31
1.049CysAsn: 1.049 ± 0.433
0.839CysPro: 0.839 ± 0.278
0.839CysGln: 0.839 ± 0.214
1.154CysArg: 1.154 ± 0.456
1.784CysSer: 1.784 ± 0.658
2.518CysThr: 2.518 ± 0.713
2.938CysVal: 2.938 ± 0.556
0.42CysTrp: 0.42 ± 0.21
2.623CysTyr: 2.623 ± 0.734
0.0CysXaa: 0.0 ± 0.0
Asp
3.987AspAla: 3.987 ± 0.465
0.839AspCys: 0.839 ± 0.395
2.413AspAsp: 2.413 ± 1.0
2.728AspGlu: 2.728 ± 0.295
2.518AspPhe: 2.518 ± 0.581
4.302AspGly: 4.302 ± 0.524
0.63AspHis: 0.63 ± 0.329
3.357AspIle: 3.357 ± 0.546
1.889AspLys: 1.889 ± 0.598
4.512AspLeu: 4.512 ± 0.89
1.049AspMet: 1.049 ± 0.394
1.993AspAsn: 1.993 ± 0.66
2.518AspPro: 2.518 ± 1.049
1.469AspGln: 1.469 ± 0.539
2.098AspArg: 2.098 ± 0.599
2.518AspSer: 2.518 ± 0.612
4.721AspThr: 4.721 ± 0.741
4.931AspVal: 4.931 ± 1.621
0.839AspTrp: 0.839 ± 0.358
3.462AspTyr: 3.462 ± 1.1
0.0AspXaa: 0.0 ± 0.0
Glu
2.413GluAla: 2.413 ± 0.592
1.049GluCys: 1.049 ± 0.374
1.993GluAsp: 1.993 ± 0.619
2.203GluGlu: 2.203 ± 1.048
1.259GluPhe: 1.259 ± 0.358
2.938GluGly: 2.938 ± 1.079
0.944GluHis: 0.944 ± 0.279
1.259GluIle: 1.259 ± 0.235
1.154GluLys: 1.154 ± 0.429
4.721GluLeu: 4.721 ± 1.227
0.42GluMet: 0.42 ± 0.301
1.889GluAsn: 1.889 ± 0.9
1.574GluPro: 1.574 ± 0.439
1.679GluGln: 1.679 ± 0.398
1.784GluArg: 1.784 ± 0.442
3.357GluSer: 3.357 ± 0.959
1.889GluThr: 1.889 ± 0.534
3.567GluVal: 3.567 ± 0.45
0.105GluTrp: 0.105 ± 0.264
0.944GluTyr: 0.944 ± 0.325
0.0GluXaa: 0.0 ± 0.0
Phe
2.098PheAla: 2.098 ± 1.472
1.469PheCys: 1.469 ± 0.44
2.833PheAsp: 2.833 ± 0.765
1.679PheGlu: 1.679 ± 0.643
0.734PhePhe: 0.734 ± 0.256
2.623PheGly: 2.623 ± 0.434
0.839PheHis: 0.839 ± 0.328
2.098PheIle: 2.098 ± 1.121
2.413PheLys: 2.413 ± 0.539
2.938PheLeu: 2.938 ± 0.581
1.364PheMet: 1.364 ± 0.206
3.148PheAsn: 3.148 ± 0.677
0.839PhePro: 0.839 ± 0.81
1.364PheGln: 1.364 ± 0.218
1.469PheArg: 1.469 ± 0.604
2.728PheSer: 2.728 ± 0.693
3.043PheThr: 3.043 ± 0.519
4.617PheVal: 4.617 ± 0.738
0.42PheTrp: 0.42 ± 0.21
2.728PheTyr: 2.728 ± 0.544
0.0PheXaa: 0.0 ± 0.0
Gly
4.931GlyAla: 4.931 ± 1.398
1.574GlyCys: 1.574 ± 0.502
3.253GlyAsp: 3.253 ± 1.019
1.679GlyGlu: 1.679 ± 0.811
3.777GlyPhe: 3.777 ± 0.742
4.197GlyGly: 4.197 ± 0.804
0.839GlyHis: 0.839 ± 0.419
1.889GlyIle: 1.889 ± 0.575
2.203GlyLys: 2.203 ± 0.648
4.931GlyLeu: 4.931 ± 1.196
0.839GlyMet: 0.839 ± 0.395
3.357GlyAsn: 3.357 ± 1.007
2.413GlyPro: 2.413 ± 0.85
1.364GlyGln: 1.364 ± 0.36
2.938GlyArg: 2.938 ± 1.575
4.617GlySer: 4.617 ± 0.808
5.561GlyThr: 5.561 ± 0.792
8.708GlyVal: 8.708 ± 0.559
1.154GlyTrp: 1.154 ± 0.405
2.728GlyTyr: 2.728 ± 0.419
0.0GlyXaa: 0.0 ± 0.0
His
1.364HisAla: 1.364 ± 0.417
0.21HisCys: 0.21 ± 0.105
0.525HisAsp: 0.525 ± 0.262
0.315HisGlu: 0.315 ± 0.157
1.154HisPhe: 1.154 ± 0.345
1.154HisGly: 1.154 ± 0.554
0.42HisHis: 0.42 ± 0.21
1.469HisIle: 1.469 ± 0.582
0.839HisLys: 0.839 ± 0.676
1.889HisLeu: 1.889 ± 0.878
0.42HisMet: 0.42 ± 0.204
0.839HisAsn: 0.839 ± 0.278
0.839HisPro: 0.839 ± 0.367
0.315HisGln: 0.315 ± 0.157
0.63HisArg: 0.63 ± 0.376
0.839HisSer: 0.839 ± 0.278
1.469HisThr: 1.469 ± 0.44
2.098HisVal: 2.098 ± 0.607
0.21HisTrp: 0.21 ± 0.324
1.364HisTyr: 1.364 ± 0.617
0.0HisXaa: 0.0 ± 0.0
Ile
3.253IleAla: 3.253 ± 1.718
1.259IleCys: 1.259 ± 0.948
1.889IleAsp: 1.889 ± 0.651
0.944IleGlu: 0.944 ± 0.471
1.679IlePhe: 1.679 ± 0.277
2.833IleGly: 2.833 ± 0.508
0.315IleHis: 0.315 ± 0.207
1.259IleIle: 1.259 ± 1.169
2.098IleLys: 2.098 ± 0.518
4.931IleLeu: 4.931 ± 2.301
1.049IleMet: 1.049 ± 0.438
2.518IleAsn: 2.518 ± 0.978
1.889IlePro: 1.889 ± 0.478
0.944IleGln: 0.944 ± 0.797
2.098IleArg: 2.098 ± 0.398
3.148IleSer: 3.148 ± 0.85
2.098IleThr: 2.098 ± 0.469
4.512IleVal: 4.512 ± 1.053
0.525IleTrp: 0.525 ± 0.156
1.679IleTyr: 1.679 ± 0.404
0.0IleXaa: 0.0 ± 0.0
Lys
3.462LysAla: 3.462 ± 0.512
1.154LysCys: 1.154 ± 0.345
1.889LysAsp: 1.889 ± 0.919
1.889LysGlu: 1.889 ± 0.769
1.889LysPhe: 1.889 ± 0.655
2.938LysGly: 2.938 ± 0.835
1.574LysHis: 1.574 ± 0.508
1.154LysIle: 1.154 ± 0.341
1.574LysLys: 1.574 ± 1.399
5.351LysLeu: 5.351 ± 0.911
1.364LysMet: 1.364 ± 0.36
1.364LysAsn: 1.364 ± 0.402
3.777LysPro: 3.777 ± 0.778
2.203LysGln: 2.203 ± 0.493
2.728LysArg: 2.728 ± 0.72
1.784LysSer: 1.784 ± 0.739
1.889LysThr: 1.889 ± 0.311
3.882LysVal: 3.882 ± 0.389
0.734LysTrp: 0.734 ± 0.256
2.623LysTyr: 2.623 ± 0.669
0.0LysXaa: 0.0 ± 0.0
Leu
8.394LeuAla: 8.394 ± 0.752
3.462LeuCys: 3.462 ± 0.533
4.407LeuAsp: 4.407 ± 1.346
4.407LeuGlu: 4.407 ± 0.873
2.938LeuPhe: 2.938 ± 0.659
4.407LeuGly: 4.407 ± 0.824
2.938LeuHis: 2.938 ± 0.723
3.672LeuIle: 3.672 ± 0.854
4.512LeuLys: 4.512 ± 1.182
10.597LeuLeu: 10.597 ± 3.085
2.308LeuMet: 2.308 ± 0.398
4.407LeuAsn: 4.407 ± 0.791
5.141LeuPro: 5.141 ± 1.354
4.512LeuGln: 4.512 ± 0.577
4.721LeuArg: 4.721 ± 0.904
7.24LeuSer: 7.24 ± 0.817
5.036LeuThr: 5.036 ± 0.737
9.338LeuVal: 9.338 ± 1.453
1.574LeuTrp: 1.574 ± 0.992
4.512LeuTyr: 4.512 ± 1.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.098MetAla: 2.098 ± 0.595
1.259MetCys: 1.259 ± 0.367
0.734MetAsp: 0.734 ± 0.288
0.944MetGlu: 0.944 ± 0.372
0.944MetPhe: 0.944 ± 0.766
1.154MetGly: 1.154 ± 0.278
0.315MetHis: 0.315 ± 0.157
0.63MetIle: 0.63 ± 0.194
0.315MetLys: 0.315 ± 0.157
3.043MetLeu: 3.043 ± 0.766
0.315MetMet: 0.315 ± 0.157
1.469MetAsn: 1.469 ± 0.45
1.574MetPro: 1.574 ± 0.665
1.574MetGln: 1.574 ± 0.439
1.469MetArg: 1.469 ± 0.405
1.889MetSer: 1.889 ± 0.424
1.364MetThr: 1.364 ± 0.394
1.993MetVal: 1.993 ± 0.613
0.21MetTrp: 0.21 ± 0.324
1.364MetTyr: 1.364 ± 0.398
0.0MetXaa: 0.0 ± 0.0
Asn
4.721AsnAla: 4.721 ± 0.505
1.574AsnCys: 1.574 ± 0.293
1.364AsnAsp: 1.364 ± 0.49
1.574AsnGlu: 1.574 ± 0.502
1.784AsnPhe: 1.784 ± 0.685
3.672AsnGly: 3.672 ± 0.861
0.839AsnHis: 0.839 ± 0.527
2.308AsnIle: 2.308 ± 0.419
2.938AsnLys: 2.938 ± 0.841
4.826AsnLeu: 4.826 ± 0.884
1.364AsnMet: 1.364 ± 0.408
2.623AsnAsn: 2.623 ± 0.971
2.098AsnPro: 2.098 ± 0.804
0.839AsnGln: 0.839 ± 0.578
2.413AsnArg: 2.413 ± 0.419
3.567AsnSer: 3.567 ± 1.219
2.938AsnThr: 2.938 ± 0.568
5.141AsnVal: 5.141 ± 0.909
0.734AsnTrp: 0.734 ± 0.233
2.623AsnTyr: 2.623 ± 0.572
0.0AsnXaa: 0.0 ± 0.0
Pro
4.302ProAla: 4.302 ± 0.699
0.734ProCys: 0.734 ± 0.237
3.043ProAsp: 3.043 ± 0.518
1.679ProGlu: 1.679 ± 0.537
1.679ProPhe: 1.679 ± 0.267
3.357ProGly: 3.357 ± 0.448
0.944ProHis: 0.944 ± 0.343
2.833ProIle: 2.833 ± 0.562
3.148ProLys: 3.148 ± 1.586
4.826ProLeu: 4.826 ± 0.761
1.154ProMet: 1.154 ± 0.429
1.889ProAsn: 1.889 ± 0.679
1.784ProPro: 1.784 ± 0.712
1.154ProGln: 1.154 ± 0.443
1.889ProArg: 1.889 ± 0.973
2.623ProSer: 2.623 ± 1.158
3.043ProThr: 3.043 ± 0.543
3.777ProVal: 3.777 ± 0.293
0.734ProTrp: 0.734 ± 0.194
1.993ProTyr: 1.993 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
2.728GlnAla: 2.728 ± 0.469
0.525GlnCys: 0.525 ± 0.268
1.784GlnAsp: 1.784 ± 0.421
1.259GlnGlu: 1.259 ± 0.446
1.889GlnPhe: 1.889 ± 0.716
1.679GlnGly: 1.679 ± 0.831
0.63GlnHis: 0.63 ± 0.191
0.63GlnIle: 0.63 ± 0.314
1.259GlnLys: 1.259 ± 0.525
4.302GlnLeu: 4.302 ± 1.213
0.63GlnMet: 0.63 ± 0.345
1.364GlnAsn: 1.364 ± 0.308
2.098GlnPro: 2.098 ± 1.013
1.574GlnGln: 1.574 ± 0.439
1.574GlnArg: 1.574 ± 0.572
1.574GlnSer: 1.574 ± 0.619
2.623GlnThr: 2.623 ± 0.813
2.413GlnVal: 2.413 ± 0.55
1.259GlnTrp: 1.259 ± 0.326
1.154GlnTyr: 1.154 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
3.567ArgAla: 3.567 ± 0.717
2.098ArgCys: 2.098 ± 0.585
1.679ArgAsp: 1.679 ± 0.462
1.889ArgGlu: 1.889 ± 0.598
1.364ArgPhe: 1.364 ± 0.423
2.938ArgGly: 2.938 ± 1.755
0.944ArgHis: 0.944 ± 0.436
1.469ArgIle: 1.469 ± 0.56
1.364ArgLys: 1.364 ± 0.681
3.987ArgLeu: 3.987 ± 0.71
1.364ArgMet: 1.364 ± 0.539
2.938ArgAsn: 2.938 ± 2.186
1.784ArgPro: 1.784 ± 0.835
1.679ArgGln: 1.679 ± 0.4
2.518ArgArg: 2.518 ± 0.877
3.043ArgSer: 3.043 ± 0.832
2.938ArgThr: 2.938 ± 0.594
3.567ArgVal: 3.567 ± 0.779
0.525ArgTrp: 0.525 ± 0.379
2.728ArgTyr: 2.728 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
5.036SerAla: 5.036 ± 0.722
2.728SerCys: 2.728 ± 0.791
4.931SerAsp: 4.931 ± 0.638
3.462SerGlu: 3.462 ± 0.648
2.728SerPhe: 2.728 ± 0.988
3.567SerGly: 3.567 ± 1.0
1.259SerHis: 1.259 ± 0.289
3.043SerIle: 3.043 ± 1.146
3.357SerLys: 3.357 ± 0.76
6.295SerLeu: 6.295 ± 0.851
1.784SerMet: 1.784 ± 0.37
2.623SerAsn: 2.623 ± 0.59
2.938SerPro: 2.938 ± 0.724
1.679SerGln: 1.679 ± 0.479
3.253SerArg: 3.253 ± 2.172
4.721SerSer: 4.721 ± 0.988
3.882SerThr: 3.882 ± 0.678
6.82SerVal: 6.82 ± 0.801
0.839SerTrp: 0.839 ± 0.198
2.623SerTyr: 2.623 ± 0.773
0.0SerXaa: 0.0 ± 0.0
Thr
4.302ThrAla: 4.302 ± 0.471
1.889ThrCys: 1.889 ± 0.539
3.043ThrAsp: 3.043 ± 0.397
1.574ThrGlu: 1.574 ± 0.246
3.357ThrPhe: 3.357 ± 0.552
4.826ThrGly: 4.826 ± 0.68
0.944ThrHis: 0.944 ± 0.527
2.623ThrIle: 2.623 ± 0.284
3.672ThrLys: 3.672 ± 1.091
5.456ThrLeu: 5.456 ± 1.022
1.889ThrMet: 1.889 ± 0.466
2.518ThrAsn: 2.518 ± 0.652
4.197ThrPro: 4.197 ± 0.492
2.203ThrGln: 2.203 ± 1.003
2.203ThrArg: 2.203 ± 0.449
5.141ThrSer: 5.141 ± 0.973
3.987ThrThr: 3.987 ± 1.095
7.554ThrVal: 7.554 ± 0.972
0.42ThrTrp: 0.42 ± 0.21
3.253ThrTyr: 3.253 ± 0.863
0.0ThrXaa: 0.0 ± 0.0
Val
7.135ValAla: 7.135 ± 0.688
3.148ValCys: 3.148 ± 0.659
5.561ValAsp: 5.561 ± 0.599
4.092ValGlu: 4.092 ± 1.328
3.357ValPhe: 3.357 ± 0.997
5.246ValGly: 5.246 ± 0.401
1.469ValHis: 1.469 ± 0.73
3.567ValIle: 3.567 ± 1.16
4.617ValLys: 4.617 ± 0.715
10.387ValLeu: 10.387 ± 1.864
2.413ValMet: 2.413 ± 0.689
5.876ValAsn: 5.876 ± 1.173
4.092ValPro: 4.092 ± 0.578
3.253ValGln: 3.253 ± 0.442
3.672ValArg: 3.672 ± 0.569
8.813ValSer: 8.813 ± 1.745
6.295ValThr: 6.295 ± 1.092
11.017ValVal: 11.017 ± 1.443
1.049ValTrp: 1.049 ± 0.296
4.721ValTyr: 4.721 ± 0.873
0.0ValXaa: 0.0 ± 0.0
Trp
0.63TrpAla: 0.63 ± 0.211
0.315TrpCys: 0.315 ± 0.129
1.154TrpAsp: 1.154 ± 0.576
0.315TrpGlu: 0.315 ± 0.207
0.734TrpPhe: 0.734 ± 0.367
0.525TrpGly: 0.525 ± 0.641
0.21TrpHis: 0.21 ± 0.144
0.315TrpIle: 0.315 ± 0.157
0.42TrpLys: 0.42 ± 0.21
2.308TrpLeu: 2.308 ± 0.588
0.105TrpMet: 0.105 ± 0.264
0.525TrpAsn: 0.525 ± 0.303
0.525TrpPro: 0.525 ± 0.428
0.315TrpGln: 0.315 ± 0.157
0.315TrpArg: 0.315 ± 0.129
0.734TrpSer: 0.734 ± 0.332
0.944TrpThr: 0.944 ± 0.215
1.469TrpVal: 1.469 ± 0.601
0.105TrpTrp: 0.105 ± 0.052
1.154TrpTyr: 1.154 ± 0.271
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.826TyrAla: 4.826 ± 0.701
1.469TyrCys: 1.469 ± 0.725
3.357TyrAsp: 3.357 ± 1.004
1.889TyrGlu: 1.889 ± 0.414
2.518TyrPhe: 2.518 ± 0.755
3.777TyrGly: 3.777 ± 0.579
0.63TyrHis: 0.63 ± 0.191
1.889TyrIle: 1.889 ± 0.577
2.518TyrLys: 2.518 ± 0.576
3.672TyrLeu: 3.672 ± 0.694
0.734TyrMet: 0.734 ± 0.233
3.672TyrAsn: 3.672 ± 1.516
2.623TyrPro: 2.623 ± 0.889
0.839TyrGln: 0.839 ± 0.278
1.679TyrArg: 1.679 ± 0.395
3.148TyrSer: 3.148 ± 0.776
3.357TyrThr: 3.357 ± 0.52
4.931TyrVal: 4.931 ± 1.082
0.525TyrTrp: 0.525 ± 0.626
2.413TyrTyr: 2.413 ± 0.469
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (9532 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski