Amino acid dipepetide frequency for Hubei picorna-like virus 73

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.051AlaAla: 7.051 ± 0.432
0.307AlaCys: 0.307 ± 0.465
4.905AlaAsp: 4.905 ± 0.817
3.372AlaGlu: 3.372 ± 1.031
4.598AlaPhe: 4.598 ± 1.812
2.759AlaGly: 2.759 ± 0.475
1.226AlaHis: 1.226 ± 1.073
3.679AlaIle: 3.679 ± 1.111
2.146AlaLys: 2.146 ± 1.239
6.131AlaLeu: 6.131 ± 1.035
2.146AlaMet: 2.146 ± 0.667
3.372AlaAsn: 3.372 ± 0.538
4.905AlaPro: 4.905 ± 1.481
2.759AlaGln: 2.759 ± 0.952
3.066AlaArg: 3.066 ± 0.646
5.212AlaSer: 5.212 ± 1.657
3.985AlaThr: 3.985 ± 2.164
3.985AlaVal: 3.985 ± 0.504
2.146AlaTrp: 2.146 ± 0.805
1.533AlaTyr: 1.533 ± 0.532
0.0AlaXaa: 0.0 ± 0.0
Cys
3.066CysAla: 3.066 ± 1.582
0.0CysCys: 0.0 ± 0.0
1.226CysAsp: 1.226 ± 0.471
0.92CysGlu: 0.92 ± 0.473
0.613CysPhe: 0.613 ± 0.536
0.92CysGly: 0.92 ± 0.531
0.307CysHis: 0.307 ± 0.644
0.307CysIle: 0.307 ± 0.177
0.92CysLys: 0.92 ± 0.531
1.226CysLeu: 1.226 ± 0.471
0.307CysMet: 0.307 ± 0.374
0.307CysAsn: 0.307 ± 0.644
0.613CysPro: 0.613 ± 0.8
0.613CysGln: 0.613 ± 0.536
0.92CysArg: 0.92 ± 0.531
0.92CysSer: 0.92 ± 0.531
1.839CysThr: 1.839 ± 0.631
1.839CysVal: 1.839 ± 0.93
0.0CysTrp: 0.0 ± 0.0
0.307CysTyr: 0.307 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
2.452AspAla: 2.452 ± 1.508
2.759AspCys: 2.759 ± 1.515
3.985AspAsp: 3.985 ± 1.417
3.372AspGlu: 3.372 ± 1.402
4.598AspPhe: 4.598 ± 1.251
3.679AspGly: 3.679 ± 0.491
1.226AspHis: 1.226 ± 0.457
5.212AspIle: 5.212 ± 1.58
2.759AspLys: 2.759 ± 0.633
5.212AspLeu: 5.212 ± 0.603
1.226AspMet: 1.226 ± 0.708
0.613AspAsn: 0.613 ± 0.354
3.679AspPro: 3.679 ± 0.497
0.92AspGln: 0.92 ± 0.809
2.452AspArg: 2.452 ± 0.741
2.759AspSer: 2.759 ± 0.748
3.985AspThr: 3.985 ± 0.504
7.357AspVal: 7.357 ± 0.935
0.307AspTrp: 0.307 ± 0.177
4.598AspTyr: 4.598 ± 0.796
0.0AspXaa: 0.0 ± 0.0
Glu
4.905GluAla: 4.905 ± 1.187
0.613GluCys: 0.613 ± 0.354
2.759GluAsp: 2.759 ± 0.475
3.372GluGlu: 3.372 ± 1.465
3.372GluPhe: 3.372 ± 0.336
2.759GluGly: 2.759 ± 1.122
2.452GluHis: 2.452 ± 0.942
3.679GluIle: 3.679 ± 1.412
2.452GluLys: 2.452 ± 0.942
4.905GluLeu: 4.905 ± 1.161
1.839GluMet: 1.839 ± 0.223
1.839GluAsn: 1.839 ± 1.062
3.066GluPro: 3.066 ± 1.239
1.533GluGln: 1.533 ± 0.995
3.066GluArg: 3.066 ± 1.063
2.146GluSer: 2.146 ± 0.789
2.452GluThr: 2.452 ± 0.902
3.066GluVal: 3.066 ± 1.236
0.92GluTrp: 0.92 ± 0.531
2.146GluTyr: 2.146 ± 0.176
0.0GluXaa: 0.0 ± 0.0
Phe
2.759PheAla: 2.759 ± 1.206
1.226PheCys: 1.226 ± 1.073
4.598PheAsp: 4.598 ± 0.796
3.372PheGlu: 3.372 ± 0.336
3.066PhePhe: 3.066 ± 1.239
2.759PheGly: 2.759 ± 0.633
0.92PheHis: 0.92 ± 0.317
1.839PheIle: 1.839 ± 1.993
3.372PheLys: 3.372 ± 0.538
3.372PheLeu: 3.372 ± 1.16
0.92PheMet: 0.92 ± 0.809
3.066PheAsn: 3.066 ± 1.239
2.759PhePro: 2.759 ± 0.759
2.452PheGln: 2.452 ± 0.917
3.372PheArg: 3.372 ± 1.16
5.212PheSer: 5.212 ± 1.58
3.985PheThr: 3.985 ± 1.07
3.679PheVal: 3.679 ± 1.412
0.307PheTrp: 0.307 ± 0.177
3.066PheTyr: 3.066 ± 1.029
0.0PheXaa: 0.0 ± 0.0
Gly
4.598GlyAla: 4.598 ± 0.146
1.226GlyCys: 1.226 ± 0.708
3.372GlyAsp: 3.372 ± 1.274
3.372GlyGlu: 3.372 ± 1.996
2.759GlyPhe: 2.759 ± 0.183
2.146GlyGly: 2.146 ± 2.079
0.307GlyHis: 0.307 ± 0.177
4.292GlyIle: 4.292 ± 1.578
4.292GlyLys: 4.292 ± 1.345
4.598GlyLeu: 4.598 ± 1.501
0.613GlyMet: 0.613 ± 0.354
3.985GlyAsn: 3.985 ± 0.498
1.226GlyPro: 1.226 ± 0.708
2.759GlyGln: 2.759 ± 0.759
2.759GlyArg: 2.759 ± 0.633
1.226GlySer: 1.226 ± 0.457
3.985GlyThr: 3.985 ± 0.498
5.518GlyVal: 5.518 ± 1.122
0.613GlyTrp: 0.613 ± 0.356
1.226GlyTyr: 1.226 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
0.92HisAla: 0.92 ± 0.531
0.0HisCys: 0.0 ± 0.0
1.533HisAsp: 1.533 ± 0.532
1.839HisGlu: 1.839 ± 1.062
1.226HisPhe: 1.226 ± 0.965
1.533HisGly: 1.533 ± 1.07
0.613HisHis: 0.613 ± 0.536
1.533HisIle: 1.533 ± 0.486
1.226HisLys: 1.226 ± 0.708
1.533HisLeu: 1.533 ± 0.295
0.307HisMet: 0.307 ± 0.177
0.307HisAsn: 0.307 ± 0.465
1.533HisPro: 1.533 ± 0.295
1.533HisGln: 1.533 ± 0.885
1.226HisArg: 1.226 ± 0.471
2.146HisSer: 2.146 ± 0.176
0.92HisThr: 0.92 ± 0.317
2.146HisVal: 2.146 ± 0.452
0.92HisTrp: 0.92 ± 0.473
0.92HisTyr: 0.92 ± 0.627
0.0HisXaa: 0.0 ± 0.0
Ile
4.598IleAla: 4.598 ± 1.587
0.92IleCys: 0.92 ± 1.172
4.905IleAsp: 4.905 ± 0.623
3.985IleGlu: 3.985 ± 1.74
2.452IlePhe: 2.452 ± 0.942
3.679IleGly: 3.679 ± 1.262
3.372IleHis: 3.372 ± 0.538
1.839IleIle: 1.839 ± 0.637
3.679IleLys: 3.679 ± 0.497
3.985IleLeu: 3.985 ± 1.965
0.613IleMet: 0.613 ± 0.354
3.679IleAsn: 3.679 ± 2.138
4.905IlePro: 4.905 ± 1.636
1.533IleGln: 1.533 ± 0.651
3.066IleArg: 3.066 ± 0.205
3.679IleSer: 3.679 ± 0.497
3.066IleThr: 3.066 ± 1.77
3.372IleVal: 3.372 ± 0.636
1.226IleTrp: 1.226 ± 0.37
0.613IleTyr: 0.613 ± 0.354
0.0IleXaa: 0.0 ± 0.0
Lys
2.759LysAla: 2.759 ± 1.122
0.307LysCys: 0.307 ± 0.177
5.212LysAsp: 5.212 ± 2.512
3.372LysGlu: 3.372 ± 0.819
3.985LysPhe: 3.985 ± 1.402
3.679LysGly: 3.679 ± 1.89
1.839LysHis: 1.839 ± 1.062
4.292LysIle: 4.292 ± 2.375
3.372LysLys: 3.372 ± 1.465
4.292LysLeu: 4.292 ± 0.938
1.226LysMet: 1.226 ± 0.471
2.452LysAsn: 2.452 ± 0.954
1.839LysPro: 1.839 ± 0.631
1.839LysGln: 1.839 ± 1.062
2.146LysArg: 2.146 ± 0.789
3.372LysSer: 3.372 ± 1.947
3.679LysThr: 3.679 ± 0.491
3.372LysVal: 3.372 ± 1.41
0.613LysTrp: 0.613 ± 0.536
2.759LysTyr: 2.759 ± 1.122
0.0LysXaa: 0.0 ± 0.0
Leu
4.598LeuAla: 4.598 ± 2.44
1.226LeuCys: 1.226 ± 0.37
5.825LeuAsp: 5.825 ± 1.132
3.985LeuGlu: 3.985 ± 2.457
4.598LeuPhe: 4.598 ± 0.474
3.066LeuGly: 3.066 ± 1.063
1.533LeuHis: 1.533 ± 0.532
6.131LeuIle: 6.131 ± 0.742
7.664LeuLys: 7.664 ± 1.697
7.051LeuLeu: 7.051 ± 1.273
1.839LeuMet: 1.839 ± 0.576
4.598LeuAsn: 4.598 ± 1.996
5.518LeuPro: 5.518 ± 0.763
3.679LeuGln: 3.679 ± 0.497
4.905LeuArg: 4.905 ± 2.897
7.357LeuSer: 7.357 ± 0.888
3.679LeuThr: 3.679 ± 0.994
7.971LeuVal: 7.971 ± 1.804
0.613LeuTrp: 0.613 ± 0.8
3.679LeuTyr: 3.679 ± 1.861
0.0LeuXaa: 0.0 ± 0.0
Met
1.839MetAla: 1.839 ± 0.166
0.307MetCys: 0.307 ± 0.177
1.226MetAsp: 1.226 ± 0.713
0.307MetGlu: 0.307 ± 0.177
1.533MetPhe: 1.533 ± 0.295
1.533MetGly: 1.533 ± 0.885
0.613MetHis: 0.613 ± 0.356
1.839MetIle: 1.839 ± 1.707
1.839MetLys: 1.839 ± 1.062
2.452MetLeu: 2.452 ± 0.954
0.613MetMet: 0.613 ± 0.354
0.307MetAsn: 0.307 ± 0.177
0.307MetPro: 0.307 ± 0.177
1.226MetGln: 1.226 ± 0.708
0.613MetArg: 0.613 ± 0.354
1.839MetSer: 1.839 ± 0.635
0.92MetThr: 0.92 ± 0.531
0.613MetVal: 0.613 ± 0.356
0.307MetTrp: 0.307 ± 0.465
1.839MetTyr: 1.839 ± 0.635
0.0MetXaa: 0.0 ± 0.0
Asn
4.905AsnAla: 4.905 ± 1.183
0.613AsnCys: 0.613 ± 0.354
2.452AsnAsp: 2.452 ± 1.426
1.533AsnGlu: 1.533 ± 0.486
2.452AsnPhe: 2.452 ± 0.914
2.146AsnGly: 2.146 ± 1.56
0.307AsnHis: 0.307 ± 0.177
1.226AsnIle: 1.226 ± 1.601
1.533AsnLys: 1.533 ± 0.995
4.598AsnLeu: 4.598 ± 1.684
0.307AsnMet: 0.307 ± 0.177
1.839AsnAsn: 1.839 ± 1.254
0.613AsnPro: 0.613 ± 0.354
0.307AsnGln: 0.307 ± 0.644
2.452AsnArg: 2.452 ± 0.312
2.452AsnSer: 2.452 ± 0.902
3.066AsnThr: 3.066 ± 0.634
4.292AsnVal: 4.292 ± 1.987
0.92AsnTrp: 0.92 ± 0.317
1.839AsnTyr: 1.839 ± 0.631
0.0AsnXaa: 0.0 ± 0.0
Pro
4.292ProAla: 4.292 ± 0.321
0.92ProCys: 0.92 ± 0.317
2.146ProAsp: 2.146 ± 0.452
2.146ProGlu: 2.146 ± 0.77
2.146ProPhe: 2.146 ± 0.927
4.292ProGly: 4.292 ± 0.833
1.226ProHis: 1.226 ± 1.545
2.146ProIle: 2.146 ± 1.001
2.452ProLys: 2.452 ± 0.296
6.744ProLeu: 6.744 ± 2.237
1.839ProMet: 1.839 ± 0.166
2.146ProAsn: 2.146 ± 1.001
3.985ProPro: 3.985 ± 1.611
2.146ProGln: 2.146 ± 0.452
1.226ProArg: 1.226 ± 0.471
3.679ProSer: 3.679 ± 1.262
1.533ProThr: 1.533 ± 1.07
3.985ProVal: 3.985 ± 1.212
1.533ProTrp: 1.533 ± 0.295
0.92ProTyr: 0.92 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
3.372GlnAla: 3.372 ± 0.819
0.92GlnCys: 0.92 ± 0.473
2.146GlnAsp: 2.146 ± 0.805
1.839GlnGlu: 1.839 ± 0.166
3.066GlnPhe: 3.066 ± 0.589
1.226GlnGly: 1.226 ± 0.708
0.92GlnHis: 0.92 ± 0.473
1.533GlnIle: 1.533 ± 0.885
1.839GlnLys: 1.839 ± 0.945
3.985GlnLeu: 3.985 ± 1.731
0.307GlnMet: 0.307 ± 0.465
0.613GlnAsn: 0.613 ± 0.354
0.92GlnPro: 0.92 ± 0.317
1.226GlnGln: 1.226 ± 1.813
1.839GlnArg: 1.839 ± 0.631
2.759GlnSer: 2.759 ± 0.748
2.759GlnThr: 2.759 ± 0.846
3.679GlnVal: 3.679 ± 1.56
0.92GlnTrp: 0.92 ± 1.139
0.92GlnTyr: 0.92 ± 1.139
0.0GlnXaa: 0.0 ± 0.0
Arg
3.372ArgAla: 3.372 ± 1.16
1.533ArgCys: 1.533 ± 0.486
3.372ArgAsp: 3.372 ± 0.444
3.372ArgGlu: 3.372 ± 1.16
1.533ArgPhe: 1.533 ± 0.295
2.759ArgGly: 2.759 ± 0.475
2.146ArgHis: 2.146 ± 1.239
3.066ArgIle: 3.066 ± 0.976
1.839ArgLys: 1.839 ± 0.166
4.905ArgLeu: 4.905 ± 0.736
2.759ArgMet: 2.759 ± 1.593
1.533ArgAsn: 1.533 ± 0.885
2.146ArgPro: 2.146 ± 0.452
1.839ArgGln: 1.839 ± 2.344
1.226ArgArg: 1.226 ± 0.708
2.146ArgSer: 2.146 ± 1.239
2.759ArgThr: 2.759 ± 0.846
3.679ArgVal: 3.679 ± 0.332
0.307ArgTrp: 0.307 ± 0.177
1.839ArgTyr: 1.839 ± 0.637
0.0ArgXaa: 0.0 ± 0.0
Ser
2.759SerAla: 2.759 ± 0.952
0.92SerCys: 0.92 ± 1.172
1.533SerAsp: 1.533 ± 0.651
4.905SerGlu: 4.905 ± 2.832
3.985SerPhe: 3.985 ± 0.666
4.598SerGly: 4.598 ± 1.955
1.533SerHis: 1.533 ± 0.486
5.825SerIle: 5.825 ± 1.714
3.985SerLys: 3.985 ± 0.293
5.212SerLeu: 5.212 ± 1.14
0.92SerMet: 0.92 ± 0.531
2.146SerAsn: 2.146 ± 0.452
2.146SerPro: 2.146 ± 0.667
4.292SerGln: 4.292 ± 2.547
4.292SerArg: 4.292 ± 1.329
5.825SerSer: 5.825 ± 0.345
4.905SerThr: 4.905 ± 2.41
4.598SerVal: 4.598 ± 1.012
1.533SerTrp: 1.533 ± 0.532
1.533SerTyr: 1.533 ± 0.295
0.0SerXaa: 0.0 ± 0.0
Thr
3.372ThrAla: 3.372 ± 0.538
0.92ThrCys: 0.92 ± 0.531
3.985ThrAsp: 3.985 ± 1.443
3.679ThrGlu: 3.679 ± 1.412
3.679ThrPhe: 3.679 ± 0.675
3.985ThrGly: 3.985 ± 2.581
0.613ThrHis: 0.613 ± 0.356
3.372ThrIle: 3.372 ± 1.031
2.452ThrLys: 2.452 ± 0.296
5.518ThrLeu: 5.518 ± 1.496
1.226ThrMet: 1.226 ± 0.37
1.226ThrAsn: 1.226 ± 0.457
4.598ThrPro: 4.598 ± 1.458
1.839ThrGln: 1.839 ± 0.93
3.679ThrArg: 3.679 ± 0.491
4.598ThrSer: 4.598 ± 1.398
4.292ThrThr: 4.292 ± 1.333
5.212ThrVal: 5.212 ± 1.375
0.307ThrTrp: 0.307 ± 0.465
1.839ThrTyr: 1.839 ± 0.635
0.0ThrXaa: 0.0 ± 0.0
Val
4.292ValAla: 4.292 ± 1.987
0.613ValCys: 0.613 ± 0.536
4.905ValAsp: 4.905 ± 1.187
2.759ValGlu: 2.759 ± 1.122
4.292ValPhe: 4.292 ± 0.352
4.905ValGly: 4.905 ± 0.817
1.533ValHis: 1.533 ± 0.532
4.905ValIle: 4.905 ± 0.819
5.518ValLys: 5.518 ± 2.688
8.277ValLeu: 8.277 ± 0.623
1.839ValMet: 1.839 ± 0.635
3.679ValAsn: 3.679 ± 0.332
4.905ValPro: 4.905 ± 2.252
2.759ValGln: 2.759 ± 0.989
3.372ValArg: 3.372 ± 1.16
5.518ValSer: 5.518 ± 0.365
5.518ValThr: 5.518 ± 1.913
6.131ValVal: 6.131 ± 1.45
0.613ValTrp: 0.613 ± 0.8
3.066ValTyr: 3.066 ± 0.634
0.0ValXaa: 0.0 ± 0.0
Trp
1.226TrpAla: 1.226 ± 0.37
0.613TrpCys: 0.613 ± 0.536
0.307TrpAsp: 0.307 ± 0.177
0.307TrpGlu: 0.307 ± 0.644
0.613TrpPhe: 0.613 ± 1.288
0.307TrpGly: 0.307 ± 0.465
0.0TrpHis: 0.0 ± 0.0
0.613TrpIle: 0.613 ± 0.356
0.92TrpLys: 0.92 ± 0.531
1.839TrpLeu: 1.839 ± 0.93
0.307TrpMet: 0.307 ± 0.177
0.613TrpAsn: 0.613 ± 0.354
0.307TrpPro: 0.307 ± 0.465
0.613TrpGln: 0.613 ± 0.356
0.92TrpArg: 0.92 ± 0.473
1.533TrpSer: 1.533 ± 1.161
1.533TrpThr: 1.533 ± 0.486
0.613TrpVal: 0.613 ± 0.356
0.0TrpTrp: 0.0 ± 0.0
0.92TrpTyr: 0.92 ± 1.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.146TyrAla: 2.146 ± 0.452
1.533TyrCys: 1.533 ± 0.295
2.759TyrAsp: 2.759 ± 1.515
1.533TyrGlu: 1.533 ± 0.995
1.533TyrPhe: 1.533 ± 1.07
2.146TyrGly: 2.146 ± 0.176
1.226TyrHis: 1.226 ± 0.37
1.839TyrIle: 1.839 ± 0.631
2.146TyrLys: 2.146 ± 1.239
3.679TyrLeu: 3.679 ± 1.105
0.613TyrMet: 0.613 ± 0.8
1.533TyrAsn: 1.533 ± 0.885
1.533TyrPro: 1.533 ± 0.486
0.92TyrGln: 0.92 ± 0.317
1.533TyrArg: 1.533 ± 0.651
3.066TyrSer: 3.066 ± 1.71
1.533TyrThr: 1.533 ± 0.651
4.292TyrVal: 4.292 ± 0.575
0.0TyrTrp: 0.0 ± 0.0
1.839TyrTyr: 1.839 ± 0.945
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski