Amino acid dipepetide frequency for Hubei picorna-like virus 66

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.869AlaAla: 4.869 ± 1.503
1.025AlaCys: 1.025 ± 0.47
3.844AlaAsp: 3.844 ± 0.167
4.869AlaGlu: 4.869 ± 0.433
2.307AlaPhe: 2.307 ± 0.704
3.332AlaGly: 3.332 ± 1.23
1.794AlaHis: 1.794 ± 0.612
5.126AlaIle: 5.126 ± 0.016
3.332AlaLys: 3.332 ± 0.503
4.869AlaLeu: 4.869 ± 0.636
0.256AlaMet: 0.256 ± 0.168
2.819AlaAsn: 2.819 ± 0.295
3.588AlaPro: 3.588 ± 0.532
2.307AlaGln: 2.307 ± 0.566
4.613AlaArg: 4.613 ± 1.153
2.819AlaSer: 2.819 ± 0.295
3.075AlaThr: 3.075 ± 0.424
4.869AlaVal: 4.869 ± 0.499
0.769AlaTrp: 0.769 ± 0.332
1.281AlaTyr: 1.281 ± 0.174
0.0AlaXaa: 0.0 ± 0.0
Cys
0.513CysAla: 0.513 ± 0.335
0.0CysCys: 0.0 ± 0.0
1.281CysAsp: 1.281 ± 0.537
1.281CysGlu: 1.281 ± 0.838
0.256CysPhe: 0.256 ± 0.168
1.281CysGly: 1.281 ± 0.537
0.0CysHis: 0.0 ± 0.0
1.538CysIle: 1.538 ± 0.7
0.513CysLys: 0.513 ± 0.335
2.05CysLeu: 2.05 ± 1.341
0.256CysMet: 0.256 ± 0.294
1.538CysAsn: 1.538 ± 1.005
0.513CysPro: 0.513 ± 0.151
0.513CysGln: 0.513 ± 0.151
0.256CysArg: 0.256 ± 0.168
0.513CysSer: 0.513 ± 0.229
0.513CysThr: 0.513 ± 0.229
1.025CysVal: 1.025 ± 0.379
0.256CysTrp: 0.256 ± 0.216
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.307AspAla: 2.307 ± 0.704
0.256AspCys: 0.256 ± 0.168
5.638AspAsp: 5.638 ± 0.14
4.869AspGlu: 4.869 ± 0.579
2.563AspPhe: 2.563 ± 0.718
2.563AspGly: 2.563 ± 0.363
1.794AspHis: 1.794 ± 0.852
3.588AspIle: 3.588 ± 1.097
3.588AspLys: 3.588 ± 1.355
6.663AspLeu: 6.663 ± 1.363
2.05AspMet: 2.05 ± 0.479
3.332AspAsn: 3.332 ± 0.654
3.075AspPro: 3.075 ± 1.017
2.05AspGln: 2.05 ± 0.352
2.819AspArg: 2.819 ± 0.57
5.126AspSer: 5.126 ± 0.342
1.794AspThr: 1.794 ± 0.948
4.869AspVal: 4.869 ± 1.021
1.281AspTrp: 1.281 ± 0.181
2.05AspTyr: 2.05 ± 0.586
0.0AspXaa: 0.0 ± 0.0
Glu
5.126GluAla: 5.126 ± 0.342
0.769GluCys: 0.769 ± 0.503
4.613GluAsp: 4.613 ± 0.76
11.02GluGlu: 11.02 ± 4.871
3.075GluPhe: 3.075 ± 0.494
4.1GluGly: 4.1 ± 0.067
2.05GluHis: 2.05 ± 0.352
7.432GluIle: 7.432 ± 1.511
6.151GluLys: 6.151 ± 4.376
4.869GluLeu: 4.869 ± 0.821
1.538GluMet: 1.538 ± 0.709
3.332GluAsn: 3.332 ± 1.287
2.307GluPro: 2.307 ± 0.82
5.126GluGln: 5.126 ± 0.67
4.613GluArg: 4.613 ± 0.528
1.794GluSer: 1.794 ± 0.608
5.382GluThr: 5.382 ± 1.025
4.613GluVal: 4.613 ± 0.325
1.025GluTrp: 1.025 ± 0.078
1.538GluTyr: 1.538 ± 1.005
0.0GluXaa: 0.0 ± 0.0
Phe
1.538PheAla: 1.538 ± 0.384
0.513PheCys: 0.513 ± 0.151
3.332PheAsp: 3.332 ± 0.508
3.332PheGlu: 3.332 ± 0.503
0.769PhePhe: 0.769 ± 0.235
3.844PheGly: 3.844 ± 0.759
0.256PheHis: 0.256 ± 0.168
2.819PheIle: 2.819 ± 0.424
2.05PheLys: 2.05 ± 1.029
3.075PheLeu: 3.075 ± 0.145
0.256PheMet: 0.256 ± 0.168
2.819PheAsn: 2.819 ± 0.536
1.025PhePro: 1.025 ± 0.392
0.513PheGln: 0.513 ± 0.335
2.563PheArg: 2.563 ± 1.005
2.05PheSer: 2.05 ± 0.603
2.05PheThr: 2.05 ± 0.352
3.844PheVal: 3.844 ± 0.167
0.769PheTrp: 0.769 ± 0.274
0.769PheTyr: 0.769 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
4.613GlyAla: 4.613 ± 0.528
0.0GlyCys: 0.0 ± 0.0
5.382GlyAsp: 5.382 ± 1.275
4.869GlyGlu: 4.869 ± 0.843
1.538GlyPhe: 1.538 ± 0.452
4.1GlyGly: 4.1 ± 1.597
1.794GlyHis: 1.794 ± 0.608
4.613GlyIle: 4.613 ± 1.021
5.126GlyLys: 5.126 ± 1.076
4.357GlyLeu: 4.357 ± 0.511
0.513GlyMet: 0.513 ± 0.348
4.613GlyAsn: 4.613 ± 1.071
1.794GlyPro: 1.794 ± 0.612
1.538GlyGln: 1.538 ± 0.47
1.538GlyArg: 1.538 ± 0.7
3.332GlySer: 3.332 ± 0.639
1.794GlyThr: 1.794 ± 0.266
4.357GlyVal: 4.357 ± 1.467
0.513GlyTrp: 0.513 ± 0.335
1.281GlyTyr: 1.281 ± 0.537
0.0GlyXaa: 0.0 ± 0.0
His
1.025HisAla: 1.025 ± 0.379
0.256HisCys: 0.256 ± 0.168
0.256HisAsp: 0.256 ± 0.216
1.025HisGlu: 1.025 ± 0.302
0.256HisPhe: 0.256 ± 0.168
2.819HisGly: 2.819 ± 1.236
0.0HisHis: 0.0 ± 0.0
1.794HisIle: 1.794 ± 0.864
1.281HisLys: 1.281 ± 0.838
1.538HisLeu: 1.538 ± 0.692
1.025HisMet: 1.025 ± 0.302
1.538HisAsn: 1.538 ± 0.547
1.281HisPro: 1.281 ± 0.181
0.769HisGln: 0.769 ± 0.235
0.256HisArg: 0.256 ± 0.168
0.769HisSer: 0.769 ± 0.503
0.256HisThr: 0.256 ± 0.168
2.819HisVal: 2.819 ± 0.536
0.0HisTrp: 0.0 ± 0.0
0.256HisTyr: 0.256 ± 0.294
0.0HisXaa: 0.0 ± 0.0
Ile
3.844IleAla: 3.844 ± 0.829
1.794IleCys: 1.794 ± 0.864
3.588IleAsp: 3.588 ± 0.648
6.663IleGlu: 6.663 ± 1.743
1.538IlePhe: 1.538 ± 0.452
4.357IleGly: 4.357 ± 0.605
0.769IleHis: 0.769 ± 0.189
2.307IleIle: 2.307 ± 0.517
4.1IleLys: 4.1 ± 0.878
3.844IleLeu: 3.844 ± 0.523
0.769IleMet: 0.769 ± 0.235
3.332IleAsn: 3.332 ± 0.654
3.588IlePro: 3.588 ± 1.306
1.794IleGln: 1.794 ± 0.653
3.332IleArg: 3.332 ± 0.934
4.613IleSer: 4.613 ± 1.504
5.382IleThr: 5.382 ± 0.987
8.201IleVal: 8.201 ± 1.357
1.281IleTrp: 1.281 ± 0.36
2.05IleTyr: 2.05 ± 0.668
0.0IleXaa: 0.0 ± 0.0
Lys
6.407LysAla: 6.407 ± 0.855
1.538LysCys: 1.538 ± 0.7
3.844LysAsp: 3.844 ± 1.094
6.407LysGlu: 6.407 ± 1.984
2.819LysPhe: 2.819 ± 0.556
3.332LysGly: 3.332 ± 0.885
1.025LysHis: 1.025 ± 0.67
4.613LysIle: 4.613 ± 1.021
5.638LysLys: 5.638 ± 2.516
7.688LysLeu: 7.688 ± 1.447
1.025LysMet: 1.025 ± 0.392
3.332LysAsn: 3.332 ± 0.997
3.588LysPro: 3.588 ± 2.343
3.588LysGln: 3.588 ± 0.47
4.357LysArg: 4.357 ± 2.38
4.613LysSer: 4.613 ± 1.101
5.638LysThr: 5.638 ± 2.74
4.613LysVal: 4.613 ± 1.467
0.256LysTrp: 0.256 ± 0.216
1.025LysTyr: 1.025 ± 0.67
0.0LysXaa: 0.0 ± 0.0
Leu
4.869LeuAla: 4.869 ± 1.149
1.281LeuCys: 1.281 ± 0.537
6.92LeuAsp: 6.92 ± 1.786
7.688LeuGlu: 7.688 ± 0.418
2.819LeuPhe: 2.819 ± 0.41
4.1LeuGly: 4.1 ± 0.703
1.281LeuHis: 1.281 ± 0.537
4.1LeuIle: 4.1 ± 0.47
7.688LeuLys: 7.688 ± 1.619
7.688LeuLeu: 7.688 ± 1.967
1.281LeuMet: 1.281 ± 0.181
4.1LeuAsn: 4.1 ± 0.703
4.613LeuPro: 4.613 ± 1.148
2.819LeuGln: 2.819 ± 0.85
3.844LeuArg: 3.844 ± 0.878
6.663LeuSer: 6.663 ± 0.707
5.638LeuThr: 5.638 ± 1.073
5.382LeuVal: 5.382 ± 0.798
1.281LeuTrp: 1.281 ± 0.181
1.794LeuTyr: 1.794 ± 0.852
0.0LeuXaa: 0.0 ± 0.0
Met
1.281MetAla: 1.281 ± 0.537
0.513MetCys: 0.513 ± 0.335
1.794MetAsp: 1.794 ± 0.852
1.281MetGlu: 1.281 ± 1.077
0.513MetPhe: 0.513 ± 0.151
1.281MetGly: 1.281 ± 0.174
0.513MetHis: 0.513 ± 0.335
0.769MetIle: 0.769 ± 0.274
1.025MetLys: 1.025 ± 0.379
1.025MetLeu: 1.025 ± 0.379
0.513MetMet: 0.513 ± 0.151
0.513MetAsn: 0.513 ± 0.151
0.513MetPro: 0.513 ± 0.151
0.513MetGln: 0.513 ± 0.431
0.513MetArg: 0.513 ± 0.151
2.819MetSer: 2.819 ± 0.536
0.769MetThr: 0.769 ± 0.274
1.281MetVal: 1.281 ± 0.537
0.513MetTrp: 0.513 ± 0.335
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.819AsnAla: 2.819 ± 0.705
0.769AsnCys: 0.769 ± 0.503
3.588AsnAsp: 3.588 ± 0.328
2.307AsnGlu: 2.307 ± 0.704
2.563AsnPhe: 2.563 ± 0.363
2.563AsnGly: 2.563 ± 0.349
1.025AsnHis: 1.025 ± 0.392
2.819AsnIle: 2.819 ± 1.265
3.332AsnLys: 3.332 ± 0.871
3.588AsnLeu: 3.588 ± 0.47
0.769AsnMet: 0.769 ± 0.274
4.869AsnAsn: 4.869 ± 1.156
2.05AsnPro: 2.05 ± 0.33
2.307AsnGln: 2.307 ± 0.292
2.307AsnArg: 2.307 ± 0.506
3.332AsnSer: 3.332 ± 0.281
5.894AsnThr: 5.894 ± 1.662
4.357AsnVal: 4.357 ± 0.609
1.538AsnTrp: 1.538 ± 0.34
1.281AsnTyr: 1.281 ± 0.357
0.0AsnXaa: 0.0 ± 0.0
Pro
2.819ProAla: 2.819 ± 1.802
0.256ProCys: 0.256 ± 0.168
1.794ProAsp: 1.794 ± 0.235
2.307ProGlu: 2.307 ± 1.161
1.794ProPhe: 1.794 ± 0.34
2.307ProGly: 2.307 ± 0.396
0.513ProHis: 0.513 ± 0.335
3.844ProIle: 3.844 ± 2.321
4.1ProLys: 4.1 ± 2.874
3.588ProLeu: 3.588 ± 0.675
1.281ProMet: 1.281 ± 0.127
1.281ProAsn: 1.281 ± 0.76
1.281ProPro: 1.281 ± 0.469
1.281ProGln: 1.281 ± 0.719
1.281ProArg: 1.281 ± 1.078
2.819ProSer: 2.819 ± 0.909
3.844ProThr: 3.844 ± 1.059
3.075ProVal: 3.075 ± 0.145
1.281ProTrp: 1.281 ± 0.537
1.025ProTyr: 1.025 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
3.075GlnAla: 3.075 ± 1.675
0.0GlnCys: 0.0 ± 0.0
1.794GlnAsp: 1.794 ± 0.948
2.307GlnGlu: 2.307 ± 0.795
1.794GlnPhe: 1.794 ± 0.266
1.025GlnGly: 1.025 ± 0.392
1.025GlnHis: 1.025 ± 0.392
2.307GlnIle: 2.307 ± 0.838
3.332GlnLys: 3.332 ± 0.187
3.075GlnLeu: 3.075 ± 0.681
0.256GlnMet: 0.256 ± 0.216
1.281GlnAsn: 1.281 ± 0.469
1.281GlnPro: 1.281 ± 0.174
1.281GlnGln: 1.281 ± 0.36
2.05GlnArg: 2.05 ± 0.857
1.794GlnSer: 1.794 ± 0.505
3.332GlnThr: 3.332 ± 1.287
3.075GlnVal: 3.075 ± 0.525
0.0GlnTrp: 0.0 ± 0.0
0.769GlnTyr: 0.769 ± 0.235
0.0GlnXaa: 0.0 ± 0.0
Arg
3.332ArgAla: 3.332 ± 0.235
0.513ArgCys: 0.513 ± 0.151
3.332ArgAsp: 3.332 ± 0.187
2.819ArgGlu: 2.819 ± 0.772
2.307ArgPhe: 2.307 ± 0.704
2.307ArgGly: 2.307 ± 0.396
0.256ArgHis: 0.256 ± 0.168
3.844ArgIle: 3.844 ± 0.943
4.357ArgLys: 4.357 ± 1.289
5.638ArgLeu: 5.638 ± 0.654
0.769ArgMet: 0.769 ± 0.332
2.563ArgAsn: 2.563 ± 0.421
2.05ArgPro: 2.05 ± 0.668
1.538ArgGln: 1.538 ± 0.384
1.794ArgArg: 1.794 ± 0.34
3.844ArgSer: 3.844 ± 1.097
2.819ArgThr: 2.819 ± 1.409
3.075ArgVal: 3.075 ± 0.525
1.538ArgTrp: 1.538 ± 0.073
1.794ArgTyr: 1.794 ± 0.87
0.0ArgXaa: 0.0 ± 0.0
Ser
2.819SerAla: 2.819 ± 1.124
1.281SerCys: 1.281 ± 0.537
4.1SerAsp: 4.1 ± 0.47
3.844SerGlu: 3.844 ± 0.531
3.332SerPhe: 3.332 ± 0.997
4.869SerGly: 4.869 ± 1.333
1.025SerHis: 1.025 ± 0.379
4.613SerIle: 4.613 ± 0.677
6.92SerLys: 6.92 ± 2.357
6.92SerLeu: 6.92 ± 1.164
0.513SerMet: 0.513 ± 0.151
1.538SerAsn: 1.538 ± 0.998
2.563SerPro: 2.563 ± 0.838
1.025SerGln: 1.025 ± 0.392
3.844SerArg: 3.844 ± 0.355
3.075SerSer: 3.075 ± 0.66
2.563SerThr: 2.563 ± 0.715
4.869SerVal: 4.869 ± 0.8
0.513SerTrp: 0.513 ± 0.335
2.05SerTyr: 2.05 ± 0.352
0.0SerXaa: 0.0 ± 0.0
Thr
4.1ThrAla: 4.1 ± 0.313
1.281ThrCys: 1.281 ± 0.838
2.563ThrAsp: 2.563 ± 0.374
3.588ThrGlu: 3.588 ± 1.097
2.563ThrPhe: 2.563 ± 0.715
3.588ThrGly: 3.588 ± 1.325
1.281ThrHis: 1.281 ± 0.357
3.844ThrIle: 3.844 ± 0.523
5.126ThrLys: 5.126 ± 1.887
6.151ThrLeu: 6.151 ± 1.05
1.281ThrMet: 1.281 ± 0.537
2.819ThrAsn: 2.819 ± 0.208
2.563ThrPro: 2.563 ± 0.838
2.307ThrGln: 2.307 ± 0.693
2.819ThrArg: 2.819 ± 0.41
2.819ThrSer: 2.819 ± 0.208
5.382ThrThr: 5.382 ± 0.782
6.151ThrVal: 6.151 ± 0.47
0.0ThrTrp: 0.0 ± 0.0
1.281ThrTyr: 1.281 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
4.613ValAla: 4.613 ± 0.662
1.025ValCys: 1.025 ± 0.379
2.307ValAsp: 2.307 ± 0.292
7.432ValGlu: 7.432 ± 1.223
2.563ValPhe: 2.563 ± 1.074
3.844ValGly: 3.844 ± 1.174
2.05ValHis: 2.05 ± 0.223
5.382ValIle: 5.382 ± 0.782
4.613ValLys: 4.613 ± 0.542
5.126ValLeu: 5.126 ± 1.261
1.794ValMet: 1.794 ± 0.608
6.151ValAsn: 6.151 ± 1.668
3.844ValPro: 3.844 ± 1.345
2.563ValGln: 2.563 ± 0.838
4.357ValArg: 4.357 ± 0.258
6.407ValSer: 6.407 ± 0.941
3.332ValThr: 3.332 ± 0.941
4.1ValVal: 4.1 ± 0.447
2.05ValTrp: 2.05 ± 0.352
2.307ValTyr: 2.307 ± 0.571
0.0ValXaa: 0.0 ± 0.0
Trp
1.025TrpAla: 1.025 ± 0.302
0.513TrpCys: 0.513 ± 0.335
0.769TrpAsp: 0.769 ± 0.274
0.769TrpGlu: 0.769 ± 0.607
0.256TrpPhe: 0.256 ± 0.216
1.025TrpGly: 1.025 ± 0.078
0.513TrpHis: 0.513 ± 0.335
0.769TrpIle: 0.769 ± 0.235
2.05TrpLys: 2.05 ± 0.352
0.769TrpLeu: 0.769 ± 0.235
0.769TrpMet: 0.769 ± 0.503
0.769TrpAsn: 0.769 ± 0.881
0.256TrpPro: 0.256 ± 0.168
0.256TrpGln: 0.256 ± 0.168
1.794TrpArg: 1.794 ± 0.505
1.025TrpSer: 1.025 ± 0.078
0.769TrpThr: 0.769 ± 0.235
0.513TrpVal: 0.513 ± 0.151
0.256TrpTrp: 0.256 ± 0.168
0.513TrpTyr: 0.513 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.769TyrAla: 0.769 ± 0.235
0.513TyrCys: 0.513 ± 0.335
1.538TyrAsp: 1.538 ± 0.34
1.794TyrGlu: 1.794 ± 0.612
2.563TyrPhe: 2.563 ± 0.715
1.025TyrGly: 1.025 ± 0.458
0.256TyrHis: 0.256 ± 0.168
1.025TyrIle: 1.025 ± 0.379
1.025TyrLys: 1.025 ± 0.392
3.332TyrLeu: 3.332 ± 1.184
0.769TyrMet: 0.769 ± 0.503
1.538TyrAsn: 1.538 ± 0.547
0.256TyrPro: 0.256 ± 0.216
0.769TyrGln: 0.769 ± 0.332
1.538TyrArg: 1.538 ± 0.965
2.05TyrSer: 2.05 ± 0.223
1.281TyrThr: 1.281 ± 0.181
0.769TyrVal: 0.769 ± 0.235
0.256TyrTrp: 0.256 ± 0.294
0.769TyrTyr: 0.769 ± 0.274
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3903 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski