Amino acid dipepetide frequency for Hubei picorna-like virus 75

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.601AlaAla: 7.601 ± 2.109
1.52AlaCys: 1.52 ± 0.682
1.824AlaAsp: 1.824 ± 1.034
3.649AlaGlu: 3.649 ± 1.083
1.824AlaPhe: 1.824 ± 0.916
3.953AlaGly: 3.953 ± 3.195
0.912AlaHis: 0.912 ± 1.321
3.344AlaIle: 3.344 ± 1.99
2.432AlaLys: 2.432 ± 0.728
5.169AlaLeu: 5.169 ± 2.067
3.344AlaMet: 3.344 ± 2.907
3.649AlaAsn: 3.649 ± 2.196
3.344AlaPro: 3.344 ± 2.03
2.432AlaGln: 2.432 ± 0.81
3.344AlaArg: 3.344 ± 1.37
6.385AlaSer: 6.385 ± 1.248
4.865AlaThr: 4.865 ± 0.71
3.04AlaVal: 3.04 ± 1.44
1.52AlaTrp: 1.52 ± 0.696
1.52AlaTyr: 1.52 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
0.608CysAla: 0.608 ± 0.541
1.216CysCys: 1.216 ± 1.116
0.608CysAsp: 0.608 ± 0.681
1.216CysGlu: 1.216 ± 0.587
0.912CysPhe: 0.912 ± 1.131
1.824CysGly: 1.824 ± 0.825
1.216CysHis: 1.216 ± 0.617
0.608CysIle: 0.608 ± 0.357
0.608CysLys: 0.608 ± 0.357
0.608CysLeu: 0.608 ± 0.357
0.608CysMet: 0.608 ± 0.405
0.608CysAsn: 0.608 ± 0.681
0.608CysPro: 0.608 ± 0.541
0.304CysGln: 0.304 ± 0.402
2.432CysArg: 2.432 ± 0.654
1.824CysSer: 1.824 ± 1.088
1.824CysThr: 1.824 ± 0.66
2.432CysVal: 2.432 ± 1.15
0.0CysTrp: 0.0 ± 0.0
0.304CysTyr: 0.304 ± 0.44
0.0CysXaa: 0.0 ± 0.0
Asp
2.736AspAla: 2.736 ± 0.49
0.912AspCys: 0.912 ± 0.536
3.649AspAsp: 3.649 ± 0.934
4.561AspGlu: 4.561 ± 1.81
2.736AspPhe: 2.736 ± 1.188
3.953AspGly: 3.953 ± 1.263
2.128AspHis: 2.128 ± 0.927
3.04AspIle: 3.04 ± 0.911
3.344AspLys: 3.344 ± 0.47
4.257AspLeu: 4.257 ± 1.474
1.824AspMet: 1.824 ± 0.542
2.128AspAsn: 2.128 ± 0.461
3.953AspPro: 3.953 ± 1.353
2.128AspGln: 2.128 ± 0.688
1.216AspArg: 1.216 ± 0.543
2.128AspSer: 2.128 ± 0.775
2.128AspThr: 2.128 ± 0.927
3.344AspVal: 3.344 ± 1.141
1.216AspTrp: 1.216 ± 0.783
1.216AspTyr: 1.216 ± 0.809
0.0AspXaa: 0.0 ± 0.0
Glu
3.344GluAla: 3.344 ± 1.612
0.912GluCys: 0.912 ± 0.444
3.649GluAsp: 3.649 ± 1.337
2.736GluGlu: 2.736 ± 1.26
5.777GluPhe: 5.777 ± 1.613
3.649GluGly: 3.649 ± 1.296
2.128GluHis: 2.128 ± 0.63
2.736GluIle: 2.736 ± 1.26
2.432GluLys: 2.432 ± 1.15
5.473GluLeu: 5.473 ± 1.571
1.216GluMet: 1.216 ± 0.543
2.432GluAsn: 2.432 ± 0.96
2.432GluPro: 2.432 ± 0.728
1.216GluGln: 1.216 ± 0.587
0.912GluArg: 0.912 ± 0.67
2.432GluSer: 2.432 ± 1.222
3.04GluThr: 3.04 ± 1.028
4.257GluVal: 4.257 ± 0.931
1.216GluTrp: 1.216 ± 0.689
3.04GluTyr: 3.04 ± 0.955
0.0GluXaa: 0.0 ± 0.0
Phe
2.432PheAla: 2.432 ± 0.946
0.912PheCys: 0.912 ± 0.373
4.561PheAsp: 4.561 ± 2.283
2.432PheGlu: 2.432 ± 0.654
2.432PhePhe: 2.432 ± 1.101
1.824PheGly: 1.824 ± 0.517
1.216PheHis: 1.216 ± 0.488
2.432PheIle: 2.432 ± 0.728
4.257PheLys: 4.257 ± 1.426
3.953PheLeu: 3.953 ± 0.806
1.216PheMet: 1.216 ± 0.689
2.128PheAsn: 2.128 ± 0.855
0.912PhePro: 0.912 ± 0.776
1.824PheGln: 1.824 ± 1.034
1.52PheArg: 1.52 ± 0.893
2.432PheSer: 2.432 ± 1.729
3.649PheThr: 3.649 ± 0.974
4.257PheVal: 4.257 ± 1.343
0.608PheTrp: 0.608 ± 0.357
0.608PheTyr: 0.608 ± 0.357
0.0PheXaa: 0.0 ± 0.0
Gly
3.344GlyAla: 3.344 ± 0.754
0.912GlyCys: 0.912 ± 0.535
2.736GlyAsp: 2.736 ± 0.725
2.128GlyGlu: 2.128 ± 0.985
2.432GlyPhe: 2.432 ± 1.062
3.344GlyGly: 3.344 ± 3.853
2.736GlyHis: 2.736 ± 1.318
3.344GlyIle: 3.344 ± 1.043
3.649GlyLys: 3.649 ± 1.459
5.777GlyLeu: 5.777 ± 0.743
3.04GlyMet: 3.04 ± 1.395
3.04GlyAsn: 3.04 ± 2.475
1.52GlyPro: 1.52 ± 0.696
2.432GlyGln: 2.432 ± 0.511
0.608GlyArg: 0.608 ± 0.357
4.257GlySer: 4.257 ± 1.005
6.081GlyThr: 6.081 ± 4.976
3.649GlyVal: 3.649 ± 0.767
0.608GlyTrp: 0.608 ± 0.405
3.344GlyTyr: 3.344 ± 0.533
0.0GlyXaa: 0.0 ± 0.0
His
1.52HisAla: 1.52 ± 0.786
0.608HisCys: 0.608 ± 0.357
0.608HisAsp: 0.608 ± 0.92
1.52HisGlu: 1.52 ± 0.831
1.216HisPhe: 1.216 ± 0.672
1.824HisGly: 1.824 ± 0.806
0.304HisHis: 0.304 ± 0.179
1.216HisIle: 1.216 ± 0.809
2.128HisLys: 2.128 ± 1.25
2.128HisLeu: 2.128 ± 0.666
0.912HisMet: 0.912 ± 0.805
0.912HisAsn: 0.912 ± 0.476
1.52HisPro: 1.52 ± 0.704
0.912HisGln: 0.912 ± 0.943
0.912HisArg: 0.912 ± 0.444
3.953HisSer: 3.953 ± 1.935
1.52HisThr: 1.52 ± 0.47
0.304HisVal: 0.304 ± 0.44
0.0HisTrp: 0.0 ± 0.0
0.912HisTyr: 0.912 ± 0.373
0.0HisXaa: 0.0 ± 0.0
Ile
4.257IleAla: 4.257 ± 1.366
2.128IleCys: 2.128 ± 0.942
3.344IleAsp: 3.344 ± 1.37
2.736IleGlu: 2.736 ± 0.99
2.128IlePhe: 2.128 ± 0.614
5.777IleGly: 5.777 ± 1.718
1.824IleHis: 1.824 ± 0.951
1.52IleIle: 1.52 ± 0.696
3.953IleLys: 3.953 ± 1.131
4.865IleLeu: 4.865 ± 1.167
1.52IleMet: 1.52 ± 0.893
1.52IleAsn: 1.52 ± 0.667
3.04IlePro: 3.04 ± 0.798
2.736IleGln: 2.736 ± 1.172
2.432IleArg: 2.432 ± 0.946
3.04IleSer: 3.04 ± 1.001
3.344IleThr: 3.344 ± 0.955
4.561IleVal: 4.561 ± 1.366
0.608IleTrp: 0.608 ± 0.541
1.52IleTyr: 1.52 ± 0.61
0.0IleXaa: 0.0 ± 0.0
Lys
3.04LysAla: 3.04 ± 1.438
1.216LysCys: 1.216 ± 0.617
3.04LysAsp: 3.04 ± 1.05
3.953LysGlu: 3.953 ± 1.962
3.344LysPhe: 3.344 ± 1.397
1.216LysGly: 1.216 ± 0.714
1.824LysHis: 1.824 ± 0.806
4.257LysIle: 4.257 ± 2.138
3.04LysLys: 3.04 ± 1.438
3.953LysLeu: 3.953 ± 1.121
2.432LysMet: 2.432 ± 0.946
3.649LysAsn: 3.649 ± 0.645
3.649LysPro: 3.649 ± 1.835
2.128LysGln: 2.128 ± 0.614
2.128LysArg: 2.128 ± 0.461
2.736LysSer: 2.736 ± 1.19
4.257LysThr: 4.257 ± 1.349
1.824LysVal: 1.824 ± 1.727
0.608LysTrp: 0.608 ± 0.357
1.52LysTyr: 1.52 ± 0.893
0.0LysXaa: 0.0 ± 0.0
Leu
6.993LeuAla: 6.993 ± 1.376
0.912LeuCys: 0.912 ± 0.535
6.081LeuAsp: 6.081 ± 1.659
3.649LeuGlu: 3.649 ± 0.565
3.04LeuPhe: 3.04 ± 1.363
6.081LeuGly: 6.081 ± 1.808
1.824LeuHis: 1.824 ± 1.214
3.953LeuIle: 3.953 ± 1.491
3.649LeuLys: 3.649 ± 0.767
4.561LeuLeu: 4.561 ± 1.501
1.824LeuMet: 1.824 ± 0.873
5.169LeuAsn: 5.169 ± 1.775
6.081LeuPro: 6.081 ± 1.582
4.257LeuGln: 4.257 ± 1.791
3.649LeuArg: 3.649 ± 1.304
5.777LeuSer: 5.777 ± 1.804
4.865LeuThr: 4.865 ± 0.998
5.777LeuVal: 5.777 ± 1.941
1.824LeuTrp: 1.824 ± 1.202
3.04LeuTyr: 3.04 ± 0.761
0.0LeuXaa: 0.0 ± 0.0
Met
3.04MetAla: 3.04 ± 2.075
1.216MetCys: 1.216 ± 0.43
1.216MetAsp: 1.216 ± 0.43
1.824MetGlu: 1.824 ± 0.662
1.216MetPhe: 1.216 ± 0.707
1.52MetGly: 1.52 ± 1.464
0.0MetHis: 0.0 ± 0.0
1.824MetIle: 1.824 ± 1.071
2.128MetLys: 2.128 ± 0.666
0.912MetLeu: 0.912 ± 0.373
0.608MetMet: 0.608 ± 0.325
2.736MetAsn: 2.736 ± 1.32
1.52MetPro: 1.52 ± 1.484
1.216MetGln: 1.216 ± 0.43
2.128MetArg: 2.128 ± 0.614
3.953MetSer: 3.953 ± 1.579
2.432MetThr: 2.432 ± 0.665
2.128MetVal: 2.128 ± 0.927
0.608MetTrp: 0.608 ± 0.405
0.912MetTyr: 0.912 ± 0.713
0.0MetXaa: 0.0 ± 0.0
Asn
2.432AsnAla: 2.432 ± 0.511
0.912AsnCys: 0.912 ± 0.67
0.608AsnAsp: 0.608 ± 0.345
2.128AsnGlu: 2.128 ± 0.748
2.432AsnPhe: 2.432 ± 1.32
2.128AsnGly: 2.128 ± 1.033
1.216AsnHis: 1.216 ± 0.43
1.824AsnIle: 1.824 ± 1.551
1.824AsnLys: 1.824 ± 0.889
2.128AsnLeu: 2.128 ± 1.002
2.736AsnMet: 2.736 ± 1.303
3.04AsnAsn: 3.04 ± 1.378
2.736AsnPro: 2.736 ± 3.022
2.432AsnGln: 2.432 ± 2.146
1.52AsnArg: 1.52 ± 0.455
4.257AsnSer: 4.257 ± 1.582
4.561AsnThr: 4.561 ± 1.252
5.169AsnVal: 5.169 ± 2.574
1.216AsnTrp: 1.216 ± 1.124
3.953AsnTyr: 3.953 ± 1.748
0.0AsnXaa: 0.0 ± 0.0
Pro
2.432ProAla: 2.432 ± 1.389
0.608ProCys: 0.608 ± 0.405
2.128ProAsp: 2.128 ± 0.833
3.649ProGlu: 3.649 ± 1.012
3.953ProPhe: 3.953 ± 0.903
2.128ProGly: 2.128 ± 1.328
0.608ProHis: 0.608 ± 0.541
3.344ProIle: 3.344 ± 1.253
3.344ProLys: 3.344 ± 0.955
3.953ProLeu: 3.953 ± 0.995
0.608ProMet: 0.608 ± 0.761
1.824ProAsn: 1.824 ± 1.149
4.257ProPro: 4.257 ± 1.632
2.432ProGln: 2.432 ± 0.711
2.432ProArg: 2.432 ± 2.083
7.297ProSer: 7.297 ± 2.937
4.865ProThr: 4.865 ± 0.957
2.736ProVal: 2.736 ± 1.096
0.912ProTrp: 0.912 ± 0.373
0.608ProTyr: 0.608 ± 0.405
0.0ProXaa: 0.0 ± 0.0
Gln
2.736GlnAla: 2.736 ± 0.698
0.0GlnCys: 0.0 ± 0.0
3.04GlnAsp: 3.04 ± 1.438
2.432GlnGlu: 2.432 ± 0.946
1.52GlnPhe: 1.52 ± 0.696
0.608GlnGly: 0.608 ± 0.357
0.304GlnHis: 0.304 ± 0.846
2.432GlnIle: 2.432 ± 0.654
3.344GlnLys: 3.344 ± 0.644
5.777GlnLeu: 5.777 ± 2.332
0.912GlnMet: 0.912 ± 1.074
2.128GlnAsn: 2.128 ± 1.558
2.128GlnPro: 2.128 ± 1.033
3.344GlnGln: 3.344 ± 2.907
2.128GlnArg: 2.128 ± 1.12
0.608GlnSer: 0.608 ± 0.357
3.953GlnThr: 3.953 ± 1.911
1.824GlnVal: 1.824 ± 0.827
0.912GlnTrp: 0.912 ± 0.444
1.216GlnTyr: 1.216 ± 0.473
0.0GlnXaa: 0.0 ± 0.0
Arg
1.824ArgAla: 1.824 ± 0.666
0.912ArgCys: 0.912 ± 0.536
3.649ArgAsp: 3.649 ± 1.083
2.128ArgGlu: 2.128 ± 0.674
1.216ArgPhe: 1.216 ± 0.543
3.649ArgGly: 3.649 ± 1.251
1.216ArgHis: 1.216 ± 0.587
3.04ArgIle: 3.04 ± 0.731
1.216ArgLys: 1.216 ± 0.714
3.649ArgLeu: 3.649 ± 1.373
2.128ArgMet: 2.128 ± 0.927
2.736ArgAsn: 2.736 ± 0.536
0.608ArgPro: 0.608 ± 1.205
1.216ArgGln: 1.216 ± 0.672
3.344ArgArg: 3.344 ± 1.296
5.169ArgSer: 5.169 ± 0.839
2.128ArgThr: 2.128 ± 0.904
2.128ArgVal: 2.128 ± 1.093
0.608ArgTrp: 0.608 ± 0.357
0.912ArgTyr: 0.912 ± 0.67
0.0ArgXaa: 0.0 ± 0.0
Ser
4.257SerAla: 4.257 ± 3.38
1.52SerCys: 1.52 ± 2.33
2.736SerAsp: 2.736 ± 1.117
3.953SerGlu: 3.953 ± 0.289
1.216SerPhe: 1.216 ± 0.543
5.169SerGly: 5.169 ± 1.733
1.216SerHis: 1.216 ± 1.113
6.689SerIle: 6.689 ± 1.902
4.257SerLys: 4.257 ± 1.129
7.905SerLeu: 7.905 ± 1.887
2.128SerMet: 2.128 ± 1.398
3.953SerAsn: 3.953 ± 1.504
4.865SerPro: 4.865 ± 1.167
3.953SerGln: 3.953 ± 2.019
2.736SerArg: 2.736 ± 0.998
5.473SerSer: 5.473 ± 2.064
6.993SerThr: 6.993 ± 3.895
3.953SerVal: 3.953 ± 1.207
0.304SerTrp: 0.304 ± 0.402
1.52SerTyr: 1.52 ± 0.704
0.0SerXaa: 0.0 ± 0.0
Thr
5.473ThrAla: 5.473 ± 1.455
1.52ThrCys: 1.52 ± 0.675
2.432ThrAsp: 2.432 ± 0.728
3.953ThrGlu: 3.953 ± 0.969
3.649ThrPhe: 3.649 ± 1.965
3.953ThrGly: 3.953 ± 1.211
0.912ThrHis: 0.912 ± 0.728
4.257ThrIle: 4.257 ± 1.236
3.344ThrLys: 3.344 ± 1.612
7.905ThrLeu: 7.905 ± 2.388
1.52ThrMet: 1.52 ± 1.504
2.736ThrAsn: 2.736 ± 1.082
5.473ThrPro: 5.473 ± 2.379
2.736ThrGln: 2.736 ± 2.144
4.561ThrArg: 4.561 ± 1.334
5.473ThrSer: 5.473 ± 3.511
4.561ThrThr: 4.561 ± 1.959
4.257ThrVal: 4.257 ± 0.94
1.52ThrTrp: 1.52 ± 0.555
2.432ThrTyr: 2.432 ± 1.101
0.0ThrXaa: 0.0 ± 0.0
Val
5.473ValAla: 5.473 ± 1.876
1.52ValCys: 1.52 ± 0.555
3.953ValAsp: 3.953 ± 1.282
3.344ValGlu: 3.344 ± 1.964
3.344ValPhe: 3.344 ± 1.37
3.649ValGly: 3.649 ± 0.757
2.128ValHis: 2.128 ± 0.891
3.649ValIle: 3.649 ± 0.601
2.128ValLys: 2.128 ± 0.927
6.385ValLeu: 6.385 ± 2.694
2.128ValMet: 2.128 ± 1.224
3.04ValAsn: 3.04 ± 0.748
3.344ValPro: 3.344 ± 1.289
1.824ValGln: 1.824 ± 1.034
1.824ValArg: 1.824 ± 0.825
3.344ValSer: 3.344 ± 1.704
3.953ValThr: 3.953 ± 0.542
6.689ValVal: 6.689 ± 1.345
1.216ValTrp: 1.216 ± 0.849
2.736ValTyr: 2.736 ± 0.984
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.304TrpCys: 0.304 ± 0.402
1.216TrpAsp: 1.216 ± 0.473
0.912TrpGlu: 0.912 ± 0.535
0.608TrpPhe: 0.608 ± 0.405
0.608TrpGly: 0.608 ± 0.576
0.304TrpHis: 0.304 ± 0.44
0.0TrpIle: 0.0 ± 0.0
2.128TrpLys: 2.128 ± 1.033
1.216TrpLeu: 1.216 ± 0.488
0.304TrpMet: 0.304 ± 0.402
0.608TrpAsn: 0.608 ± 0.761
0.608TrpPro: 0.608 ± 0.805
0.608TrpGln: 0.608 ± 0.881
0.912TrpArg: 0.912 ± 0.373
1.216TrpSer: 1.216 ± 0.783
2.736TrpThr: 2.736 ± 0.61
1.216TrpVal: 1.216 ± 0.543
0.0TrpTrp: 0.0 ± 0.0
0.304TrpTyr: 0.304 ± 0.402
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.128TyrAla: 2.128 ± 0.775
0.608TyrCys: 0.608 ± 0.357
1.52TyrAsp: 1.52 ± 0.667
2.128TyrGlu: 2.128 ± 0.976
0.608TyrPhe: 0.608 ± 0.345
2.128TyrGly: 2.128 ± 0.748
0.912TyrHis: 0.912 ± 0.476
3.344TyrIle: 3.344 ± 0.971
0.608TyrLys: 0.608 ± 0.357
2.432TyrLeu: 2.432 ± 1.235
1.824TyrMet: 1.824 ± 0.822
1.216TyrAsn: 1.216 ± 0.473
1.52TyrPro: 1.52 ± 0.929
1.216TyrGln: 1.216 ± 0.714
2.736TyrArg: 2.736 ± 0.823
3.344TyrSer: 3.344 ± 0.887
0.912TyrThr: 0.912 ± 0.536
2.128TyrVal: 2.128 ± 0.776
0.304TyrTrp: 0.304 ± 0.179
1.216TyrTyr: 1.216 ± 0.543
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3290 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski