Amino acid dipepetide frequency for Hubei picorna-like virus 71

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.565AlaAla: 4.565 ± 0.829
0.806AlaCys: 0.806 ± 0.414
2.954AlaAsp: 2.954 ± 0.365
4.834AlaGlu: 4.834 ± 0.502
3.222AlaPhe: 3.222 ± 0.807
3.491AlaGly: 3.491 ± 2.252
1.611AlaHis: 1.611 ± 0.837
4.028AlaIle: 4.028 ± 1.486
4.028AlaLys: 4.028 ± 1.021
4.565AlaLeu: 4.565 ± 0.801
1.88AlaMet: 1.88 ± 0.263
3.491AlaAsn: 3.491 ± 1.652
3.491AlaPro: 3.491 ± 1.469
2.954AlaGln: 2.954 ± 1.264
3.491AlaArg: 3.491 ± 0.272
4.565AlaSer: 4.565 ± 0.448
5.639AlaThr: 5.639 ± 0.631
4.565AlaVal: 4.565 ± 1.095
0.537AlaTrp: 0.537 ± 0.261
2.685AlaTyr: 2.685 ± 0.355
0.0AlaXaa: 0.0 ± 0.0
Cys
0.806CysAla: 0.806 ± 0.304
0.269CysCys: 0.269 ± 0.13
1.343CysAsp: 1.343 ± 0.651
0.806CysGlu: 0.806 ± 0.391
0.537CysPhe: 0.537 ± 0.261
1.611CysGly: 1.611 ± 0.539
1.074CysHis: 1.074 ± 0.353
1.88CysIle: 1.88 ± 0.651
1.074CysLys: 1.074 ± 0.567
2.148CysLeu: 2.148 ± 0.998
0.269CysMet: 0.269 ± 0.13
0.537CysAsn: 0.537 ± 0.196
0.806CysPro: 0.806 ± 0.688
1.074CysGln: 1.074 ± 0.377
0.537CysArg: 0.537 ± 0.261
1.343CysSer: 1.343 ± 0.436
0.806CysThr: 0.806 ± 0.435
2.148CysVal: 2.148 ± 0.705
0.0CysTrp: 0.0 ± 0.0
1.074CysTyr: 1.074 ± 0.752
0.0CysXaa: 0.0 ± 0.0
Asp
2.417AspAla: 2.417 ± 0.541
0.269AspCys: 0.269 ± 0.13
2.148AspAsp: 2.148 ± 0.581
4.565AspGlu: 4.565 ± 1.254
3.491AspPhe: 3.491 ± 0.623
1.343AspGly: 1.343 ± 0.538
1.343AspHis: 1.343 ± 0.467
4.834AspIle: 4.834 ± 0.909
2.954AspLys: 2.954 ± 0.365
3.759AspLeu: 3.759 ± 1.049
1.074AspMet: 1.074 ± 0.38
2.954AspAsn: 2.954 ± 0.562
4.296AspPro: 4.296 ± 0.474
2.148AspGln: 2.148 ± 0.433
3.491AspArg: 3.491 ± 0.858
2.417AspSer: 2.417 ± 0.646
2.954AspThr: 2.954 ± 0.455
3.222AspVal: 3.222 ± 0.711
1.074AspTrp: 1.074 ± 0.521
2.685AspTyr: 2.685 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
4.296GluAla: 4.296 ± 0.957
0.806GluCys: 0.806 ± 0.304
4.028GluAsp: 4.028 ± 1.308
4.565GluGlu: 4.565 ± 1.533
3.222GluPhe: 3.222 ± 0.375
4.028GluGly: 4.028 ± 1.021
1.343GluHis: 1.343 ± 0.436
3.759GluIle: 3.759 ± 0.398
3.222GluLys: 3.222 ± 1.563
4.834GluLeu: 4.834 ± 0.907
0.806GluMet: 0.806 ± 0.291
3.222GluAsn: 3.222 ± 0.462
2.954GluPro: 2.954 ± 1.173
2.148GluGln: 2.148 ± 1.162
3.491GluArg: 3.491 ± 1.024
3.759GluSer: 3.759 ± 1.446
2.954GluThr: 2.954 ± 0.886
2.417GluVal: 2.417 ± 0.223
1.611GluTrp: 1.611 ± 0.231
2.685GluTyr: 2.685 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
2.954PheAla: 2.954 ± 1.264
0.537PheCys: 0.537 ± 0.196
1.88PheAsp: 1.88 ± 0.524
3.222PheGlu: 3.222 ± 0.462
1.074PhePhe: 1.074 ± 0.177
1.88PheGly: 1.88 ± 0.499
1.343PheHis: 1.343 ± 0.28
1.88PheIle: 1.88 ± 0.723
4.296PheLys: 4.296 ± 0.957
2.685PheLeu: 2.685 ± 0.273
0.537PheMet: 0.537 ± 0.261
3.222PheAsn: 3.222 ± 1.174
1.611PhePro: 1.611 ± 0.4
1.88PheGln: 1.88 ± 0.284
2.148PheArg: 2.148 ± 0.814
2.954PheSer: 2.954 ± 0.678
3.222PheThr: 3.222 ± 0.286
3.491PheVal: 3.491 ± 0.858
0.269PheTrp: 0.269 ± 0.13
1.611PheTyr: 1.611 ± 0.828
0.0PheXaa: 0.0 ± 0.0
Gly
4.565GlyAla: 4.565 ± 2.039
2.148GlyCys: 2.148 ± 0.705
2.417GlyAsp: 2.417 ± 0.4
2.417GlyGlu: 2.417 ± 0.223
3.222GlyPhe: 3.222 ± 0.476
2.417GlyGly: 2.417 ± 1.078
1.611GlyHis: 1.611 ± 0.539
2.685GlyIle: 2.685 ± 0.56
3.759GlyLys: 3.759 ± 1.824
2.685GlyLeu: 2.685 ± 1.586
1.611GlyMet: 1.611 ± 0.4
2.685GlyAsn: 2.685 ± 0.551
2.148GlyPro: 2.148 ± 0.602
0.806GlyGln: 0.806 ± 0.133
2.954GlyArg: 2.954 ± 0.825
4.296GlySer: 4.296 ± 0.632
4.834GlyThr: 4.834 ± 0.933
3.491GlyVal: 3.491 ± 1.393
1.074GlyTrp: 1.074 ± 0.391
3.491GlyTyr: 3.491 ± 0.858
0.0GlyXaa: 0.0 ± 0.0
His
2.417HisAla: 2.417 ± 0.305
0.537HisCys: 0.537 ± 0.307
1.343HisAsp: 1.343 ± 0.436
2.417HisGlu: 2.417 ± 0.891
1.611HisPhe: 1.611 ± 0.869
2.417HisGly: 2.417 ± 0.891
1.611HisHis: 1.611 ± 1.314
1.611HisIle: 1.611 ± 0.423
2.148HisLys: 2.148 ± 0.651
1.611HisLeu: 1.611 ± 0.837
0.537HisMet: 0.537 ± 0.194
1.074HisAsn: 1.074 ± 0.313
1.88HisPro: 1.88 ± 0.343
0.806HisGln: 0.806 ± 0.133
0.537HisArg: 0.537 ± 0.307
0.806HisSer: 0.806 ± 0.657
2.148HisThr: 2.148 ± 0.808
1.611HisVal: 1.611 ± 0.539
0.269HisTrp: 0.269 ± 0.13
0.806HisTyr: 0.806 ± 0.414
0.0HisXaa: 0.0 ± 0.0
Ile
2.685IleAla: 2.685 ± 0.56
1.611IleCys: 1.611 ± 0.589
4.296IleAsp: 4.296 ± 0.454
4.028IleGlu: 4.028 ± 1.469
1.611IlePhe: 1.611 ± 0.231
2.954IleGly: 2.954 ± 0.647
1.343IleHis: 1.343 ± 0.651
3.222IleIle: 3.222 ± 0.542
2.148IleLys: 2.148 ± 0.353
3.491IleLeu: 3.491 ± 1.426
2.685IleMet: 2.685 ± 0.426
2.685IleAsn: 2.685 ± 0.656
3.759IlePro: 3.759 ± 0.468
3.222IleGln: 3.222 ± 0.83
1.88IleArg: 1.88 ± 0.319
5.908IleSer: 5.908 ± 1.057
4.296IleThr: 4.296 ± 0.46
1.88IleVal: 1.88 ± 0.343
0.806IleTrp: 0.806 ± 0.391
2.148IleTyr: 2.148 ± 0.651
0.0IleXaa: 0.0 ± 0.0
Lys
3.491LysAla: 3.491 ± 1.024
0.537LysCys: 0.537 ± 0.261
5.102LysAsp: 5.102 ± 0.729
4.565LysGlu: 4.565 ± 1.095
3.222LysPhe: 3.222 ± 0.846
2.417LysGly: 2.417 ± 0.292
0.537LysHis: 0.537 ± 0.528
6.713LysIle: 6.713 ± 1.816
3.491LysLys: 3.491 ± 1.296
6.982LysLeu: 6.982 ± 0.964
1.074LysMet: 1.074 ± 0.313
3.491LysAsn: 3.491 ± 0.594
1.343LysPro: 1.343 ± 0.28
2.685LysGln: 2.685 ± 0.908
1.074LysArg: 1.074 ± 0.521
3.759LysSer: 3.759 ± 1.049
4.296LysThr: 4.296 ± 1.405
4.565LysVal: 4.565 ± 1.43
0.269LysTrp: 0.269 ± 0.13
3.222LysTyr: 3.222 ± 0.9
0.0LysXaa: 0.0 ± 0.0
Leu
5.371LeuAla: 5.371 ± 0.699
2.954LeuCys: 2.954 ± 1.071
3.759LeuAsp: 3.759 ± 0.728
3.759LeuGlu: 3.759 ± 1.205
3.222LeuPhe: 3.222 ± 0.476
5.102LeuGly: 5.102 ± 0.466
2.685LeuHis: 2.685 ± 0.273
2.417LeuIle: 2.417 ± 0.891
5.371LeuLys: 5.371 ± 1.152
4.834LeuLeu: 4.834 ± 0.902
1.611LeuMet: 1.611 ± 0.782
4.565LeuAsn: 4.565 ± 0.779
3.491LeuPro: 3.491 ± 1.22
4.028LeuGln: 4.028 ± 1.806
3.491LeuArg: 3.491 ± 0.744
6.445LeuSer: 6.445 ± 1.083
5.371LeuThr: 5.371 ± 0.546
4.028LeuVal: 4.028 ± 0.387
0.806LeuTrp: 0.806 ± 0.133
3.759LeuTyr: 3.759 ± 0.607
0.0LeuXaa: 0.0 ± 0.0
Met
2.685MetAla: 2.685 ± 0.273
0.806MetCys: 0.806 ± 0.657
1.074MetAsp: 1.074 ± 0.177
2.148MetGlu: 2.148 ± 0.651
0.806MetPhe: 0.806 ± 0.495
0.537MetGly: 0.537 ± 0.261
0.537MetHis: 0.537 ± 0.261
0.537MetIle: 0.537 ± 0.721
2.417MetLys: 2.417 ± 0.599
2.148MetLeu: 2.148 ± 0.602
0.537MetMet: 0.537 ± 0.313
1.611MetAsn: 1.611 ± 1.174
2.685MetPro: 2.685 ± 1.167
1.88MetGln: 1.88 ± 0.263
1.611MetArg: 1.611 ± 0.4
1.611MetSer: 1.611 ± 0.4
1.611MetThr: 1.611 ± 0.571
1.611MetVal: 1.611 ± 0.284
0.0MetTrp: 0.0 ± 0.0
1.88MetTyr: 1.88 ± 0.645
0.0MetXaa: 0.0 ± 0.0
Asn
2.685AsnAla: 2.685 ± 0.581
1.611AsnCys: 1.611 ± 0.612
2.685AsnAsp: 2.685 ± 1.077
1.88AsnGlu: 1.88 ± 1.058
2.148AsnPhe: 2.148 ± 0.433
3.491AsnGly: 3.491 ± 0.623
1.88AsnHis: 1.88 ± 0.454
3.222AsnIle: 3.222 ± 0.846
3.222AsnLys: 3.222 ± 0.799
2.954AsnLeu: 2.954 ± 0.365
1.611AsnMet: 1.611 ± 0.4
1.611AsnAsn: 1.611 ± 0.587
3.759AsnPro: 3.759 ± 0.483
2.417AsnGln: 2.417 ± 1.486
2.685AsnArg: 2.685 ± 0.551
3.491AsnSer: 3.491 ± 0.798
5.371AsnThr: 5.371 ± 1.45
2.954AsnVal: 2.954 ± 0.454
1.343AsnTrp: 1.343 ± 0.651
3.759AsnTyr: 3.759 ± 2.174
0.0AsnXaa: 0.0 ± 0.0
Pro
2.954ProAla: 2.954 ± 0.562
0.806ProCys: 0.806 ± 0.391
2.148ProAsp: 2.148 ± 0.433
2.685ProGlu: 2.685 ± 0.273
1.88ProPhe: 1.88 ± 0.94
3.491ProGly: 3.491 ± 0.594
1.074ProHis: 1.074 ± 0.391
2.417ProIle: 2.417 ± 0.523
2.148ProLys: 2.148 ± 0.353
6.176ProLeu: 6.176 ± 1.664
0.806ProMet: 0.806 ± 0.841
4.565ProAsn: 4.565 ± 0.684
1.88ProPro: 1.88 ± 0.883
3.222ProGln: 3.222 ± 0.96
1.88ProArg: 1.88 ± 0.743
4.296ProSer: 4.296 ± 1.922
4.028ProThr: 4.028 ± 0.378
2.417ProVal: 2.417 ± 0.779
0.806ProTrp: 0.806 ± 0.304
1.074ProTyr: 1.074 ± 0.177
0.0ProXaa: 0.0 ± 0.0
Gln
2.954GlnAla: 2.954 ± 1.102
0.537GlnCys: 0.537 ± 0.261
2.417GlnAsp: 2.417 ± 0.265
3.222GlnGlu: 3.222 ± 0.9
2.148GlnPhe: 2.148 ± 0.58
1.074GlnGly: 1.074 ± 0.567
2.685GlnHis: 2.685 ± 0.178
2.148GlnIle: 2.148 ± 0.416
1.88GlnLys: 1.88 ± 0.543
1.343GlnLeu: 1.343 ± 0.467
2.148GlnMet: 2.148 ± 0.416
2.954GlnAsn: 2.954 ± 1.679
2.148GlnPro: 2.148 ± 0.988
4.028GlnGln: 4.028 ± 1.295
2.417GlnArg: 2.417 ± 0.506
2.685GlnSer: 2.685 ± 0.759
3.222GlnThr: 3.222 ± 0.548
3.759GlnVal: 3.759 ± 0.482
0.537GlnTrp: 0.537 ± 0.71
1.611GlnTyr: 1.611 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
3.222ArgAla: 3.222 ± 1.36
0.806ArgCys: 0.806 ± 0.414
2.954ArgAsp: 2.954 ± 0.971
2.954ArgGlu: 2.954 ± 0.562
1.611ArgPhe: 1.611 ± 0.817
4.565ArgGly: 4.565 ± 0.873
1.343ArgHis: 1.343 ± 0.597
2.417ArgIle: 2.417 ± 0.541
1.611ArgLys: 1.611 ± 0.284
3.759ArgLeu: 3.759 ± 0.338
3.222ArgMet: 3.222 ± 0.833
2.954ArgAsn: 2.954 ± 0.108
1.343ArgPro: 1.343 ± 0.28
2.148ArgGln: 2.148 ± 0.434
3.222ArgArg: 3.222 ± 0.572
1.611ArgSer: 1.611 ± 0.828
3.222ArgThr: 3.222 ± 0.906
1.074ArgVal: 1.074 ± 0.353
0.537ArgTrp: 0.537 ± 0.261
0.806ArgTyr: 0.806 ± 0.391
0.0ArgXaa: 0.0 ± 0.0
Ser
5.102SerAla: 5.102 ± 0.192
1.074SerCys: 1.074 ± 0.353
3.759SerAsp: 3.759 ± 0.753
3.491SerGlu: 3.491 ± 0.616
2.954SerPhe: 2.954 ± 0.731
3.222SerGly: 3.222 ± 0.938
1.611SerHis: 1.611 ± 0.539
4.028SerIle: 4.028 ± 0.129
4.296SerLys: 4.296 ± 0.957
4.296SerLeu: 4.296 ± 0.474
2.685SerMet: 2.685 ± 0.755
3.222SerAsn: 3.222 ± 0.715
4.296SerPro: 4.296 ± 1.34
2.954SerGln: 2.954 ± 1.264
2.417SerArg: 2.417 ± 1.039
3.759SerSer: 3.759 ± 0.91
4.565SerThr: 4.565 ± 1.88
4.028SerVal: 4.028 ± 0.575
1.074SerTrp: 1.074 ± 0.391
2.954SerTyr: 2.954 ± 0.737
0.0SerXaa: 0.0 ± 0.0
Thr
4.834ThrAla: 4.834 ± 1.9
1.074ThrCys: 1.074 ± 0.353
3.491ThrAsp: 3.491 ± 0.982
2.954ThrGlu: 2.954 ± 0.365
1.88ThrPhe: 1.88 ± 0.684
3.759ThrGly: 3.759 ± 0.482
1.611ThrHis: 1.611 ± 0.749
4.565ThrIle: 4.565 ± 0.356
4.296ThrLys: 4.296 ± 0.5
8.593ThrLeu: 8.593 ± 1.07
1.343ThrMet: 1.343 ± 0.673
2.148ThrAsn: 2.148 ± 0.808
4.028ThrPro: 4.028 ± 0.749
3.491ThrGln: 3.491 ± 0.86
2.685ThrArg: 2.685 ± 0.563
5.102ThrSer: 5.102 ± 0.995
4.834ThrThr: 4.834 ± 1.832
4.296ThrVal: 4.296 ± 1.325
0.806ThrTrp: 0.806 ± 0.495
3.222ThrTyr: 3.222 ± 0.462
0.0ThrXaa: 0.0 ± 0.0
Val
4.834ValAla: 4.834 ± 0.632
1.611ValCys: 1.611 ± 0.346
3.222ValAsp: 3.222 ± 0.476
1.88ValGlu: 1.88 ± 0.684
2.954ValPhe: 2.954 ± 0.647
4.028ValGly: 4.028 ± 1.555
0.537ValHis: 0.537 ± 0.196
2.148ValIle: 2.148 ± 0.353
4.296ValLys: 4.296 ± 0.957
5.639ValLeu: 5.639 ± 1.558
1.343ValMet: 1.343 ± 0.689
3.491ValAsn: 3.491 ± 0.807
3.491ValPro: 3.491 ± 0.572
2.417ValGln: 2.417 ± 0.45
2.148ValArg: 2.148 ± 0.754
4.296ValSer: 4.296 ± 0.531
2.417ValThr: 2.417 ± 0.265
2.417ValVal: 2.417 ± 0.45
0.806ValTrp: 0.806 ± 0.133
3.222ValTyr: 3.222 ± 1.166
0.0ValXaa: 0.0 ± 0.0
Trp
1.074TrpAla: 1.074 ± 0.799
0.537TrpCys: 0.537 ± 0.261
0.537TrpAsp: 0.537 ± 0.196
0.269TrpGlu: 0.269 ± 0.13
0.537TrpPhe: 0.537 ± 0.261
1.074TrpGly: 1.074 ± 0.177
1.074TrpHis: 1.074 ± 0.377
0.269TrpIle: 0.269 ± 0.355
1.343TrpLys: 1.343 ± 0.436
1.88TrpLeu: 1.88 ± 0.524
0.0TrpMet: 0.0 ± 0.0
1.88TrpAsn: 1.88 ± 0.912
0.269TrpPro: 0.269 ± 0.13
0.537TrpGln: 0.537 ± 0.261
0.269TrpArg: 0.269 ± 0.13
0.269TrpSer: 0.269 ± 0.305
0.537TrpThr: 0.537 ± 0.609
0.537TrpVal: 0.537 ± 0.261
0.269TrpTrp: 0.269 ± 0.13
0.269TrpTyr: 0.269 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.491TyrAla: 3.491 ± 0.538
0.806TyrCys: 0.806 ± 0.304
1.88TyrAsp: 1.88 ± 0.524
3.491TyrGlu: 3.491 ± 0.982
1.343TyrPhe: 1.343 ± 0.538
2.417TyrGly: 2.417 ± 0.471
1.343TyrHis: 1.343 ± 0.538
2.148TyrIle: 2.148 ± 1.042
4.834TyrLys: 4.834 ± 1.373
2.685TyrLeu: 2.685 ± 0.936
2.685TyrMet: 2.685 ± 0.777
2.417TyrAsn: 2.417 ± 0.693
1.074TyrPro: 1.074 ± 0.353
1.074TyrGln: 1.074 ± 0.614
2.954TyrArg: 2.954 ± 0.307
2.417TyrSer: 2.417 ± 0.523
2.685TyrThr: 2.685 ± 0.551
2.685TyrVal: 2.685 ± 0.56
0.537TyrTrp: 0.537 ± 0.261
2.148TyrTyr: 2.148 ± 0.434
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3725 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski