Amino acid dipepetide frequency for Hubei hepe-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.024AlaAla: 4.024 ± 1.48
1.341AlaCys: 1.341 ± 0.384
3.018AlaAsp: 3.018 ± 1.598
3.353AlaGlu: 3.353 ± 1.234
0.671AlaPhe: 0.671 ± 0.353
2.683AlaGly: 2.683 ± 0.767
1.341AlaHis: 1.341 ± 0.707
4.359AlaIle: 4.359 ± 1.172
2.683AlaLys: 2.683 ± 0.767
5.366AlaLeu: 5.366 ± 0.435
1.006AlaMet: 1.006 ± 0.53
2.347AlaAsn: 2.347 ± 1.422
2.012AlaPro: 2.012 ± 0.532
1.677AlaGln: 1.677 ± 0.429
3.018AlaArg: 3.018 ± 0.947
3.689AlaSer: 3.689 ± 0.678
6.372AlaThr: 6.372 ± 1.919
3.018AlaVal: 3.018 ± 2.337
1.341AlaTrp: 1.341 ± 0.707
2.347AlaTyr: 2.347 ± 0.873
0.0AlaXaa: 0.0 ± 0.0
Cys
0.335CysAla: 0.335 ± 0.177
0.0CysCys: 0.0 ± 0.0
1.006CysAsp: 1.006 ± 0.53
2.347CysGlu: 2.347 ± 1.237
0.671CysPhe: 0.671 ± 0.353
1.341CysGly: 1.341 ± 0.707
0.0CysHis: 0.0 ± 0.0
1.341CysIle: 1.341 ± 0.707
0.671CysLys: 0.671 ± 0.353
0.335CysLeu: 0.335 ± 0.177
0.671CysMet: 0.671 ± 0.353
0.335CysAsn: 0.335 ± 0.177
0.671CysPro: 0.671 ± 0.353
0.0CysGln: 0.0 ± 0.0
0.671CysArg: 0.671 ± 0.511
0.671CysSer: 0.671 ± 0.353
0.335CysThr: 0.335 ± 0.177
0.335CysVal: 0.335 ± 0.177
0.335CysTrp: 0.335 ± 0.177
1.006CysTyr: 1.006 ± 0.53
0.0CysXaa: 0.0 ± 0.0
Asp
3.689AspAla: 3.689 ± 0.618
1.006AspCys: 1.006 ± 0.53
6.036AspAsp: 6.036 ± 1.955
4.024AspGlu: 4.024 ± 0.735
2.683AspPhe: 2.683 ± 1.413
2.012AspGly: 2.012 ± 1.048
0.0AspHis: 0.0 ± 0.0
6.707AspIle: 6.707 ± 1.076
4.024AspLys: 4.024 ± 2.096
3.689AspLeu: 3.689 ± 0.618
1.006AspMet: 1.006 ± 0.53
3.353AspAsn: 3.353 ± 0.665
3.018AspPro: 3.018 ± 0.977
2.347AspGln: 2.347 ± 0.667
2.683AspArg: 2.683 ± 1.413
2.012AspSer: 2.012 ± 1.461
5.701AspThr: 5.701 ± 1.48
3.353AspVal: 3.353 ± 0.78
1.006AspTrp: 1.006 ± 0.416
2.347AspTyr: 2.347 ± 1.237
0.0AspXaa: 0.0 ± 0.0
Glu
4.695GluAla: 4.695 ± 2.473
0.335GluCys: 0.335 ± 0.177
5.03GluAsp: 5.03 ± 1.482
2.683GluGlu: 2.683 ± 0.818
1.677GluPhe: 1.677 ± 0.883
3.018GluGly: 3.018 ± 0.947
2.012GluHis: 2.012 ± 1.06
4.024GluIle: 4.024 ± 1.064
4.024GluLys: 4.024 ± 1.064
4.695GluLeu: 4.695 ± 2.473
1.341GluMet: 1.341 ± 0.707
6.036GluAsn: 6.036 ± 4.28
2.012GluPro: 2.012 ± 0.532
1.677GluGln: 1.677 ± 1.654
4.359GluArg: 4.359 ± 2.297
3.689GluSer: 3.689 ± 0.678
4.359GluThr: 4.359 ± 0.343
2.347GluVal: 2.347 ± 0.78
0.671GluTrp: 0.671 ± 0.353
1.341GluTyr: 1.341 ± 0.707
0.0GluXaa: 0.0 ± 0.0
Phe
2.012PheAla: 2.012 ± 1.461
0.671PheCys: 0.671 ± 0.353
2.683PheAsp: 2.683 ± 0.818
3.018PheGlu: 3.018 ± 1.59
2.347PhePhe: 2.347 ± 1.237
0.671PheGly: 0.671 ± 0.353
0.671PheHis: 0.671 ± 1.293
3.018PheIle: 3.018 ± 1.151
5.366PheLys: 5.366 ± 2.008
1.006PheLeu: 1.006 ± 0.53
0.0PheMet: 0.0 ± 0.0
2.683PheAsn: 2.683 ± 1.117
0.671PhePro: 0.671 ± 0.353
2.012PheGln: 2.012 ± 0.532
1.677PheArg: 1.677 ± 0.883
2.012PheSer: 2.012 ± 0.532
2.012PheThr: 2.012 ± 0.996
0.671PheVal: 0.671 ± 0.353
0.0PheTrp: 0.0 ± 0.0
1.341PheTyr: 1.341 ± 0.707
0.0PheXaa: 0.0 ± 0.0
Gly
3.353GlyAla: 3.353 ± 0.858
0.671GlyCys: 0.671 ± 0.353
3.018GlyAsp: 3.018 ± 0.977
4.024GlyGlu: 4.024 ± 1.151
2.012GlyPhe: 2.012 ± 1.06
3.689GlyGly: 3.689 ± 3.096
0.671GlyHis: 0.671 ± 0.353
2.012GlyIle: 2.012 ± 0.832
4.024GlyLys: 4.024 ± 0.735
3.353GlyLeu: 3.353 ± 0.78
1.341GlyMet: 1.341 ± 0.707
3.018GlyAsn: 3.018 ± 3.594
2.683GlyPro: 2.683 ± 0.767
2.347GlyGln: 2.347 ± 0.667
2.012GlyArg: 2.012 ± 0.832
1.677GlySer: 1.677 ± 0.914
4.024GlyThr: 4.024 ± 2.373
1.677GlyVal: 1.677 ± 0.429
0.0GlyTrp: 0.0 ± 0.0
3.689GlyTyr: 3.689 ± 3.436
0.0GlyXaa: 0.0 ± 0.0
His
0.671HisAla: 0.671 ± 0.353
0.0HisCys: 0.0 ± 0.0
2.683HisAsp: 2.683 ± 1.413
0.671HisGlu: 0.671 ± 0.353
1.006HisPhe: 1.006 ± 0.53
1.006HisGly: 1.006 ± 0.53
0.0HisHis: 0.0 ± 0.0
2.347HisIle: 2.347 ± 1.054
1.341HisLys: 1.341 ± 1.123
2.012HisLeu: 2.012 ± 1.048
1.006HisMet: 1.006 ± 0.53
1.341HisAsn: 1.341 ± 0.707
0.0HisPro: 0.0 ± 0.0
1.677HisGln: 1.677 ± 0.883
0.671HisArg: 0.671 ± 0.353
2.347HisSer: 2.347 ± 0.873
1.341HisThr: 1.341 ± 0.707
1.677HisVal: 1.677 ± 1.072
0.0HisTrp: 0.0 ± 0.0
0.671HisTyr: 0.671 ± 0.353
0.0HisXaa: 0.0 ± 0.0
Ile
5.03IleAla: 5.03 ± 1.538
1.006IleCys: 1.006 ± 0.53
6.707IleAsp: 6.707 ± 3.533
4.695IleGlu: 4.695 ± 1.334
1.341IlePhe: 1.341 ± 0.384
6.036IleGly: 6.036 ± 1.528
3.353IleHis: 3.353 ± 1.234
6.707IleIle: 6.707 ± 1.716
7.042IleLys: 7.042 ± 3.043
6.036IleLeu: 6.036 ± 0.103
1.006IleMet: 1.006 ± 0.776
4.359IleAsn: 4.359 ± 3.695
3.018IlePro: 3.018 ± 0.795
4.024IleGln: 4.024 ± 1.064
4.695IleArg: 4.695 ± 1.717
6.707IleSer: 6.707 ± 1.559
7.713IleThr: 7.713 ± 4.119
5.366IleVal: 5.366 ± 4.491
0.335IleTrp: 0.335 ± 0.641
2.347IleTyr: 2.347 ± 1.054
0.0IleXaa: 0.0 ± 0.0
Lys
3.689LysAla: 3.689 ± 0.678
1.677LysCys: 1.677 ± 0.883
2.683LysAsp: 2.683 ± 1.413
4.695LysGlu: 4.695 ± 1.823
3.018LysPhe: 3.018 ± 1.59
4.359LysGly: 4.359 ± 0.343
2.683LysHis: 2.683 ± 1.413
7.378LysIle: 7.378 ± 1.42
4.024LysLys: 4.024 ± 2.12
3.689LysLeu: 3.689 ± 1.31
2.347LysMet: 2.347 ± 1.237
6.036LysAsn: 6.036 ± 1.955
3.018LysPro: 3.018 ± 0.795
3.689LysGln: 3.689 ± 1.31
5.03LysArg: 5.03 ± 1.287
3.689LysSer: 3.689 ± 1.336
5.366LysThr: 5.366 ± 1.219
3.689LysVal: 3.689 ± 0.678
1.341LysTrp: 1.341 ± 1.123
4.359LysTyr: 4.359 ± 1.651
0.0LysXaa: 0.0 ± 0.0
Leu
3.353LeuAla: 3.353 ± 1.19
0.335LeuCys: 0.335 ± 0.177
4.695LeuAsp: 4.695 ± 1.767
5.701LeuGlu: 5.701 ± 1.371
3.018LeuPhe: 3.018 ± 1.151
2.012LeuGly: 2.012 ± 0.532
1.006LeuHis: 1.006 ± 0.53
6.372LeuIle: 6.372 ± 0.907
6.036LeuLys: 6.036 ± 2.518
2.683LeuLeu: 2.683 ± 1.326
1.677LeuMet: 1.677 ± 1.635
3.689LeuAsn: 3.689 ± 1.31
2.683LeuPro: 2.683 ± 0.767
1.677LeuGln: 1.677 ± 0.883
3.018LeuArg: 3.018 ± 1.59
3.018LeuSer: 3.018 ± 0.697
6.707LeuThr: 6.707 ± 4.662
3.353LeuVal: 3.353 ± 1.829
0.671LeuTrp: 0.671 ± 0.353
2.683LeuTyr: 2.683 ± 1.089
0.0LeuXaa: 0.0 ± 0.0
Met
0.671MetAla: 0.671 ± 0.353
0.0MetCys: 0.0 ± 0.0
1.341MetAsp: 1.341 ± 0.707
2.012MetGlu: 2.012 ± 1.06
2.012MetPhe: 2.012 ± 1.06
1.341MetGly: 1.341 ± 0.384
1.677MetHis: 1.677 ± 0.883
3.018MetIle: 3.018 ± 2.188
2.012MetLys: 2.012 ± 0.532
1.006MetLeu: 1.006 ± 0.416
1.341MetMet: 1.341 ± 0.707
0.671MetAsn: 0.671 ± 0.353
0.0MetPro: 0.0 ± 0.0
0.671MetGln: 0.671 ± 0.353
0.335MetArg: 0.335 ± 0.641
0.671MetSer: 0.671 ± 1.595
2.012MetThr: 2.012 ± 1.532
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.677MetTyr: 1.677 ± 1.654
0.0MetXaa: 0.0 ± 0.0
Asn
2.012AsnAla: 2.012 ± 0.532
1.006AsnCys: 1.006 ± 0.53
3.018AsnAsp: 3.018 ± 0.697
2.347AsnGlu: 2.347 ± 0.78
1.677AsnPhe: 1.677 ± 1.072
4.024AsnGly: 4.024 ± 1.993
1.677AsnHis: 1.677 ± 1.072
5.701AsnIle: 5.701 ± 5.839
5.366AsnLys: 5.366 ± 2.826
3.689AsnLeu: 3.689 ± 1.31
0.335AsnMet: 0.335 ± 0.177
7.713AsnAsn: 7.713 ± 7.473
2.347AsnPro: 2.347 ± 1.054
5.03AsnGln: 5.03 ± 4.963
1.677AsnArg: 1.677 ± 0.914
4.024AsnSer: 4.024 ± 1.064
4.695AsnThr: 4.695 ± 1.745
5.03AsnVal: 5.03 ± 1.538
1.341AsnTrp: 1.341 ± 0.707
5.701AsnTyr: 5.701 ± 3.49
0.0AsnXaa: 0.0 ± 0.0
Pro
2.347ProAla: 2.347 ± 1.288
0.0ProCys: 0.0 ± 0.0
3.018ProAsp: 3.018 ± 1.151
2.012ProGlu: 2.012 ± 1.06
0.671ProPhe: 0.671 ± 0.353
1.677ProGly: 1.677 ± 0.883
0.335ProHis: 0.335 ± 0.177
3.353ProIle: 3.353 ± 0.858
3.353ProLys: 3.353 ± 0.858
3.018ProLeu: 3.018 ± 3.438
1.341ProMet: 1.341 ± 1.682
2.347ProAsn: 2.347 ± 2.397
1.677ProPro: 1.677 ± 0.914
1.341ProGln: 1.341 ± 0.384
2.683ProArg: 2.683 ± 0.818
1.341ProSer: 1.341 ± 1.022
3.689ProThr: 3.689 ± 2.764
2.683ProVal: 2.683 ± 2.043
0.671ProTrp: 0.671 ± 0.353
1.341ProTyr: 1.341 ± 0.707
0.0ProXaa: 0.0 ± 0.0
Gln
3.353GlnAla: 3.353 ± 1.142
0.671GlnCys: 0.671 ± 0.353
1.677GlnAsp: 1.677 ± 0.914
2.683GlnGlu: 2.683 ± 1.326
2.347GlnPhe: 2.347 ± 0.78
2.012GlnGly: 2.012 ± 0.532
1.006GlnHis: 1.006 ± 0.53
2.683GlnIle: 2.683 ± 1.413
2.347GlnLys: 2.347 ± 0.667
1.677GlnLeu: 1.677 ± 0.883
0.671GlnMet: 0.671 ± 0.511
2.012GlnAsn: 2.012 ± 2.292
4.024GlnPro: 4.024 ± 3.379
2.347GlnGln: 2.347 ± 2.164
2.347GlnArg: 2.347 ± 2.164
4.024GlnSer: 4.024 ± 1.151
3.689GlnThr: 3.689 ± 1.993
1.341GlnVal: 1.341 ± 2.565
0.671GlnTrp: 0.671 ± 0.511
2.347GlnTyr: 2.347 ± 0.667
0.0GlnXaa: 0.0 ± 0.0
Arg
2.012ArgAla: 2.012 ± 0.532
1.006ArgCys: 1.006 ± 0.53
1.006ArgAsp: 1.006 ± 0.53
2.683ArgGlu: 2.683 ± 1.413
1.341ArgPhe: 1.341 ± 0.384
3.353ArgGly: 3.353 ± 0.858
1.341ArgHis: 1.341 ± 1.123
5.701ArgIle: 5.701 ± 1.794
3.353ArgLys: 3.353 ± 1.142
3.353ArgLeu: 3.353 ± 0.858
1.341ArgMet: 1.341 ± 0.384
5.03ArgAsn: 5.03 ± 1.482
1.677ArgPro: 1.677 ± 0.914
1.006ArgGln: 1.006 ± 0.416
5.701ArgArg: 5.701 ± 1.794
4.359ArgSer: 4.359 ± 1.651
4.695ArgThr: 4.695 ± 1.485
2.012ArgVal: 2.012 ± 1.06
0.0ArgTrp: 0.0 ± 0.0
1.006ArgTyr: 1.006 ± 0.53
0.0ArgXaa: 0.0 ± 0.0
Ser
3.353SerAla: 3.353 ± 1.19
1.341SerCys: 1.341 ± 0.707
2.012SerAsp: 2.012 ± 1.532
5.03SerGlu: 5.03 ± 0.545
1.677SerPhe: 1.677 ± 0.914
3.018SerGly: 3.018 ± 0.947
1.677SerHis: 1.677 ± 0.429
7.042SerIle: 7.042 ± 1.331
3.018SerLys: 3.018 ± 1.59
3.353SerLeu: 3.353 ± 2.061
1.341SerMet: 1.341 ± 0.384
3.689SerAsn: 3.689 ± 1.943
1.677SerPro: 1.677 ± 1.635
2.347SerGln: 2.347 ± 0.873
1.677SerArg: 1.677 ± 0.429
3.018SerSer: 3.018 ± 1.247
6.372SerThr: 6.372 ± 0.712
3.353SerVal: 3.353 ± 1.19
0.335SerTrp: 0.335 ± 0.641
3.018SerTyr: 3.018 ± 0.697
0.0SerXaa: 0.0 ± 0.0
Thr
4.359ThrAla: 4.359 ± 3.215
0.0ThrCys: 0.0 ± 0.0
4.024ThrAsp: 4.024 ± 3.369
2.683ThrGlu: 2.683 ± 0.818
2.683ThrPhe: 2.683 ± 1.089
5.701ThrGly: 5.701 ± 3.123
1.006ThrHis: 1.006 ± 1.198
7.378ThrIle: 7.378 ± 2.492
5.03ThrLys: 5.03 ± 1.076
4.695ThrLeu: 4.695 ± 1.717
2.683ThrMet: 2.683 ± 0.818
6.036ThrAsn: 6.036 ± 1.637
3.018ThrPro: 3.018 ± 3.928
4.359ThrGln: 4.359 ± 4.454
4.359ThrArg: 4.359 ± 1.651
5.366ThrSer: 5.366 ± 2.735
7.378ThrThr: 7.378 ± 4.311
6.036ThrVal: 6.036 ± 0.849
0.671ThrTrp: 0.671 ± 1.283
6.036ThrTyr: 6.036 ± 1.893
0.0ThrXaa: 0.0 ± 0.0
Val
2.683ValAla: 2.683 ± 1.326
0.671ValCys: 0.671 ± 0.353
2.683ValAsp: 2.683 ± 1.117
3.018ValGlu: 3.018 ± 2.188
2.012ValPhe: 2.012 ± 2.396
1.006ValGly: 1.006 ± 1.146
1.677ValHis: 1.677 ± 0.883
2.347ValIle: 2.347 ± 0.667
7.042ValLys: 7.042 ± 0.514
4.024ValLeu: 4.024 ± 2.373
0.671ValMet: 0.671 ± 1.283
4.024ValAsn: 4.024 ± 1.663
3.689ValPro: 3.689 ± 1.739
2.347ValGln: 2.347 ± 0.78
2.347ValArg: 2.347 ± 1.237
2.347ValSer: 2.347 ± 0.78
3.689ValThr: 3.689 ± 0.95
2.683ValVal: 2.683 ± 1.326
0.671ValTrp: 0.671 ± 0.511
1.006ValTyr: 1.006 ± 1.984
0.0ValXaa: 0.0 ± 0.0
Trp
1.006TrpAla: 1.006 ± 0.53
0.0TrpCys: 0.0 ± 0.0
1.006TrpAsp: 1.006 ± 1.146
0.0TrpGlu: 0.0 ± 0.0
0.335TrpPhe: 0.335 ± 0.641
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.341TrpIle: 1.341 ± 0.384
2.012TrpLys: 2.012 ± 1.06
0.671TrpLeu: 0.671 ± 0.511
0.671TrpMet: 0.671 ± 0.511
0.335TrpAsn: 0.335 ± 0.177
0.335TrpPro: 0.335 ± 0.177
0.0TrpGln: 0.0 ± 0.0
1.006TrpArg: 1.006 ± 0.416
1.006TrpSer: 1.006 ± 0.53
0.671TrpThr: 0.671 ± 1.293
0.335TrpVal: 0.335 ± 0.177
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.018TyrAla: 3.018 ± 2.188
1.341TyrCys: 1.341 ± 0.707
3.353TyrAsp: 3.353 ± 1.234
2.347TyrGlu: 2.347 ± 0.873
1.341TyrPhe: 1.341 ± 1.282
0.335TyrGly: 0.335 ± 0.641
0.335TyrHis: 0.335 ± 0.177
5.03TyrIle: 5.03 ± 3.215
3.689TyrLys: 3.689 ± 1.31
6.036TyrLeu: 6.036 ± 1.528
0.335TyrMet: 0.335 ± 0.177
3.353TyrAsn: 3.353 ± 0.78
0.671TyrPro: 0.671 ± 0.511
3.353TyrGln: 3.353 ± 3.709
2.012TyrArg: 2.012 ± 0.532
2.683TyrSer: 2.683 ± 0.818
2.683TyrThr: 2.683 ± 2.246
1.677TyrVal: 1.677 ± 0.429
0.671TyrTrp: 0.671 ± 0.353
2.683TyrTyr: 2.683 ± 0.818
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski