Amino acid dipepetide frequency for Beihai hepe-like virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.032AlaAla: 4.032 ± 1.629
0.672AlaCys: 0.672 ± 0.352
3.36AlaAsp: 3.36 ± 0.785
4.032AlaGlu: 4.032 ± 0.613
2.016AlaPhe: 2.016 ± 0.452
3.024AlaGly: 3.024 ± 0.544
2.016AlaHis: 2.016 ± 0.544
5.04AlaIle: 5.04 ± 0.629
4.368AlaLys: 4.368 ± 1.124
3.36AlaLeu: 3.36 ± 0.715
2.016AlaMet: 2.016 ± 0.615
3.024AlaAsn: 3.024 ± 2.181
3.024AlaPro: 3.024 ± 0.11
2.016AlaGln: 2.016 ± 0.796
3.696AlaArg: 3.696 ± 0.956
5.04AlaSer: 5.04 ± 1.39
4.368AlaThr: 4.368 ± 1.034
8.401AlaVal: 8.401 ± 2.11
0.336AlaTrp: 0.336 ± 0.424
1.344AlaTyr: 1.344 ± 0.337
0.0AlaXaa: 0.0 ± 0.0
Cys
1.008CysAla: 1.008 ± 0.272
0.0CysCys: 0.0 ± 0.0
0.672CysAsp: 0.672 ± 0.352
0.672CysGlu: 0.672 ± 0.352
1.008CysPhe: 1.008 ± 0.527
1.344CysGly: 1.344 ± 0.337
0.0CysHis: 0.0 ± 0.0
0.672CysIle: 0.672 ± 0.352
1.344CysLys: 1.344 ± 0.512
0.672CysLeu: 0.672 ± 0.352
0.336CysMet: 0.336 ± 0.176
1.008CysAsn: 1.008 ± 0.272
1.008CysPro: 1.008 ± 0.272
0.336CysGln: 0.336 ± 0.176
0.336CysArg: 0.336 ± 0.176
1.008CysSer: 1.008 ± 0.51
1.344CysThr: 1.344 ± 0.62
1.344CysVal: 1.344 ± 0.62
0.0CysTrp: 0.0 ± 0.0
2.016CysTyr: 2.016 ± 0.544
0.0CysXaa: 0.0 ± 0.0
Asp
2.688AspAla: 2.688 ± 0.709
1.344AspCys: 1.344 ± 0.62
5.712AspAsp: 5.712 ± 0.782
3.024AspGlu: 3.024 ± 0.616
5.04AspPhe: 5.04 ± 0.629
4.704AspGly: 4.704 ± 1.476
0.672AspHis: 0.672 ± 0.352
6.048AspIle: 6.048 ± 1.211
3.696AspLys: 3.696 ± 0.439
2.688AspLeu: 2.688 ± 0.67
1.344AspMet: 1.344 ± 0.337
3.36AspAsn: 3.36 ± 1.347
2.352AspPro: 2.352 ± 1.274
2.016AspGln: 2.016 ± 0.93
2.016AspArg: 2.016 ± 0.615
5.376AspSer: 5.376 ± 1.379
4.368AspThr: 4.368 ± 2.158
6.384AspVal: 6.384 ± 0.356
0.672AspTrp: 0.672 ± 0.489
3.696AspTyr: 3.696 ± 0.351
0.0AspXaa: 0.0 ± 0.0
Glu
3.024GluAla: 3.024 ± 0.791
0.672GluCys: 0.672 ± 0.352
4.032GluAsp: 4.032 ± 1.087
1.344GluGlu: 1.344 ± 0.703
1.68GluPhe: 1.68 ± 0.879
2.352GluGly: 2.352 ± 0.305
0.336GluHis: 0.336 ± 0.566
4.032GluIle: 4.032 ± 0.732
2.016GluLys: 2.016 ± 0.729
5.04GluLeu: 5.04 ± 1.719
0.672GluMet: 0.672 ± 0.352
2.016GluAsn: 2.016 ± 1.055
2.016GluPro: 2.016 ± 0.205
1.008GluGln: 1.008 ± 0.527
1.68GluArg: 1.68 ± 0.463
3.36GluSer: 3.36 ± 0.715
2.688GluThr: 2.688 ± 0.943
3.696GluVal: 3.696 ± 1.085
0.0GluTrp: 0.0 ± 0.0
2.688GluTyr: 2.688 ± 0.12
0.0GluXaa: 0.0 ± 0.0
Phe
3.36PheAla: 3.36 ± 0.969
0.672PheCys: 0.672 ± 0.352
4.032PheAsp: 4.032 ± 0.732
1.344PheGlu: 1.344 ± 0.62
2.352PhePhe: 2.352 ± 0.965
4.368PheGly: 4.368 ± 1.034
1.008PheHis: 1.008 ± 0.722
4.032PheIle: 4.032 ± 1.86
4.368PheLys: 4.368 ± 0.445
3.36PheLeu: 3.36 ± 0.423
0.672PheMet: 0.672 ± 0.352
2.352PheAsn: 2.352 ± 0.858
1.008PhePro: 1.008 ± 0.272
1.344PheGln: 1.344 ± 0.337
3.696PheArg: 3.696 ± 0.797
5.04PheSer: 5.04 ± 1.534
1.68PheThr: 1.68 ± 0.626
3.36PheVal: 3.36 ± 0.785
0.672PheTrp: 0.672 ± 0.352
2.016PheTyr: 2.016 ± 1.009
0.0PheXaa: 0.0 ± 0.0
Gly
2.016GlyAla: 2.016 ± 1.055
2.352GlyCys: 2.352 ± 0.776
3.36GlyAsp: 3.36 ± 0.849
1.68GlyGlu: 1.68 ± 0.879
2.688GlyPhe: 2.688 ± 1.24
1.68GlyGly: 1.68 ± 0.905
0.672GlyHis: 0.672 ± 0.489
3.696GlyIle: 3.696 ± 0.578
3.024GlyLys: 3.024 ± 1.582
4.032GlyLeu: 4.032 ± 0.732
0.672GlyMet: 0.672 ± 0.352
2.688GlyAsn: 2.688 ± 0.689
2.016GlyPro: 2.016 ± 0.205
1.344GlyGln: 1.344 ± 0.703
3.024GlyArg: 3.024 ± 0.791
3.696GlySer: 3.696 ± 1.133
4.368GlyThr: 4.368 ± 4.096
4.368GlyVal: 4.368 ± 0.489
0.672GlyTrp: 0.672 ± 0.352
3.36GlyTyr: 3.36 ± 0.849
0.0GlyXaa: 0.0 ± 0.0
His
2.016HisAla: 2.016 ± 1.055
0.336HisCys: 0.336 ± 0.176
2.016HisAsp: 2.016 ± 1.055
2.016HisGlu: 2.016 ± 0.544
1.008HisPhe: 1.008 ± 0.272
1.008HisGly: 1.008 ± 0.272
1.008HisHis: 1.008 ± 0.722
0.672HisIle: 0.672 ± 0.489
1.344HisLys: 1.344 ± 0.337
2.016HisLeu: 2.016 ± 0.452
0.672HisMet: 0.672 ± 0.352
1.344HisAsn: 1.344 ± 0.62
1.008HisPro: 1.008 ± 0.272
0.672HisGln: 0.672 ± 0.352
1.008HisArg: 1.008 ± 0.272
1.344HisSer: 1.344 ± 0.337
1.68HisThr: 1.68 ± 0.879
2.688HisVal: 2.688 ± 0.673
0.0HisTrp: 0.0 ± 0.0
1.008HisTyr: 1.008 ± 0.527
0.0HisXaa: 0.0 ± 0.0
Ile
4.368IleAla: 4.368 ± 1.144
0.672IleCys: 0.672 ± 0.31
3.36IleAsp: 3.36 ± 0.785
3.024IleGlu: 3.024 ± 1.582
2.352IlePhe: 2.352 ± 0.858
3.696IleGly: 3.696 ± 0.439
2.352IleHis: 2.352 ± 0.776
3.024IleIle: 3.024 ± 0.815
5.04IleLys: 5.04 ± 0.721
5.712IleLeu: 5.712 ± 0.782
1.344IleMet: 1.344 ± 0.337
5.04IleAsn: 5.04 ± 0.393
2.688IlePro: 2.688 ± 0.453
0.0IleGln: 0.0 ± 0.0
2.352IleArg: 2.352 ± 0.28
7.728IleSer: 7.728 ± 1.194
4.704IleThr: 4.704 ± 0.56
2.688IleVal: 2.688 ± 0.689
0.336IleTrp: 0.336 ± 0.424
2.688IleTyr: 2.688 ± 0.818
0.0IleXaa: 0.0 ± 0.0
Lys
4.032LysAla: 4.032 ± 1.458
0.336LysCys: 0.336 ± 0.176
3.696LysAsp: 3.696 ± 0.797
2.688LysGlu: 2.688 ± 0.943
4.704LysPhe: 4.704 ± 1.354
2.352LysGly: 2.352 ± 0.871
2.016LysHis: 2.016 ± 0.615
4.704LysIle: 4.704 ± 2.461
3.696LysLys: 3.696 ± 1.074
6.048LysLeu: 6.048 ± 0.615
2.016LysMet: 2.016 ± 0.615
4.704LysAsn: 4.704 ± 1.553
2.352LysPro: 2.352 ± 0.861
2.352LysGln: 2.352 ± 0.871
1.008LysArg: 1.008 ± 0.527
3.696LysSer: 3.696 ± 0.578
4.368LysThr: 4.368 ± 1.372
5.376LysVal: 5.376 ± 1.314
1.344LysTrp: 1.344 ± 0.354
3.024LysTyr: 3.024 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
5.04LeuAla: 5.04 ± 0.917
1.68LeuCys: 1.68 ± 0.463
7.056LeuAsp: 7.056 ± 0.914
2.688LeuGlu: 2.688 ± 1.099
3.696LeuPhe: 3.696 ± 0.956
3.36LeuGly: 3.36 ± 0.46
1.344LeuHis: 1.344 ± 0.703
3.024LeuIle: 3.024 ± 1.113
5.712LeuLys: 5.712 ± 0.782
4.032LeuLeu: 4.032 ± 0.906
2.352LeuMet: 2.352 ± 0.28
3.36LeuAsn: 3.36 ± 0.268
3.024LeuPro: 3.024 ± 1.933
2.016LeuGln: 2.016 ± 0.615
3.696LeuArg: 3.696 ± 1.085
6.72LeuSer: 6.72 ± 0.32
6.048LeuThr: 6.048 ± 1.427
5.376LeuVal: 5.376 ± 0.919
0.336LeuTrp: 0.336 ± 0.176
2.352LeuTyr: 2.352 ± 0.861
0.0LeuXaa: 0.0 ± 0.0
Met
1.68MetAla: 1.68 ± 0.463
1.008MetCys: 1.008 ± 0.272
1.008MetAsp: 1.008 ± 0.469
1.68MetGlu: 1.68 ± 0.556
1.008MetPhe: 1.008 ± 0.527
0.336MetGly: 0.336 ± 0.176
1.008MetHis: 1.008 ± 0.272
1.68MetIle: 1.68 ± 0.879
1.344MetLys: 1.344 ± 0.337
1.344MetLeu: 1.344 ± 0.512
0.336MetMet: 0.336 ± 0.424
2.688MetAsn: 2.688 ± 0.12
0.672MetPro: 0.672 ± 0.31
0.672MetGln: 0.672 ± 0.352
0.672MetArg: 0.672 ± 0.352
2.016MetSer: 2.016 ± 0.729
0.672MetThr: 0.672 ± 0.31
1.344MetVal: 1.344 ± 0.337
0.0MetTrp: 0.0 ± 0.0
2.016MetTyr: 2.016 ± 0.544
0.0MetXaa: 0.0 ± 0.0
Asn
3.696AsnAla: 3.696 ± 1.026
1.344AsnCys: 1.344 ± 0.62
4.032AsnAsp: 4.032 ± 1.412
1.008AsnGlu: 1.008 ± 0.51
3.024AsnPhe: 3.024 ± 0.791
3.024AsnGly: 3.024 ± 0.544
1.008AsnHis: 1.008 ± 0.272
3.696AsnIle: 3.696 ± 0.351
3.36AsnLys: 3.36 ± 0.785
2.016AsnLeu: 2.016 ± 0.205
1.344AsnMet: 1.344 ± 0.354
3.696AsnAsn: 3.696 ± 0.578
3.696AsnPro: 3.696 ± 1.393
2.016AsnGln: 2.016 ± 1.055
2.352AsnArg: 2.352 ± 1.231
4.704AsnSer: 4.704 ± 2.098
4.704AsnThr: 4.704 ± 1.084
6.384AsnVal: 6.384 ± 0.661
1.344AsnTrp: 1.344 ± 0.703
2.016AsnTyr: 2.016 ± 0.452
0.0AsnXaa: 0.0 ± 0.0
Pro
2.352ProAla: 2.352 ± 0.861
1.344ProCys: 1.344 ± 0.801
2.352ProAsp: 2.352 ± 0.28
1.344ProGlu: 1.344 ± 0.337
3.36ProPhe: 3.36 ± 2.101
1.344ProGly: 1.344 ± 0.62
0.672ProHis: 0.672 ± 0.352
2.688ProIle: 2.688 ± 0.943
2.352ProLys: 2.352 ± 0.871
4.032ProLeu: 4.032 ± 0.36
0.672ProMet: 0.672 ± 0.352
2.352ProAsn: 2.352 ± 1.865
1.344ProPro: 1.344 ± 0.512
1.344ProGln: 1.344 ± 0.512
2.688ProArg: 2.688 ± 2.744
4.704ProSer: 4.704 ± 1.475
3.36ProThr: 3.36 ± 1.756
2.352ProVal: 2.352 ± 0.305
0.672ProTrp: 0.672 ± 0.31
1.344ProTyr: 1.344 ± 1.144
0.0ProXaa: 0.0 ± 0.0
Gln
2.352GlnAla: 2.352 ± 0.846
0.0GlnCys: 0.0 ± 0.0
2.352GlnAsp: 2.352 ± 0.861
1.344GlnGlu: 1.344 ± 0.703
1.68GlnPhe: 1.68 ± 0.905
2.688GlnGly: 2.688 ± 0.673
1.008GlnHis: 1.008 ± 0.272
1.008GlnIle: 1.008 ± 0.272
1.68GlnLys: 1.68 ± 0.23
1.68GlnLeu: 1.68 ± 0.463
0.672GlnMet: 0.672 ± 0.489
2.016GlnAsn: 2.016 ± 0.615
1.344GlnPro: 1.344 ± 0.801
0.336GlnGln: 0.336 ± 0.176
0.672GlnArg: 0.672 ± 0.31
2.352GlnSer: 2.352 ± 1.231
2.352GlnThr: 2.352 ± 0.714
1.68GlnVal: 1.68 ± 0.605
0.336GlnTrp: 0.336 ± 0.566
1.008GlnTyr: 1.008 ± 0.527
0.0GlnXaa: 0.0 ± 0.0
Arg
3.024ArgAla: 3.024 ± 1.183
1.008ArgCys: 1.008 ± 0.272
2.352ArgAsp: 2.352 ± 1.231
1.344ArgGlu: 1.344 ± 0.337
2.688ArgPhe: 2.688 ± 1.099
1.008ArgGly: 1.008 ± 0.272
0.672ArgHis: 0.672 ± 0.352
2.016ArgIle: 2.016 ± 1.009
4.032ArgLys: 4.032 ± 1.458
4.704ArgLeu: 4.704 ± 0.862
1.344ArgMet: 1.344 ± 0.337
3.024ArgAsn: 3.024 ± 0.571
2.688ArgPro: 2.688 ± 0.12
0.672ArgGln: 0.672 ± 0.676
2.688ArgArg: 2.688 ± 0.818
3.36ArgSer: 3.36 ± 1.796
2.352ArgThr: 2.352 ± 0.776
3.696ArgVal: 3.696 ± 0.439
0.672ArgTrp: 0.672 ± 0.352
2.688ArgTyr: 2.688 ± 0.943
0.0ArgXaa: 0.0 ± 0.0
Ser
7.056SerAla: 7.056 ± 1.261
0.672SerCys: 0.672 ± 0.352
5.04SerAsp: 5.04 ± 1.358
5.04SerGlu: 5.04 ± 0.752
4.032SerPhe: 4.032 ± 1.138
5.376SerGly: 5.376 ± 1.916
3.696SerHis: 3.696 ± 1.456
5.376SerIle: 5.376 ± 3.204
4.368SerLys: 4.368 ± 0.788
5.712SerLeu: 5.712 ± 1.275
1.68SerMet: 1.68 ± 0.879
3.36SerAsn: 3.36 ± 1.37
2.352SerPro: 2.352 ± 1.526
1.68SerGln: 1.68 ± 0.23
4.704SerArg: 4.704 ± 0.129
7.056SerSer: 7.056 ± 1.809
3.36SerThr: 3.36 ± 0.751
6.048SerVal: 6.048 ± 1.107
0.336SerTrp: 0.336 ± 0.176
2.016SerTyr: 2.016 ± 0.544
0.0SerXaa: 0.0 ± 0.0
Thr
4.704ThrAla: 4.704 ± 0.609
0.0ThrCys: 0.0 ± 0.0
3.024ThrAsp: 3.024 ± 2.535
1.008ThrGlu: 1.008 ± 0.527
1.68ThrPhe: 1.68 ± 0.463
2.016ThrGly: 2.016 ± 1.467
1.008ThrHis: 1.008 ± 0.272
6.048ThrIle: 6.048 ± 1.233
5.376ThrLys: 5.376 ± 0.618
5.712ThrLeu: 5.712 ± 3.162
2.016ThrMet: 2.016 ± 0.536
3.696ThrAsn: 3.696 ± 0.398
3.36ThrPro: 3.36 ± 0.715
5.376ThrGln: 5.376 ± 2.198
2.352ThrArg: 2.352 ± 0.305
3.36ThrSer: 3.36 ± 1.809
5.376ThrThr: 5.376 ± 2.198
8.065ThrVal: 8.065 ± 3.09
0.336ThrTrp: 0.336 ± 0.176
1.68ThrTyr: 1.68 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
5.712ValAla: 5.712 ± 1.043
1.008ValCys: 1.008 ± 0.527
6.048ValAsp: 6.048 ± 1.427
5.04ValGlu: 5.04 ± 1.139
3.36ValPhe: 3.36 ± 1.533
4.368ValGly: 4.368 ± 0.788
4.032ValHis: 4.032 ± 1.01
4.704ValIle: 4.704 ± 0.609
4.368ValLys: 4.368 ± 0.445
5.04ValLeu: 5.04 ± 0.185
1.344ValMet: 1.344 ± 0.34
5.04ValAsn: 5.04 ± 1.878
3.36ValPro: 3.36 ± 0.715
2.352ValGln: 2.352 ± 0.305
3.36ValArg: 3.36 ± 0.969
5.376ValSer: 5.376 ± 1.916
6.72ValThr: 6.72 ± 3.618
8.065ValVal: 8.065 ± 3.284
0.672ValTrp: 0.672 ± 0.849
4.032ValTyr: 4.032 ± 0.906
0.0ValXaa: 0.0 ± 0.0
Trp
0.336TrpAla: 0.336 ± 0.424
0.672TrpCys: 0.672 ± 0.489
0.336TrpAsp: 0.336 ± 0.424
0.672TrpGlu: 0.672 ± 0.352
1.008TrpPhe: 1.008 ± 0.272
0.0TrpGly: 0.0 ± 0.0
0.336TrpHis: 0.336 ± 0.176
0.0TrpIle: 0.0 ± 0.0
0.336TrpLys: 0.336 ± 0.566
2.016TrpLeu: 2.016 ± 0.615
0.336TrpMet: 0.336 ± 0.424
0.672TrpAsn: 0.672 ± 0.489
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.008TrpArg: 1.008 ± 0.527
0.672TrpSer: 0.672 ± 0.352
0.336TrpThr: 0.336 ± 0.176
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.336TrpTyr: 0.336 ± 0.424
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.688TyrAla: 2.688 ± 0.453
0.336TyrCys: 0.336 ± 0.176
3.36TyrAsp: 3.36 ± 1.758
3.36TyrGlu: 3.36 ± 0.849
2.352TyrPhe: 2.352 ± 0.846
3.36TyrGly: 3.36 ± 0.849
0.0TyrHis: 0.0 ± 0.0
1.008TyrIle: 1.008 ± 0.272
2.688TyrLys: 2.688 ± 0.673
3.696TyrLeu: 3.696 ± 1.472
1.008TyrMet: 1.008 ± 0.196
3.024TyrAsn: 3.024 ± 0.791
3.36TyrPro: 3.36 ± 2.055
1.008TyrGln: 1.008 ± 0.976
3.024TyrArg: 3.024 ± 1.252
2.352TyrSer: 2.352 ± 0.305
1.344TyrThr: 1.344 ± 0.337
2.688TyrVal: 2.688 ± 0.67
0.336TyrTrp: 0.336 ± 0.176
1.344TyrTyr: 1.344 ± 0.801
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski