Amino acid dipepetide frequency for Hepatitis E virus genotype 1 (isolate Human/China/HeBei/1987) (HEV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.001AlaAla: 18.001 ± 2.287
0.321AlaCys: 0.321 ± 0.207
3.536AlaAsp: 3.536 ± 0.574
4.5AlaGlu: 4.5 ± 0.882
3.214AlaPhe: 3.214 ± 1.157
9.965AlaGly: 9.965 ± 1.267
3.214AlaHis: 3.214 ± 0.325
8.036AlaIle: 8.036 ± 0.813
0.643AlaLys: 0.643 ± 0.413
7.715AlaLeu: 7.715 ± 0.08
0.643AlaMet: 0.643 ± 0.413
4.5AlaAsn: 4.5 ± 0.773
11.572AlaPro: 11.572 ± 0.529
4.5AlaGln: 4.5 ± 0.882
6.107AlaArg: 6.107 ± 0.519
5.464AlaSer: 5.464 ± 0.465
6.429AlaThr: 6.429 ± 1.505
9.965AlaVal: 9.965 ± 1.457
1.929AlaTrp: 1.929 ± 0.344
1.929AlaTyr: 1.929 ± 1.239
0.0AlaXaa: 0.0 ± 0.0
Cys
1.929CysAla: 1.929 ± 0.519
1.607CysCys: 1.607 ± 1.879
0.321CysAsp: 0.321 ± 0.207
0.964CysGlu: 0.964 ± 0.62
1.607CysPhe: 1.607 ± 0.885
0.964CysGly: 0.964 ± 0.62
1.286CysHis: 1.286 ± 0.826
0.643CysIle: 0.643 ± 0.513
0.0CysLys: 0.0 ± 0.0
0.643CysLeu: 0.643 ± 0.623
0.0CysMet: 0.0 ± 0.0
0.964CysAsn: 0.964 ± 0.62
1.929CysPro: 1.929 ± 0.519
0.321CysGln: 0.321 ± 0.207
2.893CysArg: 2.893 ± 0.952
1.929CysSer: 1.929 ± 1.042
0.643CysThr: 0.643 ± 0.413
0.643CysVal: 0.643 ± 0.413
0.0CysTrp: 0.0 ± 0.0
0.321CysTyr: 0.321 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
2.893AspAla: 2.893 ± 1.859
0.643AspCys: 0.643 ± 0.413
1.607AspAsp: 1.607 ± 0.163
0.964AspGlu: 0.964 ± 0.62
2.893AspPhe: 2.893 ± 0.224
4.5AspGly: 4.5 ± 0.365
1.286AspHis: 1.286 ± 0.753
1.286AspIle: 1.286 ± 0.141
0.643AspLys: 0.643 ± 0.513
3.857AspLeu: 3.857 ± 1.038
0.964AspMet: 0.964 ± 0.314
1.286AspAsn: 1.286 ± 0.141
1.286AspPro: 1.286 ± 0.826
2.572AspGln: 2.572 ± 1.138
1.929AspArg: 1.929 ± 0.628
3.536AspSer: 3.536 ± 0.574
2.893AspThr: 2.893 ± 0.942
6.75AspVal: 6.75 ± 0.814
0.643AspTrp: 0.643 ± 0.513
1.929AspTyr: 1.929 ± 0.628
0.0AspXaa: 0.0 ± 0.0
Glu
4.822GluAla: 4.822 ± 0.488
1.607GluCys: 1.607 ± 0.163
0.964GluAsp: 0.964 ± 0.314
1.607GluGlu: 1.607 ± 0.825
1.286GluPhe: 1.286 ± 0.141
1.607GluGly: 1.607 ± 1.033
1.607GluHis: 1.607 ± 0.163
1.929GluIle: 1.929 ± 0.344
1.607GluLys: 1.607 ± 1.033
7.715GluLeu: 7.715 ± 0.677
0.321GluMet: 0.321 ± 0.192
1.607GluAsn: 1.607 ± 0.825
2.25GluPro: 2.25 ± 0.544
1.286GluGln: 1.286 ± 0.141
2.25GluArg: 2.25 ± 0.544
2.893GluSer: 2.893 ± 0.952
2.572GluThr: 2.572 ± 0.747
2.572GluVal: 2.572 ± 1.652
0.0GluTrp: 0.0 ± 0.0
0.964GluTyr: 0.964 ± 0.314
0.0GluXaa: 0.0 ± 0.0
Phe
3.857PheAla: 3.857 ± 0.728
2.893PheCys: 2.893 ± 1.144
2.893PheAsp: 2.893 ± 0.952
1.929PheGlu: 1.929 ± 1.239
1.286PhePhe: 1.286 ± 0.141
0.964PheGly: 0.964 ± 0.62
1.286PheHis: 1.286 ± 0.826
1.607PheIle: 1.607 ± 0.885
0.321PheLys: 0.321 ± 0.207
1.929PheLeu: 1.929 ± 0.793
0.964PheMet: 0.964 ± 0.314
0.964PheAsn: 0.964 ± 0.314
0.964PhePro: 0.964 ± 0.62
0.964PheGln: 0.964 ± 0.314
2.572PheArg: 2.572 ± 0.747
2.572PheSer: 2.572 ± 0.747
2.572PheThr: 2.572 ± 0.747
1.607PheVal: 1.607 ± 0.163
3.214PheTrp: 3.214 ± 0.752
1.607PheTyr: 1.607 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
8.036GlyAla: 8.036 ± 1.643
1.929GlyCys: 1.929 ± 0.344
2.893GlyAsp: 2.893 ± 0.952
4.179GlyGlu: 4.179 ± 0.337
2.25GlyPhe: 2.25 ± 0.544
5.464GlyGly: 5.464 ± 2.375
2.572GlyHis: 2.572 ± 0.747
3.536GlyIle: 3.536 ± 0.574
2.25GlyLys: 2.25 ± 0.441
8.036GlyLeu: 8.036 ± 1.459
1.286GlyMet: 1.286 ± 0.826
1.607GlyAsn: 1.607 ± 0.163
5.143GlyPro: 5.143 ± 0.333
2.572GlyGln: 2.572 ± 0.282
6.107GlyArg: 6.107 ± 0.519
5.143GlySer: 5.143 ± 0.776
4.822GlyThr: 4.822 ± 0.711
4.179GlyVal: 4.179 ± 1.027
0.964GlyTrp: 0.964 ± 0.314
1.607GlyTyr: 1.607 ± 0.825
0.0GlyXaa: 0.0 ± 0.0
His
1.929HisAla: 1.929 ± 1.239
0.964HisCys: 0.964 ± 0.62
2.572HisAsp: 2.572 ± 0.282
0.964HisGlu: 0.964 ± 0.314
0.643HisPhe: 0.643 ± 0.413
1.286HisGly: 1.286 ± 0.141
0.0HisHis: 0.0 ± 0.0
0.964HisIle: 0.964 ± 0.314
0.0HisLys: 0.0 ± 0.0
1.607HisLeu: 1.607 ± 1.033
0.321HisMet: 0.321 ± 0.207
1.286HisAsn: 1.286 ± 0.826
1.929HisPro: 1.929 ± 0.344
1.929HisGln: 1.929 ± 1.239
3.857HisArg: 3.857 ± 0.472
1.607HisSer: 1.607 ± 0.587
1.286HisThr: 1.286 ± 0.141
1.929HisVal: 1.929 ± 0.519
0.0HisTrp: 0.0 ± 0.0
0.964HisTyr: 0.964 ± 0.314
0.0HisXaa: 0.0 ± 0.0
Ile
2.893IleAla: 2.893 ± 0.942
0.321IleCys: 0.321 ± 0.207
1.929IleAsp: 1.929 ± 0.344
1.607IleGlu: 1.607 ± 1.033
0.643IlePhe: 0.643 ± 0.623
2.893IleGly: 2.893 ± 0.952
2.25IleHis: 2.25 ± 0.441
0.964IleIle: 0.964 ± 0.62
0.321IleLys: 0.321 ± 0.207
3.214IleLeu: 3.214 ± 0.837
0.964IleMet: 0.964 ± 0.314
0.643IleAsn: 0.643 ± 0.413
3.536IlePro: 3.536 ± 0.574
1.929IleGln: 1.929 ± 0.519
2.572IleArg: 2.572 ± 0.747
5.786IleSer: 5.786 ± 1.014
3.214IleThr: 3.214 ± 1.157
1.607IleVal: 1.607 ± 1.033
0.321IleTrp: 0.321 ± 0.207
0.321IleTyr: 0.321 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
2.572LysAla: 2.572 ± 0.747
0.321LysCys: 0.321 ± 0.207
0.964LysAsp: 0.964 ± 0.314
0.321LysGlu: 0.321 ± 0.207
0.964LysPhe: 0.964 ± 0.62
1.607LysGly: 1.607 ± 0.163
0.321LysHis: 0.321 ± 0.207
0.0LysIle: 0.0 ± 0.0
0.321LysLys: 0.321 ± 0.207
2.572LysLeu: 2.572 ± 0.282
0.964LysMet: 0.964 ± 0.314
0.321LysAsn: 0.321 ± 0.207
0.321LysPro: 0.321 ± 0.207
0.321LysGln: 0.321 ± 0.207
0.0LysArg: 0.0 ± 0.0
0.964LysSer: 0.964 ± 0.62
1.929LysThr: 1.929 ± 0.628
3.214LysVal: 3.214 ± 0.325
0.321LysTrp: 0.321 ± 0.207
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.786LeuAla: 5.786 ± 0.643
2.25LeuCys: 2.25 ± 0.529
4.5LeuAsp: 4.5 ± 0.387
5.464LeuGlu: 5.464 ± 0.839
3.857LeuPhe: 3.857 ± 0.472
8.036LeuGly: 8.036 ± 1.95
1.607LeuHis: 1.607 ± 0.163
3.536LeuIle: 3.536 ± 1.083
1.286LeuLys: 1.286 ± 0.141
9.643LeuLeu: 9.643 ± 1.268
1.607LeuMet: 1.607 ± 0.371
0.0LeuAsn: 0.0 ± 0.0
7.072LeuPro: 7.072 ± 0.964
3.536LeuGln: 3.536 ± 0.574
6.429LeuArg: 6.429 ± 0.197
5.143LeuSer: 5.143 ± 1.669
8.679LeuThr: 8.679 ± 0.671
8.036LeuVal: 8.036 ± 1.643
0.964LeuTrp: 0.964 ± 0.314
6.107LeuTyr: 6.107 ± 1.229
0.0LeuXaa: 0.0 ± 0.0
Met
1.607MetAla: 1.607 ± 0.163
0.321MetCys: 0.321 ± 0.207
0.643MetAsp: 0.643 ± 0.513
0.321MetGlu: 0.321 ± 0.207
0.643MetPhe: 0.643 ± 0.302
0.321MetGly: 0.321 ± 0.654
0.0MetHis: 0.0 ± 0.0
0.321MetIle: 0.321 ± 0.207
1.607MetLys: 1.607 ± 0.825
2.25MetLeu: 2.25 ± 0.441
0.0MetMet: 0.0 ± 0.0
0.643MetAsn: 0.643 ± 0.513
0.321MetPro: 0.321 ± 0.207
0.643MetGln: 0.643 ± 0.413
0.964MetArg: 0.964 ± 0.313
0.643MetSer: 0.643 ± 0.623
0.643MetThr: 0.643 ± 0.413
0.964MetVal: 0.964 ± 0.62
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.536AsnAla: 3.536 ± 0.574
0.321AsnCys: 0.321 ± 0.207
1.286AsnAsp: 1.286 ± 0.141
0.964AsnGlu: 0.964 ± 0.62
1.286AsnPhe: 1.286 ± 0.826
1.607AsnGly: 1.607 ± 0.825
0.964AsnHis: 0.964 ± 0.62
0.0AsnIle: 0.0 ± 0.0
0.643AsnLys: 0.643 ± 0.413
4.179AsnLeu: 4.179 ± 1.065
0.321AsnMet: 0.321 ± 0.207
0.643AsnAsn: 0.643 ± 0.413
2.572AsnPro: 2.572 ± 0.612
1.607AsnGln: 1.607 ± 0.825
0.964AsnArg: 0.964 ± 0.62
1.929AsnSer: 1.929 ± 0.628
3.536AsnThr: 3.536 ± 2.361
1.929AsnVal: 1.929 ± 0.344
0.321AsnTrp: 0.321 ± 0.207
1.286AsnTyr: 1.286 ± 1.025
0.0AsnXaa: 0.0 ± 0.0
Pro
10.929ProAla: 10.929 ± 1.138
0.964ProCys: 0.964 ± 0.659
3.857ProAsp: 3.857 ± 0.472
3.214ProGlu: 3.214 ± 1.157
3.536ProPhe: 3.536 ± 0.574
8.679ProGly: 8.679 ± 2.449
2.25ProHis: 2.25 ± 0.883
1.929ProIle: 1.929 ± 0.782
0.643ProLys: 0.643 ± 0.413
8.679ProLeu: 8.679 ± 4.833
0.964ProMet: 0.964 ± 0.763
1.286ProAsn: 1.286 ± 0.141
5.143ProPro: 5.143 ± 1.67
1.929ProGln: 1.929 ± 0.519
3.857ProArg: 3.857 ± 1.219
7.393ProSer: 7.393 ± 1.433
6.75ProThr: 6.75 ± 1.549
3.536ProVal: 3.536 ± 1.796
0.321ProTrp: 0.321 ± 0.207
2.572ProTyr: 2.572 ± 1.138
0.0ProXaa: 0.0 ± 0.0
Gln
4.179GlnAla: 4.179 ± 0.886
0.321GlnCys: 0.321 ± 0.207
2.893GlnAsp: 2.893 ± 1.849
0.643GlnGlu: 0.643 ± 0.413
0.643GlnPhe: 0.643 ± 0.413
3.214GlnGly: 3.214 ± 0.325
0.643GlnHis: 0.643 ± 0.513
0.964GlnIle: 0.964 ± 0.62
0.964GlnLys: 0.964 ± 0.62
2.893GlnLeu: 2.893 ± 0.549
0.321GlnMet: 0.321 ± 0.207
0.321GlnAsn: 0.321 ± 0.207
4.179GlnPro: 4.179 ± 1.386
1.607GlnGln: 1.607 ± 0.825
3.857GlnArg: 3.857 ± 0.688
2.572GlnSer: 2.572 ± 0.705
0.964GlnThr: 0.964 ± 0.314
1.286GlnVal: 1.286 ± 0.826
0.321GlnTrp: 0.321 ± 0.207
2.25GlnTyr: 2.25 ± 1.337
0.0GlnXaa: 0.0 ± 0.0
Arg
6.429ArgAla: 6.429 ± 0.651
0.964ArgCys: 0.964 ± 0.62
2.572ArgAsp: 2.572 ± 0.747
2.25ArgGlu: 2.25 ± 0.544
1.607ArgPhe: 1.607 ± 0.163
6.107ArgGly: 6.107 ± 0.849
2.25ArgHis: 2.25 ± 0.529
1.286ArgIle: 1.286 ± 0.141
0.964ArgLys: 0.964 ± 0.62
6.429ArgLeu: 6.429 ± 1.028
0.643ArgMet: 0.643 ± 0.534
1.929ArgAsn: 1.929 ± 0.628
11.25ArgPro: 11.25 ± 3.962
2.25ArgGln: 2.25 ± 0.441
7.715ArgArg: 7.715 ± 1.849
5.143ArgSer: 5.143 ± 0.654
2.893ArgThr: 2.893 ± 1.859
6.429ArgVal: 6.429 ± 0.705
0.964ArgTrp: 0.964 ± 0.62
2.25ArgTyr: 2.25 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
9.322SerAla: 9.322 ± 0.927
0.964SerCys: 0.964 ± 0.659
3.214SerAsp: 3.214 ± 0.325
2.893SerGlu: 2.893 ± 0.224
2.893SerPhe: 2.893 ± 0.224
5.464SerGly: 5.464 ± 1.463
0.643SerHis: 0.643 ± 0.413
3.857SerIle: 3.857 ± 0.423
1.929SerLys: 1.929 ± 0.344
5.786SerLeu: 5.786 ± 0.447
0.321SerMet: 0.321 ± 0.207
1.286SerAsn: 1.286 ± 0.141
5.786SerPro: 5.786 ± 2.255
1.929SerGln: 1.929 ± 0.519
7.072SerArg: 7.072 ± 1.842
3.536SerSer: 3.536 ± 1.796
8.357SerThr: 8.357 ± 3.021
5.143SerVal: 5.143 ± 2.276
0.964SerTrp: 0.964 ± 0.62
1.607SerTyr: 1.607 ± 0.163
0.0SerXaa: 0.0 ± 0.0
Thr
10.286ThrAla: 10.286 ± 1.128
1.929ThrCys: 1.929 ± 1.239
1.286ThrAsp: 1.286 ± 0.141
3.214ThrGlu: 3.214 ± 0.325
2.893ThrPhe: 2.893 ± 0.224
4.5ThrGly: 4.5 ± 1.181
1.607ThrHis: 1.607 ± 0.163
1.929ThrIle: 1.929 ± 0.344
2.25ThrLys: 2.25 ± 0.441
5.786ThrLeu: 5.786 ± 0.447
0.964ThrMet: 0.964 ± 0.314
4.822ThrAsn: 4.822 ± 2.474
5.786ThrPro: 5.786 ± 1.257
2.25ThrGln: 2.25 ± 0.544
4.179ThrArg: 4.179 ± 0.545
7.393ThrSer: 7.393 ± 3.611
8.679ThrThr: 8.679 ± 3.727
3.214ThrVal: 3.214 ± 0.325
0.643ThrTrp: 0.643 ± 0.413
3.536ThrTyr: 3.536 ± 0.497
0.0ThrXaa: 0.0 ± 0.0
Val
9.965ValAla: 9.965 ± 1.267
0.321ValCys: 0.321 ± 0.207
4.5ValAsp: 4.5 ± 0.773
3.214ValGlu: 3.214 ± 0.752
3.214ValPhe: 3.214 ± 1.774
4.5ValGly: 4.5 ± 0.387
1.607ValHis: 1.607 ± 1.033
3.536ValIle: 3.536 ± 0.497
1.286ValLys: 1.286 ± 0.141
6.75ValLeu: 6.75 ± 0.6
0.964ValMet: 0.964 ± 0.314
1.929ValAsn: 1.929 ± 0.628
3.857ValPro: 3.857 ± 0.472
1.607ValGln: 1.607 ± 0.163
3.857ValArg: 3.857 ± 0.423
6.107ValSer: 6.107 ± 0.969
5.786ValThr: 5.786 ± 0.931
7.715ValVal: 7.715 ± 1.679
0.643ValTrp: 0.643 ± 0.413
1.286ValTyr: 1.286 ± 0.141
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.321TrpAsp: 0.321 ± 0.207
1.286TrpGlu: 1.286 ± 0.141
0.964TrpPhe: 0.964 ± 0.62
1.286TrpGly: 1.286 ± 0.141
0.321TrpHis: 0.321 ± 0.207
0.643TrpIle: 0.643 ± 0.413
0.321TrpLys: 0.321 ± 0.207
2.25TrpLeu: 2.25 ± 0.544
0.0TrpMet: 0.0 ± 0.0
1.286TrpAsn: 1.286 ± 0.826
0.643TrpPro: 0.643 ± 0.513
0.0TrpGln: 0.0 ± 0.0
1.607TrpArg: 1.607 ± 0.825
0.643TrpSer: 0.643 ± 0.413
0.643TrpThr: 0.643 ± 0.513
0.0TrpVal: 0.0 ± 0.0
0.321TrpTrp: 0.321 ± 0.207
0.321TrpTyr: 0.321 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.214TyrAla: 3.214 ± 0.752
0.643TyrCys: 0.643 ± 0.413
1.607TyrAsp: 1.607 ± 0.825
0.964TyrGlu: 0.964 ± 0.62
0.643TyrPhe: 0.643 ± 0.513
1.607TyrGly: 1.607 ± 0.163
0.321TyrHis: 0.321 ± 0.207
0.964TyrIle: 0.964 ± 0.314
0.321TyrLys: 0.321 ± 0.207
1.929TyrLeu: 1.929 ± 1.239
0.0TyrMet: 0.0 ± 0.0
2.572TyrAsn: 2.572 ± 1.138
2.893TyrPro: 2.893 ± 0.224
1.286TyrGln: 1.286 ± 0.826
3.536TyrArg: 3.536 ± 0.574
2.572TyrSer: 2.572 ± 1.138
3.857TyrThr: 3.857 ± 1.256
1.929TyrVal: 1.929 ± 0.344
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3112 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski