Amino acid dipepetide frequency for Hubei hepe-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.763AlaAla: 3.763 ± 3.478
2.151AlaCys: 2.151 ± 1.179
6.452AlaAsp: 6.452 ± 0.677
1.613AlaGlu: 1.613 ± 0.885
2.151AlaPhe: 2.151 ± 0.617
3.226AlaGly: 3.226 ± 1.12
2.151AlaHis: 2.151 ± 0.904
3.226AlaIle: 3.226 ± 1.433
7.527AlaLys: 7.527 ± 1.775
1.613AlaLeu: 1.613 ± 0.733
2.151AlaMet: 2.151 ± 1.184
2.688AlaAsn: 2.688 ± 0.891
1.075AlaPro: 1.075 ± 0.59
0.538AlaGln: 0.538 ± 0.295
3.226AlaArg: 3.226 ± 1.12
2.688AlaSer: 2.688 ± 0.856
3.763AlaThr: 3.763 ± 2.279
2.688AlaVal: 2.688 ± 0.339
0.0AlaTrp: 0.0 ± 0.0
2.151AlaTyr: 2.151 ± 0.789
0.0AlaXaa: 0.0 ± 0.0
Cys
2.151CysAla: 2.151 ± 1.687
0.538CysCys: 0.538 ± 0.295
0.538CysAsp: 0.538 ± 0.295
0.538CysGlu: 0.538 ± 0.295
2.151CysPhe: 2.151 ± 0.617
2.151CysGly: 2.151 ± 1.582
0.0CysHis: 0.0 ± 0.0
0.538CysIle: 0.538 ± 0.614
1.075CysLys: 1.075 ± 0.59
1.613CysLeu: 1.613 ± 0.885
0.538CysMet: 0.538 ± 0.295
1.613CysAsn: 1.613 ± 0.885
1.613CysPro: 1.613 ± 1.715
1.075CysGln: 1.075 ± 0.452
2.151CysArg: 2.151 ± 1.184
1.075CysSer: 1.075 ± 0.452
1.075CysThr: 1.075 ± 0.59
1.075CysVal: 1.075 ± 0.452
0.538CysTrp: 0.538 ± 0.295
1.075CysTyr: 1.075 ± 0.59
0.0CysXaa: 0.0 ± 0.0
Asp
5.376AspAla: 5.376 ± 1.361
1.613AspCys: 1.613 ± 0.733
6.452AspAsp: 6.452 ± 2.55
4.301AspGlu: 4.301 ± 0.797
4.839AspPhe: 4.839 ± 1.076
2.688AspGly: 2.688 ± 1.709
1.613AspHis: 1.613 ± 0.798
6.452AspIle: 6.452 ± 3.538
1.613AspLys: 1.613 ± 0.454
6.452AspLeu: 6.452 ± 4.747
1.075AspMet: 1.075 ± 0.59
3.226AspAsn: 3.226 ± 0.907
1.613AspPro: 1.613 ± 0.454
1.613AspGln: 1.613 ± 0.454
2.151AspArg: 2.151 ± 0.789
2.151AspSer: 2.151 ± 0.617
5.914AspThr: 5.914 ± 0.758
5.914AspVal: 5.914 ± 0.572
0.538AspTrp: 0.538 ± 0.614
1.613AspTyr: 1.613 ± 0.885
0.0AspXaa: 0.0 ± 0.0
Glu
3.226GluAla: 3.226 ± 1.769
1.613GluCys: 1.613 ± 0.885
3.226GluAsp: 3.226 ± 1.769
4.839GluGlu: 4.839 ± 1.461
2.151GluPhe: 2.151 ± 0.789
1.075GluGly: 1.075 ± 0.59
1.075GluHis: 1.075 ± 0.59
3.763GluIle: 3.763 ± 1.386
3.226GluLys: 3.226 ± 1.12
3.763GluLeu: 3.763 ± 1.386
0.538GluMet: 0.538 ± 0.295
3.763GluAsn: 3.763 ± 0.318
0.538GluPro: 0.538 ± 0.943
2.151GluGln: 2.151 ± 0.789
1.075GluArg: 1.075 ± 1.076
2.151GluSer: 2.151 ± 1.179
3.226GluThr: 3.226 ± 1.12
1.075GluVal: 1.075 ± 0.791
1.075GluTrp: 1.075 ± 1.228
4.839GluTyr: 4.839 ± 0.322
0.0GluXaa: 0.0 ± 0.0
Phe
4.301PheAla: 4.301 ± 2.359
0.0PheCys: 0.0 ± 0.0
6.989PheAsp: 6.989 ± 0.848
2.688PheGlu: 2.688 ± 1.474
1.613PhePhe: 1.613 ± 0.885
2.151PheGly: 2.151 ± 0.617
0.538PheHis: 0.538 ± 0.295
3.226PheIle: 3.226 ± 0.907
3.226PheLys: 3.226 ± 1.147
1.613PheLeu: 1.613 ± 0.733
1.075PheMet: 1.075 ± 0.389
6.452PheAsn: 6.452 ± 1.612
3.226PhePro: 3.226 ± 1.147
1.613PheGln: 1.613 ± 0.454
1.075PheArg: 1.075 ± 0.452
2.151PheSer: 2.151 ± 0.537
5.376PheThr: 5.376 ± 0.677
2.688PheVal: 2.688 ± 1.466
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.301GlyAla: 4.301 ± 1.808
2.151GlyCys: 2.151 ± 0.789
3.226GlyAsp: 3.226 ± 0.338
1.613GlyGlu: 1.613 ± 0.885
3.763GlyPhe: 3.763 ± 2.064
2.688GlyGly: 2.688 ± 0.855
1.613GlyHis: 1.613 ± 0.733
3.763GlyIle: 3.763 ± 0.318
4.301GlyLys: 4.301 ± 0.917
2.151GlyLeu: 2.151 ± 0.904
2.688GlyMet: 2.688 ± 0.339
1.613GlyAsn: 1.613 ± 0.798
1.075GlyPro: 1.075 ± 0.59
5.376GlyGln: 5.376 ± 0.604
0.538GlyArg: 0.538 ± 0.295
5.914GlySer: 5.914 ± 2.821
3.226GlyThr: 3.226 ± 2.278
2.688GlyVal: 2.688 ± 0.339
0.538GlyTrp: 0.538 ± 0.614
2.151GlyTyr: 2.151 ± 0.789
0.0GlyXaa: 0.0 ± 0.0
His
1.613HisAla: 1.613 ± 0.885
0.538HisCys: 0.538 ± 0.614
2.688HisAsp: 2.688 ± 1.496
1.075HisGlu: 1.075 ± 0.59
1.075HisPhe: 1.075 ± 0.791
2.688HisGly: 2.688 ± 0.855
0.0HisHis: 0.0 ± 0.0
2.151HisIle: 2.151 ± 1.179
1.613HisLys: 1.613 ± 0.885
2.688HisLeu: 2.688 ± 1.496
0.0HisMet: 0.0 ± 0.0
1.075HisAsn: 1.075 ± 0.452
1.075HisPro: 1.075 ± 0.791
0.0HisGln: 0.0 ± 0.0
1.075HisArg: 1.075 ± 0.452
1.075HisSer: 1.075 ± 0.59
0.538HisThr: 0.538 ± 0.295
0.538HisVal: 0.538 ± 0.614
0.538HisTrp: 0.538 ± 0.614
1.613HisTyr: 1.613 ± 0.885
0.0HisXaa: 0.0 ± 0.0
Ile
3.226IleAla: 3.226 ± 1.433
1.613IleCys: 1.613 ± 0.454
4.301IleAsp: 4.301 ± 1.68
2.688IleGlu: 2.688 ± 1.474
2.151IlePhe: 2.151 ± 1.582
6.452IleGly: 6.452 ± 1.74
2.151IleHis: 2.151 ± 1.179
3.763IleIle: 3.763 ± 2.064
4.839IleLys: 4.839 ± 1.912
4.301IleLeu: 4.301 ± 1.075
2.688IleMet: 2.688 ± 1.474
4.839IleAsn: 4.839 ± 1.912
3.226IlePro: 3.226 ± 3.127
4.301IleGln: 4.301 ± 1.808
3.226IleArg: 3.226 ± 1.275
6.452IleSer: 6.452 ± 1.776
6.989IleThr: 6.989 ± 0.98
2.688IleVal: 2.688 ± 0.939
0.538IleTrp: 0.538 ± 0.614
2.151IleTyr: 2.151 ± 0.904
0.0IleXaa: 0.0 ± 0.0
Lys
2.151LysAla: 2.151 ± 1.179
2.688LysCys: 2.688 ± 0.855
5.914LysAsp: 5.914 ± 3.243
1.613LysGlu: 1.613 ± 0.885
6.452LysPhe: 6.452 ± 2.836
4.839LysGly: 4.839 ± 1.966
3.226LysHis: 3.226 ± 0.907
5.914LysIle: 5.914 ± 0.895
3.763LysLys: 3.763 ± 1.397
8.602LysLeu: 8.602 ± 2.44
0.538LysMet: 0.538 ± 0.295
3.763LysAsn: 3.763 ± 1.397
2.151LysPro: 2.151 ± 0.904
3.763LysGln: 3.763 ± 1.923
3.763LysArg: 3.763 ± 0.537
4.301LysSer: 4.301 ± 1.63
3.763LysThr: 3.763 ± 1.043
2.151LysVal: 2.151 ± 0.617
0.0LysTrp: 0.0 ± 0.0
3.226LysTyr: 3.226 ± 1.275
0.0LysXaa: 0.0 ± 0.0
Leu
3.763LeuAla: 3.763 ± 0.318
1.613LeuCys: 1.613 ± 0.798
3.226LeuAsp: 3.226 ± 1.356
3.226LeuGlu: 3.226 ± 1.147
2.151LeuPhe: 2.151 ± 1.582
4.301LeuGly: 4.301 ± 2.053
0.0LeuHis: 0.0 ± 0.0
5.376LeuIle: 5.376 ± 0.677
8.602LeuLys: 8.602 ± 2.382
7.527LeuLeu: 7.527 ± 0.235
0.538LeuMet: 0.538 ± 0.295
8.065LeuAsn: 8.065 ± 2.046
3.226LeuPro: 3.226 ± 1.275
1.613LeuGln: 1.613 ± 0.733
2.151LeuArg: 2.151 ± 1.184
5.376LeuSer: 5.376 ± 1.361
6.989LeuThr: 6.989 ± 1.38
5.376LeuVal: 5.376 ± 5.043
1.613LeuTrp: 1.613 ± 0.885
1.613LeuTyr: 1.613 ± 0.733
0.0LeuXaa: 0.0 ± 0.0
Met
1.613MetAla: 1.613 ± 0.454
0.538MetCys: 0.538 ± 0.614
2.151MetAsp: 2.151 ± 0.537
1.613MetGlu: 1.613 ± 0.733
0.538MetPhe: 0.538 ± 0.614
1.075MetGly: 1.075 ± 0.59
0.538MetHis: 0.538 ± 0.943
2.151MetIle: 2.151 ± 0.617
1.613MetLys: 1.613 ± 0.885
1.075MetLeu: 1.075 ± 0.59
1.613MetMet: 1.613 ± 1.119
2.688MetAsn: 2.688 ± 1.477
0.538MetPro: 0.538 ± 0.295
0.538MetGln: 0.538 ± 0.295
1.075MetArg: 1.075 ± 0.59
3.226MetSer: 3.226 ± 0.907
2.688MetThr: 2.688 ± 0.939
1.075MetVal: 1.075 ± 0.452
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.151AsnAla: 2.151 ± 0.617
1.075AsnCys: 1.075 ± 1.076
4.301AsnAsp: 4.301 ± 0.917
4.301AsnGlu: 4.301 ± 2.053
2.688AsnPhe: 2.688 ± 1.474
3.226AsnGly: 3.226 ± 1.356
1.075AsnHis: 1.075 ± 0.59
6.989AsnIle: 6.989 ± 1.38
5.376AsnLys: 5.376 ± 0.57
5.914AsnLeu: 5.914 ± 0.61
2.688AsnMet: 2.688 ± 1.265
6.452AsnAsn: 6.452 ± 2.865
2.151AsnPro: 2.151 ± 1.582
2.688AsnGln: 2.688 ± 0.939
3.226AsnArg: 3.226 ± 2.074
3.763AsnSer: 3.763 ± 1.923
5.914AsnThr: 5.914 ± 1.61
3.226AsnVal: 3.226 ± 0.907
0.538AsnTrp: 0.538 ± 0.614
3.763AsnTyr: 3.763 ± 2.064
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.613ProCys: 1.613 ± 0.798
1.075ProAsp: 1.075 ± 0.452
1.613ProGlu: 1.613 ± 1.037
1.075ProPhe: 1.075 ± 1.076
1.075ProGly: 1.075 ± 0.59
1.613ProHis: 1.613 ± 0.798
8.065ProIle: 8.065 ± 1.016
2.688ProLys: 2.688 ± 1.474
3.763ProLeu: 3.763 ± 1.923
0.0ProMet: 0.0 ± 0.0
3.763ProAsn: 3.763 ± 0.318
1.613ProPro: 1.613 ± 0.798
0.538ProGln: 0.538 ± 0.614
0.0ProArg: 0.0 ± 0.0
2.151ProSer: 2.151 ± 1.643
1.075ProThr: 1.075 ± 1.228
1.613ProVal: 1.613 ± 0.798
0.538ProTrp: 0.538 ± 0.295
2.688ProTyr: 2.688 ± 0.891
0.0ProXaa: 0.0 ± 0.0
Gln
1.075GlnAla: 1.075 ± 0.452
0.538GlnCys: 0.538 ± 0.295
1.075GlnAsp: 1.075 ± 0.791
1.613GlnGlu: 1.613 ± 0.733
2.151GlnPhe: 2.151 ± 0.789
1.613GlnGly: 1.613 ± 0.733
1.075GlnHis: 1.075 ± 0.59
1.075GlnIle: 1.075 ± 1.228
3.226GlnLys: 3.226 ± 1.769
2.151GlnLeu: 2.151 ± 1.179
0.538GlnMet: 0.538 ± 0.295
2.688GlnAsn: 2.688 ± 0.856
2.688GlnPro: 2.688 ± 1.477
2.688GlnGln: 2.688 ± 2.253
2.688GlnArg: 2.688 ± 1.709
3.763GlnSer: 3.763 ± 1.923
2.151GlnThr: 2.151 ± 1.582
3.226GlnVal: 3.226 ± 2.278
0.0GlnTrp: 0.0 ± 0.0
2.151GlnTyr: 2.151 ± 0.537
0.0GlnXaa: 0.0 ± 0.0
Arg
1.075ArgAla: 1.075 ± 1.076
0.538ArgCys: 0.538 ± 0.295
2.151ArgAsp: 2.151 ± 0.617
2.151ArgGlu: 2.151 ± 1.179
2.151ArgPhe: 2.151 ± 1.582
2.151ArgGly: 2.151 ± 1.99
1.613ArgHis: 1.613 ± 0.885
1.613ArgIle: 1.613 ± 0.454
5.376ArgLys: 5.376 ± 2.26
4.301ArgLeu: 4.301 ± 2.224
1.613ArgMet: 1.613 ± 1.037
2.688ArgAsn: 2.688 ± 0.339
2.688ArgPro: 2.688 ± 1.477
2.151ArgGln: 2.151 ± 0.537
2.688ArgArg: 2.688 ± 0.339
2.151ArgSer: 2.151 ± 0.617
2.688ArgThr: 2.688 ± 0.891
1.613ArgVal: 1.613 ± 0.885
0.0ArgTrp: 0.0 ± 0.0
1.613ArgTyr: 1.613 ± 0.733
0.0ArgXaa: 0.0 ± 0.0
Ser
1.613SerAla: 1.613 ± 0.454
1.613SerCys: 1.613 ± 0.733
3.763SerAsp: 3.763 ± 1.292
4.301SerGlu: 4.301 ± 1.235
1.075SerPhe: 1.075 ± 0.452
4.301SerGly: 4.301 ± 2.512
1.075SerHis: 1.075 ± 0.791
4.839SerIle: 4.839 ± 2.371
5.914SerLys: 5.914 ± 1.271
3.226SerLeu: 3.226 ± 1.433
1.613SerMet: 1.613 ± 0.454
2.151SerAsn: 2.151 ± 0.904
2.688SerPro: 2.688 ± 0.856
2.151SerGln: 2.151 ± 0.904
5.376SerArg: 5.376 ± 1.127
3.763SerSer: 3.763 ± 1.043
4.301SerThr: 4.301 ± 1.075
2.151SerVal: 2.151 ± 0.904
0.538SerTrp: 0.538 ± 0.614
2.151SerTyr: 2.151 ± 1.179
0.0SerXaa: 0.0 ± 0.0
Thr
2.688ThrAla: 2.688 ± 1.496
1.075ThrCys: 1.075 ± 0.791
5.914ThrAsp: 5.914 ± 1.61
1.613ThrGlu: 1.613 ± 0.733
6.452ThrPhe: 6.452 ± 1.74
5.914ThrGly: 5.914 ± 0.758
1.613ThrHis: 1.613 ± 0.885
4.301ThrIle: 4.301 ± 0.123
4.839ThrLys: 4.839 ± 1.461
8.065ThrLeu: 8.065 ± 0.381
2.688ThrMet: 2.688 ± 1.474
5.376ThrAsn: 5.376 ± 2.932
2.151ThrPro: 2.151 ± 1.99
0.538ThrGln: 0.538 ± 0.614
2.151ThrArg: 2.151 ± 0.789
3.226ThrSer: 3.226 ± 0.907
4.301ThrThr: 4.301 ± 2.368
6.452ThrVal: 6.452 ± 1.201
1.613ThrTrp: 1.613 ± 0.733
4.839ThrTyr: 4.839 ± 1.461
0.0ThrXaa: 0.0 ± 0.0
Val
5.376ValAla: 5.376 ± 1.827
0.0ValCys: 0.0 ± 0.0
3.226ValAsp: 3.226 ± 1.466
4.301ValGlu: 4.301 ± 0.123
3.226ValPhe: 3.226 ± 0.907
1.613ValGly: 1.613 ± 0.454
0.538ValHis: 0.538 ± 0.943
3.763ValIle: 3.763 ± 1.129
0.538ValLys: 0.538 ± 0.295
3.763ValLeu: 3.763 ± 1.166
2.151ValMet: 2.151 ± 0.904
3.226ValAsn: 3.226 ± 2.217
2.151ValPro: 2.151 ± 0.617
3.226ValGln: 3.226 ± 1.596
3.226ValArg: 3.226 ± 1.356
1.613ValSer: 1.613 ± 0.733
5.914ValThr: 5.914 ± 1.14
2.151ValVal: 2.151 ± 0.789
0.0ValTrp: 0.0 ± 0.0
3.763ValTyr: 3.763 ± 2.465
0.0ValXaa: 0.0 ± 0.0
Trp
0.538TrpAla: 0.538 ± 0.295
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.538TrpPhe: 0.538 ± 0.614
0.0TrpGly: 0.0 ± 0.0
0.538TrpHis: 0.538 ± 0.295
0.0TrpIle: 0.0 ± 0.0
1.613TrpLys: 1.613 ± 0.885
0.538TrpLeu: 0.538 ± 0.614
0.0TrpMet: 0.0 ± 0.0
0.538TrpAsn: 0.538 ± 0.614
0.538TrpPro: 0.538 ± 0.614
0.0TrpGln: 0.0 ± 0.0
1.075TrpArg: 1.075 ± 0.452
1.075TrpSer: 1.075 ± 1.228
1.075TrpThr: 1.075 ± 0.452
0.538TrpVal: 0.538 ± 0.943
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.839TyrAla: 4.839 ± 0.322
1.613TyrCys: 1.613 ± 0.454
0.538TyrAsp: 0.538 ± 0.943
2.688TyrGlu: 2.688 ± 0.855
1.613TyrPhe: 1.613 ± 0.733
2.151TyrGly: 2.151 ± 1.179
1.613TyrHis: 1.613 ± 0.885
1.613TyrIle: 1.613 ± 0.733
1.613TyrLys: 1.613 ± 0.733
3.226TyrLeu: 3.226 ± 1.466
1.075TyrMet: 1.075 ± 0.59
5.376TyrAsn: 5.376 ± 0.57
0.538TyrPro: 0.538 ± 0.614
1.613TyrGln: 1.613 ± 1.715
1.075TyrArg: 1.075 ± 0.452
0.538TyrSer: 0.538 ± 0.943
4.839TyrThr: 4.839 ± 2.654
4.839TyrVal: 4.839 ± 0.704
0.0TyrTrp: 0.0 ± 0.0
2.151TyrTyr: 2.151 ± 1.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski