Amino acid dipepetide frequency for Bat Middle East Hepe-Astrovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.906AlaAla: 12.906 ± 4.362
2.082AlaCys: 2.082 ± 0.861
5.412AlaAsp: 5.412 ± 1.407
4.996AlaGlu: 4.996 ± 1.51
4.163AlaPhe: 4.163 ± 1.486
6.245AlaGly: 6.245 ± 1.908
4.163AlaHis: 4.163 ± 0.289
4.163AlaIle: 4.163 ± 2.426
2.082AlaLys: 2.082 ± 0.819
9.992AlaLeu: 9.992 ± 3.806
1.249AlaMet: 1.249 ± 0.462
3.747AlaAsn: 3.747 ± 1.385
3.747AlaPro: 3.747 ± 2.886
5.828AlaGln: 5.828 ± 2.505
8.326AlaArg: 8.326 ± 0.758
4.163AlaSer: 4.163 ± 2.789
7.077AlaThr: 7.077 ± 3.324
6.661AlaVal: 6.661 ± 2.1
2.498AlaTrp: 2.498 ± 1.15
2.082AlaTyr: 2.082 ± 0.849
0.0AlaXaa: 0.0 ± 0.0
Cys
1.665CysAla: 1.665 ± 0.076
0.0CysCys: 0.0 ± 0.0
0.833CysAsp: 0.833 ± 0.51
0.833CysGlu: 0.833 ± 0.51
0.416CysPhe: 0.416 ± 0.537
1.665CysGly: 1.665 ± 1.49
0.416CysHis: 0.416 ± 0.537
1.249CysIle: 1.249 ± 0.505
0.416CysLys: 0.416 ± 0.255
1.665CysLeu: 1.665 ± 0.076
0.416CysMet: 0.416 ± 0.255
2.082CysAsn: 2.082 ± 0.861
1.665CysPro: 1.665 ± 1.49
0.0CysGln: 0.0 ± 0.0
1.665CysArg: 1.665 ± 0.634
2.498CysSer: 2.498 ± 2.535
1.665CysThr: 1.665 ± 1.49
2.498CysVal: 2.498 ± 1.086
0.0CysTrp: 0.0 ± 0.0
0.833CysTyr: 0.833 ± 0.51
0.416CysXaa: 0.416 ± 0.537
Asp
7.494AspAla: 7.494 ± 2.904
0.0AspCys: 0.0 ± 0.0
2.082AspAsp: 2.082 ± 1.275
4.163AspGlu: 4.163 ± 1.019
1.665AspPhe: 1.665 ± 0.634
2.498AspGly: 2.498 ± 0.926
0.416AspHis: 0.416 ± 0.255
2.082AspIle: 2.082 ± 0.849
0.833AspLys: 0.833 ± 0.393
5.412AspLeu: 5.412 ± 1.76
0.0AspMet: 0.0 ± 0.0
1.249AspAsn: 1.249 ± 0.462
3.331AspPro: 3.331 ± 0.977
1.665AspGln: 1.665 ± 0.076
1.665AspArg: 1.665 ± 1.02
2.498AspSer: 2.498 ± 0.469
2.498AspThr: 2.498 ± 1.666
5.828AspVal: 5.828 ± 2.186
2.082AspTrp: 2.082 ± 1.275
0.416AspTyr: 0.416 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
5.412GluAla: 5.412 ± 2.403
0.833GluCys: 0.833 ± 0.51
2.082GluAsp: 2.082 ± 1.275
2.498GluGlu: 2.498 ± 1.53
2.082GluPhe: 2.082 ± 0.849
3.331GluGly: 3.331 ± 1.568
0.833GluHis: 0.833 ± 0.51
0.833GluIle: 0.833 ± 0.455
1.665GluLys: 1.665 ± 0.634
5.412GluLeu: 5.412 ± 0.717
0.416GluMet: 0.416 ± 0.474
0.416GluAsn: 0.416 ± 0.255
3.331GluPro: 3.331 ± 1.269
2.914GluGln: 2.914 ± 1.323
2.914GluArg: 2.914 ± 0.365
1.665GluSer: 1.665 ± 0.785
1.665GluThr: 1.665 ± 0.781
1.249GluVal: 1.249 ± 0.462
1.249GluTrp: 1.249 ± 0.462
0.416GluTyr: 0.416 ± 0.474
0.0GluXaa: 0.0 ± 0.0
Phe
2.498PheAla: 2.498 ± 0.469
1.249PheCys: 1.249 ± 0.306
2.498PheAsp: 2.498 ± 0.924
1.665PheGlu: 1.665 ± 1.02
0.833PhePhe: 0.833 ± 0.393
2.914PheGly: 2.914 ± 0.387
0.833PheHis: 0.833 ± 0.558
0.416PheIle: 0.416 ± 0.255
1.665PheLys: 1.665 ± 0.785
2.498PheLeu: 2.498 ± 1.178
0.416PheMet: 0.416 ± 0.255
0.833PheAsn: 0.833 ± 0.393
1.665PhePro: 1.665 ± 0.658
1.249PheGln: 1.249 ± 0.962
2.082PheArg: 2.082 ± 0.219
1.665PheSer: 1.665 ± 0.785
3.331PheThr: 3.331 ± 1.571
1.249PheVal: 1.249 ± 0.462
0.416PheTrp: 0.416 ± 0.255
1.249PheTyr: 1.249 ± 0.462
0.0PheXaa: 0.0 ± 0.0
Gly
6.661GlyAla: 6.661 ± 2.297
1.249GlyCys: 1.249 ± 0.962
5.412GlyAsp: 5.412 ± 2.457
1.249GlyGlu: 1.249 ± 0.765
1.249GlyPhe: 1.249 ± 0.833
3.747GlyGly: 3.747 ± 1.232
0.416GlyHis: 0.416 ± 0.255
2.914GlyIle: 2.914 ± 1.08
3.331GlyLys: 3.331 ± 0.562
4.163GlyLeu: 4.163 ± 1.637
0.416GlyMet: 0.416 ± 0.255
2.498GlyAsn: 2.498 ± 0.924
4.163GlyPro: 4.163 ± 0.669
4.163GlyGln: 4.163 ± 1.346
5.412GlyArg: 5.412 ± 2.172
7.91GlySer: 7.91 ± 3.27
7.494GlyThr: 7.494 ± 1.009
4.996GlyVal: 4.996 ± 1.847
2.082GlyTrp: 2.082 ± 0.455
2.082GlyTyr: 2.082 ± 0.219
0.0GlyXaa: 0.0 ± 0.0
His
1.665HisAla: 1.665 ± 0.634
0.416HisCys: 0.416 ± 0.537
0.416HisAsp: 0.416 ± 0.255
1.249HisGlu: 1.249 ± 0.462
1.249HisPhe: 1.249 ± 0.885
1.249HisGly: 1.249 ± 0.505
0.833HisHis: 0.833 ± 0.455
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.833HisLeu: 0.833 ± 0.51
0.0HisMet: 0.0 ± 0.0
0.833HisAsn: 0.833 ± 0.51
2.498HisPro: 2.498 ± 1.15
1.665HisGln: 1.665 ± 0.076
2.082HisArg: 2.082 ± 0.861
1.249HisSer: 1.249 ± 0.462
3.331HisThr: 3.331 ± 1.27
4.996HisVal: 4.996 ± 1.903
0.416HisTrp: 0.416 ± 0.255
0.416HisTyr: 0.416 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
1.665IleAla: 1.665 ± 0.658
0.416IleCys: 0.416 ± 0.255
1.249IleAsp: 1.249 ± 0.765
1.249IleGlu: 1.249 ± 0.765
1.249IlePhe: 1.249 ± 0.462
2.082IleGly: 2.082 ± 1.213
0.833IleHis: 0.833 ± 0.393
1.665IleIle: 1.665 ± 1.02
1.665IleLys: 1.665 ± 0.634
2.498IleLeu: 2.498 ± 0.52
0.416IleMet: 0.416 ± 0.228
0.833IleAsn: 0.833 ± 0.393
2.498IlePro: 2.498 ± 1.666
1.249IleGln: 1.249 ± 0.462
2.082IleArg: 2.082 ± 0.849
2.914IleSer: 2.914 ± 2.451
5.412IleThr: 5.412 ± 2.768
1.249IleVal: 1.249 ± 0.765
0.416IleTrp: 0.416 ± 0.255
0.416IleTyr: 0.416 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
4.58LysAla: 4.58 ± 1.71
0.416LysCys: 0.416 ± 0.255
1.249LysAsp: 1.249 ± 0.462
0.833LysGlu: 0.833 ± 0.393
1.249LysPhe: 1.249 ± 0.505
1.665LysGly: 1.665 ± 0.785
0.416LysHis: 0.416 ± 0.255
0.833LysIle: 0.833 ± 0.51
1.249LysLys: 1.249 ± 0.462
1.665LysLeu: 1.665 ± 0.634
0.0LysMet: 0.0 ± 0.0
1.249LysAsn: 1.249 ± 0.765
3.747LysPro: 3.747 ± 1.735
1.665LysGln: 1.665 ± 1.02
1.665LysArg: 1.665 ± 0.656
1.249LysSer: 1.249 ± 0.462
1.665LysThr: 1.665 ± 0.785
2.498LysVal: 2.498 ± 1.178
0.416LysTrp: 0.416 ± 0.474
0.416LysTyr: 0.416 ± 0.255
0.0LysXaa: 0.0 ± 0.0
Leu
9.575LeuAla: 9.575 ± 0.781
0.833LeuCys: 0.833 ± 0.393
4.996LeuAsp: 4.996 ± 2.164
2.498LeuGlu: 2.498 ± 1.178
2.914LeuPhe: 2.914 ± 0.387
7.91LeuGly: 7.91 ± 2.002
2.498LeuHis: 2.498 ± 1.082
3.747LeuIle: 3.747 ± 0.858
2.914LeuLys: 2.914 ± 0.723
9.159LeuLeu: 9.159 ± 0.418
1.249LeuMet: 1.249 ± 0.539
0.416LeuAsn: 0.416 ± 0.255
5.412LeuPro: 5.412 ± 1.641
2.914LeuGln: 2.914 ± 0.723
9.575LeuArg: 9.575 ± 2.842
4.58LeuSer: 4.58 ± 0.59
6.661LeuThr: 6.661 ± 1.279
6.661LeuVal: 6.661 ± 0.905
0.833LeuTrp: 0.833 ± 0.949
0.416LeuTyr: 0.416 ± 0.255
0.416LeuXaa: 0.416 ± 0.474
Met
2.082MetAla: 2.082 ± 0.219
0.416MetCys: 0.416 ± 0.537
0.833MetAsp: 0.833 ± 0.393
0.0MetGlu: 0.0 ± 0.0
0.416MetPhe: 0.416 ± 0.474
0.833MetGly: 0.833 ± 0.558
0.416MetHis: 0.416 ± 0.474
0.416MetIle: 0.416 ± 0.255
0.0MetLys: 0.0 ± 0.0
0.833MetLeu: 0.833 ± 0.455
0.833MetMet: 0.833 ± 0.51
0.0MetAsn: 0.0 ± 0.0
1.249MetPro: 1.249 ± 0.987
1.249MetGln: 1.249 ± 0.306
1.665MetArg: 1.665 ± 0.658
1.249MetSer: 1.249 ± 0.505
0.416MetThr: 0.416 ± 0.255
0.416MetVal: 0.416 ± 0.255
0.0MetTrp: 0.0 ± 0.0
0.833MetTyr: 0.833 ± 0.558
0.0MetXaa: 0.0 ± 0.0
Asn
1.665AsnAla: 1.665 ± 1.02
0.416AsnCys: 0.416 ± 0.255
0.833AsnAsp: 0.833 ± 0.51
0.833AsnGlu: 0.833 ± 0.558
1.249AsnPhe: 1.249 ± 0.462
2.082AsnGly: 2.082 ± 0.455
1.249AsnHis: 1.249 ± 0.765
0.416AsnIle: 0.416 ± 0.474
0.833AsnLys: 0.833 ± 0.51
4.58AsnLeu: 4.58 ± 0.686
0.0AsnMet: 0.0 ± 0.0
0.833AsnAsn: 0.833 ± 0.949
1.665AsnPro: 1.665 ± 0.785
0.416AsnGln: 0.416 ± 0.474
1.249AsnArg: 1.249 ± 0.462
2.082AsnSer: 2.082 ± 1.103
3.331AsnThr: 3.331 ± 1.568
1.249AsnVal: 1.249 ± 0.765
0.416AsnTrp: 0.416 ± 0.474
0.416AsnTyr: 0.416 ± 0.474
0.0AsnXaa: 0.0 ± 0.0
Pro
7.494ProAla: 7.494 ± 0.411
1.249ProCys: 1.249 ± 1.611
2.914ProAsp: 2.914 ± 1.198
4.996ProGlu: 4.996 ± 1.996
1.249ProPhe: 1.249 ± 0.962
4.996ProGly: 4.996 ± 0.929
0.416ProHis: 0.416 ± 0.255
1.249ProIle: 1.249 ± 0.306
2.082ProLys: 2.082 ± 0.455
4.58ProLeu: 4.58 ± 0.758
2.082ProMet: 2.082 ± 1.406
1.249ProAsn: 1.249 ± 0.462
8.743ProPro: 8.743 ± 7.133
2.498ProGln: 2.498 ± 0.52
8.326ProArg: 8.326 ± 5.888
7.91ProSer: 7.91 ± 6.928
7.91ProThr: 7.91 ± 2.641
3.747ProVal: 3.747 ± 1.576
0.416ProTrp: 0.416 ± 0.537
1.665ProTyr: 1.665 ± 0.076
0.0ProXaa: 0.0 ± 0.0
Gln
3.747GlnAla: 3.747 ± 0.748
1.665GlnCys: 1.665 ± 0.658
2.082GlnAsp: 2.082 ± 0.819
2.498GlnGlu: 2.498 ± 0.926
2.082GlnPhe: 2.082 ± 0.219
4.163GlnGly: 4.163 ± 1.047
2.498GlnHis: 2.498 ± 1.082
2.082GlnIle: 2.082 ± 0.849
0.833GlnLys: 0.833 ± 0.51
2.914GlnLeu: 2.914 ± 1.4
0.833GlnMet: 0.833 ± 1.074
1.249GlnAsn: 1.249 ± 0.505
2.082GlnPro: 2.082 ± 1.511
3.747GlnGln: 3.747 ± 0.917
3.331GlnArg: 3.331 ± 0.974
3.747GlnSer: 3.747 ± 1.38
4.58GlnThr: 4.58 ± 0.59
1.249GlnVal: 1.249 ± 0.505
1.249GlnTrp: 1.249 ± 0.765
0.416GlnTyr: 0.416 ± 0.474
0.0GlnXaa: 0.0 ± 0.0
Arg
8.743ArgAla: 8.743 ± 2.178
2.914ArgCys: 2.914 ± 1.855
3.331ArgAsp: 3.331 ± 0.562
0.833ArgGlu: 0.833 ± 0.51
3.331ArgPhe: 3.331 ± 1.269
5.412ArgGly: 5.412 ± 1.034
1.665ArgHis: 1.665 ± 1.02
2.498ArgIle: 2.498 ± 0.336
2.082ArgLys: 2.082 ± 0.819
7.077ArgLeu: 7.077 ± 0.873
2.498ArgMet: 2.498 ± 1.279
2.498ArgAsn: 2.498 ± 0.336
4.996ArgPro: 4.996 ± 3.055
4.163ArgGln: 4.163 ± 2.044
6.661ArgArg: 6.661 ± 3.187
7.077ArgSer: 7.077 ± 4.985
5.412ArgThr: 5.412 ± 3.211
5.828ArgVal: 5.828 ± 1.566
1.249ArgTrp: 1.249 ± 1.611
1.249ArgTyr: 1.249 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
5.828SerAla: 5.828 ± 2.708
2.082SerCys: 2.082 ± 2.686
2.498SerAsp: 2.498 ± 0.336
2.498SerGlu: 2.498 ± 0.926
1.249SerPhe: 1.249 ± 0.833
5.828SerGly: 5.828 ± 2.439
1.665SerHis: 1.665 ± 0.076
2.498SerIle: 2.498 ± 0.924
1.249SerLys: 1.249 ± 0.765
8.326SerLeu: 8.326 ± 5.22
1.249SerMet: 1.249 ± 0.306
1.665SerAsn: 1.665 ± 0.076
7.91SerPro: 7.91 ± 7.56
4.996SerGln: 4.996 ± 2.306
4.58SerArg: 4.58 ± 3.94
8.743SerSer: 8.743 ± 7.248
9.159SerThr: 9.159 ± 4.596
2.498SerVal: 2.498 ± 1.924
2.082SerTrp: 2.082 ± 1.103
1.665SerTyr: 1.665 ± 1.49
0.0SerXaa: 0.0 ± 0.0
Thr
8.743ThrAla: 8.743 ± 0.594
1.249ThrCys: 1.249 ± 0.505
4.163ThrAsp: 4.163 ± 1.12
1.249ThrGlu: 1.249 ± 0.505
0.833ThrPhe: 0.833 ± 0.51
6.245ThrGly: 6.245 ± 2.054
2.498ThrHis: 2.498 ± 0.926
2.082ThrIle: 2.082 ± 0.863
3.747ThrLys: 3.747 ± 1.988
6.245ThrLeu: 6.245 ± 0.657
0.833ThrMet: 0.833 ± 0.393
2.498ThrAsn: 2.498 ± 1.666
8.743ThrPro: 8.743 ± 4.47
4.163ThrGln: 4.163 ± 1.073
7.494ThrArg: 7.494 ± 2.063
8.326ThrSer: 8.326 ± 5.921
6.661ThrThr: 6.661 ± 1.834
7.077ThrVal: 7.077 ± 3.209
1.665ThrTrp: 1.665 ± 0.076
2.082ThrTyr: 2.082 ± 1.213
0.0ThrXaa: 0.0 ± 0.0
Val
4.996ValAla: 4.996 ± 4.397
3.747ValCys: 3.747 ± 1.232
3.331ValAsp: 3.331 ± 1.269
4.58ValGlu: 4.58 ± 1.928
2.082ValPhe: 2.082 ± 1.213
4.996ValGly: 4.996 ± 1.51
2.914ValHis: 2.914 ± 0.723
2.498ValIle: 2.498 ± 1.178
0.833ValLys: 0.833 ± 0.949
4.163ValLeu: 4.163 ± 1.699
0.833ValMet: 0.833 ± 0.51
1.249ValAsn: 1.249 ± 0.462
4.996ValPro: 4.996 ± 0.938
1.249ValGln: 1.249 ± 0.462
5.828ValArg: 5.828 ± 0.255
6.661ValSer: 6.661 ± 1.948
4.996ValThr: 4.996 ± 0.694
4.996ValVal: 4.996 ± 2.568
1.249ValTrp: 1.249 ± 0.765
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.665TrpAla: 1.665 ± 0.076
0.416TrpCys: 0.416 ± 0.255
0.416TrpAsp: 0.416 ± 0.255
1.665TrpGlu: 1.665 ± 0.634
1.249TrpPhe: 1.249 ± 0.833
1.665TrpGly: 1.665 ± 0.634
0.0TrpHis: 0.0 ± 0.0
0.416TrpIle: 0.416 ± 0.255
0.0TrpLys: 0.0 ± 0.0
3.331TrpLeu: 3.331 ± 0.151
0.0TrpMet: 0.0 ± 0.0
0.416TrpAsn: 0.416 ± 0.537
1.665TrpPro: 1.665 ± 0.656
0.416TrpGln: 0.416 ± 0.255
1.249TrpArg: 1.249 ± 0.987
2.082TrpSer: 2.082 ± 0.613
1.249TrpThr: 1.249 ± 0.462
0.416TrpVal: 0.416 ± 0.537
0.416TrpTrp: 0.416 ± 0.537
0.416TrpTyr: 0.416 ± 0.255
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.498TyrAla: 2.498 ± 0.469
1.249TyrCys: 1.249 ± 0.306
1.249TyrAsp: 1.249 ± 0.505
0.833TyrGlu: 0.833 ± 0.51
0.416TyrPhe: 0.416 ± 0.255
2.082TyrGly: 2.082 ± 1.213
0.416TyrHis: 0.416 ± 0.537
0.0TyrIle: 0.0 ± 0.0
1.249TyrLys: 1.249 ± 0.462
0.416TyrLeu: 0.416 ± 0.474
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.665TyrPro: 1.665 ± 0.634
0.833TyrGln: 0.833 ± 0.558
1.665TyrArg: 1.665 ± 0.785
0.0TyrSer: 0.0 ± 0.0
2.082TyrThr: 2.082 ± 1.213
0.833TyrVal: 0.833 ± 0.455
0.0TyrTrp: 0.0 ± 0.0
0.416TyrTyr: 0.416 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.833XaaLeu: 0.833 ± 0.558
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2403 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski