Amino acid dipepetide frequency for Hubei picorna-like virus 78

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.751AlaAla: 1.751 ± 0.323
1.401AlaCys: 1.401 ± 0.771
2.101AlaAsp: 2.101 ± 1.156
1.401AlaGlu: 1.401 ± 0.127
3.151AlaPhe: 3.151 ± 0.448
2.801AlaGly: 2.801 ± 1.032
2.101AlaHis: 2.101 ± 0.13
3.852AlaIle: 3.852 ± 0.19
3.151AlaLys: 3.151 ± 0.448
7.703AlaLeu: 7.703 ± 1.023
1.401AlaMet: 1.401 ± 0.127
1.751AlaAsn: 1.751 ± 0.32
2.451AlaPro: 2.451 ± 0.062
1.401AlaGln: 1.401 ± 0.127
2.801AlaArg: 2.801 ± 0.898
3.501AlaSer: 3.501 ± 0.646
2.801AlaThr: 2.801 ± 0.255
2.451AlaVal: 2.451 ± 0.581
0.7AlaTrp: 0.7 ± 0.258
1.05AlaTyr: 1.05 ± 0.578
0.0AlaXaa: 0.0 ± 0.0
Cys
0.35CysAla: 0.35 ± 0.193
0.0CysCys: 0.0 ± 0.0
0.7CysAsp: 0.7 ± 0.385
0.7CysGlu: 0.7 ± 0.385
0.7CysPhe: 0.7 ± 0.385
1.05CysGly: 1.05 ± 0.578
0.35CysHis: 0.35 ± 0.193
0.35CysIle: 0.35 ± 0.193
1.05CysLys: 1.05 ± 0.578
1.401CysLeu: 1.401 ± 0.771
1.401CysMet: 1.401 ± 0.771
0.35CysAsn: 0.35 ± 0.193
0.0CysPro: 0.0 ± 0.0
0.7CysGln: 0.7 ± 0.385
0.35CysArg: 0.35 ± 0.193
0.7CysSer: 0.7 ± 0.385
0.0CysThr: 0.0 ± 0.0
1.751CysVal: 1.751 ± 0.32
0.0CysTrp: 0.0 ± 0.0
0.35CysTyr: 0.35 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
4.902AspAla: 4.902 ± 0.768
0.35AspCys: 0.35 ± 0.193
3.501AspAsp: 3.501 ± 0.003
1.751AspGlu: 1.751 ± 0.323
4.552AspPhe: 4.552 ± 1.355
3.151AspGly: 3.151 ± 0.196
1.401AspHis: 1.401 ± 0.771
4.902AspIle: 4.902 ± 0.768
2.451AspLys: 2.451 ± 1.349
7.003AspLeu: 7.003 ± 0.006
0.7AspMet: 0.7 ± 0.385
3.151AspAsn: 3.151 ± 1.734
3.852AspPro: 3.852 ± 2.383
2.801AspGln: 2.801 ± 0.255
1.751AspArg: 1.751 ± 0.32
3.501AspSer: 3.501 ± 0.646
3.151AspThr: 3.151 ± 0.448
3.151AspVal: 3.151 ± 1.482
1.401AspTrp: 1.401 ± 0.771
4.202AspTyr: 4.202 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
3.151GluAla: 3.151 ± 0.448
0.35GluCys: 0.35 ± 0.193
3.501GluAsp: 3.501 ± 0.64
4.202GluGlu: 4.202 ± 1.669
4.552GluPhe: 4.552 ± 1.862
1.401GluGly: 1.401 ± 0.127
2.801GluHis: 2.801 ± 1.032
3.852GluIle: 3.852 ± 0.19
4.902GluLys: 4.902 ± 2.054
5.602GluLeu: 5.602 ± 1.797
2.451GluMet: 2.451 ± 0.062
2.451GluAsn: 2.451 ± 1.349
2.801GluPro: 2.801 ± 1.032
1.05GluGln: 1.05 ± 0.709
1.05GluArg: 1.05 ± 0.065
2.451GluSer: 2.451 ± 0.062
3.852GluThr: 3.852 ± 0.19
2.101GluVal: 2.101 ± 0.513
1.401GluTrp: 1.401 ± 0.516
3.151GluTyr: 3.151 ± 1.734
0.0GluXaa: 0.0 ± 0.0
Phe
4.202PheAla: 4.202 ± 0.261
2.101PheCys: 2.101 ± 1.156
4.552PheAsp: 4.552 ± 1.218
4.202PheGlu: 4.202 ± 1.669
3.501PhePhe: 3.501 ± 0.64
4.552PheGly: 4.552 ± 1.218
2.101PheHis: 2.101 ± 0.774
2.451PheIle: 2.451 ± 0.706
2.101PheLys: 2.101 ± 0.13
3.501PheLeu: 3.501 ± 0.64
1.05PheMet: 1.05 ± 0.495
3.852PheAsn: 3.852 ± 0.19
2.101PhePro: 2.101 ± 0.13
1.751PheGln: 1.751 ± 0.966
4.902PheArg: 4.902 ± 1.805
4.202PheSer: 4.202 ± 0.904
3.151PheThr: 3.151 ± 0.448
3.501PheVal: 3.501 ± 0.64
0.7PheTrp: 0.7 ± 0.901
1.751PheTyr: 1.751 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
1.751GlyAla: 1.751 ± 0.32
0.35GlyCys: 0.35 ± 0.193
2.451GlyAsp: 2.451 ± 0.581
2.451GlyGlu: 2.451 ± 0.062
2.101GlyPhe: 2.101 ± 0.513
2.451GlyGly: 2.451 ± 1.224
0.0GlyHis: 0.0 ± 0.0
2.101GlyIle: 2.101 ± 0.513
5.602GlyLys: 5.602 ± 0.51
3.151GlyLeu: 3.151 ± 0.196
1.401GlyMet: 1.401 ± 0.516
3.501GlyAsn: 3.501 ± 0.646
1.751GlyPro: 1.751 ± 0.32
2.451GlyGln: 2.451 ± 1.224
2.801GlyArg: 2.801 ± 0.255
3.151GlySer: 3.151 ± 0.839
3.852GlyThr: 3.852 ± 1.74
4.902GlyVal: 4.902 ± 0.125
0.0GlyTrp: 0.0 ± 0.0
3.501GlyTyr: 3.501 ± 1.284
0.0GlyXaa: 0.0 ± 0.0
His
1.05HisAla: 1.05 ± 0.578
0.35HisCys: 0.35 ± 0.193
1.751HisAsp: 1.751 ± 0.323
1.05HisGlu: 1.05 ± 0.578
3.151HisPhe: 3.151 ± 0.448
0.0HisGly: 0.0 ± 0.0
0.35HisHis: 0.35 ± 0.193
3.151HisIle: 3.151 ± 1.091
2.101HisLys: 2.101 ± 0.13
2.451HisLeu: 2.451 ± 0.706
0.7HisMet: 0.7 ± 0.258
1.401HisAsn: 1.401 ± 1.159
2.451HisPro: 2.451 ± 1.224
0.0HisGln: 0.0 ± 0.0
1.05HisArg: 1.05 ± 0.065
2.101HisSer: 2.101 ± 0.513
1.401HisThr: 1.401 ± 0.127
1.401HisVal: 1.401 ± 0.127
0.0HisTrp: 0.0 ± 0.0
0.7HisTyr: 0.7 ± 0.385
0.0HisXaa: 0.0 ± 0.0
Ile
4.902IleAla: 4.902 ± 0.519
0.0IleCys: 0.0 ± 0.0
3.501IleAsp: 3.501 ± 0.003
4.552IleGlu: 4.552 ± 0.068
3.501IlePhe: 3.501 ± 0.003
3.852IleGly: 3.852 ± 1.097
1.401IleHis: 1.401 ± 0.771
3.852IleIle: 3.852 ± 0.833
3.151IleLys: 3.151 ± 1.091
4.902IleLeu: 4.902 ± 0.125
1.401IleMet: 1.401 ± 0.771
3.852IleAsn: 3.852 ± 0.454
3.852IlePro: 3.852 ± 1.097
4.202IleGln: 4.202 ± 0.261
2.801IleArg: 2.801 ± 0.388
4.902IleSer: 4.902 ± 2.449
3.151IleThr: 3.151 ± 0.196
2.101IleVal: 2.101 ± 0.513
2.101IleTrp: 2.101 ± 1.156
3.501IleTyr: 3.501 ± 0.64
0.0IleXaa: 0.0 ± 0.0
Lys
3.151LysAla: 3.151 ± 0.448
0.0LysCys: 0.0 ± 0.0
3.852LysAsp: 3.852 ± 1.476
3.501LysGlu: 3.501 ± 0.64
3.151LysPhe: 3.151 ± 1.091
2.451LysGly: 2.451 ± 0.706
2.101LysHis: 2.101 ± 0.513
3.501LysIle: 3.501 ± 1.284
2.801LysLys: 2.801 ± 1.542
3.501LysLeu: 3.501 ± 0.003
1.05LysMet: 1.05 ± 0.578
1.751LysAsn: 1.751 ± 0.963
4.202LysPro: 4.202 ± 1.026
1.401LysGln: 1.401 ± 0.771
2.801LysArg: 2.801 ± 0.898
5.252LysSer: 5.252 ± 0.961
4.202LysThr: 4.202 ± 0.382
5.252LysVal: 5.252 ± 0.326
2.101LysTrp: 2.101 ± 1.156
3.501LysTyr: 3.501 ± 0.64
0.0LysXaa: 0.0 ± 0.0
Leu
5.252LeuAla: 5.252 ± 0.961
1.401LeuCys: 1.401 ± 0.771
7.703LeuAsp: 7.703 ± 0.264
3.852LeuGlu: 3.852 ± 0.19
6.653LeuPhe: 6.653 ± 0.445
2.801LeuGly: 2.801 ± 0.388
1.751LeuHis: 1.751 ± 0.323
4.552LeuIle: 4.552 ± 0.068
7.353LeuLys: 7.353 ± 0.83
5.252LeuLeu: 5.252 ± 2.247
1.401LeuMet: 1.401 ± 0.516
5.952LeuAsn: 5.952 ± 1.227
3.501LeuPro: 3.501 ± 0.646
4.202LeuGln: 4.202 ± 1.026
4.202LeuArg: 4.202 ± 0.382
7.353LeuSer: 7.353 ± 1.1
6.653LeuThr: 6.653 ± 0.445
3.852LeuVal: 3.852 ± 1.097
0.0LeuTrp: 0.0 ± 0.0
3.151LeuTyr: 3.151 ± 0.196
0.0LeuXaa: 0.0 ± 0.0
Met
2.451MetAla: 2.451 ± 0.062
0.7MetCys: 0.7 ± 0.385
1.05MetAsp: 1.05 ± 0.578
0.7MetGlu: 0.7 ± 0.258
1.751MetPhe: 1.751 ± 0.963
0.7MetGly: 0.7 ± 0.385
0.0MetHis: 0.0 ± 0.0
2.101MetIle: 2.101 ± 0.513
2.101MetLys: 2.101 ± 1.156
1.751MetLeu: 1.751 ± 0.323
1.05MetMet: 1.05 ± 0.578
2.801MetAsn: 2.801 ± 0.255
0.0MetPro: 0.0 ± 0.0
0.35MetGln: 0.35 ± 0.193
0.0MetArg: 0.0 ± 0.0
2.451MetSer: 2.451 ± 0.062
1.05MetThr: 1.05 ± 0.578
1.401MetVal: 1.401 ± 0.771
0.0MetTrp: 0.0 ± 0.0
1.05MetTyr: 1.05 ± 0.578
0.0MetXaa: 0.0 ± 0.0
Asn
1.401AsnAla: 1.401 ± 0.771
0.7AsnCys: 0.7 ± 0.385
4.202AsnAsp: 4.202 ± 1.026
2.801AsnGlu: 2.801 ± 0.255
1.751AsnPhe: 1.751 ± 0.32
1.751AsnGly: 1.751 ± 0.963
0.7AsnHis: 0.7 ± 0.385
4.902AsnIle: 4.902 ± 1.805
3.151AsnLys: 3.151 ± 0.448
6.303AsnLeu: 6.303 ± 0.391
1.05AsnMet: 1.05 ± 0.578
3.852AsnAsn: 3.852 ± 1.097
5.952AsnPro: 5.952 ± 2.514
2.801AsnGln: 2.801 ± 1.032
2.101AsnArg: 2.101 ± 0.513
4.202AsnSer: 4.202 ± 1.547
4.902AsnThr: 4.902 ± 3.092
2.801AsnVal: 2.801 ± 0.388
0.0AsnTrp: 0.0 ± 0.0
2.101AsnTyr: 2.101 ± 0.13
0.0AsnXaa: 0.0 ± 0.0
Pro
2.101ProAla: 2.101 ± 0.513
1.05ProCys: 1.05 ± 0.065
3.151ProAsp: 3.151 ± 0.839
3.852ProGlu: 3.852 ± 1.476
2.451ProPhe: 2.451 ± 2.511
3.151ProGly: 3.151 ± 0.839
1.751ProHis: 1.751 ± 0.323
2.801ProIle: 2.801 ± 0.255
2.801ProLys: 2.801 ± 0.255
3.852ProLeu: 3.852 ± 0.454
0.35ProMet: 0.35 ± 0.451
1.751ProAsn: 1.751 ± 1.61
2.101ProPro: 2.101 ± 0.774
2.451ProGln: 2.451 ± 0.062
2.801ProArg: 2.801 ± 0.898
3.852ProSer: 3.852 ± 1.097
5.952ProThr: 5.952 ± 4.444
4.902ProVal: 4.902 ± 0.519
0.7ProTrp: 0.7 ± 0.385
1.751ProTyr: 1.751 ± 0.323
0.0ProXaa: 0.0 ± 0.0
Gln
2.101GlnAla: 2.101 ± 0.774
0.35GlnCys: 0.35 ± 0.193
1.751GlnAsp: 1.751 ± 0.323
2.101GlnGlu: 2.101 ± 0.513
3.852GlnPhe: 3.852 ± 1.74
2.801GlnGly: 2.801 ± 1.032
2.451GlnHis: 2.451 ± 0.706
2.101GlnIle: 2.101 ± 0.13
1.05GlnLys: 1.05 ± 0.065
2.451GlnLeu: 2.451 ± 0.062
0.35GlnMet: 0.35 ± 0.193
4.902GlnAsn: 4.902 ± 1.162
1.751GlnPro: 1.751 ± 0.32
0.35GlnGln: 0.35 ± 0.193
1.401GlnArg: 1.401 ± 0.127
2.451GlnSer: 2.451 ± 1.224
1.751GlnThr: 1.751 ± 0.32
2.451GlnVal: 2.451 ± 0.706
0.7GlnTrp: 0.7 ± 0.385
0.35GlnTyr: 0.35 ± 0.451
0.0GlnXaa: 0.0 ± 0.0
Arg
2.101ArgAla: 2.101 ± 0.13
1.05ArgCys: 1.05 ± 0.578
1.401ArgAsp: 1.401 ± 0.516
4.552ArgGlu: 4.552 ± 0.068
1.05ArgPhe: 1.05 ± 0.578
2.801ArgGly: 2.801 ± 0.255
1.401ArgHis: 1.401 ± 0.771
2.451ArgIle: 2.451 ± 0.581
2.101ArgLys: 2.101 ± 1.156
3.501ArgLeu: 3.501 ± 1.284
1.401ArgMet: 1.401 ± 0.56
2.451ArgAsn: 2.451 ± 0.062
1.751ArgPro: 1.751 ± 0.323
1.751ArgGln: 1.751 ± 0.32
3.501ArgArg: 3.501 ± 0.64
2.801ArgSer: 2.801 ± 1.032
3.151ArgThr: 3.151 ± 0.448
3.852ArgVal: 3.852 ± 1.097
0.7ArgTrp: 0.7 ± 0.258
0.35ArgTyr: 0.35 ± 0.193
0.0ArgXaa: 0.0 ± 0.0
Ser
2.101SerAla: 2.101 ± 0.13
0.35SerCys: 0.35 ± 0.193
4.202SerAsp: 4.202 ± 0.904
4.552SerGlu: 4.552 ± 0.068
4.202SerPhe: 4.202 ± 0.261
3.501SerGly: 3.501 ± 1.933
2.101SerHis: 2.101 ± 1.417
6.303SerIle: 6.303 ± 3.608
4.202SerLys: 4.202 ± 1.026
9.104SerLeu: 9.104 ± 2.066
2.451SerMet: 2.451 ± 1.349
2.101SerAsn: 2.101 ± 0.774
2.101SerPro: 2.101 ± 0.13
2.101SerGln: 2.101 ± 0.513
1.751SerArg: 1.751 ± 0.32
3.852SerSer: 3.852 ± 1.74
7.353SerThr: 7.353 ± 4.96
6.653SerVal: 6.653 ± 1.485
0.7SerTrp: 0.7 ± 0.385
1.401SerTyr: 1.401 ± 0.771
0.0SerXaa: 0.0 ± 0.0
Thr
1.751ThrAla: 1.751 ± 0.32
0.7ThrCys: 0.7 ± 0.385
4.202ThrAsp: 4.202 ± 2.191
4.552ThrGlu: 4.552 ± 0.575
4.552ThrPhe: 4.552 ± 1.218
4.552ThrGly: 4.552 ± 1.355
1.401ThrHis: 1.401 ± 0.771
3.852ThrIle: 3.852 ± 1.097
4.202ThrLys: 4.202 ± 1.026
5.952ThrLeu: 5.952 ± 3.157
1.401ThrMet: 1.401 ± 0.771
2.801ThrAsn: 2.801 ± 0.255
5.252ThrPro: 5.252 ± 1.613
2.801ThrGln: 2.801 ± 1.675
4.552ThrArg: 4.552 ± 0.068
6.303ThrSer: 6.303 ± 2.964
4.552ThrThr: 4.552 ± 3.285
3.501ThrVal: 3.501 ± 1.933
0.7ThrTrp: 0.7 ± 0.385
1.05ThrTyr: 1.05 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
2.451ValAla: 2.451 ± 0.062
0.35ValCys: 0.35 ± 0.193
4.552ValAsp: 4.552 ± 0.711
3.501ValGlu: 3.501 ± 1.284
3.501ValPhe: 3.501 ± 0.003
2.451ValGly: 2.451 ± 0.062
1.401ValHis: 1.401 ± 0.127
4.202ValIle: 4.202 ± 0.904
3.151ValLys: 3.151 ± 0.448
5.252ValLeu: 5.252 ± 1.613
1.05ValMet: 1.05 ± 0.578
4.202ValAsn: 4.202 ± 2.191
4.902ValPro: 4.902 ± 0.519
1.751ValGln: 1.751 ± 0.966
1.751ValArg: 1.751 ± 0.32
3.501ValSer: 3.501 ± 1.29
5.602ValThr: 5.602 ± 1.153
2.101ValVal: 2.101 ± 0.13
0.35ValTrp: 0.35 ± 0.193
4.202ValTyr: 4.202 ± 2.834
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.35TrpCys: 0.35 ± 0.193
0.7TrpAsp: 0.7 ± 0.385
0.35TrpGlu: 0.35 ± 0.193
0.7TrpPhe: 0.7 ± 0.385
0.7TrpGly: 0.7 ± 0.385
0.7TrpHis: 0.7 ± 0.385
1.751TrpIle: 1.751 ± 0.32
0.35TrpLys: 0.35 ± 0.193
0.7TrpLeu: 0.7 ± 0.385
0.7TrpMet: 0.7 ± 0.385
1.401TrpAsn: 1.401 ± 0.127
0.7TrpPro: 0.7 ± 0.385
0.7TrpGln: 0.7 ± 0.258
0.7TrpArg: 0.7 ± 0.901
1.05TrpSer: 1.05 ± 0.578
0.35TrpThr: 0.35 ± 0.451
0.35TrpVal: 0.35 ± 0.193
0.35TrpTrp: 0.35 ± 0.193
1.401TrpTyr: 1.401 ± 0.771
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.801TyrAla: 2.801 ± 0.255
0.35TyrCys: 0.35 ± 0.193
2.801TyrAsp: 2.801 ± 1.542
2.451TyrGlu: 2.451 ± 0.706
1.401TyrPhe: 1.401 ± 0.127
2.801TyrGly: 2.801 ± 1.542
0.35TyrHis: 0.35 ± 0.193
2.451TyrIle: 2.451 ± 0.706
1.751TyrLys: 1.751 ± 0.963
3.501TyrLeu: 3.501 ± 0.64
0.35TyrMet: 0.35 ± 0.193
3.151TyrAsn: 3.151 ± 1.482
2.101TyrPro: 2.101 ± 0.13
2.451TyrGln: 2.451 ± 0.062
1.401TyrArg: 1.401 ± 0.127
3.501TyrSer: 3.501 ± 0.003
1.751TyrThr: 1.751 ± 0.323
1.751TyrVal: 1.751 ± 1.61
1.401TyrTrp: 1.401 ± 0.771
2.451TyrTyr: 2.451 ± 1.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2857 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski