Amino acid dipepetide frequency for Wenzhou picorna-like virus 29

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.325AlaAla: 5.325 ± 0.698
0.71AlaCys: 0.71 ± 0.388
3.55AlaAsp: 3.55 ± 1.153
3.195AlaGlu: 3.195 ± 0.2
3.905AlaPhe: 3.905 ± 0.959
5.325AlaGly: 5.325 ± 0.698
1.065AlaHis: 1.065 ± 0.067
1.775AlaIle: 1.775 ± 0.061
3.55AlaLys: 3.55 ± 0.394
6.035AlaLeu: 6.035 ± 1.754
1.42AlaMet: 1.42 ± 0.161
3.905AlaAsn: 3.905 ± 1.104
4.26AlaPro: 4.26 ± 2.312
4.97AlaGln: 4.97 ± 0.376
2.84AlaArg: 2.84 ± 1.026
8.165AlaSer: 8.165 ± 3.787
5.325AlaThr: 5.325 ± 0.182
4.26AlaVal: 4.26 ± 0.765
1.065AlaTrp: 1.065 ± 0.067
4.615AlaTyr: 4.615 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
1.065CysAla: 1.065 ± 0.583
0.355CysCys: 0.355 ± 0.194
0.355CysAsp: 0.355 ± 0.194
1.42CysGlu: 1.42 ± 0.255
0.71CysPhe: 0.71 ± 0.388
1.42CysGly: 1.42 ± 0.777
0.0CysHis: 0.0 ± 0.0
0.71CysIle: 0.71 ± 0.643
0.0CysLys: 0.0 ± 0.0
1.775CysLeu: 1.775 ± 0.971
0.0CysMet: 0.0 ± 0.0
1.065CysAsn: 1.065 ± 0.067
0.71CysPro: 0.71 ± 0.388
0.0CysGln: 0.0 ± 0.0
0.71CysArg: 0.71 ± 0.127
0.71CysSer: 0.71 ± 0.388
0.355CysThr: 0.355 ± 0.194
1.42CysVal: 1.42 ± 0.261
0.0CysTrp: 0.0 ± 0.0
0.71CysTyr: 0.71 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
4.97AspAla: 4.97 ± 1.924
1.065AspCys: 1.065 ± 0.583
6.035AspAsp: 6.035 ± 0.722
2.13AspGlu: 2.13 ± 0.649
1.775AspPhe: 1.775 ± 0.061
3.195AspGly: 3.195 ± 0.716
2.13AspHis: 2.13 ± 0.133
4.97AspIle: 4.97 ± 2.44
2.485AspLys: 2.485 ± 0.844
7.1AspLeu: 7.1 ± 0.789
2.13AspMet: 2.13 ± 0.132
3.195AspAsn: 3.195 ± 1.748
3.905AspPro: 3.905 ± 1.104
1.065AspGln: 1.065 ± 0.449
1.775AspArg: 1.775 ± 0.061
5.325AspSer: 5.325 ± 0.334
3.905AspThr: 3.905 ± 1.475
4.97AspVal: 4.97 ± 0.376
0.0AspTrp: 0.0 ± 0.0
1.775AspTyr: 1.775 ± 0.577
0.0AspXaa: 0.0 ± 0.0
Glu
2.485GluAla: 2.485 ± 1.359
0.71GluCys: 0.71 ± 0.388
2.485GluAsp: 2.485 ± 0.188
3.905GluGlu: 3.905 ± 2.136
3.195GluPhe: 3.195 ± 1.232
2.13GluGly: 2.13 ± 0.133
0.71GluHis: 0.71 ± 0.127
4.26GluIle: 4.26 ± 0.765
2.13GluLys: 2.13 ± 0.649
5.68GluLeu: 5.68 ± 2.591
1.42GluMet: 1.42 ± 0.261
1.42GluAsn: 1.42 ± 0.255
1.065GluPro: 1.065 ± 0.583
1.42GluGln: 1.42 ± 0.255
1.775GluArg: 1.775 ± 0.455
1.775GluSer: 1.775 ± 0.455
2.13GluThr: 2.13 ± 0.382
4.615GluVal: 4.615 ± 0.055
1.065GluTrp: 1.065 ± 0.067
2.13GluTyr: 2.13 ± 0.649
0.0GluXaa: 0.0 ± 0.0
Phe
4.26PheAla: 4.26 ± 0.765
0.355PheCys: 0.355 ± 0.194
3.905PheAsp: 3.905 ± 0.443
2.84PheGlu: 2.84 ± 1.554
2.13PhePhe: 2.13 ± 0.649
2.485PheGly: 2.485 ± 1.22
1.775PheHis: 1.775 ± 0.971
2.485PheIle: 2.485 ± 0.844
2.13PheLys: 2.13 ± 0.133
4.615PheLeu: 4.615 ± 0.977
0.355PheMet: 0.355 ± 0.322
3.905PheAsn: 3.905 ± 0.443
3.55PhePro: 3.55 ± 0.637
1.775PheGln: 1.775 ± 0.577
1.42PheArg: 1.42 ± 0.255
3.195PheSer: 3.195 ± 0.2
1.775PheThr: 1.775 ± 0.577
4.97PheVal: 4.97 ± 0.376
0.0PheTrp: 0.0 ± 0.0
1.42PheTyr: 1.42 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
4.615GlyAla: 4.615 ± 1.602
0.0GlyCys: 0.0 ± 0.0
7.81GlyAsp: 7.81 ± 0.146
2.84GlyGlu: 2.84 ± 0.51
2.13GlyPhe: 2.13 ± 0.382
3.195GlyGly: 3.195 ± 0.831
1.42GlyHis: 1.42 ± 0.255
2.485GlyIle: 2.485 ± 0.328
4.26GlyLys: 4.26 ± 0.783
2.84GlyLeu: 2.84 ± 0.51
1.42GlyMet: 1.42 ± 0.777
2.84GlyAsn: 2.84 ± 0.51
1.42GlyPro: 1.42 ± 0.255
2.13GlyGln: 2.13 ± 1.165
2.13GlyArg: 2.13 ± 0.382
4.615GlySer: 4.615 ± 2.118
4.26GlyThr: 4.26 ± 2.312
2.84GlyVal: 2.84 ± 0.006
0.71GlyTrp: 0.71 ± 0.127
3.55GlyTyr: 3.55 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
2.13HisAla: 2.13 ± 0.382
0.0HisCys: 0.0 ± 0.0
0.71HisAsp: 0.71 ± 0.388
1.775HisGlu: 1.775 ± 0.455
1.775HisPhe: 1.775 ± 0.061
0.71HisGly: 0.71 ± 0.388
0.0HisHis: 0.0 ± 0.0
2.13HisIle: 2.13 ± 0.382
0.355HisLys: 0.355 ± 0.194
3.905HisLeu: 3.905 ± 1.62
0.0HisMet: 0.0 ± 0.0
0.355HisAsn: 0.355 ± 0.194
0.71HisPro: 0.71 ± 0.127
0.355HisGln: 0.355 ± 0.194
1.775HisArg: 1.775 ± 0.061
1.775HisSer: 1.775 ± 0.061
0.71HisThr: 0.71 ± 0.388
1.42HisVal: 1.42 ± 0.261
0.355HisTrp: 0.355 ± 0.322
0.71HisTyr: 0.71 ± 0.388
0.0HisXaa: 0.0 ± 0.0
Ile
3.195IleAla: 3.195 ± 0.831
0.71IleCys: 0.71 ± 0.388
2.485IleAsp: 2.485 ± 0.328
4.26IleGlu: 4.26 ± 1.299
1.775IlePhe: 1.775 ± 0.577
5.325IleGly: 5.325 ± 1.73
2.84IleHis: 2.84 ± 0.522
1.065IleIle: 1.065 ± 0.067
1.775IleLys: 1.775 ± 0.455
5.325IleLeu: 5.325 ± 0.85
0.355IleMet: 0.355 ± 0.194
2.485IleAsn: 2.485 ± 0.188
3.195IlePro: 3.195 ± 0.831
3.195IleGln: 3.195 ± 0.831
2.13IleArg: 2.13 ± 0.649
6.745IleSer: 6.745 ± 4.048
4.615IleThr: 4.615 ± 1.086
3.195IleVal: 3.195 ± 0.831
0.0IleTrp: 0.0 ± 0.0
2.485IleTyr: 2.485 ± 0.844
0.0IleXaa: 0.0 ± 0.0
Lys
6.39LysAla: 6.39 ± 2.464
1.065LysCys: 1.065 ± 0.067
1.775LysAsp: 1.775 ± 0.455
0.71LysGlu: 0.71 ± 0.127
1.775LysPhe: 1.775 ± 0.061
1.42LysGly: 1.42 ± 0.777
1.065LysHis: 1.065 ± 0.583
2.84LysIle: 2.84 ± 0.51
0.71LysLys: 0.71 ± 0.388
2.84LysLeu: 2.84 ± 0.522
0.71LysMet: 0.71 ± 0.127
1.065LysAsn: 1.065 ± 0.583
2.13LysPro: 2.13 ± 0.382
1.775LysGln: 1.775 ± 0.455
0.71LysArg: 0.71 ± 0.388
2.13LysSer: 2.13 ± 1.165
2.485LysThr: 2.485 ± 0.844
3.55LysVal: 3.55 ± 1.942
0.71LysTrp: 0.71 ± 0.388
2.13LysTyr: 2.13 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
7.1LeuAla: 7.1 ± 1.305
1.775LeuCys: 1.775 ± 0.455
6.39LeuAsp: 6.39 ± 0.631
4.26LeuGlu: 4.26 ± 0.249
5.68LeuPhe: 5.68 ± 2.076
5.68LeuGly: 5.68 ± 1.02
2.84LeuHis: 2.84 ± 1.038
3.905LeuIle: 3.905 ± 1.104
3.905LeuLys: 3.905 ± 1.62
6.745LeuLeu: 6.745 ± 1.626
1.42LeuMet: 1.42 ± 0.777
3.905LeuAsn: 3.905 ± 0.073
5.68LeuPro: 5.68 ± 0.528
2.13LeuGln: 2.13 ± 0.382
6.39LeuArg: 6.39 ± 0.631
9.585LeuSer: 9.585 ± 0.085
6.745LeuThr: 6.745 ± 0.437
4.615LeuVal: 4.615 ± 0.571
1.065LeuTrp: 1.065 ± 0.583
2.485LeuTyr: 2.485 ± 0.844
0.0LeuXaa: 0.0 ± 0.0
Met
1.775MetAla: 1.775 ± 0.061
0.0MetCys: 0.0 ± 0.0
2.13MetAsp: 2.13 ± 0.133
0.355MetGlu: 0.355 ± 0.322
2.485MetPhe: 2.485 ± 0.328
1.42MetGly: 1.42 ± 0.255
0.0MetHis: 0.0 ± 0.0
1.065MetIle: 1.065 ± 0.583
1.065MetLys: 1.065 ± 0.583
1.065MetLeu: 1.065 ± 0.583
0.355MetMet: 0.355 ± 0.194
0.71MetAsn: 0.71 ± 0.127
0.355MetPro: 0.355 ± 0.194
0.355MetGln: 0.355 ± 0.322
1.42MetArg: 1.42 ± 0.777
2.84MetSer: 2.84 ± 0.51
1.065MetThr: 1.065 ± 0.583
0.71MetVal: 0.71 ± 0.388
0.355MetTrp: 0.355 ± 0.194
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.485AsnAla: 2.485 ± 0.328
0.71AsnCys: 0.71 ± 0.388
1.42AsnAsp: 1.42 ± 0.777
2.485AsnGlu: 2.485 ± 0.844
1.42AsnPhe: 1.42 ± 0.261
1.065AsnGly: 1.065 ± 0.583
0.355AsnHis: 0.355 ± 0.194
2.485AsnIle: 2.485 ± 0.188
0.355AsnLys: 0.355 ± 0.194
6.035AsnLeu: 6.035 ± 1.341
1.065AsnMet: 1.065 ± 0.583
3.195AsnAsn: 3.195 ± 0.316
4.97AsnPro: 4.97 ± 0.655
1.065AsnGln: 1.065 ± 0.067
2.13AsnArg: 2.13 ± 0.133
8.165AsnSer: 8.165 ± 1.208
3.195AsnThr: 3.195 ± 0.316
5.325AsnVal: 5.325 ± 0.182
0.355AsnTrp: 0.355 ± 0.322
2.84AsnTyr: 2.84 ± 0.522
0.0AsnXaa: 0.0 ± 0.0
Pro
5.68ProAla: 5.68 ± 2.051
0.0ProCys: 0.0 ± 0.0
3.195ProAsp: 3.195 ± 0.831
3.195ProGlu: 3.195 ± 0.2
1.775ProPhe: 1.775 ± 1.092
3.55ProGly: 3.55 ± 0.121
0.71ProHis: 0.71 ± 0.127
4.26ProIle: 4.26 ± 0.765
1.42ProLys: 1.42 ± 0.261
5.325ProLeu: 5.325 ± 1.365
0.355ProMet: 0.355 ± 0.194
1.42ProAsn: 1.42 ± 0.255
2.84ProPro: 2.84 ± 0.006
2.485ProGln: 2.485 ± 0.188
3.905ProArg: 3.905 ± 1.62
4.615ProSer: 4.615 ± 0.571
3.905ProThr: 3.905 ± 0.589
3.195ProVal: 3.195 ± 1.863
0.0ProTrp: 0.0 ± 0.0
1.42ProTyr: 1.42 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
3.55GlnAla: 3.55 ± 1.669
0.0GlnCys: 0.0 ± 0.0
2.84GlnAsp: 2.84 ± 0.522
1.775GlnGlu: 1.775 ± 0.455
2.485GlnPhe: 2.485 ± 0.188
0.71GlnGly: 0.71 ± 0.127
0.71GlnHis: 0.71 ± 0.127
1.065GlnIle: 1.065 ± 0.067
1.42GlnLys: 1.42 ± 0.777
2.485GlnLeu: 2.485 ± 0.704
0.71GlnMet: 0.71 ± 0.388
3.195GlnAsn: 3.195 ± 0.316
1.065GlnPro: 1.065 ± 0.449
1.42GlnGln: 1.42 ± 0.255
1.065GlnArg: 1.065 ± 0.449
3.905GlnSer: 3.905 ± 0.589
2.13GlnThr: 2.13 ± 0.382
3.55GlnVal: 3.55 ± 0.637
0.71GlnTrp: 0.71 ± 0.388
0.71GlnTyr: 0.71 ± 0.127
0.0GlnXaa: 0.0 ± 0.0
Arg
2.84ArgAla: 2.84 ± 0.522
0.71ArgCys: 0.71 ± 0.388
2.485ArgAsp: 2.485 ± 0.844
1.065ArgGlu: 1.065 ± 0.583
2.485ArgPhe: 2.485 ± 1.22
3.55ArgGly: 3.55 ± 0.637
1.775ArgHis: 1.775 ± 0.971
4.26ArgIle: 4.26 ± 0.249
2.485ArgLys: 2.485 ± 0.328
3.905ArgLeu: 3.905 ± 1.475
1.065ArgMet: 1.065 ± 0.067
1.42ArgAsn: 1.42 ± 0.261
2.84ArgPro: 2.84 ± 1.038
1.42ArgGln: 1.42 ± 0.261
4.97ArgArg: 4.97 ± 0.655
2.13ArgSer: 2.13 ± 0.382
2.485ArgThr: 2.485 ± 0.188
4.26ArgVal: 4.26 ± 1.299
0.355ArgTrp: 0.355 ± 0.194
2.13ArgTyr: 2.13 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
5.68SerAla: 5.68 ± 1.535
2.13SerCys: 2.13 ± 0.382
4.615SerAsp: 4.615 ± 0.055
3.195SerGlu: 3.195 ± 0.831
3.55SerPhe: 3.55 ± 2.185
7.1SerGly: 7.1 ± 0.759
0.71SerHis: 0.71 ± 0.388
5.325SerIle: 5.325 ± 1.214
2.84SerLys: 2.84 ± 0.522
9.585SerLeu: 9.585 ± 0.431
4.615SerMet: 4.615 ± 0.571
4.97SerAsn: 4.97 ± 0.655
4.97SerPro: 4.97 ± 1.408
3.55SerGln: 3.55 ± 0.637
2.13SerArg: 2.13 ± 0.382
6.39SerSer: 6.39 ± 0.4
5.68SerThr: 5.68 ± 2.051
8.165SerVal: 8.165 ± 0.34
1.065SerTrp: 1.065 ± 0.449
2.485SerTyr: 2.485 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
4.97ThrAla: 4.97 ± 1.924
1.065ThrCys: 1.065 ± 0.449
4.26ThrAsp: 4.26 ± 0.267
1.065ThrGlu: 1.065 ± 0.583
2.84ThrPhe: 2.84 ± 0.522
3.55ThrGly: 3.55 ± 1.153
1.42ThrHis: 1.42 ± 0.255
4.26ThrIle: 4.26 ± 0.249
2.485ThrLys: 2.485 ± 1.359
7.455ThrLeu: 7.455 ± 1.08
0.355ThrMet: 0.355 ± 0.322
3.195ThrAsn: 3.195 ± 0.316
2.485ThrPro: 2.485 ± 0.704
3.195ThrGln: 3.195 ± 1.347
1.775ThrArg: 1.775 ± 0.455
6.39ThrSer: 6.39 ± 1.147
2.13ThrThr: 2.13 ± 0.382
5.68ThrVal: 5.68 ± 1.535
0.71ThrTrp: 0.71 ± 0.127
1.775ThrTyr: 1.775 ± 0.455
0.0ThrXaa: 0.0 ± 0.0
Val
4.615ValAla: 4.615 ± 2.118
1.775ValCys: 1.775 ± 0.455
2.84ValAsp: 2.84 ± 1.542
3.195ValGlu: 3.195 ± 0.716
4.26ValPhe: 4.26 ± 1.299
2.84ValGly: 2.84 ± 1.026
1.42ValHis: 1.42 ± 0.771
3.905ValIle: 3.905 ± 0.959
2.485ValLys: 2.485 ± 0.844
5.68ValLeu: 5.68 ± 0.012
0.71ValMet: 0.71 ± 0.388
6.035ValAsn: 6.035 ± 0.31
5.325ValPro: 5.325 ± 1.214
2.13ValGln: 2.13 ± 0.649
5.325ValArg: 5.325 ± 0.85
6.39ValSer: 6.39 ± 0.631
5.68ValThr: 5.68 ± 1.02
6.035ValVal: 6.035 ± 0.31
1.065ValTrp: 1.065 ± 0.067
4.26ValTyr: 4.26 ± 1.299
0.0ValXaa: 0.0 ± 0.0
Trp
0.71TrpAla: 0.71 ± 0.127
0.0TrpCys: 0.0 ± 0.0
0.71TrpAsp: 0.71 ± 0.643
0.355TrpGlu: 0.355 ± 0.194
0.355TrpPhe: 0.355 ± 0.322
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.71TrpIle: 0.71 ± 0.643
0.355TrpLys: 0.355 ± 0.194
1.065TrpLeu: 1.065 ± 0.067
0.355TrpMet: 0.355 ± 0.194
0.355TrpAsn: 0.355 ± 0.322
0.0TrpPro: 0.0 ± 0.0
0.355TrpGln: 0.355 ± 0.194
1.065TrpArg: 1.065 ± 0.067
0.71TrpSer: 0.71 ± 0.388
1.065TrpThr: 1.065 ± 0.583
1.065TrpVal: 1.065 ± 0.067
0.0TrpTrp: 0.0 ± 0.0
1.065TrpTyr: 1.065 ± 0.583
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.775TyrAla: 1.775 ± 0.971
0.355TyrCys: 0.355 ± 0.194
3.905TyrAsp: 3.905 ± 1.104
2.485TyrGlu: 2.485 ± 1.359
2.84TyrPhe: 2.84 ± 0.006
2.84TyrGly: 2.84 ± 0.006
0.71TyrHis: 0.71 ± 0.127
3.195TyrIle: 3.195 ± 0.716
1.775TyrLys: 1.775 ± 0.061
2.84TyrLeu: 2.84 ± 0.006
0.355TyrMet: 0.355 ± 0.194
2.485TyrAsn: 2.485 ± 0.328
1.775TyrPro: 1.775 ± 0.061
0.71TyrGln: 0.71 ± 0.388
3.195TyrArg: 3.195 ± 0.2
3.195TyrSer: 3.195 ± 0.2
1.42TyrThr: 1.42 ± 0.261
2.13TyrVal: 2.13 ± 0.382
0.71TyrTrp: 0.71 ± 0.127
2.13TyrTyr: 2.13 ± 0.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2818 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski