Amino acid dipepetide frequency for Wenzhou picorna-like virus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.9AlaAla: 7.9 ± 0.962
0.832AlaCys: 0.832 ± 0.205
5.405AlaAsp: 5.405 ± 1.662
4.158AlaGlu: 4.158 ± 0.948
5.821AlaPhe: 5.821 ± 0.537
4.99AlaGly: 4.99 ± 0.743
2.079AlaHis: 2.079 ± 0.184
3.742AlaIle: 3.742 ± 0.064
6.653AlaLys: 6.653 ± 0.99
8.316AlaLeu: 8.316 ± 0.58
2.079AlaMet: 2.079 ± 0.474
3.326AlaAsn: 3.326 ± 0.821
4.574AlaPro: 4.574 ± 0.142
3.326AlaGln: 3.326 ± 0.495
1.663AlaArg: 1.663 ± 0.41
5.821AlaSer: 5.821 ± 2.094
3.742AlaThr: 3.742 ± 0.594
7.069AlaVal: 7.069 ± 0.757
1.663AlaTrp: 1.663 ± 1.068
1.663AlaTyr: 1.663 ± 0.41
0.0AlaXaa: 0.0 ± 0.0
Cys
2.079CysAla: 2.079 ± 0.474
0.0CysCys: 0.0 ± 0.0
0.832CysAsp: 0.832 ± 0.205
0.416CysGlu: 0.416 ± 0.226
0.0CysPhe: 0.0 ± 0.0
1.663CysGly: 1.663 ± 0.905
0.416CysHis: 0.416 ± 0.226
0.416CysIle: 0.416 ± 0.226
0.832CysLys: 0.832 ± 0.453
1.247CysLeu: 1.247 ± 0.679
0.0CysMet: 0.0 ± 0.0
0.416CysAsn: 0.416 ± 0.226
0.832CysPro: 0.832 ± 0.453
0.0CysGln: 0.0 ± 0.0
0.416CysArg: 0.416 ± 0.226
0.416CysSer: 0.416 ± 0.226
0.0CysThr: 0.0 ± 0.0
1.247CysVal: 1.247 ± 0.679
0.0CysTrp: 0.0 ± 0.0
0.832CysTyr: 0.832 ± 0.453
0.0CysXaa: 0.0 ± 0.0
Asp
3.742AspAla: 3.742 ± 0.064
0.832AspCys: 0.832 ± 0.453
2.911AspAsp: 2.911 ± 0.269
4.158AspGlu: 4.158 ± 0.948
3.326AspPhe: 3.326 ± 0.163
2.911AspGly: 2.911 ± 0.927
0.416AspHis: 0.416 ± 0.226
4.574AspIle: 4.574 ± 1.457
2.495AspLys: 2.495 ± 0.7
5.821AspLeu: 5.821 ± 0.537
1.247AspMet: 1.247 ± 0.021
2.079AspAsn: 2.079 ± 0.184
3.326AspPro: 3.326 ± 1.478
0.832AspGln: 0.832 ± 0.205
2.079AspArg: 2.079 ± 1.132
3.742AspSer: 3.742 ± 0.594
2.911AspThr: 2.911 ± 1.047
5.405AspVal: 5.405 ± 2.942
1.247AspTrp: 1.247 ± 0.637
2.495AspTyr: 2.495 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
5.405GluAla: 5.405 ± 0.969
0.416GluCys: 0.416 ± 0.226
2.079GluAsp: 2.079 ± 1.132
2.495GluGlu: 2.495 ± 0.7
2.495GluPhe: 2.495 ± 1.358
2.079GluGly: 2.079 ± 0.184
0.832GluHis: 0.832 ± 0.453
3.742GluIle: 3.742 ± 1.379
5.405GluLys: 5.405 ± 0.969
4.99GluLeu: 4.99 ± 1.4
0.832GluMet: 0.832 ± 0.205
2.079GluAsn: 2.079 ± 0.184
0.832GluPro: 0.832 ± 0.453
1.663GluGln: 1.663 ± 0.248
3.742GluArg: 3.742 ± 0.721
1.663GluSer: 1.663 ± 0.248
2.079GluThr: 2.079 ± 0.474
3.742GluVal: 3.742 ± 0.594
0.416GluTrp: 0.416 ± 0.226
1.663GluTyr: 1.663 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
3.742PheAla: 3.742 ± 0.594
1.247PheCys: 1.247 ± 0.679
2.079PheAsp: 2.079 ± 1.132
4.158PheGlu: 4.158 ± 0.948
4.158PhePhe: 4.158 ± 0.29
2.495PheGly: 2.495 ± 1.273
1.247PheHis: 1.247 ± 0.637
0.832PheIle: 0.832 ± 0.453
2.495PheLys: 2.495 ± 0.042
3.326PheLeu: 3.326 ± 0.163
0.832PheMet: 0.832 ± 0.453
3.742PheAsn: 3.742 ± 1.252
2.079PhePro: 2.079 ± 0.474
1.663PheGln: 1.663 ± 0.41
4.158PheArg: 4.158 ± 0.29
4.158PheSer: 4.158 ± 0.368
3.742PheThr: 3.742 ± 1.252
5.821PheVal: 5.821 ± 0.12
0.416PheTrp: 0.416 ± 0.226
1.663PheTyr: 1.663 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
4.99GlyAla: 4.99 ± 1.231
0.416GlyCys: 0.416 ± 0.226
3.326GlyAsp: 3.326 ± 0.495
2.079GlyGlu: 2.079 ± 0.842
3.742GlyPhe: 3.742 ± 0.721
3.326GlyGly: 3.326 ± 1.478
1.663GlyHis: 1.663 ± 0.248
4.574GlyIle: 4.574 ± 0.799
3.326GlyLys: 3.326 ± 0.495
4.158GlyLeu: 4.158 ± 2.263
0.832GlyMet: 0.832 ± 0.453
1.663GlyAsn: 1.663 ± 0.41
3.742GlyPro: 3.742 ± 0.594
2.495GlyGln: 2.495 ± 0.042
3.326GlyArg: 3.326 ± 0.495
6.237GlySer: 6.237 ± 2.525
4.574GlyThr: 4.574 ± 1.457
4.574GlyVal: 4.574 ± 0.516
1.663GlyTrp: 1.663 ± 0.248
3.326GlyTyr: 3.326 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
0.416HisAla: 0.416 ± 0.226
0.0HisCys: 0.0 ± 0.0
1.247HisAsp: 1.247 ± 0.637
0.416HisGlu: 0.416 ± 0.226
0.832HisPhe: 0.832 ± 0.453
1.247HisGly: 1.247 ± 0.679
0.832HisHis: 0.832 ± 0.453
0.416HisIle: 0.416 ± 0.431
1.247HisLys: 1.247 ± 0.021
1.247HisLeu: 1.247 ± 0.679
0.416HisMet: 0.416 ± 0.226
0.416HisAsn: 0.416 ± 0.226
0.416HisPro: 0.416 ± 0.431
0.832HisGln: 0.832 ± 0.205
1.247HisArg: 1.247 ± 0.679
0.832HisSer: 0.832 ± 0.453
1.663HisThr: 1.663 ± 0.248
1.663HisVal: 1.663 ± 0.248
2.079HisTrp: 2.079 ± 0.474
0.416HisTyr: 0.416 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
4.574IleAla: 4.574 ± 0.142
0.416IleCys: 0.416 ± 0.226
2.079IleAsp: 2.079 ± 1.132
3.742IleGlu: 3.742 ± 0.721
2.495IlePhe: 2.495 ± 0.615
4.158IleGly: 4.158 ± 0.368
0.416IleHis: 0.416 ± 0.226
1.663IleIle: 1.663 ± 0.41
2.495IleLys: 2.495 ± 0.042
3.326IleLeu: 3.326 ± 1.153
2.079IleMet: 2.079 ± 0.474
3.326IleAsn: 3.326 ± 1.478
1.663IlePro: 1.663 ± 0.248
2.079IleGln: 2.079 ± 0.474
1.247IleArg: 1.247 ± 0.679
4.158IleSer: 4.158 ± 1.026
2.911IleThr: 2.911 ± 0.927
3.742IleVal: 3.742 ± 2.568
1.247IleTrp: 1.247 ± 0.021
1.247IleTyr: 1.247 ± 1.294
0.0IleXaa: 0.0 ± 0.0
Lys
2.495LysAla: 2.495 ± 0.7
0.832LysCys: 0.832 ± 0.453
3.742LysAsp: 3.742 ± 1.379
2.495LysGlu: 2.495 ± 1.358
1.663LysPhe: 1.663 ± 1.068
4.99LysGly: 4.99 ± 0.743
1.247LysHis: 1.247 ± 0.679
2.911LysIle: 2.911 ± 0.389
0.832LysLys: 0.832 ± 0.453
4.99LysLeu: 4.99 ± 1.4
0.416LysMet: 0.416 ± 0.226
1.663LysAsn: 1.663 ± 0.248
2.911LysPro: 2.911 ± 0.269
1.663LysGln: 1.663 ± 0.248
3.742LysArg: 3.742 ± 1.379
7.069LysSer: 7.069 ± 1.874
2.911LysThr: 2.911 ± 0.389
0.832LysVal: 0.832 ± 0.205
0.832LysTrp: 0.832 ± 0.453
2.495LysTyr: 2.495 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
9.563LeuAla: 9.563 ± 0.057
0.832LeuCys: 0.832 ± 0.453
4.99LeuAsp: 4.99 ± 1.4
3.326LeuGlu: 3.326 ± 0.495
3.326LeuPhe: 3.326 ± 0.163
4.158LeuGly: 4.158 ± 0.948
2.079LeuHis: 2.079 ± 0.474
6.653LeuIle: 6.653 ± 2.306
4.574LeuLys: 4.574 ± 1.832
7.069LeuLeu: 7.069 ± 1.874
2.079LeuMet: 2.079 ± 0.474
4.158LeuAsn: 4.158 ± 1.026
4.158LeuPro: 4.158 ± 0.368
3.326LeuGln: 3.326 ± 1.153
5.821LeuArg: 5.821 ± 0.12
6.237LeuSer: 6.237 ± 2.525
3.326LeuThr: 3.326 ± 0.163
4.99LeuVal: 4.99 ± 1.4
1.247LeuTrp: 1.247 ± 0.637
3.326LeuTyr: 3.326 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
2.079MetAla: 2.079 ± 0.184
0.832MetCys: 0.832 ± 0.453
2.079MetAsp: 2.079 ± 0.474
0.832MetGlu: 0.832 ± 0.453
1.663MetPhe: 1.663 ± 0.41
2.079MetGly: 2.079 ± 0.184
0.416MetHis: 0.416 ± 0.226
1.247MetIle: 1.247 ± 0.021
2.079MetLys: 2.079 ± 1.132
3.326MetLeu: 3.326 ± 0.495
0.832MetMet: 0.832 ± 0.453
0.0MetAsn: 0.0 ± 0.0
0.416MetPro: 0.416 ± 0.226
0.416MetGln: 0.416 ± 0.431
2.495MetArg: 2.495 ± 0.615
2.079MetSer: 2.079 ± 0.842
1.247MetThr: 1.247 ± 0.021
2.079MetVal: 2.079 ± 0.474
0.0MetTrp: 0.0 ± 0.0
0.832MetTyr: 0.832 ± 0.453
0.0MetXaa: 0.0 ± 0.0
Asn
2.079AsnAla: 2.079 ± 1.499
0.832AsnCys: 0.832 ± 0.453
0.832AsnAsp: 0.832 ± 0.453
0.832AsnGlu: 0.832 ± 0.453
2.079AsnPhe: 2.079 ± 1.499
3.326AsnGly: 3.326 ± 0.495
0.832AsnHis: 0.832 ± 0.453
3.742AsnIle: 3.742 ± 0.064
0.416AsnLys: 0.416 ± 0.226
4.158AsnLeu: 4.158 ± 1.026
1.247AsnMet: 1.247 ± 0.308
2.911AsnAsn: 2.911 ± 0.269
4.158AsnPro: 4.158 ± 0.948
1.663AsnGln: 1.663 ± 0.248
0.416AsnArg: 0.416 ± 0.226
3.326AsnSer: 3.326 ± 1.478
4.574AsnThr: 4.574 ± 2.115
2.911AsnVal: 2.911 ± 0.269
0.832AsnTrp: 0.832 ± 0.205
0.832AsnTyr: 0.832 ± 0.205
0.0AsnXaa: 0.0 ± 0.0
Pro
4.574ProAla: 4.574 ± 1.174
0.0ProCys: 0.0 ± 0.0
2.911ProAsp: 2.911 ± 0.269
2.495ProGlu: 2.495 ± 0.042
2.495ProPhe: 2.495 ± 1.273
2.911ProGly: 2.911 ± 0.389
1.247ProHis: 1.247 ± 0.679
2.495ProIle: 2.495 ± 0.615
1.663ProLys: 1.663 ± 0.248
3.742ProLeu: 3.742 ± 1.252
1.663ProMet: 1.663 ± 0.41
2.079ProAsn: 2.079 ± 0.184
1.663ProPro: 1.663 ± 1.068
3.326ProGln: 3.326 ± 0.821
3.326ProArg: 3.326 ± 1.153
3.742ProSer: 3.742 ± 2.568
2.079ProThr: 2.079 ± 0.842
7.9ProVal: 7.9 ± 1.011
0.832ProTrp: 0.832 ± 0.205
1.247ProTyr: 1.247 ± 0.021
0.0ProXaa: 0.0 ± 0.0
Gln
3.326GlnAla: 3.326 ± 0.821
1.247GlnCys: 1.247 ± 0.679
2.911GlnAsp: 2.911 ± 0.269
1.247GlnGlu: 1.247 ± 0.679
1.247GlnPhe: 1.247 ± 0.021
1.663GlnGly: 1.663 ± 0.248
0.416GlnHis: 0.416 ± 0.431
0.832GlnIle: 0.832 ± 0.205
1.663GlnLys: 1.663 ± 0.41
2.079GlnLeu: 2.079 ± 0.184
0.0GlnMet: 0.0 ± 0.0
2.079GlnAsn: 2.079 ± 0.184
3.742GlnPro: 3.742 ± 1.91
0.416GlnGln: 0.416 ± 0.226
2.079GlnArg: 2.079 ± 0.842
2.911GlnSer: 2.911 ± 0.389
2.911GlnThr: 2.911 ± 0.269
2.079GlnVal: 2.079 ± 0.842
1.247GlnTrp: 1.247 ± 0.021
1.663GlnTyr: 1.663 ± 0.248
0.0GlnXaa: 0.0 ± 0.0
Arg
3.742ArgAla: 3.742 ± 0.721
0.416ArgCys: 0.416 ± 0.226
2.079ArgAsp: 2.079 ± 0.474
2.495ArgGlu: 2.495 ± 0.042
5.821ArgPhe: 5.821 ± 0.537
4.574ArgGly: 4.574 ± 1.457
1.247ArgHis: 1.247 ± 0.679
1.663ArgIle: 1.663 ± 1.068
2.495ArgLys: 2.495 ± 0.7
4.158ArgLeu: 4.158 ± 2.263
0.832ArgMet: 0.832 ± 0.453
2.911ArgAsn: 2.911 ± 0.927
3.326ArgPro: 3.326 ± 0.495
1.247ArgGln: 1.247 ± 0.021
2.495ArgArg: 2.495 ± 0.7
2.495ArgSer: 2.495 ± 0.042
3.326ArgThr: 3.326 ± 1.153
4.574ArgVal: 4.574 ± 0.799
0.416ArgTrp: 0.416 ± 0.226
1.247ArgTyr: 1.247 ± 0.679
0.0ArgXaa: 0.0 ± 0.0
Ser
7.484SerAla: 7.484 ± 0.531
1.663SerCys: 1.663 ± 0.905
5.821SerAsp: 5.821 ± 0.778
3.326SerGlu: 3.326 ± 0.163
4.158SerPhe: 4.158 ± 1.026
2.911SerGly: 2.911 ± 0.269
0.416SerHis: 0.416 ± 0.226
4.158SerIle: 4.158 ± 0.29
4.158SerLys: 4.158 ± 0.29
5.821SerLeu: 5.821 ± 2.751
1.663SerMet: 1.663 ± 0.248
1.247SerAsn: 1.247 ± 0.679
0.832SerPro: 0.832 ± 0.205
1.663SerGln: 1.663 ± 0.41
3.742SerArg: 3.742 ± 0.721
4.158SerSer: 4.158 ± 2.341
9.148SerThr: 9.148 ± 5.545
4.99SerVal: 4.99 ± 3.204
1.247SerTrp: 1.247 ± 1.294
4.574SerTyr: 4.574 ± 0.142
0.0SerXaa: 0.0 ± 0.0
Thr
2.911ThrAla: 2.911 ± 1.705
0.0ThrCys: 0.0 ± 0.0
2.911ThrAsp: 2.911 ± 0.389
2.495ThrGlu: 2.495 ± 1.358
3.326ThrPhe: 3.326 ± 0.495
3.742ThrGly: 3.742 ± 1.91
0.416ThrHis: 0.416 ± 0.226
2.495ThrIle: 2.495 ± 0.042
2.079ThrLys: 2.079 ± 0.474
4.99ThrLeu: 4.99 ± 0.085
2.911ThrMet: 2.911 ± 1.047
2.911ThrAsn: 2.911 ± 0.389
5.821ThrPro: 5.821 ± 2.094
3.742ThrGln: 3.742 ± 2.568
2.495ThrArg: 2.495 ± 0.042
5.405ThrSer: 5.405 ± 0.311
4.158ThrThr: 4.158 ± 1.026
4.99ThrVal: 4.99 ± 0.573
1.663ThrTrp: 1.663 ± 1.068
1.247ThrTyr: 1.247 ± 0.637
0.0ThrXaa: 0.0 ± 0.0
Val
9.148ValAla: 9.148 ± 0.283
0.832ValCys: 0.832 ± 0.453
4.574ValAsp: 4.574 ± 2.773
4.99ValGlu: 4.99 ± 0.085
3.326ValPhe: 3.326 ± 1.153
7.9ValGly: 7.9 ± 1.62
0.832ValHis: 0.832 ± 0.453
2.079ValIle: 2.079 ± 0.842
3.326ValLys: 3.326 ± 1.153
6.653ValLeu: 6.653 ± 0.332
3.742ValMet: 3.742 ± 0.721
1.247ValAsn: 1.247 ± 0.679
5.821ValPro: 5.821 ± 0.12
2.495ValGln: 2.495 ± 1.273
3.742ValArg: 3.742 ± 0.721
4.574ValSer: 4.574 ± 0.799
2.911ValThr: 2.911 ± 0.269
7.069ValVal: 7.069 ± 0.099
2.079ValTrp: 2.079 ± 1.132
2.495ValTyr: 2.495 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.832TrpAla: 0.832 ± 0.863
0.0TrpCys: 0.0 ± 0.0
0.832TrpAsp: 0.832 ± 0.205
0.416TrpGlu: 0.416 ± 0.226
0.832TrpPhe: 0.832 ± 0.453
0.416TrpGly: 0.416 ± 0.226
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.247TrpLys: 1.247 ± 0.637
2.911TrpLeu: 2.911 ± 0.269
1.663TrpMet: 1.663 ± 0.41
1.663TrpAsn: 1.663 ± 0.41
0.416TrpPro: 0.416 ± 0.431
1.247TrpGln: 1.247 ± 0.637
2.079TrpArg: 2.079 ± 0.184
2.079TrpSer: 2.079 ± 0.474
0.416TrpThr: 0.416 ± 0.431
1.663TrpVal: 1.663 ± 0.248
0.0TrpTrp: 0.0 ± 0.0
0.832TrpTyr: 0.832 ± 0.453
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.742TyrAla: 3.742 ± 0.064
0.416TyrCys: 0.416 ± 0.431
2.911TyrAsp: 2.911 ± 0.269
2.079TyrGlu: 2.079 ± 0.474
1.247TyrPhe: 1.247 ± 0.637
2.495TyrGly: 2.495 ± 0.042
0.832TyrHis: 0.832 ± 0.863
0.832TyrIle: 0.832 ± 0.205
1.247TyrLys: 1.247 ± 0.679
2.911TyrLeu: 2.911 ± 0.389
1.247TyrMet: 1.247 ± 0.021
2.495TyrAsn: 2.495 ± 0.615
1.663TyrPro: 1.663 ± 0.248
1.663TyrGln: 1.663 ± 0.248
1.247TyrArg: 1.247 ± 0.021
2.079TyrSer: 2.079 ± 0.842
2.079TyrThr: 2.079 ± 1.132
2.495TyrVal: 2.495 ± 1.358
0.416TyrTrp: 0.416 ± 0.226
0.832TyrTyr: 0.832 ± 0.453
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2406 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski