Amino acid dipepetide frequency for Wenzhou picorna-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.852AlaAla: 11.852 ± 0.432
0.741AlaCys: 0.741 ± 0.421
4.444AlaAsp: 4.444 ± 0.627
5.185AlaGlu: 5.185 ± 0.205
2.963AlaPhe: 2.963 ± 0.838
4.444AlaGly: 4.444 ± 1.258
1.111AlaHis: 1.111 ± 0.632
5.556AlaIle: 5.556 ± 0.626
6.296AlaLys: 6.296 ± 1.689
4.444AlaLeu: 4.444 ± 1.266
2.222AlaMet: 2.222 ± 1.264
2.593AlaAsn: 2.593 ± 1.68
4.444AlaPro: 4.444 ± 1.889
2.222AlaGln: 2.222 ± 1.264
5.185AlaArg: 5.185 ± 1.057
3.704AlaSer: 3.704 ± 1.679
4.074AlaThr: 4.074 ± 2.099
6.667AlaVal: 6.667 ± 0.625
1.111AlaTrp: 1.111 ± 0.001
4.074AlaTyr: 4.074 ± 0.837
0.0AlaXaa: 0.0 ± 0.0
Cys
0.37CysAla: 0.37 ± 0.42
0.0CysCys: 0.0 ± 0.0
0.741CysAsp: 0.741 ± 0.421
1.481CysGlu: 1.481 ± 0.843
2.222CysPhe: 2.222 ± 0.629
0.37CysGly: 0.37 ± 0.211
0.0CysHis: 0.0 ± 0.0
0.741CysIle: 0.741 ± 0.421
1.481CysLys: 1.481 ± 0.212
1.111CysLeu: 1.111 ± 0.001
1.111CysMet: 1.111 ± 0.001
1.481CysAsn: 1.481 ± 0.843
1.852CysPro: 1.852 ± 1.053
0.0CysGln: 0.0 ± 0.0
0.741CysArg: 0.741 ± 0.21
1.111CysSer: 1.111 ± 0.63
1.111CysThr: 1.111 ± 0.632
1.111CysVal: 1.111 ± 0.632
0.37CysTrp: 0.37 ± 0.211
0.37CysTyr: 0.37 ± 0.211
0.0CysXaa: 0.0 ± 0.0
Asp
7.037AspAla: 7.037 ± 0.217
0.741AspCys: 0.741 ± 0.21
6.296AspAsp: 6.296 ± 1.689
4.074AspGlu: 4.074 ± 0.206
4.444AspPhe: 4.444 ± 1.266
3.704AspGly: 3.704 ± 0.214
1.111AspHis: 1.111 ± 0.001
4.444AspIle: 4.444 ± 0.004
2.593AspLys: 2.593 ± 1.475
4.815AspLeu: 4.815 ± 1.477
1.852AspMet: 1.852 ± 0.209
2.963AspAsn: 2.963 ± 0.838
2.963AspPro: 2.963 ± 2.1
1.481AspGln: 1.481 ± 0.212
1.111AspArg: 1.111 ± 0.63
3.704AspSer: 3.704 ± 0.214
1.481AspThr: 1.481 ± 0.212
4.444AspVal: 4.444 ± 1.266
1.481AspTrp: 1.481 ± 1.05
2.593AspTyr: 2.593 ± 0.213
0.0AspXaa: 0.0 ± 0.0
Glu
2.222GluAla: 2.222 ± 0.002
0.741GluCys: 0.741 ± 0.421
3.333GluAsp: 3.333 ± 1.265
2.963GluGlu: 2.963 ± 0.208
2.222GluPhe: 2.222 ± 0.629
2.593GluGly: 2.593 ± 0.844
2.222GluHis: 2.222 ± 0.633
2.963GluIle: 2.963 ± 0.423
5.185GluLys: 5.185 ± 2.318
6.667GluLeu: 6.667 ± 0.625
2.593GluMet: 2.593 ± 0.213
1.111GluAsn: 1.111 ± 0.001
2.963GluPro: 2.963 ± 0.208
1.852GluGln: 1.852 ± 1.053
4.074GluArg: 4.074 ± 1.055
4.074GluSer: 4.074 ± 1.468
4.815GluThr: 4.815 ± 0.846
6.296GluVal: 6.296 ± 0.204
0.741GluTrp: 0.741 ± 0.421
3.704GluTyr: 3.704 ± 0.845
0.0GluXaa: 0.0 ± 0.0
Phe
3.704PheAla: 3.704 ± 1.679
1.111PheCys: 1.111 ± 0.001
2.593PheAsp: 2.593 ± 0.418
2.963PheGlu: 2.963 ± 1.469
2.593PhePhe: 2.593 ± 1.049
2.222PheGly: 2.222 ± 0.002
1.481PheHis: 1.481 ± 0.212
2.222PheIle: 2.222 ± 1.264
3.333PheLys: 3.333 ± 1.265
2.222PheLeu: 2.222 ± 0.002
2.593PheMet: 2.593 ± 0.213
1.852PheAsn: 1.852 ± 0.209
1.852PhePro: 1.852 ± 0.209
2.593PheGln: 2.593 ± 0.213
2.963PheArg: 2.963 ± 1.054
2.593PheSer: 2.593 ± 0.418
1.852PheThr: 1.852 ± 1.47
4.815PheVal: 4.815 ± 0.846
0.0PheTrp: 0.0 ± 0.0
0.37PheTyr: 0.37 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
6.667GlyAla: 6.667 ± 1.887
1.481GlyCys: 1.481 ± 0.212
1.852GlyAsp: 1.852 ± 0.84
3.704GlyGlu: 3.704 ± 1.679
2.222GlyPhe: 2.222 ± 0.633
3.333GlyGly: 3.333 ± 1.259
1.111GlyHis: 1.111 ± 0.001
4.444GlyIle: 4.444 ± 1.897
2.593GlyLys: 2.593 ± 0.844
3.704GlyLeu: 3.704 ± 0.214
1.481GlyMet: 1.481 ± 0.212
2.963GlyAsn: 2.963 ± 0.423
0.741GlyPro: 0.741 ± 0.21
1.481GlyGln: 1.481 ± 1.681
3.333GlyArg: 3.333 ± 0.634
3.333GlySer: 3.333 ± 1.89
5.185GlyThr: 5.185 ± 0.836
4.074GlyVal: 4.074 ± 1.055
2.963GlyTrp: 2.963 ± 0.208
3.333GlyTyr: 3.333 ± 0.634
0.0GlyXaa: 0.0 ± 0.0
His
2.222HisAla: 2.222 ± 0.002
0.37HisCys: 0.37 ± 0.211
1.481HisAsp: 1.481 ± 0.212
1.111HisGlu: 1.111 ± 0.632
0.37HisPhe: 0.37 ± 0.42
2.593HisGly: 2.593 ± 0.844
0.0HisHis: 0.0 ± 0.0
1.852HisIle: 1.852 ± 0.422
1.111HisLys: 1.111 ± 0.001
1.111HisLeu: 1.111 ± 0.001
0.741HisMet: 0.741 ± 0.841
0.741HisAsn: 0.741 ± 0.421
1.481HisPro: 1.481 ± 0.212
0.37HisGln: 0.37 ± 0.211
0.0HisArg: 0.0 ± 0.0
1.481HisSer: 1.481 ± 1.05
1.111HisThr: 1.111 ± 0.63
1.481HisVal: 1.481 ± 0.419
0.0HisTrp: 0.0 ± 0.0
0.741HisTyr: 0.741 ± 0.841
0.0HisXaa: 0.0 ± 0.0
Ile
6.667IleAla: 6.667 ± 0.006
0.37IleCys: 0.37 ± 0.42
6.296IleAsp: 6.296 ± 1.058
2.963IleGlu: 2.963 ± 0.423
0.741IlePhe: 0.741 ± 0.421
3.333IleGly: 3.333 ± 0.628
1.481IleHis: 1.481 ± 0.419
2.963IleIle: 2.963 ± 1.054
4.074IleLys: 4.074 ± 1.055
2.963IleLeu: 2.963 ± 1.054
2.963IleMet: 2.963 ± 1.054
2.593IleAsn: 2.593 ± 0.418
3.704IlePro: 3.704 ± 0.845
2.593IleGln: 2.593 ± 0.213
2.593IleArg: 2.593 ± 0.844
2.963IleSer: 2.963 ± 0.208
2.222IleThr: 2.222 ± 0.002
2.222IleVal: 2.222 ± 0.633
0.37IleTrp: 0.37 ± 0.211
1.111IleTyr: 1.111 ± 0.001
0.0IleXaa: 0.0 ± 0.0
Lys
3.704LysAla: 3.704 ± 2.107
2.222LysCys: 2.222 ± 1.264
3.333LysAsp: 3.333 ± 0.634
3.333LysGlu: 3.333 ± 1.896
5.556LysPhe: 5.556 ± 1.267
3.333LysGly: 3.333 ± 1.265
1.852LysHis: 1.852 ± 0.422
3.333LysIle: 3.333 ± 1.265
5.926LysLys: 5.926 ± 2.74
5.185LysLeu: 5.185 ± 0.205
1.481LysMet: 1.481 ± 0.212
2.963LysAsn: 2.963 ± 0.423
1.111LysPro: 1.111 ± 0.001
1.481LysGln: 1.481 ± 0.212
5.185LysArg: 5.185 ± 1.057
4.444LysSer: 4.444 ± 2.528
4.815LysThr: 4.815 ± 2.108
1.111LysVal: 1.111 ± 0.632
0.37LysTrp: 0.37 ± 0.211
3.704LysTyr: 3.704 ± 1.048
0.0LysXaa: 0.0 ± 0.0
Leu
4.074LeuAla: 4.074 ± 1.055
1.111LeuCys: 1.111 ± 0.632
5.185LeuAsp: 5.185 ± 0.836
2.593LeuGlu: 2.593 ± 0.213
2.963LeuPhe: 2.963 ± 0.423
5.185LeuGly: 5.185 ± 1.057
1.852LeuHis: 1.852 ± 0.84
2.963LeuIle: 2.963 ± 1.054
4.074LeuLys: 4.074 ± 0.424
3.333LeuLeu: 3.333 ± 0.003
1.111LeuMet: 1.111 ± 0.161
2.963LeuAsn: 2.963 ± 1.054
3.333LeuPro: 3.333 ± 0.628
2.593LeuGln: 2.593 ± 0.844
5.556LeuArg: 5.556 ± 0.636
4.815LeuSer: 4.815 ± 1.047
4.815LeuThr: 4.815 ± 0.416
5.185LeuVal: 5.185 ± 1.467
1.852LeuTrp: 1.852 ± 0.209
2.222LeuTyr: 2.222 ± 0.629
0.0LeuXaa: 0.0 ± 0.0
Met
2.593MetAla: 2.593 ± 0.844
0.0MetCys: 0.0 ± 0.0
2.593MetAsp: 2.593 ± 0.213
1.852MetGlu: 1.852 ± 0.422
1.111MetPhe: 1.111 ± 0.001
2.963MetGly: 2.963 ± 0.423
0.741MetHis: 0.741 ± 0.421
1.111MetIle: 1.111 ± 0.001
2.222MetLys: 2.222 ± 0.633
1.111MetLeu: 1.111 ± 0.001
1.852MetMet: 1.852 ± 1.053
2.222MetAsn: 2.222 ± 0.633
0.741MetPro: 0.741 ± 0.841
1.852MetGln: 1.852 ± 0.84
1.111MetArg: 1.111 ± 0.001
2.222MetSer: 2.222 ± 0.002
2.963MetThr: 2.963 ± 0.423
1.852MetVal: 1.852 ± 0.209
1.111MetTrp: 1.111 ± 0.001
1.481MetTyr: 1.481 ± 1.05
0.0MetXaa: 0.0 ± 0.0
Asn
4.074AsnAla: 4.074 ± 0.424
0.37AsnCys: 0.37 ± 0.211
2.222AsnAsp: 2.222 ± 0.629
3.333AsnGlu: 3.333 ± 0.634
2.222AsnPhe: 2.222 ± 0.629
2.593AsnGly: 2.593 ± 1.049
0.741AsnHis: 0.741 ± 0.21
4.074AsnIle: 4.074 ± 0.424
1.852AsnLys: 1.852 ± 1.053
2.593AsnLeu: 2.593 ± 1.049
1.111AsnMet: 1.111 ± 0.632
2.222AsnAsn: 2.222 ± 0.633
2.222AsnPro: 2.222 ± 0.002
1.481AsnGln: 1.481 ± 0.419
0.741AsnArg: 0.741 ± 0.21
3.333AsnSer: 3.333 ± 0.003
4.815AsnThr: 4.815 ± 1.678
3.704AsnVal: 3.704 ± 1.679
1.111AsnTrp: 1.111 ± 0.632
1.481AsnTyr: 1.481 ± 1.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.593ProAla: 2.593 ± 1.68
0.741ProCys: 0.741 ± 0.21
2.222ProAsp: 2.222 ± 0.002
3.704ProGlu: 3.704 ± 2.107
1.481ProPhe: 1.481 ± 1.05
4.444ProGly: 4.444 ± 2.52
2.963ProHis: 2.963 ± 0.208
2.963ProIle: 2.963 ± 0.423
1.852ProLys: 1.852 ± 0.422
3.704ProLeu: 3.704 ± 0.417
1.111ProMet: 1.111 ± 0.001
2.593ProAsn: 2.593 ± 0.418
1.111ProPro: 1.111 ± 0.632
1.852ProGln: 1.852 ± 0.209
1.111ProArg: 1.111 ± 0.001
3.333ProSer: 3.333 ± 1.259
4.074ProThr: 4.074 ± 0.837
3.333ProVal: 3.333 ± 0.628
0.741ProTrp: 0.741 ± 0.841
1.111ProTyr: 1.111 ± 1.261
0.0ProXaa: 0.0 ± 0.0
Gln
3.333GlnAla: 3.333 ± 0.628
0.37GlnCys: 0.37 ± 0.211
2.222GlnAsp: 2.222 ± 0.633
3.704GlnGlu: 3.704 ± 0.417
0.37GlnPhe: 0.37 ± 0.42
2.593GlnGly: 2.593 ± 0.418
0.0GlnHis: 0.0 ± 0.0
1.481GlnIle: 1.481 ± 0.843
1.852GlnLys: 1.852 ± 1.053
4.815GlnLeu: 4.815 ± 0.215
1.111GlnMet: 1.111 ± 0.63
0.741GlnAsn: 0.741 ± 0.21
2.963GlnPro: 2.963 ± 0.208
1.111GlnGln: 1.111 ± 0.632
2.593GlnArg: 2.593 ± 1.049
1.481GlnSer: 1.481 ± 0.419
0.37GlnThr: 0.37 ± 0.42
0.37GlnVal: 0.37 ± 0.211
0.37GlnTrp: 0.37 ± 0.211
0.741GlnTyr: 0.741 ± 0.421
0.0GlnXaa: 0.0 ± 0.0
Arg
4.444ArgAla: 4.444 ± 1.258
0.741ArgCys: 0.741 ± 0.421
4.074ArgAsp: 4.074 ± 1.055
4.074ArgGlu: 4.074 ± 0.424
4.074ArgPhe: 4.074 ± 0.206
2.593ArgGly: 2.593 ± 0.213
0.741ArgHis: 0.741 ± 0.841
3.704ArgIle: 3.704 ± 0.214
3.333ArgLys: 3.333 ± 1.265
3.333ArgLeu: 3.333 ± 0.628
1.481ArgMet: 1.481 ± 0.843
2.593ArgAsn: 2.593 ± 0.418
4.074ArgPro: 4.074 ± 0.837
1.852ArgGln: 1.852 ± 0.422
3.704ArgArg: 3.704 ± 1.476
2.222ArgSer: 2.222 ± 1.264
2.222ArgThr: 2.222 ± 0.002
3.704ArgVal: 3.704 ± 0.845
1.111ArgTrp: 1.111 ± 0.001
1.852ArgTyr: 1.852 ± 0.84
0.0ArgXaa: 0.0 ± 0.0
Ser
3.704SerAla: 3.704 ± 1.679
1.111SerCys: 1.111 ± 0.001
4.815SerAsp: 4.815 ± 0.416
4.074SerGlu: 4.074 ± 0.206
1.481SerPhe: 1.481 ± 0.212
2.222SerGly: 2.222 ± 1.26
0.37SerHis: 0.37 ± 0.42
3.333SerIle: 3.333 ± 0.628
5.185SerLys: 5.185 ± 0.426
4.444SerLeu: 4.444 ± 1.266
1.852SerMet: 1.852 ± 1.059
1.481SerAsn: 1.481 ± 0.212
1.481SerPro: 1.481 ± 0.419
3.333SerGln: 3.333 ± 1.259
3.333SerArg: 3.333 ± 0.628
4.444SerSer: 4.444 ± 1.889
2.593SerThr: 2.593 ± 0.418
7.037SerVal: 7.037 ± 0.217
1.111SerTrp: 1.111 ± 0.001
1.481SerTyr: 1.481 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
3.333ThrAla: 3.333 ± 1.89
1.481ThrCys: 1.481 ± 0.843
2.593ThrAsp: 2.593 ± 0.213
4.815ThrGlu: 4.815 ± 1.477
2.963ThrPhe: 2.963 ± 0.208
2.963ThrGly: 2.963 ± 1.054
1.111ThrHis: 1.111 ± 0.001
2.222ThrIle: 2.222 ± 1.891
3.333ThrLys: 3.333 ± 1.896
5.556ThrLeu: 5.556 ± 3.15
2.963ThrMet: 2.963 ± 0.423
3.704ThrAsn: 3.704 ± 1.679
3.704ThrPro: 3.704 ± 1.048
2.222ThrGln: 2.222 ± 0.002
1.852ThrArg: 1.852 ± 0.209
3.704ThrSer: 3.704 ± 2.107
6.667ThrThr: 6.667 ± 1.887
4.074ThrVal: 4.074 ± 1.468
0.37ThrTrp: 0.37 ± 0.211
2.222ThrTyr: 2.222 ± 0.002
0.0ThrXaa: 0.0 ± 0.0
Val
6.667ValAla: 6.667 ± 2.53
2.222ValCys: 2.222 ± 0.002
4.074ValAsp: 4.074 ± 0.837
3.704ValGlu: 3.704 ± 2.107
3.333ValPhe: 3.333 ± 0.634
4.444ValGly: 4.444 ± 0.627
1.111ValHis: 1.111 ± 0.001
3.704ValIle: 3.704 ± 0.845
3.704ValLys: 3.704 ± 0.214
2.963ValLeu: 2.963 ± 1.054
1.852ValMet: 1.852 ± 0.84
4.444ValAsn: 4.444 ± 1.889
4.444ValPro: 4.444 ± 2.52
0.0ValGln: 0.0 ± 0.0
6.296ValArg: 6.296 ± 0.427
1.852ValSer: 1.852 ± 1.47
4.074ValThr: 4.074 ± 0.424
4.074ValVal: 4.074 ± 0.837
1.111ValTrp: 1.111 ± 0.632
3.333ValTyr: 3.333 ± 1.259
0.0ValXaa: 0.0 ± 0.0
Trp
1.111TrpAla: 1.111 ± 1.261
0.741TrpCys: 0.741 ± 0.421
1.111TrpAsp: 1.111 ± 0.632
1.111TrpGlu: 1.111 ± 0.632
1.111TrpPhe: 1.111 ± 0.632
1.111TrpGly: 1.111 ± 0.63
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.222TrpLys: 2.222 ± 1.264
1.111TrpLeu: 1.111 ± 0.632
0.741TrpMet: 0.741 ± 0.841
1.481TrpAsn: 1.481 ± 1.05
0.37TrpPro: 0.37 ± 0.42
0.741TrpGln: 0.741 ± 0.21
1.481TrpArg: 1.481 ± 1.05
0.37TrpSer: 0.37 ± 0.211
1.481TrpThr: 1.481 ± 0.843
0.37TrpVal: 0.37 ± 0.211
0.0TrpTrp: 0.0 ± 0.0
0.37TrpTyr: 0.37 ± 0.42
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.963TyrAla: 2.963 ± 0.423
1.111TyrCys: 1.111 ± 0.63
1.852TyrAsp: 1.852 ± 0.209
2.593TyrGlu: 2.593 ± 0.418
1.481TyrPhe: 1.481 ± 1.05
2.963TyrGly: 2.963 ± 0.208
0.37TyrHis: 0.37 ± 0.42
1.111TyrIle: 1.111 ± 0.63
2.222TyrLys: 2.222 ± 1.26
2.593TyrLeu: 2.593 ± 0.213
1.111TyrMet: 1.111 ± 0.632
2.593TyrAsn: 2.593 ± 0.213
1.481TyrPro: 1.481 ± 0.843
1.481TyrGln: 1.481 ± 1.05
2.963TyrArg: 2.963 ± 0.838
4.074TyrSer: 4.074 ± 1.468
1.111TyrThr: 1.111 ± 0.001
1.481TyrVal: 1.481 ± 0.419
0.741TyrTrp: 0.741 ± 0.841
1.481TyrTyr: 1.481 ± 1.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2701 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski