Amino acid dipepetide frequency for Wenzhou picorna-like virus 31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.918AlaAla: 8.918 ± 3.175
0.811AlaCys: 0.811 ± 0.416
3.243AlaAsp: 3.243 ± 0.255
4.054AlaGlu: 4.054 ± 1.377
2.837AlaPhe: 2.837 ± 0.658
8.107AlaGly: 8.107 ± 0.771
1.216AlaHis: 1.216 ± 0.625
4.459AlaIle: 4.459 ± 0.175
1.621AlaLys: 1.621 ± 0.128
6.486AlaLeu: 6.486 ± 0.899
2.432AlaMet: 2.432 ± 0.161
4.054AlaAsn: 4.054 ± 1.443
3.648AlaPro: 3.648 ± 0.241
1.621AlaGln: 1.621 ± 0.128
2.432AlaArg: 2.432 ± 1.249
6.08AlaSer: 6.08 ± 4.632
6.486AlaThr: 6.486 ± 3.719
8.918AlaVal: 8.918 ± 0.35
0.811AlaTrp: 0.811 ± 0.289
3.243AlaTyr: 3.243 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.289
0.405CysCys: 0.405 ± 0.208
0.0CysAsp: 0.0 ± 0.0
1.621CysGlu: 1.621 ± 0.833
2.027CysPhe: 2.027 ± 1.074
1.216CysGly: 1.216 ± 0.08
0.405CysHis: 0.405 ± 0.208
1.216CysIle: 1.216 ± 0.08
0.405CysLys: 0.405 ± 0.208
1.216CysLeu: 1.216 ± 0.625
0.0CysMet: 0.0 ± 0.0
0.405CysAsn: 0.405 ± 0.497
1.621CysPro: 1.621 ± 0.128
0.405CysGln: 0.405 ± 0.208
0.811CysArg: 0.811 ± 0.416
1.216CysSer: 1.216 ± 0.625
0.405CysThr: 0.405 ± 0.208
0.811CysVal: 0.811 ± 0.416
0.0CysTrp: 0.0 ± 0.0
0.405CysTyr: 0.405 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
4.054AspAla: 4.054 ± 0.672
1.621AspCys: 1.621 ± 0.833
3.648AspAsp: 3.648 ± 0.464
2.432AspGlu: 2.432 ± 0.161
2.837AspPhe: 2.837 ± 0.752
4.054AspGly: 4.054 ± 0.738
0.405AspHis: 0.405 ± 0.208
2.837AspIle: 2.837 ± 0.047
1.621AspLys: 1.621 ± 0.833
3.648AspLeu: 3.648 ± 1.169
2.432AspMet: 2.432 ± 0.161
3.243AspAsn: 3.243 ± 0.45
0.811AspPro: 0.811 ± 0.289
1.621AspGln: 1.621 ± 0.128
0.811AspArg: 0.811 ± 0.416
4.054AspSer: 4.054 ± 0.672
0.811AspThr: 0.811 ± 0.416
6.891AspVal: 6.891 ± 1.424
1.621AspTrp: 1.621 ± 0.128
0.811AspTyr: 0.811 ± 0.416
0.0AspXaa: 0.0 ± 0.0
Glu
4.054GluAla: 4.054 ± 1.377
0.811GluCys: 0.811 ± 0.416
2.432GluAsp: 2.432 ± 0.544
3.648GluGlu: 3.648 ± 1.874
3.243GluPhe: 3.243 ± 0.45
3.648GluGly: 3.648 ± 1.874
0.811GluHis: 0.811 ± 0.416
3.648GluIle: 3.648 ± 1.874
3.243GluLys: 3.243 ± 0.96
4.459GluLeu: 4.459 ± 0.175
1.216GluMet: 1.216 ± 0.785
1.216GluAsn: 1.216 ± 0.625
0.811GluPro: 0.811 ± 0.416
3.243GluGln: 3.243 ± 0.96
2.432GluArg: 2.432 ± 1.249
4.054GluSer: 4.054 ± 0.033
2.027GluThr: 2.027 ± 1.074
4.459GluVal: 4.459 ± 1.585
1.216GluTrp: 1.216 ± 0.08
1.621GluTyr: 1.621 ± 0.128
0.0GluXaa: 0.0 ± 0.0
Phe
2.837PheAla: 2.837 ± 1.363
0.0PheCys: 0.0 ± 0.0
3.243PheAsp: 3.243 ± 0.255
2.027PheGlu: 2.027 ± 1.074
0.811PhePhe: 0.811 ± 0.416
3.648PheGly: 3.648 ± 0.464
0.811PheHis: 0.811 ± 0.289
2.432PheIle: 2.432 ± 0.544
2.027PheLys: 2.027 ± 1.074
4.864PheLeu: 4.864 ± 2.498
1.216PheMet: 1.216 ± 0.272
1.621PheAsn: 1.621 ± 0.128
2.837PhePro: 2.837 ± 0.047
3.243PheGln: 3.243 ± 1.155
3.648PheArg: 3.648 ± 2.356
3.648PheSer: 3.648 ± 1.651
2.837PheThr: 2.837 ± 0.658
4.054PheVal: 4.054 ± 0.672
0.405PheTrp: 0.405 ± 0.497
0.405PheTyr: 0.405 ± 0.208
0.0PheXaa: 0.0 ± 0.0
Gly
6.891GlyAla: 6.891 ± 1.396
0.811GlyCys: 0.811 ± 0.289
4.054GlyAsp: 4.054 ± 1.377
2.837GlyGlu: 2.837 ± 0.047
3.243GlyPhe: 3.243 ± 1.155
6.486GlyGly: 6.486 ± 0.194
2.027GlyHis: 2.027 ± 0.336
2.432GlyIle: 2.432 ± 0.544
3.648GlyLys: 3.648 ± 1.874
6.891GlyLeu: 6.891 ± 0.719
1.621GlyMet: 1.621 ± 0.128
3.648GlyAsn: 3.648 ± 0.946
2.837GlyPro: 2.837 ± 0.658
2.027GlyGln: 2.027 ± 0.369
4.054GlyArg: 4.054 ± 0.738
7.296GlySer: 7.296 ± 3.303
5.675GlyThr: 5.675 ± 2.02
6.08GlyVal: 6.08 ± 0.402
2.027GlyTrp: 2.027 ± 1.041
2.837GlyTyr: 2.837 ± 0.752
0.0GlyXaa: 0.0 ± 0.0
His
0.405HisAla: 0.405 ± 0.208
0.405HisCys: 0.405 ± 0.208
1.216HisAsp: 1.216 ± 0.625
0.811HisGlu: 0.811 ± 0.416
1.621HisPhe: 1.621 ± 0.833
0.811HisGly: 0.811 ± 0.289
0.0HisHis: 0.0 ± 0.0
1.216HisIle: 1.216 ± 0.08
0.811HisLys: 0.811 ± 0.416
3.243HisLeu: 3.243 ± 0.255
0.811HisMet: 0.811 ± 0.416
0.811HisAsn: 0.811 ± 0.416
0.811HisPro: 0.811 ± 0.416
0.405HisGln: 0.405 ± 0.497
0.811HisArg: 0.811 ± 0.416
1.621HisSer: 1.621 ± 0.833
0.811HisThr: 0.811 ± 0.416
2.027HisVal: 2.027 ± 1.074
0.0HisTrp: 0.0 ± 0.0
1.216HisTyr: 1.216 ± 0.08
0.0HisXaa: 0.0 ± 0.0
Ile
2.432IleAla: 2.432 ± 0.161
0.405IleCys: 0.405 ± 0.208
1.621IleAsp: 1.621 ± 0.128
1.216IleGlu: 1.216 ± 0.625
2.027IlePhe: 2.027 ± 1.041
4.054IleGly: 4.054 ± 2.148
0.405IleHis: 0.405 ± 0.208
2.837IleIle: 2.837 ± 0.047
2.432IleLys: 2.432 ± 1.249
2.837IleLeu: 2.837 ± 0.752
2.027IleMet: 2.027 ± 1.041
2.837IleAsn: 2.837 ± 0.658
3.243IlePro: 3.243 ± 1.155
0.0IleGln: 0.0 ± 0.0
2.027IleArg: 2.027 ± 1.041
4.459IleSer: 4.459 ± 0.175
3.648IleThr: 3.648 ± 0.241
4.459IleVal: 4.459 ± 0.175
0.405IleTrp: 0.405 ± 0.208
1.621IleTyr: 1.621 ± 1.282
0.0IleXaa: 0.0 ± 0.0
Lys
2.432LysAla: 2.432 ± 0.161
0.405LysCys: 0.405 ± 0.497
2.432LysAsp: 2.432 ± 1.249
2.432LysGlu: 2.432 ± 1.249
2.837LysPhe: 2.837 ± 0.752
3.648LysGly: 3.648 ± 1.169
1.216LysHis: 1.216 ± 0.625
1.216LysIle: 1.216 ± 0.08
2.432LysLys: 2.432 ± 1.249
4.054LysLeu: 4.054 ± 1.377
1.216LysMet: 1.216 ± 0.785
1.621LysAsn: 1.621 ± 0.128
2.837LysPro: 2.837 ± 0.752
0.811LysGln: 0.811 ± 0.289
2.837LysArg: 2.837 ± 0.752
2.027LysSer: 2.027 ± 0.336
4.054LysThr: 4.054 ± 2.082
2.837LysVal: 2.837 ± 0.752
0.405LysTrp: 0.405 ± 0.208
2.432LysTyr: 2.432 ± 0.161
0.0LysXaa: 0.0 ± 0.0
Leu
7.296LeuAla: 7.296 ± 0.483
2.027LeuCys: 2.027 ± 1.779
3.648LeuAsp: 3.648 ± 0.464
4.864LeuGlu: 4.864 ± 1.088
2.027LeuPhe: 2.027 ± 0.369
6.891LeuGly: 6.891 ± 1.424
1.621LeuHis: 1.621 ± 0.833
3.243LeuIle: 3.243 ± 0.255
3.648LeuLys: 3.648 ± 1.874
4.459LeuLeu: 4.459 ± 1.585
3.243LeuMet: 3.243 ± 0.96
5.27LeuAsn: 5.27 ± 1.296
3.243LeuPro: 3.243 ± 1.155
3.648LeuGln: 3.648 ± 0.464
5.675LeuArg: 5.675 ± 0.61
6.891LeuSer: 6.891 ± 0.014
6.891LeuThr: 6.891 ± 1.424
4.459LeuVal: 4.459 ± 1.585
1.621LeuTrp: 1.621 ± 0.833
1.216LeuTyr: 1.216 ± 1.49
0.0LeuXaa: 0.0 ± 0.0
Met
2.027MetAla: 2.027 ± 1.041
0.0MetCys: 0.0 ± 0.0
1.621MetAsp: 1.621 ± 0.128
1.216MetGlu: 1.216 ± 0.625
0.811MetPhe: 0.811 ± 0.994
2.027MetGly: 2.027 ± 1.074
1.621MetHis: 1.621 ± 0.128
1.621MetIle: 1.621 ± 1.282
1.216MetLys: 1.216 ± 0.625
2.432MetLeu: 2.432 ± 0.161
0.811MetMet: 0.811 ± 0.416
0.0MetAsn: 0.0 ± 0.0
3.648MetPro: 3.648 ± 1.169
0.811MetGln: 0.811 ± 0.289
0.811MetArg: 0.811 ± 0.416
2.432MetSer: 2.432 ± 0.161
1.216MetThr: 1.216 ± 0.08
2.027MetVal: 2.027 ± 0.336
0.811MetTrp: 0.811 ± 0.416
2.027MetTyr: 2.027 ± 0.336
0.0MetXaa: 0.0 ± 0.0
Asn
2.027AsnAla: 2.027 ± 1.779
1.216AsnCys: 1.216 ± 0.625
2.837AsnAsp: 2.837 ± 1.457
2.432AsnGlu: 2.432 ± 0.544
3.243AsnPhe: 3.243 ± 1.155
3.648AsnGly: 3.648 ± 0.946
0.405AsnHis: 0.405 ± 0.208
2.432AsnIle: 2.432 ± 1.249
0.811AsnLys: 0.811 ± 0.416
3.648AsnLeu: 3.648 ± 0.946
1.216AsnMet: 1.216 ± 0.08
1.621AsnAsn: 1.621 ± 1.987
2.837AsnPro: 2.837 ± 0.047
0.405AsnGln: 0.405 ± 0.208
3.648AsnArg: 3.648 ± 0.464
2.837AsnSer: 2.837 ± 0.047
2.432AsnThr: 2.432 ± 1.571
3.243AsnVal: 3.243 ± 0.96
0.811AsnTrp: 0.811 ± 0.416
2.027AsnTyr: 2.027 ± 1.074
0.0AsnXaa: 0.0 ± 0.0
Pro
0.811ProAla: 0.811 ± 0.289
0.0ProCys: 0.0 ± 0.0
1.621ProAsp: 1.621 ± 1.282
2.027ProGlu: 2.027 ± 1.041
2.837ProPhe: 2.837 ± 2.068
3.648ProGly: 3.648 ± 0.464
2.027ProHis: 2.027 ± 1.074
2.432ProIle: 2.432 ± 0.866
2.027ProLys: 2.027 ± 1.041
4.054ProLeu: 4.054 ± 0.672
1.621ProMet: 1.621 ± 0.833
2.432ProAsn: 2.432 ± 0.161
2.432ProPro: 2.432 ± 1.571
0.405ProGln: 0.405 ± 0.208
2.027ProArg: 2.027 ± 1.041
6.486ProSer: 6.486 ± 3.014
2.027ProThr: 2.027 ± 0.336
3.243ProVal: 3.243 ± 0.45
1.216ProTrp: 1.216 ± 0.08
2.837ProTyr: 2.837 ± 1.363
0.0ProXaa: 0.0 ± 0.0
Gln
3.243GlnAla: 3.243 ± 0.45
0.0GlnCys: 0.0 ± 0.0
1.216GlnAsp: 1.216 ± 0.785
2.432GlnGlu: 2.432 ± 0.544
1.216GlnPhe: 1.216 ± 0.08
1.621GlnGly: 1.621 ± 0.128
1.621GlnHis: 1.621 ± 0.833
0.0GlnIle: 0.0 ± 0.0
0.405GlnLys: 0.405 ± 0.208
2.432GlnLeu: 2.432 ± 0.544
0.811GlnMet: 0.811 ± 0.289
0.811GlnAsn: 0.811 ± 0.416
1.216GlnPro: 1.216 ± 0.08
0.405GlnGln: 0.405 ± 0.208
2.837GlnArg: 2.837 ± 0.752
2.837GlnSer: 2.837 ± 0.047
1.621GlnThr: 1.621 ± 0.577
2.027GlnVal: 2.027 ± 0.336
1.216GlnTrp: 1.216 ± 0.625
1.216GlnTyr: 1.216 ± 1.49
0.0GlnXaa: 0.0 ± 0.0
Arg
6.08ArgAla: 6.08 ± 0.303
0.405ArgCys: 0.405 ± 0.208
3.243ArgAsp: 3.243 ± 0.255
6.486ArgGlu: 6.486 ± 3.331
2.432ArgPhe: 2.432 ± 0.544
4.054ArgGly: 4.054 ± 0.738
1.216ArgHis: 1.216 ± 0.625
2.432ArgIle: 2.432 ± 0.161
2.837ArgLys: 2.837 ± 0.658
2.837ArgLeu: 2.837 ± 0.047
2.837ArgMet: 2.837 ± 0.047
2.837ArgAsn: 2.837 ± 0.047
1.621ArgPro: 1.621 ± 0.128
0.0ArgGln: 0.0 ± 0.0
4.459ArgArg: 4.459 ± 0.175
4.054ArgSer: 4.054 ± 0.672
2.027ArgThr: 2.027 ± 0.336
4.459ArgVal: 4.459 ± 0.175
0.405ArgTrp: 0.405 ± 0.208
3.243ArgTyr: 3.243 ± 0.255
0.0ArgXaa: 0.0 ± 0.0
Ser
6.08SerAla: 6.08 ± 0.303
1.216SerCys: 1.216 ± 0.785
3.648SerAsp: 3.648 ± 0.464
4.459SerGlu: 4.459 ± 0.175
4.864SerPhe: 4.864 ± 0.322
5.27SerGly: 5.27 ± 0.819
1.216SerHis: 1.216 ± 0.785
2.837SerIle: 2.837 ± 0.047
5.675SerLys: 5.675 ± 1.315
5.675SerLeu: 5.675 ± 0.61
2.432SerMet: 2.432 ± 0.544
4.864SerAsn: 4.864 ± 0.383
3.243SerPro: 3.243 ± 1.86
1.621SerGln: 1.621 ± 0.128
3.243SerArg: 3.243 ± 0.45
9.323SerSer: 9.323 ± 1.557
4.864SerThr: 4.864 ± 1.732
6.891SerVal: 6.891 ± 2.806
0.811SerTrp: 0.811 ± 0.416
2.027SerTyr: 2.027 ± 1.779
0.0SerXaa: 0.0 ± 0.0
Thr
8.107ThrAla: 8.107 ± 5.706
1.621ThrCys: 1.621 ± 0.833
2.432ThrAsp: 2.432 ± 0.161
1.216ThrGlu: 1.216 ± 0.08
2.432ThrPhe: 2.432 ± 0.544
4.459ThrGly: 4.459 ± 1.235
0.405ThrHis: 0.405 ± 0.208
2.027ThrIle: 2.027 ± 0.369
2.027ThrLys: 2.027 ± 0.336
8.512ThrLeu: 8.512 ± 0.563
0.811ThrMet: 0.811 ± 0.289
2.432ThrAsn: 2.432 ± 0.161
4.054ThrPro: 4.054 ± 0.738
1.621ThrGln: 1.621 ± 0.128
4.459ThrArg: 4.459 ± 0.175
4.459ThrSer: 4.459 ± 1.235
2.027ThrThr: 2.027 ± 0.369
5.27ThrVal: 5.27 ± 0.819
0.405ThrTrp: 0.405 ± 0.497
2.027ThrTyr: 2.027 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
8.918ValAla: 8.918 ± 0.355
2.837ValCys: 2.837 ± 1.457
5.675ValAsp: 5.675 ± 0.095
4.054ValGlu: 4.054 ± 0.738
2.027ValPhe: 2.027 ± 0.369
7.702ValGly: 7.702 ± 0.431
1.621ValHis: 1.621 ± 0.833
2.837ValIle: 2.837 ± 0.047
4.864ValLys: 4.864 ± 1.088
4.864ValLeu: 4.864 ± 1.088
0.811ValMet: 0.811 ± 0.289
1.621ValAsn: 1.621 ± 0.833
2.432ValPro: 2.432 ± 1.571
4.054ValGln: 4.054 ± 0.033
7.296ValArg: 7.296 ± 1.632
4.864ValSer: 4.864 ± 0.322
4.864ValThr: 4.864 ± 1.027
7.702ValVal: 7.702 ± 0.431
1.621ValTrp: 1.621 ± 0.128
2.837ValTyr: 2.837 ± 0.658
0.0ValXaa: 0.0 ± 0.0
Trp
1.216TrpAla: 1.216 ± 0.625
0.405TrpCys: 0.405 ± 0.208
0.811TrpAsp: 0.811 ± 0.416
1.216TrpGlu: 1.216 ± 0.625
1.216TrpPhe: 1.216 ± 0.785
0.405TrpGly: 0.405 ± 0.208
0.811TrpHis: 0.811 ± 0.289
1.216TrpIle: 1.216 ± 0.625
1.216TrpLys: 1.216 ± 0.625
1.216TrpLeu: 1.216 ± 0.08
0.0TrpMet: 0.0 ± 0.0
1.216TrpAsn: 1.216 ± 0.625
0.405TrpPro: 0.405 ± 0.208
0.811TrpGln: 0.811 ± 0.416
1.621TrpArg: 1.621 ± 0.128
0.0TrpSer: 0.0 ± 0.0
0.405TrpThr: 0.405 ± 0.497
1.621TrpVal: 1.621 ± 0.833
0.405TrpTrp: 0.405 ± 0.208
2.027TrpTyr: 2.027 ± 0.369
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.837TyrAla: 2.837 ± 0.658
0.0TyrCys: 0.0 ± 0.0
1.621TyrAsp: 1.621 ± 0.833
0.811TyrGlu: 0.811 ± 0.289
2.432TyrPhe: 2.432 ± 0.161
2.027TyrGly: 2.027 ± 1.779
0.0TyrHis: 0.0 ± 0.0
1.621TyrIle: 1.621 ± 1.282
1.216TyrLys: 1.216 ± 0.625
4.054TyrLeu: 4.054 ± 0.033
1.216TyrMet: 1.216 ± 0.08
1.216TyrAsn: 1.216 ± 1.49
1.621TyrPro: 1.621 ± 0.577
2.027TyrGln: 2.027 ± 0.336
2.027TyrArg: 2.027 ± 0.369
1.621TyrSer: 1.621 ± 0.128
5.675TyrThr: 5.675 ± 1.315
2.027TyrVal: 2.027 ± 1.779
2.027TyrTrp: 2.027 ± 0.336
0.405TyrTyr: 0.405 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski