Amino acid dipepetide frequency for Wenzhou picorna-like virus 28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.669AlaAla: 7.669 ± 1.425
1.334AlaCys: 1.334 ± 0.243
6.002AlaAsp: 6.002 ± 0.251
4.335AlaGlu: 4.335 ± 0.386
2.001AlaPhe: 2.001 ± 0.442
3.001AlaGly: 3.001 ± 1.756
2.334AlaHis: 2.334 ± 0.828
3.001AlaIle: 3.001 ± 0.143
5.669AlaLys: 5.669 ± 2.241
6.669AlaLeu: 6.669 ± 0.677
1.667AlaMet: 1.667 ± 0.438
1.667AlaAsn: 1.667 ± 0.637
2.668AlaPro: 2.668 ± 0.589
3.001AlaGln: 3.001 ± 1.218
3.668AlaArg: 3.668 ± 0.533
7.336AlaSer: 7.336 ± 2.158
5.002AlaThr: 5.002 ± 1.314
4.001AlaVal: 4.001 ± 0.191
0.333AlaTrp: 0.333 ± 0.342
2.668AlaTyr: 2.668 ± 1.127
0.0AlaXaa: 0.0 ± 0.0
Cys
0.333CysAla: 0.333 ± 0.195
0.667CysCys: 0.667 ± 0.39
2.001CysAsp: 2.001 ± 0.442
1.0CysGlu: 1.0 ± 0.048
0.333CysPhe: 0.333 ± 0.195
1.0CysGly: 1.0 ± 0.585
0.333CysHis: 0.333 ± 0.195
1.0CysIle: 1.0 ± 0.585
2.334CysLys: 2.334 ± 0.784
1.667CysLeu: 1.667 ± 0.975
0.0CysMet: 0.0 ± 0.0
0.333CysAsn: 0.333 ± 0.342
0.667CysPro: 0.667 ± 0.147
0.333CysGln: 0.333 ± 0.195
0.667CysArg: 0.667 ± 0.39
0.333CysSer: 0.333 ± 0.195
0.667CysThr: 0.667 ± 0.147
0.333CysVal: 0.333 ± 0.342
1.0CysTrp: 1.0 ± 0.048
0.667CysTyr: 0.667 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
6.002AspAla: 6.002 ± 0.287
2.334AspCys: 2.334 ± 0.291
7.336AspAsp: 7.336 ± 0.546
4.668AspGlu: 4.668 ± 0.044
4.668AspPhe: 4.668 ± 0.044
4.001AspGly: 4.001 ± 0.346
1.0AspHis: 1.0 ± 0.585
6.002AspIle: 6.002 ± 0.287
2.668AspLys: 2.668 ± 0.589
5.669AspLeu: 5.669 ± 0.446
1.667AspMet: 1.667 ± 0.637
2.334AspAsn: 2.334 ± 0.291
4.335AspPro: 4.335 ± 0.151
3.001AspGln: 3.001 ± 2.006
2.668AspArg: 2.668 ± 0.486
2.668AspSer: 2.668 ± 1.561
3.668AspThr: 3.668 ± 1.616
5.335AspVal: 5.335 ± 0.971
0.667AspTrp: 0.667 ± 0.39
2.334AspTyr: 2.334 ± 0.784
0.0AspXaa: 0.0 ± 0.0
Glu
3.668GluAla: 3.668 ± 0.533
0.667GluCys: 0.667 ± 0.39
4.001GluAsp: 4.001 ± 0.346
3.001GluGlu: 3.001 ± 0.143
4.001GluPhe: 4.001 ± 0.346
2.334GluGly: 2.334 ± 0.828
2.001GluHis: 2.001 ± 0.096
4.335GluIle: 4.335 ± 0.689
4.668GluLys: 4.668 ± 1.119
4.668GluLeu: 4.668 ± 1.656
2.001GluMet: 2.001 ± 0.096
3.001GluAsn: 3.001 ± 0.394
2.334GluPro: 2.334 ± 0.291
2.001GluGln: 2.001 ± 0.633
4.001GluArg: 4.001 ± 0.191
2.334GluSer: 2.334 ± 0.291
2.001GluThr: 2.001 ± 0.979
4.001GluVal: 4.001 ± 0.884
1.667GluTrp: 1.667 ± 0.438
3.001GluTyr: 3.001 ± 0.394
0.0GluXaa: 0.0 ± 0.0
Phe
2.668PheAla: 2.668 ± 0.486
1.0PheCys: 1.0 ± 0.048
3.334PheAsp: 3.334 ± 0.338
2.668PheGlu: 2.668 ± 0.486
1.0PhePhe: 1.0 ± 0.048
3.668PheGly: 3.668 ± 1.616
1.334PheHis: 1.334 ± 0.243
1.667PheIle: 1.667 ± 0.1
2.001PheLys: 2.001 ± 0.442
3.334PheLeu: 3.334 ± 1.413
0.333PheMet: 0.333 ± 0.31
1.334PheAsn: 1.334 ± 0.243
1.334PhePro: 1.334 ± 0.832
2.334PheGln: 2.334 ± 2.397
3.668PheArg: 3.668 ± 1.071
5.335PheSer: 5.335 ± 0.641
4.001PheThr: 4.001 ± 0.191
3.001PheVal: 3.001 ± 0.932
0.333PheTrp: 0.333 ± 0.195
2.334PheTyr: 2.334 ± 0.784
0.0PheXaa: 0.0 ± 0.0
Gly
3.334GlyAla: 3.334 ± 0.199
0.0GlyCys: 0.0 ± 0.0
4.668GlyAsp: 4.668 ± 0.581
3.001GlyGlu: 3.001 ± 0.394
4.001GlyPhe: 4.001 ± 0.884
2.334GlyGly: 2.334 ± 0.784
0.667GlyHis: 0.667 ± 0.39
3.334GlyIle: 3.334 ± 0.338
5.669GlyLys: 5.669 ± 1.704
3.668GlyLeu: 3.668 ± 1.616
2.001GlyMet: 2.001 ± 0.096
2.668GlyAsn: 2.668 ± 1.127
1.0GlyPro: 1.0 ± 0.49
1.667GlyGln: 1.667 ± 0.438
2.334GlyArg: 2.334 ± 0.291
2.334GlySer: 2.334 ± 0.247
3.334GlyThr: 3.334 ± 0.737
3.001GlyVal: 3.001 ± 0.932
0.333GlyTrp: 0.333 ± 0.342
2.334GlyTyr: 2.334 ± 0.828
0.0GlyXaa: 0.0 ± 0.0
His
0.667HisAla: 0.667 ± 0.39
0.333HisCys: 0.333 ± 0.195
2.001HisAsp: 2.001 ± 0.633
1.0HisGlu: 1.0 ± 0.585
1.334HisPhe: 1.334 ± 0.78
1.0HisGly: 1.0 ± 0.048
0.667HisHis: 0.667 ± 0.39
1.0HisIle: 1.0 ± 0.49
1.334HisLys: 1.334 ± 0.243
1.667HisLeu: 1.667 ± 0.438
0.667HisMet: 0.667 ± 0.39
0.333HisAsn: 0.333 ± 0.195
1.667HisPro: 1.667 ± 0.438
0.667HisGln: 0.667 ± 0.39
1.667HisArg: 1.667 ± 0.637
2.001HisSer: 2.001 ± 0.096
2.001HisThr: 2.001 ± 0.442
1.667HisVal: 1.667 ± 0.438
0.0HisTrp: 0.0 ± 0.0
0.667HisTyr: 0.667 ± 0.685
0.0HisXaa: 0.0 ± 0.0
Ile
5.669IleAla: 5.669 ± 0.091
2.334IleCys: 2.334 ± 0.247
4.335IleAsp: 4.335 ± 1.461
4.335IleGlu: 4.335 ± 0.386
1.334IlePhe: 1.334 ± 0.243
3.668IleGly: 3.668 ± 0.541
1.334IleHis: 1.334 ± 0.78
3.668IleIle: 3.668 ± 0.533
4.335IleLys: 4.335 ± 0.386
5.335IleLeu: 5.335 ± 0.104
1.0IleMet: 1.0 ± 0.048
4.335IleAsn: 4.335 ± 0.386
3.668IlePro: 3.668 ± 0.004
3.001IleGln: 3.001 ± 0.143
3.334IleArg: 3.334 ± 0.199
3.001IleSer: 3.001 ± 0.143
3.668IleThr: 3.668 ± 0.004
4.001IleVal: 4.001 ± 0.728
0.333IleTrp: 0.333 ± 0.195
2.001IleTyr: 2.001 ± 0.096
0.0IleXaa: 0.0 ± 0.0
Lys
4.335LysAla: 4.335 ± 1.461
0.667LysCys: 0.667 ± 0.39
3.668LysAsp: 3.668 ± 0.004
5.002LysGlu: 5.002 ± 1.911
2.668LysPhe: 2.668 ± 0.486
3.334LysGly: 3.334 ± 0.199
0.333LysHis: 0.333 ± 0.195
3.334LysIle: 3.334 ± 1.413
4.668LysLys: 4.668 ± 0.581
6.335LysLeu: 6.335 ± 2.094
1.334LysMet: 1.334 ± 0.165
2.668LysAsn: 2.668 ± 0.589
1.334LysPro: 1.334 ± 0.832
1.334LysGln: 1.334 ± 0.243
3.334LysArg: 3.334 ± 0.199
3.001LysSer: 3.001 ± 0.932
6.669LysThr: 6.669 ± 1.752
4.001LysVal: 4.001 ± 0.191
0.667LysTrp: 0.667 ± 0.147
3.668LysTyr: 3.668 ± 1.071
0.0LysXaa: 0.0 ± 0.0
Leu
6.335LeuAla: 6.335 ± 0.056
0.0LeuCys: 0.0 ± 0.0
7.002LeuAsp: 7.002 ± 0.334
4.001LeuGlu: 4.001 ± 0.191
3.668LeuPhe: 3.668 ± 1.079
4.335LeuGly: 4.335 ± 0.924
1.667LeuHis: 1.667 ± 0.975
4.335LeuIle: 4.335 ± 0.689
5.002LeuLys: 5.002 ± 0.239
4.668LeuLeu: 4.668 ± 1.656
1.334LeuMet: 1.334 ± 0.243
4.335LeuAsn: 4.335 ± 1.764
5.335LeuPro: 5.335 ± 0.641
3.668LeuGln: 3.668 ± 1.071
7.002LeuArg: 7.002 ± 0.741
6.002LeuSer: 6.002 ± 2.436
7.002LeuThr: 7.002 ± 1.278
6.002LeuVal: 6.002 ± 1.899
0.0LeuTrp: 0.0 ± 0.0
2.334LeuTyr: 2.334 ± 1.365
0.0LeuXaa: 0.0 ± 0.0
Met
2.334MetAla: 2.334 ± 1.365
0.667MetCys: 0.667 ± 0.39
1.667MetAsp: 1.667 ± 0.438
1.667MetGlu: 1.667 ± 0.1
0.333MetPhe: 0.333 ± 0.342
0.333MetGly: 0.333 ± 0.342
0.667MetHis: 0.667 ± 0.39
1.667MetIle: 1.667 ± 0.1
1.0MetLys: 1.0 ± 0.048
2.334MetLeu: 2.334 ± 0.247
0.333MetMet: 0.333 ± 0.195
0.667MetAsn: 0.667 ± 0.147
1.667MetPro: 1.667 ± 0.637
0.333MetGln: 0.333 ± 0.342
1.334MetArg: 1.334 ± 0.78
0.667MetSer: 0.667 ± 0.39
1.334MetThr: 1.334 ± 0.295
0.667MetVal: 0.667 ± 0.39
0.667MetTrp: 0.667 ± 0.147
1.0MetTyr: 1.0 ± 0.585
0.0MetXaa: 0.0 ± 0.0
Asn
4.001AsnAla: 4.001 ± 0.346
0.0AsnCys: 0.0 ± 0.0
2.001AsnAsp: 2.001 ± 0.096
2.001AsnGlu: 2.001 ± 0.096
2.668AsnPhe: 2.668 ± 0.589
1.667AsnGly: 1.667 ± 0.637
1.0AsnHis: 1.0 ± 0.49
3.334AsnIle: 3.334 ± 0.199
1.0AsnLys: 1.0 ± 0.048
3.334AsnLeu: 3.334 ± 0.737
1.334AsnMet: 1.334 ± 0.243
0.333AsnAsn: 0.333 ± 0.342
2.668AsnPro: 2.668 ± 1.127
2.334AsnGln: 2.334 ± 0.247
2.668AsnArg: 2.668 ± 1.023
2.668AsnSer: 2.668 ± 0.589
3.668AsnThr: 3.668 ± 0.541
3.001AsnVal: 3.001 ± 0.681
0.0AsnTrp: 0.0 ± 0.0
1.0AsnTyr: 1.0 ± 0.048
0.0AsnXaa: 0.0 ± 0.0
Pro
2.334ProAla: 2.334 ± 0.247
0.667ProCys: 0.667 ± 0.39
4.668ProAsp: 4.668 ± 1.569
3.334ProGlu: 3.334 ± 0.199
3.001ProPhe: 3.001 ± 1.469
2.668ProGly: 2.668 ± 0.589
0.333ProHis: 0.333 ± 0.342
3.668ProIle: 3.668 ± 1.071
3.334ProLys: 3.334 ± 1.274
4.001ProLeu: 4.001 ± 0.191
1.334ProMet: 1.334 ± 0.243
1.667ProAsn: 1.667 ± 0.438
0.667ProPro: 0.667 ± 0.685
1.334ProGln: 1.334 ± 0.295
2.668ProArg: 2.668 ± 0.589
2.668ProSer: 2.668 ± 0.589
3.001ProThr: 3.001 ± 0.932
1.667ProVal: 1.667 ± 1.174
0.333ProTrp: 0.333 ± 0.195
2.334ProTyr: 2.334 ± 1.322
0.0ProXaa: 0.0 ± 0.0
Gln
2.668GlnAla: 2.668 ± 1.127
1.667GlnCys: 1.667 ± 0.1
2.001GlnAsp: 2.001 ± 0.442
1.0GlnGlu: 1.0 ± 0.048
0.333GlnPhe: 0.333 ± 0.195
3.668GlnGly: 3.668 ± 0.533
1.334GlnHis: 1.334 ± 0.243
4.668GlnIle: 4.668 ± 0.044
3.334GlnLys: 3.334 ± 0.199
3.668GlnLeu: 3.668 ± 0.004
1.0GlnMet: 1.0 ± 0.048
2.334GlnAsn: 2.334 ± 0.247
2.668GlnPro: 2.668 ± 0.052
1.0GlnGln: 1.0 ± 1.027
3.001GlnArg: 3.001 ± 0.681
1.334GlnSer: 1.334 ± 0.243
2.001GlnThr: 2.001 ± 0.442
2.334GlnVal: 2.334 ± 0.247
0.0GlnTrp: 0.0 ± 0.0
2.001GlnTyr: 2.001 ± 0.096
0.0GlnXaa: 0.0 ± 0.0
Arg
4.668ArgAla: 4.668 ± 1.119
1.0ArgCys: 1.0 ± 0.49
1.0ArgAsp: 1.0 ± 0.585
3.001ArgGlu: 3.001 ± 1.218
2.001ArgPhe: 2.001 ± 0.442
3.668ArgGly: 3.668 ± 1.616
1.0ArgHis: 1.0 ± 0.585
4.668ArgIle: 4.668 ± 0.044
3.001ArgLys: 3.001 ± 0.681
6.002ArgLeu: 6.002 ± 0.287
0.667ArgMet: 0.667 ± 0.147
1.0ArgAsn: 1.0 ± 0.048
2.668ArgPro: 2.668 ± 1.127
2.334ArgGln: 2.334 ± 0.828
2.668ArgArg: 2.668 ± 1.023
5.002ArgSer: 5.002 ± 0.776
4.668ArgThr: 4.668 ± 1.656
3.001ArgVal: 3.001 ± 0.394
0.333ArgTrp: 0.333 ± 0.195
2.668ArgTyr: 2.668 ± 1.127
0.0ArgXaa: 0.0 ± 0.0
Ser
4.668SerAla: 4.668 ± 1.119
1.0SerCys: 1.0 ± 0.048
4.668SerAsp: 4.668 ± 1.569
5.002SerGlu: 5.002 ± 1.314
2.668SerPhe: 2.668 ± 0.052
4.001SerGly: 4.001 ± 1.959
1.0SerHis: 1.0 ± 0.49
4.335SerIle: 4.335 ± 1.998
3.001SerLys: 3.001 ± 0.681
5.002SerLeu: 5.002 ± 0.299
1.0SerMet: 1.0 ± 0.585
3.334SerAsn: 3.334 ± 0.876
3.334SerPro: 3.334 ± 0.876
5.335SerGln: 5.335 ± 0.104
2.334SerArg: 2.334 ± 0.291
3.668SerSer: 3.668 ± 0.533
5.002SerThr: 5.002 ± 2.986
2.668SerVal: 2.668 ± 1.127
1.0SerTrp: 1.0 ± 0.49
2.001SerTyr: 2.001 ± 1.17
0.0SerXaa: 0.0 ± 0.0
Thr
4.335ThrAla: 4.335 ± 1.226
0.667ThrCys: 0.667 ± 0.147
4.335ThrAsp: 4.335 ± 1.226
4.668ThrGlu: 4.668 ± 0.044
4.335ThrPhe: 4.335 ± 0.386
2.668ThrGly: 2.668 ± 0.052
3.001ThrHis: 3.001 ± 0.394
4.335ThrIle: 4.335 ± 0.386
2.668ThrLys: 2.668 ± 0.052
7.336ThrLeu: 7.336 ± 0.008
0.333ThrMet: 0.333 ± 0.342
1.667ThrAsn: 1.667 ± 0.637
3.668ThrPro: 3.668 ± 1.079
2.668ThrGln: 2.668 ± 1.127
1.667ThrArg: 1.667 ± 0.1
7.002ThrSer: 7.002 ± 1.278
3.334ThrThr: 3.334 ± 1.274
4.001ThrVal: 4.001 ± 1.421
0.333ThrTrp: 0.333 ± 0.342
3.334ThrTyr: 3.334 ± 0.876
0.0ThrXaa: 0.0 ± 0.0
Val
5.002ValAla: 5.002 ± 0.299
0.667ValCys: 0.667 ± 0.147
3.334ValAsp: 3.334 ± 1.811
3.001ValGlu: 3.001 ± 0.394
2.334ValPhe: 2.334 ± 0.291
4.001ValGly: 4.001 ± 0.728
2.001ValHis: 2.001 ± 0.442
2.668ValIle: 2.668 ± 0.486
4.668ValLys: 4.668 ± 1.656
5.002ValLeu: 5.002 ± 1.374
2.334ValMet: 2.334 ± 0.828
3.334ValAsn: 3.334 ± 0.876
3.001ValPro: 3.001 ± 2.544
2.001ValGln: 2.001 ± 0.096
2.668ValArg: 2.668 ± 0.052
4.668ValSer: 4.668 ± 0.044
2.001ValThr: 2.001 ± 1.517
4.335ValVal: 4.335 ± 0.151
0.333ValTrp: 0.333 ± 0.195
2.668ValTyr: 2.668 ± 1.561
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.333TrpAsp: 0.333 ± 0.195
0.667TrpGlu: 0.667 ± 0.39
0.667TrpPhe: 0.667 ± 0.685
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.001TrpIle: 2.001 ± 0.442
0.0TrpLys: 0.0 ± 0.0
1.0TrpLeu: 1.0 ± 0.048
0.0TrpMet: 0.0 ± 0.0
0.667TrpAsn: 0.667 ± 0.39
0.0TrpPro: 0.0 ± 0.0
0.333TrpGln: 0.333 ± 0.195
1.334TrpArg: 1.334 ± 0.243
0.333TrpSer: 0.333 ± 0.342
1.0TrpThr: 1.0 ± 0.048
0.667TrpVal: 0.667 ± 0.147
0.0TrpTrp: 0.0 ± 0.0
0.333TrpTyr: 0.333 ± 0.195
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.334TyrAla: 3.334 ± 0.876
0.0TyrCys: 0.0 ± 0.0
4.668TyrAsp: 4.668 ± 1.656
2.334TyrGlu: 2.334 ± 0.291
3.668TyrPhe: 3.668 ± 1.071
0.333TyrGly: 0.333 ± 0.195
0.333TyrHis: 0.333 ± 0.342
1.667TyrIle: 1.667 ± 0.1
2.001TyrLys: 2.001 ± 1.517
2.668TyrLeu: 2.668 ± 0.486
0.667TyrMet: 0.667 ± 0.147
2.668TyrAsn: 2.668 ± 1.127
1.0TyrPro: 1.0 ± 0.585
3.334TyrGln: 3.334 ± 0.338
2.334TyrArg: 2.334 ± 0.784
2.668TyrSer: 2.668 ± 0.486
2.334TyrThr: 2.334 ± 0.784
2.334TyrVal: 2.334 ± 0.291
1.0TyrTrp: 1.0 ± 0.048
1.0TyrTyr: 1.0 ± 0.49
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3000 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski