Amino acid dipepetide frequency for Beihai picorna-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.285AlaAla: 3.285 ± 1.282
0.821AlaCys: 0.821 ± 0.321
6.571AlaAsp: 6.571 ± 0.509
2.875AlaGlu: 2.875 ± 0.738
3.285AlaPhe: 3.285 ± 0.254
6.16AlaGly: 6.16 ± 2.02
1.232AlaHis: 1.232 ± 0.097
3.696AlaIle: 3.696 ± 0.29
4.928AlaLys: 4.928 ± 0.381
5.749AlaLeu: 5.749 ± 1.476
3.285AlaMet: 3.285 ± 0.254
5.339AlaAsn: 5.339 ± 0.931
5.749AlaPro: 5.749 ± 0.707
3.696AlaGln: 3.696 ± 1.058
4.107AlaArg: 4.107 ± 2.238
6.982AlaSer: 6.982 ± 0.036
4.517AlaThr: 4.517 ± 1.379
4.928AlaVal: 4.928 ± 0.387
0.821AlaTrp: 0.821 ± 1.089
1.643AlaTyr: 1.643 ± 0.895
0.0AlaXaa: 0.0 ± 0.0
Cys
2.053CysAla: 2.053 ± 1.119
0.0CysCys: 0.0 ± 0.0
1.232CysAsp: 1.232 ± 0.672
0.821CysGlu: 0.821 ± 0.448
2.053CysPhe: 2.053 ± 0.351
2.053CysGly: 2.053 ± 0.351
0.821CysHis: 0.821 ± 0.448
0.821CysIle: 0.821 ± 0.448
0.821CysLys: 0.821 ± 0.448
0.411CysLeu: 0.411 ± 0.224
0.0CysMet: 0.0 ± 0.0
1.232CysAsn: 1.232 ± 0.672
0.411CysPro: 0.411 ± 0.224
0.411CysGln: 0.411 ± 0.224
0.411CysArg: 0.411 ± 0.544
1.643CysSer: 1.643 ± 0.641
0.411CysThr: 0.411 ± 0.224
2.875CysVal: 2.875 ± 0.799
0.411CysTrp: 0.411 ± 0.544
1.643CysTyr: 1.643 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
4.107AspAla: 4.107 ± 0.066
1.232AspCys: 1.232 ± 0.672
3.696AspAsp: 3.696 ± 1.058
4.928AspGlu: 4.928 ± 0.387
4.517AspPhe: 4.517 ± 0.158
3.696AspGly: 3.696 ± 0.478
0.821AspHis: 0.821 ± 0.321
6.571AspIle: 6.571 ± 1.796
3.285AspLys: 3.285 ± 0.254
5.339AspLeu: 5.339 ± 1.374
2.053AspMet: 2.053 ± 1.119
0.821AspAsn: 0.821 ± 0.321
2.875AspPro: 2.875 ± 0.738
3.696AspGln: 3.696 ± 0.478
1.232AspArg: 1.232 ± 0.097
3.285AspSer: 3.285 ± 0.254
2.875AspThr: 2.875 ± 1.506
3.285AspVal: 3.285 ± 0.514
1.643AspTrp: 1.643 ± 0.127
2.875AspTyr: 2.875 ± 0.799
0.0AspXaa: 0.0 ± 0.0
Glu
2.464GluAla: 2.464 ± 0.962
1.643GluCys: 1.643 ± 0.895
3.285GluAsp: 3.285 ± 1.282
1.232GluGlu: 1.232 ± 0.672
2.053GluPhe: 2.053 ± 0.417
2.464GluGly: 2.464 ± 0.575
0.0GluHis: 0.0 ± 0.0
1.643GluIle: 1.643 ± 0.127
2.464GluLys: 2.464 ± 1.343
5.749GluLeu: 5.749 ± 0.707
2.464GluMet: 2.464 ± 0.193
3.696GluAsn: 3.696 ± 0.29
2.053GluPro: 2.053 ± 1.119
0.411GluGln: 0.411 ± 0.224
2.464GluArg: 2.464 ± 0.575
4.928GluSer: 4.928 ± 1.15
3.285GluThr: 3.285 ± 0.254
7.392GluVal: 7.392 ± 0.188
0.821GluTrp: 0.821 ± 0.448
4.107GluTyr: 4.107 ± 0.066
0.0GluXaa: 0.0 ± 0.0
Phe
3.696PheAla: 3.696 ± 0.29
1.643PheCys: 1.643 ± 0.641
2.875PheAsp: 2.875 ± 0.03
2.875PheGlu: 2.875 ± 0.03
0.821PhePhe: 0.821 ± 0.448
2.464PheGly: 2.464 ± 0.962
0.821PheHis: 0.821 ± 0.448
2.053PheIle: 2.053 ± 0.351
2.053PheLys: 2.053 ± 1.119
1.643PheLeu: 1.643 ± 0.127
0.0PheMet: 0.0 ± 0.0
1.643PheAsn: 1.643 ± 0.641
5.339PhePro: 5.339 ± 0.605
1.643PheGln: 1.643 ± 0.127
2.464PheArg: 2.464 ± 0.193
2.875PheSer: 2.875 ± 1.567
1.643PheThr: 1.643 ± 0.895
3.285PheVal: 3.285 ± 0.514
0.0PheTrp: 0.0 ± 0.0
1.232PheTyr: 1.232 ± 0.865
0.0PheXaa: 0.0 ± 0.0
Gly
4.928GlyAla: 4.928 ± 1.15
1.232GlyCys: 1.232 ± 0.097
4.517GlyAsp: 4.517 ± 2.147
3.696GlyGlu: 3.696 ± 1.058
1.643GlyPhe: 1.643 ± 0.127
6.16GlyGly: 6.16 ± 2.02
0.411GlyHis: 0.411 ± 0.224
3.285GlyIle: 3.285 ± 0.254
4.107GlyLys: 4.107 ± 1.47
2.875GlyLeu: 2.875 ± 2.274
0.821GlyMet: 0.821 ± 0.321
2.464GlyAsn: 2.464 ± 0.193
1.232GlyPro: 1.232 ± 0.672
2.053GlyGln: 2.053 ± 1.954
2.875GlyArg: 2.875 ± 0.738
5.339GlySer: 5.339 ± 0.163
5.339GlyThr: 5.339 ± 0.931
4.517GlyVal: 4.517 ± 0.926
0.821GlyTrp: 0.821 ± 1.089
2.875GlyTyr: 2.875 ± 1.506
0.0GlyXaa: 0.0 ± 0.0
His
3.285HisAla: 3.285 ± 0.254
0.0HisCys: 0.0 ± 0.0
0.821HisAsp: 0.821 ± 0.321
0.411HisGlu: 0.411 ± 0.544
0.411HisPhe: 0.411 ± 0.224
0.821HisGly: 0.821 ± 0.448
0.0HisHis: 0.0 ± 0.0
1.232HisIle: 1.232 ± 0.672
0.821HisLys: 0.821 ± 0.321
2.464HisLeu: 2.464 ± 0.575
0.411HisMet: 0.411 ± 0.367
2.875HisAsn: 2.875 ± 0.738
0.821HisPro: 0.821 ± 0.321
0.411HisGln: 0.411 ± 0.224
0.411HisArg: 0.411 ± 0.544
2.053HisSer: 2.053 ± 1.119
1.643HisThr: 1.643 ± 0.895
2.875HisVal: 2.875 ± 0.03
0.0HisTrp: 0.0 ± 0.0
1.232HisTyr: 1.232 ± 0.672
0.0HisXaa: 0.0 ± 0.0
Ile
4.107IleAla: 4.107 ± 0.834
0.821IleCys: 0.821 ± 0.448
4.107IleAsp: 4.107 ± 0.702
2.053IleGlu: 2.053 ± 1.119
3.285IlePhe: 3.285 ± 1.023
2.053IleGly: 2.053 ± 0.351
4.107IleHis: 4.107 ± 0.702
3.285IleIle: 3.285 ± 1.791
2.875IleLys: 2.875 ± 0.738
3.285IleLeu: 3.285 ± 0.254
0.821IleMet: 0.821 ± 0.448
2.053IleAsn: 2.053 ± 0.351
2.875IlePro: 2.875 ± 0.03
0.821IleGln: 0.821 ± 0.321
3.696IleArg: 3.696 ± 1.827
4.517IleSer: 4.517 ± 0.611
3.696IleThr: 3.696 ± 0.478
2.053IleVal: 2.053 ± 0.417
0.821IleTrp: 0.821 ± 0.321
1.643IleTyr: 1.643 ± 0.127
0.0IleXaa: 0.0 ± 0.0
Lys
6.16LysAla: 6.16 ± 0.484
1.232LysCys: 1.232 ± 0.672
3.285LysAsp: 3.285 ± 0.254
3.285LysGlu: 3.285 ± 0.254
2.464LysPhe: 2.464 ± 0.575
3.696LysGly: 3.696 ± 1.246
0.411LysHis: 0.411 ± 0.224
3.696LysIle: 3.696 ± 0.29
3.285LysLys: 3.285 ± 1.023
1.232LysLeu: 1.232 ± 0.097
0.821LysMet: 0.821 ± 0.448
3.696LysAsn: 3.696 ± 0.478
1.643LysPro: 1.643 ± 0.641
2.875LysGln: 2.875 ± 0.799
2.053LysArg: 2.053 ± 0.351
5.749LysSer: 5.749 ± 3.134
1.643LysThr: 1.643 ± 0.127
2.875LysVal: 2.875 ± 0.03
0.821LysTrp: 0.821 ± 0.321
3.285LysTyr: 3.285 ± 1.023
0.0LysXaa: 0.0 ± 0.0
Leu
8.214LeuAla: 8.214 ± 1.404
2.464LeuCys: 2.464 ± 0.575
4.517LeuAsp: 4.517 ± 2.915
4.928LeuGlu: 4.928 ± 1.15
3.285LeuPhe: 3.285 ± 1.791
5.339LeuGly: 5.339 ± 0.931
1.643LeuHis: 1.643 ± 0.127
2.875LeuIle: 2.875 ± 0.799
4.107LeuLys: 4.107 ± 2.238
6.571LeuLeu: 6.571 ± 2.045
2.053LeuMet: 2.053 ± 0.417
2.464LeuAsn: 2.464 ± 0.575
3.285LeuPro: 3.285 ± 0.254
2.053LeuGln: 2.053 ± 1.119
4.107LeuArg: 4.107 ± 0.066
4.928LeuSer: 4.928 ± 0.387
4.107LeuThr: 4.107 ± 3.139
4.928LeuVal: 4.928 ± 0.381
0.411LeuTrp: 0.411 ± 0.224
3.285LeuTyr: 3.285 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
2.464MetAla: 2.464 ± 0.962
1.232MetCys: 1.232 ± 0.672
0.821MetAsp: 0.821 ± 0.448
2.053MetGlu: 2.053 ± 1.119
0.411MetPhe: 0.411 ± 0.544
1.643MetGly: 1.643 ± 1.409
0.411MetHis: 0.411 ± 0.224
2.053MetIle: 2.053 ± 0.351
2.464MetLys: 2.464 ± 0.575
2.464MetLeu: 2.464 ± 0.193
0.821MetMet: 0.821 ± 1.089
0.821MetAsn: 0.821 ± 0.321
1.232MetPro: 1.232 ± 0.672
0.0MetGln: 0.0 ± 0.0
1.643MetArg: 1.643 ± 0.895
1.232MetSer: 1.232 ± 0.672
2.464MetThr: 2.464 ± 0.575
1.232MetVal: 1.232 ± 0.672
0.411MetTrp: 0.411 ± 0.224
1.232MetTyr: 1.232 ± 0.672
0.0MetXaa: 0.0 ± 0.0
Asn
3.696AsnAla: 3.696 ± 0.29
1.232AsnCys: 1.232 ± 0.097
3.696AsnAsp: 3.696 ± 2.015
2.464AsnGlu: 2.464 ± 0.962
1.232AsnPhe: 1.232 ± 0.865
2.875AsnGly: 2.875 ± 1.506
0.411AsnHis: 0.411 ± 0.224
2.053AsnIle: 2.053 ± 0.417
0.821AsnLys: 0.821 ± 0.448
5.339AsnLeu: 5.339 ± 0.163
1.643AsnMet: 1.643 ± 0.127
2.053AsnAsn: 2.053 ± 1.119
4.107AsnPro: 4.107 ± 0.834
0.821AsnGln: 0.821 ± 0.321
1.643AsnArg: 1.643 ± 0.127
5.339AsnSer: 5.339 ± 0.931
2.464AsnThr: 2.464 ± 1.343
2.464AsnVal: 2.464 ± 0.193
0.821AsnTrp: 0.821 ± 0.448
3.285AsnTyr: 3.285 ± 2.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.464ProAla: 2.464 ± 0.193
0.411ProCys: 0.411 ± 0.224
3.285ProAsp: 3.285 ± 1.791
3.285ProGlu: 3.285 ± 0.514
3.696ProPhe: 3.696 ± 2.595
2.875ProGly: 2.875 ± 1.506
0.411ProHis: 0.411 ± 0.224
3.285ProIle: 3.285 ± 1.023
2.464ProLys: 2.464 ± 0.575
2.875ProLeu: 2.875 ± 0.799
0.821ProMet: 0.821 ± 0.448
2.053ProAsn: 2.053 ± 1.185
0.821ProPro: 0.821 ± 0.448
1.643ProGln: 1.643 ± 0.127
2.875ProArg: 2.875 ± 0.799
4.517ProSer: 4.517 ± 0.611
4.107ProThr: 4.107 ± 1.603
4.107ProVal: 4.107 ± 1.603
0.411ProTrp: 0.411 ± 0.224
2.464ProTyr: 2.464 ± 0.962
0.0ProXaa: 0.0 ± 0.0
Gln
3.696GlnAla: 3.696 ± 1.246
0.411GlnCys: 0.411 ± 0.224
2.053GlnAsp: 2.053 ± 0.351
0.0GlnGlu: 0.0 ± 0.0
1.643GlnPhe: 1.643 ± 0.127
3.285GlnGly: 3.285 ± 1.282
1.232GlnHis: 1.232 ± 0.672
0.411GlnIle: 0.411 ± 0.224
0.411GlnLys: 0.411 ± 0.224
2.875GlnLeu: 2.875 ± 0.799
0.821GlnMet: 0.821 ± 0.448
1.232GlnAsn: 1.232 ± 0.672
0.0GlnPro: 0.0 ± 0.0
1.232GlnGln: 1.232 ± 0.097
2.875GlnArg: 2.875 ± 0.03
3.285GlnSer: 3.285 ± 0.514
2.053GlnThr: 2.053 ± 1.954
3.285GlnVal: 3.285 ± 0.514
0.0GlnTrp: 0.0 ± 0.0
1.232GlnTyr: 1.232 ± 0.865
0.0GlnXaa: 0.0 ± 0.0
Arg
4.517ArgAla: 4.517 ± 1.694
1.643ArgCys: 1.643 ± 0.895
3.696ArgAsp: 3.696 ± 1.246
2.053ArgGlu: 2.053 ± 0.351
2.053ArgPhe: 2.053 ± 1.185
1.643ArgGly: 1.643 ± 1.409
3.285ArgHis: 3.285 ± 0.514
3.285ArgIle: 3.285 ± 2.05
2.053ArgLys: 2.053 ± 0.351
3.696ArgLeu: 3.696 ± 0.478
2.464ArgMet: 2.464 ± 1.343
1.643ArgAsn: 1.643 ± 0.895
1.643ArgPro: 1.643 ± 0.641
1.643ArgGln: 1.643 ± 0.641
4.928ArgArg: 4.928 ± 0.381
3.285ArgSer: 3.285 ± 1.023
1.232ArgThr: 1.232 ± 0.672
2.053ArgVal: 2.053 ± 0.417
0.411ArgTrp: 0.411 ± 0.544
1.643ArgTyr: 1.643 ± 0.127
0.0ArgXaa: 0.0 ± 0.0
Ser
5.749SerAla: 5.749 ± 1.476
1.232SerCys: 1.232 ± 0.672
4.928SerAsp: 4.928 ± 0.381
6.16SerGlu: 6.16 ± 1.053
1.232SerPhe: 1.232 ± 0.672
5.339SerGly: 5.339 ± 1.374
2.464SerHis: 2.464 ± 1.343
5.339SerIle: 5.339 ± 0.931
5.339SerLys: 5.339 ± 0.931
8.214SerLeu: 8.214 ± 1.404
3.285SerMet: 3.285 ± 1.024
4.107SerAsn: 4.107 ± 0.834
3.696SerPro: 3.696 ± 1.827
2.053SerGln: 2.053 ± 1.119
3.696SerArg: 3.696 ± 0.29
4.517SerSer: 4.517 ± 0.926
6.16SerThr: 6.16 ± 0.285
6.571SerVal: 6.571 ± 1.277
0.411SerTrp: 0.411 ± 0.224
2.464SerTyr: 2.464 ± 0.193
0.0SerXaa: 0.0 ± 0.0
Thr
5.339ThrAla: 5.339 ± 4.004
1.232ThrCys: 1.232 ± 0.097
3.696ThrAsp: 3.696 ± 0.29
2.464ThrGlu: 2.464 ± 0.575
2.464ThrPhe: 2.464 ± 1.343
3.696ThrGly: 3.696 ± 0.29
0.821ThrHis: 0.821 ± 0.321
2.053ThrIle: 2.053 ± 1.119
3.696ThrLys: 3.696 ± 0.478
4.928ThrLeu: 4.928 ± 1.15
2.464ThrMet: 2.464 ± 0.575
2.464ThrAsn: 2.464 ± 0.193
3.696ThrPro: 3.696 ± 1.058
1.232ThrGln: 1.232 ± 0.672
2.875ThrArg: 2.875 ± 0.738
5.339ThrSer: 5.339 ± 1.699
3.285ThrThr: 3.285 ± 2.819
5.749ThrVal: 5.749 ± 1.476
0.411ThrTrp: 0.411 ± 0.224
2.464ThrTyr: 2.464 ± 0.575
0.0ThrXaa: 0.0 ± 0.0
Val
4.517ValAla: 4.517 ± 2.147
1.232ValCys: 1.232 ± 0.097
1.643ValAsp: 1.643 ± 1.409
6.571ValGlu: 6.571 ± 2.045
2.053ValPhe: 2.053 ± 0.351
2.464ValGly: 2.464 ± 0.193
1.643ValHis: 1.643 ± 0.895
4.107ValIle: 4.107 ± 0.702
5.749ValLys: 5.749 ± 1.476
4.107ValLeu: 4.107 ± 0.066
1.232ValMet: 1.232 ± 0.672
4.107ValAsn: 4.107 ± 0.834
4.517ValPro: 4.517 ± 0.611
2.875ValGln: 2.875 ± 0.03
2.053ValArg: 2.053 ± 1.119
7.803ValSer: 7.803 ± 1.125
6.571ValThr: 6.571 ± 1.277
5.749ValVal: 5.749 ± 0.061
1.232ValTrp: 1.232 ± 0.865
2.464ValTyr: 2.464 ± 0.962
0.0ValXaa: 0.0 ± 0.0
Trp
1.643TrpAla: 1.643 ± 1.409
0.411TrpCys: 0.411 ± 0.224
0.821TrpAsp: 0.821 ± 0.448
0.411TrpGlu: 0.411 ± 0.544
0.411TrpPhe: 0.411 ± 0.224
0.821TrpGly: 0.821 ± 0.321
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.411TrpLys: 0.411 ± 0.224
0.411TrpLeu: 0.411 ± 0.224
0.0TrpMet: 0.0 ± 0.0
0.821TrpAsn: 0.821 ± 0.448
1.232TrpPro: 1.232 ± 1.633
1.643TrpGln: 1.643 ± 0.127
0.0TrpArg: 0.0 ± 0.0
0.821TrpSer: 0.821 ± 0.321
0.0TrpThr: 0.0 ± 0.0
0.821TrpVal: 0.821 ± 0.321
0.411TrpTrp: 0.411 ± 0.224
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.285TyrAla: 3.285 ± 0.254
0.411TyrCys: 0.411 ± 0.224
3.285TyrAsp: 3.285 ± 0.514
2.053TyrGlu: 2.053 ± 0.417
2.053TyrPhe: 2.053 ± 0.417
1.643TyrGly: 1.643 ± 0.641
2.464TyrHis: 2.464 ± 1.73
1.232TyrIle: 1.232 ± 0.672
2.053TyrLys: 2.053 ± 1.119
5.339TyrLeu: 5.339 ± 0.605
0.411TyrMet: 0.411 ± 0.224
2.875TyrAsn: 2.875 ± 1.506
1.643TyrPro: 1.643 ± 0.895
0.821TyrGln: 0.821 ± 1.089
2.464TyrArg: 2.464 ± 0.575
4.517TyrSer: 4.517 ± 0.158
2.875TyrThr: 2.875 ± 0.03
1.643TyrVal: 1.643 ± 0.641
0.0TyrTrp: 0.0 ± 0.0
0.821TyrTyr: 0.821 ± 0.448
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski