Amino acid dipepetide frequency for Beihai picorna-like virus 81

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.401AlaAla: 7.401 ± 0.011
0.822AlaCys: 0.822 ± 0.507
2.878AlaAsp: 2.878 ± 1.493
3.701AlaGlu: 3.701 ± 0.975
1.645AlaPhe: 1.645 ± 0.293
5.757AlaGly: 5.757 ± 0.936
0.822AlaHis: 0.822 ± 0.146
4.112AlaIle: 4.112 ± 0.079
3.289AlaLys: 3.289 ± 0.586
4.934AlaLeu: 4.934 ± 1.736
0.822AlaMet: 0.822 ± 0.146
3.701AlaAsn: 3.701 ± 0.321
3.289AlaPro: 3.289 ± 0.586
2.467AlaGln: 2.467 ± 0.439
5.345AlaArg: 5.345 ± 0.028
3.289AlaSer: 3.289 ± 1.24
4.934AlaThr: 4.934 ± 0.428
1.645AlaVal: 1.645 ± 1.6
0.0AlaTrp: 0.0 ± 0.0
2.467AlaTyr: 2.467 ± 0.214
0.0AlaXaa: 0.0 ± 0.0
Cys
0.411CysAla: 0.411 ± 0.4
0.0CysCys: 0.0 ± 0.0
1.234CysAsp: 1.234 ± 0.761
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.234CysGly: 1.234 ± 0.761
0.0CysHis: 0.0 ± 0.0
0.822CysIle: 0.822 ± 0.507
1.645CysLys: 1.645 ± 1.014
1.234CysLeu: 1.234 ± 0.547
0.411CysMet: 0.411 ± 0.254
0.411CysAsn: 0.411 ± 0.254
1.234CysPro: 1.234 ± 0.107
0.0CysGln: 0.0 ± 0.0
1.234CysArg: 1.234 ± 0.761
0.822CysSer: 0.822 ± 0.146
0.411CysThr: 0.411 ± 0.254
1.234CysVal: 1.234 ± 0.547
0.0CysTrp: 0.0 ± 0.0
1.234CysTyr: 1.234 ± 0.761
0.0CysXaa: 0.0 ± 0.0
Asp
2.878AspAla: 2.878 ± 0.468
0.411AspCys: 0.411 ± 0.254
4.934AspAsp: 4.934 ± 2.39
3.701AspGlu: 3.701 ± 0.321
3.701AspPhe: 3.701 ± 0.332
4.934AspGly: 4.934 ± 0.225
1.234AspHis: 1.234 ± 0.107
6.168AspIle: 6.168 ± 0.118
2.878AspLys: 2.878 ± 1.122
6.168AspLeu: 6.168 ± 0.536
2.056AspMet: 2.056 ± 0.039
3.701AspAsn: 3.701 ± 0.986
2.056AspPro: 2.056 ± 0.614
1.645AspGln: 1.645 ± 0.293
0.822AspArg: 0.822 ± 0.507
3.701AspSer: 3.701 ± 0.986
3.289AspThr: 3.289 ± 0.068
5.345AspVal: 5.345 ± 1.933
2.056AspTrp: 2.056 ± 0.614
2.467AspTyr: 2.467 ± 0.214
0.0AspXaa: 0.0 ± 0.0
Glu
4.523GluAla: 4.523 ± 0.829
0.0GluCys: 0.0 ± 0.0
4.112GluAsp: 4.112 ± 0.575
2.878GluGlu: 2.878 ± 1.122
2.878GluPhe: 2.878 ± 0.186
2.878GluGly: 2.878 ± 0.468
0.411GluHis: 0.411 ± 0.254
4.934GluIle: 4.934 ± 1.736
2.878GluLys: 2.878 ± 0.468
7.401GluLeu: 7.401 ± 0.011
2.467GluMet: 2.467 ± 0.214
2.056GluAsn: 2.056 ± 0.039
0.822GluPro: 0.822 ± 0.8
3.289GluGln: 3.289 ± 0.068
2.056GluArg: 2.056 ± 0.693
4.523GluSer: 4.523 ± 0.829
3.701GluThr: 3.701 ± 0.321
4.112GluVal: 4.112 ± 1.882
1.645GluTrp: 1.645 ± 0.361
0.822GluTyr: 0.822 ± 0.146
0.0GluXaa: 0.0 ± 0.0
Phe
1.645PheAla: 1.645 ± 0.361
0.822PheCys: 0.822 ± 0.8
3.289PheAsp: 3.289 ± 0.721
3.701PheGlu: 3.701 ± 1.629
0.411PhePhe: 0.411 ± 0.254
2.056PheGly: 2.056 ± 0.039
0.822PheHis: 0.822 ± 0.146
2.056PheIle: 2.056 ± 0.693
1.645PheLys: 1.645 ± 0.947
2.878PheLeu: 2.878 ± 1.122
1.645PheMet: 1.645 ± 0.444
2.467PheAsn: 2.467 ± 0.868
1.234PhePro: 1.234 ± 0.547
2.056PheGln: 2.056 ± 0.693
1.645PheArg: 1.645 ± 0.947
5.757PheSer: 5.757 ± 1.679
3.701PheThr: 3.701 ± 0.332
4.934PheVal: 4.934 ± 1.082
1.234PheTrp: 1.234 ± 0.107
2.056PheTyr: 2.056 ± 0.614
0.0PheXaa: 0.0 ± 0.0
Gly
1.234GlyAla: 1.234 ± 0.547
0.822GlyCys: 0.822 ± 0.146
3.701GlyAsp: 3.701 ± 0.321
4.523GlyGlu: 4.523 ± 0.479
3.701GlyPhe: 3.701 ± 0.975
3.289GlyGly: 3.289 ± 0.586
0.0GlyHis: 0.0 ± 0.0
1.645GlyIle: 1.645 ± 0.361
3.701GlyLys: 3.701 ± 2.282
4.523GlyLeu: 4.523 ± 0.479
1.234GlyMet: 1.234 ± 0.761
3.289GlyAsn: 3.289 ± 0.068
2.467GlyPro: 2.467 ± 0.214
4.523GlyGln: 4.523 ± 0.479
2.878GlyArg: 2.878 ± 1.122
3.289GlySer: 3.289 ± 1.24
5.757GlyThr: 5.757 ± 0.282
3.701GlyVal: 3.701 ± 0.332
1.645GlyTrp: 1.645 ± 0.293
2.467GlyTyr: 2.467 ± 0.868
0.0GlyXaa: 0.0 ± 0.0
His
1.645HisAla: 1.645 ± 0.293
0.0HisCys: 0.0 ± 0.0
1.645HisAsp: 1.645 ± 0.947
0.411HisGlu: 0.411 ± 0.4
0.822HisPhe: 0.822 ± 0.507
1.234HisGly: 1.234 ± 0.107
0.0HisHis: 0.0 ± 0.0
0.822HisIle: 0.822 ± 0.8
1.234HisLys: 1.234 ± 0.107
2.056HisLeu: 2.056 ± 1.268
0.822HisMet: 0.822 ± 0.146
0.822HisAsn: 0.822 ± 0.507
0.822HisPro: 0.822 ± 0.146
0.411HisGln: 0.411 ± 0.254
1.234HisArg: 1.234 ± 0.547
1.645HisSer: 1.645 ± 0.293
0.411HisThr: 0.411 ± 0.254
0.822HisVal: 0.822 ± 0.507
0.0HisTrp: 0.0 ± 0.0
1.234HisTyr: 1.234 ± 0.107
0.0HisXaa: 0.0 ± 0.0
Ile
2.056IleAla: 2.056 ± 0.614
1.234IleCys: 1.234 ± 0.761
6.579IleAsp: 6.579 ± 2.479
4.523IleGlu: 4.523 ± 0.175
4.112IlePhe: 4.112 ± 1.229
4.934IleGly: 4.934 ± 0.225
0.822IleHis: 0.822 ± 0.146
0.822IleIle: 0.822 ± 0.146
3.701IleLys: 3.701 ± 0.332
4.523IleLeu: 4.523 ± 0.175
0.411IleMet: 0.411 ± 0.4
2.878IleAsn: 2.878 ± 0.186
2.467IlePro: 2.467 ± 2.401
1.645IleGln: 1.645 ± 0.293
3.701IleArg: 3.701 ± 0.321
4.523IleSer: 4.523 ± 1.786
2.056IleThr: 2.056 ± 0.614
3.289IleVal: 3.289 ± 0.068
0.0IleTrp: 0.0 ± 0.0
0.822IleTyr: 0.822 ± 0.507
0.0IleXaa: 0.0 ± 0.0
Lys
2.056LysAla: 2.056 ± 0.614
0.411LysCys: 0.411 ± 0.254
2.056LysAsp: 2.056 ± 1.268
2.056LysGlu: 2.056 ± 1.268
1.645LysPhe: 1.645 ± 1.014
2.467LysGly: 2.467 ± 0.214
0.0LysHis: 0.0 ± 0.0
2.467LysIle: 2.467 ± 0.214
1.645LysLys: 1.645 ± 1.014
4.112LysLeu: 4.112 ± 1.229
0.411LysMet: 0.411 ± 0.254
2.878LysAsn: 2.878 ± 1.122
2.878LysPro: 2.878 ± 0.84
1.645LysGln: 1.645 ± 0.293
3.289LysArg: 3.289 ± 0.068
3.289LysSer: 3.289 ± 0.068
4.934LysThr: 4.934 ± 0.879
4.523LysVal: 4.523 ± 0.479
0.411LysTrp: 0.411 ± 0.4
2.878LysTyr: 2.878 ± 0.186
0.0LysXaa: 0.0 ± 0.0
Leu
4.934LeuAla: 4.934 ± 0.879
1.645LeuCys: 1.645 ± 1.014
7.812LeuAsp: 7.812 ± 1.55
3.701LeuGlu: 3.701 ± 0.332
3.701LeuPhe: 3.701 ± 0.332
3.701LeuGly: 3.701 ± 0.975
2.467LeuHis: 2.467 ± 0.214
5.757LeuIle: 5.757 ± 1.679
4.112LeuLys: 4.112 ± 0.079
8.224LeuLeu: 8.224 ± 0.158
1.234LeuMet: 1.234 ± 0.547
5.345LeuAsn: 5.345 ± 1.933
4.934LeuPro: 4.934 ± 0.225
3.701LeuGln: 3.701 ± 0.975
6.168LeuArg: 6.168 ± 1.843
5.757LeuSer: 5.757 ± 0.282
7.812LeuThr: 7.812 ± 2.372
6.168LeuVal: 6.168 ± 0.536
0.822LeuTrp: 0.822 ± 0.507
4.112LeuTyr: 4.112 ± 0.575
0.0LeuXaa: 0.0 ± 0.0
Met
2.878MetAla: 2.878 ± 1.122
0.411MetCys: 0.411 ± 0.254
1.645MetAsp: 1.645 ± 0.361
1.645MetGlu: 1.645 ± 0.293
0.411MetPhe: 0.411 ± 0.254
0.411MetGly: 0.411 ± 0.4
0.411MetHis: 0.411 ± 0.4
2.056MetIle: 2.056 ± 0.039
1.234MetLys: 1.234 ± 0.107
2.878MetLeu: 2.878 ± 0.186
0.822MetMet: 0.822 ± 0.507
0.822MetAsn: 0.822 ± 0.146
2.056MetPro: 2.056 ± 0.693
0.822MetGln: 0.822 ± 0.146
2.467MetArg: 2.467 ± 0.214
0.822MetSer: 0.822 ± 0.146
3.701MetThr: 3.701 ± 0.321
0.411MetVal: 0.411 ± 0.254
0.0MetTrp: 0.0 ± 0.0
0.411MetTyr: 0.411 ± 0.254
0.0MetXaa: 0.0 ± 0.0
Asn
4.934AsnAla: 4.934 ± 0.879
1.645AsnCys: 1.645 ± 1.014
2.056AsnAsp: 2.056 ± 0.039
4.112AsnGlu: 4.112 ± 0.575
0.822AsnPhe: 0.822 ± 0.146
2.878AsnGly: 2.878 ± 0.468
0.411AsnHis: 0.411 ± 0.254
0.822AsnIle: 0.822 ± 0.8
0.822AsnLys: 0.822 ± 0.507
3.701AsnLeu: 3.701 ± 0.986
1.234AsnMet: 1.234 ± 0.107
2.056AsnAsn: 2.056 ± 1.347
4.112AsnPro: 4.112 ± 1.229
2.467AsnGln: 2.467 ± 0.868
2.467AsnArg: 2.467 ± 0.214
5.345AsnSer: 5.345 ± 1.279
6.168AsnThr: 6.168 ± 2.733
3.701AsnVal: 3.701 ± 0.321
0.822AsnTrp: 0.822 ± 0.507
1.645AsnTyr: 1.645 ± 0.361
0.0AsnXaa: 0.0 ± 0.0
Pro
3.701ProAla: 3.701 ± 1.64
0.822ProCys: 0.822 ± 0.507
3.289ProAsp: 3.289 ± 0.068
3.289ProGlu: 3.289 ± 1.375
2.056ProPhe: 2.056 ± 0.039
1.645ProGly: 1.645 ± 1.014
0.822ProHis: 0.822 ± 0.8
1.645ProIle: 1.645 ± 0.293
1.234ProLys: 1.234 ± 0.547
5.345ProLeu: 5.345 ± 0.028
1.645ProMet: 1.645 ± 0.361
2.878ProAsn: 2.878 ± 0.84
2.467ProPro: 2.467 ± 1.747
2.878ProGln: 2.878 ± 0.186
1.645ProArg: 1.645 ± 0.293
3.701ProSer: 3.701 ± 0.975
4.523ProThr: 4.523 ± 2.44
4.112ProVal: 4.112 ± 1.386
0.411ProTrp: 0.411 ± 0.4
3.701ProTyr: 3.701 ± 0.986
0.0ProXaa: 0.0 ± 0.0
Gln
4.112GlnAla: 4.112 ± 2.04
1.234GlnCys: 1.234 ± 0.761
2.467GlnAsp: 2.467 ± 0.439
2.878GlnGlu: 2.878 ± 1.775
0.822GlnPhe: 0.822 ± 0.146
2.467GlnGly: 2.467 ± 1.093
1.234GlnHis: 1.234 ± 0.107
2.878GlnIle: 2.878 ± 1.493
1.645GlnLys: 1.645 ± 0.361
2.467GlnLeu: 2.467 ± 0.439
1.645GlnMet: 1.645 ± 0.293
2.056GlnAsn: 2.056 ± 0.039
3.701GlnPro: 3.701 ± 0.975
2.878GlnGln: 2.878 ± 0.186
2.467GlnArg: 2.467 ± 0.214
2.056GlnSer: 2.056 ± 0.039
1.645GlnThr: 1.645 ± 0.947
1.234GlnVal: 1.234 ± 0.761
0.822GlnTrp: 0.822 ± 0.507
1.234GlnTyr: 1.234 ± 0.547
0.0GlnXaa: 0.0 ± 0.0
Arg
2.878ArgAla: 2.878 ± 0.468
0.822ArgCys: 0.822 ± 0.146
1.234ArgAsp: 1.234 ± 0.547
4.112ArgGlu: 4.112 ± 1.882
4.112ArgPhe: 4.112 ± 0.732
2.878ArgGly: 2.878 ± 0.468
1.234ArgHis: 1.234 ± 0.761
5.345ArgIle: 5.345 ± 0.028
2.467ArgLys: 2.467 ± 0.868
4.112ArgLeu: 4.112 ± 1.882
1.645ArgMet: 1.645 ± 0.213
2.878ArgAsn: 2.878 ± 1.122
3.701ArgPro: 3.701 ± 0.321
1.645ArgGln: 1.645 ± 0.947
3.701ArgArg: 3.701 ± 2.282
3.289ArgSer: 3.289 ± 1.24
4.112ArgThr: 4.112 ± 0.575
2.056ArgVal: 2.056 ± 0.614
0.0ArgTrp: 0.0 ± 0.0
2.467ArgTyr: 2.467 ± 0.868
0.0ArgXaa: 0.0 ± 0.0
Ser
3.701SerAla: 3.701 ± 0.332
0.822SerCys: 0.822 ± 0.146
3.701SerAsp: 3.701 ± 0.332
2.878SerGlu: 2.878 ± 0.186
4.112SerPhe: 4.112 ± 2.04
2.878SerGly: 2.878 ± 0.468
2.467SerHis: 2.467 ± 0.439
3.701SerIle: 3.701 ± 0.321
3.289SerLys: 3.289 ± 0.586
9.868SerLeu: 9.868 ± 5.026
2.467SerMet: 2.467 ± 0.214
5.345SerAsn: 5.345 ± 0.028
4.112SerPro: 4.112 ± 0.079
1.234SerGln: 1.234 ± 0.107
3.289SerArg: 3.289 ± 2.029
4.112SerSer: 4.112 ± 0.575
2.878SerThr: 2.878 ± 2.801
5.345SerVal: 5.345 ± 0.625
1.234SerTrp: 1.234 ± 0.547
1.645SerTyr: 1.645 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
4.934ThrAla: 4.934 ± 1.082
0.0ThrCys: 0.0 ± 0.0
2.056ThrAsp: 2.056 ± 0.039
3.289ThrGlu: 3.289 ± 2.547
3.701ThrPhe: 3.701 ± 0.332
4.112ThrGly: 4.112 ± 2.04
1.645ThrHis: 1.645 ± 0.293
2.056ThrIle: 2.056 ± 0.039
3.701ThrLys: 3.701 ± 0.332
8.224ThrLeu: 8.224 ± 0.158
2.056ThrMet: 2.056 ± 0.039
3.289ThrAsn: 3.289 ± 0.586
4.934ThrPro: 4.934 ± 2.186
4.934ThrGln: 4.934 ± 0.225
4.934ThrArg: 4.934 ± 1.736
6.168ThrSer: 6.168 ± 2.733
3.701ThrThr: 3.701 ± 1.64
5.345ThrVal: 5.345 ± 1.279
0.0ThrTrp: 0.0 ± 0.0
2.878ThrTyr: 2.878 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
3.289ValAla: 3.289 ± 0.721
0.822ValCys: 0.822 ± 0.146
4.934ValAsp: 4.934 ± 0.879
3.289ValGlu: 3.289 ± 0.721
2.056ValPhe: 2.056 ± 0.693
3.289ValGly: 3.289 ± 0.586
1.234ValHis: 1.234 ± 0.107
4.112ValIle: 4.112 ± 0.732
2.878ValLys: 2.878 ± 0.468
6.168ValLeu: 6.168 ± 0.118
1.645ValMet: 1.645 ± 0.361
2.467ValAsn: 2.467 ± 0.214
3.701ValPro: 3.701 ± 0.332
2.467ValGln: 2.467 ± 1.093
2.878ValArg: 2.878 ± 0.84
5.345ValSer: 5.345 ± 0.682
4.523ValThr: 4.523 ± 0.479
2.878ValVal: 2.878 ± 1.122
0.822ValTrp: 0.822 ± 0.507
4.112ValTyr: 4.112 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
0.411TrpAla: 0.411 ± 0.4
0.0TrpCys: 0.0 ± 0.0
0.822TrpAsp: 0.822 ± 0.507
0.411TrpGlu: 0.411 ± 0.254
2.056TrpPhe: 2.056 ± 0.039
0.411TrpGly: 0.411 ± 0.254
0.411TrpHis: 0.411 ± 0.254
1.234TrpIle: 1.234 ± 0.107
1.234TrpLys: 1.234 ± 0.107
0.822TrpLeu: 0.822 ± 0.8
0.0TrpMet: 0.0 ± 0.0
0.822TrpAsn: 0.822 ± 0.146
0.0TrpPro: 0.0 ± 0.0
0.411TrpGln: 0.411 ± 0.254
0.822TrpArg: 0.822 ± 0.146
0.411TrpSer: 0.411 ± 0.4
1.645TrpThr: 1.645 ± 1.014
0.0TrpVal: 0.0 ± 0.0
0.411TrpTrp: 0.411 ± 0.254
0.822TrpTyr: 0.822 ± 0.507
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.701TyrAla: 3.701 ± 0.332
0.822TyrCys: 0.822 ± 0.146
3.701TyrAsp: 3.701 ± 0.975
3.289TyrGlu: 3.289 ± 0.586
3.289TyrPhe: 3.289 ± 0.721
4.934TyrGly: 4.934 ± 1.736
1.645TyrHis: 1.645 ± 0.361
1.645TyrIle: 1.645 ± 0.293
0.822TyrLys: 0.822 ± 0.507
2.467TyrLeu: 2.467 ± 0.868
1.234TyrMet: 1.234 ± 0.107
2.056TyrAsn: 2.056 ± 0.039
0.822TyrPro: 0.822 ± 0.146
0.822TyrGln: 0.822 ± 0.507
2.056TyrArg: 2.056 ± 0.614
1.234TyrSer: 1.234 ± 0.761
2.056TyrThr: 2.056 ± 0.693
2.056TyrVal: 2.056 ± 0.614
0.822TyrTrp: 0.822 ± 0.146
1.234TyrTyr: 1.234 ± 0.547
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski