Amino acid dipepetide frequency for Beihai picorna-like virus 74

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.367AlaAla: 4.367 ± 0.544
1.985AlaCys: 1.985 ± 1.039
2.779AlaAsp: 2.779 ± 0.868
4.367AlaGlu: 4.367 ± 0.037
4.764AlaPhe: 4.764 ± 0.171
2.779AlaGly: 2.779 ± 0.287
0.794AlaHis: 0.794 ± 0.416
5.161AlaIle: 5.161 ± 0.959
4.764AlaLys: 4.764 ± 1.913
5.558AlaLeu: 5.558 ± 2.328
1.191AlaMet: 1.191 ± 1.118
4.367AlaAsn: 4.367 ± 1.198
3.97AlaPro: 3.97 ± 1.986
2.779AlaGln: 2.779 ± 0.868
3.176AlaArg: 3.176 ± 1.082
5.558AlaSer: 5.558 ± 1.736
2.779AlaThr: 2.779 ± 0.868
2.779AlaVal: 2.779 ± 0.868
1.191AlaTrp: 1.191 ± 1.118
3.176AlaTyr: 3.176 ± 1.241
0.0AlaXaa: 0.0 ± 0.0
Cys
2.779CysAla: 2.779 ± 0.874
0.0CysCys: 0.0 ± 0.0
0.794CysAsp: 0.794 ± 0.165
0.794CysGlu: 0.794 ± 0.416
0.397CysPhe: 0.397 ± 0.208
1.985CysGly: 1.985 ± 1.039
0.0CysHis: 0.0 ± 0.0
0.794CysIle: 0.794 ± 0.165
0.0CysLys: 0.0 ± 0.0
1.588CysLeu: 1.588 ± 0.33
0.794CysMet: 0.794 ± 0.416
0.794CysAsn: 0.794 ± 0.165
0.397CysPro: 0.397 ± 0.208
0.397CysGln: 0.397 ± 0.208
1.191CysArg: 1.191 ± 0.043
1.985CysSer: 1.985 ± 0.458
0.0CysThr: 0.0 ± 0.0
1.191CysVal: 1.191 ± 0.043
0.397CysTrp: 0.397 ± 0.208
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.382AspAla: 2.382 ± 0.495
0.794AspCys: 0.794 ± 0.416
3.97AspAsp: 3.97 ± 0.245
5.558AspGlu: 5.558 ± 1.167
4.764AspPhe: 4.764 ± 0.752
2.779AspGly: 2.779 ± 0.874
0.397AspHis: 0.397 ± 0.208
3.573AspIle: 3.573 ± 0.128
1.985AspLys: 1.985 ± 0.122
4.367AspLeu: 4.367 ± 0.037
0.0AspMet: 0.0 ± 0.18
2.779AspAsn: 2.779 ± 1.449
4.367AspPro: 4.367 ± 1.124
3.176AspGln: 3.176 ± 0.501
0.0AspArg: 0.0 ± 0.0
4.367AspSer: 4.367 ± 1.779
1.985AspThr: 1.985 ± 0.703
4.764AspVal: 4.764 ± 0.41
1.191AspTrp: 1.191 ± 0.623
1.985AspTyr: 1.985 ± 0.458
0.0AspXaa: 0.0 ± 0.0
Glu
3.97GluAla: 3.97 ± 0.336
0.794GluCys: 0.794 ± 0.165
3.176GluAsp: 3.176 ± 0.08
3.176GluGlu: 3.176 ± 1.082
3.176GluPhe: 3.176 ± 1.082
2.382GluGly: 2.382 ± 0.666
0.397GluHis: 0.397 ± 0.208
6.749GluIle: 6.749 ± 0.049
2.779GluLys: 2.779 ± 0.287
4.764GluLeu: 4.764 ± 0.171
4.764GluMet: 4.764 ± 1.913
3.573GluAsn: 3.573 ± 0.128
1.985GluPro: 1.985 ± 0.122
2.779GluGln: 2.779 ± 0.293
2.382GluArg: 2.382 ± 0.086
3.176GluSer: 3.176 ± 0.66
2.779GluThr: 2.779 ± 0.874
5.558GluVal: 5.558 ± 0.006
1.588GluTrp: 1.588 ± 0.33
3.97GluTyr: 3.97 ± 0.917
0.0GluXaa: 0.0 ± 0.0
Phe
3.573PheAla: 3.573 ± 1.614
0.397PheCys: 0.397 ± 0.208
1.985PheAsp: 1.985 ± 0.458
3.573PheGlu: 3.573 ± 0.452
1.985PhePhe: 1.985 ± 1.039
2.779PheGly: 2.779 ± 0.287
1.191PheHis: 1.191 ± 0.043
1.191PheIle: 1.191 ± 0.043
2.382PheLys: 2.382 ± 1.247
2.779PheLeu: 2.779 ± 1.455
1.985PheMet: 1.985 ± 0.161
1.191PheAsn: 1.191 ± 0.043
1.985PhePro: 1.985 ± 0.122
1.985PheGln: 1.985 ± 0.703
3.573PheArg: 3.573 ± 0.709
7.146PheSer: 7.146 ± 0.324
2.779PheThr: 2.779 ± 0.287
4.367PheVal: 4.367 ± 0.617
0.0PheTrp: 0.0 ± 0.0
1.588PheTyr: 1.588 ± 0.251
0.0PheXaa: 0.0 ± 0.0
Gly
3.97GlyAla: 3.97 ± 1.406
0.794GlyCys: 0.794 ± 0.165
5.161GlyAsp: 5.161 ± 2.121
5.558GlyGlu: 5.558 ± 0.575
1.985GlyPhe: 1.985 ± 0.703
2.382GlyGly: 2.382 ± 1.656
0.794GlyHis: 0.794 ± 0.416
1.588GlyIle: 1.588 ± 0.33
3.573GlyLys: 3.573 ± 0.709
2.382GlyLeu: 2.382 ± 0.086
1.985GlyMet: 1.985 ± 0.458
3.176GlyAsn: 3.176 ± 0.501
3.573GlyPro: 3.573 ± 0.452
1.191GlyGln: 1.191 ± 0.623
3.176GlyArg: 3.176 ± 1.821
3.176GlySer: 3.176 ± 0.08
3.176GlyThr: 3.176 ± 0.501
3.176GlyVal: 3.176 ± 0.08
0.794GlyTrp: 0.794 ± 0.746
4.367GlyTyr: 4.367 ± 1.198
0.0GlyXaa: 0.0 ± 0.0
His
0.397HisAla: 0.397 ± 0.208
0.794HisCys: 0.794 ± 0.416
0.397HisAsp: 0.397 ± 0.208
0.794HisGlu: 0.794 ± 0.165
1.191HisPhe: 1.191 ± 0.043
1.588HisGly: 1.588 ± 0.251
0.0HisHis: 0.0 ± 0.0
1.191HisIle: 1.191 ± 0.043
0.794HisLys: 0.794 ± 0.416
1.191HisLeu: 1.191 ± 0.043
0.397HisMet: 0.397 ± 0.208
1.588HisAsn: 1.588 ± 0.33
0.794HisPro: 0.794 ± 0.416
0.794HisGln: 0.794 ± 0.165
0.794HisArg: 0.794 ± 0.416
1.588HisSer: 1.588 ± 0.251
0.397HisThr: 0.397 ± 0.373
0.794HisVal: 0.794 ± 0.416
0.397HisTrp: 0.397 ± 0.208
1.588HisTyr: 1.588 ± 0.911
0.0HisXaa: 0.0 ± 0.0
Ile
3.176IleAla: 3.176 ± 1.662
0.397IleCys: 0.397 ± 0.208
5.161IleAsp: 5.161 ± 0.379
3.97IleGlu: 3.97 ± 1.497
1.191IlePhe: 1.191 ± 0.538
4.367IleGly: 4.367 ± 1.779
1.588IleHis: 1.588 ± 0.33
2.779IleIle: 2.779 ± 0.293
1.985IleLys: 1.985 ± 0.458
6.749IleLeu: 6.749 ± 1.21
0.397IleMet: 0.397 ± 0.208
1.985IleAsn: 1.985 ± 0.703
5.161IlePro: 5.161 ± 1.944
3.573IleGln: 3.573 ± 1.033
3.176IleArg: 3.176 ± 1.082
4.367IleSer: 4.367 ± 1.198
4.764IleThr: 4.764 ± 0.41
4.367IleVal: 4.367 ± 0.037
0.794IleTrp: 0.794 ± 0.165
0.794IleTyr: 0.794 ± 0.416
0.0IleXaa: 0.0 ± 0.0
Lys
3.97LysAla: 3.97 ± 0.917
1.588LysCys: 1.588 ± 0.831
3.573LysAsp: 3.573 ± 1.29
3.97LysGlu: 3.97 ± 1.497
2.382LysPhe: 2.382 ± 0.666
2.779LysGly: 2.779 ± 0.874
0.794LysHis: 0.794 ± 0.416
5.161LysIle: 5.161 ± 0.959
3.97LysLys: 3.97 ± 0.336
4.764LysLeu: 4.764 ± 0.752
1.588LysMet: 1.588 ± 0.251
1.191LysAsn: 1.191 ± 0.043
2.382LysPro: 2.382 ± 0.495
1.191LysGln: 1.191 ± 0.538
2.382LysArg: 2.382 ± 0.666
3.176LysSer: 3.176 ± 0.66
5.558LysThr: 5.558 ± 1.167
5.955LysVal: 5.955 ± 2.536
1.191LysTrp: 1.191 ± 0.623
2.779LysTyr: 2.779 ± 0.874
0.0LysXaa: 0.0 ± 0.0
Leu
4.764LeuAla: 4.764 ± 1.571
0.794LeuCys: 0.794 ± 0.416
3.573LeuAsp: 3.573 ± 0.128
3.176LeuGlu: 3.176 ± 0.501
2.779LeuPhe: 2.779 ± 1.455
3.176LeuGly: 3.176 ± 0.501
2.382LeuHis: 2.382 ± 0.086
3.97LeuIle: 3.97 ± 1.497
6.352LeuLys: 6.352 ± 2.163
3.573LeuLeu: 3.573 ± 1.87
1.588LeuMet: 1.588 ± 0.251
5.161LeuAsn: 5.161 ± 0.959
3.97LeuPro: 3.97 ± 1.406
2.382LeuGln: 2.382 ± 0.666
3.573LeuArg: 3.573 ± 0.452
7.94LeuSer: 7.94 ± 0.672
6.352LeuThr: 6.352 ± 1.583
6.749LeuVal: 6.749 ± 1.112
0.794LeuTrp: 0.794 ± 0.416
6.352LeuTyr: 6.352 ± 1.002
0.0LeuXaa: 0.0 ± 0.0
Met
3.176MetAla: 3.176 ± 0.501
0.397MetCys: 0.397 ± 0.208
2.382MetAsp: 2.382 ± 0.086
0.794MetGlu: 0.794 ± 0.416
1.985MetPhe: 1.985 ± 0.703
1.191MetGly: 1.191 ± 0.538
0.0MetHis: 0.0 ± 0.0
0.397MetIle: 0.397 ± 0.373
2.382MetLys: 2.382 ± 0.666
2.779MetLeu: 2.779 ± 0.874
0.794MetMet: 0.794 ± 0.416
1.588MetAsn: 1.588 ± 0.251
1.191MetPro: 1.191 ± 1.118
0.397MetGln: 0.397 ± 0.208
0.397MetArg: 0.397 ± 0.208
1.191MetSer: 1.191 ± 0.043
3.176MetThr: 3.176 ± 0.08
2.382MetVal: 2.382 ± 1.247
0.0MetTrp: 0.0 ± 0.0
1.191MetTyr: 1.191 ± 0.043
0.0MetXaa: 0.0 ± 0.0
Asn
3.573AsnAla: 3.573 ± 0.452
0.397AsnCys: 0.397 ± 0.373
2.779AsnAsp: 2.779 ± 0.287
1.985AsnGlu: 1.985 ± 0.122
2.779AsnPhe: 2.779 ± 0.287
2.779AsnGly: 2.779 ± 0.868
0.397AsnHis: 0.397 ± 0.373
3.97AsnIle: 3.97 ± 0.336
2.382AsnLys: 2.382 ± 0.666
4.367AsnLeu: 4.367 ± 0.544
1.191AsnMet: 1.191 ± 0.538
0.794AsnAsn: 0.794 ± 0.416
1.985AsnPro: 1.985 ± 0.703
1.191AsnGln: 1.191 ± 0.043
3.176AsnArg: 3.176 ± 0.501
4.367AsnSer: 4.367 ± 1.198
0.794AsnThr: 0.794 ± 0.416
4.764AsnVal: 4.764 ± 1.332
1.191AsnTrp: 1.191 ± 0.538
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.97ProAla: 3.97 ± 0.825
0.397ProCys: 0.397 ± 0.208
3.573ProAsp: 3.573 ± 2.194
3.176ProGlu: 3.176 ± 0.501
1.588ProPhe: 1.588 ± 0.33
3.176ProGly: 3.176 ± 1.241
0.0ProHis: 0.0 ± 0.0
2.382ProIle: 2.382 ± 1.076
3.176ProLys: 3.176 ± 0.08
7.146ProLeu: 7.146 ± 0.905
1.985ProMet: 1.985 ± 0.122
1.985ProAsn: 1.985 ± 0.122
0.794ProPro: 0.794 ± 0.165
1.191ProGln: 1.191 ± 0.623
1.191ProArg: 1.191 ± 0.043
2.382ProSer: 2.382 ± 0.495
4.367ProThr: 4.367 ± 2.359
4.367ProVal: 4.367 ± 2.359
0.397ProTrp: 0.397 ± 0.208
4.367ProTyr: 4.367 ± 2.359
0.0ProXaa: 0.0 ± 0.0
Gln
5.161GlnAla: 5.161 ± 0.782
0.397GlnCys: 0.397 ± 0.208
1.985GlnAsp: 1.985 ± 0.122
2.779GlnGlu: 2.779 ± 0.293
0.0GlnPhe: 0.0 ± 0.0
2.382GlnGly: 2.382 ± 0.495
0.397GlnHis: 0.397 ± 0.208
1.588GlnIle: 1.588 ± 0.251
1.588GlnLys: 1.588 ± 0.251
2.382GlnLeu: 2.382 ± 0.086
0.794GlnMet: 0.794 ± 0.416
1.191GlnAsn: 1.191 ± 0.043
2.779GlnPro: 2.779 ± 0.868
1.588GlnGln: 1.588 ± 0.251
2.779GlnArg: 2.779 ± 1.455
2.382GlnSer: 2.382 ± 0.495
0.794GlnThr: 0.794 ± 0.416
3.573GlnVal: 3.573 ± 1.614
0.397GlnTrp: 0.397 ± 0.208
1.191GlnTyr: 1.191 ± 0.623
0.0GlnXaa: 0.0 ± 0.0
Arg
1.985ArgAla: 1.985 ± 0.122
0.397ArgCys: 0.397 ± 0.208
1.588ArgAsp: 1.588 ± 0.911
2.779ArgGlu: 2.779 ± 0.874
3.97ArgPhe: 3.97 ± 1.406
2.382ArgGly: 2.382 ± 1.076
1.191ArgHis: 1.191 ± 0.623
3.573ArgIle: 3.573 ± 0.709
5.955ArgLys: 5.955 ± 2.536
1.985ArgLeu: 1.985 ± 0.458
0.794ArgMet: 0.794 ± 0.165
2.779ArgAsn: 2.779 ± 1.455
2.382ArgPro: 2.382 ± 0.666
1.588ArgGln: 1.588 ± 0.251
3.176ArgArg: 3.176 ± 1.082
3.573ArgSer: 3.573 ± 0.128
1.985ArgThr: 1.985 ± 0.122
2.779ArgVal: 2.779 ± 0.293
0.397ArgTrp: 0.397 ± 0.208
2.382ArgTyr: 2.382 ± 1.076
0.0ArgXaa: 0.0 ± 0.0
Ser
3.176SerAla: 3.176 ± 0.66
0.794SerCys: 0.794 ± 0.746
2.779SerAsp: 2.779 ± 0.293
3.97SerGlu: 3.97 ± 0.825
2.779SerPhe: 2.779 ± 0.868
6.352SerGly: 6.352 ± 0.74
1.985SerHis: 1.985 ± 0.122
6.352SerIle: 6.352 ± 1.901
5.558SerLys: 5.558 ± 1.167
7.543SerLeu: 7.543 ± 0.116
1.588SerMet: 1.588 ± 0.251
3.573SerAsn: 3.573 ± 1.033
3.97SerPro: 3.97 ± 1.406
3.176SerGln: 3.176 ± 0.08
2.382SerArg: 2.382 ± 0.495
6.749SerSer: 6.749 ± 3.435
7.543SerThr: 7.543 ± 2.439
3.97SerVal: 3.97 ± 0.917
0.794SerTrp: 0.794 ± 0.746
1.191SerTyr: 1.191 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.367ThrAla: 4.367 ± 0.037
1.588ThrCys: 1.588 ± 0.831
4.367ThrAsp: 4.367 ± 0.544
2.382ThrGlu: 2.382 ± 0.086
3.97ThrPhe: 3.97 ± 0.917
5.161ThrGly: 5.161 ± 0.782
2.382ThrHis: 2.382 ± 0.086
3.573ThrIle: 3.573 ± 1.033
1.588ThrLys: 1.588 ± 0.251
3.97ThrLeu: 3.97 ± 0.917
1.588ThrMet: 1.588 ± 0.911
2.779ThrAsn: 2.779 ± 0.868
1.985ThrPro: 1.985 ± 1.284
2.779ThrGln: 2.779 ± 0.293
1.985ThrArg: 1.985 ± 0.122
4.764ThrSer: 4.764 ± 1.571
2.779ThrThr: 2.779 ± 0.287
4.764ThrVal: 4.764 ± 0.41
0.0ThrTrp: 0.0 ± 0.0
3.176ThrTyr: 3.176 ± 0.08
0.0ThrXaa: 0.0 ± 0.0
Val
6.352ValAla: 6.352 ± 1.901
1.191ValCys: 1.191 ± 0.623
2.779ValAsp: 2.779 ± 0.287
8.337ValGlu: 8.337 ± 0.281
2.779ValPhe: 2.779 ± 0.293
3.573ValGly: 3.573 ± 0.709
1.985ValHis: 1.985 ± 0.703
4.367ValIle: 4.367 ± 0.617
4.764ValLys: 4.764 ± 0.171
5.955ValLeu: 5.955 ± 1.375
1.985ValMet: 1.985 ± 1.039
1.588ValAsn: 1.588 ± 0.33
3.573ValPro: 3.573 ± 2.194
2.382ValGln: 2.382 ± 1.076
5.955ValArg: 5.955 ± 1.375
5.161ValSer: 5.161 ± 0.202
3.573ValThr: 3.573 ± 1.033
1.985ValVal: 1.985 ± 0.458
0.397ValTrp: 0.397 ± 0.208
2.779ValTyr: 2.779 ± 0.874
0.0ValXaa: 0.0 ± 0.0
Trp
0.794TrpAla: 0.794 ± 0.416
0.397TrpCys: 0.397 ± 0.373
0.794TrpAsp: 0.794 ± 0.416
0.397TrpGlu: 0.397 ± 0.208
1.191TrpPhe: 1.191 ± 0.043
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.794TrpIle: 0.794 ± 0.746
0.0TrpLys: 0.0 ± 0.0
1.588TrpLeu: 1.588 ± 0.911
1.191TrpMet: 1.191 ± 0.043
0.397TrpAsn: 0.397 ± 0.208
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.397TrpArg: 0.397 ± 0.373
0.794TrpSer: 0.794 ± 0.165
2.382TrpThr: 2.382 ± 0.666
1.191TrpVal: 1.191 ± 1.118
0.397TrpTrp: 0.397 ± 0.373
1.191TrpTyr: 1.191 ± 0.623
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.382TyrAla: 2.382 ± 0.666
1.985TyrCys: 1.985 ± 0.703
1.985TyrAsp: 1.985 ± 0.458
2.382TyrGlu: 2.382 ± 0.086
2.779TyrPhe: 2.779 ± 0.868
1.985TyrGly: 1.985 ± 1.039
0.794TyrHis: 0.794 ± 0.416
1.588TyrIle: 1.588 ± 0.251
4.367TyrLys: 4.367 ± 0.544
3.97TyrLeu: 3.97 ± 0.245
0.794TyrMet: 0.794 ± 0.746
2.382TyrAsn: 2.382 ± 0.666
4.367TyrPro: 4.367 ± 1.198
1.588TyrGln: 1.588 ± 0.831
2.779TyrArg: 2.779 ± 0.868
2.382TyrSer: 2.382 ± 0.495
1.985TyrThr: 1.985 ± 0.122
1.985TyrVal: 1.985 ± 0.122
1.588TyrTrp: 1.588 ± 0.33
0.397TyrTyr: 0.397 ± 0.373
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2520 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski