Amino acid dipepetide frequency for Beihai picorna-like virus 71

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.433AlaAla: 6.433 ± 1.268
0.715AlaCys: 0.715 ± 0.348
2.859AlaAsp: 2.859 ± 0.293
3.931AlaGlu: 3.931 ± 0.265
2.859AlaPhe: 2.859 ± 0.808
5.004AlaGly: 5.004 ± 1.338
1.072AlaHis: 1.072 ± 0.523
5.718AlaIle: 5.718 ± 1.066
3.931AlaLys: 3.931 ± 0.815
4.646AlaLeu: 4.646 ± 1.714
2.144AlaMet: 2.144 ± 0.056
2.144AlaAsn: 2.144 ± 0.606
4.289AlaPro: 4.289 ± 0.111
6.076AlaGln: 6.076 ± 0.341
4.646AlaArg: 4.646 ± 0.063
3.574AlaSer: 3.574 ± 1.01
6.433AlaThr: 6.433 ± 0.167
6.076AlaVal: 6.076 ± 0.341
1.43AlaTrp: 1.43 ± 0.146
3.217AlaTyr: 3.217 ± 3.386
0.0AlaXaa: 0.0 ± 0.0
Cys
1.072CysAla: 1.072 ± 0.523
0.357CysCys: 0.357 ± 0.174
1.072CysAsp: 1.072 ± 0.523
0.715CysGlu: 0.715 ± 0.348
0.715CysPhe: 0.715 ± 0.202
1.072CysGly: 1.072 ± 0.523
0.0CysHis: 0.0 ± 0.0
0.357CysIle: 0.357 ± 0.376
0.357CysLys: 0.357 ± 0.174
1.072CysLeu: 1.072 ± 0.028
0.0CysMet: 0.0 ± 0.0
0.357CysAsn: 0.357 ± 0.174
1.43CysPro: 1.43 ± 0.404
0.0CysGln: 0.0 ± 0.0
1.072CysArg: 1.072 ± 0.523
1.43CysSer: 1.43 ± 0.146
2.144CysThr: 2.144 ± 0.495
0.715CysVal: 0.715 ± 0.348
0.0CysTrp: 0.0 ± 0.0
0.357CysTyr: 0.357 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
2.859AspAla: 2.859 ± 0.258
1.43AspCys: 1.43 ± 0.697
5.004AspAsp: 5.004 ± 0.237
5.004AspGlu: 5.004 ± 0.787
3.574AspPhe: 3.574 ± 0.641
5.004AspGly: 5.004 ± 0.237
3.574AspHis: 3.574 ± 1.191
3.574AspIle: 3.574 ± 1.56
1.787AspLys: 1.787 ± 1.331
3.574AspLeu: 3.574 ± 0.46
0.357AspMet: 0.357 ± 0.174
1.43AspAsn: 1.43 ± 0.404
4.646AspPro: 4.646 ± 0.487
0.715AspGln: 0.715 ± 0.348
1.787AspArg: 1.787 ± 0.321
2.144AspSer: 2.144 ± 0.606
3.574AspThr: 3.574 ± 0.641
5.004AspVal: 5.004 ± 0.237
0.357AspTrp: 0.357 ± 0.174
1.787AspTyr: 1.787 ± 1.331
0.0AspXaa: 0.0 ± 0.0
Glu
3.931GluAla: 3.931 ± 0.265
0.0GluCys: 0.0 ± 0.0
2.502GluAsp: 2.502 ± 1.533
5.718GluGlu: 5.718 ± 0.585
2.502GluPhe: 2.502 ± 0.432
3.931GluGly: 3.931 ± 0.815
2.502GluHis: 2.502 ± 0.432
7.148GluIle: 7.148 ± 2.383
3.931GluLys: 3.931 ± 0.815
7.148GluLeu: 7.148 ± 0.181
2.144GluMet: 2.144 ± 0.056
2.502GluAsn: 2.502 ± 0.119
2.144GluPro: 2.144 ± 0.495
4.289GluGln: 4.289 ± 0.439
4.289GluArg: 4.289 ± 2.09
3.217GluSer: 3.217 ± 1.568
2.859GluThr: 2.859 ± 0.293
3.931GluVal: 3.931 ± 1.386
1.43GluTrp: 1.43 ± 0.404
2.502GluTyr: 2.502 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
5.004PheAla: 5.004 ± 0.864
0.357PheCys: 0.357 ± 0.174
1.43PheAsp: 1.43 ± 0.146
3.931PheGlu: 3.931 ± 1.386
1.072PhePhe: 1.072 ± 0.578
2.859PheGly: 2.859 ± 0.808
0.715PheHis: 0.715 ± 0.202
0.357PheIle: 0.357 ± 0.174
1.072PheLys: 1.072 ± 0.523
1.43PheLeu: 1.43 ± 0.697
1.072PheMet: 1.072 ± 0.028
3.574PheAsn: 3.574 ± 1.01
2.859PhePro: 2.859 ± 1.358
1.43PheGln: 1.43 ± 0.404
3.574PheArg: 3.574 ± 0.641
3.217PheSer: 3.217 ± 0.467
2.144PheThr: 2.144 ± 1.156
2.144PheVal: 2.144 ± 0.056
0.357PheTrp: 0.357 ± 0.376
0.357PheTyr: 0.357 ± 0.174
0.0PheXaa: 0.0 ± 0.0
Gly
3.574GlyAla: 3.574 ± 1.56
1.43GlyCys: 1.43 ± 0.697
6.791GlyAsp: 6.791 ± 2.759
5.718GlyGlu: 5.718 ± 1.616
2.859GlyPhe: 2.859 ± 0.258
3.217GlyGly: 3.217 ± 0.634
1.43GlyHis: 1.43 ± 0.697
4.646GlyIle: 4.646 ± 1.038
5.004GlyLys: 5.004 ± 1.338
5.718GlyLeu: 5.718 ± 0.585
3.217GlyMet: 3.217 ± 0.634
1.43GlyAsn: 1.43 ± 0.146
3.574GlyPro: 3.574 ± 0.46
1.43GlyGln: 1.43 ± 0.146
2.502GlyArg: 2.502 ± 0.669
5.718GlySer: 5.718 ± 1.066
5.361GlyThr: 5.361 ± 1.24
4.646GlyVal: 4.646 ± 1.164
1.43GlyTrp: 1.43 ± 0.146
2.502GlyTyr: 2.502 ± 0.432
0.0GlyXaa: 0.0 ± 0.0
His
2.144HisAla: 2.144 ± 0.056
0.715HisCys: 0.715 ± 0.348
1.072HisAsp: 1.072 ± 0.028
0.715HisGlu: 0.715 ± 0.348
1.072HisPhe: 1.072 ± 0.028
1.072HisGly: 1.072 ± 0.523
0.357HisHis: 0.357 ± 0.174
1.072HisIle: 1.072 ± 0.028
0.715HisLys: 0.715 ± 0.348
1.072HisLeu: 1.072 ± 0.523
1.072HisMet: 1.072 ± 0.523
0.715HisAsn: 0.715 ± 0.202
1.072HisPro: 1.072 ± 0.028
0.357HisGln: 0.357 ± 0.174
1.072HisArg: 1.072 ± 0.523
1.072HisSer: 1.072 ± 0.028
3.217HisThr: 3.217 ± 0.467
1.787HisVal: 1.787 ± 0.871
0.357HisTrp: 0.357 ± 0.174
1.072HisTyr: 1.072 ± 0.578
0.0HisXaa: 0.0 ± 0.0
Ile
4.646IleAla: 4.646 ± 1.164
1.072IleCys: 1.072 ± 0.523
3.574IleAsp: 3.574 ± 0.46
3.574IleGlu: 3.574 ± 0.46
2.859IlePhe: 2.859 ± 0.808
4.289IleGly: 4.289 ± 0.111
1.43IleHis: 1.43 ± 0.146
2.502IleIle: 2.502 ± 0.669
3.574IleLys: 3.574 ± 0.091
3.217IleLeu: 3.217 ± 0.083
1.072IleMet: 1.072 ± 0.253
2.144IleAsn: 2.144 ± 0.056
3.574IlePro: 3.574 ± 0.091
1.787IleGln: 1.787 ± 0.321
5.004IleArg: 5.004 ± 1.414
4.289IleSer: 4.289 ± 0.662
3.574IleThr: 3.574 ± 0.091
3.931IleVal: 3.931 ± 0.815
0.715IleTrp: 0.715 ± 0.348
1.43IleTyr: 1.43 ± 0.146
0.0IleXaa: 0.0 ± 0.0
Lys
4.289LysAla: 4.289 ± 1.54
0.357LysCys: 0.357 ± 0.174
2.502LysAsp: 2.502 ± 1.219
4.289LysGlu: 4.289 ± 0.989
2.144LysPhe: 2.144 ± 0.056
2.502LysGly: 2.502 ± 0.119
0.357LysHis: 0.357 ± 0.174
4.289LysIle: 4.289 ± 0.989
4.646LysLys: 4.646 ± 1.714
4.646LysLeu: 4.646 ± 0.613
1.43LysMet: 1.43 ± 0.146
1.787LysAsn: 1.787 ± 0.321
3.217LysPro: 3.217 ± 0.634
1.072LysGln: 1.072 ± 0.523
3.217LysArg: 3.217 ± 1.017
3.574LysSer: 3.574 ± 0.641
2.144LysThr: 2.144 ± 0.495
3.931LysVal: 3.931 ± 0.815
0.357LysTrp: 0.357 ± 0.174
2.144LysTyr: 2.144 ± 0.606
0.0LysXaa: 0.0 ± 0.0
Leu
7.505LeuAla: 7.505 ± 2.557
2.859LeuCys: 2.859 ± 0.843
7.148LeuAsp: 7.148 ± 0.369
4.289LeuGlu: 4.289 ± 0.989
2.502LeuPhe: 2.502 ± 0.432
5.361LeuGly: 5.361 ± 0.69
2.144LeuHis: 2.144 ± 0.495
3.931LeuIle: 3.931 ± 0.285
4.646LeuLys: 4.646 ± 1.714
3.931LeuLeu: 3.931 ± 1.366
1.43LeuMet: 1.43 ± 0.697
2.859LeuAsn: 2.859 ± 0.808
2.144LeuPro: 2.144 ± 0.495
1.43LeuGln: 1.43 ± 0.146
5.718LeuArg: 5.718 ± 0.035
3.574LeuSer: 3.574 ± 0.091
8.22LeuThr: 8.22 ± 0.154
6.433LeuVal: 6.433 ± 1.484
1.43LeuTrp: 1.43 ± 0.697
2.859LeuTyr: 2.859 ± 0.258
0.0LeuXaa: 0.0 ± 0.0
Met
1.787MetAla: 1.787 ± 0.23
0.0MetCys: 0.0 ± 0.0
1.43MetAsp: 1.43 ± 1.505
2.502MetGlu: 2.502 ± 1.219
0.357MetPhe: 0.357 ± 0.376
2.502MetGly: 2.502 ± 0.432
0.0MetHis: 0.0 ± 0.0
0.715MetIle: 0.715 ± 0.348
2.502MetLys: 2.502 ± 1.219
1.787MetLeu: 1.787 ± 0.871
1.072MetMet: 1.072 ± 0.028
0.0MetAsn: 0.0 ± 0.0
1.787MetPro: 1.787 ± 0.78
1.43MetGln: 1.43 ± 0.697
3.574MetArg: 3.574 ± 0.091
0.715MetSer: 0.715 ± 0.202
0.715MetThr: 0.715 ± 0.348
1.43MetVal: 1.43 ± 0.697
1.43MetTrp: 1.43 ± 0.146
1.072MetTyr: 1.072 ± 0.578
0.0MetXaa: 0.0 ± 0.0
Asn
5.718AsnAla: 5.718 ± 2.166
1.072AsnCys: 1.072 ± 0.578
1.43AsnAsp: 1.43 ± 0.404
1.787AsnGlu: 1.787 ± 0.78
1.072AsnPhe: 1.072 ± 0.523
2.859AsnGly: 2.859 ± 1.358
0.715AsnHis: 0.715 ± 0.348
1.072AsnIle: 1.072 ± 0.028
0.715AsnLys: 0.715 ± 0.202
4.646AsnLeu: 4.646 ± 1.588
2.859AsnMet: 2.859 ± 0.244
1.787AsnAsn: 1.787 ± 0.23
0.715AsnPro: 0.715 ± 0.202
1.787AsnGln: 1.787 ± 0.23
1.43AsnArg: 1.43 ± 0.146
2.859AsnSer: 2.859 ± 0.843
1.072AsnThr: 1.072 ± 0.028
3.574AsnVal: 3.574 ± 1.56
0.715AsnTrp: 0.715 ± 0.202
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.931ProAla: 3.931 ± 0.836
0.0ProCys: 0.0 ± 0.0
2.502ProAsp: 2.502 ± 0.982
3.217ProGlu: 3.217 ± 1.017
1.787ProPhe: 1.787 ± 0.78
2.502ProGly: 2.502 ± 0.982
0.715ProHis: 0.715 ± 0.348
3.931ProIle: 3.931 ± 1.937
0.715ProLys: 0.715 ± 0.202
8.22ProLeu: 8.22 ± 0.154
1.787ProMet: 1.787 ± 0.23
1.43ProAsn: 1.43 ± 0.404
2.502ProPro: 2.502 ± 0.982
0.357ProGln: 0.357 ± 0.174
1.787ProArg: 1.787 ± 0.321
3.217ProSer: 3.217 ± 1.017
3.217ProThr: 3.217 ± 1.735
5.361ProVal: 5.361 ± 1.79
0.715ProTrp: 0.715 ± 0.752
3.574ProTyr: 3.574 ± 1.56
0.0ProXaa: 0.0 ± 0.0
Gln
1.787GlnAla: 1.787 ± 0.871
0.0GlnCys: 0.0 ± 0.0
1.072GlnAsp: 1.072 ± 0.028
2.502GlnGlu: 2.502 ± 0.669
0.715GlnPhe: 0.715 ± 0.348
2.502GlnGly: 2.502 ± 0.119
0.357GlnHis: 0.357 ± 0.174
2.144GlnIle: 2.144 ± 1.045
1.43GlnLys: 1.43 ± 0.697
4.646GlnLeu: 4.646 ± 1.164
0.715GlnMet: 0.715 ± 0.348
0.715GlnAsn: 0.715 ± 0.752
1.43GlnPro: 1.43 ± 0.404
1.43GlnGln: 1.43 ± 0.146
0.715GlnArg: 0.715 ± 0.348
0.715GlnSer: 0.715 ± 0.752
1.43GlnThr: 1.43 ± 0.404
3.574GlnVal: 3.574 ± 1.01
1.43GlnTrp: 1.43 ± 0.404
1.787GlnTyr: 1.787 ± 0.321
0.0GlnXaa: 0.0 ± 0.0
Arg
3.574ArgAla: 3.574 ± 0.091
1.072ArgCys: 1.072 ± 0.578
2.502ArgAsp: 2.502 ± 1.219
5.004ArgGlu: 5.004 ± 1.338
3.574ArgPhe: 3.574 ± 0.091
4.289ArgGly: 4.289 ± 0.439
1.072ArgHis: 1.072 ± 0.028
3.931ArgIle: 3.931 ± 0.265
2.859ArgLys: 2.859 ± 1.393
5.004ArgLeu: 5.004 ± 0.787
0.0ArgMet: 0.0 ± 0.0
1.072ArgAsn: 1.072 ± 0.028
3.217ArgPro: 3.217 ± 1.184
0.715ArgGln: 0.715 ± 0.202
5.004ArgArg: 5.004 ± 0.864
3.931ArgSer: 3.931 ± 0.285
4.646ArgThr: 4.646 ± 2.264
4.289ArgVal: 4.289 ± 1.54
1.43ArgTrp: 1.43 ± 0.146
1.43ArgTyr: 1.43 ± 0.146
0.0ArgXaa: 0.0 ± 0.0
Ser
3.574SerAla: 3.574 ± 0.46
0.357SerCys: 0.357 ± 0.376
2.502SerAsp: 2.502 ± 0.432
4.289SerGlu: 4.289 ± 0.662
1.787SerPhe: 1.787 ± 0.321
9.65SerGly: 9.65 ± 0.85
1.072SerHis: 1.072 ± 0.578
2.502SerIle: 2.502 ± 1.219
3.931SerLys: 3.931 ± 0.285
4.646SerLeu: 4.646 ± 0.063
1.43SerMet: 1.43 ± 0.404
3.217SerAsn: 3.217 ± 1.184
2.502SerPro: 2.502 ± 0.982
1.43SerGln: 1.43 ± 0.697
1.43SerArg: 1.43 ± 0.404
3.217SerSer: 3.217 ± 0.634
6.076SerThr: 6.076 ± 1.442
4.289SerVal: 4.289 ± 0.111
0.357SerTrp: 0.357 ± 0.376
2.144SerTyr: 2.144 ± 0.495
0.0SerXaa: 0.0 ± 0.0
Thr
5.718ThrAla: 5.718 ± 1.066
0.0ThrCys: 0.0 ± 0.0
4.289ThrAsp: 4.289 ± 0.111
4.289ThrGlu: 4.289 ± 1.54
1.787ThrPhe: 1.787 ± 1.331
6.076ThrGly: 6.076 ± 0.209
1.072ThrHis: 1.072 ± 0.523
5.004ThrIle: 5.004 ± 0.313
3.217ThrLys: 3.217 ± 1.568
5.004ThrLeu: 5.004 ± 1.888
2.144ThrMet: 2.144 ± 0.495
3.574ThrAsn: 3.574 ± 2.111
2.859ThrPro: 2.859 ± 0.258
1.072ThrGln: 1.072 ± 0.523
4.289ThrArg: 4.289 ± 2.09
4.646ThrSer: 4.646 ± 2.139
3.574ThrThr: 3.574 ± 1.56
5.718ThrVal: 5.718 ± 1.066
0.0ThrTrp: 0.0 ± 0.0
2.502ThrTyr: 2.502 ± 0.982
0.0ThrXaa: 0.0 ± 0.0
Val
6.791ValAla: 6.791 ± 0.543
1.072ValCys: 1.072 ± 0.523
4.289ValAsp: 4.289 ± 0.662
6.076ValGlu: 6.076 ± 1.31
2.144ValPhe: 2.144 ± 0.056
6.076ValGly: 6.076 ± 0.892
2.859ValHis: 2.859 ± 0.843
2.502ValIle: 2.502 ± 0.669
3.574ValLys: 3.574 ± 0.091
4.646ValLeu: 4.646 ± 0.613
0.715ValMet: 0.715 ± 0.348
3.574ValAsn: 3.574 ± 0.641
5.361ValPro: 5.361 ± 2.891
2.859ValGln: 2.859 ± 0.808
4.289ValArg: 4.289 ± 0.989
6.433ValSer: 6.433 ± 1.268
1.787ValThr: 1.787 ± 0.321
5.004ValVal: 5.004 ± 1.338
1.787ValTrp: 1.787 ± 0.78
2.502ValTyr: 2.502 ± 0.669
0.0ValXaa: 0.0 ± 0.0
Trp
0.357TrpAla: 0.357 ± 0.174
0.357TrpCys: 0.357 ± 0.174
1.072TrpAsp: 1.072 ± 0.028
0.0TrpGlu: 0.0 ± 0.0
0.715TrpPhe: 0.715 ± 0.202
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.43TrpIle: 1.43 ± 0.404
1.43TrpLys: 1.43 ± 0.404
2.502TrpLeu: 2.502 ± 0.119
0.715TrpMet: 0.715 ± 0.202
1.072TrpAsn: 1.072 ± 0.028
0.357TrpPro: 0.357 ± 0.174
0.715TrpGln: 0.715 ± 0.202
0.715TrpArg: 0.715 ± 0.752
1.43TrpSer: 1.43 ± 0.146
2.859TrpThr: 2.859 ± 0.293
0.715TrpVal: 0.715 ± 0.202
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.502TyrAla: 2.502 ± 1.533
1.072TyrCys: 1.072 ± 0.028
2.144TyrAsp: 2.144 ± 0.606
1.072TyrGlu: 1.072 ± 0.028
2.859TyrPhe: 2.859 ± 0.808
1.43TyrGly: 1.43 ± 0.146
0.357TyrHis: 0.357 ± 0.376
1.787TyrIle: 1.787 ± 0.78
2.859TyrLys: 2.859 ± 0.293
2.859TyrLeu: 2.859 ± 0.258
1.072TyrMet: 1.072 ± 0.523
2.502TyrAsn: 2.502 ± 0.982
1.787TyrPro: 1.787 ± 0.23
0.715TyrGln: 0.715 ± 0.348
2.502TyrArg: 2.502 ± 0.119
1.43TyrSer: 1.43 ± 0.954
1.787TyrThr: 1.787 ± 0.23
1.787TyrVal: 1.787 ± 1.331
0.715TyrTrp: 0.715 ± 0.202
1.072TyrTyr: 1.072 ± 1.129
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski