Amino acid dipepetide frequency for Beihai picorna-like virus 90

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.815AlaAla: 3.815 ± 0.336
1.387AlaCys: 1.387 ± 0.684
3.122AlaAsp: 3.122 ± 1.231
2.775AlaGlu: 2.775 ± 0.295
3.122AlaPhe: 3.122 ± 0.984
4.162AlaGly: 4.162 ± 1.273
1.734AlaHis: 1.734 ± 0.301
2.428AlaIle: 2.428 ± 0.088
4.509AlaLys: 4.509 ± 0.548
6.59AlaLeu: 6.59 ± 0.63
1.041AlaMet: 1.041 ± 0.513
2.081AlaAsn: 2.081 ± 0.082
4.162AlaPro: 4.162 ± 0.719
2.428AlaGln: 2.428 ± 0.088
3.469AlaArg: 3.469 ± 1.06
7.978AlaSer: 7.978 ± 0.501
5.203AlaThr: 5.203 ± 0.76
5.897AlaVal: 5.897 ± 0.972
0.347AlaTrp: 0.347 ± 0.383
1.387AlaTyr: 1.387 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
0.347CysAla: 0.347 ± 0.171
0.0CysCys: 0.0 ± 0.0
0.347CysAsp: 0.347 ± 0.171
1.387CysGlu: 1.387 ± 0.684
0.694CysPhe: 0.694 ± 0.342
0.347CysGly: 0.347 ± 0.171
0.0CysHis: 0.0 ± 0.0
1.041CysIle: 1.041 ± 0.041
1.387CysLys: 1.387 ± 0.684
0.347CysLeu: 0.347 ± 0.171
1.041CysMet: 1.041 ± 0.513
0.694CysAsn: 0.694 ± 0.342
0.694CysPro: 0.694 ± 0.212
0.694CysGln: 0.694 ± 0.342
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.041CysThr: 1.041 ± 0.513
1.041CysVal: 1.041 ± 0.041
0.0CysTrp: 0.0 ± 0.0
0.347CysTyr: 0.347 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
3.122AspAla: 3.122 ± 0.43
0.694AspCys: 0.694 ± 0.342
2.775AspAsp: 2.775 ± 0.848
2.428AspGlu: 2.428 ± 0.088
3.815AspPhe: 3.815 ± 0.218
3.815AspGly: 3.815 ± 0.336
1.734AspHis: 1.734 ± 0.253
4.509AspIle: 4.509 ± 0.56
2.428AspLys: 2.428 ± 0.642
3.815AspLeu: 3.815 ± 0.89
2.428AspMet: 2.428 ± 0.088
2.081AspAsn: 2.081 ± 1.025
1.734AspPro: 1.734 ± 0.253
1.387AspGln: 1.387 ± 0.978
2.081AspArg: 2.081 ± 0.471
2.775AspSer: 2.775 ± 0.848
5.55AspThr: 5.55 ± 0.035
4.162AspVal: 4.162 ± 0.389
0.347AspTrp: 0.347 ± 0.171
2.081AspTyr: 2.081 ± 0.471
0.0AspXaa: 0.0 ± 0.0
Glu
2.428GluAla: 2.428 ± 0.642
1.387GluCys: 1.387 ± 0.684
4.509GluAsp: 4.509 ± 0.548
4.856GluGlu: 4.856 ± 0.731
2.428GluPhe: 2.428 ± 0.642
6.243GluGly: 6.243 ± 0.307
1.041GluHis: 1.041 ± 0.595
4.509GluIle: 4.509 ± 0.56
3.122GluLys: 3.122 ± 0.124
4.856GluLeu: 4.856 ± 0.931
2.428GluMet: 2.428 ± 0.088
2.081GluAsn: 2.081 ± 0.471
1.387GluPro: 1.387 ± 0.13
0.694GluGln: 0.694 ± 0.342
2.081GluArg: 2.081 ± 0.471
2.081GluSer: 2.081 ± 1.025
3.122GluThr: 3.122 ± 0.677
4.509GluVal: 4.509 ± 0.006
0.694GluTrp: 0.694 ± 0.342
3.122GluTyr: 3.122 ± 1.538
0.0GluXaa: 0.0 ± 0.0
Phe
1.734PheAla: 1.734 ± 0.301
0.0PheCys: 0.0 ± 0.0
3.469PheAsp: 3.469 ± 1.155
1.734PheGlu: 1.734 ± 0.301
0.347PhePhe: 0.347 ± 0.171
2.428PheGly: 2.428 ± 0.088
1.387PheHis: 1.387 ± 0.424
3.815PheIle: 3.815 ± 1.326
2.081PheLys: 2.081 ± 0.636
3.815PheLeu: 3.815 ± 0.218
1.041PheMet: 1.041 ± 0.595
1.734PheAsn: 1.734 ± 0.301
0.694PhePro: 0.694 ± 0.342
2.081PheGln: 2.081 ± 0.082
3.469PheArg: 3.469 ± 0.507
2.428PheSer: 2.428 ± 0.465
4.162PheThr: 4.162 ± 0.165
3.122PheVal: 3.122 ± 1.231
0.347PheTrp: 0.347 ± 0.171
1.041PheTyr: 1.041 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
2.775GlyAla: 2.775 ± 1.956
0.694GlyCys: 0.694 ± 0.212
3.122GlyAsp: 3.122 ± 0.677
4.856GlyGlu: 4.856 ± 0.177
2.428GlyPhe: 2.428 ± 0.465
1.734GlyGly: 1.734 ± 0.301
1.041GlyHis: 1.041 ± 0.041
4.856GlyIle: 4.856 ± 1.485
4.509GlyLys: 4.509 ± 0.56
4.162GlyLeu: 4.162 ± 1.497
1.387GlyMet: 1.387 ± 0.14
2.428GlyAsn: 2.428 ± 0.088
0.694GlyPro: 0.694 ± 0.342
1.041GlyGln: 1.041 ± 0.041
2.428GlyArg: 2.428 ± 0.465
4.856GlySer: 4.856 ± 2.039
3.122GlyThr: 3.122 ± 0.984
5.897GlyVal: 5.897 ± 1.526
0.694GlyTrp: 0.694 ± 0.342
1.041GlyTyr: 1.041 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
1.041HisAla: 1.041 ± 0.595
0.347HisCys: 0.347 ± 0.171
0.0HisAsp: 0.0 ± 0.0
1.387HisGlu: 1.387 ± 0.424
1.734HisPhe: 1.734 ± 0.807
1.734HisGly: 1.734 ± 0.301
0.694HisHis: 0.694 ± 0.212
1.387HisIle: 1.387 ± 0.424
0.694HisLys: 0.694 ± 0.342
3.122HisLeu: 3.122 ± 0.124
0.0HisMet: 0.0 ± 0.0
1.734HisAsn: 1.734 ± 0.301
1.734HisPro: 1.734 ± 0.253
0.694HisGln: 0.694 ± 0.766
0.0HisArg: 0.0 ± 0.0
1.734HisSer: 1.734 ± 0.253
1.734HisThr: 1.734 ± 0.807
1.387HisVal: 1.387 ± 0.684
0.347HisTrp: 0.347 ± 0.383
0.694HisTyr: 0.694 ± 0.342
0.0HisXaa: 0.0 ± 0.0
Ile
5.897IleAla: 5.897 ± 0.972
0.347IleCys: 0.347 ± 0.171
5.203IleAsp: 5.203 ± 0.76
3.815IleGlu: 3.815 ± 0.772
2.081IlePhe: 2.081 ± 0.471
2.428IleGly: 2.428 ± 0.088
0.0IleHis: 0.0 ± 0.0
2.775IleIle: 2.775 ± 0.259
5.897IleLys: 5.897 ± 1.797
4.856IleLeu: 4.856 ± 2.392
0.347IleMet: 0.347 ± 0.171
3.815IleAsn: 3.815 ± 0.772
3.122IlePro: 3.122 ± 0.124
0.694IleGln: 0.694 ± 0.212
2.081IleArg: 2.081 ± 0.082
5.55IleSer: 5.55 ± 0.589
5.55IleThr: 5.55 ± 1.626
5.897IleVal: 5.897 ± 0.136
0.0IleTrp: 0.0 ± 0.0
3.122IleTyr: 3.122 ± 0.43
0.0IleXaa: 0.0 ± 0.0
Lys
3.815LysAla: 3.815 ± 0.218
0.694LysCys: 0.694 ± 0.342
3.122LysAsp: 3.122 ± 0.984
3.122LysGlu: 3.122 ± 1.538
2.428LysPhe: 2.428 ± 0.088
3.122LysGly: 3.122 ± 0.124
2.775LysHis: 2.775 ± 0.295
4.509LysIle: 4.509 ± 1.668
4.509LysLys: 4.509 ± 2.222
4.509LysLeu: 4.509 ± 1.114
1.387LysMet: 1.387 ± 0.684
5.203LysAsn: 5.203 ± 1.456
1.734LysPro: 1.734 ± 1.361
1.041LysGln: 1.041 ± 0.513
2.428LysArg: 2.428 ± 0.088
2.775LysSer: 2.775 ± 1.367
4.856LysThr: 4.856 ± 0.731
4.856LysVal: 4.856 ± 0.377
1.387LysTrp: 1.387 ± 0.424
1.734LysTyr: 1.734 ± 0.253
0.0LysXaa: 0.0 ± 0.0
Leu
6.937LeuAla: 6.937 ± 0.095
1.387LeuCys: 1.387 ± 0.684
3.815LeuAsp: 3.815 ± 1.88
6.243LeuGlu: 6.243 ± 0.801
3.815LeuPhe: 3.815 ± 1.326
2.775LeuGly: 2.775 ± 0.259
2.428LeuHis: 2.428 ± 0.088
4.856LeuIle: 4.856 ± 1.839
4.509LeuLys: 4.509 ± 1.114
6.59LeuLeu: 6.59 ± 1.585
2.081LeuMet: 2.081 ± 0.471
5.203LeuAsn: 5.203 ± 0.206
4.509LeuPro: 4.509 ± 1.102
3.469LeuGln: 3.469 ± 0.601
5.897LeuArg: 5.897 ± 0.418
10.059LeuSer: 10.059 ± 2.798
3.122LeuThr: 3.122 ± 0.984
4.509LeuVal: 4.509 ± 0.56
0.0LeuTrp: 0.0 ± 0.0
2.775LeuTyr: 2.775 ± 0.848
0.0LeuXaa: 0.0 ± 0.0
Met
2.428MetAla: 2.428 ± 0.642
0.347MetCys: 0.347 ± 0.171
1.387MetAsp: 1.387 ± 0.13
1.734MetGlu: 1.734 ± 0.807
1.041MetPhe: 1.041 ± 0.041
0.694MetGly: 0.694 ± 0.212
0.347MetHis: 0.347 ± 0.383
0.694MetIle: 0.694 ± 0.212
1.041MetLys: 1.041 ± 0.041
2.775MetLeu: 2.775 ± 1.367
0.0MetMet: 0.0 ± 0.0
0.694MetAsn: 0.694 ± 0.342
1.734MetPro: 1.734 ± 0.253
0.347MetGln: 0.347 ± 0.171
1.387MetArg: 1.387 ± 0.13
2.428MetSer: 2.428 ± 0.642
2.428MetThr: 2.428 ± 1.019
3.469MetVal: 3.469 ± 0.507
1.041MetTrp: 1.041 ± 0.041
1.734MetTyr: 1.734 ± 0.854
0.0MetXaa: 0.0 ± 0.0
Asn
4.509AsnAla: 4.509 ± 0.006
0.694AsnCys: 0.694 ± 0.342
2.775AsnAsp: 2.775 ± 0.813
1.734AsnGlu: 1.734 ± 0.301
2.428AsnPhe: 2.428 ± 1.196
2.081AsnGly: 2.081 ± 0.471
2.081AsnHis: 2.081 ± 0.082
5.897AsnIle: 5.897 ± 2.351
4.162AsnLys: 4.162 ± 0.943
4.162AsnLeu: 4.162 ± 1.273
3.469AsnMet: 3.469 ± 0.047
2.428AsnAsn: 2.428 ± 1.019
1.734AsnPro: 1.734 ± 0.807
1.387AsnGln: 1.387 ± 0.684
1.734AsnArg: 1.734 ± 0.854
1.387AsnSer: 1.387 ± 0.13
3.122AsnThr: 3.122 ± 0.124
4.856AsnVal: 4.856 ± 0.177
0.0AsnTrp: 0.0 ± 0.0
1.041AsnTyr: 1.041 ± 0.513
0.0AsnXaa: 0.0 ± 0.0
Pro
4.509ProAla: 4.509 ± 1.102
0.694ProCys: 0.694 ± 0.212
2.081ProAsp: 2.081 ± 0.082
1.734ProGlu: 1.734 ± 0.854
1.041ProPhe: 1.041 ± 1.149
3.122ProGly: 3.122 ± 1.231
0.0ProHis: 0.0 ± 0.0
2.081ProIle: 2.081 ± 0.636
1.734ProLys: 1.734 ± 0.301
4.162ProLeu: 4.162 ± 1.497
2.775ProMet: 2.775 ± 0.295
3.815ProAsn: 3.815 ± 0.336
1.041ProPro: 1.041 ± 1.149
0.0ProGln: 0.0 ± 0.0
1.387ProArg: 1.387 ± 0.13
4.509ProSer: 4.509 ± 0.006
4.162ProThr: 4.162 ± 2.38
2.428ProVal: 2.428 ± 1.573
1.041ProTrp: 1.041 ± 0.595
2.775ProTyr: 2.775 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
1.734GlnAla: 1.734 ± 0.301
0.0GlnCys: 0.0 ± 0.0
1.387GlnAsp: 1.387 ± 0.13
0.694GlnGlu: 0.694 ± 0.342
1.734GlnPhe: 1.734 ± 1.361
1.734GlnGly: 1.734 ± 1.361
1.041GlnHis: 1.041 ± 0.595
2.428GlnIle: 2.428 ± 0.642
1.041GlnLys: 1.041 ± 0.513
3.122GlnLeu: 3.122 ± 0.124
1.734GlnMet: 1.734 ± 0.17
1.041GlnAsn: 1.041 ± 0.513
2.428GlnPro: 2.428 ± 0.642
2.775GlnGln: 2.775 ± 0.295
1.387GlnArg: 1.387 ± 0.424
2.428GlnSer: 2.428 ± 1.019
3.469GlnThr: 3.469 ± 1.155
1.734GlnVal: 1.734 ± 0.301
0.0GlnTrp: 0.0 ± 0.0
1.387GlnTyr: 1.387 ± 0.424
0.0GlnXaa: 0.0 ± 0.0
Arg
3.122ArgAla: 3.122 ± 0.677
0.0ArgCys: 0.0 ± 0.0
2.081ArgAsp: 2.081 ± 0.471
2.081ArgGlu: 2.081 ± 0.082
2.081ArgPhe: 2.081 ± 0.636
2.428ArgGly: 2.428 ± 0.465
0.694ArgHis: 0.694 ± 0.342
1.734ArgIle: 1.734 ± 0.854
2.428ArgLys: 2.428 ± 0.642
4.162ArgLeu: 4.162 ± 2.38
1.387ArgMet: 1.387 ± 0.684
2.775ArgAsn: 2.775 ± 1.367
2.428ArgPro: 2.428 ± 1.019
2.428ArgGln: 2.428 ± 0.465
3.469ArgArg: 3.469 ± 1.155
1.734ArgSer: 1.734 ± 0.807
3.122ArgThr: 3.122 ± 1.538
4.162ArgVal: 4.162 ± 0.943
0.694ArgTrp: 0.694 ± 0.766
1.734ArgTyr: 1.734 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
4.509SerAla: 4.509 ± 1.102
0.347SerCys: 0.347 ± 0.171
1.041SerAsp: 1.041 ± 0.041
3.122SerGlu: 3.122 ± 0.124
2.775SerPhe: 2.775 ± 0.295
4.162SerGly: 4.162 ± 1.826
1.387SerHis: 1.387 ± 0.978
4.509SerIle: 4.509 ± 0.006
4.856SerLys: 4.856 ± 0.931
7.631SerLeu: 7.631 ± 0.99
0.694SerMet: 0.694 ± 0.766
4.856SerAsn: 4.856 ± 0.177
4.856SerPro: 4.856 ± 0.931
4.162SerGln: 4.162 ± 0.165
3.122SerArg: 3.122 ± 0.124
5.55SerSer: 5.55 ± 1.143
4.856SerThr: 4.856 ± 0.177
7.978SerVal: 7.978 ± 1.608
1.734SerTrp: 1.734 ± 0.253
3.469SerTyr: 3.469 ± 0.507
0.0SerXaa: 0.0 ± 0.0
Thr
5.55ThrAla: 5.55 ± 0.035
0.694ThrCys: 0.694 ± 0.342
4.162ThrAsp: 4.162 ± 0.719
4.509ThrGlu: 4.509 ± 0.006
2.081ThrPhe: 2.081 ± 0.471
4.509ThrGly: 4.509 ± 1.114
0.347ThrHis: 0.347 ± 0.171
3.815ThrIle: 3.815 ± 0.336
4.509ThrLys: 4.509 ± 1.668
5.897ThrLeu: 5.897 ± 0.69
0.694ThrMet: 0.694 ± 0.212
3.469ThrAsn: 3.469 ± 0.047
3.469ThrPro: 3.469 ± 0.507
4.509ThrGln: 4.509 ± 0.548
3.815ThrArg: 3.815 ± 0.772
6.59ThrSer: 6.59 ± 0.076
7.631ThrThr: 7.631 ± 0.118
5.897ThrVal: 5.897 ± 2.08
1.387ThrTrp: 1.387 ± 0.978
2.428ThrTyr: 2.428 ± 0.642
0.0ThrXaa: 0.0 ± 0.0
Val
6.59ValAla: 6.59 ± 3.4
0.694ValCys: 0.694 ± 0.342
5.203ValAsp: 5.203 ± 1.314
7.978ValGlu: 7.978 ± 1.715
2.428ValPhe: 2.428 ± 0.465
4.856ValGly: 4.856 ± 0.377
3.122ValHis: 3.122 ± 0.124
4.162ValIle: 4.162 ± 0.165
4.509ValLys: 4.509 ± 0.56
5.897ValLeu: 5.897 ± 0.418
1.734ValMet: 1.734 ± 0.253
3.122ValAsn: 3.122 ± 0.124
5.55ValPro: 5.55 ± 0.035
1.734ValGln: 1.734 ± 0.807
1.387ValArg: 1.387 ± 0.684
5.897ValSer: 5.897 ± 0.972
5.55ValThr: 5.55 ± 0.589
7.284ValVal: 7.284 ± 2.504
0.694ValTrp: 0.694 ± 0.342
3.122ValTyr: 3.122 ± 0.124
0.0ValXaa: 0.0 ± 0.0
Trp
0.694TrpAla: 0.694 ± 0.342
0.0TrpCys: 0.0 ± 0.0
0.694TrpAsp: 0.694 ± 0.212
0.347TrpGlu: 0.347 ± 0.383
0.347TrpPhe: 0.347 ± 0.171
0.347TrpGly: 0.347 ± 0.171
0.347TrpHis: 0.347 ± 0.171
0.694TrpIle: 0.694 ± 0.212
0.0TrpLys: 0.0 ± 0.0
1.041TrpLeu: 1.041 ± 0.041
0.694TrpMet: 0.694 ± 0.212
1.041TrpAsn: 1.041 ± 0.041
0.0TrpPro: 0.0 ± 0.0
0.347TrpGln: 0.347 ± 0.383
1.734TrpArg: 1.734 ± 0.807
1.041TrpSer: 1.041 ± 0.595
1.041TrpThr: 1.041 ± 0.041
0.347TrpVal: 0.347 ± 0.171
0.347TrpTrp: 0.347 ± 0.383
0.694TrpTyr: 0.694 ± 0.212
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.081TyrAla: 2.081 ± 0.471
1.041TyrCys: 1.041 ± 0.041
3.469TyrAsp: 3.469 ± 1.155
1.041TyrGlu: 1.041 ± 0.513
2.081TyrPhe: 2.081 ± 0.636
1.041TyrGly: 1.041 ± 0.041
0.0TyrHis: 0.0 ± 0.0
2.775TyrIle: 2.775 ± 0.295
2.081TyrLys: 2.081 ± 0.471
3.469TyrLeu: 3.469 ± 0.601
1.041TyrMet: 1.041 ± 0.041
1.387TyrAsn: 1.387 ± 0.13
1.387TyrPro: 1.387 ± 0.978
1.734TyrGln: 1.734 ± 0.854
1.387TyrArg: 1.387 ± 0.13
3.815TyrSer: 3.815 ± 0.336
3.122TyrThr: 3.122 ± 0.677
2.081TyrVal: 2.081 ± 1.025
0.694TyrTrp: 0.694 ± 0.342
1.041TyrTyr: 1.041 ± 0.513
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2884 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski