Amino acid dipepetide frequency for Beihai picorna-like virus 24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.165AlaAla: 4.165 ± 0.085
1.515AlaCys: 1.515 ± 0.869
4.165AlaAsp: 4.165 ± 1.153
3.408AlaGlu: 3.408 ± 0.099
2.651AlaPhe: 2.651 ± 0.335
4.922AlaGly: 4.922 ± 0.888
1.136AlaHis: 1.136 ± 0.586
2.272AlaIle: 2.272 ± 1.79
3.408AlaLys: 3.408 ± 0.519
4.165AlaLeu: 4.165 ± 1.771
0.757AlaMet: 0.757 ± 0.703
3.786AlaAsn: 3.786 ± 0.921
3.408AlaPro: 3.408 ± 0.099
2.651AlaGln: 2.651 ± 0.902
3.786AlaArg: 3.786 ± 0.302
4.165AlaSer: 4.165 ± 1.322
5.301AlaThr: 5.301 ± 1.289
5.68AlaVal: 5.68 ± 0.453
1.515AlaTrp: 1.515 ± 0.368
1.893AlaTyr: 1.893 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.757CysAla: 0.757 ± 0.184
0.0CysCys: 0.0 ± 0.0
0.379CysAsp: 0.379 ± 0.217
0.379CysGlu: 0.379 ± 0.217
0.379CysPhe: 0.379 ± 0.217
1.893CysGly: 1.893 ± 0.468
0.379CysHis: 0.379 ± 0.401
0.379CysIle: 0.379 ± 0.217
0.0CysLys: 0.0 ± 0.0
1.136CysLeu: 1.136 ± 0.652
1.136CysMet: 1.136 ± 0.652
0.379CysAsn: 0.379 ± 0.217
0.0CysPro: 0.0 ± 0.0
0.379CysGln: 0.379 ± 0.217
0.379CysArg: 0.379 ± 0.217
2.272CysSer: 2.272 ± 0.066
0.379CysThr: 0.379 ± 0.217
1.136CysVal: 1.136 ± 0.652
0.0CysTrp: 0.0 ± 0.0
0.757CysTyr: 0.757 ± 0.435
0.0CysXaa: 0.0 ± 0.0
Asp
3.408AspAla: 3.408 ± 0.099
0.757AspCys: 0.757 ± 0.435
4.544AspAsp: 4.544 ± 1.37
3.029AspGlu: 3.029 ± 1.355
3.786AspPhe: 3.786 ± 0.317
3.029AspGly: 3.029 ± 0.501
2.272AspHis: 2.272 ± 0.066
4.544AspIle: 4.544 ± 0.486
4.544AspLys: 4.544 ± 1.989
4.922AspLeu: 4.922 ± 0.35
3.029AspMet: 3.029 ± 0.118
1.515AspAsn: 1.515 ± 0.869
3.408AspPro: 3.408 ± 0.099
3.029AspGln: 3.029 ± 0.118
1.136AspArg: 1.136 ± 0.586
4.544AspSer: 4.544 ± 0.751
3.029AspThr: 3.029 ± 0.501
3.029AspVal: 3.029 ± 1.12
1.136AspTrp: 1.136 ± 0.652
2.651AspTyr: 2.651 ± 0.284
0.0AspXaa: 0.0 ± 0.0
Glu
3.029GluAla: 3.029 ± 0.118
0.757GluCys: 0.757 ± 0.435
4.544GluAsp: 4.544 ± 1.105
3.786GluGlu: 3.786 ± 0.935
2.272GluPhe: 2.272 ± 1.79
1.515GluGly: 1.515 ± 0.368
1.136GluHis: 1.136 ± 0.033
4.922GluIle: 4.922 ± 0.969
3.029GluLys: 3.029 ± 1.12
7.194GluLeu: 7.194 ± 2.059
1.893GluMet: 1.893 ± 0.151
1.893GluAsn: 1.893 ± 0.468
3.029GluPro: 3.029 ± 0.737
3.029GluGln: 3.029 ± 1.12
3.029GluArg: 3.029 ± 1.12
2.272GluSer: 2.272 ± 0.685
3.408GluThr: 3.408 ± 1.138
3.029GluVal: 3.029 ± 0.737
1.515GluTrp: 1.515 ± 0.368
2.651GluTyr: 2.651 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
3.029PheAla: 3.029 ± 0.501
0.379PheCys: 0.379 ± 0.217
3.408PheAsp: 3.408 ± 0.718
2.272PheGlu: 2.272 ± 0.685
2.272PhePhe: 2.272 ± 0.066
1.893PheGly: 1.893 ± 0.77
1.515PheHis: 1.515 ± 0.25
1.515PheIle: 1.515 ± 0.25
4.922PheLys: 4.922 ± 0.969
4.165PheLeu: 4.165 ± 0.703
1.136PheMet: 1.136 ± 0.652
2.272PheAsn: 2.272 ± 0.552
2.651PhePro: 2.651 ± 0.284
3.408PheGln: 3.408 ± 1.138
1.136PheArg: 1.136 ± 0.033
3.029PheSer: 3.029 ± 0.118
2.272PheThr: 2.272 ± 0.066
2.651PheVal: 2.651 ± 0.335
1.136PheTrp: 1.136 ± 0.652
3.029PheTyr: 3.029 ± 0.501
0.0PheXaa: 0.0 ± 0.0
Gly
3.408GlyAla: 3.408 ± 1.757
0.379GlyCys: 0.379 ± 0.401
3.786GlyAsp: 3.786 ± 1.554
4.922GlyGlu: 4.922 ± 2.744
3.786GlyPhe: 3.786 ± 0.921
4.922GlyGly: 4.922 ± 0.888
0.757GlyHis: 0.757 ± 0.803
6.058GlyIle: 6.058 ± 2.092
6.058GlyLys: 6.058 ± 1.62
2.272GlyLeu: 2.272 ± 0.685
0.757GlyMet: 0.757 ± 0.184
2.272GlyAsn: 2.272 ± 0.552
3.029GlyPro: 3.029 ± 1.355
1.893GlyGln: 1.893 ± 0.151
2.651GlyArg: 2.651 ± 1.573
3.786GlySer: 3.786 ± 0.921
3.408GlyThr: 3.408 ± 2.375
4.544GlyVal: 4.544 ± 0.133
0.379GlyTrp: 0.379 ± 0.217
1.893GlyTyr: 1.893 ± 0.151
0.0GlyXaa: 0.0 ± 0.0
His
2.272HisAla: 2.272 ± 0.552
1.515HisCys: 1.515 ± 0.25
0.757HisAsp: 0.757 ± 0.184
1.136HisGlu: 1.136 ± 0.652
1.136HisPhe: 1.136 ± 0.652
1.515HisGly: 1.515 ± 0.368
0.379HisHis: 0.379 ± 0.217
1.515HisIle: 1.515 ± 0.869
0.379HisLys: 0.379 ± 0.217
3.408HisLeu: 3.408 ± 1.337
0.379HisMet: 0.379 ± 0.217
1.515HisAsn: 1.515 ± 1.606
0.757HisPro: 0.757 ± 0.435
0.379HisGln: 0.379 ± 0.401
0.379HisArg: 0.379 ± 0.401
1.136HisSer: 1.136 ± 0.586
1.136HisThr: 1.136 ± 0.586
2.272HisVal: 2.272 ± 0.066
0.0HisTrp: 0.0 ± 0.0
1.136HisTyr: 1.136 ± 0.586
0.0HisXaa: 0.0 ± 0.0
Ile
6.437IleAla: 6.437 ± 1.838
1.893IleCys: 1.893 ± 0.468
2.272IleAsp: 2.272 ± 1.171
4.165IleGlu: 4.165 ± 1.771
2.651IlePhe: 2.651 ± 0.902
3.786IleGly: 3.786 ± 2.158
0.0IleHis: 0.0 ± 0.0
2.272IleIle: 2.272 ± 0.552
3.408IleLys: 3.408 ± 0.099
3.786IleLeu: 3.786 ± 1.539
0.757IleMet: 0.757 ± 0.435
6.816IleAsn: 6.816 ± 0.42
1.515IlePro: 1.515 ± 0.869
1.136IleGln: 1.136 ± 0.652
2.651IleArg: 2.651 ± 0.335
4.922IleSer: 4.922 ± 0.35
3.786IleThr: 3.786 ± 0.921
3.786IleVal: 3.786 ± 0.317
0.757IleTrp: 0.757 ± 0.435
2.651IleTyr: 2.651 ± 0.335
0.0IleXaa: 0.0 ± 0.0
Lys
5.301LysAla: 5.301 ± 0.052
0.0LysCys: 0.0 ± 0.0
6.437LysAsp: 6.437 ± 2.456
4.922LysGlu: 4.922 ± 2.206
3.408LysPhe: 3.408 ± 0.718
2.272LysGly: 2.272 ± 0.066
2.272LysHis: 2.272 ± 0.066
3.029LysIle: 3.029 ± 1.12
4.922LysLys: 4.922 ± 1.587
4.922LysLeu: 4.922 ± 1.587
1.136LysMet: 1.136 ± 0.652
2.651LysAsn: 2.651 ± 0.284
3.029LysPro: 3.029 ± 0.118
2.651LysGln: 2.651 ± 0.284
2.272LysArg: 2.272 ± 0.685
4.922LysSer: 4.922 ± 1.587
2.651LysThr: 2.651 ± 0.284
1.893LysVal: 1.893 ± 0.468
0.379LysTrp: 0.379 ± 0.217
3.786LysTyr: 3.786 ± 0.921
0.0LysXaa: 0.0 ± 0.0
Leu
4.544LeuAla: 4.544 ± 0.133
1.515LeuCys: 1.515 ± 0.869
5.68LeuAsp: 5.68 ± 0.453
4.544LeuGlu: 4.544 ± 0.486
2.272LeuPhe: 2.272 ± 1.304
2.651LeuGly: 2.651 ± 0.284
2.272LeuHis: 2.272 ± 0.066
3.786LeuIle: 3.786 ± 0.302
5.68LeuLys: 5.68 ± 2.022
6.816LeuLeu: 6.816 ± 1.039
1.136LeuMet: 1.136 ± 0.652
2.651LeuAsn: 2.651 ± 0.954
3.408LeuPro: 3.408 ± 1.138
2.272LeuGln: 2.272 ± 0.066
4.922LeuArg: 4.922 ± 0.969
8.33LeuSer: 8.33 ± 0.449
7.194LeuThr: 7.194 ± 0.203
4.544LeuVal: 4.544 ± 0.133
0.757LeuTrp: 0.757 ± 0.184
2.651LeuTyr: 2.651 ± 1.521
0.0LeuXaa: 0.0 ± 0.0
Met
2.651MetAla: 2.651 ± 0.954
0.379MetCys: 0.379 ± 0.217
1.136MetAsp: 1.136 ± 0.652
2.651MetGlu: 2.651 ± 0.335
1.515MetPhe: 1.515 ± 0.25
0.757MetGly: 0.757 ± 0.184
0.757MetHis: 0.757 ± 0.803
1.893MetIle: 1.893 ± 0.151
2.272MetLys: 2.272 ± 1.304
1.515MetLeu: 1.515 ± 0.869
1.136MetMet: 1.136 ± 0.652
0.379MetAsn: 0.379 ± 0.217
0.757MetPro: 0.757 ± 0.184
0.379MetGln: 0.379 ± 0.217
1.136MetArg: 1.136 ± 0.033
3.029MetSer: 3.029 ± 1.12
1.515MetThr: 1.515 ± 0.368
0.757MetVal: 0.757 ± 0.435
0.0MetTrp: 0.0 ± 0.0
1.515MetTyr: 1.515 ± 0.368
0.0MetXaa: 0.0 ± 0.0
Asn
1.515AsnAla: 1.515 ± 0.368
0.757AsnCys: 0.757 ± 0.184
2.272AsnAsp: 2.272 ± 0.066
1.515AsnGlu: 1.515 ± 0.368
2.272AsnPhe: 2.272 ± 0.685
4.922AsnGly: 4.922 ± 2.744
1.515AsnHis: 1.515 ± 0.368
3.786AsnIle: 3.786 ± 0.921
4.544AsnLys: 4.544 ± 1.37
4.165AsnLeu: 4.165 ± 0.534
0.757AsnMet: 0.757 ± 0.435
2.272AsnAsn: 2.272 ± 1.79
2.272AsnPro: 2.272 ± 0.552
0.757AsnGln: 0.757 ± 0.184
1.515AsnArg: 1.515 ± 0.368
4.544AsnSer: 4.544 ± 1.105
3.029AsnThr: 3.029 ± 0.118
2.651AsnVal: 2.651 ± 0.335
1.893AsnTrp: 1.893 ± 0.151
3.408AsnTyr: 3.408 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
3.029ProAla: 3.029 ± 0.737
0.379ProCys: 0.379 ± 0.401
3.786ProAsp: 3.786 ± 0.921
1.893ProGlu: 1.893 ± 0.468
2.272ProPhe: 2.272 ± 0.552
1.893ProGly: 1.893 ± 0.468
0.757ProHis: 0.757 ± 0.435
3.408ProIle: 3.408 ± 0.519
1.893ProLys: 1.893 ± 0.468
3.786ProLeu: 3.786 ± 0.317
1.136ProMet: 1.136 ± 0.033
1.893ProAsn: 1.893 ± 0.151
0.379ProPro: 0.379 ± 0.217
1.893ProGln: 1.893 ± 1.086
2.272ProArg: 2.272 ± 0.066
2.651ProSer: 2.651 ± 0.954
4.165ProThr: 4.165 ± 0.085
1.893ProVal: 1.893 ± 0.468
0.757ProTrp: 0.757 ± 0.184
1.893ProTyr: 1.893 ± 0.77
0.0ProXaa: 0.0 ± 0.0
Gln
3.029GlnAla: 3.029 ± 1.12
0.757GlnCys: 0.757 ± 0.435
1.136GlnAsp: 1.136 ± 0.652
3.029GlnGlu: 3.029 ± 1.355
1.893GlnPhe: 1.893 ± 1.086
2.272GlnGly: 2.272 ± 0.066
1.893GlnHis: 1.893 ± 0.151
1.136GlnIle: 1.136 ± 0.586
1.515GlnLys: 1.515 ± 0.869
2.651GlnLeu: 2.651 ± 0.284
1.136GlnMet: 1.136 ± 1.013
2.651GlnAsn: 2.651 ± 0.902
0.379GlnPro: 0.379 ± 0.217
1.136GlnGln: 1.136 ± 0.586
3.029GlnArg: 3.029 ± 0.118
6.058GlnSer: 6.058 ± 0.854
3.029GlnThr: 3.029 ± 1.355
2.272GlnVal: 2.272 ± 0.685
1.136GlnTrp: 1.136 ± 0.033
1.136GlnTyr: 1.136 ± 0.586
0.0GlnXaa: 0.0 ± 0.0
Arg
2.651ArgAla: 2.651 ± 0.335
0.379ArgCys: 0.379 ± 0.217
2.272ArgAsp: 2.272 ± 0.066
1.893ArgGlu: 1.893 ± 0.468
1.893ArgPhe: 1.893 ± 0.468
4.922ArgGly: 4.922 ± 2.744
0.757ArgHis: 0.757 ± 0.184
3.408ArgIle: 3.408 ± 0.099
2.651ArgLys: 2.651 ± 0.902
3.029ArgLeu: 3.029 ± 0.737
1.136ArgMet: 1.136 ± 0.652
3.786ArgAsn: 3.786 ± 0.935
1.893ArgPro: 1.893 ± 0.77
2.651ArgGln: 2.651 ± 0.954
3.408ArgArg: 3.408 ± 0.718
3.786ArgSer: 3.786 ± 0.935
2.651ArgThr: 2.651 ± 0.284
1.893ArgVal: 1.893 ± 0.468
0.379ArgTrp: 0.379 ± 0.401
3.029ArgTyr: 3.029 ± 0.737
0.0ArgXaa: 0.0 ± 0.0
Ser
4.544SerAla: 4.544 ± 1.105
0.0SerCys: 0.0 ± 0.0
4.165SerAsp: 4.165 ± 0.534
3.029SerGlu: 3.029 ± 0.737
2.272SerPhe: 2.272 ± 0.552
7.573SerGly: 7.573 ± 1.841
1.893SerHis: 1.893 ± 0.468
4.922SerIle: 4.922 ± 2.206
4.165SerLys: 4.165 ± 0.085
6.058SerLeu: 6.058 ± 1.473
1.893SerMet: 1.893 ± 0.151
3.029SerAsn: 3.029 ± 0.737
1.515SerPro: 1.515 ± 0.869
6.437SerGln: 6.437 ± 0.637
4.544SerArg: 4.544 ± 1.105
4.165SerSer: 4.165 ± 1.771
9.087SerThr: 9.087 ± 0.265
4.922SerVal: 4.922 ± 0.969
1.136SerTrp: 1.136 ± 0.033
1.893SerTyr: 1.893 ± 0.151
0.0SerXaa: 0.0 ± 0.0
Thr
5.301ThrAla: 5.301 ± 0.67
0.0ThrCys: 0.0 ± 0.0
4.165ThrAsp: 4.165 ± 1.771
2.651ThrGlu: 2.651 ± 0.954
4.165ThrPhe: 4.165 ± 0.085
3.786ThrGly: 3.786 ± 2.158
1.136ThrHis: 1.136 ± 0.033
4.165ThrIle: 4.165 ± 1.153
2.651ThrLys: 2.651 ± 0.284
3.408ThrLeu: 3.408 ± 0.718
2.651ThrMet: 2.651 ± 0.284
3.408ThrAsn: 3.408 ± 2.375
4.165ThrPro: 4.165 ± 0.703
3.029ThrGln: 3.029 ± 0.118
4.922ThrArg: 4.922 ± 0.35
6.437ThrSer: 6.437 ± 1.874
5.68ThrThr: 5.68 ± 0.166
3.786ThrVal: 3.786 ± 0.302
0.757ThrTrp: 0.757 ± 0.184
1.515ThrTyr: 1.515 ± 0.368
0.0ThrXaa: 0.0 ± 0.0
Val
3.786ValAla: 3.786 ± 0.317
0.379ValCys: 0.379 ± 0.217
3.408ValAsp: 3.408 ± 0.519
6.437ValGlu: 6.437 ± 1.256
3.786ValPhe: 3.786 ± 0.935
2.651ValGly: 2.651 ± 0.902
1.136ValHis: 1.136 ± 0.652
2.651ValIle: 2.651 ± 0.902
2.651ValLys: 2.651 ± 0.954
4.922ValLeu: 4.922 ± 0.35
1.515ValMet: 1.515 ± 0.25
2.651ValAsn: 2.651 ± 0.335
4.165ValPro: 4.165 ± 0.534
2.272ValGln: 2.272 ± 1.171
2.272ValArg: 2.272 ± 0.066
4.544ValSer: 4.544 ± 0.751
2.651ValThr: 2.651 ± 0.284
4.165ValVal: 4.165 ± 0.085
0.757ValTrp: 0.757 ± 0.435
2.651ValTyr: 2.651 ± 0.284
0.0ValXaa: 0.0 ± 0.0
Trp
0.757TrpAla: 0.757 ± 0.803
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.136TrpGlu: 1.136 ± 0.652
1.515TrpPhe: 1.515 ± 0.25
0.379TrpGly: 0.379 ± 0.401
0.0TrpHis: 0.0 ± 0.0
0.757TrpIle: 0.757 ± 0.184
0.757TrpLys: 0.757 ± 0.184
2.272TrpLeu: 2.272 ± 1.304
1.136TrpMet: 1.136 ± 0.586
0.757TrpAsn: 0.757 ± 0.184
0.379TrpPro: 0.379 ± 0.217
0.757TrpGln: 0.757 ± 0.435
1.515TrpArg: 1.515 ± 0.25
1.136TrpSer: 1.136 ± 0.586
0.379TrpThr: 0.379 ± 0.217
1.136TrpVal: 1.136 ± 0.033
0.0TrpTrp: 0.0 ± 0.0
0.379TrpTyr: 0.379 ± 0.401
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.515TyrAla: 1.515 ± 0.368
0.757TyrCys: 0.757 ± 0.435
3.029TyrAsp: 3.029 ± 0.118
1.136TyrGlu: 1.136 ± 0.033
1.893TyrPhe: 1.893 ± 1.388
3.408TyrGly: 3.408 ± 0.099
1.136TyrHis: 1.136 ± 0.652
3.029TyrIle: 3.029 ± 0.501
3.408TyrLys: 3.408 ± 0.099
3.029TyrLeu: 3.029 ± 0.501
1.136TyrMet: 1.136 ± 0.033
3.786TyrAsn: 3.786 ± 0.302
1.893TyrPro: 1.893 ± 0.468
1.136TyrGln: 1.136 ± 0.033
1.515TyrArg: 1.515 ± 0.368
1.515TyrSer: 1.515 ± 1.606
3.029TyrThr: 3.029 ± 1.12
3.408TyrVal: 3.408 ± 1.138
0.757TyrTrp: 0.757 ± 0.803
3.029TyrTyr: 3.029 ± 0.501
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2642 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski