Amino acid dipepetide frequency for Beihai picorna-like virus 115

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.822AlaAla: 3.822 ± 0.3
0.955AlaCys: 0.955 ± 0.443
1.911AlaAsp: 1.911 ± 0.368
6.051AlaGlu: 6.051 ± 0.131
3.822AlaPhe: 3.822 ± 0.217
3.185AlaGly: 3.185 ± 0.095
1.274AlaHis: 1.274 ± 0.763
1.592AlaIle: 1.592 ± 0.047
2.866AlaLys: 2.866 ± 0.292
3.503AlaLeu: 3.503 ± 0.103
0.955AlaMet: 0.955 ± 0.075
2.229AlaAsn: 2.229 ± 0.17
2.229AlaPro: 2.229 ± 0.688
3.822AlaGln: 3.822 ± 1.854
3.503AlaArg: 3.503 ± 1.451
2.866AlaSer: 2.866 ± 1.779
4.459AlaThr: 4.459 ± 1.375
5.414AlaVal: 5.414 ± 0.265
0.318AlaTrp: 0.318 ± 0.32
2.229AlaTyr: 2.229 ± 0.348
0.0AlaXaa: 0.0 ± 0.0
Cys
1.274CysAla: 1.274 ± 0.791
0.318CysCys: 0.318 ± 0.32
0.955CysAsp: 0.955 ± 0.075
1.274CysGlu: 1.274 ± 0.273
0.955CysPhe: 0.955 ± 0.075
1.274CysGly: 1.274 ± 0.245
0.637CysHis: 0.637 ± 0.123
0.318CysIle: 0.318 ± 0.198
2.866CysLys: 2.866 ± 0.81
0.955CysLeu: 0.955 ± 0.593
1.274CysMet: 1.274 ± 0.154
1.274CysAsn: 1.274 ± 0.763
0.637CysPro: 0.637 ± 0.123
2.229CysGln: 2.229 ± 0.348
0.318CysArg: 0.318 ± 0.198
1.274CysSer: 1.274 ± 0.245
0.955CysThr: 0.955 ± 0.593
0.955CysVal: 0.955 ± 0.593
0.318CysTrp: 0.318 ± 0.198
0.318CysTyr: 0.318 ± 0.32
0.0CysXaa: 0.0 ± 0.0
Asp
3.185AspAla: 3.185 ± 0.423
1.592AspCys: 1.592 ± 0.988
5.732AspAsp: 5.732 ± 1.103
3.185AspGlu: 3.185 ± 0.095
2.866AspPhe: 2.866 ± 0.225
4.14AspGly: 4.14 ± 0.02
2.866AspHis: 2.866 ± 0.743
3.185AspIle: 3.185 ± 0.613
2.229AspLys: 2.229 ± 0.348
5.096AspLeu: 5.096 ± 1.498
1.911AspMet: 1.911 ± 0.668
3.185AspAsn: 3.185 ± 0.095
5.732AspPro: 5.732 ± 0.585
2.229AspGln: 2.229 ± 1.206
3.185AspArg: 3.185 ± 1.13
4.14AspSer: 4.14 ± 2.091
2.229AspThr: 2.229 ± 0.348
3.185AspVal: 3.185 ± 0.941
0.0AspTrp: 0.0 ± 0.0
2.866AspTyr: 2.866 ± 0.743
0.0AspXaa: 0.0 ± 0.0
Glu
3.185GluAla: 3.185 ± 1.13
0.318GluCys: 0.318 ± 0.198
3.185GluAsp: 3.185 ± 0.095
4.459GluGlu: 4.459 ± 0.34
1.274GluPhe: 1.274 ± 0.763
2.229GluGly: 2.229 ± 0.348
1.592GluHis: 1.592 ± 0.047
5.096GluIle: 5.096 ± 1.609
3.503GluLys: 3.503 ± 0.415
5.414GluLeu: 5.414 ± 2.324
2.229GluMet: 2.229 ± 0.866
4.459GluAsn: 4.459 ± 1.214
1.592GluPro: 1.592 ± 0.988
4.459GluGln: 4.459 ± 0.34
2.866GluArg: 2.866 ± 0.743
3.503GluSer: 3.503 ± 0.103
4.14GluThr: 4.14 ± 0.02
2.548GluVal: 2.548 ± 0.545
1.274GluTrp: 1.274 ± 0.763
1.592GluTyr: 1.592 ± 0.47
0.0GluXaa: 0.0 ± 0.0
Phe
3.822PheAla: 3.822 ± 0.3
0.637PheCys: 0.637 ± 0.123
3.503PheAsp: 3.503 ± 0.103
3.503PheGlu: 3.503 ± 0.933
2.229PhePhe: 2.229 ± 0.866
4.14PheGly: 4.14 ± 0.498
0.955PheHis: 0.955 ± 0.075
0.637PheIle: 0.637 ± 0.395
3.185PheLys: 3.185 ± 0.613
3.185PheLeu: 3.185 ± 1.976
1.592PheMet: 1.592 ± 0.47
2.229PheAsn: 2.229 ± 0.348
3.822PhePro: 3.822 ± 0.735
0.637PheGln: 0.637 ± 0.123
1.911PheArg: 1.911 ± 0.668
3.503PheSer: 3.503 ± 0.415
3.503PheThr: 3.503 ± 1.138
3.503PheVal: 3.503 ± 0.103
0.637PheTrp: 0.637 ± 0.395
1.911PheTyr: 1.911 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
5.732GlyAla: 5.732 ± 0.451
0.637GlyCys: 0.637 ± 0.123
4.14GlyAsp: 4.14 ± 1.016
4.777GlyGlu: 4.777 ± 0.376
3.503GlyPhe: 3.503 ± 1.138
1.911GlyGly: 1.911 ± 0.368
1.592GlyHis: 1.592 ± 0.47
2.229GlyIle: 2.229 ± 1.383
4.777GlyLys: 4.777 ± 1.929
4.777GlyLeu: 4.777 ± 1.411
1.911GlyMet: 1.911 ± 0.15
3.185GlyAsn: 3.185 ± 0.095
1.911GlyPro: 1.911 ± 0.885
1.911GlyGln: 1.911 ± 0.668
0.955GlyArg: 0.955 ± 0.075
3.185GlySer: 3.185 ± 1.648
2.866GlyThr: 2.866 ± 0.81
5.732GlyVal: 5.732 ± 2.138
0.637GlyTrp: 0.637 ± 0.123
1.911GlyTyr: 1.911 ± 0.885
0.0GlyXaa: 0.0 ± 0.0
His
0.637HisAla: 0.637 ± 0.395
0.955HisCys: 0.955 ± 0.96
0.955HisAsp: 0.955 ± 0.593
0.955HisGlu: 0.955 ± 0.593
1.592HisPhe: 1.592 ± 0.47
1.592HisGly: 1.592 ± 0.988
0.318HisHis: 0.318 ± 0.32
1.274HisIle: 1.274 ± 0.791
0.318HisLys: 0.318 ± 0.32
3.503HisLeu: 3.503 ± 2.174
0.318HisMet: 0.318 ± 0.32
0.318HisAsn: 0.318 ± 0.198
2.548HisPro: 2.548 ± 0.028
0.637HisGln: 0.637 ± 0.123
0.955HisArg: 0.955 ± 0.443
1.274HisSer: 1.274 ± 0.273
0.637HisThr: 0.637 ± 0.64
2.548HisVal: 2.548 ± 0.49
1.274HisTrp: 1.274 ± 0.245
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.229IleAla: 2.229 ± 0.866
0.0IleCys: 0.0 ± 0.0
4.459IleAsp: 4.459 ± 1.214
3.185IleGlu: 3.185 ± 1.459
1.592IlePhe: 1.592 ± 0.047
4.459IleGly: 4.459 ± 0.696
1.592IleHis: 1.592 ± 0.47
3.503IleIle: 3.503 ± 1.138
4.777IleLys: 4.777 ± 1.178
3.822IleLeu: 3.822 ± 1.336
1.274IleMet: 1.274 ± 0.791
3.503IleAsn: 3.503 ± 2.486
4.459IlePro: 4.459 ± 1.893
1.911IleGln: 1.911 ± 0.368
1.274IleArg: 1.274 ± 0.245
5.414IleSer: 5.414 ± 1.806
3.185IleThr: 3.185 ± 0.613
4.14IleVal: 4.14 ± 0.02
0.318IleTrp: 0.318 ± 0.198
0.955IleTyr: 0.955 ± 0.443
0.0IleXaa: 0.0 ± 0.0
Lys
3.503LysAla: 3.503 ± 0.621
2.229LysCys: 2.229 ± 0.348
3.822LysAsp: 3.822 ± 0.217
2.866LysGlu: 2.866 ± 0.743
4.14LysPhe: 4.14 ± 1.016
2.866LysGly: 2.866 ± 0.292
1.911LysHis: 1.911 ± 1.186
2.866LysIle: 2.866 ± 1.328
4.14LysLys: 4.14 ± 1.016
6.051LysLeu: 6.051 ± 0.387
2.866LysMet: 2.866 ± 1.166
3.185LysAsn: 3.185 ± 2.166
5.096LysPro: 5.096 ± 0.055
1.911LysGln: 1.911 ± 0.368
3.185LysArg: 3.185 ± 0.095
3.822LysSer: 3.822 ± 1.336
3.822LysThr: 3.822 ± 0.818
1.911LysVal: 1.911 ± 0.885
1.274LysTrp: 1.274 ± 0.273
2.866LysTyr: 2.866 ± 1.846
0.0LysXaa: 0.0 ± 0.0
Leu
5.096LeuAla: 5.096 ± 0.462
1.274LeuCys: 1.274 ± 0.273
5.414LeuAsp: 5.414 ± 0.253
3.185LeuGlu: 3.185 ± 1.976
4.777LeuPhe: 4.777 ± 0.893
4.777LeuGly: 4.777 ± 0.376
0.955LeuHis: 0.955 ± 0.075
5.414LeuIle: 5.414 ± 1.806
7.006LeuLys: 7.006 ± 1.759
7.006LeuLeu: 7.006 ± 2.795
3.185LeuMet: 3.185 ± 0.941
3.503LeuAsn: 3.503 ± 0.933
3.822LeuPro: 3.822 ± 0.217
2.229LeuGln: 2.229 ± 0.866
3.185LeuArg: 3.185 ± 0.941
4.14LeuSer: 4.14 ± 0.498
6.369LeuThr: 6.369 ± 0.328
3.822LeuVal: 3.822 ± 0.735
1.911LeuTrp: 1.911 ± 0.368
4.459LeuTyr: 4.459 ± 0.178
0.0LeuXaa: 0.0 ± 0.0
Met
1.274MetAla: 1.274 ± 0.245
1.274MetCys: 1.274 ± 0.273
0.318MetAsp: 0.318 ± 0.32
0.637MetGlu: 0.637 ± 0.395
2.229MetPhe: 2.229 ± 0.348
1.592MetGly: 1.592 ± 0.047
0.637MetHis: 0.637 ± 0.395
1.274MetIle: 1.274 ± 0.245
1.911MetLys: 1.911 ± 0.368
2.229MetLeu: 2.229 ± 0.348
0.318MetMet: 0.318 ± 0.198
2.229MetAsn: 2.229 ± 0.866
1.592MetPro: 1.592 ± 0.565
0.637MetGln: 0.637 ± 0.395
0.955MetArg: 0.955 ± 0.075
4.777MetSer: 4.777 ± 0.893
2.229MetThr: 2.229 ± 0.866
0.955MetVal: 0.955 ± 0.075
0.955MetTrp: 0.955 ± 0.593
1.592MetTyr: 1.592 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
2.866AsnAla: 2.866 ± 1.328
1.592AsnCys: 1.592 ± 0.047
1.274AsnAsp: 1.274 ± 0.245
1.911AsnGlu: 1.911 ± 0.885
2.229AsnPhe: 2.229 ± 0.17
2.548AsnGly: 2.548 ± 0.49
1.592AsnHis: 1.592 ± 0.047
3.185AsnIle: 3.185 ± 0.613
1.274AsnLys: 1.274 ± 0.245
5.096AsnLeu: 5.096 ± 1.091
2.866AsnMet: 2.866 ± 0.292
3.185AsnAsn: 3.185 ± 3.202
3.822AsnPro: 3.822 ± 1.771
2.229AsnGln: 2.229 ± 0.348
2.229AsnArg: 2.229 ± 0.688
3.503AsnSer: 3.503 ± 1.451
3.503AsnThr: 3.503 ± 1.451
4.14AsnVal: 4.14 ± 0.537
0.637AsnTrp: 0.637 ± 0.123
0.955AsnTyr: 0.955 ± 0.075
0.0AsnXaa: 0.0 ± 0.0
Pro
3.185ProAla: 3.185 ± 0.095
1.911ProCys: 1.911 ± 1.186
4.459ProAsp: 4.459 ± 0.858
2.548ProGlu: 2.548 ± 0.49
1.274ProPhe: 1.274 ± 0.273
3.503ProGly: 3.503 ± 1.968
0.318ProHis: 0.318 ± 0.32
3.822ProIle: 3.822 ± 0.735
4.14ProLys: 4.14 ± 2.091
3.822ProLeu: 3.822 ± 0.217
0.637ProMet: 0.637 ± 0.123
1.592ProAsn: 1.592 ± 0.47
0.955ProPro: 0.955 ± 0.443
2.866ProGln: 2.866 ± 0.292
2.548ProArg: 2.548 ± 0.49
3.822ProSer: 3.822 ± 0.735
3.503ProThr: 3.503 ± 0.415
5.096ProVal: 5.096 ± 0.462
0.637ProTrp: 0.637 ± 0.64
1.911ProTyr: 1.911 ± 0.15
0.0ProXaa: 0.0 ± 0.0
Gln
1.592GlnAla: 1.592 ± 0.47
0.637GlnCys: 0.637 ± 0.123
2.866GlnAsp: 2.866 ± 0.225
2.866GlnGlu: 2.866 ± 1.261
1.911GlnPhe: 1.911 ± 0.368
1.592GlnGly: 1.592 ± 0.047
0.955GlnHis: 0.955 ± 0.593
2.866GlnIle: 2.866 ± 0.292
0.955GlnLys: 0.955 ± 0.075
2.548GlnLeu: 2.548 ± 1.063
1.911GlnMet: 1.911 ± 0.15
2.866GlnAsn: 2.866 ± 0.292
1.911GlnPro: 1.911 ± 0.885
2.548GlnGln: 2.548 ± 1.063
2.866GlnArg: 2.866 ± 1.779
3.185GlnSer: 3.185 ± 0.941
3.822GlnThr: 3.822 ± 0.735
0.955GlnVal: 0.955 ± 0.443
0.637GlnTrp: 0.637 ± 0.395
2.866GlnTyr: 2.866 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
2.548ArgAla: 2.548 ± 0.028
0.955ArgCys: 0.955 ± 0.075
2.548ArgAsp: 2.548 ± 0.545
1.592ArgGlu: 1.592 ± 0.047
2.866ArgPhe: 2.866 ± 0.292
2.548ArgGly: 2.548 ± 1.063
1.592ArgHis: 1.592 ± 0.565
3.503ArgIle: 3.503 ± 0.415
3.185ArgLys: 3.185 ± 0.095
4.14ArgLeu: 4.14 ± 0.537
0.955ArgMet: 0.955 ± 0.075
0.318ArgAsn: 0.318 ± 0.32
1.911ArgPro: 1.911 ± 0.368
1.911ArgGln: 1.911 ± 0.15
2.548ArgArg: 2.548 ± 0.49
3.503ArgSer: 3.503 ± 0.933
2.229ArgThr: 2.229 ± 0.17
3.503ArgVal: 3.503 ± 0.621
0.0ArgTrp: 0.0 ± 0.0
1.274ArgTyr: 1.274 ± 0.245
0.0ArgXaa: 0.0 ± 0.0
Ser
3.822SerAla: 3.822 ± 0.735
1.592SerCys: 1.592 ± 0.047
5.096SerAsp: 5.096 ± 1.498
4.777SerGlu: 4.777 ± 1.411
3.185SerPhe: 3.185 ± 1.648
4.777SerGly: 4.777 ± 1.411
0.637SerHis: 0.637 ± 0.123
4.777SerIle: 4.777 ± 0.893
7.006SerLys: 7.006 ± 1.241
7.325SerLeu: 7.325 ± 1.668
1.274SerMet: 1.274 ± 0.763
2.866SerAsn: 2.866 ± 1.328
2.229SerPro: 2.229 ± 0.348
2.548SerGln: 2.548 ± 1.063
2.548SerArg: 2.548 ± 0.028
5.732SerSer: 5.732 ± 0.067
3.185SerThr: 3.185 ± 0.095
5.096SerVal: 5.096 ± 1.091
0.637SerTrp: 0.637 ± 0.123
2.866SerTyr: 2.866 ± 0.225
0.0SerXaa: 0.0 ± 0.0
Thr
2.866ThrAla: 2.866 ± 0.81
1.274ThrCys: 1.274 ± 0.273
4.459ThrAsp: 4.459 ± 0.34
2.548ThrGlu: 2.548 ± 0.545
2.866ThrPhe: 2.866 ± 1.779
4.14ThrGly: 4.14 ± 0.537
0.637ThrHis: 0.637 ± 0.395
4.14ThrIle: 4.14 ± 0.537
3.822ThrLys: 3.822 ± 0.3
6.688ThrLeu: 6.688 ± 0.51
0.955ThrMet: 0.955 ± 0.443
2.866ThrAsn: 2.866 ± 0.292
3.822ThrPro: 3.822 ± 0.217
3.503ThrGln: 3.503 ± 0.103
2.548ThrArg: 2.548 ± 0.028
3.503ThrSer: 3.503 ± 0.933
4.14ThrThr: 4.14 ± 1.573
4.459ThrVal: 4.459 ± 0.34
0.0ThrTrp: 0.0 ± 0.0
2.229ThrTyr: 2.229 ± 0.17
0.0ThrXaa: 0.0 ± 0.0
Val
3.503ValAla: 3.503 ± 0.933
0.955ValCys: 0.955 ± 0.075
4.777ValAsp: 4.777 ± 1.178
5.096ValGlu: 5.096 ± 0.462
3.503ValPhe: 3.503 ± 0.621
5.096ValGly: 5.096 ± 1.091
1.274ValHis: 1.274 ± 0.273
2.866ValIle: 2.866 ± 0.81
4.14ValLys: 4.14 ± 0.498
3.503ValLeu: 3.503 ± 0.621
1.592ValMet: 1.592 ± 0.047
5.096ValAsn: 5.096 ± 1.498
2.866ValPro: 2.866 ± 0.292
1.592ValGln: 1.592 ± 0.565
2.548ValArg: 2.548 ± 0.49
6.369ValSer: 6.369 ± 0.19
3.503ValThr: 3.503 ± 0.103
6.051ValVal: 6.051 ± 1.423
1.274ValTrp: 1.274 ± 0.245
1.592ValTyr: 1.592 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.955TrpAla: 0.955 ± 0.075
0.955TrpCys: 0.955 ± 0.075
1.592TrpAsp: 1.592 ± 0.565
0.637TrpGlu: 0.637 ± 0.64
0.637TrpPhe: 0.637 ± 0.395
0.318TrpGly: 0.318 ± 0.198
0.318TrpHis: 0.318 ± 0.32
1.592TrpIle: 1.592 ± 0.565
0.955TrpLys: 0.955 ± 0.593
0.637TrpLeu: 0.637 ± 0.395
0.955TrpMet: 0.955 ± 0.593
0.318TrpAsn: 0.318 ± 0.32
0.0TrpPro: 0.0 ± 0.0
0.637TrpGln: 0.637 ± 0.395
1.592TrpArg: 1.592 ± 1.083
0.955TrpSer: 0.955 ± 0.075
0.318TrpThr: 0.318 ± 0.32
0.318TrpVal: 0.318 ± 0.198
0.318TrpTrp: 0.318 ± 0.32
0.318TrpTyr: 0.318 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.911TyrAla: 1.911 ± 0.885
0.637TyrCys: 0.637 ± 0.64
1.274TyrAsp: 1.274 ± 0.245
2.866TyrGlu: 2.866 ± 0.225
1.592TyrPhe: 1.592 ± 0.565
1.592TyrGly: 1.592 ± 0.565
0.955TyrHis: 0.955 ± 0.593
1.911TyrIle: 1.911 ± 0.668
1.274TyrLys: 1.274 ± 0.245
2.229TyrLeu: 2.229 ± 0.17
0.318TyrMet: 0.318 ± 0.198
2.229TyrAsn: 2.229 ± 1.206
1.592TyrPro: 1.592 ± 0.047
1.911TyrGln: 1.911 ± 1.186
1.911TyrArg: 1.911 ± 0.15
3.503TyrSer: 3.503 ± 0.621
2.866TyrThr: 2.866 ± 0.743
2.866TyrVal: 2.866 ± 1.328
1.274TyrTrp: 1.274 ± 0.245
0.637TyrTyr: 0.637 ± 0.64
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3141 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski