Amino acid dipepetide frequency for Beihai picorna-like virus 72

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.827AlaAla: 6.827 ± 2.817
0.719AlaCys: 0.719 ± 0.395
2.875AlaAsp: 2.875 ± 0.211
3.593AlaGlu: 3.593 ± 0.414
3.953AlaPhe: 3.953 ± 0.216
6.109AlaGly: 6.109 ± 0.372
1.437AlaHis: 1.437 ± 0.79
5.031AlaIle: 5.031 ± 0.377
5.031AlaLys: 5.031 ± 0.974
5.39AlaLeu: 5.39 ± 1.172
2.156AlaMet: 2.156 ± 0.588
2.875AlaAsn: 2.875 ± 2.004
4.671AlaPro: 4.671 ± 1.016
4.671AlaGln: 4.671 ± 1.374
2.875AlaArg: 2.875 ± 0.386
3.234AlaSer: 3.234 ± 1.209
4.671AlaThr: 4.671 ± 0.418
2.515AlaVal: 2.515 ± 1.006
1.437AlaTrp: 1.437 ± 0.404
1.078AlaTyr: 1.078 ± 0.602
0.0AlaXaa: 0.0 ± 0.0
Cys
1.078CysAla: 1.078 ± 0.005
0.0CysCys: 0.0 ± 0.0
0.359CysAsp: 0.359 ± 0.198
1.437CysGlu: 1.437 ± 0.79
1.078CysPhe: 1.078 ± 0.602
2.156CysGly: 2.156 ± 0.588
0.0CysHis: 0.0 ± 0.0
0.719CysIle: 0.719 ± 0.202
0.719CysLys: 0.719 ± 0.395
0.719CysLeu: 0.719 ± 0.395
0.359CysMet: 0.359 ± 0.198
0.359CysAsn: 0.359 ± 0.198
1.078CysPro: 1.078 ± 0.602
1.078CysGln: 1.078 ± 0.593
0.719CysArg: 0.719 ± 0.395
0.719CysSer: 0.719 ± 0.395
0.359CysThr: 0.359 ± 0.198
1.078CysVal: 1.078 ± 0.005
0.0CysTrp: 0.0 ± 0.0
0.719CysTyr: 0.719 ± 0.395
0.0CysXaa: 0.0 ± 0.0
Asp
2.875AspAla: 2.875 ± 0.386
0.719AspCys: 0.719 ± 0.395
3.953AspAsp: 3.953 ± 1.411
6.468AspGlu: 6.468 ± 1.167
3.593AspPhe: 3.593 ± 0.781
5.39AspGly: 5.39 ± 0.574
2.156AspHis: 2.156 ± 1.186
3.953AspIle: 3.953 ± 0.813
2.515AspLys: 2.515 ± 0.188
5.749AspLeu: 5.749 ± 0.175
1.437AspMet: 1.437 ± 0.79
2.515AspAsn: 2.515 ± 1.006
2.875AspPro: 2.875 ± 0.386
2.515AspGln: 2.515 ± 0.188
3.234AspArg: 3.234 ± 1.181
1.437AspSer: 1.437 ± 1.002
2.156AspThr: 2.156 ± 0.607
2.515AspVal: 2.515 ± 1.006
0.359AspTrp: 0.359 ± 0.198
2.515AspTyr: 2.515 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
5.39GluAla: 5.39 ± 0.023
0.719GluCys: 0.719 ± 0.395
2.875GluAsp: 2.875 ± 1.406
7.546GluGlu: 7.546 ± 2.357
4.312GluPhe: 4.312 ± 0.579
2.156GluGly: 2.156 ± 0.607
1.797GluHis: 1.797 ± 0.207
3.234GluIle: 3.234 ± 0.611
5.031GluLys: 5.031 ± 0.377
6.827GluLeu: 6.827 ± 0.767
1.797GluMet: 1.797 ± 0.391
1.078GluAsn: 1.078 ± 0.005
1.078GluPro: 1.078 ± 0.593
3.593GluGln: 3.593 ± 0.781
4.671GluArg: 4.671 ± 2.569
3.953GluSer: 3.953 ± 1.576
1.797GluThr: 1.797 ± 0.804
7.905GluVal: 7.905 ± 0.432
2.156GluTrp: 2.156 ± 0.588
1.437GluTyr: 1.437 ± 0.79
0.0GluXaa: 0.0 ± 0.0
Phe
3.953PheAla: 3.953 ± 0.216
0.0PheCys: 0.0 ± 0.0
4.671PheAsp: 4.671 ± 1.374
4.671PheGlu: 4.671 ± 0.418
1.437PhePhe: 1.437 ± 0.193
3.593PheGly: 3.593 ± 0.414
0.359PheHis: 0.359 ± 0.4
1.797PheIle: 1.797 ± 0.207
1.437PheLys: 1.437 ± 0.79
2.156PheLeu: 2.156 ± 1.186
0.359PheMet: 0.359 ± 0.325
2.515PheAsn: 2.515 ± 1.006
2.515PhePro: 2.515 ± 1.604
1.797PheGln: 1.797 ± 0.988
4.312PheArg: 4.312 ± 0.579
2.875PheSer: 2.875 ± 0.809
3.953PheThr: 3.953 ± 0.216
2.156PheVal: 2.156 ± 0.607
0.359PheTrp: 0.359 ± 0.198
0.719PheTyr: 0.719 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
5.031GlyAla: 5.031 ± 0.818
1.797GlyCys: 1.797 ± 0.391
3.953GlyAsp: 3.953 ± 0.381
5.031GlyGlu: 5.031 ± 0.377
2.156GlyPhe: 2.156 ± 0.607
3.593GlyGly: 3.593 ± 2.803
2.156GlyHis: 2.156 ± 0.588
3.234GlyIle: 3.234 ± 1.209
5.031GlyLys: 5.031 ± 1.571
3.593GlyLeu: 3.593 ± 0.184
1.797GlyMet: 1.797 ± 0.391
4.312GlyAsn: 4.312 ± 0.018
2.875GlyPro: 2.875 ± 1.406
1.797GlyGln: 1.797 ± 0.988
2.875GlyArg: 2.875 ± 0.211
3.953GlySer: 3.953 ± 0.813
5.39GlyThr: 5.39 ± 1.815
3.593GlyVal: 3.593 ± 0.184
1.797GlyTrp: 1.797 ± 0.804
2.515GlyTyr: 2.515 ± 0.409
0.0GlyXaa: 0.0 ± 0.0
His
2.515HisAla: 2.515 ± 0.188
0.0HisCys: 0.0 ± 0.0
1.437HisAsp: 1.437 ± 0.404
0.359HisGlu: 0.359 ± 0.198
1.797HisPhe: 1.797 ± 0.988
1.797HisGly: 1.797 ± 0.988
0.719HisHis: 0.719 ± 0.395
1.437HisIle: 1.437 ± 0.193
0.359HisLys: 0.359 ± 0.198
2.515HisLeu: 2.515 ± 0.786
1.078HisMet: 1.078 ± 0.593
1.797HisAsn: 1.797 ± 0.391
1.078HisPro: 1.078 ± 0.005
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.437HisSer: 1.437 ± 0.193
1.797HisThr: 1.797 ± 0.391
2.875HisVal: 2.875 ± 1.581
0.0HisTrp: 0.0 ± 0.0
1.078HisTyr: 1.078 ± 1.199
0.0HisXaa: 0.0 ± 0.0
Ile
2.515IleAla: 2.515 ± 0.409
1.078IleCys: 1.078 ± 0.593
4.671IleAsp: 4.671 ± 0.179
3.953IleGlu: 3.953 ± 0.381
2.515IlePhe: 2.515 ± 0.409
5.39IleGly: 5.39 ± 0.62
2.515IleHis: 2.515 ± 0.188
0.359IleIle: 0.359 ± 0.198
2.875IleLys: 2.875 ± 0.211
3.953IleLeu: 3.953 ± 0.381
0.359IleMet: 0.359 ± 0.198
2.875IleAsn: 2.875 ± 0.809
2.875IlePro: 2.875 ± 0.809
1.437IleGln: 1.437 ± 0.193
3.234IleArg: 3.234 ± 0.014
7.186IleSer: 7.186 ± 0.23
3.234IleThr: 3.234 ± 0.014
3.593IleVal: 3.593 ± 1.011
0.719IleTrp: 0.719 ± 0.395
1.437IleTyr: 1.437 ± 0.404
0.0IleXaa: 0.0 ± 0.0
Lys
3.593LysAla: 3.593 ± 0.781
0.719LysCys: 0.719 ± 0.395
2.875LysAsp: 2.875 ± 1.581
2.156LysGlu: 2.156 ± 0.009
3.234LysPhe: 3.234 ± 0.014
2.875LysGly: 2.875 ± 0.386
0.719LysHis: 0.719 ± 0.395
7.186LysIle: 7.186 ± 0.23
5.749LysLys: 5.749 ± 2.564
5.749LysLeu: 5.749 ± 0.175
1.797LysMet: 1.797 ± 0.988
1.797LysAsn: 1.797 ± 0.207
3.234LysPro: 3.234 ± 0.611
1.078LysGln: 1.078 ± 0.593
2.515LysArg: 2.515 ± 0.786
2.875LysSer: 2.875 ± 0.386
5.031LysThr: 5.031 ± 2.169
3.234LysVal: 3.234 ± 1.181
0.359LysTrp: 0.359 ± 0.198
2.515LysTyr: 2.515 ± 0.188
0.0LysXaa: 0.0 ± 0.0
Leu
3.953LeuAla: 3.953 ± 0.381
1.078LeuCys: 1.078 ± 0.005
8.624LeuAsp: 8.624 ± 0.037
7.905LeuGlu: 7.905 ± 0.763
1.797LeuPhe: 1.797 ± 0.207
4.312LeuGly: 4.312 ± 0.616
2.515LeuHis: 2.515 ± 0.409
3.593LeuIle: 3.593 ± 0.184
5.031LeuLys: 5.031 ± 2.169
5.39LeuLeu: 5.39 ± 1.172
1.797LeuMet: 1.797 ± 0.391
3.593LeuAsn: 3.593 ± 0.184
4.312LeuPro: 4.312 ± 0.616
4.312LeuGln: 4.312 ± 1.176
5.031LeuArg: 5.031 ± 0.221
4.312LeuSer: 4.312 ± 0.579
5.749LeuThr: 5.749 ± 1.02
5.031LeuVal: 5.031 ± 0.974
1.437LeuTrp: 1.437 ± 0.79
4.312LeuTyr: 4.312 ± 0.579
0.0LeuXaa: 0.0 ± 0.0
Met
2.875MetAla: 2.875 ± 0.386
0.359MetCys: 0.359 ± 0.198
2.156MetAsp: 2.156 ± 0.588
2.156MetGlu: 2.156 ± 1.186
1.437MetPhe: 1.437 ± 1.002
1.078MetGly: 1.078 ± 1.199
0.359MetHis: 0.359 ± 0.198
1.437MetIle: 1.437 ± 0.79
1.437MetLys: 1.437 ± 0.79
1.078MetLeu: 1.078 ± 0.593
1.078MetMet: 1.078 ± 0.005
0.719MetAsn: 0.719 ± 0.395
1.078MetPro: 1.078 ± 0.593
1.437MetGln: 1.437 ± 0.193
2.156MetArg: 2.156 ± 0.588
1.797MetSer: 1.797 ± 1.999
1.437MetThr: 1.437 ± 0.404
1.437MetVal: 1.437 ± 0.79
0.359MetTrp: 0.359 ± 0.4
1.078MetTyr: 1.078 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
3.593AsnAla: 3.593 ± 1.011
1.437AsnCys: 1.437 ± 0.404
3.593AsnAsp: 3.593 ± 0.184
2.515AsnGlu: 2.515 ± 0.786
1.078AsnPhe: 1.078 ± 0.005
1.797AsnGly: 1.797 ± 1.402
1.078AsnHis: 1.078 ± 0.593
2.515AsnIle: 2.515 ± 0.409
1.797AsnLys: 1.797 ± 0.391
5.031AsnLeu: 5.031 ± 1.415
3.593AsnMet: 3.593 ± 1.608
2.515AsnAsn: 2.515 ± 0.409
1.797AsnPro: 1.797 ± 0.804
1.797AsnGln: 1.797 ± 0.804
1.078AsnArg: 1.078 ± 0.005
5.749AsnSer: 5.749 ± 1.369
1.078AsnThr: 1.078 ± 0.602
3.953AsnVal: 3.953 ± 2.606
0.719AsnTrp: 0.719 ± 0.202
1.078AsnTyr: 1.078 ± 0.005
0.0AsnXaa: 0.0 ± 0.0
Pro
2.875ProAla: 2.875 ± 0.809
0.359ProCys: 0.359 ± 0.4
2.156ProAsp: 2.156 ± 1.204
1.797ProGlu: 1.797 ± 0.391
2.875ProPhe: 2.875 ± 0.809
2.515ProGly: 2.515 ± 1.604
0.359ProHis: 0.359 ± 0.198
3.234ProIle: 3.234 ± 1.209
2.156ProLys: 2.156 ± 0.607
5.39ProLeu: 5.39 ± 0.62
2.156ProMet: 2.156 ± 0.426
2.156ProAsn: 2.156 ± 0.588
1.437ProPro: 1.437 ± 0.79
0.0ProGln: 0.0 ± 0.0
3.234ProArg: 3.234 ± 0.584
2.875ProSer: 2.875 ± 0.386
2.875ProThr: 2.875 ± 2.601
2.875ProVal: 2.875 ± 0.809
1.437ProTrp: 1.437 ± 0.193
2.515ProTyr: 2.515 ± 2.201
0.0ProXaa: 0.0 ± 0.0
Gln
1.437GlnAla: 1.437 ± 0.79
0.719GlnCys: 0.719 ± 0.395
1.797GlnAsp: 1.797 ± 0.391
2.156GlnGlu: 2.156 ± 0.588
0.359GlnPhe: 0.359 ± 0.198
3.234GlnGly: 3.234 ± 1.778
0.719GlnHis: 0.719 ± 0.395
3.593GlnIle: 3.593 ± 0.184
1.797GlnLys: 1.797 ± 0.988
5.031GlnLeu: 5.031 ± 0.818
1.437GlnMet: 1.437 ± 0.79
1.078GlnAsn: 1.078 ± 0.602
1.797GlnPro: 1.797 ± 0.207
1.078GlnGln: 1.078 ± 0.005
1.437GlnArg: 1.437 ± 0.193
1.437GlnSer: 1.437 ± 0.79
2.515GlnThr: 2.515 ± 0.786
3.234GlnVal: 3.234 ± 0.014
1.797GlnTrp: 1.797 ± 0.988
0.359GlnTyr: 0.359 ± 0.198
0.0GlnXaa: 0.0 ± 0.0
Arg
4.312ArgAla: 4.312 ± 0.018
0.359ArgCys: 0.359 ± 0.198
2.156ArgAsp: 2.156 ± 0.009
2.875ArgGlu: 2.875 ± 0.983
1.437ArgPhe: 1.437 ± 0.404
3.953ArgGly: 3.953 ± 0.216
2.875ArgHis: 2.875 ± 1.581
2.875ArgIle: 2.875 ± 0.983
3.234ArgLys: 3.234 ± 1.181
3.953ArgLeu: 3.953 ± 0.381
1.078ArgMet: 1.078 ± 0.593
1.797ArgAsn: 1.797 ± 0.391
3.593ArgPro: 3.593 ± 1.011
2.515ArgGln: 2.515 ± 0.188
2.875ArgArg: 2.875 ± 0.211
2.515ArgSer: 2.515 ± 0.188
4.671ArgThr: 4.671 ± 0.776
5.39ArgVal: 5.39 ± 1.769
0.0ArgTrp: 0.0 ± 0.0
2.875ArgTyr: 2.875 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
3.234SerAla: 3.234 ± 0.584
1.078SerCys: 1.078 ± 0.005
3.234SerAsp: 3.234 ± 1.181
2.515SerGlu: 2.515 ± 0.409
3.593SerPhe: 3.593 ± 0.184
6.109SerGly: 6.109 ± 0.372
1.078SerHis: 1.078 ± 0.602
1.797SerIle: 1.797 ± 0.207
5.031SerLys: 5.031 ± 0.221
7.186SerLeu: 7.186 ± 0.965
2.156SerMet: 2.156 ± 1.204
2.875SerAsn: 2.875 ± 0.809
1.078SerPro: 1.078 ± 0.593
3.593SerGln: 3.593 ± 0.414
2.875SerArg: 2.875 ± 0.386
1.797SerSer: 1.797 ± 0.391
3.593SerThr: 3.593 ± 2.206
2.875SerVal: 2.875 ± 0.386
0.0SerTrp: 0.0 ± 0.0
2.875SerTyr: 2.875 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
7.186ThrAla: 7.186 ± 0.23
0.719ThrCys: 0.719 ± 0.395
1.797ThrAsp: 1.797 ± 0.207
3.953ThrGlu: 3.953 ± 1.411
2.875ThrPhe: 2.875 ± 0.386
5.749ThrGly: 5.749 ± 1.618
1.797ThrHis: 1.797 ± 0.391
5.031ThrIle: 5.031 ± 0.221
3.234ThrLys: 3.234 ± 0.611
5.031ThrLeu: 5.031 ± 0.377
1.078ThrMet: 1.078 ± 0.005
5.39ThrAsn: 5.39 ± 3.01
1.797ThrPro: 1.797 ± 1.999
2.156ThrGln: 2.156 ± 1.186
2.515ThrArg: 2.515 ± 0.786
4.671ThrSer: 4.671 ± 0.418
2.515ThrThr: 2.515 ± 1.604
3.953ThrVal: 3.953 ± 0.813
0.0ThrTrp: 0.0 ± 0.0
1.437ThrTyr: 1.437 ± 1.599
0.0ThrXaa: 0.0 ± 0.0
Val
7.186ValAla: 7.186 ± 0.827
1.437ValCys: 1.437 ± 0.79
2.156ValAsp: 2.156 ± 0.607
6.109ValGlu: 6.109 ± 0.372
2.156ValPhe: 2.156 ± 0.009
3.593ValGly: 3.593 ± 0.414
0.719ValHis: 0.719 ± 0.395
3.953ValIle: 3.953 ± 0.216
4.312ValLys: 4.312 ± 1.176
4.671ValLeu: 4.671 ± 0.179
0.359ValMet: 0.359 ± 0.198
3.593ValAsn: 3.593 ± 0.414
4.312ValPro: 4.312 ± 1.213
1.797ValGln: 1.797 ± 0.391
4.312ValArg: 4.312 ± 0.018
3.234ValSer: 3.234 ± 1.209
5.39ValThr: 5.39 ± 2.413
2.875ValVal: 2.875 ± 0.211
0.719ValTrp: 0.719 ± 0.202
2.875ValTyr: 2.875 ± 0.809
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.078TrpAsp: 1.078 ± 0.593
0.719TrpGlu: 0.719 ± 0.395
1.078TrpPhe: 1.078 ± 0.005
0.0TrpGly: 0.0 ± 0.0
0.359TrpHis: 0.359 ± 0.198
1.078TrpIle: 1.078 ± 0.005
1.078TrpLys: 1.078 ± 0.005
2.156TrpLeu: 2.156 ± 0.588
0.359TrpMet: 0.359 ± 0.4
1.078TrpAsn: 1.078 ± 0.005
0.0TrpPro: 0.0 ± 0.0
0.359TrpGln: 0.359 ± 0.198
1.437TrpArg: 1.437 ± 0.404
1.078TrpSer: 1.078 ± 0.005
1.437TrpThr: 1.437 ± 0.193
1.437TrpVal: 1.437 ± 0.404
0.359TrpTrp: 0.359 ± 0.198
0.359TrpTyr: 0.359 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.719TyrAla: 0.719 ± 0.202
1.437TyrCys: 1.437 ± 0.404
2.515TyrAsp: 2.515 ± 0.188
1.437TyrGlu: 1.437 ± 0.404
3.234TyrPhe: 3.234 ± 0.014
1.078TyrGly: 1.078 ± 0.593
0.359TyrHis: 0.359 ± 0.4
0.719TyrIle: 0.719 ± 0.395
1.437TyrLys: 1.437 ± 0.404
2.515TyrLeu: 2.515 ± 0.409
0.359TyrMet: 0.359 ± 0.198
2.875TyrAsn: 2.875 ± 0.211
1.797TyrPro: 1.797 ± 1.402
0.0TyrGln: 0.0 ± 0.0
3.953TyrArg: 3.953 ± 0.216
1.437TyrSer: 1.437 ± 0.193
3.234TyrThr: 3.234 ± 0.014
3.234TyrVal: 3.234 ± 3.001
1.437TyrTrp: 1.437 ± 0.404
0.719TyrTyr: 0.719 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2784 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski