Amino acid dipepetide frequency for Wuhan insect virus 31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.306AlaAla: 7.306 ± 0.959
2.578AlaCys: 2.578 ± 0.018
4.727AlaAsp: 4.727 ± 0.421
2.578AlaGlu: 2.578 ± 0.018
1.719AlaPhe: 1.719 ± 1.147
2.578AlaGly: 2.578 ± 0.699
0.0AlaHis: 0.0 ± 0.0
5.587AlaIle: 5.587 ± 0.188
4.297AlaLys: 4.297 ± 1.165
8.165AlaLeu: 8.165 ± 0.852
0.859AlaMet: 0.859 ± 0.448
3.868AlaAsn: 3.868 ± 0.655
2.149AlaPro: 2.149 ± 1.121
0.43AlaGln: 0.43 ± 0.224
4.727AlaArg: 4.727 ± 0.941
4.727AlaSer: 4.727 ± 0.421
7.735AlaThr: 7.735 ± 1.99
6.016AlaVal: 6.016 ± 1.094
0.859AlaTrp: 0.859 ± 0.448
3.438AlaTyr: 3.438 ± 0.43
0.0AlaXaa: 0.0 ± 0.0
Cys
0.859CysAla: 0.859 ± 0.233
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.859CysGlu: 0.859 ± 0.914
0.0CysPhe: 0.0 ± 0.0
1.719CysGly: 1.719 ± 0.215
0.0CysHis: 0.0 ± 0.0
1.289CysIle: 1.289 ± 0.672
0.859CysLys: 0.859 ± 0.448
1.289CysLeu: 1.289 ± 0.009
1.289CysMet: 1.289 ± 0.009
1.289CysAsn: 1.289 ± 0.69
0.0CysPro: 0.0 ± 0.0
0.43CysGln: 0.43 ± 0.224
1.719CysArg: 1.719 ± 0.466
0.859CysSer: 0.859 ± 0.448
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.43CysTrp: 0.43 ± 0.224
0.43CysTyr: 0.43 ± 0.457
0.0CysXaa: 0.0 ± 0.0
Asp
4.727AspAla: 4.727 ± 1.103
0.43AspCys: 0.43 ± 0.224
3.868AspAsp: 3.868 ± 0.655
3.868AspGlu: 3.868 ± 0.708
1.719AspPhe: 1.719 ± 0.466
2.578AspGly: 2.578 ± 1.38
0.859AspHis: 0.859 ± 0.233
2.578AspIle: 2.578 ± 1.38
3.868AspLys: 3.868 ± 1.389
4.297AspLeu: 4.297 ± 0.879
0.859AspMet: 0.859 ± 0.233
1.289AspAsn: 1.289 ± 0.009
3.008AspPro: 3.008 ± 0.888
1.719AspGln: 1.719 ± 0.215
3.868AspArg: 3.868 ± 0.655
3.008AspSer: 3.008 ± 1.156
0.859AspThr: 0.859 ± 0.448
6.446AspVal: 6.446 ± 1.318
0.43AspTrp: 0.43 ± 0.224
4.727AspTyr: 4.727 ± 0.421
0.0AspXaa: 0.0 ± 0.0
Glu
3.008GluAla: 3.008 ± 0.206
0.0GluCys: 0.0 ± 0.0
2.578GluAsp: 2.578 ± 1.345
1.289GluGlu: 1.289 ± 0.009
2.578GluPhe: 2.578 ± 2.062
4.727GluGly: 4.727 ± 0.26
1.289GluHis: 1.289 ± 0.009
4.727GluIle: 4.727 ± 0.26
2.149GluLys: 2.149 ± 0.242
4.727GluLeu: 4.727 ± 0.941
1.289GluMet: 1.289 ± 0.69
1.289GluAsn: 1.289 ± 0.672
2.578GluPro: 2.578 ± 0.699
1.719GluGln: 1.719 ± 1.147
3.438GluArg: 3.438 ± 1.112
2.149GluSer: 2.149 ± 0.439
4.727GluThr: 4.727 ± 0.941
1.719GluVal: 1.719 ± 0.466
1.289GluTrp: 1.289 ± 0.69
2.149GluTyr: 2.149 ± 0.439
0.0GluXaa: 0.0 ± 0.0
Phe
0.859PheAla: 0.859 ± 0.233
0.43PheCys: 0.43 ± 0.224
5.157PheAsp: 5.157 ± 1.398
2.578PheGlu: 2.578 ± 0.663
0.0PhePhe: 0.0 ± 0.0
3.438PheGly: 3.438 ± 0.251
2.149PheHis: 2.149 ± 0.439
1.289PheIle: 1.289 ± 1.372
2.149PheLys: 2.149 ± 0.242
1.719PheLeu: 1.719 ± 0.466
1.289PheMet: 1.289 ± 0.69
0.0PheAsn: 0.0 ± 0.0
1.719PhePro: 1.719 ± 0.466
1.719PheGln: 1.719 ± 0.215
1.289PheArg: 1.289 ± 0.672
2.149PheSer: 2.149 ± 0.242
1.289PheThr: 1.289 ± 0.672
2.149PheVal: 2.149 ± 1.121
0.0PheTrp: 0.0 ± 0.0
1.719PheTyr: 1.719 ± 0.896
0.0PheXaa: 0.0 ± 0.0
Gly
3.008GlyAla: 3.008 ± 1.569
0.0GlyCys: 0.0 ± 0.0
4.297GlyAsp: 4.297 ± 0.197
2.578GlyGlu: 2.578 ± 0.018
3.008GlyPhe: 3.008 ± 0.888
3.008GlyGly: 3.008 ± 0.206
1.719GlyHis: 1.719 ± 1.829
2.578GlyIle: 2.578 ± 1.38
2.578GlyLys: 2.578 ± 2.062
7.306GlyLeu: 7.306 ± 1.085
1.289GlyMet: 1.289 ± 0.009
5.157GlyAsn: 5.157 ± 0.646
4.297GlyPro: 4.297 ± 0.879
2.578GlyGln: 2.578 ± 1.345
3.008GlyArg: 3.008 ± 0.206
3.868GlySer: 3.868 ± 0.027
6.876GlyThr: 6.876 ± 2.223
6.446GlyVal: 6.446 ± 0.726
2.578GlyTrp: 2.578 ± 1.38
3.008GlyTyr: 3.008 ± 0.206
0.0GlyXaa: 0.0 ± 0.0
His
0.43HisAla: 0.43 ± 0.224
0.0HisCys: 0.0 ± 0.0
0.43HisAsp: 0.43 ± 0.224
0.43HisGlu: 0.43 ± 0.224
0.859HisPhe: 0.859 ± 0.233
1.289HisGly: 1.289 ± 0.69
0.0HisHis: 0.0 ± 0.0
0.859HisIle: 0.859 ± 0.914
0.43HisLys: 0.43 ± 0.224
0.43HisLeu: 0.43 ± 0.224
0.859HisMet: 0.859 ± 0.233
0.43HisAsn: 0.43 ± 0.224
0.43HisPro: 0.43 ± 0.224
0.43HisGln: 0.43 ± 0.457
0.43HisArg: 0.43 ± 0.457
0.859HisSer: 0.859 ± 0.233
1.289HisThr: 1.289 ± 0.009
3.008HisVal: 3.008 ± 0.206
0.43HisTrp: 0.43 ± 0.224
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.016IleAla: 6.016 ± 0.95
0.43IleCys: 0.43 ± 0.457
3.438IleAsp: 3.438 ± 0.251
3.008IleGlu: 3.008 ± 0.475
1.289IlePhe: 1.289 ± 0.672
4.297IleGly: 4.297 ± 0.484
0.43IleHis: 0.43 ± 0.224
1.719IleIle: 1.719 ± 1.147
3.438IleLys: 3.438 ± 0.932
4.297IleLeu: 4.297 ± 0.484
1.289IleMet: 1.289 ± 0.009
3.008IleAsn: 3.008 ± 0.206
4.727IlePro: 4.727 ± 0.941
3.008IleGln: 3.008 ± 0.888
3.438IleArg: 3.438 ± 1.112
5.157IleSer: 5.157 ± 1.398
3.008IleThr: 3.008 ± 0.206
4.297IleVal: 4.297 ± 1.165
1.719IleTrp: 1.719 ± 0.466
0.43IleTyr: 0.43 ± 0.457
0.0IleXaa: 0.0 ± 0.0
Lys
2.578LysAla: 2.578 ± 0.663
0.0LysCys: 0.0 ± 0.0
1.289LysAsp: 1.289 ± 0.69
1.289LysGlu: 1.289 ± 0.009
1.719LysPhe: 1.719 ± 0.215
3.008LysGly: 3.008 ± 1.156
0.859LysHis: 0.859 ± 0.233
2.578LysIle: 2.578 ± 1.38
3.008LysLys: 3.008 ± 0.888
2.578LysLeu: 2.578 ± 0.699
0.859LysMet: 0.859 ± 0.448
1.719LysAsn: 1.719 ± 0.466
3.438LysPro: 3.438 ± 0.932
2.578LysGln: 2.578 ± 0.018
3.438LysArg: 3.438 ± 2.976
2.578LysSer: 2.578 ± 1.345
3.438LysThr: 3.438 ± 0.932
4.727LysVal: 4.727 ± 0.421
0.859LysTrp: 0.859 ± 0.914
2.149LysTyr: 2.149 ± 0.242
0.0LysXaa: 0.0 ± 0.0
Leu
5.587LeuAla: 5.587 ± 1.174
1.719LeuCys: 1.719 ± 0.466
4.297LeuAsp: 4.297 ± 0.197
3.868LeuGlu: 3.868 ± 0.027
2.149LeuPhe: 2.149 ± 0.439
3.438LeuGly: 3.438 ± 1.112
0.859LeuHis: 0.859 ± 0.233
5.157LeuIle: 5.157 ± 0.036
2.149LeuLys: 2.149 ± 0.439
4.727LeuLeu: 4.727 ± 0.421
3.008LeuMet: 3.008 ± 0.475
4.297LeuAsn: 4.297 ± 0.197
6.876LeuPro: 6.876 ± 1.542
3.008LeuGln: 3.008 ± 1.569
4.727LeuArg: 4.727 ± 1.622
8.595LeuSer: 8.595 ± 1.076
5.157LeuThr: 5.157 ± 0.036
4.727LeuVal: 4.727 ± 0.941
0.859LeuTrp: 0.859 ± 0.233
3.438LeuTyr: 3.438 ± 0.932
0.0LeuXaa: 0.0 ± 0.0
Met
1.719MetAla: 1.719 ± 0.466
1.289MetCys: 1.289 ± 0.69
2.149MetAsp: 2.149 ± 0.242
1.289MetGlu: 1.289 ± 0.009
0.859MetPhe: 0.859 ± 0.233
1.289MetGly: 1.289 ± 0.69
0.859MetHis: 0.859 ± 0.448
2.149MetIle: 2.149 ± 0.242
0.859MetLys: 0.859 ± 0.233
2.578MetLeu: 2.578 ± 0.699
0.43MetMet: 0.43 ± 0.457
1.289MetAsn: 1.289 ± 0.009
1.719MetPro: 1.719 ± 0.466
0.0MetGln: 0.0 ± 0.0
0.43MetArg: 0.43 ± 0.457
3.438MetSer: 3.438 ± 0.251
1.719MetThr: 1.719 ± 0.466
2.578MetVal: 2.578 ± 1.345
0.43MetTrp: 0.43 ± 0.457
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.446AsnAla: 6.446 ± 0.726
1.289AsnCys: 1.289 ± 0.69
1.719AsnAsp: 1.719 ± 0.466
0.859AsnGlu: 0.859 ± 0.448
0.859AsnPhe: 0.859 ± 0.233
3.008AsnGly: 3.008 ± 0.888
1.289AsnHis: 1.289 ± 0.009
3.438AsnIle: 3.438 ± 0.932
2.578AsnLys: 2.578 ± 1.345
2.149AsnLeu: 2.149 ± 1.605
1.719AsnMet: 1.719 ± 1.147
0.859AsnAsn: 0.859 ± 0.448
3.008AsnPro: 3.008 ± 0.888
1.289AsnGln: 1.289 ± 0.009
1.719AsnArg: 1.719 ± 0.896
3.868AsnSer: 3.868 ± 0.655
3.438AsnThr: 3.438 ± 1.112
3.008AsnVal: 3.008 ± 0.888
1.289AsnTrp: 1.289 ± 0.69
2.149AsnTyr: 2.149 ± 1.121
0.0AsnXaa: 0.0 ± 0.0
Pro
3.868ProAla: 3.868 ± 0.655
0.0ProCys: 0.0 ± 0.0
1.719ProAsp: 1.719 ± 0.466
6.876ProGlu: 6.876 ± 1.183
3.008ProPhe: 3.008 ± 0.206
4.727ProGly: 4.727 ± 1.784
0.0ProHis: 0.0 ± 0.0
2.149ProIle: 2.149 ± 0.242
0.859ProLys: 0.859 ± 0.233
5.157ProLeu: 5.157 ± 1.327
1.719ProMet: 1.719 ± 0.215
2.578ProAsn: 2.578 ± 1.345
5.157ProPro: 5.157 ± 2.008
0.859ProGln: 0.859 ± 0.448
3.008ProArg: 3.008 ± 0.475
2.149ProSer: 2.149 ± 0.242
8.595ProThr: 8.595 ± 1.757
3.868ProVal: 3.868 ± 1.336
1.289ProTrp: 1.289 ± 0.69
2.578ProTyr: 2.578 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.868GlnAla: 3.868 ± 0.655
0.0GlnCys: 0.0 ± 0.0
0.859GlnAsp: 0.859 ± 0.233
1.719GlnGlu: 1.719 ± 0.466
1.719GlnPhe: 1.719 ± 1.147
2.149GlnGly: 2.149 ± 1.121
0.43GlnHis: 0.43 ± 0.457
3.438GlnIle: 3.438 ± 1.112
1.289GlnLys: 1.289 ± 0.009
2.149GlnLeu: 2.149 ± 0.242
1.289GlnMet: 1.289 ± 0.672
0.859GlnAsn: 0.859 ± 0.448
2.578GlnPro: 2.578 ± 0.018
1.719GlnGln: 1.719 ± 1.147
2.578GlnArg: 2.578 ± 0.018
1.289GlnSer: 1.289 ± 0.009
1.719GlnThr: 1.719 ± 0.215
1.719GlnVal: 1.719 ± 0.896
0.43GlnTrp: 0.43 ± 0.457
2.578GlnTyr: 2.578 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
2.578ArgAla: 2.578 ± 0.699
1.289ArgCys: 1.289 ± 0.009
1.289ArgAsp: 1.289 ± 0.009
3.868ArgGlu: 3.868 ± 0.027
2.578ArgPhe: 2.578 ± 0.663
3.438ArgGly: 3.438 ± 2.295
0.43ArgHis: 0.43 ± 0.224
3.868ArgIle: 3.868 ± 0.027
3.008ArgLys: 3.008 ± 1.156
4.297ArgLeu: 4.297 ± 0.879
1.289ArgMet: 1.289 ± 1.372
2.149ArgAsn: 2.149 ± 0.242
3.868ArgPro: 3.868 ± 1.336
2.578ArgGln: 2.578 ± 0.018
2.149ArgArg: 2.149 ± 2.286
3.868ArgSer: 3.868 ± 0.655
4.727ArgThr: 4.727 ± 1.622
3.438ArgVal: 3.438 ± 0.932
2.578ArgTrp: 2.578 ± 1.38
2.578ArgTyr: 2.578 ± 1.345
0.0ArgXaa: 0.0 ± 0.0
Ser
3.008SerAla: 3.008 ± 1.569
0.859SerCys: 0.859 ± 0.448
5.157SerAsp: 5.157 ± 0.646
1.719SerGlu: 1.719 ± 0.466
2.149SerPhe: 2.149 ± 0.439
7.735SerGly: 7.735 ± 0.628
0.859SerHis: 0.859 ± 0.448
3.868SerIle: 3.868 ± 1.389
3.868SerLys: 3.868 ± 0.027
6.016SerLeu: 6.016 ± 1.094
1.289SerMet: 1.289 ± 0.185
3.438SerAsn: 3.438 ± 2.295
3.868SerPro: 3.868 ± 0.708
3.438SerGln: 3.438 ± 0.251
3.438SerArg: 3.438 ± 0.43
5.157SerSer: 5.157 ± 0.036
5.587SerThr: 5.587 ± 1.551
5.587SerVal: 5.587 ± 0.87
0.859SerTrp: 0.859 ± 0.448
3.868SerTyr: 3.868 ± 1.389
0.0SerXaa: 0.0 ± 0.0
Thr
6.446ThrAla: 6.446 ± 0.637
1.289ThrCys: 1.289 ± 0.009
5.587ThrAsp: 5.587 ± 0.87
4.297ThrGlu: 4.297 ± 1.165
1.719ThrPhe: 1.719 ± 0.896
6.446ThrGly: 6.446 ± 0.637
0.0ThrHis: 0.0 ± 0.0
5.587ThrIle: 5.587 ± 0.87
0.859ThrLys: 0.859 ± 0.233
5.587ThrLeu: 5.587 ± 1.174
2.149ThrMet: 2.149 ± 0.439
3.868ThrAsn: 3.868 ± 0.655
4.297ThrPro: 4.297 ± 1.56
1.289ThrGln: 1.289 ± 0.672
3.868ThrArg: 3.868 ± 1.389
6.876ThrSer: 6.876 ± 0.861
6.876ThrThr: 6.876 ± 1.542
7.735ThrVal: 7.735 ± 1.99
0.859ThrTrp: 0.859 ± 0.448
1.719ThrTyr: 1.719 ± 0.466
0.0ThrXaa: 0.0 ± 0.0
Val
6.016ValAla: 6.016 ± 1.775
0.43ValCys: 0.43 ± 0.224
4.297ValAsp: 4.297 ± 0.197
4.727ValGlu: 4.727 ± 0.26
3.008ValPhe: 3.008 ± 0.206
6.876ValGly: 6.876 ± 2.223
0.43ValHis: 0.43 ± 0.224
3.008ValIle: 3.008 ± 1.569
4.727ValLys: 4.727 ± 0.941
4.297ValLeu: 4.297 ± 0.879
2.149ValMet: 2.149 ± 0.923
4.297ValAsn: 4.297 ± 0.879
4.727ValPro: 4.727 ± 0.421
2.578ValGln: 2.578 ± 0.018
5.157ValArg: 5.157 ± 0.717
8.595ValSer: 8.595 ± 1.076
4.297ValThr: 4.297 ± 1.165
4.727ValVal: 4.727 ± 1.784
2.578ValTrp: 2.578 ± 0.663
1.289ValTyr: 1.289 ± 0.009
0.0ValXaa: 0.0 ± 0.0
Trp
1.289TrpAla: 1.289 ± 0.009
0.43TrpCys: 0.43 ± 0.224
1.719TrpAsp: 1.719 ± 1.147
0.43TrpGlu: 0.43 ± 0.457
0.43TrpPhe: 0.43 ± 0.224
1.289TrpGly: 1.289 ± 0.672
0.43TrpHis: 0.43 ± 0.457
0.0TrpIle: 0.0 ± 0.0
0.43TrpLys: 0.43 ± 0.457
3.008TrpLeu: 3.008 ± 0.475
0.859TrpMet: 0.859 ± 0.448
1.719TrpAsn: 1.719 ± 1.147
0.43TrpPro: 0.43 ± 0.224
0.859TrpGln: 0.859 ± 0.914
1.719TrpArg: 1.719 ± 0.466
0.859TrpSer: 0.859 ± 0.914
1.289TrpThr: 1.289 ± 0.009
1.719TrpVal: 1.719 ± 0.215
0.0TrpTrp: 0.0 ± 0.0
0.859TrpTyr: 0.859 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.727TyrAla: 4.727 ± 0.26
0.859TyrCys: 0.859 ± 0.448
0.43TyrAsp: 0.43 ± 0.224
1.289TyrGlu: 1.289 ± 0.672
1.719TyrPhe: 1.719 ± 0.215
2.149TyrGly: 2.149 ± 0.439
0.43TyrHis: 0.43 ± 0.224
2.578TyrIle: 2.578 ± 0.018
1.289TyrLys: 1.289 ± 0.69
3.868TyrLeu: 3.868 ± 0.027
0.859TyrMet: 0.859 ± 0.704
2.578TyrAsn: 2.578 ± 0.663
1.289TyrPro: 1.289 ± 0.672
2.149TyrGln: 2.149 ± 0.923
1.719TyrArg: 1.719 ± 0.215
2.149TyrSer: 2.149 ± 0.242
4.727TyrThr: 4.727 ± 1.103
4.297TyrVal: 4.297 ± 1.165
0.0TyrTrp: 0.0 ± 0.0
0.859TyrTyr: 0.859 ± 0.233
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2328 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski