Amino acid dipepetide frequency for Wuhan millipede virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.251AlaAla: 4.251 ± 0.194
2.289AlaCys: 2.289 ± 0.632
1.962AlaAsp: 1.962 ± 0.826
3.924AlaGlu: 3.924 ± 1.54
5.232AlaPhe: 5.232 ± 0.351
4.251AlaGly: 4.251 ± 0.832
2.616AlaHis: 2.616 ± 0.814
4.578AlaIle: 4.578 ± 0.651
4.578AlaLys: 4.578 ± 0.626
8.829AlaLeu: 8.829 ± 0.206
0.981AlaMet: 0.981 ± 0.647
3.597AlaAsn: 3.597 ± 1.196
4.251AlaPro: 4.251 ± 0.832
1.308AlaGln: 1.308 ± 0.088
2.289AlaArg: 2.289 ± 0.006
5.559AlaSer: 5.559 ± 2.022
5.886AlaThr: 5.886 ± 2.479
2.943AlaVal: 2.943 ± 0.357
0.654AlaTrp: 0.654 ± 0.363
1.635AlaTyr: 1.635 ± 0.269
0.0AlaXaa: 0.0 ± 0.0
Cys
1.308CysAla: 1.308 ± 0.726
0.327CysCys: 0.327 ± 0.182
0.327CysAsp: 0.327 ± 0.182
0.981CysGlu: 0.981 ± 0.545
0.327CysPhe: 0.327 ± 0.457
3.27CysGly: 3.27 ± 1.177
0.0CysHis: 0.0 ± 0.0
1.962CysIle: 1.962 ± 1.089
0.654CysLys: 0.654 ± 0.363
0.981CysLeu: 0.981 ± 0.545
0.981CysMet: 0.981 ± 0.732
0.327CysAsn: 0.327 ± 0.182
1.962CysPro: 1.962 ± 0.188
0.327CysGln: 0.327 ± 0.182
1.308CysArg: 1.308 ± 0.088
0.327CysSer: 0.327 ± 0.182
0.327CysThr: 0.327 ± 0.457
0.654CysVal: 0.654 ± 0.363
0.0CysTrp: 0.0 ± 0.0
0.981CysTyr: 0.981 ± 0.094
0.0CysXaa: 0.0 ± 0.0
Asp
5.559AspAla: 5.559 ± 1.171
0.327AspCys: 0.327 ± 0.182
3.27AspAsp: 3.27 ± 0.539
4.905AspGlu: 4.905 ± 0.808
5.232AspPhe: 5.232 ± 2.204
4.251AspGly: 4.251 ± 0.445
0.327AspHis: 0.327 ± 0.182
3.27AspIle: 3.27 ± 2.016
3.597AspLys: 3.597 ± 0.72
2.616AspLeu: 2.616 ± 0.463
0.327AspMet: 0.327 ± 0.182
2.616AspAsn: 2.616 ± 0.463
1.962AspPro: 1.962 ± 0.451
1.308AspGln: 1.308 ± 0.726
1.308AspArg: 1.308 ± 0.551
2.943AspSer: 2.943 ± 0.996
2.943AspThr: 2.943 ± 0.282
2.943AspVal: 2.943 ± 0.357
0.981AspTrp: 0.981 ± 0.545
0.981AspTyr: 0.981 ± 0.732
0.0AspXaa: 0.0 ± 0.0
Glu
2.943GluAla: 2.943 ± 1.634
0.0GluCys: 0.0 ± 0.0
2.943GluAsp: 2.943 ± 0.282
4.251GluGlu: 4.251 ± 0.445
2.943GluPhe: 2.943 ± 0.282
3.924GluGly: 3.924 ± 0.375
1.635GluHis: 1.635 ± 0.269
7.848GluIle: 7.848 ± 2.442
3.597GluLys: 3.597 ± 1.997
6.213GluLeu: 6.213 ± 0.382
2.616GluMet: 2.616 ± 0.175
2.289GluAsn: 2.289 ± 1.271
1.308GluPro: 1.308 ± 0.088
4.251GluGln: 4.251 ± 1.471
5.232GluArg: 5.232 ± 0.351
2.289GluSer: 2.289 ± 0.632
2.943GluThr: 2.943 ± 1.634
4.905GluVal: 4.905 ± 0.169
1.308GluTrp: 1.308 ± 0.726
2.289GluTyr: 2.289 ± 0.632
0.0GluXaa: 0.0 ± 0.0
Phe
3.597PheAla: 3.597 ± 1.196
1.308PheCys: 1.308 ± 0.088
4.251PheAsp: 4.251 ± 1.083
3.924PheGlu: 3.924 ± 0.375
3.597PhePhe: 3.597 ± 0.082
5.232PheGly: 5.232 ± 0.989
1.635PheHis: 1.635 ± 0.369
3.27PheIle: 3.27 ± 1.177
2.289PheLys: 2.289 ± 0.006
1.962PheLeu: 1.962 ± 0.451
0.654PheMet: 0.654 ± 0.363
2.616PheAsn: 2.616 ± 0.463
2.289PhePro: 2.289 ± 0.645
1.635PheGln: 1.635 ± 1.008
2.616PheArg: 2.616 ± 0.463
3.27PheSer: 3.27 ± 1.377
4.251PheThr: 4.251 ± 0.832
2.289PheVal: 2.289 ± 0.006
0.0PheTrp: 0.0 ± 0.0
1.962PheTyr: 1.962 ± 0.188
0.0PheXaa: 0.0 ± 0.0
Gly
2.289GlyAla: 2.289 ± 0.632
0.654GlyCys: 0.654 ± 0.275
3.27GlyAsp: 3.27 ± 0.539
4.578GlyGlu: 4.578 ± 0.626
2.616GlyPhe: 2.616 ± 0.814
1.962GlyGly: 1.962 ± 0.826
0.981GlyHis: 0.981 ± 0.094
3.597GlyIle: 3.597 ± 0.72
4.251GlyLys: 4.251 ± 1.722
3.27GlyLeu: 3.27 ± 1.177
0.981GlyMet: 0.981 ± 0.545
1.635GlyAsn: 1.635 ± 0.369
2.289GlyPro: 2.289 ± 0.645
2.289GlyGln: 2.289 ± 0.006
1.635GlyArg: 1.635 ± 0.269
3.597GlySer: 3.597 ± 1.196
7.848GlyThr: 7.848 ± 3.944
4.578GlyVal: 4.578 ± 1.29
0.654GlyTrp: 0.654 ± 0.363
0.654GlyTyr: 0.654 ± 0.275
0.0GlyXaa: 0.0 ± 0.0
His
1.635HisAla: 1.635 ± 0.269
0.981HisCys: 0.981 ± 0.094
1.308HisAsp: 1.308 ± 0.551
2.616HisGlu: 2.616 ± 1.453
1.308HisPhe: 1.308 ± 0.726
0.654HisGly: 0.654 ± 0.363
0.327HisHis: 0.327 ± 0.182
0.654HisIle: 0.654 ± 0.275
0.981HisLys: 0.981 ± 0.094
2.289HisLeu: 2.289 ± 0.632
0.654HisMet: 0.654 ± 0.275
0.654HisAsn: 0.654 ± 0.914
1.635HisPro: 1.635 ± 0.908
1.635HisGln: 1.635 ± 0.369
1.635HisArg: 1.635 ± 0.269
0.654HisSer: 0.654 ± 0.275
1.635HisThr: 1.635 ± 0.269
2.616HisVal: 2.616 ± 0.814
0.0HisTrp: 0.0 ± 0.0
1.308HisTyr: 1.308 ± 0.088
0.0HisXaa: 0.0 ± 0.0
Ile
6.213IleAla: 6.213 ± 0.257
0.981IleCys: 0.981 ± 0.545
1.962IleAsp: 1.962 ± 0.826
1.635IleGlu: 1.635 ± 0.908
2.616IlePhe: 2.616 ± 1.453
3.597IleGly: 3.597 ± 0.082
1.635IleHis: 1.635 ± 0.369
3.924IleIle: 3.924 ± 0.902
3.597IleLys: 3.597 ± 0.557
5.559IleLeu: 5.559 ± 1.171
1.962IleMet: 1.962 ± 0.685
4.905IleAsn: 4.905 ± 1.446
3.27IlePro: 3.27 ± 0.1
2.943IleGln: 2.943 ± 0.282
4.905IleArg: 4.905 ± 1.446
3.924IleSer: 3.924 ± 1.014
7.194IleThr: 7.194 ± 0.476
3.924IleVal: 3.924 ± 0.263
0.981IleTrp: 0.981 ± 0.545
2.943IleTyr: 2.943 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
4.578LysAla: 4.578 ± 0.012
0.981LysCys: 0.981 ± 0.545
4.251LysAsp: 4.251 ± 1.722
1.962LysGlu: 1.962 ± 0.188
4.251LysPhe: 4.251 ± 1.722
0.981LysGly: 0.981 ± 0.545
2.289LysHis: 2.289 ± 0.006
6.213LysIle: 6.213 ± 0.896
1.308LysLys: 1.308 ± 0.726
3.597LysLeu: 3.597 ± 0.72
2.943LysMet: 2.943 ± 1.634
2.943LysAsn: 2.943 ± 0.357
1.308LysPro: 1.308 ± 0.088
3.27LysGln: 3.27 ± 0.1
1.962LysArg: 1.962 ± 1.089
3.597LysSer: 3.597 ± 0.557
4.905LysThr: 4.905 ± 0.469
3.597LysVal: 3.597 ± 1.997
0.981LysTrp: 0.981 ± 0.732
1.308LysTyr: 1.308 ± 0.088
0.0LysXaa: 0.0 ± 0.0
Leu
5.886LeuAla: 5.886 ± 0.075
0.981LeuCys: 0.981 ± 0.732
7.521LeuAsp: 7.521 ± 2.26
6.213LeuGlu: 6.213 ± 0.257
3.924LeuPhe: 3.924 ± 1.54
3.597LeuGly: 3.597 ± 1.196
1.962LeuHis: 1.962 ± 0.451
3.924LeuIle: 3.924 ± 0.263
5.559LeuLys: 5.559 ± 0.532
6.213LeuLeu: 6.213 ± 0.896
2.289LeuMet: 2.289 ± 0.006
3.924LeuAsn: 3.924 ± 1.653
1.962LeuPro: 1.962 ± 1.465
2.943LeuGln: 2.943 ± 0.282
2.943LeuArg: 2.943 ± 0.357
5.232LeuSer: 5.232 ± 0.989
5.886LeuThr: 5.886 ± 0.563
6.213LeuVal: 6.213 ± 2.297
0.654LeuTrp: 0.654 ± 0.363
4.578LeuTyr: 4.578 ± 0.626
0.0LeuXaa: 0.0 ± 0.0
Met
2.616MetAla: 2.616 ± 0.175
0.327MetCys: 0.327 ± 0.182
0.981MetAsp: 0.981 ± 0.732
1.962MetGlu: 1.962 ± 1.089
0.327MetPhe: 0.327 ± 0.457
2.289MetGly: 2.289 ± 0.645
0.654MetHis: 0.654 ± 0.363
2.616MetIle: 2.616 ± 0.814
1.635MetLys: 1.635 ± 0.269
0.327MetLeu: 0.327 ± 0.182
0.327MetMet: 0.327 ± 0.182
0.981MetAsn: 0.981 ± 0.094
2.616MetPro: 2.616 ± 0.175
2.289MetGln: 2.289 ± 0.632
2.616MetArg: 2.616 ± 0.814
1.635MetSer: 1.635 ± 0.269
1.962MetThr: 1.962 ± 0.188
0.327MetVal: 0.327 ± 0.182
0.0MetTrp: 0.0 ± 0.0
1.308MetTyr: 1.308 ± 0.726
0.0MetXaa: 0.0 ± 0.0
Asn
3.597AsnAla: 3.597 ± 2.473
0.981AsnCys: 0.981 ± 0.545
2.289AsnAsp: 2.289 ± 1.271
3.27AsnGlu: 3.27 ± 1.177
3.924AsnPhe: 3.924 ± 1.014
1.635AsnGly: 1.635 ± 0.269
1.308AsnHis: 1.308 ± 0.088
4.905AsnIle: 4.905 ± 0.169
1.635AsnLys: 1.635 ± 0.369
7.848AsnLeu: 7.848 ± 1.39
2.616AsnMet: 2.616 ± 0.463
3.597AsnAsn: 3.597 ± 1.196
1.635AsnPro: 1.635 ± 0.369
2.616AsnGln: 2.616 ± 1.102
1.962AsnArg: 1.962 ± 1.089
4.251AsnSer: 4.251 ± 0.832
3.597AsnThr: 3.597 ± 3.111
2.943AsnVal: 2.943 ± 0.996
0.0AsnTrp: 0.0 ± 0.0
1.635AsnTyr: 1.635 ± 0.369
0.0AsnXaa: 0.0 ± 0.0
Pro
3.27ProAla: 3.27 ± 0.1
0.654ProCys: 0.654 ± 0.275
2.289ProAsp: 2.289 ± 1.922
3.924ProGlu: 3.924 ± 0.375
1.962ProPhe: 1.962 ± 1.465
1.962ProGly: 1.962 ± 0.188
0.981ProHis: 0.981 ± 0.094
1.962ProIle: 1.962 ± 0.188
1.635ProLys: 1.635 ± 0.908
4.905ProLeu: 4.905 ± 0.469
0.981ProMet: 0.981 ± 0.545
2.616ProAsn: 2.616 ± 1.74
0.327ProPro: 0.327 ± 0.182
1.635ProGln: 1.635 ± 0.369
1.962ProArg: 1.962 ± 0.188
2.289ProSer: 2.289 ± 0.632
2.943ProThr: 2.943 ± 0.282
1.635ProVal: 1.635 ± 0.369
0.981ProTrp: 0.981 ± 0.094
1.308ProTyr: 1.308 ± 0.088
0.0ProXaa: 0.0 ± 0.0
Gln
2.289GlnAla: 2.289 ± 0.645
0.654GlnCys: 0.654 ± 0.363
1.962GlnAsp: 1.962 ± 0.826
2.616GlnGlu: 2.616 ± 0.814
1.635GlnPhe: 1.635 ± 0.369
0.654GlnGly: 0.654 ± 0.275
1.308GlnHis: 1.308 ± 0.726
4.251GlnIle: 4.251 ± 0.194
3.924GlnLys: 3.924 ± 0.263
2.943GlnLeu: 2.943 ± 0.92
1.308GlnMet: 1.308 ± 0.726
1.635GlnAsn: 1.635 ± 0.269
1.962GlnPro: 1.962 ± 0.188
1.635GlnGln: 1.635 ± 0.908
2.943GlnArg: 2.943 ± 0.357
3.597GlnSer: 3.597 ± 2.473
1.308GlnThr: 1.308 ± 0.551
3.597GlnVal: 3.597 ± 0.082
0.327GlnTrp: 0.327 ± 0.182
0.654GlnTyr: 0.654 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
3.27ArgAla: 3.27 ± 1.377
0.654ArgCys: 0.654 ± 0.363
1.635ArgAsp: 1.635 ± 0.269
4.251ArgGlu: 4.251 ± 2.36
3.27ArgPhe: 3.27 ± 0.1
4.251ArgGly: 4.251 ± 0.445
0.654ArgHis: 0.654 ± 0.275
2.289ArgIle: 2.289 ± 0.006
4.251ArgLys: 4.251 ± 1.722
5.232ArgLeu: 5.232 ± 2.267
0.654ArgMet: 0.654 ± 0.363
4.578ArgAsn: 4.578 ± 0.012
1.635ArgPro: 1.635 ± 0.369
1.308ArgGln: 1.308 ± 0.726
2.943ArgArg: 2.943 ± 0.996
1.962ArgSer: 1.962 ± 0.826
2.943ArgThr: 2.943 ± 0.357
1.962ArgVal: 1.962 ± 0.826
1.308ArgTrp: 1.308 ± 0.726
1.308ArgTyr: 1.308 ± 0.551
0.0ArgXaa: 0.0 ± 0.0
Ser
4.578SerAla: 4.578 ± 0.012
1.962SerCys: 1.962 ± 0.451
2.289SerAsp: 2.289 ± 0.645
2.943SerGlu: 2.943 ± 0.282
3.27SerPhe: 3.27 ± 2.016
2.616SerGly: 2.616 ± 1.102
0.981SerHis: 0.981 ± 0.094
4.578SerIle: 4.578 ± 0.012
3.924SerLys: 3.924 ± 0.375
5.559SerLeu: 5.559 ± 2.022
2.616SerMet: 2.616 ± 0.814
2.289SerAsn: 2.289 ± 0.006
1.635SerPro: 1.635 ± 1.008
3.924SerGln: 3.924 ± 0.263
2.289SerArg: 2.289 ± 0.632
4.905SerSer: 4.905 ± 0.169
2.616SerThr: 2.616 ± 1.74
3.924SerVal: 3.924 ± 1.014
0.0SerTrp: 0.0 ± 0.0
3.597SerTyr: 3.597 ± 0.557
0.0SerXaa: 0.0 ± 0.0
Thr
4.251ThrAla: 4.251 ± 0.832
0.0ThrCys: 0.0 ± 0.0
3.924ThrAsp: 3.924 ± 0.375
5.886ThrGlu: 5.886 ± 0.563
3.27ThrPhe: 3.27 ± 0.739
3.924ThrGly: 3.924 ± 0.375
2.289ThrHis: 2.289 ± 0.645
4.578ThrIle: 4.578 ± 0.012
4.578ThrLys: 4.578 ± 0.012
5.232ThrLeu: 5.232 ± 0.926
1.962ThrMet: 1.962 ± 0.826
5.886ThrAsn: 5.886 ± 1.84
3.27ThrPro: 3.27 ± 0.1
2.943ThrGln: 2.943 ± 0.282
2.616ThrArg: 2.616 ± 0.463
3.27ThrSer: 3.27 ± 1.377
4.251ThrThr: 4.251 ± 0.445
5.886ThrVal: 5.886 ± 2.479
0.654ThrTrp: 0.654 ± 0.275
1.308ThrTyr: 1.308 ± 1.189
0.0ThrXaa: 0.0 ± 0.0
Val
5.559ValAla: 5.559 ± 0.745
1.635ValCys: 1.635 ± 0.908
2.943ValAsp: 2.943 ± 0.92
3.597ValGlu: 3.597 ± 1.196
0.981ValPhe: 0.981 ± 0.094
2.943ValGly: 2.943 ± 0.282
2.289ValHis: 2.289 ± 1.271
2.289ValIle: 2.289 ± 0.632
2.289ValLys: 2.289 ± 0.006
4.905ValLeu: 4.905 ± 0.169
1.635ValMet: 1.635 ± 1.008
4.905ValAsn: 4.905 ± 1.108
3.924ValPro: 3.924 ± 1.014
0.654ValGln: 0.654 ± 0.275
4.251ValArg: 4.251 ± 1.083
3.27ValSer: 3.27 ± 0.1
2.943ValThr: 2.943 ± 0.357
3.597ValVal: 3.597 ± 0.082
1.308ValTrp: 1.308 ± 0.551
4.251ValTyr: 4.251 ± 0.194
0.0ValXaa: 0.0 ± 0.0
Trp
1.308TrpAla: 1.308 ± 0.088
0.0TrpCys: 0.0 ± 0.0
1.635TrpAsp: 1.635 ± 0.269
0.0TrpGlu: 0.0 ± 0.0
0.654TrpPhe: 0.654 ± 0.275
0.0TrpGly: 0.0 ± 0.0
1.308TrpHis: 1.308 ± 0.726
0.981TrpIle: 0.981 ± 0.545
0.981TrpLys: 0.981 ± 0.545
0.327TrpLeu: 0.327 ± 0.457
0.0TrpMet: 0.0 ± 0.0
0.654TrpAsn: 0.654 ± 0.363
0.0TrpPro: 0.0 ± 0.0
0.981TrpGln: 0.981 ± 0.545
0.654TrpArg: 0.654 ± 0.275
1.308TrpSer: 1.308 ± 0.726
1.308TrpThr: 1.308 ± 0.088
0.327TrpVal: 0.327 ± 0.182
0.0TrpTrp: 0.0 ± 0.0
0.327TrpTyr: 0.327 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.943TyrAla: 2.943 ± 0.92
1.635TyrCys: 1.635 ± 0.269
0.654TyrAsp: 0.654 ± 0.363
2.289TyrGlu: 2.289 ± 0.632
1.308TyrPhe: 1.308 ± 0.088
1.635TyrGly: 1.635 ± 0.269
0.0TyrHis: 0.0 ± 0.0
0.981TyrIle: 0.981 ± 0.094
1.962TyrLys: 1.962 ± 1.089
3.597TyrLeu: 3.597 ± 0.557
0.981TyrMet: 0.981 ± 0.545
3.597TyrAsn: 3.597 ± 0.082
0.981TyrPro: 0.981 ± 0.732
1.308TyrGln: 1.308 ± 0.088
2.289TyrArg: 2.289 ± 0.645
2.616TyrSer: 2.616 ± 1.102
2.289TyrThr: 2.289 ± 0.645
1.308TyrVal: 1.308 ± 0.551
1.962TyrTrp: 1.962 ± 1.089
0.327TyrTyr: 0.327 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3059 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski