Amino acid dipepetide frequency for Wenzhou shrimp virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.403AlaAla: 5.403 ± 1.357
0.416AlaCys: 0.416 ± 0.215
4.156AlaAsp: 4.156 ± 2.173
3.325AlaGlu: 3.325 ± 1.002
4.572AlaPhe: 4.572 ± 0.205
4.572AlaGly: 4.572 ± 1.237
1.663AlaHis: 1.663 ± 0.861
4.988AlaIle: 4.988 ± 0.421
7.066AlaLys: 7.066 ± 1.498
7.897AlaLeu: 7.897 ± 1.928
2.078AlaMet: 2.078 ± 0.366
2.909AlaAsn: 2.909 ± 1.508
3.741AlaPro: 3.741 ± 0.946
1.663AlaGln: 1.663 ± 1.302
1.247AlaArg: 1.247 ± 0.646
4.988AlaSer: 4.988 ± 1.142
2.078AlaThr: 2.078 ± 1.087
5.403AlaVal: 5.403 ± 0.636
0.831AlaTrp: 0.831 ± 0.29
2.494AlaTyr: 2.494 ± 0.871
0.0AlaXaa: 0.0 ± 0.0
Cys
1.663CysAla: 1.663 ± 0.14
0.416CysCys: 0.416 ± 0.215
2.078CysAsp: 2.078 ± 1.077
2.909CysGlu: 2.909 ± 0.786
0.0CysPhe: 0.0 ± 0.0
1.247CysGly: 1.247 ± 0.646
0.416CysHis: 0.416 ± 0.215
0.416CysIle: 0.416 ± 0.506
0.831CysLys: 0.831 ± 0.431
2.909CysLeu: 2.909 ± 0.656
0.831CysMet: 0.831 ± 0.29
1.247CysAsn: 1.247 ± 0.075
0.0CysPro: 0.0 ± 0.0
0.416CysGln: 0.416 ± 0.215
0.416CysArg: 0.416 ± 0.215
1.247CysSer: 1.247 ± 0.646
0.831CysThr: 0.831 ± 0.431
0.831CysVal: 0.831 ± 0.431
0.0CysTrp: 0.0 ± 0.0
0.416CysTyr: 0.416 ± 0.506
0.0CysXaa: 0.0 ± 0.0
Asp
2.494AspAla: 2.494 ± 0.15
2.078AspCys: 2.078 ± 1.077
4.988AspAsp: 4.988 ± 1.142
4.156AspGlu: 4.156 ± 0.01
5.403AspPhe: 5.403 ± 0.806
5.403AspGly: 5.403 ± 0.636
0.831AspHis: 0.831 ± 0.29
3.741AspIle: 3.741 ± 0.225
2.909AspLys: 2.909 ± 0.065
4.572AspLeu: 4.572 ± 0.927
2.078AspMet: 2.078 ± 0.186
2.909AspAsn: 2.909 ± 1.377
2.909AspPro: 2.909 ± 0.656
0.831AspGln: 0.831 ± 0.29
2.078AspArg: 2.078 ± 0.366
4.572AspSer: 4.572 ± 0.516
5.403AspThr: 5.403 ± 1.527
5.403AspVal: 5.403 ± 1.527
0.0AspTrp: 0.0 ± 0.0
0.831AspTyr: 0.831 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
5.403GluAla: 5.403 ± 1.357
0.831GluCys: 0.831 ± 0.29
2.078GluAsp: 2.078 ± 0.366
4.156GluGlu: 4.156 ± 0.711
2.909GluPhe: 2.909 ± 0.065
1.247GluGly: 1.247 ± 0.075
1.247GluHis: 1.247 ± 0.646
2.909GluIle: 2.909 ± 0.786
2.078GluLys: 2.078 ± 0.356
5.819GluLeu: 5.819 ± 1.573
2.078GluMet: 2.078 ± 0.356
2.078GluAsn: 2.078 ± 1.077
2.078GluPro: 2.078 ± 0.366
1.663GluGln: 1.663 ± 0.861
4.156GluArg: 4.156 ± 1.432
1.663GluSer: 1.663 ± 0.861
4.156GluThr: 4.156 ± 0.731
5.403GluVal: 5.403 ± 0.085
1.663GluTrp: 1.663 ± 0.861
2.078GluTyr: 2.078 ± 0.366
0.0GluXaa: 0.0 ± 0.0
Phe
2.078PheAla: 2.078 ± 0.356
1.663PheCys: 1.663 ± 0.861
3.741PheAsp: 3.741 ± 0.496
2.494PheGlu: 2.494 ± 0.15
2.078PhePhe: 2.078 ± 0.356
2.494PheGly: 2.494 ± 0.871
1.247PheHis: 1.247 ± 0.796
0.0PheIle: 0.0 ± 0.0
1.247PheLys: 1.247 ± 0.796
4.156PheLeu: 4.156 ± 0.01
0.0PheMet: 0.0 ± 0.182
5.403PheAsn: 5.403 ± 0.085
2.909PhePro: 2.909 ± 2.098
2.078PheGln: 2.078 ± 0.356
2.909PheArg: 2.909 ± 0.065
4.988PheSer: 4.988 ± 0.421
3.325PheThr: 3.325 ± 0.441
4.572PheVal: 4.572 ± 0.205
0.0PheTrp: 0.0 ± 0.0
2.078PheTyr: 2.078 ± 0.356
0.0PheXaa: 0.0 ± 0.0
Gly
4.572GlyAla: 4.572 ± 0.205
1.247GlyCys: 1.247 ± 0.075
6.65GlyAsp: 6.65 ± 0.16
4.156GlyGlu: 4.156 ± 0.711
4.572GlyPhe: 4.572 ± 0.205
1.247GlyGly: 1.247 ± 0.796
1.247GlyHis: 1.247 ± 0.646
2.494GlyIle: 2.494 ± 0.571
3.741GlyLys: 3.741 ± 1.217
4.156GlyLeu: 4.156 ± 0.731
0.831GlyMet: 0.831 ± 0.29
2.494GlyAsn: 2.494 ± 0.15
2.494GlyPro: 2.494 ± 1.593
1.663GlyGln: 1.663 ± 0.861
2.494GlyArg: 2.494 ± 0.871
3.741GlySer: 3.741 ± 1.668
1.663GlyThr: 1.663 ± 0.581
4.988GlyVal: 4.988 ± 1.022
0.831GlyTrp: 0.831 ± 0.431
2.909GlyTyr: 2.909 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
0.831HisAla: 0.831 ± 0.29
0.416HisCys: 0.416 ± 0.215
3.325HisAsp: 3.325 ± 0.441
0.831HisGlu: 0.831 ± 0.431
0.416HisPhe: 0.416 ± 0.215
1.663HisGly: 1.663 ± 0.861
0.831HisHis: 0.831 ± 0.431
0.831HisIle: 0.831 ± 0.431
0.416HisLys: 0.416 ± 0.506
2.494HisLeu: 2.494 ± 0.571
0.416HisMet: 0.416 ± 0.215
0.416HisAsn: 0.416 ± 0.215
0.831HisPro: 0.831 ± 1.012
0.831HisGln: 0.831 ± 0.431
0.416HisArg: 0.416 ± 0.215
1.247HisSer: 1.247 ± 0.075
1.663HisThr: 1.663 ± 0.14
1.247HisVal: 1.247 ± 0.075
0.416HisTrp: 0.416 ± 0.215
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.325IleAla: 3.325 ± 0.441
1.247IleCys: 1.247 ± 0.075
3.325IleAsp: 3.325 ± 1.002
3.325IleGlu: 3.325 ± 0.281
2.078IlePhe: 2.078 ± 0.356
1.663IleGly: 1.663 ± 0.14
1.247IleHis: 1.247 ± 0.075
1.247IleIle: 1.247 ± 0.796
2.494IleLys: 2.494 ± 0.871
3.741IleLeu: 3.741 ± 1.217
1.247IleMet: 1.247 ± 0.075
2.494IleAsn: 2.494 ± 1.593
3.325IlePro: 3.325 ± 1.162
2.078IleGln: 2.078 ± 1.087
3.325IleArg: 3.325 ± 1.002
4.572IleSer: 4.572 ± 0.516
2.909IleThr: 2.909 ± 0.065
4.572IleVal: 4.572 ± 0.927
0.831IleTrp: 0.831 ± 0.29
2.909IleTyr: 2.909 ± 0.786
0.0IleXaa: 0.0 ± 0.0
Lys
4.572LysAla: 4.572 ± 1.648
0.416LysCys: 0.416 ± 0.215
1.247LysAsp: 1.247 ± 0.646
3.325LysGlu: 3.325 ± 1.002
3.325LysPhe: 3.325 ± 1.162
2.494LysGly: 2.494 ± 0.15
1.663LysHis: 1.663 ± 0.14
4.572LysIle: 4.572 ± 0.205
3.325LysLys: 3.325 ± 1.002
3.741LysLeu: 3.741 ± 1.217
0.831LysMet: 0.831 ± 0.431
2.909LysAsn: 2.909 ± 0.786
4.572LysPro: 4.572 ± 0.516
0.831LysGln: 0.831 ± 0.29
2.494LysArg: 2.494 ± 1.292
5.403LysSer: 5.403 ± 0.085
5.403LysThr: 5.403 ± 1.527
4.988LysVal: 4.988 ± 1.022
1.663LysTrp: 1.663 ± 0.581
1.247LysTyr: 1.247 ± 0.075
0.0LysXaa: 0.0 ± 0.0
Leu
6.65LeuAla: 6.65 ± 0.561
0.416LeuCys: 0.416 ± 0.215
4.988LeuAsp: 4.988 ± 0.421
4.988LeuGlu: 4.988 ± 0.421
4.988LeuPhe: 4.988 ± 1.863
2.909LeuGly: 2.909 ± 1.508
1.663LeuHis: 1.663 ± 0.861
3.741LeuIle: 3.741 ± 1.668
4.572LeuLys: 4.572 ± 0.927
4.988LeuLeu: 4.988 ± 0.421
2.078LeuMet: 2.078 ± 0.356
4.156LeuAsn: 4.156 ± 0.731
4.988LeuPro: 4.988 ± 1.022
3.741LeuGln: 3.741 ± 0.496
3.325LeuArg: 3.325 ± 0.281
6.65LeuSer: 6.65 ± 1.282
6.65LeuThr: 6.65 ± 0.881
5.819LeuVal: 5.819 ± 0.13
2.078LeuTrp: 2.078 ± 0.366
2.909LeuTyr: 2.909 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
0.416MetAla: 0.416 ± 0.215
0.416MetCys: 0.416 ± 0.215
1.663MetAsp: 1.663 ± 0.581
1.663MetGlu: 1.663 ± 0.14
1.247MetPhe: 1.247 ± 0.646
3.325MetGly: 3.325 ± 1.162
0.0MetHis: 0.0 ± 0.0
1.247MetIle: 1.247 ± 0.646
2.909MetLys: 2.909 ± 0.786
0.831MetLeu: 0.831 ± 0.431
1.663MetMet: 1.663 ± 0.861
1.247MetAsn: 1.247 ± 0.075
0.0MetPro: 0.0 ± 0.0
1.663MetGln: 1.663 ± 0.581
2.078MetArg: 2.078 ± 1.087
0.831MetSer: 0.831 ± 0.431
0.0MetThr: 0.0 ± 0.0
2.494MetVal: 2.494 ± 0.571
0.416MetTrp: 0.416 ± 0.215
1.663MetTyr: 1.663 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.325AsnAla: 3.325 ± 1.162
1.663AsnCys: 1.663 ± 0.861
1.247AsnAsp: 1.247 ± 0.075
3.741AsnGlu: 3.741 ± 0.225
2.494AsnPhe: 2.494 ± 0.871
3.741AsnGly: 3.741 ± 1.217
1.663AsnHis: 1.663 ± 0.581
3.325AsnIle: 3.325 ± 0.281
2.078AsnLys: 2.078 ± 0.366
3.325AsnLeu: 3.325 ± 1.002
0.0AsnMet: 0.0 ± 0.0
1.663AsnAsn: 1.663 ± 0.14
2.494AsnPro: 2.494 ± 0.15
3.741AsnGln: 3.741 ± 0.496
1.663AsnArg: 1.663 ± 0.14
3.741AsnSer: 3.741 ± 1.938
2.494AsnThr: 2.494 ± 0.15
4.156AsnVal: 4.156 ± 0.01
0.0AsnTrp: 0.0 ± 0.0
1.247AsnTyr: 1.247 ± 0.796
0.0AsnXaa: 0.0 ± 0.0
Pro
2.078ProAla: 2.078 ± 1.087
0.831ProCys: 0.831 ± 0.29
3.325ProAsp: 3.325 ± 1.883
3.325ProGlu: 3.325 ± 1.002
2.909ProPhe: 2.909 ± 2.098
4.572ProGly: 4.572 ± 1.958
0.831ProHis: 0.831 ± 0.431
4.156ProIle: 4.156 ± 0.731
2.909ProLys: 2.909 ± 1.508
6.65ProLeu: 6.65 ± 1.602
0.831ProMet: 0.831 ± 0.29
2.494ProAsn: 2.494 ± 0.15
2.078ProPro: 2.078 ± 1.087
1.247ProGln: 1.247 ± 0.075
2.494ProArg: 2.494 ± 0.871
3.741ProSer: 3.741 ± 0.496
2.494ProThr: 2.494 ± 2.314
4.572ProVal: 4.572 ± 0.516
0.416ProTrp: 0.416 ± 0.215
1.247ProTyr: 1.247 ± 0.075
0.0ProXaa: 0.0 ± 0.0
Gln
3.741GlnAla: 3.741 ± 1.938
0.831GlnCys: 0.831 ± 0.29
2.078GlnAsp: 2.078 ± 0.356
2.078GlnGlu: 2.078 ± 1.087
1.663GlnPhe: 1.663 ± 0.14
2.494GlnGly: 2.494 ± 0.15
0.416GlnHis: 0.416 ± 0.215
2.078GlnIle: 2.078 ± 1.087
1.247GlnLys: 1.247 ± 0.646
2.909GlnLeu: 2.909 ± 1.377
2.494GlnMet: 2.494 ± 0.15
0.416GlnAsn: 0.416 ± 0.215
2.494GlnPro: 2.494 ± 0.571
2.494GlnGln: 2.494 ± 0.871
1.663GlnArg: 1.663 ± 0.861
2.494GlnSer: 2.494 ± 0.871
3.325GlnThr: 3.325 ± 0.281
3.325GlnVal: 3.325 ± 0.441
0.416GlnTrp: 0.416 ± 0.215
0.831GlnTyr: 0.831 ± 0.431
0.0GlnXaa: 0.0 ± 0.0
Arg
2.494ArgAla: 2.494 ± 0.15
1.247ArgCys: 1.247 ± 0.075
2.494ArgAsp: 2.494 ± 0.571
1.247ArgGlu: 1.247 ± 0.646
2.078ArgPhe: 2.078 ± 1.087
3.741ArgGly: 3.741 ± 1.668
0.416ArgHis: 0.416 ± 0.215
2.494ArgIle: 2.494 ± 0.571
2.494ArgLys: 2.494 ± 0.571
2.494ArgLeu: 2.494 ± 0.15
1.663ArgMet: 1.663 ± 0.14
1.247ArgAsn: 1.247 ± 0.646
3.741ArgPro: 3.741 ± 1.217
2.909ArgGln: 2.909 ± 0.065
4.156ArgArg: 4.156 ± 2.154
1.663ArgSer: 1.663 ± 0.14
3.325ArgThr: 3.325 ± 1.002
4.572ArgVal: 4.572 ± 0.205
0.416ArgTrp: 0.416 ± 0.506
1.663ArgTyr: 1.663 ± 0.861
0.0ArgXaa: 0.0 ± 0.0
Ser
6.65SerAla: 6.65 ± 1.602
1.247SerCys: 1.247 ± 0.075
2.909SerAsp: 2.909 ± 1.377
3.325SerGlu: 3.325 ± 1.723
2.909SerPhe: 2.909 ± 0.786
5.819SerGly: 5.819 ± 0.13
0.0SerHis: 0.0 ± 0.0
4.988SerIle: 4.988 ± 0.3
4.156SerLys: 4.156 ± 1.452
4.572SerLeu: 4.572 ± 0.927
2.078SerMet: 2.078 ± 0.366
4.156SerAsn: 4.156 ± 0.731
4.572SerPro: 4.572 ± 2.369
1.247SerGln: 1.247 ± 0.646
3.325SerArg: 3.325 ± 0.281
4.988SerSer: 4.988 ± 0.3
1.663SerThr: 1.663 ± 0.581
6.234SerVal: 6.234 ± 1.067
1.663SerTrp: 1.663 ± 0.581
1.663SerTyr: 1.663 ± 0.14
0.0SerXaa: 0.0 ± 0.0
Thr
3.741ThrAla: 3.741 ± 0.225
1.247ThrCys: 1.247 ± 0.075
4.988ThrAsp: 4.988 ± 1.022
1.247ThrGlu: 1.247 ± 0.646
2.494ThrPhe: 2.494 ± 0.15
1.663ThrGly: 1.663 ± 0.581
2.494ThrHis: 2.494 ± 0.871
2.078ThrIle: 2.078 ± 0.356
5.403ThrLys: 5.403 ± 2.248
4.156ThrLeu: 4.156 ± 0.731
1.247ThrMet: 1.247 ± 0.075
0.831ThrAsn: 0.831 ± 0.29
2.494ThrPro: 2.494 ± 0.871
4.988ThrGln: 4.988 ± 1.022
3.325ThrArg: 3.325 ± 0.281
4.156ThrSer: 4.156 ± 0.711
5.403ThrThr: 5.403 ± 0.806
3.741ThrVal: 3.741 ± 1.668
0.416ThrTrp: 0.416 ± 0.506
4.156ThrTyr: 4.156 ± 0.731
0.0ThrXaa: 0.0 ± 0.0
Val
7.897ValAla: 7.897 ± 0.235
0.416ValCys: 0.416 ± 0.215
5.819ValAsp: 5.819 ± 0.591
3.325ValGlu: 3.325 ± 1.002
1.247ValPhe: 1.247 ± 0.075
5.403ValGly: 5.403 ± 0.806
0.0ValHis: 0.0 ± 0.0
4.572ValIle: 4.572 ± 1.648
7.897ValLys: 7.897 ± 0.486
7.897ValLeu: 7.897 ± 0.486
1.663ValMet: 1.663 ± 0.861
5.819ValAsn: 5.819 ± 2.294
4.572ValPro: 4.572 ± 2.679
3.325ValGln: 3.325 ± 0.281
2.494ValArg: 2.494 ± 0.15
6.234ValSer: 6.234 ± 3.26
2.909ValThr: 2.909 ± 0.065
4.572ValVal: 4.572 ± 0.516
0.416ValTrp: 0.416 ± 0.215
2.909ValTyr: 2.909 ± 0.656
0.0ValXaa: 0.0 ± 0.0
Trp
0.416TrpAla: 0.416 ± 0.215
0.831TrpCys: 0.831 ± 0.431
0.831TrpAsp: 0.831 ± 1.012
0.416TrpGlu: 0.416 ± 0.215
0.416TrpPhe: 0.416 ± 0.215
0.0TrpGly: 0.0 ± 0.0
0.416TrpHis: 0.416 ± 0.215
0.831TrpIle: 0.831 ± 1.012
0.831TrpLys: 0.831 ± 0.29
2.078TrpLeu: 2.078 ± 0.356
0.416TrpMet: 0.416 ± 0.215
0.831TrpAsn: 0.831 ± 0.29
0.416TrpPro: 0.416 ± 0.215
0.0TrpGln: 0.0 ± 0.0
0.831TrpArg: 0.831 ± 0.29
0.416TrpSer: 0.416 ± 0.215
0.831TrpThr: 0.831 ± 0.431
0.416TrpVal: 0.416 ± 0.215
0.416TrpTrp: 0.416 ± 0.215
1.663TrpTyr: 1.663 ± 1.302
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.741TyrAla: 3.741 ± 0.496
1.247TyrCys: 1.247 ± 0.075
1.663TyrAsp: 1.663 ± 0.14
1.663TyrGlu: 1.663 ± 0.581
2.078TyrPhe: 2.078 ± 0.366
2.494TyrGly: 2.494 ± 1.292
1.247TyrHis: 1.247 ± 0.075
1.247TyrIle: 1.247 ± 0.796
0.416TyrLys: 0.416 ± 0.215
2.909TyrLeu: 2.909 ± 0.656
0.831TyrMet: 0.831 ± 0.431
2.078TyrAsn: 2.078 ± 0.356
2.494TyrPro: 2.494 ± 1.593
2.078TyrGln: 2.078 ± 0.356
1.663TyrArg: 1.663 ± 0.14
0.831TyrSer: 0.831 ± 0.29
3.741TyrThr: 3.741 ± 1.668
2.078TyrVal: 2.078 ± 1.077
0.416TyrTrp: 0.416 ± 0.506
0.831TyrTyr: 0.831 ± 0.431
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2407 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski