Amino acid dipepetide frequency for Wenzhou gastropodes virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.362AlaAla: 3.362 ± 1.303
2.241AlaCys: 2.241 ± 0.032
3.362AlaAsp: 3.362 ± 0.048
4.856AlaGlu: 4.856 ± 0.767
3.362AlaPhe: 3.362 ± 0.048
3.736AlaGly: 3.736 ± 0.472
1.868AlaHis: 1.868 ± 0.392
3.362AlaIle: 3.362 ± 1.303
2.615AlaLys: 2.615 ± 0.456
5.603AlaLeu: 5.603 ± 0.547
0.374AlaMet: 0.374 ± 0.424
2.615AlaAsn: 2.615 ± 0.456
5.23AlaPro: 5.23 ± 2.167
2.988AlaGln: 2.988 ± 0.252
2.988AlaArg: 2.988 ± 0.376
5.977AlaSer: 5.977 ± 1.759
4.109AlaThr: 4.109 ± 1.523
2.615AlaVal: 2.615 ± 0.172
1.868AlaTrp: 1.868 ± 0.236
1.494AlaTyr: 1.494 ± 0.815
0.0AlaXaa: 0.0 ± 0.0
Cys
1.868CysAla: 1.868 ± 0.864
0.747CysCys: 0.747 ± 0.408
0.747CysAsp: 0.747 ± 0.408
1.868CysGlu: 1.868 ± 1.019
1.868CysPhe: 1.868 ± 0.236
1.494CysGly: 1.494 ± 0.815
0.747CysHis: 0.747 ± 0.408
0.747CysIle: 0.747 ± 0.22
1.121CysLys: 1.121 ± 0.612
1.494CysLeu: 1.494 ± 0.188
0.747CysMet: 0.747 ± 0.408
1.494CysAsn: 1.494 ± 0.44
0.374CysPro: 0.374 ± 0.424
0.374CysGln: 0.374 ± 0.204
0.374CysArg: 0.374 ± 0.204
0.747CysSer: 0.747 ± 0.22
0.747CysThr: 0.747 ± 0.408
2.241CysVal: 2.241 ± 0.596
0.747CysTrp: 0.747 ± 0.408
0.747CysTyr: 0.747 ± 0.408
0.0CysXaa: 0.0 ± 0.0
Asp
3.362AspAla: 3.362 ± 0.676
1.121AspCys: 1.121 ± 0.612
4.483AspAsp: 4.483 ± 0.064
3.736AspGlu: 3.736 ± 0.783
4.109AspPhe: 4.109 ± 0.268
1.868AspGly: 1.868 ± 1.019
0.747AspHis: 0.747 ± 0.408
4.109AspIle: 4.109 ± 0.268
3.362AspLys: 3.362 ± 1.835
4.856AspLeu: 4.856 ± 1.395
1.868AspMet: 1.868 ± 1.019
1.494AspAsn: 1.494 ± 0.44
4.109AspPro: 4.109 ± 2.779
1.868AspGln: 1.868 ± 0.236
2.615AspArg: 2.615 ± 0.799
3.736AspSer: 3.736 ± 0.472
2.615AspThr: 2.615 ± 1.711
2.241AspVal: 2.241 ± 0.032
2.615AspTrp: 2.615 ± 0.172
2.615AspTyr: 2.615 ± 1.084
0.0AspXaa: 0.0 ± 0.0
Glu
3.736GluAla: 3.736 ± 0.156
1.494GluCys: 1.494 ± 0.188
4.109GluAsp: 4.109 ± 0.36
4.483GluGlu: 4.483 ± 2.446
1.121GluPhe: 1.121 ± 0.612
2.241GluGly: 2.241 ± 1.223
1.494GluHis: 1.494 ± 0.815
4.856GluIle: 4.856 ± 0.14
4.483GluLys: 4.483 ± 1.819
4.856GluLeu: 4.856 ± 0.14
2.615GluMet: 2.615 ± 0.456
2.988GluAsn: 2.988 ± 0.252
1.868GluPro: 1.868 ± 0.864
3.362GluGln: 3.362 ± 1.207
2.615GluArg: 2.615 ± 0.799
4.856GluSer: 4.856 ± 0.488
2.241GluThr: 2.241 ± 0.032
5.23GluVal: 5.23 ± 0.912
1.121GluTrp: 1.121 ± 0.644
1.494GluTyr: 1.494 ± 0.188
0.0GluXaa: 0.0 ± 0.0
Phe
5.23PheAla: 5.23 ± 0.971
1.121PheCys: 1.121 ± 0.016
2.615PheAsp: 2.615 ± 1.427
2.615PheGlu: 2.615 ± 0.456
2.988PhePhe: 2.988 ± 1.003
3.362PheGly: 3.362 ± 0.048
1.868PheHis: 1.868 ± 0.236
4.483PheIle: 4.483 ± 2.446
2.241PheLys: 2.241 ± 0.032
3.362PheLeu: 3.362 ± 1.835
1.868PheMet: 1.868 ± 1.019
0.747PheAsn: 0.747 ± 0.408
0.374PhePro: 0.374 ± 0.424
2.615PheGln: 2.615 ± 1.084
2.615PheArg: 2.615 ± 1.711
3.362PheSer: 3.362 ± 1.931
3.362PheThr: 3.362 ± 1.303
3.736PheVal: 3.736 ± 0.156
1.494PheTrp: 1.494 ± 0.44
2.615PheTyr: 2.615 ± 0.172
0.0PheXaa: 0.0 ± 0.0
Gly
2.615GlyAla: 2.615 ± 1.084
1.121GlyCys: 1.121 ± 0.016
2.988GlyAsp: 2.988 ± 1.003
5.603GlyGlu: 5.603 ± 1.336
2.988GlyPhe: 2.988 ± 1.003
3.736GlyGly: 3.736 ± 1.1
1.494GlyHis: 1.494 ± 0.188
2.241GlyIle: 2.241 ± 0.596
4.109GlyLys: 4.109 ± 0.36
3.736GlyLeu: 3.736 ± 0.156
1.494GlyMet: 1.494 ± 0.815
3.362GlyAsn: 3.362 ± 0.676
1.868GlyPro: 1.868 ± 0.236
1.868GlyGln: 1.868 ± 0.236
3.362GlyArg: 3.362 ± 0.579
4.856GlySer: 4.856 ± 2.999
4.109GlyThr: 4.109 ± 0.268
4.483GlyVal: 4.483 ± 0.064
1.494GlyTrp: 1.494 ± 0.815
1.494GlyTyr: 1.494 ± 0.188
0.0GlyXaa: 0.0 ± 0.0
His
1.868HisAla: 1.868 ± 0.864
0.374HisCys: 0.374 ± 0.424
2.241HisAsp: 2.241 ± 0.032
1.121HisGlu: 1.121 ± 0.644
0.747HisPhe: 0.747 ± 0.408
1.121HisGly: 1.121 ± 0.612
0.374HisHis: 0.374 ± 0.424
1.868HisIle: 1.868 ± 0.236
0.374HisLys: 0.374 ± 0.424
3.362HisLeu: 3.362 ± 1.207
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.494HisPro: 1.494 ± 0.815
0.374HisGln: 0.374 ± 0.424
1.121HisArg: 1.121 ± 0.612
1.868HisSer: 1.868 ± 0.236
0.0HisThr: 0.0 ± 0.0
1.494HisVal: 1.494 ± 0.44
0.747HisTrp: 0.747 ± 0.22
0.747HisTyr: 0.747 ± 0.408
0.0HisXaa: 0.0 ± 0.0
Ile
6.724IleAla: 6.724 ± 1.979
0.747IleCys: 0.747 ± 0.408
3.736IleAsp: 3.736 ± 0.472
3.736IleGlu: 3.736 ± 0.783
2.615IlePhe: 2.615 ± 1.427
2.615IleGly: 2.615 ± 0.456
1.868IleHis: 1.868 ± 0.236
4.109IleIle: 4.109 ± 0.987
1.121IleLys: 1.121 ± 0.612
4.109IleLeu: 4.109 ± 0.36
0.374IleMet: 0.374 ± 0.252
3.362IleAsn: 3.362 ± 0.676
1.868IlePro: 1.868 ± 1.491
1.868IleGln: 1.868 ± 0.392
4.109IleArg: 4.109 ± 0.36
1.868IleSer: 1.868 ± 0.392
2.988IleThr: 2.988 ± 0.252
4.483IleVal: 4.483 ± 0.064
1.868IleTrp: 1.868 ± 0.392
1.868IleTyr: 1.868 ± 0.236
0.0IleXaa: 0.0 ± 0.0
Lys
2.988LysAla: 2.988 ± 0.252
1.121LysCys: 1.121 ± 0.016
3.736LysAsp: 3.736 ± 0.783
5.603LysGlu: 5.603 ± 1.803
1.868LysPhe: 1.868 ± 0.392
3.362LysGly: 3.362 ± 0.579
1.121LysHis: 1.121 ± 0.016
3.362LysIle: 3.362 ± 0.579
3.362LysLys: 3.362 ± 1.835
5.23LysLeu: 5.23 ± 0.912
0.747LysMet: 0.747 ± 0.22
1.494LysAsn: 1.494 ± 0.815
2.615LysPro: 2.615 ± 0.456
0.747LysGln: 0.747 ± 0.22
2.615LysArg: 2.615 ± 0.799
2.241LysSer: 2.241 ± 1.223
2.241LysThr: 2.241 ± 1.223
4.483LysVal: 4.483 ± 1.191
1.121LysTrp: 1.121 ± 0.612
3.362LysTyr: 3.362 ± 1.207
0.0LysXaa: 0.0 ± 0.0
Leu
4.483LeuAla: 4.483 ± 0.064
2.988LeuCys: 2.988 ± 1.631
3.362LeuAsp: 3.362 ± 0.676
1.494LeuGlu: 1.494 ± 0.188
4.483LeuPhe: 4.483 ± 0.563
4.483LeuGly: 4.483 ± 0.064
3.736LeuHis: 3.736 ± 1.727
3.736LeuIle: 3.736 ± 0.783
7.097LeuLys: 7.097 ± 0.107
10.086LeuLeu: 10.086 ± 3.621
1.868LeuMet: 1.868 ± 1.019
4.109LeuAsn: 4.109 ± 0.987
5.603LeuPro: 5.603 ± 0.547
3.736LeuGln: 3.736 ± 0.472
4.109LeuArg: 4.109 ± 1.615
4.109LeuSer: 4.109 ± 0.268
7.097LeuThr: 7.097 ± 0.107
6.724LeuVal: 6.724 ± 1.787
0.747LeuTrp: 0.747 ± 0.22
2.615LeuTyr: 2.615 ± 0.172
0.0LeuXaa: 0.0 ± 0.0
Met
2.615MetAla: 2.615 ± 0.456
1.121MetCys: 1.121 ± 0.612
2.241MetAsp: 2.241 ± 0.596
0.374MetGlu: 0.374 ± 0.424
2.241MetPhe: 2.241 ± 0.66
2.615MetGly: 2.615 ± 1.084
0.0MetHis: 0.0 ± 0.0
2.241MetIle: 2.241 ± 0.032
4.109MetLys: 4.109 ± 1.615
2.988MetLeu: 2.988 ± 0.88
0.0MetMet: 0.0 ± 0.0
0.747MetAsn: 0.747 ± 0.22
0.747MetPro: 0.747 ± 0.408
0.747MetGln: 0.747 ± 0.22
0.747MetArg: 0.747 ± 0.408
1.868MetSer: 1.868 ± 1.019
0.747MetThr: 0.747 ± 0.408
0.747MetVal: 0.747 ± 0.408
0.374MetTrp: 0.374 ± 0.204
1.121MetTyr: 1.121 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.241AsnAla: 2.241 ± 0.032
0.747AsnCys: 0.747 ± 0.22
2.615AsnAsp: 2.615 ± 0.456
1.868AsnGlu: 1.868 ± 0.236
2.615AsnPhe: 2.615 ± 0.172
2.241AsnGly: 2.241 ± 1.915
0.747AsnHis: 0.747 ± 0.408
2.615AsnIle: 2.615 ± 0.456
1.494AsnLys: 1.494 ± 0.44
2.988AsnLeu: 2.988 ± 1.003
0.0AsnMet: 0.0 ± 0.168
0.747AsnAsn: 0.747 ± 0.408
2.988AsnPro: 2.988 ± 1.507
0.747AsnGln: 0.747 ± 0.408
2.988AsnArg: 2.988 ± 1.507
2.615AsnSer: 2.615 ± 0.172
2.988AsnThr: 2.988 ± 0.252
3.736AsnVal: 3.736 ± 0.472
1.494AsnTrp: 1.494 ± 0.815
2.241AsnTyr: 2.241 ± 0.66
0.0AsnXaa: 0.0 ± 0.0
Pro
2.241ProAla: 2.241 ± 0.66
0.0ProCys: 0.0 ± 0.0
4.856ProAsp: 4.856 ± 1.743
3.736ProGlu: 3.736 ± 0.783
3.736ProPhe: 3.736 ± 2.355
2.615ProGly: 2.615 ± 0.456
0.374ProHis: 0.374 ± 0.424
1.494ProIle: 1.494 ± 0.188
2.241ProLys: 2.241 ± 0.596
5.23ProLeu: 5.23 ± 0.343
2.615ProMet: 2.615 ± 0.456
0.747ProAsn: 0.747 ± 0.22
1.494ProPro: 1.494 ± 0.44
2.615ProGln: 2.615 ± 1.084
1.494ProArg: 1.494 ± 1.067
3.362ProSer: 3.362 ± 0.579
2.615ProThr: 2.615 ± 1.084
3.362ProVal: 3.362 ± 0.676
0.747ProTrp: 0.747 ± 0.22
1.868ProTyr: 1.868 ± 1.491
0.0ProXaa: 0.0 ± 0.0
Gln
2.241GlnAla: 2.241 ± 0.596
0.374GlnCys: 0.374 ± 0.204
1.868GlnAsp: 1.868 ± 0.864
1.121GlnGlu: 1.121 ± 1.271
1.868GlnPhe: 1.868 ± 0.392
1.494GlnGly: 1.494 ± 0.44
0.374GlnHis: 0.374 ± 0.204
0.747GlnIle: 0.747 ± 0.22
0.747GlnLys: 0.747 ± 0.408
4.483GlnLeu: 4.483 ± 1.191
2.988GlnMet: 2.988 ± 2.763
1.121GlnAsn: 1.121 ± 0.016
1.121GlnPro: 1.121 ± 0.016
0.747GlnGln: 0.747 ± 0.408
2.241GlnArg: 2.241 ± 0.032
0.747GlnSer: 0.747 ± 0.22
1.868GlnThr: 1.868 ± 0.236
2.615GlnVal: 2.615 ± 0.799
0.0GlnTrp: 0.0 ± 0.0
2.241GlnTyr: 2.241 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
2.241ArgAla: 2.241 ± 0.66
1.494ArgCys: 1.494 ± 0.815
2.241ArgAsp: 2.241 ± 0.596
4.109ArgGlu: 4.109 ± 0.36
1.868ArgPhe: 1.868 ± 0.236
2.241ArgGly: 2.241 ± 0.032
1.121ArgHis: 1.121 ± 0.016
2.241ArgIle: 2.241 ± 0.66
2.241ArgLys: 2.241 ± 1.223
5.23ArgLeu: 5.23 ± 1.599
2.241ArgMet: 2.241 ± 0.596
2.615ArgAsn: 2.615 ± 1.427
3.362ArgPro: 3.362 ± 1.931
0.374ArgGln: 0.374 ± 0.424
3.362ArgArg: 3.362 ± 1.835
4.856ArgSer: 4.856 ± 1.116
4.109ArgThr: 4.109 ± 1.615
2.615ArgVal: 2.615 ± 0.172
1.868ArgTrp: 1.868 ± 0.392
2.241ArgTyr: 2.241 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
4.483SerAla: 4.483 ± 1.32
0.747SerCys: 0.747 ± 0.22
3.736SerAsp: 3.736 ± 0.156
3.736SerGlu: 3.736 ± 0.783
4.483SerPhe: 4.483 ± 0.692
5.977SerGly: 5.977 ± 0.504
0.747SerHis: 0.747 ± 0.408
5.603SerIle: 5.603 ± 0.08
2.241SerLys: 2.241 ± 0.596
5.977SerLeu: 5.977 ± 0.504
1.868SerMet: 1.868 ± 0.236
2.615SerAsn: 2.615 ± 0.456
2.988SerPro: 2.988 ± 1.631
1.494SerGln: 1.494 ± 0.44
4.483SerArg: 4.483 ± 1.947
5.23SerSer: 5.23 ± 2.167
3.736SerThr: 3.736 ± 1.727
5.23SerVal: 5.23 ± 0.284
0.374SerTrp: 0.374 ± 0.424
1.868SerTyr: 1.868 ± 0.864
0.0SerXaa: 0.0 ± 0.0
Thr
4.109ThrAla: 4.109 ± 0.268
0.0ThrCys: 0.0 ± 0.0
2.241ThrAsp: 2.241 ± 0.032
1.868ThrGlu: 1.868 ± 0.864
3.362ThrPhe: 3.362 ± 0.579
5.23ThrGly: 5.23 ± 0.912
1.121ThrHis: 1.121 ± 0.644
2.241ThrIle: 2.241 ± 0.66
2.988ThrLys: 2.988 ± 0.252
4.483ThrLeu: 4.483 ± 1.947
2.241ThrMet: 2.241 ± 1.223
3.736ThrAsn: 3.736 ± 1.727
2.988ThrPro: 2.988 ± 0.88
2.615ThrGln: 2.615 ± 0.172
2.615ThrArg: 2.615 ± 0.172
5.977ThrSer: 5.977 ± 0.124
6.35ThrThr: 6.35 ± 1.556
5.603ThrVal: 5.603 ± 1.963
0.374ThrTrp: 0.374 ± 0.424
2.615ThrTyr: 2.615 ± 1.427
0.0ThrXaa: 0.0 ± 0.0
Val
4.483ValAla: 4.483 ± 1.191
1.494ValCys: 1.494 ± 0.188
4.109ValAsp: 4.109 ± 0.896
4.856ValGlu: 4.856 ± 0.767
3.362ValPhe: 3.362 ± 0.676
4.483ValGly: 4.483 ± 0.064
0.747ValHis: 0.747 ± 0.22
4.109ValIle: 4.109 ± 1.523
4.109ValLys: 4.109 ± 0.987
3.362ValLeu: 3.362 ± 1.207
2.615ValMet: 2.615 ± 0.456
2.241ValAsn: 2.241 ± 1.287
4.109ValPro: 4.109 ± 0.896
1.121ValGln: 1.121 ± 0.016
3.362ValArg: 3.362 ± 1.207
5.977ValSer: 5.977 ± 0.751
5.23ValThr: 5.23 ± 1.539
4.856ValVal: 4.856 ± 0.488
1.494ValTrp: 1.494 ± 0.815
5.23ValTyr: 5.23 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
2.241TrpAla: 2.241 ± 0.032
0.747TrpCys: 0.747 ± 0.22
0.374TrpAsp: 0.374 ± 0.204
1.494TrpGlu: 1.494 ± 0.188
1.121TrpPhe: 1.121 ± 0.612
0.747TrpGly: 0.747 ± 0.22
0.0TrpHis: 0.0 ± 0.0
0.747TrpIle: 0.747 ± 0.408
1.121TrpLys: 1.121 ± 0.016
1.494TrpLeu: 1.494 ± 0.815
0.747TrpMet: 0.747 ± 0.408
1.868TrpAsn: 1.868 ± 0.864
0.747TrpPro: 0.747 ± 0.408
0.374TrpGln: 0.374 ± 0.204
2.615TrpArg: 2.615 ± 0.799
0.0TrpSer: 0.0 ± 0.0
2.241TrpThr: 2.241 ± 1.287
1.494TrpVal: 1.494 ± 0.188
0.747TrpTrp: 0.747 ± 0.408
1.494TrpTyr: 1.494 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.494TyrAla: 1.494 ± 0.188
1.494TyrCys: 1.494 ± 0.188
1.868TyrAsp: 1.868 ± 0.236
2.988TyrGlu: 2.988 ± 1.003
1.868TyrPhe: 1.868 ± 0.236
3.362TyrGly: 3.362 ± 1.207
0.747TyrHis: 0.747 ± 0.408
1.121TyrIle: 1.121 ± 0.644
2.241TyrLys: 2.241 ± 0.032
3.736TyrLeu: 3.736 ± 1.727
0.747TyrMet: 0.747 ± 0.408
2.988TyrAsn: 2.988 ± 0.88
1.494TyrPro: 1.494 ± 0.815
0.374TyrGln: 0.374 ± 0.204
2.241TyrArg: 2.241 ± 1.223
3.362TyrSer: 3.362 ± 0.676
3.362TyrThr: 3.362 ± 0.579
3.362TyrVal: 3.362 ± 0.676
1.121TyrTrp: 1.121 ± 0.016
1.868TyrTyr: 1.868 ± 0.392
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2678 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski