Amino acid dipepetide frequency for Wenzhou crab virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.007AlaAla: 4.007 ± 1.959
2.003AlaCys: 2.003 ± 0.549
2.29AlaAsp: 2.29 ± 0.851
4.293AlaGlu: 4.293 ± 1.531
4.007AlaPhe: 4.007 ± 1.944
9.731AlaGly: 9.731 ± 5.274
2.003AlaHis: 2.003 ± 0.26
1.717AlaIle: 1.717 ± 0.602
2.862AlaLys: 2.862 ± 0.561
7.441AlaLeu: 7.441 ± 1.53
2.003AlaMet: 2.003 ± 0.647
1.431AlaAsn: 1.431 ± 0.527
5.438AlaPro: 5.438 ± 3.672
4.293AlaGln: 4.293 ± 1.795
2.29AlaArg: 2.29 ± 0.843
8.586AlaSer: 8.586 ± 2.782
3.434AlaThr: 3.434 ± 0.345
4.579AlaVal: 4.579 ± 2.357
1.717AlaTrp: 1.717 ± 0.496
3.434AlaTyr: 3.434 ± 0.797
0.0AlaXaa: 0.0 ± 0.0
Cys
0.286CysAla: 0.286 ± 0.165
0.0CysCys: 0.0 ± 0.0
1.717CysAsp: 1.717 ± 0.541
0.859CysGlu: 0.859 ± 0.762
1.145CysPhe: 1.145 ± 0.656
0.286CysGly: 0.286 ± 0.444
0.572CysHis: 0.572 ± 0.328
0.859CysIle: 0.859 ± 0.762
1.717CysLys: 1.717 ± 0.988
2.003CysLeu: 2.003 ± 0.366
0.0CysMet: 0.0 ± 0.0
0.572CysAsn: 0.572 ± 0.329
1.431CysPro: 1.431 ± 0.406
0.572CysGln: 0.572 ± 0.329
0.859CysArg: 0.859 ± 0.762
2.29CysSer: 2.29 ± 1.311
1.145CysThr: 1.145 ± 0.303
0.859CysVal: 0.859 ± 0.494
0.286CysTrp: 0.286 ± 0.165
1.431CysTyr: 1.431 ± 0.406
0.0CysXaa: 0.0 ± 0.0
Asp
3.721AspAla: 3.721 ± 1.141
1.145AspCys: 1.145 ± 0.234
4.007AspAsp: 4.007 ± 0.932
2.29AspGlu: 2.29 ± 0.843
0.572AspPhe: 0.572 ± 0.328
1.717AspGly: 1.717 ± 0.496
2.29AspHis: 2.29 ± 0.843
2.862AspIle: 2.862 ± 0.351
1.431AspLys: 1.431 ± 0.823
5.152AspLeu: 5.152 ± 1.252
0.859AspMet: 0.859 ± 0.494
2.003AspAsn: 2.003 ± 0.26
2.003AspPro: 2.003 ± 0.899
2.003AspGln: 2.003 ± 1.178
2.29AspArg: 2.29 ± 0.425
2.29AspSer: 2.29 ± 0.468
3.148AspThr: 3.148 ± 1.321
3.434AspVal: 3.434 ± 0.925
0.286AspTrp: 0.286 ± 0.165
2.29AspTyr: 2.29 ± 0.425
0.0AspXaa: 0.0 ± 0.0
Glu
6.583GluAla: 6.583 ± 1.346
1.431GluCys: 1.431 ± 0.577
1.717GluAsp: 1.717 ± 0.096
3.148GluGlu: 3.148 ± 1.104
2.003GluPhe: 2.003 ± 0.826
4.007GluGly: 4.007 ± 0.869
0.859GluHis: 0.859 ± 0.269
2.576GluIle: 2.576 ± 0.356
2.29GluLys: 2.29 ± 0.276
7.441GluLeu: 7.441 ± 1.995
2.003GluMet: 2.003 ± 0.689
1.717GluAsn: 1.717 ± 0.673
1.145GluPro: 1.145 ± 0.303
1.717GluGln: 1.717 ± 1.662
2.003GluArg: 2.003 ± 0.689
3.148GluSer: 3.148 ± 0.541
3.148GluThr: 3.148 ± 0.845
5.724GluVal: 5.724 ± 0.829
1.145GluTrp: 1.145 ± 0.234
1.431GluTyr: 1.431 ± 0.406
0.0GluXaa: 0.0 ± 0.0
Phe
2.003PheAla: 2.003 ± 0.26
0.859PheCys: 0.859 ± 0.494
1.145PheAsp: 1.145 ± 0.656
1.431PheGlu: 1.431 ± 0.406
1.145PhePhe: 1.145 ± 0.796
3.721PheGly: 3.721 ± 0.569
1.145PheHis: 1.145 ± 0.396
2.003PheIle: 2.003 ± 0.306
0.859PheLys: 0.859 ± 0.953
4.007PheLeu: 4.007 ± 1.413
0.859PheMet: 0.859 ± 0.494
2.003PheAsn: 2.003 ± 0.829
2.29PhePro: 2.29 ± 1.12
1.145PheGln: 1.145 ± 0.234
3.148PheArg: 3.148 ± 1.198
2.29PheSer: 2.29 ± 1.318
2.576PheThr: 2.576 ± 0.583
1.145PheVal: 1.145 ± 0.234
0.286PheTrp: 0.286 ± 0.165
3.148PheTyr: 3.148 ± 0.861
0.0PheXaa: 0.0 ± 0.0
Gly
3.721GlyAla: 3.721 ± 2.327
1.145GlyCys: 1.145 ± 0.656
2.576GlyAsp: 2.576 ± 0.27
4.579GlyGlu: 4.579 ± 0.553
3.434GlyPhe: 3.434 ± 0.653
3.434GlyGly: 3.434 ± 1.811
2.29GlyHis: 2.29 ± 0.276
2.003GlyIle: 2.003 ± 0.306
2.576GlyLys: 2.576 ± 1.825
4.579GlyLeu: 4.579 ± 0.514
1.431GlyMet: 1.431 ± 1.087
2.576GlyAsn: 2.576 ± 0.903
4.293GlyPro: 4.293 ± 2.975
2.003GlyGln: 2.003 ± 0.366
4.007GlyArg: 4.007 ± 0.732
4.865GlySer: 4.865 ± 0.955
5.152GlyThr: 5.152 ± 0.607
4.007GlyVal: 4.007 ± 0.885
1.145GlyTrp: 1.145 ± 0.95
1.145GlyTyr: 1.145 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
1.431HisAla: 1.431 ± 0.527
1.431HisCys: 1.431 ± 0.577
1.431HisAsp: 1.431 ± 0.406
0.572HisGlu: 0.572 ± 0.329
1.717HisPhe: 1.717 ± 0.541
0.859HisGly: 0.859 ± 0.399
2.003HisHis: 2.003 ± 0.899
1.717HisIle: 1.717 ± 0.096
0.286HisLys: 0.286 ± 0.165
2.29HisLeu: 2.29 ± 1.318
0.572HisMet: 0.572 ± 0.328
1.145HisAsn: 1.145 ± 0.659
2.862HisPro: 2.862 ± 1.054
1.431HisGln: 1.431 ± 0.577
0.859HisArg: 0.859 ± 0.269
2.576HisSer: 2.576 ± 1.251
1.431HisThr: 1.431 ± 0.406
1.431HisVal: 1.431 ± 0.406
1.145HisTrp: 1.145 ± 0.396
2.003HisTyr: 2.003 ± 1.153
0.0HisXaa: 0.0 ± 0.0
Ile
3.721IleAla: 3.721 ± 1.228
0.0IleCys: 0.0 ± 0.0
1.431IleAsp: 1.431 ± 0.07
3.721IleGlu: 3.721 ± 0.919
1.717IlePhe: 1.717 ± 0.673
2.29IleGly: 2.29 ± 0.276
1.431IleHis: 1.431 ± 0.406
3.721IleIle: 3.721 ± 0.3
1.717IleLys: 1.717 ± 0.541
4.007IleLeu: 4.007 ± 0.805
2.29IleMet: 2.29 ± 0.289
1.717IleAsn: 1.717 ± 0.84
3.434IlePro: 3.434 ± 0.702
3.434IleGln: 3.434 ± 0.653
4.007IleArg: 4.007 ± 0.252
4.007IleSer: 4.007 ± 1.877
2.003IleThr: 2.003 ± 0.26
2.29IleVal: 2.29 ± 0.839
0.0IleTrp: 0.0 ± 0.0
1.431IleTyr: 1.431 ± 0.406
0.0IleXaa: 0.0 ± 0.0
Lys
2.29LysAla: 2.29 ± 0.425
0.859LysCys: 0.859 ± 0.269
1.145LysAsp: 1.145 ± 0.396
3.721LysGlu: 3.721 ± 0.856
1.431LysPhe: 1.431 ± 0.823
2.29LysGly: 2.29 ± 0.468
1.145LysHis: 1.145 ± 0.659
3.148LysIle: 3.148 ± 0.919
4.007LysLys: 4.007 ± 0.252
5.724LysLeu: 5.724 ± 1.114
1.145LysMet: 1.145 ± 0.656
1.431LysAsn: 1.431 ± 0.557
1.717LysPro: 1.717 ± 0.539
1.431LysGln: 1.431 ± 0.527
2.003LysArg: 2.003 ± 0.826
2.29LysSer: 2.29 ± 0.606
3.148LysThr: 3.148 ± 0.541
4.007LysVal: 4.007 ± 0.869
0.859LysTrp: 0.859 ± 0.762
2.003LysTyr: 2.003 ± 0.826
0.0LysXaa: 0.0 ± 0.0
Leu
7.155LeuAla: 7.155 ± 1.44
2.29LeuCys: 2.29 ± 0.606
8.014LeuAsp: 8.014 ± 1.497
5.724LeuGlu: 5.724 ± 1.114
2.576LeuPhe: 2.576 ± 0.356
6.01LeuGly: 6.01 ± 0.797
1.717LeuHis: 1.717 ± 0.988
1.145LeuIle: 1.145 ± 0.659
6.583LeuLys: 6.583 ± 1.377
9.159LeuLeu: 9.159 ± 2.089
2.003LeuMet: 2.003 ± 0.26
3.148LeuAsn: 3.148 ± 1.385
6.869LeuPro: 6.869 ± 0.866
4.579LeuGln: 4.579 ± 1.057
7.441LeuArg: 7.441 ± 0.185
7.441LeuSer: 7.441 ± 0.782
5.724LeuThr: 5.724 ± 1.121
7.728LeuVal: 7.728 ± 1.755
1.431LeuTrp: 1.431 ± 0.07
3.148LeuTyr: 3.148 ± 0.541
0.0LeuXaa: 0.0 ± 0.0
Met
1.717MetAla: 1.717 ± 0.496
0.0MetCys: 0.0 ± 0.0
1.145MetAsp: 1.145 ± 0.396
2.29MetGlu: 2.29 ± 0.289
0.572MetPhe: 0.572 ± 0.328
0.859MetGly: 0.859 ± 0.494
0.572MetHis: 0.572 ± 0.328
1.145MetIle: 1.145 ± 0.303
1.431MetLys: 1.431 ± 0.406
2.003MetLeu: 2.003 ± 0.689
1.717MetMet: 1.717 ± 0.096
1.431MetAsn: 1.431 ± 0.406
0.572MetPro: 0.572 ± 0.329
0.572MetGln: 0.572 ± 0.328
2.29MetArg: 2.29 ± 0.843
2.003MetSer: 2.003 ± 0.632
2.003MetThr: 2.003 ± 0.826
3.721MetVal: 3.721 ± 0.293
0.0MetTrp: 0.0 ± 0.0
2.003MetTyr: 2.003 ± 0.744
0.0MetXaa: 0.0 ± 0.0
Asn
4.293AsnAla: 4.293 ± 1.116
1.145AsnCys: 1.145 ± 0.659
0.572AsnAsp: 0.572 ± 0.329
2.003AsnGlu: 2.003 ± 0.744
1.431AsnPhe: 1.431 ± 0.527
0.572AsnGly: 0.572 ± 0.28
1.145AsnHis: 1.145 ± 0.396
2.862AsnIle: 2.862 ± 0.351
2.576AsnLys: 2.576 ± 0.356
4.293AsnLeu: 4.293 ± 1.472
2.003AsnMet: 2.003 ± 0.27
1.145AsnAsn: 1.145 ± 0.396
1.145AsnPro: 1.145 ± 0.95
1.431AsnGln: 1.431 ± 0.406
1.717AsnArg: 1.717 ± 0.673
1.431AsnSer: 1.431 ± 0.406
1.431AsnThr: 1.431 ± 0.527
1.431AsnVal: 1.431 ± 0.07
0.0AsnTrp: 0.0 ± 0.0
0.572AsnTyr: 0.572 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
5.724ProAla: 5.724 ± 3.967
0.859ProCys: 0.859 ± 0.762
3.148ProAsp: 3.148 ± 0.478
2.003ProGlu: 2.003 ± 0.899
1.431ProPhe: 1.431 ± 0.823
4.007ProGly: 4.007 ± 0.613
1.145ProHis: 1.145 ± 0.234
3.434ProIle: 3.434 ± 0.764
1.431ProLys: 1.431 ± 0.406
4.007ProLeu: 4.007 ± 0.885
2.003ProMet: 2.003 ± 0.744
0.859ProAsn: 0.859 ± 0.762
5.152ProPro: 5.152 ± 1.403
2.862ProGln: 2.862 ± 1.783
2.862ProArg: 2.862 ± 0.351
4.865ProSer: 4.865 ± 1.186
2.576ProThr: 2.576 ± 1.196
3.434ProVal: 3.434 ± 1.258
1.145ProTrp: 1.145 ± 0.659
3.148ProTyr: 3.148 ± 1.148
0.0ProXaa: 0.0 ± 0.0
Gln
6.01GlnAla: 6.01 ± 2.902
0.286GlnCys: 0.286 ± 0.165
2.576GlnAsp: 2.576 ± 0.692
2.576GlnGlu: 2.576 ± 0.583
0.572GlnPhe: 0.572 ± 0.887
4.007GlnGly: 4.007 ± 1.863
2.003GlnHis: 2.003 ± 0.549
1.717GlnIle: 1.717 ± 0.988
2.003GlnLys: 2.003 ± 0.899
4.007GlnLeu: 4.007 ± 1.252
1.717GlnMet: 1.717 ± 0.415
2.576GlnAsn: 2.576 ± 0.356
1.431GlnPro: 1.431 ± 0.527
1.145GlnGln: 1.145 ± 0.396
2.003GlnArg: 2.003 ± 0.26
3.434GlnSer: 3.434 ± 0.192
3.148GlnThr: 3.148 ± 0.552
2.29GlnVal: 2.29 ± 0.468
0.572GlnTrp: 0.572 ± 0.28
0.286GlnTyr: 0.286 ± 0.444
0.0GlnXaa: 0.0 ± 0.0
Arg
4.293ArgAla: 4.293 ± 0.952
0.859ArgCys: 0.859 ± 0.762
0.859ArgAsp: 0.859 ± 0.301
3.148ArgGlu: 3.148 ± 0.845
2.003ArgPhe: 2.003 ± 0.684
4.007ArgGly: 4.007 ± 0.373
1.717ArgHis: 1.717 ± 0.988
2.003ArgIle: 2.003 ± 0.26
3.148ArgLys: 3.148 ± 0.614
8.014ArgLeu: 8.014 ± 1.034
0.859ArgMet: 0.859 ± 0.269
2.29ArgAsn: 2.29 ± 0.843
1.717ArgPro: 1.717 ± 0.539
1.145ArgGln: 1.145 ± 0.659
3.721ArgArg: 3.721 ± 0.569
4.007ArgSer: 4.007 ± 0.252
3.434ArgThr: 3.434 ± 0.53
6.01ArgVal: 6.01 ± 1.836
1.717ArgTrp: 1.717 ± 0.496
1.145ArgTyr: 1.145 ± 0.303
0.0ArgXaa: 0.0 ± 0.0
Ser
6.297SerAla: 6.297 ± 1.688
2.003SerCys: 2.003 ± 0.549
3.148SerAsp: 3.148 ± 0.552
3.434SerGlu: 3.434 ± 0.53
1.431SerPhe: 1.431 ± 0.406
5.152SerGly: 5.152 ± 1.383
2.29SerHis: 2.29 ± 0.839
3.434SerIle: 3.434 ± 0.53
3.148SerLys: 3.148 ± 1.812
7.155SerLeu: 7.155 ± 1.749
1.431SerMet: 1.431 ± 0.406
2.862SerAsn: 2.862 ± 0.754
3.721SerPro: 3.721 ± 1.248
4.007SerGln: 4.007 ± 1.038
3.434SerArg: 3.434 ± 1.453
5.438SerSer: 5.438 ± 2.227
6.297SerThr: 6.297 ± 0.841
6.01SerVal: 6.01 ± 1.331
0.572SerTrp: 0.572 ± 0.328
2.576SerTyr: 2.576 ± 0.59
0.0SerXaa: 0.0 ± 0.0
Thr
6.297ThrAla: 6.297 ± 1.839
1.145ThrCys: 1.145 ± 0.656
1.717ThrAsp: 1.717 ± 0.539
2.576ThrGlu: 2.576 ± 1.142
3.721ThrPhe: 3.721 ± 1.278
1.717ThrGly: 1.717 ± 0.797
2.29ThrHis: 2.29 ± 0.468
6.01ThrIle: 6.01 ± 2.973
2.576ThrLys: 2.576 ± 0.698
6.583ThrLeu: 6.583 ± 1.098
1.431ThrMet: 1.431 ± 0.557
1.717ThrAsn: 1.717 ± 0.861
3.148ThrPro: 3.148 ± 0.919
2.576ThrGln: 2.576 ± 0.27
4.865ThrArg: 4.865 ± 0.124
3.721ThrSer: 3.721 ± 0.775
6.583ThrThr: 6.583 ± 1.194
2.862ThrVal: 2.862 ± 0.754
0.572ThrTrp: 0.572 ± 0.329
3.434ThrTyr: 3.434 ± 1.453
0.0ThrXaa: 0.0 ± 0.0
Val
4.293ValAla: 4.293 ± 1.385
1.431ValCys: 1.431 ± 0.823
3.148ValAsp: 3.148 ± 0.614
2.862ValGlu: 2.862 ± 0.81
4.007ValPhe: 4.007 ± 0.869
2.862ValGly: 2.862 ± 1.115
1.145ValHis: 1.145 ± 0.672
2.862ValIle: 2.862 ± 0.703
3.434ValLys: 3.434 ± 0.192
5.724ValLeu: 5.724 ± 0.79
2.29ValMet: 2.29 ± 0.425
1.431ValAsn: 1.431 ± 0.527
4.579ValPro: 4.579 ± 0.558
4.293ValGln: 4.293 ± 0.303
4.293ValArg: 4.293 ± 1.1
4.579ValSer: 4.579 ± 1.58
7.155ValThr: 7.155 ± 1.77
5.152ValVal: 5.152 ± 2.07
1.717ValTrp: 1.717 ± 0.602
2.576ValTyr: 2.576 ± 0.27
0.0ValXaa: 0.0 ± 0.0
Trp
0.572TrpAla: 0.572 ± 0.329
0.0TrpCys: 0.0 ± 0.0
2.003TrpAsp: 2.003 ± 0.744
1.145TrpGlu: 1.145 ± 0.303
0.572TrpPhe: 0.572 ± 0.329
0.572TrpGly: 0.572 ± 0.28
0.0TrpHis: 0.0 ± 0.0
0.859TrpIle: 0.859 ± 0.269
0.859TrpLys: 0.859 ± 0.269
1.717TrpLeu: 1.717 ± 1.207
0.0TrpMet: 0.0 ± 0.0
0.286TrpAsn: 0.286 ± 0.165
1.431TrpPro: 1.431 ± 0.527
0.0TrpGln: 0.0 ± 0.0
0.572TrpArg: 0.572 ± 0.328
2.576TrpSer: 2.576 ± 0.27
0.572TrpThr: 0.572 ± 0.694
1.717TrpVal: 1.717 ± 0.541
0.0TrpTrp: 0.0 ± 0.0
0.286TrpTyr: 0.286 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.148TyrAla: 3.148 ± 0.861
0.0TyrCys: 0.0 ± 0.0
2.003TyrAsp: 2.003 ± 0.26
2.862TyrGlu: 2.862 ± 1.054
1.717TyrPhe: 1.717 ± 0.541
2.862TyrGly: 2.862 ± 0.915
1.431TyrHis: 1.431 ± 0.406
2.29TyrIle: 2.29 ± 0.425
0.859TyrLys: 0.859 ± 0.399
4.579TyrLeu: 4.579 ± 0.85
1.145TyrMet: 1.145 ± 0.659
1.145TyrAsn: 1.145 ± 0.396
1.717TyrPro: 1.717 ± 0.096
3.721TyrGln: 3.721 ± 0.537
1.431TyrArg: 1.431 ± 0.406
2.003TyrSer: 2.003 ± 0.899
1.431TyrThr: 1.431 ± 0.577
2.003TyrVal: 2.003 ± 0.306
1.145TyrTrp: 1.145 ± 0.303
0.859TyrTyr: 0.859 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3495 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski