Amino acid dipepetide frequency for Xincheng Mosquito Virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.52AlaAla: 2.52 ± 1.346
0.56AlaCys: 0.56 ± 0.323
2.52AlaAsp: 2.52 ± 0.784
5.599AlaGlu: 5.599 ± 1.511
3.359AlaPhe: 3.359 ± 0.297
2.24AlaGly: 2.24 ± 0.559
2.24AlaHis: 2.24 ± 1.268
3.08AlaIle: 3.08 ± 0.831
4.759AlaLys: 4.759 ± 1.198
4.479AlaLeu: 4.479 ± 1.787
2.52AlaMet: 2.52 ± 0.708
1.68AlaAsn: 1.68 ± 0.339
1.68AlaPro: 1.68 ± 0.638
0.84AlaGln: 0.84 ± 0.484
2.24AlaArg: 2.24 ± 0.19
2.8AlaSer: 2.8 ± 0.89
3.359AlaThr: 3.359 ± 0.818
3.359AlaVal: 3.359 ± 1.177
0.28AlaTrp: 0.28 ± 0.161
1.4AlaTyr: 1.4 ± 0.646
0.0AlaXaa: 0.0 ± 0.0
Cys
0.84CysAla: 0.84 ± 0.484
0.0CysCys: 0.0 ± 0.0
0.84CysAsp: 0.84 ± 0.459
1.4CysGlu: 1.4 ± 0.523
1.4CysPhe: 1.4 ± 0.492
1.12CysGly: 1.12 ± 0.477
0.0CysHis: 0.0 ± 0.0
0.56CysIle: 0.56 ± 0.341
1.96CysLys: 1.96 ± 0.781
2.52CysLeu: 2.52 ± 0.568
0.28CysMet: 0.28 ± 0.161
1.96CysAsn: 1.96 ± 0.374
0.84CysPro: 0.84 ± 0.319
0.0CysGln: 0.0 ± 0.0
1.96CysArg: 1.96 ± 0.805
2.8CysSer: 2.8 ± 1.047
1.68CysThr: 1.68 ± 0.38
0.56CysVal: 0.56 ± 0.323
0.28CysTrp: 0.28 ± 0.161
0.84CysTyr: 0.84 ± 0.514
0.0CysXaa: 0.0 ± 0.0
Asp
1.96AspAla: 1.96 ± 0.556
0.84AspCys: 0.84 ± 0.514
2.8AspAsp: 2.8 ± 1.156
3.919AspGlu: 3.919 ± 1.331
3.08AspPhe: 3.08 ± 0.907
3.359AspGly: 3.359 ± 0.392
0.84AspHis: 0.84 ± 0.338
4.479AspIle: 4.479 ± 1.333
3.639AspLys: 3.639 ± 1.088
6.439AspLeu: 6.439 ± 0.682
2.52AspMet: 2.52 ± 0.648
5.039AspAsn: 5.039 ± 1.569
1.68AspPro: 1.68 ± 0.638
1.4AspGln: 1.4 ± 0.665
3.359AspArg: 3.359 ± 0.759
3.919AspSer: 3.919 ± 0.208
2.24AspThr: 2.24 ± 0.641
2.8AspVal: 2.8 ± 0.846
0.56AspTrp: 0.56 ± 0.238
2.24AspTyr: 2.24 ± 0.465
0.28AspXaa: 0.28 ± 0.375
Glu
4.479GluAla: 4.479 ± 1.179
1.96GluCys: 1.96 ± 0.721
4.479GluAsp: 4.479 ± 0.733
3.919GluGlu: 3.919 ± 1.68
2.24GluPhe: 2.24 ± 0.975
3.919GluGly: 3.919 ± 1.331
2.24GluHis: 2.24 ± 0.601
4.759GluIle: 4.759 ± 1.417
3.08GluLys: 3.08 ± 0.374
6.719GluLeu: 6.719 ± 1.524
1.12GluMet: 1.12 ± 0.4
1.96GluAsn: 1.96 ± 0.544
1.96GluPro: 1.96 ± 0.706
2.24GluGln: 2.24 ± 0.574
1.96GluArg: 1.96 ± 0.805
6.719GluSer: 6.719 ± 1.283
1.4GluThr: 1.4 ± 0.744
4.199GluVal: 4.199 ± 1.274
0.84GluTrp: 0.84 ± 0.738
2.24GluTyr: 2.24 ± 0.958
0.0GluXaa: 0.0 ± 0.0
Phe
1.96PheAla: 1.96 ± 0.441
0.28PheCys: 0.28 ± 0.161
1.96PheAsp: 1.96 ± 0.73
3.359PheGlu: 3.359 ± 1.215
1.68PhePhe: 1.68 ± 0.671
2.24PheGly: 2.24 ± 0.574
0.56PheHis: 0.56 ± 0.323
3.919PheIle: 3.919 ± 1.067
1.68PheLys: 1.68 ± 0.194
3.08PheLeu: 3.08 ± 0.453
0.84PheMet: 0.84 ± 0.278
1.12PheAsn: 1.12 ± 0.646
3.08PhePro: 3.08 ± 0.834
1.68PheGln: 1.68 ± 0.339
1.12PheArg: 1.12 ± 0.371
4.479PheSer: 4.479 ± 0.557
2.24PheThr: 2.24 ± 0.226
3.359PheVal: 3.359 ± 1.114
1.12PheTrp: 1.12 ± 0.579
1.68PheTyr: 1.68 ± 0.968
0.0PheXaa: 0.0 ± 0.0
Gly
2.24GlyAla: 2.24 ± 0.226
2.52GlyCys: 2.52 ± 1.137
4.199GlyAsp: 4.199 ± 2.4
5.039GlyGlu: 5.039 ± 1.087
1.96GlyPhe: 1.96 ± 0.634
3.919GlyGly: 3.919 ± 0.619
1.96GlyHis: 1.96 ± 0.568
4.759GlyIle: 4.759 ± 1.746
3.919GlyLys: 3.919 ± 0.652
5.599GlyLeu: 5.599 ± 0.845
0.56GlyMet: 0.56 ± 0.341
3.919GlyAsn: 3.919 ± 0.93
0.56GlyPro: 0.56 ± 0.323
2.24GlyGln: 2.24 ± 0.958
1.96GlyArg: 1.96 ± 0.696
2.8GlySer: 2.8 ± 1.501
3.919GlyThr: 3.919 ± 0.495
1.68GlyVal: 1.68 ± 0.591
0.28GlyTrp: 0.28 ± 0.161
1.68GlyTyr: 1.68 ± 1.029
0.0GlyXaa: 0.0 ± 0.0
His
1.4HisAla: 1.4 ± 0.518
0.28HisCys: 0.28 ± 0.161
1.68HisAsp: 1.68 ± 0.661
1.4HisGlu: 1.4 ± 0.302
0.84HisPhe: 0.84 ± 0.484
1.12HisGly: 1.12 ± 0.477
0.84HisHis: 0.84 ± 0.319
0.84HisIle: 0.84 ± 0.278
1.4HisLys: 1.4 ± 0.807
2.52HisLeu: 2.52 ± 0.65
1.12HisMet: 1.12 ± 0.646
1.12HisAsn: 1.12 ± 0.406
0.84HisPro: 0.84 ± 0.377
1.12HisGln: 1.12 ± 0.371
1.12HisArg: 1.12 ± 0.4
2.8HisSer: 2.8 ± 0.846
2.24HisThr: 2.24 ± 0.226
0.84HisVal: 0.84 ± 0.484
0.28HisTrp: 0.28 ± 0.161
0.84HisTyr: 0.84 ± 0.459
0.0HisXaa: 0.0 ± 0.0
Ile
4.479IleAla: 4.479 ± 1.2
1.4IleCys: 1.4 ± 0.341
5.039IleAsp: 5.039 ± 0.307
5.039IleGlu: 5.039 ± 1.235
1.96IlePhe: 1.96 ± 0.706
3.919IleGly: 3.919 ± 0.619
1.12IleHis: 1.12 ± 0.4
6.719IleIle: 6.719 ± 1.798
5.319IleLys: 5.319 ± 1.02
5.599IleLeu: 5.599 ± 0.994
1.4IleMet: 1.4 ± 0.649
4.199IleAsn: 4.199 ± 0.854
3.359IlePro: 3.359 ± 1.258
2.8IleGln: 2.8 ± 0.643
3.639IleArg: 3.639 ± 0.649
8.119IleSer: 8.119 ± 1.978
5.879IleThr: 5.879 ± 0.657
4.759IleVal: 4.759 ± 1.17
0.56IleTrp: 0.56 ± 0.323
3.359IleTyr: 3.359 ± 0.297
0.0IleXaa: 0.0 ± 0.0
Lys
4.199LysAla: 4.199 ± 0.952
1.4LysCys: 1.4 ± 0.679
4.479LysAsp: 4.479 ± 1.333
5.879LysGlu: 5.879 ± 0.666
1.68LysPhe: 1.68 ± 0.659
3.08LysGly: 3.08 ± 0.374
0.56LysHis: 0.56 ± 0.309
5.879LysIle: 5.879 ± 1.233
6.719LysLys: 6.719 ± 1.927
6.719LysLeu: 6.719 ± 1.347
1.96LysMet: 1.96 ± 0.653
2.52LysAsn: 2.52 ± 1.62
4.479LysPro: 4.479 ± 0.38
2.8LysGln: 2.8 ± 0.728
3.08LysArg: 3.08 ± 0.861
4.479LysSer: 4.479 ± 0.453
2.8LysThr: 2.8 ± 0.673
3.919LysVal: 3.919 ± 1.051
0.84LysTrp: 0.84 ± 0.484
2.52LysTyr: 2.52 ± 0.403
0.0LysXaa: 0.0 ± 0.0
Leu
5.599LeuAla: 5.599 ± 1.131
1.4LeuCys: 1.4 ± 0.807
4.759LeuAsp: 4.759 ± 1.724
5.599LeuGlu: 5.599 ± 1.365
3.919LeuPhe: 3.919 ± 0.882
4.199LeuGly: 4.199 ± 1.319
1.96LeuHis: 1.96 ± 0.906
7.559LeuIle: 7.559 ± 1.406
5.879LeuLys: 5.879 ± 2.27
9.239LeuLeu: 9.239 ± 1.003
4.199LeuMet: 4.199 ± 2.018
3.919LeuAsn: 3.919 ± 1.137
3.919LeuPro: 3.919 ± 0.581
3.08LeuGln: 3.08 ± 1.041
3.919LeuArg: 3.919 ± 0.917
7.839LeuSer: 7.839 ± 1.334
5.599LeuThr: 5.599 ± 1.083
5.039LeuVal: 5.039 ± 2.438
0.56LeuTrp: 0.56 ± 0.341
3.919LeuTyr: 3.919 ± 1.307
0.0LeuXaa: 0.0 ± 0.0
Met
2.8MetAla: 2.8 ± 0.827
1.12MetCys: 1.12 ± 0.406
1.4MetAsp: 1.4 ± 0.606
1.12MetGlu: 1.12 ± 0.387
1.4MetPhe: 1.4 ± 0.523
1.12MetGly: 1.12 ± 0.746
0.28MetHis: 0.28 ± 0.161
2.52MetIle: 2.52 ± 1.386
1.68MetLys: 1.68 ± 0.677
2.24MetLeu: 2.24 ± 0.557
0.84MetMet: 0.84 ± 0.37
0.84MetAsn: 0.84 ± 0.278
1.4MetPro: 1.4 ± 0.523
0.56MetGln: 0.56 ± 0.595
0.84MetArg: 0.84 ± 0.514
3.359MetSer: 3.359 ± 0.392
2.24MetThr: 2.24 ± 0.601
1.68MetVal: 1.68 ± 0.677
0.84MetTrp: 0.84 ± 0.459
0.84MetTyr: 0.84 ± 0.933
0.0MetXaa: 0.0 ± 0.0
Asn
2.8AsnAla: 2.8 ± 1.001
0.28AsnCys: 0.28 ± 0.161
3.08AsnAsp: 3.08 ± 1.173
2.52AsnGlu: 2.52 ± 0.519
3.359AsnPhe: 3.359 ± 0.67
3.08AsnGly: 3.08 ± 1.19
1.96AsnHis: 1.96 ± 0.104
5.319AsnIle: 5.319 ± 0.545
2.8AsnLys: 2.8 ± 1.078
3.08AsnLeu: 3.08 ± 0.587
1.68AsnMet: 1.68 ± 0.451
1.96AsnAsn: 1.96 ± 0.794
1.4AsnPro: 1.4 ± 0.66
1.68AsnGln: 1.68 ± 0.384
2.8AsnArg: 2.8 ± 0.53
5.039AsnSer: 5.039 ± 0.572
2.24AsnThr: 2.24 ± 0.566
2.52AsnVal: 2.52 ± 0.52
0.56AsnTrp: 0.56 ± 0.323
2.8AsnTyr: 2.8 ± 0.802
0.0AsnXaa: 0.0 ± 0.0
Pro
1.68ProAla: 1.68 ± 0.194
0.56ProCys: 0.56 ± 0.323
3.919ProAsp: 3.919 ± 0.495
2.52ProGlu: 2.52 ± 0.495
1.4ProPhe: 1.4 ± 0.807
1.68ProGly: 1.68 ± 0.659
0.28ProHis: 0.28 ± 0.161
3.919ProIle: 3.919 ± 0.729
2.24ProLys: 2.24 ± 0.903
2.24ProLeu: 2.24 ± 0.19
0.84ProMet: 0.84 ± 0.37
2.24ProAsn: 2.24 ± 0.65
2.52ProPro: 2.52 ± 0.644
0.84ProGln: 0.84 ± 0.319
0.56ProArg: 0.56 ± 0.238
5.319ProSer: 5.319 ± 1.97
1.96ProThr: 1.96 ± 0.369
2.52ProVal: 2.52 ± 0.495
0.0ProTrp: 0.0 ± 0.0
2.52ProTyr: 2.52 ± 0.784
0.0ProXaa: 0.0 ± 0.0
Gln
0.84GlnAla: 0.84 ± 1.235
0.56GlnCys: 0.56 ± 0.341
0.56GlnAsp: 0.56 ± 0.498
1.12GlnGlu: 1.12 ± 0.301
1.68GlnPhe: 1.68 ± 0.575
3.359GlnGly: 3.359 ± 0.858
0.56GlnHis: 0.56 ± 0.238
1.96GlnIle: 1.96 ± 0.441
2.52GlnLys: 2.52 ± 0.744
2.24GlnLeu: 2.24 ± 0.637
0.84GlnMet: 0.84 ± 0.484
0.56GlnAsn: 0.56 ± 0.309
1.4GlnPro: 1.4 ± 0.971
1.68GlnGln: 1.68 ± 1.633
2.24GlnArg: 2.24 ± 0.992
2.52GlnSer: 2.52 ± 1.111
2.52GlnThr: 2.52 ± 1.111
1.96GlnVal: 1.96 ± 0.499
0.28GlnTrp: 0.28 ± 0.375
0.28GlnTyr: 0.28 ± 0.161
0.28GlnXaa: 0.28 ± 0.375
Arg
1.68ArgAla: 1.68 ± 0.642
1.68ArgCys: 1.68 ± 0.447
2.24ArgAsp: 2.24 ± 0.943
3.359ArgGlu: 3.359 ± 0.559
3.08ArgPhe: 3.08 ± 1.037
3.08ArgGly: 3.08 ± 1.789
1.68ArgHis: 1.68 ± 0.391
3.639ArgIle: 3.639 ± 0.727
1.68ArgLys: 1.68 ± 0.671
5.599ArgLeu: 5.599 ± 1.143
1.4ArgMet: 1.4 ± 0.341
3.08ArgAsn: 3.08 ± 0.787
0.84ArgPro: 0.84 ± 0.484
1.68ArgGln: 1.68 ± 0.671
1.68ArgArg: 1.68 ± 0.384
3.919ArgSer: 3.919 ± 0.581
1.68ArgThr: 1.68 ± 0.74
1.96ArgVal: 1.96 ± 0.104
0.0ArgTrp: 0.0 ± 0.0
1.12ArgTyr: 1.12 ± 0.278
0.0ArgXaa: 0.0 ± 0.0
Ser
3.919SerAla: 3.919 ± 1.083
3.359SerCys: 3.359 ± 0.648
4.199SerAsp: 4.199 ± 0.435
2.8SerGlu: 2.8 ± 0.423
3.919SerPhe: 3.919 ± 0.378
5.599SerGly: 5.599 ± 1.391
1.96SerHis: 1.96 ± 0.824
7.559SerIle: 7.559 ± 1.512
6.439SerLys: 6.439 ± 0.572
9.798SerLeu: 9.798 ± 1.638
1.96SerMet: 1.96 ± 0.544
5.879SerAsn: 5.879 ± 0.805
3.919SerPro: 3.919 ± 0.939
1.4SerGln: 1.4 ± 0.606
3.359SerArg: 3.359 ± 1.211
8.399SerSer: 8.399 ± 1.026
3.08SerThr: 3.08 ± 0.458
5.319SerVal: 5.319 ± 1.132
2.52SerTrp: 2.52 ± 0.868
3.639SerTyr: 3.639 ± 1.495
0.0SerXaa: 0.0 ± 0.0
Thr
2.52ThrAla: 2.52 ± 1.609
1.12ThrCys: 1.12 ± 0.387
1.96ThrAsp: 1.96 ± 0.441
3.08ThrGlu: 3.08 ± 0.702
1.12ThrPhe: 1.12 ± 0.301
2.52ThrGly: 2.52 ± 0.832
2.24ThrHis: 2.24 ± 0.605
3.359ThrIle: 3.359 ± 0.392
5.599ThrLys: 5.599 ± 1.044
3.919ThrLeu: 3.919 ± 0.955
1.68ThrMet: 1.68 ± 0.555
1.68ThrAsn: 1.68 ± 0.339
2.8ThrPro: 2.8 ± 0.724
1.96ThrGln: 1.96 ± 0.104
4.759ThrArg: 4.759 ± 1.043
5.879ThrSer: 5.879 ± 0.952
4.479ThrThr: 4.479 ± 1.959
2.24ThrVal: 2.24 ± 0.742
1.12ThrTrp: 1.12 ± 0.646
1.4ThrTyr: 1.4 ± 0.507
0.0ThrXaa: 0.0 ± 0.0
Val
3.359ValAla: 3.359 ± 1.164
1.4ValCys: 1.4 ± 0.302
3.919ValAsp: 3.919 ± 1.49
2.52ValGlu: 2.52 ± 1.277
1.68ValPhe: 1.68 ± 0.194
3.919ValGly: 3.919 ± 0.348
1.68ValHis: 1.68 ± 0.65
4.479ValIle: 4.479 ± 1.18
4.759ValLys: 4.759 ± 0.401
3.919ValLeu: 3.919 ± 0.967
1.4ValMet: 1.4 ± 0.606
4.479ValAsn: 4.479 ± 1.582
2.52ValPro: 2.52 ± 0.744
0.28ValGln: 0.28 ± 0.375
1.68ValArg: 1.68 ± 0.339
4.199ValSer: 4.199 ± 1.006
2.52ValThr: 2.52 ± 0.708
3.08ValVal: 3.08 ± 1.67
0.56ValTrp: 0.56 ± 0.595
2.52ValTyr: 2.52 ± 0.52
0.0ValXaa: 0.0 ± 0.0
Trp
1.12TrpAla: 1.12 ± 0.646
0.56TrpCys: 0.56 ± 0.323
0.28TrpAsp: 0.28 ± 0.412
0.56TrpGlu: 0.56 ± 0.496
0.56TrpPhe: 0.56 ± 0.238
0.28TrpGly: 0.28 ± 0.161
0.56TrpHis: 0.56 ± 0.323
1.12TrpIle: 1.12 ± 0.646
1.12TrpLys: 1.12 ± 0.278
1.68TrpLeu: 1.68 ± 0.677
0.56TrpMet: 0.56 ± 0.498
0.28TrpAsn: 0.28 ± 0.412
0.0TrpPro: 0.0 ± 0.0
0.28TrpGln: 0.28 ± 0.161
0.84TrpArg: 0.84 ± 0.738
0.28TrpSer: 0.28 ± 0.375
1.12TrpThr: 1.12 ± 0.387
0.28TrpVal: 0.28 ± 0.161
0.0TrpTrp: 0.0 ± 0.0
0.28TrpTyr: 0.28 ± 0.298
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.84TyrAla: 0.84 ± 0.278
0.56TyrCys: 0.56 ± 0.309
3.08TyrAsp: 3.08 ± 0.739
1.4TyrGlu: 1.4 ± 0.523
1.12TyrPhe: 1.12 ± 0.4
1.96TyrGly: 1.96 ± 0.634
1.12TyrHis: 1.12 ± 0.387
2.52TyrIle: 2.52 ± 0.823
2.8TyrLys: 2.8 ± 0.682
4.479TyrLeu: 4.479 ± 0.536
1.12TyrMet: 1.12 ± 0.681
2.52TyrAsn: 2.52 ± 0.519
0.56TyrPro: 0.56 ± 0.323
1.4TyrGln: 1.4 ± 0.466
1.96TyrArg: 1.96 ± 0.461
3.359TyrSer: 3.359 ± 0.804
2.24TyrThr: 2.24 ± 0.773
2.8TyrVal: 2.8 ± 0.674
0.28TyrTrp: 0.28 ± 0.161
1.68TyrTyr: 1.68 ± 0.65
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.28XaaAla: 0.28 ± 0.375
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.28XaaGlu: 0.28 ± 0.375
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3573 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski