Amino acid dipepetide frequency for Wuhan Mosquito Virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.05AlaAla: 3.05 ± 0.911
1.017AlaCys: 1.017 ± 0.499
2.287AlaAsp: 2.287 ± 0.377
3.812AlaGlu: 3.812 ± 0.522
1.271AlaPhe: 1.271 ± 0.484
1.779AlaGly: 1.779 ± 0.692
1.271AlaHis: 1.271 ± 0.696
3.558AlaIle: 3.558 ± 1.233
5.591AlaLys: 5.591 ± 1.245
4.574AlaLeu: 4.574 ± 0.783
2.541AlaMet: 2.541 ± 1.113
1.017AlaAsn: 1.017 ± 0.488
3.304AlaPro: 3.304 ± 0.74
1.271AlaGln: 1.271 ± 0.732
4.066AlaArg: 4.066 ± 0.732
2.795AlaSer: 2.795 ± 0.817
4.574AlaThr: 4.574 ± 0.887
3.304AlaVal: 3.304 ± 0.726
1.017AlaTrp: 1.017 ± 0.655
2.287AlaTyr: 2.287 ± 0.667
0.0AlaXaa: 0.0 ± 0.0
Cys
0.762CysAla: 0.762 ± 0.31
0.254CysCys: 0.254 ± 0.157
1.017CysAsp: 1.017 ± 0.434
0.254CysGlu: 0.254 ± 0.157
0.508CysPhe: 0.508 ± 0.242
0.762CysGly: 0.762 ± 0.667
0.762CysHis: 0.762 ± 0.314
1.017CysIle: 1.017 ± 0.849
0.762CysLys: 0.762 ± 0.491
2.033CysLeu: 2.033 ± 0.631
0.0CysMet: 0.0 ± 0.0
0.508CysAsn: 0.508 ± 0.237
1.017CysPro: 1.017 ± 0.518
1.525CysGln: 1.525 ± 0.745
1.779CysArg: 1.779 ± 1.036
1.779CysSer: 1.779 ± 0.352
1.271CysThr: 1.271 ± 0.612
2.541CysVal: 2.541 ± 0.581
0.508CysTrp: 0.508 ± 0.242
2.033CysTyr: 2.033 ± 0.823
0.0CysXaa: 0.0 ± 0.0
Asp
2.795AspAla: 2.795 ± 0.892
0.254AspCys: 0.254 ± 0.157
3.05AspAsp: 3.05 ± 0.961
3.05AspGlu: 3.05 ± 0.912
1.779AspPhe: 1.779 ± 0.425
3.304AspGly: 3.304 ± 0.435
2.541AspHis: 2.541 ± 0.655
3.304AspIle: 3.304 ± 0.493
2.795AspLys: 2.795 ± 0.503
6.099AspLeu: 6.099 ± 1.04
1.017AspMet: 1.017 ± 0.504
2.033AspAsn: 2.033 ± 0.492
3.05AspPro: 3.05 ± 0.832
2.287AspGln: 2.287 ± 0.775
1.525AspArg: 1.525 ± 0.503
2.795AspSer: 2.795 ± 0.497
0.762AspThr: 0.762 ± 0.47
2.287AspVal: 2.287 ± 1.013
1.525AspTrp: 1.525 ± 0.257
1.017AspTyr: 1.017 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
4.574GluAla: 4.574 ± 0.967
0.762GluCys: 0.762 ± 0.825
1.779GluAsp: 1.779 ± 0.674
6.099GluGlu: 6.099 ± 0.787
1.017GluPhe: 1.017 ± 0.365
5.337GluGly: 5.337 ± 1.069
2.033GluHis: 2.033 ± 0.451
6.607GluIle: 6.607 ± 0.954
3.812GluLys: 3.812 ± 0.737
6.099GluLeu: 6.099 ± 0.825
0.508GluMet: 0.508 ± 0.248
1.779GluAsn: 1.779 ± 0.669
1.271GluPro: 1.271 ± 1.09
2.795GluGln: 2.795 ± 1.196
1.525GluArg: 1.525 ± 0.629
5.591GluSer: 5.591 ± 0.841
3.05GluThr: 3.05 ± 1.013
3.304GluVal: 3.304 ± 1.871
1.271GluTrp: 1.271 ± 0.46
1.525GluTyr: 1.525 ± 0.628
0.0GluXaa: 0.0 ± 0.0
Phe
1.525PheAla: 1.525 ± 0.665
1.525PheCys: 1.525 ± 0.71
1.271PheAsp: 1.271 ± 0.541
2.033PheGlu: 2.033 ± 0.386
2.795PhePhe: 2.795 ± 0.472
2.287PheGly: 2.287 ± 0.718
0.508PheHis: 0.508 ± 0.237
2.795PheIle: 2.795 ± 0.578
2.287PheLys: 2.287 ± 1.013
5.845PheLeu: 5.845 ± 1.385
0.762PheMet: 0.762 ± 0.475
1.017PheAsn: 1.017 ± 0.297
2.287PhePro: 2.287 ± 0.877
2.033PheGln: 2.033 ± 0.726
2.795PheArg: 2.795 ± 0.555
4.066PheSer: 4.066 ± 0.943
3.05PheThr: 3.05 ± 0.908
2.033PheVal: 2.033 ± 0.467
0.0PheTrp: 0.0 ± 0.0
1.017PheTyr: 1.017 ± 0.735
0.254PheXaa: 0.254 ± 0.26
Gly
2.541GlyAla: 2.541 ± 0.553
0.508GlyCys: 0.508 ± 0.437
2.541GlyAsp: 2.541 ± 0.819
3.558GlyGlu: 3.558 ± 1.932
3.304GlyPhe: 3.304 ± 0.525
3.304GlyGly: 3.304 ± 1.059
2.033GlyHis: 2.033 ± 0.798
1.271GlyIle: 1.271 ± 0.51
3.05GlyLys: 3.05 ± 0.485
7.878GlyLeu: 7.878 ± 1.031
0.762GlyMet: 0.762 ± 0.31
2.287GlyAsn: 2.287 ± 0.553
2.287GlyPro: 2.287 ± 0.684
1.779GlyGln: 1.779 ± 0.473
2.541GlyArg: 2.541 ± 1.006
4.32GlySer: 4.32 ± 0.601
2.541GlyThr: 2.541 ± 0.541
2.795GlyVal: 2.795 ± 0.595
1.017GlyTrp: 1.017 ± 0.385
2.795GlyTyr: 2.795 ± 0.637
0.0GlyXaa: 0.0 ± 0.0
His
1.017HisAla: 1.017 ± 0.432
0.762HisCys: 0.762 ± 0.61
1.271HisAsp: 1.271 ± 0.424
0.762HisGlu: 0.762 ± 0.389
1.779HisPhe: 1.779 ± 0.203
1.779HisGly: 1.779 ± 0.86
0.508HisHis: 0.508 ± 0.337
0.762HisIle: 0.762 ± 0.47
1.271HisLys: 1.271 ± 0.381
4.828HisLeu: 4.828 ± 0.591
0.254HisMet: 0.254 ± 0.157
0.762HisAsn: 0.762 ± 0.47
2.033HisPro: 2.033 ± 0.747
1.017HisGln: 1.017 ± 0.434
1.779HisArg: 1.779 ± 0.475
1.525HisSer: 1.525 ± 0.978
2.033HisThr: 2.033 ± 0.77
1.271HisVal: 1.271 ± 0.612
0.508HisTrp: 0.508 ± 0.313
1.525HisTyr: 1.525 ± 0.47
0.0HisXaa: 0.0 ± 0.0
Ile
3.304IleAla: 3.304 ± 0.636
1.017IleCys: 1.017 ± 0.363
1.525IleAsp: 1.525 ± 0.686
3.558IleGlu: 3.558 ± 0.703
2.033IlePhe: 2.033 ± 0.485
3.812IleGly: 3.812 ± 0.698
1.017IleHis: 1.017 ± 0.488
3.304IleIle: 3.304 ± 1.081
4.828IleLys: 4.828 ± 0.561
4.32IleLeu: 4.32 ± 1.085
1.525IleMet: 1.525 ± 0.418
3.558IleAsn: 3.558 ± 1.066
4.066IlePro: 4.066 ± 0.547
2.541IleGln: 2.541 ± 0.651
4.574IleArg: 4.574 ± 1.064
6.607IleSer: 6.607 ± 2.015
2.287IleThr: 2.287 ± 0.426
2.033IleVal: 2.033 ± 1.017
1.525IleTrp: 1.525 ± 0.503
2.287IleTyr: 2.287 ± 0.481
0.254IleXaa: 0.254 ± 0.276
Lys
3.558LysAla: 3.558 ± 1.136
0.762LysCys: 0.762 ± 0.389
2.541LysAsp: 2.541 ± 0.651
2.287LysGlu: 2.287 ± 0.762
2.795LysPhe: 2.795 ± 0.952
3.304LysGly: 3.304 ± 0.976
1.017LysHis: 1.017 ± 0.627
5.083LysIle: 5.083 ± 0.972
3.558LysLys: 3.558 ± 1.557
5.591LysLeu: 5.591 ± 1.005
2.287LysMet: 2.287 ± 0.357
3.812LysAsn: 3.812 ± 0.4
2.287LysPro: 2.287 ± 0.645
1.525LysGln: 1.525 ± 0.742
3.05LysArg: 3.05 ± 0.71
3.304LysSer: 3.304 ± 1.478
4.066LysThr: 4.066 ± 0.998
5.083LysVal: 5.083 ± 0.935
0.762LysTrp: 0.762 ± 0.317
0.762LysTyr: 0.762 ± 0.315
0.0LysXaa: 0.0 ± 0.0
Leu
6.861LeuAla: 6.861 ± 1.648
2.033LeuCys: 2.033 ± 0.31
5.591LeuAsp: 5.591 ± 0.892
5.337LeuGlu: 5.337 ± 1.213
4.574LeuPhe: 4.574 ± 1.293
4.32LeuGly: 4.32 ± 0.894
3.558LeuHis: 3.558 ± 0.951
6.099LeuIle: 6.099 ± 0.844
5.337LeuLys: 5.337 ± 1.232
10.165LeuLeu: 10.165 ± 1.352
2.795LeuMet: 2.795 ± 0.952
5.083LeuAsn: 5.083 ± 1.371
5.337LeuPro: 5.337 ± 1.02
3.812LeuGln: 3.812 ± 0.665
6.099LeuArg: 6.099 ± 1.573
13.977LeuSer: 13.977 ± 2.12
5.845LeuThr: 5.845 ± 1.028
6.607LeuVal: 6.607 ± 1.25
0.762LeuTrp: 0.762 ± 0.31
2.795LeuTyr: 2.795 ± 0.635
0.254LeuXaa: 0.254 ± 0.276
Met
1.779MetAla: 1.779 ± 0.54
0.762MetCys: 0.762 ± 0.31
1.525MetAsp: 1.525 ± 0.53
1.525MetGlu: 1.525 ± 0.761
0.762MetPhe: 0.762 ± 0.294
1.525MetGly: 1.525 ± 0.422
0.508MetHis: 0.508 ± 0.253
1.017MetIle: 1.017 ± 0.627
0.762MetLys: 0.762 ± 0.389
1.779MetLeu: 1.779 ± 0.835
0.254MetMet: 0.254 ± 0.312
0.762MetAsn: 0.762 ± 0.599
0.508MetPro: 0.508 ± 0.313
0.508MetGln: 0.508 ± 0.313
1.271MetArg: 1.271 ± 0.571
3.558MetSer: 3.558 ± 1.157
1.525MetThr: 1.525 ± 0.335
1.525MetVal: 1.525 ± 0.892
0.0MetTrp: 0.0 ± 0.0
0.508MetTyr: 0.508 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
3.05AsnAla: 3.05 ± 0.765
2.795AsnCys: 2.795 ± 0.321
1.525AsnAsp: 1.525 ± 0.503
2.795AsnGlu: 2.795 ± 0.762
2.541AsnPhe: 2.541 ± 0.956
1.271AsnGly: 1.271 ± 0.424
1.271AsnHis: 1.271 ± 0.39
2.541AsnIle: 2.541 ± 0.898
1.271AsnLys: 1.271 ± 0.751
3.812AsnLeu: 3.812 ± 1.618
1.017AsnMet: 1.017 ± 0.434
1.017AsnAsn: 1.017 ± 0.434
2.795AsnPro: 2.795 ± 0.563
2.541AsnGln: 2.541 ± 0.394
1.525AsnArg: 1.525 ± 0.416
2.287AsnSer: 2.287 ± 0.719
2.287AsnThr: 2.287 ± 0.499
3.304AsnVal: 3.304 ± 0.896
0.762AsnTrp: 0.762 ± 0.478
1.779AsnTyr: 1.779 ± 0.674
0.0AsnXaa: 0.0 ± 0.0
Pro
2.795ProAla: 2.795 ± 0.951
2.033ProCys: 2.033 ± 0.485
3.558ProAsp: 3.558 ± 0.88
4.32ProGlu: 4.32 ± 0.652
1.017ProPhe: 1.017 ± 0.285
1.525ProGly: 1.525 ± 0.629
0.762ProHis: 0.762 ± 0.268
1.525ProIle: 1.525 ± 0.503
3.304ProLys: 3.304 ± 1.589
5.083ProLeu: 5.083 ± 0.955
0.508ProMet: 0.508 ± 0.403
1.525ProAsn: 1.525 ± 0.503
3.304ProPro: 3.304 ± 0.784
1.271ProGln: 1.271 ± 0.39
2.541ProArg: 2.541 ± 0.758
4.32ProSer: 4.32 ± 0.481
2.795ProThr: 2.795 ± 0.786
4.828ProVal: 4.828 ± 0.494
1.271ProTrp: 1.271 ± 0.463
2.541ProTyr: 2.541 ± 0.363
0.0ProXaa: 0.0 ± 0.0
Gln
1.525GlnAla: 1.525 ± 0.759
0.762GlnCys: 0.762 ± 0.371
1.017GlnAsp: 1.017 ± 0.366
1.779GlnGlu: 1.779 ± 0.508
1.017GlnPhe: 1.017 ± 0.488
2.033GlnGly: 2.033 ± 0.248
1.271GlnHis: 1.271 ± 0.39
1.271GlnIle: 1.271 ± 0.621
2.287GlnLys: 2.287 ± 0.726
7.37GlnLeu: 7.37 ± 1.227
1.017GlnMet: 1.017 ± 0.347
2.033GlnAsn: 2.033 ± 0.646
1.525GlnPro: 1.525 ± 0.649
1.017GlnGln: 1.017 ± 0.413
2.033GlnArg: 2.033 ± 0.521
2.033GlnSer: 2.033 ± 0.521
2.541GlnThr: 2.541 ± 0.332
3.05GlnVal: 3.05 ± 0.365
0.508GlnTrp: 0.508 ± 0.242
0.508GlnTyr: 0.508 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
2.541ArgAla: 2.541 ± 1.083
1.017ArgCys: 1.017 ± 0.488
2.287ArgAsp: 2.287 ± 0.421
4.066ArgGlu: 4.066 ± 0.975
3.05ArgPhe: 3.05 ± 0.741
1.271ArgGly: 1.271 ± 0.381
1.779ArgHis: 1.779 ± 1.097
4.32ArgIle: 4.32 ± 0.597
4.066ArgLys: 4.066 ± 0.713
4.066ArgLeu: 4.066 ± 0.75
1.271ArgMet: 1.271 ± 0.428
2.795ArgAsn: 2.795 ± 0.841
1.525ArgPro: 1.525 ± 0.461
2.287ArgGln: 2.287 ± 0.521
2.541ArgArg: 2.541 ± 0.68
4.574ArgSer: 4.574 ± 1.118
3.05ArgThr: 3.05 ± 0.748
3.558ArgVal: 3.558 ± 0.976
1.017ArgTrp: 1.017 ± 0.406
1.779ArgTyr: 1.779 ± 0.671
0.0ArgXaa: 0.0 ± 0.0
Ser
3.05SerAla: 3.05 ± 1.093
1.017SerCys: 1.017 ± 0.672
4.828SerAsp: 4.828 ± 1.096
4.828SerGlu: 4.828 ± 1.405
4.828SerPhe: 4.828 ± 0.903
5.083SerGly: 5.083 ± 0.846
2.287SerHis: 2.287 ± 1.168
6.607SerIle: 6.607 ± 0.897
4.066SerLys: 4.066 ± 0.908
10.673SerLeu: 10.673 ± 1.454
2.033SerMet: 2.033 ± 0.621
3.812SerAsn: 3.812 ± 0.549
4.32SerPro: 4.32 ± 0.481
3.304SerGln: 3.304 ± 0.487
3.812SerArg: 3.812 ± 0.728
8.386SerSer: 8.386 ± 1.321
4.32SerThr: 4.32 ± 1.365
5.591SerVal: 5.591 ± 0.567
1.779SerTrp: 1.779 ± 0.604
3.05SerTyr: 3.05 ± 0.828
0.0SerXaa: 0.0 ± 0.0
Thr
3.812ThrAla: 3.812 ± 1.041
0.254ThrCys: 0.254 ± 0.276
3.05ThrAsp: 3.05 ± 0.623
3.05ThrGlu: 3.05 ± 1.086
2.033ThrPhe: 2.033 ± 0.278
4.828ThrGly: 4.828 ± 0.571
0.762ThrHis: 0.762 ± 0.47
2.795ThrIle: 2.795 ± 0.648
2.033ThrLys: 2.033 ± 0.467
5.845ThrLeu: 5.845 ± 0.585
1.525ThrMet: 1.525 ± 0.535
1.779ThrAsn: 1.779 ± 0.582
3.558ThrPro: 3.558 ± 0.558
1.271ThrGln: 1.271 ± 0.766
3.812ThrArg: 3.812 ± 1.037
5.083ThrSer: 5.083 ± 1.353
3.558ThrThr: 3.558 ± 0.619
4.574ThrVal: 4.574 ± 1.156
1.271ThrTrp: 1.271 ± 0.404
1.779ThrTyr: 1.779 ± 0.363
0.254ThrXaa: 0.254 ± 0.276
Val
3.05ValAla: 3.05 ± 0.836
2.541ValCys: 2.541 ± 0.711
4.32ValAsp: 4.32 ± 0.996
3.558ValGlu: 3.558 ± 0.806
2.795ValPhe: 2.795 ± 1.247
3.304ValGly: 3.304 ± 1.118
2.287ValHis: 2.287 ± 0.697
3.812ValIle: 3.812 ± 0.708
3.304ValLys: 3.304 ± 0.47
4.574ValLeu: 4.574 ± 1.422
1.271ValMet: 1.271 ± 0.541
3.304ValAsn: 3.304 ± 0.597
3.558ValPro: 3.558 ± 0.407
2.033ValGln: 2.033 ± 0.567
2.795ValArg: 2.795 ± 1.535
4.828ValSer: 4.828 ± 1.031
4.574ValThr: 4.574 ± 0.487
4.32ValVal: 4.32 ± 0.388
1.779ValTrp: 1.779 ± 0.522
3.304ValTyr: 3.304 ± 1.321
0.0ValXaa: 0.0 ± 0.0
Trp
0.762TrpAla: 0.762 ± 0.314
0.254TrpCys: 0.254 ± 0.157
0.508TrpAsp: 0.508 ± 0.358
1.779TrpGlu: 1.779 ± 0.501
1.017TrpPhe: 1.017 ± 0.512
0.762TrpGly: 0.762 ± 0.47
0.508TrpHis: 0.508 ± 0.313
0.508TrpIle: 0.508 ± 0.237
1.017TrpLys: 1.017 ± 0.589
1.525TrpLeu: 1.525 ± 0.416
0.254TrpMet: 0.254 ± 0.275
0.762TrpAsn: 0.762 ± 0.291
1.017TrpPro: 1.017 ± 0.73
0.508TrpGln: 0.508 ± 0.253
0.762TrpArg: 0.762 ± 0.47
2.541TrpSer: 2.541 ± 0.927
1.271TrpThr: 1.271 ± 0.39
1.779TrpVal: 1.779 ± 0.733
0.0TrpTrp: 0.0 ± 0.0
0.508TrpTyr: 0.508 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.033TyrAla: 2.033 ± 0.793
0.762TyrCys: 0.762 ± 0.607
1.779TyrAsp: 1.779 ± 0.774
2.033TyrGlu: 2.033 ± 0.594
1.017TyrPhe: 1.017 ± 0.742
1.525TyrGly: 1.525 ± 0.473
0.762TyrHis: 0.762 ± 0.314
2.033TyrIle: 2.033 ± 0.553
2.033TyrLys: 2.033 ± 1.378
4.828TyrLeu: 4.828 ± 0.808
0.508TyrMet: 0.508 ± 0.253
2.541TyrAsn: 2.541 ± 0.596
1.779TyrPro: 1.779 ± 0.742
1.271TyrGln: 1.271 ± 0.51
1.779TyrArg: 1.779 ± 0.575
3.304TyrSer: 3.304 ± 0.328
1.525TyrThr: 1.525 ± 0.923
1.271TyrVal: 1.271 ± 0.289
0.762TyrTrp: 0.762 ± 0.489
1.271TyrTyr: 1.271 ± 0.336
0.254TyrXaa: 0.254 ± 0.276
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.254XaaGlu: 0.254 ± 0.276
0.254XaaPhe: 0.254 ± 0.276
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.254XaaLeu: 0.254 ± 0.26
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.254XaaArg: 0.254 ± 0.276
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.254XaaVal: 0.254 ± 0.276
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski