Amino acid dipepetide frequency for Wuhan Insect virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.103AlaAla: 2.103 ± 1.145
0.789AlaCys: 0.789 ± 0.492
4.206AlaAsp: 4.206 ± 1.076
2.103AlaGlu: 2.103 ± 0.966
1.577AlaPhe: 1.577 ± 0.45
2.629AlaGly: 2.629 ± 1.005
0.263AlaHis: 0.263 ± 0.156
2.892AlaIle: 2.892 ± 1.925
3.417AlaLys: 3.417 ± 1.248
2.629AlaLeu: 2.629 ± 0.681
1.577AlaMet: 1.577 ± 0.77
1.577AlaAsn: 1.577 ± 0.59
0.789AlaPro: 0.789 ± 0.362
0.263AlaGln: 0.263 ± 0.415
2.103AlaArg: 2.103 ± 0.781
3.68AlaSer: 3.68 ± 0.608
2.366AlaThr: 2.366 ± 0.55
3.943AlaVal: 3.943 ± 1.473
0.526AlaTrp: 0.526 ± 0.292
1.052AlaTyr: 1.052 ± 0.86
0.0AlaXaa: 0.0 ± 0.0
Cys
0.789CysAla: 0.789 ± 1.086
0.526CysCys: 0.526 ± 0.292
1.84CysAsp: 1.84 ± 0.486
1.314CysGlu: 1.314 ± 0.406
0.526CysPhe: 0.526 ± 0.311
1.052CysGly: 1.052 ± 0.826
0.526CysHis: 0.526 ± 0.724
1.052CysIle: 1.052 ± 0.345
0.526CysLys: 0.526 ± 0.347
1.314CysLeu: 1.314 ± 0.778
1.052CysMet: 1.052 ± 0.369
1.314CysAsn: 1.314 ± 0.406
1.577CysPro: 1.577 ± 0.64
0.263CysGln: 0.263 ± 0.297
1.052CysArg: 1.052 ± 0.561
0.789CysSer: 0.789 ± 0.358
1.84CysThr: 1.84 ± 1.019
0.263CysVal: 0.263 ± 0.156
0.263CysTrp: 0.263 ± 0.156
0.789CysTyr: 0.789 ± 0.274
0.0CysXaa: 0.0 ± 0.0
Asp
1.84AspAla: 1.84 ± 0.823
0.789AspCys: 0.789 ± 0.629
6.572AspAsp: 6.572 ± 1.655
2.366AspGlu: 2.366 ± 0.848
3.155AspPhe: 3.155 ± 0.77
3.155AspGly: 3.155 ± 0.97
1.314AspHis: 1.314 ± 0.538
5.783AspIle: 5.783 ± 1.768
4.469AspLys: 4.469 ± 1.554
7.624AspLeu: 7.624 ± 1.67
1.84AspMet: 1.84 ± 0.486
4.206AspAsn: 4.206 ± 1.574
2.629AspPro: 2.629 ± 1.694
2.103AspGln: 2.103 ± 0.459
2.366AspArg: 2.366 ± 1.192
4.995AspSer: 4.995 ± 0.87
2.629AspThr: 2.629 ± 0.787
6.309AspVal: 6.309 ± 1.267
0.789AspTrp: 0.789 ± 0.467
2.103AspTyr: 2.103 ± 0.459
0.0AspXaa: 0.0 ± 0.0
Glu
1.84GluAla: 1.84 ± 0.772
1.052GluCys: 1.052 ± 0.637
3.68GluAsp: 3.68 ± 1.19
2.892GluGlu: 2.892 ± 0.775
3.155GluPhe: 3.155 ± 1.0
3.417GluGly: 3.417 ± 1.309
1.577GluHis: 1.577 ± 1.231
3.155GluIle: 3.155 ± 1.625
2.892GluLys: 2.892 ± 0.947
5.258GluLeu: 5.258 ± 0.992
3.155GluMet: 3.155 ± 1.173
3.417GluAsn: 3.417 ± 1.237
1.314GluPro: 1.314 ± 0.622
0.263GluGln: 0.263 ± 0.156
2.366GluArg: 2.366 ± 0.594
3.68GluSer: 3.68 ± 1.453
3.417GluThr: 3.417 ± 0.842
2.629GluVal: 2.629 ± 0.585
1.314GluTrp: 1.314 ± 0.478
2.103GluTyr: 2.103 ± 1.275
0.0GluXaa: 0.0 ± 0.0
Phe
1.314PheAla: 1.314 ± 0.569
1.314PheCys: 1.314 ± 0.741
2.366PheAsp: 2.366 ± 0.708
1.84PheGlu: 1.84 ± 0.373
0.263PhePhe: 0.263 ± 0.415
1.052PheGly: 1.052 ± 0.637
0.0PheHis: 0.0 ± 0.0
1.84PheIle: 1.84 ± 0.687
2.103PheLys: 2.103 ± 0.898
3.68PheLeu: 3.68 ± 0.815
0.789PheMet: 0.789 ± 0.274
1.577PheAsn: 1.577 ± 0.722
2.366PhePro: 2.366 ± 0.769
0.0PheGln: 0.0 ± 0.0
2.629PheArg: 2.629 ± 0.881
2.892PheSer: 2.892 ± 0.984
2.366PheThr: 2.366 ± 0.774
2.892PheVal: 2.892 ± 1.426
0.263PheTrp: 0.263 ± 0.156
0.789PheTyr: 0.789 ± 0.362
0.0PheXaa: 0.0 ± 0.0
Gly
1.577GlyAla: 1.577 ± 0.431
0.526GlyCys: 0.526 ± 0.724
3.943GlyAsp: 3.943 ± 0.82
1.052GlyGlu: 1.052 ± 0.849
1.052GlyPhe: 1.052 ± 0.561
1.84GlyGly: 1.84 ± 0.58
3.417GlyHis: 3.417 ± 0.756
3.417GlyIle: 3.417 ± 0.771
4.206GlyLys: 4.206 ± 0.976
6.835GlyLeu: 6.835 ± 1.013
2.629GlyMet: 2.629 ± 0.739
2.892GlyAsn: 2.892 ± 0.807
1.84GlyPro: 1.84 ± 1.566
0.789GlyGln: 0.789 ± 0.389
2.629GlyArg: 2.629 ± 0.87
4.732GlySer: 4.732 ± 0.963
3.155GlyThr: 3.155 ± 0.601
3.943GlyVal: 3.943 ± 0.917
0.789GlyTrp: 0.789 ± 0.467
1.84GlyTyr: 1.84 ± 0.574
0.0GlyXaa: 0.0 ± 0.0
His
1.052HisAla: 1.052 ± 0.881
0.0HisCys: 0.0 ± 0.0
2.103HisAsp: 2.103 ± 0.551
0.263HisGlu: 0.263 ± 0.156
0.526HisPhe: 0.526 ± 0.311
0.526HisGly: 0.526 ± 0.281
0.526HisHis: 0.526 ± 0.311
1.577HisIle: 1.577 ± 0.544
1.314HisLys: 1.314 ± 0.713
2.103HisLeu: 2.103 ± 0.795
1.314HisMet: 1.314 ± 0.46
1.577HisAsn: 1.577 ± 0.594
2.366HisPro: 2.366 ± 0.649
0.526HisGln: 0.526 ± 0.281
1.577HisArg: 1.577 ± 0.446
0.526HisSer: 0.526 ± 0.281
1.314HisThr: 1.314 ± 0.778
0.789HisVal: 0.789 ± 0.467
0.263HisTrp: 0.263 ± 0.156
0.789HisTyr: 0.789 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
1.84IleAla: 1.84 ± 0.373
1.052IleCys: 1.052 ± 0.345
2.629IleAsp: 2.629 ± 1.094
2.103IleGlu: 2.103 ± 0.858
2.366IlePhe: 2.366 ± 0.809
4.995IleGly: 4.995 ± 1.1
1.314IleHis: 1.314 ± 0.551
3.943IleIle: 3.943 ± 1.13
4.995IleLys: 4.995 ± 0.905
6.046IleLeu: 6.046 ± 1.916
2.892IleMet: 2.892 ± 0.749
4.469IleAsn: 4.469 ± 1.383
2.366IlePro: 2.366 ± 0.69
1.577IleGln: 1.577 ± 0.795
4.206IleArg: 4.206 ± 0.843
5.521IleSer: 5.521 ± 1.092
5.521IleThr: 5.521 ± 1.353
4.732IleVal: 4.732 ± 0.552
1.052IleTrp: 1.052 ± 0.572
2.892IleTyr: 2.892 ± 0.791
0.0IleXaa: 0.0 ± 0.0
Lys
2.629LysAla: 2.629 ± 0.806
0.789LysCys: 0.789 ± 0.629
3.417LysAsp: 3.417 ± 0.612
7.098LysGlu: 7.098 ± 2.024
2.103LysPhe: 2.103 ± 0.691
4.732LysGly: 4.732 ± 1.511
1.314LysHis: 1.314 ± 0.639
4.206LysIle: 4.206 ± 0.657
3.155LysLys: 3.155 ± 0.968
5.521LysLeu: 5.521 ± 1.604
2.629LysMet: 2.629 ± 1.091
3.417LysAsn: 3.417 ± 1.346
2.366LysPro: 2.366 ± 0.875
1.314LysGln: 1.314 ± 0.532
3.68LysArg: 3.68 ± 0.607
4.206LysSer: 4.206 ± 1.273
3.943LysThr: 3.943 ± 0.493
6.046LysVal: 6.046 ± 0.879
0.789LysTrp: 0.789 ± 0.467
2.103LysTyr: 2.103 ± 0.924
0.0LysXaa: 0.0 ± 0.0
Leu
4.206LeuAla: 4.206 ± 0.57
2.366LeuCys: 2.366 ± 0.649
5.258LeuAsp: 5.258 ± 1.65
4.469LeuGlu: 4.469 ± 0.756
3.417LeuPhe: 3.417 ± 0.882
6.046LeuGly: 6.046 ± 1.293
2.629LeuHis: 2.629 ± 0.662
8.412LeuIle: 8.412 ± 1.906
5.783LeuLys: 5.783 ± 1.008
8.149LeuLeu: 8.149 ± 2.139
3.417LeuMet: 3.417 ± 0.789
4.732LeuAsn: 4.732 ± 0.992
3.943LeuPro: 3.943 ± 0.958
2.629LeuGln: 2.629 ± 0.955
6.572LeuArg: 6.572 ± 1.073
11.304LeuSer: 11.304 ± 1.745
7.886LeuThr: 7.886 ± 1.087
4.469LeuVal: 4.469 ± 1.287
1.84LeuTrp: 1.84 ± 0.8
3.155LeuTyr: 3.155 ± 1.347
0.0LeuXaa: 0.0 ± 0.0
Met
2.103MetAla: 2.103 ± 0.986
1.052MetCys: 1.052 ± 0.369
2.629MetAsp: 2.629 ± 0.747
2.892MetGlu: 2.892 ± 1.026
2.103MetPhe: 2.103 ± 1.244
1.577MetGly: 1.577 ± 0.724
0.526MetHis: 0.526 ± 0.292
2.629MetIle: 2.629 ± 0.673
2.629MetLys: 2.629 ± 1.207
2.629MetLeu: 2.629 ± 0.921
0.526MetMet: 0.526 ± 0.347
0.789MetAsn: 0.789 ± 1.202
1.052MetPro: 1.052 ± 0.389
0.789MetGln: 0.789 ± 0.467
3.155MetArg: 3.155 ± 1.036
4.206MetSer: 4.206 ± 0.81
2.366MetThr: 2.366 ± 1.025
1.314MetVal: 1.314 ± 0.532
0.526MetTrp: 0.526 ± 0.281
1.314MetTyr: 1.314 ± 0.46
0.0MetXaa: 0.0 ± 0.0
Asn
3.68AsnAla: 3.68 ± 0.923
1.577AsnCys: 1.577 ± 0.4
2.892AsnAsp: 2.892 ± 0.975
1.84AsnGlu: 1.84 ± 0.546
1.577AsnPhe: 1.577 ± 0.629
2.892AsnGly: 2.892 ± 0.647
0.263AsnHis: 0.263 ± 0.362
2.103AsnIle: 2.103 ± 1.167
2.892AsnLys: 2.892 ± 0.268
4.469AsnLeu: 4.469 ± 1.226
2.103AsnMet: 2.103 ± 0.409
1.84AsnAsn: 1.84 ± 1.211
4.995AsnPro: 4.995 ± 0.892
2.366AsnGln: 2.366 ± 0.573
3.155AsnArg: 3.155 ± 0.422
4.206AsnSer: 4.206 ± 1.398
3.417AsnThr: 3.417 ± 0.69
2.366AsnVal: 2.366 ± 1.119
0.789AsnTrp: 0.789 ± 0.467
2.892AsnTyr: 2.892 ± 0.533
0.0AsnXaa: 0.0 ± 0.0
Pro
0.526ProAla: 0.526 ± 0.292
0.526ProCys: 0.526 ± 0.58
4.206ProAsp: 4.206 ± 0.945
2.629ProGlu: 2.629 ± 0.662
0.789ProPhe: 0.789 ± 0.339
1.052ProGly: 1.052 ± 0.462
0.789ProHis: 0.789 ± 0.274
3.417ProIle: 3.417 ± 0.786
3.155ProLys: 3.155 ± 0.914
5.783ProLeu: 5.783 ± 1.107
1.052ProMet: 1.052 ± 0.43
2.892ProAsn: 2.892 ± 1.396
1.577ProPro: 1.577 ± 0.446
1.314ProGln: 1.314 ± 0.606
1.84ProArg: 1.84 ± 0.53
3.155ProSer: 3.155 ± 0.95
3.155ProThr: 3.155 ± 0.661
2.366ProVal: 2.366 ± 0.469
0.526ProTrp: 0.526 ± 0.281
1.84ProTyr: 1.84 ± 1.089
0.0ProXaa: 0.0 ± 0.0
Gln
1.314GlnAla: 1.314 ± 0.854
0.526GlnCys: 0.526 ± 0.281
0.789GlnAsp: 0.789 ± 0.467
1.052GlnGlu: 1.052 ± 0.843
1.052GlnPhe: 1.052 ± 0.78
0.526GlnGly: 0.526 ± 0.281
0.0GlnHis: 0.0 ± 0.0
2.892GlnIle: 2.892 ± 0.62
1.577GlnLys: 1.577 ± 0.747
3.943GlnLeu: 3.943 ± 0.906
0.789GlnMet: 0.789 ± 0.447
0.789GlnAsn: 0.789 ± 0.385
0.526GlnPro: 0.526 ± 0.311
1.052GlnGln: 1.052 ± 0.561
0.789GlnArg: 0.789 ± 0.362
2.366GlnSer: 2.366 ± 0.661
1.84GlnThr: 1.84 ± 0.637
1.314GlnVal: 1.314 ± 0.92
0.263GlnTrp: 0.263 ± 0.362
1.052GlnTyr: 1.052 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
2.629ArgAla: 2.629 ± 1.468
0.789ArgCys: 0.789 ± 0.467
4.206ArgAsp: 4.206 ± 0.742
3.68ArgGlu: 3.68 ± 0.685
1.052ArgPhe: 1.052 ± 0.289
4.206ArgGly: 4.206 ± 1.122
0.789ArgHis: 0.789 ± 0.339
4.469ArgIle: 4.469 ± 1.106
2.892ArgLys: 2.892 ± 0.71
5.521ArgLeu: 5.521 ± 1.414
2.103ArgMet: 2.103 ± 1.244
1.052ArgAsn: 1.052 ± 0.622
1.84ArgPro: 1.84 ± 1.089
1.84ArgGln: 1.84 ± 0.542
2.366ArgArg: 2.366 ± 0.594
3.68ArgSer: 3.68 ± 0.808
3.68ArgThr: 3.68 ± 1.189
2.892ArgVal: 2.892 ± 0.473
1.052ArgTrp: 1.052 ± 0.462
2.892ArgTyr: 2.892 ± 0.71
0.0ArgXaa: 0.0 ± 0.0
Ser
3.68SerAla: 3.68 ± 0.942
1.84SerCys: 1.84 ± 0.803
6.309SerAsp: 6.309 ± 2.032
6.046SerGlu: 6.046 ± 1.893
2.629SerPhe: 2.629 ± 0.502
4.206SerGly: 4.206 ± 1.168
2.366SerHis: 2.366 ± 0.691
5.258SerIle: 5.258 ± 0.64
4.995SerLys: 4.995 ± 1.53
7.886SerLeu: 7.886 ± 2.952
2.366SerMet: 2.366 ± 0.798
6.046SerAsn: 6.046 ± 0.932
3.943SerPro: 3.943 ± 1.275
1.577SerGln: 1.577 ± 0.511
4.469SerArg: 4.469 ± 1.393
11.304SerSer: 11.304 ± 2.034
6.046SerThr: 6.046 ± 1.578
4.469SerVal: 4.469 ± 1.25
1.577SerTrp: 1.577 ± 0.594
2.892SerTyr: 2.892 ± 0.662
0.0SerXaa: 0.0 ± 0.0
Thr
2.103ThrAla: 2.103 ± 0.451
1.577ThrCys: 1.577 ± 0.615
4.469ThrAsp: 4.469 ± 1.346
3.68ThrGlu: 3.68 ± 0.789
1.84ThrPhe: 1.84 ± 0.778
2.892ThrGly: 2.892 ± 0.742
0.789ThrHis: 0.789 ± 0.532
2.892ThrIle: 2.892 ± 0.58
4.732ThrLys: 4.732 ± 1.479
6.835ThrLeu: 6.835 ± 1.191
2.629ThrMet: 2.629 ± 0.502
2.892ThrAsn: 2.892 ± 1.221
2.366ThrPro: 2.366 ± 0.717
3.155ThrGln: 3.155 ± 1.202
2.892ThrArg: 2.892 ± 0.404
6.835ThrSer: 6.835 ± 0.692
2.629ThrThr: 2.629 ± 0.717
4.732ThrVal: 4.732 ± 0.896
1.577ThrTrp: 1.577 ± 0.588
2.892ThrTyr: 2.892 ± 0.471
0.0ThrXaa: 0.0 ± 0.0
Val
2.892ValAla: 2.892 ± 0.903
1.314ValCys: 1.314 ± 0.756
2.629ValAsp: 2.629 ± 0.765
2.103ValGlu: 2.103 ± 0.621
1.84ValPhe: 1.84 ± 0.737
3.155ValGly: 3.155 ± 0.392
0.789ValHis: 0.789 ± 0.467
4.732ValIle: 4.732 ± 1.75
5.783ValLys: 5.783 ± 2.298
5.521ValLeu: 5.521 ± 1.585
2.103ValMet: 2.103 ± 0.642
2.629ValAsn: 2.629 ± 0.959
2.892ValPro: 2.892 ± 0.74
1.052ValGln: 1.052 ± 0.766
3.943ValArg: 3.943 ± 1.28
7.361ValSer: 7.361 ± 1.614
4.732ValThr: 4.732 ± 1.084
5.521ValVal: 5.521 ± 1.285
1.314ValTrp: 1.314 ± 0.569
2.103ValTyr: 2.103 ± 0.579
0.0ValXaa: 0.0 ± 0.0
Trp
0.526TrpAla: 0.526 ± 0.281
0.0TrpCys: 0.0 ± 0.0
1.314TrpAsp: 1.314 ± 0.519
1.052TrpGlu: 1.052 ± 0.423
0.526TrpPhe: 0.526 ± 0.347
1.314TrpGly: 1.314 ± 0.46
0.263TrpHis: 0.263 ± 0.156
0.789TrpIle: 0.789 ± 0.274
1.314TrpLys: 1.314 ± 0.318
2.629TrpLeu: 2.629 ± 1.057
0.789TrpMet: 0.789 ± 0.389
1.052TrpAsn: 1.052 ± 0.345
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.789TrpArg: 0.789 ± 0.467
1.052TrpSer: 1.052 ± 0.345
0.789TrpThr: 0.789 ± 0.274
0.526TrpVal: 0.526 ± 0.292
0.526TrpTrp: 0.526 ± 0.311
0.789TrpTyr: 0.789 ± 0.629
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.84TyrAla: 1.84 ± 1.424
0.263TyrCys: 0.263 ± 0.156
1.577TyrAsp: 1.577 ± 0.409
2.103TyrGlu: 2.103 ± 0.553
0.789TyrPhe: 0.789 ± 0.702
1.84TyrGly: 1.84 ± 0.611
1.577TyrHis: 1.577 ± 0.446
1.052TyrIle: 1.052 ± 0.637
2.892TyrLys: 2.892 ± 0.984
6.309TyrLeu: 6.309 ± 1.458
0.789TyrMet: 0.789 ± 0.339
3.155TyrAsn: 3.155 ± 0.837
1.84TyrPro: 1.84 ± 0.737
1.577TyrGln: 1.577 ± 0.511
1.052TyrArg: 1.052 ± 0.345
3.417TyrSer: 3.417 ± 0.972
1.314TyrThr: 1.314 ± 0.608
2.892TyrVal: 2.892 ± 1.166
0.0TyrTrp: 0.0 ± 0.0
1.314TyrTyr: 1.314 ± 0.46
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3805 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski