Amino acid dipepetide frequency for Wuhan Insect virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.533AlaAla: 2.533 ± 1.236
1.407AlaCys: 1.407 ± 0.514
2.252AlaAsp: 2.252 ± 0.801
2.533AlaGlu: 2.533 ± 1.003
3.096AlaPhe: 3.096 ± 1.135
2.533AlaGly: 2.533 ± 0.796
0.563AlaHis: 0.563 ± 0.329
2.252AlaIle: 2.252 ± 1.215
2.252AlaLys: 2.252 ± 1.593
5.066AlaLeu: 5.066 ± 0.746
1.126AlaMet: 1.126 ± 0.789
1.407AlaAsn: 1.407 ± 0.952
1.689AlaPro: 1.689 ± 1.355
2.252AlaGln: 2.252 ± 0.267
1.97AlaArg: 1.97 ± 0.607
2.533AlaSer: 2.533 ± 0.392
2.252AlaThr: 2.252 ± 0.775
3.377AlaVal: 3.377 ± 1.66
0.0AlaTrp: 0.0 ± 0.0
2.252AlaTyr: 2.252 ± 0.969
0.0AlaXaa: 0.0 ± 0.0
Cys
0.844CysAla: 0.844 ± 0.333
0.281CysCys: 0.281 ± 0.164
0.844CysAsp: 0.844 ± 0.493
0.563CysGlu: 0.563 ± 0.329
0.281CysPhe: 0.281 ± 0.164
1.689CysGly: 1.689 ± 0.665
0.844CysHis: 0.844 ± 0.333
0.563CysIle: 0.563 ± 0.329
0.281CysLys: 0.281 ± 0.164
2.252CysLeu: 2.252 ± 0.717
0.0CysMet: 0.0 ± 0.0
1.407CysAsn: 1.407 ± 0.457
0.844CysPro: 0.844 ± 0.741
0.844CysGln: 0.844 ± 1.249
1.407CysArg: 1.407 ± 1.201
1.407CysSer: 1.407 ± 1.075
1.126CysThr: 1.126 ± 0.797
1.126CysVal: 1.126 ± 0.678
0.281CysTrp: 0.281 ± 0.164
0.281CysTyr: 0.281 ± 0.416
0.0CysXaa: 0.0 ± 0.0
Asp
1.97AspAla: 1.97 ± 0.629
1.407AspCys: 1.407 ± 0.381
0.844AspAsp: 0.844 ± 0.412
4.503AspGlu: 4.503 ± 0.78
2.815AspPhe: 2.815 ± 1.424
1.407AspGly: 1.407 ± 0.587
0.844AspHis: 0.844 ± 0.412
1.407AspIle: 1.407 ± 0.798
4.222AspLys: 4.222 ± 0.997
7.599AspLeu: 7.599 ± 1.462
2.252AspMet: 2.252 ± 0.266
2.533AspAsn: 2.533 ± 0.373
4.222AspPro: 4.222 ± 1.758
1.97AspGln: 1.97 ± 0.58
1.97AspArg: 1.97 ± 0.787
5.066AspSer: 5.066 ± 1.216
2.252AspThr: 2.252 ± 0.64
2.252AspVal: 2.252 ± 0.594
1.97AspTrp: 1.97 ± 0.726
1.407AspTyr: 1.407 ± 0.822
0.0AspXaa: 0.0 ± 0.0
Glu
3.096GluAla: 3.096 ± 0.66
1.126GluCys: 1.126 ± 0.499
3.94GluAsp: 3.94 ± 1.033
3.659GluGlu: 3.659 ± 0.539
3.94GluPhe: 3.94 ± 0.929
3.659GluGly: 3.659 ± 0.776
0.844GluHis: 0.844 ± 0.333
4.222GluIle: 4.222 ± 0.909
1.97GluLys: 1.97 ± 0.835
6.192GluLeu: 6.192 ± 2.741
1.126GluMet: 1.126 ± 0.664
2.815GluAsn: 2.815 ± 0.569
1.689GluPro: 1.689 ± 0.964
2.815GluGln: 2.815 ± 1.952
3.096GluArg: 3.096 ± 1.025
3.096GluSer: 3.096 ± 0.986
5.066GluThr: 5.066 ± 1.072
4.503GluVal: 4.503 ± 0.557
0.0GluTrp: 0.0 ± 0.0
1.97GluTyr: 1.97 ± 1.002
0.0GluXaa: 0.0 ± 0.0
Phe
0.844PheAla: 0.844 ± 0.914
0.844PheCys: 0.844 ± 0.741
1.126PheAsp: 1.126 ± 0.401
1.126PheGlu: 1.126 ± 0.658
2.533PhePhe: 2.533 ± 0.756
1.97PheGly: 1.97 ± 0.704
1.407PheHis: 1.407 ± 0.587
1.97PheIle: 1.97 ± 0.664
4.785PheLys: 4.785 ± 1.499
5.91PheLeu: 5.91 ± 1.54
1.689PheMet: 1.689 ± 1.199
1.689PheAsn: 1.689 ± 0.463
3.377PhePro: 3.377 ± 0.935
1.689PheGln: 1.689 ± 0.679
2.252PheArg: 2.252 ± 1.316
3.94PheSer: 3.94 ± 1.087
1.689PheThr: 1.689 ± 1.195
2.533PheVal: 2.533 ± 0.77
0.281PheTrp: 0.281 ± 0.164
0.563PheTyr: 0.563 ± 0.339
0.0PheXaa: 0.0 ± 0.0
Gly
2.533GlyAla: 2.533 ± 1.319
0.844GlyCys: 0.844 ± 0.333
3.659GlyAsp: 3.659 ± 0.337
1.689GlyGlu: 1.689 ± 0.726
2.533GlyPhe: 2.533 ± 0.373
2.815GlyGly: 2.815 ± 0.569
1.689GlyHis: 1.689 ± 0.633
3.659GlyIle: 3.659 ± 0.768
2.252GlyLys: 2.252 ± 0.589
6.192GlyLeu: 6.192 ± 1.183
1.689GlyMet: 1.689 ± 0.423
1.407GlyAsn: 1.407 ± 0.723
2.815GlyPro: 2.815 ± 1.011
3.096GlyGln: 3.096 ± 0.885
2.252GlyArg: 2.252 ± 0.63
3.94GlySer: 3.94 ± 0.564
1.97GlyThr: 1.97 ± 0.56
4.503GlyVal: 4.503 ± 1.62
0.844GlyTrp: 0.844 ± 0.333
3.096GlyTyr: 3.096 ± 0.497
0.0GlyXaa: 0.0 ± 0.0
His
1.126HisAla: 1.126 ± 0.485
0.0HisCys: 0.0 ± 0.0
0.281HisAsp: 0.281 ± 0.164
0.563HisGlu: 0.563 ± 0.329
1.407HisPhe: 1.407 ± 0.381
1.126HisGly: 1.126 ± 0.406
1.689HisHis: 1.689 ± 0.633
1.689HisIle: 1.689 ± 0.987
1.126HisLys: 1.126 ± 0.658
3.94HisLeu: 3.94 ± 1.19
0.563HisMet: 0.563 ± 0.329
0.844HisAsn: 0.844 ± 0.493
2.815HisPro: 2.815 ± 0.761
1.126HisGln: 1.126 ± 0.406
1.126HisArg: 1.126 ± 0.816
3.096HisSer: 3.096 ± 0.977
0.563HisThr: 0.563 ± 0.329
1.407HisVal: 1.407 ± 0.381
1.126HisTrp: 1.126 ± 0.658
1.126HisTyr: 1.126 ± 0.908
0.0HisXaa: 0.0 ± 0.0
Ile
2.815IleAla: 2.815 ± 0.962
2.252IleCys: 2.252 ± 0.717
5.066IleAsp: 5.066 ± 1.29
3.94IleGlu: 3.94 ± 1.752
3.096IlePhe: 3.096 ± 0.942
2.533IleGly: 2.533 ± 1.003
2.533IleHis: 2.533 ± 0.778
7.036IleIle: 7.036 ± 0.823
3.659IleLys: 3.659 ± 0.834
4.785IleLeu: 4.785 ± 1.295
0.844IleMet: 0.844 ± 0.456
3.377IleAsn: 3.377 ± 0.88
3.659IlePro: 3.659 ± 1.341
2.815IleGln: 2.815 ± 0.721
4.785IleArg: 4.785 ± 1.196
3.377IleSer: 3.377 ± 0.93
3.377IleThr: 3.377 ± 1.293
2.533IleVal: 2.533 ± 0.907
0.563IleTrp: 0.563 ± 0.339
3.659IleTyr: 3.659 ± 0.667
0.0IleXaa: 0.0 ± 0.0
Lys
2.252LysAla: 2.252 ± 1.018
0.844LysCys: 0.844 ± 0.741
3.659LysAsp: 3.659 ± 2.412
4.785LysGlu: 4.785 ± 1.805
2.533LysPhe: 2.533 ± 0.802
3.94LysGly: 3.94 ± 0.685
1.689LysHis: 1.689 ± 0.434
5.348LysIle: 5.348 ± 1.549
4.503LysLys: 4.503 ± 1.277
6.473LysLeu: 6.473 ± 1.434
3.096LysMet: 3.096 ± 0.385
2.533LysAsn: 2.533 ± 1.073
3.377LysPro: 3.377 ± 2.09
1.689LysGln: 1.689 ± 0.745
2.815LysArg: 2.815 ± 0.883
2.815LysSer: 2.815 ± 1.19
2.252LysThr: 2.252 ± 1.103
3.096LysVal: 3.096 ± 0.651
0.844LysTrp: 0.844 ± 0.412
1.689LysTyr: 1.689 ± 0.636
0.0LysXaa: 0.0 ± 0.0
Leu
5.066LeuAla: 5.066 ± 1.951
1.97LeuCys: 1.97 ± 0.24
6.755LeuAsp: 6.755 ± 1.201
5.91LeuGlu: 5.91 ± 1.063
3.377LeuPhe: 3.377 ± 1.583
6.473LeuGly: 6.473 ± 1.174
3.659LeuHis: 3.659 ± 0.627
7.881LeuIle: 7.881 ± 3.199
7.599LeuLys: 7.599 ± 2.553
7.881LeuLeu: 7.881 ± 0.821
2.533LeuMet: 2.533 ± 0.464
5.91LeuAsn: 5.91 ± 1.549
4.785LeuPro: 4.785 ± 0.9
1.407LeuGln: 1.407 ± 0.399
5.629LeuArg: 5.629 ± 1.61
8.162LeuSer: 8.162 ± 1.643
7.881LeuThr: 7.881 ± 0.924
3.94LeuVal: 3.94 ± 1.988
2.533LeuTrp: 2.533 ± 0.929
3.659LeuTyr: 3.659 ± 0.444
0.0LeuXaa: 0.0 ± 0.0
Met
0.844MetAla: 0.844 ± 0.833
0.0MetCys: 0.0 ± 0.0
1.407MetAsp: 1.407 ± 0.612
1.689MetGlu: 1.689 ± 1.024
1.689MetPhe: 1.689 ± 0.397
1.407MetGly: 1.407 ± 0.595
1.407MetHis: 1.407 ± 0.819
2.252MetIle: 2.252 ± 1.087
2.815MetLys: 2.815 ± 0.97
2.533MetLeu: 2.533 ± 0.886
0.844MetMet: 0.844 ± 0.446
1.689MetAsn: 1.689 ± 0.423
0.281MetPro: 0.281 ± 0.489
1.407MetGln: 1.407 ± 0.454
1.689MetArg: 1.689 ± 0.484
3.096MetSer: 3.096 ± 1.191
1.407MetThr: 1.407 ± 0.651
2.815MetVal: 2.815 ± 0.511
0.0MetTrp: 0.0 ± 0.0
0.844MetTyr: 0.844 ± 0.333
0.0MetXaa: 0.0 ± 0.0
Asn
2.252AsnAla: 2.252 ± 1.103
0.563AsnCys: 0.563 ± 0.339
2.533AsnAsp: 2.533 ± 0.965
2.252AsnGlu: 2.252 ± 0.909
1.407AsnPhe: 1.407 ± 0.651
2.533AsnGly: 2.533 ± 0.939
0.281AsnHis: 0.281 ± 0.164
2.815AsnIle: 2.815 ± 1.531
1.97AsnLys: 1.97 ± 1.186
5.629AsnLeu: 5.629 ± 1.109
2.815AsnMet: 2.815 ± 0.41
2.252AsnAsn: 2.252 ± 1.593
3.659AsnPro: 3.659 ± 1.181
0.844AsnGln: 0.844 ± 0.333
2.252AsnArg: 2.252 ± 0.302
5.348AsnSer: 5.348 ± 0.696
1.407AsnThr: 1.407 ± 0.381
2.252AsnVal: 2.252 ± 0.733
1.126AsnTrp: 1.126 ± 0.485
2.815AsnTyr: 2.815 ± 0.677
0.0AsnXaa: 0.0 ± 0.0
Pro
3.096ProAla: 3.096 ± 0.787
0.563ProCys: 0.563 ± 0.339
3.659ProAsp: 3.659 ± 0.94
2.533ProGlu: 2.533 ± 0.976
1.407ProPhe: 1.407 ± 0.723
1.689ProGly: 1.689 ± 0.712
0.844ProHis: 0.844 ± 0.412
3.659ProIle: 3.659 ± 1.795
3.377ProLys: 3.377 ± 1.671
4.785ProLeu: 4.785 ± 1.497
0.563ProMet: 0.563 ± 0.438
2.815ProAsn: 2.815 ± 0.651
4.503ProPro: 4.503 ± 6.659
1.689ProGln: 1.689 ± 0.719
1.97ProArg: 1.97 ± 0.769
9.006ProSer: 9.006 ± 1.113
3.659ProThr: 3.659 ± 0.744
3.096ProVal: 3.096 ± 1.306
1.126ProTrp: 1.126 ± 0.488
2.252ProTyr: 2.252 ± 0.804
0.0ProXaa: 0.0 ± 0.0
Gln
1.407GlnAla: 1.407 ± 0.454
0.844GlnCys: 0.844 ± 0.333
1.97GlnAsp: 1.97 ± 0.528
1.407GlnGlu: 1.407 ± 0.859
1.407GlnPhe: 1.407 ± 0.381
3.94GlnGly: 3.94 ± 1.259
0.563GlnHis: 0.563 ± 0.606
3.096GlnIle: 3.096 ± 1.161
2.815GlnLys: 2.815 ± 1.255
1.97GlnLeu: 1.97 ± 0.664
0.563GlnMet: 0.563 ± 0.606
1.97GlnAsn: 1.97 ± 0.623
1.407GlnPro: 1.407 ± 1.347
0.844GlnGln: 0.844 ± 0.512
2.252GlnArg: 2.252 ± 1.018
3.377GlnSer: 3.377 ± 0.892
2.252GlnThr: 2.252 ± 1.342
1.97GlnVal: 1.97 ± 0.884
0.844GlnTrp: 0.844 ± 0.493
1.689GlnTyr: 1.689 ± 1.355
0.0GlnXaa: 0.0 ± 0.0
Arg
1.689ArgAla: 1.689 ± 0.824
0.281ArgCys: 0.281 ± 0.164
2.533ArgAsp: 2.533 ± 0.907
4.222ArgGlu: 4.222 ± 1.296
2.533ArgPhe: 2.533 ± 0.796
2.533ArgGly: 2.533 ± 0.907
1.407ArgHis: 1.407 ± 0.595
3.096ArgIle: 3.096 ± 0.477
1.126ArgLys: 1.126 ± 0.746
5.066ArgLeu: 5.066 ± 0.539
1.689ArgMet: 1.689 ± 0.423
3.096ArgAsn: 3.096 ± 1.025
2.252ArgPro: 2.252 ± 0.843
1.407ArgGln: 1.407 ± 0.587
3.659ArgArg: 3.659 ± 1.459
5.629ArgSer: 5.629 ± 1.451
2.815ArgThr: 2.815 ± 0.578
3.096ArgVal: 3.096 ± 1.809
1.407ArgTrp: 1.407 ± 0.514
1.407ArgTyr: 1.407 ± 0.453
0.0ArgXaa: 0.0 ± 0.0
Ser
3.377SerAla: 3.377 ± 1.002
2.252SerCys: 2.252 ± 0.931
4.785SerAsp: 4.785 ± 1.138
5.629SerGlu: 5.629 ± 0.996
2.815SerPhe: 2.815 ± 1.009
5.629SerGly: 5.629 ± 0.525
1.97SerHis: 1.97 ± 0.797
5.066SerIle: 5.066 ± 1.536
4.222SerLys: 4.222 ± 1.117
10.414SerLeu: 10.414 ± 1.539
2.815SerMet: 2.815 ± 1.041
3.659SerAsn: 3.659 ± 0.67
4.785SerPro: 4.785 ± 1.57
2.252SerGln: 2.252 ± 0.71
3.659SerArg: 3.659 ± 0.61
7.318SerSer: 7.318 ± 2.062
5.91SerThr: 5.91 ± 1.656
3.659SerVal: 3.659 ± 2.485
2.252SerTrp: 2.252 ± 0.621
4.222SerTyr: 4.222 ± 0.806
0.0SerXaa: 0.0 ± 0.0
Thr
2.252ThrAla: 2.252 ± 0.733
0.844ThrCys: 0.844 ± 0.741
2.252ThrAsp: 2.252 ± 0.639
3.94ThrGlu: 3.94 ± 2.191
1.407ThrPhe: 1.407 ± 0.381
1.689ThrGly: 1.689 ± 0.434
1.126ThrHis: 1.126 ± 0.485
3.659ThrIle: 3.659 ± 0.65
3.659ThrLys: 3.659 ± 0.627
6.473ThrLeu: 6.473 ± 1.216
1.689ThrMet: 1.689 ± 0.987
2.815ThrAsn: 2.815 ± 0.726
3.659ThrPro: 3.659 ± 1.798
3.096ThrGln: 3.096 ± 0.477
3.096ThrArg: 3.096 ± 1.106
5.629ThrSer: 5.629 ± 2.345
1.97ThrThr: 1.97 ± 0.516
3.94ThrVal: 3.94 ± 0.371
0.844ThrTrp: 0.844 ± 0.456
1.126ThrTyr: 1.126 ± 0.51
0.0ThrXaa: 0.0 ± 0.0
Val
2.252ValAla: 2.252 ± 0.96
0.563ValCys: 0.563 ± 0.339
2.815ValAsp: 2.815 ± 0.491
4.785ValGlu: 4.785 ± 1.709
1.407ValPhe: 1.407 ± 0.457
3.377ValGly: 3.377 ± 1.742
1.689ValHis: 1.689 ± 0.665
3.659ValIle: 3.659 ± 1.54
3.096ValLys: 3.096 ± 1.583
5.066ValLeu: 5.066 ± 0.539
2.252ValMet: 2.252 ± 0.949
2.815ValAsn: 2.815 ± 0.569
3.659ValPro: 3.659 ± 0.996
2.815ValGln: 2.815 ± 1.956
1.97ValArg: 1.97 ± 0.983
4.503ValSer: 4.503 ± 1.28
4.503ValThr: 4.503 ± 0.528
4.503ValVal: 4.503 ± 1.72
0.563ValTrp: 0.563 ± 0.329
1.97ValTyr: 1.97 ± 0.718
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.528
0.281TrpCys: 0.281 ± 0.164
0.563TrpAsp: 0.563 ± 0.329
0.844TrpGlu: 0.844 ± 0.412
0.563TrpPhe: 0.563 ± 0.47
0.844TrpGly: 0.844 ± 0.493
0.281TrpHis: 0.281 ± 0.164
1.407TrpIle: 1.407 ± 0.794
1.407TrpLys: 1.407 ± 0.822
0.563TrpLeu: 0.563 ± 0.339
0.563TrpMet: 0.563 ± 0.832
1.126TrpAsn: 1.126 ± 0.499
0.563TrpPro: 0.563 ± 0.339
0.0TrpGln: 0.0 ± 0.0
1.126TrpArg: 1.126 ± 0.499
2.815TrpSer: 2.815 ± 0.936
0.563TrpThr: 0.563 ± 0.329
1.407TrpVal: 1.407 ± 0.514
0.0TrpTrp: 0.0 ± 0.0
0.844TrpTyr: 0.844 ± 0.567
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.533TyrAla: 2.533 ± 0.702
0.0TyrCys: 0.0 ± 0.0
1.97TyrAsp: 1.97 ± 0.704
2.252TyrGlu: 2.252 ± 1.028
1.97TyrPhe: 1.97 ± 1.658
1.689TyrGly: 1.689 ± 0.501
1.407TyrHis: 1.407 ± 0.514
1.97TyrIle: 1.97 ± 0.591
3.096TyrLys: 3.096 ± 0.884
4.222TyrLeu: 4.222 ± 0.919
1.126TyrMet: 1.126 ± 1.095
0.844TyrAsn: 0.844 ± 0.741
2.252TyrPro: 2.252 ± 0.575
2.533TyrGln: 2.533 ± 0.775
1.97TyrArg: 1.97 ± 0.623
2.533TyrSer: 2.533 ± 0.796
2.252TyrThr: 2.252 ± 1.289
2.252TyrVal: 2.252 ± 1.174
0.0TyrTrp: 0.0 ± 0.0
0.844TyrTyr: 0.844 ± 0.456
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski