Amino acid dipepetide frequency for Wuhan insect virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.453AlaAla: 3.453 ± 0.661
1.569AlaCys: 1.569 ± 0.821
2.825AlaAsp: 2.825 ± 0.825
2.197AlaGlu: 2.197 ± 1.149
3.766AlaPhe: 3.766 ± 1.972
1.883AlaGly: 1.883 ± 0.833
0.942AlaHis: 0.942 ± 0.493
3.453AlaIle: 3.453 ± 1.809
1.255AlaLys: 1.255 ± 0.425
5.964AlaLeu: 5.964 ± 3.315
1.883AlaMet: 1.883 ± 0.985
2.825AlaAsn: 2.825 ± 0.825
0.314AlaPro: 0.314 ± 0.164
1.255AlaGln: 1.255 ± 0.657
2.825AlaArg: 2.825 ± 0.477
3.453AlaSer: 3.453 ± 0.661
1.883AlaThr: 1.883 ± 0.468
3.139AlaVal: 3.139 ± 2.762
1.255AlaTrp: 1.255 ± 0.879
1.883AlaTyr: 1.883 ± 0.985
0.0AlaXaa: 0.0 ± 0.0
Cys
0.942CysAla: 0.942 ± 0.493
0.0CysCys: 0.0 ± 0.0
1.883CysAsp: 1.883 ± 0.985
1.569CysGlu: 1.569 ± 0.821
1.883CysPhe: 1.883 ± 0.468
0.942CysGly: 0.942 ± 0.493
0.314CysHis: 0.314 ± 0.164
1.883CysIle: 1.883 ± 0.468
1.569CysLys: 1.569 ± 0.416
2.825CysLeu: 2.825 ± 0.825
0.942CysMet: 0.942 ± 0.493
0.314CysAsn: 0.314 ± 0.164
1.569CysPro: 1.569 ± 0.821
0.942CysGln: 0.942 ± 0.493
1.569CysArg: 1.569 ± 0.416
1.883CysSer: 1.883 ± 0.985
0.942CysThr: 0.942 ± 0.493
2.511CysVal: 2.511 ± 0.687
0.0CysTrp: 0.0 ± 0.0
1.569CysTyr: 1.569 ± 1.085
0.0CysXaa: 0.0 ± 0.0
Asp
4.08AspAla: 4.08 ± 1.432
2.511AspCys: 2.511 ± 0.851
3.453AspAsp: 3.453 ± 0.87
3.453AspGlu: 3.453 ± 1.201
3.139AspPhe: 3.139 ± 1.639
1.569AspGly: 1.569 ± 0.416
1.255AspHis: 1.255 ± 0.657
5.336AspIle: 5.336 ± 1.166
1.569AspLys: 1.569 ± 0.821
4.394AspLeu: 4.394 ± 1.236
1.883AspMet: 1.883 ± 0.935
2.511AspAsn: 2.511 ± 1.759
2.825AspPro: 2.825 ± 0.825
0.628AspGln: 0.628 ± 0.328
3.453AspArg: 3.453 ± 0.87
4.708AspSer: 4.708 ± 0.01
5.336AspThr: 5.336 ± 1.043
6.591AspVal: 6.591 ± 2.091
0.628AspTrp: 0.628 ± 0.328
2.197AspTyr: 2.197 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
1.883GluAla: 1.883 ± 0.985
1.255GluCys: 1.255 ± 0.425
0.942GluAsp: 0.942 ± 0.493
1.883GluGlu: 1.883 ± 0.985
1.569GluPhe: 1.569 ± 0.821
0.628GluGly: 0.628 ± 0.328
1.883GluHis: 1.883 ± 0.468
3.766GluIle: 3.766 ± 1.275
1.255GluLys: 1.255 ± 0.879
5.964GluLeu: 5.964 ± 0.418
1.255GluMet: 1.255 ± 0.657
2.825GluAsn: 2.825 ± 0.825
1.883GluPro: 1.883 ± 0.985
1.255GluGln: 1.255 ± 0.657
1.883GluArg: 1.883 ± 0.985
3.766GluSer: 3.766 ± 1.666
2.197GluThr: 2.197 ± 1.149
4.08GluVal: 4.08 ± 1.168
0.314GluTrp: 0.314 ± 0.164
4.708GluTyr: 4.708 ± 0.868
0.0GluXaa: 0.0 ± 0.0
Phe
1.255PheAla: 1.255 ± 0.657
2.197PheCys: 2.197 ± 1.149
4.708PheAsp: 4.708 ± 1.75
4.394PheGlu: 4.394 ± 1.59
3.453PhePhe: 3.453 ± 2.067
1.569PheGly: 1.569 ± 0.416
1.883PheHis: 1.883 ± 0.833
4.08PheIle: 4.08 ± 0.333
2.197PheLys: 2.197 ± 0.906
6.591PheLeu: 6.591 ± 4.97
0.628PheMet: 0.628 ± 0.328
4.08PheAsn: 4.08 ± 1.455
3.453PhePro: 3.453 ± 3.197
1.883PheGln: 1.883 ± 0.468
2.197PheArg: 2.197 ± 3.207
6.591PheSer: 6.591 ± 2.285
1.569PheThr: 1.569 ± 0.821
4.708PheVal: 4.708 ± 1.247
0.942PheTrp: 0.942 ± 0.493
2.197PheTyr: 2.197 ± 1.682
0.0PheXaa: 0.0 ± 0.0
Gly
0.628GlyAla: 0.628 ± 0.328
0.628GlyCys: 0.628 ± 0.328
3.453GlyAsp: 3.453 ± 1.806
0.0GlyGlu: 0.0 ± 0.0
2.197GlyPhe: 2.197 ± 1.818
0.628GlyGly: 0.628 ± 0.328
1.255GlyHis: 1.255 ± 0.657
2.511GlyIle: 2.511 ± 1.759
2.197GlyLys: 2.197 ± 0.858
2.197GlyLeu: 2.197 ± 1.149
1.569GlyMet: 1.569 ± 0.416
0.942GlyAsn: 0.942 ± 0.945
0.628GlyPro: 0.628 ± 0.328
0.628GlyGln: 0.628 ± 1.439
2.197GlyArg: 2.197 ± 0.696
3.453GlySer: 3.453 ± 0.433
3.453GlyThr: 3.453 ± 0.87
3.139GlyVal: 3.139 ± 1.682
0.314GlyTrp: 0.314 ± 0.164
2.825GlyTyr: 2.825 ± 1.799
0.0GlyXaa: 0.0 ± 0.0
His
0.314HisAla: 0.314 ± 0.164
0.942HisCys: 0.942 ± 0.493
2.197HisAsp: 2.197 ± 0.906
1.569HisGlu: 1.569 ± 0.821
1.569HisPhe: 1.569 ± 0.416
0.314HisGly: 0.314 ± 0.164
1.255HisHis: 1.255 ± 1.198
2.197HisIle: 2.197 ± 0.696
0.942HisLys: 0.942 ± 0.493
2.825HisLeu: 2.825 ± 0.825
0.628HisMet: 0.628 ± 0.328
2.197HisAsn: 2.197 ± 0.858
1.569HisPro: 1.569 ± 0.416
1.255HisGln: 1.255 ± 0.657
1.883HisArg: 1.883 ± 0.985
3.139HisSer: 3.139 ± 0.832
1.883HisThr: 1.883 ± 0.833
2.511HisVal: 2.511 ± 1.314
0.0HisTrp: 0.0 ± 0.0
1.255HisTyr: 1.255 ± 0.425
0.0HisXaa: 0.0 ± 0.0
Ile
4.708IleAla: 4.708 ± 3.777
1.255IleCys: 1.255 ± 0.425
5.336IleAsp: 5.336 ± 2.791
3.453IleGlu: 3.453 ± 0.87
3.139IlePhe: 3.139 ± 0.826
2.197IleGly: 2.197 ± 1.818
0.942IleHis: 0.942 ± 0.493
3.453IleIle: 3.453 ± 0.661
4.08IleLys: 4.08 ± 0.602
3.139IleLeu: 3.139 ± 0.97
0.628IleMet: 0.628 ± 0.328
3.766IleAsn: 3.766 ± 1.276
3.766IlePro: 3.766 ± 1.667
1.883IleGln: 1.883 ± 0.985
2.511IleArg: 2.511 ± 0.687
6.277IleSer: 6.277 ± 2.447
4.708IleThr: 4.708 ± 0.867
4.08IleVal: 4.08 ± 1.887
0.0IleTrp: 0.0 ± 0.0
4.394IleTyr: 4.394 ± 1.016
0.0IleXaa: 0.0 ± 0.0
Lys
1.569LysAla: 1.569 ± 1.085
0.314LysCys: 0.314 ± 0.164
3.453LysAsp: 3.453 ± 1.121
1.883LysGlu: 1.883 ± 0.468
3.139LysPhe: 3.139 ± 0.832
1.255LysGly: 1.255 ± 0.879
0.628LysHis: 0.628 ± 0.328
2.197LysIle: 2.197 ± 1.149
4.394LysLys: 4.394 ± 2.299
5.65LysLeu: 5.65 ± 1.403
0.628LysMet: 0.628 ± 0.663
5.65LysAsn: 5.65 ± 0.955
2.197LysPro: 2.197 ± 1.149
0.314LysGln: 0.314 ± 0.164
3.139LysArg: 3.139 ± 1.088
5.65LysSer: 5.65 ± 1.981
2.825LysThr: 2.825 ± 0.99
1.255LysVal: 1.255 ± 0.879
0.0LysTrp: 0.0 ± 0.0
4.394LysTyr: 4.394 ± 1.59
0.0LysXaa: 0.0 ± 0.0
Leu
5.65LeuAla: 5.65 ± 1.29
1.883LeuCys: 1.883 ± 0.986
3.766LeuAsp: 3.766 ± 0.936
4.08LeuGlu: 4.08 ± 1.245
4.708LeuPhe: 4.708 ± 1.75
3.453LeuGly: 3.453 ± 0.661
2.511LeuHis: 2.511 ± 1.575
5.336LeuIle: 5.336 ± 2.301
8.475LeuLys: 8.475 ± 1.966
8.475LeuLeu: 8.475 ± 4.777
1.569LeuMet: 1.569 ± 0.416
5.964LeuAsn: 5.964 ± 2.392
2.825LeuPro: 2.825 ± 2.106
2.197LeuGln: 2.197 ± 0.696
3.766LeuArg: 3.766 ± 1.324
8.161LeuSer: 8.161 ± 3.713
6.591LeuThr: 6.591 ± 5.455
7.219LeuVal: 7.219 ± 2.115
0.942LeuTrp: 0.942 ± 0.945
4.08LeuTyr: 4.08 ± 1.245
0.0LeuXaa: 0.0 ± 0.0
Met
0.942MetAla: 0.942 ± 0.945
0.628MetCys: 0.628 ± 0.599
0.628MetAsp: 0.628 ± 0.328
0.942MetGlu: 0.942 ± 0.493
1.883MetPhe: 1.883 ± 0.985
0.942MetGly: 0.942 ± 0.493
0.314MetHis: 0.314 ± 0.164
0.942MetIle: 0.942 ± 0.493
1.255MetLys: 1.255 ± 0.657
1.883MetLeu: 1.883 ± 0.468
0.314MetMet: 0.314 ± 0.164
0.314MetAsn: 0.314 ± 0.164
0.942MetPro: 0.942 ± 0.493
0.0MetGln: 0.0 ± 0.0
0.314MetArg: 0.314 ± 0.164
2.511MetSer: 2.511 ± 0.687
1.883MetThr: 1.883 ± 0.985
1.255MetVal: 1.255 ± 0.879
0.0MetTrp: 0.0 ± 0.0
2.197MetTyr: 2.197 ± 1.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.453AsnAla: 3.453 ± 0.87
1.255AsnCys: 1.255 ± 0.657
3.139AsnAsp: 3.139 ± 1.088
1.255AsnGlu: 1.255 ± 0.425
3.766AsnPhe: 3.766 ± 1.423
2.197AsnGly: 2.197 ± 1.951
1.255AsnHis: 1.255 ± 0.425
3.453AsnIle: 3.453 ± 0.433
3.139AsnLys: 3.139 ± 1.605
6.905AsnLeu: 6.905 ± 1.145
0.628AsnMet: 0.628 ± 0.328
3.453AsnAsn: 3.453 ± 1.121
2.825AsnPro: 2.825 ± 0.99
1.569AsnGln: 1.569 ± 0.978
3.139AsnArg: 3.139 ± 1.088
5.336AsnSer: 5.336 ± 1.261
2.825AsnThr: 2.825 ± 1.478
5.022AsnVal: 5.022 ± 0.726
0.0AsnTrp: 0.0 ± 0.0
2.825AsnTyr: 2.825 ± 1.479
0.0AsnXaa: 0.0 ± 0.0
Pro
1.883ProAla: 1.883 ± 1.891
0.628ProCys: 0.628 ± 0.328
3.453ProAsp: 3.453 ± 1.121
1.569ProGlu: 1.569 ± 1.974
2.197ProPhe: 2.197 ± 1.149
2.197ProGly: 2.197 ± 1.149
1.255ProHis: 1.255 ± 0.657
3.139ProIle: 3.139 ± 2.995
1.883ProLys: 1.883 ± 0.985
3.139ProLeu: 3.139 ± 0.826
0.628ProMet: 0.628 ± 0.328
1.883ProAsn: 1.883 ± 0.468
0.628ProPro: 0.628 ± 0.599
0.942ProGln: 0.942 ± 1.282
3.139ProArg: 3.139 ± 1.605
2.511ProSer: 2.511 ± 1.575
1.255ProThr: 1.255 ± 0.879
3.139ProVal: 3.139 ± 0.97
0.0ProTrp: 0.0 ± 0.0
1.255ProTyr: 1.255 ± 0.425
0.0ProXaa: 0.0 ± 0.0
Gln
1.569GlnAla: 1.569 ± 0.841
0.628GlnCys: 0.628 ± 0.328
1.255GlnAsp: 1.255 ± 0.425
2.197GlnGlu: 2.197 ± 0.565
1.569GlnPhe: 1.569 ± 0.821
0.628GlnGly: 0.628 ± 0.328
1.255GlnHis: 1.255 ± 0.425
1.255GlnIle: 1.255 ± 0.425
1.569GlnLys: 1.569 ± 0.841
1.569GlnLeu: 1.569 ± 1.085
0.0GlnMet: 0.0 ± 0.0
1.255GlnAsn: 1.255 ± 1.129
0.942GlnPro: 0.942 ± 0.493
0.942GlnGln: 0.942 ± 0.493
0.628GlnArg: 0.628 ± 0.328
1.569GlnSer: 1.569 ± 0.821
0.942GlnThr: 0.942 ± 0.493
2.825GlnVal: 2.825 ± 0.991
0.0GlnTrp: 0.0 ± 0.0
2.197GlnTyr: 2.197 ± 0.565
0.0GlnXaa: 0.0 ± 0.0
Arg
1.569ArgAla: 1.569 ± 0.841
2.197ArgCys: 2.197 ± 0.565
2.511ArgAsp: 2.511 ± 0.687
2.825ArgGlu: 2.825 ± 0.477
3.139ArgPhe: 3.139 ± 1.642
2.197ArgGly: 2.197 ± 0.565
2.825ArgHis: 2.825 ± 1.478
1.255ArgIle: 1.255 ± 0.425
2.197ArgLys: 2.197 ± 1.149
5.022ArgLeu: 5.022 ± 1.148
1.255ArgMet: 1.255 ± 0.657
4.708ArgAsn: 4.708 ± 4.156
0.942ArgPro: 0.942 ± 0.493
0.628ArgGln: 0.628 ± 0.328
4.394ArgArg: 4.394 ± 1.716
5.336ArgSer: 5.336 ± 3.538
2.511ArgThr: 2.511 ± 1.314
3.766ArgVal: 3.766 ± 0.936
0.314ArgTrp: 0.314 ± 1.138
2.825ArgTyr: 2.825 ± 1.478
0.0ArgXaa: 0.0 ± 0.0
Ser
4.08SerAla: 4.08 ± 1.168
1.255SerCys: 1.255 ± 0.657
5.336SerAsp: 5.336 ± 0.324
3.766SerGlu: 3.766 ± 1.324
8.788SerPhe: 8.788 ± 1.881
5.336SerGly: 5.336 ± 3.468
2.511SerHis: 2.511 ± 0.851
5.65SerIle: 5.65 ± 0.488
3.453SerLys: 3.453 ± 0.661
6.905SerLeu: 6.905 ± 2.07
2.197SerMet: 2.197 ± 1.149
3.139SerAsn: 3.139 ± 1.605
2.825SerPro: 2.825 ± 0.99
2.511SerGln: 2.511 ± 0.687
5.336SerArg: 5.336 ± 3.199
10.986SerSer: 10.986 ± 4.497
5.65SerThr: 5.65 ± 0.488
10.672SerVal: 10.672 ± 3.441
0.314SerTrp: 0.314 ± 0.164
6.591SerTyr: 6.591 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
2.197ThrAla: 2.197 ± 1.149
1.883ThrCys: 1.883 ± 0.985
1.883ThrAsp: 1.883 ± 0.833
1.569ThrGlu: 1.569 ± 0.821
3.139ThrPhe: 3.139 ± 2.49
2.197ThrGly: 2.197 ± 1.149
2.197ThrHis: 2.197 ± 0.858
5.022ThrIle: 5.022 ± 1.253
2.197ThrLys: 2.197 ± 0.858
5.336ThrLeu: 5.336 ± 2.144
0.942ThrMet: 0.942 ± 0.493
1.883ThrAsn: 1.883 ± 0.468
2.825ThrPro: 2.825 ± 0.477
1.569ThrGln: 1.569 ± 0.416
3.453ThrArg: 3.453 ± 1.121
6.591ThrSer: 6.591 ± 0.842
3.139ThrThr: 3.139 ± 1.956
3.139ThrVal: 3.139 ± 0.425
0.314ThrTrp: 0.314 ± 0.164
3.453ThrTyr: 3.453 ± 0.433
0.0ThrXaa: 0.0 ± 0.0
Val
3.766ValAla: 3.766 ± 0.498
2.511ValCys: 2.511 ± 0.687
6.591ValAsp: 6.591 ± 2.572
3.766ValGlu: 3.766 ± 0.497
3.453ValPhe: 3.453 ± 2.335
1.255ValGly: 1.255 ± 0.879
4.708ValHis: 4.708 ± 0.01
4.394ValIle: 4.394 ± 2.646
4.708ValLys: 4.708 ± 1.247
7.219ValLeu: 7.219 ± 0.918
1.569ValMet: 1.569 ± 0.841
4.08ValAsn: 4.08 ± 1.455
3.139ValPro: 3.139 ± 0.425
2.511ValGln: 2.511 ± 1.314
3.453ValArg: 3.453 ± 1.121
9.416ValSer: 9.416 ± 1.239
2.511ValThr: 2.511 ± 0.687
6.905ValVal: 6.905 ± 3.619
0.314ValTrp: 0.314 ± 0.164
4.08ValTyr: 4.08 ± 1.432
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.314TrpCys: 0.314 ± 0.164
0.314TrpAsp: 0.314 ± 0.164
0.0TrpGlu: 0.0 ± 0.0
0.628TrpPhe: 0.628 ± 1.033
0.0TrpGly: 0.0 ± 0.0
0.314TrpHis: 0.314 ± 0.164
0.314TrpIle: 0.314 ± 0.164
0.314TrpLys: 0.314 ± 0.164
0.628TrpLeu: 0.628 ± 0.328
0.0TrpMet: 0.0 ± 0.0
0.942TrpAsn: 0.942 ± 0.493
0.314TrpPro: 0.314 ± 0.164
0.0TrpGln: 0.0 ± 0.0
0.314TrpArg: 0.314 ± 0.164
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.942TrpVal: 0.942 ± 2.168
0.0TrpTrp: 0.0 ± 0.0
0.314TrpTyr: 0.314 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.08TyrAla: 4.08 ± 1.024
2.197TyrCys: 2.197 ± 0.906
4.394TyrAsp: 4.394 ± 1.716
2.825TyrGlu: 2.825 ± 0.825
3.453TyrPhe: 3.453 ± 3.539
3.453TyrGly: 3.453 ± 0.87
0.942TyrHis: 0.942 ± 0.493
4.08TyrIle: 4.08 ± 1.024
2.197TyrLys: 2.197 ± 0.565
5.022TyrLeu: 5.022 ± 1.281
0.628TyrMet: 0.628 ± 0.473
4.708TyrAsn: 4.708 ± 1.248
0.314TyrPro: 0.314 ± 0.164
1.883TyrGln: 1.883 ± 0.985
2.825TyrArg: 2.825 ± 0.825
5.964TyrSer: 5.964 ± 1.488
2.825TyrThr: 2.825 ± 0.825
3.453TyrVal: 3.453 ± 1.121
0.0TyrTrp: 0.0 ± 0.0
2.511TyrTyr: 2.511 ± 1.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3187 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski