Amino acid dipepetide frequency for Hubei Wuhan insect virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.718AlaAla: 1.718 ± 0.696
0.491AlaCys: 0.491 ± 0.542
2.699AlaAsp: 2.699 ± 0.882
1.227AlaGlu: 1.227 ± 0.494
1.227AlaPhe: 1.227 ± 0.817
1.472AlaGly: 1.472 ± 0.563
0.982AlaHis: 0.982 ± 0.361
4.908AlaIle: 4.908 ± 0.878
1.472AlaLys: 1.472 ± 0.666
3.681AlaLeu: 3.681 ± 1.028
1.963AlaMet: 1.963 ± 0.838
1.718AlaAsn: 1.718 ± 0.414
1.963AlaPro: 1.963 ± 0.731
0.736AlaGln: 0.736 ± 0.341
0.245AlaArg: 0.245 ± 0.136
4.417AlaSer: 4.417 ± 1.58
3.926AlaThr: 3.926 ± 0.986
3.19AlaVal: 3.19 ± 0.615
0.491AlaTrp: 0.491 ± 0.338
1.472AlaTyr: 1.472 ± 0.817
0.0AlaXaa: 0.0 ± 0.0
Cys
0.491CysAla: 0.491 ± 0.272
0.245CysCys: 0.245 ± 0.136
1.227CysAsp: 1.227 ± 0.665
0.245CysGlu: 0.245 ± 0.136
0.736CysPhe: 0.736 ± 0.341
1.472CysGly: 1.472 ± 0.349
0.0CysHis: 0.0 ± 0.0
0.982CysIle: 0.982 ± 0.572
1.472CysLys: 1.472 ± 0.391
0.736CysLeu: 0.736 ± 0.306
1.227CysMet: 1.227 ± 0.46
1.718CysAsn: 1.718 ± 0.414
0.982CysPro: 0.982 ± 0.394
0.0CysGln: 0.0 ± 0.0
0.245CysArg: 0.245 ± 0.136
1.718CysSer: 1.718 ± 1.113
1.227CysThr: 1.227 ± 1.085
1.963CysVal: 1.963 ± 0.544
0.491CysTrp: 0.491 ± 0.272
0.245CysTyr: 0.245 ± 0.361
0.0CysXaa: 0.0 ± 0.0
Asp
2.945AspAla: 2.945 ± 0.602
1.963AspCys: 1.963 ± 0.542
4.172AspAsp: 4.172 ± 0.912
3.19AspGlu: 3.19 ± 0.885
4.417AspPhe: 4.417 ± 0.946
2.209AspGly: 2.209 ± 0.793
0.736AspHis: 0.736 ± 0.471
5.644AspIle: 5.644 ± 1.329
6.135AspLys: 6.135 ± 0.948
4.908AspLeu: 4.908 ± 1.171
3.19AspMet: 3.19 ± 1.288
3.436AspAsn: 3.436 ± 1.02
2.209AspPro: 2.209 ± 0.633
0.982AspGln: 0.982 ± 0.545
1.718AspArg: 1.718 ± 0.465
7.362AspSer: 7.362 ± 0.96
4.908AspThr: 4.908 ± 1.064
6.871AspVal: 6.871 ± 2.652
0.982AspTrp: 0.982 ± 0.438
3.436AspTyr: 3.436 ± 0.552
0.0AspXaa: 0.0 ± 0.0
Glu
1.227GluAla: 1.227 ± 0.451
0.736GluCys: 0.736 ± 0.456
2.209GluAsp: 2.209 ± 0.434
2.454GluGlu: 2.454 ± 1.362
1.718GluPhe: 1.718 ± 0.465
0.491GluGly: 0.491 ± 0.272
0.982GluHis: 0.982 ± 0.545
3.436GluIle: 3.436 ± 0.644
3.19GluLys: 3.19 ± 0.957
4.663GluLeu: 4.663 ± 1.542
0.982GluMet: 0.982 ± 0.366
4.172GluAsn: 4.172 ± 0.622
0.736GluPro: 0.736 ± 0.409
0.491GluGln: 0.491 ± 0.285
1.227GluArg: 1.227 ± 0.681
1.472GluSer: 1.472 ± 0.349
2.699GluThr: 2.699 ± 0.623
1.227GluVal: 1.227 ± 0.373
0.245GluTrp: 0.245 ± 0.319
2.945GluTyr: 2.945 ± 0.796
0.0GluXaa: 0.0 ± 0.0
Phe
1.963PheAla: 1.963 ± 1.239
1.718PheCys: 1.718 ± 0.959
3.681PheAsp: 3.681 ± 0.675
2.454PheGlu: 2.454 ± 0.507
1.472PhePhe: 1.472 ± 1.103
1.227PheGly: 1.227 ± 0.534
1.963PheHis: 1.963 ± 1.123
3.926PheIle: 3.926 ± 1.382
3.436PheLys: 3.436 ± 0.764
3.681PheLeu: 3.681 ± 1.165
1.472PheMet: 1.472 ± 0.787
2.699PheAsn: 2.699 ± 0.957
0.736PhePro: 0.736 ± 0.456
1.718PheGln: 1.718 ± 0.64
2.454PheArg: 2.454 ± 0.437
5.153PheSer: 5.153 ± 1.359
5.153PheThr: 5.153 ± 0.814
5.153PheVal: 5.153 ± 1.159
0.0PheTrp: 0.0 ± 0.0
3.926PheTyr: 3.926 ± 1.474
0.0PheXaa: 0.0 ± 0.0
Gly
2.209GlyAla: 2.209 ± 0.935
0.491GlyCys: 0.491 ± 0.272
2.699GlyAsp: 2.699 ± 1.064
0.0GlyGlu: 0.0 ± 0.0
1.472GlyPhe: 1.472 ± 0.542
1.472GlyGly: 1.472 ± 0.554
0.736GlyHis: 0.736 ± 0.341
3.436GlyIle: 3.436 ± 0.415
2.209GlyLys: 2.209 ± 0.66
2.454GlyLeu: 2.454 ± 0.552
0.982GlyMet: 0.982 ± 0.365
1.227GlyAsn: 1.227 ± 0.443
1.227GlyPro: 1.227 ± 1.225
0.736GlyGln: 0.736 ± 0.409
1.227GlyArg: 1.227 ± 0.42
2.454GlySer: 2.454 ± 0.71
1.227GlyThr: 1.227 ± 0.871
1.227GlyVal: 1.227 ± 0.584
0.0GlyTrp: 0.0 ± 0.0
1.718GlyTyr: 1.718 ± 0.954
0.0GlyXaa: 0.0 ± 0.0
His
0.982HisAla: 0.982 ± 0.366
0.491HisCys: 0.491 ± 0.338
3.436HisAsp: 3.436 ± 0.85
1.227HisGlu: 1.227 ± 0.681
0.736HisPhe: 0.736 ± 0.456
1.227HisGly: 1.227 ± 0.453
0.736HisHis: 0.736 ± 0.306
1.227HisIle: 1.227 ± 0.919
2.209HisLys: 2.209 ± 0.792
2.699HisLeu: 2.699 ± 1.141
1.227HisMet: 1.227 ± 0.449
1.718HisAsn: 1.718 ± 0.401
0.736HisPro: 0.736 ± 0.32
0.245HisGln: 0.245 ± 0.436
1.227HisArg: 1.227 ± 0.531
2.699HisSer: 2.699 ± 0.619
1.227HisThr: 1.227 ± 0.98
1.718HisVal: 1.718 ± 0.616
0.0HisTrp: 0.0 ± 0.0
2.945HisTyr: 2.945 ± 0.407
0.0HisXaa: 0.0 ± 0.0
Ile
3.926IleAla: 3.926 ± 1.195
1.472IleCys: 1.472 ± 0.398
5.399IleAsp: 5.399 ± 1.33
2.945IleGlu: 2.945 ± 0.821
4.908IlePhe: 4.908 ± 1.537
1.963IleGly: 1.963 ± 0.809
2.945IleHis: 2.945 ± 1.591
7.117IleIle: 7.117 ± 2.307
7.362IleLys: 7.362 ± 2.084
9.325IleLeu: 9.325 ± 3.101
3.436IleMet: 3.436 ± 1.087
4.417IleAsn: 4.417 ± 1.103
3.436IlePro: 3.436 ± 0.916
1.718IleGln: 1.718 ± 0.7
2.699IleArg: 2.699 ± 0.604
5.89IleSer: 5.89 ± 0.98
4.663IleThr: 4.663 ± 1.752
6.135IleVal: 6.135 ± 0.645
0.736IleTrp: 0.736 ± 0.818
3.681IleTyr: 3.681 ± 0.859
0.0IleXaa: 0.0 ± 0.0
Lys
2.699LysAla: 2.699 ± 0.957
1.472LysCys: 1.472 ± 0.884
3.681LysAsp: 3.681 ± 0.675
2.454LysGlu: 2.454 ± 0.731
9.571LysPhe: 9.571 ± 1.222
1.472LysGly: 1.472 ± 0.554
2.209LysHis: 2.209 ± 0.572
6.135LysIle: 6.135 ± 1.434
8.589LysLys: 8.589 ± 1.158
7.362LysLeu: 7.362 ± 2.687
1.227LysMet: 1.227 ± 1.038
5.153LysAsn: 5.153 ± 1.781
2.209LysPro: 2.209 ± 0.569
1.472LysGln: 1.472 ± 0.682
2.454LysArg: 2.454 ± 0.894
6.38LysSer: 6.38 ± 1.015
6.38LysThr: 6.38 ± 1.753
2.699LysVal: 2.699 ± 0.806
0.245LysTrp: 0.245 ± 0.361
8.098LysTyr: 8.098 ± 1.693
0.0LysXaa: 0.0 ± 0.0
Leu
3.436LeuAla: 3.436 ± 1.087
0.982LeuCys: 0.982 ± 0.394
4.172LeuAsp: 4.172 ± 0.876
2.209LeuGlu: 2.209 ± 0.665
3.436LeuPhe: 3.436 ± 0.536
1.718LeuGly: 1.718 ± 0.304
2.454LeuHis: 2.454 ± 0.494
6.626LeuIle: 6.626 ± 1.935
6.871LeuLys: 6.871 ± 0.696
5.153LeuLeu: 5.153 ± 1.643
2.454LeuMet: 2.454 ± 0.931
4.417LeuAsn: 4.417 ± 0.576
3.19LeuPro: 3.19 ± 0.243
3.681LeuGln: 3.681 ± 0.993
0.491LeuArg: 0.491 ± 0.542
6.871LeuSer: 6.871 ± 0.962
6.626LeuThr: 6.626 ± 1.259
4.908LeuVal: 4.908 ± 1.417
0.491LeuTrp: 0.491 ± 0.272
6.871LeuTyr: 6.871 ± 0.637
0.0LeuXaa: 0.0 ± 0.0
Met
0.491MetAla: 0.491 ± 0.605
1.227MetCys: 1.227 ± 0.652
1.718MetAsp: 1.718 ± 0.507
0.245MetGlu: 0.245 ± 0.136
1.963MetPhe: 1.963 ± 0.54
0.0MetGly: 0.0 ± 0.0
0.982MetHis: 0.982 ± 0.361
4.663MetIle: 4.663 ± 1.287
2.699MetLys: 2.699 ± 0.805
2.454MetLeu: 2.454 ± 0.622
1.227MetMet: 1.227 ± 0.331
0.982MetAsn: 0.982 ± 0.364
2.699MetPro: 2.699 ± 1.82
0.982MetGln: 0.982 ± 0.361
1.963MetArg: 1.963 ± 0.597
3.926MetSer: 3.926 ± 0.741
2.945MetThr: 2.945 ± 0.697
0.491MetVal: 0.491 ± 0.393
0.982MetTrp: 0.982 ± 0.393
3.19MetTyr: 3.19 ± 1.132
0.0MetXaa: 0.0 ± 0.0
Asn
1.718AsnAla: 1.718 ± 0.495
0.736AsnCys: 0.736 ± 0.341
5.153AsnAsp: 5.153 ± 1.086
1.227AsnGlu: 1.227 ± 0.42
1.718AsnPhe: 1.718 ± 0.917
1.227AsnGly: 1.227 ± 0.481
1.227AsnHis: 1.227 ± 0.636
5.644AsnIle: 5.644 ± 1.236
5.399AsnLys: 5.399 ± 1.702
4.417AsnLeu: 4.417 ± 1.129
1.718AsnMet: 1.718 ± 0.414
3.681AsnAsn: 3.681 ± 1.235
0.982AsnPro: 0.982 ± 0.855
1.472AsnGln: 1.472 ± 0.641
2.699AsnArg: 2.699 ± 0.461
4.417AsnSer: 4.417 ± 0.913
3.681AsnThr: 3.681 ± 1.247
4.908AsnVal: 4.908 ± 1.871
0.245AsnTrp: 0.245 ± 0.136
3.926AsnTyr: 3.926 ± 2.106
0.0AsnXaa: 0.0 ± 0.0
Pro
1.718ProAla: 1.718 ± 0.452
0.245ProCys: 0.245 ± 0.136
1.963ProAsp: 1.963 ± 1.001
1.718ProGlu: 1.718 ± 0.703
0.982ProPhe: 0.982 ± 1.019
2.209ProGly: 2.209 ± 0.762
0.736ProHis: 0.736 ± 0.341
2.945ProIle: 2.945 ± 0.966
2.209ProLys: 2.209 ± 1.259
1.718ProLeu: 1.718 ± 1.441
1.472ProMet: 1.472 ± 0.721
1.718ProAsn: 1.718 ± 0.413
1.472ProPro: 1.472 ± 0.365
0.982ProGln: 0.982 ± 0.365
1.227ProArg: 1.227 ± 0.507
2.945ProSer: 2.945 ± 0.685
2.454ProThr: 2.454 ± 0.789
2.209ProVal: 2.209 ± 0.683
0.0ProTrp: 0.0 ± 0.0
1.718ProTyr: 1.718 ± 0.91
0.0ProXaa: 0.0 ± 0.0
Gln
0.982GlnAla: 0.982 ± 0.366
0.491GlnCys: 0.491 ± 0.542
1.718GlnAsp: 1.718 ± 0.724
1.718GlnGlu: 1.718 ± 0.682
1.227GlnPhe: 1.227 ± 0.889
0.491GlnGly: 0.491 ± 0.272
0.245GlnHis: 0.245 ± 0.361
2.945GlnIle: 2.945 ± 0.852
0.736GlnLys: 0.736 ± 0.314
1.227GlnLeu: 1.227 ± 0.681
0.245GlnMet: 0.245 ± 0.136
1.718GlnAsn: 1.718 ± 0.387
0.982GlnPro: 0.982 ± 0.365
0.736GlnGln: 0.736 ± 0.341
1.718GlnArg: 1.718 ± 0.738
1.227GlnSer: 1.227 ± 0.525
1.718GlnThr: 1.718 ± 0.703
1.472GlnVal: 1.472 ± 0.61
0.0GlnTrp: 0.0 ± 0.0
1.963GlnTyr: 1.963 ± 1.706
0.0GlnXaa: 0.0 ± 0.0
Arg
1.472ArgAla: 1.472 ± 0.817
0.982ArgCys: 0.982 ± 0.545
0.736ArgAsp: 0.736 ± 0.494
0.736ArgGlu: 0.736 ± 0.314
1.718ArgPhe: 1.718 ± 0.676
0.491ArgGly: 0.491 ± 0.393
1.227ArgHis: 1.227 ± 0.665
2.699ArgIle: 2.699 ± 0.671
3.19ArgLys: 3.19 ± 1.066
2.699ArgLeu: 2.699 ± 0.884
1.227ArgMet: 1.227 ± 0.481
2.699ArgAsn: 2.699 ± 0.649
1.472ArgPro: 1.472 ± 0.539
0.245ArgGln: 0.245 ± 0.136
1.472ArgArg: 1.472 ± 1.085
5.644ArgSer: 5.644 ± 3.106
2.945ArgThr: 2.945 ± 1.446
1.227ArgVal: 1.227 ± 0.681
0.0ArgTrp: 0.0 ± 0.0
3.19ArgTyr: 3.19 ± 0.243
0.0ArgXaa: 0.0 ± 0.0
Ser
3.681SerAla: 3.681 ± 0.877
0.982SerCys: 0.982 ± 0.366
6.626SerAsp: 6.626 ± 1.747
3.436SerGlu: 3.436 ± 1.407
3.436SerPhe: 3.436 ± 0.594
2.454SerGly: 2.454 ± 0.962
1.718SerHis: 1.718 ± 0.954
8.589SerIle: 8.589 ± 2.253
7.607SerLys: 7.607 ± 0.865
6.626SerLeu: 6.626 ± 1.19
2.945SerMet: 2.945 ± 1.056
5.153SerAsn: 5.153 ± 1.216
0.982SerPro: 0.982 ± 0.441
1.718SerGln: 1.718 ± 0.452
5.399SerArg: 5.399 ± 2.829
3.681SerSer: 3.681 ± 1.004
4.908SerThr: 4.908 ± 1.325
5.153SerVal: 5.153 ± 0.563
0.491SerTrp: 0.491 ± 0.557
4.663SerTyr: 4.663 ± 0.998
0.0SerXaa: 0.0 ± 0.0
Thr
0.982ThrAla: 0.982 ± 0.361
0.245ThrCys: 0.245 ± 0.136
5.89ThrAsp: 5.89 ± 1.434
3.436ThrGlu: 3.436 ± 0.783
3.681ThrPhe: 3.681 ± 1.66
2.699ThrGly: 2.699 ± 0.546
1.718ThrHis: 1.718 ± 0.682
4.908ThrIle: 4.908 ± 1.537
7.117ThrLys: 7.117 ± 0.991
5.153ThrLeu: 5.153 ± 1.274
3.19ThrMet: 3.19 ± 0.84
1.472ThrAsn: 1.472 ± 0.472
2.945ThrPro: 2.945 ± 0.54
3.436ThrGln: 3.436 ± 0.779
1.718ThrArg: 1.718 ± 0.522
4.908ThrSer: 4.908 ± 0.623
3.681ThrThr: 3.681 ± 0.804
6.871ThrVal: 6.871 ± 1.544
0.491ThrTrp: 0.491 ± 0.338
5.89ThrTyr: 5.89 ± 1.184
0.0ThrXaa: 0.0 ± 0.0
Val
4.417ValAla: 4.417 ± 0.559
1.227ValCys: 1.227 ± 0.453
5.399ValAsp: 5.399 ± 1.104
2.945ValGlu: 2.945 ± 0.848
3.19ValPhe: 3.19 ± 0.643
1.963ValGly: 1.963 ± 0.574
2.454ValHis: 2.454 ± 0.437
2.945ValIle: 2.945 ± 0.885
6.626ValLys: 6.626 ± 1.135
3.681ValLeu: 3.681 ± 0.86
1.963ValMet: 1.963 ± 0.598
4.908ValAsn: 4.908 ± 0.833
2.454ValPro: 2.454 ± 0.485
0.245ValGln: 0.245 ± 0.136
2.209ValArg: 2.209 ± 0.541
4.417ValSer: 4.417 ± 2.144
4.908ValThr: 4.908 ± 1.999
4.908ValVal: 4.908 ± 0.989
0.736ValTrp: 0.736 ± 0.341
5.644ValTyr: 5.644 ± 1.361
0.0ValXaa: 0.0 ± 0.0
Trp
0.491TrpAla: 0.491 ± 0.773
0.0TrpCys: 0.0 ± 0.0
0.736TrpAsp: 0.736 ± 0.409
0.245TrpGlu: 0.245 ± 0.136
1.227TrpPhe: 1.227 ± 0.981
0.0TrpGly: 0.0 ± 0.0
0.245TrpHis: 0.245 ± 0.136
0.982TrpIle: 0.982 ± 0.572
0.0TrpLys: 0.0 ± 0.0
1.227TrpLeu: 1.227 ± 0.371
0.245TrpMet: 0.245 ± 0.347
0.245TrpAsn: 0.245 ± 0.136
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.491TrpArg: 0.491 ± 0.272
0.491TrpSer: 0.491 ± 0.393
0.0TrpThr: 0.0 ± 0.0
0.491TrpVal: 0.491 ± 0.557
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.454TyrAla: 2.454 ± 0.458
1.227TyrCys: 1.227 ± 0.46
8.098TyrAsp: 8.098 ± 1.659
3.19TyrGlu: 3.19 ± 0.732
5.153TyrPhe: 5.153 ± 1.427
2.945TyrGly: 2.945 ± 0.527
4.417TyrHis: 4.417 ± 1.47
4.417TyrIle: 4.417 ± 0.947
3.436TyrLys: 3.436 ± 0.718
3.436TyrLeu: 3.436 ± 1.018
3.19TyrMet: 3.19 ± 0.79
2.945TyrAsn: 2.945 ± 0.607
1.227TyrPro: 1.227 ± 0.453
1.963TyrGln: 1.963 ± 0.736
3.19TyrArg: 3.19 ± 0.966
4.172TyrSer: 4.172 ± 0.667
5.153TyrThr: 5.153 ± 1.159
4.417TyrVal: 4.417 ± 0.932
0.491TyrTrp: 0.491 ± 0.542
3.926TyrTyr: 3.926 ± 0.655
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski