Amino acid dipepetide frequency for Hubei insect virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.238AlaAla: 2.238 ± 0.599
0.852AlaCys: 0.852 ± 0.202
3.303AlaAsp: 3.303 ± 0.463
2.557AlaGlu: 2.557 ± 0.477
1.385AlaPhe: 1.385 ± 0.211
2.344AlaGly: 2.344 ± 0.835
1.705AlaHis: 1.705 ± 0.438
4.049AlaIle: 4.049 ± 0.473
2.131AlaLys: 2.131 ± 0.706
6.18AlaLeu: 6.18 ± 0.78
1.172AlaMet: 1.172 ± 0.352
3.729AlaAsn: 3.729 ± 0.764
1.811AlaPro: 1.811 ± 0.525
1.492AlaGln: 1.492 ± 0.369
2.344AlaArg: 2.344 ± 0.299
4.049AlaSer: 4.049 ± 0.71
3.836AlaThr: 3.836 ± 0.937
2.877AlaVal: 2.877 ± 0.508
0.32AlaTrp: 0.32 ± 0.126
2.664AlaTyr: 2.664 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
1.066CysAla: 1.066 ± 0.408
0.533CysCys: 0.533 ± 0.209
0.959CysAsp: 0.959 ± 0.375
1.172CysGlu: 1.172 ± 0.28
0.639CysPhe: 0.639 ± 0.171
1.492CysGly: 1.492 ± 0.488
0.426CysHis: 0.426 ± 0.213
0.852CysIle: 0.852 ± 0.257
0.852CysLys: 0.852 ± 0.303
1.598CysLeu: 1.598 ± 0.516
0.852CysMet: 0.852 ± 0.278
1.385CysAsn: 1.385 ± 0.411
0.426CysPro: 0.426 ± 0.211
0.852CysGln: 0.852 ± 0.292
0.32CysArg: 0.32 ± 0.194
0.959CysSer: 0.959 ± 0.2
0.959CysThr: 0.959 ± 0.328
0.852CysVal: 0.852 ± 0.341
0.107CysTrp: 0.107 ± 0.091
1.811CysTyr: 1.811 ± 0.645
0.0CysXaa: 0.0 ± 0.0
Asp
4.262AspAla: 4.262 ± 0.548
0.426AspCys: 0.426 ± 0.246
3.41AspAsp: 3.41 ± 0.603
4.582AspGlu: 4.582 ± 0.389
3.197AspPhe: 3.197 ± 0.464
2.983AspGly: 2.983 ± 0.398
1.279AspHis: 1.279 ± 0.381
6.713AspIle: 6.713 ± 0.699
2.557AspLys: 2.557 ± 0.397
5.115AspLeu: 5.115 ± 0.712
1.598AspMet: 1.598 ± 0.507
2.983AspAsn: 2.983 ± 0.608
1.066AspPro: 1.066 ± 0.284
1.811AspGln: 1.811 ± 0.344
2.344AspArg: 2.344 ± 0.608
4.262AspSer: 4.262 ± 0.511
3.303AspThr: 3.303 ± 0.557
5.754AspVal: 5.754 ± 0.402
1.066AspTrp: 1.066 ± 0.395
2.238AspTyr: 2.238 ± 0.436
0.0AspXaa: 0.0 ± 0.0
Glu
1.705GluAla: 1.705 ± 0.468
0.852GluCys: 0.852 ± 0.278
1.705GluAsp: 1.705 ± 0.507
1.705GluGlu: 1.705 ± 0.603
2.238GluPhe: 2.238 ± 0.565
1.811GluGly: 1.811 ± 0.237
0.959GluHis: 0.959 ± 0.201
4.262GluIle: 4.262 ± 0.615
3.516GluLys: 3.516 ± 0.486
7.246GluLeu: 7.246 ± 0.558
0.852GluMet: 0.852 ± 0.359
3.197GluAsn: 3.197 ± 0.542
1.066GluPro: 1.066 ± 0.359
2.344GluGln: 2.344 ± 0.367
2.131GluArg: 2.131 ± 0.615
3.41GluSer: 3.41 ± 0.356
3.729GluThr: 3.729 ± 0.532
3.09GluVal: 3.09 ± 0.438
0.639GluTrp: 0.639 ± 0.217
3.836GluTyr: 3.836 ± 0.714
0.0GluXaa: 0.0 ± 0.0
Phe
2.451PheAla: 2.451 ± 0.528
0.852PheCys: 0.852 ± 0.245
3.09PheAsp: 3.09 ± 0.326
1.598PheGlu: 1.598 ± 0.393
1.811PhePhe: 1.811 ± 0.332
3.516PheGly: 3.516 ± 0.791
0.639PheHis: 0.639 ± 0.262
3.09PheIle: 3.09 ± 0.606
2.238PheLys: 2.238 ± 0.332
3.09PheLeu: 3.09 ± 0.673
0.746PheMet: 0.746 ± 0.294
3.623PheAsn: 3.623 ± 0.773
1.066PhePro: 1.066 ± 0.315
0.746PheGln: 0.746 ± 0.207
2.451PheArg: 2.451 ± 0.485
5.328PheSer: 5.328 ± 0.368
2.664PheThr: 2.664 ± 0.56
2.557PheVal: 2.557 ± 0.432
0.107PheTrp: 0.107 ± 0.095
1.172PheTyr: 1.172 ± 0.374
0.0PheXaa: 0.0 ± 0.0
Gly
1.918GlyAla: 1.918 ± 0.435
1.385GlyCys: 1.385 ± 0.195
2.664GlyAsp: 2.664 ± 0.517
2.131GlyGlu: 2.131 ± 0.512
1.598GlyPhe: 1.598 ± 0.317
2.451GlyGly: 2.451 ± 0.575
0.426GlyHis: 0.426 ± 0.195
4.475GlyIle: 4.475 ± 0.746
2.025GlyLys: 2.025 ± 0.524
4.262GlyLeu: 4.262 ± 0.618
1.279GlyMet: 1.279 ± 0.476
3.942GlyAsn: 3.942 ± 0.591
0.959GlyPro: 0.959 ± 0.265
1.598GlyGln: 1.598 ± 0.213
1.598GlyArg: 1.598 ± 0.629
3.836GlySer: 3.836 ± 0.382
3.41GlyThr: 3.41 ± 0.531
4.049GlyVal: 4.049 ± 0.459
0.639GlyTrp: 0.639 ± 0.244
2.025GlyTyr: 2.025 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
1.492HisAla: 1.492 ± 0.306
0.213HisCys: 0.213 ± 0.125
1.811HisAsp: 1.811 ± 0.318
1.705HisGlu: 1.705 ± 0.561
0.32HisPhe: 0.32 ± 0.219
1.279HisGly: 1.279 ± 0.267
0.107HisHis: 0.107 ± 0.106
1.492HisIle: 1.492 ± 0.448
0.852HisLys: 0.852 ± 0.463
2.451HisLeu: 2.451 ± 0.367
0.426HisMet: 0.426 ± 0.159
0.746HisAsn: 0.746 ± 0.235
0.533HisPro: 0.533 ± 0.34
0.959HisGln: 0.959 ± 0.371
0.533HisArg: 0.533 ± 0.136
1.492HisSer: 1.492 ± 0.473
1.598HisThr: 1.598 ± 0.357
1.705HisVal: 1.705 ± 0.389
0.0HisTrp: 0.0 ± 0.0
1.066HisTyr: 1.066 ± 0.173
0.0HisXaa: 0.0 ± 0.0
Ile
4.795IleAla: 4.795 ± 0.691
1.705IleCys: 1.705 ± 0.683
6.287IleAsp: 6.287 ± 0.627
2.877IleGlu: 2.877 ± 0.573
3.197IlePhe: 3.197 ± 0.502
3.623IleGly: 3.623 ± 0.661
1.598IleHis: 1.598 ± 0.421
6.18IleIle: 6.18 ± 0.665
2.983IleLys: 2.983 ± 0.523
6.713IleLeu: 6.713 ± 0.561
1.279IleMet: 1.279 ± 0.356
6.606IleAsn: 6.606 ± 1.229
3.729IlePro: 3.729 ± 0.674
3.09IleGln: 3.09 ± 0.271
3.41IleArg: 3.41 ± 0.685
7.565IleSer: 7.565 ± 0.652
7.459IleThr: 7.459 ± 0.899
5.221IleVal: 5.221 ± 0.609
0.746IleTrp: 0.746 ± 0.28
3.729IleTyr: 3.729 ± 0.554
0.0IleXaa: 0.0 ± 0.0
Lys
1.811LysAla: 1.811 ± 0.346
0.533LysCys: 0.533 ± 0.172
2.77LysAsp: 2.77 ± 0.677
2.77LysGlu: 2.77 ± 0.519
2.877LysPhe: 2.877 ± 0.409
0.959LysGly: 0.959 ± 0.256
1.598LysHis: 1.598 ± 0.486
5.008LysIle: 5.008 ± 0.506
2.664LysLys: 2.664 ± 0.429
5.754LysLeu: 5.754 ± 0.525
2.025LysMet: 2.025 ± 0.651
3.303LysAsn: 3.303 ± 0.565
2.557LysPro: 2.557 ± 0.568
1.918LysGln: 1.918 ± 0.583
2.664LysArg: 2.664 ± 0.383
3.729LysSer: 3.729 ± 0.526
3.303LysThr: 3.303 ± 0.348
2.025LysVal: 2.025 ± 0.285
0.213LysTrp: 0.213 ± 0.114
3.623LysTyr: 3.623 ± 1.002
0.0LysXaa: 0.0 ± 0.0
Leu
4.688LeuAla: 4.688 ± 0.461
2.344LeuCys: 2.344 ± 0.517
4.475LeuAsp: 4.475 ± 0.634
5.115LeuGlu: 5.115 ± 0.573
3.836LeuPhe: 3.836 ± 0.595
4.582LeuGly: 4.582 ± 0.704
1.705LeuHis: 1.705 ± 0.439
6.713LeuIle: 6.713 ± 0.699
6.393LeuLys: 6.393 ± 0.788
9.164LeuLeu: 9.164 ± 0.805
1.598LeuMet: 1.598 ± 0.592
7.672LeuAsn: 7.672 ± 0.603
4.795LeuPro: 4.795 ± 0.644
3.197LeuGln: 3.197 ± 0.523
4.795LeuArg: 4.795 ± 0.541
9.483LeuSer: 9.483 ± 1.184
6.074LeuThr: 6.074 ± 0.641
5.86LeuVal: 5.86 ± 0.825
0.426LeuTrp: 0.426 ± 0.198
4.795LeuTyr: 4.795 ± 0.692
0.0LeuXaa: 0.0 ± 0.0
Met
1.811MetAla: 1.811 ± 0.414
0.32MetCys: 0.32 ± 0.159
1.385MetAsp: 1.385 ± 0.452
0.959MetGlu: 0.959 ± 0.212
1.279MetPhe: 1.279 ± 0.433
1.066MetGly: 1.066 ± 0.239
0.746MetHis: 0.746 ± 0.207
1.492MetIle: 1.492 ± 0.336
0.959MetLys: 0.959 ± 0.428
2.557MetLeu: 2.557 ± 0.376
0.852MetMet: 0.852 ± 0.313
1.492MetAsn: 1.492 ± 0.523
1.066MetPro: 1.066 ± 0.18
0.959MetGln: 0.959 ± 0.207
0.746MetArg: 0.746 ± 0.331
1.918MetSer: 1.918 ± 0.479
1.492MetThr: 1.492 ± 0.234
1.385MetVal: 1.385 ± 0.316
0.213MetTrp: 0.213 ± 0.19
1.492MetTyr: 1.492 ± 0.445
0.0MetXaa: 0.0 ± 0.0
Asn
4.369AsnAla: 4.369 ± 0.54
0.852AsnCys: 0.852 ± 0.247
4.262AsnAsp: 4.262 ± 0.573
4.901AsnGlu: 4.901 ± 0.59
3.41AsnPhe: 3.41 ± 0.82
4.262AsnGly: 4.262 ± 0.588
1.066AsnHis: 1.066 ± 0.22
6.606AsnIle: 6.606 ± 0.786
4.049AsnLys: 4.049 ± 0.282
6.393AsnLeu: 6.393 ± 0.741
1.598AsnMet: 1.598 ± 0.221
4.156AsnAsn: 4.156 ± 0.61
1.492AsnPro: 1.492 ± 0.612
1.811AsnGln: 1.811 ± 0.418
2.983AsnArg: 2.983 ± 0.635
4.795AsnSer: 4.795 ± 0.688
5.221AsnThr: 5.221 ± 0.689
4.795AsnVal: 4.795 ± 0.685
0.32AsnTrp: 0.32 ± 0.161
4.049AsnTyr: 4.049 ± 0.631
0.0AsnXaa: 0.0 ± 0.0
Pro
2.025ProAla: 2.025 ± 0.349
0.533ProCys: 0.533 ± 0.136
1.598ProAsp: 1.598 ± 0.345
1.598ProGlu: 1.598 ± 0.361
1.172ProPhe: 1.172 ± 0.249
1.279ProGly: 1.279 ± 0.306
0.852ProHis: 0.852 ± 0.232
3.303ProIle: 3.303 ± 0.746
2.131ProLys: 2.131 ± 0.593
2.877ProLeu: 2.877 ± 0.59
0.639ProMet: 0.639 ± 0.21
2.238ProAsn: 2.238 ± 0.538
2.025ProPro: 2.025 ± 1.406
0.959ProGln: 0.959 ± 0.249
1.172ProArg: 1.172 ± 0.357
2.238ProSer: 2.238 ± 0.452
3.623ProThr: 3.623 ± 0.772
2.238ProVal: 2.238 ± 0.374
0.107ProTrp: 0.107 ± 0.091
1.385ProTyr: 1.385 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
1.811GlnAla: 1.811 ± 0.486
1.066GlnCys: 1.066 ± 0.613
1.705GlnAsp: 1.705 ± 0.389
1.385GlnGlu: 1.385 ± 0.243
1.598GlnPhe: 1.598 ± 0.249
0.426GlnGly: 0.426 ± 0.119
0.959GlnHis: 0.959 ± 0.43
3.197GlnIle: 3.197 ± 0.63
1.705GlnLys: 1.705 ± 0.206
3.942GlnLeu: 3.942 ± 0.697
0.746GlnMet: 0.746 ± 0.232
2.238GlnAsn: 2.238 ± 0.645
1.279GlnPro: 1.279 ± 0.604
1.492GlnGln: 1.492 ± 0.446
2.238GlnArg: 2.238 ± 0.44
1.705GlnSer: 1.705 ± 0.594
1.918GlnThr: 1.918 ± 0.574
2.238GlnVal: 2.238 ± 0.585
0.426GlnTrp: 0.426 ± 0.126
1.492GlnTyr: 1.492 ± 0.304
0.0GlnXaa: 0.0 ± 0.0
Arg
2.131ArgAla: 2.131 ± 0.369
1.066ArgCys: 1.066 ± 0.359
2.664ArgAsp: 2.664 ± 0.386
1.705ArgGlu: 1.705 ± 0.296
2.664ArgPhe: 2.664 ± 0.491
1.918ArgGly: 1.918 ± 0.377
0.852ArgHis: 0.852 ± 0.259
2.344ArgIle: 2.344 ± 0.392
2.238ArgLys: 2.238 ± 0.343
5.008ArgLeu: 5.008 ± 0.353
1.172ArgMet: 1.172 ± 0.374
3.942ArgAsn: 3.942 ± 0.568
1.172ArgPro: 1.172 ± 0.5
2.025ArgGln: 2.025 ± 0.517
2.77ArgArg: 2.77 ± 0.539
2.877ArgSer: 2.877 ± 0.628
2.025ArgThr: 2.025 ± 0.649
2.77ArgVal: 2.77 ± 0.442
0.32ArgTrp: 0.32 ± 0.134
2.77ArgTyr: 2.77 ± 0.501
0.0ArgXaa: 0.0 ± 0.0
Ser
3.836SerAla: 3.836 ± 0.589
1.279SerCys: 1.279 ± 0.416
5.647SerAsp: 5.647 ± 0.806
4.901SerGlu: 4.901 ± 0.588
3.41SerPhe: 3.41 ± 0.567
4.582SerGly: 4.582 ± 0.647
1.811SerHis: 1.811 ± 0.401
7.139SerIle: 7.139 ± 0.913
5.221SerLys: 5.221 ± 0.447
7.565SerLeu: 7.565 ± 1.006
1.598SerMet: 1.598 ± 0.421
5.115SerAsn: 5.115 ± 0.614
2.238SerPro: 2.238 ± 0.904
2.025SerGln: 2.025 ± 0.308
3.197SerArg: 3.197 ± 0.608
5.221SerSer: 5.221 ± 0.881
5.541SerThr: 5.541 ± 0.981
4.049SerVal: 4.049 ± 0.641
0.746SerTrp: 0.746 ± 0.226
2.77SerTyr: 2.77 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
2.77ThrAla: 2.77 ± 0.661
0.639ThrCys: 0.639 ± 0.205
4.582ThrAsp: 4.582 ± 0.863
2.451ThrGlu: 2.451 ± 0.383
3.09ThrPhe: 3.09 ± 0.587
2.77ThrGly: 2.77 ± 0.515
1.705ThrHis: 1.705 ± 0.354
6.819ThrIle: 6.819 ± 0.717
3.41ThrLys: 3.41 ± 0.579
7.778ThrLeu: 7.778 ± 1.328
1.918ThrMet: 1.918 ± 0.694
3.942ThrAsn: 3.942 ± 0.597
2.877ThrPro: 2.877 ± 0.511
3.516ThrGln: 3.516 ± 0.538
2.557ThrArg: 2.557 ± 0.502
5.647ThrSer: 5.647 ± 0.869
4.901ThrThr: 4.901 ± 1.532
4.369ThrVal: 4.369 ± 0.534
0.533ThrTrp: 0.533 ± 0.205
3.942ThrTyr: 3.942 ± 0.38
0.0ThrXaa: 0.0 ± 0.0
Val
2.451ValAla: 2.451 ± 0.657
1.279ValCys: 1.279 ± 0.295
4.049ValAsp: 4.049 ± 0.339
3.197ValGlu: 3.197 ± 0.586
2.557ValPhe: 2.557 ± 0.654
2.557ValGly: 2.557 ± 0.45
1.172ValHis: 1.172 ± 0.356
4.688ValIle: 4.688 ± 0.803
3.303ValLys: 3.303 ± 0.58
4.688ValLeu: 4.688 ± 0.658
2.025ValMet: 2.025 ± 0.312
5.541ValAsn: 5.541 ± 0.58
1.918ValPro: 1.918 ± 0.275
1.705ValGln: 1.705 ± 0.388
3.303ValArg: 3.303 ± 0.499
5.541ValSer: 5.541 ± 0.657
5.008ValThr: 5.008 ± 1.089
4.795ValVal: 4.795 ± 0.624
0.639ValTrp: 0.639 ± 0.258
2.983ValTyr: 2.983 ± 0.61
0.0ValXaa: 0.0 ± 0.0
Trp
0.32TrpAla: 0.32 ± 0.183
0.0TrpCys: 0.0 ± 0.0
0.426TrpAsp: 0.426 ± 0.18
0.852TrpGlu: 0.852 ± 0.274
0.639TrpPhe: 0.639 ± 0.213
0.213TrpGly: 0.213 ± 0.12
0.107TrpHis: 0.107 ± 0.092
0.426TrpIle: 0.426 ± 0.141
0.426TrpLys: 0.426 ± 0.191
0.746TrpLeu: 0.746 ± 0.31
0.32TrpMet: 0.32 ± 0.209
0.852TrpAsn: 0.852 ± 0.126
0.213TrpPro: 0.213 ± 0.118
0.213TrpGln: 0.213 ± 0.119
0.533TrpArg: 0.533 ± 0.248
0.746TrpSer: 0.746 ± 0.257
0.32TrpThr: 0.32 ± 0.199
0.107TrpVal: 0.107 ± 0.091
0.107TrpTrp: 0.107 ± 0.075
0.426TrpTyr: 0.426 ± 0.291
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.09TyrAla: 3.09 ± 0.63
1.385TyrCys: 1.385 ± 0.388
4.156TyrAsp: 4.156 ± 0.445
2.131TyrGlu: 2.131 ± 0.427
1.918TyrPhe: 1.918 ± 0.346
2.557TyrGly: 2.557 ± 0.386
1.066TyrHis: 1.066 ± 0.293
4.049TyrIle: 4.049 ± 0.734
2.451TyrLys: 2.451 ± 0.439
4.582TyrLeu: 4.582 ± 0.81
1.385TyrMet: 1.385 ± 0.265
4.688TyrAsn: 4.688 ± 0.893
1.705TyrPro: 1.705 ± 0.325
0.852TyrGln: 0.852 ± 0.251
2.344TyrArg: 2.344 ± 0.466
3.09TyrSer: 3.09 ± 0.503
3.836TyrThr: 3.836 ± 0.651
2.664TyrVal: 2.664 ± 0.303
0.32TyrTrp: 0.32 ± 0.163
1.385TyrTyr: 1.385 ± 0.449
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (9386 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski