Amino acid dipepetide frequency for Hubei lepidoptera virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.796AlaAla: 2.796 ± 0.647
0.699AlaCys: 0.699 ± 0.363
3.029AlaAsp: 3.029 ± 1.075
3.262AlaGlu: 3.262 ± 1.041
2.563AlaPhe: 2.563 ± 0.62
1.631AlaGly: 1.631 ± 0.377
0.233AlaHis: 0.233 ± 0.137
2.097AlaIle: 2.097 ± 0.512
1.398AlaLys: 1.398 ± 0.642
5.126AlaLeu: 5.126 ± 1.559
0.699AlaMet: 0.699 ± 0.321
1.631AlaAsn: 1.631 ± 0.454
1.165AlaPro: 1.165 ± 0.863
1.631AlaGln: 1.631 ± 0.771
1.398AlaArg: 1.398 ± 0.31
1.864AlaSer: 1.864 ± 0.868
2.796AlaThr: 2.796 ± 0.505
2.33AlaVal: 2.33 ± 1.053
0.932AlaTrp: 0.932 ± 0.421
1.864AlaTyr: 1.864 ± 0.743
0.0AlaXaa: 0.0 ± 0.0
Cys
0.699CysAla: 0.699 ± 0.453
0.466CysCys: 0.466 ± 0.518
0.466CysAsp: 0.466 ± 0.252
0.233CysGlu: 0.233 ± 0.311
0.466CysPhe: 0.466 ± 0.274
1.165CysGly: 1.165 ± 0.234
0.466CysHis: 0.466 ± 0.274
2.33CysIle: 2.33 ± 0.347
1.631CysLys: 1.631 ± 0.517
1.631CysLeu: 1.631 ± 0.679
0.466CysMet: 0.466 ± 0.254
0.699CysAsn: 0.699 ± 0.549
1.165CysPro: 1.165 ± 0.234
0.932CysGln: 0.932 ± 0.837
1.631CysArg: 1.631 ± 0.517
1.398CysSer: 1.398 ± 0.599
1.165CysThr: 1.165 ± 0.416
0.466CysVal: 0.466 ± 0.271
0.233CysTrp: 0.233 ± 0.137
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.398AspAla: 1.398 ± 0.591
1.165AspCys: 1.165 ± 0.638
3.262AspAsp: 3.262 ± 0.654
3.961AspGlu: 3.961 ± 0.83
1.398AspPhe: 1.398 ± 0.306
2.33AspGly: 2.33 ± 0.331
0.699AspHis: 0.699 ± 0.371
3.029AspIle: 3.029 ± 0.573
3.961AspLys: 3.961 ± 0.662
6.291AspLeu: 6.291 ± 0.93
0.932AspMet: 0.932 ± 0.421
2.097AspAsn: 2.097 ± 0.695
3.495AspPro: 3.495 ± 1.137
2.563AspGln: 2.563 ± 0.771
3.728AspArg: 3.728 ± 0.609
3.961AspSer: 3.961 ± 0.907
2.796AspThr: 2.796 ± 0.272
3.495AspVal: 3.495 ± 0.897
1.165AspTrp: 1.165 ± 0.752
2.33AspTyr: 2.33 ± 0.907
0.0AspXaa: 0.0 ± 0.0
Glu
2.563GluAla: 2.563 ± 0.881
0.466GluCys: 0.466 ± 0.405
3.029GluAsp: 3.029 ± 0.807
4.66GluGlu: 4.66 ± 0.976
2.796GluPhe: 2.796 ± 0.996
4.427GluGly: 4.427 ± 0.907
1.165GluHis: 1.165 ± 0.748
7.223GluIle: 7.223 ± 0.939
3.728GluLys: 3.728 ± 0.988
4.893GluLeu: 4.893 ± 0.568
1.398GluMet: 1.398 ± 0.383
2.097GluAsn: 2.097 ± 0.708
1.165GluPro: 1.165 ± 0.658
1.631GluGln: 1.631 ± 0.796
3.728GluArg: 3.728 ± 0.857
6.99GluSer: 6.99 ± 1.16
4.194GluThr: 4.194 ± 0.865
3.262GluVal: 3.262 ± 0.768
0.466GluTrp: 0.466 ± 0.274
2.563GluTyr: 2.563 ± 0.937
0.0GluXaa: 0.0 ± 0.0
Phe
1.864PheAla: 1.864 ± 0.63
0.932PheCys: 0.932 ± 0.231
1.398PheAsp: 1.398 ± 0.345
2.33PheGlu: 2.33 ± 0.339
1.631PhePhe: 1.631 ± 0.719
2.563PheGly: 2.563 ± 0.883
1.631PheHis: 1.631 ± 0.783
1.864PheIle: 1.864 ± 1.019
1.864PheLys: 1.864 ± 0.3
7.223PheLeu: 7.223 ± 1.307
0.699PheMet: 0.699 ± 0.477
3.728PheAsn: 3.728 ± 1.098
2.563PhePro: 2.563 ± 0.639
2.796PheGln: 2.796 ± 0.743
1.631PheArg: 1.631 ± 0.214
4.427PheSer: 4.427 ± 1.236
1.864PheThr: 1.864 ± 0.474
2.33PheVal: 2.33 ± 0.305
0.466PheTrp: 0.466 ± 0.274
0.699PheTyr: 0.699 ± 0.522
0.0PheXaa: 0.0 ± 0.0
Gly
1.864GlyAla: 1.864 ± 0.666
0.466GlyCys: 0.466 ± 0.36
3.495GlyAsp: 3.495 ± 0.615
3.029GlyGlu: 3.029 ± 0.525
2.563GlyPhe: 2.563 ± 0.674
1.864GlyGly: 1.864 ± 0.58
1.165GlyHis: 1.165 ± 0.381
4.427GlyIle: 4.427 ± 1.099
3.728GlyLys: 3.728 ± 1.014
7.456GlyLeu: 7.456 ± 1.201
1.165GlyMet: 1.165 ± 0.436
2.563GlyAsn: 2.563 ± 0.606
2.33GlyPro: 2.33 ± 0.626
3.495GlyGln: 3.495 ± 0.917
3.029GlyArg: 3.029 ± 0.601
4.427GlySer: 4.427 ± 1.3
2.097GlyThr: 2.097 ± 0.544
5.359GlyVal: 5.359 ± 0.232
0.466GlyTrp: 0.466 ± 0.274
2.796GlyTyr: 2.796 ± 0.848
0.0GlyXaa: 0.0 ± 0.0
His
0.932HisAla: 0.932 ± 0.404
0.0HisCys: 0.0 ± 0.0
0.932HisAsp: 0.932 ± 0.504
3.495HisGlu: 3.495 ± 0.872
0.932HisPhe: 0.932 ± 0.337
2.097HisGly: 2.097 ± 0.831
0.699HisHis: 0.699 ± 0.43
1.864HisIle: 1.864 ± 0.625
0.699HisLys: 0.699 ± 0.321
1.864HisLeu: 1.864 ± 0.551
0.932HisMet: 0.932 ± 0.273
0.466HisAsn: 0.466 ± 0.252
1.864HisPro: 1.864 ± 0.672
1.631HisGln: 1.631 ± 0.537
1.398HisArg: 1.398 ± 0.823
1.864HisSer: 1.864 ± 0.679
0.466HisThr: 0.466 ± 0.252
0.466HisVal: 0.466 ± 0.36
0.932HisTrp: 0.932 ± 0.334
1.631HisTyr: 1.631 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
2.097IleAla: 2.097 ± 1.112
1.631IleCys: 1.631 ± 0.419
3.495IleAsp: 3.495 ± 0.962
3.728IleGlu: 3.728 ± 0.404
2.563IlePhe: 2.563 ± 0.322
5.126IleGly: 5.126 ± 1.363
2.097IleHis: 2.097 ± 1.119
2.796IleIle: 2.796 ± 1.328
7.689IleLys: 7.689 ± 1.482
6.757IleLeu: 6.757 ± 0.752
1.165IleMet: 1.165 ± 0.367
3.961IleAsn: 3.961 ± 0.83
5.592IlePro: 5.592 ± 1.036
3.961IleGln: 3.961 ± 0.598
3.961IleArg: 3.961 ± 0.653
5.825IleSer: 5.825 ± 1.003
3.961IleThr: 3.961 ± 0.842
3.495IleVal: 3.495 ± 0.656
0.233IleTrp: 0.233 ± 0.137
4.194IleTyr: 4.194 ± 1.088
0.0IleXaa: 0.0 ± 0.0
Lys
3.262LysAla: 3.262 ± 0.848
1.398LysCys: 1.398 ± 0.498
4.194LysAsp: 4.194 ± 0.997
6.058LysGlu: 6.058 ± 1.191
1.864LysPhe: 1.864 ± 0.741
3.262LysGly: 3.262 ± 1.12
0.932LysHis: 0.932 ± 0.29
4.66LysIle: 4.66 ± 0.38
5.592LysLys: 5.592 ± 1.158
5.359LysLeu: 5.359 ± 0.983
2.097LysMet: 2.097 ± 0.493
3.029LysAsn: 3.029 ± 1.058
3.961LysPro: 3.961 ± 1.521
1.165LysGln: 1.165 ± 0.344
3.029LysArg: 3.029 ± 0.916
4.194LysSer: 4.194 ± 0.917
4.194LysThr: 4.194 ± 0.911
2.563LysVal: 2.563 ± 0.848
0.466LysTrp: 0.466 ± 0.274
2.097LysTyr: 2.097 ± 0.63
0.0LysXaa: 0.0 ± 0.0
Leu
3.495LeuAla: 3.495 ± 1.081
1.631LeuCys: 1.631 ± 0.773
4.194LeuAsp: 4.194 ± 0.446
6.99LeuGlu: 6.99 ± 0.84
5.359LeuPhe: 5.359 ± 1.234
6.524LeuGly: 6.524 ± 1.289
3.029LeuHis: 3.029 ± 0.448
8.854LeuIle: 8.854 ± 1.289
3.728LeuLys: 3.728 ± 0.493
10.252LeuLeu: 10.252 ± 1.588
1.398LeuMet: 1.398 ± 0.435
7.223LeuAsn: 7.223 ± 1.377
5.126LeuPro: 5.126 ± 0.896
2.563LeuGln: 2.563 ± 0.744
5.126LeuArg: 5.126 ± 2.089
8.854LeuSer: 8.854 ± 1.676
6.291LeuThr: 6.291 ± 0.248
4.194LeuVal: 4.194 ± 1.284
1.165LeuTrp: 1.165 ± 0.235
4.427LeuTyr: 4.427 ± 0.678
0.0LeuXaa: 0.0 ± 0.0
Met
1.398MetAla: 1.398 ± 0.663
0.0MetCys: 0.0 ± 0.0
1.165MetAsp: 1.165 ± 0.343
0.699MetGlu: 0.699 ± 0.308
0.932MetPhe: 0.932 ± 0.42
0.932MetGly: 0.932 ± 0.273
0.233MetHis: 0.233 ± 0.137
2.33MetIle: 2.33 ± 0.731
2.33MetLys: 2.33 ± 0.617
2.097MetLeu: 2.097 ± 0.49
0.699MetMet: 0.699 ± 0.289
1.165MetAsn: 1.165 ± 0.31
0.0MetPro: 0.0 ± 0.0
0.466MetGln: 0.466 ± 0.254
1.165MetArg: 1.165 ± 0.519
1.864MetSer: 1.864 ± 0.447
1.631MetThr: 1.631 ± 0.505
0.699MetVal: 0.699 ± 0.321
0.233MetTrp: 0.233 ± 0.314
0.466MetTyr: 0.466 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
3.728AsnAla: 3.728 ± 0.941
1.165AsnCys: 1.165 ± 0.235
2.563AsnAsp: 2.563 ± 0.694
1.631AsnGlu: 1.631 ± 0.36
2.33AsnPhe: 2.33 ± 0.801
2.796AsnGly: 2.796 ± 0.825
2.33AsnHis: 2.33 ± 0.405
3.961AsnIle: 3.961 ± 1.488
3.029AsnLys: 3.029 ± 0.582
5.359AsnLeu: 5.359 ± 0.446
0.466AsnMet: 0.466 ± 0.401
2.796AsnAsn: 2.796 ± 0.602
4.194AsnPro: 4.194 ± 0.978
1.631AsnGln: 1.631 ± 0.617
2.097AsnArg: 2.097 ± 0.509
4.427AsnSer: 4.427 ± 1.478
2.563AsnThr: 2.563 ± 0.572
2.563AsnVal: 2.563 ± 0.714
1.398AsnTrp: 1.398 ± 0.303
2.33AsnTyr: 2.33 ± 0.536
0.0AsnXaa: 0.0 ± 0.0
Pro
2.33ProAla: 2.33 ± 0.793
0.699ProCys: 0.699 ± 0.246
2.097ProAsp: 2.097 ± 1.034
3.262ProGlu: 3.262 ± 1.233
2.097ProPhe: 2.097 ± 0.478
2.796ProGly: 2.796 ± 1.553
1.631ProHis: 1.631 ± 0.419
2.563ProIle: 2.563 ± 0.508
3.961ProLys: 3.961 ± 1.391
5.359ProLeu: 5.359 ± 0.863
0.932ProMet: 0.932 ± 0.448
2.33ProAsn: 2.33 ± 1.138
2.097ProPro: 2.097 ± 0.666
1.398ProGln: 1.398 ± 0.717
1.631ProArg: 1.631 ± 0.706
5.592ProSer: 5.592 ± 1.612
3.728ProThr: 3.728 ± 0.406
2.097ProVal: 2.097 ± 0.558
0.699ProTrp: 0.699 ± 0.289
3.029ProTyr: 3.029 ± 0.528
0.0ProXaa: 0.0 ± 0.0
Gln
2.563GlnAla: 2.563 ± 0.642
0.699GlnCys: 0.699 ± 0.261
2.796GlnAsp: 2.796 ± 0.981
2.33GlnGlu: 2.33 ± 0.985
2.33GlnPhe: 2.33 ± 0.816
2.796GlnGly: 2.796 ± 0.557
0.699GlnHis: 0.699 ± 0.411
3.029GlnIle: 3.029 ± 0.616
1.631GlnLys: 1.631 ± 0.237
2.33GlnLeu: 2.33 ± 0.807
0.466GlnMet: 0.466 ± 0.274
2.563GlnAsn: 2.563 ± 0.63
0.932GlnPro: 0.932 ± 0.786
1.165GlnGln: 1.165 ± 0.297
2.33GlnArg: 2.33 ± 0.728
3.728GlnSer: 3.728 ± 0.921
1.864GlnThr: 1.864 ± 0.679
2.33GlnVal: 2.33 ± 0.589
0.233GlnTrp: 0.233 ± 0.259
0.699GlnTyr: 0.699 ± 0.246
0.0GlnXaa: 0.0 ± 0.0
Arg
1.165ArgAla: 1.165 ± 0.436
0.466ArgCys: 0.466 ± 0.274
2.563ArgAsp: 2.563 ± 0.222
4.66ArgGlu: 4.66 ± 1.058
2.563ArgPhe: 2.563 ± 0.722
2.563ArgGly: 2.563 ± 0.893
1.864ArgHis: 1.864 ± 0.539
3.495ArgIle: 3.495 ± 0.784
2.33ArgLys: 2.33 ± 0.368
4.427ArgLeu: 4.427 ± 1.266
1.165ArgMet: 1.165 ± 0.538
1.864ArgAsn: 1.864 ± 0.728
1.631ArgPro: 1.631 ± 0.816
3.495ArgGln: 3.495 ± 0.916
3.495ArgArg: 3.495 ± 1.204
5.126ArgSer: 5.126 ± 0.508
3.495ArgThr: 3.495 ± 0.726
2.796ArgVal: 2.796 ± 0.797
0.699ArgTrp: 0.699 ± 0.289
1.165ArgTyr: 1.165 ± 0.518
0.0ArgXaa: 0.0 ± 0.0
Ser
3.495SerAla: 3.495 ± 1.014
2.097SerCys: 2.097 ± 0.7
5.359SerAsp: 5.359 ± 1.473
4.66SerGlu: 4.66 ± 0.421
3.961SerPhe: 3.961 ± 1.187
5.825SerGly: 5.825 ± 0.998
1.864SerHis: 1.864 ± 0.289
4.66SerIle: 4.66 ± 0.83
5.126SerLys: 5.126 ± 1.189
9.087SerLeu: 9.087 ± 1.223
0.932SerMet: 0.932 ± 0.337
4.66SerAsn: 4.66 ± 1.946
4.427SerPro: 4.427 ± 0.98
1.398SerGln: 1.398 ± 0.612
4.194SerArg: 4.194 ± 1.38
9.32SerSer: 9.32 ± 1.686
5.359SerThr: 5.359 ± 1.486
5.359SerVal: 5.359 ± 0.988
2.33SerTrp: 2.33 ± 0.398
2.33SerTyr: 2.33 ± 0.422
0.0SerXaa: 0.0 ± 0.0
Thr
1.165ThrAla: 1.165 ± 0.82
1.165ThrCys: 1.165 ± 0.49
4.194ThrAsp: 4.194 ± 0.461
3.262ThrGlu: 3.262 ± 0.743
2.796ThrPhe: 2.796 ± 1.538
3.262ThrGly: 3.262 ± 0.518
1.864ThrHis: 1.864 ± 0.581
5.359ThrIle: 5.359 ± 0.628
3.262ThrLys: 3.262 ± 0.849
4.194ThrLeu: 4.194 ± 0.709
1.864ThrMet: 1.864 ± 0.61
3.029ThrAsn: 3.029 ± 0.978
3.262ThrPro: 3.262 ± 0.483
1.631ThrGln: 1.631 ± 0.377
3.728ThrArg: 3.728 ± 1.09
3.961ThrSer: 3.961 ± 0.689
3.728ThrThr: 3.728 ± 0.316
3.495ThrVal: 3.495 ± 0.816
1.864ThrTrp: 1.864 ± 0.838
1.631ThrTyr: 1.631 ± 1.158
0.0ThrXaa: 0.0 ± 0.0
Val
0.932ValAla: 0.932 ± 0.616
1.631ValCys: 1.631 ± 0.412
2.796ValAsp: 2.796 ± 0.729
2.097ValGlu: 2.097 ± 0.491
1.864ValPhe: 1.864 ± 0.462
2.33ValGly: 2.33 ± 0.399
0.932ValHis: 0.932 ± 0.573
3.961ValIle: 3.961 ± 0.903
4.194ValLys: 4.194 ± 0.485
5.825ValLeu: 5.825 ± 0.918
1.398ValMet: 1.398 ± 0.443
3.262ValAsn: 3.262 ± 1.279
2.563ValPro: 2.563 ± 0.801
1.631ValGln: 1.631 ± 0.574
1.864ValArg: 1.864 ± 0.67
5.592ValSer: 5.592 ± 0.97
3.728ValThr: 3.728 ± 1.425
2.33ValVal: 2.33 ± 1.108
0.699ValTrp: 0.699 ± 0.411
1.398ValTyr: 1.398 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.466TrpAla: 0.466 ± 0.257
0.0TrpCys: 0.0 ± 0.0
1.165TrpAsp: 1.165 ± 0.463
0.932TrpGlu: 0.932 ± 0.548
2.097TrpPhe: 2.097 ± 1.084
1.165TrpGly: 1.165 ± 0.519
0.233TrpHis: 0.233 ± 0.137
2.097TrpIle: 2.097 ± 0.534
0.699TrpLys: 0.699 ± 0.411
0.699TrpLeu: 0.699 ± 0.299
0.233TrpMet: 0.233 ± 0.137
1.398TrpAsn: 1.398 ± 0.617
0.233TrpPro: 0.233 ± 0.137
0.466TrpGln: 0.466 ± 0.212
0.466TrpArg: 0.466 ± 0.274
1.165TrpSer: 1.165 ± 0.538
0.932TrpThr: 0.932 ± 0.42
0.233TrpVal: 0.233 ± 0.259
0.466TrpTrp: 0.466 ± 0.377
0.233TrpTyr: 0.233 ± 0.311
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.932TyrAla: 0.932 ± 0.548
0.932TyrCys: 0.932 ± 0.701
1.864TyrAsp: 1.864 ± 0.718
1.165TyrGlu: 1.165 ± 0.489
1.398TyrPhe: 1.398 ± 0.505
2.33TyrGly: 2.33 ± 0.586
1.398TyrHis: 1.398 ± 0.554
3.728TyrIle: 3.728 ± 1.083
2.796TyrLys: 2.796 ± 0.93
4.194TyrLeu: 4.194 ± 0.596
1.165TyrMet: 1.165 ± 0.392
2.796TyrAsn: 2.796 ± 1.741
2.563TyrPro: 2.563 ± 1.156
1.864TyrGln: 1.864 ± 0.635
1.398TyrArg: 1.398 ± 0.655
1.864TyrSer: 1.864 ± 1.043
1.864TyrThr: 1.864 ± 1.389
1.165TyrVal: 1.165 ± 0.548
0.233TyrTrp: 0.233 ± 0.311
1.165TyrTyr: 1.165 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski