Amino acid dipepetide frequency for Hubei rhabdo-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.704AlaAla: 4.704 ± 1.503
1.881AlaCys: 1.881 ± 0.671
3.057AlaAsp: 3.057 ± 0.781
3.293AlaGlu: 3.293 ± 1.175
1.881AlaPhe: 1.881 ± 0.389
2.822AlaGly: 2.822 ± 0.959
1.411AlaHis: 1.411 ± 0.483
4.939AlaIle: 4.939 ± 1.556
1.646AlaLys: 1.646 ± 0.734
6.115AlaLeu: 6.115 ± 1.07
1.411AlaMet: 1.411 ± 0.647
1.881AlaAsn: 1.881 ± 0.325
2.352AlaPro: 2.352 ± 0.744
3.293AlaGln: 3.293 ± 0.809
3.528AlaArg: 3.528 ± 0.495
3.293AlaSer: 3.293 ± 1.142
4.468AlaThr: 4.468 ± 0.96
5.409AlaVal: 5.409 ± 0.961
1.176AlaTrp: 1.176 ± 0.532
1.646AlaTyr: 1.646 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
0.235CysAla: 0.235 ± 0.142
0.235CysCys: 0.235 ± 0.142
2.352CysAsp: 2.352 ± 1.071
0.941CysGlu: 0.941 ± 0.542
1.646CysPhe: 1.646 ± 0.473
0.235CysGly: 0.235 ± 0.142
0.235CysHis: 0.235 ± 0.405
1.176CysIle: 1.176 ± 0.461
2.117CysLys: 2.117 ± 0.691
1.176CysLeu: 1.176 ± 0.674
0.47CysMet: 0.47 ± 0.499
0.706CysAsn: 0.706 ± 0.639
1.646CysPro: 1.646 ± 0.773
0.706CysGln: 0.706 ± 0.426
1.881CysArg: 1.881 ± 0.916
3.293CysSer: 3.293 ± 0.579
0.941CysThr: 0.941 ± 0.646
1.646CysVal: 1.646 ± 0.579
0.0CysTrp: 0.0 ± 0.0
0.235CysTyr: 0.235 ± 0.307
0.0CysXaa: 0.0 ± 0.0
Asp
3.998AspAla: 3.998 ± 1.343
1.411AspCys: 1.411 ± 1.284
1.411AspAsp: 1.411 ± 0.483
3.057AspGlu: 3.057 ± 0.725
1.881AspPhe: 1.881 ± 0.521
2.587AspGly: 2.587 ± 0.613
1.411AspHis: 1.411 ± 0.565
1.646AspIle: 1.646 ± 0.53
3.293AspLys: 3.293 ± 0.79
5.174AspLeu: 5.174 ± 1.568
2.117AspMet: 2.117 ± 0.308
1.646AspAsn: 1.646 ± 0.612
4.704AspPro: 4.704 ± 1.46
1.411AspGln: 1.411 ± 0.429
2.822AspArg: 2.822 ± 1.405
5.88AspSer: 5.88 ± 0.744
2.352AspThr: 2.352 ± 0.705
2.587AspVal: 2.587 ± 0.499
0.941AspTrp: 0.941 ± 0.35
3.057AspTyr: 3.057 ± 0.725
0.0AspXaa: 0.0 ± 0.0
Glu
2.822GluAla: 2.822 ± 0.786
2.117GluCys: 2.117 ± 0.704
2.117GluAsp: 2.117 ± 0.552
3.293GluGlu: 3.293 ± 1.034
1.646GluPhe: 1.646 ± 0.53
2.352GluGly: 2.352 ± 0.466
1.646GluHis: 1.646 ± 0.521
3.293GluIle: 3.293 ± 0.965
2.587GluLys: 2.587 ± 0.909
6.35GluLeu: 6.35 ± 0.564
1.881GluMet: 1.881 ± 0.772
2.117GluAsn: 2.117 ± 1.054
1.176GluPro: 1.176 ± 0.399
1.881GluGln: 1.881 ± 0.719
3.528GluArg: 3.528 ± 1.285
5.644GluSer: 5.644 ± 0.623
4.468GluThr: 4.468 ± 1.445
2.117GluVal: 2.117 ± 0.678
0.941GluTrp: 0.941 ± 0.414
3.057GluTyr: 3.057 ± 1.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.528PheAla: 3.528 ± 0.719
0.706PheCys: 0.706 ± 0.27
2.117PheAsp: 2.117 ± 0.766
2.117PheGlu: 2.117 ± 0.813
2.587PhePhe: 2.587 ± 0.993
1.176PheGly: 1.176 ± 0.48
1.411PheHis: 1.411 ± 0.698
2.117PheIle: 2.117 ± 1.226
2.587PheLys: 2.587 ± 0.791
2.822PheLeu: 2.822 ± 0.741
0.0PheMet: 0.0 ± 0.0
1.176PheAsn: 1.176 ± 0.343
1.881PhePro: 1.881 ± 0.681
2.117PheGln: 2.117 ± 0.56
2.117PheArg: 2.117 ± 0.55
5.88PheSer: 5.88 ± 1.165
1.881PheThr: 1.881 ± 0.845
2.352PheVal: 2.352 ± 0.922
0.47PheTrp: 0.47 ± 0.252
1.176PheTyr: 1.176 ± 0.287
0.0PheXaa: 0.0 ± 0.0
Gly
1.881GlyAla: 1.881 ± 0.325
0.235GlyCys: 0.235 ± 0.381
3.528GlyAsp: 3.528 ± 0.85
1.646GlyGlu: 1.646 ± 0.612
2.822GlyPhe: 2.822 ± 1.136
2.587GlyGly: 2.587 ± 0.632
1.176GlyHis: 1.176 ± 0.502
3.057GlyIle: 3.057 ± 0.563
3.528GlyLys: 3.528 ± 0.624
6.35GlyLeu: 6.35 ± 1.272
0.706GlyMet: 0.706 ± 0.818
1.646GlyAsn: 1.646 ± 0.485
2.822GlyPro: 2.822 ± 0.952
1.881GlyGln: 1.881 ± 0.893
1.646GlyArg: 1.646 ± 0.425
2.822GlySer: 2.822 ± 0.518
2.117GlyThr: 2.117 ± 1.366
4.233GlyVal: 4.233 ± 0.931
0.941GlyTrp: 0.941 ± 0.413
1.646GlyTyr: 1.646 ± 0.538
0.0GlyXaa: 0.0 ± 0.0
His
1.176HisAla: 1.176 ± 0.55
0.941HisCys: 0.941 ± 0.44
1.411HisAsp: 1.411 ± 0.734
0.706HisGlu: 0.706 ± 0.427
1.646HisPhe: 1.646 ± 0.612
1.176HisGly: 1.176 ± 0.666
0.706HisHis: 0.706 ± 0.543
1.881HisIle: 1.881 ± 0.394
0.47HisLys: 0.47 ± 0.284
2.117HisLeu: 2.117 ± 0.557
0.47HisMet: 0.47 ± 0.307
1.176HisAsn: 1.176 ± 0.514
1.411HisPro: 1.411 ± 0.646
1.411HisGln: 1.411 ± 0.808
1.646HisArg: 1.646 ± 0.316
2.822HisSer: 2.822 ± 0.857
1.176HisThr: 1.176 ± 0.502
0.941HisVal: 0.941 ± 0.503
0.0HisTrp: 0.0 ± 0.0
1.176HisTyr: 1.176 ± 0.401
0.0HisXaa: 0.0 ± 0.0
Ile
3.528IleAla: 3.528 ± 1.316
1.176IleCys: 1.176 ± 0.287
4.233IleAsp: 4.233 ± 1.303
3.293IleGlu: 3.293 ± 0.94
2.352IlePhe: 2.352 ± 0.94
4.704IleGly: 4.704 ± 1.198
2.587IleHis: 2.587 ± 0.693
4.233IleIle: 4.233 ± 0.97
3.528IleLys: 3.528 ± 1.075
5.644IleLeu: 5.644 ± 0.599
1.646IleMet: 1.646 ± 0.579
2.352IleAsn: 2.352 ± 0.475
4.233IlePro: 4.233 ± 0.95
1.646IleGln: 1.646 ± 0.993
2.117IleArg: 2.117 ± 0.679
6.35IleSer: 6.35 ± 1.064
5.409IleThr: 5.409 ± 1.232
3.528IleVal: 3.528 ± 1.062
1.176IleTrp: 1.176 ± 0.758
2.117IleTyr: 2.117 ± 0.806
0.0IleXaa: 0.0 ± 0.0
Lys
4.233LysAla: 4.233 ± 0.742
1.646LysCys: 1.646 ± 0.573
3.293LysAsp: 3.293 ± 1.042
1.881LysGlu: 1.881 ± 0.517
1.176LysPhe: 1.176 ± 0.406
2.587LysGly: 2.587 ± 0.832
1.646LysHis: 1.646 ± 0.581
3.293LysIle: 3.293 ± 0.864
2.117LysLys: 2.117 ± 0.493
4.939LysLeu: 4.939 ± 1.424
1.881LysMet: 1.881 ± 0.648
1.176LysAsn: 1.176 ± 0.401
2.117LysPro: 2.117 ± 0.362
2.822LysGln: 2.822 ± 0.921
3.057LysArg: 3.057 ± 0.723
4.704LysSer: 4.704 ± 0.593
3.998LysThr: 3.998 ± 0.914
3.057LysVal: 3.057 ± 0.649
1.411LysTrp: 1.411 ± 0.708
1.881LysTyr: 1.881 ± 0.622
0.0LysXaa: 0.0 ± 0.0
Leu
5.409LeuAla: 5.409 ± 0.944
2.117LeuCys: 2.117 ± 0.987
5.88LeuAsp: 5.88 ± 1.28
4.704LeuGlu: 4.704 ± 1.289
3.057LeuPhe: 3.057 ± 0.677
3.528LeuGly: 3.528 ± 0.726
2.117LeuHis: 2.117 ± 0.428
7.291LeuIle: 7.291 ± 1.959
6.585LeuLys: 6.585 ± 1.325
7.526LeuLeu: 7.526 ± 1.191
3.057LeuMet: 3.057 ± 0.949
4.233LeuAsn: 4.233 ± 0.98
5.409LeuPro: 5.409 ± 1.1
2.117LeuGln: 2.117 ± 0.379
7.056LeuArg: 7.056 ± 0.871
7.996LeuSer: 7.996 ± 2.333
8.937LeuThr: 8.937 ± 1.353
6.35LeuVal: 6.35 ± 1.427
1.646LeuTrp: 1.646 ± 0.609
3.528LeuTyr: 3.528 ± 0.986
0.0LeuXaa: 0.0 ± 0.0
Met
2.117MetAla: 2.117 ± 0.503
0.235MetCys: 0.235 ± 0.142
0.706MetAsp: 0.706 ± 0.354
1.646MetGlu: 1.646 ± 0.624
1.411MetPhe: 1.411 ± 0.851
0.235MetGly: 0.235 ± 0.314
0.235MetHis: 0.235 ± 0.142
1.646MetIle: 1.646 ± 0.499
2.352MetLys: 2.352 ± 1.127
2.587MetLeu: 2.587 ± 1.305
0.941MetMet: 0.941 ± 0.906
0.47MetAsn: 0.47 ± 0.307
1.176MetPro: 1.176 ± 0.353
0.706MetGln: 0.706 ± 0.637
2.117MetArg: 2.117 ± 0.82
2.117MetSer: 2.117 ± 0.634
2.587MetThr: 2.587 ± 0.342
2.587MetVal: 2.587 ± 1.205
0.47MetTrp: 0.47 ± 0.252
0.941MetTyr: 0.941 ± 0.316
0.0MetXaa: 0.0 ± 0.0
Asn
1.176AsnAla: 1.176 ± 0.535
0.706AsnCys: 0.706 ± 0.92
1.411AsnAsp: 1.411 ± 0.309
3.528AsnGlu: 3.528 ± 0.9
1.176AsnPhe: 1.176 ± 0.287
0.706AsnGly: 0.706 ± 0.334
0.47AsnHis: 0.47 ± 0.284
2.822AsnIle: 2.822 ± 0.768
1.646AsnLys: 1.646 ± 1.019
4.233AsnLeu: 4.233 ± 0.891
1.176AsnMet: 1.176 ± 0.757
1.411AsnAsn: 1.411 ± 0.639
2.822AsnPro: 2.822 ± 0.976
1.411AsnGln: 1.411 ± 0.851
1.881AsnArg: 1.881 ± 0.383
3.763AsnSer: 3.763 ± 0.662
0.941AsnThr: 0.941 ± 0.414
2.117AsnVal: 2.117 ± 1.142
1.176AsnTrp: 1.176 ± 0.461
1.411AsnTyr: 1.411 ± 0.528
0.0AsnXaa: 0.0 ± 0.0
Pro
2.352ProAla: 2.352 ± 1.399
0.941ProCys: 0.941 ± 0.562
3.293ProAsp: 3.293 ± 1.188
3.998ProGlu: 3.998 ± 0.997
2.117ProPhe: 2.117 ± 0.524
3.763ProGly: 3.763 ± 0.902
1.176ProHis: 1.176 ± 0.468
3.998ProIle: 3.998 ± 1.17
1.881ProLys: 1.881 ± 1.114
4.704ProLeu: 4.704 ± 1.509
1.646ProMet: 1.646 ± 0.513
1.646ProAsn: 1.646 ± 0.442
2.352ProPro: 2.352 ± 0.771
1.646ProGln: 1.646 ± 0.93
2.822ProArg: 2.822 ± 1.042
3.998ProSer: 3.998 ± 0.896
3.998ProThr: 3.998 ± 1.222
3.763ProVal: 3.763 ± 1.342
0.706ProTrp: 0.706 ± 0.409
1.881ProTyr: 1.881 ± 0.721
0.0ProXaa: 0.0 ± 0.0
Gln
3.057GlnAla: 3.057 ± 1.117
0.706GlnCys: 0.706 ± 0.381
1.411GlnAsp: 1.411 ± 0.633
2.352GlnGlu: 2.352 ± 0.954
1.176GlnPhe: 1.176 ± 0.618
0.941GlnGly: 0.941 ± 0.611
0.941GlnHis: 0.941 ± 0.551
2.587GlnIle: 2.587 ± 1.093
0.941GlnLys: 0.941 ± 0.579
4.468GlnLeu: 4.468 ± 0.646
1.411GlnMet: 1.411 ± 0.601
0.706GlnAsn: 0.706 ± 0.381
1.646GlnPro: 1.646 ± 1.252
1.881GlnGln: 1.881 ± 0.543
1.411GlnArg: 1.411 ± 0.358
6.35GlnSer: 6.35 ± 1.131
1.411GlnThr: 1.411 ± 0.579
1.411GlnVal: 1.411 ± 0.413
0.235GlnTrp: 0.235 ± 0.142
1.881GlnTyr: 1.881 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
2.822ArgAla: 2.822 ± 0.776
0.47ArgCys: 0.47 ± 0.252
2.352ArgAsp: 2.352 ± 0.961
2.587ArgGlu: 2.587 ± 0.75
2.822ArgPhe: 2.822 ± 0.613
2.352ArgGly: 2.352 ± 1.05
1.176ArgHis: 1.176 ± 0.564
3.998ArgIle: 3.998 ± 0.797
1.411ArgLys: 1.411 ± 0.633
6.82ArgLeu: 6.82 ± 2.227
2.352ArgMet: 2.352 ± 0.891
1.881ArgAsn: 1.881 ± 0.518
3.057ArgPro: 3.057 ± 0.815
2.587ArgGln: 2.587 ± 0.813
3.528ArgArg: 3.528 ± 0.87
5.174ArgSer: 5.174 ± 0.906
2.352ArgThr: 2.352 ± 0.859
3.763ArgVal: 3.763 ± 0.883
0.47ArgTrp: 0.47 ± 0.367
1.646ArgTyr: 1.646 ± 0.692
0.0ArgXaa: 0.0 ± 0.0
Ser
5.174SerAla: 5.174 ± 1.888
1.411SerCys: 1.411 ± 0.698
4.233SerAsp: 4.233 ± 1.055
7.526SerGlu: 7.526 ± 1.983
4.468SerPhe: 4.468 ± 0.99
4.704SerGly: 4.704 ± 1.674
2.822SerHis: 2.822 ± 1.361
5.644SerIle: 5.644 ± 1.332
5.88SerLys: 5.88 ± 0.82
10.818SerLeu: 10.818 ± 1.189
1.411SerMet: 1.411 ± 0.492
3.528SerAsn: 3.528 ± 0.644
3.528SerPro: 3.528 ± 1.844
2.587SerGln: 2.587 ± 0.488
4.468SerArg: 4.468 ± 1.216
9.172SerSer: 9.172 ± 1.491
4.939SerThr: 4.939 ± 0.834
6.82SerVal: 6.82 ± 1.362
1.881SerTrp: 1.881 ± 1.053
3.763SerTyr: 3.763 ± 0.684
0.0SerXaa: 0.0 ± 0.0
Thr
3.998ThrAla: 3.998 ± 1.285
1.881ThrCys: 1.881 ± 1.586
4.468ThrAsp: 4.468 ± 0.852
2.352ThrGlu: 2.352 ± 0.722
1.646ThrPhe: 1.646 ± 0.616
4.233ThrGly: 4.233 ± 0.977
1.411ThrHis: 1.411 ± 0.298
3.998ThrIle: 3.998 ± 0.765
3.763ThrLys: 3.763 ± 0.849
5.644ThrLeu: 5.644 ± 1.61
1.411ThrMet: 1.411 ± 0.428
3.057ThrAsn: 3.057 ± 1.053
3.998ThrPro: 3.998 ± 0.976
2.352ThrGln: 2.352 ± 0.891
3.998ThrArg: 3.998 ± 0.537
6.82ThrSer: 6.82 ± 1.105
3.057ThrThr: 3.057 ± 1.127
4.468ThrVal: 4.468 ± 0.947
0.941ThrTrp: 0.941 ± 0.419
1.176ThrTyr: 1.176 ± 0.287
0.0ThrXaa: 0.0 ± 0.0
Val
4.939ValAla: 4.939 ± 1.141
0.706ValCys: 0.706 ± 0.381
3.293ValAsp: 3.293 ± 1.114
3.763ValGlu: 3.763 ± 0.892
2.822ValPhe: 2.822 ± 0.936
3.057ValGly: 3.057 ± 1.001
0.47ValHis: 0.47 ± 0.284
5.174ValIle: 5.174 ± 0.989
3.763ValLys: 3.763 ± 0.689
5.409ValLeu: 5.409 ± 1.877
2.587ValMet: 2.587 ± 0.53
3.057ValAsn: 3.057 ± 0.831
3.057ValPro: 3.057 ± 1.128
3.528ValGln: 3.528 ± 1.493
1.176ValArg: 1.176 ± 0.399
5.174ValSer: 5.174 ± 1.464
5.174ValThr: 5.174 ± 0.512
3.763ValVal: 3.763 ± 0.937
0.235ValTrp: 0.235 ± 0.307
2.587ValTyr: 2.587 ± 0.903
0.0ValXaa: 0.0 ± 0.0
Trp
0.235TrpAla: 0.235 ± 0.306
0.706TrpCys: 0.706 ± 0.443
1.176TrpAsp: 1.176 ± 0.48
0.706TrpGlu: 0.706 ± 0.426
0.235TrpPhe: 0.235 ± 0.142
1.411TrpGly: 1.411 ± 0.413
0.235TrpHis: 0.235 ± 0.142
0.47TrpIle: 0.47 ± 0.613
1.411TrpLys: 1.411 ± 0.809
1.646TrpLeu: 1.646 ± 0.463
0.0TrpMet: 0.0 ± 0.0
1.411TrpAsn: 1.411 ± 0.358
1.176TrpPro: 1.176 ± 0.502
0.235TrpGln: 0.235 ± 0.142
0.47TrpArg: 0.47 ± 0.548
0.235TrpSer: 0.235 ± 0.142
2.352TrpThr: 2.352 ± 0.486
0.941TrpVal: 0.941 ± 0.646
0.0TrpTrp: 0.0 ± 0.0
0.235TrpTyr: 0.235 ± 0.314
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.822TyrAla: 2.822 ± 0.855
1.646TyrCys: 1.646 ± 0.692
1.646TyrAsp: 1.646 ± 0.335
1.646TyrGlu: 1.646 ± 1.179
1.646TyrPhe: 1.646 ± 0.55
2.117TyrGly: 2.117 ± 0.519
1.176TyrHis: 1.176 ± 0.461
2.587TyrIle: 2.587 ± 0.91
1.411TyrLys: 1.411 ± 0.585
4.233TyrLeu: 4.233 ± 0.965
0.47TyrMet: 0.47 ± 0.284
0.706TyrAsn: 0.706 ± 0.27
2.117TyrPro: 2.117 ± 1.026
0.706TyrGln: 0.706 ± 0.27
2.352TyrArg: 2.352 ± 0.382
3.293TyrSer: 3.293 ± 0.49
2.117TyrThr: 2.117 ± 0.76
1.881TyrVal: 1.881 ± 0.385
0.47TyrTrp: 0.47 ± 0.29
0.235TyrTyr: 0.235 ± 0.142
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4253 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski