Amino acid dipepetide frequency for Hubei orthoptera virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.688AlaAla: 7.688 ± 3.484
1.183AlaCys: 1.183 ± 0.434
3.844AlaAsp: 3.844 ± 1.045
2.957AlaGlu: 2.957 ± 1.063
0.887AlaPhe: 0.887 ± 0.244
4.731AlaGly: 4.731 ± 1.835
1.183AlaHis: 1.183 ± 0.788
3.253AlaIle: 3.253 ± 0.589
2.365AlaLys: 2.365 ± 1.236
7.096AlaLeu: 7.096 ± 1.653
1.478AlaMet: 1.478 ± 1.154
2.957AlaAsn: 2.957 ± 0.469
2.957AlaPro: 2.957 ± 1.621
2.661AlaGln: 2.661 ± 0.97
4.435AlaArg: 4.435 ± 1.809
7.688AlaSer: 7.688 ± 1.977
5.322AlaThr: 5.322 ± 1.399
4.435AlaVal: 4.435 ± 0.716
1.478AlaTrp: 1.478 ± 0.901
2.661AlaTyr: 2.661 ± 1.073
0.0AlaXaa: 0.0 ± 0.0
Cys
1.478CysAla: 1.478 ± 0.603
0.296CysCys: 0.296 ± 0.172
1.183CysAsp: 1.183 ± 0.434
1.183CysGlu: 1.183 ± 0.631
0.887CysPhe: 0.887 ± 0.244
1.774CysGly: 1.774 ± 0.645
0.887CysHis: 0.887 ± 0.515
0.887CysIle: 0.887 ± 0.244
0.591CysLys: 0.591 ± 0.359
0.887CysLeu: 0.887 ± 0.515
0.0CysMet: 0.0 ± 0.0
0.591CysAsn: 0.591 ± 0.343
2.661CysPro: 2.661 ± 0.516
0.887CysGln: 0.887 ± 0.515
0.0CysArg: 0.0 ± 0.0
2.957CysSer: 2.957 ± 0.445
1.183CysThr: 1.183 ± 0.903
0.296CysVal: 0.296 ± 0.341
0.296CysTrp: 0.296 ± 0.172
1.183CysTyr: 1.183 ± 0.346
0.0CysXaa: 0.0 ± 0.0
Asp
1.774AspAla: 1.774 ± 0.381
0.591AspCys: 0.591 ± 0.242
1.478AspAsp: 1.478 ± 0.424
2.957AspGlu: 2.957 ± 0.848
1.774AspPhe: 1.774 ± 0.596
2.07AspGly: 2.07 ± 0.512
1.183AspHis: 1.183 ± 1.046
3.844AspIle: 3.844 ± 1.339
1.478AspLys: 1.478 ± 0.836
5.618AspLeu: 5.618 ± 1.043
1.478AspMet: 1.478 ± 0.263
2.07AspAsn: 2.07 ± 0.581
3.844AspPro: 3.844 ± 1.169
2.661AspGln: 2.661 ± 0.724
2.365AspArg: 2.365 ± 1.221
3.548AspSer: 3.548 ± 0.659
3.253AspThr: 3.253 ± 0.589
3.253AspVal: 3.253 ± 0.883
0.887AspTrp: 0.887 ± 0.751
2.661AspTyr: 2.661 ± 0.909
0.0AspXaa: 0.0 ± 0.0
Glu
3.548GluAla: 3.548 ± 1.061
0.887GluCys: 0.887 ± 0.385
1.774GluAsp: 1.774 ± 0.799
2.957GluGlu: 2.957 ± 0.953
1.183GluPhe: 1.183 ± 0.346
3.844GluGly: 3.844 ± 0.728
0.296GluHis: 0.296 ± 0.341
2.957GluIle: 2.957 ± 0.59
2.661GluLys: 2.661 ± 0.457
8.279GluLeu: 8.279 ± 1.005
0.887GluMet: 0.887 ± 0.244
1.183GluAsn: 1.183 ± 0.717
1.478GluPro: 1.478 ± 0.603
0.591GluGln: 0.591 ± 0.359
2.661GluArg: 2.661 ± 0.457
4.435GluSer: 4.435 ± 1.538
2.07GluThr: 2.07 ± 0.477
5.027GluVal: 5.027 ± 0.809
0.296GluTrp: 0.296 ± 0.41
0.296GluTyr: 0.296 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
3.548PheAla: 3.548 ± 0.238
0.0PheCys: 0.0 ± 0.0
2.365PheAsp: 2.365 ± 1.374
1.183PheGlu: 1.183 ± 0.476
1.478PhePhe: 1.478 ± 0.603
2.07PheGly: 2.07 ± 1.458
1.478PheHis: 1.478 ± 0.454
1.774PheIle: 1.774 ± 0.799
1.183PheLys: 1.183 ± 0.28
4.14PheLeu: 4.14 ± 0.904
0.0PheMet: 0.0 ± 0.0
1.183PheAsn: 1.183 ± 0.346
2.661PhePro: 2.661 ± 0.516
1.774PheGln: 1.774 ± 1.502
2.661PheArg: 2.661 ± 0.828
2.661PheSer: 2.661 ± 0.828
3.548PheThr: 3.548 ± 0.659
2.07PheVal: 2.07 ± 0.43
0.0PheTrp: 0.0 ± 0.0
0.591PheTyr: 0.591 ± 0.441
0.0PheXaa: 0.0 ± 0.0
Gly
3.548GlyAla: 3.548 ± 1.055
3.253GlyCys: 3.253 ± 1.119
5.322GlyAsp: 5.322 ± 1.034
5.027GlyGlu: 5.027 ± 0.906
3.844GlyPhe: 3.844 ± 1.137
5.322GlyGly: 5.322 ± 1.141
0.887GlyHis: 0.887 ± 0.515
3.253GlyIle: 3.253 ± 0.826
2.957GlyLys: 2.957 ± 0.897
7.096GlyLeu: 7.096 ± 1.145
1.183GlyMet: 1.183 ± 0.436
1.478GlyAsn: 1.478 ± 0.659
1.774GlyPro: 1.774 ± 0.479
2.07GlyGln: 2.07 ± 0.809
3.548GlyArg: 3.548 ± 0.647
4.435GlySer: 4.435 ± 2.091
2.365GlyThr: 2.365 ± 0.726
6.505GlyVal: 6.505 ± 1.44
0.887GlyTrp: 0.887 ± 0.244
1.183GlyTyr: 1.183 ± 0.788
0.0GlyXaa: 0.0 ± 0.0
His
2.957HisAla: 2.957 ± 0.593
0.296HisCys: 0.296 ± 0.172
1.478HisAsp: 1.478 ± 0.801
1.183HisGlu: 1.183 ± 1.366
0.296HisPhe: 0.296 ± 0.172
1.774HisGly: 1.774 ± 0.517
0.591HisHis: 0.591 ± 0.242
0.296HisIle: 0.296 ± 0.172
0.296HisLys: 0.296 ± 0.172
2.957HisLeu: 2.957 ± 0.81
1.183HisMet: 1.183 ± 0.54
0.591HisAsn: 0.591 ± 0.343
1.774HisPro: 1.774 ± 0.345
1.183HisGln: 1.183 ± 0.631
1.183HisArg: 1.183 ± 0.28
2.957HisSer: 2.957 ± 0.204
2.07HisThr: 2.07 ± 0.649
0.887HisVal: 0.887 ± 0.385
0.0HisTrp: 0.0 ± 0.0
0.296HisTyr: 0.296 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
3.844IleAla: 3.844 ± 1.155
1.774IleCys: 1.774 ± 1.132
2.661IleAsp: 2.661 ± 0.97
2.661IleGlu: 2.661 ± 1.154
0.887IlePhe: 0.887 ± 0.244
4.435IleGly: 4.435 ± 0.46
1.478IleHis: 1.478 ± 0.263
3.253IleIle: 3.253 ± 0.739
2.661IleLys: 2.661 ± 0.789
5.914IleLeu: 5.914 ± 1.24
1.183IleMet: 1.183 ± 0.39
1.183IleAsn: 1.183 ± 0.346
2.661IlePro: 2.661 ± 0.828
2.365IleGln: 2.365 ± 0.31
1.774IleArg: 1.774 ± 0.766
7.096IleSer: 7.096 ± 1.077
3.548IleThr: 3.548 ± 1.037
2.957IleVal: 2.957 ± 0.624
0.887IleTrp: 0.887 ± 0.489
2.957IleTyr: 2.957 ± 0.848
0.0IleXaa: 0.0 ± 0.0
Lys
3.253LysAla: 3.253 ± 0.989
0.296LysCys: 0.296 ± 0.172
2.957LysAsp: 2.957 ± 0.445
1.478LysGlu: 1.478 ± 0.488
2.661LysPhe: 2.661 ± 0.657
2.661LysGly: 2.661 ± 0.856
1.774LysHis: 1.774 ± 0.915
2.661LysIle: 2.661 ± 0.603
5.322LysLys: 5.322 ± 1.407
3.548LysLeu: 3.548 ± 0.323
0.296LysMet: 0.296 ± 0.545
1.478LysAsn: 1.478 ± 0.709
1.774LysPro: 1.774 ± 0.835
0.591LysGln: 0.591 ± 0.359
2.661LysArg: 2.661 ± 0.885
2.365LysSer: 2.365 ± 0.711
2.661LysThr: 2.661 ± 0.577
2.957LysVal: 2.957 ± 1.295
1.183LysTrp: 1.183 ± 0.346
2.365LysTyr: 2.365 ± 1.102
0.0LysXaa: 0.0 ± 0.0
Leu
7.983LeuAla: 7.983 ± 2.555
2.365LeuCys: 2.365 ± 0.56
4.731LeuAsp: 4.731 ± 1.058
5.027LeuGlu: 5.027 ± 1.041
3.253LeuPhe: 3.253 ± 1.48
8.87LeuGly: 8.87 ± 1.048
3.253LeuHis: 3.253 ± 0.555
7.983LeuIle: 7.983 ± 1.482
4.731LeuLys: 4.731 ± 1.619
13.897LeuLeu: 13.897 ± 1.314
2.957LeuMet: 2.957 ± 0.984
4.14LeuAsn: 4.14 ± 0.77
7.392LeuPro: 7.392 ± 0.81
3.253LeuGln: 3.253 ± 0.739
8.279LeuArg: 8.279 ± 0.842
7.688LeuSer: 7.688 ± 0.663
7.983LeuThr: 7.983 ± 1.766
7.096LeuVal: 7.096 ± 1.425
1.478LeuTrp: 1.478 ± 0.488
3.253LeuTyr: 3.253 ± 0.732
0.0LeuXaa: 0.0 ± 0.0
Met
2.365MetAla: 2.365 ± 0.468
0.296MetCys: 0.296 ± 0.172
0.887MetAsp: 0.887 ± 0.385
1.478MetGlu: 1.478 ± 0.446
0.887MetPhe: 0.887 ± 0.385
0.591MetGly: 0.591 ± 0.62
0.0MetHis: 0.0 ± 0.0
1.183MetIle: 1.183 ± 0.28
1.478MetLys: 1.478 ± 0.392
1.478MetLeu: 1.478 ± 0.603
0.887MetMet: 0.887 ± 0.383
0.591MetAsn: 0.591 ± 0.242
1.183MetPro: 1.183 ± 0.484
0.296MetGln: 0.296 ± 0.172
1.478MetArg: 1.478 ± 0.488
1.478MetSer: 1.478 ± 0.424
1.774MetThr: 1.774 ± 0.517
1.183MetVal: 1.183 ± 0.346
0.591MetTrp: 0.591 ± 0.343
0.591MetTyr: 0.591 ± 0.343
0.0MetXaa: 0.0 ± 0.0
Asn
2.661AsnAla: 2.661 ± 0.174
0.296AsnCys: 0.296 ± 0.172
0.591AsnAsp: 0.591 ± 0.62
0.591AsnGlu: 0.591 ± 0.343
1.478AsnPhe: 1.478 ± 1.154
0.296AsnGly: 0.296 ± 0.484
0.591AsnHis: 0.591 ± 0.343
2.661AsnIle: 2.661 ± 1.142
2.365AsnLys: 2.365 ± 0.574
4.731AsnLeu: 4.731 ± 0.286
1.183AsnMet: 1.183 ± 0.756
0.591AsnAsn: 0.591 ± 0.683
2.07AsnPro: 2.07 ± 1.371
1.183AsnGln: 1.183 ± 0.28
1.183AsnArg: 1.183 ± 0.54
3.253AsnSer: 3.253 ± 0.236
2.365AsnThr: 2.365 ± 0.691
2.365AsnVal: 2.365 ± 1.016
0.296AsnTrp: 0.296 ± 0.341
0.887AsnTyr: 0.887 ± 0.515
0.0AsnXaa: 0.0 ± 0.0
Pro
2.957ProAla: 2.957 ± 1.008
2.365ProCys: 2.365 ± 0.31
3.253ProAsp: 3.253 ± 1.109
2.365ProGlu: 2.365 ± 0.952
0.296ProPhe: 0.296 ± 0.172
2.661ProGly: 2.661 ± 0.902
1.183ProHis: 1.183 ± 0.476
2.07ProIle: 2.07 ± 0.58
3.548ProLys: 3.548 ± 0.862
6.209ProLeu: 6.209 ± 1.034
0.296ProMet: 0.296 ± 0.172
1.478ProAsn: 1.478 ± 0.392
6.505ProPro: 6.505 ± 2.098
2.365ProGln: 2.365 ± 0.359
2.661ProArg: 2.661 ± 0.724
7.096ProSer: 7.096 ± 1.657
5.027ProThr: 5.027 ± 1.376
2.957ProVal: 2.957 ± 0.687
1.183ProTrp: 1.183 ± 0.346
1.774ProTyr: 1.774 ± 0.345
0.0ProXaa: 0.0 ± 0.0
Gln
2.365GlnAla: 2.365 ± 0.56
0.0GlnCys: 0.0 ± 0.0
1.478GlnAsp: 1.478 ± 0.263
3.253GlnGlu: 3.253 ± 0.559
1.478GlnPhe: 1.478 ± 0.454
2.957GlnGly: 2.957 ± 0.784
0.887GlnHis: 0.887 ± 0.244
0.887GlnIle: 0.887 ± 0.876
1.478GlnLys: 1.478 ± 1.157
5.027GlnLeu: 5.027 ± 1.331
0.591GlnMet: 0.591 ± 0.343
0.296GlnAsn: 0.296 ± 0.172
1.478GlnPro: 1.478 ± 0.446
0.0GlnGln: 0.0 ± 0.0
1.183GlnArg: 1.183 ± 0.476
2.957GlnSer: 2.957 ± 0.469
1.478GlnThr: 1.478 ± 0.492
4.731GlnVal: 4.731 ± 2.332
0.887GlnTrp: 0.887 ± 0.244
1.183GlnTyr: 1.183 ± 0.756
0.0GlnXaa: 0.0 ± 0.0
Arg
3.548ArgAla: 3.548 ± 1.645
0.591ArgCys: 0.591 ± 0.343
2.957ArgAsp: 2.957 ± 0.992
2.365ArgGlu: 2.365 ± 0.727
2.957ArgPhe: 2.957 ± 0.506
5.027ArgGly: 5.027 ± 1.075
0.887ArgHis: 0.887 ± 0.515
2.957ArgIle: 2.957 ± 0.795
1.478ArgLys: 1.478 ± 0.454
6.505ArgLeu: 6.505 ± 0.668
1.478ArgMet: 1.478 ± 0.603
1.478ArgAsn: 1.478 ± 0.659
2.07ArgPro: 2.07 ± 0.477
2.365ArgGln: 2.365 ± 1.114
6.209ArgArg: 6.209 ± 1.528
5.027ArgSer: 5.027 ± 1.083
2.957ArgThr: 2.957 ± 0.81
2.957ArgVal: 2.957 ± 0.59
0.591ArgTrp: 0.591 ± 0.343
1.774ArgTyr: 1.774 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
6.801SerAla: 6.801 ± 1.257
2.365SerCys: 2.365 ± 0.968
4.435SerAsp: 4.435 ± 1.957
2.957SerGlu: 2.957 ± 0.59
4.435SerPhe: 4.435 ± 0.864
5.618SerGly: 5.618 ± 2.733
2.957SerHis: 2.957 ± 0.632
5.027SerIle: 5.027 ± 1.527
2.957SerLys: 2.957 ± 1.264
11.827SerLeu: 11.827 ± 2.483
1.478SerMet: 1.478 ± 0.392
4.731SerAsn: 4.731 ± 1.894
4.435SerPro: 4.435 ± 0.773
3.548SerGln: 3.548 ± 0.919
3.548SerArg: 3.548 ± 0.707
7.096SerSer: 7.096 ± 2.606
5.322SerThr: 5.322 ± 1.154
3.548SerVal: 3.548 ± 0.891
1.774SerTrp: 1.774 ± 0.645
2.661SerTyr: 2.661 ± 0.724
0.0SerXaa: 0.0 ± 0.0
Thr
2.661ThrAla: 2.661 ± 1.431
1.183ThrCys: 1.183 ± 0.484
3.548ThrAsp: 3.548 ± 0.659
3.253ThrGlu: 3.253 ± 1.356
2.661ThrPhe: 2.661 ± 0.982
2.661ThrGly: 2.661 ± 1.252
1.478ThrHis: 1.478 ± 0.454
5.618ThrIle: 5.618 ± 1.042
2.365ThrLys: 2.365 ± 0.832
9.166ThrLeu: 9.166 ± 1.309
0.591ThrMet: 0.591 ± 0.343
0.887ThrAsn: 0.887 ± 0.515
3.844ThrPro: 3.844 ± 1.465
2.365ThrGln: 2.365 ± 1.063
3.844ThrArg: 3.844 ± 1.305
6.209ThrSer: 6.209 ± 0.608
5.027ThrThr: 5.027 ± 1.142
5.618ThrVal: 5.618 ± 1.677
1.774ThrTrp: 1.774 ± 1.03
2.365ThrTyr: 2.365 ± 0.56
0.0ThrXaa: 0.0 ± 0.0
Val
4.435ValAla: 4.435 ± 1.192
0.887ValCys: 0.887 ± 0.515
2.07ValAsp: 2.07 ± 1.011
2.957ValGlu: 2.957 ± 0.59
2.661ValPhe: 2.661 ± 0.719
4.435ValGly: 4.435 ± 0.545
1.183ValHis: 1.183 ± 0.28
3.844ValIle: 3.844 ± 0.815
2.957ValLys: 2.957 ± 0.794
7.392ValLeu: 7.392 ± 2.004
1.774ValMet: 1.774 ± 0.791
2.957ValAsn: 2.957 ± 1.383
4.14ValPro: 4.14 ± 1.049
2.07ValGln: 2.07 ± 0.665
3.548ValArg: 3.548 ± 0.585
5.914ValSer: 5.914 ± 2.653
5.914ValThr: 5.914 ± 1.247
4.14ValVal: 4.14 ± 0.955
0.887ValTrp: 0.887 ± 0.383
1.478ValTyr: 1.478 ± 0.63
0.0ValXaa: 0.0 ± 0.0
Trp
2.07TrpAla: 2.07 ± 0.848
0.0TrpCys: 0.0 ± 0.0
0.296TrpAsp: 0.296 ± 0.341
0.887TrpGlu: 0.887 ± 0.385
0.887TrpPhe: 0.887 ± 0.385
1.774TrpGly: 1.774 ± 0.479
0.296TrpHis: 0.296 ± 0.172
0.591TrpIle: 0.591 ± 0.82
0.296TrpLys: 0.296 ± 0.172
1.183TrpLeu: 1.183 ± 0.28
0.296TrpMet: 0.296 ± 0.172
0.296TrpAsn: 0.296 ± 0.172
1.478TrpPro: 1.478 ± 0.859
0.887TrpGln: 0.887 ± 0.244
1.183TrpArg: 1.183 ± 0.484
0.591TrpSer: 0.591 ± 0.683
1.478TrpThr: 1.478 ± 0.488
0.887TrpVal: 0.887 ± 0.383
0.0TrpTrp: 0.0 ± 0.0
0.296TrpTyr: 0.296 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.774TyrAla: 1.774 ± 1.076
1.183TyrCys: 1.183 ± 0.476
1.478TyrAsp: 1.478 ± 0.392
0.591TyrGlu: 0.591 ± 0.242
1.478TyrPhe: 1.478 ± 0.492
2.365TyrGly: 2.365 ± 0.629
1.774TyrHis: 1.774 ± 0.645
1.183TyrIle: 1.183 ± 0.28
2.07TyrLys: 2.07 ± 0.581
2.661TyrLeu: 2.661 ± 0.457
1.183TyrMet: 1.183 ± 0.484
1.478TyrAsn: 1.478 ± 0.603
2.07TyrPro: 2.07 ± 0.91
1.478TyrGln: 1.478 ± 0.859
1.774TyrArg: 1.774 ± 0.345
2.07TyrSer: 2.07 ± 0.884
1.774TyrThr: 1.774 ± 1.249
1.774TyrVal: 1.774 ± 0.345
0.296TyrTrp: 0.296 ± 0.41
1.478TyrTyr: 1.478 ± 0.488
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3383 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski