Amino acid dipepetide frequency for Hubei odonate virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.054AlaAla: 4.054 ± 0.88
0.541AlaCys: 0.541 ± 0.291
1.892AlaAsp: 1.892 ± 0.878
3.514AlaGlu: 3.514 ± 0.786
2.973AlaPhe: 2.973 ± 0.713
2.973AlaGly: 2.973 ± 0.739
1.622AlaHis: 1.622 ± 0.631
2.432AlaIle: 2.432 ± 1.19
2.973AlaLys: 2.973 ± 0.603
3.514AlaLeu: 3.514 ± 0.063
0.27AlaMet: 0.27 ± 0.146
1.622AlaAsn: 1.622 ± 0.352
3.514AlaPro: 3.514 ± 1.696
2.162AlaGln: 2.162 ± 0.457
2.703AlaArg: 2.703 ± 0.68
3.243AlaSer: 3.243 ± 0.941
2.432AlaThr: 2.432 ± 1.241
1.622AlaVal: 1.622 ± 0.352
0.541AlaTrp: 0.541 ± 0.291
1.351AlaTyr: 1.351 ± 0.508
0.0AlaXaa: 0.0 ± 0.0
Cys
0.541CysAla: 0.541 ± 0.229
0.27CysCys: 0.27 ± 0.146
1.081CysAsp: 1.081 ± 0.582
0.811CysGlu: 0.811 ± 0.437
0.541CysPhe: 0.541 ± 0.291
1.081CysGly: 1.081 ± 0.582
0.27CysHis: 0.27 ± 0.146
0.27CysIle: 0.27 ± 0.146
1.081CysLys: 1.081 ± 0.582
2.973CysLeu: 2.973 ± 0.808
0.0CysMet: 0.0 ± 0.0
0.811CysAsn: 0.811 ± 0.437
1.351CysPro: 1.351 ± 0.34
0.27CysGln: 0.27 ± 0.34
1.081CysArg: 1.081 ± 0.582
1.081CysSer: 1.081 ± 0.582
0.541CysThr: 0.541 ± 0.229
1.081CysVal: 1.081 ± 0.228
0.27CysTrp: 0.27 ± 0.146
0.27CysTyr: 0.27 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
3.784AspAla: 3.784 ± 0.786
0.541AspCys: 0.541 ± 0.291
3.784AspAsp: 3.784 ± 1.611
2.973AspGlu: 2.973 ± 0.279
1.081AspPhe: 1.081 ± 0.228
2.973AspGly: 2.973 ± 0.739
1.081AspHis: 1.081 ± 0.228
3.784AspIle: 3.784 ± 1.537
2.162AspLys: 2.162 ± 0.828
4.054AspLeu: 4.054 ± 0.764
1.081AspMet: 1.081 ± 0.9
3.243AspAsn: 3.243 ± 0.941
4.324AspPro: 4.324 ± 0.671
2.703AspGln: 2.703 ± 1.345
2.432AspArg: 2.432 ± 1.936
5.405AspSer: 5.405 ± 1.767
2.703AspThr: 2.703 ± 0.444
5.676AspVal: 5.676 ± 1.239
1.351AspTrp: 1.351 ± 0.728
2.162AspTyr: 2.162 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
1.622GluAla: 1.622 ± 0.44
1.622GluCys: 1.622 ± 0.471
3.784GluAsp: 3.784 ± 0.786
5.135GluGlu: 5.135 ± 1.925
2.973GluPhe: 2.973 ± 0.3
2.973GluGly: 2.973 ± 0.279
0.27GluHis: 0.27 ± 0.146
5.405GluIle: 5.405 ± 2.189
6.757GluLys: 6.757 ± 1.313
5.405GluLeu: 5.405 ± 0.323
1.622GluMet: 1.622 ± 0.631
4.324GluAsn: 4.324 ± 1.239
3.514GluPro: 3.514 ± 0.454
1.081GluGln: 1.081 ± 0.582
4.595GluArg: 4.595 ± 0.182
4.865GluSer: 4.865 ± 0.31
3.784GluThr: 3.784 ± 0.999
4.054GluVal: 4.054 ± 0.88
0.811GluTrp: 0.811 ± 0.437
3.243GluTyr: 3.243 ± 0.981
0.0GluXaa: 0.0 ± 0.0
Phe
1.081PheAla: 1.081 ± 0.228
0.541PheCys: 0.541 ± 0.291
2.973PheAsp: 2.973 ± 0.3
3.243PheGlu: 3.243 ± 0.161
1.081PhePhe: 1.081 ± 0.582
4.865PheGly: 4.865 ± 1.658
0.541PheHis: 0.541 ± 0.291
2.432PheIle: 2.432 ± 0.506
2.973PheLys: 2.973 ± 0.713
3.243PheLeu: 3.243 ± 1.747
0.541PheMet: 0.541 ± 0.291
1.622PheAsn: 1.622 ± 1.244
2.432PhePro: 2.432 ± 0.829
1.351PheGln: 1.351 ± 0.34
2.432PheArg: 2.432 ± 0.506
3.784PheSer: 3.784 ± 1.6
2.162PheThr: 2.162 ± 0.749
1.892PheVal: 1.892 ± 1.019
0.811PheTrp: 0.811 ± 0.176
1.351PheTyr: 1.351 ± 0.728
0.0PheXaa: 0.0 ± 0.0
Gly
1.622GlyAla: 1.622 ± 0.471
1.081GlyCys: 1.081 ± 0.582
4.324GlyAsp: 4.324 ± 1.462
5.405GlyGlu: 5.405 ± 1.767
2.432GlyPhe: 2.432 ± 0.829
4.865GlyGly: 4.865 ± 1.673
0.0GlyHis: 0.0 ± 0.0
4.595GlyIle: 4.595 ± 1.801
2.973GlyLys: 2.973 ± 0.3
6.757GlyLeu: 6.757 ± 0.928
1.351GlyMet: 1.351 ± 0.728
2.703GlyAsn: 2.703 ± 1.345
2.973GlyPro: 2.973 ± 1.056
1.351GlyGln: 1.351 ± 0.34
2.703GlyArg: 2.703 ± 1.053
4.324GlySer: 4.324 ± 2.695
2.973GlyThr: 2.973 ± 0.3
3.514GlyVal: 3.514 ± 0.496
0.0GlyTrp: 0.0 ± 0.0
1.892GlyTyr: 1.892 ± 0.381
0.0GlyXaa: 0.0 ± 0.0
His
0.541HisAla: 0.541 ± 0.291
0.541HisCys: 0.541 ± 0.291
0.27HisAsp: 0.27 ± 0.146
0.541HisGlu: 0.541 ± 0.659
1.351HisPhe: 1.351 ± 0.728
1.351HisGly: 1.351 ± 0.728
0.27HisHis: 0.27 ± 0.146
0.811HisIle: 0.811 ± 0.437
0.811HisLys: 0.811 ± 0.176
2.162HisLeu: 2.162 ± 0.548
0.27HisMet: 0.27 ± 0.146
0.27HisAsn: 0.27 ± 0.146
1.081HisPro: 1.081 ± 0.582
1.081HisGln: 1.081 ± 0.604
1.622HisArg: 1.622 ± 0.471
0.541HisSer: 0.541 ± 0.291
0.27HisThr: 0.27 ± 0.146
1.351HisVal: 1.351 ± 0.381
0.0HisTrp: 0.0 ± 0.0
0.27HisTyr: 0.27 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
4.054IleAla: 4.054 ± 1.511
0.27IleCys: 0.27 ± 0.146
4.865IleAsp: 4.865 ± 1.607
4.324IleGlu: 4.324 ± 1.898
0.811IlePhe: 0.811 ± 0.176
3.243IleGly: 3.243 ± 1.062
1.081IleHis: 1.081 ± 0.582
4.324IleIle: 4.324 ± 1.373
5.405IleLys: 5.405 ± 1.254
6.486IleLeu: 6.486 ± 0.547
0.541IleMet: 0.541 ± 0.229
5.676IleAsn: 5.676 ± 2.695
3.243IlePro: 3.243 ± 0.983
5.405IleGln: 5.405 ± 3.681
4.324IleArg: 4.324 ± 0.439
4.595IleSer: 4.595 ± 0.866
4.324IleThr: 4.324 ± 0.439
2.973IleVal: 2.973 ± 0.279
1.622IleTrp: 1.622 ± 0.471
1.081IleTyr: 1.081 ± 0.228
0.0IleXaa: 0.0 ± 0.0
Lys
2.162LysAla: 2.162 ± 0.378
0.811LysCys: 0.811 ± 0.437
3.243LysAsp: 3.243 ± 1.322
5.135LysGlu: 5.135 ± 0.449
1.892LysPhe: 1.892 ± 0.381
2.432LysGly: 2.432 ± 1.19
0.541LysHis: 0.541 ± 0.291
6.757LysIle: 6.757 ± 1.313
4.865LysLys: 4.865 ± 2.12
6.757LysLeu: 6.757 ± 1.026
1.081LysMet: 1.081 ± 0.457
2.703LysAsn: 2.703 ± 1.456
4.324LysPro: 4.324 ± 0.913
2.162LysGln: 2.162 ± 1.165
2.432LysArg: 2.432 ± 0.561
4.595LysSer: 4.595 ± 2.208
5.405LysThr: 5.405 ± 1.473
3.784LysVal: 3.784 ± 0.156
0.541LysTrp: 0.541 ± 0.291
1.081LysTyr: 1.081 ± 0.228
0.0LysXaa: 0.0 ± 0.0
Leu
3.514LeuAla: 3.514 ± 0.063
0.811LeuCys: 0.811 ± 0.437
6.486LeuAsp: 6.486 ± 0.547
6.757LeuGlu: 6.757 ± 1.8
3.243LeuPhe: 3.243 ± 1.322
5.676LeuGly: 5.676 ± 2.693
1.351LeuHis: 1.351 ± 0.34
6.486LeuIle: 6.486 ± 1.408
4.595LeuLys: 4.595 ± 1.012
7.838LeuLeu: 7.838 ± 1.513
2.162LeuMet: 2.162 ± 0.565
6.216LeuAsn: 6.216 ± 1.815
6.757LeuPro: 6.757 ± 0.416
3.514LeuGln: 3.514 ± 0.927
5.946LeuArg: 5.946 ± 1.205
10.811LeuSer: 10.811 ± 3.205
2.432LeuThr: 2.432 ± 0.561
5.405LeuVal: 5.405 ± 0.784
0.811LeuTrp: 0.811 ± 0.437
2.162LeuTyr: 2.162 ± 1.165
0.0LeuXaa: 0.0 ± 0.0
Met
0.811MetAla: 0.811 ± 0.437
0.541MetCys: 0.541 ± 0.291
1.351MetAsp: 1.351 ± 0.787
1.081MetGlu: 1.081 ± 0.591
0.811MetPhe: 0.811 ± 0.561
0.811MetGly: 0.811 ± 0.561
0.811MetHis: 0.811 ± 0.437
1.351MetIle: 1.351 ± 1.239
0.27MetLys: 0.27 ± 0.146
0.541MetLeu: 0.541 ± 0.229
0.0MetMet: 0.0 ± 0.0
1.351MetAsn: 1.351 ± 0.787
1.081MetPro: 1.081 ± 0.228
0.27MetGln: 0.27 ± 0.34
1.622MetArg: 1.622 ± 0.44
1.081MetSer: 1.081 ± 0.582
1.622MetThr: 1.622 ± 0.352
1.081MetVal: 1.081 ± 0.457
0.0MetTrp: 0.0 ± 0.0
0.811MetTyr: 0.811 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
1.892AsnAla: 1.892 ± 0.415
0.27AsnCys: 0.27 ± 0.146
3.784AsnAsp: 3.784 ± 1.604
3.243AsnGlu: 3.243 ± 0.375
2.703AsnPhe: 2.703 ± 1.034
1.081AsnGly: 1.081 ± 1.314
1.622AsnHis: 1.622 ± 0.471
2.162AsnIle: 2.162 ± 0.548
3.514AsnLys: 3.514 ± 0.965
8.919AsnLeu: 8.919 ± 3.283
0.27AsnMet: 0.27 ± 0.455
3.514AsnAsn: 3.514 ± 0.843
3.784AsnPro: 3.784 ± 0.156
2.432AsnGln: 2.432 ± 1.822
2.973AsnArg: 2.973 ± 0.603
3.514AsnSer: 3.514 ± 2.231
2.703AsnThr: 2.703 ± 2.033
3.784AsnVal: 3.784 ± 0.83
0.811AsnTrp: 0.811 ± 0.437
1.081AsnTyr: 1.081 ± 0.228
0.0AsnXaa: 0.0 ± 0.0
Pro
2.162ProAla: 2.162 ± 0.457
0.811ProCys: 0.811 ± 0.437
2.162ProAsp: 2.162 ± 0.457
5.135ProGlu: 5.135 ± 1.086
2.432ProPhe: 2.432 ± 0.829
3.514ProGly: 3.514 ± 2.134
0.811ProHis: 0.811 ± 0.717
4.054ProIle: 4.054 ± 0.717
3.243ProLys: 3.243 ± 1.322
4.865ProLeu: 4.865 ± 1.121
0.27ProMet: 0.27 ± 0.34
2.432ProAsn: 2.432 ± 1.339
2.973ProPro: 2.973 ± 1.056
2.432ProGln: 2.432 ± 0.561
2.973ProArg: 2.973 ± 0.719
5.676ProSer: 5.676 ± 1.239
4.324ProThr: 4.324 ± 0.992
6.486ProVal: 6.486 ± 0.333
0.541ProTrp: 0.541 ± 0.229
1.622ProTyr: 1.622 ± 0.44
0.0ProXaa: 0.0 ± 0.0
Gln
1.351GlnAla: 1.351 ± 0.787
1.351GlnCys: 1.351 ± 0.34
1.622GlnAsp: 1.622 ± 1.433
3.243GlnGlu: 3.243 ± 0.161
1.622GlnPhe: 1.622 ± 0.686
1.892GlnGly: 1.892 ± 0.603
0.27GlnHis: 0.27 ± 0.146
3.243GlnIle: 3.243 ± 0.981
2.162GlnLys: 2.162 ± 0.548
3.514GlnLeu: 3.514 ± 0.454
0.0GlnMet: 0.0 ± 0.0
0.811GlnAsn: 0.811 ± 1.39
0.811GlnPro: 0.811 ± 0.437
1.622GlnGln: 1.622 ± 0.44
3.784GlnArg: 3.784 ± 0.627
3.243GlnSer: 3.243 ± 0.983
1.622GlnThr: 1.622 ± 2.069
2.162GlnVal: 2.162 ± 0.457
0.541GlnTrp: 0.541 ± 0.291
1.622GlnTyr: 1.622 ± 0.471
0.0GlnXaa: 0.0 ± 0.0
Arg
4.595ArgAla: 4.595 ± 1.639
0.27ArgCys: 0.27 ± 0.146
2.973ArgAsp: 2.973 ± 1.178
2.973ArgGlu: 2.973 ± 1.006
3.514ArgPhe: 3.514 ± 1.893
2.703ArgGly: 2.703 ± 0.68
0.541ArgHis: 0.541 ± 0.291
5.405ArgIle: 5.405 ± 0.323
5.405ArgLys: 5.405 ± 1.649
5.135ArgLeu: 5.135 ± 1.33
2.703ArgMet: 2.703 ± 1.143
2.703ArgAsn: 2.703 ± 0.762
4.054ArgPro: 4.054 ± 2.065
0.811ArgGln: 0.811 ± 0.176
2.432ArgArg: 2.432 ± 0.281
3.243ArgSer: 3.243 ± 0.983
2.703ArgThr: 2.703 ± 0.239
4.865ArgVal: 4.865 ± 0.562
0.811ArgTrp: 0.811 ± 0.561
1.622ArgTyr: 1.622 ± 0.686
0.0ArgXaa: 0.0 ± 0.0
Ser
2.162SerAla: 2.162 ± 0.733
1.081SerCys: 1.081 ± 0.457
5.405SerAsp: 5.405 ± 1.254
3.784SerGlu: 3.784 ± 0.892
3.514SerPhe: 3.514 ± 0.454
5.135SerGly: 5.135 ± 1.547
1.081SerHis: 1.081 ± 0.582
5.676SerIle: 5.676 ± 2.221
4.054SerLys: 4.054 ± 0.296
7.568SerLeu: 7.568 ± 0.774
0.541SerMet: 0.541 ± 0.205
5.405SerAsn: 5.405 ± 3.567
4.324SerPro: 4.324 ± 1.466
3.514SerGln: 3.514 ± 0.063
4.054SerArg: 4.054 ± 0.828
9.189SerSer: 9.189 ± 1.172
5.946SerThr: 5.946 ± 1.025
4.865SerVal: 4.865 ± 0.31
1.351SerTrp: 1.351 ± 0.728
4.054SerTyr: 4.054 ± 0.842
0.0SerXaa: 0.0 ± 0.0
Thr
2.973ThrAla: 2.973 ± 0.739
1.622ThrCys: 1.622 ± 0.874
2.432ThrAsp: 2.432 ± 0.281
3.784ThrGlu: 3.784 ± 2.098
2.703ThrPhe: 2.703 ± 1.214
4.324ThrGly: 4.324 ± 0.913
2.162ThrHis: 2.162 ± 0.457
3.784ThrIle: 3.784 ± 0.788
2.432ThrLys: 2.432 ± 1.31
4.054ThrLeu: 4.054 ± 0.842
1.622ThrMet: 1.622 ± 0.352
3.784ThrAsn: 3.784 ± 1.207
2.432ThrPro: 2.432 ± 0.281
1.351ThrGln: 1.351 ± 0.381
2.432ThrArg: 2.432 ± 0.588
5.135ThrSer: 5.135 ± 1.279
5.946ThrThr: 5.946 ± 0.878
3.243ThrVal: 3.243 ± 1.062
0.541ThrTrp: 0.541 ± 0.291
1.622ThrTyr: 1.622 ± 0.658
0.0ThrXaa: 0.0 ± 0.0
Val
4.054ValAla: 4.054 ± 1.02
1.351ValCys: 1.351 ± 0.381
3.784ValAsp: 3.784 ± 0.788
4.054ValGlu: 4.054 ± 1.235
2.703ValPhe: 2.703 ± 0.444
2.973ValGly: 2.973 ± 0.3
0.541ValHis: 0.541 ± 0.291
3.243ValIle: 3.243 ± 0.596
3.784ValLys: 3.784 ± 0.83
4.324ValLeu: 4.324 ± 1.096
1.622ValMet: 1.622 ± 0.352
3.243ValAsn: 3.243 ± 0.881
4.595ValPro: 4.595 ± 1.639
1.622ValGln: 1.622 ± 0.686
5.676ValArg: 5.676 ± 1.142
5.676ValSer: 5.676 ± 1.253
2.973ValThr: 2.973 ± 0.603
3.784ValVal: 3.784 ± 0.899
0.811ValTrp: 0.811 ± 0.176
1.892ValTyr: 1.892 ± 0.608
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.541TrpCys: 0.541 ± 0.291
0.541TrpAsp: 0.541 ± 0.291
1.622TrpGlu: 1.622 ± 0.874
1.351TrpPhe: 1.351 ± 0.728
0.811TrpGly: 0.811 ± 0.437
0.0TrpHis: 0.0 ± 0.0
0.541TrpIle: 0.541 ± 0.229
1.081TrpLys: 1.081 ± 0.457
0.811TrpLeu: 0.811 ± 0.437
0.541TrpMet: 0.541 ± 0.291
0.811TrpAsn: 0.811 ± 0.437
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.541TrpArg: 0.541 ± 0.291
1.351TrpSer: 1.351 ± 0.728
1.081TrpThr: 1.081 ± 0.582
0.811TrpVal: 0.811 ± 1.021
0.0TrpTrp: 0.0 ± 0.0
0.27TrpTyr: 0.27 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.973TyrAla: 2.973 ± 0.3
0.811TyrCys: 0.811 ± 0.176
0.541TyrAsp: 0.541 ± 0.291
1.081TyrGlu: 1.081 ± 0.582
1.622TyrPhe: 1.622 ± 0.658
2.162TyrGly: 2.162 ± 1.165
0.541TyrHis: 0.541 ± 0.291
1.892TyrIle: 1.892 ± 0.878
2.162TyrLys: 2.162 ± 0.378
4.054TyrLeu: 4.054 ± 1.705
0.541TyrMet: 0.541 ± 0.229
1.081TyrAsn: 1.081 ± 0.228
1.351TyrPro: 1.351 ± 0.728
1.351TyrGln: 1.351 ± 0.787
2.703TyrArg: 2.703 ± 0.762
1.351TyrSer: 1.351 ± 0.508
2.432TyrThr: 2.432 ± 1.19
0.27TyrVal: 0.27 ± 0.146
0.541TyrTrp: 0.541 ± 0.291
2.162TyrTyr: 2.162 ± 0.828
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3701 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski