Amino acid dipepetide frequency for Hubei hepe-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.771AlaAla: 5.771 ± 1.745
2.061AlaCys: 2.061 ± 1.119
1.649AlaAsp: 1.649 ± 0.914
1.649AlaGlu: 1.649 ± 0.462
4.946AlaPhe: 4.946 ± 1.41
1.237AlaGly: 1.237 ± 0.38
1.237AlaHis: 1.237 ± 0.38
1.649AlaIle: 1.649 ± 2.268
1.649AlaLys: 1.649 ± 0.914
7.007AlaLeu: 7.007 ± 2.177
0.824AlaMet: 0.824 ± 0.424
2.473AlaAsn: 2.473 ± 0.816
4.122AlaPro: 4.122 ± 2.948
2.885AlaGln: 2.885 ± 1.227
5.771AlaArg: 5.771 ± 1.008
5.771AlaSer: 5.771 ± 4.064
3.298AlaThr: 3.298 ± 1.365
2.885AlaVal: 2.885 ± 1.024
1.649AlaTrp: 1.649 ± 1.03
3.298AlaTyr: 3.298 ± 1.365
0.0AlaXaa: 0.0 ± 0.0
Cys
2.061CysAla: 2.061 ± 0.622
0.0CysCys: 0.0 ± 0.0
0.824CysAsp: 0.824 ± 0.457
0.824CysGlu: 0.824 ± 0.457
2.473CysPhe: 2.473 ± 0.899
0.412CysGly: 0.412 ± 0.228
0.824CysHis: 0.824 ± 0.457
1.649CysIle: 1.649 ± 0.914
0.824CysLys: 0.824 ± 0.457
3.298CysLeu: 3.298 ± 1.696
0.0CysMet: 0.0 ± 0.0
1.237CysAsn: 1.237 ± 0.38
1.237CysPro: 1.237 ± 0.38
1.649CysGln: 1.649 ± 0.931
0.412CysArg: 0.412 ± 0.565
0.824CysSer: 0.824 ± 0.424
0.0CysThr: 0.0 ± 0.0
0.412CysVal: 0.412 ± 0.565
0.0CysTrp: 0.0 ± 0.0
0.824CysTyr: 0.824 ± 0.457
0.0CysXaa: 0.0 ± 0.0
Asp
0.824AspAla: 0.824 ± 0.457
1.649AspCys: 1.649 ± 0.848
3.71AspAsp: 3.71 ± 2.056
1.237AspGlu: 1.237 ± 0.685
4.534AspPhe: 4.534 ± 1.887
0.824AspGly: 0.824 ± 0.457
2.473AspHis: 2.473 ± 0.816
4.534AspIle: 4.534 ± 2.512
2.061AspLys: 2.061 ± 1.142
3.71AspLeu: 3.71 ± 1.458
0.0AspMet: 0.0 ± 0.469
2.061AspAsn: 2.061 ± 1.142
5.771AspPro: 5.771 ± 1.692
0.824AspGln: 0.824 ± 0.424
1.649AspArg: 1.649 ± 0.914
4.534AspSer: 4.534 ± 1.903
2.885AspThr: 2.885 ± 0.815
3.298AspVal: 3.298 ± 1.827
0.412AspTrp: 0.412 ± 0.228
2.885AspTyr: 2.885 ± 1.024
0.0AspXaa: 0.0 ± 0.0
Glu
3.298GluAla: 3.298 ± 1.141
1.237GluCys: 1.237 ± 0.685
2.885GluAsp: 2.885 ± 1.599
1.237GluGlu: 1.237 ± 0.685
2.061GluPhe: 2.061 ± 0.622
1.237GluGly: 1.237 ± 0.685
2.885GluHis: 2.885 ± 0.815
2.061GluIle: 2.061 ± 1.142
3.298GluLys: 3.298 ± 1.239
4.534GluLeu: 4.534 ± 1.903
0.412GluMet: 0.412 ± 0.228
2.061GluAsn: 2.061 ± 2.182
1.237GluPro: 1.237 ± 0.685
1.649GluGln: 1.649 ± 0.914
0.824GluArg: 0.824 ± 0.424
1.237GluSer: 1.237 ± 0.685
2.885GluThr: 2.885 ± 2.032
1.649GluVal: 1.649 ± 0.462
0.824GluTrp: 0.824 ± 1.134
0.824GluTyr: 0.824 ± 0.457
0.0GluXaa: 0.0 ± 0.0
Phe
5.771PheAla: 5.771 ± 1.107
2.061PheCys: 2.061 ± 0.772
5.771PheAsp: 5.771 ± 2.047
1.649PheGlu: 1.649 ± 0.914
5.771PhePhe: 5.771 ± 3.535
0.824PheGly: 0.824 ± 0.457
1.649PheHis: 1.649 ± 0.914
4.122PheIle: 4.122 ± 0.279
2.473PheLys: 2.473 ± 0.816
6.595PheLeu: 6.595 ± 2.806
0.824PheMet: 0.824 ± 1.178
3.298PheAsn: 3.298 ± 1.696
5.359PhePro: 5.359 ± 1.53
1.649PheGln: 1.649 ± 0.462
2.885PheArg: 2.885 ± 3.278
6.595PheSer: 6.595 ± 1.457
5.359PheThr: 5.359 ± 1.912
2.885PheVal: 2.885 ± 3.313
0.0PheTrp: 0.0 ± 0.0
2.061PheTyr: 2.061 ± 1.051
0.0PheXaa: 0.0 ± 0.0
Gly
2.061GlyAla: 2.061 ± 0.785
0.0GlyCys: 0.0 ± 0.0
2.473GlyAsp: 2.473 ± 1.37
2.061GlyGlu: 2.061 ± 0.622
2.885GlyPhe: 2.885 ± 0.652
0.824GlyGly: 0.824 ± 0.457
0.824GlyHis: 0.824 ± 1.134
4.946GlyIle: 4.946 ± 1.387
1.237GlyLys: 1.237 ± 0.38
2.885GlyLeu: 2.885 ± 1.024
0.412GlyMet: 0.412 ± 0.228
2.061GlyAsn: 2.061 ± 1.051
1.237GlyPro: 1.237 ± 0.685
1.649GlyGln: 1.649 ± 0.914
4.122GlyArg: 4.122 ± 4.285
4.122GlySer: 4.122 ± 1.545
2.885GlyThr: 2.885 ± 1.599
2.061GlyVal: 2.061 ± 1.051
0.412GlyTrp: 0.412 ± 0.228
1.649GlyTyr: 1.649 ± 1.03
0.0GlyXaa: 0.0 ± 0.0
His
0.824HisAla: 0.824 ± 1.134
0.412HisCys: 0.412 ± 0.228
0.412HisAsp: 0.412 ± 0.228
1.649HisGlu: 1.649 ± 0.462
2.473HisPhe: 2.473 ± 0.899
2.061HisGly: 2.061 ± 0.622
2.473HisHis: 2.473 ± 0.761
1.649HisIle: 1.649 ± 0.848
0.412HisLys: 0.412 ± 0.228
2.885HisLeu: 2.885 ± 1.984
0.0HisMet: 0.0 ± 0.0
2.061HisAsn: 2.061 ± 0.622
1.237HisPro: 1.237 ± 0.685
2.885HisGln: 2.885 ± 0.652
2.885HisArg: 2.885 ± 1.81
3.298HisSer: 3.298 ± 1.141
4.122HisThr: 4.122 ± 1.545
2.473HisVal: 2.473 ± 1.12
0.412HisTrp: 0.412 ± 0.565
2.473HisTyr: 2.473 ± 1.37
0.0HisXaa: 0.0 ± 0.0
Ile
3.71IleAla: 3.71 ± 1.458
0.824IleCys: 0.824 ± 0.424
6.595IleAsp: 6.595 ± 3.03
4.122IleGlu: 4.122 ± 1.679
4.122IlePhe: 4.122 ± 1.245
2.473IleGly: 2.473 ± 0.816
2.061IleHis: 2.061 ± 1.051
2.473IleIle: 2.473 ± 1.272
1.237IleLys: 1.237 ± 0.38
8.244IleLeu: 8.244 ± 4.755
0.824IleMet: 0.824 ± 0.457
1.237IleAsn: 1.237 ± 1.059
4.122IlePro: 4.122 ± 0.279
1.237IleGln: 1.237 ± 0.685
2.473IleArg: 2.473 ± 0.684
6.183IleSer: 6.183 ± 1.33
2.885IleThr: 2.885 ± 0.652
5.359IleVal: 5.359 ± 2.352
1.649IleTrp: 1.649 ± 1.341
3.298IleTyr: 3.298 ± 1.239
0.0IleXaa: 0.0 ± 0.0
Lys
1.649LysAla: 1.649 ± 0.914
2.061LysCys: 2.061 ± 0.622
1.649LysAsp: 1.649 ± 0.914
0.412LysGlu: 0.412 ± 1.247
1.649LysPhe: 1.649 ± 0.914
1.649LysGly: 1.649 ± 0.914
0.0LysHis: 0.0 ± 0.0
2.061LysIle: 2.061 ± 0.622
0.824LysLys: 0.824 ± 0.457
3.298LysLeu: 3.298 ± 1.239
0.824LysMet: 0.824 ± 0.457
1.237LysAsn: 1.237 ± 0.38
1.649LysPro: 1.649 ± 0.462
2.473LysGln: 2.473 ± 0.684
1.649LysArg: 1.649 ± 0.914
1.649LysSer: 1.649 ± 0.914
4.122LysThr: 4.122 ± 2.103
1.649LysVal: 1.649 ± 1.03
0.412LysTrp: 0.412 ± 1.247
1.237LysTyr: 1.237 ± 0.685
0.0LysXaa: 0.0 ± 0.0
Leu
7.007LeuAla: 7.007 ± 2.342
2.061LeuCys: 2.061 ± 0.622
4.534LeuAsp: 4.534 ± 1.516
5.359LeuGlu: 5.359 ± 0.142
6.183LeuPhe: 6.183 ± 2.333
4.946LeuGly: 4.946 ± 0.582
4.122LeuHis: 4.122 ± 1.545
7.832LeuIle: 7.832 ± 0.879
3.298LeuLys: 3.298 ± 1.239
14.427LeuLeu: 14.427 ± 6.402
0.824LeuMet: 0.824 ± 0.457
3.71LeuAsn: 3.71 ± 1.073
6.595LeuPro: 6.595 ± 4.722
3.298LeuGln: 3.298 ± 1.239
6.183LeuArg: 6.183 ± 2.605
9.068LeuSer: 9.068 ± 6.737
6.595LeuThr: 6.595 ± 4.257
9.481LeuVal: 9.481 ± 2.122
0.824LeuTrp: 0.824 ± 1.13
2.885LeuTyr: 2.885 ± 0.652
0.0LeuXaa: 0.0 ± 0.0
Met
1.237MetAla: 1.237 ± 0.38
0.824MetCys: 0.824 ± 0.424
0.412MetAsp: 0.412 ± 0.228
0.412MetGlu: 0.412 ± 0.565
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.412MetHis: 0.412 ± 0.228
0.0MetIle: 0.0 ± 0.0
0.412MetLys: 0.412 ± 0.228
0.824MetLeu: 0.824 ± 1.134
0.412MetMet: 0.412 ± 0.228
0.824MetAsn: 0.824 ± 0.457
0.824MetPro: 0.824 ± 1.13
0.412MetGln: 0.412 ± 0.228
1.649MetArg: 1.649 ± 1.03
0.824MetSer: 0.824 ± 1.134
0.824MetThr: 0.824 ± 0.457
0.0MetVal: 0.0 ± 0.0
0.412MetTrp: 0.412 ± 0.228
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.298AsnAla: 3.298 ± 1.863
0.412AsnCys: 0.412 ± 0.228
2.061AsnAsp: 2.061 ± 1.142
0.412AsnGlu: 0.412 ± 0.228
2.473AsnPhe: 2.473 ± 0.761
2.885AsnGly: 2.885 ± 0.652
3.298AsnHis: 3.298 ± 0.697
3.71AsnIle: 3.71 ± 0.807
2.473AsnLys: 2.473 ± 1.37
5.359AsnLeu: 5.359 ± 3.077
0.412AsnMet: 0.412 ± 0.228
1.237AsnAsn: 1.237 ± 0.38
2.473AsnPro: 2.473 ± 0.684
2.061AsnGln: 2.061 ± 1.142
2.473AsnArg: 2.473 ± 1.12
2.061AsnSer: 2.061 ± 0.622
2.885AsnThr: 2.885 ± 1.024
2.061AsnVal: 2.061 ± 1.142
0.412AsnTrp: 0.412 ± 0.228
0.824AsnTyr: 0.824 ± 0.457
0.0AsnXaa: 0.0 ± 0.0
Pro
3.298ProAla: 3.298 ± 1.365
0.824ProCys: 0.824 ± 0.424
2.061ProAsp: 2.061 ± 0.772
1.649ProGlu: 1.649 ± 0.914
6.183ProPhe: 6.183 ± 1.0
4.534ProGly: 4.534 ± 1.138
0.824ProHis: 0.824 ± 1.13
4.122ProIle: 4.122 ± 0.6
1.649ProLys: 1.649 ± 1.03
7.42ProLeu: 7.42 ± 3.56
0.0ProMet: 0.0 ± 0.0
3.298ProAsn: 3.298 ± 0.925
8.656ProPro: 8.656 ± 7.474
2.473ProGln: 2.473 ± 0.684
4.122ProArg: 4.122 ± 1.545
6.183ProSer: 6.183 ± 3.349
4.122ProThr: 4.122 ± 1.781
3.298ProVal: 3.298 ± 1.141
0.412ProTrp: 0.412 ± 0.565
1.237ProTyr: 1.237 ± 0.38
0.0ProXaa: 0.0 ± 0.0
Gln
1.649GlnAla: 1.649 ± 0.914
0.0GlnCys: 0.0 ± 0.0
1.237GlnAsp: 1.237 ± 0.685
1.649GlnGlu: 1.649 ± 1.03
2.885GlnPhe: 2.885 ± 1.024
1.237GlnGly: 1.237 ± 1.059
1.237GlnHis: 1.237 ± 1.059
2.885GlnIle: 2.885 ± 1.024
1.237GlnLys: 1.237 ± 2.373
2.473GlnLeu: 2.473 ± 0.816
0.0GlnMet: 0.0 ± 0.0
2.061GlnAsn: 2.061 ± 0.622
2.473GlnPro: 2.473 ± 0.684
1.649GlnGln: 1.649 ± 0.914
2.885GlnArg: 2.885 ± 1.599
2.885GlnSer: 2.885 ± 1.187
2.885GlnThr: 2.885 ± 0.685
3.298GlnVal: 3.298 ± 0.697
0.824GlnTrp: 0.824 ± 0.457
2.061GlnTyr: 2.061 ± 1.119
0.0GlnXaa: 0.0 ± 0.0
Arg
2.885ArgAla: 2.885 ± 1.913
0.412ArgCys: 0.412 ± 0.228
3.71ArgAsp: 3.71 ± 1.458
2.061ArgGlu: 2.061 ± 0.622
4.946ArgPhe: 4.946 ± 1.798
2.885ArgGly: 2.885 ± 4.636
2.885ArgHis: 2.885 ± 1.187
3.298ArgIle: 3.298 ± 1.239
2.473ArgLys: 2.473 ± 0.816
12.366ArgLeu: 12.366 ± 0.987
1.649ArgMet: 1.649 ± 1.533
2.061ArgAsn: 2.061 ± 0.622
2.473ArgPro: 2.473 ± 2.605
2.473ArgGln: 2.473 ± 2.031
2.885ArgArg: 2.885 ± 0.815
3.298ArgSer: 3.298 ± 1.827
4.122ArgThr: 4.122 ± 1.185
3.298ArgVal: 3.298 ± 1.827
0.0ArgTrp: 0.0 ± 0.0
3.298ArgTyr: 3.298 ± 1.141
0.0ArgXaa: 0.0 ± 0.0
Ser
3.71SerAla: 3.71 ± 1.073
3.298SerCys: 3.298 ± 0.484
2.061SerAsp: 2.061 ± 0.622
2.885SerGlu: 2.885 ± 2.032
6.183SerPhe: 6.183 ± 1.33
5.771SerGly: 5.771 ± 1.008
2.061SerHis: 2.061 ± 0.772
4.534SerIle: 4.534 ± 1.433
2.885SerLys: 2.885 ± 1.599
9.068SerLeu: 9.068 ± 5.237
0.0SerMet: 0.0 ± 0.0
3.71SerAsn: 3.71 ± 0.807
7.42SerPro: 7.42 ± 4.297
1.237SerGln: 1.237 ± 0.38
6.595SerArg: 6.595 ± 1.701
8.656SerSer: 8.656 ± 5.991
4.122SerThr: 4.122 ± 2.12
4.946SerVal: 4.946 ± 2.42
0.0SerTrp: 0.0 ± 0.0
3.298SerTyr: 3.298 ± 0.697
0.0SerXaa: 0.0 ± 0.0
Thr
3.71ThrAla: 3.71 ± 0.322
0.824ThrCys: 0.824 ± 0.457
2.061ThrAsp: 2.061 ± 1.051
4.946ThrGlu: 4.946 ± 2.127
4.122ThrPhe: 4.122 ± 1.545
1.649ThrGly: 1.649 ± 0.914
1.649ThrHis: 1.649 ± 1.341
7.007ThrIle: 7.007 ± 1.23
0.824ThrLys: 0.824 ± 2.494
4.122ThrLeu: 4.122 ± 2.78
1.649ThrMet: 1.649 ± 0.914
2.473ThrAsn: 2.473 ± 2.118
3.298ThrPro: 3.298 ± 1.141
2.061ThrGln: 2.061 ± 0.772
5.771ThrArg: 5.771 ± 1.692
7.007ThrSer: 7.007 ± 2.728
7.007ThrThr: 7.007 ± 1.554
3.71ThrVal: 3.71 ± 1.073
1.237ThrTrp: 1.237 ± 1.566
1.237ThrTyr: 1.237 ± 1.059
0.0ThrXaa: 0.0 ± 0.0
Val
4.534ValAla: 4.534 ± 2.816
0.824ValCys: 0.824 ± 0.457
4.534ValAsp: 4.534 ± 1.903
2.473ValGlu: 2.473 ± 1.37
1.237ValPhe: 1.237 ± 0.38
3.298ValGly: 3.298 ± 0.484
2.473ValHis: 2.473 ± 0.816
4.122ValIle: 4.122 ± 2.948
2.473ValLys: 2.473 ± 1.12
6.183ValLeu: 6.183 ± 1.729
1.237ValMet: 1.237 ± 1.963
2.885ValAsn: 2.885 ± 1.024
3.71ValPro: 3.71 ± 0.322
2.061ValGln: 2.061 ± 1.051
5.771ValArg: 5.771 ± 1.008
4.946ValSer: 4.946 ± 1.387
2.061ValThr: 2.061 ± 0.772
4.946ValVal: 4.946 ± 2.8
0.412ValTrp: 0.412 ± 0.228
0.824ValTyr: 0.824 ± 0.424
0.0ValXaa: 0.0 ± 0.0
Trp
1.237TrpAla: 1.237 ± 2.373
0.0TrpCys: 0.0 ± 0.0
0.824TrpAsp: 0.824 ± 0.424
0.824TrpGlu: 0.824 ± 0.457
0.0TrpPhe: 0.0 ± 0.0
0.824TrpGly: 0.824 ± 0.424
0.0TrpHis: 0.0 ± 0.0
0.824TrpIle: 0.824 ± 0.424
0.0TrpLys: 0.0 ± 0.0
0.824TrpLeu: 0.824 ± 1.134
0.0TrpMet: 0.0 ± 0.0
0.412TrpAsn: 0.412 ± 0.228
0.412TrpPro: 0.412 ± 0.565
1.649TrpGln: 1.649 ± 1.341
0.824TrpArg: 0.824 ± 0.457
0.0TrpSer: 0.0 ± 0.0
0.412TrpThr: 0.412 ± 0.228
0.412TrpVal: 0.412 ± 1.247
0.0TrpTrp: 0.0 ± 0.0
0.824TrpTyr: 0.824 ± 1.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.298TyrAla: 3.298 ± 1.815
0.0TyrCys: 0.0 ± 0.0
0.824TyrAsp: 0.824 ± 0.457
0.824TyrGlu: 0.824 ± 0.457
2.473TyrPhe: 2.473 ± 0.899
1.237TyrGly: 1.237 ± 0.38
3.71TyrHis: 3.71 ± 2.232
1.649TyrIle: 1.649 ± 0.914
0.412TyrLys: 0.412 ± 0.228
3.71TyrLeu: 3.71 ± 0.807
0.0TyrMet: 0.0 ± 0.0
2.885TyrAsn: 2.885 ± 1.024
2.061TyrPro: 2.061 ± 1.051
1.237TyrGln: 1.237 ± 0.685
1.649TyrArg: 1.649 ± 0.462
3.298TyrSer: 3.298 ± 0.697
2.885TyrThr: 2.885 ± 1.599
2.885TyrVal: 2.885 ± 1.227
0.0TyrTrp: 0.0 ± 0.0
1.649TyrTyr: 1.649 ± 2.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2427 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski