Amino acid dipepetide frequency for Hubei picorna-like virus 64

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.565AlaAla: 4.565 ± 1.66
0.652AlaCys: 0.652 ± 0.343
3.913AlaAsp: 3.913 ± 0.026
2.934AlaGlu: 2.934 ± 0.527
3.261AlaPhe: 3.261 ± 0.316
3.587AlaGly: 3.587 ± 0.652
1.304AlaHis: 1.304 ± 0.329
4.239AlaIle: 4.239 ± 1.324
6.847AlaLys: 6.847 ± 1.568
6.195AlaLeu: 6.195 ± 0.297
2.608AlaMet: 2.608 ± 0.356
2.934AlaAsn: 2.934 ± 1.502
2.608AlaPro: 2.608 ± 1.673
1.956AlaGln: 1.956 ± 0.52
3.913AlaArg: 3.913 ± 0.026
3.261AlaSer: 3.261 ± 1.331
6.521AlaThr: 6.521 ± 1.14
3.913AlaVal: 3.913 ± 0.533
0.0AlaTrp: 0.0 ± 0.0
2.282AlaTyr: 2.282 ± 0.692
0.0AlaXaa: 0.0 ± 0.0
Cys
0.652CysAla: 0.652 ± 0.165
0.0CysCys: 0.0 ± 0.0
0.326CysAsp: 0.326 ± 0.171
1.956CysGlu: 1.956 ± 0.52
1.304CysPhe: 1.304 ± 0.178
0.978CysGly: 0.978 ± 0.514
0.652CysHis: 0.652 ± 0.165
0.652CysIle: 0.652 ± 0.165
0.978CysLys: 0.978 ± 0.514
0.0CysLeu: 0.0 ± 0.0
0.326CysMet: 0.326 ± 0.171
0.326CysAsn: 0.326 ± 0.171
0.0CysPro: 0.0 ± 0.0
0.978CysGln: 0.978 ± 0.514
0.652CysArg: 0.652 ± 0.343
0.978CysSer: 0.978 ± 0.514
0.326CysThr: 0.326 ± 0.171
0.978CysVal: 0.978 ± 0.514
0.326CysTrp: 0.326 ± 0.171
0.326CysTyr: 0.326 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
2.608AspAla: 2.608 ± 1.166
1.63AspCys: 1.63 ± 0.856
3.587AspAsp: 3.587 ± 1.377
4.565AspGlu: 4.565 ± 0.876
1.956AspPhe: 1.956 ± 0.52
1.304AspGly: 1.304 ± 0.329
1.304AspHis: 1.304 ± 0.329
4.565AspIle: 4.565 ± 0.139
2.608AspLys: 2.608 ± 0.356
7.173AspLeu: 7.173 ± 0.798
1.63AspMet: 1.63 ± 0.272
2.608AspAsn: 2.608 ± 0.152
4.891AspPro: 4.891 ± 0.982
1.63AspGln: 1.63 ± 0.349
1.956AspArg: 1.956 ± 0.013
2.934AspSer: 2.934 ± 0.488
5.217AspThr: 5.217 ± 0.811
3.261AspVal: 3.261 ± 0.698
0.326AspTrp: 0.326 ± 0.171
1.956AspTyr: 1.956 ± 0.52
0.0AspXaa: 0.0 ± 0.0
Glu
2.282GluAla: 2.282 ± 0.692
0.326GluCys: 0.326 ± 0.171
3.587GluAsp: 3.587 ± 0.869
4.239GluGlu: 4.239 ± 1.212
1.956GluPhe: 1.956 ± 0.013
1.956GluGly: 1.956 ± 0.013
1.63GluHis: 1.63 ± 0.349
4.891GluIle: 4.891 ± 0.475
4.565GluLys: 4.565 ± 0.369
5.217GluLeu: 5.217 ± 1.218
1.63GluMet: 1.63 ± 0.856
2.608GluAsn: 2.608 ± 0.152
2.282GluPro: 2.282 ± 0.323
0.652GluGln: 0.652 ± 0.343
4.891GluArg: 4.891 ± 1.047
2.282GluSer: 2.282 ± 0.323
2.608GluThr: 2.608 ± 0.152
3.261GluVal: 3.261 ± 0.191
0.326GluTrp: 0.326 ± 0.171
3.261GluTyr: 3.261 ± 0.698
0.0GluXaa: 0.0 ± 0.0
Phe
1.63PheAla: 1.63 ± 0.158
0.326PheCys: 0.326 ± 0.171
1.63PheAsp: 1.63 ± 0.158
1.956PheGlu: 1.956 ± 0.494
0.652PhePhe: 0.652 ± 0.343
1.956PheGly: 1.956 ± 1.028
1.304PheHis: 1.304 ± 0.685
4.565PheIle: 4.565 ± 0.139
4.565PheLys: 4.565 ± 0.876
2.608PheLeu: 2.608 ± 0.356
0.978PheMet: 0.978 ± 0.514
4.565PheAsn: 4.565 ± 0.369
1.304PhePro: 1.304 ± 0.837
1.956PheGln: 1.956 ± 1.509
1.63PheArg: 1.63 ± 0.856
1.956PheSer: 1.956 ± 1.001
3.261PheThr: 3.261 ± 0.698
2.282PheVal: 2.282 ± 0.184
0.0PheTrp: 0.0 ± 0.0
1.63PheTyr: 1.63 ± 0.665
0.0PheXaa: 0.0 ± 0.0
Gly
3.587GlyAla: 3.587 ± 0.362
1.63GlyCys: 1.63 ± 0.856
3.587GlyAsp: 3.587 ± 0.652
1.956GlyGlu: 1.956 ± 0.013
2.282GlyPhe: 2.282 ± 0.83
1.63GlyGly: 1.63 ± 0.856
0.978GlyHis: 0.978 ± 0.514
2.934GlyIle: 2.934 ± 0.995
4.239GlyLys: 4.239 ± 1.212
5.869GlyLeu: 5.869 ± 1.483
1.63GlyMet: 1.63 ± 0.349
2.608GlyAsn: 2.608 ± 0.863
0.978GlyPro: 0.978 ± 0.007
2.608GlyGln: 2.608 ± 0.863
1.63GlyArg: 1.63 ± 0.349
3.587GlySer: 3.587 ± 1.16
4.891GlyThr: 4.891 ± 0.475
3.913GlyVal: 3.913 ± 0.026
0.978GlyTrp: 0.978 ± 0.501
1.63GlyTyr: 1.63 ± 0.349
0.0GlyXaa: 0.0 ± 0.0
His
1.63HisAla: 1.63 ± 0.158
0.0HisCys: 0.0 ± 0.0
1.63HisAsp: 1.63 ± 0.158
0.978HisGlu: 0.978 ± 0.007
0.652HisPhe: 0.652 ± 0.165
1.956HisGly: 1.956 ± 0.52
0.326HisHis: 0.326 ± 0.171
2.282HisIle: 2.282 ± 0.692
1.63HisLys: 1.63 ± 0.349
1.956HisLeu: 1.956 ± 0.52
1.304HisMet: 1.304 ± 0.685
0.652HisAsn: 0.652 ± 0.165
1.63HisPro: 1.63 ± 0.158
0.652HisGln: 0.652 ± 0.343
1.63HisArg: 1.63 ± 0.856
1.63HisSer: 1.63 ± 0.158
1.956HisThr: 1.956 ± 0.52
0.652HisVal: 0.652 ± 0.343
0.0HisTrp: 0.0 ± 0.0
0.978HisTyr: 0.978 ± 0.514
0.0HisXaa: 0.0 ± 0.0
Ile
6.195IleAla: 6.195 ± 1.311
0.652IleCys: 0.652 ± 0.343
2.282IleAsp: 2.282 ± 0.323
5.217IleGlu: 5.217 ± 1.218
3.261IlePhe: 3.261 ± 0.698
3.913IleGly: 3.913 ± 0.026
0.978IleHis: 0.978 ± 0.514
2.934IleIle: 2.934 ± 0.488
3.587IleLys: 3.587 ± 0.145
7.173IleLeu: 7.173 ± 0.217
1.956IleMet: 1.956 ± 0.52
2.934IleAsn: 2.934 ± 0.02
5.217IlePro: 5.217 ± 1.825
1.956IleGln: 1.956 ± 0.52
1.956IleArg: 1.956 ± 0.013
6.847IleSer: 6.847 ± 0.046
3.913IleThr: 3.913 ± 0.533
2.934IleVal: 2.934 ± 0.02
1.304IleTrp: 1.304 ± 0.685
2.282IleTyr: 2.282 ± 0.184
0.0IleXaa: 0.0 ± 0.0
Lys
2.934LysAla: 2.934 ± 0.02
0.652LysCys: 0.652 ± 0.343
4.239LysAsp: 4.239 ± 0.705
2.608LysGlu: 2.608 ± 0.152
2.934LysPhe: 2.934 ± 0.488
4.239LysGly: 4.239 ± 0.705
2.282LysHis: 2.282 ± 0.184
2.608LysIle: 2.608 ± 1.37
4.239LysLys: 4.239 ± 1.719
4.565LysLeu: 4.565 ± 1.383
1.63LysMet: 1.63 ± 0.218
5.543LysAsn: 5.543 ± 0.375
3.587LysPro: 3.587 ± 1.16
3.587LysGln: 3.587 ± 0.869
3.913LysArg: 3.913 ± 1.041
4.239LysSer: 4.239 ± 0.31
5.217LysThr: 5.217 ± 0.711
4.565LysVal: 4.565 ± 0.139
0.652LysTrp: 0.652 ± 0.343
3.913LysTyr: 3.913 ± 0.533
0.0LysXaa: 0.0 ± 0.0
Leu
6.521LeuAla: 6.521 ± 0.633
2.282LeuCys: 2.282 ± 1.199
4.891LeuAsp: 4.891 ± 0.033
3.587LeuGlu: 3.587 ± 0.869
2.608LeuPhe: 2.608 ± 0.863
4.891LeuGly: 4.891 ± 0.475
3.261LeuHis: 3.261 ± 0.191
4.239LeuIle: 4.239 ± 0.197
6.521LeuLys: 6.521 ± 0.889
3.587LeuLeu: 3.587 ± 0.869
3.587LeuMet: 3.587 ± 0.652
4.565LeuAsn: 4.565 ± 0.369
3.913LeuPro: 3.913 ± 2.003
3.913LeuGln: 3.913 ± 0.481
3.913LeuArg: 3.913 ± 0.481
2.282LeuSer: 2.282 ± 0.323
3.261LeuThr: 3.261 ± 0.191
4.565LeuVal: 4.565 ± 0.139
0.326LeuTrp: 0.326 ± 0.171
3.261LeuTyr: 3.261 ± 1.205
0.0LeuXaa: 0.0 ± 0.0
Met
2.608MetAla: 2.608 ± 0.659
0.652MetCys: 0.652 ± 0.343
1.63MetAsp: 1.63 ± 0.349
2.282MetGlu: 2.282 ± 0.692
0.978MetPhe: 0.978 ± 0.007
1.304MetGly: 1.304 ± 0.329
1.304MetHis: 1.304 ± 0.685
0.978MetIle: 0.978 ± 0.007
1.304MetLys: 1.304 ± 0.685
0.652MetLeu: 0.652 ± 0.165
0.652MetMet: 0.652 ± 0.672
1.304MetAsn: 1.304 ± 0.329
1.956MetPro: 1.956 ± 0.52
0.978MetGln: 0.978 ± 0.501
1.304MetArg: 1.304 ± 0.329
1.63MetSer: 1.63 ± 0.665
2.934MetThr: 2.934 ± 0.527
1.63MetVal: 1.63 ± 0.349
0.0MetTrp: 0.0 ± 0.0
0.978MetTyr: 0.978 ± 0.501
0.0MetXaa: 0.0 ± 0.0
Asn
2.934AsnAla: 2.934 ± 0.02
0.326AsnCys: 0.326 ± 0.336
3.587AsnAsp: 3.587 ± 1.16
3.261AsnGlu: 3.261 ± 0.698
2.934AsnPhe: 2.934 ± 0.02
2.608AsnGly: 2.608 ± 0.659
1.63AsnHis: 1.63 ± 0.349
5.543AsnIle: 5.543 ± 0.375
4.891AsnLys: 4.891 ± 1.554
3.587AsnLeu: 3.587 ± 0.652
1.956AsnMet: 1.956 ± 1.001
3.587AsnAsn: 3.587 ± 0.145
4.239AsnPro: 4.239 ± 0.197
3.587AsnGln: 3.587 ± 1.16
1.304AsnArg: 1.304 ± 0.178
4.891AsnSer: 4.891 ± 0.033
2.608AsnThr: 2.608 ± 0.659
3.587AsnVal: 3.587 ± 0.652
0.326AsnTrp: 0.326 ± 0.171
1.956AsnTyr: 1.956 ± 1.001
0.0AsnXaa: 0.0 ± 0.0
Pro
3.587ProAla: 3.587 ± 0.145
0.326ProCys: 0.326 ± 0.336
0.978ProAsp: 0.978 ± 0.501
3.587ProGlu: 3.587 ± 0.652
1.63ProPhe: 1.63 ± 0.158
2.934ProGly: 2.934 ± 0.02
0.652ProHis: 0.652 ± 0.165
3.913ProIle: 3.913 ± 1.496
3.261ProLys: 3.261 ± 0.824
4.239ProLeu: 4.239 ± 0.817
1.304ProMet: 1.304 ± 0.329
2.934ProAsn: 2.934 ± 0.488
1.956ProPro: 1.956 ± 0.494
1.304ProGln: 1.304 ± 1.344
1.63ProArg: 1.63 ± 0.349
6.195ProSer: 6.195 ± 2.833
6.195ProThr: 6.195 ± 1.819
2.282ProVal: 2.282 ± 1.337
0.0ProTrp: 0.0 ± 0.0
2.934ProTyr: 2.934 ± 0.488
0.0ProXaa: 0.0 ± 0.0
Gln
4.239GlnAla: 4.239 ± 0.705
0.326GlnCys: 0.326 ± 0.171
1.956GlnAsp: 1.956 ± 0.494
2.282GlnGlu: 2.282 ± 0.692
2.934GlnPhe: 2.934 ± 0.527
1.63GlnGly: 1.63 ± 0.158
0.0GlnHis: 0.0 ± 0.0
1.956GlnIle: 1.956 ± 0.494
1.63GlnLys: 1.63 ± 0.349
3.913GlnLeu: 3.913 ± 1.041
1.304GlnMet: 1.304 ± 0.329
1.304GlnAsn: 1.304 ± 0.837
2.282GlnPro: 2.282 ± 0.323
2.282GlnGln: 2.282 ± 1.337
3.261GlnArg: 3.261 ± 0.824
4.565GlnSer: 4.565 ± 0.369
1.956GlnThr: 1.956 ± 1.509
1.63GlnVal: 1.63 ± 0.349
0.0GlnTrp: 0.0 ± 0.0
2.608GlnTyr: 2.608 ± 1.37
0.0GlnXaa: 0.0 ± 0.0
Arg
1.63ArgAla: 1.63 ± 0.158
0.326ArgCys: 0.326 ± 0.336
2.934ArgAsp: 2.934 ± 1.034
3.587ArgGlu: 3.587 ± 0.362
2.608ArgPhe: 2.608 ± 0.863
2.934ArgGly: 2.934 ± 1.034
0.652ArgHis: 0.652 ± 0.343
3.587ArgIle: 3.587 ± 0.869
2.282ArgLys: 2.282 ± 0.184
3.587ArgLeu: 3.587 ± 0.145
0.326ArgMet: 0.326 ± 0.336
4.565ArgAsn: 4.565 ± 0.139
1.956ArgPro: 1.956 ± 0.494
3.261ArgGln: 3.261 ± 0.698
2.934ArgArg: 2.934 ± 1.541
3.587ArgSer: 3.587 ± 0.145
1.956ArgThr: 1.956 ± 0.013
2.282ArgVal: 2.282 ± 0.323
0.326ArgTrp: 0.326 ± 0.336
2.608ArgTyr: 2.608 ± 1.37
0.0ArgXaa: 0.0 ± 0.0
Ser
5.217SerAla: 5.217 ± 0.204
0.652SerCys: 0.652 ± 0.165
3.913SerAsp: 3.913 ± 0.988
2.934SerGlu: 2.934 ± 0.995
2.282SerPhe: 2.282 ± 0.83
4.565SerGly: 4.565 ± 0.646
1.304SerHis: 1.304 ± 0.178
6.195SerIle: 6.195 ± 1.225
4.565SerLys: 4.565 ± 1.66
5.217SerLeu: 5.217 ± 1.825
1.304SerMet: 1.304 ± 0.329
4.239SerAsn: 4.239 ± 0.31
2.934SerPro: 2.934 ± 0.488
3.261SerGln: 3.261 ± 0.698
3.587SerArg: 3.587 ± 0.652
6.847SerSer: 6.847 ± 0.969
4.239SerThr: 4.239 ± 0.817
5.217SerVal: 5.217 ± 1.318
0.978SerTrp: 0.978 ± 0.007
1.304SerTyr: 1.304 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
5.217ThrAla: 5.217 ± 1.825
1.304ThrCys: 1.304 ± 0.178
5.217ThrAsp: 5.217 ± 0.303
2.934ThrGlu: 2.934 ± 0.02
3.913ThrPhe: 3.913 ± 0.026
4.239ThrGly: 4.239 ± 2.339
1.304ThrHis: 1.304 ± 0.178
5.217ThrIle: 5.217 ± 1.726
4.565ThrLys: 4.565 ± 1.153
3.261ThrLeu: 3.261 ± 0.698
0.978ThrMet: 0.978 ± 0.514
5.217ThrAsn: 5.217 ± 1.318
3.587ThrPro: 3.587 ± 1.667
3.261ThrGln: 3.261 ± 0.191
3.587ThrArg: 3.587 ± 1.884
4.891ThrSer: 4.891 ± 1.996
4.239ThrThr: 4.239 ± 2.339
3.261ThrVal: 3.261 ± 0.316
0.326ThrTrp: 0.326 ± 0.171
2.934ThrTyr: 2.934 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
5.543ValAla: 5.543 ± 0.375
0.0ValCys: 0.0 ± 0.0
3.261ValAsp: 3.261 ± 0.316
1.63ValGlu: 1.63 ± 0.856
1.304ValPhe: 1.304 ± 0.329
2.934ValGly: 2.934 ± 0.527
1.63ValHis: 1.63 ± 0.349
3.261ValIle: 3.261 ± 0.316
0.978ValLys: 0.978 ± 0.007
3.913ValLeu: 3.913 ± 1.041
1.304ValMet: 1.304 ± 0.178
3.913ValAsn: 3.913 ± 0.026
4.565ValPro: 4.565 ± 1.153
2.934ValGln: 2.934 ± 0.488
2.934ValArg: 2.934 ± 0.488
5.217ValSer: 5.217 ± 0.711
5.217ValThr: 5.217 ± 0.811
5.217ValVal: 5.217 ± 0.711
0.652ValTrp: 0.652 ± 0.672
2.934ValTyr: 2.934 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
1.304TrpAla: 1.304 ± 0.329
0.0TrpCys: 0.0 ± 0.0
0.652TrpAsp: 0.652 ± 0.672
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.652TrpGly: 0.652 ± 0.343
0.326TrpHis: 0.326 ± 0.171
0.652TrpIle: 0.652 ± 0.343
0.978TrpLys: 0.978 ± 0.514
0.326TrpLeu: 0.326 ± 0.336
0.0TrpMet: 0.0 ± 0.0
0.326TrpAsn: 0.326 ± 0.171
0.652TrpPro: 0.652 ± 0.165
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.652TrpSer: 0.652 ± 0.343
0.0TrpThr: 0.0 ± 0.0
0.652TrpVal: 0.652 ± 0.165
0.0TrpTrp: 0.0 ± 0.0
0.326TrpTyr: 0.326 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.608TyrAla: 2.608 ± 0.356
0.652TyrCys: 0.652 ± 0.343
3.913TyrAsp: 3.913 ± 1.548
1.956TyrGlu: 1.956 ± 0.52
1.304TyrPhe: 1.304 ± 0.178
2.934TyrGly: 2.934 ± 0.527
1.304TyrHis: 1.304 ± 0.685
2.282TyrIle: 2.282 ± 0.692
4.239TyrLys: 4.239 ± 0.705
2.934TyrLeu: 2.934 ± 1.034
0.326TyrMet: 0.326 ± 0.336
2.608TyrAsn: 2.608 ± 0.659
1.304TyrPro: 1.304 ± 0.837
1.304TyrGln: 1.304 ± 0.178
1.304TyrArg: 1.304 ± 0.178
2.282TyrSer: 2.282 ± 0.323
2.934TyrThr: 2.934 ± 0.02
3.261TyrVal: 3.261 ± 0.698
0.652TyrTrp: 0.652 ± 0.165
2.608TyrTyr: 2.608 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3068 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski