Amino acid dipepetide frequency for Hubei virga-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.44AlaAla: 7.44 ± 3.521
1.015AlaCys: 1.015 ± 0.48
4.058AlaAsp: 4.058 ± 1.92
5.411AlaGlu: 5.411 ± 3.222
1.353AlaPhe: 1.353 ± 0.64
5.411AlaGly: 5.411 ± 2.561
3.382AlaHis: 3.382 ± 0.327
4.058AlaIle: 4.058 ± 1.92
4.735AlaLys: 4.735 ± 0.313
7.44AlaLeu: 7.44 ± 0.334
3.044AlaMet: 3.044 ± 0.487
3.044AlaAsn: 3.044 ± 1.44
6.087AlaPro: 6.087 ± 4.83
2.367AlaGln: 2.367 ± 2.735
4.735AlaArg: 4.735 ± 3.542
5.073AlaSer: 5.073 ± 1.455
4.735AlaThr: 4.735 ± 1.615
5.749AlaVal: 5.749 ± 2.721
1.015AlaTrp: 1.015 ± 0.48
4.058AlaTyr: 4.058 ± 1.935
0.0AlaXaa: 0.0 ± 0.0
Cys
1.353CysAla: 1.353 ± 1.287
0.676CysCys: 0.676 ± 1.607
1.015CysAsp: 1.015 ± 1.447
0.676CysGlu: 0.676 ± 0.32
0.676CysPhe: 0.676 ± 0.32
1.015CysGly: 1.015 ± 0.48
0.0CysHis: 0.0 ± 0.0
1.691CysIle: 1.691 ± 0.8
1.353CysLys: 1.353 ± 0.64
0.338CysLeu: 0.338 ± 0.16
0.338CysMet: 0.338 ± 0.16
0.676CysAsn: 0.676 ± 0.32
1.015CysPro: 1.015 ± 0.48
0.676CysGln: 0.676 ± 0.32
1.015CysArg: 1.015 ± 0.48
3.044CysSer: 3.044 ± 0.487
1.691CysThr: 1.691 ± 1.127
1.015CysVal: 1.015 ± 0.48
0.0CysTrp: 0.0 ± 0.0
0.676CysTyr: 0.676 ± 0.32
0.0CysXaa: 0.0 ± 0.0
Asp
6.764AspAla: 6.764 ± 3.201
1.015AspCys: 1.015 ± 1.447
2.029AspAsp: 2.029 ± 0.967
4.396AspGlu: 4.396 ± 3.702
1.353AspPhe: 1.353 ± 0.64
2.367AspGly: 2.367 ± 1.12
1.015AspHis: 1.015 ± 0.48
4.058AspIle: 4.058 ± 1.92
3.72AspLys: 3.72 ± 1.76
6.764AspLeu: 6.764 ± 0.654
2.367AspMet: 2.367 ± 1.12
2.029AspAsn: 2.029 ± 0.96
4.396AspPro: 4.396 ± 2.08
1.353AspGln: 1.353 ± 0.64
3.044AspArg: 3.044 ± 0.487
4.735AspSer: 4.735 ± 2.24
4.058AspThr: 4.058 ± 0.007
6.087AspVal: 6.087 ± 0.953
0.0AspTrp: 0.0 ± 0.0
0.676AspTyr: 0.676 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
2.029GluAla: 2.029 ± 2.895
0.676GluCys: 0.676 ± 0.32
4.396GluAsp: 4.396 ± 2.08
3.044GluGlu: 3.044 ± 1.44
1.691GluPhe: 1.691 ± 0.8
2.705GluGly: 2.705 ± 0.647
1.015GluHis: 1.015 ± 0.48
2.705GluIle: 2.705 ± 1.28
4.735GluLys: 4.735 ± 2.24
6.764GluLeu: 6.764 ± 4.51
1.691GluMet: 1.691 ± 3.055
2.367GluAsn: 2.367 ± 0.807
3.382GluPro: 3.382 ± 0.327
0.338GluGln: 0.338 ± 0.16
2.367GluArg: 2.367 ± 1.12
4.058GluSer: 4.058 ± 3.862
4.058GluThr: 4.058 ± 0.007
4.735GluVal: 4.735 ± 1.615
0.676GluTrp: 0.676 ± 0.32
2.029GluTyr: 2.029 ± 0.96
0.0GluXaa: 0.0 ± 0.0
Phe
3.72PheAla: 3.72 ± 0.167
0.676PheCys: 0.676 ± 0.32
2.367PheAsp: 2.367 ± 1.12
2.705PheGlu: 2.705 ± 0.647
1.353PhePhe: 1.353 ± 0.64
1.353PheGly: 1.353 ± 0.64
0.338PheHis: 0.338 ± 0.16
1.015PheIle: 1.015 ± 0.48
1.691PheLys: 1.691 ± 0.8
2.367PheLeu: 2.367 ± 1.12
0.676PheMet: 0.676 ± 0.32
2.029PheAsn: 2.029 ± 0.96
1.691PhePro: 1.691 ± 0.8
0.0PheGln: 0.0 ± 0.0
1.015PheArg: 1.015 ± 0.48
3.382PheSer: 3.382 ± 1.6
1.691PheThr: 1.691 ± 0.8
3.044PheVal: 3.044 ± 0.487
0.338PheTrp: 0.338 ± 0.16
0.676PheTyr: 0.676 ± 1.607
0.0PheXaa: 0.0 ± 0.0
Gly
2.705GlyAla: 2.705 ± 1.28
1.353GlyCys: 1.353 ± 0.64
5.749GlyAsp: 5.749 ± 2.721
2.029GlyGlu: 2.029 ± 0.96
1.353GlyPhe: 1.353 ± 1.287
1.691GlyGly: 1.691 ± 0.8
1.691GlyHis: 1.691 ± 1.127
3.382GlyIle: 3.382 ± 0.327
3.382GlyLys: 3.382 ± 1.6
2.705GlyLeu: 2.705 ± 1.28
2.029GlyMet: 2.029 ± 0.96
2.367GlyAsn: 2.367 ± 0.807
0.676GlyPro: 0.676 ± 0.32
1.015GlyGln: 1.015 ± 1.447
4.058GlyArg: 4.058 ± 1.92
4.058GlySer: 4.058 ± 1.92
2.029GlyThr: 2.029 ± 0.96
4.396GlyVal: 4.396 ± 2.08
1.015GlyTrp: 1.015 ± 1.447
2.367GlyTyr: 2.367 ± 1.12
0.0GlyXaa: 0.0 ± 0.0
His
2.029HisAla: 2.029 ± 0.967
1.015HisCys: 1.015 ± 0.48
1.015HisAsp: 1.015 ± 0.48
0.676HisGlu: 0.676 ± 0.32
0.0HisPhe: 0.0 ± 0.0
1.015HisGly: 1.015 ± 1.447
0.0HisHis: 0.0 ± 0.0
1.353HisIle: 1.353 ± 0.64
0.676HisLys: 0.676 ± 0.32
3.72HisLeu: 3.72 ± 1.76
0.676HisMet: 0.676 ± 0.32
0.676HisAsn: 0.676 ± 0.32
1.353HisPro: 1.353 ± 1.287
1.015HisGln: 1.015 ± 0.48
1.691HisArg: 1.691 ± 0.8
0.676HisSer: 0.676 ± 3.535
1.015HisThr: 1.015 ± 1.447
4.396HisVal: 4.396 ± 2.08
0.338HisTrp: 0.338 ± 0.16
1.015HisTyr: 1.015 ± 1.447
0.0HisXaa: 0.0 ± 0.0
Ile
5.073IleAla: 5.073 ± 2.4
1.015IleCys: 1.015 ± 0.48
1.353IleAsp: 1.353 ± 0.64
2.367IleGlu: 2.367 ± 1.12
1.353IlePhe: 1.353 ± 0.64
2.705IleGly: 2.705 ± 1.28
1.015IleHis: 1.015 ± 0.48
2.705IleIle: 2.705 ± 0.647
1.353IleLys: 1.353 ± 0.64
3.044IleLeu: 3.044 ± 1.44
1.691IleMet: 1.691 ± 1.127
1.353IleAsn: 1.353 ± 0.64
2.029IlePro: 2.029 ± 0.96
1.691IleGln: 1.691 ± 0.8
3.382IleArg: 3.382 ± 0.327
4.735IleSer: 4.735 ± 3.542
3.72IleThr: 3.72 ± 0.167
4.058IleVal: 4.058 ± 1.92
0.676IleTrp: 0.676 ± 0.32
0.338IleTyr: 0.338 ± 0.16
0.0IleXaa: 0.0 ± 0.0
Lys
5.073LysAla: 5.073 ± 0.473
0.0LysCys: 0.0 ± 0.0
2.029LysAsp: 2.029 ± 0.96
4.058LysGlu: 4.058 ± 1.92
1.691LysPhe: 1.691 ± 0.8
2.367LysGly: 2.367 ± 1.12
1.353LysHis: 1.353 ± 0.64
0.0LysIle: 0.0 ± 0.0
2.367LysLys: 2.367 ± 2.735
5.749LysLeu: 5.749 ± 2.721
2.029LysMet: 2.029 ± 0.96
3.044LysAsn: 3.044 ± 1.44
4.396LysPro: 4.396 ± 2.08
1.015LysGln: 1.015 ± 0.48
5.749LysArg: 5.749 ± 0.793
4.396LysSer: 4.396 ± 0.153
2.367LysThr: 2.367 ± 1.12
2.367LysVal: 2.367 ± 0.807
0.338LysTrp: 0.338 ± 0.16
1.691LysTyr: 1.691 ± 0.8
0.0LysXaa: 0.0 ± 0.0
Leu
6.764LeuAla: 6.764 ± 2.582
2.367LeuCys: 2.367 ± 1.12
5.073LeuAsp: 5.073 ± 2.4
3.72LeuGlu: 3.72 ± 2.095
4.058LeuPhe: 4.058 ± 1.92
4.396LeuGly: 4.396 ± 0.153
2.705LeuHis: 2.705 ± 1.28
3.044LeuIle: 3.044 ± 1.44
3.382LeuLys: 3.382 ± 0.327
8.116LeuLeu: 8.116 ± 1.913
2.705LeuMet: 2.705 ± 0.647
3.72LeuAsn: 3.72 ± 0.167
4.735LeuPro: 4.735 ± 2.24
3.72LeuGln: 3.72 ± 0.167
7.778LeuArg: 7.778 ± 4.029
8.793LeuSer: 8.793 ± 2.233
5.411LeuThr: 5.411 ± 0.633
8.116LeuVal: 8.116 ± 1.942
0.676LeuTrp: 0.676 ± 0.32
1.015LeuTyr: 1.015 ± 0.48
0.0LeuXaa: 0.0 ± 0.0
Met
3.382MetAla: 3.382 ± 0.327
1.015MetCys: 1.015 ± 0.48
1.353MetAsp: 1.353 ± 0.64
2.029MetGlu: 2.029 ± 0.967
1.015MetPhe: 1.015 ± 1.447
0.676MetGly: 0.676 ± 0.32
0.0MetHis: 0.0 ± 0.0
1.015MetIle: 1.015 ± 0.48
1.015MetLys: 1.015 ± 0.48
3.72MetLeu: 3.72 ± 1.76
0.338MetMet: 0.338 ± 0.16
0.676MetAsn: 0.676 ± 0.32
1.691MetPro: 1.691 ± 3.055
0.0MetGln: 0.0 ± 0.0
1.353MetArg: 1.353 ± 0.64
3.044MetSer: 3.044 ± 1.44
2.029MetThr: 2.029 ± 0.967
2.029MetVal: 2.029 ± 0.96
0.338MetTrp: 0.338 ± 0.16
0.338MetTyr: 0.338 ± 1.768
0.0MetXaa: 0.0 ± 0.0
Asn
5.411AsnAla: 5.411 ± 0.633
0.676AsnCys: 0.676 ± 0.32
2.705AsnAsp: 2.705 ± 2.575
1.015AsnGlu: 1.015 ± 0.48
2.029AsnPhe: 2.029 ± 2.895
2.705AsnGly: 2.705 ± 0.647
1.015AsnHis: 1.015 ± 0.48
2.029AsnIle: 2.029 ± 0.967
0.0AsnLys: 0.0 ± 0.0
4.058AsnLeu: 4.058 ± 0.007
0.676AsnMet: 0.676 ± 0.414
0.676AsnAsn: 0.676 ± 1.607
1.691AsnPro: 1.691 ± 0.8
1.015AsnGln: 1.015 ± 0.48
3.044AsnArg: 3.044 ± 0.487
2.029AsnSer: 2.029 ± 0.967
2.029AsnThr: 2.029 ± 0.96
2.705AsnVal: 2.705 ± 0.647
0.0AsnTrp: 0.0 ± 0.0
1.015AsnTyr: 1.015 ± 0.48
0.0AsnXaa: 0.0 ± 0.0
Pro
4.396ProAla: 4.396 ± 3.702
0.338ProCys: 0.338 ± 0.16
3.72ProAsp: 3.72 ± 1.76
3.382ProGlu: 3.382 ± 2.255
0.338ProPhe: 0.338 ± 0.16
3.382ProGly: 3.382 ± 0.327
1.691ProHis: 1.691 ± 0.8
4.058ProIle: 4.058 ± 3.862
4.396ProLys: 4.396 ± 0.153
3.382ProLeu: 3.382 ± 0.327
2.029ProMet: 2.029 ± 0.96
0.338ProAsn: 0.338 ± 0.16
2.029ProPro: 2.029 ± 6.75
2.029ProGln: 2.029 ± 0.967
2.367ProArg: 2.367 ± 0.807
4.058ProSer: 4.058 ± 0.007
1.353ProThr: 1.353 ± 0.64
5.411ProVal: 5.411 ± 0.633
0.338ProTrp: 0.338 ± 0.16
2.367ProTyr: 2.367 ± 0.807
0.0ProXaa: 0.0 ± 0.0
Gln
1.353GlnAla: 1.353 ± 0.64
0.0GlnCys: 0.0 ± 0.0
1.353GlnAsp: 1.353 ± 0.64
2.367GlnGlu: 2.367 ± 1.12
0.676GlnPhe: 0.676 ± 0.32
1.015GlnGly: 1.015 ± 0.48
0.338GlnHis: 0.338 ± 0.16
1.015GlnIle: 1.015 ± 0.48
1.015GlnLys: 1.015 ± 0.48
4.396GlnLeu: 4.396 ± 0.153
0.676GlnMet: 0.676 ± 0.732
0.676GlnAsn: 0.676 ± 1.607
1.353GlnPro: 1.353 ± 1.287
1.015GlnGln: 1.015 ± 3.375
3.72GlnArg: 3.72 ± 0.167
1.353GlnSer: 1.353 ± 1.287
3.72GlnThr: 3.72 ± 4.022
1.015GlnVal: 1.015 ± 0.48
0.338GlnTrp: 0.338 ± 0.16
0.338GlnTyr: 0.338 ± 0.16
0.0GlnXaa: 0.0 ± 0.0
Arg
4.058ArgAla: 4.058 ± 5.79
1.691ArgCys: 1.691 ± 0.8
4.058ArgAsp: 4.058 ± 1.92
3.382ArgGlu: 3.382 ± 0.327
3.044ArgPhe: 3.044 ± 1.44
1.353ArgGly: 1.353 ± 1.287
1.015ArgHis: 1.015 ± 0.48
3.382ArgIle: 3.382 ± 1.6
4.058ArgLys: 4.058 ± 0.007
6.087ArgLeu: 6.087 ± 2.881
1.691ArgMet: 1.691 ± 0.8
2.705ArgAsn: 2.705 ± 2.575
3.044ArgPro: 3.044 ± 2.415
1.691ArgGln: 1.691 ± 0.8
3.72ArgArg: 3.72 ± 0.167
7.44ArgSer: 7.44 ± 6.117
6.764ArgThr: 6.764 ± 3.201
5.749ArgVal: 5.749 ± 0.793
0.338ArgTrp: 0.338 ± 0.16
2.367ArgTyr: 2.367 ± 0.807
0.0ArgXaa: 0.0 ± 0.0
Ser
7.778SerAla: 7.778 ± 0.174
1.015SerCys: 1.015 ± 0.48
4.058SerAsp: 4.058 ± 1.935
4.058SerGlu: 4.058 ± 0.007
3.72SerPhe: 3.72 ± 1.76
5.411SerGly: 5.411 ± 2.561
2.029SerHis: 2.029 ± 0.967
3.382SerIle: 3.382 ± 0.327
6.087SerLys: 6.087 ± 2.881
6.764SerLeu: 6.764 ± 2.582
0.0SerMet: 0.0 ± 0.0
2.705SerAsn: 2.705 ± 2.575
2.367SerPro: 2.367 ± 2.735
2.029SerGln: 2.029 ± 0.967
4.396SerArg: 4.396 ± 1.775
4.735SerSer: 4.735 ± 0.313
4.735SerThr: 4.735 ± 5.47
8.455SerVal: 8.455 ± 0.146
0.0SerTrp: 0.0 ± 0.0
3.382SerTyr: 3.382 ± 2.255
0.0SerXaa: 0.0 ± 0.0
Thr
5.749ThrAla: 5.749 ± 1.135
0.676ThrCys: 0.676 ± 0.32
4.735ThrAsp: 4.735 ± 0.313
4.735ThrGlu: 4.735 ± 0.313
2.705ThrPhe: 2.705 ± 1.28
4.735ThrGly: 4.735 ± 2.24
2.367ThrHis: 2.367 ± 1.12
3.044ThrIle: 3.044 ± 1.44
3.044ThrLys: 3.044 ± 1.44
4.396ThrLeu: 4.396 ± 1.775
1.691ThrMet: 1.691 ± 0.8
2.029ThrAsn: 2.029 ± 0.967
3.044ThrPro: 3.044 ± 0.487
3.044ThrGln: 3.044 ± 2.415
2.367ThrArg: 2.367 ± 0.807
2.029ThrSer: 2.029 ± 2.895
4.396ThrThr: 4.396 ± 0.153
7.102ThrVal: 7.102 ± 1.433
1.015ThrTrp: 1.015 ± 3.375
3.044ThrTyr: 3.044 ± 0.487
0.0ThrXaa: 0.0 ± 0.0
Val
6.425ValAla: 6.425 ± 3.041
1.691ValCys: 1.691 ± 0.8
7.778ValAsp: 7.778 ± 2.102
3.382ValGlu: 3.382 ± 1.6
3.382ValPhe: 3.382 ± 1.6
3.382ValGly: 3.382 ± 1.6
2.367ValHis: 2.367 ± 6.59
3.044ValIle: 3.044 ± 1.44
3.044ValLys: 3.044 ± 1.44
7.778ValLeu: 7.778 ± 0.174
1.015ValMet: 1.015 ± 0.48
3.382ValAsn: 3.382 ± 0.327
5.073ValPro: 5.073 ± 0.473
2.029ValGln: 2.029 ± 0.96
9.131ValArg: 9.131 ± 0.466
6.764ValSer: 6.764 ± 1.273
6.425ValThr: 6.425 ± 1.113
7.44ValVal: 7.44 ± 3.521
1.353ValTrp: 1.353 ± 0.64
3.382ValTyr: 3.382 ± 1.6
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.015TrpAsp: 1.015 ± 0.48
0.676TrpGlu: 0.676 ± 1.607
0.0TrpPhe: 0.0 ± 0.0
0.338TrpGly: 0.338 ± 0.16
0.0TrpHis: 0.0 ± 0.0
0.338TrpIle: 0.338 ± 0.16
0.676TrpLys: 0.676 ± 0.32
0.676TrpLeu: 0.676 ± 0.32
0.338TrpMet: 0.338 ± 0.16
1.353TrpAsn: 1.353 ± 1.287
0.0TrpPro: 0.0 ± 0.0
0.676TrpGln: 0.676 ± 1.607
0.676TrpArg: 0.676 ± 0.32
0.0TrpSer: 0.0 ± 0.0
1.353TrpThr: 1.353 ± 0.64
0.338TrpVal: 0.338 ± 0.16
0.676TrpTrp: 0.676 ± 0.32
0.676TrpTyr: 0.676 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.705TyrAla: 2.705 ± 0.647
2.029TyrCys: 2.029 ± 4.822
2.367TyrAsp: 2.367 ± 1.12
1.353TyrGlu: 1.353 ± 1.287
0.338TyrPhe: 0.338 ± 0.16
2.029TyrGly: 2.029 ± 0.96
1.015TyrHis: 1.015 ± 0.48
0.338TyrIle: 0.338 ± 0.16
1.353TyrLys: 1.353 ± 0.64
2.029TyrLeu: 2.029 ± 0.967
1.015TyrMet: 1.015 ± 1.447
1.015TyrAsn: 1.015 ± 0.48
1.353TyrPro: 1.353 ± 1.287
1.015TyrGln: 1.015 ± 0.48
2.029TyrArg: 2.029 ± 0.96
2.705TyrSer: 2.705 ± 0.647
2.367TyrThr: 2.367 ± 1.12
4.058TyrVal: 4.058 ± 0.007
0.338TyrTrp: 0.338 ± 0.16
1.691TyrTyr: 1.691 ± 0.8
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2958 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski