Amino acid dipepetide frequency for Haloprofundus marisrubri

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.507AlaAla: 12.507 ± 0.157
0.67AlaCys: 0.67 ± 0.027
8.855AlaAsp: 8.855 ± 0.12
8.381AlaGlu: 8.381 ± 0.1
4.044AlaPhe: 4.044 ± 0.07
8.785AlaGly: 8.785 ± 0.119
1.807AlaHis: 1.807 ± 0.043
4.239AlaIle: 4.239 ± 0.073
2.009AlaLys: 2.009 ± 0.05
10.394AlaLeu: 10.394 ± 0.133
2.022AlaMet: 2.022 ± 0.046
2.486AlaAsn: 2.486 ± 0.051
3.673AlaPro: 3.673 ± 0.063
2.279AlaGln: 2.279 ± 0.041
5.896AlaArg: 5.896 ± 0.086
5.686AlaSer: 5.686 ± 0.084
6.473AlaThr: 6.473 ± 0.084
10.319AlaVal: 10.319 ± 0.124
1.035AlaTrp: 1.035 ± 0.035
2.748AlaTyr: 2.748 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.025
0.093CysCys: 0.093 ± 0.009
0.534CysAsp: 0.534 ± 0.026
0.569CysGlu: 0.569 ± 0.024
0.219CysPhe: 0.219 ± 0.016
0.814CysGly: 0.814 ± 0.031
0.166CysHis: 0.166 ± 0.013
0.275CysIle: 0.275 ± 0.018
0.114CysLys: 0.114 ± 0.012
0.574CysLeu: 0.574 ± 0.025
0.11CysMet: 0.11 ± 0.01
0.168CysAsn: 0.168 ± 0.014
0.473CysPro: 0.473 ± 0.021
0.136CysGln: 0.136 ± 0.01
0.459CysArg: 0.459 ± 0.022
0.399CysSer: 0.399 ± 0.02
0.388CysThr: 0.388 ± 0.02
0.497CysVal: 0.497 ± 0.024
0.074CysTrp: 0.074 ± 0.009
0.197CysTyr: 0.197 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
9.785AspAla: 9.785 ± 0.132
0.648AspCys: 0.648 ± 0.028
7.88AspAsp: 7.88 ± 0.118
8.259AspGlu: 8.259 ± 0.109
1.846AspPhe: 1.846 ± 0.044
7.592AspGly: 7.592 ± 0.103
1.762AspHis: 1.762 ± 0.042
3.257AspIle: 3.257 ± 0.063
1.0AspLys: 1.0 ± 0.036
5.931AspLeu: 5.931 ± 0.07
1.161AspMet: 1.161 ± 0.031
1.422AspAsn: 1.422 ± 0.046
4.047AspPro: 4.047 ± 0.067
1.429AspGln: 1.429 ± 0.043
5.241AspArg: 5.241 ± 0.077
4.426AspSer: 4.426 ± 0.076
4.153AspThr: 4.153 ± 0.064
7.289AspVal: 7.289 ± 0.097
0.974AspTrp: 0.974 ± 0.032
1.859AspTyr: 1.859 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
7.826GluAla: 7.826 ± 0.091
0.456GluCys: 0.456 ± 0.022
4.697GluAsp: 4.697 ± 0.08
6.505GluGlu: 6.505 ± 0.099
3.364GluPhe: 3.364 ± 0.059
5.046GluGly: 5.046 ± 0.075
1.837GluHis: 1.837 ± 0.048
3.379GluIle: 3.379 ± 0.061
2.228GluLys: 2.228 ± 0.056
7.863GluLeu: 7.863 ± 0.097
2.012GluMet: 2.012 ± 0.044
2.83GluAsn: 2.83 ± 0.057
3.103GluPro: 3.103 ± 0.052
2.767GluGln: 2.767 ± 0.054
7.26GluArg: 7.26 ± 0.091
5.78GluSer: 5.78 ± 0.085
6.894GluThr: 6.894 ± 0.1
5.377GluVal: 5.377 ± 0.076
1.157GluTrp: 1.157 ± 0.033
2.612GluTyr: 2.612 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.866PheAla: 3.866 ± 0.073
0.302PheCys: 0.302 ± 0.015
3.47PheAsp: 3.47 ± 0.058
3.337PheGlu: 3.337 ± 0.065
1.232PhePhe: 1.232 ± 0.036
3.678PheGly: 3.678 ± 0.065
0.619PheHis: 0.619 ± 0.024
1.051PheIle: 1.051 ± 0.033
0.504PheLys: 0.504 ± 0.024
3.034PheLeu: 3.034 ± 0.058
0.491PheMet: 0.491 ± 0.02
0.723PheAsn: 0.723 ± 0.028
1.437PhePro: 1.437 ± 0.037
0.747PheGln: 0.747 ± 0.023
1.905PheArg: 1.905 ± 0.041
2.051PheSer: 2.051 ± 0.041
1.889PheThr: 1.889 ± 0.042
3.888PheVal: 3.888 ± 0.073
0.427PheTrp: 0.427 ± 0.02
0.883PheTyr: 0.883 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
7.616GlyAla: 7.616 ± 0.108
0.695GlyCys: 0.695 ± 0.03
6.532GlyAsp: 6.532 ± 0.09
6.922GlyGlu: 6.922 ± 0.087
3.31GlyPhe: 3.31 ± 0.064
7.42GlyGly: 7.42 ± 0.115
1.67GlyHis: 1.67 ± 0.041
3.941GlyIle: 3.941 ± 0.07
1.952GlyLys: 1.952 ± 0.044
7.316GlyLeu: 7.316 ± 0.092
1.645GlyMet: 1.645 ± 0.04
2.134GlyAsn: 2.134 ± 0.055
3.212GlyPro: 3.212 ± 0.058
2.097GlyGln: 2.097 ± 0.05
5.034GlyArg: 5.034 ± 0.068
5.27GlySer: 5.27 ± 0.088
5.305GlyThr: 5.305 ± 0.075
8.251GlyVal: 8.251 ± 0.107
1.114GlyTrp: 1.114 ± 0.037
2.831GlyTyr: 2.831 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
2.02HisAla: 2.02 ± 0.053
0.192HisCys: 0.192 ± 0.013
1.79HisAsp: 1.79 ± 0.043
1.652HisGlu: 1.652 ± 0.039
0.502HisPhe: 0.502 ± 0.026
1.887HisGly: 1.887 ± 0.05
0.549HisHis: 0.549 ± 0.024
0.763HisIle: 0.763 ± 0.027
0.301HisLys: 0.301 ± 0.02
1.774HisLeu: 1.774 ± 0.04
0.237HisMet: 0.237 ± 0.015
0.435HisAsn: 0.435 ± 0.02
1.249HisPro: 1.249 ± 0.036
0.43HisGln: 0.43 ± 0.022
1.295HisArg: 1.295 ± 0.039
0.932HisSer: 0.932 ± 0.032
1.088HisThr: 1.088 ± 0.035
1.881HisVal: 1.881 ± 0.039
0.243HisTrp: 0.243 ± 0.015
0.557HisTyr: 0.557 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
4.101IleAla: 4.101 ± 0.067
0.233IleCys: 0.233 ± 0.016
3.422IleAsp: 3.422 ± 0.06
3.474IleGlu: 3.474 ± 0.06
1.014IlePhe: 1.014 ± 0.038
3.394IleGly: 3.394 ± 0.061
0.796IleHis: 0.796 ± 0.027
1.233IleIle: 1.233 ± 0.038
0.644IleLys: 0.644 ± 0.027
3.137IleLeu: 3.137 ± 0.063
0.436IleMet: 0.436 ± 0.022
0.881IleAsn: 0.881 ± 0.028
2.13IlePro: 2.13 ± 0.046
0.998IleGln: 0.998 ± 0.032
2.611IleArg: 2.611 ± 0.053
2.187IleSer: 2.187 ± 0.052
2.051IleThr: 2.051 ± 0.043
3.496IleVal: 3.496 ± 0.067
0.304IleTrp: 0.304 ± 0.018
0.841IleTyr: 0.841 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
1.877LysAla: 1.877 ± 0.049
0.122LysCys: 0.122 ± 0.011
1.069LysAsp: 1.069 ± 0.036
1.441LysGlu: 1.441 ± 0.045
0.609LysPhe: 0.609 ± 0.023
1.39LysGly: 1.39 ± 0.041
0.531LysHis: 0.531 ± 0.024
0.794LysIle: 0.794 ± 0.031
0.594LysLys: 0.594 ± 0.03
1.925LysLeu: 1.925 ± 0.046
0.429LysMet: 0.429 ± 0.021
0.677LysAsn: 0.677 ± 0.029
1.043LysPro: 1.043 ± 0.035
0.89LysGln: 0.89 ± 0.029
1.796LysArg: 1.796 ± 0.038
1.278LysSer: 1.278 ± 0.033
1.484LysThr: 1.484 ± 0.034
1.226LysVal: 1.226 ± 0.03
0.235LysTrp: 0.235 ± 0.014
0.588LysTyr: 0.588 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
10.404LeuAla: 10.404 ± 0.113
0.636LeuCys: 0.636 ± 0.027
7.596LeuAsp: 7.596 ± 0.108
5.919LeuGlu: 5.919 ± 0.08
3.533LeuPhe: 3.533 ± 0.073
7.865LeuGly: 7.865 ± 0.095
1.551LeuHis: 1.551 ± 0.044
2.586LeuIle: 2.586 ± 0.064
1.648LeuLys: 1.648 ± 0.043
9.03LeuLeu: 9.03 ± 0.122
1.336LeuMet: 1.336 ± 0.037
1.942LeuAsn: 1.942 ± 0.046
4.186LeuPro: 4.186 ± 0.064
2.174LeuGln: 2.174 ± 0.046
5.973LeuArg: 5.973 ± 0.082
6.783LeuSer: 6.783 ± 0.086
5.552LeuThr: 5.552 ± 0.073
9.461LeuVal: 9.461 ± 0.116
0.954LeuTrp: 0.954 ± 0.032
2.477LeuTyr: 2.477 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
1.738MetAla: 1.738 ± 0.042
0.113MetCys: 0.113 ± 0.011
1.219MetAsp: 1.219 ± 0.036
1.175MetGlu: 1.175 ± 0.033
0.557MetPhe: 0.557 ± 0.024
1.382MetGly: 1.382 ± 0.035
0.358MetHis: 0.358 ± 0.018
0.504MetIle: 0.504 ± 0.021
0.472MetLys: 0.472 ± 0.023
1.696MetLeu: 1.696 ± 0.042
0.299MetMet: 0.299 ± 0.016
0.638MetAsn: 0.638 ± 0.025
0.826MetPro: 0.826 ± 0.027
0.601MetGln: 0.601 ± 0.025
1.201MetArg: 1.201 ± 0.032
1.541MetSer: 1.541 ± 0.038
1.443MetThr: 1.443 ± 0.033
1.385MetVal: 1.385 ± 0.035
0.162MetTrp: 0.162 ± 0.013
0.433MetTyr: 0.433 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.791AsnAla: 2.791 ± 0.056
0.234AsnCys: 0.234 ± 0.014
1.828AsnAsp: 1.828 ± 0.042
1.877AsnGlu: 1.877 ± 0.046
0.722AsnPhe: 0.722 ± 0.028
2.313AsnGly: 2.313 ± 0.058
0.503AsnHis: 0.503 ± 0.025
0.986AsnIle: 0.986 ± 0.033
0.438AsnLys: 0.438 ± 0.021
2.18AsnLeu: 2.18 ± 0.042
0.468AsnMet: 0.468 ± 0.019
0.626AsnAsn: 0.626 ± 0.032
1.689AsnPro: 1.689 ± 0.04
0.693AsnGln: 0.693 ± 0.033
1.841AsnArg: 1.841 ± 0.043
1.269AsnSer: 1.269 ± 0.036
1.411AsnThr: 1.411 ± 0.032
2.572AsnVal: 2.572 ± 0.059
0.33AsnTrp: 0.33 ± 0.019
0.787AsnTyr: 0.787 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
4.138ProAla: 4.138 ± 0.064
0.213ProCys: 0.213 ± 0.013
4.043ProAsp: 4.043 ± 0.068
4.557ProGlu: 4.557 ± 0.075
1.654ProPhe: 1.654 ± 0.04
3.512ProGly: 3.512 ± 0.066
0.881ProHis: 0.881 ± 0.032
1.683ProIle: 1.683 ± 0.042
1.043ProLys: 1.043 ± 0.03
3.835ProLeu: 3.835 ± 0.071
0.883ProMet: 0.883 ± 0.028
1.298ProAsn: 1.298 ± 0.036
2.02ProPro: 2.02 ± 0.04
1.143ProGln: 1.143 ± 0.037
2.23ProArg: 2.23 ± 0.045
2.717ProSer: 2.717 ± 0.05
3.372ProThr: 3.372 ± 0.062
4.021ProVal: 4.021 ± 0.066
0.528ProTrp: 0.528 ± 0.023
1.162ProTyr: 1.162 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
2.336GlnAla: 2.336 ± 0.047
0.141GlnCys: 0.141 ± 0.01
1.103GlnAsp: 1.103 ± 0.034
1.638GlnGlu: 1.638 ± 0.046
1.281GlnPhe: 1.281 ± 0.036
1.457GlnGly: 1.457 ± 0.038
0.533GlnHis: 0.533 ± 0.022
1.041GlnIle: 1.041 ± 0.032
0.691GlnLys: 0.691 ± 0.026
2.511GlnLeu: 2.511 ± 0.057
0.648GlnMet: 0.648 ± 0.026
0.88GlnAsn: 0.88 ± 0.035
1.079GlnPro: 1.079 ± 0.034
1.184GlnGln: 1.184 ± 0.041
2.07GlnArg: 2.07 ± 0.049
1.969GlnSer: 1.969 ± 0.046
1.86GlnThr: 1.86 ± 0.041
1.811GlnVal: 1.811 ± 0.04
0.419GlnTrp: 0.419 ± 0.021
0.943GlnTyr: 0.943 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
6.089ArgAla: 6.089 ± 0.084
0.45ArgCys: 0.45 ± 0.023
4.693ArgAsp: 4.693 ± 0.077
6.371ArgGlu: 6.371 ± 0.087
2.416ArgPhe: 2.416 ± 0.047
4.575ArgGly: 4.575 ± 0.066
1.248ArgHis: 1.248 ± 0.033
2.892ArgIle: 2.892 ± 0.053
1.511ArgLys: 1.511 ± 0.04
6.32ArgLeu: 6.32 ± 0.09
1.373ArgMet: 1.373 ± 0.037
1.808ArgAsn: 1.808 ± 0.042
2.649ArgPro: 2.649 ± 0.049
1.896ArgGln: 1.896 ± 0.049
5.272ArgArg: 5.272 ± 0.086
3.593ArgSer: 3.593 ± 0.055
4.11ArgThr: 4.11 ± 0.064
5.54ArgVal: 5.54 ± 0.076
0.781ArgTrp: 0.781 ± 0.03
1.985ArgTyr: 1.985 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
5.86SerAla: 5.86 ± 0.066
0.319SerCys: 0.319 ± 0.019
4.747SerAsp: 4.747 ± 0.072
4.886SerGlu: 4.886 ± 0.072
2.189SerPhe: 2.189 ± 0.049
5.85SerGly: 5.85 ± 0.072
1.085SerHis: 1.085 ± 0.034
2.343SerIle: 2.343 ± 0.046
1.359SerLys: 1.359 ± 0.038
5.825SerLeu: 5.825 ± 0.078
1.143SerMet: 1.143 ± 0.033
1.691SerAsn: 1.691 ± 0.044
2.786SerPro: 2.786 ± 0.059
1.568SerGln: 1.568 ± 0.041
3.478SerArg: 3.478 ± 0.052
3.638SerSer: 3.638 ± 0.076
3.928SerThr: 3.928 ± 0.07
5.715SerVal: 5.715 ± 0.083
0.662SerTrp: 0.662 ± 0.025
1.61SerTyr: 1.61 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
6.509ThrAla: 6.509 ± 0.086
0.328ThrCys: 0.328 ± 0.018
5.274ThrAsp: 5.274 ± 0.083
4.975ThrGlu: 4.975 ± 0.082
2.304ThrPhe: 2.304 ± 0.05
5.428ThrGly: 5.428 ± 0.07
1.222ThrHis: 1.222 ± 0.034
2.509ThrIle: 2.509 ± 0.054
1.242ThrLys: 1.242 ± 0.036
6.248ThrLeu: 6.248 ± 0.084
1.069ThrMet: 1.069 ± 0.031
1.693ThrAsn: 1.693 ± 0.039
3.484ThrPro: 3.484 ± 0.078
1.52ThrGln: 1.52 ± 0.045
3.527ThrArg: 3.527 ± 0.054
3.112ThrSer: 3.112 ± 0.055
4.511ThrThr: 4.511 ± 0.081
6.883ThrVal: 6.883 ± 0.09
0.685ThrTrp: 0.685 ± 0.024
1.75ThrTyr: 1.75 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
10.472ValAla: 10.472 ± 0.134
0.735ValCys: 0.735 ± 0.03
7.861ValAsp: 7.861 ± 0.097
7.751ValGlu: 7.751 ± 0.092
3.277ValPhe: 3.277 ± 0.057
8.583ValGly: 8.583 ± 0.108
1.735ValHis: 1.735 ± 0.04
2.719ValIle: 2.719 ± 0.062
1.5ValLys: 1.5 ± 0.036
8.13ValLeu: 8.13 ± 0.101
1.347ValMet: 1.347 ± 0.038
2.127ValAsn: 2.127 ± 0.046
4.086ValPro: 4.086 ± 0.064
1.909ValGln: 1.909 ± 0.047
5.554ValArg: 5.554 ± 0.087
5.986ValSer: 5.986 ± 0.08
6.034ValThr: 6.034 ± 0.084
10.635ValVal: 10.635 ± 0.127
0.834ValTrp: 0.834 ± 0.03
2.339ValTyr: 2.339 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.96TrpAla: 0.96 ± 0.031
0.089TrpCys: 0.089 ± 0.01
0.691TrpAsp: 0.691 ± 0.025
0.865TrpGlu: 0.865 ± 0.031
0.516TrpPhe: 0.516 ± 0.022
0.823TrpGly: 0.823 ± 0.029
0.235TrpHis: 0.235 ± 0.017
0.377TrpIle: 0.377 ± 0.019
0.289TrpLys: 0.289 ± 0.016
1.189TrpLeu: 1.189 ± 0.039
0.231TrpMet: 0.231 ± 0.014
0.43TrpAsn: 0.43 ± 0.018
0.477TrpPro: 0.477 ± 0.02
0.42TrpGln: 0.42 ± 0.019
0.895TrpArg: 0.895 ± 0.033
0.679TrpSer: 0.679 ± 0.026
0.808TrpThr: 0.808 ± 0.025
0.905TrpVal: 0.905 ± 0.029
0.209TrpTrp: 0.209 ± 0.014
0.382TrpTyr: 0.382 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.805TyrAla: 2.805 ± 0.047
0.203TyrCys: 0.203 ± 0.014
2.653TyrAsp: 2.653 ± 0.053
2.443TyrGlu: 2.443 ± 0.052
0.949TyrPhe: 0.949 ± 0.031
2.471TyrGly: 2.471 ± 0.046
0.686TyrHis: 0.686 ± 0.022
0.812TyrIle: 0.812 ± 0.03
0.455TyrLys: 0.455 ± 0.02
2.694TyrLeu: 2.694 ± 0.053
0.4TyrMet: 0.4 ± 0.023
0.667TyrAsn: 0.667 ± 0.027
1.286TyrPro: 1.286 ± 0.031
0.715TyrGln: 0.715 ± 0.029
2.034TyrArg: 2.034 ± 0.045
1.386TyrSer: 1.386 ± 0.036
1.522TyrThr: 1.522 ± 0.04
2.46TyrVal: 2.46 ± 0.051
0.343TyrTrp: 0.343 ± 0.016
0.875TyrTyr: 0.875 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3761 proteins (1074478 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski