Amino acid dipepetide frequency for Desulfarculus baarsii (strain ATCC 33931 / DSM 2075 / LMG 7858 / VKM B-1802 / 2st14)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.208AlaAla: 18.208 ± 0.244
1.597AlaCys: 1.597 ± 0.043
6.749AlaAsp: 6.749 ± 0.086
7.518AlaGlu: 7.518 ± 0.091
3.715AlaPhe: 3.715 ± 0.066
11.008AlaGly: 11.008 ± 0.118
2.377AlaHis: 2.377 ± 0.055
5.114AlaIle: 5.114 ± 0.079
4.898AlaLys: 4.898 ± 0.084
14.575AlaLeu: 14.575 ± 0.17
4.203AlaMet: 4.203 ± 0.061
2.703AlaAsn: 2.703 ± 0.056
5.795AlaPro: 5.795 ± 0.1
6.084AlaGln: 6.084 ± 0.105
8.983AlaArg: 8.983 ± 0.102
5.369AlaSer: 5.369 ± 0.093
4.644AlaThr: 4.644 ± 0.074
9.166AlaVal: 9.166 ± 0.112
1.666AlaTrp: 1.666 ± 0.045
2.446AlaTyr: 2.446 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
1.437CysAla: 1.437 ± 0.035
0.242CysCys: 0.242 ± 0.015
0.562CysAsp: 0.562 ± 0.025
0.471CysGlu: 0.471 ± 0.022
0.447CysPhe: 0.447 ± 0.018
1.411CysGly: 1.411 ± 0.042
0.399CysHis: 0.399 ± 0.03
0.376CysIle: 0.376 ± 0.021
0.339CysLys: 0.339 ± 0.017
1.612CysLeu: 1.612 ± 0.039
0.233CysMet: 0.233 ± 0.015
0.294CysAsn: 0.294 ± 0.017
0.92CysPro: 0.92 ± 0.035
0.672CysGln: 0.672 ± 0.024
1.051CysArg: 1.051 ± 0.034
0.54CysSer: 0.54 ± 0.024
0.473CysThr: 0.473 ± 0.024
0.748CysVal: 0.748 ± 0.026
0.176CysTrp: 0.176 ± 0.011
0.299CysTyr: 0.299 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
5.214AspAla: 5.214 ± 0.064
0.681AspCys: 0.681 ± 0.028
3.502AspAsp: 3.502 ± 0.061
3.959AspGlu: 3.959 ± 0.072
2.192AspPhe: 2.192 ± 0.049
5.011AspGly: 5.011 ± 0.076
1.236AspHis: 1.236 ± 0.031
2.668AspIle: 2.668 ± 0.054
2.172AspLys: 2.172 ± 0.044
6.269AspLeu: 6.269 ± 0.085
1.55AspMet: 1.55 ± 0.045
1.371AspAsn: 1.371 ± 0.039
3.217AspPro: 3.217 ± 0.059
2.739AspGln: 2.739 ± 0.055
3.238AspArg: 3.238 ± 0.06
2.066AspSer: 2.066 ± 0.049
1.651AspThr: 1.651 ± 0.04
3.805AspVal: 3.805 ± 0.06
0.826AspTrp: 0.826 ± 0.031
1.675AspTyr: 1.675 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
7.751GluAla: 7.751 ± 0.093
0.428GluCys: 0.428 ± 0.021
2.671GluAsp: 2.671 ± 0.049
3.276GluGlu: 3.276 ± 0.064
1.782GluPhe: 1.782 ± 0.044
4.099GluGly: 4.099 ± 0.063
1.222GluHis: 1.222 ± 0.029
3.006GluIle: 3.006 ± 0.059
2.501GluLys: 2.501 ± 0.054
7.31GluLeu: 7.31 ± 0.095
1.779GluMet: 1.779 ± 0.042
1.636GluAsn: 1.636 ± 0.044
2.78GluPro: 2.78 ± 0.053
2.578GluGln: 2.578 ± 0.053
4.756GluArg: 4.756 ± 0.07
2.406GluSer: 2.406 ± 0.05
2.333GluThr: 2.333 ± 0.045
4.044GluVal: 4.044 ± 0.065
0.448GluTrp: 0.448 ± 0.021
1.121GluTyr: 1.121 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.615PheAla: 3.615 ± 0.065
0.636PheCys: 0.636 ± 0.026
2.299PheAsp: 2.299 ± 0.048
1.883PheGlu: 1.883 ± 0.042
1.565PhePhe: 1.565 ± 0.04
3.274PheGly: 3.274 ± 0.063
0.68PheHis: 0.68 ± 0.027
1.61PheIle: 1.61 ± 0.04
1.189PheLys: 1.189 ± 0.037
3.243PheLeu: 3.243 ± 0.063
0.949PheMet: 0.949 ± 0.034
1.143PheAsn: 1.143 ± 0.033
1.288PhePro: 1.288 ± 0.038
1.052PheGln: 1.052 ± 0.027
1.928PheArg: 1.928 ± 0.047
2.008PheSer: 2.008 ± 0.046
1.568PheThr: 1.568 ± 0.039
2.556PheVal: 2.556 ± 0.052
0.559PheTrp: 0.559 ± 0.021
0.976PheTyr: 0.976 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
9.051GlyAla: 9.051 ± 0.113
1.243GlyCys: 1.243 ± 0.036
4.324GlyAsp: 4.324 ± 0.067
4.264GlyGlu: 4.264 ± 0.06
3.098GlyPhe: 3.098 ± 0.055
8.273GlyGly: 8.273 ± 0.135
2.02GlyHis: 2.02 ± 0.045
2.358GlyIle: 2.358 ± 0.055
3.454GlyLys: 3.454 ± 0.063
11.782GlyLeu: 11.782 ± 0.119
2.24GlyMet: 2.24 ± 0.05
1.754GlyAsn: 1.754 ± 0.044
4.55GlyPro: 4.55 ± 0.079
5.329GlyGln: 5.329 ± 0.081
7.78GlyArg: 7.78 ± 0.097
3.537GlySer: 3.537 ± 0.085
2.402GlyThr: 2.402 ± 0.052
7.351GlyVal: 7.351 ± 0.087
1.359GlyTrp: 1.359 ± 0.041
2.247GlyTyr: 2.247 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
2.066HisAla: 2.066 ± 0.041
0.328HisCys: 0.328 ± 0.017
1.302HisAsp: 1.302 ± 0.038
1.093HisGlu: 1.093 ± 0.03
0.821HisPhe: 0.821 ± 0.026
2.177HisGly: 2.177 ± 0.041
0.54HisHis: 0.54 ± 0.028
0.855HisIle: 0.855 ± 0.03
0.669HisLys: 0.669 ± 0.024
2.207HisLeu: 2.207 ± 0.048
0.478HisMet: 0.478 ± 0.021
0.564HisAsn: 0.564 ± 0.022
1.181HisPro: 1.181 ± 0.031
0.806HisGln: 0.806 ± 0.026
1.231HisArg: 1.231 ± 0.032
0.818HisSer: 0.818 ± 0.029
0.74HisThr: 0.74 ± 0.025
1.301HisVal: 1.301 ± 0.033
0.266HisTrp: 0.266 ± 0.016
0.571HisTyr: 0.571 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
4.806IleAla: 4.806 ± 0.057
0.689IleCys: 0.689 ± 0.027
3.215IleAsp: 3.215 ± 0.06
2.919IleGlu: 2.919 ± 0.049
1.707IlePhe: 1.707 ± 0.042
3.758IleGly: 3.758 ± 0.065
0.959IleHis: 0.959 ± 0.029
2.448IleIle: 2.448 ± 0.053
1.956IleLys: 1.956 ± 0.044
4.033IleLeu: 4.033 ± 0.066
1.212IleMet: 1.212 ± 0.042
1.625IleAsn: 1.625 ± 0.04
1.731IlePro: 1.731 ± 0.048
1.335IleGln: 1.335 ± 0.033
2.294IleArg: 2.294 ± 0.055
2.512IleSer: 2.512 ± 0.052
2.233IleThr: 2.233 ± 0.055
3.425IleVal: 3.425 ± 0.06
0.505IleTrp: 0.505 ± 0.019
1.157IleTyr: 1.157 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.694LysAla: 4.694 ± 0.085
0.346LysCys: 0.346 ± 0.019
1.951LysAsp: 1.951 ± 0.043
1.91LysGlu: 1.91 ± 0.046
0.981LysPhe: 0.981 ± 0.035
2.822LysGly: 2.822 ± 0.05
0.636LysHis: 0.636 ± 0.024
2.016LysIle: 2.016 ± 0.042
1.912LysLys: 1.912 ± 0.053
3.894LysLeu: 3.894 ± 0.064
1.057LysMet: 1.057 ± 0.034
1.16LysAsn: 1.16 ± 0.034
2.05LysPro: 2.05 ± 0.049
1.265LysGln: 1.265 ± 0.041
2.453LysArg: 2.453 ± 0.051
1.696LysSer: 1.696 ± 0.041
1.838LysThr: 1.838 ± 0.042
2.783LysVal: 2.783 ± 0.051
0.359LysTrp: 0.359 ± 0.019
0.833LysTyr: 0.833 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
18.075LeuAla: 18.075 ± 0.192
1.646LeuCys: 1.646 ± 0.035
6.533LeuAsp: 6.533 ± 0.088
7.111LeuGlu: 7.111 ± 0.098
3.369LeuPhe: 3.369 ± 0.057
10.662LeuGly: 10.662 ± 0.128
1.883LeuHis: 1.883 ± 0.048
4.822LeuIle: 4.822 ± 0.076
3.565LeuLys: 3.565 ± 0.055
11.205LeuLeu: 11.205 ± 0.143
2.449LeuMet: 2.449 ± 0.049
2.448LeuAsn: 2.448 ± 0.042
5.681LeuPro: 5.681 ± 0.083
2.824LeuGln: 2.824 ± 0.056
8.125LeuArg: 8.125 ± 0.094
5.713LeuSer: 5.713 ± 0.071
5.399LeuThr: 5.399 ± 0.081
7.388LeuVal: 7.388 ± 0.104
1.485LeuTrp: 1.485 ± 0.045
2.066LeuTyr: 2.066 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
4.16MetAla: 4.16 ± 0.065
0.227MetCys: 0.227 ± 0.014
1.483MetAsp: 1.483 ± 0.034
1.52MetGlu: 1.52 ± 0.043
0.724MetPhe: 0.724 ± 0.031
2.432MetGly: 2.432 ± 0.052
0.439MetHis: 0.439 ± 0.02
1.44MetIle: 1.44 ± 0.036
1.061MetLys: 1.061 ± 0.034
2.659MetLeu: 2.659 ± 0.054
0.559MetMet: 0.559 ± 0.025
0.804MetAsn: 0.804 ± 0.027
1.349MetPro: 1.349 ± 0.035
0.76MetGln: 0.76 ± 0.027
1.597MetArg: 1.597 ± 0.039
1.553MetSer: 1.553 ± 0.039
1.446MetThr: 1.446 ± 0.035
1.928MetVal: 1.928 ± 0.045
0.183MetTrp: 0.183 ± 0.011
0.348MetTyr: 0.348 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.589AsnAla: 2.589 ± 0.059
0.353AsnCys: 0.353 ± 0.02
1.327AsnAsp: 1.327 ± 0.037
1.193AsnGlu: 1.193 ± 0.031
0.974AsnPhe: 0.974 ± 0.03
1.898AsnGly: 1.898 ± 0.053
0.583AsnHis: 0.583 ± 0.022
1.501AsnIle: 1.501 ± 0.038
0.971AsnLys: 0.971 ± 0.028
3.001AsnLeu: 3.001 ± 0.053
0.711AsnMet: 0.711 ± 0.026
0.784AsnAsn: 0.784 ± 0.029
1.729AsnPro: 1.729 ± 0.041
1.113AsnGln: 1.113 ± 0.038
1.487AsnArg: 1.487 ± 0.036
1.104AsnSer: 1.104 ± 0.042
1.057AsnThr: 1.057 ± 0.038
1.886AsnVal: 1.886 ± 0.049
0.351AsnTrp: 0.351 ± 0.02
0.765AsnTyr: 0.765 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
6.868ProAla: 6.868 ± 0.106
0.547ProCys: 0.547 ± 0.023
3.108ProAsp: 3.108 ± 0.056
3.878ProGlu: 3.878 ± 0.059
1.612ProPhe: 1.612 ± 0.039
5.116ProGly: 5.116 ± 0.082
0.982ProHis: 0.982 ± 0.028
1.804ProIle: 1.804 ± 0.038
1.755ProLys: 1.755 ± 0.042
5.368ProLeu: 5.368 ± 0.08
1.233ProMet: 1.233 ± 0.032
1.171ProAsn: 1.171 ± 0.034
3.244ProPro: 3.244 ± 0.065
1.924ProGln: 1.924 ± 0.038
3.195ProArg: 3.195 ± 0.054
2.243ProSer: 2.243 ± 0.048
2.303ProThr: 2.303 ± 0.047
3.49ProVal: 3.49 ± 0.05
0.837ProTrp: 0.837 ± 0.031
1.108ProTyr: 1.108 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
6.94GlnAla: 6.94 ± 0.111
0.438GlnCys: 0.438 ± 0.024
1.842GlnAsp: 1.842 ± 0.044
2.195GlnGlu: 2.195 ± 0.045
1.013GlnPhe: 1.013 ± 0.03
3.438GlnGly: 3.438 ± 0.058
0.562GlnHis: 0.562 ± 0.026
1.978GlnIle: 1.978 ± 0.036
1.668GlnLys: 1.668 ± 0.041
3.371GlnLeu: 3.371 ± 0.052
1.141GlnMet: 1.141 ± 0.037
1.199GlnAsn: 1.199 ± 0.035
2.053GlnPro: 2.053 ± 0.047
1.389GlnGln: 1.389 ± 0.047
3.308GlnArg: 3.308 ± 0.07
1.965GlnSer: 1.965 ± 0.043
2.136GlnThr: 2.136 ± 0.038
2.927GlnVal: 2.927 ± 0.05
0.578GlnTrp: 0.578 ± 0.023
0.795GlnTyr: 0.795 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
8.261ArgAla: 8.261 ± 0.106
0.718ArgCys: 0.718 ± 0.028
3.264ArgAsp: 3.264 ± 0.056
3.769ArgGlu: 3.769 ± 0.062
2.607ArgPhe: 2.607 ± 0.05
5.085ArgGly: 5.085 ± 0.076
1.605ArgHis: 1.605 ± 0.04
2.98ArgIle: 2.98 ± 0.058
2.236ArgLys: 2.236 ± 0.049
9.78ArgLeu: 9.78 ± 0.127
1.864ArgMet: 1.864 ± 0.038
1.42ArgAsn: 1.42 ± 0.039
4.516ArgPro: 4.516 ± 0.069
4.306ArgGln: 4.306 ± 0.074
6.398ArgArg: 6.398 ± 0.091
2.391ArgSer: 2.391 ± 0.046
2.274ArgThr: 2.274 ± 0.048
4.852ArgVal: 4.852 ± 0.064
1.042ArgTrp: 1.042 ± 0.033
1.574ArgTyr: 1.574 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.193SerAla: 5.193 ± 0.079
0.568SerCys: 0.568 ± 0.022
2.368SerAsp: 2.368 ± 0.047
2.257SerGlu: 2.257 ± 0.055
1.821SerPhe: 1.821 ± 0.04
4.759SerGly: 4.759 ± 0.083
1.025SerHis: 1.025 ± 0.031
1.907SerIle: 1.907 ± 0.045
1.353SerLys: 1.353 ± 0.036
5.532SerLeu: 5.532 ± 0.082
1.186SerMet: 1.186 ± 0.036
1.026SerAsn: 1.026 ± 0.037
2.604SerPro: 2.604 ± 0.051
2.039SerGln: 2.039 ± 0.044
3.126SerArg: 3.126 ± 0.05
2.245SerSer: 2.245 ± 0.062
1.97SerThr: 1.97 ± 0.047
3.099SerVal: 3.099 ± 0.061
0.614SerTrp: 0.614 ± 0.024
1.191SerTyr: 1.191 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
5.044ThrAla: 5.044 ± 0.075
0.479ThrCys: 0.479 ± 0.021
1.977ThrAsp: 1.977 ± 0.041
1.913ThrGlu: 1.913 ± 0.038
1.362ThrPhe: 1.362 ± 0.039
3.883ThrGly: 3.883 ± 0.061
0.796ThrHis: 0.796 ± 0.027
2.341ThrIle: 2.341 ± 0.046
1.411ThrLys: 1.411 ± 0.039
4.357ThrLeu: 4.357 ± 0.069
1.214ThrMet: 1.214 ± 0.035
1.166ThrAsn: 1.166 ± 0.035
2.857ThrPro: 2.857 ± 0.057
1.243ThrGln: 1.243 ± 0.034
2.361ThrArg: 2.361 ± 0.051
2.107ThrSer: 2.107 ± 0.049
2.099ThrThr: 2.099 ± 0.051
3.192ThrVal: 3.192 ± 0.065
0.47ThrTrp: 0.47 ± 0.022
0.965ThrTyr: 0.965 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
9.102ValAla: 9.102 ± 0.09
1.057ValCys: 1.057 ± 0.029
4.458ValAsp: 4.458 ± 0.063
5.014ValGlu: 5.014 ± 0.065
2.734ValPhe: 2.734 ± 0.054
6.307ValGly: 6.307 ± 0.077
1.274ValHis: 1.274 ± 0.035
3.939ValIle: 3.939 ± 0.067
2.466ValLys: 2.466 ± 0.05
7.647ValLeu: 7.647 ± 0.095
1.865ValMet: 1.865 ± 0.047
2.044ValAsn: 2.044 ± 0.048
2.282ValPro: 2.282 ± 0.054
1.91ValGln: 1.91 ± 0.044
4.574ValArg: 4.574 ± 0.075
3.842ValSer: 3.842 ± 0.067
3.244ValThr: 3.244 ± 0.059
6.348ValVal: 6.348 ± 0.085
0.846ValTrp: 0.846 ± 0.033
1.655ValTyr: 1.655 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.538TrpAla: 1.538 ± 0.038
0.135TrpCys: 0.135 ± 0.011
0.666TrpAsp: 0.666 ± 0.028
0.588TrpGlu: 0.588 ± 0.023
0.448TrpPhe: 0.448 ± 0.022
0.987TrpGly: 0.987 ± 0.035
0.229TrpHis: 0.229 ± 0.015
0.451TrpIle: 0.451 ± 0.023
0.318TrpLys: 0.318 ± 0.018
1.929TrpLeu: 1.929 ± 0.049
0.262TrpMet: 0.262 ± 0.018
0.325TrpAsn: 0.325 ± 0.017
0.887TrpPro: 0.887 ± 0.035
0.524TrpGln: 0.524 ± 0.021
1.483TrpArg: 1.483 ± 0.046
0.695TrpSer: 0.695 ± 0.028
0.5TrpThr: 0.5 ± 0.022
0.745TrpVal: 0.745 ± 0.028
0.249TrpTrp: 0.249 ± 0.018
0.222TrpTyr: 0.222 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.204TyrAla: 2.204 ± 0.041
0.343TyrCys: 0.343 ± 0.017
1.441TyrAsp: 1.441 ± 0.034
1.194TyrGlu: 1.194 ± 0.039
1.05TyrPhe: 1.05 ± 0.032
2.017TyrGly: 2.017 ± 0.053
0.564TyrHis: 0.564 ± 0.021
0.902TyrIle: 0.902 ± 0.029
0.761TyrLys: 0.761 ± 0.029
2.679TyrLeu: 2.679 ± 0.052
0.49TyrMet: 0.49 ± 0.022
0.741TyrAsn: 0.741 ± 0.033
1.069TyrPro: 1.069 ± 0.034
1.043TyrGln: 1.043 ± 0.027
1.577TyrArg: 1.577 ± 0.041
1.073TyrSer: 1.073 ± 0.033
0.927TyrThr: 0.927 ± 0.037
1.576TyrVal: 1.576 ± 0.039
0.361TyrTrp: 0.361 ± 0.018
0.797TyrTyr: 0.797 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3268 proteins (1096967 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski