Amino acid dipepetide frequency for Terriglobus saanensis (strain ATCC BAA-1853 / DSM 23119 / SP1PR4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.517AlaAla: 12.517 ± 0.133
0.958AlaCys: 0.958 ± 0.03
4.984AlaAsp: 4.984 ± 0.058
6.1AlaGlu: 6.1 ± 0.083
3.871AlaPhe: 3.871 ± 0.052
8.407AlaGly: 8.407 ± 0.077
2.194AlaHis: 2.194 ± 0.041
5.627AlaIle: 5.627 ± 0.059
4.328AlaLys: 4.328 ± 0.075
10.788AlaLeu: 10.788 ± 0.109
2.913AlaMet: 2.913 ± 0.052
3.569AlaAsn: 3.569 ± 0.062
4.955AlaPro: 4.955 ± 0.067
4.28AlaGln: 4.28 ± 0.065
5.748AlaArg: 5.748 ± 0.076
6.883AlaSer: 6.883 ± 0.063
6.797AlaThr: 6.797 ± 0.102
7.641AlaVal: 7.641 ± 0.08
1.359AlaTrp: 1.359 ± 0.03
2.521AlaTyr: 2.521 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.843CysAla: 0.843 ± 0.024
0.128CysCys: 0.128 ± 0.011
0.416CysAsp: 0.416 ± 0.014
0.367CysGlu: 0.367 ± 0.016
0.38CysPhe: 0.38 ± 0.016
0.893CysGly: 0.893 ± 0.028
0.209CysHis: 0.209 ± 0.015
0.442CysIle: 0.442 ± 0.018
0.245CysLys: 0.245 ± 0.013
0.746CysLeu: 0.746 ± 0.025
0.187CysMet: 0.187 ± 0.011
0.238CysAsn: 0.238 ± 0.013
0.375CysPro: 0.375 ± 0.016
0.205CysGln: 0.205 ± 0.012
0.417CysArg: 0.417 ± 0.017
0.578CysSer: 0.578 ± 0.02
0.512CysThr: 0.512 ± 0.019
0.625CysVal: 0.625 ± 0.022
0.102CysTrp: 0.102 ± 0.009
0.234CysTyr: 0.234 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.658AspAla: 5.658 ± 0.069
0.353AspCys: 0.353 ± 0.015
2.315AspAsp: 2.315 ± 0.049
2.888AspGlu: 2.888 ± 0.049
2.25AspPhe: 2.25 ± 0.038
4.26AspGly: 4.26 ± 0.067
1.156AspHis: 1.156 ± 0.026
2.252AspIle: 2.252 ± 0.038
1.745AspLys: 1.745 ± 0.032
5.312AspLeu: 5.312 ± 0.061
0.933AspMet: 0.933 ± 0.024
1.346AspAsn: 1.346 ± 0.035
3.173AspPro: 3.173 ± 0.047
1.697AspGln: 1.697 ± 0.032
3.163AspArg: 3.163 ± 0.049
2.673AspSer: 2.673 ± 0.045
2.769AspThr: 2.769 ± 0.042
3.706AspVal: 3.706 ± 0.054
0.797AspTrp: 0.797 ± 0.023
1.439AspTyr: 1.439 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
5.734GluAla: 5.734 ± 0.078
0.341GluCys: 0.341 ± 0.016
2.625GluAsp: 2.625 ± 0.048
3.465GluGlu: 3.465 ± 0.063
2.05GluPhe: 2.05 ± 0.041
3.689GluGly: 3.689 ± 0.051
1.346GluHis: 1.346 ± 0.035
3.224GluIle: 3.224 ± 0.053
2.577GluLys: 2.577 ± 0.051
5.151GluLeu: 5.151 ± 0.065
1.551GluMet: 1.551 ± 0.04
1.731GluAsn: 1.731 ± 0.033
2.112GluPro: 2.112 ± 0.04
2.315GluGln: 2.315 ± 0.04
3.822GluArg: 3.822 ± 0.074
2.936GluSer: 2.936 ± 0.046
3.162GluThr: 3.162 ± 0.046
3.722GluVal: 3.722 ± 0.049
0.691GluTrp: 0.691 ± 0.023
1.382GluTyr: 1.382 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
4.421PheAla: 4.421 ± 0.065
0.393PheCys: 0.393 ± 0.019
2.452PheAsp: 2.452 ± 0.043
1.945PheGlu: 1.945 ± 0.037
1.865PhePhe: 1.865 ± 0.043
3.715PheGly: 3.715 ± 0.06
1.095PheHis: 1.095 ± 0.027
1.578PheIle: 1.578 ± 0.037
1.1PheLys: 1.1 ± 0.028
3.954PheLeu: 3.954 ± 0.067
0.703PheMet: 0.703 ± 0.022
1.572PheAsn: 1.572 ± 0.043
1.851PhePro: 1.851 ± 0.035
1.37PheGln: 1.37 ± 0.03
2.365PheArg: 2.365 ± 0.042
2.98PheSer: 2.98 ± 0.061
2.635PheThr: 2.635 ± 0.042
2.926PheVal: 2.926 ± 0.051
0.575PheTrp: 0.575 ± 0.021
1.187PheTyr: 1.187 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
7.479GlyAla: 7.479 ± 0.083
0.744GlyCys: 0.744 ± 0.024
3.798GlyAsp: 3.798 ± 0.054
3.856GlyGlu: 3.856 ± 0.055
3.457GlyPhe: 3.457 ± 0.058
6.701GlyGly: 6.701 ± 0.1
1.824GlyHis: 1.824 ± 0.037
4.543GlyIle: 4.543 ± 0.067
3.626GlyLys: 3.626 ± 0.057
7.419GlyLeu: 7.419 ± 0.07
1.985GlyMet: 1.985 ± 0.042
2.839GlyAsn: 2.839 ± 0.057
3.043GlyPro: 3.043 ± 0.054
2.816GlyGln: 2.816 ± 0.05
4.412GlyArg: 4.412 ± 0.056
5.447GlySer: 5.447 ± 0.081
5.629GlyThr: 5.629 ± 0.081
6.048GlyVal: 6.048 ± 0.073
1.225GlyTrp: 1.225 ± 0.03
2.546GlyTyr: 2.546 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.313HisAla: 2.313 ± 0.043
0.202HisCys: 0.202 ± 0.012
1.179HisAsp: 1.179 ± 0.028
1.154HisGlu: 1.154 ± 0.029
1.055HisPhe: 1.055 ± 0.028
2.034HisGly: 2.034 ± 0.044
0.659HisHis: 0.659 ± 0.023
1.125HisIle: 1.125 ± 0.03
0.646HisLys: 0.646 ± 0.022
2.489HisLeu: 2.489 ± 0.045
0.492HisMet: 0.492 ± 0.018
0.66HisAsn: 0.66 ± 0.019
1.416HisPro: 1.416 ± 0.033
0.722HisGln: 0.722 ± 0.021
1.412HisArg: 1.412 ± 0.029
1.274HisSer: 1.274 ± 0.032
1.329HisThr: 1.329 ± 0.03
1.562HisVal: 1.562 ± 0.036
0.354HisTrp: 0.354 ± 0.015
0.667HisTyr: 0.667 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.426IleAla: 6.426 ± 0.075
0.475IleCys: 0.475 ± 0.018
3.009IleAsp: 3.009 ± 0.045
3.021IleGlu: 3.021 ± 0.053
2.019IlePhe: 2.019 ± 0.039
4.226IleGly: 4.226 ± 0.058
1.168IleHis: 1.168 ± 0.026
1.713IleIle: 1.713 ± 0.038
1.491IleLys: 1.491 ± 0.032
4.678IleLeu: 4.678 ± 0.067
0.719IleMet: 0.719 ± 0.027
1.634IleAsn: 1.634 ± 0.039
2.911IlePro: 2.911 ± 0.046
1.66IleGln: 1.66 ± 0.03
2.904IleArg: 2.904 ± 0.049
3.477IleSer: 3.477 ± 0.046
3.286IleThr: 3.286 ± 0.053
3.91IleVal: 3.91 ± 0.053
0.561IleTrp: 0.561 ± 0.021
1.373IleTyr: 1.373 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
3.907LysAla: 3.907 ± 0.068
0.184LysCys: 0.184 ± 0.012
1.98LysAsp: 1.98 ± 0.041
2.024LysGlu: 2.024 ± 0.044
1.291LysPhe: 1.291 ± 0.031
2.514LysGly: 2.514 ± 0.041
0.837LysHis: 0.837 ± 0.026
2.132LysIle: 2.132 ± 0.043
1.998LysLys: 1.998 ± 0.049
3.854LysLeu: 3.854 ± 0.054
1.085LysMet: 1.085 ± 0.027
1.431LysAsn: 1.431 ± 0.026
2.223LysPro: 2.223 ± 0.036
1.586LysGln: 1.586 ± 0.034
2.348LysArg: 2.348 ± 0.036
2.352LysSer: 2.352 ± 0.042
2.543LysThr: 2.543 ± 0.038
2.752LysVal: 2.752 ± 0.052
0.472LysTrp: 0.472 ± 0.02
1.061LysTyr: 1.061 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
10.896LeuAla: 10.896 ± 0.112
0.947LeuCys: 0.947 ± 0.027
5.02LeuAsp: 5.02 ± 0.051
5.079LeuGlu: 5.079 ± 0.068
3.831LeuPhe: 3.831 ± 0.055
7.417LeuGly: 7.417 ± 0.082
2.398LeuHis: 2.398 ± 0.04
4.606LeuIle: 4.606 ± 0.055
3.863LeuLys: 3.863 ± 0.054
10.581LeuLeu: 10.581 ± 0.123
2.01LeuMet: 2.01 ± 0.044
3.391LeuAsn: 3.391 ± 0.053
5.646LeuPro: 5.646 ± 0.066
3.782LeuGln: 3.782 ± 0.047
6.919LeuArg: 6.919 ± 0.086
6.874LeuSer: 6.874 ± 0.069
6.304LeuThr: 6.304 ± 0.07
6.58LeuVal: 6.58 ± 0.067
1.283LeuTrp: 1.283 ± 0.036
2.495LeuTyr: 2.495 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.424MetAla: 2.424 ± 0.043
0.151MetCys: 0.151 ± 0.01
1.104MetAsp: 1.104 ± 0.028
1.26MetGlu: 1.26 ± 0.031
0.665MetPhe: 0.665 ± 0.023
1.609MetGly: 1.609 ± 0.033
0.591MetHis: 0.591 ± 0.022
1.089MetIle: 1.089 ± 0.027
1.156MetLys: 1.156 ± 0.029
2.301MetLeu: 2.301 ± 0.039
0.575MetMet: 0.575 ± 0.018
0.848MetAsn: 0.848 ± 0.023
1.351MetPro: 1.351 ± 0.034
1.078MetGln: 1.078 ± 0.025
1.692MetArg: 1.692 ± 0.035
1.474MetSer: 1.474 ± 0.031
1.472MetThr: 1.472 ± 0.038
1.492MetVal: 1.492 ± 0.035
0.228MetTrp: 0.228 ± 0.012
0.45MetTyr: 0.45 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.617AsnAla: 3.617 ± 0.057
0.284AsnCys: 0.284 ± 0.014
1.545AsnAsp: 1.545 ± 0.033
1.543AsnGlu: 1.543 ± 0.035
1.584AsnPhe: 1.584 ± 0.038
3.2AsnGly: 3.2 ± 0.07
0.724AsnHis: 0.724 ± 0.023
1.734AsnIle: 1.734 ± 0.037
1.039AsnLys: 1.039 ± 0.031
3.553AsnLeu: 3.553 ± 0.052
0.618AsnMet: 0.618 ± 0.02
1.293AsnAsn: 1.293 ± 0.042
2.328AsnPro: 2.328 ± 0.048
1.297AsnGln: 1.297 ± 0.036
1.899AsnArg: 1.899 ± 0.034
2.204AsnSer: 2.204 ± 0.046
2.092AsnThr: 2.092 ± 0.047
2.517AsnVal: 2.517 ± 0.052
0.474AsnTrp: 0.474 ± 0.02
1.249AsnTyr: 1.249 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
5.542ProAla: 5.542 ± 0.06
0.328ProCys: 0.328 ± 0.017
2.845ProAsp: 2.845 ± 0.05
3.377ProGlu: 3.377 ± 0.053
2.04ProPhe: 2.04 ± 0.044
4.268ProGly: 4.268 ± 0.062
1.122ProHis: 1.122 ± 0.032
2.49ProIle: 2.49 ± 0.043
2.048ProLys: 2.048 ± 0.038
4.814ProLeu: 4.814 ± 0.065
1.215ProMet: 1.215 ± 0.028
1.992ProAsn: 1.992 ± 0.046
2.429ProPro: 2.429 ± 0.056
2.046ProGln: 2.046 ± 0.04
2.507ProArg: 2.507 ± 0.044
3.485ProSer: 3.485 ± 0.046
3.301ProThr: 3.301 ± 0.054
3.928ProVal: 3.928 ± 0.049
0.645ProTrp: 0.645 ± 0.023
1.351ProTyr: 1.351 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.815GlnAla: 3.815 ± 0.063
0.217GlnCys: 0.217 ± 0.011
1.619GlnAsp: 1.619 ± 0.033
1.742GlnGlu: 1.742 ± 0.04
1.495GlnPhe: 1.495 ± 0.037
2.515GlnGly: 2.515 ± 0.052
0.889GlnHis: 0.889 ± 0.024
2.346GlnIle: 2.346 ± 0.038
1.676GlnLys: 1.676 ± 0.036
3.387GlnLeu: 3.387 ± 0.052
1.074GlnMet: 1.074 ± 0.03
1.433GlnAsn: 1.433 ± 0.034
1.994GlnPro: 1.994 ± 0.038
2.134GlnGln: 2.134 ± 0.06
2.548GlnArg: 2.548 ± 0.042
2.344GlnSer: 2.344 ± 0.045
2.506GlnThr: 2.506 ± 0.049
2.662GlnVal: 2.662 ± 0.043
0.512GlnTrp: 0.512 ± 0.02
1.031GlnTyr: 1.031 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
5.644ArgAla: 5.644 ± 0.071
0.426ArgCys: 0.426 ± 0.017
2.979ArgAsp: 2.979 ± 0.045
3.696ArgGlu: 3.696 ± 0.066
2.667ArgPhe: 2.667 ± 0.041
3.927ArgGly: 3.927 ± 0.053
1.277ArgHis: 1.277 ± 0.027
3.542ArgIle: 3.542 ± 0.061
2.501ArgLys: 2.501 ± 0.047
5.859ArgLeu: 5.859 ± 0.071
1.763ArgMet: 1.763 ± 0.03
2.169ArgAsn: 2.169 ± 0.045
2.675ArgPro: 2.675 ± 0.051
2.234ArgGln: 2.234 ± 0.04
3.998ArgArg: 3.998 ± 0.067
3.861ArgSer: 3.861 ± 0.05
3.401ArgThr: 3.401 ± 0.051
4.35ArgVal: 4.35 ± 0.062
0.941ArgTrp: 0.941 ± 0.027
1.902ArgTyr: 1.902 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
6.942SerAla: 6.942 ± 0.07
0.498SerCys: 0.498 ± 0.021
3.028SerAsp: 3.028 ± 0.052
3.126SerGlu: 3.126 ± 0.046
2.825SerPhe: 2.825 ± 0.046
5.836SerGly: 5.836 ± 0.076
1.352SerHis: 1.352 ± 0.033
3.468SerIle: 3.468 ± 0.052
2.256SerLys: 2.256 ± 0.041
6.568SerLeu: 6.568 ± 0.073
1.463SerMet: 1.463 ± 0.033
2.314SerAsn: 2.314 ± 0.052
3.486SerPro: 3.486 ± 0.048
2.25SerGln: 2.25 ± 0.046
3.508SerArg: 3.508 ± 0.048
4.813SerSer: 4.813 ± 0.081
4.341SerThr: 4.341 ± 0.063
4.708SerVal: 4.708 ± 0.058
0.791SerTrp: 0.791 ± 0.026
1.913SerTyr: 1.913 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
6.712ThrAla: 6.712 ± 0.087
0.523ThrCys: 0.523 ± 0.019
2.979ThrAsp: 2.979 ± 0.052
3.051ThrGlu: 3.051 ± 0.042
2.663ThrPhe: 2.663 ± 0.057
5.694ThrGly: 5.694 ± 0.087
1.364ThrHis: 1.364 ± 0.034
3.377ThrIle: 3.377 ± 0.06
2.105ThrLys: 2.105 ± 0.043
6.715ThrLeu: 6.715 ± 0.074
1.217ThrMet: 1.217 ± 0.028
2.194ThrAsn: 2.194 ± 0.054
4.06ThrPro: 4.06 ± 0.06
2.227ThrGln: 2.227 ± 0.047
3.024ThrArg: 3.024 ± 0.042
4.231ThrSer: 4.231 ± 0.065
4.205ThrThr: 4.205 ± 0.084
4.939ThrVal: 4.939 ± 0.072
0.792ThrTrp: 0.792 ± 0.025
1.757ThrTyr: 1.757 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
7.76ValAla: 7.76 ± 0.089
0.663ValCys: 0.663 ± 0.021
3.818ValAsp: 3.818 ± 0.055
3.929ValGlu: 3.929 ± 0.061
2.845ValPhe: 2.845 ± 0.051
5.27ValGly: 5.27 ± 0.061
1.612ValHis: 1.612 ± 0.034
3.482ValIle: 3.482 ± 0.062
2.589ValLys: 2.589 ± 0.042
7.539ValLeu: 7.539 ± 0.082
1.582ValMet: 1.582 ± 0.029
2.492ValAsn: 2.492 ± 0.041
3.853ValPro: 3.853 ± 0.056
2.596ValGln: 2.596 ± 0.042
4.389ValArg: 4.389 ± 0.061
4.855ValSer: 4.855 ± 0.065
4.88ValThr: 4.88 ± 0.07
5.631ValVal: 5.631 ± 0.074
0.901ValTrp: 0.901 ± 0.028
1.763ValTyr: 1.763 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.028TrpAla: 1.028 ± 0.027
0.121TrpCys: 0.121 ± 0.008
0.607TrpAsp: 0.607 ± 0.02
0.608TrpGlu: 0.608 ± 0.021
0.544TrpPhe: 0.544 ± 0.02
0.881TrpGly: 0.881 ± 0.026
0.384TrpHis: 0.384 ± 0.018
0.769TrpIle: 0.769 ± 0.024
0.689TrpLys: 0.689 ± 0.023
1.393TrpLeu: 1.393 ± 0.032
0.463TrpMet: 0.463 ± 0.019
0.613TrpAsn: 0.613 ± 0.022
0.546TrpPro: 0.546 ± 0.023
0.646TrpGln: 0.646 ± 0.023
0.838TrpArg: 0.838 ± 0.022
0.941TrpSer: 0.941 ± 0.026
0.849TrpThr: 0.849 ± 0.027
0.821TrpVal: 0.821 ± 0.023
0.247TrpTrp: 0.247 ± 0.014
0.353TrpTyr: 0.353 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.762TyrAla: 2.762 ± 0.044
0.207TyrCys: 0.207 ± 0.013
1.582TyrAsp: 1.582 ± 0.039
1.395TyrGlu: 1.395 ± 0.032
1.286TyrPhe: 1.286 ± 0.033
2.383TyrGly: 2.383 ± 0.043
0.545TyrHis: 0.545 ± 0.019
1.123TyrIle: 1.123 ± 0.031
0.896TyrLys: 0.896 ± 0.029
2.821TyrLeu: 2.821 ± 0.045
0.463TyrMet: 0.463 ± 0.019
1.033TyrAsn: 1.033 ± 0.035
1.342TyrPro: 1.342 ± 0.03
0.999TyrGln: 0.999 ± 0.029
1.899TyrArg: 1.899 ± 0.037
1.804TyrSer: 1.804 ± 0.038
1.818TyrThr: 1.818 ± 0.041
1.926TyrVal: 1.926 ± 0.045
0.392TyrTrp: 0.392 ± 0.017
0.769TyrTyr: 0.769 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4177 proteins (1519388 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski