Amino acid dipepetide frequency for Sphaerobacter thermophilus (strain DSM 20745 / S 6022)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.308AlaAla: 17.308 ± 0.192
0.928AlaCys: 0.928 ± 0.033
6.27AlaAsp: 6.27 ± 0.072
8.053AlaGlu: 8.053 ± 0.102
3.558AlaPhe: 3.558 ± 0.062
10.657AlaGly: 10.657 ± 0.129
2.083AlaHis: 2.083 ± 0.049
6.213AlaIle: 6.213 ± 0.098
1.93AlaLys: 1.93 ± 0.044
13.696AlaLeu: 13.696 ± 0.143
2.745AlaMet: 2.745 ± 0.053
2.191AlaAsn: 2.191 ± 0.048
5.626AlaPro: 5.626 ± 0.083
3.53AlaGln: 3.53 ± 0.067
10.178AlaArg: 10.178 ± 0.109
5.148AlaSer: 5.148 ± 0.073
6.647AlaThr: 6.647 ± 0.088
10.276AlaVal: 10.276 ± 0.109
1.645AlaTrp: 1.645 ± 0.045
2.675AlaTyr: 2.675 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.706CysAla: 0.706 ± 0.022
0.094CysCys: 0.094 ± 0.009
0.447CysAsp: 0.447 ± 0.019
0.331CysGlu: 0.331 ± 0.016
0.217CysPhe: 0.217 ± 0.014
0.845CysGly: 0.845 ± 0.031
0.188CysHis: 0.188 ± 0.014
0.247CysIle: 0.247 ± 0.017
0.087CysLys: 0.087 ± 0.009
0.639CysLeu: 0.639 ± 0.025
0.094CysMet: 0.094 ± 0.009
0.145CysAsn: 0.145 ± 0.013
0.494CysPro: 0.494 ± 0.023
0.229CysGln: 0.229 ± 0.013
0.567CysArg: 0.567 ± 0.023
0.301CysSer: 0.301 ± 0.017
0.36CysThr: 0.36 ± 0.021
0.49CysVal: 0.49 ± 0.023
0.098CysTrp: 0.098 ± 0.008
0.21CysTyr: 0.21 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
6.489AspAla: 6.489 ± 0.092
0.334AspCys: 0.334 ± 0.017
3.24AspAsp: 3.24 ± 0.068
4.02AspGlu: 4.02 ± 0.059
1.668AspPhe: 1.668 ± 0.039
5.245AspGly: 5.245 ± 0.074
1.105AspHis: 1.105 ± 0.032
2.5AspIle: 2.5 ± 0.05
0.799AspLys: 0.799 ± 0.029
6.624AspLeu: 6.624 ± 0.085
0.92AspMet: 0.92 ± 0.029
0.963AspAsn: 0.963 ± 0.032
4.576AspPro: 4.576 ± 0.065
1.751AspGln: 1.751 ± 0.043
4.622AspArg: 4.622 ± 0.061
1.812AspSer: 1.812 ± 0.038
2.599AspThr: 2.599 ± 0.053
4.607AspVal: 4.607 ± 0.062
0.828AspTrp: 0.828 ± 0.027
1.47AspTyr: 1.47 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
8.044GluAla: 8.044 ± 0.1
0.288GluCys: 0.288 ± 0.014
2.767GluAsp: 2.767 ± 0.06
4.626GluGlu: 4.626 ± 0.078
1.755GluPhe: 1.755 ± 0.04
4.362GluGly: 4.362 ± 0.07
1.577GluHis: 1.577 ± 0.044
3.649GluIle: 3.649 ± 0.066
1.137GluLys: 1.137 ± 0.04
5.784GluLeu: 5.784 ± 0.09
1.523GluMet: 1.523 ± 0.041
1.153GluAsn: 1.153 ± 0.036
3.908GluPro: 3.908 ± 0.066
2.852GluGln: 2.852 ± 0.059
7.249GluArg: 7.249 ± 0.099
2.54GluSer: 2.54 ± 0.048
3.274GluThr: 3.274 ± 0.052
5.26GluVal: 5.26 ± 0.072
0.861GluTrp: 0.861 ± 0.029
1.36GluTyr: 1.36 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.556PheAla: 3.556 ± 0.055
0.239PheCys: 0.239 ± 0.013
2.12PheAsp: 2.12 ± 0.046
1.769PheGlu: 1.769 ± 0.04
1.12PhePhe: 1.12 ± 0.035
3.18PheGly: 3.18 ± 0.053
0.677PheHis: 0.677 ± 0.026
1.429PheIle: 1.429 ± 0.036
0.523PheLys: 0.523 ± 0.025
3.125PheLeu: 3.125 ± 0.062
0.502PheMet: 0.502 ± 0.021
0.789PheAsn: 0.789 ± 0.025
1.712PhePro: 1.712 ± 0.038
0.942PheGln: 0.942 ± 0.031
2.214PheArg: 2.214 ± 0.045
1.516PheSer: 1.516 ± 0.034
1.905PheThr: 1.905 ± 0.048
2.593PheVal: 2.593 ± 0.059
0.564PheTrp: 0.564 ± 0.021
0.875PheTyr: 0.875 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
9.019GlyAla: 9.019 ± 0.108
0.683GlyCys: 0.683 ± 0.025
4.597GlyAsp: 4.597 ± 0.059
5.142GlyGlu: 5.142 ± 0.071
2.986GlyPhe: 2.986 ± 0.06
7.154GlyGly: 7.154 ± 0.109
1.91GlyHis: 1.91 ± 0.046
4.712GlyIle: 4.712 ± 0.077
1.95GlyLys: 1.95 ± 0.044
8.613GlyLeu: 8.613 ± 0.098
2.107GlyMet: 2.107 ± 0.048
1.842GlyAsn: 1.842 ± 0.038
4.254GlyPro: 4.254 ± 0.06
2.868GlyGln: 2.868 ± 0.05
6.835GlyArg: 6.835 ± 0.083
4.376GlySer: 4.376 ± 0.065
5.347GlyThr: 5.347 ± 0.081
7.268GlyVal: 7.268 ± 0.087
1.505GlyTrp: 1.505 ± 0.042
2.532GlyTyr: 2.532 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
2.172HisAla: 2.172 ± 0.046
0.175HisCys: 0.175 ± 0.013
1.226HisAsp: 1.226 ± 0.03
1.127HisGlu: 1.127 ± 0.031
0.69HisPhe: 0.69 ± 0.024
1.945HisGly: 1.945 ± 0.038
0.564HisHis: 0.564 ± 0.026
0.858HisIle: 0.858 ± 0.029
0.271HisLys: 0.271 ± 0.017
2.363HisLeu: 2.363 ± 0.048
0.33HisMet: 0.33 ± 0.018
0.445HisAsn: 0.445 ± 0.02
1.64HisPro: 1.64 ± 0.04
0.635HisGln: 0.635 ± 0.022
1.725HisArg: 1.725 ± 0.04
0.785HisSer: 0.785 ± 0.026
1.018HisThr: 1.018 ± 0.031
1.603HisVal: 1.603 ± 0.039
0.314HisTrp: 0.314 ± 0.016
0.608HisTyr: 0.608 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
6.558IleAla: 6.558 ± 0.084
0.376IleCys: 0.376 ± 0.018
3.459IleAsp: 3.459 ± 0.056
3.432IleGlu: 3.432 ± 0.062
1.5IlePhe: 1.5 ± 0.043
4.56IleGly: 4.56 ± 0.076
0.996IleHis: 0.996 ± 0.033
2.184IleIle: 2.184 ± 0.046
0.899IleLys: 0.899 ± 0.032
4.509IleLeu: 4.509 ± 0.065
0.724IleMet: 0.724 ± 0.025
1.186IleAsn: 1.186 ± 0.039
2.983IlePro: 2.983 ± 0.053
1.448IleGln: 1.448 ± 0.038
3.445IleArg: 3.445 ± 0.06
2.221IleSer: 2.221 ± 0.052
2.977IleThr: 2.977 ± 0.058
4.453IleVal: 4.453 ± 0.065
0.608IleTrp: 0.608 ± 0.022
1.208IleTyr: 1.208 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
1.808LysAla: 1.808 ± 0.048
0.077LysCys: 0.077 ± 0.008
0.844LysAsp: 0.844 ± 0.03
1.088LysGlu: 1.088 ± 0.035
0.431LysPhe: 0.431 ± 0.02
1.283LysGly: 1.283 ± 0.04
0.368LysHis: 0.368 ± 0.019
0.932LysIle: 0.932 ± 0.029
0.403LysLys: 0.403 ± 0.023
1.62LysLeu: 1.62 ± 0.046
0.34LysMet: 0.34 ± 0.018
0.376LysAsn: 0.376 ± 0.02
1.168LysPro: 1.168 ± 0.035
0.59LysGln: 0.59 ± 0.027
1.485LysArg: 1.485 ± 0.038
0.773LysSer: 0.773 ± 0.028
1.069LysThr: 1.069 ± 0.034
1.416LysVal: 1.416 ± 0.041
0.203LysTrp: 0.203 ± 0.014
0.427LysTyr: 0.427 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
14.587LeuAla: 14.587 ± 0.163
0.657LeuCys: 0.657 ± 0.027
6.139LeuAsp: 6.139 ± 0.092
5.841LeuGlu: 5.841 ± 0.091
3.018LeuPhe: 3.018 ± 0.062
9.14LeuGly: 9.14 ± 0.097
1.987LeuHis: 1.987 ± 0.038
5.076LeuIle: 5.076 ± 0.072
1.728LeuLys: 1.728 ± 0.043
11.541LeuLeu: 11.541 ± 0.143
1.927LeuMet: 1.927 ± 0.04
2.042LeuAsn: 2.042 ± 0.046
6.147LeuPro: 6.147 ± 0.073
2.832LeuGln: 2.832 ± 0.054
8.965LeuArg: 8.965 ± 0.094
5.189LeuSer: 5.189 ± 0.075
6.135LeuThr: 6.135 ± 0.081
8.86LeuVal: 8.86 ± 0.101
1.324LeuTrp: 1.324 ± 0.036
2.326LeuTyr: 2.326 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.545MetAla: 2.545 ± 0.047
0.104MetCys: 0.104 ± 0.01
0.968MetAsp: 0.968 ± 0.027
1.054MetGlu: 1.054 ± 0.032
0.483MetPhe: 0.483 ± 0.019
1.452MetGly: 1.452 ± 0.041
0.387MetHis: 0.387 ± 0.017
1.075MetIle: 1.075 ± 0.032
0.393MetLys: 0.393 ± 0.018
2.14MetLeu: 2.14 ± 0.045
0.451MetMet: 0.451 ± 0.02
0.44MetAsn: 0.44 ± 0.019
1.252MetPro: 1.252 ± 0.028
0.658MetGln: 0.658 ± 0.024
1.72MetArg: 1.72 ± 0.044
1.169MetSer: 1.169 ± 0.033
1.452MetThr: 1.452 ± 0.034
1.62MetVal: 1.62 ± 0.037
0.166MetTrp: 0.166 ± 0.011
0.367MetTyr: 0.367 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.306AsnAla: 2.306 ± 0.047
0.166AsnCys: 0.166 ± 0.011
0.998AsnAsp: 0.998 ± 0.032
1.073AsnGlu: 1.073 ± 0.031
0.633AsnPhe: 0.633 ± 0.027
1.935AsnGly: 1.935 ± 0.044
0.397AsnHis: 0.397 ± 0.018
1.01AsnIle: 1.01 ± 0.03
0.335AsnLys: 0.335 ± 0.018
2.241AsnLeu: 2.241 ± 0.046
0.357AsnMet: 0.357 ± 0.017
0.468AsnAsn: 0.468 ± 0.027
1.835AsnPro: 1.835 ± 0.048
0.66AsnGln: 0.66 ± 0.029
1.581AsnArg: 1.581 ± 0.037
0.831AsnSer: 0.831 ± 0.03
1.109AsnThr: 1.109 ± 0.031
1.728AsnVal: 1.728 ± 0.042
0.326AsnTrp: 0.326 ± 0.017
0.579AsnTyr: 0.579 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
6.994ProAla: 6.994 ± 0.095
0.299ProCys: 0.299 ± 0.016
4.55ProAsp: 4.55 ± 0.074
5.095ProGlu: 5.095 ± 0.071
1.853ProPhe: 1.853 ± 0.037
5.703ProGly: 5.703 ± 0.084
1.171ProHis: 1.171 ± 0.035
2.638ProIle: 2.638 ± 0.048
0.937ProLys: 0.937 ± 0.032
5.474ProLeu: 5.474 ± 0.079
1.063ProMet: 1.063 ± 0.028
1.449ProAsn: 1.449 ± 0.036
3.625ProPro: 3.625 ± 0.066
1.464ProGln: 1.464 ± 0.035
3.978ProArg: 3.978 ± 0.061
3.046ProSer: 3.046 ± 0.053
3.527ProThr: 3.527 ± 0.075
5.186ProVal: 5.186 ± 0.077
0.882ProTrp: 0.882 ± 0.033
1.429ProTyr: 1.429 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
4.063GlnAla: 4.063 ± 0.066
0.164GlnCys: 0.164 ± 0.012
1.364GlnAsp: 1.364 ± 0.035
2.08GlnGlu: 2.08 ± 0.047
0.923GlnPhe: 0.923 ± 0.03
2.27GlnGly: 2.27 ± 0.047
0.715GlnHis: 0.715 ± 0.024
1.789GlnIle: 1.789 ± 0.038
0.458GlnLys: 0.458 ± 0.021
2.746GlnLeu: 2.746 ± 0.047
0.74GlnMet: 0.74 ± 0.025
0.579GlnAsn: 0.579 ± 0.023
2.231GlnPro: 2.231 ± 0.046
1.389GlnGln: 1.389 ± 0.04
3.156GlnArg: 3.156 ± 0.05
1.324GlnSer: 1.324 ± 0.034
1.629GlnThr: 1.629 ± 0.042
2.923GlnVal: 2.923 ± 0.056
0.461GlnTrp: 0.461 ± 0.018
0.714GlnTyr: 0.714 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
9.431ArgAla: 9.431 ± 0.108
0.579ArgCys: 0.579 ± 0.024
4.592ArgAsp: 4.592 ± 0.071
5.87ArgGlu: 5.87 ± 0.092
2.975ArgPhe: 2.975 ± 0.052
5.977ArgGly: 5.977 ± 0.073
1.856ArgHis: 1.856 ± 0.043
4.115ArgIle: 4.115 ± 0.053
1.316ArgLys: 1.316 ± 0.036
9.503ArgLeu: 9.503 ± 0.11
1.757ArgMet: 1.757 ± 0.042
1.601ArgAsn: 1.601 ± 0.039
4.525ArgPro: 4.525 ± 0.067
3.029ArgGln: 3.029 ± 0.056
8.147ArgArg: 8.147 ± 0.105
3.614ArgSer: 3.614 ± 0.071
3.969ArgThr: 3.969 ± 0.059
6.778ArgVal: 6.778 ± 0.087
1.476ArgTrp: 1.476 ± 0.041
2.441ArgTyr: 2.441 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
4.825SerAla: 4.825 ± 0.059
0.275SerCys: 0.275 ± 0.014
2.529SerAsp: 2.529 ± 0.048
2.502SerGlu: 2.502 ± 0.05
1.579SerPhe: 1.579 ± 0.043
4.709SerGly: 4.709 ± 0.068
0.889SerHis: 0.889 ± 0.026
2.156SerIle: 2.156 ± 0.049
0.726SerLys: 0.726 ± 0.029
4.863SerLeu: 4.863 ± 0.076
0.988SerMet: 0.988 ± 0.028
0.994SerAsn: 0.994 ± 0.035
3.033SerPro: 3.033 ± 0.058
1.392SerGln: 1.392 ± 0.037
3.542SerArg: 3.542 ± 0.067
2.195SerSer: 2.195 ± 0.05
2.545SerThr: 2.545 ± 0.051
3.528SerVal: 3.528 ± 0.059
0.719SerTrp: 0.719 ± 0.024
1.125SerTyr: 1.125 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
6.33ThrAla: 6.33 ± 0.088
0.404ThrCys: 0.404 ± 0.023
2.755ThrAsp: 2.755 ± 0.057
2.917ThrGlu: 2.917 ± 0.055
1.937ThrPhe: 1.937 ± 0.046
5.384ThrGly: 5.384 ± 0.081
1.031ThrHis: 1.031 ± 0.033
3.129ThrIle: 3.129 ± 0.053
0.815ThrLys: 0.815 ± 0.029
6.286ThrLeu: 6.286 ± 0.082
1.038ThrMet: 1.038 ± 0.03
1.157ThrAsn: 1.157 ± 0.037
4.183ThrPro: 4.183 ± 0.079
1.455ThrGln: 1.455 ± 0.038
3.846ThrArg: 3.846 ± 0.065
2.582ThrSer: 2.582 ± 0.043
3.267ThrThr: 3.267 ± 0.067
5.437ThrVal: 5.437 ± 0.094
0.85ThrTrp: 0.85 ± 0.031
1.387ThrTyr: 1.387 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
10.706ValAla: 10.706 ± 0.108
0.589ValCys: 0.589 ± 0.021
4.981ValAsp: 4.981 ± 0.071
5.536ValGlu: 5.536 ± 0.063
2.595ValPhe: 2.595 ± 0.056
6.633ValGly: 6.633 ± 0.092
1.559ValHis: 1.559 ± 0.041
4.363ValIle: 4.363 ± 0.067
1.406ValLys: 1.406 ± 0.043
9.063ValLeu: 9.063 ± 0.107
1.606ValMet: 1.606 ± 0.041
1.817ValAsn: 1.817 ± 0.039
5.002ValPro: 5.002 ± 0.072
2.476ValGln: 2.476 ± 0.048
6.67ValArg: 6.67 ± 0.082
3.863ValSer: 3.863 ± 0.051
5.272ValThr: 5.272 ± 0.083
8.315ValVal: 8.315 ± 0.102
1.065ValTrp: 1.065 ± 0.033
1.934ValTyr: 1.934 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.298TrpAla: 1.298 ± 0.037
0.128TrpCys: 0.128 ± 0.01
0.801TrpAsp: 0.801 ± 0.029
0.946TrpGlu: 0.946 ± 0.029
0.505TrpPhe: 0.505 ± 0.02
1.025TrpGly: 1.025 ± 0.03
0.388TrpHis: 0.388 ± 0.021
0.689TrpIle: 0.689 ± 0.027
0.253TrpLys: 0.253 ± 0.017
1.873TrpLeu: 1.873 ± 0.044
0.319TrpMet: 0.319 ± 0.017
0.359TrpAsn: 0.359 ± 0.018
0.705TrpPro: 0.705 ± 0.025
0.576TrpGln: 0.576 ± 0.025
1.284TrpArg: 1.284 ± 0.037
0.798TrpSer: 0.798 ± 0.029
0.8TrpThr: 0.8 ± 0.024
1.19TrpVal: 1.19 ± 0.035
0.323TrpTrp: 0.323 ± 0.017
0.399TrpTyr: 0.399 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.613TyrAla: 2.613 ± 0.048
0.232TyrCys: 0.232 ± 0.014
1.524TyrAsp: 1.524 ± 0.039
1.47TyrGlu: 1.47 ± 0.039
0.923TyrPhe: 0.923 ± 0.033
2.24TyrGly: 2.24 ± 0.044
0.645TyrHis: 0.645 ± 0.027
0.961TyrIle: 0.961 ± 0.029
0.338TyrLys: 0.338 ± 0.02
2.757TyrLeu: 2.757 ± 0.053
0.366TyrMet: 0.366 ± 0.019
0.569TyrAsn: 0.569 ± 0.026
1.464TyrPro: 1.464 ± 0.036
0.944TyrGln: 0.944 ± 0.03
2.342TyrArg: 2.342 ± 0.048
1.028TyrSer: 1.028 ± 0.033
1.29TyrThr: 1.29 ± 0.037
1.92TyrVal: 1.92 ± 0.049
0.44TyrTrp: 0.44 ± 0.02
0.697TyrTyr: 0.697 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3471 proteins (1135642 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski