Amino acid dipepetide frequency for Telmatospirillum siberiense

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.869AlaAla: 16.869 ± 0.156
1.213AlaCys: 1.213 ± 0.027
6.746AlaAsp: 6.746 ± 0.066
7.41AlaGlu: 7.41 ± 0.079
4.131AlaPhe: 4.131 ± 0.055
11.107AlaGly: 11.107 ± 0.118
2.232AlaHis: 2.232 ± 0.044
6.132AlaIle: 6.132 ± 0.058
3.877AlaLys: 3.877 ± 0.054
13.336AlaLeu: 13.336 ± 0.163
3.16AlaMet: 3.16 ± 0.048
2.902AlaAsn: 2.902 ± 0.048
5.167AlaPro: 5.167 ± 0.067
3.767AlaGln: 3.767 ± 0.053
7.954AlaArg: 7.954 ± 0.099
6.442AlaSer: 6.442 ± 0.079
6.035AlaThr: 6.035 ± 0.088
9.344AlaVal: 9.344 ± 0.089
1.507AlaTrp: 1.507 ± 0.031
2.445AlaTyr: 2.445 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.995CysAla: 0.995 ± 0.027
0.147CysCys: 0.147 ± 0.01
0.558CysAsp: 0.558 ± 0.018
0.405CysGlu: 0.405 ± 0.016
0.37CysPhe: 0.37 ± 0.016
1.062CysGly: 1.062 ± 0.026
0.311CysHis: 0.311 ± 0.018
0.396CysIle: 0.396 ± 0.014
0.219CysLys: 0.219 ± 0.012
1.048CysLeu: 1.048 ± 0.024
0.168CysMet: 0.168 ± 0.01
0.229CysAsn: 0.229 ± 0.011
0.549CysPro: 0.549 ± 0.019
0.268CysGln: 0.268 ± 0.012
0.816CysArg: 0.816 ± 0.024
0.518CysSer: 0.518 ± 0.016
0.431CysThr: 0.431 ± 0.016
0.665CysVal: 0.665 ± 0.021
0.127CysTrp: 0.127 ± 0.011
0.226CysTyr: 0.226 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
6.178AspAla: 6.178 ± 0.064
0.516AspCys: 0.516 ± 0.019
3.095AspAsp: 3.095 ± 0.053
3.197AspGlu: 3.197 ± 0.043
2.176AspPhe: 2.176 ± 0.037
5.373AspGly: 5.373 ± 0.06
1.329AspHis: 1.329 ± 0.03
3.121AspIle: 3.121 ± 0.039
1.615AspLys: 1.615 ± 0.033
6.292AspLeu: 6.292 ± 0.061
1.196AspMet: 1.196 ± 0.028
1.291AspAsn: 1.291 ± 0.032
3.293AspPro: 3.293 ± 0.05
1.769AspGln: 1.769 ± 0.032
4.087AspArg: 4.087 ± 0.054
2.483AspSer: 2.483 ± 0.038
2.383AspThr: 2.383 ± 0.059
4.143AspVal: 4.143 ± 0.049
0.933AspTrp: 0.933 ± 0.027
1.532AspTyr: 1.532 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
7.117GluAla: 7.117 ± 0.098
0.374GluCys: 0.374 ± 0.016
2.671GluAsp: 2.671 ± 0.044
3.31GluGlu: 3.31 ± 0.056
1.546GluPhe: 1.546 ± 0.031
3.96GluGly: 3.96 ± 0.052
1.076GluHis: 1.076 ± 0.029
3.395GluIle: 3.395 ± 0.047
2.108GluLys: 2.108 ± 0.043
4.905GluLeu: 4.905 ± 0.054
1.534GluMet: 1.534 ± 0.032
1.455GluAsn: 1.455 ± 0.032
2.254GluPro: 2.254 ± 0.044
2.07GluGln: 2.07 ± 0.038
4.765GluArg: 4.765 ± 0.057
2.525GluSer: 2.525 ± 0.042
3.439GluThr: 3.439 ± 0.049
3.686GluVal: 3.686 ± 0.05
0.614GluTrp: 0.614 ± 0.02
0.815GluTyr: 0.815 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
4.155PheAla: 4.155 ± 0.055
0.459PheCys: 0.459 ± 0.016
2.511PheAsp: 2.511 ± 0.043
1.846PheGlu: 1.846 ± 0.033
1.465PhePhe: 1.465 ± 0.032
3.64PheGly: 3.64 ± 0.047
0.834PheHis: 0.834 ± 0.027
1.643PheIle: 1.643 ± 0.032
0.938PheLys: 0.938 ± 0.023
3.769PheLeu: 3.769 ± 0.054
0.679PheMet: 0.679 ± 0.021
1.005PheAsn: 1.005 ± 0.023
1.702PhePro: 1.702 ± 0.035
1.031PheGln: 1.031 ± 0.025
2.292PheArg: 2.292 ± 0.039
2.414PheSer: 2.414 ± 0.035
1.887PheThr: 1.887 ± 0.037
2.624PheVal: 2.624 ± 0.038
0.494PheTrp: 0.494 ± 0.017
0.872PheTyr: 0.872 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
9.008GlyAla: 9.008 ± 0.084
1.012GlyCys: 1.012 ± 0.027
4.45GlyAsp: 4.45 ± 0.071
4.639GlyGlu: 4.639 ± 0.068
3.475GlyPhe: 3.475 ± 0.048
7.775GlyGly: 7.775 ± 0.119
1.98GlyHis: 1.98 ± 0.033
4.742GlyIle: 4.742 ± 0.049
3.255GlyLys: 3.255 ± 0.048
9.885GlyLeu: 9.885 ± 0.108
2.224GlyMet: 2.224 ± 0.036
2.296GlyAsn: 2.296 ± 0.063
3.296GlyPro: 3.296 ± 0.049
2.93GlyGln: 2.93 ± 0.042
6.548GlyArg: 6.548 ± 0.068
4.918GlySer: 4.918 ± 0.105
4.873GlyThr: 4.873 ± 0.114
6.354GlyVal: 6.354 ± 0.066
1.394GlyTrp: 1.394 ± 0.034
2.25GlyTyr: 2.25 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.259HisAla: 2.259 ± 0.04
0.254HisCys: 0.254 ± 0.012
1.24HisAsp: 1.24 ± 0.027
0.95HisGlu: 0.95 ± 0.026
0.911HisPhe: 0.911 ± 0.025
1.942HisGly: 1.942 ± 0.037
0.652HisHis: 0.652 ± 0.021
0.945HisIle: 0.945 ± 0.023
0.48HisLys: 0.48 ± 0.018
2.381HisLeu: 2.381 ± 0.043
0.461HisMet: 0.461 ± 0.018
0.475HisAsn: 0.475 ± 0.018
1.472HisPro: 1.472 ± 0.036
0.687HisGln: 0.687 ± 0.018
1.578HisArg: 1.578 ± 0.029
1.013HisSer: 1.013 ± 0.024
0.777HisThr: 0.777 ± 0.019
1.455HisVal: 1.455 ± 0.034
0.333HisTrp: 0.333 ± 0.014
0.591HisTyr: 0.591 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.831IleAla: 6.831 ± 0.07
0.557IleCys: 0.557 ± 0.018
3.648IleAsp: 3.648 ± 0.047
3.334IleGlu: 3.334 ± 0.05
1.718IlePhe: 1.718 ± 0.03
5.269IleGly: 5.269 ± 0.072
1.015IleHis: 1.015 ± 0.025
2.261IleIle: 2.261 ± 0.039
1.449IleLys: 1.449 ± 0.031
4.908IleLeu: 4.908 ± 0.053
0.869IleMet: 0.869 ± 0.02
1.552IleAsn: 1.552 ± 0.033
2.438IlePro: 2.438 ± 0.036
1.289IleGln: 1.289 ± 0.024
3.269IleArg: 3.269 ± 0.047
3.239IleSer: 3.239 ± 0.051
2.71IleThr: 2.71 ± 0.054
3.958IleVal: 3.958 ± 0.041
0.532IleTrp: 0.532 ± 0.016
1.042IleTyr: 1.042 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
3.97LysAla: 3.97 ± 0.052
0.202LysCys: 0.202 ± 0.012
1.712LysAsp: 1.712 ± 0.035
1.528LysGlu: 1.528 ± 0.034
0.838LysPhe: 0.838 ± 0.023
2.625LysGly: 2.625 ± 0.043
0.512LysHis: 0.512 ± 0.016
1.856LysIle: 1.856 ± 0.037
1.191LysLys: 1.191 ± 0.031
2.956LysLeu: 2.956 ± 0.043
0.805LysMet: 0.805 ± 0.021
0.881LysAsn: 0.881 ± 0.026
1.774LysPro: 1.774 ± 0.029
0.888LysGln: 0.888 ± 0.025
2.077LysArg: 2.077 ± 0.037
1.841LysSer: 1.841 ± 0.037
2.024LysThr: 2.024 ± 0.038
2.43LysVal: 2.43 ± 0.046
0.319LysTrp: 0.319 ± 0.016
0.594LysTyr: 0.594 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
14.581LeuAla: 14.581 ± 0.133
1.019LeuCys: 1.019 ± 0.024
6.202LeuAsp: 6.202 ± 0.067
5.029LeuGlu: 5.029 ± 0.064
3.937LeuPhe: 3.937 ± 0.056
8.835LeuGly: 8.835 ± 0.086
2.042LeuHis: 2.042 ± 0.036
5.214LeuIle: 5.214 ± 0.079
3.426LeuLys: 3.426 ± 0.048
10.877LeuLeu: 10.877 ± 0.125
2.222LeuMet: 2.222 ± 0.037
2.553LeuAsn: 2.553 ± 0.038
6.026LeuPro: 6.026 ± 0.069
2.871LeuGln: 2.871 ± 0.045
7.351LeuArg: 7.351 ± 0.072
7.57LeuSer: 7.57 ± 0.103
6.201LeuThr: 6.201 ± 0.182
7.679LeuVal: 7.679 ± 0.074
1.267LeuTrp: 1.267 ± 0.027
2.051LeuTyr: 2.051 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
3.135MetAla: 3.135 ± 0.04
0.145MetCys: 0.145 ± 0.009
1.099MetAsp: 1.099 ± 0.025
1.101MetGlu: 1.101 ± 0.025
0.675MetPhe: 0.675 ± 0.021
1.775MetGly: 1.775 ± 0.033
0.391MetHis: 0.391 ± 0.015
1.318MetIle: 1.318 ± 0.028
0.815MetLys: 0.815 ± 0.024
2.266MetLeu: 2.266 ± 0.037
0.603MetMet: 0.603 ± 0.021
0.678MetAsn: 0.678 ± 0.021
1.35MetPro: 1.35 ± 0.029
0.65MetGln: 0.65 ± 0.019
1.603MetArg: 1.603 ± 0.03
1.657MetSer: 1.657 ± 0.027
1.774MetThr: 1.774 ± 0.033
1.854MetVal: 1.854 ± 0.037
0.187MetTrp: 0.187 ± 0.01
0.284MetTyr: 0.284 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.034AsnAla: 3.034 ± 0.044
0.25AsnCys: 0.25 ± 0.013
1.477AsnAsp: 1.477 ± 0.033
1.064AsnGlu: 1.064 ± 0.025
0.976AsnPhe: 0.976 ± 0.022
2.452AsnGly: 2.452 ± 0.056
0.523AsnHis: 0.523 ± 0.017
1.362AsnIle: 1.362 ± 0.033
0.668AsnLys: 0.668 ± 0.021
2.777AsnLeu: 2.777 ± 0.049
0.546AsnMet: 0.546 ± 0.017
0.773AsnAsn: 0.773 ± 0.028
1.714AsnPro: 1.714 ± 0.031
0.856AsnGln: 0.856 ± 0.025
1.76AsnArg: 1.76 ± 0.031
1.37AsnSer: 1.37 ± 0.039
1.307AsnThr: 1.307 ± 0.038
1.904AsnVal: 1.904 ± 0.036
0.376AsnTrp: 0.376 ± 0.017
0.673AsnTyr: 0.673 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
6.205ProAla: 6.205 ± 0.076
0.397ProCys: 0.397 ± 0.018
3.441ProAsp: 3.441 ± 0.054
3.258ProGlu: 3.258 ± 0.054
1.965ProPhe: 1.965 ± 0.037
4.369ProGly: 4.369 ± 0.067
1.036ProHis: 1.036 ± 0.026
2.281ProIle: 2.281 ± 0.034
1.533ProLys: 1.533 ± 0.03
5.305ProLeu: 5.305 ± 0.07
1.158ProMet: 1.158 ± 0.029
1.217ProAsn: 1.217 ± 0.029
3.043ProPro: 3.043 ± 0.063
1.558ProGln: 1.558 ± 0.032
2.835ProArg: 2.835 ± 0.051
3.07ProSer: 3.07 ± 0.046
2.486ProThr: 2.486 ± 0.041
4.164ProVal: 4.164 ± 0.05
0.729ProTrp: 0.729 ± 0.022
1.14ProTyr: 1.14 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.27GlnAla: 4.27 ± 0.05
0.237GlnCys: 0.237 ± 0.011
1.456GlnAsp: 1.456 ± 0.025
1.445GlnGlu: 1.445 ± 0.03
1.031GlnPhe: 1.031 ± 0.026
2.515GlnGly: 2.515 ± 0.043
0.595GlnHis: 0.595 ± 0.016
1.927GlnIle: 1.927 ± 0.073
1.019GlnLys: 1.019 ± 0.027
2.924GlnLeu: 2.924 ± 0.077
0.947GlnMet: 0.947 ± 0.026
0.799GlnAsn: 0.799 ± 0.022
1.721GlnPro: 1.721 ± 0.038
1.183GlnGln: 1.183 ± 0.03
2.238GlnArg: 2.238 ± 0.043
1.96GlnSer: 1.96 ± 0.037
2.051GlnThr: 2.051 ± 0.042
2.424GlnVal: 2.424 ± 0.045
0.417GlnTrp: 0.417 ± 0.016
0.623GlnTyr: 0.623 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
7.135ArgAla: 7.135 ± 0.074
0.61ArgCys: 0.61 ± 0.02
3.84ArgAsp: 3.84 ± 0.047
3.746ArgGlu: 3.746 ± 0.059
2.77ArgPhe: 2.77 ± 0.047
4.529ArgGly: 4.529 ± 0.054
1.985ArgHis: 1.985 ± 0.042
3.885ArgIle: 3.885 ± 0.052
2.254ArgLys: 2.254 ± 0.041
8.727ArgLeu: 8.727 ± 0.091
1.798ArgMet: 1.798 ± 0.036
1.818ArgAsn: 1.818 ± 0.034
3.674ArgPro: 3.674 ± 0.061
2.984ArgGln: 2.984 ± 0.041
6.304ArgArg: 6.304 ± 0.092
3.662ArgSer: 3.662 ± 0.044
3.303ArgThr: 3.303 ± 0.041
4.479ArgVal: 4.479 ± 0.06
1.035ArgTrp: 1.035 ± 0.029
1.655ArgTyr: 1.655 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.662SerAla: 6.662 ± 0.075
0.553SerCys: 0.553 ± 0.019
3.022SerAsp: 3.022 ± 0.047
2.744SerGlu: 2.744 ± 0.041
2.309SerPhe: 2.309 ± 0.041
6.08SerGly: 6.08 ± 0.113
1.201SerHis: 1.201 ± 0.027
2.873SerIle: 2.873 ± 0.048
1.474SerLys: 1.474 ± 0.03
6.556SerLeu: 6.556 ± 0.075
1.359SerMet: 1.359 ± 0.025
1.489SerAsn: 1.489 ± 0.039
3.081SerPro: 3.081 ± 0.041
1.816SerGln: 1.816 ± 0.033
3.807SerArg: 3.807 ± 0.051
3.822SerSer: 3.822 ± 0.095
3.006SerThr: 3.006 ± 0.091
4.444SerVal: 4.444 ± 0.06
0.787SerTrp: 0.787 ± 0.024
1.328SerTyr: 1.328 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
6.598ThrAla: 6.598 ± 0.104
0.465ThrCys: 0.465 ± 0.017
2.786ThrAsp: 2.786 ± 0.039
2.614ThrGlu: 2.614 ± 0.05
1.848ThrPhe: 1.848 ± 0.037
5.166ThrGly: 5.166 ± 0.086
0.933ThrHis: 0.933 ± 0.025
2.985ThrIle: 2.985 ± 0.063
1.335ThrLys: 1.335 ± 0.027
6.166ThrLeu: 6.166 ± 0.131
1.119ThrMet: 1.119 ± 0.027
1.433ThrAsn: 1.433 ± 0.037
3.209ThrPro: 3.209 ± 0.048
1.71ThrGln: 1.71 ± 0.149
3.089ThrArg: 3.089 ± 0.041
3.118ThrSer: 3.118 ± 0.082
3.252ThrThr: 3.252 ± 0.169
4.673ThrVal: 4.673 ± 0.082
0.616ThrTrp: 0.616 ± 0.021
1.137ThrTyr: 1.137 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
9.283ValAla: 9.283 ± 0.079
0.725ValCys: 0.725 ± 0.022
4.11ValAsp: 4.11 ± 0.049
4.421ValGlu: 4.421 ± 0.051
2.687ValPhe: 2.687 ± 0.044
5.862ValGly: 5.862 ± 0.069
1.355ValHis: 1.355 ± 0.029
4.078ValIle: 4.078 ± 0.045
2.368ValLys: 2.368 ± 0.042
7.777ValLeu: 7.777 ± 0.073
1.823ValMet: 1.823 ± 0.033
1.986ValAsn: 1.986 ± 0.044
3.671ValPro: 3.671 ± 0.053
2.09ValGln: 2.09 ± 0.041
4.795ValArg: 4.795 ± 0.065
4.737ValSer: 4.737 ± 0.062
4.564ValThr: 4.564 ± 0.087
6.427ValVal: 6.427 ± 0.07
0.878ValTrp: 0.878 ± 0.025
1.433ValTyr: 1.433 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.184TrpAla: 1.184 ± 0.029
0.127TrpCys: 0.127 ± 0.008
0.62TrpAsp: 0.62 ± 0.02
0.521TrpGlu: 0.521 ± 0.02
0.504TrpPhe: 0.504 ± 0.018
0.926TrpGly: 0.926 ± 0.027
0.332TrpHis: 0.332 ± 0.015
0.63TrpIle: 0.63 ± 0.023
0.42TrpLys: 0.42 ± 0.017
1.689TrpLeu: 1.689 ± 0.034
0.325TrpMet: 0.325 ± 0.014
0.437TrpAsn: 0.437 ± 0.017
0.684TrpPro: 0.684 ± 0.021
0.562TrpGln: 0.562 ± 0.016
1.242TrpArg: 1.242 ± 0.037
0.814TrpSer: 0.814 ± 0.021
0.754TrpThr: 0.754 ± 0.022
0.801TrpVal: 0.801 ± 0.023
0.223TrpTrp: 0.223 ± 0.012
0.294TrpTyr: 0.294 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.309TyrAla: 2.309 ± 0.04
0.244TyrCys: 0.244 ± 0.012
1.318TyrAsp: 1.318 ± 0.027
1.056TyrGlu: 1.056 ± 0.027
0.928TyrPhe: 0.928 ± 0.023
2.043TyrGly: 2.043 ± 0.038
0.522TyrHis: 0.522 ± 0.019
0.842TyrIle: 0.842 ± 0.021
0.578TyrLys: 0.578 ± 0.019
2.41TyrLeu: 2.41 ± 0.04
0.366TyrMet: 0.366 ± 0.014
0.594TyrAsn: 0.594 ± 0.022
1.071TyrPro: 1.071 ± 0.029
0.805TyrGln: 0.805 ± 0.023
1.781TyrArg: 1.781 ± 0.032
1.242TyrSer: 1.242 ± 0.033
0.996TyrThr: 0.996 ± 0.027
1.561TyrVal: 1.561 ± 0.031
0.321TyrTrp: 0.321 ± 0.016
0.617TyrTyr: 0.617 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5353 proteins (1820509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski