Amino acid dipepetide frequency for Singulisphaera acidiphila (strain ATCC BAA-1392 / DSM 18658 / VKM B-2454 / MOB10)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.055AlaAla: 12.055 ± 0.104
1.159AlaCys: 1.159 ± 0.023
5.938AlaAsp: 5.938 ± 0.058
6.736AlaGlu: 6.736 ± 0.066
3.716AlaPhe: 3.716 ± 0.043
8.749AlaGly: 8.749 ± 0.078
1.972AlaHis: 1.972 ± 0.033
5.371AlaIle: 5.371 ± 0.054
4.041AlaLys: 4.041 ± 0.052
11.463AlaLeu: 11.463 ± 0.095
2.388AlaMet: 2.388 ± 0.039
2.972AlaAsn: 2.972 ± 0.05
5.411AlaPro: 5.411 ± 0.056
3.368AlaGln: 3.368 ± 0.037
7.776AlaArg: 7.776 ± 0.077
6.463AlaSer: 6.463 ± 0.06
6.256AlaThr: 6.256 ± 0.077
7.659AlaVal: 7.659 ± 0.063
1.667AlaTrp: 1.667 ± 0.035
2.278AlaTyr: 2.278 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
0.764CysAla: 0.764 ± 0.018
0.167CysCys: 0.167 ± 0.008
0.54CysAsp: 0.54 ± 0.014
0.503CysGlu: 0.503 ± 0.015
0.348CysPhe: 0.348 ± 0.011
0.918CysGly: 0.918 ± 0.022
0.39CysHis: 0.39 ± 0.017
0.33CysIle: 0.33 ± 0.01
0.225CysLys: 0.225 ± 0.011
1.08CysLeu: 1.08 ± 0.022
0.141CysMet: 0.141 ± 0.007
0.245CysAsn: 0.245 ± 0.01
0.632CysPro: 0.632 ± 0.019
0.324CysGln: 0.324 ± 0.011
0.804CysArg: 0.804 ± 0.022
0.499CysSer: 0.499 ± 0.015
0.438CysThr: 0.438 ± 0.014
0.645CysVal: 0.645 ± 0.018
0.175CysTrp: 0.175 ± 0.008
0.263CysTyr: 0.263 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
5.65AspAla: 5.65 ± 0.059
0.467AspCys: 0.467 ± 0.012
3.291AspAsp: 3.291 ± 0.043
3.543AspGlu: 3.543 ± 0.042
2.008AspPhe: 2.008 ± 0.032
5.118AspGly: 5.118 ± 0.06
1.366AspHis: 1.366 ± 0.025
1.911AspIle: 1.911 ± 0.03
1.493AspLys: 1.493 ± 0.031
6.375AspLeu: 6.375 ± 0.065
0.772AspMet: 0.772 ± 0.017
1.203AspAsn: 1.203 ± 0.024
4.381AspPro: 4.381 ± 0.054
2.12AspGln: 2.12 ± 0.03
4.999AspArg: 4.999 ± 0.052
2.887AspSer: 2.887 ± 0.036
2.195AspThr: 2.195 ± 0.035
3.727AspVal: 3.727 ± 0.041
1.039AspTrp: 1.039 ± 0.021
1.368AspTyr: 1.368 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
6.946GluAla: 6.946 ± 0.08
0.407GluCys: 0.407 ± 0.013
2.485GluAsp: 2.485 ± 0.034
3.229GluGlu: 3.229 ± 0.045
1.973GluPhe: 1.973 ± 0.03
4.295GluGly: 4.295 ± 0.044
1.265GluHis: 1.265 ± 0.025
2.932GluIle: 2.932 ± 0.038
2.123GluLys: 2.123 ± 0.035
5.795GluLeu: 5.795 ± 0.06
1.172GluMet: 1.172 ± 0.023
1.494GluAsn: 1.494 ± 0.027
3.483GluPro: 3.483 ± 0.045
2.301GluGln: 2.301 ± 0.032
4.873GluArg: 4.873 ± 0.058
3.539GluSer: 3.539 ± 0.037
3.346GluThr: 3.346 ± 0.042
4.309GluVal: 4.309 ± 0.05
0.769GluTrp: 0.769 ± 0.018
1.095GluTyr: 1.095 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
3.781PheAla: 3.781 ± 0.039
0.367PheCys: 0.367 ± 0.012
2.447PheAsp: 2.447 ± 0.03
2.052PheGlu: 2.052 ± 0.031
1.275PhePhe: 1.275 ± 0.025
3.263PheGly: 3.263 ± 0.037
0.831PheHis: 0.831 ± 0.019
1.21PheIle: 1.21 ± 0.022
0.971PheLys: 0.971 ± 0.02
3.705PheLeu: 3.705 ± 0.045
0.593PheMet: 0.593 ± 0.014
1.065PheAsn: 1.065 ± 0.022
1.81PhePro: 1.81 ± 0.027
1.254PheGln: 1.254 ± 0.024
2.57PheArg: 2.57 ± 0.034
2.12PheSer: 2.12 ± 0.03
1.929PheThr: 1.929 ± 0.036
2.577PheVal: 2.577 ± 0.028
0.579PheTrp: 0.579 ± 0.017
0.825PheTyr: 0.825 ± 0.019
0.0PheXaa: 0.0 ± 0.0
Gly
7.109GlyAla: 7.109 ± 0.061
0.932GlyCys: 0.932 ± 0.021
4.326GlyAsp: 4.326 ± 0.062
4.484GlyGlu: 4.484 ± 0.045
3.245GlyPhe: 3.245 ± 0.037
7.381GlyGly: 7.381 ± 0.095
1.921GlyHis: 1.921 ± 0.029
3.799GlyIle: 3.799 ± 0.042
3.318GlyLys: 3.318 ± 0.045
9.027GlyLeu: 9.027 ± 0.057
1.876GlyMet: 1.876 ± 0.041
2.378GlyAsn: 2.378 ± 0.058
4.481GlyPro: 4.481 ± 0.045
3.294GlyGln: 3.294 ± 0.039
6.632GlyArg: 6.632 ± 0.065
5.206GlySer: 5.206 ± 0.059
5.075GlyThr: 5.075 ± 0.078
5.873GlyVal: 5.873 ± 0.049
1.57GlyTrp: 1.57 ± 0.03
2.196GlyTyr: 2.196 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.069HisAla: 2.069 ± 0.034
0.247HisCys: 0.247 ± 0.009
1.309HisAsp: 1.309 ± 0.023
1.269HisGlu: 1.269 ± 0.022
0.84HisPhe: 0.84 ± 0.018
1.907HisGly: 1.907 ± 0.031
0.675HisHis: 0.675 ± 0.016
0.704HisIle: 0.704 ± 0.019
0.552HisLys: 0.552 ± 0.014
2.293HisLeu: 2.293 ± 0.036
0.333HisMet: 0.333 ± 0.012
0.599HisAsn: 0.599 ± 0.015
1.615HisPro: 1.615 ± 0.023
0.739HisGln: 0.739 ± 0.018
1.717HisArg: 1.717 ± 0.027
1.119HisSer: 1.119 ± 0.022
0.95HisThr: 0.95 ± 0.021
1.44HisVal: 1.44 ± 0.025
0.449HisTrp: 0.449 ± 0.012
0.587HisTyr: 0.587 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
5.477IleAla: 5.477 ± 0.051
0.435IleCys: 0.435 ± 0.011
3.205IleAsp: 3.205 ± 0.038
3.177IleGlu: 3.177 ± 0.039
1.313IlePhe: 1.313 ± 0.023
3.873IleGly: 3.873 ± 0.049
1.025IleHis: 1.025 ± 0.022
1.605IleIle: 1.605 ± 0.026
1.467IleLys: 1.467 ± 0.028
4.494IleLeu: 4.494 ± 0.048
0.598IleMet: 0.598 ± 0.016
1.343IleAsn: 1.343 ± 0.026
2.682IlePro: 2.682 ± 0.036
1.466IleGln: 1.466 ± 0.021
3.451IleArg: 3.451 ± 0.038
2.419IleSer: 2.419 ± 0.032
2.274IleThr: 2.274 ± 0.035
3.591IleVal: 3.591 ± 0.044
0.572IleTrp: 0.572 ± 0.017
1.072IleTyr: 1.072 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.917LysAla: 3.917 ± 0.049
0.208LysCys: 0.208 ± 0.01
1.783LysAsp: 1.783 ± 0.036
1.829LysGlu: 1.829 ± 0.033
0.991LysPhe: 0.991 ± 0.021
2.652LysGly: 2.652 ± 0.036
0.627LysHis: 0.627 ± 0.016
1.59LysIle: 1.59 ± 0.03
1.454LysLys: 1.454 ± 0.035
3.187LysLeu: 3.187 ± 0.039
0.706LysMet: 0.706 ± 0.015
1.009LysAsn: 1.009 ± 0.022
2.442LysPro: 2.442 ± 0.041
1.218LysGln: 1.218 ± 0.026
2.287LysArg: 2.287 ± 0.032
2.022LysSer: 2.022 ± 0.036
2.154LysThr: 2.154 ± 0.035
2.569LysVal: 2.569 ± 0.033
0.345LysTrp: 0.345 ± 0.011
0.714LysTyr: 0.714 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
13.413LeuAla: 13.413 ± 0.088
0.976LeuCys: 0.976 ± 0.021
6.247LeuAsp: 6.247 ± 0.064
6.048LeuGlu: 6.048 ± 0.066
3.51LeuPhe: 3.51 ± 0.044
9.01LeuGly: 9.01 ± 0.07
1.932LeuHis: 1.932 ± 0.034
5.067LeuIle: 5.067 ± 0.052
3.748LeuLys: 3.748 ± 0.051
10.315LeuLeu: 10.315 ± 0.086
1.875LeuMet: 1.875 ± 0.026
2.872LeuAsn: 2.872 ± 0.035
5.936LeuPro: 5.936 ± 0.056
2.861LeuGln: 2.861 ± 0.034
7.389LeuArg: 7.389 ± 0.07
6.049LeuSer: 6.049 ± 0.048
6.172LeuThr: 6.172 ± 0.067
7.864LeuVal: 7.864 ± 0.059
1.391LeuTrp: 1.391 ± 0.026
2.088LeuTyr: 2.088 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.283MetAla: 2.283 ± 0.031
0.127MetCys: 0.127 ± 0.008
0.862MetAsp: 0.862 ± 0.019
0.889MetGlu: 0.889 ± 0.022
0.57MetPhe: 0.57 ± 0.018
1.496MetGly: 1.496 ± 0.032
0.364MetHis: 0.364 ± 0.014
1.1MetIle: 1.1 ± 0.023
0.769MetLys: 0.769 ± 0.018
1.898MetLeu: 1.898 ± 0.029
0.473MetMet: 0.473 ± 0.018
0.698MetAsn: 0.698 ± 0.016
1.236MetPro: 1.236 ± 0.022
0.517MetGln: 0.517 ± 0.014
1.325MetArg: 1.325 ± 0.024
1.353MetSer: 1.353 ± 0.025
1.388MetThr: 1.388 ± 0.024
1.379MetVal: 1.379 ± 0.024
0.157MetTrp: 0.157 ± 0.008
0.286MetTyr: 0.286 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.759AsnAla: 2.759 ± 0.045
0.252AsnCys: 0.252 ± 0.011
1.661AsnAsp: 1.661 ± 0.039
1.379AsnGlu: 1.379 ± 0.023
1.024AsnPhe: 1.024 ± 0.024
2.539AsnGly: 2.539 ± 0.054
0.667AsnHis: 0.667 ± 0.016
1.057AsnIle: 1.057 ± 0.024
0.706AsnLys: 0.706 ± 0.02
3.041AsnLeu: 3.041 ± 0.037
0.423AsnMet: 0.423 ± 0.013
0.932AsnAsn: 0.932 ± 0.03
2.216AsnPro: 2.216 ± 0.038
1.122AsnGln: 1.122 ± 0.023
2.125AsnArg: 2.125 ± 0.034
1.59AsnSer: 1.59 ± 0.036
1.419AsnThr: 1.419 ± 0.033
1.993AsnVal: 1.993 ± 0.035
0.436AsnTrp: 0.436 ± 0.015
0.806AsnTyr: 0.806 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
6.617ProAla: 6.617 ± 0.064
0.447ProCys: 0.447 ± 0.015
3.876ProAsp: 3.876 ± 0.05
4.222ProGlu: 4.222 ± 0.045
2.017ProPhe: 2.017 ± 0.029
5.389ProGly: 5.389 ± 0.059
1.186ProHis: 1.186 ± 0.019
2.805ProIle: 2.805 ± 0.035
2.243ProLys: 2.243 ± 0.031
5.543ProLeu: 5.543 ± 0.056
1.144ProMet: 1.144 ± 0.022
1.92ProAsn: 1.92 ± 0.036
3.973ProPro: 3.973 ± 0.06
1.774ProGln: 1.774 ± 0.028
3.724ProArg: 3.724 ± 0.045
3.966ProSer: 3.966 ± 0.05
3.661ProThr: 3.661 ± 0.065
4.272ProVal: 4.272 ± 0.049
0.953ProTrp: 0.953 ± 0.023
1.252ProTyr: 1.252 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
4.359GlnAla: 4.359 ± 0.046
0.326GlnCys: 0.326 ± 0.011
1.427GlnAsp: 1.427 ± 0.025
1.675GlnGlu: 1.675 ± 0.027
1.238GlnPhe: 1.238 ± 0.023
3.019GlnGly: 3.019 ± 0.037
0.609GlnHis: 0.609 ± 0.015
1.71GlnIle: 1.71 ± 0.029
1.056GlnLys: 1.056 ± 0.031
3.199GlnLeu: 3.199 ± 0.041
0.684GlnMet: 0.684 ± 0.015
0.884GlnAsn: 0.884 ± 0.02
2.01GlnPro: 2.01 ± 0.029
1.151GlnGln: 1.151 ± 0.03
2.419GlnArg: 2.419 ± 0.039
1.887GlnSer: 1.887 ± 0.029
1.88GlnThr: 1.88 ± 0.029
2.848GlnVal: 2.848 ± 0.042
0.478GlnTrp: 0.478 ± 0.015
0.756GlnTyr: 0.756 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
6.808ArgAla: 6.808 ± 0.065
0.681ArgCys: 0.681 ± 0.018
4.075ArgAsp: 4.075 ± 0.045
4.438ArgGlu: 4.438 ± 0.054
3.013ArgPhe: 3.013 ± 0.031
5.172ArgGly: 5.172 ± 0.055
1.685ArgHis: 1.685 ± 0.027
3.74ArgIle: 3.74 ± 0.04
2.456ArgLys: 2.456 ± 0.038
8.697ArgLeu: 8.697 ± 0.079
1.654ArgMet: 1.654 ± 0.03
1.933ArgAsn: 1.933 ± 0.029
4.596ArgPro: 4.596 ± 0.047
2.771ArgGln: 2.771 ± 0.034
6.471ArgArg: 6.471 ± 0.076
4.503ArgSer: 4.503 ± 0.043
3.836ArgThr: 3.836 ± 0.042
5.157ArgVal: 5.157 ± 0.051
1.438ArgTrp: 1.438 ± 0.021
1.926ArgTyr: 1.926 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
5.739SerAla: 5.739 ± 0.065
0.549SerCys: 0.549 ± 0.016
3.14SerAsp: 3.14 ± 0.034
3.063SerGlu: 3.063 ± 0.04
2.138SerPhe: 2.138 ± 0.034
5.468SerGly: 5.468 ± 0.076
1.298SerHis: 1.298 ± 0.025
2.662SerIle: 2.662 ± 0.028
1.896SerLys: 1.896 ± 0.031
6.447SerLeu: 6.447 ± 0.059
1.256SerMet: 1.256 ± 0.021
1.677SerAsn: 1.677 ± 0.037
3.935SerPro: 3.935 ± 0.046
2.048SerGln: 2.048 ± 0.03
4.358SerArg: 4.358 ± 0.043
3.88SerSer: 3.88 ± 0.066
3.407SerThr: 3.407 ± 0.053
4.086SerVal: 4.086 ± 0.043
0.96SerTrp: 0.96 ± 0.018
1.438SerTyr: 1.438 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
5.76ThrAla: 5.76 ± 0.07
0.504ThrCys: 0.504 ± 0.014
2.912ThrAsp: 2.912 ± 0.045
2.623ThrGlu: 2.623 ± 0.033
2.118ThrPhe: 2.118 ± 0.032
4.915ThrGly: 4.915 ± 0.072
1.146ThrHis: 1.146 ± 0.022
2.994ThrIle: 2.994 ± 0.041
1.732ThrLys: 1.732 ± 0.031
6.349ThrLeu: 6.349 ± 0.061
1.009ThrMet: 1.009 ± 0.02
1.599ThrAsn: 1.599 ± 0.041
3.953ThrPro: 3.953 ± 0.063
1.596ThrGln: 1.596 ± 0.03
3.516ThrArg: 3.516 ± 0.038
3.34ThrSer: 3.34 ± 0.056
3.539ThrThr: 3.539 ± 0.072
4.338ThrVal: 4.338 ± 0.061
0.894ThrTrp: 0.894 ± 0.023
1.428ThrTyr: 1.428 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
8.415ValAla: 8.415 ± 0.066
0.763ValCys: 0.763 ± 0.019
4.087ValAsp: 4.087 ± 0.041
4.624ValGlu: 4.624 ± 0.049
2.42ValPhe: 2.42 ± 0.028
5.828ValGly: 5.828 ± 0.061
1.436ValHis: 1.436 ± 0.023
3.472ValIle: 3.472 ± 0.046
2.281ValLys: 2.281 ± 0.034
7.547ValLeu: 7.547 ± 0.065
1.367ValMet: 1.367 ± 0.022
2.054ValAsn: 2.054 ± 0.036
4.099ValPro: 4.099 ± 0.049
2.152ValGln: 2.152 ± 0.029
5.374ValArg: 5.374 ± 0.054
4.195ValSer: 4.195 ± 0.045
4.28ValThr: 4.28 ± 0.064
6.297ValVal: 6.297 ± 0.058
1.014ValTrp: 1.014 ± 0.019
1.589ValTyr: 1.589 ± 0.029
0.0ValXaa: 0.0 ± 0.0
Trp
1.376TrpAla: 1.376 ± 0.025
0.167TrpCys: 0.167 ± 0.009
0.882TrpAsp: 0.882 ± 0.026
0.729TrpGlu: 0.729 ± 0.017
0.573TrpPhe: 0.573 ± 0.016
1.141TrpGly: 1.141 ± 0.02
0.374TrpHis: 0.374 ± 0.011
0.77TrpIle: 0.77 ± 0.017
0.569TrpLys: 0.569 ± 0.015
1.79TrpLeu: 1.79 ± 0.029
0.345TrpMet: 0.345 ± 0.014
0.527TrpAsn: 0.527 ± 0.014
0.828TrpPro: 0.828 ± 0.02
0.55TrpGln: 0.55 ± 0.015
1.147TrpArg: 1.147 ± 0.023
1.121TrpSer: 1.121 ± 0.025
0.901TrpThr: 0.901 ± 0.025
1.093TrpVal: 1.093 ± 0.021
0.302TrpTrp: 0.302 ± 0.012
0.367TrpTyr: 0.367 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.141TyrAla: 2.141 ± 0.033
0.25TyrCys: 0.25 ± 0.01
1.422TyrAsp: 1.422 ± 0.039
1.316TyrGlu: 1.316 ± 0.022
0.895TyrPhe: 0.895 ± 0.024
1.993TyrGly: 1.993 ± 0.029
0.641TyrHis: 0.641 ± 0.016
0.711TyrIle: 0.711 ± 0.018
0.604TyrLys: 0.604 ± 0.017
2.515TyrLeu: 2.515 ± 0.036
0.361TyrMet: 0.361 ± 0.012
0.746TyrAsn: 0.746 ± 0.023
1.267TyrPro: 1.267 ± 0.023
0.986TyrGln: 0.986 ± 0.019
2.092TyrArg: 2.092 ± 0.032
1.287TyrSer: 1.287 ± 0.036
1.165TyrThr: 1.165 ± 0.026
1.576TyrVal: 1.576 ± 0.026
0.365TyrTrp: 0.365 ± 0.012
0.766TyrTyr: 0.766 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7126 proteins (2607485 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski