Amino acid dipepetide frequency for Baekduia soli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.957AlaAla: 24.957 ± 0.231
1.295AlaCys: 1.295 ± 0.03
8.303AlaAsp: 8.303 ± 0.087
7.899AlaGlu: 7.899 ± 0.114
4.1AlaPhe: 4.1 ± 0.06
15.274AlaGly: 15.274 ± 0.135
3.084AlaHis: 3.084 ± 0.052
5.264AlaIle: 5.264 ± 0.067
2.219AlaLys: 2.219 ± 0.047
16.027AlaLeu: 16.027 ± 0.138
3.167AlaMet: 3.167 ± 0.048
1.812AlaAsn: 1.812 ± 0.042
8.267AlaPro: 8.267 ± 0.109
4.03AlaGln: 4.03 ± 0.061
12.919AlaArg: 12.919 ± 0.128
6.1AlaSer: 6.1 ± 0.07
7.697AlaThr: 7.697 ± 0.085
12.234AlaVal: 12.234 ± 0.114
2.013AlaTrp: 2.013 ± 0.038
2.457AlaTyr: 2.457 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
1.283CysAla: 1.283 ± 0.031
0.141CysCys: 0.141 ± 0.01
0.542CysAsp: 0.542 ± 0.018
0.475CysGlu: 0.475 ± 0.019
0.266CysPhe: 0.266 ± 0.013
1.039CysGly: 1.039 ± 0.027
0.216CysHis: 0.216 ± 0.012
0.268CysIle: 0.268 ± 0.014
0.101CysLys: 0.101 ± 0.009
0.712CysLeu: 0.712 ± 0.021
0.122CysMet: 0.122 ± 0.01
0.124CysAsn: 0.124 ± 0.01
0.556CysPro: 0.556 ± 0.02
0.158CysGln: 0.158 ± 0.011
0.704CysArg: 0.704 ± 0.022
0.449CysSer: 0.449 ± 0.019
0.46CysThr: 0.46 ± 0.018
0.704CysVal: 0.704 ± 0.024
0.116CysTrp: 0.116 ± 0.01
0.143CysTyr: 0.143 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
9.06AspAla: 9.06 ± 0.081
0.378AspCys: 0.378 ± 0.015
4.05AspAsp: 4.05 ± 0.063
3.908AspGlu: 3.908 ± 0.062
1.522AspPhe: 1.522 ± 0.034
6.494AspGly: 6.494 ± 0.069
1.501AspHis: 1.501 ± 0.032
1.833AspIle: 1.833 ± 0.042
0.745AspLys: 0.745 ± 0.029
6.526AspLeu: 6.526 ± 0.073
0.723AspMet: 0.723 ± 0.021
0.688AspAsn: 0.688 ± 0.021
4.768AspPro: 4.768 ± 0.063
1.328AspGln: 1.328 ± 0.032
5.304AspArg: 5.304 ± 0.06
1.661AspSer: 1.661 ± 0.038
2.46AspThr: 2.46 ± 0.044
5.711AspVal: 5.711 ± 0.062
0.774AspTrp: 0.774 ± 0.021
0.984AspTyr: 0.984 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
7.884GluAla: 7.884 ± 0.106
0.281GluCys: 0.281 ± 0.013
2.844GluAsp: 2.844 ± 0.043
2.875GluGlu: 2.875 ± 0.048
1.235GluPhe: 1.235 ± 0.029
4.248GluGly: 4.248 ± 0.069
1.838GluHis: 1.838 ± 0.037
2.492GluIle: 2.492 ± 0.045
0.798GluLys: 0.798 ± 0.028
7.049GluLeu: 7.049 ± 0.082
0.793GluMet: 0.793 ± 0.024
0.68GluAsn: 0.68 ± 0.022
3.258GluPro: 3.258 ± 0.056
1.933GluGln: 1.933 ± 0.035
6.058GluArg: 6.058 ± 0.078
1.893GluSer: 1.893 ± 0.044
2.395GluThr: 2.395 ± 0.04
4.691GluVal: 4.691 ± 0.063
0.548GluTrp: 0.548 ± 0.022
0.813GluTyr: 0.813 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
4.0PheAla: 4.0 ± 0.055
0.348PheCys: 0.348 ± 0.016
2.007PheAsp: 2.007 ± 0.036
1.598PheGlu: 1.598 ± 0.034
0.912PhePhe: 0.912 ± 0.026
3.138PheGly: 3.138 ± 0.049
0.628PheHis: 0.628 ± 0.02
0.83PheIle: 0.83 ± 0.026
0.513PheLys: 0.513 ± 0.021
2.382PheLeu: 2.382 ± 0.044
0.41PheMet: 0.41 ± 0.016
0.538PheAsn: 0.538 ± 0.018
1.291PhePro: 1.291 ± 0.032
0.599PheGln: 0.599 ± 0.02
1.756PheArg: 1.756 ± 0.029
1.446PheSer: 1.446 ± 0.036
1.684PheThr: 1.684 ± 0.031
2.386PheVal: 2.386 ± 0.038
0.384PheTrp: 0.384 ± 0.016
0.564PheTyr: 0.564 ± 0.019
0.0PheXaa: 0.0 ± 0.0
Gly
13.326GlyAla: 13.326 ± 0.123
0.907GlyCys: 0.907 ± 0.028
5.718GlyAsp: 5.718 ± 0.064
5.03GlyGlu: 5.03 ± 0.062
2.977GlyPhe: 2.977 ± 0.044
9.506GlyGly: 9.506 ± 0.115
2.397GlyHis: 2.397 ± 0.046
3.661GlyIle: 3.661 ± 0.052
1.532GlyLys: 1.532 ± 0.036
9.851GlyLeu: 9.851 ± 0.095
2.022GlyMet: 2.022 ± 0.036
1.402GlyAsn: 1.402 ± 0.041
5.541GlyPro: 5.541 ± 0.067
2.344GlyGln: 2.344 ± 0.038
8.94GlyArg: 8.94 ± 0.089
4.86GlySer: 4.86 ± 0.061
5.685GlyThr: 5.685 ± 0.06
8.003GlyVal: 8.003 ± 0.078
1.57GlyTrp: 1.57 ± 0.032
2.019GlyTyr: 2.019 ± 0.037
0.0GlyXaa: 0.0 ± 0.0
His
3.234HisAla: 3.234 ± 0.051
0.208HisCys: 0.208 ± 0.012
1.636HisAsp: 1.636 ± 0.035
1.466HisGlu: 1.466 ± 0.032
0.597HisPhe: 0.597 ± 0.021
2.653HisGly: 2.653 ± 0.044
0.789HisHis: 0.789 ± 0.026
0.633HisIle: 0.633 ± 0.02
0.298HisLys: 0.298 ± 0.016
2.304HisLeu: 2.304 ± 0.04
0.299HisMet: 0.299 ± 0.013
0.341HisAsn: 0.341 ± 0.017
1.751HisPro: 1.751 ± 0.032
0.572HisGln: 0.572 ± 0.022
2.16HisArg: 2.16 ± 0.046
0.793HisSer: 0.793 ± 0.023
1.023HisThr: 1.023 ± 0.024
2.129HisVal: 2.129 ± 0.035
0.313HisTrp: 0.313 ± 0.014
0.419HisTyr: 0.419 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
5.904IleAla: 5.904 ± 0.062
0.327IleCys: 0.327 ± 0.014
2.724IleAsp: 2.724 ± 0.047
2.282IleGlu: 2.282 ± 0.047
0.915IlePhe: 0.915 ± 0.024
3.782IleGly: 3.782 ± 0.06
0.691IleHis: 0.691 ± 0.02
1.129IleIle: 1.129 ± 0.032
0.644IleLys: 0.644 ± 0.023
2.869IleLeu: 2.869 ± 0.042
0.508IleMet: 0.508 ± 0.017
0.704IleAsn: 0.704 ± 0.023
1.772IlePro: 1.772 ± 0.035
0.749IleGln: 0.749 ± 0.024
2.261IleArg: 2.261 ± 0.039
1.597IleSer: 1.597 ± 0.032
2.12IleThr: 2.12 ± 0.035
3.62IleVal: 3.62 ± 0.057
0.36IleTrp: 0.36 ± 0.014
0.641IleTyr: 0.641 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
2.159LysAla: 2.159 ± 0.047
0.082LysCys: 0.082 ± 0.008
0.912LysAsp: 0.912 ± 0.03
0.708LysGlu: 0.708 ± 0.027
0.346LysPhe: 0.346 ± 0.016
1.441LysGly: 1.441 ± 0.039
0.342LysHis: 0.342 ± 0.014
0.733LysIle: 0.733 ± 0.028
0.477LysLys: 0.477 ± 0.024
1.711LysLeu: 1.711 ± 0.043
0.276LysMet: 0.276 ± 0.016
0.293LysAsn: 0.293 ± 0.012
0.961LysPro: 0.961 ± 0.032
0.481LysGln: 0.481 ± 0.021
1.2LysArg: 1.2 ± 0.032
0.685LysSer: 0.685 ± 0.022
0.962LysThr: 0.962 ± 0.033
1.482LysVal: 1.482 ± 0.031
0.163LysTrp: 0.163 ± 0.011
0.303LysTyr: 0.303 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
16.357LeuAla: 16.357 ± 0.155
0.905LeuCys: 0.905 ± 0.026
6.803LeuAsp: 6.803 ± 0.07
5.795LeuGlu: 5.795 ± 0.08
2.417LeuPhe: 2.417 ± 0.049
10.163LeuGly: 10.163 ± 0.103
2.448LeuHis: 2.448 ± 0.044
3.136LeuIle: 3.136 ± 0.05
1.539LeuLys: 1.539 ± 0.039
10.438LeuLeu: 10.438 ± 0.115
1.526LeuMet: 1.526 ± 0.03
1.431LeuAsn: 1.431 ± 0.03
5.733LeuPro: 5.733 ± 0.066
2.72LeuGln: 2.72 ± 0.045
9.449LeuArg: 9.449 ± 0.086
4.677LeuSer: 4.677 ± 0.062
5.527LeuThr: 5.527 ± 0.068
9.146LeuVal: 9.146 ± 0.086
1.209LeuTrp: 1.209 ± 0.035
1.605LeuTyr: 1.605 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.469MetAla: 2.469 ± 0.041
0.126MetCys: 0.126 ± 0.008
0.931MetAsp: 0.931 ± 0.023
0.71MetGlu: 0.71 ± 0.025
0.437MetPhe: 0.437 ± 0.017
1.387MetGly: 1.387 ± 0.032
0.384MetHis: 0.384 ± 0.015
0.651MetIle: 0.651 ± 0.021
0.345MetLys: 0.345 ± 0.016
1.822MetLeu: 1.822 ± 0.032
0.289MetMet: 0.289 ± 0.016
0.347MetAsn: 0.347 ± 0.015
1.252MetPro: 1.252 ± 0.03
0.46MetGln: 0.46 ± 0.019
1.561MetArg: 1.561 ± 0.034
1.311MetSer: 1.311 ± 0.027
1.4MetThr: 1.4 ± 0.03
1.353MetVal: 1.353 ± 0.029
0.168MetTrp: 0.168 ± 0.01
0.25MetTyr: 0.25 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.079AsnAla: 2.079 ± 0.036
0.138AsnCys: 0.138 ± 0.009
0.874AsnAsp: 0.874 ± 0.027
0.68AsnGlu: 0.68 ± 0.023
0.416AsnPhe: 0.416 ± 0.018
1.425AsnGly: 1.425 ± 0.034
0.328AsnHis: 0.328 ± 0.015
0.574AsnIle: 0.574 ± 0.019
0.279AsnLys: 0.279 ± 0.016
1.506AsnLeu: 1.506 ± 0.036
0.239AsnMet: 0.239 ± 0.011
0.325AsnAsn: 0.325 ± 0.017
1.084AsnPro: 1.084 ± 0.027
0.396AsnGln: 0.396 ± 0.015
1.112AsnArg: 1.112 ± 0.025
0.59AsnSer: 0.59 ± 0.023
0.797AsnThr: 0.797 ± 0.025
1.431AsnVal: 1.431 ± 0.034
0.198AsnTrp: 0.198 ± 0.011
0.329AsnTyr: 0.329 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
9.468ProAla: 9.468 ± 0.11
0.432ProCys: 0.432 ± 0.019
4.126ProAsp: 4.126 ± 0.055
3.884ProGlu: 3.884 ± 0.058
1.604ProPhe: 1.604 ± 0.033
6.897ProGly: 6.897 ± 0.08
1.221ProHis: 1.221 ± 0.031
1.724ProIle: 1.724 ± 0.032
0.983ProLys: 0.983 ± 0.029
5.09ProLeu: 5.09 ± 0.068
1.069ProMet: 1.069 ± 0.024
0.828ProAsn: 0.828 ± 0.025
3.783ProPro: 3.783 ± 0.074
1.539ProGln: 1.539 ± 0.034
4.751ProArg: 4.751 ± 0.073
3.036ProSer: 3.036 ± 0.047
3.036ProThr: 3.036 ± 0.053
5.083ProVal: 5.083 ± 0.056
0.868ProTrp: 0.868 ± 0.023
1.006ProTyr: 1.006 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.018GlnAla: 4.018 ± 0.056
0.158GlnCys: 0.158 ± 0.011
1.389GlnAsp: 1.389 ± 0.037
1.404GlnGlu: 1.404 ± 0.031
0.619GlnPhe: 0.619 ± 0.021
2.187GlnGly: 2.187 ± 0.035
0.621GlnHis: 0.621 ± 0.021
1.06GlnIle: 1.06 ± 0.028
0.429GlnLys: 0.429 ± 0.019
2.622GlnLeu: 2.622 ± 0.043
0.435GlnMet: 0.435 ± 0.02
0.38GlnAsn: 0.38 ± 0.016
1.433GlnPro: 1.433 ± 0.033
0.898GlnGln: 0.898 ± 0.031
2.757GlnArg: 2.757 ± 0.049
1.03GlnSer: 1.03 ± 0.028
1.362GlnThr: 1.362 ± 0.031
2.145GlnVal: 2.145 ± 0.038
0.309GlnTrp: 0.309 ± 0.015
0.441GlnTyr: 0.441 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
12.194ArgAla: 12.194 ± 0.116
0.768ArgCys: 0.768 ± 0.022
5.003ArgAsp: 5.003 ± 0.058
5.163ArgGlu: 5.163 ± 0.071
2.706ArgPhe: 2.706 ± 0.043
6.892ArgGly: 6.892 ± 0.076
2.367ArgHis: 2.367 ± 0.039
3.36ArgIle: 3.36 ± 0.05
1.158ArgLys: 1.158 ± 0.029
9.377ArgLeu: 9.377 ± 0.085
1.874ArgMet: 1.874 ± 0.036
1.129ArgAsn: 1.129 ± 0.028
5.616ArgPro: 5.616 ± 0.077
2.067ArgGln: 2.067 ± 0.037
10.534ArgArg: 10.534 ± 0.134
4.186ArgSer: 4.186 ± 0.06
4.587ArgThr: 4.587 ± 0.055
6.789ArgVal: 6.789 ± 0.06
1.408ArgTrp: 1.408 ± 0.033
1.642ArgTyr: 1.642 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
6.076SerAla: 6.076 ± 0.068
0.445SerCys: 0.445 ± 0.018
2.433SerAsp: 2.433 ± 0.043
2.083SerGlu: 2.083 ± 0.035
1.387SerPhe: 1.387 ± 0.033
5.033SerGly: 5.033 ± 0.066
0.934SerHis: 0.934 ± 0.025
1.759SerIle: 1.759 ± 0.038
0.811SerLys: 0.811 ± 0.025
4.024SerLeu: 4.024 ± 0.063
1.032SerMet: 1.032 ± 0.024
0.724SerAsn: 0.724 ± 0.024
2.939SerPro: 2.939 ± 0.05
1.06SerGln: 1.06 ± 0.03
3.532SerArg: 3.532 ± 0.054
2.738SerSer: 2.738 ± 0.056
2.816SerThr: 2.816 ± 0.053
3.611SerVal: 3.611 ± 0.046
0.735SerTrp: 0.735 ± 0.024
0.981SerTyr: 0.981 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
7.323ThrAla: 7.323 ± 0.085
0.468ThrCys: 0.468 ± 0.019
2.881ThrAsp: 2.881 ± 0.043
2.464ThrGlu: 2.464 ± 0.049
1.734ThrPhe: 1.734 ± 0.037
5.632ThrGly: 5.632 ± 0.068
1.1ThrHis: 1.1 ± 0.03
2.356ThrIle: 2.356 ± 0.041
0.932ThrLys: 0.932 ± 0.029
5.446ThrLeu: 5.446 ± 0.059
1.028ThrMet: 1.028 ± 0.024
0.935ThrAsn: 0.935 ± 0.023
3.677ThrPro: 3.677 ± 0.054
1.235ThrGln: 1.235 ± 0.028
3.846ThrArg: 3.846 ± 0.052
2.795ThrSer: 2.795 ± 0.048
3.558ThrThr: 3.558 ± 0.063
4.89ThrVal: 4.89 ± 0.06
0.773ThrTrp: 0.773 ± 0.024
1.155ThrTyr: 1.155 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
13.32ValAla: 13.32 ± 0.114
0.836ValCys: 0.836 ± 0.024
5.232ValAsp: 5.232 ± 0.053
4.407ValGlu: 4.407 ± 0.053
2.26ValPhe: 2.26 ± 0.037
7.215ValGly: 7.215 ± 0.073
2.003ValHis: 2.003 ± 0.033
3.11ValIle: 3.11 ± 0.041
1.386ValLys: 1.386 ± 0.039
9.877ValLeu: 9.877 ± 0.101
1.287ValMet: 1.287 ± 0.033
1.517ValAsn: 1.517 ± 0.038
5.103ValPro: 5.103 ± 0.056
2.328ValGln: 2.328 ± 0.04
7.063ValArg: 7.063 ± 0.065
3.749ValSer: 3.749 ± 0.053
4.92ValThr: 4.92 ± 0.061
8.849ValVal: 8.849 ± 0.093
0.921ValTrp: 0.921 ± 0.025
1.354ValTyr: 1.354 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.562TrpAla: 1.562 ± 0.032
0.153TrpCys: 0.153 ± 0.01
0.77TrpAsp: 0.77 ± 0.028
0.667TrpGlu: 0.667 ± 0.019
0.453TrpPhe: 0.453 ± 0.018
1.006TrpGly: 1.006 ± 0.026
0.347TrpHis: 0.347 ± 0.014
0.558TrpIle: 0.558 ± 0.018
0.211TrpLys: 0.211 ± 0.011
1.531TrpLeu: 1.531 ± 0.031
0.317TrpMet: 0.317 ± 0.014
0.277TrpAsn: 0.277 ± 0.013
0.746TrpPro: 0.746 ± 0.021
0.383TrpGln: 0.383 ± 0.018
1.319TrpArg: 1.319 ± 0.034
0.791TrpSer: 0.791 ± 0.024
0.848TrpThr: 0.848 ± 0.026
0.892TrpVal: 0.892 ± 0.025
0.289TrpTrp: 0.289 ± 0.013
0.254TrpTyr: 0.254 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.447TyrAla: 2.447 ± 0.046
0.183TyrCys: 0.183 ± 0.011
1.242TyrAsp: 1.242 ± 0.028
1.109TyrGlu: 1.109 ± 0.026
0.51TyrPhe: 0.51 ± 0.019
1.879TyrGly: 1.879 ± 0.038
0.382TyrHis: 0.382 ± 0.016
0.466TyrIle: 0.466 ± 0.019
0.317TyrLys: 0.317 ± 0.016
1.863TyrLeu: 1.863 ± 0.04
0.236TyrMet: 0.236 ± 0.013
0.321TyrAsn: 0.321 ± 0.014
0.896TyrPro: 0.896 ± 0.024
0.416TyrGln: 0.416 ± 0.015
1.593TyrArg: 1.593 ± 0.037
0.765TyrSer: 0.765 ± 0.023
0.893TyrThr: 0.893 ± 0.025
1.589TyrVal: 1.589 ± 0.036
0.254TyrTrp: 0.254 ± 0.014
0.367TyrTyr: 0.367 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4785 proteins (1506352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski