Amino acid dipepetide frequency for Shinella sp. MEC087

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.133AlaAla: 17.133 ± 0.141
1.026AlaCys: 1.026 ± 0.027
7.202AlaAsp: 7.202 ± 0.068
7.719AlaGlu: 7.719 ± 0.089
4.859AlaPhe: 4.859 ± 0.057
11.22AlaGly: 11.22 ± 0.091
2.142AlaHis: 2.142 ± 0.034
7.0AlaIle: 7.0 ± 0.067
4.191AlaLys: 4.191 ± 0.062
13.24AlaLeu: 13.24 ± 0.124
3.589AlaMet: 3.589 ± 0.052
3.012AlaAsn: 3.012 ± 0.04
4.708AlaPro: 4.708 ± 0.065
3.464AlaGln: 3.464 ± 0.052
8.39AlaArg: 8.39 ± 0.084
6.462AlaSer: 6.462 ± 0.066
6.107AlaThr: 6.107 ± 0.066
9.149AlaVal: 9.149 ± 0.092
1.406AlaTrp: 1.406 ± 0.033
2.652AlaTyr: 2.652 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.898CysAla: 0.898 ± 0.026
0.116CysCys: 0.116 ± 0.01
0.516CysAsp: 0.516 ± 0.018
0.444CysGlu: 0.444 ± 0.019
0.296CysPhe: 0.296 ± 0.014
0.903CysGly: 0.903 ± 0.028
0.213CysHis: 0.213 ± 0.011
0.417CysIle: 0.417 ± 0.017
0.188CysLys: 0.188 ± 0.012
0.775CysLeu: 0.775 ± 0.023
0.153CysMet: 0.153 ± 0.01
0.198CysAsn: 0.198 ± 0.013
0.379CysPro: 0.379 ± 0.016
0.193CysGln: 0.193 ± 0.012
0.6CysArg: 0.6 ± 0.019
0.415CysSer: 0.415 ± 0.018
0.412CysThr: 0.412 ± 0.016
0.562CysVal: 0.562 ± 0.018
0.106CysTrp: 0.106 ± 0.009
0.199CysTyr: 0.199 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.015AspAla: 7.015 ± 0.064
0.497AspCys: 0.497 ± 0.018
3.19AspAsp: 3.19 ± 0.051
3.51AspGlu: 3.51 ± 0.05
2.316AspPhe: 2.316 ± 0.035
5.302AspGly: 5.302 ± 0.057
1.218AspHis: 1.218 ± 0.029
3.543AspIle: 3.543 ± 0.054
1.858AspLys: 1.858 ± 0.043
5.735AspLeu: 5.735 ± 0.058
1.494AspMet: 1.494 ± 0.033
1.44AspAsn: 1.44 ± 0.031
3.155AspPro: 3.155 ± 0.046
1.577AspGln: 1.577 ± 0.035
4.22AspArg: 4.22 ± 0.058
2.026AspSer: 2.026 ± 0.032
2.656AspThr: 2.656 ± 0.037
4.417AspVal: 4.417 ± 0.055
0.932AspTrp: 0.932 ± 0.027
1.542AspTyr: 1.542 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
7.727GluAla: 7.727 ± 0.079
0.383GluCys: 0.383 ± 0.016
2.881GluAsp: 2.881 ± 0.04
3.443GluGlu: 3.443 ± 0.06
1.808GluPhe: 1.808 ± 0.036
4.353GluGly: 4.353 ± 0.054
1.133GluHis: 1.133 ± 0.027
3.724GluIle: 3.724 ± 0.048
2.784GluLys: 2.784 ± 0.045
4.899GluLeu: 4.899 ± 0.064
1.543GluMet: 1.543 ± 0.03
1.905GluAsn: 1.905 ± 0.037
2.634GluPro: 2.634 ± 0.048
1.993GluGln: 1.993 ± 0.036
4.906GluArg: 4.906 ± 0.063
2.307GluSer: 2.307 ± 0.04
3.928GluThr: 3.928 ± 0.05
3.579GluVal: 3.579 ± 0.052
0.704GluTrp: 0.704 ± 0.021
1.025GluTyr: 1.025 ± 0.027
0.001GluXaa: 0.001 ± 0.001
Phe
4.587PheAla: 4.587 ± 0.06
0.399PheCys: 0.399 ± 0.018
2.725PheAsp: 2.725 ± 0.039
2.219PheGlu: 2.219 ± 0.039
1.545PhePhe: 1.545 ± 0.038
3.746PheGly: 3.746 ± 0.054
0.751PheHis: 0.751 ± 0.022
1.844PheIle: 1.844 ± 0.036
1.103PheLys: 1.103 ± 0.032
3.672PheLeu: 3.672 ± 0.054
0.889PheMet: 0.889 ± 0.024
1.116PheAsn: 1.116 ± 0.028
1.597PhePro: 1.597 ± 0.033
1.025PheGln: 1.025 ± 0.026
2.378PheArg: 2.378 ± 0.032
2.428PheSer: 2.428 ± 0.043
2.022PheThr: 2.022 ± 0.035
2.978PheVal: 2.978 ± 0.048
0.55PheTrp: 0.55 ± 0.021
0.956PheTyr: 0.956 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
9.159GlyAla: 9.159 ± 0.077
0.761GlyCys: 0.761 ± 0.023
4.317GlyAsp: 4.317 ± 0.057
4.961GlyGlu: 4.961 ± 0.064
3.642GlyPhe: 3.642 ± 0.049
7.136GlyGly: 7.136 ± 0.079
1.882GlyHis: 1.882 ± 0.04
5.264GlyIle: 5.264 ± 0.066
3.705GlyLys: 3.705 ± 0.061
8.845GlyLeu: 8.845 ± 0.088
2.343GlyMet: 2.343 ± 0.041
2.356GlyAsn: 2.356 ± 0.034
3.179GlyPro: 3.179 ± 0.046
2.586GlyGln: 2.586 ± 0.045
6.076GlyArg: 6.076 ± 0.068
4.489GlySer: 4.489 ± 0.052
4.763GlyThr: 4.763 ± 0.062
6.233GlyVal: 6.233 ± 0.066
1.255GlyTrp: 1.255 ± 0.034
2.394GlyTyr: 2.394 ± 0.039
0.001GlyXaa: 0.001 ± 0.001
His
2.366HisAla: 2.366 ± 0.044
0.235HisCys: 0.235 ± 0.012
1.255HisAsp: 1.255 ± 0.034
1.043HisGlu: 1.043 ± 0.028
0.908HisPhe: 0.908 ± 0.028
1.863HisGly: 1.863 ± 0.039
0.559HisHis: 0.559 ± 0.021
1.045HisIle: 1.045 ± 0.028
0.528HisLys: 0.528 ± 0.017
1.927HisLeu: 1.927 ± 0.038
0.53HisMet: 0.53 ± 0.016
0.481HisAsn: 0.481 ± 0.016
1.225HisPro: 1.225 ± 0.03
0.552HisGln: 0.552 ± 0.017
1.359HisArg: 1.359 ± 0.031
0.88HisSer: 0.88 ± 0.023
0.778HisThr: 0.778 ± 0.019
1.496HisVal: 1.496 ± 0.035
0.284HisTrp: 0.284 ± 0.013
0.543HisTyr: 0.543 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
8.128IleAla: 8.128 ± 0.089
0.536IleCys: 0.536 ± 0.017
3.88IleAsp: 3.88 ± 0.047
3.707IleGlu: 3.707 ± 0.056
1.858IlePhe: 1.858 ± 0.036
5.362IleGly: 5.362 ± 0.066
0.972IleHis: 0.972 ± 0.026
2.611IleIle: 2.611 ± 0.044
1.567IleLys: 1.567 ± 0.033
4.795IleLeu: 4.795 ± 0.065
1.128IleMet: 1.128 ± 0.026
1.583IleAsn: 1.583 ± 0.034
2.402IlePro: 2.402 ± 0.043
1.226IleGln: 1.226 ± 0.027
3.526IleArg: 3.526 ± 0.044
3.14IleSer: 3.14 ± 0.05
2.856IleThr: 2.856 ± 0.046
4.585IleVal: 4.585 ± 0.055
0.631IleTrp: 0.631 ± 0.021
1.216IleTyr: 1.216 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.909LysAla: 4.909 ± 0.064
0.156LysCys: 0.156 ± 0.011
2.033LysAsp: 2.033 ± 0.033
1.853LysGlu: 1.853 ± 0.034
0.936LysPhe: 0.936 ± 0.026
2.886LysGly: 2.886 ± 0.05
0.623LysHis: 0.623 ± 0.022
1.94LysIle: 1.94 ± 0.04
1.487LysLys: 1.487 ± 0.039
3.408LysLeu: 3.408 ± 0.05
0.825LysMet: 0.825 ± 0.025
0.973LysAsn: 0.973 ± 0.027
2.154LysPro: 2.154 ± 0.037
1.031LysGln: 1.031 ± 0.027
2.485LysArg: 2.485 ± 0.044
2.067LysSer: 2.067 ± 0.039
2.242LysThr: 2.242 ± 0.044
2.679LysVal: 2.679 ± 0.048
0.382LysTrp: 0.382 ± 0.016
0.667LysTyr: 0.667 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
13.197LeuAla: 13.197 ± 0.11
0.846LeuCys: 0.846 ± 0.023
5.877LeuAsp: 5.877 ± 0.064
5.203LeuGlu: 5.203 ± 0.061
3.774LeuPhe: 3.774 ± 0.06
8.01LeuGly: 8.01 ± 0.073
1.772LeuHis: 1.772 ± 0.034
5.07LeuIle: 5.07 ± 0.064
3.987LeuLys: 3.987 ± 0.056
9.239LeuLeu: 9.239 ± 0.101
2.349LeuMet: 2.349 ± 0.039
2.551LeuAsn: 2.551 ± 0.041
5.201LeuPro: 5.201 ± 0.068
2.591LeuGln: 2.591 ± 0.033
6.076LeuArg: 6.076 ± 0.078
6.955LeuSer: 6.955 ± 0.088
5.445LeuThr: 5.445 ± 0.075
7.544LeuVal: 7.544 ± 0.078
1.043LeuTrp: 1.043 ± 0.028
2.144LeuTyr: 2.144 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
3.191MetAla: 3.191 ± 0.05
0.146MetCys: 0.146 ± 0.009
1.196MetAsp: 1.196 ± 0.029
1.212MetGlu: 1.212 ± 0.028
0.681MetPhe: 0.681 ± 0.021
1.791MetGly: 1.791 ± 0.035
0.472MetHis: 0.472 ± 0.019
1.454MetIle: 1.454 ± 0.028
1.129MetLys: 1.129 ± 0.024
2.565MetLeu: 2.565 ± 0.044
0.685MetMet: 0.685 ± 0.022
0.859MetAsn: 0.859 ± 0.026
1.461MetPro: 1.461 ± 0.034
0.872MetGln: 0.872 ± 0.023
1.833MetArg: 1.833 ± 0.033
1.733MetSer: 1.733 ± 0.034
2.005MetThr: 2.005 ± 0.038
1.768MetVal: 1.768 ± 0.038
0.194MetTrp: 0.194 ± 0.013
0.277MetTyr: 0.277 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.441AsnAla: 3.441 ± 0.055
0.21AsnCys: 0.21 ± 0.011
1.564AsnAsp: 1.564 ± 0.033
1.33AsnGlu: 1.33 ± 0.034
1.0AsnPhe: 1.0 ± 0.023
2.674AsnGly: 2.674 ± 0.044
0.543AsnHis: 0.543 ± 0.02
1.508AsnIle: 1.508 ± 0.032
0.759AsnLys: 0.759 ± 0.023
2.573AsnLeu: 2.573 ± 0.035
0.661AsnMet: 0.661 ± 0.02
0.758AsnAsn: 0.758 ± 0.026
1.841AsnPro: 1.841 ± 0.037
0.78AsnGln: 0.78 ± 0.021
1.962AsnArg: 1.962 ± 0.036
1.229AsnSer: 1.229 ± 0.029
1.32AsnThr: 1.32 ± 0.027
1.999AsnVal: 1.999 ± 0.037
0.445AsnTrp: 0.445 ± 0.016
0.665AsnTyr: 0.665 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
5.928ProAla: 5.928 ± 0.068
0.279ProCys: 0.279 ± 0.016
3.502ProAsp: 3.502 ± 0.038
3.456ProGlu: 3.456 ± 0.048
2.074ProPhe: 2.074 ± 0.037
4.031ProGly: 4.031 ± 0.056
1.019ProHis: 1.019 ± 0.029
2.256ProIle: 2.256 ± 0.041
1.662ProLys: 1.662 ± 0.039
4.458ProLeu: 4.458 ± 0.058
1.11ProMet: 1.11 ± 0.027
1.234ProAsn: 1.234 ± 0.028
1.938ProPro: 1.938 ± 0.04
1.509ProGln: 1.509 ± 0.033
2.519ProArg: 2.519 ± 0.046
2.637ProSer: 2.637 ± 0.041
2.322ProThr: 2.322 ± 0.04
4.272ProVal: 4.272 ± 0.053
0.604ProTrp: 0.604 ± 0.02
1.2ProTyr: 1.2 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
3.788GlnAla: 3.788 ± 0.054
0.181GlnCys: 0.181 ± 0.011
1.4GlnAsp: 1.4 ± 0.032
1.543GlnGlu: 1.543 ± 0.029
1.005GlnPhe: 1.005 ± 0.024
2.169GlnGly: 2.169 ± 0.039
0.571GlnHis: 0.571 ± 0.02
1.78GlnIle: 1.78 ± 0.055
1.191GlnLys: 1.191 ± 0.029
2.614GlnLeu: 2.614 ± 0.041
0.837GlnMet: 0.837 ± 0.02
0.867GlnAsn: 0.867 ± 0.026
1.605GlnPro: 1.605 ± 0.029
1.202GlnGln: 1.202 ± 0.034
2.147GlnArg: 2.147 ± 0.039
1.745GlnSer: 1.745 ± 0.03
1.694GlnThr: 1.694 ± 0.035
2.038GlnVal: 2.038 ± 0.041
0.375GlnTrp: 0.375 ± 0.015
0.581GlnTyr: 0.581 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
7.487ArgAla: 7.487 ± 0.082
0.447ArgCys: 0.447 ± 0.017
4.001ArgAsp: 4.001 ± 0.056
4.03ArgGlu: 4.03 ± 0.056
2.913ArgPhe: 2.913 ± 0.042
4.521ArgGly: 4.521 ± 0.052
1.667ArgHis: 1.667 ± 0.033
4.037ArgIle: 4.037 ± 0.053
2.613ArgLys: 2.613 ± 0.043
7.395ArgLeu: 7.395 ± 0.087
1.931ArgMet: 1.931 ± 0.037
2.01ArgAsn: 2.01 ± 0.033
3.294ArgPro: 3.294 ± 0.049
2.513ArgGln: 2.513 ± 0.047
5.253ArgArg: 5.253 ± 0.078
3.581ArgSer: 3.581 ± 0.052
3.442ArgThr: 3.442 ± 0.048
4.507ArgVal: 4.507 ± 0.062
0.887ArgTrp: 0.887 ± 0.027
1.742ArgTyr: 1.742 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.437SerAla: 6.437 ± 0.066
0.37SerCys: 0.37 ± 0.014
3.024SerAsp: 3.024 ± 0.048
2.877SerGlu: 2.877 ± 0.048
2.284SerPhe: 2.284 ± 0.039
5.771SerGly: 5.771 ± 0.063
1.128SerHis: 1.128 ± 0.028
3.025SerIle: 3.025 ± 0.047
1.656SerLys: 1.656 ± 0.036
5.523SerLeu: 5.523 ± 0.071
1.373SerMet: 1.373 ± 0.033
1.416SerAsn: 1.416 ± 0.032
2.667SerPro: 2.667 ± 0.043
1.571SerGln: 1.571 ± 0.032
3.699SerArg: 3.699 ± 0.042
2.983SerSer: 2.983 ± 0.052
2.779SerThr: 2.779 ± 0.046
4.115SerVal: 4.115 ± 0.05
0.696SerTrp: 0.696 ± 0.02
1.336SerTyr: 1.336 ± 0.028
0.0SerXaa: 0.0 ± 0.0
Thr
6.452ThrAla: 6.452 ± 0.079
0.409ThrCys: 0.409 ± 0.017
2.864ThrAsp: 2.864 ± 0.044
2.701ThrGlu: 2.701 ± 0.04
2.195ThrPhe: 2.195 ± 0.038
5.166ThrGly: 5.166 ± 0.061
1.005ThrHis: 1.005 ± 0.026
3.235ThrIle: 3.235 ± 0.045
1.599ThrLys: 1.599 ± 0.034
5.75ThrLeu: 5.75 ± 0.069
1.345ThrMet: 1.345 ± 0.035
1.328ThrAsn: 1.328 ± 0.031
3.161ThrPro: 3.161 ± 0.042
1.423ThrGln: 1.423 ± 0.037
3.221ThrArg: 3.221 ± 0.044
2.883ThrSer: 2.883 ± 0.043
3.011ThrThr: 3.011 ± 0.053
4.686ThrVal: 4.686 ± 0.057
0.629ThrTrp: 0.629 ± 0.021
1.266ThrTyr: 1.266 ± 0.027
0.001ThrXaa: 0.001 ± 0.001
Val
9.108ValAla: 9.108 ± 0.064
0.621ValCys: 0.621 ± 0.021
4.122ValAsp: 4.122 ± 0.054
4.585ValGlu: 4.585 ± 0.055
2.935ValPhe: 2.935 ± 0.047
5.488ValGly: 5.488 ± 0.066
1.396ValHis: 1.396 ± 0.034
4.232ValIle: 4.232 ± 0.051
2.567ValLys: 2.567 ± 0.046
7.58ValLeu: 7.58 ± 0.082
1.915ValMet: 1.915 ± 0.036
2.085ValAsn: 2.085 ± 0.041
3.667ValPro: 3.667 ± 0.054
1.919ValGln: 1.919 ± 0.036
4.878ValArg: 4.878 ± 0.052
4.734ValSer: 4.734 ± 0.052
4.7ValThr: 4.7 ± 0.057
6.057ValVal: 6.057 ± 0.074
0.863ValTrp: 0.863 ± 0.024
1.581ValTyr: 1.581 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.122TrpAla: 1.122 ± 0.03
0.122TrpCys: 0.122 ± 0.008
0.588TrpAsp: 0.588 ± 0.018
0.57TrpGlu: 0.57 ± 0.018
0.524TrpPhe: 0.524 ± 0.023
0.782TrpGly: 0.782 ± 0.023
0.301TrpHis: 0.301 ± 0.015
0.666TrpIle: 0.666 ± 0.02
0.487TrpLys: 0.487 ± 0.018
1.594TrpLeu: 1.594 ± 0.036
0.35TrpMet: 0.35 ± 0.016
0.469TrpAsn: 0.469 ± 0.017
0.652TrpPro: 0.652 ± 0.02
0.585TrpGln: 0.585 ± 0.021
0.991TrpArg: 0.991 ± 0.028
0.807TrpSer: 0.807 ± 0.026
0.715TrpThr: 0.715 ± 0.023
0.724TrpVal: 0.724 ± 0.021
0.213TrpTrp: 0.213 ± 0.011
0.304TrpTyr: 0.304 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.599TyrAla: 2.599 ± 0.039
0.243TyrCys: 0.243 ± 0.013
1.501TyrAsp: 1.501 ± 0.033
1.254TyrGlu: 1.254 ± 0.03
0.977TyrPhe: 0.977 ± 0.027
2.172TyrGly: 2.172 ± 0.04
0.49TyrHis: 0.49 ± 0.018
1.062TyrIle: 1.062 ± 0.026
0.672TyrLys: 0.672 ± 0.023
2.288TyrLeu: 2.288 ± 0.037
0.478TyrMet: 0.478 ± 0.019
0.622TyrAsn: 0.622 ± 0.022
1.089TyrPro: 1.089 ± 0.026
0.723TyrGln: 0.723 ± 0.022
1.75TyrArg: 1.75 ± 0.033
1.228TyrSer: 1.228 ± 0.026
1.131TyrThr: 1.131 ± 0.029
1.644TyrVal: 1.644 ± 0.034
0.366TyrTrp: 0.366 ± 0.016
0.639TyrTyr: 0.639 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.001XaaPhe: 0.001 ± 0.001
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.001XaaTrp: 0.001 ± 0.001
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5159 proteins (1578796 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski