Amino acid dipepetide frequency for Agromyces sp. NDB4Y10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.859AlaAla: 22.859 ± 0.24
0.645AlaCys: 0.645 ± 0.026
9.25AlaAsp: 9.25 ± 0.114
9.44AlaGlu: 9.44 ± 0.11
4.129AlaPhe: 4.129 ± 0.078
13.051AlaGly: 13.051 ± 0.129
2.528AlaHis: 2.528 ± 0.055
5.907AlaIle: 5.907 ± 0.086
2.545AlaLys: 2.545 ± 0.06
13.907AlaLeu: 13.907 ± 0.136
2.814AlaMet: 2.814 ± 0.057
2.319AlaAsn: 2.319 ± 0.053
6.963AlaPro: 6.963 ± 0.09
3.121AlaGln: 3.121 ± 0.063
10.22AlaArg: 10.22 ± 0.142
7.701AlaSer: 7.701 ± 0.085
7.643AlaThr: 7.643 ± 0.1
12.303AlaVal: 12.303 ± 0.143
2.047AlaTrp: 2.047 ± 0.044
2.402AlaTyr: 2.402 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.58CysAla: 0.58 ± 0.027
0.041CysCys: 0.041 ± 0.007
0.303CysAsp: 0.303 ± 0.019
0.248CysGlu: 0.248 ± 0.016
0.146CysPhe: 0.146 ± 0.014
0.533CysGly: 0.533 ± 0.025
0.134CysHis: 0.134 ± 0.011
0.164CysIle: 0.164 ± 0.012
0.046CysLys: 0.046 ± 0.006
0.371CysLeu: 0.371 ± 0.023
0.049CysMet: 0.049 ± 0.007
0.083CysAsn: 0.083 ± 0.01
0.283CysPro: 0.283 ± 0.017
0.103CysGln: 0.103 ± 0.012
0.338CysArg: 0.338 ± 0.021
0.304CysSer: 0.304 ± 0.016
0.296CysThr: 0.296 ± 0.018
0.371CysVal: 0.371 ± 0.02
0.071CysTrp: 0.071 ± 0.008
0.085CysTyr: 0.085 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
10.194AspAla: 10.194 ± 0.12
0.232AspCys: 0.232 ± 0.016
4.749AspAsp: 4.749 ± 0.088
5.061AspGlu: 5.061 ± 0.087
1.634AspPhe: 1.634 ± 0.042
6.535AspGly: 6.535 ± 0.104
1.272AspHis: 1.272 ± 0.045
1.91AspIle: 1.91 ± 0.048
0.713AspLys: 0.713 ± 0.037
6.735AspLeu: 6.735 ± 0.102
0.725AspMet: 0.725 ± 0.027
0.76AspAsn: 0.76 ± 0.03
4.52AspPro: 4.52 ± 0.08
1.416AspGln: 1.416 ± 0.044
5.475AspArg: 5.475 ± 0.091
2.213AspSer: 2.213 ± 0.05
2.608AspThr: 2.608 ± 0.051
5.586AspVal: 5.586 ± 0.075
1.025AspTrp: 1.025 ± 0.032
1.181AspTyr: 1.181 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
8.034GluAla: 8.034 ± 0.111
0.252GluCys: 0.252 ± 0.017
2.534GluAsp: 2.534 ± 0.057
3.276GluGlu: 3.276 ± 0.069
1.893GluPhe: 1.893 ± 0.042
4.444GluGly: 4.444 ± 0.077
1.745GluHis: 1.745 ± 0.042
2.553GluIle: 2.553 ± 0.06
1.039GluLys: 1.039 ± 0.041
7.485GluLeu: 7.485 ± 0.1
0.871GluMet: 0.871 ± 0.03
0.994GluAsn: 0.994 ± 0.038
3.826GluPro: 3.826 ± 0.068
2.33GluGln: 2.33 ± 0.06
6.505GluArg: 6.505 ± 0.097
2.74GluSer: 2.74 ± 0.051
3.098GluThr: 3.098 ± 0.061
5.301GluVal: 5.301 ± 0.087
1.02GluTrp: 1.02 ± 0.037
1.202GluTyr: 1.202 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
4.38PheAla: 4.38 ± 0.069
0.143PheCys: 0.143 ± 0.011
2.432PheAsp: 2.432 ± 0.052
1.842PheGlu: 1.842 ± 0.039
1.026PhePhe: 1.026 ± 0.034
3.417PheGly: 3.417 ± 0.059
0.549PheHis: 0.549 ± 0.025
1.144PheIle: 1.144 ± 0.039
0.37PheLys: 0.37 ± 0.02
2.722PheLeu: 2.722 ± 0.053
0.439PheMet: 0.439 ± 0.022
0.615PheAsn: 0.615 ± 0.026
1.361PhePro: 1.361 ± 0.042
0.706PheGln: 0.706 ± 0.026
1.976PheArg: 1.976 ± 0.048
1.479PheSer: 1.479 ± 0.043
2.005PheThr: 2.005 ± 0.047
2.713PheVal: 2.713 ± 0.064
0.496PheTrp: 0.496 ± 0.026
0.6PheTyr: 0.6 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
11.501GlyAla: 11.501 ± 0.124
0.551GlyCys: 0.551 ± 0.026
5.594GlyAsp: 5.594 ± 0.079
5.475GlyGlu: 5.475 ± 0.084
3.304GlyPhe: 3.304 ± 0.06
8.104GlyGly: 8.104 ± 0.12
1.79GlyHis: 1.79 ± 0.046
4.643GlyIle: 4.643 ± 0.077
1.742GlyLys: 1.742 ± 0.055
8.904GlyLeu: 8.904 ± 0.107
1.928GlyMet: 1.928 ± 0.049
1.536GlyAsn: 1.536 ± 0.044
4.235GlyPro: 4.235 ± 0.078
2.151GlyGln: 2.151 ± 0.046
7.443GlyArg: 7.443 ± 0.094
5.107GlySer: 5.107 ± 0.076
5.548GlyThr: 5.548 ± 0.089
8.163GlyVal: 8.163 ± 0.109
1.743GlyTrp: 1.743 ± 0.048
2.204GlyTyr: 2.204 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.634HisAla: 2.634 ± 0.055
0.116HisCys: 0.116 ± 0.01
1.454HisAsp: 1.454 ± 0.039
1.305HisGlu: 1.305 ± 0.039
0.529HisPhe: 0.529 ± 0.024
2.126HisGly: 2.126 ± 0.049
0.531HisHis: 0.531 ± 0.025
0.557HisIle: 0.557 ± 0.026
0.224HisLys: 0.224 ± 0.014
2.148HisLeu: 2.148 ± 0.045
0.276HisMet: 0.276 ± 0.016
0.306HisAsn: 0.306 ± 0.016
1.619HisPro: 1.619 ± 0.038
0.431HisGln: 0.431 ± 0.021
1.748HisArg: 1.748 ± 0.044
0.857HisSer: 0.857 ± 0.03
0.829HisThr: 0.829 ± 0.031
1.844HisVal: 1.844 ± 0.046
0.283HisTrp: 0.283 ± 0.018
0.414HisTyr: 0.414 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.749IleAla: 6.749 ± 0.09
0.198IleCys: 0.198 ± 0.013
3.312IleAsp: 3.312 ± 0.053
2.941IleGlu: 2.941 ± 0.069
0.977IlePhe: 0.977 ± 0.043
4.321IleGly: 4.321 ± 0.074
0.625IleHis: 0.625 ± 0.023
1.477IleIle: 1.477 ± 0.046
0.597IleLys: 0.597 ± 0.031
3.317IleLeu: 3.317 ± 0.066
0.536IleMet: 0.536 ± 0.021
0.644IleAsn: 0.644 ± 0.027
2.24IlePro: 2.24 ± 0.049
0.816IleGln: 0.816 ± 0.031
2.992IleArg: 2.992 ± 0.063
1.841IleSer: 1.841 ± 0.042
2.413IleThr: 2.413 ± 0.05
4.287IleVal: 4.287 ± 0.074
0.49IleTrp: 0.49 ± 0.025
0.627IleTyr: 0.627 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
1.998LysAla: 1.998 ± 0.054
0.051LysCys: 0.051 ± 0.007
0.807LysAsp: 0.807 ± 0.033
0.803LysGlu: 0.803 ± 0.036
0.426LysPhe: 0.426 ± 0.022
1.344LysGly: 1.344 ± 0.049
0.398LysHis: 0.398 ± 0.021
0.66LysIle: 0.66 ± 0.032
0.533LysLys: 0.533 ± 0.03
1.724LysLeu: 1.724 ± 0.055
0.284LysMet: 0.284 ± 0.018
0.366LysAsn: 0.366 ± 0.02
1.064LysPro: 1.064 ± 0.039
0.635LysGln: 0.635 ± 0.03
1.439LysArg: 1.439 ± 0.045
0.916LysSer: 0.916 ± 0.032
0.987LysThr: 0.987 ± 0.034
1.479LysVal: 1.479 ± 0.043
0.233LysTrp: 0.233 ± 0.018
0.388LysTyr: 0.388 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
15.861LeuAla: 15.861 ± 0.157
0.388LeuCys: 0.388 ± 0.019
7.149LeuAsp: 7.149 ± 0.101
6.014LeuGlu: 6.014 ± 0.086
2.618LeuPhe: 2.618 ± 0.058
9.571LeuGly: 9.571 ± 0.12
1.821LeuHis: 1.821 ± 0.045
3.918LeuIle: 3.918 ± 0.078
1.556LeuLys: 1.556 ± 0.047
9.919LeuLeu: 9.919 ± 0.122
1.478LeuMet: 1.478 ± 0.039
1.6LeuAsn: 1.6 ± 0.041
5.385LeuPro: 5.385 ± 0.079
2.305LeuGln: 2.305 ± 0.052
7.557LeuArg: 7.557 ± 0.09
4.767LeuSer: 4.767 ± 0.076
5.432LeuThr: 5.432 ± 0.07
10.035LeuVal: 10.035 ± 0.122
1.203LeuTrp: 1.203 ± 0.035
1.563LeuTyr: 1.563 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.056MetAla: 2.056 ± 0.044
0.091MetCys: 0.091 ± 0.011
0.774MetAsp: 0.774 ± 0.028
0.587MetGlu: 0.587 ± 0.026
0.497MetPhe: 0.497 ± 0.021
1.247MetGly: 1.247 ± 0.039
0.361MetHis: 0.361 ± 0.018
0.709MetIle: 0.709 ± 0.025
0.362MetLys: 0.362 ± 0.019
1.785MetLeu: 1.785 ± 0.04
0.263MetMet: 0.263 ± 0.019
0.448MetAsn: 0.448 ± 0.025
1.235MetPro: 1.235 ± 0.038
0.54MetGln: 0.54 ± 0.022
1.428MetArg: 1.428 ± 0.039
1.397MetSer: 1.397 ± 0.037
1.595MetThr: 1.595 ± 0.04
1.262MetVal: 1.262 ± 0.036
0.18MetTrp: 0.18 ± 0.012
0.282MetTyr: 0.282 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.279AsnAla: 2.279 ± 0.052
0.106AsnCys: 0.106 ± 0.009
1.004AsnAsp: 1.004 ± 0.038
0.915AsnGlu: 0.915 ± 0.033
0.56AsnPhe: 0.56 ± 0.026
1.792AsnGly: 1.792 ± 0.047
0.362AsnHis: 0.362 ± 0.019
0.662AsnIle: 0.662 ± 0.028
0.289AsnLys: 0.289 ± 0.02
1.767AsnLeu: 1.767 ± 0.043
0.265AsnMet: 0.265 ± 0.017
0.365AsnAsn: 0.365 ± 0.021
1.424AsnPro: 1.424 ± 0.039
0.446AsnGln: 0.446 ± 0.02
1.298AsnArg: 1.298 ± 0.037
0.831AsnSer: 0.831 ± 0.033
0.93AsnThr: 0.93 ± 0.037
1.467AsnVal: 1.467 ± 0.04
0.287AsnTrp: 0.287 ± 0.017
0.395AsnTyr: 0.395 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
7.968ProAla: 7.968 ± 0.103
0.192ProCys: 0.192 ± 0.014
4.287ProAsp: 4.287 ± 0.074
4.262ProGlu: 4.262 ± 0.067
1.688ProPhe: 1.688 ± 0.044
5.544ProGly: 5.544 ± 0.089
1.133ProHis: 1.133 ± 0.037
2.174ProIle: 2.174 ± 0.046
0.972ProLys: 0.972 ± 0.039
4.715ProLeu: 4.715 ± 0.076
0.915ProMet: 0.915 ± 0.03
1.023ProAsn: 1.023 ± 0.027
2.596ProPro: 2.596 ± 0.068
1.377ProGln: 1.377 ± 0.036
3.602ProArg: 3.602 ± 0.068
3.028ProSer: 3.028 ± 0.058
3.186ProThr: 3.186 ± 0.06
5.322ProVal: 5.322 ± 0.084
0.917ProTrp: 0.917 ± 0.029
1.052ProTyr: 1.052 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
3.349GlnAla: 3.349 ± 0.064
0.107GlnCys: 0.107 ± 0.01
1.002GlnAsp: 1.002 ± 0.033
1.18GlnGlu: 1.18 ± 0.04
0.793GlnPhe: 0.793 ± 0.031
1.914GlnGly: 1.914 ± 0.044
0.595GlnHis: 0.595 ± 0.024
0.985GlnIle: 0.985 ± 0.032
0.506GlnLys: 0.506 ± 0.024
2.796GlnLeu: 2.796 ± 0.053
0.399GlnMet: 0.399 ± 0.02
0.517GlnAsn: 0.517 ± 0.026
1.518GlnPro: 1.518 ± 0.041
0.997GlnGln: 0.997 ± 0.032
2.328GlnArg: 2.328 ± 0.051
1.232GlnSer: 1.232 ± 0.036
1.225GlnThr: 1.225 ± 0.039
2.403GlnVal: 2.403 ± 0.053
0.392GlnTrp: 0.392 ± 0.019
0.557GlnTyr: 0.557 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
10.022ArgAla: 10.022 ± 0.13
0.304ArgCys: 0.304 ± 0.02
4.869ArgAsp: 4.869 ± 0.08
4.902ArgGlu: 4.902 ± 0.081
2.666ArgPhe: 2.666 ± 0.055
5.926ArgGly: 5.926 ± 0.089
1.838ArgHis: 1.838 ± 0.045
3.819ArgIle: 3.819 ± 0.064
1.159ArgLys: 1.159 ± 0.04
8.27ArgLeu: 8.27 ± 0.119
1.942ArgMet: 1.942 ± 0.046
1.376ArgAsn: 1.376 ± 0.041
4.368ArgPro: 4.368 ± 0.074
1.868ArgGln: 1.868 ± 0.047
7.789ArgArg: 7.789 ± 0.129
4.086ArgSer: 4.086 ± 0.069
4.349ArgThr: 4.349 ± 0.067
6.749ArgVal: 6.749 ± 0.087
1.267ArgTrp: 1.267 ± 0.036
1.717ArgTyr: 1.717 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
6.638SerAla: 6.638 ± 0.093
0.231SerCys: 0.231 ± 0.014
2.837SerAsp: 2.837 ± 0.05
2.55SerGlu: 2.55 ± 0.05
1.643SerPhe: 1.643 ± 0.049
5.438SerGly: 5.438 ± 0.079
0.92SerHis: 0.92 ± 0.031
2.273SerIle: 2.273 ± 0.057
0.907SerLys: 0.907 ± 0.039
4.717SerLeu: 4.717 ± 0.072
1.06SerMet: 1.06 ± 0.031
0.917SerAsn: 0.917 ± 0.025
2.926SerPro: 2.926 ± 0.055
1.175SerGln: 1.175 ± 0.039
3.672SerArg: 3.672 ± 0.064
2.9SerSer: 2.9 ± 0.073
3.284SerThr: 3.284 ± 0.054
4.616SerVal: 4.616 ± 0.07
0.807SerTrp: 0.807 ± 0.028
1.05SerTyr: 1.05 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
7.539ThrAla: 7.539 ± 0.097
0.248ThrCys: 0.248 ± 0.016
3.622ThrAsp: 3.622 ± 0.068
2.911ThrGlu: 2.911 ± 0.058
1.698ThrPhe: 1.698 ± 0.042
5.83ThrGly: 5.83 ± 0.079
1.04ThrHis: 1.04 ± 0.031
2.521ThrIle: 2.521 ± 0.048
0.967ThrLys: 0.967 ± 0.032
5.358ThrLeu: 5.358 ± 0.079
0.933ThrMet: 0.933 ± 0.036
1.031ThrAsn: 1.031 ± 0.035
3.716ThrPro: 3.716 ± 0.071
1.228ThrGln: 1.228 ± 0.036
3.701ThrArg: 3.701 ± 0.066
2.954ThrSer: 2.954 ± 0.055
3.644ThrThr: 3.644 ± 0.072
5.507ThrVal: 5.507 ± 0.083
0.829ThrTrp: 0.829 ± 0.029
1.057ThrTyr: 1.057 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
12.985ValAla: 12.985 ± 0.141
0.442ValCys: 0.442 ± 0.02
6.453ValAsp: 6.453 ± 0.087
5.574ValGlu: 5.574 ± 0.08
2.907ValPhe: 2.907 ± 0.059
7.515ValGly: 7.515 ± 0.084
1.904ValHis: 1.904 ± 0.045
4.118ValIle: 4.118 ± 0.072
1.473ValLys: 1.473 ± 0.04
9.635ValLeu: 9.635 ± 0.126
1.347ValMet: 1.347 ± 0.039
1.754ValAsn: 1.754 ± 0.046
4.997ValPro: 4.997 ± 0.07
2.115ValGln: 2.115 ± 0.05
6.732ValArg: 6.732 ± 0.083
4.309ValSer: 4.309 ± 0.065
5.357ValThr: 5.357 ± 0.073
9.784ValVal: 9.784 ± 0.134
1.09ValTrp: 1.09 ± 0.038
1.497ValTyr: 1.497 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.764TrpAla: 1.764 ± 0.046
0.091TrpCys: 0.091 ± 0.011
0.727TrpAsp: 0.727 ± 0.028
0.656TrpGlu: 0.656 ± 0.026
0.604TrpPhe: 0.604 ± 0.028
1.089TrpGly: 1.089 ± 0.037
0.346TrpHis: 0.346 ± 0.019
0.709TrpIle: 0.709 ± 0.028
0.262TrpLys: 0.262 ± 0.016
1.75TrpLeu: 1.75 ± 0.042
0.355TrpMet: 0.355 ± 0.021
0.409TrpAsn: 0.409 ± 0.023
0.723TrpPro: 0.723 ± 0.027
0.54TrpGln: 0.54 ± 0.024
1.318TrpArg: 1.318 ± 0.04
0.936TrpSer: 0.936 ± 0.031
0.891TrpThr: 0.891 ± 0.034
1.183TrpVal: 1.183 ± 0.037
0.387TrpTrp: 0.387 ± 0.022
0.338TrpTyr: 0.338 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.4TyrAla: 2.4 ± 0.054
0.122TyrCys: 0.122 ± 0.01
1.387TyrAsp: 1.387 ± 0.042
1.198TyrGlu: 1.198 ± 0.036
0.678TyrPhe: 0.678 ± 0.025
1.927TyrGly: 1.927 ± 0.046
0.34TyrHis: 0.34 ± 0.018
0.498TyrIle: 0.498 ± 0.025
0.269TyrLys: 0.269 ± 0.016
2.116TyrLeu: 2.116 ± 0.052
0.22TyrMet: 0.22 ± 0.016
0.379TyrAsn: 0.379 ± 0.022
0.953TyrPro: 0.953 ± 0.031
0.495TyrGln: 0.495 ± 0.022
1.725TyrArg: 1.725 ± 0.039
0.963TyrSer: 0.963 ± 0.032
1.016TyrThr: 1.016 ± 0.037
1.612TyrVal: 1.612 ± 0.039
0.312TyrTrp: 0.312 ± 0.019
0.433TyrTyr: 0.433 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3157 proteins (1011233 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski