Amino acid dipepetide frequency for Pseudonocardia sp. HH130630-07

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.983AlaAla: 23.983 ± 0.217
0.988AlaCys: 0.988 ± 0.027
9.571AlaAsp: 9.571 ± 0.087
8.753AlaGlu: 8.753 ± 0.083
3.338AlaPhe: 3.338 ± 0.05
16.439AlaGly: 16.439 ± 0.132
2.806AlaHis: 2.806 ± 0.043
3.586AlaIle: 3.586 ± 0.051
1.484AlaLys: 1.484 ± 0.033
14.596AlaLeu: 14.596 ± 0.124
2.404AlaMet: 2.404 ± 0.038
1.639AlaAsn: 1.639 ± 0.032
8.103AlaPro: 8.103 ± 0.1
3.186AlaGln: 3.186 ± 0.05
11.552AlaArg: 11.552 ± 0.104
5.282AlaSer: 5.282 ± 0.061
7.873AlaThr: 7.873 ± 0.093
14.211AlaVal: 14.211 ± 0.119
1.837AlaTrp: 1.837 ± 0.036
2.039AlaTyr: 2.039 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
1.084CysAla: 1.084 ± 0.024
0.085CysCys: 0.085 ± 0.007
0.457CysAsp: 0.457 ± 0.015
0.348CysGlu: 0.348 ± 0.013
0.174CysPhe: 0.174 ± 0.01
0.904CysGly: 0.904 ± 0.026
0.185CysHis: 0.185 ± 0.01
0.157CysIle: 0.157 ± 0.01
0.063CysLys: 0.063 ± 0.007
0.61CysLeu: 0.61 ± 0.019
0.087CysMet: 0.087 ± 0.007
0.1CysAsn: 0.1 ± 0.007
0.474CysPro: 0.474 ± 0.018
0.105CysGln: 0.105 ± 0.008
0.631CysArg: 0.631 ± 0.022
0.424CysSer: 0.424 ± 0.017
0.483CysThr: 0.483 ± 0.022
0.609CysVal: 0.609 ± 0.019
0.114CysTrp: 0.114 ± 0.009
0.14CysTyr: 0.14 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.858AspAla: 8.858 ± 0.105
0.351AspCys: 0.351 ± 0.019
4.649AspAsp: 4.649 ± 0.06
3.901AspGlu: 3.901 ± 0.052
1.235AspPhe: 1.235 ± 0.03
7.503AspGly: 7.503 ± 0.077
1.424AspHis: 1.424 ± 0.031
1.371AspIle: 1.371 ± 0.035
0.599AspLys: 0.599 ± 0.023
6.786AspLeu: 6.786 ± 0.066
0.69AspMet: 0.69 ± 0.022
0.671AspAsn: 0.671 ± 0.02
5.939AspPro: 5.939 ± 0.067
1.346AspGln: 1.346 ± 0.028
6.693AspArg: 6.693 ± 0.076
2.172AspSer: 2.172 ± 0.044
3.323AspThr: 3.323 ± 0.047
5.729AspVal: 5.729 ± 0.06
0.873AspTrp: 0.873 ± 0.022
0.992AspTyr: 0.992 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
5.644GluAla: 5.644 ± 0.074
0.308GluCys: 0.308 ± 0.015
2.395GluAsp: 2.395 ± 0.043
2.31GluGlu: 2.31 ± 0.042
1.515GluPhe: 1.515 ± 0.032
3.173GluGly: 3.173 ± 0.051
1.716GluHis: 1.716 ± 0.035
2.425GluIle: 2.425 ± 0.034
0.89GluLys: 0.89 ± 0.03
7.077GluLeu: 7.077 ± 0.081
0.781GluMet: 0.781 ± 0.021
0.881GluAsn: 0.881 ± 0.023
3.736GluPro: 3.736 ± 0.054
2.343GluGln: 2.343 ± 0.04
5.641GluArg: 5.641 ± 0.06
2.288GluSer: 2.288 ± 0.038
2.662GluThr: 2.662 ± 0.045
4.528GluVal: 4.528 ± 0.047
0.693GluTrp: 0.693 ± 0.021
0.85GluTyr: 0.85 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
3.778PheAla: 3.778 ± 0.049
0.257PheCys: 0.257 ± 0.012
2.036PheAsp: 2.036 ± 0.037
1.072PheGlu: 1.072 ± 0.027
0.727PhePhe: 0.727 ± 0.023
3.058PheGly: 3.058 ± 0.044
0.565PheHis: 0.565 ± 0.018
0.462PheIle: 0.462 ± 0.02
0.24PheLys: 0.24 ± 0.011
2.396PheLeu: 2.396 ± 0.041
0.282PheMet: 0.282 ± 0.013
0.367PheAsn: 0.367 ± 0.015
1.285PhePro: 1.285 ± 0.032
0.436PheGln: 0.436 ± 0.016
1.819PheArg: 1.819 ± 0.036
1.278PheSer: 1.278 ± 0.029
1.75PheThr: 1.75 ± 0.038
2.394PheVal: 2.394 ± 0.037
0.385PheTrp: 0.385 ± 0.016
0.515PheTyr: 0.515 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
13.105GlyAla: 13.105 ± 0.107
0.844GlyCys: 0.844 ± 0.026
5.752GlyAsp: 5.752 ± 0.066
5.086GlyGlu: 5.086 ± 0.055
2.841GlyPhe: 2.841 ± 0.039
9.617GlyGly: 9.617 ± 0.118
2.336GlyHis: 2.336 ± 0.046
3.584GlyIle: 3.584 ± 0.062
1.359GlyLys: 1.359 ± 0.034
10.042GlyLeu: 10.042 ± 0.094
2.093GlyMet: 2.093 ± 0.035
1.455GlyAsn: 1.455 ± 0.034
6.893GlyPro: 6.893 ± 0.089
2.249GlyGln: 2.249 ± 0.041
8.993GlyArg: 8.993 ± 0.104
5.711GlySer: 5.711 ± 0.08
7.041GlyThr: 7.041 ± 0.077
8.584GlyVal: 8.584 ± 0.084
1.801GlyTrp: 1.801 ± 0.035
2.206GlyTyr: 2.206 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
2.777HisAla: 2.777 ± 0.049
0.184HisCys: 0.184 ± 0.01
1.571HisAsp: 1.571 ± 0.03
1.161HisGlu: 1.161 ± 0.026
0.487HisPhe: 0.487 ± 0.017
2.462HisGly: 2.462 ± 0.037
0.673HisHis: 0.673 ± 0.021
0.448HisIle: 0.448 ± 0.019
0.193HisLys: 0.193 ± 0.011
2.387HisLeu: 2.387 ± 0.047
0.253HisMet: 0.253 ± 0.011
0.289HisAsn: 0.289 ± 0.014
1.819HisPro: 1.819 ± 0.035
0.496HisGln: 0.496 ± 0.017
2.364HisArg: 2.364 ± 0.041
0.891HisSer: 0.891 ± 0.024
1.153HisThr: 1.153 ± 0.027
1.859HisVal: 1.859 ± 0.034
0.314HisTrp: 0.314 ± 0.015
0.431HisTyr: 0.431 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
4.737IleAla: 4.737 ± 0.055
0.237IleCys: 0.237 ± 0.012
2.291IleAsp: 2.291 ± 0.037
1.826IleGlu: 1.826 ± 0.038
0.552IlePhe: 0.552 ± 0.021
3.69IleGly: 3.69 ± 0.051
0.466IleHis: 0.466 ± 0.016
0.662IleIle: 0.662 ± 0.026
0.382IleLys: 0.382 ± 0.016
2.108IleLeu: 2.108 ± 0.047
0.368IleMet: 0.368 ± 0.015
0.496IleAsn: 0.496 ± 0.019
1.566IlePro: 1.566 ± 0.032
0.481IleGln: 0.481 ± 0.018
2.134IleArg: 2.134 ± 0.038
1.502IleSer: 1.502 ± 0.032
1.962IleThr: 1.962 ± 0.035
2.931IleVal: 2.931 ± 0.05
0.326IleTrp: 0.326 ± 0.014
0.446IleTyr: 0.446 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
1.493LysAla: 1.493 ± 0.033
0.05LysCys: 0.05 ± 0.005
0.645LysAsp: 0.645 ± 0.022
0.529LysGlu: 0.529 ± 0.019
0.247LysPhe: 0.247 ± 0.014
0.916LysGly: 0.916 ± 0.025
0.252LysHis: 0.252 ± 0.012
0.569LysIle: 0.569 ± 0.02
0.352LysLys: 0.352 ± 0.019
1.236LysLeu: 1.236 ± 0.037
0.219LysMet: 0.219 ± 0.011
0.229LysAsn: 0.229 ± 0.013
0.713LysPro: 0.713 ± 0.024
0.39LysGln: 0.39 ± 0.017
0.907LysArg: 0.907 ± 0.026
0.62LysSer: 0.62 ± 0.021
0.694LysThr: 0.694 ± 0.022
1.086LysVal: 1.086 ± 0.028
0.13LysTrp: 0.13 ± 0.007
0.218LysTyr: 0.218 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
16.335LeuAla: 16.335 ± 0.143
0.746LeuCys: 0.746 ± 0.022
7.694LeuAsp: 7.694 ± 0.08
4.096LeuGlu: 4.096 ± 0.049
2.463LeuPhe: 2.463 ± 0.042
9.917LeuGly: 9.917 ± 0.079
2.401LeuHis: 2.401 ± 0.039
2.509LeuIle: 2.509 ± 0.045
0.969LeuLys: 0.969 ± 0.028
11.412LeuLeu: 11.412 ± 0.111
1.315LeuMet: 1.315 ± 0.025
1.283LeuAsn: 1.283 ± 0.029
6.371LeuPro: 6.371 ± 0.069
2.155LeuGln: 2.155 ± 0.036
9.204LeuArg: 9.204 ± 0.082
4.928LeuSer: 4.928 ± 0.054
6.423LeuThr: 6.423 ± 0.06
10.381LeuVal: 10.381 ± 0.103
1.16LeuTrp: 1.16 ± 0.028
1.48LeuTyr: 1.48 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
1.98MetAla: 1.98 ± 0.036
0.108MetCys: 0.108 ± 0.007
0.77MetAsp: 0.77 ± 0.02
0.573MetGlu: 0.573 ± 0.019
0.452MetPhe: 0.452 ± 0.017
1.161MetGly: 1.161 ± 0.028
0.316MetHis: 0.316 ± 0.016
0.694MetIle: 0.694 ± 0.023
0.243MetLys: 0.243 ± 0.013
1.743MetLeu: 1.743 ± 0.033
0.227MetMet: 0.227 ± 0.013
0.327MetAsn: 0.327 ± 0.013
1.022MetPro: 1.022 ± 0.025
0.397MetGln: 0.397 ± 0.014
1.315MetArg: 1.315 ± 0.028
1.218MetSer: 1.218 ± 0.024
1.535MetThr: 1.535 ± 0.034
1.319MetVal: 1.319 ± 0.03
0.186MetTrp: 0.186 ± 0.01
0.248MetTyr: 0.248 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
1.796AsnAla: 1.796 ± 0.037
0.109AsnCys: 0.109 ± 0.008
0.785AsnAsp: 0.785 ± 0.022
0.579AsnGlu: 0.579 ± 0.018
0.358AsnPhe: 0.358 ± 0.015
1.491AsnGly: 1.491 ± 0.034
0.294AsnHis: 0.294 ± 0.013
0.469AsnIle: 0.469 ± 0.017
0.193AsnLys: 0.193 ± 0.011
1.42AsnLeu: 1.42 ± 0.034
0.216AsnMet: 0.216 ± 0.011
0.286AsnAsn: 0.286 ± 0.014
1.126AsnPro: 1.126 ± 0.03
0.395AsnGln: 0.395 ± 0.017
1.199AsnArg: 1.199 ± 0.027
0.682AsnSer: 0.682 ± 0.02
0.811AsnThr: 0.811 ± 0.024
1.22AsnVal: 1.22 ± 0.034
0.212AsnTrp: 0.212 ± 0.011
0.307AsnTyr: 0.307 ± 0.013
0.0AsnXaa: 0.0 ± 0.0
Pro
10.357ProAla: 10.357 ± 0.121
0.329ProCys: 0.329 ± 0.014
5.614ProAsp: 5.614 ± 0.065
4.238ProGlu: 4.238 ± 0.055
1.484ProPhe: 1.484 ± 0.037
8.398ProGly: 8.398 ± 0.082
1.294ProHis: 1.294 ± 0.03
1.347ProIle: 1.347 ± 0.025
0.666ProLys: 0.666 ± 0.022
5.141ProLeu: 5.141 ± 0.058
1.007ProMet: 1.007 ± 0.023
0.765ProAsn: 0.765 ± 0.022
3.992ProPro: 3.992 ± 0.075
1.556ProGln: 1.556 ± 0.03
4.296ProArg: 4.296 ± 0.057
3.109ProSer: 3.109 ± 0.052
3.254ProThr: 3.254 ± 0.046
6.538ProVal: 6.538 ± 0.069
0.909ProTrp: 0.909 ± 0.025
0.991ProTyr: 0.991 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
3.059GlnAla: 3.059 ± 0.049
0.145GlnCys: 0.145 ± 0.01
1.216GlnAsp: 1.216 ± 0.032
1.076GlnGlu: 1.076 ± 0.029
0.564GlnPhe: 0.564 ± 0.019
1.844GlnGly: 1.844 ± 0.038
0.539GlnHis: 0.539 ± 0.018
0.962GlnIle: 0.962 ± 0.024
0.349GlnLys: 0.349 ± 0.016
2.668GlnLeu: 2.668 ± 0.04
0.434GlnMet: 0.434 ± 0.016
0.422GlnAsn: 0.422 ± 0.019
1.344GlnPro: 1.344 ± 0.03
0.973GlnGln: 0.973 ± 0.031
2.367GlnArg: 2.367 ± 0.038
0.953GlnSer: 0.953 ± 0.027
1.076GlnThr: 1.076 ± 0.027
2.371GlnVal: 2.371 ± 0.035
0.39GlnTrp: 0.39 ± 0.013
0.43GlnTyr: 0.43 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
11.579ArgAla: 11.579 ± 0.113
0.664ArgCys: 0.664 ± 0.019
5.154ArgAsp: 5.154 ± 0.06
4.581ArgGlu: 4.581 ± 0.056
2.536ArgPhe: 2.536 ± 0.037
6.47ArgGly: 6.47 ± 0.074
2.18ArgHis: 2.18 ± 0.034
3.591ArgIle: 3.591 ± 0.051
0.967ArgLys: 0.967 ± 0.033
8.746ArgLeu: 8.746 ± 0.09
1.913ArgMet: 1.913 ± 0.034
1.275ArgAsn: 1.275 ± 0.03
5.892ArgPro: 5.892 ± 0.069
1.827ArgGln: 1.827 ± 0.035
9.152ArgArg: 9.152 ± 0.092
4.837ArgSer: 4.837 ± 0.074
5.915ArgThr: 5.915 ± 0.064
6.765ArgVal: 6.765 ± 0.066
1.565ArgTrp: 1.565 ± 0.03
1.778ArgTyr: 1.778 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
6.544SerAla: 6.544 ± 0.062
0.362SerCys: 0.362 ± 0.021
2.659SerAsp: 2.659 ± 0.048
2.21SerGlu: 2.21 ± 0.038
1.375SerPhe: 1.375 ± 0.029
6.231SerGly: 6.231 ± 0.073
0.808SerHis: 0.808 ± 0.022
1.341SerIle: 1.341 ± 0.028
0.563SerLys: 0.563 ± 0.018
4.102SerLeu: 4.102 ± 0.052
1.041SerMet: 1.041 ± 0.025
0.661SerAsn: 0.661 ± 0.022
2.924SerPro: 2.924 ± 0.043
0.969SerGln: 0.969 ± 0.025
3.769SerArg: 3.769 ± 0.056
2.546SerSer: 2.546 ± 0.057
3.131SerThr: 3.131 ± 0.053
4.195SerVal: 4.195 ± 0.055
0.875SerTrp: 0.875 ± 0.023
0.884SerTyr: 0.884 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
9.277ThrAla: 9.277 ± 0.091
0.418ThrCys: 0.418 ± 0.017
3.986ThrAsp: 3.986 ± 0.05
3.229ThrGlu: 3.229 ± 0.048
1.451ThrPhe: 1.451 ± 0.031
7.72ThrGly: 7.72 ± 0.08
1.093ThrHis: 1.093 ± 0.027
1.724ThrIle: 1.724 ± 0.036
0.633ThrLys: 0.633 ± 0.02
5.433ThrLeu: 5.433 ± 0.057
0.916ThrMet: 0.916 ± 0.026
0.809ThrAsn: 0.809 ± 0.022
4.032ThrPro: 4.032 ± 0.051
0.967ThrGln: 0.967 ± 0.024
4.242ThrArg: 4.242 ± 0.052
2.786ThrSer: 2.786 ± 0.044
3.975ThrThr: 3.975 ± 0.057
6.588ThrVal: 6.588 ± 0.07
0.857ThrTrp: 0.857 ± 0.021
1.007ThrTyr: 1.007 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
13.436ValAla: 13.436 ± 0.11
0.736ValCys: 0.736 ± 0.02
5.921ValAsp: 5.921 ± 0.061
4.897ValGlu: 4.897 ± 0.058
2.354ValPhe: 2.354 ± 0.039
7.759ValGly: 7.759 ± 0.07
2.183ValHis: 2.183 ± 0.037
2.42ValIle: 2.42 ± 0.042
0.937ValLys: 0.937 ± 0.027
11.691ValLeu: 11.691 ± 0.106
1.129ValMet: 1.129 ± 0.027
1.391ValAsn: 1.391 ± 0.03
6.182ValPro: 6.182 ± 0.065
2.034ValGln: 2.034 ± 0.035
7.998ValArg: 7.998 ± 0.08
4.207ValSer: 4.207 ± 0.047
6.084ValThr: 6.084 ± 0.053
10.333ValVal: 10.333 ± 0.105
1.111ValTrp: 1.111 ± 0.026
1.32ValTyr: 1.32 ± 0.026
0.0ValXaa: 0.0 ± 0.0
Trp
1.709TrpAla: 1.709 ± 0.029
0.154TrpCys: 0.154 ± 0.009
0.776TrpAsp: 0.776 ± 0.022
0.604TrpGlu: 0.604 ± 0.018
0.505TrpPhe: 0.505 ± 0.018
1.029TrpGly: 1.029 ± 0.028
0.337TrpHis: 0.337 ± 0.015
0.524TrpIle: 0.524 ± 0.017
0.206TrpLys: 0.206 ± 0.011
1.675TrpLeu: 1.675 ± 0.031
0.273TrpMet: 0.273 ± 0.012
0.309TrpAsn: 0.309 ± 0.014
0.85TrpPro: 0.85 ± 0.023
0.455TrpGln: 0.455 ± 0.017
1.359TrpArg: 1.359 ± 0.029
0.956TrpSer: 0.956 ± 0.023
1.021TrpThr: 1.021 ± 0.023
1.053TrpVal: 1.053 ± 0.024
0.359TrpTrp: 0.359 ± 0.016
0.244TrpTyr: 0.244 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.14TyrAla: 2.14 ± 0.038
0.159TyrCys: 0.159 ± 0.009
1.16TyrAsp: 1.16 ± 0.029
0.788TyrGlu: 0.788 ± 0.025
0.445TyrPhe: 0.445 ± 0.019
1.81TyrGly: 1.81 ± 0.038
0.347TyrHis: 0.347 ± 0.015
0.318TyrIle: 0.318 ± 0.015
0.206TyrLys: 0.206 ± 0.01
1.974TyrLeu: 1.974 ± 0.035
0.185TyrMet: 0.185 ± 0.01
0.291TyrAsn: 0.291 ± 0.013
1.023TyrPro: 1.023 ± 0.027
0.42TyrGln: 0.42 ± 0.015
1.754TyrArg: 1.754 ± 0.03
0.8TyrSer: 0.8 ± 0.02
0.979TyrThr: 0.979 ± 0.023
1.427TyrVal: 1.427 ± 0.033
0.302TyrTrp: 0.302 ± 0.014
0.305TyrTyr: 0.305 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5619 proteins (1797081 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski