Amino acid dipepetide frequency for Deinococcus koreensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.443AlaAla: 17.443 ± 0.175
0.852AlaCys: 0.852 ± 0.025
5.948AlaAsp: 5.948 ± 0.07
6.326AlaGlu: 6.326 ± 0.083
4.14AlaPhe: 4.14 ± 0.057
12.401AlaGly: 12.401 ± 0.126
2.703AlaHis: 2.703 ± 0.047
3.493AlaIle: 3.493 ± 0.065
2.159AlaLys: 2.159 ± 0.053
17.523AlaLeu: 17.523 ± 0.203
2.169AlaMet: 2.169 ± 0.039
2.182AlaAsn: 2.182 ± 0.046
7.066AlaPro: 7.066 ± 0.101
6.294AlaGln: 6.294 ± 0.088
10.626AlaArg: 10.626 ± 0.108
6.02AlaSer: 6.02 ± 0.079
6.094AlaThr: 6.094 ± 0.093
8.791AlaVal: 8.791 ± 0.089
1.838AlaTrp: 1.838 ± 0.043
2.776AlaTyr: 2.776 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.782CysAla: 0.782 ± 0.028
0.059CysCys: 0.059 ± 0.008
0.284CysAsp: 0.284 ± 0.014
0.265CysGlu: 0.265 ± 0.017
0.144CysPhe: 0.144 ± 0.011
0.673CysGly: 0.673 ± 0.023
0.13CysHis: 0.13 ± 0.01
0.174CysIle: 0.174 ± 0.011
0.081CysLys: 0.081 ± 0.008
0.56CysLeu: 0.56 ± 0.019
0.086CysMet: 0.086 ± 0.009
0.104CysAsn: 0.104 ± 0.009
0.398CysPro: 0.398 ± 0.018
0.166CysGln: 0.166 ± 0.012
0.383CysArg: 0.383 ± 0.02
0.33CysSer: 0.33 ± 0.017
0.394CysThr: 0.394 ± 0.025
0.445CysVal: 0.445 ± 0.018
0.083CysTrp: 0.083 ± 0.008
0.122CysTyr: 0.122 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
6.481AspAla: 6.481 ± 0.067
0.251AspCys: 0.251 ± 0.014
2.546AspAsp: 2.546 ± 0.058
2.94AspGlu: 2.94 ± 0.053
1.829AspPhe: 1.829 ± 0.035
4.721AspGly: 4.721 ± 0.064
1.078AspHis: 1.078 ± 0.028
1.792AspIle: 1.792 ± 0.041
0.913AspLys: 0.913 ± 0.031
6.163AspLeu: 6.163 ± 0.079
0.914AspMet: 0.914 ± 0.024
0.816AspAsn: 0.816 ± 0.031
3.557AspPro: 3.557 ± 0.061
1.567AspGln: 1.567 ± 0.034
3.044AspArg: 3.044 ± 0.041
2.279AspSer: 2.279 ± 0.041
2.838AspThr: 2.838 ± 0.053
4.347AspVal: 4.347 ± 0.067
0.871AspTrp: 0.871 ± 0.03
1.175AspTyr: 1.175 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
7.094GluAla: 7.094 ± 0.104
0.228GluCys: 0.228 ± 0.013
2.243GluAsp: 2.243 ± 0.049
2.67GluGlu: 2.67 ± 0.06
1.702GluPhe: 1.702 ± 0.036
4.449GluGly: 4.449 ± 0.063
1.32GluHis: 1.32 ± 0.032
2.018GluIle: 2.018 ± 0.046
1.287GluLys: 1.287 ± 0.04
6.692GluLeu: 6.692 ± 0.088
0.941GluMet: 0.941 ± 0.03
1.075GluAsn: 1.075 ± 0.033
2.482GluPro: 2.482 ± 0.051
2.221GluGln: 2.221 ± 0.046
5.216GluArg: 5.216 ± 0.081
2.188GluSer: 2.188 ± 0.04
2.75GluThr: 2.75 ± 0.044
4.673GluVal: 4.673 ± 0.068
0.728GluTrp: 0.728 ± 0.023
1.194GluTyr: 1.194 ± 0.035
0.0GluXaa: 0.0 ± 0.0
Phe
3.475PheAla: 3.475 ± 0.058
0.181PheCys: 0.181 ± 0.012
1.793PheAsp: 1.793 ± 0.038
1.682PheGlu: 1.682 ± 0.041
0.951PhePhe: 0.951 ± 0.031
3.155PheGly: 3.155 ± 0.049
0.599PheHis: 0.599 ± 0.024
1.152PheIle: 1.152 ± 0.037
0.753PheLys: 0.753 ± 0.03
3.25PheLeu: 3.25 ± 0.062
0.58PheMet: 0.58 ± 0.021
0.831PheAsn: 0.831 ± 0.027
1.714PhePro: 1.714 ± 0.037
1.118PheGln: 1.118 ± 0.032
2.038PheArg: 2.038 ± 0.041
1.856PheSer: 1.856 ± 0.037
2.276PheThr: 2.276 ± 0.042
2.442PheVal: 2.442 ± 0.037
0.446PheTrp: 0.446 ± 0.022
0.736PheTyr: 0.736 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
11.206GlyAla: 11.206 ± 0.108
0.577GlyCys: 0.577 ± 0.022
4.297GlyAsp: 4.297 ± 0.064
5.461GlyGlu: 5.461 ± 0.084
3.036GlyPhe: 3.036 ± 0.061
9.182GlyGly: 9.182 ± 0.113
2.049GlyHis: 2.049 ± 0.044
3.203GlyIle: 3.203 ± 0.06
2.555GlyLys: 2.555 ± 0.06
11.325GlyLeu: 11.325 ± 0.114
1.928GlyMet: 1.928 ± 0.043
2.013GlyAsn: 2.013 ± 0.045
4.045GlyPro: 4.045 ± 0.064
4.185GlyGln: 4.185 ± 0.061
7.157GlyArg: 7.157 ± 0.086
5.015GlySer: 5.015 ± 0.075
5.796GlyThr: 5.796 ± 0.089
8.163GlyVal: 8.163 ± 0.085
1.562GlyTrp: 1.562 ± 0.039
2.328GlyTyr: 2.328 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.749HisAla: 2.749 ± 0.056
0.126HisCys: 0.126 ± 0.009
1.196HisAsp: 1.196 ± 0.034
1.093HisGlu: 1.093 ± 0.033
0.674HisPhe: 0.674 ± 0.02
2.019HisGly: 2.019 ± 0.044
0.66HisHis: 0.66 ± 0.026
0.643HisIle: 0.643 ± 0.027
0.351HisLys: 0.351 ± 0.016
2.662HisLeu: 2.662 ± 0.044
0.333HisMet: 0.333 ± 0.015
0.359HisAsn: 0.359 ± 0.017
1.634HisPro: 1.634 ± 0.039
0.594HisGln: 0.594 ± 0.022
1.41HisArg: 1.41 ± 0.04
0.977HisSer: 0.977 ± 0.029
1.162HisThr: 1.162 ± 0.029
1.481HisVal: 1.481 ± 0.037
0.336HisTrp: 0.336 ± 0.015
0.518HisTyr: 0.518 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
3.883IleAla: 3.883 ± 0.06
0.201IleCys: 0.201 ± 0.014
1.982IleAsp: 1.982 ± 0.039
2.093IleGlu: 2.093 ± 0.047
0.992IlePhe: 0.992 ± 0.033
3.371IleGly: 3.371 ± 0.055
0.801IleHis: 0.801 ± 0.026
1.3IleIle: 1.3 ± 0.041
0.806IleLys: 0.806 ± 0.034
3.371IleLeu: 3.371 ± 0.065
0.577IleMet: 0.577 ± 0.025
0.863IleAsn: 0.863 ± 0.028
1.947IlePro: 1.947 ± 0.039
1.221IleGln: 1.221 ± 0.031
2.552IleArg: 2.552 ± 0.051
1.977IleSer: 1.977 ± 0.047
2.161IleThr: 2.161 ± 0.046
2.801IleVal: 2.801 ± 0.055
0.337IleTrp: 0.337 ± 0.018
0.747IleTyr: 0.747 ± 0.024
0.0IleXaa: 0.0 ± 0.0
Lys
2.792LysAla: 2.792 ± 0.062
0.086LysCys: 0.086 ± 0.009
1.129LysAsp: 1.129 ± 0.035
1.016LysGlu: 1.016 ± 0.033
0.673LysPhe: 0.673 ± 0.027
1.713LysGly: 1.713 ± 0.046
0.409LysHis: 0.409 ± 0.018
0.892LysIle: 0.892 ± 0.031
0.817LysLys: 0.817 ± 0.033
2.396LysLeu: 2.396 ± 0.054
0.56LysMet: 0.56 ± 0.02
0.65LysAsn: 0.65 ± 0.029
1.199LysPro: 1.199 ± 0.034
0.672LysGln: 0.672 ± 0.026
1.443LysArg: 1.443 ± 0.035
1.1LysSer: 1.1 ± 0.035
1.54LysThr: 1.54 ± 0.04
1.967LysVal: 1.967 ± 0.049
0.245LysTrp: 0.245 ± 0.014
0.61LysTyr: 0.61 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
14.826LeuAla: 14.826 ± 0.166
0.711LeuCys: 0.711 ± 0.025
6.492LeuAsp: 6.492 ± 0.071
6.51LeuGlu: 6.51 ± 0.084
2.863LeuPhe: 2.863 ± 0.059
11.539LeuGly: 11.539 ± 0.132
2.456LeuHis: 2.456 ± 0.047
4.512LeuIle: 4.512 ± 0.078
2.956LeuLys: 2.956 ± 0.053
13.215LeuLeu: 13.215 ± 0.176
2.117LeuMet: 2.117 ± 0.042
3.198LeuAsn: 3.198 ± 0.05
7.363LeuPro: 7.363 ± 0.093
3.835LeuGln: 3.835 ± 0.061
9.319LeuArg: 9.319 ± 0.112
8.271LeuSer: 8.271 ± 0.098
7.33LeuThr: 7.33 ± 0.084
7.365LeuVal: 7.365 ± 0.092
1.422LeuTrp: 1.422 ± 0.039
2.398LeuTyr: 2.398 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
1.916MetAla: 1.916 ± 0.038
0.062MetCys: 0.062 ± 0.006
0.843MetAsp: 0.843 ± 0.026
0.784MetGlu: 0.784 ± 0.027
0.498MetPhe: 0.498 ± 0.025
1.482MetGly: 1.482 ± 0.035
0.362MetHis: 0.362 ± 0.015
0.766MetIle: 0.766 ± 0.026
0.677MetLys: 0.677 ± 0.023
2.021MetLeu: 2.021 ± 0.043
0.335MetMet: 0.335 ± 0.017
0.695MetAsn: 0.695 ± 0.024
1.147MetPro: 1.147 ± 0.031
0.655MetGln: 0.655 ± 0.02
1.323MetArg: 1.323 ± 0.036
1.213MetSer: 1.213 ± 0.033
1.808MetThr: 1.808 ± 0.035
1.201MetVal: 1.201 ± 0.032
0.184MetTrp: 0.184 ± 0.012
0.352MetTyr: 0.352 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.943AsnAla: 2.943 ± 0.048
0.143AsnCys: 0.143 ± 0.011
1.051AsnAsp: 1.051 ± 0.031
0.913AsnGlu: 0.913 ± 0.027
0.82AsnPhe: 0.82 ± 0.03
1.834AsnGly: 1.834 ± 0.047
0.43AsnHis: 0.43 ± 0.016
0.936AsnIle: 0.936 ± 0.026
0.504AsnLys: 0.504 ± 0.021
2.724AsnLeu: 2.724 ± 0.044
0.421AsnMet: 0.421 ± 0.017
0.58AsnAsn: 0.58 ± 0.027
1.711AsnPro: 1.711 ± 0.046
0.611AsnGln: 0.611 ± 0.023
1.456AsnArg: 1.456 ± 0.035
1.053AsnSer: 1.053 ± 0.033
1.345AsnThr: 1.345 ± 0.045
2.003AsnVal: 2.003 ± 0.049
0.311AsnTrp: 0.311 ± 0.016
0.586AsnTyr: 0.586 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
8.471ProAla: 8.471 ± 0.122
0.264ProCys: 0.264 ± 0.015
4.031ProAsp: 4.031 ± 0.061
4.281ProGlu: 4.281 ± 0.06
1.67ProPhe: 1.67 ± 0.037
6.428ProGly: 6.428 ± 0.098
1.285ProHis: 1.285 ± 0.03
1.518ProIle: 1.518 ± 0.038
1.054ProLys: 1.054 ± 0.031
6.421ProLeu: 6.421 ± 0.096
1.13ProMet: 1.13 ± 0.028
1.194ProAsn: 1.194 ± 0.033
3.65ProPro: 3.65 ± 0.072
2.294ProGln: 2.294 ± 0.048
3.575ProArg: 3.575 ± 0.058
2.94ProSer: 2.94 ± 0.051
3.388ProThr: 3.388 ± 0.062
4.604ProVal: 4.604 ± 0.078
0.805ProTrp: 0.805 ± 0.028
1.268ProTyr: 1.268 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
5.805GlnAla: 5.805 ± 0.077
0.136GlnCys: 0.136 ± 0.012
1.874GlnAsp: 1.874 ± 0.042
2.066GlnGlu: 2.066 ± 0.043
0.99GlnPhe: 0.99 ± 0.029
3.966GlnGly: 3.966 ± 0.06
0.743GlnHis: 0.743 ± 0.028
1.305GlnIle: 1.305 ± 0.034
0.882GlnLys: 0.882 ± 0.026
3.601GlnLeu: 3.601 ± 0.059
0.626GlnMet: 0.626 ± 0.022
0.841GlnAsn: 0.841 ± 0.026
2.352GlnPro: 2.352 ± 0.052
1.526GlnGln: 1.526 ± 0.042
2.759GlnArg: 2.759 ± 0.058
1.869GlnSer: 1.869 ± 0.042
2.512GlnThr: 2.512 ± 0.042
3.143GlnVal: 3.143 ± 0.05
0.422GlnTrp: 0.422 ± 0.019
0.759GlnTyr: 0.759 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
9.919ArgAla: 9.919 ± 0.111
0.359ArgCys: 0.359 ± 0.017
3.738ArgAsp: 3.738 ± 0.056
4.945ArgGlu: 4.945 ± 0.081
2.402ArgPhe: 2.402 ± 0.046
6.118ArgGly: 6.118 ± 0.083
1.647ArgHis: 1.647 ± 0.033
2.546ArgIle: 2.546 ± 0.051
1.396ArgLys: 1.396 ± 0.035
8.843ArgLeu: 8.843 ± 0.107
1.453ArgMet: 1.453 ± 0.036
1.42ArgAsn: 1.42 ± 0.032
4.79ArgPro: 4.79 ± 0.066
2.881ArgGln: 2.881 ± 0.054
6.098ArgArg: 6.098 ± 0.081
3.794ArgSer: 3.794 ± 0.06
3.9ArgThr: 3.9 ± 0.059
6.319ArgVal: 6.319 ± 0.076
1.162ArgTrp: 1.162 ± 0.031
1.796ArgTyr: 1.796 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
6.991SerAla: 6.991 ± 0.089
0.332SerCys: 0.332 ± 0.021
2.594SerAsp: 2.594 ± 0.05
2.414SerGlu: 2.414 ± 0.043
1.801SerPhe: 1.801 ± 0.037
6.576SerGly: 6.576 ± 0.095
0.9SerHis: 0.9 ± 0.027
1.558SerIle: 1.558 ± 0.039
1.021SerLys: 1.021 ± 0.036
5.907SerLeu: 5.907 ± 0.081
0.967SerMet: 0.967 ± 0.03
1.134SerAsn: 1.134 ± 0.038
3.52SerPro: 3.52 ± 0.054
1.568SerGln: 1.568 ± 0.037
3.713SerArg: 3.713 ± 0.053
2.982SerSer: 2.982 ± 0.054
3.032SerThr: 3.032 ± 0.058
4.557SerVal: 4.557 ± 0.066
0.677SerTrp: 0.677 ± 0.022
1.19SerTyr: 1.19 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
6.62ThrAla: 6.62 ± 0.089
0.351ThrCys: 0.351 ± 0.021
2.72ThrAsp: 2.72 ± 0.048
2.323ThrGlu: 2.323 ± 0.045
2.206ThrPhe: 2.206 ± 0.046
5.468ThrGly: 5.468 ± 0.076
1.19ThrHis: 1.19 ± 0.033
1.862ThrIle: 1.862 ± 0.047
0.922ThrLys: 0.922 ± 0.031
8.745ThrLeu: 8.745 ± 0.101
0.852ThrMet: 0.852 ± 0.026
1.184ThrAsn: 1.184 ± 0.04
4.981ThrPro: 4.981 ± 0.088
1.899ThrGln: 1.899 ± 0.038
4.245ThrArg: 4.245 ± 0.062
2.964ThrSer: 2.964 ± 0.056
3.33ThrThr: 3.33 ± 0.074
5.312ThrVal: 5.312 ± 0.089
0.759ThrTrp: 0.759 ± 0.021
1.479ThrTyr: 1.479 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
8.895ValAla: 8.895 ± 0.083
0.513ValCys: 0.513 ± 0.021
3.443ValAsp: 3.443 ± 0.06
3.729ValGlu: 3.729 ± 0.065
2.46ValPhe: 2.46 ± 0.041
6.621ValGly: 6.621 ± 0.074
1.388ValHis: 1.388 ± 0.036
3.239ValIle: 3.239 ± 0.056
2.0ValLys: 2.0 ± 0.053
8.998ValLeu: 8.998 ± 0.086
1.683ValMet: 1.683 ± 0.038
2.272ValAsn: 2.272 ± 0.046
4.615ValPro: 4.615 ± 0.068
3.378ValGln: 3.378 ± 0.061
5.987ValArg: 5.987 ± 0.073
4.648ValSer: 4.648 ± 0.066
5.373ValThr: 5.373 ± 0.091
6.113ValVal: 6.113 ± 0.093
1.161ValTrp: 1.161 ± 0.031
1.84ValTyr: 1.84 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.633TrpAla: 1.633 ± 0.036
0.088TrpCys: 0.088 ± 0.009
0.65TrpAsp: 0.65 ± 0.022
0.582TrpGlu: 0.582 ± 0.022
0.407TrpPhe: 0.407 ± 0.017
1.105TrpGly: 1.105 ± 0.036
0.33TrpHis: 0.33 ± 0.016
0.449TrpIle: 0.449 ± 0.021
0.348TrpLys: 0.348 ± 0.018
1.735TrpLeu: 1.735 ± 0.043
0.332TrpMet: 0.332 ± 0.016
0.486TrpAsn: 0.486 ± 0.021
0.777TrpPro: 0.777 ± 0.027
0.674TrpGln: 0.674 ± 0.022
1.173TrpArg: 1.173 ± 0.033
0.714TrpSer: 0.714 ± 0.022
0.953TrpThr: 0.953 ± 0.031
0.991TrpVal: 0.991 ± 0.033
0.26TrpTrp: 0.26 ± 0.015
0.26TrpTyr: 0.26 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.921TyrAla: 2.921 ± 0.051
0.142TyrCys: 0.142 ± 0.011
1.27TyrAsp: 1.27 ± 0.037
1.077TyrGlu: 1.077 ± 0.038
0.769TyrPhe: 0.769 ± 0.024
2.261TyrGly: 2.261 ± 0.04
0.498TyrHis: 0.498 ± 0.02
0.624TyrIle: 0.624 ± 0.024
0.429TyrLys: 0.429 ± 0.021
2.546TyrLeu: 2.546 ± 0.048
0.319TyrMet: 0.319 ± 0.015
0.48TyrAsn: 0.48 ± 0.022
1.32TyrPro: 1.32 ± 0.03
0.818TyrGln: 0.818 ± 0.025
2.011TyrArg: 2.011 ± 0.04
1.243TyrSer: 1.243 ± 0.033
1.429TyrThr: 1.429 ± 0.039
1.638TyrVal: 1.638 ± 0.04
0.34TyrTrp: 0.34 ± 0.019
0.573TyrTyr: 0.573 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3967 proteins (1274002 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski