Amino acid dipepetide frequency for Kocuria palustris PEL

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.248AlaAla: 19.248 ± 0.257
0.778AlaCys: 0.778 ± 0.036
7.607AlaAsp: 7.607 ± 0.093
9.303AlaGlu: 9.303 ± 0.116
3.34AlaPhe: 3.34 ± 0.066
11.284AlaGly: 11.284 ± 0.132
2.433AlaHis: 2.433 ± 0.053
4.398AlaIle: 4.398 ± 0.067
2.31AlaLys: 2.31 ± 0.069
12.847AlaLeu: 12.847 ± 0.157
2.882AlaMet: 2.882 ± 0.065
1.779AlaAsn: 1.779 ± 0.051
6.402AlaPro: 6.402 ± 0.118
5.609AlaGln: 5.609 ± 0.091
8.609AlaArg: 8.609 ± 0.135
7.189AlaSer: 7.189 ± 0.106
5.322AlaThr: 5.322 ± 0.089
10.911AlaVal: 10.911 ± 0.145
1.77AlaTrp: 1.77 ± 0.045
1.955AlaTyr: 1.955 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.86CysAla: 0.86 ± 0.032
0.077CysCys: 0.077 ± 0.012
0.354CysAsp: 0.354 ± 0.023
0.339CysGlu: 0.339 ± 0.021
0.234CysPhe: 0.234 ± 0.017
0.67CysGly: 0.67 ± 0.032
0.133CysHis: 0.133 ± 0.013
0.211CysIle: 0.211 ± 0.016
0.061CysLys: 0.061 ± 0.008
0.655CysLeu: 0.655 ± 0.027
0.112CysMet: 0.112 ± 0.012
0.09CysAsn: 0.09 ± 0.012
0.325CysPro: 0.325 ± 0.023
0.17CysGln: 0.17 ± 0.015
0.529CysArg: 0.529 ± 0.023
0.409CysSer: 0.409 ± 0.023
0.39CysThr: 0.39 ± 0.022
0.466CysVal: 0.466 ± 0.022
0.1CysTrp: 0.1 ± 0.012
0.133CysTyr: 0.133 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.216AspAla: 7.216 ± 0.097
0.266AspCys: 0.266 ± 0.019
4.753AspAsp: 4.753 ± 0.09
4.557AspGlu: 4.557 ± 0.085
1.631AspPhe: 1.631 ± 0.045
6.272AspGly: 6.272 ± 0.088
1.463AspHis: 1.463 ± 0.042
2.023AspIle: 2.023 ± 0.05
0.979AspLys: 0.979 ± 0.043
6.287AspLeu: 6.287 ± 0.092
1.066AspMet: 1.066 ± 0.034
0.784AspAsn: 0.784 ± 0.033
4.923AspPro: 4.923 ± 0.082
2.245AspGln: 2.245 ± 0.06
4.621AspArg: 4.621 ± 0.084
3.425AspSer: 3.425 ± 0.065
2.652AspThr: 2.652 ± 0.055
4.676AspVal: 4.676 ± 0.072
0.875AspTrp: 0.875 ± 0.035
1.194AspTyr: 1.194 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
8.62GluAla: 8.62 ± 0.107
0.335GluCys: 0.335 ± 0.02
4.5GluAsp: 4.5 ± 0.079
3.746GluGlu: 3.746 ± 0.067
1.651GluPhe: 1.651 ± 0.044
4.921GluGly: 4.921 ± 0.083
2.048GluHis: 2.048 ± 0.045
3.386GluIle: 3.386 ± 0.068
1.503GluLys: 1.503 ± 0.059
7.219GluLeu: 7.219 ± 0.112
1.129GluMet: 1.129 ± 0.038
1.219GluAsn: 1.219 ± 0.042
3.607GluPro: 3.607 ± 0.071
3.758GluGln: 3.758 ± 0.085
5.328GluArg: 5.328 ± 0.104
3.51GluSer: 3.51 ± 0.077
3.554GluThr: 3.554 ± 0.063
4.789GluVal: 4.789 ± 0.085
0.756GluTrp: 0.756 ± 0.03
1.02GluTyr: 1.02 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.454PheAla: 3.454 ± 0.069
0.203PheCys: 0.203 ± 0.015
2.048PheAsp: 2.048 ± 0.046
1.749PheGlu: 1.749 ± 0.045
0.929PhePhe: 0.929 ± 0.039
2.982PheGly: 2.982 ± 0.073
0.588PheHis: 0.588 ± 0.024
0.969PheIle: 0.969 ± 0.038
0.47PheLys: 0.47 ± 0.024
2.592PheLeu: 2.592 ± 0.058
0.545PheMet: 0.545 ± 0.029
0.56PheAsn: 0.56 ± 0.028
1.299PhePro: 1.299 ± 0.045
0.785PheGln: 0.785 ± 0.029
1.741PheArg: 1.741 ± 0.044
1.708PheSer: 1.708 ± 0.048
1.789PheThr: 1.789 ± 0.051
2.38PheVal: 2.38 ± 0.058
0.452PheTrp: 0.452 ± 0.024
0.626PheTyr: 0.626 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
10.631GlyAla: 10.631 ± 0.134
0.647GlyCys: 0.647 ± 0.029
4.53GlyAsp: 4.53 ± 0.087
5.221GlyGlu: 5.221 ± 0.09
2.89GlyPhe: 2.89 ± 0.058
7.704GlyGly: 7.704 ± 0.13
2.031GlyHis: 2.031 ± 0.052
4.301GlyIle: 4.301 ± 0.081
2.056GlyLys: 2.056 ± 0.06
9.296GlyLeu: 9.296 ± 0.123
2.446GlyMet: 2.446 ± 0.051
1.365GlyAsn: 1.365 ± 0.047
4.55GlyPro: 4.55 ± 0.08
3.47GlyGln: 3.47 ± 0.076
7.166GlyArg: 7.166 ± 0.096
6.177GlySer: 6.177 ± 0.089
5.442GlyThr: 5.442 ± 0.088
6.745GlyVal: 6.745 ± 0.098
1.556GlyTrp: 1.556 ± 0.044
1.906GlyTyr: 1.906 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
2.153HisAla: 2.153 ± 0.053
0.182HisCys: 0.182 ± 0.015
1.563HisAsp: 1.563 ± 0.042
1.466HisGlu: 1.466 ± 0.04
0.52HisPhe: 0.52 ± 0.03
2.235HisGly: 2.235 ± 0.064
0.612HisHis: 0.612 ± 0.028
0.611HisIle: 0.611 ± 0.027
0.338HisLys: 0.338 ± 0.022
2.279HisLeu: 2.279 ± 0.058
0.362HisMet: 0.362 ± 0.022
0.335HisAsn: 0.335 ± 0.022
1.591HisPro: 1.591 ± 0.043
0.733HisGln: 0.733 ± 0.028
2.055HisArg: 2.055 ± 0.059
1.169HisSer: 1.169 ± 0.038
1.054HisThr: 1.054 ± 0.039
1.597HisVal: 1.597 ± 0.046
0.338HisTrp: 0.338 ± 0.017
0.438HisTyr: 0.438 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.647IleAla: 5.647 ± 0.117
0.338IleCys: 0.338 ± 0.02
2.917IleAsp: 2.917 ± 0.065
2.779IleGlu: 2.779 ± 0.058
1.201IlePhe: 1.201 ± 0.045
4.172IleGly: 4.172 ± 0.074
0.753IleHis: 0.753 ± 0.032
1.576IleIle: 1.576 ± 0.055
0.702IleLys: 0.702 ± 0.033
3.575IleLeu: 3.575 ± 0.078
0.773IleMet: 0.773 ± 0.033
0.793IleAsn: 0.793 ± 0.033
2.31IlePro: 2.31 ± 0.05
1.079IleGln: 1.079 ± 0.036
2.608IleArg: 2.608 ± 0.057
2.457IleSer: 2.457 ± 0.06
2.537IleThr: 2.537 ± 0.056
3.811IleVal: 3.811 ± 0.086
0.444IleTrp: 0.444 ± 0.023
0.721IleTyr: 0.721 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
2.52LysAla: 2.52 ± 0.074
0.069LysCys: 0.069 ± 0.009
1.276LysAsp: 1.276 ± 0.043
1.037LysGlu: 1.037 ± 0.043
0.423LysPhe: 0.423 ± 0.028
1.548LysGly: 1.548 ± 0.053
0.452LysHis: 0.452 ± 0.023
0.918LysIle: 0.918 ± 0.04
0.748LysLys: 0.748 ± 0.036
1.711LysLeu: 1.711 ± 0.049
0.395LysMet: 0.395 ± 0.026
0.454LysAsn: 0.454 ± 0.023
1.119LysPro: 1.119 ± 0.046
0.658LysGln: 0.658 ± 0.028
1.469LysArg: 1.469 ± 0.053
1.068LysSer: 1.068 ± 0.039
1.114LysThr: 1.114 ± 0.043
1.563LysVal: 1.563 ± 0.058
0.201LysTrp: 0.201 ± 0.013
0.371LysTyr: 0.371 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
12.37LeuAla: 12.37 ± 0.158
0.715LeuCys: 0.715 ± 0.028
6.291LeuAsp: 6.291 ± 0.114
6.823LeuGlu: 6.823 ± 0.105
2.55LeuPhe: 2.55 ± 0.066
9.399LeuGly: 9.399 ± 0.138
2.047LeuHis: 2.047 ± 0.055
4.378LeuIle: 4.378 ± 0.086
1.829LeuLys: 1.829 ± 0.056
10.421LeuLeu: 10.421 ± 0.132
2.048LeuMet: 2.048 ± 0.063
1.862LeuAsn: 1.862 ± 0.05
5.614LeuPro: 5.614 ± 0.082
3.096LeuGln: 3.096 ± 0.065
7.957LeuArg: 7.957 ± 0.12
6.818LeuSer: 6.818 ± 0.09
5.542LeuThr: 5.542 ± 0.085
8.366LeuVal: 8.366 ± 0.127
1.362LeuTrp: 1.362 ± 0.052
1.503LeuTyr: 1.503 ± 0.042
0.0LeuXaa: 0.0 ± 0.0
Met
2.726MetAla: 2.726 ± 0.067
0.146MetCys: 0.146 ± 0.015
1.148MetAsp: 1.148 ± 0.04
0.974MetGlu: 0.974 ± 0.037
0.532MetPhe: 0.532 ± 0.027
1.799MetGly: 1.799 ± 0.05
0.433MetHis: 0.433 ± 0.023
1.025MetIle: 1.025 ± 0.036
0.431MetLys: 0.431 ± 0.022
2.267MetLeu: 2.267 ± 0.056
0.458MetMet: 0.458 ± 0.025
0.425MetAsn: 0.425 ± 0.023
1.308MetPro: 1.308 ± 0.039
0.664MetGln: 0.664 ± 0.028
1.566MetArg: 1.566 ± 0.048
1.792MetSer: 1.792 ± 0.043
1.696MetThr: 1.696 ± 0.041
1.651MetVal: 1.651 ± 0.045
0.212MetTrp: 0.212 ± 0.018
0.32MetTyr: 0.32 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
1.906AsnAla: 1.906 ± 0.05
0.111AsnCys: 0.111 ± 0.011
0.979AsnAsp: 0.979 ± 0.038
0.856AsnGlu: 0.856 ± 0.034
0.48AsnPhe: 0.48 ± 0.024
1.527AsnGly: 1.527 ± 0.051
0.375AsnHis: 0.375 ± 0.021
0.745AsnIle: 0.745 ± 0.031
0.345AsnLys: 0.345 ± 0.025
1.686AsnLeu: 1.686 ± 0.048
0.391AsnMet: 0.391 ± 0.021
0.359AsnAsn: 0.359 ± 0.023
1.313AsnPro: 1.313 ± 0.042
0.566AsnGln: 0.566 ± 0.027
1.245AsnArg: 1.245 ± 0.037
0.859AsnSer: 0.859 ± 0.035
0.913AsnThr: 0.913 ± 0.034
1.336AsnVal: 1.336 ± 0.042
0.275AsnTrp: 0.275 ± 0.019
0.404AsnTyr: 0.404 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
6.896ProAla: 6.896 ± 0.098
0.235ProCys: 0.235 ± 0.018
3.682ProAsp: 3.682 ± 0.082
5.334ProGlu: 5.334 ± 0.084
1.48ProPhe: 1.48 ± 0.048
5.405ProGly: 5.405 ± 0.081
1.207ProHis: 1.207 ± 0.044
1.747ProIle: 1.747 ± 0.046
1.017ProLys: 1.017 ± 0.046
4.924ProLeu: 4.924 ± 0.09
1.068ProMet: 1.068 ± 0.036
0.884ProAsn: 0.884 ± 0.033
2.115ProPro: 2.115 ± 0.066
2.614ProGln: 2.614 ± 0.053
3.655ProArg: 3.655 ± 0.071
3.794ProSer: 3.794 ± 0.075
2.841ProThr: 2.841 ± 0.057
4.907ProVal: 4.907 ± 0.077
0.929ProTrp: 0.929 ± 0.03
0.947ProTyr: 0.947 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
4.66GlnAla: 4.66 ± 0.092
0.167GlnCys: 0.167 ± 0.015
2.501GlnAsp: 2.501 ± 0.066
2.599GlnGlu: 2.599 ± 0.058
0.844GlnPhe: 0.844 ± 0.036
2.84GlnGly: 2.84 ± 0.063
0.868GlnHis: 0.868 ± 0.033
1.739GlnIle: 1.739 ± 0.044
0.841GlnLys: 0.841 ± 0.036
4.083GlnLeu: 4.083 ± 0.082
0.76GlnMet: 0.76 ± 0.03
0.659GlnAsn: 0.659 ± 0.028
2.068GlnPro: 2.068 ± 0.052
1.88GlnGln: 1.88 ± 0.059
3.625GlnArg: 3.625 ± 0.071
1.723GlnSer: 1.723 ± 0.045
1.835GlnThr: 1.835 ± 0.046
2.608GlnVal: 2.608 ± 0.054
0.645GlnTrp: 0.645 ± 0.029
0.574GlnTyr: 0.574 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
8.756ArgAla: 8.756 ± 0.129
0.514ArgCys: 0.514 ± 0.026
4.189ArgAsp: 4.189 ± 0.069
5.2ArgGlu: 5.2 ± 0.09
2.269ArgPhe: 2.269 ± 0.055
5.834ArgGly: 5.834 ± 0.106
1.578ArgHis: 1.578 ± 0.046
3.491ArgIle: 3.491 ± 0.071
1.446ArgLys: 1.446 ± 0.049
7.575ArgLeu: 7.575 ± 0.126
1.952ArgMet: 1.952 ± 0.05
1.079ArgAsn: 1.079 ± 0.039
4.361ArgPro: 4.361 ± 0.085
2.632ArgGln: 2.632 ± 0.058
7.735ArgArg: 7.735 ± 0.13
4.952ArgSer: 4.952 ± 0.089
4.776ArgThr: 4.776 ± 0.074
5.257ArgVal: 5.257 ± 0.078
1.284ArgTrp: 1.284 ± 0.031
1.361ArgTyr: 1.361 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
8.098SerAla: 8.098 ± 0.096
0.366SerCys: 0.366 ± 0.021
3.256SerAsp: 3.256 ± 0.071
3.828SerGlu: 3.828 ± 0.073
1.902SerPhe: 1.902 ± 0.048
6.369SerGly: 6.369 ± 0.096
1.15SerHis: 1.15 ± 0.045
2.336SerIle: 2.336 ± 0.057
1.112SerLys: 1.112 ± 0.044
5.595SerLeu: 5.595 ± 0.093
1.633SerMet: 1.633 ± 0.045
0.992SerAsn: 0.992 ± 0.037
3.45SerPro: 3.45 ± 0.058
2.151SerGln: 2.151 ± 0.055
4.351SerArg: 4.351 ± 0.073
4.603SerSer: 4.603 ± 0.092
3.806SerThr: 3.806 ± 0.06
4.673SerVal: 4.673 ± 0.069
1.031SerTrp: 1.031 ± 0.034
1.118SerTyr: 1.118 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
7.348ThrAla: 7.348 ± 0.092
0.312ThrCys: 0.312 ± 0.02
3.242ThrAsp: 3.242 ± 0.057
3.409ThrGlu: 3.409 ± 0.07
1.458ThrPhe: 1.458 ± 0.043
5.849ThrGly: 5.849 ± 0.079
1.083ThrHis: 1.083 ± 0.036
2.22ThrIle: 2.22 ± 0.059
0.904ThrLys: 0.904 ± 0.045
4.797ThrLeu: 4.797 ± 0.078
1.119ThrMet: 1.119 ± 0.035
0.873ThrAsn: 0.873 ± 0.034
3.394ThrPro: 3.394 ± 0.074
1.684ThrGln: 1.684 ± 0.046
3.56ThrArg: 3.56 ± 0.074
3.285ThrSer: 3.285 ± 0.06
3.346ThrThr: 3.346 ± 0.068
5.057ThrVal: 5.057 ± 0.063
0.907ThrTrp: 0.907 ± 0.029
1.028ThrTyr: 1.028 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
9.208ValAla: 9.208 ± 0.112
0.599ValCys: 0.599 ± 0.027
5.037ValAsp: 5.037 ± 0.082
5.4ValGlu: 5.4 ± 0.087
2.383ValPhe: 2.383 ± 0.057
6.51ValGly: 6.51 ± 0.098
1.745ValHis: 1.745 ± 0.043
3.849ValIle: 3.849 ± 0.082
1.363ValLys: 1.363 ± 0.047
9.705ValLeu: 9.705 ± 0.136
1.77ValMet: 1.77 ± 0.049
1.43ValAsn: 1.43 ± 0.045
4.318ValPro: 4.318 ± 0.065
2.514ValGln: 2.514 ± 0.056
5.886ValArg: 5.886 ± 0.092
4.774ValSer: 4.774 ± 0.081
4.478ValThr: 4.478 ± 0.078
8.23ValVal: 8.23 ± 0.129
1.006ValTrp: 1.006 ± 0.036
1.299ValTyr: 1.299 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.694TrpAla: 1.694 ± 0.049
0.119TrpCys: 0.119 ± 0.013
0.809TrpAsp: 0.809 ± 0.033
0.846TrpGlu: 0.846 ± 0.03
0.559TrpPhe: 0.559 ± 0.025
1.093TrpGly: 1.093 ± 0.033
0.306TrpHis: 0.306 ± 0.019
0.721TrpIle: 0.721 ± 0.031
0.299TrpLys: 0.299 ± 0.021
1.656TrpLeu: 1.656 ± 0.05
0.341TrpMet: 0.341 ± 0.02
0.378TrpAsn: 0.378 ± 0.021
0.695TrpPro: 0.695 ± 0.03
0.603TrpGln: 0.603 ± 0.028
1.091TrpArg: 1.091 ± 0.04
1.037TrpSer: 1.037 ± 0.041
0.831TrpThr: 0.831 ± 0.034
1.081TrpVal: 1.081 ± 0.036
0.36TrpTrp: 0.36 ± 0.023
0.276TrpTyr: 0.276 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.967TyrAla: 1.967 ± 0.053
0.139TyrCys: 0.139 ± 0.013
1.225TyrAsp: 1.225 ± 0.046
1.135TyrGlu: 1.135 ± 0.038
0.591TyrPhe: 0.591 ± 0.03
1.719TyrGly: 1.719 ± 0.045
0.325TyrHis: 0.325 ± 0.021
0.549TyrIle: 0.549 ± 0.025
0.364TyrLys: 0.364 ± 0.019
1.823TyrLeu: 1.823 ± 0.044
0.322TyrMet: 0.322 ± 0.019
0.351TyrAsn: 0.351 ± 0.024
0.918TyrPro: 0.918 ± 0.035
0.608TyrGln: 0.608 ± 0.027
1.444TyrArg: 1.444 ± 0.039
1.069TyrSer: 1.069 ± 0.037
0.955TyrThr: 0.955 ± 0.034
1.4TyrVal: 1.4 ± 0.041
0.29TyrTrp: 0.29 ± 0.018
0.4TyrTyr: 0.4 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2561 proteins (840818 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski