Amino acid dipepetide frequency for Sinimarinibacterium sp. NLF-5-8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.132AlaAla: 16.132 ± 0.204
1.319AlaCys: 1.319 ± 0.041
7.084AlaAsp: 7.084 ± 0.092
6.885AlaGlu: 6.885 ± 0.104
3.655AlaPhe: 3.655 ± 0.071
8.832AlaGly: 8.832 ± 0.099
2.864AlaHis: 2.864 ± 0.062
5.316AlaIle: 5.316 ± 0.08
3.221AlaLys: 3.221 ± 0.068
14.814AlaLeu: 14.814 ± 0.188
2.977AlaMet: 2.977 ± 0.06
2.845AlaAsn: 2.845 ± 0.068
6.043AlaPro: 6.043 ± 0.097
7.554AlaGln: 7.554 ± 0.11
10.628AlaArg: 10.628 ± 0.135
6.099AlaSer: 6.099 ± 0.092
5.524AlaThr: 5.524 ± 0.088
8.362AlaVal: 8.362 ± 0.119
1.673AlaTrp: 1.673 ± 0.043
2.625AlaTyr: 2.625 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
1.363CysAla: 1.363 ± 0.042
0.141CysCys: 0.141 ± 0.015
0.544CysAsp: 0.544 ± 0.025
0.51CysGlu: 0.51 ± 0.025
0.295CysPhe: 0.295 ± 0.017
1.04CysGly: 1.04 ± 0.04
0.318CysHis: 0.318 ± 0.025
0.462CysIle: 0.462 ± 0.021
0.243CysLys: 0.243 ± 0.017
0.808CysLeu: 0.808 ± 0.031
0.151CysMet: 0.151 ± 0.013
0.272CysAsn: 0.272 ± 0.019
0.522CysPro: 0.522 ± 0.028
0.35CysGln: 0.35 ± 0.021
0.592CysArg: 0.592 ± 0.027
0.547CysSer: 0.547 ± 0.031
0.537CysThr: 0.537 ± 0.027
0.758CysVal: 0.758 ± 0.032
0.127CysTrp: 0.127 ± 0.012
0.22CysTyr: 0.22 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
7.751AspAla: 7.751 ± 0.102
0.509AspCys: 0.509 ± 0.023
3.369AspAsp: 3.369 ± 0.063
3.266AspGlu: 3.266 ± 0.06
2.16AspPhe: 2.16 ± 0.049
4.702AspGly: 4.702 ± 0.072
1.368AspHis: 1.368 ± 0.039
2.446AspIle: 2.446 ± 0.051
1.519AspLys: 1.519 ± 0.045
5.671AspLeu: 5.671 ± 0.078
1.111AspMet: 1.111 ± 0.04
1.349AspAsn: 1.349 ± 0.04
3.282AspPro: 3.282 ± 0.057
2.349AspGln: 2.349 ± 0.045
3.363AspArg: 3.363 ± 0.058
2.524AspSer: 2.524 ± 0.061
2.485AspThr: 2.485 ± 0.051
4.192AspVal: 4.192 ± 0.06
0.996AspTrp: 0.996 ± 0.034
1.695AspTyr: 1.695 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
6.287GluAla: 6.287 ± 0.098
0.361GluCys: 0.361 ± 0.02
2.353GluAsp: 2.353 ± 0.055
1.992GluGlu: 1.992 ± 0.062
1.651GluPhe: 1.651 ± 0.051
3.461GluGly: 3.461 ± 0.061
1.394GluHis: 1.394 ± 0.039
2.732GluIle: 2.732 ± 0.064
1.902GluLys: 1.902 ± 0.056
5.716GluLeu: 5.716 ± 0.082
1.164GluMet: 1.164 ± 0.036
1.529GluAsn: 1.529 ± 0.045
2.418GluPro: 2.418 ± 0.044
3.136GluGln: 3.136 ± 0.074
4.073GluArg: 4.073 ± 0.08
2.441GluSer: 2.441 ± 0.057
2.532GluThr: 2.532 ± 0.054
3.72GluVal: 3.72 ± 0.063
0.633GluTrp: 0.633 ± 0.024
1.232GluTyr: 1.232 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
4.351PheAla: 4.351 ± 0.084
0.402PheCys: 0.402 ± 0.022
2.607PheAsp: 2.607 ± 0.053
2.166PheGlu: 2.166 ± 0.045
1.361PhePhe: 1.361 ± 0.046
3.097PheGly: 3.097 ± 0.058
0.697PheHis: 0.697 ± 0.028
1.64PheIle: 1.64 ± 0.053
1.166PheLys: 1.166 ± 0.034
2.884PheLeu: 2.884 ± 0.061
0.782PheMet: 0.782 ± 0.031
1.236PheAsn: 1.236 ± 0.04
1.254PhePro: 1.254 ± 0.038
0.917PheGln: 0.917 ± 0.033
1.657PheArg: 1.657 ± 0.039
2.069PheSer: 2.069 ± 0.045
1.896PheThr: 1.896 ± 0.045
2.527PheVal: 2.527 ± 0.055
0.547PheTrp: 0.547 ± 0.024
0.953PheTyr: 0.953 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
8.926GlyAla: 8.926 ± 0.115
0.927GlyCys: 0.927 ± 0.037
4.162GlyAsp: 4.162 ± 0.072
4.173GlyGlu: 4.173 ± 0.068
3.297GlyPhe: 3.297 ± 0.058
6.491GlyGly: 6.491 ± 0.111
1.955GlyHis: 1.955 ± 0.045
4.403GlyIle: 4.403 ± 0.072
3.009GlyLys: 3.009 ± 0.065
8.194GlyLeu: 8.194 ± 0.101
2.108GlyMet: 2.108 ± 0.048
2.271GlyAsn: 2.271 ± 0.064
2.077GlyPro: 2.077 ± 0.049
3.739GlyGln: 3.739 ± 0.064
4.8GlyArg: 4.8 ± 0.076
4.413GlySer: 4.413 ± 0.08
4.012GlyThr: 4.012 ± 0.089
6.262GlyVal: 6.262 ± 0.1
1.345GlyTrp: 1.345 ± 0.041
2.319GlyTyr: 2.319 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
2.897HisAla: 2.897 ± 0.063
0.326HisCys: 0.326 ± 0.018
1.271HisAsp: 1.271 ± 0.04
0.998HisGlu: 0.998 ± 0.032
0.894HisPhe: 0.894 ± 0.031
2.132HisGly: 2.132 ± 0.054
0.785HisHis: 0.785 ± 0.03
1.147HisIle: 1.147 ± 0.036
0.612HisLys: 0.612 ± 0.026
2.428HisLeu: 2.428 ± 0.054
0.487HisMet: 0.487 ± 0.026
0.663HisAsn: 0.663 ± 0.032
1.538HisPro: 1.538 ± 0.043
0.995HisGln: 0.995 ± 0.035
1.596HisArg: 1.596 ± 0.043
1.24HisSer: 1.24 ± 0.039
1.067HisThr: 1.067 ± 0.033
1.467HisVal: 1.467 ± 0.035
0.553HisTrp: 0.553 ± 0.022
0.749HisTyr: 0.749 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.398IleAla: 6.398 ± 0.087
0.437IleCys: 0.437 ± 0.021
3.44IleAsp: 3.44 ± 0.054
3.472IleGlu: 3.472 ± 0.078
1.414IlePhe: 1.414 ± 0.048
4.737IleGly: 4.737 ± 0.113
1.129IleHis: 1.129 ± 0.03
2.054IleIle: 2.054 ± 0.054
1.768IleLys: 1.768 ± 0.05
3.626IleLeu: 3.626 ± 0.067
0.808IleMet: 0.808 ± 0.032
1.847IleAsn: 1.847 ± 0.047
2.39IlePro: 2.39 ± 0.056
1.738IleGln: 1.738 ± 0.044
2.935IleArg: 2.935 ± 0.057
2.76IleSer: 2.76 ± 0.054
2.794IleThr: 2.794 ± 0.065
3.505IleVal: 3.505 ± 0.072
0.534IleTrp: 0.534 ± 0.025
1.127IleTyr: 1.127 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
3.648LysAla: 3.648 ± 0.082
0.189LysCys: 0.189 ± 0.016
1.543LysAsp: 1.543 ± 0.051
1.189LysGlu: 1.189 ± 0.043
0.802LysPhe: 0.802 ± 0.03
2.213LysGly: 2.213 ± 0.059
0.676LysHis: 0.676 ± 0.029
1.661LysIle: 1.661 ± 0.047
1.294LysLys: 1.294 ± 0.052
3.33LysLeu: 3.33 ± 0.06
0.758LysMet: 0.758 ± 0.029
1.077LysAsn: 1.077 ± 0.04
1.99LysPro: 1.99 ± 0.046
1.383LysGln: 1.383 ± 0.036
2.149LysArg: 2.149 ± 0.045
1.51LysSer: 1.51 ± 0.035
2.077LysThr: 2.077 ± 0.056
2.206LysVal: 2.206 ± 0.056
0.365LysTrp: 0.365 ± 0.019
0.644LysTyr: 0.644 ± 0.037
0.0LysXaa: 0.0 ± 0.0
Leu
13.049LeuAla: 13.049 ± 0.157
1.17LeuCys: 1.17 ± 0.036
6.451LeuAsp: 6.451 ± 0.099
5.108LeuGlu: 5.108 ± 0.086
3.422LeuPhe: 3.422 ± 0.063
8.324LeuGly: 8.324 ± 0.101
2.532LeuHis: 2.532 ± 0.056
5.314LeuIle: 5.314 ± 0.081
3.644LeuLys: 3.644 ± 0.064
11.554LeuLeu: 11.554 ± 0.181
2.486LeuMet: 2.486 ± 0.059
3.084LeuAsn: 3.084 ± 0.058
6.287LeuPro: 6.287 ± 0.086
4.547LeuGln: 4.547 ± 0.079
8.109LeuArg: 8.109 ± 0.112
6.323LeuSer: 6.323 ± 0.088
5.885LeuThr: 5.885 ± 0.079
6.315LeuVal: 6.315 ± 0.105
1.556LeuTrp: 1.556 ± 0.056
2.355LeuTyr: 2.355 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
2.799MetAla: 2.799 ± 0.058
0.171MetCys: 0.171 ± 0.012
1.163MetAsp: 1.163 ± 0.036
0.898MetGlu: 0.898 ± 0.032
0.662MetPhe: 0.662 ± 0.027
1.671MetGly: 1.671 ± 0.043
0.47MetHis: 0.47 ± 0.02
1.03MetIle: 1.03 ± 0.032
0.803MetLys: 0.803 ± 0.029
2.469MetLeu: 2.469 ± 0.054
0.56MetMet: 0.56 ± 0.03
0.848MetAsn: 0.848 ± 0.032
1.372MetPro: 1.372 ± 0.043
1.178MetGln: 1.178 ± 0.035
1.639MetArg: 1.639 ± 0.042
1.561MetSer: 1.561 ± 0.038
1.601MetThr: 1.601 ± 0.044
1.433MetVal: 1.433 ± 0.037
0.2MetTrp: 0.2 ± 0.015
0.369MetTyr: 0.369 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.406AsnAla: 3.406 ± 0.072
0.269AsnCys: 0.269 ± 0.017
1.488AsnAsp: 1.488 ± 0.045
1.139AsnGlu: 1.139 ± 0.035
0.975AsnPhe: 0.975 ± 0.037
2.531AsnGly: 2.531 ± 0.067
0.659AsnHis: 0.659 ± 0.029
1.526AsnIle: 1.526 ± 0.033
0.818AsnLys: 0.818 ± 0.028
2.898AsnLeu: 2.898 ± 0.048
0.559AsnMet: 0.559 ± 0.022
0.92AsnAsn: 0.92 ± 0.038
2.126AsnPro: 2.126 ± 0.047
1.19AsnGln: 1.19 ± 0.038
1.828AsnArg: 1.828 ± 0.044
1.327AsnSer: 1.327 ± 0.046
1.705AsnThr: 1.705 ± 0.05
1.947AsnVal: 1.947 ± 0.054
0.438AsnTrp: 0.438 ± 0.021
0.758AsnTyr: 0.758 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
6.222ProAla: 6.222 ± 0.1
0.368ProCys: 0.368 ± 0.021
3.208ProAsp: 3.208 ± 0.063
3.292ProGlu: 3.292 ± 0.064
1.671ProPhe: 1.671 ± 0.043
4.063ProGly: 4.063 ± 0.062
1.159ProHis: 1.159 ± 0.035
2.253ProIle: 2.253 ± 0.046
1.435ProLys: 1.435 ± 0.043
5.428ProLeu: 5.428 ± 0.087
1.392ProMet: 1.392 ± 0.039
1.355ProAsn: 1.355 ± 0.044
2.723ProPro: 2.723 ± 0.065
2.619ProGln: 2.619 ± 0.052
3.02ProArg: 3.02 ± 0.069
2.623ProSer: 2.623 ± 0.057
2.621ProThr: 2.621 ± 0.064
3.825ProVal: 3.825 ± 0.069
0.746ProTrp: 0.746 ± 0.03
1.244ProTyr: 1.244 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
6.212GlnAla: 6.212 ± 0.107
0.435GlnCys: 0.435 ± 0.024
1.942GlnAsp: 1.942 ± 0.047
1.567GlnGlu: 1.567 ± 0.041
1.518GlnPhe: 1.518 ± 0.041
3.473GlnGly: 3.473 ± 0.069
1.199GlnHis: 1.199 ± 0.036
2.701GlnIle: 2.701 ± 0.052
1.432GlnLys: 1.432 ± 0.042
4.927GlnLeu: 4.927 ± 0.096
1.203GlnMet: 1.203 ± 0.035
1.227GlnAsn: 1.227 ± 0.038
2.624GlnPro: 2.624 ± 0.051
2.879GlnGln: 2.879 ± 0.083
4.023GlnArg: 4.023 ± 0.081
2.345GlnSer: 2.345 ± 0.05
2.853GlnThr: 2.853 ± 0.056
3.461GlnVal: 3.461 ± 0.055
0.956GlnTrp: 0.956 ± 0.031
1.056GlnTyr: 1.056 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
9.693ArgAla: 9.693 ± 0.125
0.619ArgCys: 0.619 ± 0.027
3.594ArgAsp: 3.594 ± 0.058
3.671ArgGlu: 3.671 ± 0.074
2.748ArgPhe: 2.748 ± 0.056
4.375ArgGly: 4.375 ± 0.076
1.699ArgHis: 1.699 ± 0.043
3.945ArgIle: 3.945 ± 0.068
2.087ArgLys: 2.087 ± 0.054
7.827ArgLeu: 7.827 ± 0.108
1.661ArgMet: 1.661 ± 0.048
1.934ArgAsn: 1.934 ± 0.044
2.794ArgPro: 2.794 ± 0.052
3.236ArgGln: 3.236 ± 0.068
4.783ArgArg: 4.783 ± 0.097
3.539ArgSer: 3.539 ± 0.073
2.964ArgThr: 2.964 ± 0.057
5.051ArgVal: 5.051 ± 0.077
1.28ArgTrp: 1.28 ± 0.042
2.184ArgTyr: 2.184 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
6.831SerAla: 6.831 ± 0.098
0.487SerCys: 0.487 ± 0.028
2.952SerAsp: 2.952 ± 0.057
2.502SerGlu: 2.502 ± 0.055
1.956SerPhe: 1.956 ± 0.051
5.291SerGly: 5.291 ± 0.087
1.194SerHis: 1.194 ± 0.035
2.6SerIle: 2.6 ± 0.064
1.507SerLys: 1.507 ± 0.043
5.214SerLeu: 5.214 ± 0.082
1.148SerMet: 1.148 ± 0.035
1.516SerAsn: 1.516 ± 0.046
2.538SerPro: 2.538 ± 0.057
2.264SerGln: 2.264 ± 0.054
3.284SerArg: 3.284 ± 0.062
2.963SerSer: 2.963 ± 0.067
2.753SerThr: 2.753 ± 0.063
3.813SerVal: 3.813 ± 0.068
0.732SerTrp: 0.732 ± 0.026
1.181SerTyr: 1.181 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
6.285ThrAla: 6.285 ± 0.109
0.47ThrCys: 0.47 ± 0.026
2.665ThrAsp: 2.665 ± 0.055
2.337ThrGlu: 2.337 ± 0.052
1.728ThrPhe: 1.728 ± 0.048
4.127ThrGly: 4.127 ± 0.071
1.293ThrHis: 1.293 ± 0.042
2.172ThrIle: 2.172 ± 0.048
1.041ThrLys: 1.041 ± 0.038
7.178ThrLeu: 7.178 ± 0.092
0.97ThrMet: 0.97 ± 0.035
1.179ThrAsn: 1.179 ± 0.047
3.761ThrPro: 3.761 ± 0.073
2.486ThrGln: 2.486 ± 0.06
3.357ThrArg: 3.357 ± 0.06
2.352ThrSer: 2.352 ± 0.061
2.638ThrThr: 2.638 ± 0.066
4.012ThrVal: 4.012 ± 0.081
0.654ThrTrp: 0.654 ± 0.029
1.079ThrTyr: 1.079 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
8.017ValAla: 8.017 ± 0.105
0.758ValCys: 0.758 ± 0.025
4.259ValAsp: 4.259 ± 0.071
3.934ValGlu: 3.934 ± 0.07
2.507ValPhe: 2.507 ± 0.051
5.21ValGly: 5.21 ± 0.076
1.463ValHis: 1.463 ± 0.041
3.788ValIle: 3.788 ± 0.061
2.102ValLys: 2.102 ± 0.054
7.804ValLeu: 7.804 ± 0.107
1.767ValMet: 1.767 ± 0.049
2.121ValAsn: 2.121 ± 0.058
3.588ValPro: 3.588 ± 0.066
3.071ValGln: 3.071 ± 0.065
4.73ValArg: 4.73 ± 0.074
4.064ValSer: 4.064 ± 0.073
3.861ValThr: 3.861 ± 0.081
5.329ValVal: 5.329 ± 0.095
0.902ValTrp: 0.902 ± 0.037
1.533ValTyr: 1.533 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.367TrpAla: 1.367 ± 0.041
0.17TrpCys: 0.17 ± 0.015
0.676TrpAsp: 0.676 ± 0.028
0.511TrpGlu: 0.511 ± 0.023
0.565TrpPhe: 0.565 ± 0.021
0.984TrpGly: 0.984 ± 0.035
0.405TrpHis: 0.405 ± 0.021
0.778TrpIle: 0.778 ± 0.025
0.421TrpLys: 0.421 ± 0.023
2.14TrpLeu: 2.14 ± 0.054
0.398TrpMet: 0.398 ± 0.024
0.473TrpAsn: 0.473 ± 0.023
0.691TrpPro: 0.691 ± 0.03
1.089TrpGln: 1.089 ± 0.035
1.17TrpArg: 1.17 ± 0.041
0.785TrpSer: 0.785 ± 0.034
0.681TrpThr: 0.681 ± 0.03
1.029TrpVal: 1.029 ± 0.035
0.238TrpTrp: 0.238 ± 0.018
0.295TrpTyr: 0.295 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.819TyrAla: 2.819 ± 0.063
0.273TyrCys: 0.273 ± 0.017
1.337TyrAsp: 1.337 ± 0.043
1.118TyrGlu: 1.118 ± 0.038
0.927TyrPhe: 0.927 ± 0.03
2.131TyrGly: 2.131 ± 0.043
0.586TyrHis: 0.586 ± 0.027
0.945TyrIle: 0.945 ± 0.029
0.68TyrLys: 0.68 ± 0.026
2.604TyrLeu: 2.604 ± 0.056
0.405TyrMet: 0.405 ± 0.021
0.759TyrAsn: 0.759 ± 0.032
1.248TyrPro: 1.248 ± 0.041
1.314TyrGln: 1.314 ± 0.038
1.952TyrArg: 1.952 ± 0.045
1.241TyrSer: 1.241 ± 0.04
1.299TyrThr: 1.299 ± 0.039
1.593TyrVal: 1.593 ± 0.044
0.391TyrTrp: 0.391 ± 0.024
0.653TyrTyr: 0.653 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2927 proteins (959076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski