Amino acid dipepetide frequency for Aliidiomarina haloalkalitolerans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.28AlaAla: 11.28 ± 0.159
0.942AlaCys: 0.942 ± 0.041
5.701AlaAsp: 5.701 ± 0.094
7.671AlaGlu: 7.671 ± 0.122
3.583AlaPhe: 3.583 ± 0.063
7.389AlaGly: 7.389 ± 0.117
2.091AlaHis: 2.091 ± 0.05
5.931AlaIle: 5.931 ± 0.096
3.793AlaLys: 3.793 ± 0.085
11.119AlaLeu: 11.119 ± 0.136
2.695AlaMet: 2.695 ± 0.052
3.682AlaAsn: 3.682 ± 0.073
3.553AlaPro: 3.553 ± 0.067
4.863AlaGln: 4.863 ± 0.095
5.496AlaArg: 5.496 ± 0.099
5.301AlaSer: 5.301 ± 0.09
5.107AlaThr: 5.107 ± 0.092
7.096AlaVal: 7.096 ± 0.107
1.31AlaTrp: 1.31 ± 0.042
2.441AlaTyr: 2.441 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.768CysAla: 0.768 ± 0.03
0.105CysCys: 0.105 ± 0.011
0.495CysAsp: 0.495 ± 0.023
0.508CysGlu: 0.508 ± 0.026
0.323CysPhe: 0.323 ± 0.019
0.753CysGly: 0.753 ± 0.037
0.27CysHis: 0.27 ± 0.02
0.437CysIle: 0.437 ± 0.023
0.3CysLys: 0.3 ± 0.019
0.763CysLeu: 0.763 ± 0.029
0.18CysMet: 0.18 ± 0.014
0.271CysAsn: 0.271 ± 0.02
0.412CysPro: 0.412 ± 0.025
0.404CysGln: 0.404 ± 0.023
0.477CysArg: 0.477 ± 0.026
0.555CysSer: 0.555 ± 0.028
0.441CysThr: 0.441 ± 0.024
0.544CysVal: 0.544 ± 0.032
0.115CysTrp: 0.115 ± 0.013
0.253CysTyr: 0.253 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.982AspAla: 4.982 ± 0.074
0.466AspCys: 0.466 ± 0.024
3.215AspAsp: 3.215 ± 0.069
3.775AspGlu: 3.775 ± 0.075
2.492AspPhe: 2.492 ± 0.061
3.925AspGly: 3.925 ± 0.092
1.219AspHis: 1.219 ± 0.046
3.296AspIle: 3.296 ± 0.071
2.0AspLys: 2.0 ± 0.053
5.384AspLeu: 5.384 ± 0.087
1.419AspMet: 1.419 ± 0.046
1.853AspAsn: 1.853 ± 0.053
2.259AspPro: 2.259 ± 0.06
2.201AspGln: 2.201 ± 0.053
2.871AspArg: 2.871 ± 0.069
2.893AspSer: 2.893 ± 0.059
2.404AspThr: 2.404 ± 0.054
4.019AspVal: 4.019 ± 0.065
0.934AspTrp: 0.934 ± 0.037
2.103AspTyr: 2.103 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
5.723GluAla: 5.723 ± 0.104
0.409GluCys: 0.409 ± 0.023
2.587GluAsp: 2.587 ± 0.07
3.652GluGlu: 3.652 ± 0.093
2.687GluPhe: 2.687 ± 0.066
3.578GluGly: 3.578 ± 0.074
1.845GluHis: 1.845 ± 0.053
3.665GluIle: 3.665 ± 0.078
2.69GluLys: 2.69 ± 0.068
7.826GluLeu: 7.826 ± 0.107
1.476GluMet: 1.476 ± 0.044
2.31GluAsn: 2.31 ± 0.056
2.478GluPro: 2.478 ± 0.062
4.747GluGln: 4.747 ± 0.105
4.909GluArg: 4.909 ± 0.089
3.246GluSer: 3.246 ± 0.062
3.033GluThr: 3.033 ± 0.061
4.455GluVal: 4.455 ± 0.083
0.795GluTrp: 0.795 ± 0.028
1.689GluTyr: 1.689 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
4.555PheAla: 4.555 ± 0.084
0.395PheCys: 0.395 ± 0.022
2.452PheAsp: 2.452 ± 0.055
2.214PheGlu: 2.214 ± 0.059
1.681PhePhe: 1.681 ± 0.052
3.315PheGly: 3.315 ± 0.075
0.893PheHis: 0.893 ± 0.031
2.374PheIle: 2.374 ± 0.062
1.352PheLys: 1.352 ± 0.048
3.507PheLeu: 3.507 ± 0.077
0.991PheMet: 0.991 ± 0.037
1.756PheAsn: 1.756 ± 0.052
1.547PhePro: 1.547 ± 0.052
1.476PheGln: 1.476 ± 0.042
2.131PheArg: 2.131 ± 0.055
2.778PheSer: 2.778 ± 0.066
2.315PheThr: 2.315 ± 0.056
2.962PheVal: 2.962 ± 0.071
0.547PheTrp: 0.547 ± 0.028
1.29PheTyr: 1.29 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
6.054GlyAla: 6.054 ± 0.106
0.767GlyCys: 0.767 ± 0.033
3.777GlyAsp: 3.777 ± 0.071
4.471GlyGlu: 4.471 ± 0.09
3.375GlyPhe: 3.375 ± 0.067
5.2GlyGly: 5.2 ± 0.103
1.764GlyHis: 1.764 ± 0.05
4.285GlyIle: 4.285 ± 0.077
3.069GlyLys: 3.069 ± 0.075
7.219GlyLeu: 7.219 ± 0.11
2.058GlyMet: 2.058 ± 0.057
2.455GlyAsn: 2.455 ± 0.062
2.05GlyPro: 2.05 ± 0.06
3.325GlyGln: 3.325 ± 0.069
4.066GlyArg: 4.066 ± 0.08
4.093GlySer: 4.093 ± 0.086
3.536GlyThr: 3.536 ± 0.076
5.595GlyVal: 5.595 ± 0.089
0.992GlyTrp: 0.992 ± 0.037
2.51GlyTyr: 2.51 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
2.24HisAla: 2.24 ± 0.06
0.296HisCys: 0.296 ± 0.021
1.346HisAsp: 1.346 ± 0.041
1.499HisGlu: 1.499 ± 0.047
1.084HisPhe: 1.084 ± 0.038
1.937HisGly: 1.937 ± 0.058
0.658HisHis: 0.658 ± 0.028
1.281HisIle: 1.281 ± 0.045
0.743HisLys: 0.743 ± 0.032
2.273HisLeu: 2.273 ± 0.048
0.482HisMet: 0.482 ± 0.026
0.767HisAsn: 0.767 ± 0.033
1.303HisPro: 1.303 ± 0.038
1.149HisGln: 1.149 ± 0.037
1.397HisArg: 1.397 ± 0.047
1.39HisSer: 1.39 ± 0.04
1.064HisThr: 1.064 ± 0.04
1.57HisVal: 1.57 ± 0.042
0.411HisTrp: 0.411 ± 0.019
0.873HisTyr: 0.873 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.402IleAla: 6.402 ± 0.102
0.498IleCys: 0.498 ± 0.024
3.625IleAsp: 3.625 ± 0.059
4.023IleGlu: 4.023 ± 0.086
1.963IlePhe: 1.963 ± 0.058
4.452IleGly: 4.452 ± 0.097
1.246IleHis: 1.246 ± 0.044
2.945IleIle: 2.945 ± 0.064
1.911IleLys: 1.911 ± 0.061
5.006IleLeu: 5.006 ± 0.091
1.184IleMet: 1.184 ± 0.042
2.107IleAsn: 2.107 ± 0.061
2.565IlePro: 2.565 ± 0.069
2.394IleGln: 2.394 ± 0.049
3.296IleArg: 3.296 ± 0.063
3.278IleSer: 3.278 ± 0.065
3.045IleThr: 3.045 ± 0.062
3.842IleVal: 3.842 ± 0.082
0.642IleTrp: 0.642 ± 0.03
1.487IleTyr: 1.487 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
3.822LysAla: 3.822 ± 0.085
0.204LysCys: 0.204 ± 0.015
1.829LysAsp: 1.829 ± 0.063
2.273LysGlu: 2.273 ± 0.068
1.229LysPhe: 1.229 ± 0.041
2.227LysGly: 2.227 ± 0.064
0.884LysHis: 0.884 ± 0.032
1.933LysIle: 1.933 ± 0.053
1.942LysLys: 1.942 ± 0.064
3.98LysLeu: 3.98 ± 0.078
0.884LysMet: 0.884 ± 0.035
1.246LysAsn: 1.246 ± 0.047
1.845LysPro: 1.845 ± 0.055
1.995LysGln: 1.995 ± 0.053
2.39LysArg: 2.39 ± 0.059
2.077LysSer: 2.077 ± 0.06
2.041LysThr: 2.041 ± 0.058
2.803LysVal: 2.803 ± 0.068
0.396LysTrp: 0.396 ± 0.023
0.874LysTyr: 0.874 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
11.949LeuAla: 11.949 ± 0.15
0.848LeuCys: 0.848 ± 0.032
5.428LeuAsp: 5.428 ± 0.103
6.306LeuGlu: 6.306 ± 0.097
4.019LeuPhe: 4.019 ± 0.084
7.044LeuGly: 7.044 ± 0.102
2.443LeuHis: 2.443 ± 0.065
5.584LeuIle: 5.584 ± 0.104
3.828LeuLys: 3.828 ± 0.077
11.099LeuLeu: 11.099 ± 0.174
2.569LeuMet: 2.569 ± 0.063
4.027LeuAsn: 4.027 ± 0.076
5.323LeuPro: 5.323 ± 0.09
5.367LeuGln: 5.367 ± 0.111
6.387LeuArg: 6.387 ± 0.093
6.548LeuSer: 6.548 ± 0.11
6.17LeuThr: 6.17 ± 0.094
7.508LeuVal: 7.508 ± 0.099
1.232LeuTrp: 1.232 ± 0.045
2.468LeuTyr: 2.468 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.711MetAla: 2.711 ± 0.057
0.15MetCys: 0.15 ± 0.014
1.049MetAsp: 1.049 ± 0.037
1.12MetGlu: 1.12 ± 0.039
0.814MetPhe: 0.814 ± 0.03
1.502MetGly: 1.502 ± 0.047
0.552MetHis: 0.552 ± 0.027
1.237MetIle: 1.237 ± 0.04
1.133MetLys: 1.133 ± 0.035
2.735MetLeu: 2.735 ± 0.068
0.601MetMet: 0.601 ± 0.027
0.981MetAsn: 0.981 ± 0.035
1.216MetPro: 1.216 ± 0.041
1.36MetGln: 1.36 ± 0.044
1.523MetArg: 1.523 ± 0.045
1.751MetSer: 1.751 ± 0.049
1.605MetThr: 1.605 ± 0.041
1.708MetVal: 1.708 ± 0.048
0.222MetTrp: 0.222 ± 0.017
0.509MetTyr: 0.509 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.314AsnAla: 3.314 ± 0.065
0.315AsnCys: 0.315 ± 0.022
2.076AsnAsp: 2.076 ± 0.06
2.286AsnGlu: 2.286 ± 0.058
1.419AsnPhe: 1.419 ± 0.047
2.716AsnGly: 2.716 ± 0.063
0.871AsnHis: 0.871 ± 0.034
1.915AsnIle: 1.915 ± 0.055
1.273AsnLys: 1.273 ± 0.039
3.641AsnLeu: 3.641 ± 0.077
0.869AsnMet: 0.869 ± 0.034
1.362AsnAsn: 1.362 ± 0.051
2.133AsnPro: 2.133 ± 0.052
1.799AsnGln: 1.799 ± 0.052
2.157AsnArg: 2.157 ± 0.052
1.937AsnSer: 1.937 ± 0.049
1.916AsnThr: 1.916 ± 0.054
2.334AsnVal: 2.334 ± 0.055
0.582AsnTrp: 0.582 ± 0.032
1.132AsnTyr: 1.132 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
4.291ProAla: 4.291 ± 0.086
0.298ProCys: 0.298 ± 0.021
2.652ProAsp: 2.652 ± 0.059
3.596ProGlu: 3.596 ± 0.071
1.698ProPhe: 1.698 ± 0.044
3.017ProGly: 3.017 ± 0.066
0.923ProHis: 0.923 ± 0.034
2.254ProIle: 2.254 ± 0.053
1.562ProLys: 1.562 ± 0.048
4.473ProLeu: 4.473 ± 0.084
1.032ProMet: 1.032 ± 0.032
1.656ProAsn: 1.656 ± 0.049
1.44ProPro: 1.44 ± 0.053
2.04ProGln: 2.04 ± 0.054
1.941ProArg: 1.941 ± 0.054
2.29ProSer: 2.29 ± 0.051
2.222ProThr: 2.222 ± 0.048
3.506ProVal: 3.506 ± 0.067
0.645ProTrp: 0.645 ± 0.033
1.231ProTyr: 1.231 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
5.571GlnAla: 5.571 ± 0.124
0.317GlnCys: 0.317 ± 0.019
2.061GlnAsp: 2.061 ± 0.054
2.907GlnGlu: 2.907 ± 0.071
1.884GlnPhe: 1.884 ± 0.051
3.267GlnGly: 3.267 ± 0.067
1.456GlnHis: 1.456 ± 0.048
2.543GlnIle: 2.543 ± 0.057
1.608GlnLys: 1.608 ± 0.047
5.948GlnLeu: 5.948 ± 0.111
1.056GlnMet: 1.056 ± 0.042
1.522GlnAsn: 1.522 ± 0.049
2.406GlnPro: 2.406 ± 0.051
4.098GlnGln: 4.098 ± 0.123
3.931GlnArg: 3.931 ± 0.084
2.606GlnSer: 2.606 ± 0.052
2.368GlnThr: 2.368 ± 0.057
3.885GlnVal: 3.885 ± 0.069
0.764GlnTrp: 0.764 ± 0.033
1.221GlnTyr: 1.221 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
5.174ArgAla: 5.174 ± 0.077
0.448ArgCys: 0.448 ± 0.026
3.44ArgAsp: 3.44 ± 0.072
4.212ArgGlu: 4.212 ± 0.079
2.751ArgPhe: 2.751 ± 0.062
3.817ArgGly: 3.817 ± 0.075
1.485ArgHis: 1.485 ± 0.048
3.573ArgIle: 3.573 ± 0.062
2.26ArgLys: 2.26 ± 0.058
6.644ArgLeu: 6.644 ± 0.118
1.575ArgMet: 1.575 ± 0.047
2.143ArgAsn: 2.143 ± 0.051
2.201ArgPro: 2.201 ± 0.052
3.299ArgGln: 3.299 ± 0.065
3.642ArgArg: 3.642 ± 0.083
3.253ArgSer: 3.253 ± 0.065
2.68ArgThr: 2.68 ± 0.055
4.442ArgVal: 4.442 ± 0.081
1.027ArgTrp: 1.027 ± 0.036
2.107ArgTyr: 2.107 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
5.864SerAla: 5.864 ± 0.095
0.456SerCys: 0.456 ± 0.023
3.167SerAsp: 3.167 ± 0.071
3.644SerGlu: 3.644 ± 0.071
2.474SerPhe: 2.474 ± 0.058
4.746SerGly: 4.746 ± 0.081
1.31SerHis: 1.31 ± 0.042
3.084SerIle: 3.084 ± 0.08
2.081SerLys: 2.081 ± 0.054
5.832SerLeu: 5.832 ± 0.096
1.534SerMet: 1.534 ± 0.044
1.965SerAsn: 1.965 ± 0.052
2.446SerPro: 2.446 ± 0.056
2.68SerGln: 2.68 ± 0.062
3.36SerArg: 3.36 ± 0.068
3.548SerSer: 3.548 ± 0.067
2.728SerThr: 2.728 ± 0.057
4.067SerVal: 4.067 ± 0.08
0.793SerTrp: 0.793 ± 0.034
1.683SerTyr: 1.683 ± 0.055
0.0SerXaa: 0.0 ± 0.0
Thr
5.257ThrAla: 5.257 ± 0.089
0.415ThrCys: 0.415 ± 0.02
2.787ThrAsp: 2.787 ± 0.062
3.132ThrGlu: 3.132 ± 0.065
2.025ThrPhe: 2.025 ± 0.056
4.185ThrGly: 4.185 ± 0.085
1.175ThrHis: 1.175 ± 0.037
2.988ThrIle: 2.988 ± 0.064
1.625ThrLys: 1.625 ± 0.05
5.936ThrLeu: 5.936 ± 0.095
1.1ThrMet: 1.1 ± 0.039
1.751ThrAsn: 1.751 ± 0.054
2.686ThrPro: 2.686 ± 0.052
2.226ThrGln: 2.226 ± 0.055
2.798ThrArg: 2.798 ± 0.062
2.925ThrSer: 2.925 ± 0.071
2.919ThrThr: 2.919 ± 0.056
3.92ThrVal: 3.92 ± 0.074
0.675ThrTrp: 0.675 ± 0.031
1.365ThrTyr: 1.365 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
7.682ValAla: 7.682 ± 0.103
0.617ValCys: 0.617 ± 0.027
3.894ValAsp: 3.894 ± 0.077
4.419ValGlu: 4.419 ± 0.077
2.977ValPhe: 2.977 ± 0.067
4.768ValGly: 4.768 ± 0.083
1.528ValHis: 1.528 ± 0.045
4.585ValIle: 4.585 ± 0.091
2.571ValLys: 2.571 ± 0.074
7.438ValLeu: 7.438 ± 0.115
1.879ValMet: 1.879 ± 0.051
2.841ValAsn: 2.841 ± 0.066
3.035ValPro: 3.035 ± 0.073
2.966ValGln: 2.966 ± 0.059
4.323ValArg: 4.323 ± 0.073
4.496ValSer: 4.496 ± 0.083
4.399ValThr: 4.399 ± 0.082
5.76ValVal: 5.76 ± 0.102
0.846ValTrp: 0.846 ± 0.032
1.873ValTyr: 1.873 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.866TrpAla: 0.866 ± 0.038
0.125TrpCys: 0.125 ± 0.012
0.592TrpAsp: 0.592 ± 0.029
0.542TrpGlu: 0.542 ± 0.028
0.695TrpPhe: 0.695 ± 0.033
0.768TrpGly: 0.768 ± 0.029
0.394TrpHis: 0.394 ± 0.02
0.655TrpIle: 0.655 ± 0.026
0.307TrpLys: 0.307 ± 0.022
2.146TrpLeu: 2.146 ± 0.065
0.348TrpMet: 0.348 ± 0.019
0.41TrpAsn: 0.41 ± 0.025
0.557TrpPro: 0.557 ± 0.027
1.279TrpGln: 1.279 ± 0.043
1.011TrpArg: 1.011 ± 0.036
0.754TrpSer: 0.754 ± 0.035
0.503TrpThr: 0.503 ± 0.028
0.964TrpVal: 0.964 ± 0.037
0.246TrpTrp: 0.246 ± 0.02
0.441TrpTyr: 0.441 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.54TyrAla: 2.54 ± 0.062
0.302TyrCys: 0.302 ± 0.021
1.534TyrAsp: 1.534 ± 0.044
1.552TyrGlu: 1.552 ± 0.045
1.362TyrPhe: 1.362 ± 0.046
2.055TyrGly: 2.055 ± 0.056
0.754TyrHis: 0.754 ± 0.03
1.345TyrIle: 1.345 ± 0.045
0.829TyrLys: 0.829 ± 0.035
3.179TyrLeu: 3.179 ± 0.077
0.545TyrMet: 0.545 ± 0.03
0.944TyrAsn: 0.944 ± 0.036
1.31TyrPro: 1.31 ± 0.039
1.749TyrGln: 1.749 ± 0.046
2.068TyrArg: 2.068 ± 0.057
1.744TyrSer: 1.744 ± 0.051
1.393TyrThr: 1.393 ± 0.04
1.917TyrVal: 1.917 ± 0.046
0.429TyrTrp: 0.429 ± 0.023
0.9TyrTyr: 0.9 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2391 proteins (807456 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski