Amino acid dipepetide frequency for Pisciglobus halotolerans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.593AlaAla: 6.593 ± 0.138
0.474AlaCys: 0.474 ± 0.028
4.002AlaAsp: 4.002 ± 0.082
5.445AlaGlu: 5.445 ± 0.106
3.521AlaPhe: 3.521 ± 0.083
5.22AlaGly: 5.22 ± 0.106
1.268AlaHis: 1.268 ± 0.048
5.785AlaIle: 5.785 ± 0.107
4.856AlaLys: 4.856 ± 0.099
7.447AlaLeu: 7.447 ± 0.113
1.994AlaMet: 1.994 ± 0.056
2.928AlaAsn: 2.928 ± 0.077
1.855AlaPro: 1.855 ± 0.066
2.637AlaGln: 2.637 ± 0.066
2.381AlaArg: 2.381 ± 0.068
4.153AlaSer: 4.153 ± 0.08
3.823AlaThr: 3.823 ± 0.074
5.555AlaVal: 5.555 ± 0.099
0.566AlaTrp: 0.566 ± 0.03
2.643AlaTyr: 2.643 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.318CysAla: 0.318 ± 0.025
0.065CysCys: 0.065 ± 0.011
0.236CysAsp: 0.236 ± 0.02
0.33CysGlu: 0.33 ± 0.024
0.268CysPhe: 0.268 ± 0.02
0.493CysGly: 0.493 ± 0.029
0.147CysHis: 0.147 ± 0.015
0.374CysIle: 0.374 ± 0.024
0.239CysLys: 0.239 ± 0.021
0.573CysLeu: 0.573 ± 0.031
0.135CysMet: 0.135 ± 0.014
0.186CysAsn: 0.186 ± 0.017
0.241CysPro: 0.241 ± 0.021
0.2CysGln: 0.2 ± 0.017
0.233CysArg: 0.233 ± 0.018
0.38CysSer: 0.38 ± 0.027
0.254CysThr: 0.254 ± 0.018
0.336CysVal: 0.336 ± 0.022
0.047CysTrp: 0.047 ± 0.009
0.233CysTyr: 0.233 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.569AspAla: 3.569 ± 0.082
0.3AspCys: 0.3 ± 0.022
2.583AspAsp: 2.583 ± 0.073
4.714AspGlu: 4.714 ± 0.083
2.746AspPhe: 2.746 ± 0.055
3.345AspGly: 3.345 ± 0.083
1.245AspHis: 1.245 ± 0.052
4.239AspIle: 4.239 ± 0.086
3.51AspLys: 3.51 ± 0.077
5.335AspLeu: 5.335 ± 0.088
1.437AspMet: 1.437 ± 0.048
1.97AspAsn: 1.97 ± 0.049
2.167AspPro: 2.167 ± 0.055
2.614AspGln: 2.614 ± 0.057
2.238AspArg: 2.238 ± 0.07
2.586AspSer: 2.586 ± 0.072
3.095AspThr: 3.095 ± 0.074
3.661AspVal: 3.661 ± 0.074
0.61AspTrp: 0.61 ± 0.032
2.481AspTyr: 2.481 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
6.05GluAla: 6.05 ± 0.107
0.28GluCys: 0.28 ± 0.022
4.279GluAsp: 4.279 ± 0.083
7.816GluGlu: 7.816 ± 0.145
2.039GluPhe: 2.039 ± 0.053
4.164GluGly: 4.164 ± 0.088
1.353GluHis: 1.353 ± 0.045
5.459GluIle: 5.459 ± 0.1
7.664GluLys: 7.664 ± 0.118
6.559GluLeu: 6.559 ± 0.109
2.542GluMet: 2.542 ± 0.069
4.006GluAsn: 4.006 ± 0.09
2.103GluPro: 2.103 ± 0.056
3.893GluGln: 3.893 ± 0.084
3.409GluArg: 3.409 ± 0.077
3.636GluSer: 3.636 ± 0.078
4.837GluThr: 4.837 ± 0.086
5.064GluVal: 5.064 ± 0.104
0.784GluTrp: 0.784 ± 0.033
2.044GluTyr: 2.044 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
3.0PheAla: 3.0 ± 0.065
0.28PheCys: 0.28 ± 0.021
2.701PheAsp: 2.701 ± 0.063
2.757PheGlu: 2.757 ± 0.066
2.43PhePhe: 2.43 ± 0.073
3.18PheGly: 3.18 ± 0.078
0.812PheHis: 0.812 ± 0.037
3.492PheIle: 3.492 ± 0.084
2.584PheLys: 2.584 ± 0.06
4.332PheLeu: 4.332 ± 0.102
1.168PheMet: 1.168 ± 0.051
1.914PheAsn: 1.914 ± 0.058
1.508PhePro: 1.508 ± 0.046
1.549PheGln: 1.549 ± 0.052
1.396PheArg: 1.396 ± 0.04
3.437PheSer: 3.437 ± 0.08
2.484PheThr: 2.484 ± 0.061
3.023PheVal: 3.023 ± 0.072
0.458PheTrp: 0.458 ± 0.031
1.865PheTyr: 1.865 ± 0.061
0.0PheXaa: 0.0 ± 0.0
Gly
4.551GlyAla: 4.551 ± 0.099
0.399GlyCys: 0.399 ± 0.025
3.105GlyAsp: 3.105 ± 0.079
4.288GlyGlu: 4.288 ± 0.083
3.161GlyPhe: 3.161 ± 0.072
4.357GlyGly: 4.357 ± 0.109
1.265GlyHis: 1.265 ± 0.049
5.664GlyIle: 5.664 ± 0.112
5.121GlyLys: 5.121 ± 0.092
6.348GlyLeu: 6.348 ± 0.12
1.961GlyMet: 1.961 ± 0.06
2.682GlyAsn: 2.682 ± 0.064
1.502GlyPro: 1.502 ± 0.05
2.254GlyGln: 2.254 ± 0.052
2.38GlyArg: 2.38 ± 0.068
3.993GlySer: 3.993 ± 0.074
4.079GlyThr: 4.079 ± 0.087
4.419GlyVal: 4.419 ± 0.091
0.661GlyTrp: 0.661 ± 0.038
2.734GlyTyr: 2.734 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
1.247HisAla: 1.247 ± 0.038
0.141HisCys: 0.141 ± 0.015
0.967HisAsp: 0.967 ± 0.041
1.281HisGlu: 1.281 ± 0.054
1.107HisPhe: 1.107 ± 0.048
1.147HisGly: 1.147 ± 0.044
0.719HisHis: 0.719 ± 0.036
1.359HisIle: 1.359 ± 0.047
0.947HisLys: 0.947 ± 0.035
2.209HisLeu: 2.209 ± 0.059
0.474HisMet: 0.474 ± 0.027
0.584HisAsn: 0.584 ± 0.029
0.996HisPro: 0.996 ± 0.042
1.106HisGln: 1.106 ± 0.049
0.799HisArg: 0.799 ± 0.036
1.162HisSer: 1.162 ± 0.04
1.136HisThr: 1.136 ± 0.041
1.197HisVal: 1.197 ± 0.042
0.207HisTrp: 0.207 ± 0.017
0.979HisTyr: 0.979 ± 0.043
0.0HisXaa: 0.0 ± 0.0
Ile
5.801IleAla: 5.801 ± 0.117
0.519IleCys: 0.519 ± 0.031
4.572IleAsp: 4.572 ± 0.08
5.743IleGlu: 5.743 ± 0.105
3.283IlePhe: 3.283 ± 0.08
5.563IleGly: 5.563 ± 0.115
1.493IleHis: 1.493 ± 0.051
5.032IleIle: 5.032 ± 0.093
4.633IleLys: 4.633 ± 0.095
7.031IleLeu: 7.031 ± 0.113
1.607IleMet: 1.607 ± 0.053
3.0IleAsn: 3.0 ± 0.077
3.073IlePro: 3.073 ± 0.079
3.318IleGln: 3.318 ± 0.073
2.993IleArg: 2.993 ± 0.069
4.672IleSer: 4.672 ± 0.093
3.943IleThr: 3.943 ± 0.081
5.345IleVal: 5.345 ± 0.096
0.575IleTrp: 0.575 ± 0.036
2.436IleTyr: 2.436 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
5.192LysAla: 5.192 ± 0.104
0.227LysCys: 0.227 ± 0.021
4.332LysAsp: 4.332 ± 0.079
7.913LysGlu: 7.913 ± 0.137
1.732LysPhe: 1.732 ± 0.054
4.454LysGly: 4.454 ± 0.082
1.115LysHis: 1.115 ± 0.039
4.537LysIle: 4.537 ± 0.088
6.813LysLys: 6.813 ± 0.115
5.256LysLeu: 5.256 ± 0.097
2.459LysMet: 2.459 ± 0.063
3.658LysAsn: 3.658 ± 0.089
2.047LysPro: 2.047 ± 0.054
3.412LysGln: 3.412 ± 0.082
3.363LysArg: 3.363 ± 0.079
3.351LysSer: 3.351 ± 0.08
4.095LysThr: 4.095 ± 0.087
4.578LysVal: 4.578 ± 0.086
0.676LysTrp: 0.676 ± 0.029
1.993LysTyr: 1.993 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
7.445LeuAla: 7.445 ± 0.12
0.48LeuCys: 0.48 ± 0.032
5.263LeuAsp: 5.263 ± 0.099
6.61LeuGlu: 6.61 ± 0.112
4.758LeuPhe: 4.758 ± 0.11
5.909LeuGly: 5.909 ± 0.11
1.723LeuHis: 1.723 ± 0.051
7.058LeuIle: 7.058 ± 0.119
6.64LeuLys: 6.64 ± 0.108
10.07LeuLeu: 10.07 ± 0.163
2.702LeuMet: 2.702 ± 0.059
4.549LeuAsn: 4.549 ± 0.082
3.837LeuPro: 3.837 ± 0.075
3.602LeuGln: 3.602 ± 0.072
3.186LeuArg: 3.186 ± 0.075
7.085LeuSer: 7.085 ± 0.1
6.043LeuThr: 6.043 ± 0.114
6.068LeuVal: 6.068 ± 0.099
0.707LeuTrp: 0.707 ± 0.035
3.213LeuTyr: 3.213 ± 0.072
0.0LeuXaa: 0.0 ± 0.0
Met
2.064MetAla: 2.064 ± 0.057
0.115MetCys: 0.115 ± 0.014
1.648MetAsp: 1.648 ± 0.054
2.042MetGlu: 2.042 ± 0.06
0.832MetPhe: 0.832 ± 0.033
1.705MetGly: 1.705 ± 0.052
0.434MetHis: 0.434 ± 0.026
2.212MetIle: 2.212 ± 0.062
2.413MetLys: 2.413 ± 0.059
2.359MetLeu: 2.359 ± 0.061
0.905MetMet: 0.905 ± 0.04
1.552MetAsn: 1.552 ± 0.047
1.042MetPro: 1.042 ± 0.038
1.082MetGln: 1.082 ± 0.045
1.051MetArg: 1.051 ± 0.04
1.757MetSer: 1.757 ± 0.05
1.676MetThr: 1.676 ± 0.05
1.832MetVal: 1.832 ± 0.048
0.163MetTrp: 0.163 ± 0.015
0.69MetTyr: 0.69 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
2.829AsnAla: 2.829 ± 0.064
0.256AsnCys: 0.256 ± 0.021
2.433AsnAsp: 2.433 ± 0.071
3.716AsnGlu: 3.716 ± 0.083
1.728AsnPhe: 1.728 ± 0.048
3.407AsnGly: 3.407 ± 0.074
1.056AsnHis: 1.056 ± 0.039
3.233AsnIle: 3.233 ± 0.076
3.006AsnLys: 3.006 ± 0.081
3.784AsnLeu: 3.784 ± 0.076
1.15AsnMet: 1.15 ± 0.041
1.837AsnAsn: 1.837 ± 0.063
1.767AsnPro: 1.767 ± 0.055
2.424AsnGln: 2.424 ± 0.07
1.97AsnArg: 1.97 ± 0.063
2.05AsnSer: 2.05 ± 0.053
2.133AsnThr: 2.133 ± 0.054
2.778AsnVal: 2.778 ± 0.065
0.576AsnTrp: 0.576 ± 0.029
1.513AsnTyr: 1.513 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.409ProAla: 2.409 ± 0.068
0.162ProCys: 0.162 ± 0.017
1.97ProAsp: 1.97 ± 0.057
3.147ProGlu: 3.147 ± 0.071
1.796ProPhe: 1.796 ± 0.058
1.958ProGly: 1.958 ± 0.056
0.72ProHis: 0.72 ± 0.036
2.695ProIle: 2.695 ± 0.067
2.13ProLys: 2.13 ± 0.054
3.272ProLeu: 3.272 ± 0.087
0.847ProMet: 0.847 ± 0.031
1.54ProAsn: 1.54 ± 0.054
0.716ProPro: 0.716 ± 0.035
1.133ProGln: 1.133 ± 0.043
1.038ProArg: 1.038 ± 0.044
2.106ProSer: 2.106 ± 0.061
1.92ProThr: 1.92 ± 0.059
2.688ProVal: 2.688 ± 0.064
0.313ProTrp: 0.313 ± 0.023
1.398ProTyr: 1.398 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
3.092GlnAla: 3.092 ± 0.072
0.132GlnCys: 0.132 ± 0.015
1.979GlnAsp: 1.979 ± 0.055
3.398GlnGlu: 3.398 ± 0.076
1.698GlnPhe: 1.698 ± 0.053
1.988GlnGly: 1.988 ± 0.055
0.994GlnHis: 0.994 ± 0.04
2.855GlnIle: 2.855 ± 0.078
3.427GlnLys: 3.427 ± 0.073
4.973GlnLeu: 4.973 ± 0.1
1.288GlnMet: 1.288 ± 0.045
1.764GlnAsn: 1.764 ± 0.056
1.327GlnPro: 1.327 ± 0.05
2.61GlnGln: 2.61 ± 0.079
1.67GlnArg: 1.67 ± 0.05
2.36GlnSer: 2.36 ± 0.063
2.628GlnThr: 2.628 ± 0.07
2.67GlnVal: 2.67 ± 0.068
0.407GlnTrp: 0.407 ± 0.028
1.654GlnTyr: 1.654 ± 0.052
0.0GlnXaa: 0.0 ± 0.0
Arg
2.281ArgAla: 2.281 ± 0.063
0.169ArgCys: 0.169 ± 0.016
1.964ArgAsp: 1.964 ± 0.06
2.973ArgGlu: 2.973 ± 0.078
1.799ArgPhe: 1.799 ± 0.058
2.156ArgGly: 2.156 ± 0.056
0.746ArgHis: 0.746 ± 0.036
2.881ArgIle: 2.881 ± 0.078
3.139ArgLys: 3.139 ± 0.079
3.95ArgLeu: 3.95 ± 0.091
1.245ArgMet: 1.245 ± 0.038
1.805ArgAsn: 1.805 ± 0.054
1.321ArgPro: 1.321 ± 0.045
1.664ArgGln: 1.664 ± 0.056
1.787ArgArg: 1.787 ± 0.059
2.219ArgSer: 2.219 ± 0.055
2.118ArgThr: 2.118 ± 0.054
2.428ArgVal: 2.428 ± 0.066
0.301ArgTrp: 0.301 ± 0.022
1.743ArgTyr: 1.743 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
4.139SerAla: 4.139 ± 0.081
0.309SerCys: 0.309 ± 0.022
2.991SerAsp: 2.991 ± 0.068
4.059SerGlu: 4.059 ± 0.079
3.295SerPhe: 3.295 ± 0.074
4.344SerGly: 4.344 ± 0.095
1.156SerHis: 1.156 ± 0.039
4.614SerIle: 4.614 ± 0.089
3.861SerLys: 3.861 ± 0.085
6.099SerLeu: 6.099 ± 0.101
1.483SerMet: 1.483 ± 0.049
2.572SerAsn: 2.572 ± 0.06
1.905SerPro: 1.905 ± 0.05
2.25SerGln: 2.25 ± 0.054
2.339SerArg: 2.339 ± 0.064
4.38SerSer: 4.38 ± 0.114
3.148SerThr: 3.148 ± 0.068
4.129SerVal: 4.129 ± 0.083
0.584SerTrp: 0.584 ± 0.031
2.313SerTyr: 2.313 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
4.581ThrAla: 4.581 ± 0.084
0.257ThrCys: 0.257 ± 0.021
3.194ThrAsp: 3.194 ± 0.083
3.903ThrGlu: 3.903 ± 0.077
2.8ThrPhe: 2.8 ± 0.065
4.263ThrGly: 4.263 ± 0.087
1.18ThrHis: 1.18 ± 0.048
4.864ThrIle: 4.864 ± 0.089
3.583ThrLys: 3.583 ± 0.076
5.507ThrLeu: 5.507 ± 0.1
1.309ThrMet: 1.309 ± 0.043
2.49ThrAsn: 2.49 ± 0.069
2.16ThrPro: 2.16 ± 0.055
1.859ThrGln: 1.859 ± 0.058
1.871ThrArg: 1.871 ± 0.054
3.527ThrSer: 3.527 ± 0.073
3.31ThrThr: 3.31 ± 0.075
4.439ThrVal: 4.439 ± 0.094
0.434ThrTrp: 0.434 ± 0.026
1.973ThrTyr: 1.973 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
4.965ValAla: 4.965 ± 0.091
0.446ValCys: 0.446 ± 0.029
3.689ValAsp: 3.689 ± 0.097
4.822ValGlu: 4.822 ± 0.102
3.03ValPhe: 3.03 ± 0.084
4.36ValGly: 4.36 ± 0.083
1.239ValHis: 1.239 ± 0.048
5.22ValIle: 5.22 ± 0.101
4.31ValLys: 4.31 ± 0.078
6.851ValLeu: 6.851 ± 0.114
1.831ValMet: 1.831 ± 0.057
2.711ValAsn: 2.711 ± 0.068
2.679ValPro: 2.679 ± 0.066
2.636ValGln: 2.636 ± 0.064
2.484ValArg: 2.484 ± 0.058
4.642ValSer: 4.642 ± 0.088
4.118ValThr: 4.118 ± 0.079
4.866ValVal: 4.866 ± 0.096
0.558ValTrp: 0.558 ± 0.029
2.4ValTyr: 2.4 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.504TrpAla: 0.504 ± 0.027
0.038TrpCys: 0.038 ± 0.006
0.471TrpAsp: 0.471 ± 0.029
0.607TrpGlu: 0.607 ± 0.031
0.436TrpPhe: 0.436 ± 0.028
0.575TrpGly: 0.575 ± 0.036
0.186TrpHis: 0.186 ± 0.019
0.749TrpIle: 0.749 ± 0.033
0.629TrpLys: 0.629 ± 0.03
1.041TrpLeu: 1.041 ± 0.048
0.265TrpMet: 0.265 ± 0.025
0.516TrpAsn: 0.516 ± 0.031
0.236TrpPro: 0.236 ± 0.022
0.49TrpGln: 0.49 ± 0.024
0.348TrpArg: 0.348 ± 0.024
0.525TrpSer: 0.525 ± 0.031
0.498TrpThr: 0.498 ± 0.032
0.542TrpVal: 0.542 ± 0.029
0.113TrpTrp: 0.113 ± 0.013
0.348TrpTyr: 0.348 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.514TyrAla: 2.514 ± 0.062
0.239TyrCys: 0.239 ± 0.019
2.07TyrAsp: 2.07 ± 0.062
2.415TyrGlu: 2.415 ± 0.058
1.912TyrPhe: 1.912 ± 0.055
2.336TyrGly: 2.336 ± 0.058
0.911TyrHis: 0.911 ± 0.038
2.309TyrIle: 2.309 ± 0.062
1.696TyrLys: 1.696 ± 0.056
3.956TyrLeu: 3.956 ± 0.081
0.775TyrMet: 0.775 ± 0.029
1.38TyrAsn: 1.38 ± 0.048
1.477TyrPro: 1.477 ± 0.048
2.182TyrGln: 2.182 ± 0.063
1.753TyrArg: 1.753 ± 0.052
1.95TyrSer: 1.95 ± 0.057
2.167TyrThr: 2.167 ± 0.059
2.238TyrVal: 2.238 ± 0.058
0.374TyrTrp: 0.374 ± 0.027
1.427TyrTyr: 1.427 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2245 proteins (660971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski