Amino acid dipepetide frequency for Balneola sp. EhC07

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.222AlaAla: 4.222 ± 0.08
0.446AlaCys: 0.446 ± 0.023
3.689AlaAsp: 3.689 ± 0.065
4.494AlaGlu: 4.494 ± 0.077
3.293AlaPhe: 3.293 ± 0.063
4.654AlaGly: 4.654 ± 0.09
1.043AlaHis: 1.043 ± 0.035
4.615AlaIle: 4.615 ± 0.086
3.939AlaLys: 3.939 ± 0.078
6.203AlaLeu: 6.203 ± 0.105
1.442AlaMet: 1.442 ± 0.043
2.969AlaAsn: 2.969 ± 0.057
1.966AlaPro: 1.966 ± 0.047
2.422AlaGln: 2.422 ± 0.049
2.343AlaArg: 2.343 ± 0.054
4.518AlaSer: 4.518 ± 0.087
3.512AlaThr: 3.512 ± 0.077
4.308AlaVal: 4.308 ± 0.077
0.689AlaTrp: 0.689 ± 0.025
2.289AlaTyr: 2.289 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.384CysAla: 0.384 ± 0.022
0.082CysCys: 0.082 ± 0.009
0.363CysAsp: 0.363 ± 0.018
0.357CysGlu: 0.357 ± 0.019
0.336CysPhe: 0.336 ± 0.018
0.497CysGly: 0.497 ± 0.026
0.143CysHis: 0.143 ± 0.013
0.441CysIle: 0.441 ± 0.02
0.303CysLys: 0.303 ± 0.018
0.509CysLeu: 0.509 ± 0.023
0.102CysMet: 0.102 ± 0.01
0.239CysAsn: 0.239 ± 0.016
0.237CysPro: 0.237 ± 0.018
0.134CysGln: 0.134 ± 0.013
0.197CysArg: 0.197 ± 0.013
0.529CysSer: 0.529 ± 0.023
0.326CysThr: 0.326 ± 0.017
0.36CysVal: 0.36 ± 0.02
0.054CysTrp: 0.054 ± 0.007
0.199CysTyr: 0.199 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.62AspAla: 3.62 ± 0.068
0.3AspCys: 0.3 ± 0.019
3.26AspAsp: 3.26 ± 0.073
4.759AspGlu: 4.759 ± 0.07
3.497AspPhe: 3.497 ± 0.059
4.415AspGly: 4.415 ± 0.098
1.138AspHis: 1.138 ± 0.033
4.479AspIle: 4.479 ± 0.075
3.52AspLys: 3.52 ± 0.073
5.908AspLeu: 5.908 ± 0.076
1.221AspMet: 1.221 ± 0.038
2.761AspAsn: 2.761 ± 0.066
2.536AspPro: 2.536 ± 0.053
2.16AspGln: 2.16 ± 0.048
2.441AspArg: 2.441 ± 0.053
4.444AspSer: 4.444 ± 0.073
2.823AspThr: 2.823 ± 0.067
3.808AspVal: 3.808 ± 0.073
0.879AspTrp: 0.879 ± 0.036
2.369AspTyr: 2.369 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
4.792GluAla: 4.792 ± 0.082
0.328GluCys: 0.328 ± 0.019
4.165GluAsp: 4.165 ± 0.069
6.229GluGlu: 6.229 ± 0.12
3.593GluPhe: 3.593 ± 0.056
4.348GluGly: 4.348 ± 0.065
1.265GluHis: 1.265 ± 0.04
5.916GluIle: 5.916 ± 0.078
5.305GluLys: 5.305 ± 0.103
7.522GluLeu: 7.522 ± 0.106
1.812GluMet: 1.812 ± 0.049
4.495GluAsn: 4.495 ± 0.066
1.984GluPro: 1.984 ± 0.043
2.639GluGln: 2.639 ± 0.058
3.042GluArg: 3.042 ± 0.058
4.476GluSer: 4.476 ± 0.066
3.761GluThr: 3.761 ± 0.062
4.933GluVal: 4.933 ± 0.066
0.846GluTrp: 0.846 ± 0.029
2.74GluTyr: 2.74 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
2.991PheAla: 2.991 ± 0.055
0.353PheCys: 0.353 ± 0.02
3.583PheAsp: 3.583 ± 0.06
3.868PheGlu: 3.868 ± 0.069
2.606PhePhe: 2.606 ± 0.061
3.98PheGly: 3.98 ± 0.064
0.774PheHis: 0.774 ± 0.027
3.661PheIle: 3.661 ± 0.063
3.092PheLys: 3.092 ± 0.063
4.654PheLeu: 4.654 ± 0.088
1.076PheMet: 1.076 ± 0.036
2.836PheAsn: 2.836 ± 0.074
1.632PhePro: 1.632 ± 0.036
1.513PheGln: 1.513 ± 0.042
1.966PheArg: 1.966 ± 0.041
4.43PheSer: 4.43 ± 0.079
3.022PheThr: 3.022 ± 0.053
3.097PheVal: 3.097 ± 0.055
0.667PheTrp: 0.667 ± 0.026
1.922PheTyr: 1.922 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.434GlyAla: 4.434 ± 0.075
0.486GlyCys: 0.486 ± 0.022
3.852GlyAsp: 3.852 ± 0.081
4.275GlyGlu: 4.275 ± 0.065
4.013GlyPhe: 4.013 ± 0.068
4.98GlyGly: 4.98 ± 0.114
1.078GlyHis: 1.078 ± 0.037
5.739GlyIle: 5.739 ± 0.08
4.331GlyLys: 4.331 ± 0.075
6.46GlyLeu: 6.46 ± 0.098
1.739GlyMet: 1.739 ± 0.048
3.691GlyAsn: 3.691 ± 0.084
1.492GlyPro: 1.492 ± 0.04
1.848GlyGln: 1.848 ± 0.049
2.496GlyArg: 2.496 ± 0.051
5.07GlySer: 5.07 ± 0.088
4.335GlyThr: 4.335 ± 0.116
4.67GlyVal: 4.67 ± 0.063
0.903GlyTrp: 0.903 ± 0.037
2.768GlyTyr: 2.768 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
0.966HisAla: 0.966 ± 0.03
0.136HisCys: 0.136 ± 0.013
0.9HisAsp: 0.9 ± 0.03
1.061HisGlu: 1.061 ± 0.034
1.008HisPhe: 1.008 ± 0.038
1.076HisGly: 1.076 ± 0.035
0.44HisHis: 0.44 ± 0.022
1.234HisIle: 1.234 ± 0.04
0.98HisLys: 0.98 ± 0.03
1.659HisLeu: 1.659 ± 0.047
0.311HisMet: 0.311 ± 0.017
0.749HisAsn: 0.749 ± 0.028
0.941HisPro: 0.941 ± 0.032
0.613HisGln: 0.613 ± 0.026
0.744HisArg: 0.744 ± 0.029
1.209HisSer: 1.209 ± 0.042
0.892HisThr: 0.892 ± 0.024
0.948HisVal: 0.948 ± 0.03
0.225HisTrp: 0.225 ± 0.013
0.679HisTyr: 0.679 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.127IleAla: 5.127 ± 0.083
0.521IleCys: 0.521 ± 0.023
4.911IleAsp: 4.911 ± 0.072
5.857IleGlu: 5.857 ± 0.085
3.388IlePhe: 3.388 ± 0.067
5.187IleGly: 5.187 ± 0.088
1.327IleHis: 1.327 ± 0.044
5.205IleIle: 5.205 ± 0.092
4.637IleLys: 4.637 ± 0.076
6.493IleLeu: 6.493 ± 0.111
1.319IleMet: 1.319 ± 0.035
4.032IleAsn: 4.032 ± 0.062
3.308IlePro: 3.308 ± 0.065
2.607IleGln: 2.607 ± 0.053
3.05IleArg: 3.05 ± 0.057
6.277IleSer: 6.277 ± 0.088
4.61IleThr: 4.61 ± 0.081
4.32IleVal: 4.32 ± 0.072
0.779IleTrp: 0.779 ± 0.035
2.485IleTyr: 2.485 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.445LysAla: 4.445 ± 0.08
0.217LysCys: 0.217 ± 0.014
3.823LysAsp: 3.823 ± 0.062
5.947LysGlu: 5.947 ± 0.103
2.491LysPhe: 2.491 ± 0.052
3.757LysGly: 3.757 ± 0.07
1.071LysHis: 1.071 ± 0.032
4.708LysIle: 4.708 ± 0.078
5.487LysLys: 5.487 ± 0.11
5.684LysLeu: 5.684 ± 0.084
1.654LysMet: 1.654 ± 0.04
3.845LysAsn: 3.845 ± 0.073
2.203LysPro: 2.203 ± 0.053
2.107LysGln: 2.107 ± 0.053
2.68LysArg: 2.68 ± 0.056
4.135LysSer: 4.135 ± 0.07
3.631LysThr: 3.631 ± 0.065
4.2LysVal: 4.2 ± 0.06
0.697LysTrp: 0.697 ± 0.027
2.329LysTyr: 2.329 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
5.829LeuAla: 5.829 ± 0.099
0.584LeuCys: 0.584 ± 0.021
5.462LeuAsp: 5.462 ± 0.07
6.636LeuGlu: 6.636 ± 0.094
4.801LeuPhe: 4.801 ± 0.087
5.92LeuGly: 5.92 ± 0.075
1.46LeuHis: 1.46 ± 0.041
7.37LeuIle: 7.37 ± 0.119
6.549LeuLys: 6.549 ± 0.09
8.298LeuLeu: 8.298 ± 0.137
2.037LeuMet: 2.037 ± 0.046
5.373LeuAsn: 5.373 ± 0.079
3.451LeuPro: 3.451 ± 0.061
2.763LeuGln: 2.763 ± 0.055
3.658LeuArg: 3.658 ± 0.066
7.413LeuSer: 7.413 ± 0.097
5.141LeuThr: 5.141 ± 0.08
5.686LeuVal: 5.686 ± 0.081
0.853LeuTrp: 0.853 ± 0.031
2.981LeuTyr: 2.981 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
1.488MetAla: 1.488 ± 0.039
0.108MetCys: 0.108 ± 0.009
1.261MetAsp: 1.261 ± 0.04
1.418MetGlu: 1.418 ± 0.04
0.897MetPhe: 0.897 ± 0.058
1.419MetGly: 1.419 ± 0.044
0.351MetHis: 0.351 ± 0.017
1.707MetIle: 1.707 ± 0.042
1.997MetLys: 1.997 ± 0.046
1.852MetLeu: 1.852 ± 0.049
0.603MetMet: 0.603 ± 0.026
1.423MetAsn: 1.423 ± 0.038
0.849MetPro: 0.849 ± 0.029
0.684MetGln: 0.684 ± 0.027
0.853MetArg: 0.853 ± 0.028
1.561MetSer: 1.561 ± 0.042
1.08MetThr: 1.08 ± 0.031
1.329MetVal: 1.329 ± 0.039
0.168MetTrp: 0.168 ± 0.012
0.646MetTyr: 0.646 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.291AsnAla: 3.291 ± 0.065
0.293AsnCys: 0.293 ± 0.018
3.032AsnAsp: 3.032 ± 0.063
3.974AsnGlu: 3.974 ± 0.07
2.682AsnPhe: 2.682 ± 0.063
3.999AsnGly: 3.999 ± 0.076
0.902AsnHis: 0.902 ± 0.032
4.024AsnIle: 4.024 ± 0.067
3.204AsnLys: 3.204 ± 0.055
4.745AsnLeu: 4.745 ± 0.065
1.177AsnMet: 1.177 ± 0.04
2.947AsnAsn: 2.947 ± 0.074
2.749AsnPro: 2.749 ± 0.059
1.968AsnGln: 1.968 ± 0.047
2.323AsnArg: 2.323 ± 0.051
4.172AsnSer: 4.172 ± 0.082
3.05AsnThr: 3.05 ± 0.071
3.1AsnVal: 3.1 ± 0.055
0.745AsnTrp: 0.745 ± 0.034
2.287AsnTyr: 2.287 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
2.064ProAla: 2.064 ± 0.043
0.178ProCys: 0.178 ± 0.012
2.701ProAsp: 2.701 ± 0.057
3.364ProGlu: 3.364 ± 0.063
2.065ProPhe: 2.065 ± 0.049
2.408ProGly: 2.408 ± 0.05
0.606ProHis: 0.606 ± 0.026
2.515ProIle: 2.515 ± 0.049
2.152ProLys: 2.152 ± 0.046
2.882ProLeu: 2.882 ± 0.059
0.689ProMet: 0.689 ± 0.027
1.99ProAsn: 1.99 ± 0.043
1.0ProPro: 1.0 ± 0.035
1.039ProGln: 1.039 ± 0.034
1.11ProArg: 1.11 ± 0.031
2.466ProSer: 2.466 ± 0.053
1.897ProThr: 1.897 ± 0.047
2.776ProVal: 2.776 ± 0.049
0.361ProTrp: 0.361 ± 0.019
1.315ProTyr: 1.315 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
2.095GlnAla: 2.095 ± 0.047
0.105GlnCys: 0.105 ± 0.008
1.86GlnAsp: 1.86 ± 0.048
2.445GlnGlu: 2.445 ± 0.056
1.62GlnPhe: 1.62 ± 0.036
1.835GlnGly: 1.835 ± 0.041
0.53GlnHis: 0.53 ± 0.023
2.69GlnIle: 2.69 ± 0.053
2.474GlnLys: 2.474 ± 0.057
3.03GlnLeu: 3.03 ± 0.056
0.691GlnMet: 0.691 ± 0.025
2.092GlnAsn: 2.092 ± 0.045
1.073GlnPro: 1.073 ± 0.031
1.34GlnGln: 1.34 ± 0.041
1.369GlnArg: 1.369 ± 0.038
2.237GlnSer: 2.237 ± 0.047
1.942GlnThr: 1.942 ± 0.041
2.031GlnVal: 2.031 ± 0.044
0.333GlnTrp: 0.333 ± 0.023
1.082GlnTyr: 1.082 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
2.382ArgAla: 2.382 ± 0.055
0.172ArgCys: 0.172 ± 0.011
2.161ArgAsp: 2.161 ± 0.052
2.675ArgGlu: 2.675 ± 0.057
2.274ArgPhe: 2.274 ± 0.054
2.247ArgGly: 2.247 ± 0.052
0.633ArgHis: 0.633 ± 0.029
3.189ArgIle: 3.189 ± 0.062
2.692ArgLys: 2.692 ± 0.061
3.579ArgLeu: 3.579 ± 0.062
1.018ArgMet: 1.018 ± 0.032
2.232ArgAsn: 2.232 ± 0.049
1.293ArgPro: 1.293 ± 0.036
1.175ArgGln: 1.175 ± 0.039
1.679ArgArg: 1.679 ± 0.05
2.829ArgSer: 2.829 ± 0.049
2.171ArgThr: 2.171 ± 0.043
2.612ArgVal: 2.612 ± 0.059
0.426ArgTrp: 0.426 ± 0.02
1.637ArgTyr: 1.637 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
4.382SerAla: 4.382 ± 0.075
0.441SerCys: 0.441 ± 0.019
4.738SerAsp: 4.738 ± 0.083
5.379SerGlu: 5.379 ± 0.077
4.22SerPhe: 4.22 ± 0.076
6.062SerGly: 6.062 ± 0.129
1.107SerHis: 1.107 ± 0.039
5.662SerIle: 5.662 ± 0.088
4.528SerLys: 4.528 ± 0.071
6.819SerLeu: 6.819 ± 0.094
1.446SerMet: 1.446 ± 0.037
3.935SerAsn: 3.935 ± 0.084
2.44SerPro: 2.44 ± 0.049
2.14SerGln: 2.14 ± 0.046
2.708SerArg: 2.708 ± 0.052
5.907SerSer: 5.907 ± 0.129
4.14SerThr: 4.14 ± 0.073
4.976SerVal: 4.976 ± 0.085
0.854SerTrp: 0.854 ± 0.033
2.818SerTyr: 2.818 ± 0.07
0.0SerXaa: 0.0 ± 0.0
Thr
3.506ThrAla: 3.506 ± 0.083
0.282ThrCys: 0.282 ± 0.018
3.479ThrAsp: 3.479 ± 0.076
3.74ThrGlu: 3.74 ± 0.058
2.952ThrPhe: 2.952 ± 0.066
4.48ThrGly: 4.48 ± 0.097
0.906ThrHis: 0.906 ± 0.033
4.385ThrIle: 4.385 ± 0.073
3.208ThrLys: 3.208 ± 0.055
5.257ThrLeu: 5.257 ± 0.079
1.025ThrMet: 1.025 ± 0.03
2.977ThrAsn: 2.977 ± 0.075
2.359ThrPro: 2.359 ± 0.053
1.802ThrGln: 1.802 ± 0.046
1.843ThrArg: 1.843 ± 0.04
4.171ThrSer: 4.171 ± 0.082
3.223ThrThr: 3.223 ± 0.08
3.78ThrVal: 3.78 ± 0.088
0.608ThrTrp: 0.608 ± 0.023
2.014ThrTyr: 2.014 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
4.18ValAla: 4.18 ± 0.072
0.451ValCys: 0.451 ± 0.022
3.986ValAsp: 3.986 ± 0.066
4.517ValGlu: 4.517 ± 0.07
3.303ValPhe: 3.303 ± 0.056
4.188ValGly: 4.188 ± 0.076
1.082ValHis: 1.082 ± 0.029
4.8ValIle: 4.8 ± 0.073
3.756ValLys: 3.756 ± 0.059
6.108ValLeu: 6.108 ± 0.086
1.353ValMet: 1.353 ± 0.038
3.337ValAsn: 3.337 ± 0.058
2.293ValPro: 2.293 ± 0.046
2.166ValGln: 2.166 ± 0.048
2.421ValArg: 2.421 ± 0.054
5.011ValSer: 5.011 ± 0.086
3.745ValThr: 3.745 ± 0.087
4.184ValVal: 4.184 ± 0.071
0.72ValTrp: 0.72 ± 0.029
2.261ValTyr: 2.261 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.699TrpAla: 0.699 ± 0.027
0.07TrpCys: 0.07 ± 0.008
0.792TrpAsp: 0.792 ± 0.048
0.776TrpGlu: 0.776 ± 0.027
0.583TrpPhe: 0.583 ± 0.028
0.75TrpGly: 0.75 ± 0.026
0.211TrpHis: 0.211 ± 0.013
0.857TrpIle: 0.857 ± 0.029
0.742TrpLys: 0.742 ± 0.027
1.053TrpLeu: 1.053 ± 0.036
0.328TrpMet: 0.328 ± 0.019
0.771TrpAsn: 0.771 ± 0.032
0.286TrpPro: 0.286 ± 0.018
0.333TrpGln: 0.333 ± 0.017
0.466TrpArg: 0.466 ± 0.026
0.749TrpSer: 0.749 ± 0.031
0.618TrpThr: 0.618 ± 0.027
0.756TrpVal: 0.756 ± 0.032
0.167TrpTrp: 0.167 ± 0.013
0.456TrpTyr: 0.456 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.162TyrAla: 2.162 ± 0.042
0.24TyrCys: 0.24 ± 0.017
2.32TyrAsp: 2.32 ± 0.059
2.426TyrGlu: 2.426 ± 0.057
2.102TyrPhe: 2.102 ± 0.051
2.457TyrGly: 2.457 ± 0.048
0.704TyrHis: 0.704 ± 0.027
2.245TyrIle: 2.245 ± 0.048
2.212TyrLys: 2.212 ± 0.04
3.554TyrLeu: 3.554 ± 0.055
0.651TyrMet: 0.651 ± 0.024
1.969TyrAsn: 1.969 ± 0.047
1.484TyrPro: 1.484 ± 0.041
1.419TyrGln: 1.419 ± 0.034
1.688TyrArg: 1.688 ± 0.04
3.093TyrSer: 3.093 ± 0.069
2.076TyrThr: 2.076 ± 0.052
1.986TyrVal: 1.986 ± 0.045
0.489TyrTrp: 0.489 ± 0.022
1.477TyrTyr: 1.477 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3097 proteins (1091554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski