Amino acid dipepetide frequency for Halioglobus lutimaris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.214AlaAla: 12.214 ± 0.145
1.214AlaCys: 1.214 ± 0.036
6.196AlaAsp: 6.196 ± 0.067
7.086AlaGlu: 7.086 ± 0.09
3.51AlaPhe: 3.51 ± 0.049
8.88AlaGly: 8.88 ± 0.09
1.936AlaHis: 1.936 ± 0.037
5.262AlaIle: 5.262 ± 0.066
2.73AlaLys: 2.73 ± 0.055
11.645AlaLeu: 11.645 ± 0.105
3.119AlaMet: 3.119 ± 0.049
2.996AlaAsn: 2.996 ± 0.053
4.125AlaPro: 4.125 ± 0.064
4.298AlaGln: 4.298 ± 0.055
6.688AlaArg: 6.688 ± 0.083
6.117AlaSer: 6.117 ± 0.081
5.133AlaThr: 5.133 ± 0.066
7.736AlaVal: 7.736 ± 0.078
1.373AlaTrp: 1.373 ± 0.033
2.665AlaTyr: 2.665 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.129CysAla: 1.129 ± 0.03
0.161CysCys: 0.161 ± 0.009
0.616CysAsp: 0.616 ± 0.024
0.639CysGlu: 0.639 ± 0.021
0.384CysPhe: 0.384 ± 0.016
1.012CysGly: 1.012 ± 0.033
0.342CysHis: 0.342 ± 0.023
0.527CysIle: 0.527 ± 0.019
0.315CysLys: 0.315 ± 0.015
1.063CysLeu: 1.063 ± 0.03
0.224CysMet: 0.224 ± 0.013
0.312CysAsn: 0.312 ± 0.015
0.543CysPro: 0.543 ± 0.022
0.396CysGln: 0.396 ± 0.018
0.634CysArg: 0.634 ± 0.021
0.737CysSer: 0.737 ± 0.026
0.526CysThr: 0.526 ± 0.02
0.687CysVal: 0.687 ± 0.023
0.142CysTrp: 0.142 ± 0.012
0.304CysTyr: 0.304 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
6.011AspAla: 6.011 ± 0.08
0.668AspCys: 0.668 ± 0.022
3.365AspAsp: 3.365 ± 0.06
3.957AspGlu: 3.957 ± 0.06
2.385AspPhe: 2.385 ± 0.042
4.528AspGly: 4.528 ± 0.066
1.252AspHis: 1.252 ± 0.031
3.455AspIle: 3.455 ± 0.053
2.068AspLys: 2.068 ± 0.039
5.745AspLeu: 5.745 ± 0.077
1.553AspMet: 1.553 ± 0.03
1.974AspAsn: 1.974 ± 0.04
2.99AspPro: 2.99 ± 0.047
2.067AspGln: 2.067 ± 0.039
3.382AspArg: 3.382 ± 0.049
3.415AspSer: 3.415 ± 0.059
2.945AspThr: 2.945 ± 0.051
4.041AspVal: 4.041 ± 0.057
1.105AspTrp: 1.105 ± 0.027
2.14AspTyr: 2.14 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
6.534GluAla: 6.534 ± 0.07
0.492GluCys: 0.492 ± 0.019
3.493GluAsp: 3.493 ± 0.049
4.31GluGlu: 4.31 ± 0.074
2.234GluPhe: 2.234 ± 0.043
4.399GluGly: 4.399 ± 0.065
1.533GluHis: 1.533 ± 0.036
3.441GluIle: 3.441 ± 0.051
2.561GluLys: 2.561 ± 0.046
7.242GluLeu: 7.242 ± 0.097
1.668GluMet: 1.668 ± 0.039
2.121GluAsn: 2.121 ± 0.035
2.563GluPro: 2.563 ± 0.051
3.652GluGln: 3.652 ± 0.06
4.376GluArg: 4.376 ± 0.061
3.544GluSer: 3.544 ± 0.054
3.085GluThr: 3.085 ± 0.046
4.615GluVal: 4.615 ± 0.069
0.77GluTrp: 0.77 ± 0.025
1.748GluTyr: 1.748 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.617PheAla: 3.617 ± 0.044
0.49PheCys: 0.49 ± 0.02
2.558PheAsp: 2.558 ± 0.044
2.24PheGlu: 2.24 ± 0.042
1.501PhePhe: 1.501 ± 0.037
3.131PheGly: 3.131 ± 0.052
0.832PheHis: 0.832 ± 0.026
1.783PheIle: 1.783 ± 0.037
1.102PheLys: 1.102 ± 0.025
3.371PheLeu: 3.371 ± 0.058
0.893PheMet: 0.893 ± 0.027
1.398PheAsn: 1.398 ± 0.033
1.588PhePro: 1.588 ± 0.034
1.181PheGln: 1.181 ± 0.03
1.992PheArg: 1.992 ± 0.041
2.801PheSer: 2.801 ± 0.052
2.175PheThr: 2.175 ± 0.047
2.433PheVal: 2.433 ± 0.045
0.461PheTrp: 0.461 ± 0.02
1.159PheTyr: 1.159 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
7.404GlyAla: 7.404 ± 0.094
0.956GlyCys: 0.956 ± 0.032
4.734GlyAsp: 4.734 ± 0.071
5.486GlyGlu: 5.486 ± 0.06
3.32GlyPhe: 3.32 ± 0.05
6.467GlyGly: 6.467 ± 0.097
1.647GlyHis: 1.647 ± 0.037
4.467GlyIle: 4.467 ± 0.055
3.443GlyLys: 3.443 ± 0.06
8.15GlyLeu: 8.15 ± 0.086
2.374GlyMet: 2.374 ± 0.038
2.576GlyAsn: 2.576 ± 0.042
2.7GlyPro: 2.7 ± 0.047
2.959GlyGln: 2.959 ± 0.047
4.738GlyArg: 4.738 ± 0.064
4.932GlySer: 4.932 ± 0.062
3.967GlyThr: 3.967 ± 0.055
5.979GlyVal: 5.979 ± 0.074
1.202GlyTrp: 1.202 ± 0.032
2.567GlyTyr: 2.567 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.007HisAla: 2.007 ± 0.041
0.348HisCys: 0.348 ± 0.017
1.077HisAsp: 1.077 ± 0.029
1.089HisGlu: 1.089 ± 0.03
0.924HisPhe: 0.924 ± 0.029
1.829HisGly: 1.829 ± 0.04
0.611HisHis: 0.611 ± 0.024
1.135HisIle: 1.135 ± 0.03
0.644HisLys: 0.644 ± 0.023
2.23HisLeu: 2.23 ± 0.047
0.528HisMet: 0.528 ± 0.022
0.692HisAsn: 0.692 ± 0.024
1.339HisPro: 1.339 ± 0.033
0.902HisGln: 0.902 ± 0.028
1.523HisArg: 1.523 ± 0.034
1.361HisSer: 1.361 ± 0.035
1.05HisThr: 1.05 ± 0.026
1.305HisVal: 1.305 ± 0.03
0.392HisTrp: 0.392 ± 0.017
0.827HisTyr: 0.827 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.829IleAla: 5.829 ± 0.072
0.584IleCys: 0.584 ± 0.022
3.606IleAsp: 3.606 ± 0.055
3.52IleGlu: 3.52 ± 0.058
1.632IlePhe: 1.632 ± 0.032
4.059IleGly: 4.059 ± 0.057
1.12IleHis: 1.12 ± 0.029
2.301IleIle: 2.301 ± 0.046
1.725IleLys: 1.725 ± 0.037
4.208IleLeu: 4.208 ± 0.052
1.033IleMet: 1.033 ± 0.024
1.997IleAsn: 1.997 ± 0.037
2.52IlePro: 2.52 ± 0.04
1.523IleGln: 1.523 ± 0.035
2.848IleArg: 2.848 ± 0.045
3.363IleSer: 3.363 ± 0.051
2.944IleThr: 2.944 ± 0.043
3.503IleVal: 3.503 ± 0.054
0.572IleTrp: 0.572 ± 0.021
1.381IleTyr: 1.381 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
3.387LysAla: 3.387 ± 0.056
0.224LysCys: 0.224 ± 0.013
1.637LysAsp: 1.637 ± 0.037
1.799LysGlu: 1.799 ± 0.038
0.951LysPhe: 0.951 ± 0.026
2.408LysGly: 2.408 ± 0.043
0.742LysHis: 0.742 ± 0.026
1.627LysIle: 1.627 ± 0.036
1.453LysLys: 1.453 ± 0.048
3.551LysLeu: 3.551 ± 0.053
0.814LysMet: 0.814 ± 0.027
1.07LysAsn: 1.07 ± 0.029
1.779LysPro: 1.779 ± 0.038
1.505LysGln: 1.505 ± 0.034
2.272LysArg: 2.272 ± 0.043
1.948LysSer: 1.948 ± 0.046
1.763LysThr: 1.763 ± 0.03
2.472LysVal: 2.472 ± 0.044
0.407LysTrp: 0.407 ± 0.017
0.817LysTyr: 0.817 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
12.059LeuAla: 12.059 ± 0.117
1.176LeuCys: 1.176 ± 0.032
6.437LeuAsp: 6.437 ± 0.072
6.681LeuGlu: 6.681 ± 0.087
3.763LeuPhe: 3.763 ± 0.06
8.214LeuGly: 8.214 ± 0.083
2.232LeuHis: 2.232 ± 0.048
4.626LeuIle: 4.626 ± 0.069
3.408LeuLys: 3.408 ± 0.055
11.477LeuLeu: 11.477 ± 0.128
2.455LeuMet: 2.455 ± 0.047
3.122LeuAsn: 3.122 ± 0.043
5.396LeuPro: 5.396 ± 0.066
4.672LeuGln: 4.672 ± 0.064
6.675LeuArg: 6.675 ± 0.078
6.667LeuSer: 6.667 ± 0.072
5.118LeuThr: 5.118 ± 0.07
7.437LeuVal: 7.437 ± 0.083
1.284LeuTrp: 1.284 ± 0.032
2.582LeuTyr: 2.582 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.808MetAla: 2.808 ± 0.048
0.192MetCys: 0.192 ± 0.012
1.427MetAsp: 1.427 ± 0.035
1.515MetGlu: 1.515 ± 0.029
0.796MetPhe: 0.796 ± 0.027
2.002MetGly: 2.002 ± 0.037
0.559MetHis: 0.559 ± 0.022
1.213MetIle: 1.213 ± 0.03
1.008MetLys: 1.008 ± 0.022
2.619MetLeu: 2.619 ± 0.05
0.669MetMet: 0.669 ± 0.026
0.887MetAsn: 0.887 ± 0.025
1.325MetPro: 1.325 ± 0.03
1.136MetGln: 1.136 ± 0.027
1.607MetArg: 1.607 ± 0.033
1.778MetSer: 1.778 ± 0.034
1.415MetThr: 1.415 ± 0.03
1.758MetVal: 1.758 ± 0.037
0.237MetTrp: 0.237 ± 0.013
0.499MetTyr: 0.499 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.29AsnAla: 3.29 ± 0.053
0.366AsnCys: 0.366 ± 0.02
1.683AsnAsp: 1.683 ± 0.041
1.713AsnGlu: 1.713 ± 0.032
1.158AsnPhe: 1.158 ± 0.032
2.455AsnGly: 2.455 ± 0.044
0.668AsnHis: 0.668 ± 0.024
1.723AsnIle: 1.723 ± 0.041
1.012AsnLys: 1.012 ± 0.03
3.118AsnLeu: 3.118 ± 0.045
0.799AsnMet: 0.799 ± 0.021
1.133AsnAsn: 1.133 ± 0.029
2.084AsnPro: 2.084 ± 0.034
1.161AsnGln: 1.161 ± 0.031
2.028AsnArg: 2.028 ± 0.037
1.819AsnSer: 1.819 ± 0.038
1.835AsnThr: 1.835 ± 0.04
2.149AsnVal: 2.149 ± 0.044
0.548AsnTrp: 0.548 ± 0.022
1.019AsnTyr: 1.019 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
5.126ProAla: 5.126 ± 0.077
0.372ProCys: 0.372 ± 0.016
3.138ProAsp: 3.138 ± 0.051
3.775ProGlu: 3.775 ± 0.048
1.689ProPhe: 1.689 ± 0.032
4.601ProGly: 4.601 ± 0.067
1.008ProHis: 1.008 ± 0.028
1.952ProIle: 1.952 ± 0.035
1.189ProLys: 1.189 ± 0.032
4.769ProLeu: 4.769 ± 0.065
1.201ProMet: 1.201 ± 0.028
1.25ProAsn: 1.25 ± 0.036
1.985ProPro: 1.985 ± 0.043
1.831ProGln: 1.831 ± 0.038
2.612ProArg: 2.612 ± 0.05
2.4ProSer: 2.4 ± 0.042
2.11ProThr: 2.11 ± 0.042
3.865ProVal: 3.865 ± 0.047
0.71ProTrp: 0.71 ± 0.025
1.15ProTyr: 1.15 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.472GlnAla: 4.472 ± 0.049
0.358GlnCys: 0.358 ± 0.017
1.955GlnAsp: 1.955 ± 0.04
2.279GlnGlu: 2.279 ± 0.041
1.422GlnPhe: 1.422 ± 0.032
3.125GlnGly: 3.125 ± 0.047
0.946GlnHis: 0.946 ± 0.028
1.841GlnIle: 1.841 ± 0.035
1.168GlnLys: 1.168 ± 0.031
4.809GlnLeu: 4.809 ± 0.07
1.001GlnMet: 1.001 ± 0.025
0.995GlnAsn: 0.995 ± 0.028
1.948GlnPro: 1.948 ± 0.037
2.346GlnGln: 2.346 ± 0.044
3.265GlnArg: 3.265 ± 0.054
2.262GlnSer: 2.262 ± 0.043
1.798GlnThr: 1.798 ± 0.038
3.065GlnVal: 3.065 ± 0.051
0.677GlnTrp: 0.677 ± 0.021
1.089GlnTyr: 1.089 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
5.806ArgAla: 5.806 ± 0.07
0.68ArgCys: 0.68 ± 0.024
3.768ArgAsp: 3.768 ± 0.054
4.726ArgGlu: 4.726 ± 0.061
2.644ArgPhe: 2.644 ± 0.048
4.371ArgGly: 4.371 ± 0.057
1.505ArgHis: 1.505 ± 0.031
3.361ArgIle: 3.361 ± 0.051
2.384ArgLys: 2.384 ± 0.05
6.766ArgLeu: 6.766 ± 0.069
1.649ArgMet: 1.649 ± 0.033
2.059ArgAsn: 2.059 ± 0.039
2.637ArgPro: 2.637 ± 0.045
2.837ArgGln: 2.837 ± 0.051
4.286ArgArg: 4.286 ± 0.067
3.707ArgSer: 3.707 ± 0.046
2.65ArgThr: 2.65 ± 0.043
4.472ArgVal: 4.472 ± 0.053
1.008ArgTrp: 1.008 ± 0.026
2.065ArgTyr: 2.065 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
6.431SerAla: 6.431 ± 0.072
0.623SerCys: 0.623 ± 0.022
3.331SerAsp: 3.331 ± 0.052
3.581SerGlu: 3.581 ± 0.058
2.297SerPhe: 2.297 ± 0.044
5.589SerGly: 5.589 ± 0.059
1.287SerHis: 1.287 ± 0.026
3.142SerIle: 3.142 ± 0.048
1.811SerLys: 1.811 ± 0.041
6.633SerLeu: 6.633 ± 0.079
1.587SerMet: 1.587 ± 0.034
1.935SerAsn: 1.935 ± 0.034
2.942SerPro: 2.942 ± 0.052
2.159SerGln: 2.159 ± 0.039
3.932SerArg: 3.932 ± 0.059
3.743SerSer: 3.743 ± 0.06
3.037SerThr: 3.037 ± 0.045
4.384SerVal: 4.384 ± 0.064
0.92SerTrp: 0.92 ± 0.024
1.648SerTyr: 1.648 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
5.347ThrAla: 5.347 ± 0.056
0.49ThrCys: 0.49 ± 0.023
2.993ThrAsp: 2.993 ± 0.047
2.911ThrGlu: 2.911 ± 0.048
1.703ThrPhe: 1.703 ± 0.036
4.762ThrGly: 4.762 ± 0.057
1.047ThrHis: 1.047 ± 0.027
2.41ThrIle: 2.41 ± 0.043
1.02ThrLys: 1.02 ± 0.03
5.976ThrLeu: 5.976 ± 0.077
1.072ThrMet: 1.072 ± 0.029
1.377ThrAsn: 1.377 ± 0.034
2.931ThrPro: 2.931 ± 0.054
1.647ThrGln: 1.647 ± 0.039
3.231ThrArg: 3.231 ± 0.05
2.894ThrSer: 2.894 ± 0.045
2.612ThrThr: 2.612 ± 0.046
3.938ThrVal: 3.938 ± 0.057
0.681ThrTrp: 0.681 ± 0.023
1.289ThrTyr: 1.289 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
7.754ValAla: 7.754 ± 0.086
0.772ValCys: 0.772 ± 0.025
4.609ValAsp: 4.609 ± 0.054
4.749ValGlu: 4.749 ± 0.072
2.698ValPhe: 2.698 ± 0.044
5.096ValGly: 5.096 ± 0.066
1.502ValHis: 1.502 ± 0.032
4.133ValIle: 4.133 ± 0.052
2.365ValLys: 2.365 ± 0.046
7.272ValLeu: 7.272 ± 0.081
1.811ValMet: 1.811 ± 0.04
2.506ValAsn: 2.506 ± 0.041
3.387ValPro: 3.387 ± 0.045
2.379ValGln: 2.379 ± 0.04
3.966ValArg: 3.966 ± 0.052
4.828ValSer: 4.828 ± 0.067
4.049ValThr: 4.049 ± 0.055
5.724ValVal: 5.724 ± 0.076
0.805ValTrp: 0.805 ± 0.024
1.882ValTyr: 1.882 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.08TrpAla: 1.08 ± 0.031
0.183TrpCys: 0.183 ± 0.011
0.748TrpAsp: 0.748 ± 0.028
0.784TrpGlu: 0.784 ± 0.023
0.533TrpPhe: 0.533 ± 0.02
0.947TrpGly: 0.947 ± 0.029
0.369TrpHis: 0.369 ± 0.018
0.64TrpIle: 0.64 ± 0.023
0.424TrpLys: 0.424 ± 0.019
1.713TrpLeu: 1.713 ± 0.04
0.39TrpMet: 0.39 ± 0.016
0.447TrpAsn: 0.447 ± 0.019
0.601TrpPro: 0.601 ± 0.021
0.867TrpGln: 0.867 ± 0.027
1.047TrpArg: 1.047 ± 0.029
0.873TrpSer: 0.873 ± 0.028
0.673TrpThr: 0.673 ± 0.024
0.991TrpVal: 0.991 ± 0.028
0.226TrpTrp: 0.226 ± 0.014
0.442TrpTyr: 0.442 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.62TyrAla: 2.62 ± 0.048
0.345TyrCys: 0.345 ± 0.017
1.675TyrAsp: 1.675 ± 0.037
1.538TyrGlu: 1.538 ± 0.034
1.159TyrPhe: 1.159 ± 0.031
2.194TyrGly: 2.194 ± 0.045
0.676TyrHis: 0.676 ± 0.024
1.232TyrIle: 1.232 ± 0.031
0.794TyrLys: 0.794 ± 0.025
3.124TyrLeu: 3.124 ± 0.053
0.609TyrMet: 0.609 ± 0.02
0.915TyrAsn: 0.915 ± 0.024
1.349TyrPro: 1.349 ± 0.037
1.304TyrGln: 1.304 ± 0.034
2.28TyrArg: 2.28 ± 0.039
1.822TyrSer: 1.822 ± 0.037
1.447TyrThr: 1.447 ± 0.033
1.734TyrVal: 1.734 ± 0.035
0.457TyrTrp: 0.457 ± 0.017
0.951TyrTyr: 0.951 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4080 proteins (1394984 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski