Amino acid dipepetide frequency for Acidipila sp. 4G-K13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.966AlaAla: 14.966 ± 0.157
1.049AlaCys: 1.049 ± 0.03
5.253AlaAsp: 5.253 ± 0.065
6.609AlaGlu: 6.609 ± 0.088
3.931AlaPhe: 3.931 ± 0.057
9.719AlaGly: 9.719 ± 0.087
2.382AlaHis: 2.382 ± 0.044
5.462AlaIle: 5.462 ± 0.07
3.654AlaLys: 3.654 ± 0.072
11.064AlaLeu: 11.064 ± 0.109
2.593AlaMet: 2.593 ± 0.049
3.009AlaAsn: 3.009 ± 0.058
5.286AlaPro: 5.286 ± 0.08
4.678AlaGln: 4.678 ± 0.063
6.268AlaArg: 6.268 ± 0.091
6.913AlaSer: 6.913 ± 0.077
5.688AlaThr: 5.688 ± 0.079
8.036AlaVal: 8.036 ± 0.082
1.583AlaTrp: 1.583 ± 0.034
2.477AlaTyr: 2.477 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.966CysAla: 0.966 ± 0.028
0.166CysCys: 0.166 ± 0.014
0.435CysAsp: 0.435 ± 0.018
0.444CysGlu: 0.444 ± 0.017
0.387CysPhe: 0.387 ± 0.018
1.011CysGly: 1.011 ± 0.031
0.253CysHis: 0.253 ± 0.016
0.43CysIle: 0.43 ± 0.019
0.16CysLys: 0.16 ± 0.01
0.844CysLeu: 0.844 ± 0.024
0.189CysMet: 0.189 ± 0.012
0.228CysAsn: 0.228 ± 0.011
0.471CysPro: 0.471 ± 0.021
0.218CysGln: 0.218 ± 0.012
0.603CysArg: 0.603 ± 0.022
0.663CysSer: 0.663 ± 0.026
0.457CysThr: 0.457 ± 0.018
0.637CysVal: 0.637 ± 0.022
0.101CysTrp: 0.101 ± 0.008
0.254CysTyr: 0.254 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.76AspAla: 5.76 ± 0.071
0.402AspCys: 0.402 ± 0.016
2.477AspAsp: 2.477 ± 0.05
2.957AspGlu: 2.957 ± 0.054
2.187AspPhe: 2.187 ± 0.041
4.354AspGly: 4.354 ± 0.066
1.298AspHis: 1.298 ± 0.034
2.384AspIle: 2.384 ± 0.047
1.445AspLys: 1.445 ± 0.032
5.17AspLeu: 5.17 ± 0.063
0.955AspMet: 0.955 ± 0.025
1.436AspAsn: 1.436 ± 0.034
3.38AspPro: 3.38 ± 0.049
1.853AspGln: 1.853 ± 0.035
3.355AspArg: 3.355 ± 0.049
2.892AspSer: 2.892 ± 0.045
2.673AspThr: 2.673 ± 0.047
3.428AspVal: 3.428 ± 0.055
0.835AspTrp: 0.835 ± 0.025
1.652AspTyr: 1.652 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
6.281GluAla: 6.281 ± 0.09
0.396GluCys: 0.396 ± 0.015
2.571GluAsp: 2.571 ± 0.049
3.564GluGlu: 3.564 ± 0.076
1.927GluPhe: 1.927 ± 0.038
3.691GluGly: 3.691 ± 0.056
1.442GluHis: 1.442 ± 0.032
3.331GluIle: 3.331 ± 0.057
2.501GluLys: 2.501 ± 0.052
5.296GluLeu: 5.296 ± 0.08
1.486GluMet: 1.486 ± 0.031
1.8GluAsn: 1.8 ± 0.034
2.455GluPro: 2.455 ± 0.039
2.634GluGln: 2.634 ± 0.051
4.17GluArg: 4.17 ± 0.067
3.086GluSer: 3.086 ± 0.047
3.187GluThr: 3.187 ± 0.048
3.537GluVal: 3.537 ± 0.054
0.769GluTrp: 0.769 ± 0.025
1.527GluTyr: 1.527 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
4.237PheAla: 4.237 ± 0.061
0.425PheCys: 0.425 ± 0.017
2.381PheAsp: 2.381 ± 0.042
2.066PheGlu: 2.066 ± 0.043
1.733PhePhe: 1.733 ± 0.04
3.551PheGly: 3.551 ± 0.062
1.082PheHis: 1.082 ± 0.028
1.514PheIle: 1.514 ± 0.037
0.726PheLys: 0.726 ± 0.025
3.856PheLeu: 3.856 ± 0.054
0.665PheMet: 0.665 ± 0.023
1.338PheAsn: 1.338 ± 0.044
1.951PhePro: 1.951 ± 0.036
1.272PheGln: 1.272 ± 0.032
2.562PheArg: 2.562 ± 0.044
2.953PheSer: 2.953 ± 0.043
2.517PheThr: 2.517 ± 0.057
2.537PheVal: 2.537 ± 0.046
0.555PheTrp: 0.555 ± 0.02
1.178PheTyr: 1.178 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
7.847GlyAla: 7.847 ± 0.087
0.839GlyCys: 0.839 ± 0.026
3.934GlyAsp: 3.934 ± 0.063
4.144GlyGlu: 4.144 ± 0.061
3.481GlyPhe: 3.481 ± 0.059
6.822GlyGly: 6.822 ± 0.116
2.017GlyHis: 2.017 ± 0.042
4.783GlyIle: 4.783 ± 0.068
3.27GlyLys: 3.27 ± 0.049
7.368GlyLeu: 7.368 ± 0.073
1.986GlyMet: 1.986 ± 0.042
2.977GlyAsn: 2.977 ± 0.074
3.221GlyPro: 3.221 ± 0.05
2.994GlyGln: 2.994 ± 0.052
5.057GlyArg: 5.057 ± 0.07
5.767GlySer: 5.767 ± 0.093
5.297GlyThr: 5.297 ± 0.093
5.733GlyVal: 5.733 ± 0.061
1.343GlyTrp: 1.343 ± 0.033
2.654GlyTyr: 2.654 ± 0.049
0.001GlyXaa: 0.001 ± 0.001
His
2.5HisAla: 2.5 ± 0.044
0.22HisCys: 0.22 ± 0.012
1.229HisAsp: 1.229 ± 0.032
1.211HisGlu: 1.211 ± 0.032
1.064HisPhe: 1.064 ± 0.027
2.116HisGly: 2.116 ± 0.042
0.668HisHis: 0.668 ± 0.021
1.165HisIle: 1.165 ± 0.027
0.517HisLys: 0.517 ± 0.018
2.532HisLeu: 2.532 ± 0.047
0.492HisMet: 0.492 ± 0.018
0.692HisAsn: 0.692 ± 0.023
1.688HisPro: 1.688 ± 0.039
0.769HisGln: 0.769 ± 0.025
1.561HisArg: 1.561 ± 0.031
1.399HisSer: 1.399 ± 0.031
1.313HisThr: 1.313 ± 0.029
1.535HisVal: 1.535 ± 0.033
0.4HisTrp: 0.4 ± 0.018
0.745HisTyr: 0.745 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.28IleAla: 6.28 ± 0.084
0.494IleCys: 0.494 ± 0.019
3.1IleAsp: 3.1 ± 0.046
3.134IleGlu: 3.134 ± 0.055
2.019IlePhe: 2.019 ± 0.041
4.239IleGly: 4.239 ± 0.056
1.275IleHis: 1.275 ± 0.03
1.745IleIle: 1.745 ± 0.042
1.068IleLys: 1.068 ± 0.03
4.656IleLeu: 4.656 ± 0.065
0.704IleMet: 0.704 ± 0.023
1.506IleAsn: 1.506 ± 0.037
2.896IlePro: 2.896 ± 0.05
1.637IleGln: 1.637 ± 0.031
3.431IleArg: 3.431 ± 0.053
3.305IleSer: 3.305 ± 0.051
3.112IleThr: 3.112 ± 0.058
3.731IleVal: 3.731 ± 0.054
0.573IleTrp: 0.573 ± 0.023
1.463IleTyr: 1.463 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.48LysAla: 3.48 ± 0.058
0.195LysCys: 0.195 ± 0.011
1.689LysAsp: 1.689 ± 0.044
1.8LysGlu: 1.8 ± 0.042
1.039LysPhe: 1.039 ± 0.03
2.19LysGly: 2.19 ± 0.042
0.699LysHis: 0.699 ± 0.022
1.781LysIle: 1.781 ± 0.038
1.614LysLys: 1.614 ± 0.046
3.128LysLeu: 3.128 ± 0.046
0.841LysMet: 0.841 ± 0.026
1.151LysAsn: 1.151 ± 0.03
1.947LysPro: 1.947 ± 0.046
1.448LysGln: 1.448 ± 0.033
1.958LysArg: 1.958 ± 0.041
1.873LysSer: 1.873 ± 0.039
2.09LysThr: 2.09 ± 0.042
2.149LysVal: 2.149 ± 0.038
0.39LysTrp: 0.39 ± 0.016
0.996LysTyr: 0.996 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
11.099LeuAla: 11.099 ± 0.121
0.999LeuCys: 0.999 ± 0.026
5.204LeuAsp: 5.204 ± 0.068
5.182LeuGlu: 5.182 ± 0.075
3.741LeuPhe: 3.741 ± 0.059
7.073LeuGly: 7.073 ± 0.084
2.497LeuHis: 2.497 ± 0.042
4.705LeuIle: 4.705 ± 0.064
3.429LeuLys: 3.429 ± 0.055
10.467LeuLeu: 10.467 ± 0.129
1.993LeuMet: 1.993 ± 0.039
3.275LeuAsn: 3.275 ± 0.055
5.798LeuPro: 5.798 ± 0.054
3.562LeuGln: 3.562 ± 0.053
7.252LeuArg: 7.252 ± 0.089
6.572LeuSer: 6.572 ± 0.059
6.065LeuThr: 6.065 ± 0.073
6.093LeuVal: 6.093 ± 0.078
1.283LeuTrp: 1.283 ± 0.035
2.646LeuTyr: 2.646 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
2.362MetAla: 2.362 ± 0.045
0.139MetCys: 0.139 ± 0.009
0.977MetAsp: 0.977 ± 0.03
1.132MetGlu: 1.132 ± 0.026
0.625MetPhe: 0.625 ± 0.022
1.476MetGly: 1.476 ± 0.032
0.545MetHis: 0.545 ± 0.02
0.961MetIle: 0.961 ± 0.027
0.988MetLys: 0.988 ± 0.028
2.291MetLeu: 2.291 ± 0.042
0.524MetMet: 0.524 ± 0.02
0.802MetAsn: 0.802 ± 0.024
1.417MetPro: 1.417 ± 0.034
1.03MetGln: 1.03 ± 0.029
1.678MetArg: 1.678 ± 0.041
1.365MetSer: 1.365 ± 0.029
1.385MetThr: 1.385 ± 0.035
1.413MetVal: 1.413 ± 0.031
0.212MetTrp: 0.212 ± 0.011
0.43MetTyr: 0.43 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.412AsnAla: 3.412 ± 0.063
0.284AsnCys: 0.284 ± 0.014
1.555AsnAsp: 1.555 ± 0.036
1.515AsnGlu: 1.515 ± 0.035
1.355AsnPhe: 1.355 ± 0.042
3.158AsnGly: 3.158 ± 0.073
0.736AsnHis: 0.736 ± 0.023
1.63AsnIle: 1.63 ± 0.042
0.757AsnLys: 0.757 ± 0.026
3.2AsnLeu: 3.2 ± 0.055
0.618AsnMet: 0.618 ± 0.02
1.221AsnAsn: 1.221 ± 0.044
2.424AsnPro: 2.424 ± 0.052
1.24AsnGln: 1.24 ± 0.033
1.843AsnArg: 1.843 ± 0.04
1.902AsnSer: 1.902 ± 0.048
1.976AsnThr: 1.976 ± 0.056
2.217AsnVal: 2.217 ± 0.055
0.474AsnTrp: 0.474 ± 0.019
1.149AsnTyr: 1.149 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
6.646ProAla: 6.646 ± 0.08
0.375ProCys: 0.375 ± 0.015
3.282ProAsp: 3.282 ± 0.054
3.874ProGlu: 3.874 ± 0.057
2.016ProPhe: 2.016 ± 0.039
5.21ProGly: 5.21 ± 0.077
1.259ProHis: 1.259 ± 0.033
2.142ProIle: 2.142 ± 0.037
1.658ProLys: 1.658 ± 0.034
4.771ProLeu: 4.771 ± 0.064
1.024ProMet: 1.024 ± 0.029
1.698ProAsn: 1.698 ± 0.042
2.722ProPro: 2.722 ± 0.06
2.23ProGln: 2.23 ± 0.047
2.703ProArg: 2.703 ± 0.047
3.503ProSer: 3.503 ± 0.063
2.425ProThr: 2.425 ± 0.056
4.512ProVal: 4.512 ± 0.06
0.733ProTrp: 0.733 ± 0.022
1.423ProTyr: 1.423 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.068GlnAla: 4.068 ± 0.056
0.265GlnCys: 0.265 ± 0.015
1.697GlnAsp: 1.697 ± 0.038
1.914GlnGlu: 1.914 ± 0.039
1.44GlnPhe: 1.44 ± 0.038
2.538GlnGly: 2.538 ± 0.043
0.915GlnHis: 0.915 ± 0.026
2.313GlnIle: 2.313 ± 0.043
1.572GlnLys: 1.572 ± 0.034
3.489GlnLeu: 3.489 ± 0.061
1.05GlnMet: 1.05 ± 0.028
1.457GlnAsn: 1.457 ± 0.039
2.327GlnPro: 2.327 ± 0.047
2.287GlnGln: 2.287 ± 0.054
2.601GlnArg: 2.601 ± 0.051
2.496GlnSer: 2.496 ± 0.048
2.549GlnThr: 2.549 ± 0.047
2.532GlnVal: 2.532 ± 0.042
0.526GlnTrp: 0.526 ± 0.02
1.181GlnTyr: 1.181 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
5.895ArgAla: 5.895 ± 0.079
0.51ArgCys: 0.51 ± 0.019
3.182ArgAsp: 3.182 ± 0.053
4.125ArgGlu: 4.125 ± 0.067
2.68ArgPhe: 2.68 ± 0.046
4.05ArgGly: 4.05 ± 0.062
1.61ArgHis: 1.61 ± 0.028
3.832ArgIle: 3.832 ± 0.056
2.266ArgLys: 2.266 ± 0.045
6.722ArgLeu: 6.722 ± 0.085
1.777ArgMet: 1.777 ± 0.038
2.157ArgAsn: 2.157 ± 0.045
3.044ArgPro: 3.044 ± 0.053
2.674ArgGln: 2.674 ± 0.05
5.084ArgArg: 5.084 ± 0.083
3.994ArgSer: 3.994 ± 0.058
3.461ArgThr: 3.461 ± 0.049
4.334ArgVal: 4.334 ± 0.058
1.049ArgTrp: 1.049 ± 0.03
1.969ArgTyr: 1.969 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
7.103SerAla: 7.103 ± 0.085
0.583SerCys: 0.583 ± 0.021
3.171SerAsp: 3.171 ± 0.055
3.157SerGlu: 3.157 ± 0.055
2.625SerPhe: 2.625 ± 0.048
6.739SerGly: 6.739 ± 0.107
1.352SerHis: 1.352 ± 0.031
3.315SerIle: 3.315 ± 0.052
1.762SerLys: 1.762 ± 0.036
6.457SerLeu: 6.457 ± 0.073
1.345SerMet: 1.345 ± 0.031
1.995SerAsn: 1.995 ± 0.057
3.512SerPro: 3.512 ± 0.056
2.282SerGln: 2.282 ± 0.047
3.687SerArg: 3.687 ± 0.048
4.756SerSer: 4.756 ± 0.077
3.747SerThr: 3.747 ± 0.072
4.509SerVal: 4.509 ± 0.063
0.894SerTrp: 0.894 ± 0.03
1.816SerTyr: 1.816 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
6.81ThrAla: 6.81 ± 0.094
0.491ThrCys: 0.491 ± 0.021
2.817ThrAsp: 2.817 ± 0.042
2.794ThrGlu: 2.794 ± 0.051
2.243ThrPhe: 2.243 ± 0.048
5.797ThrGly: 5.797 ± 0.093
1.223ThrHis: 1.223 ± 0.032
3.178ThrIle: 3.178 ± 0.062
1.565ThrLys: 1.565 ± 0.036
6.132ThrLeu: 6.132 ± 0.073
1.093ThrMet: 1.093 ± 0.03
1.849ThrAsn: 1.849 ± 0.053
3.755ThrPro: 3.755 ± 0.056
2.075ThrGln: 2.075 ± 0.035
3.031ThrArg: 3.031 ± 0.047
3.62ThrSer: 3.62 ± 0.067
3.536ThrThr: 3.536 ± 0.079
4.591ThrVal: 4.591 ± 0.085
0.855ThrTrp: 0.855 ± 0.026
1.542ThrTyr: 1.542 ± 0.041
0.001ThrXaa: 0.001 ± 0.001
Val
7.078ValAla: 7.078 ± 0.074
0.687ValCys: 0.687 ± 0.023
3.665ValAsp: 3.665 ± 0.049
3.894ValGlu: 3.894 ± 0.052
2.683ValPhe: 2.683 ± 0.046
4.362ValGly: 4.362 ± 0.056
1.544ValHis: 1.544 ± 0.035
3.713ValIle: 3.713 ± 0.051
2.098ValLys: 2.098 ± 0.043
7.26ValLeu: 7.26 ± 0.09
1.533ValMet: 1.533 ± 0.035
2.455ValAsn: 2.455 ± 0.048
3.816ValPro: 3.816 ± 0.051
2.48ValGln: 2.48 ± 0.044
4.543ValArg: 4.543 ± 0.053
4.864ValSer: 4.864 ± 0.059
4.818ValThr: 4.818 ± 0.089
5.109ValVal: 5.109 ± 0.066
0.899ValTrp: 0.899 ± 0.028
1.813ValTyr: 1.813 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.035TrpAla: 1.035 ± 0.032
0.127TrpCys: 0.127 ± 0.01
0.604TrpAsp: 0.604 ± 0.021
0.66TrpGlu: 0.66 ± 0.025
0.601TrpPhe: 0.601 ± 0.021
0.907TrpGly: 0.907 ± 0.024
0.377TrpHis: 0.377 ± 0.017
0.817TrpIle: 0.817 ± 0.024
0.695TrpLys: 0.695 ± 0.02
1.556TrpLeu: 1.556 ± 0.038
0.409TrpMet: 0.409 ± 0.019
0.603TrpAsn: 0.603 ± 0.021
0.635TrpPro: 0.635 ± 0.023
0.716TrpGln: 0.716 ± 0.021
0.963TrpArg: 0.963 ± 0.027
1.017TrpSer: 1.017 ± 0.029
0.916TrpThr: 0.916 ± 0.029
0.843TrpVal: 0.843 ± 0.024
0.225TrpTrp: 0.225 ± 0.015
0.408TrpTyr: 0.408 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.795TyrAla: 2.795 ± 0.042
0.269TyrCys: 0.269 ± 0.014
1.668TyrAsp: 1.668 ± 0.041
1.475TyrGlu: 1.475 ± 0.038
1.322TyrPhe: 1.322 ± 0.03
2.535TyrGly: 2.535 ± 0.041
0.644TyrHis: 0.644 ± 0.021
1.165TyrIle: 1.165 ± 0.031
0.713TyrLys: 0.713 ± 0.024
2.673TyrLeu: 2.673 ± 0.042
0.475TyrMet: 0.475 ± 0.018
1.017TyrAsn: 1.017 ± 0.035
1.462TyrPro: 1.462 ± 0.031
1.139TyrGln: 1.139 ± 0.03
2.064TyrArg: 2.064 ± 0.038
1.866TyrSer: 1.866 ± 0.044
1.786TyrThr: 1.786 ± 0.045
1.837TyrVal: 1.837 ± 0.04
0.416TyrTrp: 0.416 ± 0.018
0.893TyrTyr: 0.893 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.001XaaMet: 0.001 ± 0.001
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.002
Statistics based on 3979 proteins (1463424 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski