Amino acid dipepetide frequency for Azospirillum sp. RU38E

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.989AlaAla: 18.989 ± 0.167
1.03AlaCys: 1.03 ± 0.03
8.109AlaAsp: 8.109 ± 0.085
7.16AlaGlu: 7.16 ± 0.084
3.783AlaPhe: 3.783 ± 0.048
12.473AlaGly: 12.473 ± 0.152
2.116AlaHis: 2.116 ± 0.038
5.621AlaIle: 5.621 ± 0.057
3.777AlaLys: 3.777 ± 0.053
14.498AlaLeu: 14.498 ± 0.139
3.307AlaMet: 3.307 ± 0.056
3.2AlaAsn: 3.2 ± 0.06
6.024AlaPro: 6.024 ± 0.092
4.499AlaGln: 4.499 ± 0.06
8.649AlaArg: 8.649 ± 0.099
6.069AlaSer: 6.069 ± 0.094
6.395AlaThr: 6.395 ± 0.118
8.803AlaVal: 8.803 ± 0.091
1.525AlaTrp: 1.525 ± 0.031
2.546AlaTyr: 2.546 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.856CysAla: 0.856 ± 0.025
0.09CysCys: 0.09 ± 0.008
0.488CysAsp: 0.488 ± 0.017
0.296CysGlu: 0.296 ± 0.012
0.313CysPhe: 0.313 ± 0.014
0.818CysGly: 0.818 ± 0.021
0.239CysHis: 0.239 ± 0.014
0.321CysIle: 0.321 ± 0.015
0.145CysLys: 0.145 ± 0.008
0.849CysLeu: 0.849 ± 0.023
0.151CysMet: 0.151 ± 0.011
0.18CysAsn: 0.18 ± 0.012
0.442CysPro: 0.442 ± 0.018
0.274CysGln: 0.274 ± 0.01
0.618CysArg: 0.618 ± 0.021
0.376CysSer: 0.376 ± 0.015
0.371CysThr: 0.371 ± 0.015
0.51CysVal: 0.51 ± 0.016
0.132CysTrp: 0.132 ± 0.01
0.161CysTyr: 0.161 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
6.904AspAla: 6.904 ± 0.072
0.417AspCys: 0.417 ± 0.017
2.969AspAsp: 2.969 ± 0.047
2.751AspGlu: 2.751 ± 0.05
2.117AspPhe: 2.117 ± 0.04
6.001AspGly: 6.001 ± 0.077
1.309AspHis: 1.309 ± 0.029
2.978AspIle: 2.978 ± 0.044
1.763AspLys: 1.763 ± 0.036
6.531AspLeu: 6.531 ± 0.072
1.352AspMet: 1.352 ± 0.026
1.427AspAsn: 1.427 ± 0.039
3.531AspPro: 3.531 ± 0.055
2.134AspGln: 2.134 ± 0.042
4.476AspArg: 4.476 ± 0.065
2.417AspSer: 2.417 ± 0.041
2.517AspThr: 2.517 ± 0.05
3.816AspVal: 3.816 ± 0.055
1.102AspTrp: 1.102 ± 0.025
1.556AspTyr: 1.556 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
6.668GluAla: 6.668 ± 0.082
0.298GluCys: 0.298 ± 0.014
2.146GluAsp: 2.146 ± 0.042
2.863GluGlu: 2.863 ± 0.057
1.448GluPhe: 1.448 ± 0.029
4.164GluGly: 4.164 ± 0.052
0.833GluHis: 0.833 ± 0.027
2.736GluIle: 2.736 ± 0.047
2.158GluLys: 2.158 ± 0.043
5.034GluLeu: 5.034 ± 0.07
1.48GluMet: 1.48 ± 0.033
1.425GluAsn: 1.425 ± 0.032
2.357GluPro: 2.357 ± 0.039
2.287GluGln: 2.287 ± 0.046
4.15GluArg: 4.15 ± 0.071
1.994GluSer: 1.994 ± 0.035
2.914GluThr: 2.914 ± 0.041
3.501GluVal: 3.501 ± 0.057
0.662GluTrp: 0.662 ± 0.02
0.884GluTyr: 0.884 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
3.97PheAla: 3.97 ± 0.057
0.363PheCys: 0.363 ± 0.015
2.487PheAsp: 2.487 ± 0.04
1.482PheGlu: 1.482 ± 0.032
1.233PhePhe: 1.233 ± 0.03
3.183PheGly: 3.183 ± 0.044
0.768PheHis: 0.768 ± 0.022
1.638PheIle: 1.638 ± 0.033
0.925PheLys: 0.925 ± 0.025
3.153PheLeu: 3.153 ± 0.046
0.737PheMet: 0.737 ± 0.021
1.2PheAsn: 1.2 ± 0.034
1.485PhePro: 1.485 ± 0.031
1.228PheGln: 1.228 ± 0.029
2.085PheArg: 2.085 ± 0.036
2.068PheSer: 2.068 ± 0.037
2.236PheThr: 2.236 ± 0.038
2.133PheVal: 2.133 ± 0.038
0.505PheTrp: 0.505 ± 0.02
0.939PheTyr: 0.939 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
9.671GlyAla: 9.671 ± 0.154
0.812GlyCys: 0.812 ± 0.023
4.907GlyAsp: 4.907 ± 0.08
4.439GlyGlu: 4.439 ± 0.051
3.545GlyPhe: 3.545 ± 0.043
8.333GlyGly: 8.333 ± 0.168
1.884GlyHis: 1.884 ± 0.036
4.337GlyIle: 4.337 ± 0.056
3.101GlyLys: 3.101 ± 0.044
10.086GlyLeu: 10.086 ± 0.093
2.465GlyMet: 2.465 ± 0.04
2.631GlyAsn: 2.631 ± 0.08
4.017GlyPro: 4.017 ± 0.065
3.512GlyGln: 3.512 ± 0.048
6.516GlyArg: 6.516 ± 0.068
4.842GlySer: 4.842 ± 0.129
5.502GlyThr: 5.502 ± 0.205
6.195GlyVal: 6.195 ± 0.067
1.648GlyTrp: 1.648 ± 0.038
2.273GlyTyr: 2.273 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.156HisAla: 2.156 ± 0.038
0.217HisCys: 0.217 ± 0.01
1.169HisAsp: 1.169 ± 0.033
0.885HisGlu: 0.885 ± 0.023
0.763HisPhe: 0.763 ± 0.022
1.878HisGly: 1.878 ± 0.037
0.569HisHis: 0.569 ± 0.023
0.905HisIle: 0.905 ± 0.022
0.446HisLys: 0.446 ± 0.015
2.219HisLeu: 2.219 ± 0.048
0.44HisMet: 0.44 ± 0.016
0.456HisAsn: 0.456 ± 0.018
1.353HisPro: 1.353 ± 0.038
0.657HisGln: 0.657 ± 0.018
1.45HisArg: 1.45 ± 0.031
0.882HisSer: 0.882 ± 0.024
0.784HisThr: 0.784 ± 0.022
1.39HisVal: 1.39 ± 0.028
0.354HisTrp: 0.354 ± 0.014
0.637HisTyr: 0.637 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.223IleAla: 6.223 ± 0.064
0.422IleCys: 0.422 ± 0.018
3.255IleAsp: 3.255 ± 0.051
2.337IleGlu: 2.337 ± 0.039
1.42IlePhe: 1.42 ± 0.034
4.571IleGly: 4.571 ± 0.055
0.895IleHis: 0.895 ± 0.025
2.178IleIle: 2.178 ± 0.04
1.283IleLys: 1.283 ± 0.027
4.04IleLeu: 4.04 ± 0.053
0.88IleMet: 0.88 ± 0.026
1.608IleAsn: 1.608 ± 0.043
2.166IlePro: 2.166 ± 0.04
1.47IleGln: 1.47 ± 0.029
3.079IleArg: 3.079 ± 0.043
2.826IleSer: 2.826 ± 0.067
2.886IleThr: 2.886 ± 0.084
3.186IleVal: 3.186 ± 0.045
0.544IleTrp: 0.544 ± 0.019
0.99IleTyr: 0.99 ± 0.024
0.0IleXaa: 0.0 ± 0.0
Lys
3.967LysAla: 3.967 ± 0.054
0.128LysCys: 0.128 ± 0.01
1.499LysAsp: 1.499 ± 0.032
1.433LysGlu: 1.433 ± 0.034
0.799LysPhe: 0.799 ± 0.025
2.685LysGly: 2.685 ± 0.046
0.473LysHis: 0.473 ± 0.015
1.327LysIle: 1.327 ± 0.032
1.0LysLys: 1.0 ± 0.027
3.212LysLeu: 3.212 ± 0.053
0.645LysMet: 0.645 ± 0.023
0.763LysAsn: 0.763 ± 0.026
2.054LysPro: 2.054 ± 0.04
1.084LysGln: 1.084 ± 0.024
1.948LysArg: 1.948 ± 0.04
1.437LysSer: 1.437 ± 0.032
1.522LysThr: 1.522 ± 0.035
2.339LysVal: 2.339 ± 0.043
0.317LysTrp: 0.317 ± 0.014
0.544LysTyr: 0.544 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
15.365LeuAla: 15.365 ± 0.136
0.902LeuCys: 0.902 ± 0.025
6.402LeuAsp: 6.402 ± 0.067
5.38LeuGlu: 5.38 ± 0.077
3.724LeuPhe: 3.724 ± 0.058
8.75LeuGly: 8.75 ± 0.074
2.035LeuHis: 2.035 ± 0.038
4.444LeuIle: 4.444 ± 0.053
3.06LeuLys: 3.06 ± 0.042
11.691LeuLeu: 11.691 ± 0.154
2.178LeuMet: 2.178 ± 0.039
2.857LeuAsn: 2.857 ± 0.044
6.536LeuPro: 6.536 ± 0.084
2.762LeuGln: 2.762 ± 0.046
7.771LeuArg: 7.771 ± 0.091
7.215LeuSer: 7.215 ± 0.098
6.781LeuThr: 6.781 ± 0.151
7.787LeuVal: 7.787 ± 0.069
1.261LeuTrp: 1.261 ± 0.03
2.255LeuTyr: 2.255 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.374MetAla: 3.374 ± 0.058
0.114MetCys: 0.114 ± 0.008
1.22MetAsp: 1.22 ± 0.028
1.2MetGlu: 1.2 ± 0.03
0.601MetPhe: 0.601 ± 0.021
1.859MetGly: 1.859 ± 0.035
0.358MetHis: 0.358 ± 0.015
1.026MetIle: 1.026 ± 0.025
0.766MetLys: 0.766 ± 0.023
2.559MetLeu: 2.559 ± 0.047
0.604MetMet: 0.604 ± 0.02
0.615MetAsn: 0.615 ± 0.021
1.451MetPro: 1.451 ± 0.028
0.816MetGln: 0.816 ± 0.021
1.65MetArg: 1.65 ± 0.032
1.396MetSer: 1.396 ± 0.03
1.399MetThr: 1.399 ± 0.026
1.801MetVal: 1.801 ± 0.036
0.175MetTrp: 0.175 ± 0.008
0.26MetTyr: 0.26 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
3.246AsnAla: 3.246 ± 0.063
0.225AsnCys: 0.225 ± 0.012
1.51AsnAsp: 1.51 ± 0.043
1.101AsnGlu: 1.101 ± 0.028
0.944AsnPhe: 0.944 ± 0.03
2.723AsnGly: 2.723 ± 0.077
0.53AsnHis: 0.53 ± 0.018
1.44AsnIle: 1.44 ± 0.04
0.707AsnLys: 0.707 ± 0.023
2.998AsnLeu: 2.998 ± 0.056
0.573AsnMet: 0.573 ± 0.019
0.872AsnAsn: 0.872 ± 0.034
1.904AsnPro: 1.904 ± 0.037
0.926AsnGln: 0.926 ± 0.024
1.893AsnArg: 1.893 ± 0.036
1.451AsnSer: 1.451 ± 0.049
1.269AsnThr: 1.269 ± 0.049
1.947AsnVal: 1.947 ± 0.05
0.403AsnTrp: 0.403 ± 0.016
0.782AsnTyr: 0.782 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
7.996ProAla: 7.996 ± 0.107
0.353ProCys: 0.353 ± 0.014
4.001ProAsp: 4.001 ± 0.058
2.943ProGlu: 2.943 ± 0.051
1.949ProPhe: 1.949 ± 0.038
5.197ProGly: 5.197 ± 0.073
1.062ProHis: 1.062 ± 0.03
1.964ProIle: 1.964 ± 0.036
1.373ProLys: 1.373 ± 0.034
5.683ProLeu: 5.683 ± 0.075
1.166ProMet: 1.166 ± 0.029
1.251ProAsn: 1.251 ± 0.03
3.55ProPro: 3.55 ± 0.072
1.774ProGln: 1.774 ± 0.035
2.841ProArg: 2.841 ± 0.052
2.604ProSer: 2.604 ± 0.046
2.774ProThr: 2.774 ± 0.048
4.722ProVal: 4.722 ± 0.061
0.816ProTrp: 0.816 ± 0.023
1.179ProTyr: 1.179 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
5.135GlnAla: 5.135 ± 0.067
0.191GlnCys: 0.191 ± 0.011
1.622GlnAsp: 1.622 ± 0.033
1.684GlnGlu: 1.684 ± 0.038
1.062GlnPhe: 1.062 ± 0.028
3.265GlnGly: 3.265 ± 0.046
0.702GlnHis: 0.702 ± 0.02
1.699GlnIle: 1.699 ± 0.039
0.99GlnLys: 0.99 ± 0.024
3.207GlnLeu: 3.207 ± 0.062
0.904GlnMet: 0.904 ± 0.024
0.913GlnAsn: 0.913 ± 0.026
2.273GlnPro: 2.273 ± 0.042
1.681GlnGln: 1.681 ± 0.035
2.726GlnArg: 2.726 ± 0.049
1.777GlnSer: 1.777 ± 0.036
1.948GlnThr: 1.948 ± 0.038
2.897GlnVal: 2.897 ± 0.046
0.46GlnTrp: 0.46 ± 0.02
0.665GlnTyr: 0.665 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
7.791ArgAla: 7.791 ± 0.092
0.472ArgCys: 0.472 ± 0.017
4.103ArgAsp: 4.103 ± 0.065
3.514ArgGlu: 3.514 ± 0.058
2.884ArgPhe: 2.884 ± 0.049
4.744ArgGly: 4.744 ± 0.053
1.881ArgHis: 1.881 ± 0.039
3.535ArgIle: 3.535 ± 0.056
1.717ArgLys: 1.717 ± 0.034
9.16ArgLeu: 9.16 ± 0.11
1.657ArgMet: 1.657 ± 0.036
1.856ArgAsn: 1.856 ± 0.039
3.901ArgPro: 3.901 ± 0.067
3.173ArgGln: 3.173 ± 0.053
5.908ArgArg: 5.908 ± 0.085
3.162ArgSer: 3.162 ± 0.043
3.377ArgThr: 3.377 ± 0.046
4.735ArgVal: 4.735 ± 0.054
1.134ArgTrp: 1.134 ± 0.028
1.77ArgTyr: 1.77 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
6.08SerAla: 6.08 ± 0.09
0.362SerCys: 0.362 ± 0.014
2.831SerAsp: 2.831 ± 0.043
1.943SerGlu: 1.943 ± 0.035
2.228SerPhe: 2.228 ± 0.039
5.438SerGly: 5.438 ± 0.172
1.056SerHis: 1.056 ± 0.026
2.69SerIle: 2.69 ± 0.059
1.289SerLys: 1.289 ± 0.033
5.947SerLeu: 5.947 ± 0.084
1.114SerMet: 1.114 ± 0.028
1.539SerAsn: 1.539 ± 0.058
2.895SerPro: 2.895 ± 0.048
1.749SerGln: 1.749 ± 0.039
3.46SerArg: 3.46 ± 0.056
2.731SerSer: 2.731 ± 0.088
2.811SerThr: 2.811 ± 0.076
3.78SerVal: 3.78 ± 0.072
0.734SerTrp: 0.734 ± 0.024
1.357SerTyr: 1.357 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
7.053ThrAla: 7.053 ± 0.107
0.365ThrCys: 0.365 ± 0.016
3.111ThrAsp: 3.111 ± 0.054
2.43ThrGlu: 2.43 ± 0.044
1.616ThrPhe: 1.616 ± 0.035
5.914ThrGly: 5.914 ± 0.158
0.922ThrHis: 0.922 ± 0.022
2.867ThrIle: 2.867 ± 0.093
1.365ThrLys: 1.365 ± 0.032
6.801ThrLeu: 6.801 ± 0.19
1.101ThrMet: 1.101 ± 0.028
1.431ThrAsn: 1.431 ± 0.052
3.423ThrPro: 3.423 ± 0.051
1.719ThrGln: 1.719 ± 0.048
3.211ThrArg: 3.211 ± 0.048
2.804ThrSer: 2.804 ± 0.087
2.774ThrThr: 2.774 ± 0.124
4.416ThrVal: 4.416 ± 0.107
0.59ThrTrp: 0.59 ± 0.018
1.342ThrTyr: 1.342 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
9.282ValAla: 9.282 ± 0.087
0.521ValCys: 0.521 ± 0.017
4.093ValAsp: 4.093 ± 0.058
4.485ValGlu: 4.485 ± 0.059
1.986ValPhe: 1.986 ± 0.039
5.584ValGly: 5.584 ± 0.069
1.144ValHis: 1.144 ± 0.026
3.085ValIle: 3.085 ± 0.046
2.225ValLys: 2.225 ± 0.045
7.371ValLeu: 7.371 ± 0.077
1.749ValMet: 1.749 ± 0.036
2.126ValAsn: 2.126 ± 0.051
4.088ValPro: 4.088 ± 0.059
2.39ValGln: 2.39 ± 0.036
4.829ValArg: 4.829 ± 0.061
4.057ValSer: 4.057 ± 0.08
5.106ValThr: 5.106 ± 0.106
5.129ValVal: 5.129 ± 0.065
0.885ValTrp: 0.885 ± 0.024
1.261ValTyr: 1.261 ± 0.026
0.0ValXaa: 0.0 ± 0.0
Trp
1.351TrpAla: 1.351 ± 0.03
0.137TrpCys: 0.137 ± 0.009
0.692TrpAsp: 0.692 ± 0.021
0.561TrpGlu: 0.561 ± 0.017
0.516TrpPhe: 0.516 ± 0.02
0.965TrpGly: 0.965 ± 0.023
0.344TrpHis: 0.344 ± 0.013
0.561TrpIle: 0.561 ± 0.018
0.436TrpLys: 0.436 ± 0.016
1.798TrpLeu: 1.798 ± 0.04
0.337TrpMet: 0.337 ± 0.017
0.429TrpAsn: 0.429 ± 0.016
0.696TrpPro: 0.696 ± 0.022
0.781TrpGln: 0.781 ± 0.025
1.301TrpArg: 1.301 ± 0.032
0.742TrpSer: 0.742 ± 0.021
0.766TrpThr: 0.766 ± 0.023
0.874TrpVal: 0.874 ± 0.022
0.234TrpTrp: 0.234 ± 0.01
0.304TrpTyr: 0.304 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.501TyrAla: 2.501 ± 0.037
0.209TyrCys: 0.209 ± 0.012
1.454TyrAsp: 1.454 ± 0.032
1.112TyrGlu: 1.112 ± 0.028
0.886TyrPhe: 0.886 ± 0.024
2.169TyrGly: 2.169 ± 0.039
0.491TyrHis: 0.491 ± 0.019
0.907TyrIle: 0.907 ± 0.023
0.653TyrLys: 0.653 ± 0.022
2.318TyrLeu: 2.318 ± 0.037
0.405TyrMet: 0.405 ± 0.014
0.664TyrAsn: 0.664 ± 0.024
1.054TyrPro: 1.054 ± 0.025
0.887TyrGln: 0.887 ± 0.024
1.915TyrArg: 1.915 ± 0.043
1.183TyrSer: 1.183 ± 0.028
1.119TyrThr: 1.119 ± 0.035
1.435TyrVal: 1.435 ± 0.03
0.343TyrTrp: 0.343 ± 0.015
0.619TyrTyr: 0.619 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5137 proteins (1820581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski