Amino acid dipepetide frequency for Legionella tucsonensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.113AlaAla: 6.113 ± 0.101
1.049AlaCys: 1.049 ± 0.035
3.195AlaAsp: 3.195 ± 0.061
4.115AlaGlu: 4.115 ± 0.074
3.122AlaPhe: 3.122 ± 0.068
4.752AlaGly: 4.752 ± 0.086
1.824AlaHis: 1.824 ± 0.048
5.818AlaIle: 5.818 ± 0.085
4.808AlaLys: 4.808 ± 0.071
9.0AlaLeu: 9.0 ± 0.121
1.966AlaMet: 1.966 ± 0.05
3.311AlaAsn: 3.311 ± 0.069
2.479AlaPro: 2.479 ± 0.05
3.53AlaGln: 3.53 ± 0.067
3.075AlaArg: 3.075 ± 0.062
4.535AlaSer: 4.535 ± 0.07
3.718AlaThr: 3.718 ± 0.063
4.744AlaVal: 4.744 ± 0.077
0.741AlaTrp: 0.741 ± 0.03
2.594AlaTyr: 2.594 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.812CysAla: 0.812 ± 0.029
0.212CysCys: 0.212 ± 0.015
0.567CysAsp: 0.567 ± 0.027
0.594CysGlu: 0.594 ± 0.026
0.699CysPhe: 0.699 ± 0.029
0.856CysGly: 0.856 ± 0.031
0.322CysHis: 0.322 ± 0.019
0.884CysIle: 0.884 ± 0.03
0.616CysLys: 0.616 ± 0.028
1.274CysLeu: 1.274 ± 0.036
0.279CysMet: 0.279 ± 0.016
0.499CysAsn: 0.499 ± 0.026
0.475CysPro: 0.475 ± 0.022
0.518CysGln: 0.518 ± 0.022
0.492CysArg: 0.492 ± 0.024
0.812CysSer: 0.812 ± 0.028
0.611CysThr: 0.611 ± 0.025
0.693CysVal: 0.693 ± 0.027
0.154CysTrp: 0.154 ± 0.011
0.495CysTyr: 0.495 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
3.439AspAla: 3.439 ± 0.062
0.575AspCys: 0.575 ± 0.025
2.203AspAsp: 2.203 ± 0.054
3.461AspGlu: 3.461 ± 0.071
2.526AspPhe: 2.526 ± 0.058
2.505AspGly: 2.505 ± 0.056
0.988AspHis: 0.988 ± 0.029
3.411AspIle: 3.411 ± 0.068
3.345AspLys: 3.345 ± 0.063
5.251AspLeu: 5.251 ± 0.082
1.022AspMet: 1.022 ± 0.034
2.177AspAsn: 2.177 ± 0.039
2.005AspPro: 2.005 ± 0.053
1.55AspGln: 1.55 ± 0.04
1.822AspArg: 1.822 ± 0.043
2.838AspSer: 2.838 ± 0.061
2.303AspThr: 2.303 ± 0.052
3.021AspVal: 3.021 ± 0.058
0.661AspTrp: 0.661 ± 0.025
2.013AspTyr: 2.013 ± 0.047
0.001AspXaa: 0.001 ± 0.001
Glu
4.196GluAla: 4.196 ± 0.067
0.587GluCys: 0.587 ± 0.024
2.589GluAsp: 2.589 ± 0.057
4.453GluGlu: 4.453 ± 0.096
2.546GluPhe: 2.546 ± 0.057
2.997GluGly: 2.997 ± 0.056
1.788GluHis: 1.788 ± 0.052
4.609GluIle: 4.609 ± 0.076
4.905GluLys: 4.905 ± 0.084
6.87GluLeu: 6.87 ± 0.111
1.533GluMet: 1.533 ± 0.044
3.082GluAsn: 3.082 ± 0.059
1.778GluPro: 1.778 ± 0.043
3.583GluGln: 3.583 ± 0.077
2.935GluArg: 2.935 ± 0.072
3.385GluSer: 3.385 ± 0.057
2.969GluThr: 2.969 ± 0.061
3.501GluVal: 3.501 ± 0.063
0.62GluTrp: 0.62 ± 0.025
1.973GluTyr: 1.973 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.527PheAla: 3.527 ± 0.063
0.662PheCys: 0.662 ± 0.027
2.446PheAsp: 2.446 ± 0.058
2.213PheGlu: 2.213 ± 0.051
2.677PhePhe: 2.677 ± 0.065
2.605PheGly: 2.605 ± 0.059
1.076PheHis: 1.076 ± 0.037
3.843PheIle: 3.843 ± 0.072
2.617PheLys: 2.617 ± 0.063
4.904PheLeu: 4.904 ± 0.084
1.012PheMet: 1.012 ± 0.035
2.442PheAsn: 2.442 ± 0.055
1.795PhePro: 1.795 ± 0.046
1.548PheGln: 1.548 ± 0.038
1.528PheArg: 1.528 ± 0.041
3.66PheSer: 3.66 ± 0.072
2.558PheThr: 2.558 ± 0.053
2.526PheVal: 2.526 ± 0.055
0.576PheTrp: 0.576 ± 0.025
1.807PheTyr: 1.807 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
4.301GlyAla: 4.301 ± 0.091
0.791GlyCys: 0.791 ± 0.033
2.64GlyAsp: 2.64 ± 0.056
3.153GlyGlu: 3.153 ± 0.065
3.115GlyPhe: 3.115 ± 0.075
3.993GlyGly: 3.993 ± 0.088
1.472GlyHis: 1.472 ± 0.044
5.039GlyIle: 5.039 ± 0.077
3.819GlyLys: 3.819 ± 0.06
6.391GlyLeu: 6.391 ± 0.086
1.7GlyMet: 1.7 ± 0.046
2.402GlyAsn: 2.402 ± 0.052
1.653GlyPro: 1.653 ± 0.043
2.199GlyGln: 2.199 ± 0.057
2.45GlyArg: 2.45 ± 0.058
3.68GlySer: 3.68 ± 0.077
3.033GlyThr: 3.033 ± 0.059
4.213GlyVal: 4.213 ± 0.078
0.837GlyTrp: 0.837 ± 0.032
2.49GlyTyr: 2.49 ± 0.059
0.001GlyXaa: 0.001 ± 0.001
His
1.917HisAla: 1.917 ± 0.05
0.385HisCys: 0.385 ± 0.021
1.169HisAsp: 1.169 ± 0.035
1.564HisGlu: 1.564 ± 0.045
1.351HisPhe: 1.351 ± 0.036
1.625HisGly: 1.625 ± 0.047
0.925HisHis: 0.925 ± 0.035
1.691HisIle: 1.691 ± 0.041
1.182HisLys: 1.182 ± 0.034
2.866HisLeu: 2.866 ± 0.056
0.535HisMet: 0.535 ± 0.023
1.062HisAsn: 1.062 ± 0.032
1.308HisPro: 1.308 ± 0.04
1.368HisGln: 1.368 ± 0.039
1.075HisArg: 1.075 ± 0.038
1.626HisSer: 1.626 ± 0.041
1.195HisThr: 1.195 ± 0.037
1.408HisVal: 1.408 ± 0.042
0.39HisTrp: 0.39 ± 0.019
1.142HisTyr: 1.142 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.351IleAla: 6.351 ± 0.079
0.902IleCys: 0.902 ± 0.029
4.24IleAsp: 4.24 ± 0.064
4.871IleGlu: 4.871 ± 0.077
2.924IlePhe: 2.924 ± 0.076
4.641IleGly: 4.641 ± 0.092
1.895IleHis: 1.895 ± 0.043
5.662IleIle: 5.662 ± 0.094
5.089IleLys: 5.089 ± 0.087
7.391IleLeu: 7.391 ± 0.094
1.57IleMet: 1.57 ± 0.038
4.239IleAsn: 4.239 ± 0.073
3.523IlePro: 3.523 ± 0.068
2.979IleGln: 2.979 ± 0.06
3.008IleArg: 3.008 ± 0.05
5.209IleSer: 5.209 ± 0.074
4.26IleThr: 4.26 ± 0.071
4.084IleVal: 4.084 ± 0.071
0.662IleTrp: 0.662 ± 0.027
2.37IleTyr: 2.37 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
4.533LysAla: 4.533 ± 0.076
0.414LysCys: 0.414 ± 0.025
2.992LysAsp: 2.992 ± 0.076
5.011LysGlu: 5.011 ± 0.101
1.887LysPhe: 1.887 ± 0.05
3.226LysGly: 3.226 ± 0.063
1.521LysHis: 1.521 ± 0.043
5.152LysIle: 5.152 ± 0.076
5.64LysLys: 5.64 ± 0.096
6.257LysLeu: 6.257 ± 0.092
1.605LysMet: 1.605 ± 0.039
4.017LysAsn: 4.017 ± 0.079
2.786LysPro: 2.786 ± 0.056
3.322LysGln: 3.322 ± 0.066
2.839LysArg: 2.839 ± 0.052
3.895LysSer: 3.895 ± 0.074
3.662LysThr: 3.662 ± 0.06
3.384LysVal: 3.384 ± 0.061
0.602LysTrp: 0.602 ± 0.026
1.856LysTyr: 1.856 ± 0.052
0.0LysXaa: 0.0 ± 0.0
Leu
8.762LeuAla: 8.762 ± 0.118
1.365LeuCys: 1.365 ± 0.035
5.133LeuAsp: 5.133 ± 0.081
6.047LeuGlu: 6.047 ± 0.099
5.412LeuPhe: 5.412 ± 0.085
6.552LeuGly: 6.552 ± 0.108
2.692LeuHis: 2.692 ± 0.064
8.583LeuIle: 8.583 ± 0.119
6.943LeuLys: 6.943 ± 0.091
11.852LeuLeu: 11.852 ± 0.163
2.646LeuMet: 2.646 ± 0.053
5.596LeuAsn: 5.596 ± 0.084
4.893LeuPro: 4.893 ± 0.072
4.468LeuGln: 4.468 ± 0.078
4.401LeuArg: 4.401 ± 0.076
8.218LeuSer: 8.218 ± 0.109
6.045LeuThr: 6.045 ± 0.072
5.984LeuVal: 5.984 ± 0.085
1.034LeuTrp: 1.034 ± 0.036
3.242LeuTyr: 3.242 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
1.941MetAla: 1.941 ± 0.047
0.185MetCys: 0.185 ± 0.014
1.166MetAsp: 1.166 ± 0.036
1.328MetGlu: 1.328 ± 0.036
0.8MetPhe: 0.8 ± 0.031
1.492MetGly: 1.492 ± 0.04
0.659MetHis: 0.659 ± 0.026
1.679MetIle: 1.679 ± 0.039
1.602MetLys: 1.602 ± 0.047
2.539MetLeu: 2.539 ± 0.055
0.706MetMet: 0.706 ± 0.028
1.283MetAsn: 1.283 ± 0.04
1.122MetPro: 1.122 ± 0.038
1.158MetGln: 1.158 ± 0.034
1.176MetArg: 1.176 ± 0.035
1.693MetSer: 1.693 ± 0.043
1.355MetThr: 1.355 ± 0.041
1.404MetVal: 1.404 ± 0.04
0.161MetTrp: 0.161 ± 0.014
0.57MetTyr: 0.57 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.351AsnAla: 3.351 ± 0.063
0.55AsnCys: 0.55 ± 0.027
2.224AsnAsp: 2.224 ± 0.046
3.08AsnGlu: 3.08 ± 0.052
2.031AsnPhe: 2.031 ± 0.044
2.511AsnGly: 2.511 ± 0.067
1.386AsnHis: 1.386 ± 0.043
3.393AsnIle: 3.393 ± 0.055
3.355AsnLys: 3.355 ± 0.06
4.963AsnLeu: 4.963 ± 0.074
1.041AsnMet: 1.041 ± 0.034
2.646AsnAsn: 2.646 ± 0.066
2.687AsnPro: 2.687 ± 0.053
2.715AsnGln: 2.715 ± 0.056
1.993AsnArg: 1.993 ± 0.046
3.155AsnSer: 3.155 ± 0.064
2.694AsnThr: 2.694 ± 0.058
2.425AsnVal: 2.425 ± 0.056
0.636AsnTrp: 0.636 ± 0.027
2.005AsnTyr: 2.005 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.716ProAla: 2.716 ± 0.055
0.417ProCys: 0.417 ± 0.022
2.201ProAsp: 2.201 ± 0.049
3.191ProGlu: 3.191 ± 0.064
1.99ProPhe: 1.99 ± 0.05
2.562ProGly: 2.562 ± 0.056
0.996ProHis: 0.996 ± 0.034
2.904ProIle: 2.904 ± 0.062
2.434ProLys: 2.434 ± 0.046
4.338ProLeu: 4.338 ± 0.067
0.933ProMet: 0.933 ± 0.035
1.917ProAsn: 1.917 ± 0.045
1.526ProPro: 1.526 ± 0.045
1.858ProGln: 1.858 ± 0.04
1.36ProArg: 1.36 ± 0.034
2.577ProSer: 2.577 ± 0.05
2.119ProThr: 2.119 ± 0.049
2.954ProVal: 2.954 ± 0.066
0.45ProTrp: 0.45 ± 0.022
1.454ProTyr: 1.454 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
3.525GlnAla: 3.525 ± 0.077
0.513GlnCys: 0.513 ± 0.026
1.813GlnAsp: 1.813 ± 0.036
2.832GlnGlu: 2.832 ± 0.058
2.118GlnPhe: 2.118 ± 0.05
2.539GlnGly: 2.539 ± 0.056
1.238GlnHis: 1.238 ± 0.04
3.45GlnIle: 3.45 ± 0.062
3.135GlnLys: 3.135 ± 0.069
5.318GlnLeu: 5.318 ± 0.085
1.056GlnMet: 1.056 ± 0.035
2.186GlnAsn: 2.186 ± 0.053
1.51GlnPro: 1.51 ± 0.038
2.672GlnGln: 2.672 ± 0.071
1.959GlnArg: 1.959 ± 0.049
2.754GlnSer: 2.754 ± 0.062
2.242GlnThr: 2.242 ± 0.051
2.544GlnVal: 2.544 ± 0.053
0.587GlnTrp: 0.587 ± 0.027
1.472GlnTyr: 1.472 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
2.974ArgAla: 2.974 ± 0.054
0.465ArgCys: 0.465 ± 0.024
2.039ArgAsp: 2.039 ± 0.043
2.666ArgGlu: 2.666 ± 0.065
2.137ArgPhe: 2.137 ± 0.056
2.246ArgGly: 2.246 ± 0.051
1.098ArgHis: 1.098 ± 0.034
3.172ArgIle: 3.172 ± 0.06
2.626ArgLys: 2.626 ± 0.055
4.687ArgLeu: 4.687 ± 0.076
1.008ArgMet: 1.008 ± 0.032
1.881ArgAsn: 1.881 ± 0.045
1.397ArgPro: 1.397 ± 0.035
1.776ArgGln: 1.776 ± 0.044
1.873ArgArg: 1.873 ± 0.05
2.366ArgSer: 2.366 ± 0.055
1.955ArgThr: 1.955 ± 0.045
2.614ArgVal: 2.614 ± 0.055
0.546ArgTrp: 0.546 ± 0.024
1.668ArgTyr: 1.668 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
4.429SerAla: 4.429 ± 0.073
0.797SerCys: 0.797 ± 0.035
2.895SerAsp: 2.895 ± 0.061
3.589SerGlu: 3.589 ± 0.067
3.211SerPhe: 3.211 ± 0.065
4.351SerGly: 4.351 ± 0.074
1.659SerHis: 1.659 ± 0.037
4.942SerIle: 4.942 ± 0.084
3.824SerLys: 3.824 ± 0.067
7.733SerLeu: 7.733 ± 0.122
1.754SerMet: 1.754 ± 0.042
2.945SerAsn: 2.945 ± 0.06
2.767SerPro: 2.767 ± 0.064
2.897SerGln: 2.897 ± 0.063
2.62SerArg: 2.62 ± 0.053
4.731SerSer: 4.731 ± 0.099
3.498SerThr: 3.498 ± 0.064
3.711SerVal: 3.711 ± 0.056
0.833SerTrp: 0.833 ± 0.032
2.329SerTyr: 2.329 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
3.975ThrAla: 3.975 ± 0.077
0.587ThrCys: 0.587 ± 0.026
2.347ThrAsp: 2.347 ± 0.046
2.901ThrGlu: 2.901 ± 0.058
2.115ThrPhe: 2.115 ± 0.047
3.53ThrGly: 3.53 ± 0.066
1.524ThrHis: 1.524 ± 0.041
3.87ThrIle: 3.87 ± 0.073
2.758ThrLys: 2.758 ± 0.051
6.331ThrLeu: 6.331 ± 0.097
1.129ThrMet: 1.129 ± 0.036
2.248ThrAsn: 2.248 ± 0.047
2.779ThrPro: 2.779 ± 0.054
2.559ThrGln: 2.559 ± 0.053
2.145ThrArg: 2.145 ± 0.048
3.287ThrSer: 3.287 ± 0.062
2.941ThrThr: 2.941 ± 0.069
3.381ThrVal: 3.381 ± 0.058
0.536ThrTrp: 0.536 ± 0.022
1.789ThrTyr: 1.789 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
4.53ValAla: 4.53 ± 0.077
0.752ValCys: 0.752 ± 0.03
3.125ValAsp: 3.125 ± 0.062
3.314ValGlu: 3.314 ± 0.063
2.839ValPhe: 2.839 ± 0.057
3.601ValGly: 3.601 ± 0.076
1.428ValHis: 1.428 ± 0.046
4.766ValIle: 4.766 ± 0.075
3.392ValLys: 3.392 ± 0.061
6.333ValLeu: 6.333 ± 0.081
1.568ValMet: 1.568 ± 0.042
2.942ValAsn: 2.942 ± 0.059
2.341ValPro: 2.341 ± 0.045
2.063ValGln: 2.063 ± 0.05
2.32ValArg: 2.32 ± 0.054
4.06ValSer: 4.06 ± 0.076
3.3ValThr: 3.3 ± 0.053
4.065ValVal: 4.065 ± 0.092
0.561ValTrp: 0.561 ± 0.028
1.864ValTyr: 1.864 ± 0.043
0.001ValXaa: 0.001 ± 0.001
Trp
0.642TrpAla: 0.642 ± 0.024
0.13TrpCys: 0.13 ± 0.012
0.493TrpAsp: 0.493 ± 0.022
0.574TrpGlu: 0.574 ± 0.024
0.614TrpPhe: 0.614 ± 0.028
0.727TrpGly: 0.727 ± 0.028
0.346TrpHis: 0.346 ± 0.02
0.83TrpIle: 0.83 ± 0.027
0.575TrpLys: 0.575 ± 0.027
1.421TrpLeu: 1.421 ± 0.044
0.305TrpMet: 0.305 ± 0.02
0.515TrpAsn: 0.515 ± 0.024
0.435TrpPro: 0.435 ± 0.021
0.665TrpGln: 0.665 ± 0.027
0.538TrpArg: 0.538 ± 0.023
0.73TrpSer: 0.73 ± 0.033
0.484TrpThr: 0.484 ± 0.025
0.719TrpVal: 0.719 ± 0.026
0.177TrpTrp: 0.177 ± 0.014
0.37TrpTyr: 0.37 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.457TyrAla: 2.457 ± 0.049
0.525TyrCys: 0.525 ± 0.022
1.641TyrAsp: 1.641 ± 0.045
1.942TyrGlu: 1.942 ± 0.047
1.909TyrPhe: 1.909 ± 0.054
2.061TyrGly: 2.061 ± 0.05
0.944TyrHis: 0.944 ± 0.033
2.172TyrIle: 2.172 ± 0.051
1.934TyrLys: 1.934 ± 0.051
4.149TyrLeu: 4.149 ± 0.075
0.651TyrMet: 0.651 ± 0.025
1.451TyrAsn: 1.451 ± 0.043
1.639TyrPro: 1.639 ± 0.046
2.036TyrGln: 2.036 ± 0.041
1.586TyrArg: 1.586 ± 0.042
2.304TyrSer: 2.304 ± 0.054
1.788TyrThr: 1.788 ± 0.044
1.791TyrVal: 1.791 ± 0.045
0.523TyrTrp: 0.523 ± 0.026
1.473TyrTyr: 1.473 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.001
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.009XaaXaa: 0.009 ± 0.007
Statistics based on 2936 proteins (972462 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski