Amino acid dipepetide frequency for bacterium D16-59

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.075AlaAla: 7.075 ± 0.089
1.032AlaCys: 1.032 ± 0.024
4.254AlaAsp: 4.254 ± 0.055
5.69AlaGlu: 5.69 ± 0.074
3.059AlaPhe: 3.059 ± 0.046
6.166AlaGly: 6.166 ± 0.082
0.959AlaHis: 0.959 ± 0.026
4.36AlaIle: 4.36 ± 0.065
5.295AlaLys: 5.295 ± 0.058
6.224AlaLeu: 6.224 ± 0.066
2.047AlaMet: 2.047 ± 0.042
2.451AlaAsn: 2.451 ± 0.042
1.7AlaPro: 1.7 ± 0.038
2.131AlaGln: 2.131 ± 0.039
2.645AlaArg: 2.645 ± 0.04
3.931AlaSer: 3.931 ± 0.061
2.552AlaThr: 2.552 ± 0.048
6.389AlaVal: 6.389 ± 0.076
0.609AlaTrp: 0.609 ± 0.023
2.764AlaTyr: 2.764 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.949CysAla: 0.949 ± 0.024
0.333CysCys: 0.333 ± 0.015
0.831CysAsp: 0.831 ± 0.022
0.865CysGlu: 0.865 ± 0.025
0.759CysPhe: 0.759 ± 0.022
1.336CysGly: 1.336 ± 0.035
0.331CysHis: 0.331 ± 0.014
1.267CysIle: 1.267 ± 0.031
1.01CysLys: 1.01 ± 0.029
1.329CysLeu: 1.329 ± 0.032
0.537CysMet: 0.537 ± 0.02
0.69CysAsn: 0.69 ± 0.019
0.627CysPro: 0.627 ± 0.023
0.48CysGln: 0.48 ± 0.018
0.874CysArg: 0.874 ± 0.025
1.06CysSer: 1.06 ± 0.029
0.753CysThr: 0.753 ± 0.023
0.958CysVal: 0.958 ± 0.023
0.132CysTrp: 0.132 ± 0.011
0.693CysTyr: 0.693 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.937AspAla: 3.937 ± 0.055
0.882AspCys: 0.882 ± 0.025
2.702AspAsp: 2.702 ± 0.064
4.035AspGlu: 4.035 ± 0.054
2.803AspPhe: 2.803 ± 0.047
4.5AspGly: 4.5 ± 0.063
0.605AspHis: 0.605 ± 0.02
4.81AspIle: 4.81 ± 0.061
3.983AspLys: 3.983 ± 0.053
4.107AspLeu: 4.107 ± 0.052
1.841AspMet: 1.841 ± 0.038
2.462AspAsn: 2.462 ± 0.038
1.113AspPro: 1.113 ± 0.029
0.931AspGln: 0.931 ± 0.025
2.323AspArg: 2.323 ± 0.047
3.45AspSer: 3.45 ± 0.058
3.042AspThr: 3.042 ± 0.046
3.439AspVal: 3.439 ± 0.057
0.609AspTrp: 0.609 ± 0.021
2.883AspTyr: 2.883 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
5.259GluAla: 5.259 ± 0.065
0.949GluCys: 0.949 ± 0.03
4.21GluAsp: 4.21 ± 0.062
7.812GluGlu: 7.812 ± 0.113
2.698GluPhe: 2.698 ± 0.041
4.627GluGly: 4.627 ± 0.053
1.305GluHis: 1.305 ± 0.032
6.009GluIle: 6.009 ± 0.077
7.771GluLys: 7.771 ± 0.085
6.724GluLeu: 6.724 ± 0.072
2.495GluMet: 2.495 ± 0.041
4.516GluAsn: 4.516 ± 0.063
1.84GluPro: 1.84 ± 0.039
3.202GluGln: 3.202 ± 0.061
3.71GluArg: 3.71 ± 0.057
3.488GluSer: 3.488 ± 0.049
3.728GluThr: 3.728 ± 0.057
4.4GluVal: 4.4 ± 0.055
0.794GluTrp: 0.794 ± 0.026
3.683GluTyr: 3.683 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
2.895PheAla: 2.895 ± 0.049
0.982PheCys: 0.982 ± 0.027
2.488PheAsp: 2.488 ± 0.045
2.717PheGlu: 2.717 ± 0.041
2.056PhePhe: 2.056 ± 0.046
2.929PheGly: 2.929 ± 0.053
0.891PheHis: 0.891 ± 0.023
2.916PheIle: 2.916 ± 0.049
2.101PheLys: 2.101 ± 0.04
4.333PheLeu: 4.333 ± 0.065
1.158PheMet: 1.158 ± 0.032
1.552PheAsn: 1.552 ± 0.033
1.387PhePro: 1.387 ± 0.033
1.547PheGln: 1.547 ± 0.034
1.827PheArg: 1.827 ± 0.036
3.238PheSer: 3.238 ± 0.056
2.311PheThr: 2.311 ± 0.042
2.756PheVal: 2.756 ± 0.046
0.496PheTrp: 0.496 ± 0.021
1.998PheTyr: 1.998 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
4.182GlyAla: 4.182 ± 0.059
1.242GlyCys: 1.242 ± 0.028
3.328GlyAsp: 3.328 ± 0.049
4.84GlyGlu: 4.84 ± 0.063
3.022GlyPhe: 3.022 ± 0.051
4.3GlyGly: 4.3 ± 0.061
1.116GlyHis: 1.116 ± 0.028
5.997GlyIle: 5.997 ± 0.064
6.559GlyLys: 6.559 ± 0.071
5.291GlyLeu: 5.291 ± 0.064
2.454GlyMet: 2.454 ± 0.047
3.416GlyAsn: 3.416 ± 0.047
0.806GlyPro: 0.806 ± 0.026
2.312GlyGln: 2.312 ± 0.045
3.351GlyArg: 3.351 ± 0.05
4.035GlySer: 4.035 ± 0.054
3.831GlyThr: 3.831 ± 0.049
4.336GlyVal: 4.336 ± 0.063
0.747GlyTrp: 0.747 ± 0.024
3.304GlyTyr: 3.304 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
0.99HisAla: 0.99 ± 0.026
0.319HisCys: 0.319 ± 0.014
0.817HisAsp: 0.817 ± 0.022
0.989HisGlu: 0.989 ± 0.026
0.883HisPhe: 0.883 ± 0.024
1.186HisGly: 1.186 ± 0.028
0.372HisHis: 0.372 ± 0.019
1.439HisIle: 1.439 ± 0.031
1.047HisLys: 1.047 ± 0.027
1.421HisLeu: 1.421 ± 0.03
0.506HisMet: 0.506 ± 0.02
0.787HisAsn: 0.787 ± 0.024
0.76HisPro: 0.76 ± 0.023
0.525HisGln: 0.525 ± 0.018
0.757HisArg: 0.757 ± 0.025
1.0HisSer: 1.0 ± 0.024
0.865HisThr: 0.865 ± 0.019
0.913HisVal: 0.913 ± 0.025
0.179HisTrp: 0.179 ± 0.012
0.816HisTyr: 0.816 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.242IleAla: 5.242 ± 0.068
1.409IleCys: 1.409 ± 0.031
3.945IleAsp: 3.945 ± 0.054
4.967IleGlu: 4.967 ± 0.069
3.038IlePhe: 3.038 ± 0.057
4.748IleGly: 4.748 ± 0.068
1.329IleHis: 1.329 ± 0.027
5.007IleIle: 5.007 ± 0.071
4.608IleLys: 4.608 ± 0.063
6.901IleLeu: 6.901 ± 0.093
1.975IleMet: 1.975 ± 0.038
2.983IleAsn: 2.983 ± 0.054
2.897IlePro: 2.897 ± 0.048
2.522IleGln: 2.522 ± 0.045
3.544IleArg: 3.544 ± 0.054
5.24IleSer: 5.24 ± 0.06
4.125IleThr: 4.125 ± 0.057
4.566IleVal: 4.566 ± 0.063
0.701IleTrp: 0.701 ± 0.022
2.937IleTyr: 2.937 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
5.331LysAla: 5.331 ± 0.069
0.853LysCys: 0.853 ± 0.027
4.109LysAsp: 4.109 ± 0.064
8.048LysGlu: 8.048 ± 0.086
2.113LysPhe: 2.113 ± 0.04
4.666LysGly: 4.666 ± 0.054
1.1LysHis: 1.1 ± 0.027
5.58LysIle: 5.58 ± 0.06
8.02LysLys: 8.02 ± 0.112
5.937LysLeu: 5.937 ± 0.067
2.334LysMet: 2.334 ± 0.041
4.19LysAsn: 4.19 ± 0.059
2.084LysPro: 2.084 ± 0.043
2.873LysGln: 2.873 ± 0.042
3.677LysArg: 3.677 ± 0.055
3.947LysSer: 3.947 ± 0.05
4.023LysThr: 4.023 ± 0.063
4.394LysVal: 4.394 ± 0.066
0.71LysTrp: 0.71 ± 0.021
3.376LysTyr: 3.376 ± 0.05
0.0LysXaa: 0.0 ± 0.0
Leu
6.445LeuAla: 6.445 ± 0.078
1.648LeuCys: 1.648 ± 0.033
4.738LeuAsp: 4.738 ± 0.053
6.811LeuGlu: 6.811 ± 0.087
4.189LeuPhe: 4.189 ± 0.059
5.383LeuGly: 5.383 ± 0.064
1.562LeuHis: 1.562 ± 0.033
5.145LeuIle: 5.145 ± 0.073
6.44LeuLys: 6.44 ± 0.07
8.792LeuLeu: 8.792 ± 0.101
2.35LeuMet: 2.35 ± 0.046
3.73LeuAsn: 3.73 ± 0.053
3.332LeuPro: 3.332 ± 0.051
3.221LeuGln: 3.221 ± 0.053
3.593LeuArg: 3.593 ± 0.054
6.573LeuSer: 6.573 ± 0.073
4.437LeuThr: 4.437 ± 0.053
5.053LeuVal: 5.053 ± 0.058
0.814LeuTrp: 0.814 ± 0.025
3.622LeuTyr: 3.622 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
2.481MetAla: 2.481 ± 0.04
0.338MetCys: 0.338 ± 0.015
1.885MetAsp: 1.885 ± 0.035
2.918MetGlu: 2.918 ± 0.046
0.989MetPhe: 0.989 ± 0.028
1.991MetGly: 1.991 ± 0.04
0.429MetHis: 0.429 ± 0.016
1.814MetIle: 1.814 ± 0.037
2.485MetLys: 2.485 ± 0.042
2.71MetLeu: 2.71 ± 0.047
0.851MetMet: 0.851 ± 0.023
1.367MetAsn: 1.367 ± 0.034
1.105MetPro: 1.105 ± 0.028
1.269MetGln: 1.269 ± 0.03
1.25MetArg: 1.25 ± 0.028
1.519MetSer: 1.519 ± 0.034
1.453MetThr: 1.453 ± 0.034
1.914MetVal: 1.914 ± 0.039
0.203MetTrp: 0.203 ± 0.013
0.929MetTyr: 0.929 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.243AsnAla: 3.243 ± 0.048
0.714AsnCys: 0.714 ± 0.026
2.13AsnAsp: 2.13 ± 0.037
2.795AsnGlu: 2.795 ± 0.045
1.72AsnPhe: 1.72 ± 0.039
3.663AsnGly: 3.663 ± 0.058
0.816AsnHis: 0.816 ± 0.026
3.789AsnIle: 3.789 ± 0.057
2.972AsnLys: 2.972 ± 0.044
3.915AsnLeu: 3.915 ± 0.052
1.404AsnMet: 1.404 ± 0.029
2.067AsnAsn: 2.067 ± 0.042
1.883AsnPro: 1.883 ± 0.04
1.627AsnGln: 1.627 ± 0.037
2.18AsnArg: 2.18 ± 0.039
2.532AsnSer: 2.532 ± 0.045
2.42AsnThr: 2.42 ± 0.045
2.835AsnVal: 2.835 ± 0.042
0.421AsnTrp: 0.421 ± 0.019
2.001AsnTyr: 2.001 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
2.291ProAla: 2.291 ± 0.048
0.425ProCys: 0.425 ± 0.018
2.1ProAsp: 2.1 ± 0.036
2.997ProGlu: 2.997 ± 0.055
1.471ProPhe: 1.471 ± 0.03
1.918ProGly: 1.918 ± 0.038
0.46ProHis: 0.46 ± 0.018
1.765ProIle: 1.765 ± 0.038
2.049ProLys: 2.049 ± 0.04
2.407ProLeu: 2.407 ± 0.039
0.744ProMet: 0.744 ± 0.023
1.089ProAsn: 1.089 ± 0.026
0.738ProPro: 0.738 ± 0.024
0.983ProGln: 0.983 ± 0.028
0.879ProArg: 0.879 ± 0.027
1.734ProSer: 1.734 ± 0.041
1.32ProThr: 1.32 ± 0.038
2.694ProVal: 2.694 ± 0.042
0.28ProTrp: 0.28 ± 0.014
1.432ProTyr: 1.432 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
2.548GlnAla: 2.548 ± 0.047
0.402GlnCys: 0.402 ± 0.015
1.72GlnAsp: 1.72 ± 0.038
3.357GlnGlu: 3.357 ± 0.064
1.265GlnPhe: 1.265 ± 0.032
2.112GlnGly: 2.112 ± 0.039
0.461GlnHis: 0.461 ± 0.018
2.659GlnIle: 2.659 ± 0.045
3.443GlnLys: 3.443 ± 0.055
2.694GlnLeu: 2.694 ± 0.043
1.222GlnMet: 1.222 ± 0.03
1.738GlnAsn: 1.738 ± 0.035
0.933GlnPro: 0.933 ± 0.027
1.28GlnGln: 1.28 ± 0.032
1.546GlnArg: 1.546 ± 0.028
1.795GlnSer: 1.795 ± 0.035
1.676GlnThr: 1.676 ± 0.037
1.957GlnVal: 1.957 ± 0.037
0.304GlnTrp: 0.304 ± 0.014
1.58GlnTyr: 1.58 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.555ArgAla: 2.555 ± 0.044
0.627ArgCys: 0.627 ± 0.024
2.161ArgAsp: 2.161 ± 0.042
4.107ArgGlu: 4.107 ± 0.067
1.932ArgPhe: 1.932 ± 0.039
2.325ArgGly: 2.325 ± 0.045
0.826ArgHis: 0.826 ± 0.022
3.464ArgIle: 3.464 ± 0.055
4.078ArgLys: 4.078 ± 0.055
3.986ArgLeu: 3.986 ± 0.051
1.553ArgMet: 1.553 ± 0.032
2.224ArgAsn: 2.224 ± 0.039
1.171ArgPro: 1.171 ± 0.029
1.915ArgGln: 1.915 ± 0.039
2.342ArgArg: 2.342 ± 0.047
2.078ArgSer: 2.078 ± 0.037
1.99ArgThr: 1.99 ± 0.037
2.538ArgVal: 2.538 ± 0.044
0.374ArgTrp: 0.374 ± 0.018
2.086ArgTyr: 2.086 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
4.272SerAla: 4.272 ± 0.058
0.909SerCys: 0.909 ± 0.026
3.464SerAsp: 3.464 ± 0.059
4.17SerGlu: 4.17 ± 0.062
2.928SerPhe: 2.928 ± 0.047
4.996SerGly: 4.996 ± 0.06
1.04SerHis: 1.04 ± 0.026
4.392SerIle: 4.392 ± 0.054
3.886SerLys: 3.886 ± 0.055
5.392SerLeu: 5.392 ± 0.068
1.75SerMet: 1.75 ± 0.039
2.501SerAsn: 2.501 ± 0.052
1.699SerPro: 1.699 ± 0.035
1.909SerGln: 1.909 ± 0.038
2.721SerArg: 2.721 ± 0.052
3.901SerSer: 3.901 ± 0.072
2.624SerThr: 2.624 ± 0.048
4.668SerVal: 4.668 ± 0.067
0.566SerTrp: 0.566 ± 0.022
2.689SerTyr: 2.689 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
4.248ThrAla: 4.248 ± 0.065
0.609ThrCys: 0.609 ± 0.022
2.995ThrAsp: 2.995 ± 0.054
4.094ThrGlu: 4.094 ± 0.058
1.977ThrPhe: 1.977 ± 0.035
4.189ThrGly: 4.189 ± 0.057
0.722ThrHis: 0.722 ± 0.02
3.535ThrIle: 3.535 ± 0.059
3.331ThrLys: 3.331 ± 0.05
4.293ThrLeu: 4.293 ± 0.057
1.28ThrMet: 1.28 ± 0.029
1.943ThrAsn: 1.943 ± 0.044
1.862ThrPro: 1.862 ± 0.046
1.42ThrGln: 1.42 ± 0.032
1.696ThrArg: 1.696 ± 0.037
2.819ThrSer: 2.819 ± 0.052
2.428ThrThr: 2.428 ± 0.047
4.303ThrVal: 4.303 ± 0.065
0.45ThrTrp: 0.45 ± 0.016
1.979ThrTyr: 1.979 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
4.078ValAla: 4.078 ± 0.052
1.168ValCys: 1.168 ± 0.029
3.421ValAsp: 3.421 ± 0.052
4.173ValGlu: 4.173 ± 0.056
3.222ValPhe: 3.222 ± 0.058
3.587ValGly: 3.587 ± 0.059
1.059ValHis: 1.059 ± 0.025
4.889ValIle: 4.889 ± 0.062
4.739ValLys: 4.739 ± 0.062
6.411ValLeu: 6.411 ± 0.07
1.94ValMet: 1.94 ± 0.038
2.826ValAsn: 2.826 ± 0.042
2.275ValPro: 2.275 ± 0.042
2.191ValGln: 2.191 ± 0.048
2.782ValArg: 2.782 ± 0.047
4.904ValSer: 4.904 ± 0.073
3.955ValThr: 3.955 ± 0.064
4.271ValVal: 4.271 ± 0.058
0.63ValTrp: 0.63 ± 0.025
2.77ValTyr: 2.77 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.536TrpAla: 0.536 ± 0.021
0.161TrpCys: 0.161 ± 0.01
0.563TrpAsp: 0.563 ± 0.02
0.842TrpGlu: 0.842 ± 0.027
0.398TrpPhe: 0.398 ± 0.017
0.701TrpGly: 0.701 ± 0.024
0.209TrpHis: 0.209 ± 0.014
0.658TrpIle: 0.658 ± 0.023
0.866TrpLys: 0.866 ± 0.025
0.915TrpLeu: 0.915 ± 0.026
0.276TrpMet: 0.276 ± 0.014
0.56TrpAsn: 0.56 ± 0.022
0.144TrpPro: 0.144 ± 0.01
0.413TrpGln: 0.413 ± 0.017
0.41TrpArg: 0.41 ± 0.017
0.502TrpSer: 0.502 ± 0.017
0.384TrpThr: 0.384 ± 0.016
0.486TrpVal: 0.486 ± 0.02
0.13TrpTrp: 0.13 ± 0.01
0.449TrpTyr: 0.449 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.775TyrAla: 2.775 ± 0.047
0.812TyrCys: 0.812 ± 0.023
2.597TyrAsp: 2.597 ± 0.038
2.993TyrGlu: 2.993 ± 0.05
2.057TyrPhe: 2.057 ± 0.033
3.149TyrGly: 3.149 ± 0.049
0.982TyrHis: 0.982 ± 0.028
3.187TyrIle: 3.187 ± 0.051
2.682TyrLys: 2.682 ± 0.044
4.095TyrLeu: 4.095 ± 0.059
1.182TyrMet: 1.182 ± 0.03
2.033TyrAsn: 2.033 ± 0.037
1.421TyrPro: 1.421 ± 0.031
1.94TyrGln: 1.94 ± 0.036
2.227TyrArg: 2.227 ± 0.036
2.732TyrSer: 2.732 ± 0.048
2.255TyrThr: 2.255 ± 0.04
2.417TyrVal: 2.417 ± 0.042
0.456TyrTrp: 0.456 ± 0.018
2.27TyrTyr: 2.27 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5054 proteins (1488312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski