Amino acid dipepetide frequency for Halocynthiibacter arcticus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.937AlaAla: 12.937 ± 0.142
1.071AlaCys: 1.071 ± 0.038
5.999AlaAsp: 5.999 ± 0.08
6.844AlaGlu: 6.844 ± 0.087
4.23AlaPhe: 4.23 ± 0.063
8.688AlaGly: 8.688 ± 0.092
2.113AlaHis: 2.113 ± 0.046
6.499AlaIle: 6.499 ± 0.074
4.622AlaLys: 4.622 ± 0.069
12.209AlaLeu: 12.209 ± 0.122
3.376AlaMet: 3.376 ± 0.049
3.157AlaAsn: 3.157 ± 0.055
4.864AlaPro: 4.864 ± 0.08
4.421AlaGln: 4.421 ± 0.067
6.665AlaArg: 6.665 ± 0.091
5.972AlaSer: 5.972 ± 0.071
5.983AlaThr: 5.983 ± 0.076
7.444AlaVal: 7.444 ± 0.091
1.263AlaTrp: 1.263 ± 0.035
2.474AlaTyr: 2.474 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.009CysAla: 1.009 ± 0.034
0.103CysCys: 0.103 ± 0.009
0.654CysAsp: 0.654 ± 0.022
0.511CysGlu: 0.511 ± 0.021
0.391CysPhe: 0.391 ± 0.017
0.911CysGly: 0.911 ± 0.026
0.248CysHis: 0.248 ± 0.016
0.464CysIle: 0.464 ± 0.021
0.298CysLys: 0.298 ± 0.016
0.899CysLeu: 0.899 ± 0.028
0.182CysMet: 0.182 ± 0.013
0.251CysAsn: 0.251 ± 0.014
0.46CysPro: 0.46 ± 0.02
0.263CysGln: 0.263 ± 0.013
0.513CysArg: 0.513 ± 0.019
0.524CysSer: 0.524 ± 0.023
0.471CysThr: 0.471 ± 0.019
0.704CysVal: 0.704 ± 0.027
0.103CysTrp: 0.103 ± 0.009
0.219CysTyr: 0.219 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
6.424AspAla: 6.424 ± 0.075
0.524AspCys: 0.524 ± 0.021
3.064AspAsp: 3.064 ± 0.06
3.327AspGlu: 3.327 ± 0.068
2.568AspPhe: 2.568 ± 0.045
4.968AspGly: 4.968 ± 0.07
1.24AspHis: 1.24 ± 0.039
3.503AspIle: 3.503 ± 0.058
1.82AspLys: 1.82 ± 0.047
6.234AspLeu: 6.234 ± 0.065
1.597AspMet: 1.597 ± 0.034
1.458AspAsn: 1.458 ± 0.04
3.156AspPro: 3.156 ± 0.048
1.888AspGln: 1.888 ± 0.04
3.374AspArg: 3.374 ± 0.055
2.477AspSer: 2.477 ± 0.043
2.974AspThr: 2.974 ± 0.052
4.428AspVal: 4.428 ± 0.065
0.977AspTrp: 0.977 ± 0.03
1.464AspTyr: 1.464 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
7.172GluAla: 7.172 ± 0.1
0.423GluCys: 0.423 ± 0.018
3.141GluAsp: 3.141 ± 0.048
3.358GluGlu: 3.358 ± 0.062
2.183GluPhe: 2.183 ± 0.044
4.474GluGly: 4.474 ± 0.059
1.141GluHis: 1.141 ± 0.03
4.247GluIle: 4.247 ± 0.065
2.749GluLys: 2.749 ± 0.051
5.281GluLeu: 5.281 ± 0.071
1.859GluMet: 1.859 ± 0.037
2.389GluAsn: 2.389 ± 0.041
2.171GluPro: 2.171 ± 0.048
1.946GluGln: 1.946 ± 0.041
3.821GluArg: 3.821 ± 0.058
2.7GluSer: 2.7 ± 0.046
3.96GluThr: 3.96 ± 0.063
4.38GluVal: 4.38 ± 0.067
0.675GluTrp: 0.675 ± 0.024
1.191GluTyr: 1.191 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
4.447PheAla: 4.447 ± 0.063
0.445PheCys: 0.445 ± 0.02
2.948PheAsp: 2.948 ± 0.05
2.477PheGlu: 2.477 ± 0.05
1.603PhePhe: 1.603 ± 0.043
3.756PheGly: 3.756 ± 0.058
0.786PheHis: 0.786 ± 0.025
1.986PheIle: 1.986 ± 0.036
1.253PheLys: 1.253 ± 0.04
3.851PheLeu: 3.851 ± 0.065
0.995PheMet: 0.995 ± 0.03
1.292PheAsn: 1.292 ± 0.036
1.665PhePro: 1.665 ± 0.039
1.251PheGln: 1.251 ± 0.038
1.966PheArg: 1.966 ± 0.042
2.733PheSer: 2.733 ± 0.051
2.333PheThr: 2.333 ± 0.052
3.01PheVal: 3.01 ± 0.052
0.602PheTrp: 0.602 ± 0.027
1.077PheTyr: 1.077 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
8.471GlyAla: 8.471 ± 0.094
0.884GlyCys: 0.884 ± 0.029
4.196GlyAsp: 4.196 ± 0.061
4.41GlyGlu: 4.41 ± 0.06
3.718GlyPhe: 3.718 ± 0.062
6.533GlyGly: 6.533 ± 0.11
1.838GlyHis: 1.838 ± 0.042
4.775GlyIle: 4.775 ± 0.065
3.634GlyLys: 3.634 ± 0.07
8.329GlyLeu: 8.329 ± 0.092
2.395GlyMet: 2.395 ± 0.046
2.408GlyAsn: 2.408 ± 0.055
3.084GlyPro: 3.084 ± 0.046
2.837GlyGln: 2.837 ± 0.051
4.509GlyArg: 4.509 ± 0.058
4.444GlySer: 4.444 ± 0.068
4.487GlyThr: 4.487 ± 0.07
6.241GlyVal: 6.241 ± 0.071
1.22GlyTrp: 1.22 ± 0.033
2.261GlyTyr: 2.261 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.016HisAla: 2.016 ± 0.042
0.224HisCys: 0.224 ± 0.017
1.129HisAsp: 1.129 ± 0.033
1.042HisGlu: 1.042 ± 0.032
0.893HisPhe: 0.893 ± 0.026
1.683HisGly: 1.683 ± 0.042
0.565HisHis: 0.565 ± 0.024
1.119HisIle: 1.119 ± 0.04
0.665HisLys: 0.665 ± 0.025
2.066HisLeu: 2.066 ± 0.04
0.527HisMet: 0.527 ± 0.023
0.561HisAsn: 0.561 ± 0.021
1.272HisPro: 1.272 ± 0.036
0.628HisGln: 0.628 ± 0.019
1.158HisArg: 1.158 ± 0.035
1.169HisSer: 1.169 ± 0.03
0.931HisThr: 0.931 ± 0.027
1.522HisVal: 1.522 ± 0.041
0.323HisTrp: 0.323 ± 0.016
0.541HisTyr: 0.541 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.148IleAla: 7.148 ± 0.072
0.676IleCys: 0.676 ± 0.024
3.693IleAsp: 3.693 ± 0.059
3.973IleGlu: 3.973 ± 0.056
2.166IlePhe: 2.166 ± 0.044
5.087IleGly: 5.087 ± 0.074
0.964IleHis: 0.964 ± 0.03
2.873IleIle: 2.873 ± 0.052
2.001IleLys: 2.001 ± 0.041
5.766IleLeu: 5.766 ± 0.07
1.27IleMet: 1.27 ± 0.037
1.785IleAsn: 1.785 ± 0.038
2.664IlePro: 2.664 ± 0.051
1.571IleGln: 1.571 ± 0.038
3.096IleArg: 3.096 ± 0.052
4.065IleSer: 4.065 ± 0.058
3.299IleThr: 3.299 ± 0.054
4.319IleVal: 4.319 ± 0.064
0.749IleTrp: 0.749 ± 0.027
1.387IleTyr: 1.387 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
4.236LysAla: 4.236 ± 0.065
0.274LysCys: 0.274 ± 0.014
2.105LysAsp: 2.105 ± 0.043
1.993LysGlu: 1.993 ± 0.043
1.355LysPhe: 1.355 ± 0.037
2.953LysGly: 2.953 ± 0.053
0.766LysHis: 0.766 ± 0.026
2.569LysIle: 2.569 ± 0.053
1.855LysLys: 1.855 ± 0.042
3.673LysLeu: 3.673 ± 0.053
1.23LysMet: 1.23 ± 0.035
1.441LysAsn: 1.441 ± 0.035
2.038LysPro: 2.038 ± 0.046
1.209LysGln: 1.209 ± 0.035
2.62LysArg: 2.62 ± 0.05
2.734LysSer: 2.734 ± 0.052
2.793LysThr: 2.793 ± 0.052
2.609LysVal: 2.609 ± 0.05
0.518LysTrp: 0.518 ± 0.023
0.909LysTyr: 0.909 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
11.556LeuAla: 11.556 ± 0.114
0.954LeuCys: 0.954 ± 0.031
5.566LeuAsp: 5.566 ± 0.075
5.851LeuGlu: 5.851 ± 0.071
3.81LeuPhe: 3.81 ± 0.065
8.044LeuGly: 8.044 ± 0.103
1.793LeuHis: 1.793 ± 0.043
5.471LeuIle: 5.471 ± 0.07
4.138LeuLys: 4.138 ± 0.064
9.055LeuLeu: 9.055 ± 0.114
2.613LeuMet: 2.613 ± 0.045
3.216LeuAsn: 3.216 ± 0.049
5.141LeuPro: 5.141 ± 0.073
2.947LeuGln: 2.947 ± 0.053
6.254LeuArg: 6.254 ± 0.079
7.099LeuSer: 7.099 ± 0.089
6.073LeuThr: 6.073 ± 0.073
6.859LeuVal: 6.859 ± 0.098
1.217LeuTrp: 1.217 ± 0.037
2.012LeuTyr: 2.012 ± 0.042
0.0LeuXaa: 0.0 ± 0.0
Met
3.175MetAla: 3.175 ± 0.056
0.208MetCys: 0.208 ± 0.013
1.46MetAsp: 1.46 ± 0.034
1.349MetGlu: 1.349 ± 0.036
0.934MetPhe: 0.934 ± 0.028
2.211MetGly: 2.211 ± 0.044
0.507MetHis: 0.507 ± 0.02
1.753MetIle: 1.753 ± 0.041
1.28MetLys: 1.28 ± 0.036
2.45MetLeu: 2.45 ± 0.051
0.781MetMet: 0.781 ± 0.032
0.986MetAsn: 0.986 ± 0.027
1.416MetPro: 1.416 ± 0.035
0.961MetGln: 0.961 ± 0.031
1.745MetArg: 1.745 ± 0.039
1.897MetSer: 1.897 ± 0.037
1.998MetThr: 1.998 ± 0.047
1.912MetVal: 1.912 ± 0.042
0.24MetTrp: 0.24 ± 0.017
0.346MetTyr: 0.346 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.562AsnAla: 3.562 ± 0.062
0.323AsnCys: 0.323 ± 0.018
1.813AsnAsp: 1.813 ± 0.041
1.625AsnGlu: 1.625 ± 0.039
1.281AsnPhe: 1.281 ± 0.033
2.84AsnGly: 2.84 ± 0.063
0.632AsnHis: 0.632 ± 0.021
1.878AsnIle: 1.878 ± 0.042
1.061AsnLys: 1.061 ± 0.032
3.146AsnLeu: 3.146 ± 0.052
0.847AsnMet: 0.847 ± 0.028
0.937AsnAsn: 0.937 ± 0.032
2.016AsnPro: 2.016 ± 0.044
0.942AsnGln: 0.942 ± 0.029
1.825AsnArg: 1.825 ± 0.039
1.8AsnSer: 1.8 ± 0.043
1.687AsnThr: 1.687 ± 0.038
2.268AsnVal: 2.268 ± 0.043
0.54AsnTrp: 0.54 ± 0.022
0.803AsnTyr: 0.803 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
4.59ProAla: 4.59 ± 0.072
0.345ProCys: 0.345 ± 0.016
3.266ProAsp: 3.266 ± 0.056
3.776ProGlu: 3.776 ± 0.058
2.041ProPhe: 2.041 ± 0.045
2.826ProGly: 2.826 ± 0.055
1.002ProHis: 1.002 ± 0.029
2.764ProIle: 2.764 ± 0.044
2.149ProLys: 2.149 ± 0.047
4.4ProLeu: 4.4 ± 0.066
1.315ProMet: 1.315 ± 0.031
1.737ProAsn: 1.737 ± 0.038
1.851ProPro: 1.851 ± 0.038
1.571ProGln: 1.571 ± 0.039
2.33ProArg: 2.33 ± 0.042
2.952ProSer: 2.952 ± 0.052
2.678ProThr: 2.678 ± 0.048
3.565ProVal: 3.565 ± 0.052
0.59ProTrp: 0.59 ± 0.02
1.131ProTyr: 1.131 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.716GlnAla: 3.716 ± 0.066
0.234GlnCys: 0.234 ± 0.015
1.807GlnAsp: 1.807 ± 0.038
1.851GlnGlu: 1.851 ± 0.045
1.229GlnPhe: 1.229 ± 0.034
2.48GlnGly: 2.48 ± 0.04
0.602GlnHis: 0.602 ± 0.023
2.324GlnIle: 2.324 ± 0.051
1.55GlnLys: 1.55 ± 0.036
2.81GlnLeu: 2.81 ± 0.054
1.061GlnMet: 1.061 ± 0.03
1.277GlnAsn: 1.277 ± 0.032
1.377GlnPro: 1.377 ± 0.039
1.048GlnGln: 1.048 ± 0.034
2.087GlnArg: 2.087 ± 0.047
2.267GlnSer: 2.267 ± 0.042
2.047GlnThr: 2.047 ± 0.045
2.323GlnVal: 2.323 ± 0.045
0.382GlnTrp: 0.382 ± 0.017
0.678GlnTyr: 0.678 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
6.524ArgAla: 6.524 ± 0.089
0.447ArgCys: 0.447 ± 0.021
3.535ArgAsp: 3.535 ± 0.059
3.51ArgGlu: 3.51 ± 0.061
2.384ArgPhe: 2.384 ± 0.051
3.841ArgGly: 3.841 ± 0.058
1.35ArgHis: 1.35 ± 0.036
3.59ArgIle: 3.59 ± 0.056
2.491ArgLys: 2.491 ± 0.048
6.155ArgLeu: 6.155 ± 0.079
1.739ArgMet: 1.739 ± 0.041
1.937ArgAsn: 1.937 ± 0.044
2.582ArgPro: 2.582 ± 0.048
2.082ArgGln: 2.082 ± 0.048
3.859ArgArg: 3.859 ± 0.06
3.144ArgSer: 3.144 ± 0.052
2.8ArgThr: 2.8 ± 0.049
4.278ArgVal: 4.278 ± 0.065
0.749ArgTrp: 0.749 ± 0.026
1.395ArgTyr: 1.395 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.083SerAla: 6.083 ± 0.073
0.509SerCys: 0.509 ± 0.018
3.655SerAsp: 3.655 ± 0.059
3.725SerGlu: 3.725 ± 0.056
2.658SerPhe: 2.658 ± 0.054
5.68SerGly: 5.68 ± 0.074
1.225SerHis: 1.225 ± 0.036
3.32SerIle: 3.32 ± 0.057
2.379SerLys: 2.379 ± 0.049
5.914SerLeu: 5.914 ± 0.077
1.619SerMet: 1.619 ± 0.039
1.889SerAsn: 1.889 ± 0.042
2.73SerPro: 2.73 ± 0.049
2.016SerGln: 2.016 ± 0.036
3.236SerArg: 3.236 ± 0.058
3.345SerSer: 3.345 ± 0.061
3.177SerThr: 3.177 ± 0.054
4.352SerVal: 4.352 ± 0.065
0.693SerTrp: 0.693 ± 0.027
1.474SerTyr: 1.474 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
5.901ThrAla: 5.901 ± 0.079
0.517ThrCys: 0.517 ± 0.022
3.116ThrAsp: 3.116 ± 0.05
3.088ThrGlu: 3.088 ± 0.057
2.209ThrPhe: 2.209 ± 0.043
5.175ThrGly: 5.175 ± 0.079
1.176ThrHis: 1.176 ± 0.033
3.234ThrIle: 3.234 ± 0.049
2.129ThrLys: 2.129 ± 0.041
6.264ThrLeu: 6.264 ± 0.075
1.346ThrMet: 1.346 ± 0.036
1.664ThrAsn: 1.664 ± 0.042
3.387ThrPro: 3.387 ± 0.066
1.974ThrGln: 1.974 ± 0.041
3.116ThrArg: 3.116 ± 0.045
3.475ThrSer: 3.475 ± 0.061
3.317ThrThr: 3.317 ± 0.068
4.223ThrVal: 4.223 ± 0.074
0.699ThrTrp: 0.699 ± 0.027
1.447ThrTyr: 1.447 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
8.212ValAla: 8.212 ± 0.098
0.621ValCys: 0.621 ± 0.024
4.09ValAsp: 4.09 ± 0.057
4.519ValGlu: 4.519 ± 0.063
3.182ValPhe: 3.182 ± 0.05
5.461ValGly: 5.461 ± 0.069
1.309ValHis: 1.309 ± 0.035
4.335ValIle: 4.335 ± 0.055
2.539ValLys: 2.539 ± 0.05
7.349ValLeu: 7.349 ± 0.099
1.955ValMet: 1.955 ± 0.042
2.257ValAsn: 2.257 ± 0.047
3.276ValPro: 3.276 ± 0.055
2.226ValGln: 2.226 ± 0.046
3.829ValArg: 3.829 ± 0.06
4.697ValSer: 4.697 ± 0.065
4.531ValThr: 4.531 ± 0.065
5.662ValVal: 5.662 ± 0.077
0.95ValTrp: 0.95 ± 0.03
1.595ValTyr: 1.595 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.208TrpAla: 1.208 ± 0.035
0.126TrpCys: 0.126 ± 0.011
0.72TrpAsp: 0.72 ± 0.024
0.682TrpGlu: 0.682 ± 0.026
0.546TrpPhe: 0.546 ± 0.023
0.945TrpGly: 0.945 ± 0.031
0.3TrpHis: 0.3 ± 0.017
0.708TrpIle: 0.708 ± 0.025
0.513TrpLys: 0.513 ± 0.021
1.378TrpLeu: 1.378 ± 0.043
0.418TrpMet: 0.418 ± 0.022
0.487TrpAsn: 0.487 ± 0.02
0.631TrpPro: 0.631 ± 0.023
0.575TrpGln: 0.575 ± 0.023
0.955TrpArg: 0.955 ± 0.028
0.792TrpSer: 0.792 ± 0.026
0.716TrpThr: 0.716 ± 0.029
0.892TrpVal: 0.892 ± 0.028
0.212TrpTrp: 0.212 ± 0.013
0.261TrpTyr: 0.261 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.443TyrAla: 2.443 ± 0.05
0.271TyrCys: 0.271 ± 0.016
1.508TyrAsp: 1.508 ± 0.036
1.348TyrGlu: 1.348 ± 0.034
1.097TyrPhe: 1.097 ± 0.03
2.119TyrGly: 2.119 ± 0.052
0.477TyrHis: 0.477 ± 0.022
1.142TyrIle: 1.142 ± 0.03
0.755TyrLys: 0.755 ± 0.028
2.438TyrLeu: 2.438 ± 0.047
0.483TyrMet: 0.483 ± 0.018
0.696TyrAsn: 0.696 ± 0.023
1.1TyrPro: 1.1 ± 0.031
0.794TyrGln: 0.794 ± 0.024
1.432TyrArg: 1.432 ± 0.037
1.393TyrSer: 1.393 ± 0.034
1.202TyrThr: 1.202 ± 0.034
1.607TyrVal: 1.607 ± 0.037
0.36TyrTrp: 0.36 ± 0.021
0.666TyrTyr: 0.666 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3882 proteins (1158560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski