Amino acid dipepetide frequency for Puniceibacterium antarcticum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.751AlaAla: 15.751 ± 0.14
1.118AlaCys: 1.118 ± 0.027
6.843AlaAsp: 6.843 ± 0.074
7.423AlaGlu: 7.423 ± 0.065
4.294AlaPhe: 4.294 ± 0.056
9.782AlaGly: 9.782 ± 0.09
2.268AlaHis: 2.268 ± 0.039
5.788AlaIle: 5.788 ± 0.06
3.796AlaLys: 3.796 ± 0.058
14.037AlaLeu: 14.037 ± 0.119
3.835AlaMet: 3.835 ± 0.051
2.688AlaAsn: 2.688 ± 0.042
5.56AlaPro: 5.56 ± 0.07
4.863AlaGln: 4.863 ± 0.057
8.475AlaArg: 8.475 ± 0.083
5.762AlaSer: 5.762 ± 0.064
6.129AlaThr: 6.129 ± 0.057
8.34AlaVal: 8.34 ± 0.087
1.41AlaTrp: 1.41 ± 0.03
2.494AlaTyr: 2.494 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.14CysAla: 1.14 ± 0.028
0.129CysCys: 0.129 ± 0.011
0.607CysAsp: 0.607 ± 0.02
0.417CysGlu: 0.417 ± 0.016
0.321CysPhe: 0.321 ± 0.015
0.96CysGly: 0.96 ± 0.028
0.283CysHis: 0.283 ± 0.014
0.451CysIle: 0.451 ± 0.02
0.225CysLys: 0.225 ± 0.014
0.894CysLeu: 0.894 ± 0.022
0.184CysMet: 0.184 ± 0.011
0.236CysAsn: 0.236 ± 0.012
0.527CysPro: 0.527 ± 0.018
0.265CysGln: 0.265 ± 0.012
0.583CysArg: 0.583 ± 0.018
0.486CysSer: 0.486 ± 0.018
0.473CysThr: 0.473 ± 0.017
0.664CysVal: 0.664 ± 0.021
0.142CysTrp: 0.142 ± 0.008
0.199CysTyr: 0.199 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.18AspAla: 7.18 ± 0.08
0.529AspCys: 0.529 ± 0.02
3.096AspAsp: 3.096 ± 0.044
3.124AspGlu: 3.124 ± 0.043
2.322AspPhe: 2.322 ± 0.042
5.469AspGly: 5.469 ± 0.072
1.405AspHis: 1.405 ± 0.032
3.189AspIle: 3.189 ± 0.047
1.75AspLys: 1.75 ± 0.039
6.718AspLeu: 6.718 ± 0.064
1.749AspMet: 1.749 ± 0.033
1.243AspAsn: 1.243 ± 0.028
3.532AspPro: 3.532 ± 0.052
2.041AspGln: 2.041 ± 0.034
4.077AspArg: 4.077 ± 0.05
2.534AspSer: 2.534 ± 0.043
3.237AspThr: 3.237 ± 0.052
4.287AspVal: 4.287 ± 0.06
1.179AspTrp: 1.179 ± 0.029
1.465AspTyr: 1.465 ± 0.028
0.0AspXaa: 0.0 ± 0.0
Glu
7.186GluAla: 7.186 ± 0.078
0.329GluCys: 0.329 ± 0.015
3.117GluAsp: 3.117 ± 0.043
3.085GluGlu: 3.085 ± 0.059
1.666GluPhe: 1.666 ± 0.04
4.282GluGly: 4.282 ± 0.048
1.121GluHis: 1.121 ± 0.025
3.356GluIle: 3.356 ± 0.048
2.065GluLys: 2.065 ± 0.042
4.876GluLeu: 4.876 ± 0.059
1.78GluMet: 1.78 ± 0.034
1.64GluAsn: 1.64 ± 0.035
2.252GluPro: 2.252 ± 0.04
1.957GluGln: 1.957 ± 0.035
4.0GluArg: 4.0 ± 0.055
2.33GluSer: 2.33 ± 0.037
3.647GluThr: 3.647 ± 0.052
4.142GluVal: 4.142 ± 0.055
0.623GluTrp: 0.623 ± 0.021
1.02GluTyr: 1.02 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
4.522PheAla: 4.522 ± 0.066
0.417PheCys: 0.417 ± 0.015
2.862PheAsp: 2.862 ± 0.048
2.135PheGlu: 2.135 ± 0.035
1.455PhePhe: 1.455 ± 0.038
3.811PheGly: 3.811 ± 0.053
0.776PheHis: 0.776 ± 0.023
1.625PheIle: 1.625 ± 0.034
0.94PheLys: 0.94 ± 0.027
3.467PheLeu: 3.467 ± 0.047
0.9PheMet: 0.9 ± 0.018
1.052PheAsn: 1.052 ± 0.028
1.518PhePro: 1.518 ± 0.031
1.113PheGln: 1.113 ± 0.027
2.106PheArg: 2.106 ± 0.037
2.207PheSer: 2.207 ± 0.043
2.065PheThr: 2.065 ± 0.035
2.716PheVal: 2.716 ± 0.042
0.595PheTrp: 0.595 ± 0.022
0.897PheTyr: 0.897 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
9.697GlyAla: 9.697 ± 0.083
0.874GlyCys: 0.874 ± 0.025
4.556GlyAsp: 4.556 ± 0.054
4.194GlyGlu: 4.194 ± 0.049
3.668GlyPhe: 3.668 ± 0.05
7.145GlyGly: 7.145 ± 0.091
1.917GlyHis: 1.917 ± 0.037
4.521GlyIle: 4.521 ± 0.056
3.08GlyLys: 3.08 ± 0.046
9.197GlyLeu: 9.197 ± 0.102
2.656GlyMet: 2.656 ± 0.049
2.15GlyAsn: 2.15 ± 0.044
3.509GlyPro: 3.509 ± 0.047
3.317GlyGln: 3.317 ± 0.05
5.522GlyArg: 5.522 ± 0.066
4.42GlySer: 4.42 ± 0.062
4.979GlyThr: 4.979 ± 0.054
6.39GlyVal: 6.39 ± 0.062
1.475GlyTrp: 1.475 ± 0.03
2.305GlyTyr: 2.305 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.316HisAla: 2.316 ± 0.037
0.253HisCys: 0.253 ± 0.012
1.309HisAsp: 1.309 ± 0.027
1.019HisGlu: 1.019 ± 0.029
0.836HisPhe: 0.836 ± 0.024
1.838HisGly: 1.838 ± 0.04
0.562HisHis: 0.562 ± 0.019
0.981HisIle: 0.981 ± 0.025
0.58HisLys: 0.58 ± 0.018
2.217HisLeu: 2.217 ± 0.038
0.585HisMet: 0.585 ± 0.019
0.471HisAsn: 0.471 ± 0.019
1.334HisPro: 1.334 ± 0.03
0.596HisGln: 0.596 ± 0.019
1.355HisArg: 1.355 ± 0.031
1.026HisSer: 1.026 ± 0.026
0.84HisThr: 0.84 ± 0.023
1.568HisVal: 1.568 ± 0.031
0.344HisTrp: 0.344 ± 0.015
0.51HisTyr: 0.51 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
7.125IleAla: 7.125 ± 0.076
0.652IleCys: 0.652 ± 0.019
3.472IleAsp: 3.472 ± 0.051
3.369IleGlu: 3.369 ± 0.052
1.842IlePhe: 1.842 ± 0.041
4.868IleGly: 4.868 ± 0.066
0.898IleHis: 0.898 ± 0.023
2.281IleIle: 2.281 ± 0.042
1.511IleLys: 1.511 ± 0.031
5.021IleLeu: 5.021 ± 0.068
1.115IleMet: 1.115 ± 0.028
1.408IleAsn: 1.408 ± 0.031
2.493IlePro: 2.493 ± 0.047
1.251IleGln: 1.251 ± 0.028
3.276IleArg: 3.276 ± 0.045
3.219IleSer: 3.219 ± 0.056
3.143IleThr: 3.143 ± 0.047
3.691IleVal: 3.691 ± 0.056
0.764IleTrp: 0.764 ± 0.021
1.208IleTyr: 1.208 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
3.839LysAla: 3.839 ± 0.055
0.19LysCys: 0.19 ± 0.011
1.828LysAsp: 1.828 ± 0.037
1.492LysGlu: 1.492 ± 0.034
0.932LysPhe: 0.932 ± 0.026
2.754LysGly: 2.754 ± 0.04
0.605LysHis: 0.605 ± 0.021
1.717LysIle: 1.717 ± 0.034
1.161LysLys: 1.161 ± 0.029
2.975LysLeu: 2.975 ± 0.054
0.911LysMet: 0.911 ± 0.025
0.8LysAsn: 0.8 ± 0.024
1.766LysPro: 1.766 ± 0.037
0.924LysGln: 0.924 ± 0.027
2.212LysArg: 2.212 ± 0.04
1.943LysSer: 1.943 ± 0.035
2.091LysThr: 2.091 ± 0.034
2.395LysVal: 2.395 ± 0.046
0.37LysTrp: 0.37 ± 0.015
0.64LysTyr: 0.64 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
12.568LeuAla: 12.568 ± 0.11
1.06LeuCys: 1.06 ± 0.029
6.06LeuAsp: 6.06 ± 0.07
5.105LeuGlu: 5.105 ± 0.058
3.616LeuPhe: 3.616 ± 0.065
8.595LeuGly: 8.595 ± 0.085
1.947LeuHis: 1.947 ± 0.037
5.562LeuIle: 5.562 ± 0.071
3.216LeuLys: 3.216 ± 0.049
9.456LeuLeu: 9.456 ± 0.099
2.911LeuMet: 2.911 ± 0.046
2.876LeuAsn: 2.876 ± 0.049
5.573LeuPro: 5.573 ± 0.061
3.075LeuGln: 3.075 ± 0.045
7.35LeuArg: 7.35 ± 0.088
7.38LeuSer: 7.38 ± 0.084
6.252LeuThr: 6.252 ± 0.071
6.909LeuVal: 6.909 ± 0.071
1.331LeuTrp: 1.331 ± 0.034
1.998LeuTyr: 1.998 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
3.494MetAla: 3.494 ± 0.046
0.201MetCys: 0.201 ± 0.011
1.462MetAsp: 1.462 ± 0.031
1.289MetGlu: 1.289 ± 0.032
0.867MetPhe: 0.867 ± 0.028
2.314MetGly: 2.314 ± 0.046
0.51MetHis: 0.51 ± 0.018
1.696MetIle: 1.696 ± 0.035
1.078MetLys: 1.078 ± 0.026
2.805MetLeu: 2.805 ± 0.048
0.856MetMet: 0.856 ± 0.028
0.942MetAsn: 0.942 ± 0.025
1.622MetPro: 1.622 ± 0.029
1.074MetGln: 1.074 ± 0.026
1.95MetArg: 1.95 ± 0.037
1.901MetSer: 1.901 ± 0.032
2.275MetThr: 2.275 ± 0.039
1.921MetVal: 1.921 ± 0.036
0.222MetTrp: 0.222 ± 0.011
0.358MetTyr: 0.358 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.205AsnAla: 3.205 ± 0.043
0.245AsnCys: 0.245 ± 0.012
1.505AsnAsp: 1.505 ± 0.032
1.133AsnGlu: 1.133 ± 0.025
0.96AsnPhe: 0.96 ± 0.021
2.399AsnGly: 2.399 ± 0.04
0.51AsnHis: 0.51 ± 0.018
1.478AsnIle: 1.478 ± 0.035
0.688AsnLys: 0.688 ± 0.024
2.642AsnLeu: 2.642 ± 0.038
0.758AsnMet: 0.758 ± 0.02
0.678AsnAsn: 0.678 ± 0.024
1.813AsnPro: 1.813 ± 0.039
0.713AsnGln: 0.713 ± 0.023
1.778AsnArg: 1.778 ± 0.035
1.251AsnSer: 1.251 ± 0.03
1.462AsnThr: 1.462 ± 0.034
1.844AsnVal: 1.844 ± 0.035
0.441AsnTrp: 0.441 ± 0.017
0.617AsnTyr: 0.617 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
5.355ProAla: 5.355 ± 0.061
0.381ProCys: 0.381 ± 0.014
3.882ProAsp: 3.882 ± 0.055
3.813ProGlu: 3.813 ± 0.055
1.935ProPhe: 1.935 ± 0.037
4.335ProGly: 4.335 ± 0.059
1.033ProHis: 1.033 ± 0.023
2.261ProIle: 2.261 ± 0.041
1.689ProLys: 1.689 ± 0.034
4.795ProLeu: 4.795 ± 0.071
1.348ProMet: 1.348 ± 0.029
1.307ProAsn: 1.307 ± 0.032
2.233ProPro: 2.233 ± 0.043
1.8ProGln: 1.8 ± 0.034
2.73ProArg: 2.73 ± 0.045
2.664ProSer: 2.664 ± 0.046
2.42ProThr: 2.42 ± 0.036
4.147ProVal: 4.147 ± 0.054
0.655ProTrp: 0.655 ± 0.022
1.16ProTyr: 1.16 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
4.079GlnAla: 4.079 ± 0.051
0.209GlnCys: 0.209 ± 0.009
1.891GlnAsp: 1.891 ± 0.036
1.6GlnGlu: 1.6 ± 0.03
1.132GlnPhe: 1.132 ± 0.028
2.752GlnGly: 2.752 ± 0.039
0.678GlnHis: 0.678 ± 0.022
2.302GlnIle: 2.302 ± 0.04
1.15GlnLys: 1.15 ± 0.028
3.021GlnLeu: 3.021 ± 0.047
1.152GlnMet: 1.152 ± 0.026
1.025GlnAsn: 1.025 ± 0.027
1.657GlnPro: 1.657 ± 0.035
1.207GlnGln: 1.207 ± 0.034
2.355GlnArg: 2.355 ± 0.044
2.051GlnSer: 2.051 ± 0.037
2.084GlnThr: 2.084 ± 0.038
2.402GlnVal: 2.402 ± 0.04
0.428GlnTrp: 0.428 ± 0.018
0.604GlnTyr: 0.604 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
7.83ArgAla: 7.83 ± 0.089
0.526ArgCys: 0.526 ± 0.021
4.402ArgAsp: 4.402 ± 0.063
3.432ArgGlu: 3.432 ± 0.05
2.623ArgPhe: 2.623 ± 0.036
4.556ArgGly: 4.556 ± 0.059
1.573ArgHis: 1.573 ± 0.032
3.967ArgIle: 3.967 ± 0.054
2.329ArgLys: 2.329 ± 0.041
7.194ArgLeu: 7.194 ± 0.075
2.026ArgMet: 2.026 ± 0.033
1.804ArgAsn: 1.804 ± 0.038
3.137ArgPro: 3.137 ± 0.048
2.36ArgGln: 2.36 ± 0.044
4.971ArgArg: 4.971 ± 0.071
3.467ArgSer: 3.467 ± 0.045
3.204ArgThr: 3.204 ± 0.045
4.615ArgVal: 4.615 ± 0.059
0.932ArgTrp: 0.932 ± 0.023
1.542ArgTyr: 1.542 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
6.274SerAla: 6.274 ± 0.057
0.519SerCys: 0.519 ± 0.02
3.633SerAsp: 3.633 ± 0.054
3.148SerGlu: 3.148 ± 0.048
2.308SerPhe: 2.308 ± 0.039
5.691SerGly: 5.691 ± 0.062
1.158SerHis: 1.158 ± 0.025
2.68SerIle: 2.68 ± 0.043
1.612SerLys: 1.612 ± 0.035
5.377SerLeu: 5.377 ± 0.058
1.532SerMet: 1.532 ± 0.03
1.47SerAsn: 1.47 ± 0.032
2.55SerPro: 2.55 ± 0.046
1.773SerGln: 1.773 ± 0.035
3.399SerArg: 3.399 ± 0.052
2.921SerSer: 2.921 ± 0.049
2.663SerThr: 2.663 ± 0.041
4.153SerVal: 4.153 ± 0.053
0.762SerTrp: 0.762 ± 0.021
1.413SerTyr: 1.413 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
6.546ThrAla: 6.546 ± 0.079
0.513ThrCys: 0.513 ± 0.018
3.234ThrAsp: 3.234 ± 0.049
2.942ThrGlu: 2.942 ± 0.043
2.091ThrPhe: 2.091 ± 0.041
5.513ThrGly: 5.513 ± 0.069
1.175ThrHis: 1.175 ± 0.027
2.796ThrIle: 2.796 ± 0.043
1.534ThrLys: 1.534 ± 0.03
6.611ThrLeu: 6.611 ± 0.073
1.356ThrMet: 1.356 ± 0.033
1.364ThrAsn: 1.364 ± 0.031
3.531ThrPro: 3.531 ± 0.048
1.82ThrGln: 1.82 ± 0.036
3.505ThrArg: 3.505 ± 0.051
3.052ThrSer: 3.052 ± 0.045
3.059ThrThr: 3.059 ± 0.051
4.186ThrVal: 4.186 ± 0.055
0.66ThrTrp: 0.66 ± 0.022
1.285ThrTyr: 1.285 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
8.69ValAla: 8.69 ± 0.082
0.666ValCys: 0.666 ± 0.021
3.984ValAsp: 3.984 ± 0.051
3.963ValGlu: 3.963 ± 0.051
2.872ValPhe: 2.872 ± 0.041
5.412ValGly: 5.412 ± 0.069
1.315ValHis: 1.315 ± 0.03
4.271ValIle: 4.271 ± 0.06
2.101ValLys: 2.101 ± 0.04
7.662ValLeu: 7.662 ± 0.088
2.25ValMet: 2.25 ± 0.038
1.915ValAsn: 1.915 ± 0.036
3.542ValPro: 3.542 ± 0.047
2.309ValGln: 2.309 ± 0.041
4.22ValArg: 4.22 ± 0.048
4.409ValSer: 4.409 ± 0.056
4.792ValThr: 4.792 ± 0.061
5.663ValVal: 5.663 ± 0.069
0.962ValTrp: 0.962 ± 0.03
1.463ValTyr: 1.463 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.322TrpAla: 1.322 ± 0.029
0.153TrpCys: 0.153 ± 0.009
0.785TrpAsp: 0.785 ± 0.021
0.597TrpGlu: 0.597 ± 0.019
0.534TrpPhe: 0.534 ± 0.018
0.994TrpGly: 0.994 ± 0.03
0.338TrpHis: 0.338 ± 0.014
0.724TrpIle: 0.724 ± 0.022
0.458TrpLys: 0.458 ± 0.017
1.589TrpLeu: 1.589 ± 0.034
0.439TrpMet: 0.439 ± 0.018
0.449TrpAsn: 0.449 ± 0.016
0.721TrpPro: 0.721 ± 0.026
0.629TrpGln: 0.629 ± 0.02
1.094TrpArg: 1.094 ± 0.027
0.85TrpSer: 0.85 ± 0.022
0.791TrpThr: 0.791 ± 0.021
0.905TrpVal: 0.905 ± 0.024
0.216TrpTrp: 0.216 ± 0.011
0.261TrpTyr: 0.261 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.537TyrAla: 2.537 ± 0.038
0.222TyrCys: 0.222 ± 0.012
1.598TyrAsp: 1.598 ± 0.033
1.196TyrGlu: 1.196 ± 0.03
0.906TyrPhe: 0.906 ± 0.024
2.102TyrGly: 2.102 ± 0.037
0.469TyrHis: 0.469 ± 0.021
0.961TyrIle: 0.961 ± 0.026
0.583TyrLys: 0.583 ± 0.022
2.285TyrLeu: 2.285 ± 0.041
0.493TyrMet: 0.493 ± 0.018
0.602TyrAsn: 0.602 ± 0.021
1.074TyrPro: 1.074 ± 0.029
0.693TyrGln: 0.693 ± 0.023
1.528TyrArg: 1.528 ± 0.03
1.163TyrSer: 1.163 ± 0.029
1.167TyrThr: 1.167 ± 0.024
1.521TyrVal: 1.521 ± 0.03
0.339TyrTrp: 0.339 ± 0.017
0.554TyrTyr: 0.554 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5603 proteins (1590982 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski