Amino acid dipepetide frequency for Cuneatibacter caecimuris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.675AlaAla: 9.675 ± 0.151
1.312AlaCys: 1.312 ± 0.037
4.617AlaAsp: 4.617 ± 0.076
6.731AlaGlu: 6.731 ± 0.092
3.062AlaPhe: 3.062 ± 0.06
7.931AlaGly: 7.931 ± 0.118
1.167AlaHis: 1.167 ± 0.032
3.997AlaIle: 3.997 ± 0.083
3.854AlaLys: 3.854 ± 0.061
7.73AlaLeu: 7.73 ± 0.095
2.383AlaMet: 2.383 ± 0.054
2.234AlaAsn: 2.234 ± 0.053
2.317AlaPro: 2.317 ± 0.062
2.552AlaGln: 2.552 ± 0.058
3.877AlaArg: 3.877 ± 0.066
4.307AlaSer: 4.307 ± 0.063
2.81AlaThr: 2.81 ± 0.06
7.506AlaVal: 7.506 ± 0.093
0.818AlaTrp: 0.818 ± 0.03
2.749AlaTyr: 2.749 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
1.106CysAla: 1.106 ± 0.035
0.359CysCys: 0.359 ± 0.021
0.726CysAsp: 0.726 ± 0.03
0.848CysGlu: 0.848 ± 0.029
0.677CysPhe: 0.677 ± 0.025
1.713CysGly: 1.713 ± 0.041
0.289CysHis: 0.289 ± 0.018
0.941CysIle: 0.941 ± 0.031
0.605CysLys: 0.605 ± 0.028
1.444CysLeu: 1.444 ± 0.04
0.447CysMet: 0.447 ± 0.019
0.476CysAsn: 0.476 ± 0.023
0.694CysPro: 0.694 ± 0.033
0.46CysGln: 0.46 ± 0.02
1.204CysArg: 1.204 ± 0.04
1.049CysSer: 1.049 ± 0.034
0.724CysThr: 0.724 ± 0.028
0.981CysVal: 0.981 ± 0.032
0.187CysTrp: 0.187 ± 0.014
0.584CysTyr: 0.584 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
4.135AspAla: 4.135 ± 0.071
0.88AspCys: 0.88 ± 0.032
2.215AspAsp: 2.215 ± 0.062
3.889AspGlu: 3.889 ± 0.067
2.545AspPhe: 2.545 ± 0.05
4.605AspGly: 4.605 ± 0.092
0.914AspHis: 0.914 ± 0.037
3.537AspIle: 3.537 ± 0.066
2.721AspLys: 2.721 ± 0.058
4.784AspLeu: 4.784 ± 0.075
1.743AspMet: 1.743 ± 0.04
1.736AspAsn: 1.736 ± 0.047
2.026AspPro: 2.026 ± 0.051
1.78AspGln: 1.78 ± 0.045
2.873AspArg: 2.873 ± 0.054
3.173AspSer: 3.173 ± 0.059
2.77AspThr: 2.77 ± 0.053
3.432AspVal: 3.432 ± 0.068
0.707AspTrp: 0.707 ± 0.03
2.519AspTyr: 2.519 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
6.292GluAla: 6.292 ± 0.094
0.812GluCys: 0.812 ± 0.028
4.359GluAsp: 4.359 ± 0.072
8.011GluGlu: 8.011 ± 0.114
2.56GluPhe: 2.56 ± 0.054
5.064GluGly: 5.064 ± 0.085
1.242GluHis: 1.242 ± 0.033
5.53GluIle: 5.53 ± 0.08
6.113GluLys: 6.113 ± 0.094
7.157GluLeu: 7.157 ± 0.096
2.573GluMet: 2.573 ± 0.051
3.879GluAsn: 3.879 ± 0.067
2.077GluPro: 2.077 ± 0.044
3.367GluGln: 3.367 ± 0.072
3.622GluArg: 3.622 ± 0.07
3.571GluSer: 3.571 ± 0.064
4.055GluThr: 4.055 ± 0.077
4.327GluVal: 4.327 ± 0.072
0.763GluTrp: 0.763 ± 0.032
2.982GluTyr: 2.982 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
3.082PheAla: 3.082 ± 0.058
0.796PheCys: 0.796 ± 0.031
2.368PheAsp: 2.368 ± 0.051
2.469PheGlu: 2.469 ± 0.058
1.69PhePhe: 1.69 ± 0.046
3.118PheGly: 3.118 ± 0.067
0.835PheHis: 0.835 ± 0.03
2.329PheIle: 2.329 ± 0.054
1.466PheLys: 1.466 ± 0.042
4.166PheLeu: 4.166 ± 0.077
1.063PheMet: 1.063 ± 0.036
1.181PheAsn: 1.181 ± 0.034
1.487PhePro: 1.487 ± 0.041
1.523PheGln: 1.523 ± 0.037
2.237PheArg: 2.237 ± 0.049
3.1PheSer: 3.1 ± 0.062
2.31PheThr: 2.31 ± 0.051
2.477PheVal: 2.477 ± 0.055
0.547PheTrp: 0.547 ± 0.026
1.725PheTyr: 1.725 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
5.858GlyAla: 5.858 ± 0.102
1.438GlyCys: 1.438 ± 0.045
3.691GlyAsp: 3.691 ± 0.069
5.303GlyGlu: 5.303 ± 0.074
3.194GlyPhe: 3.194 ± 0.065
5.652GlyGly: 5.652 ± 0.119
1.29GlyHis: 1.29 ± 0.038
5.861GlyIle: 5.861 ± 0.084
5.364GlyLys: 5.364 ± 0.082
6.829GlyLeu: 6.829 ± 0.095
2.684GlyMet: 2.684 ± 0.056
3.129GlyAsn: 3.129 ± 0.073
1.708GlyPro: 1.708 ± 0.057
2.593GlyGln: 2.593 ± 0.061
4.288GlyArg: 4.288 ± 0.076
4.517GlySer: 4.517 ± 0.083
4.271GlyThr: 4.271 ± 0.084
5.092GlyVal: 5.092 ± 0.079
0.916GlyTrp: 0.916 ± 0.037
3.293GlyTyr: 3.293 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
1.072HisAla: 1.072 ± 0.03
0.343HisCys: 0.343 ± 0.019
0.842HisAsp: 0.842 ± 0.028
1.009HisGlu: 1.009 ± 0.028
0.852HisPhe: 0.852 ± 0.028
1.251HisGly: 1.251 ± 0.039
0.405HisHis: 0.405 ± 0.024
1.227HisIle: 1.227 ± 0.033
0.744HisLys: 0.744 ± 0.026
1.569HisLeu: 1.569 ± 0.048
0.51HisMet: 0.51 ± 0.021
0.644HisAsn: 0.644 ± 0.027
0.908HisPro: 0.908 ± 0.029
0.609HisGln: 0.609 ± 0.028
0.859HisArg: 0.859 ± 0.034
0.963HisSer: 0.963 ± 0.032
0.928HisThr: 0.928 ± 0.03
1.115HisVal: 1.115 ± 0.033
0.207HisTrp: 0.207 ± 0.016
0.8HisTyr: 0.8 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.01IleAla: 5.01 ± 0.081
1.166IleCys: 1.166 ± 0.041
3.239IleAsp: 3.239 ± 0.064
3.766IleGlu: 3.766 ± 0.067
2.744IlePhe: 2.744 ± 0.058
4.501IleGly: 4.501 ± 0.08
1.2IleHis: 1.2 ± 0.035
3.75IleIle: 3.75 ± 0.066
2.959IleLys: 2.959 ± 0.059
6.638IleLeu: 6.638 ± 0.088
1.605IleMet: 1.605 ± 0.042
2.315IleAsn: 2.315 ± 0.049
3.01IlePro: 3.01 ± 0.059
2.359IleGln: 2.359 ± 0.047
3.941IleArg: 3.941 ± 0.07
4.743IleSer: 4.743 ± 0.071
3.484IleThr: 3.484 ± 0.06
4.009IleVal: 4.009 ± 0.066
0.673IleTrp: 0.673 ± 0.024
2.433IleTyr: 2.433 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
4.5LysAla: 4.5 ± 0.07
0.605LysCys: 0.605 ± 0.027
3.074LysAsp: 3.074 ± 0.055
5.456LysGlu: 5.456 ± 0.09
1.707LysPhe: 1.707 ± 0.048
3.745LysGly: 3.745 ± 0.064
0.838LysHis: 0.838 ± 0.029
4.094LysIle: 4.094 ± 0.062
4.862LysLys: 4.862 ± 0.076
5.001LysLeu: 5.001 ± 0.078
1.921LysMet: 1.921 ± 0.048
2.936LysAsn: 2.936 ± 0.054
1.954LysPro: 1.954 ± 0.045
2.159LysGln: 2.159 ± 0.047
2.933LysArg: 2.933 ± 0.062
2.975LysSer: 2.975 ± 0.058
3.258LysThr: 3.258 ± 0.062
3.455LysVal: 3.455 ± 0.058
0.64LysTrp: 0.64 ± 0.027
2.307LysTyr: 2.307 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
7.88LeuAla: 7.88 ± 0.115
1.613LeuCys: 1.613 ± 0.044
5.244LeuAsp: 5.244 ± 0.088
7.549LeuGlu: 7.549 ± 0.108
3.829LeuPhe: 3.829 ± 0.072
6.327LeuGly: 6.327 ± 0.089
1.547LeuHis: 1.547 ± 0.039
5.6LeuIle: 5.6 ± 0.079
5.993LeuLys: 5.993 ± 0.076
9.455LeuLeu: 9.455 ± 0.136
2.685LeuMet: 2.685 ± 0.057
3.62LeuAsn: 3.62 ± 0.065
4.032LeuPro: 4.032 ± 0.065
3.462LeuGln: 3.462 ± 0.066
4.719LeuArg: 4.719 ± 0.07
6.245LeuSer: 6.245 ± 0.085
5.08LeuThr: 5.08 ± 0.074
5.733LeuVal: 5.733 ± 0.09
0.925LeuTrp: 0.925 ± 0.029
3.448LeuTyr: 3.448 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.517MetAla: 2.517 ± 0.055
0.322MetCys: 0.322 ± 0.018
1.771MetAsp: 1.771 ± 0.041
2.655MetGlu: 2.655 ± 0.053
0.996MetPhe: 0.996 ± 0.033
2.135MetGly: 2.135 ± 0.047
0.391MetHis: 0.391 ± 0.02
1.932MetIle: 1.932 ± 0.05
2.401MetLys: 2.401 ± 0.046
2.749MetLeu: 2.749 ± 0.051
0.965MetMet: 0.965 ± 0.031
1.456MetAsn: 1.456 ± 0.039
1.107MetPro: 1.107 ± 0.027
1.065MetGln: 1.065 ± 0.034
1.495MetArg: 1.495 ± 0.042
1.583MetSer: 1.583 ± 0.05
1.696MetThr: 1.696 ± 0.047
1.907MetVal: 1.907 ± 0.046
0.226MetTrp: 0.226 ± 0.015
0.879MetTyr: 0.879 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.9AsnAla: 2.9 ± 0.063
0.617AsnCys: 0.617 ± 0.025
1.692AsnAsp: 1.692 ± 0.047
2.356AsnGlu: 2.356 ± 0.047
1.448AsnPhe: 1.448 ± 0.037
3.314AsnGly: 3.314 ± 0.069
0.747AsnHis: 0.747 ± 0.03
2.718AsnIle: 2.718 ± 0.06
1.96AsnLys: 1.96 ± 0.051
3.522AsnLeu: 3.522 ± 0.057
1.186AsnMet: 1.186 ± 0.033
1.406AsnAsn: 1.406 ± 0.045
1.972AsnPro: 1.972 ± 0.043
1.487AsnGln: 1.487 ± 0.034
2.145AsnArg: 2.145 ± 0.053
2.187AsnSer: 2.187 ± 0.05
2.106AsnThr: 2.106 ± 0.047
2.411AsnVal: 2.411 ± 0.054
0.472AsnTrp: 0.472 ± 0.02
1.603AsnTyr: 1.603 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
3.199ProAla: 3.199 ± 0.073
0.551ProCys: 0.551 ± 0.025
2.436ProAsp: 2.436 ± 0.056
4.249ProGlu: 4.249 ± 0.071
1.558ProPhe: 1.558 ± 0.044
3.198ProGly: 3.198 ± 0.064
0.641ProHis: 0.641 ± 0.028
1.615ProIle: 1.615 ± 0.043
1.515ProLys: 1.515 ± 0.041
2.924ProLeu: 2.924 ± 0.06
0.903ProMet: 0.903 ± 0.027
1.086ProAsn: 1.086 ± 0.033
0.921ProPro: 0.921 ± 0.036
1.155ProGln: 1.155 ± 0.033
1.299ProArg: 1.299 ± 0.037
1.981ProSer: 1.981 ± 0.052
1.424ProThr: 1.424 ± 0.048
3.468ProVal: 3.468 ± 0.062
0.402ProTrp: 0.402 ± 0.02
1.423ProTyr: 1.423 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.071GlnAla: 3.071 ± 0.053
0.365GlnCys: 0.365 ± 0.018
1.807GlnAsp: 1.807 ± 0.044
3.302GlnGlu: 3.302 ± 0.062
1.312GlnPhe: 1.312 ± 0.035
2.36GlnGly: 2.36 ± 0.05
0.465GlnHis: 0.465 ± 0.022
2.585GlnIle: 2.585 ± 0.056
2.709GlnLys: 2.709 ± 0.061
3.201GlnLeu: 3.201 ± 0.061
1.313GlnMet: 1.313 ± 0.041
1.623GlnAsn: 1.623 ± 0.045
1.161GlnPro: 1.161 ± 0.037
1.299GlnGln: 1.299 ± 0.039
1.574GlnArg: 1.574 ± 0.041
1.742GlnSer: 1.742 ± 0.042
1.661GlnThr: 1.661 ± 0.045
2.542GlnVal: 2.542 ± 0.05
0.387GlnTrp: 0.387 ± 0.021
1.439GlnTyr: 1.439 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
3.538ArgAla: 3.538 ± 0.056
0.721ArgCys: 0.721 ± 0.027
2.565ArgAsp: 2.565 ± 0.055
4.612ArgGlu: 4.612 ± 0.081
2.109ArgPhe: 2.109 ± 0.045
3.114ArgGly: 3.114 ± 0.054
0.875ArgHis: 0.875 ± 0.027
3.718ArgIle: 3.718 ± 0.066
3.635ArgLys: 3.635 ± 0.063
5.165ArgLeu: 5.165 ± 0.088
1.816ArgMet: 1.816 ± 0.047
2.175ArgAsn: 2.175 ± 0.045
1.826ArgPro: 1.826 ± 0.052
2.237ArgGln: 2.237 ± 0.052
3.485ArgArg: 3.485 ± 0.068
2.551ArgSer: 2.551 ± 0.055
2.566ArgThr: 2.566 ± 0.049
3.12ArgVal: 3.12 ± 0.066
0.511ArgTrp: 0.511 ± 0.026
2.173ArgTyr: 2.173 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.924SerAla: 4.924 ± 0.088
0.931SerCys: 0.931 ± 0.031
2.907SerAsp: 2.907 ± 0.059
3.913SerGlu: 3.913 ± 0.067
2.643SerPhe: 2.643 ± 0.055
5.829SerGly: 5.829 ± 0.105
0.984SerHis: 0.984 ± 0.034
3.448SerIle: 3.448 ± 0.074
2.604SerLys: 2.604 ± 0.058
5.626SerLeu: 5.626 ± 0.082
1.735SerMet: 1.735 ± 0.042
1.916SerAsn: 1.916 ± 0.049
2.075SerPro: 2.075 ± 0.045
1.951SerGln: 1.951 ± 0.05
3.475SerArg: 3.475 ± 0.054
3.869SerSer: 3.869 ± 0.075
2.57SerThr: 2.57 ± 0.062
4.366SerVal: 4.366 ± 0.068
0.735SerTrp: 0.735 ± 0.033
2.235SerTyr: 2.235 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
4.954ThrAla: 4.954 ± 0.088
0.613ThrCys: 0.613 ± 0.025
2.985ThrAsp: 2.985 ± 0.052
4.176ThrGlu: 4.176 ± 0.091
1.911ThrPhe: 1.911 ± 0.045
5.189ThrGly: 5.189 ± 0.106
0.78ThrHis: 0.78 ± 0.031
2.973ThrIle: 2.973 ± 0.057
2.324ThrLys: 2.324 ± 0.058
4.513ThrLeu: 4.513 ± 0.067
1.283ThrMet: 1.283 ± 0.036
1.583ThrAsn: 1.583 ± 0.044
2.074ThrPro: 2.074 ± 0.045
1.382ThrGln: 1.382 ± 0.038
2.025ThrArg: 2.025 ± 0.042
2.624ThrSer: 2.624 ± 0.065
2.347ThrThr: 2.347 ± 0.059
4.615ThrVal: 4.615 ± 0.079
0.559ThrTrp: 0.559 ± 0.028
1.884ThrTyr: 1.884 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
4.502ValAla: 4.502 ± 0.07
1.228ValCys: 1.228 ± 0.037
3.513ValAsp: 3.513 ± 0.067
4.478ValGlu: 4.478 ± 0.066
2.935ValPhe: 2.935 ± 0.064
4.071ValGly: 4.071 ± 0.069
1.164ValHis: 1.164 ± 0.032
4.645ValIle: 4.645 ± 0.064
4.043ValLys: 4.043 ± 0.065
7.333ValLeu: 7.333 ± 0.086
2.06ValMet: 2.06 ± 0.044
2.74ValAsn: 2.74 ± 0.046
2.799ValPro: 2.799 ± 0.049
2.237ValGln: 2.237 ± 0.052
3.596ValArg: 3.596 ± 0.06
4.666ValSer: 4.666 ± 0.075
4.148ValThr: 4.148 ± 0.068
4.396ValVal: 4.396 ± 0.074
0.737ValTrp: 0.737 ± 0.024
2.677ValTyr: 2.677 ± 0.059
0.0ValXaa: 0.0 ± 0.0
Trp
0.721TrpAla: 0.721 ± 0.03
0.168TrpCys: 0.168 ± 0.013
0.655TrpAsp: 0.655 ± 0.027
0.827TrpGlu: 0.827 ± 0.031
0.454TrpPhe: 0.454 ± 0.024
0.771TrpGly: 0.771 ± 0.031
0.195TrpHis: 0.195 ± 0.015
0.717TrpIle: 0.717 ± 0.029
0.888TrpLys: 0.888 ± 0.032
1.138TrpLeu: 1.138 ± 0.036
0.396TrpMet: 0.396 ± 0.019
0.582TrpAsn: 0.582 ± 0.025
0.281TrpPro: 0.281 ± 0.017
0.414TrpGln: 0.414 ± 0.02
0.497TrpArg: 0.497 ± 0.024
0.554TrpSer: 0.554 ± 0.024
0.532TrpThr: 0.532 ± 0.023
0.606TrpVal: 0.606 ± 0.024
0.153TrpTrp: 0.153 ± 0.014
0.449TrpTyr: 0.449 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.692TyrAla: 2.692 ± 0.054
0.675TyrCys: 0.675 ± 0.025
2.277TyrAsp: 2.277 ± 0.053
2.756TyrGlu: 2.756 ± 0.057
1.74TyrPhe: 1.74 ± 0.045
3.084TyrGly: 3.084 ± 0.065
0.891TyrHis: 0.891 ± 0.028
2.326TyrIle: 2.326 ± 0.051
1.661TyrLys: 1.661 ± 0.042
4.043TyrLeu: 4.043 ± 0.069
0.977TyrMet: 0.977 ± 0.031
1.529TyrAsn: 1.529 ± 0.039
1.518TyrPro: 1.518 ± 0.039
1.827TyrGln: 1.827 ± 0.046
2.368TyrArg: 2.368 ± 0.047
2.336TyrSer: 2.336 ± 0.052
2.056TyrThr: 2.056 ± 0.06
2.412TyrVal: 2.412 ± 0.053
0.435TyrTrp: 0.435 ± 0.021
1.797TyrTyr: 1.797 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3170 proteins (1036976 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski