Amino acid dipepetide frequency for [Clostridium] polysaccharolyticum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.413AlaAla: 5.413 ± 0.111
1.045AlaCys: 1.045 ± 0.037
3.869AlaAsp: 3.869 ± 0.078
4.174AlaGlu: 4.174 ± 0.078
3.016AlaPhe: 3.016 ± 0.059
5.267AlaGly: 5.267 ± 0.086
0.959AlaHis: 0.959 ± 0.034
5.06AlaIle: 5.06 ± 0.086
5.33AlaLys: 5.33 ± 0.087
5.76AlaLeu: 5.76 ± 0.078
1.981AlaMet: 1.981 ± 0.047
2.909AlaAsn: 2.909 ± 0.065
1.867AlaPro: 1.867 ± 0.058
2.059AlaGln: 2.059 ± 0.056
2.313AlaArg: 2.313 ± 0.05
4.086AlaSer: 4.086 ± 0.066
2.999AlaThr: 2.999 ± 0.064
5.328AlaVal: 5.328 ± 0.074
0.544AlaTrp: 0.544 ± 0.027
2.796AlaTyr: 2.796 ± 0.063
0.0AlaXaa: 0.0 ± 0.0
Cys
0.866CysAla: 0.866 ± 0.03
0.276CysCys: 0.276 ± 0.019
0.849CysAsp: 0.849 ± 0.033
0.875CysGlu: 0.875 ± 0.031
0.729CysPhe: 0.729 ± 0.029
1.256CysGly: 1.256 ± 0.034
0.264CysHis: 0.264 ± 0.017
1.27CysIle: 1.27 ± 0.041
1.083CysLys: 1.083 ± 0.032
1.223CysLeu: 1.223 ± 0.035
0.47CysMet: 0.47 ± 0.021
0.772CysAsn: 0.772 ± 0.028
0.518CysPro: 0.518 ± 0.023
0.439CysGln: 0.439 ± 0.02
0.509CysArg: 0.509 ± 0.022
0.988CysSer: 0.988 ± 0.027
0.756CysThr: 0.756 ± 0.028
1.0CysVal: 1.0 ± 0.037
0.113CysTrp: 0.113 ± 0.011
0.682CysTyr: 0.682 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
3.481AspAla: 3.481 ± 0.069
0.807AspCys: 0.807 ± 0.031
2.423AspAsp: 2.423 ± 0.056
4.122AspGlu: 4.122 ± 0.075
2.639AspPhe: 2.639 ± 0.047
4.027AspGly: 4.027 ± 0.082
0.739AspHis: 0.739 ± 0.035
4.76AspIle: 4.76 ± 0.08
4.259AspLys: 4.259 ± 0.074
4.468AspLeu: 4.468 ± 0.062
1.65AspMet: 1.65 ± 0.041
2.488AspAsn: 2.488 ± 0.054
1.446AspPro: 1.446 ± 0.039
1.282AspGln: 1.282 ± 0.037
1.875AspArg: 1.875 ± 0.042
3.378AspSer: 3.378 ± 0.067
2.766AspThr: 2.766 ± 0.052
3.858AspVal: 3.858 ± 0.061
0.607AspTrp: 0.607 ± 0.027
2.895AspTyr: 2.895 ± 0.056
0.0AspXaa: 0.0 ± 0.0
Glu
5.067GluAla: 5.067 ± 0.079
0.904GluCys: 0.904 ± 0.03
4.106GluAsp: 4.106 ± 0.072
7.557GluGlu: 7.557 ± 0.125
2.923GluPhe: 2.923 ± 0.061
4.094GluGly: 4.094 ± 0.069
1.435GluHis: 1.435 ± 0.033
5.855GluIle: 5.855 ± 0.098
7.433GluLys: 7.433 ± 0.107
6.912GluLeu: 6.912 ± 0.094
2.271GluMet: 2.271 ± 0.049
4.431GluAsn: 4.431 ± 0.073
1.914GluPro: 1.914 ± 0.055
3.297GluGln: 3.297 ± 0.07
3.05GluArg: 3.05 ± 0.058
3.609GluSer: 3.609 ± 0.069
3.712GluThr: 3.712 ± 0.069
4.61GluVal: 4.61 ± 0.077
0.613GluTrp: 0.613 ± 0.027
3.328GluTyr: 3.328 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.84PheAla: 2.84 ± 0.052
0.787PheCys: 0.787 ± 0.03
2.556PheAsp: 2.556 ± 0.06
2.855PheGlu: 2.855 ± 0.056
1.897PhePhe: 1.897 ± 0.048
2.949PheGly: 2.949 ± 0.057
0.943PheHis: 0.943 ± 0.032
3.245PheIle: 3.245 ± 0.072
2.315PheLys: 2.315 ± 0.047
3.944PheLeu: 3.944 ± 0.076
1.18PheMet: 1.18 ± 0.036
1.756PheAsn: 1.756 ± 0.043
1.259PhePro: 1.259 ± 0.032
1.633PheGln: 1.633 ± 0.044
1.571PheArg: 1.571 ± 0.04
2.897PheSer: 2.897 ± 0.051
2.421PheThr: 2.421 ± 0.056
2.985PheVal: 2.985 ± 0.055
0.417PheTrp: 0.417 ± 0.026
1.967PheTyr: 1.967 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.215GlyAla: 4.215 ± 0.079
1.181GlyCys: 1.181 ± 0.042
3.25GlyAsp: 3.25 ± 0.069
4.282GlyGlu: 4.282 ± 0.074
3.05GlyPhe: 3.05 ± 0.057
4.256GlyGly: 4.256 ± 0.075
1.039GlyHis: 1.039 ± 0.033
6.006GlyIle: 6.006 ± 0.084
6.063GlyLys: 6.063 ± 0.082
5.182GlyLeu: 5.182 ± 0.076
2.123GlyMet: 2.123 ± 0.052
3.395GlyAsn: 3.395 ± 0.076
1.118GlyPro: 1.118 ± 0.037
2.024GlyGln: 2.024 ± 0.049
2.522GlyArg: 2.522 ± 0.048
3.901GlySer: 3.901 ± 0.079
4.178GlyThr: 4.178 ± 0.075
4.667GlyVal: 4.667 ± 0.067
0.63GlyTrp: 0.63 ± 0.03
3.235GlyTyr: 3.235 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
1.013HisAla: 1.013 ± 0.032
0.283HisCys: 0.283 ± 0.015
0.884HisAsp: 0.884 ± 0.031
0.967HisGlu: 0.967 ± 0.028
0.86HisPhe: 0.86 ± 0.029
1.135HisGly: 1.135 ± 0.04
0.389HisHis: 0.389 ± 0.025
1.415HisIle: 1.415 ± 0.039
1.154HisLys: 1.154 ± 0.039
1.362HisLeu: 1.362 ± 0.037
0.515HisMet: 0.515 ± 0.023
0.907HisAsn: 0.907 ± 0.031
0.716HisPro: 0.716 ± 0.026
0.49HisGln: 0.49 ± 0.021
0.598HisArg: 0.598 ± 0.025
1.043HisSer: 1.043 ± 0.031
0.843HisThr: 0.843 ± 0.027
1.147HisVal: 1.147 ± 0.034
0.158HisTrp: 0.158 ± 0.014
0.764HisTyr: 0.764 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.69IleAla: 5.69 ± 0.094
1.374IleCys: 1.374 ± 0.037
4.135IleAsp: 4.135 ± 0.071
5.72IleGlu: 5.72 ± 0.087
2.837IlePhe: 2.837 ± 0.058
5.19IleGly: 5.19 ± 0.083
1.472IleHis: 1.472 ± 0.041
5.674IleIle: 5.674 ± 0.099
5.399IleLys: 5.399 ± 0.086
7.072IleLeu: 7.072 ± 0.105
2.092IleMet: 2.092 ± 0.043
3.488IleAsn: 3.488 ± 0.064
2.891IlePro: 2.891 ± 0.053
2.926IleGln: 2.926 ± 0.058
3.262IleArg: 3.262 ± 0.059
5.213IleSer: 5.213 ± 0.074
4.505IleThr: 4.505 ± 0.068
5.293IleVal: 5.293 ± 0.081
0.638IleTrp: 0.638 ± 0.026
3.002IleTyr: 3.002 ± 0.056
0.0IleXaa: 0.0 ± 0.0
Lys
5.345LysAla: 5.345 ± 0.084
0.831LysCys: 0.831 ± 0.027
4.541LysAsp: 4.541 ± 0.065
8.808LysGlu: 8.808 ± 0.14
2.374LysPhe: 2.374 ± 0.052
4.626LysGly: 4.626 ± 0.065
1.141LysHis: 1.141 ± 0.031
5.681LysIle: 5.681 ± 0.082
7.689LysLys: 7.689 ± 0.108
6.285LysLeu: 6.285 ± 0.08
2.434LysMet: 2.434 ± 0.041
4.407LysAsn: 4.407 ± 0.069
2.198LysPro: 2.198 ± 0.05
3.063LysGln: 3.063 ± 0.06
3.382LysArg: 3.382 ± 0.06
4.021LysSer: 4.021 ± 0.065
4.173LysThr: 4.173 ± 0.074
5.163LysVal: 5.163 ± 0.077
0.694LysTrp: 0.694 ± 0.027
3.34LysTyr: 3.34 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
5.989LeuAla: 5.989 ± 0.088
1.382LeuCys: 1.382 ± 0.036
4.908LeuAsp: 4.908 ± 0.07
6.873LeuGlu: 6.873 ± 0.1
3.775LeuPhe: 3.775 ± 0.067
5.602LeuGly: 5.602 ± 0.082
1.538LeuHis: 1.538 ± 0.048
6.045LeuIle: 6.045 ± 0.101
6.676LeuLys: 6.676 ± 0.088
7.871LeuLeu: 7.871 ± 0.121
2.237LeuMet: 2.237 ± 0.041
4.349LeuAsn: 4.349 ± 0.074
2.889LeuPro: 2.889 ± 0.062
2.794LeuGln: 2.794 ± 0.054
3.207LeuArg: 3.207 ± 0.063
6.279LeuSer: 6.279 ± 0.082
4.493LeuThr: 4.493 ± 0.072
5.465LeuVal: 5.465 ± 0.074
0.694LeuTrp: 0.694 ± 0.026
3.448LeuTyr: 3.448 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.008MetAla: 2.008 ± 0.049
0.375MetCys: 0.375 ± 0.019
1.771MetAsp: 1.771 ± 0.036
2.621MetGlu: 2.621 ± 0.051
1.086MetPhe: 1.086 ± 0.036
1.738MetGly: 1.738 ± 0.044
0.426MetHis: 0.426 ± 0.021
2.047MetIle: 2.047 ± 0.045
2.834MetLys: 2.834 ± 0.051
2.622MetLeu: 2.622 ± 0.063
0.802MetMet: 0.802 ± 0.028
1.641MetAsn: 1.641 ± 0.038
1.01MetPro: 1.01 ± 0.03
1.009MetGln: 1.009 ± 0.031
1.069MetArg: 1.069 ± 0.035
1.695MetSer: 1.695 ± 0.034
1.354MetThr: 1.354 ± 0.034
1.777MetVal: 1.777 ± 0.043
0.197MetTrp: 0.197 ± 0.014
1.002MetTyr: 1.002 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.262AsnAla: 3.262 ± 0.059
0.766AsnCys: 0.766 ± 0.029
2.296AsnAsp: 2.296 ± 0.047
3.458AsnGlu: 3.458 ± 0.064
1.86AsnPhe: 1.86 ± 0.049
4.061AsnGly: 4.061 ± 0.077
0.871AsnHis: 0.871 ± 0.03
4.033AsnIle: 4.033 ± 0.056
3.413AsnLys: 3.413 ± 0.064
4.089AsnLeu: 4.089 ± 0.07
1.558AsnMet: 1.558 ± 0.041
2.339AsnAsn: 2.339 ± 0.06
1.962AsnPro: 1.962 ± 0.043
2.124AsnGln: 2.124 ± 0.053
2.122AsnArg: 2.122 ± 0.048
2.877AsnSer: 2.877 ± 0.061
2.679AsnThr: 2.679 ± 0.059
3.458AsnVal: 3.458 ± 0.068
0.514AsnTrp: 0.514 ± 0.024
2.182AsnTyr: 2.182 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.118ProAla: 2.118 ± 0.057
0.423ProCys: 0.423 ± 0.021
2.051ProAsp: 2.051 ± 0.048
2.63ProGlu: 2.63 ± 0.052
1.445ProPhe: 1.445 ± 0.038
1.741ProGly: 1.741 ± 0.043
0.451ProHis: 0.451 ± 0.022
2.009ProIle: 2.009 ± 0.047
1.974ProLys: 1.974 ± 0.044
2.377ProLeu: 2.377 ± 0.046
0.801ProMet: 0.801 ± 0.027
1.263ProAsn: 1.263 ± 0.036
0.56ProPro: 0.56 ± 0.028
0.805ProGln: 0.805 ± 0.028
0.845ProArg: 0.845 ± 0.03
2.239ProSer: 2.239 ± 0.079
1.192ProThr: 1.192 ± 0.04
2.828ProVal: 2.828 ± 0.067
0.268ProTrp: 0.268 ± 0.015
1.462ProTyr: 1.462 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
2.244GlnAla: 2.244 ± 0.046
0.418GlnCys: 0.418 ± 0.02
1.646GlnAsp: 1.646 ± 0.042
2.865GlnGlu: 2.865 ± 0.063
1.474GlnPhe: 1.474 ± 0.04
1.918GlnGly: 1.918 ± 0.046
0.488GlnHis: 0.488 ± 0.018
2.835GlnIle: 2.835 ± 0.053
3.201GlnLys: 3.201 ± 0.056
2.984GlnLeu: 2.984 ± 0.055
1.168GlnMet: 1.168 ± 0.036
1.949GlnAsn: 1.949 ± 0.041
0.843GlnPro: 0.843 ± 0.028
1.252GlnGln: 1.252 ± 0.044
1.246GlnArg: 1.246 ± 0.034
1.823GlnSer: 1.823 ± 0.042
1.667GlnThr: 1.667 ± 0.045
2.376GlnVal: 2.376 ± 0.052
0.338GlnTrp: 0.338 ± 0.021
1.638GlnTyr: 1.638 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
2.047ArgAla: 2.047 ± 0.045
0.472ArgCys: 0.472 ± 0.023
1.811ArgAsp: 1.811 ± 0.043
3.103ArgGlu: 3.103 ± 0.068
1.694ArgPhe: 1.694 ± 0.041
2.083ArgGly: 2.083 ± 0.045
0.619ArgHis: 0.619 ± 0.022
3.426ArgIle: 3.426 ± 0.063
3.652ArgLys: 3.652 ± 0.066
3.142ArgLeu: 3.142 ± 0.063
1.27ArgMet: 1.27 ± 0.032
2.248ArgAsn: 2.248 ± 0.044
1.006ArgPro: 1.006 ± 0.037
1.384ArgGln: 1.384 ± 0.037
1.622ArgArg: 1.622 ± 0.044
1.897ArgSer: 1.897 ± 0.042
2.025ArgThr: 2.025 ± 0.047
2.312ArgVal: 2.312 ± 0.045
0.31ArgTrp: 0.31 ± 0.017
1.753ArgTyr: 1.753 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
3.854SerAla: 3.854 ± 0.072
0.914SerCys: 0.914 ± 0.031
3.261SerAsp: 3.261 ± 0.066
3.997SerGlu: 3.997 ± 0.064
2.916SerPhe: 2.916 ± 0.053
4.679SerGly: 4.679 ± 0.082
0.952SerHis: 0.952 ± 0.031
5.021SerIle: 5.021 ± 0.079
4.53SerLys: 4.53 ± 0.064
5.391SerLeu: 5.391 ± 0.079
1.816SerMet: 1.816 ± 0.046
2.997SerAsn: 2.997 ± 0.066
1.555SerPro: 1.555 ± 0.039
2.108SerGln: 2.108 ± 0.044
2.336SerArg: 2.336 ± 0.051
4.294SerSer: 4.294 ± 0.102
3.042SerThr: 3.042 ± 0.059
4.495SerVal: 4.495 ± 0.085
0.581SerTrp: 0.581 ± 0.026
2.902SerTyr: 2.902 ± 0.063
0.0SerXaa: 0.0 ± 0.0
Thr
3.747ThrAla: 3.747 ± 0.067
0.706ThrCys: 0.706 ± 0.025
2.938ThrAsp: 2.938 ± 0.068
3.478ThrGlu: 3.478 ± 0.063
2.278ThrPhe: 2.278 ± 0.051
4.357ThrGly: 4.357 ± 0.072
0.77ThrHis: 0.77 ± 0.029
4.272ThrIle: 4.272 ± 0.073
3.756ThrLys: 3.756 ± 0.061
4.5ThrLeu: 4.5 ± 0.072
1.377ThrMet: 1.377 ± 0.036
2.248ThrAsn: 2.248 ± 0.048
1.759ThrPro: 1.759 ± 0.05
1.448ThrGln: 1.448 ± 0.039
1.753ThrArg: 1.753 ± 0.038
3.43ThrSer: 3.43 ± 0.07
2.61ThrThr: 2.61 ± 0.07
4.219ThrVal: 4.219 ± 0.078
0.473ThrTrp: 0.473 ± 0.021
2.376ThrTyr: 2.376 ± 0.064
0.0ThrXaa: 0.0 ± 0.0
Val
4.459ValAla: 4.459 ± 0.074
1.114ValCys: 1.114 ± 0.03
3.522ValAsp: 3.522 ± 0.06
4.843ValGlu: 4.843 ± 0.071
3.132ValPhe: 3.132 ± 0.064
3.658ValGly: 3.658 ± 0.075
1.07ValHis: 1.07 ± 0.033
5.353ValIle: 5.353 ± 0.084
5.602ValLys: 5.602 ± 0.085
6.684ValLeu: 6.684 ± 0.097
1.96ValMet: 1.96 ± 0.042
3.496ValAsn: 3.496 ± 0.062
2.373ValPro: 2.373 ± 0.051
2.096ValGln: 2.096 ± 0.044
2.499ValArg: 2.499 ± 0.054
4.864ValSer: 4.864 ± 0.069
4.235ValThr: 4.235 ± 0.086
4.747ValVal: 4.747 ± 0.081
0.55ValTrp: 0.55 ± 0.025
2.919ValTyr: 2.919 ± 0.063
0.001ValXaa: 0.001 ± 0.001
Trp
0.489TrpAla: 0.489 ± 0.021
0.161TrpCys: 0.161 ± 0.013
0.516TrpAsp: 0.516 ± 0.024
0.613TrpGlu: 0.613 ± 0.024
0.414TrpPhe: 0.414 ± 0.02
0.635TrpGly: 0.635 ± 0.03
0.178TrpHis: 0.178 ± 0.013
0.657TrpIle: 0.657 ± 0.026
0.811TrpLys: 0.811 ± 0.026
0.705TrpLeu: 0.705 ± 0.027
0.263TrpMet: 0.263 ± 0.017
0.638TrpAsn: 0.638 ± 0.025
0.201TrpPro: 0.201 ± 0.016
0.28TrpGln: 0.28 ± 0.017
0.315TrpArg: 0.315 ± 0.018
0.522TrpSer: 0.522 ± 0.026
0.427TrpThr: 0.427 ± 0.022
0.463TrpVal: 0.463 ± 0.023
0.113TrpTrp: 0.113 ± 0.011
0.427TrpTyr: 0.427 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.627TyrAla: 2.627 ± 0.059
0.719TyrCys: 0.719 ± 0.031
2.639TyrAsp: 2.639 ± 0.066
3.279TyrGlu: 3.279 ± 0.064
2.015TyrPhe: 2.015 ± 0.054
3.043TyrGly: 3.043 ± 0.061
0.901TyrHis: 0.901 ± 0.026
3.211TyrIle: 3.211 ± 0.055
3.023TyrLys: 3.023 ± 0.05
3.94TyrLeu: 3.94 ± 0.077
1.112TyrMet: 1.112 ± 0.031
2.204TyrAsn: 2.204 ± 0.055
1.397TyrPro: 1.397 ± 0.043
1.862TyrGln: 1.862 ± 0.051
1.779TyrArg: 1.779 ± 0.044
2.587TyrSer: 2.587 ± 0.051
2.415TyrThr: 2.415 ± 0.053
2.989TyrVal: 2.989 ± 0.058
0.376TyrTrp: 0.376 ± 0.02
2.23TyrTyr: 2.23 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 3018 proteins (1022225 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski