Amino acid dipepetide frequency for Caproiciproducens galactitolivorans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.185AlaAla: 9.185 ± 0.147
1.297AlaCys: 1.297 ± 0.045
4.576AlaAsp: 4.576 ± 0.066
5.559AlaGlu: 5.559 ± 0.113
3.396AlaPhe: 3.396 ± 0.072
6.422AlaGly: 6.422 ± 0.104
1.178AlaHis: 1.178 ± 0.041
5.471AlaIle: 5.471 ± 0.106
5.455AlaLys: 5.455 ± 0.107
7.928AlaLeu: 7.928 ± 0.115
2.423AlaMet: 2.423 ± 0.054
2.949AlaAsn: 2.949 ± 0.065
2.475AlaPro: 2.475 ± 0.071
3.197AlaGln: 3.197 ± 0.077
3.357AlaArg: 3.357 ± 0.076
4.682AlaSer: 4.682 ± 0.096
3.488AlaThr: 3.488 ± 0.077
7.225AlaVal: 7.225 ± 0.114
0.599AlaTrp: 0.599 ± 0.029
2.564AlaTyr: 2.564 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
1.355CysAla: 1.355 ± 0.05
0.372CysCys: 0.372 ± 0.024
0.931CysAsp: 0.931 ± 0.036
0.951CysGlu: 0.951 ± 0.034
0.638CysPhe: 0.638 ± 0.034
1.736CysGly: 1.736 ± 0.06
0.308CysHis: 0.308 ± 0.021
1.054CysIle: 1.054 ± 0.035
0.865CysLys: 0.865 ± 0.037
1.143CysLeu: 1.143 ± 0.04
0.417CysMet: 0.417 ± 0.028
0.607CysAsn: 0.607 ± 0.027
0.731CysPro: 0.731 ± 0.034
0.331CysGln: 0.331 ± 0.022
0.833CysArg: 0.833 ± 0.039
1.058CysSer: 1.058 ± 0.041
0.818CysThr: 0.818 ± 0.034
1.149CysVal: 1.149 ± 0.039
0.113CysTrp: 0.113 ± 0.013
0.529CysTyr: 0.529 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
4.228AspAla: 4.228 ± 0.08
0.933AspCys: 0.933 ± 0.037
2.682AspAsp: 2.682 ± 0.072
3.919AspGlu: 3.919 ± 0.091
2.647AspPhe: 2.647 ± 0.061
4.31AspGly: 4.31 ± 0.09
0.85AspHis: 0.85 ± 0.034
4.307AspIle: 4.307 ± 0.081
3.268AspLys: 3.268 ± 0.073
4.472AspLeu: 4.472 ± 0.079
1.586AspMet: 1.586 ± 0.045
2.022AspAsn: 2.022 ± 0.053
2.001AspPro: 2.001 ± 0.049
1.241AspGln: 1.241 ± 0.04
2.54AspArg: 2.54 ± 0.064
3.232AspSer: 3.232 ± 0.07
3.134AspThr: 3.134 ± 0.069
3.668AspVal: 3.668 ± 0.068
0.506AspTrp: 0.506 ± 0.029
2.294AspTyr: 2.294 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
4.949GluAla: 4.949 ± 0.09
0.828GluCys: 0.828 ± 0.031
3.21GluAsp: 3.21 ± 0.073
4.994GluGlu: 4.994 ± 0.101
2.239GluPhe: 2.239 ± 0.049
3.683GluGly: 3.683 ± 0.078
1.12GluHis: 1.12 ± 0.043
5.155GluIle: 5.155 ± 0.094
5.757GluLys: 5.757 ± 0.098
6.195GluLeu: 6.195 ± 0.099
2.041GluMet: 2.041 ± 0.054
3.937GluAsn: 3.937 ± 0.073
2.027GluPro: 2.027 ± 0.057
2.775GluGln: 2.775 ± 0.07
3.319GluArg: 3.319 ± 0.081
3.217GluSer: 3.217 ± 0.072
3.255GluThr: 3.255 ± 0.073
3.838GluVal: 3.838 ± 0.083
0.533GluTrp: 0.533 ± 0.031
2.422GluTyr: 2.422 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.254PheAla: 3.254 ± 0.075
0.788PheCys: 0.788 ± 0.032
2.33PheAsp: 2.33 ± 0.059
2.397PheGlu: 2.397 ± 0.059
1.909PhePhe: 1.909 ± 0.061
3.284PheGly: 3.284 ± 0.079
0.824PheHis: 0.824 ± 0.033
2.779PheIle: 2.779 ± 0.063
2.115PheLys: 2.115 ± 0.056
3.968PheLeu: 3.968 ± 0.098
1.055PheMet: 1.055 ± 0.038
1.622PheAsn: 1.622 ± 0.047
1.519PhePro: 1.519 ± 0.047
1.355PheGln: 1.355 ± 0.042
1.726PheArg: 1.726 ± 0.044
3.086PheSer: 3.086 ± 0.069
2.569PheThr: 2.569 ± 0.065
2.732PheVal: 2.732 ± 0.061
0.316PheTrp: 0.316 ± 0.025
1.659PheTyr: 1.659 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
5.413GlyAla: 5.413 ± 0.101
1.345GlyCys: 1.345 ± 0.05
3.596GlyAsp: 3.596 ± 0.071
4.412GlyGlu: 4.412 ± 0.082
3.151GlyPhe: 3.151 ± 0.067
5.342GlyGly: 5.342 ± 0.096
1.226GlyHis: 1.226 ± 0.043
5.928GlyIle: 5.928 ± 0.098
5.466GlyLys: 5.466 ± 0.101
6.158GlyLeu: 6.158 ± 0.096
2.352GlyMet: 2.352 ± 0.07
3.186GlyAsn: 3.186 ± 0.075
1.468GlyPro: 1.468 ± 0.046
2.086GlyGln: 2.086 ± 0.054
3.28GlyArg: 3.28 ± 0.075
4.309GlySer: 4.309 ± 0.083
4.483GlyThr: 4.483 ± 0.086
5.3GlyVal: 5.3 ± 0.078
0.676GlyTrp: 0.676 ± 0.027
2.908GlyTyr: 2.908 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.21HisAla: 1.21 ± 0.047
0.319HisCys: 0.319 ± 0.025
0.865HisAsp: 0.865 ± 0.033
0.947HisGlu: 0.947 ± 0.036
0.854HisPhe: 0.854 ± 0.033
1.331HisGly: 1.331 ± 0.045
0.349HisHis: 0.349 ± 0.021
1.338HisIle: 1.338 ± 0.044
0.913HisLys: 0.913 ± 0.039
1.499HisLeu: 1.499 ± 0.046
0.492HisMet: 0.492 ± 0.028
0.743HisAsn: 0.743 ± 0.03
0.978HisPro: 0.978 ± 0.039
0.485HisGln: 0.485 ± 0.022
0.809HisArg: 0.809 ± 0.034
1.015HisSer: 1.015 ± 0.043
0.979HisThr: 0.979 ± 0.038
0.998HisVal: 0.998 ± 0.037
0.138HisTrp: 0.138 ± 0.013
0.647HisTyr: 0.647 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.402IleAla: 6.402 ± 0.108
1.254IleCys: 1.254 ± 0.043
4.17IleAsp: 4.17 ± 0.084
4.285IleGlu: 4.285 ± 0.082
2.677IlePhe: 2.677 ± 0.072
5.042IleGly: 5.042 ± 0.089
1.297IleHis: 1.297 ± 0.04
4.776IleIle: 4.776 ± 0.096
4.12IleLys: 4.12 ± 0.075
6.763IleLeu: 6.763 ± 0.112
1.776IleMet: 1.776 ± 0.052
2.895IleAsn: 2.895 ± 0.072
3.283IlePro: 3.283 ± 0.063
2.366IleGln: 2.366 ± 0.059
3.749IleArg: 3.749 ± 0.074
5.13IleSer: 5.13 ± 0.102
4.274IleThr: 4.274 ± 0.078
4.775IleVal: 4.775 ± 0.08
0.497IleTrp: 0.497 ± 0.028
2.334IleTyr: 2.334 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
5.648LysAla: 5.648 ± 0.094
0.764LysCys: 0.764 ± 0.032
3.472LysAsp: 3.472 ± 0.074
5.155LysGlu: 5.155 ± 0.103
1.832LysPhe: 1.832 ± 0.044
3.959LysGly: 3.959 ± 0.078
1.007LysHis: 1.007 ± 0.034
5.058LysIle: 5.058 ± 0.087
5.471LysLys: 5.471 ± 0.082
5.639LysLeu: 5.639 ± 0.085
2.16LysMet: 2.16 ± 0.046
3.595LysAsn: 3.595 ± 0.065
2.508LysPro: 2.508 ± 0.055
2.409LysGln: 2.409 ± 0.061
3.215LysArg: 3.215 ± 0.07
3.759LysSer: 3.759 ± 0.076
4.046LysThr: 4.046 ± 0.083
4.141LysVal: 4.141 ± 0.084
0.555LysTrp: 0.555 ± 0.026
2.434LysTyr: 2.434 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
7.306LeuAla: 7.306 ± 0.114
1.784LeuCys: 1.784 ± 0.054
4.792LeuAsp: 4.792 ± 0.079
5.415LeuGlu: 5.415 ± 0.096
3.957LeuPhe: 3.957 ± 0.093
5.947LeuGly: 5.947 ± 0.107
1.6LeuHis: 1.6 ± 0.048
5.971LeuIle: 5.971 ± 0.1
6.402LeuLys: 6.402 ± 0.097
8.818LeuLeu: 8.818 ± 0.147
2.391LeuMet: 2.391 ± 0.051
4.197LeuAsn: 4.197 ± 0.086
4.185LeuPro: 4.185 ± 0.086
2.886LeuGln: 2.886 ± 0.066
4.278LeuArg: 4.278 ± 0.086
6.832LeuSer: 6.832 ± 0.112
5.41LeuThr: 5.41 ± 0.081
5.395LeuVal: 5.395 ± 0.093
0.651LeuTrp: 0.651 ± 0.03
3.05LeuTyr: 3.05 ± 0.072
0.0LeuXaa: 0.0 ± 0.0
Met
2.394MetAla: 2.394 ± 0.059
0.326MetCys: 0.326 ± 0.021
1.658MetAsp: 1.658 ± 0.05
2.039MetGlu: 2.039 ± 0.051
0.942MetPhe: 0.942 ± 0.041
1.941MetGly: 1.941 ± 0.057
0.453MetHis: 0.453 ± 0.024
1.791MetIle: 1.791 ± 0.049
2.314MetLys: 2.314 ± 0.051
2.835MetLeu: 2.835 ± 0.059
0.821MetMet: 0.821 ± 0.036
1.444MetAsn: 1.444 ± 0.052
1.178MetPro: 1.178 ± 0.039
1.052MetGln: 1.052 ± 0.037
1.301MetArg: 1.301 ± 0.038
1.625MetSer: 1.625 ± 0.049
1.481MetThr: 1.481 ± 0.042
1.847MetVal: 1.847 ± 0.047
0.206MetTrp: 0.206 ± 0.02
0.675MetTyr: 0.675 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
3.671AsnAla: 3.671 ± 0.073
0.74AsnCys: 0.74 ± 0.035
2.172AsnAsp: 2.172 ± 0.061
2.674AsnGlu: 2.674 ± 0.061
1.641AsnPhe: 1.641 ± 0.052
3.826AsnGly: 3.826 ± 0.081
0.733AsnHis: 0.733 ± 0.035
3.381AsnIle: 3.381 ± 0.073
2.558AsnLys: 2.558 ± 0.057
3.922AsnLeu: 3.922 ± 0.088
1.221AsnMet: 1.221 ± 0.041
1.873AsnAsn: 1.873 ± 0.06
2.338AsnPro: 2.338 ± 0.057
1.402AsnGln: 1.402 ± 0.041
2.191AsnArg: 2.191 ± 0.053
2.689AsnSer: 2.689 ± 0.067
2.414AsnThr: 2.414 ± 0.065
3.007AsnVal: 3.007 ± 0.067
0.396AsnTrp: 0.396 ± 0.025
1.629AsnTyr: 1.629 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
3.157ProAla: 3.157 ± 0.068
0.566ProCys: 0.566 ± 0.026
2.616ProAsp: 2.616 ± 0.059
3.217ProGlu: 3.217 ± 0.065
1.764ProPhe: 1.764 ± 0.052
2.47ProGly: 2.47 ± 0.07
0.662ProHis: 0.662 ± 0.033
2.305ProIle: 2.305 ± 0.055
2.224ProLys: 2.224 ± 0.058
3.177ProLeu: 3.177 ± 0.068
0.914ProMet: 0.914 ± 0.031
1.573ProAsn: 1.573 ± 0.046
1.184ProPro: 1.184 ± 0.047
1.396ProGln: 1.396 ± 0.046
1.29ProArg: 1.29 ± 0.041
2.163ProSer: 2.163 ± 0.054
1.836ProThr: 1.836 ± 0.053
3.46ProVal: 3.46 ± 0.063
0.337ProTrp: 0.337 ± 0.023
1.444ProTyr: 1.444 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.752GlnAla: 2.752 ± 0.067
0.349GlnCys: 0.349 ± 0.021
1.505GlnAsp: 1.505 ± 0.047
2.184GlnGlu: 2.184 ± 0.064
1.248GlnPhe: 1.248 ± 0.04
1.989GlnGly: 1.989 ± 0.044
0.535GlnHis: 0.535 ± 0.025
2.676GlnIle: 2.676 ± 0.065
2.847GlnLys: 2.847 ± 0.07
2.823GlnLeu: 2.823 ± 0.056
1.058GlnMet: 1.058 ± 0.043
1.916GlnAsn: 1.916 ± 0.05
1.113GlnPro: 1.113 ± 0.045
1.326GlnGln: 1.326 ± 0.051
1.578GlnArg: 1.578 ± 0.048
1.942GlnSer: 1.942 ± 0.049
1.791GlnThr: 1.791 ± 0.049
1.944GlnVal: 1.944 ± 0.052
0.312GlnTrp: 0.312 ± 0.022
1.318GlnTyr: 1.318 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
3.345ArgAla: 3.345 ± 0.075
0.686ArgCys: 0.686 ± 0.03
2.316ArgAsp: 2.316 ± 0.055
3.506ArgGlu: 3.506 ± 0.073
2.144ArgPhe: 2.144 ± 0.053
2.758ArgGly: 2.758 ± 0.052
0.794ArgHis: 0.794 ± 0.039
3.648ArgIle: 3.648 ± 0.072
3.405ArgLys: 3.405 ± 0.071
4.492ArgLeu: 4.492 ± 0.088
1.527ArgMet: 1.527 ± 0.047
2.103ArgAsn: 2.103 ± 0.065
1.531ArgPro: 1.531 ± 0.048
1.621ArgGln: 1.621 ± 0.049
2.673ArgArg: 2.673 ± 0.068
2.584ArgSer: 2.584 ± 0.062
2.273ArgThr: 2.273 ± 0.062
2.931ArgVal: 2.931 ± 0.071
0.353ArgTrp: 0.353 ± 0.022
1.816ArgTyr: 1.816 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
5.352SerAla: 5.352 ± 0.107
0.878SerCys: 0.878 ± 0.037
3.462SerAsp: 3.462 ± 0.074
3.852SerGlu: 3.852 ± 0.068
2.802SerPhe: 2.802 ± 0.065
5.542SerGly: 5.542 ± 0.101
0.988SerHis: 0.988 ± 0.034
4.371SerIle: 4.371 ± 0.095
3.594SerLys: 3.594 ± 0.077
5.607SerLeu: 5.607 ± 0.091
1.747SerMet: 1.747 ± 0.055
2.503SerAsn: 2.503 ± 0.069
2.239SerPro: 2.239 ± 0.06
1.956SerGln: 1.956 ± 0.068
2.837SerArg: 2.837 ± 0.063
4.212SerSer: 4.212 ± 0.105
3.114SerThr: 3.114 ± 0.068
4.766SerVal: 4.766 ± 0.093
0.454SerTrp: 0.454 ± 0.025
2.152SerTyr: 2.152 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
5.268ThrAla: 5.268 ± 0.102
0.728ThrCys: 0.728 ± 0.032
3.157ThrAsp: 3.157 ± 0.073
3.291ThrGlu: 3.291 ± 0.071
2.209ThrPhe: 2.209 ± 0.054
4.798ThrGly: 4.798 ± 0.099
0.877ThrHis: 0.877 ± 0.04
3.78ThrIle: 3.78 ± 0.074
2.976ThrLys: 2.976 ± 0.068
5.045ThrLeu: 5.045 ± 0.093
1.351ThrMet: 1.351 ± 0.042
2.132ThrAsn: 2.132 ± 0.062
2.515ThrPro: 2.515 ± 0.057
1.657ThrGln: 1.657 ± 0.051
2.098ThrArg: 2.098 ± 0.058
3.09ThrSer: 3.09 ± 0.077
2.831ThrThr: 2.831 ± 0.072
4.932ThrVal: 4.932 ± 0.092
0.421ThrTrp: 0.421 ± 0.028
1.714ThrTyr: 1.714 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
5.232ValAla: 5.232 ± 0.084
1.232ValCys: 1.232 ± 0.041
3.716ValAsp: 3.716 ± 0.069
4.095ValGlu: 4.095 ± 0.072
3.106ValPhe: 3.106 ± 0.073
4.465ValGly: 4.465 ± 0.087
1.165ValHis: 1.165 ± 0.039
5.011ValIle: 5.011 ± 0.09
4.497ValLys: 4.497 ± 0.077
6.814ValLeu: 6.814 ± 0.096
1.907ValMet: 1.907 ± 0.059
3.025ValAsn: 3.025 ± 0.06
2.866ValPro: 2.866 ± 0.062
2.171ValGln: 2.171 ± 0.059
3.13ValArg: 3.13 ± 0.066
5.013ValSer: 5.013 ± 0.092
4.285ValThr: 4.285 ± 0.087
4.879ValVal: 4.879 ± 0.085
0.573ValTrp: 0.573 ± 0.031
2.406ValTyr: 2.406 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
0.569TrpAla: 0.569 ± 0.028
0.149TrpCys: 0.149 ± 0.015
0.482TrpAsp: 0.482 ± 0.026
0.505TrpGlu: 0.505 ± 0.03
0.383TrpPhe: 0.383 ± 0.024
0.573TrpGly: 0.573 ± 0.031
0.181TrpHis: 0.181 ± 0.015
0.559TrpIle: 0.559 ± 0.03
0.551TrpLys: 0.551 ± 0.029
0.814TrpLeu: 0.814 ± 0.035
0.251TrpMet: 0.251 ± 0.018
0.482TrpAsn: 0.482 ± 0.028
0.186TrpPro: 0.186 ± 0.018
0.298TrpGln: 0.298 ± 0.022
0.361TrpArg: 0.361 ± 0.024
0.401TrpSer: 0.401 ± 0.026
0.377TrpThr: 0.377 ± 0.026
0.531TrpVal: 0.531 ± 0.032
0.11TrpTrp: 0.11 ± 0.012
0.306TrpTyr: 0.306 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.738TyrAla: 2.738 ± 0.055
0.603TyrCys: 0.603 ± 0.03
2.132TyrAsp: 2.132 ± 0.054
2.095TyrGlu: 2.095 ± 0.048
1.75TyrPhe: 1.75 ± 0.054
2.666TyrGly: 2.666 ± 0.063
0.76TyrHis: 0.76 ± 0.033
2.506TyrIle: 2.506 ± 0.065
2.027TyrLys: 2.027 ± 0.056
3.157TyrLeu: 3.157 ± 0.077
0.833TyrMet: 0.833 ± 0.033
1.674TyrAsn: 1.674 ± 0.059
1.43TyrPro: 1.43 ± 0.048
1.198TyrGln: 1.198 ± 0.044
1.941TyrArg: 1.941 ± 0.053
2.389TyrSer: 2.389 ± 0.067
2.01TyrThr: 2.01 ± 0.061
2.071TyrVal: 2.071 ± 0.054
0.322TyrTrp: 0.322 ± 0.02
1.529TyrTyr: 1.529 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2438 proteins (752679 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski