Amino acid dipepetide frequency for Erysipelothrix sp. HDW6B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.711AlaAla: 4.711 ± 0.104
0.445AlaCys: 0.445 ± 0.025
3.37AlaAsp: 3.37 ± 0.077
3.432AlaGlu: 3.432 ± 0.081
3.255AlaPhe: 3.255 ± 0.07
4.187AlaGly: 4.187 ± 0.091
1.463AlaHis: 1.463 ± 0.06
6.535AlaIle: 6.535 ± 0.114
3.918AlaLys: 3.918 ± 0.079
7.532AlaLeu: 7.532 ± 0.117
2.441AlaMet: 2.441 ± 0.068
3.062AlaAsn: 3.062 ± 0.07
2.045AlaPro: 2.045 ± 0.07
2.713AlaGln: 2.713 ± 0.07
2.558AlaArg: 2.558 ± 0.061
4.669AlaSer: 4.669 ± 0.097
4.452AlaThr: 4.452 ± 0.096
5.106AlaVal: 5.106 ± 0.111
0.555AlaTrp: 0.555 ± 0.035
2.779AlaTyr: 2.779 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.356CysAla: 0.356 ± 0.025
0.044CysCys: 0.044 ± 0.009
0.396CysAsp: 0.396 ± 0.026
0.318CysGlu: 0.318 ± 0.022
0.258CysPhe: 0.258 ± 0.02
0.472CysGly: 0.472 ± 0.024
0.14CysHis: 0.14 ± 0.014
0.44CysIle: 0.44 ± 0.025
0.225CysLys: 0.225 ± 0.021
0.454CysLeu: 0.454 ± 0.03
0.142CysMet: 0.142 ± 0.015
0.222CysAsn: 0.222 ± 0.021
0.202CysPro: 0.202 ± 0.02
0.121CysGln: 0.121 ± 0.013
0.158CysArg: 0.158 ± 0.015
0.31CysSer: 0.31 ± 0.022
0.3CysThr: 0.3 ± 0.022
0.502CysVal: 0.502 ± 0.03
0.032CysTrp: 0.032 ± 0.007
0.214CysTyr: 0.214 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
5.022AspAla: 5.022 ± 0.118
0.261AspCys: 0.261 ± 0.022
3.379AspAsp: 3.379 ± 0.078
4.051AspGlu: 4.051 ± 0.086
2.885AspPhe: 2.885 ± 0.074
3.991AspGly: 3.991 ± 0.104
1.216AspHis: 1.216 ± 0.051
4.901AspIle: 4.901 ± 0.092
3.233AspLys: 3.233 ± 0.073
5.204AspLeu: 5.204 ± 0.104
1.543AspMet: 1.543 ± 0.053
2.568AspAsn: 2.568 ± 0.066
2.241AspPro: 2.241 ± 0.087
1.903AspGln: 1.903 ± 0.058
2.258AspArg: 2.258 ± 0.064
3.23AspSer: 3.23 ± 0.083
3.667AspThr: 3.667 ± 0.08
5.183AspVal: 5.183 ± 0.104
0.387AspTrp: 0.387 ± 0.023
2.788AspTyr: 2.788 ± 0.077
0.0AspXaa: 0.0 ± 0.0
Glu
5.299GluAla: 5.299 ± 0.105
0.274GluCys: 0.274 ± 0.02
3.486GluAsp: 3.486 ± 0.072
3.712GluGlu: 3.712 ± 0.105
2.655GluPhe: 2.655 ± 0.074
3.831GluGly: 3.831 ± 0.086
1.37GluHis: 1.37 ± 0.046
5.504GluIle: 5.504 ± 0.097
3.949GluLys: 3.949 ± 0.09
5.757GluLeu: 5.757 ± 0.112
1.962GluMet: 1.962 ± 0.067
3.631GluAsn: 3.631 ± 0.073
1.908GluPro: 1.908 ± 0.057
2.038GluGln: 2.038 ± 0.069
2.912GluArg: 2.912 ± 0.08
4.164GluSer: 4.164 ± 0.082
4.66GluThr: 4.66 ± 0.103
4.563GluVal: 4.563 ± 0.093
0.529GluTrp: 0.529 ± 0.029
2.426GluTyr: 2.426 ± 0.062
0.0GluXaa: 0.0 ± 0.0
Phe
3.031PheAla: 3.031 ± 0.076
0.2PheCys: 0.2 ± 0.015
3.412PheAsp: 3.412 ± 0.08
3.528PheGlu: 3.528 ± 0.079
1.87PhePhe: 1.87 ± 0.069
3.305PheGly: 3.305 ± 0.073
0.681PheHis: 0.681 ± 0.035
3.56PheIle: 3.56 ± 0.09
2.936PheLys: 2.936 ± 0.071
3.459PheLeu: 3.459 ± 0.091
1.311PheMet: 1.311 ± 0.046
2.34PheAsn: 2.34 ± 0.065
1.308PhePro: 1.308 ± 0.047
1.242PheGln: 1.242 ± 0.042
1.34PheArg: 1.34 ± 0.053
2.797PheSer: 2.797 ± 0.073
2.894PheThr: 2.894 ± 0.069
3.617PheVal: 3.617 ± 0.082
0.369PheTrp: 0.369 ± 0.026
1.762PheTyr: 1.762 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
4.212GlyAla: 4.212 ± 0.098
0.454GlyCys: 0.454 ± 0.028
3.284GlyAsp: 3.284 ± 0.08
3.317GlyGlu: 3.317 ± 0.08
3.297GlyPhe: 3.297 ± 0.074
4.235GlyGly: 4.235 ± 0.198
1.32GlyHis: 1.32 ± 0.042
5.683GlyIle: 5.683 ± 0.102
3.947GlyLys: 3.947 ± 0.079
5.606GlyLeu: 5.606 ± 0.105
1.938GlyMet: 1.938 ± 0.064
3.002GlyAsn: 3.002 ± 0.105
1.424GlyPro: 1.424 ± 0.044
1.947GlyGln: 1.947 ± 0.059
2.292GlyArg: 2.292 ± 0.08
4.404GlySer: 4.404 ± 0.1
4.221GlyThr: 4.221 ± 0.112
5.022GlyVal: 5.022 ± 0.102
0.683GlyTrp: 0.683 ± 0.039
3.364GlyTyr: 3.364 ± 0.084
0.0GlyXaa: 0.0 ± 0.0
His
1.521HisAla: 1.521 ± 0.055
0.128HisCys: 0.128 ± 0.015
1.477HisAsp: 1.477 ± 0.054
1.552HisGlu: 1.552 ± 0.057
0.963HisPhe: 0.963 ± 0.036
1.367HisGly: 1.367 ± 0.044
0.625HisHis: 0.625 ± 0.036
1.463HisIle: 1.463 ± 0.051
1.126HisLys: 1.126 ± 0.043
1.756HisLeu: 1.756 ± 0.064
0.496HisMet: 0.496 ± 0.026
1.007HisAsn: 1.007 ± 0.034
0.87HisPro: 0.87 ± 0.038
0.85HisGln: 0.85 ± 0.035
0.919HisArg: 0.919 ± 0.038
1.096HisSer: 1.096 ± 0.048
1.176HisThr: 1.176 ± 0.052
1.515HisVal: 1.515 ± 0.052
0.17HisTrp: 0.17 ± 0.017
0.9HisTyr: 0.9 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.536IleAla: 6.536 ± 0.121
0.479IleCys: 0.479 ± 0.029
5.386IleAsp: 5.386 ± 0.102
5.855IleGlu: 5.855 ± 0.099
3.251IlePhe: 3.251 ± 0.087
5.306IleGly: 5.306 ± 0.091
1.735IleHis: 1.735 ± 0.055
6.346IleIle: 6.346 ± 0.111
4.559IleLys: 4.559 ± 0.089
7.342IleLeu: 7.342 ± 0.147
2.191IleMet: 2.191 ± 0.073
4.063IleAsn: 4.063 ± 0.082
3.043IlePro: 3.043 ± 0.07
3.091IleGln: 3.091 ± 0.082
3.26IleArg: 3.26 ± 0.07
5.144IleSer: 5.144 ± 0.102
4.907IleThr: 4.907 ± 0.093
6.411IleVal: 6.411 ± 0.101
0.482IleTrp: 0.482 ± 0.029
2.775IleTyr: 2.775 ± 0.068
0.0IleXaa: 0.0 ± 0.0
Lys
3.932LysAla: 3.932 ± 0.08
0.228LysCys: 0.228 ± 0.022
3.822LysAsp: 3.822 ± 0.091
4.545LysGlu: 4.545 ± 0.088
1.997LysPhe: 1.997 ± 0.055
3.257LysGly: 3.257 ± 0.075
1.444LysHis: 1.444 ± 0.045
4.363LysIle: 4.363 ± 0.093
3.894LysLys: 3.894 ± 0.087
4.93LysLeu: 4.93 ± 0.1
1.813LysMet: 1.813 ± 0.058
3.2LysAsn: 3.2 ± 0.077
2.288LysPro: 2.288 ± 0.061
2.619LysGln: 2.619 ± 0.065
2.865LysArg: 2.865 ± 0.062
3.531LysSer: 3.531 ± 0.078
3.858LysThr: 3.858 ± 0.077
3.988LysVal: 3.988 ± 0.094
0.478LysTrp: 0.478 ± 0.03
2.339LysTyr: 2.339 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
6.257LeuAla: 6.257 ± 0.119
0.505LeuCys: 0.505 ± 0.03
5.749LeuAsp: 5.749 ± 0.103
6.382LeuGlu: 6.382 ± 0.104
3.836LeuPhe: 3.836 ± 0.087
5.769LeuGly: 5.769 ± 0.11
1.629LeuHis: 1.629 ± 0.056
7.062LeuIle: 7.062 ± 0.137
5.819LeuLys: 5.819 ± 0.087
7.873LeuLeu: 7.873 ± 0.16
2.782LeuMet: 2.782 ± 0.072
5.046LeuAsn: 5.046 ± 0.088
3.163LeuPro: 3.163 ± 0.067
2.99LeuGln: 2.99 ± 0.074
3.493LeuArg: 3.493 ± 0.09
6.618LeuSer: 6.618 ± 0.106
5.484LeuThr: 5.484 ± 0.096
6.542LeuVal: 6.542 ± 0.102
0.669LeuTrp: 0.669 ± 0.032
3.008LeuTyr: 3.008 ± 0.078
0.0LeuXaa: 0.0 ± 0.0
Met
1.86MetAla: 1.86 ± 0.055
0.137MetCys: 0.137 ± 0.014
1.629MetAsp: 1.629 ± 0.056
1.579MetGlu: 1.579 ± 0.052
1.305MetPhe: 1.305 ± 0.049
1.756MetGly: 1.756 ± 0.051
0.591MetHis: 0.591 ± 0.032
2.665MetIle: 2.665 ± 0.07
2.353MetLys: 2.353 ± 0.06
2.517MetLeu: 2.517 ± 0.067
1.176MetMet: 1.176 ± 0.049
1.83MetAsn: 1.83 ± 0.057
1.038MetPro: 1.038 ± 0.039
0.993MetGln: 0.993 ± 0.036
1.224MetArg: 1.224 ± 0.044
1.968MetSer: 1.968 ± 0.058
1.866MetThr: 1.866 ± 0.057
1.726MetVal: 1.726 ± 0.062
0.19MetTrp: 0.19 ± 0.016
0.943MetTyr: 0.943 ± 0.039
0.0MetXaa: 0.0 ± 0.0
Asn
3.625AsnAla: 3.625 ± 0.077
0.226AsnCys: 0.226 ± 0.017
3.013AsnAsp: 3.013 ± 0.074
3.261AsnGlu: 3.261 ± 0.075
2.012AsnPhe: 2.012 ± 0.06
3.436AsnGly: 3.436 ± 0.093
1.266AsnHis: 1.266 ± 0.039
3.843AsnIle: 3.843 ± 0.077
2.863AsnLys: 2.863 ± 0.073
4.328AsnLeu: 4.328 ± 0.09
1.225AsnMet: 1.225 ± 0.046
2.565AsnAsn: 2.565 ± 0.089
2.273AsnPro: 2.273 ± 0.068
2.212AsnGln: 2.212 ± 0.062
2.169AsnArg: 2.169 ± 0.06
2.669AsnSer: 2.669 ± 0.075
3.18AsnThr: 3.18 ± 0.075
3.866AsnVal: 3.866 ± 0.095
0.449AsnTrp: 0.449 ± 0.025
2.23AsnTyr: 2.23 ± 0.069
0.0AsnXaa: 0.0 ± 0.0
Pro
1.852ProAla: 1.852 ± 0.063
0.134ProCys: 0.134 ± 0.014
1.711ProAsp: 1.711 ± 0.047
2.752ProGlu: 2.752 ± 0.084
1.605ProPhe: 1.605 ± 0.059
1.86ProGly: 1.86 ± 0.066
0.726ProHis: 0.726 ± 0.032
2.866ProIle: 2.866 ± 0.092
1.949ProLys: 1.949 ± 0.053
3.114ProLeu: 3.114 ± 0.074
0.94ProMet: 0.94 ± 0.037
1.834ProAsn: 1.834 ± 0.054
0.538ProPro: 0.538 ± 0.032
1.349ProGln: 1.349 ± 0.05
1.026ProArg: 1.026 ± 0.039
2.089ProSer: 2.089 ± 0.05
2.224ProThr: 2.224 ± 0.072
2.61ProVal: 2.61 ± 0.079
0.292ProTrp: 0.292 ± 0.021
1.447ProTyr: 1.447 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
2.625GlnAla: 2.625 ± 0.06
0.155GlnCys: 0.155 ± 0.017
2.0GlnAsp: 2.0 ± 0.057
2.517GlnGlu: 2.517 ± 0.065
1.661GlnPhe: 1.661 ± 0.057
2.134GlnGly: 2.134 ± 0.059
0.764GlnHis: 0.764 ± 0.035
2.634GlnIle: 2.634 ± 0.065
2.279GlnLys: 2.279 ± 0.058
3.159GlnLeu: 3.159 ± 0.082
1.076GlnMet: 1.076 ± 0.041
1.906GlnAsn: 1.906 ± 0.056
1.078GlnPro: 1.078 ± 0.04
1.573GlnGln: 1.573 ± 0.058
1.723GlnArg: 1.723 ± 0.062
2.414GlnSer: 2.414 ± 0.06
2.396GlnThr: 2.396 ± 0.072
2.182GlnVal: 2.182 ± 0.054
0.389GlnTrp: 0.389 ± 0.024
1.486GlnTyr: 1.486 ± 0.051
0.0GlnXaa: 0.0 ± 0.0
Arg
2.283ArgAla: 2.283 ± 0.068
0.237ArgCys: 0.237 ± 0.018
2.57ArgAsp: 2.57 ± 0.068
2.74ArgGlu: 2.74 ± 0.071
1.979ArgPhe: 1.979 ± 0.053
2.117ArgGly: 2.117 ± 0.062
0.87ArgHis: 0.87 ± 0.035
3.54ArgIle: 3.54 ± 0.068
2.625ArgLys: 2.625 ± 0.062
3.573ArgLeu: 3.573 ± 0.083
1.225ArgMet: 1.225 ± 0.047
2.172ArgAsn: 2.172 ± 0.061
1.142ArgPro: 1.142 ± 0.043
1.377ArgGln: 1.377 ± 0.047
1.802ArgArg: 1.802 ± 0.062
2.151ArgSer: 2.151 ± 0.055
2.175ArgThr: 2.175 ± 0.063
2.907ArgVal: 2.907 ± 0.064
0.306ArgTrp: 0.306 ± 0.022
1.934ArgTyr: 1.934 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
3.73SerAla: 3.73 ± 0.081
0.341SerCys: 0.341 ± 0.023
3.548SerAsp: 3.548 ± 0.068
3.638SerGlu: 3.638 ± 0.09
3.255SerPhe: 3.255 ± 0.075
4.532SerGly: 4.532 ± 0.134
1.284SerHis: 1.284 ± 0.05
5.585SerIle: 5.585 ± 0.099
3.897SerLys: 3.897 ± 0.087
6.042SerLeu: 6.042 ± 0.102
1.915SerMet: 1.915 ± 0.058
3.364SerAsn: 3.364 ± 0.088
1.7SerPro: 1.7 ± 0.054
2.291SerGln: 2.291 ± 0.062
2.451SerArg: 2.451 ± 0.062
3.804SerSer: 3.804 ± 0.08
3.621SerThr: 3.621 ± 0.086
4.764SerVal: 4.764 ± 0.088
0.562SerTrp: 0.562 ± 0.031
2.665SerTyr: 2.665 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
3.625ThrAla: 3.625 ± 0.085
0.244ThrCys: 0.244 ± 0.023
3.388ThrAsp: 3.388 ± 0.087
3.453ThrGlu: 3.453 ± 0.08
3.151ThrPhe: 3.151 ± 0.083
4.063ThrGly: 4.063 ± 0.114
1.48ThrHis: 1.48 ± 0.054
5.499ThrIle: 5.499 ± 0.098
3.328ThrLys: 3.328 ± 0.088
6.711ThrLeu: 6.711 ± 0.111
1.783ThrMet: 1.783 ± 0.056
2.916ThrAsn: 2.916 ± 0.08
2.488ThrPro: 2.488 ± 0.068
2.58ThrGln: 2.58 ± 0.064
2.291ThrArg: 2.291 ± 0.064
3.721ThrSer: 3.721 ± 0.073
3.995ThrThr: 3.995 ± 0.099
5.338ThrVal: 5.338 ± 0.14
0.535ThrTrp: 0.535 ± 0.029
2.722ThrTyr: 2.722 ± 0.094
0.0ThrXaa: 0.0 ± 0.0
Val
5.51ValAla: 5.51 ± 0.099
0.515ValCys: 0.515 ± 0.028
4.875ValAsp: 4.875 ± 0.102
4.714ValGlu: 4.714 ± 0.109
3.374ValPhe: 3.374 ± 0.073
4.863ValGly: 4.863 ± 0.109
1.313ValHis: 1.313 ± 0.047
6.077ValIle: 6.077 ± 0.114
4.072ValLys: 4.072 ± 0.093
6.934ValLeu: 6.934 ± 0.12
2.182ValMet: 2.182 ± 0.058
3.513ValAsn: 3.513 ± 0.081
2.46ValPro: 2.46 ± 0.061
2.227ValGln: 2.227 ± 0.06
2.729ValArg: 2.729 ± 0.072
5.401ValSer: 5.401 ± 0.1
5.026ValThr: 5.026 ± 0.107
6.173ValVal: 6.173 ± 0.112
0.511ValTrp: 0.511 ± 0.028
2.818ValTyr: 2.818 ± 0.063
0.0ValXaa: 0.0 ± 0.0
Trp
0.386TrpAla: 0.386 ± 0.024
0.059TrpCys: 0.059 ± 0.009
0.52TrpAsp: 0.52 ± 0.03
0.481TrpGlu: 0.481 ± 0.03
0.475TrpPhe: 0.475 ± 0.03
0.508TrpGly: 0.508 ± 0.028
0.167TrpHis: 0.167 ± 0.017
0.744TrpIle: 0.744 ± 0.035
0.402TrpLys: 0.402 ± 0.025
0.773TrpLeu: 0.773 ± 0.042
0.249TrpMet: 0.249 ± 0.021
0.553TrpAsn: 0.553 ± 0.029
0.158TrpPro: 0.158 ± 0.016
0.333TrpGln: 0.333 ± 0.027
0.295TrpArg: 0.295 ± 0.022
0.455TrpSer: 0.455 ± 0.024
0.479TrpThr: 0.479 ± 0.03
0.527TrpVal: 0.527 ± 0.032
0.102TrpTrp: 0.102 ± 0.012
0.356TrpTyr: 0.356 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.856TyrAla: 2.856 ± 0.067
0.238TyrCys: 0.238 ± 0.019
2.895TyrAsp: 2.895 ± 0.099
2.802TyrGlu: 2.802 ± 0.066
1.878TyrPhe: 1.878 ± 0.053
2.665TyrGly: 2.665 ± 0.065
0.891TyrHis: 0.891 ± 0.038
2.851TyrIle: 2.851 ± 0.084
2.016TyrLys: 2.016 ± 0.065
3.614TyrLeu: 3.614 ± 0.083
0.951TyrMet: 0.951 ± 0.039
1.919TyrAsn: 1.919 ± 0.058
1.552TyrPro: 1.552 ± 0.05
1.701TyrGln: 1.701 ± 0.051
1.961TyrArg: 1.961 ± 0.054
2.354TyrSer: 2.354 ± 0.067
2.729TyrThr: 2.729 ± 0.093
2.749TyrVal: 2.749 ± 0.068
0.333TyrTrp: 0.333 ± 0.024
1.694TyrTyr: 1.694 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2061 proteins (663539 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski