Amino acid dipepetide frequency for Erysipelothrix sp. HDW6A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.764AlaAla: 3.764 ± 0.1
0.533AlaCys: 0.533 ± 0.03
2.875AlaAsp: 2.875 ± 0.076
3.351AlaGlu: 3.351 ± 0.085
2.725AlaPhe: 2.725 ± 0.072
3.701AlaGly: 3.701 ± 0.096
1.152AlaHis: 1.152 ± 0.048
5.797AlaIle: 5.797 ± 0.12
4.09AlaLys: 4.09 ± 0.104
6.557AlaLeu: 6.557 ± 0.116
1.946AlaMet: 1.946 ± 0.064
2.868AlaAsn: 2.868 ± 0.078
1.598AlaPro: 1.598 ± 0.059
2.169AlaGln: 2.169 ± 0.065
2.245AlaArg: 2.245 ± 0.068
4.259AlaSer: 4.259 ± 0.088
3.367AlaThr: 3.367 ± 0.093
4.439AlaVal: 4.439 ± 0.097
0.495AlaTrp: 0.495 ± 0.034
2.62AlaTyr: 2.62 ± 0.068
0.0AlaXaa: 0.0 ± 0.0
Cys
0.438CysAla: 0.438 ± 0.029
0.054CysCys: 0.054 ± 0.011
0.482CysAsp: 0.482 ± 0.031
0.4CysGlu: 0.4 ± 0.028
0.27CysPhe: 0.27 ± 0.022
0.55CysGly: 0.55 ± 0.034
0.165CysHis: 0.165 ± 0.017
0.56CysIle: 0.56 ± 0.033
0.37CysLys: 0.37 ± 0.026
0.575CysLeu: 0.575 ± 0.033
0.144CysMet: 0.144 ± 0.015
0.312CysAsn: 0.312 ± 0.022
0.231CysPro: 0.231 ± 0.019
0.183CysGln: 0.183 ± 0.018
0.231CysArg: 0.231 ± 0.02
0.467CysSer: 0.467 ± 0.026
0.328CysThr: 0.328 ± 0.024
0.545CysVal: 0.545 ± 0.028
0.046CysTrp: 0.046 ± 0.008
0.299CysTyr: 0.299 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.845AspAla: 3.845 ± 0.094
0.334AspCys: 0.334 ± 0.026
3.609AspAsp: 3.609 ± 0.085
4.901AspGlu: 4.901 ± 0.103
3.139AspPhe: 3.139 ± 0.08
3.665AspGly: 3.665 ± 0.091
1.028AspHis: 1.028 ± 0.044
5.389AspIle: 5.389 ± 0.114
4.015AspLys: 4.015 ± 0.091
5.367AspLeu: 5.367 ± 0.111
1.566AspMet: 1.566 ± 0.054
2.829AspAsn: 2.829 ± 0.081
1.782AspPro: 1.782 ± 0.063
1.497AspGln: 1.497 ± 0.05
2.175AspArg: 2.175 ± 0.061
3.742AspSer: 3.742 ± 0.091
3.394AspThr: 3.394 ± 0.086
4.838AspVal: 4.838 ± 0.11
0.468AspTrp: 0.468 ± 0.03
2.946AspTyr: 2.946 ± 0.072
0.0AspXaa: 0.0 ± 0.0
Glu
4.554GluAla: 4.554 ± 0.115
0.439GluCys: 0.439 ± 0.027
4.125GluAsp: 4.125 ± 0.092
5.759GluGlu: 5.759 ± 0.13
2.978GluPhe: 2.978 ± 0.066
4.195GluGly: 4.195 ± 0.101
1.32GluHis: 1.32 ± 0.046
6.172GluIle: 6.172 ± 0.099
4.972GluLys: 4.972 ± 0.103
6.803GluLeu: 6.803 ± 0.119
2.328GluMet: 2.328 ± 0.063
4.03GluAsn: 4.03 ± 0.084
1.86GluPro: 1.86 ± 0.066
2.006GluGln: 2.006 ± 0.062
2.959GluArg: 2.959 ± 0.077
4.354GluSer: 4.354 ± 0.097
3.996GluThr: 3.996 ± 0.086
5.106GluVal: 5.106 ± 0.096
0.538GluTrp: 0.538 ± 0.033
3.037GluTyr: 3.037 ± 0.076
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 0.082
0.295PheCys: 0.295 ± 0.021
3.282PheAsp: 3.282 ± 0.074
3.397PheGlu: 3.397 ± 0.075
1.98PhePhe: 1.98 ± 0.069
3.226PheGly: 3.226 ± 0.078
0.658PheHis: 0.658 ± 0.031
3.911PheIle: 3.911 ± 0.095
2.993PheLys: 2.993 ± 0.075
3.606PheLeu: 3.606 ± 0.083
1.23PheMet: 1.23 ± 0.051
2.569PheAsn: 2.569 ± 0.066
1.363PhePro: 1.363 ± 0.049
1.117PheGln: 1.117 ± 0.051
1.374PheArg: 1.374 ± 0.054
3.012PheSer: 3.012 ± 0.074
2.656PheThr: 2.656 ± 0.072
3.526PheVal: 3.526 ± 0.08
0.358PheTrp: 0.358 ± 0.023
1.821PheTyr: 1.821 ± 0.057
0.0PheXaa: 0.0 ± 0.0
Gly
3.748GlyAla: 3.748 ± 0.09
0.48GlyCys: 0.48 ± 0.029
3.351GlyAsp: 3.351 ± 0.083
3.562GlyGlu: 3.562 ± 0.088
3.1GlyPhe: 3.1 ± 0.078
3.837GlyGly: 3.837 ± 0.125
1.206GlyHis: 1.206 ± 0.053
5.961GlyIle: 5.961 ± 0.104
4.191GlyLys: 4.191 ± 0.089
5.68GlyLeu: 5.68 ± 0.109
1.799GlyMet: 1.799 ± 0.06
2.846GlyAsn: 2.846 ± 0.068
1.303GlyPro: 1.303 ± 0.049
1.802GlyGln: 1.802 ± 0.057
2.203GlyArg: 2.203 ± 0.066
4.04GlySer: 4.04 ± 0.09
3.614GlyThr: 3.614 ± 0.091
5.05GlyVal: 5.05 ± 0.098
0.602GlyTrp: 0.602 ± 0.035
3.165GlyTyr: 3.165 ± 0.076
0.0GlyXaa: 0.0 ± 0.0
His
1.12HisAla: 1.12 ± 0.043
0.18HisCys: 0.18 ± 0.019
1.317HisAsp: 1.317 ± 0.057
1.458HisGlu: 1.458 ± 0.048
0.947HisPhe: 0.947 ± 0.042
1.259HisGly: 1.259 ± 0.05
0.563HisHis: 0.563 ± 0.032
1.464HisIle: 1.464 ± 0.058
1.113HisLys: 1.113 ± 0.04
1.744HisLeu: 1.744 ± 0.062
0.402HisMet: 0.402 ± 0.026
0.962HisAsn: 0.962 ± 0.044
0.777HisPro: 0.777 ± 0.036
0.753HisGln: 0.753 ± 0.036
0.803HisArg: 0.803 ± 0.037
1.147HisSer: 1.147 ± 0.044
0.981HisThr: 0.981 ± 0.042
1.358HisVal: 1.358 ± 0.05
0.146HisTrp: 0.146 ± 0.015
0.889HisTyr: 0.889 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
6.075IleAla: 6.075 ± 0.108
0.563IleCys: 0.563 ± 0.031
5.851IleAsp: 5.851 ± 0.097
6.273IleGlu: 6.273 ± 0.1
3.355IlePhe: 3.355 ± 0.089
5.092IleGly: 5.092 ± 0.103
1.707IleHis: 1.707 ± 0.052
6.626IleIle: 6.626 ± 0.152
5.291IleLys: 5.291 ± 0.099
7.651IleLeu: 7.651 ± 0.135
2.114IleMet: 2.114 ± 0.063
4.663IleAsn: 4.663 ± 0.096
3.136IlePro: 3.136 ± 0.081
3.044IleGln: 3.044 ± 0.071
3.244IleArg: 3.244 ± 0.077
5.873IleSer: 5.873 ± 0.093
4.889IleThr: 4.889 ± 0.103
6.416IleVal: 6.416 ± 0.108
0.426IleTrp: 0.426 ± 0.03
3.007IleTyr: 3.007 ± 0.084
0.0IleXaa: 0.0 ± 0.0
Lys
4.001LysAla: 4.001 ± 0.094
0.311LysCys: 0.311 ± 0.025
4.429LysAsp: 4.429 ± 0.093
6.134LysGlu: 6.134 ± 0.108
2.238LysPhe: 2.238 ± 0.059
3.648LysGly: 3.648 ± 0.076
1.517LysHis: 1.517 ± 0.055
4.778LysIle: 4.778 ± 0.101
5.072LysLys: 5.072 ± 0.107
5.775LysLeu: 5.775 ± 0.097
1.938LysMet: 1.938 ± 0.061
3.754LysAsn: 3.754 ± 0.087
2.403LysPro: 2.403 ± 0.074
2.501LysGln: 2.501 ± 0.072
3.204LysArg: 3.204 ± 0.071
3.791LysSer: 3.791 ± 0.081
3.874LysThr: 3.874 ± 0.084
4.48LysVal: 4.48 ± 0.08
0.485LysTrp: 0.485 ± 0.032
2.778LysTyr: 2.778 ± 0.07
0.0LysXaa: 0.0 ± 0.0
Leu
5.78LeuAla: 5.78 ± 0.108
0.675LeuCys: 0.675 ± 0.033
5.859LeuAsp: 5.859 ± 0.105
6.774LeuGlu: 6.774 ± 0.114
4.147LeuPhe: 4.147 ± 0.09
5.958LeuGly: 5.958 ± 0.102
1.527LeuHis: 1.527 ± 0.049
7.69LeuIle: 7.69 ± 0.121
6.27LeuLys: 6.27 ± 0.094
8.591LeuLeu: 8.591 ± 0.166
2.572LeuMet: 2.572 ± 0.067
5.325LeuAsn: 5.325 ± 0.088
3.114LeuPro: 3.114 ± 0.085
2.807LeuGln: 2.807 ± 0.072
3.535LeuArg: 3.535 ± 0.08
7.21LeuSer: 7.21 ± 0.124
4.901LeuThr: 4.901 ± 0.094
6.837LeuVal: 6.837 ± 0.122
0.658LeuTrp: 0.658 ± 0.033
3.217LeuTyr: 3.217 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
1.505MetAla: 1.505 ± 0.053
0.185MetCys: 0.185 ± 0.018
1.415MetAsp: 1.415 ± 0.048
1.558MetGlu: 1.558 ± 0.049
1.251MetPhe: 1.251 ± 0.044
1.649MetGly: 1.649 ± 0.063
0.409MetHis: 0.409 ± 0.025
2.489MetIle: 2.489 ± 0.07
2.591MetLys: 2.591 ± 0.066
2.41MetLeu: 2.41 ± 0.074
0.999MetMet: 0.999 ± 0.043
1.901MetAsn: 1.901 ± 0.063
0.943MetPro: 0.943 ± 0.039
0.815MetGln: 0.815 ± 0.036
1.149MetArg: 1.149 ± 0.042
1.931MetSer: 1.931 ± 0.053
1.688MetThr: 1.688 ± 0.051
1.641MetVal: 1.641 ± 0.055
0.17MetTrp: 0.17 ± 0.016
0.923MetTyr: 0.923 ± 0.041
0.0MetXaa: 0.0 ± 0.0
Asn
3.127AsnAla: 3.127 ± 0.074
0.314AsnCys: 0.314 ± 0.023
3.265AsnAsp: 3.265 ± 0.081
3.757AsnGlu: 3.757 ± 0.083
2.075AsnPhe: 2.075 ± 0.061
3.363AsnGly: 3.363 ± 0.097
1.166AsnHis: 1.166 ± 0.043
4.325AsnIle: 4.325 ± 0.093
3.636AsnLys: 3.636 ± 0.088
4.483AsnLeu: 4.483 ± 0.082
1.307AsnMet: 1.307 ± 0.049
2.922AsnAsn: 2.922 ± 0.078
2.245AsnPro: 2.245 ± 0.064
1.985AsnGln: 1.985 ± 0.051
2.347AsnArg: 2.347 ± 0.063
3.18AsnSer: 3.18 ± 0.089
3.095AsnThr: 3.095 ± 0.085
3.665AsnVal: 3.665 ± 0.079
0.36AsnTrp: 0.36 ± 0.027
2.315AsnTyr: 2.315 ± 0.072
0.0AsnXaa: 0.0 ± 0.0
Pro
1.59ProAla: 1.59 ± 0.059
0.188ProCys: 0.188 ± 0.02
1.58ProAsp: 1.58 ± 0.054
2.639ProGlu: 2.639 ± 0.067
1.598ProPhe: 1.598 ± 0.056
1.699ProGly: 1.699 ± 0.057
0.655ProHis: 0.655 ± 0.035
2.739ProIle: 2.739 ± 0.078
2.004ProLys: 2.004 ± 0.059
2.839ProLeu: 2.839 ± 0.068
0.845ProMet: 0.845 ± 0.038
1.697ProAsn: 1.697 ± 0.052
0.489ProPro: 0.489 ± 0.034
1.047ProGln: 1.047 ± 0.045
0.915ProArg: 0.915 ± 0.041
2.153ProSer: 2.153 ± 0.067
1.885ProThr: 1.885 ± 0.088
2.335ProVal: 2.335 ± 0.065
0.282ProTrp: 0.282 ± 0.02
1.346ProTyr: 1.346 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
2.274GlnAla: 2.274 ± 0.062
0.2GlnCys: 0.2 ± 0.016
1.789GlnAsp: 1.789 ± 0.063
2.842GlnGlu: 2.842 ± 0.069
1.502GlnPhe: 1.502 ± 0.054
1.945GlnGly: 1.945 ± 0.065
0.628GlnHis: 0.628 ± 0.033
2.535GlnIle: 2.535 ± 0.067
2.299GlnLys: 2.299 ± 0.056
2.802GlnLeu: 2.802 ± 0.068
0.848GlnMet: 0.848 ± 0.037
1.707GlnAsn: 1.707 ± 0.055
0.663GlnPro: 0.663 ± 0.031
1.106GlnGln: 1.106 ± 0.055
1.363GlnArg: 1.363 ± 0.051
2.303GlnSer: 2.303 ± 0.074
1.843GlnThr: 1.843 ± 0.049
2.13GlnVal: 2.13 ± 0.06
0.275GlnTrp: 0.275 ± 0.023
1.269GlnTyr: 1.269 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
1.901ArgAla: 1.901 ± 0.061
0.224ArgCys: 0.224 ± 0.02
2.34ArgAsp: 2.34 ± 0.069
2.854ArgGlu: 2.854 ± 0.075
1.909ArgPhe: 1.909 ± 0.052
2.028ArgGly: 2.028 ± 0.071
0.881ArgHis: 0.881 ± 0.037
3.54ArgIle: 3.54 ± 0.08
2.778ArgLys: 2.778 ± 0.069
3.596ArgLeu: 3.596 ± 0.083
1.144ArgMet: 1.144 ± 0.045
2.199ArgAsn: 2.199 ± 0.057
1.04ArgPro: 1.04 ± 0.041
1.337ArgGln: 1.337 ± 0.044
1.821ArgArg: 1.821 ± 0.067
2.214ArgSer: 2.214 ± 0.066
2.031ArgThr: 2.031 ± 0.072
2.837ArgVal: 2.837 ± 0.079
0.263ArgTrp: 0.263 ± 0.019
2.08ArgTyr: 2.08 ± 0.071
0.0ArgXaa: 0.0 ± 0.0
Ser
3.355SerAla: 3.355 ± 0.085
0.377SerCys: 0.377 ± 0.027
3.825SerAsp: 3.825 ± 0.083
4.042SerGlu: 4.042 ± 0.079
3.433SerPhe: 3.433 ± 0.085
4.314SerGly: 4.314 ± 0.096
1.217SerHis: 1.217 ± 0.042
6.078SerIle: 6.078 ± 0.104
4.69SerLys: 4.69 ± 0.103
6.917SerLeu: 6.917 ± 0.125
1.917SerMet: 1.917 ± 0.059
3.502SerAsn: 3.502 ± 0.093
1.71SerPro: 1.71 ± 0.063
2.257SerGln: 2.257 ± 0.069
2.503SerArg: 2.503 ± 0.076
4.522SerSer: 4.522 ± 0.109
3.528SerThr: 3.528 ± 0.097
4.987SerVal: 4.987 ± 0.103
0.531SerTrp: 0.531 ± 0.028
3.014SerTyr: 3.014 ± 0.07
0.0SerXaa: 0.0 ± 0.0
Thr
3.102ThrAla: 3.102 ± 0.088
0.317ThrCys: 0.317 ± 0.022
3.026ThrAsp: 3.026 ± 0.094
3.278ThrGlu: 3.278 ± 0.081
2.589ThrPhe: 2.589 ± 0.073
3.652ThrGly: 3.652 ± 0.085
1.217ThrHis: 1.217 ± 0.046
5.228ThrIle: 5.228 ± 0.097
3.329ThrLys: 3.329 ± 0.077
5.966ThrLeu: 5.966 ± 0.108
1.446ThrMet: 1.446 ± 0.05
2.652ThrAsn: 2.652 ± 0.071
2.067ThrPro: 2.067 ± 0.064
2.001ThrGln: 2.001 ± 0.065
2.041ThrArg: 2.041 ± 0.064
3.677ThrSer: 3.677 ± 0.079
3.005ThrThr: 3.005 ± 0.087
4.51ThrVal: 4.51 ± 0.126
0.482ThrTrp: 0.482 ± 0.032
2.452ThrTyr: 2.452 ± 0.075
0.0ThrXaa: 0.0 ± 0.0
Val
4.736ValAla: 4.736 ± 0.093
0.584ValCys: 0.584 ± 0.032
4.755ValAsp: 4.755 ± 0.098
5.06ValGlu: 5.06 ± 0.112
3.548ValPhe: 3.548 ± 0.074
4.609ValGly: 4.609 ± 0.09
1.273ValHis: 1.273 ± 0.052
6.212ValIle: 6.212 ± 0.094
4.402ValLys: 4.402 ± 0.102
7.266ValLeu: 7.266 ± 0.12
1.997ValMet: 1.997 ± 0.067
3.565ValAsn: 3.565 ± 0.081
2.174ValPro: 2.174 ± 0.058
1.916ValGln: 1.916 ± 0.059
2.652ValArg: 2.652 ± 0.068
5.554ValSer: 5.554 ± 0.102
4.164ValThr: 4.164 ± 0.109
5.905ValVal: 5.905 ± 0.12
0.523ValTrp: 0.523 ± 0.035
2.798ValTyr: 2.798 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.382TrpAla: 0.382 ± 0.027
0.066TrpCys: 0.066 ± 0.011
0.463TrpAsp: 0.463 ± 0.03
0.494TrpGlu: 0.494 ± 0.033
0.421TrpPhe: 0.421 ± 0.024
0.489TrpGly: 0.489 ± 0.033
0.141TrpHis: 0.141 ± 0.013
0.713TrpIle: 0.713 ± 0.038
0.494TrpLys: 0.494 ± 0.032
0.784TrpLeu: 0.784 ± 0.038
0.224TrpMet: 0.224 ± 0.017
0.512TrpAsn: 0.512 ± 0.035
0.127TrpPro: 0.127 ± 0.014
0.232TrpGln: 0.232 ± 0.018
0.239TrpArg: 0.239 ± 0.021
0.495TrpSer: 0.495 ± 0.031
0.378TrpThr: 0.378 ± 0.024
0.484TrpVal: 0.484 ± 0.03
0.071TrpTrp: 0.071 ± 0.012
0.344TrpTyr: 0.344 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.442TyrAla: 2.442 ± 0.065
0.328TyrCys: 0.328 ± 0.022
2.893TyrAsp: 2.893 ± 0.075
2.998TyrGlu: 2.998 ± 0.076
1.98TyrPhe: 1.98 ± 0.068
2.664TyrGly: 2.664 ± 0.068
0.954TyrHis: 0.954 ± 0.038
3.09TyrIle: 3.09 ± 0.078
2.51TyrLys: 2.51 ± 0.071
4.037TyrLeu: 4.037 ± 0.097
0.852TyrMet: 0.852 ± 0.041
2.118TyrAsn: 2.118 ± 0.061
1.498TyrPro: 1.498 ± 0.054
1.748TyrGln: 1.748 ± 0.049
2.014TyrArg: 2.014 ± 0.07
2.793TyrSer: 2.793 ± 0.075
2.421TyrThr: 2.421 ± 0.084
2.605TyrVal: 2.605 ± 0.076
0.375TyrTrp: 0.375 ± 0.027
2.029TyrTyr: 2.029 ± 0.07
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1961 proteins (589313 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski