Amino acid dipepetide frequency for Clostridium novyi (strain NT)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.605AlaAla: 3.605 ± 0.103
0.699AlaCys: 0.699 ± 0.029
2.367AlaAsp: 2.367 ± 0.058
3.363AlaGlu: 3.363 ± 0.072
2.39AlaPhe: 2.39 ± 0.069
3.519AlaGly: 3.519 ± 0.093
0.824AlaHis: 0.824 ± 0.038
5.81AlaIle: 5.81 ± 0.105
4.972AlaLys: 4.972 ± 0.097
5.469AlaLeu: 5.469 ± 0.11
1.698AlaMet: 1.698 ± 0.043
2.551AlaAsn: 2.551 ± 0.053
1.389AlaPro: 1.389 ± 0.047
1.338AlaGln: 1.338 ± 0.045
1.925AlaArg: 1.925 ± 0.05
3.206AlaSer: 3.206 ± 0.072
2.741AlaThr: 2.741 ± 0.06
3.844AlaVal: 3.844 ± 0.083
0.296AlaTrp: 0.296 ± 0.019
2.1AlaTyr: 2.1 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.646CysAla: 0.646 ± 0.036
0.204CysCys: 0.204 ± 0.018
0.81CysAsp: 0.81 ± 0.033
0.83CysGlu: 0.83 ± 0.038
0.506CysPhe: 0.506 ± 0.029
1.107CysGly: 1.107 ± 0.043
0.237CysHis: 0.237 ± 0.018
1.23CysIle: 1.23 ± 0.041
1.071CysLys: 1.071 ± 0.042
0.88CysLeu: 0.88 ± 0.034
0.306CysMet: 0.306 ± 0.02
0.852CysAsn: 0.852 ± 0.032
0.479CysPro: 0.479 ± 0.026
0.189CysGln: 0.189 ± 0.015
0.416CysArg: 0.416 ± 0.025
0.808CysSer: 0.808 ± 0.036
0.616CysThr: 0.616 ± 0.03
0.738CysVal: 0.738 ± 0.027
0.07CysTrp: 0.07 ± 0.01
0.476CysTyr: 0.476 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
2.703AspAla: 2.703 ± 0.059
0.59AspCys: 0.59 ± 0.024
2.84AspAsp: 2.84 ± 0.062
4.702AspGlu: 4.702 ± 0.104
2.563AspPhe: 2.563 ± 0.065
3.306AspGly: 3.306 ± 0.077
0.599AspHis: 0.599 ± 0.03
6.636AspIle: 6.636 ± 0.086
5.843AspLys: 5.843 ± 0.103
4.536AspLeu: 4.536 ± 0.083
1.53AspMet: 1.53 ± 0.047
3.548AspAsn: 3.548 ± 0.083
1.341AspPro: 1.341 ± 0.04
0.668AspGln: 0.668 ± 0.027
1.727AspArg: 1.727 ± 0.052
3.12AspSer: 3.12 ± 0.066
2.521AspThr: 2.521 ± 0.063
3.907AspVal: 3.907 ± 0.074
0.335AspTrp: 0.335 ± 0.023
2.564AspTyr: 2.564 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
4.026GluAla: 4.026 ± 0.084
0.769GluCys: 0.769 ± 0.036
4.624GluAsp: 4.624 ± 0.084
6.958GluGlu: 6.958 ± 0.125
3.061GluPhe: 3.061 ± 0.064
3.922GluGly: 3.922 ± 0.079
0.905GluHis: 0.905 ± 0.033
7.049GluIle: 7.049 ± 0.109
8.231GluLys: 8.231 ± 0.124
6.622GluLeu: 6.622 ± 0.113
1.84GluMet: 1.84 ± 0.061
5.717GluAsn: 5.717 ± 0.105
1.469GluPro: 1.469 ± 0.055
1.803GluGln: 1.803 ± 0.055
2.558GluArg: 2.558 ± 0.062
3.432GluSer: 3.432 ± 0.07
2.867GluThr: 2.867 ± 0.067
5.237GluVal: 5.237 ± 0.093
0.359GluTrp: 0.359 ± 0.023
2.977GluTyr: 2.977 ± 0.079
0.0GluXaa: 0.0 ± 0.0
Phe
2.169PheAla: 2.169 ± 0.061
0.558PheCys: 0.558 ± 0.025
2.394PheAsp: 2.394 ± 0.061
2.564PheGlu: 2.564 ± 0.058
1.802PhePhe: 1.802 ± 0.058
2.756PheGly: 2.756 ± 0.061
0.546PheHis: 0.546 ± 0.025
4.578PheIle: 4.578 ± 0.093
4.262PheLys: 4.262 ± 0.078
3.823PheLeu: 3.823 ± 0.072
1.133PheMet: 1.133 ± 0.04
3.173PheAsn: 3.173 ± 0.065
1.119PhePro: 1.119 ± 0.044
0.982PheGln: 0.982 ± 0.037
1.267PheArg: 1.267 ± 0.045
2.909PheSer: 2.909 ± 0.068
2.227PheThr: 2.227 ± 0.052
2.545PheVal: 2.545 ± 0.061
0.256PheTrp: 0.256 ± 0.02
1.692PheTyr: 1.692 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
3.996GlyAla: 3.996 ± 0.099
0.905GlyCys: 0.905 ± 0.039
3.135GlyAsp: 3.135 ± 0.069
4.229GlyGlu: 4.229 ± 0.08
3.072GlyPhe: 3.072 ± 0.063
4.183GlyGly: 4.183 ± 0.094
1.052GlyHis: 1.052 ± 0.038
6.676GlyIle: 6.676 ± 0.098
6.035GlyLys: 6.035 ± 0.11
5.255GlyLeu: 5.255 ± 0.089
1.72GlyMet: 1.72 ± 0.057
3.26GlyAsn: 3.26 ± 0.068
1.256GlyPro: 1.256 ± 0.049
1.428GlyGln: 1.428 ± 0.043
2.212GlyArg: 2.212 ± 0.054
3.43GlySer: 3.43 ± 0.075
3.254GlyThr: 3.254 ± 0.076
4.73GlyVal: 4.73 ± 0.093
0.409GlyTrp: 0.409 ± 0.027
2.91GlyTyr: 2.91 ± 0.076
0.0GlyXaa: 0.0 ± 0.0
His
0.68HisAla: 0.68 ± 0.033
0.195HisCys: 0.195 ± 0.017
0.742HisAsp: 0.742 ± 0.034
0.812HisGlu: 0.812 ± 0.034
0.597HisPhe: 0.597 ± 0.025
1.002HisGly: 1.002 ± 0.035
0.273HisHis: 0.273 ± 0.02
1.371HisIle: 1.371 ± 0.04
1.237HisLys: 1.237 ± 0.042
1.12HisLeu: 1.12 ± 0.044
0.358HisMet: 0.358 ± 0.019
0.841HisAsn: 0.841 ± 0.029
0.601HisPro: 0.601 ± 0.03
0.329HisGln: 0.329 ± 0.02
0.522HisArg: 0.522 ± 0.024
0.846HisSer: 0.846 ± 0.033
0.653HisThr: 0.653 ± 0.033
0.762HisVal: 0.762 ± 0.035
0.111HisTrp: 0.111 ± 0.011
0.506HisTyr: 0.506 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.636IleAla: 5.636 ± 0.099
1.343IleCys: 1.343 ± 0.047
6.056IleAsp: 6.056 ± 0.097
7.18IleGlu: 7.18 ± 0.126
4.306IlePhe: 4.306 ± 0.102
6.167IleGly: 6.167 ± 0.111
1.245IleHis: 1.245 ± 0.04
9.525IleIle: 9.525 ± 0.155
10.078IleLys: 10.078 ± 0.139
9.336IleLeu: 9.336 ± 0.149
2.438IleMet: 2.438 ± 0.063
6.817IleAsn: 6.817 ± 0.105
3.389IlePro: 3.389 ± 0.066
2.296IleGln: 2.296 ± 0.055
3.028IleArg: 3.028 ± 0.067
6.663IleSer: 6.663 ± 0.106
4.937IleThr: 4.937 ± 0.092
6.456IleVal: 6.456 ± 0.106
0.47IleTrp: 0.47 ± 0.029
3.523IleTyr: 3.523 ± 0.072
0.0IleXaa: 0.0 ± 0.0
Lys
4.914LysAla: 4.914 ± 0.094
1.001LysCys: 1.001 ± 0.044
6.709LysAsp: 6.709 ± 0.124
9.955LysGlu: 9.955 ± 0.16
3.742LysPhe: 3.742 ± 0.081
5.531LysGly: 5.531 ± 0.106
1.234LysHis: 1.234 ± 0.045
9.15LysIle: 9.15 ± 0.11
9.476LysLys: 9.476 ± 0.161
8.175LysLeu: 8.175 ± 0.097
2.575LysMet: 2.575 ± 0.063
7.456LysAsn: 7.456 ± 0.137
2.257LysPro: 2.257 ± 0.06
2.405LysGln: 2.405 ± 0.057
3.312LysArg: 3.312 ± 0.078
5.448LysSer: 5.448 ± 0.092
4.285LysThr: 4.285 ± 0.07
6.791LysVal: 6.791 ± 0.123
0.603LysTrp: 0.603 ± 0.028
4.605LysTyr: 4.605 ± 0.094
0.0LysXaa: 0.0 ± 0.0
Leu
4.675LeuAla: 4.675 ± 0.094
1.096LeuCys: 1.096 ± 0.04
4.944LeuAsp: 4.944 ± 0.084
6.138LeuGlu: 6.138 ± 0.105
3.309LeuPhe: 3.309 ± 0.071
5.998LeuGly: 5.998 ± 0.108
1.002LeuHis: 1.002 ± 0.039
7.961LeuIle: 7.961 ± 0.133
9.481LeuLys: 9.481 ± 0.124
7.105LeuLeu: 7.105 ± 0.111
2.233LeuMet: 2.233 ± 0.056
6.233LeuAsn: 6.233 ± 0.116
2.601LeuPro: 2.601 ± 0.065
2.063LeuGln: 2.063 ± 0.049
2.953LeuArg: 2.953 ± 0.071
6.081LeuSer: 6.081 ± 0.099
4.331LeuThr: 4.331 ± 0.092
5.476LeuVal: 5.476 ± 0.094
0.481LeuTrp: 0.481 ± 0.028
3.201LeuTyr: 3.201 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
1.803MetAla: 1.803 ± 0.052
0.298MetCys: 0.298 ± 0.021
1.599MetAsp: 1.599 ± 0.043
1.942MetGlu: 1.942 ± 0.048
1.03MetPhe: 1.03 ± 0.041
1.903MetGly: 1.903 ± 0.051
0.357MetHis: 0.357 ± 0.023
2.253MetIle: 2.253 ± 0.063
2.563MetLys: 2.563 ± 0.068
2.239MetLeu: 2.239 ± 0.056
0.65MetMet: 0.65 ± 0.031
1.791MetAsn: 1.791 ± 0.045
0.876MetPro: 0.876 ± 0.036
0.625MetGln: 0.625 ± 0.031
0.89MetArg: 0.89 ± 0.034
1.635MetSer: 1.635 ± 0.049
1.138MetThr: 1.138 ± 0.039
1.559MetVal: 1.559 ± 0.044
0.155MetTrp: 0.155 ± 0.014
0.967MetTyr: 0.967 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.042AsnAla: 3.042 ± 0.075
0.895AsnCys: 0.895 ± 0.041
3.154AsnAsp: 3.154 ± 0.076
4.443AsnGlu: 4.443 ± 0.085
2.684AsnPhe: 2.684 ± 0.063
3.657AsnGly: 3.657 ± 0.074
0.872AsnHis: 0.872 ± 0.036
7.736AsnIle: 7.736 ± 0.129
7.4AsnLys: 7.4 ± 0.124
5.765AsnLeu: 5.765 ± 0.11
1.783AsnMet: 1.783 ± 0.046
4.982AsnAsn: 4.982 ± 0.12
2.139AsnPro: 2.139 ± 0.056
1.24AsnGln: 1.24 ± 0.042
1.991AsnArg: 1.991 ± 0.051
3.911AsnSer: 3.911 ± 0.084
2.914AsnThr: 2.914 ± 0.06
4.043AsnVal: 4.043 ± 0.074
0.444AsnTrp: 0.444 ± 0.023
2.846AsnTyr: 2.846 ± 0.068
0.0AsnXaa: 0.0 ± 0.0
Pro
1.336ProAla: 1.336 ± 0.052
0.381ProCys: 0.381 ± 0.021
1.306ProAsp: 1.306 ± 0.043
2.233ProGlu: 2.233 ± 0.08
1.288ProPhe: 1.288 ± 0.051
1.769ProGly: 1.769 ± 0.048
0.48ProHis: 0.48 ± 0.026
2.835ProIle: 2.835 ± 0.059
2.659ProLys: 2.659 ± 0.065
2.275ProLeu: 2.275 ± 0.051
0.76ProMet: 0.76 ± 0.03
1.655ProAsn: 1.655 ± 0.05
0.632ProPro: 0.632 ± 0.028
0.739ProGln: 0.739 ± 0.034
0.809ProArg: 0.809 ± 0.035
1.739ProSer: 1.739 ± 0.054
1.455ProThr: 1.455 ± 0.046
2.008ProVal: 2.008 ± 0.054
0.195ProTrp: 0.195 ± 0.018
1.271ProTyr: 1.271 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
1.203GlnAla: 1.203 ± 0.044
0.296GlnCys: 0.296 ± 0.019
1.159GlnAsp: 1.159 ± 0.038
1.731GlnGlu: 1.731 ± 0.062
0.942GlnPhe: 0.942 ± 0.034
1.426GlnGly: 1.426 ± 0.049
0.315GlnHis: 0.315 ± 0.022
2.149GlnIle: 2.149 ± 0.059
1.991GlnLys: 1.991 ± 0.058
2.006GlnLeu: 2.006 ± 0.056
0.669GlnMet: 0.669 ± 0.031
1.674GlnAsn: 1.674 ± 0.051
0.512GlnPro: 0.512 ± 0.026
0.662GlnGln: 0.662 ± 0.037
0.849GlnArg: 0.849 ± 0.037
1.292GlnSer: 1.292 ± 0.041
0.883GlnThr: 0.883 ± 0.036
1.418GlnVal: 1.418 ± 0.044
0.207GlnTrp: 0.207 ± 0.018
1.033GlnTyr: 1.033 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
1.788ArgAla: 1.788 ± 0.051
0.402ArgCys: 0.402 ± 0.027
1.879ArgAsp: 1.879 ± 0.049
2.992ArgGlu: 2.992 ± 0.072
1.392ArgPhe: 1.392 ± 0.043
2.046ArgGly: 2.046 ± 0.059
0.495ArgHis: 0.495 ± 0.028
3.112ArgIle: 3.112 ± 0.062
3.184ArgLys: 3.184 ± 0.06
2.648ArgLeu: 2.648 ± 0.07
0.97ArgMet: 0.97 ± 0.042
1.919ArgAsn: 1.919 ± 0.052
0.879ArgPro: 0.879 ± 0.034
0.906ArgGln: 0.906 ± 0.038
1.447ArgArg: 1.447 ± 0.049
1.581ArgSer: 1.581 ± 0.05
1.587ArgThr: 1.587 ± 0.046
2.216ArgVal: 2.216 ± 0.061
0.248ArgTrp: 0.248 ± 0.02
1.373ArgTyr: 1.373 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.054SerAla: 3.054 ± 0.066
0.68SerCys: 0.68 ± 0.033
2.814SerAsp: 2.814 ± 0.056
3.859SerGlu: 3.859 ± 0.077
2.728SerPhe: 2.728 ± 0.063
4.006SerGly: 4.006 ± 0.073
0.917SerHis: 0.917 ± 0.034
6.798SerIle: 6.798 ± 0.104
6.119SerLys: 6.119 ± 0.098
5.458SerLeu: 5.458 ± 0.109
1.652SerMet: 1.652 ± 0.048
3.807SerAsn: 3.807 ± 0.088
1.596SerPro: 1.596 ± 0.049
1.452SerGln: 1.452 ± 0.046
1.958SerArg: 1.958 ± 0.048
3.977SerSer: 3.977 ± 0.082
3.032SerThr: 3.032 ± 0.07
3.748SerVal: 3.748 ± 0.076
0.369SerTrp: 0.369 ± 0.024
2.334SerTyr: 2.334 ± 0.055
0.0SerXaa: 0.0 ± 0.0
Thr
2.766ThrAla: 2.766 ± 0.067
0.55ThrCys: 0.55 ± 0.031
2.27ThrAsp: 2.27 ± 0.056
2.944ThrGlu: 2.944 ± 0.067
2.075ThrPhe: 2.075 ± 0.05
3.52ThrGly: 3.52 ± 0.078
0.717ThrHis: 0.717 ± 0.026
4.853ThrIle: 4.853 ± 0.081
4.169ThrLys: 4.169 ± 0.092
4.649ThrLeu: 4.649 ± 0.084
1.129ThrMet: 1.129 ± 0.038
2.61ThrAsn: 2.61 ± 0.059
1.759ThrPro: 1.759 ± 0.049
0.952ThrGln: 0.952 ± 0.034
1.584ThrArg: 1.584 ± 0.047
2.946ThrSer: 2.946 ± 0.063
2.473ThrThr: 2.473 ± 0.065
3.364ThrVal: 3.364 ± 0.079
0.304ThrTrp: 0.304 ± 0.022
1.727ThrTyr: 1.727 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
3.871ValAla: 3.871 ± 0.082
0.928ValCys: 0.928 ± 0.033
4.062ValAsp: 4.062 ± 0.079
4.52ValGlu: 4.52 ± 0.089
2.98ValPhe: 2.98 ± 0.068
4.402ValGly: 4.402 ± 0.09
0.901ValHis: 0.901 ± 0.035
6.366ValIle: 6.366 ± 0.108
6.108ValLys: 6.108 ± 0.103
6.075ValLeu: 6.075 ± 0.106
1.639ValMet: 1.639 ± 0.043
3.681ValAsn: 3.681 ± 0.066
2.18ValPro: 2.18 ± 0.061
1.506ValGln: 1.506 ± 0.052
1.984ValArg: 1.984 ± 0.065
4.357ValSer: 4.357 ± 0.087
3.231ValThr: 3.231 ± 0.069
4.841ValVal: 4.841 ± 0.106
0.4ValTrp: 0.4 ± 0.026
2.383ValTyr: 2.383 ± 0.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.315TrpAla: 0.315 ± 0.019
0.104TrpCys: 0.104 ± 0.012
0.35TrpAsp: 0.35 ± 0.021
0.398TrpGlu: 0.398 ± 0.022
0.33TrpPhe: 0.33 ± 0.019
0.428TrpGly: 0.428 ± 0.027
0.112TrpHis: 0.112 ± 0.014
0.657TrpIle: 0.657 ± 0.031
0.51TrpLys: 0.51 ± 0.028
0.473TrpLeu: 0.473 ± 0.028
0.171TrpMet: 0.171 ± 0.016
0.458TrpAsn: 0.458 ± 0.032
0.134TrpPro: 0.134 ± 0.014
0.182TrpGln: 0.182 ± 0.016
0.221TrpArg: 0.221 ± 0.018
0.298TrpSer: 0.298 ± 0.019
0.251TrpThr: 0.251 ± 0.019
0.355TrpVal: 0.355 ± 0.021
0.06TrpTrp: 0.06 ± 0.008
0.228TrpTyr: 0.228 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.879TyrAla: 1.879 ± 0.056
0.575TyrCys: 0.575 ± 0.031
2.367TyrAsp: 2.367 ± 0.064
2.636TyrGlu: 2.636 ± 0.068
1.961TyrPhe: 1.961 ± 0.055
2.496TyrGly: 2.496 ± 0.054
0.473TyrHis: 0.473 ± 0.027
4.147TyrIle: 4.147 ± 0.081
4.233TyrLys: 4.233 ± 0.089
3.449TyrLeu: 3.449 ± 0.073
1.019TyrMet: 1.019 ± 0.038
2.918TyrAsn: 2.918 ± 0.082
1.214TyrPro: 1.214 ± 0.048
0.651TyrGln: 0.651 ± 0.029
1.411TyrArg: 1.411 ± 0.05
2.681TyrSer: 2.681 ± 0.065
1.964TyrThr: 1.964 ± 0.048
2.378TyrVal: 2.378 ± 0.049
0.262TyrTrp: 0.262 ± 0.021
1.68TyrTyr: 1.68 ± 0.056
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2305 proteins (729213 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski