Amino acid dipepetide frequency for Herbaspirillum sp. CF444

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.23AlaAla: 15.23 ± 0.146
1.09AlaCys: 1.09 ± 0.026
6.199AlaAsp: 6.199 ± 0.068
5.96AlaGlu: 5.96 ± 0.074
3.877AlaPhe: 3.877 ± 0.05
9.616AlaGly: 9.616 ± 0.089
2.25AlaHis: 2.25 ± 0.037
6.259AlaIle: 6.259 ± 0.081
4.265AlaLys: 4.265 ± 0.067
12.845AlaLeu: 12.845 ± 0.116
3.4AlaMet: 3.4 ± 0.05
3.39AlaAsn: 3.39 ± 0.057
4.897AlaPro: 4.897 ± 0.063
5.158AlaGln: 5.158 ± 0.064
6.973AlaArg: 6.973 ± 0.08
6.639AlaSer: 6.639 ± 0.07
5.89AlaThr: 5.89 ± 0.069
8.118AlaVal: 8.118 ± 0.075
1.452AlaTrp: 1.452 ± 0.031
2.766AlaTyr: 2.766 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.995CysAla: 0.995 ± 0.025
0.111CysCys: 0.111 ± 0.008
0.497CysAsp: 0.497 ± 0.021
0.436CysGlu: 0.436 ± 0.016
0.312CysPhe: 0.312 ± 0.015
0.874CysGly: 0.874 ± 0.027
0.228CysHis: 0.228 ± 0.012
0.46CysIle: 0.46 ± 0.02
0.276CysLys: 0.276 ± 0.013
0.83CysLeu: 0.83 ± 0.024
0.187CysMet: 0.187 ± 0.011
0.267CysAsn: 0.267 ± 0.013
0.372CysPro: 0.372 ± 0.018
0.223CysGln: 0.223 ± 0.012
0.518CysArg: 0.518 ± 0.018
0.529CysSer: 0.529 ± 0.02
0.429CysThr: 0.429 ± 0.017
0.656CysVal: 0.656 ± 0.021
0.081CysTrp: 0.081 ± 0.008
0.215CysTyr: 0.215 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.538AspAla: 6.538 ± 0.075
0.452AspCys: 0.452 ± 0.016
2.931AspAsp: 2.931 ± 0.044
2.975AspGlu: 2.975 ± 0.054
2.188AspPhe: 2.188 ± 0.039
4.515AspGly: 4.515 ± 0.061
1.062AspHis: 1.062 ± 0.029
3.287AspIle: 3.287 ± 0.045
2.356AspLys: 2.356 ± 0.043
5.157AspLeu: 5.157 ± 0.06
1.405AspMet: 1.405 ± 0.032
1.688AspAsn: 1.688 ± 0.032
2.462AspPro: 2.462 ± 0.04
1.785AspGln: 1.785 ± 0.036
2.847AspArg: 2.847 ± 0.041
2.577AspSer: 2.577 ± 0.039
2.602AspThr: 2.602 ± 0.039
4.071AspVal: 4.071 ± 0.051
0.834AspTrp: 0.834 ± 0.022
1.619AspTyr: 1.619 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
5.765GluAla: 5.765 ± 0.075
0.377GluCys: 0.377 ± 0.017
2.227GluAsp: 2.227 ± 0.038
2.989GluGlu: 2.989 ± 0.052
1.862GluPhe: 1.862 ± 0.036
3.151GluGly: 3.151 ± 0.047
1.319GluHis: 1.319 ± 0.025
3.233GluIle: 3.233 ± 0.051
2.591GluLys: 2.591 ± 0.053
5.549GluLeu: 5.549 ± 0.067
1.385GluMet: 1.385 ± 0.033
1.714GluAsn: 1.714 ± 0.034
1.96GluPro: 1.96 ± 0.037
2.969GluGln: 2.969 ± 0.058
3.792GluArg: 3.792 ± 0.061
2.688GluSer: 2.688 ± 0.043
2.676GluThr: 2.676 ± 0.048
3.69GluVal: 3.69 ± 0.05
0.649GluTrp: 0.649 ± 0.02
1.208GluTyr: 1.208 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
4.023PheAla: 4.023 ± 0.05
0.403PheCys: 0.403 ± 0.019
2.463PheAsp: 2.463 ± 0.041
1.913PheGlu: 1.913 ± 0.031
1.611PhePhe: 1.611 ± 0.036
3.477PheGly: 3.477 ± 0.046
0.782PheHis: 0.782 ± 0.024
1.976PheIle: 1.976 ± 0.037
1.42PheLys: 1.42 ± 0.033
3.351PheLeu: 3.351 ± 0.052
0.908PheMet: 0.908 ± 0.025
1.42PheAsn: 1.42 ± 0.03
1.622PhePro: 1.622 ± 0.028
1.186PheGln: 1.186 ± 0.022
1.909PheArg: 1.909 ± 0.035
2.803PheSer: 2.803 ± 0.049
1.977PheThr: 1.977 ± 0.037
2.778PheVal: 2.778 ± 0.045
0.516PheTrp: 0.516 ± 0.019
1.043PheTyr: 1.043 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
8.043GlyAla: 8.043 ± 0.094
0.72GlyCys: 0.72 ± 0.022
3.943GlyAsp: 3.943 ± 0.064
3.95GlyGlu: 3.95 ± 0.055
3.337GlyPhe: 3.337 ± 0.048
6.228GlyGly: 6.228 ± 0.095
1.634GlyHis: 1.634 ± 0.034
4.829GlyIle: 4.829 ± 0.057
4.309GlyLys: 4.309 ± 0.06
7.999GlyLeu: 7.999 ± 0.072
2.465GlyMet: 2.465 ± 0.041
2.796GlyAsn: 2.796 ± 0.048
2.448GlyPro: 2.448 ± 0.043
2.9GlyGln: 2.9 ± 0.043
4.403GlyArg: 4.403 ± 0.061
4.703GlySer: 4.703 ± 0.074
4.145GlyThr: 4.145 ± 0.071
6.121GlyVal: 6.121 ± 0.074
1.169GlyTrp: 1.169 ± 0.027
2.368GlyTyr: 2.368 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.591HisAla: 2.591 ± 0.047
0.243HisCys: 0.243 ± 0.013
1.23HisAsp: 1.23 ± 0.028
1.065HisGlu: 1.065 ± 0.026
0.976HisPhe: 0.976 ± 0.025
1.945HisGly: 1.945 ± 0.035
0.613HisHis: 0.613 ± 0.023
1.182HisIle: 1.182 ± 0.028
0.664HisLys: 0.664 ± 0.021
2.151HisLeu: 2.151 ± 0.037
0.514HisMet: 0.514 ± 0.018
0.577HisAsn: 0.577 ± 0.023
1.338HisPro: 1.338 ± 0.031
0.837HisGln: 0.837 ± 0.022
1.221HisArg: 1.221 ± 0.03
1.035HisSer: 1.035 ± 0.027
0.947HisThr: 0.947 ± 0.025
1.561HisVal: 1.561 ± 0.032
0.327HisTrp: 0.327 ± 0.014
0.686HisTyr: 0.686 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.411IleAla: 7.411 ± 0.084
0.514IleCys: 0.514 ± 0.019
3.624IleAsp: 3.624 ± 0.044
3.214IleGlu: 3.214 ± 0.047
1.791IlePhe: 1.791 ± 0.033
5.091IleGly: 5.091 ± 0.052
0.998IleHis: 0.998 ± 0.027
2.37IleIle: 2.37 ± 0.045
2.189IleLys: 2.189 ± 0.04
4.685IleLeu: 4.685 ± 0.062
1.054IleMet: 1.054 ± 0.026
1.972IleAsn: 1.972 ± 0.042
2.534IlePro: 2.534 ± 0.043
1.541IleGln: 1.541 ± 0.038
3.13IleArg: 3.13 ± 0.042
3.411IleSer: 3.411 ± 0.048
2.953IleThr: 2.953 ± 0.046
4.53IleVal: 4.53 ± 0.056
0.622IleTrp: 0.622 ± 0.021
1.308IleTyr: 1.308 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.281LysAla: 4.281 ± 0.062
0.185LysCys: 0.185 ± 0.013
2.091LysAsp: 2.091 ± 0.041
2.17LysGlu: 2.17 ± 0.041
1.282LysPhe: 1.282 ± 0.029
2.622LysGly: 2.622 ± 0.048
0.784LysHis: 0.784 ± 0.024
2.452LysIle: 2.452 ± 0.041
2.151LysLys: 2.151 ± 0.048
4.374LysLeu: 4.374 ± 0.054
1.221LysMet: 1.221 ± 0.032
1.585LysAsn: 1.585 ± 0.032
2.274LysPro: 2.274 ± 0.038
1.794LysGln: 1.794 ± 0.036
2.405LysArg: 2.405 ± 0.037
2.397LysSer: 2.397 ± 0.043
2.503LysThr: 2.503 ± 0.041
2.861LysVal: 2.861 ± 0.049
0.438LysTrp: 0.438 ± 0.016
0.889LysTyr: 0.889 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
12.796LeuAla: 12.796 ± 0.123
0.972LeuCys: 0.972 ± 0.025
5.58LeuAsp: 5.58 ± 0.057
5.168LeuGlu: 5.168 ± 0.068
3.838LeuPhe: 3.838 ± 0.069
7.763LeuGly: 7.763 ± 0.086
2.362LeuHis: 2.362 ± 0.041
5.403LeuIle: 5.403 ± 0.074
4.244LeuLys: 4.244 ± 0.055
11.071LeuLeu: 11.071 ± 0.123
2.531LeuMet: 2.531 ± 0.042
3.446LeuAsn: 3.446 ± 0.049
5.711LeuPro: 5.711 ± 0.076
4.372LeuGln: 4.372 ± 0.055
6.456LeuArg: 6.456 ± 0.072
7.083LeuSer: 7.083 ± 0.07
5.694LeuThr: 5.694 ± 0.058
6.939LeuVal: 6.939 ± 0.079
1.06LeuTrp: 1.06 ± 0.028
2.233LeuTyr: 2.233 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
2.788MetAla: 2.788 ± 0.043
0.176MetCys: 0.176 ± 0.011
1.19MetAsp: 1.19 ± 0.027
1.215MetGlu: 1.215 ± 0.028
0.84MetPhe: 0.84 ± 0.026
1.662MetGly: 1.662 ± 0.039
0.626MetHis: 0.626 ± 0.021
1.319MetIle: 1.319 ± 0.029
1.223MetLys: 1.223 ± 0.027
2.962MetLeu: 2.962 ± 0.049
0.711MetMet: 0.711 ± 0.024
1.002MetAsn: 1.002 ± 0.027
1.537MetPro: 1.537 ± 0.029
1.303MetGln: 1.303 ± 0.029
1.748MetArg: 1.748 ± 0.036
1.889MetSer: 1.889 ± 0.035
1.742MetThr: 1.742 ± 0.035
1.715MetVal: 1.715 ± 0.037
0.224MetTrp: 0.224 ± 0.011
0.503MetTyr: 0.503 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.912AsnAla: 3.912 ± 0.058
0.282AsnCys: 0.282 ± 0.014
1.787AsnAsp: 1.787 ± 0.036
1.468AsnGlu: 1.468 ± 0.031
1.176AsnPhe: 1.176 ± 0.032
2.839AsnGly: 2.839 ± 0.055
0.618AsnHis: 0.618 ± 0.018
1.91AsnIle: 1.91 ± 0.039
1.303AsnLys: 1.303 ± 0.03
3.328AsnLeu: 3.328 ± 0.051
0.873AsnMet: 0.873 ± 0.028
1.184AsnAsn: 1.184 ± 0.037
1.9AsnPro: 1.9 ± 0.036
1.24AsnGln: 1.24 ± 0.032
1.804AsnArg: 1.804 ± 0.035
1.709AsnSer: 1.709 ± 0.039
1.862AsnThr: 1.862 ± 0.039
2.405AsnVal: 2.405 ± 0.042
0.476AsnTrp: 0.476 ± 0.02
0.912AsnTyr: 0.912 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
5.776ProAla: 5.776 ± 0.068
0.302ProCys: 0.302 ± 0.012
2.869ProAsp: 2.869 ± 0.044
2.92ProGlu: 2.92 ± 0.041
1.811ProPhe: 1.811 ± 0.035
3.679ProGly: 3.679 ± 0.055
1.099ProHis: 1.099 ± 0.026
2.056ProIle: 2.056 ± 0.04
1.619ProLys: 1.619 ± 0.034
4.852ProLeu: 4.852 ± 0.06
1.112ProMet: 1.112 ± 0.024
1.407ProAsn: 1.407 ± 0.032
2.005ProPro: 2.005 ± 0.045
2.101ProGln: 2.101 ± 0.036
2.21ProArg: 2.21 ± 0.043
2.707ProSer: 2.707 ± 0.041
2.397ProThr: 2.397 ± 0.044
3.763ProVal: 3.763 ± 0.053
0.577ProTrp: 0.577 ± 0.02
1.219ProTyr: 1.219 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
5.16GlnAla: 5.16 ± 0.072
0.27GlnCys: 0.27 ± 0.013
1.831GlnAsp: 1.831 ± 0.035
2.01GlnGlu: 2.01 ± 0.044
1.452GlnPhe: 1.452 ± 0.028
2.808GlnGly: 2.808 ± 0.038
0.998GlnHis: 0.998 ± 0.024
2.297GlnIle: 2.297 ± 0.038
1.537GlnLys: 1.537 ± 0.035
4.438GlnLeu: 4.438 ± 0.06
1.073GlnMet: 1.073 ± 0.024
1.247GlnAsn: 1.247 ± 0.028
1.862GlnPro: 1.862 ± 0.036
2.259GlnGln: 2.259 ± 0.042
2.932GlnArg: 2.932 ± 0.046
2.368GlnSer: 2.368 ± 0.043
2.151GlnThr: 2.151 ± 0.037
2.887GlnVal: 2.887 ± 0.047
0.556GlnTrp: 0.556 ± 0.016
1.001GlnTyr: 1.001 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
5.944ArgAla: 5.944 ± 0.068
0.482ArgCys: 0.482 ± 0.017
3.245ArgAsp: 3.245 ± 0.045
3.381ArgGlu: 3.381 ± 0.051
2.449ArgPhe: 2.449 ± 0.043
3.77ArgGly: 3.77 ± 0.051
1.66ArgHis: 1.66 ± 0.033
3.739ArgIle: 3.739 ± 0.049
2.477ArgLys: 2.477 ± 0.039
6.562ArgLeu: 6.562 ± 0.075
1.731ArgMet: 1.731 ± 0.031
2.104ArgAsn: 2.104 ± 0.036
2.389ArgPro: 2.389 ± 0.04
2.841ArgGln: 2.841 ± 0.045
4.047ArgArg: 4.047 ± 0.066
3.315ArgSer: 3.315 ± 0.048
2.992ArgThr: 2.992 ± 0.047
4.11ArgVal: 4.11 ± 0.052
0.824ArgTrp: 0.824 ± 0.024
1.785ArgTyr: 1.785 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.756SerAla: 6.756 ± 0.071
0.463SerCys: 0.463 ± 0.016
3.09SerAsp: 3.09 ± 0.043
2.836SerGlu: 2.836 ± 0.042
2.443SerPhe: 2.443 ± 0.042
5.634SerGly: 5.634 ± 0.077
1.271SerHis: 1.271 ± 0.027
3.257SerIle: 3.257 ± 0.051
2.182SerLys: 2.182 ± 0.036
6.4SerLeu: 6.4 ± 0.071
1.569SerMet: 1.569 ± 0.033
1.902SerAsn: 1.902 ± 0.042
2.775SerPro: 2.775 ± 0.046
2.078SerGln: 2.078 ± 0.04
3.351SerArg: 3.351 ± 0.05
4.044SerSer: 4.044 ± 0.069
3.269SerThr: 3.269 ± 0.057
4.393SerVal: 4.393 ± 0.053
0.764SerTrp: 0.764 ± 0.024
1.627SerTyr: 1.627 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
5.885ThrAla: 5.885 ± 0.066
0.401ThrCys: 0.401 ± 0.018
2.686ThrAsp: 2.686 ± 0.046
2.468ThrGlu: 2.468 ± 0.044
1.863ThrPhe: 1.863 ± 0.036
4.545ThrGly: 4.545 ± 0.075
1.175ThrHis: 1.175 ± 0.029
2.948ThrIle: 2.948 ± 0.051
1.555ThrLys: 1.555 ± 0.033
6.06ThrLeu: 6.06 ± 0.067
1.338ThrMet: 1.338 ± 0.029
1.571ThrAsn: 1.571 ± 0.04
3.313ThrPro: 3.313 ± 0.051
2.074ThrGln: 2.074 ± 0.036
3.037ThrArg: 3.037 ± 0.045
3.255ThrSer: 3.255 ± 0.056
3.039ThrThr: 3.039 ± 0.054
4.261ThrVal: 4.261 ± 0.059
0.586ThrTrp: 0.586 ± 0.019
1.375ThrTyr: 1.375 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
8.675ValAla: 8.675 ± 0.086
0.672ValCys: 0.672 ± 0.023
3.945ValAsp: 3.945 ± 0.055
3.921ValGlu: 3.921 ± 0.052
2.742ValPhe: 2.742 ± 0.047
5.407ValGly: 5.407 ± 0.068
1.369ValHis: 1.369 ± 0.029
4.163ValIle: 4.163 ± 0.064
2.961ValLys: 2.961 ± 0.046
7.64ValLeu: 7.64 ± 0.09
1.961ValMet: 1.961 ± 0.037
2.452ValAsn: 2.452 ± 0.045
3.264ValPro: 3.264 ± 0.049
2.676ValGln: 2.676 ± 0.039
4.266ValArg: 4.266 ± 0.057
4.68ValSer: 4.68 ± 0.063
4.215ValThr: 4.215 ± 0.054
5.975ValVal: 5.975 ± 0.074
0.834ValTrp: 0.834 ± 0.021
1.646ValTyr: 1.646 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
0.969TrpAla: 0.969 ± 0.024
0.118TrpCys: 0.118 ± 0.008
0.533TrpAsp: 0.533 ± 0.019
0.544TrpGlu: 0.544 ± 0.02
0.557TrpPhe: 0.557 ± 0.02
0.758TrpGly: 0.758 ± 0.02
0.36TrpHis: 0.36 ± 0.014
0.726TrpIle: 0.726 ± 0.02
0.508TrpLys: 0.508 ± 0.017
1.719TrpLeu: 1.719 ± 0.04
0.42TrpMet: 0.42 ± 0.016
0.496TrpAsn: 0.496 ± 0.017
0.513TrpPro: 0.513 ± 0.021
0.708TrpGln: 0.708 ± 0.021
1.005TrpArg: 1.005 ± 0.029
0.78TrpSer: 0.78 ± 0.022
0.593TrpThr: 0.593 ± 0.021
0.769TrpVal: 0.769 ± 0.021
0.193TrpTrp: 0.193 ± 0.011
0.317TrpTyr: 0.317 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.739TyrAla: 2.739 ± 0.044
0.262TyrCys: 0.262 ± 0.014
1.389TyrAsp: 1.389 ± 0.032
1.191TyrGlu: 1.191 ± 0.025
1.173TyrPhe: 1.173 ± 0.028
2.129TyrGly: 2.129 ± 0.04
0.494TyrHis: 0.494 ± 0.018
1.075TyrIle: 1.075 ± 0.028
0.928TyrLys: 0.928 ± 0.026
2.751TyrLeu: 2.751 ± 0.046
0.517TyrMet: 0.517 ± 0.015
0.766TyrAsn: 0.766 ± 0.024
1.306TyrPro: 1.306 ± 0.029
1.09TyrGln: 1.09 ± 0.027
1.791TyrArg: 1.791 ± 0.034
1.493TyrSer: 1.493 ± 0.029
1.361TyrThr: 1.361 ± 0.038
1.86TyrVal: 1.86 ± 0.037
0.404TyrTrp: 0.404 ± 0.016
0.723TyrTyr: 0.723 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4973 proteins (1635139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski