Amino acid dipepetide frequency for Enterobacter sp. BIGb0383

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.834AlaAla: 10.834 ± 0.105
1.122AlaCys: 1.122 ± 0.031
5.369AlaAsp: 5.369 ± 0.068
5.739AlaGlu: 5.739 ± 0.076
3.695AlaPhe: 3.695 ± 0.053
8.202AlaGly: 8.202 ± 0.105
1.809AlaHis: 1.809 ± 0.037
5.928AlaIle: 5.928 ± 0.069
3.553AlaLys: 3.553 ± 0.053
12.462AlaLeu: 12.462 ± 0.111
2.894AlaMet: 2.894 ± 0.048
2.894AlaAsn: 2.894 ± 0.048
3.69AlaPro: 3.69 ± 0.053
4.514AlaGln: 4.514 ± 0.058
5.994AlaArg: 5.994 ± 0.068
5.935AlaSer: 5.935 ± 0.08
5.006AlaThr: 5.006 ± 0.071
6.967AlaVal: 6.967 ± 0.08
1.746AlaTrp: 1.746 ± 0.04
1.908AlaTyr: 1.908 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.984CysAla: 0.984 ± 0.023
0.162CysCys: 0.162 ± 0.011
0.596CysAsp: 0.596 ± 0.022
0.544CysGlu: 0.544 ± 0.021
0.423CysPhe: 0.423 ± 0.019
1.015CysGly: 1.015 ± 0.027
0.305CysHis: 0.305 ± 0.016
0.529CysIle: 0.529 ± 0.02
0.271CysLys: 0.271 ± 0.013
0.978CysLeu: 0.978 ± 0.03
0.216CysMet: 0.216 ± 0.013
0.293CysAsn: 0.293 ± 0.014
0.487CysPro: 0.487 ± 0.022
0.424CysGln: 0.424 ± 0.018
0.615CysArg: 0.615 ± 0.018
0.596CysSer: 0.596 ± 0.021
0.501CysThr: 0.501 ± 0.019
0.701CysVal: 0.701 ± 0.023
0.174CysTrp: 0.174 ± 0.01
0.308CysTyr: 0.308 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.59AspAla: 5.59 ± 0.064
0.507AspCys: 0.507 ± 0.02
3.033AspAsp: 3.033 ± 0.048
3.557AspGlu: 3.557 ± 0.054
2.124AspPhe: 2.124 ± 0.039
4.085AspGly: 4.085 ± 0.063
0.949AspHis: 0.949 ± 0.027
3.476AspIle: 3.476 ± 0.051
2.501AspLys: 2.501 ± 0.047
4.818AspLeu: 4.818 ± 0.057
1.339AspMet: 1.339 ± 0.034
2.26AspAsn: 2.26 ± 0.04
2.352AspPro: 2.352 ± 0.042
1.531AspGln: 1.531 ± 0.035
2.889AspArg: 2.889 ± 0.045
2.992AspSer: 2.992 ± 0.052
2.68AspThr: 2.68 ± 0.05
3.84AspVal: 3.84 ± 0.055
0.811AspTrp: 0.811 ± 0.023
1.91AspTyr: 1.91 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
5.421GluAla: 5.421 ± 0.069
0.426GluCys: 0.426 ± 0.02
2.343GluAsp: 2.343 ± 0.044
3.286GluGlu: 3.286 ± 0.049
1.763GluPhe: 1.763 ± 0.034
3.675GluGly: 3.675 ± 0.051
1.304GluHis: 1.304 ± 0.029
3.193GluIle: 3.193 ± 0.057
3.122GluLys: 3.122 ± 0.056
5.72GluLeu: 5.72 ± 0.058
1.783GluMet: 1.783 ± 0.035
2.246GluAsn: 2.246 ± 0.036
2.116GluPro: 2.116 ± 0.037
3.381GluGln: 3.381 ± 0.054
3.588GluArg: 3.588 ± 0.051
3.083GluSer: 3.083 ± 0.049
2.97GluThr: 2.97 ± 0.045
3.691GluVal: 3.691 ± 0.045
0.796GluTrp: 0.796 ± 0.024
1.434GluTyr: 1.434 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
3.765PheAla: 3.765 ± 0.052
0.507PheCys: 0.507 ± 0.018
2.367PheAsp: 2.367 ± 0.04
1.668PheGlu: 1.668 ± 0.036
1.578PhePhe: 1.578 ± 0.034
3.155PheGly: 3.155 ± 0.045
0.77PheHis: 0.77 ± 0.022
2.507PheIle: 2.507 ± 0.039
1.189PheLys: 1.189 ± 0.028
3.236PheLeu: 3.236 ± 0.054
0.921PheMet: 0.921 ± 0.027
1.71PheAsn: 1.71 ± 0.034
1.517PhePro: 1.517 ± 0.04
1.135PheGln: 1.135 ± 0.026
1.947PheArg: 1.947 ± 0.042
3.239PheSer: 3.239 ± 0.044
2.397PheThr: 2.397 ± 0.036
2.485PheVal: 2.485 ± 0.048
0.634PheTrp: 0.634 ± 0.022
1.211PheTyr: 1.211 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
6.344GlyAla: 6.344 ± 0.077
0.972GlyCys: 0.972 ± 0.027
3.987GlyAsp: 3.987 ± 0.064
4.783GlyGlu: 4.783 ± 0.06
3.288GlyPhe: 3.288 ± 0.054
5.635GlyGly: 5.635 ± 0.086
1.58GlyHis: 1.58 ± 0.033
4.97GlyIle: 4.97 ± 0.066
3.852GlyLys: 3.852 ± 0.064
7.982GlyLeu: 7.982 ± 0.08
2.358GlyMet: 2.358 ± 0.045
2.783GlyAsn: 2.783 ± 0.059
2.083GlyPro: 2.083 ± 0.039
3.171GlyGln: 3.171 ± 0.056
3.997GlyArg: 3.997 ± 0.055
4.54GlySer: 4.54 ± 0.073
3.964GlyThr: 3.964 ± 0.081
5.774GlyVal: 5.774 ± 0.064
1.311GlyTrp: 1.311 ± 0.031
2.573GlyTyr: 2.573 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
1.872HisAla: 1.872 ± 0.04
0.294HisCys: 0.294 ± 0.014
1.119HisAsp: 1.119 ± 0.029
1.04HisGlu: 1.04 ± 0.028
1.05HisPhe: 1.05 ± 0.027
1.644HisGly: 1.644 ± 0.034
0.724HisHis: 0.724 ± 0.024
1.266HisIle: 1.266 ± 0.029
0.693HisLys: 0.693 ± 0.023
2.173HisLeu: 2.173 ± 0.042
0.511HisMet: 0.511 ± 0.019
0.784HisAsn: 0.784 ± 0.025
1.284HisPro: 1.284 ± 0.031
1.123HisGln: 1.123 ± 0.026
1.229HisArg: 1.229 ± 0.032
1.272HisSer: 1.272 ± 0.028
1.032HisThr: 1.032 ± 0.024
1.196HisVal: 1.196 ± 0.029
0.389HisTrp: 0.389 ± 0.015
0.879HisTyr: 0.879 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.585IleAla: 6.585 ± 0.074
0.628IleCys: 0.628 ± 0.021
3.571IleAsp: 3.571 ± 0.061
3.162IleGlu: 3.162 ± 0.048
1.97IlePhe: 1.97 ± 0.04
4.52IleGly: 4.52 ± 0.059
1.126IleHis: 1.126 ± 0.031
3.191IleIle: 3.191 ± 0.052
2.151IleLys: 2.151 ± 0.043
4.936IleLeu: 4.936 ± 0.069
1.169IleMet: 1.169 ± 0.031
2.456IleAsn: 2.456 ± 0.046
2.641IlePro: 2.641 ± 0.042
1.789IleGln: 1.789 ± 0.035
2.997IleArg: 2.997 ± 0.049
3.709IleSer: 3.709 ± 0.053
3.485IleThr: 3.485 ± 0.062
3.906IleVal: 3.906 ± 0.059
0.692IleTrp: 0.692 ± 0.022
1.47IleTyr: 1.47 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.089LysAla: 4.089 ± 0.051
0.254LysCys: 0.254 ± 0.014
1.942LysAsp: 1.942 ± 0.041
2.328LysGlu: 2.328 ± 0.043
1.026LysPhe: 1.026 ± 0.026
2.876LysGly: 2.876 ± 0.05
0.764LysHis: 0.764 ± 0.025
2.235LysIle: 2.235 ± 0.041
2.077LysLys: 2.077 ± 0.04
3.871LysLeu: 3.871 ± 0.058
1.194LysMet: 1.194 ± 0.03
1.645LysAsn: 1.645 ± 0.037
1.94LysPro: 1.94 ± 0.041
1.76LysGln: 1.76 ± 0.034
2.38LysArg: 2.38 ± 0.039
2.251LysSer: 2.251 ± 0.04
2.501LysThr: 2.501 ± 0.045
2.848LysVal: 2.848 ± 0.049
0.428LysTrp: 0.428 ± 0.018
1.027LysTyr: 1.027 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
11.991LeuAla: 11.991 ± 0.118
1.265LeuCys: 1.265 ± 0.033
5.648LeuAsp: 5.648 ± 0.07
5.424LeuGlu: 5.424 ± 0.065
4.267LeuPhe: 4.267 ± 0.071
7.447LeuGly: 7.447 ± 0.081
2.249LeuHis: 2.249 ± 0.041
5.731LeuIle: 5.731 ± 0.064
4.393LeuLys: 4.393 ± 0.059
12.429LeuLeu: 12.429 ± 0.15
2.943LeuMet: 2.943 ± 0.045
4.305LeuAsn: 4.305 ± 0.065
5.705LeuPro: 5.705 ± 0.078
4.368LeuGln: 4.368 ± 0.057
6.366LeuArg: 6.366 ± 0.073
7.737LeuSer: 7.737 ± 0.082
6.683LeuThr: 6.683 ± 0.079
7.215LeuVal: 7.215 ± 0.082
1.588LeuTrp: 1.588 ± 0.033
2.608LeuTyr: 2.608 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.855MetAla: 2.855 ± 0.04
0.166MetCys: 0.166 ± 0.011
1.182MetAsp: 1.182 ± 0.032
1.204MetGlu: 1.204 ± 0.026
0.846MetPhe: 0.846 ± 0.025
1.866MetGly: 1.866 ± 0.038
0.513MetHis: 0.513 ± 0.019
1.364MetIle: 1.364 ± 0.036
1.281MetLys: 1.281 ± 0.028
3.112MetLeu: 3.112 ± 0.05
0.886MetMet: 0.886 ± 0.027
1.051MetAsn: 1.051 ± 0.027
1.349MetPro: 1.349 ± 0.035
1.201MetGln: 1.201 ± 0.031
1.459MetArg: 1.459 ± 0.035
1.827MetSer: 1.827 ± 0.036
1.8MetThr: 1.8 ± 0.036
1.951MetVal: 1.951 ± 0.038
0.241MetTrp: 0.241 ± 0.012
0.49MetTyr: 0.49 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.611AsnAla: 3.611 ± 0.056
0.308AsnCys: 0.308 ± 0.014
2.005AsnAsp: 2.005 ± 0.044
1.79AsnGlu: 1.79 ± 0.035
1.27AsnPhe: 1.27 ± 0.029
3.081AsnGly: 3.081 ± 0.07
0.768AsnHis: 0.768 ± 0.024
2.248AsnIle: 2.248 ± 0.043
1.42AsnLys: 1.42 ± 0.035
3.531AsnLeu: 3.531 ± 0.052
0.84AsnMet: 0.84 ± 0.022
1.596AsnAsn: 1.596 ± 0.045
2.103AsnPro: 2.103 ± 0.033
1.605AsnGln: 1.605 ± 0.034
1.992AsnArg: 1.992 ± 0.039
1.97AsnSer: 1.97 ± 0.044
2.003AsnThr: 2.003 ± 0.044
2.533AsnVal: 2.533 ± 0.045
0.556AsnTrp: 0.556 ± 0.024
1.107AsnTyr: 1.107 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
4.673ProAla: 4.673 ± 0.062
0.346ProCys: 0.346 ± 0.013
2.875ProAsp: 2.875 ± 0.053
3.356ProGlu: 3.356 ± 0.055
1.814ProPhe: 1.814 ± 0.037
3.58ProGly: 3.58 ± 0.05
0.993ProHis: 0.993 ± 0.026
1.692ProIle: 1.692 ± 0.035
1.368ProLys: 1.368 ± 0.037
5.265ProLeu: 5.265 ± 0.076
1.093ProMet: 1.093 ± 0.029
1.232ProAsn: 1.232 ± 0.029
1.724ProPro: 1.724 ± 0.037
2.322ProGln: 2.322 ± 0.041
2.06ProArg: 2.06 ± 0.037
2.158ProSer: 2.158 ± 0.04
2.069ProThr: 2.069 ± 0.039
3.982ProVal: 3.982 ± 0.056
0.813ProTrp: 0.813 ± 0.027
1.119ProTyr: 1.119 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
4.893GlnAla: 4.893 ± 0.064
0.341GlnCys: 0.341 ± 0.015
1.982GlnAsp: 1.982 ± 0.034
2.144GlnGlu: 2.144 ± 0.046
1.458GlnPhe: 1.458 ± 0.03
3.226GlnGly: 3.226 ± 0.052
1.202GlnHis: 1.202 ± 0.029
2.254GlnIle: 2.254 ± 0.041
1.771GlnLys: 1.771 ± 0.041
4.991GlnLeu: 4.991 ± 0.067
1.216GlnMet: 1.216 ± 0.028
1.417GlnAsn: 1.417 ± 0.03
2.211GlnPro: 2.211 ± 0.045
3.381GlnGln: 3.381 ± 0.073
3.074GlnArg: 3.074 ± 0.053
2.487GlnSer: 2.487 ± 0.043
2.35GlnThr: 2.35 ± 0.05
3.038GlnVal: 3.038 ± 0.043
0.634GlnTrp: 0.634 ± 0.024
1.177GlnTyr: 1.177 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
4.751ArgAla: 4.751 ± 0.056
0.57ArgCys: 0.57 ± 0.017
3.219ArgAsp: 3.219 ± 0.05
3.745ArgGlu: 3.745 ± 0.056
2.619ArgPhe: 2.619 ± 0.047
3.44ArgGly: 3.44 ± 0.057
1.595ArgHis: 1.595 ± 0.035
3.403ArgIle: 3.403 ± 0.046
2.141ArgLys: 2.141 ± 0.047
6.888ArgLeu: 6.888 ± 0.083
1.598ArgMet: 1.598 ± 0.034
2.006ArgAsn: 2.006 ± 0.034
2.34ArgPro: 2.34 ± 0.04
3.322ArgGln: 3.322 ± 0.051
3.648ArgArg: 3.648 ± 0.065
2.956ArgSer: 2.956 ± 0.044
2.676ArgThr: 2.676 ± 0.043
3.997ArgVal: 3.997 ± 0.052
0.996ArgTrp: 0.996 ± 0.027
2.193ArgTyr: 2.193 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.058SerAla: 6.058 ± 0.082
0.553SerCys: 0.553 ± 0.018
3.196SerAsp: 3.196 ± 0.058
3.23SerGlu: 3.23 ± 0.055
2.28SerPhe: 2.28 ± 0.042
5.742SerGly: 5.742 ± 0.096
1.385SerHis: 1.385 ± 0.032
3.035SerIle: 3.035 ± 0.049
1.912SerLys: 1.912 ± 0.038
7.195SerLeu: 7.195 ± 0.089
1.472SerMet: 1.472 ± 0.028
1.875SerAsn: 1.875 ± 0.04
2.689SerPro: 2.689 ± 0.043
2.635SerGln: 2.635 ± 0.046
3.734SerArg: 3.734 ± 0.05
3.642SerSer: 3.642 ± 0.068
3.097SerThr: 3.097 ± 0.053
4.528SerVal: 4.528 ± 0.066
1.071SerTrp: 1.071 ± 0.031
1.563SerTyr: 1.563 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
5.243ThrAla: 5.243 ± 0.07
0.474ThrCys: 0.474 ± 0.018
2.763ThrAsp: 2.763 ± 0.048
2.657ThrGlu: 2.657 ± 0.044
2.055ThrPhe: 2.055 ± 0.037
4.717ThrGly: 4.717 ± 0.08
1.216ThrHis: 1.216 ± 0.03
2.801ThrIle: 2.801 ± 0.051
1.354ThrLys: 1.354 ± 0.034
7.92ThrLeu: 7.92 ± 0.105
1.099ThrMet: 1.099 ± 0.029
1.508ThrAsn: 1.508 ± 0.038
3.255ThrPro: 3.255 ± 0.05
2.26ThrGln: 2.26 ± 0.05
3.381ThrArg: 3.381 ± 0.053
3.005ThrSer: 3.005 ± 0.052
2.986ThrThr: 2.986 ± 0.057
4.006ThrVal: 4.006 ± 0.066
0.836ThrTrp: 0.836 ± 0.027
1.093ThrTyr: 1.093 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
7.165ValAla: 7.165 ± 0.07
0.714ValCys: 0.714 ± 0.023
3.807ValAsp: 3.807 ± 0.064
3.911ValGlu: 3.911 ± 0.049
2.573ValPhe: 2.573 ± 0.047
4.893ValGly: 4.893 ± 0.058
1.224ValHis: 1.224 ± 0.027
4.256ValIle: 4.256 ± 0.061
2.976ValLys: 2.976 ± 0.047
7.448ValLeu: 7.448 ± 0.078
2.171ValMet: 2.171 ± 0.04
2.813ValAsn: 2.813 ± 0.05
3.047ValPro: 3.047 ± 0.054
2.454ValGln: 2.454 ± 0.04
3.819ValArg: 3.819 ± 0.057
4.873ValSer: 4.873 ± 0.064
4.398ValThr: 4.398 ± 0.066
5.668ValVal: 5.668 ± 0.08
0.969ValTrp: 0.969 ± 0.03
1.664ValTyr: 1.664 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.07TrpAla: 1.07 ± 0.029
0.166TrpCys: 0.166 ± 0.011
0.666TrpAsp: 0.666 ± 0.023
0.602TrpGlu: 0.602 ± 0.02
0.705TrpPhe: 0.705 ± 0.027
0.938TrpGly: 0.938 ± 0.027
0.46TrpHis: 0.46 ± 0.018
0.721TrpIle: 0.721 ± 0.022
0.509TrpLys: 0.509 ± 0.019
2.463TrpLeu: 2.463 ± 0.051
0.416TrpMet: 0.416 ± 0.018
0.495TrpAsn: 0.495 ± 0.017
0.692TrpPro: 0.692 ± 0.022
1.241TrpGln: 1.241 ± 0.029
1.099TrpArg: 1.099 ± 0.031
0.881TrpSer: 0.881 ± 0.028
0.617TrpThr: 0.617 ± 0.021
0.951TrpVal: 0.951 ± 0.026
0.231TrpTrp: 0.231 ± 0.014
0.42TrpTyr: 0.42 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.465TyrAla: 2.465 ± 0.043
0.346TyrCys: 0.346 ± 0.017
1.574TyrAsp: 1.574 ± 0.039
1.169TyrGlu: 1.169 ± 0.035
1.135TyrPhe: 1.135 ± 0.029
2.211TyrGly: 2.211 ± 0.039
0.638TyrHis: 0.638 ± 0.024
1.355TyrIle: 1.355 ± 0.031
0.902TyrLys: 0.902 ± 0.024
2.944TyrLeu: 2.944 ± 0.048
0.527TyrMet: 0.527 ± 0.02
0.967TyrAsn: 0.967 ± 0.031
1.31TyrPro: 1.31 ± 0.03
1.632TyrGln: 1.632 ± 0.036
1.833TyrArg: 1.833 ± 0.037
1.741TyrSer: 1.741 ± 0.038
1.41TyrThr: 1.41 ± 0.037
1.568TyrVal: 1.568 ± 0.033
0.425TyrTrp: 0.425 ± 0.019
0.896TyrTyr: 0.896 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4596 proteins (1513582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski