Amino acid dipepetide frequency for Mycobacterium haemophilum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.822AlaAla: 20.822 ± 0.187
1.07AlaCys: 1.07 ± 0.03
7.861AlaAsp: 7.861 ± 0.096
7.409AlaGlu: 7.409 ± 0.09
3.305AlaPhe: 3.305 ± 0.055
11.88AlaGly: 11.88 ± 0.133
2.653AlaHis: 2.653 ± 0.05
5.605AlaIle: 5.605 ± 0.076
3.192AlaLys: 3.192 ± 0.074
12.992AlaLeu: 12.992 ± 0.117
2.937AlaMet: 2.937 ± 0.045
2.907AlaAsn: 2.907 ± 0.055
6.036AlaPro: 6.036 ± 0.078
4.387AlaGln: 4.387 ± 0.059
8.624AlaArg: 8.624 ± 0.095
6.42AlaSer: 6.42 ± 0.073
7.406AlaThr: 7.406 ± 0.079
11.508AlaVal: 11.508 ± 0.114
1.637AlaTrp: 1.637 ± 0.037
2.321AlaTyr: 2.321 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.17CysAla: 1.17 ± 0.03
0.109CysCys: 0.109 ± 0.011
0.59CysAsp: 0.59 ± 0.022
0.442CysGlu: 0.442 ± 0.02
0.226CysPhe: 0.226 ± 0.013
1.028CysGly: 1.028 ± 0.028
0.194CysHis: 0.194 ± 0.013
0.276CysIle: 0.276 ± 0.017
0.139CysLys: 0.139 ± 0.01
0.703CysLeu: 0.703 ± 0.027
0.137CysMet: 0.137 ± 0.013
0.192CysAsn: 0.192 ± 0.013
0.506CysPro: 0.506 ± 0.022
0.233CysGln: 0.233 ± 0.013
0.619CysArg: 0.619 ± 0.025
0.535CysSer: 0.535 ± 0.021
0.526CysThr: 0.526 ± 0.019
0.714CysVal: 0.714 ± 0.023
0.169CysTrp: 0.169 ± 0.012
0.225CysTyr: 0.225 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.33AspAla: 7.33 ± 0.088
0.458AspCys: 0.458 ± 0.021
3.875AspAsp: 3.875 ± 0.059
3.696AspGlu: 3.696 ± 0.064
1.681AspPhe: 1.681 ± 0.043
5.476AspGly: 5.476 ± 0.082
1.396AspHis: 1.396 ± 0.041
2.729AspIle: 2.729 ± 0.05
1.308AspLys: 1.308 ± 0.035
5.746AspLeu: 5.746 ± 0.074
1.009AspMet: 1.009 ± 0.026
1.292AspAsn: 1.292 ± 0.032
4.15AspPro: 4.15 ± 0.062
2.008AspGln: 2.008 ± 0.042
4.375AspArg: 4.375 ± 0.064
2.851AspSer: 2.851 ± 0.047
3.127AspThr: 3.127 ± 0.053
5.128AspVal: 5.128 ± 0.069
0.885AspTrp: 0.885 ± 0.026
1.416AspTyr: 1.416 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
5.781GluAla: 5.781 ± 0.088
0.37GluCys: 0.37 ± 0.018
2.101GluAsp: 2.101 ± 0.041
2.159GluGlu: 2.159 ± 0.05
1.765GluPhe: 1.765 ± 0.043
2.865GluGly: 2.865 ± 0.058
1.514GluHis: 1.514 ± 0.033
2.522GluIle: 2.522 ± 0.045
1.157GluLys: 1.157 ± 0.038
6.461GluLeu: 6.461 ± 0.079
1.039GluMet: 1.039 ± 0.031
1.012GluAsn: 1.012 ± 0.027
2.771GluPro: 2.771 ± 0.051
2.216GluGln: 2.216 ± 0.047
3.937GluArg: 3.937 ± 0.058
2.662GluSer: 2.662 ± 0.052
2.538GluThr: 2.538 ± 0.049
4.354GluVal: 4.354 ± 0.068
0.688GluTrp: 0.688 ± 0.025
1.091GluTyr: 1.091 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.903PheAla: 3.903 ± 0.059
0.377PheCys: 0.377 ± 0.018
2.284PheAsp: 2.284 ± 0.041
1.444PheGlu: 1.444 ± 0.033
0.956PhePhe: 0.956 ± 0.037
3.319PheGly: 3.319 ± 0.061
0.628PheHis: 0.628 ± 0.023
1.105PheIle: 1.105 ± 0.033
0.516PheLys: 0.516 ± 0.024
2.364PheLeu: 2.364 ± 0.048
0.511PheMet: 0.511 ± 0.023
0.754PheAsn: 0.754 ± 0.026
1.392PhePro: 1.392 ± 0.031
0.713PheGln: 0.713 ± 0.022
1.613PheArg: 1.613 ± 0.036
1.743PheSer: 1.743 ± 0.036
1.949PheThr: 1.949 ± 0.047
2.514PheVal: 2.514 ± 0.048
0.427PheTrp: 0.427 ± 0.021
0.707PheTyr: 0.707 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
9.953GlyAla: 9.953 ± 0.134
0.921GlyCys: 0.921 ± 0.03
4.709GlyAsp: 4.709 ± 0.063
3.954GlyGlu: 3.954 ± 0.062
2.909GlyPhe: 2.909 ± 0.056
7.861GlyGly: 7.861 ± 0.213
2.075GlyHis: 2.075 ± 0.042
4.153GlyIle: 4.153 ± 0.061
2.417GlyLys: 2.417 ± 0.045
8.732GlyLeu: 8.732 ± 0.09
2.182GlyMet: 2.182 ± 0.041
2.082GlyAsn: 2.082 ± 0.06
4.515GlyPro: 4.515 ± 0.074
2.981GlyGln: 2.981 ± 0.044
6.04GlyArg: 6.04 ± 0.079
5.427GlySer: 5.427 ± 0.069
5.075GlyThr: 5.075 ± 0.067
7.512GlyVal: 7.512 ± 0.094
1.579GlyTrp: 1.579 ± 0.039
2.37GlyTyr: 2.37 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
2.666HisAla: 2.666 ± 0.051
0.254HisCys: 0.254 ± 0.015
1.432HisAsp: 1.432 ± 0.036
1.042HisGlu: 1.042 ± 0.032
0.631HisPhe: 0.631 ± 0.022
2.165HisGly: 2.165 ± 0.048
0.699HisHis: 0.699 ± 0.028
0.903HisIle: 0.903 ± 0.026
0.41HisLys: 0.41 ± 0.018
2.092HisLeu: 2.092 ± 0.049
0.354HisMet: 0.354 ± 0.017
0.582HisAsn: 0.582 ± 0.024
1.691HisPro: 1.691 ± 0.039
0.811HisGln: 0.811 ± 0.026
1.943HisArg: 1.943 ± 0.04
1.148HisSer: 1.148 ± 0.031
1.276HisThr: 1.276 ± 0.032
1.747HisVal: 1.747 ± 0.03
0.383HisTrp: 0.383 ± 0.018
0.589HisTyr: 0.589 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
6.533IleAla: 6.533 ± 0.078
0.424IleCys: 0.424 ± 0.017
3.448IleAsp: 3.448 ± 0.052
2.599IleGlu: 2.599 ± 0.052
1.064IlePhe: 1.064 ± 0.034
4.569IleGly: 4.569 ± 0.067
0.831IleHis: 0.831 ± 0.023
1.583IleIle: 1.583 ± 0.041
0.961IleLys: 0.961 ± 0.031
3.079IleLeu: 3.079 ± 0.055
0.681IleMet: 0.681 ± 0.024
1.297IleAsn: 1.297 ± 0.034
2.475IlePro: 2.475 ± 0.045
1.003IleGln: 1.003 ± 0.028
2.763IleArg: 2.763 ± 0.052
2.507IleSer: 2.507 ± 0.045
3.018IleThr: 3.018 ± 0.052
3.878IleVal: 3.878 ± 0.066
0.47IleTrp: 0.47 ± 0.02
0.869IleTyr: 0.869 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
2.694LysAla: 2.694 ± 0.061
0.122LysCys: 0.122 ± 0.009
1.114LysAsp: 1.114 ± 0.033
0.92LysGlu: 0.92 ± 0.031
0.623LysPhe: 0.623 ± 0.024
1.498LysGly: 1.498 ± 0.042
0.509LysHis: 0.509 ± 0.02
1.047LysIle: 1.047 ± 0.036
0.662LysLys: 0.662 ± 0.033
2.331LysLeu: 2.331 ± 0.051
0.507LysMet: 0.507 ± 0.023
0.551LysAsn: 0.551 ± 0.026
1.524LysPro: 1.524 ± 0.034
0.77LysGln: 0.77 ± 0.031
1.621LysArg: 1.621 ± 0.043
1.34LysSer: 1.34 ± 0.039
1.468LysThr: 1.468 ± 0.035
1.925LysVal: 1.925 ± 0.043
0.312LysTrp: 0.312 ± 0.016
0.5LysTyr: 0.5 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
14.481LeuAla: 14.481 ± 0.14
0.875LeuCys: 0.875 ± 0.029
6.276LeuAsp: 6.276 ± 0.079
4.093LeuGlu: 4.093 ± 0.069
2.593LeuPhe: 2.593 ± 0.053
8.759LeuGly: 8.759 ± 0.096
2.184LeuHis: 2.184 ± 0.041
4.244LeuIle: 4.244 ± 0.067
1.821LeuLys: 1.821 ± 0.043
10.071LeuLeu: 10.071 ± 0.114
1.831LeuMet: 1.831 ± 0.041
2.234LeuAsn: 2.234 ± 0.05
5.893LeuPro: 5.893 ± 0.077
2.828LeuGln: 2.828 ± 0.053
7.805LeuArg: 7.805 ± 0.106
5.835LeuSer: 5.835 ± 0.075
6.462LeuThr: 6.462 ± 0.069
8.674LeuVal: 8.674 ± 0.098
1.26LeuTrp: 1.26 ± 0.034
1.672LeuTyr: 1.672 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.657MetAla: 2.657 ± 0.046
0.194MetCys: 0.194 ± 0.012
0.844MetAsp: 0.844 ± 0.028
0.688MetGlu: 0.688 ± 0.024
0.619MetPhe: 0.619 ± 0.024
1.483MetGly: 1.483 ± 0.04
0.433MetHis: 0.433 ± 0.02
0.899MetIle: 0.899 ± 0.03
0.446MetLys: 0.446 ± 0.019
2.216MetLeu: 2.216 ± 0.047
0.435MetMet: 0.435 ± 0.022
0.51MetAsn: 0.51 ± 0.019
1.257MetPro: 1.257 ± 0.031
0.592MetGln: 0.592 ± 0.023
1.584MetArg: 1.584 ± 0.034
1.604MetSer: 1.604 ± 0.04
1.803MetThr: 1.803 ± 0.038
1.718MetVal: 1.718 ± 0.039
0.303MetTrp: 0.303 ± 0.016
0.358MetTyr: 0.358 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.897AsnAla: 2.897 ± 0.056
0.224AsnCys: 0.224 ± 0.013
1.308AsnAsp: 1.308 ± 0.037
1.03AsnGlu: 1.03 ± 0.03
0.712AsnPhe: 0.712 ± 0.023
2.208AsnGly: 2.208 ± 0.062
0.53AsnHis: 0.53 ± 0.02
1.095AsnIle: 1.095 ± 0.034
0.482AsnLys: 0.482 ± 0.02
2.179AsnLeu: 2.179 ± 0.045
0.437AsnMet: 0.437 ± 0.018
0.668AsnAsn: 0.668 ± 0.027
1.796AsnPro: 1.796 ± 0.042
0.786AsnGln: 0.786 ± 0.026
1.675AsnArg: 1.675 ± 0.04
1.316AsnSer: 1.316 ± 0.039
1.378AsnThr: 1.378 ± 0.033
1.768AsnVal: 1.768 ± 0.032
0.413AsnTrp: 0.413 ± 0.019
0.618AsnTyr: 0.618 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
7.232ProAla: 7.232 ± 0.099
0.328ProCys: 0.328 ± 0.019
4.162ProAsp: 4.162 ± 0.062
3.295ProGlu: 3.295 ± 0.047
1.566ProPhe: 1.566 ± 0.033
5.633ProGly: 5.633 ± 0.093
1.266ProHis: 1.266 ± 0.033
2.255ProIle: 2.255 ± 0.044
1.285ProLys: 1.285 ± 0.035
4.885ProLeu: 4.885 ± 0.073
1.186ProMet: 1.186 ± 0.03
1.37ProAsn: 1.37 ± 0.035
3.823ProPro: 3.823 ± 0.092
1.957ProGln: 1.957 ± 0.045
3.413ProArg: 3.413 ± 0.063
3.19ProSer: 3.19 ± 0.055
3.565ProThr: 3.565 ± 0.067
4.828ProVal: 4.828 ± 0.071
0.895ProTrp: 0.895 ± 0.026
1.17ProTyr: 1.17 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.053GlnAla: 4.053 ± 0.063
0.247GlnCys: 0.247 ± 0.016
1.261GlnAsp: 1.261 ± 0.035
1.159GlnGlu: 1.159 ± 0.032
1.043GlnPhe: 1.043 ± 0.029
2.027GlnGly: 2.027 ± 0.045
0.899GlnHis: 0.899 ± 0.03
1.636GlnIle: 1.636 ± 0.038
0.625GlnLys: 0.625 ± 0.024
4.109GlnLeu: 4.109 ± 0.055
0.707GlnMet: 0.707 ± 0.029
0.652GlnAsn: 0.652 ± 0.021
2.111GlnPro: 2.111 ± 0.048
1.548GlnGln: 1.548 ± 0.037
3.095GlnArg: 3.095 ± 0.055
1.574GlnSer: 1.574 ± 0.039
1.729GlnThr: 1.729 ± 0.042
2.912GlnVal: 2.912 ± 0.045
0.558GlnTrp: 0.558 ± 0.021
0.695GlnTyr: 0.695 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
8.225ArgAla: 8.225 ± 0.096
0.673ArgCys: 0.673 ± 0.024
4.2ArgAsp: 4.2 ± 0.065
3.496ArgGlu: 3.496 ± 0.061
2.419ArgPhe: 2.419 ± 0.045
5.188ArgGly: 5.188 ± 0.069
1.803ArgHis: 1.803 ± 0.04
3.494ArgIle: 3.494 ± 0.057
1.66ArgLys: 1.66 ± 0.036
7.718ArgLeu: 7.718 ± 0.093
1.768ArgMet: 1.768 ± 0.038
1.741ArgAsn: 1.741 ± 0.034
3.913ArgPro: 3.913 ± 0.06
2.546ArgGln: 2.546 ± 0.05
6.743ArgArg: 6.743 ± 0.105
4.021ArgSer: 4.021 ± 0.064
3.885ArgThr: 3.885 ± 0.063
5.602ArgVal: 5.602 ± 0.074
1.381ArgTrp: 1.381 ± 0.037
1.968ArgTyr: 1.968 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
7.21SerAla: 7.21 ± 0.094
0.439SerCys: 0.439 ± 0.02
3.254SerAsp: 3.254 ± 0.051
2.592SerGlu: 2.592 ± 0.043
1.632SerPhe: 1.632 ± 0.041
5.825SerGly: 5.825 ± 0.081
1.179SerHis: 1.179 ± 0.033
2.352SerIle: 2.352 ± 0.047
1.268SerLys: 1.268 ± 0.033
5.041SerLeu: 5.041 ± 0.068
1.391SerMet: 1.391 ± 0.034
1.27SerAsn: 1.27 ± 0.037
3.127SerPro: 3.127 ± 0.048
1.725SerGln: 1.725 ± 0.034
3.908SerArg: 3.908 ± 0.052
3.397SerSer: 3.397 ± 0.06
3.392SerThr: 3.392 ± 0.052
4.678SerVal: 4.678 ± 0.066
0.991SerTrp: 0.991 ± 0.032
1.34SerTyr: 1.34 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
7.917ThrAla: 7.917 ± 0.095
0.434ThrCys: 0.434 ± 0.021
3.581ThrAsp: 3.581 ± 0.053
2.998ThrGlu: 2.998 ± 0.043
1.689ThrPhe: 1.689 ± 0.037
5.638ThrGly: 5.638 ± 0.074
1.275ThrHis: 1.275 ± 0.03
2.648ThrIle: 2.648 ± 0.05
1.354ThrLys: 1.354 ± 0.034
5.693ThrLeu: 5.693 ± 0.07
1.283ThrMet: 1.283 ± 0.034
1.442ThrAsn: 1.442 ± 0.037
3.717ThrPro: 3.717 ± 0.067
1.672ThrGln: 1.672 ± 0.041
3.645ThrArg: 3.645 ± 0.057
3.319ThrSer: 3.319 ± 0.061
3.962ThrThr: 3.962 ± 0.074
5.773ThrVal: 5.773 ± 0.081
0.817ThrTrp: 0.817 ± 0.026
1.254ThrTyr: 1.254 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
11.489ValAla: 11.489 ± 0.096
0.81ValCys: 0.81 ± 0.027
5.554ValAsp: 5.554 ± 0.073
4.307ValGlu: 4.307 ± 0.068
2.532ValPhe: 2.532 ± 0.05
7.043ValGly: 7.043 ± 0.085
1.881ValHis: 1.881 ± 0.04
4.182ValIle: 4.182 ± 0.07
1.694ValLys: 1.694 ± 0.05
9.201ValLeu: 9.201 ± 0.095
1.623ValMet: 1.623 ± 0.036
2.1ValAsn: 2.1 ± 0.044
4.519ValPro: 4.519 ± 0.06
2.3ValGln: 2.3 ± 0.045
5.851ValArg: 5.851 ± 0.075
4.958ValSer: 4.958 ± 0.063
5.443ValThr: 5.443 ± 0.077
8.698ValVal: 8.698 ± 0.103
1.047ValTrp: 1.047 ± 0.033
1.515ValTyr: 1.515 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.485TrpAla: 1.485 ± 0.037
0.169TrpCys: 0.169 ± 0.012
0.76TrpAsp: 0.76 ± 0.026
0.622TrpGlu: 0.622 ± 0.023
0.49TrpPhe: 0.49 ± 0.021
0.963TrpGly: 0.963 ± 0.03
0.366TrpHis: 0.366 ± 0.017
0.664TrpIle: 0.664 ± 0.026
0.298TrpLys: 0.298 ± 0.014
1.848TrpLeu: 1.848 ± 0.048
0.335TrpMet: 0.335 ± 0.019
0.402TrpAsn: 0.402 ± 0.019
0.821TrpPro: 0.821 ± 0.028
0.672TrpGln: 0.672 ± 0.024
1.315TrpArg: 1.315 ± 0.038
0.969TrpSer: 0.969 ± 0.032
0.899TrpThr: 0.899 ± 0.03
1.167TrpVal: 1.167 ± 0.036
0.369TrpTrp: 0.369 ± 0.017
0.342TrpTyr: 0.342 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.472TyrAla: 2.472 ± 0.046
0.235TyrCys: 0.235 ± 0.014
1.325TyrAsp: 1.325 ± 0.037
1.057TyrGlu: 1.057 ± 0.032
0.766TyrPhe: 0.766 ± 0.026
2.014TyrGly: 2.014 ± 0.041
0.506TyrHis: 0.506 ± 0.021
0.659TyrIle: 0.659 ± 0.025
0.344TyrLys: 0.344 ± 0.017
2.431TyrLeu: 2.431 ± 0.043
0.285TyrMet: 0.285 ± 0.016
0.461TyrAsn: 0.461 ± 0.018
1.266TyrPro: 1.266 ± 0.038
0.895TyrGln: 0.895 ± 0.026
1.878TyrArg: 1.878 ± 0.044
1.194TyrSer: 1.194 ± 0.032
1.208TyrThr: 1.208 ± 0.035
1.651TyrVal: 1.651 ± 0.039
0.371TyrTrp: 0.371 ± 0.016
0.501TyrTyr: 0.501 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3743 proteins (1208662 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski