Amino acid dipepetide frequency for Thioalbus denitrificans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.331AlaAla: 17.331 ± 0.191
1.278AlaCys: 1.278 ± 0.036
6.064AlaAsp: 6.064 ± 0.073
8.217AlaGlu: 8.217 ± 0.103
3.707AlaPhe: 3.707 ± 0.063
12.358AlaGly: 12.358 ± 0.138
2.324AlaHis: 2.324 ± 0.043
4.645AlaIle: 4.645 ± 0.068
2.136AlaLys: 2.136 ± 0.048
14.468AlaLeu: 14.468 ± 0.141
2.835AlaMet: 2.835 ± 0.053
2.192AlaAsn: 2.192 ± 0.051
5.813AlaPro: 5.813 ± 0.11
3.247AlaGln: 3.247 ± 0.054
9.965AlaArg: 9.965 ± 0.102
4.42AlaSer: 4.42 ± 0.058
5.015AlaThr: 5.015 ± 0.066
9.128AlaVal: 9.128 ± 0.107
1.921AlaTrp: 1.921 ± 0.044
2.504AlaTyr: 2.504 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.066CysAla: 1.066 ± 0.032
0.158CysCys: 0.158 ± 0.012
0.54CysAsp: 0.54 ± 0.023
0.502CysGlu: 0.502 ± 0.024
0.292CysPhe: 0.292 ± 0.016
1.075CysGly: 1.075 ± 0.034
0.383CysHis: 0.383 ± 0.026
0.382CysIle: 0.382 ± 0.018
0.179CysLys: 0.179 ± 0.012
0.947CysLeu: 0.947 ± 0.027
0.175CysMet: 0.175 ± 0.013
0.254CysAsn: 0.254 ± 0.015
0.69CysPro: 0.69 ± 0.027
0.25CysGln: 0.25 ± 0.016
0.909CysArg: 0.909 ± 0.025
0.475CysSer: 0.475 ± 0.02
0.515CysThr: 0.515 ± 0.024
0.655CysVal: 0.655 ± 0.027
0.117CysTrp: 0.117 ± 0.012
0.239CysTyr: 0.239 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
6.135AspAla: 6.135 ± 0.081
0.497AspCys: 0.497 ± 0.021
2.616AspAsp: 2.616 ± 0.061
3.267AspGlu: 3.267 ± 0.065
1.925AspPhe: 1.925 ± 0.041
4.97AspGly: 4.97 ± 0.085
1.216AspHis: 1.216 ± 0.032
2.43AspIle: 2.43 ± 0.048
1.169AspLys: 1.169 ± 0.034
5.898AspLeu: 5.898 ± 0.068
1.086AspMet: 1.086 ± 0.029
1.259AspAsn: 1.259 ± 0.036
3.757AspPro: 3.757 ± 0.052
1.504AspGln: 1.504 ± 0.034
4.549AspArg: 4.549 ± 0.056
2.365AspSer: 2.365 ± 0.042
2.594AspThr: 2.594 ± 0.05
3.234AspVal: 3.234 ± 0.061
0.904AspTrp: 0.904 ± 0.028
1.685AspTyr: 1.685 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
8.504GluAla: 8.504 ± 0.1
0.483GluCys: 0.483 ± 0.019
2.711GluAsp: 2.711 ± 0.05
4.175GluGlu: 4.175 ± 0.073
1.915GluPhe: 1.915 ± 0.047
5.031GluGly: 5.031 ± 0.073
1.533GluHis: 1.533 ± 0.033
2.745GluIle: 2.745 ± 0.045
1.762GluLys: 1.762 ± 0.043
7.249GluLeu: 7.249 ± 0.085
1.583GluMet: 1.583 ± 0.032
1.519GluAsn: 1.519 ± 0.032
3.478GluPro: 3.478 ± 0.059
2.866GluGln: 2.866 ± 0.063
6.802GluArg: 6.802 ± 0.096
3.202GluSer: 3.202 ± 0.058
3.459GluThr: 3.459 ± 0.051
5.05GluVal: 5.05 ± 0.085
0.829GluTrp: 0.829 ± 0.027
1.386GluTyr: 1.386 ± 0.035
0.0GluXaa: 0.0 ± 0.0
Phe
3.479PheAla: 3.479 ± 0.064
0.394PheCys: 0.394 ± 0.018
2.167PheAsp: 2.167 ± 0.04
2.083PheGlu: 2.083 ± 0.044
1.179PhePhe: 1.179 ± 0.037
3.169PheGly: 3.169 ± 0.053
0.808PheHis: 0.808 ± 0.029
1.52PheIle: 1.52 ± 0.042
0.647PheLys: 0.647 ± 0.025
3.608PheLeu: 3.608 ± 0.06
0.786PheMet: 0.786 ± 0.024
0.997PheAsn: 0.997 ± 0.031
1.616PhePro: 1.616 ± 0.036
0.973PheGln: 0.973 ± 0.032
2.528PheArg: 2.528 ± 0.043
1.907PheSer: 1.907 ± 0.046
1.938PheThr: 1.938 ± 0.038
2.251PheVal: 2.251 ± 0.045
0.454PheTrp: 0.454 ± 0.02
0.961PheTyr: 0.961 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
9.449GlyAla: 9.449 ± 0.111
1.143GlyCys: 1.143 ± 0.032
4.567GlyAsp: 4.567 ± 0.079
6.215GlyGlu: 6.215 ± 0.076
3.345GlyPhe: 3.345 ± 0.059
8.49GlyGly: 8.49 ± 0.115
2.172GlyHis: 2.172 ± 0.044
4.232GlyIle: 4.232 ± 0.063
2.401GlyLys: 2.401 ± 0.053
10.263GlyLeu: 10.263 ± 0.112
2.482GlyMet: 2.482 ± 0.056
2.217GlyAsn: 2.217 ± 0.053
3.741GlyPro: 3.741 ± 0.065
2.772GlyGln: 2.772 ± 0.051
7.817GlyArg: 7.817 ± 0.095
4.367GlySer: 4.367 ± 0.06
4.492GlyThr: 4.492 ± 0.069
6.925GlyVal: 6.925 ± 0.08
1.569GlyTrp: 1.569 ± 0.04
2.579GlyTyr: 2.579 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
2.443HisAla: 2.443 ± 0.041
0.337HisCys: 0.337 ± 0.018
1.288HisAsp: 1.288 ± 0.035
1.212HisGlu: 1.212 ± 0.03
0.866HisPhe: 0.866 ± 0.025
2.267HisGly: 2.267 ± 0.047
0.722HisHis: 0.722 ± 0.026
0.909HisIle: 0.909 ± 0.026
0.481HisLys: 0.481 ± 0.019
2.629HisLeu: 2.629 ± 0.047
0.442HisMet: 0.442 ± 0.02
0.525HisAsn: 0.525 ± 0.022
1.742HisPro: 1.742 ± 0.049
0.718HisGln: 0.718 ± 0.026
1.82HisArg: 1.82 ± 0.038
0.923HisSer: 0.923 ± 0.026
1.092HisThr: 1.092 ± 0.029
1.491HisVal: 1.491 ± 0.036
0.437HisTrp: 0.437 ± 0.021
0.787HisTyr: 0.787 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
4.744IleAla: 4.744 ± 0.069
0.407IleCys: 0.407 ± 0.019
2.694IleAsp: 2.694 ± 0.052
2.786IleGlu: 2.786 ± 0.054
1.221IlePhe: 1.221 ± 0.035
3.76IleGly: 3.76 ± 0.065
1.122IleHis: 1.122 ± 0.03
1.876IleIle: 1.876 ± 0.048
1.036IleLys: 1.036 ± 0.031
4.218IleLeu: 4.218 ± 0.073
0.751IleMet: 0.751 ± 0.028
1.176IleAsn: 1.176 ± 0.031
2.468IlePro: 2.468 ± 0.05
1.175IleGln: 1.175 ± 0.034
3.434IleArg: 3.434 ± 0.054
2.074IleSer: 2.074 ± 0.04
2.353IleThr: 2.353 ± 0.045
2.746IleVal: 2.746 ± 0.05
0.421IleTrp: 0.421 ± 0.019
1.012IleTyr: 1.012 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
2.706LysAla: 2.706 ± 0.063
0.165LysCys: 0.165 ± 0.012
1.144LysAsp: 1.144 ± 0.033
1.363LysGlu: 1.363 ± 0.044
0.623LysPhe: 0.623 ± 0.022
1.914LysGly: 1.914 ± 0.047
0.513LysHis: 0.513 ± 0.025
0.99LysIle: 0.99 ± 0.029
0.937LysLys: 0.937 ± 0.043
2.286LysLeu: 2.286 ± 0.047
0.511LysMet: 0.511 ± 0.025
0.6LysAsn: 0.6 ± 0.026
1.37LysPro: 1.37 ± 0.041
0.784LysGln: 0.784 ± 0.026
1.882LysArg: 1.882 ± 0.035
1.322LysSer: 1.322 ± 0.034
1.313LysThr: 1.313 ± 0.033
1.932LysVal: 1.932 ± 0.046
0.278LysTrp: 0.278 ± 0.015
0.612LysTyr: 0.612 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
15.383LeuAla: 15.383 ± 0.141
1.009LeuCys: 1.009 ± 0.032
6.44LeuAsp: 6.44 ± 0.08
8.564LeuGlu: 8.564 ± 0.098
4.108LeuPhe: 4.108 ± 0.067
10.012LeuGly: 10.012 ± 0.108
2.518LeuHis: 2.518 ± 0.049
3.98LeuIle: 3.98 ± 0.061
2.765LeuLys: 2.765 ± 0.055
13.639LeuLeu: 13.639 ± 0.19
2.211LeuMet: 2.211 ± 0.044
2.504LeuAsn: 2.504 ± 0.046
6.529LeuPro: 6.529 ± 0.088
3.553LeuGln: 3.553 ± 0.057
9.157LeuArg: 9.157 ± 0.087
5.445LeuSer: 5.445 ± 0.065
5.285LeuThr: 5.285 ± 0.066
8.798LeuVal: 8.798 ± 0.099
1.424LeuTrp: 1.424 ± 0.039
2.441LeuTyr: 2.441 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
2.733MetAla: 2.733 ± 0.05
0.14MetCys: 0.14 ± 0.011
1.263MetAsp: 1.263 ± 0.031
1.52MetGlu: 1.52 ± 0.034
0.576MetPhe: 0.576 ± 0.023
1.859MetGly: 1.859 ± 0.046
0.495MetHis: 0.495 ± 0.02
0.81MetIle: 0.81 ± 0.028
0.844MetLys: 0.844 ± 0.025
2.18MetLeu: 2.18 ± 0.045
0.481MetMet: 0.481 ± 0.026
0.783MetAsn: 0.783 ± 0.026
1.232MetPro: 1.232 ± 0.035
0.743MetGln: 0.743 ± 0.027
1.506MetArg: 1.506 ± 0.035
1.362MetSer: 1.362 ± 0.034
1.339MetThr: 1.339 ± 0.037
1.584MetVal: 1.584 ± 0.036
0.148MetTrp: 0.148 ± 0.012
0.341MetTyr: 0.341 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.463AsnAla: 2.463 ± 0.052
0.245AsnCys: 0.245 ± 0.013
1.182AsnAsp: 1.182 ± 0.038
1.137AsnGlu: 1.137 ± 0.029
0.85AsnPhe: 0.85 ± 0.027
2.111AsnGly: 2.111 ± 0.053
0.528AsnHis: 0.528 ± 0.022
1.187AsnIle: 1.187 ± 0.034
0.547AsnLys: 0.547 ± 0.021
2.787AsnLeu: 2.787 ± 0.055
0.46AsnMet: 0.46 ± 0.017
0.624AsnAsn: 0.624 ± 0.026
1.866AsnPro: 1.866 ± 0.049
0.66AsnGln: 0.66 ± 0.021
1.931AsnArg: 1.931 ± 0.041
0.947AsnSer: 0.947 ± 0.033
1.181AsnThr: 1.181 ± 0.034
1.583AsnVal: 1.583 ± 0.043
0.333AsnTrp: 0.333 ± 0.018
0.645AsnTyr: 0.645 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
7.324ProAla: 7.324 ± 0.104
0.471ProCys: 0.471 ± 0.022
3.547ProAsp: 3.547 ± 0.064
4.822ProGlu: 4.822 ± 0.073
1.83ProPhe: 1.83 ± 0.04
6.061ProGly: 6.061 ± 0.079
1.093ProHis: 1.093 ± 0.034
1.732ProIle: 1.732 ± 0.04
1.071ProLys: 1.071 ± 0.053
5.758ProLeu: 5.758 ± 0.071
1.135ProMet: 1.135 ± 0.031
0.997ProAsn: 0.997 ± 0.033
3.128ProPro: 3.128 ± 0.066
1.378ProGln: 1.378 ± 0.037
3.718ProArg: 3.718 ± 0.058
1.986ProSer: 1.986 ± 0.043
2.091ProThr: 2.091 ± 0.048
4.751ProVal: 4.751 ± 0.066
0.856ProTrp: 0.856 ± 0.029
1.316ProTyr: 1.316 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
4.296GlnAla: 4.296 ± 0.067
0.248GlnCys: 0.248 ± 0.016
1.273GlnAsp: 1.273 ± 0.035
1.892GlnGlu: 1.892 ± 0.042
0.908GlnPhe: 0.908 ± 0.026
2.624GlnGly: 2.624 ± 0.048
0.662GlnHis: 0.662 ± 0.022
1.114GlnIle: 1.114 ± 0.031
0.766GlnLys: 0.766 ± 0.027
3.25GlnLeu: 3.25 ± 0.065
0.771GlnMet: 0.771 ± 0.027
0.583GlnAsn: 0.583 ± 0.024
1.828GlnPro: 1.828 ± 0.044
1.242GlnGln: 1.242 ± 0.037
2.821GlnArg: 2.821 ± 0.059
1.398GlnSer: 1.398 ± 0.037
1.336GlnThr: 1.336 ± 0.035
2.724GlnVal: 2.724 ± 0.048
0.416GlnTrp: 0.416 ± 0.017
0.631GlnTyr: 0.631 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
8.411ArgAla: 8.411 ± 0.092
0.755ArgCys: 0.755 ± 0.029
4.381ArgAsp: 4.381 ± 0.066
6.057ArgGlu: 6.057 ± 0.079
3.242ArgPhe: 3.242 ± 0.05
6.049ArgGly: 6.049 ± 0.07
2.287ArgHis: 2.287 ± 0.056
4.093ArgIle: 4.093 ± 0.054
1.923ArgLys: 1.923 ± 0.046
10.858ArgLeu: 10.858 ± 0.123
1.942ArgMet: 1.942 ± 0.039
1.961ArgAsn: 1.961 ± 0.038
4.272ArgPro: 4.272 ± 0.072
2.961ArgGln: 2.961 ± 0.057
7.617ArgArg: 7.617 ± 0.095
3.48ArgSer: 3.48 ± 0.065
3.657ArgThr: 3.657 ± 0.06
6.224ArgVal: 6.224 ± 0.079
1.294ArgTrp: 1.294 ± 0.039
2.282ArgTyr: 2.282 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
5.041SerAla: 5.041 ± 0.064
0.463SerCys: 0.463 ± 0.02
2.325SerAsp: 2.325 ± 0.051
2.468SerGlu: 2.468 ± 0.051
1.546SerPhe: 1.546 ± 0.036
5.19SerGly: 5.19 ± 0.071
1.103SerHis: 1.103 ± 0.031
1.961SerIle: 1.961 ± 0.041
0.959SerLys: 0.959 ± 0.035
5.41SerLeu: 5.41 ± 0.076
1.011SerMet: 1.011 ± 0.031
1.044SerAsn: 1.044 ± 0.033
2.43SerPro: 2.43 ± 0.043
1.251SerGln: 1.251 ± 0.039
3.812SerArg: 3.812 ± 0.056
2.102SerSer: 2.102 ± 0.055
2.125SerThr: 2.125 ± 0.045
3.405SerVal: 3.405 ± 0.051
0.618SerTrp: 0.618 ± 0.024
1.173SerTyr: 1.173 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
5.582ThrAla: 5.582 ± 0.078
0.495ThrCys: 0.495 ± 0.022
2.494ThrAsp: 2.494 ± 0.045
2.574ThrGlu: 2.574 ± 0.046
1.524ThrPhe: 1.524 ± 0.035
5.22ThrGly: 5.22 ± 0.066
1.049ThrHis: 1.049 ± 0.03
1.927ThrIle: 1.927 ± 0.042
0.815ThrLys: 0.815 ± 0.028
6.531ThrLeu: 6.531 ± 0.071
0.789ThrMet: 0.789 ± 0.03
0.955ThrAsn: 0.955 ± 0.032
3.24ThrPro: 3.24 ± 0.043
1.178ThrGln: 1.178 ± 0.037
3.742ThrArg: 3.742 ± 0.055
1.927ThrSer: 1.927 ± 0.04
2.453ThrThr: 2.453 ± 0.05
4.033ThrVal: 4.033 ± 0.059
0.66ThrTrp: 0.66 ± 0.022
1.108ThrTyr: 1.108 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
8.881ValAla: 8.881 ± 0.094
0.675ValCys: 0.675 ± 0.026
4.269ValAsp: 4.269 ± 0.066
5.401ValGlu: 5.401 ± 0.075
2.436ValPhe: 2.436 ± 0.047
5.768ValGly: 5.768 ± 0.071
1.623ValHis: 1.623 ± 0.04
3.505ValIle: 3.505 ± 0.055
1.928ValLys: 1.928 ± 0.045
8.587ValLeu: 8.587 ± 0.111
1.728ValMet: 1.728 ± 0.045
2.049ValAsn: 2.049 ± 0.042
3.817ValPro: 3.817 ± 0.058
2.125ValGln: 2.125 ± 0.041
5.865ValArg: 5.865 ± 0.081
3.782ValSer: 3.782 ± 0.063
4.145ValThr: 4.145 ± 0.057
6.455ValVal: 6.455 ± 0.089
0.862ValTrp: 0.862 ± 0.028
1.752ValTyr: 1.752 ± 0.038
0.001ValXaa: 0.001 ± 0.001
Trp
1.177TrpAla: 1.177 ± 0.035
0.172TrpCys: 0.172 ± 0.012
0.698TrpAsp: 0.698 ± 0.021
0.789TrpGlu: 0.789 ± 0.027
0.523TrpPhe: 0.523 ± 0.023
0.996TrpGly: 0.996 ± 0.032
0.353TrpHis: 0.353 ± 0.019
0.599TrpIle: 0.599 ± 0.024
0.339TrpLys: 0.339 ± 0.018
2.101TrpLeu: 2.101 ± 0.048
0.339TrpMet: 0.339 ± 0.014
0.419TrpAsn: 0.419 ± 0.02
0.682TrpPro: 0.682 ± 0.023
0.525TrpGln: 0.525 ± 0.023
1.345TrpArg: 1.345 ± 0.038
0.811TrpSer: 0.811 ± 0.029
0.619TrpThr: 0.619 ± 0.024
1.059TrpVal: 1.059 ± 0.026
0.279TrpTrp: 0.279 ± 0.015
0.37TrpTyr: 0.37 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.421TyrAla: 2.421 ± 0.047
0.269TyrCys: 0.269 ± 0.013
1.398TyrAsp: 1.398 ± 0.039
1.238TyrGlu: 1.238 ± 0.037
0.851TyrPhe: 0.851 ± 0.03
2.316TyrGly: 2.316 ± 0.059
0.628TyrHis: 0.628 ± 0.024
0.836TyrIle: 0.836 ± 0.029
0.538TyrLys: 0.538 ± 0.02
3.098TyrLeu: 3.098 ± 0.054
0.42TyrMet: 0.42 ± 0.02
0.608TyrAsn: 0.608 ± 0.027
1.366TyrPro: 1.366 ± 0.038
0.877TyrGln: 0.877 ± 0.026
2.525TyrArg: 2.525 ± 0.051
1.142TyrSer: 1.142 ± 0.033
1.296TyrThr: 1.296 ± 0.034
1.624TyrVal: 1.624 ± 0.036
0.374TyrTrp: 0.374 ± 0.018
0.757TyrTyr: 0.757 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 3827 proteins (1199143 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski