Amino acid dipepetide frequency for Bacillus manliponensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.439AlaAla: 5.439 ± 0.087
0.674AlaCys: 0.674 ± 0.024
2.883AlaAsp: 2.883 ± 0.053
4.501AlaGlu: 4.501 ± 0.07
3.271AlaPhe: 3.271 ± 0.063
4.682AlaGly: 4.682 ± 0.083
1.377AlaHis: 1.377 ± 0.034
5.649AlaIle: 5.649 ± 0.082
5.042AlaLys: 5.042 ± 0.069
7.085AlaLeu: 7.085 ± 0.08
2.128AlaMet: 2.128 ± 0.051
2.886AlaAsn: 2.886 ± 0.054
1.969AlaPro: 1.969 ± 0.043
2.078AlaGln: 2.078 ± 0.045
2.316AlaArg: 2.316 ± 0.054
3.819AlaSer: 3.819 ± 0.051
3.622AlaThr: 3.622 ± 0.059
5.25AlaVal: 5.25 ± 0.071
0.631AlaTrp: 0.631 ± 0.021
2.46AlaTyr: 2.46 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.494CysAla: 0.494 ± 0.018
0.14CysCys: 0.14 ± 0.01
0.408CysAsp: 0.408 ± 0.016
0.609CysGlu: 0.609 ± 0.025
0.445CysPhe: 0.445 ± 0.02
0.718CysGly: 0.718 ± 0.025
0.222CysHis: 0.222 ± 0.015
0.8CysIle: 0.8 ± 0.026
0.512CysLys: 0.512 ± 0.024
0.815CysLeu: 0.815 ± 0.026
0.263CysMet: 0.263 ± 0.015
0.411CysAsn: 0.411 ± 0.019
0.324CysPro: 0.324 ± 0.015
0.23CysGln: 0.23 ± 0.015
0.309CysArg: 0.309 ± 0.016
0.577CysSer: 0.577 ± 0.022
0.517CysThr: 0.517 ± 0.021
0.533CysVal: 0.533 ± 0.021
0.085CysTrp: 0.085 ± 0.007
0.34CysTyr: 0.34 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.108AspAla: 3.108 ± 0.049
0.435AspCys: 0.435 ± 0.019
2.139AspAsp: 2.139 ± 0.046
4.314AspGlu: 4.314 ± 0.061
2.193AspPhe: 2.193 ± 0.043
3.291AspGly: 3.291 ± 0.062
0.915AspHis: 0.915 ± 0.026
3.989AspIle: 3.989 ± 0.063
2.957AspLys: 2.957 ± 0.056
4.125AspLeu: 4.125 ± 0.069
1.364AspMet: 1.364 ± 0.032
1.734AspAsn: 1.734 ± 0.041
1.529AspPro: 1.529 ± 0.037
1.309AspGln: 1.309 ± 0.035
1.9AspArg: 1.9 ± 0.041
2.317AspSer: 2.317 ± 0.041
2.348AspThr: 2.348 ± 0.045
4.019AspVal: 4.019 ± 0.062
0.605AspTrp: 0.605 ± 0.023
1.994AspTyr: 1.994 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
5.414GluAla: 5.414 ± 0.077
0.508GluCys: 0.508 ± 0.02
3.733GluAsp: 3.733 ± 0.053
8.085GluGlu: 8.085 ± 0.127
2.56GluPhe: 2.56 ± 0.047
4.456GluGly: 4.456 ± 0.067
1.777GluHis: 1.777 ± 0.034
5.654GluIle: 5.654 ± 0.073
7.331GluLys: 7.331 ± 0.087
7.102GluLeu: 7.102 ± 0.093
2.508GluMet: 2.508 ± 0.05
3.682GluAsn: 3.682 ± 0.058
1.759GluPro: 1.759 ± 0.05
3.834GluGln: 3.834 ± 0.072
3.77GluArg: 3.77 ± 0.052
3.324GluSer: 3.324 ± 0.054
4.045GluThr: 4.045 ± 0.06
5.664GluVal: 5.664 ± 0.087
0.819GluTrp: 0.819 ± 0.027
2.452GluTyr: 2.452 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.105PheAla: 3.105 ± 0.055
0.462PheCys: 0.462 ± 0.019
2.115PheAsp: 2.115 ± 0.043
2.847PheGlu: 2.847 ± 0.053
2.548PhePhe: 2.548 ± 0.063
3.256PheGly: 3.256 ± 0.052
1.181PheHis: 1.181 ± 0.037
4.141PheIle: 4.141 ± 0.08
2.186PheLys: 2.186 ± 0.038
4.91PheLeu: 4.91 ± 0.082
1.26PheMet: 1.26 ± 0.031
1.666PheAsn: 1.666 ± 0.04
1.623PhePro: 1.623 ± 0.034
1.869PheGln: 1.869 ± 0.039
1.578PheArg: 1.578 ± 0.036
3.299PheSer: 3.299 ± 0.062
2.722PheThr: 2.722 ± 0.053
3.628PheVal: 3.628 ± 0.058
0.496PheTrp: 0.496 ± 0.021
1.773PheTyr: 1.773 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.763GlyAla: 4.763 ± 0.074
0.631GlyCys: 0.631 ± 0.02
3.036GlyAsp: 3.036 ± 0.057
4.671GlyGlu: 4.671 ± 0.068
3.19GlyPhe: 3.19 ± 0.053
4.476GlyGly: 4.476 ± 0.078
1.326GlyHis: 1.326 ± 0.037
5.792GlyIle: 5.792 ± 0.082
5.112GlyLys: 5.112 ± 0.06
6.141GlyLeu: 6.141 ± 0.085
2.109GlyMet: 2.109 ± 0.04
2.676GlyAsn: 2.676 ± 0.046
1.52GlyPro: 1.52 ± 0.038
1.973GlyGln: 1.973 ± 0.043
2.452GlyArg: 2.452 ± 0.056
3.573GlySer: 3.573 ± 0.054
4.056GlyThr: 4.056 ± 0.07
5.184GlyVal: 5.184 ± 0.067
0.816GlyTrp: 0.816 ± 0.033
2.784GlyTyr: 2.784 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.371HisAla: 1.371 ± 0.036
0.232HisCys: 0.232 ± 0.014
1.093HisAsp: 1.093 ± 0.033
1.648HisGlu: 1.648 ± 0.041
1.08HisPhe: 1.08 ± 0.035
1.382HisGly: 1.382 ± 0.035
0.676HisHis: 0.676 ± 0.028
2.057HisIle: 2.057 ± 0.044
1.182HisLys: 1.182 ± 0.031
2.073HisLeu: 2.073 ± 0.043
0.659HisMet: 0.659 ± 0.024
0.941HisAsn: 0.941 ± 0.028
1.089HisPro: 1.089 ± 0.035
0.687HisGln: 0.687 ± 0.025
0.913HisArg: 0.913 ± 0.026
1.242HisSer: 1.242 ± 0.031
1.286HisThr: 1.286 ± 0.031
1.762HisVal: 1.762 ± 0.038
0.213HisTrp: 0.213 ± 0.013
0.961HisTyr: 0.961 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.194IleAla: 6.194 ± 0.087
0.831IleCys: 0.831 ± 0.025
4.098IleAsp: 4.098 ± 0.063
6.014IleGlu: 6.014 ± 0.076
3.523IlePhe: 3.523 ± 0.068
6.228IleGly: 6.228 ± 0.084
1.891IleHis: 1.891 ± 0.042
5.949IleIle: 5.949 ± 0.101
4.194IleLys: 4.194 ± 0.059
6.947IleLeu: 6.947 ± 0.082
1.844IleMet: 1.844 ± 0.041
2.93IleAsn: 2.93 ± 0.058
3.262IlePro: 3.262 ± 0.051
3.026IleGln: 3.026 ± 0.051
3.034IleArg: 3.034 ± 0.051
4.893IleSer: 4.893 ± 0.069
4.608IleThr: 4.608 ± 0.066
6.147IleVal: 6.147 ± 0.078
0.687IleTrp: 0.687 ± 0.025
2.593IleTyr: 2.593 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
4.366LysAla: 4.366 ± 0.068
0.398LysCys: 0.398 ± 0.018
3.737LysAsp: 3.737 ± 0.065
7.968LysGlu: 7.968 ± 0.089
2.084LysPhe: 2.084 ± 0.045
4.439LysGly: 4.439 ± 0.067
1.533LysHis: 1.533 ± 0.032
4.583LysIle: 4.583 ± 0.071
6.389LysLys: 6.389 ± 0.091
6.017LysLeu: 6.017 ± 0.081
2.421LysMet: 2.421 ± 0.042
3.158LysAsn: 3.158 ± 0.053
2.048LysPro: 2.048 ± 0.039
3.772LysGln: 3.772 ± 0.058
3.55LysArg: 3.55 ± 0.051
3.261LysSer: 3.261 ± 0.054
3.531LysThr: 3.531 ± 0.052
4.965LysVal: 4.965 ± 0.074
0.85LysTrp: 0.85 ± 0.026
2.408LysTyr: 2.408 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
6.818LeuAla: 6.818 ± 0.077
0.925LeuCys: 0.925 ± 0.025
4.183LeuAsp: 4.183 ± 0.057
6.668LeuGlu: 6.668 ± 0.089
5.089LeuPhe: 5.089 ± 0.094
6.194LeuGly: 6.194 ± 0.072
2.446LeuHis: 2.446 ± 0.051
6.6LeuIle: 6.6 ± 0.092
6.117LeuLys: 6.117 ± 0.08
10.371LeuLeu: 10.371 ± 0.145
2.342LeuMet: 2.342 ± 0.046
3.795LeuAsn: 3.795 ± 0.061
3.745LeuPro: 3.745 ± 0.047
4.92LeuGln: 4.92 ± 0.071
3.61LeuArg: 3.61 ± 0.052
6.288LeuSer: 6.288 ± 0.076
5.326LeuThr: 5.326 ± 0.07
6.093LeuVal: 6.093 ± 0.067
0.816LeuTrp: 0.816 ± 0.029
3.563LeuTyr: 3.563 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
1.816MetAla: 1.816 ± 0.04
0.18MetCys: 0.18 ± 0.011
1.322MetAsp: 1.322 ± 0.035
2.013MetGlu: 2.013 ± 0.042
1.221MetPhe: 1.221 ± 0.035
1.706MetGly: 1.706 ± 0.04
0.534MetHis: 0.534 ± 0.023
2.242MetIle: 2.242 ± 0.04
3.025MetLys: 3.025 ± 0.055
2.837MetLeu: 2.837 ± 0.053
1.065MetMet: 1.065 ± 0.033
1.717MetAsn: 1.717 ± 0.032
0.986MetPro: 0.986 ± 0.033
1.135MetGln: 1.135 ± 0.03
1.229MetArg: 1.229 ± 0.032
1.692MetSer: 1.692 ± 0.038
1.631MetThr: 1.631 ± 0.033
1.663MetVal: 1.663 ± 0.039
0.241MetTrp: 0.241 ± 0.013
0.982MetTyr: 0.982 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.61AsnAla: 2.61 ± 0.051
0.357AsnCys: 0.357 ± 0.017
2.067AsnAsp: 2.067 ± 0.044
3.787AsnGlu: 3.787 ± 0.067
1.677AsnPhe: 1.677 ± 0.048
3.175AsnGly: 3.175 ± 0.058
0.996AsnHis: 0.996 ± 0.03
3.735AsnIle: 3.735 ± 0.051
3.062AsnLys: 3.062 ± 0.053
3.489AsnLeu: 3.489 ± 0.058
1.343AsnMet: 1.343 ± 0.03
1.963AsnAsn: 1.963 ± 0.048
1.948AsnPro: 1.948 ± 0.042
1.497AsnGln: 1.497 ± 0.038
1.855AsnArg: 1.855 ± 0.045
2.122AsnSer: 2.122 ± 0.044
2.151AsnThr: 2.151 ± 0.045
3.305AsnVal: 3.305 ± 0.056
0.501AsnTrp: 0.501 ± 0.02
1.619AsnTyr: 1.619 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
2.073ProAla: 2.073 ± 0.041
0.241ProCys: 0.241 ± 0.014
1.519ProAsp: 1.519 ± 0.036
2.483ProGlu: 2.483 ± 0.052
2.019ProPhe: 2.019 ± 0.044
1.919ProGly: 1.919 ± 0.051
0.829ProHis: 0.829 ± 0.029
2.758ProIle: 2.758 ± 0.048
2.272ProLys: 2.272 ± 0.037
3.415ProLeu: 3.415 ± 0.058
0.782ProMet: 0.782 ± 0.028
1.685ProAsn: 1.685 ± 0.036
0.848ProPro: 0.848 ± 0.028
0.969ProGln: 0.969 ± 0.031
1.009ProArg: 1.009 ± 0.026
2.135ProSer: 2.135 ± 0.039
1.873ProThr: 1.873 ± 0.04
2.617ProVal: 2.617 ± 0.057
0.328ProTrp: 0.328 ± 0.014
1.481ProTyr: 1.481 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
2.491GlnAla: 2.491 ± 0.048
0.277GlnCys: 0.277 ± 0.016
1.594GlnAsp: 1.594 ± 0.036
2.953GlnGlu: 2.953 ± 0.064
1.785GlnPhe: 1.785 ± 0.038
2.099GlnGly: 2.099 ± 0.048
0.93GlnHis: 0.93 ± 0.033
2.594GlnIle: 2.594 ± 0.043
3.049GlnLys: 3.049 ± 0.05
4.067GlnLeu: 4.067 ± 0.063
1.142GlnMet: 1.142 ± 0.034
1.784GlnAsn: 1.784 ± 0.041
1.13GlnPro: 1.13 ± 0.029
1.858GlnGln: 1.858 ± 0.044
1.431GlnArg: 1.431 ± 0.033
2.145GlnSer: 2.145 ± 0.04
2.15GlnThr: 2.15 ± 0.045
2.572GlnVal: 2.572 ± 0.047
0.362GlnTrp: 0.362 ± 0.015
1.584GlnTyr: 1.584 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
2.473ArgAla: 2.473 ± 0.05
0.254ArgCys: 0.254 ± 0.014
1.959ArgAsp: 1.959 ± 0.038
3.351ArgGlu: 3.351 ± 0.069
1.836ArgPhe: 1.836 ± 0.039
2.242ArgGly: 2.242 ± 0.045
0.814ArgHis: 0.814 ± 0.025
2.99ArgIle: 2.99 ± 0.048
3.268ArgLys: 3.268 ± 0.049
3.738ArgLeu: 3.738 ± 0.061
1.275ArgMet: 1.275 ± 0.036
1.894ArgAsn: 1.894 ± 0.038
1.148ArgPro: 1.148 ± 0.03
1.386ArgGln: 1.386 ± 0.032
1.657ArgArg: 1.657 ± 0.037
2.008ArgSer: 2.008 ± 0.039
2.115ArgThr: 2.115 ± 0.045
2.725ArgVal: 2.725 ± 0.048
0.39ArgTrp: 0.39 ± 0.017
1.608ArgTyr: 1.608 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
3.38SerAla: 3.38 ± 0.048
0.494SerCys: 0.494 ± 0.023
2.306SerAsp: 2.306 ± 0.047
3.617SerGlu: 3.617 ± 0.055
3.483SerPhe: 3.483 ± 0.059
3.916SerGly: 3.916 ± 0.055
1.34SerHis: 1.34 ± 0.034
5.131SerIle: 5.131 ± 0.071
3.615SerLys: 3.615 ± 0.059
5.984SerLeu: 5.984 ± 0.07
1.745SerMet: 1.745 ± 0.037
2.437SerAsn: 2.437 ± 0.049
1.949SerPro: 1.949 ± 0.04
1.762SerGln: 1.762 ± 0.038
2.04SerArg: 2.04 ± 0.042
3.673SerSer: 3.673 ± 0.062
2.93SerThr: 2.93 ± 0.045
4.062SerVal: 4.062 ± 0.063
0.606SerTrp: 0.606 ± 0.021
2.39SerTyr: 2.39 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
3.697ThrAla: 3.697 ± 0.068
0.466ThrCys: 0.466 ± 0.021
2.464ThrAsp: 2.464 ± 0.047
3.839ThrGlu: 3.839 ± 0.061
2.871ThrPhe: 2.871 ± 0.051
4.004ThrGly: 4.004 ± 0.07
1.137ThrHis: 1.137 ± 0.033
4.666ThrIle: 4.666 ± 0.071
3.843ThrLys: 3.843 ± 0.052
5.459ThrLeu: 5.459 ± 0.076
1.451ThrMet: 1.451 ± 0.039
2.644ThrAsn: 2.644 ± 0.051
2.131ThrPro: 2.131 ± 0.038
1.334ThrGln: 1.334 ± 0.032
1.688ThrArg: 1.688 ± 0.038
3.212ThrSer: 3.212 ± 0.052
3.036ThrThr: 3.036 ± 0.048
4.478ThrVal: 4.478 ± 0.063
0.523ThrTrp: 0.523 ± 0.022
2.276ThrTyr: 2.276 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
5.263ValAla: 5.263 ± 0.077
0.781ValCys: 0.781 ± 0.029
3.337ValAsp: 3.337 ± 0.063
5.178ValGlu: 5.178 ± 0.081
3.289ValPhe: 3.289 ± 0.063
4.79ValGly: 4.79 ± 0.079
1.541ValHis: 1.541 ± 0.039
5.696ValIle: 5.696 ± 0.07
5.002ValLys: 5.002 ± 0.077
7.056ValLeu: 7.056 ± 0.087
2.108ValMet: 2.108 ± 0.039
3.081ValAsn: 3.081 ± 0.06
2.715ValPro: 2.715 ± 0.043
2.819ValGln: 2.819 ± 0.047
2.804ValArg: 2.804 ± 0.048
4.671ValSer: 4.671 ± 0.057
4.704ValThr: 4.704 ± 0.063
5.554ValVal: 5.554 ± 0.081
0.715ValTrp: 0.715 ± 0.028
2.519ValTyr: 2.519 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.525TrpAla: 0.525 ± 0.023
0.095TrpCys: 0.095 ± 0.009
0.555TrpAsp: 0.555 ± 0.02
0.73TrpGlu: 0.73 ± 0.028
0.551TrpPhe: 0.551 ± 0.026
0.681TrpGly: 0.681 ± 0.024
0.205TrpHis: 0.205 ± 0.014
0.926TrpIle: 0.926 ± 0.029
0.8TrpLys: 0.8 ± 0.026
1.124TrpLeu: 1.124 ± 0.034
0.339TrpMet: 0.339 ± 0.018
0.573TrpAsn: 0.573 ± 0.024
0.201TrpPro: 0.201 ± 0.014
0.353TrpGln: 0.353 ± 0.017
0.421TrpArg: 0.421 ± 0.018
0.544TrpSer: 0.544 ± 0.019
0.462TrpThr: 0.462 ± 0.022
0.615TrpVal: 0.615 ± 0.023
0.124TrpTrp: 0.124 ± 0.01
0.39TrpTyr: 0.39 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.359TyrAla: 2.359 ± 0.046
0.373TyrCys: 0.373 ± 0.018
2.04TyrAsp: 2.04 ± 0.043
3.201TyrGlu: 3.201 ± 0.049
1.95TyrPhe: 1.95 ± 0.043
2.63TyrGly: 2.63 ± 0.048
0.878TyrHis: 0.878 ± 0.027
2.939TyrIle: 2.939 ± 0.046
2.518TyrLys: 2.518 ± 0.051
3.154TyrLeu: 3.154 ± 0.057
1.09TyrMet: 1.09 ± 0.033
1.6TyrAsn: 1.6 ± 0.036
1.332TyrPro: 1.332 ± 0.032
1.184TyrGln: 1.184 ± 0.027
1.565TyrArg: 1.565 ± 0.039
2.065TyrSer: 2.065 ± 0.042
2.108TyrThr: 2.108 ± 0.043
2.785TyrVal: 2.785 ± 0.041
0.407TyrTrp: 0.407 ± 0.016
1.68TyrTyr: 1.68 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4374 proteins (1221822 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski