Amino acid dipepetide frequency for Paenibacillus anaericanus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.088AlaAla: 6.088 ± 0.086
0.622AlaCys: 0.622 ± 0.02
3.6AlaAsp: 3.6 ± 0.052
4.855AlaGlu: 4.855 ± 0.062
2.889AlaPhe: 2.889 ± 0.05
5.402AlaGly: 5.402 ± 0.062
1.183AlaHis: 1.183 ± 0.028
5.348AlaIle: 5.348 ± 0.061
4.086AlaLys: 4.086 ± 0.052
7.26AlaLeu: 7.26 ± 0.074
2.006AlaMet: 2.006 ± 0.036
2.638AlaAsn: 2.638 ± 0.043
2.178AlaPro: 2.178 ± 0.045
2.335AlaGln: 2.335 ± 0.039
2.825AlaArg: 2.825 ± 0.042
4.432AlaSer: 4.432 ± 0.062
3.831AlaThr: 3.831 ± 0.098
5.396AlaVal: 5.396 ± 0.065
0.782AlaTrp: 0.782 ± 0.023
2.358AlaTyr: 2.358 ± 0.042
0.001AlaXaa: 0.001 ± 0.001
Cys
0.463CysAla: 0.463 ± 0.018
0.109CysCys: 0.109 ± 0.008
0.417CysAsp: 0.417 ± 0.019
0.45CysGlu: 0.45 ± 0.018
0.326CysPhe: 0.326 ± 0.014
0.772CysGly: 0.772 ± 0.024
0.201CysHis: 0.201 ± 0.012
0.61CysIle: 0.61 ± 0.022
0.386CysLys: 0.386 ± 0.015
0.774CysLeu: 0.774 ± 0.024
0.221CysMet: 0.221 ± 0.012
0.349CysAsn: 0.349 ± 0.015
0.331CysPro: 0.331 ± 0.016
0.234CysGln: 0.234 ± 0.013
0.402CysArg: 0.402 ± 0.015
0.6CysSer: 0.6 ± 0.017
0.391CysThr: 0.391 ± 0.016
0.475CysVal: 0.475 ± 0.016
0.086CysTrp: 0.086 ± 0.008
0.285CysTyr: 0.285 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.342AspAla: 3.342 ± 0.048
0.383AspCys: 0.383 ± 0.018
2.55AspAsp: 2.55 ± 0.041
3.836AspGlu: 3.836 ± 0.051
2.304AspPhe: 2.304 ± 0.04
3.941AspGly: 3.941 ± 0.057
1.151AspHis: 1.151 ± 0.027
4.176AspIle: 4.176 ± 0.052
3.129AspLys: 3.129 ± 0.046
5.12AspLeu: 5.12 ± 0.057
1.439AspMet: 1.439 ± 0.03
2.266AspAsn: 2.266 ± 0.037
2.223AspPro: 2.223 ± 0.037
2.007AspGln: 2.007 ± 0.032
2.488AspArg: 2.488 ± 0.045
3.076AspSer: 3.076 ± 0.047
2.8AspThr: 2.8 ± 0.068
3.629AspVal: 3.629 ± 0.049
0.788AspTrp: 0.788 ± 0.022
2.213AspTyr: 2.213 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
5.083GluAla: 5.083 ± 0.062
0.417GluCys: 0.417 ± 0.018
3.394GluAsp: 3.394 ± 0.049
5.381GluGlu: 5.381 ± 0.067
2.381GluPhe: 2.381 ± 0.034
4.415GluGly: 4.415 ± 0.056
1.469GluHis: 1.469 ± 0.031
5.045GluIle: 5.045 ± 0.063
4.167GluLys: 4.167 ± 0.055
7.176GluLeu: 7.176 ± 0.084
2.127GluMet: 2.127 ± 0.033
2.87GluAsn: 2.87 ± 0.041
2.042GluPro: 2.042 ± 0.039
3.573GluGln: 3.573 ± 0.048
3.487GluArg: 3.487 ± 0.052
3.919GluSer: 3.919 ± 0.052
3.276GluThr: 3.276 ± 0.047
4.83GluVal: 4.83 ± 0.059
0.895GluTrp: 0.895 ± 0.024
2.244GluTyr: 2.244 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.031PheAla: 3.031 ± 0.054
0.393PheCys: 0.393 ± 0.016
2.387PheAsp: 2.387 ± 0.037
2.562PheGlu: 2.562 ± 0.044
1.886PhePhe: 1.886 ± 0.043
3.191PheGly: 3.191 ± 0.054
0.865PheHis: 0.865 ± 0.026
3.429PheIle: 3.429 ± 0.051
2.277PheLys: 2.277 ± 0.043
3.879PheLeu: 3.879 ± 0.059
1.195PheMet: 1.195 ± 0.028
1.94PheAsn: 1.94 ± 0.033
1.516PhePro: 1.516 ± 0.032
1.445PheGln: 1.445 ± 0.035
1.858PheArg: 1.858 ± 0.037
3.05PheSer: 3.05 ± 0.045
2.535PheThr: 2.535 ± 0.038
2.921PheVal: 2.921 ± 0.043
0.555PheTrp: 0.555 ± 0.019
1.566PheTyr: 1.566 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
4.994GlyAla: 4.994 ± 0.121
0.688GlyCys: 0.688 ± 0.023
3.799GlyAsp: 3.799 ± 0.072
4.399GlyGlu: 4.399 ± 0.056
3.249GlyPhe: 3.249 ± 0.046
5.161GlyGly: 5.161 ± 0.074
1.4GlyHis: 1.4 ± 0.032
5.993GlyIle: 5.993 ± 0.067
4.531GlyLys: 4.531 ± 0.06
6.98GlyLeu: 6.98 ± 0.079
2.241GlyMet: 2.241 ± 0.034
3.074GlyAsn: 3.074 ± 0.052
1.828GlyPro: 1.828 ± 0.055
2.491GlyGln: 2.491 ± 0.042
2.981GlyArg: 2.981 ± 0.048
4.713GlySer: 4.713 ± 0.064
4.346GlyThr: 4.346 ± 0.068
5.193GlyVal: 5.193 ± 0.066
0.942GlyTrp: 0.942 ± 0.028
3.039GlyTyr: 3.039 ± 0.045
0.001GlyXaa: 0.001 ± 0.001
His
1.276HisAla: 1.276 ± 0.032
0.191HisCys: 0.191 ± 0.011
1.017HisAsp: 1.017 ± 0.026
1.287HisGlu: 1.287 ± 0.029
1.002HisPhe: 1.002 ± 0.026
1.427HisGly: 1.427 ± 0.028
0.591HisHis: 0.591 ± 0.019
1.46HisIle: 1.46 ± 0.034
0.941HisLys: 0.941 ± 0.022
2.018HisLeu: 2.018 ± 0.042
0.524HisMet: 0.524 ± 0.016
0.78HisAsn: 0.78 ± 0.02
1.088HisPro: 1.088 ± 0.024
0.746HisGln: 0.746 ± 0.026
0.971HisArg: 0.971 ± 0.024
1.254HisSer: 1.254 ± 0.026
1.029HisThr: 1.029 ± 0.028
1.311HisVal: 1.311 ± 0.032
0.297HisTrp: 0.297 ± 0.013
0.853HisTyr: 0.853 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.716IleAla: 5.716 ± 0.063
0.707IleCys: 0.707 ± 0.02
4.085IleAsp: 4.085 ± 0.054
4.929IleGlu: 4.929 ± 0.051
2.926IlePhe: 2.926 ± 0.047
5.572IleGly: 5.572 ± 0.078
1.614IleHis: 1.614 ± 0.028
5.44IleIle: 5.44 ± 0.072
3.827IleLys: 3.827 ± 0.052
6.831IleLeu: 6.831 ± 0.08
1.983IleMet: 1.983 ± 0.035
3.137IleAsn: 3.137 ± 0.052
3.39IlePro: 3.39 ± 0.048
2.843IleGln: 2.843 ± 0.047
3.4IleArg: 3.4 ± 0.055
5.576IleSer: 5.576 ± 0.061
4.561IleThr: 4.561 ± 0.059
5.484IleVal: 5.484 ± 0.063
0.774IleTrp: 0.774 ± 0.027
2.46IleTyr: 2.46 ± 0.04
0.001IleXaa: 0.001 ± 0.001
Lys
3.831LysAla: 3.831 ± 0.056
0.322LysCys: 0.322 ± 0.015
3.309LysAsp: 3.309 ± 0.05
4.761LysGlu: 4.761 ± 0.063
1.899LysPhe: 1.899 ± 0.036
3.832LysGly: 3.832 ± 0.053
1.092LysHis: 1.092 ± 0.03
3.782LysIle: 3.782 ± 0.055
3.675LysLys: 3.675 ± 0.056
5.887LysLeu: 5.887 ± 0.058
1.794LysMet: 1.794 ± 0.033
2.42LysAsn: 2.42 ± 0.036
2.086LysPro: 2.086 ± 0.039
2.537LysGln: 2.537 ± 0.038
2.784LysArg: 2.784 ± 0.047
3.566LysSer: 3.566 ± 0.043
2.936LysThr: 2.936 ± 0.049
4.215LysVal: 4.215 ± 0.05
0.801LysTrp: 0.801 ± 0.025
2.157LysTyr: 2.157 ± 0.036
0.001LysXaa: 0.001 ± 0.001
Leu
7.056LeuAla: 7.056 ± 0.079
0.86LeuCys: 0.86 ± 0.027
5.198LeuAsp: 5.198 ± 0.052
6.334LeuGlu: 6.334 ± 0.069
4.565LeuPhe: 4.565 ± 0.072
6.717LeuGly: 6.717 ± 0.071
2.015LeuHis: 2.015 ± 0.036
7.433LeuIle: 7.433 ± 0.093
5.655LeuLys: 5.655 ± 0.062
10.707LeuLeu: 10.707 ± 0.135
2.621LeuMet: 2.621 ± 0.049
4.421LeuAsn: 4.421 ± 0.053
4.202LeuPro: 4.202 ± 0.053
3.997LeuGln: 3.997 ± 0.061
4.55LeuArg: 4.55 ± 0.064
7.502LeuSer: 7.502 ± 0.075
5.654LeuThr: 5.654 ± 0.065
6.345LeuVal: 6.345 ± 0.07
1.001LeuTrp: 1.001 ± 0.028
3.278LeuTyr: 3.278 ± 0.043
0.001LeuXaa: 0.001 ± 0.001
Met
1.993MetAla: 1.993 ± 0.033
0.178MetCys: 0.178 ± 0.012
1.619MetAsp: 1.619 ± 0.03
1.93MetGlu: 1.93 ± 0.035
1.113MetPhe: 1.113 ± 0.027
1.853MetGly: 1.853 ± 0.032
0.416MetHis: 0.416 ± 0.015
2.151MetIle: 2.151 ± 0.04
2.126MetLys: 2.126 ± 0.037
2.851MetLeu: 2.851 ± 0.043
0.934MetMet: 0.934 ± 0.025
1.643MetAsn: 1.643 ± 0.03
1.005MetPro: 1.005 ± 0.026
0.942MetGln: 0.942 ± 0.023
1.157MetArg: 1.157 ± 0.024
1.898MetSer: 1.898 ± 0.033
1.618MetThr: 1.618 ± 0.031
1.799MetVal: 1.799 ± 0.036
0.243MetTrp: 0.243 ± 0.011
0.798MetTyr: 0.798 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
2.726AsnAla: 2.726 ± 0.043
0.343AsnCys: 0.343 ± 0.016
2.216AsnAsp: 2.216 ± 0.034
3.104AsnGlu: 3.104 ± 0.048
1.644AsnPhe: 1.644 ± 0.034
3.332AsnGly: 3.332 ± 0.052
0.988AsnHis: 0.988 ± 0.024
3.131AsnIle: 3.131 ± 0.044
2.686AsnLys: 2.686 ± 0.043
3.902AsnLeu: 3.902 ± 0.049
1.199AsnMet: 1.199 ± 0.026
2.21AsnAsn: 2.21 ± 0.047
2.087AsnPro: 2.087 ± 0.036
1.703AsnGln: 1.703 ± 0.031
2.045AsnArg: 2.045 ± 0.037
2.756AsnSer: 2.756 ± 0.054
2.31AsnThr: 2.31 ± 0.038
2.912AsnVal: 2.912 ± 0.054
0.601AsnTrp: 0.601 ± 0.017
1.706AsnTyr: 1.706 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
2.52ProAla: 2.52 ± 0.041
0.255ProCys: 0.255 ± 0.012
2.339ProAsp: 2.339 ± 0.036
3.083ProGlu: 3.083 ± 0.045
1.83ProPhe: 1.83 ± 0.035
2.5ProGly: 2.5 ± 0.043
0.802ProHis: 0.802 ± 0.024
2.706ProIle: 2.706 ± 0.04
1.865ProLys: 1.865 ± 0.037
3.692ProLeu: 3.692 ± 0.054
0.887ProMet: 0.887 ± 0.023
1.646ProAsn: 1.646 ± 0.036
1.058ProPro: 1.058 ± 0.03
1.324ProGln: 1.324 ± 0.028
1.25ProArg: 1.25 ± 0.028
2.576ProSer: 2.576 ± 0.063
2.218ProThr: 2.218 ± 0.082
2.829ProVal: 2.829 ± 0.043
0.461ProTrp: 0.461 ± 0.016
1.447ProTyr: 1.447 ± 0.026
0.001ProXaa: 0.001 ± 0.001
Gln
2.746GlnAla: 2.746 ± 0.044
0.221GlnCys: 0.221 ± 0.012
1.884GlnAsp: 1.884 ± 0.037
2.719GlnGlu: 2.719 ± 0.044
1.636GlnPhe: 1.636 ± 0.03
2.678GlnGly: 2.678 ± 0.05
0.794GlnHis: 0.794 ± 0.023
2.732GlnIle: 2.732 ± 0.042
1.99GlnLys: 1.99 ± 0.036
4.013GlnLeu: 4.013 ± 0.05
1.168GlnMet: 1.168 ± 0.03
1.494GlnAsn: 1.494 ± 0.029
1.31GlnPro: 1.31 ± 0.033
1.781GlnGln: 1.781 ± 0.035
1.8GlnArg: 1.8 ± 0.038
2.418GlnSer: 2.418 ± 0.045
1.914GlnThr: 1.914 ± 0.034
2.569GlnVal: 2.569 ± 0.042
0.518GlnTrp: 0.518 ± 0.018
1.411GlnTyr: 1.411 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
2.681ArgAla: 2.681 ± 0.04
0.335ArgCys: 0.335 ± 0.015
2.368ArgAsp: 2.368 ± 0.04
3.323ArgGlu: 3.323 ± 0.051
1.963ArgPhe: 1.963 ± 0.034
2.772ArgGly: 2.772 ± 0.04
0.913ArgHis: 0.913 ± 0.024
3.512ArgIle: 3.512 ± 0.051
2.954ArgLys: 2.954 ± 0.04
4.545ArgLeu: 4.545 ± 0.065
1.417ArgMet: 1.417 ± 0.03
2.089ArgAsn: 2.089 ± 0.034
1.444ArgPro: 1.444 ± 0.029
1.722ArgGln: 1.722 ± 0.036
2.344ArgArg: 2.344 ± 0.05
2.848ArgSer: 2.848 ± 0.045
2.373ArgThr: 2.373 ± 0.039
2.852ArgVal: 2.852 ± 0.045
0.56ArgTrp: 0.56 ± 0.02
1.752ArgTyr: 1.752 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
4.409SerAla: 4.409 ± 0.054
0.473SerCys: 0.473 ± 0.017
3.447SerAsp: 3.447 ± 0.053
4.1SerGlu: 4.1 ± 0.049
3.187SerPhe: 3.187 ± 0.053
5.364SerGly: 5.364 ± 0.063
1.251SerHis: 1.251 ± 0.028
5.159SerIle: 5.159 ± 0.058
3.846SerLys: 3.846 ± 0.045
6.883SerLeu: 6.883 ± 0.072
1.861SerMet: 1.861 ± 0.032
2.934SerAsn: 2.934 ± 0.053
2.498SerPro: 2.498 ± 0.057
2.292SerGln: 2.292 ± 0.043
2.907SerArg: 2.907 ± 0.048
4.834SerSer: 4.834 ± 0.069
3.558SerThr: 3.558 ± 0.053
4.64SerVal: 4.64 ± 0.057
0.839SerTrp: 0.839 ± 0.021
2.525SerTyr: 2.525 ± 0.041
0.001SerXaa: 0.001 ± 0.001
Thr
4.147ThrAla: 4.147 ± 0.058
0.349ThrCys: 0.349 ± 0.016
2.767ThrAsp: 2.767 ± 0.049
3.519ThrGlu: 3.519 ± 0.047
2.417ThrPhe: 2.417 ± 0.035
4.891ThrGly: 4.891 ± 0.26
1.02ThrHis: 1.02 ± 0.024
4.153ThrIle: 4.153 ± 0.052
2.88ThrLys: 2.88 ± 0.043
5.742ThrLeu: 5.742 ± 0.067
1.357ThrMet: 1.357 ± 0.026
2.229ThrAsn: 2.229 ± 0.047
2.504ThrPro: 2.504 ± 0.08
1.643ThrGln: 1.643 ± 0.031
2.107ThrArg: 2.107 ± 0.034
3.694ThrSer: 3.694 ± 0.057
3.321ThrThr: 3.321 ± 0.059
4.424ThrVal: 4.424 ± 0.058
0.687ThrTrp: 0.687 ± 0.02
2.02ThrTyr: 2.02 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
4.977ValAla: 4.977 ± 0.065
0.586ValCys: 0.586 ± 0.018
3.773ValAsp: 3.773 ± 0.051
4.413ValGlu: 4.413 ± 0.061
2.954ValPhe: 2.954 ± 0.045
4.673ValGly: 4.673 ± 0.052
1.314ValHis: 1.314 ± 0.026
5.455ValIle: 5.455 ± 0.066
3.982ValLys: 3.982 ± 0.045
6.977ValLeu: 6.977 ± 0.069
1.968ValMet: 1.968 ± 0.034
3.076ValAsn: 3.076 ± 0.045
2.712ValPro: 2.712 ± 0.042
2.418ValGln: 2.418 ± 0.039
2.938ValArg: 2.938 ± 0.042
5.001ValSer: 5.001 ± 0.065
4.569ValThr: 4.569 ± 0.067
5.076ValVal: 5.076 ± 0.066
0.801ValTrp: 0.801 ± 0.025
2.363ValTyr: 2.363 ± 0.041
0.001ValXaa: 0.001 ± 0.001
Trp
0.709TrpAla: 0.709 ± 0.023
0.111TrpCys: 0.111 ± 0.008
0.71TrpAsp: 0.71 ± 0.023
0.784TrpGlu: 0.784 ± 0.023
0.562TrpPhe: 0.562 ± 0.02
0.841TrpGly: 0.841 ± 0.024
0.233TrpHis: 0.233 ± 0.011
0.992TrpIle: 0.992 ± 0.024
0.714TrpLys: 0.714 ± 0.019
1.279TrpLeu: 1.279 ± 0.03
0.406TrpMet: 0.406 ± 0.017
0.724TrpAsn: 0.724 ± 0.022
0.307TrpPro: 0.307 ± 0.013
0.429TrpGln: 0.429 ± 0.014
0.543TrpArg: 0.543 ± 0.022
0.847TrpSer: 0.847 ± 0.025
0.648TrpThr: 0.648 ± 0.02
0.788TrpVal: 0.788 ± 0.023
0.182TrpTrp: 0.182 ± 0.011
0.451TrpTyr: 0.451 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.329TyrAla: 2.329 ± 0.036
0.338TyrCys: 0.338 ± 0.013
1.983TyrAsp: 1.983 ± 0.041
2.423TyrGlu: 2.423 ± 0.037
1.745TyrPhe: 1.745 ± 0.033
2.707TyrGly: 2.707 ± 0.044
0.753TyrHis: 0.753 ± 0.021
2.548TyrIle: 2.548 ± 0.041
1.965TyrLys: 1.965 ± 0.036
3.597TyrLeu: 3.597 ± 0.047
0.919TyrMet: 0.919 ± 0.025
1.66TyrAsn: 1.66 ± 0.037
1.478TyrPro: 1.478 ± 0.032
1.335TyrGln: 1.335 ± 0.035
1.932TyrArg: 1.932 ± 0.037
2.447TyrSer: 2.447 ± 0.042
1.983TyrThr: 1.983 ± 0.038
2.339TyrVal: 2.339 ± 0.037
0.447TyrTrp: 0.447 ± 0.016
1.489TyrTyr: 1.489 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.001
0.001XaaLys: 0.001 ± 0.001
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.001XaaGln: 0.001 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.002XaaVal: 0.002 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5415 proteins (1728414 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski