Amino acid dipepetide frequency for Alkalihalobacillus okhensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.393AlaAla: 5.393 ± 0.086
0.637AlaCys: 0.637 ± 0.024
3.131AlaAsp: 3.131 ± 0.051
4.487AlaGlu: 4.487 ± 0.066
3.329AlaPhe: 3.329 ± 0.06
5.017AlaGly: 5.017 ± 0.071
1.315AlaHis: 1.315 ± 0.034
6.024AlaIle: 6.024 ± 0.077
4.465AlaLys: 4.465 ± 0.067
7.229AlaLeu: 7.229 ± 0.083
2.098AlaMet: 2.098 ± 0.047
2.814AlaAsn: 2.814 ± 0.047
1.993AlaPro: 1.993 ± 0.044
2.189AlaGln: 2.189 ± 0.042
2.663AlaArg: 2.663 ± 0.047
4.085AlaSer: 4.085 ± 0.064
3.801AlaThr: 3.801 ± 0.052
5.187AlaVal: 5.187 ± 0.072
0.619AlaTrp: 0.619 ± 0.024
2.244AlaTyr: 2.244 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.423CysAla: 0.423 ± 0.02
0.1CysCys: 0.1 ± 0.01
0.378CysAsp: 0.378 ± 0.015
0.469CysGlu: 0.469 ± 0.02
0.338CysPhe: 0.338 ± 0.016
0.669CysGly: 0.669 ± 0.023
0.198CysHis: 0.198 ± 0.013
0.531CysIle: 0.531 ± 0.021
0.383CysLys: 0.383 ± 0.019
0.706CysLeu: 0.706 ± 0.029
0.156CysMet: 0.156 ± 0.011
0.303CysAsn: 0.303 ± 0.014
0.381CysPro: 0.381 ± 0.021
0.263CysGln: 0.263 ± 0.015
0.284CysArg: 0.284 ± 0.016
0.498CysSer: 0.498 ± 0.02
0.395CysThr: 0.395 ± 0.02
0.453CysVal: 0.453 ± 0.02
0.06CysTrp: 0.06 ± 0.007
0.269CysTyr: 0.269 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.093AspAla: 3.093 ± 0.052
0.365AspCys: 0.365 ± 0.017
2.552AspAsp: 2.552 ± 0.05
4.605AspGlu: 4.605 ± 0.067
2.265AspPhe: 2.265 ± 0.041
3.352AspGly: 3.352 ± 0.057
1.357AspHis: 1.357 ± 0.034
3.826AspIle: 3.826 ± 0.06
2.697AspLys: 2.697 ± 0.049
5.203AspLeu: 5.203 ± 0.07
1.303AspMet: 1.303 ± 0.034
1.603AspAsn: 1.603 ± 0.037
2.006AspPro: 2.006 ± 0.042
2.305AspGln: 2.305 ± 0.046
2.283AspArg: 2.283 ± 0.044
2.552AspSer: 2.552 ± 0.041
2.34AspThr: 2.34 ± 0.043
4.149AspVal: 4.149 ± 0.064
0.614AspTrp: 0.614 ± 0.024
2.17AspTyr: 2.17 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
5.508GluAla: 5.508 ± 0.084
0.414GluCys: 0.414 ± 0.018
3.886GluAsp: 3.886 ± 0.063
7.726GluGlu: 7.726 ± 0.096
2.542GluPhe: 2.542 ± 0.042
4.647GluGly: 4.647 ± 0.063
1.703GluHis: 1.703 ± 0.037
5.382GluIle: 5.382 ± 0.072
5.96GluLys: 5.96 ± 0.071
7.655GluLeu: 7.655 ± 0.083
2.396GluMet: 2.396 ± 0.042
3.407GluAsn: 3.407 ± 0.055
2.119GluPro: 2.119 ± 0.041
3.862GluGln: 3.862 ± 0.064
3.909GluArg: 3.909 ± 0.07
3.714GluSer: 3.714 ± 0.059
3.902GluThr: 3.902 ± 0.055
5.662GluVal: 5.662 ± 0.076
0.874GluTrp: 0.874 ± 0.03
2.229GluTyr: 2.229 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.104PheAla: 3.104 ± 0.06
0.345PheCys: 0.345 ± 0.018
2.415PheAsp: 2.415 ± 0.04
3.024PheGlu: 3.024 ± 0.049
2.444PhePhe: 2.444 ± 0.052
3.359PheGly: 3.359 ± 0.058
1.007PheHis: 1.007 ± 0.026
4.062PheIle: 4.062 ± 0.081
2.165PheLys: 2.165 ± 0.043
4.618PheLeu: 4.618 ± 0.076
1.17PheMet: 1.17 ± 0.035
1.826PheAsn: 1.826 ± 0.04
1.613PhePro: 1.613 ± 0.037
1.666PheGln: 1.666 ± 0.036
1.569PheArg: 1.569 ± 0.032
3.184PheSer: 3.184 ± 0.056
2.537PheThr: 2.537 ± 0.046
3.383PheVal: 3.383 ± 0.054
0.476PheTrp: 0.476 ± 0.019
1.693PheTyr: 1.693 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
4.964GlyAla: 4.964 ± 0.079
0.629GlyCys: 0.629 ± 0.022
3.112GlyAsp: 3.112 ± 0.062
4.685GlyGlu: 4.685 ± 0.068
3.488GlyPhe: 3.488 ± 0.063
4.834GlyGly: 4.834 ± 0.077
1.458GlyHis: 1.458 ± 0.038
5.732GlyIle: 5.732 ± 0.068
4.393GlyLys: 4.393 ± 0.068
6.803GlyLeu: 6.803 ± 0.089
2.206GlyMet: 2.206 ± 0.047
2.503GlyAsn: 2.503 ± 0.045
1.854GlyPro: 1.854 ± 0.047
2.38GlyGln: 2.38 ± 0.044
2.731GlyArg: 2.731 ± 0.046
3.852GlySer: 3.852 ± 0.05
3.954GlyThr: 3.954 ± 0.058
5.486GlyVal: 5.486 ± 0.073
0.786GlyTrp: 0.786 ± 0.021
2.789GlyTyr: 2.789 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.403HisAla: 1.403 ± 0.036
0.215HisCys: 0.215 ± 0.014
1.094HisAsp: 1.094 ± 0.027
1.551HisGlu: 1.551 ± 0.031
1.144HisPhe: 1.144 ± 0.031
1.423HisGly: 1.423 ± 0.033
0.706HisHis: 0.706 ± 0.027
1.554HisIle: 1.554 ± 0.035
1.042HisLys: 1.042 ± 0.035
2.248HisLeu: 2.248 ± 0.044
0.531HisMet: 0.531 ± 0.024
0.809HisAsn: 0.809 ± 0.023
1.137HisPro: 1.137 ± 0.031
0.889HisGln: 0.889 ± 0.029
0.868HisArg: 0.868 ± 0.027
1.387HisSer: 1.387 ± 0.034
1.071HisThr: 1.071 ± 0.032
1.624HisVal: 1.624 ± 0.039
0.24HisTrp: 0.24 ± 0.014
0.908HisTyr: 0.908 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.882IleAla: 5.882 ± 0.073
0.622IleCys: 0.622 ± 0.026
4.419IleAsp: 4.419 ± 0.051
6.044IleGlu: 6.044 ± 0.08
3.155IlePhe: 3.155 ± 0.061
6.255IleGly: 6.255 ± 0.087
1.703IleHis: 1.703 ± 0.035
5.866IleIle: 5.866 ± 0.077
4.232IleLys: 4.232 ± 0.057
6.953IleLeu: 6.953 ± 0.096
1.838IleMet: 1.838 ± 0.039
3.256IleAsn: 3.256 ± 0.059
3.355IlePro: 3.355 ± 0.051
2.796IleGln: 2.796 ± 0.053
3.227IleArg: 3.227 ± 0.05
4.932IleSer: 4.932 ± 0.061
4.342IleThr: 4.342 ± 0.05
5.634IleVal: 5.634 ± 0.067
0.743IleTrp: 0.743 ± 0.026
2.397IleTyr: 2.397 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
4.211LysAla: 4.211 ± 0.065
0.29LysCys: 0.29 ± 0.016
3.326LysAsp: 3.326 ± 0.051
6.509LysGlu: 6.509 ± 0.081
1.658LysPhe: 1.658 ± 0.035
4.234LysGly: 4.234 ± 0.065
1.371LysHis: 1.371 ± 0.031
3.876LysIle: 3.876 ± 0.057
5.463LysLys: 5.463 ± 0.073
5.426LysLeu: 5.426 ± 0.063
2.018LysMet: 2.018 ± 0.043
2.819LysAsn: 2.819 ± 0.045
2.018LysPro: 2.018 ± 0.039
3.131LysGln: 3.131 ± 0.056
3.372LysArg: 3.372 ± 0.06
3.318LysSer: 3.318 ± 0.064
3.193LysThr: 3.193 ± 0.053
4.628LysVal: 4.628 ± 0.059
0.794LysTrp: 0.794 ± 0.027
1.783LysTyr: 1.783 ± 0.04
0.0LysXaa: 0.0 ± 0.0
Leu
7.532LeuAla: 7.532 ± 0.087
0.699LeuCys: 0.699 ± 0.025
4.973LeuAsp: 4.973 ± 0.065
7.135LeuGlu: 7.135 ± 0.083
5.045LeuPhe: 5.045 ± 0.084
6.53LeuGly: 6.53 ± 0.071
2.114LeuHis: 2.114 ± 0.046
7.41LeuIle: 7.41 ± 0.085
5.953LeuLys: 5.953 ± 0.071
10.452LeuLeu: 10.452 ± 0.107
2.649LeuMet: 2.649 ± 0.038
4.079LeuAsn: 4.079 ± 0.055
3.887LeuPro: 3.887 ± 0.058
3.666LeuGln: 3.666 ± 0.054
3.702LeuArg: 3.702 ± 0.066
6.725LeuSer: 6.725 ± 0.077
5.77LeuThr: 5.77 ± 0.068
6.796LeuVal: 6.796 ± 0.071
0.834LeuTrp: 0.834 ± 0.028
3.089LeuTyr: 3.089 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.007MetAla: 2.007 ± 0.038
0.146MetCys: 0.146 ± 0.011
1.447MetAsp: 1.447 ± 0.038
1.989MetGlu: 1.989 ± 0.04
1.212MetPhe: 1.212 ± 0.036
1.799MetGly: 1.799 ± 0.041
0.415MetHis: 0.415 ± 0.021
2.302MetIle: 2.302 ± 0.049
2.384MetLys: 2.384 ± 0.04
2.606MetLeu: 2.606 ± 0.05
0.898MetMet: 0.898 ± 0.03
1.642MetAsn: 1.642 ± 0.035
0.979MetPro: 0.979 ± 0.029
0.869MetGln: 0.869 ± 0.024
1.083MetArg: 1.083 ± 0.03
1.788MetSer: 1.788 ± 0.038
1.791MetThr: 1.791 ± 0.042
1.941MetVal: 1.941 ± 0.04
0.228MetTrp: 0.228 ± 0.014
0.764MetTyr: 0.764 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.437AsnAla: 2.437 ± 0.044
0.291AsnCys: 0.291 ± 0.017
2.233AsnAsp: 2.233 ± 0.05
3.684AsnGlu: 3.684 ± 0.058
1.493AsnPhe: 1.493 ± 0.037
3.115AsnGly: 3.115 ± 0.056
1.119AsnHis: 1.119 ± 0.033
3.154AsnIle: 3.154 ± 0.047
2.67AsnLys: 2.67 ± 0.051
3.694AsnLeu: 3.694 ± 0.054
1.223AsnMet: 1.223 ± 0.035
1.821AsnAsn: 1.821 ± 0.045
2.023AsnPro: 2.023 ± 0.045
2.1AsnGln: 2.1 ± 0.045
1.941AsnArg: 1.941 ± 0.036
2.192AsnSer: 2.192 ± 0.046
1.967AsnThr: 1.967 ± 0.047
3.187AsnVal: 3.187 ± 0.049
0.592AsnTrp: 0.592 ± 0.022
1.443AsnTyr: 1.443 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
2.17ProAla: 2.17 ± 0.04
0.211ProCys: 0.211 ± 0.014
1.964ProAsp: 1.964 ± 0.044
2.869ProGlu: 2.869 ± 0.053
2.031ProPhe: 2.031 ± 0.044
2.206ProGly: 2.206 ± 0.048
0.833ProHis: 0.833 ± 0.03
2.942ProIle: 2.942 ± 0.049
2.022ProLys: 2.022 ± 0.044
3.589ProLeu: 3.589 ± 0.061
0.854ProMet: 0.854 ± 0.025
1.738ProAsn: 1.738 ± 0.043
1.03ProPro: 1.03 ± 0.029
1.069ProGln: 1.069 ± 0.031
1.157ProArg: 1.157 ± 0.03
2.249ProSer: 2.249 ± 0.046
2.18ProThr: 2.18 ± 0.044
2.775ProVal: 2.775 ± 0.055
0.409ProTrp: 0.409 ± 0.018
1.405ProTyr: 1.405 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.914GlnAla: 2.914 ± 0.053
0.208GlnCys: 0.208 ± 0.014
1.681GlnAsp: 1.681 ± 0.037
2.986GlnGlu: 2.986 ± 0.054
1.741GlnPhe: 1.741 ± 0.034
2.238GlnGly: 2.238 ± 0.044
0.818GlnHis: 0.818 ± 0.025
2.704GlnIle: 2.704 ± 0.05
2.588GlnLys: 2.588 ± 0.053
4.26GlnLeu: 4.26 ± 0.066
1.244GlnMet: 1.244 ± 0.035
1.506GlnAsn: 1.506 ± 0.037
1.227GlnPro: 1.227 ± 0.036
1.81GlnGln: 1.81 ± 0.054
1.616GlnArg: 1.616 ± 0.037
2.277GlnSer: 2.277 ± 0.042
2.197GlnThr: 2.197 ± 0.038
2.854GlnVal: 2.854 ± 0.048
0.461GlnTrp: 0.461 ± 0.019
1.248GlnTyr: 1.248 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.562ArgAla: 2.562 ± 0.049
0.29ArgCys: 0.29 ± 0.016
2.142ArgAsp: 2.142 ± 0.038
3.268ArgGlu: 3.268 ± 0.058
1.98ArgPhe: 1.98 ± 0.038
2.523ArgGly: 2.523 ± 0.048
0.791ArgHis: 0.791 ± 0.028
3.096ArgIle: 3.096 ± 0.051
3.131ArgLys: 3.131 ± 0.048
4.117ArgLeu: 4.117 ± 0.061
1.346ArgMet: 1.346 ± 0.032
1.953ArgAsn: 1.953 ± 0.04
1.349ArgPro: 1.349 ± 0.034
1.552ArgGln: 1.552 ± 0.03
1.897ArgArg: 1.897 ± 0.05
2.349ArgSer: 2.349 ± 0.039
2.155ArgThr: 2.155 ± 0.041
2.907ArgVal: 2.907 ± 0.051
0.47ArgTrp: 0.47 ± 0.021
1.607ArgTyr: 1.607 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
3.472SerAla: 3.472 ± 0.06
0.439SerCys: 0.439 ± 0.018
2.82SerAsp: 2.82 ± 0.054
4.146SerGlu: 4.146 ± 0.055
3.346SerPhe: 3.346 ± 0.055
4.261SerGly: 4.261 ± 0.065
1.281SerHis: 1.281 ± 0.028
5.099SerIle: 5.099 ± 0.066
3.669SerLys: 3.669 ± 0.057
6.194SerLeu: 6.194 ± 0.083
1.736SerMet: 1.736 ± 0.038
2.606SerAsn: 2.606 ± 0.047
2.052SerPro: 2.052 ± 0.038
2.019SerGln: 2.019 ± 0.038
2.363SerArg: 2.363 ± 0.04
3.913SerSer: 3.913 ± 0.061
3.274SerThr: 3.274 ± 0.055
4.188SerVal: 4.188 ± 0.062
0.657SerTrp: 0.657 ± 0.023
2.238SerTyr: 2.238 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
3.663ThrAla: 3.663 ± 0.052
0.381ThrCys: 0.381 ± 0.018
2.715ThrAsp: 2.715 ± 0.051
3.626ThrGlu: 3.626 ± 0.058
2.791ThrPhe: 2.791 ± 0.053
3.987ThrGly: 3.987 ± 0.056
1.086ThrHis: 1.086 ± 0.033
4.814ThrIle: 4.814 ± 0.066
3.279ThrLys: 3.279 ± 0.051
5.468ThrLeu: 5.468 ± 0.064
1.405ThrMet: 1.405 ± 0.037
2.54ThrAsn: 2.54 ± 0.047
2.214ThrPro: 2.214 ± 0.038
1.518ThrGln: 1.518 ± 0.033
1.893ThrArg: 1.893 ± 0.038
3.278ThrSer: 3.278 ± 0.048
3.076ThrThr: 3.076 ± 0.052
4.274ThrVal: 4.274 ± 0.055
0.574ThrTrp: 0.574 ± 0.023
2.064ThrTyr: 2.064 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
5.435ValAla: 5.435 ± 0.081
0.613ValCys: 0.613 ± 0.022
3.969ValAsp: 3.969 ± 0.06
5.362ValGlu: 5.362 ± 0.082
3.285ValPhe: 3.285 ± 0.057
5.071ValGly: 5.071 ± 0.07
1.441ValHis: 1.441 ± 0.033
5.995ValIle: 5.995 ± 0.078
4.353ValLys: 4.353 ± 0.064
7.117ValLeu: 7.117 ± 0.076
2.016ValMet: 2.016 ± 0.042
3.215ValAsn: 3.215 ± 0.055
2.8ValPro: 2.8 ± 0.051
2.528ValGln: 2.528 ± 0.046
2.899ValArg: 2.899 ± 0.046
4.717ValSer: 4.717 ± 0.065
4.477ValThr: 4.477 ± 0.055
5.517ValVal: 5.517 ± 0.071
0.641ValTrp: 0.641 ± 0.021
2.314ValTyr: 2.314 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.596TrpAla: 0.596 ± 0.023
0.074TrpCys: 0.074 ± 0.007
0.529TrpAsp: 0.529 ± 0.021
0.729TrpGlu: 0.729 ± 0.026
0.553TrpPhe: 0.553 ± 0.022
0.746TrpGly: 0.746 ± 0.027
0.229TrpHis: 0.229 ± 0.014
0.822TrpIle: 0.822 ± 0.026
0.696TrpLys: 0.696 ± 0.023
1.228TrpLeu: 1.228 ± 0.034
0.347TrpMet: 0.347 ± 0.018
0.525TrpAsn: 0.525 ± 0.021
0.294TrpPro: 0.294 ± 0.015
0.401TrpGln: 0.401 ± 0.019
0.426TrpArg: 0.426 ± 0.02
0.675TrpSer: 0.675 ± 0.02
0.512TrpThr: 0.512 ± 0.02
0.729TrpVal: 0.729 ± 0.022
0.165TrpTrp: 0.165 ± 0.012
0.342TrpTyr: 0.342 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.947TyrAla: 1.947 ± 0.037
0.287TyrCys: 0.287 ± 0.016
1.856TyrAsp: 1.856 ± 0.039
2.603TyrGlu: 2.603 ± 0.045
1.821TyrPhe: 1.821 ± 0.036
2.375TyrGly: 2.375 ± 0.041
0.899TyrHis: 0.899 ± 0.029
2.461TyrIle: 2.461 ± 0.048
1.894TyrLys: 1.894 ± 0.042
3.473TyrLeu: 3.473 ± 0.051
0.853TyrMet: 0.853 ± 0.022
1.413TyrAsn: 1.413 ± 0.035
1.37TyrPro: 1.37 ± 0.037
1.569TyrGln: 1.569 ± 0.035
1.601TyrArg: 1.601 ± 0.036
2.091TyrSer: 2.091 ± 0.048
1.708TyrThr: 1.708 ± 0.036
2.393TyrVal: 2.393 ± 0.045
0.382TyrTrp: 0.382 ± 0.017
1.383TyrTyr: 1.383 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4332 proteins (1252721 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski