Amino acid dipepetide frequency for Neorhizobium sp. NCHU2750

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.517AlaAla: 15.517 ± 0.157
0.955AlaCys: 0.955 ± 0.022
6.821AlaAsp: 6.821 ± 0.077
7.149AlaGlu: 7.149 ± 0.082
4.488AlaPhe: 4.488 ± 0.056
10.136AlaGly: 10.136 ± 0.095
2.007AlaHis: 2.007 ± 0.036
7.075AlaIle: 7.075 ± 0.066
4.585AlaLys: 4.585 ± 0.066
12.266AlaLeu: 12.266 ± 0.109
3.638AlaMet: 3.638 ± 0.049
2.99AlaAsn: 2.99 ± 0.051
4.666AlaPro: 4.666 ± 0.077
3.562AlaGln: 3.562 ± 0.052
7.305AlaArg: 7.305 ± 0.071
7.066AlaSer: 7.066 ± 0.066
5.852AlaThr: 5.852 ± 0.061
8.301AlaVal: 8.301 ± 0.079
1.291AlaTrp: 1.291 ± 0.028
2.543AlaTyr: 2.543 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.794CysAla: 0.794 ± 0.02
0.104CysCys: 0.104 ± 0.008
0.488CysAsp: 0.488 ± 0.017
0.407CysGlu: 0.407 ± 0.014
0.308CysPhe: 0.308 ± 0.013
0.876CysGly: 0.876 ± 0.024
0.218CysHis: 0.218 ± 0.011
0.426CysIle: 0.426 ± 0.016
0.176CysLys: 0.176 ± 0.01
0.817CysLeu: 0.817 ± 0.024
0.174CysMet: 0.174 ± 0.01
0.223CysAsn: 0.223 ± 0.01
0.352CysPro: 0.352 ± 0.014
0.23CysGln: 0.23 ± 0.011
0.572CysArg: 0.572 ± 0.016
0.451CysSer: 0.451 ± 0.017
0.36CysThr: 0.36 ± 0.013
0.552CysVal: 0.552 ± 0.02
0.101CysTrp: 0.101 ± 0.008
0.189CysTyr: 0.189 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
6.506AspAla: 6.506 ± 0.071
0.464AspCys: 0.464 ± 0.015
3.305AspAsp: 3.305 ± 0.051
3.649AspGlu: 3.649 ± 0.051
2.383AspPhe: 2.383 ± 0.039
5.14AspGly: 5.14 ± 0.071
1.271AspHis: 1.271 ± 0.031
3.505AspIle: 3.505 ± 0.047
2.047AspLys: 2.047 ± 0.038
5.974AspLeu: 5.974 ± 0.06
1.59AspMet: 1.59 ± 0.032
1.567AspAsn: 1.567 ± 0.033
3.182AspPro: 3.182 ± 0.047
1.837AspGln: 1.837 ± 0.032
4.156AspArg: 4.156 ± 0.048
2.073AspSer: 2.073 ± 0.032
2.653AspThr: 2.653 ± 0.04
4.109AspVal: 4.109 ± 0.049
0.925AspTrp: 0.925 ± 0.023
1.545AspTyr: 1.545 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
7.068GluAla: 7.068 ± 0.068
0.334GluCys: 0.334 ± 0.013
2.897GluAsp: 2.897 ± 0.046
3.361GluGlu: 3.361 ± 0.054
1.837GluPhe: 1.837 ± 0.037
4.065GluGly: 4.065 ± 0.047
1.151GluHis: 1.151 ± 0.027
3.813GluIle: 3.813 ± 0.054
2.838GluLys: 2.838 ± 0.046
5.15GluLeu: 5.15 ± 0.064
1.604GluMet: 1.604 ± 0.031
1.842GluAsn: 1.842 ± 0.035
2.598GluPro: 2.598 ± 0.042
2.114GluGln: 2.114 ± 0.035
4.367GluArg: 4.367 ± 0.069
2.213GluSer: 2.213 ± 0.032
3.542GluThr: 3.542 ± 0.045
3.621GluVal: 3.621 ± 0.05
0.677GluTrp: 0.677 ± 0.019
1.009GluTyr: 1.009 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
4.44PheAla: 4.44 ± 0.058
0.414PheCys: 0.414 ± 0.014
2.808PheAsp: 2.808 ± 0.043
2.115PheGlu: 2.115 ± 0.032
1.573PhePhe: 1.573 ± 0.032
3.761PheGly: 3.761 ± 0.052
0.754PheHis: 0.754 ± 0.019
2.018PheIle: 2.018 ± 0.037
1.156PheLys: 1.156 ± 0.028
3.508PheLeu: 3.508 ± 0.046
0.916PheMet: 0.916 ± 0.024
1.142PheAsn: 1.142 ± 0.024
1.616PhePro: 1.616 ± 0.031
1.108PheGln: 1.108 ± 0.024
2.291PheArg: 2.291 ± 0.035
2.706PheSer: 2.706 ± 0.033
1.966PheThr: 1.966 ± 0.031
2.908PheVal: 2.908 ± 0.042
0.546PheTrp: 0.546 ± 0.019
0.971PheTyr: 0.971 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
8.356GlyAla: 8.356 ± 0.08
0.74GlyCys: 0.74 ± 0.022
4.278GlyAsp: 4.278 ± 0.047
4.717GlyGlu: 4.717 ± 0.055
3.639GlyPhe: 3.639 ± 0.044
6.965GlyGly: 6.965 ± 0.075
1.903GlyHis: 1.903 ± 0.036
5.066GlyIle: 5.066 ± 0.057
3.771GlyLys: 3.771 ± 0.046
8.536GlyLeu: 8.536 ± 0.069
2.484GlyMet: 2.484 ± 0.043
2.41GlyAsn: 2.41 ± 0.039
3.168GlyPro: 3.168 ± 0.042
2.811GlyGln: 2.811 ± 0.043
5.624GlyArg: 5.624 ± 0.062
4.981GlySer: 4.981 ± 0.056
4.525GlyThr: 4.525 ± 0.073
5.611GlyVal: 5.611 ± 0.063
1.228GlyTrp: 1.228 ± 0.024
2.379GlyTyr: 2.379 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.098HisAla: 2.098 ± 0.036
0.204HisCys: 0.204 ± 0.01
1.271HisAsp: 1.271 ± 0.032
1.062HisGlu: 1.062 ± 0.025
0.894HisPhe: 0.894 ± 0.025
1.816HisGly: 1.816 ± 0.035
0.598HisHis: 0.598 ± 0.023
1.023HisIle: 1.023 ± 0.027
0.539HisLys: 0.539 ± 0.019
2.003HisLeu: 2.003 ± 0.033
0.549HisMet: 0.549 ± 0.017
0.477HisAsn: 0.477 ± 0.016
1.179HisPro: 1.179 ± 0.027
0.61HisGln: 0.61 ± 0.018
1.325HisArg: 1.325 ± 0.03
0.994HisSer: 0.994 ± 0.026
0.79HisThr: 0.79 ± 0.02
1.458HisVal: 1.458 ± 0.028
0.28HisTrp: 0.28 ± 0.015
0.569HisTyr: 0.569 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.893IleAla: 7.893 ± 0.072
0.587IleCys: 0.587 ± 0.015
3.993IleAsp: 3.993 ± 0.048
3.772IleGlu: 3.772 ± 0.05
2.088IlePhe: 2.088 ± 0.035
5.526IleGly: 5.526 ± 0.057
0.979IleHis: 0.979 ± 0.024
2.929IleIle: 2.929 ± 0.052
1.695IleLys: 1.695 ± 0.034
4.985IleLeu: 4.985 ± 0.069
1.227IleMet: 1.227 ± 0.026
1.636IleAsn: 1.636 ± 0.032
2.489IlePro: 2.489 ± 0.041
1.327IleGln: 1.327 ± 0.028
3.569IleArg: 3.569 ± 0.046
3.653IleSer: 3.653 ± 0.044
3.013IleThr: 3.013 ± 0.045
4.701IleVal: 4.701 ± 0.05
0.671IleTrp: 0.671 ± 0.02
1.29IleTyr: 1.29 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.959LysAla: 4.959 ± 0.058
0.156LysCys: 0.156 ± 0.008
2.11LysAsp: 2.11 ± 0.039
1.896LysGlu: 1.896 ± 0.036
1.081LysPhe: 1.081 ± 0.026
2.932LysGly: 2.932 ± 0.043
0.62LysHis: 0.62 ± 0.021
2.171LysIle: 2.171 ± 0.035
1.657LysLys: 1.657 ± 0.037
3.831LysLeu: 3.831 ± 0.054
0.976LysMet: 0.976 ± 0.027
1.124LysAsn: 1.124 ± 0.025
2.436LysPro: 2.436 ± 0.041
1.228LysGln: 1.228 ± 0.026
2.464LysArg: 2.464 ± 0.041
2.355LysSer: 2.355 ± 0.039
2.406LysThr: 2.406 ± 0.04
2.751LysVal: 2.751 ± 0.045
0.448LysTrp: 0.448 ± 0.015
0.701LysTyr: 0.701 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
12.514LeuAla: 12.514 ± 0.107
0.865LeuCys: 0.865 ± 0.023
5.863LeuAsp: 5.863 ± 0.066
4.866LeuGlu: 4.866 ± 0.053
3.632LeuPhe: 3.632 ± 0.048
7.885LeuGly: 7.885 ± 0.07
1.694LeuHis: 1.694 ± 0.033
5.396LeuIle: 5.396 ± 0.06
4.234LeuLys: 4.234 ± 0.051
9.011LeuLeu: 9.011 ± 0.095
2.503LeuMet: 2.503 ± 0.039
2.754LeuAsn: 2.754 ± 0.039
5.275LeuPro: 5.275 ± 0.059
2.879LeuGln: 2.879 ± 0.036
5.888LeuArg: 5.888 ± 0.062
7.354LeuSer: 7.354 ± 0.088
5.644LeuThr: 5.644 ± 0.069
7.177LeuVal: 7.177 ± 0.068
1.041LeuTrp: 1.041 ± 0.024
2.039LeuTyr: 2.039 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.347MetAla: 3.347 ± 0.046
0.143MetCys: 0.143 ± 0.008
1.271MetAsp: 1.271 ± 0.028
1.239MetGlu: 1.239 ± 0.025
0.797MetPhe: 0.797 ± 0.02
1.756MetGly: 1.756 ± 0.033
0.428MetHis: 0.428 ± 0.015
1.632MetIle: 1.632 ± 0.033
1.222MetLys: 1.222 ± 0.025
2.696MetLeu: 2.696 ± 0.041
0.771MetMet: 0.771 ± 0.022
0.869MetAsn: 0.869 ± 0.024
1.616MetPro: 1.616 ± 0.032
0.933MetGln: 0.933 ± 0.023
1.804MetArg: 1.804 ± 0.032
1.897MetSer: 1.897 ± 0.034
2.112MetThr: 2.112 ± 0.032
1.858MetVal: 1.858 ± 0.033
0.213MetTrp: 0.213 ± 0.012
0.318MetTyr: 0.318 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.381AsnAla: 3.381 ± 0.051
0.232AsnCys: 0.232 ± 0.011
1.588AsnAsp: 1.588 ± 0.033
1.375AsnGlu: 1.375 ± 0.027
1.116AsnPhe: 1.116 ± 0.029
2.638AsnGly: 2.638 ± 0.043
0.548AsnHis: 0.548 ± 0.02
1.614AsnIle: 1.614 ± 0.031
0.833AsnLys: 0.833 ± 0.021
2.79AsnLeu: 2.79 ± 0.041
0.743AsnMet: 0.743 ± 0.021
0.835AsnAsn: 0.835 ± 0.022
1.902AsnPro: 1.902 ± 0.033
0.883AsnGln: 0.883 ± 0.027
1.921AsnArg: 1.921 ± 0.034
1.554AsnSer: 1.554 ± 0.032
1.398AsnThr: 1.398 ± 0.03
2.071AsnVal: 2.071 ± 0.037
0.456AsnTrp: 0.456 ± 0.017
0.756AsnTyr: 0.756 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
5.671ProAla: 5.671 ± 0.075
0.259ProCys: 0.259 ± 0.012
3.391ProAsp: 3.391 ± 0.05
3.324ProGlu: 3.324 ± 0.043
1.999ProPhe: 1.999 ± 0.037
3.93ProGly: 3.93 ± 0.048
1.045ProHis: 1.045 ± 0.024
2.5ProIle: 2.5 ± 0.04
1.932ProLys: 1.932 ± 0.041
4.474ProLeu: 4.474 ± 0.059
1.173ProMet: 1.173 ± 0.025
1.364ProAsn: 1.364 ± 0.03
2.137ProPro: 2.137 ± 0.042
1.665ProGln: 1.665 ± 0.028
2.475ProArg: 2.475 ± 0.037
2.819ProSer: 2.819 ± 0.042
2.477ProThr: 2.477 ± 0.041
4.07ProVal: 4.07 ± 0.051
0.588ProTrp: 0.588 ± 0.017
1.244ProTyr: 1.244 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.916GlnAla: 3.916 ± 0.051
0.176GlnCys: 0.176 ± 0.011
1.499GlnAsp: 1.499 ± 0.03
1.526GlnGlu: 1.526 ± 0.031
1.135GlnPhe: 1.135 ± 0.026
2.273GlnGly: 2.273 ± 0.035
0.627GlnHis: 0.627 ± 0.019
2.023GlnIle: 2.023 ± 0.044
1.342GlnLys: 1.342 ± 0.026
2.862GlnLeu: 2.862 ± 0.047
1.008GlnMet: 1.008 ± 0.023
1.035GlnAsn: 1.035 ± 0.024
1.734GlnPro: 1.734 ± 0.034
1.454GlnGln: 1.454 ± 0.042
2.199GlnArg: 2.199 ± 0.037
2.031GlnSer: 2.031 ± 0.033
1.816GlnThr: 1.816 ± 0.032
2.135GlnVal: 2.135 ± 0.043
0.398GlnTrp: 0.398 ± 0.013
0.639GlnTyr: 0.639 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
6.46ArgAla: 6.46 ± 0.067
0.428ArgCys: 0.428 ± 0.017
3.77ArgAsp: 3.77 ± 0.05
3.714ArgGlu: 3.714 ± 0.052
2.73ArgPhe: 2.73 ± 0.037
4.272ArgGly: 4.272 ± 0.056
1.65ArgHis: 1.65 ± 0.03
3.953ArgIle: 3.953 ± 0.047
2.564ArgLys: 2.564 ± 0.044
7.067ArgLeu: 7.067 ± 0.068
1.93ArgMet: 1.93 ± 0.033
2.035ArgAsn: 2.035 ± 0.035
3.118ArgPro: 3.118 ± 0.046
2.648ArgGln: 2.648 ± 0.041
4.895ArgArg: 4.895 ± 0.069
3.793ArgSer: 3.793 ± 0.045
3.092ArgThr: 3.092 ± 0.043
4.113ArgVal: 4.113 ± 0.052
0.845ArgTrp: 0.845 ± 0.02
1.737ArgTyr: 1.737 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.697SerAla: 6.697 ± 0.073
0.415SerCys: 0.415 ± 0.016
3.384SerAsp: 3.384 ± 0.042
3.137SerGlu: 3.137 ± 0.044
2.581SerPhe: 2.581 ± 0.036
5.895SerGly: 5.895 ± 0.071
1.172SerHis: 1.172 ± 0.026
3.361SerIle: 3.361 ± 0.048
2.017SerLys: 2.017 ± 0.035
6.082SerLeu: 6.082 ± 0.063
1.536SerMet: 1.536 ± 0.029
1.663SerAsn: 1.663 ± 0.032
2.868SerPro: 2.868 ± 0.044
1.914SerGln: 1.914 ± 0.037
3.787SerArg: 3.787 ± 0.051
3.8SerSer: 3.8 ± 0.06
3.163SerThr: 3.163 ± 0.072
4.385SerVal: 4.385 ± 0.052
0.789SerTrp: 0.789 ± 0.02
1.518SerTyr: 1.518 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
6.298ThrAla: 6.298 ± 0.078
0.397ThrCys: 0.397 ± 0.015
2.995ThrAsp: 2.995 ± 0.049
2.665ThrGlu: 2.665 ± 0.038
2.1ThrPhe: 2.1 ± 0.04
5.031ThrGly: 5.031 ± 0.063
0.987ThrHis: 0.987 ± 0.025
3.339ThrIle: 3.339 ± 0.042
1.807ThrLys: 1.807 ± 0.03
5.546ThrLeu: 5.546 ± 0.053
1.314ThrMet: 1.314 ± 0.026
1.457ThrAsn: 1.457 ± 0.034
3.032ThrPro: 3.032 ± 0.048
1.39ThrGln: 1.39 ± 0.031
3.126ThrArg: 3.126 ± 0.041
3.31ThrSer: 3.31 ± 0.045
3.052ThrThr: 3.052 ± 0.055
4.542ThrVal: 4.542 ± 0.064
0.64ThrTrp: 0.64 ± 0.018
1.312ThrTyr: 1.312 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
8.745ValAla: 8.745 ± 0.082
0.599ValCys: 0.599 ± 0.019
4.039ValAsp: 4.039 ± 0.05
4.348ValGlu: 4.348 ± 0.053
2.835ValPhe: 2.835 ± 0.043
5.346ValGly: 5.346 ± 0.065
1.288ValHis: 1.288 ± 0.026
4.445ValIle: 4.445 ± 0.054
2.615ValLys: 2.615 ± 0.042
7.039ValLeu: 7.039 ± 0.072
1.934ValMet: 1.934 ± 0.034
2.063ValAsn: 2.063 ± 0.034
3.394ValPro: 3.394 ± 0.045
1.873ValGln: 1.873 ± 0.034
4.321ValArg: 4.321 ± 0.058
4.918ValSer: 4.918 ± 0.053
4.616ValThr: 4.616 ± 0.058
5.756ValVal: 5.756 ± 0.065
0.834ValTrp: 0.834 ± 0.022
1.497ValTyr: 1.497 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.078TrpAla: 1.078 ± 0.025
0.116TrpCys: 0.116 ± 0.008
0.624TrpAsp: 0.624 ± 0.018
0.543TrpGlu: 0.543 ± 0.018
0.528TrpPhe: 0.528 ± 0.016
0.81TrpGly: 0.81 ± 0.02
0.304TrpHis: 0.304 ± 0.015
0.683TrpIle: 0.683 ± 0.021
0.542TrpLys: 0.542 ± 0.02
1.505TrpLeu: 1.505 ± 0.029
0.352TrpMet: 0.352 ± 0.016
0.485TrpAsn: 0.485 ± 0.017
0.635TrpPro: 0.635 ± 0.021
0.558TrpGln: 0.558 ± 0.018
0.987TrpArg: 0.987 ± 0.023
0.829TrpSer: 0.829 ± 0.023
0.721TrpThr: 0.721 ± 0.022
0.736TrpVal: 0.736 ± 0.019
0.189TrpTrp: 0.189 ± 0.012
0.302TrpTyr: 0.302 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.474TyrAla: 2.474 ± 0.04
0.231TyrCys: 0.231 ± 0.013
1.485TyrAsp: 1.485 ± 0.03
1.237TyrGlu: 1.237 ± 0.027
0.972TyrPhe: 0.972 ± 0.027
2.139TyrGly: 2.139 ± 0.035
0.496TyrHis: 0.496 ± 0.016
1.051TyrIle: 1.051 ± 0.025
0.731TyrLys: 0.731 ± 0.021
2.372TyrLeu: 2.372 ± 0.044
0.499TyrMet: 0.499 ± 0.016
0.663TyrAsn: 0.663 ± 0.023
1.139TyrPro: 1.139 ± 0.024
0.805TyrGln: 0.805 ± 0.021
1.741TyrArg: 1.741 ± 0.032
1.365TyrSer: 1.365 ± 0.03
1.142TyrThr: 1.142 ± 0.027
1.649TyrVal: 1.649 ± 0.031
0.368TyrTrp: 0.368 ± 0.016
0.655TyrTyr: 0.655 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5901 proteins (1837426 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski