Amino acid dipepetide frequency for Rhizobium sp. 24NR

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.295AlaAla: 15.295 ± 0.152
0.868AlaCys: 0.868 ± 0.027
6.671AlaAsp: 6.671 ± 0.073
7.21AlaGlu: 7.21 ± 0.078
4.432AlaPhe: 4.432 ± 0.058
9.862AlaGly: 9.862 ± 0.083
2.035AlaHis: 2.035 ± 0.043
6.772AlaIle: 6.772 ± 0.076
4.395AlaLys: 4.395 ± 0.056
12.377AlaLeu: 12.377 ± 0.127
3.488AlaMet: 3.488 ± 0.054
2.955AlaAsn: 2.955 ± 0.048
4.685AlaPro: 4.685 ± 0.067
3.738AlaGln: 3.738 ± 0.057
7.37AlaArg: 7.37 ± 0.088
6.816AlaSer: 6.816 ± 0.068
5.66AlaThr: 5.66 ± 0.07
8.557AlaVal: 8.557 ± 0.071
1.327AlaTrp: 1.327 ± 0.031
2.561AlaTyr: 2.561 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.725CysAla: 0.725 ± 0.02
0.087CysCys: 0.087 ± 0.009
0.48CysAsp: 0.48 ± 0.017
0.389CysGlu: 0.389 ± 0.016
0.31CysPhe: 0.31 ± 0.015
0.851CysGly: 0.851 ± 0.026
0.203CysHis: 0.203 ± 0.013
0.396CysIle: 0.396 ± 0.017
0.164CysLys: 0.164 ± 0.01
0.765CysLeu: 0.765 ± 0.025
0.15CysMet: 0.15 ± 0.009
0.205CysAsn: 0.205 ± 0.011
0.357CysPro: 0.357 ± 0.019
0.218CysGln: 0.218 ± 0.013
0.528CysArg: 0.528 ± 0.02
0.425CysSer: 0.425 ± 0.018
0.356CysThr: 0.356 ± 0.015
0.565CysVal: 0.565 ± 0.022
0.099CysTrp: 0.099 ± 0.008
0.186CysTyr: 0.186 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.433AspAla: 6.433 ± 0.071
0.472AspCys: 0.472 ± 0.017
3.032AspAsp: 3.032 ± 0.043
3.514AspGlu: 3.514 ± 0.055
2.333AspPhe: 2.333 ± 0.041
4.946AspGly: 4.946 ± 0.07
1.233AspHis: 1.233 ± 0.03
3.463AspIle: 3.463 ± 0.049
1.962AspLys: 1.962 ± 0.042
6.106AspLeu: 6.106 ± 0.067
1.504AspMet: 1.504 ± 0.033
1.472AspAsn: 1.472 ± 0.033
3.262AspPro: 3.262 ± 0.05
1.877AspGln: 1.877 ± 0.043
4.119AspArg: 4.119 ± 0.064
2.226AspSer: 2.226 ± 0.041
2.542AspThr: 2.542 ± 0.047
4.308AspVal: 4.308 ± 0.055
0.897AspTrp: 0.897 ± 0.028
1.534AspTyr: 1.534 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
7.249GluAla: 7.249 ± 0.081
0.318GluCys: 0.318 ± 0.014
3.008GluAsp: 3.008 ± 0.049
3.453GluGlu: 3.453 ± 0.057
1.893GluPhe: 1.893 ± 0.036
4.224GluGly: 4.224 ± 0.059
1.198GluHis: 1.198 ± 0.028
3.791GluIle: 3.791 ± 0.053
2.675GluLys: 2.675 ± 0.051
5.291GluLeu: 5.291 ± 0.065
1.547GluMet: 1.547 ± 0.031
1.798GluAsn: 1.798 ± 0.037
2.666GluPro: 2.666 ± 0.051
2.161GluGln: 2.161 ± 0.047
4.542GluArg: 4.542 ± 0.076
2.374GluSer: 2.374 ± 0.04
3.655GluThr: 3.655 ± 0.056
3.732GluVal: 3.732 ± 0.052
0.685GluTrp: 0.685 ± 0.023
1.009GluTyr: 1.009 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
4.392PheAla: 4.392 ± 0.063
0.377PheCys: 0.377 ± 0.015
2.736PheAsp: 2.736 ± 0.043
2.218PheGlu: 2.218 ± 0.035
1.549PhePhe: 1.549 ± 0.036
3.809PheGly: 3.809 ± 0.057
0.795PheHis: 0.795 ± 0.023
1.936PheIle: 1.936 ± 0.039
1.163PheLys: 1.163 ± 0.033
3.689PheLeu: 3.689 ± 0.058
0.851PheMet: 0.851 ± 0.025
1.135PheAsn: 1.135 ± 0.028
1.654PhePro: 1.654 ± 0.032
1.146PheGln: 1.146 ± 0.026
2.267PheArg: 2.267 ± 0.044
2.637PheSer: 2.637 ± 0.041
1.976PheThr: 1.976 ± 0.037
2.832PheVal: 2.832 ± 0.047
0.551PheTrp: 0.551 ± 0.017
0.951PheTyr: 0.951 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
8.266GlyAla: 8.266 ± 0.089
0.74GlyCys: 0.74 ± 0.026
4.199GlyAsp: 4.199 ± 0.064
4.765GlyGlu: 4.765 ± 0.055
3.663GlyPhe: 3.663 ± 0.056
6.819GlyGly: 6.819 ± 0.093
1.846GlyHis: 1.846 ± 0.042
5.183GlyIle: 5.183 ± 0.067
3.622GlyLys: 3.622 ± 0.045
8.702GlyLeu: 8.702 ± 0.089
2.331GlyMet: 2.331 ± 0.044
2.336GlyAsn: 2.336 ± 0.039
3.22GlyPro: 3.22 ± 0.054
2.913GlyGln: 2.913 ± 0.047
5.735GlyArg: 5.735 ± 0.061
5.023GlySer: 5.023 ± 0.059
4.586GlyThr: 4.586 ± 0.063
5.701GlyVal: 5.701 ± 0.068
1.236GlyTrp: 1.236 ± 0.031
2.287GlyTyr: 2.287 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
2.073HisAla: 2.073 ± 0.039
0.211HisCys: 0.211 ± 0.014
1.27HisAsp: 1.27 ± 0.035
1.097HisGlu: 1.097 ± 0.03
0.883HisPhe: 0.883 ± 0.023
1.836HisGly: 1.836 ± 0.038
0.577HisHis: 0.577 ± 0.021
1.008HisIle: 1.008 ± 0.025
0.502HisLys: 0.502 ± 0.018
2.021HisLeu: 2.021 ± 0.033
0.529HisMet: 0.529 ± 0.018
0.485HisAsn: 0.485 ± 0.017
1.303HisPro: 1.303 ± 0.029
0.618HisGln: 0.618 ± 0.021
1.344HisArg: 1.344 ± 0.03
1.024HisSer: 1.024 ± 0.026
0.789HisThr: 0.789 ± 0.022
1.521HisVal: 1.521 ± 0.031
0.279HisTrp: 0.279 ± 0.014
0.552HisTyr: 0.552 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.694IleAla: 7.694 ± 0.091
0.534IleCys: 0.534 ± 0.02
3.795IleAsp: 3.795 ± 0.049
3.637IleGlu: 3.637 ± 0.055
2.096IlePhe: 2.096 ± 0.037
5.427IleGly: 5.427 ± 0.072
0.985IleHis: 0.985 ± 0.026
2.745IleIle: 2.745 ± 0.05
1.764IleLys: 1.764 ± 0.033
5.08IleLeu: 5.08 ± 0.062
1.211IleMet: 1.211 ± 0.033
1.604IleAsn: 1.604 ± 0.037
2.497IlePro: 2.497 ± 0.044
1.389IleGln: 1.389 ± 0.028
3.614IleArg: 3.614 ± 0.044
3.597IleSer: 3.597 ± 0.051
2.919IleThr: 2.919 ± 0.044
4.59IleVal: 4.59 ± 0.062
0.651IleTrp: 0.651 ± 0.02
1.275IleTyr: 1.275 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.71LysAla: 4.71 ± 0.062
0.132LysCys: 0.132 ± 0.01
2.0LysAsp: 2.0 ± 0.037
1.866LysGlu: 1.866 ± 0.038
1.026LysPhe: 1.026 ± 0.025
2.881LysGly: 2.881 ± 0.054
0.677LysHis: 0.677 ± 0.021
2.142LysIle: 2.142 ± 0.043
1.508LysLys: 1.508 ± 0.04
3.775LysLeu: 3.775 ± 0.06
0.925LysMet: 0.925 ± 0.023
1.092LysAsn: 1.092 ± 0.029
2.309LysPro: 2.309 ± 0.051
1.211LysGln: 1.211 ± 0.033
2.469LysArg: 2.469 ± 0.037
2.231LysSer: 2.231 ± 0.041
2.265LysThr: 2.265 ± 0.039
2.686LysVal: 2.686 ± 0.052
0.393LysTrp: 0.393 ± 0.014
0.691LysTyr: 0.691 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
12.596LeuAla: 12.596 ± 0.104
0.824LeuCys: 0.824 ± 0.025
6.04LeuAsp: 6.04 ± 0.068
5.224LeuGlu: 5.224 ± 0.056
3.676LeuPhe: 3.676 ± 0.059
8.092LeuGly: 8.092 ± 0.09
1.798LeuHis: 1.798 ± 0.035
5.311LeuIle: 5.311 ± 0.075
4.092LeuLys: 4.092 ± 0.057
9.407LeuLeu: 9.407 ± 0.12
2.462LeuMet: 2.462 ± 0.042
2.754LeuAsn: 2.754 ± 0.047
5.409LeuPro: 5.409 ± 0.063
3.003LeuGln: 3.003 ± 0.042
6.268LeuArg: 6.268 ± 0.076
7.351LeuSer: 7.351 ± 0.105
5.607LeuThr: 5.607 ± 0.067
7.488LeuVal: 7.488 ± 0.089
1.084LeuTrp: 1.084 ± 0.025
2.071LeuTyr: 2.071 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.126MetAla: 3.126 ± 0.05
0.13MetCys: 0.13 ± 0.009
1.24MetAsp: 1.24 ± 0.027
1.284MetGlu: 1.284 ± 0.028
0.712MetPhe: 0.712 ± 0.025
1.771MetGly: 1.771 ± 0.038
0.378MetHis: 0.378 ± 0.016
1.594MetIle: 1.594 ± 0.034
1.15MetLys: 1.15 ± 0.026
2.634MetLeu: 2.634 ± 0.044
0.698MetMet: 0.698 ± 0.025
0.892MetAsn: 0.892 ± 0.025
1.55MetPro: 1.55 ± 0.033
0.954MetGln: 0.954 ± 0.026
1.793MetArg: 1.793 ± 0.036
1.808MetSer: 1.808 ± 0.034
1.951MetThr: 1.951 ± 0.036
1.825MetVal: 1.825 ± 0.034
0.182MetTrp: 0.182 ± 0.011
0.316MetTyr: 0.316 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.195AsnAla: 3.195 ± 0.056
0.238AsnCys: 0.238 ± 0.013
1.533AsnAsp: 1.533 ± 0.03
1.38AsnGlu: 1.38 ± 0.035
1.099AsnPhe: 1.099 ± 0.026
2.616AsnGly: 2.616 ± 0.043
0.557AsnHis: 0.557 ± 0.019
1.562AsnIle: 1.562 ± 0.033
0.788AsnLys: 0.788 ± 0.023
2.846AsnLeu: 2.846 ± 0.046
0.717AsnMet: 0.717 ± 0.025
0.836AsnAsn: 0.836 ± 0.03
1.903AsnPro: 1.903 ± 0.038
0.868AsnGln: 0.868 ± 0.023
1.967AsnArg: 1.967 ± 0.037
1.499AsnSer: 1.499 ± 0.032
1.35AsnThr: 1.35 ± 0.029
2.031AsnVal: 2.031 ± 0.036
0.452AsnTrp: 0.452 ± 0.019
0.732AsnTyr: 0.732 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
5.692ProAla: 5.692 ± 0.072
0.252ProCys: 0.252 ± 0.014
3.441ProAsp: 3.441 ± 0.042
3.459ProGlu: 3.459 ± 0.054
2.028ProPhe: 2.028 ± 0.039
4.002ProGly: 4.002 ± 0.058
1.035ProHis: 1.035 ± 0.026
2.416ProIle: 2.416 ± 0.043
1.89ProLys: 1.89 ± 0.043
4.653ProLeu: 4.653 ± 0.06
1.145ProMet: 1.145 ± 0.03
1.409ProAsn: 1.409 ± 0.033
2.216ProPro: 2.216 ± 0.042
1.782ProGln: 1.782 ± 0.039
2.528ProArg: 2.528 ± 0.042
2.98ProSer: 2.98 ± 0.045
2.36ProThr: 2.36 ± 0.039
4.231ProVal: 4.231 ± 0.053
0.59ProTrp: 0.59 ± 0.018
1.19ProTyr: 1.19 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.154GlnAla: 4.154 ± 0.066
0.167GlnCys: 0.167 ± 0.011
1.621GlnAsp: 1.621 ± 0.032
1.703GlnGlu: 1.703 ± 0.032
1.083GlnPhe: 1.083 ± 0.027
2.37GlnGly: 2.37 ± 0.038
0.656GlnHis: 0.656 ± 0.02
2.064GlnIle: 2.064 ± 0.061
1.297GlnLys: 1.297 ± 0.029
2.953GlnLeu: 2.953 ± 0.047
0.99GlnMet: 0.99 ± 0.026
0.978GlnAsn: 0.978 ± 0.025
1.857GlnPro: 1.857 ± 0.044
1.367GlnGln: 1.367 ± 0.038
2.248GlnArg: 2.248 ± 0.046
2.021GlnSer: 2.021 ± 0.042
1.811GlnThr: 1.811 ± 0.038
2.322GlnVal: 2.322 ± 0.056
0.394GlnTrp: 0.394 ± 0.018
0.63GlnTyr: 0.63 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
6.482ArgAla: 6.482 ± 0.076
0.407ArgCys: 0.407 ± 0.017
3.818ArgAsp: 3.818 ± 0.055
3.918ArgGlu: 3.918 ± 0.055
2.81ArgPhe: 2.81 ± 0.044
4.316ArgGly: 4.316 ± 0.06
1.646ArgHis: 1.646 ± 0.04
4.098ArgIle: 4.098 ± 0.049
2.546ArgLys: 2.546 ± 0.04
7.42ArgLeu: 7.42 ± 0.091
1.909ArgMet: 1.909 ± 0.035
1.981ArgAsn: 1.981 ± 0.037
3.096ArgPro: 3.096 ± 0.05
2.69ArgGln: 2.69 ± 0.043
5.015ArgArg: 5.015 ± 0.077
3.886ArgSer: 3.886 ± 0.052
3.165ArgThr: 3.165 ± 0.048
4.216ArgVal: 4.216 ± 0.055
0.847ArgTrp: 0.847 ± 0.024
1.726ArgTyr: 1.726 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
6.561SerAla: 6.561 ± 0.082
0.388SerCys: 0.388 ± 0.016
3.374SerAsp: 3.374 ± 0.048
3.125SerGlu: 3.125 ± 0.049
2.632SerPhe: 2.632 ± 0.042
5.998SerGly: 5.998 ± 0.067
1.171SerHis: 1.171 ± 0.028
3.351SerIle: 3.351 ± 0.048
1.953SerLys: 1.953 ± 0.039
6.064SerLeu: 6.064 ± 0.065
1.508SerMet: 1.508 ± 0.033
1.642SerAsn: 1.642 ± 0.037
2.915SerPro: 2.915 ± 0.047
1.851SerGln: 1.851 ± 0.036
3.925SerArg: 3.925 ± 0.053
3.778SerSer: 3.778 ± 0.061
3.048SerThr: 3.048 ± 0.075
4.417SerVal: 4.417 ± 0.05
0.773SerTrp: 0.773 ± 0.023
1.423SerTyr: 1.423 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
6.226ThrAla: 6.226 ± 0.074
0.386ThrCys: 0.386 ± 0.016
2.897ThrAsp: 2.897 ± 0.052
2.713ThrGlu: 2.713 ± 0.045
2.034ThrPhe: 2.034 ± 0.043
5.035ThrGly: 5.035 ± 0.06
1.001ThrHis: 1.001 ± 0.027
3.241ThrIle: 3.241 ± 0.048
1.71ThrLys: 1.71 ± 0.037
5.512ThrLeu: 5.512 ± 0.073
1.273ThrMet: 1.273 ± 0.032
1.417ThrAsn: 1.417 ± 0.032
2.973ThrPro: 2.973 ± 0.046
1.475ThrGln: 1.475 ± 0.04
3.016ThrArg: 3.016 ± 0.043
3.175ThrSer: 3.175 ± 0.052
2.878ThrThr: 2.878 ± 0.052
4.53ThrVal: 4.53 ± 0.067
0.631ThrTrp: 0.631 ± 0.023
1.264ThrTyr: 1.264 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
8.705ValAla: 8.705 ± 0.085
0.594ValCys: 0.594 ± 0.022
3.971ValAsp: 3.971 ± 0.063
4.565ValGlu: 4.565 ± 0.059
2.905ValPhe: 2.905 ± 0.042
5.423ValGly: 5.423 ± 0.07
1.358ValHis: 1.358 ± 0.031
4.358ValIle: 4.358 ± 0.071
2.579ValLys: 2.579 ± 0.045
7.386ValLeu: 7.386 ± 0.092
1.938ValMet: 1.938 ± 0.037
2.057ValAsn: 2.057 ± 0.039
3.628ValPro: 3.628 ± 0.049
2.078ValGln: 2.078 ± 0.039
4.533ValArg: 4.533 ± 0.057
4.896ValSer: 4.896 ± 0.064
4.668ValThr: 4.668 ± 0.057
5.93ValVal: 5.93 ± 0.063
0.826ValTrp: 0.826 ± 0.025
1.484ValTyr: 1.484 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.072TrpAla: 1.072 ± 0.027
0.116TrpCys: 0.116 ± 0.009
0.607TrpAsp: 0.607 ± 0.021
0.527TrpGlu: 0.527 ± 0.02
0.538TrpPhe: 0.538 ± 0.018
0.817TrpGly: 0.817 ± 0.026
0.298TrpHis: 0.298 ± 0.015
0.657TrpIle: 0.657 ± 0.02
0.508TrpLys: 0.508 ± 0.018
1.54TrpLeu: 1.54 ± 0.036
0.338TrpMet: 0.338 ± 0.014
0.496TrpAsn: 0.496 ± 0.017
0.661TrpPro: 0.661 ± 0.021
0.573TrpGln: 0.573 ± 0.019
0.994TrpArg: 0.994 ± 0.024
0.826TrpSer: 0.826 ± 0.026
0.693TrpThr: 0.693 ± 0.021
0.72TrpVal: 0.72 ± 0.021
0.204TrpTrp: 0.204 ± 0.011
0.284TrpTyr: 0.284 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.429TyrAla: 2.429 ± 0.04
0.22TyrCys: 0.22 ± 0.012
1.499TyrAsp: 1.499 ± 0.032
1.224TyrGlu: 1.224 ± 0.027
0.961TyrPhe: 0.961 ± 0.023
2.083TyrGly: 2.083 ± 0.039
0.475TyrHis: 0.475 ± 0.017
1.016TyrIle: 1.016 ± 0.024
0.72TyrLys: 0.72 ± 0.025
2.337TyrLeu: 2.337 ± 0.04
0.47TyrMet: 0.47 ± 0.018
0.623TyrAsn: 0.623 ± 0.02
1.14TyrPro: 1.14 ± 0.03
0.771TyrGln: 0.771 ± 0.02
1.715TyrArg: 1.715 ± 0.033
1.325TyrSer: 1.325 ± 0.032
1.108TyrThr: 1.108 ± 0.033
1.679TyrVal: 1.679 ± 0.035
0.372TyrTrp: 0.372 ± 0.016
0.625TyrTyr: 0.625 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4834 proteins (1524728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski