Amino acid dipepetide frequency for Ochrobactrum anthropi (strain ATCC 49188 / DSM 6882 / JCM 21032 / NBRC 15819 / NCTC 12168)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.936AlaAla: 13.936 ± 0.132
0.898AlaCys: 0.898 ± 0.026
6.222AlaAsp: 6.222 ± 0.066
7.041AlaGlu: 7.041 ± 0.08
4.436AlaPhe: 4.436 ± 0.061
9.335AlaGly: 9.335 ± 0.111
1.994AlaHis: 1.994 ± 0.034
6.921AlaIle: 6.921 ± 0.086
4.621AlaLys: 4.621 ± 0.075
11.923AlaLeu: 11.923 ± 0.099
3.266AlaMet: 3.266 ± 0.052
3.195AlaAsn: 3.195 ± 0.054
4.502AlaPro: 4.502 ± 0.061
3.608AlaGln: 3.608 ± 0.056
6.981AlaArg: 6.981 ± 0.069
6.518AlaSer: 6.518 ± 0.074
5.52AlaThr: 5.52 ± 0.053
7.995AlaVal: 7.995 ± 0.085
1.23AlaTrp: 1.23 ± 0.032
2.635AlaTyr: 2.635 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.812CysAla: 0.812 ± 0.027
0.112CysCys: 0.112 ± 0.009
0.488CysAsp: 0.488 ± 0.02
0.404CysGlu: 0.404 ± 0.016
0.34CysPhe: 0.34 ± 0.016
0.872CysGly: 0.872 ± 0.024
0.214CysHis: 0.214 ± 0.012
0.431CysIle: 0.431 ± 0.017
0.234CysLys: 0.234 ± 0.013
0.724CysLeu: 0.724 ± 0.021
0.165CysMet: 0.165 ± 0.01
0.202CysAsn: 0.202 ± 0.012
0.389CysPro: 0.389 ± 0.019
0.21CysGln: 0.21 ± 0.012
0.515CysArg: 0.515 ± 0.019
0.46CysSer: 0.46 ± 0.015
0.395CysThr: 0.395 ± 0.017
0.538CysVal: 0.538 ± 0.018
0.103CysTrp: 0.103 ± 0.009
0.179CysTyr: 0.179 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
6.206AspAla: 6.206 ± 0.078
0.463AspCys: 0.463 ± 0.019
3.118AspAsp: 3.118 ± 0.057
3.744AspGlu: 3.744 ± 0.055
2.267AspPhe: 2.267 ± 0.041
4.901AspGly: 4.901 ± 0.068
1.208AspHis: 1.208 ± 0.033
3.523AspIle: 3.523 ± 0.048
2.293AspLys: 2.293 ± 0.042
5.383AspLeu: 5.383 ± 0.065
1.472AspMet: 1.472 ± 0.031
1.711AspAsn: 1.711 ± 0.036
2.985AspPro: 2.985 ± 0.047
1.781AspGln: 1.781 ± 0.038
3.986AspArg: 3.986 ± 0.058
2.238AspSer: 2.238 ± 0.049
2.563AspThr: 2.563 ± 0.045
4.115AspVal: 4.115 ± 0.055
0.909AspTrp: 0.909 ± 0.027
1.579AspTyr: 1.579 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
7.102GluAla: 7.102 ± 0.088
0.355GluCys: 0.355 ± 0.016
2.803GluAsp: 2.803 ± 0.045
3.557GluGlu: 3.557 ± 0.059
1.896GluPhe: 1.896 ± 0.038
4.227GluGly: 4.227 ± 0.054
1.214GluHis: 1.214 ± 0.026
3.84GluIle: 3.84 ± 0.048
3.295GluLys: 3.295 ± 0.055
5.532GluLeu: 5.532 ± 0.07
1.61GluMet: 1.61 ± 0.038
2.122GluAsn: 2.122 ± 0.041
2.637GluPro: 2.637 ± 0.044
2.278GluGln: 2.278 ± 0.04
4.694GluArg: 4.694 ± 0.065
2.431GluSer: 2.431 ± 0.043
3.764GluThr: 3.764 ± 0.049
3.629GluVal: 3.629 ± 0.047
0.723GluTrp: 0.723 ± 0.023
1.135GluTyr: 1.135 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.483PheAla: 4.483 ± 0.07
0.383PheCys: 0.383 ± 0.019
2.706PheAsp: 2.706 ± 0.045
2.222PheGlu: 2.222 ± 0.045
1.615PhePhe: 1.615 ± 0.042
3.774PheGly: 3.774 ± 0.055
0.804PheHis: 0.804 ± 0.024
2.167PheIle: 2.167 ± 0.044
1.253PheLys: 1.253 ± 0.036
3.699PheLeu: 3.699 ± 0.058
0.899PheMet: 0.899 ± 0.025
1.331PheAsn: 1.331 ± 0.032
1.608PhePro: 1.608 ± 0.033
1.096PheGln: 1.096 ± 0.027
2.258PheArg: 2.258 ± 0.045
2.657PheSer: 2.657 ± 0.042
2.092PheThr: 2.092 ± 0.04
2.941PheVal: 2.941 ± 0.051
0.59PheTrp: 0.59 ± 0.022
1.032PheTyr: 1.032 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
8.072GlyAla: 8.072 ± 0.094
0.758GlyCys: 0.758 ± 0.024
4.196GlyAsp: 4.196 ± 0.067
4.713GlyGlu: 4.713 ± 0.058
3.739GlyPhe: 3.739 ± 0.05
6.848GlyGly: 6.848 ± 0.133
1.713GlyHis: 1.713 ± 0.044
5.25GlyIle: 5.25 ± 0.057
4.071GlyLys: 4.071 ± 0.061
8.456GlyLeu: 8.456 ± 0.089
2.268GlyMet: 2.268 ± 0.039
2.623GlyAsn: 2.623 ± 0.052
2.921GlyPro: 2.921 ± 0.043
2.698GlyGln: 2.698 ± 0.046
5.159GlyArg: 5.159 ± 0.063
4.817GlySer: 4.817 ± 0.073
4.616GlyThr: 4.616 ± 0.126
5.946GlyVal: 5.946 ± 0.067
1.268GlyTrp: 1.268 ± 0.029
2.334GlyTyr: 2.334 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.961HisAla: 1.961 ± 0.038
0.203HisCys: 0.203 ± 0.012
1.237HisAsp: 1.237 ± 0.034
1.147HisGlu: 1.147 ± 0.028
0.877HisPhe: 0.877 ± 0.025
1.785HisGly: 1.785 ± 0.035
0.532HisHis: 0.532 ± 0.024
1.091HisIle: 1.091 ± 0.025
0.632HisLys: 0.632 ± 0.022
1.935HisLeu: 1.935 ± 0.04
0.53HisMet: 0.53 ± 0.016
0.558HisAsn: 0.558 ± 0.021
1.187HisPro: 1.187 ± 0.028
0.592HisGln: 0.592 ± 0.02
1.326HisArg: 1.326 ± 0.036
1.036HisSer: 1.036 ± 0.027
0.808HisThr: 0.808 ± 0.023
1.425HisVal: 1.425 ± 0.033
0.311HisTrp: 0.311 ± 0.014
0.575HisTyr: 0.575 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
7.951IleAla: 7.951 ± 0.079
0.566IleCys: 0.566 ± 0.019
3.92IleAsp: 3.92 ± 0.055
3.986IleGlu: 3.986 ± 0.053
2.149IlePhe: 2.149 ± 0.043
5.415IleGly: 5.415 ± 0.068
1.039IleHis: 1.039 ± 0.026
3.18IleIle: 3.18 ± 0.058
1.921IleLys: 1.921 ± 0.039
5.196IleLeu: 5.196 ± 0.064
1.227IleMet: 1.227 ± 0.028
1.876IleAsn: 1.876 ± 0.04
2.549IlePro: 2.549 ± 0.041
1.446IleGln: 1.446 ± 0.032
3.519IleArg: 3.519 ± 0.053
3.594IleSer: 3.594 ± 0.05
2.97IleThr: 2.97 ± 0.046
4.794IleVal: 4.794 ± 0.05
0.663IleTrp: 0.663 ± 0.02
1.346IleTyr: 1.346 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
5.015LysAla: 5.015 ± 0.063
0.174LysCys: 0.174 ± 0.011
2.258LysAsp: 2.258 ± 0.047
2.109LysGlu: 2.109 ± 0.038
1.137LysPhe: 1.137 ± 0.028
3.188LysGly: 3.188 ± 0.053
0.75LysHis: 0.75 ± 0.025
2.335LysIle: 2.335 ± 0.045
1.88LysLys: 1.88 ± 0.045
4.217LysLeu: 4.217 ± 0.05
1.038LysMet: 1.038 ± 0.026
1.366LysAsn: 1.366 ± 0.031
2.592LysPro: 2.592 ± 0.049
1.447LysGln: 1.447 ± 0.031
2.908LysArg: 2.908 ± 0.05
2.433LysSer: 2.433 ± 0.042
2.432LysThr: 2.432 ± 0.041
2.813LysVal: 2.813 ± 0.049
0.459LysTrp: 0.459 ± 0.017
0.8LysTyr: 0.8 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
11.865LeuAla: 11.865 ± 0.108
0.838LeuCys: 0.838 ± 0.023
5.775LeuAsp: 5.775 ± 0.064
5.374LeuGlu: 5.374 ± 0.077
3.847LeuPhe: 3.847 ± 0.058
7.936LeuGly: 7.936 ± 0.091
1.789LeuHis: 1.789 ± 0.04
5.6LeuIle: 5.6 ± 0.069
4.422LeuLys: 4.422 ± 0.052
9.255LeuLeu: 9.255 ± 0.111
2.457LeuMet: 2.457 ± 0.043
3.079LeuAsn: 3.079 ± 0.048
5.169LeuPro: 5.169 ± 0.066
2.924LeuGln: 2.924 ± 0.044
5.976LeuArg: 5.976 ± 0.07
6.785LeuSer: 6.785 ± 0.071
5.543LeuThr: 5.543 ± 0.075
7.191LeuVal: 7.191 ± 0.08
1.044LeuTrp: 1.044 ± 0.029
2.19LeuTyr: 2.19 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.962MetAla: 2.962 ± 0.047
0.152MetCys: 0.152 ± 0.01
1.14MetAsp: 1.14 ± 0.027
1.245MetGlu: 1.245 ± 0.032
0.761MetPhe: 0.761 ± 0.024
1.783MetGly: 1.783 ± 0.034
0.465MetHis: 0.465 ± 0.016
1.578MetIle: 1.578 ± 0.04
1.238MetLys: 1.238 ± 0.034
2.549MetLeu: 2.549 ± 0.041
0.729MetMet: 0.729 ± 0.026
0.954MetAsn: 0.954 ± 0.023
1.491MetPro: 1.491 ± 0.034
0.957MetGln: 0.957 ± 0.03
1.876MetArg: 1.876 ± 0.038
1.771MetSer: 1.771 ± 0.036
1.89MetThr: 1.89 ± 0.035
1.749MetVal: 1.749 ± 0.04
0.228MetTrp: 0.228 ± 0.013
0.334MetTyr: 0.334 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.498AsnAla: 3.498 ± 0.058
0.237AsnCys: 0.237 ± 0.013
1.787AsnAsp: 1.787 ± 0.036
1.64AsnGlu: 1.64 ± 0.034
1.155AsnPhe: 1.155 ± 0.029
2.962AsnGly: 2.962 ± 0.053
0.606AsnHis: 0.606 ± 0.019
1.88AsnIle: 1.88 ± 0.036
1.022AsnLys: 1.022 ± 0.027
2.977AsnLeu: 2.977 ± 0.042
0.814AsnMet: 0.814 ± 0.023
0.97AsnAsn: 0.97 ± 0.03
1.993AsnPro: 1.993 ± 0.038
0.958AsnGln: 0.958 ± 0.026
2.073AsnArg: 2.073 ± 0.039
1.685AsnSer: 1.685 ± 0.04
1.511AsnThr: 1.511 ± 0.038
2.28AsnVal: 2.28 ± 0.042
0.528AsnTrp: 0.528 ± 0.018
0.831AsnTyr: 0.831 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
5.051ProAla: 5.051 ± 0.07
0.28ProCys: 0.28 ± 0.014
3.316ProAsp: 3.316 ± 0.048
3.55ProGlu: 3.55 ± 0.053
2.028ProPhe: 2.028 ± 0.04
3.693ProGly: 3.693 ± 0.056
1.027ProHis: 1.027 ± 0.026
2.423ProIle: 2.423 ± 0.04
1.907ProLys: 1.907 ± 0.042
4.391ProLeu: 4.391 ± 0.066
1.099ProMet: 1.099 ± 0.026
1.434ProAsn: 1.434 ± 0.029
1.86ProPro: 1.86 ± 0.044
1.748ProGln: 1.748 ± 0.038
2.399ProArg: 2.399 ± 0.045
2.789ProSer: 2.789 ± 0.046
2.291ProThr: 2.291 ± 0.044
4.186ProVal: 4.186 ± 0.056
0.581ProTrp: 0.581 ± 0.018
1.222ProTyr: 1.222 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.697GlnAla: 3.697 ± 0.055
0.198GlnCys: 0.198 ± 0.012
1.523GlnAsp: 1.523 ± 0.033
1.631GlnGlu: 1.631 ± 0.036
1.188GlnPhe: 1.188 ± 0.03
2.231GlnGly: 2.231 ± 0.041
0.684GlnHis: 0.684 ± 0.021
2.042GlnIle: 2.042 ± 0.036
1.6GlnLys: 1.6 ± 0.033
3.036GlnLeu: 3.036 ± 0.044
0.994GlnMet: 0.994 ± 0.028
1.163GlnAsn: 1.163 ± 0.028
1.654GlnPro: 1.654 ± 0.033
1.311GlnGln: 1.311 ± 0.035
2.31GlnArg: 2.31 ± 0.037
2.054GlnSer: 2.054 ± 0.035
1.829GlnThr: 1.829 ± 0.039
2.09GlnVal: 2.09 ± 0.043
0.429GlnTrp: 0.429 ± 0.02
0.708GlnTyr: 0.708 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
6.25ArgAla: 6.25 ± 0.064
0.41ArgCys: 0.41 ± 0.018
3.678ArgAsp: 3.678 ± 0.054
4.053ArgGlu: 4.053 ± 0.066
2.914ArgPhe: 2.914 ± 0.047
4.234ArgGly: 4.234 ± 0.053
1.46ArgHis: 1.46 ± 0.03
4.119ArgIle: 4.119 ± 0.054
2.839ArgLys: 2.839 ± 0.042
6.96ArgLeu: 6.96 ± 0.082
1.761ArgMet: 1.761 ± 0.038
2.173ArgAsn: 2.173 ± 0.039
2.928ArgPro: 2.928 ± 0.051
2.502ArgGln: 2.502 ± 0.043
4.7ArgArg: 4.7 ± 0.067
3.69ArgSer: 3.69 ± 0.046
3.05ArgThr: 3.05 ± 0.048
4.192ArgVal: 4.192 ± 0.056
0.849ArgTrp: 0.849 ± 0.023
1.704ArgTyr: 1.704 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.241SerAla: 6.241 ± 0.081
0.43SerCys: 0.43 ± 0.017
3.247SerAsp: 3.247 ± 0.051
3.159SerGlu: 3.159 ± 0.052
2.701SerPhe: 2.701 ± 0.038
5.898SerGly: 5.898 ± 0.094
1.165SerHis: 1.165 ± 0.031
3.374SerIle: 3.374 ± 0.049
2.102SerLys: 2.102 ± 0.044
5.868SerLeu: 5.868 ± 0.063
1.475SerMet: 1.475 ± 0.033
1.699SerAsn: 1.699 ± 0.035
2.676SerPro: 2.676 ± 0.044
1.847SerGln: 1.847 ± 0.035
3.578SerArg: 3.578 ± 0.055
3.404SerSer: 3.404 ± 0.054
2.949SerThr: 2.949 ± 0.049
4.299SerVal: 4.299 ± 0.048
0.806SerTrp: 0.806 ± 0.023
1.448SerTyr: 1.448 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
5.862ThrAla: 5.862 ± 0.063
0.403ThrCys: 0.403 ± 0.017
2.877ThrAsp: 2.877 ± 0.046
2.858ThrGlu: 2.858 ± 0.036
2.062ThrPhe: 2.062 ± 0.041
5.121ThrGly: 5.121 ± 0.099
1.008ThrHis: 1.008 ± 0.028
3.33ThrIle: 3.33 ± 0.053
1.836ThrLys: 1.836 ± 0.031
5.657ThrLeu: 5.657 ± 0.086
1.271ThrMet: 1.271 ± 0.029
1.495ThrAsn: 1.495 ± 0.039
2.946ThrPro: 2.946 ± 0.049
1.446ThrGln: 1.446 ± 0.032
2.991ThrArg: 2.991 ± 0.048
2.997ThrSer: 2.997 ± 0.05
2.91ThrThr: 2.91 ± 0.058
4.512ThrVal: 4.512 ± 0.058
0.584ThrTrp: 0.584 ± 0.021
1.31ThrTyr: 1.31 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
8.287ValAla: 8.287 ± 0.082
0.561ValCys: 0.561 ± 0.022
3.996ValAsp: 3.996 ± 0.055
4.572ValGlu: 4.572 ± 0.06
2.916ValPhe: 2.916 ± 0.054
5.246ValGly: 5.246 ± 0.056
1.309ValHis: 1.309 ± 0.03
4.377ValIle: 4.377 ± 0.061
2.791ValLys: 2.791 ± 0.05
7.334ValLeu: 7.334 ± 0.085
1.892ValMet: 1.892 ± 0.033
2.197ValAsn: 2.197 ± 0.048
3.435ValPro: 3.435 ± 0.05
2.083ValGln: 2.083 ± 0.039
4.532ValArg: 4.532 ± 0.058
4.759ValSer: 4.759 ± 0.057
4.436ValThr: 4.436 ± 0.058
5.542ValVal: 5.542 ± 0.077
0.859ValTrp: 0.859 ± 0.022
1.581ValTyr: 1.581 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.073TrpAla: 1.073 ± 0.027
0.118TrpCys: 0.118 ± 0.009
0.639TrpAsp: 0.639 ± 0.021
0.52TrpGlu: 0.52 ± 0.019
0.544TrpPhe: 0.544 ± 0.022
0.839TrpGly: 0.839 ± 0.026
0.294TrpHis: 0.294 ± 0.013
0.724TrpIle: 0.724 ± 0.025
0.589TrpLys: 0.589 ± 0.019
1.605TrpLeu: 1.605 ± 0.034
0.341TrpMet: 0.341 ± 0.016
0.494TrpAsn: 0.494 ± 0.018
0.625TrpPro: 0.625 ± 0.022
0.588TrpGln: 0.588 ± 0.019
0.954TrpArg: 0.954 ± 0.026
0.824TrpSer: 0.824 ± 0.026
0.712TrpThr: 0.712 ± 0.023
0.755TrpVal: 0.755 ± 0.024
0.217TrpTrp: 0.217 ± 0.014
0.298TrpTyr: 0.298 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.455TyrAla: 2.455 ± 0.035
0.245TyrCys: 0.245 ± 0.013
1.517TyrAsp: 1.517 ± 0.037
1.319TyrGlu: 1.319 ± 0.03
1.038TyrPhe: 1.038 ± 0.029
2.18TyrGly: 2.18 ± 0.043
0.469TyrHis: 0.469 ± 0.018
1.123TyrIle: 1.123 ± 0.031
0.806TyrLys: 0.806 ± 0.024
2.365TyrLeu: 2.365 ± 0.04
0.544TyrMet: 0.544 ± 0.021
0.764TyrAsn: 0.764 ± 0.025
1.127TyrPro: 1.127 ± 0.031
0.85TyrGln: 0.85 ± 0.029
1.733TyrArg: 1.733 ± 0.031
1.423TyrSer: 1.423 ± 0.032
1.196TyrThr: 1.196 ± 0.032
1.715TyrVal: 1.715 ± 0.032
0.371TyrTrp: 0.371 ± 0.019
0.687TyrTyr: 0.687 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4757 proteins (1495400 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski