Amino acid dipepetide frequency for Chryseobacterium sp. 3008163

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.9AlaAla: 3.9 ± 0.09
0.479AlaCys: 0.479 ± 0.023
3.334AlaAsp: 3.334 ± 0.056
4.12AlaGlu: 4.12 ± 0.073
2.997AlaPhe: 2.997 ± 0.057
4.291AlaGly: 4.291 ± 0.077
0.978AlaHis: 0.978 ± 0.031
4.643AlaIle: 4.643 ± 0.077
4.789AlaLys: 4.789 ± 0.076
5.473AlaLeu: 5.473 ± 0.077
1.472AlaMet: 1.472 ± 0.039
3.427AlaAsn: 3.427 ± 0.067
1.707AlaPro: 1.707 ± 0.048
2.471AlaGln: 2.471 ± 0.05
1.674AlaArg: 1.674 ± 0.037
3.94AlaSer: 3.94 ± 0.073
3.54AlaThr: 3.54 ± 0.075
4.023AlaVal: 4.023 ± 0.069
0.592AlaTrp: 0.592 ± 0.023
2.187AlaTyr: 2.187 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.407CysAla: 0.407 ± 0.021
0.098CysCys: 0.098 ± 0.009
0.376CysAsp: 0.376 ± 0.019
0.427CysGlu: 0.427 ± 0.019
0.446CysPhe: 0.446 ± 0.022
0.615CysGly: 0.615 ± 0.024
0.166CysHis: 0.166 ± 0.014
0.581CysIle: 0.581 ± 0.022
0.476CysLys: 0.476 ± 0.019
0.641CysLeu: 0.641 ± 0.022
0.166CysMet: 0.166 ± 0.013
0.424CysAsn: 0.424 ± 0.018
0.312CysPro: 0.312 ± 0.021
0.22CysGln: 0.22 ± 0.014
0.195CysArg: 0.195 ± 0.013
0.534CysSer: 0.534 ± 0.022
0.429CysThr: 0.429 ± 0.02
0.414CysVal: 0.414 ± 0.019
0.063CysTrp: 0.063 ± 0.007
0.299CysTyr: 0.299 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.306AspAla: 3.306 ± 0.057
0.387AspCys: 0.387 ± 0.021
2.791AspAsp: 2.791 ± 0.05
3.984AspGlu: 3.984 ± 0.07
3.963AspPhe: 3.963 ± 0.053
3.521AspGly: 3.521 ± 0.06
0.915AspHis: 0.915 ± 0.032
4.169AspIle: 4.169 ± 0.064
4.396AspLys: 4.396 ± 0.067
5.126AspLeu: 5.126 ± 0.083
1.048AspMet: 1.048 ± 0.03
3.008AspAsn: 3.008 ± 0.059
1.597AspPro: 1.597 ± 0.042
1.777AspGln: 1.777 ± 0.038
1.721AspArg: 1.721 ± 0.044
3.16AspSer: 3.16 ± 0.055
2.238AspThr: 2.238 ± 0.042
3.322AspVal: 3.322 ± 0.062
0.774AspTrp: 0.774 ± 0.024
2.834AspTyr: 2.834 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
3.668GluAla: 3.668 ± 0.077
0.363GluCys: 0.363 ± 0.018
3.453GluAsp: 3.453 ± 0.059
4.718GluGlu: 4.718 ± 0.092
3.284GluPhe: 3.284 ± 0.057
3.33GluGly: 3.33 ± 0.063
0.966GluHis: 0.966 ± 0.031
6.141GluIle: 6.141 ± 0.104
7.025GluLys: 7.025 ± 0.107
5.614GluLeu: 5.614 ± 0.08
1.72GluMet: 1.72 ± 0.039
5.675GluAsn: 5.675 ± 0.083
1.345GluPro: 1.345 ± 0.034
2.095GluGln: 2.095 ± 0.043
2.239GluArg: 2.239 ± 0.048
3.472GluSer: 3.472 ± 0.059
3.419GluThr: 3.419 ± 0.058
3.902GluVal: 3.902 ± 0.063
0.662GluTrp: 0.662 ± 0.024
2.512GluTyr: 2.512 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.373PheAla: 3.373 ± 0.058
0.47PheCys: 0.47 ± 0.022
3.206PheAsp: 3.206 ± 0.046
3.431PheGlu: 3.431 ± 0.07
2.988PhePhe: 2.988 ± 0.054
3.722PheGly: 3.722 ± 0.063
0.955PheHis: 0.955 ± 0.031
4.221PheIle: 4.221 ± 0.074
3.989PheLys: 3.989 ± 0.072
5.025PheLeu: 5.025 ± 0.079
1.186PheMet: 1.186 ± 0.034
3.365PheAsn: 3.365 ± 0.057
1.894PhePro: 1.894 ± 0.04
1.719PheGln: 1.719 ± 0.043
1.725PheArg: 1.725 ± 0.045
4.437PheSer: 4.437 ± 0.064
3.312PheThr: 3.312 ± 0.064
3.092PheVal: 3.092 ± 0.058
0.616PheTrp: 0.616 ± 0.026
2.47PheTyr: 2.47 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
3.79GlyAla: 3.79 ± 0.067
0.547GlyCys: 0.547 ± 0.023
2.958GlyAsp: 2.958 ± 0.055
3.459GlyGlu: 3.459 ± 0.061
3.651GlyPhe: 3.651 ± 0.062
4.476GlyGly: 4.476 ± 0.084
0.962GlyHis: 0.962 ± 0.032
5.36GlyIle: 5.36 ± 0.071
5.536GlyLys: 5.536 ± 0.086
5.276GlyLeu: 5.276 ± 0.075
1.558GlyMet: 1.558 ± 0.041
4.265GlyAsn: 4.265 ± 0.081
1.142GlyPro: 1.142 ± 0.037
1.908GlyGln: 1.908 ± 0.045
1.942GlyArg: 1.942 ± 0.045
4.069GlySer: 4.069 ± 0.073
4.005GlyThr: 4.005 ± 0.085
3.915GlyVal: 3.915 ± 0.064
0.756GlyTrp: 0.756 ± 0.029
2.75GlyTyr: 2.75 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
0.839HisAla: 0.839 ± 0.032
0.139HisCys: 0.139 ± 0.009
0.808HisAsp: 0.808 ± 0.028
0.951HisGlu: 0.951 ± 0.033
1.173HisPhe: 1.173 ± 0.035
0.961HisGly: 0.961 ± 0.031
0.462HisHis: 0.462 ± 0.022
1.227HisIle: 1.227 ± 0.034
1.106HisLys: 1.106 ± 0.035
1.696HisLeu: 1.696 ± 0.048
0.302HisMet: 0.302 ± 0.017
0.88HisAsn: 0.88 ± 0.027
0.79HisPro: 0.79 ± 0.031
0.753HisGln: 0.753 ± 0.025
0.576HisArg: 0.576 ± 0.022
1.124HisSer: 1.124 ± 0.034
0.842HisThr: 0.842 ± 0.026
0.731HisVal: 0.731 ± 0.025
0.216HisTrp: 0.216 ± 0.014
0.79HisTyr: 0.79 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.14IleAla: 5.14 ± 0.077
0.687IleCys: 0.687 ± 0.028
4.66IleAsp: 4.66 ± 0.071
5.162IleGlu: 5.162 ± 0.078
4.25IlePhe: 4.25 ± 0.066
4.863IleGly: 4.863 ± 0.08
1.299IleHis: 1.299 ± 0.036
6.486IleIle: 6.486 ± 0.097
6.097IleLys: 6.097 ± 0.084
7.626IleLeu: 7.626 ± 0.102
1.436IleMet: 1.436 ± 0.038
5.055IleAsn: 5.055 ± 0.072
3.333IlePro: 3.333 ± 0.059
2.715IleGln: 2.715 ± 0.055
2.307IleArg: 2.307 ± 0.05
6.311IleSer: 6.311 ± 0.067
4.568IleThr: 4.568 ± 0.075
4.54IleVal: 4.54 ± 0.068
0.717IleTrp: 0.717 ± 0.026
3.104IleTyr: 3.104 ± 0.062
0.0IleXaa: 0.0 ± 0.0
Lys
4.696LysAla: 4.696 ± 0.068
0.376LysCys: 0.376 ± 0.022
5.124LysAsp: 5.124 ± 0.074
6.162LysGlu: 6.162 ± 0.09
3.718LysPhe: 3.718 ± 0.064
4.381LysGly: 4.381 ± 0.062
1.233LysHis: 1.233 ± 0.04
7.698LysIle: 7.698 ± 0.087
7.71LysLys: 7.71 ± 0.097
6.756LysLeu: 6.756 ± 0.073
2.507LysMet: 2.507 ± 0.041
6.768LysAsn: 6.768 ± 0.088
2.606LysPro: 2.606 ± 0.053
2.554LysGln: 2.554 ± 0.044
2.438LysArg: 2.438 ± 0.046
5.135LysSer: 5.135 ± 0.079
5.039LysThr: 5.039 ± 0.073
4.836LysVal: 4.836 ± 0.064
0.817LysTrp: 0.817 ± 0.026
3.482LysTyr: 3.482 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
5.303LeuAla: 5.303 ± 0.074
0.674LeuCys: 0.674 ± 0.022
4.721LeuAsp: 4.721 ± 0.058
5.699LeuGlu: 5.699 ± 0.095
4.772LeuPhe: 4.772 ± 0.084
5.626LeuGly: 5.626 ± 0.082
1.401LeuHis: 1.401 ± 0.038
6.797LeuIle: 6.797 ± 0.088
8.174LeuLys: 8.174 ± 0.098
8.065LeuLeu: 8.065 ± 0.113
2.256LeuMet: 2.256 ± 0.041
5.921LeuAsn: 5.921 ± 0.087
3.528LeuPro: 3.528 ± 0.056
3.349LeuGln: 3.349 ± 0.056
2.829LeuArg: 2.829 ± 0.05
6.721LeuSer: 6.721 ± 0.079
4.859LeuThr: 4.859 ± 0.071
4.989LeuVal: 4.989 ± 0.076
0.838LeuTrp: 0.838 ± 0.027
3.137LeuTyr: 3.137 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
1.398MetAla: 1.398 ± 0.038
0.137MetCys: 0.137 ± 0.012
1.065MetAsp: 1.065 ± 0.03
1.482MetGlu: 1.482 ± 0.038
0.937MetPhe: 0.937 ± 0.032
1.388MetGly: 1.388 ± 0.039
0.365MetHis: 0.365 ± 0.017
1.857MetIle: 1.857 ± 0.048
2.656MetLys: 2.656 ± 0.042
1.96MetLeu: 1.96 ± 0.044
0.761MetMet: 0.761 ± 0.028
1.626MetAsn: 1.626 ± 0.031
0.833MetPro: 0.833 ± 0.029
0.828MetGln: 0.828 ± 0.026
0.793MetArg: 0.793 ± 0.031
1.53MetSer: 1.53 ± 0.037
1.301MetThr: 1.301 ± 0.038
1.322MetVal: 1.322 ± 0.037
0.159MetTrp: 0.159 ± 0.01
0.751MetTyr: 0.751 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.994AsnAla: 3.994 ± 0.064
0.442AsnCys: 0.442 ± 0.022
3.346AsnAsp: 3.346 ± 0.07
4.004AsnGlu: 4.004 ± 0.068
3.822AsnPhe: 3.822 ± 0.061
4.306AsnGly: 4.306 ± 0.082
1.164AsnHis: 1.164 ± 0.034
5.54AsnIle: 5.54 ± 0.067
4.608AsnLys: 4.608 ± 0.066
6.203AsnLeu: 6.203 ± 0.085
1.36AsnMet: 1.36 ± 0.036
4.429AsnAsn: 4.429 ± 0.088
3.025AsnPro: 3.025 ± 0.058
2.647AsnGln: 2.647 ± 0.045
1.985AsnArg: 1.985 ± 0.043
4.456AsnSer: 4.456 ± 0.085
3.581AsnThr: 3.581 ± 0.087
3.889AsnVal: 3.889 ± 0.07
0.804AsnTrp: 0.804 ± 0.028
3.161AsnTyr: 3.161 ± 0.069
0.0AsnXaa: 0.0 ± 0.0
Pro
2.087ProAla: 2.087 ± 0.052
0.201ProCys: 0.201 ± 0.013
1.971ProAsp: 1.971 ± 0.044
2.728ProGlu: 2.728 ± 0.055
1.821ProPhe: 1.821 ± 0.042
1.801ProGly: 1.801 ± 0.046
0.545ProHis: 0.545 ± 0.02
2.454ProIle: 2.454 ± 0.05
2.602ProLys: 2.602 ± 0.053
2.764ProLeu: 2.764 ± 0.047
0.773ProMet: 0.773 ± 0.032
2.308ProAsn: 2.308 ± 0.046
0.827ProPro: 0.827 ± 0.036
1.252ProGln: 1.252 ± 0.037
0.846ProArg: 0.846 ± 0.028
2.203ProSer: 2.203 ± 0.044
2.117ProThr: 2.117 ± 0.053
2.342ProVal: 2.342 ± 0.05
0.288ProTrp: 0.288 ± 0.017
1.359ProTyr: 1.359 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
1.8GlnAla: 1.8 ± 0.04
0.185GlnCys: 0.185 ± 0.012
1.626GlnAsp: 1.626 ± 0.036
2.135GlnGlu: 2.135 ± 0.052
1.892GlnPhe: 1.892 ± 0.041
1.742GlnGly: 1.742 ± 0.043
0.594GlnHis: 0.594 ± 0.022
2.96GlnIle: 2.96 ± 0.049
3.463GlnLys: 3.463 ± 0.064
3.247GlnLeu: 3.247 ± 0.051
0.904GlnMet: 0.904 ± 0.028
2.915GlnAsn: 2.915 ± 0.057
1.118GlnPro: 1.118 ± 0.032
1.515GlnGln: 1.515 ± 0.041
1.166GlnArg: 1.166 ± 0.032
2.248GlnSer: 2.248 ± 0.048
2.055GlnThr: 2.055 ± 0.048
1.841GlnVal: 1.841 ± 0.042
0.376GlnTrp: 0.376 ± 0.021
1.521GlnTyr: 1.521 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
1.656ArgAla: 1.656 ± 0.044
0.189ArgCys: 0.189 ± 0.013
1.57ArgAsp: 1.57 ± 0.041
2.101ArgGlu: 2.101 ± 0.048
1.713ArgPhe: 1.713 ± 0.036
1.644ArgGly: 1.644 ± 0.04
0.486ArgHis: 0.486 ± 0.02
2.688ArgIle: 2.688 ± 0.048
2.921ArgLys: 2.921 ± 0.054
2.717ArgLeu: 2.717 ± 0.051
0.846ArgMet: 0.846 ± 0.022
2.169ArgAsn: 2.169 ± 0.048
0.989ArgPro: 0.989 ± 0.028
1.029ArgGln: 1.029 ± 0.035
1.205ArgArg: 1.205 ± 0.038
1.789ArgSer: 1.789 ± 0.042
1.602ArgThr: 1.602 ± 0.043
1.772ArgVal: 1.772 ± 0.036
0.327ArgTrp: 0.327 ± 0.018
1.276ArgTyr: 1.276 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
4.135SerAla: 4.135 ± 0.063
0.642SerCys: 0.642 ± 0.022
3.7SerAsp: 3.7 ± 0.06
4.526SerGlu: 4.526 ± 0.061
4.009SerPhe: 4.009 ± 0.068
4.816SerGly: 4.816 ± 0.066
1.085SerHis: 1.085 ± 0.03
5.061SerIle: 5.061 ± 0.067
5.31SerLys: 5.31 ± 0.074
6.113SerLeu: 6.113 ± 0.083
1.409SerMet: 1.409 ± 0.034
3.951SerAsn: 3.951 ± 0.077
2.174SerPro: 2.174 ± 0.045
2.541SerGln: 2.541 ± 0.052
1.917SerArg: 1.917 ± 0.043
4.58SerSer: 4.58 ± 0.091
3.783SerThr: 3.783 ± 0.075
4.301SerVal: 4.301 ± 0.06
0.72SerTrp: 0.72 ± 0.029
2.873SerTyr: 2.873 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
3.951ThrAla: 3.951 ± 0.082
0.345ThrCys: 0.345 ± 0.019
3.168ThrAsp: 3.168 ± 0.048
3.679ThrGlu: 3.679 ± 0.062
3.012ThrPhe: 3.012 ± 0.052
4.013ThrGly: 4.013 ± 0.079
0.894ThrHis: 0.894 ± 0.026
4.251ThrIle: 4.251 ± 0.069
4.132ThrLys: 4.132 ± 0.062
4.977ThrLeu: 4.977 ± 0.068
0.982ThrMet: 0.982 ± 0.032
3.358ThrAsn: 3.358 ± 0.07
2.309ThrPro: 2.309 ± 0.05
1.935ThrGln: 1.935 ± 0.047
1.415ThrArg: 1.415 ± 0.033
3.85ThrSer: 3.85 ± 0.078
3.605ThrThr: 3.605 ± 0.094
3.724ThrVal: 3.724 ± 0.076
0.589ThrTrp: 0.589 ± 0.026
2.345ThrTyr: 2.345 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
3.745ValAla: 3.745 ± 0.065
0.493ValCys: 0.493 ± 0.022
3.165ValAsp: 3.165 ± 0.058
3.785ValGlu: 3.785 ± 0.063
3.412ValPhe: 3.412 ± 0.062
3.616ValGly: 3.616 ± 0.065
0.869ValHis: 0.869 ± 0.029
4.561ValIle: 4.561 ± 0.074
4.921ValLys: 4.921 ± 0.075
5.444ValLeu: 5.444 ± 0.066
1.384ValMet: 1.384 ± 0.037
3.704ValAsn: 3.704 ± 0.074
2.044ValPro: 2.044 ± 0.043
1.835ValGln: 1.835 ± 0.044
1.853ValArg: 1.853 ± 0.038
4.456ValSer: 4.456 ± 0.063
3.298ValThr: 3.298 ± 0.069
3.73ValVal: 3.73 ± 0.07
0.633ValTrp: 0.633 ± 0.024
2.38ValTyr: 2.38 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.591TrpAla: 0.591 ± 0.02
0.082TrpCys: 0.082 ± 0.007
0.618TrpAsp: 0.618 ± 0.024
0.686TrpGlu: 0.686 ± 0.023
0.57TrpPhe: 0.57 ± 0.024
0.698TrpGly: 0.698 ± 0.031
0.191TrpHis: 0.191 ± 0.013
0.764TrpIle: 0.764 ± 0.028
1.023TrpLys: 1.023 ± 0.035
0.933TrpLeu: 0.933 ± 0.029
0.35TrpMet: 0.35 ± 0.019
0.76TrpAsn: 0.76 ± 0.028
0.183TrpPro: 0.183 ± 0.014
0.409TrpGln: 0.409 ± 0.02
0.339TrpArg: 0.339 ± 0.016
0.657TrpSer: 0.657 ± 0.026
0.583TrpThr: 0.583 ± 0.025
0.584TrpVal: 0.584 ± 0.022
0.153TrpTrp: 0.153 ± 0.011
0.431TrpTyr: 0.431 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.26TyrAla: 2.26 ± 0.048
0.352TyrCys: 0.352 ± 0.017
2.373TyrAsp: 2.373 ± 0.053
2.364TyrGlu: 2.364 ± 0.05
2.767TyrPhe: 2.767 ± 0.054
2.517TyrGly: 2.517 ± 0.051
0.837TyrHis: 0.837 ± 0.028
2.898TyrIle: 2.898 ± 0.059
3.108TyrLys: 3.108 ± 0.057
4.001TyrLeu: 4.001 ± 0.06
0.731TyrMet: 0.731 ± 0.021
2.758TyrAsn: 2.758 ± 0.06
1.533TyrPro: 1.533 ± 0.038
1.752TyrGln: 1.752 ± 0.037
1.522TyrArg: 1.522 ± 0.037
2.972TyrSer: 2.972 ± 0.054
2.356TyrThr: 2.356 ± 0.058
2.057TyrVal: 2.057 ± 0.04
0.505TyrTrp: 0.505 ± 0.021
2.121TyrTyr: 2.121 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3789 proteins (1171329 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski