Amino acid dipepetide frequency for Oscillibacter valericigenes (strain DSM 18026 / NBRC 101213 / Sjm18-20)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.507AlaAla: 11.507 ± 0.132
1.501AlaCys: 1.501 ± 0.038
5.272AlaAsp: 5.272 ± 0.077
6.506AlaGlu: 6.506 ± 0.092
3.429AlaPhe: 3.429 ± 0.056
7.335AlaGly: 7.335 ± 0.074
1.404AlaHis: 1.404 ± 0.037
5.011AlaIle: 5.011 ± 0.073
4.479AlaLys: 4.479 ± 0.075
9.338AlaLeu: 9.338 ± 0.126
2.837AlaMet: 2.837 ± 0.048
2.734AlaAsn: 2.734 ± 0.047
3.191AlaPro: 3.191 ± 0.058
3.365AlaGln: 3.365 ± 0.058
4.227AlaArg: 4.227 ± 0.065
4.809AlaSer: 4.809 ± 0.069
4.017AlaThr: 4.017 ± 0.079
7.975AlaVal: 7.975 ± 0.092
0.811AlaTrp: 0.811 ± 0.029
2.789AlaTyr: 2.789 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
1.528CysAla: 1.528 ± 0.039
0.383CysCys: 0.383 ± 0.018
0.953CysAsp: 0.953 ± 0.03
0.858CysGlu: 0.858 ± 0.029
0.72CysPhe: 0.72 ± 0.025
1.887CysGly: 1.887 ± 0.045
0.394CysHis: 0.394 ± 0.018
1.037CysIle: 1.037 ± 0.031
0.795CysLys: 0.795 ± 0.027
1.39CysLeu: 1.39 ± 0.033
0.473CysMet: 0.473 ± 0.022
0.546CysAsn: 0.546 ± 0.022
0.824CysPro: 0.824 ± 0.029
0.523CysGln: 0.523 ± 0.021
1.062CysArg: 1.062 ± 0.033
0.948CysSer: 0.948 ± 0.029
0.942CysThr: 0.942 ± 0.031
1.195CysVal: 1.195 ± 0.03
0.15CysTrp: 0.15 ± 0.01
0.588CysTyr: 0.588 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
5.178AspAla: 5.178 ± 0.079
0.993AspCys: 0.993 ± 0.028
2.754AspAsp: 2.754 ± 0.056
3.773AspGlu: 3.773 ± 0.06
2.523AspPhe: 2.523 ± 0.044
4.972AspGly: 4.972 ± 0.084
0.997AspHis: 0.997 ± 0.027
3.401AspIle: 3.401 ± 0.057
2.603AspLys: 2.603 ± 0.048
4.728AspLeu: 4.728 ± 0.064
1.551AspMet: 1.551 ± 0.037
1.919AspAsn: 1.919 ± 0.049
2.151AspPro: 2.151 ± 0.047
1.494AspGln: 1.494 ± 0.038
3.006AspArg: 3.006 ± 0.058
3.118AspSer: 3.118 ± 0.057
3.155AspThr: 3.155 ± 0.059
3.771AspVal: 3.771 ± 0.063
0.744AspTrp: 0.744 ± 0.027
2.196AspTyr: 2.196 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
5.393GluAla: 5.393 ± 0.072
0.804GluCys: 0.804 ± 0.029
3.378GluAsp: 3.378 ± 0.055
4.501GluGlu: 4.501 ± 0.069
2.216GluPhe: 2.216 ± 0.044
3.729GluGly: 3.729 ± 0.057
1.235GluHis: 1.235 ± 0.036
4.355GluIle: 4.355 ± 0.058
4.846GluLys: 4.846 ± 0.075
6.46GluLeu: 6.46 ± 0.072
1.883GluMet: 1.883 ± 0.041
3.329GluAsn: 3.329 ± 0.056
2.053GluPro: 2.053 ± 0.046
2.736GluGln: 2.736 ± 0.058
3.699GluArg: 3.699 ± 0.065
3.321GluSer: 3.321 ± 0.054
3.689GluThr: 3.689 ± 0.06
3.653GluVal: 3.653 ± 0.053
0.575GluTrp: 0.575 ± 0.02
2.223GluTyr: 2.223 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.417PheAla: 3.417 ± 0.05
0.84PheCys: 0.84 ± 0.028
2.31PheAsp: 2.31 ± 0.042
2.088PheGlu: 2.088 ± 0.048
1.803PhePhe: 1.803 ± 0.041
3.095PheGly: 3.095 ± 0.057
0.808PheHis: 0.808 ± 0.026
2.147PheIle: 2.147 ± 0.051
1.562PheLys: 1.562 ± 0.04
3.981PheLeu: 3.981 ± 0.062
0.922PheMet: 0.922 ± 0.033
1.408PheAsn: 1.408 ± 0.036
1.635PhePro: 1.635 ± 0.037
1.39PheGln: 1.39 ± 0.031
2.041PheArg: 2.041 ± 0.043
3.101PheSer: 3.101 ± 0.06
2.495PheThr: 2.495 ± 0.042
2.458PheVal: 2.458 ± 0.049
0.444PheTrp: 0.444 ± 0.023
1.454PheTyr: 1.454 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
6.47GlyAla: 6.47 ± 0.094
1.412GlyCys: 1.412 ± 0.035
3.905GlyAsp: 3.905 ± 0.059
4.634GlyGlu: 4.634 ± 0.077
3.129GlyPhe: 3.129 ± 0.055
6.346GlyGly: 6.346 ± 0.103
1.323GlyHis: 1.323 ± 0.036
5.166GlyIle: 5.166 ± 0.071
5.008GlyLys: 5.008 ± 0.065
6.726GlyLeu: 6.726 ± 0.087
2.447GlyMet: 2.447 ± 0.049
2.826GlyAsn: 2.826 ± 0.047
1.668GlyPro: 1.668 ± 0.045
2.458GlyGln: 2.458 ± 0.049
3.89GlyArg: 3.89 ± 0.062
4.589GlySer: 4.589 ± 0.075
4.919GlyThr: 4.919 ± 0.076
5.639GlyVal: 5.639 ± 0.081
0.817GlyTrp: 0.817 ± 0.025
3.058GlyTyr: 3.058 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
1.385HisAla: 1.385 ± 0.035
0.419HisCys: 0.419 ± 0.018
0.914HisAsp: 0.914 ± 0.028
0.918HisGlu: 0.918 ± 0.029
0.812HisPhe: 0.812 ± 0.026
1.386HisGly: 1.386 ± 0.034
0.456HisHis: 0.456 ± 0.026
1.285HisIle: 1.285 ± 0.034
0.759HisLys: 0.759 ± 0.023
1.625HisLeu: 1.625 ± 0.037
0.507HisMet: 0.507 ± 0.022
0.629HisAsn: 0.629 ± 0.024
1.002HisPro: 1.002 ± 0.033
0.531HisGln: 0.531 ± 0.023
1.06HisArg: 1.06 ± 0.035
1.105HisSer: 1.105 ± 0.03
1.065HisThr: 1.065 ± 0.031
1.111HisVal: 1.111 ± 0.027
0.211HisTrp: 0.211 ± 0.015
0.713HisTyr: 0.713 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.385IleAla: 5.385 ± 0.068
1.159IleCys: 1.159 ± 0.036
3.279IleAsp: 3.279 ± 0.052
3.303IleGlu: 3.303 ± 0.052
2.272IlePhe: 2.272 ± 0.052
4.383IleGly: 4.383 ± 0.074
1.161IleHis: 1.161 ± 0.033
3.292IleIle: 3.292 ± 0.068
2.587IleLys: 2.587 ± 0.042
6.122IleLeu: 6.122 ± 0.097
1.404IleMet: 1.404 ± 0.037
2.128IleAsn: 2.128 ± 0.044
2.892IlePro: 2.892 ± 0.046
2.044IleGln: 2.044 ± 0.038
3.555IleArg: 3.555 ± 0.056
4.149IleSer: 4.149 ± 0.063
3.62IleThr: 3.62 ± 0.067
4.057IleVal: 4.057 ± 0.061
0.584IleTrp: 0.584 ± 0.025
2.015IleTyr: 2.015 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.744LysAla: 4.744 ± 0.072
0.657LysCys: 0.657 ± 0.025
2.8LysAsp: 2.8 ± 0.054
3.818LysGlu: 3.818 ± 0.057
1.507LysPhe: 1.507 ± 0.039
3.272LysGly: 3.272 ± 0.046
0.78LysHis: 0.78 ± 0.029
3.358LysIle: 3.358 ± 0.05
4.079LysLys: 4.079 ± 0.07
5.159LysLeu: 5.159 ± 0.068
1.624LysMet: 1.624 ± 0.039
2.489LysAsn: 2.489 ± 0.047
2.131LysPro: 2.131 ± 0.053
1.971LysGln: 1.971 ± 0.044
3.055LysArg: 3.055 ± 0.051
3.185LysSer: 3.185 ± 0.053
3.466LysThr: 3.466 ± 0.058
3.121LysVal: 3.121 ± 0.061
0.555LysTrp: 0.555 ± 0.022
2.023LysTyr: 2.023 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
8.521LeuAla: 8.521 ± 0.101
1.985LeuCys: 1.985 ± 0.041
5.273LeuAsp: 5.273 ± 0.067
5.579LeuGlu: 5.579 ± 0.077
3.884LeuPhe: 3.884 ± 0.072
6.536LeuGly: 6.536 ± 0.091
1.813LeuHis: 1.813 ± 0.038
5.179LeuIle: 5.179 ± 0.077
5.194LeuLys: 5.194 ± 0.065
9.937LeuLeu: 9.937 ± 0.137
2.64LeuMet: 2.64 ± 0.049
3.56LeuAsn: 3.56 ± 0.058
4.496LeuPro: 4.496 ± 0.066
2.897LeuGln: 2.897 ± 0.054
5.636LeuArg: 5.636 ± 0.084
7.143LeuSer: 7.143 ± 0.09
6.064LeuThr: 6.064 ± 0.082
5.638LeuVal: 5.638 ± 0.079
0.971LeuTrp: 0.971 ± 0.029
3.131LeuTyr: 3.131 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.635MetAla: 2.635 ± 0.058
0.355MetCys: 0.355 ± 0.016
1.833MetAsp: 1.833 ± 0.039
2.205MetGlu: 2.205 ± 0.046
0.865MetPhe: 0.865 ± 0.027
1.998MetGly: 1.998 ± 0.046
0.398MetHis: 0.398 ± 0.018
1.501MetIle: 1.501 ± 0.043
2.038MetLys: 2.038 ± 0.044
2.691MetLeu: 2.691 ± 0.058
0.808MetMet: 0.808 ± 0.026
1.279MetAsn: 1.279 ± 0.031
1.183MetPro: 1.183 ± 0.029
0.963MetGln: 0.963 ± 0.033
1.469MetArg: 1.469 ± 0.036
1.801MetSer: 1.801 ± 0.038
1.76MetThr: 1.76 ± 0.037
1.764MetVal: 1.764 ± 0.036
0.181MetTrp: 0.181 ± 0.012
0.692MetTyr: 0.692 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.334AsnAla: 3.334 ± 0.06
0.663AsnCys: 0.663 ± 0.027
1.703AsnAsp: 1.703 ± 0.038
2.049AsnGlu: 2.049 ± 0.038
1.372AsnPhe: 1.372 ± 0.034
3.326AsnGly: 3.326 ± 0.066
0.721AsnHis: 0.721 ± 0.024
2.481AsnIle: 2.481 ± 0.05
1.628AsnLys: 1.628 ± 0.035
3.656AsnLeu: 3.656 ± 0.056
1.049AsnMet: 1.049 ± 0.028
1.277AsnAsn: 1.277 ± 0.039
1.915AsnPro: 1.915 ± 0.045
1.26AsnGln: 1.26 ± 0.032
2.229AsnArg: 2.229 ± 0.042
2.171AsnSer: 2.171 ± 0.051
2.12AsnThr: 2.12 ± 0.047
2.53AsnVal: 2.53 ± 0.039
0.41AsnTrp: 0.41 ± 0.019
1.48AsnTyr: 1.48 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
3.859ProAla: 3.859 ± 0.063
0.599ProCys: 0.599 ± 0.025
2.64ProAsp: 2.64 ± 0.04
3.537ProGlu: 3.537 ± 0.061
1.655ProPhe: 1.655 ± 0.037
2.936ProGly: 2.936 ± 0.054
0.72ProHis: 0.72 ± 0.023
2.07ProIle: 2.07 ± 0.038
1.85ProLys: 1.85 ± 0.044
3.235ProLeu: 3.235 ± 0.056
1.107ProMet: 1.107 ± 0.03
1.333ProAsn: 1.333 ± 0.034
1.425ProPro: 1.425 ± 0.043
1.366ProGln: 1.366 ± 0.031
1.659ProArg: 1.659 ± 0.04
2.269ProSer: 2.269 ± 0.051
2.005ProThr: 2.005 ± 0.049
3.319ProVal: 3.319 ± 0.054
0.432ProTrp: 0.432 ± 0.019
1.418ProTyr: 1.418 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
3.092GlnAla: 3.092 ± 0.056
0.467GlnCys: 0.467 ± 0.02
1.569GlnAsp: 1.569 ± 0.036
2.303GlnGlu: 2.303 ± 0.052
1.236GlnPhe: 1.236 ± 0.029
2.159GlnGly: 2.159 ± 0.048
0.592GlnHis: 0.592 ± 0.019
2.27GlnIle: 2.27 ± 0.049
2.276GlnLys: 2.276 ± 0.049
3.082GlnLeu: 3.082 ± 0.057
1.113GlnMet: 1.113 ± 0.031
1.563GlnAsn: 1.563 ± 0.04
1.195GlnPro: 1.195 ± 0.029
1.372GlnGln: 1.372 ± 0.037
1.968GlnArg: 1.968 ± 0.042
2.167GlnSer: 2.167 ± 0.044
2.011GlnThr: 2.011 ± 0.046
2.085GlnVal: 2.085 ± 0.038
0.359GlnTrp: 0.359 ± 0.017
1.304GlnTyr: 1.304 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
4.288ArgAla: 4.288 ± 0.064
0.918ArgCys: 0.918 ± 0.027
2.876ArgAsp: 2.876 ± 0.052
4.162ArgGlu: 4.162 ± 0.066
2.241ArgPhe: 2.241 ± 0.054
3.421ArgGly: 3.421 ± 0.06
1.089ArgHis: 1.089 ± 0.029
3.319ArgIle: 3.319 ± 0.05
3.343ArgLys: 3.343 ± 0.05
5.224ArgLeu: 5.224 ± 0.074
1.716ArgMet: 1.716 ± 0.037
2.06ArgAsn: 2.06 ± 0.042
2.02ArgPro: 2.02 ± 0.049
2.252ArgGln: 2.252 ± 0.048
3.962ArgArg: 3.962 ± 0.072
2.82ArgSer: 2.82 ± 0.052
2.831ArgThr: 2.831 ± 0.048
3.299ArgVal: 3.299 ± 0.053
0.606ArgTrp: 0.606 ± 0.024
2.022ArgTyr: 2.022 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
6.153SerAla: 6.153 ± 0.081
0.959SerCys: 0.959 ± 0.028
3.486SerAsp: 3.486 ± 0.058
3.575SerGlu: 3.575 ± 0.055
2.606SerPhe: 2.606 ± 0.046
6.06SerGly: 6.06 ± 0.082
1.032SerHis: 1.032 ± 0.031
3.657SerIle: 3.657 ± 0.061
2.751SerLys: 2.751 ± 0.058
5.513SerLeu: 5.513 ± 0.07
1.768SerMet: 1.768 ± 0.035
2.136SerAsn: 2.136 ± 0.049
2.304SerPro: 2.304 ± 0.044
1.915SerGln: 1.915 ± 0.039
3.177SerArg: 3.177 ± 0.058
4.092SerSer: 4.092 ± 0.085
3.276SerThr: 3.276 ± 0.066
4.65SerVal: 4.65 ± 0.073
0.602SerTrp: 0.602 ± 0.023
2.087SerTyr: 2.087 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
6.254ThrAla: 6.254 ± 0.085
0.791ThrCys: 0.791 ± 0.03
3.341ThrAsp: 3.341 ± 0.058
3.458ThrGlu: 3.458 ± 0.058
2.176ThrPhe: 2.176 ± 0.049
5.19ThrGly: 5.19 ± 0.092
0.982ThrHis: 0.982 ± 0.029
3.317ThrIle: 3.317 ± 0.063
2.31ThrLys: 2.31 ± 0.046
5.669ThrLeu: 5.669 ± 0.074
1.53ThrMet: 1.53 ± 0.038
1.754ThrAsn: 1.754 ± 0.045
2.711ThrPro: 2.711 ± 0.049
1.741ThrGln: 1.741 ± 0.042
2.396ThrArg: 2.396 ± 0.05
3.062ThrSer: 3.062 ± 0.053
2.994ThrThr: 2.994 ± 0.078
5.441ThrVal: 5.441 ± 0.083
0.537ThrTrp: 0.537 ± 0.022
1.971ThrTyr: 1.971 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
5.631ValAla: 5.631 ± 0.081
1.382ValCys: 1.382 ± 0.038
3.846ValAsp: 3.846 ± 0.057
4.131ValGlu: 4.131 ± 0.057
2.974ValPhe: 2.974 ± 0.058
4.832ValGly: 4.832 ± 0.068
1.056ValHis: 1.056 ± 0.027
4.015ValIle: 4.015 ± 0.069
3.54ValLys: 3.54 ± 0.052
7.111ValLeu: 7.111 ± 0.087
1.944ValMet: 1.944 ± 0.037
2.513ValAsn: 2.513 ± 0.045
2.963ValPro: 2.963 ± 0.057
2.211ValGln: 2.211 ± 0.038
3.63ValArg: 3.63 ± 0.062
5.187ValSer: 5.187 ± 0.067
4.482ValThr: 4.482 ± 0.076
4.833ValVal: 4.833 ± 0.071
0.69ValTrp: 0.69 ± 0.025
2.36ValTyr: 2.36 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.798TrpAla: 0.798 ± 0.026
0.207TrpCys: 0.207 ± 0.012
0.604TrpAsp: 0.604 ± 0.021
0.582TrpGlu: 0.582 ± 0.023
0.429TrpPhe: 0.429 ± 0.022
0.735TrpGly: 0.735 ± 0.025
0.186TrpHis: 0.186 ± 0.012
0.516TrpIle: 0.516 ± 0.023
0.625TrpLys: 0.625 ± 0.023
1.052TrpLeu: 1.052 ± 0.028
0.315TrpMet: 0.315 ± 0.016
0.515TrpAsn: 0.515 ± 0.021
0.299TrpPro: 0.299 ± 0.017
0.394TrpGln: 0.394 ± 0.019
0.572TrpArg: 0.572 ± 0.024
0.605TrpSer: 0.605 ± 0.025
0.542TrpThr: 0.542 ± 0.025
0.66TrpVal: 0.66 ± 0.024
0.112TrpTrp: 0.112 ± 0.011
0.397TrpTyr: 0.397 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.966TyrAla: 2.966 ± 0.057
0.698TyrCys: 0.698 ± 0.027
2.289TyrAsp: 2.289 ± 0.046
2.098TyrGlu: 2.098 ± 0.036
1.549TyrPhe: 1.549 ± 0.037
2.831TyrGly: 2.831 ± 0.051
0.743TyrHis: 0.743 ± 0.026
2.013TyrIle: 2.013 ± 0.042
1.524TyrLys: 1.524 ± 0.038
3.38TyrLeu: 3.38 ± 0.066
0.782TyrMet: 0.782 ± 0.026
1.375TyrAsn: 1.375 ± 0.038
1.375TyrPro: 1.375 ± 0.033
1.304TyrGln: 1.304 ± 0.031
2.179TyrArg: 2.179 ± 0.045
2.154TyrSer: 2.154 ± 0.046
2.171TyrThr: 2.171 ± 0.058
2.134TyrVal: 2.134 ± 0.041
0.353TyrTrp: 0.353 ± 0.016
1.479TyrTyr: 1.479 ± 0.04
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4593 proteins (1245691 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski