Amino acid dipepetide frequency for Bacteroidaceae bacterium HV4-6-C5C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.017AlaAla: 5.017 ± 0.087
0.916AlaCys: 0.916 ± 0.028
3.831AlaAsp: 3.831 ± 0.06
4.131AlaGlu: 4.131 ± 0.063
3.068AlaPhe: 3.068 ± 0.059
4.794AlaGly: 4.794 ± 0.077
1.157AlaHis: 1.157 ± 0.034
5.026AlaIle: 5.026 ± 0.077
4.334AlaLys: 4.334 ± 0.069
6.467AlaLeu: 6.467 ± 0.086
1.735AlaMet: 1.735 ± 0.041
3.439AlaAsn: 3.439 ± 0.067
2.027AlaPro: 2.027 ± 0.043
2.482AlaGln: 2.482 ± 0.048
2.763AlaArg: 2.763 ± 0.054
4.543AlaSer: 4.543 ± 0.066
3.829AlaThr: 3.829 ± 0.065
4.286AlaVal: 4.286 ± 0.07
0.798AlaTrp: 0.798 ± 0.027
2.967AlaTyr: 2.967 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.668CysAla: 0.668 ± 0.022
0.18CysCys: 0.18 ± 0.015
0.596CysAsp: 0.596 ± 0.023
0.606CysGlu: 0.606 ± 0.024
0.594CysPhe: 0.594 ± 0.026
0.936CysGly: 0.936 ± 0.028
0.251CysHis: 0.251 ± 0.016
0.864CysIle: 0.864 ± 0.028
0.665CysLys: 0.665 ± 0.023
1.028CysLeu: 1.028 ± 0.029
0.297CysMet: 0.297 ± 0.016
0.569CysAsn: 0.569 ± 0.02
0.478CysPro: 0.478 ± 0.027
0.331CysGln: 0.331 ± 0.018
0.542CysArg: 0.542 ± 0.021
0.759CysSer: 0.759 ± 0.026
0.616CysThr: 0.616 ± 0.023
0.67CysVal: 0.67 ± 0.028
0.129CysTrp: 0.129 ± 0.011
0.514CysTyr: 0.514 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.791AspAla: 3.791 ± 0.063
0.574AspCys: 0.574 ± 0.023
2.613AspAsp: 2.613 ± 0.052
3.629AspGlu: 3.629 ± 0.064
3.044AspPhe: 3.044 ± 0.049
3.996AspGly: 3.996 ± 0.065
0.925AspHis: 0.925 ± 0.032
4.381AspIle: 4.381 ± 0.066
4.197AspLys: 4.197 ± 0.065
4.832AspLeu: 4.832 ± 0.067
1.514AspMet: 1.514 ± 0.032
2.899AspAsn: 2.899 ± 0.061
1.939AspPro: 1.939 ± 0.041
1.424AspGln: 1.424 ± 0.038
2.435AspArg: 2.435 ± 0.051
3.228AspSer: 3.228 ± 0.058
2.633AspThr: 2.633 ± 0.055
3.407AspVal: 3.407 ± 0.061
0.898AspTrp: 0.898 ± 0.032
2.852AspTyr: 2.852 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
4.353GluAla: 4.353 ± 0.074
0.606GluCys: 0.606 ± 0.023
3.025GluAsp: 3.025 ± 0.06
4.431GluGlu: 4.431 ± 0.081
2.33GluPhe: 2.33 ± 0.046
3.852GluGly: 3.852 ± 0.061
1.205GluHis: 1.205 ± 0.038
4.548GluIle: 4.548 ± 0.069
5.04GluLys: 5.04 ± 0.078
5.832GluLeu: 5.832 ± 0.086
1.791GluMet: 1.791 ± 0.043
3.621GluAsn: 3.621 ± 0.053
1.589GluPro: 1.589 ± 0.037
2.294GluGln: 2.294 ± 0.05
2.93GluArg: 2.93 ± 0.058
3.395GluSer: 3.395 ± 0.054
2.976GluThr: 2.976 ± 0.052
3.992GluVal: 3.992 ± 0.058
0.821GluTrp: 0.821 ± 0.027
2.676GluTyr: 2.676 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
2.948PheAla: 2.948 ± 0.052
0.606PheCys: 0.606 ± 0.023
2.73PheAsp: 2.73 ± 0.05
2.416PheGlu: 2.416 ± 0.047
2.26PhePhe: 2.26 ± 0.051
3.31PheGly: 3.31 ± 0.061
0.884PheHis: 0.884 ± 0.028
3.467PheIle: 3.467 ± 0.06
2.636PheLys: 2.636 ± 0.056
4.027PheLeu: 4.027 ± 0.072
1.178PheMet: 1.178 ± 0.031
2.578PheAsn: 2.578 ± 0.049
1.741PhePro: 1.741 ± 0.038
1.28PheGln: 1.28 ± 0.034
2.005PheArg: 2.005 ± 0.043
3.7PheSer: 3.7 ± 0.061
2.705PheThr: 2.705 ± 0.056
2.78PheVal: 2.78 ± 0.057
0.572PheTrp: 0.572 ± 0.024
2.08PheTyr: 2.08 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.277GlyAla: 4.277 ± 0.071
0.829GlyCys: 0.829 ± 0.03
3.518GlyAsp: 3.518 ± 0.057
3.935GlyGlu: 3.935 ± 0.062
3.194GlyPhe: 3.194 ± 0.052
4.787GlyGly: 4.787 ± 0.086
1.238GlyHis: 1.238 ± 0.035
5.558GlyIle: 5.558 ± 0.063
5.367GlyLys: 5.367 ± 0.076
5.917GlyLeu: 5.917 ± 0.08
2.077GlyMet: 2.077 ± 0.049
3.804GlyAsn: 3.804 ± 0.069
1.244GlyPro: 1.244 ± 0.032
2.021GlyGln: 2.021 ± 0.047
2.809GlyArg: 2.809 ± 0.058
4.289GlySer: 4.289 ± 0.073
4.128GlyThr: 4.128 ± 0.077
4.828GlyVal: 4.828 ± 0.064
1.053GlyTrp: 1.053 ± 0.033
3.588GlyTyr: 3.588 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
1.156HisAla: 1.156 ± 0.036
0.264HisCys: 0.264 ± 0.018
0.941HisAsp: 0.941 ± 0.027
0.988HisGlu: 0.988 ± 0.028
1.073HisPhe: 1.073 ± 0.031
1.174HisGly: 1.174 ± 0.032
0.459HisHis: 0.459 ± 0.023
1.467HisIle: 1.467 ± 0.038
1.031HisLys: 1.031 ± 0.028
1.856HisLeu: 1.856 ± 0.046
0.337HisMet: 0.337 ± 0.017
0.938HisAsn: 0.938 ± 0.028
1.061HisPro: 1.061 ± 0.034
0.619HisGln: 0.619 ± 0.022
0.815HisArg: 0.815 ± 0.029
1.265HisSer: 1.265 ± 0.033
0.982HisThr: 0.982 ± 0.024
0.953HisVal: 0.953 ± 0.027
0.258HisTrp: 0.258 ± 0.015
0.942HisTyr: 0.942 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.42IleAla: 5.42 ± 0.074
0.939IleCys: 0.939 ± 0.03
4.347IleAsp: 4.347 ± 0.063
4.514IleGlu: 4.514 ± 0.07
2.949IlePhe: 2.949 ± 0.062
4.99IleGly: 4.99 ± 0.08
1.482IleHis: 1.482 ± 0.037
5.146IleIle: 5.146 ± 0.081
4.722IleLys: 4.722 ± 0.065
6.509IleLeu: 6.509 ± 0.081
1.541IleMet: 1.541 ± 0.038
3.977IleAsn: 3.977 ± 0.063
3.362IlePro: 3.362 ± 0.057
2.273IleGln: 2.273 ± 0.053
3.422IleArg: 3.422 ± 0.061
5.319IleSer: 5.319 ± 0.073
4.235IleThr: 4.235 ± 0.067
4.372IleVal: 4.372 ± 0.074
0.817IleTrp: 0.817 ± 0.029
3.058IleTyr: 3.058 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.908LysAla: 4.908 ± 0.081
0.528LysCys: 0.528 ± 0.021
4.317LysAsp: 4.317 ± 0.058
5.567LysGlu: 5.567 ± 0.085
2.182LysPhe: 2.182 ± 0.046
4.877LysGly: 4.877 ± 0.075
1.28LysHis: 1.28 ± 0.036
4.681LysIle: 4.681 ± 0.068
5.404LysLys: 5.404 ± 0.084
5.641LysLeu: 5.641 ± 0.071
2.157LysMet: 2.157 ± 0.047
3.864LysAsn: 3.864 ± 0.064
2.087LysPro: 2.087 ± 0.048
2.644LysGln: 2.644 ± 0.055
3.172LysArg: 3.172 ± 0.055
3.931LysSer: 3.931 ± 0.058
3.702LysThr: 3.702 ± 0.064
4.41LysVal: 4.41 ± 0.062
0.805LysTrp: 0.805 ± 0.027
3.175LysTyr: 3.175 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
5.991LeuAla: 5.991 ± 0.085
1.203LeuCys: 1.203 ± 0.036
4.672LeuAsp: 4.672 ± 0.065
4.882LeuGlu: 4.882 ± 0.08
4.706LeuPhe: 4.706 ± 0.071
5.594LeuGly: 5.594 ± 0.088
1.743LeuHis: 1.743 ± 0.036
6.131LeuIle: 6.131 ± 0.083
6.837LeuLys: 6.837 ± 0.078
9.117LeuLeu: 9.117 ± 0.143
2.365LeuMet: 2.365 ± 0.047
5.177LeuAsn: 5.177 ± 0.07
3.975LeuPro: 3.975 ± 0.062
3.266LeuGln: 3.266 ± 0.056
4.039LeuArg: 4.039 ± 0.063
7.162LeuSer: 7.162 ± 0.088
5.079LeuThr: 5.079 ± 0.082
4.965LeuVal: 4.965 ± 0.072
1.052LeuTrp: 1.052 ± 0.035
3.846LeuTyr: 3.846 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
1.805MetAla: 1.805 ± 0.041
0.196MetCys: 0.196 ± 0.014
1.483MetAsp: 1.483 ± 0.038
1.647MetGlu: 1.647 ± 0.039
0.962MetPhe: 0.962 ± 0.034
1.802MetGly: 1.802 ± 0.04
0.455MetHis: 0.455 ± 0.017
1.666MetIle: 1.666 ± 0.044
2.462MetLys: 2.462 ± 0.042
2.358MetLeu: 2.358 ± 0.05
0.7MetMet: 0.7 ± 0.026
1.587MetAsn: 1.587 ± 0.037
1.161MetPro: 1.161 ± 0.031
1.0MetGln: 1.0 ± 0.027
1.22MetArg: 1.22 ± 0.03
1.535MetSer: 1.535 ± 0.037
1.381MetThr: 1.381 ± 0.037
1.485MetVal: 1.485 ± 0.036
0.236MetTrp: 0.236 ± 0.013
0.832MetTyr: 0.832 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.683AsnAla: 3.683 ± 0.063
0.503AsnCys: 0.503 ± 0.024
2.844AsnAsp: 2.844 ± 0.053
3.275AsnGlu: 3.275 ± 0.057
2.373AsnPhe: 2.373 ± 0.043
4.234AsnGly: 4.234 ± 0.07
0.99AsnHis: 0.99 ± 0.031
4.478AsnIle: 4.478 ± 0.072
3.709AsnLys: 3.709 ± 0.06
4.661AsnLeu: 4.661 ± 0.069
1.351AsnMet: 1.351 ± 0.038
3.083AsnAsn: 3.083 ± 0.063
2.634AsnPro: 2.634 ± 0.047
1.742AsnGln: 1.742 ± 0.042
2.424AsnArg: 2.424 ± 0.041
3.392AsnSer: 3.392 ± 0.068
3.073AsnThr: 3.073 ± 0.059
3.26AsnVal: 3.26 ± 0.062
0.732AsnTrp: 0.732 ± 0.026
2.67AsnTyr: 2.67 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
2.518ProAla: 2.518 ± 0.048
0.338ProCys: 0.338 ± 0.015
2.509ProAsp: 2.509 ± 0.051
2.921ProGlu: 2.921 ± 0.051
1.8ProPhe: 1.8 ± 0.038
2.363ProGly: 2.363 ± 0.047
0.705ProHis: 0.705 ± 0.025
2.443ProIle: 2.443 ± 0.05
2.037ProLys: 2.037 ± 0.038
3.299ProLeu: 3.299 ± 0.059
0.873ProMet: 0.873 ± 0.027
1.874ProAsn: 1.874 ± 0.044
0.775ProPro: 0.775 ± 0.024
1.378ProGln: 1.378 ± 0.035
1.232ProArg: 1.232 ± 0.037
2.347ProSer: 2.347 ± 0.05
1.982ProThr: 1.982 ± 0.045
2.803ProVal: 2.803 ± 0.056
0.469ProTrp: 0.469 ± 0.02
1.781ProTyr: 1.781 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
2.292GlnAla: 2.292 ± 0.039
0.275GlnCys: 0.275 ± 0.016
1.611GlnAsp: 1.611 ± 0.037
2.052GlnGlu: 2.052 ± 0.045
1.306GlnPhe: 1.306 ± 0.032
1.947GlnGly: 1.947 ± 0.037
0.629GlnHis: 0.629 ± 0.028
2.438GlnIle: 2.438 ± 0.047
2.487GlnLys: 2.487 ± 0.046
3.37GlnLeu: 3.37 ± 0.062
0.963GlnMet: 0.963 ± 0.03
1.924GlnAsn: 1.924 ± 0.045
1.141GlnPro: 1.141 ± 0.027
1.493GlnGln: 1.493 ± 0.041
1.598GlnArg: 1.598 ± 0.043
2.098GlnSer: 2.098 ± 0.045
1.976GlnThr: 1.976 ± 0.046
2.051GlnVal: 2.051 ± 0.042
0.489GlnTrp: 0.489 ± 0.021
1.511GlnTyr: 1.511 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.541ArgAla: 2.541 ± 0.057
0.418ArgCys: 0.418 ± 0.02
2.153ArgAsp: 2.153 ± 0.044
2.728ArgGlu: 2.728 ± 0.055
2.14ArgPhe: 2.14 ± 0.046
2.368ArgGly: 2.368 ± 0.049
0.833ArgHis: 0.833 ± 0.026
3.614ArgIle: 3.614 ± 0.056
3.325ArgLys: 3.325 ± 0.061
4.324ArgLeu: 4.324 ± 0.067
1.374ArgMet: 1.374 ± 0.036
2.49ArgAsn: 2.49 ± 0.047
1.466ArgPro: 1.466 ± 0.043
1.632ArgGln: 1.632 ± 0.041
1.99ArgArg: 1.99 ± 0.05
2.56ArgSer: 2.56 ± 0.05
2.282ArgThr: 2.282 ± 0.043
2.563ArgVal: 2.563 ± 0.05
0.669ArgTrp: 0.669 ± 0.024
2.128ArgTyr: 2.128 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
4.339SerAla: 4.339 ± 0.07
0.825SerCys: 0.825 ± 0.026
3.608SerAsp: 3.608 ± 0.056
3.658SerGlu: 3.658 ± 0.055
3.662SerPhe: 3.662 ± 0.057
4.974SerGly: 4.974 ± 0.069
1.148SerHis: 1.148 ± 0.034
5.07SerIle: 5.07 ± 0.073
3.864SerLys: 3.864 ± 0.055
6.441SerLeu: 6.441 ± 0.083
1.523SerMet: 1.523 ± 0.038
3.269SerAsn: 3.269 ± 0.06
2.489SerPro: 2.489 ± 0.048
2.01SerGln: 2.01 ± 0.047
2.704SerArg: 2.704 ± 0.049
4.702SerSer: 4.702 ± 0.083
3.64SerThr: 3.64 ± 0.056
4.322SerVal: 4.322 ± 0.061
0.912SerTrp: 0.912 ± 0.036
3.121SerTyr: 3.121 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
3.793ThrAla: 3.793 ± 0.068
0.519ThrCys: 0.519 ± 0.022
3.396ThrAsp: 3.396 ± 0.054
3.087ThrGlu: 3.087 ± 0.054
2.682ThrPhe: 2.682 ± 0.054
4.38ThrGly: 4.38 ± 0.071
1.031ThrHis: 1.031 ± 0.031
3.955ThrIle: 3.955 ± 0.062
3.094ThrLys: 3.094 ± 0.051
5.325ThrLeu: 5.325 ± 0.072
1.116ThrMet: 1.116 ± 0.031
2.805ThrAsn: 2.805 ± 0.05
2.686ThrPro: 2.686 ± 0.045
1.833ThrGln: 1.833 ± 0.044
2.06ThrArg: 2.06 ± 0.043
3.583ThrSer: 3.583 ± 0.066
3.117ThrThr: 3.117 ± 0.07
3.622ThrVal: 3.622 ± 0.061
0.657ThrTrp: 0.657 ± 0.024
2.537ThrTyr: 2.537 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.336ValAla: 4.336 ± 0.069
0.874ValCys: 0.874 ± 0.029
3.641ValAsp: 3.641 ± 0.064
3.759ValGlu: 3.759 ± 0.064
2.809ValPhe: 2.809 ± 0.053
4.004ValGly: 4.004 ± 0.062
1.026ValHis: 1.026 ± 0.029
4.5ValIle: 4.5 ± 0.073
4.237ValLys: 4.237 ± 0.06
5.514ValLeu: 5.514 ± 0.077
1.577ValMet: 1.577 ± 0.035
3.475ValAsn: 3.475 ± 0.063
2.336ValPro: 2.336 ± 0.054
1.822ValGln: 1.822 ± 0.038
2.72ValArg: 2.72 ± 0.053
4.537ValSer: 4.537 ± 0.063
3.512ValThr: 3.512 ± 0.066
4.087ValVal: 4.087 ± 0.074
0.687ValTrp: 0.687 ± 0.024
2.697ValTyr: 2.697 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.764TrpAla: 0.764 ± 0.025
0.167TrpCys: 0.167 ± 0.013
0.739TrpAsp: 0.739 ± 0.027
0.751TrpGlu: 0.751 ± 0.03
0.563TrpPhe: 0.563 ± 0.02
1.015TrpGly: 1.015 ± 0.034
0.262TrpHis: 0.262 ± 0.015
0.865TrpIle: 0.865 ± 0.028
0.93TrpLys: 0.93 ± 0.031
1.137TrpLeu: 1.137 ± 0.037
0.425TrpMet: 0.425 ± 0.018
0.93TrpAsn: 0.93 ± 0.029
0.299TrpPro: 0.299 ± 0.018
0.488TrpGln: 0.488 ± 0.025
0.565TrpArg: 0.565 ± 0.022
0.767TrpSer: 0.767 ± 0.03
0.701TrpThr: 0.701 ± 0.028
0.78TrpVal: 0.78 ± 0.026
0.189TrpTrp: 0.189 ± 0.012
0.528TrpTyr: 0.528 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.011TyrAla: 3.011 ± 0.052
0.532TyrCys: 0.532 ± 0.02
2.639TyrAsp: 2.639 ± 0.055
2.33TyrGlu: 2.33 ± 0.05
2.209TyrPhe: 2.209 ± 0.039
3.065TyrGly: 3.065 ± 0.062
0.879TyrHis: 0.879 ± 0.029
3.094TyrIle: 3.094 ± 0.061
2.95TyrLys: 2.95 ± 0.049
4.148TyrLeu: 4.148 ± 0.064
1.079TyrMet: 1.079 ± 0.029
2.807TyrAsn: 2.807 ± 0.059
1.957TyrPro: 1.957 ± 0.047
1.618TyrGln: 1.618 ± 0.042
2.151TyrArg: 2.151 ± 0.048
3.167TyrSer: 3.167 ± 0.056
2.732TyrThr: 2.732 ± 0.048
2.513TyrVal: 2.513 ± 0.049
0.622TyrTrp: 0.622 ± 0.022
2.251TyrTyr: 2.251 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2881 proteins (1119731 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski