Amino acid dipepetide frequency for Ileibacterium valens

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.86AlaAla: 6.86 ± 0.141
0.956AlaCys: 0.956 ± 0.038
4.371AlaAsp: 4.371 ± 0.089
4.148AlaGlu: 4.148 ± 0.092
2.923AlaPhe: 2.923 ± 0.076
5.179AlaGly: 5.179 ± 0.107
1.204AlaHis: 1.204 ± 0.041
5.386AlaIle: 5.386 ± 0.097
4.8AlaLys: 4.8 ± 0.113
7.388AlaLeu: 7.388 ± 0.112
2.269AlaMet: 2.269 ± 0.066
3.068AlaAsn: 3.068 ± 0.069
2.064AlaPro: 2.064 ± 0.058
2.433AlaGln: 2.433 ± 0.058
3.083AlaArg: 3.083 ± 0.073
4.733AlaSer: 4.733 ± 0.092
2.496AlaThr: 2.496 ± 0.069
4.583AlaVal: 4.583 ± 0.1
0.668AlaTrp: 0.668 ± 0.032
2.336AlaTyr: 2.336 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.9CysAla: 0.9 ± 0.041
0.183CysCys: 0.183 ± 0.019
0.595CysAsp: 0.595 ± 0.027
0.653CysGlu: 0.653 ± 0.031
0.591CysPhe: 0.591 ± 0.028
1.004CysGly: 1.004 ± 0.042
0.328CysHis: 0.328 ± 0.023
0.85CysIle: 0.85 ± 0.038
0.63CysLys: 0.63 ± 0.033
1.284CysLeu: 1.284 ± 0.047
0.359CysMet: 0.359 ± 0.019
0.395CysAsn: 0.395 ± 0.024
0.542CysPro: 0.542 ± 0.026
0.444CysGln: 0.444 ± 0.03
0.569CysArg: 0.569 ± 0.031
0.991CysSer: 0.991 ± 0.039
0.579CysThr: 0.579 ± 0.03
0.606CysVal: 0.606 ± 0.033
0.151CysTrp: 0.151 ± 0.018
0.398CysTyr: 0.398 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.734AspAla: 3.734 ± 0.076
0.699AspCys: 0.699 ± 0.034
3.355AspAsp: 3.355 ± 0.092
5.181AspGlu: 5.181 ± 0.109
2.958AspPhe: 2.958 ± 0.068
4.222AspGly: 4.222 ± 0.087
1.652AspHis: 1.652 ± 0.051
3.794AspIle: 3.794 ± 0.075
3.181AspLys: 3.181 ± 0.077
6.716AspLeu: 6.716 ± 0.114
1.469AspMet: 1.469 ± 0.039
2.157AspAsn: 2.157 ± 0.056
2.901AspPro: 2.901 ± 0.064
3.269AspGln: 3.269 ± 0.078
2.666AspArg: 2.666 ± 0.059
3.903AspSer: 3.903 ± 0.094
2.549AspThr: 2.549 ± 0.068
3.579AspVal: 3.579 ± 0.068
0.723AspTrp: 0.723 ± 0.033
2.582AspTyr: 2.582 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
5.235GluAla: 5.235 ± 0.088
0.732GluCys: 0.732 ± 0.037
4.65GluAsp: 4.65 ± 0.081
6.141GluGlu: 6.141 ± 0.129
2.879GluPhe: 2.879 ± 0.078
3.746GluGly: 3.746 ± 0.083
1.184GluHis: 1.184 ± 0.043
5.64GluIle: 5.64 ± 0.101
5.944GluLys: 5.944 ± 0.101
6.286GluLeu: 6.286 ± 0.107
2.435GluMet: 2.435 ± 0.06
4.807GluAsn: 4.807 ± 0.094
2.118GluPro: 2.118 ± 0.065
3.13GluGln: 3.13 ± 0.078
2.586GluArg: 2.586 ± 0.072
3.99GluSer: 3.99 ± 0.088
3.518GluThr: 3.518 ± 0.074
4.218GluVal: 4.218 ± 0.081
0.662GluTrp: 0.662 ± 0.03
2.583GluTyr: 2.583 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.951PheAla: 2.951 ± 0.077
0.683PheCys: 0.683 ± 0.03
3.061PheAsp: 3.061 ± 0.071
3.157PheGlu: 3.157 ± 0.07
1.981PhePhe: 1.981 ± 0.064
3.172PheGly: 3.172 ± 0.068
0.834PheHis: 0.834 ± 0.033
3.011PheIle: 3.011 ± 0.078
2.443PheLys: 2.443 ± 0.063
4.115PheLeu: 4.115 ± 0.099
1.21PheMet: 1.21 ± 0.044
1.903PheAsn: 1.903 ± 0.054
1.388PhePro: 1.388 ± 0.043
1.339PheGln: 1.339 ± 0.042
1.735PheArg: 1.735 ± 0.053
3.372PheSer: 3.372 ± 0.094
2.298PheThr: 2.298 ± 0.074
2.653PheVal: 2.653 ± 0.064
0.469PheTrp: 0.469 ± 0.026
1.642PheTyr: 1.642 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.05GlyAla: 4.05 ± 0.102
1.007GlyCys: 1.007 ± 0.046
2.966GlyAsp: 2.966 ± 0.07
3.208GlyGlu: 3.208 ± 0.08
3.251GlyPhe: 3.251 ± 0.073
3.605GlyGly: 3.605 ± 0.099
1.175GlyHis: 1.175 ± 0.046
5.311GlyIle: 5.311 ± 0.095
4.779GlyLys: 4.779 ± 0.096
6.095GlyLeu: 6.095 ± 0.112
2.081GlyMet: 2.081 ± 0.058
2.857GlyAsn: 2.857 ± 0.078
1.375GlyPro: 1.375 ± 0.048
2.054GlyGln: 2.054 ± 0.061
2.653GlyArg: 2.653 ± 0.067
4.14GlySer: 4.14 ± 0.094
3.672GlyThr: 3.672 ± 0.095
3.963GlyVal: 3.963 ± 0.071
0.757GlyTrp: 0.757 ± 0.036
2.877GlyTyr: 2.877 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
1.087HisAla: 1.087 ± 0.039
0.245HisCys: 0.245 ± 0.019
1.05HisAsp: 1.05 ± 0.037
1.409HisGlu: 1.409 ± 0.035
0.93HisPhe: 0.93 ± 0.041
1.258HisGly: 1.258 ± 0.045
0.501HisHis: 0.501 ± 0.029
1.242HisIle: 1.242 ± 0.039
0.957HisLys: 0.957 ± 0.033
1.903HisLeu: 1.903 ± 0.059
0.462HisMet: 0.462 ± 0.026
0.745HisAsn: 0.745 ± 0.029
1.188HisPro: 1.188 ± 0.049
0.934HisGln: 0.934 ± 0.038
0.743HisArg: 0.743 ± 0.031
1.329HisSer: 1.329 ± 0.046
0.892HisThr: 0.892 ± 0.036
1.063HisVal: 1.063 ± 0.037
0.175HisTrp: 0.175 ± 0.015
0.789HisTyr: 0.789 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.395IleAla: 5.395 ± 0.101
1.144IleCys: 1.144 ± 0.043
4.597IleAsp: 4.597 ± 0.087
5.699IleGlu: 5.699 ± 0.089
2.94IlePhe: 2.94 ± 0.073
4.478IleGly: 4.478 ± 0.09
1.723IleHis: 1.723 ± 0.051
4.603IleIle: 4.603 ± 0.085
3.753IleLys: 3.753 ± 0.078
7.583IleLeu: 7.583 ± 0.127
1.762IleMet: 1.762 ± 0.048
3.251IleAsn: 3.251 ± 0.066
3.532IlePro: 3.532 ± 0.081
2.964IleGln: 2.964 ± 0.063
3.884IleArg: 3.884 ± 0.086
5.463IleSer: 5.463 ± 0.096
3.771IleThr: 3.771 ± 0.075
4.382IleVal: 4.382 ± 0.089
0.666IleTrp: 0.666 ± 0.034
2.619IleTyr: 2.619 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
5.525LysAla: 5.525 ± 0.091
0.397LysCys: 0.397 ± 0.024
4.644LysAsp: 4.644 ± 0.096
6.124LysGlu: 6.124 ± 0.097
1.869LysPhe: 1.869 ± 0.056
3.686LysGly: 3.686 ± 0.088
1.038LysHis: 1.038 ± 0.035
4.724LysIle: 4.724 ± 0.099
6.425LysLys: 6.425 ± 0.112
5.145LysLeu: 5.145 ± 0.082
2.093LysMet: 2.093 ± 0.062
4.13LysAsn: 4.13 ± 0.073
2.493LysPro: 2.493 ± 0.066
2.629LysGln: 2.629 ± 0.061
2.98LysArg: 2.98 ± 0.076
3.917LysSer: 3.917 ± 0.077
4.188LysThr: 4.188 ± 0.078
3.897LysVal: 3.897 ± 0.081
0.545LysTrp: 0.545 ± 0.028
2.249LysTyr: 2.249 ± 0.065
0.0LysXaa: 0.0 ± 0.0
Leu
6.727LeuAla: 6.727 ± 0.107
1.319LeuCys: 1.319 ± 0.046
5.995LeuAsp: 5.995 ± 0.094
6.686LeuGlu: 6.686 ± 0.093
4.556LeuPhe: 4.556 ± 0.11
5.335LeuGly: 5.335 ± 0.1
1.656LeuHis: 1.656 ± 0.052
7.594LeuIle: 7.594 ± 0.138
7.175LeuLys: 7.175 ± 0.116
8.891LeuLeu: 8.891 ± 0.159
2.933LeuMet: 2.933 ± 0.08
5.159LeuAsn: 5.159 ± 0.087
3.763LeuPro: 3.763 ± 0.076
3.145LeuGln: 3.145 ± 0.07
3.774LeuArg: 3.774 ± 0.075
7.299LeuSer: 7.299 ± 0.114
4.335LeuThr: 4.335 ± 0.08
5.462LeuVal: 5.462 ± 0.096
0.859LeuTrp: 0.859 ± 0.034
3.107LeuTyr: 3.107 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
2.438MetAla: 2.438 ± 0.064
0.217MetCys: 0.217 ± 0.018
2.057MetAsp: 2.057 ± 0.057
2.117MetGlu: 2.117 ± 0.049
1.051MetPhe: 1.051 ± 0.041
1.622MetGly: 1.622 ± 0.051
0.404MetHis: 0.404 ± 0.023
2.54MetIle: 2.54 ± 0.065
2.66MetLys: 2.66 ± 0.065
2.414MetLeu: 2.414 ± 0.06
0.944MetMet: 0.944 ± 0.039
2.16MetAsn: 2.16 ± 0.062
1.158MetPro: 1.158 ± 0.046
1.04MetGln: 1.04 ± 0.036
0.959MetArg: 0.959 ± 0.034
1.881MetSer: 1.881 ± 0.055
1.515MetThr: 1.515 ± 0.051
1.81MetVal: 1.81 ± 0.055
0.154MetTrp: 0.154 ± 0.014
0.668MetTyr: 0.668 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
3.312AsnAla: 3.312 ± 0.084
0.499AsnCys: 0.499 ± 0.026
2.834AsnAsp: 2.834 ± 0.07
3.569AsnGlu: 3.569 ± 0.083
1.649AsnPhe: 1.649 ± 0.053
3.552AsnGly: 3.552 ± 0.074
1.09AsnHis: 1.09 ± 0.046
3.08AsnIle: 3.08 ± 0.066
2.918AsnLys: 2.918 ± 0.062
4.602AsnLeu: 4.602 ± 0.074
1.439AsnMet: 1.439 ± 0.04
2.143AsnAsn: 2.143 ± 0.067
2.821AsnPro: 2.821 ± 0.071
2.553AsnGln: 2.553 ± 0.07
2.486AsnArg: 2.486 ± 0.069
3.041AsnSer: 3.041 ± 0.076
2.493AsnThr: 2.493 ± 0.067
2.763AsnVal: 2.763 ± 0.068
0.511AsnTrp: 0.511 ± 0.031
1.752AsnTyr: 1.752 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
2.894ProAla: 2.894 ± 0.074
0.352ProCys: 0.352 ± 0.023
2.81ProAsp: 2.81 ± 0.076
3.961ProGlu: 3.961 ± 0.09
1.816ProPhe: 1.816 ± 0.049
2.418ProGly: 2.418 ± 0.057
0.556ProHis: 0.556 ± 0.033
2.499ProIle: 2.499 ± 0.06
2.242ProLys: 2.242 ± 0.06
3.195ProLeu: 3.195 ± 0.07
0.96ProMet: 0.96 ± 0.039
1.592ProAsn: 1.592 ± 0.046
0.602ProPro: 0.602 ± 0.029
1.171ProGln: 1.171 ± 0.046
1.044ProArg: 1.044 ± 0.04
2.421ProSer: 2.421 ± 0.058
1.658ProThr: 1.658 ± 0.05
3.252ProVal: 3.252 ± 0.095
0.382ProTrp: 0.382 ± 0.022
1.438ProTyr: 1.438 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
3.071GlnAla: 3.071 ± 0.074
0.272GlnCys: 0.272 ± 0.022
2.134GlnAsp: 2.134 ± 0.06
2.783GlnGlu: 2.783 ± 0.071
1.411GlnPhe: 1.411 ± 0.048
1.978GlnGly: 1.978 ± 0.059
0.427GlnHis: 0.427 ± 0.025
3.365GlnIle: 3.365 ± 0.072
3.516GlnLys: 3.516 ± 0.075
3.154GlnLeu: 3.154 ± 0.065
1.271GlnMet: 1.271 ± 0.049
2.435GlnAsn: 2.435 ± 0.072
1.241GlnPro: 1.241 ± 0.044
1.284GlnGln: 1.284 ± 0.049
1.368GlnArg: 1.368 ± 0.049
2.553GlnSer: 2.553 ± 0.064
2.167GlnThr: 2.167 ± 0.054
2.228GlnVal: 2.228 ± 0.059
0.351GlnTrp: 0.351 ± 0.026
1.211GlnTyr: 1.211 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.175ArgAla: 2.175 ± 0.056
0.399ArgCys: 0.399 ± 0.023
2.238ArgAsp: 2.238 ± 0.06
2.819ArgGlu: 2.819 ± 0.066
2.02ArgPhe: 2.02 ± 0.053
1.837ArgGly: 1.837 ± 0.054
0.85ArgHis: 0.85 ± 0.034
3.752ArgIle: 3.752 ± 0.078
3.903ArgLys: 3.903 ± 0.092
4.205ArgLeu: 4.205 ± 0.08
1.673ArgMet: 1.673 ± 0.049
2.212ArgAsn: 2.212 ± 0.058
1.415ArgPro: 1.415 ± 0.048
1.794ArgGln: 1.794 ± 0.051
1.957ArgArg: 1.957 ± 0.062
2.559ArgSer: 2.559 ± 0.064
2.087ArgThr: 2.087 ± 0.055
2.274ArgVal: 2.274 ± 0.057
0.422ArgTrp: 0.422 ± 0.028
1.769ArgTyr: 1.769 ± 0.059
0.0ArgXaa: 0.0 ± 0.0
Ser
4.627SerAla: 4.627 ± 0.089
0.782SerCys: 0.782 ± 0.034
4.292SerAsp: 4.292 ± 0.093
4.702SerGlu: 4.702 ± 0.09
3.112SerPhe: 3.112 ± 0.073
4.694SerGly: 4.694 ± 0.096
1.25SerHis: 1.25 ± 0.04
4.793SerIle: 4.793 ± 0.089
4.393SerLys: 4.393 ± 0.084
6.767SerLeu: 6.767 ± 0.126
2.12SerMet: 2.12 ± 0.057
2.937SerAsn: 2.937 ± 0.07
2.013SerPro: 2.013 ± 0.056
2.463SerGln: 2.463 ± 0.067
3.127SerArg: 3.127 ± 0.076
4.994SerSer: 4.994 ± 0.107
3.073SerThr: 3.073 ± 0.075
4.276SerVal: 4.276 ± 0.083
0.746SerTrp: 0.746 ± 0.037
2.342SerTyr: 2.342 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
3.549ThrAla: 3.549 ± 0.079
0.535ThrCys: 0.535 ± 0.027
3.15ThrAsp: 3.15 ± 0.083
3.158ThrGlu: 3.158 ± 0.07
2.249ThrPhe: 2.249 ± 0.065
3.754ThrGly: 3.754 ± 0.087
0.95ThrHis: 0.95 ± 0.038
3.789ThrIle: 3.789 ± 0.074
2.772ThrLys: 2.772 ± 0.072
4.864ThrLeu: 4.864 ± 0.085
1.309ThrMet: 1.309 ± 0.042
2.164ThrAsn: 2.164 ± 0.065
2.321ThrPro: 2.321 ± 0.07
1.562ThrGln: 1.562 ± 0.048
1.97ThrArg: 1.97 ± 0.052
3.053ThrSer: 3.053 ± 0.076
2.371ThrThr: 2.371 ± 0.07
3.418ThrVal: 3.418 ± 0.096
0.525ThrTrp: 0.525 ± 0.031
1.81ThrTyr: 1.81 ± 0.076
0.0ThrXaa: 0.0 ± 0.0
Val
3.492ValAla: 3.492 ± 0.079
1.01ValCys: 1.01 ± 0.038
3.69ValAsp: 3.69 ± 0.087
3.934ValGlu: 3.934 ± 0.082
3.058ValPhe: 3.058 ± 0.07
3.16ValGly: 3.16 ± 0.083
1.18ValHis: 1.18 ± 0.039
5.007ValIle: 5.007 ± 0.075
3.609ValLys: 3.609 ± 0.072
6.559ValLeu: 6.559 ± 0.102
1.752ValMet: 1.752 ± 0.052
2.926ValAsn: 2.926 ± 0.07
2.344ValPro: 2.344 ± 0.067
1.92ValGln: 1.92 ± 0.054
2.68ValArg: 2.68 ± 0.066
4.693ValSer: 4.693 ± 0.088
3.067ValThr: 3.067 ± 0.096
3.809ValVal: 3.809 ± 0.08
0.539ValTrp: 0.539 ± 0.028
2.351ValTyr: 2.351 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.552TrpAla: 0.552 ± 0.027
0.117TrpCys: 0.117 ± 0.012
0.506TrpAsp: 0.506 ± 0.026
0.441TrpGlu: 0.441 ± 0.026
0.452TrpPhe: 0.452 ± 0.03
0.502TrpGly: 0.502 ± 0.027
0.167TrpHis: 0.167 ± 0.017
1.024TrpIle: 1.024 ± 0.047
0.727TrpLys: 0.727 ± 0.033
1.063TrpLeu: 1.063 ± 0.04
0.452TrpMet: 0.452 ± 0.029
0.581TrpAsn: 0.581 ± 0.035
0.307TrpPro: 0.307 ± 0.019
0.481TrpGln: 0.481 ± 0.029
0.314TrpArg: 0.314 ± 0.022
0.542TrpSer: 0.542 ± 0.027
0.595TrpThr: 0.595 ± 0.033
0.498TrpVal: 0.498 ± 0.03
0.094TrpTrp: 0.094 ± 0.013
0.355TrpTyr: 0.355 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.374TyrAla: 2.374 ± 0.068
0.505TyrCys: 0.505 ± 0.028
2.295TyrAsp: 2.295 ± 0.058
2.596TyrGlu: 2.596 ± 0.064
1.723TyrPhe: 1.723 ± 0.057
2.672TyrGly: 2.672 ± 0.068
0.735TyrHis: 0.735 ± 0.035
2.204TyrIle: 2.204 ± 0.052
1.869TyrLys: 1.869 ± 0.055
3.543TyrLeu: 3.543 ± 0.088
0.813TyrMet: 0.813 ± 0.038
1.488TyrAsn: 1.488 ± 0.052
1.677TyrPro: 1.677 ± 0.054
1.609TyrGln: 1.609 ± 0.047
1.783TyrArg: 1.783 ± 0.046
2.629TyrSer: 2.629 ± 0.064
1.95TyrThr: 1.95 ± 0.053
2.044TyrVal: 2.044 ± 0.047
0.369TyrTrp: 0.369 ± 0.022
1.391TyrTyr: 1.391 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2365 proteins (701050 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski