Amino acid dipepetide frequency for Flavonifractor sp. An10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.932AlaAla: 13.932 ± 0.183
1.59AlaCys: 1.59 ± 0.042
5.715AlaAsp: 5.715 ± 0.081
7.411AlaGlu: 7.411 ± 0.089
3.44AlaPhe: 3.44 ± 0.056
8.811AlaGly: 8.811 ± 0.123
1.639AlaHis: 1.639 ± 0.039
4.704AlaIle: 4.704 ± 0.078
3.615AlaLys: 3.615 ± 0.071
11.562AlaLeu: 11.562 ± 0.16
2.645AlaMet: 2.645 ± 0.05
2.383AlaAsn: 2.383 ± 0.054
3.933AlaPro: 3.933 ± 0.067
3.567AlaGln: 3.567 ± 0.054
6.026AlaArg: 6.026 ± 0.095
4.397AlaSer: 4.397 ± 0.078
3.49AlaThr: 3.49 ± 0.06
8.62AlaVal: 8.62 ± 0.111
1.146AlaTrp: 1.146 ± 0.031
2.841AlaTyr: 2.841 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
1.614CysAla: 1.614 ± 0.042
0.317CysCys: 0.317 ± 0.018
0.877CysAsp: 0.877 ± 0.029
0.758CysGlu: 0.758 ± 0.024
0.627CysPhe: 0.627 ± 0.027
1.723CysGly: 1.723 ± 0.042
0.297CysHis: 0.297 ± 0.016
0.752CysIle: 0.752 ± 0.026
0.511CysLys: 0.511 ± 0.025
1.454CysLeu: 1.454 ± 0.04
0.395CysMet: 0.395 ± 0.018
0.384CysAsn: 0.384 ± 0.02
0.867CysPro: 0.867 ± 0.038
0.484CysGln: 0.484 ± 0.021
1.185CysArg: 1.185 ± 0.035
0.866CysSer: 0.866 ± 0.031
0.81CysThr: 0.81 ± 0.03
1.057CysVal: 1.057 ± 0.036
0.194CysTrp: 0.194 ± 0.014
0.534CysTyr: 0.534 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
5.406AspAla: 5.406 ± 0.073
0.888AspCys: 0.888 ± 0.029
2.807AspAsp: 2.807 ± 0.084
4.012AspGlu: 4.012 ± 0.082
2.499AspPhe: 2.499 ± 0.053
5.676AspGly: 5.676 ± 0.085
0.976AspHis: 0.976 ± 0.033
3.11AspIle: 3.11 ± 0.054
2.316AspLys: 2.316 ± 0.062
4.7AspLeu: 4.7 ± 0.072
1.48AspMet: 1.48 ± 0.038
1.657AspAsn: 1.657 ± 0.047
2.612AspPro: 2.612 ± 0.047
1.632AspGln: 1.632 ± 0.037
3.116AspArg: 3.116 ± 0.053
2.741AspSer: 2.741 ± 0.051
3.299AspThr: 3.299 ± 0.061
3.92AspVal: 3.92 ± 0.079
0.805AspTrp: 0.805 ± 0.031
2.435AspTyr: 2.435 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
7.036GluAla: 7.036 ± 0.097
0.689GluCys: 0.689 ± 0.027
3.886GluAsp: 3.886 ± 0.06
5.957GluGlu: 5.957 ± 0.093
2.023GluPhe: 2.023 ± 0.047
5.177GluGly: 5.177 ± 0.074
1.546GluHis: 1.546 ± 0.042
3.621GluIle: 3.621 ± 0.075
3.855GluLys: 3.855 ± 0.073
7.946GluLeu: 7.946 ± 0.102
1.777GluMet: 1.777 ± 0.042
2.605GluAsn: 2.605 ± 0.051
2.62GluPro: 2.62 ± 0.054
3.275GluGln: 3.275 ± 0.066
4.658GluArg: 4.658 ± 0.083
2.886GluSer: 2.886 ± 0.052
3.364GluThr: 3.364 ± 0.063
4.141GluVal: 4.141 ± 0.073
0.686GluTrp: 0.686 ± 0.025
2.231GluTyr: 2.231 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
3.427PheAla: 3.427 ± 0.053
0.745PheCys: 0.745 ± 0.026
2.36PheAsp: 2.36 ± 0.047
1.914PheGlu: 1.914 ± 0.041
1.543PhePhe: 1.543 ± 0.041
2.917PheGly: 2.917 ± 0.049
0.815PheHis: 0.815 ± 0.027
1.651PheIle: 1.651 ± 0.043
1.074PheLys: 1.074 ± 0.032
4.095PheLeu: 4.095 ± 0.066
0.701PheMet: 0.701 ± 0.028
1.097PheAsn: 1.097 ± 0.033
1.665PhePro: 1.665 ± 0.037
1.511PheGln: 1.511 ± 0.036
2.204PheArg: 2.204 ± 0.051
2.502PheSer: 2.502 ± 0.055
2.354PheThr: 2.354 ± 0.053
2.418PheVal: 2.418 ± 0.051
0.467PheTrp: 0.467 ± 0.021
1.37PheTyr: 1.37 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
7.358GlyAla: 7.358 ± 0.109
1.421GlyCys: 1.421 ± 0.04
4.259GlyAsp: 4.259 ± 0.073
5.714GlyGlu: 5.714 ± 0.082
2.976GlyPhe: 2.976 ± 0.067
7.074GlyGly: 7.074 ± 0.12
1.429GlyHis: 1.429 ± 0.034
4.592GlyIle: 4.592 ± 0.067
3.994GlyLys: 3.994 ± 0.064
8.05GlyLeu: 8.05 ± 0.096
2.432GlyMet: 2.432 ± 0.044
2.303GlyAsn: 2.303 ± 0.055
2.242GlyPro: 2.242 ± 0.06
2.907GlyGln: 2.907 ± 0.059
5.257GlyArg: 5.257 ± 0.075
4.364GlySer: 4.364 ± 0.068
4.815GlyThr: 4.815 ± 0.088
6.231GlyVal: 6.231 ± 0.084
1.061GlyTrp: 1.061 ± 0.031
3.139GlyTyr: 3.139 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.439HisAla: 1.439 ± 0.041
0.37HisCys: 0.37 ± 0.017
0.928HisAsp: 0.928 ± 0.031
0.925HisGlu: 0.925 ± 0.028
0.832HisPhe: 0.832 ± 0.029
1.616HisGly: 1.616 ± 0.042
0.428HisHis: 0.428 ± 0.021
1.242HisIle: 1.242 ± 0.033
0.656HisLys: 0.656 ± 0.025
1.89HisLeu: 1.89 ± 0.04
0.461HisMet: 0.461 ± 0.022
0.586HisAsn: 0.586 ± 0.024
1.167HisPro: 1.167 ± 0.034
0.602HisGln: 0.602 ± 0.026
1.215HisArg: 1.215 ± 0.032
0.916HisSer: 0.916 ± 0.029
1.134HisThr: 1.134 ± 0.035
1.123HisVal: 1.123 ± 0.028
0.265HisTrp: 0.265 ± 0.015
0.712HisTyr: 0.712 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
4.785IleAla: 4.785 ± 0.077
0.9IleCys: 0.9 ± 0.032
2.99IleAsp: 2.99 ± 0.054
2.761IleGlu: 2.761 ± 0.059
1.894IlePhe: 1.894 ± 0.047
3.828IleGly: 3.828 ± 0.073
1.082IleHis: 1.082 ± 0.032
2.615IleIle: 2.615 ± 0.06
1.783IleLys: 1.783 ± 0.053
5.42IleLeu: 5.42 ± 0.085
1.05IleMet: 1.05 ± 0.034
1.611IleAsn: 1.611 ± 0.041
2.776IlePro: 2.776 ± 0.056
1.967IleGln: 1.967 ± 0.05
3.269IleArg: 3.269 ± 0.06
3.099IleSer: 3.099 ± 0.058
3.231IleThr: 3.231 ± 0.049
3.565IleVal: 3.565 ± 0.064
0.503IleTrp: 0.503 ± 0.021
1.758IleTyr: 1.758 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.119LysAla: 4.119 ± 0.085
0.435LysCys: 0.435 ± 0.021
2.152LysAsp: 2.152 ± 0.053
3.285LysGlu: 3.285 ± 0.074
1.132LysPhe: 1.132 ± 0.031
3.023LysGly: 3.023 ± 0.061
0.735LysHis: 0.735 ± 0.028
2.186LysIle: 2.186 ± 0.052
2.855LysLys: 2.855 ± 0.07
4.015LysLeu: 4.015 ± 0.069
1.163LysMet: 1.163 ± 0.034
1.618LysAsn: 1.618 ± 0.043
1.645LysPro: 1.645 ± 0.039
1.499LysGln: 1.499 ± 0.046
2.639LysArg: 2.639 ± 0.051
2.059LysSer: 2.059 ± 0.049
2.45LysThr: 2.45 ± 0.058
2.696LysVal: 2.696 ± 0.059
0.42LysTrp: 0.42 ± 0.021
1.527LysTyr: 1.527 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
10.828LeuAla: 10.828 ± 0.139
2.041LeuCys: 2.041 ± 0.041
6.133LeuAsp: 6.133 ± 0.087
7.411LeuGlu: 7.411 ± 0.097
3.897LeuPhe: 3.897 ± 0.073
7.557LeuGly: 7.557 ± 0.107
1.88LeuHis: 1.88 ± 0.044
4.705LeuIle: 4.705 ± 0.07
3.973LeuLys: 3.973 ± 0.079
11.307LeuLeu: 11.307 ± 0.181
2.505LeuMet: 2.505 ± 0.046
3.105LeuAsn: 3.105 ± 0.047
5.278LeuPro: 5.278 ± 0.073
2.862LeuGln: 2.862 ± 0.06
6.705LeuArg: 6.705 ± 0.095
6.917LeuSer: 6.917 ± 0.087
6.605LeuThr: 6.605 ± 0.094
6.372LeuVal: 6.372 ± 0.073
1.185LeuTrp: 1.185 ± 0.035
3.304LeuTyr: 3.304 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.671MetAla: 2.671 ± 0.053
0.285MetCys: 0.285 ± 0.018
1.711MetAsp: 1.711 ± 0.044
2.203MetGlu: 2.203 ± 0.041
0.718MetPhe: 0.718 ± 0.027
2.12MetGly: 2.12 ± 0.052
0.351MetHis: 0.351 ± 0.018
1.145MetIle: 1.145 ± 0.037
1.569MetLys: 1.569 ± 0.037
2.475MetLeu: 2.475 ± 0.05
0.655MetMet: 0.655 ± 0.023
0.992MetAsn: 0.992 ± 0.03
1.091MetPro: 1.091 ± 0.03
0.715MetGln: 0.715 ± 0.024
1.401MetArg: 1.401 ± 0.037
1.448MetSer: 1.448 ± 0.039
1.499MetThr: 1.499 ± 0.039
1.724MetVal: 1.724 ± 0.044
0.194MetTrp: 0.194 ± 0.013
0.57MetTyr: 0.57 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
2.706AsnAla: 2.706 ± 0.06
0.452AsnCys: 0.452 ± 0.02
1.48AsnAsp: 1.48 ± 0.048
1.642AsnGlu: 1.642 ± 0.042
1.103AsnPhe: 1.103 ± 0.034
2.819AsnGly: 2.819 ± 0.071
0.601AsnHis: 0.601 ± 0.027
1.777AsnIle: 1.777 ± 0.041
1.126AsnLys: 1.126 ± 0.037
3.16AsnLeu: 3.16 ± 0.053
0.787AsnMet: 0.787 ± 0.029
1.001AsnAsn: 1.001 ± 0.036
1.865AsnPro: 1.865 ± 0.044
1.105AsnGln: 1.105 ± 0.029
1.944AsnArg: 1.944 ± 0.041
1.456AsnSer: 1.456 ± 0.04
1.801AsnThr: 1.801 ± 0.05
2.008AsnVal: 2.008 ± 0.058
0.417AsnTrp: 0.417 ± 0.02
1.168AsnTyr: 1.168 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
5.176ProAla: 5.176 ± 0.094
0.623ProCys: 0.623 ± 0.025
3.008ProAsp: 3.008 ± 0.047
4.389ProGlu: 4.389 ± 0.066
1.62ProPhe: 1.62 ± 0.038
3.69ProGly: 3.69 ± 0.071
0.825ProHis: 0.825 ± 0.025
1.954ProIle: 1.954 ± 0.05
1.487ProLys: 1.487 ± 0.047
4.044ProLeu: 4.044 ± 0.065
1.081ProMet: 1.081 ± 0.032
1.184ProAsn: 1.184 ± 0.036
1.843ProPro: 1.843 ± 0.054
1.459ProGln: 1.459 ± 0.034
2.244ProArg: 2.244 ± 0.049
2.136ProSer: 2.136 ± 0.049
1.984ProThr: 1.984 ± 0.043
3.644ProVal: 3.644 ± 0.064
0.545ProTrp: 0.545 ± 0.024
1.44ProTyr: 1.44 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
3.791GlnAla: 3.791 ± 0.055
0.412GlnCys: 0.412 ± 0.021
1.492GlnAsp: 1.492 ± 0.043
2.617GlnGlu: 2.617 ± 0.049
1.238GlnPhe: 1.238 ± 0.036
2.509GlnGly: 2.509 ± 0.054
0.571GlnHis: 0.571 ± 0.024
1.874GlnIle: 1.874 ± 0.041
1.826GlnLys: 1.826 ± 0.055
3.396GlnLeu: 3.396 ± 0.061
1.069GlnMet: 1.069 ± 0.032
1.216GlnAsn: 1.216 ± 0.036
1.332GlnPro: 1.332 ± 0.037
1.327GlnGln: 1.327 ± 0.039
2.231GlnArg: 2.231 ± 0.053
1.772GlnSer: 1.772 ± 0.04
1.761GlnThr: 1.761 ± 0.042
2.572GlnVal: 2.572 ± 0.039
0.458GlnTrp: 0.458 ± 0.019
1.328GlnTyr: 1.328 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
6.005ArgAla: 6.005 ± 0.093
0.944ArgCys: 0.944 ± 0.034
3.327ArgAsp: 3.327 ± 0.059
4.788ArgGlu: 4.788 ± 0.083
2.41ArgPhe: 2.41 ± 0.053
4.066ArgGly: 4.066 ± 0.066
1.194ArgHis: 1.194 ± 0.035
3.21ArgIle: 3.21 ± 0.057
2.817ArgLys: 2.817 ± 0.06
6.714ArgLeu: 6.714 ± 0.093
1.771ArgMet: 1.771 ± 0.04
1.78ArgAsn: 1.78 ± 0.038
2.826ArgPro: 2.826 ± 0.065
2.639ArgGln: 2.639 ± 0.053
5.256ArgArg: 5.256 ± 0.096
2.833ArgSer: 2.833 ± 0.055
3.111ArgThr: 3.111 ± 0.059
3.949ArgVal: 3.949 ± 0.066
0.873ArgTrp: 0.873 ± 0.029
2.226ArgTyr: 2.226 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
5.198SerAla: 5.198 ± 0.067
0.769SerCys: 0.769 ± 0.025
2.828SerAsp: 2.828 ± 0.055
2.837SerGlu: 2.837 ± 0.055
2.206SerPhe: 2.206 ± 0.044
5.198SerGly: 5.198 ± 0.07
1.062SerHis: 1.062 ± 0.033
2.699SerIle: 2.699 ± 0.051
1.865SerLys: 1.865 ± 0.047
5.362SerLeu: 5.362 ± 0.084
1.472SerMet: 1.472 ± 0.037
1.551SerAsn: 1.551 ± 0.042
2.447SerPro: 2.447 ± 0.05
1.702SerGln: 1.702 ± 0.038
3.366SerArg: 3.366 ± 0.057
2.806SerSer: 2.806 ± 0.06
2.77SerThr: 2.77 ± 0.054
3.864SerVal: 3.864 ± 0.057
0.64SerTrp: 0.64 ± 0.024
1.89SerTyr: 1.89 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
5.974ThrAla: 5.974 ± 0.093
0.727ThrCys: 0.727 ± 0.027
2.981ThrAsp: 2.981 ± 0.061
3.202ThrGlu: 3.202 ± 0.064
2.061ThrPhe: 2.061 ± 0.048
5.246ThrGly: 5.246 ± 0.089
0.963ThrHis: 0.963 ± 0.031
2.975ThrIle: 2.975 ± 0.053
1.66ThrLys: 1.66 ± 0.043
5.672ThrLeu: 5.672 ± 0.079
1.382ThrMet: 1.382 ± 0.036
1.44ThrAsn: 1.44 ± 0.041
2.988ThrPro: 2.988 ± 0.063
1.552ThrGln: 1.552 ± 0.037
2.812ThrArg: 2.812 ± 0.049
2.485ThrSer: 2.485 ± 0.061
2.824ThrThr: 2.824 ± 0.063
5.084ThrVal: 5.084 ± 0.101
0.564ThrTrp: 0.564 ± 0.023
1.843ThrTyr: 1.843 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
5.884ValAla: 5.884 ± 0.084
1.352ValCys: 1.352 ± 0.037
4.219ValAsp: 4.219 ± 0.076
5.079ValGlu: 5.079 ± 0.073
2.716ValPhe: 2.716 ± 0.053
4.946ValGly: 4.946 ± 0.075
1.132ValHis: 1.132 ± 0.03
3.793ValIle: 3.793 ± 0.074
2.978ValLys: 2.978 ± 0.059
8.136ValLeu: 8.136 ± 0.107
1.681ValMet: 1.681 ± 0.039
2.308ValAsn: 2.308 ± 0.06
3.273ValPro: 3.273 ± 0.057
2.047ValGln: 2.047 ± 0.04
4.217ValArg: 4.217 ± 0.065
4.468ValSer: 4.468 ± 0.081
4.438ValThr: 4.438 ± 0.097
5.391ValVal: 5.391 ± 0.09
0.888ValTrp: 0.888 ± 0.033
2.395ValTyr: 2.395 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
1.086TrpAla: 1.086 ± 0.03
0.194TrpCys: 0.194 ± 0.014
0.693TrpAsp: 0.693 ± 0.03
0.821TrpGlu: 0.821 ± 0.028
0.505TrpPhe: 0.505 ± 0.022
0.901TrpGly: 0.901 ± 0.031
0.229TrpHis: 0.229 ± 0.015
0.494TrpIle: 0.494 ± 0.023
0.547TrpLys: 0.547 ± 0.024
1.449TrpLeu: 1.449 ± 0.045
0.333TrpMet: 0.333 ± 0.015
0.419TrpAsn: 0.419 ± 0.023
0.416TrpPro: 0.416 ± 0.018
0.509TrpGln: 0.509 ± 0.023
0.768TrpArg: 0.768 ± 0.031
0.626TrpSer: 0.626 ± 0.027
0.539TrpThr: 0.539 ± 0.024
0.686TrpVal: 0.686 ± 0.023
0.146TrpTrp: 0.146 ± 0.01
0.489TrpTyr: 0.489 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.032TyrAla: 3.032 ± 0.054
0.554TyrCys: 0.554 ± 0.023
2.241TyrAsp: 2.241 ± 0.051
2.251TyrGlu: 2.251 ± 0.054
1.388TyrPhe: 1.388 ± 0.037
2.724TyrGly: 2.724 ± 0.059
0.756TyrHis: 0.756 ± 0.024
1.787TyrIle: 1.787 ± 0.044
1.147TyrLys: 1.147 ± 0.039
3.721TyrLeu: 3.721 ± 0.06
0.694TyrMet: 0.694 ± 0.026
1.176TyrAsn: 1.176 ± 0.043
1.489TyrPro: 1.489 ± 0.041
1.43TyrGln: 1.43 ± 0.037
2.227TyrArg: 2.227 ± 0.046
1.755TyrSer: 1.755 ± 0.046
2.158TyrThr: 2.158 ± 0.05
2.277TyrVal: 2.277 ± 0.053
0.392TyrTrp: 0.392 ± 0.022
1.482TyrTyr: 1.482 ± 0.04
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3501 proteins (1082429 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski