Amino acid dipepetide frequency for Eubacterium sp. CAG:86

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.138AlaAla: 7.138 ± 0.131
1.099AlaCys: 1.099 ± 0.037
5.192AlaAsp: 5.192 ± 0.091
4.318AlaGlu: 4.318 ± 0.084
2.736AlaPhe: 2.736 ± 0.069
6.065AlaGly: 6.065 ± 0.114
0.937AlaHis: 0.937 ± 0.037
5.105AlaIle: 5.105 ± 0.103
5.059AlaLys: 5.059 ± 0.099
6.028AlaLeu: 6.028 ± 0.097
2.17AlaMet: 2.17 ± 0.058
2.692AlaAsn: 2.692 ± 0.07
1.627AlaPro: 1.627 ± 0.05
2.2AlaGln: 2.2 ± 0.065
2.425AlaArg: 2.425 ± 0.065
4.248AlaSer: 4.248 ± 0.083
2.951AlaThr: 2.951 ± 0.091
6.635AlaVal: 6.635 ± 0.11
0.456AlaTrp: 0.456 ± 0.029
2.764AlaTyr: 2.764 ± 0.064
0.001AlaXaa: 0.001 ± 0.001
Cys
1.0CysAla: 1.0 ± 0.043
0.245CysCys: 0.245 ± 0.019
0.973CysAsp: 0.973 ± 0.032
0.923CysGlu: 0.923 ± 0.036
0.564CysPhe: 0.564 ± 0.025
1.379CysGly: 1.379 ± 0.053
0.287CysHis: 0.287 ± 0.021
1.406CysIle: 1.406 ± 0.053
0.893CysLys: 0.893 ± 0.032
1.009CysLeu: 1.009 ± 0.042
0.463CysMet: 0.463 ± 0.029
0.749CysAsn: 0.749 ± 0.03
0.539CysPro: 0.539 ± 0.031
0.319CysGln: 0.319 ± 0.02
0.62CysArg: 0.62 ± 0.033
0.918CysSer: 0.918 ± 0.039
0.759CysThr: 0.759 ± 0.039
1.164CysVal: 1.164 ± 0.04
0.112CysTrp: 0.112 ± 0.014
0.57CysTyr: 0.57 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
4.426AspAla: 4.426 ± 0.076
0.837AspCys: 0.837 ± 0.04
4.143AspAsp: 4.143 ± 0.076
5.76AspGlu: 5.76 ± 0.09
2.65AspPhe: 2.65 ± 0.068
4.51AspGly: 4.51 ± 0.113
0.617AspHis: 0.617 ± 0.03
6.005AspIle: 6.005 ± 0.084
5.137AspLys: 5.137 ± 0.099
3.793AspLeu: 3.793 ± 0.071
2.268AspMet: 2.268 ± 0.058
3.712AspAsn: 3.712 ± 0.09
1.185AspPro: 1.185 ± 0.045
0.731AspGln: 0.731 ± 0.032
2.145AspArg: 2.145 ± 0.068
3.92AspSer: 3.92 ± 0.09
3.46AspThr: 3.46 ± 0.068
4.464AspVal: 4.464 ± 0.081
0.441AspTrp: 0.441 ± 0.03
3.264AspTyr: 3.264 ± 0.068
0.0AspXaa: 0.0 ± 0.0
Glu
4.647GluAla: 4.647 ± 0.086
0.953GluCys: 0.953 ± 0.036
3.873GluAsp: 3.873 ± 0.077
5.354GluGlu: 5.354 ± 0.121
2.583GluPhe: 2.583 ± 0.055
3.46GluGly: 3.46 ± 0.079
1.267GluHis: 1.267 ± 0.046
5.44GluIle: 5.44 ± 0.107
6.715GluLys: 6.715 ± 0.114
6.288GluLeu: 6.288 ± 0.105
2.004GluMet: 2.004 ± 0.063
4.818GluAsn: 4.818 ± 0.091
1.584GluPro: 1.584 ± 0.049
2.37GluGln: 2.37 ± 0.066
2.659GluArg: 2.659 ± 0.064
3.654GluSer: 3.654 ± 0.082
3.317GluThr: 3.317 ± 0.062
4.013GluVal: 4.013 ± 0.091
0.542GluTrp: 0.542 ± 0.026
3.397GluTyr: 3.397 ± 0.078
0.0GluXaa: 0.0 ± 0.0
Phe
3.079PheAla: 3.079 ± 0.07
0.671PheCys: 0.671 ± 0.03
2.908PheAsp: 2.908 ± 0.068
2.669PheGlu: 2.669 ± 0.062
1.633PhePhe: 1.633 ± 0.059
2.996PheGly: 2.996 ± 0.069
0.603PheHis: 0.603 ± 0.029
3.475PheIle: 3.475 ± 0.081
2.691PheLys: 2.691 ± 0.061
3.253PheLeu: 3.253 ± 0.073
1.29PheMet: 1.29 ± 0.045
2.102PheAsn: 2.102 ± 0.065
1.131PhePro: 1.131 ± 0.044
0.774PheGln: 0.774 ± 0.031
1.334PheArg: 1.334 ± 0.047
2.774PheSer: 2.774 ± 0.062
2.461PheThr: 2.461 ± 0.062
3.041PheVal: 3.041 ± 0.081
0.283PheTrp: 0.283 ± 0.02
1.757PheTyr: 1.757 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.506GlyAla: 4.506 ± 0.094
1.229GlyCys: 1.229 ± 0.046
3.654GlyAsp: 3.654 ± 0.079
4.065GlyGlu: 4.065 ± 0.081
3.145GlyPhe: 3.145 ± 0.072
4.417GlyGly: 4.417 ± 0.106
1.22GlyHis: 1.22 ± 0.041
6.574GlyIle: 6.574 ± 0.101
5.273GlyLys: 5.273 ± 0.086
5.039GlyLeu: 5.039 ± 0.091
2.38GlyMet: 2.38 ± 0.061
3.661GlyAsn: 3.661 ± 0.082
1.15GlyPro: 1.15 ± 0.042
1.763GlyGln: 1.763 ± 0.05
2.795GlyArg: 2.795 ± 0.075
4.135GlySer: 4.135 ± 0.084
3.962GlyThr: 3.962 ± 0.091
4.861GlyVal: 4.861 ± 0.079
0.553GlyTrp: 0.553 ± 0.03
3.247GlyTyr: 3.247 ± 0.072
0.001GlyXaa: 0.001 ± 0.002
His
0.895HisAla: 0.895 ± 0.033
0.269HisCys: 0.269 ± 0.02
0.976HisAsp: 0.976 ± 0.038
1.031HisGlu: 1.031 ± 0.044
0.743HisPhe: 0.743 ± 0.034
1.08HisGly: 1.08 ± 0.04
0.298HisHis: 0.298 ± 0.031
1.371HisIle: 1.371 ± 0.045
0.981HisLys: 0.981 ± 0.039
1.183HisLeu: 1.183 ± 0.043
0.465HisMet: 0.465 ± 0.025
0.854HisAsn: 0.854 ± 0.038
0.64HisPro: 0.64 ± 0.033
0.386HisGln: 0.386 ± 0.023
0.641HisArg: 0.641 ± 0.028
0.92HisSer: 0.92 ± 0.037
0.749HisThr: 0.749 ± 0.033
1.089HisVal: 1.089 ± 0.043
0.097HisTrp: 0.097 ± 0.011
0.58HisTyr: 0.58 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.177IleAla: 6.177 ± 0.098
1.428IleCys: 1.428 ± 0.052
5.2IleAsp: 5.2 ± 0.089
5.35IleGlu: 5.35 ± 0.097
3.444IlePhe: 3.444 ± 0.081
5.214IleGly: 5.214 ± 0.093
1.218IleHis: 1.218 ± 0.043
7.253IleIle: 7.253 ± 0.138
6.105IleLys: 6.105 ± 0.099
6.687IleLeu: 6.687 ± 0.125
2.482IleMet: 2.482 ± 0.06
4.492IleAsn: 4.492 ± 0.081
2.902IlePro: 2.902 ± 0.058
1.992IleGln: 1.992 ± 0.056
3.135IleArg: 3.135 ± 0.074
5.839IleSer: 5.839 ± 0.086
4.878IleThr: 4.878 ± 0.098
5.591IleVal: 5.591 ± 0.104
0.526IleTrp: 0.526 ± 0.028
3.395IleTyr: 3.395 ± 0.072
0.0IleXaa: 0.0 ± 0.0
Lys
5.186LysAla: 5.186 ± 0.09
0.949LysCys: 0.949 ± 0.042
4.857LysAsp: 4.857 ± 0.09
6.422LysGlu: 6.422 ± 0.107
2.39LysPhe: 2.39 ± 0.058
4.198LysGly: 4.198 ± 0.081
1.115LysHis: 1.115 ± 0.039
6.002LysIle: 6.002 ± 0.101
7.389LysLys: 7.389 ± 0.111
5.82LysLeu: 5.82 ± 0.095
2.481LysMet: 2.481 ± 0.057
5.071LysAsn: 5.071 ± 0.093
2.089LysPro: 2.089 ± 0.064
2.341LysGln: 2.341 ± 0.064
2.925LysArg: 2.925 ± 0.072
4.262LysSer: 4.262 ± 0.085
4.128LysThr: 4.128 ± 0.09
4.564LysVal: 4.564 ± 0.095
0.578LysTrp: 0.578 ± 0.029
3.666LysTyr: 3.666 ± 0.06
0.001LysXaa: 0.001 ± 0.001
Leu
5.997LeuAla: 5.997 ± 0.082
1.287LeuCys: 1.287 ± 0.039
5.036LeuAsp: 5.036 ± 0.081
5.092LeuGlu: 5.092 ± 0.087
3.387LeuPhe: 3.387 ± 0.077
5.168LeuGly: 5.168 ± 0.089
1.283LeuHis: 1.283 ± 0.043
6.327LeuIle: 6.327 ± 0.118
6.282LeuLys: 6.282 ± 0.104
6.38LeuLeu: 6.38 ± 0.112
2.418LeuMet: 2.418 ± 0.058
4.261LeuAsn: 4.261 ± 0.074
2.72LeuPro: 2.72 ± 0.065
1.858LeuGln: 1.858 ± 0.054
2.937LeuArg: 2.937 ± 0.058
5.977LeuSer: 5.977 ± 0.112
4.409LeuThr: 4.409 ± 0.081
5.165LeuVal: 5.165 ± 0.096
0.525LeuTrp: 0.525 ± 0.025
3.09LeuTyr: 3.09 ± 0.069
0.001LeuXaa: 0.001 ± 0.001
Met
2.429MetAla: 2.429 ± 0.06
0.421MetCys: 0.421 ± 0.023
1.745MetAsp: 1.745 ± 0.056
2.041MetGlu: 2.041 ± 0.065
1.197MetPhe: 1.197 ± 0.042
1.99MetGly: 1.99 ± 0.056
0.475MetHis: 0.475 ± 0.025
2.324MetIle: 2.324 ± 0.064
2.533MetLys: 2.533 ± 0.068
2.704MetLeu: 2.704 ± 0.062
1.0MetMet: 1.0 ± 0.043
1.749MetAsn: 1.749 ± 0.045
1.175MetPro: 1.175 ± 0.039
1.071MetGln: 1.071 ± 0.039
1.204MetArg: 1.204 ± 0.043
2.156MetSer: 2.156 ± 0.053
1.815MetThr: 1.815 ± 0.051
1.89MetVal: 1.89 ± 0.05
0.213MetTrp: 0.213 ± 0.018
1.176MetTyr: 1.176 ± 0.04
0.003MetXaa: 0.003 ± 0.002
Asn
3.761AsnAla: 3.761 ± 0.069
0.731AsnCys: 0.731 ± 0.037
3.138AsnAsp: 3.138 ± 0.077
3.615AsnGlu: 3.615 ± 0.075
1.845AsnPhe: 1.845 ± 0.056
3.948AsnGly: 3.948 ± 0.094
0.84AsnHis: 0.84 ± 0.035
5.098AsnIle: 5.098 ± 0.083
3.785AsnLys: 3.785 ± 0.083
3.926AsnLeu: 3.926 ± 0.077
1.868AsnMet: 1.868 ± 0.047
3.411AsnAsn: 3.411 ± 0.109
2.198AsnPro: 2.198 ± 0.05
1.346AsnGln: 1.346 ± 0.047
1.935AsnArg: 1.935 ± 0.062
3.316AsnSer: 3.316 ± 0.068
2.914AsnThr: 2.914 ± 0.074
4.014AsnVal: 4.014 ± 0.086
0.322AsnTrp: 0.322 ± 0.023
2.195AsnTyr: 2.195 ± 0.06
0.001AsnXaa: 0.001 ± 0.001
Pro
1.985ProAla: 1.985 ± 0.057
0.41ProCys: 0.41 ± 0.022
2.432ProAsp: 2.432 ± 0.069
2.555ProGlu: 2.555 ± 0.06
1.309ProPhe: 1.309 ± 0.05
1.987ProGly: 1.987 ± 0.052
0.461ProHis: 0.461 ± 0.028
1.684ProIle: 1.684 ± 0.053
1.704ProLys: 1.704 ± 0.055
2.348ProLeu: 2.348 ± 0.051
0.741ProMet: 0.741 ± 0.035
1.011ProAsn: 1.011 ± 0.042
0.542ProPro: 0.542 ± 0.028
0.973ProGln: 0.973 ± 0.037
0.777ProArg: 0.777 ± 0.037
1.68ProSer: 1.68 ± 0.046
1.312ProThr: 1.312 ± 0.048
3.003ProVal: 3.003 ± 0.054
0.231ProTrp: 0.231 ± 0.02
1.278ProTyr: 1.278 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
1.936GlnAla: 1.936 ± 0.066
0.321GlnCys: 0.321 ± 0.02
1.341GlnAsp: 1.341 ± 0.042
1.864GlnGlu: 1.864 ± 0.062
1.106GlnPhe: 1.106 ± 0.039
1.465GlnGly: 1.465 ± 0.05
0.41GlnHis: 0.41 ± 0.024
2.509GlnIle: 2.509 ± 0.063
2.218GlnLys: 2.218 ± 0.054
2.488GlnLeu: 2.488 ± 0.063
1.039GlnMet: 1.039 ± 0.042
1.476GlnAsn: 1.476 ± 0.053
0.773GlnPro: 0.773 ± 0.036
0.98GlnGln: 0.98 ± 0.043
1.091GlnArg: 1.091 ± 0.045
1.556GlnSer: 1.556 ± 0.047
1.49GlnThr: 1.49 ± 0.056
1.591GlnVal: 1.591 ± 0.043
0.196GlnTrp: 0.196 ± 0.018
1.231GlnTyr: 1.231 ± 0.047
0.0GlnXaa: 0.0 ± 0.0
Arg
2.135ArgAla: 2.135 ± 0.054
0.51ArgCys: 0.51 ± 0.033
1.955ArgAsp: 1.955 ± 0.052
2.66ArgGlu: 2.66 ± 0.075
1.743ArgPhe: 1.743 ± 0.054
2.096ArgGly: 2.096 ± 0.062
0.644ArgHis: 0.644 ± 0.033
3.345ArgIle: 3.345 ± 0.079
3.129ArgLys: 3.129 ± 0.077
3.211ArgLeu: 3.211 ± 0.074
1.354ArgMet: 1.354 ± 0.049
1.983ArgAsn: 1.983 ± 0.055
1.067ArgPro: 1.067 ± 0.04
1.437ArgGln: 1.437 ± 0.048
1.661ArgArg: 1.661 ± 0.057
1.796ArgSer: 1.796 ± 0.062
1.942ArgThr: 1.942 ± 0.058
2.436ArgVal: 2.436 ± 0.061
0.287ArgTrp: 0.287 ± 0.019
1.707ArgTyr: 1.707 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
4.423SerAla: 4.423 ± 0.078
0.833SerCys: 0.833 ± 0.033
4.583SerAsp: 4.583 ± 0.088
4.203SerGlu: 4.203 ± 0.083
2.88SerPhe: 2.88 ± 0.07
5.342SerGly: 5.342 ± 0.108
0.952SerHis: 0.952 ± 0.036
4.606SerIle: 4.606 ± 0.077
4.077SerLys: 4.077 ± 0.081
5.066SerLeu: 5.066 ± 0.086
1.798SerMet: 1.798 ± 0.052
2.733SerAsn: 2.733 ± 0.071
1.49SerPro: 1.49 ± 0.043
2.062SerGln: 2.062 ± 0.055
2.478SerArg: 2.478 ± 0.064
4.445SerSer: 4.445 ± 0.107
3.09SerThr: 3.09 ± 0.077
4.966SerVal: 4.966 ± 0.097
0.473SerTrp: 0.473 ± 0.028
2.849SerTyr: 2.849 ± 0.065
0.003SerXaa: 0.003 ± 0.002
Thr
4.123ThrAla: 4.123 ± 0.089
0.687ThrCys: 0.687 ± 0.032
3.839ThrAsp: 3.839 ± 0.08
3.237ThrGlu: 3.237 ± 0.083
2.31ThrPhe: 2.31 ± 0.063
4.641ThrGly: 4.641 ± 0.087
0.851ThrHis: 0.851 ± 0.041
4.179ThrIle: 4.179 ± 0.078
3.45ThrLys: 3.45 ± 0.082
4.31ThrLeu: 4.31 ± 0.074
1.343ThrMet: 1.343 ± 0.046
2.268ThrAsn: 2.268 ± 0.058
1.932ThrPro: 1.932 ± 0.063
1.652ThrGln: 1.652 ± 0.055
1.717ThrArg: 1.717 ± 0.046
3.355ThrSer: 3.355 ± 0.074
2.902ThrThr: 2.902 ± 0.093
4.451ThrVal: 4.451 ± 0.093
0.377ThrTrp: 0.377 ± 0.024
2.282ThrTyr: 2.282 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
4.829ValAla: 4.829 ± 0.101
1.263ValCys: 1.263 ± 0.046
4.137ValAsp: 4.137 ± 0.078
4.249ValGlu: 4.249 ± 0.084
3.007ValPhe: 3.007 ± 0.07
4.154ValGly: 4.154 ± 0.087
0.994ValHis: 0.994 ± 0.039
6.375ValIle: 6.375 ± 0.12
5.33ValLys: 5.33 ± 0.108
6.208ValLeu: 6.208 ± 0.101
2.167ValMet: 2.167 ± 0.052
3.864ValAsn: 3.864 ± 0.075
2.258ValPro: 2.258 ± 0.058
1.551ValGln: 1.551 ± 0.053
2.593ValArg: 2.593 ± 0.064
5.179ValSer: 5.179 ± 0.095
4.542ValThr: 4.542 ± 0.097
5.043ValVal: 5.043 ± 0.097
0.473ValTrp: 0.473 ± 0.028
3.069ValTyr: 3.069 ± 0.066
0.001ValXaa: 0.001 ± 0.001
Trp
0.391TrpAla: 0.391 ± 0.024
0.122TrpCys: 0.122 ± 0.015
0.448TrpAsp: 0.448 ± 0.029
0.483TrpGlu: 0.483 ± 0.028
0.346TrpPhe: 0.346 ± 0.023
0.511TrpGly: 0.511 ± 0.026
0.151TrpHis: 0.151 ± 0.015
0.56TrpIle: 0.56 ± 0.026
0.535TrpLys: 0.535 ± 0.03
0.679TrpLeu: 0.679 ± 0.034
0.207TrpMet: 0.207 ± 0.016
0.482TrpAsn: 0.482 ± 0.03
0.127TrpPro: 0.127 ± 0.013
0.249TrpGln: 0.249 ± 0.02
0.258TrpArg: 0.258 ± 0.019
0.372TrpSer: 0.372 ± 0.024
0.291TrpThr: 0.291 ± 0.023
0.427TrpVal: 0.427 ± 0.027
0.06TrpTrp: 0.06 ± 0.011
0.365TrpTyr: 0.365 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.781TyrAla: 2.781 ± 0.064
0.627TyrCys: 0.627 ± 0.031
3.038TyrAsp: 3.038 ± 0.072
3.045TyrGlu: 3.045 ± 0.067
1.904TyrPhe: 1.904 ± 0.056
3.01TyrGly: 3.01 ± 0.069
0.624TyrHis: 0.624 ± 0.03
3.671TyrIle: 3.671 ± 0.073
3.146TyrLys: 3.146 ± 0.071
3.18TyrLeu: 3.18 ± 0.072
1.371TyrMet: 1.371 ± 0.043
2.818TyrAsn: 2.818 ± 0.076
1.295TyrPro: 1.295 ± 0.045
1.019TyrGln: 1.019 ± 0.037
1.767TyrArg: 1.767 ± 0.053
2.851TyrSer: 2.851 ± 0.065
2.534TyrThr: 2.534 ± 0.067
2.88TyrVal: 2.88 ± 0.068
0.319TyrTrp: 0.319 ± 0.021
2.06TyrTyr: 2.06 ± 0.061
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.002
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.003XaaLys: 0.003 ± 0.002
0.003XaaLeu: 0.003 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.003XaaGln: 0.003 ± 0.002
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.018XaaXaa: 0.018 ± 0.006
Statistics based on 2189 proteins (714217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski