Amino acid dipepetide frequency for Ruminococcus sp. CAG:353

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.454AlaAla: 10.454 ± 0.188
1.293AlaCys: 1.293 ± 0.05
6.238AlaAsp: 6.238 ± 0.107
7.324AlaGlu: 7.324 ± 0.129
3.098AlaPhe: 3.098 ± 0.074
5.828AlaGly: 5.828 ± 0.107
1.197AlaHis: 1.197 ± 0.046
4.589AlaIle: 4.589 ± 0.082
5.051AlaLys: 5.051 ± 0.104
7.026AlaLeu: 7.026 ± 0.104
2.53AlaMet: 2.53 ± 0.064
2.663AlaAsn: 2.663 ± 0.054
2.587AlaPro: 2.587 ± 0.078
2.457AlaGln: 2.457 ± 0.071
2.811AlaArg: 2.811 ± 0.057
4.34AlaSer: 4.34 ± 0.083
2.721AlaThr: 2.721 ± 0.081
8.141AlaVal: 8.141 ± 0.122
0.517AlaTrp: 0.517 ± 0.031
2.948AlaTyr: 2.948 ± 0.068
0.011AlaXaa: 0.011 ± 0.004
Cys
1.555CysAla: 1.555 ± 0.059
0.435CysCys: 0.435 ± 0.026
1.128CysAsp: 1.128 ± 0.037
1.098CysGlu: 1.098 ± 0.038
0.797CysPhe: 0.797 ± 0.033
2.096CysGly: 2.096 ± 0.056
0.347CysHis: 0.347 ± 0.023
1.24CysIle: 1.24 ± 0.044
0.916CysLys: 0.916 ± 0.037
1.14CysLeu: 1.14 ± 0.043
0.428CysMet: 0.428 ± 0.026
0.608CysAsn: 0.608 ± 0.033
0.804CysPro: 0.804 ± 0.042
0.284CysGln: 0.284 ± 0.019
0.949CysArg: 0.949 ± 0.04
1.282CysSer: 1.282 ± 0.046
1.088CysThr: 1.088 ± 0.047
1.164CysVal: 1.164 ± 0.04
0.13CysTrp: 0.13 ± 0.013
0.654CysTyr: 0.654 ± 0.03
0.0CysXaa: 0.0 ± 0.0
Asp
3.962AspAla: 3.962 ± 0.086
1.042AspCys: 1.042 ± 0.042
4.008AspAsp: 4.008 ± 0.091
4.995AspGlu: 4.995 ± 0.103
3.01AspPhe: 3.01 ± 0.073
4.864AspGly: 4.864 ± 0.092
0.833AspHis: 0.833 ± 0.038
6.05AspIle: 6.05 ± 0.095
4.516AspLys: 4.516 ± 0.086
4.012AspLeu: 4.012 ± 0.062
2.309AspMet: 2.309 ± 0.057
2.996AspAsn: 2.996 ± 0.068
2.084AspPro: 2.084 ± 0.053
0.826AspGln: 0.826 ± 0.034
2.64AspArg: 2.64 ± 0.056
3.883AspSer: 3.883 ± 0.095
3.497AspThr: 3.497 ± 0.073
3.808AspVal: 3.808 ± 0.073
0.544AspTrp: 0.544 ± 0.029
2.925AspTyr: 2.925 ± 0.074
0.0AspXaa: 0.0 ± 0.0
Glu
5.469GluAla: 5.469 ± 0.102
0.932GluCys: 0.932 ± 0.036
3.67GluAsp: 3.67 ± 0.076
5.242GluGlu: 5.242 ± 0.099
2.523GluPhe: 2.523 ± 0.062
3.894GluGly: 3.894 ± 0.079
1.279GluHis: 1.279 ± 0.039
5.25GluIle: 5.25 ± 0.087
6.291GluLys: 6.291 ± 0.11
6.461GluLeu: 6.461 ± 0.112
2.319GluMet: 2.319 ± 0.062
4.393GluAsn: 4.393 ± 0.081
1.855GluPro: 1.855 ± 0.05
2.24GluGln: 2.24 ± 0.066
3.073GluArg: 3.073 ± 0.061
3.371GluSer: 3.371 ± 0.061
3.805GluThr: 3.805 ± 0.086
3.483GluVal: 3.483 ± 0.077
0.51GluTrp: 0.51 ± 0.026
3.231GluTyr: 3.231 ± 0.073
0.0GluXaa: 0.0 ± 0.0
Phe
3.44PheAla: 3.44 ± 0.077
0.842PheCys: 0.842 ± 0.033
2.998PheAsp: 2.998 ± 0.066
2.818PheGlu: 2.818 ± 0.069
1.881PhePhe: 1.881 ± 0.06
3.388PheGly: 3.388 ± 0.072
0.59PheHis: 0.59 ± 0.031
3.238PheIle: 3.238 ± 0.079
2.142PheLys: 2.142 ± 0.053
3.065PheLeu: 3.065 ± 0.068
1.215PheMet: 1.215 ± 0.041
1.674PheAsn: 1.674 ± 0.048
1.37PhePro: 1.37 ± 0.045
0.715PheGln: 0.715 ± 0.029
1.919PheArg: 1.919 ± 0.054
3.41PheSer: 3.41 ± 0.077
2.663PheThr: 2.663 ± 0.071
2.723PheVal: 2.723 ± 0.067
0.332PheTrp: 0.332 ± 0.025
1.724PheTyr: 1.724 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
5.364GlyAla: 5.364 ± 0.096
1.572GlyCys: 1.572 ± 0.051
4.043GlyAsp: 4.043 ± 0.083
4.545GlyGlu: 4.545 ± 0.095
3.3GlyPhe: 3.3 ± 0.074
5.29GlyGly: 5.29 ± 0.112
1.196GlyHis: 1.196 ± 0.042
6.029GlyIle: 6.029 ± 0.102
5.197GlyLys: 5.197 ± 0.083
5.423GlyLeu: 5.423 ± 0.09
2.364GlyMet: 2.364 ± 0.061
2.992GlyAsn: 2.992 ± 0.076
1.008GlyPro: 1.008 ± 0.037
1.599GlyGln: 1.599 ± 0.049
3.165GlyArg: 3.165 ± 0.079
4.55GlySer: 4.55 ± 0.08
4.261GlyThr: 4.261 ± 0.076
4.846GlyVal: 4.846 ± 0.076
0.637GlyTrp: 0.637 ± 0.035
3.302GlyTyr: 3.302 ± 0.07
0.001GlyXaa: 0.001 ± 0.001
His
0.964HisAla: 0.964 ± 0.032
0.348HisCys: 0.348 ± 0.023
0.992HisAsp: 0.992 ± 0.038
0.952HisGlu: 0.952 ± 0.037
0.824HisPhe: 0.824 ± 0.034
1.275HisGly: 1.275 ± 0.044
0.309HisHis: 0.309 ± 0.024
1.318HisIle: 1.318 ± 0.044
0.935HisLys: 0.935 ± 0.035
1.213HisLeu: 1.213 ± 0.041
0.443HisMet: 0.443 ± 0.024
0.722HisAsn: 0.722 ± 0.03
0.709HisPro: 0.709 ± 0.033
0.392HisGln: 0.392 ± 0.022
0.728HisArg: 0.728 ± 0.034
1.072HisSer: 1.072 ± 0.043
1.01HisThr: 1.01 ± 0.041
0.8HisVal: 0.8 ± 0.034
0.146HisTrp: 0.146 ± 0.014
0.709HisTyr: 0.709 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
6.167IleAla: 6.167 ± 0.101
1.744IleCys: 1.744 ± 0.052
4.596IleAsp: 4.596 ± 0.079
4.365IleGlu: 4.365 ± 0.083
3.21IlePhe: 3.21 ± 0.074
4.93IleGly: 4.93 ± 0.089
1.104IleHis: 1.104 ± 0.04
6.286IleIle: 6.286 ± 0.115
4.556IleLys: 4.556 ± 0.082
5.626IleLeu: 5.626 ± 0.11
2.206IleMet: 2.206 ± 0.056
3.345IleAsn: 3.345 ± 0.074
3.298IlePro: 3.298 ± 0.061
1.357IleGln: 1.357 ± 0.044
3.271IleArg: 3.271 ± 0.069
6.333IleSer: 6.333 ± 0.103
4.992IleThr: 4.992 ± 0.081
4.776IleVal: 4.776 ± 0.083
0.532IleTrp: 0.532 ± 0.026
2.924IleTyr: 2.924 ± 0.063
0.0IleXaa: 0.0 ± 0.0
Lys
5.674LysAla: 5.674 ± 0.09
1.02LysCys: 1.02 ± 0.041
3.49LysAsp: 3.49 ± 0.057
4.541LysGlu: 4.541 ± 0.08
2.287LysPhe: 2.287 ± 0.056
4.226LysGly: 4.226 ± 0.076
0.946LysHis: 0.946 ± 0.039
4.879LysIle: 4.879 ± 0.097
5.364LysLys: 5.364 ± 0.086
5.989LysLeu: 5.989 ± 0.083
2.259LysMet: 2.259 ± 0.053
3.475LysAsn: 3.475 ± 0.079
2.223LysPro: 2.223 ± 0.062
1.824LysGln: 1.824 ± 0.057
2.938LysArg: 2.938 ± 0.067
3.965LysSer: 3.965 ± 0.078
3.541LysThr: 3.541 ± 0.073
3.706LysVal: 3.706 ± 0.082
0.576LysTrp: 0.576 ± 0.031
3.052LysTyr: 3.052 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
6.733LeuAla: 6.733 ± 0.112
1.757LeuCys: 1.757 ± 0.054
4.932LeuAsp: 4.932 ± 0.093
4.891LeuGlu: 4.891 ± 0.088
3.434LeuPhe: 3.434 ± 0.081
5.357LeuGly: 5.357 ± 0.09
1.261LeuHis: 1.261 ± 0.041
6.28LeuIle: 6.28 ± 0.114
5.303LeuLys: 5.303 ± 0.085
6.773LeuLeu: 6.773 ± 0.139
2.559LeuMet: 2.559 ± 0.062
3.526LeuAsn: 3.526 ± 0.075
3.288LeuPro: 3.288 ± 0.073
1.567LeuGln: 1.567 ± 0.05
3.631LeuArg: 3.631 ± 0.077
6.428LeuSer: 6.428 ± 0.112
5.284LeuThr: 5.284 ± 0.085
4.407LeuVal: 4.407 ± 0.081
0.638LeuTrp: 0.638 ± 0.034
3.063LeuTyr: 3.063 ± 0.071
0.0LeuXaa: 0.0 ± 0.0
Met
2.454MetAla: 2.454 ± 0.063
0.492MetCys: 0.492 ± 0.03
1.639MetAsp: 1.639 ± 0.044
1.799MetGlu: 1.799 ± 0.051
1.167MetPhe: 1.167 ± 0.044
2.36MetGly: 2.36 ± 0.067
0.438MetHis: 0.438 ± 0.027
2.102MetIle: 2.102 ± 0.062
2.392MetLys: 2.392 ± 0.056
2.927MetLeu: 2.927 ± 0.074
0.905MetMet: 0.905 ± 0.038
1.691MetAsn: 1.691 ± 0.049
1.3MetPro: 1.3 ± 0.047
0.884MetGln: 0.884 ± 0.031
1.325MetArg: 1.325 ± 0.046
2.169MetSer: 2.169 ± 0.059
1.894MetThr: 1.894 ± 0.055
1.647MetVal: 1.647 ± 0.052
0.243MetTrp: 0.243 ± 0.021
1.009MetTyr: 1.009 ± 0.041
0.0MetXaa: 0.0 ± 0.0
Asn
3.649AsnAla: 3.649 ± 0.065
0.715AsnCys: 0.715 ± 0.034
2.772AsnAsp: 2.772 ± 0.077
2.97AsnGlu: 2.97 ± 0.066
1.717AsnPhe: 1.717 ± 0.055
4.003AsnGly: 4.003 ± 0.1
0.585AsnHis: 0.585 ± 0.033
3.98AsnIle: 3.98 ± 0.077
2.583AsnLys: 2.583 ± 0.06
2.923AsnLeu: 2.923 ± 0.06
1.356AsnMet: 1.356 ± 0.041
1.955AsnAsn: 1.955 ± 0.066
1.792AsnPro: 1.792 ± 0.052
0.886AsnGln: 0.886 ± 0.04
1.792AsnArg: 1.792 ± 0.051
2.899AsnSer: 2.899 ± 0.08
2.484AsnThr: 2.484 ± 0.06
3.079AsnVal: 3.079 ± 0.068
0.329AsnTrp: 0.329 ± 0.021
1.824AsnTyr: 1.824 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
3.14ProAla: 3.14 ± 0.079
0.56ProCys: 0.56 ± 0.029
2.87ProAsp: 2.87 ± 0.063
3.462ProGlu: 3.462 ± 0.078
1.525ProPhe: 1.525 ± 0.045
1.406ProGly: 1.406 ± 0.05
0.612ProHis: 0.612 ± 0.033
1.922ProIle: 1.922 ± 0.049
2.026ProLys: 2.026 ± 0.058
2.737ProLeu: 2.737 ± 0.061
1.026ProMet: 1.026 ± 0.036
1.292ProAsn: 1.292 ± 0.047
0.9ProPro: 0.9 ± 0.035
1.146ProGln: 1.146 ± 0.037
1.076ProArg: 1.076 ± 0.039
2.067ProSer: 2.067 ± 0.06
1.582ProThr: 1.582 ± 0.047
3.076ProVal: 3.076 ± 0.07
0.273ProTrp: 0.273 ± 0.023
1.417ProTyr: 1.417 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
1.999GlnAla: 1.999 ± 0.06
0.334GlnCys: 0.334 ± 0.022
1.019GlnAsp: 1.019 ± 0.036
1.21GlnGlu: 1.21 ± 0.043
0.877GlnPhe: 0.877 ± 0.035
1.468GlnGly: 1.468 ± 0.043
0.385GlnHis: 0.385 ± 0.024
1.883GlnIle: 1.883 ± 0.053
1.82GlnLys: 1.82 ± 0.061
2.327GlnLeu: 2.327 ± 0.052
0.949GlnMet: 0.949 ± 0.039
1.33GlnAsn: 1.33 ± 0.054
0.769GlnPro: 0.769 ± 0.037
0.859GlnGln: 0.859 ± 0.044
1.147GlnArg: 1.147 ± 0.042
1.564GlnSer: 1.564 ± 0.048
1.384GlnThr: 1.384 ± 0.043
1.391GlnVal: 1.391 ± 0.047
0.245GlnTrp: 0.245 ± 0.022
1.008GlnTyr: 1.008 ± 0.043
0.001GlnXaa: 0.001 ± 0.001
Arg
2.786ArgAla: 2.786 ± 0.065
0.757ArgCys: 0.757 ± 0.037
2.261ArgAsp: 2.261 ± 0.057
3.083ArgGlu: 3.083 ± 0.078
1.908ArgPhe: 1.908 ± 0.053
2.226ArgGly: 2.226 ± 0.067
0.79ArgHis: 0.79 ± 0.036
3.518ArgIle: 3.518 ± 0.071
3.1ArgLys: 3.1 ± 0.065
4.121ArgLeu: 4.121 ± 0.091
1.316ArgMet: 1.316 ± 0.048
1.829ArgAsn: 1.829 ± 0.048
1.25ArgPro: 1.25 ± 0.04
1.387ArgGln: 1.387 ± 0.044
2.275ArgArg: 2.275 ± 0.062
2.794ArgSer: 2.794 ± 0.068
2.371ArgThr: 2.371 ± 0.059
2.365ArgVal: 2.365 ± 0.053
0.343ArgTrp: 0.343 ± 0.022
1.808ArgTyr: 1.808 ± 0.053
0.001ArgXaa: 0.001 ± 0.001
Ser
6.199SerAla: 6.199 ± 0.102
1.154SerCys: 1.154 ± 0.038
4.95SerAsp: 4.95 ± 0.098
5.173SerGlu: 5.173 ± 0.106
3.019SerPhe: 3.019 ± 0.063
5.766SerGly: 5.766 ± 0.109
1.073SerHis: 1.073 ± 0.034
3.941SerIle: 3.941 ± 0.075
3.621SerLys: 3.621 ± 0.073
5.281SerLeu: 5.281 ± 0.088
1.803SerMet: 1.803 ± 0.043
2.273SerAsn: 2.273 ± 0.067
2.293SerPro: 2.293 ± 0.055
1.677SerGln: 1.677 ± 0.051
2.803SerArg: 2.803 ± 0.063
4.464SerSer: 4.464 ± 0.102
3.107SerThr: 3.107 ± 0.072
5.105SerVal: 5.105 ± 0.076
0.52SerTrp: 0.52 ± 0.028
2.598SerTyr: 2.598 ± 0.068
0.001SerXaa: 0.001 ± 0.002
Thr
6.063ThrAla: 6.063 ± 0.111
0.769ThrCys: 0.769 ± 0.031
4.156ThrAsp: 4.156 ± 0.075
4.228ThrGlu: 4.228 ± 0.085
2.188ThrPhe: 2.188 ± 0.065
4.578ThrGly: 4.578 ± 0.101
0.886ThrHis: 0.886 ± 0.033
3.547ThrIle: 3.547 ± 0.075
2.707ThrLys: 2.707 ± 0.06
4.332ThrLeu: 4.332 ± 0.08
1.331ThrMet: 1.331 ± 0.044
1.893ThrAsn: 1.893 ± 0.063
2.301ThrPro: 2.301 ± 0.059
1.189ThrGln: 1.189 ± 0.042
1.774ThrArg: 1.774 ± 0.048
3.316ThrSer: 3.316 ± 0.078
2.786ThrThr: 2.786 ± 0.1
5.472ThrVal: 5.472 ± 0.096
0.393ThrTrp: 0.393 ± 0.026
2.071ThrTyr: 2.071 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
4.164ValAla: 4.164 ± 0.096
1.483ValCys: 1.483 ± 0.05
3.468ValAsp: 3.468 ± 0.074
3.527ValGlu: 3.527 ± 0.086
3.154ValPhe: 3.154 ± 0.058
4.031ValGly: 4.031 ± 0.072
1.048ValHis: 1.048 ± 0.039
5.87ValIle: 5.87 ± 0.089
4.633ValLys: 4.633 ± 0.097
5.946ValLeu: 5.946 ± 0.094
2.231ValMet: 2.231 ± 0.061
3.077ValAsn: 3.077 ± 0.07
2.644ValPro: 2.644 ± 0.053
1.526ValGln: 1.526 ± 0.046
2.899ValArg: 2.899 ± 0.066
5.303ValSer: 5.303 ± 0.097
4.687ValThr: 4.687 ± 0.1
4.026ValVal: 4.026 ± 0.087
0.576ValTrp: 0.576 ± 0.026
2.707ValTyr: 2.707 ± 0.071
0.001ValXaa: 0.001 ± 0.001
Trp
0.52TrpAla: 0.52 ± 0.029
0.19TrpCys: 0.19 ± 0.016
0.52TrpAsp: 0.52 ± 0.03
0.488TrpGlu: 0.488 ± 0.027
0.36TrpPhe: 0.36 ± 0.025
0.63TrpGly: 0.63 ± 0.032
0.209TrpHis: 0.209 ± 0.018
0.507TrpIle: 0.507 ± 0.031
0.475TrpLys: 0.475 ± 0.026
0.786TrpLeu: 0.786 ± 0.032
0.192TrpMet: 0.192 ± 0.016
0.499TrpAsn: 0.499 ± 0.029
0.127TrpPro: 0.127 ± 0.016
0.322TrpGln: 0.322 ± 0.018
0.297TrpArg: 0.297 ± 0.021
0.514TrpSer: 0.514 ± 0.03
0.4TrpThr: 0.4 ± 0.029
0.407TrpVal: 0.407 ± 0.024
0.1TrpTrp: 0.1 ± 0.012
0.347TrpTyr: 0.347 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.066TyrAla: 3.066 ± 0.079
0.702TyrCys: 0.702 ± 0.031
3.056TyrAsp: 3.056 ± 0.072
2.58TyrGlu: 2.58 ± 0.063
1.868TyrPhe: 1.868 ± 0.056
3.069TyrGly: 3.069 ± 0.071
0.774TyrHis: 0.774 ± 0.035
3.186TyrIle: 3.186 ± 0.066
2.348TyrLys: 2.348 ± 0.068
3.002TyrLeu: 3.002 ± 0.068
1.091TyrMet: 1.091 ± 0.039
2.063TyrAsn: 2.063 ± 0.054
1.512TyrPro: 1.512 ± 0.049
0.831TyrGln: 0.831 ± 0.036
1.77TyrArg: 1.77 ± 0.046
3.097TyrSer: 3.097 ± 0.073
2.422TyrThr: 2.422 ± 0.071
2.58TyrVal: 2.58 ± 0.065
0.307TyrTrp: 0.307 ± 0.019
1.979TyrTyr: 1.979 ± 0.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.002
0.0XaaLys: 0.0 ± 0.0
0.003XaaLeu: 0.003 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.001XaaGln: 0.001 ± 0.001
0.004XaaArg: 0.004 ± 0.003
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.003XaaTyr: 0.003 ± 0.002
0.028XaaXaa: 0.028 ± 0.01
Statistics based on 2331 proteins (717514 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski