Amino acid dipepetide frequency for Ruminococcaceae bacterium CPB6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.813AlaAla: 11.813 ± 0.21
1.602AlaCys: 1.602 ± 0.055
5.349AlaAsp: 5.349 ± 0.102
5.531AlaGlu: 5.531 ± 0.103
3.538AlaPhe: 3.538 ± 0.078
7.476AlaGly: 7.476 ± 0.127
1.622AlaHis: 1.622 ± 0.057
4.38AlaIle: 4.38 ± 0.1
4.914AlaLys: 4.914 ± 0.098
8.644AlaLeu: 8.644 ± 0.141
2.533AlaMet: 2.533 ± 0.073
2.671AlaAsn: 2.671 ± 0.073
3.049AlaPro: 3.049 ± 0.083
3.606AlaGln: 3.606 ± 0.089
3.909AlaArg: 3.909 ± 0.094
4.871AlaSer: 4.871 ± 0.126
3.542AlaThr: 3.542 ± 0.078
8.173AlaVal: 8.173 ± 0.152
0.621AlaTrp: 0.621 ± 0.034
2.64AlaTyr: 2.64 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
1.683CysAla: 1.683 ± 0.059
0.457CysCys: 0.457 ± 0.031
0.991CysAsp: 0.991 ± 0.039
0.85CysGlu: 0.85 ± 0.04
0.741CysPhe: 0.741 ± 0.034
2.018CysGly: 2.018 ± 0.066
0.418CysHis: 0.418 ± 0.028
1.034CysIle: 1.034 ± 0.046
0.801CysLys: 0.801 ± 0.04
1.389CysLeu: 1.389 ± 0.047
0.5CysMet: 0.5 ± 0.032
0.527CysAsn: 0.527 ± 0.029
0.922CysPro: 0.922 ± 0.048
0.57CysGln: 0.57 ± 0.033
1.194CysArg: 1.194 ± 0.045
1.219CysSer: 1.219 ± 0.054
1.102CysThr: 1.102 ± 0.044
1.2CysVal: 1.2 ± 0.049
0.15CysTrp: 0.15 ± 0.017
0.5CysTyr: 0.5 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
4.625AspAla: 4.625 ± 0.086
0.956AspCys: 0.956 ± 0.04
2.695AspAsp: 2.695 ± 0.074
3.638AspGlu: 3.638 ± 0.087
2.372AspPhe: 2.372 ± 0.066
4.37AspGly: 4.37 ± 0.098
1.098AspHis: 1.098 ± 0.048
3.78AspIle: 3.78 ± 0.084
2.994AspLys: 2.994 ± 0.081
4.716AspLeu: 4.716 ± 0.101
1.741AspMet: 1.741 ± 0.054
1.845AspAsn: 1.845 ± 0.069
2.047AspPro: 2.047 ± 0.068
1.348AspGln: 1.348 ± 0.044
2.763AspArg: 2.763 ± 0.078
3.312AspSer: 3.312 ± 0.076
3.4AspThr: 3.4 ± 0.09
3.548AspVal: 3.548 ± 0.078
0.568AspTrp: 0.568 ± 0.037
2.118AspTyr: 2.118 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
4.881GluAla: 4.881 ± 0.089
0.714GluCys: 0.714 ± 0.036
3.113GluAsp: 3.113 ± 0.086
4.521GluGlu: 4.521 ± 0.11
1.673GluPhe: 1.673 ± 0.055
3.44GluGly: 3.44 ± 0.081
1.236GluHis: 1.236 ± 0.05
3.718GluIle: 3.718 ± 0.087
5.107GluLys: 5.107 ± 0.102
5.492GluLeu: 5.492 ± 0.109
1.911GluMet: 1.911 ± 0.059
3.168GluAsn: 3.168 ± 0.073
2.096GluPro: 2.096 ± 0.061
2.872GluGln: 2.872 ± 0.071
3.011GluArg: 3.011 ± 0.078
2.97GluSer: 2.97 ± 0.07
3.169GluThr: 3.169 ± 0.07
3.577GluVal: 3.577 ± 0.082
0.462GluTrp: 0.462 ± 0.033
1.727GluTyr: 1.727 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
3.307PheAla: 3.307 ± 0.085
0.888PheCys: 0.888 ± 0.043
2.139PheAsp: 2.139 ± 0.06
1.79PheGlu: 1.79 ± 0.056
1.709PhePhe: 1.709 ± 0.066
3.159PheGly: 3.159 ± 0.078
0.976PheHis: 0.976 ± 0.044
2.164PheIle: 2.164 ± 0.065
1.389PheLys: 1.389 ± 0.054
3.945PheLeu: 3.945 ± 0.089
0.956PheMet: 0.956 ± 0.047
1.234PheAsn: 1.234 ± 0.052
1.629PhePro: 1.629 ± 0.054
1.413PheGln: 1.413 ± 0.052
1.807PheArg: 1.807 ± 0.057
3.198PheSer: 3.198 ± 0.075
2.484PheThr: 2.484 ± 0.071
2.533PheVal: 2.533 ± 0.072
0.372PheTrp: 0.372 ± 0.028
1.455PheTyr: 1.455 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
5.932GlyAla: 5.932 ± 0.124
1.615GlyCys: 1.615 ± 0.051
3.494GlyAsp: 3.494 ± 0.08
3.846GlyGlu: 3.846 ± 0.081
2.953GlyPhe: 2.953 ± 0.073
5.75GlyGly: 5.75 ± 0.118
1.554GlyHis: 1.554 ± 0.05
5.516GlyIle: 5.516 ± 0.123
5.271GlyLys: 5.271 ± 0.087
6.471GlyLeu: 6.471 ± 0.125
2.408GlyMet: 2.408 ± 0.064
2.612GlyAsn: 2.612 ± 0.076
1.884GlyPro: 1.884 ± 0.065
2.51GlyGln: 2.51 ± 0.084
4.019GlyArg: 4.019 ± 0.098
4.773GlySer: 4.773 ± 0.098
4.941GlyThr: 4.941 ± 0.104
5.347GlyVal: 5.347 ± 0.102
0.78GlyTrp: 0.78 ± 0.034
2.712GlyTyr: 2.712 ± 0.073
0.0GlyXaa: 0.0 ± 0.0
His
1.605HisAla: 1.605 ± 0.061
0.493HisCys: 0.493 ± 0.032
1.052HisAsp: 1.052 ± 0.04
0.913HisGlu: 0.913 ± 0.039
0.996HisPhe: 0.996 ± 0.044
1.687HisGly: 1.687 ± 0.052
0.527HisHis: 0.527 ± 0.035
1.42HisIle: 1.42 ± 0.046
1.046HisLys: 1.046 ± 0.044
2.008HisLeu: 2.008 ± 0.065
0.536HisMet: 0.536 ± 0.031
0.765HisAsn: 0.765 ± 0.038
1.175HisPro: 1.175 ± 0.045
0.639HisGln: 0.639 ± 0.041
1.188HisArg: 1.188 ± 0.047
1.255HisSer: 1.255 ± 0.048
1.46HisThr: 1.46 ± 0.052
1.409HisVal: 1.409 ± 0.049
0.255HisTrp: 0.255 ± 0.02
0.692HisTyr: 0.692 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
5.325IleAla: 5.325 ± 0.099
1.221IleCys: 1.221 ± 0.044
3.436IleAsp: 3.436 ± 0.09
2.884IleGlu: 2.884 ± 0.087
2.282IlePhe: 2.282 ± 0.073
4.302IleGly: 4.302 ± 0.111
1.294IleHis: 1.294 ± 0.051
3.501IleIle: 3.501 ± 0.093
2.654IleLys: 2.654 ± 0.087
5.918IleLeu: 5.918 ± 0.124
1.423IleMet: 1.423 ± 0.05
2.078IleAsn: 2.078 ± 0.06
3.015IlePro: 3.015 ± 0.077
2.164IleGln: 2.164 ± 0.062
3.166IleArg: 3.166 ± 0.074
4.329IleSer: 4.329 ± 0.095
3.632IleThr: 3.632 ± 0.078
4.055IleVal: 4.055 ± 0.097
0.517IleTrp: 0.517 ± 0.033
1.96IleTyr: 1.96 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
5.306LysAla: 5.306 ± 0.108
0.687LysCys: 0.687 ± 0.035
3.332LysAsp: 3.332 ± 0.079
4.4LysGlu: 4.4 ± 0.095
1.389LysPhe: 1.389 ± 0.045
3.616LysGly: 3.616 ± 0.09
1.078LysHis: 1.078 ± 0.043
3.693LysIle: 3.693 ± 0.092
4.803LysLys: 4.803 ± 0.095
4.825LysLeu: 4.825 ± 0.105
1.998LysMet: 1.998 ± 0.056
2.787LysAsn: 2.787 ± 0.077
2.294LysPro: 2.294 ± 0.063
2.367LysGln: 2.367 ± 0.073
2.766LysArg: 2.766 ± 0.067
3.149LysSer: 3.149 ± 0.086
3.705LysThr: 3.705 ± 0.083
3.912LysVal: 3.912 ± 0.082
0.512LysTrp: 0.512 ± 0.031
1.972LysTyr: 1.972 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
7.964LeuAla: 7.964 ± 0.116
2.149LeuCys: 2.149 ± 0.072
4.851LeuAsp: 4.851 ± 0.089
5.019LeuGlu: 5.019 ± 0.106
4.006LeuPhe: 4.006 ± 0.11
6.284LeuGly: 6.284 ± 0.096
2.34LeuHis: 2.34 ± 0.058
4.808LeuIle: 4.808 ± 0.1
5.031LeuLys: 5.031 ± 0.1
9.994LeuLeu: 9.994 ± 0.19
2.498LeuMet: 2.498 ± 0.073
3.472LeuAsn: 3.472 ± 0.073
4.854LeuPro: 4.854 ± 0.095
4.468LeuGln: 4.468 ± 0.089
5.148LeuArg: 5.148 ± 0.099
6.497LeuSer: 6.497 ± 0.137
5.765LeuThr: 5.765 ± 0.094
5.713LeuVal: 5.713 ± 0.097
0.838LeuTrp: 0.838 ± 0.039
3.106LeuTyr: 3.106 ± 0.083
0.0LeuXaa: 0.0 ± 0.0
Met
2.586MetAla: 2.586 ± 0.073
0.36MetCys: 0.36 ± 0.025
1.687MetAsp: 1.687 ± 0.058
1.843MetGlu: 1.843 ± 0.054
0.786MetPhe: 0.786 ± 0.032
1.993MetGly: 1.993 ± 0.061
0.568MetHis: 0.568 ± 0.035
1.447MetIle: 1.447 ± 0.053
2.071MetLys: 2.071 ± 0.059
2.844MetLeu: 2.844 ± 0.089
0.743MetMet: 0.743 ± 0.039
1.296MetAsn: 1.296 ± 0.047
1.326MetPro: 1.326 ± 0.046
1.401MetGln: 1.401 ± 0.05
1.472MetArg: 1.472 ± 0.044
1.517MetSer: 1.517 ± 0.055
1.784MetThr: 1.784 ± 0.066
1.858MetVal: 1.858 ± 0.061
0.163MetTrp: 0.163 ± 0.017
0.741MetTyr: 0.741 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
3.417AsnAla: 3.417 ± 0.083
0.675AsnCys: 0.675 ± 0.036
1.872AsnAsp: 1.872 ± 0.053
1.942AsnGlu: 1.942 ± 0.06
1.358AsnPhe: 1.358 ± 0.05
3.526AsnGly: 3.526 ± 0.082
0.772AsnHis: 0.772 ± 0.039
2.491AsnIle: 2.491 ± 0.074
1.823AsnLys: 1.823 ± 0.07
3.429AsnLeu: 3.429 ± 0.083
1.068AsnMet: 1.068 ± 0.043
1.301AsnAsn: 1.301 ± 0.046
2.005AsnPro: 2.005 ± 0.057
1.212AsnGln: 1.212 ± 0.054
1.914AsnArg: 1.914 ± 0.054
2.156AsnSer: 2.156 ± 0.073
2.132AsnThr: 2.132 ± 0.055
2.63AsnVal: 2.63 ± 0.066
0.388AsnTrp: 0.388 ± 0.028
1.35AsnTyr: 1.35 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
3.878ProAla: 3.878 ± 0.089
0.678ProCys: 0.678 ± 0.038
2.846ProAsp: 2.846 ± 0.068
3.351ProGlu: 3.351 ± 0.078
1.704ProPhe: 1.704 ± 0.061
2.912ProGly: 2.912 ± 0.074
0.865ProHis: 0.865 ± 0.037
2.127ProIle: 2.127 ± 0.071
2.272ProLys: 2.272 ± 0.069
3.678ProLeu: 3.678 ± 0.095
1.069ProMet: 1.069 ± 0.043
1.547ProAsn: 1.547 ± 0.058
1.457ProPro: 1.457 ± 0.054
1.986ProGln: 1.986 ± 0.063
1.648ProArg: 1.648 ± 0.054
2.173ProSer: 2.173 ± 0.058
2.059ProThr: 2.059 ± 0.062
3.489ProVal: 3.489 ± 0.084
0.384ProTrp: 0.384 ± 0.026
1.488ProTyr: 1.488 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
3.463GlnAla: 3.463 ± 0.084
0.471GlnCys: 0.471 ± 0.033
1.826GlnAsp: 1.826 ± 0.053
2.476GlnGlu: 2.476 ± 0.056
1.33GlnPhe: 1.33 ± 0.045
2.156GlnGly: 2.156 ± 0.069
0.818GlnHis: 0.818 ± 0.038
2.391GlnIle: 2.391 ± 0.055
3.48GlnLys: 3.48 ± 0.095
3.775GlnLeu: 3.775 ± 0.094
1.44GlnMet: 1.44 ± 0.05
1.882GlnAsn: 1.882 ± 0.061
1.644GlnPro: 1.644 ± 0.051
2.345GlnGln: 2.345 ± 0.082
2.039GlnArg: 2.039 ± 0.059
1.923GlnSer: 1.923 ± 0.058
2.34GlnThr: 2.34 ± 0.08
2.552GlnVal: 2.552 ± 0.073
0.357GlnTrp: 0.357 ± 0.025
1.433GlnTyr: 1.433 ± 0.055
0.0GlnXaa: 0.0 ± 0.0
Arg
3.909ArgAla: 3.909 ± 0.092
0.933ArgCys: 0.933 ± 0.039
2.423ArgAsp: 2.423 ± 0.083
3.351ArgGlu: 3.351 ± 0.087
2.176ArgPhe: 2.176 ± 0.068
3.152ArgGly: 3.152 ± 0.077
1.078ArgHis: 1.078 ± 0.042
3.331ArgIle: 3.331 ± 0.085
3.239ArgLys: 3.239 ± 0.077
4.985ArgLeu: 4.985 ± 0.103
1.6ArgMet: 1.6 ± 0.05
1.829ArgAsn: 1.829 ± 0.067
2.027ArgPro: 2.027 ± 0.065
2.3ArgGln: 2.3 ± 0.069
3.572ArgArg: 3.572 ± 0.108
2.88ArgSer: 2.88 ± 0.078
2.916ArgThr: 2.916 ± 0.075
3.37ArgVal: 3.37 ± 0.076
0.568ArgTrp: 0.568 ± 0.032
1.855ArgTyr: 1.855 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
5.963SerAla: 5.963 ± 0.124
1.115SerCys: 1.115 ± 0.042
3.159SerAsp: 3.159 ± 0.09
3.118SerGlu: 3.118 ± 0.076
2.676SerPhe: 2.676 ± 0.06
5.711SerGly: 5.711 ± 0.105
1.256SerHis: 1.256 ± 0.049
3.535SerIle: 3.535 ± 0.078
3.013SerLys: 3.013 ± 0.081
5.611SerLeu: 5.611 ± 0.114
1.695SerMet: 1.695 ± 0.05
2.01SerAsn: 2.01 ± 0.069
2.391SerPro: 2.391 ± 0.062
2.079SerGln: 2.079 ± 0.064
3.278SerArg: 3.278 ± 0.083
4.674SerSer: 4.674 ± 0.164
3.409SerThr: 3.409 ± 0.101
4.56SerVal: 4.56 ± 0.098
0.609SerTrp: 0.609 ± 0.037
2.052SerTyr: 2.052 ± 0.068
0.0SerXaa: 0.0 ± 0.0
Thr
6.289ThrAla: 6.289 ± 0.129
0.916ThrCys: 0.916 ± 0.042
3.433ThrAsp: 3.433 ± 0.084
3.297ThrGlu: 3.297 ± 0.071
2.285ThrPhe: 2.285 ± 0.067
5.012ThrGly: 5.012 ± 0.102
1.148ThrHis: 1.148 ± 0.045
3.31ThrIle: 3.31 ± 0.092
2.697ThrLys: 2.697 ± 0.067
5.488ThrLeu: 5.488 ± 0.097
1.411ThrMet: 1.411 ± 0.044
2.047ThrAsn: 2.047 ± 0.057
2.69ThrPro: 2.69 ± 0.079
1.943ThrGln: 1.943 ± 0.066
2.263ThrArg: 2.263 ± 0.068
3.21ThrSer: 3.21 ± 0.084
2.926ThrThr: 2.926 ± 0.086
4.994ThrVal: 4.994 ± 0.117
0.481ThrTrp: 0.481 ± 0.033
1.838ThrTyr: 1.838 ± 0.064
0.0ThrXaa: 0.0 ± 0.0
Val
5.104ValAla: 5.104 ± 0.099
1.488ValCys: 1.488 ± 0.05
3.557ValAsp: 3.557 ± 0.088
3.701ValGlu: 3.701 ± 0.085
2.771ValPhe: 2.771 ± 0.073
4.572ValGly: 4.572 ± 0.116
1.433ValHis: 1.433 ± 0.054
4.089ValIle: 4.089 ± 0.085
3.742ValLys: 3.742 ± 0.084
7.704ValLeu: 7.704 ± 0.127
1.841ValMet: 1.841 ± 0.057
2.562ValAsn: 2.562 ± 0.061
3.457ValPro: 3.457 ± 0.087
3.033ValGln: 3.033 ± 0.085
3.771ValArg: 3.771 ± 0.082
5.201ValSer: 5.201 ± 0.109
4.393ValThr: 4.393 ± 0.088
4.718ValVal: 4.718 ± 0.105
0.729ValTrp: 0.729 ± 0.033
2.316ValTyr: 2.316 ± 0.072
0.0ValXaa: 0.0 ± 0.0
Trp
0.675TrpAla: 0.675 ± 0.038
0.197TrpCys: 0.197 ± 0.02
0.58TrpAsp: 0.58 ± 0.035
0.571TrpGlu: 0.571 ± 0.031
0.394TrpPhe: 0.394 ± 0.029
0.612TrpGly: 0.612 ± 0.032
0.211TrpHis: 0.211 ± 0.02
0.544TrpIle: 0.544 ± 0.031
0.602TrpLys: 0.602 ± 0.032
0.85TrpLeu: 0.85 ± 0.038
0.315TrpMet: 0.315 ± 0.024
0.505TrpAsn: 0.505 ± 0.037
0.252TrpPro: 0.252 ± 0.022
0.496TrpGln: 0.496 ± 0.031
0.445TrpArg: 0.445 ± 0.031
0.527TrpSer: 0.527 ± 0.034
0.418TrpThr: 0.418 ± 0.029
0.547TrpVal: 0.547 ± 0.03
0.095TrpTrp: 0.095 ± 0.014
0.364TrpTyr: 0.364 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.756TyrAla: 2.756 ± 0.075
0.649TyrCys: 0.649 ± 0.031
2.1TyrAsp: 2.1 ± 0.061
1.818TyrGlu: 1.818 ± 0.056
1.408TyrPhe: 1.408 ± 0.052
2.639TyrGly: 2.639 ± 0.063
0.83TyrHis: 0.83 ± 0.04
1.877TyrIle: 1.877 ± 0.064
1.644TyrLys: 1.644 ± 0.059
3.038TyrLeu: 3.038 ± 0.081
0.799TyrMet: 0.799 ± 0.039
1.324TyrAsn: 1.324 ± 0.051
1.401TyrPro: 1.401 ± 0.047
1.386TyrGln: 1.386 ± 0.047
2.122TyrArg: 2.122 ± 0.072
2.081TyrSer: 2.081 ± 0.064
2.101TyrThr: 2.101 ± 0.069
1.996TyrVal: 1.996 ± 0.058
0.352TyrTrp: 0.352 ± 0.029
1.358TyrTyr: 1.358 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1921 proteins (588157 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski