Amino acid dipepetide frequency for Megasphaera sp. AM44-1BH

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.06AlaAla: 10.06 ± 0.168
1.384AlaCys: 1.384 ± 0.045
5.396AlaAsp: 5.396 ± 0.078
5.013AlaGlu: 5.013 ± 0.096
3.4AlaPhe: 3.4 ± 0.074
7.573AlaGly: 7.573 ± 0.118
1.756AlaHis: 1.756 ± 0.05
5.846AlaIle: 5.846 ± 0.092
4.119AlaLys: 4.119 ± 0.085
8.095AlaLeu: 8.095 ± 0.1
2.845AlaMet: 2.845 ± 0.06
2.64AlaAsn: 2.64 ± 0.074
2.727AlaPro: 2.727 ± 0.066
2.901AlaGln: 2.901 ± 0.067
4.321AlaArg: 4.321 ± 0.084
4.808AlaSer: 4.808 ± 0.09
3.721AlaThr: 3.721 ± 0.091
7.563AlaVal: 7.563 ± 0.113
0.846AlaTrp: 0.846 ± 0.034
3.017AlaTyr: 3.017 ± 0.069
0.0AlaXaa: 0.0 ± 0.0
Cys
1.002CysAla: 1.002 ± 0.041
0.257CysCys: 0.257 ± 0.019
0.782CysAsp: 0.782 ± 0.035
0.624CysGlu: 0.624 ± 0.031
0.513CysPhe: 0.513 ± 0.025
1.478CysGly: 1.478 ± 0.051
0.499CysHis: 0.499 ± 0.024
1.041CysIle: 1.041 ± 0.033
0.572CysLys: 0.572 ± 0.026
1.309CysLeu: 1.309 ± 0.044
0.426CysMet: 0.426 ± 0.025
0.462CysAsn: 0.462 ± 0.022
0.74CysPro: 0.74 ± 0.035
0.628CysGln: 0.628 ± 0.031
0.885CysArg: 0.885 ± 0.033
0.808CysSer: 0.808 ± 0.03
0.772CysThr: 0.772 ± 0.028
0.881CysVal: 0.881 ± 0.038
0.159CysTrp: 0.159 ± 0.013
0.467CysTyr: 0.467 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
5.128AspAla: 5.128 ± 0.078
0.809AspCys: 0.809 ± 0.027
3.355AspAsp: 3.355 ± 0.06
3.938AspGlu: 3.938 ± 0.072
2.498AspPhe: 2.498 ± 0.047
4.593AspGly: 4.593 ± 0.097
1.27AspHis: 1.27 ± 0.039
4.581AspIle: 4.581 ± 0.082
3.361AspLys: 3.361 ± 0.074
4.718AspLeu: 4.718 ± 0.08
2.012AspMet: 2.012 ± 0.054
1.994AspAsn: 1.994 ± 0.062
2.105AspPro: 2.105 ± 0.049
1.568AspGln: 1.568 ± 0.056
2.631AspArg: 2.631 ± 0.061
2.801AspSer: 2.801 ± 0.062
3.627AspThr: 3.627 ± 0.07
4.443AspVal: 4.443 ± 0.073
0.709AspTrp: 0.709 ± 0.029
2.479AspTyr: 2.479 ± 0.056
0.0AspXaa: 0.0 ± 0.0
Glu
5.265GluAla: 5.265 ± 0.1
0.615GluCys: 0.615 ± 0.029
3.454GluAsp: 3.454 ± 0.072
5.009GluGlu: 5.009 ± 0.102
1.937GluPhe: 1.937 ± 0.046
3.844GluGly: 3.844 ± 0.069
1.418GluHis: 1.418 ± 0.042
3.994GluIle: 3.994 ± 0.084
4.944GluLys: 4.944 ± 0.081
5.646GluLeu: 5.646 ± 0.086
1.844GluMet: 1.844 ± 0.051
2.604GluAsn: 2.604 ± 0.054
1.853GluPro: 1.853 ± 0.047
2.699GluGln: 2.699 ± 0.065
3.112GluArg: 3.112 ± 0.077
2.65GluSer: 2.65 ± 0.056
3.354GluThr: 3.354 ± 0.063
3.435GluVal: 3.435 ± 0.068
0.559GluTrp: 0.559 ± 0.024
1.996GluTyr: 1.996 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.172PheAla: 3.172 ± 0.071
0.676PheCys: 0.676 ± 0.029
2.318PheAsp: 2.318 ± 0.055
1.673PheGlu: 1.673 ± 0.047
1.828PhePhe: 1.828 ± 0.05
2.967PheGly: 2.967 ± 0.06
1.023PheHis: 1.023 ± 0.036
2.821PheIle: 2.821 ± 0.058
1.591PheLys: 1.591 ± 0.047
3.846PheLeu: 3.846 ± 0.082
1.151PheMet: 1.151 ± 0.037
1.318PheAsn: 1.318 ± 0.043
1.569PhePro: 1.569 ± 0.046
1.222PheGln: 1.222 ± 0.039
1.749PheArg: 1.749 ± 0.045
2.632PheSer: 2.632 ± 0.063
2.425PheThr: 2.425 ± 0.053
2.691PheVal: 2.691 ± 0.07
0.476PheTrp: 0.476 ± 0.024
1.429PheTyr: 1.429 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
6.213GlyAla: 6.213 ± 0.11
1.309GlyCys: 1.309 ± 0.051
3.994GlyAsp: 3.994 ± 0.095
3.674GlyGlu: 3.674 ± 0.071
3.022GlyPhe: 3.022 ± 0.069
5.67GlyGly: 5.67 ± 0.118
1.86GlyHis: 1.86 ± 0.045
6.062GlyIle: 6.062 ± 0.095
4.925GlyLys: 4.925 ± 0.079
6.803GlyLeu: 6.803 ± 0.102
2.563GlyMet: 2.563 ± 0.064
2.967GlyAsn: 2.967 ± 0.082
2.037GlyPro: 2.037 ± 0.051
2.673GlyGln: 2.673 ± 0.07
3.949GlyArg: 3.949 ± 0.077
4.528GlySer: 4.528 ± 0.104
5.069GlyThr: 5.069 ± 0.115
4.959GlyVal: 4.959 ± 0.074
0.82GlyTrp: 0.82 ± 0.035
3.11GlyTyr: 3.11 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
1.64HisAla: 1.64 ± 0.048
0.408HisCys: 0.408 ± 0.022
1.405HisAsp: 1.405 ± 0.046
1.239HisGlu: 1.239 ± 0.04
1.05HisPhe: 1.05 ± 0.033
1.818HisGly: 1.818 ± 0.048
0.81HisHis: 0.81 ± 0.036
2.002HisIle: 2.002 ± 0.048
1.04HisLys: 1.04 ± 0.032
2.233HisLeu: 2.233 ± 0.053
0.821HisMet: 0.821 ± 0.032
0.87HisAsn: 0.87 ± 0.031
1.258HisPro: 1.258 ± 0.044
0.786HisGln: 0.786 ± 0.03
1.147HisArg: 1.147 ± 0.043
1.205HisSer: 1.205 ± 0.041
1.416HisThr: 1.416 ± 0.042
1.67HisVal: 1.67 ± 0.046
0.274HisTrp: 0.274 ± 0.019
0.868HisTyr: 0.868 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.223IleAla: 6.223 ± 0.104
1.125IleCys: 1.125 ± 0.042
4.085IleAsp: 4.085 ± 0.074
3.745IleGlu: 3.745 ± 0.073
2.466IlePhe: 2.466 ± 0.062
5.39IleGly: 5.39 ± 0.094
1.733IleHis: 1.733 ± 0.047
4.7IleIle: 4.7 ± 0.087
3.066IleLys: 3.066 ± 0.071
6.318IleLeu: 6.318 ± 0.109
1.937IleMet: 1.937 ± 0.051
2.367IleAsn: 2.367 ± 0.057
3.441IlePro: 3.441 ± 0.073
2.302IleGln: 2.302 ± 0.058
3.718IleArg: 3.718 ± 0.07
4.272IleSer: 4.272 ± 0.081
4.055IleThr: 4.055 ± 0.087
5.013IleVal: 5.013 ± 0.088
0.613IleTrp: 0.613 ± 0.029
2.288IleTyr: 2.288 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
5.137LysAla: 5.137 ± 0.089
0.486LysCys: 0.486 ± 0.023
3.502LysAsp: 3.502 ± 0.076
4.776LysGlu: 4.776 ± 0.085
1.47LysPhe: 1.47 ± 0.042
4.105LysGly: 4.105 ± 0.074
1.028LysHis: 1.028 ± 0.036
3.611LysIle: 3.611 ± 0.069
4.76LysLys: 4.76 ± 0.096
4.348LysLeu: 4.348 ± 0.082
1.814LysMet: 1.814 ± 0.045
2.536LysAsn: 2.536 ± 0.061
1.939LysPro: 1.939 ± 0.052
2.105LysGln: 2.105 ± 0.053
2.662LysArg: 2.662 ± 0.06
2.534LysSer: 2.534 ± 0.056
3.178LysThr: 3.178 ± 0.07
3.591LysVal: 3.591 ± 0.079
0.535LysTrp: 0.535 ± 0.031
2.055LysTyr: 2.055 ± 0.06
0.0LysXaa: 0.0 ± 0.0
Leu
8.685LeuAla: 8.685 ± 0.118
1.412LeuCys: 1.412 ± 0.048
5.241LeuAsp: 5.241 ± 0.081
4.977LeuGlu: 4.977 ± 0.083
3.703LeuPhe: 3.703 ± 0.079
6.77LeuGly: 6.77 ± 0.113
2.245LeuHis: 2.245 ± 0.054
5.538LeuIle: 5.538 ± 0.1
5.056LeuLys: 5.056 ± 0.078
8.279LeuLeu: 8.279 ± 0.119
2.512LeuMet: 2.512 ± 0.059
3.05LeuAsn: 3.05 ± 0.065
4.409LeuPro: 4.409 ± 0.082
3.654LeuGln: 3.654 ± 0.077
4.547LeuArg: 4.547 ± 0.083
6.049LeuSer: 6.049 ± 0.092
5.637LeuThr: 5.637 ± 0.101
5.677LeuVal: 5.677 ± 0.09
0.84LeuTrp: 0.84 ± 0.036
3.218LeuTyr: 3.218 ± 0.066
0.0LeuXaa: 0.0 ± 0.0
Met
3.118MetAla: 3.118 ± 0.073
0.321MetCys: 0.321 ± 0.02
1.883MetAsp: 1.883 ± 0.054
2.099MetGlu: 2.099 ± 0.048
0.941MetPhe: 0.941 ± 0.032
2.194MetGly: 2.194 ± 0.054
0.644MetHis: 0.644 ± 0.028
1.888MetIle: 1.888 ± 0.051
2.281MetLys: 2.281 ± 0.047
2.494MetLeu: 2.494 ± 0.06
0.937MetMet: 0.937 ± 0.04
1.372MetAsn: 1.372 ± 0.051
1.404MetPro: 1.404 ± 0.044
0.998MetGln: 0.998 ± 0.031
1.38MetArg: 1.38 ± 0.036
1.591MetSer: 1.591 ± 0.042
2.011MetThr: 2.011 ± 0.054
1.798MetVal: 1.798 ± 0.049
0.199MetTrp: 0.199 ± 0.017
0.88MetTyr: 0.88 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
2.892AsnAla: 2.892 ± 0.081
0.481AsnCys: 0.481 ± 0.024
2.005AsnAsp: 2.005 ± 0.054
1.928AsnGlu: 1.928 ± 0.049
1.313AsnPhe: 1.313 ± 0.043
2.984AsnGly: 2.984 ± 0.08
0.89AsnHis: 0.89 ± 0.031
2.64AsnIle: 2.64 ± 0.072
1.952AsnLys: 1.952 ± 0.059
3.271AsnLeu: 3.271 ± 0.062
1.062AsnMet: 1.062 ± 0.037
1.323AsnAsn: 1.323 ± 0.059
2.025AsnPro: 2.025 ± 0.053
1.314AsnGln: 1.314 ± 0.042
1.873AsnArg: 1.873 ± 0.048
1.818AsnSer: 1.818 ± 0.061
2.116AsnThr: 2.116 ± 0.079
2.561AsnVal: 2.561 ± 0.063
0.388AsnTrp: 0.388 ± 0.024
1.295AsnTyr: 1.295 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
3.576ProAla: 3.576 ± 0.08
0.522ProCys: 0.522 ± 0.026
3.129ProAsp: 3.129 ± 0.064
3.266ProGlu: 3.266 ± 0.065
1.74ProPhe: 1.74 ± 0.045
3.021ProGly: 3.021 ± 0.06
0.907ProHis: 0.907 ± 0.034
2.158ProIle: 2.158 ± 0.053
1.666ProLys: 1.666 ± 0.048
3.607ProLeu: 3.607 ± 0.07
1.132ProMet: 1.132 ± 0.038
1.169ProAsn: 1.169 ± 0.036
1.183ProPro: 1.183 ± 0.04
1.473ProGln: 1.473 ± 0.049
1.593ProArg: 1.593 ± 0.05
2.17ProSer: 2.17 ± 0.053
1.664ProThr: 1.664 ± 0.046
3.905ProVal: 3.905 ± 0.066
0.408ProTrp: 0.408 ± 0.024
1.736ProTyr: 1.736 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
3.542GlnAla: 3.542 ± 0.075
0.456GlnCys: 0.456 ± 0.024
1.94GlnAsp: 1.94 ± 0.056
2.53GlnGlu: 2.53 ± 0.061
1.309GlnPhe: 1.309 ± 0.038
2.351GlnGly: 2.351 ± 0.054
0.868GlnHis: 0.868 ± 0.032
2.424GlnIle: 2.424 ± 0.051
2.515GlnLys: 2.515 ± 0.058
3.177GlnLeu: 3.177 ± 0.065
1.173GlnMet: 1.173 ± 0.042
1.447GlnAsn: 1.447 ± 0.042
1.382GlnPro: 1.382 ± 0.044
1.838GlnGln: 1.838 ± 0.062
1.876GlnArg: 1.876 ± 0.056
2.0GlnSer: 2.0 ± 0.053
1.949GlnThr: 1.949 ± 0.053
2.273GlnVal: 2.273 ± 0.057
0.405GlnTrp: 0.405 ± 0.021
1.494GlnTyr: 1.494 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
3.404ArgAla: 3.404 ± 0.063
0.679ArgCys: 0.679 ± 0.031
2.772ArgAsp: 2.772 ± 0.063
3.198ArgGlu: 3.198 ± 0.068
1.992ArgPhe: 1.992 ± 0.048
2.81ArgGly: 2.81 ± 0.057
1.485ArgHis: 1.485 ± 0.043
3.64ArgIle: 3.64 ± 0.075
3.09ArgLys: 3.09 ± 0.067
4.788ArgLeu: 4.788 ± 0.089
1.555ArgMet: 1.555 ± 0.041
2.023ArgAsn: 2.023 ± 0.043
2.036ArgPro: 2.036 ± 0.051
2.604ArgGln: 2.604 ± 0.071
3.069ArgArg: 3.069 ± 0.08
2.56ArgSer: 2.56 ± 0.05
2.795ArgThr: 2.795 ± 0.062
3.1ArgVal: 3.1 ± 0.052
0.54ArgTrp: 0.54 ± 0.029
1.962ArgTyr: 1.962 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
4.304SerAla: 4.304 ± 0.074
0.803SerCys: 0.803 ± 0.034
3.147SerAsp: 3.147 ± 0.066
2.809SerGlu: 2.809 ± 0.066
2.401SerPhe: 2.401 ± 0.049
4.912SerGly: 4.912 ± 0.088
1.472SerHis: 1.472 ± 0.047
3.612SerIle: 3.612 ± 0.07
2.527SerLys: 2.527 ± 0.056
5.78SerLeu: 5.78 ± 0.1
1.611SerMet: 1.611 ± 0.045
1.73SerAsn: 1.73 ± 0.061
2.141SerPro: 2.141 ± 0.047
2.126SerGln: 2.126 ± 0.053
3.139SerArg: 3.139 ± 0.071
3.217SerSer: 3.217 ± 0.062
2.73SerThr: 2.73 ± 0.076
4.121SerVal: 4.121 ± 0.074
0.634SerTrp: 0.634 ± 0.029
2.15SerTyr: 2.15 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
5.472ThrAla: 5.472 ± 0.104
0.716ThrCys: 0.716 ± 0.037
3.362ThrAsp: 3.362 ± 0.081
3.229ThrGlu: 3.229 ± 0.066
2.107ThrPhe: 2.107 ± 0.052
5.337ThrGly: 5.337 ± 0.103
1.154ThrHis: 1.154 ± 0.035
4.029ThrIle: 4.029 ± 0.074
2.722ThrLys: 2.722 ± 0.063
5.366ThrLeu: 5.366 ± 0.081
1.591ThrMet: 1.591 ± 0.044
1.952ThrAsn: 1.952 ± 0.073
2.619ThrPro: 2.619 ± 0.058
1.604ThrGln: 1.604 ± 0.043
2.302ThrArg: 2.302 ± 0.053
2.79ThrSer: 2.79 ± 0.073
3.118ThrThr: 3.118 ± 0.109
4.636ThrVal: 4.636 ± 0.099
0.591ThrTrp: 0.591 ± 0.028
2.132ThrTyr: 2.132 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
5.573ValAla: 5.573 ± 0.1
1.126ValCys: 1.126 ± 0.036
3.684ValAsp: 3.684 ± 0.084
3.764ValGlu: 3.764 ± 0.063
2.877ValPhe: 2.877 ± 0.064
4.492ValGly: 4.492 ± 0.097
1.555ValHis: 1.555 ± 0.043
5.005ValIle: 5.005 ± 0.092
3.545ValLys: 3.545 ± 0.079
7.062ValLeu: 7.062 ± 0.108
2.111ValMet: 2.111 ± 0.051
2.478ValAsn: 2.478 ± 0.073
3.348ValPro: 3.348 ± 0.068
2.482ValGln: 2.482 ± 0.058
3.871ValArg: 3.871 ± 0.07
4.46ValSer: 4.46 ± 0.078
4.521ValThr: 4.521 ± 0.088
5.165ValVal: 5.165 ± 0.085
0.68ValTrp: 0.68 ± 0.03
2.627ValTyr: 2.627 ± 0.064
0.0ValXaa: 0.0 ± 0.0
Trp
0.668TrpAla: 0.668 ± 0.032
0.142TrpCys: 0.142 ± 0.013
0.632TrpAsp: 0.632 ± 0.027
0.497TrpGlu: 0.497 ± 0.028
0.482TrpPhe: 0.482 ± 0.024
0.718TrpGly: 0.718 ± 0.033
0.345TrpHis: 0.345 ± 0.022
0.674TrpIle: 0.674 ± 0.028
0.654TrpLys: 0.654 ± 0.028
1.035TrpLeu: 1.035 ± 0.041
0.304TrpMet: 0.304 ± 0.02
0.461TrpAsn: 0.461 ± 0.023
0.354TrpPro: 0.354 ± 0.02
0.679TrpGln: 0.679 ± 0.033
0.503TrpArg: 0.503 ± 0.025
0.549TrpSer: 0.549 ± 0.026
0.481TrpThr: 0.481 ± 0.024
0.498TrpVal: 0.498 ± 0.026
0.15TrpTrp: 0.15 ± 0.014
0.401TrpTyr: 0.401 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.958TyrAla: 2.958 ± 0.067
0.577TyrCys: 0.577 ± 0.028
2.531TyrAsp: 2.531 ± 0.071
2.245TyrGlu: 2.245 ± 0.053
1.528TyrPhe: 1.528 ± 0.046
3.18TyrGly: 3.18 ± 0.072
0.997TyrHis: 0.997 ± 0.039
2.59TyrIle: 2.59 ± 0.06
1.787TyrLys: 1.787 ± 0.05
3.325TyrLeu: 3.325 ± 0.068
1.028TyrMet: 1.028 ± 0.028
1.369TyrAsn: 1.369 ± 0.043
1.457TyrPro: 1.457 ± 0.044
1.289TyrGln: 1.289 ± 0.038
1.779TyrArg: 1.779 ± 0.048
1.925TyrSer: 1.925 ± 0.048
2.189TyrThr: 2.189 ± 0.053
2.448TyrVal: 2.448 ± 0.045
0.401TyrTrp: 0.401 ± 0.025
1.454TyrTyr: 1.454 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2578 proteins (835493 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski