Amino acid dipepetide frequency for Streptococcus minor

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.76AlaAla: 5.76 ± 0.143
0.487AlaCys: 0.487 ± 0.025
4.596AlaAsp: 4.596 ± 0.105
4.849AlaGlu: 4.849 ± 0.1
3.373AlaPhe: 3.373 ± 0.094
5.753AlaGly: 5.753 ± 0.112
1.427AlaHis: 1.427 ± 0.048
6.14AlaIle: 6.14 ± 0.113
4.982AlaLys: 4.982 ± 0.099
7.45AlaLeu: 7.45 ± 0.129
1.934AlaMet: 1.934 ± 0.057
3.08AlaAsn: 3.08 ± 0.076
2.2AlaPro: 2.2 ± 0.066
3.22AlaGln: 3.22 ± 0.096
3.096AlaArg: 3.096 ± 0.078
4.671AlaSer: 4.671 ± 0.101
4.556AlaThr: 4.556 ± 0.105
5.308AlaVal: 5.308 ± 0.11
0.626AlaTrp: 0.626 ± 0.037
2.932AlaTyr: 2.932 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
0.324CysAla: 0.324 ± 0.025
0.052CysCys: 0.052 ± 0.01
0.277CysAsp: 0.277 ± 0.023
0.263CysGlu: 0.263 ± 0.022
0.251CysPhe: 0.251 ± 0.025
0.497CysGly: 0.497 ± 0.027
0.183CysHis: 0.183 ± 0.018
0.291CysIle: 0.291 ± 0.02
0.206CysLys: 0.206 ± 0.02
0.574CysLeu: 0.574 ± 0.031
0.126CysMet: 0.126 ± 0.014
0.199CysAsn: 0.199 ± 0.024
0.241CysPro: 0.241 ± 0.022
0.31CysGln: 0.31 ± 0.024
0.213CysArg: 0.213 ± 0.02
0.347CysSer: 0.347 ± 0.025
0.234CysThr: 0.234 ± 0.024
0.263CysVal: 0.263 ± 0.025
0.047CysTrp: 0.047 ± 0.009
0.173CysTyr: 0.173 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.82AspAla: 3.82 ± 0.073
0.298AspCys: 0.298 ± 0.022
2.695AspAsp: 2.695 ± 0.074
4.219AspGlu: 4.219 ± 0.105
3.318AspPhe: 3.318 ± 0.081
3.994AspGly: 3.994 ± 0.103
0.951AspHis: 0.951 ± 0.042
4.553AspIle: 4.553 ± 0.095
3.898AspLys: 3.898 ± 0.095
5.992AspLeu: 5.992 ± 0.107
1.558AspMet: 1.558 ± 0.056
2.266AspAsn: 2.266 ± 0.063
1.593AspPro: 1.593 ± 0.058
1.966AspGln: 1.966 ± 0.054
2.001AspArg: 2.001 ± 0.061
3.046AspSer: 3.046 ± 0.081
2.759AspThr: 2.759 ± 0.073
3.731AspVal: 3.731 ± 0.083
0.715AspTrp: 0.715 ± 0.033
2.817AspTyr: 2.817 ± 0.074
0.0AspXaa: 0.0 ± 0.0
Glu
5.814GluAla: 5.814 ± 0.105
0.249GluCys: 0.249 ± 0.021
3.738GluAsp: 3.738 ± 0.084
6.006GluGlu: 6.006 ± 0.111
2.585GluPhe: 2.585 ± 0.074
3.811GluGly: 3.811 ± 0.084
1.409GluHis: 1.409 ± 0.046
5.315GluIle: 5.315 ± 0.112
5.796GluLys: 5.796 ± 0.11
6.934GluLeu: 6.934 ± 0.135
1.886GluMet: 1.886 ± 0.06
3.555GluAsn: 3.555 ± 0.079
1.753GluPro: 1.753 ± 0.057
2.834GluGln: 2.834 ± 0.078
3.255GluArg: 3.255 ± 0.085
3.236GluSer: 3.236 ± 0.073
3.773GluThr: 3.773 ± 0.092
4.84GluVal: 4.84 ± 0.11
0.544GluTrp: 0.544 ± 0.028
2.027GluTyr: 2.027 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.462PheAla: 3.462 ± 0.085
0.263PheCys: 0.263 ± 0.023
2.913PheAsp: 2.913 ± 0.073
3.07PheGlu: 3.07 ± 0.077
2.259PhePhe: 2.259 ± 0.082
3.401PheGly: 3.401 ± 0.079
0.923PheHis: 0.923 ± 0.042
3.173PheIle: 3.173 ± 0.096
2.358PheLys: 2.358 ± 0.073
4.863PheLeu: 4.863 ± 0.129
1.048PheMet: 1.048 ± 0.045
1.798PheAsn: 1.798 ± 0.059
1.533PhePro: 1.533 ± 0.056
1.678PheGln: 1.678 ± 0.068
1.606PheArg: 1.606 ± 0.054
3.29PheSer: 3.29 ± 0.085
2.402PheThr: 2.402 ± 0.07
3.211PheVal: 3.211 ± 0.087
0.499PheTrp: 0.499 ± 0.034
1.859PheTyr: 1.859 ± 0.067
0.0PheXaa: 0.0 ± 0.0
Gly
4.619GlyAla: 4.619 ± 0.109
0.373GlyCys: 0.373 ± 0.027
3.302GlyAsp: 3.302 ± 0.074
3.644GlyGlu: 3.644 ± 0.08
3.373GlyPhe: 3.373 ± 0.081
4.342GlyGly: 4.342 ± 0.097
1.434GlyHis: 1.434 ± 0.061
5.444GlyIle: 5.444 ± 0.107
4.561GlyLys: 4.561 ± 0.088
7.019GlyLeu: 7.019 ± 0.121
1.884GlyMet: 1.884 ± 0.072
2.756GlyAsn: 2.756 ± 0.075
1.463GlyPro: 1.463 ± 0.056
3.483GlyGln: 3.483 ± 0.087
2.824GlyArg: 2.824 ± 0.072
3.836GlySer: 3.836 ± 0.091
3.773GlyThr: 3.773 ± 0.097
4.893GlyVal: 4.893 ± 0.091
0.694GlyTrp: 0.694 ± 0.036
2.779GlyTyr: 2.779 ± 0.07
0.0GlyXaa: 0.0 ± 0.0
His
1.338HisAla: 1.338 ± 0.047
0.129HisCys: 0.129 ± 0.016
1.01HisAsp: 1.01 ± 0.047
1.158HisGlu: 1.158 ± 0.043
1.23HisPhe: 1.23 ± 0.049
1.273HisGly: 1.273 ± 0.048
0.598HisHis: 0.598 ± 0.035
1.401HisIle: 1.401 ± 0.054
1.015HisLys: 1.015 ± 0.039
2.364HisLeu: 2.364 ± 0.069
0.419HisMet: 0.419 ± 0.028
0.733HisAsn: 0.733 ± 0.036
1.019HisPro: 1.019 ± 0.049
1.031HisGln: 1.031 ± 0.041
0.93HisArg: 0.93 ± 0.042
1.186HisSer: 1.186 ± 0.046
0.984HisThr: 0.984 ± 0.037
1.261HisVal: 1.261 ± 0.046
0.131HisTrp: 0.131 ± 0.015
0.987HisTyr: 0.987 ± 0.047
0.0HisXaa: 0.0 ± 0.0
Ile
5.941IleAla: 5.941 ± 0.1
0.546IleCys: 0.546 ± 0.035
4.172IleAsp: 4.172 ± 0.07
5.065IleGlu: 5.065 ± 0.105
3.452IlePhe: 3.452 ± 0.088
5.055IleGly: 5.055 ± 0.094
1.423IleHis: 1.423 ± 0.047
5.004IleIle: 5.004 ± 0.112
4.23IleLys: 4.23 ± 0.098
7.841IleLeu: 7.841 ± 0.168
1.582IleMet: 1.582 ± 0.056
2.882IleAsn: 2.882 ± 0.072
2.993IlePro: 2.993 ± 0.081
3.002IleGln: 3.002 ± 0.072
3.054IleArg: 3.054 ± 0.062
5.119IleSer: 5.119 ± 0.112
3.965IleThr: 3.965 ± 0.099
4.962IleVal: 4.962 ± 0.093
0.668IleTrp: 0.668 ± 0.035
2.65IleTyr: 2.65 ± 0.084
0.0IleXaa: 0.0 ± 0.0
Lys
4.669LysAla: 4.669 ± 0.093
0.199LysCys: 0.199 ± 0.019
3.925LysAsp: 3.925 ± 0.082
5.672LysGlu: 5.672 ± 0.104
1.931LysPhe: 1.931 ± 0.071
3.806LysGly: 3.806 ± 0.086
1.218LysHis: 1.218 ± 0.042
4.642LysIle: 4.642 ± 0.099
5.242LysLys: 5.242 ± 0.11
5.561LysLeu: 5.561 ± 0.108
2.016LysMet: 2.016 ± 0.072
3.096LysAsn: 3.096 ± 0.079
2.055LysPro: 2.055 ± 0.073
2.7LysGln: 2.7 ± 0.068
2.845LysArg: 2.845 ± 0.064
3.513LysSer: 3.513 ± 0.085
3.806LysThr: 3.806 ± 0.073
4.364LysVal: 4.364 ± 0.091
0.63LysTrp: 0.63 ± 0.03
2.102LysTyr: 2.102 ± 0.068
0.0LysXaa: 0.0 ± 0.0
Leu
9.226LeuAla: 9.226 ± 0.146
0.467LeuCys: 0.467 ± 0.03
5.946LeuAsp: 5.946 ± 0.121
7.082LeuGlu: 7.082 ± 0.127
4.481LeuPhe: 4.481 ± 0.111
6.639LeuGly: 6.639 ± 0.129
1.838LeuHis: 1.838 ± 0.057
6.764LeuIle: 6.764 ± 0.128
5.932LeuLys: 5.932 ± 0.103
10.29LeuLeu: 10.29 ± 0.191
2.482LeuMet: 2.482 ± 0.06
3.923LeuAsn: 3.923 ± 0.078
4.211LeuPro: 4.211 ± 0.098
3.677LeuGln: 3.677 ± 0.072
3.898LeuArg: 3.898 ± 0.077
7.394LeuSer: 7.394 ± 0.129
6.29LeuThr: 6.29 ± 0.116
7.241LeuVal: 7.241 ± 0.13
0.665LeuTrp: 0.665 ± 0.037
3.438LeuTyr: 3.438 ± 0.091
0.0LeuXaa: 0.0 ± 0.0
Met
2.104MetAla: 2.104 ± 0.061
0.105MetCys: 0.105 ± 0.014
1.42MetAsp: 1.42 ± 0.049
1.596MetGlu: 1.596 ± 0.052
0.788MetPhe: 0.788 ± 0.042
1.683MetGly: 1.683 ± 0.064
0.382MetHis: 0.382 ± 0.024
1.845MetIle: 1.845 ± 0.071
2.029MetLys: 2.029 ± 0.054
2.247MetLeu: 2.247 ± 0.068
0.776MetMet: 0.776 ± 0.041
1.212MetAsn: 1.212 ± 0.047
0.844MetPro: 0.844 ± 0.044
0.928MetGln: 0.928 ± 0.043
1.047MetArg: 1.047 ± 0.043
1.573MetSer: 1.573 ± 0.053
1.954MetThr: 1.954 ± 0.065
1.809MetVal: 1.809 ± 0.053
0.171MetTrp: 0.171 ± 0.018
0.586MetTyr: 0.586 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
2.73AsnAla: 2.73 ± 0.062
0.223AsnCys: 0.223 ± 0.02
2.039AsnAsp: 2.039 ± 0.065
2.447AsnGlu: 2.447 ± 0.064
1.947AsnPhe: 1.947 ± 0.063
3.133AsnGly: 3.133 ± 0.098
1.064AsnHis: 1.064 ± 0.045
3.293AsnIle: 3.293 ± 0.074
2.515AsnLys: 2.515 ± 0.067
4.598AsnLeu: 4.598 ± 0.093
1.054AsnMet: 1.054 ± 0.054
1.83AsnAsn: 1.83 ± 0.068
2.159AsnPro: 2.159 ± 0.067
2.357AsnGln: 2.357 ± 0.071
2.004AsnArg: 2.004 ± 0.068
2.198AsnSer: 2.198 ± 0.065
2.039AsnThr: 2.039 ± 0.061
2.545AsnVal: 2.545 ± 0.072
0.488AsnTrp: 0.488 ± 0.033
1.786AsnTyr: 1.786 ± 0.062
0.0AsnXaa: 0.0 ± 0.0
Pro
2.587ProAla: 2.587 ± 0.075
0.134ProCys: 0.134 ± 0.018
2.14ProAsp: 2.14 ± 0.069
2.723ProGlu: 2.723 ± 0.067
1.624ProPhe: 1.624 ± 0.059
1.92ProGly: 1.92 ± 0.058
0.754ProHis: 0.754 ± 0.035
2.564ProIle: 2.564 ± 0.066
1.995ProLys: 1.995 ± 0.073
3.094ProLeu: 3.094 ± 0.073
0.774ProMet: 0.774 ± 0.032
1.713ProAsn: 1.713 ± 0.055
0.703ProPro: 0.703 ± 0.053
1.334ProGln: 1.334 ± 0.05
1.095ProArg: 1.095 ± 0.042
2.268ProSer: 2.268 ± 0.07
2.121ProThr: 2.121 ± 0.059
2.789ProVal: 2.789 ± 0.073
0.276ProTrp: 0.276 ± 0.023
1.345ProTyr: 1.345 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
3.919GlnAla: 3.919 ± 0.093
0.129GlnCys: 0.129 ± 0.015
2.156GlnAsp: 2.156 ± 0.066
3.576GlnGlu: 3.576 ± 0.083
1.713GlnPhe: 1.713 ± 0.05
2.407GlnGly: 2.407 ± 0.067
0.902GlnHis: 0.902 ± 0.038
3.009GlnIle: 3.009 ± 0.075
2.862GlnLys: 2.862 ± 0.076
4.418GlnLeu: 4.418 ± 0.092
1.109GlnMet: 1.109 ± 0.042
1.68GlnAsn: 1.68 ± 0.055
1.35GlnPro: 1.35 ± 0.044
1.849GlnGln: 1.849 ± 0.055
1.545GlnArg: 1.545 ± 0.067
2.376GlnSer: 2.376 ± 0.065
2.578GlnThr: 2.578 ± 0.072
3.48GlnVal: 3.48 ± 0.088
0.314GlnTrp: 0.314 ± 0.026
1.443GlnTyr: 1.443 ± 0.054
0.0GlnXaa: 0.0 ± 0.0
Arg
2.613ArgAla: 2.613 ± 0.081
0.157ArgCys: 0.157 ± 0.016
2.245ArgAsp: 2.245 ± 0.068
3.035ArgGlu: 3.035 ± 0.079
1.997ArgPhe: 1.997 ± 0.062
2.285ArgGly: 2.285 ± 0.077
0.867ArgHis: 0.867 ± 0.041
3.016ArgIle: 3.016 ± 0.072
2.852ArgLys: 2.852 ± 0.069
4.317ArgLeu: 4.317 ± 0.095
1.13ArgMet: 1.13 ± 0.044
1.776ArgAsn: 1.776 ± 0.056
1.347ArgPro: 1.347 ± 0.053
2.156ArgGln: 2.156 ± 0.069
1.962ArgArg: 1.962 ± 0.07
2.165ArgSer: 2.165 ± 0.061
2.077ArgThr: 2.077 ± 0.054
2.838ArgVal: 2.838 ± 0.072
0.288ArgTrp: 0.288 ± 0.025
1.634ArgTyr: 1.634 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
3.939SerAla: 3.939 ± 0.077
0.307SerCys: 0.307 ± 0.024
3.342SerAsp: 3.342 ± 0.089
3.548SerGlu: 3.548 ± 0.076
3.133SerPhe: 3.133 ± 0.087
4.42SerGly: 4.42 ± 0.087
1.364SerHis: 1.364 ± 0.045
4.572SerIle: 4.572 ± 0.099
3.67SerLys: 3.67 ± 0.079
6.649SerLeu: 6.649 ± 0.113
1.441SerMet: 1.441 ± 0.05
2.533SerAsn: 2.533 ± 0.079
2.083SerPro: 2.083 ± 0.06
3.234SerGln: 3.234 ± 0.094
2.512SerArg: 2.512 ± 0.067
3.989SerSer: 3.989 ± 0.118
3.264SerThr: 3.264 ± 0.078
3.9SerVal: 3.9 ± 0.082
0.651SerTrp: 0.651 ± 0.034
2.59SerTyr: 2.59 ± 0.074
0.0SerXaa: 0.0 ± 0.0
Thr
4.314ThrAla: 4.314 ± 0.101
0.314ThrCys: 0.314 ± 0.025
3.29ThrAsp: 3.29 ± 0.072
3.607ThrGlu: 3.607 ± 0.079
2.693ThrPhe: 2.693 ± 0.067
4.289ThrGly: 4.289 ± 0.085
1.04ThrHis: 1.04 ± 0.044
4.669ThrIle: 4.669 ± 0.094
3.246ThrLys: 3.246 ± 0.087
5.467ThrLeu: 5.467 ± 0.092
1.219ThrMet: 1.219 ± 0.045
2.416ThrAsn: 2.416 ± 0.07
2.334ThrPro: 2.334 ± 0.071
1.879ThrGln: 1.879 ± 0.058
2.011ThrArg: 2.011 ± 0.063
3.64ThrSer: 3.64 ± 0.083
3.853ThrThr: 3.853 ± 0.225
4.633ThrVal: 4.633 ± 0.108
0.523ThrTrp: 0.523 ± 0.035
2.397ThrTyr: 2.397 ± 0.073
0.0ThrXaa: 0.0 ± 0.0
Val
6.098ValAla: 6.098 ± 0.115
0.361ValCys: 0.361 ± 0.025
4.263ValAsp: 4.263 ± 0.101
4.938ValGlu: 4.938 ± 0.106
3.094ValPhe: 3.094 ± 0.093
4.621ValGly: 4.621 ± 0.094
1.286ValHis: 1.286 ± 0.05
4.776ValIle: 4.776 ± 0.096
3.973ValLys: 3.973 ± 0.08
6.869ValLeu: 6.869 ± 0.13
1.629ValMet: 1.629 ± 0.053
2.969ValAsn: 2.969 ± 0.075
2.456ValPro: 2.456 ± 0.069
2.48ValGln: 2.48 ± 0.072
2.747ValArg: 2.747 ± 0.075
4.448ValSer: 4.448 ± 0.089
4.636ValThr: 4.636 ± 0.116
5.187ValVal: 5.187 ± 0.099
0.612ValTrp: 0.612 ± 0.03
2.505ValTyr: 2.505 ± 0.071
0.0ValXaa: 0.0 ± 0.0
Trp
0.569TrpAla: 0.569 ± 0.032
0.044TrpCys: 0.044 ± 0.009
0.488TrpAsp: 0.488 ± 0.035
0.572TrpGlu: 0.572 ± 0.033
0.438TrpPhe: 0.438 ± 0.03
0.605TrpGly: 0.605 ± 0.03
0.185TrpHis: 0.185 ± 0.018
0.64TrpIle: 0.64 ± 0.038
0.535TrpLys: 0.535 ± 0.03
1.066TrpLeu: 1.066 ± 0.049
0.23TrpMet: 0.23 ± 0.021
0.49TrpAsn: 0.49 ± 0.032
0.195TrpPro: 0.195 ± 0.019
0.476TrpGln: 0.476 ± 0.03
0.358TrpArg: 0.358 ± 0.023
0.527TrpSer: 0.527 ± 0.029
0.617TrpThr: 0.617 ± 0.034
0.548TrpVal: 0.548 ± 0.028
0.106TrpTrp: 0.106 ± 0.013
0.344TrpTyr: 0.344 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.595TyrAla: 2.595 ± 0.071
0.234TyrCys: 0.234 ± 0.021
2.231TyrAsp: 2.231 ± 0.066
2.288TyrGlu: 2.288 ± 0.061
1.933TyrPhe: 1.933 ± 0.06
2.561TyrGly: 2.561 ± 0.07
0.949TyrHis: 0.949 ± 0.047
2.496TyrIle: 2.496 ± 0.071
2.091TyrLys: 2.091 ± 0.074
4.273TyrLeu: 4.273 ± 0.114
0.799TyrMet: 0.799 ± 0.037
1.544TyrAsn: 1.544 ± 0.058
1.423TyrPro: 1.423 ± 0.058
2.22TyrGln: 2.22 ± 0.07
1.753TyrArg: 1.753 ± 0.059
2.318TyrSer: 2.318 ± 0.075
2.152TyrThr: 2.152 ± 0.077
2.145TyrVal: 2.145 ± 0.071
0.385TyrTrp: 0.385 ± 0.027
1.736TyrTyr: 1.736 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1898 proteins (573303 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski