Amino acid dipepetide frequency for Streptococcus sp. oral taxon 056 str. F0418

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.621AlaAla: 5.621 ± 0.137
0.474AlaCys: 0.474 ± 0.031
4.005AlaAsp: 4.005 ± 0.094
5.197AlaGlu: 5.197 ± 0.141
3.501AlaPhe: 3.501 ± 0.082
5.423AlaGly: 5.423 ± 0.126
1.304AlaHis: 1.304 ± 0.053
5.578AlaIle: 5.578 ± 0.116
4.899AlaLys: 4.899 ± 0.114
7.643AlaLeu: 7.643 ± 0.151
1.819AlaMet: 1.819 ± 0.065
2.96AlaAsn: 2.96 ± 0.073
2.056AlaPro: 2.056 ± 0.07
3.189AlaGln: 3.189 ± 0.091
2.907AlaArg: 2.907 ± 0.071
5.104AlaSer: 5.104 ± 0.503
4.187AlaThr: 4.187 ± 0.112
5.252AlaVal: 5.252 ± 0.129
0.635AlaTrp: 0.635 ± 0.037
2.709AlaTyr: 2.709 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
0.357CysAla: 0.357 ± 0.025
0.048CysCys: 0.048 ± 0.009
0.316CysAsp: 0.316 ± 0.023
0.276CysGlu: 0.276 ± 0.025
0.319CysPhe: 0.319 ± 0.026
0.469CysGly: 0.469 ± 0.034
0.171CysHis: 0.171 ± 0.018
0.33CysIle: 0.33 ± 0.026
0.243CysLys: 0.243 ± 0.02
0.692CysLeu: 0.692 ± 0.036
0.157CysMet: 0.157 ± 0.017
0.21CysAsn: 0.21 ± 0.019
0.218CysPro: 0.218 ± 0.021
0.284CysGln: 0.284 ± 0.026
0.218CysArg: 0.218 ± 0.02
0.423CysSer: 0.423 ± 0.03
0.225CysThr: 0.225 ± 0.018
0.33CysVal: 0.33 ± 0.024
0.073CysTrp: 0.073 ± 0.013
0.214CysTyr: 0.214 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.641AspAla: 3.641 ± 0.161
0.303AspCys: 0.303 ± 0.022
2.625AspAsp: 2.625 ± 0.077
4.121AspGlu: 4.121 ± 0.094
3.288AspPhe: 3.288 ± 0.086
3.773AspGly: 3.773 ± 0.106
0.986AspHis: 0.986 ± 0.041
4.169AspIle: 4.169 ± 0.105
4.102AspLys: 4.102 ± 0.094
5.88AspLeu: 5.88 ± 0.118
1.391AspMet: 1.391 ± 0.049
2.209AspAsn: 2.209 ± 0.063
1.573AspPro: 1.573 ± 0.056
2.124AspGln: 2.124 ± 0.065
2.104AspArg: 2.104 ± 0.065
3.017AspSer: 3.017 ± 0.085
2.682AspThr: 2.682 ± 0.072
3.624AspVal: 3.624 ± 0.093
0.653AspTrp: 0.653 ± 0.035
2.894AspTyr: 2.894 ± 0.083
0.0AspXaa: 0.0 ± 0.0
Glu
5.418GluAla: 5.418 ± 0.12
0.307GluCys: 0.307 ± 0.026
3.618GluAsp: 3.618 ± 0.087
6.133GluGlu: 6.133 ± 0.13
2.732GluPhe: 2.732 ± 0.066
3.663GluGly: 3.663 ± 0.089
1.307GluHis: 1.307 ± 0.047
5.594GluIle: 5.594 ± 0.119
6.379GluLys: 6.379 ± 0.127
7.065GluLeu: 7.065 ± 0.141
1.864GluMet: 1.864 ± 0.068
4.003GluAsn: 4.003 ± 0.097
1.582GluPro: 1.582 ± 0.051
2.935GluGln: 2.935 ± 0.094
3.333GluArg: 3.333 ± 0.095
3.438GluSer: 3.438 ± 0.089
3.556GluThr: 3.556 ± 0.098
4.786GluVal: 4.786 ± 0.125
0.63GluTrp: 0.63 ± 0.033
2.131GluTyr: 2.131 ± 0.072
0.0GluXaa: 0.0 ± 0.0
Phe
3.296PheAla: 3.296 ± 0.088
0.305PheCys: 0.305 ± 0.026
2.985PheAsp: 2.985 ± 0.082
3.121PheGlu: 3.121 ± 0.071
2.4PhePhe: 2.4 ± 0.081
3.224PheGly: 3.224 ± 0.08
0.92PheHis: 0.92 ± 0.039
3.436PheIle: 3.436 ± 0.085
2.548PheLys: 2.548 ± 0.07
4.895PheLeu: 4.895 ± 0.114
1.152PheMet: 1.152 ± 0.046
1.951PheAsn: 1.951 ± 0.063
1.543PhePro: 1.543 ± 0.053
1.719PheGln: 1.719 ± 0.065
1.683PheArg: 1.683 ± 0.055
3.51PheSer: 3.51 ± 0.1
2.532PheThr: 2.532 ± 0.084
3.365PheVal: 3.365 ± 0.09
0.56PheTrp: 0.56 ± 0.033
2.019PheTyr: 2.019 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
4.517GlyAla: 4.517 ± 0.12
0.426GlyCys: 0.426 ± 0.029
3.242GlyAsp: 3.242 ± 0.082
3.695GlyGlu: 3.695 ± 0.083
3.208GlyPhe: 3.208 ± 0.077
4.162GlyGly: 4.162 ± 0.105
1.337GlyHis: 1.337 ± 0.045
5.129GlyIle: 5.129 ± 0.119
4.465GlyLys: 4.465 ± 0.099
6.807GlyLeu: 6.807 ± 0.1
1.78GlyMet: 1.78 ± 0.068
2.796GlyAsn: 2.796 ± 0.099
1.343GlyPro: 1.343 ± 0.054
3.158GlyGln: 3.158 ± 0.093
2.819GlyArg: 2.819 ± 0.078
3.743GlySer: 3.743 ± 0.095
3.741GlyThr: 3.741 ± 0.133
4.662GlyVal: 4.662 ± 0.111
0.676GlyTrp: 0.676 ± 0.037
2.648GlyTyr: 2.648 ± 0.072
0.0GlyXaa: 0.0 ± 0.0
His
1.138HisAla: 1.138 ± 0.041
0.137HisCys: 0.137 ± 0.015
1.033HisAsp: 1.033 ± 0.04
1.145HisGlu: 1.145 ± 0.058
1.243HisPhe: 1.243 ± 0.046
1.222HisGly: 1.222 ± 0.051
0.581HisHis: 0.581 ± 0.033
1.361HisIle: 1.361 ± 0.051
1.043HisLys: 1.043 ± 0.04
2.197HisLeu: 2.197 ± 0.067
0.426HisMet: 0.426 ± 0.029
0.719HisAsn: 0.719 ± 0.036
0.926HisPro: 0.926 ± 0.041
0.949HisGln: 0.949 ± 0.046
0.883HisArg: 0.883 ± 0.044
1.202HisSer: 1.202 ± 0.048
0.945HisThr: 0.945 ± 0.042
1.079HisVal: 1.079 ± 0.049
0.182HisTrp: 0.182 ± 0.018
1.0HisTyr: 1.0 ± 0.045
0.0HisXaa: 0.0 ± 0.0
Ile
5.899IleAla: 5.899 ± 0.101
0.624IleCys: 0.624 ± 0.033
4.096IleAsp: 4.096 ± 0.089
5.063IleGlu: 5.063 ± 0.115
3.675IlePhe: 3.675 ± 0.1
4.82IleGly: 4.82 ± 0.112
1.362IleHis: 1.362 ± 0.049
5.175IleIle: 5.175 ± 0.127
4.398IleLys: 4.398 ± 0.095
7.873IleLeu: 7.873 ± 0.166
1.687IleMet: 1.687 ± 0.056
2.962IleAsn: 2.962 ± 0.075
2.835IlePro: 2.835 ± 0.075
2.844IleGln: 2.844 ± 0.075
2.953IleArg: 2.953 ± 0.078
5.238IleSer: 5.238 ± 0.169
3.7IleThr: 3.7 ± 0.087
4.795IleVal: 4.795 ± 0.113
0.681IleTrp: 0.681 ± 0.038
2.807IleTyr: 2.807 ± 0.078
0.0IleXaa: 0.0 ± 0.0
Lys
4.765LysAla: 4.765 ± 0.105
0.207LysCys: 0.207 ± 0.019
3.861LysAsp: 3.861 ± 0.083
6.085LysGlu: 6.085 ± 0.128
2.12LysPhe: 2.12 ± 0.065
3.768LysGly: 3.768 ± 0.078
1.186LysHis: 1.186 ± 0.045
4.929LysIle: 4.929 ± 0.105
5.805LysLys: 5.805 ± 0.123
6.036LysLeu: 6.036 ± 0.112
2.202LysMet: 2.202 ± 0.063
3.577LysAsn: 3.577 ± 0.089
2.083LysPro: 2.083 ± 0.075
2.716LysGln: 2.716 ± 0.072
3.11LysArg: 3.11 ± 0.09
3.848LysSer: 3.848 ± 0.091
3.893LysThr: 3.893 ± 0.081
4.499LysVal: 4.499 ± 0.092
0.644LysTrp: 0.644 ± 0.036
2.311LysTyr: 2.311 ± 0.066
0.0LysXaa: 0.0 ± 0.0
Leu
8.758LeuAla: 8.758 ± 0.158
0.592LeuCys: 0.592 ± 0.034
5.938LeuAsp: 5.938 ± 0.132
7.203LeuGlu: 7.203 ± 0.158
4.754LeuPhe: 4.754 ± 0.128
6.334LeuGly: 6.334 ± 0.115
1.712LeuHis: 1.712 ± 0.063
6.837LeuIle: 6.837 ± 0.143
6.297LeuLys: 6.297 ± 0.114
10.88LeuLeu: 10.88 ± 0.222
2.5LeuMet: 2.5 ± 0.076
4.034LeuAsn: 4.034 ± 0.085
4.148LeuPro: 4.148 ± 0.094
3.747LeuGln: 3.747 ± 0.086
3.943LeuArg: 3.943 ± 0.091
7.508LeuSer: 7.508 ± 0.128
6.327LeuThr: 6.327 ± 0.149
6.921LeuVal: 6.921 ± 0.115
0.695LeuTrp: 0.695 ± 0.036
3.376LeuTyr: 3.376 ± 0.086
0.0LeuXaa: 0.0 ± 0.0
Met
1.949MetAla: 1.949 ± 0.064
0.121MetCys: 0.121 ± 0.016
1.318MetAsp: 1.318 ± 0.052
1.589MetGlu: 1.589 ± 0.056
0.858MetPhe: 0.858 ± 0.045
1.619MetGly: 1.619 ± 0.063
0.317MetHis: 0.317 ± 0.024
2.024MetIle: 2.024 ± 0.074
2.156MetLys: 2.156 ± 0.061
2.361MetLeu: 2.361 ± 0.071
0.763MetMet: 0.763 ± 0.038
1.266MetAsn: 1.266 ± 0.046
0.765MetPro: 0.765 ± 0.038
0.936MetGln: 0.936 ± 0.043
1.066MetArg: 1.066 ± 0.048
1.591MetSer: 1.591 ± 0.05
1.851MetThr: 1.851 ± 0.056
1.63MetVal: 1.63 ± 0.055
0.16MetTrp: 0.16 ± 0.018
0.631MetTyr: 0.631 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
2.866AsnAla: 2.866 ± 0.071
0.239AsnCys: 0.239 ± 0.021
2.181AsnAsp: 2.181 ± 0.061
2.423AsnGlu: 2.423 ± 0.062
2.119AsnPhe: 2.119 ± 0.065
3.319AsnGly: 3.319 ± 0.097
1.115AsnHis: 1.115 ± 0.047
3.337AsnIle: 3.337 ± 0.093
2.652AsnLys: 2.652 ± 0.07
4.51AsnLeu: 4.51 ± 0.105
1.056AsnMet: 1.056 ± 0.046
1.828AsnAsn: 1.828 ± 0.068
2.115AsnPro: 2.115 ± 0.056
2.418AsnGln: 2.418 ± 0.073
2.161AsnArg: 2.161 ± 0.078
2.737AsnSer: 2.737 ± 0.228
2.063AsnThr: 2.063 ± 0.067
2.662AsnVal: 2.662 ± 0.08
0.496AsnTrp: 0.496 ± 0.031
1.739AsnTyr: 1.739 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
2.415ProAla: 2.415 ± 0.077
0.15ProCys: 0.15 ± 0.018
2.124ProAsp: 2.124 ± 0.061
2.766ProGlu: 2.766 ± 0.079
1.716ProPhe: 1.716 ± 0.059
1.803ProGly: 1.803 ± 0.065
0.736ProHis: 0.736 ± 0.036
2.415ProIle: 2.415 ± 0.069
1.965ProLys: 1.965 ± 0.062
2.928ProLeu: 2.928 ± 0.071
0.71ProMet: 0.71 ± 0.038
1.489ProAsn: 1.489 ± 0.051
0.599ProPro: 0.599 ± 0.05
1.368ProGln: 1.368 ± 0.051
0.988ProArg: 0.988 ± 0.045
2.161ProSer: 2.161 ± 0.069
1.956ProThr: 1.956 ± 0.087
2.55ProVal: 2.55 ± 0.083
0.314ProTrp: 0.314 ± 0.024
1.288ProTyr: 1.288 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
3.795GlnAla: 3.795 ± 0.105
0.118GlnCys: 0.118 ± 0.016
2.145GlnAsp: 2.145 ± 0.074
3.65GlnGlu: 3.65 ± 0.097
1.703GlnPhe: 1.703 ± 0.06
2.461GlnGly: 2.461 ± 0.069
0.849GlnHis: 0.849 ± 0.041
3.115GlnIle: 3.115 ± 0.079
3.094GlnLys: 3.094 ± 0.079
4.225GlnLeu: 4.225 ± 0.103
0.997GlnMet: 0.997 ± 0.049
1.815GlnAsn: 1.815 ± 0.065
1.291GlnPro: 1.291 ± 0.052
1.685GlnGln: 1.685 ± 0.065
1.539GlnArg: 1.539 ± 0.059
2.249GlnSer: 2.249 ± 0.069
2.397GlnThr: 2.397 ± 0.078
3.217GlnVal: 3.217 ± 0.078
0.307GlnTrp: 0.307 ± 0.024
1.461GlnTyr: 1.461 ± 0.056
0.0GlnXaa: 0.0 ± 0.0
Arg
2.557ArgAla: 2.557 ± 0.073
0.182ArgCys: 0.182 ± 0.018
2.233ArgAsp: 2.233 ± 0.07
3.181ArgGlu: 3.181 ± 0.1
2.069ArgPhe: 2.069 ± 0.057
2.24ArgGly: 2.24 ± 0.066
0.833ArgHis: 0.833 ± 0.038
2.991ArgIle: 2.991 ± 0.078
3.051ArgLys: 3.051 ± 0.081
4.344ArgLeu: 4.344 ± 0.108
1.163ArgMet: 1.163 ± 0.052
1.842ArgAsn: 1.842 ± 0.053
1.273ArgPro: 1.273 ± 0.052
1.969ArgGln: 1.969 ± 0.055
2.003ArgArg: 2.003 ± 0.068
2.236ArgSer: 2.236 ± 0.06
1.99ArgThr: 1.99 ± 0.064
2.843ArgVal: 2.843 ± 0.072
0.303ArgTrp: 0.303 ± 0.027
1.796ArgTyr: 1.796 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
4.38SerAla: 4.38 ± 0.453
0.357SerCys: 0.357 ± 0.027
3.463SerAsp: 3.463 ± 0.079
3.715SerGlu: 3.715 ± 0.097
3.099SerPhe: 3.099 ± 0.088
4.476SerGly: 4.476 ± 0.105
1.373SerHis: 1.373 ± 0.057
4.633SerIle: 4.633 ± 0.163
4.205SerLys: 4.205 ± 0.1
6.743SerLeu: 6.743 ± 0.119
1.546SerMet: 1.546 ± 0.055
2.944SerAsn: 2.944 ± 0.22
1.889SerPro: 1.889 ± 0.062
3.249SerGln: 3.249 ± 0.095
2.53SerArg: 2.53 ± 0.065
4.353SerSer: 4.353 ± 0.129
3.226SerThr: 3.226 ± 0.093
4.414SerVal: 4.414 ± 0.344
0.596SerTrp: 0.596 ± 0.032
2.721SerTyr: 2.721 ± 0.109
0.0SerXaa: 0.0 ± 0.0
Thr
4.207ThrAla: 4.207 ± 0.116
0.291ThrCys: 0.291 ± 0.025
3.221ThrAsp: 3.221 ± 0.118
3.367ThrGlu: 3.367 ± 0.084
2.695ThrPhe: 2.695 ± 0.101
4.045ThrGly: 4.045 ± 0.101
1.016ThrHis: 1.016 ± 0.045
4.497ThrIle: 4.497 ± 0.085
3.29ThrLys: 3.29 ± 0.092
5.216ThrLeu: 5.216 ± 0.1
1.136ThrMet: 1.136 ± 0.038
2.438ThrAsn: 2.438 ± 0.072
2.263ThrPro: 2.263 ± 0.121
1.678ThrGln: 1.678 ± 0.056
1.99ThrArg: 1.99 ± 0.063
3.665ThrSer: 3.665 ± 0.12
3.094ThrThr: 3.094 ± 0.1
4.237ThrVal: 4.237 ± 0.126
0.519ThrTrp: 0.519 ± 0.037
2.283ThrTyr: 2.283 ± 0.134
0.0ThrXaa: 0.0 ± 0.0
Val
5.797ValAla: 5.797 ± 0.115
0.398ValCys: 0.398 ± 0.027
3.864ValAsp: 3.864 ± 0.091
4.801ValGlu: 4.801 ± 0.118
3.076ValPhe: 3.076 ± 0.077
4.471ValGly: 4.471 ± 0.095
1.198ValHis: 1.198 ± 0.052
4.686ValIle: 4.686 ± 0.113
4.326ValLys: 4.326 ± 0.095
6.983ValLeu: 6.983 ± 0.141
1.568ValMet: 1.568 ± 0.061
2.859ValAsn: 2.859 ± 0.077
2.297ValPro: 2.297 ± 0.071
2.32ValGln: 2.32 ± 0.062
2.746ValArg: 2.746 ± 0.084
4.968ValSer: 4.968 ± 0.357
4.257ValThr: 4.257 ± 0.137
4.931ValVal: 4.931 ± 0.11
0.556ValTrp: 0.556 ± 0.033
2.365ValTyr: 2.365 ± 0.065
0.0ValXaa: 0.0 ± 0.0
Trp
0.547TrpAla: 0.547 ± 0.031
0.053TrpCys: 0.053 ± 0.01
0.505TrpAsp: 0.505 ± 0.035
0.542TrpGlu: 0.542 ± 0.035
0.483TrpPhe: 0.483 ± 0.031
0.571TrpGly: 0.571 ± 0.036
0.173TrpHis: 0.173 ± 0.019
0.678TrpIle: 0.678 ± 0.039
0.551TrpLys: 0.551 ± 0.031
1.088TrpLeu: 1.088 ± 0.047
0.241TrpMet: 0.241 ± 0.022
0.54TrpAsn: 0.54 ± 0.038
0.21TrpPro: 0.21 ± 0.021
0.485TrpGln: 0.485 ± 0.031
0.382TrpArg: 0.382 ± 0.028
0.624TrpSer: 0.624 ± 0.037
0.483TrpThr: 0.483 ± 0.031
0.494TrpVal: 0.494 ± 0.03
0.116TrpTrp: 0.116 ± 0.015
0.367TrpTyr: 0.367 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.539TyrAla: 2.539 ± 0.078
0.239TyrCys: 0.239 ± 0.028
2.388TyrAsp: 2.388 ± 0.143
2.402TyrGlu: 2.402 ± 0.075
2.106TyrPhe: 2.106 ± 0.064
2.493TyrGly: 2.493 ± 0.067
0.931TyrHis: 0.931 ± 0.045
2.575TyrIle: 2.575 ± 0.076
2.179TyrLys: 2.179 ± 0.065
4.137TyrLeu: 4.137 ± 0.092
0.815TyrMet: 0.815 ± 0.039
1.591TyrAsn: 1.591 ± 0.072
1.432TyrPro: 1.432 ± 0.056
2.34TyrGln: 2.34 ± 0.062
1.739TyrArg: 1.739 ± 0.061
2.297TyrSer: 2.297 ± 0.077
2.04TyrThr: 2.04 ± 0.087
2.177TyrVal: 2.177 ± 0.062
0.339TyrTrp: 0.339 ± 0.028
1.689TyrTyr: 1.689 ± 0.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1957 proteins (560762 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski