Amino acid dipepetide frequency for Streptococcus henryi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.681AlaAla: 5.681 ± 0.142
0.445AlaCys: 0.445 ± 0.026
4.499AlaAsp: 4.499 ± 0.105
5.282AlaGlu: 5.282 ± 0.107
3.236AlaPhe: 3.236 ± 0.083
5.22AlaGly: 5.22 ± 0.096
1.214AlaHis: 1.214 ± 0.044
5.987AlaIle: 5.987 ± 0.104
5.166AlaLys: 5.166 ± 0.093
7.326AlaLeu: 7.326 ± 0.13
1.901AlaMet: 1.901 ± 0.05
3.246AlaAsn: 3.246 ± 0.074
2.112AlaPro: 2.112 ± 0.063
3.187AlaGln: 3.187 ± 0.085
2.788AlaArg: 2.788 ± 0.071
4.809AlaSer: 4.809 ± 0.122
4.153AlaThr: 4.153 ± 0.084
5.21AlaVal: 5.21 ± 0.103
0.667AlaTrp: 0.667 ± 0.037
2.897AlaTyr: 2.897 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.309CysAla: 0.309 ± 0.024
0.061CysCys: 0.061 ± 0.009
0.337CysAsp: 0.337 ± 0.024
0.251CysGlu: 0.251 ± 0.019
0.253CysPhe: 0.253 ± 0.019
0.483CysGly: 0.483 ± 0.026
0.17CysHis: 0.17 ± 0.018
0.318CysIle: 0.318 ± 0.022
0.241CysLys: 0.241 ± 0.02
0.597CysLeu: 0.597 ± 0.029
0.126CysMet: 0.126 ± 0.015
0.237CysAsn: 0.237 ± 0.021
0.26CysPro: 0.26 ± 0.02
0.318CysGln: 0.318 ± 0.023
0.209CysArg: 0.209 ± 0.019
0.319CysSer: 0.319 ± 0.02
0.25CysThr: 0.25 ± 0.019
0.298CysVal: 0.298 ± 0.021
0.051CysTrp: 0.051 ± 0.008
0.221CysTyr: 0.221 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.957AspAla: 3.957 ± 0.086
0.327AspCys: 0.327 ± 0.023
3.473AspAsp: 3.473 ± 0.076
4.146AspGlu: 4.146 ± 0.088
3.763AspPhe: 3.763 ± 0.085
4.143AspGly: 4.143 ± 0.095
0.878AspHis: 0.878 ± 0.039
4.624AspIle: 4.624 ± 0.072
4.467AspLys: 4.467 ± 0.079
6.089AspLeu: 6.089 ± 0.102
1.67AspMet: 1.67 ± 0.051
2.983AspAsn: 2.983 ± 0.062
1.553AspPro: 1.553 ± 0.043
1.965AspGln: 1.965 ± 0.067
2.086AspArg: 2.086 ± 0.062
3.596AspSer: 3.596 ± 0.094
2.933AspThr: 2.933 ± 0.077
4.003AspVal: 4.003 ± 0.076
0.715AspTrp: 0.715 ± 0.038
3.214AspTyr: 3.214 ± 0.085
0.0AspXaa: 0.0 ± 0.0
Glu
5.509GluAla: 5.509 ± 0.113
0.276GluCys: 0.276 ± 0.021
4.194GluAsp: 4.194 ± 0.072
6.078GluGlu: 6.078 ± 0.128
2.548GluPhe: 2.548 ± 0.062
3.644GluGly: 3.644 ± 0.074
1.172GluHis: 1.172 ± 0.039
5.224GluIle: 5.224 ± 0.086
5.8GluLys: 5.8 ± 0.103
7.043GluLeu: 7.043 ± 0.116
1.813GluMet: 1.813 ± 0.057
4.109GluAsn: 4.109 ± 0.075
1.692GluPro: 1.692 ± 0.057
2.509GluGln: 2.509 ± 0.066
2.998GluArg: 2.998 ± 0.07
3.762GluSer: 3.762 ± 0.111
4.042GluThr: 4.042 ± 0.091
4.95GluVal: 4.95 ± 0.094
0.571GluTrp: 0.571 ± 0.028
1.96GluTyr: 1.96 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.32PheAla: 3.32 ± 0.076
0.3PheCys: 0.3 ± 0.022
3.331PheAsp: 3.331 ± 0.063
3.281PheGlu: 3.281 ± 0.064
2.089PhePhe: 2.089 ± 0.065
3.44PheGly: 3.44 ± 0.08
0.751PheHis: 0.751 ± 0.03
3.178PheIle: 3.178 ± 0.078
2.684PheLys: 2.684 ± 0.062
4.401PheLeu: 4.401 ± 0.112
1.141PheMet: 1.141 ± 0.041
2.154PheAsn: 2.154 ± 0.056
1.472PhePro: 1.472 ± 0.054
1.471PheGln: 1.471 ± 0.047
1.554PheArg: 1.554 ± 0.047
3.311PheSer: 3.311 ± 0.087
2.697PheThr: 2.697 ± 0.064
3.128PheVal: 3.128 ± 0.073
0.454PheTrp: 0.454 ± 0.026
1.988PheTyr: 1.988 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.604GlyAla: 4.604 ± 0.097
0.413GlyCys: 0.413 ± 0.025
3.511GlyAsp: 3.511 ± 0.082
3.61GlyGlu: 3.61 ± 0.08
3.3GlyPhe: 3.3 ± 0.069
4.191GlyGly: 4.191 ± 0.094
1.332GlyHis: 1.332 ± 0.044
5.24GlyIle: 5.24 ± 0.1
4.63GlyLys: 4.63 ± 0.081
6.674GlyLeu: 6.674 ± 0.111
1.721GlyMet: 1.721 ± 0.055
2.904GlyAsn: 2.904 ± 0.071
1.289GlyPro: 1.289 ± 0.042
3.112GlyGln: 3.112 ± 0.068
2.65GlyArg: 2.65 ± 0.074
3.878GlySer: 3.878 ± 0.089
3.631GlyThr: 3.631 ± 0.09
4.674GlyVal: 4.674 ± 0.093
0.718GlyTrp: 0.718 ± 0.035
2.746GlyTyr: 2.746 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
1.069HisAla: 1.069 ± 0.046
0.133HisCys: 0.133 ± 0.015
0.982HisAsp: 0.982 ± 0.041
1.102HisGlu: 1.102 ± 0.042
1.092HisPhe: 1.092 ± 0.048
1.151HisGly: 1.151 ± 0.04
0.484HisHis: 0.484 ± 0.033
1.255HisIle: 1.255 ± 0.043
0.969HisLys: 0.969 ± 0.038
1.903HisLeu: 1.903 ± 0.059
0.431HisMet: 0.431 ± 0.021
0.67HisAsn: 0.67 ± 0.032
0.787HisPro: 0.787 ± 0.039
0.767HisGln: 0.767 ± 0.032
0.756HisArg: 0.756 ± 0.033
1.019HisSer: 1.019 ± 0.035
0.909HisThr: 0.909 ± 0.036
1.157HisVal: 1.157 ± 0.04
0.168HisTrp: 0.168 ± 0.018
0.891HisTyr: 0.891 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
6.049IleAla: 6.049 ± 0.114
0.487IleCys: 0.487 ± 0.031
4.552IleAsp: 4.552 ± 0.08
4.905IleGlu: 4.905 ± 0.09
3.451IlePhe: 3.451 ± 0.087
4.885IleGly: 4.885 ± 0.087
1.14IleHis: 1.14 ± 0.041
5.21IleIle: 5.21 ± 0.1
4.713IleLys: 4.713 ± 0.085
7.223IleLeu: 7.223 ± 0.122
1.703IleMet: 1.703 ± 0.052
3.396IleAsn: 3.396 ± 0.071
2.782IlePro: 2.782 ± 0.06
2.442IleGln: 2.442 ± 0.07
2.658IleArg: 2.658 ± 0.071
5.215IleSer: 5.215 ± 0.097
4.096IleThr: 4.096 ± 0.079
4.793IleVal: 4.793 ± 0.092
0.627IleTrp: 0.627 ± 0.03
2.665IleTyr: 2.665 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
5.149LysAla: 5.149 ± 0.105
0.228LysCys: 0.228 ± 0.021
4.174LysAsp: 4.174 ± 0.078
5.76LysGlu: 5.76 ± 0.101
2.226LysPhe: 2.226 ± 0.058
3.742LysGly: 3.742 ± 0.069
1.208LysHis: 1.208 ± 0.04
4.926LysIle: 4.926 ± 0.1
5.316LysLys: 5.316 ± 0.114
5.958LysLeu: 5.958 ± 0.097
2.017LysMet: 2.017 ± 0.058
3.547LysAsn: 3.547 ± 0.068
2.001LysPro: 2.001 ± 0.06
2.551LysGln: 2.551 ± 0.069
2.999LysArg: 2.999 ± 0.069
4.171LysSer: 4.171 ± 0.082
4.091LysThr: 4.091 ± 0.077
4.662LysVal: 4.662 ± 0.093
0.652LysTrp: 0.652 ± 0.029
2.511LysTyr: 2.511 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
8.463LeuAla: 8.463 ± 0.133
0.488LeuCys: 0.488 ± 0.027
5.991LeuAsp: 5.991 ± 0.105
7.007LeuGlu: 7.007 ± 0.112
4.223LeuPhe: 4.223 ± 0.097
6.217LeuGly: 6.217 ± 0.111
1.423LeuHis: 1.423 ± 0.049
6.868LeuIle: 6.868 ± 0.139
6.39LeuLys: 6.39 ± 0.106
9.482LeuLeu: 9.482 ± 0.177
2.437LeuMet: 2.437 ± 0.067
4.363LeuAsn: 4.363 ± 0.086
3.729LeuPro: 3.729 ± 0.074
3.271LeuGln: 3.271 ± 0.07
3.477LeuArg: 3.477 ± 0.071
7.504LeuSer: 7.504 ± 0.13
6.316LeuThr: 6.316 ± 0.111
6.881LeuVal: 6.881 ± 0.115
0.695LeuTrp: 0.695 ± 0.033
3.256LeuTyr: 3.256 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
2.153MetAla: 2.153 ± 0.062
0.136MetCys: 0.136 ± 0.014
1.403MetAsp: 1.403 ± 0.047
1.41MetGlu: 1.41 ± 0.046
0.877MetPhe: 0.877 ± 0.038
1.602MetGly: 1.602 ± 0.053
0.322MetHis: 0.322 ± 0.023
1.865MetIle: 1.865 ± 0.047
1.93MetLys: 1.93 ± 0.049
2.193MetLeu: 2.193 ± 0.057
0.682MetMet: 0.682 ± 0.03
1.203MetAsn: 1.203 ± 0.037
0.823MetPro: 0.823 ± 0.035
0.835MetGln: 0.835 ± 0.036
0.952MetArg: 0.952 ± 0.033
1.774MetSer: 1.774 ± 0.055
2.154MetThr: 2.154 ± 0.055
1.696MetVal: 1.696 ± 0.054
0.16MetTrp: 0.16 ± 0.015
0.641MetTyr: 0.641 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
3.086AsnAla: 3.086 ± 0.078
0.267AsnCys: 0.267 ± 0.021
2.599AsnAsp: 2.599 ± 0.057
2.687AsnGlu: 2.687 ± 0.069
2.151AsnPhe: 2.151 ± 0.061
3.49AsnGly: 3.49 ± 0.079
1.115AsnHis: 1.115 ± 0.041
3.349AsnIle: 3.349 ± 0.075
2.868AsnLys: 2.868 ± 0.076
4.863AsnLeu: 4.863 ± 0.101
1.195AsnMet: 1.195 ± 0.039
2.232AsnAsn: 2.232 ± 0.075
2.34AsnPro: 2.34 ± 0.064
2.411AsnGln: 2.411 ± 0.062
1.973AsnArg: 1.973 ± 0.057
2.859AsnSer: 2.859 ± 0.074
2.381AsnThr: 2.381 ± 0.077
2.962AsnVal: 2.962 ± 0.073
0.536AsnTrp: 0.536 ± 0.035
2.089AsnTyr: 2.089 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
2.271ProAla: 2.271 ± 0.071
0.129ProCys: 0.129 ± 0.013
2.167ProAsp: 2.167 ± 0.062
2.819ProGlu: 2.819 ± 0.066
1.579ProPhe: 1.579 ± 0.048
1.663ProGly: 1.663 ± 0.048
0.585ProHis: 0.585 ± 0.027
2.335ProIle: 2.335 ± 0.061
2.051ProLys: 2.051 ± 0.068
2.847ProLeu: 2.847 ± 0.065
0.696ProMet: 0.696 ± 0.034
1.7ProAsn: 1.7 ± 0.055
0.5ProPro: 0.5 ± 0.03
1.241ProGln: 1.241 ± 0.048
0.955ProArg: 0.955 ± 0.033
2.037ProSer: 2.037 ± 0.062
1.994ProThr: 1.994 ± 0.057
2.439ProVal: 2.439 ± 0.064
0.243ProTrp: 0.243 ± 0.019
1.303ProTyr: 1.303 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
3.548GlnAla: 3.548 ± 0.078
0.144GlnCys: 0.144 ± 0.017
2.168GlnAsp: 2.168 ± 0.056
3.352GlnGlu: 3.352 ± 0.074
1.533GlnPhe: 1.533 ± 0.045
2.286GlnGly: 2.286 ± 0.059
0.64GlnHis: 0.64 ± 0.03
2.845GlnIle: 2.845 ± 0.067
2.975GlnLys: 2.975 ± 0.073
3.776GlnLeu: 3.776 ± 0.084
0.994GlnMet: 0.994 ± 0.037
1.79GlnAsn: 1.79 ± 0.055
1.05GlnPro: 1.05 ± 0.041
1.277GlnGln: 1.277 ± 0.048
1.432GlnArg: 1.432 ± 0.055
2.325GlnSer: 2.325 ± 0.063
2.385GlnThr: 2.385 ± 0.063
3.109GlnVal: 3.109 ± 0.076
0.298GlnTrp: 0.298 ± 0.021
1.24GlnTyr: 1.24 ± 0.047
0.0GlnXaa: 0.0 ± 0.0
Arg
2.476ArgAla: 2.476 ± 0.071
0.172ArgCys: 0.172 ± 0.016
2.303ArgAsp: 2.303 ± 0.058
2.751ArgGlu: 2.751 ± 0.073
1.955ArgPhe: 1.955 ± 0.057
2.244ArgGly: 2.244 ± 0.062
0.8ArgHis: 0.8 ± 0.037
2.762ArgIle: 2.762 ± 0.062
2.781ArgLys: 2.781 ± 0.069
4.084ArgLeu: 4.084 ± 0.081
1.049ArgMet: 1.049 ± 0.041
1.862ArgAsn: 1.862 ± 0.057
1.215ArgPro: 1.215 ± 0.041
1.986ArgGln: 1.986 ± 0.061
1.846ArgArg: 1.846 ± 0.058
1.906ArgSer: 1.906 ± 0.058
1.865ArgThr: 1.865 ± 0.046
2.573ArgVal: 2.573 ± 0.062
0.28ArgTrp: 0.28 ± 0.022
1.617ArgTyr: 1.617 ± 0.059
0.0ArgXaa: 0.0 ± 0.0
Ser
4.333SerAla: 4.333 ± 0.101
0.312SerCys: 0.312 ± 0.025
4.084SerAsp: 4.084 ± 0.091
4.148SerGlu: 4.148 ± 0.108
3.213SerPhe: 3.213 ± 0.08
4.785SerGly: 4.785 ± 0.107
1.319SerHis: 1.319 ± 0.042
4.393SerIle: 4.393 ± 0.086
4.205SerLys: 4.205 ± 0.082
6.709SerLeu: 6.709 ± 0.12
1.424SerMet: 1.424 ± 0.041
2.993SerAsn: 2.993 ± 0.076
1.83SerPro: 1.83 ± 0.056
3.083SerGln: 3.083 ± 0.084
2.544SerArg: 2.544 ± 0.068
4.583SerSer: 4.583 ± 0.129
3.343SerThr: 3.343 ± 0.107
4.207SerVal: 4.207 ± 0.09
0.65SerTrp: 0.65 ± 0.035
2.699SerTyr: 2.699 ± 0.07
0.0SerXaa: 0.0 ± 0.0
Thr
4.307ThrAla: 4.307 ± 0.096
0.293ThrCys: 0.293 ± 0.024
3.421ThrAsp: 3.421 ± 0.081
3.675ThrGlu: 3.675 ± 0.08
2.762ThrPhe: 2.762 ± 0.069
4.181ThrGly: 4.181 ± 0.082
1.036ThrHis: 1.036 ± 0.036
4.502ThrIle: 4.502 ± 0.084
3.487ThrLys: 3.487 ± 0.078
5.572ThrLeu: 5.572 ± 0.098
1.186ThrMet: 1.186 ± 0.041
2.608ThrAsn: 2.608 ± 0.074
2.206ThrPro: 2.206 ± 0.069
1.916ThrGln: 1.916 ± 0.06
1.934ThrArg: 1.934 ± 0.051
4.002ThrSer: 4.002 ± 0.11
3.298ThrThr: 3.298 ± 0.102
4.473ThrVal: 4.473 ± 0.099
0.565ThrTrp: 0.565 ± 0.033
2.478ThrTyr: 2.478 ± 0.069
0.0ThrXaa: 0.0 ± 0.0
Val
5.685ValAla: 5.685 ± 0.108
0.394ValCys: 0.394 ± 0.025
4.247ValAsp: 4.247 ± 0.079
4.817ValGlu: 4.817 ± 0.094
3.252ValPhe: 3.252 ± 0.071
4.275ValGly: 4.275 ± 0.087
1.04ValHis: 1.04 ± 0.037
5.019ValIle: 5.019 ± 0.079
4.455ValLys: 4.455 ± 0.084
6.599ValLeu: 6.599 ± 0.112
1.641ValMet: 1.641 ± 0.045
3.082ValAsn: 3.082 ± 0.067
2.314ValPro: 2.314 ± 0.063
2.155ValGln: 2.155 ± 0.058
2.593ValArg: 2.593 ± 0.072
4.798ValSer: 4.798 ± 0.097
4.795ValThr: 4.795 ± 0.105
4.658ValVal: 4.658 ± 0.095
0.566ValTrp: 0.566 ± 0.031
2.388ValTyr: 2.388 ± 0.06
0.0ValXaa: 0.0 ± 0.0
Trp
0.576TrpAla: 0.576 ± 0.032
0.056TrpCys: 0.056 ± 0.01
0.523TrpAsp: 0.523 ± 0.026
0.553TrpGlu: 0.553 ± 0.029
0.446TrpPhe: 0.446 ± 0.024
0.643TrpGly: 0.643 ± 0.032
0.185TrpHis: 0.185 ± 0.016
0.636TrpIle: 0.636 ± 0.034
0.582TrpLys: 0.582 ± 0.031
1.02TrpLeu: 1.02 ± 0.053
0.217TrpMet: 0.217 ± 0.018
0.507TrpAsn: 0.507 ± 0.026
0.157TrpPro: 0.157 ± 0.018
0.441TrpGln: 0.441 ± 0.023
0.342TrpArg: 0.342 ± 0.021
0.607TrpSer: 0.607 ± 0.033
0.51TrpThr: 0.51 ± 0.034
0.574TrpVal: 0.574 ± 0.03
0.11TrpTrp: 0.11 ± 0.012
0.383TrpTyr: 0.383 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.482TyrAla: 2.482 ± 0.058
0.246TyrCys: 0.246 ± 0.02
2.628TyrAsp: 2.628 ± 0.062
2.259TyrGlu: 2.259 ± 0.056
2.196TyrPhe: 2.196 ± 0.063
2.544TyrGly: 2.544 ± 0.065
0.881TyrHis: 0.881 ± 0.032
2.457TyrIle: 2.457 ± 0.066
2.177TyrLys: 2.177 ± 0.062
4.12TyrLeu: 4.12 ± 0.085
0.767TyrMet: 0.767 ± 0.039
1.867TyrAsn: 1.867 ± 0.06
1.439TyrPro: 1.439 ± 0.051
2.228TyrGln: 2.228 ± 0.067
1.778TyrArg: 1.778 ± 0.055
2.385TyrSer: 2.385 ± 0.063
2.059TyrThr: 2.059 ± 0.067
2.33TyrVal: 2.33 ± 0.059
0.344TyrTrp: 0.344 ± 0.022
1.852TyrTyr: 1.852 ± 0.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2315 proteins (692190 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski