Amino acid dipepetide frequency for Streptococcus cristatus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.688AlaAla: 6.688 ± 0.128
0.508AlaCys: 0.508 ± 0.031
4.399AlaAsp: 4.399 ± 0.094
5.954AlaGlu: 5.954 ± 0.135
3.555AlaPhe: 3.555 ± 0.084
5.795AlaGly: 5.795 ± 0.111
1.302AlaHis: 1.302 ± 0.05
5.477AlaIle: 5.477 ± 0.125
5.206AlaLys: 5.206 ± 0.146
8.146AlaLeu: 8.146 ± 0.235
1.838AlaMet: 1.838 ± 0.066
3.107AlaAsn: 3.107 ± 0.08
2.299AlaPro: 2.299 ± 0.093
3.501AlaGln: 3.501 ± 0.105
2.982AlaArg: 2.982 ± 0.08
4.912AlaSer: 4.912 ± 0.121
4.205AlaThr: 4.205 ± 0.194
5.456AlaVal: 5.456 ± 0.116
0.647AlaTrp: 0.647 ± 0.03
2.844AlaTyr: 2.844 ± 0.071
0.0AlaXaa: 0.0 ± 0.0
Cys
0.311CysAla: 0.311 ± 0.021
0.065CysCys: 0.065 ± 0.011
0.305CysAsp: 0.305 ± 0.024
0.263CysGlu: 0.263 ± 0.024
0.27CysPhe: 0.27 ± 0.022
0.479CysGly: 0.479 ± 0.029
0.164CysHis: 0.164 ± 0.018
0.303CysIle: 0.303 ± 0.022
0.219CysLys: 0.219 ± 0.021
0.549CysLeu: 0.549 ± 0.034
0.12CysMet: 0.12 ± 0.014
0.175CysAsn: 0.175 ± 0.019
0.222CysPro: 0.222 ± 0.021
0.299CysGln: 0.299 ± 0.024
0.222CysArg: 0.222 ± 0.02
0.371CysSer: 0.371 ± 0.027
0.209CysThr: 0.209 ± 0.02
0.306CysVal: 0.306 ± 0.027
0.05CysTrp: 0.05 ± 0.009
0.188CysTyr: 0.188 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.612AspAla: 3.612 ± 0.112
0.308AspCys: 0.308 ± 0.025
2.722AspAsp: 2.722 ± 0.09
4.224AspGlu: 4.224 ± 0.105
3.487AspPhe: 3.487 ± 0.087
3.992AspGly: 3.992 ± 0.223
0.967AspHis: 0.967 ± 0.041
4.053AspIle: 4.053 ± 0.088
4.272AspLys: 4.272 ± 0.131
6.111AspLeu: 6.111 ± 0.129
1.365AspMet: 1.365 ± 0.054
2.195AspAsn: 2.195 ± 0.094
1.79AspPro: 1.79 ± 0.092
2.238AspGln: 2.238 ± 0.064
2.13AspArg: 2.13 ± 0.063
3.109AspSer: 3.109 ± 0.084
2.654AspThr: 2.654 ± 0.085
3.545AspVal: 3.545 ± 0.088
0.715AspTrp: 0.715 ± 0.046
2.951AspTyr: 2.951 ± 0.075
0.0AspXaa: 0.0 ± 0.0
Glu
6.235GluAla: 6.235 ± 0.189
0.258GluCys: 0.258 ± 0.023
3.879GluAsp: 3.879 ± 0.083
6.582GluGlu: 6.582 ± 0.132
2.758GluPhe: 2.758 ± 0.071
3.832GluGly: 3.832 ± 0.099
1.295GluHis: 1.295 ± 0.049
5.555GluIle: 5.555 ± 0.123
6.294GluLys: 6.294 ± 0.122
7.376GluLeu: 7.376 ± 0.134
1.832GluMet: 1.832 ± 0.061
3.908GluAsn: 3.908 ± 0.094
1.692GluPro: 1.692 ± 0.056
3.049GluGln: 3.049 ± 0.084
3.441GluArg: 3.441 ± 0.086
3.79GluSer: 3.79 ± 0.179
3.79GluThr: 3.79 ± 0.09
4.915GluVal: 4.915 ± 0.104
0.618GluTrp: 0.618 ± 0.038
2.205GluTyr: 2.205 ± 0.066
0.0GluXaa: 0.0 ± 0.0
Phe
3.463PheAla: 3.463 ± 0.09
0.243PheCys: 0.243 ± 0.023
2.989PheAsp: 2.989 ± 0.082
3.172PheGlu: 3.172 ± 0.088
2.286PhePhe: 2.286 ± 0.079
3.261PheGly: 3.261 ± 0.074
0.862PheHis: 0.862 ± 0.044
3.057PheIle: 3.057 ± 0.088
2.483PheLys: 2.483 ± 0.078
4.919PheLeu: 4.919 ± 0.122
1.054PheMet: 1.054 ± 0.045
1.885PheAsn: 1.885 ± 0.069
1.627PhePro: 1.627 ± 0.063
1.607PheGln: 1.607 ± 0.054
1.629PheArg: 1.629 ± 0.057
3.324PheSer: 3.324 ± 0.086
2.4PheThr: 2.4 ± 0.073
3.15PheVal: 3.15 ± 0.085
0.486PheTrp: 0.486 ± 0.03
1.906PheTyr: 1.906 ± 0.07
0.0PheXaa: 0.0 ± 0.0
Gly
5.102GlyAla: 5.102 ± 0.278
0.397GlyCys: 0.397 ± 0.026
3.34GlyAsp: 3.34 ± 0.098
3.735GlyGlu: 3.735 ± 0.104
3.232GlyPhe: 3.232 ± 0.075
4.212GlyGly: 4.212 ± 0.114
1.281GlyHis: 1.281 ± 0.049
4.907GlyIle: 4.907 ± 0.12
4.832GlyLys: 4.832 ± 0.172
7.042GlyLeu: 7.042 ± 0.126
1.771GlyMet: 1.771 ± 0.06
2.592GlyAsn: 2.592 ± 0.088
1.523GlyPro: 1.523 ± 0.084
3.374GlyGln: 3.374 ± 0.094
2.838GlyArg: 2.838 ± 0.076
3.79GlySer: 3.79 ± 0.079
3.728GlyThr: 3.728 ± 0.133
4.749GlyVal: 4.749 ± 0.103
0.642GlyTrp: 0.642 ± 0.037
2.587GlyTyr: 2.587 ± 0.08
0.0GlyXaa: 0.0 ± 0.0
His
1.117HisAla: 1.117 ± 0.049
0.113HisCys: 0.113 ± 0.015
0.912HisAsp: 0.912 ± 0.039
1.216HisGlu: 1.216 ± 0.061
1.086HisPhe: 1.086 ± 0.046
1.211HisGly: 1.211 ± 0.05
0.488HisHis: 0.488 ± 0.029
1.237HisIle: 1.237 ± 0.049
1.015HisLys: 1.015 ± 0.052
2.111HisLeu: 2.111 ± 0.091
0.445HisMet: 0.445 ± 0.024
0.71HisAsn: 0.71 ± 0.037
0.924HisPro: 0.924 ± 0.045
0.939HisGln: 0.939 ± 0.047
0.855HisArg: 0.855 ± 0.042
1.049HisSer: 1.049 ± 0.053
0.939HisThr: 0.939 ± 0.073
1.071HisVal: 1.071 ± 0.048
0.18HisTrp: 0.18 ± 0.018
0.956HisTyr: 0.956 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
5.92IleAla: 5.92 ± 0.116
0.496IleCys: 0.496 ± 0.031
3.897IleAsp: 3.897 ± 0.086
4.979IleGlu: 4.979 ± 0.119
3.465IlePhe: 3.465 ± 0.103
4.684IleGly: 4.684 ± 0.102
1.239IleHis: 1.239 ± 0.051
4.702IleIle: 4.702 ± 0.111
4.204IleLys: 4.204 ± 0.088
7.472IleLeu: 7.472 ± 0.164
1.538IleMet: 1.538 ± 0.058
2.782IleAsn: 2.782 ± 0.084
2.705IlePro: 2.705 ± 0.063
2.761IleGln: 2.761 ± 0.075
2.977IleArg: 2.977 ± 0.078
4.874IleSer: 4.874 ± 0.119
3.49IleThr: 3.49 ± 0.086
4.649IleVal: 4.649 ± 0.098
0.606IleTrp: 0.606 ± 0.033
2.618IleTyr: 2.618 ± 0.077
0.0IleXaa: 0.0 ± 0.0
Lys
4.97LysAla: 4.97 ± 0.111
0.175LysCys: 0.175 ± 0.016
4.046LysAsp: 4.046 ± 0.159
6.01LysGlu: 6.01 ± 0.112
2.187LysPhe: 2.187 ± 0.064
3.983LysGly: 3.983 ± 0.138
1.086LysHis: 1.086 ± 0.055
4.835LysIle: 4.835 ± 0.11
5.41LysLys: 5.41 ± 0.135
6.005LysLeu: 6.005 ± 0.152
2.048LysMet: 2.048 ± 0.069
3.417LysAsn: 3.417 ± 0.1
2.27LysPro: 2.27 ± 0.088
2.554LysGln: 2.554 ± 0.061
3.015LysArg: 3.015 ± 0.08
3.79LysSer: 3.79 ± 0.088
3.738LysThr: 3.738 ± 0.084
4.459LysVal: 4.459 ± 0.099
0.601LysTrp: 0.601 ± 0.033
2.274LysTyr: 2.274 ± 0.069
0.0LysXaa: 0.0 ± 0.0
Leu
9.615LeuAla: 9.615 ± 0.158
0.477LeuCys: 0.477 ± 0.031
6.192LeuAsp: 6.192 ± 0.143
7.559LeuGlu: 7.559 ± 0.147
4.467LeuPhe: 4.467 ± 0.116
6.572LeuGly: 6.572 ± 0.133
1.733LeuHis: 1.733 ± 0.072
6.635LeuIle: 6.635 ± 0.146
6.335LeuLys: 6.335 ± 0.174
10.803LeuLeu: 10.803 ± 0.223
2.382LeuMet: 2.382 ± 0.074
3.915LeuAsn: 3.915 ± 0.093
4.185LeuPro: 4.185 ± 0.076
3.673LeuGln: 3.673 ± 0.098
4.178LeuArg: 4.178 ± 0.156
7.29LeuSer: 7.29 ± 0.202
6.3LeuThr: 6.3 ± 0.116
7.244LeuVal: 7.244 ± 0.231
0.715LeuTrp: 0.715 ± 0.039
3.305LeuTyr: 3.305 ± 0.095
0.0LeuXaa: 0.0 ± 0.0
Met
1.909MetAla: 1.909 ± 0.062
0.098MetCys: 0.098 ± 0.013
1.374MetAsp: 1.374 ± 0.054
1.588MetGlu: 1.588 ± 0.059
0.797MetPhe: 0.797 ± 0.041
1.6MetGly: 1.6 ± 0.051
0.339MetHis: 0.339 ± 0.025
1.819MetIle: 1.819 ± 0.062
1.988MetLys: 1.988 ± 0.07
2.315MetLeu: 2.315 ± 0.084
0.689MetMet: 0.689 ± 0.038
1.148MetAsn: 1.148 ± 0.047
0.741MetPro: 0.741 ± 0.033
0.922MetGln: 0.922 ± 0.038
1.068MetArg: 1.068 ± 0.05
1.538MetSer: 1.538 ± 0.056
1.795MetThr: 1.795 ± 0.054
1.535MetVal: 1.535 ± 0.056
0.157MetTrp: 0.157 ± 0.015
0.541MetTyr: 0.541 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.874AsnAla: 2.874 ± 0.079
0.229AsnCys: 0.229 ± 0.019
2.171AsnAsp: 2.171 ± 0.075
2.358AsnGlu: 2.358 ± 0.068
2.031AsnPhe: 2.031 ± 0.065
3.305AsnGly: 3.305 ± 0.164
0.936AsnHis: 0.936 ± 0.047
3.097AsnIle: 3.097 ± 0.088
2.484AsnLys: 2.484 ± 0.073
4.481AsnLeu: 4.481 ± 0.106
1.006AsnMet: 1.006 ± 0.04
1.839AsnAsn: 1.839 ± 0.127
2.176AsnPro: 2.176 ± 0.067
2.193AsnGln: 2.193 ± 0.066
1.988AsnArg: 1.988 ± 0.068
2.404AsnSer: 2.404 ± 0.065
1.877AsnThr: 1.877 ± 0.065
2.705AsnVal: 2.705 ± 0.077
0.452AsnTrp: 0.452 ± 0.032
1.737AsnTyr: 1.737 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
2.813ProAla: 2.813 ± 0.092
0.147ProCys: 0.147 ± 0.015
2.294ProAsp: 2.294 ± 0.081
3.081ProGlu: 3.081 ± 0.099
1.666ProPhe: 1.666 ± 0.054
1.755ProGly: 1.755 ± 0.061
0.737ProHis: 0.737 ± 0.038
2.339ProIle: 2.339 ± 0.064
2.048ProLys: 2.048 ± 0.065
2.934ProLeu: 2.934 ± 0.071
0.657ProMet: 0.657 ± 0.037
1.581ProAsn: 1.581 ± 0.057
0.566ProPro: 0.566 ± 0.033
1.434ProGln: 1.434 ± 0.052
0.985ProArg: 0.985 ± 0.039
2.356ProSer: 2.356 ± 0.078
2.11ProThr: 2.11 ± 0.081
2.751ProVal: 2.751 ± 0.1
0.322ProTrp: 0.322 ± 0.024
1.3ProTyr: 1.3 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
3.925GlnAla: 3.925 ± 0.102
0.106GlnCys: 0.106 ± 0.016
2.207GlnAsp: 2.207 ± 0.081
3.766GlnGlu: 3.766 ± 0.091
1.564GlnPhe: 1.564 ± 0.056
2.518GlnGly: 2.518 ± 0.069
0.867GlnHis: 0.867 ± 0.045
2.987GlnIle: 2.987 ± 0.08
2.984GlnLys: 2.984 ± 0.083
4.406GlnLeu: 4.406 ± 0.097
1.006GlnMet: 1.006 ± 0.044
1.808GlnAsn: 1.808 ± 0.059
1.381GlnPro: 1.381 ± 0.059
1.769GlnGln: 1.769 ± 0.067
1.519GlnArg: 1.519 ± 0.052
2.311GlnSer: 2.311 ± 0.111
2.347GlnThr: 2.347 ± 0.067
3.36GlnVal: 3.36 ± 0.097
0.286GlnTrp: 0.286 ± 0.023
1.36GlnTyr: 1.36 ± 0.047
0.0GlnXaa: 0.0 ± 0.0
Arg
2.763ArgAla: 2.763 ± 0.072
0.207ArgCys: 0.207 ± 0.018
2.334ArgAsp: 2.334 ± 0.072
3.297ArgGlu: 3.297 ± 0.085
2.014ArgPhe: 2.014 ± 0.063
2.287ArgGly: 2.287 ± 0.068
0.755ArgHis: 0.755 ± 0.038
3.006ArgIle: 3.006 ± 0.079
2.878ArgLys: 2.878 ± 0.075
4.395ArgLeu: 4.395 ± 0.145
1.129ArgMet: 1.129 ± 0.048
1.841ArgAsn: 1.841 ± 0.06
1.372ArgPro: 1.372 ± 0.056
2.075ArgGln: 2.075 ± 0.055
2.003ArgArg: 2.003 ± 0.067
2.216ArgSer: 2.216 ± 0.071
1.985ArgThr: 1.985 ± 0.056
2.806ArgVal: 2.806 ± 0.074
0.323ArgTrp: 0.323 ± 0.025
1.641ArgTyr: 1.641 ± 0.059
0.0ArgXaa: 0.0 ± 0.0
Ser
4.164SerAla: 4.164 ± 0.11
0.327SerCys: 0.327 ± 0.028
3.427SerAsp: 3.427 ± 0.089
3.981SerGlu: 3.981 ± 0.181
2.984SerPhe: 2.984 ± 0.09
4.164SerGly: 4.164 ± 0.087
1.278SerHis: 1.278 ± 0.047
4.236SerIle: 4.236 ± 0.134
4.019SerLys: 4.019 ± 0.088
7.063SerLeu: 7.063 ± 0.204
1.495SerMet: 1.495 ± 0.064
2.57SerAsn: 2.57 ± 0.077
2.089SerPro: 2.089 ± 0.06
3.266SerGln: 3.266 ± 0.11
2.558SerArg: 2.558 ± 0.066
4.452SerSer: 4.452 ± 0.153
3.145SerThr: 3.145 ± 0.095
4.039SerVal: 4.039 ± 0.093
0.613SerTrp: 0.613 ± 0.037
2.522SerTyr: 2.522 ± 0.073
0.0SerXaa: 0.0 ± 0.0
Thr
4.33ThrAla: 4.33 ± 0.113
0.277ThrCys: 0.277 ± 0.022
3.222ThrAsp: 3.222 ± 0.087
3.598ThrGlu: 3.598 ± 0.076
2.474ThrPhe: 2.474 ± 0.078
4.445ThrGly: 4.445 ± 0.287
1.028ThrHis: 1.028 ± 0.074
4.241ThrIle: 4.241 ± 0.108
3.056ThrLys: 3.056 ± 0.077
5.087ThrLeu: 5.087 ± 0.111
1.061ThrMet: 1.061 ± 0.042
2.13ThrAsn: 2.13 ± 0.062
2.185ThrPro: 2.185 ± 0.081
1.757ThrGln: 1.757 ± 0.054
2.009ThrArg: 2.009 ± 0.073
3.571ThrSer: 3.571 ± 0.106
2.857ThrThr: 2.857 ± 0.086
4.26ThrVal: 4.26 ± 0.124
0.527ThrTrp: 0.527 ± 0.038
2.161ThrTyr: 2.161 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
5.815ValAla: 5.815 ± 0.118
0.397ValCys: 0.397 ± 0.027
3.913ValAsp: 3.913 ± 0.1
4.948ValGlu: 4.948 ± 0.13
3.021ValPhe: 3.021 ± 0.082
4.45ValGly: 4.45 ± 0.101
1.216ValHis: 1.216 ± 0.066
4.623ValIle: 4.623 ± 0.094
4.416ValLys: 4.416 ± 0.103
7.093ValLeu: 7.093 ± 0.237
1.523ValMet: 1.523 ± 0.056
2.748ValAsn: 2.748 ± 0.073
2.506ValPro: 2.506 ± 0.13
2.378ValGln: 2.378 ± 0.076
2.784ValArg: 2.784 ± 0.08
4.464ValSer: 4.464 ± 0.099
4.368ValThr: 4.368 ± 0.129
5.013ValVal: 5.013 ± 0.119
0.57ValTrp: 0.57 ± 0.039
2.37ValTyr: 2.37 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.583TrpAla: 0.583 ± 0.033
0.07TrpCys: 0.07 ± 0.016
0.474TrpAsp: 0.474 ± 0.033
0.532TrpGlu: 0.532 ± 0.027
0.443TrpPhe: 0.443 ± 0.032
0.565TrpGly: 0.565 ± 0.032
0.192TrpHis: 0.192 ± 0.018
0.621TrpIle: 0.621 ± 0.036
0.534TrpLys: 0.534 ± 0.037
1.158TrpLeu: 1.158 ± 0.059
0.233TrpMet: 0.233 ± 0.018
0.464TrpAsn: 0.464 ± 0.03
0.185TrpPro: 0.185 ± 0.017
0.472TrpGln: 0.472 ± 0.031
0.363TrpArg: 0.363 ± 0.025
0.583TrpSer: 0.583 ± 0.039
0.517TrpThr: 0.517 ± 0.03
0.481TrpVal: 0.481 ± 0.027
0.12TrpTrp: 0.12 ± 0.014
0.397TrpTyr: 0.397 ± 0.035
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.609TyrAla: 2.609 ± 0.068
0.193TyrCys: 0.193 ± 0.019
2.344TyrAsp: 2.344 ± 0.069
2.46TyrGlu: 2.46 ± 0.078
2.015TyrPhe: 2.015 ± 0.069
2.563TyrGly: 2.563 ± 0.086
0.809TyrHis: 0.809 ± 0.038
2.327TyrIle: 2.327 ± 0.07
2.197TyrLys: 2.197 ± 0.07
4.11TyrLeu: 4.11 ± 0.1
0.766TyrMet: 0.766 ± 0.044
1.53TyrAsn: 1.53 ± 0.066
1.439TyrPro: 1.439 ± 0.056
2.216TyrGln: 2.216 ± 0.074
1.783TyrArg: 1.783 ± 0.056
2.199TyrSer: 2.199 ± 0.073
1.822TyrThr: 1.822 ± 0.061
2.122TyrVal: 2.122 ± 0.063
0.359TyrTrp: 0.359 ± 0.026
1.675TyrTyr: 1.675 ± 0.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1914 proteins (584487 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski