Amino acid dipepetide frequency for Vulcanisaeta sp. SCGC AB-777_J10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.585AlaAla: 4.585 ± 0.162
0.486AlaCys: 0.486 ± 0.048
2.655AlaAsp: 2.655 ± 0.103
3.769AlaGlu: 3.769 ± 0.128
2.82AlaPhe: 2.82 ± 0.124
4.511AlaGly: 4.511 ± 0.146
1.126AlaHis: 1.126 ± 0.071
6.24AlaIle: 6.24 ± 0.18
3.534AlaLys: 3.534 ± 0.121
9.108AlaLeu: 9.108 ± 0.211
2.31AlaMet: 2.31 ± 0.099
2.146AlaAsn: 2.146 ± 0.095
2.302AlaPro: 2.302 ± 0.102
1.538AlaGln: 1.538 ± 0.076
4.922AlaArg: 4.922 ± 0.142
4.209AlaSer: 4.209 ± 0.155
3.063AlaThr: 3.063 ± 0.123
6.535AlaVal: 6.535 ± 0.175
1.02AlaTrp: 1.02 ± 0.067
3.279AlaTyr: 3.279 ± 0.116
0.0AlaXaa: 0.0 ± 0.0
Cys
0.337CysAla: 0.337 ± 0.037
0.098CysCys: 0.098 ± 0.025
0.357CysAsp: 0.357 ± 0.034
0.431CysGlu: 0.431 ± 0.035
0.22CysPhe: 0.22 ± 0.029
0.879CysGly: 0.879 ± 0.065
0.086CysHis: 0.086 ± 0.018
0.518CysIle: 0.518 ± 0.047
0.337CysLys: 0.337 ± 0.038
0.549CysLeu: 0.549 ± 0.049
0.184CysMet: 0.184 ± 0.029
0.306CysAsn: 0.306 ± 0.034
0.781CysPro: 0.781 ± 0.069
0.161CysGln: 0.161 ± 0.024
0.428CysArg: 0.428 ± 0.041
0.373CysSer: 0.373 ± 0.032
0.447CysThr: 0.447 ± 0.046
0.561CysVal: 0.561 ± 0.056
0.075CysTrp: 0.075 ± 0.017
0.208CysTyr: 0.208 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
3.679AspAla: 3.679 ± 0.121
0.459AspCys: 0.459 ± 0.043
2.455AspAsp: 2.455 ± 0.11
3.93AspGlu: 3.93 ± 0.128
1.851AspPhe: 1.851 ± 0.084
3.357AspGly: 3.357 ± 0.119
0.592AspHis: 0.592 ± 0.047
3.891AspIle: 3.891 ± 0.133
2.659AspLys: 2.659 ± 0.115
5.625AspLeu: 5.625 ± 0.152
1.035AspMet: 1.035 ± 0.062
1.463AspAsn: 1.463 ± 0.074
2.408AspPro: 2.408 ± 0.099
0.749AspGln: 0.749 ± 0.053
2.914AspArg: 2.914 ± 0.125
2.177AspSer: 2.177 ± 0.094
1.816AspThr: 1.816 ± 0.08
5.099AspVal: 5.099 ± 0.144
0.796AspTrp: 0.796 ± 0.056
2.914AspTyr: 2.914 ± 0.105
0.0AspXaa: 0.0 ± 0.0
Glu
4.554GluAla: 4.554 ± 0.162
0.388GluCys: 0.388 ± 0.038
3.585GluAsp: 3.585 ± 0.138
5.166GluGlu: 5.166 ± 0.204
2.459GluPhe: 2.459 ± 0.092
4.126GluGly: 4.126 ± 0.134
0.973GluHis: 0.973 ± 0.062
4.428GluIle: 4.428 ± 0.152
2.785GluLys: 2.785 ± 0.096
8.747GluLeu: 8.747 ± 0.272
1.208GluMet: 1.208 ± 0.054
1.961GluAsn: 1.961 ± 0.078
2.326GluPro: 2.326 ± 0.089
0.992GluGln: 0.992 ± 0.061
3.879GluArg: 3.879 ± 0.135
2.891GluSer: 2.891 ± 0.12
2.589GluThr: 2.589 ± 0.118
6.593GluVal: 6.593 ± 0.207
0.698GluTrp: 0.698 ± 0.052
2.875GluTyr: 2.875 ± 0.106
0.0GluXaa: 0.0 ± 0.0
Phe
2.087PheAla: 2.087 ± 0.099
0.208PheCys: 0.208 ± 0.03
1.863PheAsp: 1.863 ± 0.082
1.914PheGlu: 1.914 ± 0.088
1.055PhePhe: 1.055 ± 0.073
2.84PheGly: 2.84 ± 0.107
0.494PheHis: 0.494 ± 0.042
2.875PheIle: 2.875 ± 0.13
1.804PheLys: 1.804 ± 0.081
3.295PheLeu: 3.295 ± 0.133
0.988PheMet: 0.988 ± 0.054
1.757PheAsn: 1.757 ± 0.088
1.208PhePro: 1.208 ± 0.083
0.69PheGln: 0.69 ± 0.053
2.506PheArg: 2.506 ± 0.118
2.149PheSer: 2.149 ± 0.105
2.087PheThr: 2.087 ± 0.087
3.067PheVal: 3.067 ± 0.121
0.42PheTrp: 0.42 ± 0.045
1.373PheTyr: 1.373 ± 0.09
0.0PheXaa: 0.0 ± 0.0
Gly
4.985GlyAla: 4.985 ± 0.158
0.537GlyCys: 0.537 ± 0.046
3.801GlyAsp: 3.801 ± 0.115
4.475GlyGlu: 4.475 ± 0.152
3.361GlyPhe: 3.361 ± 0.139
5.766GlyGly: 5.766 ± 0.197
1.055GlyHis: 1.055 ± 0.057
6.413GlyIle: 6.413 ± 0.171
4.354GlyLys: 4.354 ± 0.134
7.962GlyLeu: 7.962 ± 0.196
1.761GlyMet: 1.761 ± 0.073
2.926GlyAsn: 2.926 ± 0.112
2.561GlyPro: 2.561 ± 0.107
1.208GlyGln: 1.208 ± 0.082
4.617GlyArg: 4.617 ± 0.153
4.828GlySer: 4.828 ± 0.143
3.471GlyThr: 3.471 ± 0.127
6.95GlyVal: 6.95 ± 0.146
1.173GlyTrp: 1.173 ± 0.071
3.283GlyTyr: 3.283 ± 0.122
0.0GlyXaa: 0.0 ± 0.0
His
1.145HisAla: 1.145 ± 0.073
0.141HisCys: 0.141 ± 0.023
0.718HisAsp: 0.718 ± 0.05
1.102HisGlu: 1.102 ± 0.064
0.49HisPhe: 0.49 ± 0.038
1.381HisGly: 1.381 ± 0.072
0.349HisHis: 0.349 ± 0.043
1.051HisIle: 1.051 ± 0.061
0.482HisLys: 0.482 ± 0.05
1.267HisLeu: 1.267 ± 0.075
0.408HisMet: 0.408 ± 0.043
0.471HisAsn: 0.471 ± 0.042
0.894HisPro: 0.894 ± 0.069
0.22HisGln: 0.22 ± 0.03
0.886HisArg: 0.886 ± 0.057
0.686HisSer: 0.686 ± 0.058
0.628HisThr: 0.628 ± 0.053
1.561HisVal: 1.561 ± 0.084
0.216HisTrp: 0.216 ± 0.028
0.792HisTyr: 0.792 ± 0.048
0.0HisXaa: 0.0 ± 0.0
Ile
6.205IleAla: 6.205 ± 0.167
0.384IleCys: 0.384 ± 0.038
4.632IleAsp: 4.632 ± 0.157
4.966IleGlu: 4.966 ± 0.166
1.902IlePhe: 1.902 ± 0.098
5.546IleGly: 5.546 ± 0.168
1.22IleHis: 1.22 ± 0.066
6.884IleIle: 6.884 ± 0.209
4.675IleLys: 4.675 ± 0.135
6.684IleLeu: 6.684 ± 0.198
2.146IleMet: 2.146 ± 0.094
5.072IleAsn: 5.072 ± 0.176
4.091IlePro: 4.091 ± 0.136
1.279IleGln: 1.279 ± 0.084
5.366IleArg: 5.366 ± 0.133
4.593IleSer: 4.593 ± 0.133
5.213IleThr: 5.213 ± 0.162
6.072IleVal: 6.072 ± 0.168
0.761IleTrp: 0.761 ± 0.061
3.118IleTyr: 3.118 ± 0.119
0.0IleXaa: 0.0 ± 0.0
Lys
4.201LysAla: 4.201 ± 0.116
0.49LysCys: 0.49 ± 0.038
2.589LysAsp: 2.589 ± 0.101
3.044LysGlu: 3.044 ± 0.119
1.479LysPhe: 1.479 ± 0.074
3.31LysGly: 3.31 ± 0.118
0.784LysHis: 0.784 ± 0.06
3.15LysIle: 3.15 ± 0.131
1.722LysLys: 1.722 ± 0.104
5.381LysLeu: 5.381 ± 0.174
1.075LysMet: 1.075 ± 0.066
1.569LysAsn: 1.569 ± 0.081
3.271LysPro: 3.271 ± 0.121
1.012LysGln: 1.012 ± 0.075
2.726LysArg: 2.726 ± 0.133
2.699LysSer: 2.699 ± 0.117
2.185LysThr: 2.185 ± 0.094
4.903LysVal: 4.903 ± 0.144
0.616LysTrp: 0.616 ± 0.044
2.891LysTyr: 2.891 ± 0.084
0.0LysXaa: 0.0 ± 0.0
Leu
7.703LeuAla: 7.703 ± 0.187
0.592LeuCys: 0.592 ± 0.05
4.911LeuAsp: 4.911 ± 0.155
5.715LeuGlu: 5.715 ± 0.187
3.318LeuPhe: 3.318 ± 0.128
9.143LeuGly: 9.143 ± 0.26
1.412LeuHis: 1.412 ± 0.079
8.908LeuIle: 8.908 ± 0.229
4.973LeuLys: 4.973 ± 0.156
10.163LeuLeu: 10.163 ± 0.229
3.428LeuMet: 3.428 ± 0.102
4.828LeuAsn: 4.828 ± 0.153
4.569LeuPro: 4.569 ± 0.139
1.62LeuGln: 1.62 ± 0.085
7.594LeuArg: 7.594 ± 0.206
7.28LeuSer: 7.28 ± 0.176
5.436LeuThr: 5.436 ± 0.15
8.351LeuVal: 8.351 ± 0.178
1.067LeuTrp: 1.067 ± 0.071
3.53LeuTyr: 3.53 ± 0.127
0.0LeuXaa: 0.0 ± 0.0
Met
2.083MetAla: 2.083 ± 0.095
0.126MetCys: 0.126 ± 0.022
1.361MetAsp: 1.361 ± 0.074
1.592MetGlu: 1.592 ± 0.089
0.647MetPhe: 0.647 ± 0.05
2.255MetGly: 2.255 ± 0.084
0.439MetHis: 0.439 ± 0.046
1.785MetIle: 1.785 ± 0.085
1.259MetLys: 1.259 ± 0.066
2.412MetLeu: 2.412 ± 0.096
0.624MetMet: 0.624 ± 0.047
1.306MetAsn: 1.306 ± 0.066
1.334MetPro: 1.334 ± 0.068
0.447MetGln: 0.447 ± 0.044
1.785MetArg: 1.785 ± 0.073
1.82MetSer: 1.82 ± 0.082
1.145MetThr: 1.145 ± 0.07
2.196MetVal: 2.196 ± 0.091
0.204MetTrp: 0.204 ± 0.026
1.0MetTyr: 1.0 ± 0.066
0.0MetXaa: 0.0 ± 0.0
Asn
3.467AsnAla: 3.467 ± 0.123
0.392AsnCys: 0.392 ± 0.041
2.126AsnAsp: 2.126 ± 0.089
3.134AsnGlu: 3.134 ± 0.132
1.173AsnPhe: 1.173 ± 0.063
2.742AsnGly: 2.742 ± 0.137
0.553AsnHis: 0.553 ± 0.043
3.044AsnIle: 3.044 ± 0.127
2.095AsnLys: 2.095 ± 0.106
3.365AsnLeu: 3.365 ± 0.107
0.984AsnMet: 0.984 ± 0.063
2.087AsnAsn: 2.087 ± 0.103
2.538AsnPro: 2.538 ± 0.101
0.828AsnGln: 0.828 ± 0.055
2.055AsnArg: 2.055 ± 0.103
2.047AsnSer: 2.047 ± 0.091
2.247AsnThr: 2.247 ± 0.105
3.542AsnVal: 3.542 ± 0.13
0.518AsnTrp: 0.518 ± 0.044
2.514AsnTyr: 2.514 ± 0.124
0.0AsnXaa: 0.0 ± 0.0
Pro
2.342ProAla: 2.342 ± 0.106
0.314ProCys: 0.314 ± 0.03
2.106ProAsp: 2.106 ± 0.097
2.718ProGlu: 2.718 ± 0.117
1.64ProPhe: 1.64 ± 0.091
3.326ProGly: 3.326 ± 0.125
0.761ProHis: 0.761 ± 0.051
3.962ProIle: 3.962 ± 0.122
2.11ProLys: 2.11 ± 0.09
4.209ProLeu: 4.209 ± 0.138
1.102ProMet: 1.102 ± 0.069
1.871ProAsn: 1.871 ± 0.093
2.342ProPro: 2.342 ± 0.127
1.192ProGln: 1.192 ± 0.08
2.993ProArg: 2.993 ± 0.128
2.942ProSer: 2.942 ± 0.109
2.738ProThr: 2.738 ± 0.117
3.236ProVal: 3.236 ± 0.114
0.737ProTrp: 0.737 ± 0.057
2.142ProTyr: 2.142 ± 0.118
0.0ProXaa: 0.0 ± 0.0
Gln
1.463GlnAla: 1.463 ± 0.078
0.235GlnCys: 0.235 ± 0.027
0.792GlnAsp: 0.792 ± 0.05
1.126GlnGlu: 1.126 ± 0.062
0.773GlnPhe: 0.773 ± 0.058
1.569GlnGly: 1.569 ± 0.079
0.286GlnHis: 0.286 ± 0.034
1.035GlnIle: 1.035 ± 0.075
0.561GlnLys: 0.561 ± 0.046
2.255GlnLeu: 2.255 ± 0.087
0.502GlnMet: 0.502 ± 0.038
0.565GlnAsn: 0.565 ± 0.056
0.824GlnPro: 0.824 ± 0.07
0.608GlnGln: 0.608 ± 0.061
1.169GlnArg: 1.169 ± 0.071
1.067GlnSer: 1.067 ± 0.073
0.792GlnThr: 0.792 ± 0.063
1.812GlnVal: 1.812 ± 0.076
0.302GlnTrp: 0.302 ± 0.039
1.051GlnTyr: 1.051 ± 0.076
0.0GlnXaa: 0.0 ± 0.0
Arg
4.534ArgAla: 4.534 ± 0.143
0.573ArgCys: 0.573 ± 0.053
3.832ArgAsp: 3.832 ± 0.107
5.472ArgGlu: 5.472 ± 0.158
2.667ArgPhe: 2.667 ± 0.09
4.887ArgGly: 4.887 ± 0.151
0.961ArgHis: 0.961 ± 0.06
4.569ArgIle: 4.569 ± 0.156
2.997ArgLys: 2.997 ± 0.131
7.072ArgLeu: 7.072 ± 0.215
1.283ArgMet: 1.283 ± 0.076
2.416ArgAsn: 2.416 ± 0.108
2.291ArgPro: 2.291 ± 0.087
1.341ArgGln: 1.341 ± 0.074
4.942ArgArg: 4.942 ± 0.188
3.338ArgSer: 3.338 ± 0.125
2.11ArgThr: 2.11 ± 0.092
6.605ArgVal: 6.605 ± 0.174
0.843ArgTrp: 0.843 ± 0.059
2.71ArgTyr: 2.71 ± 0.112
0.0ArgXaa: 0.0 ± 0.0
Ser
3.448SerAla: 3.448 ± 0.115
0.447SerCys: 0.447 ± 0.05
2.404SerAsp: 2.404 ± 0.091
3.326SerGlu: 3.326 ± 0.119
1.977SerPhe: 1.977 ± 0.092
4.303SerGly: 4.303 ± 0.14
0.765SerHis: 0.765 ± 0.053
5.809SerIle: 5.809 ± 0.18
3.02SerLys: 3.02 ± 0.114
6.276SerLeu: 6.276 ± 0.202
1.887SerMet: 1.887 ± 0.09
2.487SerAsn: 2.487 ± 0.117
2.361SerPro: 2.361 ± 0.108
1.216SerGln: 1.216 ± 0.072
4.107SerArg: 4.107 ± 0.136
3.456SerSer: 3.456 ± 0.126
3.295SerThr: 3.295 ± 0.135
4.401SerVal: 4.401 ± 0.143
0.769SerTrp: 0.769 ± 0.065
1.957SerTyr: 1.957 ± 0.098
0.0SerXaa: 0.0 ± 0.0
Thr
3.42ThrAla: 3.42 ± 0.123
0.412ThrCys: 0.412 ± 0.044
1.8ThrAsp: 1.8 ± 0.08
2.518ThrGlu: 2.518 ± 0.111
1.53ThrPhe: 1.53 ± 0.08
3.765ThrGly: 3.765 ± 0.123
0.906ThrHis: 0.906 ± 0.058
4.287ThrIle: 4.287 ± 0.18
2.349ThrLys: 2.349 ± 0.098
5.174ThrLeu: 5.174 ± 0.17
1.341ThrMet: 1.341 ± 0.071
2.04ThrAsn: 2.04 ± 0.1
2.593ThrPro: 2.593 ± 0.094
1.224ThrGln: 1.224 ± 0.073
2.757ThrArg: 2.757 ± 0.095
2.832ThrSer: 2.832 ± 0.118
2.824ThrThr: 2.824 ± 0.139
4.013ThrVal: 4.013 ± 0.144
0.726ThrTrp: 0.726 ± 0.059
2.444ThrTyr: 2.444 ± 0.126
0.0ThrXaa: 0.0 ± 0.0
Val
5.691ValAla: 5.691 ± 0.183
0.561ValCys: 0.561 ± 0.046
4.993ValAsp: 4.993 ± 0.122
5.26ValGlu: 5.26 ± 0.181
3.283ValPhe: 3.283 ± 0.105
6.782ValGly: 6.782 ± 0.167
1.283ValHis: 1.283 ± 0.075
8.013ValIle: 8.013 ± 0.172
5.24ValLys: 5.24 ± 0.158
8.704ValLeu: 8.704 ± 0.195
2.263ValMet: 2.263 ± 0.087
3.997ValAsn: 3.997 ± 0.133
3.66ValPro: 3.66 ± 0.143
1.408ValGln: 1.408 ± 0.07
6.338ValArg: 6.338 ± 0.18
5.279ValSer: 5.279 ± 0.15
4.287ValThr: 4.287 ± 0.131
8.507ValVal: 8.507 ± 0.221
0.894ValTrp: 0.894 ± 0.064
3.42ValTyr: 3.42 ± 0.112
0.0ValXaa: 0.0 ± 0.0
Trp
0.934TrpAla: 0.934 ± 0.067
0.086TrpCys: 0.086 ± 0.018
0.706TrpAsp: 0.706 ± 0.055
0.584TrpGlu: 0.584 ± 0.048
0.612TrpPhe: 0.612 ± 0.044
1.102TrpGly: 1.102 ± 0.075
0.224TrpHis: 0.224 ± 0.027
0.816TrpIle: 0.816 ± 0.056
0.514TrpLys: 0.514 ± 0.046
1.439TrpLeu: 1.439 ± 0.087
0.231TrpMet: 0.231 ± 0.029
0.494TrpAsn: 0.494 ± 0.049
0.537TrpPro: 0.537 ± 0.047
0.255TrpGln: 0.255 ± 0.037
0.984TrpArg: 0.984 ± 0.068
0.804TrpSer: 0.804 ± 0.061
0.443TrpThr: 0.443 ± 0.043
1.192TrpVal: 1.192 ± 0.074
0.282TrpTrp: 0.282 ± 0.039
0.612TrpTyr: 0.612 ± 0.056
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.295TyrAla: 3.295 ± 0.103
0.404TyrCys: 0.404 ± 0.042
2.185TyrAsp: 2.185 ± 0.093
2.981TyrGlu: 2.981 ± 0.112
1.424TyrPhe: 1.424 ± 0.082
3.899TyrGly: 3.899 ± 0.133
0.62TyrHis: 0.62 ± 0.04
3.13TyrIle: 3.13 ± 0.107
1.624TyrLys: 1.624 ± 0.082
4.856TyrLeu: 4.856 ± 0.144
1.055TyrMet: 1.055 ± 0.071
1.691TyrAsn: 1.691 ± 0.094
1.891TyrPro: 1.891 ± 0.075
0.8TyrGln: 0.8 ± 0.058
2.479TyrArg: 2.479 ± 0.085
2.24TyrSer: 2.24 ± 0.092
2.138TyrThr: 2.138 ± 0.112
4.668TyrVal: 4.668 ± 0.133
0.737TyrTrp: 0.737 ± 0.052
2.13TyrTyr: 2.13 ± 0.111
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 918 proteins (254953 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski