Amino acid dipepetide frequency for Serratia phage BF

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.155AlaAla: 4.155 ± 0.333
0.453AlaCys: 0.453 ± 0.065
3.035AlaAsp: 3.035 ± 0.183
3.942AlaGlu: 3.942 ± 0.252
2.175AlaPhe: 2.175 ± 0.139
3.738AlaGly: 3.738 ± 0.227
0.879AlaHis: 0.879 ± 0.105
3.701AlaIle: 3.701 ± 0.162
3.738AlaLys: 3.738 ± 0.208
4.608AlaLeu: 4.608 ± 0.224
1.518AlaMet: 1.518 ± 0.114
3.424AlaAsn: 3.424 ± 0.203
1.98AlaPro: 1.98 ± 0.174
1.934AlaGln: 1.934 ± 0.129
2.249AlaArg: 2.249 ± 0.185
3.516AlaSer: 3.516 ± 0.198
3.304AlaThr: 3.304 ± 0.371
3.813AlaVal: 3.813 ± 0.234
0.601AlaTrp: 0.601 ± 0.071
2.267AlaTyr: 2.267 ± 0.144
0.0AlaXaa: 0.0 ± 0.0
Cys
0.537CysAla: 0.537 ± 0.067
0.139CysCys: 0.139 ± 0.041
0.768CysAsp: 0.768 ± 0.087
0.731CysGlu: 0.731 ± 0.081
0.453CysPhe: 0.453 ± 0.061
0.842CysGly: 0.842 ± 0.101
0.231CysHis: 0.231 ± 0.046
0.731CysIle: 0.731 ± 0.081
0.639CysLys: 0.639 ± 0.07
0.824CysLeu: 0.824 ± 0.102
0.333CysMet: 0.333 ± 0.056
0.583CysAsn: 0.583 ± 0.083
0.481CysPro: 0.481 ± 0.079
0.361CysGln: 0.361 ± 0.067
0.398CysArg: 0.398 ± 0.063
0.851CysSer: 0.851 ± 0.101
1.101CysThr: 1.101 ± 0.102
0.787CysVal: 0.787 ± 0.084
0.13CysTrp: 0.13 ± 0.034
0.546CysTyr: 0.546 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
3.813AspAla: 3.813 ± 0.196
0.796AspCys: 0.796 ± 0.083
4.692AspAsp: 4.692 ± 0.265
5.922AspGlu: 5.922 ± 0.274
3.267AspPhe: 3.267 ± 0.168
4.414AspGly: 4.414 ± 0.185
1.175AspHis: 1.175 ± 0.104
5.006AspIle: 5.006 ± 0.218
3.627AspLys: 3.627 ± 0.195
5.08AspLeu: 5.08 ± 0.198
1.906AspMet: 1.906 ± 0.162
3.646AspAsn: 3.646 ± 0.221
2.165AspPro: 2.165 ± 0.142
1.545AspGln: 1.545 ± 0.134
2.101AspArg: 2.101 ± 0.147
4.423AspSer: 4.423 ± 0.211
3.526AspThr: 3.526 ± 0.179
4.201AspVal: 4.201 ± 0.217
0.814AspTrp: 0.814 ± 0.087
3.359AspTyr: 3.359 ± 0.194
0.0AspXaa: 0.0 ± 0.0
Glu
3.627GluAla: 3.627 ± 0.218
0.87GluCys: 0.87 ± 0.113
4.738GluAsp: 4.738 ± 0.23
5.182GluGlu: 5.182 ± 0.268
3.618GluPhe: 3.618 ± 0.178
3.1GluGly: 3.1 ± 0.176
1.684GluHis: 1.684 ± 0.14
4.96GluIle: 4.96 ± 0.256
4.09GluLys: 4.09 ± 0.249
6.515GluLeu: 6.515 ± 0.279
2.647GluMet: 2.647 ± 0.175
3.544GluAsn: 3.544 ± 0.187
1.897GluPro: 1.897 ± 0.124
3.026GluGln: 3.026 ± 0.191
3.202GluArg: 3.202 ± 0.164
4.081GluSer: 4.081 ± 0.231
3.193GluThr: 3.193 ± 0.189
5.136GluVal: 5.136 ± 0.239
0.814GluTrp: 0.814 ± 0.086
4.062GluTyr: 4.062 ± 0.239
0.0GluXaa: 0.0 ± 0.0
Phe
1.925PheAla: 1.925 ± 0.132
0.694PheCys: 0.694 ± 0.08
3.563PheAsp: 3.563 ± 0.188
3.378PheGlu: 3.378 ± 0.181
1.878PhePhe: 1.878 ± 0.165
3.017PheGly: 3.017 ± 0.149
0.768PheHis: 0.768 ± 0.093
3.128PheIle: 3.128 ± 0.184
3.044PheLys: 3.044 ± 0.168
3.146PheLeu: 3.146 ± 0.158
1.342PheMet: 1.342 ± 0.113
2.924PheAsn: 2.924 ± 0.156
1.24PhePro: 1.24 ± 0.11
1.425PheGln: 1.425 ± 0.113
1.397PheArg: 1.397 ± 0.115
3.156PheSer: 3.156 ± 0.155
2.933PheThr: 2.933 ± 0.15
3.035PheVal: 3.035 ± 0.169
0.759PheTrp: 0.759 ± 0.08
2.082PheTyr: 2.082 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
3.23GlyAla: 3.23 ± 0.248
0.713GlyCys: 0.713 ± 0.087
3.535GlyAsp: 3.535 ± 0.2
3.868GlyGlu: 3.868 ± 0.184
2.323GlyPhe: 2.323 ± 0.145
3.859GlyGly: 3.859 ± 0.442
0.953GlyHis: 0.953 ± 0.105
4.053GlyIle: 4.053 ± 0.2
3.924GlyLys: 3.924 ± 0.194
4.701GlyLeu: 4.701 ± 0.203
1.564GlyMet: 1.564 ± 0.129
3.803GlyAsn: 3.803 ± 0.266
0.676GlyPro: 0.676 ± 0.094
2.193GlyGln: 2.193 ± 0.181
2.341GlyArg: 2.341 ± 0.151
5.006GlySer: 5.006 ± 0.297
5.265GlyThr: 5.265 ± 0.339
3.859GlyVal: 3.859 ± 0.192
0.75GlyTrp: 0.75 ± 0.099
3.257GlyTyr: 3.257 ± 0.2
0.0GlyXaa: 0.0 ± 0.0
His
0.944HisAla: 0.944 ± 0.09
0.296HisCys: 0.296 ± 0.06
1.37HisAsp: 1.37 ± 0.115
1.37HisGlu: 1.37 ± 0.126
1.12HisPhe: 1.12 ± 0.121
1.296HisGly: 1.296 ± 0.102
0.416HisHis: 0.416 ± 0.068
1.259HisIle: 1.259 ± 0.099
1.166HisLys: 1.166 ± 0.087
1.629HisLeu: 1.629 ± 0.123
0.5HisMet: 0.5 ± 0.06
1.064HisAsn: 1.064 ± 0.106
0.805HisPro: 0.805 ± 0.085
0.564HisGln: 0.564 ± 0.073
0.657HisArg: 0.657 ± 0.079
1.444HisSer: 1.444 ± 0.124
1.027HisThr: 1.027 ± 0.089
1.231HisVal: 1.231 ± 0.108
0.25HisTrp: 0.25 ± 0.047
0.861HisTyr: 0.861 ± 0.105
0.0HisXaa: 0.0 ± 0.0
Ile
3.85IleAla: 3.85 ± 0.235
0.777IleCys: 0.777 ± 0.087
4.858IleAsp: 4.858 ± 0.238
5.256IleGlu: 5.256 ± 0.256
2.647IlePhe: 2.647 ± 0.175
3.914IleGly: 3.914 ± 0.198
1.536IleHis: 1.536 ± 0.106
4.923IleIle: 4.923 ± 0.234
4.895IleLys: 4.895 ± 0.246
5.265IleLeu: 5.265 ± 0.216
2.054IleMet: 2.054 ± 0.145
4.692IleAsn: 4.692 ± 0.216
2.822IlePro: 2.822 ± 0.168
2.841IleGln: 2.841 ± 0.167
3.193IleArg: 3.193 ± 0.166
4.803IleSer: 4.803 ± 0.221
4.22IleThr: 4.22 ± 0.254
4.59IleVal: 4.59 ± 0.206
0.879IleTrp: 0.879 ± 0.095
2.434IleTyr: 2.434 ± 0.173
0.0IleXaa: 0.0 ± 0.0
Lys
3.424LysAla: 3.424 ± 0.201
0.657LysCys: 0.657 ± 0.093
3.914LysAsp: 3.914 ± 0.215
4.655LysGlu: 4.655 ± 0.26
3.054LysPhe: 3.054 ± 0.19
2.684LysGly: 2.684 ± 0.154
1.462LysHis: 1.462 ± 0.121
4.692LysIle: 4.692 ± 0.177
4.21LysLys: 4.21 ± 0.231
5.358LysLeu: 5.358 ± 0.259
2.175LysMet: 2.175 ± 0.146
3.961LysAsn: 3.961 ± 0.2
1.99LysPro: 1.99 ± 0.127
2.23LysGln: 2.23 ± 0.151
2.776LysArg: 2.776 ± 0.168
3.859LysSer: 3.859 ± 0.192
3.655LysThr: 3.655 ± 0.176
4.636LysVal: 4.636 ± 0.204
0.546LysTrp: 0.546 ± 0.067
3.868LysTyr: 3.868 ± 0.187
0.0LysXaa: 0.0 ± 0.0
Leu
4.84LeuAla: 4.84 ± 0.253
0.907LeuCys: 0.907 ± 0.099
5.302LeuAsp: 5.302 ± 0.248
5.432LeuGlu: 5.432 ± 0.254
3.035LeuPhe: 3.035 ± 0.177
3.97LeuGly: 3.97 ± 0.208
1.573LeuHis: 1.573 ± 0.121
4.969LeuIle: 4.969 ± 0.234
5.395LeuLys: 5.395 ± 0.255
5.876LeuLeu: 5.876 ± 0.268
2.036LeuMet: 2.036 ± 0.132
5.441LeuAsn: 5.441 ± 0.245
3.267LeuPro: 3.267 ± 0.19
3.072LeuGln: 3.072 ± 0.173
3.433LeuArg: 3.433 ± 0.178
6.098LeuSer: 6.098 ± 0.229
4.775LeuThr: 4.775 ± 0.352
4.923LeuVal: 4.923 ± 0.227
0.787LeuTrp: 0.787 ± 0.083
3.6LeuTyr: 3.6 ± 0.199
0.0LeuXaa: 0.0 ± 0.0
Met
1.693MetAla: 1.693 ± 0.124
0.361MetCys: 0.361 ± 0.066
1.767MetAsp: 1.767 ± 0.137
1.638MetGlu: 1.638 ± 0.124
1.545MetPhe: 1.545 ± 0.129
1.499MetGly: 1.499 ± 0.134
0.49MetHis: 0.49 ± 0.072
1.98MetIle: 1.98 ± 0.128
2.508MetLys: 2.508 ± 0.175
1.74MetLeu: 1.74 ± 0.134
0.796MetMet: 0.796 ± 0.082
2.101MetAsn: 2.101 ± 0.144
0.74MetPro: 0.74 ± 0.075
0.972MetGln: 0.972 ± 0.098
1.092MetArg: 1.092 ± 0.095
2.249MetSer: 2.249 ± 0.146
1.693MetThr: 1.693 ± 0.12
1.499MetVal: 1.499 ± 0.108
0.333MetTrp: 0.333 ± 0.057
1.555MetTyr: 1.555 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
3.563AsnAla: 3.563 ± 0.21
0.657AsnCys: 0.657 ± 0.073
3.526AsnAsp: 3.526 ± 0.213
4.201AsnGlu: 4.201 ± 0.211
2.508AsnPhe: 2.508 ± 0.166
4.562AsnGly: 4.562 ± 0.24
1.166AsnHis: 1.166 ± 0.106
4.682AsnIle: 4.682 ± 0.217
4.007AsnLys: 4.007 ± 0.201
4.349AsnLeu: 4.349 ± 0.175
1.647AsnMet: 1.647 ± 0.138
4.247AsnAsn: 4.247 ± 0.195
2.711AsnPro: 2.711 ± 0.192
1.888AsnGln: 1.888 ± 0.124
2.313AsnArg: 2.313 ± 0.145
4.544AsnSer: 4.544 ± 0.243
4.303AsnThr: 4.303 ± 0.259
4.303AsnVal: 4.303 ± 0.213
0.676AsnTrp: 0.676 ± 0.076
2.711AsnTyr: 2.711 ± 0.177
0.0AsnXaa: 0.0 ± 0.0
Pro
1.897ProAla: 1.897 ± 0.147
0.324ProCys: 0.324 ± 0.056
2.563ProAsp: 2.563 ± 0.165
2.896ProGlu: 2.896 ± 0.165
1.629ProPhe: 1.629 ± 0.114
1.323ProGly: 1.323 ± 0.121
0.555ProHis: 0.555 ± 0.079
2.443ProIle: 2.443 ± 0.154
1.897ProLys: 1.897 ± 0.136
2.091ProLeu: 2.091 ± 0.148
0.759ProMet: 0.759 ± 0.079
2.156ProAsn: 2.156 ± 0.142
0.907ProPro: 0.907 ± 0.115
1.064ProGln: 1.064 ± 0.11
1.064ProArg: 1.064 ± 0.115
2.591ProSer: 2.591 ± 0.173
2.221ProThr: 2.221 ± 0.152
2.517ProVal: 2.517 ± 0.146
0.352ProTrp: 0.352 ± 0.068
1.499ProTyr: 1.499 ± 0.114
0.0ProXaa: 0.0 ± 0.0
Gln
2.027GlnAla: 2.027 ± 0.154
0.407GlnCys: 0.407 ± 0.057
1.953GlnAsp: 1.953 ± 0.124
2.998GlnGlu: 2.998 ± 0.203
1.638GlnPhe: 1.638 ± 0.129
1.804GlnGly: 1.804 ± 0.129
0.759GlnHis: 0.759 ± 0.082
2.332GlnIle: 2.332 ± 0.143
2.221GlnLys: 2.221 ± 0.137
3.202GlnLeu: 3.202 ± 0.198
1.221GlnMet: 1.221 ± 0.103
2.406GlnAsn: 2.406 ± 0.181
1.009GlnPro: 1.009 ± 0.103
1.462GlnGln: 1.462 ± 0.144
1.453GlnArg: 1.453 ± 0.112
2.073GlnSer: 2.073 ± 0.142
1.73GlnThr: 1.73 ± 0.143
2.054GlnVal: 2.054 ± 0.142
0.407GlnTrp: 0.407 ± 0.061
2.36GlnTyr: 2.36 ± 0.157
0.0GlnXaa: 0.0 ± 0.0
Arg
1.943ArgAla: 1.943 ± 0.124
0.361ArgCys: 0.361 ± 0.053
2.221ArgAsp: 2.221 ± 0.133
2.647ArgGlu: 2.647 ± 0.196
1.851ArgPhe: 1.851 ± 0.128
2.406ArgGly: 2.406 ± 0.138
0.713ArgHis: 0.713 ± 0.089
3.063ArgIle: 3.063 ± 0.15
2.758ArgLys: 2.758 ± 0.159
3.091ArgLeu: 3.091 ± 0.138
1.157ArgMet: 1.157 ± 0.117
2.785ArgAsn: 2.785 ± 0.15
1.036ArgPro: 1.036 ± 0.105
1.508ArgGln: 1.508 ± 0.12
1.592ArgArg: 1.592 ± 0.128
2.591ArgSer: 2.591 ± 0.143
2.6ArgThr: 2.6 ± 0.163
2.554ArgVal: 2.554 ± 0.14
0.546ArgTrp: 0.546 ± 0.074
1.777ArgTyr: 1.777 ± 0.114
0.0ArgXaa: 0.0 ± 0.0
Ser
3.933SerAla: 3.933 ± 0.225
0.731SerCys: 0.731 ± 0.098
4.451SerAsp: 4.451 ± 0.205
4.109SerGlu: 4.109 ± 0.228
3.285SerPhe: 3.285 ± 0.171
4.821SerGly: 4.821 ± 0.286
1.175SerHis: 1.175 ± 0.106
5.025SerIle: 5.025 ± 0.215
3.97SerLys: 3.97 ± 0.189
5.043SerLeu: 5.043 ± 0.212
1.823SerMet: 1.823 ± 0.146
3.803SerAsn: 3.803 ± 0.195
2.073SerPro: 2.073 ± 0.166
2.202SerGln: 2.202 ± 0.171
2.498SerArg: 2.498 ± 0.156
5.145SerSer: 5.145 ± 0.264
7.181SerThr: 7.181 ± 0.368
4.627SerVal: 4.627 ± 0.212
0.898SerTrp: 0.898 ± 0.086
2.98SerTyr: 2.98 ± 0.185
0.0SerXaa: 0.0 ± 0.0
Thr
3.637ThrAla: 3.637 ± 0.292
0.555ThrCys: 0.555 ± 0.077
4.377ThrAsp: 4.377 ± 0.243
4.053ThrGlu: 4.053 ± 0.195
3.322ThrPhe: 3.322 ± 0.195
5.09ThrGly: 5.09 ± 0.391
1.027ThrHis: 1.027 ± 0.09
4.793ThrIle: 4.793 ± 0.252
3.757ThrLys: 3.757 ± 0.156
5.265ThrLeu: 5.265 ± 0.34
1.379ThrMet: 1.379 ± 0.117
3.646ThrAsn: 3.646 ± 0.217
2.461ThrPro: 2.461 ± 0.147
1.795ThrGln: 1.795 ± 0.138
2.138ThrArg: 2.138 ± 0.115
4.571ThrSer: 4.571 ± 0.265
4.257ThrThr: 4.257 ± 0.373
5.191ThrVal: 5.191 ± 0.367
0.814ThrTrp: 0.814 ± 0.083
2.961ThrTyr: 2.961 ± 0.187
0.0ThrXaa: 0.0 ± 0.0
Val
3.23ValAla: 3.23 ± 0.219
0.75ValCys: 0.75 ± 0.077
4.442ValAsp: 4.442 ± 0.188
4.072ValGlu: 4.072 ± 0.222
2.647ValPhe: 2.647 ± 0.16
3.572ValGly: 3.572 ± 0.209
1.462ValHis: 1.462 ± 0.149
4.433ValIle: 4.433 ± 0.228
4.044ValLys: 4.044 ± 0.195
7.283ValLeu: 7.283 ± 0.273
1.814ValMet: 1.814 ± 0.143
3.979ValAsn: 3.979 ± 0.231
2.591ValPro: 2.591 ± 0.206
3.313ValGln: 3.313 ± 0.162
2.711ValArg: 2.711 ± 0.169
4.682ValSer: 4.682 ± 0.222
4.136ValThr: 4.136 ± 0.362
4.627ValVal: 4.627 ± 0.228
0.676ValTrp: 0.676 ± 0.075
3.211ValTyr: 3.211 ± 0.168
0.0ValXaa: 0.0 ± 0.0
Trp
0.583TrpAla: 0.583 ± 0.072
0.167TrpCys: 0.167 ± 0.039
0.888TrpAsp: 0.888 ± 0.089
0.648TrpGlu: 0.648 ± 0.07
0.555TrpPhe: 0.555 ± 0.072
0.713TrpGly: 0.713 ± 0.091
0.305TrpHis: 0.305 ± 0.05
0.824TrpIle: 0.824 ± 0.103
0.842TrpLys: 0.842 ± 0.095
0.685TrpLeu: 0.685 ± 0.075
0.352TrpMet: 0.352 ± 0.058
0.768TrpAsn: 0.768 ± 0.09
0.194TrpPro: 0.194 ± 0.042
0.342TrpGln: 0.342 ± 0.062
0.416TrpArg: 0.416 ± 0.056
0.74TrpSer: 0.74 ± 0.088
0.851TrpThr: 0.851 ± 0.092
0.944TrpVal: 0.944 ± 0.123
0.148TrpTrp: 0.148 ± 0.033
0.759TrpTyr: 0.759 ± 0.082
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.184TyrAla: 2.184 ± 0.14
0.851TyrCys: 0.851 ± 0.08
3.516TyrAsp: 3.516 ± 0.19
2.989TyrGlu: 2.989 ± 0.217
2.295TyrPhe: 2.295 ± 0.145
3.248TyrGly: 3.248 ± 0.181
0.935TyrHis: 0.935 ± 0.103
3.452TyrIle: 3.452 ± 0.198
3.072TyrLys: 3.072 ± 0.17
2.98TyrLeu: 2.98 ± 0.183
1.221TyrMet: 1.221 ± 0.114
3.59TyrAsn: 3.59 ± 0.197
1.703TyrPro: 1.703 ± 0.134
1.777TyrGln: 1.777 ± 0.129
2.175TyrArg: 2.175 ± 0.163
3.35TyrSer: 3.35 ± 0.21
3.118TyrThr: 3.118 ± 0.176
3.146TyrVal: 3.146 ± 0.196
0.546TyrTrp: 0.546 ± 0.071
2.397TyrTyr: 2.397 ± 0.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 549 proteins (108066 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski