Amino acid dipepetide frequency for Salmonella phage SPFM13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.375AlaAla: 7.375 ± 0.442
0.75AlaCys: 0.75 ± 0.114
4.832AlaAsp: 4.832 ± 0.277
5.126AlaGlu: 5.126 ± 0.286
3.159AlaPhe: 3.159 ± 0.249
5.019AlaGly: 5.019 ± 0.288
1.472AlaHis: 1.472 ± 0.131
5.086AlaIle: 5.086 ± 0.249
4.109AlaLys: 4.109 ± 0.307
7.174AlaLeu: 7.174 ± 0.368
2.329AlaMet: 2.329 ± 0.171
3.413AlaAsn: 3.413 ± 0.181
3.078AlaPro: 3.078 ± 0.239
2.449AlaGln: 2.449 ± 0.191
3.574AlaArg: 3.574 ± 0.206
4.082AlaSer: 4.082 ± 0.268
4.926AlaThr: 4.926 ± 0.337
5.434AlaVal: 5.434 ± 0.286
1.205AlaTrp: 1.205 ± 0.158
3.012AlaTyr: 3.012 ± 0.207
0.0AlaXaa: 0.0 ± 0.0
Cys
0.509CysAla: 0.509 ± 0.097
0.134CysCys: 0.134 ± 0.042
0.509CysAsp: 0.509 ± 0.081
0.576CysGlu: 0.576 ± 0.097
0.201CysPhe: 0.201 ± 0.054
0.763CysGly: 0.763 ± 0.095
0.308CysHis: 0.308 ± 0.061
0.415CysIle: 0.415 ± 0.084
0.361CysLys: 0.361 ± 0.072
0.843CysLeu: 0.843 ± 0.11
0.254CysMet: 0.254 ± 0.057
0.549CysAsn: 0.549 ± 0.092
0.348CysPro: 0.348 ± 0.071
0.361CysGln: 0.361 ± 0.078
0.602CysArg: 0.602 ± 0.1
0.696CysSer: 0.696 ± 0.094
0.522CysThr: 0.522 ± 0.097
0.642CysVal: 0.642 ± 0.089
0.134CysTrp: 0.134 ± 0.041
0.468CysTyr: 0.468 ± 0.079
0.0CysXaa: 0.0 ± 0.0
Asp
5.073AspAla: 5.073 ± 0.308
0.602AspCys: 0.602 ± 0.109
3.855AspAsp: 3.855 ± 0.218
4.537AspGlu: 4.537 ± 0.269
2.503AspPhe: 2.503 ± 0.181
4.725AspGly: 4.725 ± 0.284
1.044AspHis: 1.044 ± 0.117
3.815AspIle: 3.815 ± 0.243
3.025AspLys: 3.025 ± 0.202
5.581AspLeu: 5.581 ± 0.307
1.593AspMet: 1.593 ± 0.137
2.73AspAsn: 2.73 ± 0.193
3.346AspPro: 3.346 ± 0.166
1.78AspGln: 1.78 ± 0.156
3.159AspArg: 3.159 ± 0.231
3.078AspSer: 3.078 ± 0.199
3.989AspThr: 3.989 ± 0.199
4.818AspVal: 4.818 ± 0.288
1.164AspTrp: 1.164 ± 0.115
2.824AspTyr: 2.824 ± 0.197
0.0AspXaa: 0.0 ± 0.0
Glu
4.752GluAla: 4.752 ± 0.25
0.562GluCys: 0.562 ± 0.09
3.828GluAsp: 3.828 ± 0.277
4.042GluGlu: 4.042 ± 0.299
2.73GluPhe: 2.73 ± 0.192
3.935GluGly: 3.935 ± 0.215
1.338GluHis: 1.338 ± 0.148
3.948GluIle: 3.948 ± 0.271
3.493GluLys: 3.493 ± 0.227
6.947GluLeu: 6.947 ± 0.319
2.021GluMet: 2.021 ± 0.14
2.583GluAsn: 2.583 ± 0.192
2.356GluPro: 2.356 ± 0.224
2.128GluGln: 2.128 ± 0.17
3.788GluArg: 3.788 ± 0.235
3.159GluSer: 3.159 ± 0.208
3.614GluThr: 3.614 ± 0.233
4.363GluVal: 4.363 ± 0.259
1.178GluTrp: 1.178 ± 0.127
2.851GluTyr: 2.851 ± 0.199
0.0GluXaa: 0.0 ± 0.0
Phe
2.543PheAla: 2.543 ± 0.205
0.375PheCys: 0.375 ± 0.068
2.757PheAsp: 2.757 ± 0.172
2.423PheGlu: 2.423 ± 0.188
1.526PhePhe: 1.526 ± 0.131
2.583PheGly: 2.583 ± 0.227
0.763PheHis: 0.763 ± 0.128
1.807PheIle: 1.807 ± 0.143
1.753PheLys: 1.753 ± 0.17
2.918PheLeu: 2.918 ± 0.175
1.044PheMet: 1.044 ± 0.134
2.516PheAsn: 2.516 ± 0.173
1.7PhePro: 1.7 ± 0.163
1.231PheGln: 1.231 ± 0.121
2.034PheArg: 2.034 ± 0.163
2.57PheSer: 2.57 ± 0.169
3.105PheThr: 3.105 ± 0.177
2.838PheVal: 2.838 ± 0.187
0.509PheTrp: 0.509 ± 0.092
1.794PheTyr: 1.794 ± 0.142
0.0PheXaa: 0.0 ± 0.0
Gly
3.989GlyAla: 3.989 ± 0.232
0.602GlyCys: 0.602 ± 0.093
4.069GlyAsp: 4.069 ± 0.25
4.644GlyGlu: 4.644 ± 0.247
3.212GlyPhe: 3.212 ± 0.215
4.805GlyGly: 4.805 ± 0.472
1.205GlyHis: 1.205 ± 0.143
3.841GlyIle: 3.841 ± 0.208
4.149GlyLys: 4.149 ± 0.312
6.184GlyLeu: 6.184 ± 0.294
1.834GlyMet: 1.834 ± 0.148
3.186GlyAsn: 3.186 ± 0.208
1.713GlyPro: 1.713 ± 0.145
2.73GlyGln: 2.73 ± 0.199
3.774GlyArg: 3.774 ± 0.241
3.681GlySer: 3.681 ± 0.258
4.377GlyThr: 4.377 ± 0.273
4.752GlyVal: 4.752 ± 0.227
1.004GlyTrp: 1.004 ± 0.103
2.918GlyTyr: 2.918 ± 0.169
0.0GlyXaa: 0.0 ± 0.0
His
1.392HisAla: 1.392 ± 0.153
0.254HisCys: 0.254 ± 0.055
1.419HisAsp: 1.419 ± 0.128
1.111HisGlu: 1.111 ± 0.125
0.87HisPhe: 0.87 ± 0.105
1.164HisGly: 1.164 ± 0.108
0.576HisHis: 0.576 ± 0.095
1.258HisIle: 1.258 ± 0.122
0.576HisLys: 0.576 ± 0.093
1.767HisLeu: 1.767 ± 0.142
0.495HisMet: 0.495 ± 0.096
0.763HisAsn: 0.763 ± 0.095
1.057HisPro: 1.057 ± 0.105
0.709HisGln: 0.709 ± 0.092
1.272HisArg: 1.272 ± 0.122
0.857HisSer: 0.857 ± 0.099
1.272HisThr: 1.272 ± 0.114
1.312HisVal: 1.312 ± 0.122
0.335HisTrp: 0.335 ± 0.067
1.151HisTyr: 1.151 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
4.497IleAla: 4.497 ± 0.236
0.656IleCys: 0.656 ± 0.098
4.243IleAsp: 4.243 ± 0.201
3.922IleGlu: 3.922 ± 0.276
1.713IlePhe: 1.713 ± 0.156
3.627IleGly: 3.627 ± 0.213
0.95IleHis: 0.95 ± 0.1
2.878IleIle: 2.878 ± 0.202
2.717IleLys: 2.717 ± 0.166
4.029IleLeu: 4.029 ± 0.199
1.057IleMet: 1.057 ± 0.114
3.145IleAsn: 3.145 ± 0.225
3.226IlePro: 3.226 ± 0.2
1.74IleGln: 1.74 ± 0.152
3.587IleArg: 3.587 ± 0.233
3.52IleSer: 3.52 ± 0.203
4.176IleThr: 4.176 ± 0.247
3.748IleVal: 3.748 ± 0.224
0.576IleTrp: 0.576 ± 0.097
2.275IleTyr: 2.275 ± 0.17
0.0IleXaa: 0.0 ± 0.0
Lys
4.243LysAla: 4.243 ± 0.324
0.428LysCys: 0.428 ± 0.07
2.931LysAsp: 2.931 ± 0.256
3.467LysGlu: 3.467 ± 0.24
1.82LysPhe: 1.82 ± 0.152
3.119LysGly: 3.119 ± 0.272
1.031LysHis: 1.031 ± 0.14
2.623LysIle: 2.623 ± 0.167
2.476LysLys: 2.476 ± 0.206
5.675LysLeu: 5.675 ± 0.297
1.74LysMet: 1.74 ± 0.154
2.075LysAsn: 2.075 ± 0.17
2.891LysPro: 2.891 ± 0.198
2.235LysGln: 2.235 ± 0.158
3.266LysArg: 3.266 ± 0.236
2.597LysSer: 2.597 ± 0.207
3.172LysThr: 3.172 ± 0.186
3.748LysVal: 3.748 ± 0.235
0.642LysTrp: 0.642 ± 0.088
1.914LysTyr: 1.914 ± 0.154
0.0LysXaa: 0.0 ± 0.0
Leu
7.549LeuAla: 7.549 ± 0.305
0.937LeuCys: 0.937 ± 0.114
5.876LeuAsp: 5.876 ± 0.273
5.367LeuGlu: 5.367 ± 0.288
3.172LeuPhe: 3.172 ± 0.211
5.274LeuGly: 5.274 ± 0.33
1.887LeuHis: 1.887 ± 0.179
4.203LeuIle: 4.203 ± 0.241
4.912LeuLys: 4.912 ± 0.247
7.402LeuLeu: 7.402 ± 0.393
2.235LeuMet: 2.235 ± 0.18
5.086LeuAsn: 5.086 ± 0.253
5.166LeuPro: 5.166 ± 0.259
3.333LeuGln: 3.333 ± 0.246
5.327LeuArg: 5.327 ± 0.283
6.224LeuSer: 6.224 ± 0.269
6.492LeuThr: 6.492 ± 0.333
5.836LeuVal: 5.836 ± 0.299
0.99LeuTrp: 0.99 ± 0.1
3.346LeuTyr: 3.346 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
2.516MetAla: 2.516 ± 0.199
0.321MetCys: 0.321 ± 0.065
1.231MetAsp: 1.231 ± 0.121
1.419MetGlu: 1.419 ± 0.139
1.178MetPhe: 1.178 ± 0.104
1.646MetGly: 1.646 ± 0.157
0.589MetHis: 0.589 ± 0.09
1.472MetIle: 1.472 ± 0.167
1.486MetLys: 1.486 ± 0.145
2.543MetLeu: 2.543 ± 0.185
0.843MetMet: 0.843 ± 0.109
1.312MetAsn: 1.312 ± 0.136
0.95MetPro: 0.95 ± 0.101
1.004MetGln: 1.004 ± 0.102
1.593MetArg: 1.593 ± 0.132
2.048MetSer: 2.048 ± 0.171
1.673MetThr: 1.673 ± 0.158
1.874MetVal: 1.874 ± 0.177
0.321MetTrp: 0.321 ± 0.059
1.004MetTyr: 1.004 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
4.176AsnAla: 4.176 ± 0.211
0.361AsnCys: 0.361 ± 0.072
2.784AsnAsp: 2.784 ± 0.177
2.864AsnGlu: 2.864 ± 0.177
1.646AsnPhe: 1.646 ± 0.133
4.296AsnGly: 4.296 ± 0.249
0.87AsnHis: 0.87 ± 0.1
2.449AsnIle: 2.449 ± 0.158
2.342AsnLys: 2.342 ± 0.17
4.069AsnLeu: 4.069 ± 0.227
1.057AsnMet: 1.057 ± 0.131
2.409AsnAsn: 2.409 ± 0.183
3.065AsnPro: 3.065 ± 0.186
1.686AsnGln: 1.686 ± 0.171
2.597AsnArg: 2.597 ± 0.179
2.503AsnSer: 2.503 ± 0.181
3.065AsnThr: 3.065 ± 0.224
3.306AsnVal: 3.306 ± 0.187
0.642AsnTrp: 0.642 ± 0.103
1.941AsnTyr: 1.941 ± 0.162
0.0AsnXaa: 0.0 ± 0.0
Pro
3.681ProAla: 3.681 ± 0.271
0.214ProCys: 0.214 ± 0.052
3.44ProAsp: 3.44 ± 0.226
3.534ProGlu: 3.534 ± 0.238
1.606ProPhe: 1.606 ± 0.183
3.092ProGly: 3.092 ± 0.223
0.857ProHis: 0.857 ± 0.104
2.757ProIle: 2.757 ± 0.217
2.53ProLys: 2.53 ± 0.178
3.681ProLeu: 3.681 ± 0.203
1.205ProMet: 1.205 ± 0.123
1.941ProAsn: 1.941 ± 0.149
1.794ProPro: 1.794 ± 0.16
1.78ProGln: 1.78 ± 0.157
2.168ProArg: 2.168 ± 0.197
2.717ProSer: 2.717 ± 0.198
3.132ProThr: 3.132 ± 0.205
3.922ProVal: 3.922 ± 0.232
0.642ProTrp: 0.642 ± 0.104
1.794ProTyr: 1.794 ± 0.158
0.0ProXaa: 0.0 ± 0.0
Gln
2.811GlnAla: 2.811 ± 0.2
0.308GlnCys: 0.308 ± 0.069
1.807GlnAsp: 1.807 ± 0.133
2.155GlnGlu: 2.155 ± 0.174
1.566GlnPhe: 1.566 ± 0.164
2.088GlnGly: 2.088 ± 0.191
0.763GlnHis: 0.763 ± 0.101
2.182GlnIle: 2.182 ± 0.193
1.78GlnLys: 1.78 ± 0.165
3.694GlnLeu: 3.694 ± 0.213
1.178GlnMet: 1.178 ± 0.123
1.539GlnAsn: 1.539 ± 0.126
1.526GlnPro: 1.526 ± 0.152
1.941GlnGln: 1.941 ± 0.203
2.436GlnArg: 2.436 ± 0.185
1.753GlnSer: 1.753 ± 0.146
2.195GlnThr: 2.195 ± 0.156
2.382GlnVal: 2.382 ± 0.216
0.602GlnTrp: 0.602 ± 0.083
1.432GlnTyr: 1.432 ± 0.121
0.0GlnXaa: 0.0 ± 0.0
Arg
3.534ArgAla: 3.534 ± 0.229
0.629ArgCys: 0.629 ± 0.092
3.319ArgAsp: 3.319 ± 0.217
3.212ArgGlu: 3.212 ± 0.228
2.449ArgPhe: 2.449 ± 0.172
3.681ArgGly: 3.681 ± 0.254
1.138ArgHis: 1.138 ± 0.124
3.386ArgIle: 3.386 ± 0.212
3.36ArgLys: 3.36 ± 0.252
5.421ArgLeu: 5.421 ± 0.297
1.633ArgMet: 1.633 ± 0.144
2.824ArgAsn: 2.824 ± 0.168
2.168ArgPro: 2.168 ± 0.212
1.981ArgGln: 1.981 ± 0.169
3.534ArgArg: 3.534 ± 0.248
2.824ArgSer: 2.824 ± 0.216
3.159ArgThr: 3.159 ± 0.231
3.453ArgVal: 3.453 ± 0.258
1.138ArgTrp: 1.138 ± 0.141
2.61ArgTyr: 2.61 ± 0.171
0.0ArgXaa: 0.0 ± 0.0
Ser
4.404SerAla: 4.404 ± 0.257
0.455SerCys: 0.455 ± 0.086
3.132SerAsp: 3.132 ± 0.21
3.239SerGlu: 3.239 ± 0.216
2.101SerPhe: 2.101 ± 0.165
4.323SerGly: 4.323 ± 0.245
0.924SerHis: 0.924 ± 0.114
3.547SerIle: 3.547 ± 0.189
3.212SerLys: 3.212 ± 0.182
5.247SerLeu: 5.247 ± 0.241
1.579SerMet: 1.579 ± 0.155
2.53SerAsn: 2.53 ± 0.171
2.583SerPro: 2.583 ± 0.185
2.168SerGln: 2.168 ± 0.139
2.931SerArg: 2.931 ± 0.197
3.507SerSer: 3.507 ± 0.252
3.56SerThr: 3.56 ± 0.259
4.056SerVal: 4.056 ± 0.204
0.776SerTrp: 0.776 ± 0.098
2.021SerTyr: 2.021 ± 0.153
0.0SerXaa: 0.0 ± 0.0
Thr
5.019ThrAla: 5.019 ± 0.279
0.361ThrCys: 0.361 ± 0.077
4.243ThrAsp: 4.243 ± 0.231
4.002ThrGlu: 4.002 ± 0.244
2.864ThrPhe: 2.864 ± 0.216
4.658ThrGly: 4.658 ± 0.241
1.312ThrHis: 1.312 ± 0.142
3.721ThrIle: 3.721 ± 0.279
2.958ThrLys: 2.958 ± 0.188
6.746ThrLeu: 6.746 ± 0.349
1.579ThrMet: 1.579 ± 0.129
2.904ThrAsn: 2.904 ± 0.203
3.681ThrPro: 3.681 ± 0.214
2.021ThrGln: 2.021 ± 0.181
2.945ThrArg: 2.945 ± 0.227
3.56ThrSer: 3.56 ± 0.243
4.618ThrThr: 4.618 ± 0.318
5.166ThrVal: 5.166 ± 0.32
1.111ThrTrp: 1.111 ± 0.128
2.356ThrTyr: 2.356 ± 0.162
0.0ThrXaa: 0.0 ± 0.0
Val
5.769ValAla: 5.769 ± 0.294
0.642ValCys: 0.642 ± 0.097
5.019ValAsp: 5.019 ± 0.237
4.952ValGlu: 4.952 ± 0.299
2.476ValPhe: 2.476 ± 0.196
4.096ValGly: 4.096 ± 0.206
1.138ValHis: 1.138 ± 0.118
4.082ValIle: 4.082 ± 0.219
4.122ValLys: 4.122 ± 0.218
5.501ValLeu: 5.501 ± 0.29
1.673ValMet: 1.673 ± 0.14
3.681ValAsn: 3.681 ± 0.246
3.306ValPro: 3.306 ± 0.199
2.423ValGln: 2.423 ± 0.192
3.507ValArg: 3.507 ± 0.214
3.841ValSer: 3.841 ± 0.234
4.966ValThr: 4.966 ± 0.26
5.688ValVal: 5.688 ± 0.348
0.937ValTrp: 0.937 ± 0.128
2.945ValTyr: 2.945 ± 0.213
0.0ValXaa: 0.0 ± 0.0
Trp
1.084TrpAla: 1.084 ± 0.125
0.134TrpCys: 0.134 ± 0.037
1.017TrpAsp: 1.017 ± 0.114
0.977TrpGlu: 0.977 ± 0.111
0.602TrpPhe: 0.602 ± 0.088
0.683TrpGly: 0.683 ± 0.104
0.348TrpHis: 0.348 ± 0.067
0.723TrpIle: 0.723 ± 0.097
0.924TrpLys: 0.924 ± 0.108
1.646TrpLeu: 1.646 ± 0.15
0.468TrpMet: 0.468 ± 0.087
0.589TrpAsn: 0.589 ± 0.088
0.549TrpPro: 0.549 ± 0.098
0.535TrpGln: 0.535 ± 0.084
0.763TrpArg: 0.763 ± 0.092
0.75TrpSer: 0.75 ± 0.1
0.924TrpThr: 0.924 ± 0.12
1.004TrpVal: 1.004 ± 0.133
0.294TrpTrp: 0.294 ± 0.073
0.736TrpTyr: 0.736 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.69TyrAla: 2.69 ± 0.18
0.468TyrCys: 0.468 ± 0.091
2.931TyrAsp: 2.931 ± 0.212
2.048TyrGlu: 2.048 ± 0.163
1.392TyrPhe: 1.392 ± 0.155
3.092TyrGly: 3.092 ± 0.171
1.098TyrHis: 1.098 ± 0.135
2.088TyrIle: 2.088 ± 0.154
1.82TyrLys: 1.82 ± 0.197
3.547TyrLeu: 3.547 ± 0.213
1.191TyrMet: 1.191 ± 0.124
2.356TyrAsn: 2.356 ± 0.223
1.981TyrPro: 1.981 ± 0.163
1.901TyrGln: 1.901 ± 0.156
2.597TyrArg: 2.597 ± 0.231
2.329TyrSer: 2.329 ± 0.178
2.891TyrThr: 2.891 ± 0.206
2.423TyrVal: 2.423 ± 0.187
0.549TyrTrp: 0.549 ± 0.08
2.034TyrTyr: 2.034 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 261 proteins (74714 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski