Amino acid dipepetide frequency for Aeromonas phage phiAS5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.259AlaAla: 5.259 ± 0.333
0.739AlaCys: 0.739 ± 0.119
4.071AlaAsp: 4.071 ± 0.251
4.419AlaGlu: 4.419 ± 0.283
2.811AlaPhe: 2.811 ± 0.217
4.796AlaGly: 4.796 ± 0.306
1.087AlaHis: 1.087 ± 0.131
4.709AlaIle: 4.709 ± 0.222
5.795AlaLys: 5.795 ± 0.319
4.752AlaLeu: 4.752 ± 0.27
2.506AlaMet: 2.506 ± 0.195
3.912AlaAsn: 3.912 ± 0.294
1.912AlaPro: 1.912 ± 0.187
2.318AlaGln: 2.318 ± 0.209
3.535AlaArg: 3.535 ± 0.246
3.955AlaSer: 3.955 ± 0.236
4.448AlaThr: 4.448 ± 0.368
4.651AlaVal: 4.651 ± 0.253
0.913AlaTrp: 0.913 ± 0.114
2.883AlaTyr: 2.883 ± 0.167
0.0AlaXaa: 0.0 ± 0.0
Cys
0.551CysAla: 0.551 ± 0.1
0.13CysCys: 0.13 ± 0.042
1.029CysAsp: 1.029 ± 0.145
0.797CysGlu: 0.797 ± 0.117
0.391CysPhe: 0.391 ± 0.063
0.782CysGly: 0.782 ± 0.111
0.304CysHis: 0.304 ± 0.076
0.724CysIle: 0.724 ± 0.098
0.942CysLys: 0.942 ± 0.109
0.739CysLeu: 0.739 ± 0.099
0.304CysMet: 0.304 ± 0.065
0.681CysAsn: 0.681 ± 0.108
0.594CysPro: 0.594 ± 0.101
0.435CysGln: 0.435 ± 0.069
0.884CysArg: 0.884 ± 0.117
0.681CysSer: 0.681 ± 0.124
0.464CysThr: 0.464 ± 0.068
0.927CysVal: 0.927 ± 0.129
0.217CysTrp: 0.217 ± 0.051
0.565CysTyr: 0.565 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
4.448AspAla: 4.448 ± 0.254
0.797AspCys: 0.797 ± 0.131
3.767AspAsp: 3.767 ± 0.265
4.158AspGlu: 4.158 ± 0.241
3.028AspPhe: 3.028 ± 0.207
4.738AspGly: 4.738 ± 0.278
1.203AspHis: 1.203 ± 0.138
5.042AspIle: 5.042 ± 0.298
4.144AspLys: 4.144 ± 0.235
4.998AspLeu: 4.998 ± 0.263
2.101AspMet: 2.101 ± 0.194
2.869AspAsn: 2.869 ± 0.234
2.101AspPro: 2.101 ± 0.224
1.84AspGln: 1.84 ± 0.183
2.724AspArg: 2.724 ± 0.183
3.405AspSer: 3.405 ± 0.241
3.303AspThr: 3.303 ± 0.251
5.187AspVal: 5.187 ± 0.283
1.029AspTrp: 1.029 ± 0.118
3.39AspTyr: 3.39 ± 0.222
0.0AspXaa: 0.0 ± 0.0
Glu
4.651GluAla: 4.651 ± 0.321
1.058GluCys: 1.058 ± 0.111
3.289GluAsp: 3.289 ± 0.234
4.158GluGlu: 4.158 ± 0.268
3.231GluPhe: 3.231 ± 0.219
3.043GluGly: 3.043 ± 0.193
1.376GluHis: 1.376 ± 0.155
5.462GluIle: 5.462 ± 0.275
4.071GluLys: 4.071 ± 0.31
6.273GluLeu: 6.273 ± 0.389
2.347GluMet: 2.347 ± 0.17
3.463GluAsn: 3.463 ± 0.208
1.521GluPro: 1.521 ± 0.166
3.014GluGln: 3.014 ± 0.209
3.767GluArg: 3.767 ± 0.246
4.1GluSer: 4.1 ± 0.244
4.1GluThr: 4.1 ± 0.254
4.564GluVal: 4.564 ± 0.282
0.956GluTrp: 0.956 ± 0.113
3.086GluTyr: 3.086 ± 0.211
0.0GluXaa: 0.0 ± 0.0
Phe
3.115PheAla: 3.115 ± 0.22
0.536PheCys: 0.536 ± 0.088
3.665PheAsp: 3.665 ± 0.225
3.463PheGlu: 3.463 ± 0.279
1.405PhePhe: 1.405 ± 0.13
3.1PheGly: 3.1 ± 0.223
0.898PheHis: 0.898 ± 0.116
2.593PheIle: 2.593 ± 0.217
3.202PheLys: 3.202 ± 0.232
2.188PheLeu: 2.188 ± 0.181
1.434PheMet: 1.434 ± 0.156
2.304PheAsn: 2.304 ± 0.183
1.014PhePro: 1.014 ± 0.144
1.0PheGln: 1.0 ± 0.105
2.086PheArg: 2.086 ± 0.169
2.376PheSer: 2.376 ± 0.202
2.651PheThr: 2.651 ± 0.169
3.55PheVal: 3.55 ± 0.23
0.406PheTrp: 0.406 ± 0.069
1.666PheTyr: 1.666 ± 0.165
0.0PheXaa: 0.0 ± 0.0
Gly
4.086GlyAla: 4.086 ± 0.292
0.768GlyCys: 0.768 ± 0.118
4.317GlyAsp: 4.317 ± 0.339
4.346GlyGlu: 4.346 ± 0.218
2.956GlyPhe: 2.956 ± 0.207
3.723GlyGly: 3.723 ± 0.326
1.014GlyHis: 1.014 ± 0.123
4.375GlyIle: 4.375 ± 0.222
5.578GlyLys: 5.578 ± 0.306
4.346GlyLeu: 4.346 ± 0.254
1.985GlyMet: 1.985 ± 0.178
3.883GlyAsn: 3.883 ± 0.275
1.188GlyPro: 1.188 ± 0.122
1.956GlyGln: 1.956 ± 0.19
2.84GlyArg: 2.84 ± 0.186
3.883GlySer: 3.883 ± 0.284
4.462GlyThr: 4.462 ± 0.338
4.651GlyVal: 4.651 ± 0.249
0.869GlyTrp: 0.869 ± 0.118
3.26GlyTyr: 3.26 ± 0.185
0.0GlyXaa: 0.0 ± 0.0
His
1.26HisAla: 1.26 ± 0.15
0.246HisCys: 0.246 ± 0.055
1.26HisAsp: 1.26 ± 0.127
1.42HisGlu: 1.42 ± 0.162
0.884HisPhe: 0.884 ± 0.126
1.376HisGly: 1.376 ± 0.149
0.551HisHis: 0.551 ± 0.117
1.376HisIle: 1.376 ± 0.157
1.145HisLys: 1.145 ± 0.137
1.55HisLeu: 1.55 ± 0.164
0.565HisMet: 0.565 ± 0.083
1.188HisAsn: 1.188 ± 0.126
0.768HisPro: 0.768 ± 0.118
0.536HisGln: 0.536 ± 0.087
0.637HisArg: 0.637 ± 0.105
1.42HisSer: 1.42 ± 0.146
0.84HisThr: 0.84 ± 0.107
1.507HisVal: 1.507 ± 0.153
0.464HisTrp: 0.464 ± 0.09
0.855HisTyr: 0.855 ± 0.117
0.0HisXaa: 0.0 ± 0.0
Ile
5.52IleAla: 5.52 ± 0.286
0.855IleCys: 0.855 ± 0.117
5.375IleAsp: 5.375 ± 0.279
5.824IleGlu: 5.824 ± 0.329
1.623IlePhe: 1.623 ± 0.145
4.361IleGly: 4.361 ± 0.291
1.347IleHis: 1.347 ± 0.142
3.564IleIle: 3.564 ± 0.24
4.94IleLys: 4.94 ± 0.297
3.926IleLeu: 3.926 ± 0.219
2.275IleMet: 2.275 ± 0.197
3.622IleAsn: 3.622 ± 0.213
2.55IlePro: 2.55 ± 0.185
2.26IleGln: 2.26 ± 0.156
4.1IleArg: 4.1 ± 0.265
3.926IleSer: 3.926 ± 0.237
4.144IleThr: 4.144 ± 0.258
4.578IleVal: 4.578 ± 0.245
0.609IleTrp: 0.609 ± 0.083
2.173IleTyr: 2.173 ± 0.169
0.0IleXaa: 0.0 ± 0.0
Lys
5.245LysAla: 5.245 ± 0.309
0.739LysCys: 0.739 ± 0.087
4.723LysAsp: 4.723 ± 0.276
4.854LysGlu: 4.854 ± 0.302
3.55LysPhe: 3.55 ± 0.231
3.55LysGly: 3.55 ± 0.226
1.797LysHis: 1.797 ± 0.154
5.085LysIle: 5.085 ± 0.275
4.491LysLys: 4.491 ± 0.338
5.65LysLeu: 5.65 ± 0.3
2.521LysMet: 2.521 ± 0.199
3.868LysAsn: 3.868 ± 0.228
2.55LysPro: 2.55 ± 0.206
2.97LysGln: 2.97 ± 0.248
4.028LysArg: 4.028 ± 0.255
4.462LysSer: 4.462 ± 0.29
4.245LysThr: 4.245 ± 0.241
4.868LysVal: 4.868 ± 0.251
1.0LysTrp: 1.0 ± 0.113
2.97LysTyr: 2.97 ± 0.247
0.0LysXaa: 0.0 ± 0.0
Leu
4.738LeuAla: 4.738 ± 0.293
1.072LeuCys: 1.072 ± 0.119
4.607LeuAsp: 4.607 ± 0.267
4.883LeuGlu: 4.883 ± 0.278
2.564LeuPhe: 2.564 ± 0.195
4.375LeuGly: 4.375 ± 0.204
1.55LeuHis: 1.55 ± 0.139
4.245LeuIle: 4.245 ± 0.263
5.636LeuLys: 5.636 ± 0.343
4.202LeuLeu: 4.202 ± 0.278
2.072LeuMet: 2.072 ± 0.165
3.767LeuAsn: 3.767 ± 0.237
2.898LeuPro: 2.898 ± 0.204
2.304LeuGln: 2.304 ± 0.192
3.216LeuArg: 3.216 ± 0.196
4.448LeuSer: 4.448 ± 0.253
4.231LeuThr: 4.231 ± 0.226
5.027LeuVal: 5.027 ± 0.231
0.637LeuTrp: 0.637 ± 0.103
2.695LeuTyr: 2.695 ± 0.204
0.0LeuXaa: 0.0 ± 0.0
Met
1.898MetAla: 1.898 ± 0.156
0.435MetCys: 0.435 ± 0.074
1.637MetAsp: 1.637 ± 0.144
1.927MetGlu: 1.927 ± 0.184
1.507MetPhe: 1.507 ± 0.156
1.695MetGly: 1.695 ± 0.156
0.681MetHis: 0.681 ± 0.106
2.55MetIle: 2.55 ± 0.176
3.086MetLys: 3.086 ± 0.216
2.333MetLeu: 2.333 ± 0.228
0.869MetMet: 0.869 ± 0.127
2.072MetAsn: 2.072 ± 0.177
0.913MetPro: 0.913 ± 0.127
1.159MetGln: 1.159 ± 0.137
1.536MetArg: 1.536 ± 0.155
2.521MetSer: 2.521 ± 0.208
2.159MetThr: 2.159 ± 0.175
2.028MetVal: 2.028 ± 0.173
0.246MetTrp: 0.246 ± 0.063
1.014MetTyr: 1.014 ± 0.118
0.0MetXaa: 0.0 ± 0.0
Asn
4.129AsnAla: 4.129 ± 0.247
0.681AsnCys: 0.681 ± 0.096
3.1AsnAsp: 3.1 ± 0.211
3.26AsnGlu: 3.26 ± 0.211
2.405AsnPhe: 2.405 ± 0.22
4.564AsnGly: 4.564 ± 0.339
1.072AsnHis: 1.072 ± 0.141
3.202AsnIle: 3.202 ± 0.214
3.912AsnLys: 3.912 ± 0.284
4.042AsnLeu: 4.042 ± 0.239
1.826AsnMet: 1.826 ± 0.161
2.695AsnAsn: 2.695 ± 0.231
2.115AsnPro: 2.115 ± 0.19
1.666AsnGln: 1.666 ± 0.143
2.666AsnArg: 2.666 ± 0.216
3.129AsnSer: 3.129 ± 0.213
2.927AsnThr: 2.927 ± 0.254
3.796AsnVal: 3.796 ± 0.244
0.637AsnTrp: 0.637 ± 0.087
1.753AsnTyr: 1.753 ± 0.157
0.0AsnXaa: 0.0 ± 0.0
Pro
2.028ProAla: 2.028 ± 0.19
0.319ProCys: 0.319 ± 0.061
2.231ProAsp: 2.231 ± 0.179
1.985ProGlu: 1.985 ± 0.148
1.492ProPhe: 1.492 ± 0.134
1.869ProGly: 1.869 ± 0.223
0.507ProHis: 0.507 ± 0.088
1.985ProIle: 1.985 ± 0.157
2.477ProLys: 2.477 ± 0.205
2.115ProLeu: 2.115 ± 0.192
0.913ProMet: 0.913 ± 0.11
1.84ProAsn: 1.84 ± 0.163
0.739ProPro: 0.739 ± 0.106
0.971ProGln: 0.971 ± 0.092
1.405ProArg: 1.405 ± 0.138
2.115ProSer: 2.115 ± 0.152
2.275ProThr: 2.275 ± 0.183
2.97ProVal: 2.97 ± 0.215
0.406ProTrp: 0.406 ± 0.094
1.188ProTyr: 1.188 ± 0.131
0.0ProXaa: 0.0 ± 0.0
Gln
2.376GlnAla: 2.376 ± 0.213
0.377GlnCys: 0.377 ± 0.084
1.666GlnAsp: 1.666 ± 0.178
2.318GlnGlu: 2.318 ± 0.202
1.608GlnPhe: 1.608 ± 0.158
2.028GlnGly: 2.028 ± 0.184
0.71GlnHis: 0.71 ± 0.091
2.709GlnIle: 2.709 ± 0.215
2.173GlnLys: 2.173 ± 0.193
2.405GlnLeu: 2.405 ± 0.179
1.347GlnMet: 1.347 ± 0.15
1.753GlnAsn: 1.753 ± 0.164
0.971GlnPro: 0.971 ± 0.112
1.174GlnGln: 1.174 ± 0.122
1.826GlnArg: 1.826 ± 0.159
2.173GlnSer: 2.173 ± 0.191
1.883GlnThr: 1.883 ± 0.162
2.463GlnVal: 2.463 ± 0.213
0.464GlnTrp: 0.464 ± 0.084
1.159GlnTyr: 1.159 ± 0.137
0.0GlnXaa: 0.0 ± 0.0
Arg
3.448ArgAla: 3.448 ± 0.2
0.551ArgCys: 0.551 ± 0.095
3.231ArgAsp: 3.231 ± 0.223
3.419ArgGlu: 3.419 ± 0.218
2.246ArgPhe: 2.246 ± 0.214
3.086ArgGly: 3.086 ± 0.174
1.058ArgHis: 1.058 ± 0.127
3.158ArgIle: 3.158 ± 0.229
3.767ArgLys: 3.767 ± 0.248
3.564ArgLeu: 3.564 ± 0.22
1.854ArgMet: 1.854 ± 0.166
2.26ArgAsn: 2.26 ± 0.169
1.434ArgPro: 1.434 ± 0.13
1.739ArgGln: 1.739 ± 0.168
2.506ArgArg: 2.506 ± 0.2
2.782ArgSer: 2.782 ± 0.209
2.767ArgThr: 2.767 ± 0.192
4.173ArgVal: 4.173 ± 0.275
0.855ArgTrp: 0.855 ± 0.117
2.767ArgTyr: 2.767 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
3.781SerAla: 3.781 ± 0.231
0.695SerCys: 0.695 ± 0.095
3.738SerAsp: 3.738 ± 0.258
3.738SerGlu: 3.738 ± 0.256
2.666SerPhe: 2.666 ± 0.189
4.709SerGly: 4.709 ± 0.327
1.289SerHis: 1.289 ± 0.141
4.071SerIle: 4.071 ± 0.212
4.013SerLys: 4.013 ± 0.244
4.028SerLeu: 4.028 ± 0.283
2.072SerMet: 2.072 ± 0.153
3.071SerAsn: 3.071 ± 0.266
1.854SerPro: 1.854 ± 0.156
1.985SerGln: 1.985 ± 0.144
3.434SerArg: 3.434 ± 0.211
3.825SerSer: 3.825 ± 0.252
3.521SerThr: 3.521 ± 0.269
4.26SerVal: 4.26 ± 0.212
0.753SerTrp: 0.753 ± 0.093
2.753SerTyr: 2.753 ± 0.186
0.0SerXaa: 0.0 ± 0.0
Thr
4.144ThrAla: 4.144 ± 0.385
0.652ThrCys: 0.652 ± 0.101
3.521ThrAsp: 3.521 ± 0.26
3.839ThrGlu: 3.839 ± 0.257
2.898ThrPhe: 2.898 ± 0.194
4.419ThrGly: 4.419 ± 0.357
1.043ThrHis: 1.043 ± 0.127
4.086ThrIle: 4.086 ± 0.29
4.375ThrLys: 4.375 ± 0.242
3.941ThrLeu: 3.941 ± 0.224
1.652ThrMet: 1.652 ± 0.158
3.231ThrAsn: 3.231 ± 0.26
2.622ThrPro: 2.622 ± 0.283
1.681ThrGln: 1.681 ± 0.191
2.753ThrArg: 2.753 ± 0.226
3.231ThrSer: 3.231 ± 0.226
3.521ThrThr: 3.521 ± 0.319
4.897ThrVal: 4.897 ± 0.295
0.724ThrTrp: 0.724 ± 0.11
2.028ThrTyr: 2.028 ± 0.183
0.0ThrXaa: 0.0 ± 0.0
Val
5.114ValAla: 5.114 ± 0.281
0.797ValCys: 0.797 ± 0.111
5.317ValAsp: 5.317 ± 0.253
5.52ValGlu: 5.52 ± 0.332
3.144ValPhe: 3.144 ± 0.216
4.694ValGly: 4.694 ± 0.323
1.26ValHis: 1.26 ± 0.127
5.172ValIle: 5.172 ± 0.257
5.433ValLys: 5.433 ± 0.254
4.549ValLeu: 4.549 ± 0.301
2.043ValMet: 2.043 ± 0.192
3.825ValAsn: 3.825 ± 0.217
2.448ValPro: 2.448 ± 0.195
2.347ValGln: 2.347 ± 0.203
3.637ValArg: 3.637 ± 0.232
4.738ValSer: 4.738 ± 0.231
4.361ValThr: 4.361 ± 0.282
5.578ValVal: 5.578 ± 0.307
0.797ValTrp: 0.797 ± 0.116
3.014ValTyr: 3.014 ± 0.21
0.0ValXaa: 0.0 ± 0.0
Trp
0.797TrpAla: 0.797 ± 0.125
0.072TrpCys: 0.072 ± 0.029
0.971TrpAsp: 0.971 ± 0.114
0.927TrpGlu: 0.927 ± 0.115
0.71TrpPhe: 0.71 ± 0.108
0.565TrpGly: 0.565 ± 0.091
0.348TrpHis: 0.348 ± 0.069
0.826TrpIle: 0.826 ± 0.119
1.072TrpLys: 1.072 ± 0.122
0.753TrpLeu: 0.753 ± 0.099
0.464TrpMet: 0.464 ± 0.067
0.797TrpAsn: 0.797 ± 0.101
0.203TrpPro: 0.203 ± 0.058
0.522TrpGln: 0.522 ± 0.091
0.652TrpArg: 0.652 ± 0.108
0.623TrpSer: 0.623 ± 0.104
0.666TrpThr: 0.666 ± 0.103
1.029TrpVal: 1.029 ± 0.096
0.203TrpTrp: 0.203 ± 0.05
0.71TrpTyr: 0.71 ± 0.104
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.883TyrAla: 2.883 ± 0.192
0.623TyrCys: 0.623 ± 0.098
2.767TyrAsp: 2.767 ± 0.188
2.275TyrGlu: 2.275 ± 0.182
1.536TyrPhe: 1.536 ± 0.156
3.014TyrGly: 3.014 ± 0.184
0.753TyrHis: 0.753 ± 0.099
2.637TyrIle: 2.637 ± 0.248
2.912TyrLys: 2.912 ± 0.215
2.811TyrLeu: 2.811 ± 0.202
1.101TyrMet: 1.101 ± 0.119
2.477TyrAsn: 2.477 ± 0.189
1.434TyrPro: 1.434 ± 0.154
1.71TyrGln: 1.71 ± 0.147
2.434TyrArg: 2.434 ± 0.226
2.463TyrSer: 2.463 ± 0.202
2.289TyrThr: 2.289 ± 0.196
3.086TyrVal: 3.086 ± 0.213
0.724TyrTrp: 0.724 ± 0.111
1.637TyrTyr: 1.637 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 343 proteins (69023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski