Amino acid dipepetide frequency for Aeromonas phage Aswh_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.058AlaAla: 3.058 ± 0.252
0.657AlaCys: 0.657 ± 0.096
2.615AlaAsp: 2.615 ± 0.211
2.773AlaGlu: 2.773 ± 0.185
1.972AlaPhe: 1.972 ± 0.138
3.501AlaGly: 3.501 ± 0.255
0.872AlaHis: 0.872 ± 0.105
3.601AlaIle: 3.601 ± 0.19
4.073AlaLys: 4.073 ± 0.275
4.145AlaLeu: 4.145 ± 0.228
1.815AlaMet: 1.815 ± 0.148
2.858AlaAsn: 2.858 ± 0.237
1.315AlaPro: 1.315 ± 0.121
1.615AlaGln: 1.615 ± 0.178
2.515AlaArg: 2.515 ± 0.216
3.144AlaSer: 3.144 ± 0.219
3.073AlaThr: 3.073 ± 0.312
3.073AlaVal: 3.073 ± 0.195
0.7AlaTrp: 0.7 ± 0.083
1.858AlaTyr: 1.858 ± 0.168
0.0AlaXaa: 0.0 ± 0.0
Cys
0.486CysAla: 0.486 ± 0.088
0.186CysCys: 0.186 ± 0.053
0.972CysAsp: 0.972 ± 0.151
0.943CysGlu: 0.943 ± 0.125
0.572CysPhe: 0.572 ± 0.096
1.158CysGly: 1.158 ± 0.147
0.357CysHis: 0.357 ± 0.07
1.0CysIle: 1.0 ± 0.128
1.058CysLys: 1.058 ± 0.122
0.943CysLeu: 0.943 ± 0.115
0.414CysMet: 0.414 ± 0.081
0.629CysAsn: 0.629 ± 0.093
0.6CysPro: 0.6 ± 0.103
0.343CysGln: 0.343 ± 0.07
0.529CysArg: 0.529 ± 0.094
0.829CysSer: 0.829 ± 0.114
0.472CysThr: 0.472 ± 0.097
1.058CysVal: 1.058 ± 0.101
0.229CysTrp: 0.229 ± 0.057
0.529CysTyr: 0.529 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
3.173AspAla: 3.173 ± 0.227
0.943AspCys: 0.943 ± 0.114
3.802AspAsp: 3.802 ± 0.245
4.602AspGlu: 4.602 ± 0.317
3.644AspPhe: 3.644 ± 0.207
5.374AspGly: 5.374 ± 0.283
1.172AspHis: 1.172 ± 0.135
5.359AspIle: 5.359 ± 0.284
4.116AspLys: 4.116 ± 0.247
5.402AspLeu: 5.402 ± 0.325
2.072AspMet: 2.072 ± 0.166
3.33AspAsn: 3.33 ± 0.192
2.644AspPro: 2.644 ± 0.18
2.072AspGln: 2.072 ± 0.161
2.372AspArg: 2.372 ± 0.215
3.83AspSer: 3.83 ± 0.234
3.416AspThr: 3.416 ± 0.253
5.316AspVal: 5.316 ± 0.274
1.2AspTrp: 1.2 ± 0.139
3.201AspTyr: 3.201 ± 0.216
0.0AspXaa: 0.0 ± 0.0
Glu
3.401GluAla: 3.401 ± 0.235
0.886GluCys: 0.886 ± 0.115
4.216GluAsp: 4.216 ± 0.273
5.031GluGlu: 5.031 ± 0.334
3.716GluPhe: 3.716 ± 0.235
3.301GluGly: 3.301 ± 0.236
1.601GluHis: 1.601 ± 0.132
5.517GluIle: 5.517 ± 0.302
5.459GluLys: 5.459 ± 0.336
6.088GluLeu: 6.088 ± 0.313
2.601GluMet: 2.601 ± 0.21
4.13GluAsn: 4.13 ± 0.279
1.815GluPro: 1.815 ± 0.155
2.315GluGln: 2.315 ± 0.199
2.815GluArg: 2.815 ± 0.231
4.445GluSer: 4.445 ± 0.347
4.159GluThr: 4.159 ± 0.251
4.988GluVal: 4.988 ± 0.245
0.943GluTrp: 0.943 ± 0.122
3.716GluTyr: 3.716 ± 0.266
0.0GluXaa: 0.0 ± 0.0
Phe
2.387PheAla: 2.387 ± 0.173
0.643PheCys: 0.643 ± 0.116
4.33PheAsp: 4.33 ± 0.27
3.001PheGlu: 3.001 ± 0.192
2.101PhePhe: 2.101 ± 0.166
3.201PheGly: 3.201 ± 0.199
0.8PheHis: 0.8 ± 0.101
3.716PheIle: 3.716 ± 0.25
3.444PheLys: 3.444 ± 0.251
2.615PheLeu: 2.615 ± 0.198
1.115PheMet: 1.115 ± 0.14
2.773PheAsn: 2.773 ± 0.209
1.515PhePro: 1.515 ± 0.155
1.515PheGln: 1.515 ± 0.143
1.672PheArg: 1.672 ± 0.15
3.158PheSer: 3.158 ± 0.197
2.315PheThr: 2.315 ± 0.159
3.902PheVal: 3.902 ± 0.249
0.657PheTrp: 0.657 ± 0.106
2.301PheTyr: 2.301 ± 0.175
0.0PheXaa: 0.0 ± 0.0
Gly
2.83GlyAla: 2.83 ± 0.226
1.043GlyCys: 1.043 ± 0.132
4.273GlyAsp: 4.273 ± 0.277
3.787GlyGlu: 3.787 ± 0.225
2.987GlyPhe: 2.987 ± 0.208
3.93GlyGly: 3.93 ± 0.404
1.186GlyHis: 1.186 ± 0.146
4.388GlyIle: 4.388 ± 0.241
4.53GlyLys: 4.53 ± 0.285
4.287GlyLeu: 4.287 ± 0.255
2.301GlyMet: 2.301 ± 0.173
3.687GlyAsn: 3.687 ± 0.343
1.186GlyPro: 1.186 ± 0.129
1.872GlyGln: 1.872 ± 0.192
2.815GlyArg: 2.815 ± 0.171
5.016GlySer: 5.016 ± 0.36
3.673GlyThr: 3.673 ± 0.274
5.316GlyVal: 5.316 ± 0.299
1.229GlyTrp: 1.229 ± 0.131
3.73GlyTyr: 3.73 ± 0.232
0.0GlyXaa: 0.0 ± 0.0
His
0.829HisAla: 0.829 ± 0.112
0.257HisCys: 0.257 ± 0.061
1.315HisAsp: 1.315 ± 0.144
1.258HisGlu: 1.258 ± 0.124
0.929HisPhe: 0.929 ± 0.109
1.543HisGly: 1.543 ± 0.142
0.443HisHis: 0.443 ± 0.082
1.501HisIle: 1.501 ± 0.148
1.401HisLys: 1.401 ± 0.153
1.301HisLeu: 1.301 ± 0.123
0.786HisMet: 0.786 ± 0.104
1.143HisAsn: 1.143 ± 0.133
0.886HisPro: 0.886 ± 0.117
0.443HisGln: 0.443 ± 0.08
0.815HisArg: 0.815 ± 0.124
1.129HisSer: 1.129 ± 0.127
1.2HisThr: 1.2 ± 0.122
1.543HisVal: 1.543 ± 0.138
0.272HisTrp: 0.272 ± 0.06
0.786HisTyr: 0.786 ± 0.105
0.0HisXaa: 0.0 ± 0.0
Ile
3.401IleAla: 3.401 ± 0.253
0.958IleCys: 0.958 ± 0.132
5.431IleAsp: 5.431 ± 0.314
5.774IleGlu: 5.774 ± 0.35
2.572IlePhe: 2.572 ± 0.218
4.216IleGly: 4.216 ± 0.268
1.543IleHis: 1.543 ± 0.146
4.502IleIle: 4.502 ± 0.242
5.417IleLys: 5.417 ± 0.272
4.959IleLeu: 4.959 ± 0.264
2.015IleMet: 2.015 ± 0.169
4.245IleAsn: 4.245 ± 0.244
2.944IlePro: 2.944 ± 0.206
2.587IleGln: 2.587 ± 0.182
3.773IleArg: 3.773 ± 0.23
4.731IleSer: 4.731 ± 0.26
3.93IleThr: 3.93 ± 0.246
4.53IleVal: 4.53 ± 0.247
0.572IleTrp: 0.572 ± 0.096
2.93IleTyr: 2.93 ± 0.188
0.0IleXaa: 0.0 ± 0.0
Lys
4.173LysAla: 4.173 ± 0.271
0.7LysCys: 0.7 ± 0.107
5.202LysAsp: 5.202 ± 0.334
6.86LysGlu: 6.86 ± 0.366
3.759LysPhe: 3.759 ± 0.264
3.63LysGly: 3.63 ± 0.268
1.672LysHis: 1.672 ± 0.172
5.288LysIle: 5.288 ± 0.306
6.974LysLys: 6.974 ± 0.407
5.331LysLeu: 5.331 ± 0.268
2.944LysMet: 2.944 ± 0.215
4.402LysAsn: 4.402 ± 0.255
2.287LysPro: 2.287 ± 0.203
2.344LysGln: 2.344 ± 0.169
2.587LysArg: 2.587 ± 0.213
4.931LysSer: 4.931 ± 0.25
4.216LysThr: 4.216 ± 0.251
5.074LysVal: 5.074 ± 0.302
0.857LysTrp: 0.857 ± 0.095
3.83LysTyr: 3.83 ± 0.229
0.0LysXaa: 0.0 ± 0.0
Leu
3.83LeuAla: 3.83 ± 0.255
1.072LeuCys: 1.072 ± 0.124
5.231LeuAsp: 5.231 ± 0.3
5.345LeuGlu: 5.345 ± 0.314
3.073LeuPhe: 3.073 ± 0.213
4.402LeuGly: 4.402 ± 0.26
1.501LeuHis: 1.501 ± 0.138
4.302LeuIle: 4.302 ± 0.246
5.702LeuLys: 5.702 ± 0.326
4.316LeuLeu: 4.316 ± 0.253
2.015LeuMet: 2.015 ± 0.17
4.087LeuAsn: 4.087 ± 0.254
2.544LeuPro: 2.544 ± 0.184
2.044LeuGln: 2.044 ± 0.173
2.673LeuArg: 2.673 ± 0.165
5.459LeuSer: 5.459 ± 0.292
4.002LeuThr: 4.002 ± 0.246
5.231LeuVal: 5.231 ± 0.306
0.686LeuTrp: 0.686 ± 0.107
3.073LeuTyr: 3.073 ± 0.224
0.0LeuXaa: 0.0 ± 0.0
Met
1.515MetAla: 1.515 ± 0.143
0.343MetCys: 0.343 ± 0.074
1.829MetAsp: 1.829 ± 0.181
2.072MetGlu: 2.072 ± 0.169
1.386MetPhe: 1.386 ± 0.139
1.601MetGly: 1.601 ± 0.17
0.557MetHis: 0.557 ± 0.084
2.287MetIle: 2.287 ± 0.176
3.544MetLys: 3.544 ± 0.289
2.172MetLeu: 2.172 ± 0.196
0.972MetMet: 0.972 ± 0.151
2.144MetAsn: 2.144 ± 0.164
0.715MetPro: 0.715 ± 0.097
0.772MetGln: 0.772 ± 0.091
1.029MetArg: 1.029 ± 0.129
2.401MetSer: 2.401 ± 0.171
2.072MetThr: 2.072 ± 0.166
2.287MetVal: 2.287 ± 0.193
0.343MetTrp: 0.343 ± 0.062
1.215MetTyr: 1.215 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
2.687AsnAla: 2.687 ± 0.226
0.8AsnCys: 0.8 ± 0.1
3.058AsnAsp: 3.058 ± 0.204
3.601AsnGlu: 3.601 ± 0.246
2.515AsnPhe: 2.515 ± 0.222
4.43AsnGly: 4.43 ± 0.302
1.2AsnHis: 1.2 ± 0.134
3.859AsnIle: 3.859 ± 0.22
4.359AsnLys: 4.359 ± 0.287
4.245AsnLeu: 4.245 ± 0.248
1.886AsnMet: 1.886 ± 0.167
2.63AsnAsn: 2.63 ± 0.28
2.172AsnPro: 2.172 ± 0.195
2.115AsnGln: 2.115 ± 0.185
2.815AsnArg: 2.815 ± 0.203
3.459AsnSer: 3.459 ± 0.208
3.216AsnThr: 3.216 ± 0.283
3.416AsnVal: 3.416 ± 0.221
0.757AsnTrp: 0.757 ± 0.098
2.558AsnTyr: 2.558 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
1.486ProAla: 1.486 ± 0.153
0.314ProCys: 0.314 ± 0.076
2.687ProAsp: 2.687 ± 0.191
2.372ProGlu: 2.372 ± 0.18
1.772ProPhe: 1.772 ± 0.159
1.315ProGly: 1.315 ± 0.152
0.743ProHis: 0.743 ± 0.108
1.744ProIle: 1.744 ± 0.147
2.33ProLys: 2.33 ± 0.17
1.944ProLeu: 1.944 ± 0.165
0.7ProMet: 0.7 ± 0.109
2.115ProAsn: 2.115 ± 0.191
0.872ProPro: 0.872 ± 0.118
1.0ProGln: 1.0 ± 0.116
1.143ProArg: 1.143 ± 0.141
2.644ProSer: 2.644 ± 0.21
2.087ProThr: 2.087 ± 0.212
2.901ProVal: 2.901 ± 0.203
0.486ProTrp: 0.486 ± 0.07
1.601ProTyr: 1.601 ± 0.151
0.0ProXaa: 0.0 ± 0.0
Gln
1.872GlnAla: 1.872 ± 0.16
0.3GlnCys: 0.3 ± 0.06
1.944GlnAsp: 1.944 ± 0.202
2.215GlnGlu: 2.215 ± 0.179
1.886GlnPhe: 1.886 ± 0.166
1.586GlnGly: 1.586 ± 0.13
0.529GlnHis: 0.529 ± 0.088
2.53GlnIle: 2.53 ± 0.207
2.315GlnLys: 2.315 ± 0.189
2.244GlnLeu: 2.244 ± 0.207
0.843GlnMet: 0.843 ± 0.123
1.901GlnAsn: 1.901 ± 0.147
1.015GlnPro: 1.015 ± 0.129
1.243GlnGln: 1.243 ± 0.15
1.272GlnArg: 1.272 ± 0.155
1.744GlnSer: 1.744 ± 0.146
1.629GlnThr: 1.629 ± 0.158
1.929GlnVal: 1.929 ± 0.162
0.572GlnTrp: 0.572 ± 0.089
1.786GlnTyr: 1.786 ± 0.152
0.0GlnXaa: 0.0 ± 0.0
Arg
2.201ArgAla: 2.201 ± 0.19
0.457ArgCys: 0.457 ± 0.072
2.973ArgAsp: 2.973 ± 0.221
2.858ArgGlu: 2.858 ± 0.234
2.215ArgPhe: 2.215 ± 0.171
2.715ArgGly: 2.715 ± 0.209
0.643ArgHis: 0.643 ± 0.11
3.573ArgIle: 3.573 ± 0.229
3.187ArgLys: 3.187 ± 0.269
2.615ArgLeu: 2.615 ± 0.218
1.143ArgMet: 1.143 ± 0.132
2.258ArgAsn: 2.258 ± 0.166
1.215ArgPro: 1.215 ± 0.121
1.372ArgGln: 1.372 ± 0.137
1.844ArgArg: 1.844 ± 0.173
2.458ArgSer: 2.458 ± 0.193
2.229ArgThr: 2.229 ± 0.165
3.573ArgVal: 3.573 ± 0.248
0.729ArgTrp: 0.729 ± 0.12
1.972ArgTyr: 1.972 ± 0.161
0.0ArgXaa: 0.0 ± 0.0
Ser
3.087SerAla: 3.087 ± 0.251
0.958SerCys: 0.958 ± 0.124
4.373SerAsp: 4.373 ± 0.272
4.573SerGlu: 4.573 ± 0.262
3.187SerPhe: 3.187 ± 0.231
5.502SerGly: 5.502 ± 0.353
1.158SerHis: 1.158 ± 0.133
4.731SerIle: 4.731 ± 0.265
4.888SerLys: 4.888 ± 0.266
5.288SerLeu: 5.288 ± 0.28
2.172SerMet: 2.172 ± 0.169
3.187SerAsn: 3.187 ± 0.258
2.201SerPro: 2.201 ± 0.201
1.872SerGln: 1.872 ± 0.153
2.815SerArg: 2.815 ± 0.177
4.43SerSer: 4.43 ± 0.371
3.644SerThr: 3.644 ± 0.284
4.902SerVal: 4.902 ± 0.295
0.872SerTrp: 0.872 ± 0.109
3.073SerTyr: 3.073 ± 0.216
0.0SerXaa: 0.0 ± 0.0
Thr
2.558ThrAla: 2.558 ± 0.22
0.572ThrCys: 0.572 ± 0.1
3.087ThrAsp: 3.087 ± 0.23
3.944ThrGlu: 3.944 ± 0.231
2.401ThrPhe: 2.401 ± 0.184
4.116ThrGly: 4.116 ± 0.382
1.215ThrHis: 1.215 ± 0.134
4.43ThrIle: 4.43 ± 0.252
4.273ThrLys: 4.273 ± 0.268
4.216ThrLeu: 4.216 ± 0.251
1.515ThrMet: 1.515 ± 0.148
2.93ThrAsn: 2.93 ± 0.249
2.544ThrPro: 2.544 ± 0.19
1.958ThrGln: 1.958 ± 0.198
2.53ThrArg: 2.53 ± 0.199
3.744ThrSer: 3.744 ± 0.305
3.158ThrThr: 3.158 ± 0.329
3.887ThrVal: 3.887 ± 0.265
0.715ThrTrp: 0.715 ± 0.108
2.172ThrTyr: 2.172 ± 0.206
0.0ThrXaa: 0.0 ± 0.0
Val
3.673ValAla: 3.673 ± 0.259
1.129ValCys: 1.129 ± 0.162
4.788ValAsp: 4.788 ± 0.236
5.488ValGlu: 5.488 ± 0.32
3.53ValPhe: 3.53 ± 0.218
4.616ValGly: 4.616 ± 0.275
1.358ValHis: 1.358 ± 0.139
4.773ValIle: 4.773 ± 0.283
5.359ValLys: 5.359 ± 0.257
4.688ValLeu: 4.688 ± 0.267
2.201ValMet: 2.201 ± 0.215
3.687ValAsn: 3.687 ± 0.232
2.015ValPro: 2.015 ± 0.168
2.001ValGln: 2.001 ± 0.168
3.487ValArg: 3.487 ± 0.191
5.517ValSer: 5.517 ± 0.32
4.087ValThr: 4.087 ± 0.314
5.316ValVal: 5.316 ± 0.27
1.229ValTrp: 1.229 ± 0.131
3.816ValTyr: 3.816 ± 0.238
0.0ValXaa: 0.0 ± 0.0
Trp
0.672TrpAla: 0.672 ± 0.109
0.314TrpCys: 0.314 ± 0.061
1.258TrpAsp: 1.258 ± 0.142
1.129TrpGlu: 1.129 ± 0.118
0.772TrpPhe: 0.772 ± 0.125
0.843TrpGly: 0.843 ± 0.116
0.214TrpHis: 0.214 ± 0.064
0.743TrpIle: 0.743 ± 0.111
1.372TrpLys: 1.372 ± 0.131
0.872TrpLeu: 0.872 ± 0.101
0.5TrpMet: 0.5 ± 0.086
0.743TrpAsn: 0.743 ± 0.091
0.229TrpPro: 0.229 ± 0.056
0.343TrpGln: 0.343 ± 0.071
0.543TrpArg: 0.543 ± 0.087
0.829TrpSer: 0.829 ± 0.102
0.543TrpThr: 0.543 ± 0.081
1.129TrpVal: 1.129 ± 0.141
0.286TrpTrp: 0.286 ± 0.068
0.686TrpTyr: 0.686 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.915TyrAla: 1.915 ± 0.157
0.772TyrCys: 0.772 ± 0.094
3.659TyrAsp: 3.659 ± 0.256
3.516TyrGlu: 3.516 ± 0.236
2.172TyrPhe: 2.172 ± 0.193
3.087TyrGly: 3.087 ± 0.202
1.015TyrHis: 1.015 ± 0.121
3.187TyrIle: 3.187 ± 0.218
3.258TyrLys: 3.258 ± 0.214
2.915TyrLeu: 2.915 ± 0.217
1.229TyrMet: 1.229 ± 0.145
2.901TyrAsn: 2.901 ± 0.193
1.543TyrPro: 1.543 ± 0.148
1.529TyrGln: 1.529 ± 0.156
2.187TyrArg: 2.187 ± 0.154
3.016TyrSer: 3.016 ± 0.232
2.844TyrThr: 2.844 ± 0.205
3.387TyrVal: 3.387 ± 0.217
0.686TyrTrp: 0.686 ± 0.083
2.229TyrTyr: 2.229 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 406 proteins (69972 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski