Amino acid dipepetide frequency for Aeromonas phage Asswx_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.269AlaAla: 3.269 ± 0.251
0.597AlaCys: 0.597 ± 0.099
2.615AlaAsp: 2.615 ± 0.19
3.113AlaGlu: 3.113 ± 0.218
2.047AlaPhe: 2.047 ± 0.174
3.255AlaGly: 3.255 ± 0.284
0.967AlaHis: 0.967 ± 0.12
3.483AlaIle: 3.483 ± 0.184
4.378AlaLys: 4.378 ± 0.271
4.136AlaLeu: 4.136 ± 0.263
1.976AlaMet: 1.976 ± 0.178
2.914AlaAsn: 2.914 ± 0.242
1.194AlaPro: 1.194 ± 0.144
1.692AlaGln: 1.692 ± 0.178
2.416AlaArg: 2.416 ± 0.185
2.971AlaSer: 2.971 ± 0.214
2.928AlaThr: 2.928 ± 0.307
3.113AlaVal: 3.113 ± 0.228
0.512AlaTrp: 0.512 ± 0.087
2.203AlaTyr: 2.203 ± 0.16
0.0AlaXaa: 0.0 ± 0.0
Cys
0.625CysAla: 0.625 ± 0.095
0.213CysCys: 0.213 ± 0.058
1.208CysAsp: 1.208 ± 0.146
0.881CysGlu: 0.881 ± 0.121
0.625CysPhe: 0.625 ± 0.094
1.294CysGly: 1.294 ± 0.161
0.355CysHis: 0.355 ± 0.073
0.952CysIle: 0.952 ± 0.12
0.881CysLys: 0.881 ± 0.121
0.81CysLeu: 0.81 ± 0.11
0.327CysMet: 0.327 ± 0.069
0.64CysAsn: 0.64 ± 0.102
0.583CysPro: 0.583 ± 0.118
0.227CysGln: 0.227 ± 0.058
0.426CysArg: 0.426 ± 0.073
0.881CysSer: 0.881 ± 0.123
0.668CysThr: 0.668 ± 0.107
1.023CysVal: 1.023 ± 0.126
0.256CysTrp: 0.256 ± 0.062
0.668CysTyr: 0.668 ± 0.096
0.0CysXaa: 0.0 ± 0.0
Asp
3.085AspAla: 3.085 ± 0.246
0.896AspCys: 0.896 ± 0.101
4.335AspAsp: 4.335 ± 0.252
4.804AspGlu: 4.804 ± 0.281
3.71AspPhe: 3.71 ± 0.236
5.416AspGly: 5.416 ± 0.258
1.222AspHis: 1.222 ± 0.152
5.629AspIle: 5.629 ± 0.268
3.952AspLys: 3.952 ± 0.258
5.416AspLeu: 5.416 ± 0.282
2.274AspMet: 2.274 ± 0.185
3.085AspAsn: 3.085 ± 0.184
2.758AspPro: 2.758 ± 0.215
1.819AspGln: 1.819 ± 0.157
2.388AspArg: 2.388 ± 0.184
3.952AspSer: 3.952 ± 0.204
3.568AspThr: 3.568 ± 0.229
5.458AspVal: 5.458 ± 0.264
1.208AspTrp: 1.208 ± 0.129
3.44AspTyr: 3.44 ± 0.198
0.0AspXaa: 0.0 ± 0.0
Glu
3.639GluAla: 3.639 ± 0.283
1.052GluCys: 1.052 ± 0.126
4.136GluAsp: 4.136 ± 0.217
5.316GluGlu: 5.316 ± 0.311
4.08GluPhe: 4.08 ± 0.229
3.127GluGly: 3.127 ± 0.179
1.379GluHis: 1.379 ± 0.14
5.231GluIle: 5.231 ± 0.308
5.373GluLys: 5.373 ± 0.306
6.51GluLeu: 6.51 ± 0.298
2.999GluMet: 2.999 ± 0.234
4.051GluAsn: 4.051 ± 0.23
1.677GluPro: 1.677 ± 0.167
2.246GluGln: 2.246 ± 0.219
2.843GluArg: 2.843 ± 0.203
4.662GluSer: 4.662 ± 0.249
3.852GluThr: 3.852 ± 0.264
4.847GluVal: 4.847 ± 0.238
0.952GluTrp: 0.952 ± 0.11
3.838GluTyr: 3.838 ± 0.282
0.0GluXaa: 0.0 ± 0.0
Phe
2.303PheAla: 2.303 ± 0.167
0.711PheCys: 0.711 ± 0.108
3.838PheAsp: 3.838 ± 0.216
3.44PheGlu: 3.44 ± 0.219
1.976PhePhe: 1.976 ± 0.158
3.284PheGly: 3.284 ± 0.201
0.753PheHis: 0.753 ± 0.1
3.298PheIle: 3.298 ± 0.237
3.667PheLys: 3.667 ± 0.216
2.459PheLeu: 2.459 ± 0.198
1.208PheMet: 1.208 ± 0.125
3.17PheAsn: 3.17 ± 0.218
1.365PhePro: 1.365 ± 0.147
1.009PheGln: 1.009 ± 0.098
1.706PheArg: 1.706 ± 0.14
3.07PheSer: 3.07 ± 0.19
2.601PheThr: 2.601 ± 0.203
4.008PheVal: 4.008 ± 0.282
0.654PheTrp: 0.654 ± 0.101
2.118PheTyr: 2.118 ± 0.149
0.0PheXaa: 0.0 ± 0.0
Gly
2.644GlyAla: 2.644 ± 0.231
0.896GlyCys: 0.896 ± 0.113
4.492GlyAsp: 4.492 ± 0.29
3.852GlyGlu: 3.852 ± 0.211
2.999GlyPhe: 2.999 ± 0.193
3.966GlyGly: 3.966 ± 0.306
1.294GlyHis: 1.294 ± 0.145
4.79GlyIle: 4.79 ± 0.254
4.947GlyLys: 4.947 ± 0.286
4.08GlyLeu: 4.08 ± 0.242
2.374GlyMet: 2.374 ± 0.162
3.596GlyAsn: 3.596 ± 0.36
1.038GlyPro: 1.038 ± 0.114
1.834GlyGln: 1.834 ± 0.191
2.928GlyArg: 2.928 ± 0.221
4.335GlySer: 4.335 ± 0.32
3.511GlyThr: 3.511 ± 0.281
5.274GlyVal: 5.274 ± 0.294
1.251GlyTrp: 1.251 ± 0.137
3.539GlyTyr: 3.539 ± 0.263
0.0GlyXaa: 0.0 ± 0.0
His
0.753HisAla: 0.753 ± 0.104
0.313HisCys: 0.313 ± 0.063
1.564HisAsp: 1.564 ± 0.138
1.237HisGlu: 1.237 ± 0.129
0.867HisPhe: 0.867 ± 0.115
1.407HisGly: 1.407 ± 0.16
0.498HisHis: 0.498 ± 0.098
1.521HisIle: 1.521 ± 0.143
1.421HisLys: 1.421 ± 0.137
1.407HisLeu: 1.407 ± 0.142
0.668HisMet: 0.668 ± 0.099
1.066HisAsn: 1.066 ± 0.126
0.839HisPro: 0.839 ± 0.119
0.611HisGln: 0.611 ± 0.087
0.981HisArg: 0.981 ± 0.127
1.151HisSer: 1.151 ± 0.112
1.123HisThr: 1.123 ± 0.116
1.436HisVal: 1.436 ± 0.127
0.313HisTrp: 0.313 ± 0.073
0.924HisTyr: 0.924 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
3.881IleAla: 3.881 ± 0.227
1.08IleCys: 1.08 ± 0.134
5.842IleAsp: 5.842 ± 0.287
5.828IleGlu: 5.828 ± 0.277
2.701IlePhe: 2.701 ± 0.187
4.321IleGly: 4.321 ± 0.296
1.521IleHis: 1.521 ± 0.158
4.264IleIle: 4.264 ± 0.258
5.174IleLys: 5.174 ± 0.272
4.705IleLeu: 4.705 ± 0.237
1.99IleMet: 1.99 ± 0.164
4.321IleAsn: 4.321 ± 0.238
2.928IlePro: 2.928 ± 0.213
2.601IleGln: 2.601 ± 0.198
3.61IleArg: 3.61 ± 0.232
5.146IleSer: 5.146 ± 0.343
3.781IleThr: 3.781 ± 0.226
4.648IleVal: 4.648 ± 0.235
0.611IleTrp: 0.611 ± 0.087
2.772IleTyr: 2.772 ± 0.199
0.0IleXaa: 0.0 ± 0.0
Lys
4.08LysAla: 4.08 ± 0.28
0.725LysCys: 0.725 ± 0.098
5.231LysAsp: 5.231 ± 0.272
6.723LysGlu: 6.723 ± 0.35
3.881LysPhe: 3.881 ± 0.214
3.625LysGly: 3.625 ± 0.235
1.635LysHis: 1.635 ± 0.154
5.43LysIle: 5.43 ± 0.306
6.411LysLys: 6.411 ± 0.308
5.032LysLeu: 5.032 ± 0.255
2.829LysMet: 2.829 ± 0.213
4.293LysAsn: 4.293 ± 0.239
2.53LysPro: 2.53 ± 0.182
2.431LysGln: 2.431 ± 0.186
2.63LysArg: 2.63 ± 0.202
4.804LysSer: 4.804 ± 0.271
4.733LysThr: 4.733 ± 0.272
4.961LysVal: 4.961 ± 0.275
1.009LysTrp: 1.009 ± 0.11
3.682LysTyr: 3.682 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
3.724LeuAla: 3.724 ± 0.253
0.938LeuCys: 0.938 ± 0.106
4.833LeuAsp: 4.833 ± 0.277
5.202LeuGlu: 5.202 ± 0.273
3.212LeuPhe: 3.212 ± 0.222
4.165LeuGly: 4.165 ± 0.251
1.151LeuHis: 1.151 ± 0.131
4.833LeuIle: 4.833 ± 0.261
5.6LeuLys: 5.6 ± 0.277
4.023LeuLeu: 4.023 ± 0.227
2.104LeuMet: 2.104 ± 0.167
4.023LeuAsn: 4.023 ± 0.24
2.559LeuPro: 2.559 ± 0.208
1.635LeuGln: 1.635 ± 0.155
3.141LeuArg: 3.141 ± 0.197
4.804LeuSer: 4.804 ± 0.29
4.094LeuThr: 4.094 ± 0.218
5.842LeuVal: 5.842 ± 0.27
0.697LeuTrp: 0.697 ± 0.104
2.829LeuTyr: 2.829 ± 0.205
0.0LeuXaa: 0.0 ± 0.0
Met
1.748MetAla: 1.748 ± 0.154
0.455MetCys: 0.455 ± 0.086
1.933MetAsp: 1.933 ± 0.174
2.217MetGlu: 2.217 ± 0.232
1.45MetPhe: 1.45 ± 0.129
1.805MetGly: 1.805 ± 0.166
0.398MetHis: 0.398 ± 0.079
2.331MetIle: 2.331 ± 0.175
3.923MetLys: 3.923 ± 0.226
1.99MetLeu: 1.99 ± 0.204
0.924MetMet: 0.924 ± 0.136
2.175MetAsn: 2.175 ± 0.184
0.597MetPro: 0.597 ± 0.093
0.725MetGln: 0.725 ± 0.099
1.052MetArg: 1.052 ± 0.126
2.118MetSer: 2.118 ± 0.17
1.962MetThr: 1.962 ± 0.164
2.559MetVal: 2.559 ± 0.186
0.299MetTrp: 0.299 ± 0.062
1.208MetTyr: 1.208 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
2.772AsnAla: 2.772 ± 0.289
0.668AsnCys: 0.668 ± 0.09
3.298AsnAsp: 3.298 ± 0.22
3.696AsnGlu: 3.696 ± 0.24
2.402AsnPhe: 2.402 ± 0.2
4.122AsnGly: 4.122 ± 0.303
1.251AsnHis: 1.251 ± 0.129
3.596AsnIle: 3.596 ± 0.227
4.406AsnLys: 4.406 ± 0.23
4.406AsnLeu: 4.406 ± 0.263
1.592AsnMet: 1.592 ± 0.152
3.085AsnAsn: 3.085 ± 0.282
2.26AsnPro: 2.26 ± 0.197
2.033AsnGln: 2.033 ± 0.148
2.672AsnArg: 2.672 ± 0.184
3.554AsnSer: 3.554 ± 0.195
3.298AsnThr: 3.298 ± 0.305
3.184AsnVal: 3.184 ± 0.26
0.796AsnTrp: 0.796 ± 0.104
2.416AsnTyr: 2.416 ± 0.183
0.0AsnXaa: 0.0 ± 0.0
Pro
1.35ProAla: 1.35 ± 0.143
0.426ProCys: 0.426 ± 0.081
2.615ProAsp: 2.615 ± 0.162
2.516ProGlu: 2.516 ± 0.191
1.834ProPhe: 1.834 ± 0.159
1.407ProGly: 1.407 ± 0.137
0.824ProHis: 0.824 ± 0.117
1.834ProIle: 1.834 ± 0.155
2.118ProLys: 2.118 ± 0.172
1.834ProLeu: 1.834 ± 0.154
0.753ProMet: 0.753 ± 0.087
1.819ProAsn: 1.819 ± 0.128
0.782ProPro: 0.782 ± 0.102
0.924ProGln: 0.924 ± 0.128
1.251ProArg: 1.251 ± 0.127
2.559ProSer: 2.559 ± 0.216
2.175ProThr: 2.175 ± 0.155
2.971ProVal: 2.971 ± 0.187
0.611ProTrp: 0.611 ± 0.08
1.635ProTyr: 1.635 ± 0.135
0.0ProXaa: 0.0 ± 0.0
Gln
1.919GlnAla: 1.919 ± 0.16
0.327GlnCys: 0.327 ± 0.075
1.493GlnAsp: 1.493 ± 0.15
2.445GlnGlu: 2.445 ± 0.186
1.393GlnPhe: 1.393 ± 0.141
1.649GlnGly: 1.649 ± 0.149
0.441GlnHis: 0.441 ± 0.082
2.573GlnIle: 2.573 ± 0.169
2.217GlnLys: 2.217 ± 0.161
2.189GlnLeu: 2.189 ± 0.168
0.839GlnMet: 0.839 ± 0.105
1.677GlnAsn: 1.677 ± 0.139
1.109GlnPro: 1.109 ± 0.118
0.967GlnGln: 0.967 ± 0.138
1.35GlnArg: 1.35 ± 0.137
1.862GlnSer: 1.862 ± 0.173
1.578GlnThr: 1.578 ± 0.133
1.905GlnVal: 1.905 ± 0.159
0.526GlnTrp: 0.526 ± 0.088
1.535GlnTyr: 1.535 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
2.132ArgAla: 2.132 ± 0.178
0.554ArgCys: 0.554 ± 0.09
2.772ArgAsp: 2.772 ± 0.199
2.829ArgGlu: 2.829 ± 0.202
2.118ArgPhe: 2.118 ± 0.166
2.8ArgGly: 2.8 ± 0.227
0.896ArgHis: 0.896 ± 0.124
3.255ArgIle: 3.255 ± 0.203
3.113ArgLys: 3.113 ± 0.231
2.772ArgLeu: 2.772 ± 0.178
1.379ArgMet: 1.379 ± 0.124
2.104ArgAsn: 2.104 ± 0.155
1.137ArgPro: 1.137 ± 0.1
1.549ArgGln: 1.549 ± 0.127
1.905ArgArg: 1.905 ± 0.17
2.758ArgSer: 2.758 ± 0.182
2.018ArgThr: 2.018 ± 0.169
3.539ArgVal: 3.539 ± 0.264
0.81ArgTrp: 0.81 ± 0.122
2.161ArgTyr: 2.161 ± 0.187
0.0ArgXaa: 0.0 ± 0.0
Ser
2.772SerAla: 2.772 ± 0.235
0.938SerCys: 0.938 ± 0.125
4.35SerAsp: 4.35 ± 0.309
4.179SerGlu: 4.179 ± 0.244
2.999SerPhe: 2.999 ± 0.207
5.345SerGly: 5.345 ± 0.326
1.222SerHis: 1.222 ± 0.124
4.79SerIle: 4.79 ± 0.203
4.918SerLys: 4.918 ± 0.251
4.733SerLeu: 4.733 ± 0.243
2.075SerMet: 2.075 ± 0.183
3.255SerAsn: 3.255 ± 0.266
2.132SerPro: 2.132 ± 0.189
2.004SerGln: 2.004 ± 0.158
2.928SerArg: 2.928 ± 0.18
4.549SerSer: 4.549 ± 0.326
3.738SerThr: 3.738 ± 0.292
4.534SerVal: 4.534 ± 0.228
0.981SerTrp: 0.981 ± 0.105
2.985SerTyr: 2.985 ± 0.219
0.0SerXaa: 0.0 ± 0.0
Thr
2.701ThrAla: 2.701 ± 0.278
0.54ThrCys: 0.54 ± 0.098
3.582ThrAsp: 3.582 ± 0.218
3.525ThrGlu: 3.525 ± 0.226
2.303ThrPhe: 2.303 ± 0.192
4.165ThrGly: 4.165 ± 0.35
1.379ThrHis: 1.379 ± 0.149
4.591ThrIle: 4.591 ± 0.307
4.065ThrLys: 4.065 ± 0.219
4.122ThrLeu: 4.122 ± 0.269
1.35ThrMet: 1.35 ± 0.154
2.886ThrAsn: 2.886 ± 0.227
2.814ThrPro: 2.814 ± 0.224
1.876ThrGln: 1.876 ± 0.147
2.26ThrArg: 2.26 ± 0.165
3.468ThrSer: 3.468 ± 0.239
3.141ThrThr: 3.141 ± 0.368
4.534ThrVal: 4.534 ± 0.255
0.682ThrTrp: 0.682 ± 0.091
2.146ThrTyr: 2.146 ± 0.172
0.0ThrXaa: 0.0 ± 0.0
Val
3.284ValAla: 3.284 ± 0.241
1.166ValCys: 1.166 ± 0.129
5.43ValAsp: 5.43 ± 0.256
5.728ValGlu: 5.728 ± 0.281
3.298ValPhe: 3.298 ± 0.191
4.648ValGly: 4.648 ± 0.33
1.578ValHis: 1.578 ± 0.128
4.975ValIle: 4.975 ± 0.27
5.6ValLys: 5.6 ± 0.258
4.605ValLeu: 4.605 ± 0.26
2.459ValMet: 2.459 ± 0.191
4.037ValAsn: 4.037 ± 0.233
2.09ValPro: 2.09 ± 0.171
1.99ValGln: 1.99 ± 0.159
3.312ValArg: 3.312 ± 0.19
5.032ValSer: 5.032 ± 0.26
4.165ValThr: 4.165 ± 0.271
5.871ValVal: 5.871 ± 0.277
1.279ValTrp: 1.279 ± 0.152
3.397ValTyr: 3.397 ± 0.262
0.0ValXaa: 0.0 ± 0.0
Trp
0.739TrpAla: 0.739 ± 0.098
0.37TrpCys: 0.37 ± 0.071
1.166TrpAsp: 1.166 ± 0.13
1.08TrpGlu: 1.08 ± 0.117
0.839TrpPhe: 0.839 ± 0.098
0.839TrpGly: 0.839 ± 0.112
0.27TrpHis: 0.27 ± 0.069
0.995TrpIle: 0.995 ± 0.125
1.251TrpLys: 1.251 ± 0.13
0.952TrpLeu: 0.952 ± 0.119
0.398TrpMet: 0.398 ± 0.076
0.711TrpAsn: 0.711 ± 0.098
0.27TrpPro: 0.27 ± 0.062
0.355TrpGln: 0.355 ± 0.075
0.611TrpArg: 0.611 ± 0.084
0.91TrpSer: 0.91 ± 0.095
0.625TrpThr: 0.625 ± 0.087
1.095TrpVal: 1.095 ± 0.119
0.185TrpTrp: 0.185 ± 0.069
0.625TrpTyr: 0.625 ± 0.103
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.36TyrAla: 2.36 ± 0.169
0.682TyrCys: 0.682 ± 0.1
3.682TyrAsp: 3.682 ± 0.206
3.198TyrGlu: 3.198 ± 0.233
1.905TyrPhe: 1.905 ± 0.191
2.999TyrGly: 2.999 ± 0.209
1.066TyrHis: 1.066 ± 0.117
3.369TyrIle: 3.369 ± 0.207
3.255TyrLys: 3.255 ± 0.203
3.227TyrLeu: 3.227 ± 0.209
1.336TyrMet: 1.336 ± 0.133
2.601TyrAsn: 2.601 ± 0.207
1.578TyrPro: 1.578 ± 0.146
1.464TyrGln: 1.464 ± 0.145
2.189TyrArg: 2.189 ± 0.169
2.843TyrSer: 2.843 ± 0.228
2.573TyrThr: 2.573 ± 0.198
3.141TyrVal: 3.141 ± 0.223
0.654TyrTrp: 0.654 ± 0.091
2.146TyrTyr: 2.146 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 407 proteins (70352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski