Amino acid dipepetide frequency for Salmonella phage Mutine

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.043AlaAla: 5.043 ± 0.367
0.64AlaCys: 0.64 ± 0.126
4.102AlaAsp: 4.102 ± 0.321
4.122AlaGlu: 4.122 ± 0.387
2.582AlaPhe: 2.582 ± 0.218
4.162AlaGly: 4.162 ± 0.373
1.321AlaHis: 1.321 ± 0.182
4.363AlaIle: 4.363 ± 0.345
3.942AlaLys: 3.942 ± 0.271
5.283AlaLeu: 5.283 ± 0.349
1.921AlaMet: 1.921 ± 0.171
2.922AlaAsn: 2.922 ± 0.238
2.301AlaPro: 2.301 ± 0.222
2.542AlaGln: 2.542 ± 0.271
3.382AlaArg: 3.382 ± 0.273
3.982AlaSer: 3.982 ± 0.287
4.102AlaThr: 4.102 ± 0.346
4.663AlaVal: 4.663 ± 0.284
0.861AlaTrp: 0.861 ± 0.145
2.341AlaTyr: 2.341 ± 0.26
0.0AlaXaa: 0.0 ± 0.0
Cys
0.76CysAla: 0.76 ± 0.119
0.16CysCys: 0.16 ± 0.069
0.841CysAsp: 0.841 ± 0.154
0.881CysGlu: 0.881 ± 0.124
0.28CysPhe: 0.28 ± 0.067
0.82CysGly: 0.82 ± 0.146
0.42CysHis: 0.42 ± 0.097
0.74CysIle: 0.74 ± 0.105
0.78CysLys: 0.78 ± 0.132
0.72CysLeu: 0.72 ± 0.127
0.32CysMet: 0.32 ± 0.081
0.7CysAsn: 0.7 ± 0.112
0.64CysPro: 0.64 ± 0.108
0.4CysGln: 0.4 ± 0.094
0.54CysArg: 0.54 ± 0.108
0.881CysSer: 0.881 ± 0.149
0.78CysThr: 0.78 ± 0.139
1.001CysVal: 1.001 ± 0.153
0.12CysTrp: 0.12 ± 0.052
0.4CysTyr: 0.4 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
4.463AspAla: 4.463 ± 0.309
0.8AspCys: 0.8 ± 0.129
3.742AspAsp: 3.742 ± 0.262
3.982AspGlu: 3.982 ± 0.305
2.982AspPhe: 2.982 ± 0.235
4.983AspGly: 4.983 ± 0.349
1.021AspHis: 1.021 ± 0.119
4.703AspIle: 4.703 ± 0.267
3.662AspLys: 3.662 ± 0.249
5.663AspLeu: 5.663 ± 0.315
2.081AspMet: 2.081 ± 0.222
3.042AspAsn: 3.042 ± 0.232
2.762AspPro: 2.762 ± 0.245
1.861AspGln: 1.861 ± 0.153
2.341AspArg: 2.341 ± 0.187
3.902AspSer: 3.902 ± 0.276
3.462AspThr: 3.462 ± 0.26
4.122AspVal: 4.122 ± 0.306
1.021AspTrp: 1.021 ± 0.139
3.122AspTyr: 3.122 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
4.042GluAla: 4.042 ± 0.346
0.64GluCys: 0.64 ± 0.119
4.343GluAsp: 4.343 ± 0.282
4.383GluGlu: 4.383 ± 0.413
3.282GluPhe: 3.282 ± 0.255
4.443GluGly: 4.443 ± 0.293
1.341GluHis: 1.341 ± 0.155
4.283GluIle: 4.283 ± 0.311
3.722GluLys: 3.722 ± 0.335
6.364GluLeu: 6.364 ± 0.389
2.101GluMet: 2.101 ± 0.198
3.122GluAsn: 3.122 ± 0.261
1.921GluPro: 1.921 ± 0.19
2.622GluGln: 2.622 ± 0.272
3.722GluArg: 3.722 ± 0.299
3.802GluSer: 3.802 ± 0.266
3.542GluThr: 3.542 ± 0.265
4.263GluVal: 4.263 ± 0.32
1.101GluTrp: 1.101 ± 0.157
2.882GluTyr: 2.882 ± 0.26
0.0GluXaa: 0.0 ± 0.0
Phe
2.441PheAla: 2.441 ± 0.232
0.52PheCys: 0.52 ± 0.106
2.822PheAsp: 2.822 ± 0.208
3.022PheGlu: 3.022 ± 0.247
1.641PhePhe: 1.641 ± 0.195
3.442PheGly: 3.442 ± 0.262
0.921PheHis: 0.921 ± 0.146
2.802PheIle: 2.802 ± 0.222
2.802PheLys: 2.802 ± 0.257
2.802PheLeu: 2.802 ± 0.189
1.201PheMet: 1.201 ± 0.147
2.602PheAsn: 2.602 ± 0.208
1.481PhePro: 1.481 ± 0.202
1.481PheGln: 1.481 ± 0.139
2.381PheArg: 2.381 ± 0.234
2.882PheSer: 2.882 ± 0.228
2.642PheThr: 2.642 ± 0.265
3.162PheVal: 3.162 ± 0.222
0.68PheTrp: 0.68 ± 0.111
1.541PheTyr: 1.541 ± 0.164
0.0PheXaa: 0.0 ± 0.0
Gly
3.742GlyAla: 3.742 ± 0.325
0.961GlyCys: 0.961 ± 0.147
4.102GlyAsp: 4.102 ± 0.348
4.463GlyGlu: 4.463 ± 0.254
2.982GlyPhe: 2.982 ± 0.237
5.103GlyGly: 5.103 ± 0.473
1.341GlyHis: 1.341 ± 0.173
4.723GlyIle: 4.723 ± 0.328
5.243GlyLys: 5.243 ± 0.364
5.003GlyLeu: 5.003 ± 0.308
2.081GlyMet: 2.081 ± 0.202
3.582GlyAsn: 3.582 ± 0.292
1.141GlyPro: 1.141 ± 0.158
2.461GlyGln: 2.461 ± 0.189
2.722GlyArg: 2.722 ± 0.22
4.963GlySer: 4.963 ± 0.516
3.842GlyThr: 3.842 ± 0.37
4.943GlyVal: 4.943 ± 0.348
1.281GlyTrp: 1.281 ± 0.174
2.842GlyTyr: 2.842 ± 0.243
0.0GlyXaa: 0.0 ± 0.0
His
1.001HisAla: 1.001 ± 0.138
0.3HisCys: 0.3 ± 0.06
1.241HisAsp: 1.241 ± 0.198
0.68HisGlu: 0.68 ± 0.105
0.981HisPhe: 0.981 ± 0.142
0.901HisGly: 0.901 ± 0.136
0.52HisHis: 0.52 ± 0.118
1.541HisIle: 1.541 ± 0.2
1.321HisLys: 1.321 ± 0.19
1.781HisLeu: 1.781 ± 0.149
0.58HisMet: 0.58 ± 0.102
0.68HisAsn: 0.68 ± 0.126
1.161HisPro: 1.161 ± 0.179
0.62HisGln: 0.62 ± 0.111
1.181HisArg: 1.181 ± 0.157
1.041HisSer: 1.041 ± 0.142
1.201HisThr: 1.201 ± 0.182
1.501HisVal: 1.501 ± 0.222
0.24HisTrp: 0.24 ± 0.082
0.961HisTyr: 0.961 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
3.842IleAla: 3.842 ± 0.283
0.861IleCys: 0.861 ± 0.143
4.963IleAsp: 4.963 ± 0.358
4.743IleGlu: 4.743 ± 0.335
1.841IlePhe: 1.841 ± 0.187
3.822IleGly: 3.822 ± 0.249
1.321IleHis: 1.321 ± 0.196
3.802IleIle: 3.802 ± 0.279
4.042IleLys: 4.042 ± 0.292
4.443IleLeu: 4.443 ± 0.296
1.661IleMet: 1.661 ± 0.175
3.582IleAsn: 3.582 ± 0.285
2.982IlePro: 2.982 ± 0.247
2.862IleGln: 2.862 ± 0.228
3.302IleArg: 3.302 ± 0.268
3.722IleSer: 3.722 ± 0.314
4.503IleThr: 4.503 ± 0.319
4.062IleVal: 4.062 ± 0.336
0.78IleTrp: 0.78 ± 0.147
2.141IleTyr: 2.141 ± 0.204
0.0IleXaa: 0.0 ± 0.0
Lys
4.283LysAla: 4.283 ± 0.38
0.64LysCys: 0.64 ± 0.135
3.862LysAsp: 3.862 ± 0.267
4.543LysGlu: 4.543 ± 0.362
3.202LysPhe: 3.202 ± 0.216
3.982LysGly: 3.982 ± 0.291
1.201LysHis: 1.201 ± 0.174
3.982LysIle: 3.982 ± 0.219
4.283LysLys: 4.283 ± 0.334
5.103LysLeu: 5.103 ± 0.349
2.301LysMet: 2.301 ± 0.272
2.622LysAsn: 2.622 ± 0.214
2.682LysPro: 2.682 ± 0.249
2.862LysGln: 2.862 ± 0.243
3.162LysArg: 3.162 ± 0.253
4.183LysSer: 4.183 ± 0.29
4.263LysThr: 4.263 ± 0.306
4.223LysVal: 4.223 ± 0.293
0.941LysTrp: 0.941 ± 0.149
2.441LysTyr: 2.441 ± 0.241
0.0LysXaa: 0.0 ± 0.0
Leu
5.964LeuAla: 5.964 ± 0.331
0.76LeuCys: 0.76 ± 0.125
4.903LeuAsp: 4.903 ± 0.304
5.323LeuGlu: 5.323 ± 0.401
3.502LeuPhe: 3.502 ± 0.296
5.043LeuGly: 5.043 ± 0.287
1.301LeuHis: 1.301 ± 0.167
4.002LeuIle: 4.002 ± 0.29
5.844LeuLys: 5.844 ± 0.389
6.064LeuLeu: 6.064 ± 0.405
2.061LeuMet: 2.061 ± 0.203
4.263LeuAsn: 4.263 ± 0.315
3.442LeuPro: 3.442 ± 0.298
2.922LeuGln: 2.922 ± 0.228
4.162LeuArg: 4.162 ± 0.263
5.663LeuSer: 5.663 ± 0.337
4.903LeuThr: 4.903 ± 0.31
5.603LeuVal: 5.603 ± 0.352
0.78LeuTrp: 0.78 ± 0.156
3.242LeuTyr: 3.242 ± 0.313
0.0LeuXaa: 0.0 ± 0.0
Met
2.441MetAla: 2.441 ± 0.283
0.38MetCys: 0.38 ± 0.086
1.621MetAsp: 1.621 ± 0.168
1.641MetGlu: 1.641 ± 0.165
1.401MetPhe: 1.401 ± 0.188
1.421MetGly: 1.421 ± 0.157
0.42MetHis: 0.42 ± 0.083
1.721MetIle: 1.721 ± 0.199
2.341MetLys: 2.341 ± 0.25
2.481MetLeu: 2.481 ± 0.229
1.021MetMet: 1.021 ± 0.137
1.581MetAsn: 1.581 ± 0.196
1.061MetPro: 1.061 ± 0.168
0.981MetGln: 0.981 ± 0.126
1.641MetArg: 1.641 ± 0.171
2.181MetSer: 2.181 ± 0.205
1.861MetThr: 1.861 ± 0.211
1.521MetVal: 1.521 ± 0.192
0.3MetTrp: 0.3 ± 0.077
0.941MetTyr: 0.941 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.822AsnAla: 3.822 ± 0.294
0.921AsnCys: 0.921 ± 0.159
2.922AsnAsp: 2.922 ± 0.266
2.742AsnGlu: 2.742 ± 0.261
2.141AsnPhe: 2.141 ± 0.198
4.243AsnGly: 4.243 ± 0.395
1.201AsnHis: 1.201 ± 0.175
3.302AsnIle: 3.302 ± 0.267
3.262AsnLys: 3.262 ± 0.275
3.882AsnLeu: 3.882 ± 0.354
1.621AsnMet: 1.621 ± 0.175
3.162AsnAsn: 3.162 ± 0.286
2.622AsnPro: 2.622 ± 0.299
2.221AsnGln: 2.221 ± 0.179
2.401AsnArg: 2.401 ± 0.25
2.882AsnSer: 2.882 ± 0.264
2.582AsnThr: 2.582 ± 0.242
3.502AsnVal: 3.502 ± 0.341
0.6AsnTrp: 0.6 ± 0.136
1.801AsnTyr: 1.801 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
2.341ProAla: 2.341 ± 0.212
0.48ProCys: 0.48 ± 0.1
2.962ProAsp: 2.962 ± 0.271
3.562ProGlu: 3.562 ± 0.275
1.721ProPhe: 1.721 ± 0.169
2.602ProGly: 2.602 ± 0.255
0.74ProHis: 0.74 ± 0.157
2.241ProIle: 2.241 ± 0.231
2.201ProLys: 2.201 ± 0.195
2.882ProLeu: 2.882 ± 0.25
1.041ProMet: 1.041 ± 0.151
1.981ProAsn: 1.981 ± 0.196
1.261ProPro: 1.261 ± 0.167
1.301ProGln: 1.301 ± 0.168
1.721ProArg: 1.721 ± 0.204
2.742ProSer: 2.742 ± 0.211
2.361ProThr: 2.361 ± 0.242
2.742ProVal: 2.742 ± 0.271
0.66ProTrp: 0.66 ± 0.106
1.241ProTyr: 1.241 ± 0.163
0.0ProXaa: 0.0 ± 0.0
Gln
2.722GlnAla: 2.722 ± 0.24
0.42GlnCys: 0.42 ± 0.095
2.021GlnAsp: 2.021 ± 0.228
2.341GlnGlu: 2.341 ± 0.207
1.881GlnPhe: 1.881 ± 0.186
2.361GlnGly: 2.361 ± 0.206
0.861GlnHis: 0.861 ± 0.141
2.742GlnIle: 2.742 ± 0.221
2.161GlnLys: 2.161 ± 0.188
3.182GlnLeu: 3.182 ± 0.298
1.001GlnMet: 1.001 ± 0.133
1.821GlnAsn: 1.821 ± 0.193
1.281GlnPro: 1.281 ± 0.178
1.941GlnGln: 1.941 ± 0.267
2.161GlnArg: 2.161 ± 0.197
2.241GlnSer: 2.241 ± 0.236
2.181GlnThr: 2.181 ± 0.195
2.702GlnVal: 2.702 ± 0.264
0.6GlnTrp: 0.6 ± 0.121
1.441GlnTyr: 1.441 ± 0.17
0.0GlnXaa: 0.0 ± 0.0
Arg
3.002ArgAla: 3.002 ± 0.222
0.76ArgCys: 0.76 ± 0.112
2.862ArgAsp: 2.862 ± 0.229
3.082ArgGlu: 3.082 ± 0.277
2.281ArgPhe: 2.281 ± 0.196
2.742ArgGly: 2.742 ± 0.211
1.141ArgHis: 1.141 ± 0.17
3.282ArgIle: 3.282 ± 0.229
2.922ArgLys: 2.922 ± 0.293
4.523ArgLeu: 4.523 ± 0.344
1.541ArgMet: 1.541 ± 0.168
2.862ArgAsn: 2.862 ± 0.253
1.741ArgPro: 1.741 ± 0.232
2.121ArgGln: 2.121 ± 0.24
3.022ArgArg: 3.022 ± 0.308
3.362ArgSer: 3.362 ± 0.274
2.221ArgThr: 2.221 ± 0.27
3.322ArgVal: 3.322 ± 0.243
0.841ArgTrp: 0.841 ± 0.126
2.181ArgTyr: 2.181 ± 0.213
0.0ArgXaa: 0.0 ± 0.0
Ser
3.522SerAla: 3.522 ± 0.276
0.64SerCys: 0.64 ± 0.127
3.762SerAsp: 3.762 ± 0.308
4.223SerGlu: 4.223 ± 0.271
2.922SerPhe: 2.922 ± 0.258
5.103SerGly: 5.103 ± 0.465
0.921SerHis: 0.921 ± 0.147
4.223SerIle: 4.223 ± 0.348
4.122SerLys: 4.122 ± 0.355
5.243SerLeu: 5.243 ± 0.379
1.821SerMet: 1.821 ± 0.207
3.742SerAsn: 3.742 ± 0.282
2.381SerPro: 2.381 ± 0.222
2.341SerGln: 2.341 ± 0.224
3.262SerArg: 3.262 ± 0.262
4.183SerSer: 4.183 ± 0.389
3.702SerThr: 3.702 ± 0.391
4.803SerVal: 4.803 ± 0.363
0.961SerTrp: 0.961 ± 0.138
2.622SerTyr: 2.622 ± 0.236
0.0SerXaa: 0.0 ± 0.0
Thr
3.862ThrAla: 3.862 ± 0.294
0.52ThrCys: 0.52 ± 0.108
3.542ThrAsp: 3.542 ± 0.284
4.002ThrGlu: 4.002 ± 0.372
2.582ThrPhe: 2.582 ± 0.243
4.323ThrGly: 4.323 ± 0.337
1.081ThrHis: 1.081 ± 0.159
4.042ThrIle: 4.042 ± 0.318
3.622ThrLys: 3.622 ± 0.315
4.743ThrLeu: 4.743 ± 0.326
1.241ThrMet: 1.241 ± 0.143
2.642ThrAsn: 2.642 ± 0.268
3.562ThrPro: 3.562 ± 0.252
2.081ThrGln: 2.081 ± 0.166
2.862ThrArg: 2.862 ± 0.217
3.882ThrSer: 3.882 ± 0.383
3.782ThrThr: 3.782 ± 0.349
4.463ThrVal: 4.463 ± 0.412
0.78ThrTrp: 0.78 ± 0.136
1.821ThrTyr: 1.821 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
3.922ValAla: 3.922 ± 0.302
0.76ValCys: 0.76 ± 0.142
5.263ValAsp: 5.263 ± 0.357
4.863ValGlu: 4.863 ± 0.35
2.622ValPhe: 2.622 ± 0.25
4.683ValGly: 4.683 ± 0.28
1.161ValHis: 1.161 ± 0.169
4.162ValIle: 4.162 ± 0.262
5.183ValLys: 5.183 ± 0.345
5.183ValLeu: 5.183 ± 0.347
1.861ValMet: 1.861 ± 0.192
3.742ValAsn: 3.742 ± 0.261
2.301ValPro: 2.301 ± 0.226
2.361ValGln: 2.361 ± 0.207
2.842ValArg: 2.842 ± 0.228
4.903ValSer: 4.903 ± 0.318
4.783ValThr: 4.783 ± 0.433
5.964ValVal: 5.964 ± 0.409
1.201ValTrp: 1.201 ± 0.174
3.082ValTyr: 3.082 ± 0.227
0.0ValXaa: 0.0 ± 0.0
Trp
1.021TrpAla: 1.021 ± 0.138
0.4TrpCys: 0.4 ± 0.094
1.061TrpAsp: 1.061 ± 0.153
1.041TrpGlu: 1.041 ± 0.159
0.74TrpPhe: 0.74 ± 0.12
0.841TrpGly: 0.841 ± 0.121
0.22TrpHis: 0.22 ± 0.071
0.68TrpIle: 0.68 ± 0.149
0.941TrpLys: 0.941 ± 0.164
1.441TrpLeu: 1.441 ± 0.193
0.46TrpMet: 0.46 ± 0.092
0.66TrpAsn: 0.66 ± 0.111
0.44TrpPro: 0.44 ± 0.096
0.4TrpGln: 0.4 ± 0.091
0.901TrpArg: 0.901 ± 0.14
0.66TrpSer: 0.66 ± 0.111
0.66TrpThr: 0.66 ± 0.118
1.161TrpVal: 1.161 ± 0.172
0.18TrpTrp: 0.18 ± 0.057
0.5TrpTyr: 0.5 ± 0.084
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.341TyrAla: 2.341 ± 0.227
0.62TyrCys: 0.62 ± 0.118
2.782TyrAsp: 2.782 ± 0.252
2.281TyrGlu: 2.281 ± 0.183
1.701TyrPhe: 1.701 ± 0.187
2.502TyrGly: 2.502 ± 0.264
0.981TyrHis: 0.981 ± 0.168
1.941TyrIle: 1.941 ± 0.204
2.381TyrLys: 2.381 ± 0.23
2.962TyrLeu: 2.962 ± 0.237
1.021TyrMet: 1.021 ± 0.131
2.522TyrAsn: 2.522 ± 0.229
1.621TyrPro: 1.621 ± 0.177
1.681TyrGln: 1.681 ± 0.187
2.101TyrArg: 2.101 ± 0.191
2.441TyrSer: 2.441 ± 0.209
2.061TyrThr: 2.061 ± 0.282
3.122TyrVal: 3.122 ± 0.25
0.48TyrTrp: 0.48 ± 0.094
1.421TyrTyr: 1.421 ± 0.153
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 218 proteins (49971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski