Amino acid dipepetide frequency for Salmonella phage SFP10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.338AlaAla: 5.338 ± 0.34
0.665AlaCys: 0.665 ± 0.105
4.299AlaAsp: 4.299 ± 0.335
4.258AlaGlu: 4.258 ± 0.478
2.513AlaPhe: 2.513 ± 0.223
4.362AlaGly: 4.362 ± 0.363
1.433AlaHis: 1.433 ± 0.169
4.528AlaIle: 4.528 ± 0.318
3.676AlaLys: 3.676 ± 0.325
5.089AlaLeu: 5.089 ± 0.304
2.035AlaMet: 2.035 ± 0.198
3.385AlaAsn: 3.385 ± 0.26
2.492AlaPro: 2.492 ± 0.256
2.472AlaGln: 2.472 ± 0.224
3.406AlaArg: 3.406 ± 0.252
3.988AlaSer: 3.988 ± 0.33
3.946AlaThr: 3.946 ± 0.314
5.109AlaVal: 5.109 ± 0.376
0.831AlaTrp: 0.831 ± 0.126
2.555AlaTyr: 2.555 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
0.748CysAla: 0.748 ± 0.107
0.104CysCys: 0.104 ± 0.042
0.768CysAsp: 0.768 ± 0.143
0.852CysGlu: 0.852 ± 0.122
0.353CysPhe: 0.353 ± 0.079
0.727CysGly: 0.727 ± 0.123
0.436CysHis: 0.436 ± 0.106
0.748CysIle: 0.748 ± 0.108
0.768CysLys: 0.768 ± 0.114
0.602CysLeu: 0.602 ± 0.107
0.332CysMet: 0.332 ± 0.08
0.519CysAsn: 0.519 ± 0.1
0.623CysPro: 0.623 ± 0.109
0.291CysGln: 0.291 ± 0.062
0.353CysArg: 0.353 ± 0.096
0.81CysSer: 0.81 ± 0.135
0.706CysThr: 0.706 ± 0.118
0.955CysVal: 0.955 ± 0.151
0.125CysTrp: 0.125 ± 0.052
0.395CysTyr: 0.395 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
4.528AspAla: 4.528 ± 0.288
0.768AspCys: 0.768 ± 0.12
3.78AspAsp: 3.78 ± 0.324
3.863AspGlu: 3.863 ± 0.32
3.053AspPhe: 3.053 ± 0.255
5.13AspGly: 5.13 ± 0.343
0.997AspHis: 0.997 ± 0.16
4.569AspIle: 4.569 ± 0.281
3.718AspLys: 3.718 ± 0.251
5.94AspLeu: 5.94 ± 0.371
2.098AspMet: 2.098 ± 0.205
3.136AspAsn: 3.136 ± 0.236
2.804AspPro: 2.804 ± 0.227
1.807AspGln: 1.807 ± 0.204
2.347AspArg: 2.347 ± 0.204
3.884AspSer: 3.884 ± 0.289
3.593AspThr: 3.593 ± 0.303
4.258AspVal: 4.258 ± 0.269
0.976AspTrp: 0.976 ± 0.17
3.157AspTyr: 3.157 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
4.424GluAla: 4.424 ± 0.426
0.644GluCys: 0.644 ± 0.11
4.071GluAsp: 4.071 ± 0.312
4.382GluGlu: 4.382 ± 0.403
2.949GluPhe: 2.949 ± 0.267
4.154GluGly: 4.154 ± 0.27
1.412GluHis: 1.412 ± 0.166
4.486GluIle: 4.486 ± 0.267
3.635GluLys: 3.635 ± 0.368
6.169GluLeu: 6.169 ± 0.367
2.077GluMet: 2.077 ± 0.19
3.24GluAsn: 3.24 ± 0.243
1.932GluPro: 1.932 ± 0.209
2.804GluGln: 2.804 ± 0.271
3.552GluArg: 3.552 ± 0.296
3.635GluSer: 3.635 ± 0.283
3.635GluThr: 3.635 ± 0.26
4.341GluVal: 4.341 ± 0.27
1.101GluTrp: 1.101 ± 0.15
3.053GluTyr: 3.053 ± 0.295
0.0GluXaa: 0.0 ± 0.0
Phe
2.326PheAla: 2.326 ± 0.289
0.353PheCys: 0.353 ± 0.097
2.825PheAsp: 2.825 ± 0.228
3.053PheGlu: 3.053 ± 0.253
1.724PhePhe: 1.724 ± 0.209
3.219PheGly: 3.219 ± 0.272
0.976PheHis: 0.976 ± 0.14
2.721PheIle: 2.721 ± 0.223
2.866PheLys: 2.866 ± 0.241
2.721PheLeu: 2.721 ± 0.242
1.329PheMet: 1.329 ± 0.172
2.617PheAsn: 2.617 ± 0.214
1.308PhePro: 1.308 ± 0.203
1.495PheGln: 1.495 ± 0.162
2.16PheArg: 2.16 ± 0.217
2.845PheSer: 2.845 ± 0.213
2.617PheThr: 2.617 ± 0.248
3.136PheVal: 3.136 ± 0.225
0.768PheTrp: 0.768 ± 0.128
1.599PheTyr: 1.599 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
3.967GlyAla: 3.967 ± 0.374
0.893GlyCys: 0.893 ± 0.13
4.092GlyAsp: 4.092 ± 0.283
4.611GlyGlu: 4.611 ± 0.32
2.825GlyPhe: 2.825 ± 0.219
5.006GlyGly: 5.006 ± 0.447
1.308GlyHis: 1.308 ± 0.17
4.985GlyIle: 4.985 ± 0.344
4.798GlyLys: 4.798 ± 0.317
5.151GlyLeu: 5.151 ± 0.336
1.807GlyMet: 1.807 ± 0.176
3.489GlyAsn: 3.489 ± 0.257
1.101GlyPro: 1.101 ± 0.156
2.534GlyGln: 2.534 ± 0.216
2.762GlyArg: 2.762 ± 0.238
4.964GlySer: 4.964 ± 0.396
3.655GlyThr: 3.655 ± 0.326
5.296GlyVal: 5.296 ± 0.348
1.225GlyTrp: 1.225 ± 0.172
2.638GlyTyr: 2.638 ± 0.226
0.0GlyXaa: 0.0 ± 0.0
His
1.059HisAla: 1.059 ± 0.161
0.291HisCys: 0.291 ± 0.069
1.329HisAsp: 1.329 ± 0.168
0.706HisGlu: 0.706 ± 0.118
0.935HisPhe: 0.935 ± 0.129
0.997HisGly: 0.997 ± 0.147
0.498HisHis: 0.498 ± 0.106
1.475HisIle: 1.475 ± 0.183
1.267HisLys: 1.267 ± 0.161
1.703HisLeu: 1.703 ± 0.172
0.644HisMet: 0.644 ± 0.103
0.706HisAsn: 0.706 ± 0.104
1.018HisPro: 1.018 ± 0.173
0.54HisGln: 0.54 ± 0.11
1.122HisArg: 1.122 ± 0.128
1.038HisSer: 1.038 ± 0.136
1.225HisThr: 1.225 ± 0.191
1.371HisVal: 1.371 ± 0.211
0.166HisTrp: 0.166 ± 0.062
0.997HisTyr: 0.997 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
4.133IleAla: 4.133 ± 0.309
0.706IleCys: 0.706 ± 0.123
5.234IleAsp: 5.234 ± 0.301
4.839IleGlu: 4.839 ± 0.311
1.869IlePhe: 1.869 ± 0.217
3.905IleGly: 3.905 ± 0.272
1.433IleHis: 1.433 ± 0.177
3.676IleIle: 3.676 ± 0.307
4.029IleLys: 4.029 ± 0.29
4.445IleLeu: 4.445 ± 0.313
1.703IleMet: 1.703 ± 0.186
3.78IleAsn: 3.78 ± 0.347
2.929IlePro: 2.929 ± 0.211
2.762IleGln: 2.762 ± 0.262
3.053IleArg: 3.053 ± 0.223
3.78IleSer: 3.78 ± 0.329
4.445IleThr: 4.445 ± 0.329
4.092IleVal: 4.092 ± 0.325
0.727IleTrp: 0.727 ± 0.128
2.264IleTyr: 2.264 ± 0.208
0.0IleXaa: 0.0 ± 0.0
Lys
4.175LysAla: 4.175 ± 0.368
0.498LysCys: 0.498 ± 0.112
3.655LysAsp: 3.655 ± 0.207
4.528LysGlu: 4.528 ± 0.364
3.115LysPhe: 3.115 ± 0.221
3.905LysGly: 3.905 ± 0.329
1.122LysHis: 1.122 ± 0.159
3.946LysIle: 3.946 ± 0.255
4.258LysLys: 4.258 ± 0.376
5.276LysLeu: 5.276 ± 0.339
2.222LysMet: 2.222 ± 0.278
2.617LysAsn: 2.617 ± 0.211
2.762LysPro: 2.762 ± 0.235
3.115LysGln: 3.115 ± 0.239
3.115LysArg: 3.115 ± 0.296
4.009LysSer: 4.009 ± 0.297
3.884LysThr: 3.884 ± 0.256
4.32LysVal: 4.32 ± 0.289
0.955LysTrp: 0.955 ± 0.141
2.202LysTyr: 2.202 ± 0.188
0.0LysXaa: 0.0 ± 0.0
Leu
6.127LeuAla: 6.127 ± 0.386
0.935LeuCys: 0.935 ± 0.157
4.943LeuAsp: 4.943 ± 0.3
5.13LeuGlu: 5.13 ± 0.332
3.531LeuPhe: 3.531 ± 0.281
5.213LeuGly: 5.213 ± 0.292
1.288LeuHis: 1.288 ± 0.172
4.382LeuIle: 4.382 ± 0.335
6.065LeuLys: 6.065 ± 0.392
6.231LeuLeu: 6.231 ± 0.45
1.89LeuMet: 1.89 ± 0.193
4.528LeuAsn: 4.528 ± 0.317
3.489LeuPro: 3.489 ± 0.258
2.887LeuGln: 2.887 ± 0.24
3.78LeuArg: 3.78 ± 0.271
5.462LeuSer: 5.462 ± 0.291
5.089LeuThr: 5.089 ± 0.297
5.4LeuVal: 5.4 ± 0.278
0.685LeuTrp: 0.685 ± 0.134
3.199LeuTyr: 3.199 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
2.43MetAla: 2.43 ± 0.245
0.353MetCys: 0.353 ± 0.094
1.537MetAsp: 1.537 ± 0.19
1.641MetGlu: 1.641 ± 0.18
1.288MetPhe: 1.288 ± 0.147
1.412MetGly: 1.412 ± 0.145
0.436MetHis: 0.436 ± 0.096
1.703MetIle: 1.703 ± 0.157
2.305MetLys: 2.305 ± 0.212
2.43MetLeu: 2.43 ± 0.246
1.018MetMet: 1.018 ± 0.145
1.682MetAsn: 1.682 ± 0.226
1.08MetPro: 1.08 ± 0.153
1.018MetGln: 1.018 ± 0.144
1.641MetArg: 1.641 ± 0.187
2.305MetSer: 2.305 ± 0.189
1.765MetThr: 1.765 ± 0.198
1.786MetVal: 1.786 ± 0.168
0.291MetTrp: 0.291 ± 0.087
0.852MetTyr: 0.852 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
3.842AsnAla: 3.842 ± 0.28
0.706AsnCys: 0.706 ± 0.126
3.199AsnAsp: 3.199 ± 0.272
2.762AsnGlu: 2.762 ± 0.255
2.056AsnPhe: 2.056 ± 0.228
4.237AsnGly: 4.237 ± 0.348
1.059AsnHis: 1.059 ± 0.183
3.448AsnIle: 3.448 ± 0.281
3.365AsnLys: 3.365 ± 0.245
3.635AsnLeu: 3.635 ± 0.273
1.828AsnMet: 1.828 ± 0.205
3.282AsnAsn: 3.282 ± 0.32
2.409AsnPro: 2.409 ± 0.261
2.139AsnGln: 2.139 ± 0.184
2.513AsnArg: 2.513 ± 0.218
2.825AsnSer: 2.825 ± 0.251
2.887AsnThr: 2.887 ± 0.297
3.78AsnVal: 3.78 ± 0.291
0.748AsnTrp: 0.748 ± 0.121
1.89AsnTyr: 1.89 ± 0.204
0.0AsnXaa: 0.0 ± 0.0
Pro
2.492ProAla: 2.492 ± 0.236
0.436ProCys: 0.436 ± 0.098
3.074ProAsp: 3.074 ± 0.244
3.406ProGlu: 3.406 ± 0.275
1.682ProPhe: 1.682 ± 0.2
2.555ProGly: 2.555 ± 0.208
0.644ProHis: 0.644 ± 0.113
2.181ProIle: 2.181 ± 0.203
2.222ProLys: 2.222 ± 0.197
2.991ProLeu: 2.991 ± 0.247
0.893ProMet: 0.893 ± 0.146
1.849ProAsn: 1.849 ± 0.167
1.246ProPro: 1.246 ± 0.192
1.246ProGln: 1.246 ± 0.149
1.62ProArg: 1.62 ± 0.196
2.804ProSer: 2.804 ± 0.253
2.617ProThr: 2.617 ± 0.266
2.825ProVal: 2.825 ± 0.28
0.582ProTrp: 0.582 ± 0.113
1.392ProTyr: 1.392 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.638GlnAla: 2.638 ± 0.242
0.395GlnCys: 0.395 ± 0.096
2.077GlnAsp: 2.077 ± 0.224
2.368GlnGlu: 2.368 ± 0.257
1.849GlnPhe: 1.849 ± 0.188
2.326GlnGly: 2.326 ± 0.224
0.893GlnHis: 0.893 ± 0.137
2.472GlnIle: 2.472 ± 0.236
2.119GlnLys: 2.119 ± 0.197
3.302GlnLeu: 3.302 ± 0.299
1.101GlnMet: 1.101 ± 0.141
1.641GlnAsn: 1.641 ± 0.177
1.225GlnPro: 1.225 ± 0.159
1.89GlnGln: 1.89 ± 0.231
2.326GlnArg: 2.326 ± 0.216
2.389GlnSer: 2.389 ± 0.241
2.243GlnThr: 2.243 ± 0.192
2.804GlnVal: 2.804 ± 0.236
0.519GlnTrp: 0.519 ± 0.097
1.558GlnTyr: 1.558 ± 0.17
0.0GlnXaa: 0.0 ± 0.0
Arg
2.929ArgAla: 2.929 ± 0.275
0.665ArgCys: 0.665 ± 0.118
2.908ArgAsp: 2.908 ± 0.265
3.136ArgGlu: 3.136 ± 0.251
2.285ArgPhe: 2.285 ± 0.197
2.762ArgGly: 2.762 ± 0.254
1.018ArgHis: 1.018 ± 0.123
3.261ArgIle: 3.261 ± 0.265
2.804ArgLys: 2.804 ± 0.304
4.632ArgLeu: 4.632 ± 0.326
1.433ArgMet: 1.433 ± 0.176
2.555ArgAsn: 2.555 ± 0.255
1.765ArgPro: 1.765 ± 0.22
2.243ArgGln: 2.243 ± 0.218
3.095ArgArg: 3.095 ± 0.279
2.949ArgSer: 2.949 ± 0.253
2.243ArgThr: 2.243 ± 0.197
3.302ArgVal: 3.302 ± 0.245
0.748ArgTrp: 0.748 ± 0.134
2.119ArgTyr: 2.119 ± 0.226
0.0ArgXaa: 0.0 ± 0.0
Ser
3.863SerAla: 3.863 ± 0.303
0.602SerCys: 0.602 ± 0.123
3.822SerAsp: 3.822 ± 0.255
3.863SerGlu: 3.863 ± 0.239
2.783SerPhe: 2.783 ± 0.219
5.026SerGly: 5.026 ± 0.434
0.685SerHis: 0.685 ± 0.142
4.32SerIle: 4.32 ± 0.303
3.925SerLys: 3.925 ± 0.287
5.379SerLeu: 5.379 ± 0.32
1.869SerMet: 1.869 ± 0.202
3.614SerAsn: 3.614 ± 0.271
2.492SerPro: 2.492 ± 0.216
2.222SerGln: 2.222 ± 0.213
3.095SerArg: 3.095 ± 0.303
4.092SerSer: 4.092 ± 0.341
3.697SerThr: 3.697 ± 0.35
4.673SerVal: 4.673 ± 0.345
0.727SerTrp: 0.727 ± 0.116
2.472SerTyr: 2.472 ± 0.216
0.0SerXaa: 0.0 ± 0.0
Thr
4.133ThrAla: 4.133 ± 0.358
0.519ThrCys: 0.519 ± 0.099
3.531ThrAsp: 3.531 ± 0.282
3.946ThrGlu: 3.946 ± 0.304
2.762ThrPhe: 2.762 ± 0.234
4.258ThrGly: 4.258 ± 0.353
0.976ThrHis: 0.976 ± 0.146
4.029ThrIle: 4.029 ± 0.315
3.655ThrLys: 3.655 ± 0.23
4.777ThrLeu: 4.777 ± 0.351
1.246ThrMet: 1.246 ± 0.168
2.804ThrAsn: 2.804 ± 0.238
3.469ThrPro: 3.469 ± 0.311
2.326ThrGln: 2.326 ± 0.203
2.887ThrArg: 2.887 ± 0.22
3.614ThrSer: 3.614 ± 0.4
4.071ThrThr: 4.071 ± 0.384
4.299ThrVal: 4.299 ± 0.352
0.748ThrTrp: 0.748 ± 0.12
1.849ThrTyr: 1.849 ± 0.246
0.0ThrXaa: 0.0 ± 0.0
Val
3.884ValAla: 3.884 ± 0.288
0.831ValCys: 0.831 ± 0.144
5.4ValAsp: 5.4 ± 0.308
5.13ValGlu: 5.13 ± 0.358
2.721ValPhe: 2.721 ± 0.242
4.735ValGly: 4.735 ± 0.334
1.267ValHis: 1.267 ± 0.158
4.237ValIle: 4.237 ± 0.299
5.172ValLys: 5.172 ± 0.333
5.483ValLeu: 5.483 ± 0.335
1.828ValMet: 1.828 ± 0.207
3.988ValAsn: 3.988 ± 0.286
2.555ValPro: 2.555 ± 0.219
2.451ValGln: 2.451 ± 0.262
2.991ValArg: 2.991 ± 0.21
4.673ValSer: 4.673 ± 0.298
4.735ValThr: 4.735 ± 0.428
6.21ValVal: 6.21 ± 0.387
1.142ValTrp: 1.142 ± 0.165
3.012ValTyr: 3.012 ± 0.216
0.0ValXaa: 0.0 ± 0.0
Trp
0.914TrpAla: 0.914 ± 0.143
0.312TrpCys: 0.312 ± 0.086
1.018TrpAsp: 1.018 ± 0.155
1.122TrpGlu: 1.122 ± 0.168
0.665TrpPhe: 0.665 ± 0.104
0.706TrpGly: 0.706 ± 0.123
0.166TrpHis: 0.166 ± 0.062
0.623TrpIle: 0.623 ± 0.125
0.789TrpLys: 0.789 ± 0.137
1.412TrpLeu: 1.412 ± 0.182
0.436TrpMet: 0.436 ± 0.098
0.665TrpAsn: 0.665 ± 0.104
0.415TrpPro: 0.415 ± 0.094
0.395TrpGln: 0.395 ± 0.089
0.872TrpArg: 0.872 ± 0.158
0.665TrpSer: 0.665 ± 0.116
0.768TrpThr: 0.768 ± 0.131
1.205TrpVal: 1.205 ± 0.152
0.187TrpTrp: 0.187 ± 0.06
0.415TrpTyr: 0.415 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.43TyrAla: 2.43 ± 0.179
0.54TyrCys: 0.54 ± 0.089
2.845TyrAsp: 2.845 ± 0.256
2.098TyrGlu: 2.098 ± 0.191
1.724TyrPhe: 1.724 ± 0.177
2.472TyrGly: 2.472 ± 0.241
1.038TyrHis: 1.038 ± 0.15
2.015TyrIle: 2.015 ± 0.215
2.243TyrLys: 2.243 ± 0.202
2.887TyrLeu: 2.887 ± 0.233
1.163TyrMet: 1.163 ± 0.138
2.513TyrAsn: 2.513 ± 0.188
1.641TyrPro: 1.641 ± 0.171
1.495TyrGln: 1.495 ± 0.186
2.16TyrArg: 2.16 ± 0.19
2.492TyrSer: 2.492 ± 0.187
2.077TyrThr: 2.077 ± 0.295
3.282TyrVal: 3.282 ± 0.244
0.498TyrTrp: 0.498 ± 0.099
1.454TyrTyr: 1.454 ± 0.145
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 201 proteins (48148 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski