Amino acid dipepetide frequency for Bacillus phage YungSlug

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.969AlaAla: 0.969 ± 0.123
0.624AlaCys: 0.624 ± 0.151
3.659AlaAsp: 3.659 ± 0.267
4.929AlaGlu: 4.929 ± 0.329
2.755AlaPhe: 2.755 ± 0.324
4.327AlaGly: 4.327 ± 0.417
1.098AlaHis: 1.098 ± 0.144
4.133AlaIle: 4.133 ± 0.275
6.565AlaLys: 6.565 ± 0.41
4.951AlaLeu: 4.951 ± 0.359
1.83AlaMet: 1.83 ± 0.267
2.82AlaAsn: 2.82 ± 0.366
2.045AlaPro: 2.045 ± 0.226
2.411AlaGln: 2.411 ± 0.249
2.497AlaArg: 2.497 ± 0.273
2.927AlaSer: 2.927 ± 0.238
3.573AlaThr: 3.573 ± 0.332
3.853AlaVal: 3.853 ± 0.28
0.667AlaTrp: 0.667 ± 0.127
2.54AlaTyr: 2.54 ± 0.211
0.0AlaXaa: 0.0 ± 0.0
Cys
0.667CysAla: 0.667 ± 0.115
0.086CysCys: 0.086 ± 0.039
0.603CysAsp: 0.603 ± 0.119
0.861CysGlu: 0.861 ± 0.137
0.409CysPhe: 0.409 ± 0.094
0.495CysGly: 0.495 ± 0.119
0.129CysHis: 0.129 ± 0.058
0.387CysIle: 0.387 ± 0.088
0.947CysLys: 0.947 ± 0.175
0.603CysLeu: 0.603 ± 0.143
0.215CysMet: 0.215 ± 0.057
0.538CysAsn: 0.538 ± 0.116
0.323CysPro: 0.323 ± 0.087
0.344CysGln: 0.344 ± 0.084
0.301CysArg: 0.301 ± 0.08
0.474CysSer: 0.474 ± 0.11
0.452CysThr: 0.452 ± 0.091
0.667CysVal: 0.667 ± 0.103
0.215CysTrp: 0.215 ± 0.064
0.301CysTyr: 0.301 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
3.724AspAla: 3.724 ± 0.338
0.517AspCys: 0.517 ± 0.108
4.09AspAsp: 4.09 ± 0.267
5.338AspGlu: 5.338 ± 0.444
3.509AspPhe: 3.509 ± 0.258
4.8AspGly: 4.8 ± 0.388
0.646AspHis: 0.646 ± 0.112
4.09AspIle: 4.09 ± 0.271
5.704AspLys: 5.704 ± 0.373
5.381AspLeu: 5.381 ± 0.31
2.066AspMet: 2.066 ± 0.209
3.1AspAsn: 3.1 ± 0.249
1.7AspPro: 1.7 ± 0.224
1.012AspGln: 1.012 ± 0.162
2.368AspArg: 2.368 ± 0.234
3.121AspSer: 3.121 ± 0.237
3.336AspThr: 3.336 ± 0.378
3.702AspVal: 3.702 ± 0.304
1.313AspTrp: 1.313 ± 0.191
2.97AspTyr: 2.97 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
5.317GluAla: 5.317 ± 0.342
0.667GluCys: 0.667 ± 0.142
5.123GluAsp: 5.123 ± 0.38
9.019GluGlu: 9.019 ± 0.678
3.229GluPhe: 3.229 ± 0.266
4.951GluGly: 4.951 ± 0.344
1.485GluHis: 1.485 ± 0.201
5.618GluIle: 5.618 ± 0.405
7.878GluLys: 7.878 ± 0.536
7.728GluLeu: 7.728 ± 0.499
3.25GluMet: 3.25 ± 0.254
4.09GluAsn: 4.09 ± 0.275
2.109GluPro: 2.109 ± 0.243
2.841GluGln: 2.841 ± 0.233
4.004GluArg: 4.004 ± 0.324
3.81GluSer: 3.81 ± 0.271
4.434GluThr: 4.434 ± 0.337
6.737GluVal: 6.737 ± 0.44
1.098GluTrp: 1.098 ± 0.144
3.423GluTyr: 3.423 ± 0.314
0.0GluXaa: 0.0 ± 0.0
Phe
2.088PheAla: 2.088 ± 0.194
0.495PheCys: 0.495 ± 0.105
3.293PheAsp: 3.293 ± 0.268
3.509PheGlu: 3.509 ± 0.351
1.873PhePhe: 1.873 ± 0.206
3.078PheGly: 3.078 ± 0.26
0.624PheHis: 0.624 ± 0.146
2.239PheIle: 2.239 ± 0.257
3.745PheLys: 3.745 ± 0.296
3.595PheLeu: 3.595 ± 0.31
1.076PheMet: 1.076 ± 0.189
2.518PheAsn: 2.518 ± 0.244
1.442PhePro: 1.442 ± 0.211
1.184PheGln: 1.184 ± 0.172
1.873PheArg: 1.873 ± 0.208
2.368PheSer: 2.368 ± 0.228
2.626PheThr: 2.626 ± 0.259
2.691PheVal: 2.691 ± 0.238
0.56PheTrp: 0.56 ± 0.11
1.313PheTyr: 1.313 ± 0.172
0.0PheXaa: 0.0 ± 0.0
Gly
3.681GlyAla: 3.681 ± 0.425
0.689GlyCys: 0.689 ± 0.109
3.767GlyAsp: 3.767 ± 0.337
5.015GlyGlu: 5.015 ± 0.41
3.573GlyPhe: 3.573 ± 0.282
5.726GlyGly: 5.726 ± 0.724
1.292GlyHis: 1.292 ± 0.214
4.348GlyIle: 4.348 ± 0.344
5.919GlyLys: 5.919 ± 0.331
5.446GlyLeu: 5.446 ± 0.404
1.916GlyMet: 1.916 ± 0.258
3.681GlyAsn: 3.681 ± 0.352
0.538GlyPro: 0.538 ± 0.109
2.174GlyGln: 2.174 ± 0.223
3.035GlyArg: 3.035 ± 0.292
3.466GlySer: 3.466 ± 0.377
4.542GlyThr: 4.542 ± 0.466
4.757GlyVal: 4.757 ± 0.325
0.947GlyTrp: 0.947 ± 0.172
2.777GlyTyr: 2.777 ± 0.234
0.0GlyXaa: 0.0 ± 0.0
His
0.775HisAla: 0.775 ± 0.162
0.258HisCys: 0.258 ± 0.07
1.033HisAsp: 1.033 ± 0.151
1.033HisGlu: 1.033 ± 0.177
0.904HisPhe: 0.904 ± 0.146
1.033HisGly: 1.033 ± 0.154
0.323HisHis: 0.323 ± 0.082
1.119HisIle: 1.119 ± 0.184
1.485HisLys: 1.485 ± 0.206
1.571HisLeu: 1.571 ± 0.242
0.409HisMet: 0.409 ± 0.11
0.969HisAsn: 0.969 ± 0.144
0.689HisPro: 0.689 ± 0.111
0.387HisGln: 0.387 ± 0.076
0.71HisArg: 0.71 ± 0.131
1.076HisSer: 1.076 ± 0.177
0.969HisThr: 0.969 ± 0.125
0.796HisVal: 0.796 ± 0.146
0.344HisTrp: 0.344 ± 0.096
0.603HisTyr: 0.603 ± 0.114
0.0HisXaa: 0.0 ± 0.0
Ile
3.616IleAla: 3.616 ± 0.29
0.517IleCys: 0.517 ± 0.103
4.391IleAsp: 4.391 ± 0.341
5.231IleGlu: 5.231 ± 0.355
2.109IlePhe: 2.109 ± 0.236
4.133IleGly: 4.133 ± 0.316
1.076IleHis: 1.076 ± 0.174
3.272IleIle: 3.272 ± 0.31
5.898IleLys: 5.898 ± 0.325
4.499IleLeu: 4.499 ± 0.318
1.528IleMet: 1.528 ± 0.197
3.832IleAsn: 3.832 ± 0.305
2.023IlePro: 2.023 ± 0.218
2.023IleGln: 2.023 ± 0.207
3.078IleArg: 3.078 ± 0.286
3.143IleSer: 3.143 ± 0.257
3.466IleThr: 3.466 ± 0.277
4.047IleVal: 4.047 ± 0.289
0.56IleTrp: 0.56 ± 0.102
1.98IleTyr: 1.98 ± 0.196
0.0IleXaa: 0.0 ± 0.0
Lys
7.017LysAla: 7.017 ± 0.424
0.689LysCys: 0.689 ± 0.153
5.919LysAsp: 5.919 ± 0.404
10.311LysGlu: 10.311 ± 0.626
2.906LysPhe: 2.906 ± 0.259
5.467LysGly: 5.467 ± 0.422
1.744LysHis: 1.744 ± 0.247
5.015LysIle: 5.015 ± 0.286
8.567LysLys: 8.567 ± 0.535
7.555LysLeu: 7.555 ± 0.532
2.863LysMet: 2.863 ± 0.232
4.606LysAsn: 4.606 ± 0.357
3.207LysPro: 3.207 ± 0.276
3.358LysGln: 3.358 ± 0.292
4.413LysArg: 4.413 ± 0.35
4.994LysSer: 4.994 ± 0.38
4.951LysThr: 4.951 ± 0.281
6.285LysVal: 6.285 ± 0.329
1.227LysTrp: 1.227 ± 0.165
3.487LysTyr: 3.487 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
4.929LeuAla: 4.929 ± 0.311
0.624LeuCys: 0.624 ± 0.127
5.252LeuAsp: 5.252 ± 0.33
7.857LeuGlu: 7.857 ± 0.425
2.863LeuPhe: 2.863 ± 0.292
5.295LeuGly: 5.295 ± 0.363
1.076LeuHis: 1.076 ± 0.174
4.219LeuIle: 4.219 ± 0.311
7.964LeuLys: 7.964 ± 0.442
6.135LeuLeu: 6.135 ± 0.391
2.368LeuMet: 2.368 ± 0.228
4.585LeuAsn: 4.585 ± 0.376
2.454LeuPro: 2.454 ± 0.229
2.712LeuGln: 2.712 ± 0.262
3.358LeuArg: 3.358 ± 0.323
5.015LeuSer: 5.015 ± 0.303
5.446LeuThr: 5.446 ± 0.331
4.908LeuVal: 4.908 ± 0.379
0.775LeuTrp: 0.775 ± 0.148
2.239LeuTyr: 2.239 ± 0.236
0.0LeuXaa: 0.0 ± 0.0
Met
1.916MetAla: 1.916 ± 0.257
0.237MetCys: 0.237 ± 0.069
1.593MetAsp: 1.593 ± 0.2
2.411MetGlu: 2.411 ± 0.229
1.335MetPhe: 1.335 ± 0.144
1.851MetGly: 1.851 ± 0.216
0.366MetHis: 0.366 ± 0.092
1.636MetIle: 1.636 ± 0.192
3.207MetLys: 3.207 ± 0.299
2.002MetLeu: 2.002 ± 0.23
0.904MetMet: 0.904 ± 0.166
2.153MetAsn: 2.153 ± 0.204
0.947MetPro: 0.947 ± 0.166
0.732MetGln: 0.732 ± 0.127
1.378MetArg: 1.378 ± 0.168
2.002MetSer: 2.002 ± 0.202
2.002MetThr: 2.002 ± 0.202
1.7MetVal: 1.7 ± 0.181
0.258MetTrp: 0.258 ± 0.08
0.99MetTyr: 0.99 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.509AsnAla: 3.509 ± 0.309
0.624AsnCys: 0.624 ± 0.129
2.54AsnAsp: 2.54 ± 0.224
3.573AsnGlu: 3.573 ± 0.286
2.411AsnPhe: 2.411 ± 0.269
4.649AsnGly: 4.649 ± 0.393
0.775AsnHis: 0.775 ± 0.121
3.25AsnIle: 3.25 ± 0.271
4.284AsnLys: 4.284 ± 0.358
4.348AsnLeu: 4.348 ± 0.315
1.787AsnMet: 1.787 ± 0.218
3.121AsnAsn: 3.121 ± 0.345
2.755AsnPro: 2.755 ± 0.268
2.088AsnGln: 2.088 ± 0.205
2.153AsnArg: 2.153 ± 0.217
3.078AsnSer: 3.078 ± 0.315
2.518AsnThr: 2.518 ± 0.226
3.121AsnVal: 3.121 ± 0.26
0.775AsnTrp: 0.775 ± 0.128
2.153AsnTyr: 2.153 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
2.153ProAla: 2.153 ± 0.214
0.215ProCys: 0.215 ± 0.066
2.368ProAsp: 2.368 ± 0.209
2.97ProGlu: 2.97 ± 0.268
1.248ProPhe: 1.248 ± 0.146
1.442ProGly: 1.442 ± 0.204
0.646ProHis: 0.646 ± 0.124
1.873ProIle: 1.873 ± 0.234
3.164ProLys: 3.164 ± 0.308
2.088ProLeu: 2.088 ± 0.233
0.71ProMet: 0.71 ± 0.127
1.937ProAsn: 1.937 ± 0.217
1.162ProPro: 1.162 ± 0.198
1.055ProGln: 1.055 ± 0.167
0.926ProArg: 0.926 ± 0.147
2.002ProSer: 2.002 ± 0.188
2.088ProThr: 2.088 ± 0.213
1.83ProVal: 1.83 ± 0.218
0.387ProTrp: 0.387 ± 0.077
1.614ProTyr: 1.614 ± 0.179
0.0ProXaa: 0.0 ± 0.0
Gln
2.131GlnAla: 2.131 ± 0.221
0.237GlnCys: 0.237 ± 0.07
1.442GlnAsp: 1.442 ± 0.169
2.282GlnGlu: 2.282 ± 0.214
1.205GlnPhe: 1.205 ± 0.147
2.325GlnGly: 2.325 ± 0.276
0.474GlnHis: 0.474 ± 0.109
2.023GlnIle: 2.023 ± 0.193
3.229GlnLys: 3.229 ± 0.273
3.229GlnLeu: 3.229 ± 0.311
0.775GlnMet: 0.775 ± 0.131
1.528GlnAsn: 1.528 ± 0.198
1.485GlnPro: 1.485 ± 0.183
0.753GlnGln: 0.753 ± 0.215
1.873GlnArg: 1.873 ± 0.212
1.679GlnSer: 1.679 ± 0.216
1.679GlnThr: 1.679 ± 0.175
2.088GlnVal: 2.088 ± 0.231
0.387GlnTrp: 0.387 ± 0.084
0.99GlnTyr: 0.99 ± 0.155
0.0GlnXaa: 0.0 ± 0.0
Arg
2.648ArgAla: 2.648 ± 0.318
0.301ArgCys: 0.301 ± 0.094
2.389ArgAsp: 2.389 ± 0.239
3.466ArgGlu: 3.466 ± 0.279
2.066ArgPhe: 2.066 ± 0.238
2.368ArgGly: 2.368 ± 0.276
0.624ArgHis: 0.624 ± 0.116
2.927ArgIle: 2.927 ± 0.295
4.822ArgLys: 4.822 ± 0.37
3.681ArgLeu: 3.681 ± 0.281
1.7ArgMet: 1.7 ± 0.18
2.432ArgAsn: 2.432 ± 0.229
1.184ArgPro: 1.184 ± 0.169
1.27ArgGln: 1.27 ± 0.16
1.873ArgArg: 1.873 ± 0.249
1.744ArgSer: 1.744 ± 0.201
2.066ArgThr: 2.066 ± 0.208
3.423ArgVal: 3.423 ± 0.223
0.732ArgTrp: 0.732 ± 0.12
1.421ArgTyr: 1.421 ± 0.186
0.0ArgXaa: 0.0 ± 0.0
Ser
3.315SerAla: 3.315 ± 0.289
0.581SerCys: 0.581 ± 0.13
3.25SerAsp: 3.25 ± 0.325
4.111SerGlu: 4.111 ± 0.292
2.518SerPhe: 2.518 ± 0.203
3.315SerGly: 3.315 ± 0.275
0.99SerHis: 0.99 ± 0.147
3.401SerIle: 3.401 ± 0.304
5.338SerLys: 5.338 ± 0.326
4.176SerLeu: 4.176 ± 0.31
1.335SerMet: 1.335 ± 0.176
2.648SerAsn: 2.648 ± 0.23
1.765SerPro: 1.765 ± 0.192
1.937SerGln: 1.937 ± 0.204
2.26SerArg: 2.26 ± 0.238
2.26SerSer: 2.26 ± 0.263
2.755SerThr: 2.755 ± 0.308
3.853SerVal: 3.853 ± 0.323
0.926SerTrp: 0.926 ± 0.151
2.109SerTyr: 2.109 ± 0.198
0.0SerXaa: 0.0 ± 0.0
Thr
3.272ThrAla: 3.272 ± 0.346
0.452ThrCys: 0.452 ± 0.112
3.336ThrAsp: 3.336 ± 0.301
4.025ThrGlu: 4.025 ± 0.324
2.777ThrPhe: 2.777 ± 0.252
4.154ThrGly: 4.154 ± 0.369
1.184ThrHis: 1.184 ± 0.17
3.767ThrIle: 3.767 ± 0.367
5.381ThrLys: 5.381 ± 0.285
4.413ThrLeu: 4.413 ± 0.38
1.442ThrMet: 1.442 ± 0.207
2.863ThrAsn: 2.863 ± 0.257
2.562ThrPro: 2.562 ± 0.354
1.765ThrGln: 1.765 ± 0.177
1.851ThrArg: 1.851 ± 0.205
3.186ThrSer: 3.186 ± 0.299
3.53ThrThr: 3.53 ± 0.395
4.391ThrVal: 4.391 ± 0.339
1.055ThrTrp: 1.055 ± 0.137
2.54ThrTyr: 2.54 ± 0.21
0.0ThrXaa: 0.0 ± 0.0
Val
4.284ValAla: 4.284 ± 0.298
0.581ValCys: 0.581 ± 0.118
4.52ValAsp: 4.52 ± 0.294
6.479ValGlu: 6.479 ± 0.41
2.798ValPhe: 2.798 ± 0.275
4.456ValGly: 4.456 ± 0.353
1.098ValHis: 1.098 ± 0.187
3.659ValIle: 3.659 ± 0.31
6.328ValLys: 6.328 ± 0.281
4.434ValLeu: 4.434 ± 0.329
1.937ValMet: 1.937 ± 0.215
3.143ValAsn: 3.143 ± 0.271
2.153ValPro: 2.153 ± 0.231
2.196ValGln: 2.196 ± 0.187
2.906ValArg: 2.906 ± 0.286
3.81ValSer: 3.81 ± 0.288
4.542ValThr: 4.542 ± 0.437
5.36ValVal: 5.36 ± 0.464
0.624ValTrp: 0.624 ± 0.108
3.057ValTyr: 3.057 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
0.732TrpAla: 0.732 ± 0.13
0.172TrpCys: 0.172 ± 0.061
1.162TrpAsp: 1.162 ± 0.161
1.313TrpGlu: 1.313 ± 0.192
0.387TrpPhe: 0.387 ± 0.087
1.033TrpGly: 1.033 ± 0.182
0.258TrpHis: 0.258 ± 0.084
1.076TrpIle: 1.076 ± 0.2
1.119TrpLys: 1.119 ± 0.164
0.926TrpLeu: 0.926 ± 0.167
0.409TrpMet: 0.409 ± 0.109
0.775TrpAsn: 0.775 ± 0.13
0.28TrpPro: 0.28 ± 0.084
0.323TrpGln: 0.323 ± 0.085
0.517TrpArg: 0.517 ± 0.092
0.796TrpSer: 0.796 ± 0.136
0.732TrpThr: 0.732 ± 0.127
0.947TrpVal: 0.947 ± 0.177
0.237TrpTrp: 0.237 ± 0.084
0.474TrpTyr: 0.474 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.432TyrAla: 2.432 ± 0.277
0.452TyrCys: 0.452 ± 0.103
2.755TyrAsp: 2.755 ± 0.293
3.078TyrGlu: 3.078 ± 0.254
1.313TyrPhe: 1.313 ± 0.182
2.389TyrGly: 2.389 ± 0.238
0.581TyrHis: 0.581 ± 0.125
2.368TyrIle: 2.368 ± 0.248
3.057TyrLys: 3.057 ± 0.26
3.315TyrLeu: 3.315 ± 0.24
1.076TyrMet: 1.076 ± 0.141
2.174TyrAsn: 2.174 ± 0.254
1.055TyrPro: 1.055 ± 0.138
1.248TyrGln: 1.248 ± 0.154
1.765TyrArg: 1.765 ± 0.244
1.894TyrSer: 1.894 ± 0.208
2.239TyrThr: 2.239 ± 0.23
3.207TyrVal: 3.207 ± 0.292
0.581TyrTrp: 0.581 ± 0.114
1.7TyrTyr: 1.7 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 227 proteins (46458 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski