Amino acid dipepetide frequency for Vibrio phage BBMuffin

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.343AlaAla: 6.343 ± 0.906
0.862AlaCys: 0.862 ± 0.159
4.674AlaAsp: 4.674 ± 0.392
5.62AlaGlu: 5.62 ± 0.526
3.005AlaPhe: 3.005 ± 0.282
4.424AlaGly: 4.424 ± 0.371
1.057AlaHis: 1.057 ± 0.18
4.451AlaIle: 4.451 ± 0.358
7.067AlaLys: 7.067 ± 0.597
6.148AlaLeu: 6.148 ± 0.453
2.114AlaMet: 2.114 ± 0.271
3.951AlaAsn: 3.951 ± 0.345
2.56AlaPro: 2.56 ± 0.304
3.116AlaGln: 3.116 ± 0.299
3.505AlaArg: 3.505 ± 0.312
5.119AlaSer: 5.119 ± 0.656
4.897AlaThr: 4.897 ± 0.404
4.257AlaVal: 4.257 ± 0.381
1.002AlaTrp: 1.002 ± 0.147
2.977AlaTyr: 2.977 ± 0.282
0.0AlaXaa: 0.0 ± 0.0
Cys
0.89CysAla: 0.89 ± 0.152
0.111CysCys: 0.111 ± 0.058
0.807CysAsp: 0.807 ± 0.18
1.029CysGlu: 1.029 ± 0.172
0.445CysPhe: 0.445 ± 0.112
0.862CysGly: 0.862 ± 0.165
0.278CysHis: 0.278 ± 0.092
0.779CysIle: 0.779 ± 0.137
0.946CysLys: 0.946 ± 0.149
0.696CysLeu: 0.696 ± 0.144
0.334CysMet: 0.334 ± 0.101
0.779CysAsn: 0.779 ± 0.137
0.501CysPro: 0.501 ± 0.125
0.556CysGln: 0.556 ± 0.126
0.612CysArg: 0.612 ± 0.116
0.835CysSer: 0.835 ± 0.203
0.862CysThr: 0.862 ± 0.155
0.807CysVal: 0.807 ± 0.164
0.167CysTrp: 0.167 ± 0.068
0.584CysTyr: 0.584 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
4.897AspAla: 4.897 ± 0.367
0.64AspCys: 0.64 ± 0.141
3.617AspAsp: 3.617 ± 0.344
4.535AspGlu: 4.535 ± 0.378
3.032AspPhe: 3.032 ± 0.331
4.479AspGly: 4.479 ± 0.445
0.946AspHis: 0.946 ± 0.156
4.424AspIle: 4.424 ± 0.325
4.618AspLys: 4.618 ± 0.341
5.314AspLeu: 5.314 ± 0.392
1.864AspMet: 1.864 ± 0.222
3.422AspAsn: 3.422 ± 0.329
2.198AspPro: 2.198 ± 0.267
1.586AspGln: 1.586 ± 0.181
2.448AspArg: 2.448 ± 0.295
3.617AspSer: 3.617 ± 0.36
4.257AspThr: 4.257 ± 0.324
4.173AspVal: 4.173 ± 0.304
1.113AspTrp: 1.113 ± 0.167
2.309AspTyr: 2.309 ± 0.288
0.0AspXaa: 0.0 ± 0.0
Glu
6.009GluAla: 6.009 ± 0.525
1.002GluCys: 1.002 ± 0.19
3.895GluAsp: 3.895 ± 0.293
4.451GluGlu: 4.451 ± 0.428
2.838GluPhe: 2.838 ± 0.312
3.978GluGly: 3.978 ± 0.338
1.53GluHis: 1.53 ± 0.186
4.702GluIle: 4.702 ± 0.364
4.897GluLys: 4.897 ± 0.46
7.122GluLeu: 7.122 ± 0.535
2.254GluMet: 2.254 ± 0.286
3.533GluAsn: 3.533 ± 0.302
1.808GluPro: 1.808 ± 0.242
2.949GluGln: 2.949 ± 0.318
2.866GluArg: 2.866 ± 0.34
2.977GluSer: 2.977 ± 0.338
3.7GluThr: 3.7 ± 0.319
5.23GluVal: 5.23 ± 0.364
1.113GluTrp: 1.113 ± 0.182
3.088GluTyr: 3.088 ± 0.289
0.0GluXaa: 0.0 ± 0.0
Phe
2.532PheAla: 2.532 ± 0.271
0.751PheCys: 0.751 ± 0.136
3.227PheAsp: 3.227 ± 0.299
2.866PheGlu: 2.866 ± 0.332
1.669PhePhe: 1.669 ± 0.197
2.643PheGly: 2.643 ± 0.254
1.196PheHis: 1.196 ± 0.203
2.587PheIle: 2.587 ± 0.278
2.866PheLys: 2.866 ± 0.337
2.587PheLeu: 2.587 ± 0.281
1.113PheMet: 1.113 ± 0.175
2.17PheAsn: 2.17 ± 0.24
1.419PhePro: 1.419 ± 0.204
1.029PheGln: 1.029 ± 0.151
1.614PheArg: 1.614 ± 0.212
2.671PheSer: 2.671 ± 0.285
2.643PheThr: 2.643 ± 0.268
2.281PheVal: 2.281 ± 0.326
0.584PheTrp: 0.584 ± 0.103
1.419PheTyr: 1.419 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
4.201GlyAla: 4.201 ± 0.442
0.946GlyCys: 0.946 ± 0.179
4.785GlyAsp: 4.785 ± 0.455
3.951GlyGlu: 3.951 ± 0.384
3.144GlyPhe: 3.144 ± 0.251
3.617GlyGly: 3.617 ± 0.406
1.029GlyHis: 1.029 ± 0.171
4.09GlyIle: 4.09 ± 0.306
5.147GlyLys: 5.147 ± 0.409
5.147GlyLeu: 5.147 ± 0.417
1.836GlyMet: 1.836 ± 0.186
3.394GlyAsn: 3.394 ± 0.345
0.529GlyPro: 0.529 ± 0.116
2.003GlyGln: 2.003 ± 0.21
3.144GlyArg: 3.144 ± 0.3
4.284GlySer: 4.284 ± 0.443
4.924GlyThr: 4.924 ± 0.412
4.201GlyVal: 4.201 ± 0.319
1.419GlyTrp: 1.419 ± 0.227
2.949GlyTyr: 2.949 ± 0.321
0.0GlyXaa: 0.0 ± 0.0
His
1.085HisAla: 1.085 ± 0.19
0.362HisCys: 0.362 ± 0.091
1.502HisAsp: 1.502 ± 0.221
1.419HisGlu: 1.419 ± 0.199
0.807HisPhe: 0.807 ± 0.132
1.614HisGly: 1.614 ± 0.207
0.64HisHis: 0.64 ± 0.144
1.335HisIle: 1.335 ± 0.163
1.419HisLys: 1.419 ± 0.174
2.281HisLeu: 2.281 ± 0.216
0.584HisMet: 0.584 ± 0.158
0.946HisAsn: 0.946 ± 0.174
0.862HisPro: 0.862 ± 0.153
0.556HisGln: 0.556 ± 0.117
0.946HisArg: 0.946 ± 0.156
0.862HisSer: 0.862 ± 0.129
1.168HisThr: 1.168 ± 0.177
1.113HisVal: 1.113 ± 0.22
0.278HisTrp: 0.278 ± 0.086
0.835HisTyr: 0.835 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
4.424IleAla: 4.424 ± 0.317
0.779IleCys: 0.779 ± 0.17
3.784IleAsp: 3.784 ± 0.404
5.425IleGlu: 5.425 ± 0.419
1.808IlePhe: 1.808 ± 0.222
3.422IleGly: 3.422 ± 0.288
1.113IleHis: 1.113 ± 0.202
3.645IleIle: 3.645 ± 0.327
4.73IleLys: 4.73 ± 0.34
4.368IleLeu: 4.368 ± 0.316
1.808IleMet: 1.808 ± 0.22
3.422IleAsn: 3.422 ± 0.34
2.226IlePro: 2.226 ± 0.25
2.532IleGln: 2.532 ± 0.234
2.893IleArg: 2.893 ± 0.297
4.284IleSer: 4.284 ± 0.317
4.145IleThr: 4.145 ± 0.373
3.672IleVal: 3.672 ± 0.356
0.529IleTrp: 0.529 ± 0.115
2.031IleTyr: 2.031 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
6.594LysAla: 6.594 ± 0.57
0.723LysCys: 0.723 ± 0.135
4.062LysAsp: 4.062 ± 0.391
5.286LysGlu: 5.286 ± 0.452
3.116LysPhe: 3.116 ± 0.303
4.646LysGly: 4.646 ± 0.348
1.641LysHis: 1.641 ± 0.22
4.34LysIle: 4.34 ± 0.319
4.006LysLys: 4.006 ± 0.301
7.484LysLeu: 7.484 ± 0.5
2.365LysMet: 2.365 ± 0.266
2.726LysAsn: 2.726 ± 0.268
3.311LysPro: 3.311 ± 0.432
2.699LysGln: 2.699 ± 0.279
2.949LysArg: 2.949 ± 0.312
3.756LysSer: 3.756 ± 0.325
4.062LysThr: 4.062 ± 0.386
4.73LysVal: 4.73 ± 0.371
0.807LysTrp: 0.807 ± 0.138
3.06LysTyr: 3.06 ± 0.378
0.0LysXaa: 0.0 ± 0.0
Leu
7.706LeuAla: 7.706 ± 0.536
1.057LeuCys: 1.057 ± 0.158
6.399LeuAsp: 6.399 ± 0.357
6.566LeuGlu: 6.566 ± 0.457
2.42LeuPhe: 2.42 ± 0.231
5.898LeuGly: 5.898 ± 0.485
1.947LeuHis: 1.947 ± 0.252
4.897LeuIle: 4.897 ± 0.386
5.008LeuLys: 5.008 ± 0.352
6.315LeuLeu: 6.315 ± 0.447
2.198LeuMet: 2.198 ± 0.211
3.839LeuAsn: 3.839 ± 0.349
3.589LeuPro: 3.589 ± 0.36
3.199LeuGln: 3.199 ± 0.281
3.895LeuArg: 3.895 ± 0.307
4.98LeuSer: 4.98 ± 0.432
4.674LeuThr: 4.674 ± 0.341
6.26LeuVal: 6.26 ± 0.421
0.779LeuTrp: 0.779 ± 0.186
2.726LeuTyr: 2.726 ± 0.294
0.0LeuXaa: 0.0 ± 0.0
Met
2.448MetAla: 2.448 ± 0.302
0.445MetCys: 0.445 ± 0.115
1.475MetAsp: 1.475 ± 0.19
1.92MetGlu: 1.92 ± 0.23
1.168MetPhe: 1.168 ± 0.19
1.586MetGly: 1.586 ± 0.203
0.584MetHis: 0.584 ± 0.138
1.641MetIle: 1.641 ± 0.221
2.142MetLys: 2.142 ± 0.238
2.393MetLeu: 2.393 ± 0.258
0.751MetMet: 0.751 ± 0.157
1.419MetAsn: 1.419 ± 0.222
1.308MetPro: 1.308 ± 0.2
0.974MetGln: 0.974 ± 0.186
1.224MetArg: 1.224 ± 0.179
1.614MetSer: 1.614 ± 0.206
1.697MetThr: 1.697 ± 0.191
1.641MetVal: 1.641 ± 0.242
0.334MetTrp: 0.334 ± 0.084
0.89MetTyr: 0.89 ± 0.148
0.0MetXaa: 0.0 ± 0.0
Asn
3.589AsnAla: 3.589 ± 0.334
0.64AsnCys: 0.64 ± 0.148
2.42AsnAsp: 2.42 ± 0.257
2.893AsnGlu: 2.893 ± 0.284
1.391AsnPhe: 1.391 ± 0.173
3.784AsnGly: 3.784 ± 0.289
1.057AsnHis: 1.057 ± 0.185
3.283AsnIle: 3.283 ± 0.293
4.618AsnLys: 4.618 ± 0.332
4.229AsnLeu: 4.229 ± 0.283
1.308AsnMet: 1.308 ± 0.191
2.309AsnAsn: 2.309 ± 0.277
2.448AsnPro: 2.448 ± 0.304
1.502AsnGln: 1.502 ± 0.217
2.003AsnArg: 2.003 ± 0.284
3.839AsnSer: 3.839 ± 0.35
3.172AsnThr: 3.172 ± 0.343
3.255AsnVal: 3.255 ± 0.295
0.779AsnTrp: 0.779 ± 0.15
2.226AsnTyr: 2.226 ± 0.256
0.0AsnXaa: 0.0 ± 0.0
Pro
2.476ProAla: 2.476 ± 0.312
0.473ProCys: 0.473 ± 0.125
2.448ProAsp: 2.448 ± 0.26
3.255ProGlu: 3.255 ± 0.298
1.308ProPhe: 1.308 ± 0.149
1.864ProGly: 1.864 ± 0.307
0.779ProHis: 0.779 ± 0.129
2.226ProIle: 2.226 ± 0.239
2.031ProLys: 2.031 ± 0.265
2.726ProLeu: 2.726 ± 0.292
1.085ProMet: 1.085 ± 0.161
2.059ProAsn: 2.059 ± 0.287
0.807ProPro: 0.807 ± 0.166
0.974ProGln: 0.974 ± 0.149
1.308ProArg: 1.308 ± 0.204
2.42ProSer: 2.42 ± 0.335
2.059ProThr: 2.059 ± 0.24
2.42ProVal: 2.42 ± 0.243
0.389ProTrp: 0.389 ± 0.1
1.502ProTyr: 1.502 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
3.144GlnAla: 3.144 ± 0.328
0.584GlnCys: 0.584 ± 0.119
2.17GlnAsp: 2.17 ± 0.211
2.337GlnGlu: 2.337 ± 0.278
1.447GlnPhe: 1.447 ± 0.198
2.448GlnGly: 2.448 ± 0.25
0.751GlnHis: 0.751 ± 0.145
2.254GlnIle: 2.254 ± 0.27
2.114GlnLys: 2.114 ± 0.238
3.672GlnLeu: 3.672 ± 0.394
1.002GlnMet: 1.002 ± 0.173
1.141GlnAsn: 1.141 ± 0.198
1.002GlnPro: 1.002 ± 0.175
1.168GlnGln: 1.168 ± 0.249
1.558GlnArg: 1.558 ± 0.234
2.142GlnSer: 2.142 ± 0.367
1.975GlnThr: 1.975 ± 0.204
2.726GlnVal: 2.726 ± 0.249
0.529GlnTrp: 0.529 ± 0.145
1.308GlnTyr: 1.308 ± 0.182
0.0GlnXaa: 0.0 ± 0.0
Arg
3.283ArgAla: 3.283 ± 0.307
0.445ArgCys: 0.445 ± 0.104
3.06ArgAsp: 3.06 ± 0.348
2.726ArgGlu: 2.726 ± 0.316
1.753ArgPhe: 1.753 ± 0.232
2.838ArgGly: 2.838 ± 0.268
0.974ArgHis: 0.974 ± 0.158
2.81ArgIle: 2.81 ± 0.281
3.172ArgLys: 3.172 ± 0.316
3.617ArgLeu: 3.617 ± 0.324
1.085ArgMet: 1.085 ± 0.163
2.254ArgAsn: 2.254 ± 0.276
1.502ArgPro: 1.502 ± 0.203
1.502ArgGln: 1.502 ± 0.189
1.92ArgArg: 1.92 ± 0.193
2.337ArgSer: 2.337 ± 0.235
2.337ArgThr: 2.337 ± 0.239
3.283ArgVal: 3.283 ± 0.31
0.556ArgTrp: 0.556 ± 0.121
1.808ArgTyr: 1.808 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
4.702SerAla: 4.702 ± 0.531
0.668SerCys: 0.668 ± 0.166
3.533SerAsp: 3.533 ± 0.364
3.617SerGlu: 3.617 ± 0.444
2.476SerPhe: 2.476 ± 0.282
5.564SerGly: 5.564 ± 0.512
1.141SerHis: 1.141 ± 0.186
3.339SerIle: 3.339 ± 0.348
4.897SerLys: 4.897 ± 0.356
5.509SerLeu: 5.509 ± 0.441
1.669SerMet: 1.669 ± 0.215
3.366SerAsn: 3.366 ± 0.317
2.114SerPro: 2.114 ± 0.224
2.087SerGln: 2.087 ± 0.275
2.337SerArg: 2.337 ± 0.279
4.173SerSer: 4.173 ± 0.512
3.645SerThr: 3.645 ± 0.388
3.978SerVal: 3.978 ± 0.362
0.807SerTrp: 0.807 ± 0.179
2.365SerTyr: 2.365 ± 0.237
0.0SerXaa: 0.0 ± 0.0
Thr
4.535ThrAla: 4.535 ± 0.379
0.807ThrCys: 0.807 ± 0.165
3.7ThrAsp: 3.7 ± 0.336
3.7ThrGlu: 3.7 ± 0.316
2.81ThrPhe: 2.81 ± 0.298
4.145ThrGly: 4.145 ± 0.362
1.28ThrHis: 1.28 ± 0.188
4.09ThrIle: 4.09 ± 0.343
4.507ThrLys: 4.507 ± 0.445
4.869ThrLeu: 4.869 ± 0.362
1.141ThrMet: 1.141 ± 0.16
3.45ThrAsn: 3.45 ± 0.406
2.393ThrPro: 2.393 ± 0.273
2.198ThrGln: 2.198 ± 0.306
2.448ThrArg: 2.448 ± 0.25
4.006ThrSer: 4.006 ± 0.468
4.118ThrThr: 4.118 ± 0.427
3.867ThrVal: 3.867 ± 0.311
0.751ThrTrp: 0.751 ± 0.139
2.699ThrTyr: 2.699 ± 0.236
0.0ThrXaa: 0.0 ± 0.0
Val
4.757ValAla: 4.757 ± 0.401
0.668ValCys: 0.668 ± 0.144
4.118ValAsp: 4.118 ± 0.36
4.813ValGlu: 4.813 ± 0.326
3.199ValPhe: 3.199 ± 0.27
3.561ValGly: 3.561 ± 0.334
1.391ValHis: 1.391 ± 0.185
3.06ValIle: 3.06 ± 0.301
4.785ValLys: 4.785 ± 0.386
4.952ValLeu: 4.952 ± 0.294
1.391ValMet: 1.391 ± 0.19
3.505ValAsn: 3.505 ± 0.286
2.198ValPro: 2.198 ± 0.239
2.281ValGln: 2.281 ± 0.26
3.339ValArg: 3.339 ± 0.26
5.203ValSer: 5.203 ± 0.418
4.173ValThr: 4.173 ± 0.328
4.674ValVal: 4.674 ± 0.371
0.807ValTrp: 0.807 ± 0.154
2.532ValTyr: 2.532 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
1.196TrpAla: 1.196 ± 0.197
0.25TrpCys: 0.25 ± 0.084
1.085TrpAsp: 1.085 ± 0.168
0.696TrpGlu: 0.696 ± 0.125
0.556TrpPhe: 0.556 ± 0.118
0.835TrpGly: 0.835 ± 0.136
0.473TrpHis: 0.473 ± 0.111
0.612TrpIle: 0.612 ± 0.15
0.862TrpLys: 0.862 ± 0.162
1.363TrpLeu: 1.363 ± 0.235
0.417TrpMet: 0.417 ± 0.112
0.723TrpAsn: 0.723 ± 0.157
0.278TrpPro: 0.278 ± 0.117
0.612TrpGln: 0.612 ± 0.134
0.501TrpArg: 0.501 ± 0.121
0.862TrpSer: 0.862 ± 0.2
0.445TrpThr: 0.445 ± 0.116
0.89TrpVal: 0.89 ± 0.158
0.195TrpTrp: 0.195 ± 0.077
0.556TrpTyr: 0.556 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.337TyrAla: 2.337 ± 0.247
0.668TyrCys: 0.668 ± 0.126
2.726TyrAsp: 2.726 ± 0.29
2.838TyrGlu: 2.838 ± 0.298
1.475TyrPhe: 1.475 ± 0.202
2.532TyrGly: 2.532 ± 0.248
0.807TyrHis: 0.807 ± 0.131
2.476TyrIle: 2.476 ± 0.238
2.532TyrLys: 2.532 ± 0.275
3.589TyrLeu: 3.589 ± 0.33
1.308TyrMet: 1.308 ± 0.221
2.337TyrAsn: 2.337 ± 0.293
1.558TyrPro: 1.558 ± 0.207
1.947TyrGln: 1.947 ± 0.263
1.753TyrArg: 1.753 ± 0.201
2.003TyrSer: 2.003 ± 0.275
2.56TyrThr: 2.56 ± 0.272
1.947TyrVal: 1.947 ± 0.245
0.473TyrTrp: 0.473 ± 0.125
1.641TyrTyr: 1.641 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 183 proteins (35945 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski