Amino acid dipepetide frequency for Listeria phage LMTA-57

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.809AlaAla: 0.809 ± 0.269
0.253AlaCys: 0.253 ± 0.085
3.717AlaAsp: 3.717 ± 0.373
4.248AlaGlu: 4.248 ± 0.332
2.124AlaPhe: 2.124 ± 0.2
3.894AlaGly: 3.894 ± 0.384
0.91AlaHis: 0.91 ± 0.132
3.919AlaIle: 3.919 ± 0.31
4.779AlaLys: 4.779 ± 0.338
5.31AlaLeu: 5.31 ± 0.345
1.644AlaMet: 1.644 ± 0.247
3.919AlaAsn: 3.919 ± 0.398
1.644AlaPro: 1.644 ± 0.277
2.048AlaGln: 2.048 ± 0.256
2.63AlaArg: 2.63 ± 0.27
4.703AlaSer: 4.703 ± 0.386
3.717AlaThr: 3.717 ± 0.352
4.096AlaVal: 4.096 ± 0.281
0.405AlaTrp: 0.405 ± 0.096
2.908AlaTyr: 2.908 ± 0.296
0.0AlaXaa: 0.0 ± 0.0
Cys
0.152CysAla: 0.152 ± 0.059
0.126CysCys: 0.126 ± 0.055
0.303CysAsp: 0.303 ± 0.087
0.278CysGlu: 0.278 ± 0.078
0.329CysPhe: 0.329 ± 0.081
0.48CysGly: 0.48 ± 0.103
0.177CysHis: 0.177 ± 0.061
0.354CysIle: 0.354 ± 0.087
0.683CysLys: 0.683 ± 0.137
0.531CysLeu: 0.531 ± 0.112
0.101CysMet: 0.101 ± 0.054
0.405CysAsn: 0.405 ± 0.103
0.329CysPro: 0.329 ± 0.102
0.253CysGln: 0.253 ± 0.075
0.177CysArg: 0.177 ± 0.064
0.531CysSer: 0.531 ± 0.103
0.303CysThr: 0.303 ± 0.084
0.48CysVal: 0.48 ± 0.115
0.051CysTrp: 0.051 ± 0.036
0.329CysTyr: 0.329 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
2.301AspAla: 2.301 ± 0.291
0.506AspCys: 0.506 ± 0.12
2.782AspAsp: 2.782 ± 0.289
4.071AspGlu: 4.071 ± 0.356
3.136AspPhe: 3.136 ± 0.248
3.085AspGly: 3.085 ± 0.319
0.379AspHis: 0.379 ± 0.103
4.349AspIle: 4.349 ± 0.335
5.841AspLys: 5.841 ± 0.39
6.094AspLeu: 6.094 ± 0.379
1.517AspMet: 1.517 ± 0.175
4.375AspAsn: 4.375 ± 0.318
1.492AspPro: 1.492 ± 0.193
1.391AspGln: 1.391 ± 0.259
2.63AspArg: 2.63 ± 0.276
4.273AspSer: 4.273 ± 0.366
3.565AspThr: 3.565 ± 0.268
3.768AspVal: 3.768 ± 0.359
0.733AspTrp: 0.733 ± 0.135
3.616AspTyr: 3.616 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
5.386GluAla: 5.386 ± 0.412
0.48GluCys: 0.48 ± 0.128
5.209GluAsp: 5.209 ± 0.37
9.407GluGlu: 9.407 ± 0.962
3.009GluPhe: 3.009 ± 0.288
4.526GluGly: 4.526 ± 0.385
1.062GluHis: 1.062 ± 0.173
4.906GluIle: 4.906 ± 0.345
7.889GluLys: 7.889 ± 0.589
7.965GluLeu: 7.965 ± 0.629
1.998GluMet: 1.998 ± 0.228
4.779GluAsn: 4.779 ± 0.308
1.77GluPro: 1.77 ± 0.249
2.782GluGln: 2.782 ± 0.287
3.338GluArg: 3.338 ± 0.253
4.552GluSer: 4.552 ± 0.305
4.147GluThr: 4.147 ± 0.335
6.397GluVal: 6.397 ± 0.48
0.632GluTrp: 0.632 ± 0.131
2.908GluTyr: 2.908 ± 0.254
0.0GluXaa: 0.0 ± 0.0
Phe
1.821PheAla: 1.821 ± 0.211
0.253PheCys: 0.253 ± 0.088
2.124PheAsp: 2.124 ± 0.217
2.604PheGlu: 2.604 ± 0.27
1.669PhePhe: 1.669 ± 0.188
2.276PheGly: 2.276 ± 0.253
0.657PheHis: 0.657 ± 0.123
2.832PheIle: 2.832 ± 0.259
2.857PheLys: 2.857 ± 0.222
3.919PheLeu: 3.919 ± 0.342
0.936PheMet: 0.936 ± 0.152
2.124PheAsn: 2.124 ± 0.273
1.062PhePro: 1.062 ± 0.162
1.214PheGln: 1.214 ± 0.208
1.239PheArg: 1.239 ± 0.156
3.136PheSer: 3.136 ± 0.32
2.453PheThr: 2.453 ± 0.221
3.211PheVal: 3.211 ± 0.297
0.202PheTrp: 0.202 ± 0.066
2.326PheTyr: 2.326 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
4.071GlyAla: 4.071 ± 0.592
0.531GlyCys: 0.531 ± 0.123
3.667GlyAsp: 3.667 ± 0.392
3.97GlyGlu: 3.97 ± 0.282
2.124GlyPhe: 2.124 ± 0.212
4.779GlyGly: 4.779 ± 0.657
0.834GlyHis: 0.834 ± 0.133
4.172GlyIle: 4.172 ± 0.396
5.26GlyLys: 5.26 ± 0.382
3.919GlyLeu: 3.919 ± 0.319
1.593GlyMet: 1.593 ± 0.213
3.464GlyAsn: 3.464 ± 0.282
0.025GlyPro: 0.025 ± 0.021
1.517GlyGln: 1.517 ± 0.269
2.175GlyArg: 2.175 ± 0.241
4.476GlySer: 4.476 ± 0.445
4.122GlyThr: 4.122 ± 0.375
4.703GlyVal: 4.703 ± 0.362
0.556GlyTrp: 0.556 ± 0.134
2.933GlyTyr: 2.933 ± 0.242
0.0GlyXaa: 0.0 ± 0.0
His
0.809HisAla: 0.809 ± 0.121
0.152HisCys: 0.152 ± 0.056
0.86HisAsp: 0.86 ± 0.158
0.91HisGlu: 0.91 ± 0.164
0.582HisPhe: 0.582 ± 0.112
0.784HisGly: 0.784 ± 0.135
0.303HisHis: 0.303 ± 0.083
1.214HisIle: 1.214 ± 0.222
1.264HisLys: 1.264 ± 0.205
1.315HisLeu: 1.315 ± 0.175
0.278HisMet: 0.278 ± 0.094
0.759HisAsn: 0.759 ± 0.134
0.379HisPro: 0.379 ± 0.106
0.405HisGln: 0.405 ± 0.089
0.708HisArg: 0.708 ± 0.14
0.986HisSer: 0.986 ± 0.159
1.037HisThr: 1.037 ± 0.15
0.885HisVal: 0.885 ± 0.161
0.303HisTrp: 0.303 ± 0.108
0.809HisTyr: 0.809 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
4.349IleAla: 4.349 ± 0.386
0.405IleCys: 0.405 ± 0.106
4.147IleAsp: 4.147 ± 0.367
5.538IleGlu: 5.538 ± 0.385
1.871IlePhe: 1.871 ± 0.236
3.818IleGly: 3.818 ± 0.329
1.037IleHis: 1.037 ± 0.183
4.577IleIle: 4.577 ± 0.346
5.462IleLys: 5.462 ± 0.412
5.007IleLeu: 5.007 ± 0.353
1.745IleMet: 1.745 ± 0.211
3.515IleAsn: 3.515 ± 0.3
2.529IlePro: 2.529 ± 0.247
2.402IleGln: 2.402 ± 0.261
2.959IleArg: 2.959 ± 0.247
4.248IleSer: 4.248 ± 0.315
3.768IleThr: 3.768 ± 0.306
4.653IleVal: 4.653 ± 0.301
0.531IleTrp: 0.531 ± 0.131
2.099IleTyr: 2.099 ± 0.212
0.0IleXaa: 0.0 ± 0.0
Lys
4.88LysAla: 4.88 ± 0.33
0.354LysCys: 0.354 ± 0.134
5.512LysAsp: 5.512 ± 0.461
9.786LysGlu: 9.786 ± 0.75
2.984LysPhe: 2.984 ± 0.297
5.234LysGly: 5.234 ± 0.414
1.593LysHis: 1.593 ± 0.239
4.627LysIle: 4.627 ± 0.328
9.128LysLys: 9.128 ± 0.614
7.004LysLeu: 7.004 ± 0.48
2.048LysMet: 2.048 ± 0.22
4.906LysAsn: 4.906 ± 0.341
2.579LysPro: 2.579 ± 0.297
3.237LysGln: 3.237 ± 0.28
3.237LysArg: 3.237 ± 0.307
4.931LysSer: 4.931 ± 0.357
4.501LysThr: 4.501 ± 0.355
6.372LysVal: 6.372 ± 0.434
0.961LysTrp: 0.961 ± 0.16
3.439LysTyr: 3.439 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
6.246LeuAla: 6.246 ± 0.361
0.632LeuCys: 0.632 ± 0.118
5.942LeuAsp: 5.942 ± 0.446
7.611LeuGlu: 7.611 ± 0.491
3.161LeuPhe: 3.161 ± 0.27
5.26LeuGly: 5.26 ± 0.333
0.936LeuHis: 0.936 ± 0.174
4.021LeuIle: 4.021 ± 0.359
6.347LeuLys: 6.347 ± 0.33
6.65LeuLeu: 6.65 ± 0.433
1.719LeuMet: 1.719 ± 0.181
4.804LeuAsn: 4.804 ± 0.368
3.161LeuPro: 3.161 ± 0.336
2.807LeuGln: 2.807 ± 0.237
3.869LeuArg: 3.869 ± 0.319
6.069LeuSer: 6.069 ± 0.47
5.184LeuThr: 5.184 ± 0.423
6.069LeuVal: 6.069 ± 0.421
0.809LeuTrp: 0.809 ± 0.123
3.161LeuTyr: 3.161 ± 0.273
0.0LeuXaa: 0.0 ± 0.0
Met
1.644MetAla: 1.644 ± 0.175
0.177MetCys: 0.177 ± 0.09
0.986MetAsp: 0.986 ± 0.131
1.896MetGlu: 1.896 ± 0.228
1.113MetPhe: 1.113 ± 0.156
1.087MetGly: 1.087 ± 0.159
0.152MetHis: 0.152 ± 0.067
1.365MetIle: 1.365 ± 0.175
2.427MetLys: 2.427 ± 0.25
1.947MetLeu: 1.947 ± 0.231
0.303MetMet: 0.303 ± 0.082
1.467MetAsn: 1.467 ± 0.149
1.037MetPro: 1.037 ± 0.144
0.86MetGln: 0.86 ± 0.133
1.315MetArg: 1.315 ± 0.154
1.998MetSer: 1.998 ± 0.232
1.922MetThr: 1.922 ± 0.233
1.416MetVal: 1.416 ± 0.175
0.278MetTrp: 0.278 ± 0.085
1.113MetTyr: 1.113 ± 0.172
0.0MetXaa: 0.0 ± 0.0
Asn
3.034AsnAla: 3.034 ± 0.35
0.202AsnCys: 0.202 ± 0.068
2.503AsnAsp: 2.503 ± 0.216
3.717AsnGlu: 3.717 ± 0.341
2.352AsnPhe: 2.352 ± 0.222
3.338AsnGly: 3.338 ± 0.296
1.037AsnHis: 1.037 ± 0.166
4.223AsnIle: 4.223 ± 0.395
5.614AsnLys: 5.614 ± 0.348
5.26AsnLeu: 5.26 ± 0.391
1.846AsnMet: 1.846 ± 0.165
3.515AsnAsn: 3.515 ± 0.329
2.023AsnPro: 2.023 ± 0.246
1.896AsnGln: 1.896 ± 0.218
2.529AsnArg: 2.529 ± 0.226
3.641AsnSer: 3.641 ± 0.332
4.122AsnThr: 4.122 ± 0.369
3.464AsnVal: 3.464 ± 0.279
0.834AsnTrp: 0.834 ± 0.139
2.63AsnTyr: 2.63 ± 0.263
0.0AsnXaa: 0.0 ± 0.0
Pro
1.694ProAla: 1.694 ± 0.258
0.126ProCys: 0.126 ± 0.057
1.821ProAsp: 1.821 ± 0.219
2.579ProGlu: 2.579 ± 0.272
1.188ProPhe: 1.188 ± 0.16
0.43ProGly: 0.43 ± 0.117
0.379ProHis: 0.379 ± 0.097
1.77ProIle: 1.77 ± 0.224
2.655ProLys: 2.655 ± 0.262
2.301ProLeu: 2.301 ± 0.233
0.885ProMet: 0.885 ± 0.146
1.669ProAsn: 1.669 ± 0.175
0.683ProPro: 0.683 ± 0.181
1.214ProGln: 1.214 ± 0.205
1.011ProArg: 1.011 ± 0.139
2.225ProSer: 2.225 ± 0.266
2.175ProThr: 2.175 ± 0.264
1.871ProVal: 1.871 ± 0.309
0.202ProTrp: 0.202 ± 0.068
1.467ProTyr: 1.467 ± 0.193
0.0ProXaa: 0.0 ± 0.0
Gln
2.908GlnAla: 2.908 ± 0.435
0.126GlnCys: 0.126 ± 0.058
1.846GlnAsp: 1.846 ± 0.191
2.908GlnGlu: 2.908 ± 0.284
1.239GlnPhe: 1.239 ± 0.184
2.402GlnGly: 2.402 ± 0.217
0.354GlnHis: 0.354 ± 0.092
1.821GlnIle: 1.821 ± 0.24
2.63GlnLys: 2.63 ± 0.355
3.034GlnLeu: 3.034 ± 0.246
0.936GlnMet: 0.936 ± 0.173
1.365GlnAsn: 1.365 ± 0.182
0.91GlnPro: 0.91 ± 0.273
2.225GlnGln: 2.225 ± 0.818
1.365GlnArg: 1.365 ± 0.21
2.023GlnSer: 2.023 ± 0.325
1.922GlnThr: 1.922 ± 0.229
2.782GlnVal: 2.782 ± 0.279
0.126GlnTrp: 0.126 ± 0.052
1.138GlnTyr: 1.138 ± 0.181
0.0GlnXaa: 0.0 ± 0.0
Arg
2.554ArgAla: 2.554 ± 0.303
0.152ArgCys: 0.152 ± 0.061
2.706ArgAsp: 2.706 ± 0.269
3.591ArgGlu: 3.591 ± 0.371
1.644ArgPhe: 1.644 ± 0.19
2.478ArgGly: 2.478 ± 0.267
0.556ArgHis: 0.556 ± 0.087
2.832ArgIle: 2.832 ± 0.25
3.667ArgLys: 3.667 ± 0.298
3.414ArgLeu: 3.414 ± 0.278
1.087ArgMet: 1.087 ± 0.165
2.731ArgAsn: 2.731 ± 0.238
0.657ArgPro: 0.657 ± 0.129
1.492ArgGln: 1.492 ± 0.205
1.239ArgArg: 1.239 ± 0.163
1.972ArgSer: 1.972 ± 0.231
2.377ArgThr: 2.377 ± 0.24
3.287ArgVal: 3.287 ± 0.273
0.405ArgTrp: 0.405 ± 0.111
1.922ArgTyr: 1.922 ± 0.224
0.0ArgXaa: 0.0 ± 0.0
Ser
3.54SerAla: 3.54 ± 0.282
0.455SerCys: 0.455 ± 0.111
3.793SerAsp: 3.793 ± 0.343
4.45SerGlu: 4.45 ± 0.358
2.933SerPhe: 2.933 ± 0.261
3.616SerGly: 3.616 ± 0.369
1.34SerHis: 1.34 ± 0.202
5.083SerIle: 5.083 ± 0.318
6.069SerLys: 6.069 ± 0.437
6.347SerLeu: 6.347 ± 0.421
1.821SerMet: 1.821 ± 0.198
3.667SerAsn: 3.667 ± 0.328
1.972SerPro: 1.972 ± 0.217
2.124SerGln: 2.124 ± 0.204
3.034SerArg: 3.034 ± 0.287
5.26SerSer: 5.26 ± 0.441
4.375SerThr: 4.375 ± 0.372
4.754SerVal: 4.754 ± 0.366
0.708SerTrp: 0.708 ± 0.134
3.338SerTyr: 3.338 ± 0.322
0.0SerXaa: 0.0 ± 0.0
Thr
4.071ThrAla: 4.071 ± 0.361
0.278ThrCys: 0.278 ± 0.074
3.793ThrAsp: 3.793 ± 0.291
5.032ThrGlu: 5.032 ± 0.395
2.883ThrPhe: 2.883 ± 0.269
4.349ThrGly: 4.349 ± 0.377
1.138ThrHis: 1.138 ± 0.15
4.122ThrIle: 4.122 ± 0.335
4.83ThrLys: 4.83 ± 0.381
4.981ThrLeu: 4.981 ± 0.368
1.315ThrMet: 1.315 ± 0.197
3.287ThrAsn: 3.287 ± 0.303
2.453ThrPro: 2.453 ± 0.227
1.492ThrGln: 1.492 ± 0.2
2.073ThrArg: 2.073 ± 0.253
4.299ThrSer: 4.299 ± 0.285
4.122ThrThr: 4.122 ± 0.321
5.108ThrVal: 5.108 ± 0.392
0.784ThrTrp: 0.784 ± 0.108
2.706ThrTyr: 2.706 ± 0.254
0.0ThrXaa: 0.0 ± 0.0
Val
4.476ValAla: 4.476 ± 0.307
0.48ValCys: 0.48 ± 0.118
4.88ValAsp: 4.88 ± 0.336
6.17ValGlu: 6.17 ± 0.434
2.706ValPhe: 2.706 ± 0.273
3.869ValGly: 3.869 ± 0.347
0.91ValHis: 0.91 ± 0.145
4.83ValIle: 4.83 ± 0.418
5.74ValLys: 5.74 ± 0.38
5.158ValLeu: 5.158 ± 0.396
1.365ValMet: 1.365 ± 0.215
4.046ValAsn: 4.046 ± 0.315
2.276ValPro: 2.276 ± 0.216
2.579ValGln: 2.579 ± 0.27
2.655ValArg: 2.655 ± 0.243
5.563ValSer: 5.563 ± 0.354
5.209ValThr: 5.209 ± 0.408
5.032ValVal: 5.032 ± 0.375
0.683ValTrp: 0.683 ± 0.121
3.287ValTyr: 3.287 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
0.43TrpAla: 0.43 ± 0.1
0.152TrpCys: 0.152 ± 0.057
0.759TrpAsp: 0.759 ± 0.139
0.834TrpGlu: 0.834 ± 0.145
0.329TrpPhe: 0.329 ± 0.091
0.683TrpGly: 0.683 ± 0.151
0.278TrpHis: 0.278 ± 0.078
0.43TrpIle: 0.43 ± 0.085
0.759TrpLys: 0.759 ± 0.119
0.885TrpLeu: 0.885 ± 0.132
0.076TrpMet: 0.076 ± 0.044
0.48TrpAsn: 0.48 ± 0.117
0.0TrpPro: 0.0 ± 0.0
0.354TrpGln: 0.354 ± 0.102
0.253TrpArg: 0.253 ± 0.068
0.506TrpSer: 0.506 ± 0.1
0.759TrpThr: 0.759 ± 0.164
0.784TrpVal: 0.784 ± 0.161
0.253TrpTrp: 0.253 ± 0.083
0.759TrpTyr: 0.759 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.503TyrAla: 2.503 ± 0.255
0.531TyrCys: 0.531 ± 0.121
2.554TyrAsp: 2.554 ± 0.254
3.54TyrGlu: 3.54 ± 0.295
1.542TyrPhe: 1.542 ± 0.207
2.503TyrGly: 2.503 ± 0.299
0.708TyrHis: 0.708 ± 0.128
3.211TyrIle: 3.211 ± 0.288
3.414TyrLys: 3.414 ± 0.28
3.439TyrLeu: 3.439 ± 0.317
1.264TyrMet: 1.264 ± 0.184
2.503TyrAsn: 2.503 ± 0.224
1.365TyrPro: 1.365 ± 0.183
1.77TyrGln: 1.77 ± 0.278
2.326TyrArg: 2.326 ± 0.212
3.287TyrSer: 3.287 ± 0.315
3.338TyrThr: 3.338 ± 0.314
2.782TyrVal: 2.782 ± 0.262
0.303TyrTrp: 0.303 ± 0.073
2.124TyrTyr: 2.124 ± 0.221
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 196 proteins (39548 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski