Amino acid dipepetide frequency for Listeria phage LP-125

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.778AlaAla: 0.778 ± 0.327
0.301AlaCys: 0.301 ± 0.079
3.29AlaAsp: 3.29 ± 0.321
4.42AlaGlu: 4.42 ± 0.346
2.134AlaPhe: 2.134 ± 0.211
3.993AlaGly: 3.993 ± 0.476
0.979AlaHis: 0.979 ± 0.141
4.294AlaIle: 4.294 ± 0.37
4.771AlaLys: 4.771 ± 0.352
5.499AlaLeu: 5.499 ± 0.331
1.456AlaMet: 1.456 ± 0.257
3.591AlaAsn: 3.591 ± 0.351
1.783AlaPro: 1.783 ± 0.297
2.084AlaGln: 2.084 ± 0.255
2.561AlaArg: 2.561 ± 0.276
4.57AlaSer: 4.57 ± 0.404
3.817AlaThr: 3.817 ± 0.317
4.018AlaVal: 4.018 ± 0.29
0.377AlaTrp: 0.377 ± 0.094
2.863AlaTyr: 2.863 ± 0.329
0.0AlaXaa: 0.0 ± 0.0
Cys
0.176CysAla: 0.176 ± 0.063
0.126CysCys: 0.126 ± 0.061
0.326CysAsp: 0.326 ± 0.098
0.377CysGlu: 0.377 ± 0.089
0.276CysPhe: 0.276 ± 0.085
0.527CysGly: 0.527 ± 0.102
0.201CysHis: 0.201 ± 0.069
0.301CysIle: 0.301 ± 0.091
0.603CysLys: 0.603 ± 0.102
0.452CysLeu: 0.452 ± 0.1
0.126CysMet: 0.126 ± 0.057
0.427CysAsn: 0.427 ± 0.102
0.352CysPro: 0.352 ± 0.1
0.251CysGln: 0.251 ± 0.064
0.251CysArg: 0.251 ± 0.08
0.502CysSer: 0.502 ± 0.103
0.251CysThr: 0.251 ± 0.079
0.502CysVal: 0.502 ± 0.1
0.0CysTrp: 0.0 ± 0.0
0.452CysTyr: 0.452 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
2.285AspAla: 2.285 ± 0.28
0.502AspCys: 0.502 ± 0.12
2.687AspAsp: 2.687 ± 0.308
3.968AspGlu: 3.968 ± 0.339
3.013AspPhe: 3.013 ± 0.262
3.315AspGly: 3.315 ± 0.368
0.427AspHis: 0.427 ± 0.113
4.52AspIle: 4.52 ± 0.32
6.052AspLys: 6.052 ± 0.431
5.449AspLeu: 5.449 ± 0.313
1.607AspMet: 1.607 ± 0.19
4.294AspAsn: 4.294 ± 0.343
1.532AspPro: 1.532 ± 0.195
1.256AspGln: 1.256 ± 0.231
2.637AspArg: 2.637 ± 0.271
4.244AspSer: 4.244 ± 0.309
3.566AspThr: 3.566 ± 0.295
3.842AspVal: 3.842 ± 0.325
0.728AspTrp: 0.728 ± 0.134
3.591AspTyr: 3.591 ± 0.34
0.0AspXaa: 0.0 ± 0.0
Glu
5.725GluAla: 5.725 ± 0.36
0.452GluCys: 0.452 ± 0.115
5.248GluAsp: 5.248 ± 0.387
9.166GluGlu: 9.166 ± 0.801
3.114GluPhe: 3.114 ± 0.304
4.269GluGly: 4.269 ± 0.395
1.03GluHis: 1.03 ± 0.185
4.57GluIle: 4.57 ± 0.386
7.659GluLys: 7.659 ± 0.59
8.086GluLeu: 8.086 ± 0.534
1.908GluMet: 1.908 ± 0.223
4.445GluAsn: 4.445 ± 0.367
1.632GluPro: 1.632 ± 0.205
2.888GluGln: 2.888 ± 0.243
3.541GluArg: 3.541 ± 0.266
4.57GluSer: 4.57 ± 0.318
3.942GluThr: 3.942 ± 0.326
6.378GluVal: 6.378 ± 0.414
0.728GluTrp: 0.728 ± 0.134
2.963GluTyr: 2.963 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
1.783PheAla: 1.783 ± 0.214
0.276PheCys: 0.276 ± 0.09
2.059PheAsp: 2.059 ± 0.206
2.637PheGlu: 2.637 ± 0.268
1.657PhePhe: 1.657 ± 0.196
2.335PheGly: 2.335 ± 0.277
0.603PheHis: 0.603 ± 0.138
2.988PheIle: 2.988 ± 0.319
2.612PheLys: 2.612 ± 0.228
4.068PheLeu: 4.068 ± 0.346
0.979PheMet: 0.979 ± 0.148
2.185PheAsn: 2.185 ± 0.281
1.13PhePro: 1.13 ± 0.183
1.105PheGln: 1.105 ± 0.164
1.356PheArg: 1.356 ± 0.178
3.239PheSer: 3.239 ± 0.311
2.536PheThr: 2.536 ± 0.215
3.139PheVal: 3.139 ± 0.322
0.226PheTrp: 0.226 ± 0.075
2.335PheTyr: 2.335 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
3.817GlyAla: 3.817 ± 0.727
0.502GlyCys: 0.502 ± 0.123
3.942GlyAsp: 3.942 ± 0.393
3.968GlyGlu: 3.968 ± 0.276
2.009GlyPhe: 2.009 ± 0.231
4.646GlyGly: 4.646 ± 0.739
0.854GlyHis: 0.854 ± 0.165
4.219GlyIle: 4.219 ± 0.375
5.098GlyLys: 5.098 ± 0.469
4.143GlyLeu: 4.143 ± 0.35
1.733GlyMet: 1.733 ± 0.244
3.641GlyAsn: 3.641 ± 0.273
0.025GlyPro: 0.025 ± 0.025
1.482GlyGln: 1.482 ± 0.241
2.285GlyArg: 2.285 ± 0.248
4.42GlySer: 4.42 ± 0.467
3.867GlyThr: 3.867 ± 0.348
4.646GlyVal: 4.646 ± 0.348
0.628GlyTrp: 0.628 ± 0.12
2.762GlyTyr: 2.762 ± 0.244
0.0GlyXaa: 0.0 ± 0.0
His
0.804HisAla: 0.804 ± 0.15
0.201HisCys: 0.201 ± 0.071
0.854HisAsp: 0.854 ± 0.154
0.954HisGlu: 0.954 ± 0.153
0.477HisPhe: 0.477 ± 0.111
0.804HisGly: 0.804 ± 0.149
0.276HisHis: 0.276 ± 0.084
1.13HisIle: 1.13 ± 0.233
1.256HisLys: 1.256 ± 0.164
1.381HisLeu: 1.381 ± 0.183
0.276HisMet: 0.276 ± 0.086
0.628HisAsn: 0.628 ± 0.11
0.402HisPro: 0.402 ± 0.117
0.377HisGln: 0.377 ± 0.078
0.753HisArg: 0.753 ± 0.132
1.055HisSer: 1.055 ± 0.177
1.004HisThr: 1.004 ± 0.182
1.08HisVal: 1.08 ± 0.208
0.301HisTrp: 0.301 ± 0.115
0.854HisTyr: 0.854 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
4.62IleAla: 4.62 ± 0.398
0.427IleCys: 0.427 ± 0.108
4.143IleAsp: 4.143 ± 0.322
5.625IleGlu: 5.625 ± 0.444
1.883IlePhe: 1.883 ± 0.225
3.817IleGly: 3.817 ± 0.352
1.13IleHis: 1.13 ± 0.171
5.072IleIle: 5.072 ± 0.466
5.324IleLys: 5.324 ± 0.4
5.273IleLeu: 5.273 ± 0.359
1.833IleMet: 1.833 ± 0.186
3.541IleAsn: 3.541 ± 0.345
2.436IlePro: 2.436 ± 0.224
2.436IleGln: 2.436 ± 0.27
2.938IleArg: 2.938 ± 0.243
4.52IleSer: 4.52 ± 0.321
4.043IleThr: 4.043 ± 0.342
4.57IleVal: 4.57 ± 0.35
0.502IleTrp: 0.502 ± 0.136
2.26IleTyr: 2.26 ± 0.263
0.0IleXaa: 0.0 ± 0.0
Lys
4.846LysAla: 4.846 ± 0.408
0.352LysCys: 0.352 ± 0.134
5.625LysAsp: 5.625 ± 0.406
9.643LysGlu: 9.643 ± 0.64
3.013LysPhe: 3.013 ± 0.262
5.198LysGly: 5.198 ± 0.438
1.356LysHis: 1.356 ± 0.181
4.846LysIle: 4.846 ± 0.426
8.814LysLys: 8.814 ± 0.586
6.855LysLeu: 6.855 ± 0.418
1.984LysMet: 1.984 ± 0.223
5.148LysAsn: 5.148 ± 0.381
2.486LysPro: 2.486 ± 0.244
3.29LysGln: 3.29 ± 0.267
3.239LysArg: 3.239 ± 0.356
4.771LysSer: 4.771 ± 0.419
4.696LysThr: 4.696 ± 0.359
6.202LysVal: 6.202 ± 0.41
0.829LysTrp: 0.829 ± 0.136
3.315LysTyr: 3.315 ± 0.249
0.0LysXaa: 0.0 ± 0.0
Leu
6.328LeuAla: 6.328 ± 0.436
0.703LeuCys: 0.703 ± 0.132
5.976LeuAsp: 5.976 ± 0.343
7.458LeuGlu: 7.458 ± 0.476
3.44LeuPhe: 3.44 ± 0.31
5.349LeuGly: 5.349 ± 0.356
0.979LeuHis: 0.979 ± 0.181
4.394LeuIle: 4.394 ± 0.405
6.378LeuLys: 6.378 ± 0.407
6.906LeuLeu: 6.906 ± 0.486
1.908LeuMet: 1.908 ± 0.216
4.821LeuAsn: 4.821 ± 0.344
3.214LeuPro: 3.214 ± 0.362
2.913LeuGln: 2.913 ± 0.256
3.892LeuArg: 3.892 ± 0.374
5.625LeuSer: 5.625 ± 0.377
5.273LeuThr: 5.273 ± 0.364
6.152LeuVal: 6.152 ± 0.462
0.854LeuTrp: 0.854 ± 0.143
3.214LeuTyr: 3.214 ± 0.267
0.0LeuXaa: 0.0 ± 0.0
Met
1.632MetAla: 1.632 ± 0.207
0.151MetCys: 0.151 ± 0.076
0.979MetAsp: 0.979 ± 0.137
1.783MetGlu: 1.783 ± 0.216
1.13MetPhe: 1.13 ± 0.164
1.105MetGly: 1.105 ± 0.216
0.151MetHis: 0.151 ± 0.061
1.356MetIle: 1.356 ± 0.181
2.285MetLys: 2.285 ± 0.257
2.084MetLeu: 2.084 ± 0.264
0.402MetMet: 0.402 ± 0.084
1.381MetAsn: 1.381 ± 0.19
1.03MetPro: 1.03 ± 0.148
0.904MetGln: 0.904 ± 0.141
1.281MetArg: 1.281 ± 0.147
2.059MetSer: 2.059 ± 0.218
2.009MetThr: 2.009 ± 0.216
1.431MetVal: 1.431 ± 0.21
0.251MetTrp: 0.251 ± 0.082
1.03MetTyr: 1.03 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
2.863AsnAla: 2.863 ± 0.278
0.226AsnCys: 0.226 ± 0.08
2.486AsnAsp: 2.486 ± 0.216
3.867AsnGlu: 3.867 ± 0.32
2.285AsnPhe: 2.285 ± 0.24
3.239AsnGly: 3.239 ± 0.315
1.004AsnHis: 1.004 ± 0.16
4.244AsnIle: 4.244 ± 0.393
5.7AsnLys: 5.7 ± 0.335
4.771AsnLeu: 4.771 ± 0.368
1.783AsnMet: 1.783 ± 0.184
3.616AsnAsn: 3.616 ± 0.265
2.084AsnPro: 2.084 ± 0.248
1.808AsnGln: 1.808 ± 0.221
2.335AsnArg: 2.335 ± 0.251
3.666AsnSer: 3.666 ± 0.374
4.043AsnThr: 4.043 ± 0.306
3.34AsnVal: 3.34 ± 0.293
0.753AsnTrp: 0.753 ± 0.123
2.612AsnTyr: 2.612 ± 0.25
0.0AsnXaa: 0.0 ± 0.0
Pro
1.682ProAla: 1.682 ± 0.244
0.1ProCys: 0.1 ± 0.055
1.883ProAsp: 1.883 ± 0.219
2.461ProGlu: 2.461 ± 0.221
1.23ProPhe: 1.23 ± 0.158
0.427ProGly: 0.427 ± 0.116
0.377ProHis: 0.377 ± 0.097
1.733ProIle: 1.733 ± 0.212
2.712ProLys: 2.712 ± 0.281
2.461ProLeu: 2.461 ± 0.232
0.904ProMet: 0.904 ± 0.165
1.532ProAsn: 1.532 ± 0.189
0.728ProPro: 0.728 ± 0.151
1.23ProGln: 1.23 ± 0.208
0.904ProArg: 0.904 ± 0.161
2.31ProSer: 2.31 ± 0.275
2.21ProThr: 2.21 ± 0.254
1.908ProVal: 1.908 ± 0.269
0.201ProTrp: 0.201 ± 0.074
1.507ProTyr: 1.507 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
2.812GlnAla: 2.812 ± 0.357
0.075GlnCys: 0.075 ± 0.039
1.808GlnAsp: 1.808 ± 0.215
2.812GlnGlu: 2.812 ± 0.314
1.205GlnPhe: 1.205 ± 0.163
2.285GlnGly: 2.285 ± 0.204
0.427GlnHis: 0.427 ± 0.091
1.783GlnIle: 1.783 ± 0.227
2.637GlnLys: 2.637 ± 0.271
3.114GlnLeu: 3.114 ± 0.246
0.854GlnMet: 0.854 ± 0.174
1.281GlnAsn: 1.281 ± 0.185
0.854GlnPro: 0.854 ± 0.238
2.185GlnGln: 2.185 ± 0.78
1.356GlnArg: 1.356 ± 0.205
2.109GlnSer: 2.109 ± 0.3
1.934GlnThr: 1.934 ± 0.225
2.737GlnVal: 2.737 ± 0.257
0.126GlnTrp: 0.126 ± 0.058
1.23GlnTyr: 1.23 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
2.461ArgAla: 2.461 ± 0.276
0.126ArgCys: 0.126 ± 0.06
2.662ArgAsp: 2.662 ± 0.249
3.666ArgGlu: 3.666 ± 0.357
1.632ArgPhe: 1.632 ± 0.208
2.461ArgGly: 2.461 ± 0.294
0.678ArgHis: 0.678 ± 0.119
3.013ArgIle: 3.013 ± 0.244
3.742ArgLys: 3.742 ± 0.303
3.691ArgLeu: 3.691 ± 0.281
1.03ArgMet: 1.03 ± 0.145
2.662ArgAsn: 2.662 ± 0.256
0.653ArgPro: 0.653 ± 0.122
1.482ArgGln: 1.482 ± 0.219
1.306ArgArg: 1.306 ± 0.197
1.883ArgSer: 1.883 ± 0.243
2.411ArgThr: 2.411 ± 0.282
3.29ArgVal: 3.29 ± 0.284
0.326ArgTrp: 0.326 ± 0.093
1.858ArgTyr: 1.858 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
3.616SerAla: 3.616 ± 0.332
0.377SerCys: 0.377 ± 0.104
3.742SerAsp: 3.742 ± 0.333
4.369SerGlu: 4.369 ± 0.348
2.988SerPhe: 2.988 ± 0.278
3.49SerGly: 3.49 ± 0.37
1.306SerHis: 1.306 ± 0.165
5.223SerIle: 5.223 ± 0.376
5.926SerLys: 5.926 ± 0.442
6.378SerLeu: 6.378 ± 0.399
1.657SerMet: 1.657 ± 0.179
3.44SerAsn: 3.44 ± 0.299
1.959SerPro: 1.959 ± 0.241
2.109SerGln: 2.109 ± 0.225
3.239SerArg: 3.239 ± 0.333
4.922SerSer: 4.922 ± 0.427
4.294SerThr: 4.294 ± 0.346
4.972SerVal: 4.972 ± 0.381
0.854SerTrp: 0.854 ± 0.148
3.214SerTyr: 3.214 ± 0.309
0.0SerXaa: 0.0 ± 0.0
Thr
4.168ThrAla: 4.168 ± 0.414
0.301ThrCys: 0.301 ± 0.079
3.716ThrAsp: 3.716 ± 0.274
4.796ThrGlu: 4.796 ± 0.336
3.064ThrPhe: 3.064 ± 0.273
4.369ThrGly: 4.369 ± 0.354
1.155ThrHis: 1.155 ± 0.166
4.194ThrIle: 4.194 ± 0.39
4.721ThrLys: 4.721 ± 0.365
5.273ThrLeu: 5.273 ± 0.348
1.205ThrMet: 1.205 ± 0.161
2.963ThrAsn: 2.963 ± 0.262
2.436ThrPro: 2.436 ± 0.255
1.456ThrGln: 1.456 ± 0.171
2.009ThrArg: 2.009 ± 0.234
4.294ThrSer: 4.294 ± 0.335
3.993ThrThr: 3.993 ± 0.294
5.349ThrVal: 5.349 ± 0.429
0.703ThrTrp: 0.703 ± 0.146
2.938ThrTyr: 2.938 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
4.545ValAla: 4.545 ± 0.348
0.552ValCys: 0.552 ± 0.122
4.696ValAsp: 4.696 ± 0.342
5.926ValGlu: 5.926 ± 0.423
2.888ValPhe: 2.888 ± 0.311
3.917ValGly: 3.917 ± 0.316
0.929ValHis: 0.929 ± 0.16
4.947ValIle: 4.947 ± 0.461
5.826ValLys: 5.826 ± 0.428
5.273ValLeu: 5.273 ± 0.392
1.507ValMet: 1.507 ± 0.23
3.917ValAsn: 3.917 ± 0.313
2.335ValPro: 2.335 ± 0.252
2.386ValGln: 2.386 ± 0.28
2.662ValArg: 2.662 ± 0.241
5.75ValSer: 5.75 ± 0.39
5.123ValThr: 5.123 ± 0.464
5.173ValVal: 5.173 ± 0.395
0.778ValTrp: 0.778 ± 0.149
3.239ValTyr: 3.239 ± 0.346
0.0ValXaa: 0.0 ± 0.0
Trp
0.377TrpAla: 0.377 ± 0.092
0.176TrpCys: 0.176 ± 0.059
0.778TrpAsp: 0.778 ± 0.131
0.829TrpGlu: 0.829 ± 0.152
0.326TrpPhe: 0.326 ± 0.078
0.753TrpGly: 0.753 ± 0.154
0.251TrpHis: 0.251 ± 0.075
0.452TrpIle: 0.452 ± 0.115
0.728TrpLys: 0.728 ± 0.124
0.904TrpLeu: 0.904 ± 0.159
0.1TrpMet: 0.1 ± 0.053
0.502TrpAsn: 0.502 ± 0.111
0.0TrpPro: 0.0 ± 0.0
0.301TrpGln: 0.301 ± 0.085
0.226TrpArg: 0.226 ± 0.078
0.502TrpSer: 0.502 ± 0.104
0.703TrpThr: 0.703 ± 0.154
0.829TrpVal: 0.829 ± 0.146
0.276TrpTrp: 0.276 ± 0.084
0.753TrpTyr: 0.753 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.26TyrAla: 2.26 ± 0.238
0.552TyrCys: 0.552 ± 0.114
2.612TyrAsp: 2.612 ± 0.235
3.591TyrGlu: 3.591 ± 0.306
1.507TyrPhe: 1.507 ± 0.206
2.386TyrGly: 2.386 ± 0.274
0.753TyrHis: 0.753 ± 0.151
3.114TyrIle: 3.114 ± 0.288
3.666TyrLys: 3.666 ± 0.264
3.716TyrLeu: 3.716 ± 0.381
1.155TyrMet: 1.155 ± 0.17
2.712TyrAsn: 2.712 ± 0.256
1.431TyrPro: 1.431 ± 0.186
1.682TyrGln: 1.682 ± 0.283
2.26TyrArg: 2.26 ± 0.232
3.064TyrSer: 3.064 ± 0.332
3.39TyrThr: 3.39 ± 0.344
2.812TyrVal: 2.812 ± 0.276
0.326TyrTrp: 0.326 ± 0.087
2.009TyrTyr: 2.009 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 189 proteins (39824 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski