Amino acid dipepetide frequency for Lactococcus phage AM3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.028AlaAla: 0.028 ± 0.027
0.278AlaCys: 0.278 ± 0.087
2.618AlaAsp: 2.618 ± 0.259
3.147AlaGlu: 3.147 ± 0.292
1.726AlaPhe: 1.726 ± 0.198
3.648AlaGly: 3.648 ± 0.723
0.863AlaHis: 0.863 ± 0.141
3.815AlaIle: 3.815 ± 0.436
5.263AlaLys: 5.263 ± 0.587
4.483AlaLeu: 4.483 ± 0.597
1.476AlaMet: 1.476 ± 0.251
2.84AlaAsn: 2.84 ± 0.269
1.03AlaPro: 1.03 ± 0.204
2.534AlaGln: 2.534 ± 0.409
1.726AlaArg: 1.726 ± 0.228
3.23AlaSer: 3.23 ± 0.316
3.23AlaThr: 3.23 ± 0.361
3.286AlaVal: 3.286 ± 0.386
0.473AlaTrp: 0.473 ± 0.116
2.645AlaTyr: 2.645 ± 0.281
0.0AlaXaa: 0.0 ± 0.0
Cys
0.223CysAla: 0.223 ± 0.087
0.111CysCys: 0.111 ± 0.06
0.446CysAsp: 0.446 ± 0.143
0.585CysGlu: 0.585 ± 0.159
0.278CysPhe: 0.278 ± 0.09
0.501CysGly: 0.501 ± 0.183
0.084CysHis: 0.084 ± 0.059
0.557CysIle: 0.557 ± 0.131
0.585CysLys: 0.585 ± 0.145
0.501CysLeu: 0.501 ± 0.119
0.056CysMet: 0.056 ± 0.035
0.362CysAsn: 0.362 ± 0.105
0.306CysPro: 0.306 ± 0.169
0.306CysGln: 0.306 ± 0.09
0.418CysArg: 0.418 ± 0.095
0.752CysSer: 0.752 ± 0.179
0.139CysThr: 0.139 ± 0.06
0.362CysVal: 0.362 ± 0.1
0.111CysTrp: 0.111 ± 0.046
0.306CysTyr: 0.306 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
2.478AspAla: 2.478 ± 0.318
0.418AspCys: 0.418 ± 0.123
4.567AspAsp: 4.567 ± 0.423
6.154AspGlu: 6.154 ± 0.434
4.121AspPhe: 4.121 ± 0.338
4.678AspGly: 4.678 ± 0.386
0.501AspHis: 0.501 ± 0.151
6.154AspIle: 6.154 ± 0.467
7.073AspLys: 7.073 ± 0.515
5.653AspLeu: 5.653 ± 0.501
1.81AspMet: 1.81 ± 0.198
4.762AspAsn: 4.762 ± 0.352
0.975AspPro: 0.975 ± 0.167
1.002AspGln: 1.002 ± 0.237
2.116AspArg: 2.116 ± 0.249
4.4AspSer: 4.4 ± 0.415
3.731AspThr: 3.731 ± 0.364
4.455AspVal: 4.455 ± 0.334
0.724AspTrp: 0.724 ± 0.16
3.091AspTyr: 3.091 ± 0.288
0.0AspXaa: 0.0 ± 0.0
Glu
3.982GluAla: 3.982 ± 0.457
0.557GluCys: 0.557 ± 0.152
5.959GluAsp: 5.959 ± 0.574
6.934GluGlu: 6.934 ± 0.552
3.592GluPhe: 3.592 ± 0.372
3.147GluGly: 3.147 ± 0.317
1.309GluHis: 1.309 ± 0.199
6.795GluIle: 6.795 ± 0.384
7.407GluLys: 7.407 ± 0.597
7.741GluLeu: 7.741 ± 0.494
2.2GluMet: 2.2 ± 0.256
5.319GluAsn: 5.319 ± 0.373
1.086GluPro: 1.086 ± 0.192
2.868GluGln: 2.868 ± 0.272
2.339GluArg: 2.339 ± 0.253
5.402GluSer: 5.402 ± 0.423
4.233GluThr: 4.233 ± 0.389
5.012GluVal: 5.012 ± 0.362
0.78GluTrp: 0.78 ± 0.133
4.511GluTyr: 4.511 ± 0.43
0.0GluXaa: 0.0 ± 0.0
Phe
2.005PheAla: 2.005 ± 0.228
0.39PheCys: 0.39 ± 0.124
3.871PheAsp: 3.871 ± 0.347
4.177PheGlu: 4.177 ± 0.456
1.309PhePhe: 1.309 ± 0.196
2.645PheGly: 2.645 ± 0.299
0.473PheHis: 0.473 ± 0.121
3.007PheIle: 3.007 ± 0.308
4.177PheLys: 4.177 ± 0.318
3.509PheLeu: 3.509 ± 0.379
1.448PheMet: 1.448 ± 0.203
3.286PheAsn: 3.286 ± 0.324
0.919PhePro: 0.919 ± 0.178
1.337PheGln: 1.337 ± 0.173
1.587PheArg: 1.587 ± 0.253
2.896PheSer: 2.896 ± 0.349
3.063PheThr: 3.063 ± 0.316
2.729PheVal: 2.729 ± 0.296
0.446PheTrp: 0.446 ± 0.126
2.2PheTyr: 2.2 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
2.451GlyAla: 2.451 ± 0.347
0.306GlyCys: 0.306 ± 0.099
3.592GlyAsp: 3.592 ± 0.298
4.121GlyGlu: 4.121 ± 0.321
3.314GlyPhe: 3.314 ± 0.296
3.564GlyGly: 3.564 ± 0.573
0.919GlyHis: 0.919 ± 0.145
4.817GlyIle: 4.817 ± 0.396
4.845GlyLys: 4.845 ± 0.431
4.623GlyLeu: 4.623 ± 0.353
1.949GlyMet: 1.949 ± 0.198
4.288GlyAsn: 4.288 ± 0.401
0.0GlyPro: 0.0 ± 0.0
1.504GlyGln: 1.504 ± 0.213
2.2GlyArg: 2.2 ± 0.289
3.369GlySer: 3.369 ± 0.356
3.648GlyThr: 3.648 ± 0.408
3.731GlyVal: 3.731 ± 0.317
0.808GlyTrp: 0.808 ± 0.212
3.091GlyTyr: 3.091 ± 0.257
0.0GlyXaa: 0.0 ± 0.0
His
0.557HisAla: 0.557 ± 0.123
0.139HisCys: 0.139 ± 0.054
1.197HisAsp: 1.197 ± 0.169
1.002HisGlu: 1.002 ± 0.182
0.808HisPhe: 0.808 ± 0.183
1.058HisGly: 1.058 ± 0.202
0.362HisHis: 0.362 ± 0.122
1.281HisIle: 1.281 ± 0.202
1.392HisLys: 1.392 ± 0.221
1.002HisLeu: 1.002 ± 0.178
0.251HisMet: 0.251 ± 0.086
1.086HisAsn: 1.086 ± 0.253
0.418HisPro: 0.418 ± 0.09
0.501HisGln: 0.501 ± 0.139
0.529HisArg: 0.529 ± 0.118
1.002HisSer: 1.002 ± 0.163
0.863HisThr: 0.863 ± 0.158
0.585HisVal: 0.585 ± 0.145
0.195HisTrp: 0.195 ± 0.075
0.975HisTyr: 0.975 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
3.982IleAla: 3.982 ± 0.335
0.557IleCys: 0.557 ± 0.122
5.82IleAsp: 5.82 ± 0.376
6.516IleGlu: 6.516 ± 0.56
3.147IlePhe: 3.147 ± 0.291
3.537IleGly: 3.537 ± 0.377
1.17IleHis: 1.17 ± 0.171
5.374IleIle: 5.374 ± 0.604
8.27IleLys: 8.27 ± 0.479
5.876IleLeu: 5.876 ± 0.556
1.309IleMet: 1.309 ± 0.171
5.458IleAsn: 5.458 ± 0.409
2.451IlePro: 2.451 ± 0.268
2.506IleGln: 2.506 ± 0.314
2.367IleArg: 2.367 ± 0.27
5.235IleSer: 5.235 ± 0.331
4.372IleThr: 4.372 ± 0.416
5.152IleVal: 5.152 ± 0.474
0.696IleTrp: 0.696 ± 0.14
2.868IleTyr: 2.868 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
4.929LysAla: 4.929 ± 0.441
0.557LysCys: 0.557 ± 0.152
6.683LysAsp: 6.683 ± 0.471
10.136LysGlu: 10.136 ± 0.532
3.453LysPhe: 3.453 ± 0.417
4.873LysGly: 4.873 ± 0.353
1.782LysHis: 1.782 ± 0.287
6.627LysIle: 6.627 ± 0.44
7.797LysLys: 7.797 ± 0.565
8.076LysLeu: 8.076 ± 0.418
3.063LysMet: 3.063 ± 0.286
5.931LysAsn: 5.931 ± 0.378
1.977LysPro: 1.977 ± 0.262
3.119LysGln: 3.119 ± 0.326
3.369LysArg: 3.369 ± 0.34
4.817LysSer: 4.817 ± 0.452
5.597LysThr: 5.597 ± 0.43
5.152LysVal: 5.152 ± 0.393
1.03LysTrp: 1.03 ± 0.159
3.926LysTyr: 3.926 ± 0.401
0.0LysXaa: 0.0 ± 0.0
Leu
4.706LeuAla: 4.706 ± 0.41
0.473LeuCys: 0.473 ± 0.118
6.293LeuAsp: 6.293 ± 0.426
6.126LeuGlu: 6.126 ± 0.456
3.787LeuPhe: 3.787 ± 0.451
4.79LeuGly: 4.79 ± 0.517
1.142LeuHis: 1.142 ± 0.203
5.681LeuIle: 5.681 ± 0.509
7.519LeuLys: 7.519 ± 0.535
6.21LeuLeu: 6.21 ± 0.435
1.977LeuMet: 1.977 ± 0.241
5.068LeuAsn: 5.068 ± 0.373
2.144LeuPro: 2.144 ± 0.217
2.673LeuGln: 2.673 ± 0.405
2.757LeuArg: 2.757 ± 0.323
6.46LeuSer: 6.46 ± 0.526
5.068LeuThr: 5.068 ± 0.453
4.567LeuVal: 4.567 ± 0.363
0.585LeuTrp: 0.585 ± 0.117
3.175LeuTyr: 3.175 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
1.81MetAla: 1.81 ± 0.181
0.084MetCys: 0.084 ± 0.054
1.559MetAsp: 1.559 ± 0.266
1.949MetGlu: 1.949 ± 0.241
1.225MetPhe: 1.225 ± 0.189
1.364MetGly: 1.364 ± 0.209
0.223MetHis: 0.223 ± 0.084
1.782MetIle: 1.782 ± 0.208
2.562MetLys: 2.562 ± 0.26
1.894MetLeu: 1.894 ± 0.258
0.78MetMet: 0.78 ± 0.15
1.949MetAsn: 1.949 ± 0.212
0.64MetPro: 0.64 ± 0.142
1.225MetGln: 1.225 ± 0.325
0.724MetArg: 0.724 ± 0.138
1.949MetSer: 1.949 ± 0.268
2.033MetThr: 2.033 ± 0.215
1.559MetVal: 1.559 ± 0.21
0.278MetTrp: 0.278 ± 0.09
1.002MetTyr: 1.002 ± 0.17
0.0MetXaa: 0.0 ± 0.0
Asn
3.147AsnAla: 3.147 ± 0.336
0.39AsnCys: 0.39 ± 0.119
3.314AsnAsp: 3.314 ± 0.264
4.957AsnGlu: 4.957 ± 0.438
3.091AsnPhe: 3.091 ± 0.289
4.344AsnGly: 4.344 ± 0.282
0.975AsnHis: 0.975 ± 0.206
5.709AsnIle: 5.709 ± 0.393
6.071AsnLys: 6.071 ± 0.412
5.597AsnLeu: 5.597 ± 0.479
1.559AsnMet: 1.559 ± 0.222
4.038AsnAsn: 4.038 ± 0.332
2.2AsnPro: 2.2 ± 0.243
2.84AsnGln: 2.84 ± 0.335
2.116AsnArg: 2.116 ± 0.244
4.261AsnSer: 4.261 ± 0.383
3.286AsnThr: 3.286 ± 0.233
3.871AsnVal: 3.871 ± 0.252
0.585AsnTrp: 0.585 ± 0.136
2.645AsnTyr: 2.645 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
1.17ProAla: 1.17 ± 0.196
0.167ProCys: 0.167 ± 0.078
1.726ProAsp: 1.726 ± 0.237
1.559ProGlu: 1.559 ± 0.205
1.142ProPhe: 1.142 ± 0.16
0.084ProGly: 0.084 ± 0.053
0.418ProHis: 0.418 ± 0.103
1.615ProIle: 1.615 ± 0.204
2.061ProLys: 2.061 ± 0.291
1.866ProLeu: 1.866 ± 0.224
0.78ProMet: 0.78 ± 0.151
1.726ProAsn: 1.726 ± 0.213
0.334ProPro: 0.334 ± 0.104
0.724ProGln: 0.724 ± 0.156
0.64ProArg: 0.64 ± 0.148
1.699ProSer: 1.699 ± 0.226
1.392ProThr: 1.392 ± 0.257
1.866ProVal: 1.866 ± 0.268
0.084ProTrp: 0.084 ± 0.048
1.197ProTyr: 1.197 ± 0.195
0.0ProXaa: 0.0 ± 0.0
Gln
2.2GlnAla: 2.2 ± 0.434
0.223GlnCys: 0.223 ± 0.097
1.894GlnAsp: 1.894 ± 0.216
2.506GlnGlu: 2.506 ± 0.226
1.281GlnPhe: 1.281 ± 0.156
2.033GlnGly: 2.033 ± 0.221
0.724GlnHis: 0.724 ± 0.159
2.311GlnIle: 2.311 ± 0.275
3.119GlnLys: 3.119 ± 0.371
3.202GlnLeu: 3.202 ± 0.419
1.197GlnMet: 1.197 ± 0.256
2.228GlnAsn: 2.228 ± 0.286
0.752GlnPro: 0.752 ± 0.128
1.448GlnGln: 1.448 ± 0.492
1.587GlnArg: 1.587 ± 0.212
2.005GlnSer: 2.005 ± 0.272
2.088GlnThr: 2.088 ± 0.38
1.392GlnVal: 1.392 ± 0.204
0.39GlnTrp: 0.39 ± 0.107
1.448GlnTyr: 1.448 ± 0.192
0.0GlnXaa: 0.0 ± 0.0
Arg
1.309ArgAla: 1.309 ± 0.188
0.473ArgCys: 0.473 ± 0.231
2.423ArgAsp: 2.423 ± 0.238
2.84ArgGlu: 2.84 ± 0.288
1.587ArgPhe: 1.587 ± 0.218
2.2ArgGly: 2.2 ± 0.257
0.613ArgHis: 0.613 ± 0.135
2.673ArgIle: 2.673 ± 0.363
2.98ArgLys: 2.98 ± 0.329
2.451ArgLeu: 2.451 ± 0.293
0.835ArgMet: 0.835 ± 0.163
2.423ArgAsn: 2.423 ± 0.264
0.724ArgPro: 0.724 ± 0.135
1.225ArgGln: 1.225 ± 0.24
1.253ArgArg: 1.253 ± 0.203
1.838ArgSer: 1.838 ± 0.253
2.005ArgThr: 2.005 ± 0.24
2.451ArgVal: 2.451 ± 0.257
0.585ArgTrp: 0.585 ± 0.129
2.005ArgTyr: 2.005 ± 0.266
0.0ArgXaa: 0.0 ± 0.0
Ser
3.676SerAla: 3.676 ± 0.42
0.39SerCys: 0.39 ± 0.103
4.261SerAsp: 4.261 ± 0.345
5.068SerGlu: 5.068 ± 0.339
3.648SerPhe: 3.648 ± 0.406
4.233SerGly: 4.233 ± 0.484
0.808SerHis: 0.808 ± 0.165
5.124SerIle: 5.124 ± 0.398
5.848SerLys: 5.848 ± 0.387
4.873SerLeu: 4.873 ± 0.515
1.476SerMet: 1.476 ± 0.205
3.731SerAsn: 3.731 ± 0.37
1.532SerPro: 1.532 ± 0.198
2.088SerGln: 2.088 ± 0.276
2.952SerArg: 2.952 ± 0.289
4.623SerSer: 4.623 ± 0.718
3.537SerThr: 3.537 ± 0.434
3.954SerVal: 3.954 ± 0.351
0.696SerTrp: 0.696 ± 0.162
2.673SerTyr: 2.673 ± 0.286
0.0SerXaa: 0.0 ± 0.0
Thr
3.509ThrAla: 3.509 ± 0.576
0.501ThrCys: 0.501 ± 0.162
3.982ThrAsp: 3.982 ± 0.346
3.982ThrGlu: 3.982 ± 0.374
2.896ThrPhe: 2.896 ± 0.261
3.286ThrGly: 3.286 ± 0.341
1.058ThrHis: 1.058 ± 0.178
4.511ThrIle: 4.511 ± 0.413
5.597ThrLys: 5.597 ± 0.383
5.012ThrLeu: 5.012 ± 0.323
1.448ThrMet: 1.448 ± 0.199
3.369ThrAsn: 3.369 ± 0.271
1.866ThrPro: 1.866 ± 0.227
1.587ThrGln: 1.587 ± 0.216
2.116ThrArg: 2.116 ± 0.227
3.954ThrSer: 3.954 ± 0.436
3.537ThrThr: 3.537 ± 0.406
4.762ThrVal: 4.762 ± 0.406
0.613ThrTrp: 0.613 ± 0.182
2.423ThrTyr: 2.423 ± 0.264
0.0ThrXaa: 0.0 ± 0.0
Val
3.342ValAla: 3.342 ± 0.386
0.501ValCys: 0.501 ± 0.112
4.762ValAsp: 4.762 ± 0.327
4.985ValGlu: 4.985 ± 0.401
2.896ValPhe: 2.896 ± 0.274
3.564ValGly: 3.564 ± 0.366
0.947ValHis: 0.947 ± 0.171
4.455ValIle: 4.455 ± 0.444
5.876ValLys: 5.876 ± 0.396
4.177ValLeu: 4.177 ± 0.417
1.559ValMet: 1.559 ± 0.248
3.342ValAsn: 3.342 ± 0.313
1.476ValPro: 1.476 ± 0.233
2.256ValGln: 2.256 ± 0.235
1.921ValArg: 1.921 ± 0.246
4.121ValSer: 4.121 ± 0.338
4.511ValThr: 4.511 ± 0.403
4.344ValVal: 4.344 ± 0.401
0.808ValTrp: 0.808 ± 0.176
2.813ValTyr: 2.813 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
0.39TrpAla: 0.39 ± 0.107
0.084TrpCys: 0.084 ± 0.053
0.696TrpAsp: 0.696 ± 0.151
0.835TrpGlu: 0.835 ± 0.191
0.446TrpPhe: 0.446 ± 0.13
0.668TrpGly: 0.668 ± 0.156
0.167TrpHis: 0.167 ± 0.066
0.64TrpIle: 0.64 ± 0.15
0.752TrpLys: 0.752 ± 0.15
0.891TrpLeu: 0.891 ± 0.171
0.362TrpMet: 0.362 ± 0.093
1.086TrpAsn: 1.086 ± 0.161
0.0TrpPro: 0.0 ± 0.0
0.306TrpGln: 0.306 ± 0.083
0.418TrpArg: 0.418 ± 0.099
0.501TrpSer: 0.501 ± 0.114
0.724TrpThr: 0.724 ± 0.136
0.835TrpVal: 0.835 ± 0.18
0.084TrpTrp: 0.084 ± 0.054
0.362TrpTyr: 0.362 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.451TyrAla: 2.451 ± 0.242
0.39TyrCys: 0.39 ± 0.112
3.23TyrAsp: 3.23 ± 0.282
3.592TyrGlu: 3.592 ± 0.369
1.726TyrPhe: 1.726 ± 0.259
2.924TyrGly: 2.924 ± 0.235
0.668TyrHis: 0.668 ± 0.124
3.731TyrIle: 3.731 ± 0.376
3.704TyrLys: 3.704 ± 0.371
3.564TyrLeu: 3.564 ± 0.322
1.03TyrMet: 1.03 ± 0.164
2.701TyrAsn: 2.701 ± 0.288
1.42TyrPro: 1.42 ± 0.241
1.977TyrGln: 1.977 ± 0.205
1.754TyrArg: 1.754 ± 0.249
2.673TyrSer: 2.673 ± 0.311
2.952TyrThr: 2.952 ± 0.351
2.59TyrVal: 2.59 ± 0.294
0.278TyrTrp: 0.278 ± 0.084
1.838TyrTyr: 1.838 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 177 proteins (35912 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski