Amino acid dipepetide frequency for Lactobacillus phage LfeInf

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.499AlaAla: 1.499 ± 0.302
0.187AlaCys: 0.187 ± 0.09
3.56AlaAsp: 3.56 ± 0.408
2.061AlaGlu: 2.061 ± 0.309
2.311AlaPhe: 2.311 ± 0.237
3.685AlaGly: 3.685 ± 0.526
1.156AlaHis: 1.156 ± 0.177
3.81AlaIle: 3.81 ± 0.401
5.871AlaLys: 5.871 ± 0.599
5.215AlaLeu: 5.215 ± 0.415
1.249AlaMet: 1.249 ± 0.212
5.122AlaAsn: 5.122 ± 0.487
1.999AlaPro: 1.999 ± 0.301
3.404AlaGln: 3.404 ± 0.451
1.999AlaArg: 1.999 ± 0.273
5.247AlaSer: 5.247 ± 0.674
4.653AlaThr: 4.653 ± 0.446
3.498AlaVal: 3.498 ± 0.332
0.656AlaTrp: 0.656 ± 0.202
3.342AlaTyr: 3.342 ± 0.339
0.0AlaXaa: 0.0 ± 0.0
Cys
0.344CysAla: 0.344 ± 0.104
0.031CysCys: 0.031 ± 0.032
0.312CysAsp: 0.312 ± 0.125
0.25CysGlu: 0.25 ± 0.102
0.312CysPhe: 0.312 ± 0.126
0.812CysGly: 0.812 ± 0.191
0.125CysHis: 0.125 ± 0.055
0.281CysIle: 0.281 ± 0.08
0.312CysLys: 0.312 ± 0.083
0.437CysLeu: 0.437 ± 0.137
0.156CysMet: 0.156 ± 0.062
0.375CysAsn: 0.375 ± 0.122
0.562CysPro: 0.562 ± 0.158
0.156CysGln: 0.156 ± 0.073
0.094CysArg: 0.094 ± 0.065
0.375CysSer: 0.375 ± 0.096
0.593CysThr: 0.593 ± 0.148
0.281CysVal: 0.281 ± 0.093
0.125CysTrp: 0.125 ± 0.064
0.312CysTyr: 0.312 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
2.998AspAla: 2.998 ± 0.306
0.562AspCys: 0.562 ± 0.165
5.215AspAsp: 5.215 ± 0.585
3.654AspGlu: 3.654 ± 0.562
2.592AspPhe: 2.592 ± 0.271
4.934AspGly: 4.934 ± 0.561
1.405AspHis: 1.405 ± 0.252
3.31AspIle: 3.31 ± 0.398
5.653AspLys: 5.653 ± 0.502
6.808AspLeu: 6.808 ± 0.559
1.28AspMet: 1.28 ± 0.214
5.215AspAsn: 5.215 ± 0.415
2.374AspPro: 2.374 ± 0.277
3.467AspGln: 3.467 ± 0.348
1.843AspArg: 1.843 ± 0.257
4.778AspSer: 4.778 ± 0.381
4.528AspThr: 4.528 ± 0.397
3.623AspVal: 3.623 ± 0.359
0.75AspTrp: 0.75 ± 0.149
3.998AspTyr: 3.998 ± 0.4
0.0AspXaa: 0.0 ± 0.0
Glu
2.873GluAla: 2.873 ± 0.337
0.344GluCys: 0.344 ± 0.139
4.06GluAsp: 4.06 ± 0.52
3.716GluGlu: 3.716 ± 0.517
1.53GluPhe: 1.53 ± 0.199
2.811GluGly: 2.811 ± 0.271
1.093GluHis: 1.093 ± 0.232
3.654GluIle: 3.654 ± 0.385
3.154GluLys: 3.154 ± 0.382
5.59GluLeu: 5.59 ± 0.568
1.374GluMet: 1.374 ± 0.2
2.405GluAsn: 2.405 ± 0.269
1.312GluPro: 1.312 ± 0.251
3.998GluGln: 3.998 ± 0.502
1.686GluArg: 1.686 ± 0.304
2.936GluSer: 2.936 ± 0.335
2.842GluThr: 2.842 ± 0.318
3.435GluVal: 3.435 ± 0.413
0.312GluTrp: 0.312 ± 0.084
2.936GluTyr: 2.936 ± 0.358
0.0GluXaa: 0.0 ± 0.0
Phe
1.999PheAla: 1.999 ± 0.324
0.25PheCys: 0.25 ± 0.087
2.842PheAsp: 2.842 ± 0.295
1.686PheGlu: 1.686 ± 0.239
0.937PhePhe: 0.937 ± 0.201
2.186PheGly: 2.186 ± 0.291
0.437PheHis: 0.437 ± 0.11
2.374PheIle: 2.374 ± 0.297
2.092PheLys: 2.092 ± 0.266
2.592PheLeu: 2.592 ± 0.377
0.75PheMet: 0.75 ± 0.162
3.029PheAsn: 3.029 ± 0.385
1.031PhePro: 1.031 ± 0.238
1.187PheGln: 1.187 ± 0.168
1.156PheArg: 1.156 ± 0.173
2.155PheSer: 2.155 ± 0.282
2.811PheThr: 2.811 ± 0.326
2.311PheVal: 2.311 ± 0.296
0.562PheTrp: 0.562 ± 0.146
1.655PheTyr: 1.655 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
3.467GlyAla: 3.467 ± 0.436
0.156GlyCys: 0.156 ± 0.082
5.153GlyAsp: 5.153 ± 0.504
2.686GlyGlu: 2.686 ± 0.277
2.623GlyPhe: 2.623 ± 0.333
4.622GlyGly: 4.622 ± 0.624
1.249GlyHis: 1.249 ± 0.198
3.654GlyIle: 3.654 ± 0.411
4.31GlyLys: 4.31 ± 0.408
4.997GlyLeu: 4.997 ± 0.349
1.718GlyMet: 1.718 ± 0.248
4.653GlyAsn: 4.653 ± 0.644
0.718GlyPro: 0.718 ± 0.172
3.029GlyGln: 3.029 ± 0.351
2.092GlyArg: 2.092 ± 0.244
6.433GlySer: 6.433 ± 0.699
5.122GlyThr: 5.122 ± 0.859
4.06GlyVal: 4.06 ± 0.374
0.843GlyTrp: 0.843 ± 0.181
3.935GlyTyr: 3.935 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
0.812HisAla: 0.812 ± 0.19
0.094HisCys: 0.094 ± 0.047
1.374HisAsp: 1.374 ± 0.289
0.843HisGlu: 0.843 ± 0.198
0.656HisPhe: 0.656 ± 0.147
1.031HisGly: 1.031 ± 0.238
0.187HisHis: 0.187 ± 0.069
0.968HisIle: 0.968 ± 0.206
1.124HisLys: 1.124 ± 0.198
1.78HisLeu: 1.78 ± 0.26
0.593HisMet: 0.593 ± 0.131
1.093HisAsn: 1.093 ± 0.202
0.656HisPro: 0.656 ± 0.108
1.031HisGln: 1.031 ± 0.226
0.593HisArg: 0.593 ± 0.137
0.718HisSer: 0.718 ± 0.117
1.312HisThr: 1.312 ± 0.185
1.124HisVal: 1.124 ± 0.191
0.219HisTrp: 0.219 ± 0.068
0.812HisTyr: 0.812 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
3.373IleAla: 3.373 ± 0.347
0.437IleCys: 0.437 ± 0.116
4.56IleAsp: 4.56 ± 0.465
3.279IleGlu: 3.279 ± 0.431
1.999IlePhe: 1.999 ± 0.282
2.967IleGly: 2.967 ± 0.37
0.874IleHis: 0.874 ± 0.181
3.623IleIle: 3.623 ± 0.421
4.622IleLys: 4.622 ± 0.507
4.122IleLeu: 4.122 ± 0.478
1.218IleMet: 1.218 ± 0.193
5.091IleAsn: 5.091 ± 0.359
1.811IlePro: 1.811 ± 0.255
2.467IleGln: 2.467 ± 0.283
2.061IleArg: 2.061 ± 0.276
5.059IleSer: 5.059 ± 0.591
4.466IleThr: 4.466 ± 0.484
3.404IleVal: 3.404 ± 0.284
0.562IleTrp: 0.562 ± 0.145
2.623IleTyr: 2.623 ± 0.348
0.0IleXaa: 0.0 ± 0.0
Lys
4.997LysAla: 4.997 ± 0.541
0.312LysCys: 0.312 ± 0.147
5.215LysAsp: 5.215 ± 0.458
5.34LysGlu: 5.34 ± 0.649
1.78LysPhe: 1.78 ± 0.245
4.341LysGly: 4.341 ± 0.352
1.53LysHis: 1.53 ± 0.303
3.841LysIle: 3.841 ± 0.39
4.778LysLys: 4.778 ± 0.56
6.402LysLeu: 6.402 ± 0.585
1.811LysMet: 1.811 ± 0.254
3.404LysAsn: 3.404 ± 0.263
2.124LysPro: 2.124 ± 0.37
3.498LysGln: 3.498 ± 0.427
1.936LysArg: 1.936 ± 0.226
5.028LysSer: 5.028 ± 0.756
3.873LysThr: 3.873 ± 0.377
5.59LysVal: 5.59 ± 0.574
0.468LysTrp: 0.468 ± 0.122
3.186LysTyr: 3.186 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
6.527LeuAla: 6.527 ± 0.453
0.687LeuCys: 0.687 ± 0.186
6.715LeuAsp: 6.715 ± 0.48
4.997LeuGlu: 4.997 ± 0.604
2.655LeuPhe: 2.655 ± 0.273
5.746LeuGly: 5.746 ± 0.342
1.031LeuHis: 1.031 ± 0.201
5.153LeuIle: 5.153 ± 0.456
6.152LeuLys: 6.152 ± 0.486
6.465LeuLeu: 6.465 ± 0.628
1.968LeuMet: 1.968 ± 0.261
6.09LeuAsn: 6.09 ± 0.552
2.405LeuPro: 2.405 ± 0.333
3.81LeuGln: 3.81 ± 0.255
2.748LeuArg: 2.748 ± 0.294
6.964LeuSer: 6.964 ± 0.498
6.621LeuThr: 6.621 ± 0.542
6.09LeuVal: 6.09 ± 0.516
0.812LeuTrp: 0.812 ± 0.147
3.498LeuTyr: 3.498 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
1.593MetAla: 1.593 ± 0.255
0.125MetCys: 0.125 ± 0.057
1.343MetAsp: 1.343 ± 0.217
1.062MetGlu: 1.062 ± 0.207
0.687MetPhe: 0.687 ± 0.175
1.062MetGly: 1.062 ± 0.195
0.437MetHis: 0.437 ± 0.107
1.312MetIle: 1.312 ± 0.256
1.437MetLys: 1.437 ± 0.206
1.718MetLeu: 1.718 ± 0.215
0.468MetMet: 0.468 ± 0.14
1.593MetAsn: 1.593 ± 0.23
0.531MetPro: 0.531 ± 0.118
1.218MetGln: 1.218 ± 0.155
0.75MetArg: 0.75 ± 0.136
1.343MetSer: 1.343 ± 0.208
1.093MetThr: 1.093 ± 0.206
1.655MetVal: 1.655 ± 0.252
0.094MetTrp: 0.094 ± 0.049
0.75MetTyr: 0.75 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
5.684AsnAla: 5.684 ± 0.663
0.281AsnCys: 0.281 ± 0.092
3.873AsnAsp: 3.873 ± 0.334
3.061AsnGlu: 3.061 ± 0.29
1.905AsnPhe: 1.905 ± 0.245
4.934AsnGly: 4.934 ± 0.566
1.374AsnHis: 1.374 ± 0.239
4.341AsnIle: 4.341 ± 0.298
4.372AsnLys: 4.372 ± 0.473
6.496AsnLeu: 6.496 ± 0.475
1.187AsnMet: 1.187 ± 0.188
4.778AsnAsn: 4.778 ± 0.462
2.498AsnPro: 2.498 ± 0.274
4.528AsnGln: 4.528 ± 0.439
1.905AsnArg: 1.905 ± 0.178
5.778AsnSer: 5.778 ± 0.538
4.154AsnThr: 4.154 ± 0.37
3.56AsnVal: 3.56 ± 0.36
0.718AsnTrp: 0.718 ± 0.153
3.498AsnTyr: 3.498 ± 0.288
0.0AsnXaa: 0.0 ± 0.0
Pro
2.03ProAla: 2.03 ± 0.357
0.094ProCys: 0.094 ± 0.048
2.342ProAsp: 2.342 ± 0.326
1.968ProGlu: 1.968 ± 0.3
0.999ProPhe: 0.999 ± 0.179
1.468ProGly: 1.468 ± 0.252
0.625ProHis: 0.625 ± 0.131
1.686ProIle: 1.686 ± 0.217
2.061ProLys: 2.061 ± 0.213
2.436ProLeu: 2.436 ± 0.268
0.718ProMet: 0.718 ± 0.152
2.03ProAsn: 2.03 ± 0.268
0.468ProPro: 0.468 ± 0.119
1.437ProGln: 1.437 ± 0.241
0.625ProArg: 0.625 ± 0.137
2.436ProSer: 2.436 ± 0.314
2.623ProThr: 2.623 ± 0.284
2.28ProVal: 2.28 ± 0.337
0.5ProTrp: 0.5 ± 0.168
2.061ProTyr: 2.061 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
4.029GlnAla: 4.029 ± 0.514
0.187GlnCys: 0.187 ± 0.068
2.498GlnAsp: 2.498 ± 0.302
3.123GlnGlu: 3.123 ± 0.426
1.53GlnPhe: 1.53 ± 0.296
3.279GlnGly: 3.279 ± 0.319
0.5GlnHis: 0.5 ± 0.121
2.717GlnIle: 2.717 ± 0.28
2.623GlnLys: 2.623 ± 0.347
4.841GlnLeu: 4.841 ± 0.414
0.999GlnMet: 0.999 ± 0.203
2.904GlnAsn: 2.904 ± 0.271
2.124GlnPro: 2.124 ± 0.316
3.841GlnGln: 3.841 ± 0.508
2.03GlnArg: 2.03 ± 0.349
3.623GlnSer: 3.623 ± 0.442
3.435GlnThr: 3.435 ± 0.327
4.029GlnVal: 4.029 ± 0.375
0.375GlnTrp: 0.375 ± 0.101
2.498GlnTyr: 2.498 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
1.468ArgAla: 1.468 ± 0.194
0.344ArgCys: 0.344 ± 0.136
1.187ArgAsp: 1.187 ± 0.209
1.655ArgGlu: 1.655 ± 0.24
1.437ArgPhe: 1.437 ± 0.191
2.092ArgGly: 2.092 ± 0.262
0.406ArgHis: 0.406 ± 0.098
1.718ArgIle: 1.718 ± 0.255
1.936ArgLys: 1.936 ± 0.286
2.904ArgLeu: 2.904 ± 0.328
0.625ArgMet: 0.625 ± 0.11
1.343ArgAsn: 1.343 ± 0.222
0.874ArgPro: 0.874 ± 0.161
1.499ArgGln: 1.499 ± 0.23
0.999ArgArg: 0.999 ± 0.165
2.467ArgSer: 2.467 ± 0.388
1.999ArgThr: 1.999 ± 0.335
3.123ArgVal: 3.123 ± 0.293
0.625ArgTrp: 0.625 ± 0.151
1.874ArgTyr: 1.874 ± 0.251
0.0ArgXaa: 0.0 ± 0.0
Ser
4.809SerAla: 4.809 ± 0.485
0.406SerCys: 0.406 ± 0.109
4.778SerAsp: 4.778 ± 0.475
3.498SerGlu: 3.498 ± 0.384
2.873SerPhe: 2.873 ± 0.27
6.184SerGly: 6.184 ± 0.914
1.28SerHis: 1.28 ± 0.19
4.622SerIle: 4.622 ± 0.394
5.528SerLys: 5.528 ± 0.379
7.152SerLeu: 7.152 ± 0.533
1.312SerMet: 1.312 ± 0.213
5.715SerAsn: 5.715 ± 0.524
2.686SerPro: 2.686 ± 0.358
3.904SerGln: 3.904 ± 0.492
1.686SerArg: 1.686 ± 0.293
7.745SerSer: 7.745 ± 0.82
5.528SerThr: 5.528 ± 0.709
4.809SerVal: 4.809 ± 0.46
0.75SerTrp: 0.75 ± 0.123
3.81SerTyr: 3.81 ± 0.342
0.0SerXaa: 0.0 ± 0.0
Thr
4.279ThrAla: 4.279 ± 0.506
0.344ThrCys: 0.344 ± 0.124
4.778ThrAsp: 4.778 ± 0.42
2.467ThrGlu: 2.467 ± 0.294
2.842ThrPhe: 2.842 ± 0.348
4.653ThrGly: 4.653 ± 0.413
0.999ThrHis: 0.999 ± 0.19
5.153ThrIle: 5.153 ± 0.548
4.435ThrLys: 4.435 ± 0.378
6.34ThrLeu: 6.34 ± 0.469
0.937ThrMet: 0.937 ± 0.18
4.966ThrAsn: 4.966 ± 0.469
2.904ThrPro: 2.904 ± 0.335
3.186ThrGln: 3.186 ± 0.446
2.186ThrArg: 2.186 ± 0.267
5.746ThrSer: 5.746 ± 0.793
5.746ThrThr: 5.746 ± 0.769
3.654ThrVal: 3.654 ± 0.525
0.874ThrTrp: 0.874 ± 0.185
4.06ThrTyr: 4.06 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
4.466ValAla: 4.466 ± 0.416
0.781ValCys: 0.781 ± 0.214
5.028ValAsp: 5.028 ± 0.399
3.654ValGlu: 3.654 ± 0.492
2.217ValPhe: 2.217 ± 0.291
3.841ValGly: 3.841 ± 0.358
1.031ValHis: 1.031 ± 0.156
3.279ValIle: 3.279 ± 0.309
4.997ValLys: 4.997 ± 0.49
4.903ValLeu: 4.903 ± 0.425
0.999ValMet: 0.999 ± 0.175
4.622ValAsn: 4.622 ± 0.453
2.092ValPro: 2.092 ± 0.237
2.217ValGln: 2.217 ± 0.269
1.811ValArg: 1.811 ± 0.256
5.746ValSer: 5.746 ± 0.509
5.028ValThr: 5.028 ± 0.554
4.685ValVal: 4.685 ± 0.46
0.562ValTrp: 0.562 ± 0.155
4.185ValTyr: 4.185 ± 0.673
0.0ValXaa: 0.0 ± 0.0
Trp
0.5TrpAla: 0.5 ± 0.114
0.187TrpCys: 0.187 ± 0.066
0.531TrpAsp: 0.531 ± 0.164
0.468TrpGlu: 0.468 ± 0.123
0.344TrpPhe: 0.344 ± 0.08
0.75TrpGly: 0.75 ± 0.149
0.187TrpHis: 0.187 ± 0.074
0.718TrpIle: 0.718 ± 0.147
0.687TrpLys: 0.687 ± 0.139
1.156TrpLeu: 1.156 ± 0.197
0.031TrpMet: 0.031 ± 0.028
0.468TrpAsn: 0.468 ± 0.134
0.219TrpPro: 0.219 ± 0.084
0.468TrpGln: 0.468 ± 0.126
0.25TrpArg: 0.25 ± 0.088
1.031TrpSer: 1.031 ± 0.234
0.593TrpThr: 0.593 ± 0.142
1.124TrpVal: 1.124 ± 0.253
0.312TrpTrp: 0.312 ± 0.104
0.687TrpTyr: 0.687 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.78TyrAla: 2.78 ± 0.242
0.437TyrCys: 0.437 ± 0.111
3.623TyrAsp: 3.623 ± 0.399
2.592TyrGlu: 2.592 ± 0.32
1.811TyrPhe: 1.811 ± 0.252
4.122TyrGly: 4.122 ± 0.585
1.093TyrHis: 1.093 ± 0.163
2.405TyrIle: 2.405 ± 0.295
3.31TyrLys: 3.31 ± 0.324
4.778TyrLeu: 4.778 ± 0.481
0.843TyrMet: 0.843 ± 0.14
4.06TyrAsn: 4.06 ± 0.335
1.499TyrPro: 1.499 ± 0.303
2.842TyrGln: 2.842 ± 0.292
2.124TyrArg: 2.124 ± 0.286
3.467TyrSer: 3.467 ± 0.322
3.529TyrThr: 3.529 ± 0.421
3.81TyrVal: 3.81 ± 0.353
0.593TyrTrp: 0.593 ± 0.129
2.342TyrTyr: 2.342 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 123 proteins (32021 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski