Amino acid dipepetide frequency for Lactobacillus phage LpeD

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.924AlaAla: 1.924 ± 0.499
0.163AlaCys: 0.163 ± 0.069
3.098AlaAsp: 3.098 ± 0.276
2.935AlaGlu: 2.935 ± 0.283
2.152AlaPhe: 2.152 ± 0.223
3.815AlaGly: 3.815 ± 0.61
0.913AlaHis: 0.913 ± 0.17
3.946AlaIle: 3.946 ± 0.383
4.729AlaLys: 4.729 ± 0.457
4.533AlaLeu: 4.533 ± 0.402
1.207AlaMet: 1.207 ± 0.188
3.391AlaAsn: 3.391 ± 0.346
2.054AlaPro: 2.054 ± 0.281
2.185AlaGln: 2.185 ± 0.279
2.022AlaArg: 2.022 ± 0.261
4.729AlaSer: 4.729 ± 0.592
4.207AlaThr: 4.207 ± 0.413
3.391AlaVal: 3.391 ± 0.297
0.489AlaTrp: 0.489 ± 0.12
2.837AlaTyr: 2.837 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
0.163CysAla: 0.163 ± 0.079
0.098CysCys: 0.098 ± 0.076
0.261CysAsp: 0.261 ± 0.082
0.163CysGlu: 0.163 ± 0.071
0.261CysPhe: 0.261 ± 0.104
0.457CysGly: 0.457 ± 0.109
0.163CysHis: 0.163 ± 0.075
0.587CysIle: 0.587 ± 0.133
0.489CysLys: 0.489 ± 0.13
0.522CysLeu: 0.522 ± 0.163
0.098CysMet: 0.098 ± 0.05
0.196CysAsn: 0.196 ± 0.082
0.196CysPro: 0.196 ± 0.08
0.261CysGln: 0.261 ± 0.105
0.163CysArg: 0.163 ± 0.066
0.522CysSer: 0.522 ± 0.136
0.228CysThr: 0.228 ± 0.089
0.359CysVal: 0.359 ± 0.108
0.033CysTrp: 0.033 ± 0.034
0.424CysTyr: 0.424 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
3.555AspAla: 3.555 ± 0.427
0.554AspCys: 0.554 ± 0.145
4.468AspAsp: 4.468 ± 0.513
4.207AspGlu: 4.207 ± 0.397
3.391AspPhe: 3.391 ± 0.364
4.598AspGly: 4.598 ± 0.446
0.652AspHis: 0.652 ± 0.141
5.739AspIle: 5.739 ± 0.497
5.935AspLys: 5.935 ± 0.567
7.011AspLeu: 7.011 ± 0.574
1.859AspMet: 1.859 ± 0.221
4.663AspAsn: 4.663 ± 0.436
1.891AspPro: 1.891 ± 0.247
1.337AspGln: 1.337 ± 0.185
1.989AspArg: 1.989 ± 0.232
6.229AspSer: 6.229 ± 0.448
4.663AspThr: 4.663 ± 0.445
3.489AspVal: 3.489 ± 0.358
0.815AspTrp: 0.815 ± 0.177
3.652AspTyr: 3.652 ± 0.359
0.0AspXaa: 0.0 ± 0.0
Glu
3.261GluAla: 3.261 ± 0.402
0.228GluCys: 0.228 ± 0.077
4.761GluAsp: 4.761 ± 0.408
3.913GluGlu: 3.913 ± 0.51
2.185GluPhe: 2.185 ± 0.271
2.805GluGly: 2.805 ± 0.245
0.978GluHis: 0.978 ± 0.167
3.522GluIle: 3.522 ± 0.316
3.946GluLys: 3.946 ± 0.396
5.479GluLeu: 5.479 ± 0.44
1.337GluMet: 1.337 ± 0.229
3.848GluAsn: 3.848 ± 0.413
1.207GluPro: 1.207 ± 0.224
2.609GluGln: 2.609 ± 0.363
1.957GluArg: 1.957 ± 0.252
3.489GluSer: 3.489 ± 0.352
2.805GluThr: 2.805 ± 0.285
3.163GluVal: 3.163 ± 0.365
0.554GluTrp: 0.554 ± 0.127
2.674GluTyr: 2.674 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
1.402PheAla: 1.402 ± 0.256
0.391PheCys: 0.391 ± 0.131
2.609PheAsp: 2.609 ± 0.262
1.728PheGlu: 1.728 ± 0.29
0.75PhePhe: 0.75 ± 0.155
2.772PheGly: 2.772 ± 0.326
0.326PheHis: 0.326 ± 0.138
2.478PheIle: 2.478 ± 0.335
3.783PheLys: 3.783 ± 0.416
2.315PheLeu: 2.315 ± 0.269
0.88PheMet: 0.88 ± 0.16
3.978PheAsn: 3.978 ± 0.375
1.141PhePro: 1.141 ± 0.194
1.37PheGln: 1.37 ± 0.217
1.565PheArg: 1.565 ± 0.209
3.391PheSer: 3.391 ± 0.369
2.511PheThr: 2.511 ± 0.359
2.739PheVal: 2.739 ± 0.42
0.424PheTrp: 0.424 ± 0.117
2.087PheTyr: 2.087 ± 0.302
0.0PheXaa: 0.0 ± 0.0
Gly
3.946GlyAla: 3.946 ± 0.678
0.326GlyCys: 0.326 ± 0.108
4.761GlyAsp: 4.761 ± 0.35
2.805GlyGlu: 2.805 ± 0.293
2.674GlyPhe: 2.674 ± 0.246
4.044GlyGly: 4.044 ± 0.471
1.207GlyHis: 1.207 ± 0.248
4.337GlyIle: 4.337 ± 0.365
5.12GlyLys: 5.12 ± 0.482
5.185GlyLeu: 5.185 ± 0.416
1.533GlyMet: 1.533 ± 0.221
4.533GlyAsn: 4.533 ± 0.467
0.88GlyPro: 0.88 ± 0.218
1.989GlyGln: 1.989 ± 0.313
1.663GlyArg: 1.663 ± 0.202
5.381GlySer: 5.381 ± 0.595
5.022GlyThr: 5.022 ± 0.6
4.37GlyVal: 4.37 ± 0.452
0.75GlyTrp: 0.75 ± 0.147
3.685GlyTyr: 3.685 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
0.62HisAla: 0.62 ± 0.155
0.196HisCys: 0.196 ± 0.087
1.044HisAsp: 1.044 ± 0.175
0.815HisGlu: 0.815 ± 0.155
0.783HisPhe: 0.783 ± 0.195
0.783HisGly: 0.783 ± 0.164
0.228HisHis: 0.228 ± 0.086
1.272HisIle: 1.272 ± 0.212
0.946HisLys: 0.946 ± 0.165
1.304HisLeu: 1.304 ± 0.198
0.293HisMet: 0.293 ± 0.118
1.272HisAsn: 1.272 ± 0.223
0.652HisPro: 0.652 ± 0.12
0.326HisGln: 0.326 ± 0.077
0.293HisArg: 0.293 ± 0.095
0.946HisSer: 0.946 ± 0.167
0.685HisThr: 0.685 ± 0.154
1.141HisVal: 1.141 ± 0.22
0.261HisTrp: 0.261 ± 0.1
0.848HisTyr: 0.848 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
3.587IleAla: 3.587 ± 0.407
0.391IleCys: 0.391 ± 0.158
5.283IleAsp: 5.283 ± 0.431
3.815IleGlu: 3.815 ± 0.339
1.402IlePhe: 1.402 ± 0.239
4.142IleGly: 4.142 ± 0.427
0.783IleHis: 0.783 ± 0.181
4.044IleIle: 4.044 ± 0.407
7.076IleLys: 7.076 ± 0.538
4.076IleLeu: 4.076 ± 0.435
1.598IleMet: 1.598 ± 0.263
6.522IleAsn: 6.522 ± 0.575
2.707IlePro: 2.707 ± 0.294
1.989IleGln: 1.989 ± 0.264
2.576IleArg: 2.576 ± 0.256
6.229IleSer: 6.229 ± 0.508
4.435IleThr: 4.435 ± 0.372
3.587IleVal: 3.587 ± 0.338
0.554IleTrp: 0.554 ± 0.15
2.902IleTyr: 2.902 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
5.055LysAla: 5.055 ± 0.373
0.424LysCys: 0.424 ± 0.115
6.131LysAsp: 6.131 ± 0.579
5.739LysGlu: 5.739 ± 0.644
3.131LysPhe: 3.131 ± 0.372
4.5LysGly: 4.5 ± 0.415
1.435LysHis: 1.435 ± 0.253
4.892LysIle: 4.892 ± 0.403
5.022LysLys: 5.022 ± 0.629
7.076LysLeu: 7.076 ± 0.535
1.826LysMet: 1.826 ± 0.233
5.152LysAsn: 5.152 ± 0.473
2.511LysPro: 2.511 ± 0.398
3.228LysGln: 3.228 ± 0.356
2.935LysArg: 2.935 ± 0.372
6.424LysSer: 6.424 ± 0.655
3.522LysThr: 3.522 ± 0.314
5.25LysVal: 5.25 ± 0.405
0.62LysTrp: 0.62 ± 0.147
3.718LysTyr: 3.718 ± 0.46
0.0LysXaa: 0.0 ± 0.0
Leu
4.957LeuAla: 4.957 ± 0.418
0.326LeuCys: 0.326 ± 0.096
6.261LeuAsp: 6.261 ± 0.544
4.696LeuGlu: 4.696 ± 0.452
3.294LeuPhe: 3.294 ± 0.322
4.794LeuGly: 4.794 ± 0.454
1.044LeuHis: 1.044 ± 0.167
4.957LeuIle: 4.957 ± 0.385
6.555LeuLys: 6.555 ± 0.519
5.739LeuLeu: 5.739 ± 0.586
2.054LeuMet: 2.054 ± 0.253
6.0LeuAsn: 6.0 ± 0.455
2.87LeuPro: 2.87 ± 0.396
3.098LeuGln: 3.098 ± 0.361
3.163LeuArg: 3.163 ± 0.405
8.022LeuSer: 8.022 ± 0.661
5.674LeuThr: 5.674 ± 0.425
5.022LeuVal: 5.022 ± 0.433
0.62LeuTrp: 0.62 ± 0.171
3.881LeuTyr: 3.881 ± 0.394
0.0LeuXaa: 0.0 ± 0.0
Met
1.337MetAla: 1.337 ± 0.25
0.033MetCys: 0.033 ± 0.033
1.37MetAsp: 1.37 ± 0.248
1.402MetGlu: 1.402 ± 0.199
0.946MetPhe: 0.946 ± 0.152
1.011MetGly: 1.011 ± 0.209
0.424MetHis: 0.424 ± 0.105
1.467MetIle: 1.467 ± 0.228
1.728MetLys: 1.728 ± 0.319
1.859MetLeu: 1.859 ± 0.212
0.293MetMet: 0.293 ± 0.158
1.174MetAsn: 1.174 ± 0.222
0.652MetPro: 0.652 ± 0.133
1.207MetGln: 1.207 ± 0.206
0.62MetArg: 0.62 ± 0.143
2.218MetSer: 2.218 ± 0.296
1.435MetThr: 1.435 ± 0.247
1.076MetVal: 1.076 ± 0.187
0.163MetTrp: 0.163 ± 0.066
0.913MetTyr: 0.913 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
3.457AsnAla: 3.457 ± 0.322
0.326AsnCys: 0.326 ± 0.085
4.794AsnAsp: 4.794 ± 0.369
3.196AsnGlu: 3.196 ± 0.301
2.511AsnPhe: 2.511 ± 0.386
5.185AsnGly: 5.185 ± 0.524
1.435AsnHis: 1.435 ± 0.21
4.239AsnIle: 4.239 ± 0.392
6.261AsnLys: 6.261 ± 0.527
5.968AsnLeu: 5.968 ± 0.383
1.598AsnMet: 1.598 ± 0.281
6.131AsnAsn: 6.131 ± 0.422
2.478AsnPro: 2.478 ± 0.299
3.261AsnGln: 3.261 ± 0.352
2.544AsnArg: 2.544 ± 0.249
6.979AsnSer: 6.979 ± 0.518
4.761AsnThr: 4.761 ± 0.581
3.163AsnVal: 3.163 ± 0.376
0.783AsnTrp: 0.783 ± 0.137
4.011AsnTyr: 4.011 ± 0.43
0.0AsnXaa: 0.0 ± 0.0
Pro
1.37ProAla: 1.37 ± 0.253
0.065ProCys: 0.065 ± 0.047
2.218ProAsp: 2.218 ± 0.266
2.674ProGlu: 2.674 ± 0.316
1.402ProPhe: 1.402 ± 0.245
1.044ProGly: 1.044 ± 0.171
0.391ProHis: 0.391 ± 0.103
2.218ProIle: 2.218 ± 0.257
2.152ProLys: 2.152 ± 0.306
2.674ProLeu: 2.674 ± 0.331
0.554ProMet: 0.554 ± 0.113
2.218ProAsn: 2.218 ± 0.283
0.554ProPro: 0.554 ± 0.15
0.978ProGln: 0.978 ± 0.223
0.815ProArg: 0.815 ± 0.127
2.087ProSer: 2.087 ± 0.25
2.087ProThr: 2.087 ± 0.392
2.218ProVal: 2.218 ± 0.291
0.293ProTrp: 0.293 ± 0.102
1.891ProTyr: 1.891 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
2.772GlnAla: 2.772 ± 0.309
0.326GlnCys: 0.326 ± 0.107
2.446GlnAsp: 2.446 ± 0.283
1.989GlnGlu: 1.989 ± 0.287
1.37GlnPhe: 1.37 ± 0.215
2.544GlnGly: 2.544 ± 0.254
0.554GlnHis: 0.554 ± 0.146
2.348GlnIle: 2.348 ± 0.288
2.446GlnLys: 2.446 ± 0.34
3.294GlnLeu: 3.294 ± 0.335
0.913GlnMet: 0.913 ± 0.188
1.565GlnAsn: 1.565 ± 0.205
0.946GlnPro: 0.946 ± 0.255
1.598GlnGln: 1.598 ± 0.337
1.239GlnArg: 1.239 ± 0.22
3.0GlnSer: 3.0 ± 0.363
1.891GlnThr: 1.891 ± 0.298
3.033GlnVal: 3.033 ± 0.334
0.457GlnTrp: 0.457 ± 0.123
1.663GlnTyr: 1.663 ± 0.224
0.0GlnXaa: 0.0 ± 0.0
Arg
1.533ArgAla: 1.533 ± 0.193
0.228ArgCys: 0.228 ± 0.072
1.891ArgAsp: 1.891 ± 0.25
1.663ArgGlu: 1.663 ± 0.235
1.598ArgPhe: 1.598 ± 0.193
2.185ArgGly: 2.185 ± 0.283
0.424ArgHis: 0.424 ± 0.106
3.131ArgIle: 3.131 ± 0.314
2.413ArgLys: 2.413 ± 0.317
3.326ArgLeu: 3.326 ± 0.321
0.88ArgMet: 0.88 ± 0.178
1.989ArgAsn: 1.989 ± 0.23
0.913ArgPro: 0.913 ± 0.198
1.304ArgGln: 1.304 ± 0.216
1.207ArgArg: 1.207 ± 0.217
2.283ArgSer: 2.283 ± 0.276
1.891ArgThr: 1.891 ± 0.228
2.544ArgVal: 2.544 ± 0.298
0.391ArgTrp: 0.391 ± 0.131
1.631ArgTyr: 1.631 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
4.892SerAla: 4.892 ± 0.493
0.163SerCys: 0.163 ± 0.068
6.653SerAsp: 6.653 ± 0.424
4.109SerGlu: 4.109 ± 0.441
3.033SerPhe: 3.033 ± 0.362
6.783SerGly: 6.783 ± 0.691
1.044SerHis: 1.044 ± 0.25
5.381SerIle: 5.381 ± 0.444
7.207SerLys: 7.207 ± 0.734
7.142SerLeu: 7.142 ± 0.572
1.337SerMet: 1.337 ± 0.201
6.424SerAsn: 6.424 ± 0.508
2.12SerPro: 2.12 ± 0.254
3.228SerGln: 3.228 ± 0.362
2.315SerArg: 2.315 ± 0.29
7.24SerSer: 7.24 ± 0.89
5.218SerThr: 5.218 ± 0.525
4.989SerVal: 4.989 ± 0.426
1.011SerTrp: 1.011 ± 0.177
4.207SerTyr: 4.207 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
3.75ThrAla: 3.75 ± 0.489
0.326ThrCys: 0.326 ± 0.118
4.272ThrAsp: 4.272 ± 0.43
2.674ThrGlu: 2.674 ± 0.275
2.772ThrPhe: 2.772 ± 0.306
5.055ThrGly: 5.055 ± 0.687
0.815ThrHis: 0.815 ± 0.156
5.25ThrIle: 5.25 ± 0.423
3.815ThrLys: 3.815 ± 0.35
5.022ThrLeu: 5.022 ± 0.502
0.783ThrMet: 0.783 ± 0.143
4.598ThrAsn: 4.598 ± 0.4
2.348ThrPro: 2.348 ± 0.289
2.478ThrGln: 2.478 ± 0.359
2.087ThrArg: 2.087 ± 0.242
5.12ThrSer: 5.12 ± 0.573
5.087ThrThr: 5.087 ± 1.419
4.663ThrVal: 4.663 ± 0.42
0.587ThrTrp: 0.587 ± 0.136
3.0ThrTyr: 3.0 ± 0.342
0.0ThrXaa: 0.0 ± 0.0
Val
3.391ValAla: 3.391 ± 0.316
0.457ValCys: 0.457 ± 0.13
4.631ValAsp: 4.631 ± 0.385
3.261ValGlu: 3.261 ± 0.394
2.511ValPhe: 2.511 ± 0.261
3.652ValGly: 3.652 ± 0.415
0.913ValHis: 0.913 ± 0.168
4.011ValIle: 4.011 ± 0.311
4.729ValLys: 4.729 ± 0.398
4.859ValLeu: 4.859 ± 0.475
1.239ValMet: 1.239 ± 0.188
4.859ValAsn: 4.859 ± 0.449
2.12ValPro: 2.12 ± 0.272
1.761ValGln: 1.761 ± 0.238
1.794ValArg: 1.794 ± 0.272
5.381ValSer: 5.381 ± 0.403
4.598ValThr: 4.598 ± 0.393
3.75ValVal: 3.75 ± 0.37
0.848ValTrp: 0.848 ± 0.245
3.457ValTyr: 3.457 ± 0.31
0.0ValXaa: 0.0 ± 0.0
Trp
0.652TrpAla: 0.652 ± 0.138
0.065TrpCys: 0.065 ± 0.043
0.587TrpAsp: 0.587 ± 0.144
0.62TrpGlu: 0.62 ± 0.127
0.652TrpPhe: 0.652 ± 0.157
0.913TrpGly: 0.913 ± 0.225
0.098TrpHis: 0.098 ± 0.055
0.652TrpIle: 0.652 ± 0.159
0.75TrpLys: 0.75 ± 0.133
1.076TrpLeu: 1.076 ± 0.298
0.228TrpMet: 0.228 ± 0.076
0.522TrpAsn: 0.522 ± 0.125
0.13TrpPro: 0.13 ± 0.068
0.326TrpGln: 0.326 ± 0.121
0.326TrpArg: 0.326 ± 0.105
0.587TrpSer: 0.587 ± 0.153
0.554TrpThr: 0.554 ± 0.159
0.978TrpVal: 0.978 ± 0.21
0.326TrpTrp: 0.326 ± 0.109
0.652TrpTyr: 0.652 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.098TyrAla: 3.098 ± 0.316
0.554TyrCys: 0.554 ± 0.158
3.163TyrAsp: 3.163 ± 0.341
2.315TyrGlu: 2.315 ± 0.281
1.826TyrPhe: 1.826 ± 0.219
3.131TyrGly: 3.131 ± 0.36
0.88TyrHis: 0.88 ± 0.171
3.228TyrIle: 3.228 ± 0.343
3.489TyrLys: 3.489 ± 0.439
4.565TyrLeu: 4.565 ± 0.42
0.783TyrMet: 0.783 ± 0.153
4.076TyrAsn: 4.076 ± 0.381
1.533TyrPro: 1.533 ± 0.257
1.859TyrGln: 1.859 ± 0.343
2.152TyrArg: 2.152 ± 0.292
4.239TyrSer: 4.239 ± 0.438
3.228TyrThr: 3.228 ± 0.323
3.294TyrVal: 3.294 ± 0.323
0.717TyrTrp: 0.717 ± 0.159
3.163TyrTyr: 3.163 ± 0.296
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 100 proteins (30666 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski