Amino acid dipepetide frequency for Lactobacillus phage Lv-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.611AlaAla: 4.611 ± 0.849
0.348AlaCys: 0.348 ± 0.173
5.133AlaAsp: 5.133 ± 0.648
4.437AlaGlu: 4.437 ± 0.616
3.045AlaPhe: 3.045 ± 0.633
4.611AlaGly: 4.611 ± 1.161
0.696AlaHis: 0.696 ± 0.235
5.916AlaIle: 5.916 ± 0.654
6.003AlaLys: 6.003 ± 1.128
4.872AlaLeu: 4.872 ± 0.663
1.566AlaMet: 1.566 ± 0.366
3.654AlaAsn: 3.654 ± 0.763
1.827AlaPro: 1.827 ± 0.324
3.741AlaGln: 3.741 ± 0.529
2.262AlaArg: 2.262 ± 0.443
5.22AlaSer: 5.22 ± 0.644
5.307AlaThr: 5.307 ± 0.748
5.655AlaVal: 5.655 ± 0.789
1.392AlaTrp: 1.392 ± 0.404
2.523AlaTyr: 2.523 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.348CysAla: 0.348 ± 0.187
0.0CysCys: 0.0 ± 0.0
0.696CysAsp: 0.696 ± 0.234
0.348CysGlu: 0.348 ± 0.23
0.348CysPhe: 0.348 ± 0.186
0.696CysGly: 0.696 ± 0.245
0.348CysHis: 0.348 ± 0.18
0.609CysIle: 0.609 ± 0.238
0.609CysLys: 0.609 ± 0.225
1.044CysLeu: 1.044 ± 0.278
0.087CysMet: 0.087 ± 0.094
0.174CysAsn: 0.174 ± 0.114
0.174CysPro: 0.174 ± 0.169
0.087CysGln: 0.087 ± 0.09
0.522CysArg: 0.522 ± 0.211
0.609CysSer: 0.609 ± 0.197
0.522CysThr: 0.522 ± 0.232
0.522CysVal: 0.522 ± 0.208
0.0CysTrp: 0.0 ± 0.0
0.174CysTyr: 0.174 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
4.524AspAla: 4.524 ± 0.642
0.522AspCys: 0.522 ± 0.201
5.655AspAsp: 5.655 ± 1.068
4.176AspGlu: 4.176 ± 0.633
3.48AspPhe: 3.48 ± 0.46
4.611AspGly: 4.611 ± 0.54
1.044AspHis: 1.044 ± 0.305
4.524AspIle: 4.524 ± 0.662
4.872AspLys: 4.872 ± 0.612
6.699AspLeu: 6.699 ± 0.841
0.696AspMet: 0.696 ± 0.245
3.828AspAsn: 3.828 ± 0.59
1.566AspPro: 1.566 ± 0.382
2.262AspGln: 2.262 ± 0.504
1.392AspArg: 1.392 ± 0.283
4.611AspSer: 4.611 ± 0.964
3.915AspThr: 3.915 ± 0.568
3.915AspVal: 3.915 ± 0.515
1.044AspTrp: 1.044 ± 0.319
3.132AspTyr: 3.132 ± 0.579
0.0AspXaa: 0.0 ± 0.0
Glu
4.35GluAla: 4.35 ± 0.478
0.174GluCys: 0.174 ± 0.116
4.437GluAsp: 4.437 ± 0.779
4.176GluGlu: 4.176 ± 0.645
2.262GluPhe: 2.262 ± 0.496
2.001GluGly: 2.001 ± 0.296
1.044GluHis: 1.044 ± 0.259
3.132GluIle: 3.132 ± 0.387
6.09GluLys: 6.09 ± 0.692
5.22GluLeu: 5.22 ± 0.739
1.653GluMet: 1.653 ± 0.366
4.263GluAsn: 4.263 ± 0.67
1.827GluPro: 1.827 ± 0.424
2.871GluGln: 2.871 ± 0.478
2.349GluArg: 2.349 ± 0.438
2.697GluSer: 2.697 ± 0.433
2.523GluThr: 2.523 ± 0.36
3.48GluVal: 3.48 ± 0.542
0.87GluTrp: 0.87 ± 0.349
1.566GluTyr: 1.566 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
2.001PheAla: 2.001 ± 0.385
0.348PheCys: 0.348 ± 0.166
2.871PheAsp: 2.871 ± 0.517
2.61PheGlu: 2.61 ± 0.65
2.175PhePhe: 2.175 ± 0.534
3.306PheGly: 3.306 ± 0.797
0.87PheHis: 0.87 ± 0.258
2.262PheIle: 2.262 ± 0.477
3.393PheLys: 3.393 ± 0.567
2.349PheLeu: 2.349 ± 0.525
1.044PheMet: 1.044 ± 0.335
2.61PheAsn: 2.61 ± 0.442
1.131PhePro: 1.131 ± 0.356
1.653PheGln: 1.653 ± 0.351
1.305PheArg: 1.305 ± 0.293
3.393PheSer: 3.393 ± 0.461
2.784PheThr: 2.784 ± 0.393
2.523PheVal: 2.523 ± 0.486
0.609PheTrp: 0.609 ± 0.164
1.566PheTyr: 1.566 ± 0.357
0.0PheXaa: 0.0 ± 0.0
Gly
3.828GlyAla: 3.828 ± 0.778
0.522GlyCys: 0.522 ± 0.221
3.045GlyAsp: 3.045 ± 0.49
3.567GlyGlu: 3.567 ± 0.544
2.523GlyPhe: 2.523 ± 0.506
3.48GlyGly: 3.48 ± 0.773
1.566GlyHis: 1.566 ± 0.342
5.046GlyIle: 5.046 ± 0.734
6.438GlyLys: 6.438 ± 1.212
5.133GlyLeu: 5.133 ± 1.121
1.392GlyMet: 1.392 ± 0.312
3.219GlyAsn: 3.219 ± 0.717
0.696GlyPro: 0.696 ± 0.165
3.132GlyGln: 3.132 ± 0.706
2.349GlyArg: 2.349 ± 0.556
4.698GlySer: 4.698 ± 0.741
4.176GlyThr: 4.176 ± 0.713
3.48GlyVal: 3.48 ± 0.621
1.218GlyTrp: 1.218 ± 0.353
2.958GlyTyr: 2.958 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
0.783HisAla: 0.783 ± 0.251
0.0HisCys: 0.0 ± 0.0
1.131HisAsp: 1.131 ± 0.359
1.044HisGlu: 1.044 ± 0.278
0.783HisPhe: 0.783 ± 0.235
0.957HisGly: 0.957 ± 0.373
0.348HisHis: 0.348 ± 0.176
1.305HisIle: 1.305 ± 0.294
1.653HisLys: 1.653 ± 0.317
1.392HisLeu: 1.392 ± 0.364
0.174HisMet: 0.174 ± 0.098
0.696HisAsn: 0.696 ± 0.241
0.348HisPro: 0.348 ± 0.167
0.435HisGln: 0.435 ± 0.194
0.783HisArg: 0.783 ± 0.268
1.305HisSer: 1.305 ± 0.341
1.131HisThr: 1.131 ± 0.354
1.131HisVal: 1.131 ± 0.358
0.348HisTrp: 0.348 ± 0.152
1.131HisTyr: 1.131 ± 0.292
0.0HisXaa: 0.0 ± 0.0
Ile
4.872IleAla: 4.872 ± 0.733
0.348IleCys: 0.348 ± 0.175
5.307IleAsp: 5.307 ± 0.772
4.089IleGlu: 4.089 ± 0.574
2.871IlePhe: 2.871 ± 0.495
4.524IleGly: 4.524 ± 0.552
0.87IleHis: 0.87 ± 0.199
4.002IleIle: 4.002 ± 0.542
5.481IleLys: 5.481 ± 0.726
5.22IleLeu: 5.22 ± 0.936
0.783IleMet: 0.783 ± 0.25
4.176IleAsn: 4.176 ± 0.567
2.262IlePro: 2.262 ± 0.474
3.741IleGln: 3.741 ± 0.569
1.74IleArg: 1.74 ± 0.357
6.177IleSer: 6.177 ± 0.631
4.263IleThr: 4.263 ± 0.799
3.741IleVal: 3.741 ± 0.668
0.609IleTrp: 0.609 ± 0.163
2.001IleTyr: 2.001 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
7.395LysAla: 7.395 ± 1.119
0.957LysCys: 0.957 ± 0.367
6.264LysAsp: 6.264 ± 0.645
4.437LysGlu: 4.437 ± 0.616
3.306LysPhe: 3.306 ± 0.509
5.307LysGly: 5.307 ± 0.773
1.479LysHis: 1.479 ± 0.35
5.916LysIle: 5.916 ± 0.7
8.265LysLys: 8.265 ± 1.017
5.916LysLeu: 5.916 ± 0.709
2.088LysMet: 2.088 ± 0.394
5.916LysAsn: 5.916 ± 0.897
2.871LysPro: 2.871 ± 0.486
5.307LysGln: 5.307 ± 0.751
3.132LysArg: 3.132 ± 0.756
5.742LysSer: 5.742 ± 1.116
4.785LysThr: 4.785 ± 0.666
4.524LysVal: 4.524 ± 0.48
1.653LysTrp: 1.653 ± 0.428
3.567LysTyr: 3.567 ± 0.604
0.0LysXaa: 0.0 ± 0.0
Leu
6.96LeuAla: 6.96 ± 0.588
0.957LeuCys: 0.957 ± 0.286
4.785LeuAsp: 4.785 ± 0.747
4.437LeuGlu: 4.437 ± 0.701
2.523LeuPhe: 2.523 ± 0.449
4.872LeuGly: 4.872 ± 0.709
1.653LeuHis: 1.653 ± 0.283
3.741LeuIle: 3.741 ± 0.714
8.265LeuLys: 8.265 ± 0.751
6.09LeuLeu: 6.09 ± 0.913
2.088LeuMet: 2.088 ± 0.441
4.437LeuAsn: 4.437 ± 0.557
1.653LeuPro: 1.653 ± 0.357
3.567LeuGln: 3.567 ± 0.583
3.48LeuArg: 3.48 ± 0.511
6.351LeuSer: 6.351 ± 0.588
6.264LeuThr: 6.264 ± 0.74
4.263LeuVal: 4.263 ± 0.65
1.218LeuTrp: 1.218 ± 0.285
1.653LeuTyr: 1.653 ± 0.394
0.0LeuXaa: 0.0 ± 0.0
Met
2.001MetAla: 2.001 ± 0.486
0.348MetCys: 0.348 ± 0.182
0.435MetAsp: 0.435 ± 0.232
1.044MetGlu: 1.044 ± 0.261
0.957MetPhe: 0.957 ± 0.292
0.87MetGly: 0.87 ± 0.233
0.261MetHis: 0.261 ± 0.139
1.827MetIle: 1.827 ± 0.383
2.523MetLys: 2.523 ± 0.451
2.001MetLeu: 2.001 ± 0.397
0.435MetMet: 0.435 ± 0.206
1.392MetAsn: 1.392 ± 0.28
0.957MetPro: 0.957 ± 0.413
1.131MetGln: 1.131 ± 0.251
0.87MetArg: 0.87 ± 0.21
1.914MetSer: 1.914 ± 0.406
1.479MetThr: 1.479 ± 0.282
0.609MetVal: 0.609 ± 0.191
0.0MetTrp: 0.0 ± 0.0
0.522MetTyr: 0.522 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
5.22AsnAla: 5.22 ± 0.744
0.609AsnCys: 0.609 ± 0.28
3.567AsnAsp: 3.567 ± 0.572
3.045AsnGlu: 3.045 ± 0.413
2.697AsnPhe: 2.697 ± 0.519
4.089AsnGly: 4.089 ± 0.601
1.131AsnHis: 1.131 ± 0.401
4.176AsnIle: 4.176 ± 0.603
5.307AsnLys: 5.307 ± 0.822
4.959AsnLeu: 4.959 ± 0.921
1.392AsnMet: 1.392 ± 0.344
3.132AsnAsn: 3.132 ± 0.725
1.914AsnPro: 1.914 ± 0.37
2.871AsnGln: 2.871 ± 0.498
2.523AsnArg: 2.523 ± 0.476
3.306AsnSer: 3.306 ± 0.561
3.306AsnThr: 3.306 ± 0.503
2.61AsnVal: 2.61 ± 0.504
1.392AsnTrp: 1.392 ± 0.319
2.349AsnTyr: 2.349 ± 0.589
0.0AsnXaa: 0.0 ± 0.0
Pro
1.653ProAla: 1.653 ± 0.339
0.087ProCys: 0.087 ± 0.088
2.262ProAsp: 2.262 ± 0.448
2.523ProGlu: 2.523 ± 0.383
1.131ProPhe: 1.131 ± 0.33
1.044ProGly: 1.044 ± 0.382
0.348ProHis: 0.348 ± 0.191
2.436ProIle: 2.436 ± 0.539
2.175ProLys: 2.175 ± 0.431
2.001ProLeu: 2.001 ± 0.459
0.609ProMet: 0.609 ± 0.26
2.175ProAsn: 2.175 ± 0.575
0.957ProPro: 0.957 ± 0.266
0.87ProGln: 0.87 ± 0.359
0.609ProArg: 0.609 ± 0.288
1.305ProSer: 1.305 ± 0.247
1.827ProThr: 1.827 ± 0.414
2.262ProVal: 2.262 ± 0.521
0.174ProTrp: 0.174 ± 0.124
1.479ProTyr: 1.479 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
4.176GlnAla: 4.176 ± 0.492
0.261GlnCys: 0.261 ± 0.155
2.958GlnAsp: 2.958 ± 0.539
1.914GlnGlu: 1.914 ± 0.419
2.001GlnPhe: 2.001 ± 0.46
1.653GlnGly: 1.653 ± 0.393
0.87GlnHis: 0.87 ± 0.267
4.611GlnIle: 4.611 ± 0.553
3.915GlnLys: 3.915 ± 0.55
4.089GlnLeu: 4.089 ± 0.603
1.479GlnMet: 1.479 ± 0.341
2.871GlnAsn: 2.871 ± 0.433
1.479GlnPro: 1.479 ± 0.484
2.262GlnGln: 2.262 ± 0.45
2.088GlnArg: 2.088 ± 0.502
2.958GlnSer: 2.958 ± 0.496
3.306GlnThr: 3.306 ± 0.509
2.871GlnVal: 2.871 ± 0.562
0.957GlnTrp: 0.957 ± 0.328
2.001GlnTyr: 2.001 ± 0.579
0.0GlnXaa: 0.0 ± 0.0
Arg
2.436ArgAla: 2.436 ± 0.419
0.609ArgCys: 0.609 ± 0.286
2.61ArgAsp: 2.61 ± 0.625
1.74ArgGlu: 1.74 ± 0.455
0.609ArgPhe: 0.609 ± 0.222
2.262ArgGly: 2.262 ± 0.405
1.131ArgHis: 1.131 ± 0.355
1.827ArgIle: 1.827 ± 0.425
3.654ArgLys: 3.654 ± 0.586
3.045ArgLeu: 3.045 ± 0.365
0.696ArgMet: 0.696 ± 0.244
1.914ArgAsn: 1.914 ± 0.5
0.957ArgPro: 0.957 ± 0.279
1.914ArgGln: 1.914 ± 0.349
1.218ArgArg: 1.218 ± 0.314
2.349ArgSer: 2.349 ± 0.414
1.74ArgThr: 1.74 ± 0.418
3.219ArgVal: 3.219 ± 0.549
0.435ArgTrp: 0.435 ± 0.174
1.74ArgTyr: 1.74 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
4.698SerAla: 4.698 ± 1.14
0.174SerCys: 0.174 ± 0.121
4.611SerAsp: 4.611 ± 0.771
4.176SerGlu: 4.176 ± 0.589
2.523SerPhe: 2.523 ± 0.447
6.612SerGly: 6.612 ± 0.84
1.044SerHis: 1.044 ± 0.296
4.959SerIle: 4.959 ± 0.755
6.612SerLys: 6.612 ± 1.032
5.481SerLeu: 5.481 ± 0.499
1.914SerMet: 1.914 ± 0.261
3.741SerAsn: 3.741 ± 0.551
1.827SerPro: 1.827 ± 0.361
3.219SerGln: 3.219 ± 0.574
1.914SerArg: 1.914 ± 0.35
5.22SerSer: 5.22 ± 0.995
4.524SerThr: 4.524 ± 0.599
3.828SerVal: 3.828 ± 0.417
0.957SerTrp: 0.957 ± 0.251
2.088SerTyr: 2.088 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
5.046ThrAla: 5.046 ± 0.549
0.435ThrCys: 0.435 ± 0.223
2.784ThrAsp: 2.784 ± 0.518
4.263ThrGlu: 4.263 ± 0.497
2.523ThrPhe: 2.523 ± 0.45
5.568ThrGly: 5.568 ± 0.846
0.696ThrHis: 0.696 ± 0.301
4.524ThrIle: 4.524 ± 0.452
5.133ThrLys: 5.133 ± 0.789
4.524ThrLeu: 4.524 ± 0.731
0.87ThrMet: 0.87 ± 0.226
4.089ThrAsn: 4.089 ± 0.767
2.175ThrPro: 2.175 ± 0.527
3.741ThrGln: 3.741 ± 0.464
2.349ThrArg: 2.349 ± 0.498
3.915ThrSer: 3.915 ± 0.586
4.002ThrThr: 4.002 ± 0.64
4.437ThrVal: 4.437 ± 0.857
0.783ThrTrp: 0.783 ± 0.289
2.61ThrTyr: 2.61 ± 0.465
0.0ThrXaa: 0.0 ± 0.0
Val
5.046ValAla: 5.046 ± 0.573
0.348ValCys: 0.348 ± 0.149
4.959ValAsp: 4.959 ± 0.771
3.219ValGlu: 3.219 ± 0.508
2.088ValPhe: 2.088 ± 0.463
3.132ValGly: 3.132 ± 0.568
0.783ValHis: 0.783 ± 0.263
3.741ValIle: 3.741 ± 0.48
4.176ValLys: 4.176 ± 0.462
4.35ValLeu: 4.35 ± 0.689
1.392ValMet: 1.392 ± 0.319
3.48ValAsn: 3.48 ± 0.537
2.088ValPro: 2.088 ± 0.554
2.784ValGln: 2.784 ± 0.398
2.349ValArg: 2.349 ± 0.481
4.176ValSer: 4.176 ± 0.605
5.307ValThr: 5.307 ± 0.608
4.002ValVal: 4.002 ± 0.722
0.87ValTrp: 0.87 ± 0.308
1.566ValTyr: 1.566 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
0.87TrpAla: 0.87 ± 0.302
0.348TrpCys: 0.348 ± 0.181
0.609TrpAsp: 0.609 ± 0.208
0.435TrpGlu: 0.435 ± 0.174
0.696TrpPhe: 0.696 ± 0.23
0.609TrpGly: 0.609 ± 0.292
0.174TrpHis: 0.174 ± 0.114
0.783TrpIle: 0.783 ± 0.315
1.827TrpLys: 1.827 ± 0.535
1.392TrpLeu: 1.392 ± 0.29
0.174TrpMet: 0.174 ± 0.124
1.566TrpAsn: 1.566 ± 0.507
0.087TrpPro: 0.087 ± 0.091
1.131TrpGln: 1.131 ± 0.268
0.87TrpArg: 0.87 ± 0.307
1.392TrpSer: 1.392 ± 0.339
0.87TrpThr: 0.87 ± 0.284
0.87TrpVal: 0.87 ± 0.211
0.261TrpTrp: 0.261 ± 0.151
0.522TrpTyr: 0.522 ± 0.265
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.001TyrAla: 2.001 ± 0.363
0.522TyrCys: 0.522 ± 0.217
2.349TyrAsp: 2.349 ± 0.495
1.305TyrGlu: 1.305 ± 0.289
2.262TyrPhe: 2.262 ± 0.569
2.262TyrGly: 2.262 ± 0.426
0.522TyrHis: 0.522 ± 0.197
1.74TyrIle: 1.74 ± 0.441
2.436TyrLys: 2.436 ± 0.54
3.48TyrLeu: 3.48 ± 0.625
0.957TyrMet: 0.957 ± 0.213
2.349TyrAsn: 2.349 ± 0.469
1.131TyrPro: 1.131 ± 0.338
2.001TyrGln: 2.001 ± 0.325
2.001TyrArg: 2.001 ± 0.378
2.784TyrSer: 2.784 ± 0.67
2.436TyrThr: 2.436 ± 0.485
1.914TyrVal: 1.914 ± 0.406
0.696TyrTrp: 0.696 ± 0.246
1.479TyrTyr: 1.479 ± 0.352
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (11495 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski