Amino acid dipepetide frequency for Lactobacillus phage LF1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.49AlaAla: 5.49 ± 1.543
0.314AlaCys: 0.314 ± 0.123
5.882AlaAsp: 5.882 ± 0.728
4.47AlaGlu: 4.47 ± 0.609
2.588AlaPhe: 2.588 ± 0.395
5.019AlaGly: 5.019 ± 0.87
0.863AlaHis: 0.863 ± 0.278
6.274AlaIle: 6.274 ± 0.803
7.294AlaLys: 7.294 ± 0.754
5.647AlaLeu: 5.647 ± 0.749
1.882AlaMet: 1.882 ± 0.331
5.254AlaAsn: 5.254 ± 0.868
1.412AlaPro: 1.412 ± 0.267
5.176AlaGln: 5.176 ± 1.005
2.745AlaArg: 2.745 ± 0.472
6.509AlaSer: 6.509 ± 1.112
6.274AlaThr: 6.274 ± 1.03
4.706AlaVal: 4.706 ± 0.671
1.02AlaTrp: 1.02 ± 0.279
2.51AlaTyr: 2.51 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
0.392CysAla: 0.392 ± 0.184
0.0CysCys: 0.0 ± 0.0
0.157CysAsp: 0.157 ± 0.102
0.549CysGlu: 0.549 ± 0.219
0.235CysPhe: 0.235 ± 0.172
0.471CysGly: 0.471 ± 0.221
0.314CysHis: 0.314 ± 0.166
0.549CysIle: 0.549 ± 0.267
0.471CysLys: 0.471 ± 0.22
0.392CysLeu: 0.392 ± 0.184
0.078CysMet: 0.078 ± 0.09
0.078CysAsn: 0.078 ± 0.073
0.157CysPro: 0.157 ± 0.11
0.314CysGln: 0.314 ± 0.147
0.314CysArg: 0.314 ± 0.155
0.471CysSer: 0.471 ± 0.196
0.471CysThr: 0.471 ± 0.206
0.235CysVal: 0.235 ± 0.139
0.078CysTrp: 0.078 ± 0.086
0.392CysTyr: 0.392 ± 0.238
0.0CysXaa: 0.0 ± 0.0
Asp
4.784AspAla: 4.784 ± 0.623
0.549AspCys: 0.549 ± 0.243
5.803AspAsp: 5.803 ± 0.99
4.392AspGlu: 4.392 ± 0.716
3.137AspPhe: 3.137 ± 0.447
4.862AspGly: 4.862 ± 0.753
1.255AspHis: 1.255 ± 0.382
2.98AspIle: 2.98 ± 0.587
4.862AspLys: 4.862 ± 0.734
5.568AspLeu: 5.568 ± 0.732
1.49AspMet: 1.49 ± 0.355
3.137AspAsn: 3.137 ± 0.508
2.431AspPro: 2.431 ± 0.46
2.823AspGln: 2.823 ± 0.487
2.431AspArg: 2.431 ± 0.546
3.529AspSer: 3.529 ± 0.465
5.254AspThr: 5.254 ± 0.846
4.078AspVal: 4.078 ± 0.725
1.412AspTrp: 1.412 ± 0.352
3.451AspTyr: 3.451 ± 0.516
0.0AspXaa: 0.0 ± 0.0
Glu
4.706GluAla: 4.706 ± 0.617
0.078GluCys: 0.078 ± 0.07
3.843GluAsp: 3.843 ± 0.562
2.666GluGlu: 2.666 ± 0.486
2.431GluPhe: 2.431 ± 0.489
2.353GluGly: 2.353 ± 0.405
1.02GluHis: 1.02 ± 0.326
4.0GluIle: 4.0 ± 0.632
4.784GluLys: 4.784 ± 0.698
4.784GluLeu: 4.784 ± 0.614
1.725GluMet: 1.725 ± 0.371
2.98GluAsn: 2.98 ± 0.655
2.039GluPro: 2.039 ± 0.449
2.51GluGln: 2.51 ± 0.456
3.137GluArg: 3.137 ± 0.543
4.313GluSer: 4.313 ± 0.658
2.588GluThr: 2.588 ± 0.462
3.843GluVal: 3.843 ± 0.536
1.098GluTrp: 1.098 ± 0.381
2.666GluTyr: 2.666 ± 0.515
0.0GluXaa: 0.0 ± 0.0
Phe
2.039PheAla: 2.039 ± 0.41
0.549PheCys: 0.549 ± 0.21
1.882PheAsp: 1.882 ± 0.397
3.059PheGlu: 3.059 ± 0.553
0.784PhePhe: 0.784 ± 0.241
2.666PheGly: 2.666 ± 0.67
0.549PheHis: 0.549 ± 0.233
2.117PheIle: 2.117 ± 0.487
2.588PheLys: 2.588 ± 0.504
2.588PheLeu: 2.588 ± 0.556
0.863PheMet: 0.863 ± 0.266
2.353PheAsn: 2.353 ± 0.436
1.412PhePro: 1.412 ± 0.302
0.863PheGln: 0.863 ± 0.276
0.941PheArg: 0.941 ± 0.3
2.274PheSer: 2.274 ± 0.435
3.372PheThr: 3.372 ± 0.514
2.039PheVal: 2.039 ± 0.288
0.392PheTrp: 0.392 ± 0.183
0.706PheTyr: 0.706 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
3.059GlyAla: 3.059 ± 0.672
0.549GlyCys: 0.549 ± 0.216
3.843GlyAsp: 3.843 ± 0.692
2.745GlyGlu: 2.745 ± 0.416
2.039GlyPhe: 2.039 ± 0.469
3.529GlyGly: 3.529 ± 0.535
1.49GlyHis: 1.49 ± 0.363
3.764GlyIle: 3.764 ± 0.617
6.117GlyLys: 6.117 ± 0.812
6.039GlyLeu: 6.039 ± 0.634
1.882GlyMet: 1.882 ± 0.537
4.078GlyAsn: 4.078 ± 0.542
1.02GlyPro: 1.02 ± 0.325
3.451GlyGln: 3.451 ± 0.477
2.431GlyArg: 2.431 ± 0.563
4.078GlySer: 4.078 ± 0.588
4.862GlyThr: 4.862 ± 0.628
4.47GlyVal: 4.47 ± 0.778
0.941GlyTrp: 0.941 ± 0.377
2.274GlyTyr: 2.274 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
1.02HisAla: 1.02 ± 0.237
0.078HisCys: 0.078 ± 0.08
0.941HisAsp: 0.941 ± 0.318
0.784HisGlu: 0.784 ± 0.308
0.627HisPhe: 0.627 ± 0.282
0.627HisGly: 0.627 ± 0.197
0.235HisHis: 0.235 ± 0.162
0.863HisIle: 0.863 ± 0.294
0.784HisLys: 0.784 ± 0.245
1.333HisLeu: 1.333 ± 0.334
0.549HisMet: 0.549 ± 0.183
0.863HisAsn: 0.863 ± 0.212
1.255HisPro: 1.255 ± 0.429
0.549HisGln: 0.549 ± 0.177
0.941HisArg: 0.941 ± 0.308
1.02HisSer: 1.02 ± 0.45
0.784HisThr: 0.784 ± 0.26
0.706HisVal: 0.706 ± 0.29
0.314HisTrp: 0.314 ± 0.151
0.627HisTyr: 0.627 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
4.313IleAla: 4.313 ± 0.624
0.471IleCys: 0.471 ± 0.264
4.862IleAsp: 4.862 ± 0.844
3.529IleGlu: 3.529 ± 0.734
1.725IlePhe: 1.725 ± 0.405
3.372IleGly: 3.372 ± 0.411
0.784IleHis: 0.784 ± 0.314
3.215IleIle: 3.215 ± 0.648
5.49IleLys: 5.49 ± 0.639
4.862IleLeu: 4.862 ± 0.724
1.255IleMet: 1.255 ± 0.319
4.862IleAsn: 4.862 ± 0.79
1.961IlePro: 1.961 ± 0.538
2.902IleGln: 2.902 ± 0.491
2.431IleArg: 2.431 ± 0.388
3.764IleSer: 3.764 ± 0.711
3.686IleThr: 3.686 ± 0.552
2.902IleVal: 2.902 ± 0.418
0.627IleTrp: 0.627 ± 0.186
2.274IleTyr: 2.274 ± 0.572
0.0IleXaa: 0.0 ± 0.0
Lys
7.764LysAla: 7.764 ± 1.359
0.627LysCys: 0.627 ± 0.233
4.078LysAsp: 4.078 ± 0.708
5.333LysGlu: 5.333 ± 0.795
2.117LysPhe: 2.117 ± 0.418
4.627LysGly: 4.627 ± 0.569
1.333LysHis: 1.333 ± 0.308
4.313LysIle: 4.313 ± 0.626
5.411LysLys: 5.411 ± 0.911
6.274LysLeu: 6.274 ± 0.853
2.51LysMet: 2.51 ± 0.5
4.313LysAsn: 4.313 ± 0.534
1.647LysPro: 1.647 ± 0.335
4.0LysGln: 4.0 ± 0.636
3.529LysArg: 3.529 ± 0.619
4.627LysSer: 4.627 ± 0.547
5.254LysThr: 5.254 ± 0.756
4.627LysVal: 4.627 ± 0.457
1.49LysTrp: 1.49 ± 0.331
2.431LysTyr: 2.431 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
7.764LeuAla: 7.764 ± 0.673
0.392LeuCys: 0.392 ± 0.229
6.901LeuAsp: 6.901 ± 0.967
3.843LeuGlu: 3.843 ± 0.612
1.961LeuPhe: 1.961 ± 0.386
4.235LeuGly: 4.235 ± 0.526
1.255LeuHis: 1.255 ± 0.346
4.784LeuIle: 4.784 ± 0.714
6.745LeuLys: 6.745 ± 0.622
6.431LeuLeu: 6.431 ± 1.012
1.961LeuMet: 1.961 ± 0.326
5.019LeuAsn: 5.019 ± 0.52
2.745LeuPro: 2.745 ± 0.484
3.372LeuGln: 3.372 ± 0.452
3.529LeuArg: 3.529 ± 0.592
6.431LeuSer: 6.431 ± 0.71
6.274LeuThr: 6.274 ± 0.744
4.627LeuVal: 4.627 ± 0.636
0.627LeuTrp: 0.627 ± 0.239
2.588LeuTyr: 2.588 ± 0.529
0.0LeuXaa: 0.0 ± 0.0
Met
1.961MetAla: 1.961 ± 0.317
0.078MetCys: 0.078 ± 0.083
2.117MetAsp: 2.117 ± 0.372
1.098MetGlu: 1.098 ± 0.25
1.02MetPhe: 1.02 ± 0.227
1.255MetGly: 1.255 ± 0.423
0.157MetHis: 0.157 ± 0.117
1.961MetIle: 1.961 ± 0.386
1.804MetLys: 1.804 ± 0.389
1.49MetLeu: 1.49 ± 0.336
0.314MetMet: 0.314 ± 0.155
1.882MetAsn: 1.882 ± 0.485
0.784MetPro: 0.784 ± 0.235
1.725MetGln: 1.725 ± 0.333
1.412MetArg: 1.412 ± 0.36
1.647MetSer: 1.647 ± 0.538
1.882MetThr: 1.882 ± 0.375
1.333MetVal: 1.333 ± 0.28
0.157MetTrp: 0.157 ± 0.151
0.863MetTyr: 0.863 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
4.941AsnAla: 4.941 ± 0.717
0.314AsnCys: 0.314 ± 0.15
3.294AsnAsp: 3.294 ± 0.514
4.47AsnGlu: 4.47 ± 0.61
1.725AsnPhe: 1.725 ± 0.334
5.098AsnGly: 5.098 ± 0.635
0.549AsnHis: 0.549 ± 0.244
2.902AsnIle: 2.902 ± 0.465
4.235AsnLys: 4.235 ± 0.731
4.392AsnLeu: 4.392 ± 0.549
1.02AsnMet: 1.02 ± 0.347
3.372AsnAsn: 3.372 ± 0.517
1.569AsnPro: 1.569 ± 0.417
3.294AsnGln: 3.294 ± 0.484
2.902AsnArg: 2.902 ± 0.452
6.117AsnSer: 6.117 ± 0.869
3.294AsnThr: 3.294 ± 0.637
3.137AsnVal: 3.137 ± 0.447
1.176AsnTrp: 1.176 ± 0.291
2.902AsnTyr: 2.902 ± 0.584
0.0AsnXaa: 0.0 ± 0.0
Pro
2.431ProAla: 2.431 ± 0.392
0.078ProCys: 0.078 ± 0.08
2.666ProAsp: 2.666 ± 0.493
2.196ProGlu: 2.196 ± 0.513
1.02ProPhe: 1.02 ± 0.353
1.569ProGly: 1.569 ± 0.4
0.314ProHis: 0.314 ± 0.212
1.804ProIle: 1.804 ± 0.395
1.647ProLys: 1.647 ± 0.328
1.882ProLeu: 1.882 ± 0.421
0.863ProMet: 0.863 ± 0.257
1.882ProAsn: 1.882 ± 0.491
0.627ProPro: 0.627 ± 0.234
1.333ProGln: 1.333 ± 0.404
1.02ProArg: 1.02 ± 0.29
1.882ProSer: 1.882 ± 0.428
2.745ProThr: 2.745 ± 0.552
1.569ProVal: 1.569 ± 0.414
0.157ProTrp: 0.157 ± 0.119
1.02ProTyr: 1.02 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
5.411GlnAla: 5.411 ± 0.828
0.392GlnCys: 0.392 ± 0.162
2.745GlnAsp: 2.745 ± 0.44
3.059GlnGlu: 3.059 ± 0.53
2.274GlnPhe: 2.274 ± 0.466
2.353GlnGly: 2.353 ± 0.592
0.627GlnHis: 0.627 ± 0.216
2.745GlnIle: 2.745 ± 0.469
3.215GlnLys: 3.215 ± 0.498
4.313GlnLeu: 4.313 ± 0.574
1.647GlnMet: 1.647 ± 0.453
2.117GlnAsn: 2.117 ± 0.464
0.784GlnPro: 0.784 ± 0.282
2.353GlnGln: 2.353 ± 0.672
2.117GlnArg: 2.117 ± 0.366
4.392GlnSer: 4.392 ± 0.942
3.843GlnThr: 3.843 ± 0.727
2.196GlnVal: 2.196 ± 0.463
0.549GlnTrp: 0.549 ± 0.221
1.569GlnTyr: 1.569 ± 0.477
0.0GlnXaa: 0.0 ± 0.0
Arg
2.823ArgAla: 2.823 ± 0.462
0.235ArgCys: 0.235 ± 0.134
2.823ArgAsp: 2.823 ± 0.374
2.745ArgGlu: 2.745 ± 0.531
1.647ArgPhe: 1.647 ± 0.414
2.51ArgGly: 2.51 ± 0.404
0.863ArgHis: 0.863 ± 0.407
2.117ArgIle: 2.117 ± 0.42
3.059ArgLys: 3.059 ± 0.444
4.313ArgLeu: 4.313 ± 0.642
1.176ArgMet: 1.176 ± 0.315
2.196ArgAsn: 2.196 ± 0.423
0.941ArgPro: 0.941 ± 0.36
1.804ArgGln: 1.804 ± 0.407
2.431ArgArg: 2.431 ± 0.406
3.294ArgSer: 3.294 ± 0.495
2.196ArgThr: 2.196 ± 0.365
2.274ArgVal: 2.274 ± 0.374
0.392ArgTrp: 0.392 ± 0.181
2.196ArgTyr: 2.196 ± 0.551
0.0ArgXaa: 0.0 ± 0.0
Ser
5.882SerAla: 5.882 ± 1.194
0.235SerCys: 0.235 ± 0.139
5.019SerAsp: 5.019 ± 0.839
3.451SerGlu: 3.451 ± 0.632
3.529SerPhe: 3.529 ± 0.559
5.803SerGly: 5.803 ± 0.82
0.706SerHis: 0.706 ± 0.221
4.549SerIle: 4.549 ± 0.615
4.627SerLys: 4.627 ± 0.712
6.117SerLeu: 6.117 ± 0.824
1.882SerMet: 1.882 ± 0.482
4.392SerAsn: 4.392 ± 0.505
1.961SerPro: 1.961 ± 0.392
4.0SerGln: 4.0 ± 0.652
3.137SerArg: 3.137 ± 0.542
5.019SerSer: 5.019 ± 0.785
5.803SerThr: 5.803 ± 0.645
3.843SerVal: 3.843 ± 0.627
0.941SerTrp: 0.941 ± 0.245
2.431SerTyr: 2.431 ± 0.429
0.0SerXaa: 0.0 ± 0.0
Thr
6.98ThrAla: 6.98 ± 1.007
0.078ThrCys: 0.078 ± 0.087
3.843ThrAsp: 3.843 ± 0.613
3.294ThrGlu: 3.294 ± 0.494
1.725ThrPhe: 1.725 ± 0.411
5.019ThrGly: 5.019 ± 0.556
0.784ThrHis: 0.784 ± 0.217
4.0ThrIle: 4.0 ± 0.523
5.882ThrLys: 5.882 ± 0.678
6.117ThrLeu: 6.117 ± 0.892
1.725ThrMet: 1.725 ± 0.329
3.921ThrAsn: 3.921 ± 0.8
3.137ThrPro: 3.137 ± 0.493
3.059ThrGln: 3.059 ± 0.74
2.039ThrArg: 2.039 ± 0.434
5.49ThrSer: 5.49 ± 0.687
5.333ThrThr: 5.333 ± 0.872
5.254ThrVal: 5.254 ± 0.818
0.863ThrTrp: 0.863 ± 0.26
2.51ThrTyr: 2.51 ± 0.696
0.0ThrXaa: 0.0 ± 0.0
Val
5.803ValAla: 5.803 ± 0.785
0.549ValCys: 0.549 ± 0.223
4.392ValAsp: 4.392 ± 0.801
3.608ValGlu: 3.608 ± 0.445
1.333ValPhe: 1.333 ± 0.277
3.921ValGly: 3.921 ± 0.577
1.098ValHis: 1.098 ± 0.332
3.215ValIle: 3.215 ± 0.613
4.392ValLys: 4.392 ± 0.592
4.392ValLeu: 4.392 ± 0.598
1.176ValMet: 1.176 ± 0.378
4.47ValAsn: 4.47 ± 0.492
1.098ValPro: 1.098 ± 0.358
1.804ValGln: 1.804 ± 0.32
2.117ValArg: 2.117 ± 0.414
4.784ValSer: 4.784 ± 0.79
4.235ValThr: 4.235 ± 0.684
3.294ValVal: 3.294 ± 0.527
0.941ValTrp: 0.941 ± 0.253
1.647ValTyr: 1.647 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.549TrpAla: 0.549 ± 0.202
0.078TrpCys: 0.078 ± 0.07
0.941TrpAsp: 0.941 ± 0.265
0.784TrpGlu: 0.784 ± 0.229
0.314TrpPhe: 0.314 ± 0.166
1.176TrpGly: 1.176 ± 0.3
0.157TrpHis: 0.157 ± 0.119
0.941TrpIle: 0.941 ± 0.282
0.941TrpLys: 0.941 ± 0.267
1.333TrpLeu: 1.333 ± 0.384
0.157TrpMet: 0.157 ± 0.109
1.333TrpAsn: 1.333 ± 0.634
0.392TrpPro: 0.392 ± 0.158
0.706TrpGln: 0.706 ± 0.257
0.784TrpArg: 0.784 ± 0.276
1.098TrpSer: 1.098 ± 0.276
0.706TrpThr: 0.706 ± 0.213
0.863TrpVal: 0.863 ± 0.24
0.157TrpTrp: 0.157 ± 0.093
0.314TrpTyr: 0.314 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.215TyrAla: 3.215 ± 0.57
0.471TyrCys: 0.471 ± 0.206
2.196TyrAsp: 2.196 ± 0.477
1.49TyrGlu: 1.49 ± 0.379
1.804TyrPhe: 1.804 ± 0.508
2.196TyrGly: 2.196 ± 0.59
0.549TyrHis: 0.549 ± 0.24
2.196TyrIle: 2.196 ± 0.421
1.961TyrLys: 1.961 ± 0.466
3.215TyrLeu: 3.215 ± 0.587
0.706TyrMet: 0.706 ± 0.209
2.196TyrAsn: 2.196 ± 0.46
1.333TyrPro: 1.333 ± 0.489
2.666TyrGln: 2.666 ± 0.473
1.725TyrArg: 1.725 ± 0.333
2.588TyrSer: 2.588 ± 0.409
2.196TyrThr: 2.196 ± 0.475
2.274TyrVal: 2.274 ± 0.344
0.392TyrTrp: 0.392 ± 0.167
1.255TyrTyr: 1.255 ± 0.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12752 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski