Amino acid dipepetide frequency for Lactobacillus phage PLE3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.097AlaAla: 8.097 ± 1.3
0.481AlaCys: 0.481 ± 0.209
7.055AlaAsp: 7.055 ± 1.161
5.211AlaGlu: 5.211 ± 0.723
3.527AlaPhe: 3.527 ± 0.576
5.131AlaGly: 5.131 ± 0.988
1.443AlaHis: 1.443 ± 0.377
4.329AlaIle: 4.329 ± 0.919
5.932AlaLys: 5.932 ± 0.763
7.776AlaLeu: 7.776 ± 1.033
2.646AlaMet: 2.646 ± 0.827
5.211AlaAsn: 5.211 ± 1.008
2.084AlaPro: 2.084 ± 0.417
3.367AlaGln: 3.367 ± 0.668
2.886AlaArg: 2.886 ± 0.577
6.494AlaSer: 6.494 ± 0.961
6.013AlaThr: 6.013 ± 0.757
5.692AlaVal: 5.692 ± 0.811
0.802AlaTrp: 0.802 ± 0.23
2.646AlaTyr: 2.646 ± 0.54
0.0AlaXaa: 0.0 ± 0.0
Cys
0.08CysAla: 0.08 ± 0.085
0.08CysCys: 0.08 ± 0.085
0.241CysAsp: 0.241 ± 0.142
0.561CysGlu: 0.561 ± 0.217
0.16CysPhe: 0.16 ± 0.128
0.321CysGly: 0.321 ± 0.163
0.0CysHis: 0.0 ± 0.0
0.241CysIle: 0.241 ± 0.128
0.481CysLys: 0.481 ± 0.227
0.401CysLeu: 0.401 ± 0.205
0.0CysMet: 0.0 ± 0.0
0.561CysAsn: 0.561 ± 0.293
0.08CysPro: 0.08 ± 0.085
0.241CysGln: 0.241 ± 0.132
0.16CysArg: 0.16 ± 0.122
0.241CysSer: 0.241 ± 0.191
0.16CysThr: 0.16 ± 0.114
0.08CysVal: 0.08 ± 0.07
0.08CysTrp: 0.08 ± 0.084
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.57AspAla: 4.57 ± 0.571
0.321AspCys: 0.321 ± 0.169
4.489AspAsp: 4.489 ± 0.875
4.489AspGlu: 4.489 ± 0.86
2.886AspPhe: 2.886 ± 0.5
6.333AspGly: 6.333 ± 1.528
1.042AspHis: 1.042 ± 0.26
3.928AspIle: 3.928 ± 0.721
4.249AspLys: 4.249 ± 0.502
4.81AspLeu: 4.81 ± 0.53
1.924AspMet: 1.924 ± 0.354
3.046AspAsn: 3.046 ± 0.549
3.046AspPro: 3.046 ± 0.552
3.046AspGln: 3.046 ± 0.463
2.405AspArg: 2.405 ± 0.391
5.371AspSer: 5.371 ± 0.727
3.046AspThr: 3.046 ± 0.657
3.688AspVal: 3.688 ± 0.542
1.042AspTrp: 1.042 ± 0.311
2.886AspTyr: 2.886 ± 0.544
0.0AspXaa: 0.0 ± 0.0
Glu
5.371GluAla: 5.371 ± 0.893
0.16GluCys: 0.16 ± 0.129
3.768GluAsp: 3.768 ± 0.61
2.886GluGlu: 2.886 ± 0.585
2.565GluPhe: 2.565 ± 0.452
2.405GluGly: 2.405 ± 0.437
1.203GluHis: 1.203 ± 0.286
3.287GluIle: 3.287 ± 0.552
3.127GluLys: 3.127 ± 0.673
6.173GluLeu: 6.173 ± 0.828
1.283GluMet: 1.283 ± 0.361
2.565GluAsn: 2.565 ± 0.452
1.844GluPro: 1.844 ± 0.503
2.325GluGln: 2.325 ± 0.464
2.726GluArg: 2.726 ± 0.568
3.608GluSer: 3.608 ± 0.702
3.527GluThr: 3.527 ± 0.532
3.046GluVal: 3.046 ± 0.416
0.722GluTrp: 0.722 ± 0.199
2.165GluTyr: 2.165 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
3.127PheAla: 3.127 ± 0.477
0.08PheCys: 0.08 ± 0.084
2.806PheAsp: 2.806 ± 0.473
1.924PheGlu: 1.924 ± 0.355
1.122PhePhe: 1.122 ± 0.326
2.886PheGly: 2.886 ± 0.386
0.641PheHis: 0.641 ± 0.246
2.165PheIle: 2.165 ± 0.383
2.646PheLys: 2.646 ± 0.39
2.084PheLeu: 2.084 ± 0.507
1.042PheMet: 1.042 ± 0.231
2.245PheAsn: 2.245 ± 0.424
1.203PhePro: 1.203 ± 0.306
1.042PheGln: 1.042 ± 0.314
1.122PheArg: 1.122 ± 0.299
3.207PheSer: 3.207 ± 0.408
3.046PheThr: 3.046 ± 0.648
2.886PheVal: 2.886 ± 0.417
0.882PheTrp: 0.882 ± 0.286
1.363PheTyr: 1.363 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
5.211GlyAla: 5.211 ± 1.082
0.0GlyCys: 0.0 ± 0.0
3.447GlyAsp: 3.447 ± 0.651
3.527GlyGlu: 3.527 ± 0.489
3.046GlyPhe: 3.046 ± 0.497
3.848GlyGly: 3.848 ± 0.517
1.443GlyHis: 1.443 ± 0.474
4.81GlyIle: 4.81 ± 0.833
5.131GlyLys: 5.131 ± 0.81
4.81GlyLeu: 4.81 ± 1.21
1.764GlyMet: 1.764 ± 0.36
3.207GlyAsn: 3.207 ± 0.499
2.004GlyPro: 2.004 ± 0.438
2.245GlyGln: 2.245 ± 0.363
2.004GlyArg: 2.004 ± 0.351
4.65GlySer: 4.65 ± 0.576
4.169GlyThr: 4.169 ± 0.798
5.532GlyVal: 5.532 ± 0.691
0.802GlyTrp: 0.802 ± 0.267
3.046GlyTyr: 3.046 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
0.561HisAla: 0.561 ± 0.201
0.0HisCys: 0.0 ± 0.0
1.684HisAsp: 1.684 ± 0.334
0.962HisGlu: 0.962 ± 0.348
0.962HisPhe: 0.962 ± 0.303
1.363HisGly: 1.363 ± 0.324
0.561HisHis: 0.561 ± 0.276
0.802HisIle: 0.802 ± 0.289
1.042HisLys: 1.042 ± 0.315
0.962HisLeu: 0.962 ± 0.279
0.321HisMet: 0.321 ± 0.185
0.802HisAsn: 0.802 ± 0.302
0.481HisPro: 0.481 ± 0.195
0.962HisGln: 0.962 ± 0.335
0.882HisArg: 0.882 ± 0.263
1.443HisSer: 1.443 ± 0.341
1.122HisThr: 1.122 ± 0.334
1.042HisVal: 1.042 ± 0.309
0.241HisTrp: 0.241 ± 0.134
1.122HisTyr: 1.122 ± 0.286
0.0HisXaa: 0.0 ± 0.0
Ile
5.852IleAla: 5.852 ± 0.667
0.401IleCys: 0.401 ± 0.202
4.489IleAsp: 4.489 ± 0.538
3.768IleGlu: 3.768 ± 0.658
1.283IlePhe: 1.283 ± 0.341
4.008IleGly: 4.008 ± 0.644
0.962IleHis: 0.962 ± 0.308
3.447IleIle: 3.447 ± 0.652
4.65IleLys: 4.65 ± 0.539
3.768IleLeu: 3.768 ± 0.446
1.523IleMet: 1.523 ± 0.379
3.848IleAsn: 3.848 ± 0.42
2.405IlePro: 2.405 ± 0.408
2.165IleGln: 2.165 ± 0.551
2.325IleArg: 2.325 ± 0.498
4.73IleSer: 4.73 ± 0.562
4.008IleThr: 4.008 ± 0.603
3.608IleVal: 3.608 ± 0.52
0.641IleTrp: 0.641 ± 0.263
2.325IleTyr: 2.325 ± 0.588
0.0IleXaa: 0.0 ± 0.0
Lys
7.856LysAla: 7.856 ± 0.795
0.241LysCys: 0.241 ± 0.15
4.008LysAsp: 4.008 ± 0.646
3.447LysGlu: 3.447 ± 0.619
1.523LysPhe: 1.523 ± 0.3
4.65LysGly: 4.65 ± 0.961
1.122LysHis: 1.122 ± 0.327
4.008LysIle: 4.008 ± 0.521
4.81LysLys: 4.81 ± 0.878
5.932LysLeu: 5.932 ± 0.924
2.966LysMet: 2.966 ± 0.516
3.367LysAsn: 3.367 ± 0.497
3.768LysPro: 3.768 ± 0.614
4.008LysGln: 4.008 ± 0.746
4.409LysArg: 4.409 ± 0.819
4.73LysSer: 4.73 ± 0.848
3.928LysThr: 3.928 ± 0.597
3.367LysVal: 3.367 ± 0.447
0.882LysTrp: 0.882 ± 0.28
2.565LysTyr: 2.565 ± 0.434
0.0LysXaa: 0.0 ± 0.0
Leu
5.532LeuAla: 5.532 ± 0.609
0.241LeuCys: 0.241 ± 0.176
5.371LeuAsp: 5.371 ± 0.665
3.848LeuGlu: 3.848 ± 0.762
2.806LeuPhe: 2.806 ± 0.497
5.051LeuGly: 5.051 ± 0.701
1.443LeuHis: 1.443 ± 0.368
5.692LeuIle: 5.692 ± 0.876
6.894LeuLys: 6.894 ± 0.915
5.051LeuLeu: 5.051 ± 0.558
1.844LeuMet: 1.844 ± 0.33
5.291LeuAsn: 5.291 ± 0.663
3.287LeuPro: 3.287 ± 0.531
3.207LeuGln: 3.207 ± 0.453
3.447LeuArg: 3.447 ± 0.572
5.532LeuSer: 5.532 ± 0.841
5.371LeuThr: 5.371 ± 0.715
4.73LeuVal: 4.73 ± 0.812
0.882LeuTrp: 0.882 ± 0.336
2.485LeuTyr: 2.485 ± 0.454
0.0LeuXaa: 0.0 ± 0.0
Met
2.886MetAla: 2.886 ± 0.613
0.08MetCys: 0.08 ± 0.08
1.603MetAsp: 1.603 ± 0.328
1.203MetGlu: 1.203 ± 0.253
0.641MetPhe: 0.641 ± 0.233
1.443MetGly: 1.443 ± 0.348
0.641MetHis: 0.641 ± 0.229
1.844MetIle: 1.844 ± 0.472
2.084MetLys: 2.084 ± 0.391
1.844MetLeu: 1.844 ± 0.34
0.561MetMet: 0.561 ± 0.258
1.924MetAsn: 1.924 ± 0.465
1.203MetPro: 1.203 ± 0.319
2.084MetGln: 2.084 ± 0.593
1.203MetArg: 1.203 ± 0.283
1.764MetSer: 1.764 ± 0.489
2.325MetThr: 2.325 ± 0.377
1.042MetVal: 1.042 ± 0.378
0.401MetTrp: 0.401 ± 0.193
0.962MetTyr: 0.962 ± 0.304
0.0MetXaa: 0.0 ± 0.0
Asn
4.97AsnAla: 4.97 ± 0.688
0.08AsnCys: 0.08 ± 0.087
3.688AsnAsp: 3.688 ± 0.654
2.325AsnGlu: 2.325 ± 0.486
2.325AsnPhe: 2.325 ± 0.43
4.73AsnGly: 4.73 ± 0.551
0.641AsnHis: 0.641 ± 0.228
2.806AsnIle: 2.806 ± 0.438
2.485AsnLys: 2.485 ± 0.376
3.848AsnLeu: 3.848 ± 0.564
2.004AsnMet: 2.004 ± 0.478
2.405AsnAsn: 2.405 ± 0.39
2.966AsnPro: 2.966 ± 0.504
2.565AsnGln: 2.565 ± 0.434
2.405AsnArg: 2.405 ± 0.403
3.527AsnSer: 3.527 ± 0.559
2.325AsnThr: 2.325 ± 0.489
2.084AsnVal: 2.084 ± 0.418
1.283AsnTrp: 1.283 ± 0.375
1.924AsnTyr: 1.924 ± 0.436
0.0AsnXaa: 0.0 ± 0.0
Pro
3.367ProAla: 3.367 ± 0.576
0.08ProCys: 0.08 ± 0.069
2.565ProAsp: 2.565 ± 0.481
2.886ProGlu: 2.886 ± 0.547
1.122ProPhe: 1.122 ± 0.292
2.084ProGly: 2.084 ± 0.444
0.321ProHis: 0.321 ± 0.167
2.004ProIle: 2.004 ± 0.429
3.608ProLys: 3.608 ± 0.534
2.726ProLeu: 2.726 ± 0.515
1.042ProMet: 1.042 ± 0.285
1.523ProAsn: 1.523 ± 0.437
0.561ProPro: 0.561 ± 0.256
1.443ProGln: 1.443 ± 0.323
1.042ProArg: 1.042 ± 0.292
4.249ProSer: 4.249 ± 0.628
3.046ProThr: 3.046 ± 0.759
2.245ProVal: 2.245 ± 0.651
0.561ProTrp: 0.561 ± 0.203
1.283ProTyr: 1.283 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
5.131GlnAla: 5.131 ± 0.593
0.321GlnCys: 0.321 ± 0.15
2.084GlnAsp: 2.084 ± 0.438
2.084GlnGlu: 2.084 ± 0.492
1.443GlnPhe: 1.443 ± 0.357
2.726GlnGly: 2.726 ± 0.353
0.962GlnHis: 0.962 ± 0.368
2.325GlnIle: 2.325 ± 0.434
2.966GlnLys: 2.966 ± 0.708
4.489GlnLeu: 4.489 ± 0.532
1.283GlnMet: 1.283 ± 0.343
1.523GlnAsn: 1.523 ± 0.313
1.363GlnPro: 1.363 ± 0.442
2.485GlnGln: 2.485 ± 0.468
2.405GlnArg: 2.405 ± 0.544
3.046GlnSer: 3.046 ± 0.452
3.127GlnThr: 3.127 ± 0.421
2.966GlnVal: 2.966 ± 0.482
1.122GlnTrp: 1.122 ± 0.313
1.844GlnTyr: 1.844 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
2.325ArgAla: 2.325 ± 0.528
0.481ArgCys: 0.481 ± 0.21
1.283ArgAsp: 1.283 ± 0.317
3.127ArgGlu: 3.127 ± 0.621
1.844ArgPhe: 1.844 ± 0.417
2.084ArgGly: 2.084 ± 0.416
0.401ArgHis: 0.401 ± 0.204
2.886ArgIle: 2.886 ± 0.666
3.527ArgLys: 3.527 ± 0.62
3.608ArgLeu: 3.608 ± 0.639
1.603ArgMet: 1.603 ± 0.47
1.924ArgAsn: 1.924 ± 0.415
1.363ArgPro: 1.363 ± 0.364
2.405ArgGln: 2.405 ± 0.777
1.844ArgArg: 1.844 ± 0.455
2.245ArgSer: 2.245 ± 0.438
2.565ArgThr: 2.565 ± 0.481
2.726ArgVal: 2.726 ± 0.411
0.561ArgTrp: 0.561 ± 0.267
2.245ArgTyr: 2.245 ± 0.551
0.0ArgXaa: 0.0 ± 0.0
Ser
5.291SerAla: 5.291 ± 0.901
0.241SerCys: 0.241 ± 0.15
5.131SerAsp: 5.131 ± 0.785
4.008SerGlu: 4.008 ± 0.979
3.928SerPhe: 3.928 ± 0.5
5.131SerGly: 5.131 ± 0.994
1.122SerHis: 1.122 ± 0.309
4.089SerIle: 4.089 ± 0.669
6.013SerLys: 6.013 ± 0.718
6.253SerLeu: 6.253 ± 0.889
2.004SerMet: 2.004 ± 0.315
3.367SerAsn: 3.367 ± 0.576
2.245SerPro: 2.245 ± 0.462
3.046SerGln: 3.046 ± 0.661
2.485SerArg: 2.485 ± 0.712
6.494SerSer: 6.494 ± 1.115
4.97SerThr: 4.97 ± 0.683
4.329SerVal: 4.329 ± 0.501
0.722SerTrp: 0.722 ± 0.25
3.367SerTyr: 3.367 ± 0.652
0.0SerXaa: 0.0 ± 0.0
Thr
7.295ThrAla: 7.295 ± 1.1
0.08ThrCys: 0.08 ± 0.077
4.97ThrAsp: 4.97 ± 0.793
2.646ThrGlu: 2.646 ± 0.392
2.165ThrPhe: 2.165 ± 0.394
3.848ThrGly: 3.848 ± 0.496
1.363ThrHis: 1.363 ± 0.447
4.169ThrIle: 4.169 ± 0.565
4.169ThrLys: 4.169 ± 0.581
5.131ThrLeu: 5.131 ± 0.701
1.283ThrMet: 1.283 ± 0.371
2.565ThrAsn: 2.565 ± 0.518
3.608ThrPro: 3.608 ± 0.609
2.485ThrGln: 2.485 ± 0.369
3.207ThrArg: 3.207 ± 0.542
4.57ThrSer: 4.57 ± 0.826
7.856ThrThr: 7.856 ± 3.759
5.051ThrVal: 5.051 ± 0.716
0.241ThrTrp: 0.241 ± 0.129
2.245ThrTyr: 2.245 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
5.051ValAla: 5.051 ± 0.76
0.401ValCys: 0.401 ± 0.185
4.81ValAsp: 4.81 ± 0.44
3.207ValGlu: 3.207 ± 0.583
1.283ValPhe: 1.283 ± 0.375
4.65ValGly: 4.65 ± 0.501
0.641ValHis: 0.641 ± 0.178
3.127ValIle: 3.127 ± 0.535
3.928ValLys: 3.928 ± 0.674
4.57ValLeu: 4.57 ± 0.57
1.603ValMet: 1.603 ± 0.306
3.046ValAsn: 3.046 ± 0.458
2.726ValPro: 2.726 ± 0.46
3.046ValGln: 3.046 ± 0.594
2.245ValArg: 2.245 ± 0.387
4.65ValSer: 4.65 ± 0.675
4.329ValThr: 4.329 ± 0.685
2.726ValVal: 2.726 ± 0.423
0.882ValTrp: 0.882 ± 0.281
2.004ValTyr: 2.004 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
1.122TrpAla: 1.122 ± 0.362
0.08TrpCys: 0.08 ± 0.085
0.882TrpAsp: 0.882 ± 0.246
0.722TrpGlu: 0.722 ± 0.284
0.561TrpPhe: 0.561 ± 0.211
0.481TrpGly: 0.481 ± 0.189
0.641TrpHis: 0.641 ± 0.243
1.443TrpIle: 1.443 ± 0.305
1.203TrpLys: 1.203 ± 0.328
1.203TrpLeu: 1.203 ± 0.296
0.16TrpMet: 0.16 ± 0.125
0.722TrpAsn: 0.722 ± 0.194
0.08TrpPro: 0.08 ± 0.093
0.802TrpGln: 0.802 ± 0.208
0.481TrpArg: 0.481 ± 0.241
1.042TrpSer: 1.042 ± 0.314
1.122TrpThr: 1.122 ± 0.274
0.481TrpVal: 0.481 ± 0.226
0.241TrpTrp: 0.241 ± 0.135
0.561TrpTyr: 0.561 ± 0.251
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.207TyrAla: 3.207 ± 0.477
0.321TyrCys: 0.321 ± 0.173
2.245TyrAsp: 2.245 ± 0.407
1.924TyrGlu: 1.924 ± 0.388
2.325TyrPhe: 2.325 ± 0.46
1.523TyrGly: 1.523 ± 0.496
0.722TyrHis: 0.722 ± 0.257
2.806TyrIle: 2.806 ± 0.525
2.646TyrLys: 2.646 ± 0.511
2.565TyrLeu: 2.565 ± 0.488
0.802TyrMet: 0.802 ± 0.253
2.245TyrAsn: 2.245 ± 0.465
1.443TyrPro: 1.443 ± 0.319
2.726TyrGln: 2.726 ± 0.525
1.363TyrArg: 1.363 ± 0.315
2.646TyrSer: 2.646 ± 0.609
2.886TyrThr: 2.886 ± 0.597
1.603TyrVal: 1.603 ± 0.375
1.042TyrTrp: 1.042 ± 0.291
1.523TyrTyr: 1.523 ± 0.384
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12475 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski