Amino acid dipepetide frequency for Listeria phage LWP01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.005AlaAla: 6.005 ± 0.971
0.468AlaCys: 0.468 ± 0.188
6.005AlaAsp: 6.005 ± 0.695
6.629AlaGlu: 6.629 ± 0.713
3.509AlaPhe: 3.509 ± 0.681
4.913AlaGly: 4.913 ± 0.837
0.936AlaHis: 0.936 ± 0.341
6.005AlaIle: 6.005 ± 0.777
7.253AlaLys: 7.253 ± 0.834
7.721AlaLeu: 7.721 ± 0.668
2.574AlaMet: 2.574 ± 0.366
3.509AlaAsn: 3.509 ± 0.513
2.34AlaPro: 2.34 ± 0.453
2.885AlaGln: 2.885 ± 0.557
4.133AlaArg: 4.133 ± 0.421
3.899AlaSer: 3.899 ± 0.578
4.991AlaThr: 4.991 ± 0.698
5.459AlaVal: 5.459 ± 0.564
1.014AlaTrp: 1.014 ± 0.267
2.651AlaTyr: 2.651 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
0.39CysAla: 0.39 ± 0.198
0.078CysCys: 0.078 ± 0.067
0.312CysAsp: 0.312 ± 0.167
0.468CysGlu: 0.468 ± 0.189
0.312CysPhe: 0.312 ± 0.155
0.468CysGly: 0.468 ± 0.175
0.078CysHis: 0.078 ± 0.08
0.858CysIle: 0.858 ± 0.233
0.624CysLys: 0.624 ± 0.281
0.546CysLeu: 0.546 ± 0.268
0.156CysMet: 0.156 ± 0.102
0.234CysAsn: 0.234 ± 0.136
0.312CysPro: 0.312 ± 0.196
0.156CysGln: 0.156 ± 0.106
0.312CysArg: 0.312 ± 0.131
0.0CysSer: 0.0 ± 0.0
0.702CysThr: 0.702 ± 0.234
0.312CysVal: 0.312 ± 0.142
0.078CysTrp: 0.078 ± 0.084
0.312CysTyr: 0.312 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
5.069AspAla: 5.069 ± 0.605
0.312AspCys: 0.312 ± 0.141
4.367AspAsp: 4.367 ± 0.89
5.615AspGlu: 5.615 ± 0.772
3.353AspPhe: 3.353 ± 0.545
4.835AspGly: 4.835 ± 0.605
0.624AspHis: 0.624 ± 0.238
5.381AspIle: 5.381 ± 0.579
4.757AspLys: 4.757 ± 0.616
6.317AspLeu: 6.317 ± 0.778
1.716AspMet: 1.716 ± 0.326
2.885AspAsn: 2.885 ± 0.459
1.638AspPro: 1.638 ± 0.437
1.56AspGln: 1.56 ± 0.322
2.729AspArg: 2.729 ± 0.465
2.885AspSer: 2.885 ± 0.484
3.431AspThr: 3.431 ± 0.645
4.679AspVal: 4.679 ± 0.592
1.092AspTrp: 1.092 ± 0.249
2.574AspTyr: 2.574 ± 0.457
0.0AspXaa: 0.0 ± 0.0
Glu
6.551GluAla: 6.551 ± 0.846
0.312GluCys: 0.312 ± 0.146
5.537GluAsp: 5.537 ± 1.01
6.863GluGlu: 6.863 ± 0.727
3.353GluPhe: 3.353 ± 0.492
5.459GluGly: 5.459 ± 0.756
1.092GluHis: 1.092 ± 0.314
4.211GluIle: 4.211 ± 0.513
6.707GluLys: 6.707 ± 0.704
9.826GluLeu: 9.826 ± 0.767
2.34GluMet: 2.34 ± 0.379
4.367GluAsn: 4.367 ± 0.5
1.716GluPro: 1.716 ± 0.387
3.275GluGln: 3.275 ± 0.609
4.055GluArg: 4.055 ± 0.612
3.977GluSer: 3.977 ± 0.513
4.211GluThr: 4.211 ± 0.584
5.849GluVal: 5.849 ± 0.657
0.78GluTrp: 0.78 ± 0.291
2.729GluTyr: 2.729 ± 0.37
0.0GluXaa: 0.0 ± 0.0
Phe
3.197PheAla: 3.197 ± 0.775
0.624PheCys: 0.624 ± 0.209
2.885PheAsp: 2.885 ± 0.601
3.821PheGlu: 3.821 ± 0.795
1.482PhePhe: 1.482 ± 0.348
2.963PheGly: 2.963 ± 0.434
0.936PheHis: 0.936 ± 0.235
1.872PheIle: 1.872 ± 0.418
3.119PheLys: 3.119 ± 0.639
3.197PheLeu: 3.197 ± 0.579
0.78PheMet: 0.78 ± 0.231
1.404PheAsn: 1.404 ± 0.36
1.404PhePro: 1.404 ± 0.297
1.014PheGln: 1.014 ± 0.238
1.404PheArg: 1.404 ± 0.319
3.275PheSer: 3.275 ± 0.628
1.482PheThr: 1.482 ± 0.346
1.482PheVal: 1.482 ± 0.373
0.546PheTrp: 0.546 ± 0.225
1.872PheTyr: 1.872 ± 0.362
0.0PheXaa: 0.0 ± 0.0
Gly
5.381GlyAla: 5.381 ± 0.74
0.312GlyCys: 0.312 ± 0.143
3.119GlyAsp: 3.119 ± 0.497
5.225GlyGlu: 5.225 ± 0.554
2.34GlyPhe: 2.34 ± 0.381
4.055GlyGly: 4.055 ± 0.441
1.404GlyHis: 1.404 ± 0.396
3.743GlyIle: 3.743 ± 0.589
6.707GlyLys: 6.707 ± 0.655
6.005GlyLeu: 6.005 ± 0.733
2.496GlyMet: 2.496 ± 0.505
2.651GlyAsn: 2.651 ± 0.458
0.468GlyPro: 0.468 ± 0.16
2.028GlyGln: 2.028 ± 0.46
2.496GlyArg: 2.496 ± 0.553
3.899GlySer: 3.899 ± 0.644
5.225GlyThr: 5.225 ± 0.794
4.601GlyVal: 4.601 ± 0.772
0.78GlyTrp: 0.78 ± 0.292
2.418GlyTyr: 2.418 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
1.326HisAla: 1.326 ± 0.346
0.0HisCys: 0.0 ± 0.0
1.248HisAsp: 1.248 ± 0.356
0.546HisGlu: 0.546 ± 0.221
0.936HisPhe: 0.936 ± 0.275
1.482HisGly: 1.482 ± 0.358
0.624HisHis: 0.624 ± 0.22
1.014HisIle: 1.014 ± 0.286
0.78HisLys: 0.78 ± 0.212
0.78HisLeu: 0.78 ± 0.273
0.546HisMet: 0.546 ± 0.226
0.312HisAsn: 0.312 ± 0.173
0.936HisPro: 0.936 ± 0.261
0.702HisGln: 0.702 ± 0.222
0.468HisArg: 0.468 ± 0.187
0.546HisSer: 0.546 ± 0.171
1.092HisThr: 1.092 ± 0.282
1.092HisVal: 1.092 ± 0.318
0.312HisTrp: 0.312 ± 0.148
0.468HisTyr: 0.468 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
4.991IleAla: 4.991 ± 0.743
0.702IleCys: 0.702 ± 0.284
5.069IleAsp: 5.069 ± 0.598
5.849IleGlu: 5.849 ± 0.806
1.716IlePhe: 1.716 ± 0.433
2.807IleGly: 2.807 ± 0.43
1.326IleHis: 1.326 ± 0.414
2.963IleIle: 2.963 ± 0.648
5.381IleLys: 5.381 ± 0.739
3.977IleLeu: 3.977 ± 0.491
0.858IleMet: 0.858 ± 0.313
2.729IleAsn: 2.729 ± 0.439
2.807IlePro: 2.807 ± 0.461
1.794IleGln: 1.794 ± 0.416
2.651IleArg: 2.651 ± 0.449
3.977IleSer: 3.977 ± 0.521
3.587IleThr: 3.587 ± 0.583
4.679IleVal: 4.679 ± 0.686
0.546IleTrp: 0.546 ± 0.219
2.496IleTyr: 2.496 ± 0.452
0.0IleXaa: 0.0 ± 0.0
Lys
8.032LysAla: 8.032 ± 0.77
0.39LysCys: 0.39 ± 0.137
4.991LysAsp: 4.991 ± 0.478
6.785LysGlu: 6.785 ± 0.894
2.418LysPhe: 2.418 ± 0.487
5.225LysGly: 5.225 ± 0.85
1.482LysHis: 1.482 ± 0.377
4.757LysIle: 4.757 ± 0.59
6.395LysLys: 6.395 ± 0.807
6.551LysLeu: 6.551 ± 0.568
2.418LysMet: 2.418 ± 0.436
4.289LysAsn: 4.289 ± 0.737
2.106LysPro: 2.106 ± 0.48
3.509LysGln: 3.509 ± 0.516
3.665LysArg: 3.665 ± 0.516
4.367LysSer: 4.367 ± 0.571
5.381LysThr: 5.381 ± 0.701
5.225LysVal: 5.225 ± 0.754
1.092LysTrp: 1.092 ± 0.291
3.197LysTyr: 3.197 ± 0.54
0.0LysXaa: 0.0 ± 0.0
Leu
7.019LeuAla: 7.019 ± 0.702
0.468LeuCys: 0.468 ± 0.141
7.487LeuAsp: 7.487 ± 0.711
9.046LeuGlu: 9.046 ± 0.831
2.885LeuPhe: 2.885 ± 0.503
6.239LeuGly: 6.239 ± 0.637
0.702LeuHis: 0.702 ± 0.288
4.523LeuIle: 4.523 ± 0.863
7.253LeuLys: 7.253 ± 0.682
6.707LeuLeu: 6.707 ± 0.883
2.729LeuMet: 2.729 ± 0.436
3.821LeuAsn: 3.821 ± 0.661
2.262LeuPro: 2.262 ± 0.486
2.729LeuGln: 2.729 ± 0.485
5.069LeuArg: 5.069 ± 0.862
4.913LeuSer: 4.913 ± 0.517
5.147LeuThr: 5.147 ± 0.481
5.459LeuVal: 5.459 ± 0.762
0.546LeuTrp: 0.546 ± 0.18
2.184LeuTyr: 2.184 ± 0.443
0.0LeuXaa: 0.0 ± 0.0
Met
2.651MetAla: 2.651 ± 0.43
0.078MetCys: 0.078 ± 0.077
1.56MetAsp: 1.56 ± 0.339
1.872MetGlu: 1.872 ± 0.395
1.248MetPhe: 1.248 ± 0.268
1.56MetGly: 1.56 ± 0.405
0.312MetHis: 0.312 ± 0.152
0.858MetIle: 0.858 ± 0.25
1.872MetLys: 1.872 ± 0.389
1.482MetLeu: 1.482 ± 0.291
0.702MetMet: 0.702 ± 0.256
1.326MetAsn: 1.326 ± 0.295
1.326MetPro: 1.326 ± 0.253
0.936MetGln: 0.936 ± 0.255
1.17MetArg: 1.17 ± 0.256
1.716MetSer: 1.716 ± 0.413
3.041MetThr: 3.041 ± 0.486
1.95MetVal: 1.95 ± 0.412
0.078MetTrp: 0.078 ± 0.075
0.936MetTyr: 0.936 ± 0.244
0.0MetXaa: 0.0 ± 0.0
Asn
4.055AsnAla: 4.055 ± 0.701
0.39AsnCys: 0.39 ± 0.178
2.496AsnAsp: 2.496 ± 0.5
2.729AsnGlu: 2.729 ± 0.42
1.872AsnPhe: 1.872 ± 0.385
3.587AsnGly: 3.587 ± 0.444
0.702AsnHis: 0.702 ± 0.228
2.807AsnIle: 2.807 ± 0.464
4.133AsnLys: 4.133 ± 0.679
3.821AsnLeu: 3.821 ± 0.397
1.014AsnMet: 1.014 ± 0.245
1.404AsnAsn: 1.404 ± 0.352
1.95AsnPro: 1.95 ± 0.432
1.872AsnGln: 1.872 ± 0.474
2.496AsnArg: 2.496 ± 0.435
3.041AsnSer: 3.041 ± 0.452
2.184AsnThr: 2.184 ± 0.42
2.574AsnVal: 2.574 ± 0.378
0.546AsnTrp: 0.546 ± 0.172
2.262AsnTyr: 2.262 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
2.106ProAla: 2.106 ± 0.39
0.234ProCys: 0.234 ± 0.127
1.95ProAsp: 1.95 ± 0.502
2.963ProGlu: 2.963 ± 0.564
1.092ProPhe: 1.092 ± 0.351
2.106ProGly: 2.106 ± 0.422
0.234ProHis: 0.234 ± 0.135
1.794ProIle: 1.794 ± 0.428
2.184ProLys: 2.184 ± 0.541
2.574ProLeu: 2.574 ± 0.594
0.468ProMet: 0.468 ± 0.146
1.092ProAsn: 1.092 ± 0.274
0.936ProPro: 0.936 ± 0.251
0.468ProGln: 0.468 ± 0.184
1.17ProArg: 1.17 ± 0.292
1.794ProSer: 1.794 ± 0.356
1.794ProThr: 1.794 ± 0.447
1.95ProVal: 1.95 ± 0.351
0.624ProTrp: 0.624 ± 0.191
0.936ProTyr: 0.936 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
2.963GlnAla: 2.963 ± 0.317
0.234GlnCys: 0.234 ± 0.117
2.418GlnAsp: 2.418 ± 0.454
3.821GlnGlu: 3.821 ± 0.628
1.248GlnPhe: 1.248 ± 0.296
1.95GlnGly: 1.95 ± 0.456
0.39GlnHis: 0.39 ± 0.184
2.262GlnIle: 2.262 ± 0.397
2.262GlnLys: 2.262 ± 0.385
3.821GlnLeu: 3.821 ± 0.68
1.17GlnMet: 1.17 ± 0.289
1.56GlnAsn: 1.56 ± 0.406
0.936GlnPro: 0.936 ± 0.275
2.028GlnGln: 2.028 ± 0.498
2.651GlnArg: 2.651 ± 0.425
1.482GlnSer: 1.482 ± 0.354
1.716GlnThr: 1.716 ± 0.407
2.106GlnVal: 2.106 ± 0.344
0.234GlnTrp: 0.234 ± 0.161
1.248GlnTyr: 1.248 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
3.119ArgAla: 3.119 ± 0.472
0.312ArgCys: 0.312 ± 0.155
2.651ArgAsp: 2.651 ± 0.379
4.055ArgGlu: 4.055 ± 0.639
1.716ArgPhe: 1.716 ± 0.389
2.418ArgGly: 2.418 ± 0.526
1.248ArgHis: 1.248 ± 0.293
4.055ArgIle: 4.055 ± 0.699
4.055ArgLys: 4.055 ± 0.619
4.757ArgLeu: 4.757 ± 0.559
1.326ArgMet: 1.326 ± 0.294
2.574ArgAsn: 2.574 ± 0.367
0.858ArgPro: 0.858 ± 0.255
1.95ArgGln: 1.95 ± 0.484
2.106ArgArg: 2.106 ± 0.488
1.482ArgSer: 1.482 ± 0.351
1.872ArgThr: 1.872 ± 0.452
3.119ArgVal: 3.119 ± 0.397
0.468ArgTrp: 0.468 ± 0.152
1.95ArgTyr: 1.95 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
4.367SerAla: 4.367 ± 0.595
0.39SerCys: 0.39 ± 0.178
2.807SerAsp: 2.807 ± 0.382
4.133SerGlu: 4.133 ± 0.466
1.872SerPhe: 1.872 ± 0.345
3.353SerGly: 3.353 ± 0.628
0.858SerHis: 0.858 ± 0.301
4.055SerIle: 4.055 ± 0.684
5.069SerLys: 5.069 ± 0.684
4.367SerLeu: 4.367 ± 0.515
1.404SerMet: 1.404 ± 0.335
3.275SerAsn: 3.275 ± 0.43
1.95SerPro: 1.95 ± 0.437
3.353SerGln: 3.353 ± 0.593
2.028SerArg: 2.028 ± 0.443
3.197SerSer: 3.197 ± 0.566
2.651SerThr: 2.651 ± 0.511
3.197SerVal: 3.197 ± 0.548
0.624SerTrp: 0.624 ± 0.229
2.418SerTyr: 2.418 ± 0.562
0.0SerXaa: 0.0 ± 0.0
Thr
6.161ThrAla: 6.161 ± 1.079
0.312ThrCys: 0.312 ± 0.155
2.885ThrAsp: 2.885 ± 0.484
3.275ThrGlu: 3.275 ± 0.527
3.197ThrPhe: 3.197 ± 0.389
4.133ThrGly: 4.133 ± 0.536
0.702ThrHis: 0.702 ± 0.247
4.133ThrIle: 4.133 ± 0.585
5.069ThrLys: 5.069 ± 0.735
5.147ThrLeu: 5.147 ± 0.669
1.326ThrMet: 1.326 ± 0.259
2.574ThrAsn: 2.574 ± 0.507
1.248ThrPro: 1.248 ± 0.385
1.404ThrGln: 1.404 ± 0.318
2.418ThrArg: 2.418 ± 0.335
3.509ThrSer: 3.509 ± 0.512
4.133ThrThr: 4.133 ± 0.591
4.289ThrVal: 4.289 ± 0.735
1.014ThrTrp: 1.014 ± 0.288
2.028ThrTyr: 2.028 ± 0.419
0.0ThrXaa: 0.0 ± 0.0
Val
5.381ValAla: 5.381 ± 0.694
0.546ValCys: 0.546 ± 0.197
4.835ValAsp: 4.835 ± 0.593
6.551ValGlu: 6.551 ± 0.783
2.262ValPhe: 2.262 ± 0.37
4.289ValGly: 4.289 ± 0.708
0.858ValHis: 0.858 ± 0.22
3.509ValIle: 3.509 ± 0.633
4.913ValLys: 4.913 ± 0.539
5.381ValLeu: 5.381 ± 0.698
1.092ValMet: 1.092 ± 0.288
3.197ValAsn: 3.197 ± 0.54
2.184ValPro: 2.184 ± 0.498
2.418ValGln: 2.418 ± 0.46
3.119ValArg: 3.119 ± 0.62
3.977ValSer: 3.977 ± 0.523
4.055ValThr: 4.055 ± 0.629
4.835ValVal: 4.835 ± 0.783
0.546ValTrp: 0.546 ± 0.191
2.028ValTyr: 2.028 ± 0.383
0.0ValXaa: 0.0 ± 0.0
Trp
1.638TrpAla: 1.638 ± 0.405
0.156TrpCys: 0.156 ± 0.108
0.78TrpAsp: 0.78 ± 0.209
0.858TrpGlu: 0.858 ± 0.257
0.39TrpPhe: 0.39 ± 0.136
0.624TrpGly: 0.624 ± 0.215
0.312TrpHis: 0.312 ± 0.19
0.546TrpIle: 0.546 ± 0.224
0.858TrpLys: 0.858 ± 0.245
1.014TrpLeu: 1.014 ± 0.312
0.312TrpMet: 0.312 ± 0.141
0.156TrpAsn: 0.156 ± 0.097
0.39TrpPro: 0.39 ± 0.217
0.234TrpGln: 0.234 ± 0.11
0.78TrpArg: 0.78 ± 0.352
0.936TrpSer: 0.936 ± 0.266
0.312TrpThr: 0.312 ± 0.147
0.546TrpVal: 0.546 ± 0.204
0.078TrpTrp: 0.078 ± 0.063
0.546TrpTyr: 0.546 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.496TyrAla: 2.496 ± 0.402
0.39TyrCys: 0.39 ± 0.168
2.418TyrAsp: 2.418 ± 0.382
2.106TyrGlu: 2.106 ± 0.452
1.56TyrPhe: 1.56 ± 0.368
2.885TyrGly: 2.885 ± 0.53
0.39TyrHis: 0.39 ± 0.219
1.638TyrIle: 1.638 ± 0.401
2.885TyrLys: 2.885 ± 0.533
3.197TyrLeu: 3.197 ± 0.655
1.17TyrMet: 1.17 ± 0.275
2.574TyrAsn: 2.574 ± 0.382
0.702TyrPro: 0.702 ± 0.234
2.184TyrGln: 2.184 ± 0.412
1.17TyrArg: 1.17 ± 0.37
2.496TyrSer: 2.496 ± 0.392
1.95TyrThr: 1.95 ± 0.447
2.574TyrVal: 2.574 ± 0.318
0.468TyrTrp: 0.468 ± 0.221
1.014TyrTyr: 1.014 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12824 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski