Amino acid dipepetide frequency for Listeria phage P35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.82AlaAla: 0.82 ± 0.451
0.364AlaCys: 0.364 ± 0.166
3.371AlaAsp: 3.371 ± 0.48
5.011AlaGlu: 5.011 ± 0.833
2.46AlaPhe: 2.46 ± 0.409
2.46AlaGly: 2.46 ± 0.481
1.002AlaHis: 1.002 ± 0.215
4.555AlaIle: 4.555 ± 0.673
3.918AlaLys: 3.918 ± 0.713
5.649AlaLeu: 5.649 ± 0.643
2.095AlaMet: 2.095 ± 0.438
3.644AlaAsn: 3.644 ± 0.642
1.64AlaPro: 1.64 ± 0.328
2.733AlaGln: 2.733 ± 0.497
1.731AlaArg: 1.731 ± 0.402
2.915AlaSer: 2.915 ± 0.466
3.644AlaThr: 3.644 ± 0.732
4.1AlaVal: 4.1 ± 0.604
0.638AlaTrp: 0.638 ± 0.22
3.007AlaTyr: 3.007 ± 0.593
0.0AlaXaa: 0.0 ± 0.0
Cys
0.82CysAla: 0.82 ± 0.282
0.091CysCys: 0.091 ± 0.075
0.729CysAsp: 0.729 ± 0.249
0.456CysGlu: 0.456 ± 0.229
0.456CysPhe: 0.456 ± 0.247
0.638CysGly: 0.638 ± 0.245
0.0CysHis: 0.0 ± 0.0
0.273CysIle: 0.273 ± 0.153
0.456CysLys: 0.456 ± 0.184
0.364CysLeu: 0.364 ± 0.176
0.091CysMet: 0.091 ± 0.097
0.547CysAsn: 0.547 ± 0.253
0.273CysPro: 0.273 ± 0.17
0.091CysGln: 0.091 ± 0.073
0.547CysArg: 0.547 ± 0.268
0.182CysSer: 0.182 ± 0.121
0.456CysThr: 0.456 ± 0.21
0.364CysVal: 0.364 ± 0.238
0.091CysTrp: 0.091 ± 0.104
0.364CysTyr: 0.364 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
3.371AspAla: 3.371 ± 0.682
1.093AspCys: 1.093 ± 0.337
5.102AspAsp: 5.102 ± 0.761
4.92AspGlu: 4.92 ± 0.954
3.735AspPhe: 3.735 ± 0.634
5.74AspGly: 5.74 ± 0.749
0.911AspHis: 0.911 ± 0.309
5.193AspIle: 5.193 ± 0.645
4.647AspLys: 4.647 ± 0.564
6.378AspLeu: 6.378 ± 0.925
1.367AspMet: 1.367 ± 0.417
4.373AspAsn: 4.373 ± 0.552
2.46AspPro: 2.46 ± 0.587
2.369AspGln: 2.369 ± 0.487
2.095AspArg: 2.095 ± 0.458
3.462AspSer: 3.462 ± 0.383
4.647AspThr: 4.647 ± 0.673
5.922AspVal: 5.922 ± 0.816
1.002AspTrp: 1.002 ± 0.307
4.1AspTyr: 4.1 ± 0.564
0.0AspXaa: 0.0 ± 0.0
Glu
4.92GluAla: 4.92 ± 0.703
0.547GluCys: 0.547 ± 0.187
3.735GluAsp: 3.735 ± 0.502
4.191GluGlu: 4.191 ± 0.788
3.189GluPhe: 3.189 ± 0.525
3.735GluGly: 3.735 ± 0.677
0.729GluHis: 0.729 ± 0.24
4.555GluIle: 4.555 ± 0.798
4.829GluLys: 4.829 ± 0.907
5.558GluLeu: 5.558 ± 1.034
2.187GluMet: 2.187 ± 0.415
3.28GluAsn: 3.28 ± 0.67
2.278GluPro: 2.278 ± 0.588
3.28GluGln: 3.28 ± 0.692
3.007GluArg: 3.007 ± 0.695
3.735GluSer: 3.735 ± 0.643
3.827GluThr: 3.827 ± 0.582
5.558GluVal: 5.558 ± 0.614
1.184GluTrp: 1.184 ± 0.295
2.46GluTyr: 2.46 ± 0.479
0.0GluXaa: 0.0 ± 0.0
Phe
2.187PheAla: 2.187 ± 0.458
0.182PheCys: 0.182 ± 0.148
4.009PheAsp: 4.009 ± 0.424
3.371PheGlu: 3.371 ± 0.501
1.458PhePhe: 1.458 ± 0.318
3.28PheGly: 3.28 ± 0.791
0.456PheHis: 0.456 ± 0.187
3.371PheIle: 3.371 ± 0.623
3.918PheLys: 3.918 ± 0.541
3.28PheLeu: 3.28 ± 0.591
1.002PheMet: 1.002 ± 0.306
2.095PheAsn: 2.095 ± 0.386
1.276PhePro: 1.276 ± 0.355
2.095PheGln: 2.095 ± 0.39
1.913PheArg: 1.913 ± 0.437
2.187PheSer: 2.187 ± 0.492
3.827PheThr: 3.827 ± 0.614
2.095PheVal: 2.095 ± 0.429
0.364PheTrp: 0.364 ± 0.173
2.187PheTyr: 2.187 ± 0.462
0.0PheXaa: 0.0 ± 0.0
Gly
2.915GlyAla: 2.915 ± 0.526
0.091GlyCys: 0.091 ± 0.094
5.011GlyAsp: 5.011 ± 0.598
2.824GlyGlu: 2.824 ± 0.468
3.28GlyPhe: 3.28 ± 0.569
4.829GlyGly: 4.829 ± 0.768
1.731GlyHis: 1.731 ± 0.349
4.738GlyIle: 4.738 ± 0.864
3.735GlyLys: 3.735 ± 0.575
5.375GlyLeu: 5.375 ± 0.827
2.004GlyMet: 2.004 ± 0.54
3.28GlyAsn: 3.28 ± 0.577
0.182GlyPro: 0.182 ± 0.124
2.369GlyGln: 2.369 ± 0.459
2.551GlyArg: 2.551 ± 0.602
2.824GlySer: 2.824 ± 0.531
3.462GlyThr: 3.462 ± 0.716
6.195GlyVal: 6.195 ± 0.896
0.547GlyTrp: 0.547 ± 0.189
3.644GlyTyr: 3.644 ± 0.547
0.0GlyXaa: 0.0 ± 0.0
His
1.184HisAla: 1.184 ± 0.284
0.273HisCys: 0.273 ± 0.13
0.638HisAsp: 0.638 ± 0.26
1.093HisGlu: 1.093 ± 0.383
0.729HisPhe: 0.729 ± 0.317
1.276HisGly: 1.276 ± 0.318
0.638HisHis: 0.638 ± 0.243
1.367HisIle: 1.367 ± 0.307
0.911HisLys: 0.911 ± 0.285
1.731HisLeu: 1.731 ± 0.429
0.547HisMet: 0.547 ± 0.203
1.093HisAsn: 1.093 ± 0.344
0.547HisPro: 0.547 ± 0.206
0.638HisGln: 0.638 ± 0.208
0.911HisArg: 0.911 ± 0.367
0.82HisSer: 0.82 ± 0.305
1.458HisThr: 1.458 ± 0.295
1.731HisVal: 1.731 ± 0.418
0.273HisTrp: 0.273 ± 0.166
0.456HisTyr: 0.456 ± 0.237
0.0HisXaa: 0.0 ± 0.0
Ile
3.918IleAla: 3.918 ± 0.54
0.182IleCys: 0.182 ± 0.121
6.56IleAsp: 6.56 ± 0.712
4.191IleGlu: 4.191 ± 0.816
3.462IlePhe: 3.462 ± 0.676
3.553IleGly: 3.553 ± 0.613
0.729IleHis: 0.729 ± 0.339
4.373IleIle: 4.373 ± 0.658
6.469IleLys: 6.469 ± 0.702
4.191IleLeu: 4.191 ± 0.466
1.458IleMet: 1.458 ± 0.343
3.827IleAsn: 3.827 ± 0.59
4.1IlePro: 4.1 ± 0.575
2.187IleGln: 2.187 ± 0.436
3.462IleArg: 3.462 ± 0.583
3.462IleSer: 3.462 ± 0.486
4.282IleThr: 4.282 ± 0.693
5.284IleVal: 5.284 ± 0.657
1.002IleTrp: 1.002 ± 0.339
2.642IleTyr: 2.642 ± 0.554
0.0IleXaa: 0.0 ± 0.0
Lys
5.284LysAla: 5.284 ± 0.578
0.638LysCys: 0.638 ± 0.227
5.102LysAsp: 5.102 ± 0.581
5.649LysGlu: 5.649 ± 1.064
3.28LysPhe: 3.28 ± 0.537
5.375LysGly: 5.375 ± 1.091
1.64LysHis: 1.64 ± 0.343
3.553LysIle: 3.553 ± 0.572
6.469LysLys: 6.469 ± 1.057
5.922LysLeu: 5.922 ± 0.763
3.098LysMet: 3.098 ± 0.631
3.462LysAsn: 3.462 ± 0.439
2.915LysPro: 2.915 ± 0.469
2.915LysGln: 2.915 ± 0.677
4.009LysArg: 4.009 ± 0.615
4.191LysSer: 4.191 ± 0.648
4.373LysThr: 4.373 ± 0.65
5.558LysVal: 5.558 ± 0.673
0.729LysTrp: 0.729 ± 0.205
3.462LysTyr: 3.462 ± 0.58
0.0LysXaa: 0.0 ± 0.0
Leu
4.647LeuAla: 4.647 ± 0.788
0.547LeuCys: 0.547 ± 0.216
7.38LeuAsp: 7.38 ± 0.823
5.011LeuGlu: 5.011 ± 0.663
3.007LeuPhe: 3.007 ± 0.593
3.918LeuGly: 3.918 ± 0.678
1.184LeuHis: 1.184 ± 0.311
5.011LeuIle: 5.011 ± 0.748
6.742LeuLys: 6.742 ± 0.918
5.74LeuLeu: 5.74 ± 0.798
2.095LeuMet: 2.095 ± 0.503
5.011LeuAsn: 5.011 ± 0.596
2.46LeuPro: 2.46 ± 0.395
2.46LeuGln: 2.46 ± 0.469
3.735LeuArg: 3.735 ± 0.631
4.1LeuSer: 4.1 ± 0.411
6.013LeuThr: 6.013 ± 0.866
5.193LeuVal: 5.193 ± 0.639
1.002LeuTrp: 1.002 ± 0.295
2.369LeuTyr: 2.369 ± 0.478
0.0LeuXaa: 0.0 ± 0.0
Met
1.276MetAla: 1.276 ± 0.297
0.091MetCys: 0.091 ± 0.113
1.64MetAsp: 1.64 ± 0.34
2.095MetGlu: 2.095 ± 0.394
0.82MetPhe: 0.82 ± 0.304
2.187MetGly: 2.187 ± 0.459
0.273MetHis: 0.273 ± 0.164
2.278MetIle: 2.278 ± 0.434
2.004MetLys: 2.004 ± 0.431
2.46MetLeu: 2.46 ± 0.453
0.364MetMet: 0.364 ± 0.185
1.458MetAsn: 1.458 ± 0.332
1.276MetPro: 1.276 ± 0.323
0.911MetGln: 0.911 ± 0.251
1.549MetArg: 1.549 ± 0.411
1.367MetSer: 1.367 ± 0.354
3.098MetThr: 3.098 ± 0.548
1.822MetVal: 1.822 ± 0.476
0.273MetTrp: 0.273 ± 0.153
1.002MetTyr: 1.002 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
4.738AsnAla: 4.738 ± 0.604
0.638AsnCys: 0.638 ± 0.216
3.553AsnAsp: 3.553 ± 0.548
3.827AsnGlu: 3.827 ± 0.739
2.369AsnPhe: 2.369 ± 0.403
3.553AsnGly: 3.553 ± 0.643
1.184AsnHis: 1.184 ± 0.309
5.193AsnIle: 5.193 ± 0.714
3.007AsnLys: 3.007 ± 0.512
2.915AsnLeu: 2.915 ± 0.55
1.822AsnMet: 1.822 ± 0.335
3.007AsnAsn: 3.007 ± 0.577
2.551AsnPro: 2.551 ± 0.478
2.278AsnGln: 2.278 ± 0.51
2.551AsnArg: 2.551 ± 0.545
3.462AsnSer: 3.462 ± 0.563
2.824AsnThr: 2.824 ± 0.504
3.462AsnVal: 3.462 ± 0.625
0.82AsnTrp: 0.82 ± 0.322
2.095AsnTyr: 2.095 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
1.731ProAla: 1.731 ± 0.364
0.091ProCys: 0.091 ± 0.083
2.551ProAsp: 2.551 ± 0.466
2.095ProGlu: 2.095 ± 0.435
1.549ProPhe: 1.549 ± 0.362
0.547ProGly: 0.547 ± 0.245
0.82ProHis: 0.82 ± 0.322
2.095ProIle: 2.095 ± 0.412
3.189ProLys: 3.189 ± 0.603
2.46ProLeu: 2.46 ± 0.417
0.638ProMet: 0.638 ± 0.211
2.824ProAsn: 2.824 ± 0.664
1.002ProPro: 1.002 ± 0.299
1.367ProGln: 1.367 ± 0.303
1.367ProArg: 1.367 ± 0.303
2.278ProSer: 2.278 ± 0.393
3.644ProThr: 3.644 ± 0.981
2.278ProVal: 2.278 ± 0.535
0.182ProTrp: 0.182 ± 0.147
1.64ProTyr: 1.64 ± 0.414
0.0ProXaa: 0.0 ± 0.0
Gln
3.098GlnAla: 3.098 ± 0.533
0.091GlnCys: 0.091 ± 0.1
2.642GlnAsp: 2.642 ± 0.51
2.824GlnGlu: 2.824 ± 0.61
1.276GlnPhe: 1.276 ± 0.427
3.007GlnGly: 3.007 ± 0.612
1.002GlnHis: 1.002 ± 0.314
2.187GlnIle: 2.187 ± 0.45
2.824GlnLys: 2.824 ± 0.683
3.644GlnLeu: 3.644 ± 0.724
1.002GlnMet: 1.002 ± 0.335
1.276GlnAsn: 1.276 ± 0.335
1.549GlnPro: 1.549 ± 0.385
1.64GlnGln: 1.64 ± 0.484
1.184GlnArg: 1.184 ± 0.356
2.642GlnSer: 2.642 ± 0.413
1.822GlnThr: 1.822 ± 0.47
2.46GlnVal: 2.46 ± 0.396
0.638GlnTrp: 0.638 ± 0.263
1.549GlnTyr: 1.549 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
2.551ArgAla: 2.551 ± 0.565
0.182ArgCys: 0.182 ± 0.134
3.553ArgAsp: 3.553 ± 0.662
2.824ArgGlu: 2.824 ± 0.531
1.64ArgPhe: 1.64 ± 0.342
2.551ArgGly: 2.551 ± 0.567
1.549ArgHis: 1.549 ± 0.369
3.007ArgIle: 3.007 ± 0.458
3.827ArgLys: 3.827 ± 0.651
4.009ArgLeu: 4.009 ± 0.539
1.093ArgMet: 1.093 ± 0.316
2.095ArgAsn: 2.095 ± 0.538
1.276ArgPro: 1.276 ± 0.329
1.64ArgGln: 1.64 ± 0.391
1.913ArgArg: 1.913 ± 0.417
2.46ArgSer: 2.46 ± 0.448
1.184ArgThr: 1.184 ± 0.366
3.644ArgVal: 3.644 ± 0.659
0.547ArgTrp: 0.547 ± 0.227
2.278ArgTyr: 2.278 ± 0.604
0.0ArgXaa: 0.0 ± 0.0
Ser
2.915SerAla: 2.915 ± 0.517
0.273SerCys: 0.273 ± 0.143
4.555SerAsp: 4.555 ± 0.58
3.918SerGlu: 3.918 ± 0.673
2.642SerPhe: 2.642 ± 0.488
3.371SerGly: 3.371 ± 0.744
1.367SerHis: 1.367 ± 0.347
3.553SerIle: 3.553 ± 0.56
3.735SerLys: 3.735 ± 0.602
3.735SerLeu: 3.735 ± 0.534
1.731SerMet: 1.731 ± 0.358
3.098SerAsn: 3.098 ± 0.546
1.64SerPro: 1.64 ± 0.391
2.004SerGln: 2.004 ± 0.422
2.46SerArg: 2.46 ± 0.371
2.824SerSer: 2.824 ± 0.473
3.827SerThr: 3.827 ± 0.754
3.827SerVal: 3.827 ± 0.65
0.547SerTrp: 0.547 ± 0.22
2.824SerTyr: 2.824 ± 0.626
0.0SerXaa: 0.0 ± 0.0
Thr
3.371ThrAla: 3.371 ± 0.557
0.273ThrCys: 0.273 ± 0.188
4.92ThrAsp: 4.92 ± 0.604
3.827ThrGlu: 3.827 ± 0.752
3.28ThrPhe: 3.28 ± 0.582
4.738ThrGly: 4.738 ± 0.847
1.002ThrHis: 1.002 ± 0.345
5.375ThrIle: 5.375 ± 0.674
5.74ThrLys: 5.74 ± 0.692
4.829ThrLeu: 4.829 ± 0.731
1.458ThrMet: 1.458 ± 0.367
4.1ThrAsn: 4.1 ± 0.851
2.642ThrPro: 2.642 ± 0.581
3.371ThrGln: 3.371 ± 0.413
2.095ThrArg: 2.095 ± 0.379
4.555ThrSer: 4.555 ± 0.599
4.92ThrThr: 4.92 ± 0.667
4.009ThrVal: 4.009 ± 0.556
0.82ThrTrp: 0.82 ± 0.309
1.822ThrTyr: 1.822 ± 0.386
0.0ThrXaa: 0.0 ± 0.0
Val
3.098ValAla: 3.098 ± 0.437
0.729ValCys: 0.729 ± 0.226
3.827ValAsp: 3.827 ± 0.685
4.647ValGlu: 4.647 ± 0.619
3.553ValPhe: 3.553 ± 0.51
4.191ValGly: 4.191 ± 0.491
0.82ValHis: 0.82 ± 0.238
5.466ValIle: 5.466 ± 0.656
6.56ValLys: 6.56 ± 0.815
5.011ValLeu: 5.011 ± 0.653
2.095ValMet: 2.095 ± 0.485
3.918ValAsn: 3.918 ± 0.637
2.915ValPro: 2.915 ± 0.513
2.187ValGln: 2.187 ± 0.434
4.1ValArg: 4.1 ± 0.577
4.1ValSer: 4.1 ± 0.574
5.831ValThr: 5.831 ± 0.854
4.009ValVal: 4.009 ± 0.63
1.093ValTrp: 1.093 ± 0.354
2.004ValTyr: 2.004 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
0.911TrpAla: 0.911 ± 0.238
0.364TrpCys: 0.364 ± 0.188
1.276TrpAsp: 1.276 ± 0.344
0.547TrpGlu: 0.547 ± 0.192
0.456TrpPhe: 0.456 ± 0.191
0.547TrpGly: 0.547 ± 0.187
0.273TrpHis: 0.273 ± 0.156
0.638TrpIle: 0.638 ± 0.235
1.276TrpLys: 1.276 ± 0.421
1.002TrpLeu: 1.002 ± 0.523
0.273TrpMet: 0.273 ± 0.154
0.547TrpAsn: 0.547 ± 0.208
0.0TrpPro: 0.0 ± 0.0
0.547TrpGln: 0.547 ± 0.264
0.364TrpArg: 0.364 ± 0.242
0.911TrpSer: 0.911 ± 0.244
1.002TrpThr: 1.002 ± 0.336
0.456TrpVal: 0.456 ± 0.225
0.091TrpTrp: 0.091 ± 0.1
1.002TrpTyr: 1.002 ± 0.297
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.004TyrAla: 2.004 ± 0.462
0.547TyrCys: 0.547 ± 0.305
2.733TyrAsp: 2.733 ± 0.458
3.462TyrGlu: 3.462 ± 0.583
1.913TyrPhe: 1.913 ± 0.368
2.095TyrGly: 2.095 ± 0.424
1.002TyrHis: 1.002 ± 0.25
2.551TyrIle: 2.551 ± 0.464
3.371TyrLys: 3.371 ± 0.6
3.371TyrLeu: 3.371 ± 0.574
1.731TyrMet: 1.731 ± 0.376
2.915TyrAsn: 2.915 ± 0.488
1.367TyrPro: 1.367 ± 0.328
1.276TyrGln: 1.276 ± 0.376
2.278TyrArg: 2.278 ± 0.516
2.369TyrSer: 2.369 ± 0.431
3.007TyrThr: 3.007 ± 0.541
2.187TyrVal: 2.187 ± 0.564
0.638TyrTrp: 0.638 ± 0.233
2.187TyrTyr: 2.187 ± 0.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (10977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski