Amino acid dipepetide frequency for Leuconostoc phage P793

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.34AlaAla: 1.34 ± 0.639
0.0AlaCys: 0.0 ± 0.0
5.482AlaAsp: 5.482 ± 0.723
1.584AlaGlu: 1.584 ± 0.337
3.289AlaPhe: 3.289 ± 0.894
4.995AlaGly: 4.995 ± 0.859
0.244AlaHis: 0.244 ± 0.175
6.944AlaIle: 6.944 ± 1.072
4.02AlaLys: 4.02 ± 0.578
5.117AlaLeu: 5.117 ± 0.878
1.218AlaMet: 1.218 ± 0.436
5.604AlaAsn: 5.604 ± 0.832
1.949AlaPro: 1.949 ± 0.485
3.411AlaGln: 3.411 ± 0.684
1.34AlaArg: 1.34 ± 0.328
4.873AlaSer: 4.873 ± 0.8
5.117AlaThr: 5.117 ± 0.897
4.386AlaVal: 4.386 ± 0.742
0.853AlaTrp: 0.853 ± 0.279
2.802AlaTyr: 2.802 ± 0.584
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.122CysAsp: 0.122 ± 0.119
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.365CysHis: 0.365 ± 0.254
0.365CysIle: 0.365 ± 0.258
0.0CysLys: 0.0 ± 0.0
0.122CysLeu: 0.122 ± 0.134
0.0CysMet: 0.0 ± 0.0
0.244CysAsn: 0.244 ± 0.163
0.0CysPro: 0.0 ± 0.0
0.122CysGln: 0.122 ± 0.11
0.122CysArg: 0.122 ± 0.106
0.244CysSer: 0.244 ± 0.161
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.949AspAla: 1.949 ± 0.455
0.365AspCys: 0.365 ± 0.217
4.751AspAsp: 4.751 ± 0.951
4.386AspGlu: 4.386 ± 1.016
3.777AspPhe: 3.777 ± 0.586
5.726AspGly: 5.726 ± 0.79
0.975AspHis: 0.975 ± 0.325
5.117AspIle: 5.117 ± 0.603
5.239AspLys: 5.239 ± 1.002
5.239AspLeu: 5.239 ± 0.974
2.071AspMet: 2.071 ± 0.426
4.873AspAsn: 4.873 ± 0.821
2.193AspPro: 2.193 ± 0.48
0.853AspGln: 0.853 ± 0.381
1.462AspArg: 1.462 ± 0.475
4.142AspSer: 4.142 ± 0.888
4.142AspThr: 4.142 ± 0.785
3.533AspVal: 3.533 ± 0.618
0.731AspTrp: 0.731 ± 0.249
3.777AspTyr: 3.777 ± 0.69
0.0AspXaa: 0.0 ± 0.0
Glu
1.949GluAla: 1.949 ± 0.503
0.122GluCys: 0.122 ± 0.11
1.827GluAsp: 1.827 ± 0.593
1.949GluGlu: 1.949 ± 0.566
2.558GluPhe: 2.558 ± 0.641
1.584GluGly: 1.584 ± 0.451
0.975GluHis: 0.975 ± 0.361
4.142GluIle: 4.142 ± 0.76
3.289GluLys: 3.289 ± 0.756
6.213GluLeu: 6.213 ± 0.855
0.975GluMet: 0.975 ± 0.352
4.63GluAsn: 4.63 ± 0.741
1.34GluPro: 1.34 ± 0.466
1.827GluGln: 1.827 ± 0.588
1.706GluArg: 1.706 ± 0.497
2.68GluSer: 2.68 ± 0.594
3.533GluThr: 3.533 ± 0.698
2.68GluVal: 2.68 ± 0.793
0.853GluTrp: 0.853 ± 0.352
2.315GluTyr: 2.315 ± 0.599
0.0GluXaa: 0.0 ± 0.0
Phe
2.68PheAla: 2.68 ± 0.574
0.244PheCys: 0.244 ± 0.156
3.655PheAsp: 3.655 ± 0.598
2.802PheGlu: 2.802 ± 0.535
1.34PhePhe: 1.34 ± 0.471
4.02PheGly: 4.02 ± 0.496
0.609PheHis: 0.609 ± 0.244
3.899PheIle: 3.899 ± 0.733
4.02PheLys: 4.02 ± 0.675
3.289PheLeu: 3.289 ± 0.699
1.34PheMet: 1.34 ± 0.468
3.289PheAsn: 3.289 ± 0.591
0.975PhePro: 0.975 ± 0.332
1.462PheGln: 1.462 ± 0.469
1.218PheArg: 1.218 ± 0.347
3.168PheSer: 3.168 ± 0.891
3.655PheThr: 3.655 ± 0.781
2.315PheVal: 2.315 ± 0.447
0.487PheTrp: 0.487 ± 0.301
1.949PheTyr: 1.949 ± 0.464
0.0PheXaa: 0.0 ± 0.0
Gly
4.63GlyAla: 4.63 ± 1.163
0.365GlyCys: 0.365 ± 0.236
3.777GlyAsp: 3.777 ± 0.576
2.193GlyGlu: 2.193 ± 0.512
4.63GlyPhe: 4.63 ± 0.96
3.411GlyGly: 3.411 ± 0.702
0.609GlyHis: 0.609 ± 0.309
5.97GlyIle: 5.97 ± 1.541
5.482GlyLys: 5.482 ± 0.841
5.239GlyLeu: 5.239 ± 0.806
1.218GlyMet: 1.218 ± 0.322
4.995GlyAsn: 4.995 ± 1.045
0.244GlyPro: 0.244 ± 0.153
2.924GlyGln: 2.924 ± 0.496
2.071GlyArg: 2.071 ± 0.466
5.848GlySer: 5.848 ± 1.251
5.604GlyThr: 5.604 ± 1.008
5.482GlyVal: 5.482 ± 1.407
0.365GlyTrp: 0.365 ± 0.187
3.168GlyTyr: 3.168 ± 0.832
0.0GlyXaa: 0.0 ± 0.0
His
0.975HisAla: 0.975 ± 0.291
0.122HisCys: 0.122 ± 0.134
0.975HisAsp: 0.975 ± 0.309
0.853HisGlu: 0.853 ± 0.444
0.244HisPhe: 0.244 ± 0.156
1.584HisGly: 1.584 ± 0.402
0.365HisHis: 0.365 ± 0.186
1.096HisIle: 1.096 ± 0.346
0.609HisLys: 0.609 ± 0.251
1.096HisLeu: 1.096 ± 0.401
0.487HisMet: 0.487 ± 0.267
0.975HisAsn: 0.975 ± 0.315
0.122HisPro: 0.122 ± 0.121
0.487HisGln: 0.487 ± 0.232
0.365HisArg: 0.365 ± 0.19
1.462HisSer: 1.462 ± 0.374
0.975HisThr: 0.975 ± 0.332
0.487HisVal: 0.487 ± 0.216
0.244HisTrp: 0.244 ± 0.162
1.218HisTyr: 1.218 ± 0.427
0.0HisXaa: 0.0 ± 0.0
Ile
4.873IleAla: 4.873 ± 0.78
0.0IleCys: 0.0 ± 0.0
5.117IleAsp: 5.117 ± 0.85
4.02IleGlu: 4.02 ± 0.745
3.046IlePhe: 3.046 ± 0.623
5.482IleGly: 5.482 ± 1.653
1.34IleHis: 1.34 ± 0.329
4.873IleIle: 4.873 ± 0.746
6.335IleLys: 6.335 ± 0.789
5.117IleLeu: 5.117 ± 0.714
1.462IleMet: 1.462 ± 0.637
4.02IleAsn: 4.02 ± 0.676
2.071IlePro: 2.071 ± 0.377
2.802IleGln: 2.802 ± 0.512
2.193IleArg: 2.193 ± 0.367
5.239IleSer: 5.239 ± 0.843
6.213IleThr: 6.213 ± 1.022
4.63IleVal: 4.63 ± 0.859
0.975IleTrp: 0.975 ± 0.338
3.289IleTyr: 3.289 ± 0.681
0.0IleXaa: 0.0 ± 0.0
Lys
5.239LysAla: 5.239 ± 0.652
0.0LysCys: 0.0 ± 0.0
3.168LysAsp: 3.168 ± 0.702
3.046LysGlu: 3.046 ± 0.604
3.046LysPhe: 3.046 ± 0.697
4.142LysGly: 4.142 ± 0.574
1.34LysHis: 1.34 ± 0.42
4.63LysIle: 4.63 ± 0.7
4.63LysLys: 4.63 ± 0.97
6.944LysLeu: 6.944 ± 0.928
2.315LysMet: 2.315 ± 0.663
4.386LysAsn: 4.386 ± 0.76
3.777LysPro: 3.777 ± 0.836
3.899LysGln: 3.899 ± 0.721
3.168LysArg: 3.168 ± 0.59
4.751LysSer: 4.751 ± 1.012
4.873LysThr: 4.873 ± 0.779
3.533LysVal: 3.533 ± 0.627
0.609LysTrp: 0.609 ± 0.264
3.411LysTyr: 3.411 ± 0.774
0.0LysXaa: 0.0 ± 0.0
Leu
8.041LeuAla: 8.041 ± 1.005
0.0LeuCys: 0.0 ± 0.0
6.579LeuAsp: 6.579 ± 0.908
4.751LeuGlu: 4.751 ± 0.735
3.046LeuPhe: 3.046 ± 0.615
6.335LeuGly: 6.335 ± 1.252
1.706LeuHis: 1.706 ± 0.505
4.142LeuIle: 4.142 ± 0.801
5.97LeuLys: 5.97 ± 0.777
4.873LeuLeu: 4.873 ± 0.838
2.071LeuMet: 2.071 ± 0.374
5.239LeuAsn: 5.239 ± 0.747
2.558LeuPro: 2.558 ± 0.528
3.289LeuGln: 3.289 ± 0.583
2.193LeuArg: 2.193 ± 0.509
5.361LeuSer: 5.361 ± 0.685
6.092LeuThr: 6.092 ± 1.292
5.361LeuVal: 5.361 ± 0.768
0.609LeuTrp: 0.609 ± 0.248
2.558LeuTyr: 2.558 ± 0.599
0.0LeuXaa: 0.0 ± 0.0
Met
2.558MetAla: 2.558 ± 0.463
0.0MetCys: 0.0 ± 0.0
0.975MetAsp: 0.975 ± 0.347
0.853MetGlu: 0.853 ± 0.261
0.731MetPhe: 0.731 ± 0.355
1.827MetGly: 1.827 ± 0.659
0.487MetHis: 0.487 ± 0.204
1.462MetIle: 1.462 ± 0.443
1.34MetLys: 1.34 ± 0.458
0.487MetLeu: 0.487 ± 0.245
0.365MetMet: 0.365 ± 0.187
1.34MetAsn: 1.34 ± 0.389
1.096MetPro: 1.096 ± 0.421
0.731MetGln: 0.731 ± 0.349
0.731MetArg: 0.731 ± 0.256
1.949MetSer: 1.949 ± 0.562
2.071MetThr: 2.071 ± 0.394
2.193MetVal: 2.193 ± 0.426
0.0MetTrp: 0.0 ± 0.0
0.975MetTyr: 0.975 ± 0.37
0.0MetXaa: 0.0 ± 0.0
Asn
5.604AsnAla: 5.604 ± 0.87
0.0AsnCys: 0.0 ± 0.0
3.655AsnAsp: 3.655 ± 0.638
3.777AsnGlu: 3.777 ± 0.809
2.558AsnPhe: 2.558 ± 0.705
6.457AsnGly: 6.457 ± 1.046
0.853AsnHis: 0.853 ± 0.323
4.142AsnIle: 4.142 ± 0.589
4.02AsnLys: 4.02 ± 0.836
4.751AsnLeu: 4.751 ± 0.643
1.096AsnMet: 1.096 ± 0.308
4.873AsnAsn: 4.873 ± 0.595
2.558AsnPro: 2.558 ± 0.581
3.899AsnGln: 3.899 ± 0.694
2.071AsnArg: 2.071 ± 0.467
3.168AsnSer: 3.168 ± 0.528
4.63AsnThr: 4.63 ± 0.779
5.239AsnVal: 5.239 ± 0.638
0.975AsnTrp: 0.975 ± 0.308
3.533AsnTyr: 3.533 ± 0.848
0.0AsnXaa: 0.0 ± 0.0
Pro
1.827ProAla: 1.827 ± 0.36
0.122ProCys: 0.122 ± 0.106
2.68ProAsp: 2.68 ± 0.593
1.218ProGlu: 1.218 ± 0.46
1.827ProPhe: 1.827 ± 0.471
0.122ProGly: 0.122 ± 0.114
0.609ProHis: 0.609 ± 0.335
2.924ProIle: 2.924 ± 0.517
2.68ProLys: 2.68 ± 0.59
2.802ProLeu: 2.802 ± 0.389
0.609ProMet: 0.609 ± 0.286
1.949ProAsn: 1.949 ± 0.627
0.244ProPro: 0.244 ± 0.177
1.462ProGln: 1.462 ± 0.482
0.975ProArg: 0.975 ± 0.452
2.924ProSer: 2.924 ± 0.617
2.558ProThr: 2.558 ± 0.563
2.071ProVal: 2.071 ± 0.456
0.0ProTrp: 0.0 ± 0.0
1.706ProTyr: 1.706 ± 0.477
0.0ProXaa: 0.0 ± 0.0
Gln
4.142GlnAla: 4.142 ± 1.154
0.122GlnCys: 0.122 ± 0.106
2.558GlnAsp: 2.558 ± 0.527
1.949GlnGlu: 1.949 ± 0.79
1.706GlnPhe: 1.706 ± 0.46
1.949GlnGly: 1.949 ± 0.382
0.0GlnHis: 0.0 ± 0.0
2.924GlnIle: 2.924 ± 0.572
2.924GlnLys: 2.924 ± 0.737
4.142GlnLeu: 4.142 ± 0.863
1.584GlnMet: 1.584 ± 0.393
2.437GlnAsn: 2.437 ± 0.451
1.827GlnPro: 1.827 ± 0.425
1.949GlnGln: 1.949 ± 0.433
1.827GlnArg: 1.827 ± 0.461
3.289GlnSer: 3.289 ± 0.47
3.411GlnThr: 3.411 ± 0.666
2.68GlnVal: 2.68 ± 0.533
0.609GlnTrp: 0.609 ± 0.244
1.827GlnTyr: 1.827 ± 0.514
0.0GlnXaa: 0.0 ± 0.0
Arg
1.949ArgAla: 1.949 ± 0.55
0.0ArgCys: 0.0 ± 0.0
3.046ArgAsp: 3.046 ± 0.709
1.462ArgGlu: 1.462 ± 0.375
1.218ArgPhe: 1.218 ± 0.427
1.827ArgGly: 1.827 ± 0.421
0.609ArgHis: 0.609 ± 0.292
1.949ArgIle: 1.949 ± 0.47
1.827ArgLys: 1.827 ± 0.436
3.411ArgLeu: 3.411 ± 0.66
0.487ArgMet: 0.487 ± 0.216
0.853ArgAsn: 0.853 ± 0.311
1.584ArgPro: 1.584 ± 0.464
2.071ArgGln: 2.071 ± 0.463
0.487ArgArg: 0.487 ± 0.253
1.34ArgSer: 1.34 ± 0.379
2.315ArgThr: 2.315 ± 0.631
2.193ArgVal: 2.193 ± 0.507
0.731ArgTrp: 0.731 ± 0.334
1.34ArgTyr: 1.34 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
4.995SerAla: 4.995 ± 0.661
0.0SerCys: 0.0 ± 0.0
4.751SerAsp: 4.751 ± 0.649
3.289SerGlu: 3.289 ± 0.931
3.046SerPhe: 3.046 ± 0.873
6.579SerGly: 6.579 ± 1.815
1.34SerHis: 1.34 ± 0.418
5.117SerIle: 5.117 ± 0.808
4.751SerLys: 4.751 ± 0.631
5.239SerLeu: 5.239 ± 1.409
1.706SerMet: 1.706 ± 0.447
4.264SerAsn: 4.264 ± 0.833
1.827SerPro: 1.827 ± 0.476
4.873SerGln: 4.873 ± 0.727
2.193SerArg: 2.193 ± 0.372
6.457SerSer: 6.457 ± 1.06
5.726SerThr: 5.726 ± 0.97
6.823SerVal: 6.823 ± 1.561
0.365SerTrp: 0.365 ± 0.231
2.315SerTyr: 2.315 ± 0.494
0.0SerXaa: 0.0 ± 0.0
Thr
4.751ThrAla: 4.751 ± 0.714
0.122ThrCys: 0.122 ± 0.11
3.899ThrAsp: 3.899 ± 0.491
2.437ThrGlu: 2.437 ± 0.511
3.533ThrPhe: 3.533 ± 0.696
5.726ThrGly: 5.726 ± 0.706
0.853ThrHis: 0.853 ± 0.39
5.848ThrIle: 5.848 ± 0.918
5.482ThrLys: 5.482 ± 1.053
6.579ThrLeu: 6.579 ± 0.954
0.853ThrMet: 0.853 ± 0.362
5.848ThrAsn: 5.848 ± 0.712
2.437ThrPro: 2.437 ± 0.455
3.289ThrGln: 3.289 ± 0.718
3.289ThrArg: 3.289 ± 0.612
7.554ThrSer: 7.554 ± 1.573
4.995ThrThr: 4.995 ± 0.818
4.142ThrVal: 4.142 ± 0.545
0.975ThrTrp: 0.975 ± 0.402
2.558ThrTyr: 2.558 ± 0.643
0.0ThrXaa: 0.0 ± 0.0
Val
3.655ValAla: 3.655 ± 0.663
0.122ValCys: 0.122 ± 0.106
4.386ValAsp: 4.386 ± 0.752
2.924ValGlu: 2.924 ± 0.886
4.02ValPhe: 4.02 ± 0.667
3.289ValGly: 3.289 ± 1.126
0.244ValHis: 0.244 ± 0.158
4.142ValIle: 4.142 ± 0.866
5.239ValLys: 5.239 ± 0.926
4.142ValLeu: 4.142 ± 0.66
1.462ValMet: 1.462 ± 0.406
4.142ValAsn: 4.142 ± 1.056
2.68ValPro: 2.68 ± 0.545
2.558ValGln: 2.558 ± 0.522
1.584ValArg: 1.584 ± 0.483
6.579ValSer: 6.579 ± 0.976
5.361ValThr: 5.361 ± 0.935
3.655ValVal: 3.655 ± 0.524
0.365ValTrp: 0.365 ± 0.203
4.02ValTyr: 4.02 ± 0.746
0.0ValXaa: 0.0 ± 0.0
Trp
0.609TrpAla: 0.609 ± 0.258
0.0TrpCys: 0.0 ± 0.0
0.853TrpAsp: 0.853 ± 0.221
0.975TrpGlu: 0.975 ± 0.364
0.487TrpPhe: 0.487 ± 0.229
0.731TrpGly: 0.731 ± 0.286
0.365TrpHis: 0.365 ± 0.217
0.487TrpIle: 0.487 ± 0.23
0.244TrpLys: 0.244 ± 0.153
1.34TrpLeu: 1.34 ± 0.453
0.0TrpMet: 0.0 ± 0.0
0.731TrpAsn: 0.731 ± 0.352
0.0TrpPro: 0.0 ± 0.0
0.487TrpGln: 0.487 ± 0.293
0.365TrpArg: 0.365 ± 0.219
1.096TrpSer: 1.096 ± 0.409
0.609TrpThr: 0.609 ± 0.291
0.487TrpVal: 0.487 ± 0.268
0.244TrpTrp: 0.244 ± 0.155
0.487TrpTyr: 0.487 ± 0.267
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.802TyrAla: 2.802 ± 0.54
0.0TyrCys: 0.0 ± 0.0
3.046TyrAsp: 3.046 ± 0.879
2.68TyrGlu: 2.68 ± 0.808
2.315TyrPhe: 2.315 ± 0.573
2.437TyrGly: 2.437 ± 0.849
0.853TyrHis: 0.853 ± 0.418
2.924TyrIle: 2.924 ± 0.645
2.802TyrLys: 2.802 ± 0.57
4.751TyrLeu: 4.751 ± 0.723
0.731TyrMet: 0.731 ± 0.377
3.168TyrAsn: 3.168 ± 0.691
1.827TyrPro: 1.827 ± 0.434
1.462TyrGln: 1.462 ± 0.432
1.34TyrArg: 1.34 ± 0.497
3.533TyrSer: 3.533 ± 0.753
3.289TyrThr: 3.289 ± 0.635
2.68TyrVal: 2.68 ± 0.594
0.609TyrTrp: 0.609 ± 0.287
1.949TyrTyr: 1.949 ± 0.65
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (8209 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski