Amino acid dipepetide frequency for Leuconostoc phage LDG

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.74AlaAla: 0.74 ± 0.468
0.0AlaCys: 0.0 ± 0.0
4.442AlaAsp: 4.442 ± 0.781
1.974AlaGlu: 1.974 ± 0.468
2.961AlaPhe: 2.961 ± 0.822
5.429AlaGly: 5.429 ± 0.861
0.247AlaHis: 0.247 ± 0.154
6.169AlaIle: 6.169 ± 1.085
3.455AlaLys: 3.455 ± 0.501
4.935AlaLeu: 4.935 ± 0.89
1.604AlaMet: 1.604 ± 0.446
5.182AlaAsn: 5.182 ± 0.848
1.727AlaPro: 1.727 ± 0.412
3.208AlaGln: 3.208 ± 0.606
1.604AlaArg: 1.604 ± 0.41
4.688AlaSer: 4.688 ± 0.935
5.059AlaThr: 5.059 ± 0.933
4.935AlaVal: 4.935 ± 0.771
0.864AlaTrp: 0.864 ± 0.33
2.961AlaTyr: 2.961 ± 0.699
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.123CysAsp: 0.123 ± 0.146
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.247CysHis: 0.247 ± 0.25
0.123CysIle: 0.123 ± 0.129
0.0CysLys: 0.0 ± 0.0
0.123CysLeu: 0.123 ± 0.12
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.123CysGln: 0.123 ± 0.125
0.0CysArg: 0.0 ± 0.0
0.123CysSer: 0.123 ± 0.141
0.123CysThr: 0.123 ± 0.128
0.123CysVal: 0.123 ± 0.111
0.123CysTrp: 0.123 ± 0.119
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.727AspAla: 1.727 ± 0.395
0.247AspCys: 0.247 ± 0.186
4.565AspAsp: 4.565 ± 0.905
4.318AspGlu: 4.318 ± 1.006
3.948AspPhe: 3.948 ± 0.71
5.305AspGly: 5.305 ± 0.969
1.11AspHis: 1.11 ± 0.324
5.059AspIle: 5.059 ± 0.75
5.059AspLys: 5.059 ± 1.133
5.429AspLeu: 5.429 ± 0.886
2.221AspMet: 2.221 ± 0.517
4.565AspAsn: 4.565 ± 0.722
2.344AspPro: 2.344 ± 0.497
0.864AspGln: 0.864 ± 0.344
1.481AspArg: 1.481 ± 0.475
3.331AspSer: 3.331 ± 0.758
3.455AspThr: 3.455 ± 0.631
3.825AspVal: 3.825 ± 0.503
1.234AspTrp: 1.234 ± 0.315
3.208AspTyr: 3.208 ± 0.567
0.0AspXaa: 0.0 ± 0.0
Glu
2.221GluAla: 2.221 ± 0.606
0.247GluCys: 0.247 ± 0.159
2.221GluAsp: 2.221 ± 0.523
1.974GluGlu: 1.974 ± 0.682
3.208GluPhe: 3.208 ± 0.673
1.234GluGly: 1.234 ± 0.391
1.11GluHis: 1.11 ± 0.393
4.935GluIle: 4.935 ± 0.708
4.318GluLys: 4.318 ± 0.802
6.046GluLeu: 6.046 ± 0.785
0.987GluMet: 0.987 ± 0.329
4.688GluAsn: 4.688 ± 0.921
1.11GluPro: 1.11 ± 0.348
2.344GluGln: 2.344 ± 0.542
2.344GluArg: 2.344 ± 0.624
2.961GluSer: 2.961 ± 0.603
3.208GluThr: 3.208 ± 0.686
2.838GluVal: 2.838 ± 0.714
0.617GluTrp: 0.617 ± 0.293
2.344GluTyr: 2.344 ± 0.597
0.0GluXaa: 0.0 ± 0.0
Phe
2.714PheAla: 2.714 ± 0.639
0.123PheCys: 0.123 ± 0.146
3.331PheAsp: 3.331 ± 0.633
2.714PheGlu: 2.714 ± 0.697
1.11PhePhe: 1.11 ± 0.434
3.948PheGly: 3.948 ± 0.65
0.617PheHis: 0.617 ± 0.232
3.948PheIle: 3.948 ± 0.712
3.701PheLys: 3.701 ± 0.562
3.208PheLeu: 3.208 ± 0.72
1.481PheMet: 1.481 ± 0.538
3.208PheAsn: 3.208 ± 0.566
0.987PhePro: 0.987 ± 0.328
1.357PheGln: 1.357 ± 0.517
1.234PheArg: 1.234 ± 0.343
3.455PheSer: 3.455 ± 0.818
3.455PheThr: 3.455 ± 0.576
2.591PheVal: 2.591 ± 0.55
0.74PheTrp: 0.74 ± 0.333
1.851PheTyr: 1.851 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
4.688GlyAla: 4.688 ± 1.022
0.0GlyCys: 0.0 ± 0.0
3.455GlyAsp: 3.455 ± 0.631
2.468GlyGlu: 2.468 ± 0.553
4.935GlyPhe: 4.935 ± 1.013
3.825GlyGly: 3.825 ± 0.829
0.617GlyHis: 0.617 ± 0.312
5.305GlyIle: 5.305 ± 1.431
5.676GlyLys: 5.676 ± 0.823
5.059GlyLeu: 5.059 ± 0.837
1.481GlyMet: 1.481 ± 0.487
3.701GlyAsn: 3.701 ± 0.859
0.123GlyPro: 0.123 ± 0.119
3.208GlyGln: 3.208 ± 0.493
2.344GlyArg: 2.344 ± 0.464
5.922GlySer: 5.922 ± 1.01
5.676GlyThr: 5.676 ± 0.843
5.305GlyVal: 5.305 ± 1.178
0.37GlyTrp: 0.37 ± 0.187
2.468GlyTyr: 2.468 ± 0.601
0.0GlyXaa: 0.0 ± 0.0
His
1.11HisAla: 1.11 ± 0.361
0.123HisCys: 0.123 ± 0.12
0.987HisAsp: 0.987 ± 0.298
0.864HisGlu: 0.864 ± 0.514
0.123HisPhe: 0.123 ± 0.132
1.357HisGly: 1.357 ± 0.452
0.247HisHis: 0.247 ± 0.189
1.11HisIle: 1.11 ± 0.301
0.74HisLys: 0.74 ± 0.309
0.987HisLeu: 0.987 ± 0.389
0.37HisMet: 0.37 ± 0.205
1.234HisAsn: 1.234 ± 0.352
0.123HisPro: 0.123 ± 0.118
0.617HisGln: 0.617 ± 0.248
0.37HisArg: 0.37 ± 0.196
1.11HisSer: 1.11 ± 0.409
0.617HisThr: 0.617 ± 0.316
0.494HisVal: 0.494 ± 0.194
0.0HisTrp: 0.0 ± 0.0
1.11HisTyr: 1.11 ± 0.358
0.0HisXaa: 0.0 ± 0.0
Ile
4.565IleAla: 4.565 ± 0.659
0.37IleCys: 0.37 ± 0.221
4.565IleAsp: 4.565 ± 0.878
3.085IleGlu: 3.085 ± 0.789
2.961IlePhe: 2.961 ± 0.502
5.182IleGly: 5.182 ± 1.275
1.357IleHis: 1.357 ± 0.369
5.182IleIle: 5.182 ± 0.817
6.786IleLys: 6.786 ± 1.038
5.059IleLeu: 5.059 ± 0.792
1.727IleMet: 1.727 ± 0.653
3.948IleAsn: 3.948 ± 0.657
1.727IlePro: 1.727 ± 0.435
2.468IleGln: 2.468 ± 0.544
1.974IleArg: 1.974 ± 0.396
5.182IleSer: 5.182 ± 0.713
6.539IleThr: 6.539 ± 0.949
4.935IleVal: 4.935 ± 0.857
0.74IleTrp: 0.74 ± 0.3
3.085IleTyr: 3.085 ± 0.557
0.0IleXaa: 0.0 ± 0.0
Lys
5.182LysAla: 5.182 ± 0.709
0.0LysCys: 0.0 ± 0.0
3.578LysAsp: 3.578 ± 0.695
2.961LysGlu: 2.961 ± 0.563
3.701LysPhe: 3.701 ± 0.89
4.195LysGly: 4.195 ± 0.597
1.234LysHis: 1.234 ± 0.419
4.565LysIle: 4.565 ± 0.654
5.305LysLys: 5.305 ± 1.006
7.279LysLeu: 7.279 ± 0.795
2.468LysMet: 2.468 ± 0.586
5.059LysAsn: 5.059 ± 0.659
2.961LysPro: 2.961 ± 0.613
3.701LysGln: 3.701 ± 0.709
3.208LysArg: 3.208 ± 0.713
5.182LysSer: 5.182 ± 0.798
4.935LysThr: 4.935 ± 0.812
3.331LysVal: 3.331 ± 0.632
0.864LysTrp: 0.864 ± 0.376
3.455LysTyr: 3.455 ± 0.757
0.0LysXaa: 0.0 ± 0.0
Leu
7.033LeuAla: 7.033 ± 0.927
0.0LeuCys: 0.0 ± 0.0
6.046LeuAsp: 6.046 ± 0.766
5.429LeuGlu: 5.429 ± 0.962
3.085LeuPhe: 3.085 ± 0.489
6.292LeuGly: 6.292 ± 1.123
1.851LeuHis: 1.851 ± 0.404
3.948LeuIle: 3.948 ± 0.77
6.046LeuLys: 6.046 ± 0.833
5.676LeuLeu: 5.676 ± 0.967
2.221LeuMet: 2.221 ± 0.391
4.935LeuAsn: 4.935 ± 0.783
2.591LeuPro: 2.591 ± 0.538
3.701LeuGln: 3.701 ± 0.701
1.974LeuArg: 1.974 ± 0.473
5.922LeuSer: 5.922 ± 0.653
6.663LeuThr: 6.663 ± 1.405
5.799LeuVal: 5.799 ± 0.896
0.74LeuTrp: 0.74 ± 0.236
3.208LeuTyr: 3.208 ± 0.634
0.0LeuXaa: 0.0 ± 0.0
Met
2.714MetAla: 2.714 ± 0.419
0.0MetCys: 0.0 ± 0.0
0.987MetAsp: 0.987 ± 0.328
0.74MetGlu: 0.74 ± 0.247
0.74MetPhe: 0.74 ± 0.318
1.974MetGly: 1.974 ± 0.653
0.247MetHis: 0.247 ± 0.179
1.11MetIle: 1.11 ± 0.333
1.357MetLys: 1.357 ± 0.406
0.987MetLeu: 0.987 ± 0.329
0.37MetMet: 0.37 ± 0.223
1.604MetAsn: 1.604 ± 0.447
1.234MetPro: 1.234 ± 0.393
0.617MetGln: 0.617 ± 0.265
0.74MetArg: 0.74 ± 0.307
2.097MetSer: 2.097 ± 0.494
1.974MetThr: 1.974 ± 0.367
2.097MetVal: 2.097 ± 0.485
0.123MetTrp: 0.123 ± 0.135
1.234MetTyr: 1.234 ± 0.443
0.0MetXaa: 0.0 ± 0.0
Asn
5.429AsnAla: 5.429 ± 0.797
0.0AsnCys: 0.0 ± 0.0
3.455AsnAsp: 3.455 ± 0.642
3.331AsnGlu: 3.331 ± 0.861
2.221AsnPhe: 2.221 ± 0.583
6.539AsnGly: 6.539 ± 0.878
0.37AsnHis: 0.37 ± 0.202
4.318AsnIle: 4.318 ± 0.656
4.688AsnLys: 4.688 ± 0.688
4.688AsnLeu: 4.688 ± 0.703
1.481AsnMet: 1.481 ± 0.459
5.305AsnAsn: 5.305 ± 0.709
3.085AsnPro: 3.085 ± 0.657
4.072AsnGln: 4.072 ± 0.964
2.097AsnArg: 2.097 ± 0.549
3.825AsnSer: 3.825 ± 0.683
3.331AsnThr: 3.331 ± 0.836
4.812AsnVal: 4.812 ± 0.775
0.864AsnTrp: 0.864 ± 0.32
2.961AsnTyr: 2.961 ± 0.737
0.0AsnXaa: 0.0 ± 0.0
Pro
1.604ProAla: 1.604 ± 0.35
0.0ProCys: 0.0 ± 0.0
2.714ProAsp: 2.714 ± 0.671
1.357ProGlu: 1.357 ± 0.452
1.481ProPhe: 1.481 ± 0.497
0.247ProGly: 0.247 ± 0.184
0.617ProHis: 0.617 ± 0.343
2.591ProIle: 2.591 ± 0.536
2.468ProLys: 2.468 ± 0.579
2.591ProLeu: 2.591 ± 0.409
0.37ProMet: 0.37 ± 0.208
1.727ProAsn: 1.727 ± 0.486
0.247ProPro: 0.247 ± 0.152
1.604ProGln: 1.604 ± 0.417
0.987ProArg: 0.987 ± 0.436
2.714ProSer: 2.714 ± 0.689
2.221ProThr: 2.221 ± 0.434
1.727ProVal: 1.727 ± 0.42
0.0ProTrp: 0.0 ± 0.0
1.851ProTyr: 1.851 ± 0.526
0.0ProXaa: 0.0 ± 0.0
Gln
3.825GlnAla: 3.825 ± 1.086
0.0GlnCys: 0.0 ± 0.0
2.838GlnAsp: 2.838 ± 0.452
2.221GlnGlu: 2.221 ± 0.649
1.604GlnPhe: 1.604 ± 0.471
1.851GlnGly: 1.851 ± 0.412
0.123GlnHis: 0.123 ± 0.12
3.208GlnIle: 3.208 ± 0.502
2.714GlnLys: 2.714 ± 0.681
4.935GlnLeu: 4.935 ± 0.974
1.481GlnMet: 1.481 ± 0.411
2.468GlnAsn: 2.468 ± 0.488
1.357GlnPro: 1.357 ± 0.406
2.468GlnGln: 2.468 ± 0.614
2.097GlnArg: 2.097 ± 0.484
3.331GlnSer: 3.331 ± 0.684
3.455GlnThr: 3.455 ± 0.567
2.961GlnVal: 2.961 ± 0.602
0.617GlnTrp: 0.617 ± 0.257
2.221GlnTyr: 2.221 ± 0.561
0.0GlnXaa: 0.0 ± 0.0
Arg
2.591ArgAla: 2.591 ± 0.488
0.0ArgCys: 0.0 ± 0.0
2.221ArgAsp: 2.221 ± 0.552
1.974ArgGlu: 1.974 ± 0.481
0.987ArgPhe: 0.987 ± 0.328
1.604ArgGly: 1.604 ± 0.42
0.494ArgHis: 0.494 ± 0.271
2.344ArgIle: 2.344 ± 0.483
1.974ArgLys: 1.974 ± 0.45
4.318ArgLeu: 4.318 ± 0.735
0.37ArgMet: 0.37 ± 0.213
0.987ArgAsn: 0.987 ± 0.362
1.357ArgPro: 1.357 ± 0.429
2.221ArgGln: 2.221 ± 0.553
0.494ArgArg: 0.494 ± 0.281
1.481ArgSer: 1.481 ± 0.462
2.097ArgThr: 2.097 ± 0.502
2.468ArgVal: 2.468 ± 0.583
0.864ArgTrp: 0.864 ± 0.313
1.357ArgTyr: 1.357 ± 0.453
0.0ArgXaa: 0.0 ± 0.0
Ser
4.318SerAla: 4.318 ± 0.579
0.0SerCys: 0.0 ± 0.0
4.812SerAsp: 4.812 ± 0.744
4.195SerGlu: 4.195 ± 1.1
2.591SerPhe: 2.591 ± 0.754
5.676SerGly: 5.676 ± 1.424
1.234SerHis: 1.234 ± 0.484
5.059SerIle: 5.059 ± 0.714
5.552SerLys: 5.552 ± 0.888
5.922SerLeu: 5.922 ± 1.12
1.357SerMet: 1.357 ± 0.452
4.318SerAsn: 4.318 ± 0.947
1.357SerPro: 1.357 ± 0.377
4.565SerGln: 4.565 ± 0.679
2.468SerArg: 2.468 ± 0.537
6.292SerSer: 6.292 ± 1.23
5.429SerThr: 5.429 ± 1.187
7.156SerVal: 7.156 ± 1.158
0.494SerTrp: 0.494 ± 0.254
2.838SerTyr: 2.838 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
4.195ThrAla: 4.195 ± 0.736
0.0ThrCys: 0.0 ± 0.0
4.195ThrAsp: 4.195 ± 0.751
3.455ThrGlu: 3.455 ± 0.604
3.701ThrPhe: 3.701 ± 0.649
5.305ThrGly: 5.305 ± 0.733
0.987ThrHis: 0.987 ± 0.421
5.552ThrIle: 5.552 ± 0.864
4.565ThrLys: 4.565 ± 0.779
5.305ThrLeu: 5.305 ± 0.712
1.11ThrMet: 1.11 ± 0.374
4.812ThrAsn: 4.812 ± 0.745
2.468ThrPro: 2.468 ± 0.46
3.455ThrGln: 3.455 ± 0.708
2.838ThrArg: 2.838 ± 0.669
7.156ThrSer: 7.156 ± 1.579
5.922ThrThr: 5.922 ± 0.944
4.318ThrVal: 4.318 ± 0.638
0.987ThrTrp: 0.987 ± 0.395
2.221ThrTyr: 2.221 ± 0.556
0.0ThrXaa: 0.0 ± 0.0
Val
4.195ValAla: 4.195 ± 0.486
0.0ValCys: 0.0 ± 0.0
4.935ValAsp: 4.935 ± 0.734
3.701ValGlu: 3.701 ± 0.823
3.455ValPhe: 3.455 ± 0.7
3.578ValGly: 3.578 ± 0.932
0.247ValHis: 0.247 ± 0.179
3.701ValIle: 3.701 ± 0.854
5.799ValLys: 5.799 ± 0.827
4.812ValLeu: 4.812 ± 0.867
1.357ValMet: 1.357 ± 0.342
4.935ValAsn: 4.935 ± 0.867
2.468ValPro: 2.468 ± 0.518
2.838ValGln: 2.838 ± 0.527
1.851ValArg: 1.851 ± 0.448
6.292ValSer: 6.292 ± 0.904
4.688ValThr: 4.688 ± 0.784
4.195ValVal: 4.195 ± 0.691
0.37ValTrp: 0.37 ± 0.195
3.701ValTyr: 3.701 ± 0.647
0.0ValXaa: 0.0 ± 0.0
Trp
0.864TrpAla: 0.864 ± 0.298
0.0TrpCys: 0.0 ± 0.0
0.74TrpAsp: 0.74 ± 0.262
1.11TrpGlu: 1.11 ± 0.341
0.494TrpPhe: 0.494 ± 0.215
0.864TrpGly: 0.864 ± 0.442
0.247TrpHis: 0.247 ± 0.185
0.617TrpIle: 0.617 ± 0.239
0.247TrpLys: 0.247 ± 0.171
1.357TrpLeu: 1.357 ± 0.439
0.0TrpMet: 0.0 ± 0.0
0.74TrpAsn: 0.74 ± 0.374
0.0TrpPro: 0.0 ± 0.0
0.494TrpGln: 0.494 ± 0.288
0.617TrpArg: 0.617 ± 0.344
1.234TrpSer: 1.234 ± 0.297
0.617TrpThr: 0.617 ± 0.342
0.494TrpVal: 0.494 ± 0.26
0.247TrpTrp: 0.247 ± 0.158
0.617TrpTyr: 0.617 ± 0.284
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.468TyrAla: 2.468 ± 0.473
0.123TyrCys: 0.123 ± 0.125
3.085TyrAsp: 3.085 ± 0.71
3.085TyrGlu: 3.085 ± 0.851
2.468TyrPhe: 2.468 ± 0.515
2.221TyrGly: 2.221 ± 0.716
0.494TyrHis: 0.494 ± 0.326
2.591TyrIle: 2.591 ± 0.548
2.714TyrLys: 2.714 ± 0.573
4.442TyrLeu: 4.442 ± 0.562
0.37TyrMet: 0.37 ± 0.214
3.455TyrAsn: 3.455 ± 0.786
1.851TyrPro: 1.851 ± 0.435
1.974TyrGln: 1.974 ± 0.536
1.481TyrArg: 1.481 ± 0.486
3.331TyrSer: 3.331 ± 0.622
3.085TyrThr: 3.085 ± 0.653
2.714TyrVal: 2.714 ± 0.608
0.74TyrTrp: 0.74 ± 0.259
1.974TyrTyr: 1.974 ± 0.668
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (8106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski