Amino acid dipepetide frequency for Lactococcus phage P4565

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.731AlaAla: 0.731 ± 0.444
0.313AlaCys: 0.313 ± 0.21
2.612AlaAsp: 2.612 ± 0.541
4.492AlaGlu: 4.492 ± 0.889
3.239AlaPhe: 3.239 ± 0.773
3.865AlaGly: 3.865 ± 0.598
0.94AlaHis: 0.94 ± 0.396
4.91AlaIle: 4.91 ± 0.967
6.477AlaLys: 6.477 ± 0.831
5.955AlaLeu: 5.955 ± 0.957
2.089AlaMet: 2.089 ± 0.56
4.388AlaAsn: 4.388 ± 0.845
0.94AlaPro: 0.94 ± 0.354
1.88AlaGln: 1.88 ± 0.569
1.985AlaArg: 1.985 ± 0.428
2.612AlaSer: 2.612 ± 0.635
2.716AlaThr: 2.716 ± 0.69
3.448AlaVal: 3.448 ± 0.69
1.985AlaTrp: 1.985 ± 0.78
1.567AlaTyr: 1.567 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.209CysAla: 0.209 ± 0.162
0.0CysCys: 0.0 ± 0.0
0.209CysAsp: 0.209 ± 0.158
0.522CysGlu: 0.522 ± 0.268
0.209CysPhe: 0.209 ± 0.223
0.731CysGly: 0.731 ± 0.323
0.104CysHis: 0.104 ± 0.105
0.627CysIle: 0.627 ± 0.215
0.627CysLys: 0.627 ± 0.323
0.522CysLeu: 0.522 ± 0.275
0.104CysMet: 0.104 ± 0.091
0.313CysAsn: 0.313 ± 0.187
0.104CysPro: 0.104 ± 0.115
0.418CysGln: 0.418 ± 0.191
0.313CysArg: 0.313 ± 0.191
0.209CysSer: 0.209 ± 0.152
0.209CysThr: 0.209 ± 0.149
0.418CysVal: 0.418 ± 0.194
0.0CysTrp: 0.0 ± 0.0
0.209CysTyr: 0.209 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
2.194AspAla: 2.194 ± 0.777
0.313AspCys: 0.313 ± 0.169
3.552AspAsp: 3.552 ± 0.788
3.656AspGlu: 3.656 ± 0.711
3.552AspPhe: 3.552 ± 0.573
3.865AspGly: 3.865 ± 0.608
0.627AspHis: 0.627 ± 0.331
4.388AspIle: 4.388 ± 0.811
5.955AspLys: 5.955 ± 1.018
6.164AspLeu: 6.164 ± 0.924
0.94AspMet: 0.94 ± 0.258
4.597AspAsn: 4.597 ± 0.941
1.672AspPro: 1.672 ± 0.386
0.731AspGln: 0.731 ± 0.268
2.194AspArg: 2.194 ± 0.508
2.821AspSer: 2.821 ± 0.576
4.492AspThr: 4.492 ± 0.709
2.716AspVal: 2.716 ± 0.633
1.149AspTrp: 1.149 ± 0.343
2.612AspTyr: 2.612 ± 0.527
0.0AspXaa: 0.0 ± 0.0
Glu
4.074GluAla: 4.074 ± 0.568
0.418GluCys: 0.418 ± 0.209
2.716GluAsp: 2.716 ± 0.528
5.015GluGlu: 5.015 ± 1.061
3.97GluPhe: 3.97 ± 0.468
2.403GluGly: 2.403 ± 0.407
0.836GluHis: 0.836 ± 0.314
6.059GluIle: 6.059 ± 0.855
6.268GluLys: 6.268 ± 1.103
9.402GluLeu: 9.402 ± 1.349
2.821GluMet: 2.821 ± 0.526
5.015GluAsn: 5.015 ± 0.721
1.045GluPro: 1.045 ± 0.427
3.134GluGln: 3.134 ± 0.734
2.821GluArg: 2.821 ± 0.422
2.925GluSer: 2.925 ± 0.453
4.597GluThr: 4.597 ± 0.669
5.328GluVal: 5.328 ± 0.719
1.254GluTrp: 1.254 ± 0.344
3.239GluTyr: 3.239 ± 0.657
0.0GluXaa: 0.0 ± 0.0
Phe
2.298PheAla: 2.298 ± 0.513
0.209PheCys: 0.209 ± 0.153
3.761PheAsp: 3.761 ± 0.545
2.925PheGlu: 2.925 ± 0.636
2.089PhePhe: 2.089 ± 0.513
3.134PheGly: 3.134 ± 0.568
0.418PheHis: 0.418 ± 0.259
3.03PheIle: 3.03 ± 0.589
3.552PheLys: 3.552 ± 0.719
2.716PheLeu: 2.716 ± 0.486
1.149PheMet: 1.149 ± 0.335
3.03PheAsn: 3.03 ± 0.595
0.627PhePro: 0.627 ± 0.241
1.254PheGln: 1.254 ± 0.481
1.358PheArg: 1.358 ± 0.369
4.283PheSer: 4.283 ± 0.636
2.612PheThr: 2.612 ± 0.389
2.298PheVal: 2.298 ± 0.323
0.209PheTrp: 0.209 ± 0.147
1.985PheTyr: 1.985 ± 0.515
0.0PheXaa: 0.0 ± 0.0
Gly
3.761GlyAla: 3.761 ± 0.963
0.209GlyCys: 0.209 ± 0.138
3.343GlyAsp: 3.343 ± 0.619
3.656GlyGlu: 3.656 ± 0.712
1.672GlyPhe: 1.672 ± 0.412
4.597GlyGly: 4.597 ± 0.957
0.94GlyHis: 0.94 ± 0.29
4.806GlyIle: 4.806 ± 1.338
5.85GlyLys: 5.85 ± 0.673
6.059GlyLeu: 6.059 ± 1.043
1.254GlyMet: 1.254 ± 0.403
3.656GlyAsn: 3.656 ± 0.542
0.104GlyPro: 0.104 ± 0.097
1.672GlyGln: 1.672 ± 0.456
2.298GlyArg: 2.298 ± 0.417
5.224GlySer: 5.224 ± 0.988
4.074GlyThr: 4.074 ± 0.935
5.015GlyVal: 5.015 ± 0.865
1.045GlyTrp: 1.045 ± 0.333
3.239GlyTyr: 3.239 ± 0.68
0.0GlyXaa: 0.0 ± 0.0
His
0.731HisAla: 0.731 ± 0.271
0.209HisCys: 0.209 ± 0.138
0.94HisAsp: 0.94 ± 0.336
0.418HisGlu: 0.418 ± 0.195
0.418HisPhe: 0.418 ± 0.197
0.94HisGly: 0.94 ± 0.383
0.104HisHis: 0.104 ± 0.106
0.731HisIle: 0.731 ± 0.31
0.627HisLys: 0.627 ± 0.286
1.254HisLeu: 1.254 ± 0.421
0.209HisMet: 0.209 ± 0.211
1.672HisAsn: 1.672 ± 0.503
0.313HisPro: 0.313 ± 0.164
0.522HisGln: 0.522 ± 0.245
0.209HisArg: 0.209 ± 0.153
0.209HisSer: 0.209 ± 0.156
0.731HisThr: 0.731 ± 0.266
0.522HisVal: 0.522 ± 0.261
0.209HisTrp: 0.209 ± 0.218
0.627HisTyr: 0.627 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
4.179IleAla: 4.179 ± 0.596
0.104IleCys: 0.104 ± 0.105
4.701IleAsp: 4.701 ± 0.561
6.164IleGlu: 6.164 ± 1.073
2.716IlePhe: 2.716 ± 0.515
3.134IleGly: 3.134 ± 0.819
0.836IleHis: 0.836 ± 0.284
5.537IleIle: 5.537 ± 0.867
7.209IleLys: 7.209 ± 0.658
5.537IleLeu: 5.537 ± 0.771
1.045IleMet: 1.045 ± 0.342
5.641IleAsn: 5.641 ± 0.709
1.985IlePro: 1.985 ± 0.656
2.194IleGln: 2.194 ± 0.472
1.672IleArg: 1.672 ± 0.423
5.955IleSer: 5.955 ± 1.053
5.746IleThr: 5.746 ± 0.726
4.91IleVal: 4.91 ± 0.73
1.045IleTrp: 1.045 ± 0.32
2.298IleTyr: 2.298 ± 0.403
0.0IleXaa: 0.0 ± 0.0
Lys
6.791LysAla: 6.791 ± 0.949
0.418LysCys: 0.418 ± 0.217
4.597LysAsp: 4.597 ± 0.615
8.044LysGlu: 8.044 ± 1.165
2.089LysPhe: 2.089 ± 0.577
5.537LysGly: 5.537 ± 0.898
1.045LysHis: 1.045 ± 0.34
6.582LysIle: 6.582 ± 0.944
9.402LysLys: 9.402 ± 1.175
7.313LysLeu: 7.313 ± 0.79
2.612LysMet: 2.612 ± 0.347
5.224LysAsn: 5.224 ± 0.667
1.672LysPro: 1.672 ± 0.401
3.448LysGln: 3.448 ± 0.592
4.074LysArg: 4.074 ± 0.797
5.224LysSer: 5.224 ± 0.989
5.537LysThr: 5.537 ± 0.591
6.477LysVal: 6.477 ± 0.807
1.358LysTrp: 1.358 ± 0.374
4.179LysTyr: 4.179 ± 0.705
0.0LysXaa: 0.0 ± 0.0
Leu
4.91LeuAla: 4.91 ± 0.652
0.418LeuCys: 0.418 ± 0.196
5.119LeuAsp: 5.119 ± 0.719
6.268LeuGlu: 6.268 ± 0.861
3.656LeuPhe: 3.656 ± 0.623
4.701LeuGly: 4.701 ± 1.022
1.045LeuHis: 1.045 ± 0.357
6.477LeuIle: 6.477 ± 0.778
8.567LeuLys: 8.567 ± 1.046
6.164LeuLeu: 6.164 ± 1.091
2.194LeuMet: 2.194 ± 0.623
5.433LeuAsn: 5.433 ± 0.819
3.134LeuPro: 3.134 ± 0.467
4.074LeuGln: 4.074 ± 1.0
2.403LeuArg: 2.403 ± 0.572
5.433LeuSer: 5.433 ± 0.682
6.059LeuThr: 6.059 ± 0.849
5.955LeuVal: 5.955 ± 0.742
1.254LeuTrp: 1.254 ± 0.295
4.283LeuTyr: 4.283 ± 0.755
0.0LeuXaa: 0.0 ± 0.0
Met
1.88MetAla: 1.88 ± 0.452
0.104MetCys: 0.104 ± 0.113
1.463MetAsp: 1.463 ± 0.409
1.88MetGlu: 1.88 ± 0.512
0.418MetPhe: 0.418 ± 0.206
0.94MetGly: 0.94 ± 0.325
0.313MetHis: 0.313 ± 0.182
1.985MetIle: 1.985 ± 0.464
1.88MetLys: 1.88 ± 0.484
1.985MetLeu: 1.985 ± 0.609
0.313MetMet: 0.313 ± 0.181
1.985MetAsn: 1.985 ± 0.434
0.522MetPro: 0.522 ± 0.232
1.776MetGln: 1.776 ± 0.367
0.418MetArg: 0.418 ± 0.265
1.776MetSer: 1.776 ± 0.448
1.88MetThr: 1.88 ± 0.452
1.567MetVal: 1.567 ± 0.345
0.0MetTrp: 0.0 ± 0.0
1.254MetTyr: 1.254 ± 0.418
0.0MetXaa: 0.0 ± 0.0
Asn
5.015AsnAla: 5.015 ± 1.03
0.627AsnCys: 0.627 ± 0.39
4.283AsnAsp: 4.283 ± 0.71
4.806AsnGlu: 4.806 ± 0.698
2.507AsnPhe: 2.507 ± 0.632
7.0AsnGly: 7.0 ± 0.96
0.522AsnHis: 0.522 ± 0.212
4.492AsnIle: 4.492 ± 0.7
7.209AsnLys: 7.209 ± 0.904
5.433AsnLeu: 5.433 ± 0.77
1.463AsnMet: 1.463 ± 0.372
3.97AsnAsn: 3.97 ± 1.047
1.88AsnPro: 1.88 ± 0.361
2.507AsnGln: 2.507 ± 0.554
1.672AsnArg: 1.672 ± 0.327
5.119AsnSer: 5.119 ± 0.916
4.179AsnThr: 4.179 ± 0.735
3.448AsnVal: 3.448 ± 0.656
1.254AsnTrp: 1.254 ± 0.353
2.298AsnTyr: 2.298 ± 0.503
0.0AsnXaa: 0.0 ± 0.0
Pro
1.358ProAla: 1.358 ± 0.363
0.104ProCys: 0.104 ± 0.107
1.776ProAsp: 1.776 ± 0.407
1.463ProGlu: 1.463 ± 0.466
1.045ProPhe: 1.045 ± 0.354
0.418ProGly: 0.418 ± 0.202
0.104ProHis: 0.104 ± 0.105
2.194ProIle: 2.194 ± 0.525
1.672ProLys: 1.672 ± 0.424
1.88ProLeu: 1.88 ± 0.497
0.731ProMet: 0.731 ± 0.262
1.672ProAsn: 1.672 ± 0.578
0.627ProPro: 0.627 ± 0.281
0.731ProGln: 0.731 ± 0.333
0.418ProArg: 0.418 ± 0.197
1.463ProSer: 1.463 ± 0.49
2.194ProThr: 2.194 ± 0.411
1.567ProVal: 1.567 ± 0.398
0.0ProTrp: 0.0 ± 0.0
0.94ProTyr: 0.94 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
2.716GlnAla: 2.716 ± 0.734
0.104GlnCys: 0.104 ± 0.121
2.089GlnAsp: 2.089 ± 0.438
2.507GlnGlu: 2.507 ± 0.608
1.149GlnPhe: 1.149 ± 0.345
3.239GlnGly: 3.239 ± 0.506
0.209GlnHis: 0.209 ± 0.151
1.358GlnIle: 1.358 ± 0.326
2.507GlnLys: 2.507 ± 0.56
4.492GlnLeu: 4.492 ± 1.197
0.836GlnMet: 0.836 ± 0.251
2.194GlnAsn: 2.194 ± 0.413
1.254GlnPro: 1.254 ± 0.405
2.089GlnGln: 2.089 ± 0.819
1.776GlnArg: 1.776 ± 0.453
1.672GlnSer: 1.672 ± 0.451
3.03GlnThr: 3.03 ± 0.521
2.089GlnVal: 2.089 ± 0.433
0.627GlnTrp: 0.627 ± 0.238
1.045GlnTyr: 1.045 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
2.089ArgAla: 2.089 ± 0.482
0.313ArgCys: 0.313 ± 0.164
1.776ArgAsp: 1.776 ± 0.453
2.507ArgGlu: 2.507 ± 0.451
1.254ArgPhe: 1.254 ± 0.337
1.672ArgGly: 1.672 ± 0.366
0.627ArgHis: 0.627 ± 0.256
1.985ArgIle: 1.985 ± 0.42
2.716ArgLys: 2.716 ± 0.574
3.656ArgLeu: 3.656 ± 0.677
0.522ArgMet: 0.522 ± 0.246
2.612ArgAsn: 2.612 ± 0.559
1.358ArgPro: 1.358 ± 0.349
1.254ArgGln: 1.254 ± 0.346
1.463ArgArg: 1.463 ± 0.397
1.985ArgSer: 1.985 ± 0.388
1.88ArgThr: 1.88 ± 0.324
1.985ArgVal: 1.985 ± 0.491
0.313ArgTrp: 0.313 ± 0.162
1.567ArgTyr: 1.567 ± 0.368
0.0ArgXaa: 0.0 ± 0.0
Ser
4.074SerAla: 4.074 ± 1.18
0.836SerCys: 0.836 ± 0.376
3.97SerAsp: 3.97 ± 0.53
3.134SerGlu: 3.134 ± 0.511
3.343SerPhe: 3.343 ± 0.687
5.433SerGly: 5.433 ± 1.112
1.149SerHis: 1.149 ± 0.313
4.492SerIle: 4.492 ± 0.953
5.85SerLys: 5.85 ± 0.891
5.119SerLeu: 5.119 ± 0.665
1.463SerMet: 1.463 ± 0.325
3.865SerAsn: 3.865 ± 0.766
0.94SerPro: 0.94 ± 0.27
2.194SerGln: 2.194 ± 0.528
2.298SerArg: 2.298 ± 0.37
4.597SerSer: 4.597 ± 0.883
2.612SerThr: 2.612 ± 0.431
4.179SerVal: 4.179 ± 0.847
0.94SerTrp: 0.94 ± 0.279
3.343SerTyr: 3.343 ± 0.615
0.0SerXaa: 0.0 ± 0.0
Thr
4.179ThrAla: 4.179 ± 0.573
0.313ThrCys: 0.313 ± 0.186
3.552ThrAsp: 3.552 ± 0.671
6.268ThrGlu: 6.268 ± 0.68
2.612ThrPhe: 2.612 ± 0.505
4.283ThrGly: 4.283 ± 0.599
0.104ThrHis: 0.104 ± 0.091
4.388ThrIle: 4.388 ± 0.771
4.597ThrLys: 4.597 ± 0.611
5.85ThrLeu: 5.85 ± 0.746
1.045ThrMet: 1.045 ± 0.325
5.641ThrAsn: 5.641 ± 0.998
1.463ThrPro: 1.463 ± 0.34
2.716ThrGln: 2.716 ± 0.434
1.776ThrArg: 1.776 ± 0.494
4.179ThrSer: 4.179 ± 0.787
4.701ThrThr: 4.701 ± 0.74
4.283ThrVal: 4.283 ± 0.787
0.731ThrTrp: 0.731 ± 0.308
2.821ThrTyr: 2.821 ± 0.612
0.0ThrXaa: 0.0 ± 0.0
Val
3.656ValAla: 3.656 ± 0.556
0.313ValCys: 0.313 ± 0.174
4.701ValAsp: 4.701 ± 0.728
4.701ValGlu: 4.701 ± 0.504
3.343ValPhe: 3.343 ± 0.555
2.925ValGly: 2.925 ± 0.515
0.627ValHis: 0.627 ± 0.23
4.597ValIle: 4.597 ± 0.589
6.477ValLys: 6.477 ± 0.782
2.925ValLeu: 2.925 ± 0.558
1.985ValMet: 1.985 ± 0.409
3.343ValAsn: 3.343 ± 0.701
1.358ValPro: 1.358 ± 0.522
2.298ValGln: 2.298 ± 0.571
2.925ValArg: 2.925 ± 0.729
4.806ValSer: 4.806 ± 1.207
4.806ValThr: 4.806 ± 0.962
3.656ValVal: 3.656 ± 0.906
0.313ValTrp: 0.313 ± 0.211
2.925ValTyr: 2.925 ± 0.492
0.0ValXaa: 0.0 ± 0.0
Trp
0.836TrpAla: 0.836 ± 0.205
0.209TrpCys: 0.209 ± 0.162
1.045TrpAsp: 1.045 ± 0.584
0.731TrpGlu: 0.731 ± 0.255
1.045TrpPhe: 1.045 ± 0.492
0.94TrpGly: 0.94 ± 0.33
0.209TrpHis: 0.209 ± 0.179
0.627TrpIle: 0.627 ± 0.245
1.149TrpLys: 1.149 ± 0.362
1.567TrpLeu: 1.567 ± 0.467
0.209TrpMet: 0.209 ± 0.132
1.149TrpAsn: 1.149 ± 0.32
0.0TrpPro: 0.0 ± 0.0
0.836TrpGln: 0.836 ± 0.268
0.313TrpArg: 0.313 ± 0.241
1.045TrpSer: 1.045 ± 0.27
0.522TrpThr: 0.522 ± 0.246
0.627TrpVal: 0.627 ± 0.26
0.209TrpTrp: 0.209 ± 0.143
0.731TrpTyr: 0.731 ± 0.291
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.463TyrAla: 1.463 ± 0.497
0.522TyrCys: 0.522 ± 0.362
1.985TyrAsp: 1.985 ± 0.534
4.179TyrGlu: 4.179 ± 0.85
2.612TyrPhe: 2.612 ± 0.531
2.716TyrGly: 2.716 ± 0.558
0.836TyrHis: 0.836 ± 0.305
3.343TyrIle: 3.343 ± 0.723
3.134TyrLys: 3.134 ± 0.637
3.552TyrLeu: 3.552 ± 0.675
1.254TyrMet: 1.254 ± 0.364
4.283TyrAsn: 4.283 ± 0.643
1.358TyrPro: 1.358 ± 0.423
1.254TyrGln: 1.254 ± 0.4
1.149TyrArg: 1.149 ± 0.383
2.298TyrSer: 2.298 ± 0.495
2.925TyrThr: 2.925 ± 0.602
2.194TyrVal: 2.194 ± 0.475
0.104TyrTrp: 0.104 ± 0.097
2.403TyrTyr: 2.403 ± 0.616
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (9573 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski