Amino acid dipepetide frequency for Lactococcus phage P335

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.602AlaAla: 4.602 ± 1.016
0.294AlaCys: 0.294 ± 0.14
4.113AlaAsp: 4.113 ± 0.627
3.819AlaGlu: 3.819 ± 0.886
3.427AlaPhe: 3.427 ± 0.479
2.938AlaGly: 2.938 ± 0.826
0.685AlaHis: 0.685 ± 0.294
4.7AlaIle: 4.7 ± 1.051
4.505AlaLys: 4.505 ± 0.689
6.561AlaLeu: 6.561 ± 0.737
1.469AlaMet: 1.469 ± 0.375
4.896AlaAsn: 4.896 ± 0.609
1.567AlaPro: 1.567 ± 0.459
3.134AlaGln: 3.134 ± 0.572
2.644AlaArg: 2.644 ± 0.633
4.602AlaSer: 4.602 ± 0.739
3.721AlaThr: 3.721 ± 0.739
2.644AlaVal: 2.644 ± 0.639
1.273AlaTrp: 1.273 ± 0.334
1.763AlaTyr: 1.763 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
0.098CysAla: 0.098 ± 0.101
0.0CysCys: 0.0 ± 0.0
0.49CysAsp: 0.49 ± 0.297
0.49CysGlu: 0.49 ± 0.166
0.294CysPhe: 0.294 ± 0.182
0.294CysGly: 0.294 ± 0.174
0.098CysHis: 0.098 ± 0.103
0.49CysIle: 0.49 ± 0.245
0.49CysLys: 0.49 ± 0.193
0.392CysLeu: 0.392 ± 0.211
0.0CysMet: 0.0 ± 0.0
0.294CysAsn: 0.294 ± 0.184
0.098CysPro: 0.098 ± 0.096
0.0CysGln: 0.0 ± 0.0
0.294CysArg: 0.294 ± 0.159
0.392CysSer: 0.392 ± 0.212
0.294CysThr: 0.294 ± 0.161
0.294CysVal: 0.294 ± 0.145
0.0CysTrp: 0.0 ± 0.0
0.392CysTyr: 0.392 ± 0.231
0.0CysXaa: 0.0 ± 0.0
Asp
2.84AspAla: 2.84 ± 0.582
0.49AspCys: 0.49 ± 0.2
4.798AspAsp: 4.798 ± 0.716
5.973AspGlu: 5.973 ± 1.128
2.252AspPhe: 2.252 ± 0.606
4.015AspGly: 4.015 ± 0.795
0.294AspHis: 0.294 ± 0.159
4.798AspIle: 4.798 ± 0.588
5.092AspLys: 5.092 ± 0.839
5.288AspLeu: 5.288 ± 0.724
1.175AspMet: 1.175 ± 0.32
4.113AspAsn: 4.113 ± 0.503
1.175AspPro: 1.175 ± 0.419
1.077AspGln: 1.077 ± 0.302
2.644AspArg: 2.644 ± 0.367
4.113AspSer: 4.113 ± 0.633
3.623AspThr: 3.623 ± 0.552
3.917AspVal: 3.917 ± 0.61
1.371AspTrp: 1.371 ± 0.536
2.84AspTyr: 2.84 ± 0.566
0.0AspXaa: 0.0 ± 0.0
Glu
4.113GluAla: 4.113 ± 0.724
0.196GluCys: 0.196 ± 0.121
2.938GluAsp: 2.938 ± 0.678
5.386GluGlu: 5.386 ± 1.197
3.721GluPhe: 3.721 ± 0.646
3.036GluGly: 3.036 ± 0.663
0.979GluHis: 0.979 ± 0.338
5.973GluIle: 5.973 ± 0.781
6.855GluLys: 6.855 ± 1.531
8.324GluLeu: 8.324 ± 1.122
2.252GluMet: 2.252 ± 0.529
4.015GluAsn: 4.015 ± 0.783
1.958GluPro: 1.958 ± 0.534
3.721GluGln: 3.721 ± 0.547
2.84GluArg: 2.84 ± 0.644
3.427GluSer: 3.427 ± 0.592
3.917GluThr: 3.917 ± 0.689
5.288GluVal: 5.288 ± 0.709
0.881GluTrp: 0.881 ± 0.284
3.427GluTyr: 3.427 ± 0.694
0.0GluXaa: 0.0 ± 0.0
Phe
2.448PheAla: 2.448 ± 0.552
0.392PheCys: 0.392 ± 0.163
4.015PheAsp: 4.015 ± 0.708
2.742PheGlu: 2.742 ± 0.501
1.273PhePhe: 1.273 ± 0.431
3.036PheGly: 3.036 ± 0.569
0.392PheHis: 0.392 ± 0.2
3.134PheIle: 3.134 ± 0.552
3.721PheLys: 3.721 ± 0.686
2.84PheLeu: 2.84 ± 0.567
1.469PheMet: 1.469 ± 0.431
2.546PheAsn: 2.546 ± 0.509
1.273PhePro: 1.273 ± 0.396
1.469PheGln: 1.469 ± 0.375
1.567PheArg: 1.567 ± 0.354
2.644PheSer: 2.644 ± 0.494
2.644PheThr: 2.644 ± 0.616
2.742PheVal: 2.742 ± 0.525
0.294PheTrp: 0.294 ± 0.161
1.273PheTyr: 1.273 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
3.819GlyAla: 3.819 ± 0.986
0.392GlyCys: 0.392 ± 0.232
4.015GlyAsp: 4.015 ± 0.617
3.231GlyGlu: 3.231 ± 0.607
3.036GlyPhe: 3.036 ± 0.758
5.19GlyGly: 5.19 ± 1.055
0.783GlyHis: 0.783 ± 0.261
5.582GlyIle: 5.582 ± 1.013
5.68GlyLys: 5.68 ± 0.685
4.113GlyLeu: 4.113 ± 0.636
1.665GlyMet: 1.665 ± 0.494
3.427GlyAsn: 3.427 ± 0.756
0.979GlyPro: 0.979 ± 0.276
3.036GlyGln: 3.036 ± 0.721
2.84GlyArg: 2.84 ± 0.631
3.819GlySer: 3.819 ± 0.596
4.113GlyThr: 4.113 ± 0.7
3.623GlyVal: 3.623 ± 0.881
0.685GlyTrp: 0.685 ± 0.234
3.819GlyTyr: 3.819 ± 0.615
0.0GlyXaa: 0.0 ± 0.0
His
0.979HisAla: 0.979 ± 0.381
0.196HisCys: 0.196 ± 0.132
0.881HisAsp: 0.881 ± 0.306
0.588HisGlu: 0.588 ± 0.28
0.49HisPhe: 0.49 ± 0.195
0.979HisGly: 0.979 ± 0.323
0.0HisHis: 0.0 ± 0.0
1.077HisIle: 1.077 ± 0.38
1.077HisLys: 1.077 ± 0.348
0.588HisLeu: 0.588 ± 0.283
0.098HisMet: 0.098 ± 0.088
0.294HisAsn: 0.294 ± 0.194
0.588HisPro: 0.588 ± 0.196
0.588HisGln: 0.588 ± 0.303
0.196HisArg: 0.196 ± 0.13
0.783HisSer: 0.783 ± 0.295
0.685HisThr: 0.685 ± 0.27
0.881HisVal: 0.881 ± 0.237
0.392HisTrp: 0.392 ± 0.221
0.685HisTyr: 0.685 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
3.917IleAla: 3.917 ± 0.652
0.294IleCys: 0.294 ± 0.171
3.231IleAsp: 3.231 ± 0.536
5.386IleGlu: 5.386 ± 0.834
2.252IlePhe: 2.252 ± 0.547
4.994IleGly: 4.994 ± 0.827
0.685IleHis: 0.685 ± 0.26
4.7IleIle: 4.7 ± 1.023
7.54IleLys: 7.54 ± 0.707
3.623IleLeu: 3.623 ± 0.466
1.958IleMet: 1.958 ± 0.382
5.875IleAsn: 5.875 ± 0.886
2.252IlePro: 2.252 ± 0.438
3.231IleGln: 3.231 ± 0.662
2.056IleArg: 2.056 ± 0.423
6.365IleSer: 6.365 ± 0.859
4.602IleThr: 4.602 ± 0.704
4.211IleVal: 4.211 ± 0.746
0.685IleTrp: 0.685 ± 0.253
2.84IleTyr: 2.84 ± 0.66
0.0IleXaa: 0.0 ± 0.0
Lys
6.855LysAla: 6.855 ± 1.054
0.392LysCys: 0.392 ± 0.251
5.288LysAsp: 5.288 ± 0.782
6.757LysGlu: 6.757 ± 1.244
3.231LysPhe: 3.231 ± 0.594
4.7LysGly: 4.7 ± 0.706
2.252LysHis: 2.252 ± 0.612
5.68LysIle: 5.68 ± 1.076
8.519LysLys: 8.519 ± 1.297
7.932LysLeu: 7.932 ± 1.244
2.056LysMet: 2.056 ± 0.537
7.736LysAsn: 7.736 ± 0.977
2.448LysPro: 2.448 ± 0.468
3.721LysGln: 3.721 ± 0.555
4.309LysArg: 4.309 ± 0.783
5.875LysSer: 5.875 ± 0.779
4.407LysThr: 4.407 ± 0.669
4.505LysVal: 4.505 ± 0.732
1.175LysTrp: 1.175 ± 0.36
3.231LysTyr: 3.231 ± 0.628
0.0LysXaa: 0.0 ± 0.0
Leu
5.484LeuAla: 5.484 ± 0.822
0.392LeuCys: 0.392 ± 0.245
5.582LeuAsp: 5.582 ± 0.697
6.365LeuGlu: 6.365 ± 1.058
1.958LeuPhe: 1.958 ± 0.364
4.211LeuGly: 4.211 ± 0.49
0.685LeuHis: 0.685 ± 0.233
5.092LeuIle: 5.092 ± 0.748
6.659LeuLys: 6.659 ± 0.856
6.267LeuLeu: 6.267 ± 0.987
3.036LeuMet: 3.036 ± 0.493
5.484LeuAsn: 5.484 ± 0.634
2.154LeuPro: 2.154 ± 0.309
3.819LeuGln: 3.819 ± 0.616
2.154LeuArg: 2.154 ± 0.439
6.561LeuSer: 6.561 ± 0.826
5.875LeuThr: 5.875 ± 0.886
2.546LeuVal: 2.546 ± 0.458
0.881LeuTrp: 0.881 ± 0.242
2.84LeuTyr: 2.84 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
1.763MetAla: 1.763 ± 0.389
0.0MetCys: 0.0 ± 0.0
1.469MetAsp: 1.469 ± 0.371
2.056MetGlu: 2.056 ± 0.453
0.783MetPhe: 0.783 ± 0.227
1.273MetGly: 1.273 ± 0.463
0.098MetHis: 0.098 ± 0.096
1.763MetIle: 1.763 ± 0.357
2.35MetLys: 2.35 ± 0.571
1.175MetLeu: 1.175 ± 0.323
0.196MetMet: 0.196 ± 0.148
1.175MetAsn: 1.175 ± 0.373
0.881MetPro: 0.881 ± 0.309
0.979MetGln: 0.979 ± 0.285
1.077MetArg: 1.077 ± 0.293
2.056MetSer: 2.056 ± 0.411
2.938MetThr: 2.938 ± 0.565
0.881MetVal: 0.881 ± 0.283
0.098MetTrp: 0.098 ± 0.088
1.077MetTyr: 1.077 ± 0.522
0.0MetXaa: 0.0 ± 0.0
Asn
4.211AsnAla: 4.211 ± 0.747
0.196AsnCys: 0.196 ± 0.199
3.819AsnAsp: 3.819 ± 0.663
3.819AsnGlu: 3.819 ± 0.646
2.252AsnPhe: 2.252 ± 0.574
6.855AsnGly: 6.855 ± 0.886
0.979AsnHis: 0.979 ± 0.456
4.407AsnIle: 4.407 ± 0.52
6.267AsnLys: 6.267 ± 0.757
5.484AsnLeu: 5.484 ± 0.638
1.273AsnMet: 1.273 ± 0.306
4.211AsnAsn: 4.211 ± 0.869
2.448AsnPro: 2.448 ± 0.397
2.448AsnGln: 2.448 ± 0.501
2.056AsnArg: 2.056 ± 0.429
3.525AsnSer: 3.525 ± 0.576
3.134AsnThr: 3.134 ± 0.515
3.819AsnVal: 3.819 ± 0.554
1.077AsnTrp: 1.077 ± 0.287
2.742AsnTyr: 2.742 ± 0.601
0.0AsnXaa: 0.0 ± 0.0
Pro
1.077ProAla: 1.077 ± 0.361
0.294ProCys: 0.294 ± 0.174
2.154ProAsp: 2.154 ± 0.574
2.056ProGlu: 2.056 ± 0.447
1.469ProPhe: 1.469 ± 0.399
0.685ProGly: 0.685 ± 0.237
0.685ProHis: 0.685 ± 0.227
2.448ProIle: 2.448 ± 0.526
3.329ProLys: 3.329 ± 0.44
1.665ProLeu: 1.665 ± 0.319
0.49ProMet: 0.49 ± 0.22
1.763ProAsn: 1.763 ± 0.406
0.685ProPro: 0.685 ± 0.216
0.783ProGln: 0.783 ± 0.257
0.881ProArg: 0.881 ± 0.319
1.861ProSer: 1.861 ± 0.372
1.861ProThr: 1.861 ± 0.324
1.958ProVal: 1.958 ± 0.372
0.0ProTrp: 0.0 ± 0.0
1.175ProTyr: 1.175 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
4.211GlnAla: 4.211 ± 0.577
0.196GlnCys: 0.196 ± 0.118
1.371GlnAsp: 1.371 ± 0.453
4.7GlnGlu: 4.7 ± 0.591
1.469GlnPhe: 1.469 ± 0.401
2.84GlnGly: 2.84 ± 0.501
0.294GlnHis: 0.294 ± 0.172
2.546GlnIle: 2.546 ± 0.421
3.427GlnLys: 3.427 ± 0.606
2.448GlnLeu: 2.448 ± 0.51
1.175GlnMet: 1.175 ± 0.375
1.861GlnAsn: 1.861 ± 0.487
1.371GlnPro: 1.371 ± 0.491
2.252GlnGln: 2.252 ± 0.497
0.979GlnArg: 0.979 ± 0.326
3.134GlnSer: 3.134 ± 0.519
2.644GlnThr: 2.644 ± 0.458
3.036GlnVal: 3.036 ± 0.698
0.881GlnTrp: 0.881 ± 0.288
1.665GlnTyr: 1.665 ± 0.443
0.0GlnXaa: 0.0 ± 0.0
Arg
1.469ArgAla: 1.469 ± 0.377
0.196ArgCys: 0.196 ± 0.136
2.154ArgAsp: 2.154 ± 0.459
2.252ArgGlu: 2.252 ± 0.518
2.35ArgPhe: 2.35 ± 0.393
1.665ArgGly: 1.665 ± 0.448
0.392ArgHis: 0.392 ± 0.188
2.938ArgIle: 2.938 ± 0.674
3.721ArgLys: 3.721 ± 0.67
4.015ArgLeu: 4.015 ± 0.896
0.685ArgMet: 0.685 ± 0.228
2.154ArgAsn: 2.154 ± 0.47
0.783ArgPro: 0.783 ± 0.385
0.881ArgGln: 0.881 ± 0.261
1.077ArgArg: 1.077 ± 0.404
1.861ArgSer: 1.861 ± 0.424
1.763ArgThr: 1.763 ± 0.324
2.644ArgVal: 2.644 ± 0.417
0.49ArgTrp: 0.49 ± 0.222
2.35ArgTyr: 2.35 ± 0.538
0.0ArgXaa: 0.0 ± 0.0
Ser
4.113SerAla: 4.113 ± 0.606
0.196SerCys: 0.196 ± 0.125
4.798SerAsp: 4.798 ± 0.599
4.505SerGlu: 4.505 ± 0.668
3.721SerPhe: 3.721 ± 0.576
5.582SerGly: 5.582 ± 1.022
0.783SerHis: 0.783 ± 0.253
3.819SerIle: 3.819 ± 0.485
5.68SerLys: 5.68 ± 0.787
4.505SerLeu: 4.505 ± 0.55
1.175SerMet: 1.175 ± 0.337
4.896SerAsn: 4.896 ± 0.683
1.371SerPro: 1.371 ± 0.514
3.329SerGln: 3.329 ± 0.555
1.861SerArg: 1.861 ± 0.419
4.505SerSer: 4.505 ± 0.761
4.309SerThr: 4.309 ± 0.594
4.798SerVal: 4.798 ± 0.587
0.979SerTrp: 0.979 ± 0.26
2.546SerTyr: 2.546 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
4.211ThrAla: 4.211 ± 0.849
0.098ThrCys: 0.098 ± 0.099
3.329ThrAsp: 3.329 ± 0.533
3.623ThrGlu: 3.623 ± 0.721
3.525ThrPhe: 3.525 ± 0.667
5.19ThrGly: 5.19 ± 0.602
0.685ThrHis: 0.685 ± 0.254
3.525ThrIle: 3.525 ± 0.487
5.288ThrLys: 5.288 ± 0.913
4.309ThrLeu: 4.309 ± 0.644
1.175ThrMet: 1.175 ± 0.425
3.721ThrAsn: 3.721 ± 0.534
1.763ThrPro: 1.763 ± 0.544
2.252ThrGln: 2.252 ± 0.372
2.644ThrArg: 2.644 ± 0.512
3.721ThrSer: 3.721 ± 0.525
3.917ThrThr: 3.917 ± 0.721
4.309ThrVal: 4.309 ± 0.635
0.979ThrTrp: 0.979 ± 0.261
3.036ThrTyr: 3.036 ± 0.618
0.0ThrXaa: 0.0 ± 0.0
Val
3.623ValAla: 3.623 ± 0.82
0.392ValCys: 0.392 ± 0.197
3.819ValAsp: 3.819 ± 0.791
5.582ValGlu: 5.582 ± 0.811
2.35ValPhe: 2.35 ± 0.495
3.036ValGly: 3.036 ± 0.546
0.49ValHis: 0.49 ± 0.27
4.211ValIle: 4.211 ± 0.642
5.973ValLys: 5.973 ± 0.625
4.896ValLeu: 4.896 ± 0.884
1.469ValMet: 1.469 ± 0.403
3.819ValAsn: 3.819 ± 0.667
1.469ValPro: 1.469 ± 0.384
2.546ValGln: 2.546 ± 0.676
1.077ValArg: 1.077 ± 0.356
4.015ValSer: 4.015 ± 0.563
3.819ValThr: 3.819 ± 0.625
3.819ValVal: 3.819 ± 0.55
0.881ValTrp: 0.881 ± 0.283
1.958ValTyr: 1.958 ± 0.542
0.0ValXaa: 0.0 ± 0.0
Trp
1.273TrpAla: 1.273 ± 0.395
0.0TrpCys: 0.0 ± 0.0
0.881TrpAsp: 0.881 ± 0.264
1.077TrpGlu: 1.077 ± 0.354
0.49TrpPhe: 0.49 ± 0.206
0.49TrpGly: 0.49 ± 0.225
0.294TrpHis: 0.294 ± 0.169
1.273TrpIle: 1.273 ± 0.275
1.175TrpLys: 1.175 ± 0.34
0.49TrpLeu: 0.49 ± 0.229
0.098TrpMet: 0.098 ± 0.104
0.685TrpAsn: 0.685 ± 0.221
0.196TrpPro: 0.196 ± 0.137
1.371TrpGln: 1.371 ± 0.383
0.783TrpArg: 0.783 ± 0.289
0.783TrpSer: 0.783 ± 0.249
0.685TrpThr: 0.685 ± 0.324
0.979TrpVal: 0.979 ± 0.315
0.294TrpTrp: 0.294 ± 0.184
0.685TrpTyr: 0.685 ± 0.343
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.35TyrAla: 2.35 ± 0.489
0.588TyrCys: 0.588 ± 0.289
2.742TyrAsp: 2.742 ± 0.568
2.84TyrGlu: 2.84 ± 0.585
1.567TyrPhe: 1.567 ± 0.388
2.84TyrGly: 2.84 ± 0.633
0.49TyrHis: 0.49 ± 0.221
2.546TyrIle: 2.546 ± 0.531
3.721TyrLys: 3.721 ± 0.678
3.134TyrLeu: 3.134 ± 0.621
1.077TyrMet: 1.077 ± 0.311
2.056TyrAsn: 2.056 ± 0.491
1.861TyrPro: 1.861 ± 0.391
1.861TyrGln: 1.861 ± 0.433
1.861TyrArg: 1.861 ± 0.543
3.427TyrSer: 3.427 ± 0.628
2.35TyrThr: 2.35 ± 0.495
2.35TyrVal: 2.35 ± 0.446
0.685TyrTrp: 0.685 ± 0.263
1.567TyrTyr: 1.567 ± 0.371
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10213 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski