Amino acid dipepetide frequency for Lactococcus phage LP1502c

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.88AlaAla: 0.88 ± 0.325
0.22AlaCys: 0.22 ± 0.133
3.299AlaAsp: 3.299 ± 0.608
3.958AlaGlu: 3.958 ± 0.756
2.639AlaPhe: 2.639 ± 0.526
4.178AlaGly: 4.178 ± 1.083
0.99AlaHis: 0.99 ± 0.232
4.508AlaIle: 4.508 ± 0.893
5.827AlaLys: 5.827 ± 0.78
6.157AlaLeu: 6.157 ± 1.049
2.529AlaMet: 2.529 ± 0.74
5.278AlaAsn: 5.278 ± 0.787
0.99AlaPro: 0.99 ± 0.388
2.529AlaGln: 2.529 ± 0.545
1.869AlaArg: 1.869 ± 0.419
3.518AlaSer: 3.518 ± 0.576
3.079AlaThr: 3.079 ± 0.679
5.278AlaVal: 5.278 ± 1.23
1.649AlaTrp: 1.649 ± 0.461
2.309AlaTyr: 2.309 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
0.33CysAla: 0.33 ± 0.169
0.11CysCys: 0.11 ± 0.119
0.33CysAsp: 0.33 ± 0.214
0.22CysGlu: 0.22 ± 0.156
0.11CysPhe: 0.11 ± 0.121
0.66CysGly: 0.66 ± 0.287
0.22CysHis: 0.22 ± 0.162
0.55CysIle: 0.55 ± 0.281
0.66CysLys: 0.66 ± 0.303
0.55CysLeu: 0.55 ± 0.278
0.11CysMet: 0.11 ± 0.098
0.66CysAsn: 0.66 ± 0.346
0.11CysPro: 0.11 ± 0.11
0.33CysGln: 0.33 ± 0.199
0.55CysArg: 0.55 ± 0.272
0.22CysSer: 0.22 ± 0.172
0.33CysThr: 0.33 ± 0.207
0.44CysVal: 0.44 ± 0.215
0.22CysTrp: 0.22 ± 0.174
0.44CysTyr: 0.44 ± 0.27
0.0CysXaa: 0.0 ± 0.0
Asp
2.199AspAla: 2.199 ± 0.406
0.55AspCys: 0.55 ± 0.307
3.408AspAsp: 3.408 ± 0.742
3.958AspGlu: 3.958 ± 0.824
3.518AspPhe: 3.518 ± 0.566
3.738AspGly: 3.738 ± 0.504
0.66AspHis: 0.66 ± 0.266
3.848AspIle: 3.848 ± 0.53
5.168AspLys: 5.168 ± 0.762
6.047AspLeu: 6.047 ± 0.737
1.1AspMet: 1.1 ± 0.287
4.508AspAsn: 4.508 ± 0.658
1.1AspPro: 1.1 ± 0.447
0.55AspGln: 0.55 ± 0.267
1.979AspArg: 1.979 ± 0.502
3.518AspSer: 3.518 ± 0.652
5.388AspThr: 5.388 ± 0.739
3.958AspVal: 3.958 ± 0.609
0.88AspTrp: 0.88 ± 0.34
3.079AspTyr: 3.079 ± 0.589
0.0AspXaa: 0.0 ± 0.0
Glu
3.628GluAla: 3.628 ± 0.642
0.55GluCys: 0.55 ± 0.326
2.859GluAsp: 2.859 ± 0.553
5.498GluGlu: 5.498 ± 0.986
3.408GluPhe: 3.408 ± 0.515
2.199GluGly: 2.199 ± 0.396
0.99GluHis: 0.99 ± 0.444
5.388GluIle: 5.388 ± 0.651
5.937GluLys: 5.937 ± 1.226
9.456GluLeu: 9.456 ± 1.427
2.529GluMet: 2.529 ± 0.489
4.838GluAsn: 4.838 ± 0.709
1.649GluPro: 1.649 ± 0.462
4.178GluGln: 4.178 ± 0.804
2.529GluArg: 2.529 ± 0.588
3.958GluSer: 3.958 ± 0.58
4.508GluThr: 4.508 ± 0.81
4.398GluVal: 4.398 ± 0.56
0.77GluTrp: 0.77 ± 0.28
3.518GluTyr: 3.518 ± 0.665
0.0GluXaa: 0.0 ± 0.0
Phe
2.859PheAla: 2.859 ± 0.665
0.33PheCys: 0.33 ± 0.22
3.628PheAsp: 3.628 ± 0.582
2.749PheGlu: 2.749 ± 0.607
1.649PhePhe: 1.649 ± 0.636
2.419PheGly: 2.419 ± 0.53
0.22PheHis: 0.22 ± 0.132
2.859PheIle: 2.859 ± 0.507
3.738PheLys: 3.738 ± 0.616
1.759PheLeu: 1.759 ± 0.464
0.99PheMet: 0.99 ± 0.32
3.079PheAsn: 3.079 ± 0.891
1.1PhePro: 1.1 ± 0.301
0.88PheGln: 0.88 ± 0.292
0.99PheArg: 0.99 ± 0.282
4.068PheSer: 4.068 ± 0.741
2.639PheThr: 2.639 ± 0.428
2.859PheVal: 2.859 ± 0.402
0.22PheTrp: 0.22 ± 0.142
1.429PheTyr: 1.429 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
3.628GlyAla: 3.628 ± 1.386
0.44GlyCys: 0.44 ± 0.252
3.408GlyAsp: 3.408 ± 0.645
4.508GlyGlu: 4.508 ± 0.557
2.749GlyPhe: 2.749 ± 0.631
4.618GlyGly: 4.618 ± 0.946
0.44GlyHis: 0.44 ± 0.189
4.068GlyIle: 4.068 ± 1.178
6.377GlyLys: 6.377 ± 0.724
5.388GlyLeu: 5.388 ± 0.904
1.429GlyMet: 1.429 ± 0.38
2.859GlyAsn: 2.859 ± 0.624
0.22GlyPro: 0.22 ± 0.143
2.089GlyGln: 2.089 ± 0.463
1.759GlyArg: 1.759 ± 0.38
5.168GlySer: 5.168 ± 0.846
2.969GlyThr: 2.969 ± 0.668
6.157GlyVal: 6.157 ± 0.996
1.1GlyTrp: 1.1 ± 0.276
2.969GlyTyr: 2.969 ± 0.591
0.0GlyXaa: 0.0 ± 0.0
His
0.66HisAla: 0.66 ± 0.241
0.66HisCys: 0.66 ± 0.364
0.44HisAsp: 0.44 ± 0.237
0.55HisGlu: 0.55 ± 0.235
0.44HisPhe: 0.44 ± 0.272
1.319HisGly: 1.319 ± 0.393
0.11HisHis: 0.11 ± 0.095
0.55HisIle: 0.55 ± 0.269
0.77HisLys: 0.77 ± 0.311
1.1HisLeu: 1.1 ± 0.368
0.22HisMet: 0.22 ± 0.191
1.869HisAsn: 1.869 ± 0.558
0.22HisPro: 0.22 ± 0.151
0.22HisGln: 0.22 ± 0.148
0.11HisArg: 0.11 ± 0.094
0.44HisSer: 0.44 ± 0.216
0.88HisThr: 0.88 ± 0.282
0.77HisVal: 0.77 ± 0.379
0.22HisTrp: 0.22 ± 0.197
0.77HisTyr: 0.77 ± 0.308
0.0HisXaa: 0.0 ± 0.0
Ile
3.958IleAla: 3.958 ± 0.605
0.11IleCys: 0.11 ± 0.121
4.838IleAsp: 4.838 ± 0.506
6.267IleGlu: 6.267 ± 0.925
2.309IlePhe: 2.309 ± 0.604
3.848IleGly: 3.848 ± 0.935
0.77IleHis: 0.77 ± 0.293
3.848IleIle: 3.848 ± 0.47
7.147IleLys: 7.147 ± 0.801
4.508IleLeu: 4.508 ± 0.781
0.88IleMet: 0.88 ± 0.26
5.278IleAsn: 5.278 ± 0.684
1.429IlePro: 1.429 ± 0.331
2.529IleGln: 2.529 ± 0.467
1.649IleArg: 1.649 ± 0.389
3.738IleSer: 3.738 ± 0.622
4.838IleThr: 4.838 ± 0.612
4.178IleVal: 4.178 ± 0.683
1.759IleTrp: 1.759 ± 0.47
3.299IleTyr: 3.299 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
7.367LysAla: 7.367 ± 0.867
0.55LysCys: 0.55 ± 0.29
5.168LysAsp: 5.168 ± 0.653
7.257LysGlu: 7.257 ± 1.144
2.419LysPhe: 2.419 ± 0.488
5.388LysGly: 5.388 ± 0.938
1.1LysHis: 1.1 ± 0.378
4.728LysIle: 4.728 ± 0.672
7.257LysLys: 7.257 ± 0.965
7.806LysLeu: 7.806 ± 0.944
3.079LysMet: 3.079 ± 0.507
5.498LysAsn: 5.498 ± 0.797
1.319LysPro: 1.319 ± 0.379
3.958LysGln: 3.958 ± 0.692
4.068LysArg: 4.068 ± 0.829
4.838LysSer: 4.838 ± 0.685
5.717LysThr: 5.717 ± 0.653
5.937LysVal: 5.937 ± 0.775
1.759LysTrp: 1.759 ± 0.36
3.738LysTyr: 3.738 ± 0.684
0.0LysXaa: 0.0 ± 0.0
Leu
4.618LeuAla: 4.618 ± 0.666
0.33LeuCys: 0.33 ± 0.154
4.508LeuAsp: 4.508 ± 0.531
5.498LeuGlu: 5.498 ± 0.817
3.408LeuPhe: 3.408 ± 0.548
4.398LeuGly: 4.398 ± 0.621
0.99LeuHis: 0.99 ± 0.336
6.707LeuIle: 6.707 ± 1.015
8.686LeuLys: 8.686 ± 0.844
6.487LeuLeu: 6.487 ± 0.875
2.199LeuMet: 2.199 ± 0.527
5.058LeuAsn: 5.058 ± 0.716
3.299LeuPro: 3.299 ± 0.525
3.738LeuGln: 3.738 ± 0.549
2.199LeuArg: 2.199 ± 0.557
5.058LeuSer: 5.058 ± 0.713
6.047LeuThr: 6.047 ± 0.839
6.267LeuVal: 6.267 ± 0.595
1.539LeuTrp: 1.539 ± 0.394
4.178LeuTyr: 4.178 ± 0.765
0.0LeuXaa: 0.0 ± 0.0
Met
2.199MetAla: 2.199 ± 0.606
0.11MetCys: 0.11 ± 0.094
1.649MetAsp: 1.649 ± 0.52
1.1MetGlu: 1.1 ± 0.421
0.55MetPhe: 0.55 ± 0.194
1.209MetGly: 1.209 ± 0.298
0.33MetHis: 0.33 ± 0.189
2.309MetIle: 2.309 ± 0.559
2.969MetLys: 2.969 ± 0.651
1.979MetLeu: 1.979 ± 0.599
0.22MetMet: 0.22 ± 0.177
1.649MetAsn: 1.649 ± 0.404
0.55MetPro: 0.55 ± 0.252
1.649MetGln: 1.649 ± 0.424
0.55MetArg: 0.55 ± 0.312
1.539MetSer: 1.539 ± 0.386
2.309MetThr: 2.309 ± 0.63
1.429MetVal: 1.429 ± 0.4
0.11MetTrp: 0.11 ± 0.1
1.1MetTyr: 1.1 ± 0.377
0.0MetXaa: 0.0 ± 0.0
Asn
4.948AsnAla: 4.948 ± 1.123
0.22AsnCys: 0.22 ± 0.148
4.618AsnAsp: 4.618 ± 0.671
4.728AsnGlu: 4.728 ± 0.733
2.089AsnPhe: 2.089 ± 0.428
6.157AsnGly: 6.157 ± 0.855
0.88AsnHis: 0.88 ± 0.261
4.398AsnIle: 4.398 ± 0.749
7.037AsnLys: 7.037 ± 1.075
6.377AsnLeu: 6.377 ± 1.005
1.539AsnMet: 1.539 ± 0.36
2.749AsnAsn: 2.749 ± 0.479
2.199AsnPro: 2.199 ± 0.522
2.089AsnGln: 2.089 ± 0.453
1.759AsnArg: 1.759 ± 0.378
5.388AsnSer: 5.388 ± 0.614
4.618AsnThr: 4.618 ± 0.759
3.189AsnVal: 3.189 ± 0.741
0.77AsnTrp: 0.77 ± 0.358
2.529AsnTyr: 2.529 ± 0.63
0.0AsnXaa: 0.0 ± 0.0
Pro
1.209ProAla: 1.209 ± 0.39
0.11ProCys: 0.11 ± 0.114
1.429ProAsp: 1.429 ± 0.435
1.979ProGlu: 1.979 ± 0.556
1.1ProPhe: 1.1 ± 0.305
0.22ProGly: 0.22 ± 0.139
0.0ProHis: 0.0 ± 0.0
2.199ProIle: 2.199 ± 0.534
1.979ProLys: 1.979 ± 0.402
2.089ProLeu: 2.089 ± 0.528
0.88ProMet: 0.88 ± 0.291
2.419ProAsn: 2.419 ± 0.744
0.55ProPro: 0.55 ± 0.248
0.66ProGln: 0.66 ± 0.271
0.44ProArg: 0.44 ± 0.177
1.649ProSer: 1.649 ± 0.439
1.979ProThr: 1.979 ± 0.386
1.869ProVal: 1.869 ± 0.42
0.22ProTrp: 0.22 ± 0.135
0.55ProTyr: 0.55 ± 0.248
0.0ProXaa: 0.0 ± 0.0
Gln
3.628GlnAla: 3.628 ± 0.661
0.11GlnCys: 0.11 ± 0.119
2.199GlnAsp: 2.199 ± 0.618
2.419GlnGlu: 2.419 ± 0.739
1.539GlnPhe: 1.539 ± 0.419
2.969GlnGly: 2.969 ± 0.48
0.44GlnHis: 0.44 ± 0.19
2.309GlnIle: 2.309 ± 0.581
2.749GlnLys: 2.749 ± 0.641
2.969GlnLeu: 2.969 ± 0.473
1.209GlnMet: 1.209 ± 0.314
2.309GlnAsn: 2.309 ± 0.477
1.319GlnPro: 1.319 ± 0.37
1.869GlnGln: 1.869 ± 0.45
1.649GlnArg: 1.649 ± 0.425
2.199GlnSer: 2.199 ± 0.507
2.089GlnThr: 2.089 ± 0.408
2.419GlnVal: 2.419 ± 0.478
0.66GlnTrp: 0.66 ± 0.222
0.99GlnTyr: 0.99 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
2.309ArgAla: 2.309 ± 0.656
0.33ArgCys: 0.33 ± 0.198
1.209ArgAsp: 1.209 ± 0.414
2.639ArgGlu: 2.639 ± 0.543
0.44ArgPhe: 0.44 ± 0.214
2.199ArgGly: 2.199 ± 0.422
0.77ArgHis: 0.77 ± 0.288
2.309ArgIle: 2.309 ± 0.497
3.848ArgLys: 3.848 ± 0.766
3.408ArgLeu: 3.408 ± 0.602
0.44ArgMet: 0.44 ± 0.244
2.089ArgAsn: 2.089 ± 0.489
0.88ArgPro: 0.88 ± 0.374
1.539ArgGln: 1.539 ± 0.39
1.539ArgArg: 1.539 ± 0.415
1.539ArgSer: 1.539 ± 0.452
1.869ArgThr: 1.869 ± 0.319
1.539ArgVal: 1.539 ± 0.401
0.22ArgTrp: 0.22 ± 0.133
1.979ArgTyr: 1.979 ± 0.468
0.0ArgXaa: 0.0 ± 0.0
Ser
5.607SerAla: 5.607 ± 1.356
0.88SerCys: 0.88 ± 0.39
4.398SerAsp: 4.398 ± 0.745
4.068SerGlu: 4.068 ± 0.804
3.299SerPhe: 3.299 ± 0.555
4.948SerGly: 4.948 ± 1.061
0.88SerHis: 0.88 ± 0.342
3.518SerIle: 3.518 ± 0.495
5.607SerLys: 5.607 ± 0.69
5.388SerLeu: 5.388 ± 0.882
1.539SerMet: 1.539 ± 0.361
3.738SerAsn: 3.738 ± 0.76
1.319SerPro: 1.319 ± 0.294
2.639SerGln: 2.639 ± 0.531
2.529SerArg: 2.529 ± 0.39
5.717SerSer: 5.717 ± 1.051
3.738SerThr: 3.738 ± 0.904
3.958SerVal: 3.958 ± 0.587
0.88SerTrp: 0.88 ± 0.263
1.539SerTyr: 1.539 ± 0.398
0.0SerXaa: 0.0 ± 0.0
Thr
5.278ThrAla: 5.278 ± 0.798
0.22ThrCys: 0.22 ± 0.14
3.518ThrAsp: 3.518 ± 0.767
5.498ThrGlu: 5.498 ± 0.624
2.309ThrPhe: 2.309 ± 0.455
4.068ThrGly: 4.068 ± 0.644
0.33ThrHis: 0.33 ± 0.21
4.398ThrIle: 4.398 ± 0.816
4.508ThrLys: 4.508 ± 0.619
5.937ThrLeu: 5.937 ± 0.774
1.429ThrMet: 1.429 ± 0.326
4.948ThrAsn: 4.948 ± 0.749
2.199ThrPro: 2.199 ± 0.419
2.529ThrGln: 2.529 ± 0.62
2.089ThrArg: 2.089 ± 0.412
5.278ThrSer: 5.278 ± 0.731
4.068ThrThr: 4.068 ± 0.578
4.068ThrVal: 4.068 ± 0.707
0.88ThrTrp: 0.88 ± 0.32
1.869ThrTyr: 1.869 ± 0.535
0.0ThrXaa: 0.0 ± 0.0
Val
4.068ValAla: 4.068 ± 0.785
0.66ValCys: 0.66 ± 0.336
4.838ValAsp: 4.838 ± 0.868
4.948ValGlu: 4.948 ± 0.566
3.299ValPhe: 3.299 ± 0.706
4.288ValGly: 4.288 ± 0.636
0.66ValHis: 0.66 ± 0.251
4.838ValIle: 4.838 ± 0.658
5.278ValLys: 5.278 ± 0.699
3.189ValLeu: 3.189 ± 0.605
1.869ValMet: 1.869 ± 0.431
3.299ValAsn: 3.299 ± 0.597
1.759ValPro: 1.759 ± 0.508
1.539ValGln: 1.539 ± 0.41
3.189ValArg: 3.189 ± 0.733
5.278ValSer: 5.278 ± 1.107
5.388ValThr: 5.388 ± 0.867
3.189ValVal: 3.189 ± 0.654
0.55ValTrp: 0.55 ± 0.228
2.749ValTyr: 2.749 ± 0.586
0.0ValXaa: 0.0 ± 0.0
Trp
0.55TrpAla: 0.55 ± 0.2
0.22TrpCys: 0.22 ± 0.14
1.1TrpAsp: 1.1 ± 0.4
1.1TrpGlu: 1.1 ± 0.3
0.88TrpPhe: 0.88 ± 0.347
0.55TrpGly: 0.55 ± 0.259
0.33TrpHis: 0.33 ± 0.188
0.77TrpIle: 0.77 ± 0.293
0.66TrpLys: 0.66 ± 0.226
1.319TrpLeu: 1.319 ± 0.34
0.44TrpMet: 0.44 ± 0.209
1.979TrpAsn: 1.979 ± 0.574
0.0TrpPro: 0.0 ± 0.0
0.99TrpGln: 0.99 ± 0.323
0.33TrpArg: 0.33 ± 0.262
1.319TrpSer: 1.319 ± 0.331
0.44TrpThr: 0.44 ± 0.256
0.66TrpVal: 0.66 ± 0.261
0.33TrpTrp: 0.33 ± 0.155
1.1TrpTyr: 1.1 ± 0.297
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.759TyrAla: 1.759 ± 0.425
0.44TyrCys: 0.44 ± 0.294
2.309TyrAsp: 2.309 ± 0.65
3.848TyrGlu: 3.848 ± 0.696
2.309TyrPhe: 2.309 ± 0.424
2.969TyrGly: 2.969 ± 0.554
1.209TyrHis: 1.209 ± 0.37
3.079TyrIle: 3.079 ± 0.675
2.859TyrLys: 2.859 ± 0.649
3.299TyrLeu: 3.299 ± 0.753
0.77TyrMet: 0.77 ± 0.303
4.178TyrAsn: 4.178 ± 0.582
1.319TyrPro: 1.319 ± 0.356
1.539TyrGln: 1.539 ± 0.496
1.319TyrArg: 1.319 ± 0.37
1.759TyrSer: 1.759 ± 0.383
2.529TyrThr: 2.529 ± 0.552
2.309TyrVal: 2.309 ± 0.518
0.33TyrTrp: 0.33 ± 0.164
2.639TyrTyr: 2.639 ± 0.563
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (9096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski