Amino acid dipepetide frequency for Lactococcus phage CHPC52

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.347AlaAla: 0.347 ± 0.196
0.231AlaCys: 0.231 ± 0.169
3.353AlaAsp: 3.353 ± 0.554
4.624AlaGlu: 4.624 ± 0.818
3.121AlaPhe: 3.121 ± 0.813
4.162AlaGly: 4.162 ± 0.829
0.578AlaHis: 0.578 ± 0.281
4.393AlaIle: 4.393 ± 0.727
5.549AlaLys: 5.549 ± 1.023
7.168AlaLeu: 7.168 ± 1.083
2.197AlaMet: 2.197 ± 0.741
4.277AlaAsn: 4.277 ± 0.71
0.694AlaPro: 0.694 ± 0.271
2.775AlaGln: 2.775 ± 0.535
2.312AlaArg: 2.312 ± 0.473
3.468AlaSer: 3.468 ± 0.763
3.006AlaThr: 3.006 ± 0.758
4.74AlaVal: 4.74 ± 0.93
1.965AlaTrp: 1.965 ± 0.728
2.197AlaTyr: 2.197 ± 0.377
0.0AlaXaa: 0.0 ± 0.0
Cys
0.462CysAla: 0.462 ± 0.213
0.116CysCys: 0.116 ± 0.131
0.116CysAsp: 0.116 ± 0.126
0.578CysGlu: 0.578 ± 0.274
0.0CysPhe: 0.0 ± 0.0
0.925CysGly: 0.925 ± 0.465
0.231CysHis: 0.231 ± 0.148
0.347CysIle: 0.347 ± 0.234
0.809CysLys: 0.809 ± 0.437
0.462CysLeu: 0.462 ± 0.347
0.0CysMet: 0.0 ± 0.0
0.462CysAsn: 0.462 ± 0.26
0.116CysPro: 0.116 ± 0.112
0.231CysGln: 0.231 ± 0.156
0.347CysArg: 0.347 ± 0.209
0.347CysSer: 0.347 ± 0.314
0.116CysThr: 0.116 ± 0.141
0.347CysVal: 0.347 ± 0.175
0.0CysTrp: 0.0 ± 0.0
0.578CysTyr: 0.578 ± 0.419
0.0CysXaa: 0.0 ± 0.0
Asp
1.618AspAla: 1.618 ± 0.514
0.462AspCys: 0.462 ± 0.234
3.121AspAsp: 3.121 ± 0.645
3.121AspGlu: 3.121 ± 0.729
3.815AspPhe: 3.815 ± 0.57
3.815AspGly: 3.815 ± 0.889
0.809AspHis: 0.809 ± 0.35
4.277AspIle: 4.277 ± 0.761
5.202AspLys: 5.202 ± 0.853
6.012AspLeu: 6.012 ± 0.853
0.809AspMet: 0.809 ± 0.257
4.393AspAsn: 4.393 ± 0.599
1.272AspPro: 1.272 ± 0.46
0.462AspGln: 0.462 ± 0.279
1.503AspArg: 1.503 ± 0.5
2.659AspSer: 2.659 ± 0.517
4.971AspThr: 4.971 ± 0.823
3.468AspVal: 3.468 ± 0.618
0.809AspTrp: 0.809 ± 0.254
2.775AspTyr: 2.775 ± 0.534
0.0AspXaa: 0.0 ± 0.0
Glu
3.815GluAla: 3.815 ± 0.672
0.347GluCys: 0.347 ± 0.228
3.584GluAsp: 3.584 ± 0.696
5.087GluGlu: 5.087 ± 0.902
3.931GluPhe: 3.931 ± 0.714
2.312GluGly: 2.312 ± 0.433
0.809GluHis: 0.809 ± 0.383
5.896GluIle: 5.896 ± 0.774
5.434GluLys: 5.434 ± 1.155
10.289GluLeu: 10.289 ± 1.599
2.775GluMet: 2.775 ± 0.546
4.509GluAsn: 4.509 ± 0.827
1.04GluPro: 1.04 ± 0.352
3.584GluGln: 3.584 ± 0.638
2.312GluArg: 2.312 ± 0.524
4.046GluSer: 4.046 ± 0.603
4.624GluThr: 4.624 ± 0.652
3.353GluVal: 3.353 ± 0.545
1.04GluTrp: 1.04 ± 0.324
3.121GluTyr: 3.121 ± 0.739
0.0GluXaa: 0.0 ± 0.0
Phe
3.815PheAla: 3.815 ± 0.841
0.462PheCys: 0.462 ± 0.367
3.353PheAsp: 3.353 ± 0.556
2.428PheGlu: 2.428 ± 0.592
2.081PhePhe: 2.081 ± 0.569
2.197PheGly: 2.197 ± 0.545
0.347PheHis: 0.347 ± 0.199
3.121PheIle: 3.121 ± 0.595
4.74PheLys: 4.74 ± 0.787
2.659PheLeu: 2.659 ± 0.522
0.809PheMet: 0.809 ± 0.271
3.121PheAsn: 3.121 ± 0.953
0.925PhePro: 0.925 ± 0.349
1.156PheGln: 1.156 ± 0.359
1.272PheArg: 1.272 ± 0.309
4.046PheSer: 4.046 ± 0.773
3.237PheThr: 3.237 ± 0.553
3.006PheVal: 3.006 ± 0.448
0.231PheTrp: 0.231 ± 0.162
1.734PheTyr: 1.734 ± 0.385
0.0PheXaa: 0.0 ± 0.0
Gly
3.237GlyAla: 3.237 ± 0.883
0.231GlyCys: 0.231 ± 0.165
2.775GlyAsp: 2.775 ± 0.717
4.162GlyGlu: 4.162 ± 0.723
2.89GlyPhe: 2.89 ± 0.707
4.74GlyGly: 4.74 ± 1.014
1.04GlyHis: 1.04 ± 0.328
3.931GlyIle: 3.931 ± 1.271
6.243GlyLys: 6.243 ± 0.588
5.549GlyLeu: 5.549 ± 1.045
1.503GlyMet: 1.503 ± 0.512
4.624GlyAsn: 4.624 ± 0.691
0.116GlyPro: 0.116 ± 0.115
2.197GlyGln: 2.197 ± 0.5
2.197GlyArg: 2.197 ± 0.343
4.971GlySer: 4.971 ± 0.976
3.121GlyThr: 3.121 ± 0.615
6.012GlyVal: 6.012 ± 1.239
1.272GlyTrp: 1.272 ± 0.323
2.659GlyTyr: 2.659 ± 0.575
0.0GlyXaa: 0.0 ± 0.0
His
0.694HisAla: 0.694 ± 0.21
0.347HisCys: 0.347 ± 0.21
0.462HisAsp: 0.462 ± 0.21
0.462HisGlu: 0.462 ± 0.237
0.578HisPhe: 0.578 ± 0.278
1.04HisGly: 1.04 ± 0.326
0.116HisHis: 0.116 ± 0.116
1.387HisIle: 1.387 ± 0.417
0.809HisLys: 0.809 ± 0.293
0.925HisLeu: 0.925 ± 0.365
0.0HisMet: 0.0 ± 0.0
1.618HisAsn: 1.618 ± 0.576
0.116HisPro: 0.116 ± 0.104
0.347HisGln: 0.347 ± 0.176
0.231HisArg: 0.231 ± 0.177
0.462HisSer: 0.462 ± 0.211
0.809HisThr: 0.809 ± 0.349
0.809HisVal: 0.809 ± 0.332
0.116HisTrp: 0.116 ± 0.105
0.462HisTyr: 0.462 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
4.624IleAla: 4.624 ± 0.632
0.116IleCys: 0.116 ± 0.131
4.624IleAsp: 4.624 ± 0.631
5.896IleGlu: 5.896 ± 0.979
2.89IlePhe: 2.89 ± 0.746
3.931IleGly: 3.931 ± 0.924
0.809IleHis: 0.809 ± 0.267
4.855IleIle: 4.855 ± 0.672
6.936IleLys: 6.936 ± 0.852
5.78IleLeu: 5.78 ± 1.2
1.503IleMet: 1.503 ± 0.379
5.318IleAsn: 5.318 ± 0.531
1.734IlePro: 1.734 ± 0.398
2.197IleGln: 2.197 ± 0.398
1.618IleArg: 1.618 ± 0.483
3.584IleSer: 3.584 ± 0.666
5.434IleThr: 5.434 ± 0.825
4.277IleVal: 4.277 ± 0.67
1.04IleTrp: 1.04 ± 0.442
2.428IleTyr: 2.428 ± 0.432
0.0IleXaa: 0.0 ± 0.0
Lys
6.358LysAla: 6.358 ± 0.906
0.231LysCys: 0.231 ± 0.155
4.971LysAsp: 4.971 ± 0.872
7.63LysGlu: 7.63 ± 1.192
2.543LysPhe: 2.543 ± 0.545
5.665LysGly: 5.665 ± 0.863
1.04LysHis: 1.04 ± 0.429
6.243LysIle: 6.243 ± 0.845
9.133LysLys: 9.133 ± 1.242
7.168LysLeu: 7.168 ± 0.971
3.468LysMet: 3.468 ± 0.468
4.74LysAsn: 4.74 ± 0.765
1.503LysPro: 1.503 ± 0.475
4.046LysGln: 4.046 ± 0.781
4.046LysArg: 4.046 ± 0.768
5.665LysSer: 5.665 ± 0.862
4.971LysThr: 4.971 ± 0.909
6.59LysVal: 6.59 ± 0.78
1.503LysTrp: 1.503 ± 0.455
3.584LysTyr: 3.584 ± 0.621
0.0LysXaa: 0.0 ± 0.0
Leu
5.202LeuAla: 5.202 ± 0.738
0.462LeuCys: 0.462 ± 0.254
4.971LeuAsp: 4.971 ± 0.74
5.896LeuGlu: 5.896 ± 1.11
4.046LeuPhe: 4.046 ± 0.798
4.855LeuGly: 4.855 ± 0.868
1.387LeuHis: 1.387 ± 0.401
7.168LeuIle: 7.168 ± 1.128
8.092LeuLys: 8.092 ± 1.071
7.514LeuLeu: 7.514 ± 1.235
1.503LeuMet: 1.503 ± 0.526
5.202LeuAsn: 5.202 ± 0.929
3.237LeuPro: 3.237 ± 0.574
2.89LeuGln: 2.89 ± 0.427
2.89LeuArg: 2.89 ± 0.615
4.971LeuSer: 4.971 ± 0.765
6.936LeuThr: 6.936 ± 0.887
5.434LeuVal: 5.434 ± 0.718
1.503LeuTrp: 1.503 ± 0.452
4.277LeuTyr: 4.277 ± 0.868
0.0LeuXaa: 0.0 ± 0.0
Met
2.312MetAla: 2.312 ± 0.499
0.116MetCys: 0.116 ± 0.121
1.618MetAsp: 1.618 ± 0.439
2.197MetGlu: 2.197 ± 0.568
0.462MetPhe: 0.462 ± 0.231
1.156MetGly: 1.156 ± 0.334
0.231MetHis: 0.231 ± 0.195
2.081MetIle: 2.081 ± 0.599
2.543MetLys: 2.543 ± 0.536
1.04MetLeu: 1.04 ± 0.343
0.347MetMet: 0.347 ± 0.182
2.081MetAsn: 2.081 ± 0.56
0.462MetPro: 0.462 ± 0.222
1.85MetGln: 1.85 ± 0.404
0.347MetArg: 0.347 ± 0.245
1.156MetSer: 1.156 ± 0.301
1.965MetThr: 1.965 ± 0.527
1.503MetVal: 1.503 ± 0.344
0.0MetTrp: 0.0 ± 0.0
1.618MetTyr: 1.618 ± 0.44
0.0MetXaa: 0.0 ± 0.0
Asn
4.855AsnAla: 4.855 ± 1.041
0.462AsnCys: 0.462 ± 0.237
3.584AsnAsp: 3.584 ± 0.68
4.393AsnGlu: 4.393 ± 0.793
2.197AsnPhe: 2.197 ± 0.624
6.358AsnGly: 6.358 ± 0.801
0.925AsnHis: 0.925 ± 0.366
4.509AsnIle: 4.509 ± 0.698
6.243AsnLys: 6.243 ± 1.134
5.665AsnLeu: 5.665 ± 0.829
1.156AsnMet: 1.156 ± 0.41
3.584AsnAsn: 3.584 ± 0.619
1.85AsnPro: 1.85 ± 0.41
2.312AsnGln: 2.312 ± 0.51
1.85AsnArg: 1.85 ± 0.326
5.202AsnSer: 5.202 ± 0.742
4.74AsnThr: 4.74 ± 0.924
3.699AsnVal: 3.699 ± 0.764
1.04AsnTrp: 1.04 ± 0.389
2.428AsnTyr: 2.428 ± 0.7
0.0AsnXaa: 0.0 ± 0.0
Pro
1.734ProAla: 1.734 ± 0.41
0.116ProCys: 0.116 ± 0.126
1.503ProAsp: 1.503 ± 0.41
1.503ProGlu: 1.503 ± 0.474
1.04ProPhe: 1.04 ± 0.32
0.231ProGly: 0.231 ± 0.14
0.116ProHis: 0.116 ± 0.12
1.618ProIle: 1.618 ± 0.401
2.428ProLys: 2.428 ± 0.508
1.965ProLeu: 1.965 ± 0.477
0.578ProMet: 0.578 ± 0.274
2.081ProAsn: 2.081 ± 0.623
0.809ProPro: 0.809 ± 0.318
0.462ProGln: 0.462 ± 0.223
0.462ProArg: 0.462 ± 0.255
0.809ProSer: 0.809 ± 0.32
2.197ProThr: 2.197 ± 0.457
1.503ProVal: 1.503 ± 0.515
0.0ProTrp: 0.0 ± 0.0
0.462ProTyr: 0.462 ± 0.27
0.0ProXaa: 0.0 ± 0.0
Gln
3.353GlnAla: 3.353 ± 0.832
0.347GlnCys: 0.347 ± 0.227
1.85GlnAsp: 1.85 ± 0.579
2.197GlnGlu: 2.197 ± 0.538
1.156GlnPhe: 1.156 ± 0.379
3.006GlnGly: 3.006 ± 0.581
0.347GlnHis: 0.347 ± 0.239
1.503GlnIle: 1.503 ± 0.325
3.006GlnLys: 3.006 ± 0.668
3.006GlnLeu: 3.006 ± 0.522
1.156GlnMet: 1.156 ± 0.339
2.312GlnAsn: 2.312 ± 0.529
1.156GlnPro: 1.156 ± 0.39
1.272GlnGln: 1.272 ± 0.418
1.387GlnArg: 1.387 ± 0.444
2.428GlnSer: 2.428 ± 0.494
2.197GlnThr: 2.197 ± 0.417
2.312GlnVal: 2.312 ± 0.469
0.809GlnTrp: 0.809 ± 0.267
1.272GlnTyr: 1.272 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
2.197ArgAla: 2.197 ± 0.656
0.231ArgCys: 0.231 ± 0.153
1.734ArgAsp: 1.734 ± 0.394
2.312ArgGlu: 2.312 ± 0.43
0.694ArgPhe: 0.694 ± 0.256
1.734ArgGly: 1.734 ± 0.357
0.694ArgHis: 0.694 ± 0.281
1.85ArgIle: 1.85 ± 0.451
3.468ArgLys: 3.468 ± 0.798
3.353ArgLeu: 3.353 ± 0.59
0.809ArgMet: 0.809 ± 0.331
2.428ArgAsn: 2.428 ± 0.47
0.578ArgPro: 0.578 ± 0.235
1.503ArgGln: 1.503 ± 0.38
1.85ArgArg: 1.85 ± 0.466
1.734ArgSer: 1.734 ± 0.484
1.965ArgThr: 1.965 ± 0.475
1.965ArgVal: 1.965 ± 0.694
0.347ArgTrp: 0.347 ± 0.163
1.618ArgTyr: 1.618 ± 0.434
0.0ArgXaa: 0.0 ± 0.0
Ser
5.434SerAla: 5.434 ± 1.541
0.694SerCys: 0.694 ± 0.348
4.162SerAsp: 4.162 ± 0.664
4.162SerGlu: 4.162 ± 0.763
3.353SerPhe: 3.353 ± 0.574
4.971SerGly: 4.971 ± 0.968
0.462SerHis: 0.462 ± 0.227
3.815SerIle: 3.815 ± 0.713
4.277SerLys: 4.277 ± 0.779
6.243SerLeu: 6.243 ± 0.969
1.734SerMet: 1.734 ± 0.383
3.584SerAsn: 3.584 ± 0.629
1.272SerPro: 1.272 ± 0.377
1.85SerGln: 1.85 ± 0.54
2.428SerArg: 2.428 ± 0.476
4.971SerSer: 4.971 ± 0.735
3.121SerThr: 3.121 ± 0.695
4.162SerVal: 4.162 ± 0.673
0.694SerTrp: 0.694 ± 0.243
1.734SerTyr: 1.734 ± 0.429
0.0SerXaa: 0.0 ± 0.0
Thr
5.087ThrAla: 5.087 ± 1.011
0.347ThrCys: 0.347 ± 0.226
3.353ThrAsp: 3.353 ± 0.729
5.78ThrGlu: 5.78 ± 0.655
3.006ThrPhe: 3.006 ± 0.53
4.509ThrGly: 4.509 ± 0.612
0.116ThrHis: 0.116 ± 0.12
4.162ThrIle: 4.162 ± 0.767
5.318ThrLys: 5.318 ± 0.655
5.78ThrLeu: 5.78 ± 0.898
1.04ThrMet: 1.04 ± 0.368
4.509ThrAsn: 4.509 ± 0.626
2.197ThrPro: 2.197 ± 0.378
2.428ThrGln: 2.428 ± 0.519
1.618ThrArg: 1.618 ± 0.594
4.74ThrSer: 4.74 ± 0.692
4.74ThrThr: 4.74 ± 0.664
5.318ThrVal: 5.318 ± 0.911
0.809ThrTrp: 0.809 ± 0.366
2.428ThrTyr: 2.428 ± 0.54
0.0ThrXaa: 0.0 ± 0.0
Val
3.584ValAla: 3.584 ± 0.581
0.347ValCys: 0.347 ± 0.256
3.699ValAsp: 3.699 ± 0.692
4.74ValGlu: 4.74 ± 0.691
3.237ValPhe: 3.237 ± 0.733
4.277ValGly: 4.277 ± 0.744
0.347ValHis: 0.347 ± 0.197
4.509ValIle: 4.509 ± 0.915
6.59ValLys: 6.59 ± 0.756
3.584ValLeu: 3.584 ± 0.616
2.312ValMet: 2.312 ± 0.472
3.353ValAsn: 3.353 ± 0.766
1.272ValPro: 1.272 ± 0.349
2.428ValGln: 2.428 ± 0.571
2.659ValArg: 2.659 ± 0.616
4.855ValSer: 4.855 ± 1.029
5.78ValThr: 5.78 ± 0.806
4.046ValVal: 4.046 ± 0.797
0.578ValTrp: 0.578 ± 0.216
3.699ValTyr: 3.699 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
0.462TrpAla: 0.462 ± 0.186
0.347TrpCys: 0.347 ± 0.233
0.694TrpAsp: 0.694 ± 0.26
0.694TrpGlu: 0.694 ± 0.276
0.925TrpPhe: 0.925 ± 0.368
0.694TrpGly: 0.694 ± 0.277
0.231TrpHis: 0.231 ± 0.173
0.694TrpIle: 0.694 ± 0.318
1.272TrpLys: 1.272 ± 0.325
1.387TrpLeu: 1.387 ± 0.473
0.231TrpMet: 0.231 ± 0.139
1.387TrpAsn: 1.387 ± 0.589
0.0TrpPro: 0.0 ± 0.0
0.694TrpGln: 0.694 ± 0.238
0.578TrpArg: 0.578 ± 0.275
1.272TrpSer: 1.272 ± 0.279
0.578TrpThr: 0.578 ± 0.27
0.809TrpVal: 0.809 ± 0.33
0.116TrpTrp: 0.116 ± 0.108
1.04TrpTyr: 1.04 ± 0.339
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.965TyrAla: 1.965 ± 0.526
0.578TyrCys: 0.578 ± 0.348
1.85TyrAsp: 1.85 ± 0.547
4.046TyrGlu: 4.046 ± 0.76
2.775TyrPhe: 2.775 ± 0.549
2.89TyrGly: 2.89 ± 0.669
1.156TyrHis: 1.156 ± 0.422
3.237TyrIle: 3.237 ± 0.568
3.006TyrLys: 3.006 ± 0.699
2.89TyrLeu: 2.89 ± 0.709
1.04TyrMet: 1.04 ± 0.409
3.468TyrAsn: 3.468 ± 0.6
1.387TyrPro: 1.387 ± 0.416
1.387TyrGln: 1.387 ± 0.462
1.156TyrArg: 1.156 ± 0.372
1.734TyrSer: 1.734 ± 0.409
2.775TyrThr: 2.775 ± 0.657
2.543TyrVal: 2.543 ± 0.465
0.231TyrTrp: 0.231 ± 0.159
2.428TyrTyr: 2.428 ± 0.622
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (8651 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski