Amino acid dipepetide frequency for Lactococcus phage CHPC964

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.564AlaAla: 0.564 ± 0.36
0.113AlaCys: 0.113 ± 0.141
3.158AlaAsp: 3.158 ± 0.634
4.512AlaGlu: 4.512 ± 0.992
3.61AlaPhe: 3.61 ± 0.928
4.512AlaGly: 4.512 ± 0.993
0.564AlaHis: 0.564 ± 0.218
5.302AlaIle: 5.302 ± 1.09
5.527AlaLys: 5.527 ± 0.753
6.204AlaLeu: 6.204 ± 0.948
2.369AlaMet: 2.369 ± 0.501
4.399AlaAsn: 4.399 ± 1.013
0.902AlaPro: 0.902 ± 0.342
2.82AlaGln: 2.82 ± 0.634
2.482AlaArg: 2.482 ± 0.423
4.399AlaSer: 4.399 ± 0.855
3.271AlaThr: 3.271 ± 0.739
3.723AlaVal: 3.723 ± 0.833
1.918AlaTrp: 1.918 ± 1.169
2.369AlaTyr: 2.369 ± 0.451
0.0AlaXaa: 0.0 ± 0.0
Cys
0.338CysAla: 0.338 ± 0.186
0.0CysCys: 0.0 ± 0.0
0.338CysAsp: 0.338 ± 0.202
0.451CysGlu: 0.451 ± 0.232
0.338CysPhe: 0.338 ± 0.2
0.677CysGly: 0.677 ± 0.312
0.226CysHis: 0.226 ± 0.164
0.338CysIle: 0.338 ± 0.212
0.902CysLys: 0.902 ± 0.383
0.226CysLeu: 0.226 ± 0.171
0.113CysMet: 0.113 ± 0.113
0.677CysAsn: 0.677 ± 0.246
0.113CysPro: 0.113 ± 0.112
0.338CysGln: 0.338 ± 0.202
0.677CysArg: 0.677 ± 0.281
0.338CysSer: 0.338 ± 0.192
0.226CysThr: 0.226 ± 0.162
0.451CysVal: 0.451 ± 0.206
0.226CysTrp: 0.226 ± 0.151
0.226CysTyr: 0.226 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
1.354AspAla: 1.354 ± 0.407
0.338AspCys: 0.338 ± 0.214
3.271AspAsp: 3.271 ± 0.685
3.61AspGlu: 3.61 ± 0.743
3.61AspPhe: 3.61 ± 0.661
3.723AspGly: 3.723 ± 0.732
0.677AspHis: 0.677 ± 0.301
3.948AspIle: 3.948 ± 0.682
5.527AspLys: 5.527 ± 0.751
6.091AspLeu: 6.091 ± 0.945
0.451AspMet: 0.451 ± 0.195
4.287AspAsn: 4.287 ± 0.68
1.354AspPro: 1.354 ± 0.448
0.677AspGln: 0.677 ± 0.291
1.692AspArg: 1.692 ± 0.427
2.82AspSer: 2.82 ± 0.526
3.158AspThr: 3.158 ± 0.705
3.497AspVal: 3.497 ± 0.499
0.564AspTrp: 0.564 ± 0.241
2.707AspTyr: 2.707 ± 0.563
0.0AspXaa: 0.0 ± 0.0
Glu
3.948GluAla: 3.948 ± 0.772
0.338GluCys: 0.338 ± 0.194
2.369GluAsp: 2.369 ± 0.556
4.851GluGlu: 4.851 ± 0.989
3.835GluPhe: 3.835 ± 0.709
1.692GluGly: 1.692 ± 0.348
1.128GluHis: 1.128 ± 0.398
5.527GluIle: 5.527 ± 0.859
6.204GluLys: 6.204 ± 1.24
8.911GluLeu: 8.911 ± 1.197
2.369GluMet: 2.369 ± 0.475
4.963GluAsn: 4.963 ± 0.759
1.015GluPro: 1.015 ± 0.348
4.174GluGln: 4.174 ± 0.785
3.158GluArg: 3.158 ± 0.637
4.625GluSer: 4.625 ± 0.678
5.302GluThr: 5.302 ± 0.862
4.512GluVal: 4.512 ± 0.725
0.677GluTrp: 0.677 ± 0.25
3.158GluTyr: 3.158 ± 0.808
0.0GluXaa: 0.0 ± 0.0
Phe
4.174PheAla: 4.174 ± 0.858
0.451PheCys: 0.451 ± 0.23
3.271PheAsp: 3.271 ± 0.576
2.369PheGlu: 2.369 ± 0.529
1.918PhePhe: 1.918 ± 0.819
1.579PheGly: 1.579 ± 0.452
0.451PheHis: 0.451 ± 0.298
3.384PheIle: 3.384 ± 0.676
3.835PheLys: 3.835 ± 0.711
2.369PheLeu: 2.369 ± 0.558
0.79PheMet: 0.79 ± 0.26
3.271PheAsn: 3.271 ± 0.918
0.902PhePro: 0.902 ± 0.317
1.354PheGln: 1.354 ± 0.519
1.466PheArg: 1.466 ± 0.32
3.835PheSer: 3.835 ± 1.175
3.61PheThr: 3.61 ± 0.631
2.369PheVal: 2.369 ± 0.464
0.451PheTrp: 0.451 ± 0.214
2.03PheTyr: 2.03 ± 0.432
0.0PheXaa: 0.0 ± 0.0
Gly
3.835GlyAla: 3.835 ± 1.293
0.338GlyCys: 0.338 ± 0.194
3.271GlyAsp: 3.271 ± 0.642
4.061GlyGlu: 4.061 ± 0.723
2.594GlyPhe: 2.594 ± 0.657
4.174GlyGly: 4.174 ± 1.092
1.015GlyHis: 1.015 ± 0.302
4.399GlyIle: 4.399 ± 1.712
5.753GlyLys: 5.753 ± 0.721
5.64GlyLeu: 5.64 ± 1.046
1.241GlyMet: 1.241 ± 0.371
3.158GlyAsn: 3.158 ± 0.787
0.338GlyPro: 0.338 ± 0.149
1.579GlyGln: 1.579 ± 0.357
1.579GlyArg: 1.579 ± 0.334
4.963GlySer: 4.963 ± 1.069
2.933GlyThr: 2.933 ± 0.644
6.091GlyVal: 6.091 ± 1.286
1.354GlyTrp: 1.354 ± 0.417
2.933GlyTyr: 2.933 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
0.677HisAla: 0.677 ± 0.283
0.451HisCys: 0.451 ± 0.273
0.79HisAsp: 0.79 ± 0.276
0.451HisGlu: 0.451 ± 0.239
0.564HisPhe: 0.564 ± 0.267
1.466HisGly: 1.466 ± 0.375
0.226HisHis: 0.226 ± 0.175
1.015HisIle: 1.015 ± 0.32
0.564HisLys: 0.564 ± 0.235
1.015HisLeu: 1.015 ± 0.37
0.0HisMet: 0.0 ± 0.0
1.466HisAsn: 1.466 ± 0.492
0.113HisPro: 0.113 ± 0.112
0.451HisGln: 0.451 ± 0.299
0.113HisArg: 0.113 ± 0.115
0.338HisSer: 0.338 ± 0.18
1.128HisThr: 1.128 ± 0.365
0.79HisVal: 0.79 ± 0.314
0.113HisTrp: 0.113 ± 0.126
0.451HisTyr: 0.451 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
5.302IleAla: 5.302 ± 0.682
0.113IleCys: 0.113 ± 0.116
4.061IleAsp: 4.061 ± 0.568
5.64IleGlu: 5.64 ± 0.946
3.271IlePhe: 3.271 ± 0.743
3.723IleGly: 3.723 ± 0.83
0.677IleHis: 0.677 ± 0.275
5.189IleIle: 5.189 ± 0.801
7.558IleLys: 7.558 ± 0.967
5.753IleLeu: 5.753 ± 0.89
1.918IleMet: 1.918 ± 0.551
5.866IleAsn: 5.866 ± 0.895
2.03IlePro: 2.03 ± 0.427
2.03IleGln: 2.03 ± 0.418
1.466IleArg: 1.466 ± 0.38
4.061IleSer: 4.061 ± 0.906
5.189IleThr: 5.189 ± 0.663
4.399IleVal: 4.399 ± 0.777
1.241IleTrp: 1.241 ± 0.467
2.594IleTyr: 2.594 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
6.43LysAla: 6.43 ± 0.91
0.902LysCys: 0.902 ± 0.593
4.287LysAsp: 4.287 ± 0.664
7.558LysGlu: 7.558 ± 1.28
2.482LysPhe: 2.482 ± 0.525
4.512LysGly: 4.512 ± 0.834
1.354LysHis: 1.354 ± 0.469
5.302LysIle: 5.302 ± 0.737
7.332LysLys: 7.332 ± 1.131
7.445LysLeu: 7.445 ± 0.958
3.271LysMet: 3.271 ± 0.534
6.317LysAsn: 6.317 ± 0.8
1.692LysPro: 1.692 ± 0.483
4.174LysGln: 4.174 ± 0.709
4.287LysArg: 4.287 ± 0.778
4.851LysSer: 4.851 ± 0.696
4.963LysThr: 4.963 ± 0.77
6.655LysVal: 6.655 ± 0.84
1.466LysTrp: 1.466 ± 0.435
3.384LysTyr: 3.384 ± 0.538
0.0LysXaa: 0.0 ± 0.0
Leu
6.317LeuAla: 6.317 ± 0.746
0.451LeuCys: 0.451 ± 0.223
4.963LeuAsp: 4.963 ± 0.695
5.415LeuGlu: 5.415 ± 0.712
3.723LeuPhe: 3.723 ± 0.703
4.174LeuGly: 4.174 ± 0.964
1.579LeuHis: 1.579 ± 0.474
7.107LeuIle: 7.107 ± 0.872
8.347LeuLys: 8.347 ± 1.085
7.671LeuLeu: 7.671 ± 1.502
1.579LeuMet: 1.579 ± 0.426
5.527LeuAsn: 5.527 ± 0.872
2.594LeuPro: 2.594 ± 0.5
3.046LeuGln: 3.046 ± 0.663
2.594LeuArg: 2.594 ± 0.591
4.851LeuSer: 4.851 ± 0.596
5.302LeuThr: 5.302 ± 0.72
5.979LeuVal: 5.979 ± 0.781
1.354LeuTrp: 1.354 ± 0.331
3.948LeuTyr: 3.948 ± 0.778
0.0LeuXaa: 0.0 ± 0.0
Met
2.256MetAla: 2.256 ± 0.594
0.113MetCys: 0.113 ± 0.133
1.015MetAsp: 1.015 ± 0.298
1.805MetGlu: 1.805 ± 0.462
0.564MetPhe: 0.564 ± 0.24
1.015MetGly: 1.015 ± 0.387
0.226MetHis: 0.226 ± 0.164
2.256MetIle: 2.256 ± 0.621
2.482MetLys: 2.482 ± 0.479
1.128MetLeu: 1.128 ± 0.364
0.338MetMet: 0.338 ± 0.181
2.369MetAsn: 2.369 ± 0.602
0.564MetPro: 0.564 ± 0.242
1.579MetGln: 1.579 ± 0.353
0.451MetArg: 0.451 ± 0.271
1.692MetSer: 1.692 ± 0.342
1.354MetThr: 1.354 ± 0.364
1.241MetVal: 1.241 ± 0.336
0.226MetTrp: 0.226 ± 0.15
1.354MetTyr: 1.354 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
5.415AsnAla: 5.415 ± 1.22
0.338AsnCys: 0.338 ± 0.184
4.061AsnAsp: 4.061 ± 0.842
4.963AsnGlu: 4.963 ± 0.75
1.805AsnPhe: 1.805 ± 0.53
7.332AsnGly: 7.332 ± 1.04
0.564AsnHis: 0.564 ± 0.264
4.287AsnIle: 4.287 ± 0.642
6.543AsnLys: 6.543 ± 1.09
5.866AsnLeu: 5.866 ± 0.836
1.466AsnMet: 1.466 ± 0.384
4.174AsnAsn: 4.174 ± 0.871
2.256AsnPro: 2.256 ± 0.527
2.933AsnGln: 2.933 ± 0.524
1.692AsnArg: 1.692 ± 0.386
5.189AsnSer: 5.189 ± 0.731
4.174AsnThr: 4.174 ± 0.682
3.271AsnVal: 3.271 ± 0.682
1.128AsnTrp: 1.128 ± 0.346
2.369AsnTyr: 2.369 ± 0.66
0.0AsnXaa: 0.0 ± 0.0
Pro
1.579ProAla: 1.579 ± 0.338
0.113ProCys: 0.113 ± 0.125
1.354ProAsp: 1.354 ± 0.416
1.918ProGlu: 1.918 ± 0.51
0.902ProPhe: 0.902 ± 0.296
0.226ProGly: 0.226 ± 0.147
0.0ProHis: 0.0 ± 0.0
1.692ProIle: 1.692 ± 0.411
2.594ProLys: 2.594 ± 0.541
1.805ProLeu: 1.805 ± 0.384
0.451ProMet: 0.451 ± 0.198
2.482ProAsn: 2.482 ± 0.771
0.564ProPro: 0.564 ± 0.25
0.677ProGln: 0.677 ± 0.304
0.451ProArg: 0.451 ± 0.196
0.902ProSer: 0.902 ± 0.332
2.707ProThr: 2.707 ± 0.546
1.805ProVal: 1.805 ± 0.455
0.113ProTrp: 0.113 ± 0.112
0.677ProTyr: 0.677 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
2.707GlnAla: 2.707 ± 0.775
0.226GlnCys: 0.226 ± 0.158
1.692GlnAsp: 1.692 ± 0.512
2.82GlnGlu: 2.82 ± 0.496
1.128GlnPhe: 1.128 ± 0.343
2.82GlnGly: 2.82 ± 0.545
0.451GlnHis: 0.451 ± 0.208
1.466GlnIle: 1.466 ± 0.378
2.933GlnLys: 2.933 ± 0.659
3.158GlnLeu: 3.158 ± 0.619
0.677GlnMet: 0.677 ± 0.27
2.03GlnAsn: 2.03 ± 0.391
1.354GlnPro: 1.354 ± 0.289
1.579GlnGln: 1.579 ± 0.422
1.579GlnArg: 1.579 ± 0.488
2.707GlnSer: 2.707 ± 0.439
2.256GlnThr: 2.256 ± 0.592
2.707GlnVal: 2.707 ± 0.504
0.902GlnTrp: 0.902 ± 0.283
1.241GlnTyr: 1.241 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
2.256ArgAla: 2.256 ± 0.637
0.338ArgCys: 0.338 ± 0.183
1.241ArgAsp: 1.241 ± 0.351
2.03ArgGlu: 2.03 ± 0.481
0.451ArgPhe: 0.451 ± 0.244
2.256ArgGly: 2.256 ± 0.536
0.451ArgHis: 0.451 ± 0.183
2.482ArgIle: 2.482 ± 0.579
4.061ArgLys: 4.061 ± 0.775
3.61ArgLeu: 3.61 ± 0.729
0.677ArgMet: 0.677 ± 0.285
2.369ArgAsn: 2.369 ± 0.475
0.79ArgPro: 0.79 ± 0.286
1.015ArgGln: 1.015 ± 0.375
1.692ArgArg: 1.692 ± 0.434
1.579ArgSer: 1.579 ± 0.393
2.256ArgThr: 2.256 ± 0.449
2.03ArgVal: 2.03 ± 0.663
0.451ArgTrp: 0.451 ± 0.24
1.466ArgTyr: 1.466 ± 0.322
0.0ArgXaa: 0.0 ± 0.0
Ser
5.076SerAla: 5.076 ± 1.487
1.128SerCys: 1.128 ± 0.376
3.497SerAsp: 3.497 ± 0.672
4.287SerGlu: 4.287 ± 0.706
3.497SerPhe: 3.497 ± 0.941
4.963SerGly: 4.963 ± 1.512
0.338SerHis: 0.338 ± 0.167
4.625SerIle: 4.625 ± 0.843
4.399SerLys: 4.399 ± 0.732
6.655SerLeu: 6.655 ± 1.132
1.692SerMet: 1.692 ± 0.345
3.948SerAsn: 3.948 ± 0.677
1.466SerPro: 1.466 ± 0.419
1.805SerGln: 1.805 ± 0.519
2.482SerArg: 2.482 ± 0.498
5.302SerSer: 5.302 ± 1.018
3.723SerThr: 3.723 ± 0.64
4.399SerVal: 4.399 ± 0.763
0.677SerTrp: 0.677 ± 0.281
1.579SerTyr: 1.579 ± 0.417
0.0SerXaa: 0.0 ± 0.0
Thr
4.625ThrAla: 4.625 ± 0.577
0.226ThrCys: 0.226 ± 0.155
3.835ThrAsp: 3.835 ± 0.801
5.979ThrGlu: 5.979 ± 0.782
2.369ThrPhe: 2.369 ± 0.479
4.399ThrGly: 4.399 ± 0.621
0.113ThrHis: 0.113 ± 0.112
4.738ThrIle: 4.738 ± 0.862
5.189ThrLys: 5.189 ± 0.614
4.738ThrLeu: 4.738 ± 0.748
1.015ThrMet: 1.015 ± 0.345
4.963ThrAsn: 4.963 ± 0.709
2.143ThrPro: 2.143 ± 0.348
2.03ThrGln: 2.03 ± 0.494
1.805ThrArg: 1.805 ± 0.508
4.625ThrSer: 4.625 ± 0.603
4.625ThrThr: 4.625 ± 0.892
5.189ThrVal: 5.189 ± 0.813
0.79ThrTrp: 0.79 ± 0.289
2.03ThrTyr: 2.03 ± 0.449
0.0ThrXaa: 0.0 ± 0.0
Val
3.271ValAla: 3.271 ± 0.678
0.451ValCys: 0.451 ± 0.224
3.835ValAsp: 3.835 ± 0.615
4.851ValGlu: 4.851 ± 0.73
2.933ValPhe: 2.933 ± 0.852
4.287ValGly: 4.287 ± 0.579
0.677ValHis: 0.677 ± 0.269
5.527ValIle: 5.527 ± 0.835
5.866ValLys: 5.866 ± 0.796
3.271ValLeu: 3.271 ± 0.621
2.256ValMet: 2.256 ± 0.426
3.61ValAsn: 3.61 ± 0.792
1.579ValPro: 1.579 ± 0.502
1.692ValGln: 1.692 ± 0.398
2.369ValArg: 2.369 ± 0.583
5.189ValSer: 5.189 ± 1.053
6.43ValThr: 6.43 ± 1.044
4.738ValVal: 4.738 ± 0.99
1.015ValTrp: 1.015 ± 0.345
2.594ValTyr: 2.594 ± 0.437
0.0ValXaa: 0.0 ± 0.0
Trp
0.79TrpAla: 0.79 ± 0.273
0.226TrpCys: 0.226 ± 0.193
1.015TrpAsp: 1.015 ± 0.471
0.902TrpGlu: 0.902 ± 0.297
1.466TrpPhe: 1.466 ± 0.524
0.902TrpGly: 0.902 ± 0.384
0.451TrpHis: 0.451 ± 0.274
0.451TrpIle: 0.451 ± 0.282
0.902TrpLys: 0.902 ± 0.32
2.03TrpLeu: 2.03 ± 0.611
0.338TrpMet: 0.338 ± 0.185
1.128TrpAsn: 1.128 ± 0.385
0.0TrpPro: 0.0 ± 0.0
0.902TrpGln: 0.902 ± 0.346
0.338TrpArg: 0.338 ± 0.235
1.579TrpSer: 1.579 ± 0.385
0.564TrpThr: 0.564 ± 0.229
0.451TrpVal: 0.451 ± 0.233
0.0TrpTrp: 0.0 ± 0.0
0.677TrpTyr: 0.677 ± 0.253
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.805TyrAla: 1.805 ± 0.506
0.564TyrCys: 0.564 ± 0.32
2.03TyrAsp: 2.03 ± 0.582
4.287TyrGlu: 4.287 ± 0.679
2.82TyrPhe: 2.82 ± 0.481
2.82TyrGly: 2.82 ± 0.553
1.015TyrHis: 1.015 ± 0.268
3.046TyrIle: 3.046 ± 0.605
2.256TyrLys: 2.256 ± 0.471
2.933TyrLeu: 2.933 ± 0.737
0.902TyrMet: 0.902 ± 0.397
2.82TyrAsn: 2.82 ± 0.497
1.241TyrPro: 1.241 ± 0.383
1.579TyrGln: 1.579 ± 0.506
1.128TyrArg: 1.128 ± 0.366
1.692TyrSer: 1.692 ± 0.391
2.256TyrThr: 2.256 ± 0.534
2.143TyrVal: 2.143 ± 0.547
0.451TyrTrp: 0.451 ± 0.231
2.256TyrTyr: 2.256 ± 0.564
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (8866 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski