Amino acid dipepetide frequency for Lactococcus phage 1358

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.413AlaAla: 13.413 ± 1.827
0.61AlaCys: 0.61 ± 0.197
6.184AlaAsp: 6.184 ± 0.708
8.449AlaGlu: 8.449 ± 1.397
3.223AlaPhe: 3.223 ± 0.578
6.184AlaGly: 6.184 ± 0.881
1.481AlaHis: 1.481 ± 0.342
5.923AlaIle: 5.923 ± 0.896
5.4AlaLys: 5.4 ± 0.83
7.055AlaLeu: 7.055 ± 0.897
2.265AlaMet: 2.265 ± 0.386
4.181AlaAsn: 4.181 ± 0.635
3.223AlaPro: 3.223 ± 0.497
3.745AlaGln: 3.745 ± 0.56
4.616AlaArg: 4.616 ± 0.772
4.442AlaSer: 4.442 ± 0.421
6.794AlaThr: 6.794 ± 0.911
5.662AlaVal: 5.662 ± 1.255
1.132AlaTrp: 1.132 ± 0.409
3.571AlaTyr: 3.571 ± 0.589
0.0AlaXaa: 0.0 ± 0.0
Cys
0.261CysAla: 0.261 ± 0.135
0.0CysCys: 0.0 ± 0.0
0.871CysAsp: 0.871 ± 0.321
0.958CysGlu: 0.958 ± 0.365
0.348CysPhe: 0.348 ± 0.197
0.523CysGly: 0.523 ± 0.294
0.087CysHis: 0.087 ± 0.093
0.174CysIle: 0.174 ± 0.121
0.61CysLys: 0.61 ± 0.213
0.523CysLeu: 0.523 ± 0.202
0.087CysMet: 0.087 ± 0.097
0.087CysAsn: 0.087 ± 0.092
0.436CysPro: 0.436 ± 0.188
0.0CysGln: 0.0 ± 0.0
0.523CysArg: 0.523 ± 0.216
0.261CysSer: 0.261 ± 0.138
0.348CysThr: 0.348 ± 0.193
0.523CysVal: 0.523 ± 0.254
0.087CysTrp: 0.087 ± 0.095
0.61CysTyr: 0.61 ± 0.243
0.0CysXaa: 0.0 ± 0.0
Asp
7.229AspAla: 7.229 ± 1.036
0.436AspCys: 0.436 ± 0.167
6.184AspAsp: 6.184 ± 1.495
7.229AspGlu: 7.229 ± 1.39
2.874AspPhe: 2.874 ± 0.5
5.923AspGly: 5.923 ± 0.815
1.132AspHis: 1.132 ± 0.334
4.703AspIle: 4.703 ± 0.729
5.313AspLys: 5.313 ± 0.701
4.181AspLeu: 4.181 ± 0.678
1.568AspMet: 1.568 ± 0.378
3.136AspAsn: 3.136 ± 0.578
2.874AspPro: 2.874 ± 0.503
0.784AspGln: 0.784 ± 0.247
2.613AspArg: 2.613 ± 0.592
3.571AspSer: 3.571 ± 0.682
3.484AspThr: 3.484 ± 0.628
4.965AspVal: 4.965 ± 0.894
0.871AspTrp: 0.871 ± 0.3
2.787AspTyr: 2.787 ± 0.447
0.0AspXaa: 0.0 ± 0.0
Glu
6.968GluAla: 6.968 ± 0.956
0.261GluCys: 0.261 ± 0.167
5.226GluAsp: 5.226 ± 1.541
5.749GluGlu: 5.749 ± 1.893
2.961GluPhe: 2.961 ± 0.524
3.484GluGly: 3.484 ± 0.748
2.003GluHis: 2.003 ± 0.397
5.574GluIle: 5.574 ± 0.789
3.92GluLys: 3.92 ± 0.561
5.139GluLeu: 5.139 ± 0.738
1.916GluMet: 1.916 ± 0.503
3.049GluAsn: 3.049 ± 0.494
2.265GluPro: 2.265 ± 0.495
3.571GluGln: 3.571 ± 0.503
2.874GluArg: 2.874 ± 0.562
3.397GluSer: 3.397 ± 0.759
2.003GluThr: 2.003 ± 0.481
3.832GluVal: 3.832 ± 0.561
0.871GluTrp: 0.871 ± 0.331
2.526GluTyr: 2.526 ± 0.546
0.0GluXaa: 0.0 ± 0.0
Phe
2.961PheAla: 2.961 ± 0.485
0.523PheCys: 0.523 ± 0.198
4.007PheAsp: 4.007 ± 0.554
2.613PheGlu: 2.613 ± 0.373
1.307PhePhe: 1.307 ± 0.307
2.613PheGly: 2.613 ± 0.456
1.045PheHis: 1.045 ± 0.273
2.526PheIle: 2.526 ± 0.471
3.136PheLys: 3.136 ± 0.483
1.829PheLeu: 1.829 ± 0.382
0.958PheMet: 0.958 ± 0.277
1.481PheAsn: 1.481 ± 0.345
1.132PhePro: 1.132 ± 0.254
1.045PheGln: 1.045 ± 0.399
0.958PheArg: 0.958 ± 0.319
2.265PheSer: 2.265 ± 0.522
1.742PheThr: 1.742 ± 0.402
2.439PheVal: 2.439 ± 0.484
0.436PheTrp: 0.436 ± 0.196
1.829PheTyr: 1.829 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
6.707GlyAla: 6.707 ± 0.853
0.348GlyCys: 0.348 ± 0.319
4.878GlyAsp: 4.878 ± 0.705
3.31GlyGlu: 3.31 ± 0.693
2.787GlyPhe: 2.787 ± 0.462
6.097GlyGly: 6.097 ± 0.83
1.655GlyHis: 1.655 ± 0.459
4.355GlyIle: 4.355 ± 0.779
5.226GlyLys: 5.226 ± 0.797
5.4GlyLeu: 5.4 ± 0.633
2.09GlyMet: 2.09 ± 0.459
3.832GlyAsn: 3.832 ± 0.714
0.0GlyPro: 0.0 ± 0.0
3.049GlyGln: 3.049 ± 0.525
3.484GlyArg: 3.484 ± 0.544
3.92GlySer: 3.92 ± 0.699
6.445GlyThr: 6.445 ± 1.421
5.662GlyVal: 5.662 ± 0.813
0.697GlyTrp: 0.697 ± 0.278
2.874GlyTyr: 2.874 ± 0.626
0.0GlyXaa: 0.0 ± 0.0
His
1.829HisAla: 1.829 ± 0.393
0.348HisCys: 0.348 ± 0.157
1.045HisAsp: 1.045 ± 0.332
1.219HisGlu: 1.219 ± 0.375
0.261HisPhe: 0.261 ± 0.13
1.307HisGly: 1.307 ± 0.429
0.348HisHis: 0.348 ± 0.191
1.568HisIle: 1.568 ± 0.358
1.219HisLys: 1.219 ± 0.313
1.481HisLeu: 1.481 ± 0.37
0.261HisMet: 0.261 ± 0.148
1.219HisAsn: 1.219 ± 0.274
0.523HisPro: 0.523 ± 0.191
0.348HisGln: 0.348 ± 0.171
0.697HisArg: 0.697 ± 0.364
0.261HisSer: 0.261 ± 0.134
1.568HisThr: 1.568 ± 0.345
1.481HisVal: 1.481 ± 0.39
0.261HisTrp: 0.261 ± 0.136
0.784HisTyr: 0.784 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
6.62IleAla: 6.62 ± 0.917
0.261IleCys: 0.261 ± 0.144
4.703IleAsp: 4.703 ± 0.644
4.529IleGlu: 4.529 ± 0.651
1.829IlePhe: 1.829 ± 0.38
3.484IleGly: 3.484 ± 0.628
0.784IleHis: 0.784 ± 0.265
3.658IleIle: 3.658 ± 0.631
6.445IleLys: 6.445 ± 0.769
3.745IleLeu: 3.745 ± 0.564
1.394IleMet: 1.394 ± 0.457
3.223IleAsn: 3.223 ± 0.498
1.394IlePro: 1.394 ± 0.353
2.787IleGln: 2.787 ± 0.716
2.613IleArg: 2.613 ± 0.533
3.136IleSer: 3.136 ± 0.332
4.007IleThr: 4.007 ± 0.659
4.007IleVal: 4.007 ± 0.387
0.261IleTrp: 0.261 ± 0.147
1.742IleTyr: 1.742 ± 0.284
0.0IleXaa: 0.0 ± 0.0
Lys
7.926LysAla: 7.926 ± 0.904
0.174LysCys: 0.174 ± 0.125
4.703LysAsp: 4.703 ± 0.68
3.397LysGlu: 3.397 ± 0.692
2.265LysPhe: 2.265 ± 0.495
6.184LysGly: 6.184 ± 0.738
1.829LysHis: 1.829 ± 0.527
4.007LysIle: 4.007 ± 0.625
5.4LysLys: 5.4 ± 0.959
5.487LysLeu: 5.487 ± 0.834
1.568LysMet: 1.568 ± 0.292
3.484LysAsn: 3.484 ± 0.484
2.7LysPro: 2.7 ± 0.524
2.003LysGln: 2.003 ± 0.418
4.616LysArg: 4.616 ± 0.745
3.745LysSer: 3.745 ± 0.584
4.094LysThr: 4.094 ± 0.552
4.791LysVal: 4.791 ± 0.62
1.219LysTrp: 1.219 ± 0.252
2.439LysTyr: 2.439 ± 0.45
0.0LysXaa: 0.0 ± 0.0
Leu
6.184LeuAla: 6.184 ± 0.812
0.61LeuCys: 0.61 ± 0.252
6.184LeuAsp: 6.184 ± 0.741
5.052LeuGlu: 5.052 ± 0.722
2.003LeuPhe: 2.003 ± 0.436
6.097LeuGly: 6.097 ± 0.88
1.481LeuHis: 1.481 ± 0.351
2.613LeuIle: 2.613 ± 0.509
5.923LeuLys: 5.923 ± 0.876
4.355LeuLeu: 4.355 ± 0.674
1.219LeuMet: 1.219 ± 0.351
3.745LeuAsn: 3.745 ± 0.526
3.31LeuPro: 3.31 ± 0.598
3.049LeuGln: 3.049 ± 0.549
3.484LeuArg: 3.484 ± 0.537
4.181LeuSer: 4.181 ± 0.676
4.442LeuThr: 4.442 ± 0.635
4.442LeuVal: 4.442 ± 0.44
1.307LeuTrp: 1.307 ± 0.353
1.742LeuTyr: 1.742 ± 0.762
0.0LeuXaa: 0.0 ± 0.0
Met
1.655MetAla: 1.655 ± 0.453
0.0MetCys: 0.0 ± 0.0
0.958MetAsp: 0.958 ± 0.233
0.436MetGlu: 0.436 ± 0.233
0.871MetPhe: 0.871 ± 0.347
2.178MetGly: 2.178 ± 0.423
0.348MetHis: 0.348 ± 0.183
1.132MetIle: 1.132 ± 0.459
2.7MetLys: 2.7 ± 0.462
1.132MetLeu: 1.132 ± 0.364
0.261MetMet: 0.261 ± 0.145
1.045MetAsn: 1.045 ± 0.355
1.307MetPro: 1.307 ± 0.287
0.958MetGln: 0.958 ± 0.288
1.132MetArg: 1.132 ± 0.257
1.568MetSer: 1.568 ± 0.404
2.09MetThr: 2.09 ± 0.337
1.132MetVal: 1.132 ± 0.398
0.0MetTrp: 0.0 ± 0.0
1.132MetTyr: 1.132 ± 0.345
0.0MetXaa: 0.0 ± 0.0
Asn
4.529AsnAla: 4.529 ± 0.668
0.348AsnCys: 0.348 ± 0.179
2.439AsnAsp: 2.439 ± 0.409
3.136AsnGlu: 3.136 ± 0.652
2.352AsnPhe: 2.352 ± 0.502
4.268AsnGly: 4.268 ± 0.754
0.61AsnHis: 0.61 ± 0.21
3.136AsnIle: 3.136 ± 0.604
2.439AsnLys: 2.439 ± 0.328
3.397AsnLeu: 3.397 ± 0.538
0.958AsnMet: 0.958 ± 0.33
3.136AsnAsn: 3.136 ± 0.626
2.874AsnPro: 2.874 ± 0.439
2.09AsnGln: 2.09 ± 0.55
2.526AsnArg: 2.526 ± 0.581
1.394AsnSer: 1.394 ± 0.478
3.223AsnThr: 3.223 ± 0.55
3.136AsnVal: 3.136 ± 0.59
0.436AsnTrp: 0.436 ± 0.205
1.568AsnTyr: 1.568 ± 0.382
0.0AsnXaa: 0.0 ± 0.0
Pro
3.484ProAla: 3.484 ± 0.564
0.087ProCys: 0.087 ± 0.081
2.265ProAsp: 2.265 ± 0.498
2.874ProGlu: 2.874 ± 0.812
0.697ProPhe: 0.697 ± 0.202
0.087ProGly: 0.087 ± 0.09
0.523ProHis: 0.523 ± 0.2
2.003ProIle: 2.003 ± 0.314
2.439ProLys: 2.439 ± 0.452
3.31ProLeu: 3.31 ± 0.505
0.261ProMet: 0.261 ± 0.158
1.916ProAsn: 1.916 ± 0.387
1.045ProPro: 1.045 ± 0.378
1.481ProGln: 1.481 ± 0.318
2.352ProArg: 2.352 ± 0.454
2.003ProSer: 2.003 ± 0.49
3.049ProThr: 3.049 ± 0.614
1.916ProVal: 1.916 ± 0.394
0.61ProTrp: 0.61 ± 0.307
1.568ProTyr: 1.568 ± 0.375
0.0ProXaa: 0.0 ± 0.0
Gln
4.355GlnAla: 4.355 ± 0.755
0.087GlnCys: 0.087 ± 0.097
2.178GlnAsp: 2.178 ± 0.657
1.568GlnGlu: 1.568 ± 0.36
1.742GlnPhe: 1.742 ± 0.325
3.049GlnGly: 3.049 ± 0.581
0.523GlnHis: 0.523 ± 0.176
2.352GlnIle: 2.352 ± 0.339
2.526GlnLys: 2.526 ± 0.501
4.616GlnLeu: 4.616 ± 0.65
1.132GlnMet: 1.132 ± 0.359
1.829GlnAsn: 1.829 ± 0.38
0.958GlnPro: 0.958 ± 0.267
1.655GlnGln: 1.655 ± 0.387
1.219GlnArg: 1.219 ± 0.345
2.439GlnSer: 2.439 ± 0.549
2.003GlnThr: 2.003 ± 0.407
2.09GlnVal: 2.09 ± 0.424
0.523GlnTrp: 0.523 ± 0.206
1.394GlnTyr: 1.394 ± 0.431
0.0GlnXaa: 0.0 ± 0.0
Arg
3.397ArgAla: 3.397 ± 0.589
0.871ArgCys: 0.871 ± 0.241
3.136ArgAsp: 3.136 ± 0.675
2.439ArgGlu: 2.439 ± 0.559
2.178ArgPhe: 2.178 ± 0.448
3.832ArgGly: 3.832 ± 0.689
0.697ArgHis: 0.697 ± 0.311
2.787ArgIle: 2.787 ± 0.553
3.92ArgLys: 3.92 ± 0.715
3.484ArgLeu: 3.484 ± 0.66
1.132ArgMet: 1.132 ± 0.272
1.916ArgAsn: 1.916 ± 0.472
1.394ArgPro: 1.394 ± 0.32
2.003ArgGln: 2.003 ± 0.329
3.397ArgArg: 3.397 ± 0.82
2.265ArgSer: 2.265 ± 0.706
2.787ArgThr: 2.787 ± 0.429
2.7ArgVal: 2.7 ± 0.455
0.871ArgTrp: 0.871 ± 0.268
2.874ArgTyr: 2.874 ± 0.486
0.0ArgXaa: 0.0 ± 0.0
Ser
4.791SerAla: 4.791 ± 0.647
0.261SerCys: 0.261 ± 0.154
4.181SerAsp: 4.181 ± 0.699
3.223SerGlu: 3.223 ± 0.605
1.568SerPhe: 1.568 ± 0.43
5.052SerGly: 5.052 ± 0.759
0.61SerHis: 0.61 ± 0.207
2.787SerIle: 2.787 ± 0.467
3.832SerLys: 3.832 ± 0.538
2.961SerLeu: 2.961 ± 0.554
1.394SerMet: 1.394 ± 0.366
2.352SerAsn: 2.352 ± 0.476
1.394SerPro: 1.394 ± 0.34
1.742SerGln: 1.742 ± 0.38
2.003SerArg: 2.003 ± 0.494
3.223SerSer: 3.223 ± 0.574
3.745SerThr: 3.745 ± 0.825
3.31SerVal: 3.31 ± 0.558
0.784SerTrp: 0.784 ± 0.379
2.09SerTyr: 2.09 ± 0.262
0.0SerXaa: 0.0 ± 0.0
Thr
5.574ThrAla: 5.574 ± 0.702
0.784ThrCys: 0.784 ± 0.309
4.007ThrAsp: 4.007 ± 0.567
3.484ThrGlu: 3.484 ± 0.915
2.265ThrPhe: 2.265 ± 0.451
5.662ThrGly: 5.662 ± 0.831
1.045ThrHis: 1.045 ± 0.29
4.268ThrIle: 4.268 ± 0.62
4.529ThrLys: 4.529 ± 0.803
3.571ThrLeu: 3.571 ± 0.462
0.871ThrMet: 0.871 ± 0.299
3.049ThrAsn: 3.049 ± 0.585
3.397ThrPro: 3.397 ± 0.94
2.787ThrGln: 2.787 ± 0.487
2.526ThrArg: 2.526 ± 0.537
4.007ThrSer: 4.007 ± 0.553
6.62ThrThr: 6.62 ± 2.654
5.487ThrVal: 5.487 ± 0.851
0.61ThrTrp: 0.61 ± 0.242
1.742ThrTyr: 1.742 ± 0.358
0.0ThrXaa: 0.0 ± 0.0
Val
5.052ValAla: 5.052 ± 0.604
0.61ValCys: 0.61 ± 0.233
5.226ValAsp: 5.226 ± 0.781
4.442ValGlu: 4.442 ± 0.588
3.136ValPhe: 3.136 ± 0.667
4.181ValGly: 4.181 ± 0.623
0.958ValHis: 0.958 ± 0.271
4.355ValIle: 4.355 ± 0.579
4.268ValLys: 4.268 ± 0.873
5.574ValLeu: 5.574 ± 0.681
1.655ValMet: 1.655 ± 0.421
3.397ValAsn: 3.397 ± 0.561
2.178ValPro: 2.178 ± 0.449
2.613ValGln: 2.613 ± 0.574
3.658ValArg: 3.658 ± 0.448
2.265ValSer: 2.265 ± 0.513
4.529ValThr: 4.529 ± 0.571
4.268ValVal: 4.268 ± 0.554
0.958ValTrp: 0.958 ± 0.238
2.178ValTyr: 2.178 ± 0.383
0.0ValXaa: 0.0 ± 0.0
Trp
1.219TrpAla: 1.219 ± 0.368
0.261TrpCys: 0.261 ± 0.143
0.348TrpAsp: 0.348 ± 0.196
0.958TrpGlu: 0.958 ± 0.265
0.436TrpPhe: 0.436 ± 0.153
1.045TrpGly: 1.045 ± 0.386
0.174TrpHis: 0.174 ± 0.12
0.523TrpIle: 0.523 ± 0.207
0.61TrpLys: 0.61 ± 0.207
1.132TrpLeu: 1.132 ± 0.452
0.087TrpMet: 0.087 ± 0.083
0.61TrpAsn: 0.61 ± 0.208
0.0TrpPro: 0.0 ± 0.0
0.871TrpGln: 0.871 ± 0.312
0.784TrpArg: 0.784 ± 0.375
0.61TrpSer: 0.61 ± 0.271
0.784TrpThr: 0.784 ± 0.232
0.871TrpVal: 0.871 ± 0.361
0.261TrpTrp: 0.261 ± 0.142
1.045TrpTyr: 1.045 ± 0.31
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.397TyrAla: 3.397 ± 0.631
0.436TyrCys: 0.436 ± 0.191
3.223TyrAsp: 3.223 ± 0.483
2.526TyrGlu: 2.526 ± 0.437
1.742TyrPhe: 1.742 ± 0.385
1.568TyrGly: 1.568 ± 0.66
0.697TyrHis: 0.697 ± 0.258
2.178TyrIle: 2.178 ± 0.433
2.178TyrLys: 2.178 ± 0.51
3.049TyrLeu: 3.049 ± 0.412
0.871TyrMet: 0.871 ± 0.365
1.307TyrAsn: 1.307 ± 0.355
1.568TyrPro: 1.568 ± 0.469
1.655TyrGln: 1.655 ± 0.4
2.003TyrArg: 2.003 ± 0.45
2.352TyrSer: 2.352 ± 0.414
2.526TyrThr: 2.526 ± 0.485
2.787TyrVal: 2.787 ± 0.461
0.436TyrTrp: 0.436 ± 0.209
1.394TyrTyr: 1.394 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (11482 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski