Amino acid dipepetide frequency for Lactobacillus phage JNU_P11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.73AlaAla: 3.73 ± 1.051
0.355AlaCys: 0.355 ± 0.15
4.352AlaAsp: 4.352 ± 0.553
3.819AlaGlu: 3.819 ± 0.622
2.575AlaPhe: 2.575 ± 0.447
5.24AlaGly: 5.24 ± 1.002
0.888AlaHis: 0.888 ± 0.254
6.306AlaIle: 6.306 ± 1.422
7.46AlaLys: 7.46 ± 0.722
5.151AlaLeu: 5.151 ± 0.641
2.22AlaMet: 2.22 ± 0.605
4.44AlaAsn: 4.44 ± 0.618
1.155AlaPro: 1.155 ± 0.322
4.352AlaGln: 4.352 ± 0.965
3.197AlaArg: 3.197 ± 0.572
3.464AlaSer: 3.464 ± 0.728
4.174AlaThr: 4.174 ± 0.669
5.062AlaVal: 5.062 ± 0.867
0.977AlaTrp: 0.977 ± 0.295
2.487AlaTyr: 2.487 ± 0.343
0.0AlaXaa: 0.0 ± 0.0
Cys
0.622CysAla: 0.622 ± 0.19
0.0CysCys: 0.0 ± 0.0
0.622CysAsp: 0.622 ± 0.247
0.355CysGlu: 0.355 ± 0.151
0.355CysPhe: 0.355 ± 0.201
0.622CysGly: 0.622 ± 0.242
0.089CysHis: 0.089 ± 0.077
0.089CysIle: 0.089 ± 0.077
0.266CysLys: 0.266 ± 0.183
0.444CysLeu: 0.444 ± 0.179
0.266CysMet: 0.266 ± 0.137
0.089CysAsn: 0.089 ± 0.082
0.089CysPro: 0.089 ± 0.082
0.178CysGln: 0.178 ± 0.107
0.178CysArg: 0.178 ± 0.108
0.71CysSer: 0.71 ± 0.271
0.266CysThr: 0.266 ± 0.149
0.533CysVal: 0.533 ± 0.267
0.178CysTrp: 0.178 ± 0.116
0.178CysTyr: 0.178 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
4.796AspAla: 4.796 ± 0.576
0.444AspCys: 0.444 ± 0.172
6.039AspAsp: 6.039 ± 0.909
4.885AspGlu: 4.885 ± 0.752
2.487AspPhe: 2.487 ± 0.364
5.062AspGly: 5.062 ± 0.766
0.799AspHis: 0.799 ± 0.267
4.707AspIle: 4.707 ± 0.555
6.039AspLys: 6.039 ± 0.867
5.684AspLeu: 5.684 ± 0.713
1.954AspMet: 1.954 ± 0.422
4.263AspAsn: 4.263 ± 0.691
2.664AspPro: 2.664 ± 0.453
2.842AspGln: 2.842 ± 0.583
2.043AspArg: 2.043 ± 0.42
4.174AspSer: 4.174 ± 0.525
4.352AspThr: 4.352 ± 0.728
3.286AspVal: 3.286 ± 0.495
1.776AspTrp: 1.776 ± 0.4
3.908AspTyr: 3.908 ± 0.722
0.0AspXaa: 0.0 ± 0.0
Glu
3.197GluAla: 3.197 ± 0.403
0.355GluCys: 0.355 ± 0.195
5.329GluAsp: 5.329 ± 0.758
3.552GluGlu: 3.552 ± 0.837
2.398GluPhe: 2.398 ± 0.539
2.753GluGly: 2.753 ± 0.518
0.977GluHis: 0.977 ± 0.311
4.529GluIle: 4.529 ± 0.705
5.506GluLys: 5.506 ± 0.973
5.062GluLeu: 5.062 ± 0.738
1.243GluMet: 1.243 ± 0.276
3.908GluAsn: 3.908 ± 0.692
1.421GluPro: 1.421 ± 0.396
2.398GluGln: 2.398 ± 0.357
2.309GluArg: 2.309 ± 0.474
2.398GluSer: 2.398 ± 0.478
2.398GluThr: 2.398 ± 0.419
4.352GluVal: 4.352 ± 0.821
0.622GluTrp: 0.622 ± 0.207
2.753GluTyr: 2.753 ± 0.513
0.0GluXaa: 0.0 ± 0.0
Phe
1.865PheAla: 1.865 ± 0.368
0.266PheCys: 0.266 ± 0.144
3.02PheAsp: 3.02 ± 0.554
3.197PheGlu: 3.197 ± 0.498
1.954PhePhe: 1.954 ± 0.596
2.487PheGly: 2.487 ± 0.404
0.444PheHis: 0.444 ± 0.199
1.865PheIle: 1.865 ± 0.339
3.286PheLys: 3.286 ± 0.593
2.842PheLeu: 2.842 ± 0.464
0.799PheMet: 0.799 ± 0.326
2.664PheAsn: 2.664 ± 0.53
1.421PhePro: 1.421 ± 0.359
0.888PheGln: 0.888 ± 0.341
1.51PheArg: 1.51 ± 0.42
2.842PheSer: 2.842 ± 0.796
2.309PheThr: 2.309 ± 0.443
2.22PheVal: 2.22 ± 0.447
0.266PheTrp: 0.266 ± 0.166
1.243PheTyr: 1.243 ± 0.291
0.0PheXaa: 0.0 ± 0.0
Gly
3.73GlyAla: 3.73 ± 0.743
0.0GlyCys: 0.0 ± 0.0
3.197GlyAsp: 3.197 ± 0.496
2.664GlyGlu: 2.664 ± 0.566
2.575GlyPhe: 2.575 ± 0.402
3.641GlyGly: 3.641 ± 0.659
1.421GlyHis: 1.421 ± 0.311
6.039GlyIle: 6.039 ± 1.265
4.529GlyLys: 4.529 ± 0.533
6.039GlyLeu: 6.039 ± 1.166
2.575GlyMet: 2.575 ± 0.471
3.819GlyAsn: 3.819 ± 0.852
0.71GlyPro: 0.71 ± 0.289
2.398GlyGln: 2.398 ± 0.336
1.776GlyArg: 1.776 ± 0.354
3.197GlySer: 3.197 ± 0.536
4.44GlyThr: 4.44 ± 0.681
4.263GlyVal: 4.263 ± 0.517
1.155GlyTrp: 1.155 ± 0.311
3.197GlyTyr: 3.197 ± 0.574
0.0GlyXaa: 0.0 ± 0.0
His
0.888HisAla: 0.888 ± 0.269
0.178HisCys: 0.178 ± 0.115
1.332HisAsp: 1.332 ± 0.33
0.799HisGlu: 0.799 ± 0.284
0.799HisPhe: 0.799 ± 0.246
1.155HisGly: 1.155 ± 0.281
0.178HisHis: 0.178 ± 0.13
0.977HisIle: 0.977 ± 0.283
1.421HisLys: 1.421 ± 0.36
1.51HisLeu: 1.51 ± 0.36
0.266HisMet: 0.266 ± 0.152
0.71HisAsn: 0.71 ± 0.257
0.622HisPro: 0.622 ± 0.285
0.0HisGln: 0.0 ± 0.0
0.178HisArg: 0.178 ± 0.132
1.155HisSer: 1.155 ± 0.415
0.799HisThr: 0.799 ± 0.271
0.799HisVal: 0.799 ± 0.269
0.178HisTrp: 0.178 ± 0.116
0.977HisTyr: 0.977 ± 0.277
0.0HisXaa: 0.0 ± 0.0
Ile
4.885IleAla: 4.885 ± 0.725
0.533IleCys: 0.533 ± 0.224
5.595IleAsp: 5.595 ± 0.789
3.996IleGlu: 3.996 ± 0.642
3.108IlePhe: 3.108 ± 0.592
4.263IleGly: 4.263 ± 0.777
0.444IleHis: 0.444 ± 0.283
3.641IleIle: 3.641 ± 0.588
7.194IleLys: 7.194 ± 0.953
4.796IleLeu: 4.796 ± 0.705
1.687IleMet: 1.687 ± 0.346
5.861IleAsn: 5.861 ± 0.558
2.043IlePro: 2.043 ± 0.402
1.954IleGln: 1.954 ± 0.417
3.108IleArg: 3.108 ± 0.53
3.908IleSer: 3.908 ± 0.553
3.73IleThr: 3.73 ± 0.51
5.24IleVal: 5.24 ± 0.994
1.687IleTrp: 1.687 ± 0.649
2.131IleTyr: 2.131 ± 0.513
0.0IleXaa: 0.0 ± 0.0
Lys
7.194LysAla: 7.194 ± 0.781
0.444LysCys: 0.444 ± 0.205
5.151LysAsp: 5.151 ± 0.645
6.572LysGlu: 6.572 ± 1.008
2.575LysPhe: 2.575 ± 0.414
3.819LysGly: 3.819 ± 0.398
1.776LysHis: 1.776 ± 0.429
6.039LysIle: 6.039 ± 0.708
8.881LysLys: 8.881 ± 1.218
6.039LysLeu: 6.039 ± 1.047
2.22LysMet: 2.22 ± 0.468
5.24LysAsn: 5.24 ± 0.627
3.641LysPro: 3.641 ± 0.677
3.908LysGln: 3.908 ± 0.681
3.641LysArg: 3.641 ± 0.788
5.95LysSer: 5.95 ± 0.84
5.684LysThr: 5.684 ± 0.83
4.529LysVal: 4.529 ± 0.562
1.599LysTrp: 1.599 ± 0.445
3.641LysTyr: 3.641 ± 0.564
0.0LysXaa: 0.0 ± 0.0
Leu
5.417LeuAla: 5.417 ± 0.599
0.71LeuCys: 0.71 ± 0.237
6.661LeuAsp: 6.661 ± 0.744
4.174LeuGlu: 4.174 ± 0.844
2.309LeuPhe: 2.309 ± 0.404
3.908LeuGly: 3.908 ± 0.492
0.888LeuHis: 0.888 ± 0.359
5.329LeuIle: 5.329 ± 1.007
8.437LeuLys: 8.437 ± 0.942
6.394LeuLeu: 6.394 ± 0.913
0.888LeuMet: 0.888 ± 0.271
5.595LeuAsn: 5.595 ± 0.617
2.487LeuPro: 2.487 ± 0.431
2.931LeuGln: 2.931 ± 0.441
2.753LeuArg: 2.753 ± 0.622
5.506LeuSer: 5.506 ± 0.624
5.684LeuThr: 5.684 ± 0.526
4.973LeuVal: 4.973 ± 0.679
1.243LeuTrp: 1.243 ± 0.466
2.931LeuTyr: 2.931 ± 0.519
0.0LeuXaa: 0.0 ± 0.0
Met
3.02MetAla: 3.02 ± 0.422
0.266MetCys: 0.266 ± 0.163
1.51MetAsp: 1.51 ± 0.317
1.243MetGlu: 1.243 ± 0.293
0.622MetPhe: 0.622 ± 0.202
1.332MetGly: 1.332 ± 0.295
0.266MetHis: 0.266 ± 0.199
1.421MetIle: 1.421 ± 0.313
2.309MetLys: 2.309 ± 0.485
1.243MetLeu: 1.243 ± 0.316
0.444MetMet: 0.444 ± 0.169
2.131MetAsn: 2.131 ± 0.43
0.71MetPro: 0.71 ± 0.29
1.066MetGln: 1.066 ± 0.328
0.977MetArg: 0.977 ± 0.316
1.243MetSer: 1.243 ± 0.262
1.687MetThr: 1.687 ± 0.364
1.687MetVal: 1.687 ± 0.258
0.178MetTrp: 0.178 ± 0.116
0.799MetTyr: 0.799 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
5.329AsnAla: 5.329 ± 0.994
0.444AsnCys: 0.444 ± 0.209
4.174AsnAsp: 4.174 ± 0.798
3.552AsnGlu: 3.552 ± 0.56
2.398AsnPhe: 2.398 ± 0.369
5.329AsnGly: 5.329 ± 0.731
1.155AsnHis: 1.155 ± 0.268
3.552AsnIle: 3.552 ± 0.669
4.973AsnLys: 4.973 ± 0.735
4.44AsnLeu: 4.44 ± 0.494
1.599AsnMet: 1.599 ± 0.378
3.02AsnAsn: 3.02 ± 0.526
2.22AsnPro: 2.22 ± 0.389
3.108AsnGln: 3.108 ± 0.512
2.131AsnArg: 2.131 ± 0.589
3.996AsnSer: 3.996 ± 0.444
2.664AsnThr: 2.664 ± 0.633
4.174AsnVal: 4.174 ± 0.591
1.332AsnTrp: 1.332 ± 0.284
3.286AsnTyr: 3.286 ± 0.558
0.0AsnXaa: 0.0 ± 0.0
Pro
2.22ProAla: 2.22 ± 0.501
0.0ProCys: 0.0 ± 0.0
2.487ProAsp: 2.487 ± 0.579
1.865ProGlu: 1.865 ± 0.462
1.155ProPhe: 1.155 ± 0.285
0.888ProGly: 0.888 ± 0.259
0.444ProHis: 0.444 ± 0.224
2.131ProIle: 2.131 ± 0.451
2.575ProLys: 2.575 ± 0.573
2.22ProLeu: 2.22 ± 0.44
0.266ProMet: 0.266 ± 0.133
1.776ProAsn: 1.776 ± 0.356
0.178ProPro: 0.178 ± 0.135
1.776ProGln: 1.776 ± 0.256
0.977ProArg: 0.977 ± 0.263
1.776ProSer: 1.776 ± 0.419
2.842ProThr: 2.842 ± 0.478
1.776ProVal: 1.776 ± 0.444
0.266ProTrp: 0.266 ± 0.158
1.421ProTyr: 1.421 ± 0.386
0.0ProXaa: 0.0 ± 0.0
Gln
4.263GlnAla: 4.263 ± 0.747
0.178GlnCys: 0.178 ± 0.112
2.575GlnAsp: 2.575 ± 0.427
2.753GlnGlu: 2.753 ± 0.457
1.954GlnPhe: 1.954 ± 0.309
3.108GlnGly: 3.108 ± 0.83
0.799GlnHis: 0.799 ± 0.244
3.286GlnIle: 3.286 ± 0.533
4.263GlnLys: 4.263 ± 0.637
4.618GlnLeu: 4.618 ± 0.688
1.421GlnMet: 1.421 ± 0.312
1.776GlnAsn: 1.776 ± 0.373
1.687GlnPro: 1.687 ± 0.45
2.664GlnGln: 2.664 ± 0.459
1.599GlnArg: 1.599 ± 0.39
2.043GlnSer: 2.043 ± 0.415
2.487GlnThr: 2.487 ± 0.454
2.575GlnVal: 2.575 ± 0.471
0.355GlnTrp: 0.355 ± 0.192
1.599GlnTyr: 1.599 ± 0.316
0.0GlnXaa: 0.0 ± 0.0
Arg
1.865ArgAla: 1.865 ± 0.331
0.355ArgCys: 0.355 ± 0.228
1.332ArgAsp: 1.332 ± 0.357
2.842ArgGlu: 2.842 ± 0.539
1.51ArgPhe: 1.51 ± 0.392
2.664ArgGly: 2.664 ± 0.499
0.71ArgHis: 0.71 ± 0.245
3.197ArgIle: 3.197 ± 0.614
3.02ArgLys: 3.02 ± 0.812
3.996ArgLeu: 3.996 ± 0.622
0.622ArgMet: 0.622 ± 0.244
2.753ArgAsn: 2.753 ± 0.384
0.977ArgPro: 0.977 ± 0.356
1.865ArgGln: 1.865 ± 0.475
1.155ArgArg: 1.155 ± 0.349
1.332ArgSer: 1.332 ± 0.354
1.776ArgThr: 1.776 ± 0.419
2.398ArgVal: 2.398 ± 0.467
0.622ArgTrp: 0.622 ± 0.193
1.599ArgTyr: 1.599 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
3.908SerAla: 3.908 ± 0.694
0.266SerCys: 0.266 ± 0.165
4.174SerAsp: 4.174 ± 0.664
2.664SerGlu: 2.664 ± 0.347
3.108SerPhe: 3.108 ± 0.625
4.529SerGly: 4.529 ± 0.779
0.799SerHis: 0.799 ± 0.261
4.618SerIle: 4.618 ± 0.814
4.885SerLys: 4.885 ± 0.748
5.329SerLeu: 5.329 ± 0.631
1.243SerMet: 1.243 ± 0.302
4.529SerAsn: 4.529 ± 0.707
1.51SerPro: 1.51 ± 0.336
2.753SerGln: 2.753 ± 0.532
1.51SerArg: 1.51 ± 0.318
3.908SerSer: 3.908 ± 0.541
2.487SerThr: 2.487 ± 0.408
4.618SerVal: 4.618 ± 0.585
1.155SerTrp: 1.155 ± 0.367
2.575SerTyr: 2.575 ± 0.595
0.0SerXaa: 0.0 ± 0.0
Thr
4.529ThrAla: 4.529 ± 1.006
0.266ThrCys: 0.266 ± 0.149
5.062ThrAsp: 5.062 ± 0.723
2.487ThrGlu: 2.487 ± 0.538
2.131ThrPhe: 2.131 ± 0.432
3.73ThrGly: 3.73 ± 0.481
0.977ThrHis: 0.977 ± 0.354
4.529ThrIle: 4.529 ± 0.667
4.885ThrLys: 4.885 ± 0.58
3.819ThrLeu: 3.819 ± 0.537
0.888ThrMet: 0.888 ± 0.238
2.931ThrAsn: 2.931 ± 0.475
1.954ThrPro: 1.954 ± 0.437
2.664ThrGln: 2.664 ± 0.437
2.398ThrArg: 2.398 ± 0.508
4.529ThrSer: 4.529 ± 0.743
4.352ThrThr: 4.352 ± 0.631
5.151ThrVal: 5.151 ± 1.072
1.155ThrTrp: 1.155 ± 0.302
1.865ThrTyr: 1.865 ± 0.393
0.0ThrXaa: 0.0 ± 0.0
Val
6.217ValAla: 6.217 ± 1.111
0.533ValCys: 0.533 ± 0.198
4.885ValAsp: 4.885 ± 0.95
2.931ValGlu: 2.931 ± 0.528
1.51ValPhe: 1.51 ± 0.351
2.753ValGly: 2.753 ± 0.451
1.066ValHis: 1.066 ± 0.371
3.996ValIle: 3.996 ± 0.579
4.618ValLys: 4.618 ± 0.608
3.73ValLeu: 3.73 ± 0.559
2.131ValMet: 2.131 ± 0.471
4.174ValAsn: 4.174 ± 0.619
2.131ValPro: 2.131 ± 0.506
4.174ValGln: 4.174 ± 0.816
2.309ValArg: 2.309 ± 0.408
5.062ValSer: 5.062 ± 0.638
5.329ValThr: 5.329 ± 0.801
3.197ValVal: 3.197 ± 0.632
1.332ValTrp: 1.332 ± 0.552
1.954ValTyr: 1.954 ± 0.436
0.0ValXaa: 0.0 ± 0.0
Trp
1.243TrpAla: 1.243 ± 0.272
0.089TrpCys: 0.089 ± 0.099
1.243TrpAsp: 1.243 ± 0.27
0.444TrpGlu: 0.444 ± 0.183
0.178TrpPhe: 0.178 ± 0.11
1.066TrpGly: 1.066 ± 0.306
0.089TrpHis: 0.089 ± 0.092
1.066TrpIle: 1.066 ± 0.288
1.066TrpLys: 1.066 ± 0.286
1.687TrpLeu: 1.687 ± 0.302
0.089TrpMet: 0.089 ± 0.075
1.865TrpAsn: 1.865 ± 0.811
0.444TrpPro: 0.444 ± 0.171
1.687TrpGln: 1.687 ± 0.639
0.888TrpArg: 0.888 ± 0.263
0.977TrpSer: 0.977 ± 0.33
0.622TrpThr: 0.622 ± 0.197
0.977TrpVal: 0.977 ± 0.314
0.444TrpTrp: 0.444 ± 0.2
0.888TrpTyr: 0.888 ± 0.318
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.487TyrAla: 2.487 ± 0.421
0.444TyrCys: 0.444 ± 0.248
3.464TyrAsp: 3.464 ± 0.479
2.664TyrGlu: 2.664 ± 0.632
1.687TyrPhe: 1.687 ± 0.332
3.02TyrGly: 3.02 ± 0.649
0.71TyrHis: 0.71 ± 0.229
2.487TyrIle: 2.487 ± 0.458
3.02TyrLys: 3.02 ± 0.547
4.174TyrLeu: 4.174 ± 0.721
1.243TyrMet: 1.243 ± 0.359
1.332TyrAsn: 1.332 ± 0.347
0.977TyrPro: 0.977 ± 0.271
2.398TyrGln: 2.398 ± 0.43
1.954TyrArg: 1.954 ± 0.447
2.398TyrSer: 2.398 ± 0.512
2.22TyrThr: 2.22 ± 0.426
2.22TyrVal: 2.22 ± 0.457
0.533TyrTrp: 0.533 ± 0.176
1.51TyrTyr: 1.51 ± 0.363
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11261 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski