Amino acid dipepetide frequency for Lactococcus phage 98104

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.151AlaAla: 3.151 ± 0.629
0.649AlaCys: 0.649 ± 0.246
3.799AlaAsp: 3.799 ± 0.63
3.985AlaGlu: 3.985 ± 0.775
2.965AlaPhe: 2.965 ± 0.504
3.336AlaGly: 3.336 ± 0.672
0.649AlaHis: 0.649 ± 0.233
5.282AlaIle: 5.282 ± 1.001
5.375AlaLys: 5.375 ± 0.622
6.116AlaLeu: 6.116 ± 0.673
1.575AlaMet: 1.575 ± 0.287
5.097AlaAsn: 5.097 ± 0.602
1.575AlaPro: 1.575 ± 0.448
3.429AlaGln: 3.429 ± 0.588
2.595AlaArg: 2.595 ± 0.54
3.521AlaSer: 3.521 ± 0.547
4.17AlaThr: 4.17 ± 0.51
4.077AlaVal: 4.077 ± 0.603
1.761AlaTrp: 1.761 ± 0.526
2.039AlaTyr: 2.039 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.093CysAla: 0.093 ± 0.088
0.0CysCys: 0.0 ± 0.0
0.741CysAsp: 0.741 ± 0.253
0.649CysGlu: 0.649 ± 0.219
0.185CysPhe: 0.185 ± 0.124
0.463CysGly: 0.463 ± 0.237
0.371CysHis: 0.371 ± 0.261
0.185CysIle: 0.185 ± 0.121
0.834CysLys: 0.834 ± 0.26
0.371CysLeu: 0.371 ± 0.206
0.278CysMet: 0.278 ± 0.193
0.185CysAsn: 0.185 ± 0.145
0.278CysPro: 0.278 ± 0.156
0.0CysGln: 0.0 ± 0.0
0.185CysArg: 0.185 ± 0.118
0.834CysSer: 0.834 ± 0.32
0.278CysThr: 0.278 ± 0.162
0.185CysVal: 0.185 ± 0.111
0.093CysTrp: 0.093 ± 0.086
0.093CysTyr: 0.093 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
3.521AspAla: 3.521 ± 0.479
0.556AspCys: 0.556 ± 0.216
4.077AspAsp: 4.077 ± 0.61
6.116AspGlu: 6.116 ± 0.857
3.058AspPhe: 3.058 ± 0.511
5.375AspGly: 5.375 ± 0.715
0.463AspHis: 0.463 ± 0.217
5.004AspIle: 5.004 ± 0.708
5.004AspLys: 5.004 ± 0.497
4.17AspLeu: 4.17 ± 0.538
1.668AspMet: 1.668 ± 0.393
2.965AspAsn: 2.965 ± 0.49
1.112AspPro: 1.112 ± 0.415
1.297AspGln: 1.297 ± 0.307
2.409AspArg: 2.409 ± 0.393
5.097AspSer: 5.097 ± 0.615
3.892AspThr: 3.892 ± 0.626
3.799AspVal: 3.799 ± 0.47
1.205AspTrp: 1.205 ± 0.303
2.131AspTyr: 2.131 ± 0.369
0.0AspXaa: 0.0 ± 0.0
Glu
3.799GluAla: 3.799 ± 0.537
0.185GluCys: 0.185 ± 0.113
2.502GluAsp: 2.502 ± 0.46
6.765GluGlu: 6.765 ± 1.183
3.985GluPhe: 3.985 ± 0.51
2.965GluGly: 2.965 ± 0.531
1.112GluHis: 1.112 ± 0.339
4.726GluIle: 4.726 ± 0.529
7.692GluLys: 7.692 ± 1.171
7.97GluLeu: 7.97 ± 0.867
1.946GluMet: 1.946 ± 0.514
4.077GluAsn: 4.077 ± 0.677
2.317GluPro: 2.317 ± 0.541
3.336GluGln: 3.336 ± 0.573
3.336GluArg: 3.336 ± 0.653
3.336GluSer: 3.336 ± 0.473
4.355GluThr: 4.355 ± 0.654
5.375GluVal: 5.375 ± 0.839
1.297GluTrp: 1.297 ± 0.352
3.799GluTyr: 3.799 ± 0.603
0.0GluXaa: 0.0 ± 0.0
Phe
2.687PheAla: 2.687 ± 0.501
0.649PheCys: 0.649 ± 0.203
3.614PheAsp: 3.614 ± 0.429
3.058PheGlu: 3.058 ± 0.592
1.761PhePhe: 1.761 ± 0.408
3.058PheGly: 3.058 ± 0.548
0.556PheHis: 0.556 ± 0.235
2.687PheIle: 2.687 ± 0.508
4.633PheLys: 4.633 ± 0.651
2.502PheLeu: 2.502 ± 0.577
1.668PheMet: 1.668 ± 0.422
3.336PheAsn: 3.336 ± 0.612
0.649PhePro: 0.649 ± 0.296
1.668PheGln: 1.668 ± 0.398
0.927PheArg: 0.927 ± 0.285
3.243PheSer: 3.243 ± 0.541
2.873PheThr: 2.873 ± 0.636
2.409PheVal: 2.409 ± 0.522
0.278PheTrp: 0.278 ± 0.141
1.668PheTyr: 1.668 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
3.707GlyAla: 3.707 ± 0.629
0.463GlyCys: 0.463 ± 0.175
2.409GlyAsp: 2.409 ± 0.378
3.614GlyGlu: 3.614 ± 0.594
2.78GlyPhe: 2.78 ± 0.426
4.448GlyGly: 4.448 ± 0.701
0.649GlyHis: 0.649 ± 0.195
5.468GlyIle: 5.468 ± 0.601
5.931GlyLys: 5.931 ± 0.649
5.468GlyLeu: 5.468 ± 1.238
1.761GlyMet: 1.761 ± 0.518
3.429GlyAsn: 3.429 ± 0.729
0.741GlyPro: 0.741 ± 0.335
2.78GlyGln: 2.78 ± 0.591
2.317GlyArg: 2.317 ± 0.453
4.448GlySer: 4.448 ± 0.759
4.263GlyThr: 4.263 ± 0.651
3.892GlyVal: 3.892 ± 0.773
1.297GlyTrp: 1.297 ± 0.37
3.243GlyTyr: 3.243 ± 0.563
0.0GlyXaa: 0.0 ± 0.0
His
0.927HisAla: 0.927 ± 0.285
0.185HisCys: 0.185 ± 0.13
1.019HisAsp: 1.019 ± 0.255
1.761HisGlu: 1.761 ± 0.498
0.834HisPhe: 0.834 ± 0.285
1.019HisGly: 1.019 ± 0.298
0.371HisHis: 0.371 ± 0.148
0.556HisIle: 0.556 ± 0.226
0.556HisLys: 0.556 ± 0.176
0.834HisLeu: 0.834 ± 0.265
0.185HisMet: 0.185 ± 0.13
0.649HisAsn: 0.649 ± 0.218
0.371HisPro: 0.371 ± 0.163
0.556HisGln: 0.556 ± 0.225
0.463HisArg: 0.463 ± 0.203
0.927HisSer: 0.927 ± 0.34
0.278HisThr: 0.278 ± 0.178
0.556HisVal: 0.556 ± 0.235
0.185HisTrp: 0.185 ± 0.115
0.834HisTyr: 0.834 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
4.819IleAla: 4.819 ± 0.819
0.185IleCys: 0.185 ± 0.11
4.263IleAsp: 4.263 ± 0.571
5.468IleGlu: 5.468 ± 0.79
2.409IlePhe: 2.409 ± 0.531
3.521IleGly: 3.521 ± 0.636
0.834IleHis: 0.834 ± 0.44
3.429IleIle: 3.429 ± 0.592
7.136IleLys: 7.136 ± 0.803
4.355IleLeu: 4.355 ± 0.599
1.483IleMet: 1.483 ± 0.272
5.375IleAsn: 5.375 ± 0.852
1.761IlePro: 1.761 ± 0.365
2.965IleGln: 2.965 ± 0.523
2.409IleArg: 2.409 ± 0.349
5.097IleSer: 5.097 ± 0.688
4.541IleThr: 4.541 ± 0.551
3.707IleVal: 3.707 ± 0.861
0.556IleTrp: 0.556 ± 0.251
2.409IleTyr: 2.409 ± 0.484
0.0IleXaa: 0.0 ± 0.0
Lys
6.95LysAla: 6.95 ± 1.021
0.185LysCys: 0.185 ± 0.172
6.487LysAsp: 6.487 ± 0.711
6.302LysGlu: 6.302 ± 0.925
2.78LysPhe: 2.78 ± 0.501
6.302LysGly: 6.302 ± 0.866
1.853LysHis: 1.853 ± 0.437
6.209LysIle: 6.209 ± 0.676
8.989LysLys: 8.989 ± 1.152
6.858LysLeu: 6.858 ± 0.726
2.502LysMet: 2.502 ± 0.52
6.024LysAsn: 6.024 ± 0.846
2.317LysPro: 2.317 ± 0.43
5.375LysGln: 5.375 ± 0.706
3.892LysArg: 3.892 ± 0.755
5.19LysSer: 5.19 ± 0.626
4.819LysThr: 4.819 ± 0.742
4.17LysVal: 4.17 ± 0.732
0.834LysTrp: 0.834 ± 0.262
3.336LysTyr: 3.336 ± 0.537
0.0LysXaa: 0.0 ± 0.0
Leu
4.355LeuAla: 4.355 ± 0.687
0.834LeuCys: 0.834 ± 0.286
5.282LeuAsp: 5.282 ± 0.486
5.653LeuGlu: 5.653 ± 0.854
2.687LeuPhe: 2.687 ± 0.419
4.726LeuGly: 4.726 ± 0.631
0.649LeuHis: 0.649 ± 0.222
4.912LeuIle: 4.912 ± 0.702
7.321LeuLys: 7.321 ± 0.877
6.58LeuLeu: 6.58 ± 0.928
2.039LeuMet: 2.039 ± 0.453
6.024LeuAsn: 6.024 ± 0.748
2.78LeuPro: 2.78 ± 0.501
3.521LeuGln: 3.521 ± 0.629
2.039LeuArg: 2.039 ± 0.513
6.765LeuSer: 6.765 ± 0.689
5.468LeuThr: 5.468 ± 0.733
3.892LeuVal: 3.892 ± 0.634
1.297LeuTrp: 1.297 ± 0.643
2.502LeuTyr: 2.502 ± 0.407
0.0LeuXaa: 0.0 ± 0.0
Met
2.78MetAla: 2.78 ± 0.458
0.185MetCys: 0.185 ± 0.128
1.39MetAsp: 1.39 ± 0.352
1.853MetGlu: 1.853 ± 0.474
0.649MetPhe: 0.649 ± 0.183
1.297MetGly: 1.297 ± 0.321
0.371MetHis: 0.371 ± 0.183
1.39MetIle: 1.39 ± 0.376
1.946MetLys: 1.946 ± 0.435
2.039MetLeu: 2.039 ± 0.395
0.371MetMet: 0.371 ± 0.199
1.946MetAsn: 1.946 ± 0.374
0.556MetPro: 0.556 ± 0.245
1.205MetGln: 1.205 ± 0.283
1.112MetArg: 1.112 ± 0.405
1.761MetSer: 1.761 ± 0.444
3.336MetThr: 3.336 ± 0.578
0.834MetVal: 0.834 ± 0.244
0.278MetTrp: 0.278 ± 0.152
0.556MetTyr: 0.556 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
4.819AsnAla: 4.819 ± 0.767
0.278AsnCys: 0.278 ± 0.138
3.614AsnAsp: 3.614 ± 0.473
3.985AsnGlu: 3.985 ± 0.677
2.595AsnPhe: 2.595 ± 0.461
5.653AsnGly: 5.653 ± 0.967
0.834AsnHis: 0.834 ± 0.372
3.799AsnIle: 3.799 ± 0.543
5.282AsnLys: 5.282 ± 0.565
5.56AsnLeu: 5.56 ± 0.573
1.483AsnMet: 1.483 ± 0.445
4.17AsnAsn: 4.17 ± 0.498
1.946AsnPro: 1.946 ± 0.398
3.985AsnGln: 3.985 ± 0.764
1.853AsnArg: 1.853 ± 0.302
4.819AsnSer: 4.819 ± 0.755
2.687AsnThr: 2.687 ± 0.455
3.614AsnVal: 3.614 ± 0.569
0.649AsnTrp: 0.649 ± 0.227
2.131AsnTyr: 2.131 ± 0.421
0.0AsnXaa: 0.0 ± 0.0
Pro
1.112ProAla: 1.112 ± 0.339
0.093ProCys: 0.093 ± 0.094
2.131ProAsp: 2.131 ± 0.531
2.409ProGlu: 2.409 ± 0.381
1.112ProPhe: 1.112 ± 0.31
0.741ProGly: 0.741 ± 0.245
0.741ProHis: 0.741 ± 0.242
1.39ProIle: 1.39 ± 0.423
2.502ProLys: 2.502 ± 0.413
2.131ProLeu: 2.131 ± 0.433
0.463ProMet: 0.463 ± 0.177
1.668ProAsn: 1.668 ± 0.414
0.463ProPro: 0.463 ± 0.175
1.205ProGln: 1.205 ± 0.32
0.556ProArg: 0.556 ± 0.256
1.39ProSer: 1.39 ± 0.398
1.668ProThr: 1.668 ± 0.416
1.853ProVal: 1.853 ± 0.383
0.185ProTrp: 0.185 ± 0.132
0.927ProTyr: 0.927 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
4.541GlnAla: 4.541 ± 0.694
0.278GlnCys: 0.278 ± 0.146
1.297GlnAsp: 1.297 ± 0.454
3.985GlnGlu: 3.985 ± 0.45
1.39GlnPhe: 1.39 ± 0.391
2.409GlnGly: 2.409 ± 0.558
0.278GlnHis: 0.278 ± 0.15
2.965GlnIle: 2.965 ± 0.652
3.429GlnLys: 3.429 ± 0.617
3.614GlnLeu: 3.614 ± 0.762
1.112GlnMet: 1.112 ± 0.338
2.687GlnAsn: 2.687 ± 0.483
1.668GlnPro: 1.668 ± 0.386
2.873GlnGln: 2.873 ± 0.629
1.853GlnArg: 1.853 ± 0.475
1.668GlnSer: 1.668 ± 0.544
2.78GlnThr: 2.78 ± 0.428
3.243GlnVal: 3.243 ± 0.593
0.834GlnTrp: 0.834 ± 0.294
1.853GlnTyr: 1.853 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
2.409ArgAla: 2.409 ± 0.484
0.185ArgCys: 0.185 ± 0.11
2.131ArgAsp: 2.131 ± 0.47
2.595ArgGlu: 2.595 ± 0.52
2.131ArgPhe: 2.131 ± 0.464
1.761ArgGly: 1.761 ± 0.383
0.093ArgHis: 0.093 ± 0.091
2.502ArgIle: 2.502 ± 0.422
4.263ArgLys: 4.263 ± 0.557
3.799ArgLeu: 3.799 ± 0.744
1.575ArgMet: 1.575 ± 0.334
2.039ArgAsn: 2.039 ± 0.374
0.834ArgPro: 0.834 ± 0.313
1.205ArgGln: 1.205 ± 0.302
1.39ArgArg: 1.39 ± 0.428
1.853ArgSer: 1.853 ± 0.363
1.761ArgThr: 1.761 ± 0.284
2.409ArgVal: 2.409 ± 0.509
0.371ArgTrp: 0.371 ± 0.195
1.112ArgTyr: 1.112 ± 0.432
0.0ArgXaa: 0.0 ± 0.0
Ser
4.263SerAla: 4.263 ± 0.885
0.463SerCys: 0.463 ± 0.186
6.024SerAsp: 6.024 ± 0.584
4.633SerGlu: 4.633 ± 0.654
3.799SerPhe: 3.799 ± 0.628
5.282SerGly: 5.282 ± 0.633
1.205SerHis: 1.205 ± 0.296
3.521SerIle: 3.521 ± 0.477
4.633SerLys: 4.633 ± 0.79
4.263SerLeu: 4.263 ± 0.532
1.483SerMet: 1.483 ± 0.41
4.355SerAsn: 4.355 ± 0.602
1.205SerPro: 1.205 ± 0.264
2.78SerGln: 2.78 ± 0.605
1.946SerArg: 1.946 ± 0.308
4.726SerSer: 4.726 ± 0.945
3.892SerThr: 3.892 ± 0.436
4.077SerVal: 4.077 ± 0.536
0.834SerTrp: 0.834 ± 0.214
2.687SerTyr: 2.687 ± 0.398
0.0SerXaa: 0.0 ± 0.0
Thr
4.912ThrAla: 4.912 ± 0.698
0.371ThrCys: 0.371 ± 0.207
4.077ThrAsp: 4.077 ± 0.594
4.448ThrGlu: 4.448 ± 0.833
3.151ThrPhe: 3.151 ± 0.498
4.726ThrGly: 4.726 ± 0.571
0.463ThrHis: 0.463 ± 0.22
5.097ThrIle: 5.097 ± 0.647
5.746ThrLys: 5.746 ± 0.63
4.541ThrLeu: 4.541 ± 0.614
1.205ThrMet: 1.205 ± 0.365
2.965ThrAsn: 2.965 ± 0.594
1.575ThrPro: 1.575 ± 0.319
1.668ThrGln: 1.668 ± 0.365
2.873ThrArg: 2.873 ± 0.487
2.873ThrSer: 2.873 ± 0.472
3.892ThrThr: 3.892 ± 0.489
4.448ThrVal: 4.448 ± 0.67
0.741ThrTrp: 0.741 ± 0.294
1.761ThrTyr: 1.761 ± 0.384
0.0ThrXaa: 0.0 ± 0.0
Val
3.521ValAla: 3.521 ± 0.577
0.278ValCys: 0.278 ± 0.146
4.726ValAsp: 4.726 ± 0.903
4.819ValGlu: 4.819 ± 0.735
2.595ValPhe: 2.595 ± 0.501
3.058ValGly: 3.058 ± 0.561
0.927ValHis: 0.927 ± 0.311
3.707ValIle: 3.707 ± 0.523
5.931ValLys: 5.931 ± 0.937
3.985ValLeu: 3.985 ± 0.499
1.297ValMet: 1.297 ± 0.308
3.614ValAsn: 3.614 ± 0.729
1.205ValPro: 1.205 ± 0.311
2.039ValGln: 2.039 ± 0.479
1.483ValArg: 1.483 ± 0.374
5.004ValSer: 5.004 ± 0.605
4.17ValThr: 4.17 ± 0.761
4.633ValVal: 4.633 ± 0.662
0.649ValTrp: 0.649 ± 0.198
1.761ValTyr: 1.761 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
1.205TrpAla: 1.205 ± 0.275
0.0TrpCys: 0.0 ± 0.0
0.927TrpAsp: 0.927 ± 0.431
0.927TrpGlu: 0.927 ± 0.303
0.834TrpPhe: 0.834 ± 0.22
0.371TrpGly: 0.371 ± 0.185
0.278TrpHis: 0.278 ± 0.148
1.668TrpIle: 1.668 ± 0.311
1.297TrpLys: 1.297 ± 0.345
0.649TrpLeu: 0.649 ± 0.256
0.185TrpMet: 0.185 ± 0.155
1.112TrpAsn: 1.112 ± 0.517
0.093TrpPro: 0.093 ± 0.082
1.112TrpGln: 1.112 ± 0.359
0.834TrpArg: 0.834 ± 0.254
0.649TrpSer: 0.649 ± 0.208
0.741TrpThr: 0.741 ± 0.25
0.649TrpVal: 0.649 ± 0.28
0.371TrpTrp: 0.371 ± 0.185
0.463TrpTyr: 0.463 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.575TyrAla: 1.575 ± 0.355
0.278TyrCys: 0.278 ± 0.145
2.409TyrAsp: 2.409 ± 0.506
1.946TyrGlu: 1.946 ± 0.45
2.224TyrPhe: 2.224 ± 0.402
2.595TyrGly: 2.595 ± 0.546
0.371TyrHis: 0.371 ± 0.173
2.224TyrIle: 2.224 ± 0.465
2.873TyrLys: 2.873 ± 0.491
3.521TyrLeu: 3.521 ± 0.613
1.483TyrMet: 1.483 ± 0.379
2.039TyrAsn: 2.039 ± 0.515
1.297TyrPro: 1.297 ± 0.302
1.761TyrGln: 1.761 ± 0.424
1.946TyrArg: 1.946 ± 0.473
2.873TyrSer: 2.873 ± 0.498
1.668TyrThr: 1.668 ± 0.361
1.575TyrVal: 1.575 ± 0.379
0.649TyrTrp: 0.649 ± 0.225
1.019TyrTyr: 1.019 ± 0.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (10792 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski