Amino acid dipepetide frequency for Lactobacillus phage EV3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.521AlaAla: 3.521 ± 1.346
0.371AlaCys: 0.371 ± 0.147
3.521AlaAsp: 3.521 ± 0.453
3.614AlaGlu: 3.614 ± 0.852
3.243AlaPhe: 3.243 ± 0.599
3.892AlaGly: 3.892 ± 0.836
1.112AlaHis: 1.112 ± 0.317
4.726AlaIle: 4.726 ± 0.685
6.579AlaLys: 6.579 ± 0.755
4.818AlaLeu: 4.818 ± 0.772
1.39AlaMet: 1.39 ± 0.58
5.374AlaAsn: 5.374 ± 0.799
1.761AlaPro: 1.761 ± 0.372
2.502AlaGln: 2.502 ± 0.853
2.965AlaArg: 2.965 ± 0.706
3.799AlaSer: 3.799 ± 0.923
4.262AlaThr: 4.262 ± 0.899
3.243AlaVal: 3.243 ± 0.493
0.649AlaTrp: 0.649 ± 0.191
1.668AlaTyr: 1.668 ± 0.395
0.0AlaXaa: 0.0 ± 0.0
Cys
0.093CysAla: 0.093 ± 0.091
0.0CysCys: 0.0 ± 0.0
0.834CysAsp: 0.834 ± 0.289
0.371CysGlu: 0.371 ± 0.174
0.741CysPhe: 0.741 ± 0.258
0.463CysGly: 0.463 ± 0.199
0.0CysHis: 0.0 ± 0.0
0.463CysIle: 0.463 ± 0.214
0.463CysLys: 0.463 ± 0.202
0.649CysLeu: 0.649 ± 0.209
0.0CysMet: 0.0 ± 0.0
0.463CysAsn: 0.463 ± 0.236
0.185CysPro: 0.185 ± 0.119
0.093CysGln: 0.093 ± 0.101
0.185CysArg: 0.185 ± 0.108
0.093CysSer: 0.093 ± 0.08
0.093CysThr: 0.093 ± 0.091
0.463CysVal: 0.463 ± 0.183
0.0CysTrp: 0.0 ± 0.0
0.463CysTyr: 0.463 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
3.15AspAla: 3.15 ± 0.57
0.741AspCys: 0.741 ± 0.285
5.652AspAsp: 5.652 ± 0.871
4.355AspGlu: 4.355 ± 0.735
3.799AspPhe: 3.799 ± 0.773
5.096AspGly: 5.096 ± 0.734
0.834AspHis: 0.834 ± 0.253
5.096AspIle: 5.096 ± 0.668
5.93AspLys: 5.93 ± 0.801
5.56AspLeu: 5.56 ± 0.664
1.761AspMet: 1.761 ± 0.383
4.911AspAsn: 4.911 ± 0.907
2.687AspPro: 2.687 ± 0.714
1.946AspGln: 1.946 ± 0.449
1.483AspArg: 1.483 ± 0.27
5.096AspSer: 5.096 ± 0.816
2.687AspThr: 2.687 ± 0.517
5.282AspVal: 5.282 ± 0.74
1.575AspTrp: 1.575 ± 0.415
3.058AspTyr: 3.058 ± 0.525
0.0AspXaa: 0.0 ± 0.0
Glu
3.243GluAla: 3.243 ± 0.57
0.185GluCys: 0.185 ± 0.149
2.965GluAsp: 2.965 ± 0.496
2.131GluGlu: 2.131 ± 0.636
2.409GluPhe: 2.409 ± 0.502
2.687GluGly: 2.687 ± 0.491
1.297GluHis: 1.297 ± 0.286
4.077GluIle: 4.077 ± 0.686
4.818GluLys: 4.818 ± 0.861
4.355GluLeu: 4.355 ± 0.588
1.205GluMet: 1.205 ± 0.324
3.799GluAsn: 3.799 ± 0.596
1.668GluPro: 1.668 ± 0.385
2.039GluGln: 2.039 ± 0.509
2.317GluArg: 2.317 ± 0.454
3.521GluSer: 3.521 ± 0.588
2.595GluThr: 2.595 ± 0.484
3.243GluVal: 3.243 ± 0.521
0.463GluTrp: 0.463 ± 0.255
2.131GluTyr: 2.131 ± 0.449
0.0GluXaa: 0.0 ± 0.0
Phe
2.224PheAla: 2.224 ± 0.562
0.371PheCys: 0.371 ± 0.215
3.058PheAsp: 3.058 ± 0.682
2.039PheGlu: 2.039 ± 0.49
1.39PhePhe: 1.39 ± 0.291
2.872PheGly: 2.872 ± 0.44
1.019PheHis: 1.019 ± 0.315
3.243PheIle: 3.243 ± 0.656
4.262PheLys: 4.262 ± 0.579
3.336PheLeu: 3.336 ± 0.54
1.205PheMet: 1.205 ± 0.328
3.058PheAsn: 3.058 ± 0.564
1.946PhePro: 1.946 ± 0.431
1.853PheGln: 1.853 ± 0.424
1.668PheArg: 1.668 ± 0.356
2.78PheSer: 2.78 ± 0.629
1.946PheThr: 1.946 ± 0.605
2.131PheVal: 2.131 ± 0.387
0.278PheTrp: 0.278 ± 0.161
1.575PheTyr: 1.575 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
2.965GlyAla: 2.965 ± 0.567
0.278GlyCys: 0.278 ± 0.13
4.17GlyAsp: 4.17 ± 0.584
2.131GlyGlu: 2.131 ± 0.443
3.15GlyPhe: 3.15 ± 0.499
3.428GlyGly: 3.428 ± 0.917
1.297GlyHis: 1.297 ± 0.45
4.818GlyIle: 4.818 ± 0.632
5.93GlyLys: 5.93 ± 0.635
5.93GlyLeu: 5.93 ± 1.341
1.297GlyMet: 1.297 ± 0.378
5.467GlyAsn: 5.467 ± 0.796
0.834GlyPro: 0.834 ± 0.428
2.409GlyGln: 2.409 ± 0.46
3.243GlyArg: 3.243 ± 0.777
3.15GlySer: 3.15 ± 0.43
4.54GlyThr: 4.54 ± 0.713
4.077GlyVal: 4.077 ± 0.515
1.668GlyTrp: 1.668 ± 0.492
2.317GlyTyr: 2.317 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
1.575HisAla: 1.575 ± 0.322
0.0HisCys: 0.0 ± 0.0
1.205HisAsp: 1.205 ± 0.346
0.927HisGlu: 0.927 ± 0.23
1.297HisPhe: 1.297 ± 0.402
1.483HisGly: 1.483 ± 0.328
0.463HisHis: 0.463 ± 0.278
1.112HisIle: 1.112 ± 0.329
0.927HisLys: 0.927 ± 0.326
1.483HisLeu: 1.483 ± 0.276
0.371HisMet: 0.371 ± 0.177
0.927HisAsn: 0.927 ± 0.25
0.278HisPro: 0.278 ± 0.192
1.019HisGln: 1.019 ± 0.302
0.556HisArg: 0.556 ± 0.212
1.668HisSer: 1.668 ± 0.414
0.834HisThr: 0.834 ± 0.263
1.39HisVal: 1.39 ± 0.353
0.371HisTrp: 0.371 ± 0.192
1.112HisTyr: 1.112 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
4.262IleAla: 4.262 ± 0.751
0.278IleCys: 0.278 ± 0.147
4.54IleAsp: 4.54 ± 0.744
3.336IleGlu: 3.336 ± 0.498
2.502IlePhe: 2.502 ± 0.534
3.428IleGly: 3.428 ± 0.644
1.205IleHis: 1.205 ± 0.254
3.15IleIle: 3.15 ± 0.465
6.672IleLys: 6.672 ± 0.69
4.262IleLeu: 4.262 ± 0.559
1.853IleMet: 1.853 ± 0.423
5.745IleAsn: 5.745 ± 0.643
3.243IlePro: 3.243 ± 0.539
2.872IleGln: 2.872 ± 0.561
2.131IleArg: 2.131 ± 0.336
4.17IleSer: 4.17 ± 0.468
2.965IleThr: 2.965 ± 0.61
3.706IleVal: 3.706 ± 0.54
0.741IleTrp: 0.741 ± 0.275
2.78IleTyr: 2.78 ± 0.528
0.0IleXaa: 0.0 ± 0.0
Lys
5.56LysAla: 5.56 ± 0.84
0.371LysCys: 0.371 ± 0.266
6.301LysAsp: 6.301 ± 0.782
5.93LysGlu: 5.93 ± 0.703
2.409LysPhe: 2.409 ± 0.481
4.726LysGly: 4.726 ± 0.574
1.483LysHis: 1.483 ± 0.362
5.096LysIle: 5.096 ± 0.586
6.857LysLys: 6.857 ± 0.824
7.042LysLeu: 7.042 ± 0.754
2.595LysMet: 2.595 ± 0.587
6.116LysAsn: 6.116 ± 0.82
3.706LysPro: 3.706 ± 0.521
5.096LysGln: 5.096 ± 1.011
4.448LysArg: 4.448 ± 0.813
5.374LysSer: 5.374 ± 0.768
5.004LysThr: 5.004 ± 0.797
4.355LysVal: 4.355 ± 0.811
1.112LysTrp: 1.112 ± 0.313
3.428LysTyr: 3.428 ± 0.843
0.0LysXaa: 0.0 ± 0.0
Leu
4.911LeuAla: 4.911 ± 0.811
0.556LeuCys: 0.556 ± 0.235
5.282LeuAsp: 5.282 ± 0.749
4.077LeuGlu: 4.077 ± 0.607
2.78LeuPhe: 2.78 ± 0.573
5.096LeuGly: 5.096 ± 0.636
1.019LeuHis: 1.019 ± 0.289
4.633LeuIle: 4.633 ± 0.631
7.042LeuLys: 7.042 ± 0.768
3.799LeuLeu: 3.799 ± 0.637
2.502LeuMet: 2.502 ± 0.601
6.764LeuAsn: 6.764 ± 0.872
2.78LeuPro: 2.78 ± 0.457
2.965LeuGln: 2.965 ± 0.497
1.946LeuArg: 1.946 ± 0.338
6.208LeuSer: 6.208 ± 0.788
7.598LeuThr: 7.598 ± 0.818
5.374LeuVal: 5.374 ± 0.512
0.463LeuTrp: 0.463 ± 0.183
2.502LeuTyr: 2.502 ± 0.463
0.0LeuXaa: 0.0 ± 0.0
Met
1.483MetAla: 1.483 ± 0.409
0.278MetCys: 0.278 ± 0.215
1.112MetAsp: 1.112 ± 0.395
0.649MetGlu: 0.649 ± 0.202
1.39MetPhe: 1.39 ± 0.385
0.556MetGly: 0.556 ± 0.194
0.185MetHis: 0.185 ± 0.127
1.946MetIle: 1.946 ± 0.329
1.946MetLys: 1.946 ± 0.373
1.39MetLeu: 1.39 ± 0.437
0.649MetMet: 0.649 ± 0.385
2.039MetAsn: 2.039 ± 0.364
0.927MetPro: 0.927 ± 0.239
1.019MetGln: 1.019 ± 0.249
0.834MetArg: 0.834 ± 0.253
2.224MetSer: 2.224 ± 0.524
2.409MetThr: 2.409 ± 0.569
1.575MetVal: 1.575 ± 0.438
0.463MetTrp: 0.463 ± 0.171
0.834MetTyr: 0.834 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
5.467AsnAla: 5.467 ± 0.503
0.834AsnCys: 0.834 ± 0.282
3.984AsnAsp: 3.984 ± 0.473
4.262AsnGlu: 4.262 ± 0.54
2.872AsnPhe: 2.872 ± 0.633
5.838AsnGly: 5.838 ± 0.648
1.112AsnHis: 1.112 ± 0.28
4.262AsnIle: 4.262 ± 0.636
5.93AsnLys: 5.93 ± 0.587
5.56AsnLeu: 5.56 ± 0.832
2.039AsnMet: 2.039 ± 0.406
5.096AsnAsn: 5.096 ± 0.65
2.872AsnPro: 2.872 ± 0.494
3.15AsnGln: 3.15 ± 0.47
2.595AsnArg: 2.595 ± 0.421
4.818AsnSer: 4.818 ± 0.64
3.892AsnThr: 3.892 ± 0.548
5.467AsnVal: 5.467 ± 0.529
1.39AsnTrp: 1.39 ± 0.287
3.243AsnTyr: 3.243 ± 0.678
0.0AsnXaa: 0.0 ± 0.0
Pro
2.131ProAla: 2.131 ± 0.68
0.185ProCys: 0.185 ± 0.122
2.595ProAsp: 2.595 ± 0.533
1.946ProGlu: 1.946 ± 0.368
1.668ProPhe: 1.668 ± 0.472
2.224ProGly: 2.224 ± 0.35
0.649ProHis: 0.649 ± 0.241
1.946ProIle: 1.946 ± 0.396
3.614ProLys: 3.614 ± 0.606
2.224ProLeu: 2.224 ± 0.417
0.371ProMet: 0.371 ± 0.148
2.502ProAsn: 2.502 ± 0.423
0.463ProPro: 0.463 ± 0.159
2.131ProGln: 2.131 ± 0.432
0.834ProArg: 0.834 ± 0.175
2.131ProSer: 2.131 ± 0.417
2.317ProThr: 2.317 ± 0.413
2.872ProVal: 2.872 ± 0.572
0.463ProTrp: 0.463 ± 0.258
1.39ProTyr: 1.39 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
3.614GlnAla: 3.614 ± 0.811
0.093GlnCys: 0.093 ± 0.091
3.243GlnAsp: 3.243 ± 0.497
1.39GlnGlu: 1.39 ± 0.274
1.575GlnPhe: 1.575 ± 0.418
2.595GlnGly: 2.595 ± 0.38
1.112GlnHis: 1.112 ± 0.309
3.15GlnIle: 3.15 ± 0.551
3.243GlnLys: 3.243 ± 0.534
3.799GlnLeu: 3.799 ± 0.566
0.927GlnMet: 0.927 ± 0.258
2.872GlnAsn: 2.872 ± 0.455
2.409GlnPro: 2.409 ± 0.856
2.687GlnGln: 2.687 ± 0.769
1.575GlnArg: 1.575 ± 0.493
3.892GlnSer: 3.892 ± 0.711
1.946GlnThr: 1.946 ± 0.437
2.872GlnVal: 2.872 ± 0.39
1.019GlnTrp: 1.019 ± 0.267
1.019GlnTyr: 1.019 ± 0.3
0.0GlnXaa: 0.0 ± 0.0
Arg
2.78ArgAla: 2.78 ± 0.829
0.0ArgCys: 0.0 ± 0.0
2.039ArgAsp: 2.039 ± 0.363
2.409ArgGlu: 2.409 ± 0.432
1.483ArgPhe: 1.483 ± 0.358
1.668ArgGly: 1.668 ± 0.539
1.297ArgHis: 1.297 ± 0.344
2.039ArgIle: 2.039 ± 0.347
3.243ArgLys: 3.243 ± 0.506
4.077ArgLeu: 4.077 ± 0.515
0.741ArgMet: 0.741 ± 0.277
2.595ArgAsn: 2.595 ± 0.512
1.112ArgPro: 1.112 ± 0.25
1.668ArgGln: 1.668 ± 0.248
1.575ArgArg: 1.575 ± 0.325
1.668ArgSer: 1.668 ± 0.317
2.502ArgThr: 2.502 ± 0.538
1.668ArgVal: 1.668 ± 0.37
0.834ArgTrp: 0.834 ± 0.272
1.019ArgTyr: 1.019 ± 0.32
0.0ArgXaa: 0.0 ± 0.0
Ser
4.077SerAla: 4.077 ± 1.005
0.278SerCys: 0.278 ± 0.14
5.93SerAsp: 5.93 ± 0.794
3.058SerGlu: 3.058 ± 0.33
2.409SerPhe: 2.409 ± 0.628
5.745SerGly: 5.745 ± 0.855
1.668SerHis: 1.668 ± 0.371
3.243SerIle: 3.243 ± 0.478
4.911SerLys: 4.911 ± 0.758
5.467SerLeu: 5.467 ± 0.704
1.112SerMet: 1.112 ± 0.311
4.818SerAsn: 4.818 ± 0.643
2.039SerPro: 2.039 ± 0.35
3.428SerGln: 3.428 ± 0.678
2.78SerArg: 2.78 ± 0.48
4.633SerSer: 4.633 ± 0.799
3.614SerThr: 3.614 ± 0.616
3.984SerVal: 3.984 ± 0.63
1.483SerTrp: 1.483 ± 0.328
2.687SerTyr: 2.687 ± 0.555
0.0SerXaa: 0.0 ± 0.0
Thr
3.892ThrAla: 3.892 ± 0.644
0.278ThrCys: 0.278 ± 0.198
5.189ThrAsp: 5.189 ± 0.652
2.502ThrGlu: 2.502 ± 0.442
2.039ThrPhe: 2.039 ± 0.368
4.818ThrGly: 4.818 ± 0.665
1.297ThrHis: 1.297 ± 0.445
4.262ThrIle: 4.262 ± 0.633
5.096ThrLys: 5.096 ± 0.464
4.54ThrLeu: 4.54 ± 0.582
1.112ThrMet: 1.112 ± 0.298
4.633ThrAsn: 4.633 ± 0.548
2.965ThrPro: 2.965 ± 0.638
1.668ThrGln: 1.668 ± 0.322
1.946ThrArg: 1.946 ± 0.323
3.892ThrSer: 3.892 ± 0.613
4.17ThrThr: 4.17 ± 0.78
2.595ThrVal: 2.595 ± 0.476
0.834ThrTrp: 0.834 ± 0.208
2.595ThrTyr: 2.595 ± 0.805
0.0ThrXaa: 0.0 ± 0.0
Val
5.096ValAla: 5.096 ± 0.988
0.371ValCys: 0.371 ± 0.231
4.911ValAsp: 4.911 ± 0.618
3.428ValGlu: 3.428 ± 0.446
2.224ValPhe: 2.224 ± 0.452
3.984ValGly: 3.984 ± 0.817
1.205ValHis: 1.205 ± 0.268
3.706ValIle: 3.706 ± 0.427
4.448ValLys: 4.448 ± 0.515
5.096ValLeu: 5.096 ± 0.694
1.483ValMet: 1.483 ± 0.283
4.448ValAsn: 4.448 ± 0.654
1.668ValPro: 1.668 ± 0.391
2.595ValGln: 2.595 ± 0.474
1.483ValArg: 1.483 ± 0.362
4.726ValSer: 4.726 ± 0.595
3.243ValThr: 3.243 ± 0.584
4.262ValVal: 4.262 ± 0.746
0.556ValTrp: 0.556 ± 0.254
1.946ValTyr: 1.946 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.556TrpAla: 0.556 ± 0.238
0.278TrpCys: 0.278 ± 0.153
0.834TrpAsp: 0.834 ± 0.303
0.927TrpGlu: 0.927 ± 0.302
1.112TrpPhe: 1.112 ± 0.269
0.556TrpGly: 0.556 ± 0.173
0.463TrpHis: 0.463 ± 0.155
0.834TrpIle: 0.834 ± 0.242
1.575TrpLys: 1.575 ± 0.527
1.297TrpLeu: 1.297 ± 0.41
0.463TrpMet: 0.463 ± 0.161
1.019TrpAsn: 1.019 ± 0.268
0.093TrpPro: 0.093 ± 0.101
1.205TrpGln: 1.205 ± 0.271
0.834TrpArg: 0.834 ± 0.278
0.649TrpSer: 0.649 ± 0.261
1.019TrpThr: 1.019 ± 0.35
0.649TrpVal: 0.649 ± 0.234
0.0TrpTrp: 0.0 ± 0.0
0.463TrpTyr: 0.463 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.409TyrAla: 2.409 ± 0.408
0.371TyrCys: 0.371 ± 0.167
3.336TyrAsp: 3.336 ± 0.667
1.668TyrGlu: 1.668 ± 0.354
1.761TyrPhe: 1.761 ± 0.405
1.946TyrGly: 1.946 ± 0.455
0.278TyrHis: 0.278 ± 0.156
2.039TyrIle: 2.039 ± 0.389
3.614TyrLys: 3.614 ± 0.55
3.614TyrLeu: 3.614 ± 0.647
0.741TyrMet: 0.741 ± 0.263
2.039TyrAsn: 2.039 ± 0.502
1.019TyrPro: 1.019 ± 0.293
2.687TyrGln: 2.687 ± 0.394
1.019TyrArg: 1.019 ± 0.302
2.872TyrSer: 2.872 ± 0.491
2.687TyrThr: 2.687 ± 0.598
1.668TyrVal: 1.668 ± 0.415
0.463TyrTrp: 0.463 ± 0.218
1.205TyrTyr: 1.205 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (10793 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski