Amino acid dipepetide frequency for Lactococcus phage Dub35A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.195AlaAla: 3.195 ± 0.953
0.752AlaCys: 0.752 ± 0.246
4.323AlaAsp: 4.323 ± 0.727
3.195AlaGlu: 3.195 ± 0.525
2.631AlaPhe: 2.631 ± 0.441
5.075AlaGly: 5.075 ± 1.102
1.128AlaHis: 1.128 ± 0.408
5.827AlaIle: 5.827 ± 1.058
5.075AlaLys: 5.075 ± 0.514
5.545AlaLeu: 5.545 ± 0.768
1.786AlaMet: 1.786 ± 0.335
3.477AlaAsn: 3.477 ± 0.587
1.692AlaPro: 1.692 ± 0.34
2.161AlaGln: 2.161 ± 0.427
2.255AlaArg: 2.255 ± 0.529
3.947AlaSer: 3.947 ± 0.756
4.323AlaThr: 4.323 ± 0.598
4.135AlaVal: 4.135 ± 0.711
1.504AlaTrp: 1.504 ± 0.334
1.41AlaTyr: 1.41 ± 0.449
0.0AlaXaa: 0.0 ± 0.0
Cys
0.282CysAla: 0.282 ± 0.139
0.094CysCys: 0.094 ± 0.087
1.316CysAsp: 1.316 ± 0.375
0.188CysGlu: 0.188 ± 0.135
0.282CysPhe: 0.282 ± 0.198
0.658CysGly: 0.658 ± 0.225
0.282CysHis: 0.282 ± 0.17
0.188CysIle: 0.188 ± 0.141
0.47CysLys: 0.47 ± 0.197
0.282CysLeu: 0.282 ± 0.172
0.0CysMet: 0.0 ± 0.0
0.188CysAsn: 0.188 ± 0.117
0.282CysPro: 0.282 ± 0.155
0.564CysGln: 0.564 ± 0.214
0.564CysArg: 0.564 ± 0.292
0.376CysSer: 0.376 ± 0.169
0.188CysThr: 0.188 ± 0.124
0.094CysVal: 0.094 ± 0.095
0.0CysTrp: 0.0 ± 0.0
0.282CysTyr: 0.282 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
3.289AspAla: 3.289 ± 0.593
0.282AspCys: 0.282 ± 0.144
3.571AspAsp: 3.571 ± 0.771
5.451AspGlu: 5.451 ± 0.807
3.665AspPhe: 3.665 ± 0.472
4.417AspGly: 4.417 ± 0.665
0.94AspHis: 0.94 ± 0.238
4.323AspIle: 4.323 ± 0.635
4.041AspLys: 4.041 ± 0.651
4.605AspLeu: 4.605 ± 0.779
1.88AspMet: 1.88 ± 0.376
4.135AspAsn: 4.135 ± 0.567
1.692AspPro: 1.692 ± 0.476
1.598AspGln: 1.598 ± 0.41
2.725AspArg: 2.725 ± 0.479
4.041AspSer: 4.041 ± 0.519
3.289AspThr: 3.289 ± 0.602
3.853AspVal: 3.853 ± 0.703
1.222AspTrp: 1.222 ± 0.305
2.819AspTyr: 2.819 ± 0.603
0.0AspXaa: 0.0 ± 0.0
Glu
4.417GluAla: 4.417 ± 0.599
0.564GluCys: 0.564 ± 0.24
3.759GluAsp: 3.759 ± 0.903
4.323GluGlu: 4.323 ± 0.861
2.725GluPhe: 2.725 ± 0.546
2.725GluGly: 2.725 ± 0.522
1.41GluHis: 1.41 ± 0.421
5.451GluIle: 5.451 ± 0.64
5.733GluLys: 5.733 ± 1.033
6.39GluLeu: 6.39 ± 0.932
1.598GluMet: 1.598 ± 0.372
3.477GluAsn: 3.477 ± 0.656
2.161GluPro: 2.161 ± 0.459
2.913GluGln: 2.913 ± 0.652
3.289GluArg: 3.289 ± 0.671
3.853GluSer: 3.853 ± 0.684
3.571GluThr: 3.571 ± 0.607
4.323GluVal: 4.323 ± 0.732
1.41GluTrp: 1.41 ± 0.454
1.973GluTyr: 1.973 ± 0.403
0.0GluXaa: 0.0 ± 0.0
Phe
2.819PheAla: 2.819 ± 0.517
0.094PheCys: 0.094 ± 0.09
3.007PheAsp: 3.007 ± 0.387
3.571PheGlu: 3.571 ± 0.549
2.067PhePhe: 2.067 ± 0.411
3.289PheGly: 3.289 ± 0.517
0.658PheHis: 0.658 ± 0.281
1.786PheIle: 1.786 ± 0.421
4.041PheLys: 4.041 ± 0.541
3.195PheLeu: 3.195 ± 0.713
1.41PheMet: 1.41 ± 0.376
2.255PheAsn: 2.255 ± 0.466
1.316PhePro: 1.316 ± 0.351
1.034PheGln: 1.034 ± 0.261
2.067PheArg: 2.067 ± 0.535
2.725PheSer: 2.725 ± 0.805
3.383PheThr: 3.383 ± 0.595
1.88PheVal: 1.88 ± 0.523
0.658PheTrp: 0.658 ± 0.283
1.598PheTyr: 1.598 ± 0.357
0.0PheXaa: 0.0 ± 0.0
Gly
3.195GlyAla: 3.195 ± 0.705
0.376GlyCys: 0.376 ± 0.244
3.665GlyAsp: 3.665 ± 0.576
3.289GlyGlu: 3.289 ± 0.587
4.041GlyPhe: 4.041 ± 0.578
4.511GlyGly: 4.511 ± 0.88
0.846GlyHis: 0.846 ± 0.288
6.86GlyIle: 6.86 ± 1.353
5.451GlyLys: 5.451 ± 0.758
4.699GlyLeu: 4.699 ± 0.584
1.41GlyMet: 1.41 ± 0.43
4.135GlyAsn: 4.135 ± 0.638
0.846GlyPro: 0.846 ± 0.372
2.913GlyGln: 2.913 ± 0.665
1.504GlyArg: 1.504 ± 0.452
4.041GlySer: 4.041 ± 0.797
5.451GlyThr: 5.451 ± 0.624
3.759GlyVal: 3.759 ± 0.535
1.316GlyTrp: 1.316 ± 0.407
3.571GlyTyr: 3.571 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
0.94HisAla: 0.94 ± 0.259
0.0HisCys: 0.0 ± 0.0
0.94HisAsp: 0.94 ± 0.324
0.94HisGlu: 0.94 ± 0.265
0.376HisPhe: 0.376 ± 0.193
1.128HisGly: 1.128 ± 0.428
0.282HisHis: 0.282 ± 0.152
0.94HisIle: 0.94 ± 0.308
1.316HisLys: 1.316 ± 0.342
0.94HisLeu: 0.94 ± 0.306
0.47HisMet: 0.47 ± 0.19
0.752HisAsn: 0.752 ± 0.229
0.47HisPro: 0.47 ± 0.203
0.376HisGln: 0.376 ± 0.2
0.658HisArg: 0.658 ± 0.229
1.128HisSer: 1.128 ± 0.383
0.658HisThr: 0.658 ± 0.243
0.846HisVal: 0.846 ± 0.325
0.094HisTrp: 0.094 ± 0.09
0.94HisTyr: 0.94 ± 0.298
0.0HisXaa: 0.0 ± 0.0
Ile
4.981IleAla: 4.981 ± 0.749
0.282IleCys: 0.282 ± 0.163
5.733IleAsp: 5.733 ± 0.534
5.545IleGlu: 5.545 ± 0.963
2.255IlePhe: 2.255 ± 0.481
4.135IleGly: 4.135 ± 0.674
1.222IleHis: 1.222 ± 0.45
5.263IleIle: 5.263 ± 0.634
5.92IleLys: 5.92 ± 0.77
5.733IleLeu: 5.733 ± 0.692
1.973IleMet: 1.973 ± 0.393
3.947IleAsn: 3.947 ± 0.507
3.101IlePro: 3.101 ± 0.398
3.195IleGln: 3.195 ± 0.505
2.725IleArg: 2.725 ± 0.451
7.612IleSer: 7.612 ± 1.48
4.229IleThr: 4.229 ± 0.639
3.477IleVal: 3.477 ± 0.471
0.282IleTrp: 0.282 ± 0.157
3.007IleTyr: 3.007 ± 0.734
0.0IleXaa: 0.0 ± 0.0
Lys
6.108LysAla: 6.108 ± 0.887
0.47LysCys: 0.47 ± 0.208
3.665LysAsp: 3.665 ± 0.641
5.169LysGlu: 5.169 ± 0.72
2.443LysPhe: 2.443 ± 0.5
5.075LysGly: 5.075 ± 0.641
0.752LysHis: 0.752 ± 0.268
6.296LysIle: 6.296 ± 0.812
8.458LysLys: 8.458 ± 1.32
7.706LysLeu: 7.706 ± 0.901
2.161LysMet: 2.161 ± 0.429
6.672LysAsn: 6.672 ± 0.777
3.195LysPro: 3.195 ± 0.506
3.101LysGln: 3.101 ± 0.451
3.289LysArg: 3.289 ± 0.636
5.827LysSer: 5.827 ± 0.611
5.639LysThr: 5.639 ± 0.985
3.759LysVal: 3.759 ± 0.783
1.41LysTrp: 1.41 ± 0.422
3.007LysTyr: 3.007 ± 0.637
0.0LysXaa: 0.0 ± 0.0
Leu
5.639LeuAla: 5.639 ± 0.679
0.47LeuCys: 0.47 ± 0.224
4.417LeuAsp: 4.417 ± 0.521
5.639LeuGlu: 5.639 ± 0.765
3.759LeuPhe: 3.759 ± 0.597
4.981LeuGly: 4.981 ± 0.741
0.376LeuHis: 0.376 ± 0.208
4.135LeuIle: 4.135 ± 0.554
9.022LeuLys: 9.022 ± 0.991
5.263LeuLeu: 5.263 ± 0.835
1.316LeuMet: 1.316 ± 0.315
6.296LeuAsn: 6.296 ± 0.834
3.853LeuPro: 3.853 ± 0.529
2.725LeuGln: 2.725 ± 0.601
3.101LeuArg: 3.101 ± 0.6
6.766LeuSer: 6.766 ± 0.815
5.263LeuThr: 5.263 ± 0.636
5.075LeuVal: 5.075 ± 0.635
1.41LeuTrp: 1.41 ± 0.416
1.973LeuTyr: 1.973 ± 0.377
0.094LeuXaa: 0.094 ± 0.099
Met
1.786MetAla: 1.786 ± 0.364
0.47MetCys: 0.47 ± 0.233
1.504MetAsp: 1.504 ± 0.301
1.692MetGlu: 1.692 ± 0.355
0.47MetPhe: 0.47 ± 0.23
1.222MetGly: 1.222 ± 0.44
0.094MetHis: 0.094 ± 0.106
1.504MetIle: 1.504 ± 0.345
2.537MetLys: 2.537 ± 0.527
1.786MetLeu: 1.786 ± 0.428
0.658MetMet: 0.658 ± 0.235
1.41MetAsn: 1.41 ± 0.352
0.752MetPro: 0.752 ± 0.334
0.752MetGln: 0.752 ± 0.32
1.222MetArg: 1.222 ± 0.305
2.161MetSer: 2.161 ± 0.446
1.973MetThr: 1.973 ± 0.415
0.752MetVal: 0.752 ± 0.299
0.188MetTrp: 0.188 ± 0.115
0.658MetTyr: 0.658 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
3.383AsnAla: 3.383 ± 0.612
0.282AsnCys: 0.282 ± 0.166
3.195AsnAsp: 3.195 ± 0.511
2.537AsnGlu: 2.537 ± 0.46
2.631AsnPhe: 2.631 ± 0.501
5.357AsnGly: 5.357 ± 0.851
0.752AsnHis: 0.752 ± 0.284
4.605AsnIle: 4.605 ± 0.705
3.853AsnLys: 3.853 ± 0.542
5.639AsnLeu: 5.639 ± 0.89
1.128AsnMet: 1.128 ± 0.258
2.819AsnAsn: 2.819 ± 0.516
3.007AsnPro: 3.007 ± 0.48
3.007AsnGln: 3.007 ± 0.499
1.973AsnArg: 1.973 ± 0.387
4.699AsnSer: 4.699 ± 0.781
2.349AsnThr: 2.349 ± 0.55
5.075AsnVal: 5.075 ± 0.697
0.94AsnTrp: 0.94 ± 0.288
2.537AsnTyr: 2.537 ± 0.666
0.0AsnXaa: 0.0 ± 0.0
Pro
0.846ProAla: 0.846 ± 0.289
0.094ProCys: 0.094 ± 0.115
2.067ProAsp: 2.067 ± 0.52
2.819ProGlu: 2.819 ± 0.636
1.598ProPhe: 1.598 ± 0.35
1.128ProGly: 1.128 ± 0.304
0.376ProHis: 0.376 ± 0.195
2.349ProIle: 2.349 ± 0.37
2.819ProLys: 2.819 ± 0.571
3.571ProLeu: 3.571 ± 0.529
1.034ProMet: 1.034 ± 0.289
2.443ProAsn: 2.443 ± 0.539
0.846ProPro: 0.846 ± 0.246
1.786ProGln: 1.786 ± 0.392
1.128ProArg: 1.128 ± 0.356
2.255ProSer: 2.255 ± 0.399
1.973ProThr: 1.973 ± 0.373
2.349ProVal: 2.349 ± 0.525
0.376ProTrp: 0.376 ± 0.177
0.752ProTyr: 0.752 ± 0.221
0.0ProXaa: 0.0 ± 0.0
Gln
4.417GlnAla: 4.417 ± 0.572
0.094GlnCys: 0.094 ± 0.087
1.128GlnAsp: 1.128 ± 0.328
3.289GlnGlu: 3.289 ± 0.688
1.598GlnPhe: 1.598 ± 0.433
1.692GlnGly: 1.692 ± 0.458
0.47GlnHis: 0.47 ± 0.212
2.537GlnIle: 2.537 ± 0.439
3.195GlnLys: 3.195 ± 0.573
3.101GlnLeu: 3.101 ± 0.476
0.47GlnMet: 0.47 ± 0.25
1.88GlnAsn: 1.88 ± 0.342
1.222GlnPro: 1.222 ± 0.3
1.692GlnGln: 1.692 ± 0.406
1.316GlnArg: 1.316 ± 0.336
3.007GlnSer: 3.007 ± 0.549
2.349GlnThr: 2.349 ± 0.529
2.255GlnVal: 2.255 ± 0.399
0.752GlnTrp: 0.752 ± 0.217
1.88GlnTyr: 1.88 ± 0.44
0.0GlnXaa: 0.0 ± 0.0
Arg
2.349ArgAla: 2.349 ± 0.411
0.752ArgCys: 0.752 ± 0.324
2.443ArgAsp: 2.443 ± 0.564
1.88ArgGlu: 1.88 ± 0.411
2.255ArgPhe: 2.255 ± 0.52
1.316ArgGly: 1.316 ± 0.344
0.564ArgHis: 0.564 ± 0.235
3.101ArgIle: 3.101 ± 0.615
3.853ArgLys: 3.853 ± 0.604
3.195ArgLeu: 3.195 ± 0.549
0.658ArgMet: 0.658 ± 0.259
2.161ArgAsn: 2.161 ± 0.478
1.128ArgPro: 1.128 ± 0.373
1.504ArgGln: 1.504 ± 0.41
1.973ArgArg: 1.973 ± 0.439
2.067ArgSer: 2.067 ± 0.558
2.161ArgThr: 2.161 ± 0.514
2.067ArgVal: 2.067 ± 0.422
0.282ArgTrp: 0.282 ± 0.166
1.786ArgTyr: 1.786 ± 0.493
0.0ArgXaa: 0.0 ± 0.0
Ser
5.357SerAla: 5.357 ± 1.412
0.282SerCys: 0.282 ± 0.137
4.323SerAsp: 4.323 ± 0.72
4.041SerGlu: 4.041 ± 0.601
2.443SerPhe: 2.443 ± 0.471
6.672SerGly: 6.672 ± 1.156
1.222SerHis: 1.222 ± 0.323
5.451SerIle: 5.451 ± 0.748
5.451SerLys: 5.451 ± 0.608
6.766SerLeu: 6.766 ± 0.892
2.067SerMet: 2.067 ± 0.363
4.981SerAsn: 4.981 ± 0.754
1.504SerPro: 1.504 ± 0.396
1.88SerGln: 1.88 ± 0.444
1.88SerArg: 1.88 ± 0.406
5.827SerSer: 5.827 ± 0.807
4.699SerThr: 4.699 ± 0.713
4.323SerVal: 4.323 ± 0.47
0.846SerTrp: 0.846 ± 0.292
2.443SerTyr: 2.443 ± 0.514
0.0SerXaa: 0.0 ± 0.0
Thr
4.041ThrAla: 4.041 ± 0.655
0.188ThrCys: 0.188 ± 0.141
3.853ThrAsp: 3.853 ± 0.629
3.853ThrGlu: 3.853 ± 0.59
2.067ThrPhe: 2.067 ± 0.409
5.639ThrGly: 5.639 ± 0.999
1.222ThrHis: 1.222 ± 0.339
5.263ThrIle: 5.263 ± 0.697
4.511ThrLys: 4.511 ± 0.732
5.451ThrLeu: 5.451 ± 0.76
0.94ThrMet: 0.94 ± 0.329
2.349ThrAsn: 2.349 ± 0.436
2.255ThrPro: 2.255 ± 0.631
2.913ThrGln: 2.913 ± 0.521
1.128ThrArg: 1.128 ± 0.326
4.417ThrSer: 4.417 ± 0.8
4.135ThrThr: 4.135 ± 0.683
5.639ThrVal: 5.639 ± 0.703
0.564ThrTrp: 0.564 ± 0.205
1.973ThrTyr: 1.973 ± 0.605
0.0ThrXaa: 0.0 ± 0.0
Val
3.759ValAla: 3.759 ± 0.744
0.47ValCys: 0.47 ± 0.261
4.323ValAsp: 4.323 ± 0.692
4.793ValGlu: 4.793 ± 0.719
2.913ValPhe: 2.913 ± 0.49
3.853ValGly: 3.853 ± 0.566
1.034ValHis: 1.034 ± 0.324
5.169ValIle: 5.169 ± 0.743
4.417ValLys: 4.417 ± 0.874
4.041ValLeu: 4.041 ± 0.69
1.786ValMet: 1.786 ± 0.308
3.289ValAsn: 3.289 ± 0.558
1.786ValPro: 1.786 ± 0.483
1.786ValGln: 1.786 ± 0.338
2.349ValArg: 2.349 ± 0.437
4.229ValSer: 4.229 ± 0.722
3.947ValThr: 3.947 ± 0.668
4.699ValVal: 4.699 ± 0.719
0.376ValTrp: 0.376 ± 0.182
2.255ValTyr: 2.255 ± 0.457
0.0ValXaa: 0.0 ± 0.0
Trp
0.658TrpAla: 0.658 ± 0.317
0.376TrpCys: 0.376 ± 0.171
1.504TrpAsp: 1.504 ± 0.373
1.034TrpGlu: 1.034 ± 0.243
0.94TrpPhe: 0.94 ± 0.349
0.752TrpGly: 0.752 ± 0.368
0.0TrpHis: 0.0 ± 0.0
0.846TrpIle: 0.846 ± 0.291
0.94TrpLys: 0.94 ± 0.354
1.504TrpLeu: 1.504 ± 0.406
0.188TrpMet: 0.188 ± 0.137
0.94TrpAsn: 0.94 ± 0.366
0.188TrpPro: 0.188 ± 0.143
0.658TrpGln: 0.658 ± 0.198
0.658TrpArg: 0.658 ± 0.204
0.846TrpSer: 0.846 ± 0.235
0.658TrpThr: 0.658 ± 0.227
1.034TrpVal: 1.034 ± 0.263
0.188TrpTrp: 0.188 ± 0.141
0.376TrpTyr: 0.376 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.973TyrAla: 1.973 ± 0.488
0.188TyrCys: 0.188 ± 0.129
3.101TyrAsp: 3.101 ± 0.603
2.819TyrGlu: 2.819 ± 0.655
1.598TyrPhe: 1.598 ± 0.351
2.255TyrGly: 2.255 ± 0.498
0.564TyrHis: 0.564 ± 0.225
2.725TyrIle: 2.725 ± 0.439
3.101TyrLys: 3.101 ± 0.507
2.255TyrLeu: 2.255 ± 0.527
0.564TyrMet: 0.564 ± 0.224
2.161TyrAsn: 2.161 ± 0.529
1.316TyrPro: 1.316 ± 0.41
1.786TyrGln: 1.786 ± 0.389
1.692TyrArg: 1.692 ± 0.404
2.631TyrSer: 2.631 ± 0.492
2.161TyrThr: 2.161 ± 0.532
1.88TyrVal: 1.88 ± 0.325
0.47TyrTrp: 0.47 ± 0.171
1.222TyrTyr: 1.222 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.094XaaPhe: 0.094 ± 0.099
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (10642 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski