Amino acid dipepetide frequency for Lactococcus phage BM13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.761AlaAla: 4.761 ± 0.939
0.104AlaCys: 0.104 ± 0.101
4.244AlaAsp: 4.244 ± 0.653
3.312AlaGlu: 3.312 ± 0.647
3.312AlaPhe: 3.312 ± 0.494
3.519AlaGly: 3.519 ± 1.054
1.346AlaHis: 1.346 ± 0.347
5.9AlaIle: 5.9 ± 1.23
5.175AlaLys: 5.175 ± 0.691
4.968AlaLeu: 4.968 ± 0.597
2.174AlaMet: 2.174 ± 0.589
3.933AlaAsn: 3.933 ± 0.525
1.346AlaPro: 1.346 ± 0.451
2.795AlaGln: 2.795 ± 0.501
2.484AlaArg: 2.484 ± 0.477
4.865AlaSer: 4.865 ± 0.833
4.554AlaThr: 4.554 ± 1.257
3.83AlaVal: 3.83 ± 0.847
0.621AlaTrp: 0.621 ± 0.218
2.07AlaTyr: 2.07 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
0.104CysAla: 0.104 ± 0.117
0.207CysCys: 0.207 ± 0.15
0.311CysAsp: 0.311 ± 0.181
0.828CysGlu: 0.828 ± 0.239
0.311CysPhe: 0.311 ± 0.185
0.518CysGly: 0.518 ± 0.198
0.0CysHis: 0.0 ± 0.0
0.104CysIle: 0.104 ± 0.115
0.311CysLys: 0.311 ± 0.18
0.414CysLeu: 0.414 ± 0.265
0.0CysMet: 0.0 ± 0.0
0.207CysAsn: 0.207 ± 0.151
0.104CysPro: 0.104 ± 0.104
0.0CysGln: 0.0 ± 0.0
0.207CysArg: 0.207 ± 0.135
0.311CysSer: 0.311 ± 0.186
0.311CysThr: 0.311 ± 0.163
0.207CysVal: 0.207 ± 0.133
0.0CysTrp: 0.0 ± 0.0
0.414CysTyr: 0.414 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
4.037AspAla: 4.037 ± 0.857
0.518AspCys: 0.518 ± 0.257
4.968AspAsp: 4.968 ± 1.225
4.865AspGlu: 4.865 ± 0.876
3.416AspPhe: 3.416 ± 0.507
4.451AspGly: 4.451 ± 0.774
0.518AspHis: 0.518 ± 0.217
3.83AspIle: 3.83 ± 0.631
6.625AspLys: 6.625 ± 1.233
5.589AspLeu: 5.589 ± 0.748
2.277AspMet: 2.277 ± 0.506
3.623AspAsn: 3.623 ± 0.63
1.553AspPro: 1.553 ± 0.385
1.035AspGln: 1.035 ± 0.36
2.07AspArg: 2.07 ± 0.506
3.416AspSer: 3.416 ± 0.652
2.381AspThr: 2.381 ± 0.449
2.795AspVal: 2.795 ± 0.501
1.139AspTrp: 1.139 ± 0.303
2.898AspTyr: 2.898 ± 0.61
0.0AspXaa: 0.0 ± 0.0
Glu
4.554GluAla: 4.554 ± 0.708
0.414GluCys: 0.414 ± 0.207
3.416GluAsp: 3.416 ± 0.804
5.382GluGlu: 5.382 ± 1.164
3.726GluPhe: 3.726 ± 0.591
2.381GluGly: 2.381 ± 0.374
1.035GluHis: 1.035 ± 0.364
6.004GluIle: 6.004 ± 0.885
6.107GluLys: 6.107 ± 1.044
7.556GluLeu: 7.556 ± 1.037
1.449GluMet: 1.449 ± 0.421
5.175GluAsn: 5.175 ± 0.744
1.346GluPro: 1.346 ± 0.459
3.83GluGln: 3.83 ± 0.764
2.588GluArg: 2.588 ± 0.661
3.209GluSer: 3.209 ± 0.655
4.658GluThr: 4.658 ± 0.642
6.211GluVal: 6.211 ± 0.8
1.035GluTrp: 1.035 ± 0.289
2.277GluTyr: 2.277 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
2.691PheAla: 2.691 ± 0.485
0.104PheCys: 0.104 ± 0.097
2.795PheAsp: 2.795 ± 0.654
3.519PheGlu: 3.519 ± 0.693
3.002PhePhe: 3.002 ± 0.792
3.623PheGly: 3.623 ± 0.567
0.414PheHis: 0.414 ± 0.204
2.898PheIle: 2.898 ± 0.582
3.726PheLys: 3.726 ± 0.704
3.002PheLeu: 3.002 ± 0.56
1.242PheMet: 1.242 ± 0.377
4.554PheAsn: 4.554 ± 0.729
1.346PhePro: 1.346 ± 0.357
1.863PheGln: 1.863 ± 0.434
1.553PheArg: 1.553 ± 0.386
3.312PheSer: 3.312 ± 0.671
3.002PheThr: 3.002 ± 0.61
2.484PheVal: 2.484 ± 0.487
0.207PheTrp: 0.207 ± 0.143
2.381PheTyr: 2.381 ± 0.5
0.0PheXaa: 0.0 ± 0.0
Gly
3.519GlyAla: 3.519 ± 0.721
0.207GlyCys: 0.207 ± 0.134
3.209GlyAsp: 3.209 ± 0.727
4.865GlyGlu: 4.865 ± 0.914
4.14GlyPhe: 4.14 ± 0.9
6.004GlyGly: 6.004 ± 0.975
0.414GlyHis: 0.414 ± 0.189
6.004GlyIle: 6.004 ± 1.074
6.728GlyLys: 6.728 ± 0.859
4.968GlyLeu: 4.968 ± 0.71
1.449GlyMet: 1.449 ± 0.346
3.519GlyAsn: 3.519 ± 0.894
0.725GlyPro: 0.725 ± 0.266
2.588GlyGln: 2.588 ± 0.624
2.898GlyArg: 2.898 ± 0.539
3.623GlySer: 3.623 ± 0.667
5.072GlyThr: 5.072 ± 0.734
3.105GlyVal: 3.105 ± 0.513
1.035GlyTrp: 1.035 ± 0.34
2.381GlyTyr: 2.381 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
1.139HisAla: 1.139 ± 0.38
0.0HisCys: 0.0 ± 0.0
0.828HisAsp: 0.828 ± 0.3
1.553HisGlu: 1.553 ± 0.417
0.207HisPhe: 0.207 ± 0.124
0.828HisGly: 0.828 ± 0.394
0.311HisHis: 0.311 ± 0.282
0.932HisIle: 0.932 ± 0.304
1.242HisLys: 1.242 ± 0.359
0.828HisLeu: 0.828 ± 0.304
0.414HisMet: 0.414 ± 0.198
0.932HisAsn: 0.932 ± 0.29
0.414HisPro: 0.414 ± 0.201
0.311HisGln: 0.311 ± 0.216
0.311HisArg: 0.311 ± 0.163
0.932HisSer: 0.932 ± 0.325
0.414HisThr: 0.414 ± 0.202
0.621HisVal: 0.621 ± 0.241
0.0HisTrp: 0.0 ± 0.0
0.621HisTyr: 0.621 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
5.382IleAla: 5.382 ± 0.783
0.311IleCys: 0.311 ± 0.193
5.797IleAsp: 5.797 ± 0.738
5.693IleGlu: 5.693 ± 0.92
3.105IlePhe: 3.105 ± 0.657
5.175IleGly: 5.175 ± 1.343
1.346IleHis: 1.346 ± 0.529
4.658IleIle: 4.658 ± 0.781
6.107IleLys: 6.107 ± 0.926
3.312IleLeu: 3.312 ± 0.466
1.346IleMet: 1.346 ± 0.332
4.14IleAsn: 4.14 ± 0.65
2.795IlePro: 2.795 ± 0.524
3.002IleGln: 3.002 ± 0.498
2.484IleArg: 2.484 ± 0.536
5.175IleSer: 5.175 ± 0.803
5.175IleThr: 5.175 ± 0.751
3.726IleVal: 3.726 ± 0.626
0.621IleTrp: 0.621 ± 0.212
2.277IleTyr: 2.277 ± 0.418
0.0IleXaa: 0.0 ± 0.0
Lys
5.589LysAla: 5.589 ± 0.936
0.207LysCys: 0.207 ± 0.143
5.797LysAsp: 5.797 ± 0.793
7.556LysGlu: 7.556 ± 0.989
3.416LysPhe: 3.416 ± 0.659
5.589LysGly: 5.589 ± 0.667
2.07LysHis: 2.07 ± 0.572
4.658LysIle: 4.658 ± 0.654
9.419LysLys: 9.419 ± 1.166
7.97LysLeu: 7.97 ± 1.19
2.588LysMet: 2.588 ± 0.464
5.175LysAsn: 5.175 ± 0.737
1.967LysPro: 1.967 ± 0.45
3.209LysGln: 3.209 ± 0.702
3.83LysArg: 3.83 ± 0.738
4.658LysSer: 4.658 ± 0.637
5.072LysThr: 5.072 ± 0.636
5.693LysVal: 5.693 ± 0.988
1.139LysTrp: 1.139 ± 0.374
4.658LysTyr: 4.658 ± 0.67
0.0LysXaa: 0.0 ± 0.0
Leu
5.382LeuAla: 5.382 ± 0.604
0.311LeuCys: 0.311 ± 0.18
6.211LeuAsp: 6.211 ± 0.912
6.004LeuGlu: 6.004 ± 1.132
2.691LeuPhe: 2.691 ± 0.466
4.968LeuGly: 4.968 ± 0.674
0.414LeuHis: 0.414 ± 0.205
4.865LeuIle: 4.865 ± 0.461
7.556LeuLys: 7.556 ± 1.005
4.347LeuLeu: 4.347 ± 0.68
1.449LeuMet: 1.449 ± 0.404
6.004LeuAsn: 6.004 ± 0.793
2.277LeuPro: 2.277 ± 0.569
4.865LeuGln: 4.865 ± 0.645
2.795LeuArg: 2.795 ± 0.527
4.865LeuSer: 4.865 ± 0.763
4.761LeuThr: 4.761 ± 0.712
3.726LeuVal: 3.726 ± 0.626
0.828LeuTrp: 0.828 ± 0.323
1.863LeuTyr: 1.863 ± 0.56
0.0LeuXaa: 0.0 ± 0.0
Met
1.242MetAla: 1.242 ± 0.352
0.104MetCys: 0.104 ± 0.099
1.346MetAsp: 1.346 ± 0.365
1.76MetGlu: 1.76 ± 0.391
1.139MetPhe: 1.139 ± 0.414
0.828MetGly: 0.828 ± 0.306
0.104MetHis: 0.104 ± 0.097
1.553MetIle: 1.553 ± 0.338
2.277MetLys: 2.277 ± 0.484
1.449MetLeu: 1.449 ± 0.325
0.311MetMet: 0.311 ± 0.138
2.277MetAsn: 2.277 ± 0.567
1.035MetPro: 1.035 ± 0.306
1.035MetGln: 1.035 ± 0.283
1.139MetArg: 1.139 ± 0.312
0.932MetSer: 0.932 ± 0.301
3.002MetThr: 3.002 ± 0.606
1.346MetVal: 1.346 ± 0.417
0.207MetTrp: 0.207 ± 0.144
0.311MetTyr: 0.311 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
4.968AsnAla: 4.968 ± 1.254
0.518AsnCys: 0.518 ± 0.218
2.691AsnAsp: 2.691 ± 0.485
4.244AsnGlu: 4.244 ± 0.605
3.105AsnPhe: 3.105 ± 0.473
5.072AsnGly: 5.072 ± 0.733
0.828AsnHis: 0.828 ± 0.416
4.451AsnIle: 4.451 ± 0.639
5.175AsnLys: 5.175 ± 0.785
3.933AsnLeu: 3.933 ± 0.601
1.553AsnMet: 1.553 ± 0.302
5.382AsnAsn: 5.382 ± 0.733
2.277AsnPro: 2.277 ± 0.518
4.451AsnGln: 4.451 ± 0.953
2.174AsnArg: 2.174 ± 0.433
3.312AsnSer: 3.312 ± 0.469
2.588AsnThr: 2.588 ± 0.451
4.451AsnVal: 4.451 ± 0.618
1.139AsnTrp: 1.139 ± 0.289
2.898AsnTyr: 2.898 ± 0.911
0.0AsnXaa: 0.0 ± 0.0
Pro
1.346ProAla: 1.346 ± 0.329
0.104ProCys: 0.104 ± 0.093
2.484ProAsp: 2.484 ± 0.511
1.76ProGlu: 1.76 ± 0.465
1.449ProPhe: 1.449 ± 0.45
0.725ProGly: 0.725 ± 0.392
0.621ProHis: 0.621 ± 0.28
1.449ProIle: 1.449 ± 0.355
2.484ProLys: 2.484 ± 0.535
1.967ProLeu: 1.967 ± 0.441
0.621ProMet: 0.621 ± 0.215
1.553ProAsn: 1.553 ± 0.531
0.621ProPro: 0.621 ± 0.241
1.76ProGln: 1.76 ± 0.622
0.518ProArg: 0.518 ± 0.197
1.76ProSer: 1.76 ± 0.46
1.242ProThr: 1.242 ± 0.332
3.002ProVal: 3.002 ± 0.522
0.414ProTrp: 0.414 ± 0.221
0.725ProTyr: 0.725 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
3.933GlnAla: 3.933 ± 0.487
0.104GlnCys: 0.104 ± 0.118
1.035GlnAsp: 1.035 ± 0.332
2.795GlnGlu: 2.795 ± 0.606
2.588GlnPhe: 2.588 ± 0.379
2.691GlnGly: 2.691 ± 0.802
0.828GlnHis: 0.828 ± 0.257
2.795GlnIle: 2.795 ± 0.529
3.105GlnLys: 3.105 ± 0.484
4.554GlnLeu: 4.554 ± 0.662
1.449GlnMet: 1.449 ± 0.368
3.416GlnAsn: 3.416 ± 0.723
1.553GlnPro: 1.553 ± 0.442
4.554GlnGln: 4.554 ± 1.323
1.76GlnArg: 1.76 ± 0.583
2.484GlnSer: 2.484 ± 0.567
2.174GlnThr: 2.174 ± 0.458
3.312GlnVal: 3.312 ± 0.537
0.725GlnTrp: 0.725 ± 0.258
1.449GlnTyr: 1.449 ± 0.328
0.0GlnXaa: 0.0 ± 0.0
Arg
2.07ArgAla: 2.07 ± 0.489
0.414ArgCys: 0.414 ± 0.192
2.07ArgAsp: 2.07 ± 0.512
2.174ArgGlu: 2.174 ± 0.516
1.449ArgPhe: 1.449 ± 0.399
2.588ArgGly: 2.588 ± 0.45
0.0ArgHis: 0.0 ± 0.0
3.416ArgIle: 3.416 ± 0.614
2.795ArgLys: 2.795 ± 0.834
3.416ArgLeu: 3.416 ± 0.589
0.828ArgMet: 0.828 ± 0.329
2.898ArgAsn: 2.898 ± 0.6
0.725ArgPro: 0.725 ± 0.289
1.553ArgGln: 1.553 ± 0.505
1.967ArgArg: 1.967 ± 0.481
1.863ArgSer: 1.863 ± 0.359
2.588ArgThr: 2.588 ± 0.405
1.967ArgVal: 1.967 ± 0.377
0.725ArgTrp: 0.725 ± 0.283
2.174ArgTyr: 2.174 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
2.898SerAla: 2.898 ± 0.649
0.0SerCys: 0.0 ± 0.0
4.244SerAsp: 4.244 ± 0.762
4.865SerGlu: 4.865 ± 0.733
3.519SerPhe: 3.519 ± 0.733
5.797SerGly: 5.797 ± 0.855
0.104SerHis: 0.104 ± 0.101
4.658SerIle: 4.658 ± 0.785
5.797SerLys: 5.797 ± 0.665
4.451SerLeu: 4.451 ± 0.453
1.035SerMet: 1.035 ± 0.355
3.623SerAsn: 3.623 ± 0.648
1.139SerPro: 1.139 ± 0.3
3.002SerGln: 3.002 ± 0.527
2.484SerArg: 2.484 ± 0.387
3.312SerSer: 3.312 ± 0.646
3.002SerThr: 3.002 ± 0.521
3.002SerVal: 3.002 ± 0.632
0.207SerTrp: 0.207 ± 0.131
2.484SerTyr: 2.484 ± 0.402
0.0SerXaa: 0.0 ± 0.0
Thr
4.244ThrAla: 4.244 ± 1.055
0.414ThrCys: 0.414 ± 0.2
3.726ThrAsp: 3.726 ± 0.692
3.933ThrGlu: 3.933 ± 0.643
2.381ThrPhe: 2.381 ± 0.52
4.865ThrGly: 4.865 ± 0.59
0.725ThrHis: 0.725 ± 0.256
4.968ThrIle: 4.968 ± 0.572
4.554ThrLys: 4.554 ± 0.824
4.865ThrLeu: 4.865 ± 0.693
0.621ThrMet: 0.621 ± 0.244
3.83ThrAsn: 3.83 ± 0.681
2.588ThrPro: 2.588 ± 0.639
2.381ThrGln: 2.381 ± 0.484
2.07ThrArg: 2.07 ± 0.456
3.416ThrSer: 3.416 ± 0.485
3.726ThrThr: 3.726 ± 0.521
3.623ThrVal: 3.623 ± 0.602
0.518ThrTrp: 0.518 ± 0.231
2.898ThrTyr: 2.898 ± 0.796
0.0ThrXaa: 0.0 ± 0.0
Val
4.554ValAla: 4.554 ± 0.811
0.414ValCys: 0.414 ± 0.184
3.623ValAsp: 3.623 ± 0.582
4.761ValGlu: 4.761 ± 0.575
2.277ValPhe: 2.277 ± 0.531
4.037ValGly: 4.037 ± 0.454
0.932ValHis: 0.932 ± 0.312
4.865ValIle: 4.865 ± 0.743
6.004ValLys: 6.004 ± 0.784
3.726ValLeu: 3.726 ± 0.746
1.139ValMet: 1.139 ± 0.514
2.381ValAsn: 2.381 ± 0.436
1.76ValPro: 1.76 ± 0.407
1.656ValGln: 1.656 ± 0.416
2.588ValArg: 2.588 ± 0.53
3.933ValSer: 3.933 ± 0.67
4.244ValThr: 4.244 ± 0.722
4.347ValVal: 4.347 ± 0.702
0.414ValTrp: 0.414 ± 0.228
1.863ValTyr: 1.863 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.208
0.207TrpCys: 0.207 ± 0.142
0.828TrpAsp: 0.828 ± 0.309
0.518TrpGlu: 0.518 ± 0.197
0.518TrpPhe: 0.518 ± 0.201
0.414TrpGly: 0.414 ± 0.174
0.104TrpHis: 0.104 ± 0.109
1.035TrpIle: 1.035 ± 0.309
1.346TrpLys: 1.346 ± 0.502
1.449TrpLeu: 1.449 ± 0.427
0.311TrpMet: 0.311 ± 0.188
0.621TrpAsn: 0.621 ± 0.289
0.0TrpPro: 0.0 ± 0.0
0.932TrpGln: 0.932 ± 0.339
0.621TrpArg: 0.621 ± 0.249
1.242TrpSer: 1.242 ± 0.333
0.311TrpThr: 0.311 ± 0.163
0.518TrpVal: 0.518 ± 0.175
0.104TrpTrp: 0.104 ± 0.11
0.414TrpTyr: 0.414 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.484TyrAla: 2.484 ± 0.469
0.207TyrCys: 0.207 ± 0.146
2.381TyrAsp: 2.381 ± 0.577
1.967TyrGlu: 1.967 ± 0.429
2.07TyrPhe: 2.07 ± 0.491
2.381TyrGly: 2.381 ± 0.594
0.725TyrHis: 0.725 ± 0.28
2.588TyrIle: 2.588 ± 0.493
3.726TyrLys: 3.726 ± 0.611
3.416TyrLeu: 3.416 ± 0.649
1.035TyrMet: 1.035 ± 0.31
1.967TyrAsn: 1.967 ± 0.585
1.035TyrPro: 1.035 ± 0.309
2.484TyrGln: 2.484 ± 0.611
1.139TyrArg: 1.139 ± 0.44
3.002TyrSer: 3.002 ± 0.567
2.07TyrThr: 2.07 ± 0.492
1.553TyrVal: 1.553 ± 0.444
0.828TyrTrp: 0.828 ± 0.345
1.76TyrTyr: 1.76 ± 0.451
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (9662 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski