Amino acid dipepetide frequency for Lactobacillus phage CL2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.93AlaAla: 7.93 ± 1.278
0.426AlaCys: 0.426 ± 0.172
6.822AlaAsp: 6.822 ± 0.786
5.202AlaGlu: 5.202 ± 0.829
3.923AlaPhe: 3.923 ± 0.661
5.969AlaGly: 5.969 ± 0.987
1.62AlaHis: 1.62 ± 0.401
6.054AlaIle: 6.054 ± 0.955
5.202AlaLys: 5.202 ± 0.675
5.287AlaLeu: 5.287 ± 0.596
2.473AlaMet: 2.473 ± 0.568
5.543AlaAsn: 5.543 ± 0.964
1.705AlaPro: 1.705 ± 0.47
2.558AlaGln: 2.558 ± 0.449
4.093AlaArg: 4.093 ± 0.745
5.031AlaSer: 5.031 ± 0.79
5.713AlaThr: 5.713 ± 0.84
6.054AlaVal: 6.054 ± 1.139
1.364AlaTrp: 1.364 ± 0.517
2.729AlaTyr: 2.729 ± 0.601
0.0AlaXaa: 0.0 ± 0.0
Cys
0.341CysAla: 0.341 ± 0.178
0.171CysCys: 0.171 ± 0.129
0.426CysAsp: 0.426 ± 0.165
0.512CysGlu: 0.512 ± 0.234
0.256CysPhe: 0.256 ± 0.146
0.682CysGly: 0.682 ± 0.302
0.0CysHis: 0.0 ± 0.0
0.426CysIle: 0.426 ± 0.165
0.767CysLys: 0.767 ± 0.261
0.426CysLeu: 0.426 ± 0.231
0.085CysMet: 0.085 ± 0.082
0.0CysAsn: 0.0 ± 0.0
0.512CysPro: 0.512 ± 0.235
0.0CysGln: 0.0 ± 0.0
0.256CysArg: 0.256 ± 0.137
0.341CysSer: 0.341 ± 0.172
0.256CysThr: 0.256 ± 0.133
0.512CysVal: 0.512 ± 0.213
0.085CysTrp: 0.085 ± 0.081
0.256CysTyr: 0.256 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
4.434AspAla: 4.434 ± 0.628
0.085AspCys: 0.085 ± 0.08
5.628AspAsp: 5.628 ± 0.844
4.178AspGlu: 4.178 ± 0.895
2.814AspPhe: 2.814 ± 0.537
8.016AspGly: 8.016 ± 1.029
1.45AspHis: 1.45 ± 0.377
3.411AspIle: 3.411 ± 0.555
4.861AspLys: 4.861 ± 1.138
5.372AspLeu: 5.372 ± 0.594
1.705AspMet: 1.705 ± 0.4
3.24AspAsn: 3.24 ± 0.613
2.558AspPro: 2.558 ± 0.478
3.411AspGln: 3.411 ± 0.594
3.24AspArg: 3.24 ± 0.445
4.434AspSer: 4.434 ± 0.719
2.985AspThr: 2.985 ± 0.637
3.752AspVal: 3.752 ± 0.649
1.023AspTrp: 1.023 ± 0.248
2.814AspTyr: 2.814 ± 0.563
0.0AspXaa: 0.0 ± 0.0
Glu
3.752GluAla: 3.752 ± 0.631
0.256GluCys: 0.256 ± 0.133
3.155GluAsp: 3.155 ± 0.564
2.558GluGlu: 2.558 ± 0.61
1.876GluPhe: 1.876 ± 0.452
3.667GluGly: 3.667 ± 0.524
1.364GluHis: 1.364 ± 0.287
3.411GluIle: 3.411 ± 0.548
3.667GluLys: 3.667 ± 0.692
5.202GluLeu: 5.202 ± 0.666
1.62GluMet: 1.62 ± 0.452
2.473GluAsn: 2.473 ± 0.53
1.535GluPro: 1.535 ± 0.538
1.791GluGln: 1.791 ± 0.36
2.643GluArg: 2.643 ± 0.408
2.473GluSer: 2.473 ± 0.426
3.07GluThr: 3.07 ± 0.43
3.496GluVal: 3.496 ± 0.618
0.938GluTrp: 0.938 ± 0.234
2.729GluTyr: 2.729 ± 0.662
0.0GluXaa: 0.0 ± 0.0
Phe
2.729PheAla: 2.729 ± 0.392
0.341PheCys: 0.341 ± 0.168
2.814PheAsp: 2.814 ± 0.417
1.62PheGlu: 1.62 ± 0.354
1.876PhePhe: 1.876 ± 0.625
2.729PheGly: 2.729 ± 0.528
0.426PheHis: 0.426 ± 0.21
2.132PheIle: 2.132 ± 0.329
2.814PheLys: 2.814 ± 0.445
2.899PheLeu: 2.899 ± 0.582
1.194PheMet: 1.194 ± 0.269
1.961PheAsn: 1.961 ± 0.378
1.279PhePro: 1.279 ± 0.41
0.938PheGln: 0.938 ± 0.265
1.023PheArg: 1.023 ± 0.272
4.178PheSer: 4.178 ± 0.918
3.07PheThr: 3.07 ± 0.47
2.643PheVal: 2.643 ± 0.473
0.426PheTrp: 0.426 ± 0.196
0.853PheTyr: 0.853 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
5.543GlyAla: 5.543 ± 0.983
0.341GlyCys: 0.341 ± 0.148
5.713GlyAsp: 5.713 ± 1.177
2.729GlyGlu: 2.729 ± 0.486
2.729GlyPhe: 2.729 ± 0.436
4.349GlyGly: 4.349 ± 0.491
1.279GlyHis: 1.279 ± 0.389
6.566GlyIle: 6.566 ± 1.342
6.992GlyLys: 6.992 ± 1.006
5.116GlyLeu: 5.116 ± 0.974
2.302GlyMet: 2.302 ± 0.412
3.24GlyAsn: 3.24 ± 0.571
1.535GlyPro: 1.535 ± 0.274
2.729GlyGln: 2.729 ± 0.452
2.985GlyArg: 2.985 ± 0.519
5.372GlySer: 5.372 ± 0.675
5.287GlyThr: 5.287 ± 0.726
4.519GlyVal: 4.519 ± 0.726
1.279GlyTrp: 1.279 ± 0.328
3.667GlyTyr: 3.667 ± 0.587
0.0GlyXaa: 0.0 ± 0.0
His
1.023HisAla: 1.023 ± 0.346
0.341HisCys: 0.341 ± 0.181
1.279HisAsp: 1.279 ± 0.299
0.938HisGlu: 0.938 ± 0.282
0.682HisPhe: 0.682 ± 0.281
1.364HisGly: 1.364 ± 0.306
0.512HisHis: 0.512 ± 0.245
0.853HisIle: 0.853 ± 0.306
1.535HisLys: 1.535 ± 0.339
1.535HisLeu: 1.535 ± 0.405
0.426HisMet: 0.426 ± 0.184
0.767HisAsn: 0.767 ± 0.24
0.512HisPro: 0.512 ± 0.211
1.023HisGln: 1.023 ± 0.247
0.767HisArg: 0.767 ± 0.261
1.62HisSer: 1.62 ± 0.49
1.023HisThr: 1.023 ± 0.367
0.938HisVal: 0.938 ± 0.315
0.171HisTrp: 0.171 ± 0.104
0.853HisTyr: 0.853 ± 0.295
0.0HisXaa: 0.0 ± 0.0
Ile
6.31IleAla: 6.31 ± 0.713
0.256IleCys: 0.256 ± 0.148
4.861IleAsp: 4.861 ± 0.709
2.729IleGlu: 2.729 ± 0.555
1.961IlePhe: 1.961 ± 0.445
3.923IleGly: 3.923 ± 0.601
0.938IleHis: 0.938 ± 0.265
3.326IleIle: 3.326 ± 0.506
3.923IleLys: 3.923 ± 0.605
2.985IleLeu: 2.985 ± 0.504
1.876IleMet: 1.876 ± 0.431
3.411IleAsn: 3.411 ± 0.499
2.473IlePro: 2.473 ± 0.476
2.132IleGln: 2.132 ± 0.435
2.473IleArg: 2.473 ± 0.583
4.69IleSer: 4.69 ± 0.687
4.605IleThr: 4.605 ± 0.584
4.861IleVal: 4.861 ± 0.829
1.791IleTrp: 1.791 ± 0.691
2.473IleTyr: 2.473 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
6.395LysAla: 6.395 ± 0.832
0.597LysCys: 0.597 ± 0.243
3.667LysAsp: 3.667 ± 0.684
3.667LysGlu: 3.667 ± 0.61
2.217LysPhe: 2.217 ± 0.474
4.69LysGly: 4.69 ± 1.198
1.194LysHis: 1.194 ± 0.367
4.093LysIle: 4.093 ± 0.422
3.581LysLys: 3.581 ± 0.713
5.202LysLeu: 5.202 ± 0.858
2.558LysMet: 2.558 ± 0.538
2.899LysAsn: 2.899 ± 0.615
3.155LysPro: 3.155 ± 0.56
4.008LysGln: 4.008 ± 0.68
3.923LysArg: 3.923 ± 0.713
4.605LysSer: 4.605 ± 0.666
5.372LysThr: 5.372 ± 0.725
4.008LysVal: 4.008 ± 0.567
0.853LysTrp: 0.853 ± 0.272
2.047LysTyr: 2.047 ± 0.317
0.0LysXaa: 0.0 ± 0.0
Leu
6.737LeuAla: 6.737 ± 0.718
0.341LeuCys: 0.341 ± 0.164
4.434LeuAsp: 4.434 ± 0.643
3.581LeuGlu: 3.581 ± 0.635
2.899LeuPhe: 2.899 ± 0.462
4.519LeuGly: 4.519 ± 0.681
1.279LeuHis: 1.279 ± 0.351
5.372LeuIle: 5.372 ± 0.675
5.287LeuLys: 5.287 ± 0.829
5.202LeuLeu: 5.202 ± 0.694
1.364LeuMet: 1.364 ± 0.328
4.349LeuAsn: 4.349 ± 0.685
2.729LeuPro: 2.729 ± 0.508
3.326LeuGln: 3.326 ± 0.533
3.155LeuArg: 3.155 ± 0.505
5.628LeuSer: 5.628 ± 0.751
5.372LeuThr: 5.372 ± 0.55
4.946LeuVal: 4.946 ± 0.742
1.279LeuTrp: 1.279 ± 0.533
2.302LeuTyr: 2.302 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
2.302MetAla: 2.302 ± 0.547
0.085MetCys: 0.085 ± 0.077
1.62MetAsp: 1.62 ± 0.377
0.938MetGlu: 0.938 ± 0.262
0.597MetPhe: 0.597 ± 0.258
1.535MetGly: 1.535 ± 0.421
0.171MetHis: 0.171 ± 0.115
1.364MetIle: 1.364 ± 0.304
1.876MetLys: 1.876 ± 0.408
2.558MetLeu: 2.558 ± 0.465
0.682MetMet: 0.682 ± 0.31
1.961MetAsn: 1.961 ± 0.387
0.938MetPro: 0.938 ± 0.289
1.535MetGln: 1.535 ± 0.365
1.279MetArg: 1.279 ± 0.333
1.194MetSer: 1.194 ± 0.27
2.388MetThr: 2.388 ± 0.347
1.62MetVal: 1.62 ± 0.334
0.256MetTrp: 0.256 ± 0.209
1.279MetTyr: 1.279 ± 0.344
0.0MetXaa: 0.0 ± 0.0
Asn
4.605AsnAla: 4.605 ± 0.552
0.256AsnCys: 0.256 ± 0.149
4.434AsnAsp: 4.434 ± 0.678
2.558AsnGlu: 2.558 ± 0.54
1.109AsnPhe: 1.109 ± 0.333
5.969AsnGly: 5.969 ± 0.654
0.938AsnHis: 0.938 ± 0.254
2.047AsnIle: 2.047 ± 0.512
2.473AsnLys: 2.473 ± 0.547
3.581AsnLeu: 3.581 ± 0.453
1.279AsnMet: 1.279 ± 0.419
2.132AsnAsn: 2.132 ± 0.465
2.047AsnPro: 2.047 ± 0.404
2.388AsnGln: 2.388 ± 0.507
2.558AsnArg: 2.558 ± 0.426
2.558AsnSer: 2.558 ± 0.538
2.132AsnThr: 2.132 ± 0.501
2.899AsnVal: 2.899 ± 0.485
1.109AsnTrp: 1.109 ± 0.312
2.217AsnTyr: 2.217 ± 0.611
0.0AsnXaa: 0.0 ± 0.0
Pro
2.558ProAla: 2.558 ± 0.528
0.085ProCys: 0.085 ± 0.079
2.558ProAsp: 2.558 ± 0.434
3.155ProGlu: 3.155 ± 0.624
1.279ProPhe: 1.279 ± 0.272
1.535ProGly: 1.535 ± 0.319
0.512ProHis: 0.512 ± 0.23
2.388ProIle: 2.388 ± 0.366
3.155ProLys: 3.155 ± 0.559
1.961ProLeu: 1.961 ± 0.423
0.597ProMet: 0.597 ± 0.219
1.535ProAsn: 1.535 ± 0.371
0.682ProPro: 0.682 ± 0.299
1.364ProGln: 1.364 ± 0.349
1.023ProArg: 1.023 ± 0.325
2.558ProSer: 2.558 ± 0.524
2.047ProThr: 2.047 ± 0.506
2.814ProVal: 2.814 ± 0.439
0.938ProTrp: 0.938 ± 0.229
1.279ProTyr: 1.279 ± 0.283
0.0ProXaa: 0.0 ± 0.0
Gln
5.031GlnAla: 5.031 ± 0.652
0.341GlnCys: 0.341 ± 0.147
1.876GlnAsp: 1.876 ± 0.501
2.217GlnGlu: 2.217 ± 0.511
1.279GlnPhe: 1.279 ± 0.373
2.302GlnGly: 2.302 ± 0.482
0.853GlnHis: 0.853 ± 0.307
2.388GlnIle: 2.388 ± 0.513
2.302GlnLys: 2.302 ± 0.518
3.667GlnLeu: 3.667 ± 0.558
0.938GlnMet: 0.938 ± 0.279
1.705GlnAsn: 1.705 ± 0.382
1.45GlnPro: 1.45 ± 0.34
2.302GlnGln: 2.302 ± 0.438
1.45GlnArg: 1.45 ± 0.372
2.899GlnSer: 2.899 ± 0.393
3.326GlnThr: 3.326 ± 0.542
3.667GlnVal: 3.667 ± 0.492
1.279GlnTrp: 1.279 ± 0.239
1.62GlnTyr: 1.62 ± 0.387
0.0GlnXaa: 0.0 ± 0.0
Arg
2.814ArgAla: 2.814 ± 0.506
0.682ArgCys: 0.682 ± 0.291
1.705ArgAsp: 1.705 ± 0.358
2.473ArgGlu: 2.473 ± 0.499
2.558ArgPhe: 2.558 ± 0.554
2.729ArgGly: 2.729 ± 0.527
0.682ArgHis: 0.682 ± 0.31
2.558ArgIle: 2.558 ± 0.583
3.326ArgLys: 3.326 ± 0.601
4.605ArgLeu: 4.605 ± 0.65
1.109ArgMet: 1.109 ± 0.318
2.302ArgAsn: 2.302 ± 0.347
1.279ArgPro: 1.279 ± 0.361
1.791ArgGln: 1.791 ± 0.419
2.132ArgArg: 2.132 ± 0.475
2.047ArgSer: 2.047 ± 0.341
2.643ArgThr: 2.643 ± 0.498
2.302ArgVal: 2.302 ± 0.493
0.426ArgTrp: 0.426 ± 0.197
2.217ArgTyr: 2.217 ± 0.554
0.0ArgXaa: 0.0 ± 0.0
Ser
5.287SerAla: 5.287 ± 1.025
0.256SerCys: 0.256 ± 0.2
5.202SerAsp: 5.202 ± 0.765
3.411SerGlu: 3.411 ± 0.761
3.155SerPhe: 3.155 ± 0.609
6.992SerGly: 6.992 ± 0.977
1.023SerHis: 1.023 ± 0.304
3.752SerIle: 3.752 ± 0.522
5.287SerLys: 5.287 ± 0.8
4.264SerLeu: 4.264 ± 0.57
1.791SerMet: 1.791 ± 0.287
3.24SerAsn: 3.24 ± 0.558
2.388SerPro: 2.388 ± 0.536
3.24SerGln: 3.24 ± 0.616
1.535SerArg: 1.535 ± 0.35
5.713SerSer: 5.713 ± 0.738
4.434SerThr: 4.434 ± 0.697
4.69SerVal: 4.69 ± 0.639
1.194SerTrp: 1.194 ± 0.272
2.388SerTyr: 2.388 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
6.822ThrAla: 6.822 ± 0.79
0.171ThrCys: 0.171 ± 0.114
4.861ThrAsp: 4.861 ± 0.504
2.643ThrGlu: 2.643 ± 0.384
2.473ThrPhe: 2.473 ± 0.499
5.116ThrGly: 5.116 ± 0.643
1.279ThrHis: 1.279 ± 0.389
4.178ThrIle: 4.178 ± 0.613
3.581ThrLys: 3.581 ± 0.51
4.946ThrLeu: 4.946 ± 0.722
1.705ThrMet: 1.705 ± 0.272
2.132ThrAsn: 2.132 ± 0.39
3.155ThrPro: 3.155 ± 0.557
2.473ThrGln: 2.473 ± 0.524
2.132ThrArg: 2.132 ± 0.43
4.264ThrSer: 4.264 ± 0.731
4.093ThrThr: 4.093 ± 0.611
5.713ThrVal: 5.713 ± 0.712
0.767ThrTrp: 0.767 ± 0.26
2.643ThrTyr: 2.643 ± 0.543
0.0ThrXaa: 0.0 ± 0.0
Val
6.992ValAla: 6.992 ± 0.988
0.682ValCys: 0.682 ± 0.257
5.031ValAsp: 5.031 ± 0.665
3.24ValGlu: 3.24 ± 0.741
2.047ValPhe: 2.047 ± 0.359
4.264ValGly: 4.264 ± 0.737
1.023ValHis: 1.023 ± 0.264
4.264ValIle: 4.264 ± 0.59
4.69ValLys: 4.69 ± 0.728
4.861ValLeu: 4.861 ± 0.631
1.791ValMet: 1.791 ± 0.301
3.155ValAsn: 3.155 ± 0.507
2.388ValPro: 2.388 ± 0.471
2.388ValGln: 2.388 ± 0.627
2.814ValArg: 2.814 ± 0.551
5.116ValSer: 5.116 ± 0.789
4.264ValThr: 4.264 ± 0.623
3.326ValVal: 3.326 ± 0.54
1.194ValTrp: 1.194 ± 0.486
2.473ValTyr: 2.473 ± 0.524
0.0ValXaa: 0.0 ± 0.0
Trp
0.853TrpAla: 0.853 ± 0.271
0.256TrpCys: 0.256 ± 0.137
0.767TrpAsp: 0.767 ± 0.205
0.938TrpGlu: 0.938 ± 0.332
0.597TrpPhe: 0.597 ± 0.245
1.279TrpGly: 1.279 ± 0.343
0.597TrpHis: 0.597 ± 0.253
1.279TrpIle: 1.279 ± 0.276
1.023TrpLys: 1.023 ± 0.33
1.535TrpLeu: 1.535 ± 0.351
0.171TrpMet: 0.171 ± 0.136
1.62TrpAsn: 1.62 ± 0.942
0.171TrpPro: 0.171 ± 0.121
1.279TrpGln: 1.279 ± 0.318
0.853TrpArg: 0.853 ± 0.26
1.535TrpSer: 1.535 ± 0.652
1.194TrpThr: 1.194 ± 0.299
0.682TrpVal: 0.682 ± 0.229
0.171TrpTrp: 0.171 ± 0.122
0.512TrpTyr: 0.512 ± 0.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.24TyrAla: 3.24 ± 0.478
0.341TyrCys: 0.341 ± 0.212
2.558TyrAsp: 2.558 ± 0.416
2.388TyrGlu: 2.388 ± 0.64
1.62TyrPhe: 1.62 ± 0.377
2.643TyrGly: 2.643 ± 0.547
1.023TyrHis: 1.023 ± 0.267
2.132TyrIle: 2.132 ± 0.391
2.388TyrLys: 2.388 ± 0.396
2.899TyrLeu: 2.899 ± 0.503
0.426TyrMet: 0.426 ± 0.189
1.705TyrAsn: 1.705 ± 0.342
1.535TyrPro: 1.535 ± 0.287
2.217TyrGln: 2.217 ± 0.446
2.047TyrArg: 2.047 ± 0.361
2.985TyrSer: 2.985 ± 0.675
1.876TyrThr: 1.876 ± 0.448
2.558TyrVal: 2.558 ± 0.431
0.767TyrTrp: 0.767 ± 0.285
1.876TyrTyr: 1.876 ± 0.517
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (11728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski