Amino acid dipepetide frequency for Vibrio phage CKB-S2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.806AlaAla: 10.806 ± 1.342
0.681AlaCys: 0.681 ± 0.273
5.062AlaAsp: 5.062 ± 0.753
6.62AlaGlu: 6.62 ± 0.842
3.115AlaPhe: 3.115 ± 0.584
10.027AlaGly: 10.027 ± 1.421
0.876AlaHis: 0.876 ± 0.276
3.991AlaIle: 3.991 ± 0.659
6.62AlaLys: 6.62 ± 0.906
6.523AlaLeu: 6.523 ± 0.808
3.407AlaMet: 3.407 ± 0.796
4.576AlaAsn: 4.576 ± 0.928
2.921AlaPro: 2.921 ± 0.726
5.744AlaGln: 5.744 ± 1.077
4.381AlaArg: 4.381 ± 0.685
4.381AlaSer: 4.381 ± 1.015
5.452AlaThr: 5.452 ± 0.663
4.576AlaVal: 4.576 ± 0.697
0.779AlaTrp: 0.779 ± 0.304
3.018AlaTyr: 3.018 ± 0.549
0.0AlaXaa: 0.0 ± 0.0
Cys
0.584CysAla: 0.584 ± 0.234
0.195CysCys: 0.195 ± 0.15
0.487CysAsp: 0.487 ± 0.234
0.779CysGlu: 0.779 ± 0.328
0.487CysPhe: 0.487 ± 0.208
0.292CysGly: 0.292 ± 0.166
0.292CysHis: 0.292 ± 0.194
0.195CysIle: 0.195 ± 0.124
1.266CysLys: 1.266 ± 0.426
0.487CysLeu: 0.487 ± 0.191
0.097CysMet: 0.097 ± 0.091
0.389CysAsn: 0.389 ± 0.214
0.389CysPro: 0.389 ± 0.205
0.195CysGln: 0.195 ± 0.148
0.097CysArg: 0.097 ± 0.107
0.389CysSer: 0.389 ± 0.215
0.584CysThr: 0.584 ± 0.289
0.584CysVal: 0.584 ± 0.267
0.097CysTrp: 0.097 ± 0.1
0.292CysTyr: 0.292 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
4.868AspAla: 4.868 ± 0.774
0.292AspCys: 0.292 ± 0.204
2.823AspAsp: 2.823 ± 0.574
3.407AspGlu: 3.407 ± 0.562
2.434AspPhe: 2.434 ± 0.466
5.16AspGly: 5.16 ± 0.807
0.974AspHis: 0.974 ± 0.394
4.089AspIle: 4.089 ± 0.546
4.868AspLys: 4.868 ± 0.756
4.868AspLeu: 4.868 ± 0.559
1.85AspMet: 1.85 ± 0.608
2.629AspAsn: 2.629 ± 0.521
3.699AspPro: 3.699 ± 0.666
1.85AspGln: 1.85 ± 0.394
4.186AspArg: 4.186 ± 0.728
3.115AspSer: 3.115 ± 0.479
2.434AspThr: 2.434 ± 0.449
3.407AspVal: 3.407 ± 0.578
1.168AspTrp: 1.168 ± 0.41
2.726AspTyr: 2.726 ± 0.422
0.0AspXaa: 0.0 ± 0.0
Glu
6.62GluAla: 6.62 ± 0.797
0.292GluCys: 0.292 ± 0.201
3.31GluAsp: 3.31 ± 0.638
4.77GluGlu: 4.77 ± 0.782
2.531GluPhe: 2.531 ± 0.547
3.505GluGly: 3.505 ± 0.557
0.974GluHis: 0.974 ± 0.33
4.381GluIle: 4.381 ± 0.825
5.16GluLys: 5.16 ± 0.962
8.664GluLeu: 8.664 ± 1.126
1.558GluMet: 1.558 ± 0.39
2.531GluAsn: 2.531 ± 0.681
2.044GluPro: 2.044 ± 0.376
4.381GluGln: 4.381 ± 0.966
3.699GluArg: 3.699 ± 0.791
3.991GluSer: 3.991 ± 0.531
3.797GluThr: 3.797 ± 0.541
3.699GluVal: 3.699 ± 0.64
1.752GluTrp: 1.752 ± 0.371
2.336GluTyr: 2.336 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.823PheAla: 2.823 ± 0.69
0.195PheCys: 0.195 ± 0.143
2.921PheAsp: 2.921 ± 0.445
2.336PheGlu: 2.336 ± 0.516
1.071PhePhe: 1.071 ± 0.396
2.239PheGly: 2.239 ± 0.421
0.974PheHis: 0.974 ± 0.303
1.947PheIle: 1.947 ± 0.359
2.531PheLys: 2.531 ± 0.479
2.434PheLeu: 2.434 ± 0.458
0.779PheMet: 0.779 ± 0.382
1.947PheAsn: 1.947 ± 0.471
1.46PhePro: 1.46 ± 0.448
1.071PheGln: 1.071 ± 0.342
1.947PheArg: 1.947 ± 0.465
1.363PheSer: 1.363 ± 0.494
2.142PheThr: 2.142 ± 0.513
3.018PheVal: 3.018 ± 0.512
0.681PheTrp: 0.681 ± 0.266
1.46PheTyr: 1.46 ± 0.385
0.0PheXaa: 0.0 ± 0.0
Gly
8.762GlyAla: 8.762 ± 1.246
0.974GlyCys: 0.974 ± 0.374
4.868GlyAsp: 4.868 ± 0.735
4.965GlyGlu: 4.965 ± 0.806
2.629GlyPhe: 2.629 ± 0.591
7.204GlyGly: 7.204 ± 1.392
0.584GlyHis: 0.584 ± 0.212
3.602GlyIle: 3.602 ± 0.514
4.576GlyLys: 4.576 ± 0.762
5.062GlyLeu: 5.062 ± 0.645
1.655GlyMet: 1.655 ± 0.353
3.602GlyAsn: 3.602 ± 0.554
1.168GlyPro: 1.168 ± 0.288
2.726GlyGln: 2.726 ± 0.467
2.726GlyArg: 2.726 ± 0.506
4.283GlySer: 4.283 ± 0.949
3.699GlyThr: 3.699 ± 0.688
5.841GlyVal: 5.841 ± 0.752
1.558GlyTrp: 1.558 ± 0.366
3.213GlyTyr: 3.213 ± 0.66
0.0GlyXaa: 0.0 ± 0.0
His
1.46HisAla: 1.46 ± 0.466
0.097HisCys: 0.097 ± 0.078
1.46HisAsp: 1.46 ± 0.433
1.363HisGlu: 1.363 ± 0.363
0.876HisPhe: 0.876 ± 0.324
1.363HisGly: 1.363 ± 0.409
0.389HisHis: 0.389 ± 0.241
0.779HisIle: 0.779 ± 0.262
1.46HisLys: 1.46 ± 0.474
1.558HisLeu: 1.558 ± 0.477
0.389HisMet: 0.389 ± 0.219
1.168HisAsn: 1.168 ± 0.303
0.487HisPro: 0.487 ± 0.214
0.876HisGln: 0.876 ± 0.295
1.558HisArg: 1.558 ± 0.449
0.389HisSer: 0.389 ± 0.227
0.974HisThr: 0.974 ± 0.282
0.681HisVal: 0.681 ± 0.293
0.779HisTrp: 0.779 ± 0.313
0.292HisTyr: 0.292 ± 0.139
0.0HisXaa: 0.0 ± 0.0
Ile
4.77IleAla: 4.77 ± 0.59
0.097IleCys: 0.097 ± 0.078
5.257IleAsp: 5.257 ± 0.816
4.089IleGlu: 4.089 ± 0.687
1.266IlePhe: 1.266 ± 0.37
3.699IleGly: 3.699 ± 0.568
0.487IleHis: 0.487 ± 0.254
1.752IleIle: 1.752 ± 0.435
5.16IleLys: 5.16 ± 0.999
3.018IleLeu: 3.018 ± 0.482
1.655IleMet: 1.655 ± 0.406
3.018IleAsn: 3.018 ± 0.458
2.629IlePro: 2.629 ± 0.658
2.531IleGln: 2.531 ± 0.36
2.726IleArg: 2.726 ± 0.551
3.018IleSer: 3.018 ± 0.498
4.576IleThr: 4.576 ± 0.614
2.921IleVal: 2.921 ± 0.495
0.681IleTrp: 0.681 ± 0.269
1.655IleTyr: 1.655 ± 0.486
0.0IleXaa: 0.0 ± 0.0
Lys
8.275LysAla: 8.275 ± 0.961
0.876LysCys: 0.876 ± 0.328
4.186LysAsp: 4.186 ± 0.641
4.868LysGlu: 4.868 ± 0.863
2.142LysPhe: 2.142 ± 0.534
4.868LysGly: 4.868 ± 0.664
2.142LysHis: 2.142 ± 0.475
3.602LysIle: 3.602 ± 0.438
4.089LysLys: 4.089 ± 0.908
5.452LysLeu: 5.452 ± 0.711
2.531LysMet: 2.531 ± 0.461
2.726LysAsn: 2.726 ± 0.62
2.629LysPro: 2.629 ± 0.608
3.213LysGln: 3.213 ± 0.523
4.965LysArg: 4.965 ± 0.728
3.31LysSer: 3.31 ± 0.481
3.018LysThr: 3.018 ± 0.466
4.381LysVal: 4.381 ± 0.762
0.876LysTrp: 0.876 ± 0.254
1.655LysTyr: 1.655 ± 0.504
0.0LysXaa: 0.0 ± 0.0
Leu
6.717LeuAla: 6.717 ± 1.009
0.584LeuCys: 0.584 ± 0.286
4.77LeuAsp: 4.77 ± 0.721
6.523LeuGlu: 6.523 ± 0.768
2.921LeuPhe: 2.921 ± 0.367
4.673LeuGly: 4.673 ± 0.867
1.071LeuHis: 1.071 ± 0.291
4.576LeuIle: 4.576 ± 0.688
4.576LeuLys: 4.576 ± 0.684
5.257LeuLeu: 5.257 ± 0.839
1.85LeuMet: 1.85 ± 0.444
3.31LeuAsn: 3.31 ± 0.519
3.31LeuPro: 3.31 ± 0.615
2.921LeuGln: 2.921 ± 0.612
3.797LeuArg: 3.797 ± 0.705
5.549LeuSer: 5.549 ± 0.723
5.354LeuThr: 5.354 ± 0.613
5.646LeuVal: 5.646 ± 0.896
0.974LeuTrp: 0.974 ± 0.362
2.142LeuTyr: 2.142 ± 0.568
0.0LeuXaa: 0.0 ± 0.0
Met
3.018MetAla: 3.018 ± 0.532
0.292MetCys: 0.292 ± 0.174
1.363MetAsp: 1.363 ± 0.374
2.239MetGlu: 2.239 ± 0.525
1.168MetPhe: 1.168 ± 0.448
2.434MetGly: 2.434 ± 0.362
0.195MetHis: 0.195 ± 0.188
1.947MetIle: 1.947 ± 0.416
1.85MetLys: 1.85 ± 0.523
2.142MetLeu: 2.142 ± 0.522
1.46MetMet: 1.46 ± 0.362
1.266MetAsn: 1.266 ± 0.326
1.071MetPro: 1.071 ± 0.373
1.46MetGln: 1.46 ± 0.391
2.239MetArg: 2.239 ± 0.532
1.363MetSer: 1.363 ± 0.381
1.947MetThr: 1.947 ± 0.513
1.071MetVal: 1.071 ± 0.274
0.195MetTrp: 0.195 ± 0.125
0.584MetTyr: 0.584 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
4.381AsnAla: 4.381 ± 0.988
0.097AsnCys: 0.097 ± 0.091
2.142AsnAsp: 2.142 ± 0.491
3.505AsnGlu: 3.505 ± 0.513
1.85AsnPhe: 1.85 ± 0.415
4.576AsnGly: 4.576 ± 0.853
1.071AsnHis: 1.071 ± 0.358
3.018AsnIle: 3.018 ± 0.583
2.823AsnLys: 2.823 ± 0.617
2.823AsnLeu: 2.823 ± 0.532
1.558AsnMet: 1.558 ± 0.363
3.699AsnAsn: 3.699 ± 0.906
1.558AsnPro: 1.558 ± 0.378
2.434AsnGln: 2.434 ± 0.535
2.629AsnArg: 2.629 ± 0.442
2.629AsnSer: 2.629 ± 0.468
3.31AsnThr: 3.31 ± 0.686
2.434AsnVal: 2.434 ± 0.444
1.071AsnTrp: 1.071 ± 0.321
2.239AsnTyr: 2.239 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
3.115ProAla: 3.115 ± 0.607
0.195ProCys: 0.195 ± 0.156
2.823ProAsp: 2.823 ± 0.422
3.018ProGlu: 3.018 ± 0.679
0.974ProPhe: 0.974 ± 0.292
0.097ProGly: 0.097 ± 0.104
0.779ProHis: 0.779 ± 0.266
2.239ProIle: 2.239 ± 0.431
2.531ProLys: 2.531 ± 0.536
2.726ProLeu: 2.726 ± 0.635
1.46ProMet: 1.46 ± 0.392
2.239ProAsn: 2.239 ± 0.49
1.46ProPro: 1.46 ± 0.57
1.752ProGln: 1.752 ± 0.345
1.558ProArg: 1.558 ± 0.518
2.434ProSer: 2.434 ± 0.569
2.726ProThr: 2.726 ± 0.58
2.531ProVal: 2.531 ± 0.529
0.292ProTrp: 0.292 ± 0.135
1.363ProTyr: 1.363 ± 0.225
0.0ProXaa: 0.0 ± 0.0
Gln
5.16GlnAla: 5.16 ± 0.888
0.097GlnCys: 0.097 ± 0.098
2.726GlnAsp: 2.726 ± 0.632
3.31GlnGlu: 3.31 ± 0.523
1.655GlnPhe: 1.655 ± 0.375
2.726GlnGly: 2.726 ± 0.602
1.071GlnHis: 1.071 ± 0.285
2.531GlnIle: 2.531 ± 0.506
2.726GlnLys: 2.726 ± 0.618
4.77GlnLeu: 4.77 ± 0.661
1.558GlnMet: 1.558 ± 0.331
2.142GlnAsn: 2.142 ± 0.469
2.142GlnPro: 2.142 ± 0.523
3.115GlnGln: 3.115 ± 0.768
2.434GlnArg: 2.434 ± 0.565
3.018GlnSer: 3.018 ± 0.634
2.336GlnThr: 2.336 ± 0.545
3.115GlnVal: 3.115 ± 0.762
0.779GlnTrp: 0.779 ± 0.256
1.655GlnTyr: 1.655 ± 0.367
0.0GlnXaa: 0.0 ± 0.0
Arg
5.354ArgAla: 5.354 ± 0.579
0.779ArgCys: 0.779 ± 0.315
2.726ArgAsp: 2.726 ± 0.467
3.505ArgGlu: 3.505 ± 0.729
1.85ArgPhe: 1.85 ± 0.368
3.505ArgGly: 3.505 ± 0.553
1.655ArgHis: 1.655 ± 0.455
3.797ArgIle: 3.797 ± 0.654
3.407ArgLys: 3.407 ± 0.765
3.991ArgLeu: 3.991 ± 0.515
1.655ArgMet: 1.655 ± 0.416
2.434ArgAsn: 2.434 ± 0.435
0.974ArgPro: 0.974 ± 0.323
3.31ArgGln: 3.31 ± 0.499
2.629ArgArg: 2.629 ± 0.636
0.876ArgSer: 0.876 ± 0.334
2.629ArgThr: 2.629 ± 0.58
3.213ArgVal: 3.213 ± 0.527
0.487ArgTrp: 0.487 ± 0.211
2.629ArgTyr: 2.629 ± 0.453
0.0ArgXaa: 0.0 ± 0.0
Ser
4.089SerAla: 4.089 ± 0.657
0.389SerCys: 0.389 ± 0.268
3.991SerAsp: 3.991 ± 0.63
3.602SerGlu: 3.602 ± 0.851
2.336SerPhe: 2.336 ± 0.414
5.062SerGly: 5.062 ± 0.855
1.071SerHis: 1.071 ± 0.377
2.142SerIle: 2.142 ± 0.359
3.407SerLys: 3.407 ± 0.578
4.576SerLeu: 4.576 ± 0.607
1.558SerMet: 1.558 ± 0.377
3.407SerAsn: 3.407 ± 0.508
2.726SerPro: 2.726 ± 0.51
2.921SerGln: 2.921 ± 0.609
2.239SerArg: 2.239 ± 0.586
1.947SerSer: 1.947 ± 0.459
2.921SerThr: 2.921 ± 0.466
3.894SerVal: 3.894 ± 0.545
0.584SerTrp: 0.584 ± 0.259
1.947SerTyr: 1.947 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
4.673ThrAla: 4.673 ± 0.629
0.195ThrCys: 0.195 ± 0.13
3.115ThrAsp: 3.115 ± 0.699
3.894ThrGlu: 3.894 ± 0.66
2.239ThrPhe: 2.239 ± 0.639
5.452ThrGly: 5.452 ± 0.828
1.558ThrHis: 1.558 ± 0.498
3.699ThrIle: 3.699 ± 0.751
4.868ThrLys: 4.868 ± 0.702
3.797ThrLeu: 3.797 ± 0.595
1.655ThrMet: 1.655 ± 0.315
2.434ThrAsn: 2.434 ± 0.543
2.921ThrPro: 2.921 ± 0.515
2.629ThrGln: 2.629 ± 0.546
2.726ThrArg: 2.726 ± 0.57
2.726ThrSer: 2.726 ± 0.485
2.921ThrThr: 2.921 ± 0.6
3.699ThrVal: 3.699 ± 0.984
0.389ThrTrp: 0.389 ± 0.199
1.168ThrTyr: 1.168 ± 0.318
0.0ThrXaa: 0.0 ± 0.0
Val
4.673ValAla: 4.673 ± 0.732
0.876ValCys: 0.876 ± 0.31
3.31ValAsp: 3.31 ± 0.513
4.381ValGlu: 4.381 ± 0.717
1.752ValPhe: 1.752 ± 0.367
4.186ValGly: 4.186 ± 0.881
1.168ValHis: 1.168 ± 0.265
4.381ValIle: 4.381 ± 0.62
4.186ValLys: 4.186 ± 0.718
4.77ValLeu: 4.77 ± 0.615
1.363ValMet: 1.363 ± 0.296
3.991ValAsn: 3.991 ± 0.597
1.947ValPro: 1.947 ± 0.42
2.239ValGln: 2.239 ± 0.45
2.239ValArg: 2.239 ± 0.378
5.841ValSer: 5.841 ± 0.74
3.699ValThr: 3.699 ± 0.754
4.478ValVal: 4.478 ± 0.847
0.779ValTrp: 0.779 ± 0.236
2.044ValTyr: 2.044 ± 0.465
0.0ValXaa: 0.0 ± 0.0
Trp
1.071TrpAla: 1.071 ± 0.409
0.389TrpCys: 0.389 ± 0.2
0.974TrpAsp: 0.974 ± 0.275
1.071TrpGlu: 1.071 ± 0.366
0.584TrpPhe: 0.584 ± 0.195
0.292TrpGly: 0.292 ± 0.161
0.389TrpHis: 0.389 ± 0.305
0.487TrpIle: 0.487 ± 0.226
0.779TrpLys: 0.779 ± 0.375
1.46TrpLeu: 1.46 ± 0.465
0.195TrpMet: 0.195 ± 0.141
1.071TrpAsn: 1.071 ± 0.353
0.195TrpPro: 0.195 ± 0.134
0.779TrpGln: 0.779 ± 0.305
0.779TrpArg: 0.779 ± 0.298
1.266TrpSer: 1.266 ± 0.428
0.974TrpThr: 0.974 ± 0.309
1.071TrpVal: 1.071 ± 0.29
0.097TrpTrp: 0.097 ± 0.086
0.487TrpTyr: 0.487 ± 0.241
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.336TyrAla: 2.336 ± 0.629
0.584TyrCys: 0.584 ± 0.268
2.531TyrAsp: 2.531 ± 0.62
1.752TyrGlu: 1.752 ± 0.406
1.46TyrPhe: 1.46 ± 0.289
2.336TyrGly: 2.336 ± 0.365
0.681TyrHis: 0.681 ± 0.242
1.85TyrIle: 1.85 ± 0.44
2.921TyrLys: 2.921 ± 0.579
2.142TyrLeu: 2.142 ± 0.512
0.974TyrMet: 0.974 ± 0.277
1.363TyrAsn: 1.363 ± 0.361
0.584TyrPro: 0.584 ± 0.227
2.921TyrGln: 2.921 ± 0.526
1.85TyrArg: 1.85 ± 0.43
2.921TyrSer: 2.921 ± 0.466
1.266TyrThr: 1.266 ± 0.496
2.044TyrVal: 2.044 ± 0.35
0.389TyrTrp: 0.389 ± 0.187
1.558TyrTyr: 1.558 ± 0.454
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (10273 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski