Amino acid dipepetide frequency for Vibrio phage VP5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.302AlaAla: 7.302 ± 1.553
0.913AlaCys: 0.913 ± 0.397
5.842AlaAsp: 5.842 ± 1.042
6.207AlaGlu: 6.207 ± 1.19
2.008AlaPhe: 2.008 ± 0.409
7.484AlaGly: 7.484 ± 1.039
1.278AlaHis: 1.278 ± 0.452
5.294AlaIle: 5.294 ± 0.753
4.564AlaLys: 4.564 ± 1.173
7.85AlaLeu: 7.85 ± 1.122
3.468AlaMet: 3.468 ± 0.568
2.556AlaAsn: 2.556 ± 0.493
2.373AlaPro: 2.373 ± 0.53
3.286AlaGln: 3.286 ± 1.008
4.199AlaArg: 4.199 ± 0.654
5.659AlaSer: 5.659 ± 0.955
5.659AlaThr: 5.659 ± 1.126
4.381AlaVal: 4.381 ± 0.871
1.46AlaTrp: 1.46 ± 0.476
2.373AlaTyr: 2.373 ± 0.779
0.0AlaXaa: 0.0 ± 0.0
Cys
0.365CysAla: 0.365 ± 0.287
0.365CysCys: 0.365 ± 0.202
0.548CysAsp: 0.548 ± 0.353
0.548CysGlu: 0.548 ± 0.229
0.548CysPhe: 0.548 ± 0.225
1.278CysGly: 1.278 ± 0.51
0.365CysHis: 0.365 ± 0.259
1.46CysIle: 1.46 ± 0.449
0.548CysLys: 0.548 ± 0.292
0.913CysLeu: 0.913 ± 0.316
0.0CysMet: 0.0 ± 0.0
0.365CysAsn: 0.365 ± 0.264
0.365CysPro: 0.365 ± 0.203
0.365CysGln: 0.365 ± 0.26
0.548CysArg: 0.548 ± 0.337
0.0CysSer: 0.0 ± 0.0
1.643CysThr: 1.643 ± 0.687
0.183CysVal: 0.183 ± 0.168
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.294AspAla: 5.294 ± 0.684
1.278AspCys: 1.278 ± 0.48
2.921AspAsp: 2.921 ± 0.917
5.842AspGlu: 5.842 ± 1.685
3.286AspPhe: 3.286 ± 0.736
4.564AspGly: 4.564 ± 1.122
0.73AspHis: 0.73 ± 0.44
4.746AspIle: 4.746 ± 0.767
4.381AspLys: 4.381 ± 0.947
5.659AspLeu: 5.659 ± 0.906
2.373AspMet: 2.373 ± 0.738
1.825AspAsn: 1.825 ± 0.516
3.103AspPro: 3.103 ± 0.722
0.913AspGln: 0.913 ± 0.345
2.556AspArg: 2.556 ± 0.95
5.659AspSer: 5.659 ± 0.889
2.738AspThr: 2.738 ± 0.617
3.834AspVal: 3.834 ± 0.844
2.191AspTrp: 2.191 ± 0.582
1.278AspTyr: 1.278 ± 0.45
0.0AspXaa: 0.0 ± 0.0
Glu
4.929GluAla: 4.929 ± 0.747
0.913GluCys: 0.913 ± 0.377
3.834GluAsp: 3.834 ± 0.85
6.024GluGlu: 6.024 ± 1.423
2.373GluPhe: 2.373 ± 0.453
4.016GluGly: 4.016 ± 0.886
2.008GluHis: 2.008 ± 0.715
3.103GluIle: 3.103 ± 0.947
2.738GluLys: 2.738 ± 0.575
5.659GluLeu: 5.659 ± 0.932
2.008GluMet: 2.008 ± 0.6
2.738GluAsn: 2.738 ± 0.953
1.643GluPro: 1.643 ± 0.499
2.921GluGln: 2.921 ± 0.908
4.381GluArg: 4.381 ± 1.029
2.921GluSer: 2.921 ± 0.745
2.921GluThr: 2.921 ± 0.68
4.199GluVal: 4.199 ± 0.743
1.643GluTrp: 1.643 ± 0.506
2.556GluTyr: 2.556 ± 0.677
0.0GluXaa: 0.0 ± 0.0
Phe
3.103PheAla: 3.103 ± 0.896
0.183PheCys: 0.183 ± 0.218
2.738PheAsp: 2.738 ± 0.658
2.008PheGlu: 2.008 ± 0.548
0.73PhePhe: 0.73 ± 0.589
4.929PheGly: 4.929 ± 1.056
0.183PheHis: 0.183 ± 0.162
2.008PheIle: 2.008 ± 0.445
1.643PheLys: 1.643 ± 0.924
2.556PheLeu: 2.556 ± 0.739
1.278PheMet: 1.278 ± 0.611
2.738PheAsn: 2.738 ± 0.633
1.278PhePro: 1.278 ± 0.342
1.643PheGln: 1.643 ± 0.592
1.825PheArg: 1.825 ± 0.92
2.191PheSer: 2.191 ± 0.975
2.191PheThr: 2.191 ± 0.588
2.373PheVal: 2.373 ± 0.882
0.183PheTrp: 0.183 ± 0.225
1.095PheTyr: 1.095 ± 0.377
0.0PheXaa: 0.0 ± 0.0
Gly
6.207GlyAla: 6.207 ± 1.566
0.913GlyCys: 0.913 ± 0.508
5.294GlyAsp: 5.294 ± 0.783
2.921GlyGlu: 2.921 ± 0.553
2.738GlyPhe: 2.738 ± 0.418
15.152GlyGly: 15.152 ± 8.495
2.191GlyHis: 2.191 ± 0.563
4.746GlyIle: 4.746 ± 0.769
6.389GlyLys: 6.389 ± 0.832
4.016GlyLeu: 4.016 ± 0.515
2.373GlyMet: 2.373 ± 0.931
6.754GlyAsn: 6.754 ± 2.606
3.103GlyPro: 3.103 ± 0.41
4.016GlyGln: 4.016 ± 0.574
4.199GlyArg: 4.199 ± 0.662
4.746GlySer: 4.746 ± 1.317
5.111GlyThr: 5.111 ± 1.025
4.564GlyVal: 4.564 ± 1.048
1.46GlyTrp: 1.46 ± 0.415
3.468GlyTyr: 3.468 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.095HisAla: 1.095 ± 0.535
0.365HisCys: 0.365 ± 0.291
1.643HisAsp: 1.643 ± 0.451
1.095HisGlu: 1.095 ± 0.436
0.365HisPhe: 0.365 ± 0.246
1.825HisGly: 1.825 ± 0.699
0.0HisHis: 0.0 ± 0.0
2.008HisIle: 2.008 ± 0.79
0.548HisLys: 0.548 ± 0.461
1.095HisLeu: 1.095 ± 0.544
0.913HisMet: 0.913 ± 0.567
0.913HisAsn: 0.913 ± 0.226
0.73HisPro: 0.73 ± 0.467
0.548HisGln: 0.548 ± 0.304
1.095HisArg: 1.095 ± 0.453
1.095HisSer: 1.095 ± 0.406
0.548HisThr: 0.548 ± 0.274
1.278HisVal: 1.278 ± 0.388
0.0HisTrp: 0.0 ± 0.0
0.913HisTyr: 0.913 ± 0.533
0.0HisXaa: 0.0 ± 0.0
Ile
6.937IleAla: 6.937 ± 1.112
0.183IleCys: 0.183 ± 0.167
5.842IleAsp: 5.842 ± 1.228
3.468IleGlu: 3.468 ± 0.75
0.913IlePhe: 0.913 ± 0.515
4.381IleGly: 4.381 ± 0.895
0.365IleHis: 0.365 ± 0.26
2.556IleIle: 2.556 ± 0.551
2.921IleLys: 2.921 ± 0.811
2.921IleLeu: 2.921 ± 0.702
1.825IleMet: 1.825 ± 0.467
2.191IleAsn: 2.191 ± 0.68
3.103IlePro: 3.103 ± 0.487
2.556IleGln: 2.556 ± 0.622
4.199IleArg: 4.199 ± 0.722
3.651IleSer: 3.651 ± 1.069
3.286IleThr: 3.286 ± 1.029
3.103IleVal: 3.103 ± 0.68
1.46IleTrp: 1.46 ± 0.564
0.913IleTyr: 0.913 ± 0.335
0.0IleXaa: 0.0 ± 0.0
Lys
5.842LysAla: 5.842 ± 0.927
0.183LysCys: 0.183 ± 0.162
2.556LysAsp: 2.556 ± 0.717
2.738LysGlu: 2.738 ± 0.909
1.46LysPhe: 1.46 ± 0.483
4.929LysGly: 4.929 ± 1.273
1.46LysHis: 1.46 ± 0.387
2.921LysIle: 2.921 ± 0.513
5.111LysLys: 5.111 ± 1.092
4.199LysLeu: 4.199 ± 0.807
2.373LysMet: 2.373 ± 0.872
3.103LysAsn: 3.103 ± 0.839
2.373LysPro: 2.373 ± 0.712
2.191LysGln: 2.191 ± 0.623
2.738LysArg: 2.738 ± 0.88
3.651LysSer: 3.651 ± 0.996
2.373LysThr: 2.373 ± 0.726
3.468LysVal: 3.468 ± 0.941
1.46LysTrp: 1.46 ± 0.452
3.286LysTyr: 3.286 ± 0.729
0.0LysXaa: 0.0 ± 0.0
Leu
5.659LeuAla: 5.659 ± 0.924
0.365LeuCys: 0.365 ± 0.268
4.381LeuAsp: 4.381 ± 0.737
4.746LeuGlu: 4.746 ± 0.952
1.643LeuPhe: 1.643 ± 0.661
5.659LeuGly: 5.659 ± 0.924
2.008LeuHis: 2.008 ± 0.644
4.199LeuIle: 4.199 ± 0.962
4.016LeuLys: 4.016 ± 0.899
2.921LeuLeu: 2.921 ± 0.456
3.651LeuMet: 3.651 ± 0.917
2.921LeuAsn: 2.921 ± 0.786
3.103LeuPro: 3.103 ± 0.457
2.921LeuGln: 2.921 ± 0.717
4.746LeuArg: 4.746 ± 0.598
4.381LeuSer: 4.381 ± 0.943
4.746LeuThr: 4.746 ± 0.655
4.199LeuVal: 4.199 ± 0.852
1.46LeuTrp: 1.46 ± 0.655
1.095LeuTyr: 1.095 ± 0.388
0.0LeuXaa: 0.0 ± 0.0
Met
3.286MetAla: 3.286 ± 1.622
0.548MetCys: 0.548 ± 0.293
2.191MetAsp: 2.191 ± 0.343
1.825MetGlu: 1.825 ± 0.676
1.095MetPhe: 1.095 ± 0.461
1.825MetGly: 1.825 ± 0.668
0.548MetHis: 0.548 ± 0.323
1.643MetIle: 1.643 ± 0.395
1.825MetLys: 1.825 ± 0.473
2.191MetLeu: 2.191 ± 0.583
1.278MetMet: 1.278 ± 0.552
1.46MetAsn: 1.46 ± 0.525
2.373MetPro: 2.373 ± 0.523
1.278MetGln: 1.278 ± 0.544
1.825MetArg: 1.825 ± 0.688
2.556MetSer: 2.556 ± 0.731
1.46MetThr: 1.46 ± 0.452
2.556MetVal: 2.556 ± 0.685
0.183MetTrp: 0.183 ± 0.218
1.46MetTyr: 1.46 ± 0.588
0.0MetXaa: 0.0 ± 0.0
Asn
5.111AsnAla: 5.111 ± 0.919
0.365AsnCys: 0.365 ± 0.241
2.008AsnAsp: 2.008 ± 0.727
2.373AsnGlu: 2.373 ± 0.7
1.825AsnPhe: 1.825 ± 0.329
5.294AsnGly: 5.294 ± 2.145
0.365AsnHis: 0.365 ± 0.236
2.738AsnIle: 2.738 ± 0.84
3.286AsnLys: 3.286 ± 0.617
2.191AsnLeu: 2.191 ± 0.583
1.095AsnMet: 1.095 ± 0.482
2.008AsnAsn: 2.008 ± 0.606
2.008AsnPro: 2.008 ± 0.677
2.373AsnGln: 2.373 ± 0.537
4.016AsnArg: 4.016 ± 0.846
2.008AsnSer: 2.008 ± 0.94
2.373AsnThr: 2.373 ± 0.555
2.373AsnVal: 2.373 ± 0.527
1.46AsnTrp: 1.46 ± 0.59
1.095AsnTyr: 1.095 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
3.651ProAla: 3.651 ± 0.673
0.183ProCys: 0.183 ± 0.161
4.564ProAsp: 4.564 ± 0.644
2.556ProGlu: 2.556 ± 0.444
1.825ProPhe: 1.825 ± 0.345
0.0ProGly: 0.0 ± 0.0
0.913ProHis: 0.913 ± 0.485
2.008ProIle: 2.008 ± 0.533
2.008ProLys: 2.008 ± 0.562
1.825ProLeu: 1.825 ± 0.741
1.825ProMet: 1.825 ± 0.814
3.468ProAsn: 3.468 ± 0.789
1.278ProPro: 1.278 ± 0.838
0.913ProGln: 0.913 ± 0.25
2.373ProArg: 2.373 ± 0.69
2.738ProSer: 2.738 ± 0.501
3.834ProThr: 3.834 ± 0.757
3.103ProVal: 3.103 ± 1.205
0.73ProTrp: 0.73 ± 0.33
0.913ProTyr: 0.913 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
3.651GlnAla: 3.651 ± 0.777
0.183GlnCys: 0.183 ± 0.167
2.008GlnAsp: 2.008 ± 0.6
2.921GlnGlu: 2.921 ± 0.813
1.278GlnPhe: 1.278 ± 0.404
2.738GlnGly: 2.738 ± 0.608
0.365GlnHis: 0.365 ± 0.324
2.556GlnIle: 2.556 ± 0.648
1.643GlnLys: 1.643 ± 0.426
3.286GlnLeu: 3.286 ± 0.938
1.278GlnMet: 1.278 ± 0.558
1.278GlnAsn: 1.278 ± 0.641
1.095GlnPro: 1.095 ± 0.444
1.643GlnGln: 1.643 ± 0.555
3.286GlnArg: 3.286 ± 0.951
2.191GlnSer: 2.191 ± 0.447
2.373GlnThr: 2.373 ± 0.593
1.643GlnVal: 1.643 ± 0.46
1.643GlnTrp: 1.643 ± 0.365
1.095GlnTyr: 1.095 ± 0.533
0.0GlnXaa: 0.0 ± 0.0
Arg
5.111ArgAla: 5.111 ± 0.894
0.365ArgCys: 0.365 ± 0.286
3.286ArgAsp: 3.286 ± 0.581
3.834ArgGlu: 3.834 ± 0.575
3.651ArgPhe: 3.651 ± 1.31
5.476ArgGly: 5.476 ± 1.295
0.73ArgHis: 0.73 ± 0.706
2.738ArgIle: 2.738 ± 0.533
4.381ArgLys: 4.381 ± 0.95
4.381ArgLeu: 4.381 ± 0.804
2.556ArgMet: 2.556 ± 0.506
3.468ArgAsn: 3.468 ± 1.013
2.373ArgPro: 2.373 ± 0.715
2.008ArgGln: 2.008 ± 0.548
3.468ArgArg: 3.468 ± 0.408
2.921ArgSer: 2.921 ± 0.837
2.008ArgThr: 2.008 ± 0.59
3.286ArgVal: 3.286 ± 0.721
1.643ArgTrp: 1.643 ± 0.736
2.556ArgTyr: 2.556 ± 0.561
0.0ArgXaa: 0.0 ± 0.0
Ser
4.381SerAla: 4.381 ± 1.276
0.365SerCys: 0.365 ± 0.287
4.564SerAsp: 4.564 ± 0.891
3.651SerGlu: 3.651 ± 0.779
3.286SerPhe: 3.286 ± 0.638
5.111SerGly: 5.111 ± 1.648
1.095SerHis: 1.095 ± 0.391
3.286SerIle: 3.286 ± 0.941
3.468SerLys: 3.468 ± 1.217
4.016SerLeu: 4.016 ± 0.728
2.191SerMet: 2.191 ± 0.562
2.921SerAsn: 2.921 ± 1.313
3.651SerPro: 3.651 ± 0.823
2.008SerGln: 2.008 ± 0.377
3.468SerArg: 3.468 ± 0.582
3.286SerSer: 3.286 ± 0.788
2.191SerThr: 2.191 ± 0.777
4.199SerVal: 4.199 ± 1.211
1.095SerTrp: 1.095 ± 0.393
2.373SerTyr: 2.373 ± 0.708
0.0SerXaa: 0.0 ± 0.0
Thr
4.016ThrAla: 4.016 ± 0.953
0.365ThrCys: 0.365 ± 0.272
2.738ThrAsp: 2.738 ± 0.784
3.286ThrGlu: 3.286 ± 0.591
3.103ThrPhe: 3.103 ± 0.63
6.572ThrGly: 6.572 ± 1.256
0.548ThrHis: 0.548 ± 0.298
2.556ThrIle: 2.556 ± 0.425
2.921ThrLys: 2.921 ± 0.741
4.564ThrLeu: 4.564 ± 0.74
0.73ThrMet: 0.73 ± 0.373
2.008ThrAsn: 2.008 ± 0.739
2.556ThrPro: 2.556 ± 0.41
2.373ThrGln: 2.373 ± 0.583
3.286ThrArg: 3.286 ± 0.488
3.468ThrSer: 3.468 ± 0.775
2.738ThrThr: 2.738 ± 0.846
3.834ThrVal: 3.834 ± 1.094
0.548ThrTrp: 0.548 ± 0.307
1.643ThrTyr: 1.643 ± 0.457
0.0ThrXaa: 0.0 ± 0.0
Val
4.016ValAla: 4.016 ± 1.02
1.095ValCys: 1.095 ± 0.384
3.651ValAsp: 3.651 ± 0.708
4.016ValGlu: 4.016 ± 0.593
2.738ValPhe: 2.738 ± 0.914
5.294ValGly: 5.294 ± 1.017
2.373ValHis: 2.373 ± 0.705
3.651ValIle: 3.651 ± 0.827
2.556ValLys: 2.556 ± 0.365
4.746ValLeu: 4.746 ± 0.904
1.643ValMet: 1.643 ± 0.483
0.913ValAsn: 0.913 ± 0.339
2.373ValPro: 2.373 ± 0.466
1.825ValGln: 1.825 ± 0.598
4.016ValArg: 4.016 ± 1.09
4.016ValSer: 4.016 ± 1.794
2.556ValThr: 2.556 ± 0.895
4.199ValVal: 4.199 ± 0.713
1.46ValTrp: 1.46 ± 0.492
2.738ValTyr: 2.738 ± 0.837
0.0ValXaa: 0.0 ± 0.0
Trp
1.825TrpAla: 1.825 ± 0.292
0.548TrpCys: 0.548 ± 0.251
1.825TrpAsp: 1.825 ± 0.693
1.643TrpGlu: 1.643 ± 0.527
1.278TrpPhe: 1.278 ± 0.819
0.913TrpGly: 0.913 ± 0.283
0.183TrpHis: 0.183 ± 0.215
0.913TrpIle: 0.913 ± 0.298
0.73TrpLys: 0.73 ± 0.345
1.643TrpLeu: 1.643 ± 0.687
0.365TrpMet: 0.365 ± 0.257
0.365TrpAsn: 0.365 ± 0.255
0.365TrpPro: 0.365 ± 0.324
0.73TrpGln: 0.73 ± 0.339
2.008TrpArg: 2.008 ± 0.634
1.46TrpSer: 1.46 ± 0.499
1.643TrpThr: 1.643 ± 0.432
1.095TrpVal: 1.095 ± 0.418
0.73TrpTrp: 0.73 ± 0.31
0.913TrpTyr: 0.913 ± 0.328
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 0.596
0.73TyrCys: 0.73 ± 0.484
2.738TyrAsp: 2.738 ± 0.522
1.825TyrGlu: 1.825 ± 0.61
0.73TyrPhe: 0.73 ± 0.34
2.921TyrGly: 2.921 ± 0.7
0.548TyrHis: 0.548 ± 0.243
1.643TyrIle: 1.643 ± 0.542
2.738TyrLys: 2.738 ± 0.829
2.191TyrLeu: 2.191 ± 0.748
0.0TyrMet: 0.0 ± 0.0
2.373TyrAsn: 2.373 ± 0.661
1.46TyrPro: 1.46 ± 0.507
1.643TyrGln: 1.643 ± 0.442
2.008TyrArg: 2.008 ± 0.46
2.008TyrSer: 2.008 ± 0.567
1.46TyrThr: 1.46 ± 0.426
2.008TyrVal: 2.008 ± 0.474
0.365TyrTrp: 0.365 ± 0.257
1.278TyrTyr: 1.278 ± 0.692
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5479 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski