Amino acid dipepetide frequency for Vibrio phage VGJ

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.667AlaAla: 6.667 ± 1.969
0.0AlaCys: 0.0 ± 0.0
2.857AlaAsp: 2.857 ± 1.276
3.81AlaGlu: 3.81 ± 1.092
5.238AlaPhe: 5.238 ± 1.04
3.81AlaGly: 3.81 ± 1.389
0.952AlaHis: 0.952 ± 0.643
6.667AlaIle: 6.667 ± 2.336
4.286AlaLys: 4.286 ± 1.316
10.476AlaLeu: 10.476 ± 2.838
3.333AlaMet: 3.333 ± 1.797
1.429AlaAsn: 1.429 ± 0.837
3.333AlaPro: 3.333 ± 1.212
3.81AlaGln: 3.81 ± 0.926
2.381AlaArg: 2.381 ± 1.439
1.429AlaSer: 1.429 ± 0.867
1.905AlaThr: 1.905 ± 1.305
6.19AlaVal: 6.19 ± 1.884
0.952AlaTrp: 0.952 ± 0.679
3.333AlaTyr: 3.333 ± 1.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.905CysAla: 1.905 ± 0.956
0.476CysCys: 0.476 ± 0.543
2.381CysAsp: 2.381 ± 0.908
0.952CysGlu: 0.952 ± 0.606
2.381CysPhe: 2.381 ± 0.861
1.905CysGly: 1.905 ± 0.919
0.476CysHis: 0.476 ± 0.528
1.429CysIle: 1.429 ± 0.738
1.429CysLys: 1.429 ± 0.811
0.476CysLeu: 0.476 ± 0.544
0.476CysMet: 0.476 ± 0.381
0.0CysAsn: 0.0 ± 0.0
2.381CysPro: 2.381 ± 1.898
0.476CysGln: 0.476 ± 0.381
0.952CysArg: 0.952 ± 0.596
2.381CysSer: 2.381 ± 1.184
2.381CysThr: 2.381 ± 0.89
0.952CysVal: 0.952 ± 0.456
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.286AspAla: 4.286 ± 1.265
1.905AspCys: 1.905 ± 1.103
6.667AspAsp: 6.667 ± 1.976
2.381AspGlu: 2.381 ± 0.892
2.381AspPhe: 2.381 ± 1.04
2.857AspGly: 2.857 ± 1.618
1.429AspHis: 1.429 ± 0.551
4.762AspIle: 4.762 ± 1.74
2.381AspLys: 2.381 ± 1.387
4.762AspLeu: 4.762 ± 1.4
1.429AspMet: 1.429 ± 0.494
1.429AspAsn: 1.429 ± 0.972
5.714AspPro: 5.714 ± 2.379
0.952AspGln: 0.952 ± 0.456
2.381AspArg: 2.381 ± 1.064
3.333AspSer: 3.333 ± 1.378
3.333AspThr: 3.333 ± 1.333
4.762AspVal: 4.762 ± 2.221
0.952AspTrp: 0.952 ± 0.456
3.333AspTyr: 3.333 ± 1.149
0.0AspXaa: 0.0 ± 0.0
Glu
4.762GluAla: 4.762 ± 1.098
2.381GluCys: 2.381 ± 1.012
2.381GluAsp: 2.381 ± 0.852
2.381GluGlu: 2.381 ± 0.806
1.905GluPhe: 1.905 ± 0.917
1.429GluGly: 1.429 ± 0.881
1.905GluHis: 1.905 ± 0.79
1.905GluIle: 1.905 ± 0.84
2.857GluLys: 2.857 ± 0.755
5.238GluLeu: 5.238 ± 2.05
0.952GluMet: 0.952 ± 0.655
2.381GluAsn: 2.381 ± 0.91
5.238GluPro: 5.238 ± 1.559
1.429GluGln: 1.429 ± 0.809
0.0GluArg: 0.0 ± 0.0
3.81GluSer: 3.81 ± 1.756
3.81GluThr: 3.81 ± 1.751
2.381GluVal: 2.381 ± 0.849
0.476GluTrp: 0.476 ± 0.581
1.429GluTyr: 1.429 ± 0.678
0.0GluXaa: 0.0 ± 0.0
Phe
5.238PheAla: 5.238 ± 2.116
0.952PheCys: 0.952 ± 0.596
2.857PheAsp: 2.857 ± 0.987
2.381PheGlu: 2.381 ± 0.718
1.429PhePhe: 1.429 ± 1.055
5.714PheGly: 5.714 ± 0.983
0.476PheHis: 0.476 ± 0.717
3.333PheIle: 3.333 ± 1.091
2.381PheLys: 2.381 ± 1.495
2.857PheLeu: 2.857 ± 1.152
2.381PheMet: 2.381 ± 0.942
1.905PheAsn: 1.905 ± 1.118
0.952PhePro: 0.952 ± 0.768
0.952PheGln: 0.952 ± 0.804
1.905PheArg: 1.905 ± 1.348
6.667PheSer: 6.667 ± 1.661
1.905PheThr: 1.905 ± 0.962
3.333PheVal: 3.333 ± 0.744
0.952PheTrp: 0.952 ± 0.456
1.905PheTyr: 1.905 ± 0.76
0.0PheXaa: 0.0 ± 0.0
Gly
3.333GlyAla: 3.333 ± 1.518
1.429GlyCys: 1.429 ± 0.568
6.19GlyAsp: 6.19 ± 1.634
2.381GlyGlu: 2.381 ± 0.89
3.81GlyPhe: 3.81 ± 1.645
4.762GlyGly: 4.762 ± 1.567
0.952GlyHis: 0.952 ± 0.755
7.143GlyIle: 7.143 ± 1.765
1.905GlyLys: 1.905 ± 0.741
6.19GlyLeu: 6.19 ± 1.955
2.381GlyMet: 2.381 ± 0.957
1.905GlyAsn: 1.905 ± 0.966
0.476GlyPro: 0.476 ± 0.464
1.905GlyGln: 1.905 ± 1.005
1.905GlyArg: 1.905 ± 0.9
6.19GlySer: 6.19 ± 1.604
2.381GlyThr: 2.381 ± 0.882
3.81GlyVal: 3.81 ± 2.052
0.952GlyTrp: 0.952 ± 0.594
2.381GlyTyr: 2.381 ± 0.998
0.0GlyXaa: 0.0 ± 0.0
His
2.857HisAla: 2.857 ± 1.239
0.0HisCys: 0.0 ± 0.0
0.952HisAsp: 0.952 ± 0.965
0.0HisGlu: 0.0 ± 0.0
0.952HisPhe: 0.952 ± 0.762
0.476HisGly: 0.476 ± 0.464
0.0HisHis: 0.0 ± 0.0
0.952HisIle: 0.952 ± 0.804
0.952HisLys: 0.952 ± 0.929
0.476HisLeu: 0.476 ± 0.381
0.476HisMet: 0.476 ± 0.402
0.952HisAsn: 0.952 ± 0.606
0.476HisPro: 0.476 ± 0.402
0.476HisGln: 0.476 ± 0.546
0.952HisArg: 0.952 ± 0.807
0.952HisSer: 0.952 ± 0.655
0.476HisThr: 0.476 ± 0.402
0.952HisVal: 0.952 ± 0.804
0.476HisTrp: 0.476 ± 0.464
1.429HisTyr: 1.429 ± 0.974
0.0HisXaa: 0.0 ± 0.0
Ile
5.714IleAla: 5.714 ± 2.288
1.905IleCys: 1.905 ± 0.864
5.714IleAsp: 5.714 ± 1.634
3.81IleGlu: 3.81 ± 1.266
3.81IlePhe: 3.81 ± 1.493
3.333IleGly: 3.333 ± 1.25
0.952IleHis: 0.952 ± 0.755
5.714IleIle: 5.714 ± 1.533
3.333IleLys: 3.333 ± 1.691
2.857IleLeu: 2.857 ± 1.343
0.952IleMet: 0.952 ± 0.646
3.81IleAsn: 3.81 ± 1.424
5.714IlePro: 5.714 ± 1.586
3.333IleGln: 3.333 ± 1.249
1.905IleArg: 1.905 ± 0.962
6.667IleSer: 6.667 ± 2.249
6.667IleThr: 6.667 ± 1.987
4.762IleVal: 4.762 ± 1.406
1.905IleTrp: 1.905 ± 0.964
3.81IleTyr: 3.81 ± 1.408
0.0IleXaa: 0.0 ± 0.0
Lys
5.714LysAla: 5.714 ± 1.177
1.905LysCys: 1.905 ± 1.305
2.381LysAsp: 2.381 ± 1.242
1.429LysGlu: 1.429 ± 0.762
2.381LysPhe: 2.381 ± 0.83
3.81LysGly: 3.81 ± 2.263
0.952LysHis: 0.952 ± 0.57
3.333LysIle: 3.333 ± 0.863
5.714LysLys: 5.714 ± 2.167
4.286LysLeu: 4.286 ± 2.156
2.857LysMet: 2.857 ± 1.181
3.333LysAsn: 3.333 ± 0.775
2.381LysPro: 2.381 ± 0.903
3.81LysGln: 3.81 ± 1.41
4.286LysArg: 4.286 ± 1.702
3.81LysSer: 3.81 ± 1.203
3.333LysThr: 3.333 ± 0.681
3.333LysVal: 3.333 ± 1.551
0.0LysTrp: 0.0 ± 0.0
1.429LysTyr: 1.429 ± 0.74
0.0LysXaa: 0.0 ± 0.0
Leu
5.238LeuAla: 5.238 ± 1.837
2.857LeuCys: 2.857 ± 1.313
4.762LeuAsp: 4.762 ± 1.117
2.857LeuGlu: 2.857 ± 1.208
0.952LeuPhe: 0.952 ± 0.689
7.619LeuGly: 7.619 ± 1.523
2.381LeuHis: 2.381 ± 0.955
8.095LeuIle: 8.095 ± 2.106
4.286LeuLys: 4.286 ± 1.39
9.524LeuLeu: 9.524 ± 2.921
2.381LeuMet: 2.381 ± 1.358
4.762LeuAsn: 4.762 ± 1.702
3.333LeuPro: 3.333 ± 1.471
1.905LeuGln: 1.905 ± 0.876
3.333LeuArg: 3.333 ± 1.646
6.667LeuSer: 6.667 ± 1.952
4.762LeuThr: 4.762 ± 1.083
3.81LeuVal: 3.81 ± 1.403
0.952LeuTrp: 0.952 ± 0.679
3.81LeuTyr: 3.81 ± 1.267
0.0LeuXaa: 0.0 ± 0.0
Met
2.857MetAla: 2.857 ± 1.674
0.0MetCys: 0.0 ± 0.0
0.952MetAsp: 0.952 ± 0.665
0.952MetGlu: 0.952 ± 0.638
0.952MetPhe: 0.952 ± 0.456
0.952MetGly: 0.952 ± 0.755
0.0MetHis: 0.0 ± 0.0
2.381MetIle: 2.381 ± 1.052
0.476MetLys: 0.476 ± 0.381
2.857MetLeu: 2.857 ± 1.349
0.952MetMet: 0.952 ± 0.763
1.905MetAsn: 1.905 ± 0.843
0.476MetPro: 0.476 ± 0.565
0.0MetGln: 0.0 ± 0.0
2.381MetArg: 2.381 ± 0.784
1.905MetSer: 1.905 ± 1.12
2.381MetThr: 2.381 ± 1.242
3.333MetVal: 3.333 ± 0.868
0.0MetTrp: 0.0 ± 0.0
0.476MetTyr: 0.476 ± 0.528
0.0MetXaa: 0.0 ± 0.0
Asn
1.429AsnAla: 1.429 ± 0.849
0.476AsnCys: 0.476 ± 0.381
1.905AsnAsp: 1.905 ± 0.849
5.714AsnGlu: 5.714 ± 2.76
1.905AsnPhe: 1.905 ± 0.856
1.905AsnGly: 1.905 ± 1.133
0.0AsnHis: 0.0 ± 0.0
2.381AsnIle: 2.381 ± 0.994
4.762AsnLys: 4.762 ± 1.784
3.333AsnLeu: 3.333 ± 1.216
0.476AsnMet: 0.476 ± 0.402
1.905AsnAsn: 1.905 ± 0.673
4.286AsnPro: 4.286 ± 1.727
0.952AsnGln: 0.952 ± 0.456
2.381AsnArg: 2.381 ± 0.937
2.857AsnSer: 2.857 ± 1.168
2.381AsnThr: 2.381 ± 0.732
1.905AsnVal: 1.905 ± 1.309
0.476AsnTrp: 0.476 ± 0.717
0.952AsnTyr: 0.952 ± 0.881
0.0AsnXaa: 0.0 ± 0.0
Pro
1.905ProAla: 1.905 ± 0.578
0.476ProCys: 0.476 ± 0.381
4.762ProAsp: 4.762 ± 1.714
5.714ProGlu: 5.714 ± 2.226
3.81ProPhe: 3.81 ± 0.949
0.476ProGly: 0.476 ± 0.381
0.952ProHis: 0.952 ± 0.929
2.381ProIle: 2.381 ± 1.257
3.333ProLys: 3.333 ± 1.214
5.714ProLeu: 5.714 ± 2.208
1.429ProMet: 1.429 ± 1.029
2.857ProAsn: 2.857 ± 1.389
3.333ProPro: 3.333 ± 0.819
1.905ProGln: 1.905 ± 0.902
2.381ProArg: 2.381 ± 1.354
5.714ProSer: 5.714 ± 2.18
4.286ProThr: 4.286 ± 1.837
2.857ProVal: 2.857 ± 0.812
0.476ProTrp: 0.476 ± 0.464
0.476ProTyr: 0.476 ± 0.717
0.0ProXaa: 0.0 ± 0.0
Gln
1.429GlnAla: 1.429 ± 0.652
1.429GlnCys: 1.429 ± 0.746
2.381GlnAsp: 2.381 ± 0.925
0.952GlnGlu: 0.952 ± 0.762
0.952GlnPhe: 0.952 ± 0.643
2.857GlnGly: 2.857 ± 1.806
0.476GlnHis: 0.476 ± 0.381
2.857GlnIle: 2.857 ± 0.671
1.905GlnLys: 1.905 ± 0.788
5.238GlnLeu: 5.238 ± 1.339
0.0GlnMet: 0.0 ± 0.0
1.429GlnAsn: 1.429 ± 0.625
0.952GlnPro: 0.952 ± 0.654
1.905GlnGln: 1.905 ± 0.861
2.381GlnArg: 2.381 ± 0.829
2.857GlnSer: 2.857 ± 1.142
2.381GlnThr: 2.381 ± 0.888
1.429GlnVal: 1.429 ± 0.867
0.476GlnTrp: 0.476 ± 0.581
1.429GlnTyr: 1.429 ± 1.059
0.0GlnXaa: 0.0 ± 0.0
Arg
2.857ArgAla: 2.857 ± 1.385
1.429ArgCys: 1.429 ± 1.393
1.429ArgAsp: 1.429 ± 1.207
1.905ArgGlu: 1.905 ± 1.019
2.381ArgPhe: 2.381 ± 0.763
1.429ArgGly: 1.429 ± 1.393
0.0ArgHis: 0.0 ± 0.0
6.667ArgIle: 6.667 ± 1.431
4.286ArgLys: 4.286 ± 1.775
4.762ArgLeu: 4.762 ± 1.083
0.952ArgMet: 0.952 ± 0.865
2.381ArgAsn: 2.381 ± 1.527
2.857ArgPro: 2.857 ± 0.996
0.476ArgGln: 0.476 ± 0.464
1.429ArgArg: 1.429 ± 1.042
2.381ArgSer: 2.381 ± 1.22
2.857ArgThr: 2.857 ± 1.51
2.857ArgVal: 2.857 ± 0.904
0.952ArgTrp: 0.952 ± 0.655
1.429ArgTyr: 1.429 ± 0.625
0.0ArgXaa: 0.0 ± 0.0
Ser
5.714SerAla: 5.714 ± 1.71
1.429SerCys: 1.429 ± 0.648
3.333SerAsp: 3.333 ± 1.789
1.905SerGlu: 1.905 ± 1.148
5.714SerPhe: 5.714 ± 1.252
5.714SerGly: 5.714 ± 2.037
1.429SerHis: 1.429 ± 0.771
4.762SerIle: 4.762 ± 1.247
5.714SerLys: 5.714 ± 1.013
4.762SerLeu: 4.762 ± 1.243
3.333SerMet: 3.333 ± 1.369
1.905SerAsn: 1.905 ± 0.862
2.381SerPro: 2.381 ± 1.007
3.81SerGln: 3.81 ± 1.166
3.81SerArg: 3.81 ± 1.498
5.238SerSer: 5.238 ± 1.347
0.952SerThr: 0.952 ± 0.762
3.81SerVal: 3.81 ± 1.322
0.476SerTrp: 0.476 ± 0.464
2.857SerTyr: 2.857 ± 0.772
0.0SerXaa: 0.0 ± 0.0
Thr
3.81ThrAla: 3.81 ± 0.92
2.381ThrCys: 2.381 ± 0.975
3.333ThrAsp: 3.333 ± 1.338
2.381ThrGlu: 2.381 ± 0.86
1.905ThrPhe: 1.905 ± 0.776
5.714ThrGly: 5.714 ± 1.368
0.0ThrHis: 0.0 ± 0.0
2.381ThrIle: 2.381 ± 0.974
3.81ThrLys: 3.81 ± 1.532
1.905ThrLeu: 1.905 ± 0.901
0.476ThrMet: 0.476 ± 0.662
2.381ThrAsn: 2.381 ± 1.062
4.286ThrPro: 4.286 ± 1.587
2.857ThrGln: 2.857 ± 0.976
4.762ThrArg: 4.762 ± 1.18
2.381ThrSer: 2.381 ± 0.824
2.857ThrThr: 2.857 ± 1.161
4.286ThrVal: 4.286 ± 1.218
1.429ThrTrp: 1.429 ± 0.959
2.857ThrTyr: 2.857 ± 1.214
0.0ThrXaa: 0.0 ± 0.0
Val
4.286ValAla: 4.286 ± 1.49
1.905ValCys: 1.905 ± 0.685
3.333ValAsp: 3.333 ± 1.333
4.286ValGlu: 4.286 ± 1.569
4.286ValPhe: 4.286 ± 1.189
1.905ValGly: 1.905 ± 0.843
0.476ValHis: 0.476 ± 0.402
6.19ValIle: 6.19 ± 1.815
3.333ValLys: 3.333 ± 0.709
2.857ValLeu: 2.857 ± 0.911
0.476ValMet: 0.476 ± 0.381
3.333ValAsn: 3.333 ± 1.518
4.762ValPro: 4.762 ± 1.299
2.381ValGln: 2.381 ± 1.044
3.333ValArg: 3.333 ± 0.969
3.81ValSer: 3.81 ± 1.346
4.286ValThr: 4.286 ± 1.852
2.381ValVal: 2.381 ± 0.999
0.476ValTrp: 0.476 ± 0.717
2.381ValTyr: 2.381 ± 1.305
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.476TrpAsp: 0.476 ± 0.581
0.476TrpGlu: 0.476 ± 0.528
0.0TrpPhe: 0.0 ± 0.0
2.857TrpGly: 2.857 ± 1.118
0.476TrpHis: 0.476 ± 0.581
0.476TrpIle: 0.476 ± 0.381
0.952TrpLys: 0.952 ± 0.57
1.905TrpLeu: 1.905 ± 1.445
0.0TrpMet: 0.0 ± 0.0
0.952TrpAsn: 0.952 ± 0.483
0.952TrpPro: 0.952 ± 0.755
0.0TrpGln: 0.0 ± 0.0
0.952TrpArg: 0.952 ± 0.57
0.476TrpSer: 0.476 ± 0.464
0.476TrpThr: 0.476 ± 0.464
0.952TrpVal: 0.952 ± 0.679
0.952TrpTrp: 0.952 ± 0.638
0.476TrpTyr: 0.476 ± 0.464
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.857TyrAla: 2.857 ± 1.01
0.476TyrCys: 0.476 ± 0.402
2.381TyrAsp: 2.381 ± 1.216
2.381TyrGlu: 2.381 ± 0.695
3.333TyrPhe: 3.333 ± 1.123
2.857TyrGly: 2.857 ± 1.004
0.952TyrHis: 0.952 ± 0.804
2.857TyrIle: 2.857 ± 1.168
2.381TyrLys: 2.381 ± 1.327
3.81TyrLeu: 3.81 ± 1.513
0.0TyrMet: 0.0 ± 0.0
1.429TyrAsn: 1.429 ± 1.104
0.952TyrPro: 0.952 ± 0.762
2.381TyrGln: 2.381 ± 1.194
1.905TyrArg: 1.905 ± 0.843
0.0TyrSer: 0.0 ± 0.0
2.381TyrThr: 2.381 ± 1.244
2.381TyrVal: 2.381 ± 0.945
0.476TyrTrp: 0.476 ± 0.464
2.381TyrTyr: 2.381 ± 0.963
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2101 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski