Amino acid dipepetide frequency for Vibrio phage PVA1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.715AlaAla: 7.715 ± 2.261
0.964AlaCys: 0.964 ± 0.465
5.625AlaAsp: 5.625 ± 0.763
5.625AlaGlu: 5.625 ± 0.845
2.25AlaPhe: 2.25 ± 0.527
6.107AlaGly: 6.107 ± 1.253
0.964AlaHis: 0.964 ± 0.501
4.982AlaIle: 4.982 ± 0.867
4.982AlaLys: 4.982 ± 1.155
6.59AlaLeu: 6.59 ± 1.415
2.893AlaMet: 2.893 ± 0.6
3.536AlaAsn: 3.536 ± 0.649
2.411AlaPro: 2.411 ± 0.746
4.822AlaGln: 4.822 ± 1.293
4.018AlaArg: 4.018 ± 1.152
6.268AlaSer: 6.268 ± 0.874
5.947AlaThr: 5.947 ± 1.06
6.59AlaVal: 6.59 ± 0.998
1.446AlaTrp: 1.446 ± 0.462
2.732AlaTyr: 2.732 ± 0.661
0.0AlaXaa: 0.0 ± 0.0
Cys
1.125CysAla: 1.125 ± 0.347
0.161CysCys: 0.161 ± 0.157
0.804CysAsp: 0.804 ± 0.441
1.125CysGlu: 1.125 ± 0.411
0.321CysPhe: 0.321 ± 0.239
1.125CysGly: 1.125 ± 0.465
0.321CysHis: 0.321 ± 0.301
0.482CysIle: 0.482 ± 0.36
0.482CysLys: 0.482 ± 0.271
0.482CysLeu: 0.482 ± 0.276
0.161CysMet: 0.161 ± 0.171
0.0CysAsn: 0.0 ± 0.0
0.482CysPro: 0.482 ± 0.265
0.161CysGln: 0.161 ± 0.157
0.161CysArg: 0.161 ± 0.169
0.482CysSer: 0.482 ± 0.283
0.804CysThr: 0.804 ± 0.28
0.804CysVal: 0.804 ± 0.321
0.0CysTrp: 0.0 ± 0.0
0.643CysTyr: 0.643 ± 0.273
0.0CysXaa: 0.0 ± 0.0
Asp
6.911AspAla: 6.911 ± 1.019
0.643AspCys: 0.643 ± 0.325
3.214AspAsp: 3.214 ± 0.551
5.786AspGlu: 5.786 ± 1.072
2.25AspPhe: 2.25 ± 0.621
5.464AspGly: 5.464 ± 1.462
0.482AspHis: 0.482 ± 0.31
3.375AspIle: 3.375 ± 0.812
4.018AspLys: 4.018 ± 1.101
6.107AspLeu: 6.107 ± 0.693
2.25AspMet: 2.25 ± 0.525
3.857AspAsn: 3.857 ± 0.681
2.893AspPro: 2.893 ± 0.714
3.214AspGln: 3.214 ± 1.24
3.536AspArg: 3.536 ± 0.647
4.339AspSer: 4.339 ± 0.616
3.214AspThr: 3.214 ± 1.036
4.339AspVal: 4.339 ± 0.878
1.125AspTrp: 1.125 ± 0.552
2.411AspTyr: 2.411 ± 0.716
0.0AspXaa: 0.0 ± 0.0
Glu
5.304GluAla: 5.304 ± 1.065
0.643GluCys: 0.643 ± 0.39
5.143GluAsp: 5.143 ± 0.956
4.982GluGlu: 4.982 ± 1.149
2.25GluPhe: 2.25 ± 0.704
4.822GluGly: 4.822 ± 0.821
0.482GluHis: 0.482 ± 0.385
5.143GluIle: 5.143 ± 0.696
5.304GluLys: 5.304 ± 1.162
7.875GluLeu: 7.875 ± 0.948
2.893GluMet: 2.893 ± 0.829
3.214GluAsn: 3.214 ± 0.77
1.446GluPro: 1.446 ± 0.722
3.054GluGln: 3.054 ± 0.623
2.893GluArg: 2.893 ± 0.674
4.661GluSer: 4.661 ± 1.068
3.054GluThr: 3.054 ± 0.647
3.857GluVal: 3.857 ± 0.536
1.446GluTrp: 1.446 ± 0.578
3.214GluTyr: 3.214 ± 0.742
0.0GluXaa: 0.0 ± 0.0
Phe
3.536PheAla: 3.536 ± 0.518
0.804PheCys: 0.804 ± 0.294
3.054PheAsp: 3.054 ± 0.671
2.411PheGlu: 2.411 ± 0.533
1.286PhePhe: 1.286 ± 0.37
2.732PheGly: 2.732 ± 0.577
0.321PheHis: 0.321 ± 0.228
1.125PheIle: 1.125 ± 0.616
2.572PheLys: 2.572 ± 0.531
2.732PheLeu: 2.732 ± 0.671
0.804PheMet: 0.804 ± 0.313
2.572PheAsn: 2.572 ± 0.743
0.804PhePro: 0.804 ± 0.401
0.964PheGln: 0.964 ± 0.391
1.286PheArg: 1.286 ± 0.378
1.286PheSer: 1.286 ± 0.307
2.411PheThr: 2.411 ± 0.779
1.286PheVal: 1.286 ± 0.408
0.804PheTrp: 0.804 ± 0.326
0.964PheTyr: 0.964 ± 0.583
0.0PheXaa: 0.0 ± 0.0
Gly
6.107GlyAla: 6.107 ± 1.043
0.161GlyCys: 0.161 ± 0.151
3.857GlyAsp: 3.857 ± 0.862
5.786GlyGlu: 5.786 ± 0.801
2.732GlyPhe: 2.732 ± 0.707
5.143GlyGly: 5.143 ± 1.136
1.286GlyHis: 1.286 ± 0.406
3.536GlyIle: 3.536 ± 0.759
4.5GlyLys: 4.5 ± 1.084
6.59GlyLeu: 6.59 ± 1.478
1.125GlyMet: 1.125 ± 0.422
3.054GlyAsn: 3.054 ± 0.643
1.607GlyPro: 1.607 ± 0.685
3.697GlyGln: 3.697 ± 1.181
3.697GlyArg: 3.697 ± 1.163
5.625GlySer: 5.625 ± 1.335
3.054GlyThr: 3.054 ± 0.658
4.339GlyVal: 4.339 ± 0.752
0.482GlyTrp: 0.482 ± 0.284
2.893GlyTyr: 2.893 ± 0.732
0.0GlyXaa: 0.0 ± 0.0
His
0.964HisAla: 0.964 ± 0.271
0.321HisCys: 0.321 ± 0.241
0.804HisAsp: 0.804 ± 0.396
2.25HisGlu: 2.25 ± 0.545
0.321HisPhe: 0.321 ± 0.234
0.482HisGly: 0.482 ± 0.32
0.0HisHis: 0.0 ± 0.0
0.482HisIle: 0.482 ± 0.349
0.804HisLys: 0.804 ± 0.35
0.964HisLeu: 0.964 ± 0.285
0.161HisMet: 0.161 ± 0.185
0.804HisAsn: 0.804 ± 0.261
0.321HisPro: 0.321 ± 0.219
0.964HisGln: 0.964 ± 0.347
0.482HisArg: 0.482 ± 0.385
0.964HisSer: 0.964 ± 0.442
0.643HisThr: 0.643 ± 0.368
0.964HisVal: 0.964 ± 0.367
0.161HisTrp: 0.161 ± 0.169
1.125HisTyr: 1.125 ± 0.391
0.0HisXaa: 0.0 ± 0.0
Ile
6.107IleAla: 6.107 ± 0.953
0.482IleCys: 0.482 ± 0.28
3.857IleAsp: 3.857 ± 0.941
5.143IleGlu: 5.143 ± 1.034
1.125IlePhe: 1.125 ± 0.5
3.214IleGly: 3.214 ± 0.591
0.321IleHis: 0.321 ± 0.232
3.054IleIle: 3.054 ± 0.471
5.143IleLys: 5.143 ± 0.46
1.125IleLeu: 1.125 ± 0.341
0.964IleMet: 0.964 ± 0.42
3.214IleAsn: 3.214 ± 0.864
2.572IlePro: 2.572 ± 0.819
2.411IleGln: 2.411 ± 0.625
3.214IleArg: 3.214 ± 0.663
3.054IleSer: 3.054 ± 0.936
3.536IleThr: 3.536 ± 0.687
2.572IleVal: 2.572 ± 0.649
0.321IleTrp: 0.321 ± 0.26
2.732IleTyr: 2.732 ± 0.598
0.0IleXaa: 0.0 ± 0.0
Lys
5.143LysAla: 5.143 ± 1.216
0.482LysCys: 0.482 ± 0.264
4.661LysAsp: 4.661 ± 1.203
4.822LysGlu: 4.822 ± 0.837
2.893LysPhe: 2.893 ± 0.604
4.339LysGly: 4.339 ± 0.981
0.804LysHis: 0.804 ± 0.302
2.732LysIle: 2.732 ± 0.893
5.625LysLys: 5.625 ± 1.315
7.072LysLeu: 7.072 ± 1.139
2.572LysMet: 2.572 ± 0.647
3.375LysAsn: 3.375 ± 0.919
2.089LysPro: 2.089 ± 0.67
4.339LysGln: 4.339 ± 1.021
3.536LysArg: 3.536 ± 0.827
5.304LysSer: 5.304 ± 0.961
3.214LysThr: 3.214 ± 0.761
4.339LysVal: 4.339 ± 0.726
1.125LysTrp: 1.125 ± 0.407
2.411LysTyr: 2.411 ± 0.682
0.0LysXaa: 0.0 ± 0.0
Leu
5.143LeuAla: 5.143 ± 0.824
0.964LeuCys: 0.964 ± 0.301
5.625LeuAsp: 5.625 ± 0.892
4.661LeuGlu: 4.661 ± 0.866
1.607LeuPhe: 1.607 ± 0.458
6.107LeuGly: 6.107 ± 0.936
1.446LeuHis: 1.446 ± 0.563
2.732LeuIle: 2.732 ± 0.801
6.429LeuLys: 6.429 ± 0.917
3.857LeuLeu: 3.857 ± 0.629
2.572LeuMet: 2.572 ± 0.763
4.5LeuAsn: 4.5 ± 0.692
4.5LeuPro: 4.5 ± 0.885
2.411LeuGln: 2.411 ± 0.55
3.536LeuArg: 3.536 ± 1.044
7.715LeuSer: 7.715 ± 1.526
5.786LeuThr: 5.786 ± 1.009
3.054LeuVal: 3.054 ± 0.897
0.482LeuTrp: 0.482 ± 0.247
1.929LeuTyr: 1.929 ± 0.67
0.0LeuXaa: 0.0 ± 0.0
Met
2.893MetAla: 2.893 ± 0.871
0.964MetCys: 0.964 ± 0.529
2.089MetAsp: 2.089 ± 0.546
1.446MetGlu: 1.446 ± 0.744
0.643MetPhe: 0.643 ± 0.49
1.446MetGly: 1.446 ± 0.551
0.964MetHis: 0.964 ± 0.42
1.929MetIle: 1.929 ± 0.408
1.286MetLys: 1.286 ± 0.455
2.25MetLeu: 2.25 ± 0.598
1.446MetMet: 1.446 ± 0.694
1.768MetAsn: 1.768 ± 0.565
0.643MetPro: 0.643 ± 0.357
2.089MetGln: 2.089 ± 0.736
1.286MetArg: 1.286 ± 0.435
1.929MetSer: 1.929 ± 0.588
2.25MetThr: 2.25 ± 0.616
1.607MetVal: 1.607 ± 0.518
0.321MetTrp: 0.321 ± 0.219
0.643MetTyr: 0.643 ± 0.312
0.0MetXaa: 0.0 ± 0.0
Asn
5.625AsnAla: 5.625 ± 0.631
0.482AsnCys: 0.482 ± 0.267
4.018AsnAsp: 4.018 ± 0.945
2.572AsnGlu: 2.572 ± 0.63
1.929AsnPhe: 1.929 ± 0.604
4.982AsnGly: 4.982 ± 0.855
0.964AsnHis: 0.964 ± 0.414
2.893AsnIle: 2.893 ± 0.687
3.536AsnLys: 3.536 ± 1.057
5.304AsnLeu: 5.304 ± 0.936
1.607AsnMet: 1.607 ± 0.464
1.929AsnAsn: 1.929 ± 0.463
4.179AsnPro: 4.179 ± 0.753
1.929AsnGln: 1.929 ± 0.735
1.929AsnArg: 1.929 ± 0.667
2.089AsnSer: 2.089 ± 0.789
2.572AsnThr: 2.572 ± 0.596
2.572AsnVal: 2.572 ± 0.734
0.964AsnTrp: 0.964 ± 0.388
2.25AsnTyr: 2.25 ± 0.667
0.0AsnXaa: 0.0 ± 0.0
Pro
2.572ProAla: 2.572 ± 0.452
0.161ProCys: 0.161 ± 0.171
3.214ProAsp: 3.214 ± 0.792
4.018ProGlu: 4.018 ± 1.045
1.607ProPhe: 1.607 ± 0.431
1.446ProGly: 1.446 ± 0.382
0.482ProHis: 0.482 ± 0.272
1.286ProIle: 1.286 ± 0.341
2.572ProLys: 2.572 ± 0.723
2.25ProLeu: 2.25 ± 0.538
0.964ProMet: 0.964 ± 0.383
2.411ProAsn: 2.411 ± 0.694
0.482ProPro: 0.482 ± 0.246
1.125ProGln: 1.125 ± 0.61
1.446ProArg: 1.446 ± 0.386
2.893ProSer: 2.893 ± 0.971
2.411ProThr: 2.411 ± 0.561
2.732ProVal: 2.732 ± 0.835
0.482ProTrp: 0.482 ± 0.275
1.125ProTyr: 1.125 ± 0.463
0.0ProXaa: 0.0 ± 0.0
Gln
4.5GlnAla: 4.5 ± 0.93
0.804GlnCys: 0.804 ± 0.295
1.768GlnAsp: 1.768 ± 0.535
3.214GlnGlu: 3.214 ± 0.922
2.089GlnPhe: 2.089 ± 0.553
2.572GlnGly: 2.572 ± 0.844
0.321GlnHis: 0.321 ± 0.239
1.768GlnIle: 1.768 ± 0.401
3.214GlnLys: 3.214 ± 0.955
3.536GlnLeu: 3.536 ± 0.784
0.964GlnMet: 0.964 ± 0.506
3.375GlnAsn: 3.375 ± 0.9
2.25GlnPro: 2.25 ± 0.75
2.089GlnGln: 2.089 ± 0.764
2.732GlnArg: 2.732 ± 0.646
3.054GlnSer: 3.054 ± 0.63
1.929GlnThr: 1.929 ± 0.539
3.375GlnVal: 3.375 ± 0.89
0.482GlnTrp: 0.482 ± 0.307
2.411GlnTyr: 2.411 ± 0.679
0.0GlnXaa: 0.0 ± 0.0
Arg
3.697ArgAla: 3.697 ± 0.892
0.643ArgCys: 0.643 ± 0.433
3.697ArgAsp: 3.697 ± 0.837
4.018ArgGlu: 4.018 ± 0.626
1.929ArgPhe: 1.929 ± 0.558
1.607ArgGly: 1.607 ± 0.606
0.482ArgHis: 0.482 ± 0.309
1.929ArgIle: 1.929 ± 0.595
4.179ArgLys: 4.179 ± 0.881
4.018ArgLeu: 4.018 ± 0.812
0.964ArgMet: 0.964 ± 0.345
3.375ArgAsn: 3.375 ± 0.5
0.804ArgPro: 0.804 ± 0.297
2.411ArgGln: 2.411 ± 0.673
1.607ArgArg: 1.607 ± 0.493
3.536ArgSer: 3.536 ± 0.711
1.607ArgThr: 1.607 ± 0.485
2.732ArgVal: 2.732 ± 0.604
0.482ArgTrp: 0.482 ± 0.246
1.446ArgTyr: 1.446 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
7.232SerAla: 7.232 ± 1.541
0.161SerCys: 0.161 ± 0.197
4.822SerAsp: 4.822 ± 0.774
3.375SerGlu: 3.375 ± 0.653
2.893SerPhe: 2.893 ± 0.854
6.107SerGly: 6.107 ± 0.705
0.804SerHis: 0.804 ± 0.416
3.697SerIle: 3.697 ± 0.649
4.179SerLys: 4.179 ± 1.218
3.214SerLeu: 3.214 ± 0.7
1.607SerMet: 1.607 ± 0.533
4.661SerAsn: 4.661 ± 0.768
2.732SerPro: 2.732 ± 0.633
3.375SerGln: 3.375 ± 0.638
3.214SerArg: 3.214 ± 0.436
4.5SerSer: 4.5 ± 0.959
2.732SerThr: 2.732 ± 0.56
4.179SerVal: 4.179 ± 0.619
1.125SerTrp: 1.125 ± 0.354
2.893SerTyr: 2.893 ± 0.668
0.0SerXaa: 0.0 ± 0.0
Thr
3.054ThrAla: 3.054 ± 0.65
0.161ThrCys: 0.161 ± 0.162
4.982ThrAsp: 4.982 ± 1.013
2.572ThrGlu: 2.572 ± 0.731
2.732ThrPhe: 2.732 ± 0.874
5.304ThrGly: 5.304 ± 0.763
0.804ThrHis: 0.804 ± 0.49
4.5ThrIle: 4.5 ± 0.876
4.179ThrLys: 4.179 ± 0.94
3.857ThrLeu: 3.857 ± 0.888
1.768ThrMet: 1.768 ± 0.376
2.25ThrAsn: 2.25 ± 0.516
2.893ThrPro: 2.893 ± 0.649
2.893ThrGln: 2.893 ± 0.657
1.768ThrArg: 1.768 ± 0.506
3.214ThrSer: 3.214 ± 0.793
4.5ThrThr: 4.5 ± 0.786
4.018ThrVal: 4.018 ± 1.074
0.643ThrTrp: 0.643 ± 0.278
1.607ThrTyr: 1.607 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
4.339ValAla: 4.339 ± 0.885
0.964ValCys: 0.964 ± 0.344
4.822ValAsp: 4.822 ± 1.235
4.018ValGlu: 4.018 ± 0.874
1.446ValPhe: 1.446 ± 0.538
3.054ValGly: 3.054 ± 0.572
1.607ValHis: 1.607 ± 0.771
4.5ValIle: 4.5 ± 0.492
3.857ValLys: 3.857 ± 0.902
2.893ValLeu: 2.893 ± 0.442
2.089ValMet: 2.089 ± 0.686
4.339ValAsn: 4.339 ± 0.559
1.768ValPro: 1.768 ± 0.41
2.25ValGln: 2.25 ± 0.595
2.572ValArg: 2.572 ± 0.594
3.857ValSer: 3.857 ± 0.652
5.464ValThr: 5.464 ± 1.329
4.018ValVal: 4.018 ± 1.253
0.321ValTrp: 0.321 ± 0.208
1.125ValTyr: 1.125 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
1.446TrpAla: 1.446 ± 0.396
0.0TrpCys: 0.0 ± 0.0
0.804TrpAsp: 0.804 ± 0.406
0.804TrpGlu: 0.804 ± 0.346
0.482TrpPhe: 0.482 ± 0.508
0.321TrpGly: 0.321 ± 0.207
0.482TrpHis: 0.482 ± 0.29
1.286TrpIle: 1.286 ± 0.409
0.804TrpLys: 0.804 ± 0.38
0.804TrpLeu: 0.804 ± 0.323
0.643TrpMet: 0.643 ± 0.319
0.321TrpAsn: 0.321 ± 0.276
0.321TrpPro: 0.321 ± 0.219
0.482TrpGln: 0.482 ± 0.255
0.482TrpArg: 0.482 ± 0.215
1.125TrpSer: 1.125 ± 0.383
0.643TrpThr: 0.643 ± 0.306
0.804TrpVal: 0.804 ± 0.357
0.161TrpTrp: 0.161 ± 0.155
0.482TrpTyr: 0.482 ± 0.278
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.054TyrAla: 3.054 ± 0.629
0.161TyrCys: 0.161 ± 0.148
2.572TyrAsp: 2.572 ± 0.644
3.214TyrGlu: 3.214 ± 0.692
0.964TyrPhe: 0.964 ± 0.443
2.893TyrGly: 2.893 ± 0.774
0.643TyrHis: 0.643 ± 0.344
2.732TyrIle: 2.732 ± 0.761
3.214TyrLys: 3.214 ± 0.578
3.536TyrLeu: 3.536 ± 0.674
1.286TyrMet: 1.286 ± 0.464
1.768TyrAsn: 1.768 ± 0.555
0.482TyrPro: 0.482 ± 0.241
1.768TyrGln: 1.768 ± 0.535
1.768TyrArg: 1.768 ± 0.402
1.607TyrSer: 1.607 ± 0.563
1.929TyrThr: 1.929 ± 0.658
1.125TyrVal: 1.125 ± 0.423
0.321TyrTrp: 0.321 ± 0.208
1.446TyrTyr: 1.446 ± 0.483
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (6223 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski