Amino acid dipepetide frequency for Vibrio phage VSK

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.925AlaAla: 3.925 ± 1.663
0.0AlaCys: 0.0 ± 0.0
2.453AlaAsp: 2.453 ± 0.839
3.435AlaGlu: 3.435 ± 0.92
4.907AlaPhe: 4.907 ± 1.313
1.963AlaGly: 1.963 ± 0.889
1.472AlaHis: 1.472 ± 0.714
6.379AlaIle: 6.379 ± 2.027
4.907AlaLys: 4.907 ± 1.524
9.814AlaLeu: 9.814 ± 2.388
0.491AlaMet: 0.491 ± 0.409
2.944AlaAsn: 2.944 ± 1.41
1.472AlaPro: 1.472 ± 0.63
4.416AlaGln: 4.416 ± 0.954
1.963AlaArg: 1.963 ± 0.922
2.453AlaSer: 2.453 ± 1.313
1.472AlaThr: 1.472 ± 0.643
6.869AlaVal: 6.869 ± 2.049
0.981AlaTrp: 0.981 ± 0.684
3.435AlaTyr: 3.435 ± 0.904
0.0AlaXaa: 0.0 ± 0.0
Cys
0.981CysAla: 0.981 ± 0.652
0.491CysCys: 0.491 ± 0.537
0.981CysAsp: 0.981 ± 0.646
0.981CysGlu: 0.981 ± 0.841
1.963CysPhe: 1.963 ± 0.721
1.963CysGly: 1.963 ± 0.904
0.491CysHis: 0.491 ± 0.504
0.981CysIle: 0.981 ± 0.662
1.472CysLys: 1.472 ± 0.849
0.491CysLeu: 0.491 ± 0.613
0.981CysMet: 0.981 ± 0.681
0.491CysAsn: 0.491 ± 0.409
0.981CysPro: 0.981 ± 0.819
0.491CysGln: 0.491 ± 0.421
1.472CysArg: 1.472 ± 1.027
0.981CysSer: 0.981 ± 0.566
0.981CysThr: 0.981 ± 0.524
1.472CysVal: 1.472 ± 0.908
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.925AspAla: 3.925 ± 1.197
0.0AspCys: 0.0 ± 0.0
4.416AspAsp: 4.416 ± 1.619
1.963AspGlu: 1.963 ± 1.131
1.472AspPhe: 1.472 ± 0.794
3.925AspGly: 3.925 ± 1.276
1.472AspHis: 1.472 ± 0.599
4.416AspIle: 4.416 ± 1.658
1.472AspLys: 1.472 ± 0.78
5.397AspLeu: 5.397 ± 1.706
0.981AspMet: 0.981 ± 0.683
1.472AspAsn: 1.472 ± 0.906
5.888AspPro: 5.888 ± 1.686
0.981AspGln: 0.981 ± 0.566
0.981AspArg: 0.981 ± 0.685
1.472AspSer: 1.472 ± 0.843
2.944AspThr: 2.944 ± 0.936
3.925AspVal: 3.925 ± 2.259
1.472AspTrp: 1.472 ± 0.776
2.453AspTyr: 2.453 ± 1.038
0.0AspXaa: 0.0 ± 0.0
Glu
5.397GluAla: 5.397 ± 1.261
2.944GluCys: 2.944 ± 0.924
1.472GluAsp: 1.472 ± 0.854
1.963GluGlu: 1.963 ± 0.724
2.944GluPhe: 2.944 ± 0.876
0.491GluGly: 0.491 ± 0.409
1.472GluHis: 1.472 ± 0.643
0.981GluIle: 0.981 ± 0.666
3.435GluLys: 3.435 ± 0.931
2.944GluLeu: 2.944 ± 1.414
0.981GluMet: 0.981 ± 0.679
0.981GluAsn: 0.981 ± 0.6
4.416GluPro: 4.416 ± 1.481
2.944GluGln: 2.944 ± 1.551
0.0GluArg: 0.0 ± 0.0
4.907GluSer: 4.907 ± 1.846
3.435GluThr: 3.435 ± 1.395
2.453GluVal: 2.453 ± 1.0
0.491GluTrp: 0.491 ± 0.496
1.472GluTyr: 1.472 ± 0.836
0.0GluXaa: 0.0 ± 0.0
Phe
2.944PheAla: 2.944 ± 1.086
0.981PheCys: 0.981 ± 0.555
2.944PheAsp: 2.944 ± 0.936
1.963PheGlu: 1.963 ± 0.73
0.981PhePhe: 0.981 ± 0.566
4.907PheGly: 4.907 ± 1.275
0.981PheHis: 0.981 ± 0.524
2.944PheIle: 2.944 ± 1.016
0.981PheLys: 0.981 ± 0.689
4.416PheLeu: 4.416 ± 1.692
2.453PheMet: 2.453 ± 1.199
2.944PheAsn: 2.944 ± 1.46
0.981PhePro: 0.981 ± 0.629
0.981PheGln: 0.981 ± 0.596
2.944PheArg: 2.944 ± 1.514
6.379PheSer: 6.379 ± 1.338
2.944PheThr: 2.944 ± 0.953
2.453PheVal: 2.453 ± 0.987
1.472PheTrp: 1.472 ± 0.629
1.963PheTyr: 1.963 ± 0.59
0.0PheXaa: 0.0 ± 0.0
Gly
3.925GlyAla: 3.925 ± 1.559
0.981GlyCys: 0.981 ± 0.632
3.435GlyAsp: 3.435 ± 0.963
2.453GlyGlu: 2.453 ± 0.96
3.435GlyPhe: 3.435 ± 1.357
3.925GlyGly: 3.925 ± 1.219
1.472GlyHis: 1.472 ± 0.832
5.397GlyIle: 5.397 ± 1.822
1.472GlyLys: 1.472 ± 0.796
6.869GlyLeu: 6.869 ± 1.998
1.963GlyMet: 1.963 ± 0.917
1.963GlyAsn: 1.963 ± 0.899
0.981GlyPro: 0.981 ± 0.819
2.453GlyGln: 2.453 ± 1.335
1.472GlyArg: 1.472 ± 0.958
6.379GlySer: 6.379 ± 1.305
1.963GlyThr: 1.963 ± 0.968
2.944GlyVal: 2.944 ± 1.256
0.491GlyTrp: 0.491 ± 0.421
3.925GlyTyr: 3.925 ± 1.248
0.0GlyXaa: 0.0 ± 0.0
His
2.453HisAla: 2.453 ± 0.881
0.0HisCys: 0.0 ± 0.0
0.491HisAsp: 0.491 ± 0.409
0.491HisGlu: 0.491 ± 0.409
0.491HisPhe: 0.491 ± 0.417
1.472HisGly: 1.472 ± 0.851
0.981HisHis: 0.981 ± 0.819
1.472HisIle: 1.472 ± 0.775
1.963HisLys: 1.963 ± 0.931
1.472HisLeu: 1.472 ± 0.894
0.981HisMet: 0.981 ± 0.557
0.491HisAsn: 0.491 ± 0.496
0.981HisPro: 0.981 ± 0.819
0.0HisGln: 0.0 ± 0.0
1.472HisArg: 1.472 ± 1.228
0.491HisSer: 0.491 ± 0.54
0.981HisThr: 0.981 ± 0.666
0.981HisVal: 0.981 ± 0.979
0.0HisTrp: 0.0 ± 0.0
0.981HisTyr: 0.981 ± 0.964
0.0HisXaa: 0.0 ± 0.0
Ile
4.416IleAla: 4.416 ± 0.926
1.963IleCys: 1.963 ± 0.726
6.869IleAsp: 6.869 ± 2.865
5.397IleGlu: 5.397 ± 1.572
2.453IlePhe: 2.453 ± 0.906
1.963IleGly: 1.963 ± 0.975
0.491IleHis: 0.491 ± 0.482
3.925IleIle: 3.925 ± 1.561
2.453IleLys: 2.453 ± 0.935
3.435IleLeu: 3.435 ± 1.361
1.963IleMet: 1.963 ± 1.004
4.416IleAsn: 4.416 ± 1.195
5.888IlePro: 5.888 ± 2.338
2.453IleGln: 2.453 ± 1.129
1.963IleArg: 1.963 ± 1.26
7.36IleSer: 7.36 ± 2.495
4.907IleThr: 4.907 ± 1.407
3.925IleVal: 3.925 ± 0.881
1.472IleTrp: 1.472 ± 0.598
2.944IleTyr: 2.944 ± 0.736
0.0IleXaa: 0.0 ± 0.0
Lys
4.907LysAla: 4.907 ± 1.193
1.472LysCys: 1.472 ± 0.894
2.453LysAsp: 2.453 ± 1.012
0.981LysGlu: 0.981 ± 0.555
1.472LysPhe: 1.472 ± 0.715
1.963LysGly: 1.963 ± 1.284
0.981LysHis: 0.981 ± 0.611
4.907LysIle: 4.907 ± 1.085
5.397LysLys: 5.397 ± 1.551
4.907LysLeu: 4.907 ± 1.621
2.944LysMet: 2.944 ± 1.398
1.472LysAsn: 1.472 ± 0.673
3.435LysPro: 3.435 ± 1.376
3.435LysGln: 3.435 ± 1.348
4.907LysArg: 4.907 ± 1.776
2.944LysSer: 2.944 ± 0.838
6.869LysThr: 6.869 ± 2.14
5.397LysVal: 5.397 ± 1.724
0.0LysTrp: 0.0 ± 0.0
1.472LysTyr: 1.472 ± 0.78
0.0LysXaa: 0.0 ± 0.0
Leu
5.888LeuAla: 5.888 ± 2.181
2.453LeuCys: 2.453 ± 0.974
2.944LeuAsp: 2.944 ± 1.042
3.435LeuGlu: 3.435 ± 1.193
2.944LeuPhe: 2.944 ± 0.856
7.36LeuGly: 7.36 ± 1.067
1.472LeuHis: 1.472 ± 0.589
8.832LeuIle: 8.832 ± 2.733
4.416LeuLys: 4.416 ± 1.445
9.323LeuLeu: 9.323 ± 3.351
3.925LeuMet: 3.925 ± 1.284
4.416LeuAsn: 4.416 ± 1.803
5.888LeuPro: 5.888 ± 1.953
2.453LeuGln: 2.453 ± 0.885
3.435LeuArg: 3.435 ± 1.55
8.832LeuSer: 8.832 ± 2.853
4.907LeuThr: 4.907 ± 1.541
4.907LeuVal: 4.907 ± 1.337
0.981LeuTrp: 0.981 ± 0.684
3.925LeuTyr: 3.925 ± 1.514
0.0LeuXaa: 0.0 ± 0.0
Met
2.453MetAla: 2.453 ± 1.115
0.0MetCys: 0.0 ± 0.0
0.981MetAsp: 0.981 ± 1.225
0.981MetGlu: 0.981 ± 0.819
1.472MetPhe: 1.472 ± 0.742
1.963MetGly: 1.963 ± 0.659
0.491MetHis: 0.491 ± 0.482
1.963MetIle: 1.963 ± 1.219
0.981MetLys: 0.981 ± 0.578
2.453MetLeu: 2.453 ± 1.007
0.491MetMet: 0.491 ± 0.417
1.963MetAsn: 1.963 ± 1.031
2.453MetPro: 2.453 ± 1.164
0.491MetGln: 0.491 ± 0.504
2.453MetArg: 2.453 ± 0.773
2.453MetSer: 2.453 ± 0.792
1.472MetThr: 1.472 ± 0.902
2.944MetVal: 2.944 ± 0.825
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.944AsnAla: 2.944 ± 1.156
0.491AsnCys: 0.491 ± 0.537
1.963AsnAsp: 1.963 ± 1.192
3.435AsnGlu: 3.435 ± 1.615
1.472AsnPhe: 1.472 ± 0.689
0.491AsnGly: 0.491 ± 0.504
0.491AsnHis: 0.491 ± 0.409
1.963AsnIle: 1.963 ± 1.072
5.888AsnLys: 5.888 ± 1.318
3.925AsnLeu: 3.925 ± 0.805
0.0AsnMet: 0.0 ± 0.0
2.453AsnAsn: 2.453 ± 0.76
3.435AsnPro: 3.435 ± 1.094
1.472AsnGln: 1.472 ± 0.52
2.453AsnArg: 2.453 ± 1.097
3.925AsnSer: 3.925 ± 1.4
3.925AsnThr: 3.925 ± 1.089
1.963AsnVal: 1.963 ± 1.161
0.981AsnTrp: 0.981 ± 0.754
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.981ProAla: 0.981 ± 0.557
0.981ProCys: 0.981 ± 0.566
4.416ProAsp: 4.416 ± 2.134
3.435ProGlu: 3.435 ± 1.78
4.416ProPhe: 4.416 ± 1.079
0.981ProGly: 0.981 ± 0.524
1.472ProHis: 1.472 ± 1.228
1.472ProIle: 1.472 ± 1.088
3.925ProLys: 3.925 ± 1.189
6.869ProLeu: 6.869 ± 2.559
1.472ProMet: 1.472 ± 0.746
2.453ProAsn: 2.453 ± 1.093
2.453ProPro: 2.453 ± 0.867
2.944ProGln: 2.944 ± 1.306
2.944ProArg: 2.944 ± 0.835
6.869ProSer: 6.869 ± 1.695
5.397ProThr: 5.397 ± 1.858
3.925ProVal: 3.925 ± 1.203
0.491ProTrp: 0.491 ± 0.421
0.491ProTyr: 0.491 ± 0.409
0.0ProXaa: 0.0 ± 0.0
Gln
1.963GlnAla: 1.963 ± 0.744
0.981GlnCys: 0.981 ± 0.684
2.453GlnAsp: 2.453 ± 0.651
0.981GlnGlu: 0.981 ± 0.566
1.472GlnPhe: 1.472 ± 0.797
2.453GlnGly: 2.453 ± 1.687
0.0GlnHis: 0.0 ± 0.0
1.963GlnIle: 1.963 ± 0.616
2.944GlnLys: 2.944 ± 1.475
3.435GlnLeu: 3.435 ± 1.212
0.491GlnMet: 0.491 ± 0.409
1.472GlnAsn: 1.472 ± 0.731
1.963GlnPro: 1.963 ± 0.82
1.472GlnGln: 1.472 ± 1.006
2.944GlnArg: 2.944 ± 0.839
4.416GlnSer: 4.416 ± 1.729
1.472GlnThr: 1.472 ± 0.908
2.944GlnVal: 2.944 ± 1.123
0.491GlnTrp: 0.491 ± 0.56
0.491GlnTyr: 0.491 ± 0.54
0.0GlnXaa: 0.0 ± 0.0
Arg
2.453ArgAla: 2.453 ± 1.414
0.0ArgCys: 0.0 ± 0.0
1.472ArgAsp: 1.472 ± 0.742
2.453ArgGlu: 2.453 ± 1.365
3.435ArgPhe: 3.435 ± 1.306
2.944ArgGly: 2.944 ± 1.265
0.981ArgHis: 0.981 ± 0.819
6.379ArgIle: 6.379 ± 1.308
2.453ArgLys: 2.453 ± 1.046
5.397ArgLeu: 5.397 ± 1.466
0.981ArgMet: 0.981 ± 0.626
3.435ArgAsn: 3.435 ± 1.659
4.907ArgPro: 4.907 ± 1.616
0.491ArgGln: 0.491 ± 0.409
2.453ArgArg: 2.453 ± 1.722
2.944ArgSer: 2.944 ± 1.168
5.397ArgThr: 5.397 ± 2.333
1.472ArgVal: 1.472 ± 0.629
0.491ArgTrp: 0.491 ± 0.489
0.491ArgTyr: 0.491 ± 0.496
0.0ArgXaa: 0.0 ± 0.0
Ser
6.379SerAla: 6.379 ± 2.048
0.491SerCys: 0.491 ± 0.421
4.907SerAsp: 4.907 ± 1.986
2.453SerGlu: 2.453 ± 1.435
4.907SerPhe: 4.907 ± 0.851
5.397SerGly: 5.397 ± 1.215
0.981SerHis: 0.981 ± 0.557
3.435SerIle: 3.435 ± 0.93
8.342SerLys: 8.342 ± 1.837
7.851SerLeu: 7.851 ± 3.298
4.416SerMet: 4.416 ± 1.352
2.944SerAsn: 2.944 ± 0.911
2.453SerPro: 2.453 ± 0.855
2.453SerGln: 2.453 ± 1.374
4.907SerArg: 4.907 ± 1.724
5.397SerSer: 5.397 ± 1.356
4.907SerThr: 4.907 ± 2.117
3.925SerVal: 3.925 ± 1.657
0.0SerTrp: 0.0 ± 0.0
3.435SerTyr: 3.435 ± 1.09
0.0SerXaa: 0.0 ± 0.0
Thr
3.925ThrAla: 3.925 ± 1.123
1.963ThrCys: 1.963 ± 0.807
2.453ThrAsp: 2.453 ± 1.392
1.963ThrGlu: 1.963 ± 0.824
1.963ThrPhe: 1.963 ± 0.835
5.888ThrGly: 5.888 ± 1.412
0.491ThrHis: 0.491 ± 0.537
3.435ThrIle: 3.435 ± 1.234
4.416ThrLys: 4.416 ± 0.958
6.379ThrLeu: 6.379 ± 1.23
1.963ThrMet: 1.963 ± 1.035
2.453ThrAsn: 2.453 ± 1.397
4.416ThrPro: 4.416 ± 1.532
2.944ThrGln: 2.944 ± 0.864
4.907ThrArg: 4.907 ± 1.085
2.944ThrSer: 2.944 ± 1.596
4.907ThrThr: 4.907 ± 2.867
3.435ThrVal: 3.435 ± 0.829
0.981ThrTrp: 0.981 ± 0.688
1.472ThrTyr: 1.472 ± 0.75
0.0ThrXaa: 0.0 ± 0.0
Val
2.944ValAla: 2.944 ± 0.915
1.963ValCys: 1.963 ± 0.785
2.944ValAsp: 2.944 ± 0.926
4.416ValGlu: 4.416 ± 1.737
3.925ValPhe: 3.925 ± 0.993
3.925ValGly: 3.925 ± 1.06
0.491ValHis: 0.491 ± 0.409
6.869ValIle: 6.869 ± 1.949
2.944ValLys: 2.944 ± 1.021
2.944ValLeu: 2.944 ± 1.136
0.491ValMet: 0.491 ± 0.421
3.435ValAsn: 3.435 ± 1.243
3.925ValPro: 3.925 ± 1.398
1.472ValGln: 1.472 ± 0.906
3.925ValArg: 3.925 ± 1.22
7.36ValSer: 7.36 ± 1.842
3.435ValThr: 3.435 ± 1.654
1.963ValVal: 1.963 ± 0.855
0.0ValTrp: 0.0 ± 0.0
1.963ValTyr: 1.963 ± 0.97
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.491TrpGlu: 0.491 ± 0.504
0.0TrpPhe: 0.0 ± 0.0
1.963TrpGly: 1.963 ± 0.974
0.491TrpHis: 0.491 ± 0.56
0.491TrpIle: 0.491 ± 0.417
0.491TrpLys: 0.491 ± 0.489
1.472TrpLeu: 1.472 ± 1.011
0.0TrpMet: 0.0 ± 0.0
0.491TrpAsn: 0.491 ± 0.421
0.981TrpPro: 0.981 ± 0.716
0.491TrpGln: 0.491 ± 0.496
0.981TrpArg: 0.981 ± 0.557
0.981TrpSer: 0.981 ± 0.566
0.0TrpThr: 0.0 ± 0.0
1.472TrpVal: 1.472 ± 1.232
0.0TrpTrp: 0.0 ± 0.0
0.491TrpTyr: 0.491 ± 0.409
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.944TyrAla: 2.944 ± 1.269
0.491TyrCys: 0.491 ± 0.489
0.981TyrAsp: 0.981 ± 0.808
2.453TyrGlu: 2.453 ± 0.911
3.435TyrPhe: 3.435 ± 1.099
1.963TyrGly: 1.963 ± 1.279
1.472TyrHis: 1.472 ± 0.622
2.453TyrIle: 2.453 ± 1.002
1.963TyrLys: 1.963 ± 1.503
2.944TyrLeu: 2.944 ± 1.426
0.0TyrMet: 0.0 ± 0.0
0.981TyrAsn: 0.981 ± 0.819
0.491TyrPro: 0.491 ± 0.417
1.963TyrGln: 1.963 ± 1.113
2.453TyrArg: 2.453 ± 0.954
0.981TyrSer: 0.981 ± 0.671
0.981TyrThr: 0.981 ± 0.646
1.963TyrVal: 1.963 ± 1.195
0.491TyrTrp: 0.491 ± 0.409
0.981TyrTyr: 0.981 ± 0.596
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2039 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski