Amino acid dipepetide frequency for Vibrio phage VALG_phi6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.693AlaAla: 3.693 ± 1.447
2.052AlaCys: 2.052 ± 0.99
2.052AlaAsp: 2.052 ± 0.88
2.872AlaGlu: 2.872 ± 0.733
4.103AlaPhe: 4.103 ± 1.259
2.462AlaGly: 2.462 ± 1.01
1.231AlaHis: 1.231 ± 0.722
2.872AlaIle: 2.872 ± 1.495
3.283AlaLys: 3.283 ± 0.778
9.438AlaLeu: 9.438 ± 1.631
1.641AlaMet: 1.641 ± 0.765
3.283AlaAsn: 3.283 ± 0.924
1.641AlaPro: 1.641 ± 0.977
3.283AlaGln: 3.283 ± 1.194
3.693AlaArg: 3.693 ± 1.215
6.155AlaSer: 6.155 ± 1.493
4.103AlaThr: 4.103 ± 1.276
5.745AlaVal: 5.745 ± 1.865
0.821AlaTrp: 0.821 ± 0.402
2.052AlaTyr: 2.052 ± 1.377
0.0AlaXaa: 0.0 ± 0.0
Cys
2.052CysAla: 2.052 ± 0.875
0.0CysCys: 0.0 ± 0.0
0.821CysAsp: 0.821 ± 0.569
2.462CysGlu: 2.462 ± 1.4
0.821CysPhe: 0.821 ± 0.685
1.641CysGly: 1.641 ± 0.614
0.41CysHis: 0.41 ± 0.342
0.821CysIle: 0.821 ± 0.507
1.641CysLys: 1.641 ± 1.001
1.231CysLeu: 1.231 ± 0.519
0.821CysMet: 0.821 ± 0.468
0.821CysAsn: 0.821 ± 0.46
0.41CysPro: 0.41 ± 0.356
1.641CysGln: 1.641 ± 0.812
0.41CysArg: 0.41 ± 0.342
0.821CysSer: 0.821 ± 0.436
0.821CysThr: 0.821 ± 0.436
0.41CysVal: 0.41 ± 0.555
0.41CysTrp: 0.41 ± 0.342
0.821CysTyr: 0.821 ± 0.471
0.0CysXaa: 0.0 ± 0.0
Asp
2.872AspAla: 2.872 ± 0.995
2.462AspCys: 2.462 ± 1.041
4.514AspAsp: 4.514 ± 1.612
4.514AspGlu: 4.514 ± 1.211
2.462AspPhe: 2.462 ± 0.746
4.103AspGly: 4.103 ± 1.736
1.231AspHis: 1.231 ± 0.832
5.745AspIle: 5.745 ± 0.944
4.103AspLys: 4.103 ± 1.676
5.334AspLeu: 5.334 ± 1.278
0.41AspMet: 0.41 ± 0.342
2.052AspAsn: 2.052 ± 0.676
2.872AspPro: 2.872 ± 1.761
0.821AspGln: 0.821 ± 0.942
1.231AspArg: 1.231 ± 0.861
3.283AspSer: 3.283 ± 1.286
3.693AspThr: 3.693 ± 0.93
4.924AspVal: 4.924 ± 1.012
0.821AspTrp: 0.821 ± 0.547
2.872AspTyr: 2.872 ± 1.207
0.0AspXaa: 0.0 ± 0.0
Glu
2.462GluAla: 2.462 ± 0.99
1.231GluCys: 1.231 ± 0.971
4.924GluAsp: 4.924 ± 1.774
2.052GluGlu: 2.052 ± 0.961
2.462GluPhe: 2.462 ± 0.818
2.872GluGly: 2.872 ± 1.317
0.821GluHis: 0.821 ± 0.684
4.924GluIle: 4.924 ± 1.532
2.462GluLys: 2.462 ± 1.003
8.617GluLeu: 8.617 ± 1.825
0.41GluMet: 0.41 ± 0.356
2.052GluAsn: 2.052 ± 0.711
2.462GluPro: 2.462 ± 1.064
4.924GluGln: 4.924 ± 1.217
2.462GluArg: 2.462 ± 0.745
4.514GluSer: 4.514 ± 1.135
2.052GluThr: 2.052 ± 0.771
2.462GluVal: 2.462 ± 0.914
1.231GluTrp: 1.231 ± 0.81
5.745GluTyr: 5.745 ± 2.476
0.0GluXaa: 0.0 ± 0.0
Phe
4.103PheAla: 4.103 ± 1.223
1.231PheCys: 1.231 ± 0.471
4.514PheAsp: 4.514 ± 1.163
2.872PheGlu: 2.872 ± 1.14
1.641PhePhe: 1.641 ± 1.036
2.872PheGly: 2.872 ± 1.333
1.231PheHis: 1.231 ± 0.471
1.231PheIle: 1.231 ± 0.52
1.641PheLys: 1.641 ± 0.574
5.745PheLeu: 5.745 ± 1.837
0.41PheMet: 0.41 ± 0.342
4.924PheAsn: 4.924 ± 1.112
2.052PhePro: 2.052 ± 0.771
1.641PheGln: 1.641 ± 0.823
2.052PheArg: 2.052 ± 0.911
1.641PheSer: 1.641 ± 1.368
2.872PheThr: 2.872 ± 1.139
2.462PheVal: 2.462 ± 1.136
0.821PheTrp: 0.821 ± 0.431
1.231PheTyr: 1.231 ± 0.566
0.0PheXaa: 0.0 ± 0.0
Gly
5.334GlyAla: 5.334 ± 1.261
1.641GlyCys: 1.641 ± 0.718
4.514GlyAsp: 4.514 ± 2.168
4.103GlyGlu: 4.103 ± 1.594
4.514GlyPhe: 4.514 ± 1.536
2.462GlyGly: 2.462 ± 1.039
0.821GlyHis: 0.821 ± 0.618
2.872GlyIle: 2.872 ± 0.824
3.693GlyLys: 3.693 ± 1.299
9.027GlyLeu: 9.027 ± 1.725
2.052GlyMet: 2.052 ± 1.131
1.641GlyAsn: 1.641 ± 0.592
0.821GlyPro: 0.821 ± 0.471
2.872GlyGln: 2.872 ± 1.052
2.872GlyArg: 2.872 ± 0.796
4.924GlySer: 4.924 ± 1.783
3.693GlyThr: 3.693 ± 1.75
5.745GlyVal: 5.745 ± 1.37
0.0GlyTrp: 0.0 ± 0.0
2.052GlyTyr: 2.052 ± 0.926
0.0GlyXaa: 0.0 ± 0.0
His
1.641HisAla: 1.641 ± 0.883
1.231HisCys: 1.231 ± 0.747
3.283HisAsp: 3.283 ± 0.771
2.462HisGlu: 2.462 ± 1.067
1.231HisPhe: 1.231 ± 0.55
1.231HisGly: 1.231 ± 0.813
0.0HisHis: 0.0 ± 0.0
1.231HisIle: 1.231 ± 0.747
2.462HisLys: 2.462 ± 0.972
3.283HisLeu: 3.283 ± 1.618
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.41HisPro: 0.41 ± 0.356
0.821HisGln: 0.821 ± 0.535
0.821HisArg: 0.821 ± 0.436
0.41HisSer: 0.41 ± 0.342
0.0HisThr: 0.0 ± 0.0
0.821HisVal: 0.821 ± 0.402
1.231HisTrp: 1.231 ± 0.635
0.41HisTyr: 0.41 ± 0.342
0.0HisXaa: 0.0 ± 0.0
Ile
4.103IleAla: 4.103 ± 2.077
2.462IleCys: 2.462 ± 0.892
3.283IleAsp: 3.283 ± 1.027
4.514IleGlu: 4.514 ± 0.909
4.514IlePhe: 4.514 ± 1.115
5.745IleGly: 5.745 ± 1.058
2.052IleHis: 2.052 ± 0.93
2.872IleIle: 2.872 ± 1.033
4.514IleLys: 4.514 ± 1.575
5.334IleLeu: 5.334 ± 1.595
0.41IleMet: 0.41 ± 0.415
4.103IleAsn: 4.103 ± 1.303
2.052IlePro: 2.052 ± 0.751
2.052IleGln: 2.052 ± 1.118
0.821IleArg: 0.821 ± 0.436
2.872IleSer: 2.872 ± 1.204
4.103IleThr: 4.103 ± 1.353
2.872IleVal: 2.872 ± 1.222
0.0IleTrp: 0.0 ± 0.0
4.924IleTyr: 4.924 ± 1.22
0.0IleXaa: 0.0 ± 0.0
Lys
4.103LysAla: 4.103 ± 1.078
0.821LysCys: 0.821 ± 0.431
3.283LysAsp: 3.283 ± 1.027
2.872LysGlu: 2.872 ± 1.205
4.103LysPhe: 4.103 ± 1.181
6.155LysGly: 6.155 ± 2.485
1.641LysHis: 1.641 ± 0.828
4.103LysIle: 4.103 ± 2.125
4.514LysLys: 4.514 ± 1.549
4.514LysLeu: 4.514 ± 0.953
3.283LysMet: 3.283 ± 0.934
3.283LysAsn: 3.283 ± 1.214
2.462LysPro: 2.462 ± 1.005
2.462LysGln: 2.462 ± 1.0
2.872LysArg: 2.872 ± 0.927
2.872LysSer: 2.872 ± 0.719
2.872LysThr: 2.872 ± 1.423
2.462LysVal: 2.462 ± 1.361
0.41LysTrp: 0.41 ± 0.342
1.231LysTyr: 1.231 ± 0.66
0.0LysXaa: 0.0 ± 0.0
Leu
5.334LeuAla: 5.334 ± 1.759
2.052LeuCys: 2.052 ± 1.184
4.924LeuAsp: 4.924 ± 1.428
6.565LeuGlu: 6.565 ± 1.753
4.924LeuPhe: 4.924 ± 1.48
7.386LeuGly: 7.386 ± 1.599
2.872LeuHis: 2.872 ± 0.986
8.207LeuIle: 8.207 ± 1.823
6.976LeuLys: 6.976 ± 1.452
9.848LeuLeu: 9.848 ± 2.493
2.462LeuMet: 2.462 ± 1.24
6.155LeuAsn: 6.155 ± 1.26
3.283LeuPro: 3.283 ± 0.963
2.462LeuGln: 2.462 ± 0.77
3.283LeuArg: 3.283 ± 1.114
5.745LeuSer: 5.745 ± 1.484
5.334LeuThr: 5.334 ± 1.564
9.027LeuVal: 9.027 ± 1.951
1.641LeuTrp: 1.641 ± 0.532
2.052LeuTyr: 2.052 ± 1.32
0.0LeuXaa: 0.0 ± 0.0
Met
2.872MetAla: 2.872 ± 1.204
0.0MetCys: 0.0 ± 0.0
1.641MetAsp: 1.641 ± 0.72
1.641MetGlu: 1.641 ± 0.768
0.821MetPhe: 0.821 ± 0.83
0.821MetGly: 0.821 ± 0.467
0.0MetHis: 0.0 ± 0.0
1.231MetIle: 1.231 ± 0.624
1.231MetLys: 1.231 ± 0.644
0.41MetLeu: 0.41 ± 0.434
0.0MetMet: 0.0 ± 0.0
2.462MetAsn: 2.462 ± 1.121
0.0MetPro: 0.0 ± 0.0
0.821MetGln: 0.821 ± 0.557
1.641MetArg: 1.641 ± 1.117
2.462MetSer: 2.462 ± 1.495
1.641MetThr: 1.641 ± 0.775
2.462MetVal: 2.462 ± 1.212
0.41MetTrp: 0.41 ± 0.468
0.41MetTyr: 0.41 ± 0.434
0.0MetXaa: 0.0 ± 0.0
Asn
3.283AsnAla: 3.283 ± 1.359
0.0AsnCys: 0.0 ± 0.0
2.462AsnAsp: 2.462 ± 0.882
3.283AsnGlu: 3.283 ± 1.234
1.641AsnPhe: 1.641 ± 0.716
2.462AsnGly: 2.462 ± 0.756
0.821AsnHis: 0.821 ± 0.83
4.103AsnIle: 4.103 ± 0.922
4.924AsnLys: 4.924 ± 1.063
2.052AsnLeu: 2.052 ± 0.691
1.641AsnMet: 1.641 ± 1.019
3.693AsnAsn: 3.693 ± 1.097
3.283AsnPro: 3.283 ± 1.156
2.052AsnGln: 2.052 ± 0.762
1.641AsnArg: 1.641 ± 0.54
3.283AsnSer: 3.283 ± 1.176
3.283AsnThr: 3.283 ± 1.837
3.693AsnVal: 3.693 ± 0.984
0.821AsnTrp: 0.821 ± 0.518
1.231AsnTyr: 1.231 ± 0.722
0.0AsnXaa: 0.0 ± 0.0
Pro
3.283ProAla: 3.283 ± 1.187
0.0ProCys: 0.0 ± 0.0
5.745ProAsp: 5.745 ± 3.043
2.872ProGlu: 2.872 ± 1.052
0.41ProPhe: 0.41 ± 0.342
1.641ProGly: 1.641 ± 0.635
0.41ProHis: 0.41 ± 0.342
2.052ProIle: 2.052 ± 0.592
1.641ProLys: 1.641 ± 0.662
3.693ProLeu: 3.693 ± 1.581
0.41ProMet: 0.41 ± 0.415
0.0ProAsn: 0.0 ± 0.0
1.231ProPro: 1.231 ± 0.578
1.231ProGln: 1.231 ± 0.945
2.052ProArg: 2.052 ± 0.719
2.052ProSer: 2.052 ± 0.639
3.283ProThr: 3.283 ± 1.024
3.283ProVal: 3.283 ± 1.285
0.0ProTrp: 0.0 ± 0.0
0.821ProTyr: 0.821 ± 0.467
0.0ProXaa: 0.0 ± 0.0
Gln
2.872GlnAla: 2.872 ± 0.855
1.231GlnCys: 1.231 ± 0.724
0.821GlnAsp: 0.821 ± 0.557
3.283GlnGlu: 3.283 ± 1.408
1.641GlnPhe: 1.641 ± 0.609
1.641GlnGly: 1.641 ± 0.778
0.41GlnHis: 0.41 ± 0.342
4.103GlnIle: 4.103 ± 1.099
0.821GlnLys: 0.821 ± 0.55
2.872GlnLeu: 2.872 ± 1.376
0.821GlnMet: 0.821 ± 0.658
0.41GlnAsn: 0.41 ± 0.434
1.231GlnPro: 1.231 ± 0.649
2.052GlnGln: 2.052 ± 0.846
2.462GlnArg: 2.462 ± 0.942
2.872GlnSer: 2.872 ± 0.924
2.052GlnThr: 2.052 ± 0.531
1.231GlnVal: 1.231 ± 0.691
0.41GlnTrp: 0.41 ± 0.471
2.462GlnTyr: 2.462 ± 0.738
0.0GlnXaa: 0.0 ± 0.0
Arg
1.231ArgAla: 1.231 ± 0.747
0.41ArgCys: 0.41 ± 0.356
1.641ArgAsp: 1.641 ± 0.996
2.462ArgGlu: 2.462 ± 1.407
2.462ArgPhe: 2.462 ± 1.068
2.462ArgGly: 2.462 ± 1.054
1.641ArgHis: 1.641 ± 0.698
4.103ArgIle: 4.103 ± 1.595
2.052ArgLys: 2.052 ± 0.869
3.693ArgLeu: 3.693 ± 1.413
1.231ArgMet: 1.231 ± 0.845
1.231ArgAsn: 1.231 ± 0.462
2.462ArgPro: 2.462 ± 0.62
1.231ArgGln: 1.231 ± 0.462
2.052ArgArg: 2.052 ± 1.309
0.821ArgSer: 0.821 ± 0.471
4.514ArgThr: 4.514 ± 1.085
2.052ArgVal: 2.052 ± 0.875
1.231ArgTrp: 1.231 ± 0.797
2.462ArgTyr: 2.462 ± 0.762
0.0ArgXaa: 0.0 ± 0.0
Ser
4.924SerAla: 4.924 ± 1.004
0.821SerCys: 0.821 ± 0.548
1.641SerAsp: 1.641 ± 0.664
3.693SerGlu: 3.693 ± 0.828
2.052SerPhe: 2.052 ± 0.815
5.745SerGly: 5.745 ± 1.698
1.641SerHis: 1.641 ± 0.65
5.745SerIle: 5.745 ± 0.837
2.052SerLys: 2.052 ± 0.859
8.617SerLeu: 8.617 ± 3.583
2.052SerMet: 2.052 ± 0.841
2.052SerAsn: 2.052 ± 1.031
3.283SerPro: 3.283 ± 0.912
1.641SerGln: 1.641 ± 0.702
4.924SerArg: 4.924 ± 1.345
3.693SerSer: 3.693 ± 1.268
3.283SerThr: 3.283 ± 1.271
4.103SerVal: 4.103 ± 1.002
0.41SerTrp: 0.41 ± 0.36
2.462SerTyr: 2.462 ± 0.845
0.0SerXaa: 0.0 ± 0.0
Thr
4.924ThrAla: 4.924 ± 1.696
0.821ThrCys: 0.821 ± 0.436
2.052ThrAsp: 2.052 ± 0.676
2.462ThrGlu: 2.462 ± 0.962
3.693ThrPhe: 3.693 ± 1.3
5.334ThrGly: 5.334 ± 1.937
2.052ThrHis: 2.052 ± 0.73
2.872ThrIle: 2.872 ± 1.397
4.514ThrLys: 4.514 ± 1.074
2.872ThrLeu: 2.872 ± 0.937
1.231ThrMet: 1.231 ± 0.697
4.514ThrAsn: 4.514 ± 1.338
2.872ThrPro: 2.872 ± 0.837
0.821ThrGln: 0.821 ± 0.436
1.231ThrArg: 1.231 ± 0.698
5.745ThrSer: 5.745 ± 1.713
2.872ThrThr: 2.872 ± 1.065
6.565ThrVal: 6.565 ± 1.524
0.821ThrTrp: 0.821 ± 0.549
1.231ThrTyr: 1.231 ± 0.566
0.0ThrXaa: 0.0 ± 0.0
Val
3.283ValAla: 3.283 ± 1.466
0.41ValCys: 0.41 ± 0.555
4.514ValAsp: 4.514 ± 1.982
4.103ValGlu: 4.103 ± 1.108
2.052ValPhe: 2.052 ± 0.92
4.103ValGly: 4.103 ± 1.159
1.641ValHis: 1.641 ± 0.805
2.872ValIle: 2.872 ± 1.178
3.283ValLys: 3.283 ± 0.652
8.617ValLeu: 8.617 ± 1.864
1.641ValMet: 1.641 ± 1.093
4.514ValAsn: 4.514 ± 1.476
2.462ValPro: 2.462 ± 0.865
1.641ValGln: 1.641 ± 0.599
2.872ValArg: 2.872 ± 1.034
6.976ValSer: 6.976 ± 2.406
5.745ValThr: 5.745 ± 1.506
5.745ValVal: 5.745 ± 2.652
1.641ValTrp: 1.641 ± 0.585
2.462ValTyr: 2.462 ± 0.933
0.0ValXaa: 0.0 ± 0.0
Trp
0.821TrpAla: 0.821 ± 0.455
0.0TrpCys: 0.0 ± 0.0
1.641TrpAsp: 1.641 ± 0.75
0.821TrpGlu: 0.821 ± 0.681
0.821TrpPhe: 0.821 ± 0.684
0.41TrpGly: 0.41 ± 0.356
1.231TrpHis: 1.231 ± 0.58
0.41TrpIle: 0.41 ± 0.468
0.41TrpLys: 0.41 ± 0.342
2.052TrpLeu: 2.052 ± 0.948
0.821TrpMet: 0.821 ± 0.538
0.821TrpAsn: 0.821 ± 0.549
0.0TrpPro: 0.0 ± 0.0
0.41TrpGln: 0.41 ± 0.434
0.821TrpArg: 0.821 ± 0.685
0.821TrpSer: 0.821 ± 0.498
0.0TrpThr: 0.0 ± 0.0
0.821TrpVal: 0.821 ± 0.619
0.0TrpTrp: 0.0 ± 0.0
0.41TrpTyr: 0.41 ± 0.342
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.283TyrAla: 3.283 ± 1.021
0.41TyrCys: 0.41 ± 0.36
2.052TyrAsp: 2.052 ± 0.924
1.231TyrGlu: 1.231 ± 0.574
0.821TyrPhe: 0.821 ± 0.589
4.103TyrGly: 4.103 ± 1.498
0.821TyrHis: 0.821 ± 0.436
1.231TyrIle: 1.231 ± 1.104
4.103TyrLys: 4.103 ± 1.157
3.283TyrLeu: 3.283 ± 1.051
0.821TyrMet: 0.821 ± 0.6
1.641TyrAsn: 1.641 ± 0.805
0.821TyrPro: 0.821 ± 0.436
0.821TyrGln: 0.821 ± 0.748
1.231TyrArg: 1.231 ± 0.747
2.872TyrSer: 2.872 ± 1.733
3.693TyrThr: 3.693 ± 1.107
3.693TyrVal: 3.693 ± 1.411
0.41TyrTrp: 0.41 ± 0.342
0.821TyrTyr: 0.821 ± 0.569
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2438 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski