Amino acid dipepetide frequency for Vibrio phage VALG_phi8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.305AlaAla: 6.305 ± 2.057
0.485AlaCys: 0.485 ± 0.382
1.94AlaAsp: 1.94 ± 1.032
3.395AlaGlu: 3.395 ± 1.284
1.455AlaPhe: 1.455 ± 0.842
3.395AlaGly: 3.395 ± 0.91
0.97AlaHis: 0.97 ± 0.508
4.365AlaIle: 4.365 ± 1.514
6.79AlaLys: 6.79 ± 1.59
7.274AlaLeu: 7.274 ± 1.812
2.91AlaMet: 2.91 ± 1.413
2.425AlaAsn: 2.425 ± 1.129
2.425AlaPro: 2.425 ± 1.276
2.91AlaGln: 2.91 ± 1.747
3.395AlaArg: 3.395 ± 1.086
4.85AlaSer: 4.85 ± 1.957
1.94AlaThr: 1.94 ± 1.174
6.305AlaVal: 6.305 ± 2.095
1.455AlaTrp: 1.455 ± 0.763
4.365AlaTyr: 4.365 ± 1.425
0.0AlaXaa: 0.0 ± 0.0
Cys
1.94CysAla: 1.94 ± 0.741
0.0CysCys: 0.0 ± 0.0
0.97CysAsp: 0.97 ± 0.813
1.94CysGlu: 1.94 ± 1.119
1.455CysPhe: 1.455 ± 0.82
1.94CysGly: 1.94 ± 0.615
1.455CysHis: 1.455 ± 0.634
0.485CysIle: 0.485 ± 0.382
0.97CysLys: 0.97 ± 0.489
1.455CysLeu: 1.455 ± 0.975
0.485CysMet: 0.485 ± 0.382
0.97CysAsn: 0.97 ± 0.86
0.485CysPro: 0.485 ± 0.382
0.485CysGln: 0.485 ± 0.382
0.485CysArg: 0.485 ± 0.413
0.485CysSer: 0.485 ± 0.642
1.94CysThr: 1.94 ± 1.126
0.485CysVal: 0.485 ± 0.464
0.0CysTrp: 0.0 ± 0.0
0.97CysTyr: 0.97 ± 0.489
0.0CysXaa: 0.0 ± 0.0
Asp
1.94AspAla: 1.94 ± 0.716
1.94AspCys: 1.94 ± 1.169
6.305AspAsp: 6.305 ± 1.877
2.91AspGlu: 2.91 ± 1.056
2.425AspPhe: 2.425 ± 0.919
8.729AspGly: 8.729 ± 1.87
0.97AspHis: 0.97 ± 0.688
5.82AspIle: 5.82 ± 1.54
1.455AspLys: 1.455 ± 1.065
6.79AspLeu: 6.79 ± 1.665
2.425AspMet: 2.425 ± 0.761
1.94AspAsn: 1.94 ± 1.167
4.365AspPro: 4.365 ± 2.1
0.485AspGln: 0.485 ± 0.613
0.485AspArg: 0.485 ± 0.464
1.94AspSer: 1.94 ± 1.055
6.305AspThr: 6.305 ± 2.534
4.365AspVal: 4.365 ± 1.526
2.425AspTrp: 2.425 ± 0.761
3.395AspTyr: 3.395 ± 1.627
0.0AspXaa: 0.0 ± 0.0
Glu
4.85GluAla: 4.85 ± 1.787
0.485GluCys: 0.485 ± 0.558
4.365GluAsp: 4.365 ± 1.371
2.425GluGlu: 2.425 ± 0.859
1.455GluPhe: 1.455 ± 0.634
2.425GluGly: 2.425 ± 1.067
0.0GluHis: 0.0 ± 0.0
2.91GluIle: 2.91 ± 1.494
3.88GluLys: 3.88 ± 1.626
4.365GluLeu: 4.365 ± 1.557
0.485GluMet: 0.485 ± 0.511
1.455GluAsn: 1.455 ± 0.869
1.455GluPro: 1.455 ± 0.791
5.82GluGln: 5.82 ± 2.068
1.94GluArg: 1.94 ± 0.543
4.365GluSer: 4.365 ± 1.249
1.94GluThr: 1.94 ± 1.015
2.91GluVal: 2.91 ± 1.243
1.94GluTrp: 1.94 ± 1.151
2.91GluTyr: 2.91 ± 0.972
0.0GluXaa: 0.0 ± 0.0
Phe
3.88PheAla: 3.88 ± 1.664
0.485PheCys: 0.485 ± 0.413
4.365PheAsp: 4.365 ± 1.231
3.395PheGlu: 3.395 ± 1.027
1.94PhePhe: 1.94 ± 1.056
4.365PheGly: 4.365 ± 1.761
1.455PheHis: 1.455 ± 0.766
1.455PheIle: 1.455 ± 0.869
1.455PheLys: 1.455 ± 0.776
1.455PheLeu: 1.455 ± 0.933
0.0PheMet: 0.0 ± 0.0
2.425PheAsn: 2.425 ± 1.016
1.94PhePro: 1.94 ± 0.931
1.455PheGln: 1.455 ± 0.82
1.455PheArg: 1.455 ± 1.122
3.395PheSer: 3.395 ± 1.277
4.365PheThr: 4.365 ± 1.142
2.91PheVal: 2.91 ± 0.811
1.455PheTrp: 1.455 ± 0.722
3.395PheTyr: 3.395 ± 1.666
0.0PheXaa: 0.0 ± 0.0
Gly
3.88GlyAla: 3.88 ± 1.661
2.425GlyCys: 2.425 ± 0.784
4.365GlyAsp: 4.365 ± 1.649
3.395GlyGlu: 3.395 ± 0.884
4.85GlyPhe: 4.85 ± 1.566
5.335GlyGly: 5.335 ± 1.383
0.485GlyHis: 0.485 ± 0.382
3.395GlyIle: 3.395 ± 1.248
5.335GlyLys: 5.335 ± 1.46
5.82GlyLeu: 5.82 ± 1.425
1.455GlyMet: 1.455 ± 0.734
1.455GlyAsn: 1.455 ± 0.774
1.94GlyPro: 1.94 ± 0.818
2.91GlyGln: 2.91 ± 0.981
2.91GlyArg: 2.91 ± 1.769
5.82GlySer: 5.82 ± 1.42
4.365GlyThr: 4.365 ± 1.309
6.305GlyVal: 6.305 ± 2.35
0.0GlyTrp: 0.0 ± 0.0
3.395GlyTyr: 3.395 ± 1.405
0.0GlyXaa: 0.0 ± 0.0
His
0.485HisAla: 0.485 ± 0.558
0.97HisCys: 0.97 ± 0.827
1.455HisAsp: 1.455 ± 0.82
1.94HisGlu: 1.94 ± 1.271
1.455HisPhe: 1.455 ± 0.948
0.485HisGly: 0.485 ± 0.382
0.97HisHis: 0.97 ± 0.693
2.425HisIle: 2.425 ± 1.092
0.485HisLys: 0.485 ± 0.413
1.455HisLeu: 1.455 ± 0.983
0.97HisMet: 0.97 ± 0.524
0.485HisAsn: 0.485 ± 0.382
0.0HisPro: 0.0 ± 0.0
0.485HisGln: 0.485 ± 0.613
0.97HisArg: 0.97 ± 0.929
0.97HisSer: 0.97 ± 0.643
0.485HisThr: 0.485 ± 0.382
0.485HisVal: 0.485 ± 0.413
0.485HisTrp: 0.485 ± 0.413
1.94HisTyr: 1.94 ± 0.669
0.0HisXaa: 0.0 ± 0.0
Ile
3.88IleAla: 3.88 ± 1.778
0.97IleCys: 0.97 ± 0.489
4.365IleAsp: 4.365 ± 1.005
4.85IleGlu: 4.85 ± 0.847
1.94IlePhe: 1.94 ± 0.737
4.365IleGly: 4.365 ± 1.046
1.455IleHis: 1.455 ± 0.633
3.395IleIle: 3.395 ± 1.303
4.365IleLys: 4.365 ± 1.728
5.335IleLeu: 5.335 ± 1.253
1.455IleMet: 1.455 ± 0.727
2.91IleAsn: 2.91 ± 1.391
3.395IlePro: 3.395 ± 1.557
1.455IleGln: 1.455 ± 0.869
2.91IleArg: 2.91 ± 0.953
2.91IleSer: 2.91 ± 1.029
4.85IleThr: 4.85 ± 1.908
1.94IleVal: 1.94 ± 1.0
0.485IleTrp: 0.485 ± 0.464
3.395IleTyr: 3.395 ± 1.233
0.0IleXaa: 0.0 ± 0.0
Lys
5.82LysAla: 5.82 ± 0.814
0.485LysCys: 0.485 ± 0.511
4.365LysAsp: 4.365 ± 1.123
0.97LysGlu: 0.97 ± 0.667
2.91LysPhe: 2.91 ± 1.453
1.94LysGly: 1.94 ± 0.908
2.91LysHis: 2.91 ± 1.029
3.88LysIle: 3.88 ± 1.293
6.79LysLys: 6.79 ± 1.895
4.365LysLeu: 4.365 ± 1.604
0.97LysMet: 0.97 ± 1.04
1.94LysAsn: 1.94 ± 0.833
1.94LysPro: 1.94 ± 1.015
2.425LysGln: 2.425 ± 1.125
4.365LysArg: 4.365 ± 1.521
6.79LysSer: 6.79 ± 1.222
1.94LysThr: 1.94 ± 0.615
3.395LysVal: 3.395 ± 1.909
0.97LysTrp: 0.97 ± 0.636
0.97LysTyr: 0.97 ± 0.643
0.0LysXaa: 0.0 ± 0.0
Leu
5.335LeuAla: 5.335 ± 1.641
1.455LeuCys: 1.455 ± 0.774
3.395LeuAsp: 3.395 ± 1.449
6.79LeuGlu: 6.79 ± 1.36
2.91LeuPhe: 2.91 ± 2.118
6.305LeuGly: 6.305 ± 1.344
1.455LeuHis: 1.455 ± 1.24
5.335LeuIle: 5.335 ± 1.714
5.82LeuLys: 5.82 ± 1.118
8.244LeuLeu: 8.244 ± 3.977
2.425LeuMet: 2.425 ± 1.87
5.335LeuAsn: 5.335 ± 1.537
4.365LeuPro: 4.365 ± 1.321
2.91LeuGln: 2.91 ± 1.134
3.88LeuArg: 3.88 ± 1.319
7.759LeuSer: 7.759 ± 2.24
5.82LeuThr: 5.82 ± 2.61
8.244LeuVal: 8.244 ± 2.585
0.0LeuTrp: 0.0 ± 0.0
1.455LeuTyr: 1.455 ± 0.87
0.0LeuXaa: 0.0 ± 0.0
Met
2.91MetAla: 2.91 ± 2.26
0.0MetCys: 0.0 ± 0.0
1.94MetAsp: 1.94 ± 1.016
0.0MetGlu: 0.0 ± 0.0
0.485MetPhe: 0.485 ± 0.511
0.485MetGly: 0.485 ± 0.464
0.0MetHis: 0.0 ± 0.0
1.455MetIle: 1.455 ± 1.001
1.94MetLys: 1.94 ± 1.103
2.425MetLeu: 2.425 ± 1.827
0.0MetMet: 0.0 ± 0.0
2.425MetAsn: 2.425 ± 1.581
1.455MetPro: 1.455 ± 0.846
0.485MetGln: 0.485 ± 0.382
0.97MetArg: 0.97 ± 0.489
1.94MetSer: 1.94 ± 1.386
1.94MetThr: 1.94 ± 1.017
1.94MetVal: 1.94 ± 1.066
0.0MetTrp: 0.0 ± 0.0
0.485MetTyr: 0.485 ± 0.558
0.0MetXaa: 0.0 ± 0.0
Asn
3.395AsnAla: 3.395 ± 1.915
0.0AsnCys: 0.0 ± 0.0
2.425AsnAsp: 2.425 ± 0.985
2.425AsnGlu: 2.425 ± 1.256
0.97AsnPhe: 0.97 ± 0.765
2.425AsnGly: 2.425 ± 0.855
0.485AsnHis: 0.485 ± 0.382
3.395AsnIle: 3.395 ± 1.605
4.85AsnLys: 4.85 ± 1.316
1.455AsnLeu: 1.455 ± 1.095
1.94AsnMet: 1.94 ± 1.05
0.485AsnAsn: 0.485 ± 0.413
2.91AsnPro: 2.91 ± 1.288
3.88AsnGln: 3.88 ± 2.253
0.485AsnArg: 0.485 ± 0.511
2.425AsnSer: 2.425 ± 1.147
6.305AsnThr: 6.305 ± 2.678
1.455AsnVal: 1.455 ± 1.023
0.0AsnTrp: 0.0 ± 0.0
0.485AsnTyr: 0.485 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
1.94ProAla: 1.94 ± 0.851
0.0ProCys: 0.0 ± 0.0
7.759ProAsp: 7.759 ± 2.685
1.94ProGlu: 1.94 ± 0.947
3.88ProPhe: 3.88 ± 1.363
0.97ProGly: 0.97 ± 0.726
0.0ProHis: 0.0 ± 0.0
2.425ProIle: 2.425 ± 0.708
0.485ProLys: 0.485 ± 0.413
3.395ProLeu: 3.395 ± 1.924
0.0ProMet: 0.0 ± 0.0
0.485ProAsn: 0.485 ± 0.642
2.425ProPro: 2.425 ± 0.841
1.455ProGln: 1.455 ± 0.986
1.455ProArg: 1.455 ± 0.834
3.395ProSer: 3.395 ± 1.533
4.365ProThr: 4.365 ± 1.749
5.335ProVal: 5.335 ± 1.569
0.0ProTrp: 0.0 ± 0.0
1.455ProTyr: 1.455 ± 1.147
0.0ProXaa: 0.0 ± 0.0
Gln
3.88GlnAla: 3.88 ± 1.466
1.455GlnCys: 1.455 ± 0.925
0.485GlnAsp: 0.485 ± 0.464
1.455GlnGlu: 1.455 ± 0.846
3.395GlnPhe: 3.395 ± 0.72
1.455GlnGly: 1.455 ± 0.642
0.485GlnHis: 0.485 ± 0.558
4.365GlnIle: 4.365 ± 1.009
1.455GlnLys: 1.455 ± 0.634
3.88GlnLeu: 3.88 ± 2.077
0.485GlnMet: 0.485 ± 0.382
1.94GlnAsn: 1.94 ± 1.047
1.455GlnPro: 1.455 ± 1.001
2.425GlnGln: 2.425 ± 1.176
1.94GlnArg: 1.94 ± 0.905
2.91GlnSer: 2.91 ± 1.512
0.97GlnThr: 0.97 ± 0.489
1.455GlnVal: 1.455 ± 0.488
1.455GlnTrp: 1.455 ± 0.758
2.91GlnTyr: 2.91 ± 1.431
0.0GlnXaa: 0.0 ± 0.0
Arg
2.91ArgAla: 2.91 ± 1.273
1.94ArgCys: 1.94 ± 1.126
2.425ArgAsp: 2.425 ± 1.361
2.425ArgGlu: 2.425 ± 1.482
1.455ArgPhe: 1.455 ± 0.642
3.395ArgGly: 3.395 ± 1.219
1.455ArgHis: 1.455 ± 0.801
3.395ArgIle: 3.395 ± 1.165
1.455ArgLys: 1.455 ± 0.984
5.335ArgLeu: 5.335 ± 2.325
0.0ArgMet: 0.0 ± 0.0
0.97ArgAsn: 0.97 ± 0.508
2.91ArgPro: 2.91 ± 1.162
0.485ArgGln: 0.485 ± 0.413
4.365ArgArg: 4.365 ± 1.987
2.425ArgSer: 2.425 ± 1.653
2.91ArgThr: 2.91 ± 1.466
1.94ArgVal: 1.94 ± 1.068
0.485ArgTrp: 0.485 ± 0.464
0.97ArgTyr: 0.97 ± 0.622
0.0ArgXaa: 0.0 ± 0.0
Ser
3.395SerAla: 3.395 ± 1.498
1.455SerCys: 1.455 ± 0.83
2.91SerAsp: 2.91 ± 1.389
2.91SerGlu: 2.91 ± 0.772
2.91SerPhe: 2.91 ± 1.39
6.305SerGly: 6.305 ± 1.605
1.455SerHis: 1.455 ± 0.881
5.82SerIle: 5.82 ± 1.585
0.0SerLys: 0.0 ± 0.0
9.214SerLeu: 9.214 ± 3.211
3.395SerMet: 3.395 ± 1.655
2.91SerAsn: 2.91 ± 1.443
1.455SerPro: 1.455 ± 1.122
3.88SerGln: 3.88 ± 1.266
3.395SerArg: 3.395 ± 1.234
2.425SerSer: 2.425 ± 0.954
4.85SerThr: 4.85 ± 1.391
4.85SerVal: 4.85 ± 2.069
0.485SerTrp: 0.485 ± 0.795
1.94SerTyr: 1.94 ± 1.286
0.0SerXaa: 0.0 ± 0.0
Thr
3.88ThrAla: 3.88 ± 1.017
2.91ThrCys: 2.91 ± 1.46
2.91ThrAsp: 2.91 ± 1.454
3.88ThrGlu: 3.88 ± 1.759
1.94ThrPhe: 1.94 ± 0.737
7.274ThrGly: 7.274 ± 2.397
0.97ThrHis: 0.97 ± 0.667
2.425ThrIle: 2.425 ± 0.773
4.85ThrLys: 4.85 ± 1.474
5.335ThrLeu: 5.335 ± 1.363
0.97ThrMet: 0.97 ± 0.625
4.365ThrAsn: 4.365 ± 1.231
3.395ThrPro: 3.395 ± 1.301
1.94ThrGln: 1.94 ± 0.793
1.455ThrArg: 1.455 ± 0.722
4.365ThrSer: 4.365 ± 1.264
1.455ThrThr: 1.455 ± 0.801
6.305ThrVal: 6.305 ± 2.055
1.94ThrTrp: 1.94 ± 0.749
2.425ThrTyr: 2.425 ± 0.799
0.0ThrXaa: 0.0 ± 0.0
Val
2.91ValAla: 2.91 ± 1.826
1.455ValCys: 1.455 ± 0.83
5.335ValAsp: 5.335 ± 1.617
2.425ValGlu: 2.425 ± 1.052
6.79ValPhe: 6.79 ± 1.617
4.365ValGly: 4.365 ± 0.924
1.94ValHis: 1.94 ± 1.015
2.425ValIle: 2.425 ± 1.255
3.395ValLys: 3.395 ± 1.59
7.274ValLeu: 7.274 ± 0.995
1.455ValMet: 1.455 ± 1.125
5.335ValAsn: 5.335 ± 1.986
2.91ValPro: 2.91 ± 1.331
2.425ValGln: 2.425 ± 1.237
3.395ValArg: 3.395 ± 1.185
2.91ValSer: 2.91 ± 1.703
4.85ValThr: 4.85 ± 0.878
2.91ValVal: 2.91 ± 1.741
0.485ValTrp: 0.485 ± 0.413
1.455ValTyr: 1.455 ± 0.913
0.0ValXaa: 0.0 ± 0.0
Trp
1.455TrpAla: 1.455 ± 1.064
0.0TrpCys: 0.0 ± 0.0
0.97TrpAsp: 0.97 ± 1.084
0.485TrpGlu: 0.485 ± 0.464
0.97TrpPhe: 0.97 ± 0.651
0.485TrpGly: 0.485 ± 0.382
0.0TrpHis: 0.0 ± 0.0
0.485TrpIle: 0.485 ± 0.558
0.485TrpLys: 0.485 ± 0.464
1.455TrpLeu: 1.455 ± 0.794
0.485TrpMet: 0.485 ± 0.413
0.97TrpAsn: 0.97 ± 0.735
0.97TrpPro: 0.97 ± 0.929
0.0TrpGln: 0.0 ± 0.0
1.455TrpArg: 1.455 ± 0.801
0.97TrpSer: 0.97 ± 0.667
0.485TrpThr: 0.485 ± 0.382
1.455TrpVal: 1.455 ± 0.948
0.485TrpTrp: 0.485 ± 0.464
0.97TrpTyr: 0.97 ± 0.508
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.365TyrAla: 4.365 ± 1.577
0.485TyrCys: 0.485 ± 0.795
4.365TyrAsp: 4.365 ± 1.468
2.425TyrGlu: 2.425 ± 1.239
1.455TyrPhe: 1.455 ± 0.642
3.395TyrGly: 3.395 ± 1.415
0.485TyrHis: 0.485 ± 0.464
0.97TyrIle: 0.97 ± 0.929
3.88TyrLys: 3.88 ± 1.253
2.91TyrLeu: 2.91 ± 0.989
0.485TyrMet: 0.485 ± 0.73
0.97TyrAsn: 0.97 ± 0.827
0.97TyrPro: 0.97 ± 0.765
1.94TyrGln: 1.94 ± 0.853
1.94TyrArg: 1.94 ± 1.015
3.395TyrSer: 3.395 ± 1.058
2.91TyrThr: 2.91 ± 1.308
1.455TyrVal: 1.455 ± 0.657
0.485TyrTrp: 0.485 ± 0.413
0.97TyrTyr: 0.97 ± 0.813
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (2063 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski