Amino acid dipepetide frequency for Vibrio virus Vf33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.48AlaAla: 8.48 ± 3.43
1.211AlaCys: 1.211 ± 1.01
2.423AlaAsp: 2.423 ± 1.182
3.634AlaGlu: 3.634 ± 1.331
1.211AlaPhe: 1.211 ± 0.635
4.24AlaGly: 4.24 ± 0.76
1.211AlaHis: 1.211 ± 0.586
6.057AlaIle: 6.057 ± 1.734
6.057AlaLys: 6.057 ± 1.964
7.268AlaLeu: 7.268 ± 2.259
3.028AlaMet: 3.028 ± 1.294
1.817AlaAsn: 1.817 ± 0.892
3.028AlaPro: 3.028 ± 1.48
2.423AlaGln: 2.423 ± 0.984
3.634AlaArg: 3.634 ± 1.444
5.451AlaSer: 5.451 ± 1.038
2.423AlaThr: 2.423 ± 1.2
6.663AlaVal: 6.663 ± 1.649
1.817AlaTrp: 1.817 ± 0.929
6.057AlaTyr: 6.057 ± 1.153
0.0AlaXaa: 0.0 ± 0.0
Cys
1.817CysAla: 1.817 ± 0.837
0.0CysCys: 0.0 ± 0.0
0.606CysAsp: 0.606 ± 0.505
2.423CysGlu: 2.423 ± 1.36
1.817CysPhe: 1.817 ± 0.829
1.817CysGly: 1.817 ± 0.313
1.211CysHis: 1.211 ± 0.502
0.606CysIle: 0.606 ± 0.505
1.211CysLys: 1.211 ± 0.502
1.211CysLeu: 1.211 ± 0.863
0.606CysMet: 0.606 ± 0.505
0.0CysAsn: 0.0 ± 0.0
1.211CysPro: 1.211 ± 0.502
0.606CysGln: 0.606 ± 0.505
0.606CysArg: 0.606 ± 0.468
1.211CysSer: 1.211 ± 0.635
2.423CysThr: 2.423 ± 1.316
0.606CysVal: 0.606 ± 0.523
0.0CysTrp: 0.0 ± 0.0
1.211CysTyr: 1.211 ± 0.502
0.0CysXaa: 0.0 ± 0.0
Asp
2.423AspAla: 2.423 ± 1.005
2.423AspCys: 2.423 ± 1.34
4.846AspAsp: 4.846 ± 2.093
3.028AspGlu: 3.028 ± 0.792
3.634AspPhe: 3.634 ± 0.626
9.691AspGly: 9.691 ± 1.659
1.211AspHis: 1.211 ± 0.78
7.268AspIle: 7.268 ± 1.317
0.606AspLys: 0.606 ± 0.523
6.057AspLeu: 6.057 ± 1.422
3.028AspMet: 3.028 ± 0.689
1.817AspAsn: 1.817 ± 1.515
6.057AspPro: 6.057 ± 3.07
0.606AspGln: 0.606 ± 0.726
0.606AspArg: 0.606 ± 0.523
2.423AspSer: 2.423 ± 1.519
6.663AspThr: 6.663 ± 2.845
6.057AspVal: 6.057 ± 2.178
2.423AspTrp: 2.423 ± 0.602
2.423AspTyr: 2.423 ± 1.172
0.0AspXaa: 0.0 ± 0.0
Glu
4.24GluAla: 4.24 ± 1.941
0.0GluCys: 0.0 ± 0.0
6.057GluAsp: 6.057 ± 0.812
1.817GluGlu: 1.817 ± 0.313
1.211GluPhe: 1.211 ± 0.502
1.817GluGly: 1.817 ± 0.686
0.0GluHis: 0.0 ± 0.0
2.423GluIle: 2.423 ± 1.581
3.634GluLys: 3.634 ± 1.158
3.028GluLeu: 3.028 ± 0.759
0.0GluMet: 0.0 ± 0.0
1.211GluAsn: 1.211 ± 1.01
0.0GluPro: 0.0 ± 0.0
4.24GluGln: 4.24 ± 0.957
1.817GluArg: 1.817 ± 0.891
3.634GluSer: 3.634 ± 1.035
3.028GluThr: 3.028 ± 0.792
3.028GluVal: 3.028 ± 1.197
1.817GluTrp: 1.817 ± 1.293
3.634GluTyr: 3.634 ± 0.844
0.0GluXaa: 0.0 ± 0.0
Phe
3.634PheAla: 3.634 ± 1.733
0.606PheCys: 0.606 ± 0.468
4.24PheAsp: 4.24 ± 1.123
2.423PheGlu: 2.423 ± 0.852
1.817PhePhe: 1.817 ± 1.097
6.057PheGly: 6.057 ± 2.224
0.606PheHis: 0.606 ± 0.468
1.211PheIle: 1.211 ± 1.01
1.211PheLys: 1.211 ± 0.781
1.817PheLeu: 1.817 ± 1.045
0.0PheMet: 0.0 ± 0.0
2.423PheAsn: 2.423 ± 1.144
1.817PhePro: 1.817 ± 0.923
1.817PheGln: 1.817 ± 0.829
0.606PheArg: 0.606 ± 0.468
3.028PheSer: 3.028 ± 1.222
4.24PheThr: 4.24 ± 1.127
3.634PheVal: 3.634 ± 0.747
1.817PheTrp: 1.817 ± 0.793
3.634PheTyr: 3.634 ± 1.709
0.0PheXaa: 0.0 ± 0.0
Gly
5.451GlyAla: 5.451 ± 2.202
2.423GlyCys: 2.423 ± 0.62
5.451GlyAsp: 5.451 ± 2.889
3.634GlyGlu: 3.634 ± 0.905
4.846GlyPhe: 4.846 ± 1.152
5.451GlyGly: 5.451 ± 1.442
0.0GlyHis: 0.0 ± 0.0
3.634GlyIle: 3.634 ± 1.328
4.846GlyLys: 4.846 ± 1.389
7.874GlyLeu: 7.874 ± 1.352
1.211GlyMet: 1.211 ± 0.747
1.817GlyAsn: 1.817 ± 0.892
2.423GlyPro: 2.423 ± 0.701
2.423GlyGln: 2.423 ± 0.999
3.634GlyArg: 3.634 ± 1.709
4.24GlySer: 4.24 ± 0.66
4.846GlyThr: 4.846 ± 1.204
4.24GlyVal: 4.24 ± 1.615
0.0GlyTrp: 0.0 ± 0.0
3.634GlyTyr: 3.634 ± 1.675
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.211HisCys: 1.211 ± 0.936
1.817HisAsp: 1.817 ± 0.829
1.817HisGlu: 1.817 ± 1.404
1.211HisPhe: 1.211 ± 0.929
0.606HisGly: 0.606 ± 0.505
0.0HisHis: 0.0 ± 0.0
2.423HisIle: 2.423 ± 1.172
0.606HisLys: 0.606 ± 0.468
0.606HisLeu: 0.606 ± 0.523
0.606HisMet: 0.606 ± 0.523
0.606HisAsn: 0.606 ± 0.505
0.0HisPro: 0.0 ± 0.0
0.606HisGln: 0.606 ± 0.726
1.211HisArg: 1.211 ± 1.047
0.606HisSer: 0.606 ± 0.523
0.606HisThr: 0.606 ± 0.505
0.606HisVal: 0.606 ± 0.468
0.606HisTrp: 0.606 ± 0.468
1.211HisTyr: 1.211 ± 0.5
0.0HisXaa: 0.0 ± 0.0
Ile
4.846IleAla: 4.846 ± 1.869
0.606IleCys: 0.606 ± 0.468
5.451IleAsp: 5.451 ± 0.743
4.24IleGlu: 4.24 ± 0.72
2.423IlePhe: 2.423 ± 0.62
4.24IleGly: 4.24 ± 1.365
1.211IleHis: 1.211 ± 0.5
3.634IleIle: 3.634 ± 1.297
4.24IleLys: 4.24 ± 1.878
4.846IleLeu: 4.846 ± 1.422
1.817IleMet: 1.817 ± 0.808
3.634IleAsn: 3.634 ± 1.777
3.028IlePro: 3.028 ± 1.506
1.211IleGln: 1.211 ± 1.01
2.423IleArg: 2.423 ± 1.164
3.028IleSer: 3.028 ± 1.062
4.846IleThr: 4.846 ± 1.88
1.817IleVal: 1.817 ± 1.026
1.211IleTrp: 1.211 ± 0.586
3.634IleTyr: 3.634 ± 1.097
0.0IleXaa: 0.0 ± 0.0
Lys
5.451LysAla: 5.451 ± 1.019
0.0LysCys: 0.0 ± 0.0
4.24LysAsp: 4.24 ± 1.289
0.606LysGlu: 0.606 ± 0.468
3.028LysPhe: 3.028 ± 1.786
1.817LysGly: 1.817 ± 0.313
2.423LysHis: 2.423 ± 1.129
4.24LysIle: 4.24 ± 1.157
6.663LysLys: 6.663 ± 1.82
4.24LysLeu: 4.24 ± 1.127
0.606LysMet: 0.606 ± 0.869
1.817LysAsn: 1.817 ± 0.858
3.028LysPro: 3.028 ± 0.792
2.423LysGln: 2.423 ± 1.054
4.24LysArg: 4.24 ± 1.408
4.846LysSer: 4.846 ± 0.767
1.211LysThr: 1.211 ± 0.5
3.028LysVal: 3.028 ± 1.876
0.606LysTrp: 0.606 ± 0.468
0.606LysTyr: 0.606 ± 0.523
0.0LysXaa: 0.0 ± 0.0
Leu
6.057LeuAla: 6.057 ± 1.688
2.423LeuCys: 2.423 ± 1.36
4.24LeuAsp: 4.24 ± 1.406
4.846LeuGlu: 4.846 ± 1.7
3.028LeuPhe: 3.028 ± 2.218
6.057LeuGly: 6.057 ± 0.966
1.817LeuHis: 1.817 ± 1.404
4.846LeuIle: 4.846 ± 1.451
4.24LeuLys: 4.24 ± 1.126
7.874LeuLeu: 7.874 ± 3.248
2.423LeuMet: 2.423 ± 1.618
6.663LeuAsn: 6.663 ± 1.587
3.028LeuPro: 3.028 ± 1.233
2.423LeuGln: 2.423 ± 0.644
3.634LeuArg: 3.634 ± 1.701
6.057LeuSer: 6.057 ± 2.0
3.634LeuThr: 3.634 ± 1.664
7.874LeuVal: 7.874 ± 2.132
0.0LeuTrp: 0.0 ± 0.0
1.211LeuTyr: 1.211 ± 0.926
0.0LeuXaa: 0.0 ± 0.0
Met
4.24MetAla: 4.24 ± 2.05
0.0MetCys: 0.0 ± 0.0
1.817MetAsp: 1.817 ± 1.097
0.606MetGlu: 0.606 ± 0.767
0.0MetPhe: 0.0 ± 0.0
0.606MetGly: 0.606 ± 0.523
0.0MetHis: 0.0 ± 0.0
1.817MetIle: 1.817 ± 1.112
1.211MetLys: 1.211 ± 0.759
1.211MetLeu: 1.211 ± 1.535
0.0MetMet: 0.0 ± 0.0
1.211MetAsn: 1.211 ± 1.01
1.211MetPro: 1.211 ± 0.876
0.606MetGln: 0.606 ± 0.505
1.211MetArg: 1.211 ± 0.502
2.423MetSer: 2.423 ± 1.128
2.423MetThr: 2.423 ± 0.973
1.817MetVal: 1.817 ± 1.045
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.634AsnAla: 3.634 ± 2.024
0.0AsnCys: 0.0 ± 0.0
2.423AsnAsp: 2.423 ± 1.005
0.606AsnGlu: 0.606 ± 0.523
1.211AsnPhe: 1.211 ± 1.01
1.817AsnGly: 1.817 ± 0.892
0.606AsnHis: 0.606 ± 0.505
3.634AsnIle: 3.634 ± 1.709
4.846AsnLys: 4.846 ± 0.893
0.606AsnLeu: 0.606 ± 0.767
1.211AsnMet: 1.211 ± 0.926
1.211AsnAsn: 1.211 ± 0.936
3.634AsnPro: 3.634 ± 1.701
2.423AsnGln: 2.423 ± 1.36
0.0AsnArg: 0.0 ± 0.0
3.028AsnSer: 3.028 ± 1.565
7.268AsnThr: 7.268 ± 3.112
2.423AsnVal: 2.423 ± 1.687
0.0AsnTrp: 0.0 ± 0.0
0.606AsnTyr: 0.606 ± 0.468
0.0AsnXaa: 0.0 ± 0.0
Pro
2.423ProAla: 2.423 ± 1.109
0.0ProCys: 0.0 ± 0.0
8.48ProAsp: 8.48 ± 3.585
1.817ProGlu: 1.817 ± 0.642
3.634ProPhe: 3.634 ± 1.618
1.211ProGly: 1.211 ± 0.876
0.0ProHis: 0.0 ± 0.0
2.423ProIle: 2.423 ± 0.602
0.606ProLys: 0.606 ± 0.468
3.634ProLeu: 3.634 ± 2.225
0.0ProMet: 0.0 ± 0.0
0.606ProAsn: 0.606 ± 0.618
3.028ProPro: 3.028 ± 0.895
1.817ProGln: 1.817 ± 1.073
1.211ProArg: 1.211 ± 0.929
4.24ProSer: 4.24 ± 1.406
4.24ProThr: 4.24 ± 1.548
6.663ProVal: 6.663 ± 1.812
0.0ProTrp: 0.0 ± 0.0
1.817ProTyr: 1.817 ± 1.515
0.0ProXaa: 0.0 ± 0.0
Gln
3.634GlnAla: 3.634 ± 1.442
1.211GlnCys: 1.211 ± 0.8
0.606GlnAsp: 0.606 ± 0.523
1.211GlnGlu: 1.211 ± 0.876
2.423GlnPhe: 2.423 ± 0.602
1.211GlnGly: 1.211 ± 0.5
0.0GlnHis: 0.0 ± 0.0
4.24GlnIle: 4.24 ± 0.936
1.211GlnLys: 1.211 ± 0.502
4.24GlnLeu: 4.24 ± 1.959
0.606GlnMet: 0.606 ± 0.505
2.423GlnAsn: 2.423 ± 0.999
1.817GlnPro: 1.817 ± 1.112
1.817GlnGln: 1.817 ± 0.858
1.211GlnArg: 1.211 ± 0.586
1.211GlnSer: 1.211 ± 0.586
2.423GlnThr: 2.423 ± 1.005
1.817GlnVal: 1.817 ± 0.313
1.817GlnTrp: 1.817 ± 1.097
2.423GlnTyr: 2.423 ± 1.434
0.0GlnXaa: 0.0 ± 0.0
Arg
2.423ArgAla: 2.423 ± 0.941
2.423ArgCys: 2.423 ± 1.316
3.028ArgAsp: 3.028 ± 1.564
2.423ArgGlu: 2.423 ± 1.56
1.211ArgPhe: 1.211 ± 0.5
3.634ArgGly: 3.634 ± 1.271
1.817ArgHis: 1.817 ± 0.923
3.634ArgIle: 3.634 ± 0.964
1.211ArgLys: 1.211 ± 1.047
4.846ArgLeu: 4.846 ± 2.091
0.0ArgMet: 0.0 ± 0.0
1.211ArgAsn: 1.211 ± 0.586
4.24ArgPro: 4.24 ± 1.194
0.606ArgGln: 0.606 ± 0.468
3.634ArgArg: 3.634 ± 2.005
0.606ArgSer: 0.606 ± 0.468
2.423ArgThr: 2.423 ± 1.857
1.817ArgVal: 1.817 ± 0.313
0.606ArgTrp: 0.606 ± 0.523
0.606ArgTyr: 0.606 ± 0.505
0.0ArgXaa: 0.0 ± 0.0
Ser
4.846SerAla: 4.846 ± 1.833
1.211SerCys: 1.211 ± 0.771
4.846SerAsp: 4.846 ± 1.442
3.028SerGlu: 3.028 ± 0.759
3.634SerPhe: 3.634 ± 1.556
6.057SerGly: 6.057 ± 1.352
1.211SerHis: 1.211 ± 0.5
5.451SerIle: 5.451 ± 1.979
0.0SerLys: 0.0 ± 0.0
6.057SerLeu: 6.057 ± 1.997
2.423SerMet: 2.423 ± 0.934
0.606SerAsn: 0.606 ± 0.523
1.211SerPro: 1.211 ± 0.936
3.634SerGln: 3.634 ± 1.342
2.423SerArg: 2.423 ± 1.19
2.423SerSer: 2.423 ± 1.316
4.846SerThr: 4.846 ± 1.322
3.634SerVal: 3.634 ± 1.395
0.606SerTrp: 0.606 ± 0.468
1.211SerTyr: 1.211 ± 0.876
0.0SerXaa: 0.0 ± 0.0
Thr
4.24ThrAla: 4.24 ± 1.083
3.028ThrCys: 3.028 ± 1.8
3.028ThrAsp: 3.028 ± 1.8
2.423ThrGlu: 2.423 ± 0.613
2.423ThrPhe: 2.423 ± 0.62
6.663ThrGly: 6.663 ± 1.972
0.606ThrHis: 0.606 ± 0.468
1.817ThrIle: 1.817 ± 0.851
6.057ThrLys: 6.057 ± 1.375
6.057ThrLeu: 6.057 ± 2.194
1.211ThrMet: 1.211 ± 0.747
4.24ThrAsn: 4.24 ± 1.123
2.423ThrPro: 2.423 ± 1.07
2.423ThrGln: 2.423 ± 1.07
3.028ThrArg: 3.028 ± 0.895
4.846ThrSer: 4.846 ± 0.852
2.423ThrThr: 2.423 ± 0.602
6.057ThrVal: 6.057 ± 1.741
1.817ThrTrp: 1.817 ± 0.891
3.028ThrTyr: 3.028 ± 0.926
0.0ThrXaa: 0.0 ± 0.0
Val
3.634ValAla: 3.634 ± 1.305
1.817ValCys: 1.817 ± 1.148
6.057ValAsp: 6.057 ± 2.035
3.028ValGlu: 3.028 ± 1.329
6.057ValPhe: 6.057 ± 1.354
4.24ValGly: 4.24 ± 1.017
2.423ValHis: 2.423 ± 1.172
1.817ValIle: 1.817 ± 0.727
2.423ValLys: 2.423 ± 1.495
7.268ValLeu: 7.268 ± 1.214
1.211ValMet: 1.211 ± 1.041
5.451ValAsn: 5.451 ± 1.852
4.24ValPro: 4.24 ± 1.167
3.028ValGln: 3.028 ± 1.197
3.028ValArg: 3.028 ± 0.798
3.028ValSer: 3.028 ± 1.406
4.846ValThr: 4.846 ± 0.871
3.028ValVal: 3.028 ± 1.66
0.606ValTrp: 0.606 ± 0.468
1.817ValTyr: 1.817 ± 0.89
0.0ValXaa: 0.0 ± 0.0
Trp
1.211TrpAla: 1.211 ± 0.926
0.606TrpCys: 0.606 ± 0.468
0.606TrpAsp: 0.606 ± 0.726
0.606TrpGlu: 0.606 ± 0.523
0.606TrpPhe: 0.606 ± 0.523
1.211TrpGly: 1.211 ± 0.502
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.606TrpLys: 0.606 ± 0.523
1.211TrpLeu: 1.211 ± 0.863
0.606TrpMet: 0.606 ± 0.468
1.211TrpAsn: 1.211 ± 0.929
1.211TrpPro: 1.211 ± 1.047
0.0TrpGln: 0.0 ± 0.0
1.817TrpArg: 1.817 ± 0.923
0.606TrpSer: 0.606 ± 0.468
0.606TrpThr: 0.606 ± 0.505
2.423TrpVal: 2.423 ± 1.508
0.606TrpTrp: 0.606 ± 0.523
1.211TrpTyr: 1.211 ± 0.586
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.451TyrAla: 5.451 ± 1.686
0.0TyrCys: 0.0 ± 0.0
3.028TyrAsp: 3.028 ± 1.259
2.423TyrGlu: 2.423 ± 1.316
1.211TyrPhe: 1.211 ± 0.5
3.634TyrGly: 3.634 ± 0.99
0.606TyrHis: 0.606 ± 0.523
1.211TyrIle: 1.211 ± 1.047
4.24TyrLys: 4.24 ± 1.319
2.423TyrLeu: 2.423 ± 0.796
1.211TyrMet: 1.211 ± 0.8
1.211TyrAsn: 1.211 ± 0.936
1.211TyrPro: 1.211 ± 1.01
2.423TyrGln: 2.423 ± 0.867
2.423TyrArg: 2.423 ± 1.172
2.423TyrSer: 2.423 ± 0.701
2.423TyrThr: 2.423 ± 0.975
1.817TyrVal: 1.817 ± 0.837
0.606TyrTrp: 0.606 ± 0.468
0.606TyrTyr: 0.606 ± 0.505
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1652 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski