Amino acid dipepetide frequency for Vibrio phage VfO4K68

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.581AlaAla: 5.581 ± 2.866
2.537AlaCys: 2.537 ± 1.066
1.522AlaAsp: 1.522 ± 0.86
3.551AlaGlu: 3.551 ± 0.618
3.044AlaPhe: 3.044 ± 1.506
3.044AlaGly: 3.044 ± 1.516
0.507AlaHis: 0.507 ± 0.48
4.059AlaIle: 4.059 ± 1.961
2.029AlaLys: 2.029 ± 1.013
9.132AlaLeu: 9.132 ± 2.091
1.015AlaMet: 1.015 ± 0.713
4.566AlaAsn: 4.566 ± 1.002
2.537AlaPro: 2.537 ± 1.362
3.551AlaGln: 3.551 ± 0.845
2.537AlaArg: 2.537 ± 1.091
5.074AlaSer: 5.074 ± 1.028
4.566AlaThr: 4.566 ± 0.9
5.581AlaVal: 5.581 ± 1.257
1.015AlaTrp: 1.015 ± 0.494
3.044AlaTyr: 3.044 ± 1.724
0.0AlaXaa: 0.0 ± 0.0
Cys
1.015CysAla: 1.015 ± 0.494
0.0CysCys: 0.0 ± 0.0
0.507CysAsp: 0.507 ± 0.381
2.029CysGlu: 2.029 ± 1.025
1.015CysPhe: 1.015 ± 0.96
1.015CysGly: 1.015 ± 0.494
1.015CysHis: 1.015 ± 0.462
0.507CysIle: 0.507 ± 0.406
1.015CysLys: 1.015 ± 0.462
2.029CysLeu: 2.029 ± 0.734
0.507CysMet: 0.507 ± 0.381
0.0CysAsn: 0.0 ± 0.0
1.015CysPro: 1.015 ± 0.462
2.029CysGln: 2.029 ± 0.845
1.522CysArg: 1.522 ± 1.098
1.015CysSer: 1.015 ± 0.494
1.015CysThr: 1.015 ± 0.494
0.507CysVal: 0.507 ± 0.598
0.507CysTrp: 0.507 ± 0.406
2.537CysTyr: 2.537 ± 1.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.044AspAla: 3.044 ± 0.96
3.551AspCys: 3.551 ± 1.125
6.596AspAsp: 6.596 ± 1.428
5.581AspGlu: 5.581 ± 1.381
3.551AspPhe: 3.551 ± 0.801
5.581AspGly: 5.581 ± 1.393
0.507AspHis: 0.507 ± 0.629
6.088AspIle: 6.088 ± 1.272
1.015AspLys: 1.015 ± 0.762
4.059AspLeu: 4.059 ± 1.125
1.015AspMet: 1.015 ± 0.684
2.029AspAsn: 2.029 ± 0.747
3.551AspPro: 3.551 ± 1.83
1.522AspGln: 1.522 ± 1.286
0.507AspArg: 0.507 ± 0.615
3.044AspSer: 3.044 ± 1.485
4.059AspThr: 4.059 ± 1.215
5.581AspVal: 5.581 ± 1.658
0.507AspTrp: 0.507 ± 0.48
2.537AspTyr: 2.537 ± 0.982
0.0AspXaa: 0.0 ± 0.0
Glu
1.015GluAla: 1.015 ± 0.745
0.507GluCys: 0.507 ± 0.406
5.074GluAsp: 5.074 ± 1.671
1.522GluGlu: 1.522 ± 0.802
1.522GluPhe: 1.522 ± 0.66
3.551GluGly: 3.551 ± 1.255
1.015GluHis: 1.015 ± 0.812
3.551GluIle: 3.551 ± 1.755
2.537GluLys: 2.537 ± 0.801
5.581GluLeu: 5.581 ± 1.499
0.0GluMet: 0.0 ± 0.0
2.029GluAsn: 2.029 ± 0.747
2.029GluPro: 2.029 ± 0.832
4.566GluGln: 4.566 ± 1.364
2.029GluArg: 2.029 ± 0.745
4.059GluSer: 4.059 ± 1.274
2.029GluThr: 2.029 ± 0.988
2.537GluVal: 2.537 ± 0.706
0.507GluTrp: 0.507 ± 0.615
7.61GluTyr: 7.61 ± 2.615
0.0GluXaa: 0.0 ± 0.0
Phe
4.566PheAla: 4.566 ± 1.487
1.015PheCys: 1.015 ± 0.494
4.059PheAsp: 4.059 ± 1.355
2.537PheGlu: 2.537 ± 1.407
2.029PhePhe: 2.029 ± 1.426
2.029PheGly: 2.029 ± 0.831
1.522PheHis: 1.522 ± 0.584
1.522PheIle: 1.522 ± 0.759
1.522PheLys: 1.522 ± 0.718
4.566PheLeu: 4.566 ± 1.471
0.507PheMet: 0.507 ± 0.406
4.566PheAsn: 4.566 ± 1.187
2.029PhePro: 2.029 ± 0.988
1.015PheGln: 1.015 ± 0.96
2.029PheArg: 2.029 ± 0.968
3.044PheSer: 3.044 ± 1.641
3.044PheThr: 3.044 ± 1.169
1.522PheVal: 1.522 ± 0.671
0.507PheTrp: 0.507 ± 0.381
1.522PheTyr: 1.522 ± 0.584
0.0PheXaa: 0.0 ± 0.0
Gly
5.581GlyAla: 5.581 ± 1.85
2.537GlyCys: 2.537 ± 0.959
5.074GlyAsp: 5.074 ± 1.441
4.059GlyGlu: 4.059 ± 1.274
5.581GlyPhe: 5.581 ± 1.613
2.537GlyGly: 2.537 ± 0.706
1.015GlyHis: 1.015 ± 0.636
3.551GlyIle: 3.551 ± 0.96
5.074GlyLys: 5.074 ± 1.873
7.61GlyLeu: 7.61 ± 1.436
3.044GlyMet: 3.044 ± 1.612
2.029GlyAsn: 2.029 ± 0.527
1.015GlyPro: 1.015 ± 0.462
3.044GlyGln: 3.044 ± 1.356
3.044GlyArg: 3.044 ± 0.679
6.088GlySer: 6.088 ± 1.823
6.088GlyThr: 6.088 ± 1.555
6.596GlyVal: 6.596 ± 1.531
0.507GlyTrp: 0.507 ± 0.5
2.029GlyTyr: 2.029 ± 0.747
0.0GlyXaa: 0.0 ± 0.0
His
1.015HisAla: 1.015 ± 0.734
1.015HisCys: 1.015 ± 0.96
3.044HisAsp: 3.044 ± 0.803
2.537HisGlu: 2.537 ± 1.55
1.522HisPhe: 1.522 ± 0.741
2.029HisGly: 2.029 ± 1.196
0.0HisHis: 0.0 ± 0.0
1.015HisIle: 1.015 ± 0.96
2.537HisLys: 2.537 ± 1.038
1.522HisLeu: 1.522 ± 0.793
0.507HisMet: 0.507 ± 0.5
0.0HisAsn: 0.0 ± 0.0
1.015HisPro: 1.015 ± 0.559
0.507HisGln: 0.507 ± 0.406
0.507HisArg: 0.507 ± 0.406
0.507HisSer: 0.507 ± 0.406
1.015HisThr: 1.015 ± 0.559
1.015HisVal: 1.015 ± 0.494
1.015HisTrp: 1.015 ± 0.738
0.507HisTyr: 0.507 ± 0.406
0.0HisXaa: 0.0 ± 0.0
Ile
3.551IleAla: 3.551 ± 2.36
2.029IleCys: 2.029 ± 0.607
4.059IleAsp: 4.059 ± 1.014
4.059IleGlu: 4.059 ± 0.601
3.551IlePhe: 3.551 ± 1.046
6.088IleGly: 6.088 ± 0.734
2.029IleHis: 2.029 ± 1.196
1.522IleIle: 1.522 ± 0.784
3.551IleLys: 3.551 ± 1.573
3.551IleLeu: 3.551 ± 1.254
0.507IleMet: 0.507 ± 0.381
4.059IleAsn: 4.059 ± 1.661
1.522IlePro: 1.522 ± 0.718
2.029IleGln: 2.029 ± 1.285
1.015IleArg: 1.015 ± 0.494
2.029IleSer: 2.029 ± 0.818
4.059IleThr: 4.059 ± 1.479
1.522IleVal: 1.522 ± 0.718
0.0IleTrp: 0.0 ± 0.0
4.566IleTyr: 4.566 ± 0.854
0.0IleXaa: 0.0 ± 0.0
Lys
4.059LysAla: 4.059 ± 1.125
0.0LysCys: 0.0 ± 0.0
2.537LysAsp: 2.537 ± 0.816
2.029LysGlu: 2.029 ± 0.829
4.059LysPhe: 4.059 ± 1.079
5.074LysGly: 5.074 ± 1.874
2.029LysHis: 2.029 ± 1.364
2.029LysIle: 2.029 ± 1.19
3.044LysLys: 3.044 ± 2.011
4.566LysLeu: 4.566 ± 1.325
1.522LysMet: 1.522 ± 0.823
2.029LysAsn: 2.029 ± 0.739
3.551LysPro: 3.551 ± 1.101
1.522LysGln: 1.522 ± 0.521
3.044LysArg: 3.044 ± 1.285
5.581LysSer: 5.581 ± 1.074
2.029LysThr: 2.029 ± 1.616
3.551LysVal: 3.551 ± 1.842
0.507LysTrp: 0.507 ± 0.48
2.029LysTyr: 2.029 ± 0.894
0.0LysXaa: 0.0 ± 0.0
Leu
4.566LeuAla: 4.566 ± 1.762
0.507LeuCys: 0.507 ± 0.48
4.566LeuAsp: 4.566 ± 1.544
5.074LeuGlu: 5.074 ± 1.989
5.074LeuPhe: 5.074 ± 2.255
6.088LeuGly: 6.088 ± 0.998
3.551LeuHis: 3.551 ± 1.87
6.088LeuIle: 6.088 ± 1.438
5.074LeuLys: 5.074 ± 1.043
9.132LeuLeu: 9.132 ± 1.055
2.029LeuMet: 2.029 ± 1.35
7.103LeuAsn: 7.103 ± 1.424
2.537LeuPro: 2.537 ± 0.894
2.029LeuGln: 2.029 ± 1.025
5.581LeuArg: 5.581 ± 1.246
4.059LeuSer: 4.059 ± 0.781
3.551LeuThr: 3.551 ± 0.69
7.103LeuVal: 7.103 ± 1.661
1.015LeuTrp: 1.015 ± 0.494
2.029LeuTyr: 2.029 ± 0.832
0.0LeuXaa: 0.0 ± 0.0
Met
3.044MetAla: 3.044 ± 1.896
0.507MetCys: 0.507 ± 0.381
1.015MetAsp: 1.015 ± 0.667
1.522MetGlu: 1.522 ± 0.984
0.0MetPhe: 0.0 ± 0.0
0.507MetGly: 0.507 ± 0.406
0.0MetHis: 0.0 ± 0.0
0.507MetIle: 0.507 ± 0.629
2.537MetLys: 2.537 ± 1.523
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.507MetAsn: 0.507 ± 0.381
0.0MetPro: 0.0 ± 0.0
0.507MetGln: 0.507 ± 0.406
1.015MetArg: 1.015 ± 0.681
2.029MetSer: 2.029 ± 1.453
1.522MetThr: 1.522 ± 1.07
1.522MetVal: 1.522 ± 0.983
0.507MetTrp: 0.507 ± 0.615
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.059AsnAla: 4.059 ± 1.934
0.0AsnCys: 0.0 ± 0.0
1.522AsnAsp: 1.522 ± 0.862
1.522AsnGlu: 1.522 ± 0.521
2.029AsnPhe: 2.029 ± 0.748
4.566AsnGly: 4.566 ± 1.523
0.507AsnHis: 0.507 ± 0.5
3.044AsnIle: 3.044 ± 1.079
4.566AsnLys: 4.566 ± 1.029
2.029AsnLeu: 2.029 ± 0.762
0.507AsnMet: 0.507 ± 0.598
3.551AsnAsn: 3.551 ± 1.244
4.566AsnPro: 4.566 ± 1.247
1.522AsnGln: 1.522 ± 0.669
1.015AsnArg: 1.015 ± 0.494
3.044AsnSer: 3.044 ± 1.4
4.059AsnThr: 4.059 ± 1.496
5.581AsnVal: 5.581 ± 1.807
1.522AsnTrp: 1.522 ± 0.749
1.015AsnTyr: 1.015 ± 0.462
0.0AsnXaa: 0.0 ± 0.0
Pro
4.059ProAla: 4.059 ± 0.761
0.0ProCys: 0.0 ± 0.0
8.118ProAsp: 8.118 ± 3.545
2.537ProGlu: 2.537 ± 1.283
0.507ProPhe: 0.507 ± 0.406
1.522ProGly: 1.522 ± 0.82
0.507ProHis: 0.507 ± 0.406
2.029ProIle: 2.029 ± 0.607
2.029ProLys: 2.029 ± 0.738
4.566ProLeu: 4.566 ± 1.828
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
1.522ProPro: 1.522 ± 0.698
0.507ProGln: 0.507 ± 0.629
2.029ProArg: 2.029 ± 0.93
2.537ProSer: 2.537 ± 0.552
5.074ProThr: 5.074 ± 1.818
4.566ProVal: 4.566 ± 1.866
0.0ProTrp: 0.0 ± 0.0
0.507ProTyr: 0.507 ± 0.406
0.0ProXaa: 0.0 ± 0.0
Gln
2.537GlnAla: 2.537 ± 0.871
1.522GlnCys: 1.522 ± 1.143
1.015GlnAsp: 1.015 ± 0.524
2.537GlnGlu: 2.537 ± 1.316
1.015GlnPhe: 1.015 ± 0.462
2.029GlnGly: 2.029 ± 0.957
0.507GlnHis: 0.507 ± 0.406
3.044GlnIle: 3.044 ± 1.079
1.015GlnLys: 1.015 ± 0.684
2.537GlnLeu: 2.537 ± 1.504
0.0GlnMet: 0.0 ± 0.0
0.507GlnAsn: 0.507 ± 0.381
1.522GlnPro: 1.522 ± 0.718
2.029GlnGln: 2.029 ± 0.988
2.537GlnArg: 2.537 ± 1.023
2.537GlnSer: 2.537 ± 0.762
2.537GlnThr: 2.537 ± 0.763
1.015GlnVal: 1.015 ± 0.684
0.507GlnTrp: 0.507 ± 0.629
2.537GlnTyr: 2.537 ± 0.783
0.0GlnXaa: 0.0 ± 0.0
Arg
1.522ArgAla: 1.522 ± 0.862
0.507ArgCys: 0.507 ± 0.381
2.029ArgAsp: 2.029 ± 0.94
1.522ArgGlu: 1.522 ± 1.302
3.551ArgPhe: 3.551 ± 1.474
4.566ArgGly: 4.566 ± 2.21
2.029ArgHis: 2.029 ± 0.734
3.551ArgIle: 3.551 ± 1.702
2.029ArgLys: 2.029 ± 1.413
4.059ArgLeu: 4.059 ± 1.608
1.015ArgMet: 1.015 ± 0.913
1.522ArgAsn: 1.522 ± 0.584
3.044ArgPro: 3.044 ± 0.885
1.522ArgGln: 1.522 ± 0.401
3.551ArgArg: 3.551 ± 2.061
1.522ArgSer: 1.522 ± 0.607
3.551ArgThr: 3.551 ± 1.4
2.029ArgVal: 2.029 ± 0.836
1.015ArgTrp: 1.015 ± 0.812
1.522ArgTyr: 1.522 ± 0.749
0.0ArgXaa: 0.0 ± 0.0
Ser
7.103SerAla: 7.103 ± 1.219
3.044SerCys: 3.044 ± 1.863
2.537SerAsp: 2.537 ± 0.738
3.044SerGlu: 3.044 ± 1.095
2.029SerPhe: 2.029 ± 0.662
7.103SerGly: 7.103 ± 2.354
1.522SerHis: 1.522 ± 0.521
5.074SerIle: 5.074 ± 0.968
1.522SerLys: 1.522 ± 0.635
6.596SerLeu: 6.596 ± 1.507
1.015SerMet: 1.015 ± 0.751
2.029SerAsn: 2.029 ± 1.048
3.044SerPro: 3.044 ± 0.823
1.522SerGln: 1.522 ± 0.82
3.551SerArg: 3.551 ± 1.181
2.537SerSer: 2.537 ± 1.085
4.059SerThr: 4.059 ± 1.215
4.059SerVal: 4.059 ± 1.173
1.015SerTrp: 1.015 ± 0.462
2.537SerTyr: 2.537 ± 0.973
0.0SerXaa: 0.0 ± 0.0
Thr
5.074ThrAla: 5.074 ± 1.058
0.507ThrCys: 0.507 ± 0.406
2.537ThrAsp: 2.537 ± 1.066
1.522ThrGlu: 1.522 ± 0.841
1.522ThrPhe: 1.522 ± 1.005
6.088ThrGly: 6.088 ± 1.561
1.015ThrHis: 1.015 ± 0.462
5.074ThrIle: 5.074 ± 1.449
6.088ThrLys: 6.088 ± 1.397
3.044ThrLeu: 3.044 ± 1.108
1.522ThrMet: 1.522 ± 0.924
4.566ThrAsn: 4.566 ± 1.219
3.044ThrPro: 3.044 ± 1.204
1.015ThrGln: 1.015 ± 0.494
2.537ThrArg: 2.537 ± 0.559
6.088ThrSer: 6.088 ± 1.686
3.551ThrThr: 3.551 ± 0.849
7.61ThrVal: 7.61 ± 1.666
1.522ThrTrp: 1.522 ± 1.092
2.029ThrTyr: 2.029 ± 0.419
0.0ThrXaa: 0.0 ± 0.0
Val
3.044ValAla: 3.044 ± 1.296
0.507ValCys: 0.507 ± 0.598
6.088ValAsp: 6.088 ± 1.856
4.059ValGlu: 4.059 ± 1.056
2.029ValPhe: 2.029 ± 1.151
5.074ValGly: 5.074 ± 1.628
2.029ValHis: 2.029 ± 0.988
1.522ValIle: 1.522 ± 0.759
3.044ValLys: 3.044 ± 0.674
7.103ValLeu: 7.103 ± 1.828
1.015ValMet: 1.015 ± 0.713
5.581ValAsn: 5.581 ± 2.038
3.551ValPro: 3.551 ± 0.603
2.029ValGln: 2.029 ± 0.745
4.059ValArg: 4.059 ± 1.801
7.103ValSer: 7.103 ± 2.137
7.103ValThr: 7.103 ± 1.876
8.118ValVal: 8.118 ± 3.111
1.522ValTrp: 1.522 ± 0.666
2.029ValTyr: 2.029 ± 0.938
0.0ValXaa: 0.0 ± 0.0
Trp
0.507TrpAla: 0.507 ± 0.406
0.507TrpCys: 0.507 ± 0.48
1.015TrpAsp: 1.015 ± 0.73
0.0TrpGlu: 0.0 ± 0.0
1.015TrpPhe: 1.015 ± 0.812
1.015TrpGly: 1.015 ± 0.462
1.015TrpHis: 1.015 ± 0.494
0.507TrpIle: 0.507 ± 0.615
0.507TrpLys: 0.507 ± 0.406
2.029TrpLeu: 2.029 ± 1.183
1.015TrpMet: 1.015 ± 0.602
1.015TrpAsn: 1.015 ± 0.751
0.507TrpPro: 0.507 ± 0.381
0.0TrpGln: 0.0 ± 0.0
1.015TrpArg: 1.015 ± 0.96
1.522TrpSer: 1.522 ± 1.098
0.0TrpThr: 0.0 ± 0.0
1.015TrpVal: 1.015 ± 0.649
0.0TrpTrp: 0.0 ± 0.0
0.507TrpTyr: 0.507 ± 0.48
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.551TyrAla: 3.551 ± 1.182
0.0TyrCys: 0.0 ± 0.0
1.015TyrAsp: 1.015 ± 0.96
2.029TyrGlu: 2.029 ± 0.747
0.507TyrPhe: 0.507 ± 0.406
6.596TyrGly: 6.596 ± 1.93
0.507TyrHis: 0.507 ± 0.406
1.015TyrIle: 1.015 ± 1.231
4.059TyrLys: 4.059 ± 1.102
3.044TyrLeu: 3.044 ± 1.059
0.0TyrMet: 0.0 ± 0.0
2.537TyrAsn: 2.537 ± 0.816
1.015TyrPro: 1.015 ± 0.494
1.015TyrGln: 1.015 ± 0.92
2.537TyrArg: 2.537 ± 1.235
1.522TyrSer: 1.522 ± 0.802
3.551TyrThr: 3.551 ± 1.193
5.581TyrVal: 5.581 ± 1.999
1.015TyrTrp: 1.015 ± 0.462
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1972 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski