Amino acid dipepetide frequency for Burkholderia phage BEK

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.029AlaAla: 21.029 ± 2.107
1.388AlaCys: 1.388 ± 0.472
8.433AlaAsp: 8.433 ± 0.935
5.871AlaGlu: 5.871 ± 0.654
2.989AlaPhe: 2.989 ± 0.559
10.568AlaGly: 10.568 ± 0.919
2.242AlaHis: 2.242 ± 0.422
4.804AlaIle: 4.804 ± 0.713
4.27AlaLys: 4.27 ± 0.738
12.169AlaLeu: 12.169 ± 1.171
3.629AlaMet: 3.629 ± 0.531
3.096AlaAsn: 3.096 ± 0.535
6.191AlaPro: 6.191 ± 0.695
3.416AlaGln: 3.416 ± 0.592
10.461AlaArg: 10.461 ± 1.06
7.045AlaSer: 7.045 ± 0.913
6.618AlaThr: 6.618 ± 0.748
6.191AlaVal: 6.191 ± 0.719
2.669AlaTrp: 2.669 ± 0.692
3.523AlaTyr: 3.523 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.854CysAla: 0.854 ± 0.258
0.32CysCys: 0.32 ± 0.184
0.534CysAsp: 0.534 ± 0.239
1.174CysGlu: 1.174 ± 0.32
0.213CysPhe: 0.213 ± 0.158
1.174CysGly: 1.174 ± 0.394
0.32CysHis: 0.32 ± 0.173
0.534CysIle: 0.534 ± 0.263
0.213CysLys: 0.213 ± 0.137
0.747CysLeu: 0.747 ± 0.3
0.534CysMet: 0.534 ± 0.224
0.213CysAsn: 0.213 ± 0.177
0.427CysPro: 0.427 ± 0.187
0.427CysGln: 0.427 ± 0.196
1.281CysArg: 1.281 ± 0.517
0.747CysSer: 0.747 ± 0.239
0.427CysThr: 0.427 ± 0.2
0.961CysVal: 0.961 ± 0.29
0.32CysTrp: 0.32 ± 0.201
0.427CysTyr: 0.427 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
7.899AspAla: 7.899 ± 0.785
0.534AspCys: 0.534 ± 0.201
3.736AspAsp: 3.736 ± 0.751
4.483AspGlu: 4.483 ± 0.717
2.562AspPhe: 2.562 ± 0.381
5.017AspGly: 5.017 ± 0.907
1.494AspHis: 1.494 ± 0.357
3.629AspIle: 3.629 ± 0.728
1.815AspLys: 1.815 ± 0.828
5.978AspLeu: 5.978 ± 0.879
1.601AspMet: 1.601 ± 0.373
1.067AspAsn: 1.067 ± 0.268
3.309AspPro: 3.309 ± 0.589
1.921AspGln: 1.921 ± 0.499
3.95AspArg: 3.95 ± 0.737
1.921AspSer: 1.921 ± 0.486
3.523AspThr: 3.523 ± 0.648
3.95AspVal: 3.95 ± 0.642
0.534AspTrp: 0.534 ± 0.244
2.348AspTyr: 2.348 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
4.59GluAla: 4.59 ± 0.969
0.64GluCys: 0.64 ± 0.245
1.708GluAsp: 1.708 ± 0.455
1.708GluGlu: 1.708 ± 0.515
2.775GluPhe: 2.775 ± 0.436
2.669GluGly: 2.669 ± 0.547
1.281GluHis: 1.281 ± 0.416
2.882GluIle: 2.882 ± 0.55
2.348GluLys: 2.348 ± 0.462
5.764GluLeu: 5.764 ± 0.714
0.747GluMet: 0.747 ± 0.293
1.815GluAsn: 1.815 ± 0.418
2.455GluPro: 2.455 ± 0.502
1.815GluGln: 1.815 ± 0.404
7.045GluArg: 7.045 ± 0.545
3.523GluSer: 3.523 ± 0.506
3.096GluThr: 3.096 ± 0.508
3.309GluVal: 3.309 ± 0.616
1.601GluTrp: 1.601 ± 0.374
1.601GluTyr: 1.601 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
6.191PheAla: 6.191 ± 0.783
0.32PheCys: 0.32 ± 0.17
2.455PheAsp: 2.455 ± 0.583
2.028PheGlu: 2.028 ± 0.452
1.708PhePhe: 1.708 ± 0.532
2.242PheGly: 2.242 ± 0.545
1.067PheHis: 1.067 ± 0.451
1.067PheIle: 1.067 ± 0.302
1.601PheLys: 1.601 ± 0.425
2.348PheLeu: 2.348 ± 0.397
0.534PheMet: 0.534 ± 0.224
0.961PheAsn: 0.961 ± 0.392
1.708PhePro: 1.708 ± 0.364
0.747PheGln: 0.747 ± 0.32
2.989PheArg: 2.989 ± 0.632
2.028PheSer: 2.028 ± 0.403
1.815PheThr: 1.815 ± 0.365
2.242PheVal: 2.242 ± 0.511
0.747PheTrp: 0.747 ± 0.257
0.534PheTyr: 0.534 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
8.54GlyAla: 8.54 ± 1.063
1.388GlyCys: 1.388 ± 0.433
3.309GlyAsp: 3.309 ± 0.578
3.523GlyGlu: 3.523 ± 0.644
2.775GlyPhe: 2.775 ± 0.524
5.978GlyGly: 5.978 ± 0.853
1.388GlyHis: 1.388 ± 0.521
2.135GlyIle: 2.135 ± 0.407
3.416GlyLys: 3.416 ± 0.658
6.618GlyLeu: 6.618 ± 0.735
2.669GlyMet: 2.669 ± 0.555
2.242GlyAsn: 2.242 ± 0.644
3.096GlyPro: 3.096 ± 0.609
1.921GlyGln: 1.921 ± 0.378
6.085GlyArg: 6.085 ± 0.66
3.309GlySer: 3.309 ± 0.647
5.017GlyThr: 5.017 ± 0.682
4.483GlyVal: 4.483 ± 0.764
2.242GlyTrp: 2.242 ± 0.405
2.348GlyTyr: 2.348 ± 0.504
0.0GlyXaa: 0.0 ± 0.0
His
4.27HisAla: 4.27 ± 0.701
0.427HisCys: 0.427 ± 0.221
1.494HisAsp: 1.494 ± 0.481
1.388HisGlu: 1.388 ± 0.317
0.427HisPhe: 0.427 ± 0.221
1.388HisGly: 1.388 ± 0.369
0.961HisHis: 0.961 ± 0.318
0.854HisIle: 0.854 ± 0.28
1.174HisLys: 1.174 ± 0.402
1.708HisLeu: 1.708 ± 0.362
0.534HisMet: 0.534 ± 0.228
0.854HisAsn: 0.854 ± 0.279
0.961HisPro: 0.961 ± 0.43
0.854HisGln: 0.854 ± 0.279
1.708HisArg: 1.708 ± 0.436
1.067HisSer: 1.067 ± 0.31
1.601HisThr: 1.601 ± 0.418
2.028HisVal: 2.028 ± 0.384
0.427HisTrp: 0.427 ± 0.189
0.747HisTyr: 0.747 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
5.871IleAla: 5.871 ± 0.65
0.534IleCys: 0.534 ± 0.229
5.337IleAsp: 5.337 ± 0.77
2.775IleGlu: 2.775 ± 0.719
0.747IlePhe: 0.747 ± 0.258
3.523IleGly: 3.523 ± 0.784
1.281IleHis: 1.281 ± 0.305
1.281IleIle: 1.281 ± 0.459
0.961IleLys: 0.961 ± 0.325
2.669IleLeu: 2.669 ± 0.456
0.961IleMet: 0.961 ± 0.328
1.281IleAsn: 1.281 ± 0.268
1.921IlePro: 1.921 ± 0.557
1.174IleGln: 1.174 ± 0.33
2.669IleArg: 2.669 ± 0.488
2.562IleSer: 2.562 ± 0.47
2.135IleThr: 2.135 ± 0.445
3.202IleVal: 3.202 ± 0.696
0.213IleTrp: 0.213 ± 0.119
0.747IleTyr: 0.747 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
5.337LysAla: 5.337 ± 0.706
0.32LysCys: 0.32 ± 0.15
1.601LysAsp: 1.601 ± 0.393
1.388LysGlu: 1.388 ± 0.406
0.854LysPhe: 0.854 ± 0.309
2.882LysGly: 2.882 ± 0.556
0.961LysHis: 0.961 ± 0.348
1.601LysIle: 1.601 ± 0.61
2.348LysLys: 2.348 ± 0.547
3.523LysLeu: 3.523 ± 0.788
1.067LysMet: 1.067 ± 0.282
1.281LysAsn: 1.281 ± 0.386
1.921LysPro: 1.921 ± 0.501
2.028LysGln: 2.028 ± 0.487
5.337LysArg: 5.337 ± 0.674
2.242LysSer: 2.242 ± 0.407
2.028LysThr: 2.028 ± 0.402
1.601LysVal: 1.601 ± 0.43
0.427LysTrp: 0.427 ± 0.182
1.281LysTyr: 1.281 ± 0.347
0.0LysXaa: 0.0 ± 0.0
Leu
10.781LeuAla: 10.781 ± 1.407
1.067LeuCys: 1.067 ± 0.356
5.978LeuAsp: 5.978 ± 0.684
5.551LeuGlu: 5.551 ± 0.691
2.882LeuPhe: 2.882 ± 0.786
6.832LeuGly: 6.832 ± 0.813
1.921LeuHis: 1.921 ± 0.489
3.416LeuIle: 3.416 ± 0.48
3.096LeuLys: 3.096 ± 0.554
7.472LeuLeu: 7.472 ± 0.724
1.601LeuMet: 1.601 ± 0.454
2.775LeuAsn: 2.775 ± 0.449
4.056LeuPro: 4.056 ± 0.72
3.309LeuGln: 3.309 ± 0.49
8.219LeuArg: 8.219 ± 1.136
5.978LeuSer: 5.978 ± 0.791
5.871LeuThr: 5.871 ± 0.589
6.512LeuVal: 6.512 ± 0.722
0.213LeuTrp: 0.213 ± 0.152
2.348LeuTyr: 2.348 ± 0.504
0.0LeuXaa: 0.0 ± 0.0
Met
2.775MetAla: 2.775 ± 0.534
0.213MetCys: 0.213 ± 0.143
1.921MetAsp: 1.921 ± 0.449
0.747MetGlu: 0.747 ± 0.278
0.427MetPhe: 0.427 ± 0.196
1.494MetGly: 1.494 ± 0.454
0.427MetHis: 0.427 ± 0.218
0.534MetIle: 0.534 ± 0.273
0.213MetLys: 0.213 ± 0.156
1.708MetLeu: 1.708 ± 0.382
0.107MetMet: 0.107 ± 0.107
0.961MetAsn: 0.961 ± 0.282
1.388MetPro: 1.388 ± 0.4
0.64MetGln: 0.64 ± 0.272
2.135MetArg: 2.135 ± 0.458
1.388MetSer: 1.388 ± 0.356
2.882MetThr: 2.882 ± 0.586
1.494MetVal: 1.494 ± 0.31
0.213MetTrp: 0.213 ± 0.143
0.32MetTyr: 0.32 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
2.669AsnAla: 2.669 ± 0.531
0.0AsnCys: 0.0 ± 0.0
2.562AsnAsp: 2.562 ± 0.407
1.708AsnGlu: 1.708 ± 0.352
0.854AsnPhe: 0.854 ± 0.25
3.523AsnGly: 3.523 ± 0.739
0.427AsnHis: 0.427 ± 0.204
1.494AsnIle: 1.494 ± 0.324
1.281AsnLys: 1.281 ± 0.443
2.242AsnLeu: 2.242 ± 0.492
0.747AsnMet: 0.747 ± 0.31
0.961AsnAsn: 0.961 ± 0.406
1.494AsnPro: 1.494 ± 0.371
1.601AsnGln: 1.601 ± 0.453
1.708AsnArg: 1.708 ± 0.382
1.067AsnSer: 1.067 ± 0.283
1.174AsnThr: 1.174 ± 0.308
2.562AsnVal: 2.562 ± 0.415
0.213AsnTrp: 0.213 ± 0.143
0.961AsnTyr: 0.961 ± 0.315
0.0AsnXaa: 0.0 ± 0.0
Pro
6.405ProAla: 6.405 ± 1.076
0.854ProCys: 0.854 ± 0.395
3.309ProAsp: 3.309 ± 0.578
2.882ProGlu: 2.882 ± 0.578
1.494ProPhe: 1.494 ± 0.33
2.348ProGly: 2.348 ± 0.478
1.174ProHis: 1.174 ± 0.374
1.921ProIle: 1.921 ± 0.498
3.096ProLys: 3.096 ± 0.566
3.95ProLeu: 3.95 ± 0.6
0.64ProMet: 0.64 ± 0.287
1.494ProAsn: 1.494 ± 0.489
3.416ProPro: 3.416 ± 0.938
1.174ProGln: 1.174 ± 0.272
4.056ProArg: 4.056 ± 0.545
2.455ProSer: 2.455 ± 0.504
2.348ProThr: 2.348 ± 0.481
3.629ProVal: 3.629 ± 0.657
1.281ProTrp: 1.281 ± 0.388
0.854ProTyr: 0.854 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
3.95GlnAla: 3.95 ± 0.643
0.427GlnCys: 0.427 ± 0.227
0.854GlnAsp: 0.854 ± 0.236
1.174GlnGlu: 1.174 ± 0.362
1.708GlnPhe: 1.708 ± 0.569
1.281GlnGly: 1.281 ± 0.305
0.854GlnHis: 0.854 ± 0.314
1.494GlnIle: 1.494 ± 0.53
1.174GlnLys: 1.174 ± 0.39
3.416GlnLeu: 3.416 ± 0.669
0.64GlnMet: 0.64 ± 0.299
0.854GlnAsn: 0.854 ± 0.367
1.281GlnPro: 1.281 ± 0.368
2.242GlnGln: 2.242 ± 0.603
3.843GlnArg: 3.843 ± 0.704
2.028GlnSer: 2.028 ± 0.483
2.348GlnThr: 2.348 ± 0.532
2.242GlnVal: 2.242 ± 0.455
0.32GlnTrp: 0.32 ± 0.204
0.747GlnTyr: 0.747 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
9.18ArgAla: 9.18 ± 1.238
1.174ArgCys: 1.174 ± 0.36
4.056ArgAsp: 4.056 ± 0.524
7.045ArgGlu: 7.045 ± 0.89
2.989ArgPhe: 2.989 ± 0.65
5.978ArgGly: 5.978 ± 0.966
2.775ArgHis: 2.775 ± 0.501
5.124ArgIle: 5.124 ± 0.75
4.27ArgLys: 4.27 ± 0.793
7.686ArgLeu: 7.686 ± 0.965
1.388ArgMet: 1.388 ± 0.334
2.669ArgAsn: 2.669 ± 0.665
3.736ArgPro: 3.736 ± 0.71
3.416ArgGln: 3.416 ± 0.631
8.113ArgArg: 8.113 ± 1.078
3.736ArgSer: 3.736 ± 0.74
4.377ArgThr: 4.377 ± 0.65
6.725ArgVal: 6.725 ± 0.916
1.174ArgTrp: 1.174 ± 0.377
2.455ArgTyr: 2.455 ± 0.52
0.0ArgXaa: 0.0 ± 0.0
Ser
7.365SerAla: 7.365 ± 0.806
0.534SerCys: 0.534 ± 0.24
3.096SerAsp: 3.096 ± 0.567
1.708SerGlu: 1.708 ± 0.337
2.348SerPhe: 2.348 ± 0.592
4.804SerGly: 4.804 ± 0.751
1.494SerHis: 1.494 ± 0.527
2.348SerIle: 2.348 ± 0.435
2.135SerLys: 2.135 ± 0.419
5.444SerLeu: 5.444 ± 0.781
1.281SerMet: 1.281 ± 0.419
1.815SerAsn: 1.815 ± 0.523
2.562SerPro: 2.562 ± 0.583
0.854SerGln: 0.854 ± 0.27
4.483SerArg: 4.483 ± 0.686
3.736SerSer: 3.736 ± 0.722
3.096SerThr: 3.096 ± 0.548
2.989SerVal: 2.989 ± 0.613
0.747SerTrp: 0.747 ± 0.305
1.067SerTyr: 1.067 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
5.978ThrAla: 5.978 ± 0.935
0.64ThrCys: 0.64 ± 0.221
4.163ThrAsp: 4.163 ± 0.725
2.348ThrGlu: 2.348 ± 0.428
2.135ThrPhe: 2.135 ± 0.654
4.91ThrGly: 4.91 ± 0.687
2.028ThrHis: 2.028 ± 0.604
2.242ThrIle: 2.242 ± 0.637
2.348ThrLys: 2.348 ± 0.524
5.337ThrLeu: 5.337 ± 0.689
0.747ThrMet: 0.747 ± 0.228
1.601ThrAsn: 1.601 ± 0.375
4.697ThrPro: 4.697 ± 0.765
1.921ThrGln: 1.921 ± 0.441
3.95ThrArg: 3.95 ± 0.508
3.523ThrSer: 3.523 ± 0.726
3.736ThrThr: 3.736 ± 0.7
4.377ThrVal: 4.377 ± 0.58
0.961ThrTrp: 0.961 ± 0.37
0.64ThrTyr: 0.64 ± 0.343
0.0ThrXaa: 0.0 ± 0.0
Val
7.365ValAla: 7.365 ± 0.844
0.747ValCys: 0.747 ± 0.289
4.697ValAsp: 4.697 ± 0.635
3.843ValGlu: 3.843 ± 0.673
3.202ValPhe: 3.202 ± 0.732
4.163ValGly: 4.163 ± 0.702
1.174ValHis: 1.174 ± 0.325
2.775ValIle: 2.775 ± 0.553
2.989ValLys: 2.989 ± 0.584
5.764ValLeu: 5.764 ± 0.933
2.028ValMet: 2.028 ± 0.456
2.455ValAsn: 2.455 ± 0.437
3.096ValPro: 3.096 ± 0.533
1.494ValGln: 1.494 ± 0.356
5.551ValArg: 5.551 ± 0.715
4.056ValSer: 4.056 ± 0.792
3.523ValThr: 3.523 ± 0.601
5.444ValVal: 5.444 ± 0.796
1.174ValTrp: 1.174 ± 0.365
1.708ValTyr: 1.708 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
2.135TrpAla: 2.135 ± 0.461
0.213TrpCys: 0.213 ± 0.15
0.64TrpAsp: 0.64 ± 0.307
0.427TrpGlu: 0.427 ± 0.187
0.961TrpPhe: 0.961 ± 0.308
0.427TrpGly: 0.427 ± 0.172
0.534TrpHis: 0.534 ± 0.237
0.747TrpIle: 0.747 ± 0.312
0.534TrpLys: 0.534 ± 0.212
2.562TrpLeu: 2.562 ± 0.517
0.107TrpMet: 0.107 ± 0.105
0.32TrpAsn: 0.32 ± 0.141
0.64TrpPro: 0.64 ± 0.265
0.854TrpGln: 0.854 ± 0.285
1.708TrpArg: 1.708 ± 0.492
0.747TrpSer: 0.747 ± 0.287
1.067TrpThr: 1.067 ± 0.374
0.961TrpVal: 0.961 ± 0.279
0.32TrpTrp: 0.32 ± 0.164
0.427TrpTyr: 0.427 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.416TyrAla: 3.416 ± 0.515
0.107TyrCys: 0.107 ± 0.117
1.388TyrAsp: 1.388 ± 0.387
1.601TyrGlu: 1.601 ± 0.405
1.281TyrPhe: 1.281 ± 0.533
1.281TyrGly: 1.281 ± 0.46
1.067TyrHis: 1.067 ± 0.297
0.961TyrIle: 0.961 ± 0.298
0.854TyrLys: 0.854 ± 0.288
2.775TyrLeu: 2.775 ± 0.738
0.32TyrMet: 0.32 ± 0.192
0.534TyrAsn: 0.534 ± 0.217
0.427TyrPro: 0.427 ± 0.195
1.067TyrGln: 1.067 ± 0.308
2.775TyrArg: 2.775 ± 0.62
0.747TyrSer: 0.747 ± 0.296
1.601TyrThr: 1.601 ± 0.388
2.348TyrVal: 2.348 ± 0.496
0.534TyrTrp: 0.534 ± 0.214
0.64TyrTyr: 0.64 ± 0.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (9369 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski