Amino acid dipepetide frequency for Enterococcus phage BC611

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.053AlaAla: 7.053 ± 1.389
0.385AlaCys: 0.385 ± 0.156
4.36AlaAsp: 4.36 ± 0.592
6.027AlaGlu: 6.027 ± 0.715
2.757AlaPhe: 2.757 ± 0.431
4.167AlaGly: 4.167 ± 0.681
0.769AlaHis: 0.769 ± 0.255
5.77AlaIle: 5.77 ± 0.669
5.193AlaLys: 5.193 ± 0.76
5.514AlaLeu: 5.514 ± 0.701
2.565AlaMet: 2.565 ± 0.457
3.911AlaAsn: 3.911 ± 0.411
2.244AlaPro: 2.244 ± 0.369
2.18AlaGln: 2.18 ± 0.371
3.847AlaArg: 3.847 ± 0.529
4.937AlaSer: 4.937 ± 0.535
4.103AlaThr: 4.103 ± 0.538
4.616AlaVal: 4.616 ± 0.614
0.705AlaTrp: 0.705 ± 0.219
3.27AlaTyr: 3.27 ± 0.502
0.0AlaXaa: 0.0 ± 0.0
Cys
0.256CysAla: 0.256 ± 0.108
0.256CysCys: 0.256 ± 0.144
0.449CysAsp: 0.449 ± 0.175
0.641CysGlu: 0.641 ± 0.208
0.321CysPhe: 0.321 ± 0.143
0.577CysGly: 0.577 ± 0.214
0.064CysHis: 0.064 ± 0.059
0.385CysIle: 0.385 ± 0.166
0.513CysLys: 0.513 ± 0.247
0.641CysLeu: 0.641 ± 0.183
0.449CysMet: 0.449 ± 0.181
0.192CysAsn: 0.192 ± 0.1
0.128CysPro: 0.128 ± 0.092
0.385CysGln: 0.385 ± 0.169
0.256CysArg: 0.256 ± 0.131
0.321CysSer: 0.321 ± 0.145
0.256CysThr: 0.256 ± 0.136
0.192CysVal: 0.192 ± 0.112
0.064CysTrp: 0.064 ± 0.06
0.192CysTyr: 0.192 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
4.039AspAla: 4.039 ± 0.576
0.513AspCys: 0.513 ± 0.23
3.27AspAsp: 3.27 ± 0.67
5.001AspGlu: 5.001 ± 0.527
3.655AspPhe: 3.655 ± 0.496
3.526AspGly: 3.526 ± 0.473
0.577AspHis: 0.577 ± 0.174
4.103AspIle: 4.103 ± 0.524
4.296AspLys: 4.296 ± 0.495
5.963AspLeu: 5.963 ± 0.61
1.731AspMet: 1.731 ± 0.391
3.27AspAsn: 3.27 ± 0.478
1.154AspPro: 1.154 ± 0.284
1.282AspGln: 1.282 ± 0.288
1.923AspArg: 1.923 ± 0.317
3.526AspSer: 3.526 ± 0.455
4.296AspThr: 4.296 ± 0.775
4.36AspVal: 4.36 ± 0.483
0.705AspTrp: 0.705 ± 0.179
3.655AspTyr: 3.655 ± 0.428
0.0AspXaa: 0.0 ± 0.0
Glu
7.886GluAla: 7.886 ± 0.801
0.705GluCys: 0.705 ± 0.201
5.514GluAsp: 5.514 ± 0.652
9.361GluGlu: 9.361 ± 0.917
3.462GluPhe: 3.462 ± 0.541
6.476GluGly: 6.476 ± 0.597
0.513GluHis: 0.513 ± 0.177
4.232GluIle: 4.232 ± 0.512
5.129GluLys: 5.129 ± 0.776
7.373GluLeu: 7.373 ± 0.833
2.565GluMet: 2.565 ± 0.366
4.103GluAsn: 4.103 ± 0.558
1.859GluPro: 1.859 ± 0.28
2.5GluGln: 2.5 ± 0.474
4.616GluArg: 4.616 ± 0.612
5.322GluSer: 5.322 ± 0.581
4.36GluThr: 4.36 ± 0.542
6.347GluVal: 6.347 ± 0.64
1.859GluTrp: 1.859 ± 0.37
3.013GluTyr: 3.013 ± 0.476
0.0GluXaa: 0.0 ± 0.0
Phe
2.949PheAla: 2.949 ± 0.408
0.256PheCys: 0.256 ± 0.135
2.565PheAsp: 2.565 ± 0.46
3.526PheGlu: 3.526 ± 0.516
1.475PhePhe: 1.475 ± 0.291
3.013PheGly: 3.013 ± 0.465
0.641PheHis: 0.641 ± 0.199
3.078PheIle: 3.078 ± 0.452
3.142PheLys: 3.142 ± 0.393
2.821PheLeu: 2.821 ± 0.43
0.898PheMet: 0.898 ± 0.297
2.244PheAsn: 2.244 ± 0.35
1.603PhePro: 1.603 ± 0.317
1.154PheGln: 1.154 ± 0.322
1.218PheArg: 1.218 ± 0.253
2.629PheSer: 2.629 ± 0.502
2.5PheThr: 2.5 ± 0.376
2.629PheVal: 2.629 ± 0.399
0.641PheTrp: 0.641 ± 0.236
1.731PheTyr: 1.731 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
3.783GlyAla: 3.783 ± 0.537
0.385GlyCys: 0.385 ± 0.158
3.013GlyAsp: 3.013 ± 0.463
4.424GlyGlu: 4.424 ± 0.497
3.206GlyPhe: 3.206 ± 0.445
4.745GlyGly: 4.745 ± 0.803
1.475GlyHis: 1.475 ± 0.283
4.039GlyIle: 4.039 ± 0.693
5.834GlyLys: 5.834 ± 0.775
4.552GlyLeu: 4.552 ± 0.626
1.411GlyMet: 1.411 ± 0.301
2.949GlyAsn: 2.949 ± 0.37
0.0GlyPro: 0.0 ± 0.0
2.18GlyGln: 2.18 ± 0.462
2.757GlyArg: 2.757 ± 0.396
3.783GlySer: 3.783 ± 0.469
4.68GlyThr: 4.68 ± 0.865
4.232GlyVal: 4.232 ± 0.729
0.833GlyTrp: 0.833 ± 0.29
3.526GlyTyr: 3.526 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
0.769HisAla: 0.769 ± 0.215
0.256HisCys: 0.256 ± 0.12
0.513HisAsp: 0.513 ± 0.191
1.154HisGlu: 1.154 ± 0.269
0.769HisPhe: 0.769 ± 0.192
0.833HisGly: 0.833 ± 0.215
0.449HisHis: 0.449 ± 0.151
1.795HisIle: 1.795 ± 0.377
1.218HisLys: 1.218 ± 0.303
1.09HisLeu: 1.09 ± 0.253
0.128HisMet: 0.128 ± 0.084
1.026HisAsn: 1.026 ± 0.201
0.577HisPro: 0.577 ± 0.167
0.705HisGln: 0.705 ± 0.193
0.833HisArg: 0.833 ± 0.195
0.705HisSer: 0.705 ± 0.188
0.705HisThr: 0.705 ± 0.23
0.898HisVal: 0.898 ± 0.229
0.064HisTrp: 0.064 ± 0.064
0.641HisTyr: 0.641 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
4.873IleAla: 4.873 ± 0.51
0.321IleCys: 0.321 ± 0.145
4.039IleAsp: 4.039 ± 0.416
5.642IleGlu: 5.642 ± 0.567
1.795IlePhe: 1.795 ± 0.313
2.885IleGly: 2.885 ± 0.42
1.731IleHis: 1.731 ± 0.376
3.142IleIle: 3.142 ± 0.457
5.45IleLys: 5.45 ± 0.451
3.975IleLeu: 3.975 ± 0.578
2.116IleMet: 2.116 ± 0.387
5.001IleAsn: 5.001 ± 0.551
1.795IlePro: 1.795 ± 0.327
2.629IleGln: 2.629 ± 0.587
2.949IleArg: 2.949 ± 0.442
3.655IleSer: 3.655 ± 0.694
4.167IleThr: 4.167 ± 0.716
3.462IleVal: 3.462 ± 0.472
0.321IleTrp: 0.321 ± 0.172
2.629IleTyr: 2.629 ± 0.432
0.0IleXaa: 0.0 ± 0.0
Lys
6.732LysAla: 6.732 ± 0.715
0.128LysCys: 0.128 ± 0.089
6.091LysAsp: 6.091 ± 0.471
7.117LysGlu: 7.117 ± 0.785
2.693LysPhe: 2.693 ± 0.433
4.616LysGly: 4.616 ± 0.481
1.346LysHis: 1.346 ± 0.314
3.398LysIle: 3.398 ± 0.387
5.963LysLys: 5.963 ± 0.862
7.886LysLeu: 7.886 ± 0.614
2.18LysMet: 2.18 ± 0.455
3.398LysAsn: 3.398 ± 0.577
3.013LysPro: 3.013 ± 0.511
2.949LysGln: 2.949 ± 0.534
3.526LysArg: 3.526 ± 0.586
4.937LysSer: 4.937 ± 0.637
4.424LysThr: 4.424 ± 0.487
5.45LysVal: 5.45 ± 0.505
0.769LysTrp: 0.769 ± 0.222
2.5LysTyr: 2.5 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
5.45LeuAla: 5.45 ± 0.57
0.449LeuCys: 0.449 ± 0.15
5.001LeuAsp: 5.001 ± 0.477
8.72LeuGlu: 8.72 ± 0.925
2.885LeuPhe: 2.885 ± 0.399
4.873LeuGly: 4.873 ± 0.631
1.282LeuHis: 1.282 ± 0.282
5.45LeuIle: 5.45 ± 0.708
6.989LeuLys: 6.989 ± 0.768
5.834LeuLeu: 5.834 ± 0.714
2.308LeuMet: 2.308 ± 0.378
4.039LeuAsn: 4.039 ± 0.472
2.757LeuPro: 2.757 ± 0.414
4.103LeuGln: 4.103 ± 0.486
3.334LeuArg: 3.334 ± 0.471
5.578LeuSer: 5.578 ± 0.564
6.732LeuThr: 6.732 ± 0.651
5.129LeuVal: 5.129 ± 0.606
0.898LeuTrp: 0.898 ± 0.265
2.821LeuTyr: 2.821 ± 0.458
0.0LeuXaa: 0.0 ± 0.0
Met
2.629MetAla: 2.629 ± 0.5
0.128MetCys: 0.128 ± 0.079
1.859MetAsp: 1.859 ± 0.306
2.757MetGlu: 2.757 ± 0.44
1.026MetPhe: 1.026 ± 0.219
1.026MetGly: 1.026 ± 0.315
0.192MetHis: 0.192 ± 0.119
1.218MetIle: 1.218 ± 0.266
2.116MetLys: 2.116 ± 0.365
2.693MetLeu: 2.693 ± 0.379
0.385MetMet: 0.385 ± 0.153
1.603MetAsn: 1.603 ± 0.25
0.898MetPro: 0.898 ± 0.236
0.769MetGln: 0.769 ± 0.192
1.346MetArg: 1.346 ± 0.296
1.988MetSer: 1.988 ± 0.361
1.411MetThr: 1.411 ± 0.292
1.346MetVal: 1.346 ± 0.221
0.192MetTrp: 0.192 ± 0.109
0.705MetTyr: 0.705 ± 0.215
0.0MetXaa: 0.0 ± 0.0
Asn
3.206AsnAla: 3.206 ± 0.581
0.256AsnCys: 0.256 ± 0.128
3.078AsnAsp: 3.078 ± 0.397
3.398AsnGlu: 3.398 ± 0.42
2.116AsnPhe: 2.116 ± 0.341
4.167AsnGly: 4.167 ± 0.514
1.603AsnHis: 1.603 ± 0.382
3.398AsnIle: 3.398 ± 0.392
4.809AsnLys: 4.809 ± 0.747
4.36AsnLeu: 4.36 ± 0.387
1.026AsnMet: 1.026 ± 0.214
1.539AsnAsn: 1.539 ± 0.281
2.949AsnPro: 2.949 ± 0.564
1.795AsnGln: 1.795 ± 0.38
1.923AsnArg: 1.923 ± 0.289
3.206AsnSer: 3.206 ± 0.393
2.885AsnThr: 2.885 ± 0.536
3.398AsnVal: 3.398 ± 0.46
0.898AsnTrp: 0.898 ± 0.226
2.244AsnTyr: 2.244 ± 0.492
0.0AsnXaa: 0.0 ± 0.0
Pro
2.18ProAla: 2.18 ± 0.488
0.064ProCys: 0.064 ± 0.069
1.731ProAsp: 1.731 ± 0.297
3.398ProGlu: 3.398 ± 0.511
1.411ProPhe: 1.411 ± 0.237
0.128ProGly: 0.128 ± 0.085
0.385ProHis: 0.385 ± 0.131
2.372ProIle: 2.372 ± 0.416
2.757ProLys: 2.757 ± 0.406
2.5ProLeu: 2.5 ± 0.461
0.898ProMet: 0.898 ± 0.238
1.859ProAsn: 1.859 ± 0.42
0.577ProPro: 0.577 ± 0.186
0.449ProGln: 0.449 ± 0.165
1.026ProArg: 1.026 ± 0.275
2.5ProSer: 2.5 ± 0.48
1.859ProThr: 1.859 ± 0.321
1.795ProVal: 1.795 ± 0.378
0.256ProTrp: 0.256 ± 0.117
1.346ProTyr: 1.346 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
2.757GlnAla: 2.757 ± 0.457
0.128GlnCys: 0.128 ± 0.097
1.667GlnAsp: 1.667 ± 0.309
3.27GlnGlu: 3.27 ± 0.464
0.641GlnPhe: 0.641 ± 0.177
2.757GlnGly: 2.757 ± 0.431
0.385GlnHis: 0.385 ± 0.149
1.988GlnIle: 1.988 ± 0.487
2.18GlnLys: 2.18 ± 0.403
3.27GlnLeu: 3.27 ± 0.393
0.962GlnMet: 0.962 ± 0.32
1.282GlnAsn: 1.282 ± 0.334
0.833GlnPro: 0.833 ± 0.221
1.988GlnGln: 1.988 ± 0.504
1.795GlnArg: 1.795 ± 0.261
2.5GlnSer: 2.5 ± 0.431
2.5GlnThr: 2.5 ± 0.41
2.436GlnVal: 2.436 ± 0.348
0.385GlnTrp: 0.385 ± 0.142
1.859GlnTyr: 1.859 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
2.949ArgAla: 2.949 ± 0.47
0.321ArgCys: 0.321 ± 0.116
2.693ArgAsp: 2.693 ± 0.365
3.334ArgGlu: 3.334 ± 0.361
2.308ArgPhe: 2.308 ± 0.356
2.18ArgGly: 2.18 ± 0.327
0.705ArgHis: 0.705 ± 0.229
2.885ArgIle: 2.885 ± 0.453
3.783ArgLys: 3.783 ± 0.632
4.488ArgLeu: 4.488 ± 0.532
0.769ArgMet: 0.769 ± 0.222
1.859ArgAsn: 1.859 ± 0.378
1.603ArgPro: 1.603 ± 0.374
1.667ArgGln: 1.667 ± 0.305
2.18ArgArg: 2.18 ± 0.381
1.859ArgSer: 1.859 ± 0.336
2.116ArgThr: 2.116 ± 0.293
2.565ArgVal: 2.565 ± 0.457
0.321ArgTrp: 0.321 ± 0.143
1.988ArgTyr: 1.988 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
3.911SerAla: 3.911 ± 0.562
0.513SerCys: 0.513 ± 0.171
3.526SerAsp: 3.526 ± 0.387
4.232SerGlu: 4.232 ± 0.426
2.629SerPhe: 2.629 ± 0.407
3.975SerGly: 3.975 ± 0.564
0.449SerHis: 0.449 ± 0.195
4.167SerIle: 4.167 ± 0.487
5.77SerLys: 5.77 ± 0.412
5.706SerLeu: 5.706 ± 0.694
1.411SerMet: 1.411 ± 0.267
3.655SerAsn: 3.655 ± 0.468
1.731SerPro: 1.731 ± 0.325
2.052SerGln: 2.052 ± 0.335
2.821SerArg: 2.821 ± 0.425
2.629SerSer: 2.629 ± 0.423
3.783SerThr: 3.783 ± 0.503
4.232SerVal: 4.232 ± 0.473
0.833SerTrp: 0.833 ± 0.23
2.18SerTyr: 2.18 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
5.001ThrAla: 5.001 ± 0.879
0.449ThrCys: 0.449 ± 0.153
3.975ThrAsp: 3.975 ± 0.493
3.975ThrGlu: 3.975 ± 0.456
2.949ThrPhe: 2.949 ± 0.386
3.526ThrGly: 3.526 ± 0.558
0.898ThrHis: 0.898 ± 0.186
4.103ThrIle: 4.103 ± 0.572
3.847ThrLys: 3.847 ± 0.418
6.027ThrLeu: 6.027 ± 0.629
1.411ThrMet: 1.411 ± 0.286
3.206ThrAsn: 3.206 ± 0.466
2.629ThrPro: 2.629 ± 0.477
2.116ThrGln: 2.116 ± 0.343
1.859ThrArg: 1.859 ± 0.347
3.398ThrSer: 3.398 ± 0.601
2.885ThrThr: 2.885 ± 0.429
5.065ThrVal: 5.065 ± 0.692
0.641ThrTrp: 0.641 ± 0.247
2.693ThrTyr: 2.693 ± 0.389
0.0ThrXaa: 0.0 ± 0.0
Val
4.937ValAla: 4.937 ± 0.691
0.705ValCys: 0.705 ± 0.218
4.36ValAsp: 4.36 ± 0.472
5.322ValGlu: 5.322 ± 0.575
2.629ValPhe: 2.629 ± 0.376
4.552ValGly: 4.552 ± 0.658
0.577ValHis: 0.577 ± 0.185
4.552ValIle: 4.552 ± 0.642
5.963ValLys: 5.963 ± 0.653
5.129ValLeu: 5.129 ± 0.616
1.411ValMet: 1.411 ± 0.279
3.462ValAsn: 3.462 ± 0.407
1.988ValPro: 1.988 ± 0.383
2.436ValGln: 2.436 ± 0.359
2.18ValArg: 2.18 ± 0.378
3.526ValSer: 3.526 ± 0.45
4.167ValThr: 4.167 ± 0.553
4.103ValVal: 4.103 ± 0.629
1.346ValTrp: 1.346 ± 0.301
2.436ValTyr: 2.436 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
0.641TrpAla: 0.641 ± 0.213
0.064TrpCys: 0.064 ± 0.057
0.769TrpAsp: 0.769 ± 0.215
1.411TrpGlu: 1.411 ± 0.29
0.577TrpPhe: 0.577 ± 0.168
1.154TrpGly: 1.154 ± 0.247
0.192TrpHis: 0.192 ± 0.111
0.898TrpIle: 0.898 ± 0.203
0.769TrpLys: 0.769 ± 0.222
0.641TrpLeu: 0.641 ± 0.233
0.256TrpMet: 0.256 ± 0.11
1.09TrpAsn: 1.09 ± 0.222
0.064TrpPro: 0.064 ± 0.067
0.577TrpGln: 0.577 ± 0.166
0.256TrpArg: 0.256 ± 0.115
0.705TrpSer: 0.705 ± 0.246
0.769TrpThr: 0.769 ± 0.247
0.641TrpVal: 0.641 ± 0.165
0.128TrpTrp: 0.128 ± 0.089
0.641TrpTyr: 0.641 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.372TyrAla: 2.372 ± 0.42
0.256TyrCys: 0.256 ± 0.178
2.372TyrAsp: 2.372 ± 0.401
4.488TyrGlu: 4.488 ± 0.667
1.411TyrPhe: 1.411 ± 0.313
2.629TyrGly: 2.629 ± 0.395
0.898TyrHis: 0.898 ± 0.211
1.603TyrIle: 1.603 ± 0.291
3.398TyrLys: 3.398 ± 0.424
4.167TyrLeu: 4.167 ± 0.494
1.346TyrMet: 1.346 ± 0.296
2.565TyrAsn: 2.565 ± 0.461
1.09TyrPro: 1.09 ± 0.262
1.667TyrGln: 1.667 ± 0.333
1.923TyrArg: 1.923 ± 0.363
2.5TyrSer: 2.5 ± 0.363
2.052TyrThr: 2.052 ± 0.318
2.885TyrVal: 2.885 ± 0.404
0.385TyrTrp: 0.385 ± 0.16
1.988TyrTyr: 1.988 ± 0.396
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (15598 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski