Amino acid dipepetide frequency for Propionibacterium phage B3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.028AlaAla: 20.028 ± 2.809
0.529AlaCys: 0.529 ± 0.211
8.117AlaAsp: 8.117 ± 1.071
6.176AlaGlu: 6.176 ± 0.727
2.47AlaPhe: 2.47 ± 0.53
11.735AlaGly: 11.735 ± 1.704
2.294AlaHis: 2.294 ± 0.419
4.588AlaIle: 4.588 ± 0.857
4.676AlaLys: 4.676 ± 0.92
10.235AlaLeu: 10.235 ± 1.234
2.823AlaMet: 2.823 ± 0.458
3.529AlaAsn: 3.529 ± 0.582
5.117AlaPro: 5.117 ± 0.724
5.117AlaGln: 5.117 ± 0.515
7.852AlaArg: 7.852 ± 1.187
8.823AlaSer: 8.823 ± 1.243
7.764AlaThr: 7.764 ± 0.74
9.088AlaVal: 9.088 ± 1.151
3.529AlaTrp: 3.529 ± 0.716
1.323AlaTyr: 1.323 ± 0.363
0.0AlaXaa: 0.0 ± 0.0
Cys
0.794CysAla: 0.794 ± 0.275
0.088CysCys: 0.088 ± 0.086
0.441CysAsp: 0.441 ± 0.188
0.353CysGlu: 0.353 ± 0.213
0.265CysPhe: 0.265 ± 0.198
1.412CysGly: 1.412 ± 0.464
0.265CysHis: 0.265 ± 0.157
0.265CysIle: 0.265 ± 0.149
0.529CysLys: 0.529 ± 0.199
0.529CysLeu: 0.529 ± 0.204
0.176CysMet: 0.176 ± 0.126
0.441CysAsn: 0.441 ± 0.191
0.794CysPro: 0.794 ± 0.282
0.265CysGln: 0.265 ± 0.159
0.794CysArg: 0.794 ± 0.231
0.441CysSer: 0.441 ± 0.216
0.441CysThr: 0.441 ± 0.209
0.353CysVal: 0.353 ± 0.206
0.353CysTrp: 0.353 ± 0.156
0.088CysTyr: 0.088 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
7.852AspAla: 7.852 ± 0.871
0.441AspCys: 0.441 ± 0.204
3.882AspAsp: 3.882 ± 0.927
3.617AspGlu: 3.617 ± 0.566
2.47AspPhe: 2.47 ± 0.628
6.264AspGly: 6.264 ± 0.585
1.147AspHis: 1.147 ± 0.385
2.735AspIle: 2.735 ± 0.51
2.382AspLys: 2.382 ± 0.455
4.412AspLeu: 4.412 ± 0.608
1.676AspMet: 1.676 ± 0.468
0.882AspAsn: 0.882 ± 0.286
4.588AspPro: 4.588 ± 0.596
2.118AspGln: 2.118 ± 0.507
3.97AspArg: 3.97 ± 0.672
3.529AspSer: 3.529 ± 0.575
3.617AspThr: 3.617 ± 0.675
5.647AspVal: 5.647 ± 0.805
2.029AspTrp: 2.029 ± 0.504
1.323AspTyr: 1.323 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
6.617GluAla: 6.617 ± 0.657
0.882GluCys: 0.882 ± 0.32
2.647GluAsp: 2.647 ± 0.303
2.294GluGlu: 2.294 ± 0.512
1.412GluPhe: 1.412 ± 0.423
2.735GluGly: 2.735 ± 0.454
1.147GluHis: 1.147 ± 0.364
2.559GluIle: 2.559 ± 0.459
1.5GluLys: 1.5 ± 0.329
4.676GluLeu: 4.676 ± 0.564
0.971GluMet: 0.971 ± 0.273
1.059GluAsn: 1.059 ± 0.273
2.47GluPro: 2.47 ± 0.499
1.765GluGln: 1.765 ± 0.351
3.706GluArg: 3.706 ± 0.655
2.647GluSer: 2.647 ± 0.463
2.647GluThr: 2.647 ± 0.554
4.853GluVal: 4.853 ± 0.947
1.765GluTrp: 1.765 ± 0.417
0.618GluTyr: 0.618 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
2.823PheAla: 2.823 ± 0.523
0.265PheCys: 0.265 ± 0.156
1.588PheAsp: 1.588 ± 0.364
1.323PheGlu: 1.323 ± 0.381
0.971PhePhe: 0.971 ± 0.298
2.206PheGly: 2.206 ± 0.459
0.971PheHis: 0.971 ± 0.268
0.882PheIle: 0.882 ± 0.24
1.853PheLys: 1.853 ± 0.467
1.941PheLeu: 1.941 ± 0.408
0.794PheMet: 0.794 ± 0.241
0.706PheAsn: 0.706 ± 0.227
1.5PhePro: 1.5 ± 0.45
1.5PheGln: 1.5 ± 0.333
2.206PheArg: 2.206 ± 0.544
1.941PheSer: 1.941 ± 0.463
2.382PheThr: 2.382 ± 0.459
2.47PheVal: 2.47 ± 0.533
0.618PheTrp: 0.618 ± 0.255
0.353PheTyr: 0.353 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
9.176GlyAla: 9.176 ± 1.15
1.235GlyCys: 1.235 ± 0.416
4.941GlyAsp: 4.941 ± 0.866
3.706GlyGlu: 3.706 ± 0.566
2.912GlyPhe: 2.912 ± 0.583
8.999GlyGly: 8.999 ± 1.025
1.5GlyHis: 1.5 ± 0.375
6.0GlyIle: 6.0 ± 1.601
2.823GlyLys: 2.823 ± 0.584
8.999GlyLeu: 8.999 ± 1.321
2.647GlyMet: 2.647 ± 0.564
1.765GlyAsn: 1.765 ± 0.466
4.676GlyPro: 4.676 ± 0.844
2.647GlyGln: 2.647 ± 0.385
6.529GlyArg: 6.529 ± 0.918
4.5GlySer: 4.5 ± 0.6
5.47GlyThr: 5.47 ± 0.531
6.176GlyVal: 6.176 ± 0.944
2.647GlyTrp: 2.647 ± 0.538
1.941GlyTyr: 1.941 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
1.5HisAla: 1.5 ± 0.403
0.265HisCys: 0.265 ± 0.142
1.235HisAsp: 1.235 ± 0.334
0.794HisGlu: 0.794 ± 0.209
0.706HisPhe: 0.706 ± 0.348
2.029HisGly: 2.029 ± 0.492
0.618HisHis: 0.618 ± 0.282
0.441HisIle: 0.441 ± 0.171
0.353HisLys: 0.353 ± 0.18
1.412HisLeu: 1.412 ± 0.393
0.618HisMet: 0.618 ± 0.221
0.529HisAsn: 0.529 ± 0.329
1.059HisPro: 1.059 ± 0.38
0.529HisGln: 0.529 ± 0.266
1.676HisArg: 1.676 ± 0.404
0.971HisSer: 0.971 ± 0.283
1.147HisThr: 1.147 ± 0.377
1.765HisVal: 1.765 ± 0.495
0.529HisTrp: 0.529 ± 0.211
0.529HisTyr: 0.529 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
5.294IleAla: 5.294 ± 0.68
0.353IleCys: 0.353 ± 0.192
3.0IleAsp: 3.0 ± 0.483
3.265IleGlu: 3.265 ± 0.659
1.323IlePhe: 1.323 ± 0.336
3.176IleGly: 3.176 ± 0.65
0.618IleHis: 0.618 ± 0.257
2.559IleIle: 2.559 ± 0.786
2.559IleLys: 2.559 ± 0.487
3.265IleLeu: 3.265 ± 0.446
0.618IleMet: 0.618 ± 0.225
0.794IleAsn: 0.794 ± 0.311
2.647IlePro: 2.647 ± 0.412
1.941IleGln: 1.941 ± 0.803
2.912IleArg: 2.912 ± 0.579
3.0IleSer: 3.0 ± 0.46
2.647IleThr: 2.647 ± 0.807
3.353IleVal: 3.353 ± 0.58
0.706IleTrp: 0.706 ± 0.208
0.794IleTyr: 0.794 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
5.823LysAla: 5.823 ± 1.006
0.353LysCys: 0.353 ± 0.163
1.941LysAsp: 1.941 ± 0.475
2.206LysGlu: 2.206 ± 0.461
0.794LysPhe: 0.794 ± 0.275
3.529LysGly: 3.529 ± 0.612
0.529LysHis: 0.529 ± 0.205
1.323LysIle: 1.323 ± 0.308
1.235LysLys: 1.235 ± 0.413
3.088LysLeu: 3.088 ± 0.456
1.235LysMet: 1.235 ± 0.292
1.059LysAsn: 1.059 ± 0.306
2.294LysPro: 2.294 ± 0.529
0.794LysGln: 0.794 ± 0.221
3.265LysArg: 3.265 ± 0.726
2.118LysSer: 2.118 ± 0.588
2.735LysThr: 2.735 ± 0.517
4.412LysVal: 4.412 ± 0.606
0.353LysTrp: 0.353 ± 0.177
0.088LysTyr: 0.088 ± 0.084
0.0LysXaa: 0.0 ± 0.0
Leu
9.352LeuAla: 9.352 ± 0.926
0.353LeuCys: 0.353 ± 0.176
5.382LeuAsp: 5.382 ± 0.669
3.529LeuGlu: 3.529 ± 0.683
1.853LeuPhe: 1.853 ± 0.442
6.882LeuGly: 6.882 ± 1.478
1.235LeuHis: 1.235 ± 0.335
3.706LeuIle: 3.706 ± 1.023
3.088LeuLys: 3.088 ± 0.665
5.647LeuLeu: 5.647 ± 0.808
1.5LeuMet: 1.5 ± 0.293
1.5LeuAsn: 1.5 ± 0.316
4.941LeuPro: 4.941 ± 0.591
3.176LeuGln: 3.176 ± 0.588
6.353LeuArg: 6.353 ± 0.733
5.206LeuSer: 5.206 ± 0.887
6.264LeuThr: 6.264 ± 0.806
5.558LeuVal: 5.558 ± 0.654
2.47LeuTrp: 2.47 ± 0.5
0.353LeuTyr: 0.353 ± 0.169
0.0LeuXaa: 0.0 ± 0.0
Met
3.706MetAla: 3.706 ± 0.633
0.176MetCys: 0.176 ± 0.126
0.706MetAsp: 0.706 ± 0.303
0.794MetGlu: 0.794 ± 0.19
0.618MetPhe: 0.618 ± 0.23
1.588MetGly: 1.588 ± 0.364
0.353MetHis: 0.353 ± 0.193
1.235MetIle: 1.235 ± 0.423
1.323MetLys: 1.323 ± 0.352
1.588MetLeu: 1.588 ± 0.444
0.618MetMet: 0.618 ± 0.263
0.618MetAsn: 0.618 ± 0.287
1.412MetPro: 1.412 ± 0.318
0.618MetGln: 0.618 ± 0.195
1.323MetArg: 1.323 ± 0.309
2.647MetSer: 2.647 ± 0.413
1.853MetThr: 1.853 ± 0.427
1.5MetVal: 1.5 ± 0.369
0.176MetTrp: 0.176 ± 0.13
0.265MetTyr: 0.265 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.176AsnAla: 3.176 ± 0.694
0.176AsnCys: 0.176 ± 0.114
1.235AsnAsp: 1.235 ± 0.299
0.706AsnGlu: 0.706 ± 0.221
0.618AsnPhe: 0.618 ± 0.214
2.735AsnGly: 2.735 ± 0.64
0.088AsnHis: 0.088 ± 0.082
0.441AsnIle: 0.441 ± 0.216
0.353AsnLys: 0.353 ± 0.201
2.029AsnLeu: 2.029 ± 0.528
0.176AsnMet: 0.176 ± 0.12
0.618AsnAsn: 0.618 ± 0.275
2.559AsnPro: 2.559 ± 0.54
0.971AsnGln: 0.971 ± 0.251
1.235AsnArg: 1.235 ± 0.344
1.588AsnSer: 1.588 ± 0.485
1.323AsnThr: 1.323 ± 0.281
1.853AsnVal: 1.853 ± 0.379
0.353AsnTrp: 0.353 ± 0.164
0.441AsnTyr: 0.441 ± 0.19
0.0AsnXaa: 0.0 ± 0.0
Pro
6.264ProAla: 6.264 ± 0.932
0.618ProCys: 0.618 ± 0.291
5.47ProAsp: 5.47 ± 0.977
2.559ProGlu: 2.559 ± 0.424
1.588ProPhe: 1.588 ± 0.323
5.382ProGly: 5.382 ± 0.665
1.059ProHis: 1.059 ± 0.358
2.118ProIle: 2.118 ± 0.338
2.735ProLys: 2.735 ± 0.581
3.441ProLeu: 3.441 ± 0.557
1.059ProMet: 1.059 ± 0.575
0.971ProAsn: 0.971 ± 0.275
3.441ProPro: 3.441 ± 0.748
3.265ProGln: 3.265 ± 0.571
2.118ProArg: 2.118 ± 0.451
4.676ProSer: 4.676 ± 0.814
2.206ProThr: 2.206 ± 0.557
4.412ProVal: 4.412 ± 0.685
1.5ProTrp: 1.5 ± 0.363
0.706ProTyr: 0.706 ± 0.235
0.0ProXaa: 0.0 ± 0.0
Gln
5.382GlnAla: 5.382 ± 0.667
0.088GlnCys: 0.088 ± 0.095
2.206GlnAsp: 2.206 ± 0.513
1.323GlnGlu: 1.323 ± 0.309
0.971GlnPhe: 0.971 ± 0.328
3.088GlnGly: 3.088 ± 0.876
0.353GlnHis: 0.353 ± 0.18
2.118GlnIle: 2.118 ± 0.396
1.147GlnLys: 1.147 ± 0.285
4.059GlnLeu: 4.059 ± 0.844
0.882GlnMet: 0.882 ± 0.247
1.235GlnAsn: 1.235 ± 0.31
1.5GlnPro: 1.5 ± 0.331
1.323GlnGln: 1.323 ± 0.333
2.823GlnArg: 2.823 ± 0.563
2.029GlnSer: 2.029 ± 0.361
2.206GlnThr: 2.206 ± 0.4
3.0GlnVal: 3.0 ± 0.526
1.412GlnTrp: 1.412 ± 0.353
0.882GlnTyr: 0.882 ± 0.351
0.0GlnXaa: 0.0 ± 0.0
Arg
7.235ArgAla: 7.235 ± 0.895
0.706ArgCys: 0.706 ± 0.269
4.412ArgAsp: 4.412 ± 0.827
3.882ArgGlu: 3.882 ± 0.664
2.559ArgPhe: 2.559 ± 0.485
4.588ArgGly: 4.588 ± 0.816
1.853ArgHis: 1.853 ± 0.372
3.353ArgIle: 3.353 ± 0.627
2.823ArgLys: 2.823 ± 0.728
5.558ArgLeu: 5.558 ± 0.816
2.118ArgMet: 2.118 ± 0.397
1.412ArgAsn: 1.412 ± 0.318
3.882ArgPro: 3.882 ± 0.691
2.47ArgGln: 2.47 ± 0.542
6.882ArgArg: 6.882 ± 1.147
3.617ArgSer: 3.617 ± 0.534
3.882ArgThr: 3.882 ± 0.654
4.676ArgVal: 4.676 ± 0.623
1.853ArgTrp: 1.853 ± 0.475
1.412ArgTyr: 1.412 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
8.029SerAla: 8.029 ± 1.59
0.353SerCys: 0.353 ± 0.183
5.029SerAsp: 5.029 ± 0.723
2.559SerGlu: 2.559 ± 0.443
2.294SerPhe: 2.294 ± 0.351
6.97SerGly: 6.97 ± 1.049
1.323SerHis: 1.323 ± 0.38
2.912SerIle: 2.912 ± 0.643
2.647SerLys: 2.647 ± 0.418
3.706SerLeu: 3.706 ± 0.524
1.676SerMet: 1.676 ± 0.497
1.5SerAsn: 1.5 ± 0.355
2.47SerPro: 2.47 ± 0.449
2.47SerGln: 2.47 ± 0.466
3.794SerArg: 3.794 ± 0.568
4.412SerSer: 4.412 ± 0.921
4.059SerThr: 4.059 ± 0.564
4.941SerVal: 4.941 ± 0.552
1.853SerTrp: 1.853 ± 0.489
0.618SerTyr: 0.618 ± 0.28
0.0SerXaa: 0.0 ± 0.0
Thr
6.264ThrAla: 6.264 ± 0.662
1.147ThrCys: 1.147 ± 0.411
4.676ThrAsp: 4.676 ± 0.784
2.559ThrGlu: 2.559 ± 0.38
2.206ThrPhe: 2.206 ± 0.414
6.617ThrGly: 6.617 ± 0.964
0.794ThrHis: 0.794 ± 0.296
3.176ThrIle: 3.176 ± 0.531
2.735ThrLys: 2.735 ± 0.386
4.941ThrLeu: 4.941 ± 0.719
0.794ThrMet: 0.794 ± 0.243
1.412ThrAsn: 1.412 ± 0.364
3.617ThrPro: 3.617 ± 0.667
1.5ThrGln: 1.5 ± 0.331
4.412ThrArg: 4.412 ± 0.702
4.676ThrSer: 4.676 ± 0.532
4.323ThrThr: 4.323 ± 0.786
4.235ThrVal: 4.235 ± 0.602
1.5ThrTrp: 1.5 ± 0.349
0.794ThrTyr: 0.794 ± 0.304
0.0ThrXaa: 0.0 ± 0.0
Val
12.088ValAla: 12.088 ± 1.254
0.618ValCys: 0.618 ± 0.214
4.147ValAsp: 4.147 ± 0.626
5.47ValGlu: 5.47 ± 0.754
1.941ValPhe: 1.941 ± 0.394
6.0ValGly: 6.0 ± 0.843
1.588ValHis: 1.588 ± 0.398
2.912ValIle: 2.912 ± 0.508
3.0ValLys: 3.0 ± 0.515
4.941ValLeu: 4.941 ± 0.695
1.676ValMet: 1.676 ± 0.433
1.765ValAsn: 1.765 ± 0.348
4.853ValPro: 4.853 ± 0.794
3.0ValGln: 3.0 ± 0.672
4.412ValArg: 4.412 ± 0.883
4.853ValSer: 4.853 ± 0.605
5.911ValThr: 5.911 ± 0.671
6.0ValVal: 6.0 ± 0.843
1.676ValTrp: 1.676 ± 0.496
0.529ValTyr: 0.529 ± 0.208
0.0ValXaa: 0.0 ± 0.0
Trp
2.47TrpAla: 2.47 ± 0.602
0.176TrpCys: 0.176 ± 0.13
2.47TrpAsp: 2.47 ± 0.614
1.147TrpGlu: 1.147 ± 0.317
0.618TrpPhe: 0.618 ± 0.209
2.47TrpGly: 2.47 ± 0.408
0.706TrpHis: 0.706 ± 0.292
0.971TrpIle: 0.971 ± 0.307
0.971TrpLys: 0.971 ± 0.349
2.559TrpLeu: 2.559 ± 0.669
0.706TrpMet: 0.706 ± 0.256
0.618TrpAsn: 0.618 ± 0.212
1.059TrpPro: 1.059 ± 0.33
1.588TrpGln: 1.588 ± 0.328
1.765TrpArg: 1.765 ± 0.438
1.412TrpSer: 1.412 ± 0.414
1.323TrpThr: 1.323 ± 0.355
2.029TrpVal: 2.029 ± 0.47
0.706TrpTrp: 0.706 ± 0.253
0.353TrpTyr: 0.353 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.941TyrAla: 1.941 ± 0.435
0.265TyrCys: 0.265 ± 0.158
1.059TyrAsp: 1.059 ± 0.291
0.706TyrGlu: 0.706 ± 0.319
0.706TyrPhe: 0.706 ± 0.216
0.882TyrGly: 0.882 ± 0.373
0.265TyrHis: 0.265 ± 0.18
0.529TyrIle: 0.529 ± 0.231
0.441TyrLys: 0.441 ± 0.179
1.323TyrLeu: 1.323 ± 0.364
0.265TyrMet: 0.265 ± 0.157
0.265TyrAsn: 0.265 ± 0.167
0.706TyrPro: 0.706 ± 0.291
0.971TyrGln: 0.971 ± 0.315
1.147TyrArg: 1.147 ± 0.323
0.529TyrSer: 0.529 ± 0.184
0.176TyrThr: 0.176 ± 0.114
1.059TyrVal: 1.059 ± 0.342
0.176TyrTrp: 0.176 ± 0.125
0.441TyrTyr: 0.441 ± 0.239
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11335 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski