Amino acid dipepetide frequency for Rhodococcus phage Gollum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.323AlaAla: 10.323 ± 1.284
0.641AlaCys: 0.641 ± 0.262
5.482AlaAsp: 5.482 ± 0.683
8.828AlaGlu: 8.828 ± 0.867
2.848AlaPhe: 2.848 ± 0.386
7.76AlaGly: 7.76 ± 0.857
1.21AlaHis: 1.21 ± 0.306
4.272AlaIle: 4.272 ± 0.769
4.058AlaLys: 4.058 ± 0.47
8.828AlaLeu: 8.828 ± 0.886
2.349AlaMet: 2.349 ± 0.332
2.99AlaAsn: 2.99 ± 0.545
4.272AlaPro: 4.272 ± 0.581
3.346AlaGln: 3.346 ± 0.555
6.052AlaArg: 6.052 ± 0.677
5.197AlaSer: 5.197 ± 0.594
4.699AlaThr: 4.699 ± 0.685
7.262AlaVal: 7.262 ± 0.664
1.637AlaTrp: 1.637 ± 0.35
2.634AlaTyr: 2.634 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.498CysAla: 0.498 ± 0.176
0.0CysCys: 0.0 ± 0.0
0.498CysAsp: 0.498 ± 0.196
0.854CysGlu: 0.854 ± 0.294
0.142CysPhe: 0.142 ± 0.107
1.139CysGly: 1.139 ± 0.277
0.142CysHis: 0.142 ± 0.099
0.214CysIle: 0.214 ± 0.122
0.57CysLys: 0.57 ± 0.238
0.285CysLeu: 0.285 ± 0.14
0.071CysMet: 0.071 ± 0.071
0.285CysAsn: 0.285 ± 0.117
0.712CysPro: 0.712 ± 0.23
0.427CysGln: 0.427 ± 0.147
0.427CysArg: 0.427 ± 0.169
0.427CysSer: 0.427 ± 0.192
0.356CysThr: 0.356 ± 0.165
0.854CysVal: 0.854 ± 0.308
0.142CysTrp: 0.142 ± 0.097
0.427CysTyr: 0.427 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
5.98AspAla: 5.98 ± 0.703
0.712AspCys: 0.712 ± 0.231
3.061AspAsp: 3.061 ± 0.409
5.34AspGlu: 5.34 ± 0.724
1.993AspPhe: 1.993 ± 0.362
5.98AspGly: 5.98 ± 0.787
1.851AspHis: 1.851 ± 0.364
1.139AspIle: 1.139 ± 0.285
3.133AspLys: 3.133 ± 0.502
5.909AspLeu: 5.909 ± 0.683
2.065AspMet: 2.065 ± 0.343
2.207AspAsn: 2.207 ± 0.378
4.77AspPro: 4.77 ± 0.703
2.136AspGln: 2.136 ± 0.348
3.346AspArg: 3.346 ± 0.492
3.56AspSer: 3.56 ± 0.482
3.417AspThr: 3.417 ± 0.456
5.482AspVal: 5.482 ± 0.583
1.353AspTrp: 1.353 ± 0.315
2.563AspTyr: 2.563 ± 0.526
0.0AspXaa: 0.0 ± 0.0
Glu
7.191GluAla: 7.191 ± 0.858
0.57GluCys: 0.57 ± 0.185
5.197GluAsp: 5.197 ± 0.596
4.841GluGlu: 4.841 ± 0.798
2.634GluPhe: 2.634 ± 0.528
5.838GluGly: 5.838 ± 0.603
1.424GluHis: 1.424 ± 0.332
4.485GluIle: 4.485 ± 0.734
2.421GluLys: 2.421 ± 0.464
7.475GluLeu: 7.475 ± 0.89
1.21GluMet: 1.21 ± 0.315
2.705GluAsn: 2.705 ± 0.43
2.563GluPro: 2.563 ± 0.512
2.563GluGln: 2.563 ± 0.408
3.773GluArg: 3.773 ± 0.722
3.845GluSer: 3.845 ± 0.527
4.414GluThr: 4.414 ± 0.56
6.692GluVal: 6.692 ± 0.767
1.495GluTrp: 1.495 ± 0.368
1.851GluTyr: 1.851 ± 0.329
0.0GluXaa: 0.0 ± 0.0
Phe
3.417PheAla: 3.417 ± 0.559
0.214PheCys: 0.214 ± 0.117
2.634PheAsp: 2.634 ± 0.451
2.136PheGlu: 2.136 ± 0.366
0.783PhePhe: 0.783 ± 0.21
2.777PheGly: 2.777 ± 0.43
0.641PheHis: 0.641 ± 0.214
1.566PheIle: 1.566 ± 0.317
1.353PheLys: 1.353 ± 0.315
2.99PheLeu: 2.99 ± 0.532
0.641PheMet: 0.641 ± 0.182
1.282PheAsn: 1.282 ± 0.264
1.353PhePro: 1.353 ± 0.247
1.139PheGln: 1.139 ± 0.274
2.349PheArg: 2.349 ± 0.375
2.634PheSer: 2.634 ± 0.338
1.78PheThr: 1.78 ± 0.32
2.563PheVal: 2.563 ± 0.468
0.498PheTrp: 0.498 ± 0.164
0.997PheTyr: 0.997 ± 0.301
0.0PheXaa: 0.0 ± 0.0
Gly
6.835GlyAla: 6.835 ± 0.759
0.712GlyCys: 0.712 ± 0.258
6.55GlyAsp: 6.55 ± 0.633
4.984GlyGlu: 4.984 ± 0.718
4.343GlyPhe: 4.343 ± 0.706
6.052GlyGly: 6.052 ± 0.634
1.851GlyHis: 1.851 ± 0.344
4.2GlyIle: 4.2 ± 0.57
3.845GlyLys: 3.845 ± 0.447
5.838GlyLeu: 5.838 ± 0.797
2.207GlyMet: 2.207 ± 0.652
2.99GlyAsn: 2.99 ± 0.45
3.417GlyPro: 3.417 ± 0.417
3.417GlyGln: 3.417 ± 0.575
4.2GlyArg: 4.2 ± 0.638
5.411GlySer: 5.411 ± 0.785
5.411GlyThr: 5.411 ± 0.616
7.048GlyVal: 7.048 ± 0.861
2.207GlyTrp: 2.207 ± 0.409
3.204GlyTyr: 3.204 ± 0.611
0.0GlyXaa: 0.0 ± 0.0
His
1.353HisAla: 1.353 ± 0.302
0.071HisCys: 0.071 ± 0.065
1.424HisAsp: 1.424 ± 0.331
1.566HisGlu: 1.566 ± 0.416
0.783HisPhe: 0.783 ± 0.206
1.139HisGly: 1.139 ± 0.299
0.427HisHis: 0.427 ± 0.184
1.424HisIle: 1.424 ± 0.341
0.57HisLys: 0.57 ± 0.221
1.424HisLeu: 1.424 ± 0.366
0.57HisMet: 0.57 ± 0.248
0.57HisAsn: 0.57 ± 0.171
1.21HisPro: 1.21 ± 0.268
0.641HisGln: 0.641 ± 0.208
1.78HisArg: 1.78 ± 0.365
0.854HisSer: 0.854 ± 0.322
1.068HisThr: 1.068 ± 0.296
1.353HisVal: 1.353 ± 0.287
0.498HisTrp: 0.498 ± 0.209
0.854HisTyr: 0.854 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
5.34IleAla: 5.34 ± 0.749
0.356IleCys: 0.356 ± 0.127
3.56IleAsp: 3.56 ± 0.507
3.489IleGlu: 3.489 ± 0.557
0.854IlePhe: 0.854 ± 0.233
4.912IleGly: 4.912 ± 0.596
0.926IleHis: 0.926 ± 0.199
1.566IleIle: 1.566 ± 0.277
1.353IleLys: 1.353 ± 0.306
2.99IleLeu: 2.99 ± 0.495
0.712IleMet: 0.712 ± 0.225
1.353IleAsn: 1.353 ± 0.293
2.278IlePro: 2.278 ± 0.51
1.068IleGln: 1.068 ± 0.27
3.417IleArg: 3.417 ± 0.47
2.207IleSer: 2.207 ± 0.471
3.702IleThr: 3.702 ± 0.496
3.417IleVal: 3.417 ± 0.529
0.498IleTrp: 0.498 ± 0.21
1.637IleTyr: 1.637 ± 0.358
0.0IleXaa: 0.0 ± 0.0
Lys
4.058LysAla: 4.058 ± 0.491
0.142LysCys: 0.142 ± 0.097
2.634LysAsp: 2.634 ± 0.362
2.065LysGlu: 2.065 ± 0.341
1.139LysPhe: 1.139 ± 0.295
3.773LysGly: 3.773 ± 0.693
1.068LysHis: 1.068 ± 0.274
1.495LysIle: 1.495 ± 0.305
1.566LysLys: 1.566 ± 0.388
4.058LysLeu: 4.058 ± 0.603
1.068LysMet: 1.068 ± 0.321
1.353LysAsn: 1.353 ± 0.275
1.78LysPro: 1.78 ± 0.422
0.854LysGln: 0.854 ± 0.242
2.492LysArg: 2.492 ± 0.464
2.278LysSer: 2.278 ± 0.329
2.634LysThr: 2.634 ± 0.559
3.204LysVal: 3.204 ± 0.538
0.712LysTrp: 0.712 ± 0.235
0.997LysTyr: 0.997 ± 0.233
0.0LysXaa: 0.0 ± 0.0
Leu
9.469LeuAla: 9.469 ± 0.729
0.641LeuCys: 0.641 ± 0.242
5.482LeuAsp: 5.482 ± 0.745
6.123LeuGlu: 6.123 ± 0.763
2.349LeuPhe: 2.349 ± 0.401
6.408LeuGly: 6.408 ± 0.86
1.566LeuHis: 1.566 ± 0.446
3.275LeuIle: 3.275 ± 0.436
3.56LeuLys: 3.56 ± 0.497
5.34LeuLeu: 5.34 ± 0.63
1.709LeuMet: 1.709 ± 0.464
2.278LeuAsn: 2.278 ± 0.435
4.556LeuPro: 4.556 ± 0.513
2.919LeuGln: 2.919 ± 0.453
5.268LeuArg: 5.268 ± 0.716
5.411LeuSer: 5.411 ± 0.624
6.265LeuThr: 6.265 ± 0.683
6.052LeuVal: 6.052 ± 0.636
1.495LeuTrp: 1.495 ± 0.307
2.634LeuTyr: 2.634 ± 0.43
0.0LeuXaa: 0.0 ± 0.0
Met
2.065MetAla: 2.065 ± 0.386
0.0MetCys: 0.0 ± 0.0
1.21MetAsp: 1.21 ± 0.322
1.353MetGlu: 1.353 ± 0.278
0.854MetPhe: 0.854 ± 0.254
2.421MetGly: 2.421 ± 0.401
0.427MetHis: 0.427 ± 0.164
1.353MetIle: 1.353 ± 0.327
1.21MetLys: 1.21 ± 0.264
1.566MetLeu: 1.566 ± 0.289
0.356MetMet: 0.356 ± 0.155
0.783MetAsn: 0.783 ± 0.237
0.854MetPro: 0.854 ± 0.228
0.498MetGln: 0.498 ± 0.208
1.851MetArg: 1.851 ± 0.458
2.848MetSer: 2.848 ± 0.397
1.922MetThr: 1.922 ± 0.345
1.21MetVal: 1.21 ± 0.276
0.214MetTrp: 0.214 ± 0.114
0.57MetTyr: 0.57 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
3.061AsnAla: 3.061 ± 0.446
0.712AsnCys: 0.712 ± 0.284
2.065AsnAsp: 2.065 ± 0.362
1.993AsnGlu: 1.993 ± 0.409
1.068AsnPhe: 1.068 ± 0.252
3.631AsnGly: 3.631 ± 0.539
0.854AsnHis: 0.854 ± 0.217
1.21AsnIle: 1.21 ± 0.262
1.21AsnLys: 1.21 ± 0.275
3.061AsnLeu: 3.061 ± 0.392
0.997AsnMet: 0.997 ± 0.304
1.495AsnAsn: 1.495 ± 0.363
2.349AsnPro: 2.349 ± 0.385
0.783AsnGln: 0.783 ± 0.218
1.78AsnArg: 1.78 ± 0.409
2.207AsnSer: 2.207 ± 0.351
1.993AsnThr: 1.993 ± 0.415
2.421AsnVal: 2.421 ± 0.344
0.854AsnTrp: 0.854 ± 0.271
1.139AsnTyr: 1.139 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
4.058ProAla: 4.058 ± 0.604
0.356ProCys: 0.356 ± 0.168
2.99ProAsp: 2.99 ± 0.446
3.275ProGlu: 3.275 ± 0.497
1.21ProPhe: 1.21 ± 0.231
4.2ProGly: 4.2 ± 0.611
0.783ProHis: 0.783 ± 0.219
2.634ProIle: 2.634 ± 0.455
1.566ProLys: 1.566 ± 0.294
3.845ProLeu: 3.845 ± 0.462
1.353ProMet: 1.353 ± 0.251
1.922ProAsn: 1.922 ± 0.4
1.282ProPro: 1.282 ± 0.275
1.637ProGln: 1.637 ± 0.262
2.777ProArg: 2.777 ± 0.484
2.421ProSer: 2.421 ± 0.365
3.916ProThr: 3.916 ± 0.528
4.058ProVal: 4.058 ± 0.571
1.139ProTrp: 1.139 ± 0.305
1.495ProTyr: 1.495 ± 0.312
0.0ProXaa: 0.0 ± 0.0
Gln
3.417GlnAla: 3.417 ± 0.501
0.214GlnCys: 0.214 ± 0.163
1.922GlnAsp: 1.922 ± 0.375
1.566GlnGlu: 1.566 ± 0.368
1.21GlnPhe: 1.21 ± 0.228
2.705GlnGly: 2.705 ± 0.55
0.641GlnHis: 0.641 ± 0.186
1.78GlnIle: 1.78 ± 0.285
0.641GlnLys: 0.641 ± 0.184
2.634GlnLeu: 2.634 ± 0.472
0.997GlnMet: 0.997 ± 0.26
0.783GlnAsn: 0.783 ± 0.21
1.78GlnPro: 1.78 ± 0.387
1.424GlnGln: 1.424 ± 0.313
3.061GlnArg: 3.061 ± 0.458
2.207GlnSer: 2.207 ± 0.356
1.637GlnThr: 1.637 ± 0.336
3.417GlnVal: 3.417 ± 0.505
0.641GlnTrp: 0.641 ± 0.201
0.926GlnTyr: 0.926 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
5.268ArgAla: 5.268 ± 0.638
1.139ArgCys: 1.139 ± 0.288
4.2ArgAsp: 4.2 ± 0.523
4.699ArgGlu: 4.699 ± 0.782
2.492ArgPhe: 2.492 ± 0.431
3.916ArgGly: 3.916 ± 0.501
1.21ArgHis: 1.21 ± 0.376
2.777ArgIle: 2.777 ± 0.572
2.99ArgLys: 2.99 ± 0.473
5.411ArgLeu: 5.411 ± 0.67
0.997ArgMet: 0.997 ± 0.314
1.993ArgAsn: 1.993 ± 0.304
2.705ArgPro: 2.705 ± 0.403
2.349ArgGln: 2.349 ± 0.47
3.56ArgArg: 3.56 ± 0.454
3.631ArgSer: 3.631 ± 0.472
3.845ArgThr: 3.845 ± 0.575
4.77ArgVal: 4.77 ± 0.463
1.495ArgTrp: 1.495 ± 0.277
1.78ArgTyr: 1.78 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
4.912SerAla: 4.912 ± 0.527
0.57SerCys: 0.57 ± 0.187
4.129SerAsp: 4.129 ± 0.492
4.628SerGlu: 4.628 ± 0.701
2.492SerPhe: 2.492 ± 0.462
5.909SerGly: 5.909 ± 0.804
1.139SerHis: 1.139 ± 0.239
2.278SerIle: 2.278 ± 0.381
2.278SerLys: 2.278 ± 0.426
5.624SerLeu: 5.624 ± 0.653
1.637SerMet: 1.637 ± 0.376
2.705SerAsn: 2.705 ± 0.4
2.136SerPro: 2.136 ± 0.374
1.851SerGln: 1.851 ± 0.377
3.631SerArg: 3.631 ± 0.445
3.56SerSer: 3.56 ± 0.625
3.56SerThr: 3.56 ± 0.401
4.912SerVal: 4.912 ± 0.538
1.21SerTrp: 1.21 ± 0.286
1.566SerTyr: 1.566 ± 0.345
0.0SerXaa: 0.0 ± 0.0
Thr
5.34ThrAla: 5.34 ± 0.746
0.356ThrCys: 0.356 ± 0.165
3.346ThrAsp: 3.346 ± 0.415
5.197ThrGlu: 5.197 ± 0.568
1.993ThrPhe: 1.993 ± 0.394
5.767ThrGly: 5.767 ± 0.751
0.997ThrHis: 0.997 ± 0.281
3.275ThrIle: 3.275 ± 0.559
2.634ThrLys: 2.634 ± 0.424
5.055ThrLeu: 5.055 ± 0.442
1.282ThrMet: 1.282 ± 0.27
1.78ThrAsn: 1.78 ± 0.398
3.845ThrPro: 3.845 ± 0.479
1.993ThrGln: 1.993 ± 0.342
3.346ThrArg: 3.346 ± 0.434
3.133ThrSer: 3.133 ± 0.418
3.489ThrThr: 3.489 ± 0.597
4.841ThrVal: 4.841 ± 0.561
1.068ThrTrp: 1.068 ± 0.217
2.634ThrTyr: 2.634 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
7.689ValAla: 7.689 ± 0.76
0.926ValCys: 0.926 ± 0.285
5.411ValAsp: 5.411 ± 0.556
6.336ValGlu: 6.336 ± 0.654
2.919ValPhe: 2.919 ± 0.41
5.767ValGly: 5.767 ± 0.769
1.353ValHis: 1.353 ± 0.342
4.2ValIle: 4.2 ± 0.547
3.061ValLys: 3.061 ± 0.443
6.621ValLeu: 6.621 ± 0.613
1.566ValMet: 1.566 ± 0.424
3.061ValAsn: 3.061 ± 0.408
3.702ValPro: 3.702 ± 0.406
2.492ValGln: 2.492 ± 0.377
4.699ValArg: 4.699 ± 0.552
5.34ValSer: 5.34 ± 0.612
4.699ValThr: 4.699 ± 0.63
6.336ValVal: 6.336 ± 0.843
1.139ValTrp: 1.139 ± 0.27
2.065ValTyr: 2.065 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
1.282TrpAla: 1.282 ± 0.275
0.0TrpCys: 0.0 ± 0.0
1.424TrpAsp: 1.424 ± 0.294
1.424TrpGlu: 1.424 ± 0.292
0.783TrpPhe: 0.783 ± 0.281
1.566TrpGly: 1.566 ± 0.397
0.57TrpHis: 0.57 ± 0.217
0.926TrpIle: 0.926 ± 0.211
0.783TrpLys: 0.783 ± 0.222
1.282TrpLeu: 1.282 ± 0.343
0.641TrpMet: 0.641 ± 0.197
0.997TrpAsn: 0.997 ± 0.307
0.712TrpPro: 0.712 ± 0.203
0.854TrpGln: 0.854 ± 0.217
0.57TrpArg: 0.57 ± 0.175
1.353TrpSer: 1.353 ± 0.345
1.353TrpThr: 1.353 ± 0.266
1.353TrpVal: 1.353 ± 0.279
0.356TrpTrp: 0.356 ± 0.197
0.641TrpTyr: 0.641 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.133TyrAla: 3.133 ± 0.543
0.142TyrCys: 0.142 ± 0.105
2.777TyrAsp: 2.777 ± 0.443
2.492TyrGlu: 2.492 ± 0.499
0.854TyrPhe: 0.854 ± 0.235
3.061TyrGly: 3.061 ± 0.508
0.498TyrHis: 0.498 ± 0.169
1.566TyrIle: 1.566 ± 0.361
0.57TyrLys: 0.57 ± 0.182
2.563TyrLeu: 2.563 ± 0.421
1.068TyrMet: 1.068 ± 0.258
1.353TyrAsn: 1.353 ± 0.296
0.641TyrPro: 0.641 ± 0.229
1.21TyrGln: 1.21 ± 0.211
2.848TyrArg: 2.848 ± 0.528
2.207TyrSer: 2.207 ± 0.436
1.282TyrThr: 1.282 ± 0.333
2.136TyrVal: 2.136 ± 0.459
0.214TyrTrp: 0.214 ± 0.122
0.641TyrTyr: 0.641 ± 0.211
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (14047 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski