Amino acid dipepetide frequency for Bacillus phage PBC5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.328AlaAla: 6.328 ± 0.773
0.651AlaCys: 0.651 ± 0.208
3.489AlaAsp: 3.489 ± 0.51
4.968AlaGlu: 4.968 ± 0.624
3.371AlaPhe: 3.371 ± 0.565
4.672AlaGly: 4.672 ± 0.545
0.828AlaHis: 0.828 ± 0.208
6.269AlaIle: 6.269 ± 0.888
6.624AlaLys: 6.624 ± 0.61
5.323AlaLeu: 5.323 ± 0.633
2.72AlaMet: 2.72 ± 0.388
3.962AlaAsn: 3.962 ± 0.591
2.247AlaPro: 2.247 ± 0.382
3.371AlaGln: 3.371 ± 0.403
3.312AlaArg: 3.312 ± 0.474
3.194AlaSer: 3.194 ± 0.38
4.79AlaThr: 4.79 ± 0.688
5.086AlaVal: 5.086 ± 0.808
1.242AlaTrp: 1.242 ± 0.564
3.194AlaTyr: 3.194 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.237CysAla: 0.237 ± 0.13
0.118CysCys: 0.118 ± 0.07
0.414CysAsp: 0.414 ± 0.157
1.124CysGlu: 1.124 ± 0.309
0.355CysPhe: 0.355 ± 0.168
0.946CysGly: 0.946 ± 0.304
0.237CysHis: 0.237 ± 0.117
0.591CysIle: 0.591 ± 0.264
0.651CysLys: 0.651 ± 0.197
0.591CysLeu: 0.591 ± 0.227
0.532CysMet: 0.532 ± 0.198
0.296CysAsn: 0.296 ± 0.144
0.177CysPro: 0.177 ± 0.105
0.355CysGln: 0.355 ± 0.156
0.355CysArg: 0.355 ± 0.151
0.177CysSer: 0.177 ± 0.098
0.296CysThr: 0.296 ± 0.122
0.651CysVal: 0.651 ± 0.144
0.0CysTrp: 0.0 ± 0.0
0.177CysTyr: 0.177 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
4.968AspAla: 4.968 ± 0.685
0.237AspCys: 0.237 ± 0.122
2.839AspAsp: 2.839 ± 0.434
4.968AspGlu: 4.968 ± 0.855
2.602AspPhe: 2.602 ± 0.362
5.737AspGly: 5.737 ± 1.003
1.124AspHis: 1.124 ± 0.248
3.785AspIle: 3.785 ± 0.442
4.081AspLys: 4.081 ± 0.43
4.849AspLeu: 4.849 ± 0.614
1.892AspMet: 1.892 ± 0.317
2.78AspAsn: 2.78 ± 0.53
1.833AspPro: 1.833 ± 0.339
1.892AspGln: 1.892 ± 0.295
2.07AspArg: 2.07 ± 0.343
2.839AspSer: 2.839 ± 0.344
2.839AspThr: 2.839 ± 0.438
4.14AspVal: 4.14 ± 0.477
0.887AspTrp: 0.887 ± 0.2
1.774AspTyr: 1.774 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
5.5GluAla: 5.5 ± 0.519
0.71GluCys: 0.71 ± 0.211
3.608GluAsp: 3.608 ± 0.513
6.151GluGlu: 6.151 ± 0.842
2.366GluPhe: 2.366 ± 0.421
5.441GluGly: 5.441 ± 0.676
1.419GluHis: 1.419 ± 0.279
4.258GluIle: 4.258 ± 0.479
7.57GluLys: 7.57 ± 0.785
6.86GluLeu: 6.86 ± 0.666
2.425GluMet: 2.425 ± 0.343
3.43GluAsn: 3.43 ± 0.507
2.543GluPro: 2.543 ± 0.385
4.199GluGln: 4.199 ± 0.768
4.022GluArg: 4.022 ± 0.475
3.134GluSer: 3.134 ± 0.519
4.495GluThr: 4.495 ± 0.535
5.382GluVal: 5.382 ± 0.675
1.183GluTrp: 1.183 ± 0.251
2.957GluTyr: 2.957 ± 0.609
0.0GluXaa: 0.0 ± 0.0
Phe
2.129PheAla: 2.129 ± 0.311
0.355PheCys: 0.355 ± 0.143
3.312PheAsp: 3.312 ± 0.452
2.839PheGlu: 2.839 ± 0.389
1.005PhePhe: 1.005 ± 0.222
3.253PheGly: 3.253 ± 0.467
0.769PheHis: 0.769 ± 0.212
2.07PheIle: 2.07 ± 0.365
3.371PheLys: 3.371 ± 0.427
2.306PheLeu: 2.306 ± 0.433
0.887PheMet: 0.887 ± 0.278
2.011PheAsn: 2.011 ± 0.318
1.538PhePro: 1.538 ± 0.269
1.774PheGln: 1.774 ± 0.269
1.301PheArg: 1.301 ± 0.263
2.247PheSer: 2.247 ± 0.307
2.543PheThr: 2.543 ± 0.514
2.661PheVal: 2.661 ± 0.346
0.414PheTrp: 0.414 ± 0.149
2.366PheTyr: 2.366 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
4.731GlyAla: 4.731 ± 0.833
0.591GlyCys: 0.591 ± 0.157
3.194GlyAsp: 3.194 ± 0.501
4.909GlyGlu: 4.909 ± 0.522
3.548GlyPhe: 3.548 ± 0.421
5.263GlyGly: 5.263 ± 0.832
0.887GlyHis: 0.887 ± 0.259
4.731GlyIle: 4.731 ± 0.549
5.796GlyLys: 5.796 ± 0.633
4.554GlyLeu: 4.554 ± 0.634
2.247GlyMet: 2.247 ± 0.417
4.14GlyAsn: 4.14 ± 0.587
0.177GlyPro: 0.177 ± 0.104
2.839GlyGln: 2.839 ± 0.372
3.016GlyArg: 3.016 ± 0.401
3.312GlySer: 3.312 ± 0.522
4.849GlyThr: 4.849 ± 0.808
4.968GlyVal: 4.968 ± 0.481
1.301GlyTrp: 1.301 ± 0.273
3.371GlyTyr: 3.371 ± 0.521
0.0GlyXaa: 0.0 ± 0.0
His
1.242HisAla: 1.242 ± 0.296
0.296HisCys: 0.296 ± 0.125
1.183HisAsp: 1.183 ± 0.266
1.479HisGlu: 1.479 ± 0.319
0.769HisPhe: 0.769 ± 0.242
1.656HisGly: 1.656 ± 0.387
0.769HisHis: 0.769 ± 0.29
1.36HisIle: 1.36 ± 0.306
1.36HisLys: 1.36 ± 0.322
1.065HisLeu: 1.065 ± 0.251
0.946HisMet: 0.946 ± 0.236
0.887HisAsn: 0.887 ± 0.241
0.887HisPro: 0.887 ± 0.243
0.473HisGln: 0.473 ± 0.159
0.828HisArg: 0.828 ± 0.224
0.651HisSer: 0.651 ± 0.218
1.183HisThr: 1.183 ± 0.24
1.183HisVal: 1.183 ± 0.304
0.296HisTrp: 0.296 ± 0.103
0.828HisTyr: 0.828 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
3.962IleAla: 3.962 ± 0.442
0.591IleCys: 0.591 ± 0.226
5.263IleAsp: 5.263 ± 0.535
5.5IleGlu: 5.5 ± 0.537
2.247IlePhe: 2.247 ± 0.419
4.613IleGly: 4.613 ± 0.68
1.065IleHis: 1.065 ± 0.301
3.608IleIle: 3.608 ± 0.451
5.441IleLys: 5.441 ± 0.617
3.844IleLeu: 3.844 ± 0.47
1.715IleMet: 1.715 ± 0.336
4.317IleAsn: 4.317 ± 0.545
2.306IlePro: 2.306 ± 0.393
2.543IleGln: 2.543 ± 0.44
3.016IleArg: 3.016 ± 0.44
3.134IleSer: 3.134 ± 0.421
3.844IleThr: 3.844 ± 0.421
4.672IleVal: 4.672 ± 0.611
1.124IleTrp: 1.124 ± 0.589
1.774IleTyr: 1.774 ± 0.261
0.0IleXaa: 0.0 ± 0.0
Lys
7.747LysAla: 7.747 ± 0.619
0.355LysCys: 0.355 ± 0.153
4.14LysAsp: 4.14 ± 0.489
6.801LysGlu: 6.801 ± 0.915
2.839LysPhe: 2.839 ± 0.357
5.559LysGly: 5.559 ± 0.589
1.892LysHis: 1.892 ± 0.35
4.258LysIle: 4.258 ± 0.51
6.565LysLys: 6.565 ± 0.77
5.618LysLeu: 5.618 ± 0.629
3.134LysMet: 3.134 ± 0.516
3.667LysAsn: 3.667 ± 0.523
3.075LysPro: 3.075 ± 0.425
3.489LysGln: 3.489 ± 0.51
4.909LysArg: 4.909 ± 0.748
3.489LysSer: 3.489 ± 0.818
4.554LysThr: 4.554 ± 0.945
4.909LysVal: 4.909 ± 0.631
1.065LysTrp: 1.065 ± 0.23
3.134LysTyr: 3.134 ± 0.382
0.0LysXaa: 0.0 ± 0.0
Leu
6.032LeuAla: 6.032 ± 0.576
0.651LeuCys: 0.651 ± 0.244
4.081LeuAsp: 4.081 ± 0.458
6.387LeuGlu: 6.387 ± 0.93
2.129LeuPhe: 2.129 ± 0.307
4.199LeuGly: 4.199 ± 0.457
1.301LeuHis: 1.301 ± 0.327
3.962LeuIle: 3.962 ± 0.572
6.683LeuLys: 6.683 ± 0.57
4.613LeuLeu: 4.613 ± 0.608
2.129LeuMet: 2.129 ± 0.373
2.839LeuAsn: 2.839 ± 0.574
1.656LeuPro: 1.656 ± 0.331
3.903LeuGln: 3.903 ± 0.552
3.667LeuArg: 3.667 ± 0.502
4.79LeuSer: 4.79 ± 0.497
4.199LeuThr: 4.199 ± 0.422
4.14LeuVal: 4.14 ± 0.572
0.71LeuTrp: 0.71 ± 0.199
3.134LeuTyr: 3.134 ± 0.543
0.0LeuXaa: 0.0 ± 0.0
Met
2.898MetAla: 2.898 ± 0.493
0.118MetCys: 0.118 ± 0.078
1.538MetAsp: 1.538 ± 0.365
2.366MetGlu: 2.366 ± 0.373
0.651MetPhe: 0.651 ± 0.19
1.538MetGly: 1.538 ± 0.402
0.591MetHis: 0.591 ± 0.185
2.425MetIle: 2.425 ± 0.401
3.134MetLys: 3.134 ± 0.378
1.952MetLeu: 1.952 ± 0.369
0.946MetMet: 0.946 ± 0.239
2.306MetAsn: 2.306 ± 0.36
1.065MetPro: 1.065 ± 0.222
1.301MetGln: 1.301 ± 0.261
1.597MetArg: 1.597 ± 0.32
2.188MetSer: 2.188 ± 0.346
1.656MetThr: 1.656 ± 0.271
1.656MetVal: 1.656 ± 0.459
0.651MetTrp: 0.651 ± 0.175
1.301MetTyr: 1.301 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
4.081AsnAla: 4.081 ± 0.678
0.769AsnCys: 0.769 ± 0.324
3.844AsnAsp: 3.844 ± 0.45
3.489AsnGlu: 3.489 ± 0.488
2.839AsnPhe: 2.839 ± 0.537
5.441AsnGly: 5.441 ± 0.53
0.946AsnHis: 0.946 ± 0.221
3.726AsnIle: 3.726 ± 0.434
3.726AsnLys: 3.726 ± 0.411
4.849AsnLeu: 4.849 ± 0.536
1.597AsnMet: 1.597 ± 0.289
2.425AsnAsn: 2.425 ± 0.455
2.188AsnPro: 2.188 ± 0.424
1.952AsnGln: 1.952 ± 0.38
2.129AsnArg: 2.129 ± 0.369
2.484AsnSer: 2.484 ± 0.445
3.43AsnThr: 3.43 ± 0.593
3.371AsnVal: 3.371 ± 0.524
0.71AsnTrp: 0.71 ± 0.224
1.597AsnTyr: 1.597 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
1.833ProAla: 1.833 ± 0.412
0.414ProCys: 0.414 ± 0.149
2.661ProAsp: 2.661 ± 0.394
2.366ProGlu: 2.366 ± 0.329
1.36ProPhe: 1.36 ± 0.311
0.0ProGly: 0.0 ± 0.0
0.769ProHis: 0.769 ± 0.195
2.366ProIle: 2.366 ± 0.352
2.602ProLys: 2.602 ± 0.41
1.892ProLeu: 1.892 ± 0.368
0.769ProMet: 0.769 ± 0.227
1.774ProAsn: 1.774 ± 0.39
1.36ProPro: 1.36 ± 0.419
1.479ProGln: 1.479 ± 0.252
1.124ProArg: 1.124 ± 0.255
2.247ProSer: 2.247 ± 0.324
2.898ProThr: 2.898 ± 0.462
1.892ProVal: 1.892 ± 0.348
0.355ProTrp: 0.355 ± 0.161
1.183ProTyr: 1.183 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
4.022GlnAla: 4.022 ± 0.406
0.059GlnCys: 0.059 ± 0.051
2.188GlnAsp: 2.188 ± 0.317
3.371GlnGlu: 3.371 ± 0.478
1.538GlnPhe: 1.538 ± 0.284
2.898GlnGly: 2.898 ± 0.511
0.71GlnHis: 0.71 ± 0.218
2.425GlnIle: 2.425 ± 0.41
2.78GlnLys: 2.78 ± 0.475
3.134GlnLeu: 3.134 ± 0.385
1.124GlnMet: 1.124 ± 0.211
2.425GlnAsn: 2.425 ± 0.355
1.242GlnPro: 1.242 ± 0.281
2.366GlnGln: 2.366 ± 0.324
2.543GlnArg: 2.543 ± 0.404
2.011GlnSer: 2.011 ± 0.343
2.957GlnThr: 2.957 ± 0.465
2.425GlnVal: 2.425 ± 0.418
0.651GlnTrp: 0.651 ± 0.215
1.774GlnTyr: 1.774 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
3.134ArgAla: 3.134 ± 0.475
0.118ArgCys: 0.118 ± 0.08
2.07ArgAsp: 2.07 ± 0.321
3.785ArgGlu: 3.785 ± 0.602
2.129ArgPhe: 2.129 ± 0.389
2.188ArgGly: 2.188 ± 0.331
0.828ArgHis: 0.828 ± 0.227
3.075ArgIle: 3.075 ± 0.456
3.726ArgLys: 3.726 ± 0.541
5.086ArgLeu: 5.086 ± 0.585
1.715ArgMet: 1.715 ± 0.29
2.957ArgAsn: 2.957 ± 0.349
1.242ArgPro: 1.242 ± 0.293
1.892ArgGln: 1.892 ± 0.371
2.306ArgArg: 2.306 ± 0.522
1.833ArgSer: 1.833 ± 0.345
2.602ArgThr: 2.602 ± 0.337
2.898ArgVal: 2.898 ± 0.54
0.591ArgTrp: 0.591 ± 0.228
2.011ArgTyr: 2.011 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
3.253SerAla: 3.253 ± 0.668
0.237SerCys: 0.237 ± 0.106
2.306SerAsp: 2.306 ± 0.395
3.371SerGlu: 3.371 ± 0.438
2.366SerPhe: 2.366 ± 0.348
3.194SerGly: 3.194 ± 0.474
0.828SerHis: 0.828 ± 0.218
4.258SerIle: 4.258 ± 0.675
3.548SerLys: 3.548 ± 0.49
3.844SerLeu: 3.844 ± 0.375
1.479SerMet: 1.479 ± 0.283
2.839SerAsn: 2.839 ± 0.592
1.242SerPro: 1.242 ± 0.235
1.774SerGln: 1.774 ± 0.269
2.011SerArg: 2.011 ± 0.378
1.479SerSer: 1.479 ± 0.496
3.548SerThr: 3.548 ± 0.563
2.898SerVal: 2.898 ± 0.405
0.414SerTrp: 0.414 ± 0.146
1.833SerTyr: 1.833 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
5.323ThrAla: 5.323 ± 1.219
0.71ThrCys: 0.71 ± 0.248
3.844ThrAsp: 3.844 ± 0.608
3.253ThrGlu: 3.253 ± 0.504
3.194ThrPhe: 3.194 ± 0.437
4.199ThrGly: 4.199 ± 0.54
1.597ThrHis: 1.597 ± 0.374
4.672ThrIle: 4.672 ± 0.586
4.317ThrLys: 4.317 ± 0.478
4.317ThrLeu: 4.317 ± 0.466
2.129ThrMet: 2.129 ± 0.292
4.199ThrAsn: 4.199 ± 0.775
3.016ThrPro: 3.016 ± 0.422
1.538ThrGln: 1.538 ± 0.327
2.661ThrArg: 2.661 ± 0.567
2.188ThrSer: 2.188 ± 0.378
3.962ThrThr: 3.962 ± 0.583
4.081ThrVal: 4.081 ± 0.415
1.005ThrTrp: 1.005 ± 0.267
2.188ThrTyr: 2.188 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
5.027ValAla: 5.027 ± 0.811
0.473ValCys: 0.473 ± 0.178
4.081ValAsp: 4.081 ± 0.443
5.796ValGlu: 5.796 ± 0.674
2.07ValPhe: 2.07 ± 0.397
4.258ValGly: 4.258 ± 0.448
1.833ValHis: 1.833 ± 0.367
3.844ValIle: 3.844 ± 0.449
5.027ValLys: 5.027 ± 0.575
3.548ValLeu: 3.548 ± 0.516
2.011ValMet: 2.011 ± 0.432
4.436ValAsn: 4.436 ± 0.544
2.366ValPro: 2.366 ± 0.405
3.016ValGln: 3.016 ± 0.316
2.602ValArg: 2.602 ± 0.314
3.194ValSer: 3.194 ± 0.473
4.022ValThr: 4.022 ± 0.531
4.495ValVal: 4.495 ± 0.506
0.71ValTrp: 0.71 ± 0.206
2.72ValTyr: 2.72 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
0.651TrpAla: 0.651 ± 0.182
0.177TrpCys: 0.177 ± 0.123
0.887TrpAsp: 0.887 ± 0.34
1.301TrpGlu: 1.301 ± 0.582
0.473TrpPhe: 0.473 ± 0.164
0.71TrpGly: 0.71 ± 0.256
0.414TrpHis: 0.414 ± 0.139
0.828TrpIle: 0.828 ± 0.228
1.065TrpLys: 1.065 ± 0.203
1.124TrpLeu: 1.124 ± 0.269
0.414TrpMet: 0.414 ± 0.143
1.419TrpAsn: 1.419 ± 0.651
0.0TrpPro: 0.0 ± 0.0
0.769TrpGln: 0.769 ± 0.253
0.769TrpArg: 0.769 ± 0.221
0.651TrpSer: 0.651 ± 0.203
0.887TrpThr: 0.887 ± 0.419
0.946TrpVal: 0.946 ± 0.217
0.177TrpTrp: 0.177 ± 0.1
0.532TrpTyr: 0.532 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.543TyrAla: 2.543 ± 0.387
0.651TyrCys: 0.651 ± 0.208
3.194TyrAsp: 3.194 ± 0.492
3.489TyrGlu: 3.489 ± 0.652
1.301TyrPhe: 1.301 ± 0.297
2.661TyrGly: 2.661 ± 0.427
0.532TyrHis: 0.532 ± 0.156
2.07TyrIle: 2.07 ± 0.358
3.134TyrLys: 3.134 ± 0.466
1.833TyrLeu: 1.833 ± 0.339
1.005TyrMet: 1.005 ± 0.229
2.484TyrAsn: 2.484 ± 0.396
1.183TyrPro: 1.183 ± 0.299
1.715TyrGln: 1.715 ± 0.272
1.952TyrArg: 1.952 ± 0.354
1.479TyrSer: 1.479 ± 0.366
2.839TyrThr: 2.839 ± 0.52
3.075TyrVal: 3.075 ± 0.512
0.71TyrTrp: 0.71 ± 0.198
2.07TyrTyr: 2.07 ± 0.44
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (16910 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski