Amino acid dipepetide frequency for Bacillus phage phiS58

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.967AlaAla: 5.967 ± 1.363
0.144AlaCys: 0.144 ± 0.113
3.379AlaAsp: 3.379 ± 0.436
5.751AlaGlu: 5.751 ± 0.578
2.804AlaPhe: 2.804 ± 0.472
3.523AlaGly: 3.523 ± 0.612
1.006AlaHis: 1.006 ± 0.264
4.026AlaIle: 4.026 ± 0.583
5.176AlaLys: 5.176 ± 0.596
5.464AlaLeu: 5.464 ± 0.742
1.725AlaMet: 1.725 ± 0.314
3.163AlaAsn: 3.163 ± 0.546
1.222AlaPro: 1.222 ± 0.306
2.229AlaGln: 2.229 ± 0.45
2.876AlaArg: 2.876 ± 0.335
3.307AlaSer: 3.307 ± 0.457
2.876AlaThr: 2.876 ± 0.53
4.457AlaVal: 4.457 ± 0.718
1.222AlaTrp: 1.222 ± 0.288
2.085AlaTyr: 2.085 ± 0.353
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.174
0.144CysCys: 0.144 ± 0.098
1.006CysAsp: 1.006 ± 0.333
1.222CysGlu: 1.222 ± 0.325
0.503CysPhe: 0.503 ± 0.177
0.575CysGly: 0.575 ± 0.205
0.072CysHis: 0.072 ± 0.067
0.503CysIle: 0.503 ± 0.183
0.935CysLys: 0.935 ± 0.3
0.503CysLeu: 0.503 ± 0.21
0.216CysMet: 0.216 ± 0.113
0.935CysAsn: 0.935 ± 0.303
0.216CysPro: 0.216 ± 0.162
0.288CysGln: 0.288 ± 0.127
0.503CysArg: 0.503 ± 0.197
0.431CysSer: 0.431 ± 0.163
0.503CysThr: 0.503 ± 0.198
0.216CysVal: 0.216 ± 0.131
0.072CysTrp: 0.072 ± 0.066
0.575CysTyr: 0.575 ± 0.238
0.0CysXaa: 0.0 ± 0.0
Asp
2.66AspAla: 2.66 ± 0.537
0.288AspCys: 0.288 ± 0.151
3.523AspAsp: 3.523 ± 0.556
4.026AspGlu: 4.026 ± 0.629
2.516AspPhe: 2.516 ± 0.433
4.745AspGly: 4.745 ± 0.624
0.935AspHis: 0.935 ± 0.263
4.242AspIle: 4.242 ± 0.621
4.889AspLys: 4.889 ± 0.515
4.457AspLeu: 4.457 ± 0.599
2.372AspMet: 2.372 ± 0.379
2.804AspAsn: 2.804 ± 0.556
2.013AspPro: 2.013 ± 0.403
2.157AspGln: 2.157 ± 0.379
2.588AspArg: 2.588 ± 0.387
2.66AspSer: 2.66 ± 0.437
2.876AspThr: 2.876 ± 0.47
4.745AspVal: 4.745 ± 0.622
1.006AspTrp: 1.006 ± 0.262
2.588AspTyr: 2.588 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
5.464GluAla: 5.464 ± 0.664
0.503GluCys: 0.503 ± 0.164
4.242GluAsp: 4.242 ± 0.534
7.189GluGlu: 7.189 ± 1.033
3.307GluPhe: 3.307 ± 0.448
4.242GluGly: 4.242 ± 0.496
1.006GluHis: 1.006 ± 0.251
5.679GluIle: 5.679 ± 0.602
7.908GluLys: 7.908 ± 0.841
6.973GluLeu: 6.973 ± 0.788
2.372GluMet: 2.372 ± 0.5
4.313GluAsn: 4.313 ± 0.375
2.013GluPro: 2.013 ± 0.417
3.954GluGln: 3.954 ± 0.577
4.673GluArg: 4.673 ± 0.498
4.529GluSer: 4.529 ± 0.741
4.242GluThr: 4.242 ± 0.607
4.745GluVal: 4.745 ± 0.647
1.941GluTrp: 1.941 ± 0.386
2.732GluTyr: 2.732 ± 0.455
0.0GluXaa: 0.0 ± 0.0
Phe
2.444PheAla: 2.444 ± 0.402
0.359PheCys: 0.359 ± 0.151
2.732PheAsp: 2.732 ± 0.485
2.804PheGlu: 2.804 ± 0.414
1.366PhePhe: 1.366 ± 0.348
3.307PheGly: 3.307 ± 0.548
0.719PheHis: 0.719 ± 0.2
4.313PheIle: 4.313 ± 0.464
3.882PheLys: 3.882 ± 0.518
2.444PheLeu: 2.444 ± 0.443
1.15PheMet: 1.15 ± 0.317
2.444PheAsn: 2.444 ± 0.39
0.863PhePro: 0.863 ± 0.24
1.51PheGln: 1.51 ± 0.291
1.797PheArg: 1.797 ± 0.392
2.732PheSer: 2.732 ± 0.489
2.229PheThr: 2.229 ± 0.391
2.948PheVal: 2.948 ± 0.481
0.503PheTrp: 0.503 ± 0.196
1.51PheTyr: 1.51 ± 0.312
0.0PheXaa: 0.0 ± 0.0
Gly
4.313GlyAla: 4.313 ± 0.838
0.288GlyCys: 0.288 ± 0.193
3.954GlyAsp: 3.954 ± 0.593
4.313GlyGlu: 4.313 ± 0.617
2.229GlyPhe: 2.229 ± 0.486
3.882GlyGly: 3.882 ± 0.763
0.431GlyHis: 0.431 ± 0.159
4.313GlyIle: 4.313 ± 0.773
5.32GlyLys: 5.32 ± 0.677
4.96GlyLeu: 4.96 ± 0.633
2.085GlyMet: 2.085 ± 0.315
3.235GlyAsn: 3.235 ± 0.516
0.935GlyPro: 0.935 ± 0.295
1.941GlyGln: 1.941 ± 0.359
2.876GlyArg: 2.876 ± 0.423
3.595GlySer: 3.595 ± 0.527
3.451GlyThr: 3.451 ± 0.593
3.882GlyVal: 3.882 ± 0.77
1.941GlyTrp: 1.941 ± 0.359
3.019GlyTyr: 3.019 ± 0.548
0.0GlyXaa: 0.0 ± 0.0
His
0.791HisAla: 0.791 ± 0.248
0.288HisCys: 0.288 ± 0.139
0.791HisAsp: 0.791 ± 0.188
0.791HisGlu: 0.791 ± 0.27
1.006HisPhe: 1.006 ± 0.27
0.791HisGly: 0.791 ± 0.202
0.288HisHis: 0.288 ± 0.145
0.863HisIle: 0.863 ± 0.238
1.294HisLys: 1.294 ± 0.393
1.294HisLeu: 1.294 ± 0.293
0.216HisMet: 0.216 ± 0.141
0.431HisAsn: 0.431 ± 0.155
0.288HisPro: 0.288 ± 0.147
0.791HisGln: 0.791 ± 0.265
0.647HisArg: 0.647 ± 0.205
0.935HisSer: 0.935 ± 0.236
0.719HisThr: 0.719 ± 0.23
1.438HisVal: 1.438 ± 0.273
0.288HisTrp: 0.288 ± 0.124
0.719HisTyr: 0.719 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
5.392IleAla: 5.392 ± 0.68
1.366IleCys: 1.366 ± 0.306
4.026IleAsp: 4.026 ± 0.506
6.183IleGlu: 6.183 ± 0.559
3.019IlePhe: 3.019 ± 0.491
4.098IleGly: 4.098 ± 0.534
1.006IleHis: 1.006 ± 0.285
4.889IleIle: 4.889 ± 0.599
6.398IleLys: 6.398 ± 0.702
5.104IleLeu: 5.104 ± 0.666
1.438IleMet: 1.438 ± 0.337
5.32IleAsn: 5.32 ± 0.805
2.157IlePro: 2.157 ± 0.378
3.019IleGln: 3.019 ± 0.552
3.163IleArg: 3.163 ± 0.544
4.457IleSer: 4.457 ± 0.565
4.601IleThr: 4.601 ± 0.596
3.523IleVal: 3.523 ± 0.587
0.863IleTrp: 0.863 ± 0.216
2.157IleTyr: 2.157 ± 0.405
0.0IleXaa: 0.0 ± 0.0
Lys
5.751LysAla: 5.751 ± 0.665
0.719LysCys: 0.719 ± 0.273
5.895LysAsp: 5.895 ± 0.794
9.346LysGlu: 9.346 ± 0.909
2.732LysPhe: 2.732 ± 0.562
5.032LysGly: 5.032 ± 0.549
1.078LysHis: 1.078 ± 0.253
6.183LysIle: 6.183 ± 0.726
9.561LysLys: 9.561 ± 1.201
7.045LysLeu: 7.045 ± 0.744
2.804LysMet: 2.804 ± 0.409
4.673LysAsn: 4.673 ± 0.594
2.732LysPro: 2.732 ± 0.363
4.242LysGln: 4.242 ± 0.564
5.32LysArg: 5.32 ± 0.686
4.529LysSer: 4.529 ± 0.568
4.817LysThr: 4.817 ± 0.755
6.542LysVal: 6.542 ± 0.613
0.503LysTrp: 0.503 ± 0.165
3.091LysTyr: 3.091 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
5.248LeuAla: 5.248 ± 0.568
0.791LeuCys: 0.791 ± 0.275
5.104LeuAsp: 5.104 ± 0.633
6.758LeuGlu: 6.758 ± 0.614
3.019LeuPhe: 3.019 ± 0.524
4.96LeuGly: 4.96 ± 0.873
1.222LeuHis: 1.222 ± 0.3
5.104LeuIle: 5.104 ± 0.739
6.614LeuLys: 6.614 ± 0.732
5.248LeuLeu: 5.248 ± 0.724
2.588LeuMet: 2.588 ± 0.446
4.17LeuAsn: 4.17 ± 0.551
2.301LeuPro: 2.301 ± 0.342
3.451LeuGln: 3.451 ± 0.48
3.451LeuArg: 3.451 ± 0.445
5.607LeuSer: 5.607 ± 0.605
4.242LeuThr: 4.242 ± 0.504
5.536LeuVal: 5.536 ± 0.649
0.791LeuTrp: 0.791 ± 0.208
2.876LeuTyr: 2.876 ± 0.504
0.0LeuXaa: 0.0 ± 0.0
Met
1.797MetAla: 1.797 ± 0.434
0.288MetCys: 0.288 ± 0.139
1.51MetAsp: 1.51 ± 0.412
2.301MetGlu: 2.301 ± 0.316
1.438MetPhe: 1.438 ± 0.372
1.294MetGly: 1.294 ± 0.296
0.575MetHis: 0.575 ± 0.22
1.869MetIle: 1.869 ± 0.375
3.595MetLys: 3.595 ± 0.489
2.229MetLeu: 2.229 ± 0.39
0.935MetMet: 0.935 ± 0.236
2.301MetAsn: 2.301 ± 0.426
1.222MetPro: 1.222 ± 0.308
1.582MetGln: 1.582 ± 0.407
1.653MetArg: 1.653 ± 0.361
1.869MetSer: 1.869 ± 0.35
2.085MetThr: 2.085 ± 0.391
1.078MetVal: 1.078 ± 0.266
0.431MetTrp: 0.431 ± 0.181
0.863MetTyr: 0.863 ± 0.302
0.0MetXaa: 0.0 ± 0.0
Asn
3.235AsnAla: 3.235 ± 0.604
0.863AsnCys: 0.863 ± 0.31
3.595AsnAsp: 3.595 ± 0.61
4.817AsnGlu: 4.817 ± 0.591
1.725AsnPhe: 1.725 ± 0.365
4.745AsnGly: 4.745 ± 0.753
0.791AsnHis: 0.791 ± 0.254
4.242AsnIle: 4.242 ± 0.486
4.745AsnLys: 4.745 ± 0.605
5.607AsnLeu: 5.607 ± 0.541
1.725AsnMet: 1.725 ± 0.373
2.804AsnAsn: 2.804 ± 0.336
2.013AsnPro: 2.013 ± 0.354
1.582AsnGln: 1.582 ± 0.273
2.229AsnArg: 2.229 ± 0.406
2.372AsnSer: 2.372 ± 0.308
2.516AsnThr: 2.516 ± 0.444
3.235AsnVal: 3.235 ± 0.483
0.719AsnTrp: 0.719 ± 0.198
1.366AsnTyr: 1.366 ± 0.33
0.0AsnXaa: 0.0 ± 0.0
Pro
1.725ProAla: 1.725 ± 0.26
0.216ProCys: 0.216 ± 0.133
1.078ProAsp: 1.078 ± 0.298
1.797ProGlu: 1.797 ± 0.441
1.366ProPhe: 1.366 ± 0.354
1.366ProGly: 1.366 ± 0.333
0.647ProHis: 0.647 ± 0.178
1.582ProIle: 1.582 ± 0.356
2.516ProLys: 2.516 ± 0.386
2.229ProLeu: 2.229 ± 0.461
0.935ProMet: 0.935 ± 0.204
1.653ProAsn: 1.653 ± 0.325
0.935ProPro: 0.935 ± 0.289
1.15ProGln: 1.15 ± 0.257
0.791ProArg: 0.791 ± 0.285
1.366ProSer: 1.366 ± 0.288
2.085ProThr: 2.085 ± 0.375
2.948ProVal: 2.948 ± 0.467
0.288ProTrp: 0.288 ± 0.119
1.582ProTyr: 1.582 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
2.444GlnAla: 2.444 ± 0.378
0.359GlnCys: 0.359 ± 0.174
1.653GlnAsp: 1.653 ± 0.327
3.882GlnGlu: 3.882 ± 0.565
2.229GlnPhe: 2.229 ± 0.429
2.157GlnGly: 2.157 ± 0.342
0.431GlnHis: 0.431 ± 0.197
2.732GlnIle: 2.732 ± 0.443
3.666GlnLys: 3.666 ± 0.456
3.379GlnLeu: 3.379 ± 0.578
1.582GlnMet: 1.582 ± 0.356
2.085GlnAsn: 2.085 ± 0.437
1.366GlnPro: 1.366 ± 0.292
2.372GlnGln: 2.372 ± 0.492
2.013GlnArg: 2.013 ± 0.312
2.444GlnSer: 2.444 ± 0.404
1.941GlnThr: 1.941 ± 0.409
1.941GlnVal: 1.941 ± 0.39
0.431GlnTrp: 0.431 ± 0.182
2.085GlnTyr: 2.085 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
2.516ArgAla: 2.516 ± 0.49
0.503ArgCys: 0.503 ± 0.219
1.941ArgAsp: 1.941 ± 0.309
2.948ArgGlu: 2.948 ± 0.41
2.372ArgPhe: 2.372 ± 0.465
2.66ArgGly: 2.66 ± 0.445
0.719ArgHis: 0.719 ± 0.239
4.385ArgIle: 4.385 ± 0.654
5.392ArgLys: 5.392 ± 0.725
4.026ArgLeu: 4.026 ± 0.568
1.366ArgMet: 1.366 ± 0.244
2.229ArgAsn: 2.229 ± 0.439
0.935ArgPro: 0.935 ± 0.236
1.366ArgGln: 1.366 ± 0.292
1.869ArgArg: 1.869 ± 0.484
2.301ArgSer: 2.301 ± 0.41
2.085ArgThr: 2.085 ± 0.385
2.948ArgVal: 2.948 ± 0.546
0.575ArgTrp: 0.575 ± 0.183
2.372ArgTyr: 2.372 ± 0.6
0.0ArgXaa: 0.0 ± 0.0
Ser
2.876SerAla: 2.876 ± 0.508
0.647SerCys: 0.647 ± 0.229
3.523SerAsp: 3.523 ± 0.441
4.457SerGlu: 4.457 ± 0.494
3.091SerPhe: 3.091 ± 0.378
2.732SerGly: 2.732 ± 0.492
0.503SerHis: 0.503 ± 0.201
4.817SerIle: 4.817 ± 0.632
5.536SerLys: 5.536 ± 0.515
4.313SerLeu: 4.313 ± 0.5
1.582SerMet: 1.582 ± 0.299
3.091SerAsn: 3.091 ± 0.444
1.15SerPro: 1.15 ± 0.225
2.444SerGln: 2.444 ± 0.402
2.301SerArg: 2.301 ± 0.414
2.516SerSer: 2.516 ± 0.552
3.019SerThr: 3.019 ± 0.548
3.81SerVal: 3.81 ± 0.618
0.431SerTrp: 0.431 ± 0.17
2.013SerTyr: 2.013 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
3.091ThrAla: 3.091 ± 0.388
0.503ThrCys: 0.503 ± 0.229
2.444ThrAsp: 2.444 ± 0.582
3.738ThrGlu: 3.738 ± 0.522
2.66ThrPhe: 2.66 ± 0.434
3.235ThrGly: 3.235 ± 0.556
1.294ThrHis: 1.294 ± 0.317
5.176ThrIle: 5.176 ± 0.574
5.967ThrLys: 5.967 ± 0.783
4.457ThrLeu: 4.457 ± 0.75
1.797ThrMet: 1.797 ± 0.341
2.948ThrAsn: 2.948 ± 0.386
2.301ThrPro: 2.301 ± 0.389
2.085ThrGln: 2.085 ± 0.353
1.725ThrArg: 1.725 ± 0.35
1.51ThrSer: 1.51 ± 0.343
2.804ThrThr: 2.804 ± 0.585
3.523ThrVal: 3.523 ± 0.514
0.503ThrTrp: 0.503 ± 0.207
1.797ThrTyr: 1.797 ± 0.334
0.0ThrXaa: 0.0 ± 0.0
Val
3.738ValAla: 3.738 ± 0.52
0.503ValCys: 0.503 ± 0.205
4.026ValAsp: 4.026 ± 0.46
5.679ValGlu: 5.679 ± 0.666
2.588ValPhe: 2.588 ± 0.463
4.457ValGly: 4.457 ± 0.65
1.078ValHis: 1.078 ± 0.251
4.098ValIle: 4.098 ± 0.483
5.104ValLys: 5.104 ± 0.59
5.248ValLeu: 5.248 ± 0.703
1.941ValMet: 1.941 ± 0.372
3.666ValAsn: 3.666 ± 0.497
2.157ValPro: 2.157 ± 0.319
2.804ValGln: 2.804 ± 0.372
2.66ValArg: 2.66 ± 0.453
4.673ValSer: 4.673 ± 0.546
3.882ValThr: 3.882 ± 0.571
3.595ValVal: 3.595 ± 0.526
0.647ValTrp: 0.647 ± 0.245
2.229ValTyr: 2.229 ± 0.369
0.0ValXaa: 0.0 ± 0.0
Trp
0.575TrpAla: 0.575 ± 0.228
0.503TrpCys: 0.503 ± 0.186
1.294TrpAsp: 1.294 ± 0.368
1.51TrpGlu: 1.51 ± 0.278
0.431TrpPhe: 0.431 ± 0.185
1.15TrpGly: 1.15 ± 0.269
0.144TrpHis: 0.144 ± 0.101
0.719TrpIle: 0.719 ± 0.244
1.15TrpLys: 1.15 ± 0.242
0.863TrpLeu: 0.863 ± 0.226
0.575TrpMet: 0.575 ± 0.206
0.863TrpAsn: 0.863 ± 0.213
0.216TrpPro: 0.216 ± 0.129
0.503TrpGln: 0.503 ± 0.189
0.647TrpArg: 0.647 ± 0.238
0.935TrpSer: 0.935 ± 0.253
0.575TrpThr: 0.575 ± 0.169
0.719TrpVal: 0.719 ± 0.227
0.144TrpTrp: 0.144 ± 0.092
0.431TrpTyr: 0.431 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.941TyrAla: 1.941 ± 0.36
0.575TyrCys: 0.575 ± 0.194
2.372TyrAsp: 2.372 ± 0.433
2.516TyrGlu: 2.516 ± 0.395
1.653TyrPhe: 1.653 ± 0.375
2.013TyrGly: 2.013 ± 0.317
0.719TyrHis: 0.719 ± 0.279
2.444TyrIle: 2.444 ± 0.373
3.019TyrLys: 3.019 ± 0.512
3.235TyrLeu: 3.235 ± 0.588
1.438TyrMet: 1.438 ± 0.385
1.797TyrAsn: 1.797 ± 0.392
1.222TyrPro: 1.222 ± 0.297
1.725TyrGln: 1.725 ± 0.309
1.725TyrArg: 1.725 ± 0.314
2.085TyrSer: 2.085 ± 0.37
2.013TyrThr: 2.013 ± 0.335
2.948TyrVal: 2.948 ± 0.497
0.647TyrTrp: 0.647 ± 0.224
2.085TyrTyr: 2.085 ± 0.612
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (13911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski