Amino acid dipepetide frequency for Escherichia phage Penshu1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.94AlaAla: 8.94 ± 1.039
0.87AlaCys: 0.87 ± 0.238
4.905AlaAsp: 4.905 ± 0.573
6.25AlaGlu: 6.25 ± 0.67
3.402AlaPhe: 3.402 ± 0.42
7.753AlaGly: 7.753 ± 0.884
1.028AlaHis: 1.028 ± 0.314
5.38AlaIle: 5.38 ± 0.611
5.854AlaLys: 5.854 ± 0.561
7.358AlaLeu: 7.358 ± 1.064
2.532AlaMet: 2.532 ± 0.435
3.639AlaAsn: 3.639 ± 0.418
2.69AlaPro: 2.69 ± 0.493
2.769AlaGln: 2.769 ± 0.45
3.56AlaArg: 3.56 ± 0.48
4.351AlaSer: 4.351 ± 0.407
4.114AlaThr: 4.114 ± 0.741
5.934AlaVal: 5.934 ± 0.892
1.741AlaTrp: 1.741 ± 0.439
2.769AlaTyr: 2.769 ± 0.531
0.0AlaXaa: 0.0 ± 0.0
Cys
0.712CysAla: 0.712 ± 0.212
0.158CysCys: 0.158 ± 0.16
0.87CysAsp: 0.87 ± 0.295
0.554CysGlu: 0.554 ± 0.197
0.791CysPhe: 0.791 ± 0.294
0.791CysGly: 0.791 ± 0.256
0.475CysHis: 0.475 ± 0.272
0.316CysIle: 0.316 ± 0.145
0.949CysLys: 0.949 ± 0.362
0.791CysLeu: 0.791 ± 0.312
0.475CysMet: 0.475 ± 0.228
0.554CysAsn: 0.554 ± 0.205
0.475CysPro: 0.475 ± 0.272
0.158CysGln: 0.158 ± 0.116
0.633CysArg: 0.633 ± 0.224
0.396CysSer: 0.396 ± 0.194
0.316CysThr: 0.316 ± 0.15
0.791CysVal: 0.791 ± 0.318
0.237CysTrp: 0.237 ± 0.132
0.237CysTyr: 0.237 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
6.25AspAla: 6.25 ± 0.559
1.028AspCys: 1.028 ± 0.4
4.193AspAsp: 4.193 ± 0.555
3.797AspGlu: 3.797 ± 0.448
2.057AspPhe: 2.057 ± 0.48
6.646AspGly: 6.646 ± 0.818
1.108AspHis: 1.108 ± 0.246
2.69AspIle: 2.69 ± 0.382
3.402AspLys: 3.402 ± 0.647
5.142AspLeu: 5.142 ± 0.532
2.136AspMet: 2.136 ± 0.386
2.453AspAsn: 2.453 ± 0.532
2.927AspPro: 2.927 ± 0.552
1.899AspGln: 1.899 ± 0.376
2.532AspArg: 2.532 ± 0.378
3.639AspSer: 3.639 ± 0.439
3.956AspThr: 3.956 ± 0.543
4.747AspVal: 4.747 ± 0.543
0.949AspTrp: 0.949 ± 0.276
2.532AspTyr: 2.532 ± 0.486
0.0AspXaa: 0.0 ± 0.0
Glu
7.041GluAla: 7.041 ± 0.941
0.633GluCys: 0.633 ± 0.219
4.589GluAsp: 4.589 ± 0.626
4.747GluGlu: 4.747 ± 0.775
2.136GluPhe: 2.136 ± 0.395
4.668GluGly: 4.668 ± 0.71
1.187GluHis: 1.187 ± 0.285
2.532GluIle: 2.532 ± 0.45
3.323GluLys: 3.323 ± 0.462
5.38GluLeu: 5.38 ± 0.671
2.453GluMet: 2.453 ± 0.436
2.215GluAsn: 2.215 ± 0.448
1.899GluPro: 1.899 ± 0.392
2.532GluGln: 2.532 ± 0.455
3.56GluArg: 3.56 ± 0.534
4.272GluSer: 4.272 ± 0.619
3.718GluThr: 3.718 ± 0.401
4.114GluVal: 4.114 ± 0.69
1.345GluTrp: 1.345 ± 0.291
2.69GluTyr: 2.69 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
2.532PheAla: 2.532 ± 0.377
0.316PheCys: 0.316 ± 0.189
2.769PheAsp: 2.769 ± 0.425
1.82PheGlu: 1.82 ± 0.382
1.108PhePhe: 1.108 ± 0.31
2.69PheGly: 2.69 ± 0.534
0.554PheHis: 0.554 ± 0.197
1.582PheIle: 1.582 ± 0.406
2.769PheLys: 2.769 ± 0.467
2.927PheLeu: 2.927 ± 0.385
1.187PheMet: 1.187 ± 0.416
1.978PheAsn: 1.978 ± 0.338
1.661PhePro: 1.661 ± 0.341
0.87PheGln: 0.87 ± 0.296
1.424PheArg: 1.424 ± 0.323
2.69PheSer: 2.69 ± 0.324
2.057PheThr: 2.057 ± 0.343
2.532PheVal: 2.532 ± 0.503
0.396PheTrp: 0.396 ± 0.138
1.345PheTyr: 1.345 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
6.329GlyAla: 6.329 ± 0.89
0.554GlyCys: 0.554 ± 0.202
5.301GlyAsp: 5.301 ± 0.845
5.301GlyGlu: 5.301 ± 0.522
2.373GlyPhe: 2.373 ± 0.432
5.459GlyGly: 5.459 ± 0.713
1.028GlyHis: 1.028 ± 0.274
4.272GlyIle: 4.272 ± 0.538
6.804GlyLys: 6.804 ± 0.843
6.487GlyLeu: 6.487 ± 0.738
2.057GlyMet: 2.057 ± 0.356
3.165GlyAsn: 3.165 ± 0.524
1.503GlyPro: 1.503 ± 0.336
2.769GlyGln: 2.769 ± 0.396
6.013GlyArg: 6.013 ± 0.625
6.013GlySer: 6.013 ± 0.718
4.43GlyThr: 4.43 ± 0.445
5.538GlyVal: 5.538 ± 0.843
1.187GlyTrp: 1.187 ± 0.305
3.481GlyTyr: 3.481 ± 0.45
0.0GlyXaa: 0.0 ± 0.0
His
0.712HisAla: 0.712 ± 0.223
0.237HisCys: 0.237 ± 0.148
1.503HisAsp: 1.503 ± 0.532
1.108HisGlu: 1.108 ± 0.315
0.475HisPhe: 0.475 ± 0.227
0.791HisGly: 0.791 ± 0.206
0.316HisHis: 0.316 ± 0.176
0.949HisIle: 0.949 ± 0.223
1.345HisLys: 1.345 ± 0.32
2.136HisLeu: 2.136 ± 0.386
0.475HisMet: 0.475 ± 0.188
0.554HisAsn: 0.554 ± 0.194
0.554HisPro: 0.554 ± 0.18
0.554HisGln: 0.554 ± 0.201
1.424HisArg: 1.424 ± 0.341
1.108HisSer: 1.108 ± 0.26
1.424HisThr: 1.424 ± 0.327
1.424HisVal: 1.424 ± 0.314
0.475HisTrp: 0.475 ± 0.179
0.712HisTyr: 0.712 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.193IleAla: 4.193 ± 0.546
0.87IleCys: 0.87 ± 0.254
2.848IleAsp: 2.848 ± 0.363
2.769IleGlu: 2.769 ± 0.37
1.187IlePhe: 1.187 ± 0.306
3.797IleGly: 3.797 ± 0.631
1.345IleHis: 1.345 ± 0.364
1.899IleIle: 1.899 ± 0.417
3.56IleLys: 3.56 ± 0.513
3.797IleLeu: 3.797 ± 0.471
0.791IleMet: 0.791 ± 0.209
2.294IleAsn: 2.294 ± 0.434
2.057IlePro: 2.057 ± 0.418
1.741IleGln: 1.741 ± 0.449
3.006IleArg: 3.006 ± 0.47
2.927IleSer: 2.927 ± 0.485
3.006IleThr: 3.006 ± 0.441
3.956IleVal: 3.956 ± 0.445
0.554IleTrp: 0.554 ± 0.203
1.187IleTyr: 1.187 ± 0.228
0.0IleXaa: 0.0 ± 0.0
Lys
7.12LysAla: 7.12 ± 0.679
0.87LysCys: 0.87 ± 0.322
4.272LysAsp: 4.272 ± 0.53
3.402LysGlu: 3.402 ± 0.435
2.532LysPhe: 2.532 ± 0.486
4.193LysGly: 4.193 ± 0.51
1.582LysHis: 1.582 ± 0.481
2.294LysIle: 2.294 ± 0.331
4.193LysLys: 4.193 ± 0.779
6.25LysLeu: 6.25 ± 0.711
1.899LysMet: 1.899 ± 0.371
2.373LysAsn: 2.373 ± 0.455
2.453LysPro: 2.453 ± 0.587
1.82LysGln: 1.82 ± 0.46
4.035LysArg: 4.035 ± 0.536
4.509LysSer: 4.509 ± 0.587
3.877LysThr: 3.877 ± 0.456
5.38LysVal: 5.38 ± 0.731
1.345LysTrp: 1.345 ± 0.318
2.453LysTyr: 2.453 ± 0.399
0.0LysXaa: 0.0 ± 0.0
Leu
6.883LeuAla: 6.883 ± 0.671
0.712LeuCys: 0.712 ± 0.321
4.351LeuAsp: 4.351 ± 0.542
5.775LeuGlu: 5.775 ± 0.655
2.215LeuPhe: 2.215 ± 0.325
5.222LeuGly: 5.222 ± 0.768
1.028LeuHis: 1.028 ± 0.26
3.877LeuIle: 3.877 ± 0.665
7.358LeuLys: 7.358 ± 0.69
5.38LeuLeu: 5.38 ± 0.663
3.165LeuMet: 3.165 ± 0.457
4.272LeuAsn: 4.272 ± 0.559
3.481LeuPro: 3.481 ± 0.434
3.639LeuGln: 3.639 ± 0.607
5.222LeuArg: 5.222 ± 0.424
5.222LeuSer: 5.222 ± 0.713
6.408LeuThr: 6.408 ± 0.732
4.509LeuVal: 4.509 ± 0.567
0.791LeuTrp: 0.791 ± 0.257
2.373LeuTyr: 2.373 ± 0.565
0.0LeuXaa: 0.0 ± 0.0
Met
2.848MetAla: 2.848 ± 0.442
0.475MetCys: 0.475 ± 0.17
1.345MetAsp: 1.345 ± 0.369
1.978MetGlu: 1.978 ± 0.309
1.266MetPhe: 1.266 ± 0.331
2.69MetGly: 2.69 ± 0.389
0.475MetHis: 0.475 ± 0.187
1.345MetIle: 1.345 ± 0.24
1.028MetLys: 1.028 ± 0.289
2.373MetLeu: 2.373 ± 0.419
0.475MetMet: 0.475 ± 0.183
1.266MetAsn: 1.266 ± 0.265
1.028MetPro: 1.028 ± 0.347
0.949MetGln: 0.949 ± 0.292
1.187MetArg: 1.187 ± 0.301
2.057MetSer: 2.057 ± 0.426
2.294MetThr: 2.294 ± 0.432
2.453MetVal: 2.453 ± 0.43
0.158MetTrp: 0.158 ± 0.112
0.949MetTyr: 0.949 ± 0.292
0.0MetXaa: 0.0 ± 0.0
Asn
4.272AsnAla: 4.272 ± 0.524
0.475AsnCys: 0.475 ± 0.212
2.453AsnAsp: 2.453 ± 0.509
2.373AsnGlu: 2.373 ± 0.377
1.503AsnPhe: 1.503 ± 0.246
4.351AsnGly: 4.351 ± 0.526
0.791AsnHis: 0.791 ± 0.292
2.69AsnIle: 2.69 ± 0.387
2.373AsnLys: 2.373 ± 0.421
3.244AsnLeu: 3.244 ± 0.519
1.108AsnMet: 1.108 ± 0.302
1.661AsnAsn: 1.661 ± 0.361
2.69AsnPro: 2.69 ± 0.424
1.741AsnGln: 1.741 ± 0.339
2.373AsnArg: 2.373 ± 0.552
2.215AsnSer: 2.215 ± 0.446
2.136AsnThr: 2.136 ± 0.417
3.006AsnVal: 3.006 ± 0.505
0.237AsnTrp: 0.237 ± 0.16
2.136AsnTyr: 2.136 ± 0.472
0.0AsnXaa: 0.0 ± 0.0
Pro
2.769ProAla: 2.769 ± 0.562
0.475ProCys: 0.475 ± 0.22
2.532ProAsp: 2.532 ± 0.395
3.323ProGlu: 3.323 ± 0.417
1.187ProPhe: 1.187 ± 0.232
1.899ProGly: 1.899 ± 0.43
0.633ProHis: 0.633 ± 0.175
2.215ProIle: 2.215 ± 0.331
2.769ProLys: 2.769 ± 0.526
1.978ProLeu: 1.978 ± 0.442
1.108ProMet: 1.108 ± 0.269
2.453ProAsn: 2.453 ± 0.374
0.87ProPro: 0.87 ± 0.202
1.741ProGln: 1.741 ± 0.355
1.741ProArg: 1.741 ± 0.342
2.769ProSer: 2.769 ± 0.405
2.927ProThr: 2.927 ± 0.416
2.927ProVal: 2.927 ± 0.312
0.712ProTrp: 0.712 ± 0.247
0.87ProTyr: 0.87 ± 0.214
0.0ProXaa: 0.0 ± 0.0
Gln
3.639GlnAla: 3.639 ± 0.542
0.237GlnCys: 0.237 ± 0.158
2.927GlnAsp: 2.927 ± 0.625
2.453GlnGlu: 2.453 ± 0.39
1.82GlnPhe: 1.82 ± 0.265
2.69GlnGly: 2.69 ± 0.466
0.633GlnHis: 0.633 ± 0.213
1.028GlnIle: 1.028 ± 0.261
2.215GlnLys: 2.215 ± 0.448
3.877GlnLeu: 3.877 ± 0.654
1.187GlnMet: 1.187 ± 0.307
1.266GlnAsn: 1.266 ± 0.355
1.266GlnPro: 1.266 ± 0.406
1.899GlnGln: 1.899 ± 0.503
2.294GlnArg: 2.294 ± 0.643
2.532GlnSer: 2.532 ± 0.379
1.978GlnThr: 1.978 ± 0.415
2.136GlnVal: 2.136 ± 0.327
0.475GlnTrp: 0.475 ± 0.191
0.87GlnTyr: 0.87 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
3.797ArgAla: 3.797 ± 0.766
0.316ArgCys: 0.316 ± 0.139
4.43ArgAsp: 4.43 ± 0.437
3.718ArgGlu: 3.718 ± 0.557
2.769ArgPhe: 2.769 ± 0.356
4.351ArgGly: 4.351 ± 0.468
0.791ArgHis: 0.791 ± 0.333
3.085ArgIle: 3.085 ± 0.58
3.639ArgLys: 3.639 ± 0.582
5.934ArgLeu: 5.934 ± 0.742
0.949ArgMet: 0.949 ± 0.258
2.532ArgAsn: 2.532 ± 0.439
1.582ArgPro: 1.582 ± 0.301
2.769ArgGln: 2.769 ± 0.436
2.611ArgArg: 2.611 ± 0.363
3.797ArgSer: 3.797 ± 0.61
2.373ArgThr: 2.373 ± 0.46
2.848ArgVal: 2.848 ± 0.512
1.108ArgTrp: 1.108 ± 0.318
1.661ArgTyr: 1.661 ± 0.314
0.0ArgXaa: 0.0 ± 0.0
Ser
4.668SerAla: 4.668 ± 0.643
0.791SerCys: 0.791 ± 0.336
5.063SerAsp: 5.063 ± 0.491
3.56SerGlu: 3.56 ± 0.53
2.057SerPhe: 2.057 ± 0.365
6.329SerGly: 6.329 ± 0.802
2.294SerHis: 2.294 ± 0.349
3.006SerIle: 3.006 ± 0.513
3.639SerLys: 3.639 ± 0.546
3.956SerLeu: 3.956 ± 0.526
1.345SerMet: 1.345 ± 0.33
3.323SerAsn: 3.323 ± 0.587
2.927SerPro: 2.927 ± 0.455
1.899SerGln: 1.899 ± 0.392
3.639SerArg: 3.639 ± 0.51
3.877SerSer: 3.877 ± 0.49
2.848SerThr: 2.848 ± 0.349
4.589SerVal: 4.589 ± 0.532
0.791SerTrp: 0.791 ± 0.198
2.69SerTyr: 2.69 ± 0.594
0.0SerXaa: 0.0 ± 0.0
Thr
4.589ThrAla: 4.589 ± 0.774
0.554ThrCys: 0.554 ± 0.234
3.244ThrAsp: 3.244 ± 0.432
4.589ThrGlu: 4.589 ± 0.634
2.294ThrPhe: 2.294 ± 0.397
4.984ThrGly: 4.984 ± 0.558
0.712ThrHis: 0.712 ± 0.237
3.797ThrIle: 3.797 ± 0.549
3.165ThrLys: 3.165 ± 0.423
4.984ThrLeu: 4.984 ± 0.569
1.899ThrMet: 1.899 ± 0.39
1.582ThrAsn: 1.582 ± 0.354
3.323ThrPro: 3.323 ± 0.329
2.769ThrGln: 2.769 ± 0.441
2.769ThrArg: 2.769 ± 0.453
2.848ThrSer: 2.848 ± 0.522
3.085ThrThr: 3.085 ± 0.527
4.589ThrVal: 4.589 ± 0.558
0.633ThrTrp: 0.633 ± 0.166
1.661ThrTyr: 1.661 ± 0.251
0.0ThrXaa: 0.0 ± 0.0
Val
4.747ValAla: 4.747 ± 0.539
0.554ValCys: 0.554 ± 0.293
3.402ValAsp: 3.402 ± 0.561
4.826ValGlu: 4.826 ± 0.651
2.294ValPhe: 2.294 ± 0.426
6.566ValGly: 6.566 ± 0.903
0.949ValHis: 0.949 ± 0.451
3.085ValIle: 3.085 ± 0.55
5.222ValLys: 5.222 ± 0.559
5.301ValLeu: 5.301 ± 0.704
2.215ValMet: 2.215 ± 0.367
3.56ValAsn: 3.56 ± 0.546
2.848ValPro: 2.848 ± 0.457
2.532ValGln: 2.532 ± 0.434
4.035ValArg: 4.035 ± 0.493
4.747ValSer: 4.747 ± 0.554
4.193ValThr: 4.193 ± 0.543
5.617ValVal: 5.617 ± 0.852
0.712ValTrp: 0.712 ± 0.255
2.769ValTyr: 2.769 ± 0.483
0.0ValXaa: 0.0 ± 0.0
Trp
0.475TrpAla: 0.475 ± 0.147
0.237TrpCys: 0.237 ± 0.164
0.791TrpAsp: 0.791 ± 0.236
0.949TrpGlu: 0.949 ± 0.23
0.633TrpPhe: 0.633 ± 0.198
1.028TrpGly: 1.028 ± 0.303
0.396TrpHis: 0.396 ± 0.157
0.316TrpIle: 0.316 ± 0.198
1.345TrpLys: 1.345 ± 0.342
1.899TrpLeu: 1.899 ± 0.372
0.316TrpMet: 0.316 ± 0.144
1.187TrpAsn: 1.187 ± 0.32
0.237TrpPro: 0.237 ± 0.149
0.712TrpGln: 0.712 ± 0.27
0.791TrpArg: 0.791 ± 0.25
0.87TrpSer: 0.87 ± 0.376
0.633TrpThr: 0.633 ± 0.215
0.87TrpVal: 0.87 ± 0.291
0.158TrpTrp: 0.158 ± 0.118
0.712TrpTyr: 0.712 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.323TyrAla: 3.323 ± 0.582
0.316TyrCys: 0.316 ± 0.143
2.294TyrAsp: 2.294 ± 0.38
1.741TyrGlu: 1.741 ± 0.479
1.108TyrPhe: 1.108 ± 0.217
3.165TyrGly: 3.165 ± 0.563
0.949TyrHis: 0.949 ± 0.283
1.424TyrIle: 1.424 ± 0.429
1.899TyrLys: 1.899 ± 0.426
2.769TyrLeu: 2.769 ± 0.366
0.791TyrMet: 0.791 ± 0.232
1.503TyrAsn: 1.503 ± 0.38
1.424TyrPro: 1.424 ± 0.329
1.661TyrGln: 1.661 ± 0.525
2.215TyrArg: 2.215 ± 0.432
2.453TyrSer: 2.453 ± 0.403
2.294TyrThr: 2.294 ± 0.326
2.215TyrVal: 2.215 ± 0.389
0.554TyrTrp: 0.554 ± 0.179
1.266TyrTyr: 1.266 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12641 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski