Amino acid dipepetide frequency for Escherichia phage phi191

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.618AlaAla: 9.618 ± 0.97
1.082AlaCys: 1.082 ± 0.281
5.531AlaAsp: 5.531 ± 0.597
8.296AlaGlu: 8.296 ± 0.756
3.787AlaPhe: 3.787 ± 0.612
9.017AlaGly: 9.017 ± 1.295
1.503AlaHis: 1.503 ± 0.354
4.448AlaIle: 4.448 ± 0.516
4.989AlaLys: 4.989 ± 0.594
6.853AlaLeu: 6.853 ± 0.635
3.006AlaMet: 3.006 ± 0.435
2.946AlaAsn: 2.946 ± 0.479
3.667AlaPro: 3.667 ± 0.521
4.509AlaGln: 4.509 ± 0.632
5.05AlaArg: 5.05 ± 0.579
5.831AlaSer: 5.831 ± 0.527
5.531AlaThr: 5.531 ± 0.8
6.733AlaVal: 6.733 ± 0.589
1.683AlaTrp: 1.683 ± 0.327
2.705AlaTyr: 2.705 ± 0.342
0.24AlaXaa: 0.24 ± 0.13
Cys
1.022CysAla: 1.022 ± 0.267
0.24CysCys: 0.24 ± 0.138
0.541CysAsp: 0.541 ± 0.231
0.481CysGlu: 0.481 ± 0.171
0.421CysPhe: 0.421 ± 0.158
0.842CysGly: 0.842 ± 0.274
0.24CysHis: 0.24 ± 0.119
0.361CysIle: 0.361 ± 0.151
0.601CysLys: 0.601 ± 0.206
1.142CysLeu: 1.142 ± 0.245
0.301CysMet: 0.301 ± 0.138
0.361CysAsn: 0.361 ± 0.125
0.721CysPro: 0.721 ± 0.234
0.301CysGln: 0.301 ± 0.138
0.962CysArg: 0.962 ± 0.28
1.142CysSer: 1.142 ± 0.227
0.481CysThr: 0.481 ± 0.14
0.721CysVal: 0.721 ± 0.226
0.12CysTrp: 0.12 ± 0.09
0.361CysTyr: 0.361 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
6.072AspAla: 6.072 ± 0.655
0.481AspCys: 0.481 ± 0.153
3.487AspAsp: 3.487 ± 0.403
3.787AspGlu: 3.787 ± 0.578
1.924AspPhe: 1.924 ± 0.344
3.968AspGly: 3.968 ± 0.535
0.781AspHis: 0.781 ± 0.341
3.126AspIle: 3.126 ± 0.469
3.907AspLys: 3.907 ± 0.411
4.088AspLeu: 4.088 ± 0.479
1.743AspMet: 1.743 ± 0.372
2.344AspAsn: 2.344 ± 0.304
2.585AspPro: 2.585 ± 0.432
1.262AspGln: 1.262 ± 0.335
3.667AspArg: 3.667 ± 0.47
3.186AspSer: 3.186 ± 0.364
2.585AspThr: 2.585 ± 0.465
4.509AspVal: 4.509 ± 0.459
0.781AspTrp: 0.781 ± 0.22
1.443AspTyr: 1.443 ± 0.291
0.0AspXaa: 0.0 ± 0.0
Glu
6.552GluAla: 6.552 ± 0.732
0.962GluCys: 0.962 ± 0.249
2.525GluAsp: 2.525 ± 0.306
4.569GluGlu: 4.569 ± 0.527
2.705GluPhe: 2.705 ± 0.506
3.607GluGly: 3.607 ± 0.48
1.383GluHis: 1.383 ± 0.268
3.787GluIle: 3.787 ± 0.487
4.569GluLys: 4.569 ± 0.548
5.771GluLeu: 5.771 ± 0.636
1.864GluMet: 1.864 ± 0.302
2.705GluAsn: 2.705 ± 0.422
1.383GluPro: 1.383 ± 0.256
3.066GluGln: 3.066 ± 0.504
5.531GluArg: 5.531 ± 0.61
3.366GluSer: 3.366 ± 0.412
3.547GluThr: 3.547 ± 0.565
3.968GluVal: 3.968 ± 0.48
0.721GluTrp: 0.721 ± 0.196
2.405GluTyr: 2.405 ± 0.331
0.0GluXaa: 0.0 ± 0.0
Phe
3.066PheAla: 3.066 ± 0.407
0.661PheCys: 0.661 ± 0.193
1.924PheAsp: 1.924 ± 0.33
1.443PheGlu: 1.443 ± 0.319
0.902PhePhe: 0.902 ± 0.248
2.284PheGly: 2.284 ± 0.345
0.541PheHis: 0.541 ± 0.166
2.044PheIle: 2.044 ± 0.428
1.623PheLys: 1.623 ± 0.337
2.224PheLeu: 2.224 ± 0.367
1.262PheMet: 1.262 ± 0.296
2.044PheAsn: 2.044 ± 0.419
1.323PhePro: 1.323 ± 0.255
0.842PheGln: 0.842 ± 0.259
2.825PheArg: 2.825 ± 0.419
3.306PheSer: 3.306 ± 0.503
2.164PheThr: 2.164 ± 0.37
3.006PheVal: 3.006 ± 0.427
0.361PheTrp: 0.361 ± 0.171
0.781PheTyr: 0.781 ± 0.265
0.06PheXaa: 0.06 ± 0.063
Gly
6.913GlyAla: 6.913 ± 0.959
0.421GlyCys: 0.421 ± 0.195
4.689GlyAsp: 4.689 ± 0.945
5.831GlyGlu: 5.831 ± 1.137
2.946GlyPhe: 2.946 ± 0.438
5.17GlyGly: 5.17 ± 0.624
0.902GlyHis: 0.902 ± 0.252
4.088GlyIle: 4.088 ± 0.553
5.47GlyLys: 5.47 ± 0.648
4.989GlyLeu: 4.989 ± 0.532
2.224GlyMet: 2.224 ± 0.375
2.825GlyAsn: 2.825 ± 0.411
4.629GlyPro: 4.629 ± 2.758
3.186GlyGln: 3.186 ± 0.432
4.569GlyArg: 4.569 ± 0.431
4.208GlySer: 4.208 ± 0.613
3.847GlyThr: 3.847 ± 0.417
5.11GlyVal: 5.11 ± 0.522
0.781GlyTrp: 0.781 ± 0.215
2.405GlyTyr: 2.405 ± 0.316
0.06GlyXaa: 0.06 ± 0.069
His
2.044HisAla: 2.044 ± 0.29
0.12HisCys: 0.12 ± 0.084
0.721HisAsp: 0.721 ± 0.175
0.842HisGlu: 0.842 ± 0.28
0.721HisPhe: 0.721 ± 0.259
1.443HisGly: 1.443 ± 0.293
0.421HisHis: 0.421 ± 0.194
0.781HisIle: 0.781 ± 0.251
0.661HisLys: 0.661 ± 0.191
1.803HisLeu: 1.803 ± 0.42
0.661HisMet: 0.661 ± 0.226
0.721HisAsn: 0.721 ± 0.191
0.902HisPro: 0.902 ± 0.232
0.661HisGln: 0.661 ± 0.181
1.022HisArg: 1.022 ± 0.256
1.443HisSer: 1.443 ± 0.285
0.842HisThr: 0.842 ± 0.238
0.902HisVal: 0.902 ± 0.273
0.18HisTrp: 0.18 ± 0.101
0.902HisTyr: 0.902 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.929IleAla: 4.929 ± 0.549
0.661IleCys: 0.661 ± 0.2
3.607IleAsp: 3.607 ± 0.466
3.006IleGlu: 3.006 ± 0.407
1.082IlePhe: 1.082 ± 0.323
2.525IleGly: 2.525 ± 0.463
1.022IleHis: 1.022 ± 0.248
2.525IleIle: 2.525 ± 0.38
2.765IleLys: 2.765 ± 0.387
3.066IleLeu: 3.066 ± 0.501
1.082IleMet: 1.082 ± 0.235
3.126IleAsn: 3.126 ± 0.384
2.525IlePro: 2.525 ± 0.407
1.803IleGln: 1.803 ± 0.292
4.989IleArg: 4.989 ± 0.547
3.907IleSer: 3.907 ± 0.608
3.487IleThr: 3.487 ± 0.551
2.044IleVal: 2.044 ± 0.264
0.18IleTrp: 0.18 ± 0.095
1.262IleTyr: 1.262 ± 0.299
0.0IleXaa: 0.0 ± 0.0
Lys
5.951LysAla: 5.951 ± 0.56
0.601LysCys: 0.601 ± 0.242
3.006LysAsp: 3.006 ± 0.506
3.366LysGlu: 3.366 ± 0.449
1.383LysPhe: 1.383 ± 0.298
6.252LysGly: 6.252 ± 1.353
1.082LysHis: 1.082 ± 0.286
3.427LysIle: 3.427 ± 0.442
3.547LysLys: 3.547 ± 0.473
4.749LysLeu: 4.749 ± 0.567
1.383LysMet: 1.383 ± 0.264
3.306LysAsn: 3.306 ± 0.528
2.645LysPro: 2.645 ± 0.409
2.344LysGln: 2.344 ± 0.444
2.585LysArg: 2.585 ± 0.354
3.246LysSer: 3.246 ± 0.433
3.667LysThr: 3.667 ± 0.447
2.946LysVal: 2.946 ± 0.383
0.601LysTrp: 0.601 ± 0.192
1.262LysTyr: 1.262 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
8.957LeuAla: 8.957 ± 0.789
1.022LeuCys: 1.022 ± 0.267
3.547LeuAsp: 3.547 ± 0.446
4.268LeuGlu: 4.268 ± 0.58
2.645LeuPhe: 2.645 ± 0.357
4.448LeuGly: 4.448 ± 0.547
1.443LeuHis: 1.443 ± 0.325
3.246LeuIle: 3.246 ± 0.477
3.907LeuLys: 3.907 ± 0.474
6.372LeuLeu: 6.372 ± 0.488
1.864LeuMet: 1.864 ± 0.402
4.268LeuAsn: 4.268 ± 0.523
4.388LeuPro: 4.388 ± 0.49
3.306LeuGln: 3.306 ± 0.569
5.05LeuArg: 5.05 ± 0.554
6.072LeuSer: 6.072 ± 0.665
5.531LeuThr: 5.531 ± 0.527
5.05LeuVal: 5.05 ± 0.553
0.601LeuTrp: 0.601 ± 0.181
1.803LeuTyr: 1.803 ± 0.306
0.24LeuXaa: 0.24 ± 0.117
Met
3.066MetAla: 3.066 ± 0.434
0.06MetCys: 0.06 ± 0.057
1.262MetAsp: 1.262 ± 0.239
1.202MetGlu: 1.202 ± 0.245
0.781MetPhe: 0.781 ± 0.215
1.803MetGly: 1.803 ± 0.348
0.361MetHis: 0.361 ± 0.158
0.962MetIle: 0.962 ± 0.222
2.224MetLys: 2.224 ± 0.373
1.803MetLeu: 1.803 ± 0.318
0.962MetMet: 0.962 ± 0.296
1.984MetAsn: 1.984 ± 0.371
1.743MetPro: 1.743 ± 0.323
1.082MetGln: 1.082 ± 0.266
1.563MetArg: 1.563 ± 0.315
2.104MetSer: 2.104 ± 0.305
2.705MetThr: 2.705 ± 0.373
1.443MetVal: 1.443 ± 0.322
0.24MetTrp: 0.24 ± 0.094
0.541MetTyr: 0.541 ± 0.222
0.06MetXaa: 0.06 ± 0.057
Asn
5.11AsnAla: 5.11 ± 0.547
0.661AsnCys: 0.661 ± 0.216
2.044AsnAsp: 2.044 ± 0.322
2.525AsnGlu: 2.525 ± 0.398
1.383AsnPhe: 1.383 ± 0.308
3.787AsnGly: 3.787 ± 0.453
1.323AsnHis: 1.323 ± 0.261
2.284AsnIle: 2.284 ± 0.35
2.284AsnLys: 2.284 ± 0.394
3.487AsnLeu: 3.487 ± 0.463
1.082AsnMet: 1.082 ± 0.257
2.044AsnAsn: 2.044 ± 0.462
1.924AsnPro: 1.924 ± 0.334
1.924AsnGln: 1.924 ± 0.333
2.405AsnArg: 2.405 ± 0.392
2.525AsnSer: 2.525 ± 0.462
1.984AsnThr: 1.984 ± 0.384
3.006AsnVal: 3.006 ± 0.437
0.781AsnTrp: 0.781 ± 0.208
1.323AsnTyr: 1.323 ± 0.285
0.06AsnXaa: 0.06 ± 0.055
Pro
3.847ProAla: 3.847 ± 0.853
0.481ProCys: 0.481 ± 0.191
3.968ProAsp: 3.968 ± 0.574
4.809ProGlu: 4.809 ± 0.631
1.623ProPhe: 1.623 ± 0.318
3.607ProGly: 3.607 ± 0.823
0.361ProHis: 0.361 ± 0.134
0.962ProIle: 0.962 ± 0.259
2.825ProLys: 2.825 ± 0.875
3.126ProLeu: 3.126 ± 0.374
0.902ProMet: 0.902 ± 0.229
0.962ProAsn: 0.962 ± 0.254
1.262ProPro: 1.262 ± 0.328
2.224ProGln: 2.224 ± 0.659
2.705ProArg: 2.705 ± 0.428
2.405ProSer: 2.405 ± 0.397
2.224ProThr: 2.224 ± 0.338
4.629ProVal: 4.629 ± 0.603
0.601ProTrp: 0.601 ± 0.186
1.803ProTyr: 1.803 ± 0.348
0.0ProXaa: 0.0 ± 0.0
Gln
3.667GlnAla: 3.667 ± 0.567
0.902GlnCys: 0.902 ± 0.225
2.224GlnAsp: 2.224 ± 0.382
2.164GlnGlu: 2.164 ± 0.365
1.323GlnPhe: 1.323 ± 0.278
3.547GlnGly: 3.547 ± 1.031
1.022GlnHis: 1.022 ± 0.244
2.525GlnIle: 2.525 ± 0.438
2.885GlnLys: 2.885 ± 0.513
3.547GlnLeu: 3.547 ± 0.501
1.082GlnMet: 1.082 ± 0.284
1.503GlnAsn: 1.503 ± 0.292
1.924GlnPro: 1.924 ± 0.432
3.366GlnGln: 3.366 ± 0.725
2.224GlnArg: 2.224 ± 0.414
2.344GlnSer: 2.344 ± 0.335
1.864GlnThr: 1.864 ± 0.323
2.164GlnVal: 2.164 ± 0.482
0.541GlnTrp: 0.541 ± 0.176
1.202GlnTyr: 1.202 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
4.028ArgAla: 4.028 ± 0.447
0.541ArgCys: 0.541 ± 0.221
4.028ArgAsp: 4.028 ± 0.553
5.11ArgGlu: 5.11 ± 0.682
2.946ArgPhe: 2.946 ± 0.482
4.869ArgGly: 4.869 ± 0.817
1.984ArgHis: 1.984 ± 0.293
3.366ArgIle: 3.366 ± 0.457
4.509ArgLys: 4.509 ± 0.645
5.35ArgLeu: 5.35 ± 0.488
1.864ArgMet: 1.864 ± 0.384
3.066ArgAsn: 3.066 ± 0.452
2.044ArgPro: 2.044 ± 0.383
2.765ArgGln: 2.765 ± 0.408
5.35ArgArg: 5.35 ± 0.705
3.727ArgSer: 3.727 ± 0.419
3.727ArgThr: 3.727 ± 0.544
3.847ArgVal: 3.847 ± 0.58
1.202ArgTrp: 1.202 ± 0.271
2.104ArgTyr: 2.104 ± 0.373
0.18ArgXaa: 0.18 ± 0.106
Ser
6.192SerAla: 6.192 ± 0.604
0.661SerCys: 0.661 ± 0.189
3.907SerAsp: 3.907 ± 0.481
3.968SerGlu: 3.968 ± 0.384
1.924SerPhe: 1.924 ± 0.343
5.591SerGly: 5.591 ± 0.636
1.262SerHis: 1.262 ± 0.271
2.525SerIle: 2.525 ± 0.363
2.765SerLys: 2.765 ± 0.328
6.432SerLeu: 6.432 ± 0.769
1.924SerMet: 1.924 ± 0.369
2.044SerAsn: 2.044 ± 0.374
3.186SerPro: 3.186 ± 0.536
2.946SerGln: 2.946 ± 0.47
4.689SerArg: 4.689 ± 0.612
3.306SerSer: 3.306 ± 0.575
3.246SerThr: 3.246 ± 0.457
4.448SerVal: 4.448 ± 0.534
0.781SerTrp: 0.781 ± 0.198
1.563SerTyr: 1.563 ± 0.43
0.0SerXaa: 0.0 ± 0.0
Thr
5.831ThrAla: 5.831 ± 0.637
0.421ThrCys: 0.421 ± 0.187
2.946ThrAsp: 2.946 ± 0.418
3.907ThrGlu: 3.907 ± 0.518
2.164ThrPhe: 2.164 ± 0.396
5.891ThrGly: 5.891 ± 0.711
0.721ThrHis: 0.721 ± 0.232
3.487ThrIle: 3.487 ± 0.396
2.645ThrLys: 2.645 ± 0.389
5.23ThrLeu: 5.23 ± 0.521
1.082ThrMet: 1.082 ± 0.24
1.743ThrAsn: 1.743 ± 0.285
3.727ThrPro: 3.727 ± 0.483
1.924ThrGln: 1.924 ± 0.361
2.765ThrArg: 2.765 ± 0.467
3.847ThrSer: 3.847 ± 0.432
4.208ThrThr: 4.208 ± 0.563
4.509ThrVal: 4.509 ± 0.592
1.202ThrTrp: 1.202 ± 0.235
1.082ThrTyr: 1.082 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
6.673ValAla: 6.673 ± 0.669
1.082ValCys: 1.082 ± 0.273
3.847ValAsp: 3.847 ± 0.561
3.487ValGlu: 3.487 ± 0.425
2.284ValPhe: 2.284 ± 0.369
3.186ValGly: 3.186 ± 0.541
0.661ValHis: 0.661 ± 0.208
3.487ValIle: 3.487 ± 0.559
3.427ValLys: 3.427 ± 0.424
5.41ValLeu: 5.41 ± 0.517
2.164ValMet: 2.164 ± 0.312
3.787ValAsn: 3.787 ± 0.452
2.885ValPro: 2.885 ± 0.452
1.924ValGln: 1.924 ± 0.279
5.11ValArg: 5.11 ± 0.874
5.17ValSer: 5.17 ± 0.707
5.05ValThr: 5.05 ± 0.494
4.509ValVal: 4.509 ± 0.589
0.842ValTrp: 0.842 ± 0.196
1.743ValTyr: 1.743 ± 0.335
0.06ValXaa: 0.06 ± 0.066
Trp
0.842TrpAla: 0.842 ± 0.233
0.24TrpCys: 0.24 ± 0.123
0.541TrpAsp: 0.541 ± 0.219
0.601TrpGlu: 0.601 ± 0.153
0.601TrpPhe: 0.601 ± 0.182
0.962TrpGly: 0.962 ± 0.224
0.24TrpHis: 0.24 ± 0.122
0.721TrpIle: 0.721 ± 0.226
0.902TrpLys: 0.902 ± 0.25
0.842TrpLeu: 0.842 ± 0.341
0.781TrpMet: 0.781 ± 0.217
0.541TrpAsn: 0.541 ± 0.168
0.601TrpPro: 0.601 ± 0.197
0.781TrpGln: 0.781 ± 0.193
1.022TrpArg: 1.022 ± 0.19
0.721TrpSer: 0.721 ± 0.2
0.842TrpThr: 0.842 ± 0.209
1.082TrpVal: 1.082 ± 0.238
0.18TrpTrp: 0.18 ± 0.116
0.24TrpTyr: 0.24 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.645TyrAla: 2.645 ± 0.427
0.06TyrCys: 0.06 ± 0.058
1.323TyrAsp: 1.323 ± 0.288
1.142TyrGlu: 1.142 ± 0.347
1.142TyrPhe: 1.142 ± 0.267
2.465TyrGly: 2.465 ± 0.475
0.421TyrHis: 0.421 ± 0.179
1.323TyrIle: 1.323 ± 0.226
0.842TyrLys: 0.842 ± 0.238
1.623TyrLeu: 1.623 ± 0.348
0.661TyrMet: 0.661 ± 0.208
1.623TyrAsn: 1.623 ± 0.315
1.262TyrPro: 1.262 ± 0.294
1.803TyrGln: 1.803 ± 0.263
2.525TyrArg: 2.525 ± 0.457
1.383TyrSer: 1.383 ± 0.265
1.803TyrThr: 1.803 ± 0.417
1.984TyrVal: 1.984 ± 0.416
0.902TyrTrp: 0.902 ± 0.224
1.383TyrTyr: 1.383 ± 0.291
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.06XaaAsp: 0.06 ± 0.066
0.18XaaGlu: 0.18 ± 0.101
0.0XaaPhe: 0.0 ± 0.0
0.06XaaGly: 0.06 ± 0.063
0.12XaaHis: 0.12 ± 0.087
0.12XaaIle: 0.12 ± 0.095
0.0XaaLys: 0.0 ± 0.0
0.12XaaLeu: 0.12 ± 0.096
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.12XaaSer: 0.12 ± 0.085
0.0XaaThr: 0.0 ± 0.0
0.18XaaVal: 0.18 ± 0.096
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.301XaaXaa: 0.301 ± 0.196
Statistics based on 87 proteins (16636 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski