Amino acid dipepetide frequency for Enterobacter phage phiEap-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.379AlaAla: 9.379 ± 1.257
0.502AlaCys: 0.502 ± 0.186
5.946AlaAsp: 5.946 ± 0.59
5.025AlaGlu: 5.025 ± 0.44
2.847AlaPhe: 2.847 ± 0.435
7.118AlaGly: 7.118 ± 0.905
1.34AlaHis: 1.34 ± 0.273
4.857AlaIle: 4.857 ± 0.785
5.778AlaLys: 5.778 ± 0.723
8.123AlaLeu: 8.123 ± 0.86
3.601AlaMet: 3.601 ± 0.655
4.104AlaAsn: 4.104 ± 0.568
2.931AlaPro: 2.931 ± 0.543
3.685AlaGln: 3.685 ± 0.445
3.936AlaArg: 3.936 ± 0.499
5.695AlaSer: 5.695 ± 0.704
4.69AlaThr: 4.69 ± 0.723
4.941AlaVal: 4.941 ± 0.655
1.256AlaTrp: 1.256 ± 0.428
2.596AlaTyr: 2.596 ± 0.555
0.0AlaXaa: 0.0 ± 0.0
Cys
0.586CysAla: 0.586 ± 0.196
0.084CysCys: 0.084 ± 0.088
0.67CysAsp: 0.67 ± 0.327
0.67CysGlu: 0.67 ± 0.252
0.502CysPhe: 0.502 ± 0.233
0.67CysGly: 0.67 ± 0.214
0.251CysHis: 0.251 ± 0.137
0.586CysIle: 0.586 ± 0.18
0.084CysLys: 0.084 ± 0.082
0.837CysLeu: 0.837 ± 0.288
0.0CysMet: 0.0 ± 0.0
0.167CysAsn: 0.167 ± 0.113
0.419CysPro: 0.419 ± 0.169
0.502CysGln: 0.502 ± 0.24
0.586CysArg: 0.586 ± 0.263
0.419CysSer: 0.419 ± 0.187
0.502CysThr: 0.502 ± 0.23
0.335CysVal: 0.335 ± 0.153
0.084CysTrp: 0.084 ± 0.081
0.251CysTyr: 0.251 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.778AspAla: 5.778 ± 0.69
0.502AspCys: 0.502 ± 0.221
4.522AspAsp: 4.522 ± 0.535
3.685AspGlu: 3.685 ± 0.555
2.68AspPhe: 2.68 ± 0.399
5.946AspGly: 5.946 ± 0.737
1.005AspHis: 1.005 ± 0.261
2.764AspIle: 2.764 ± 0.358
4.271AspLys: 4.271 ± 0.608
4.857AspLeu: 4.857 ± 0.559
1.842AspMet: 1.842 ± 0.389
3.434AspAsn: 3.434 ± 0.404
2.596AspPro: 2.596 ± 0.5
2.429AspGln: 2.429 ± 0.567
2.345AspArg: 2.345 ± 0.49
2.764AspSer: 2.764 ± 0.411
4.104AspThr: 4.104 ± 0.559
4.271AspVal: 4.271 ± 0.56
0.754AspTrp: 0.754 ± 0.324
2.68AspTyr: 2.68 ± 0.616
0.0AspXaa: 0.0 ± 0.0
Glu
6.951GluAla: 6.951 ± 1.155
0.419GluCys: 0.419 ± 0.177
3.852GluAsp: 3.852 ± 0.645
4.941GluGlu: 4.941 ± 0.782
3.182GluPhe: 3.182 ± 0.451
4.606GluGly: 4.606 ± 0.738
1.759GluHis: 1.759 ± 0.483
3.601GluIle: 3.601 ± 0.536
3.099GluLys: 3.099 ± 0.596
6.03GluLeu: 6.03 ± 0.744
1.675GluMet: 1.675 ± 0.418
2.261GluAsn: 2.261 ± 0.335
2.345GluPro: 2.345 ± 0.633
3.434GluGln: 3.434 ± 0.88
4.606GluArg: 4.606 ± 0.407
3.769GluSer: 3.769 ± 0.566
2.931GluThr: 2.931 ± 0.5
4.69GluVal: 4.69 ± 0.653
1.089GluTrp: 1.089 ± 0.299
3.099GluTyr: 3.099 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
3.182PheAla: 3.182 ± 0.514
0.419PheCys: 0.419 ± 0.246
3.015PheAsp: 3.015 ± 0.486
1.424PheGlu: 1.424 ± 0.3
0.921PhePhe: 0.921 ± 0.327
3.35PheGly: 3.35 ± 0.529
0.921PheHis: 0.921 ± 0.349
1.591PheIle: 1.591 ± 0.445
1.926PheLys: 1.926 ± 0.282
3.182PheLeu: 3.182 ± 0.503
1.256PheMet: 1.256 ± 0.305
2.177PheAsn: 2.177 ± 0.402
1.759PhePro: 1.759 ± 0.528
1.34PheGln: 1.34 ± 0.443
1.591PheArg: 1.591 ± 0.396
2.345PheSer: 2.345 ± 0.525
2.68PheThr: 2.68 ± 0.518
2.68PheVal: 2.68 ± 0.572
0.335PheTrp: 0.335 ± 0.144
0.586PheTyr: 0.586 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
6.951GlyAla: 6.951 ± 0.651
0.837GlyCys: 0.837 ± 0.3
5.527GlyAsp: 5.527 ± 0.691
5.276GlyGlu: 5.276 ± 0.517
3.517GlyPhe: 3.517 ± 0.44
6.951GlyGly: 6.951 ± 0.923
0.67GlyHis: 0.67 ± 0.301
5.611GlyIle: 5.611 ± 0.678
5.108GlyLys: 5.108 ± 0.766
6.448GlyLeu: 6.448 ± 1.101
2.01GlyMet: 2.01 ± 0.438
4.438GlyAsn: 4.438 ± 0.786
1.591GlyPro: 1.591 ± 0.359
2.847GlyGln: 2.847 ± 0.557
3.769GlyArg: 3.769 ± 0.378
5.443GlySer: 5.443 ± 0.806
4.271GlyThr: 4.271 ± 0.654
5.443GlyVal: 5.443 ± 0.674
1.759GlyTrp: 1.759 ± 0.388
2.847GlyTyr: 2.847 ± 0.409
0.0GlyXaa: 0.0 ± 0.0
His
1.089HisAla: 1.089 ± 0.284
0.167HisCys: 0.167 ± 0.122
0.754HisAsp: 0.754 ± 0.251
1.591HisGlu: 1.591 ± 0.497
0.754HisPhe: 0.754 ± 0.259
1.759HisGly: 1.759 ± 0.422
0.502HisHis: 0.502 ± 0.184
0.921HisIle: 0.921 ± 0.263
0.921HisLys: 0.921 ± 0.216
1.591HisLeu: 1.591 ± 0.395
0.335HisMet: 0.335 ± 0.184
0.921HisAsn: 0.921 ± 0.296
0.586HisPro: 0.586 ± 0.217
0.419HisGln: 0.419 ± 0.193
0.754HisArg: 0.754 ± 0.237
1.005HisSer: 1.005 ± 0.285
0.586HisThr: 0.586 ± 0.179
1.591HisVal: 1.591 ± 0.468
0.251HisTrp: 0.251 ± 0.134
1.005HisTyr: 1.005 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
3.852IleAla: 3.852 ± 0.503
0.335IleCys: 0.335 ± 0.156
3.601IleAsp: 3.601 ± 0.517
4.941IleGlu: 4.941 ± 0.697
1.089IlePhe: 1.089 ± 0.279
4.104IleGly: 4.104 ± 0.613
1.005IleHis: 1.005 ± 0.29
2.68IleIle: 2.68 ± 0.513
3.517IleLys: 3.517 ± 0.493
4.02IleLeu: 4.02 ± 0.507
1.256IleMet: 1.256 ± 0.394
2.847IleAsn: 2.847 ± 0.514
2.01IlePro: 2.01 ± 0.418
1.759IleGln: 1.759 ± 0.334
3.015IleArg: 3.015 ± 0.614
3.266IleSer: 3.266 ± 0.468
2.931IleThr: 2.931 ± 0.341
3.517IleVal: 3.517 ± 0.581
0.502IleTrp: 0.502 ± 0.205
2.01IleTyr: 2.01 ± 0.28
0.0IleXaa: 0.0 ± 0.0
Lys
7.202LysAla: 7.202 ± 0.873
0.502LysCys: 0.502 ± 0.202
3.099LysAsp: 3.099 ± 0.432
3.517LysGlu: 3.517 ± 0.728
2.512LysPhe: 2.512 ± 0.44
5.778LysGly: 5.778 ± 0.893
1.675LysHis: 1.675 ± 0.345
2.177LysIle: 2.177 ± 0.376
3.517LysLys: 3.517 ± 0.685
5.527LysLeu: 5.527 ± 0.625
1.34LysMet: 1.34 ± 0.257
2.68LysAsn: 2.68 ± 0.411
2.261LysPro: 2.261 ± 0.452
2.429LysGln: 2.429 ± 0.36
3.015LysArg: 3.015 ± 0.479
4.187LysSer: 4.187 ± 0.651
3.434LysThr: 3.434 ± 0.391
5.025LysVal: 5.025 ± 0.708
1.172LysTrp: 1.172 ± 0.239
2.429LysTyr: 2.429 ± 0.437
0.0LysXaa: 0.0 ± 0.0
Leu
7.37LeuAla: 7.37 ± 1.158
0.251LeuCys: 0.251 ± 0.196
4.857LeuAsp: 4.857 ± 0.598
6.113LeuGlu: 6.113 ± 1.053
2.01LeuPhe: 2.01 ± 0.393
5.025LeuGly: 5.025 ± 0.778
1.172LeuHis: 1.172 ± 0.245
4.02LeuIle: 4.02 ± 0.438
7.202LeuLys: 7.202 ± 0.729
5.695LeuLeu: 5.695 ± 0.872
2.261LeuMet: 2.261 ± 0.364
4.606LeuAsn: 4.606 ± 0.561
3.434LeuPro: 3.434 ± 0.444
3.517LeuGln: 3.517 ± 0.585
5.36LeuArg: 5.36 ± 0.648
4.69LeuSer: 4.69 ± 0.603
5.276LeuThr: 5.276 ± 0.6
4.606LeuVal: 4.606 ± 0.634
1.34LeuTrp: 1.34 ± 0.404
2.512LeuTyr: 2.512 ± 0.494
0.0LeuXaa: 0.0 ± 0.0
Met
3.182MetAla: 3.182 ± 0.437
0.251MetCys: 0.251 ± 0.146
2.094MetAsp: 2.094 ± 0.429
1.256MetGlu: 1.256 ± 0.279
1.172MetPhe: 1.172 ± 0.365
1.842MetGly: 1.842 ± 0.326
0.251MetHis: 0.251 ± 0.133
1.172MetIle: 1.172 ± 0.275
1.172MetLys: 1.172 ± 0.279
2.345MetLeu: 2.345 ± 0.475
0.419MetMet: 0.419 ± 0.222
1.089MetAsn: 1.089 ± 0.29
1.089MetPro: 1.089 ± 0.26
1.591MetGln: 1.591 ± 0.392
1.424MetArg: 1.424 ± 0.296
1.926MetSer: 1.926 ± 0.458
2.177MetThr: 2.177 ± 0.398
1.842MetVal: 1.842 ± 0.373
0.084MetTrp: 0.084 ± 0.091
0.335MetTyr: 0.335 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
4.355AsnAla: 4.355 ± 0.656
0.502AsnCys: 0.502 ± 0.279
2.596AsnAsp: 2.596 ± 0.519
3.015AsnGlu: 3.015 ± 0.545
1.591AsnPhe: 1.591 ± 0.357
5.276AsnGly: 5.276 ± 0.652
0.67AsnHis: 0.67 ± 0.227
2.596AsnIle: 2.596 ± 0.364
2.847AsnLys: 2.847 ± 0.56
3.35AsnLeu: 3.35 ± 0.485
1.089AsnMet: 1.089 ± 0.28
2.177AsnAsn: 2.177 ± 0.422
2.512AsnPro: 2.512 ± 0.468
1.926AsnGln: 1.926 ± 0.357
2.68AsnArg: 2.68 ± 0.616
3.517AsnSer: 3.517 ± 0.618
2.177AsnThr: 2.177 ± 0.493
3.769AsnVal: 3.769 ± 0.642
0.502AsnTrp: 0.502 ± 0.208
1.34AsnTyr: 1.34 ± 0.521
0.0AsnXaa: 0.0 ± 0.0
Pro
3.015ProAla: 3.015 ± 0.496
0.419ProCys: 0.419 ± 0.205
2.429ProAsp: 2.429 ± 0.451
3.936ProGlu: 3.936 ± 0.659
1.34ProPhe: 1.34 ± 0.26
1.759ProGly: 1.759 ± 0.365
0.419ProHis: 0.419 ± 0.18
1.172ProIle: 1.172 ± 0.288
2.345ProLys: 2.345 ± 0.455
2.261ProLeu: 2.261 ± 0.405
1.256ProMet: 1.256 ± 0.24
1.926ProAsn: 1.926 ± 0.332
0.586ProPro: 0.586 ± 0.211
1.591ProGln: 1.591 ± 0.297
1.507ProArg: 1.507 ± 0.313
2.68ProSer: 2.68 ± 0.41
2.512ProThr: 2.512 ± 0.459
2.764ProVal: 2.764 ± 0.464
0.67ProTrp: 0.67 ± 0.234
2.094ProTyr: 2.094 ± 0.483
0.0ProXaa: 0.0 ± 0.0
Gln
3.434GlnAla: 3.434 ± 0.531
0.335GlnCys: 0.335 ± 0.167
2.345GlnAsp: 2.345 ± 0.44
2.764GlnGlu: 2.764 ± 0.457
1.926GlnPhe: 1.926 ± 0.313
2.847GlnGly: 2.847 ± 0.554
0.754GlnHis: 0.754 ± 0.238
2.094GlnIle: 2.094 ± 0.381
2.512GlnLys: 2.512 ± 0.721
4.355GlnLeu: 4.355 ± 0.478
1.256GlnMet: 1.256 ± 0.398
1.256GlnAsn: 1.256 ± 0.312
1.591GlnPro: 1.591 ± 0.287
2.512GlnGln: 2.512 ± 0.801
1.675GlnArg: 1.675 ± 0.436
2.177GlnSer: 2.177 ± 0.443
1.675GlnThr: 1.675 ± 0.339
2.68GlnVal: 2.68 ± 0.416
0.921GlnTrp: 0.921 ± 0.242
1.34GlnTyr: 1.34 ± 0.428
0.0GlnXaa: 0.0 ± 0.0
Arg
4.355ArgAla: 4.355 ± 0.467
0.67ArgCys: 0.67 ± 0.27
3.35ArgAsp: 3.35 ± 0.48
3.936ArgGlu: 3.936 ± 0.702
1.591ArgPhe: 1.591 ± 0.399
3.936ArgGly: 3.936 ± 0.63
0.67ArgHis: 0.67 ± 0.237
3.015ArgIle: 3.015 ± 0.508
3.517ArgLys: 3.517 ± 0.475
4.773ArgLeu: 4.773 ± 0.577
1.507ArgMet: 1.507 ± 0.289
2.68ArgAsn: 2.68 ± 0.488
1.256ArgPro: 1.256 ± 0.321
1.926ArgGln: 1.926 ± 0.397
2.512ArgArg: 2.512 ± 0.356
3.35ArgSer: 3.35 ± 0.645
2.345ArgThr: 2.345 ± 0.375
4.438ArgVal: 4.438 ± 0.692
0.502ArgTrp: 0.502 ± 0.214
1.005ArgTyr: 1.005 ± 0.267
0.0ArgXaa: 0.0 ± 0.0
Ser
5.108SerAla: 5.108 ± 0.56
0.502SerCys: 0.502 ± 0.207
4.522SerAsp: 4.522 ± 0.55
3.852SerGlu: 3.852 ± 0.489
3.182SerPhe: 3.182 ± 0.366
4.606SerGly: 4.606 ± 0.743
1.089SerHis: 1.089 ± 0.253
3.936SerIle: 3.936 ± 0.664
4.271SerLys: 4.271 ± 0.496
4.187SerLeu: 4.187 ± 0.698
1.089SerMet: 1.089 ± 0.284
2.177SerAsn: 2.177 ± 0.391
2.429SerPro: 2.429 ± 0.378
1.759SerGln: 1.759 ± 0.366
3.434SerArg: 3.434 ± 0.606
2.847SerSer: 2.847 ± 0.47
4.187SerThr: 4.187 ± 0.673
4.02SerVal: 4.02 ± 0.583
1.172SerTrp: 1.172 ± 0.22
1.759SerTyr: 1.759 ± 0.483
0.0SerXaa: 0.0 ± 0.0
Thr
4.02ThrAla: 4.02 ± 0.611
0.335ThrCys: 0.335 ± 0.21
3.099ThrAsp: 3.099 ± 0.381
4.02ThrGlu: 4.02 ± 0.548
2.177ThrPhe: 2.177 ± 0.358
6.448ThrGly: 6.448 ± 0.573
1.005ThrHis: 1.005 ± 0.249
3.852ThrIle: 3.852 ± 0.621
4.355ThrLys: 4.355 ± 0.695
5.276ThrLeu: 5.276 ± 0.761
1.507ThrMet: 1.507 ± 0.38
2.847ThrAsn: 2.847 ± 0.527
2.596ThrPro: 2.596 ± 0.402
1.591ThrGln: 1.591 ± 0.426
2.261ThrArg: 2.261 ± 0.437
4.02ThrSer: 4.02 ± 0.648
3.685ThrThr: 3.685 ± 0.834
3.517ThrVal: 3.517 ± 0.696
0.502ThrTrp: 0.502 ± 0.222
1.256ThrTyr: 1.256 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
5.276ValAla: 5.276 ± 0.633
0.419ValCys: 0.419 ± 0.181
3.852ValAsp: 3.852 ± 0.524
4.773ValGlu: 4.773 ± 0.816
2.512ValPhe: 2.512 ± 0.562
5.36ValGly: 5.36 ± 0.681
1.675ValHis: 1.675 ± 0.375
3.015ValIle: 3.015 ± 0.527
4.271ValLys: 4.271 ± 0.527
5.192ValLeu: 5.192 ± 0.575
1.675ValMet: 1.675 ± 0.315
3.35ValAsn: 3.35 ± 0.515
3.266ValPro: 3.266 ± 0.467
2.68ValGln: 2.68 ± 0.373
4.02ValArg: 4.02 ± 0.44
3.852ValSer: 3.852 ± 0.53
5.36ValThr: 5.36 ± 0.667
5.611ValVal: 5.611 ± 0.744
0.837ValTrp: 0.837 ± 0.284
2.764ValTyr: 2.764 ± 0.476
0.0ValXaa: 0.0 ± 0.0
Trp
0.754TrpAla: 0.754 ± 0.301
0.586TrpCys: 0.586 ± 0.215
0.67TrpAsp: 0.67 ± 0.269
1.172TrpGlu: 1.172 ± 0.364
0.335TrpPhe: 0.335 ± 0.18
0.67TrpGly: 0.67 ± 0.248
0.335TrpHis: 0.335 ± 0.204
0.837TrpIle: 0.837 ± 0.213
1.34TrpLys: 1.34 ± 0.348
1.34TrpLeu: 1.34 ± 0.425
0.335TrpMet: 0.335 ± 0.164
0.837TrpAsn: 0.837 ± 0.251
0.335TrpPro: 0.335 ± 0.161
0.586TrpGln: 0.586 ± 0.214
0.837TrpArg: 0.837 ± 0.276
0.67TrpSer: 0.67 ± 0.372
1.089TrpThr: 1.089 ± 0.242
1.256TrpVal: 1.256 ± 0.388
0.251TrpTrp: 0.251 ± 0.141
0.251TrpTyr: 0.251 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.512TyrAla: 2.512 ± 0.552
0.167TyrCys: 0.167 ± 0.135
2.429TyrAsp: 2.429 ± 0.568
2.429TyrGlu: 2.429 ± 0.422
0.837TyrPhe: 0.837 ± 0.259
3.266TyrGly: 3.266 ± 0.499
0.335TyrHis: 0.335 ± 0.193
1.759TyrIle: 1.759 ± 0.476
1.507TyrLys: 1.507 ± 0.359
1.926TyrLeu: 1.926 ± 0.343
0.921TyrMet: 0.921 ± 0.309
2.512TyrAsn: 2.512 ± 0.403
1.089TyrPro: 1.089 ± 0.379
1.926TyrGln: 1.926 ± 0.476
2.01TyrArg: 2.01 ± 0.278
1.507TyrSer: 1.507 ± 0.352
1.759TyrThr: 1.759 ± 0.413
2.764TyrVal: 2.764 ± 0.528
0.502TyrTrp: 0.502 ± 0.241
0.754TyrTyr: 0.754 ± 0.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (11942 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski