Amino acid dipepetide frequency for Escherichia phage BA14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.564AlaAla: 8.564 ± 1.215
0.816AlaCys: 0.816 ± 0.231
5.546AlaAsp: 5.546 ± 0.81
6.362AlaGlu: 6.362 ± 0.721
3.507AlaPhe: 3.507 ± 0.456
8.075AlaGly: 8.075 ± 1.045
1.55AlaHis: 1.55 ± 0.295
4.16AlaIle: 4.16 ± 0.563
7.341AlaLys: 7.341 ± 0.851
7.341AlaLeu: 7.341 ± 0.98
2.855AlaMet: 2.855 ± 0.597
4.812AlaAsn: 4.812 ± 0.797
2.773AlaPro: 2.773 ± 0.396
4.16AlaGln: 4.16 ± 0.461
4.976AlaArg: 4.976 ± 0.582
4.812AlaSer: 4.812 ± 0.673
3.263AlaThr: 3.263 ± 0.568
5.383AlaVal: 5.383 ± 0.695
1.305AlaTrp: 1.305 ± 0.279
3.018AlaTyr: 3.018 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.897CysAla: 0.897 ± 0.301
0.163CysCys: 0.163 ± 0.162
0.571CysAsp: 0.571 ± 0.295
0.571CysGlu: 0.571 ± 0.218
0.653CysPhe: 0.653 ± 0.247
0.734CysGly: 0.734 ± 0.269
0.326CysHis: 0.326 ± 0.147
0.326CysIle: 0.326 ± 0.156
0.326CysLys: 0.326 ± 0.16
1.305CysLeu: 1.305 ± 0.324
0.163CysMet: 0.163 ± 0.128
0.489CysAsn: 0.489 ± 0.2
0.734CysPro: 0.734 ± 0.307
0.571CysGln: 0.571 ± 0.204
0.653CysArg: 0.653 ± 0.284
0.734CysSer: 0.734 ± 0.25
0.489CysThr: 0.489 ± 0.262
1.06CysVal: 1.06 ± 0.366
0.163CysTrp: 0.163 ± 0.131
0.408CysTyr: 0.408 ± 0.234
0.0CysXaa: 0.0 ± 0.0
Asp
5.22AspAla: 5.22 ± 0.626
0.816AspCys: 0.816 ± 0.28
4.16AspAsp: 4.16 ± 0.971
3.589AspGlu: 3.589 ± 0.587
2.529AspPhe: 2.529 ± 0.455
6.199AspGly: 6.199 ± 0.575
1.06AspHis: 1.06 ± 0.295
3.263AspIle: 3.263 ± 0.412
3.67AspLys: 3.67 ± 0.716
3.1AspLeu: 3.1 ± 0.588
2.936AspMet: 2.936 ± 0.507
2.365AspAsn: 2.365 ± 0.319
3.263AspPro: 3.263 ± 0.512
1.958AspGln: 1.958 ± 0.394
3.018AspArg: 3.018 ± 0.7
3.426AspSer: 3.426 ± 0.506
3.181AspThr: 3.181 ± 0.462
4.16AspVal: 4.16 ± 0.523
1.06AspTrp: 1.06 ± 0.419
1.631AspTyr: 1.631 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
7.83GluAla: 7.83 ± 0.914
0.734GluCys: 0.734 ± 0.271
3.67GluAsp: 3.67 ± 0.554
4.812GluGlu: 4.812 ± 1.14
2.447GluPhe: 2.447 ± 0.464
4.976GluGly: 4.976 ± 0.728
1.794GluHis: 1.794 ± 0.402
3.181GluIle: 3.181 ± 0.501
3.263GluLys: 3.263 ± 0.626
6.444GluLeu: 6.444 ± 0.645
2.529GluMet: 2.529 ± 0.412
3.263GluAsn: 3.263 ± 0.546
2.121GluPro: 2.121 ± 0.429
3.67GluGln: 3.67 ± 0.442
4.405GluArg: 4.405 ± 0.537
3.181GluSer: 3.181 ± 0.538
3.67GluThr: 3.67 ± 0.468
4.405GluVal: 4.405 ± 0.557
1.468GluTrp: 1.468 ± 0.354
2.855GluTyr: 2.855 ± 0.448
0.0GluXaa: 0.0 ± 0.0
Phe
2.773PheAla: 2.773 ± 0.599
0.489PheCys: 0.489 ± 0.216
2.039PheAsp: 2.039 ± 0.444
2.121PheGlu: 2.121 ± 0.357
1.142PhePhe: 1.142 ± 0.257
2.855PheGly: 2.855 ± 0.584
0.408PheHis: 0.408 ± 0.191
1.794PheIle: 1.794 ± 0.426
2.447PheLys: 2.447 ± 0.456
3.1PheLeu: 3.1 ± 0.462
1.142PheMet: 1.142 ± 0.346
1.55PheAsn: 1.55 ± 0.392
1.958PhePro: 1.958 ± 0.458
1.223PheGln: 1.223 ± 0.292
2.121PheArg: 2.121 ± 0.328
2.039PheSer: 2.039 ± 0.5
3.344PheThr: 3.344 ± 0.498
2.039PheVal: 2.039 ± 0.379
0.163PheTrp: 0.163 ± 0.108
1.142PheTyr: 1.142 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
7.015GlyAla: 7.015 ± 0.871
0.897GlyCys: 0.897 ± 0.329
5.791GlyAsp: 5.791 ± 0.714
5.628GlyGlu: 5.628 ± 0.81
2.61GlyPhe: 2.61 ± 0.517
5.71GlyGly: 5.71 ± 0.757
1.06GlyHis: 1.06 ± 0.299
3.507GlyIle: 3.507 ± 0.654
6.607GlyLys: 6.607 ± 1.03
6.77GlyLeu: 6.77 ± 1.012
2.202GlyMet: 2.202 ± 0.365
3.1GlyAsn: 3.1 ± 0.472
0.326GlyPro: 0.326 ± 0.169
3.018GlyGln: 3.018 ± 0.485
4.568GlyArg: 4.568 ± 0.535
5.22GlySer: 5.22 ± 0.598
3.507GlyThr: 3.507 ± 0.39
4.812GlyVal: 4.812 ± 0.675
2.121GlyTrp: 2.121 ± 0.46
2.692GlyTyr: 2.692 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
1.387HisAla: 1.387 ± 0.386
0.408HisCys: 0.408 ± 0.175
1.305HisAsp: 1.305 ± 0.333
1.55HisGlu: 1.55 ± 0.383
1.06HisPhe: 1.06 ± 0.272
1.55HisGly: 1.55 ± 0.321
0.489HisHis: 0.489 ± 0.218
1.06HisIle: 1.06 ± 0.246
1.223HisLys: 1.223 ± 0.321
1.794HisLeu: 1.794 ± 0.414
0.489HisMet: 0.489 ± 0.185
1.06HisAsn: 1.06 ± 0.39
0.326HisPro: 0.326 ± 0.16
0.245HisGln: 0.245 ± 0.133
0.734HisArg: 0.734 ± 0.219
1.06HisSer: 1.06 ± 0.26
1.223HisThr: 1.223 ± 0.308
1.55HisVal: 1.55 ± 0.301
0.408HisTrp: 0.408 ± 0.17
0.571HisTyr: 0.571 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
4.078IleAla: 4.078 ± 0.546
0.571IleCys: 0.571 ± 0.235
3.834IleAsp: 3.834 ± 0.452
3.181IleGlu: 3.181 ± 0.476
0.816IlePhe: 0.816 ± 0.206
3.507IleGly: 3.507 ± 0.479
0.979IleHis: 0.979 ± 0.281
2.529IleIle: 2.529 ± 0.587
3.344IleLys: 3.344 ± 0.495
3.344IleLeu: 3.344 ± 0.572
1.142IleMet: 1.142 ± 0.33
2.202IleAsn: 2.202 ± 0.579
2.447IlePro: 2.447 ± 0.458
1.958IleGln: 1.958 ± 0.469
3.263IleArg: 3.263 ± 0.429
2.855IleSer: 2.855 ± 0.516
2.202IleThr: 2.202 ± 0.395
2.936IleVal: 2.936 ± 0.6
1.06IleTrp: 1.06 ± 0.291
1.468IleTyr: 1.468 ± 0.279
0.0IleXaa: 0.0 ± 0.0
Lys
7.749LysAla: 7.749 ± 0.849
0.734LysCys: 0.734 ± 0.288
3.834LysAsp: 3.834 ± 0.493
5.954LysGlu: 5.954 ± 0.625
2.039LysPhe: 2.039 ± 0.431
5.139LysGly: 5.139 ± 0.518
1.631LysHis: 1.631 ± 0.42
1.713LysIle: 1.713 ± 0.279
5.383LysLys: 5.383 ± 0.806
5.465LysLeu: 5.465 ± 0.736
2.284LysMet: 2.284 ± 0.422
2.365LysAsn: 2.365 ± 0.429
3.018LysPro: 3.018 ± 0.74
2.692LysGln: 2.692 ± 0.505
3.589LysArg: 3.589 ± 0.659
3.997LysSer: 3.997 ± 0.583
3.507LysThr: 3.507 ± 0.473
5.791LysVal: 5.791 ± 0.752
0.489LysTrp: 0.489 ± 0.181
2.202LysTyr: 2.202 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
8.564LeuAla: 8.564 ± 0.9
0.489LeuCys: 0.489 ± 0.221
4.241LeuAsp: 4.241 ± 0.607
5.873LeuGlu: 5.873 ± 0.984
2.365LeuPhe: 2.365 ± 0.426
3.915LeuGly: 3.915 ± 0.582
1.305LeuHis: 1.305 ± 0.285
3.997LeuIle: 3.997 ± 0.522
6.607LeuLys: 6.607 ± 0.814
5.465LeuLeu: 5.465 ± 0.889
2.692LeuMet: 2.692 ± 0.554
3.834LeuAsn: 3.834 ± 0.738
2.855LeuPro: 2.855 ± 0.501
3.1LeuGln: 3.1 ± 0.526
6.117LeuArg: 6.117 ± 0.618
4.731LeuSer: 4.731 ± 0.626
4.812LeuThr: 4.812 ± 0.737
5.71LeuVal: 5.71 ± 0.597
1.223LeuTrp: 1.223 ± 0.335
2.447LeuTyr: 2.447 ± 0.45
0.0LeuXaa: 0.0 ± 0.0
Met
2.692MetAla: 2.692 ± 0.472
0.163MetCys: 0.163 ± 0.114
1.55MetAsp: 1.55 ± 0.45
1.794MetGlu: 1.794 ± 0.415
1.468MetPhe: 1.468 ± 0.305
2.365MetGly: 2.365 ± 0.553
0.408MetHis: 0.408 ± 0.21
1.55MetIle: 1.55 ± 0.308
2.202MetLys: 2.202 ± 0.406
3.018MetLeu: 3.018 ± 0.501
0.653MetMet: 0.653 ± 0.218
1.55MetAsn: 1.55 ± 0.362
1.305MetPro: 1.305 ± 0.355
1.387MetGln: 1.387 ± 0.349
1.223MetArg: 1.223 ± 0.323
2.284MetSer: 2.284 ± 0.262
1.876MetThr: 1.876 ± 0.395
2.447MetVal: 2.447 ± 0.547
0.163MetTrp: 0.163 ± 0.103
0.979MetTyr: 0.979 ± 0.259
0.0MetXaa: 0.0 ± 0.0
Asn
3.67AsnAla: 3.67 ± 0.626
0.653AsnCys: 0.653 ± 0.232
2.284AsnAsp: 2.284 ± 0.365
3.018AsnGlu: 3.018 ± 0.477
1.958AsnPhe: 1.958 ± 0.277
4.731AsnGly: 4.731 ± 0.784
1.305AsnHis: 1.305 ± 0.333
2.121AsnIle: 2.121 ± 0.387
3.018AsnLys: 3.018 ± 0.515
3.589AsnLeu: 3.589 ± 0.474
1.142AsnMet: 1.142 ± 0.288
1.713AsnAsn: 1.713 ± 0.48
2.447AsnPro: 2.447 ± 0.339
1.55AsnGln: 1.55 ± 0.292
2.692AsnArg: 2.692 ± 0.752
2.447AsnSer: 2.447 ± 0.502
2.529AsnThr: 2.529 ± 0.453
2.855AsnVal: 2.855 ± 0.495
0.571AsnTrp: 0.571 ± 0.221
1.958AsnTyr: 1.958 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
2.365ProAla: 2.365 ± 0.454
0.408ProCys: 0.408 ± 0.176
2.61ProAsp: 2.61 ± 0.472
3.67ProGlu: 3.67 ± 0.777
1.387ProPhe: 1.387 ± 0.348
1.305ProGly: 1.305 ± 0.277
0.816ProHis: 0.816 ± 0.266
1.713ProIle: 1.713 ± 0.348
2.855ProLys: 2.855 ± 0.428
2.284ProLeu: 2.284 ± 0.419
0.979ProMet: 0.979 ± 0.391
2.284ProAsn: 2.284 ± 0.465
0.816ProPro: 0.816 ± 0.209
0.653ProGln: 0.653 ± 0.205
1.305ProArg: 1.305 ± 0.346
2.202ProSer: 2.202 ± 0.39
2.365ProThr: 2.365 ± 0.41
2.773ProVal: 2.773 ± 0.43
0.734ProTrp: 0.734 ± 0.246
1.794ProTyr: 1.794 ± 0.482
0.0ProXaa: 0.0 ± 0.0
Gln
4.568GlnAla: 4.568 ± 0.537
0.326GlnCys: 0.326 ± 0.167
1.958GlnAsp: 1.958 ± 0.303
2.855GlnGlu: 2.855 ± 0.489
1.958GlnPhe: 1.958 ± 0.31
2.936GlnGly: 2.936 ± 0.628
0.897GlnHis: 0.897 ± 0.267
1.876GlnIle: 1.876 ± 0.374
1.468GlnLys: 1.468 ± 0.481
3.263GlnLeu: 3.263 ± 0.621
1.387GlnMet: 1.387 ± 0.332
0.979GlnAsn: 0.979 ± 0.227
1.387GlnPro: 1.387 ± 0.386
1.713GlnGln: 1.713 ± 0.336
1.55GlnArg: 1.55 ± 0.282
1.794GlnSer: 1.794 ± 0.429
2.202GlnThr: 2.202 ± 0.452
2.121GlnVal: 2.121 ± 0.363
0.816GlnTrp: 0.816 ± 0.205
1.468GlnTyr: 1.468 ± 0.323
0.0GlnXaa: 0.0 ± 0.0
Arg
4.405ArgAla: 4.405 ± 0.83
0.816ArgCys: 0.816 ± 0.295
3.263ArgAsp: 3.263 ± 0.574
3.752ArgGlu: 3.752 ± 0.619
2.121ArgPhe: 2.121 ± 0.349
4.16ArgGly: 4.16 ± 0.571
0.979ArgHis: 0.979 ± 0.286
2.61ArgIle: 2.61 ± 0.532
4.078ArgLys: 4.078 ± 0.644
4.894ArgLeu: 4.894 ± 0.709
1.713ArgMet: 1.713 ± 0.422
3.507ArgAsn: 3.507 ± 0.572
1.55ArgPro: 1.55 ± 0.324
1.631ArgGln: 1.631 ± 0.365
1.876ArgArg: 1.876 ± 0.294
4.16ArgSer: 4.16 ± 0.672
3.018ArgThr: 3.018 ± 0.479
4.16ArgVal: 4.16 ± 0.557
0.816ArgTrp: 0.816 ± 0.294
1.55ArgTyr: 1.55 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
4.649SerAla: 4.649 ± 0.634
0.734SerCys: 0.734 ± 0.269
4.16SerAsp: 4.16 ± 0.469
3.507SerGlu: 3.507 ± 0.507
2.202SerPhe: 2.202 ± 0.354
4.976SerGly: 4.976 ± 0.589
1.631SerHis: 1.631 ± 0.397
3.018SerIle: 3.018 ± 0.465
3.834SerLys: 3.834 ± 0.712
3.589SerLeu: 3.589 ± 0.567
2.039SerMet: 2.039 ± 0.389
2.692SerAsn: 2.692 ± 0.406
1.958SerPro: 1.958 ± 0.259
2.039SerGln: 2.039 ± 0.406
3.344SerArg: 3.344 ± 0.546
2.447SerSer: 2.447 ± 0.434
3.997SerThr: 3.997 ± 0.57
4.241SerVal: 4.241 ± 0.518
0.489SerTrp: 0.489 ± 0.188
2.121SerTyr: 2.121 ± 0.589
0.0SerXaa: 0.0 ± 0.0
Thr
3.752ThrAla: 3.752 ± 0.648
0.897ThrCys: 0.897 ± 0.29
3.181ThrAsp: 3.181 ± 0.353
3.834ThrGlu: 3.834 ± 0.395
1.958ThrPhe: 1.958 ± 0.385
5.302ThrGly: 5.302 ± 0.713
1.06ThrHis: 1.06 ± 0.325
3.426ThrIle: 3.426 ± 0.565
3.997ThrLys: 3.997 ± 0.48
5.873ThrLeu: 5.873 ± 0.731
1.142ThrMet: 1.142 ± 0.306
1.55ThrAsn: 1.55 ± 0.42
2.202ThrPro: 2.202 ± 0.358
1.794ThrGln: 1.794 ± 0.332
2.855ThrArg: 2.855 ± 0.433
3.426ThrSer: 3.426 ± 0.574
2.936ThrThr: 2.936 ± 0.491
4.241ThrVal: 4.241 ± 0.586
1.223ThrTrp: 1.223 ± 0.391
1.305ThrTyr: 1.305 ± 0.348
0.0ThrXaa: 0.0 ± 0.0
Val
6.77ValAla: 6.77 ± 0.734
0.489ValCys: 0.489 ± 0.23
3.67ValAsp: 3.67 ± 0.452
5.465ValGlu: 5.465 ± 0.831
2.447ValPhe: 2.447 ± 0.489
4.568ValGly: 4.568 ± 0.677
1.142ValHis: 1.142 ± 0.289
3.507ValIle: 3.507 ± 0.634
4.568ValLys: 4.568 ± 0.788
5.546ValLeu: 5.546 ± 0.677
2.039ValMet: 2.039 ± 0.451
3.752ValAsn: 3.752 ± 0.762
2.365ValPro: 2.365 ± 0.417
2.202ValGln: 2.202 ± 0.505
4.323ValArg: 4.323 ± 0.661
4.405ValSer: 4.405 ± 0.63
4.568ValThr: 4.568 ± 0.541
4.405ValVal: 4.405 ± 0.801
0.816ValTrp: 0.816 ± 0.281
2.039ValTyr: 2.039 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
0.489TrpAla: 0.489 ± 0.205
0.326TrpCys: 0.326 ± 0.184
0.245TrpAsp: 0.245 ± 0.148
0.897TrpGlu: 0.897 ± 0.251
0.408TrpPhe: 0.408 ± 0.181
1.142TrpGly: 1.142 ± 0.323
0.245TrpHis: 0.245 ± 0.15
0.653TrpIle: 0.653 ± 0.242
1.794TrpLys: 1.794 ± 0.375
1.794TrpLeu: 1.794 ± 0.451
0.571TrpMet: 0.571 ± 0.18
1.223TrpAsn: 1.223 ± 0.31
0.326TrpPro: 0.326 ± 0.18
0.571TrpGln: 0.571 ± 0.198
0.653TrpArg: 0.653 ± 0.211
0.979TrpSer: 0.979 ± 0.344
1.55TrpThr: 1.55 ± 0.352
1.387TrpVal: 1.387 ± 0.33
0.245TrpTrp: 0.245 ± 0.136
0.326TrpTyr: 0.326 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.181TyrAla: 3.181 ± 0.394
0.408TyrCys: 0.408 ± 0.189
2.692TyrAsp: 2.692 ± 0.455
2.284TyrGlu: 2.284 ± 0.477
0.816TyrPhe: 0.816 ± 0.252
3.018TyrGly: 3.018 ± 0.383
0.408TyrHis: 0.408 ± 0.171
1.713TyrIle: 1.713 ± 0.492
1.55TyrLys: 1.55 ± 0.362
2.365TyrLeu: 2.365 ± 0.42
0.897TyrMet: 0.897 ± 0.222
1.958TyrAsn: 1.958 ± 0.35
1.142TyrPro: 1.142 ± 0.297
1.387TyrGln: 1.387 ± 0.426
1.794TyrArg: 1.794 ± 0.312
1.468TyrSer: 1.468 ± 0.368
1.713TyrThr: 1.713 ± 0.463
2.61TyrVal: 2.61 ± 0.409
0.571TyrTrp: 0.571 ± 0.174
0.816TyrTyr: 0.816 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (12261 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski