Amino acid dipepetide frequency for Escherichia phage TL-2011b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.767AlaAla: 11.767 ± 1.312
0.322AlaCys: 0.322 ± 0.183
6.447AlaAsp: 6.447 ± 0.704
6.286AlaGlu: 6.286 ± 0.85
4.11AlaPhe: 4.11 ± 0.636
6.447AlaGly: 6.447 ± 1.0
1.209AlaHis: 1.209 ± 0.301
4.191AlaIle: 4.191 ± 0.775
4.11AlaLys: 4.11 ± 0.438
7.898AlaLeu: 7.898 ± 0.658
3.627AlaMet: 3.627 ± 0.615
3.868AlaAsn: 3.868 ± 0.656
2.176AlaPro: 2.176 ± 0.501
4.674AlaGln: 4.674 ± 1.002
5.48AlaArg: 5.48 ± 0.665
7.334AlaSer: 7.334 ± 0.689
4.513AlaThr: 4.513 ± 0.599
6.044AlaVal: 6.044 ± 0.808
1.451AlaTrp: 1.451 ± 0.377
2.418AlaTyr: 2.418 ± 0.497
0.0AlaXaa: 0.0 ± 0.0
Cys
0.725CysAla: 0.725 ± 0.282
0.242CysCys: 0.242 ± 0.174
0.645CysAsp: 0.645 ± 0.222
0.564CysGlu: 0.564 ± 0.198
0.645CysPhe: 0.645 ± 0.248
0.806CysGly: 0.806 ± 0.296
0.0CysHis: 0.0 ± 0.0
0.564CysIle: 0.564 ± 0.249
0.564CysLys: 0.564 ± 0.247
0.967CysLeu: 0.967 ± 0.312
0.645CysMet: 0.645 ± 0.228
0.322CysAsn: 0.322 ± 0.187
0.322CysPro: 0.322 ± 0.162
0.484CysGln: 0.484 ± 0.261
0.806CysArg: 0.806 ± 0.276
0.645CysSer: 0.645 ± 0.254
0.564CysThr: 0.564 ± 0.25
0.484CysVal: 0.484 ± 0.209
0.242CysTrp: 0.242 ± 0.135
0.403CysTyr: 0.403 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
6.367AspAla: 6.367 ± 0.854
0.564AspCys: 0.564 ± 0.212
3.466AspAsp: 3.466 ± 0.543
3.546AspGlu: 3.546 ± 0.628
1.451AspPhe: 1.451 ± 0.347
4.836AspGly: 4.836 ± 0.626
0.322AspHis: 0.322 ± 0.152
3.304AspIle: 3.304 ± 0.52
4.352AspLys: 4.352 ± 0.842
3.868AspLeu: 3.868 ± 0.545
1.773AspMet: 1.773 ± 0.467
2.176AspAsn: 2.176 ± 0.455
2.579AspPro: 2.579 ± 0.577
2.498AspGln: 2.498 ± 0.423
2.982AspArg: 2.982 ± 0.682
3.546AspSer: 3.546 ± 0.504
2.337AspThr: 2.337 ± 0.443
4.755AspVal: 4.755 ± 0.601
1.531AspTrp: 1.531 ± 0.297
2.337AspTyr: 2.337 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
6.286GluAla: 6.286 ± 0.656
0.806GluCys: 0.806 ± 0.215
3.949GluAsp: 3.949 ± 0.518
3.949GluGlu: 3.949 ± 0.799
2.74GluPhe: 2.74 ± 0.455
3.304GluGly: 3.304 ± 0.708
0.645GluHis: 0.645 ± 0.27
3.949GluIle: 3.949 ± 0.705
4.03GluLys: 4.03 ± 0.579
5.883GluLeu: 5.883 ± 0.899
2.337GluMet: 2.337 ± 0.468
2.579GluAsn: 2.579 ± 0.435
2.74GluPro: 2.74 ± 0.567
3.304GluGln: 3.304 ± 0.696
4.03GluArg: 4.03 ± 0.656
3.063GluSer: 3.063 ± 0.454
3.466GluThr: 3.466 ± 0.534
3.385GluVal: 3.385 ± 0.43
0.645GluTrp: 0.645 ± 0.23
2.579GluTyr: 2.579 ± 0.373
0.0GluXaa: 0.0 ± 0.0
Phe
2.74PheAla: 2.74 ± 0.496
0.403PheCys: 0.403 ± 0.159
2.095PheAsp: 2.095 ± 0.434
1.934PheGlu: 1.934 ± 0.395
1.773PhePhe: 1.773 ± 0.433
2.418PheGly: 2.418 ± 0.567
0.564PheHis: 0.564 ± 0.231
2.579PheIle: 2.579 ± 0.83
1.612PheLys: 1.612 ± 0.388
1.934PheLeu: 1.934 ± 0.438
1.048PheMet: 1.048 ± 0.271
2.095PheAsn: 2.095 ± 0.482
1.37PhePro: 1.37 ± 0.361
1.128PheGln: 1.128 ± 0.309
3.224PheArg: 3.224 ± 0.497
2.579PheSer: 2.579 ± 0.427
2.095PheThr: 2.095 ± 0.438
2.337PheVal: 2.337 ± 0.488
0.403PheTrp: 0.403 ± 0.182
1.612PheTyr: 1.612 ± 0.514
0.0PheXaa: 0.0 ± 0.0
Gly
4.997GlyAla: 4.997 ± 1.159
0.725GlyCys: 0.725 ± 0.332
3.949GlyAsp: 3.949 ± 0.763
4.836GlyGlu: 4.836 ± 0.583
3.224GlyPhe: 3.224 ± 0.537
5.722GlyGly: 5.722 ± 0.88
1.531GlyHis: 1.531 ± 0.358
3.868GlyIle: 3.868 ± 0.528
5.48GlyLys: 5.48 ± 0.803
4.433GlyLeu: 4.433 ± 0.666
2.74GlyMet: 2.74 ± 0.48
4.513GlyAsn: 4.513 ± 0.635
1.289GlyPro: 1.289 ± 0.357
2.901GlyGln: 2.901 ± 0.574
3.304GlyArg: 3.304 ± 0.604
3.949GlySer: 3.949 ± 0.663
4.191GlyThr: 4.191 ± 0.696
5.642GlyVal: 5.642 ± 0.619
1.692GlyTrp: 1.692 ± 0.399
2.418GlyTyr: 2.418 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
1.37HisAla: 1.37 ± 0.372
0.322HisCys: 0.322 ± 0.186
1.048HisAsp: 1.048 ± 0.268
1.209HisGlu: 1.209 ± 0.387
0.322HisPhe: 0.322 ± 0.156
1.209HisGly: 1.209 ± 0.284
0.322HisHis: 0.322 ± 0.194
0.887HisIle: 0.887 ± 0.299
0.322HisLys: 0.322 ± 0.24
1.048HisLeu: 1.048 ± 0.344
0.161HisMet: 0.161 ± 0.111
1.209HisAsn: 1.209 ± 0.319
0.887HisPro: 0.887 ± 0.29
0.242HisGln: 0.242 ± 0.138
1.048HisArg: 1.048 ± 0.287
0.806HisSer: 0.806 ± 0.235
0.645HisThr: 0.645 ± 0.325
1.048HisVal: 1.048 ± 0.329
0.484HisTrp: 0.484 ± 0.232
0.484HisTyr: 0.484 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.158IleAla: 5.158 ± 0.617
0.645IleCys: 0.645 ± 0.227
4.03IleAsp: 4.03 ± 0.579
3.466IleGlu: 3.466 ± 0.449
2.015IlePhe: 2.015 ± 0.376
3.868IleGly: 3.868 ± 0.561
0.645IleHis: 0.645 ± 0.238
4.271IleIle: 4.271 ± 0.691
2.821IleLys: 2.821 ± 0.545
3.627IleLeu: 3.627 ± 0.776
0.725IleMet: 0.725 ± 0.249
4.271IleAsn: 4.271 ± 0.659
2.579IlePro: 2.579 ± 0.473
2.176IleGln: 2.176 ± 0.404
3.063IleArg: 3.063 ± 0.35
4.271IleSer: 4.271 ± 0.539
4.594IleThr: 4.594 ± 0.462
3.385IleVal: 3.385 ± 0.703
0.725IleTrp: 0.725 ± 0.218
1.451IleTyr: 1.451 ± 0.296
0.0IleXaa: 0.0 ± 0.0
Lys
5.077LysAla: 5.077 ± 0.71
1.048LysCys: 1.048 ± 0.333
1.692LysAsp: 1.692 ± 0.4
3.868LysGlu: 3.868 ± 0.534
1.37LysPhe: 1.37 ± 0.296
3.224LysGly: 3.224 ± 0.454
1.048LysHis: 1.048 ± 0.251
2.74LysIle: 2.74 ± 0.484
3.143LysLys: 3.143 ± 0.766
4.755LysLeu: 4.755 ± 0.595
1.531LysMet: 1.531 ± 0.375
2.821LysAsn: 2.821 ± 0.582
2.901LysPro: 2.901 ± 0.616
2.498LysGln: 2.498 ± 0.371
3.063LysArg: 3.063 ± 0.6
3.304LysSer: 3.304 ± 0.514
3.788LysThr: 3.788 ± 0.584
3.788LysVal: 3.788 ± 0.432
1.531LysTrp: 1.531 ± 0.317
2.337LysTyr: 2.337 ± 0.525
0.0LysXaa: 0.0 ± 0.0
Leu
7.495LeuAla: 7.495 ± 0.839
1.048LeuCys: 1.048 ± 0.322
4.271LeuAsp: 4.271 ± 0.489
4.594LeuGlu: 4.594 ± 0.667
2.901LeuPhe: 2.901 ± 0.395
4.433LeuGly: 4.433 ± 0.566
1.692LeuHis: 1.692 ± 0.294
4.755LeuIle: 4.755 ± 0.633
3.627LeuLys: 3.627 ± 0.671
5.4LeuLeu: 5.4 ± 0.705
2.337LeuMet: 2.337 ± 0.449
3.788LeuAsn: 3.788 ± 0.5
3.385LeuPro: 3.385 ± 0.477
3.224LeuGln: 3.224 ± 0.476
4.191LeuArg: 4.191 ± 0.639
5.883LeuSer: 5.883 ± 0.843
5.561LeuThr: 5.561 ± 0.527
4.674LeuVal: 4.674 ± 0.56
0.725LeuTrp: 0.725 ± 0.204
1.934LeuTyr: 1.934 ± 0.504
0.0LeuXaa: 0.0 ± 0.0
Met
3.707MetAla: 3.707 ± 0.716
0.242MetCys: 0.242 ± 0.153
1.128MetAsp: 1.128 ± 0.287
1.692MetGlu: 1.692 ± 0.338
1.37MetPhe: 1.37 ± 0.32
1.612MetGly: 1.612 ± 0.37
0.161MetHis: 0.161 ± 0.112
0.967MetIle: 0.967 ± 0.34
2.095MetLys: 2.095 ± 0.499
2.821MetLeu: 2.821 ± 0.516
0.967MetMet: 0.967 ± 0.393
1.773MetAsn: 1.773 ± 0.482
1.612MetPro: 1.612 ± 0.412
1.289MetGln: 1.289 ± 0.333
2.015MetArg: 2.015 ± 0.527
2.418MetSer: 2.418 ± 0.519
1.934MetThr: 1.934 ± 0.32
1.692MetVal: 1.692 ± 0.301
0.242MetTrp: 0.242 ± 0.131
0.725MetTyr: 0.725 ± 0.336
0.0MetXaa: 0.0 ± 0.0
Asn
4.271AsnAla: 4.271 ± 0.681
0.484AsnCys: 0.484 ± 0.23
3.224AsnAsp: 3.224 ± 0.462
3.063AsnGlu: 3.063 ± 0.555
1.531AsnPhe: 1.531 ± 0.32
5.48AsnGly: 5.48 ± 0.709
0.806AsnHis: 0.806 ± 0.256
2.498AsnIle: 2.498 ± 0.55
3.224AsnLys: 3.224 ± 0.554
2.66AsnLeu: 2.66 ± 0.485
1.289AsnMet: 1.289 ± 0.372
2.015AsnAsn: 2.015 ± 0.531
2.66AsnPro: 2.66 ± 0.486
2.095AsnGln: 2.095 ± 0.482
2.257AsnArg: 2.257 ± 0.504
2.74AsnSer: 2.74 ± 0.409
2.74AsnThr: 2.74 ± 0.479
2.821AsnVal: 2.821 ± 0.538
0.484AsnTrp: 0.484 ± 0.257
1.854AsnTyr: 1.854 ± 0.439
0.0AsnXaa: 0.0 ± 0.0
Pro
3.466ProAla: 3.466 ± 0.502
0.161ProCys: 0.161 ± 0.124
2.982ProAsp: 2.982 ± 0.55
4.11ProGlu: 4.11 ± 0.671
1.289ProPhe: 1.289 ± 0.297
3.707ProGly: 3.707 ± 0.645
0.484ProHis: 0.484 ± 0.18
1.934ProIle: 1.934 ± 0.49
1.289ProLys: 1.289 ± 0.287
2.982ProLeu: 2.982 ± 0.538
0.806ProMet: 0.806 ± 0.21
1.048ProAsn: 1.048 ± 0.336
1.692ProPro: 1.692 ± 0.437
2.176ProGln: 2.176 ± 0.341
1.451ProArg: 1.451 ± 0.38
4.11ProSer: 4.11 ± 0.713
1.531ProThr: 1.531 ± 0.412
3.788ProVal: 3.788 ± 0.503
0.645ProTrp: 0.645 ± 0.245
1.048ProTyr: 1.048 ± 0.264
0.0ProXaa: 0.0 ± 0.0
Gln
4.594GlnAla: 4.594 ± 0.707
0.322GlnCys: 0.322 ± 0.201
2.418GlnAsp: 2.418 ± 0.422
3.466GlnGlu: 3.466 ± 0.49
1.692GlnPhe: 1.692 ± 0.33
2.015GlnGly: 2.015 ± 0.413
0.564GlnHis: 0.564 ± 0.247
2.015GlnIle: 2.015 ± 0.369
2.418GlnLys: 2.418 ± 0.572
4.674GlnLeu: 4.674 ± 0.911
1.531GlnMet: 1.531 ± 0.38
1.531GlnAsn: 1.531 ± 0.323
1.37GlnPro: 1.37 ± 0.367
3.627GlnGln: 3.627 ± 0.644
3.063GlnArg: 3.063 ± 0.68
2.418GlnSer: 2.418 ± 0.469
2.821GlnThr: 2.821 ± 0.657
2.74GlnVal: 2.74 ± 0.564
1.128GlnTrp: 1.128 ± 0.34
2.015GlnTyr: 2.015 ± 0.536
0.0GlnXaa: 0.0 ± 0.0
Arg
4.674ArgAla: 4.674 ± 0.68
0.484ArgCys: 0.484 ± 0.257
3.707ArgAsp: 3.707 ± 0.655
3.868ArgGlu: 3.868 ± 0.587
1.854ArgPhe: 1.854 ± 0.365
2.901ArgGly: 2.901 ± 0.452
1.209ArgHis: 1.209 ± 0.365
4.03ArgIle: 4.03 ± 0.455
4.674ArgLys: 4.674 ± 0.671
4.755ArgLeu: 4.755 ± 0.481
1.692ArgMet: 1.692 ± 0.409
2.74ArgAsn: 2.74 ± 0.526
1.531ArgPro: 1.531 ± 0.304
3.063ArgGln: 3.063 ± 0.589
4.03ArgArg: 4.03 ± 0.619
3.224ArgSer: 3.224 ± 0.461
2.901ArgThr: 2.901 ± 0.638
3.868ArgVal: 3.868 ± 0.528
1.048ArgTrp: 1.048 ± 0.362
1.612ArgTyr: 1.612 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
5.561SerAla: 5.561 ± 0.557
0.887SerCys: 0.887 ± 0.243
3.304SerAsp: 3.304 ± 0.575
3.627SerGlu: 3.627 ± 0.531
2.74SerPhe: 2.74 ± 0.767
5.964SerGly: 5.964 ± 0.713
0.887SerHis: 0.887 ± 0.302
3.546SerIle: 3.546 ± 0.663
3.304SerLys: 3.304 ± 0.332
5.4SerLeu: 5.4 ± 0.748
1.934SerMet: 1.934 ± 0.485
3.224SerAsn: 3.224 ± 0.562
3.304SerPro: 3.304 ± 0.528
2.901SerGln: 2.901 ± 0.392
3.868SerArg: 3.868 ± 0.753
4.03SerSer: 4.03 ± 0.713
3.949SerThr: 3.949 ± 0.572
4.836SerVal: 4.836 ± 0.851
1.37SerTrp: 1.37 ± 0.301
1.692SerTyr: 1.692 ± 0.386
0.0SerXaa: 0.0 ± 0.0
Thr
5.48ThrAla: 5.48 ± 0.914
0.564ThrCys: 0.564 ± 0.222
3.788ThrAsp: 3.788 ± 0.675
2.74ThrGlu: 2.74 ± 0.511
1.37ThrPhe: 1.37 ± 0.314
5.4ThrGly: 5.4 ± 0.847
0.967ThrHis: 0.967 ± 0.249
3.385ThrIle: 3.385 ± 0.636
2.74ThrLys: 2.74 ± 0.451
4.352ThrLeu: 4.352 ± 0.703
2.015ThrMet: 2.015 ± 0.423
1.692ThrAsn: 1.692 ± 0.286
3.627ThrPro: 3.627 ± 0.593
2.418ThrGln: 2.418 ± 0.547
2.901ThrArg: 2.901 ± 0.535
4.352ThrSer: 4.352 ± 0.652
3.063ThrThr: 3.063 ± 0.585
3.627ThrVal: 3.627 ± 0.664
0.725ThrTrp: 0.725 ± 0.288
1.612ThrTyr: 1.612 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
5.964ValAla: 5.964 ± 0.714
0.564ValCys: 0.564 ± 0.251
3.868ValAsp: 3.868 ± 0.63
3.788ValGlu: 3.788 ± 0.387
1.612ValPhe: 1.612 ± 0.422
3.707ValGly: 3.707 ± 0.797
0.967ValHis: 0.967 ± 0.304
4.433ValIle: 4.433 ± 0.682
3.143ValLys: 3.143 ± 0.473
5.158ValLeu: 5.158 ± 0.618
2.015ValMet: 2.015 ± 0.423
4.594ValAsn: 4.594 ± 0.704
2.337ValPro: 2.337 ± 0.352
3.143ValGln: 3.143 ± 0.441
4.11ValArg: 4.11 ± 0.459
4.836ValSer: 4.836 ± 0.571
4.11ValThr: 4.11 ± 0.647
4.594ValVal: 4.594 ± 0.814
0.806ValTrp: 0.806 ± 0.229
2.095ValTyr: 2.095 ± 0.414
0.0ValXaa: 0.0 ± 0.0
Trp
1.612TrpAla: 1.612 ± 0.346
0.242TrpCys: 0.242 ± 0.134
0.967TrpAsp: 0.967 ± 0.238
0.967TrpGlu: 0.967 ± 0.303
0.725TrpPhe: 0.725 ± 0.229
1.773TrpGly: 1.773 ± 0.391
0.322TrpHis: 0.322 ± 0.168
1.048TrpIle: 1.048 ± 0.296
0.967TrpLys: 0.967 ± 0.362
1.692TrpLeu: 1.692 ± 0.441
0.564TrpMet: 0.564 ± 0.232
0.645TrpAsn: 0.645 ± 0.204
0.564TrpPro: 0.564 ± 0.197
0.564TrpGln: 0.564 ± 0.246
1.048TrpArg: 1.048 ± 0.308
1.209TrpSer: 1.209 ± 0.305
0.564TrpThr: 0.564 ± 0.211
0.725TrpVal: 0.725 ± 0.233
0.242TrpTrp: 0.242 ± 0.138
0.242TrpTyr: 0.242 ± 0.14
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.063TyrAla: 3.063 ± 0.608
0.564TyrCys: 0.564 ± 0.233
1.612TyrAsp: 1.612 ± 0.32
2.095TyrGlu: 2.095 ± 0.321
1.209TyrPhe: 1.209 ± 0.326
2.579TyrGly: 2.579 ± 0.645
0.725TyrHis: 0.725 ± 0.357
2.821TyrIle: 2.821 ± 0.66
1.612TyrLys: 1.612 ± 0.257
1.612TyrLeu: 1.612 ± 0.266
0.645TyrMet: 0.645 ± 0.291
1.37TyrAsn: 1.37 ± 0.48
1.773TyrPro: 1.773 ± 0.378
2.015TyrGln: 2.015 ± 0.587
1.934TyrArg: 1.934 ± 0.44
1.612TyrSer: 1.612 ± 0.384
1.451TyrThr: 1.451 ± 0.359
1.612TyrVal: 1.612 ± 0.316
0.564TyrTrp: 0.564 ± 0.204
0.887TyrTyr: 0.887 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12409 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski