Amino acid dipepetide frequency for Escherichia phage myPSH2311

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.213AlaAla: 7.213 ± 1.087
0.892AlaCys: 0.892 ± 0.261
5.268AlaAsp: 5.268 ± 0.631
4.863AlaGlu: 4.863 ± 0.729
2.837AlaPhe: 2.837 ± 0.483
5.187AlaGly: 5.187 ± 1.053
1.135AlaHis: 1.135 ± 0.274
4.458AlaIle: 4.458 ± 0.65
5.106AlaLys: 5.106 ± 0.838
7.376AlaLeu: 7.376 ± 0.947
1.864AlaMet: 1.864 ± 0.367
3.809AlaAsn: 3.809 ± 0.647
2.269AlaPro: 2.269 ± 0.435
3.809AlaGln: 3.809 ± 0.786
3.647AlaArg: 3.647 ± 0.545
4.701AlaSer: 4.701 ± 0.667
3.647AlaThr: 3.647 ± 0.517
4.377AlaVal: 4.377 ± 0.511
1.135AlaTrp: 1.135 ± 0.281
1.864AlaTyr: 1.864 ± 0.395
0.0AlaXaa: 0.0 ± 0.0
Cys
0.648CysAla: 0.648 ± 0.28
0.0CysCys: 0.0 ± 0.0
0.243CysAsp: 0.243 ± 0.122
0.648CysGlu: 0.648 ± 0.282
0.243CysPhe: 0.243 ± 0.125
0.811CysGly: 0.811 ± 0.225
0.243CysHis: 0.243 ± 0.15
0.648CysIle: 0.648 ± 0.247
0.973CysLys: 0.973 ± 0.302
1.297CysLeu: 1.297 ± 0.31
0.405CysMet: 0.405 ± 0.196
0.486CysAsn: 0.486 ± 0.198
0.243CysPro: 0.243 ± 0.142
0.243CysGln: 0.243 ± 0.125
0.486CysArg: 0.486 ± 0.217
0.729CysSer: 0.729 ± 0.26
0.405CysThr: 0.405 ± 0.16
1.459CysVal: 1.459 ± 0.32
0.162CysTrp: 0.162 ± 0.101
0.567CysTyr: 0.567 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
4.62AspAla: 4.62 ± 0.54
0.567AspCys: 0.567 ± 0.198
3.242AspAsp: 3.242 ± 0.603
5.755AspGlu: 5.755 ± 0.856
2.107AspPhe: 2.107 ± 0.439
4.215AspGly: 4.215 ± 0.693
1.216AspHis: 1.216 ± 0.325
3.566AspIle: 3.566 ± 0.56
3.89AspLys: 3.89 ± 0.558
5.106AspLeu: 5.106 ± 0.683
2.188AspMet: 2.188 ± 0.428
3.08AspAsn: 3.08 ± 0.536
2.513AspPro: 2.513 ± 0.526
1.621AspGln: 1.621 ± 0.42
3.242AspArg: 3.242 ± 0.455
3.485AspSer: 3.485 ± 0.475
2.432AspThr: 2.432 ± 0.4
3.647AspVal: 3.647 ± 0.62
0.811AspTrp: 0.811 ± 0.276
2.675AspTyr: 2.675 ± 0.461
0.0AspXaa: 0.0 ± 0.0
Glu
5.836GluAla: 5.836 ± 0.793
1.054GluCys: 1.054 ± 0.313
5.268GluAsp: 5.268 ± 0.856
8.024GluGlu: 8.024 ± 1.338
2.594GluPhe: 2.594 ± 0.425
3.809GluGly: 3.809 ± 0.639
1.459GluHis: 1.459 ± 0.364
3.728GluIle: 3.728 ± 0.483
4.863GluLys: 4.863 ± 0.65
5.917GluLeu: 5.917 ± 0.803
3.08GluMet: 3.08 ± 0.401
3.809GluAsn: 3.809 ± 0.578
1.864GluPro: 1.864 ± 0.387
3.647GluGln: 3.647 ± 0.67
3.647GluArg: 3.647 ± 0.562
3.971GluSer: 3.971 ± 0.602
3.161GluThr: 3.161 ± 0.584
4.863GluVal: 4.863 ± 0.692
1.702GluTrp: 1.702 ± 0.346
2.513GluTyr: 2.513 ± 0.418
0.0GluXaa: 0.0 ± 0.0
Phe
2.269PheAla: 2.269 ± 0.455
0.729PheCys: 0.729 ± 0.222
3.242PheAsp: 3.242 ± 0.689
3.08PheGlu: 3.08 ± 0.555
1.783PhePhe: 1.783 ± 0.417
2.188PheGly: 2.188 ± 0.436
0.567PheHis: 0.567 ± 0.239
1.945PheIle: 1.945 ± 0.419
2.107PheLys: 2.107 ± 0.44
4.053PheLeu: 4.053 ± 0.728
0.567PheMet: 0.567 ± 0.196
3.08PheAsn: 3.08 ± 0.454
1.297PhePro: 1.297 ± 0.315
1.864PheGln: 1.864 ± 0.35
2.35PheArg: 2.35 ± 0.51
2.594PheSer: 2.594 ± 0.467
2.432PheThr: 2.432 ± 0.468
2.269PheVal: 2.269 ± 0.488
0.567PheTrp: 0.567 ± 0.22
1.945PheTyr: 1.945 ± 0.434
0.0PheXaa: 0.0 ± 0.0
Gly
4.944GlyAla: 4.944 ± 1.065
0.648GlyCys: 0.648 ± 0.226
4.296GlyAsp: 4.296 ± 0.68
3.971GlyGlu: 3.971 ± 0.703
2.918GlyPhe: 2.918 ± 0.517
5.187GlyGly: 5.187 ± 1.066
1.459GlyHis: 1.459 ± 0.344
3.728GlyIle: 3.728 ± 0.592
5.106GlyLys: 5.106 ± 0.735
5.917GlyLeu: 5.917 ± 0.824
2.756GlyMet: 2.756 ± 0.475
2.918GlyAsn: 2.918 ± 0.493
0.892GlyPro: 0.892 ± 0.31
2.918GlyGln: 2.918 ± 0.726
3.809GlyArg: 3.809 ± 0.574
6.403GlySer: 6.403 ± 1.02
2.594GlyThr: 2.594 ± 0.466
4.863GlyVal: 4.863 ± 0.65
1.54GlyTrp: 1.54 ± 0.327
2.675GlyTyr: 2.675 ± 0.431
0.0GlyXaa: 0.0 ± 0.0
His
1.135HisAla: 1.135 ± 0.318
0.162HisCys: 0.162 ± 0.104
1.297HisAsp: 1.297 ± 0.363
1.378HisGlu: 1.378 ± 0.417
0.567HisPhe: 0.567 ± 0.168
0.892HisGly: 0.892 ± 0.266
0.243HisHis: 0.243 ± 0.133
0.811HisIle: 0.811 ± 0.282
1.54HisLys: 1.54 ± 0.317
2.107HisLeu: 2.107 ± 0.472
0.567HisMet: 0.567 ± 0.215
1.135HisAsn: 1.135 ± 0.297
0.811HisPro: 0.811 ± 0.229
0.486HisGln: 0.486 ± 0.21
1.459HisArg: 1.459 ± 0.332
1.216HisSer: 1.216 ± 0.395
0.892HisThr: 0.892 ± 0.279
1.378HisVal: 1.378 ± 0.372
0.324HisTrp: 0.324 ± 0.164
0.973HisTyr: 0.973 ± 0.347
0.0HisXaa: 0.0 ± 0.0
Ile
3.242IleAla: 3.242 ± 0.427
0.729IleCys: 0.729 ± 0.287
3.161IleAsp: 3.161 ± 0.529
3.404IleGlu: 3.404 ± 0.493
1.945IlePhe: 1.945 ± 0.447
4.053IleGly: 4.053 ± 0.695
0.892IleHis: 0.892 ± 0.267
2.107IleIle: 2.107 ± 0.436
4.134IleLys: 4.134 ± 0.557
3.647IleLeu: 3.647 ± 0.48
1.297IleMet: 1.297 ± 0.353
3.728IleAsn: 3.728 ± 0.507
3.566IlePro: 3.566 ± 0.609
1.945IleGln: 1.945 ± 0.362
3.323IleArg: 3.323 ± 0.563
3.728IleSer: 3.728 ± 0.453
4.134IleThr: 4.134 ± 0.666
3.242IleVal: 3.242 ± 0.494
0.973IleTrp: 0.973 ± 0.325
2.269IleTyr: 2.269 ± 0.373
0.0IleXaa: 0.0 ± 0.0
Lys
5.025LysAla: 5.025 ± 0.664
0.486LysCys: 0.486 ± 0.239
4.215LysAsp: 4.215 ± 0.495
5.755LysGlu: 5.755 ± 0.853
2.35LysPhe: 2.35 ± 0.449
4.782LysGly: 4.782 ± 0.541
2.107LysHis: 2.107 ± 0.341
2.675LysIle: 2.675 ± 0.43
4.377LysLys: 4.377 ± 0.714
3.809LysLeu: 3.809 ± 0.567
2.269LysMet: 2.269 ± 0.474
3.404LysAsn: 3.404 ± 0.572
1.945LysPro: 1.945 ± 0.55
2.918LysGln: 2.918 ± 0.409
3.404LysArg: 3.404 ± 0.709
3.566LysSer: 3.566 ± 0.612
2.918LysThr: 2.918 ± 0.563
3.323LysVal: 3.323 ± 0.513
1.216LysTrp: 1.216 ± 0.337
2.675LysTyr: 2.675 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
6.241LeuAla: 6.241 ± 0.861
1.216LeuCys: 1.216 ± 0.354
3.242LeuAsp: 3.242 ± 0.549
6.484LeuGlu: 6.484 ± 0.762
3.404LeuPhe: 3.404 ± 0.496
6.403LeuGly: 6.403 ± 0.775
1.054LeuHis: 1.054 ± 0.265
3.971LeuIle: 3.971 ± 0.56
4.458LeuLys: 4.458 ± 0.702
4.377LeuLeu: 4.377 ± 0.63
2.432LeuMet: 2.432 ± 0.307
4.62LeuAsn: 4.62 ± 0.745
2.675LeuPro: 2.675 ± 0.376
3.404LeuGln: 3.404 ± 0.522
4.701LeuArg: 4.701 ± 0.561
5.755LeuSer: 5.755 ± 0.764
4.296LeuThr: 4.296 ± 0.696
5.998LeuVal: 5.998 ± 0.761
0.405LeuTrp: 0.405 ± 0.155
2.188LeuTyr: 2.188 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
1.945MetAla: 1.945 ± 0.335
0.405MetCys: 0.405 ± 0.199
1.54MetAsp: 1.54 ± 0.353
2.432MetGlu: 2.432 ± 0.588
1.702MetPhe: 1.702 ± 0.38
1.216MetGly: 1.216 ± 0.311
0.811MetHis: 0.811 ± 0.271
1.783MetIle: 1.783 ± 0.405
2.513MetLys: 2.513 ± 0.491
2.35MetLeu: 2.35 ± 0.399
0.892MetMet: 0.892 ± 0.265
1.864MetAsn: 1.864 ± 0.387
1.054MetPro: 1.054 ± 0.293
1.216MetGln: 1.216 ± 0.282
2.107MetArg: 2.107 ± 0.469
2.35MetSer: 2.35 ± 0.413
1.783MetThr: 1.783 ± 0.36
1.621MetVal: 1.621 ± 0.378
0.081MetTrp: 0.081 ± 0.075
0.811MetTyr: 0.811 ± 0.276
0.0MetXaa: 0.0 ± 0.0
Asn
4.701AsnAla: 4.701 ± 0.657
0.0AsnCys: 0.0 ± 0.0
2.837AsnAsp: 2.837 ± 0.452
2.675AsnGlu: 2.675 ± 0.548
2.837AsnPhe: 2.837 ± 0.525
4.863AsnGly: 4.863 ± 0.558
1.054AsnHis: 1.054 ± 0.295
3.566AsnIle: 3.566 ± 0.556
3.566AsnLys: 3.566 ± 0.551
4.782AsnLeu: 4.782 ± 0.72
1.621AsnMet: 1.621 ± 0.395
3.08AsnAsn: 3.08 ± 0.604
2.756AsnPro: 2.756 ± 0.407
2.918AsnGln: 2.918 ± 0.63
2.999AsnArg: 2.999 ± 0.394
4.053AsnSer: 4.053 ± 0.58
2.837AsnThr: 2.837 ± 0.458
3.647AsnVal: 3.647 ± 0.482
0.892AsnTrp: 0.892 ± 0.282
2.269AsnTyr: 2.269 ± 0.336
0.0AsnXaa: 0.0 ± 0.0
Pro
2.837ProAla: 2.837 ± 0.469
0.243ProCys: 0.243 ± 0.135
2.675ProAsp: 2.675 ± 0.556
2.432ProGlu: 2.432 ± 0.435
1.459ProPhe: 1.459 ± 0.36
1.054ProGly: 1.054 ± 0.283
0.811ProHis: 0.811 ± 0.291
1.54ProIle: 1.54 ± 0.372
2.188ProLys: 2.188 ± 0.492
2.756ProLeu: 2.756 ± 0.465
1.135ProMet: 1.135 ± 0.313
1.702ProAsn: 1.702 ± 0.375
1.864ProPro: 1.864 ± 0.582
1.054ProGln: 1.054 ± 0.342
1.459ProArg: 1.459 ± 0.405
2.594ProSer: 2.594 ± 0.517
2.026ProThr: 2.026 ± 0.523
2.756ProVal: 2.756 ± 0.567
0.648ProTrp: 0.648 ± 0.232
0.811ProTyr: 0.811 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
4.62GlnAla: 4.62 ± 0.847
0.973GlnCys: 0.973 ± 0.323
2.35GlnAsp: 2.35 ± 0.446
2.918GlnGlu: 2.918 ± 0.449
1.783GlnPhe: 1.783 ± 0.336
2.918GlnGly: 2.918 ± 0.755
0.324GlnHis: 0.324 ± 0.157
2.432GlnIle: 2.432 ± 0.473
2.513GlnLys: 2.513 ± 0.422
2.594GlnLeu: 2.594 ± 0.692
0.892GlnMet: 0.892 ± 0.316
3.323GlnAsn: 3.323 ± 0.785
0.729GlnPro: 0.729 ± 0.209
4.377GlnGln: 4.377 ± 1.655
2.432GlnArg: 2.432 ± 0.452
2.026GlnSer: 2.026 ± 0.435
1.54GlnThr: 1.54 ± 0.346
2.756GlnVal: 2.756 ± 0.399
0.811GlnTrp: 0.811 ± 0.23
1.378GlnTyr: 1.378 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
4.053ArgAla: 4.053 ± 0.853
0.405ArgCys: 0.405 ± 0.146
3.89ArgAsp: 3.89 ± 0.599
3.971ArgGlu: 3.971 ± 0.721
2.513ArgPhe: 2.513 ± 0.472
3.485ArgGly: 3.485 ± 0.575
0.892ArgHis: 0.892 ± 0.23
2.513ArgIle: 2.513 ± 0.443
3.323ArgLys: 3.323 ± 0.616
3.404ArgLeu: 3.404 ± 0.486
1.54ArgMet: 1.54 ± 0.298
2.594ArgAsn: 2.594 ± 0.548
1.621ArgPro: 1.621 ± 0.337
2.513ArgGln: 2.513 ± 0.407
3.728ArgArg: 3.728 ± 0.645
3.728ArgSer: 3.728 ± 0.489
3.08ArgThr: 3.08 ± 0.497
4.458ArgVal: 4.458 ± 0.723
1.135ArgTrp: 1.135 ± 0.338
1.621ArgTyr: 1.621 ± 0.335
0.0ArgXaa: 0.0 ± 0.0
Ser
3.809SerAla: 3.809 ± 0.482
0.567SerCys: 0.567 ± 0.188
3.485SerAsp: 3.485 ± 0.531
4.215SerGlu: 4.215 ± 0.595
2.594SerPhe: 2.594 ± 0.494
6.646SerGly: 6.646 ± 0.961
1.783SerHis: 1.783 ± 0.36
3.566SerIle: 3.566 ± 0.583
3.728SerLys: 3.728 ± 0.575
5.268SerLeu: 5.268 ± 0.607
2.188SerMet: 2.188 ± 0.486
4.62SerAsn: 4.62 ± 0.778
2.513SerPro: 2.513 ± 0.43
1.864SerGln: 1.864 ± 0.541
2.999SerArg: 2.999 ± 0.478
3.485SerSer: 3.485 ± 0.564
3.404SerThr: 3.404 ± 0.457
3.242SerVal: 3.242 ± 0.433
0.892SerTrp: 0.892 ± 0.243
2.918SerTyr: 2.918 ± 0.528
0.0SerXaa: 0.0 ± 0.0
Thr
3.809ThrAla: 3.809 ± 0.514
0.486ThrCys: 0.486 ± 0.168
1.864ThrAsp: 1.864 ± 0.414
3.566ThrGlu: 3.566 ± 0.436
2.675ThrPhe: 2.675 ± 0.434
3.89ThrGly: 3.89 ± 0.543
1.216ThrHis: 1.216 ± 0.312
4.215ThrIle: 4.215 ± 0.539
2.594ThrLys: 2.594 ± 0.401
4.296ThrLeu: 4.296 ± 0.592
1.216ThrMet: 1.216 ± 0.365
2.999ThrAsn: 2.999 ± 0.457
2.107ThrPro: 2.107 ± 0.413
2.026ThrGln: 2.026 ± 0.546
1.783ThrArg: 1.783 ± 0.425
2.837ThrSer: 2.837 ± 0.426
1.945ThrThr: 1.945 ± 0.432
3.89ThrVal: 3.89 ± 0.656
0.973ThrTrp: 0.973 ± 0.322
2.594ThrTyr: 2.594 ± 0.515
0.0ThrXaa: 0.0 ± 0.0
Val
5.187ValAla: 5.187 ± 0.657
0.567ValCys: 0.567 ± 0.212
4.296ValAsp: 4.296 ± 0.521
5.755ValGlu: 5.755 ± 0.713
2.918ValPhe: 2.918 ± 0.509
4.944ValGly: 4.944 ± 0.57
1.216ValHis: 1.216 ± 0.312
4.62ValIle: 4.62 ± 0.542
3.485ValLys: 3.485 ± 0.445
4.458ValLeu: 4.458 ± 0.64
1.216ValMet: 1.216 ± 0.336
4.539ValAsn: 4.539 ± 0.624
1.783ValPro: 1.783 ± 0.514
2.594ValGln: 2.594 ± 0.443
3.161ValArg: 3.161 ± 0.494
2.756ValSer: 2.756 ± 0.392
4.296ValThr: 4.296 ± 0.775
4.782ValVal: 4.782 ± 0.686
0.973ValTrp: 0.973 ± 0.299
2.026ValTyr: 2.026 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.264
0.243TrpCys: 0.243 ± 0.135
1.378TrpAsp: 1.378 ± 0.333
0.811TrpGlu: 0.811 ± 0.307
0.648TrpPhe: 0.648 ± 0.23
0.973TrpGly: 0.973 ± 0.303
0.243TrpHis: 0.243 ± 0.157
1.297TrpIle: 1.297 ± 0.26
0.648TrpLys: 0.648 ± 0.235
1.297TrpLeu: 1.297 ± 0.322
0.811TrpMet: 0.811 ± 0.238
1.297TrpAsn: 1.297 ± 0.33
0.243TrpPro: 0.243 ± 0.158
0.486TrpGln: 0.486 ± 0.213
1.054TrpArg: 1.054 ± 0.281
0.973TrpSer: 0.973 ± 0.269
1.054TrpThr: 1.054 ± 0.312
1.054TrpVal: 1.054 ± 0.292
0.162TrpTrp: 0.162 ± 0.113
0.324TrpTyr: 0.324 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.594TyrAla: 2.594 ± 0.383
0.567TyrCys: 0.567 ± 0.208
2.188TyrAsp: 2.188 ± 0.457
2.756TyrGlu: 2.756 ± 0.591
1.054TyrPhe: 1.054 ± 0.294
2.026TyrGly: 2.026 ± 0.433
0.648TyrHis: 0.648 ± 0.264
2.35TyrIle: 2.35 ± 0.447
2.107TyrLys: 2.107 ± 0.443
2.513TyrLeu: 2.513 ± 0.477
1.459TyrMet: 1.459 ± 0.32
1.864TyrAsn: 1.864 ± 0.358
1.297TyrPro: 1.297 ± 0.414
1.783TyrGln: 1.783 ± 0.411
2.675TyrArg: 2.675 ± 0.624
2.837TyrSer: 2.837 ± 0.443
2.107TyrThr: 2.107 ± 0.417
1.945TyrVal: 1.945 ± 0.329
0.324TyrTrp: 0.324 ± 0.149
1.135TyrTyr: 1.135 ± 0.262
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (12339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski