Amino acid dipepetide frequency for Pseudomonas phage HU1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.797AlaAla: 11.797 ± 1.84
0.747AlaCys: 0.747 ± 0.233
5.899AlaAsp: 5.899 ± 0.669
5.899AlaGlu: 5.899 ± 0.877
2.837AlaPhe: 2.837 ± 0.453
6.72AlaGly: 6.72 ± 0.732
1.419AlaHis: 1.419 ± 0.36
6.048AlaIle: 6.048 ± 0.563
5.6AlaLys: 5.6 ± 0.863
8.587AlaLeu: 8.587 ± 0.849
4.107AlaMet: 4.107 ± 0.475
4.181AlaAsn: 4.181 ± 0.603
3.211AlaPro: 3.211 ± 0.541
5.973AlaGln: 5.973 ± 1.074
5.301AlaArg: 5.301 ± 0.782
5.749AlaSer: 5.749 ± 0.663
6.272AlaThr: 6.272 ± 0.652
6.347AlaVal: 6.347 ± 0.532
2.091AlaTrp: 2.091 ± 0.374
2.539AlaTyr: 2.539 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
1.045CysAla: 1.045 ± 0.276
0.373CysCys: 0.373 ± 0.172
0.523CysAsp: 0.523 ± 0.206
0.299CysGlu: 0.299 ± 0.174
0.523CysPhe: 0.523 ± 0.224
1.12CysGly: 1.12 ± 0.386
0.523CysHis: 0.523 ± 0.267
0.523CysIle: 0.523 ± 0.198
0.523CysLys: 0.523 ± 0.242
0.597CysLeu: 0.597 ± 0.21
0.149CysMet: 0.149 ± 0.107
0.224CysAsn: 0.224 ± 0.159
0.373CysPro: 0.373 ± 0.178
0.373CysGln: 0.373 ± 0.15
0.821CysArg: 0.821 ± 0.336
0.971CysSer: 0.971 ± 0.302
0.597CysThr: 0.597 ± 0.21
0.448CysVal: 0.448 ± 0.195
0.373CysTrp: 0.373 ± 0.184
0.373CysTyr: 0.373 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
5.899AspAla: 5.899 ± 0.846
1.12AspCys: 1.12 ± 0.322
4.331AspAsp: 4.331 ± 0.545
3.584AspGlu: 3.584 ± 0.563
1.941AspPhe: 1.941 ± 0.335
5.824AspGly: 5.824 ± 0.625
0.448AspHis: 0.448 ± 0.186
1.941AspIle: 1.941 ± 0.317
3.435AspLys: 3.435 ± 0.507
5.973AspLeu: 5.973 ± 0.838
2.016AspMet: 2.016 ± 0.381
2.464AspAsn: 2.464 ± 0.36
2.987AspPro: 2.987 ± 0.651
2.837AspGln: 2.837 ± 0.649
3.808AspArg: 3.808 ± 0.603
2.763AspSer: 2.763 ± 0.338
3.136AspThr: 3.136 ± 0.45
3.435AspVal: 3.435 ± 0.66
0.896AspTrp: 0.896 ± 0.242
1.867AspTyr: 1.867 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
5.973GluAla: 5.973 ± 0.759
0.672GluCys: 0.672 ± 0.307
2.539GluAsp: 2.539 ± 0.501
3.435GluGlu: 3.435 ± 0.515
2.912GluPhe: 2.912 ± 0.521
4.256GluGly: 4.256 ± 0.556
1.269GluHis: 1.269 ± 0.287
3.435GluIle: 3.435 ± 0.621
3.211GluLys: 3.211 ± 0.441
7.168GluLeu: 7.168 ± 0.7
2.763GluMet: 2.763 ± 0.512
2.24GluAsn: 2.24 ± 0.367
2.539GluPro: 2.539 ± 0.4
3.808GluGln: 3.808 ± 0.805
4.181GluArg: 4.181 ± 0.587
3.061GluSer: 3.061 ± 0.589
3.061GluThr: 3.061 ± 0.469
4.405GluVal: 4.405 ± 0.601
0.747GluTrp: 0.747 ± 0.264
2.315GluTyr: 2.315 ± 0.411
0.0GluXaa: 0.0 ± 0.0
Phe
3.435PheAla: 3.435 ± 0.562
0.224PheCys: 0.224 ± 0.124
2.389PheAsp: 2.389 ± 0.485
2.763PheGlu: 2.763 ± 0.488
1.195PhePhe: 1.195 ± 0.416
3.136PheGly: 3.136 ± 0.521
0.523PheHis: 0.523 ± 0.246
1.867PheIle: 1.867 ± 0.345
1.941PheLys: 1.941 ± 0.394
1.941PheLeu: 1.941 ± 0.465
0.523PheMet: 0.523 ± 0.187
1.568PheAsn: 1.568 ± 0.324
1.867PhePro: 1.867 ± 0.435
1.045PheGln: 1.045 ± 0.3
1.867PheArg: 1.867 ± 0.409
2.091PheSer: 2.091 ± 0.34
2.389PheThr: 2.389 ± 0.475
2.091PheVal: 2.091 ± 0.529
0.523PheTrp: 0.523 ± 0.156
1.045PheTyr: 1.045 ± 0.275
0.0PheXaa: 0.0 ± 0.0
Gly
7.317GlyAla: 7.317 ± 0.66
1.12GlyCys: 1.12 ± 0.367
3.957GlyAsp: 3.957 ± 0.592
5.899GlyGlu: 5.899 ± 0.72
2.987GlyPhe: 2.987 ± 0.488
7.243GlyGly: 7.243 ± 0.881
1.867GlyHis: 1.867 ± 0.436
3.733GlyIle: 3.733 ± 0.586
5.899GlyLys: 5.899 ± 0.598
6.048GlyLeu: 6.048 ± 0.646
2.912GlyMet: 2.912 ± 0.377
2.613GlyAsn: 2.613 ± 0.375
2.837GlyPro: 2.837 ± 0.489
3.435GlyGln: 3.435 ± 0.396
3.808GlyArg: 3.808 ± 0.551
3.659GlySer: 3.659 ± 0.573
4.555GlyThr: 4.555 ± 0.722
5.675GlyVal: 5.675 ± 0.604
1.269GlyTrp: 1.269 ± 0.238
2.837GlyTyr: 2.837 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.568HisAla: 1.568 ± 0.388
0.149HisCys: 0.149 ± 0.118
0.821HisAsp: 0.821 ± 0.258
1.269HisGlu: 1.269 ± 0.296
1.045HisPhe: 1.045 ± 0.29
1.045HisGly: 1.045 ± 0.3
0.224HisHis: 0.224 ± 0.114
0.672HisIle: 0.672 ± 0.237
1.045HisLys: 1.045 ± 0.311
1.419HisLeu: 1.419 ± 0.343
0.075HisMet: 0.075 ± 0.071
0.523HisAsn: 0.523 ± 0.178
0.523HisPro: 0.523 ± 0.217
0.597HisGln: 0.597 ± 0.222
0.747HisArg: 0.747 ± 0.263
0.747HisSer: 0.747 ± 0.224
1.045HisThr: 1.045 ± 0.225
1.12HisVal: 1.12 ± 0.36
0.597HisTrp: 0.597 ± 0.241
0.224HisTyr: 0.224 ± 0.123
0.0HisXaa: 0.0 ± 0.0
Ile
5.525IleAla: 5.525 ± 0.766
0.299IleCys: 0.299 ± 0.15
4.331IleAsp: 4.331 ± 0.641
3.285IleGlu: 3.285 ± 0.557
0.896IlePhe: 0.896 ± 0.239
4.181IleGly: 4.181 ± 0.53
0.672IleHis: 0.672 ± 0.21
2.837IleIle: 2.837 ± 0.429
2.912IleLys: 2.912 ± 0.562
2.763IleLeu: 2.763 ± 0.481
1.419IleMet: 1.419 ± 0.302
2.165IleAsn: 2.165 ± 0.524
3.285IlePro: 3.285 ± 0.574
2.987IleGln: 2.987 ± 0.431
2.389IleArg: 2.389 ± 0.357
3.659IleSer: 3.659 ± 0.464
2.987IleThr: 2.987 ± 0.372
3.285IleVal: 3.285 ± 0.558
1.045IleTrp: 1.045 ± 0.277
1.269IleTyr: 1.269 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
6.272LysAla: 6.272 ± 0.883
0.373LysCys: 0.373 ± 0.171
3.211LysAsp: 3.211 ± 0.685
2.912LysGlu: 2.912 ± 0.498
1.867LysPhe: 1.867 ± 0.386
4.107LysGly: 4.107 ± 0.576
1.045LysHis: 1.045 ± 0.314
3.36LysIle: 3.36 ± 0.562
2.837LysLys: 2.837 ± 0.616
4.853LysLeu: 4.853 ± 0.713
1.941LysMet: 1.941 ± 0.358
2.165LysAsn: 2.165 ± 0.438
3.061LysPro: 3.061 ± 0.55
2.165LysGln: 2.165 ± 0.345
3.061LysArg: 3.061 ± 0.488
3.211LysSer: 3.211 ± 0.626
3.211LysThr: 3.211 ± 0.476
3.285LysVal: 3.285 ± 0.524
0.747LysTrp: 0.747 ± 0.236
1.493LysTyr: 1.493 ± 0.255
0.0LysXaa: 0.0 ± 0.0
Leu
10.603LeuAla: 10.603 ± 1.104
0.597LeuCys: 0.597 ± 0.22
5.227LeuAsp: 5.227 ± 0.636
5.003LeuGlu: 5.003 ± 0.559
1.867LeuPhe: 1.867 ± 0.301
6.197LeuGly: 6.197 ± 0.702
1.045LeuHis: 1.045 ± 0.299
3.733LeuIle: 3.733 ± 0.524
4.704LeuLys: 4.704 ± 0.475
5.077LeuLeu: 5.077 ± 0.591
2.763LeuMet: 2.763 ± 0.477
3.957LeuAsn: 3.957 ± 0.527
3.509LeuPro: 3.509 ± 0.705
3.808LeuGln: 3.808 ± 0.458
5.152LeuArg: 5.152 ± 0.552
4.779LeuSer: 4.779 ± 0.565
5.6LeuThr: 5.6 ± 0.621
4.48LeuVal: 4.48 ± 0.569
0.672LeuTrp: 0.672 ± 0.198
2.016LeuTyr: 2.016 ± 0.41
0.0LeuXaa: 0.0 ± 0.0
Met
2.688MetAla: 2.688 ± 0.489
0.448MetCys: 0.448 ± 0.183
2.165MetAsp: 2.165 ± 0.429
1.717MetGlu: 1.717 ± 0.406
1.195MetPhe: 1.195 ± 0.319
2.539MetGly: 2.539 ± 0.389
0.299MetHis: 0.299 ± 0.147
1.269MetIle: 1.269 ± 0.278
1.867MetLys: 1.867 ± 0.363
2.24MetLeu: 2.24 ± 0.436
0.224MetMet: 0.224 ± 0.139
1.344MetAsn: 1.344 ± 0.284
1.643MetPro: 1.643 ± 0.362
1.568MetGln: 1.568 ± 0.329
2.315MetArg: 2.315 ± 0.443
2.912MetSer: 2.912 ± 0.413
2.763MetThr: 2.763 ± 0.446
1.269MetVal: 1.269 ± 0.195
0.075MetTrp: 0.075 ± 0.07
0.448MetTyr: 0.448 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.107AsnAla: 4.107 ± 0.91
0.299AsnCys: 0.299 ± 0.166
2.464AsnAsp: 2.464 ± 0.507
2.912AsnGlu: 2.912 ± 0.532
1.344AsnPhe: 1.344 ± 0.374
3.509AsnGly: 3.509 ± 0.526
0.672AsnHis: 0.672 ± 0.225
2.539AsnIle: 2.539 ± 0.546
2.24AsnLys: 2.24 ± 0.394
2.613AsnLeu: 2.613 ± 0.49
1.717AsnMet: 1.717 ± 0.392
1.717AsnAsn: 1.717 ± 0.374
1.941AsnPro: 1.941 ± 0.425
1.867AsnGln: 1.867 ± 0.509
2.389AsnArg: 2.389 ± 0.417
2.091AsnSer: 2.091 ± 0.48
2.837AsnThr: 2.837 ± 0.522
2.091AsnVal: 2.091 ± 0.391
0.373AsnTrp: 0.373 ± 0.212
1.643AsnTyr: 1.643 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
3.36ProAla: 3.36 ± 0.514
0.373ProCys: 0.373 ± 0.209
3.136ProAsp: 3.136 ± 0.575
3.957ProGlu: 3.957 ± 0.542
2.091ProPhe: 2.091 ± 0.328
3.136ProGly: 3.136 ± 0.456
0.299ProHis: 0.299 ± 0.148
1.568ProIle: 1.568 ± 0.326
2.016ProLys: 2.016 ± 0.523
3.211ProLeu: 3.211 ± 0.542
1.419ProMet: 1.419 ± 0.309
1.643ProAsn: 1.643 ± 0.478
1.941ProPro: 1.941 ± 0.417
2.613ProGln: 2.613 ± 0.372
1.717ProArg: 1.717 ± 0.313
1.643ProSer: 1.643 ± 0.436
2.539ProThr: 2.539 ± 0.376
3.136ProVal: 3.136 ± 0.61
0.747ProTrp: 0.747 ± 0.204
1.493ProTyr: 1.493 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
6.272GlnAla: 6.272 ± 0.944
0.299GlnCys: 0.299 ± 0.169
2.539GlnAsp: 2.539 ± 0.474
2.688GlnGlu: 2.688 ± 0.459
1.717GlnPhe: 1.717 ± 0.374
4.107GlnGly: 4.107 ± 0.665
0.597GlnHis: 0.597 ± 0.174
2.24GlnIle: 2.24 ± 0.42
2.763GlnLys: 2.763 ± 0.521
4.032GlnLeu: 4.032 ± 0.56
1.568GlnMet: 1.568 ± 0.393
2.24GlnAsn: 2.24 ± 0.43
1.792GlnPro: 1.792 ± 0.424
3.211GlnGln: 3.211 ± 0.796
3.509GlnArg: 3.509 ± 0.585
2.912GlnSer: 2.912 ± 0.431
2.315GlnThr: 2.315 ± 0.458
3.211GlnVal: 3.211 ± 0.531
0.597GlnTrp: 0.597 ± 0.276
1.195GlnTyr: 1.195 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
4.629ArgAla: 4.629 ± 0.804
0.821ArgCys: 0.821 ± 0.279
3.957ArgAsp: 3.957 ± 0.564
5.301ArgGlu: 5.301 ± 0.632
2.464ArgPhe: 2.464 ± 0.497
3.733ArgGly: 3.733 ± 0.454
0.896ArgHis: 0.896 ± 0.292
3.36ArgIle: 3.36 ± 0.394
3.36ArgLys: 3.36 ± 0.468
5.301ArgLeu: 5.301 ± 0.681
1.045ArgMet: 1.045 ± 0.216
2.613ArgAsn: 2.613 ± 0.38
1.568ArgPro: 1.568 ± 0.382
3.061ArgGln: 3.061 ± 0.543
2.837ArgArg: 2.837 ± 0.425
2.464ArgSer: 2.464 ± 0.474
2.987ArgThr: 2.987 ± 0.517
3.883ArgVal: 3.883 ± 0.58
0.672ArgTrp: 0.672 ± 0.206
1.717ArgTyr: 1.717 ± 0.4
0.0ArgXaa: 0.0 ± 0.0
Ser
5.152SerAla: 5.152 ± 0.593
0.597SerCys: 0.597 ± 0.226
3.061SerAsp: 3.061 ± 0.49
3.136SerGlu: 3.136 ± 0.465
1.568SerPhe: 1.568 ± 0.355
5.6SerGly: 5.6 ± 0.753
1.045SerHis: 1.045 ± 0.254
3.733SerIle: 3.733 ± 0.497
2.912SerLys: 2.912 ± 0.475
4.48SerLeu: 4.48 ± 0.626
1.568SerMet: 1.568 ± 0.328
2.763SerAsn: 2.763 ± 0.5
1.941SerPro: 1.941 ± 0.415
3.061SerGln: 3.061 ± 0.486
2.688SerArg: 2.688 ± 0.474
3.808SerSer: 3.808 ± 0.629
3.211SerThr: 3.211 ± 0.465
4.256SerVal: 4.256 ± 0.577
0.672SerTrp: 0.672 ± 0.21
1.419SerTyr: 1.419 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
5.749ThrAla: 5.749 ± 0.814
0.597ThrCys: 0.597 ± 0.264
2.912ThrAsp: 2.912 ± 0.476
3.584ThrGlu: 3.584 ± 0.433
2.016ThrPhe: 2.016 ± 0.393
5.376ThrGly: 5.376 ± 0.717
0.971ThrHis: 0.971 ± 0.273
3.061ThrIle: 3.061 ± 0.403
2.987ThrLys: 2.987 ± 0.52
5.152ThrLeu: 5.152 ± 0.744
1.568ThrMet: 1.568 ± 0.379
2.613ThrAsn: 2.613 ± 0.441
2.613ThrPro: 2.613 ± 0.556
2.389ThrGln: 2.389 ± 0.445
3.211ThrArg: 3.211 ± 0.414
3.509ThrSer: 3.509 ± 0.533
3.808ThrThr: 3.808 ± 0.506
4.928ThrVal: 4.928 ± 0.742
1.344ThrTrp: 1.344 ± 0.343
1.493ThrTyr: 1.493 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
6.123ValAla: 6.123 ± 0.622
0.672ValCys: 0.672 ± 0.203
4.853ValAsp: 4.853 ± 0.648
3.957ValGlu: 3.957 ± 0.651
2.613ValPhe: 2.613 ± 0.432
4.555ValGly: 4.555 ± 0.708
0.747ValHis: 0.747 ± 0.223
4.107ValIle: 4.107 ± 0.555
3.211ValLys: 3.211 ± 0.501
5.077ValLeu: 5.077 ± 0.551
1.867ValMet: 1.867 ± 0.333
2.837ValAsn: 2.837 ± 0.593
2.389ValPro: 2.389 ± 0.482
2.837ValGln: 2.837 ± 0.478
3.733ValArg: 3.733 ± 0.513
4.256ValSer: 4.256 ± 0.584
3.509ValThr: 3.509 ± 0.637
4.928ValVal: 4.928 ± 0.954
0.672ValTrp: 0.672 ± 0.264
1.867ValTyr: 1.867 ± 0.505
0.0ValXaa: 0.0 ± 0.0
Trp
1.419TrpAla: 1.419 ± 0.379
0.373TrpCys: 0.373 ± 0.18
1.195TrpAsp: 1.195 ± 0.242
0.747TrpGlu: 0.747 ± 0.177
0.373TrpPhe: 0.373 ± 0.181
0.597TrpGly: 0.597 ± 0.296
0.373TrpHis: 0.373 ± 0.158
0.821TrpIle: 0.821 ± 0.218
0.672TrpLys: 0.672 ± 0.222
1.568TrpLeu: 1.568 ± 0.382
0.448TrpMet: 0.448 ± 0.184
0.299TrpAsn: 0.299 ± 0.181
0.597TrpPro: 0.597 ± 0.2
0.672TrpGln: 0.672 ± 0.195
1.195TrpArg: 1.195 ± 0.353
0.747TrpSer: 0.747 ± 0.213
0.971TrpThr: 0.971 ± 0.285
1.045TrpVal: 1.045 ± 0.416
0.448TrpTrp: 0.448 ± 0.158
0.448TrpTyr: 0.448 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.389TyrAla: 2.389 ± 0.49
0.597TyrCys: 0.597 ± 0.242
1.269TyrAsp: 1.269 ± 0.283
1.643TyrGlu: 1.643 ± 0.31
0.971TyrPhe: 0.971 ± 0.259
2.688TyrGly: 2.688 ± 0.474
0.523TyrHis: 0.523 ± 0.181
1.344TyrIle: 1.344 ± 0.312
1.045TyrLys: 1.045 ± 0.235
2.688TyrLeu: 2.688 ± 0.438
0.821TyrMet: 0.821 ± 0.24
1.195TyrAsn: 1.195 ± 0.344
1.419TyrPro: 1.419 ± 0.287
1.493TyrGln: 1.493 ± 0.463
1.941TyrArg: 1.941 ± 0.387
1.643TyrSer: 1.643 ± 0.358
2.091TyrThr: 2.091 ± 0.456
1.493TyrVal: 1.493 ± 0.408
0.448TyrTrp: 0.448 ± 0.165
0.597TyrTyr: 0.597 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski