Amino acid dipepetide frequency for Pseudomonas phage vB_Pae_PS44

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.809AlaAla: 11.809 ± 0.758
0.893AlaCys: 0.893 ± 0.232
5.557AlaAsp: 5.557 ± 0.529
7.988AlaGlu: 7.988 ± 1.14
3.721AlaPhe: 3.721 ± 0.403
7.492AlaGly: 7.492 ± 0.69
1.439AlaHis: 1.439 ± 0.374
6.45AlaIle: 6.45 ± 0.539
6.004AlaLys: 6.004 ± 0.636
6.748AlaLeu: 6.748 ± 0.595
2.282AlaMet: 2.282 ± 0.311
4.466AlaAsn: 4.466 ± 0.654
4.218AlaPro: 4.218 ± 0.625
4.019AlaGln: 4.019 ± 0.333
5.855AlaArg: 5.855 ± 0.61
5.805AlaSer: 5.805 ± 0.487
4.813AlaThr: 4.813 ± 0.647
6.252AlaVal: 6.252 ± 0.645
1.24AlaTrp: 1.24 ± 0.283
2.282AlaTyr: 2.282 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.992CysAla: 0.992 ± 0.222
0.198CysCys: 0.198 ± 0.107
0.546CysAsp: 0.546 ± 0.154
0.695CysGlu: 0.695 ± 0.199
0.447CysPhe: 0.447 ± 0.169
0.447CysGly: 0.447 ± 0.14
0.05CysHis: 0.05 ± 0.053
0.397CysIle: 0.397 ± 0.12
0.496CysLys: 0.496 ± 0.168
0.645CysLeu: 0.645 ± 0.187
0.149CysMet: 0.149 ± 0.085
0.447CysAsn: 0.447 ± 0.136
0.645CysPro: 0.645 ± 0.174
0.05CysGln: 0.05 ± 0.046
0.595CysArg: 0.595 ± 0.187
0.397CysSer: 0.397 ± 0.156
0.744CysThr: 0.744 ± 0.228
0.695CysVal: 0.695 ± 0.196
0.149CysTrp: 0.149 ± 0.076
0.447CysTyr: 0.447 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
4.614AspAla: 4.614 ± 0.453
0.546AspCys: 0.546 ± 0.14
3.87AspAsp: 3.87 ± 0.627
4.863AspGlu: 4.863 ± 0.578
3.126AspPhe: 3.126 ± 0.351
6.301AspGly: 6.301 ± 0.478
1.389AspHis: 1.389 ± 0.265
3.771AspIle: 3.771 ± 0.558
2.878AspLys: 2.878 ± 0.344
4.912AspLeu: 4.912 ± 0.51
1.637AspMet: 1.637 ± 0.257
1.836AspAsn: 1.836 ± 0.327
4.069AspPro: 4.069 ± 0.43
2.431AspGln: 2.431 ± 0.374
3.126AspArg: 3.126 ± 0.373
3.721AspSer: 3.721 ± 0.496
2.679AspThr: 2.679 ± 0.345
3.572AspVal: 3.572 ± 0.419
1.489AspTrp: 1.489 ± 0.257
1.737AspTyr: 1.737 ± 0.315
0.05AspXaa: 0.05 ± 0.046
Glu
7.592GluAla: 7.592 ± 0.856
0.447GluCys: 0.447 ± 0.156
4.019GluAsp: 4.019 ± 0.53
6.004GluGlu: 6.004 ± 0.956
3.126GluPhe: 3.126 ± 0.395
4.118GluGly: 4.118 ± 0.655
2.084GluHis: 2.084 ± 0.407
4.515GluIle: 4.515 ± 0.509
4.069GluLys: 4.069 ± 0.874
6.053GluLeu: 6.053 ± 0.578
2.382GluMet: 2.382 ± 0.378
2.134GluAsn: 2.134 ± 0.269
2.828GluPro: 2.828 ± 0.424
2.977GluGln: 2.977 ± 0.452
5.011GluArg: 5.011 ± 0.689
3.076GluSer: 3.076 ± 0.395
2.729GluThr: 2.729 ± 0.399
4.366GluVal: 4.366 ± 0.489
0.992GluTrp: 0.992 ± 0.221
2.134GluTyr: 2.134 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
4.168PheAla: 4.168 ± 0.506
0.298PheCys: 0.298 ± 0.112
3.87PheAsp: 3.87 ± 0.459
3.076PheGlu: 3.076 ± 0.49
1.538PhePhe: 1.538 ± 0.341
3.771PheGly: 3.771 ± 0.485
0.744PheHis: 0.744 ± 0.223
2.332PheIle: 2.332 ± 0.256
1.737PheLys: 1.737 ± 0.371
2.927PheLeu: 2.927 ± 0.425
0.943PheMet: 0.943 ± 0.231
2.034PheAsn: 2.034 ± 0.381
1.439PhePro: 1.439 ± 0.242
1.489PheGln: 1.489 ± 0.276
2.431PheArg: 2.431 ± 0.352
2.282PheSer: 2.282 ± 0.363
1.885PheThr: 1.885 ± 0.252
2.531PheVal: 2.531 ± 0.443
0.695PheTrp: 0.695 ± 0.207
1.389PheTyr: 1.389 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
6.103GlyAla: 6.103 ± 0.699
0.546GlyCys: 0.546 ± 0.188
4.118GlyAsp: 4.118 ± 0.505
5.408GlyGlu: 5.408 ± 0.56
3.225GlyPhe: 3.225 ± 0.337
4.565GlyGly: 4.565 ± 0.673
1.29GlyHis: 1.29 ± 0.262
4.664GlyIle: 4.664 ± 0.454
3.324GlyLys: 3.324 ± 0.422
4.714GlyLeu: 4.714 ± 0.501
1.935GlyMet: 1.935 ± 0.309
2.779GlyAsn: 2.779 ± 0.672
2.481GlyPro: 2.481 ± 0.452
2.878GlyGln: 2.878 ± 0.412
5.16GlyArg: 5.16 ± 0.459
5.061GlySer: 5.061 ± 0.699
4.317GlyThr: 4.317 ± 0.506
5.607GlyVal: 5.607 ± 0.449
2.084GlyTrp: 2.084 ± 0.288
2.034GlyTyr: 2.034 ± 0.291
0.0GlyXaa: 0.0 ± 0.0
His
1.24HisAla: 1.24 ± 0.265
0.198HisCys: 0.198 ± 0.114
0.844HisAsp: 0.844 ± 0.199
1.141HisGlu: 1.141 ± 0.28
0.844HisPhe: 0.844 ± 0.161
1.489HisGly: 1.489 ± 0.248
0.447HisHis: 0.447 ± 0.167
0.943HisIle: 0.943 ± 0.25
0.794HisLys: 0.794 ± 0.223
1.439HisLeu: 1.439 ± 0.262
0.595HisMet: 0.595 ± 0.148
0.546HisAsn: 0.546 ± 0.141
0.844HisPro: 0.844 ± 0.164
0.893HisGln: 0.893 ± 0.224
1.588HisArg: 1.588 ± 0.327
0.943HisSer: 0.943 ± 0.226
0.695HisThr: 0.695 ± 0.203
0.992HisVal: 0.992 ± 0.193
0.248HisTrp: 0.248 ± 0.11
0.645HisTyr: 0.645 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
5.359IleAla: 5.359 ± 0.507
0.695IleCys: 0.695 ± 0.206
4.466IleAsp: 4.466 ± 0.481
3.572IleGlu: 3.572 ± 0.425
1.786IlePhe: 1.786 ± 0.323
4.118IleGly: 4.118 ± 0.585
1.141IleHis: 1.141 ± 0.212
3.473IleIle: 3.473 ± 0.407
3.275IleLys: 3.275 ± 0.492
4.366IleLeu: 4.366 ± 0.544
2.332IleMet: 2.332 ± 0.357
3.076IleAsn: 3.076 ± 0.306
3.523IlePro: 3.523 ± 0.384
2.828IleGln: 2.828 ± 0.442
3.821IleArg: 3.821 ± 0.439
4.416IleSer: 4.416 ± 0.5
3.473IleThr: 3.473 ± 0.393
2.977IleVal: 2.977 ± 0.438
0.645IleTrp: 0.645 ± 0.219
1.737IleTyr: 1.737 ± 0.27
0.0IleXaa: 0.0 ± 0.0
Lys
7.244LysAla: 7.244 ± 0.97
0.447LysCys: 0.447 ± 0.157
3.027LysAsp: 3.027 ± 0.432
3.771LysGlu: 3.771 ± 0.795
2.431LysPhe: 2.431 ± 0.301
3.225LysGly: 3.225 ± 0.394
0.645LysHis: 0.645 ± 0.197
3.126LysIle: 3.126 ± 0.381
2.58LysLys: 2.58 ± 0.441
4.466LysLeu: 4.466 ± 0.455
1.439LysMet: 1.439 ± 0.265
1.885LysAsn: 1.885 ± 0.234
2.63LysPro: 2.63 ± 0.358
1.34LysGln: 1.34 ± 0.274
3.275LysArg: 3.275 ± 0.51
3.176LysSer: 3.176 ± 0.383
2.927LysThr: 2.927 ± 0.436
3.821LysVal: 3.821 ± 0.41
0.546LysTrp: 0.546 ± 0.128
1.538LysTyr: 1.538 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
7.641LeuAla: 7.641 ± 0.606
0.695LeuCys: 0.695 ± 0.205
5.26LeuAsp: 5.26 ± 0.479
5.26LeuGlu: 5.26 ± 0.626
2.481LeuPhe: 2.481 ± 0.278
5.011LeuGly: 5.011 ± 0.445
0.844LeuHis: 0.844 ± 0.209
3.87LeuIle: 3.87 ± 0.507
4.218LeuLys: 4.218 ± 0.534
4.912LeuLeu: 4.912 ± 0.594
1.489LeuMet: 1.489 ± 0.29
3.424LeuAsn: 3.424 ± 0.417
3.275LeuPro: 3.275 ± 0.496
3.076LeuGln: 3.076 ± 0.404
4.813LeuArg: 4.813 ± 0.506
5.557LeuSer: 5.557 ± 0.452
3.622LeuThr: 3.622 ± 0.322
5.309LeuVal: 5.309 ± 0.458
1.34LeuTrp: 1.34 ± 0.249
2.134LeuTyr: 2.134 ± 0.356
0.0LeuXaa: 0.0 ± 0.0
Met
2.382MetAla: 2.382 ± 0.312
0.149MetCys: 0.149 ± 0.078
1.637MetAsp: 1.637 ± 0.281
1.637MetGlu: 1.637 ± 0.273
1.191MetPhe: 1.191 ± 0.229
1.141MetGly: 1.141 ± 0.259
0.198MetHis: 0.198 ± 0.09
1.687MetIle: 1.687 ± 0.214
1.637MetLys: 1.637 ± 0.299
1.935MetLeu: 1.935 ± 0.242
0.347MetMet: 0.347 ± 0.158
1.29MetAsn: 1.29 ± 0.212
0.943MetPro: 0.943 ± 0.202
0.645MetGln: 0.645 ± 0.224
1.637MetArg: 1.637 ± 0.307
2.382MetSer: 2.382 ± 0.313
2.034MetThr: 2.034 ± 0.335
1.637MetVal: 1.637 ± 0.245
0.149MetTrp: 0.149 ± 0.107
0.794MetTyr: 0.794 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
4.317AsnAla: 4.317 ± 0.646
0.248AsnCys: 0.248 ± 0.138
2.481AsnAsp: 2.481 ± 0.349
2.134AsnGlu: 2.134 ± 0.301
1.687AsnPhe: 1.687 ± 0.332
3.523AsnGly: 3.523 ± 0.445
0.943AsnHis: 0.943 ± 0.225
2.531AsnIle: 2.531 ± 0.365
1.042AsnLys: 1.042 ± 0.214
3.821AsnLeu: 3.821 ± 0.512
0.943AsnMet: 0.943 ± 0.206
1.786AsnAsn: 1.786 ± 0.493
2.828AsnPro: 2.828 ± 0.374
1.637AsnGln: 1.637 ± 0.343
2.332AsnArg: 2.332 ± 0.347
3.126AsnSer: 3.126 ± 0.449
2.084AsnThr: 2.084 ± 0.541
3.126AsnVal: 3.126 ± 0.529
0.695AsnTrp: 0.695 ± 0.161
1.24AsnTyr: 1.24 ± 0.294
0.05AsnXaa: 0.05 ± 0.049
Pro
4.714ProAla: 4.714 ± 0.712
0.149ProCys: 0.149 ± 0.086
3.672ProAsp: 3.672 ± 0.445
3.87ProGlu: 3.87 ± 0.456
2.084ProPhe: 2.084 ± 0.352
3.672ProGly: 3.672 ± 0.421
0.844ProHis: 0.844 ± 0.223
1.985ProIle: 1.985 ± 0.354
1.885ProLys: 1.885 ± 0.411
2.282ProLeu: 2.282 ± 0.347
1.042ProMet: 1.042 ± 0.225
2.034ProAsn: 2.034 ± 0.324
2.084ProPro: 2.084 ± 0.453
1.786ProGln: 1.786 ± 0.252
2.332ProArg: 2.332 ± 0.39
3.126ProSer: 3.126 ± 0.339
2.779ProThr: 2.779 ± 0.423
3.027ProVal: 3.027 ± 0.377
0.794ProTrp: 0.794 ± 0.241
1.141ProTyr: 1.141 ± 0.245
0.0ProXaa: 0.0 ± 0.0
Gln
3.672GlnAla: 3.672 ± 0.35
0.546GlnCys: 0.546 ± 0.166
2.183GlnAsp: 2.183 ± 0.319
2.779GlnGlu: 2.779 ± 0.415
1.885GlnPhe: 1.885 ± 0.314
2.481GlnGly: 2.481 ± 0.401
0.595GlnHis: 0.595 ± 0.174
2.431GlnIle: 2.431 ± 0.308
2.382GlnLys: 2.382 ± 0.392
3.176GlnLeu: 3.176 ± 0.334
0.844GlnMet: 0.844 ± 0.178
2.382GlnAsn: 2.382 ± 0.36
1.538GlnPro: 1.538 ± 0.26
1.34GlnGln: 1.34 ± 0.205
2.332GlnArg: 2.332 ± 0.353
2.084GlnSer: 2.084 ± 0.352
2.034GlnThr: 2.034 ± 0.308
2.282GlnVal: 2.282 ± 0.357
0.347GlnTrp: 0.347 ± 0.128
1.042GlnTyr: 1.042 ± 0.238
0.05GlnXaa: 0.05 ± 0.052
Arg
5.905ArgAla: 5.905 ± 0.613
0.595ArgCys: 0.595 ± 0.143
3.225ArgAsp: 3.225 ± 0.409
4.416ArgGlu: 4.416 ± 0.697
2.58ArgPhe: 2.58 ± 0.331
3.523ArgGly: 3.523 ± 0.414
1.24ArgHis: 1.24 ± 0.29
4.565ArgIle: 4.565 ± 0.414
4.317ArgLys: 4.317 ± 0.482
5.656ArgLeu: 5.656 ± 0.578
1.737ArgMet: 1.737 ± 0.361
2.679ArgAsn: 2.679 ± 0.379
2.382ArgPro: 2.382 ± 0.374
2.63ArgGln: 2.63 ± 0.351
4.168ArgArg: 4.168 ± 0.633
3.225ArgSer: 3.225 ± 0.357
2.63ArgThr: 2.63 ± 0.452
3.771ArgVal: 3.771 ± 0.489
0.893ArgTrp: 0.893 ± 0.189
2.084ArgTyr: 2.084 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
6.947SerAla: 6.947 ± 0.675
0.496SerCys: 0.496 ± 0.159
3.721SerAsp: 3.721 ± 0.416
3.275SerGlu: 3.275 ± 0.407
2.431SerPhe: 2.431 ± 0.36
4.168SerGly: 4.168 ± 0.603
0.844SerHis: 0.844 ± 0.224
3.721SerIle: 3.721 ± 0.492
3.672SerLys: 3.672 ± 0.479
4.466SerLeu: 4.466 ± 0.44
1.737SerMet: 1.737 ± 0.349
2.233SerAsn: 2.233 ± 0.397
3.176SerPro: 3.176 ± 0.38
2.679SerGln: 2.679 ± 0.351
4.218SerArg: 4.218 ± 0.494
4.714SerSer: 4.714 ± 0.59
3.225SerThr: 3.225 ± 0.53
4.366SerVal: 4.366 ± 0.424
0.844SerTrp: 0.844 ± 0.224
2.282SerTyr: 2.282 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
4.664ThrAla: 4.664 ± 0.516
0.645ThrCys: 0.645 ± 0.158
2.233ThrAsp: 2.233 ± 0.307
2.679ThrGlu: 2.679 ± 0.406
2.531ThrPhe: 2.531 ± 0.305
5.061ThrGly: 5.061 ± 0.697
0.744ThrHis: 0.744 ± 0.195
3.672ThrIle: 3.672 ± 0.456
3.424ThrLys: 3.424 ± 0.476
3.821ThrLeu: 3.821 ± 0.42
0.893ThrMet: 0.893 ± 0.2
1.737ThrAsn: 1.737 ± 0.324
2.779ThrPro: 2.779 ± 0.397
1.439ThrGln: 1.439 ± 0.273
2.481ThrArg: 2.481 ± 0.352
4.069ThrSer: 4.069 ± 0.482
2.58ThrThr: 2.58 ± 0.48
3.324ThrVal: 3.324 ± 0.371
0.893ThrTrp: 0.893 ± 0.241
1.29ThrTyr: 1.29 ± 0.187
0.0ThrXaa: 0.0 ± 0.0
Val
6.599ValAla: 6.599 ± 0.627
0.943ValCys: 0.943 ± 0.212
5.061ValAsp: 5.061 ± 0.534
5.26ValGlu: 5.26 ± 0.715
2.63ValPhe: 2.63 ± 0.385
4.218ValGly: 4.218 ± 0.415
1.141ValHis: 1.141 ± 0.244
4.019ValIle: 4.019 ± 0.487
3.771ValLys: 3.771 ± 0.474
4.416ValLeu: 4.416 ± 0.47
1.588ValMet: 1.588 ± 0.319
2.927ValAsn: 2.927 ± 0.502
2.282ValPro: 2.282 ± 0.333
2.729ValGln: 2.729 ± 0.36
3.92ValArg: 3.92 ± 0.544
3.424ValSer: 3.424 ± 0.365
3.523ValThr: 3.523 ± 0.416
4.317ValVal: 4.317 ± 0.586
0.992ValTrp: 0.992 ± 0.179
1.687ValTyr: 1.687 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
1.389TrpAla: 1.389 ± 0.22
0.198TrpCys: 0.198 ± 0.116
0.992TrpAsp: 0.992 ± 0.192
1.191TrpGlu: 1.191 ± 0.276
0.595TrpPhe: 0.595 ± 0.193
1.191TrpGly: 1.191 ± 0.298
0.248TrpHis: 0.248 ± 0.111
0.943TrpIle: 0.943 ± 0.221
0.943TrpLys: 0.943 ± 0.162
1.191TrpLeu: 1.191 ± 0.257
0.347TrpMet: 0.347 ± 0.135
1.24TrpAsn: 1.24 ± 0.263
0.645TrpPro: 0.645 ± 0.157
0.347TrpGln: 0.347 ± 0.113
1.092TrpArg: 1.092 ± 0.212
0.893TrpSer: 0.893 ± 0.198
0.744TrpThr: 0.744 ± 0.191
1.092TrpVal: 1.092 ± 0.227
0.198TrpTrp: 0.198 ± 0.084
0.347TrpTyr: 0.347 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.985TyrAla: 1.985 ± 0.335
0.298TyrCys: 0.298 ± 0.128
1.836TyrAsp: 1.836 ± 0.306
1.588TyrGlu: 1.588 ± 0.271
1.24TyrPhe: 1.24 ± 0.215
2.779TyrGly: 2.779 ± 0.337
0.496TyrHis: 0.496 ± 0.148
1.985TyrIle: 1.985 ± 0.267
1.092TyrLys: 1.092 ± 0.253
2.282TyrLeu: 2.282 ± 0.399
0.744TyrMet: 0.744 ± 0.202
1.637TyrAsn: 1.637 ± 0.302
0.695TyrPro: 0.695 ± 0.187
1.191TyrGln: 1.191 ± 0.275
2.034TyrArg: 2.034 ± 0.312
1.737TyrSer: 1.737 ± 0.349
1.34TyrThr: 1.34 ± 0.227
2.382TyrVal: 2.382 ± 0.448
0.595TyrTrp: 0.595 ± 0.167
0.595TyrTyr: 0.595 ± 0.174
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.05XaaGlu: 0.05 ± 0.052
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.099XaaThr: 0.099 ± 0.064
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (20155 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski