Amino acid dipepetide frequency for Pseudomonas phage vB_PaeS_PM105

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.576AlaAla: 17.576 ± 2.515
1.025AlaCys: 1.025 ± 0.368
7.881AlaAsp: 7.881 ± 0.568
9.852AlaGlu: 9.852 ± 1.134
3.231AlaPhe: 3.231 ± 0.55
9.852AlaGly: 9.852 ± 0.764
2.128AlaHis: 2.128 ± 0.363
6.305AlaIle: 6.305 ± 0.694
4.02AlaLys: 4.02 ± 0.595
10.64AlaLeu: 10.64 ± 0.922
3.625AlaMet: 3.625 ± 0.597
3.31AlaAsn: 3.31 ± 0.52
5.675AlaPro: 5.675 ± 0.691
6.069AlaGln: 6.069 ± 0.768
8.039AlaArg: 8.039 ± 0.778
6.699AlaSer: 6.699 ± 1.074
7.881AlaThr: 7.881 ± 0.646
7.33AlaVal: 7.33 ± 0.762
2.522AlaTrp: 2.522 ± 0.327
3.231AlaTyr: 3.231 ± 0.613
0.0AlaXaa: 0.0 ± 0.0
Cys
0.946CysAla: 0.946 ± 0.297
0.158CysCys: 0.158 ± 0.11
0.631CysAsp: 0.631 ± 0.195
0.315CysGlu: 0.315 ± 0.148
0.158CysPhe: 0.158 ± 0.11
0.709CysGly: 0.709 ± 0.21
0.236CysHis: 0.236 ± 0.127
0.315CysIle: 0.315 ± 0.148
0.158CysLys: 0.158 ± 0.124
0.709CysLeu: 0.709 ± 0.225
0.158CysMet: 0.158 ± 0.126
0.394CysAsn: 0.394 ± 0.158
0.473CysPro: 0.473 ± 0.188
0.709CysGln: 0.709 ± 0.263
0.552CysArg: 0.552 ± 0.21
0.315CysSer: 0.315 ± 0.21
0.315CysThr: 0.315 ± 0.156
0.473CysVal: 0.473 ± 0.193
0.0CysTrp: 0.0 ± 0.0
0.236CysTyr: 0.236 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
5.438AspAla: 5.438 ± 0.478
0.473AspCys: 0.473 ± 0.176
2.916AspAsp: 2.916 ± 0.467
4.965AspGlu: 4.965 ± 0.634
1.813AspPhe: 1.813 ± 0.403
5.281AspGly: 5.281 ± 0.572
0.946AspHis: 0.946 ± 0.275
2.522AspIle: 2.522 ± 0.463
1.813AspLys: 1.813 ± 0.38
5.911AspLeu: 5.911 ± 0.649
0.709AspMet: 0.709 ± 0.278
1.655AspAsn: 1.655 ± 0.296
2.995AspPro: 2.995 ± 0.447
3.941AspGln: 3.941 ± 0.554
3.783AspArg: 3.783 ± 0.578
3.31AspSer: 3.31 ± 0.495
2.522AspThr: 2.522 ± 0.438
3.625AspVal: 3.625 ± 0.555
0.946AspTrp: 0.946 ± 0.278
1.734AspTyr: 1.734 ± 0.292
0.0AspXaa: 0.0 ± 0.0
Glu
7.172GluAla: 7.172 ± 0.967
1.025GluCys: 1.025 ± 0.248
2.68GluAsp: 2.68 ± 0.513
4.335GluGlu: 4.335 ± 0.606
1.892GluPhe: 1.892 ± 0.414
4.098GluGly: 4.098 ± 0.608
1.103GluHis: 1.103 ± 0.248
2.601GluIle: 2.601 ± 0.309
2.364GluLys: 2.364 ± 0.423
8.118GluLeu: 8.118 ± 0.94
1.97GluMet: 1.97 ± 0.399
1.813GluAsn: 1.813 ± 0.321
2.68GluPro: 2.68 ± 0.632
5.832GluGln: 5.832 ± 0.777
5.123GluArg: 5.123 ± 0.653
3.468GluSer: 3.468 ± 0.547
2.443GluThr: 2.443 ± 0.375
3.31GluVal: 3.31 ± 0.486
1.34GluTrp: 1.34 ± 0.364
1.576GluTyr: 1.576 ± 0.329
0.0GluXaa: 0.0 ± 0.0
Phe
3.31PheAla: 3.31 ± 0.624
0.158PheCys: 0.158 ± 0.134
2.207PheAsp: 2.207 ± 0.384
1.497PheGlu: 1.497 ± 0.367
0.946PhePhe: 0.946 ± 0.23
3.31PheGly: 3.31 ± 0.487
0.473PheHis: 0.473 ± 0.168
0.788PheIle: 0.788 ± 0.245
0.709PheLys: 0.709 ± 0.225
2.049PheLeu: 2.049 ± 0.378
0.631PheMet: 0.631 ± 0.179
1.34PheAsn: 1.34 ± 0.353
1.103PhePro: 1.103 ± 0.274
1.025PheGln: 1.025 ± 0.336
2.364PheArg: 2.364 ± 0.447
2.049PheSer: 2.049 ± 0.378
1.419PheThr: 1.419 ± 0.365
1.576PheVal: 1.576 ± 0.388
0.552PheTrp: 0.552 ± 0.229
0.709PheTyr: 0.709 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
8.433GlyAla: 8.433 ± 1.051
0.788GlyCys: 0.788 ± 0.213
4.729GlyAsp: 4.729 ± 0.551
3.862GlyGlu: 3.862 ± 0.434
2.995GlyPhe: 2.995 ± 0.414
6.148GlyGly: 6.148 ± 1.057
0.788GlyHis: 0.788 ± 0.248
2.601GlyIle: 2.601 ± 0.347
3.625GlyLys: 3.625 ± 0.553
7.487GlyLeu: 7.487 ± 0.871
1.892GlyMet: 1.892 ± 0.327
2.759GlyAsn: 2.759 ± 0.47
2.364GlyPro: 2.364 ± 0.405
4.492GlyGln: 4.492 ± 0.663
6.936GlyArg: 6.936 ± 0.658
4.571GlySer: 4.571 ± 0.787
4.965GlyThr: 4.965 ± 0.771
4.098GlyVal: 4.098 ± 0.536
2.128GlyTrp: 2.128 ± 0.305
1.892GlyTyr: 1.892 ± 0.472
0.0GlyXaa: 0.0 ± 0.0
His
2.049HisAla: 2.049 ± 0.362
0.0HisCys: 0.0 ± 0.0
0.946HisAsp: 0.946 ± 0.283
0.867HisGlu: 0.867 ± 0.249
0.236HisPhe: 0.236 ± 0.128
1.892HisGly: 1.892 ± 0.396
0.394HisHis: 0.394 ± 0.149
1.025HisIle: 1.025 ± 0.253
0.236HisLys: 0.236 ± 0.143
1.576HisLeu: 1.576 ± 0.396
0.236HisMet: 0.236 ± 0.158
0.709HisAsn: 0.709 ± 0.203
1.261HisPro: 1.261 ± 0.337
1.025HisGln: 1.025 ± 0.341
1.655HisArg: 1.655 ± 0.375
0.867HisSer: 0.867 ± 0.252
0.788HisThr: 0.788 ± 0.276
0.788HisVal: 0.788 ± 0.278
0.236HisTrp: 0.236 ± 0.129
0.788HisTyr: 0.788 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
5.675IleAla: 5.675 ± 0.693
0.473IleCys: 0.473 ± 0.213
2.995IleAsp: 2.995 ± 0.482
4.02IleGlu: 4.02 ± 0.596
0.394IlePhe: 0.394 ± 0.141
2.759IleGly: 2.759 ± 0.558
0.867IleHis: 0.867 ± 0.233
1.025IleIle: 1.025 ± 0.295
1.025IleLys: 1.025 ± 0.241
3.389IleLeu: 3.389 ± 0.404
0.552IleMet: 0.552 ± 0.215
1.261IleAsn: 1.261 ± 0.283
1.97IlePro: 1.97 ± 0.361
2.522IleGln: 2.522 ± 0.415
3.468IleArg: 3.468 ± 0.542
2.522IleSer: 2.522 ± 0.425
3.625IleThr: 3.625 ± 0.486
2.364IleVal: 2.364 ± 0.426
0.709IleTrp: 0.709 ± 0.329
1.103IleTyr: 1.103 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
6.463LysAla: 6.463 ± 0.747
0.158LysCys: 0.158 ± 0.1
1.734LysAsp: 1.734 ± 0.466
1.576LysGlu: 1.576 ± 0.297
1.025LysPhe: 1.025 ± 0.31
2.207LysGly: 2.207 ± 0.405
0.473LysHis: 0.473 ± 0.167
1.419LysIle: 1.419 ± 0.29
1.497LysLys: 1.497 ± 0.265
3.862LysLeu: 3.862 ± 0.56
0.394LysMet: 0.394 ± 0.194
1.103LysAsn: 1.103 ± 0.263
1.97LysPro: 1.97 ± 0.46
1.97LysGln: 1.97 ± 0.333
3.468LysArg: 3.468 ± 0.715
1.734LysSer: 1.734 ± 0.343
1.34LysThr: 1.34 ± 0.349
1.655LysVal: 1.655 ± 0.411
0.158LysTrp: 0.158 ± 0.103
0.552LysTyr: 0.552 ± 0.219
0.0LysXaa: 0.0 ± 0.0
Leu
12.926LeuAla: 12.926 ± 0.861
1.182LeuCys: 1.182 ± 0.364
6.778LeuAsp: 6.778 ± 0.629
5.517LeuGlu: 5.517 ± 0.566
1.97LeuPhe: 1.97 ± 0.37
7.093LeuGly: 7.093 ± 0.767
1.734LeuHis: 1.734 ± 0.427
5.123LeuIle: 5.123 ± 0.602
2.995LeuLys: 2.995 ± 0.624
7.96LeuLeu: 7.96 ± 0.732
1.892LeuMet: 1.892 ± 0.399
3.31LeuAsn: 3.31 ± 0.464
5.675LeuPro: 5.675 ± 0.903
6.778LeuGln: 6.778 ± 0.884
7.172LeuArg: 7.172 ± 0.804
5.517LeuSer: 5.517 ± 0.44
5.359LeuThr: 5.359 ± 0.587
6.778LeuVal: 6.778 ± 0.841
0.315LeuTrp: 0.315 ± 0.156
2.049LeuTyr: 2.049 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
3.389MetAla: 3.389 ± 0.574
0.0MetCys: 0.0 ± 0.0
1.497MetAsp: 1.497 ± 0.334
1.025MetGlu: 1.025 ± 0.339
0.631MetPhe: 0.631 ± 0.228
1.419MetGly: 1.419 ± 0.338
0.158MetHis: 0.158 ± 0.109
1.025MetIle: 1.025 ± 0.294
1.182MetLys: 1.182 ± 0.323
1.419MetLeu: 1.419 ± 0.402
0.079MetMet: 0.079 ± 0.089
1.103MetAsn: 1.103 ± 0.314
1.261MetPro: 1.261 ± 0.311
0.709MetGln: 0.709 ± 0.205
0.867MetArg: 0.867 ± 0.218
2.128MetSer: 2.128 ± 0.383
1.813MetThr: 1.813 ± 0.413
1.103MetVal: 1.103 ± 0.283
0.236MetTrp: 0.236 ± 0.131
0.394MetTyr: 0.394 ± 0.274
0.0MetXaa: 0.0 ± 0.0
Asn
4.256AsnAla: 4.256 ± 0.508
0.315AsnCys: 0.315 ± 0.169
1.576AsnAsp: 1.576 ± 0.35
1.576AsnGlu: 1.576 ± 0.309
0.867AsnPhe: 0.867 ± 0.188
2.68AsnGly: 2.68 ± 0.495
1.025AsnHis: 1.025 ± 0.406
1.576AsnIle: 1.576 ± 0.366
0.946AsnLys: 0.946 ± 0.223
2.207AsnLeu: 2.207 ± 0.447
0.631AsnMet: 0.631 ± 0.226
1.497AsnAsn: 1.497 ± 0.382
2.128AsnPro: 2.128 ± 0.438
2.522AsnGln: 2.522 ± 0.403
2.049AsnArg: 2.049 ± 0.274
1.34AsnSer: 1.34 ± 0.394
1.576AsnThr: 1.576 ± 0.411
1.576AsnVal: 1.576 ± 0.306
0.394AsnTrp: 0.394 ± 0.162
1.34AsnTyr: 1.34 ± 0.345
0.0AsnXaa: 0.0 ± 0.0
Pro
6.936ProAla: 6.936 ± 0.794
0.394ProCys: 0.394 ± 0.203
2.759ProAsp: 2.759 ± 0.541
4.808ProGlu: 4.808 ± 0.739
2.207ProPhe: 2.207 ± 0.608
4.492ProGly: 4.492 ± 0.549
0.946ProHis: 0.946 ± 0.306
1.419ProIle: 1.419 ± 0.31
1.892ProLys: 1.892 ± 0.338
3.625ProLeu: 3.625 ± 0.579
1.103ProMet: 1.103 ± 0.318
1.497ProAsn: 1.497 ± 0.4
1.97ProPro: 1.97 ± 0.378
2.128ProGln: 2.128 ± 0.429
2.286ProArg: 2.286 ± 0.434
2.995ProSer: 2.995 ± 0.406
1.813ProThr: 1.813 ± 0.349
4.256ProVal: 4.256 ± 0.671
0.473ProTrp: 0.473 ± 0.185
1.103ProTyr: 1.103 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
7.803GlnAla: 7.803 ± 0.739
0.236GlnCys: 0.236 ± 0.154
2.128GlnAsp: 2.128 ± 0.405
3.231GlnGlu: 3.231 ± 0.509
1.419GlnPhe: 1.419 ± 0.279
2.916GlnGly: 2.916 ± 0.47
1.182GlnHis: 1.182 ± 0.33
2.522GlnIle: 2.522 ± 0.474
2.049GlnLys: 2.049 ± 0.434
8.039GlnLeu: 8.039 ± 1.026
0.946GlnMet: 0.946 ± 0.224
0.788GlnAsn: 0.788 ± 0.215
2.995GlnPro: 2.995 ± 0.684
3.704GlnGln: 3.704 ± 0.822
4.965GlnArg: 4.965 ± 0.564
2.128GlnSer: 2.128 ± 0.382
2.443GlnThr: 2.443 ± 0.465
4.02GlnVal: 4.02 ± 0.601
0.867GlnTrp: 0.867 ± 0.289
1.34GlnTyr: 1.34 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
7.724ArgAla: 7.724 ± 0.712
0.473ArgCys: 0.473 ± 0.194
4.256ArgAsp: 4.256 ± 0.562
4.256ArgGlu: 4.256 ± 0.527
1.813ArgPhe: 1.813 ± 0.353
4.887ArgGly: 4.887 ± 0.569
1.576ArgHis: 1.576 ± 0.338
3.468ArgIle: 3.468 ± 0.658
3.074ArgLys: 3.074 ± 0.675
8.197ArgLeu: 8.197 ± 0.51
1.892ArgMet: 1.892 ± 0.373
2.916ArgAsn: 2.916 ± 0.508
3.074ArgPro: 3.074 ± 0.49
3.389ArgGln: 3.389 ± 0.569
5.517ArgArg: 5.517 ± 0.78
3.389ArgSer: 3.389 ± 0.488
3.153ArgThr: 3.153 ± 0.491
3.862ArgVal: 3.862 ± 0.666
1.655ArgTrp: 1.655 ± 0.482
2.837ArgTyr: 2.837 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
8.354SerAla: 8.354 ± 1.336
0.236SerCys: 0.236 ± 0.141
3.468SerAsp: 3.468 ± 0.513
2.68SerGlu: 2.68 ± 0.543
2.128SerPhe: 2.128 ± 0.377
4.571SerGly: 4.571 ± 0.554
0.236SerHis: 0.236 ± 0.139
2.207SerIle: 2.207 ± 0.366
1.734SerLys: 1.734 ± 0.395
5.99SerLeu: 5.99 ± 0.643
1.182SerMet: 1.182 ± 0.307
1.813SerAsn: 1.813 ± 0.407
2.837SerPro: 2.837 ± 0.535
2.364SerGln: 2.364 ± 0.525
2.916SerArg: 2.916 ± 0.428
3.625SerSer: 3.625 ± 0.569
3.625SerThr: 3.625 ± 0.499
4.02SerVal: 4.02 ± 0.541
0.631SerTrp: 0.631 ± 0.241
1.576SerTyr: 1.576 ± 0.287
0.0SerXaa: 0.0 ± 0.0
Thr
6.778ThrAla: 6.778 ± 0.702
0.315ThrCys: 0.315 ± 0.152
2.916ThrAsp: 2.916 ± 0.508
3.783ThrGlu: 3.783 ± 0.425
1.34ThrPhe: 1.34 ± 0.338
4.808ThrGly: 4.808 ± 0.687
1.34ThrHis: 1.34 ± 0.326
1.419ThrIle: 1.419 ± 0.311
1.34ThrLys: 1.34 ± 0.395
6.463ThrLeu: 6.463 ± 0.627
0.788ThrMet: 0.788 ± 0.178
1.813ThrAsn: 1.813 ± 0.32
3.625ThrPro: 3.625 ± 0.546
1.734ThrGln: 1.734 ± 0.473
3.31ThrArg: 3.31 ± 0.586
4.098ThrSer: 4.098 ± 0.628
2.759ThrThr: 2.759 ± 0.476
3.389ThrVal: 3.389 ± 0.513
0.552ThrTrp: 0.552 ± 0.201
1.576ThrTyr: 1.576 ± 0.303
0.0ThrXaa: 0.0 ± 0.0
Val
6.62ValAla: 6.62 ± 0.664
0.315ValCys: 0.315 ± 0.133
2.995ValAsp: 2.995 ± 0.495
4.965ValGlu: 4.965 ± 0.712
1.813ValPhe: 1.813 ± 0.336
5.123ValGly: 5.123 ± 0.528
0.867ValHis: 0.867 ± 0.249
3.231ValIle: 3.231 ± 0.537
2.68ValLys: 2.68 ± 0.47
6.463ValLeu: 6.463 ± 0.918
1.261ValMet: 1.261 ± 0.276
1.497ValAsn: 1.497 ± 0.293
2.837ValPro: 2.837 ± 0.569
2.995ValGln: 2.995 ± 0.553
3.074ValArg: 3.074 ± 0.401
3.547ValSer: 3.547 ± 0.367
4.098ValThr: 4.098 ± 0.647
4.965ValVal: 4.965 ± 0.63
1.261ValTrp: 1.261 ± 0.353
1.182ValTyr: 1.182 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
1.813TrpAla: 1.813 ± 0.471
0.158TrpCys: 0.158 ± 0.11
0.631TrpAsp: 0.631 ± 0.21
0.236TrpGlu: 0.236 ± 0.144
0.788TrpPhe: 0.788 ± 0.225
1.025TrpGly: 1.025 ± 0.287
0.473TrpHis: 0.473 ± 0.181
0.946TrpIle: 0.946 ± 0.293
0.394TrpLys: 0.394 ± 0.161
2.128TrpLeu: 2.128 ± 0.365
0.709TrpMet: 0.709 ± 0.215
0.473TrpAsn: 0.473 ± 0.167
0.788TrpPro: 0.788 ± 0.391
0.946TrpGln: 0.946 ± 0.256
1.34TrpArg: 1.34 ± 0.246
0.709TrpSer: 0.709 ± 0.258
1.261TrpThr: 1.261 ± 0.296
0.631TrpVal: 0.631 ± 0.232
0.552TrpTrp: 0.552 ± 0.214
0.236TrpTyr: 0.236 ± 0.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.231TyrAla: 3.231 ± 0.447
0.079TyrCys: 0.079 ± 0.082
1.576TyrAsp: 1.576 ± 0.318
1.576TyrGlu: 1.576 ± 0.357
0.552TyrPhe: 0.552 ± 0.208
2.286TyrGly: 2.286 ± 0.437
0.631TyrHis: 0.631 ± 0.291
1.182TyrIle: 1.182 ± 0.359
1.025TyrLys: 1.025 ± 0.264
1.892TyrLeu: 1.892 ± 0.367
0.709TyrMet: 0.709 ± 0.306
1.103TyrAsn: 1.103 ± 0.277
1.419TyrPro: 1.419 ± 0.373
0.867TyrGln: 0.867 ± 0.209
2.522TyrArg: 2.522 ± 0.47
1.182TyrSer: 1.182 ± 0.363
0.946TyrThr: 0.946 ± 0.272
1.97TyrVal: 1.97 ± 0.348
0.709TyrTrp: 0.709 ± 0.19
0.394TyrTyr: 0.394 ± 0.156
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12689 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski