Amino acid dipepetide frequency for Pseudomonas phage YMC11/07/P54_PAE_BP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.221AlaAla: 14.221 ± 1.587
1.257AlaCys: 1.257 ± 0.256
5.887AlaAsp: 5.887 ± 0.72
7.938AlaGlu: 7.938 ± 0.76
2.381AlaPhe: 2.381 ± 0.432
8.467AlaGly: 8.467 ± 0.844
2.381AlaHis: 2.381 ± 0.433
6.085AlaIle: 6.085 ± 0.649
4.63AlaLys: 4.63 ± 0.581
9.128AlaLeu: 9.128 ± 0.898
3.704AlaMet: 3.704 ± 0.531
3.572AlaAsn: 3.572 ± 0.586
4.3AlaPro: 4.3 ± 0.645
5.159AlaGln: 5.159 ± 0.794
6.416AlaArg: 6.416 ± 0.974
6.218AlaSer: 6.218 ± 0.697
6.747AlaThr: 6.747 ± 1.078
6.085AlaVal: 6.085 ± 0.746
2.646AlaTrp: 2.646 ± 0.392
3.175AlaTyr: 3.175 ± 0.551
0.0AlaXaa: 0.0 ± 0.0
Cys
0.728CysAla: 0.728 ± 0.209
0.066CysCys: 0.066 ± 0.064
0.86CysAsp: 0.86 ± 0.216
0.265CysGlu: 0.265 ± 0.141
0.132CysPhe: 0.132 ± 0.09
1.654CysGly: 1.654 ± 0.316
0.198CysHis: 0.198 ± 0.108
0.529CysIle: 0.529 ± 0.17
0.463CysLys: 0.463 ± 0.16
0.926CysLeu: 0.926 ± 0.21
0.198CysMet: 0.198 ± 0.113
0.661CysAsn: 0.661 ± 0.249
0.992CysPro: 0.992 ± 0.249
0.463CysGln: 0.463 ± 0.169
0.595CysArg: 0.595 ± 0.191
0.661CysSer: 0.661 ± 0.211
0.728CysThr: 0.728 ± 0.221
0.265CysVal: 0.265 ± 0.178
0.198CysTrp: 0.198 ± 0.109
0.132CysTyr: 0.132 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
6.482AspAla: 6.482 ± 0.662
0.728AspCys: 0.728 ± 0.236
3.506AspAsp: 3.506 ± 0.385
3.638AspGlu: 3.638 ± 0.529
1.984AspPhe: 1.984 ± 0.322
5.887AspGly: 5.887 ± 0.705
1.191AspHis: 1.191 ± 0.303
3.043AspIle: 3.043 ± 0.343
2.381AspLys: 2.381 ± 0.399
5.755AspLeu: 5.755 ± 0.579
1.588AspMet: 1.588 ± 0.321
1.521AspAsn: 1.521 ± 0.294
3.241AspPro: 3.241 ± 0.318
3.572AspGln: 3.572 ± 0.399
3.969AspArg: 3.969 ± 0.556
3.109AspSer: 3.109 ± 0.475
2.844AspThr: 2.844 ± 0.48
5.093AspVal: 5.093 ± 0.568
1.124AspTrp: 1.124 ± 0.243
1.389AspTyr: 1.389 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
6.35GluAla: 6.35 ± 0.837
0.992GluCys: 0.992 ± 0.265
3.44GluAsp: 3.44 ± 0.448
3.704GluGlu: 3.704 ± 0.581
2.58GluPhe: 2.58 ± 0.458
3.903GluGly: 3.903 ± 0.437
1.389GluHis: 1.389 ± 0.352
4.167GluIle: 4.167 ± 0.59
2.447GluLys: 2.447 ± 0.39
5.358GluLeu: 5.358 ± 0.609
1.588GluMet: 1.588 ± 0.301
2.117GluAsn: 2.117 ± 0.469
3.373GluPro: 3.373 ± 0.509
3.175GluGln: 3.175 ± 0.367
5.622GluArg: 5.622 ± 0.745
3.373GluSer: 3.373 ± 0.394
2.977GluThr: 2.977 ± 0.38
4.3GluVal: 4.3 ± 0.463
0.794GluTrp: 0.794 ± 0.244
1.786GluTyr: 1.786 ± 0.288
0.0GluXaa: 0.0 ± 0.0
Phe
2.315PheAla: 2.315 ± 0.347
0.265PheCys: 0.265 ± 0.126
2.844PheAsp: 2.844 ± 0.45
2.249PheGlu: 2.249 ± 0.437
0.463PhePhe: 0.463 ± 0.179
2.315PheGly: 2.315 ± 0.449
0.529PheHis: 0.529 ± 0.194
1.455PheIle: 1.455 ± 0.223
1.191PheLys: 1.191 ± 0.277
2.646PheLeu: 2.646 ± 0.427
0.595PheMet: 0.595 ± 0.215
1.257PheAsn: 1.257 ± 0.357
0.992PhePro: 0.992 ± 0.208
1.124PheGln: 1.124 ± 0.258
1.455PheArg: 1.455 ± 0.297
2.051PheSer: 2.051 ± 0.373
1.852PheThr: 1.852 ± 0.322
1.918PheVal: 1.918 ± 0.339
0.331PheTrp: 0.331 ± 0.122
0.794PheTyr: 0.794 ± 0.189
0.0PheXaa: 0.0 ± 0.0
Gly
8.533GlyAla: 8.533 ± 1.082
1.191GlyCys: 1.191 ± 0.353
4.498GlyAsp: 4.498 ± 0.518
4.763GlyGlu: 4.763 ± 0.503
2.315GlyPhe: 2.315 ± 0.432
6.548GlyGly: 6.548 ± 0.858
1.786GlyHis: 1.786 ± 0.291
3.175GlyIle: 3.175 ± 0.394
3.241GlyLys: 3.241 ± 0.48
8.467GlyLeu: 8.467 ± 0.654
2.381GlyMet: 2.381 ± 0.356
2.91GlyAsn: 2.91 ± 0.738
2.58GlyPro: 2.58 ± 0.454
3.572GlyGln: 3.572 ± 0.43
5.49GlyArg: 5.49 ± 0.621
3.638GlySer: 3.638 ± 0.734
4.432GlyThr: 4.432 ± 0.614
5.622GlyVal: 5.622 ± 0.58
1.124GlyTrp: 1.124 ± 0.264
3.307GlyTyr: 3.307 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
2.381HisAla: 2.381 ± 0.398
0.331HisCys: 0.331 ± 0.129
1.455HisAsp: 1.455 ± 0.306
1.191HisGlu: 1.191 ± 0.322
0.661HisPhe: 0.661 ± 0.213
1.786HisGly: 1.786 ± 0.361
1.058HisHis: 1.058 ± 0.384
0.463HisIle: 0.463 ± 0.175
0.595HisLys: 0.595 ± 0.162
1.786HisLeu: 1.786 ± 0.377
0.529HisMet: 0.529 ± 0.182
0.529HisAsn: 0.529 ± 0.18
1.72HisPro: 1.72 ± 0.324
0.926HisGln: 0.926 ± 0.292
1.455HisArg: 1.455 ± 0.372
0.661HisSer: 0.661 ± 0.222
1.058HisThr: 1.058 ± 0.305
1.323HisVal: 1.323 ± 0.31
0.198HisTrp: 0.198 ± 0.129
0.265HisTyr: 0.265 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
5.424IleAla: 5.424 ± 0.522
0.397IleCys: 0.397 ± 0.169
4.3IleAsp: 4.3 ± 0.536
3.44IleGlu: 3.44 ± 0.436
1.058IlePhe: 1.058 ± 0.267
4.167IleGly: 4.167 ± 0.589
0.926IleHis: 0.926 ± 0.24
2.447IleIle: 2.447 ± 0.388
2.315IleLys: 2.315 ± 0.286
3.44IleLeu: 3.44 ± 0.527
0.86IleMet: 0.86 ± 0.23
2.051IleAsn: 2.051 ± 0.303
2.646IlePro: 2.646 ± 0.547
1.521IleGln: 1.521 ± 0.294
3.109IleArg: 3.109 ± 0.475
2.844IleSer: 2.844 ± 0.504
3.175IleThr: 3.175 ± 0.368
3.506IleVal: 3.506 ± 0.401
0.86IleTrp: 0.86 ± 0.196
1.455IleTyr: 1.455 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
3.77LysAla: 3.77 ± 0.62
0.198LysCys: 0.198 ± 0.127
2.117LysAsp: 2.117 ± 0.343
2.58LysGlu: 2.58 ± 0.474
0.86LysPhe: 0.86 ± 0.243
3.44LysGly: 3.44 ± 0.515
0.794LysHis: 0.794 ± 0.217
1.654LysIle: 1.654 ± 0.307
1.588LysLys: 1.588 ± 0.358
3.77LysLeu: 3.77 ± 0.714
0.595LysMet: 0.595 ± 0.175
1.191LysAsn: 1.191 ± 0.297
2.91LysPro: 2.91 ± 0.62
1.72LysGln: 1.72 ± 0.29
3.506LysArg: 3.506 ± 0.479
1.72LysSer: 1.72 ± 0.271
2.514LysThr: 2.514 ± 0.422
3.175LysVal: 3.175 ± 0.554
0.595LysTrp: 0.595 ± 0.183
0.992LysTyr: 0.992 ± 0.251
0.0LysXaa: 0.0 ± 0.0
Leu
8.599LeuAla: 8.599 ± 0.985
0.926LeuCys: 0.926 ± 0.279
6.019LeuAsp: 6.019 ± 0.643
4.763LeuGlu: 4.763 ± 0.528
2.315LeuPhe: 2.315 ± 0.408
6.35LeuGly: 6.35 ± 0.742
1.521LeuHis: 1.521 ± 0.335
4.167LeuIle: 4.167 ± 0.436
3.572LeuLys: 3.572 ± 0.421
6.615LeuLeu: 6.615 ± 0.618
1.654LeuMet: 1.654 ± 0.269
3.44LeuAsn: 3.44 ± 0.552
4.233LeuPro: 4.233 ± 0.455
3.373LeuGln: 3.373 ± 0.438
7.012LeuArg: 7.012 ± 0.732
5.556LeuSer: 5.556 ± 0.678
5.292LeuThr: 5.292 ± 0.536
5.556LeuVal: 5.556 ± 0.683
0.595LeuTrp: 0.595 ± 0.209
2.447LeuTyr: 2.447 ± 0.269
0.0LeuXaa: 0.0 ± 0.0
Met
3.506MetAla: 3.506 ± 0.491
0.132MetCys: 0.132 ± 0.09
1.389MetAsp: 1.389 ± 0.295
1.455MetGlu: 1.455 ± 0.344
0.794MetPhe: 0.794 ± 0.203
1.323MetGly: 1.323 ± 0.318
0.331MetHis: 0.331 ± 0.139
1.521MetIle: 1.521 ± 0.344
1.389MetLys: 1.389 ± 0.312
2.646MetLeu: 2.646 ± 0.341
0.198MetMet: 0.198 ± 0.093
1.058MetAsn: 1.058 ± 0.29
1.323MetPro: 1.323 ± 0.304
0.926MetGln: 0.926 ± 0.268
2.051MetArg: 2.051 ± 0.321
1.588MetSer: 1.588 ± 0.353
2.447MetThr: 2.447 ± 0.373
1.455MetVal: 1.455 ± 0.285
0.265MetTrp: 0.265 ± 0.14
0.529MetTyr: 0.529 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.373AsnAla: 3.373 ± 0.444
0.463AsnCys: 0.463 ± 0.181
2.249AsnAsp: 2.249 ± 0.355
2.117AsnGlu: 2.117 ± 0.554
1.257AsnPhe: 1.257 ± 0.295
3.836AsnGly: 3.836 ± 0.489
0.926AsnHis: 0.926 ± 0.243
1.323AsnIle: 1.323 ± 0.294
1.521AsnLys: 1.521 ± 0.431
2.91AsnLeu: 2.91 ± 0.464
0.463AsnMet: 0.463 ± 0.189
0.992AsnAsn: 0.992 ± 0.31
1.72AsnPro: 1.72 ± 0.382
1.058AsnGln: 1.058 ± 0.25
2.381AsnArg: 2.381 ± 0.385
2.844AsnSer: 2.844 ± 0.433
1.72AsnThr: 1.72 ± 0.364
1.852AsnVal: 1.852 ± 0.485
0.463AsnTrp: 0.463 ± 0.23
0.992AsnTyr: 0.992 ± 0.227
0.0AsnXaa: 0.0 ± 0.0
Pro
6.218ProAla: 6.218 ± 0.652
0.529ProCys: 0.529 ± 0.199
3.307ProAsp: 3.307 ± 0.471
4.035ProGlu: 4.035 ± 0.5
1.389ProPhe: 1.389 ± 0.373
4.101ProGly: 4.101 ± 0.567
0.595ProHis: 0.595 ± 0.179
2.051ProIle: 2.051 ± 0.326
1.455ProLys: 1.455 ± 0.273
4.564ProLeu: 4.564 ± 0.661
1.257ProMet: 1.257 ± 0.327
1.72ProAsn: 1.72 ± 0.307
1.786ProPro: 1.786 ± 0.307
1.72ProGln: 1.72 ± 0.301
2.712ProArg: 2.712 ± 0.469
2.91ProSer: 2.91 ± 0.451
3.506ProThr: 3.506 ± 0.563
3.307ProVal: 3.307 ± 0.492
0.86ProTrp: 0.86 ± 0.274
1.257ProTyr: 1.257 ± 0.242
0.0ProXaa: 0.0 ± 0.0
Gln
6.152GlnAla: 6.152 ± 0.961
0.529GlnCys: 0.529 ± 0.167
1.455GlnAsp: 1.455 ± 0.334
2.249GlnGlu: 2.249 ± 0.335
1.323GlnPhe: 1.323 ± 0.303
2.381GlnGly: 2.381 ± 0.368
0.661GlnHis: 0.661 ± 0.162
2.514GlnIle: 2.514 ± 0.439
1.191GlnLys: 1.191 ± 0.25
2.844GlnLeu: 2.844 ± 0.484
1.455GlnMet: 1.455 ± 0.274
1.058GlnAsn: 1.058 ± 0.249
2.447GlnPro: 2.447 ± 0.476
3.307GlnGln: 3.307 ± 0.529
3.704GlnArg: 3.704 ± 0.52
1.786GlnSer: 1.786 ± 0.277
2.58GlnThr: 2.58 ± 0.431
3.109GlnVal: 3.109 ± 0.557
0.661GlnTrp: 0.661 ± 0.209
1.323GlnTyr: 1.323 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
8.202ArgAla: 8.202 ± 0.765
0.86ArgCys: 0.86 ± 0.225
4.696ArgAsp: 4.696 ± 0.528
4.763ArgGlu: 4.763 ± 0.705
2.249ArgPhe: 2.249 ± 0.336
4.696ArgGly: 4.696 ± 0.56
1.918ArgHis: 1.918 ± 0.321
4.167ArgIle: 4.167 ± 0.462
2.91ArgLys: 2.91 ± 0.432
6.019ArgLeu: 6.019 ± 0.8
2.514ArgMet: 2.514 ± 0.385
2.183ArgAsn: 2.183 ± 0.361
3.704ArgPro: 3.704 ± 0.572
2.514ArgGln: 2.514 ± 0.403
5.689ArgArg: 5.689 ± 0.701
3.175ArgSer: 3.175 ± 0.433
2.778ArgThr: 2.778 ± 0.365
4.3ArgVal: 4.3 ± 0.417
1.588ArgTrp: 1.588 ± 0.374
2.183ArgTyr: 2.183 ± 0.305
0.0ArgXaa: 0.0 ± 0.0
Ser
5.689SerAla: 5.689 ± 0.637
0.529SerCys: 0.529 ± 0.185
2.91SerAsp: 2.91 ± 0.351
3.969SerGlu: 3.969 ± 0.649
1.852SerPhe: 1.852 ± 0.42
5.159SerGly: 5.159 ± 0.638
0.86SerHis: 0.86 ± 0.23
2.91SerIle: 2.91 ± 0.451
2.183SerLys: 2.183 ± 0.396
5.027SerLeu: 5.027 ± 0.506
2.249SerMet: 2.249 ± 0.294
2.051SerAsn: 2.051 ± 0.349
3.704SerPro: 3.704 ± 0.688
2.646SerGln: 2.646 ± 0.351
3.175SerArg: 3.175 ± 0.364
4.035SerSer: 4.035 ± 0.732
3.307SerThr: 3.307 ± 0.559
3.572SerVal: 3.572 ± 0.492
0.794SerTrp: 0.794 ± 0.263
1.124SerTyr: 1.124 ± 0.284
0.0SerXaa: 0.0 ± 0.0
Thr
7.342ThrAla: 7.342 ± 0.859
0.397ThrCys: 0.397 ± 0.16
4.167ThrAsp: 4.167 ± 0.48
3.175ThrGlu: 3.175 ± 0.424
1.984ThrPhe: 1.984 ± 0.403
5.226ThrGly: 5.226 ± 0.614
0.794ThrHis: 0.794 ± 0.262
3.109ThrIle: 3.109 ± 0.521
1.852ThrLys: 1.852 ± 0.385
4.035ThrLeu: 4.035 ± 0.497
1.588ThrMet: 1.588 ± 0.304
1.918ThrAsn: 1.918 ± 0.35
2.778ThrPro: 2.778 ± 0.337
2.051ThrGln: 2.051 ± 0.385
3.506ThrArg: 3.506 ± 0.608
3.506ThrSer: 3.506 ± 0.66
3.241ThrThr: 3.241 ± 0.427
5.424ThrVal: 5.424 ± 0.818
0.728ThrTrp: 0.728 ± 0.199
1.521ThrTyr: 1.521 ± 0.338
0.0ThrXaa: 0.0 ± 0.0
Val
7.342ValAla: 7.342 ± 0.726
0.331ValCys: 0.331 ± 0.149
4.233ValAsp: 4.233 ± 0.591
4.961ValGlu: 4.961 ± 0.724
1.786ValPhe: 1.786 ± 0.3
5.226ValGly: 5.226 ± 0.572
1.323ValHis: 1.323 ± 0.279
3.307ValIle: 3.307 ± 0.487
3.704ValLys: 3.704 ± 0.483
4.432ValLeu: 4.432 ± 0.614
1.588ValMet: 1.588 ± 0.296
2.91ValAsn: 2.91 ± 0.419
3.043ValPro: 3.043 ± 0.452
2.249ValGln: 2.249 ± 0.389
4.696ValArg: 4.696 ± 0.659
4.763ValSer: 4.763 ± 0.685
4.763ValThr: 4.763 ± 0.645
4.895ValVal: 4.895 ± 0.771
1.058ValTrp: 1.058 ± 0.294
1.588ValTyr: 1.588 ± 0.257
0.0ValXaa: 0.0 ± 0.0
Trp
1.389TrpAla: 1.389 ± 0.301
0.132TrpCys: 0.132 ± 0.093
1.124TrpAsp: 1.124 ± 0.288
0.86TrpGlu: 0.86 ± 0.221
0.728TrpPhe: 0.728 ± 0.244
0.728TrpGly: 0.728 ± 0.219
0.529TrpHis: 0.529 ± 0.242
0.728TrpIle: 0.728 ± 0.215
0.661TrpLys: 0.661 ± 0.201
1.389TrpLeu: 1.389 ± 0.262
0.463TrpMet: 0.463 ± 0.17
0.331TrpAsn: 0.331 ± 0.142
0.728TrpPro: 0.728 ± 0.257
0.463TrpGln: 0.463 ± 0.217
1.323TrpArg: 1.323 ± 0.281
1.191TrpSer: 1.191 ± 0.289
0.661TrpThr: 0.661 ± 0.199
1.588TrpVal: 1.588 ± 0.435
0.132TrpTrp: 0.132 ± 0.097
0.529TrpTyr: 0.529 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.381TyrAla: 2.381 ± 0.37
0.463TyrCys: 0.463 ± 0.205
1.588TyrAsp: 1.588 ± 0.3
1.389TyrGlu: 1.389 ± 0.282
0.529TyrPhe: 0.529 ± 0.185
2.381TyrGly: 2.381 ± 0.369
0.661TyrHis: 0.661 ± 0.245
1.124TyrIle: 1.124 ± 0.263
0.397TyrLys: 0.397 ± 0.148
2.249TyrLeu: 2.249 ± 0.377
0.794TyrMet: 0.794 ± 0.21
0.992TyrAsn: 0.992 ± 0.269
1.058TyrPro: 1.058 ± 0.317
1.191TyrGln: 1.191 ± 0.271
3.241TyrArg: 3.241 ± 0.481
2.117TyrSer: 2.117 ± 0.423
1.72TyrThr: 1.72 ± 0.317
1.852TyrVal: 1.852 ± 0.485
0.661TyrTrp: 0.661 ± 0.241
0.529TyrTyr: 0.529 ± 0.18
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (15119 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski