Amino acid dipepetide frequency for Pseudomonas phage vB_PaS_IME307

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.868AlaAla: 13.868 ± 1.412
1.694AlaCys: 1.694 ± 0.276
5.146AlaAsp: 5.146 ± 0.567
9.224AlaGlu: 9.224 ± 1.002
3.326AlaPhe: 3.326 ± 0.414
7.656AlaGly: 7.656 ± 0.687
1.067AlaHis: 1.067 ± 0.269
5.899AlaIle: 5.899 ± 0.647
5.961AlaLys: 5.961 ± 0.676
9.977AlaLeu: 9.977 ± 0.74
3.891AlaMet: 3.891 ± 0.564
2.698AlaAsn: 2.698 ± 0.412
3.577AlaPro: 3.577 ± 0.61
4.706AlaGln: 4.706 ± 0.585
5.961AlaArg: 5.961 ± 0.622
7.091AlaSer: 7.091 ± 0.648
4.079AlaThr: 4.079 ± 0.457
7.091AlaVal: 7.091 ± 0.659
2.134AlaTrp: 2.134 ± 0.383
2.761AlaTyr: 2.761 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
1.255CysAla: 1.255 ± 0.302
0.314CysCys: 0.314 ± 0.155
1.506CysAsp: 1.506 ± 0.287
0.879CysGlu: 0.879 ± 0.192
0.314CysPhe: 0.314 ± 0.139
1.381CysGly: 1.381 ± 0.374
0.439CysHis: 0.439 ± 0.174
0.816CysIle: 0.816 ± 0.242
0.816CysLys: 0.816 ± 0.281
0.879CysLeu: 0.879 ± 0.249
0.377CysMet: 0.377 ± 0.159
0.377CysAsn: 0.377 ± 0.169
1.192CysPro: 1.192 ± 0.259
0.502CysGln: 0.502 ± 0.193
0.941CysArg: 0.941 ± 0.21
1.443CysSer: 1.443 ± 0.277
0.628CysThr: 0.628 ± 0.185
1.067CysVal: 1.067 ± 0.287
0.314CysTrp: 0.314 ± 0.143
0.063CysTyr: 0.063 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
6.15AspAla: 6.15 ± 0.642
1.004AspCys: 1.004 ± 0.28
3.64AspAsp: 3.64 ± 0.665
4.644AspGlu: 4.644 ± 0.6
2.887AspPhe: 2.887 ± 0.461
5.334AspGly: 5.334 ± 0.748
0.69AspHis: 0.69 ± 0.212
3.075AspIle: 3.075 ± 0.47
2.008AspLys: 2.008 ± 0.42
3.828AspLeu: 3.828 ± 0.541
1.632AspMet: 1.632 ± 0.327
1.381AspAsn: 1.381 ± 0.324
3.702AspPro: 3.702 ± 0.504
2.071AspGln: 2.071 ± 0.315
4.079AspArg: 4.079 ± 0.437
3.451AspSer: 3.451 ± 0.42
2.134AspThr: 2.134 ± 0.368
3.702AspVal: 3.702 ± 0.431
1.255AspTrp: 1.255 ± 0.271
2.008AspTyr: 2.008 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
7.279GluAla: 7.279 ± 0.874
1.004GluCys: 1.004 ± 0.276
3.2GluAsp: 3.2 ± 0.518
3.828GluGlu: 3.828 ± 0.612
3.012GluPhe: 3.012 ± 0.371
4.644GluGly: 4.644 ± 0.559
1.506GluHis: 1.506 ± 0.238
3.514GluIle: 3.514 ± 0.488
3.2GluLys: 3.2 ± 0.571
5.961GluLeu: 5.961 ± 0.621
2.071GluMet: 2.071 ± 0.375
1.883GluAsn: 1.883 ± 0.416
3.075GluPro: 3.075 ± 0.375
4.644GluGln: 4.644 ± 0.803
7.154GluArg: 7.154 ± 0.762
3.075GluSer: 3.075 ± 0.426
2.636GluThr: 2.636 ± 0.372
4.204GluVal: 4.204 ± 0.488
2.071GluTrp: 2.071 ± 0.404
2.322GluTyr: 2.322 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
3.138PheAla: 3.138 ± 0.435
0.628PheCys: 0.628 ± 0.188
2.447PheAsp: 2.447 ± 0.407
2.385PheGlu: 2.385 ± 0.415
1.318PhePhe: 1.318 ± 0.32
3.138PheGly: 3.138 ± 0.519
0.628PheHis: 0.628 ± 0.23
2.008PheIle: 2.008 ± 0.327
1.506PheLys: 1.506 ± 0.25
2.259PheLeu: 2.259 ± 0.401
0.753PheMet: 0.753 ± 0.266
1.067PheAsn: 1.067 ± 0.224
1.255PhePro: 1.255 ± 0.247
0.879PheGln: 0.879 ± 0.248
2.259PheArg: 2.259 ± 0.348
2.636PheSer: 2.636 ± 0.397
2.447PheThr: 2.447 ± 0.396
2.322PheVal: 2.322 ± 0.379
0.439PheTrp: 0.439 ± 0.174
0.879PheTyr: 0.879 ± 0.243
0.0PheXaa: 0.0 ± 0.0
Gly
6.463GlyAla: 6.463 ± 0.478
1.255GlyCys: 1.255 ± 0.303
3.702GlyAsp: 3.702 ± 0.495
5.459GlyGlu: 5.459 ± 0.578
3.326GlyPhe: 3.326 ± 0.514
6.212GlyGly: 6.212 ± 0.62
2.008GlyHis: 2.008 ± 0.356
5.083GlyIle: 5.083 ± 0.529
3.891GlyLys: 3.891 ± 0.407
5.773GlyLeu: 5.773 ± 0.651
2.887GlyMet: 2.887 ± 0.422
2.824GlyAsn: 2.824 ± 0.459
1.945GlyPro: 1.945 ± 0.339
2.51GlyGln: 2.51 ± 0.404
3.953GlyArg: 3.953 ± 0.428
4.204GlySer: 4.204 ± 0.511
3.577GlyThr: 3.577 ± 0.5
5.271GlyVal: 5.271 ± 0.48
2.008GlyTrp: 2.008 ± 0.33
2.259GlyTyr: 2.259 ± 0.407
0.0GlyXaa: 0.0 ± 0.0
His
1.255HisAla: 1.255 ± 0.311
0.314HisCys: 0.314 ± 0.144
1.13HisAsp: 1.13 ± 0.232
1.067HisGlu: 1.067 ± 0.242
0.502HisPhe: 0.502 ± 0.213
1.632HisGly: 1.632 ± 0.357
0.502HisHis: 0.502 ± 0.19
0.753HisIle: 0.753 ± 0.209
0.941HisLys: 0.941 ± 0.336
1.694HisLeu: 1.694 ± 0.318
0.251HisMet: 0.251 ± 0.128
0.565HisAsn: 0.565 ± 0.174
1.381HisPro: 1.381 ± 0.254
0.628HisGln: 0.628 ± 0.189
0.879HisArg: 0.879 ± 0.174
1.255HisSer: 1.255 ± 0.286
0.439HisThr: 0.439 ± 0.16
1.318HisVal: 1.318 ± 0.295
0.377HisTrp: 0.377 ± 0.164
0.565HisTyr: 0.565 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
5.773IleAla: 5.773 ± 0.681
0.69IleCys: 0.69 ± 0.217
3.702IleAsp: 3.702 ± 0.472
5.208IleGlu: 5.208 ± 0.644
1.82IlePhe: 1.82 ± 0.338
4.393IleGly: 4.393 ± 0.489
1.13IleHis: 1.13 ± 0.242
2.761IleIle: 2.761 ± 0.448
2.447IleLys: 2.447 ± 0.422
4.393IleLeu: 4.393 ± 0.498
0.816IleMet: 0.816 ± 0.217
2.385IleAsn: 2.385 ± 0.372
3.326IlePro: 3.326 ± 0.45
1.82IleGln: 1.82 ± 0.314
3.075IleArg: 3.075 ± 0.358
3.64IleSer: 3.64 ± 0.499
3.953IleThr: 3.953 ± 0.564
3.012IleVal: 3.012 ± 0.559
0.69IleTrp: 0.69 ± 0.207
1.381IleTyr: 1.381 ± 0.312
0.0IleXaa: 0.0 ± 0.0
Lys
6.087LysAla: 6.087 ± 0.784
1.13LysCys: 1.13 ± 0.324
2.447LysAsp: 2.447 ± 0.636
3.012LysGlu: 3.012 ± 0.481
0.879LysPhe: 0.879 ± 0.201
3.2LysGly: 3.2 ± 0.434
1.13LysHis: 1.13 ± 0.232
2.071LysIle: 2.071 ± 0.46
2.259LysLys: 2.259 ± 0.515
3.702LysLeu: 3.702 ± 0.519
1.13LysMet: 1.13 ± 0.255
1.82LysAsn: 1.82 ± 0.37
2.636LysPro: 2.636 ± 0.41
1.82LysGln: 1.82 ± 0.417
3.702LysArg: 3.702 ± 0.488
3.953LysSer: 3.953 ± 0.444
3.138LysThr: 3.138 ± 0.51
3.012LysVal: 3.012 ± 0.419
0.879LysTrp: 0.879 ± 0.208
0.941LysTyr: 0.941 ± 0.226
0.0LysXaa: 0.0 ± 0.0
Leu
7.593LeuAla: 7.593 ± 0.743
1.13LeuCys: 1.13 ± 0.311
5.899LeuAsp: 5.899 ± 0.641
5.208LeuGlu: 5.208 ± 0.522
2.134LeuPhe: 2.134 ± 0.377
6.15LeuGly: 6.15 ± 0.654
1.13LeuHis: 1.13 ± 0.324
4.832LeuIle: 4.832 ± 0.531
4.455LeuLys: 4.455 ± 0.446
5.836LeuLeu: 5.836 ± 0.741
2.134LeuMet: 2.134 ± 0.419
3.326LeuAsn: 3.326 ± 0.457
3.953LeuPro: 3.953 ± 0.621
2.949LeuGln: 2.949 ± 0.355
5.961LeuArg: 5.961 ± 0.56
5.208LeuSer: 5.208 ± 0.526
3.953LeuThr: 3.953 ± 0.553
4.581LeuVal: 4.581 ± 0.505
1.506LeuTrp: 1.506 ± 0.342
2.134LeuTyr: 2.134 ± 0.368
0.0LeuXaa: 0.0 ± 0.0
Met
3.138MetAla: 3.138 ± 0.385
0.126MetCys: 0.126 ± 0.085
1.255MetAsp: 1.255 ± 0.303
1.506MetGlu: 1.506 ± 0.379
1.004MetPhe: 1.004 ± 0.286
1.694MetGly: 1.694 ± 0.309
0.314MetHis: 0.314 ± 0.128
1.004MetIle: 1.004 ± 0.251
1.569MetLys: 1.569 ± 0.44
2.008MetLeu: 2.008 ± 0.365
0.565MetMet: 0.565 ± 0.193
1.318MetAsn: 1.318 ± 0.32
2.008MetPro: 2.008 ± 0.362
1.13MetGln: 1.13 ± 0.301
1.694MetArg: 1.694 ± 0.372
2.322MetSer: 2.322 ± 0.446
1.883MetThr: 1.883 ± 0.331
0.941MetVal: 0.941 ± 0.25
0.439MetTrp: 0.439 ± 0.176
0.753MetTyr: 0.753 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
4.079AsnAla: 4.079 ± 0.471
0.377AsnCys: 0.377 ± 0.149
1.694AsnAsp: 1.694 ± 0.302
2.134AsnGlu: 2.134 ± 0.343
1.067AsnPhe: 1.067 ± 0.328
2.698AsnGly: 2.698 ± 0.435
1.004AsnHis: 1.004 ± 0.238
1.381AsnIle: 1.381 ± 0.326
1.192AsnLys: 1.192 ± 0.254
2.259AsnLeu: 2.259 ± 0.377
0.628AsnMet: 0.628 ± 0.197
0.941AsnAsn: 0.941 ± 0.289
2.322AsnPro: 2.322 ± 0.342
1.381AsnGln: 1.381 ± 0.313
2.071AsnArg: 2.071 ± 0.333
2.447AsnSer: 2.447 ± 0.425
1.506AsnThr: 1.506 ± 0.433
2.322AsnVal: 2.322 ± 0.5
0.941AsnTrp: 0.941 ± 0.26
0.816AsnTyr: 0.816 ± 0.244
0.0AsnXaa: 0.0 ± 0.0
Pro
5.71ProAla: 5.71 ± 0.536
0.941ProCys: 0.941 ± 0.253
3.577ProAsp: 3.577 ± 0.518
3.577ProGlu: 3.577 ± 0.596
1.506ProPhe: 1.506 ± 0.398
3.514ProGly: 3.514 ± 0.496
0.565ProHis: 0.565 ± 0.175
1.82ProIle: 1.82 ± 0.306
2.322ProLys: 2.322 ± 0.485
3.765ProLeu: 3.765 ± 0.568
1.318ProMet: 1.318 ± 0.269
1.443ProAsn: 1.443 ± 0.273
1.694ProPro: 1.694 ± 0.314
1.381ProGln: 1.381 ± 0.312
1.694ProArg: 1.694 ± 0.33
3.451ProSer: 3.451 ± 0.463
2.071ProThr: 2.071 ± 0.351
4.016ProVal: 4.016 ± 0.571
0.439ProTrp: 0.439 ± 0.146
1.443ProTyr: 1.443 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
5.836GlnAla: 5.836 ± 0.638
0.816GlnCys: 0.816 ± 0.258
1.569GlnAsp: 1.569 ± 0.279
2.761GlnGlu: 2.761 ± 0.41
1.318GlnPhe: 1.318 ± 0.319
2.259GlnGly: 2.259 ± 0.398
0.753GlnHis: 0.753 ± 0.247
2.761GlnIle: 2.761 ± 0.427
2.008GlnLys: 2.008 ± 0.342
3.2GlnLeu: 3.2 ± 0.528
1.255GlnMet: 1.255 ± 0.285
1.443GlnAsn: 1.443 ± 0.272
1.318GlnPro: 1.318 ± 0.312
2.259GlnGln: 2.259 ± 0.581
2.573GlnArg: 2.573 ± 0.443
1.945GlnSer: 1.945 ± 0.324
1.443GlnThr: 1.443 ± 0.269
1.945GlnVal: 1.945 ± 0.3
0.879GlnTrp: 0.879 ± 0.183
1.757GlnTyr: 1.757 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
6.401ArgAla: 6.401 ± 0.831
0.69ArgCys: 0.69 ± 0.224
4.142ArgAsp: 4.142 ± 0.554
4.895ArgGlu: 4.895 ± 0.57
2.51ArgPhe: 2.51 ± 0.443
4.267ArgGly: 4.267 ± 0.478
1.13ArgHis: 1.13 ± 0.214
4.581ArgIle: 4.581 ± 0.646
2.636ArgLys: 2.636 ± 0.348
5.648ArgLeu: 5.648 ± 0.635
1.883ArgMet: 1.883 ± 0.32
1.757ArgAsn: 1.757 ± 0.3
2.385ArgPro: 2.385 ± 0.366
2.698ArgGln: 2.698 ± 0.482
4.267ArgArg: 4.267 ± 0.61
3.2ArgSer: 3.2 ± 0.381
2.51ArgThr: 2.51 ± 0.385
3.953ArgVal: 3.953 ± 0.469
1.318ArgTrp: 1.318 ± 0.287
2.322ArgTyr: 2.322 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
7.279SerAla: 7.279 ± 0.723
0.941SerCys: 0.941 ± 0.257
4.204SerAsp: 4.204 ± 0.493
3.64SerGlu: 3.64 ± 0.543
2.322SerPhe: 2.322 ± 0.348
5.961SerGly: 5.961 ± 0.783
0.753SerHis: 0.753 ± 0.297
3.702SerIle: 3.702 ± 0.531
3.326SerLys: 3.326 ± 0.521
5.146SerLeu: 5.146 ± 0.622
1.82SerMet: 1.82 ± 0.335
2.636SerAsn: 2.636 ± 0.454
2.259SerPro: 2.259 ± 0.377
1.757SerGln: 1.757 ± 0.281
3.577SerArg: 3.577 ± 0.529
3.451SerSer: 3.451 ± 0.482
3.577SerThr: 3.577 ± 0.579
4.33SerVal: 4.33 ± 0.639
0.941SerTrp: 0.941 ± 0.214
1.694SerTyr: 1.694 ± 0.375
0.0SerXaa: 0.0 ± 0.0
Thr
4.644ThrAla: 4.644 ± 0.487
0.816ThrCys: 0.816 ± 0.27
2.447ThrAsp: 2.447 ± 0.37
3.012ThrGlu: 3.012 ± 0.429
2.071ThrPhe: 2.071 ± 0.366
3.012ThrGly: 3.012 ± 0.489
1.13ThrHis: 1.13 ± 0.285
3.263ThrIle: 3.263 ± 0.432
2.573ThrLys: 2.573 ± 0.466
4.393ThrLeu: 4.393 ± 0.534
1.004ThrMet: 1.004 ± 0.223
1.694ThrAsn: 1.694 ± 0.362
2.447ThrPro: 2.447 ± 0.366
2.322ThrGln: 2.322 ± 0.375
1.883ThrArg: 1.883 ± 0.477
3.012ThrSer: 3.012 ± 0.654
3.075ThrThr: 3.075 ± 0.474
3.953ThrVal: 3.953 ± 0.481
0.753ThrTrp: 0.753 ± 0.219
1.757ThrTyr: 1.757 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
6.463ValAla: 6.463 ± 0.547
1.13ValCys: 1.13 ± 0.316
4.267ValAsp: 4.267 ± 0.547
5.02ValGlu: 5.02 ± 0.596
1.945ValPhe: 1.945 ± 0.426
4.33ValGly: 4.33 ± 0.447
0.628ValHis: 0.628 ± 0.214
4.455ValIle: 4.455 ± 0.433
3.765ValLys: 3.765 ± 0.591
4.706ValLeu: 4.706 ± 0.641
0.941ValMet: 0.941 ± 0.257
2.51ValAsn: 2.51 ± 0.486
3.012ValPro: 3.012 ± 0.443
2.447ValGln: 2.447 ± 0.379
3.891ValArg: 3.891 ± 0.474
4.706ValSer: 4.706 ± 0.52
3.64ValThr: 3.64 ± 0.465
4.706ValVal: 4.706 ± 0.658
1.004ValTrp: 1.004 ± 0.241
1.632ValTyr: 1.632 ± 0.393
0.0ValXaa: 0.0 ± 0.0
Trp
1.757TrpAla: 1.757 ± 0.326
0.314TrpCys: 0.314 ± 0.149
0.69TrpAsp: 0.69 ± 0.186
1.318TrpGlu: 1.318 ± 0.213
0.502TrpPhe: 0.502 ± 0.2
1.067TrpGly: 1.067 ± 0.237
0.565TrpHis: 0.565 ± 0.178
1.192TrpIle: 1.192 ± 0.244
1.13TrpLys: 1.13 ± 0.265
2.447TrpLeu: 2.447 ± 0.362
0.565TrpMet: 0.565 ± 0.202
0.69TrpAsn: 0.69 ± 0.196
1.067TrpPro: 1.067 ± 0.264
0.69TrpGln: 0.69 ± 0.286
1.443TrpArg: 1.443 ± 0.311
1.443TrpSer: 1.443 ± 0.315
1.067TrpThr: 1.067 ± 0.236
1.506TrpVal: 1.506 ± 0.265
0.565TrpTrp: 0.565 ± 0.219
0.188TrpTyr: 0.188 ± 0.099
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.64TyrAla: 3.64 ± 0.538
0.188TyrCys: 0.188 ± 0.106
2.008TyrAsp: 2.008 ± 0.336
1.506TyrGlu: 1.506 ± 0.294
0.628TyrPhe: 0.628 ± 0.22
2.071TyrGly: 2.071 ± 0.354
0.377TyrHis: 0.377 ± 0.136
1.443TyrIle: 1.443 ± 0.268
0.941TyrLys: 0.941 ± 0.248
2.196TyrLeu: 2.196 ± 0.357
0.69TyrMet: 0.69 ± 0.211
0.502TyrAsn: 0.502 ± 0.209
1.694TyrPro: 1.694 ± 0.441
1.506TyrGln: 1.506 ± 0.253
2.134TyrArg: 2.134 ± 0.399
1.443TyrSer: 1.443 ± 0.244
1.506TyrThr: 1.506 ± 0.462
1.945TyrVal: 1.945 ± 0.353
1.255TyrTrp: 1.255 ± 0.255
0.753TyrTyr: 0.753 ± 0.207
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (15937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski