Amino acid dipepetide frequency for Pseudomonas phage vB_PsyM_KIL4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.706AlaAla: 6.706 ± 0.703
0.809AlaCys: 0.809 ± 0.171
4.625AlaAsp: 4.625 ± 0.388
4.702AlaGlu: 4.702 ± 0.393
3.045AlaPhe: 3.045 ± 0.365
5.049AlaGly: 5.049 ± 0.451
1.272AlaHis: 1.272 ± 0.205
4.625AlaIle: 4.625 ± 0.421
4.741AlaLys: 4.741 ± 0.4
6.282AlaLeu: 6.282 ± 0.786
2.081AlaMet: 2.081 ± 0.303
3.315AlaAsn: 3.315 ± 0.371
2.467AlaPro: 2.467 ± 0.34
3.661AlaGln: 3.661 ± 0.302
4.162AlaArg: 4.162 ± 0.472
4.856AlaSer: 4.856 ± 0.47
5.242AlaThr: 5.242 ± 0.559
5.126AlaVal: 5.126 ± 0.584
1.31AlaTrp: 1.31 ± 0.222
2.274AlaTyr: 2.274 ± 0.248
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.191
0.27CysCys: 0.27 ± 0.095
0.694CysAsp: 0.694 ± 0.22
0.771CysGlu: 0.771 ± 0.194
0.578CysPhe: 0.578 ± 0.16
1.002CysGly: 1.002 ± 0.199
0.424CysHis: 0.424 ± 0.138
0.771CysIle: 0.771 ± 0.165
0.925CysLys: 0.925 ± 0.211
0.886CysLeu: 0.886 ± 0.187
0.462CysMet: 0.462 ± 0.137
0.694CysAsn: 0.694 ± 0.183
0.655CysPro: 0.655 ± 0.18
0.424CysGln: 0.424 ± 0.12
0.617CysArg: 0.617 ± 0.154
1.041CysSer: 1.041 ± 0.178
0.848CysThr: 0.848 ± 0.168
1.041CysVal: 1.041 ± 0.186
0.154CysTrp: 0.154 ± 0.076
0.617CysTyr: 0.617 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
3.931AspAla: 3.931 ± 0.384
1.041AspCys: 1.041 ± 0.202
3.083AspAsp: 3.083 ± 0.36
4.008AspGlu: 4.008 ± 0.453
2.736AspPhe: 2.736 ± 0.325
5.126AspGly: 5.126 ± 0.508
1.156AspHis: 1.156 ± 0.203
2.775AspIle: 2.775 ± 0.313
4.471AspLys: 4.471 ± 0.491
5.473AspLeu: 5.473 ± 0.48
1.85AspMet: 1.85 ± 0.281
2.428AspAsn: 2.428 ± 0.368
1.85AspPro: 1.85 ± 0.311
1.773AspGln: 1.773 ± 0.251
2.775AspArg: 2.775 ± 0.329
3.392AspSer: 3.392 ± 0.464
3.43AspThr: 3.43 ± 0.357
3.816AspVal: 3.816 ± 0.442
1.503AspTrp: 1.503 ± 0.234
2.39AspTyr: 2.39 ± 0.321
0.0AspXaa: 0.0 ± 0.0
Glu
5.974GluAla: 5.974 ± 0.648
1.041GluCys: 1.041 ± 0.233
3.97GluAsp: 3.97 ± 0.469
5.511GluGlu: 5.511 ± 0.557
3.469GluPhe: 3.469 ± 0.383
3.546GluGly: 3.546 ± 0.446
1.503GluHis: 1.503 ± 0.24
4.162GluIle: 4.162 ± 0.363
3.931GluLys: 3.931 ± 0.396
5.897GluLeu: 5.897 ± 0.611
2.467GluMet: 2.467 ± 0.319
2.505GluAsn: 2.505 ± 0.278
1.734GluPro: 1.734 ± 0.268
2.659GluGln: 2.659 ± 0.344
3.739GluArg: 3.739 ± 0.405
4.008GluSer: 4.008 ± 0.387
3.083GluThr: 3.083 ± 0.352
4.625GluVal: 4.625 ± 0.362
1.542GluTrp: 1.542 ± 0.23
2.621GluTyr: 2.621 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
2.428PheAla: 2.428 ± 0.329
0.694PheCys: 0.694 ± 0.164
2.775PheAsp: 2.775 ± 0.394
3.16PheGlu: 3.16 ± 0.351
1.233PhePhe: 1.233 ± 0.229
2.891PheGly: 2.891 ± 0.32
0.771PheHis: 0.771 ± 0.151
2.004PheIle: 2.004 ± 0.264
2.467PheLys: 2.467 ± 0.329
3.083PheLeu: 3.083 ± 0.36
1.079PheMet: 1.079 ± 0.178
2.081PheAsn: 2.081 ± 0.282
1.387PhePro: 1.387 ± 0.217
1.465PheGln: 1.465 ± 0.215
1.811PheArg: 1.811 ± 0.29
3.083PheSer: 3.083 ± 0.324
2.659PheThr: 2.659 ± 0.32
2.621PheVal: 2.621 ± 0.338
0.27PheTrp: 0.27 ± 0.146
1.002PheTyr: 1.002 ± 0.177
0.0PheXaa: 0.0 ± 0.0
Gly
4.317GlyAla: 4.317 ± 0.506
1.156GlyCys: 1.156 ± 0.186
3.7GlyAsp: 3.7 ± 0.398
4.085GlyGlu: 4.085 ± 0.37
3.083GlyPhe: 3.083 ± 0.318
5.396GlyGly: 5.396 ± 0.612
1.31GlyHis: 1.31 ± 0.228
3.931GlyIle: 3.931 ± 0.489
5.897GlyLys: 5.897 ± 0.489
7.092GlyLeu: 7.092 ± 0.492
2.158GlyMet: 2.158 ± 0.251
3.546GlyAsn: 3.546 ± 0.435
1.118GlyPro: 1.118 ± 0.227
1.966GlyGln: 1.966 ± 0.292
3.507GlyArg: 3.507 ± 0.357
5.087GlySer: 5.087 ± 0.569
3.97GlyThr: 3.97 ± 0.507
6.244GlyVal: 6.244 ± 0.426
1.465GlyTrp: 1.465 ± 0.238
3.854GlyTyr: 3.854 ± 0.419
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.228
0.27HisCys: 0.27 ± 0.089
1.002HisAsp: 1.002 ± 0.176
1.156HisGlu: 1.156 ± 0.194
0.809HisPhe: 0.809 ± 0.187
1.58HisGly: 1.58 ± 0.284
0.462HisHis: 0.462 ± 0.163
1.041HisIle: 1.041 ± 0.219
1.349HisLys: 1.349 ± 0.206
1.349HisLeu: 1.349 ± 0.242
0.732HisMet: 0.732 ± 0.179
1.156HisAsn: 1.156 ± 0.186
0.848HisPro: 0.848 ± 0.17
0.54HisGln: 0.54 ± 0.155
1.31HisArg: 1.31 ± 0.203
1.503HisSer: 1.503 ± 0.199
1.118HisThr: 1.118 ± 0.188
1.079HisVal: 1.079 ± 0.182
0.385HisTrp: 0.385 ± 0.13
0.694HisTyr: 0.694 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
3.854IleAla: 3.854 ± 0.384
0.848IleCys: 0.848 ± 0.198
3.816IleAsp: 3.816 ± 0.386
4.432IleGlu: 4.432 ± 0.421
1.657IlePhe: 1.657 ± 0.277
3.661IleGly: 3.661 ± 0.433
1.272IleHis: 1.272 ± 0.209
2.852IleIle: 2.852 ± 0.328
4.085IleLys: 4.085 ± 0.335
4.471IleLeu: 4.471 ± 0.42
1.195IleMet: 1.195 ± 0.188
3.122IleAsn: 3.122 ± 0.329
2.235IlePro: 2.235 ± 0.27
2.12IleGln: 2.12 ± 0.31
3.276IleArg: 3.276 ± 0.351
4.085IleSer: 4.085 ± 0.488
4.124IleThr: 4.124 ± 0.323
3.276IleVal: 3.276 ± 0.374
0.732IleTrp: 0.732 ± 0.164
1.426IleTyr: 1.426 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
6.051LysAla: 6.051 ± 0.602
0.655LysCys: 0.655 ± 0.162
4.008LysAsp: 4.008 ± 0.469
4.972LysGlu: 4.972 ± 0.534
2.274LysPhe: 2.274 ± 0.271
4.471LysGly: 4.471 ± 0.445
1.118LysHis: 1.118 ± 0.217
3.816LysIle: 3.816 ± 0.345
3.276LysLys: 3.276 ± 0.363
4.972LysLeu: 4.972 ± 0.442
2.621LysMet: 2.621 ± 0.29
2.929LysAsn: 2.929 ± 0.352
2.582LysPro: 2.582 ± 0.392
2.621LysGln: 2.621 ± 0.263
3.469LysArg: 3.469 ± 0.347
4.047LysSer: 4.047 ± 0.437
3.931LysThr: 3.931 ± 0.425
5.627LysVal: 5.627 ± 0.476
1.002LysTrp: 1.002 ± 0.188
2.467LysTyr: 2.467 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
7.284LeuAla: 7.284 ± 0.457
1.002LeuCys: 1.002 ± 0.205
5.319LeuAsp: 5.319 ± 0.434
6.398LeuGlu: 6.398 ± 0.54
3.083LeuPhe: 3.083 ± 0.397
6.167LeuGly: 6.167 ± 0.676
1.118LeuHis: 1.118 ± 0.232
4.278LeuIle: 4.278 ± 0.427
5.434LeuLys: 5.434 ± 0.48
5.55LeuLeu: 5.55 ± 0.535
2.312LeuMet: 2.312 ± 0.249
4.047LeuAsn: 4.047 ± 0.46
3.777LeuPro: 3.777 ± 0.339
3.739LeuGln: 3.739 ± 0.36
3.739LeuArg: 3.739 ± 0.367
5.511LeuSer: 5.511 ± 0.474
4.471LeuThr: 4.471 ± 0.512
4.625LeuVal: 4.625 ± 0.437
1.272LeuTrp: 1.272 ± 0.22
2.467LeuTyr: 2.467 ± 0.308
0.0LeuXaa: 0.0 ± 0.0
Met
2.621MetAla: 2.621 ± 0.328
0.308MetCys: 0.308 ± 0.12
1.041MetAsp: 1.041 ± 0.181
1.889MetGlu: 1.889 ± 0.295
1.002MetPhe: 1.002 ± 0.222
1.811MetGly: 1.811 ± 0.296
0.655MetHis: 0.655 ± 0.131
1.85MetIle: 1.85 ± 0.283
2.235MetLys: 2.235 ± 0.303
2.12MetLeu: 2.12 ± 0.303
0.54MetMet: 0.54 ± 0.158
1.387MetAsn: 1.387 ± 0.225
1.002MetPro: 1.002 ± 0.186
1.195MetGln: 1.195 ± 0.152
1.002MetArg: 1.002 ± 0.206
2.428MetSer: 2.428 ± 0.259
1.927MetThr: 1.927 ± 0.247
1.773MetVal: 1.773 ± 0.321
0.385MetTrp: 0.385 ± 0.101
1.118MetTyr: 1.118 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
3.199AsnAla: 3.199 ± 0.394
0.694AsnCys: 0.694 ± 0.164
2.582AsnAsp: 2.582 ± 0.293
1.85AsnGlu: 1.85 ± 0.267
1.657AsnPhe: 1.657 ± 0.265
4.355AsnGly: 4.355 ± 0.36
1.156AsnHis: 1.156 ± 0.188
3.315AsnIle: 3.315 ± 0.334
3.237AsnLys: 3.237 ± 0.337
4.432AsnLeu: 4.432 ± 0.563
0.886AsnMet: 0.886 ± 0.177
2.505AsnAsn: 2.505 ± 0.377
2.197AsnPro: 2.197 ± 0.336
1.503AsnGln: 1.503 ± 0.242
2.544AsnArg: 2.544 ± 0.331
3.546AsnSer: 3.546 ± 0.401
2.814AsnThr: 2.814 ± 0.341
2.852AsnVal: 2.852 ± 0.374
0.694AsnTrp: 0.694 ± 0.156
1.773AsnTyr: 1.773 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
2.852ProAla: 2.852 ± 0.412
0.655ProCys: 0.655 ± 0.149
2.043ProAsp: 2.043 ± 0.281
3.276ProGlu: 3.276 ± 0.326
1.233ProPhe: 1.233 ± 0.19
1.889ProGly: 1.889 ± 0.288
0.732ProHis: 0.732 ± 0.162
1.734ProIle: 1.734 ± 0.263
1.966ProLys: 1.966 ± 0.323
2.621ProLeu: 2.621 ± 0.316
1.079ProMet: 1.079 ± 0.248
1.696ProAsn: 1.696 ± 0.255
0.886ProPro: 0.886 ± 0.194
1.387ProGln: 1.387 ± 0.251
1.272ProArg: 1.272 ± 0.24
2.351ProSer: 2.351 ± 0.258
2.158ProThr: 2.158 ± 0.266
2.505ProVal: 2.505 ± 0.292
0.54ProTrp: 0.54 ± 0.164
1.503ProTyr: 1.503 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
3.623GlnAla: 3.623 ± 0.434
0.385GlnCys: 0.385 ± 0.145
2.312GlnAsp: 2.312 ± 0.333
2.081GlnGlu: 2.081 ± 0.227
1.58GlnPhe: 1.58 ± 0.251
2.852GlnGly: 2.852 ± 0.323
0.694GlnHis: 0.694 ± 0.172
2.544GlnIle: 2.544 ± 0.283
2.544GlnLys: 2.544 ± 0.323
3.276GlnLeu: 3.276 ± 0.426
1.31GlnMet: 1.31 ± 0.238
1.503GlnAsn: 1.503 ± 0.251
0.848GlnPro: 0.848 ± 0.179
1.426GlnGln: 1.426 ± 0.222
2.081GlnArg: 2.081 ± 0.317
2.544GlnSer: 2.544 ± 0.34
1.773GlnThr: 1.773 ± 0.279
2.698GlnVal: 2.698 ± 0.325
0.848GlnTrp: 0.848 ± 0.196
1.387GlnTyr: 1.387 ± 0.236
0.0GlnXaa: 0.0 ± 0.0
Arg
3.469ArgAla: 3.469 ± 0.361
0.964ArgCys: 0.964 ± 0.212
2.467ArgAsp: 2.467 ± 0.372
2.968ArgGlu: 2.968 ± 0.344
1.657ArgPhe: 1.657 ± 0.188
3.276ArgGly: 3.276 ± 0.373
1.002ArgHis: 1.002 ± 0.23
2.891ArgIle: 2.891 ± 0.348
3.739ArgLys: 3.739 ± 0.385
4.664ArgLeu: 4.664 ± 0.388
1.387ArgMet: 1.387 ± 0.209
2.312ArgAsn: 2.312 ± 0.295
1.657ArgPro: 1.657 ± 0.229
2.12ArgGln: 2.12 ± 0.344
2.698ArgArg: 2.698 ± 0.393
2.814ArgSer: 2.814 ± 0.389
2.467ArgThr: 2.467 ± 0.296
3.43ArgVal: 3.43 ± 0.305
1.079ArgTrp: 1.079 ± 0.208
1.773ArgTyr: 1.773 ± 0.285
0.0ArgXaa: 0.0 ± 0.0
Ser
3.893SerAla: 3.893 ± 0.473
0.809SerCys: 0.809 ± 0.254
3.623SerAsp: 3.623 ± 0.387
3.893SerGlu: 3.893 ± 0.369
2.698SerPhe: 2.698 ± 0.355
6.321SerGly: 6.321 ± 0.717
1.118SerHis: 1.118 ± 0.205
3.893SerIle: 3.893 ± 0.38
4.394SerLys: 4.394 ± 0.449
5.203SerLeu: 5.203 ± 0.483
1.966SerMet: 1.966 ± 0.275
3.353SerAsn: 3.353 ± 0.393
2.505SerPro: 2.505 ± 0.311
2.467SerGln: 2.467 ± 0.257
3.16SerArg: 3.16 ± 0.277
4.664SerSer: 4.664 ± 0.723
4.162SerThr: 4.162 ± 0.51
5.049SerVal: 5.049 ± 0.471
1.349SerTrp: 1.349 ± 0.19
2.467SerTyr: 2.467 ± 0.267
0.0SerXaa: 0.0 ± 0.0
Thr
4.24ThrAla: 4.24 ± 0.376
0.54ThrCys: 0.54 ± 0.143
3.122ThrAsp: 3.122 ± 0.322
3.893ThrGlu: 3.893 ± 0.368
2.428ThrPhe: 2.428 ± 0.291
5.165ThrGly: 5.165 ± 0.439
1.156ThrHis: 1.156 ± 0.217
3.816ThrIle: 3.816 ± 0.402
3.584ThrLys: 3.584 ± 0.437
4.895ThrLeu: 4.895 ± 0.428
1.657ThrMet: 1.657 ± 0.207
2.968ThrAsn: 2.968 ± 0.38
2.736ThrPro: 2.736 ± 0.344
2.582ThrGln: 2.582 ± 0.343
2.235ThrArg: 2.235 ± 0.32
4.124ThrSer: 4.124 ± 0.503
3.739ThrThr: 3.739 ± 0.483
4.201ThrVal: 4.201 ± 0.456
0.578ThrTrp: 0.578 ± 0.157
2.12ThrTyr: 2.12 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
6.321ValAla: 6.321 ± 0.523
0.886ValCys: 0.886 ± 0.192
4.818ValAsp: 4.818 ± 0.4
4.586ValGlu: 4.586 ± 0.487
2.736ValPhe: 2.736 ± 0.356
4.664ValGly: 4.664 ± 0.512
1.118ValHis: 1.118 ± 0.226
3.392ValIle: 3.392 ± 0.37
4.779ValLys: 4.779 ± 0.423
4.818ValLeu: 4.818 ± 0.431
1.619ValMet: 1.619 ± 0.263
3.276ValAsn: 3.276 ± 0.343
2.12ValPro: 2.12 ± 0.291
2.698ValGln: 2.698 ± 0.339
3.045ValArg: 3.045 ± 0.325
4.162ValSer: 4.162 ± 0.413
4.586ValThr: 4.586 ± 0.444
4.586ValVal: 4.586 ± 0.532
0.886ValTrp: 0.886 ± 0.15
3.199ValTyr: 3.199 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
1.31TrpAla: 1.31 ± 0.221
0.154TrpCys: 0.154 ± 0.083
1.349TrpAsp: 1.349 ± 0.209
1.195TrpGlu: 1.195 ± 0.243
0.54TrpPhe: 0.54 ± 0.142
0.771TrpGly: 0.771 ± 0.184
0.655TrpHis: 0.655 ± 0.17
1.041TrpIle: 1.041 ± 0.224
1.349TrpLys: 1.349 ± 0.216
1.889TrpLeu: 1.889 ± 0.238
0.27TrpMet: 0.27 ± 0.102
0.771TrpAsn: 0.771 ± 0.155
0.54TrpPro: 0.54 ± 0.159
0.771TrpGln: 0.771 ± 0.144
0.848TrpArg: 0.848 ± 0.167
0.694TrpSer: 0.694 ± 0.146
0.578TrpThr: 0.578 ± 0.156
1.349TrpVal: 1.349 ± 0.221
0.27TrpTrp: 0.27 ± 0.087
0.655TrpTyr: 0.655 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.351TyrAla: 2.351 ± 0.315
0.501TyrCys: 0.501 ± 0.159
2.698TyrAsp: 2.698 ± 0.358
3.16TyrGlu: 3.16 ± 0.382
1.465TyrPhe: 1.465 ± 0.236
2.891TyrGly: 2.891 ± 0.293
0.809TyrHis: 0.809 ± 0.211
1.811TyrIle: 1.811 ± 0.292
2.505TyrLys: 2.505 ± 0.314
2.621TyrLeu: 2.621 ± 0.341
0.578TyrMet: 0.578 ± 0.149
2.197TyrAsn: 2.197 ± 0.284
1.233TyrPro: 1.233 ± 0.221
1.195TyrGln: 1.195 ± 0.236
1.542TyrArg: 1.542 ± 0.223
3.045TyrSer: 3.045 ± 0.435
2.698TyrThr: 2.698 ± 0.282
1.773TyrVal: 1.773 ± 0.239
0.732TyrTrp: 0.732 ± 0.161
1.118TyrTyr: 1.118 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 167 proteins (25947 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski