Amino acid dipepetide frequency for Pseudomonas phage JBD5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.545AlaAla: 17.545 ± 2.188
1.49AlaCys: 1.49 ± 0.379
6.786AlaAsp: 6.786 ± 0.821
8.855AlaGlu: 8.855 ± 1.053
3.724AlaPhe: 3.724 ± 0.479
8.938AlaGly: 8.938 ± 1.118
1.821AlaHis: 1.821 ± 0.415
6.29AlaIle: 6.29 ± 0.586
4.635AlaLys: 4.635 ± 0.758
12.083AlaLeu: 12.083 ± 1.074
4.552AlaMet: 4.552 ± 0.593
3.641AlaAsn: 3.641 ± 0.578
4.469AlaPro: 4.469 ± 0.715
6.29AlaGln: 6.29 ± 0.777
8.607AlaArg: 8.607 ± 0.978
7.283AlaSer: 7.283 ± 1.128
6.207AlaThr: 6.207 ± 0.679
6.373AlaVal: 6.373 ± 0.813
2.152AlaTrp: 2.152 ± 0.356
3.062AlaTyr: 3.062 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.91CysAla: 0.91 ± 0.302
0.083CysCys: 0.083 ± 0.106
0.745CysAsp: 0.745 ± 0.273
0.579CysGlu: 0.579 ± 0.299
0.497CysPhe: 0.497 ± 0.248
0.662CysGly: 0.662 ± 0.249
0.331CysHis: 0.331 ± 0.172
0.083CysIle: 0.083 ± 0.106
0.083CysLys: 0.083 ± 0.081
0.331CysLeu: 0.331 ± 0.202
0.414CysMet: 0.414 ± 0.214
0.248CysAsn: 0.248 ± 0.166
0.579CysPro: 0.579 ± 0.257
0.166CysGln: 0.166 ± 0.127
0.91CysArg: 0.91 ± 0.32
0.497CysSer: 0.497 ± 0.238
0.414CysThr: 0.414 ± 0.236
0.248CysVal: 0.248 ± 0.147
0.248CysTrp: 0.248 ± 0.143
0.166CysTyr: 0.166 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
6.042AspAla: 6.042 ± 0.783
0.083AspCys: 0.083 ± 0.079
3.393AspAsp: 3.393 ± 0.441
3.724AspGlu: 3.724 ± 0.633
1.821AspPhe: 1.821 ± 0.266
6.538AspGly: 6.538 ± 0.863
1.241AspHis: 1.241 ± 0.362
1.986AspIle: 1.986 ± 0.402
1.159AspLys: 1.159 ± 0.361
5.048AspLeu: 5.048 ± 0.72
1.49AspMet: 1.49 ± 0.446
1.821AspAsn: 1.821 ± 0.357
2.814AspPro: 2.814 ± 0.465
3.973AspGln: 3.973 ± 0.761
3.89AspArg: 3.89 ± 0.48
3.476AspSer: 3.476 ± 0.527
3.393AspThr: 3.393 ± 0.673
3.807AspVal: 3.807 ± 0.614
1.159AspTrp: 1.159 ± 0.267
1.324AspTyr: 1.324 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
7.531GluAla: 7.531 ± 0.708
0.662GluCys: 0.662 ± 0.287
2.979GluAsp: 2.979 ± 0.616
3.145GluGlu: 3.145 ± 0.555
2.317GluPhe: 2.317 ± 0.443
3.145GluGly: 3.145 ± 0.533
0.91GluHis: 0.91 ± 0.321
2.731GluIle: 2.731 ± 0.495
2.566GluLys: 2.566 ± 0.465
6.704GluLeu: 6.704 ± 0.696
1.407GluMet: 1.407 ± 0.479
1.159GluAsn: 1.159 ± 0.315
2.979GluPro: 2.979 ± 0.474
3.641GluGln: 3.641 ± 0.519
4.8GluArg: 4.8 ± 0.705
3.145GluSer: 3.145 ± 0.457
2.897GluThr: 2.897 ± 0.527
4.635GluVal: 4.635 ± 0.606
0.91GluTrp: 0.91 ± 0.277
1.572GluTyr: 1.572 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
3.807PheAla: 3.807 ± 0.55
0.414PheCys: 0.414 ± 0.173
2.069PheAsp: 2.069 ± 0.443
1.407PheGlu: 1.407 ± 0.402
0.91PhePhe: 0.91 ± 0.27
2.648PheGly: 2.648 ± 0.413
0.497PheHis: 0.497 ± 0.217
1.407PheIle: 1.407 ± 0.3
0.993PheLys: 0.993 ± 0.276
1.821PheLeu: 1.821 ± 0.471
0.91PheMet: 0.91 ± 0.304
0.91PheAsn: 0.91 ± 0.283
1.49PhePro: 1.49 ± 0.398
1.572PheGln: 1.572 ± 0.344
2.069PheArg: 2.069 ± 0.475
1.407PheSer: 1.407 ± 0.303
1.655PheThr: 1.655 ± 0.306
1.655PheVal: 1.655 ± 0.28
0.414PheTrp: 0.414 ± 0.184
0.91PheTyr: 0.91 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
7.862GlyAla: 7.862 ± 1.246
0.331GlyCys: 0.331 ± 0.147
4.552GlyAsp: 4.552 ± 0.798
4.469GlyGlu: 4.469 ± 0.599
2.814GlyPhe: 2.814 ± 0.428
6.621GlyGly: 6.621 ± 0.678
0.993GlyHis: 0.993 ± 0.352
4.055GlyIle: 4.055 ± 0.581
2.814GlyLys: 2.814 ± 0.524
7.697GlyLeu: 7.697 ± 0.777
1.655GlyMet: 1.655 ± 0.476
1.904GlyAsn: 1.904 ± 0.373
2.731GlyPro: 2.731 ± 0.464
4.635GlyGln: 4.635 ± 0.739
6.869GlyArg: 6.869 ± 0.808
5.297GlySer: 5.297 ± 0.722
4.386GlyThr: 4.386 ± 0.498
5.048GlyVal: 5.048 ± 0.673
1.821GlyTrp: 1.821 ± 0.363
2.235GlyTyr: 2.235 ± 0.638
0.0GlyXaa: 0.0 ± 0.0
His
1.738HisAla: 1.738 ± 0.357
0.083HisCys: 0.083 ± 0.077
0.579HisAsp: 0.579 ± 0.143
0.745HisGlu: 0.745 ± 0.321
0.497HisPhe: 0.497 ± 0.269
1.159HisGly: 1.159 ± 0.273
0.331HisHis: 0.331 ± 0.244
0.662HisIle: 0.662 ± 0.223
0.248HisLys: 0.248 ± 0.159
1.821HisLeu: 1.821 ± 0.408
0.828HisMet: 0.828 ± 0.273
0.993HisAsn: 0.993 ± 0.298
1.241HisPro: 1.241 ± 0.348
0.91HisGln: 0.91 ± 0.309
0.745HisArg: 0.745 ± 0.281
0.579HisSer: 0.579 ± 0.271
0.745HisThr: 0.745 ± 0.273
0.828HisVal: 0.828 ± 0.252
0.414HisTrp: 0.414 ± 0.188
0.828HisTyr: 0.828 ± 0.277
0.0HisXaa: 0.0 ± 0.0
Ile
5.297IleAla: 5.297 ± 0.728
0.497IleCys: 0.497 ± 0.227
3.724IleAsp: 3.724 ± 0.507
3.145IleGlu: 3.145 ± 0.449
0.91IlePhe: 0.91 ± 0.256
3.559IleGly: 3.559 ± 0.559
0.662IleHis: 0.662 ± 0.209
1.655IleIle: 1.655 ± 0.295
1.159IleLys: 1.159 ± 0.26
2.648IleLeu: 2.648 ± 0.454
0.414IleMet: 0.414 ± 0.175
1.159IleAsn: 1.159 ± 0.365
2.152IlePro: 2.152 ± 0.431
2.235IleGln: 2.235 ± 0.578
3.973IleArg: 3.973 ± 0.551
2.317IleSer: 2.317 ± 0.391
2.814IleThr: 2.814 ± 0.518
2.814IleVal: 2.814 ± 0.411
0.579IleTrp: 0.579 ± 0.215
0.828IleTyr: 0.828 ± 0.268
0.0IleXaa: 0.0 ± 0.0
Lys
4.966LysAla: 4.966 ± 0.855
0.0LysCys: 0.0 ± 0.0
1.241LysAsp: 1.241 ± 0.316
1.738LysGlu: 1.738 ± 0.476
0.579LysPhe: 0.579 ± 0.214
2.648LysGly: 2.648 ± 0.335
0.497LysHis: 0.497 ± 0.189
1.076LysIle: 1.076 ± 0.338
1.49LysLys: 1.49 ± 0.377
3.062LysLeu: 3.062 ± 0.617
0.91LysMet: 0.91 ± 0.347
1.241LysAsn: 1.241 ± 0.336
2.483LysPro: 2.483 ± 0.599
1.49LysGln: 1.49 ± 0.403
2.979LysArg: 2.979 ± 0.536
1.986LysSer: 1.986 ± 0.43
1.821LysThr: 1.821 ± 0.385
2.483LysVal: 2.483 ± 0.484
0.331LysTrp: 0.331 ± 0.172
1.159LysTyr: 1.159 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
12.745LeuAla: 12.745 ± 1.299
0.579LeuCys: 0.579 ± 0.226
6.786LeuAsp: 6.786 ± 0.771
6.455LeuGlu: 6.455 ± 0.719
2.235LeuPhe: 2.235 ± 0.538
7.697LeuGly: 7.697 ± 0.689
2.069LeuHis: 2.069 ± 0.456
2.979LeuIle: 2.979 ± 0.529
3.641LeuLys: 3.641 ± 0.852
8.442LeuLeu: 8.442 ± 0.949
1.986LeuMet: 1.986 ± 0.475
2.979LeuAsn: 2.979 ± 0.501
4.552LeuPro: 4.552 ± 0.578
4.055LeuGln: 4.055 ± 0.723
6.207LeuArg: 6.207 ± 0.638
5.214LeuSer: 5.214 ± 0.728
5.297LeuThr: 5.297 ± 0.848
6.869LeuVal: 6.869 ± 0.76
1.159LeuTrp: 1.159 ± 0.296
2.069LeuTyr: 2.069 ± 0.486
0.0LeuXaa: 0.0 ± 0.0
Met
3.973MetAla: 3.973 ± 0.506
0.083MetCys: 0.083 ± 0.084
1.904MetAsp: 1.904 ± 0.333
1.655MetGlu: 1.655 ± 0.442
0.497MetPhe: 0.497 ± 0.199
1.49MetGly: 1.49 ± 0.329
0.414MetHis: 0.414 ± 0.2
0.414MetIle: 0.414 ± 0.159
0.91MetLys: 0.91 ± 0.264
1.655MetLeu: 1.655 ± 0.326
0.414MetMet: 0.414 ± 0.203
0.662MetAsn: 0.662 ± 0.237
1.159MetPro: 1.159 ± 0.422
1.49MetGln: 1.49 ± 0.336
1.572MetArg: 1.572 ± 0.293
1.986MetSer: 1.986 ± 0.386
1.572MetThr: 1.572 ± 0.322
0.993MetVal: 0.993 ± 0.367
0.414MetTrp: 0.414 ± 0.177
0.331MetTyr: 0.331 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
3.31AsnAla: 3.31 ± 0.69
0.083AsnCys: 0.083 ± 0.071
1.241AsnAsp: 1.241 ± 0.278
1.407AsnGlu: 1.407 ± 0.319
0.579AsnPhe: 0.579 ± 0.241
3.062AsnGly: 3.062 ± 0.561
0.745AsnHis: 0.745 ± 0.234
1.241AsnIle: 1.241 ± 0.297
1.241AsnLys: 1.241 ± 0.4
2.814AsnLeu: 2.814 ± 0.577
0.745AsnMet: 0.745 ± 0.24
1.904AsnAsn: 1.904 ± 0.487
2.4AsnPro: 2.4 ± 0.558
1.241AsnGln: 1.241 ± 0.344
2.648AsnArg: 2.648 ± 0.439
1.324AsnSer: 1.324 ± 0.453
1.324AsnThr: 1.324 ± 0.273
1.241AsnVal: 1.241 ± 0.319
0.745AsnTrp: 0.745 ± 0.216
1.324AsnTyr: 1.324 ± 0.283
0.0AsnXaa: 0.0 ± 0.0
Pro
6.455ProAla: 6.455 ± 0.782
0.745ProCys: 0.745 ± 0.32
3.807ProAsp: 3.807 ± 0.517
2.731ProGlu: 2.731 ± 0.405
1.572ProPhe: 1.572 ± 0.401
4.221ProGly: 4.221 ± 0.597
0.414ProHis: 0.414 ± 0.184
1.904ProIle: 1.904 ± 0.445
1.821ProLys: 1.821 ± 0.414
4.138ProLeu: 4.138 ± 0.516
0.91ProMet: 0.91 ± 0.295
1.655ProAsn: 1.655 ± 0.317
2.4ProPro: 2.4 ± 0.555
2.152ProGln: 2.152 ± 0.483
3.393ProArg: 3.393 ± 0.645
3.145ProSer: 3.145 ± 0.457
2.731ProThr: 2.731 ± 0.406
3.31ProVal: 3.31 ± 0.762
0.497ProTrp: 0.497 ± 0.206
1.49ProTyr: 1.49 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
6.207GlnAla: 6.207 ± 0.889
0.248GlnCys: 0.248 ± 0.176
1.821GlnAsp: 1.821 ± 0.355
1.986GlnGlu: 1.986 ± 0.513
1.49GlnPhe: 1.49 ± 0.425
4.304GlnGly: 4.304 ± 0.656
0.828GlnHis: 0.828 ± 0.271
2.4GlnIle: 2.4 ± 0.509
1.159GlnLys: 1.159 ± 0.394
6.373GlnLeu: 6.373 ± 0.621
1.324GlnMet: 1.324 ± 0.308
1.159GlnAsn: 1.159 ± 0.22
1.986GlnPro: 1.986 ± 0.403
3.724GlnGln: 3.724 ± 0.985
4.717GlnArg: 4.717 ± 0.756
2.483GlnSer: 2.483 ± 0.32
2.317GlnThr: 2.317 ± 0.468
4.966GlnVal: 4.966 ± 0.563
0.497GlnTrp: 0.497 ± 0.259
0.993GlnTyr: 0.993 ± 0.329
0.0GlnXaa: 0.0 ± 0.0
Arg
8.276ArgAla: 8.276 ± 0.808
0.497ArgCys: 0.497 ± 0.204
4.469ArgAsp: 4.469 ± 0.572
5.048ArgGlu: 5.048 ± 0.528
1.986ArgPhe: 1.986 ± 0.412
4.635ArgGly: 4.635 ± 0.627
1.076ArgHis: 1.076 ± 0.237
3.973ArgIle: 3.973 ± 0.617
2.483ArgLys: 2.483 ± 0.463
7.531ArgLeu: 7.531 ± 0.916
1.159ArgMet: 1.159 ± 0.294
1.904ArgAsn: 1.904 ± 0.392
3.641ArgPro: 3.641 ± 0.88
4.055ArgGln: 4.055 ± 0.692
5.959ArgArg: 5.959 ± 0.887
4.717ArgSer: 4.717 ± 0.599
3.062ArgThr: 3.062 ± 0.473
5.048ArgVal: 5.048 ± 0.817
1.738ArgTrp: 1.738 ± 0.428
2.483ArgTyr: 2.483 ± 0.514
0.0ArgXaa: 0.0 ± 0.0
Ser
7.531SerAla: 7.531 ± 0.765
0.662SerCys: 0.662 ± 0.242
3.724SerAsp: 3.724 ± 0.738
2.566SerGlu: 2.566 ± 0.408
1.241SerPhe: 1.241 ± 0.263
5.214SerGly: 5.214 ± 0.73
0.993SerHis: 0.993 ± 0.354
2.731SerIle: 2.731 ± 0.419
1.986SerLys: 1.986 ± 0.482
6.29SerLeu: 6.29 ± 0.666
0.993SerMet: 0.993 ± 0.307
2.152SerAsn: 2.152 ± 0.436
3.641SerPro: 3.641 ± 0.636
2.566SerGln: 2.566 ± 0.448
3.228SerArg: 3.228 ± 0.576
4.221SerSer: 4.221 ± 0.602
3.393SerThr: 3.393 ± 0.711
3.228SerVal: 3.228 ± 0.511
1.821SerTrp: 1.821 ± 0.359
1.986SerTyr: 1.986 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
7.283ThrAla: 7.283 ± 0.94
0.414ThrCys: 0.414 ± 0.273
3.145ThrAsp: 3.145 ± 0.44
3.062ThrGlu: 3.062 ± 0.506
1.655ThrPhe: 1.655 ± 0.348
4.221ThrGly: 4.221 ± 0.73
0.497ThrHis: 0.497 ± 0.212
2.152ThrIle: 2.152 ± 0.454
1.821ThrLys: 1.821 ± 0.439
5.297ThrLeu: 5.297 ± 0.794
0.828ThrMet: 0.828 ± 0.227
1.655ThrAsn: 1.655 ± 0.366
2.897ThrPro: 2.897 ± 0.362
1.324ThrGln: 1.324 ± 0.334
2.979ThrArg: 2.979 ± 0.509
3.807ThrSer: 3.807 ± 0.564
3.145ThrThr: 3.145 ± 0.713
5.545ThrVal: 5.545 ± 0.782
0.91ThrTrp: 0.91 ± 0.274
1.159ThrTyr: 1.159 ± 0.295
0.0ThrXaa: 0.0 ± 0.0
Val
9.104ValAla: 9.104 ± 0.864
0.579ValCys: 0.579 ± 0.198
3.145ValAsp: 3.145 ± 0.598
4.717ValGlu: 4.717 ± 0.782
1.904ValPhe: 1.904 ± 0.293
4.8ValGly: 4.8 ± 0.716
0.828ValHis: 0.828 ± 0.218
2.566ValIle: 2.566 ± 0.474
2.483ValLys: 2.483 ± 0.492
6.124ValLeu: 6.124 ± 0.848
1.49ValMet: 1.49 ± 0.325
1.986ValAsn: 1.986 ± 0.348
3.393ValPro: 3.393 ± 0.478
3.228ValGln: 3.228 ± 0.549
4.717ValArg: 4.717 ± 0.573
3.973ValSer: 3.973 ± 0.629
4.469ValThr: 4.469 ± 0.654
4.469ValVal: 4.469 ± 0.537
1.159ValTrp: 1.159 ± 0.267
2.235ValTyr: 2.235 ± 0.557
0.0ValXaa: 0.0 ± 0.0
Trp
1.324TrpAla: 1.324 ± 0.295
0.331TrpCys: 0.331 ± 0.146
0.828TrpAsp: 0.828 ± 0.229
0.745TrpGlu: 0.745 ± 0.246
0.579TrpPhe: 0.579 ± 0.153
1.076TrpGly: 1.076 ± 0.393
0.166TrpHis: 0.166 ± 0.114
0.993TrpIle: 0.993 ± 0.247
0.91TrpLys: 0.91 ± 0.234
1.904TrpLeu: 1.904 ± 0.384
0.745TrpMet: 0.745 ± 0.234
0.497TrpAsn: 0.497 ± 0.175
0.91TrpPro: 0.91 ± 0.324
0.91TrpGln: 0.91 ± 0.257
1.49TrpArg: 1.49 ± 0.374
1.324TrpSer: 1.324 ± 0.366
0.828TrpThr: 0.828 ± 0.259
1.655TrpVal: 1.655 ± 0.379
0.579TrpTrp: 0.579 ± 0.218
0.166TrpTyr: 0.166 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.897TyrAla: 2.897 ± 0.328
0.331TyrCys: 0.331 ± 0.158
1.241TyrAsp: 1.241 ± 0.344
1.738TyrGlu: 1.738 ± 0.432
1.159TyrPhe: 1.159 ± 0.242
2.152TyrGly: 2.152 ± 0.498
0.662TyrHis: 0.662 ± 0.282
1.241TyrIle: 1.241 ± 0.317
0.579TyrLys: 0.579 ± 0.268
2.235TyrLeu: 2.235 ± 0.455
0.331TyrMet: 0.331 ± 0.198
0.993TyrAsn: 0.993 ± 0.302
1.572TyrPro: 1.572 ± 0.514
1.324TyrGln: 1.324 ± 0.313
1.986TyrArg: 1.986 ± 0.407
1.986TyrSer: 1.986 ± 0.425
1.324TyrThr: 1.324 ± 0.462
2.152TyrVal: 2.152 ± 0.362
0.414TyrTrp: 0.414 ± 0.194
0.497TyrTyr: 0.497 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12084 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski