Amino acid dipepetide frequency for Staphylococcus phage phi7401PVL

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.822AlaAla: 2.822 ± 0.586
0.235AlaCys: 0.235 ± 0.137
2.665AlaAsp: 2.665 ± 0.513
4.232AlaGlu: 4.232 ± 0.549
1.568AlaPhe: 1.568 ± 0.332
3.997AlaGly: 3.997 ± 0.694
1.097AlaHis: 1.097 ± 0.325
4.467AlaIle: 4.467 ± 0.843
6.662AlaLys: 6.662 ± 1.095
5.33AlaLeu: 5.33 ± 0.702
1.411AlaMet: 1.411 ± 0.29
4.311AlaAsn: 4.311 ± 1.026
1.489AlaPro: 1.489 ± 0.37
1.724AlaGln: 1.724 ± 0.337
2.743AlaArg: 2.743 ± 0.376
4.546AlaSer: 4.546 ± 0.794
3.605AlaThr: 3.605 ± 0.5
2.9AlaVal: 2.9 ± 0.48
1.019AlaTrp: 1.019 ± 0.346
2.665AlaTyr: 2.665 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
0.157CysAla: 0.157 ± 0.112
0.0CysCys: 0.0 ± 0.0
0.157CysAsp: 0.157 ± 0.11
0.392CysGlu: 0.392 ± 0.243
0.314CysPhe: 0.314 ± 0.193
0.314CysGly: 0.314 ± 0.136
0.157CysHis: 0.157 ± 0.122
0.392CysIle: 0.392 ± 0.163
0.47CysLys: 0.47 ± 0.206
0.549CysLeu: 0.549 ± 0.229
0.078CysMet: 0.078 ± 0.083
0.157CysAsn: 0.157 ± 0.135
0.157CysPro: 0.157 ± 0.093
0.157CysGln: 0.157 ± 0.105
0.314CysArg: 0.314 ± 0.18
0.157CysSer: 0.157 ± 0.111
0.235CysThr: 0.235 ± 0.136
0.078CysVal: 0.078 ± 0.1
0.0CysTrp: 0.0 ± 0.0
0.314CysTyr: 0.314 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
2.978AspAla: 2.978 ± 0.673
0.157AspCys: 0.157 ± 0.103
3.997AspAsp: 3.997 ± 0.839
4.938AspGlu: 4.938 ± 0.81
3.684AspPhe: 3.684 ± 0.543
3.37AspGly: 3.37 ± 0.623
0.862AspHis: 0.862 ± 0.249
4.781AspIle: 4.781 ± 0.627
6.897AspLys: 6.897 ± 0.681
5.565AspLeu: 5.565 ± 0.625
1.959AspMet: 1.959 ± 0.411
2.822AspAsn: 2.822 ± 0.379
1.411AspPro: 1.411 ± 0.395
1.176AspGln: 1.176 ± 0.228
2.038AspArg: 2.038 ± 0.402
3.605AspSer: 3.605 ± 0.658
3.762AspThr: 3.762 ± 0.537
3.919AspVal: 3.919 ± 0.48
0.862AspTrp: 0.862 ± 0.22
3.527AspTyr: 3.527 ± 0.554
0.0AspXaa: 0.0 ± 0.0
Glu
4.703GluAla: 4.703 ± 0.498
0.47GluCys: 0.47 ± 0.224
4.546GluAsp: 4.546 ± 0.813
6.505GluGlu: 6.505 ± 1.232
2.9GluPhe: 2.9 ± 0.459
3.292GluGly: 3.292 ± 0.424
1.019GluHis: 1.019 ± 0.312
5.094GluIle: 5.094 ± 0.926
8.308GluLys: 8.308 ± 0.953
7.054GluLeu: 7.054 ± 0.823
2.351GluMet: 2.351 ± 0.447
5.33GluAsn: 5.33 ± 0.761
1.568GluPro: 1.568 ± 0.373
2.822GluGln: 2.822 ± 0.55
2.978GluArg: 2.978 ± 0.596
3.292GluSer: 3.292 ± 0.457
4.389GluThr: 4.389 ± 0.703
3.527GluVal: 3.527 ± 0.379
0.941GluTrp: 0.941 ± 0.218
2.665GluTyr: 2.665 ± 0.668
0.0GluXaa: 0.0 ± 0.0
Phe
1.803PheAla: 1.803 ± 0.381
0.392PheCys: 0.392 ± 0.168
3.057PheAsp: 3.057 ± 0.398
3.135PheGlu: 3.135 ± 0.506
0.705PhePhe: 0.705 ± 0.203
2.743PheGly: 2.743 ± 0.601
0.47PheHis: 0.47 ± 0.169
3.605PheIle: 3.605 ± 0.564
4.154PheLys: 4.154 ± 0.601
1.959PheLeu: 1.959 ± 0.379
1.019PheMet: 1.019 ± 0.234
3.605PheAsn: 3.605 ± 0.559
0.941PhePro: 0.941 ± 0.333
1.176PheGln: 1.176 ± 0.293
1.176PheArg: 1.176 ± 0.242
2.038PheSer: 2.038 ± 0.383
1.724PheThr: 1.724 ± 0.385
2.038PheVal: 2.038 ± 0.414
0.392PheTrp: 0.392 ± 0.161
1.646PheTyr: 1.646 ± 0.367
0.0PheXaa: 0.0 ± 0.0
Gly
4.311GlyAla: 4.311 ± 1.093
0.235GlyCys: 0.235 ± 0.136
3.997GlyAsp: 3.997 ± 0.441
3.762GlyGlu: 3.762 ± 0.491
2.43GlyPhe: 2.43 ± 0.366
5.8GlyGly: 5.8 ± 1.334
1.489GlyHis: 1.489 ± 0.382
4.076GlyIle: 4.076 ± 0.542
6.427GlyLys: 6.427 ± 0.718
5.643GlyLeu: 5.643 ± 0.95
1.411GlyMet: 1.411 ± 0.467
3.213GlyAsn: 3.213 ± 0.545
0.941GlyPro: 0.941 ± 0.18
1.724GlyGln: 1.724 ± 0.364
2.351GlyArg: 2.351 ± 0.529
3.997GlySer: 3.997 ± 0.54
3.684GlyThr: 3.684 ± 0.62
4.154GlyVal: 4.154 ± 0.675
1.019GlyTrp: 1.019 ± 0.288
2.743GlyTyr: 2.743 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
1.019HisAla: 1.019 ± 0.256
0.078HisCys: 0.078 ± 0.066
0.784HisAsp: 0.784 ± 0.312
1.176HisGlu: 1.176 ± 0.326
0.784HisPhe: 0.784 ± 0.208
1.176HisGly: 1.176 ± 0.257
0.392HisHis: 0.392 ± 0.193
1.332HisIle: 1.332 ± 0.449
1.254HisLys: 1.254 ± 0.289
1.724HisLeu: 1.724 ± 0.35
0.392HisMet: 0.392 ± 0.175
0.941HisAsn: 0.941 ± 0.291
0.941HisPro: 0.941 ± 0.214
0.47HisGln: 0.47 ± 0.178
0.862HisArg: 0.862 ± 0.244
1.254HisSer: 1.254 ± 0.251
1.254HisThr: 1.254 ± 0.363
0.784HisVal: 0.784 ± 0.295
0.314HisTrp: 0.314 ± 0.17
0.941HisTyr: 0.941 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.546IleAla: 4.546 ± 0.634
0.392IleCys: 0.392 ± 0.283
4.781IleAsp: 4.781 ± 0.815
5.486IleGlu: 5.486 ± 0.599
2.508IlePhe: 2.508 ± 0.53
3.37IleGly: 3.37 ± 0.57
1.568IleHis: 1.568 ± 0.347
3.919IleIle: 3.919 ± 0.717
7.602IleLys: 7.602 ± 0.906
4.859IleLeu: 4.859 ± 0.872
1.568IleMet: 1.568 ± 0.348
4.703IleAsn: 4.703 ± 0.517
2.351IlePro: 2.351 ± 0.381
1.568IleGln: 1.568 ± 0.314
3.057IleArg: 3.057 ± 0.402
4.624IleSer: 4.624 ± 0.683
4.624IleThr: 4.624 ± 0.684
3.762IleVal: 3.762 ± 0.624
0.705IleTrp: 0.705 ± 0.277
2.743IleTyr: 2.743 ± 0.562
0.0IleXaa: 0.0 ± 0.0
Lys
7.916LysAla: 7.916 ± 1.121
0.235LysCys: 0.235 ± 0.121
5.016LysAsp: 5.016 ± 0.552
7.838LysGlu: 7.838 ± 0.848
2.038LysPhe: 2.038 ± 0.414
6.035LysGly: 6.035 ± 1.054
1.881LysHis: 1.881 ± 0.374
6.192LysIle: 6.192 ± 0.855
7.994LysLys: 7.994 ± 1.193
8.935LysLeu: 8.935 ± 0.991
2.978LysMet: 2.978 ± 0.542
5.957LysAsn: 5.957 ± 0.728
2.665LysPro: 2.665 ± 0.373
5.094LysGln: 5.094 ± 0.541
4.154LysArg: 4.154 ± 0.67
6.035LysSer: 6.035 ± 1.306
5.251LysThr: 5.251 ± 0.573
5.408LysVal: 5.408 ± 0.574
2.038LysTrp: 2.038 ± 0.537
4.938LysTyr: 4.938 ± 0.577
0.0LysXaa: 0.0 ± 0.0
Leu
4.311LeuAla: 4.311 ± 0.846
0.392LeuCys: 0.392 ± 0.183
5.173LeuAsp: 5.173 ± 0.973
6.505LeuGlu: 6.505 ± 0.661
2.351LeuPhe: 2.351 ± 0.32
5.094LeuGly: 5.094 ± 1.056
1.097LeuHis: 1.097 ± 0.353
5.173LeuIle: 5.173 ± 0.611
8.465LeuLys: 8.465 ± 1.304
6.505LeuLeu: 6.505 ± 0.777
1.803LeuMet: 1.803 ± 0.292
5.721LeuAsn: 5.721 ± 0.577
3.057LeuPro: 3.057 ± 0.577
2.9LeuGln: 2.9 ± 0.7
3.605LeuArg: 3.605 ± 0.47
5.878LeuSer: 5.878 ± 0.796
5.565LeuThr: 5.565 ± 0.754
4.076LeuVal: 4.076 ± 0.476
0.392LeuTrp: 0.392 ± 0.21
3.135LeuTyr: 3.135 ± 0.814
0.0LeuXaa: 0.0 ± 0.0
Met
1.097MetAla: 1.097 ± 0.252
0.157MetCys: 0.157 ± 0.105
1.411MetAsp: 1.411 ± 0.381
1.411MetGlu: 1.411 ± 0.317
0.941MetPhe: 0.941 ± 0.254
1.646MetGly: 1.646 ± 0.475
0.392MetHis: 0.392 ± 0.177
1.332MetIle: 1.332 ± 0.209
2.586MetLys: 2.586 ± 0.599
1.803MetLeu: 1.803 ± 0.465
0.157MetMet: 0.157 ± 0.106
1.568MetAsn: 1.568 ± 0.385
0.941MetPro: 0.941 ± 0.223
1.646MetGln: 1.646 ± 0.391
1.254MetArg: 1.254 ± 0.223
2.116MetSer: 2.116 ± 0.394
2.038MetThr: 2.038 ± 0.34
1.097MetVal: 1.097 ± 0.195
0.235MetTrp: 0.235 ± 0.127
0.941MetTyr: 0.941 ± 0.254
0.0MetXaa: 0.0 ± 0.0
Asn
3.919AsnAla: 3.919 ± 0.532
0.078AsnCys: 0.078 ± 0.066
3.84AsnAsp: 3.84 ± 0.505
4.311AsnGlu: 4.311 ± 0.583
2.116AsnPhe: 2.116 ± 0.453
4.311AsnGly: 4.311 ± 0.739
0.941AsnHis: 0.941 ± 0.314
4.624AsnIle: 4.624 ± 0.518
6.662AsnLys: 6.662 ± 0.724
4.781AsnLeu: 4.781 ± 0.523
0.941AsnMet: 0.941 ± 0.269
4.467AsnAsn: 4.467 ± 0.816
2.273AsnPro: 2.273 ± 0.359
2.822AsnGln: 2.822 ± 0.478
2.978AsnArg: 2.978 ± 0.503
4.938AsnSer: 4.938 ± 0.604
4.154AsnThr: 4.154 ± 0.386
3.292AsnVal: 3.292 ± 0.676
1.176AsnTrp: 1.176 ± 0.353
3.37AsnTyr: 3.37 ± 0.65
0.0AsnXaa: 0.0 ± 0.0
Pro
1.097ProAla: 1.097 ± 0.244
0.235ProCys: 0.235 ± 0.132
1.332ProAsp: 1.332 ± 0.364
2.43ProGlu: 2.43 ± 0.508
1.332ProPhe: 1.332 ± 0.319
1.959ProGly: 1.959 ± 0.392
0.392ProHis: 0.392 ± 0.177
1.881ProIle: 1.881 ± 0.339
2.351ProLys: 2.351 ± 0.525
2.586ProLeu: 2.586 ± 0.514
0.941ProMet: 0.941 ± 0.259
2.195ProAsn: 2.195 ± 0.404
0.627ProPro: 0.627 ± 0.207
1.176ProGln: 1.176 ± 0.284
1.176ProArg: 1.176 ± 0.311
2.195ProSer: 2.195 ± 0.389
1.332ProThr: 1.332 ± 0.295
1.332ProVal: 1.332 ± 0.418
0.392ProTrp: 0.392 ± 0.164
1.097ProTyr: 1.097 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
2.743GlnAla: 2.743 ± 0.419
0.235GlnCys: 0.235 ± 0.137
2.116GlnAsp: 2.116 ± 0.415
2.351GlnGlu: 2.351 ± 0.379
1.568GlnPhe: 1.568 ± 0.335
2.195GlnGly: 2.195 ± 0.518
0.627GlnHis: 0.627 ± 0.183
2.9GlnIle: 2.9 ± 0.588
3.213GlnLys: 3.213 ± 0.419
3.135GlnLeu: 3.135 ± 0.361
0.941GlnMet: 0.941 ± 0.308
2.743GlnAsn: 2.743 ± 0.546
1.019GlnPro: 1.019 ± 0.249
1.411GlnGln: 1.411 ± 0.395
1.959GlnArg: 1.959 ± 0.353
2.43GlnSer: 2.43 ± 0.312
1.332GlnThr: 1.332 ± 0.345
2.116GlnVal: 2.116 ± 0.443
0.47GlnTrp: 0.47 ± 0.181
1.332GlnTyr: 1.332 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
2.743ArgAla: 2.743 ± 0.739
0.0ArgCys: 0.0 ± 0.0
3.135ArgAsp: 3.135 ± 0.391
2.43ArgGlu: 2.43 ± 0.523
2.038ArgPhe: 2.038 ± 0.409
2.116ArgGly: 2.116 ± 0.363
0.784ArgHis: 0.784 ± 0.225
3.37ArgIle: 3.37 ± 0.459
3.997ArgLys: 3.997 ± 0.511
3.605ArgLeu: 3.605 ± 0.833
0.862ArgMet: 0.862 ± 0.239
2.665ArgAsn: 2.665 ± 0.505
0.705ArgPro: 0.705 ± 0.372
1.489ArgGln: 1.489 ± 0.353
1.724ArgArg: 1.724 ± 0.316
1.724ArgSer: 1.724 ± 0.328
2.665ArgThr: 2.665 ± 0.411
2.351ArgVal: 2.351 ± 0.392
0.392ArgTrp: 0.392 ± 0.179
2.508ArgTyr: 2.508 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
3.605SerAla: 3.605 ± 0.9
0.235SerCys: 0.235 ± 0.139
5.251SerAsp: 5.251 ± 0.649
4.703SerGlu: 4.703 ± 0.626
2.822SerPhe: 2.822 ± 0.508
4.311SerGly: 4.311 ± 0.516
0.705SerHis: 0.705 ± 0.196
3.997SerIle: 3.997 ± 0.552
6.897SerLys: 6.897 ± 1.317
3.997SerLeu: 3.997 ± 0.408
1.881SerMet: 1.881 ± 0.403
4.781SerAsn: 4.781 ± 0.622
1.646SerPro: 1.646 ± 0.436
2.9SerGln: 2.9 ± 0.498
2.508SerArg: 2.508 ± 0.431
3.919SerSer: 3.919 ± 0.781
3.449SerThr: 3.449 ± 0.547
4.154SerVal: 4.154 ± 0.651
0.941SerTrp: 0.941 ± 0.255
2.116SerTyr: 2.116 ± 0.431
0.0SerXaa: 0.0 ± 0.0
Thr
3.527ThrAla: 3.527 ± 0.568
0.235ThrCys: 0.235 ± 0.144
3.919ThrAsp: 3.919 ± 0.454
3.527ThrGlu: 3.527 ± 0.541
2.665ThrPhe: 2.665 ± 0.383
4.154ThrGly: 4.154 ± 0.568
1.803ThrHis: 1.803 ± 0.461
4.467ThrIle: 4.467 ± 0.565
5.408ThrLys: 5.408 ± 0.664
4.311ThrLeu: 4.311 ± 0.494
0.784ThrMet: 0.784 ± 0.235
3.605ThrAsn: 3.605 ± 0.509
2.586ThrPro: 2.586 ± 0.322
1.724ThrGln: 1.724 ± 0.285
1.881ThrArg: 1.881 ± 0.318
3.84ThrSer: 3.84 ± 0.546
3.213ThrThr: 3.213 ± 0.514
4.389ThrVal: 4.389 ± 0.556
0.392ThrTrp: 0.392 ± 0.205
2.43ThrTyr: 2.43 ± 0.447
0.0ThrXaa: 0.0 ± 0.0
Val
3.213ValAla: 3.213 ± 0.553
0.235ValCys: 0.235 ± 0.138
3.997ValAsp: 3.997 ± 0.646
5.173ValGlu: 5.173 ± 0.546
2.351ValPhe: 2.351 ± 0.427
3.527ValGly: 3.527 ± 0.532
0.941ValHis: 0.941 ± 0.216
3.605ValIle: 3.605 ± 0.438
4.546ValLys: 4.546 ± 0.698
4.467ValLeu: 4.467 ± 0.474
1.332ValMet: 1.332 ± 0.277
3.762ValAsn: 3.762 ± 0.505
1.568ValPro: 1.568 ± 0.354
2.273ValGln: 2.273 ± 0.359
2.116ValArg: 2.116 ± 0.281
4.232ValSer: 4.232 ± 0.654
3.135ValThr: 3.135 ± 0.711
2.978ValVal: 2.978 ± 0.539
0.549ValTrp: 0.549 ± 0.202
1.803ValTyr: 1.803 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
0.549TrpAla: 0.549 ± 0.199
0.0TrpCys: 0.0 ± 0.0
0.549TrpAsp: 0.549 ± 0.262
0.784TrpGlu: 0.784 ± 0.344
1.332TrpPhe: 1.332 ± 0.355
0.784TrpGly: 0.784 ± 0.254
0.0TrpHis: 0.0 ± 0.0
0.941TrpIle: 0.941 ± 0.31
0.705TrpLys: 0.705 ± 0.213
1.176TrpLeu: 1.176 ± 0.231
0.392TrpMet: 0.392 ± 0.181
1.019TrpAsn: 1.019 ± 0.281
0.392TrpPro: 0.392 ± 0.246
0.549TrpGln: 0.549 ± 0.182
0.47TrpArg: 0.47 ± 0.185
1.097TrpSer: 1.097 ± 0.324
0.705TrpThr: 0.705 ± 0.197
0.862TrpVal: 0.862 ± 0.214
0.157TrpTrp: 0.157 ± 0.131
0.47TrpTyr: 0.47 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.43TyrAla: 2.43 ± 0.297
0.549TyrCys: 0.549 ± 0.234
2.586TyrAsp: 2.586 ± 0.661
2.665TyrGlu: 2.665 ± 0.533
1.332TyrPhe: 1.332 ± 0.379
3.213TyrGly: 3.213 ± 0.553
1.254TyrHis: 1.254 ± 0.377
2.586TyrIle: 2.586 ± 0.46
3.997TyrLys: 3.997 ± 0.521
3.213TyrLeu: 3.213 ± 0.536
1.489TyrMet: 1.489 ± 0.29
2.43TyrAsn: 2.43 ± 0.619
0.784TyrPro: 0.784 ± 0.237
2.038TyrGln: 2.038 ± 0.309
1.959TyrArg: 1.959 ± 0.47
2.978TyrSer: 2.978 ± 0.342
2.743TyrThr: 2.743 ± 0.529
2.586TyrVal: 2.586 ± 0.469
0.549TyrTrp: 0.549 ± 0.187
1.568TyrTyr: 1.568 ± 0.448
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (12760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski