Amino acid dipepetide frequency for Staphylococcus phage phiPVL-CN125

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.992AlaAla: 1.992 ± 0.87
0.166AlaCys: 0.166 ± 0.103
1.494AlaAsp: 1.494 ± 0.314
2.49AlaGlu: 2.49 ± 0.427
2.324AlaPhe: 2.324 ± 0.668
3.984AlaGly: 3.984 ± 0.805
0.498AlaHis: 0.498 ± 0.171
4.814AlaIle: 4.814 ± 0.602
4.648AlaLys: 4.648 ± 0.73
4.316AlaLeu: 4.316 ± 0.79
1.909AlaMet: 1.909 ± 0.37
3.486AlaAsn: 3.486 ± 0.575
1.328AlaPro: 1.328 ± 0.289
2.075AlaGln: 2.075 ± 0.493
2.656AlaArg: 2.656 ± 0.455
3.154AlaSer: 3.154 ± 0.666
4.316AlaThr: 4.316 ± 0.591
3.237AlaVal: 3.237 ± 0.479
0.166AlaTrp: 0.166 ± 0.1
1.992AlaTyr: 1.992 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.249CysAla: 0.249 ± 0.146
0.0CysCys: 0.0 ± 0.0
0.249CysAsp: 0.249 ± 0.175
0.498CysGlu: 0.498 ± 0.234
0.166CysPhe: 0.166 ± 0.126
0.332CysGly: 0.332 ± 0.191
0.166CysHis: 0.166 ± 0.116
0.747CysIle: 0.747 ± 0.233
0.332CysLys: 0.332 ± 0.146
0.581CysLeu: 0.581 ± 0.19
0.166CysMet: 0.166 ± 0.149
0.415CysAsn: 0.415 ± 0.183
0.0CysPro: 0.0 ± 0.0
0.166CysGln: 0.166 ± 0.121
0.498CysArg: 0.498 ± 0.24
0.664CysSer: 0.664 ± 0.281
0.083CysThr: 0.083 ± 0.071
0.498CysVal: 0.498 ± 0.197
0.0CysTrp: 0.0 ± 0.0
0.415CysTyr: 0.415 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
2.822AspAla: 2.822 ± 0.473
0.664AspCys: 0.664 ± 0.246
3.403AspAsp: 3.403 ± 0.615
5.312AspGlu: 5.312 ± 0.827
2.905AspPhe: 2.905 ± 0.454
2.988AspGly: 2.988 ± 0.408
0.913AspHis: 0.913 ± 0.261
4.316AspIle: 4.316 ± 0.659
5.976AspLys: 5.976 ± 0.794
5.893AspLeu: 5.893 ± 0.798
1.743AspMet: 1.743 ± 0.343
3.403AspAsn: 3.403 ± 0.539
1.494AspPro: 1.494 ± 0.407
0.913AspGln: 0.913 ± 0.242
2.158AspArg: 2.158 ± 0.462
3.486AspSer: 3.486 ± 0.556
4.316AspThr: 4.316 ± 0.606
3.32AspVal: 3.32 ± 0.55
0.581AspTrp: 0.581 ± 0.234
3.071AspTyr: 3.071 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
3.237GluAla: 3.237 ± 0.527
0.83GluCys: 0.83 ± 0.247
3.32GluAsp: 3.32 ± 0.595
5.727GluGlu: 5.727 ± 1.003
2.739GluPhe: 2.739 ± 0.401
3.154GluGly: 3.154 ± 0.55
1.162GluHis: 1.162 ± 0.287
6.806GluIle: 6.806 ± 0.945
6.474GluLys: 6.474 ± 0.871
6.64GluLeu: 6.64 ± 0.732
2.988GluMet: 2.988 ± 0.477
4.731GluAsn: 4.731 ± 0.624
1.577GluPro: 1.577 ± 0.309
3.735GluGln: 3.735 ± 0.807
3.071GluArg: 3.071 ± 0.521
3.735GluSer: 3.735 ± 0.538
3.652GluThr: 3.652 ± 0.574
4.067GluVal: 4.067 ± 0.589
0.83GluTrp: 0.83 ± 0.287
3.569GluTyr: 3.569 ± 0.633
0.0GluXaa: 0.0 ± 0.0
Phe
2.075PheAla: 2.075 ± 0.392
0.166PheCys: 0.166 ± 0.113
2.49PheAsp: 2.49 ± 0.425
2.324PheGlu: 2.324 ± 0.464
1.079PhePhe: 1.079 ± 0.283
2.905PheGly: 2.905 ± 0.515
1.162PheHis: 1.162 ± 0.304
3.652PheIle: 3.652 ± 0.64
3.486PheLys: 3.486 ± 0.52
2.822PheLeu: 2.822 ± 0.577
1.66PheMet: 1.66 ± 0.361
3.901PheAsn: 3.901 ± 0.555
0.913PhePro: 0.913 ± 0.28
1.162PheGln: 1.162 ± 0.268
1.577PheArg: 1.577 ± 0.307
2.49PheSer: 2.49 ± 0.524
1.909PheThr: 1.909 ± 0.459
2.407PheVal: 2.407 ± 0.49
0.166PheTrp: 0.166 ± 0.105
1.411PheTyr: 1.411 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
3.071GlyAla: 3.071 ± 0.765
0.249GlyCys: 0.249 ± 0.157
3.154GlyAsp: 3.154 ± 0.423
3.403GlyGlu: 3.403 ± 0.478
2.822GlyPhe: 2.822 ± 0.504
3.735GlyGly: 3.735 ± 0.687
1.494GlyHis: 1.494 ± 0.37
3.901GlyIle: 3.901 ± 0.915
5.644GlyLys: 5.644 ± 0.954
4.98GlyLeu: 4.98 ± 1.003
1.245GlyMet: 1.245 ± 0.332
4.15GlyAsn: 4.15 ± 0.659
0.913GlyPro: 0.913 ± 0.341
2.241GlyGln: 2.241 ± 0.476
2.573GlyArg: 2.573 ± 0.512
3.569GlySer: 3.569 ± 0.46
3.237GlyThr: 3.237 ± 0.446
4.15GlyVal: 4.15 ± 0.581
1.162GlyTrp: 1.162 ± 0.299
3.652GlyTyr: 3.652 ± 0.664
0.0GlyXaa: 0.0 ± 0.0
His
0.913HisAla: 0.913 ± 0.291
0.166HisCys: 0.166 ± 0.131
1.079HisAsp: 1.079 ± 0.304
1.577HisGlu: 1.577 ± 0.379
0.747HisPhe: 0.747 ± 0.231
0.664HisGly: 0.664 ± 0.309
0.166HisHis: 0.166 ± 0.157
1.909HisIle: 1.909 ± 0.42
1.245HisLys: 1.245 ± 0.389
1.245HisLeu: 1.245 ± 0.341
0.332HisMet: 0.332 ± 0.179
1.328HisAsn: 1.328 ± 0.319
0.249HisPro: 0.249 ± 0.173
0.83HisGln: 0.83 ± 0.22
0.747HisArg: 0.747 ± 0.211
1.411HisSer: 1.411 ± 0.325
1.411HisThr: 1.411 ± 0.359
1.079HisVal: 1.079 ± 0.405
0.332HisTrp: 0.332 ± 0.166
0.913HisTyr: 0.913 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
4.565IleAla: 4.565 ± 0.584
0.249IleCys: 0.249 ± 0.137
4.897IleAsp: 4.897 ± 0.574
6.972IleGlu: 6.972 ± 0.803
3.154IlePhe: 3.154 ± 0.56
3.984IleGly: 3.984 ± 0.585
1.328IleHis: 1.328 ± 0.335
5.312IleIle: 5.312 ± 0.918
8.134IleLys: 8.134 ± 0.77
4.648IleLeu: 4.648 ± 0.638
1.66IleMet: 1.66 ± 0.347
6.64IleAsn: 6.64 ± 0.992
1.66IlePro: 1.66 ± 0.27
3.071IleGln: 3.071 ± 0.498
2.822IleArg: 2.822 ± 0.657
5.561IleSer: 5.561 ± 0.667
5.561IleThr: 5.561 ± 0.496
4.648IleVal: 4.648 ± 0.562
1.079IleTrp: 1.079 ± 0.43
3.071IleTyr: 3.071 ± 0.604
0.083IleXaa: 0.083 ± 0.069
Lys
5.976LysAla: 5.976 ± 0.72
0.415LysCys: 0.415 ± 0.231
5.727LysAsp: 5.727 ± 0.608
6.308LysGlu: 6.308 ± 0.839
4.399LysPhe: 4.399 ± 0.75
5.727LysGly: 5.727 ± 0.882
1.577LysHis: 1.577 ± 0.306
6.474LysIle: 6.474 ± 0.818
7.802LysLys: 7.802 ± 0.845
8.3LysLeu: 8.3 ± 1.016
2.324LysMet: 2.324 ± 0.539
5.063LysAsn: 5.063 ± 0.878
3.071LysPro: 3.071 ± 0.491
4.897LysGln: 4.897 ± 0.8
4.233LysArg: 4.233 ± 0.624
4.731LysSer: 4.731 ± 0.729
6.557LysThr: 6.557 ± 0.941
5.063LysVal: 5.063 ± 0.693
1.411LysTrp: 1.411 ± 0.332
4.98LysTyr: 4.98 ± 0.662
0.0LysXaa: 0.0 ± 0.0
Leu
3.818LeuAla: 3.818 ± 0.587
0.498LeuCys: 0.498 ± 0.191
5.229LeuAsp: 5.229 ± 0.737
6.391LeuGlu: 6.391 ± 0.863
2.988LeuPhe: 2.988 ± 0.523
4.565LeuGly: 4.565 ± 0.868
0.996LeuHis: 0.996 ± 0.222
5.229LeuIle: 5.229 ± 0.64
8.134LeuLys: 8.134 ± 0.881
5.644LeuLeu: 5.644 ± 0.982
1.079LeuMet: 1.079 ± 0.318
5.312LeuAsn: 5.312 ± 0.657
2.49LeuPro: 2.49 ± 0.361
3.154LeuGln: 3.154 ± 0.436
3.237LeuArg: 3.237 ± 0.537
5.81LeuSer: 5.81 ± 0.667
4.482LeuThr: 4.482 ± 0.686
4.399LeuVal: 4.399 ± 0.628
0.83LeuTrp: 0.83 ± 0.246
3.071LeuTyr: 3.071 ± 0.566
0.0LeuXaa: 0.0 ± 0.0
Met
1.079MetAla: 1.079 ± 0.328
0.166MetCys: 0.166 ± 0.122
1.66MetAsp: 1.66 ± 0.311
1.743MetGlu: 1.743 ± 0.359
1.245MetPhe: 1.245 ± 0.257
0.747MetGly: 0.747 ± 0.337
0.415MetHis: 0.415 ± 0.183
1.826MetIle: 1.826 ± 0.319
2.905MetLys: 2.905 ± 0.5
1.743MetLeu: 1.743 ± 0.426
0.747MetMet: 0.747 ± 0.264
2.324MetAsn: 2.324 ± 0.529
1.328MetPro: 1.328 ± 0.362
1.245MetGln: 1.245 ± 0.32
1.162MetArg: 1.162 ± 0.391
2.573MetSer: 2.573 ± 0.404
1.411MetThr: 1.411 ± 0.304
1.245MetVal: 1.245 ± 0.336
0.332MetTrp: 0.332 ± 0.126
0.996MetTyr: 0.996 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
2.656AsnAla: 2.656 ± 0.574
0.415AsnCys: 0.415 ± 0.187
3.403AsnAsp: 3.403 ± 0.589
5.146AsnGlu: 5.146 ± 0.804
2.075AsnPhe: 2.075 ± 0.526
5.644AsnGly: 5.644 ± 0.793
0.996AsnHis: 0.996 ± 0.299
5.727AsnIle: 5.727 ± 0.821
6.142AsnLys: 6.142 ± 0.972
5.312AsnLeu: 5.312 ± 0.952
1.909AsnMet: 1.909 ± 0.308
4.98AsnAsn: 4.98 ± 0.815
2.822AsnPro: 2.822 ± 0.42
3.569AsnGln: 3.569 ± 0.518
2.739AsnArg: 2.739 ± 0.536
3.569AsnSer: 3.569 ± 0.554
3.071AsnThr: 3.071 ± 0.469
4.067AsnVal: 4.067 ± 0.638
1.079AsnTrp: 1.079 ± 0.386
3.071AsnTyr: 3.071 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
1.162ProAla: 1.162 ± 0.283
0.0ProCys: 0.0 ± 0.0
1.411ProAsp: 1.411 ± 0.358
1.494ProGlu: 1.494 ± 0.364
0.996ProPhe: 0.996 ± 0.298
1.328ProGly: 1.328 ± 0.324
0.581ProHis: 0.581 ± 0.263
2.49ProIle: 2.49 ± 0.393
2.573ProLys: 2.573 ± 0.583
2.49ProLeu: 2.49 ± 0.525
0.996ProMet: 0.996 ± 0.293
2.075ProAsn: 2.075 ± 0.371
0.83ProPro: 0.83 ± 0.2
0.747ProGln: 0.747 ± 0.236
0.913ProArg: 0.913 ± 0.255
1.743ProSer: 1.743 ± 0.363
1.66ProThr: 1.66 ± 0.44
1.162ProVal: 1.162 ± 0.264
0.166ProTrp: 0.166 ± 0.093
1.162ProTyr: 1.162 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
3.237GlnAla: 3.237 ± 0.505
0.415GlnCys: 0.415 ± 0.214
2.241GlnAsp: 2.241 ± 0.516
2.905GlnGlu: 2.905 ± 0.573
1.411GlnPhe: 1.411 ± 0.295
1.826GlnGly: 1.826 ± 0.372
0.913GlnHis: 0.913 ± 0.277
3.154GlnIle: 3.154 ± 0.606
3.901GlnLys: 3.901 ± 0.66
2.656GlnLeu: 2.656 ± 0.452
0.996GlnMet: 0.996 ± 0.301
2.822GlnAsn: 2.822 ± 0.517
0.913GlnPro: 0.913 ± 0.299
1.411GlnGln: 1.411 ± 0.341
1.992GlnArg: 1.992 ± 0.397
2.407GlnSer: 2.407 ± 0.524
2.241GlnThr: 2.241 ± 0.384
1.992GlnVal: 1.992 ± 0.463
0.415GlnTrp: 0.415 ± 0.198
2.407GlnTyr: 2.407 ± 0.464
0.0GlnXaa: 0.0 ± 0.0
Arg
2.158ArgAla: 2.158 ± 0.455
0.249ArgCys: 0.249 ± 0.2
3.486ArgAsp: 3.486 ± 0.447
3.901ArgGlu: 3.901 ± 0.475
1.328ArgPhe: 1.328 ± 0.333
2.075ArgGly: 2.075 ± 0.351
1.328ArgHis: 1.328 ± 0.315
3.735ArgIle: 3.735 ± 0.57
4.15ArgLys: 4.15 ± 0.54
2.905ArgLeu: 2.905 ± 0.436
0.913ArgMet: 0.913 ± 0.304
2.49ArgAsn: 2.49 ± 0.403
0.83ArgPro: 0.83 ± 0.233
1.66ArgGln: 1.66 ± 0.415
1.992ArgArg: 1.992 ± 0.436
1.826ArgSer: 1.826 ± 0.361
2.241ArgThr: 2.241 ± 0.435
2.241ArgVal: 2.241 ± 0.413
0.498ArgTrp: 0.498 ± 0.178
2.158ArgTyr: 2.158 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
3.237SerAla: 3.237 ± 0.7
0.249SerCys: 0.249 ± 0.123
4.648SerAsp: 4.648 ± 0.806
4.482SerGlu: 4.482 ± 0.619
2.988SerPhe: 2.988 ± 0.547
4.15SerGly: 4.15 ± 0.645
1.328SerHis: 1.328 ± 0.32
5.81SerIle: 5.81 ± 0.647
5.561SerLys: 5.561 ± 0.723
3.901SerLeu: 3.901 ± 0.517
1.328SerMet: 1.328 ± 0.301
4.731SerAsn: 4.731 ± 0.743
0.913SerPro: 0.913 ± 0.272
2.573SerGln: 2.573 ± 0.489
2.822SerArg: 2.822 ± 0.469
4.233SerSer: 4.233 ± 0.704
2.905SerThr: 2.905 ± 0.358
3.071SerVal: 3.071 ± 0.526
0.498SerTrp: 0.498 ± 0.2
3.237SerTyr: 3.237 ± 0.762
0.0SerXaa: 0.0 ± 0.0
Thr
3.071ThrAla: 3.071 ± 0.463
0.332ThrCys: 0.332 ± 0.177
4.316ThrAsp: 4.316 ± 0.561
3.486ThrGlu: 3.486 ± 0.622
1.909ThrPhe: 1.909 ± 0.357
4.233ThrGly: 4.233 ± 0.754
1.494ThrHis: 1.494 ± 0.39
5.312ThrIle: 5.312 ± 0.777
5.727ThrLys: 5.727 ± 0.818
4.233ThrLeu: 4.233 ± 0.549
1.245ThrMet: 1.245 ± 0.316
2.905ThrAsn: 2.905 ± 0.495
1.909ThrPro: 1.909 ± 0.418
1.66ThrGln: 1.66 ± 0.318
2.573ThrArg: 2.573 ± 0.502
5.063ThrSer: 5.063 ± 0.852
3.486ThrThr: 3.486 ± 0.672
3.901ThrVal: 3.901 ± 0.501
0.83ThrTrp: 0.83 ± 0.232
3.071ThrTyr: 3.071 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
3.154ValAla: 3.154 ± 0.539
0.498ValCys: 0.498 ± 0.213
3.818ValAsp: 3.818 ± 0.646
3.901ValGlu: 3.901 ± 0.667
1.992ValPhe: 1.992 ± 0.393
3.901ValGly: 3.901 ± 0.792
0.498ValHis: 0.498 ± 0.209
3.818ValIle: 3.818 ± 0.621
6.806ValLys: 6.806 ± 0.641
4.648ValLeu: 4.648 ± 0.552
1.743ValMet: 1.743 ± 0.4
3.154ValAsn: 3.154 ± 0.474
1.494ValPro: 1.494 ± 0.313
1.992ValGln: 1.992 ± 0.39
1.826ValArg: 1.826 ± 0.463
3.569ValSer: 3.569 ± 0.524
5.229ValThr: 5.229 ± 0.777
4.565ValVal: 4.565 ± 0.603
0.249ValTrp: 0.249 ± 0.126
1.245ValTyr: 1.245 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
0.332TrpAla: 0.332 ± 0.191
0.0TrpCys: 0.0 ± 0.0
0.83TrpAsp: 0.83 ± 0.279
0.664TrpGlu: 0.664 ± 0.212
0.83TrpPhe: 0.83 ± 0.24
0.83TrpGly: 0.83 ± 0.255
0.0TrpHis: 0.0 ± 0.0
1.079TrpIle: 1.079 ± 0.247
0.664TrpLys: 0.664 ± 0.222
0.83TrpLeu: 0.83 ± 0.281
0.664TrpMet: 0.664 ± 0.217
1.245TrpAsn: 1.245 ± 0.397
0.249TrpPro: 0.249 ± 0.118
0.498TrpGln: 0.498 ± 0.176
0.415TrpArg: 0.415 ± 0.158
0.332TrpSer: 0.332 ± 0.148
0.581TrpThr: 0.581 ± 0.159
0.913TrpVal: 0.913 ± 0.286
0.249TrpTrp: 0.249 ± 0.147
0.332TrpTyr: 0.332 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.241TyrAla: 2.241 ± 0.383
0.498TyrCys: 0.498 ± 0.196
2.822TyrAsp: 2.822 ± 0.58
3.32TyrGlu: 3.32 ± 0.521
1.66TyrPhe: 1.66 ± 0.403
2.573TyrGly: 2.573 ± 0.522
1.328TyrHis: 1.328 ± 0.3
2.988TyrIle: 2.988 ± 0.527
4.731TyrLys: 4.731 ± 0.601
3.569TyrLeu: 3.569 ± 0.553
1.162TyrMet: 1.162 ± 0.265
3.237TyrAsn: 3.237 ± 0.581
0.996TyrPro: 0.996 ± 0.237
2.656TyrGln: 2.656 ± 0.441
2.158TyrArg: 2.158 ± 0.482
2.822TyrSer: 2.822 ± 0.61
2.324TyrThr: 2.324 ± 0.437
2.075TyrVal: 2.075 ± 0.314
0.664TyrTrp: 0.664 ± 0.198
1.411TyrTyr: 1.411 ± 0.389
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.083XaaThr: 0.083 ± 0.069
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12049 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski