Amino acid dipepetide frequency for Staphylococcus phage CNPx

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.115AlaAla: 2.115 ± 0.394
0.219AlaCys: 0.219 ± 0.127
2.188AlaAsp: 2.188 ± 0.418
3.938AlaGlu: 3.938 ± 0.634
2.042AlaPhe: 2.042 ± 0.614
3.354AlaGly: 3.354 ± 0.508
1.458AlaHis: 1.458 ± 0.35
4.813AlaIle: 4.813 ± 0.748
5.25AlaLys: 5.25 ± 0.781
4.813AlaLeu: 4.813 ± 0.761
1.969AlaMet: 1.969 ± 0.381
4.375AlaAsn: 4.375 ± 0.489
1.313AlaPro: 1.313 ± 0.289
2.406AlaGln: 2.406 ± 0.475
1.75AlaArg: 1.75 ± 0.265
3.719AlaSer: 3.719 ± 0.794
2.99AlaThr: 2.99 ± 0.458
2.552AlaVal: 2.552 ± 0.545
0.656AlaTrp: 0.656 ± 0.204
1.969AlaTyr: 1.969 ± 0.323
0.0AlaXaa: 0.0 ± 0.0
Cys
0.292CysAla: 0.292 ± 0.113
0.0CysCys: 0.0 ± 0.0
0.073CysAsp: 0.073 ± 0.071
0.219CysGlu: 0.219 ± 0.118
0.219CysPhe: 0.219 ± 0.123
0.729CysGly: 0.729 ± 0.247
0.219CysHis: 0.219 ± 0.127
0.365CysIle: 0.365 ± 0.185
0.656CysLys: 0.656 ± 0.264
0.365CysLeu: 0.365 ± 0.152
0.0CysMet: 0.0 ± 0.0
0.729CysAsn: 0.729 ± 0.189
0.365CysPro: 0.365 ± 0.166
0.073CysGln: 0.073 ± 0.073
0.073CysArg: 0.073 ± 0.079
0.438CysSer: 0.438 ± 0.21
0.219CysThr: 0.219 ± 0.144
0.438CysVal: 0.438 ± 0.188
0.073CysTrp: 0.073 ± 0.07
0.365CysTyr: 0.365 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
3.354AspAla: 3.354 ± 0.754
0.292AspCys: 0.292 ± 0.161
4.375AspAsp: 4.375 ± 0.744
5.761AspGlu: 5.761 ± 0.732
2.844AspPhe: 2.844 ± 0.473
4.084AspGly: 4.084 ± 0.567
0.583AspHis: 0.583 ± 0.23
5.323AspIle: 5.323 ± 0.667
5.542AspLys: 5.542 ± 0.648
5.761AspLeu: 5.761 ± 0.666
1.677AspMet: 1.677 ± 0.483
4.375AspAsn: 4.375 ± 0.815
1.313AspPro: 1.313 ± 0.303
0.948AspGln: 0.948 ± 0.236
1.823AspArg: 1.823 ± 0.343
3.136AspSer: 3.136 ± 0.4
4.011AspThr: 4.011 ± 0.56
3.427AspVal: 3.427 ± 0.473
0.729AspTrp: 0.729 ± 0.22
3.063AspTyr: 3.063 ± 0.513
0.0AspXaa: 0.0 ± 0.0
Glu
2.698GluAla: 2.698 ± 0.409
0.438GluCys: 0.438 ± 0.171
3.573GluAsp: 3.573 ± 0.62
5.469GluGlu: 5.469 ± 0.845
2.698GluPhe: 2.698 ± 0.474
2.042GluGly: 2.042 ± 0.457
1.458GluHis: 1.458 ± 0.408
5.469GluIle: 5.469 ± 0.72
6.417GluLys: 6.417 ± 0.768
8.022GluLeu: 8.022 ± 0.94
2.042GluMet: 2.042 ± 0.407
5.542GluAsn: 5.542 ± 0.733
1.823GluPro: 1.823 ± 0.416
4.23GluGln: 4.23 ± 0.626
4.74GluArg: 4.74 ± 0.656
3.282GluSer: 3.282 ± 0.435
3.938GluThr: 3.938 ± 0.606
5.469GluVal: 5.469 ± 0.768
1.24GluTrp: 1.24 ± 0.288
3.938GluTyr: 3.938 ± 0.499
0.0GluXaa: 0.0 ± 0.0
Phe
1.969PheAla: 1.969 ± 0.372
0.146PheCys: 0.146 ± 0.093
2.771PheAsp: 2.771 ± 0.341
2.188PheGlu: 2.188 ± 0.437
1.458PhePhe: 1.458 ± 0.325
2.115PheGly: 2.115 ± 0.69
0.729PheHis: 0.729 ± 0.201
2.844PheIle: 2.844 ± 0.461
3.136PheLys: 3.136 ± 0.443
2.042PheLeu: 2.042 ± 0.428
1.386PheMet: 1.386 ± 0.323
3.5PheAsn: 3.5 ± 0.41
0.365PhePro: 0.365 ± 0.17
1.386PheGln: 1.386 ± 0.388
1.531PheArg: 1.531 ± 0.377
1.896PheSer: 1.896 ± 0.355
2.698PheThr: 2.698 ± 0.548
2.917PheVal: 2.917 ± 0.47
0.292PheTrp: 0.292 ± 0.182
2.188PheTyr: 2.188 ± 0.461
0.0PheXaa: 0.0 ± 0.0
Gly
3.865GlyAla: 3.865 ± 0.624
0.729GlyCys: 0.729 ± 0.207
2.771GlyAsp: 2.771 ± 0.567
2.844GlyGlu: 2.844 ± 0.476
2.334GlyPhe: 2.334 ± 0.432
3.354GlyGly: 3.354 ± 0.601
1.458GlyHis: 1.458 ± 0.447
5.25GlyIle: 5.25 ± 0.742
5.323GlyLys: 5.323 ± 0.627
4.302GlyLeu: 4.302 ± 0.589
1.75GlyMet: 1.75 ± 0.36
3.354GlyAsn: 3.354 ± 0.615
0.656GlyPro: 0.656 ± 0.231
2.917GlyGln: 2.917 ± 0.385
2.917GlyArg: 2.917 ± 0.42
2.917GlySer: 2.917 ± 0.647
3.792GlyThr: 3.792 ± 0.58
4.886GlyVal: 4.886 ± 0.605
0.948GlyTrp: 0.948 ± 0.32
3.5GlyTyr: 3.5 ± 0.67
0.0GlyXaa: 0.0 ± 0.0
His
1.313HisAla: 1.313 ± 0.356
0.073HisCys: 0.073 ± 0.062
1.021HisAsp: 1.021 ± 0.312
1.021HisGlu: 1.021 ± 0.232
0.583HisPhe: 0.583 ± 0.181
1.021HisGly: 1.021 ± 0.278
0.729HisHis: 0.729 ± 0.271
0.875HisIle: 0.875 ± 0.247
1.386HisLys: 1.386 ± 0.313
1.386HisLeu: 1.386 ± 0.32
0.146HisMet: 0.146 ± 0.093
1.386HisAsn: 1.386 ± 0.338
0.802HisPro: 0.802 ± 0.195
0.656HisGln: 0.656 ± 0.236
0.51HisArg: 0.51 ± 0.22
1.167HisSer: 1.167 ± 0.277
1.531HisThr: 1.531 ± 0.341
1.386HisVal: 1.386 ± 0.278
0.073HisTrp: 0.073 ± 0.073
0.656HisTyr: 0.656 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
5.542IleAla: 5.542 ± 0.707
0.219IleCys: 0.219 ± 0.14
5.178IleAsp: 5.178 ± 0.634
6.782IleGlu: 6.782 ± 0.826
2.042IlePhe: 2.042 ± 0.42
5.688IleGly: 5.688 ± 0.641
0.51IleHis: 0.51 ± 0.181
4.521IleIle: 4.521 ± 0.608
8.751IleLys: 8.751 ± 0.77
3.938IleLeu: 3.938 ± 0.605
1.531IleMet: 1.531 ± 0.454
4.594IleAsn: 4.594 ± 0.471
2.261IlePro: 2.261 ± 0.289
2.479IleGln: 2.479 ± 0.414
2.261IleArg: 2.261 ± 0.322
4.74IleSer: 4.74 ± 0.791
5.25IleThr: 5.25 ± 0.569
3.573IleVal: 3.573 ± 0.647
0.948IleTrp: 0.948 ± 0.286
2.406IleTyr: 2.406 ± 0.439
0.0IleXaa: 0.0 ± 0.0
Lys
4.448LysAla: 4.448 ± 0.632
0.365LysCys: 0.365 ± 0.171
6.198LysAsp: 6.198 ± 0.567
8.459LysGlu: 8.459 ± 1.165
3.209LysPhe: 3.209 ± 0.442
5.396LysGly: 5.396 ± 0.549
2.115LysHis: 2.115 ± 0.337
5.615LysIle: 5.615 ± 0.74
6.855LysLys: 6.855 ± 1.097
7.584LysLeu: 7.584 ± 0.734
2.406LysMet: 2.406 ± 0.439
5.688LysAsn: 5.688 ± 0.699
2.99LysPro: 2.99 ± 0.788
5.323LysGln: 5.323 ± 0.609
5.032LysArg: 5.032 ± 0.661
4.594LysSer: 4.594 ± 0.536
6.49LysThr: 6.49 ± 0.628
5.688LysVal: 5.688 ± 0.593
1.167LysTrp: 1.167 ± 0.25
3.938LysTyr: 3.938 ± 0.67
0.0LysXaa: 0.0 ± 0.0
Leu
4.157LeuAla: 4.157 ± 0.755
0.292LeuCys: 0.292 ± 0.236
5.98LeuAsp: 5.98 ± 0.791
5.834LeuGlu: 5.834 ± 0.639
3.209LeuPhe: 3.209 ± 0.511
4.302LeuGly: 4.302 ± 0.747
1.24LeuHis: 1.24 ± 0.364
5.25LeuIle: 5.25 ± 0.762
7.438LeuLys: 7.438 ± 0.723
5.542LeuLeu: 5.542 ± 0.803
1.677LeuMet: 1.677 ± 0.305
5.615LeuAsn: 5.615 ± 0.448
2.552LeuPro: 2.552 ± 0.4
3.792LeuGln: 3.792 ± 0.553
3.5LeuArg: 3.5 ± 0.525
4.011LeuSer: 4.011 ± 0.535
4.23LeuThr: 4.23 ± 0.542
3.573LeuVal: 3.573 ± 0.426
1.167LeuTrp: 1.167 ± 0.293
3.427LeuTyr: 3.427 ± 0.631
0.0LeuXaa: 0.0 ± 0.0
Met
1.458MetAla: 1.458 ± 0.456
0.146MetCys: 0.146 ± 0.106
1.021MetAsp: 1.021 ± 0.282
1.677MetGlu: 1.677 ± 0.308
0.802MetPhe: 0.802 ± 0.245
0.948MetGly: 0.948 ± 0.269
0.146MetHis: 0.146 ± 0.106
1.531MetIle: 1.531 ± 0.291
2.99MetLys: 2.99 ± 0.451
2.698MetLeu: 2.698 ± 0.458
0.656MetMet: 0.656 ± 0.254
1.969MetAsn: 1.969 ± 0.363
1.094MetPro: 1.094 ± 0.251
0.729MetGln: 0.729 ± 0.273
0.948MetArg: 0.948 ± 0.24
1.75MetSer: 1.75 ± 0.376
1.823MetThr: 1.823 ± 0.417
0.656MetVal: 0.656 ± 0.199
0.51MetTrp: 0.51 ± 0.181
1.458MetTyr: 1.458 ± 0.391
0.0MetXaa: 0.0 ± 0.0
Asn
3.938AsnAla: 3.938 ± 0.57
0.802AsnCys: 0.802 ± 0.326
5.178AsnAsp: 5.178 ± 0.671
4.521AsnGlu: 4.521 ± 0.5
2.552AsnPhe: 2.552 ± 0.393
5.105AsnGly: 5.105 ± 0.655
1.167AsnHis: 1.167 ± 0.354
4.886AsnIle: 4.886 ± 0.713
6.344AsnLys: 6.344 ± 0.766
4.375AsnLeu: 4.375 ± 0.659
0.583AsnMet: 0.583 ± 0.212
5.323AsnAsn: 5.323 ± 0.76
2.479AsnPro: 2.479 ± 0.48
3.063AsnGln: 3.063 ± 0.571
3.5AsnArg: 3.5 ± 0.602
4.375AsnSer: 4.375 ± 0.447
3.282AsnThr: 3.282 ± 0.611
5.178AsnVal: 5.178 ± 0.602
0.875AsnTrp: 0.875 ± 0.272
2.552AsnTyr: 2.552 ± 0.449
0.0AsnXaa: 0.0 ± 0.0
Pro
1.094ProAla: 1.094 ± 0.275
0.146ProCys: 0.146 ± 0.109
1.896ProAsp: 1.896 ± 0.411
2.261ProGlu: 2.261 ± 0.396
1.677ProPhe: 1.677 ± 0.444
1.24ProGly: 1.24 ± 0.344
0.438ProHis: 0.438 ± 0.142
2.698ProIle: 2.698 ± 0.516
3.136ProLys: 3.136 ± 0.525
1.386ProLeu: 1.386 ± 0.338
0.583ProMet: 0.583 ± 0.214
1.823ProAsn: 1.823 ± 0.432
1.094ProPro: 1.094 ± 0.327
0.875ProGln: 0.875 ± 0.278
1.75ProArg: 1.75 ± 0.371
1.823ProSer: 1.823 ± 0.431
1.313ProThr: 1.313 ± 0.315
1.75ProVal: 1.75 ± 0.382
0.146ProTrp: 0.146 ± 0.099
1.458ProTyr: 1.458 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
2.552GlnAla: 2.552 ± 0.529
0.365GlnCys: 0.365 ± 0.138
2.115GlnAsp: 2.115 ± 0.406
2.771GlnGlu: 2.771 ± 0.434
0.948GlnPhe: 0.948 ± 0.234
2.479GlnGly: 2.479 ± 0.414
0.729GlnHis: 0.729 ± 0.258
3.427GlnIle: 3.427 ± 0.512
4.375GlnLys: 4.375 ± 0.723
2.99GlnLeu: 2.99 ± 0.421
1.167GlnMet: 1.167 ± 0.264
3.209GlnAsn: 3.209 ± 0.527
1.24GlnPro: 1.24 ± 0.253
2.042GlnGln: 2.042 ± 0.403
2.115GlnArg: 2.115 ± 0.281
2.261GlnSer: 2.261 ± 0.343
2.261GlnThr: 2.261 ± 0.537
1.75GlnVal: 1.75 ± 0.406
0.438GlnTrp: 0.438 ± 0.203
1.531GlnTyr: 1.531 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
2.334ArgAla: 2.334 ± 0.514
0.365ArgCys: 0.365 ± 0.16
2.99ArgAsp: 2.99 ± 0.426
3.209ArgGlu: 3.209 ± 0.511
1.458ArgPhe: 1.458 ± 0.292
2.188ArgGly: 2.188 ± 0.424
0.875ArgHis: 0.875 ± 0.242
2.844ArgIle: 2.844 ± 0.425
4.157ArgLys: 4.157 ± 0.686
3.865ArgLeu: 3.865 ± 0.481
1.677ArgMet: 1.677 ± 0.313
3.282ArgAsn: 3.282 ± 0.473
1.167ArgPro: 1.167 ± 0.301
0.948ArgGln: 0.948 ± 0.235
2.261ArgArg: 2.261 ± 0.492
2.261ArgSer: 2.261 ± 0.465
1.531ArgThr: 1.531 ± 0.35
3.427ArgVal: 3.427 ± 0.485
0.875ArgTrp: 0.875 ± 0.227
2.771ArgTyr: 2.771 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
3.282SerAla: 3.282 ± 0.623
0.219SerCys: 0.219 ± 0.129
3.282SerAsp: 3.282 ± 0.471
3.646SerGlu: 3.646 ± 0.464
2.334SerPhe: 2.334 ± 0.412
4.448SerGly: 4.448 ± 0.674
1.167SerHis: 1.167 ± 0.297
3.5SerIle: 3.5 ± 0.515
4.448SerLys: 4.448 ± 0.692
4.667SerLeu: 4.667 ± 0.576
1.386SerMet: 1.386 ± 0.316
4.157SerAsn: 4.157 ± 0.729
1.167SerPro: 1.167 ± 0.273
2.406SerGln: 2.406 ± 0.55
2.552SerArg: 2.552 ± 0.428
2.625SerSer: 2.625 ± 0.385
3.136SerThr: 3.136 ± 0.423
3.719SerVal: 3.719 ± 0.451
1.167SerTrp: 1.167 ± 0.29
2.552SerTyr: 2.552 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
3.354ThrAla: 3.354 ± 0.598
0.292ThrCys: 0.292 ± 0.142
3.865ThrAsp: 3.865 ± 0.486
3.063ThrGlu: 3.063 ± 0.495
3.063ThrPhe: 3.063 ± 0.572
3.5ThrGly: 3.5 ± 0.541
1.021ThrHis: 1.021 ± 0.246
5.542ThrIle: 5.542 ± 0.531
6.782ThrLys: 6.782 ± 0.752
3.865ThrLeu: 3.865 ± 0.496
1.167ThrMet: 1.167 ± 0.261
3.719ThrAsn: 3.719 ± 0.558
2.844ThrPro: 2.844 ± 0.486
2.406ThrGln: 2.406 ± 0.415
2.042ThrArg: 2.042 ± 0.522
3.5ThrSer: 3.5 ± 0.486
4.302ThrThr: 4.302 ± 0.65
3.427ThrVal: 3.427 ± 0.781
0.438ThrTrp: 0.438 ± 0.169
1.969ThrTyr: 1.969 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
3.5ValAla: 3.5 ± 0.711
0.219ValCys: 0.219 ± 0.138
5.032ValAsp: 5.032 ± 0.577
5.178ValGlu: 5.178 ± 0.681
2.115ValPhe: 2.115 ± 0.513
3.865ValGly: 3.865 ± 0.41
0.438ValHis: 0.438 ± 0.187
3.938ValIle: 3.938 ± 0.569
5.542ValLys: 5.542 ± 0.711
4.375ValLeu: 4.375 ± 0.654
1.604ValMet: 1.604 ± 0.31
3.427ValAsn: 3.427 ± 0.47
1.823ValPro: 1.823 ± 0.381
2.115ValGln: 2.115 ± 0.316
2.771ValArg: 2.771 ± 0.389
4.959ValSer: 4.959 ± 0.571
3.719ValThr: 3.719 ± 0.486
5.178ValVal: 5.178 ± 0.565
0.365ValTrp: 0.365 ± 0.233
2.261ValTyr: 2.261 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.228
0.219TrpCys: 0.219 ± 0.115
0.365TrpAsp: 0.365 ± 0.206
1.531TrpGlu: 1.531 ± 0.35
0.365TrpPhe: 0.365 ± 0.163
0.802TrpGly: 0.802 ± 0.229
0.219TrpHis: 0.219 ± 0.118
0.875TrpIle: 0.875 ± 0.274
0.875TrpLys: 0.875 ± 0.239
1.167TrpLeu: 1.167 ± 0.315
0.0TrpMet: 0.0 ± 0.0
0.729TrpAsn: 0.729 ± 0.315
0.146TrpPro: 0.146 ± 0.082
0.438TrpGln: 0.438 ± 0.182
0.438TrpArg: 0.438 ± 0.15
0.656TrpSer: 0.656 ± 0.22
1.531TrpThr: 1.531 ± 0.273
1.167TrpVal: 1.167 ± 0.307
0.219TrpTrp: 0.219 ± 0.178
0.583TrpTyr: 0.583 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.896TyrAla: 1.896 ± 0.413
0.365TyrCys: 0.365 ± 0.179
2.552TyrAsp: 2.552 ± 0.365
4.157TyrGlu: 4.157 ± 0.678
1.604TyrPhe: 1.604 ± 0.336
3.209TyrGly: 3.209 ± 0.826
1.021TyrHis: 1.021 ± 0.273
3.646TyrIle: 3.646 ± 0.561
4.157TyrLys: 4.157 ± 0.596
3.646TyrLeu: 3.646 ± 0.425
1.531TyrMet: 1.531 ± 0.326
3.282TyrAsn: 3.282 ± 0.551
1.094TyrPro: 1.094 ± 0.3
1.531TyrGln: 1.531 ± 0.332
1.969TyrArg: 1.969 ± 0.457
1.75TyrSer: 1.75 ± 0.32
2.115TyrThr: 2.115 ± 0.329
2.334TyrVal: 2.334 ± 0.277
0.729TyrTrp: 0.729 ± 0.239
1.531TyrTyr: 1.531 ± 0.34
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13714 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski