Amino acid dipepetide frequency for Staphylococcus phage P630

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.855AlaAla: 1.855 ± 0.603
0.403AlaCys: 0.403 ± 0.195
3.226AlaAsp: 3.226 ± 0.413
3.951AlaGlu: 3.951 ± 0.585
2.661AlaPhe: 2.661 ± 0.519
3.145AlaGly: 3.145 ± 0.43
0.564AlaHis: 0.564 ± 0.235
5.806AlaIle: 5.806 ± 0.634
4.758AlaLys: 4.758 ± 0.51
5.161AlaLeu: 5.161 ± 0.669
1.532AlaMet: 1.532 ± 0.369
3.064AlaAsn: 3.064 ± 0.43
1.693AlaPro: 1.693 ± 0.341
2.822AlaGln: 2.822 ± 0.623
2.661AlaArg: 2.661 ± 0.429
3.226AlaSer: 3.226 ± 0.496
3.226AlaThr: 3.226 ± 0.733
3.467AlaVal: 3.467 ± 0.614
0.564AlaTrp: 0.564 ± 0.236
2.661AlaTyr: 2.661 ± 0.427
0.0AlaXaa: 0.0 ± 0.0
Cys
0.403CysAla: 0.403 ± 0.202
0.0CysCys: 0.0 ± 0.0
0.081CysAsp: 0.081 ± 0.083
0.403CysGlu: 0.403 ± 0.222
0.242CysPhe: 0.242 ± 0.188
0.403CysGly: 0.403 ± 0.191
0.081CysHis: 0.081 ± 0.088
0.968CysIle: 0.968 ± 0.235
0.242CysLys: 0.242 ± 0.142
0.323CysLeu: 0.323 ± 0.158
0.081CysMet: 0.081 ± 0.083
0.242CysAsn: 0.242 ± 0.12
0.242CysPro: 0.242 ± 0.126
0.242CysGln: 0.242 ± 0.167
0.242CysArg: 0.242 ± 0.137
0.242CysSer: 0.242 ± 0.135
0.323CysThr: 0.323 ± 0.145
0.242CysVal: 0.242 ± 0.125
0.081CysTrp: 0.081 ± 0.084
0.323CysTyr: 0.323 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
2.742AspAla: 2.742 ± 0.456
0.403AspCys: 0.403 ± 0.193
4.193AspAsp: 4.193 ± 0.81
5.806AspGlu: 5.806 ± 0.978
4.193AspPhe: 4.193 ± 0.624
5.403AspGly: 5.403 ± 0.679
0.726AspHis: 0.726 ± 0.25
5.403AspIle: 5.403 ± 0.616
4.596AspLys: 4.596 ± 0.589
5.887AspLeu: 5.887 ± 0.75
1.451AspMet: 1.451 ± 0.357
3.951AspAsn: 3.951 ± 0.662
1.693AspPro: 1.693 ± 0.36
1.613AspGln: 1.613 ± 0.35
1.774AspArg: 1.774 ± 0.462
3.467AspSer: 3.467 ± 0.541
2.58AspThr: 2.58 ± 0.674
4.274AspVal: 4.274 ± 0.591
0.564AspTrp: 0.564 ± 0.212
3.306AspTyr: 3.306 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
5.242GluAla: 5.242 ± 0.876
0.645GluCys: 0.645 ± 0.208
3.629GluAsp: 3.629 ± 0.543
6.209GluGlu: 6.209 ± 0.819
2.984GluPhe: 2.984 ± 0.552
2.258GluGly: 2.258 ± 0.387
1.371GluHis: 1.371 ± 0.346
6.129GluIle: 6.129 ± 0.865
7.58GluLys: 7.58 ± 1.061
8.79GluLeu: 8.79 ± 1.059
2.5GluMet: 2.5 ± 0.45
4.919GluAsn: 4.919 ± 0.527
1.29GluPro: 1.29 ± 0.238
2.661GluGln: 2.661 ± 0.415
4.435GluArg: 4.435 ± 0.631
4.274GluSer: 4.274 ± 0.621
4.354GluThr: 4.354 ± 0.97
4.193GluVal: 4.193 ± 0.515
1.129GluTrp: 1.129 ± 0.28
4.032GluTyr: 4.032 ± 0.738
0.0GluXaa: 0.0 ± 0.0
Phe
2.339PheAla: 2.339 ± 0.483
0.161PheCys: 0.161 ± 0.117
2.822PheAsp: 2.822 ± 0.441
3.548PheGlu: 3.548 ± 0.614
1.048PhePhe: 1.048 ± 0.264
2.177PheGly: 2.177 ± 0.308
0.564PheHis: 0.564 ± 0.19
3.709PheIle: 3.709 ± 0.51
4.758PheLys: 4.758 ± 0.543
3.226PheLeu: 3.226 ± 0.545
1.371PheMet: 1.371 ± 0.391
3.226PheAsn: 3.226 ± 0.52
0.887PhePro: 0.887 ± 0.276
0.484PheGln: 0.484 ± 0.158
1.774PheArg: 1.774 ± 0.337
3.387PheSer: 3.387 ± 0.72
2.742PheThr: 2.742 ± 0.463
2.258PheVal: 2.258 ± 0.452
0.161PheTrp: 0.161 ± 0.109
1.532PheTyr: 1.532 ± 0.415
0.0PheXaa: 0.0 ± 0.0
Gly
3.306GlyAla: 3.306 ± 0.556
0.403GlyCys: 0.403 ± 0.202
3.548GlyAsp: 3.548 ± 0.686
3.629GlyGlu: 3.629 ± 0.767
2.742GlyPhe: 2.742 ± 0.54
3.951GlyGly: 3.951 ± 1.001
1.371GlyHis: 1.371 ± 0.326
4.032GlyIle: 4.032 ± 0.858
6.612GlyLys: 6.612 ± 0.685
5.242GlyLeu: 5.242 ± 0.818
1.048GlyMet: 1.048 ± 0.286
2.822GlyAsn: 2.822 ± 0.424
1.129GlyPro: 1.129 ± 0.344
1.451GlyGln: 1.451 ± 0.369
2.58GlyArg: 2.58 ± 0.57
2.258GlySer: 2.258 ± 0.451
3.145GlyThr: 3.145 ± 0.454
3.79GlyVal: 3.79 ± 0.656
0.887GlyTrp: 0.887 ± 0.303
2.339GlyTyr: 2.339 ± 0.432
0.0GlyXaa: 0.0 ± 0.0
His
1.29HisAla: 1.29 ± 0.327
0.0HisCys: 0.0 ± 0.0
1.048HisAsp: 1.048 ± 0.323
0.726HisGlu: 0.726 ± 0.213
1.048HisPhe: 1.048 ± 0.299
0.645HisGly: 0.645 ± 0.327
0.323HisHis: 0.323 ± 0.182
2.016HisIle: 2.016 ± 0.444
1.29HisLys: 1.29 ± 0.296
1.29HisLeu: 1.29 ± 0.273
0.161HisMet: 0.161 ± 0.106
1.21HisAsn: 1.21 ± 0.344
0.564HisPro: 0.564 ± 0.181
0.403HisGln: 0.403 ± 0.13
0.323HisArg: 0.323 ± 0.175
0.806HisSer: 0.806 ± 0.222
1.451HisThr: 1.451 ± 0.318
0.887HisVal: 0.887 ± 0.258
0.161HisTrp: 0.161 ± 0.136
0.968HisTyr: 0.968 ± 0.286
0.0HisXaa: 0.0 ± 0.0
Ile
5.483IleAla: 5.483 ± 0.807
0.323IleCys: 0.323 ± 0.17
6.209IleAsp: 6.209 ± 0.665
6.29IleGlu: 6.29 ± 0.696
3.306IlePhe: 3.306 ± 0.52
4.113IleGly: 4.113 ± 0.522
1.451IleHis: 1.451 ± 0.384
4.274IleIle: 4.274 ± 0.605
8.951IleLys: 8.951 ± 0.853
4.274IleLeu: 4.274 ± 0.679
1.451IleMet: 1.451 ± 0.332
6.29IleAsn: 6.29 ± 0.875
2.258IlePro: 2.258 ± 0.534
4.032IleGln: 4.032 ± 0.462
2.742IleArg: 2.742 ± 0.521
4.274IleSer: 4.274 ± 0.637
3.871IleThr: 3.871 ± 0.565
4.838IleVal: 4.838 ± 0.585
1.29IleTrp: 1.29 ± 0.445
2.903IleTyr: 2.903 ± 0.577
0.0IleXaa: 0.0 ± 0.0
Lys
5.242LysAla: 5.242 ± 0.585
0.484LysCys: 0.484 ± 0.199
6.532LysAsp: 6.532 ± 0.613
9.112LysGlu: 9.112 ± 0.97
3.387LysPhe: 3.387 ± 0.488
5.08LysGly: 5.08 ± 1.064
1.532LysHis: 1.532 ± 0.373
6.532LysIle: 6.532 ± 0.7
7.903LysLys: 7.903 ± 0.866
7.419LysLeu: 7.419 ± 0.798
2.177LysMet: 2.177 ± 0.495
5.403LysAsn: 5.403 ± 0.811
3.064LysPro: 3.064 ± 0.552
4.516LysGln: 4.516 ± 0.784
4.596LysArg: 4.596 ± 0.629
5.725LysSer: 5.725 ± 0.795
5.322LysThr: 5.322 ± 0.776
6.048LysVal: 6.048 ± 0.79
1.048LysTrp: 1.048 ± 0.33
4.354LysTyr: 4.354 ± 0.582
0.0LysXaa: 0.0 ± 0.0
Leu
3.951LeuAla: 3.951 ± 0.563
0.403LeuCys: 0.403 ± 0.171
5.08LeuAsp: 5.08 ± 0.55
7.257LeuGlu: 7.257 ± 0.896
3.387LeuPhe: 3.387 ± 0.465
3.951LeuGly: 3.951 ± 0.615
1.048LeuHis: 1.048 ± 0.241
6.129LeuIle: 6.129 ± 0.731
9.273LeuLys: 9.273 ± 0.905
7.016LeuLeu: 7.016 ± 0.894
1.855LeuMet: 1.855 ± 0.456
5.645LeuAsn: 5.645 ± 0.705
1.855LeuPro: 1.855 ± 0.454
3.226LeuGln: 3.226 ± 0.429
3.467LeuArg: 3.467 ± 0.529
6.935LeuSer: 6.935 ± 0.825
4.677LeuThr: 4.677 ± 0.513
4.354LeuVal: 4.354 ± 0.572
0.645LeuTrp: 0.645 ± 0.238
3.467LeuTyr: 3.467 ± 0.771
0.0LeuXaa: 0.0 ± 0.0
Met
1.129MetAla: 1.129 ± 0.367
0.161MetCys: 0.161 ± 0.114
1.371MetAsp: 1.371 ± 0.298
1.29MetGlu: 1.29 ± 0.329
0.887MetPhe: 0.887 ± 0.233
1.371MetGly: 1.371 ± 0.469
0.484MetHis: 0.484 ± 0.232
2.177MetIle: 2.177 ± 0.417
2.177MetLys: 2.177 ± 0.452
1.774MetLeu: 1.774 ± 0.314
0.887MetMet: 0.887 ± 0.27
1.855MetAsn: 1.855 ± 0.404
0.968MetPro: 0.968 ± 0.299
1.371MetGln: 1.371 ± 0.426
1.21MetArg: 1.21 ± 0.231
1.371MetSer: 1.371 ± 0.27
1.129MetThr: 1.129 ± 0.283
1.451MetVal: 1.451 ± 0.307
0.484MetTrp: 0.484 ± 0.162
0.806MetTyr: 0.806 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
3.709AsnAla: 3.709 ± 0.623
0.403AsnCys: 0.403 ± 0.251
3.467AsnAsp: 3.467 ± 0.502
4.516AsnGlu: 4.516 ± 0.563
1.451AsnPhe: 1.451 ± 0.469
5.322AsnGly: 5.322 ± 0.597
0.564AsnHis: 0.564 ± 0.248
4.435AsnIle: 4.435 ± 0.555
7.822AsnLys: 7.822 ± 1.047
4.758AsnLeu: 4.758 ± 0.538
1.693AsnMet: 1.693 ± 0.318
5.08AsnAsn: 5.08 ± 0.949
2.661AsnPro: 2.661 ± 0.423
2.661AsnGln: 2.661 ± 0.449
2.822AsnArg: 2.822 ± 0.53
3.548AsnSer: 3.548 ± 0.515
3.467AsnThr: 3.467 ± 0.51
3.548AsnVal: 3.548 ± 0.531
0.968AsnTrp: 0.968 ± 0.367
3.064AsnTyr: 3.064 ± 0.5
0.0AsnXaa: 0.0 ± 0.0
Pro
1.048ProAla: 1.048 ± 0.324
0.0ProCys: 0.0 ± 0.0
1.613ProAsp: 1.613 ± 0.275
2.177ProGlu: 2.177 ± 0.397
1.451ProPhe: 1.451 ± 0.327
0.968ProGly: 0.968 ± 0.218
0.645ProHis: 0.645 ± 0.212
2.016ProIle: 2.016 ± 0.367
3.064ProLys: 3.064 ± 0.558
2.097ProLeu: 2.097 ± 0.448
0.968ProMet: 0.968 ± 0.214
1.048ProAsn: 1.048 ± 0.245
0.726ProPro: 0.726 ± 0.186
0.887ProGln: 0.887 ± 0.276
1.048ProArg: 1.048 ± 0.255
1.855ProSer: 1.855 ± 0.398
1.21ProThr: 1.21 ± 0.334
1.371ProVal: 1.371 ± 0.343
0.161ProTrp: 0.161 ± 0.114
1.21ProTyr: 1.21 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
3.226GlnAla: 3.226 ± 0.402
0.403GlnCys: 0.403 ± 0.2
1.855GlnAsp: 1.855 ± 0.425
2.097GlnGlu: 2.097 ± 0.44
1.129GlnPhe: 1.129 ± 0.215
2.177GlnGly: 2.177 ± 0.396
0.806GlnHis: 0.806 ± 0.235
2.58GlnIle: 2.58 ± 0.355
3.387GlnLys: 3.387 ± 0.478
3.387GlnLeu: 3.387 ± 0.487
0.726GlnMet: 0.726 ± 0.193
3.226GlnAsn: 3.226 ± 0.5
0.887GlnPro: 0.887 ± 0.28
2.016GlnGln: 2.016 ± 0.49
2.339GlnArg: 2.339 ± 0.431
2.016GlnSer: 2.016 ± 0.389
2.177GlnThr: 2.177 ± 0.45
2.258GlnVal: 2.258 ± 0.436
0.484GlnTrp: 0.484 ± 0.237
1.855GlnTyr: 1.855 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
2.5ArgAla: 2.5 ± 0.363
0.081ArgCys: 0.081 ± 0.081
2.661ArgAsp: 2.661 ± 0.419
4.113ArgGlu: 4.113 ± 0.632
2.177ArgPhe: 2.177 ± 0.378
1.935ArgGly: 1.935 ± 0.393
0.726ArgHis: 0.726 ± 0.246
3.467ArgIle: 3.467 ± 0.641
2.903ArgLys: 2.903 ± 0.455
4.274ArgLeu: 4.274 ± 0.474
1.29ArgMet: 1.29 ± 0.332
2.742ArgAsn: 2.742 ± 0.447
0.968ArgPro: 0.968 ± 0.287
1.774ArgGln: 1.774 ± 0.488
2.339ArgArg: 2.339 ± 0.431
1.29ArgSer: 1.29 ± 0.261
2.339ArgThr: 2.339 ± 0.48
2.742ArgVal: 2.742 ± 0.423
0.484ArgTrp: 0.484 ± 0.211
2.5ArgTyr: 2.5 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
3.467SerAla: 3.467 ± 0.592
0.242SerCys: 0.242 ± 0.144
4.516SerAsp: 4.516 ± 0.702
5.645SerGlu: 5.645 ± 0.806
2.661SerPhe: 2.661 ± 0.524
3.306SerGly: 3.306 ± 0.821
1.451SerHis: 1.451 ± 0.268
5.0SerIle: 5.0 ± 0.851
4.435SerLys: 4.435 ± 0.636
4.193SerLeu: 4.193 ± 0.589
1.048SerMet: 1.048 ± 0.244
4.596SerAsn: 4.596 ± 0.584
0.806SerPro: 0.806 ± 0.23
1.613SerGln: 1.613 ± 0.384
1.855SerArg: 1.855 ± 0.351
2.661SerSer: 2.661 ± 0.374
3.467SerThr: 3.467 ± 0.561
2.661SerVal: 2.661 ± 0.516
0.403SerTrp: 0.403 ± 0.167
2.5SerTyr: 2.5 ± 0.535
0.0SerXaa: 0.0 ± 0.0
Thr
3.387ThrAla: 3.387 ± 0.562
0.323ThrCys: 0.323 ± 0.172
3.709ThrAsp: 3.709 ± 0.835
3.467ThrGlu: 3.467 ± 0.506
2.258ThrPhe: 2.258 ± 0.415
3.709ThrGly: 3.709 ± 0.62
1.129ThrHis: 1.129 ± 0.332
4.274ThrIle: 4.274 ± 0.628
4.596ThrLys: 4.596 ± 0.596
5.0ThrLeu: 5.0 ± 0.676
1.129ThrMet: 1.129 ± 0.232
2.903ThrAsn: 2.903 ± 0.551
2.258ThrPro: 2.258 ± 0.377
1.774ThrGln: 1.774 ± 0.381
2.419ThrArg: 2.419 ± 0.459
3.467ThrSer: 3.467 ± 0.748
3.064ThrThr: 3.064 ± 0.563
2.984ThrVal: 2.984 ± 0.577
0.564ThrTrp: 0.564 ± 0.2
2.339ThrTyr: 2.339 ± 0.454
0.0ThrXaa: 0.0 ± 0.0
Val
3.387ValAla: 3.387 ± 0.513
0.242ValCys: 0.242 ± 0.132
4.758ValAsp: 4.758 ± 0.686
3.79ValGlu: 3.79 ± 0.606
2.419ValPhe: 2.419 ± 0.558
3.306ValGly: 3.306 ± 0.573
0.887ValHis: 0.887 ± 0.3
5.08ValIle: 5.08 ± 0.661
5.887ValLys: 5.887 ± 0.732
4.354ValLeu: 4.354 ± 0.533
1.129ValMet: 1.129 ± 0.291
3.951ValAsn: 3.951 ± 0.478
0.806ValPro: 0.806 ± 0.298
2.5ValGln: 2.5 ± 0.553
2.177ValArg: 2.177 ± 0.355
2.984ValSer: 2.984 ± 0.584
3.467ValThr: 3.467 ± 0.536
3.79ValVal: 3.79 ± 0.721
0.645ValTrp: 0.645 ± 0.243
2.097ValTyr: 2.097 ± 0.369
0.0ValXaa: 0.0 ± 0.0
Trp
0.403TrpAla: 0.403 ± 0.201
0.081TrpCys: 0.081 ± 0.083
0.806TrpAsp: 0.806 ± 0.246
0.726TrpGlu: 0.726 ± 0.224
0.806TrpPhe: 0.806 ± 0.237
0.484TrpGly: 0.484 ± 0.201
0.161TrpHis: 0.161 ± 0.127
0.806TrpIle: 0.806 ± 0.223
1.048TrpLys: 1.048 ± 0.311
1.371TrpLeu: 1.371 ± 0.27
0.403TrpMet: 0.403 ± 0.175
0.806TrpAsn: 0.806 ± 0.286
0.081TrpPro: 0.081 ± 0.061
0.564TrpGln: 0.564 ± 0.213
0.484TrpArg: 0.484 ± 0.184
0.726TrpSer: 0.726 ± 0.297
0.403TrpThr: 0.403 ± 0.175
0.645TrpVal: 0.645 ± 0.191
0.081TrpTrp: 0.081 ± 0.077
0.564TrpTyr: 0.564 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.339TyrAla: 2.339 ± 0.374
0.242TyrCys: 0.242 ± 0.136
2.903TyrAsp: 2.903 ± 0.525
3.871TyrGlu: 3.871 ± 0.65
1.935TyrPhe: 1.935 ± 0.439
2.58TyrGly: 2.58 ± 0.504
0.726TyrHis: 0.726 ± 0.265
3.79TyrIle: 3.79 ± 0.707
4.193TyrLys: 4.193 ± 0.566
3.79TyrLeu: 3.79 ± 0.634
1.371TyrMet: 1.371 ± 0.405
2.984TyrAsn: 2.984 ± 0.459
0.726TyrPro: 0.726 ± 0.202
2.419TyrGln: 2.419 ± 0.515
2.016TyrArg: 2.016 ± 0.478
2.016TyrSer: 2.016 ± 0.397
2.339TyrThr: 2.339 ± 0.403
1.935TyrVal: 1.935 ± 0.399
0.645TyrTrp: 0.645 ± 0.234
1.29TyrTyr: 1.29 ± 0.498
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (12402 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski