Amino acid dipepetide frequency for Staphylococcus phage phiBU01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.492AlaAla: 1.492 ± 0.72
0.439AlaCys: 0.439 ± 0.185
2.633AlaAsp: 2.633 ± 0.492
3.423AlaGlu: 3.423 ± 0.692
3.072AlaPhe: 3.072 ± 0.625
3.599AlaGly: 3.599 ± 0.658
0.439AlaHis: 0.439 ± 0.184
5.267AlaIle: 5.267 ± 0.879
4.565AlaLys: 4.565 ± 0.471
5.179AlaLeu: 5.179 ± 0.76
1.404AlaMet: 1.404 ± 0.366
3.248AlaAsn: 3.248 ± 0.56
1.58AlaPro: 1.58 ± 0.404
3.072AlaGln: 3.072 ± 0.566
2.458AlaArg: 2.458 ± 0.468
4.038AlaSer: 4.038 ± 0.628
3.16AlaThr: 3.16 ± 0.547
2.721AlaVal: 2.721 ± 0.682
0.439AlaTrp: 0.439 ± 0.191
3.072AlaTyr: 3.072 ± 0.5
0.0AlaXaa: 0.0 ± 0.0
Cys
0.351CysAla: 0.351 ± 0.182
0.176CysCys: 0.176 ± 0.174
0.088CysAsp: 0.088 ± 0.077
0.088CysGlu: 0.088 ± 0.094
0.088CysPhe: 0.088 ± 0.077
0.176CysGly: 0.176 ± 0.105
0.088CysHis: 0.088 ± 0.087
0.527CysIle: 0.527 ± 0.214
0.351CysLys: 0.351 ± 0.176
0.351CysLeu: 0.351 ± 0.157
0.088CysMet: 0.088 ± 0.091
0.351CysAsn: 0.351 ± 0.149
0.176CysPro: 0.176 ± 0.136
0.088CysGln: 0.088 ± 0.087
0.351CysArg: 0.351 ± 0.204
0.614CysSer: 0.614 ± 0.297
0.614CysThr: 0.614 ± 0.259
0.263CysVal: 0.263 ± 0.154
0.088CysTrp: 0.088 ± 0.094
0.439CysTyr: 0.439 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
3.16AspAla: 3.16 ± 0.381
0.351AspCys: 0.351 ± 0.169
4.038AspAsp: 4.038 ± 0.734
5.091AspGlu: 5.091 ± 0.948
3.16AspPhe: 3.16 ± 0.527
5.53AspGly: 5.53 ± 0.711
0.79AspHis: 0.79 ± 0.236
4.74AspIle: 4.74 ± 0.74
4.916AspLys: 4.916 ± 0.866
5.091AspLeu: 5.091 ± 0.829
1.404AspMet: 1.404 ± 0.371
4.828AspAsn: 4.828 ± 0.692
1.492AspPro: 1.492 ± 0.427
1.053AspGln: 1.053 ± 0.369
2.195AspArg: 2.195 ± 0.588
3.336AspSer: 3.336 ± 0.492
3.16AspThr: 3.16 ± 0.554
3.511AspVal: 3.511 ± 0.668
0.439AspTrp: 0.439 ± 0.165
2.809AspTyr: 2.809 ± 0.619
0.0AspXaa: 0.0 ± 0.0
Glu
5.091GluAla: 5.091 ± 0.858
0.263GluCys: 0.263 ± 0.15
3.511GluAsp: 3.511 ± 0.572
7.022GluGlu: 7.022 ± 1.178
3.423GluPhe: 3.423 ± 0.547
2.37GluGly: 2.37 ± 0.493
1.229GluHis: 1.229 ± 0.315
6.671GluIle: 6.671 ± 0.866
7.461GluLys: 7.461 ± 1.09
7.9GluLeu: 7.9 ± 0.944
1.931GluMet: 1.931 ± 0.527
4.477GluAsn: 4.477 ± 0.62
2.107GluPro: 2.107 ± 0.496
3.072GluGln: 3.072 ± 0.432
4.038GluArg: 4.038 ± 0.658
4.565GluSer: 4.565 ± 0.657
3.862GluThr: 3.862 ± 0.964
4.038GluVal: 4.038 ± 0.558
0.878GluTrp: 0.878 ± 0.214
3.687GluTyr: 3.687 ± 0.609
0.0GluXaa: 0.0 ± 0.0
Phe
2.546PheAla: 2.546 ± 0.514
0.351PheCys: 0.351 ± 0.225
2.809PheAsp: 2.809 ± 0.494
3.775PheGlu: 3.775 ± 0.497
1.492PhePhe: 1.492 ± 0.314
2.458PheGly: 2.458 ± 0.442
0.527PheHis: 0.527 ± 0.196
3.775PheIle: 3.775 ± 0.627
4.565PheLys: 4.565 ± 0.699
2.985PheLeu: 2.985 ± 0.589
1.404PheMet: 1.404 ± 0.452
4.477PheAsn: 4.477 ± 0.631
1.229PhePro: 1.229 ± 0.365
0.614PheGln: 0.614 ± 0.152
1.58PheArg: 1.58 ± 0.368
2.107PheSer: 2.107 ± 0.534
2.546PheThr: 2.546 ± 0.594
2.37PheVal: 2.37 ± 0.432
0.263PheTrp: 0.263 ± 0.193
1.492PheTyr: 1.492 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
2.458GlyAla: 2.458 ± 0.521
0.614GlyCys: 0.614 ± 0.185
3.862GlyAsp: 3.862 ± 0.577
3.072GlyGlu: 3.072 ± 0.665
2.458GlyPhe: 2.458 ± 0.526
4.477GlyGly: 4.477 ± 1.079
1.404GlyHis: 1.404 ± 0.37
4.301GlyIle: 4.301 ± 0.808
7.198GlyLys: 7.198 ± 0.953
5.706GlyLeu: 5.706 ± 0.809
0.79GlyMet: 0.79 ± 0.343
2.633GlyAsn: 2.633 ± 0.443
1.404GlyPro: 1.404 ± 0.352
1.756GlyGln: 1.756 ± 0.317
2.37GlyArg: 2.37 ± 0.398
3.16GlySer: 3.16 ± 0.645
2.985GlyThr: 2.985 ± 0.605
3.862GlyVal: 3.862 ± 0.741
0.79GlyTrp: 0.79 ± 0.278
2.633GlyTyr: 2.633 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
0.702HisAla: 0.702 ± 0.267
0.176HisCys: 0.176 ± 0.126
0.878HisAsp: 0.878 ± 0.285
1.141HisGlu: 1.141 ± 0.271
1.141HisPhe: 1.141 ± 0.28
0.527HisGly: 0.527 ± 0.206
0.439HisHis: 0.439 ± 0.201
1.931HisIle: 1.931 ± 0.391
1.58HisLys: 1.58 ± 0.413
1.229HisLeu: 1.229 ± 0.291
0.088HisMet: 0.088 ± 0.077
0.878HisAsn: 0.878 ± 0.287
0.614HisPro: 0.614 ± 0.251
0.702HisGln: 0.702 ± 0.295
0.351HisArg: 0.351 ± 0.183
0.79HisSer: 0.79 ± 0.176
0.878HisThr: 0.878 ± 0.257
0.966HisVal: 0.966 ± 0.272
0.088HisTrp: 0.088 ± 0.098
0.878HisTyr: 0.878 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
4.652IleAla: 4.652 ± 0.679
0.614IleCys: 0.614 ± 0.267
5.618IleAsp: 5.618 ± 0.85
6.935IleGlu: 6.935 ± 0.742
3.423IlePhe: 3.423 ± 0.549
3.775IleGly: 3.775 ± 0.551
1.141IleHis: 1.141 ± 0.314
4.916IleIle: 4.916 ± 0.747
8.515IleLys: 8.515 ± 0.634
4.828IleLeu: 4.828 ± 0.622
1.58IleMet: 1.58 ± 0.341
5.794IleAsn: 5.794 ± 1.065
1.843IlePro: 1.843 ± 0.503
3.16IleGln: 3.16 ± 0.485
3.072IleArg: 3.072 ± 0.633
5.794IleSer: 5.794 ± 0.851
4.652IleThr: 4.652 ± 0.53
5.179IleVal: 5.179 ± 0.633
1.317IleTrp: 1.317 ± 0.489
3.072IleTyr: 3.072 ± 0.691
0.0IleXaa: 0.0 ± 0.0
Lys
5.179LysAla: 5.179 ± 0.938
0.176LysCys: 0.176 ± 0.125
5.969LysAsp: 5.969 ± 0.569
9.217LysGlu: 9.217 ± 1.074
3.511LysPhe: 3.511 ± 0.535
6.32LysGly: 6.32 ± 0.928
1.141LysHis: 1.141 ± 0.4
7.374LysIle: 7.374 ± 0.81
8.515LysLys: 8.515 ± 1.084
6.496LysLeu: 6.496 ± 0.772
2.721LysMet: 2.721 ± 0.509
5.881LysAsn: 5.881 ± 0.709
2.985LysPro: 2.985 ± 0.571
3.862LysGln: 3.862 ± 0.727
4.301LysArg: 4.301 ± 0.683
6.057LysSer: 6.057 ± 0.816
6.057LysThr: 6.057 ± 0.854
5.881LysVal: 5.881 ± 0.615
1.229LysTrp: 1.229 ± 0.339
3.862LysTyr: 3.862 ± 0.672
0.0LysXaa: 0.0 ± 0.0
Leu
4.038LeuAla: 4.038 ± 0.774
0.176LeuCys: 0.176 ± 0.104
5.179LeuAsp: 5.179 ± 0.603
7.374LeuGlu: 7.374 ± 1.007
3.687LeuPhe: 3.687 ± 0.588
3.336LeuGly: 3.336 ± 0.735
0.966LeuHis: 0.966 ± 0.293
5.091LeuIle: 5.091 ± 0.73
8.339LeuLys: 8.339 ± 0.939
6.584LeuLeu: 6.584 ± 0.792
1.668LeuMet: 1.668 ± 0.369
6.057LeuAsn: 6.057 ± 0.759
2.37LeuPro: 2.37 ± 0.501
3.16LeuGln: 3.16 ± 0.594
3.687LeuArg: 3.687 ± 0.514
5.881LeuSer: 5.881 ± 0.588
4.565LeuThr: 4.565 ± 0.647
3.423LeuVal: 3.423 ± 0.607
0.878LeuTrp: 0.878 ± 0.266
3.248LeuTyr: 3.248 ± 0.594
0.0LeuXaa: 0.0 ± 0.0
Met
1.141MetAla: 1.141 ± 0.378
0.176MetCys: 0.176 ± 0.117
1.492MetAsp: 1.492 ± 0.343
1.317MetGlu: 1.317 ± 0.318
0.878MetPhe: 0.878 ± 0.26
1.492MetGly: 1.492 ± 0.456
0.351MetHis: 0.351 ± 0.152
2.546MetIle: 2.546 ± 0.423
1.668MetLys: 1.668 ± 0.394
1.931MetLeu: 1.931 ± 0.262
0.966MetMet: 0.966 ± 0.373
1.317MetAsn: 1.317 ± 0.321
0.702MetPro: 0.702 ± 0.26
0.614MetGln: 0.614 ± 0.193
1.053MetArg: 1.053 ± 0.296
1.492MetSer: 1.492 ± 0.296
1.141MetThr: 1.141 ± 0.299
1.58MetVal: 1.58 ± 0.301
0.351MetTrp: 0.351 ± 0.167
1.053MetTyr: 1.053 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
4.565AsnAla: 4.565 ± 0.61
0.088AsnCys: 0.088 ± 0.091
4.213AsnAsp: 4.213 ± 0.761
5.355AsnGlu: 5.355 ± 0.978
1.756AsnPhe: 1.756 ± 0.573
5.179AsnGly: 5.179 ± 0.56
1.317AsnHis: 1.317 ± 0.354
4.213AsnIle: 4.213 ± 0.542
7.022AsnLys: 7.022 ± 0.847
4.213AsnLeu: 4.213 ± 0.676
1.58AsnMet: 1.58 ± 0.397
4.565AsnAsn: 4.565 ± 0.717
2.458AsnPro: 2.458 ± 0.404
2.897AsnGln: 2.897 ± 0.589
3.072AsnArg: 3.072 ± 0.836
3.248AsnSer: 3.248 ± 0.482
4.038AsnThr: 4.038 ± 0.466
3.687AsnVal: 3.687 ± 0.554
1.053AsnTrp: 1.053 ± 0.471
3.336AsnTyr: 3.336 ± 0.478
0.0AsnXaa: 0.0 ± 0.0
Pro
1.317ProAla: 1.317 ± 0.4
0.088ProCys: 0.088 ± 0.093
1.317ProAsp: 1.317 ± 0.288
2.37ProGlu: 2.37 ± 0.482
1.404ProPhe: 1.404 ± 0.365
1.053ProGly: 1.053 ± 0.261
0.351ProHis: 0.351 ± 0.156
3.16ProIle: 3.16 ± 0.457
2.546ProLys: 2.546 ± 0.575
2.019ProLeu: 2.019 ± 0.379
0.527ProMet: 0.527 ± 0.208
1.229ProAsn: 1.229 ± 0.329
1.141ProPro: 1.141 ± 0.296
0.702ProGln: 0.702 ± 0.228
1.404ProArg: 1.404 ± 0.341
1.756ProSer: 1.756 ± 0.43
1.931ProThr: 1.931 ± 0.435
1.492ProVal: 1.492 ± 0.301
0.263ProTrp: 0.263 ± 0.129
0.79ProTyr: 0.79 ± 0.208
0.0ProXaa: 0.0 ± 0.0
Gln
2.985GlnAla: 2.985 ± 0.529
0.263GlnCys: 0.263 ± 0.155
2.019GlnAsp: 2.019 ± 0.493
1.931GlnGlu: 1.931 ± 0.397
0.966GlnPhe: 0.966 ± 0.285
2.282GlnGly: 2.282 ± 0.449
0.79GlnHis: 0.79 ± 0.362
2.897GlnIle: 2.897 ± 0.456
2.985GlnLys: 2.985 ± 0.469
3.336GlnLeu: 3.336 ± 0.549
0.878GlnMet: 0.878 ± 0.241
3.248GlnAsn: 3.248 ± 0.524
0.702GlnPro: 0.702 ± 0.239
1.756GlnGln: 1.756 ± 0.491
1.492GlnArg: 1.492 ± 0.341
2.107GlnSer: 2.107 ± 0.44
1.931GlnThr: 1.931 ± 0.396
1.931GlnVal: 1.931 ± 0.368
0.527GlnTrp: 0.527 ± 0.168
1.492GlnTyr: 1.492 ± 0.287
0.0GlnXaa: 0.0 ± 0.0
Arg
2.195ArgAla: 2.195 ± 0.474
0.176ArgCys: 0.176 ± 0.126
2.107ArgAsp: 2.107 ± 0.371
3.423ArgGlu: 3.423 ± 0.646
2.107ArgPhe: 2.107 ± 0.437
1.843ArgGly: 1.843 ± 0.332
0.702ArgHis: 0.702 ± 0.279
3.248ArgIle: 3.248 ± 0.695
4.038ArgLys: 4.038 ± 0.661
3.862ArgLeu: 3.862 ± 0.612
1.141ArgMet: 1.141 ± 0.267
2.985ArgAsn: 2.985 ± 0.436
0.439ArgPro: 0.439 ± 0.17
1.317ArgGln: 1.317 ± 0.318
2.458ArgArg: 2.458 ± 0.462
1.756ArgSer: 1.756 ± 0.388
2.897ArgThr: 2.897 ± 0.601
2.282ArgVal: 2.282 ± 0.363
0.527ArgTrp: 0.527 ± 0.236
2.721ArgTyr: 2.721 ± 0.562
0.0ArgXaa: 0.0 ± 0.0
Ser
3.95SerAla: 3.95 ± 0.596
0.614SerCys: 0.614 ± 0.258
4.477SerAsp: 4.477 ± 0.742
5.881SerGlu: 5.881 ± 0.719
2.985SerPhe: 2.985 ± 0.58
2.897SerGly: 2.897 ± 0.601
1.404SerHis: 1.404 ± 0.293
5.969SerIle: 5.969 ± 0.768
5.881SerLys: 5.881 ± 1.118
3.862SerLeu: 3.862 ± 0.582
1.404SerMet: 1.404 ± 0.367
4.389SerAsn: 4.389 ± 0.475
1.053SerPro: 1.053 ± 0.268
1.668SerGln: 1.668 ± 0.468
1.668SerArg: 1.668 ± 0.422
3.687SerSer: 3.687 ± 0.568
2.985SerThr: 2.985 ± 0.456
2.809SerVal: 2.809 ± 0.564
0.527SerTrp: 0.527 ± 0.3
2.282SerTyr: 2.282 ± 0.402
0.0SerXaa: 0.0 ± 0.0
Thr
3.775ThrAla: 3.775 ± 0.634
0.263ThrCys: 0.263 ± 0.17
3.511ThrAsp: 3.511 ± 0.578
3.423ThrGlu: 3.423 ± 0.515
2.107ThrPhe: 2.107 ± 0.344
3.599ThrGly: 3.599 ± 0.982
1.492ThrHis: 1.492 ± 0.299
4.301ThrIle: 4.301 ± 0.637
5.442ThrLys: 5.442 ± 0.63
4.916ThrLeu: 4.916 ± 0.572
0.79ThrMet: 0.79 ± 0.209
3.16ThrAsn: 3.16 ± 0.477
2.195ThrPro: 2.195 ± 0.424
1.931ThrGln: 1.931 ± 0.453
2.195ThrArg: 2.195 ± 0.553
3.862ThrSer: 3.862 ± 0.782
3.687ThrThr: 3.687 ± 0.584
3.862ThrVal: 3.862 ± 0.582
1.053ThrTrp: 1.053 ± 0.37
2.458ThrTyr: 2.458 ± 0.518
0.0ThrXaa: 0.0 ± 0.0
Val
2.985ValAla: 2.985 ± 0.658
0.176ValCys: 0.176 ± 0.174
3.862ValAsp: 3.862 ± 0.642
2.897ValGlu: 2.897 ± 0.583
2.809ValPhe: 2.809 ± 0.525
3.862ValGly: 3.862 ± 0.611
0.702ValHis: 0.702 ± 0.254
4.477ValIle: 4.477 ± 0.644
5.267ValLys: 5.267 ± 0.746
4.389ValLeu: 4.389 ± 0.643
1.317ValMet: 1.317 ± 0.336
4.301ValAsn: 4.301 ± 0.494
1.404ValPro: 1.404 ± 0.365
2.458ValGln: 2.458 ± 0.634
2.195ValArg: 2.195 ± 0.41
2.809ValSer: 2.809 ± 0.485
4.038ValThr: 4.038 ± 0.53
3.423ValVal: 3.423 ± 0.692
0.614ValTrp: 0.614 ± 0.254
2.546ValTyr: 2.546 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
0.351TrpAla: 0.351 ± 0.155
0.088TrpCys: 0.088 ± 0.079
0.614TrpAsp: 0.614 ± 0.253
1.053TrpGlu: 1.053 ± 0.276
0.79TrpPhe: 0.79 ± 0.249
0.614TrpGly: 0.614 ± 0.227
0.088TrpHis: 0.088 ± 0.079
1.141TrpIle: 1.141 ± 0.252
1.141TrpLys: 1.141 ± 0.28
1.492TrpLeu: 1.492 ± 0.414
0.263TrpMet: 0.263 ± 0.13
1.053TrpAsn: 1.053 ± 0.436
0.088TrpPro: 0.088 ± 0.072
0.439TrpGln: 0.439 ± 0.18
0.527TrpArg: 0.527 ± 0.184
0.527TrpSer: 0.527 ± 0.201
0.263TrpThr: 0.263 ± 0.13
0.702TrpVal: 0.702 ± 0.275
0.088TrpTrp: 0.088 ± 0.095
0.614TrpTyr: 0.614 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.37TyrAla: 2.37 ± 0.355
0.088TyrCys: 0.088 ± 0.083
2.985TyrAsp: 2.985 ± 0.537
2.809TyrGlu: 2.809 ± 0.596
2.107TyrPhe: 2.107 ± 0.502
2.546TyrGly: 2.546 ± 0.489
0.614TyrHis: 0.614 ± 0.224
3.336TyrIle: 3.336 ± 0.713
4.74TyrLys: 4.74 ± 0.696
3.336TyrLeu: 3.336 ± 0.579
1.229TyrMet: 1.229 ± 0.369
2.985TyrAsn: 2.985 ± 0.49
0.878TyrPro: 0.878 ± 0.235
2.195TyrGln: 2.195 ± 0.442
1.756TyrArg: 1.756 ± 0.422
2.809TyrSer: 2.809 ± 0.531
2.633TyrThr: 2.633 ± 0.49
2.546TyrVal: 2.546 ± 0.411
0.527TyrTrp: 0.527 ± 0.209
0.966TyrTyr: 0.966 ± 0.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11393 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski