Amino acid dipepetide frequency for Staphylococcus phage SA12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.438AlaAla: 1.438 ± 0.45
0.606AlaCys: 0.606 ± 0.206
2.801AlaAsp: 2.801 ± 0.428
4.012AlaGlu: 4.012 ± 0.539
2.801AlaPhe: 2.801 ± 0.559
3.104AlaGly: 3.104 ± 0.428
0.984AlaHis: 0.984 ± 0.208
4.618AlaIle: 4.618 ± 1.074
4.996AlaLys: 4.996 ± 0.714
4.088AlaLeu: 4.088 ± 0.632
1.968AlaMet: 1.968 ± 0.522
4.466AlaAsn: 4.466 ± 0.466
1.893AlaPro: 1.893 ± 0.354
2.422AlaGln: 2.422 ± 0.55
2.877AlaArg: 2.877 ± 0.551
4.164AlaSer: 4.164 ± 0.637
3.861AlaThr: 3.861 ± 0.691
3.255AlaVal: 3.255 ± 0.584
0.908AlaTrp: 0.908 ± 0.317
2.498AlaTyr: 2.498 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.151CysAla: 0.151 ± 0.111
0.076CysCys: 0.076 ± 0.068
0.227CysAsp: 0.227 ± 0.161
0.227CysGlu: 0.227 ± 0.126
0.303CysPhe: 0.303 ± 0.166
0.227CysGly: 0.227 ± 0.112
0.0CysHis: 0.0 ± 0.0
0.227CysIle: 0.227 ± 0.134
0.454CysLys: 0.454 ± 0.162
0.303CysLeu: 0.303 ± 0.165
0.151CysMet: 0.151 ± 0.108
0.53CysAsn: 0.53 ± 0.23
0.303CysPro: 0.303 ± 0.202
0.303CysGln: 0.303 ± 0.124
0.151CysArg: 0.151 ± 0.114
0.606CysSer: 0.606 ± 0.216
0.379CysThr: 0.379 ± 0.178
0.076CysVal: 0.076 ± 0.083
0.151CysTrp: 0.151 ± 0.1
0.227CysTyr: 0.227 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
3.558AspAla: 3.558 ± 0.558
0.227AspCys: 0.227 ± 0.125
4.769AspAsp: 4.769 ± 0.89
6.132AspGlu: 6.132 ± 0.931
3.255AspPhe: 3.255 ± 0.481
4.012AspGly: 4.012 ± 0.538
0.454AspHis: 0.454 ± 0.186
4.088AspIle: 4.088 ± 0.584
4.921AspLys: 4.921 ± 0.814
4.921AspLeu: 4.921 ± 0.582
1.287AspMet: 1.287 ± 0.263
4.164AspAsn: 4.164 ± 0.596
1.287AspPro: 1.287 ± 0.375
1.211AspGln: 1.211 ± 0.335
2.498AspArg: 2.498 ± 0.437
4.088AspSer: 4.088 ± 0.606
4.088AspThr: 4.088 ± 0.626
4.315AspVal: 4.315 ± 0.587
0.606AspTrp: 0.606 ± 0.169
2.574AspTyr: 2.574 ± 0.53
0.0AspXaa: 0.0 ± 0.0
Glu
5.602GluAla: 5.602 ± 0.846
0.303GluCys: 0.303 ± 0.132
3.785GluAsp: 3.785 ± 0.559
6.435GluGlu: 6.435 ± 0.873
3.482GluPhe: 3.482 ± 0.522
2.725GluGly: 2.725 ± 0.505
1.136GluHis: 1.136 ± 0.323
6.359GluIle: 6.359 ± 0.905
5.45GluLys: 5.45 ± 0.824
6.964GluLeu: 6.964 ± 0.776
1.968GluMet: 1.968 ± 0.44
5.526GluAsn: 5.526 ± 0.728
2.044GluPro: 2.044 ± 0.414
4.466GluGln: 4.466 ± 0.733
3.709GluArg: 3.709 ± 0.653
3.861GluSer: 3.861 ± 0.705
3.179GluThr: 3.179 ± 0.43
5.223GluVal: 5.223 ± 0.543
0.833GluTrp: 0.833 ± 0.246
4.996GluTyr: 4.996 ± 0.634
0.0GluXaa: 0.0 ± 0.0
Phe
2.195PheAla: 2.195 ± 0.391
0.379PheCys: 0.379 ± 0.17
2.801PheAsp: 2.801 ± 0.489
3.785PheGlu: 3.785 ± 0.525
1.514PhePhe: 1.514 ± 0.289
2.498PheGly: 2.498 ± 0.484
0.379PheHis: 0.379 ± 0.204
3.331PheIle: 3.331 ± 0.46
4.618PheLys: 4.618 ± 0.504
2.725PheLeu: 2.725 ± 0.425
0.908PheMet: 0.908 ± 0.213
3.861PheAsn: 3.861 ± 0.494
0.833PhePro: 0.833 ± 0.271
1.59PheGln: 1.59 ± 0.435
1.06PheArg: 1.06 ± 0.314
2.347PheSer: 2.347 ± 0.488
2.65PheThr: 2.65 ± 0.454
2.347PheVal: 2.347 ± 0.608
0.227PheTrp: 0.227 ± 0.112
1.514PheTyr: 1.514 ± 0.374
0.0PheXaa: 0.0 ± 0.0
Gly
3.179GlyAla: 3.179 ± 0.44
0.227GlyCys: 0.227 ± 0.144
3.104GlyAsp: 3.104 ± 0.565
3.407GlyGlu: 3.407 ± 0.625
2.044GlyPhe: 2.044 ± 0.437
2.877GlyGly: 2.877 ± 0.482
1.211GlyHis: 1.211 ± 0.413
4.164GlyIle: 4.164 ± 0.48
4.088GlyLys: 4.088 ± 0.508
4.996GlyLeu: 4.996 ± 0.697
1.136GlyMet: 1.136 ± 0.302
3.104GlyAsn: 3.104 ± 0.497
0.606GlyPro: 0.606 ± 0.278
1.893GlyGln: 1.893 ± 0.373
2.422GlyArg: 2.422 ± 0.51
2.877GlySer: 2.877 ± 0.539
4.164GlyThr: 4.164 ± 0.597
4.012GlyVal: 4.012 ± 0.518
1.363GlyTrp: 1.363 ± 0.461
2.725GlyTyr: 2.725 ± 0.699
0.0GlyXaa: 0.0 ± 0.0
His
1.06HisAla: 1.06 ± 0.319
0.076HisCys: 0.076 ± 0.074
1.211HisAsp: 1.211 ± 0.257
1.363HisGlu: 1.363 ± 0.4
0.833HisPhe: 0.833 ± 0.249
1.211HisGly: 1.211 ± 0.358
0.379HisHis: 0.379 ± 0.166
1.06HisIle: 1.06 ± 0.274
0.757HisLys: 0.757 ± 0.227
1.136HisLeu: 1.136 ± 0.291
0.303HisMet: 0.303 ± 0.197
0.984HisAsn: 0.984 ± 0.254
0.681HisPro: 0.681 ± 0.223
0.53HisGln: 0.53 ± 0.203
0.606HisArg: 0.606 ± 0.187
1.06HisSer: 1.06 ± 0.24
1.06HisThr: 1.06 ± 0.27
1.363HisVal: 1.363 ± 0.337
0.0HisTrp: 0.0 ± 0.0
0.379HisTyr: 0.379 ± 0.248
0.0HisXaa: 0.0 ± 0.0
Ile
4.769IleAla: 4.769 ± 0.701
0.076IleCys: 0.076 ± 0.069
5.753IleAsp: 5.753 ± 0.656
6.586IleGlu: 6.586 ± 0.772
2.347IlePhe: 2.347 ± 0.417
3.709IleGly: 3.709 ± 0.534
1.59IleHis: 1.59 ± 0.299
4.239IleIle: 4.239 ± 0.414
8.1IleLys: 8.1 ± 0.824
4.542IleLeu: 4.542 ± 0.48
1.665IleMet: 1.665 ± 0.332
5.072IleAsn: 5.072 ± 1.021
1.741IlePro: 1.741 ± 0.316
2.12IleGln: 2.12 ± 0.491
3.255IleArg: 3.255 ± 0.597
4.315IleSer: 4.315 ± 0.56
5.223IleThr: 5.223 ± 0.64
4.088IleVal: 4.088 ± 0.664
1.287IleTrp: 1.287 ± 0.648
2.65IleTyr: 2.65 ± 0.386
0.0IleXaa: 0.0 ± 0.0
Lys
5.602LysAla: 5.602 ± 0.769
0.076LysCys: 0.076 ± 0.069
5.602LysAsp: 5.602 ± 0.739
7.646LysGlu: 7.646 ± 0.87
3.331LysPhe: 3.331 ± 0.494
4.921LysGly: 4.921 ± 0.679
1.211LysHis: 1.211 ± 0.27
6.132LysIle: 6.132 ± 0.896
9.387LysLys: 9.387 ± 0.82
7.343LysLeu: 7.343 ± 0.745
1.817LysMet: 1.817 ± 0.367
5.45LysAsn: 5.45 ± 0.793
2.422LysPro: 2.422 ± 0.486
5.072LysGln: 5.072 ± 0.561
4.164LysArg: 4.164 ± 0.479
5.148LysSer: 5.148 ± 0.666
5.678LysThr: 5.678 ± 0.636
6.435LysVal: 6.435 ± 0.688
0.681LysTrp: 0.681 ± 0.22
4.239LysTyr: 4.239 ± 0.709
0.0LysXaa: 0.0 ± 0.0
Leu
2.725LeuAla: 2.725 ± 0.419
0.379LeuCys: 0.379 ± 0.179
5.375LeuAsp: 5.375 ± 0.662
6.737LeuGlu: 6.737 ± 0.793
3.709LeuPhe: 3.709 ± 0.52
3.709LeuGly: 3.709 ± 0.46
1.211LeuHis: 1.211 ± 0.347
4.239LeuIle: 4.239 ± 0.448
7.04LeuLys: 7.04 ± 0.529
4.693LeuLeu: 4.693 ± 0.635
1.665LeuMet: 1.665 ± 0.357
6.359LeuAsn: 6.359 ± 0.594
2.725LeuPro: 2.725 ± 0.49
4.012LeuGln: 4.012 ± 0.568
3.407LeuArg: 3.407 ± 0.602
4.012LeuSer: 4.012 ± 0.463
5.829LeuThr: 5.829 ± 0.72
4.996LeuVal: 4.996 ± 0.844
0.606LeuTrp: 0.606 ± 0.222
2.65LeuTyr: 2.65 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
1.06MetAla: 1.06 ± 0.272
0.076MetCys: 0.076 ± 0.074
1.211MetAsp: 1.211 ± 0.277
1.893MetGlu: 1.893 ± 0.468
0.833MetPhe: 0.833 ± 0.231
0.984MetGly: 0.984 ± 0.289
0.379MetHis: 0.379 ± 0.22
1.514MetIle: 1.514 ± 0.305
1.665MetLys: 1.665 ± 0.285
2.422MetLeu: 2.422 ± 0.357
0.833MetMet: 0.833 ± 0.243
1.665MetAsn: 1.665 ± 0.457
0.908MetPro: 0.908 ± 0.251
1.438MetGln: 1.438 ± 0.377
0.833MetArg: 0.833 ± 0.302
1.211MetSer: 1.211 ± 0.307
2.725MetThr: 2.725 ± 0.56
1.211MetVal: 1.211 ± 0.294
0.454MetTrp: 0.454 ± 0.167
0.833MetTyr: 0.833 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
5.45AsnAla: 5.45 ± 0.675
0.227AsnCys: 0.227 ± 0.15
5.072AsnAsp: 5.072 ± 0.568
5.375AsnGlu: 5.375 ± 0.697
2.877AsnPhe: 2.877 ± 0.5
4.769AsnGly: 4.769 ± 0.61
1.136AsnHis: 1.136 ± 0.302
4.391AsnIle: 4.391 ± 0.498
7.57AsnLys: 7.57 ± 0.671
5.223AsnLeu: 5.223 ± 0.576
1.438AsnMet: 1.438 ± 0.309
4.164AsnAsn: 4.164 ± 0.674
2.725AsnPro: 2.725 ± 0.473
2.801AsnGln: 2.801 ± 0.466
2.422AsnArg: 2.422 ± 0.407
3.104AsnSer: 3.104 ± 0.593
4.466AsnThr: 4.466 ± 0.566
4.012AsnVal: 4.012 ± 0.538
0.757AsnTrp: 0.757 ± 0.21
2.952AsnTyr: 2.952 ± 0.487
0.0AsnXaa: 0.0 ± 0.0
Pro
1.363ProAla: 1.363 ± 0.262
0.227ProCys: 0.227 ± 0.135
1.211ProAsp: 1.211 ± 0.34
2.044ProGlu: 2.044 ± 0.332
1.211ProPhe: 1.211 ± 0.309
1.893ProGly: 1.893 ± 0.479
0.379ProHis: 0.379 ± 0.172
2.347ProIle: 2.347 ± 0.394
3.255ProLys: 3.255 ± 0.571
1.438ProLeu: 1.438 ± 0.345
0.984ProMet: 0.984 ± 0.32
2.271ProAsn: 2.271 ± 0.369
0.53ProPro: 0.53 ± 0.226
0.984ProGln: 0.984 ± 0.29
1.06ProArg: 1.06 ± 0.302
1.211ProSer: 1.211 ± 0.337
1.59ProThr: 1.59 ± 0.332
2.422ProVal: 2.422 ± 0.583
0.076ProTrp: 0.076 ± 0.073
1.136ProTyr: 1.136 ± 0.314
0.0ProXaa: 0.0 ± 0.0
Gln
2.65GlnAla: 2.65 ± 0.493
0.379GlnCys: 0.379 ± 0.185
2.422GlnAsp: 2.422 ± 0.495
2.801GlnGlu: 2.801 ± 0.611
1.741GlnPhe: 1.741 ± 0.415
2.044GlnGly: 2.044 ± 0.352
0.53GlnHis: 0.53 ± 0.174
2.801GlnIle: 2.801 ± 0.379
3.331GlnLys: 3.331 ± 0.515
3.709GlnLeu: 3.709 ± 0.658
1.211GlnMet: 1.211 ± 0.321
2.65GlnAsn: 2.65 ± 0.336
1.514GlnPro: 1.514 ± 0.448
2.271GlnGln: 2.271 ± 0.427
1.59GlnArg: 1.59 ± 0.417
2.725GlnSer: 2.725 ± 0.423
2.422GlnThr: 2.422 ± 0.411
2.347GlnVal: 2.347 ± 0.447
0.454GlnTrp: 0.454 ± 0.166
1.363GlnTyr: 1.363 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
1.514ArgAla: 1.514 ± 0.339
0.303ArgCys: 0.303 ± 0.146
1.968ArgAsp: 1.968 ± 0.467
2.725ArgGlu: 2.725 ± 0.429
1.893ArgPhe: 1.893 ± 0.409
1.968ArgGly: 1.968 ± 0.376
0.908ArgHis: 0.908 ± 0.224
3.861ArgIle: 3.861 ± 0.647
4.315ArgLys: 4.315 ± 0.692
4.239ArgLeu: 4.239 ± 0.684
0.303ArgMet: 0.303 ± 0.147
3.407ArgAsn: 3.407 ± 0.527
0.681ArgPro: 0.681 ± 0.183
1.665ArgGln: 1.665 ± 0.418
1.817ArgArg: 1.817 ± 0.344
2.801ArgSer: 2.801 ± 0.383
1.741ArgThr: 1.741 ± 0.36
2.271ArgVal: 2.271 ± 0.5
0.379ArgTrp: 0.379 ± 0.2
2.347ArgTyr: 2.347 ± 0.518
0.0ArgXaa: 0.0 ± 0.0
Ser
4.618SerAla: 4.618 ± 0.652
0.379SerCys: 0.379 ± 0.219
4.239SerAsp: 4.239 ± 0.744
3.331SerGlu: 3.331 ± 0.429
2.12SerPhe: 2.12 ± 0.483
2.952SerGly: 2.952 ± 0.541
1.363SerHis: 1.363 ± 0.364
5.45SerIle: 5.45 ± 0.673
5.526SerLys: 5.526 ± 0.67
3.634SerLeu: 3.634 ± 0.436
1.968SerMet: 1.968 ± 0.392
4.239SerAsn: 4.239 ± 0.524
1.287SerPro: 1.287 ± 0.341
1.893SerGln: 1.893 ± 0.344
2.422SerArg: 2.422 ± 0.35
3.028SerSer: 3.028 ± 0.434
3.407SerThr: 3.407 ± 0.42
3.179SerVal: 3.179 ± 0.461
0.303SerTrp: 0.303 ± 0.135
2.195SerTyr: 2.195 ± 0.398
0.0SerXaa: 0.0 ± 0.0
Thr
4.239ThrAla: 4.239 ± 0.55
0.151ThrCys: 0.151 ± 0.103
3.558ThrAsp: 3.558 ± 0.757
4.315ThrGlu: 4.315 ± 0.624
2.347ThrPhe: 2.347 ± 0.426
3.861ThrGly: 3.861 ± 0.496
0.984ThrHis: 0.984 ± 0.252
6.056ThrIle: 6.056 ± 1.111
5.602ThrLys: 5.602 ± 0.646
5.148ThrLeu: 5.148 ± 0.6
0.833ThrMet: 0.833 ± 0.311
3.936ThrAsn: 3.936 ± 0.558
2.271ThrPro: 2.271 ± 0.431
3.179ThrGln: 3.179 ± 0.548
2.65ThrArg: 2.65 ± 0.327
4.088ThrSer: 4.088 ± 0.716
4.164ThrThr: 4.164 ± 0.831
4.466ThrVal: 4.466 ± 0.699
0.908ThrTrp: 0.908 ± 0.288
1.968ThrTyr: 1.968 ± 0.479
0.0ThrXaa: 0.0 ± 0.0
Val
3.407ValAla: 3.407 ± 0.854
0.454ValCys: 0.454 ± 0.172
4.845ValAsp: 4.845 ± 0.833
4.618ValGlu: 4.618 ± 0.615
2.725ValPhe: 2.725 ± 0.465
3.104ValGly: 3.104 ± 0.518
0.681ValHis: 0.681 ± 0.203
4.769ValIle: 4.769 ± 0.52
5.98ValLys: 5.98 ± 0.716
5.072ValLeu: 5.072 ± 0.828
2.195ValMet: 2.195 ± 0.44
5.223ValAsn: 5.223 ± 0.578
1.968ValPro: 1.968 ± 0.517
1.211ValGln: 1.211 ± 0.35
2.195ValArg: 2.195 ± 0.445
3.634ValSer: 3.634 ± 0.498
4.315ValThr: 4.315 ± 0.536
3.558ValVal: 3.558 ± 0.6
1.06ValTrp: 1.06 ± 0.344
2.347ValTyr: 2.347 ± 0.516
0.0ValXaa: 0.0 ± 0.0
Trp
0.984TrpAla: 0.984 ± 0.372
0.151TrpCys: 0.151 ± 0.095
0.227TrpAsp: 0.227 ± 0.108
0.833TrpGlu: 0.833 ± 0.236
0.303TrpPhe: 0.303 ± 0.181
0.606TrpGly: 0.606 ± 0.325
0.227TrpHis: 0.227 ± 0.133
0.53TrpIle: 0.53 ± 0.188
0.908TrpLys: 0.908 ± 0.268
0.757TrpLeu: 0.757 ± 0.261
0.151TrpMet: 0.151 ± 0.116
1.665TrpAsn: 1.665 ± 0.908
0.151TrpPro: 0.151 ± 0.097
0.606TrpGln: 0.606 ± 0.24
0.227TrpArg: 0.227 ± 0.147
0.833TrpSer: 0.833 ± 0.264
0.984TrpThr: 0.984 ± 0.301
0.833TrpVal: 0.833 ± 0.32
0.0TrpTrp: 0.0 ± 0.0
0.681TrpTyr: 0.681 ± 0.255
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.195TyrAla: 2.195 ± 0.304
0.303TyrCys: 0.303 ± 0.146
2.271TyrAsp: 2.271 ± 0.411
3.785TyrGlu: 3.785 ± 0.589
1.893TyrPhe: 1.893 ± 0.373
2.422TyrGly: 2.422 ± 0.701
0.908TyrHis: 0.908 ± 0.324
2.952TyrIle: 2.952 ± 0.526
4.391TyrLys: 4.391 ± 0.576
2.725TyrLeu: 2.725 ± 0.499
1.287TyrMet: 1.287 ± 0.314
2.195TyrAsn: 2.195 ± 0.362
1.06TyrPro: 1.06 ± 0.35
1.363TyrGln: 1.363 ± 0.303
1.741TyrArg: 1.741 ± 0.486
2.422TyrSer: 2.422 ± 0.54
2.801TyrThr: 2.801 ± 0.384
2.952TyrVal: 2.952 ± 0.522
0.606TyrTrp: 0.606 ± 0.196
1.59TyrTyr: 1.59 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (13211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski