Amino acid dipepetide frequency for Staphylococcus phage phi879

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.222AlaAla: 3.222 ± 0.694
0.393AlaCys: 0.393 ± 0.19
1.729AlaAsp: 1.729 ± 0.366
4.165AlaGlu: 4.165 ± 0.572
2.122AlaPhe: 2.122 ± 0.388
3.929AlaGly: 3.929 ± 0.655
0.471AlaHis: 0.471 ± 0.192
3.772AlaIle: 3.772 ± 0.653
4.558AlaLys: 4.558 ± 0.505
5.501AlaLeu: 5.501 ± 0.566
1.336AlaMet: 1.336 ± 0.359
3.3AlaAsn: 3.3 ± 0.589
1.336AlaPro: 1.336 ± 0.289
1.886AlaGln: 1.886 ± 0.338
2.279AlaArg: 2.279 ± 0.446
3.065AlaSer: 3.065 ± 0.482
3.457AlaThr: 3.457 ± 0.51
3.065AlaVal: 3.065 ± 0.555
0.786AlaTrp: 0.786 ± 0.268
1.886AlaTyr: 1.886 ± 0.324
0.0AlaXaa: 0.0 ± 0.0
Cys
0.314CysAla: 0.314 ± 0.167
0.314CysCys: 0.314 ± 0.17
0.471CysAsp: 0.471 ± 0.206
0.786CysGlu: 0.786 ± 0.239
0.236CysPhe: 0.236 ± 0.149
0.471CysGly: 0.471 ± 0.225
0.314CysHis: 0.314 ± 0.169
0.629CysIle: 0.629 ± 0.268
0.864CysLys: 0.864 ± 0.306
0.786CysLeu: 0.786 ± 0.3
0.157CysMet: 0.157 ± 0.109
0.55CysAsn: 0.55 ± 0.289
0.157CysPro: 0.157 ± 0.126
0.079CysGln: 0.079 ± 0.091
0.236CysArg: 0.236 ± 0.139
0.393CysSer: 0.393 ± 0.204
0.236CysThr: 0.236 ± 0.132
0.236CysVal: 0.236 ± 0.142
0.0CysTrp: 0.0 ± 0.0
0.157CysTyr: 0.157 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
2.515AspAla: 2.515 ± 0.421
0.236AspCys: 0.236 ± 0.131
3.536AspAsp: 3.536 ± 0.57
6.129AspGlu: 6.129 ± 0.988
2.672AspPhe: 2.672 ± 0.393
3.929AspGly: 3.929 ± 0.549
1.1AspHis: 1.1 ± 0.286
4.715AspIle: 4.715 ± 0.677
6.051AspLys: 6.051 ± 0.654
4.715AspLeu: 4.715 ± 0.626
1.022AspMet: 1.022 ± 0.225
3.929AspAsn: 3.929 ± 0.598
1.1AspPro: 1.1 ± 0.307
1.336AspGln: 1.336 ± 0.286
1.493AspArg: 1.493 ± 0.378
4.322AspSer: 4.322 ± 0.562
2.75AspThr: 2.75 ± 0.45
3.693AspVal: 3.693 ± 0.614
0.55AspTrp: 0.55 ± 0.161
4.086AspTyr: 4.086 ± 0.517
0.0AspXaa: 0.0 ± 0.0
Glu
4.008GluAla: 4.008 ± 0.485
0.707GluCys: 0.707 ± 0.248
4.715GluAsp: 4.715 ± 0.681
7.622GluGlu: 7.622 ± 1.498
3.379GluPhe: 3.379 ± 0.428
3.379GluGly: 3.379 ± 0.404
1.493GluHis: 1.493 ± 0.302
6.836GluIle: 6.836 ± 1.032
7.151GluLys: 7.151 ± 0.904
7.386GluLeu: 7.386 ± 1.256
2.515GluMet: 2.515 ± 0.454
5.029GluAsn: 5.029 ± 0.781
1.729GluPro: 1.729 ± 0.399
3.615GluGln: 3.615 ± 0.708
3.143GluArg: 3.143 ± 0.536
4.243GluSer: 4.243 ± 0.478
4.008GluThr: 4.008 ± 0.588
5.736GluVal: 5.736 ± 0.707
0.786GluTrp: 0.786 ± 0.246
3.065GluTyr: 3.065 ± 0.554
0.0GluXaa: 0.0 ± 0.0
Phe
2.907PheAla: 2.907 ± 0.61
0.393PheCys: 0.393 ± 0.174
2.75PheAsp: 2.75 ± 0.518
3.615PheGlu: 3.615 ± 0.653
1.257PhePhe: 1.257 ± 0.332
2.986PheGly: 2.986 ± 0.395
0.471PheHis: 0.471 ± 0.186
3.615PheIle: 3.615 ± 0.526
5.186PheLys: 5.186 ± 0.574
2.75PheLeu: 2.75 ± 0.567
1.336PheMet: 1.336 ± 0.311
3.457PheAsn: 3.457 ± 0.586
0.707PhePro: 0.707 ± 0.238
1.493PheGln: 1.493 ± 0.361
1.65PheArg: 1.65 ± 0.382
2.75PheSer: 2.75 ± 0.571
2.043PheThr: 2.043 ± 0.448
1.022PheVal: 1.022 ± 0.26
0.471PheTrp: 0.471 ± 0.174
1.65PheTyr: 1.65 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
2.122GlyAla: 2.122 ± 0.428
0.157GlyCys: 0.157 ± 0.117
4.008GlyAsp: 4.008 ± 0.657
2.672GlyGlu: 2.672 ± 0.446
3.615GlyPhe: 3.615 ± 0.517
3.772GlyGly: 3.772 ± 1.033
0.864GlyHis: 0.864 ± 0.281
5.108GlyIle: 5.108 ± 0.875
5.579GlyLys: 5.579 ± 0.739
4.558GlyLeu: 4.558 ± 0.606
1.257GlyMet: 1.257 ± 0.38
3.536GlyAsn: 3.536 ± 0.491
1.336GlyPro: 1.336 ± 0.516
1.964GlyGln: 1.964 ± 0.349
2.2GlyArg: 2.2 ± 0.331
3.772GlySer: 3.772 ± 0.667
4.322GlyThr: 4.322 ± 0.553
3.85GlyVal: 3.85 ± 0.67
0.786GlyTrp: 0.786 ± 0.228
2.986GlyTyr: 2.986 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
0.943HisAla: 0.943 ± 0.234
0.393HisCys: 0.393 ± 0.215
0.943HisAsp: 0.943 ± 0.279
1.572HisGlu: 1.572 ± 0.31
0.786HisPhe: 0.786 ± 0.227
0.864HisGly: 0.864 ± 0.263
0.236HisHis: 0.236 ± 0.139
0.864HisIle: 0.864 ± 0.205
1.729HisLys: 1.729 ± 0.373
1.336HisLeu: 1.336 ± 0.371
0.314HisMet: 0.314 ± 0.16
0.629HisAsn: 0.629 ± 0.204
0.629HisPro: 0.629 ± 0.262
0.471HisGln: 0.471 ± 0.168
1.257HisArg: 1.257 ± 0.34
1.336HisSer: 1.336 ± 0.246
1.022HisThr: 1.022 ± 0.281
1.179HisVal: 1.179 ± 0.33
0.157HisTrp: 0.157 ± 0.085
0.943HisTyr: 0.943 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
4.558IleAla: 4.558 ± 0.499
0.393IleCys: 0.393 ± 0.177
5.186IleAsp: 5.186 ± 0.671
6.522IleGlu: 6.522 ± 0.799
2.672IlePhe: 2.672 ± 0.502
3.929IleGly: 3.929 ± 0.483
1.336IleHis: 1.336 ± 0.34
5.265IleIle: 5.265 ± 0.836
7.937IleLys: 7.937 ± 0.8
5.815IleLeu: 5.815 ± 0.647
1.336IleMet: 1.336 ± 0.348
4.479IleAsn: 4.479 ± 0.691
1.964IlePro: 1.964 ± 0.354
2.75IleGln: 2.75 ± 0.408
3.065IleArg: 3.065 ± 0.681
5.501IleSer: 5.501 ± 0.757
4.636IleThr: 4.636 ± 0.695
4.558IleVal: 4.558 ± 0.591
0.943IleTrp: 0.943 ± 0.369
3.065IleTyr: 3.065 ± 0.617
0.0IleXaa: 0.0 ± 0.0
Lys
4.322LysAla: 4.322 ± 0.682
0.314LysCys: 0.314 ± 0.196
6.286LysAsp: 6.286 ± 0.746
7.701LysGlu: 7.701 ± 0.938
4.558LysPhe: 4.558 ± 0.655
5.579LysGly: 5.579 ± 0.962
1.807LysHis: 1.807 ± 0.473
5.501LysIle: 5.501 ± 0.634
9.115LysLys: 9.115 ± 1.155
8.644LysLeu: 8.644 ± 0.887
2.672LysMet: 2.672 ± 0.482
6.051LysAsn: 6.051 ± 0.698
1.964LysPro: 1.964 ± 0.409
4.322LysGln: 4.322 ± 0.53
4.558LysArg: 4.558 ± 0.769
5.972LysSer: 5.972 ± 0.816
5.815LysThr: 5.815 ± 0.699
7.072LysVal: 7.072 ± 0.6
0.393LysTrp: 0.393 ± 0.159
4.558LysTyr: 4.558 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
3.772LeuAla: 3.772 ± 0.6
0.786LeuCys: 0.786 ± 0.334
4.479LeuAsp: 4.479 ± 0.683
6.129LeuGlu: 6.129 ± 0.738
3.379LeuPhe: 3.379 ± 0.483
4.4LeuGly: 4.4 ± 0.836
1.022LeuHis: 1.022 ± 0.314
5.265LeuIle: 5.265 ± 0.737
9.115LeuLys: 9.115 ± 0.955
6.994LeuLeu: 6.994 ± 0.72
1.886LeuMet: 1.886 ± 0.339
6.915LeuAsn: 6.915 ± 0.897
2.75LeuPro: 2.75 ± 0.431
2.986LeuGln: 2.986 ± 0.403
3.536LeuArg: 3.536 ± 0.452
5.972LeuSer: 5.972 ± 0.771
4.793LeuThr: 4.793 ± 0.657
4.793LeuVal: 4.793 ± 0.58
0.55LeuTrp: 0.55 ± 0.199
3.065LeuTyr: 3.065 ± 0.569
0.0LeuXaa: 0.0 ± 0.0
Met
1.572MetAla: 1.572 ± 0.467
0.314MetCys: 0.314 ± 0.167
1.414MetAsp: 1.414 ± 0.331
1.179MetGlu: 1.179 ± 0.333
0.629MetPhe: 0.629 ± 0.216
0.707MetGly: 0.707 ± 0.167
0.707MetHis: 0.707 ± 0.209
1.414MetIle: 1.414 ± 0.279
2.515MetLys: 2.515 ± 0.405
1.493MetLeu: 1.493 ± 0.302
0.314MetMet: 0.314 ± 0.135
1.807MetAsn: 1.807 ± 0.436
0.707MetPro: 0.707 ± 0.294
0.786MetGln: 0.786 ± 0.233
1.1MetArg: 1.1 ± 0.359
2.593MetSer: 2.593 ± 0.43
2.2MetThr: 2.2 ± 0.517
1.414MetVal: 1.414 ± 0.29
0.079MetTrp: 0.079 ± 0.056
1.1MetTyr: 1.1 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
3.379AsnAla: 3.379 ± 0.515
0.393AsnCys: 0.393 ± 0.185
3.3AsnAsp: 3.3 ± 0.587
6.365AsnGlu: 6.365 ± 0.771
2.279AsnPhe: 2.279 ± 0.362
5.893AsnGly: 5.893 ± 0.832
1.65AsnHis: 1.65 ± 0.324
5.658AsnIle: 5.658 ± 0.637
6.286AsnLys: 6.286 ± 0.864
5.579AsnLeu: 5.579 ± 0.654
1.65AsnMet: 1.65 ± 0.308
4.243AsnAsn: 4.243 ± 0.552
2.436AsnPro: 2.436 ± 0.319
2.436AsnGln: 2.436 ± 0.471
2.122AsnArg: 2.122 ± 0.446
3.457AsnSer: 3.457 ± 0.432
3.222AsnThr: 3.222 ± 0.407
2.672AsnVal: 2.672 ± 0.49
0.943AsnTrp: 0.943 ± 0.289
3.143AsnTyr: 3.143 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
1.807ProAla: 1.807 ± 0.437
0.0ProCys: 0.0 ± 0.0
1.493ProAsp: 1.493 ± 0.42
2.279ProGlu: 2.279 ± 0.588
1.179ProPhe: 1.179 ± 0.422
1.572ProGly: 1.572 ± 0.399
0.786ProHis: 0.786 ± 0.22
2.436ProIle: 2.436 ± 0.381
1.65ProLys: 1.65 ± 0.314
2.043ProLeu: 2.043 ± 0.432
0.314ProMet: 0.314 ± 0.214
1.729ProAsn: 1.729 ± 0.31
0.707ProPro: 0.707 ± 0.238
0.864ProGln: 0.864 ± 0.229
0.864ProArg: 0.864 ± 0.277
1.65ProSer: 1.65 ± 0.327
1.414ProThr: 1.414 ± 0.386
1.729ProVal: 1.729 ± 0.365
0.236ProTrp: 0.236 ± 0.142
0.943ProTyr: 0.943 ± 0.222
0.0ProXaa: 0.0 ± 0.0
Gln
2.122GlnAla: 2.122 ± 0.414
0.079GlnCys: 0.079 ± 0.08
1.807GlnAsp: 1.807 ± 0.312
2.829GlnGlu: 2.829 ± 0.614
1.493GlnPhe: 1.493 ± 0.44
1.886GlnGly: 1.886 ± 0.37
0.55GlnHis: 0.55 ± 0.168
2.986GlnIle: 2.986 ± 0.474
3.143GlnLys: 3.143 ± 0.579
3.065GlnLeu: 3.065 ± 0.549
0.943GlnMet: 0.943 ± 0.283
2.672GlnAsn: 2.672 ± 0.495
0.707GlnPro: 0.707 ± 0.227
2.043GlnGln: 2.043 ± 0.436
2.436GlnArg: 2.436 ± 0.395
2.043GlnSer: 2.043 ± 0.438
1.807GlnThr: 1.807 ± 0.432
1.65GlnVal: 1.65 ± 0.325
0.55GlnTrp: 0.55 ± 0.157
2.043GlnTyr: 2.043 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
1.886ArgAla: 1.886 ± 0.528
0.236ArgCys: 0.236 ± 0.184
2.515ArgAsp: 2.515 ± 0.495
2.829ArgGlu: 2.829 ± 0.493
2.122ArgPhe: 2.122 ± 0.366
1.964ArgGly: 1.964 ± 0.361
0.864ArgHis: 0.864 ± 0.181
3.693ArgIle: 3.693 ± 0.521
4.322ArgLys: 4.322 ± 0.501
3.065ArgLeu: 3.065 ± 0.356
1.572ArgMet: 1.572 ± 0.285
2.829ArgAsn: 2.829 ± 0.386
1.179ArgPro: 1.179 ± 0.295
1.336ArgGln: 1.336 ± 0.245
1.807ArgArg: 1.807 ± 0.397
2.357ArgSer: 2.357 ± 0.427
2.593ArgThr: 2.593 ± 0.492
1.807ArgVal: 1.807 ± 0.346
0.393ArgTrp: 0.393 ± 0.18
1.65ArgTyr: 1.65 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
2.515SerAla: 2.515 ± 0.415
0.393SerCys: 0.393 ± 0.168
3.379SerAsp: 3.379 ± 0.471
5.972SerGlu: 5.972 ± 0.647
3.379SerPhe: 3.379 ± 0.559
3.85SerGly: 3.85 ± 0.857
1.179SerHis: 1.179 ± 0.301
5.029SerIle: 5.029 ± 0.667
6.208SerLys: 6.208 ± 0.825
5.029SerLeu: 5.029 ± 0.555
2.122SerMet: 2.122 ± 0.285
5.343SerAsn: 5.343 ± 0.554
1.179SerPro: 1.179 ± 0.253
2.593SerGln: 2.593 ± 0.618
2.2SerArg: 2.2 ± 0.329
4.243SerSer: 4.243 ± 0.623
3.143SerThr: 3.143 ± 0.442
2.75SerVal: 2.75 ± 0.377
0.55SerTrp: 0.55 ± 0.19
1.886SerTyr: 1.886 ± 0.394
0.0SerXaa: 0.0 ± 0.0
Thr
3.536ThrAla: 3.536 ± 0.488
0.629ThrCys: 0.629 ± 0.202
4.558ThrAsp: 4.558 ± 0.667
3.143ThrGlu: 3.143 ± 0.405
2.2ThrPhe: 2.2 ± 0.347
3.615ThrGly: 3.615 ± 0.787
1.1ThrHis: 1.1 ± 0.394
4.715ThrIle: 4.715 ± 0.694
5.343ThrLys: 5.343 ± 0.639
5.108ThrLeu: 5.108 ± 0.489
1.336ThrMet: 1.336 ± 0.267
3.222ThrAsn: 3.222 ± 0.429
2.2ThrPro: 2.2 ± 0.385
1.65ThrGln: 1.65 ± 0.301
2.515ThrArg: 2.515 ± 0.503
3.143ThrSer: 3.143 ± 0.551
4.322ThrThr: 4.322 ± 0.656
3.457ThrVal: 3.457 ± 0.562
0.707ThrTrp: 0.707 ± 0.225
1.65ThrTyr: 1.65 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
4.086ValAla: 4.086 ± 0.688
0.314ValCys: 0.314 ± 0.159
4.086ValAsp: 4.086 ± 0.588
4.243ValGlu: 4.243 ± 0.647
2.672ValPhe: 2.672 ± 0.493
3.3ValGly: 3.3 ± 0.763
0.471ValHis: 0.471 ± 0.176
4.243ValIle: 4.243 ± 0.584
5.108ValLys: 5.108 ± 0.757
4.322ValLeu: 4.322 ± 0.69
1.1ValMet: 1.1 ± 0.312
4.636ValAsn: 4.636 ± 0.56
1.493ValPro: 1.493 ± 0.403
2.2ValGln: 2.2 ± 0.374
2.829ValArg: 2.829 ± 0.717
3.143ValSer: 3.143 ± 0.524
3.693ValThr: 3.693 ± 0.509
4.243ValVal: 4.243 ± 0.746
0.471ValTrp: 0.471 ± 0.175
1.807ValTyr: 1.807 ± 0.393
0.0ValXaa: 0.0 ± 0.0
Trp
0.471TrpAla: 0.471 ± 0.314
0.079TrpCys: 0.079 ± 0.061
0.55TrpAsp: 0.55 ± 0.195
0.55TrpGlu: 0.55 ± 0.191
0.314TrpPhe: 0.314 ± 0.173
0.629TrpGly: 0.629 ± 0.23
0.393TrpHis: 0.393 ± 0.162
0.707TrpIle: 0.707 ± 0.252
1.022TrpLys: 1.022 ± 0.249
1.414TrpLeu: 1.414 ± 0.294
0.0TrpMet: 0.0 ± 0.0
0.393TrpAsn: 0.393 ± 0.227
0.079TrpPro: 0.079 ± 0.056
0.314TrpGln: 0.314 ± 0.203
0.471TrpArg: 0.471 ± 0.184
0.864TrpSer: 0.864 ± 0.339
0.786TrpThr: 0.786 ± 0.206
0.393TrpVal: 0.393 ± 0.173
0.236TrpTrp: 0.236 ± 0.13
0.314TrpTyr: 0.314 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.2TyrAla: 2.2 ± 0.392
0.864TyrCys: 0.864 ± 0.347
2.672TyrAsp: 2.672 ± 0.349
4.322TyrGlu: 4.322 ± 0.558
1.572TyrPhe: 1.572 ± 0.387
1.807TyrGly: 1.807 ± 0.366
0.786TyrHis: 0.786 ± 0.233
3.457TyrIle: 3.457 ± 0.723
4.322TyrLys: 4.322 ± 0.523
2.986TyrLeu: 2.986 ± 0.538
0.786TyrMet: 0.786 ± 0.287
2.357TyrAsn: 2.357 ± 0.439
1.336TyrPro: 1.336 ± 0.237
1.729TyrGln: 1.729 ± 0.331
1.257TyrArg: 1.257 ± 0.285
2.122TyrSer: 2.122 ± 0.353
1.886TyrThr: 1.886 ± 0.375
3.143TyrVal: 3.143 ± 0.465
0.393TyrTrp: 0.393 ± 0.186
1.414TyrTyr: 1.414 ± 0.422
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12727 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski