Amino acid dipepetide frequency for Streptococcus phage P9854

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.213AlaAla: 3.213 ± 0.787
0.26AlaCys: 0.26 ± 0.175
4.689AlaAsp: 4.689 ± 0.98
3.56AlaGlu: 3.56 ± 0.567
2.171AlaPhe: 2.171 ± 0.401
3.82AlaGly: 3.82 ± 0.64
0.608AlaHis: 0.608 ± 0.23
4.862AlaIle: 4.862 ± 0.749
5.731AlaLys: 5.731 ± 1.033
6.165AlaLeu: 6.165 ± 0.683
1.91AlaMet: 1.91 ± 0.383
4.168AlaAsn: 4.168 ± 0.68
1.65AlaPro: 1.65 ± 0.35
2.258AlaGln: 2.258 ± 0.572
2.431AlaArg: 2.431 ± 0.437
4.341AlaSer: 4.341 ± 0.559
5.036AlaThr: 5.036 ± 0.77
3.907AlaVal: 3.907 ± 0.648
1.042AlaTrp: 1.042 ± 0.262
2.865AlaTyr: 2.865 ± 0.549
0.0AlaXaa: 0.0 ± 0.0
Cys
0.087CysAla: 0.087 ± 0.075
0.0CysCys: 0.0 ± 0.0
0.781CysAsp: 0.781 ± 0.27
0.347CysGlu: 0.347 ± 0.19
0.26CysPhe: 0.26 ± 0.156
0.347CysGly: 0.347 ± 0.22
0.174CysHis: 0.174 ± 0.128
0.087CysIle: 0.087 ± 0.08
0.521CysLys: 0.521 ± 0.3
0.521CysLeu: 0.521 ± 0.248
0.087CysMet: 0.087 ± 0.079
0.695CysAsn: 0.695 ± 0.276
0.434CysPro: 0.434 ± 0.281
0.347CysGln: 0.347 ± 0.185
0.608CysArg: 0.608 ± 0.32
0.087CysSer: 0.087 ± 0.088
0.434CysThr: 0.434 ± 0.193
0.174CysVal: 0.174 ± 0.093
0.174CysTrp: 0.174 ± 0.131
0.347CysTyr: 0.347 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
3.56AspAla: 3.56 ± 0.505
0.26AspCys: 0.26 ± 0.153
4.081AspAsp: 4.081 ± 0.724
4.255AspGlu: 4.255 ± 0.693
4.081AspPhe: 4.081 ± 0.521
7.641AspGly: 7.641 ± 1.513
0.347AspHis: 0.347 ± 0.208
4.428AspIle: 4.428 ± 0.635
5.21AspLys: 5.21 ± 0.696
3.647AspLeu: 3.647 ± 0.691
2.084AspMet: 2.084 ± 0.422
3.386AspAsn: 3.386 ± 0.488
2.171AspPro: 2.171 ± 0.44
1.65AspGln: 1.65 ± 0.292
2.258AspArg: 2.258 ± 0.436
3.56AspSer: 3.56 ± 0.607
3.56AspThr: 3.56 ± 0.525
4.341AspVal: 4.341 ± 0.646
0.781AspTrp: 0.781 ± 0.253
2.865AspTyr: 2.865 ± 0.507
0.0AspXaa: 0.0 ± 0.0
Glu
3.82GluAla: 3.82 ± 0.552
0.434GluCys: 0.434 ± 0.168
2.779GluAsp: 2.779 ± 0.476
5.123GluGlu: 5.123 ± 1.049
2.779GluPhe: 2.779 ± 0.633
3.213GluGly: 3.213 ± 0.516
1.042GluHis: 1.042 ± 0.304
5.991GluIle: 5.991 ± 0.71
4.862GluLys: 4.862 ± 1.049
6.773GluLeu: 6.773 ± 0.724
2.692GluMet: 2.692 ± 0.551
3.907GluAsn: 3.907 ± 0.646
1.563GluPro: 1.563 ± 0.457
2.605GluGln: 2.605 ± 0.379
3.56GluArg: 3.56 ± 0.726
3.213GluSer: 3.213 ± 0.58
3.473GluThr: 3.473 ± 0.563
4.168GluVal: 4.168 ± 0.625
1.389GluTrp: 1.389 ± 0.271
3.126GluTyr: 3.126 ± 0.639
0.0GluXaa: 0.0 ± 0.0
Phe
3.213PheAla: 3.213 ± 0.559
0.347PheCys: 0.347 ± 0.195
3.213PheAsp: 3.213 ± 0.457
2.431PheGlu: 2.431 ± 0.489
1.91PhePhe: 1.91 ± 0.397
2.865PheGly: 2.865 ± 0.646
0.434PheHis: 0.434 ± 0.151
2.344PheIle: 2.344 ± 0.535
3.907PheLys: 3.907 ± 0.527
3.734PheLeu: 3.734 ± 0.635
0.695PheMet: 0.695 ± 0.211
3.473PheAsn: 3.473 ± 0.696
0.521PhePro: 0.521 ± 0.196
1.216PheGln: 1.216 ± 0.275
1.823PheArg: 1.823 ± 0.41
2.952PheSer: 2.952 ± 0.439
2.605PheThr: 2.605 ± 0.579
3.386PheVal: 3.386 ± 0.696
0.608PheTrp: 0.608 ± 0.215
1.91PheTyr: 1.91 ± 0.524
0.0PheXaa: 0.0 ± 0.0
Gly
3.039GlyAla: 3.039 ± 0.665
0.695GlyCys: 0.695 ± 0.278
4.428GlyAsp: 4.428 ± 0.492
3.299GlyGlu: 3.299 ± 0.488
3.56GlyPhe: 3.56 ± 0.694
4.341GlyGly: 4.341 ± 0.898
0.868GlyHis: 0.868 ± 0.246
6.338GlyIle: 6.338 ± 0.827
6.338GlyLys: 6.338 ± 0.697
6.078GlyLeu: 6.078 ± 0.711
1.563GlyMet: 1.563 ± 0.51
3.647GlyAsn: 3.647 ± 0.65
1.389GlyPro: 1.389 ± 0.493
3.039GlyGln: 3.039 ± 0.444
3.039GlyArg: 3.039 ± 0.583
4.776GlySer: 4.776 ± 0.7
4.602GlyThr: 4.602 ± 0.555
2.952GlyVal: 2.952 ± 0.757
1.302GlyTrp: 1.302 ± 0.317
2.952GlyTyr: 2.952 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
0.521HisAla: 0.521 ± 0.176
0.174HisCys: 0.174 ± 0.091
0.781HisAsp: 0.781 ± 0.234
0.608HisGlu: 0.608 ± 0.235
0.608HisPhe: 0.608 ± 0.221
0.608HisGly: 0.608 ± 0.205
0.434HisHis: 0.434 ± 0.172
0.781HisIle: 0.781 ± 0.258
0.955HisLys: 0.955 ± 0.247
1.042HisLeu: 1.042 ± 0.259
0.26HisMet: 0.26 ± 0.151
0.955HisAsn: 0.955 ± 0.272
0.434HisPro: 0.434 ± 0.196
0.521HisGln: 0.521 ± 0.229
0.521HisArg: 0.521 ± 0.146
0.608HisSer: 0.608 ± 0.186
0.608HisThr: 0.608 ± 0.203
1.302HisVal: 1.302 ± 0.267
0.174HisTrp: 0.174 ± 0.101
0.955HisTyr: 0.955 ± 0.344
0.0HisXaa: 0.0 ± 0.0
Ile
4.862IleAla: 4.862 ± 0.59
0.26IleCys: 0.26 ± 0.16
4.428IleAsp: 4.428 ± 0.486
5.123IleGlu: 5.123 ± 0.964
1.823IlePhe: 1.823 ± 0.524
4.689IleGly: 4.689 ± 0.622
0.695IleHis: 0.695 ± 0.266
2.865IleIle: 2.865 ± 0.547
5.991IleLys: 5.991 ± 0.717
3.647IleLeu: 3.647 ± 0.717
1.563IleMet: 1.563 ± 0.519
3.907IleAsn: 3.907 ± 0.487
3.386IlePro: 3.386 ± 0.429
2.431IleGln: 2.431 ± 0.496
3.734IleArg: 3.734 ± 0.596
4.862IleSer: 4.862 ± 0.645
3.994IleThr: 3.994 ± 0.546
3.386IleVal: 3.386 ± 0.473
1.042IleTrp: 1.042 ± 0.243
2.431IleTyr: 2.431 ± 0.579
0.0IleXaa: 0.0 ± 0.0
Lys
5.991LysAla: 5.991 ± 0.631
0.521LysCys: 0.521 ± 0.26
4.428LysAsp: 4.428 ± 0.73
7.901LysGlu: 7.901 ± 1.007
3.56LysPhe: 3.56 ± 0.78
5.47LysGly: 5.47 ± 0.568
1.216LysHis: 1.216 ± 0.413
4.515LysIle: 4.515 ± 0.58
7.641LysLys: 7.641 ± 1.318
6.773LysLeu: 6.773 ± 0.805
2.084LysMet: 2.084 ± 0.402
5.123LysAsn: 5.123 ± 0.583
3.386LysPro: 3.386 ± 0.418
3.386LysGln: 3.386 ± 0.575
3.126LysArg: 3.126 ± 0.43
4.689LysSer: 4.689 ± 0.546
5.47LysThr: 5.47 ± 0.573
4.602LysVal: 4.602 ± 0.609
0.955LysTrp: 0.955 ± 0.26
3.386LysTyr: 3.386 ± 0.865
0.0LysXaa: 0.0 ± 0.0
Leu
6.425LeuAla: 6.425 ± 0.679
0.608LeuCys: 0.608 ± 0.26
5.904LeuAsp: 5.904 ± 0.79
6.165LeuGlu: 6.165 ± 0.925
2.692LeuPhe: 2.692 ± 0.395
4.602LeuGly: 4.602 ± 0.842
0.608LeuHis: 0.608 ± 0.274
4.515LeuIle: 4.515 ± 0.66
7.294LeuLys: 7.294 ± 0.737
5.123LeuLeu: 5.123 ± 0.633
2.171LeuMet: 2.171 ± 0.384
5.383LeuAsn: 5.383 ± 0.595
2.865LeuPro: 2.865 ± 0.427
3.039LeuGln: 3.039 ± 0.535
3.299LeuArg: 3.299 ± 0.698
5.904LeuSer: 5.904 ± 0.674
5.991LeuThr: 5.991 ± 0.888
4.689LeuVal: 4.689 ± 0.669
0.695LeuTrp: 0.695 ± 0.259
1.91LeuTyr: 1.91 ± 0.464
0.0LeuXaa: 0.0 ± 0.0
Met
2.171MetAla: 2.171 ± 0.513
0.174MetCys: 0.174 ± 0.118
1.302MetAsp: 1.302 ± 0.295
1.563MetGlu: 1.563 ± 0.479
0.955MetPhe: 0.955 ± 0.275
0.955MetGly: 0.955 ± 0.218
0.434MetHis: 0.434 ± 0.212
1.65MetIle: 1.65 ± 0.337
2.431MetLys: 2.431 ± 0.536
1.823MetLeu: 1.823 ± 0.308
0.521MetMet: 0.521 ± 0.159
1.563MetAsn: 1.563 ± 0.399
1.042MetPro: 1.042 ± 0.294
1.129MetGln: 1.129 ± 0.336
0.781MetArg: 0.781 ± 0.234
1.91MetSer: 1.91 ± 0.326
1.65MetThr: 1.65 ± 0.344
1.997MetVal: 1.997 ± 0.347
0.174MetTrp: 0.174 ± 0.126
1.476MetTyr: 1.476 ± 0.399
0.0MetXaa: 0.0 ± 0.0
Asn
4.515AsnAla: 4.515 ± 1.1
0.174AsnCys: 0.174 ± 0.112
3.647AsnAsp: 3.647 ± 0.46
3.734AsnGlu: 3.734 ± 0.553
2.344AsnPhe: 2.344 ± 0.544
6.425AsnGly: 6.425 ± 1.057
1.042AsnHis: 1.042 ± 0.248
3.994AsnIle: 3.994 ± 0.546
4.168AsnLys: 4.168 ± 0.62
4.255AsnLeu: 4.255 ± 0.557
1.389AsnMet: 1.389 ± 0.367
3.907AsnAsn: 3.907 ± 0.6
3.213AsnPro: 3.213 ± 0.644
2.171AsnGln: 2.171 ± 0.358
1.737AsnArg: 1.737 ± 0.464
4.602AsnSer: 4.602 ± 0.609
3.299AsnThr: 3.299 ± 0.905
3.82AsnVal: 3.82 ± 0.513
1.129AsnTrp: 1.129 ± 0.303
1.91AsnTyr: 1.91 ± 0.373
0.0AsnXaa: 0.0 ± 0.0
Pro
1.389ProAla: 1.389 ± 0.324
0.0ProCys: 0.0 ± 0.0
1.737ProAsp: 1.737 ± 0.418
2.344ProGlu: 2.344 ± 0.407
1.476ProPhe: 1.476 ± 0.38
1.65ProGly: 1.65 ± 0.47
0.347ProHis: 0.347 ± 0.179
1.476ProIle: 1.476 ± 0.393
3.734ProLys: 3.734 ± 0.461
2.605ProLeu: 2.605 ± 0.428
0.434ProMet: 0.434 ± 0.178
2.431ProAsn: 2.431 ± 0.397
0.695ProPro: 0.695 ± 0.367
1.476ProGln: 1.476 ± 0.334
1.302ProArg: 1.302 ± 0.331
2.865ProSer: 2.865 ± 0.564
2.344ProThr: 2.344 ± 0.413
1.91ProVal: 1.91 ± 0.544
0.434ProTrp: 0.434 ± 0.142
0.868ProTyr: 0.868 ± 0.288
0.0ProXaa: 0.0 ± 0.0
Gln
3.039GlnAla: 3.039 ± 0.636
0.174GlnCys: 0.174 ± 0.113
1.823GlnAsp: 1.823 ± 0.365
2.865GlnGlu: 2.865 ± 0.557
1.563GlnPhe: 1.563 ± 0.308
3.213GlnGly: 3.213 ± 0.857
0.434GlnHis: 0.434 ± 0.186
1.823GlnIle: 1.823 ± 0.387
3.213GlnLys: 3.213 ± 0.493
3.907GlnLeu: 3.907 ± 0.43
1.389GlnMet: 1.389 ± 0.398
2.605GlnAsn: 2.605 ± 0.486
0.521GlnPro: 0.521 ± 0.256
2.431GlnGln: 2.431 ± 0.587
1.216GlnArg: 1.216 ± 0.357
2.171GlnSer: 2.171 ± 0.391
2.779GlnThr: 2.779 ± 0.425
2.171GlnVal: 2.171 ± 0.466
0.608GlnTrp: 0.608 ± 0.221
2.431GlnTyr: 2.431 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
2.431ArgAla: 2.431 ± 0.519
0.695ArgCys: 0.695 ± 0.466
2.692ArgAsp: 2.692 ± 0.437
2.779ArgGlu: 2.779 ± 0.499
1.997ArgPhe: 1.997 ± 0.396
2.605ArgGly: 2.605 ± 0.358
0.868ArgHis: 0.868 ± 0.291
2.518ArgIle: 2.518 ± 0.576
3.126ArgLys: 3.126 ± 0.618
3.82ArgLeu: 3.82 ± 0.783
1.129ArgMet: 1.129 ± 0.341
2.344ArgAsn: 2.344 ± 0.398
1.129ArgPro: 1.129 ± 0.251
1.823ArgGln: 1.823 ± 0.32
1.389ArgArg: 1.389 ± 0.384
1.91ArgSer: 1.91 ± 0.409
2.692ArgThr: 2.692 ± 0.663
1.91ArgVal: 1.91 ± 0.395
1.042ArgTrp: 1.042 ± 0.261
2.084ArgTyr: 2.084 ± 0.511
0.0ArgXaa: 0.0 ± 0.0
Ser
3.647SerAla: 3.647 ± 0.632
0.608SerCys: 0.608 ± 0.238
4.515SerAsp: 4.515 ± 0.781
4.255SerGlu: 4.255 ± 0.661
3.647SerPhe: 3.647 ± 0.725
4.255SerGly: 4.255 ± 0.421
0.521SerHis: 0.521 ± 0.209
4.168SerIle: 4.168 ± 0.515
5.817SerLys: 5.817 ± 0.902
4.341SerLeu: 4.341 ± 0.592
2.171SerMet: 2.171 ± 0.367
3.994SerAsn: 3.994 ± 0.501
2.171SerPro: 2.171 ± 0.473
3.039SerGln: 3.039 ± 0.536
2.258SerArg: 2.258 ± 0.437
3.647SerSer: 3.647 ± 0.729
4.341SerThr: 4.341 ± 0.641
5.297SerVal: 5.297 ± 0.704
0.868SerTrp: 0.868 ± 0.244
1.302SerTyr: 1.302 ± 0.267
0.0SerXaa: 0.0 ± 0.0
Thr
3.907ThrAla: 3.907 ± 0.707
0.26ThrCys: 0.26 ± 0.159
4.081ThrAsp: 4.081 ± 0.676
3.039ThrGlu: 3.039 ± 0.413
3.386ThrPhe: 3.386 ± 0.647
4.168ThrGly: 4.168 ± 0.654
1.216ThrHis: 1.216 ± 0.308
5.47ThrIle: 5.47 ± 0.93
4.168ThrLys: 4.168 ± 0.516
6.773ThrLeu: 6.773 ± 0.836
1.129ThrMet: 1.129 ± 0.252
4.081ThrAsn: 4.081 ± 0.626
1.91ThrPro: 1.91 ± 0.464
2.692ThrGln: 2.692 ± 0.479
1.997ThrArg: 1.997 ± 0.467
4.081ThrSer: 4.081 ± 0.491
3.473ThrThr: 3.473 ± 0.519
3.907ThrVal: 3.907 ± 0.492
1.042ThrTrp: 1.042 ± 0.309
3.039ThrTyr: 3.039 ± 0.58
0.0ThrXaa: 0.0 ± 0.0
Val
5.036ValAla: 5.036 ± 0.788
0.26ValCys: 0.26 ± 0.139
5.383ValAsp: 5.383 ± 0.716
3.473ValGlu: 3.473 ± 0.575
2.605ValPhe: 2.605 ± 0.359
4.255ValGly: 4.255 ± 0.612
0.521ValHis: 0.521 ± 0.207
4.081ValIle: 4.081 ± 0.617
5.991ValLys: 5.991 ± 0.706
3.907ValLeu: 3.907 ± 0.553
1.389ValMet: 1.389 ± 0.309
3.299ValAsn: 3.299 ± 0.526
1.65ValPro: 1.65 ± 0.408
1.65ValGln: 1.65 ± 0.413
2.518ValArg: 2.518 ± 0.608
4.602ValSer: 4.602 ± 0.724
4.168ValThr: 4.168 ± 0.641
3.299ValVal: 3.299 ± 0.49
0.781ValTrp: 0.781 ± 0.231
1.823ValTyr: 1.823 ± 0.324
0.0ValXaa: 0.0 ± 0.0
Trp
0.781TrpAla: 0.781 ± 0.22
0.174TrpCys: 0.174 ± 0.101
1.302TrpAsp: 1.302 ± 0.531
0.955TrpGlu: 0.955 ± 0.273
0.608TrpPhe: 0.608 ± 0.244
0.608TrpGly: 0.608 ± 0.215
0.26TrpHis: 0.26 ± 0.119
0.868TrpIle: 0.868 ± 0.214
0.781TrpLys: 0.781 ± 0.224
1.563TrpLeu: 1.563 ± 0.309
0.26TrpMet: 0.26 ± 0.127
1.042TrpAsn: 1.042 ± 0.305
0.174TrpPro: 0.174 ± 0.122
0.955TrpGln: 0.955 ± 0.258
0.695TrpArg: 0.695 ± 0.205
1.302TrpSer: 1.302 ± 0.387
0.868TrpThr: 0.868 ± 0.239
1.042TrpVal: 1.042 ± 0.273
0.26TrpTrp: 0.26 ± 0.175
0.26TrpTyr: 0.26 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.039TyrAla: 3.039 ± 0.493
0.434TyrCys: 0.434 ± 0.289
2.171TyrAsp: 2.171 ± 0.373
2.692TyrGlu: 2.692 ± 0.529
1.563TyrPhe: 1.563 ± 0.299
2.518TyrGly: 2.518 ± 0.627
0.781TyrHis: 0.781 ± 0.248
2.431TyrIle: 2.431 ± 0.407
2.344TyrLys: 2.344 ± 0.441
3.386TyrLeu: 3.386 ± 0.561
0.695TyrMet: 0.695 ± 0.243
1.563TyrAsn: 1.563 ± 0.326
1.216TyrPro: 1.216 ± 0.36
2.518TyrGln: 2.518 ± 0.34
2.692TyrArg: 2.692 ± 0.609
2.779TyrSer: 2.779 ± 0.707
2.431TyrThr: 2.431 ± 0.726
2.431TyrVal: 2.431 ± 0.504
0.26TyrTrp: 0.26 ± 0.153
2.084TyrTyr: 2.084 ± 0.475
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (11518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski