Amino acid dipepetide frequency for Streptococcus phage CHPC577

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.534AlaAla: 3.534 ± 0.604
0.478AlaCys: 0.478 ± 0.199
3.629AlaAsp: 3.629 ± 0.629
3.056AlaGlu: 3.056 ± 0.507
2.483AlaPhe: 2.483 ± 0.549
3.438AlaGly: 3.438 ± 0.751
0.86AlaHis: 0.86 ± 0.327
5.062AlaIle: 5.062 ± 1.239
5.635AlaLys: 5.635 ± 0.837
5.826AlaLeu: 5.826 ± 0.658
2.006AlaMet: 2.006 ± 0.382
4.68AlaAsn: 4.68 ± 0.71
1.91AlaPro: 1.91 ± 0.4
1.719AlaGln: 1.719 ± 0.361
2.292AlaArg: 2.292 ± 0.551
4.393AlaSer: 4.393 ± 0.629
3.725AlaThr: 3.725 ± 0.66
3.152AlaVal: 3.152 ± 0.632
1.624AlaTrp: 1.624 ± 0.417
2.483AlaTyr: 2.483 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
0.191CysAla: 0.191 ± 0.149
0.096CysCys: 0.096 ± 0.098
0.86CysAsp: 0.86 ± 0.288
0.764CysGlu: 0.764 ± 0.297
0.096CysPhe: 0.096 ± 0.09
0.669CysGly: 0.669 ± 0.249
0.287CysHis: 0.287 ± 0.151
0.382CysIle: 0.382 ± 0.176
0.478CysLys: 0.478 ± 0.176
0.478CysLeu: 0.478 ± 0.186
0.096CysMet: 0.096 ± 0.086
0.478CysAsn: 0.478 ± 0.229
0.191CysPro: 0.191 ± 0.124
0.287CysGln: 0.287 ± 0.152
0.478CysArg: 0.478 ± 0.216
0.955CysSer: 0.955 ± 0.376
0.191CysThr: 0.191 ± 0.123
0.478CysVal: 0.478 ± 0.287
0.191CysTrp: 0.191 ± 0.152
0.382CysTyr: 0.382 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
2.865AspAla: 2.865 ± 0.428
0.573AspCys: 0.573 ± 0.248
4.298AspAsp: 4.298 ± 0.693
3.82AspGlu: 3.82 ± 0.762
3.056AspPhe: 3.056 ± 0.766
5.73AspGly: 5.73 ± 0.89
0.669AspHis: 0.669 ± 0.248
4.298AspIle: 4.298 ± 0.678
4.393AspLys: 4.393 ± 0.659
4.393AspLeu: 4.393 ± 0.574
1.433AspMet: 1.433 ± 0.387
3.725AspAsn: 3.725 ± 0.634
1.337AspPro: 1.337 ± 0.476
1.051AspGln: 1.051 ± 0.342
2.674AspArg: 2.674 ± 0.505
3.343AspSer: 3.343 ± 0.455
3.534AspThr: 3.534 ± 0.475
4.966AspVal: 4.966 ± 0.796
0.86AspTrp: 0.86 ± 0.28
2.77AspTyr: 2.77 ± 0.594
0.0AspXaa: 0.0 ± 0.0
Glu
3.916GluAla: 3.916 ± 0.669
0.669GluCys: 0.669 ± 0.259
3.343GluAsp: 3.343 ± 0.585
4.775GluGlu: 4.775 ± 0.904
2.961GluPhe: 2.961 ± 0.553
3.629GluGly: 3.629 ± 0.576
0.86GluHis: 0.86 ± 0.244
4.871GluIle: 4.871 ± 0.819
6.399GluLys: 6.399 ± 1.144
6.972GluLeu: 6.972 ± 1.034
1.528GluMet: 1.528 ± 0.398
4.584GluAsn: 4.584 ± 0.855
2.197GluPro: 2.197 ± 0.634
2.197GluGln: 2.197 ± 0.556
3.629GluArg: 3.629 ± 0.605
3.534GluSer: 3.534 ± 0.567
4.107GluThr: 4.107 ± 0.642
4.393GluVal: 4.393 ± 0.869
1.624GluTrp: 1.624 ± 0.303
3.438GluTyr: 3.438 ± 0.675
0.0GluXaa: 0.0 ± 0.0
Phe
2.961PheAla: 2.961 ± 0.452
0.573PheCys: 0.573 ± 0.249
3.82PheAsp: 3.82 ± 0.407
3.82PheGlu: 3.82 ± 0.662
1.719PhePhe: 1.719 ± 0.435
2.77PheGly: 2.77 ± 0.488
0.478PheHis: 0.478 ± 0.267
2.388PheIle: 2.388 ± 0.475
4.202PheLys: 4.202 ± 0.822
2.674PheLeu: 2.674 ± 0.484
1.433PheMet: 1.433 ± 0.332
3.247PheAsn: 3.247 ± 0.755
0.287PhePro: 0.287 ± 0.199
2.197PheGln: 2.197 ± 0.405
1.528PheArg: 1.528 ± 0.431
3.056PheSer: 3.056 ± 0.477
2.674PheThr: 2.674 ± 0.49
2.483PheVal: 2.483 ± 0.468
0.669PheTrp: 0.669 ± 0.329
1.528PheTyr: 1.528 ± 0.425
0.0PheXaa: 0.0 ± 0.0
Gly
4.202GlyAla: 4.202 ± 0.77
0.478GlyCys: 0.478 ± 0.193
3.916GlyAsp: 3.916 ± 0.643
2.197GlyGlu: 2.197 ± 0.47
2.961GlyPhe: 2.961 ± 0.559
3.438GlyGly: 3.438 ± 0.701
0.382GlyHis: 0.382 ± 0.169
6.303GlyIle: 6.303 ± 1.1
7.736GlyLys: 7.736 ± 0.874
5.062GlyLeu: 5.062 ± 1.182
1.528GlyMet: 1.528 ± 0.603
3.629GlyAsn: 3.629 ± 0.609
0.86GlyPro: 0.86 ± 0.364
1.91GlyGln: 1.91 ± 0.547
3.152GlyArg: 3.152 ± 0.555
4.584GlySer: 4.584 ± 0.594
4.393GlyThr: 4.393 ± 0.619
3.725GlyVal: 3.725 ± 0.681
0.764GlyTrp: 0.764 ± 0.272
3.152GlyTyr: 3.152 ± 0.649
0.0GlyXaa: 0.0 ± 0.0
His
0.382HisAla: 0.382 ± 0.165
0.096HisCys: 0.096 ± 0.086
0.573HisAsp: 0.573 ± 0.211
0.478HisGlu: 0.478 ± 0.175
0.382HisPhe: 0.382 ± 0.173
1.337HisGly: 1.337 ± 0.313
0.191HisHis: 0.191 ± 0.154
0.669HisIle: 0.669 ± 0.253
0.86HisLys: 0.86 ± 0.28
0.764HisLeu: 0.764 ± 0.242
0.096HisMet: 0.096 ± 0.114
0.382HisAsn: 0.382 ± 0.175
0.478HisPro: 0.478 ± 0.206
0.478HisGln: 0.478 ± 0.249
0.669HisArg: 0.669 ± 0.287
0.955HisSer: 0.955 ± 0.287
0.669HisThr: 0.669 ± 0.257
1.051HisVal: 1.051 ± 0.265
0.287HisTrp: 0.287 ± 0.154
0.382HisTyr: 0.382 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
4.393IleAla: 4.393 ± 0.772
0.764IleCys: 0.764 ± 0.315
4.298IleAsp: 4.298 ± 0.671
6.494IleGlu: 6.494 ± 0.889
2.197IlePhe: 2.197 ± 0.528
4.393IleGly: 4.393 ± 0.81
0.669IleHis: 0.669 ± 0.26
4.871IleIle: 4.871 ± 0.79
6.303IleLys: 6.303 ± 0.675
3.534IleLeu: 3.534 ± 0.41
1.719IleMet: 1.719 ± 0.42
5.635IleAsn: 5.635 ± 0.599
3.152IlePro: 3.152 ± 0.534
3.343IleGln: 3.343 ± 0.557
2.197IleArg: 2.197 ± 0.39
4.298IleSer: 4.298 ± 0.699
4.489IleThr: 4.489 ± 0.783
3.82IleVal: 3.82 ± 0.63
0.955IleTrp: 0.955 ± 0.334
2.292IleTyr: 2.292 ± 0.485
0.0IleXaa: 0.0 ± 0.0
Lys
6.494LysAla: 6.494 ± 1.018
0.287LysCys: 0.287 ± 0.163
4.584LysAsp: 4.584 ± 0.585
7.449LysGlu: 7.449 ± 0.895
2.961LysPhe: 2.961 ± 0.791
5.444LysGly: 5.444 ± 0.863
1.051LysHis: 1.051 ± 0.348
5.73LysIle: 5.73 ± 0.888
8.022LysLys: 8.022 ± 1.21
7.067LysLeu: 7.067 ± 0.911
2.674LysMet: 2.674 ± 0.679
6.781LysAsn: 6.781 ± 0.767
3.343LysPro: 3.343 ± 0.603
4.107LysGln: 4.107 ± 0.643
3.82LysArg: 3.82 ± 0.949
5.062LysSer: 5.062 ± 0.78
5.826LysThr: 5.826 ± 0.902
3.056LysVal: 3.056 ± 0.466
0.955LysTrp: 0.955 ± 0.28
3.629LysTyr: 3.629 ± 0.878
0.0LysXaa: 0.0 ± 0.0
Leu
4.871LeuAla: 4.871 ± 0.759
0.764LeuCys: 0.764 ± 0.276
4.966LeuAsp: 4.966 ± 0.532
5.444LeuGlu: 5.444 ± 1.026
3.343LeuPhe: 3.343 ± 0.592
4.298LeuGly: 4.298 ± 0.637
0.573LeuHis: 0.573 ± 0.228
4.68LeuIle: 4.68 ± 0.708
6.876LeuLys: 6.876 ± 0.881
5.348LeuLeu: 5.348 ± 0.98
1.528LeuMet: 1.528 ± 0.3
5.157LeuAsn: 5.157 ± 0.544
2.961LeuPro: 2.961 ± 0.408
3.152LeuGln: 3.152 ± 0.533
2.865LeuArg: 2.865 ± 0.631
5.826LeuSer: 5.826 ± 0.706
4.393LeuThr: 4.393 ± 0.84
4.202LeuVal: 4.202 ± 0.684
1.528LeuTrp: 1.528 ± 0.626
2.77LeuTyr: 2.77 ± 0.475
0.0LeuXaa: 0.0 ± 0.0
Met
1.719MetAla: 1.719 ± 0.41
0.096MetCys: 0.096 ± 0.09
1.051MetAsp: 1.051 ± 0.289
1.528MetGlu: 1.528 ± 0.391
0.955MetPhe: 0.955 ± 0.227
1.051MetGly: 1.051 ± 0.302
0.382MetHis: 0.382 ± 0.2
1.146MetIle: 1.146 ± 0.336
2.865MetLys: 2.865 ± 0.586
1.815MetLeu: 1.815 ± 0.382
0.382MetMet: 0.382 ± 0.222
1.528MetAsn: 1.528 ± 0.313
0.764MetPro: 0.764 ± 0.234
1.051MetGln: 1.051 ± 0.3
1.528MetArg: 1.528 ± 0.407
1.337MetSer: 1.337 ± 0.386
2.292MetThr: 2.292 ± 0.495
1.719MetVal: 1.719 ± 0.386
0.096MetTrp: 0.096 ± 0.111
0.382MetTyr: 0.382 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
5.444AsnAla: 5.444 ± 0.725
0.86AsnCys: 0.86 ± 0.292
3.534AsnAsp: 3.534 ± 0.568
4.202AsnGlu: 4.202 ± 0.854
2.961AsnPhe: 2.961 ± 0.563
6.876AsnGly: 6.876 ± 1.077
1.146AsnHis: 1.146 ± 0.369
4.489AsnIle: 4.489 ± 0.612
5.253AsnLys: 5.253 ± 0.617
4.871AsnLeu: 4.871 ± 0.572
1.91AsnMet: 1.91 ± 0.455
4.871AsnAsn: 4.871 ± 0.88
2.388AsnPro: 2.388 ± 0.409
2.101AsnGln: 2.101 ± 0.475
2.197AsnArg: 2.197 ± 0.454
3.438AsnSer: 3.438 ± 0.813
3.343AsnThr: 3.343 ± 0.444
4.107AsnVal: 4.107 ± 0.733
1.146AsnTrp: 1.146 ± 0.34
2.865AsnTyr: 2.865 ± 0.647
0.0AsnXaa: 0.0 ± 0.0
Pro
0.764ProAla: 0.764 ± 0.293
0.191ProCys: 0.191 ± 0.146
2.292ProAsp: 2.292 ± 0.455
2.101ProGlu: 2.101 ± 0.44
1.433ProPhe: 1.433 ± 0.328
0.955ProGly: 0.955 ± 0.281
0.287ProHis: 0.287 ± 0.156
2.197ProIle: 2.197 ± 0.496
2.388ProLys: 2.388 ± 0.637
1.91ProLeu: 1.91 ± 0.514
0.287ProMet: 0.287 ± 0.136
1.624ProAsn: 1.624 ± 0.418
1.146ProPro: 1.146 ± 0.243
1.051ProGln: 1.051 ± 0.372
1.337ProArg: 1.337 ± 0.404
2.101ProSer: 2.101 ± 0.482
2.674ProThr: 2.674 ± 0.486
2.101ProVal: 2.101 ± 0.524
0.478ProTrp: 0.478 ± 0.188
0.764ProTyr: 0.764 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
4.011GlnAla: 4.011 ± 0.68
0.191GlnCys: 0.191 ± 0.113
0.955GlnAsp: 0.955 ± 0.294
3.152GlnGlu: 3.152 ± 0.398
1.433GlnPhe: 1.433 ± 0.388
2.579GlnGly: 2.579 ± 0.702
0.478GlnHis: 0.478 ± 0.228
2.292GlnIle: 2.292 ± 0.587
2.579GlnLys: 2.579 ± 0.447
2.961GlnLeu: 2.961 ± 0.501
1.146GlnMet: 1.146 ± 0.267
1.624GlnAsn: 1.624 ± 0.269
1.242GlnPro: 1.242 ± 0.478
2.101GlnGln: 2.101 ± 0.41
1.337GlnArg: 1.337 ± 0.379
2.77GlnSer: 2.77 ± 0.464
2.197GlnThr: 2.197 ± 0.492
2.483GlnVal: 2.483 ± 0.615
0.669GlnTrp: 0.669 ± 0.3
1.433GlnTyr: 1.433 ± 0.26
0.0GlnXaa: 0.0 ± 0.0
Arg
1.719ArgAla: 1.719 ± 0.436
0.287ArgCys: 0.287 ± 0.16
2.388ArgAsp: 2.388 ± 0.434
3.056ArgGlu: 3.056 ± 0.68
2.388ArgPhe: 2.388 ± 0.462
2.388ArgGly: 2.388 ± 0.395
0.382ArgHis: 0.382 ± 0.168
3.152ArgIle: 3.152 ± 0.651
3.725ArgLys: 3.725 ± 0.543
4.871ArgLeu: 4.871 ± 0.758
1.242ArgMet: 1.242 ± 0.395
2.961ArgAsn: 2.961 ± 0.462
0.573ArgPro: 0.573 ± 0.285
1.146ArgGln: 1.146 ± 0.376
1.433ArgArg: 1.433 ± 0.42
2.77ArgSer: 2.77 ± 0.484
2.006ArgThr: 2.006 ± 0.506
2.865ArgVal: 2.865 ± 0.548
0.86ArgTrp: 0.86 ± 0.266
2.197ArgTyr: 2.197 ± 0.723
0.0ArgXaa: 0.0 ± 0.0
Ser
3.916SerAla: 3.916 ± 0.897
0.287SerCys: 0.287 ± 0.143
4.298SerAsp: 4.298 ± 0.58
5.348SerGlu: 5.348 ± 0.681
3.629SerPhe: 3.629 ± 0.75
4.202SerGly: 4.202 ± 0.637
0.478SerHis: 0.478 ± 0.186
4.775SerIle: 4.775 ± 0.894
4.298SerLys: 4.298 ± 0.834
4.584SerLeu: 4.584 ± 0.592
1.051SerMet: 1.051 ± 0.323
5.157SerAsn: 5.157 ± 1.035
1.242SerPro: 1.242 ± 0.364
2.865SerGln: 2.865 ± 0.521
2.483SerArg: 2.483 ± 0.42
4.871SerSer: 4.871 ± 1.273
3.916SerThr: 3.916 ± 0.789
4.871SerVal: 4.871 ± 0.698
0.955SerTrp: 0.955 ± 0.297
2.483SerTyr: 2.483 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
3.82ThrAla: 3.82 ± 0.645
0.382ThrCys: 0.382 ± 0.208
3.916ThrAsp: 3.916 ± 0.635
4.011ThrGlu: 4.011 ± 0.707
3.916ThrPhe: 3.916 ± 0.699
3.916ThrGly: 3.916 ± 0.638
0.573ThrHis: 0.573 ± 0.228
4.584ThrIle: 4.584 ± 0.853
5.921ThrLys: 5.921 ± 0.709
4.202ThrLeu: 4.202 ± 0.568
1.528ThrMet: 1.528 ± 0.47
2.579ThrAsn: 2.579 ± 0.579
1.051ThrPro: 1.051 ± 0.39
1.719ThrGln: 1.719 ± 0.333
2.674ThrArg: 2.674 ± 0.493
4.107ThrSer: 4.107 ± 0.718
4.298ThrThr: 4.298 ± 0.612
5.539ThrVal: 5.539 ± 0.86
0.669ThrTrp: 0.669 ± 0.391
2.961ThrTyr: 2.961 ± 0.557
0.0ThrXaa: 0.0 ± 0.0
Val
3.534ValAla: 3.534 ± 0.597
0.478ValCys: 0.478 ± 0.224
3.534ValAsp: 3.534 ± 0.863
4.489ValGlu: 4.489 ± 0.597
2.77ValPhe: 2.77 ± 0.584
3.629ValGly: 3.629 ± 0.575
0.86ValHis: 0.86 ± 0.433
4.202ValIle: 4.202 ± 0.691
6.303ValLys: 6.303 ± 0.958
4.966ValLeu: 4.966 ± 0.895
1.146ValMet: 1.146 ± 0.382
5.539ValAsn: 5.539 ± 0.881
1.242ValPro: 1.242 ± 0.331
2.006ValGln: 2.006 ± 0.548
2.006ValArg: 2.006 ± 0.495
4.68ValSer: 4.68 ± 0.653
4.584ValThr: 4.584 ± 0.902
4.298ValVal: 4.298 ± 0.798
1.051ValTrp: 1.051 ± 0.268
1.624ValTyr: 1.624 ± 0.401
0.0ValXaa: 0.0 ± 0.0
Trp
1.242TrpAla: 1.242 ± 0.358
0.191TrpCys: 0.191 ± 0.126
1.051TrpAsp: 1.051 ± 0.377
1.242TrpGlu: 1.242 ± 0.42
0.86TrpPhe: 0.86 ± 0.24
0.955TrpGly: 0.955 ± 0.342
0.191TrpHis: 0.191 ± 0.134
0.955TrpIle: 0.955 ± 0.272
1.146TrpLys: 1.146 ± 0.32
0.669TrpLeu: 0.669 ± 0.287
0.191TrpMet: 0.191 ± 0.144
1.528TrpAsn: 1.528 ± 0.629
0.191TrpPro: 0.191 ± 0.122
1.242TrpGln: 1.242 ± 0.383
1.146TrpArg: 1.146 ± 0.265
0.669TrpSer: 0.669 ± 0.293
0.86TrpThr: 0.86 ± 0.262
1.337TrpVal: 1.337 ± 0.34
0.096TrpTrp: 0.096 ± 0.098
0.287TrpTyr: 0.287 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.006TyrAla: 2.006 ± 0.446
0.287TyrCys: 0.287 ± 0.185
2.579TyrAsp: 2.579 ± 0.514
2.197TyrGlu: 2.197 ± 0.466
2.006TyrPhe: 2.006 ± 0.468
2.674TyrGly: 2.674 ± 0.706
0.382TyrHis: 0.382 ± 0.184
2.77TyrIle: 2.77 ± 0.792
2.961TyrLys: 2.961 ± 0.67
2.579TyrLeu: 2.579 ± 0.518
0.669TyrMet: 0.669 ± 0.243
2.292TyrAsn: 2.292 ± 0.469
1.528TyrPro: 1.528 ± 0.45
1.91TyrGln: 1.91 ± 0.428
2.961TyrArg: 2.961 ± 0.7
2.961TyrSer: 2.961 ± 0.447
2.197TyrThr: 2.197 ± 0.418
2.292TyrVal: 2.292 ± 0.41
0.573TyrTrp: 0.573 ± 0.178
2.197TyrTyr: 2.197 ± 0.63
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10472 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski