Amino acid dipepetide frequency for Streptococcus phage Str03

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.566AlaAla: 3.566 ± 0.692
0.297AlaCys: 0.297 ± 0.202
4.557AlaAsp: 4.557 ± 0.575
5.745AlaGlu: 5.745 ± 1.1
2.476AlaPhe: 2.476 ± 0.554
3.665AlaGly: 3.665 ± 0.702
0.594AlaHis: 0.594 ± 0.252
5.944AlaIle: 5.944 ± 1.194
5.844AlaLys: 5.844 ± 0.748
5.448AlaLeu: 5.448 ± 0.76
2.675AlaMet: 2.675 ± 0.52
3.368AlaAsn: 3.368 ± 0.627
1.189AlaPro: 1.189 ± 0.388
2.972AlaGln: 2.972 ± 0.411
3.467AlaArg: 3.467 ± 0.589
4.656AlaSer: 4.656 ± 0.82
3.863AlaThr: 3.863 ± 0.671
5.349AlaVal: 5.349 ± 0.948
1.189AlaTrp: 1.189 ± 0.366
2.774AlaTyr: 2.774 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
0.297CysAla: 0.297 ± 0.153
0.0CysCys: 0.0 ± 0.0
0.297CysAsp: 0.297 ± 0.18
0.297CysGlu: 0.297 ± 0.169
0.198CysPhe: 0.198 ± 0.148
0.297CysGly: 0.297 ± 0.255
0.495CysHis: 0.495 ± 0.221
0.099CysIle: 0.099 ± 0.102
0.594CysLys: 0.594 ± 0.284
0.693CysLeu: 0.693 ± 0.277
0.0CysMet: 0.0 ± 0.0
0.297CysAsn: 0.297 ± 0.153
0.099CysPro: 0.099 ± 0.098
0.198CysGln: 0.198 ± 0.15
0.198CysArg: 0.198 ± 0.154
0.396CysSer: 0.396 ± 0.249
0.198CysThr: 0.198 ± 0.144
0.495CysVal: 0.495 ± 0.245
0.0CysTrp: 0.0 ± 0.0
0.495CysTyr: 0.495 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
2.873AspAla: 2.873 ± 0.556
0.198AspCys: 0.198 ± 0.135
4.953AspAsp: 4.953 ± 0.945
4.854AspGlu: 4.854 ± 0.678
2.873AspPhe: 2.873 ± 0.539
4.061AspGly: 4.061 ± 0.464
0.693AspHis: 0.693 ± 0.246
5.944AspIle: 5.944 ± 0.846
5.745AspLys: 5.745 ± 0.739
5.052AspLeu: 5.052 ± 0.733
2.08AspMet: 2.08 ± 0.471
3.566AspAsn: 3.566 ± 0.489
1.387AspPro: 1.387 ± 0.407
1.486AspGln: 1.486 ± 0.316
2.377AspArg: 2.377 ± 0.443
4.16AspSer: 4.16 ± 0.683
3.269AspThr: 3.269 ± 0.501
2.972AspVal: 2.972 ± 0.48
1.486AspTrp: 1.486 ± 0.411
2.675AspTyr: 2.675 ± 0.649
0.0AspXaa: 0.0 ± 0.0
Glu
5.745GluAla: 5.745 ± 0.633
0.396GluCys: 0.396 ± 0.252
3.467GluAsp: 3.467 ± 0.545
6.934GluGlu: 6.934 ± 1.091
2.675GluPhe: 2.675 ± 0.594
3.17GluGly: 3.17 ± 0.548
1.09GluHis: 1.09 ± 0.377
5.448GluIle: 5.448 ± 0.688
6.34GluLys: 6.34 ± 1.089
9.609GluLeu: 9.609 ± 0.923
1.783GluMet: 1.783 ± 0.4
4.359GluAsn: 4.359 ± 0.903
1.684GluPro: 1.684 ± 0.483
2.972GluGln: 2.972 ± 0.523
3.269GluArg: 3.269 ± 0.589
3.962GluSer: 3.962 ± 0.613
6.538GluThr: 6.538 ± 0.589
5.448GluVal: 5.448 ± 0.607
0.991GluTrp: 0.991 ± 0.319
2.278GluTyr: 2.278 ± 0.552
0.0GluXaa: 0.0 ± 0.0
Phe
1.585PheAla: 1.585 ± 0.459
0.099PheCys: 0.099 ± 0.107
4.557PheAsp: 4.557 ± 0.843
3.467PheGlu: 3.467 ± 0.612
0.991PhePhe: 0.991 ± 0.352
2.08PheGly: 2.08 ± 0.465
0.792PheHis: 0.792 ± 0.274
2.576PheIle: 2.576 ± 0.491
3.17PheLys: 3.17 ± 0.609
2.675PheLeu: 2.675 ± 0.642
1.387PheMet: 1.387 ± 0.428
2.873PheAsn: 2.873 ± 0.429
0.892PhePro: 0.892 ± 0.391
1.486PheGln: 1.486 ± 0.379
1.387PheArg: 1.387 ± 0.324
2.675PheSer: 2.675 ± 0.495
2.377PheThr: 2.377 ± 0.556
1.486PheVal: 1.486 ± 0.384
0.198PheTrp: 0.198 ± 0.151
1.684PheTyr: 1.684 ± 0.363
0.0PheXaa: 0.0 ± 0.0
Gly
4.854GlyAla: 4.854 ± 1.126
0.495GlyCys: 0.495 ± 0.214
2.972GlyAsp: 2.972 ± 0.524
3.665GlyGlu: 3.665 ± 0.497
2.576GlyPhe: 2.576 ± 0.518
3.764GlyGly: 3.764 ± 0.69
1.288GlyHis: 1.288 ± 0.326
4.557GlyIle: 4.557 ± 0.512
5.349GlyLys: 5.349 ± 0.642
4.359GlyLeu: 4.359 ± 0.886
1.783GlyMet: 1.783 ± 0.543
3.269GlyAsn: 3.269 ± 0.602
0.297GlyPro: 0.297 ± 0.149
2.576GlyGln: 2.576 ± 0.613
2.377GlyArg: 2.377 ± 0.737
3.863GlySer: 3.863 ± 0.703
3.467GlyThr: 3.467 ± 0.802
3.566GlyVal: 3.566 ± 0.679
0.991GlyTrp: 0.991 ± 0.324
2.774GlyTyr: 2.774 ± 0.629
0.0GlyXaa: 0.0 ± 0.0
His
0.594HisAla: 0.594 ± 0.239
0.099HisCys: 0.099 ± 0.101
1.189HisAsp: 1.189 ± 0.405
1.387HisGlu: 1.387 ± 0.308
0.495HisPhe: 0.495 ± 0.194
0.892HisGly: 0.892 ± 0.295
0.198HisHis: 0.198 ± 0.139
1.783HisIle: 1.783 ± 0.426
1.09HisLys: 1.09 ± 0.375
1.189HisLeu: 1.189 ± 0.377
0.198HisMet: 0.198 ± 0.128
0.792HisAsn: 0.792 ± 0.305
0.198HisPro: 0.198 ± 0.146
0.495HisGln: 0.495 ± 0.202
0.297HisArg: 0.297 ± 0.183
1.288HisSer: 1.288 ± 0.303
1.189HisThr: 1.189 ± 0.41
0.892HisVal: 0.892 ± 0.234
0.198HisTrp: 0.198 ± 0.156
1.585HisTyr: 1.585 ± 0.501
0.0HisXaa: 0.0 ± 0.0
Ile
5.448IleAla: 5.448 ± 0.879
0.594IleCys: 0.594 ± 0.417
6.043IleAsp: 6.043 ± 0.807
5.547IleGlu: 5.547 ± 0.739
2.377IlePhe: 2.377 ± 0.517
3.566IleGly: 3.566 ± 0.735
1.09IleHis: 1.09 ± 0.318
3.566IleIle: 3.566 ± 0.725
6.439IleLys: 6.439 ± 0.861
5.151IleLeu: 5.151 ± 0.812
1.288IleMet: 1.288 ± 0.368
4.26IleAsn: 4.26 ± 0.641
1.882IlePro: 1.882 ± 0.452
2.476IleGln: 2.476 ± 0.483
2.972IleArg: 2.972 ± 0.531
3.863IleSer: 3.863 ± 0.499
5.448IleThr: 5.448 ± 0.754
3.764IleVal: 3.764 ± 0.791
0.693IleTrp: 0.693 ± 0.281
2.179IleTyr: 2.179 ± 0.423
0.0IleXaa: 0.0 ± 0.0
Lys
6.934LysAla: 6.934 ± 0.724
0.198LysCys: 0.198 ± 0.138
3.962LysAsp: 3.962 ± 0.677
7.528LysGlu: 7.528 ± 0.864
2.675LysPhe: 2.675 ± 0.397
4.557LysGly: 4.557 ± 0.589
1.288LysHis: 1.288 ± 0.327
5.448LysIle: 5.448 ± 0.457
6.142LysLys: 6.142 ± 0.885
6.736LysLeu: 6.736 ± 0.968
2.576LysMet: 2.576 ± 0.546
5.151LysAsn: 5.151 ± 1.003
2.278LysPro: 2.278 ± 0.45
3.467LysGln: 3.467 ± 0.451
2.576LysArg: 2.576 ± 0.584
5.745LysSer: 5.745 ± 0.722
6.34LysThr: 6.34 ± 0.847
4.656LysVal: 4.656 ± 0.86
0.693LysTrp: 0.693 ± 0.25
2.972LysTyr: 2.972 ± 0.703
0.0LysXaa: 0.0 ± 0.0
Leu
6.241LeuAla: 6.241 ± 0.675
0.495LeuCys: 0.495 ± 0.263
5.844LeuAsp: 5.844 ± 0.638
7.727LeuGlu: 7.727 ± 0.914
4.16LeuPhe: 4.16 ± 0.68
5.25LeuGly: 5.25 ± 0.869
1.387LeuHis: 1.387 ± 0.378
5.944LeuIle: 5.944 ± 0.933
8.222LeuLys: 8.222 ± 0.988
5.448LeuLeu: 5.448 ± 0.722
1.882LeuMet: 1.882 ± 0.495
4.656LeuAsn: 4.656 ± 0.754
2.476LeuPro: 2.476 ± 0.576
2.278LeuGln: 2.278 ± 0.608
4.755LeuArg: 4.755 ± 0.569
6.736LeuSer: 6.736 ± 0.939
5.745LeuThr: 5.745 ± 0.812
4.16LeuVal: 4.16 ± 0.756
0.892LeuTrp: 0.892 ± 0.469
3.17LeuTyr: 3.17 ± 0.534
0.0LeuXaa: 0.0 ± 0.0
Met
1.684MetAla: 1.684 ± 0.352
0.396MetCys: 0.396 ± 0.212
1.684MetAsp: 1.684 ± 0.464
1.882MetGlu: 1.882 ± 0.456
0.495MetPhe: 0.495 ± 0.238
1.585MetGly: 1.585 ± 0.316
0.495MetHis: 0.495 ± 0.249
1.189MetIle: 1.189 ± 0.383
2.873MetLys: 2.873 ± 0.481
1.981MetLeu: 1.981 ± 0.444
0.594MetMet: 0.594 ± 0.262
1.783MetAsn: 1.783 ± 0.424
1.189MetPro: 1.189 ± 0.331
0.297MetGln: 0.297 ± 0.186
0.892MetArg: 0.892 ± 0.294
1.684MetSer: 1.684 ± 0.511
2.377MetThr: 2.377 ± 0.614
1.387MetVal: 1.387 ± 0.392
0.198MetTrp: 0.198 ± 0.159
0.297MetTyr: 0.297 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
4.458AsnAla: 4.458 ± 0.807
0.594AsnCys: 0.594 ± 0.32
2.972AsnAsp: 2.972 ± 0.695
4.458AsnGlu: 4.458 ± 0.525
3.071AsnPhe: 3.071 ± 0.63
4.755AsnGly: 4.755 ± 0.925
1.387AsnHis: 1.387 ± 0.412
3.17AsnIle: 3.17 ± 0.683
3.467AsnLys: 3.467 ± 0.544
6.043AsnLeu: 6.043 ± 0.892
1.387AsnMet: 1.387 ± 0.437
3.071AsnAsn: 3.071 ± 0.509
2.278AsnPro: 2.278 ± 0.445
2.179AsnGln: 2.179 ± 0.495
3.368AsnArg: 3.368 ± 0.571
3.368AsnSer: 3.368 ± 0.625
2.476AsnThr: 2.476 ± 0.537
2.873AsnVal: 2.873 ± 0.636
0.792AsnTrp: 0.792 ± 0.29
2.08AsnTyr: 2.08 ± 0.423
0.0AsnXaa: 0.0 ± 0.0
Pro
1.09ProAla: 1.09 ± 0.303
0.198ProCys: 0.198 ± 0.153
1.783ProAsp: 1.783 ± 0.414
1.288ProGlu: 1.288 ± 0.388
0.792ProPhe: 0.792 ± 0.286
1.09ProGly: 1.09 ± 0.357
0.594ProHis: 0.594 ± 0.219
2.278ProIle: 2.278 ± 0.576
1.783ProLys: 1.783 ± 0.477
2.476ProLeu: 2.476 ± 0.502
0.396ProMet: 0.396 ± 0.22
1.684ProAsn: 1.684 ± 0.347
0.495ProPro: 0.495 ± 0.26
1.09ProGln: 1.09 ± 0.305
1.486ProArg: 1.486 ± 0.429
1.783ProSer: 1.783 ± 0.586
1.387ProThr: 1.387 ± 0.342
1.783ProVal: 1.783 ± 0.347
0.396ProTrp: 0.396 ± 0.218
0.892ProTyr: 0.892 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
4.26GlnAla: 4.26 ± 0.607
0.198GlnCys: 0.198 ± 0.144
1.288GlnAsp: 1.288 ± 0.314
2.278GlnGlu: 2.278 ± 0.46
1.189GlnPhe: 1.189 ± 0.297
2.08GlnGly: 2.08 ± 0.447
0.198GlnHis: 0.198 ± 0.157
2.278GlnIle: 2.278 ± 0.437
3.17GlnLys: 3.17 ± 0.591
5.25GlnLeu: 5.25 ± 0.837
0.792GlnMet: 0.792 ± 0.251
1.486GlnAsn: 1.486 ± 0.356
1.09GlnPro: 1.09 ± 0.415
1.684GlnGln: 1.684 ± 0.476
2.576GlnArg: 2.576 ± 0.536
2.08GlnSer: 2.08 ± 0.663
1.585GlnThr: 1.585 ± 0.366
2.179GlnVal: 2.179 ± 0.478
0.297GlnTrp: 0.297 ± 0.151
1.387GlnTyr: 1.387 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
2.972ArgAla: 2.972 ± 0.63
0.693ArgCys: 0.693 ± 0.284
2.377ArgAsp: 2.377 ± 0.546
3.764ArgGlu: 3.764 ± 0.84
1.585ArgPhe: 1.585 ± 0.369
2.08ArgGly: 2.08 ± 0.445
0.991ArgHis: 0.991 ± 0.34
3.071ArgIle: 3.071 ± 0.546
3.467ArgLys: 3.467 ± 0.545
3.665ArgLeu: 3.665 ± 0.547
0.693ArgMet: 0.693 ± 0.205
2.675ArgAsn: 2.675 ± 0.535
1.882ArgPro: 1.882 ± 0.529
2.08ArgGln: 2.08 ± 0.469
2.278ArgArg: 2.278 ± 0.621
2.576ArgSer: 2.576 ± 0.45
2.774ArgThr: 2.774 ± 0.832
3.467ArgVal: 3.467 ± 0.539
0.495ArgTrp: 0.495 ± 0.208
1.981ArgTyr: 1.981 ± 0.459
0.0ArgXaa: 0.0 ± 0.0
Ser
4.854SerAla: 4.854 ± 0.806
0.099SerCys: 0.099 ± 0.088
3.863SerAsp: 3.863 ± 0.574
3.863SerGlu: 3.863 ± 0.614
3.764SerPhe: 3.764 ± 0.822
4.16SerGly: 4.16 ± 1.02
0.792SerHis: 0.792 ± 0.284
3.665SerIle: 3.665 ± 0.652
4.26SerLys: 4.26 ± 0.678
6.34SerLeu: 6.34 ± 0.609
1.09SerMet: 1.09 ± 0.344
4.656SerAsn: 4.656 ± 0.713
1.387SerPro: 1.387 ± 0.401
2.972SerGln: 2.972 ± 0.484
3.17SerArg: 3.17 ± 0.764
5.448SerSer: 5.448 ± 1.153
4.458SerThr: 4.458 ± 0.97
3.467SerVal: 3.467 ± 0.616
0.297SerTrp: 0.297 ± 0.184
2.576SerTyr: 2.576 ± 0.719
0.0SerXaa: 0.0 ± 0.0
Thr
4.854ThrAla: 4.854 ± 0.82
0.297ThrCys: 0.297 ± 0.171
3.665ThrAsp: 3.665 ± 0.468
4.854ThrGlu: 4.854 ± 0.719
2.179ThrPhe: 2.179 ± 0.362
4.953ThrGly: 4.953 ± 0.837
0.792ThrHis: 0.792 ± 0.259
5.25ThrIle: 5.25 ± 0.742
5.745ThrLys: 5.745 ± 0.534
5.349ThrLeu: 5.349 ± 0.721
1.09ThrMet: 1.09 ± 0.33
3.368ThrAsn: 3.368 ± 0.671
1.684ThrPro: 1.684 ± 0.51
2.179ThrGln: 2.179 ± 0.422
2.377ThrArg: 2.377 ± 0.521
3.665ThrSer: 3.665 ± 1.031
4.061ThrThr: 4.061 ± 0.775
6.043ThrVal: 6.043 ± 0.745
0.991ThrTrp: 0.991 ± 0.431
2.576ThrTyr: 2.576 ± 0.551
0.0ThrXaa: 0.0 ± 0.0
Val
3.863ValAla: 3.863 ± 0.626
0.099ValCys: 0.099 ± 0.107
4.26ValAsp: 4.26 ± 0.674
4.458ValGlu: 4.458 ± 0.743
1.684ValPhe: 1.684 ± 0.683
3.863ValGly: 3.863 ± 0.846
0.892ValHis: 0.892 ± 0.322
3.764ValIle: 3.764 ± 0.601
4.359ValLys: 4.359 ± 0.601
3.962ValLeu: 3.962 ± 0.632
1.882ValMet: 1.882 ± 0.373
3.764ValAsn: 3.764 ± 0.552
1.387ValPro: 1.387 ± 0.426
1.783ValGln: 1.783 ± 0.505
2.675ValArg: 2.675 ± 0.545
4.26ValSer: 4.26 ± 0.638
5.745ValThr: 5.745 ± 0.79
3.665ValVal: 3.665 ± 0.668
1.189ValTrp: 1.189 ± 0.361
1.882ValTyr: 1.882 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.792TrpAla: 0.792 ± 0.369
0.099TrpCys: 0.099 ± 0.109
0.396TrpAsp: 0.396 ± 0.168
0.594TrpGlu: 0.594 ± 0.26
0.594TrpPhe: 0.594 ± 0.246
1.09TrpGly: 1.09 ± 0.393
0.297TrpHis: 0.297 ± 0.205
0.792TrpIle: 0.792 ± 0.293
0.594TrpLys: 0.594 ± 0.255
1.486TrpLeu: 1.486 ± 0.357
0.396TrpMet: 0.396 ± 0.243
0.892TrpAsn: 0.892 ± 0.295
0.099TrpPro: 0.099 ± 0.088
0.495TrpGln: 0.495 ± 0.208
0.792TrpArg: 0.792 ± 0.281
1.288TrpSer: 1.288 ± 0.587
0.991TrpThr: 0.991 ± 0.274
0.396TrpVal: 0.396 ± 0.202
0.0TrpTrp: 0.0 ± 0.0
0.396TrpTyr: 0.396 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.675TyrAla: 2.675 ± 0.613
0.099TyrCys: 0.099 ± 0.101
2.576TyrAsp: 2.576 ± 0.483
3.566TyrGlu: 3.566 ± 0.745
1.585TyrPhe: 1.585 ± 0.47
1.882TyrGly: 1.882 ± 0.441
0.693TyrHis: 0.693 ± 0.212
1.981TyrIle: 1.981 ± 0.49
3.269TyrLys: 3.269 ± 0.766
4.557TyrLeu: 4.557 ± 0.818
0.892TyrMet: 0.892 ± 0.346
2.278TyrAsn: 2.278 ± 0.657
0.892TyrPro: 0.892 ± 0.307
2.179TyrGln: 2.179 ± 0.412
2.278TyrArg: 2.278 ± 0.597
1.684TyrSer: 1.684 ± 0.424
1.684TyrThr: 1.684 ± 0.429
1.387TyrVal: 1.387 ± 0.374
0.495TyrTrp: 0.495 ± 0.247
1.882TyrTyr: 1.882 ± 0.498
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski