Amino acid dipepetide frequency for Streptococcus phage SW21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.112AlaAla: 3.112 ± 0.936
0.094AlaCys: 0.094 ± 0.1
4.81AlaAsp: 4.81 ± 0.814
3.112AlaGlu: 3.112 ± 0.544
1.886AlaPhe: 1.886 ± 0.528
4.433AlaGly: 4.433 ± 0.82
0.943AlaHis: 0.943 ± 0.329
4.621AlaIle: 4.621 ± 0.84
5.659AlaLys: 5.659 ± 0.932
6.225AlaLeu: 6.225 ± 0.692
1.32AlaMet: 1.32 ± 0.351
4.81AlaAsn: 4.81 ± 0.811
1.509AlaPro: 1.509 ± 0.337
2.641AlaGln: 2.641 ± 0.638
2.641AlaArg: 2.641 ± 0.431
4.716AlaSer: 4.716 ± 0.599
4.81AlaThr: 4.81 ± 0.818
3.867AlaVal: 3.867 ± 0.703
1.132AlaTrp: 1.132 ± 0.315
2.264AlaTyr: 2.264 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.094CysAla: 0.094 ± 0.08
0.094CysCys: 0.094 ± 0.084
0.66CysAsp: 0.66 ± 0.311
0.189CysGlu: 0.189 ± 0.122
0.189CysPhe: 0.189 ± 0.143
0.189CysGly: 0.189 ± 0.125
0.189CysHis: 0.189 ± 0.131
0.094CysIle: 0.094 ± 0.084
0.283CysLys: 0.283 ± 0.196
0.66CysLeu: 0.66 ± 0.275
0.0CysMet: 0.0 ± 0.0
0.472CysAsn: 0.472 ± 0.237
0.189CysPro: 0.189 ± 0.142
0.189CysGln: 0.189 ± 0.143
0.377CysArg: 0.377 ± 0.267
0.189CysSer: 0.189 ± 0.137
0.283CysThr: 0.283 ± 0.164
0.189CysVal: 0.189 ± 0.094
0.189CysTrp: 0.189 ± 0.147
0.094CysTyr: 0.094 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
3.773AspAla: 3.773 ± 0.49
0.283AspCys: 0.283 ± 0.179
3.867AspAsp: 3.867 ± 0.553
4.527AspGlu: 4.527 ± 0.657
2.924AspPhe: 2.924 ± 0.553
7.073AspGly: 7.073 ± 1.038
1.32AspHis: 1.32 ± 0.333
4.904AspIle: 4.904 ± 0.854
5.376AspLys: 5.376 ± 0.597
3.961AspLeu: 3.961 ± 1.009
2.546AspMet: 2.546 ± 0.454
3.867AspAsn: 3.867 ± 0.677
2.264AspPro: 2.264 ± 0.516
1.415AspGln: 1.415 ± 0.304
1.792AspArg: 1.792 ± 0.408
3.867AspSer: 3.867 ± 0.508
4.527AspThr: 4.527 ± 0.492
3.961AspVal: 3.961 ± 0.7
1.226AspTrp: 1.226 ± 0.211
3.018AspTyr: 3.018 ± 0.589
0.0AspXaa: 0.0 ± 0.0
Glu
3.773GluAla: 3.773 ± 0.547
0.283GluCys: 0.283 ± 0.133
3.112GluAsp: 3.112 ± 0.499
4.244GluGlu: 4.244 ± 0.928
2.546GluPhe: 2.546 ± 0.588
3.301GluGly: 3.301 ± 0.389
1.037GluHis: 1.037 ± 0.313
5.659GluIle: 5.659 ± 0.885
3.584GluLys: 3.584 ± 0.643
6.413GluLeu: 6.413 ± 0.751
1.886GluMet: 1.886 ± 0.365
4.527GluAsn: 4.527 ± 0.699
2.169GluPro: 2.169 ± 0.609
2.641GluGln: 2.641 ± 0.495
3.207GluArg: 3.207 ± 0.657
3.018GluSer: 3.018 ± 0.432
3.584GluThr: 3.584 ± 0.676
4.244GluVal: 4.244 ± 0.489
1.132GluTrp: 1.132 ± 0.369
3.112GluTyr: 3.112 ± 0.645
0.0GluXaa: 0.0 ± 0.0
Phe
3.018PheAla: 3.018 ± 0.45
0.189PheCys: 0.189 ± 0.143
3.678PheAsp: 3.678 ± 0.649
2.075PheGlu: 2.075 ± 0.374
1.886PhePhe: 1.886 ± 0.432
3.018PheGly: 3.018 ± 0.535
0.472PheHis: 0.472 ± 0.15
2.829PheIle: 2.829 ± 0.618
3.961PheLys: 3.961 ± 0.676
2.735PheLeu: 2.735 ± 0.54
0.283PheMet: 0.283 ± 0.127
3.018PheAsn: 3.018 ± 0.809
0.283PhePro: 0.283 ± 0.146
1.415PheGln: 1.415 ± 0.345
1.132PheArg: 1.132 ± 0.286
3.112PheSer: 3.112 ± 0.599
3.112PheThr: 3.112 ± 0.62
2.452PheVal: 2.452 ± 0.416
0.377PheTrp: 0.377 ± 0.229
2.169PheTyr: 2.169 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
3.678GlyAla: 3.678 ± 0.758
0.189GlyCys: 0.189 ± 0.143
4.716GlyAsp: 4.716 ± 0.57
3.584GlyGlu: 3.584 ± 0.709
3.207GlyPhe: 3.207 ± 0.397
4.527GlyGly: 4.527 ± 0.897
0.755GlyHis: 0.755 ± 0.259
5.942GlyIle: 5.942 ± 0.665
6.696GlyLys: 6.696 ± 0.682
5.47GlyLeu: 5.47 ± 0.833
1.32GlyMet: 1.32 ± 0.365
3.773GlyAsn: 3.773 ± 0.767
0.849GlyPro: 0.849 ± 0.273
2.735GlyGln: 2.735 ± 0.575
3.018GlyArg: 3.018 ± 0.569
4.81GlySer: 4.81 ± 0.89
4.433GlyThr: 4.433 ± 0.713
3.867GlyVal: 3.867 ± 0.724
1.226GlyTrp: 1.226 ± 0.343
2.641GlyTyr: 2.641 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
0.377HisAla: 0.377 ± 0.198
0.094HisCys: 0.094 ± 0.103
0.66HisAsp: 0.66 ± 0.19
0.472HisGlu: 0.472 ± 0.199
0.755HisPhe: 0.755 ± 0.25
0.66HisGly: 0.66 ± 0.21
0.566HisHis: 0.566 ± 0.166
1.132HisIle: 1.132 ± 0.326
1.037HisLys: 1.037 ± 0.327
0.755HisLeu: 0.755 ± 0.255
0.472HisMet: 0.472 ± 0.189
0.755HisAsn: 0.755 ± 0.342
0.66HisPro: 0.66 ± 0.244
0.755HisGln: 0.755 ± 0.267
0.755HisArg: 0.755 ± 0.214
0.755HisSer: 0.755 ± 0.236
0.66HisThr: 0.66 ± 0.217
1.603HisVal: 1.603 ± 0.27
0.189HisTrp: 0.189 ± 0.15
0.943HisTyr: 0.943 ± 0.302
0.0HisXaa: 0.0 ± 0.0
Ile
5.093IleAla: 5.093 ± 0.926
0.472IleCys: 0.472 ± 0.213
5.47IleAsp: 5.47 ± 0.659
3.961IleGlu: 3.961 ± 0.654
1.509IlePhe: 1.509 ± 0.428
4.244IleGly: 4.244 ± 0.478
0.283IleHis: 0.283 ± 0.153
3.678IleIle: 3.678 ± 0.768
7.262IleLys: 7.262 ± 0.702
3.961IleLeu: 3.961 ± 0.767
1.981IleMet: 1.981 ± 0.434
4.15IleAsn: 4.15 ± 0.552
3.112IlePro: 3.112 ± 0.542
2.829IleGln: 2.829 ± 0.429
3.112IleArg: 3.112 ± 0.595
4.904IleSer: 4.904 ± 0.696
4.15IleThr: 4.15 ± 0.669
3.301IleVal: 3.301 ± 0.657
0.943IleTrp: 0.943 ± 0.264
2.452IleTyr: 2.452 ± 0.557
0.0IleXaa: 0.0 ± 0.0
Lys
5.093LysAla: 5.093 ± 0.625
0.377LysCys: 0.377 ± 0.243
4.338LysAsp: 4.338 ± 0.555
6.885LysGlu: 6.885 ± 0.837
3.961LysPhe: 3.961 ± 0.902
5.847LysGly: 5.847 ± 0.709
1.603LysHis: 1.603 ± 0.614
4.527LysIle: 4.527 ± 0.528
6.791LysLys: 6.791 ± 1.101
6.508LysLeu: 6.508 ± 0.683
2.169LysMet: 2.169 ± 0.509
6.13LysAsn: 6.13 ± 0.76
3.207LysPro: 3.207 ± 0.435
3.49LysGln: 3.49 ± 0.576
3.018LysArg: 3.018 ± 0.445
4.621LysSer: 4.621 ± 0.843
5.564LysThr: 5.564 ± 0.742
4.338LysVal: 4.338 ± 0.709
1.132LysTrp: 1.132 ± 0.275
2.924LysTyr: 2.924 ± 0.703
0.0LysXaa: 0.0 ± 0.0
Leu
6.885LeuAla: 6.885 ± 0.639
0.377LeuCys: 0.377 ± 0.195
5.753LeuAsp: 5.753 ± 0.665
6.791LeuGlu: 6.791 ± 1.024
2.735LeuPhe: 2.735 ± 0.402
4.904LeuGly: 4.904 ± 1.058
0.755LeuHis: 0.755 ± 0.306
4.716LeuIle: 4.716 ± 0.73
6.508LeuLys: 6.508 ± 0.744
4.81LeuLeu: 4.81 ± 0.679
2.546LeuMet: 2.546 ± 0.33
5.47LeuAsn: 5.47 ± 0.734
3.112LeuPro: 3.112 ± 0.464
2.452LeuGln: 2.452 ± 0.487
3.395LeuArg: 3.395 ± 0.718
4.81LeuSer: 4.81 ± 0.768
6.225LeuThr: 6.225 ± 0.901
4.15LeuVal: 4.15 ± 0.529
0.849LeuTrp: 0.849 ± 0.298
2.075LeuTyr: 2.075 ± 0.546
0.0LeuXaa: 0.0 ± 0.0
Met
2.075MetAla: 2.075 ± 0.397
0.094MetCys: 0.094 ± 0.084
1.037MetAsp: 1.037 ± 0.335
1.32MetGlu: 1.32 ± 0.389
1.415MetPhe: 1.415 ± 0.275
0.755MetGly: 0.755 ± 0.242
0.283MetHis: 0.283 ± 0.148
1.603MetIle: 1.603 ± 0.389
2.924MetLys: 2.924 ± 0.524
1.792MetLeu: 1.792 ± 0.348
0.472MetMet: 0.472 ± 0.227
1.037MetAsn: 1.037 ± 0.332
1.037MetPro: 1.037 ± 0.219
0.755MetGln: 0.755 ± 0.229
0.943MetArg: 0.943 ± 0.26
1.981MetSer: 1.981 ± 0.38
1.698MetThr: 1.698 ± 0.455
1.981MetVal: 1.981 ± 0.406
0.283MetTrp: 0.283 ± 0.168
0.755MetTyr: 0.755 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
4.81AsnAla: 4.81 ± 1.143
0.377AsnCys: 0.377 ± 0.179
3.773AsnAsp: 3.773 ± 0.6
3.584AsnGlu: 3.584 ± 0.511
2.735AsnPhe: 2.735 ± 0.431
6.979AsnGly: 6.979 ± 1.149
1.132AsnHis: 1.132 ± 0.303
4.15AsnIle: 4.15 ± 0.594
3.395AsnLys: 3.395 ± 0.493
4.81AsnLeu: 4.81 ± 0.67
1.32AsnMet: 1.32 ± 0.327
4.433AsnAsn: 4.433 ± 0.848
2.924AsnPro: 2.924 ± 0.501
2.924AsnGln: 2.924 ± 0.418
2.264AsnArg: 2.264 ± 0.583
3.395AsnSer: 3.395 ± 0.588
3.395AsnThr: 3.395 ± 0.591
3.301AsnVal: 3.301 ± 0.416
1.792AsnTrp: 1.792 ± 0.377
2.264AsnTyr: 2.264 ± 0.432
0.0AsnXaa: 0.0 ± 0.0
Pro
1.981ProAla: 1.981 ± 0.395
0.0ProCys: 0.0 ± 0.0
2.075ProAsp: 2.075 ± 0.486
2.735ProGlu: 2.735 ± 0.612
1.509ProPhe: 1.509 ± 0.368
0.849ProGly: 0.849 ± 0.264
0.472ProHis: 0.472 ± 0.194
1.603ProIle: 1.603 ± 0.294
3.584ProLys: 3.584 ± 0.579
2.452ProLeu: 2.452 ± 0.529
0.377ProMet: 0.377 ± 0.207
2.641ProAsn: 2.641 ± 0.488
0.66ProPro: 0.66 ± 0.363
1.415ProGln: 1.415 ± 0.277
1.037ProArg: 1.037 ± 0.368
2.452ProSer: 2.452 ± 0.496
2.264ProThr: 2.264 ± 0.384
1.886ProVal: 1.886 ± 0.554
0.472ProTrp: 0.472 ± 0.156
0.66ProTyr: 0.66 ± 0.264
0.0ProXaa: 0.0 ± 0.0
Gln
3.584GlnAla: 3.584 ± 0.568
0.283GlnCys: 0.283 ± 0.14
1.981GlnAsp: 1.981 ± 0.355
2.641GlnGlu: 2.641 ± 0.546
1.226GlnPhe: 1.226 ± 0.421
3.112GlnGly: 3.112 ± 0.729
0.377GlnHis: 0.377 ± 0.187
2.546GlnIle: 2.546 ± 0.557
3.112GlnLys: 3.112 ± 0.487
3.584GlnLeu: 3.584 ± 0.455
1.132GlnMet: 1.132 ± 0.293
2.452GlnAsn: 2.452 ± 0.528
0.283GlnPro: 0.283 ± 0.142
3.112GlnGln: 3.112 ± 0.621
1.509GlnArg: 1.509 ± 0.356
3.018GlnSer: 3.018 ± 0.429
2.546GlnThr: 2.546 ± 0.488
1.886GlnVal: 1.886 ± 0.578
0.849GlnTrp: 0.849 ± 0.345
1.981GlnTyr: 1.981 ± 0.503
0.0GlnXaa: 0.0 ± 0.0
Arg
1.981ArgAla: 1.981 ± 0.365
0.094ArgCys: 0.094 ± 0.112
2.546ArgAsp: 2.546 ± 0.544
2.075ArgGlu: 2.075 ± 0.422
2.169ArgPhe: 2.169 ± 0.435
2.641ArgGly: 2.641 ± 0.63
0.849ArgHis: 0.849 ± 0.257
3.018ArgIle: 3.018 ± 0.587
2.735ArgLys: 2.735 ± 0.571
3.49ArgLeu: 3.49 ± 0.693
1.226ArgMet: 1.226 ± 0.342
2.641ArgAsn: 2.641 ± 0.368
1.226ArgPro: 1.226 ± 0.307
1.886ArgGln: 1.886 ± 0.39
1.415ArgArg: 1.415 ± 0.359
2.075ArgSer: 2.075 ± 0.382
2.829ArgThr: 2.829 ± 0.777
2.735ArgVal: 2.735 ± 0.466
0.755ArgTrp: 0.755 ± 0.21
2.169ArgTyr: 2.169 ± 0.584
0.0ArgXaa: 0.0 ± 0.0
Ser
2.924SerAla: 2.924 ± 0.617
0.566SerCys: 0.566 ± 0.282
4.527SerAsp: 4.527 ± 0.64
3.207SerGlu: 3.207 ± 0.501
3.112SerPhe: 3.112 ± 0.646
4.433SerGly: 4.433 ± 0.576
0.377SerHis: 0.377 ± 0.144
4.81SerIle: 4.81 ± 0.446
5.093SerLys: 5.093 ± 0.757
5.093SerLeu: 5.093 ± 0.751
1.792SerMet: 1.792 ± 0.425
4.244SerAsn: 4.244 ± 0.783
2.358SerPro: 2.358 ± 0.482
3.112SerGln: 3.112 ± 0.649
3.018SerArg: 3.018 ± 0.544
4.999SerSer: 4.999 ± 1.154
3.773SerThr: 3.773 ± 0.565
5.187SerVal: 5.187 ± 0.853
0.755SerTrp: 0.755 ± 0.357
1.981SerTyr: 1.981 ± 0.495
0.0SerXaa: 0.0 ± 0.0
Thr
4.716ThrAla: 4.716 ± 0.821
0.283ThrCys: 0.283 ± 0.154
4.527ThrAsp: 4.527 ± 0.665
3.867ThrGlu: 3.867 ± 0.544
3.018ThrPhe: 3.018 ± 0.566
3.678ThrGly: 3.678 ± 0.527
0.943ThrHis: 0.943 ± 0.309
4.244ThrIle: 4.244 ± 0.742
5.564ThrLys: 5.564 ± 0.711
7.073ThrLeu: 7.073 ± 1.024
1.226ThrMet: 1.226 ± 0.337
3.678ThrAsn: 3.678 ± 0.642
1.698ThrPro: 1.698 ± 0.451
2.452ThrGln: 2.452 ± 0.519
1.698ThrArg: 1.698 ± 0.376
3.584ThrSer: 3.584 ± 0.564
3.301ThrThr: 3.301 ± 0.554
4.716ThrVal: 4.716 ± 0.447
1.32ThrTrp: 1.32 ± 0.395
3.678ThrTyr: 3.678 ± 0.62
0.0ThrXaa: 0.0 ± 0.0
Val
4.244ValAla: 4.244 ± 0.79
0.189ValCys: 0.189 ± 0.109
5.564ValAsp: 5.564 ± 0.626
4.244ValGlu: 4.244 ± 0.683
1.792ValPhe: 1.792 ± 0.358
4.621ValGly: 4.621 ± 0.594
0.66ValHis: 0.66 ± 0.212
4.338ValIle: 4.338 ± 0.532
5.47ValLys: 5.47 ± 0.678
3.961ValLeu: 3.961 ± 0.817
1.037ValMet: 1.037 ± 0.287
3.584ValAsn: 3.584 ± 0.925
2.169ValPro: 2.169 ± 0.53
1.792ValGln: 1.792 ± 0.443
2.264ValArg: 2.264 ± 0.641
4.81ValSer: 4.81 ± 0.691
4.716ValThr: 4.716 ± 0.77
4.15ValVal: 4.15 ± 0.716
0.849ValTrp: 0.849 ± 0.222
1.792ValTyr: 1.792 ± 0.346
0.0ValXaa: 0.0 ± 0.0
Trp
0.66TrpAla: 0.66 ± 0.222
0.094TrpCys: 0.094 ± 0.097
1.509TrpAsp: 1.509 ± 0.552
0.943TrpGlu: 0.943 ± 0.212
0.849TrpPhe: 0.849 ± 0.289
0.377TrpGly: 0.377 ± 0.173
0.283TrpHis: 0.283 ± 0.145
0.66TrpIle: 0.66 ± 0.26
0.755TrpLys: 0.755 ± 0.234
1.886TrpLeu: 1.886 ± 0.395
0.189TrpMet: 0.189 ± 0.124
0.66TrpAsn: 0.66 ± 0.253
0.189TrpPro: 0.189 ± 0.143
0.755TrpGln: 0.755 ± 0.266
0.943TrpArg: 0.943 ± 0.258
1.792TrpSer: 1.792 ± 0.564
1.509TrpThr: 1.509 ± 0.477
1.32TrpVal: 1.32 ± 0.322
0.189TrpTrp: 0.189 ± 0.122
0.377TrpTyr: 0.377 ± 0.266
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.735TyrAla: 2.735 ± 0.314
0.377TyrCys: 0.377 ± 0.28
1.886TyrAsp: 1.886 ± 0.36
3.018TyrGlu: 3.018 ± 0.558
1.792TyrPhe: 1.792 ± 0.339
1.792TyrGly: 1.792 ± 0.594
0.755TyrHis: 0.755 ± 0.245
1.792TyrIle: 1.792 ± 0.345
2.735TyrLys: 2.735 ± 0.471
3.678TyrLeu: 3.678 ± 0.608
0.849TyrMet: 0.849 ± 0.261
1.415TyrAsn: 1.415 ± 0.379
1.32TyrPro: 1.32 ± 0.393
2.452TyrGln: 2.452 ± 0.321
3.018TyrArg: 3.018 ± 0.662
2.452TyrSer: 2.452 ± 0.672
1.886TyrThr: 1.886 ± 0.535
3.112TyrVal: 3.112 ± 0.459
0.283TyrTrp: 0.283 ± 0.163
2.169TyrTyr: 2.169 ± 0.746
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (10604 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski