Amino acid dipepetide frequency for Streptococcus phage CHPC1091

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.108AlaAla: 3.108 ± 1.009
0.178AlaCys: 0.178 ± 0.123
5.417AlaAsp: 5.417 ± 1.218
3.374AlaGlu: 3.374 ± 0.492
1.776AlaPhe: 1.776 ± 0.527
4.44AlaGly: 4.44 ± 0.798
1.154AlaHis: 1.154 ± 0.403
4.973AlaIle: 4.973 ± 0.98
5.683AlaLys: 5.683 ± 1.044
5.95AlaLeu: 5.95 ± 0.833
2.042AlaMet: 2.042 ± 0.415
4.44AlaAsn: 4.44 ± 0.781
1.776AlaPro: 1.776 ± 0.431
2.486AlaGln: 2.486 ± 0.556
2.753AlaArg: 2.753 ± 0.508
3.907AlaSer: 3.907 ± 0.705
4.351AlaThr: 4.351 ± 0.639
3.818AlaVal: 3.818 ± 0.695
1.066AlaTrp: 1.066 ± 0.277
3.019AlaTyr: 3.019 ± 0.575
0.0AlaXaa: 0.0 ± 0.0
Cys
0.178CysAla: 0.178 ± 0.121
0.0CysCys: 0.0 ± 0.0
0.622CysAsp: 0.622 ± 0.333
0.355CysGlu: 0.355 ± 0.199
0.178CysPhe: 0.178 ± 0.179
0.266CysGly: 0.266 ± 0.177
0.178CysHis: 0.178 ± 0.125
0.178CysIle: 0.178 ± 0.154
0.355CysLys: 0.355 ± 0.165
0.444CysLeu: 0.444 ± 0.263
0.089CysMet: 0.089 ± 0.09
0.266CysAsn: 0.266 ± 0.161
0.266CysPro: 0.266 ± 0.168
0.178CysGln: 0.178 ± 0.144
0.266CysArg: 0.266 ± 0.208
0.444CysSer: 0.444 ± 0.219
0.355CysThr: 0.355 ± 0.193
0.355CysVal: 0.355 ± 0.131
0.178CysTrp: 0.178 ± 0.126
0.089CysTyr: 0.089 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
3.197AspAla: 3.197 ± 0.507
0.355AspCys: 0.355 ± 0.236
4.795AspAsp: 4.795 ± 0.773
3.996AspGlu: 3.996 ± 0.683
4.44AspPhe: 4.44 ± 0.68
6.927AspGly: 6.927 ± 1.526
0.799AspHis: 0.799 ± 0.276
5.772AspIle: 5.772 ± 0.804
4.351AspLys: 4.351 ± 0.626
3.641AspLeu: 3.641 ± 0.792
2.309AspMet: 2.309 ± 0.368
4.351AspAsn: 4.351 ± 0.693
2.398AspPro: 2.398 ± 0.476
1.687AspGln: 1.687 ± 0.28
2.575AspArg: 2.575 ± 0.488
3.818AspSer: 3.818 ± 0.709
4.174AspThr: 4.174 ± 0.627
3.463AspVal: 3.463 ± 0.783
1.243AspTrp: 1.243 ± 0.3
2.842AspTyr: 2.842 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
3.818GluAla: 3.818 ± 0.741
0.266GluCys: 0.266 ± 0.141
2.93GluAsp: 2.93 ± 0.643
3.818GluGlu: 3.818 ± 0.734
2.486GluPhe: 2.486 ± 0.504
3.463GluGly: 3.463 ± 0.408
1.421GluHis: 1.421 ± 0.405
5.595GluIle: 5.595 ± 0.934
4.618GluLys: 4.618 ± 0.928
6.216GluLeu: 6.216 ± 0.996
2.309GluMet: 2.309 ± 0.461
3.818GluAsn: 3.818 ± 0.663
2.309GluPro: 2.309 ± 0.622
2.93GluGln: 2.93 ± 0.478
3.108GluArg: 3.108 ± 0.676
3.286GluSer: 3.286 ± 0.573
2.842GluThr: 2.842 ± 0.554
5.328GluVal: 5.328 ± 0.716
1.243GluTrp: 1.243 ± 0.347
3.374GluTyr: 3.374 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
3.019PheAla: 3.019 ± 0.558
0.266PheCys: 0.266 ± 0.152
3.197PheAsp: 3.197 ± 0.463
2.309PheGlu: 2.309 ± 0.552
1.687PhePhe: 1.687 ± 0.314
3.108PheGly: 3.108 ± 0.597
0.622PheHis: 0.622 ± 0.178
2.22PheIle: 2.22 ± 0.417
3.907PheLys: 3.907 ± 0.81
2.753PheLeu: 2.753 ± 0.562
0.533PheMet: 0.533 ± 0.208
3.907PheAsn: 3.907 ± 0.568
0.355PhePro: 0.355 ± 0.158
1.51PheGln: 1.51 ± 0.31
1.51PheArg: 1.51 ± 0.378
3.019PheSer: 3.019 ± 0.743
2.842PheThr: 2.842 ± 0.588
2.309PheVal: 2.309 ± 0.45
0.533PheTrp: 0.533 ± 0.274
2.309PheTyr: 2.309 ± 0.433
0.0PheXaa: 0.0 ± 0.0
Gly
3.996GlyAla: 3.996 ± 0.766
0.178GlyCys: 0.178 ± 0.134
4.884GlyAsp: 4.884 ± 0.651
3.286GlyGlu: 3.286 ± 0.623
2.93GlyPhe: 2.93 ± 0.479
4.529GlyGly: 4.529 ± 0.776
1.154GlyHis: 1.154 ± 0.28
5.328GlyIle: 5.328 ± 0.82
7.193GlyLys: 7.193 ± 0.82
5.861GlyLeu: 5.861 ± 0.797
1.687GlyMet: 1.687 ± 0.417
4.085GlyAsn: 4.085 ± 0.747
1.154GlyPro: 1.154 ± 0.64
2.842GlyGln: 2.842 ± 0.653
3.463GlyArg: 3.463 ± 0.669
5.506GlySer: 5.506 ± 0.769
4.262GlyThr: 4.262 ± 0.633
3.286GlyVal: 3.286 ± 0.642
1.243GlyTrp: 1.243 ± 0.436
2.842GlyTyr: 2.842 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
0.355HisAla: 0.355 ± 0.137
0.089HisCys: 0.089 ± 0.091
1.066HisAsp: 1.066 ± 0.301
0.71HisGlu: 0.71 ± 0.258
0.622HisPhe: 0.622 ± 0.227
0.977HisGly: 0.977 ± 0.297
0.71HisHis: 0.71 ± 0.233
1.066HisIle: 1.066 ± 0.351
1.51HisLys: 1.51 ± 0.489
0.888HisLeu: 0.888 ± 0.273
0.622HisMet: 0.622 ± 0.24
0.533HisAsn: 0.533 ± 0.223
0.71HisPro: 0.71 ± 0.202
0.622HisGln: 0.622 ± 0.264
0.977HisArg: 0.977 ± 0.268
0.977HisSer: 0.977 ± 0.273
0.622HisThr: 0.622 ± 0.2
1.598HisVal: 1.598 ± 0.281
0.178HisTrp: 0.178 ± 0.123
0.799HisTyr: 0.799 ± 0.309
0.0HisXaa: 0.0 ± 0.0
Ile
5.062IleAla: 5.062 ± 0.982
0.355IleCys: 0.355 ± 0.2
5.683IleAsp: 5.683 ± 0.721
5.151IleGlu: 5.151 ± 0.847
1.776IlePhe: 1.776 ± 0.441
4.529IleGly: 4.529 ± 0.594
0.888IleHis: 0.888 ± 0.262
3.197IleIle: 3.197 ± 0.509
5.683IleLys: 5.683 ± 0.703
3.552IleLeu: 3.552 ± 0.713
1.776IleMet: 1.776 ± 0.566
4.529IleAsn: 4.529 ± 0.582
3.641IlePro: 3.641 ± 0.637
2.753IleGln: 2.753 ± 0.476
2.842IleArg: 2.842 ± 0.597
3.996IleSer: 3.996 ± 0.505
4.085IleThr: 4.085 ± 0.443
3.552IleVal: 3.552 ± 0.683
1.154IleTrp: 1.154 ± 0.275
2.398IleTyr: 2.398 ± 0.499
0.0IleXaa: 0.0 ± 0.0
Lys
5.772LysAla: 5.772 ± 0.556
0.266LysCys: 0.266 ± 0.153
4.795LysAsp: 4.795 ± 0.762
7.637LysGlu: 7.637 ± 1.047
3.552LysPhe: 3.552 ± 0.802
5.506LysGly: 5.506 ± 0.651
1.154LysHis: 1.154 ± 0.398
5.328LysIle: 5.328 ± 0.814
6.483LysLys: 6.483 ± 0.893
6.571LysLeu: 6.571 ± 0.914
2.042LysMet: 2.042 ± 0.522
4.707LysAsn: 4.707 ± 0.661
2.575LysPro: 2.575 ± 0.378
3.197LysGln: 3.197 ± 0.631
3.019LysArg: 3.019 ± 0.459
3.552LysSer: 3.552 ± 0.61
5.328LysThr: 5.328 ± 0.801
4.44LysVal: 4.44 ± 0.679
1.243LysTrp: 1.243 ± 0.257
2.93LysTyr: 2.93 ± 0.5
0.0LysXaa: 0.0 ± 0.0
Leu
6.394LeuAla: 6.394 ± 0.787
0.622LeuCys: 0.622 ± 0.239
5.239LeuAsp: 5.239 ± 0.763
6.66LeuGlu: 6.66 ± 1.032
2.842LeuPhe: 2.842 ± 0.473
5.506LeuGly: 5.506 ± 0.988
0.977LeuHis: 0.977 ± 0.286
4.529LeuIle: 4.529 ± 0.593
7.015LeuLys: 7.015 ± 0.69
4.529LeuLeu: 4.529 ± 0.737
2.486LeuMet: 2.486 ± 0.397
4.707LeuAsn: 4.707 ± 0.538
2.842LeuPro: 2.842 ± 0.449
2.753LeuGln: 2.753 ± 0.519
3.108LeuArg: 3.108 ± 0.55
4.618LeuSer: 4.618 ± 0.759
5.772LeuThr: 5.772 ± 0.766
4.085LeuVal: 4.085 ± 0.598
0.622LeuTrp: 0.622 ± 0.286
2.042LeuTyr: 2.042 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
2.042MetAla: 2.042 ± 0.462
0.089MetCys: 0.089 ± 0.1
0.799MetAsp: 0.799 ± 0.249
1.421MetGlu: 1.421 ± 0.455
1.332MetPhe: 1.332 ± 0.307
1.154MetGly: 1.154 ± 0.258
0.533MetHis: 0.533 ± 0.238
1.51MetIle: 1.51 ± 0.4
2.575MetLys: 2.575 ± 0.537
1.776MetLeu: 1.776 ± 0.285
0.444MetMet: 0.444 ± 0.233
1.51MetAsn: 1.51 ± 0.299
1.421MetPro: 1.421 ± 0.344
0.799MetGln: 0.799 ± 0.223
1.332MetArg: 1.332 ± 0.298
1.776MetSer: 1.776 ± 0.386
1.687MetThr: 1.687 ± 0.396
2.042MetVal: 2.042 ± 0.437
0.089MetTrp: 0.089 ± 0.069
0.888MetTyr: 0.888 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
5.151AsnAla: 5.151 ± 0.923
0.444AsnCys: 0.444 ± 0.234
3.907AsnAsp: 3.907 ± 0.573
3.552AsnGlu: 3.552 ± 0.615
2.131AsnPhe: 2.131 ± 0.524
6.838AsnGly: 6.838 ± 1.017
1.332AsnHis: 1.332 ± 0.305
3.463AsnIle: 3.463 ± 0.603
3.463AsnLys: 3.463 ± 0.439
5.506AsnLeu: 5.506 ± 0.52
0.977AsnMet: 0.977 ± 0.357
4.174AsnAsn: 4.174 ± 0.609
2.842AsnPro: 2.842 ± 0.537
2.753AsnGln: 2.753 ± 0.502
2.042AsnArg: 2.042 ± 0.451
3.907AsnSer: 3.907 ± 0.596
3.374AsnThr: 3.374 ± 0.613
2.842AsnVal: 2.842 ± 0.399
1.598AsnTrp: 1.598 ± 0.407
2.664AsnTyr: 2.664 ± 0.586
0.0AsnXaa: 0.0 ± 0.0
Pro
1.687ProAla: 1.687 ± 0.325
0.089ProCys: 0.089 ± 0.09
1.776ProAsp: 1.776 ± 0.388
2.398ProGlu: 2.398 ± 0.557
1.066ProPhe: 1.066 ± 0.289
1.243ProGly: 1.243 ± 0.392
0.355ProHis: 0.355 ± 0.159
1.51ProIle: 1.51 ± 0.343
3.463ProLys: 3.463 ± 0.521
2.22ProLeu: 2.22 ± 0.476
0.355ProMet: 0.355 ± 0.17
2.398ProAsn: 2.398 ± 0.408
0.888ProPro: 0.888 ± 0.391
1.598ProGln: 1.598 ± 0.373
1.154ProArg: 1.154 ± 0.408
2.753ProSer: 2.753 ± 0.504
2.842ProThr: 2.842 ± 0.431
1.776ProVal: 1.776 ± 0.532
0.799ProTrp: 0.799 ± 0.238
1.243ProTyr: 1.243 ± 0.521
0.0ProXaa: 0.0 ± 0.0
Gln
3.818GlnAla: 3.818 ± 0.428
0.178GlnCys: 0.178 ± 0.142
2.131GlnAsp: 2.131 ± 0.392
2.93GlnGlu: 2.93 ± 0.475
1.51GlnPhe: 1.51 ± 0.413
3.818GlnGly: 3.818 ± 0.83
0.622GlnHis: 0.622 ± 0.251
2.486GlnIle: 2.486 ± 0.603
2.753GlnLys: 2.753 ± 0.551
3.108GlnLeu: 3.108 ± 0.43
1.154GlnMet: 1.154 ± 0.27
2.309GlnAsn: 2.309 ± 0.392
0.266GlnPro: 0.266 ± 0.166
2.398GlnGln: 2.398 ± 0.441
1.687GlnArg: 1.687 ± 0.35
2.398GlnSer: 2.398 ± 0.427
2.842GlnThr: 2.842 ± 0.547
2.309GlnVal: 2.309 ± 0.657
0.71GlnTrp: 0.71 ± 0.356
2.131GlnTyr: 2.131 ± 0.442
0.0GlnXaa: 0.0 ± 0.0
Arg
1.865ArgAla: 1.865 ± 0.352
0.178ArgCys: 0.178 ± 0.126
2.486ArgAsp: 2.486 ± 0.443
2.93ArgGlu: 2.93 ± 0.688
2.042ArgPhe: 2.042 ± 0.415
2.753ArgGly: 2.753 ± 0.61
0.622ArgHis: 0.622 ± 0.268
3.463ArgIle: 3.463 ± 0.563
3.019ArgLys: 3.019 ± 0.51
3.73ArgLeu: 3.73 ± 0.761
1.243ArgMet: 1.243 ± 0.361
2.753ArgAsn: 2.753 ± 0.463
1.332ArgPro: 1.332 ± 0.366
1.687ArgGln: 1.687 ± 0.431
1.598ArgArg: 1.598 ± 0.484
1.421ArgSer: 1.421 ± 0.278
2.575ArgThr: 2.575 ± 0.73
2.753ArgVal: 2.753 ± 0.418
1.066ArgTrp: 1.066 ± 0.335
2.309ArgTyr: 2.309 ± 0.479
0.0ArgXaa: 0.0 ± 0.0
Ser
3.019SerAla: 3.019 ± 0.942
0.71SerCys: 0.71 ± 0.249
3.996SerAsp: 3.996 ± 0.65
3.641SerGlu: 3.641 ± 0.58
2.93SerPhe: 2.93 ± 0.479
4.174SerGly: 4.174 ± 0.62
0.444SerHis: 0.444 ± 0.177
4.795SerIle: 4.795 ± 0.57
4.174SerLys: 4.174 ± 0.713
4.085SerLeu: 4.085 ± 0.53
1.51SerMet: 1.51 ± 0.332
5.151SerAsn: 5.151 ± 0.627
2.042SerPro: 2.042 ± 0.404
2.93SerGln: 2.93 ± 0.486
2.486SerArg: 2.486 ± 0.684
3.197SerSer: 3.197 ± 0.525
3.552SerThr: 3.552 ± 0.494
4.884SerVal: 4.884 ± 0.781
0.799SerTrp: 0.799 ± 0.367
2.131SerTyr: 2.131 ± 0.547
0.0SerXaa: 0.0 ± 0.0
Thr
4.884ThrAla: 4.884 ± 0.635
0.266ThrCys: 0.266 ± 0.154
4.085ThrAsp: 4.085 ± 0.596
2.842ThrGlu: 2.842 ± 0.461
3.286ThrPhe: 3.286 ± 0.622
3.463ThrGly: 3.463 ± 0.524
1.243ThrHis: 1.243 ± 0.305
4.262ThrIle: 4.262 ± 0.796
4.973ThrLys: 4.973 ± 0.556
6.66ThrLeu: 6.66 ± 0.768
1.066ThrMet: 1.066 ± 0.302
3.907ThrAsn: 3.907 ± 0.678
1.51ThrPro: 1.51 ± 0.528
3.019ThrGln: 3.019 ± 0.476
2.042ThrArg: 2.042 ± 0.378
3.552ThrSer: 3.552 ± 0.659
2.93ThrThr: 2.93 ± 0.566
4.529ThrVal: 4.529 ± 0.498
1.154ThrTrp: 1.154 ± 0.337
3.197ThrTyr: 3.197 ± 0.59
0.0ThrXaa: 0.0 ± 0.0
Val
4.44ValAla: 4.44 ± 0.831
0.266ValCys: 0.266 ± 0.192
5.151ValAsp: 5.151 ± 0.583
4.262ValGlu: 4.262 ± 0.695
2.309ValPhe: 2.309 ± 0.444
4.529ValGly: 4.529 ± 0.621
0.533ValHis: 0.533 ± 0.167
3.818ValIle: 3.818 ± 0.539
5.239ValLys: 5.239 ± 0.774
4.973ValLeu: 4.973 ± 0.737
1.154ValMet: 1.154 ± 0.304
3.286ValAsn: 3.286 ± 0.702
1.776ValPro: 1.776 ± 0.391
1.776ValGln: 1.776 ± 0.484
1.954ValArg: 1.954 ± 0.357
4.351ValSer: 4.351 ± 0.722
4.884ValThr: 4.884 ± 0.813
3.374ValVal: 3.374 ± 0.405
0.71ValTrp: 0.71 ± 0.288
1.954ValTyr: 1.954 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
0.622TrpAla: 0.622 ± 0.213
0.089TrpCys: 0.089 ± 0.092
1.598TrpAsp: 1.598 ± 0.503
1.421TrpGlu: 1.421 ± 0.327
0.799TrpPhe: 0.799 ± 0.227
0.71TrpGly: 0.71 ± 0.307
0.266TrpHis: 0.266 ± 0.128
0.533TrpIle: 0.533 ± 0.172
0.799TrpLys: 0.799 ± 0.21
1.332TrpLeu: 1.332 ± 0.322
0.266TrpMet: 0.266 ± 0.135
0.444TrpAsn: 0.444 ± 0.279
0.089TrpPro: 0.089 ± 0.089
1.066TrpGln: 1.066 ± 0.273
0.977TrpArg: 0.977 ± 0.243
1.776TrpSer: 1.776 ± 0.742
1.154TrpThr: 1.154 ± 0.609
1.51TrpVal: 1.51 ± 0.305
0.266TrpTrp: 0.266 ± 0.177
0.444TrpTyr: 0.444 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.286TyrAla: 3.286 ± 0.539
0.355TyrCys: 0.355 ± 0.24
2.575TyrAsp: 2.575 ± 0.429
2.486TyrGlu: 2.486 ± 0.539
2.22TyrPhe: 2.22 ± 0.461
1.776TyrGly: 1.776 ± 0.467
0.622TyrHis: 0.622 ± 0.205
2.575TyrIle: 2.575 ± 0.487
2.842TyrLys: 2.842 ± 0.541
3.552TyrLeu: 3.552 ± 0.489
1.066TyrMet: 1.066 ± 0.318
1.776TyrAsn: 1.776 ± 0.333
1.421TyrPro: 1.421 ± 0.452
2.575TyrGln: 2.575 ± 0.509
2.93TyrArg: 2.93 ± 0.585
2.398TyrSer: 2.398 ± 0.553
2.398TyrThr: 2.398 ± 0.645
2.486TyrVal: 2.486 ± 0.506
0.266TyrTrp: 0.266 ± 0.136
2.22TyrTyr: 2.22 ± 0.775
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11262 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski