Amino acid dipepetide frequency for Streptococcus phage CHPC1027

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.525AlaAla: 2.525 ± 1.067
0.18AlaCys: 0.18 ± 0.145
4.328AlaAsp: 4.328 ± 0.715
3.516AlaGlu: 3.516 ± 0.435
2.074AlaPhe: 2.074 ± 0.615
3.787AlaGly: 3.787 ± 0.8
0.811AlaHis: 0.811 ± 0.278
4.688AlaIle: 4.688 ± 0.722
6.221AlaLys: 6.221 ± 1.189
5.59AlaLeu: 5.59 ± 0.818
1.352AlaMet: 1.352 ± 0.337
4.418AlaAsn: 4.418 ± 0.632
2.074AlaPro: 2.074 ± 0.427
2.434AlaGln: 2.434 ± 0.44
2.705AlaArg: 2.705 ± 0.483
4.688AlaSer: 4.688 ± 0.616
4.148AlaThr: 4.148 ± 0.798
3.426AlaVal: 3.426 ± 0.618
0.902AlaTrp: 0.902 ± 0.257
2.615AlaTyr: 2.615 ± 0.488
0.0AlaXaa: 0.0 ± 0.0
Cys
0.18CysAla: 0.18 ± 0.105
0.09CysCys: 0.09 ± 0.082
0.811CysAsp: 0.811 ± 0.28
0.27CysGlu: 0.27 ± 0.166
0.361CysPhe: 0.361 ± 0.217
0.361CysGly: 0.361 ± 0.203
0.18CysHis: 0.18 ± 0.124
0.18CysIle: 0.18 ± 0.151
0.361CysLys: 0.361 ± 0.187
0.451CysLeu: 0.451 ± 0.24
0.09CysMet: 0.09 ± 0.088
0.361CysAsn: 0.361 ± 0.213
0.27CysPro: 0.27 ± 0.154
0.27CysGln: 0.27 ± 0.16
0.541CysArg: 0.541 ± 0.269
0.451CysSer: 0.451 ± 0.276
0.451CysThr: 0.451 ± 0.212
0.18CysVal: 0.18 ± 0.103
0.18CysTrp: 0.18 ± 0.128
0.18CysTyr: 0.18 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
3.516AspAla: 3.516 ± 0.75
0.361AspCys: 0.361 ± 0.168
4.148AspAsp: 4.148 ± 0.556
4.057AspGlu: 4.057 ± 0.651
3.426AspPhe: 3.426 ± 0.643
5.951AspGly: 5.951 ± 0.932
1.082AspHis: 1.082 ± 0.344
4.418AspIle: 4.418 ± 0.706
5.229AspLys: 5.229 ± 0.709
3.967AspLeu: 3.967 ± 0.781
2.254AspMet: 2.254 ± 0.526
4.598AspAsn: 4.598 ± 0.77
1.984AspPro: 1.984 ± 0.432
1.352AspGln: 1.352 ± 0.297
3.246AspArg: 3.246 ± 0.444
3.697AspSer: 3.697 ± 0.574
4.148AspThr: 4.148 ± 0.639
4.508AspVal: 4.508 ± 0.671
1.082AspTrp: 1.082 ± 0.29
3.246AspTyr: 3.246 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
4.057GluAla: 4.057 ± 0.543
0.361GluCys: 0.361 ± 0.158
3.697GluAsp: 3.697 ± 0.699
3.877GluGlu: 3.877 ± 0.795
2.795GluPhe: 2.795 ± 0.594
3.516GluGly: 3.516 ± 0.479
1.352GluHis: 1.352 ± 0.354
5.77GluIle: 5.77 ± 0.693
3.877GluLys: 3.877 ± 0.812
6.311GluLeu: 6.311 ± 0.948
2.254GluMet: 2.254 ± 0.509
3.967GluAsn: 3.967 ± 0.693
1.984GluPro: 1.984 ± 0.518
2.434GluGln: 2.434 ± 0.569
3.066GluArg: 3.066 ± 0.61
2.885GluSer: 2.885 ± 0.49
2.975GluThr: 2.975 ± 0.495
4.779GluVal: 4.779 ± 0.649
1.352GluTrp: 1.352 ± 0.322
2.975GluTyr: 2.975 ± 0.681
0.0GluXaa: 0.0 ± 0.0
Phe
3.246PheAla: 3.246 ± 0.563
0.361PheCys: 0.361 ± 0.224
3.516PheAsp: 3.516 ± 0.581
2.344PheGlu: 2.344 ± 0.613
1.803PhePhe: 1.803 ± 0.373
3.156PheGly: 3.156 ± 0.638
0.541PheHis: 0.541 ± 0.144
2.525PheIle: 2.525 ± 0.539
4.418PheLys: 4.418 ± 0.48
3.156PheLeu: 3.156 ± 0.481
0.451PheMet: 0.451 ± 0.192
3.787PheAsn: 3.787 ± 0.687
0.541PhePro: 0.541 ± 0.199
1.082PheGln: 1.082 ± 0.244
1.533PheArg: 1.533 ± 0.326
2.885PheSer: 2.885 ± 0.495
3.156PheThr: 3.156 ± 0.717
2.525PheVal: 2.525 ± 0.354
0.811PheTrp: 0.811 ± 0.258
1.713PheTyr: 1.713 ± 0.386
0.0PheXaa: 0.0 ± 0.0
Gly
3.066GlyAla: 3.066 ± 0.667
0.451GlyCys: 0.451 ± 0.215
3.877GlyAsp: 3.877 ± 0.486
3.697GlyGlu: 3.697 ± 0.561
3.336GlyPhe: 3.336 ± 0.506
4.328GlyGly: 4.328 ± 0.853
0.902GlyHis: 0.902 ± 0.281
4.869GlyIle: 4.869 ± 0.729
7.303GlyLys: 7.303 ± 0.94
6.402GlyLeu: 6.402 ± 0.904
1.352GlyMet: 1.352 ± 0.301
4.418GlyAsn: 4.418 ± 0.594
1.262GlyPro: 1.262 ± 0.402
2.795GlyGln: 2.795 ± 0.668
2.975GlyArg: 2.975 ± 0.494
4.598GlySer: 4.598 ± 0.698
4.148GlyThr: 4.148 ± 0.649
4.238GlyVal: 4.238 ± 0.63
1.262GlyTrp: 1.262 ± 0.371
3.156GlyTyr: 3.156 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
0.361HisAla: 0.361 ± 0.154
0.18HisCys: 0.18 ± 0.148
0.902HisAsp: 0.902 ± 0.282
0.631HisGlu: 0.631 ± 0.238
0.811HisPhe: 0.811 ± 0.251
0.992HisGly: 0.992 ± 0.315
0.361HisHis: 0.361 ± 0.175
1.443HisIle: 1.443 ± 0.337
0.902HisLys: 0.902 ± 0.328
1.262HisLeu: 1.262 ± 0.277
0.361HisMet: 0.361 ± 0.219
0.451HisAsn: 0.451 ± 0.171
0.631HisPro: 0.631 ± 0.193
0.721HisGln: 0.721 ± 0.257
1.082HisArg: 1.082 ± 0.313
0.992HisSer: 0.992 ± 0.23
0.631HisThr: 0.631 ± 0.208
1.352HisVal: 1.352 ± 0.211
0.09HisTrp: 0.09 ± 0.091
0.811HisTyr: 0.811 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
4.598IleAla: 4.598 ± 0.892
0.451IleCys: 0.451 ± 0.198
5.229IleAsp: 5.229 ± 0.735
4.418IleGlu: 4.418 ± 0.696
2.074IlePhe: 2.074 ± 0.466
4.238IleGly: 4.238 ± 0.736
0.631IleHis: 0.631 ± 0.219
4.057IleIle: 4.057 ± 0.748
6.762IleLys: 6.762 ± 0.671
3.877IleLeu: 3.877 ± 0.658
1.893IleMet: 1.893 ± 0.464
4.057IleAsn: 4.057 ± 0.504
3.426IlePro: 3.426 ± 0.546
2.525IleGln: 2.525 ± 0.4
2.615IleArg: 2.615 ± 0.399
4.688IleSer: 4.688 ± 0.54
3.336IleThr: 3.336 ± 0.657
3.156IleVal: 3.156 ± 0.561
0.902IleTrp: 0.902 ± 0.24
2.434IleTyr: 2.434 ± 0.607
0.0IleXaa: 0.0 ± 0.0
Lys
5.41LysAla: 5.41 ± 0.551
0.451LysCys: 0.451 ± 0.295
5.41LysAsp: 5.41 ± 0.798
7.123LysGlu: 7.123 ± 0.97
3.156LysPhe: 3.156 ± 0.804
6.402LysGly: 6.402 ± 0.708
1.352LysHis: 1.352 ± 0.453
5.59LysIle: 5.59 ± 0.648
7.123LysLys: 7.123 ± 1.141
6.672LysLeu: 6.672 ± 0.773
1.803LysMet: 1.803 ± 0.484
4.688LysAsn: 4.688 ± 0.601
2.975LysPro: 2.975 ± 0.487
3.787LysGln: 3.787 ± 0.591
4.238LysArg: 4.238 ± 0.583
4.238LysSer: 4.238 ± 0.588
5.5LysThr: 5.5 ± 0.758
4.688LysVal: 4.688 ± 0.662
1.172LysTrp: 1.172 ± 0.292
3.697LysTyr: 3.697 ± 0.896
0.0LysXaa: 0.0 ± 0.0
Leu
6.402LeuAla: 6.402 ± 0.883
0.631LeuCys: 0.631 ± 0.231
5.59LeuAsp: 5.59 ± 0.724
6.131LeuGlu: 6.131 ± 0.939
3.066LeuPhe: 3.066 ± 0.416
5.77LeuGly: 5.77 ± 0.876
0.992LeuHis: 0.992 ± 0.328
3.516LeuIle: 3.516 ± 0.516
6.943LeuLys: 6.943 ± 0.712
5.229LeuLeu: 5.229 ± 0.671
2.164LeuMet: 2.164 ± 0.454
5.41LeuAsn: 5.41 ± 0.738
2.705LeuPro: 2.705 ± 0.464
2.885LeuGln: 2.885 ± 0.583
3.697LeuArg: 3.697 ± 0.834
4.598LeuSer: 4.598 ± 0.706
6.041LeuThr: 6.041 ± 0.828
4.148LeuVal: 4.148 ± 0.68
0.811LeuTrp: 0.811 ± 0.289
1.893LeuTyr: 1.893 ± 0.391
0.0LeuXaa: 0.0 ± 0.0
Met
1.984MetAla: 1.984 ± 0.355
0.0MetCys: 0.0 ± 0.0
0.721MetAsp: 0.721 ± 0.225
1.623MetGlu: 1.623 ± 0.369
1.082MetPhe: 1.082 ± 0.295
0.902MetGly: 0.902 ± 0.291
0.27MetHis: 0.27 ± 0.167
1.443MetIle: 1.443 ± 0.376
2.795MetLys: 2.795 ± 0.501
1.533MetLeu: 1.533 ± 0.278
0.361MetMet: 0.361 ± 0.186
1.262MetAsn: 1.262 ± 0.328
0.992MetPro: 0.992 ± 0.266
0.902MetGln: 0.902 ± 0.232
0.721MetArg: 0.721 ± 0.223
1.803MetSer: 1.803 ± 0.506
1.082MetThr: 1.082 ± 0.296
1.984MetVal: 1.984 ± 0.395
0.09MetTrp: 0.09 ± 0.076
0.811MetTyr: 0.811 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
4.688AsnAla: 4.688 ± 1.117
0.361AsnCys: 0.361 ± 0.206
3.967AsnAsp: 3.967 ± 0.495
3.877AsnGlu: 3.877 ± 0.643
2.705AsnPhe: 2.705 ± 0.523
7.213AsnGly: 7.213 ± 1.116
1.262AsnHis: 1.262 ± 0.33
3.697AsnIle: 3.697 ± 0.451
4.238AsnLys: 4.238 ± 0.531
5.32AsnLeu: 5.32 ± 0.658
0.992AsnMet: 0.992 ± 0.293
3.967AsnAsn: 3.967 ± 0.767
3.246AsnPro: 3.246 ± 0.535
2.705AsnGln: 2.705 ± 0.349
2.434AsnArg: 2.434 ± 0.544
3.877AsnSer: 3.877 ± 0.599
3.426AsnThr: 3.426 ± 0.533
3.607AsnVal: 3.607 ± 0.49
1.352AsnTrp: 1.352 ± 0.308
1.713AsnTyr: 1.713 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
1.533ProAla: 1.533 ± 0.316
0.18ProCys: 0.18 ± 0.163
1.533ProAsp: 1.533 ± 0.44
2.615ProGlu: 2.615 ± 0.489
1.262ProPhe: 1.262 ± 0.242
1.352ProGly: 1.352 ± 0.417
0.27ProHis: 0.27 ± 0.126
1.713ProIle: 1.713 ± 0.367
3.697ProLys: 3.697 ± 0.525
2.795ProLeu: 2.795 ± 0.382
0.27ProMet: 0.27 ± 0.153
2.525ProAsn: 2.525 ± 0.416
0.721ProPro: 0.721 ± 0.282
1.443ProGln: 1.443 ± 0.271
0.811ProArg: 0.811 ± 0.343
2.795ProSer: 2.795 ± 0.407
2.344ProThr: 2.344 ± 0.427
1.262ProVal: 1.262 ± 0.471
0.721ProTrp: 0.721 ± 0.186
1.262ProTyr: 1.262 ± 0.44
0.0ProXaa: 0.0 ± 0.0
Gln
4.057GlnAla: 4.057 ± 0.593
0.18GlnCys: 0.18 ± 0.113
2.074GlnAsp: 2.074 ± 0.313
2.434GlnGlu: 2.434 ± 0.458
1.262GlnPhe: 1.262 ± 0.319
3.066GlnGly: 3.066 ± 0.637
0.451GlnHis: 0.451 ± 0.227
2.344GlnIle: 2.344 ± 0.567
3.156GlnLys: 3.156 ± 0.51
3.246GlnLeu: 3.246 ± 0.484
1.443GlnMet: 1.443 ± 0.339
2.615GlnAsn: 2.615 ± 0.414
0.541GlnPro: 0.541 ± 0.194
2.344GlnGln: 2.344 ± 0.503
1.623GlnArg: 1.623 ± 0.377
2.434GlnSer: 2.434 ± 0.416
2.885GlnThr: 2.885 ± 0.575
2.164GlnVal: 2.164 ± 0.546
0.361GlnTrp: 0.361 ± 0.194
2.254GlnTyr: 2.254 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
1.713ArgAla: 1.713 ± 0.401
0.18ArgCys: 0.18 ± 0.121
2.795ArgAsp: 2.795 ± 0.473
2.525ArgGlu: 2.525 ± 0.423
2.344ArgPhe: 2.344 ± 0.488
2.885ArgGly: 2.885 ± 0.488
0.721ArgHis: 0.721 ± 0.28
3.607ArgIle: 3.607 ± 0.616
3.156ArgLys: 3.156 ± 0.575
3.516ArgLeu: 3.516 ± 0.629
0.992ArgMet: 0.992 ± 0.277
3.066ArgAsn: 3.066 ± 0.405
1.082ArgPro: 1.082 ± 0.247
2.074ArgGln: 2.074 ± 0.377
1.352ArgArg: 1.352 ± 0.4
1.533ArgSer: 1.533 ± 0.318
2.705ArgThr: 2.705 ± 0.76
3.246ArgVal: 3.246 ± 0.576
1.262ArgTrp: 1.262 ± 0.272
2.344ArgTyr: 2.344 ± 0.525
0.0ArgXaa: 0.0 ± 0.0
Ser
3.516SerAla: 3.516 ± 0.529
0.451SerCys: 0.451 ± 0.213
4.598SerAsp: 4.598 ± 0.548
3.877SerGlu: 3.877 ± 0.459
2.885SerPhe: 2.885 ± 0.489
4.508SerGly: 4.508 ± 0.723
0.541SerHis: 0.541 ± 0.195
4.238SerIle: 4.238 ± 0.584
5.139SerLys: 5.139 ± 0.821
4.598SerLeu: 4.598 ± 0.665
1.172SerMet: 1.172 ± 0.285
4.688SerAsn: 4.688 ± 0.695
1.713SerPro: 1.713 ± 0.281
3.066SerGln: 3.066 ± 0.61
2.615SerArg: 2.615 ± 0.643
3.336SerSer: 3.336 ± 0.488
3.877SerThr: 3.877 ± 0.632
5.229SerVal: 5.229 ± 0.717
0.811SerTrp: 0.811 ± 0.356
1.533SerTyr: 1.533 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
4.418ThrAla: 4.418 ± 0.764
0.361ThrCys: 0.361 ± 0.163
4.148ThrAsp: 4.148 ± 0.585
3.336ThrGlu: 3.336 ± 0.489
3.336ThrPhe: 3.336 ± 0.615
3.516ThrGly: 3.516 ± 0.527
1.262ThrHis: 1.262 ± 0.307
4.418ThrIle: 4.418 ± 0.732
5.139ThrLys: 5.139 ± 0.676
6.311ThrLeu: 6.311 ± 0.94
0.902ThrMet: 0.902 ± 0.233
3.787ThrAsn: 3.787 ± 0.608
1.623ThrPro: 1.623 ± 0.415
2.615ThrGln: 2.615 ± 0.506
2.074ThrArg: 2.074 ± 0.441
3.877ThrSer: 3.877 ± 0.545
3.246ThrThr: 3.246 ± 0.62
3.967ThrVal: 3.967 ± 0.556
0.721ThrTrp: 0.721 ± 0.309
3.066ThrTyr: 3.066 ± 0.597
0.0ThrXaa: 0.0 ± 0.0
Val
4.418ValAla: 4.418 ± 0.658
0.361ValCys: 0.361 ± 0.18
5.139ValAsp: 5.139 ± 0.543
4.238ValGlu: 4.238 ± 0.65
2.885ValPhe: 2.885 ± 0.588
4.238ValGly: 4.238 ± 0.626
0.811ValHis: 0.811 ± 0.262
3.787ValIle: 3.787 ± 0.491
5.139ValLys: 5.139 ± 0.761
3.787ValLeu: 3.787 ± 0.759
1.172ValMet: 1.172 ± 0.332
3.697ValAsn: 3.697 ± 0.595
1.623ValPro: 1.623 ± 0.323
2.254ValGln: 2.254 ± 0.465
2.344ValArg: 2.344 ± 0.598
4.869ValSer: 4.869 ± 0.493
5.049ValThr: 5.049 ± 0.834
3.516ValVal: 3.516 ± 0.738
1.082ValTrp: 1.082 ± 0.301
1.803ValTyr: 1.803 ± 0.564
0.0ValXaa: 0.0 ± 0.0
Trp
0.451TrpAla: 0.451 ± 0.23
0.18TrpCys: 0.18 ± 0.145
0.992TrpAsp: 0.992 ± 0.381
1.082TrpGlu: 1.082 ± 0.276
0.721TrpPhe: 0.721 ± 0.263
0.631TrpGly: 0.631 ± 0.201
0.361TrpHis: 0.361 ± 0.16
0.631TrpIle: 0.631 ± 0.178
0.992TrpLys: 0.992 ± 0.266
1.352TrpLeu: 1.352 ± 0.345
0.18TrpMet: 0.18 ± 0.112
1.082TrpAsn: 1.082 ± 0.333
0.18TrpPro: 0.18 ± 0.148
0.992TrpGln: 0.992 ± 0.214
0.811TrpArg: 0.811 ± 0.224
1.893TrpSer: 1.893 ± 0.641
0.811TrpThr: 0.811 ± 0.275
1.533TrpVal: 1.533 ± 0.304
0.27TrpTrp: 0.27 ± 0.174
0.361TrpTyr: 0.361 ± 0.212
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.344TyrAla: 2.344 ± 0.31
0.451TyrCys: 0.451 ± 0.226
2.795TyrAsp: 2.795 ± 0.615
2.615TyrGlu: 2.615 ± 0.468
2.164TyrPhe: 2.164 ± 0.468
1.623TyrGly: 1.623 ± 0.329
0.811TyrHis: 0.811 ± 0.269
2.434TyrIle: 2.434 ± 0.431
2.885TyrLys: 2.885 ± 0.517
3.336TyrLeu: 3.336 ± 0.488
0.811TyrMet: 0.811 ± 0.252
1.803TyrAsn: 1.803 ± 0.451
1.533TyrPro: 1.533 ± 0.398
2.254TyrGln: 2.254 ± 0.382
2.615TyrArg: 2.615 ± 0.613
2.254TyrSer: 2.254 ± 0.482
2.164TyrThr: 2.164 ± 0.579
2.705TyrVal: 2.705 ± 0.652
0.27TyrTrp: 0.27 ± 0.132
2.074TyrTyr: 2.074 ± 0.479
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski