Amino acid dipepetide frequency for Streptococcus phage CHPC663

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.999AlaAla: 3.999 ± 0.846
0.174AlaCys: 0.174 ± 0.148
4.869AlaAsp: 4.869 ± 0.706
4.521AlaGlu: 4.521 ± 0.63
2.087AlaPhe: 2.087 ± 0.482
3.825AlaGly: 3.825 ± 0.862
0.956AlaHis: 0.956 ± 0.321
4.521AlaIle: 4.521 ± 0.867
5.477AlaLys: 5.477 ± 0.844
5.39AlaLeu: 5.39 ± 0.606
2.0AlaMet: 2.0 ± 0.486
4.608AlaAsn: 4.608 ± 0.74
1.478AlaPro: 1.478 ± 0.385
2.26AlaGln: 2.26 ± 0.587
2.956AlaArg: 2.956 ± 0.412
4.26AlaSer: 4.26 ± 0.864
4.26AlaThr: 4.26 ± 0.883
3.391AlaVal: 3.391 ± 0.789
1.043AlaTrp: 1.043 ± 0.247
2.608AlaTyr: 2.608 ± 0.562
0.0AlaXaa: 0.0 ± 0.0
Cys
0.174CysAla: 0.174 ± 0.116
0.0CysCys: 0.0 ± 0.0
0.956CysAsp: 0.956 ± 0.273
0.348CysGlu: 0.348 ± 0.165
0.522CysPhe: 0.522 ± 0.268
0.261CysGly: 0.261 ± 0.155
0.174CysHis: 0.174 ± 0.113
0.0CysIle: 0.0 ± 0.0
0.348CysLys: 0.348 ± 0.177
0.435CysLeu: 0.435 ± 0.199
0.087CysMet: 0.087 ± 0.09
0.261CysAsn: 0.261 ± 0.157
0.174CysPro: 0.174 ± 0.132
0.261CysGln: 0.261 ± 0.142
0.261CysArg: 0.261 ± 0.197
0.522CysSer: 0.522 ± 0.298
0.435CysThr: 0.435 ± 0.206
0.261CysVal: 0.261 ± 0.152
0.174CysTrp: 0.174 ± 0.122
0.087CysTyr: 0.087 ± 0.079
0.0CysXaa: 0.0 ± 0.0
Asp
3.391AspAla: 3.391 ± 0.612
0.609AspCys: 0.609 ± 0.269
4.173AspAsp: 4.173 ± 0.749
3.912AspGlu: 3.912 ± 0.667
3.999AspPhe: 3.999 ± 0.495
7.303AspGly: 7.303 ± 1.531
1.217AspHis: 1.217 ± 0.317
4.782AspIle: 4.782 ± 0.592
4.869AspLys: 4.869 ± 0.525
4.608AspLeu: 4.608 ± 0.77
2.0AspMet: 2.0 ± 0.4
4.26AspAsn: 4.26 ± 0.612
1.826AspPro: 1.826 ± 0.418
1.739AspGln: 1.739 ± 0.365
2.782AspArg: 2.782 ± 0.672
4.26AspSer: 4.26 ± 0.536
3.652AspThr: 3.652 ± 0.528
3.217AspVal: 3.217 ± 0.593
1.13AspTrp: 1.13 ± 0.242
2.608AspTyr: 2.608 ± 0.513
0.0AspXaa: 0.0 ± 0.0
Glu
3.565GluAla: 3.565 ± 0.496
0.348GluCys: 0.348 ± 0.154
3.391GluAsp: 3.391 ± 0.589
4.521GluGlu: 4.521 ± 0.814
2.087GluPhe: 2.087 ± 0.371
2.782GluGly: 2.782 ± 0.346
0.869GluHis: 0.869 ± 0.27
4.869GluIle: 4.869 ± 0.686
4.26GluLys: 4.26 ± 0.816
5.912GluLeu: 5.912 ± 0.74
2.26GluMet: 2.26 ± 0.387
3.738GluAsn: 3.738 ± 0.66
1.652GluPro: 1.652 ± 0.55
3.217GluGln: 3.217 ± 0.725
2.956GluArg: 2.956 ± 0.535
3.738GluSer: 3.738 ± 0.537
3.912GluThr: 3.912 ± 0.689
4.608GluVal: 4.608 ± 0.869
1.13GluTrp: 1.13 ± 0.295
3.478GluTyr: 3.478 ± 0.676
0.0GluXaa: 0.0 ± 0.0
Phe
2.695PheAla: 2.695 ± 0.413
0.348PheCys: 0.348 ± 0.251
2.956PheAsp: 2.956 ± 0.479
2.608PheGlu: 2.608 ± 0.598
1.565PhePhe: 1.565 ± 0.33
3.825PheGly: 3.825 ± 0.569
0.435PheHis: 0.435 ± 0.19
2.087PheIle: 2.087 ± 0.448
4.347PheLys: 4.347 ± 0.673
2.869PheLeu: 2.869 ± 0.441
0.782PheMet: 0.782 ± 0.237
3.13PheAsn: 3.13 ± 0.817
0.782PhePro: 0.782 ± 0.204
1.304PheGln: 1.304 ± 0.342
1.826PheArg: 1.826 ± 0.349
3.478PheSer: 3.478 ± 0.573
2.087PheThr: 2.087 ± 0.488
2.087PheVal: 2.087 ± 0.42
0.609PheTrp: 0.609 ± 0.149
1.739PheTyr: 1.739 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
3.391GlyAla: 3.391 ± 0.618
0.261GlyCys: 0.261 ± 0.149
4.695GlyAsp: 4.695 ± 0.691
3.565GlyGlu: 3.565 ± 0.603
3.304GlyPhe: 3.304 ± 0.49
4.347GlyGly: 4.347 ± 0.783
1.652GlyHis: 1.652 ± 0.621
5.39GlyIle: 5.39 ± 0.563
7.477GlyLys: 7.477 ± 1.05
6.781GlyLeu: 6.781 ± 0.794
1.13GlyMet: 1.13 ± 0.313
3.738GlyAsn: 3.738 ± 0.576
2.0GlyPro: 2.0 ± 0.839
2.869GlyGln: 2.869 ± 0.603
3.043GlyArg: 3.043 ± 0.492
4.608GlySer: 4.608 ± 0.607
4.608GlyThr: 4.608 ± 0.631
3.738GlyVal: 3.738 ± 0.678
0.956GlyTrp: 0.956 ± 0.33
3.217GlyTyr: 3.217 ± 0.478
0.0GlyXaa: 0.0 ± 0.0
His
0.696HisAla: 0.696 ± 0.307
0.087HisCys: 0.087 ± 0.089
1.391HisAsp: 1.391 ± 0.371
0.522HisGlu: 0.522 ± 0.219
0.696HisPhe: 0.696 ± 0.2
0.782HisGly: 0.782 ± 0.248
0.435HisHis: 0.435 ± 0.183
1.043HisIle: 1.043 ± 0.37
0.696HisLys: 0.696 ± 0.263
1.304HisLeu: 1.304 ± 0.449
0.348HisMet: 0.348 ± 0.183
0.522HisAsn: 0.522 ± 0.275
0.435HisPro: 0.435 ± 0.165
0.609HisGln: 0.609 ± 0.274
0.435HisArg: 0.435 ± 0.151
0.869HisSer: 0.869 ± 0.274
1.391HisThr: 1.391 ± 0.271
1.391HisVal: 1.391 ± 0.34
0.261HisTrp: 0.261 ± 0.154
0.696HisTyr: 0.696 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
5.13IleAla: 5.13 ± 0.706
0.261IleCys: 0.261 ± 0.163
5.216IleAsp: 5.216 ± 0.821
4.086IleGlu: 4.086 ± 0.528
1.478IlePhe: 1.478 ± 0.432
5.043IleGly: 5.043 ± 0.541
0.782IleHis: 0.782 ± 0.262
2.608IleIle: 2.608 ± 0.366
6.434IleLys: 6.434 ± 0.676
4.347IleLeu: 4.347 ± 0.67
1.913IleMet: 1.913 ± 0.499
4.695IleAsn: 4.695 ± 0.637
2.956IlePro: 2.956 ± 0.444
3.043IleGln: 3.043 ± 0.426
1.826IleArg: 1.826 ± 0.352
4.956IleSer: 4.956 ± 0.716
3.565IleThr: 3.565 ± 0.714
3.13IleVal: 3.13 ± 0.506
0.782IleTrp: 0.782 ± 0.239
1.826IleTyr: 1.826 ± 0.404
0.0IleXaa: 0.0 ± 0.0
Lys
5.912LysAla: 5.912 ± 0.498
0.348LysCys: 0.348 ± 0.223
5.13LysAsp: 5.13 ± 0.767
6.608LysGlu: 6.608 ± 0.827
3.652LysPhe: 3.652 ± 0.794
6.173LysGly: 6.173 ± 0.932
1.304LysHis: 1.304 ± 0.335
5.651LysIle: 5.651 ± 0.677
6.173LysLys: 6.173 ± 1.041
5.999LysLeu: 5.999 ± 0.901
2.608LysMet: 2.608 ± 0.564
6.086LysAsn: 6.086 ± 0.745
2.956LysPro: 2.956 ± 0.43
3.652LysGln: 3.652 ± 0.467
3.738LysArg: 3.738 ± 0.631
4.26LysSer: 4.26 ± 0.667
5.39LysThr: 5.39 ± 0.849
3.738LysVal: 3.738 ± 0.606
1.478LysTrp: 1.478 ± 0.387
3.912LysTyr: 3.912 ± 0.835
0.0LysXaa: 0.0 ± 0.0
Leu
6.26LeuAla: 6.26 ± 0.773
0.435LeuCys: 0.435 ± 0.245
5.303LeuAsp: 5.303 ± 0.712
6.781LeuGlu: 6.781 ± 1.016
2.869LeuPhe: 2.869 ± 0.407
5.912LeuGly: 5.912 ± 0.704
1.043LeuHis: 1.043 ± 0.32
4.434LeuIle: 4.434 ± 0.575
7.042LeuLys: 7.042 ± 0.518
4.26LeuLeu: 4.26 ± 0.724
2.347LeuMet: 2.347 ± 0.398
4.434LeuAsn: 4.434 ± 0.577
2.434LeuPro: 2.434 ± 0.346
2.782LeuGln: 2.782 ± 0.519
2.869LeuArg: 2.869 ± 0.572
5.477LeuSer: 5.477 ± 0.791
5.825LeuThr: 5.825 ± 0.877
3.999LeuVal: 3.999 ± 0.568
0.609LeuTrp: 0.609 ± 0.199
2.174LeuTyr: 2.174 ± 0.555
0.0LeuXaa: 0.0 ± 0.0
Met
1.826MetAla: 1.826 ± 0.505
0.087MetCys: 0.087 ± 0.078
1.13MetAsp: 1.13 ± 0.348
1.217MetGlu: 1.217 ± 0.356
0.956MetPhe: 0.956 ± 0.234
0.869MetGly: 0.869 ± 0.263
0.348MetHis: 0.348 ± 0.197
2.26MetIle: 2.26 ± 0.376
3.13MetLys: 3.13 ± 0.661
2.0MetLeu: 2.0 ± 0.354
0.956MetMet: 0.956 ± 0.254
1.13MetAsn: 1.13 ± 0.256
0.869MetPro: 0.869 ± 0.266
1.043MetGln: 1.043 ± 0.242
0.782MetArg: 0.782 ± 0.224
2.0MetSer: 2.0 ± 0.437
2.174MetThr: 2.174 ± 0.351
1.478MetVal: 1.478 ± 0.435
0.087MetTrp: 0.087 ± 0.062
1.217MetTyr: 1.217 ± 0.404
0.0MetXaa: 0.0 ± 0.0
Asn
4.521AsnAla: 4.521 ± 1.08
0.348AsnCys: 0.348 ± 0.206
3.478AsnAsp: 3.478 ± 0.529
3.391AsnGlu: 3.391 ± 0.695
2.174AsnPhe: 2.174 ± 0.475
6.086AsnGly: 6.086 ± 0.779
0.696AsnHis: 0.696 ± 0.197
3.304AsnIle: 3.304 ± 0.603
4.956AsnLys: 4.956 ± 0.582
4.869AsnLeu: 4.869 ± 0.456
1.304AsnMet: 1.304 ± 0.379
3.738AsnAsn: 3.738 ± 0.526
3.13AsnPro: 3.13 ± 0.682
2.695AsnGln: 2.695 ± 0.45
2.434AsnArg: 2.434 ± 0.469
4.347AsnSer: 4.347 ± 0.617
3.478AsnThr: 3.478 ± 0.451
3.13AsnVal: 3.13 ± 0.569
1.217AsnTrp: 1.217 ± 0.288
2.26AsnTyr: 2.26 ± 0.416
0.0AsnXaa: 0.0 ± 0.0
Pro
2.174ProAla: 2.174 ± 0.369
0.174ProCys: 0.174 ± 0.175
1.391ProAsp: 1.391 ± 0.369
1.565ProGlu: 1.565 ± 0.452
1.391ProPhe: 1.391 ± 0.411
1.391ProGly: 1.391 ± 0.649
0.348ProHis: 0.348 ± 0.146
1.652ProIle: 1.652 ± 0.276
3.912ProLys: 3.912 ± 0.723
2.434ProLeu: 2.434 ± 0.409
0.174ProMet: 0.174 ± 0.113
2.174ProAsn: 2.174 ± 0.432
0.261ProPro: 0.261 ± 0.187
1.826ProGln: 1.826 ± 0.4
1.304ProArg: 1.304 ± 0.413
2.347ProSer: 2.347 ± 0.346
2.434ProThr: 2.434 ± 0.354
1.043ProVal: 1.043 ± 0.282
0.522ProTrp: 0.522 ± 0.136
1.304ProTyr: 1.304 ± 0.353
0.0ProXaa: 0.0 ± 0.0
Gln
3.043GlnAla: 3.043 ± 0.627
0.261GlnCys: 0.261 ± 0.155
2.521GlnAsp: 2.521 ± 0.486
2.869GlnGlu: 2.869 ± 0.497
1.739GlnPhe: 1.739 ± 0.412
3.304GlnGly: 3.304 ± 0.841
0.261GlnHis: 0.261 ± 0.134
2.695GlnIle: 2.695 ± 0.525
3.217GlnLys: 3.217 ± 0.542
3.652GlnLeu: 3.652 ± 0.479
1.478GlnMet: 1.478 ± 0.364
2.26GlnAsn: 2.26 ± 0.471
0.609GlnPro: 0.609 ± 0.221
3.13GlnGln: 3.13 ± 0.722
1.913GlnArg: 1.913 ± 0.332
1.913GlnSer: 1.913 ± 0.27
2.782GlnThr: 2.782 ± 0.321
2.174GlnVal: 2.174 ± 0.445
0.609GlnTrp: 0.609 ± 0.184
2.434GlnTyr: 2.434 ± 0.411
0.0GlnXaa: 0.0 ± 0.0
Arg
2.174ArgAla: 2.174 ± 0.419
0.174ArgCys: 0.174 ± 0.123
2.956ArgAsp: 2.956 ± 0.77
2.608ArgGlu: 2.608 ± 0.603
1.826ArgPhe: 1.826 ± 0.421
2.869ArgGly: 2.869 ± 0.579
0.782ArgHis: 0.782 ± 0.244
2.869ArgIle: 2.869 ± 0.671
3.13ArgLys: 3.13 ± 0.59
2.869ArgLeu: 2.869 ± 0.579
0.696ArgMet: 0.696 ± 0.307
2.608ArgAsn: 2.608 ± 0.554
1.043ArgPro: 1.043 ± 0.253
2.347ArgGln: 2.347 ± 0.536
1.478ArgArg: 1.478 ± 0.316
2.0ArgSer: 2.0 ± 0.386
2.521ArgThr: 2.521 ± 0.724
2.521ArgVal: 2.521 ± 0.461
1.304ArgTrp: 1.304 ± 0.335
2.434ArgTyr: 2.434 ± 0.538
0.0ArgXaa: 0.0 ± 0.0
Ser
3.565SerAla: 3.565 ± 0.583
0.522SerCys: 0.522 ± 0.188
4.782SerAsp: 4.782 ± 0.788
3.825SerGlu: 3.825 ± 0.491
3.13SerPhe: 3.13 ± 0.471
4.521SerGly: 4.521 ± 0.524
0.696SerHis: 0.696 ± 0.244
4.086SerIle: 4.086 ± 0.727
4.869SerLys: 4.869 ± 0.653
4.695SerLeu: 4.695 ± 0.528
2.174SerMet: 2.174 ± 0.323
4.608SerAsn: 4.608 ± 0.803
2.347SerPro: 2.347 ± 0.497
3.391SerGln: 3.391 ± 0.488
3.043SerArg: 3.043 ± 0.531
4.434SerSer: 4.434 ± 0.63
4.695SerThr: 4.695 ± 0.732
5.564SerVal: 5.564 ± 0.753
0.609SerTrp: 0.609 ± 0.26
2.347SerTyr: 2.347 ± 0.423
0.0SerXaa: 0.0 ± 0.0
Thr
4.347ThrAla: 4.347 ± 0.64
0.261ThrCys: 0.261 ± 0.14
4.434ThrAsp: 4.434 ± 0.858
4.26ThrGlu: 4.26 ± 0.572
2.956ThrPhe: 2.956 ± 0.529
3.999ThrGly: 3.999 ± 0.555
0.782ThrHis: 0.782 ± 0.334
4.434ThrIle: 4.434 ± 0.798
4.956ThrLys: 4.956 ± 0.523
6.868ThrLeu: 6.868 ± 0.8
0.956ThrMet: 0.956 ± 0.291
3.999ThrAsn: 3.999 ± 0.533
1.478ThrPro: 1.478 ± 0.376
2.434ThrGln: 2.434 ± 0.491
2.174ThrArg: 2.174 ± 0.419
3.391ThrSer: 3.391 ± 0.508
3.043ThrThr: 3.043 ± 0.67
4.086ThrVal: 4.086 ± 0.473
1.478ThrTrp: 1.478 ± 0.267
3.391ThrTyr: 3.391 ± 0.74
0.0ThrXaa: 0.0 ± 0.0
Val
4.434ValAla: 4.434 ± 0.906
0.348ValCys: 0.348 ± 0.148
3.825ValAsp: 3.825 ± 0.447
3.304ValGlu: 3.304 ± 0.52
2.087ValPhe: 2.087 ± 0.503
4.347ValGly: 4.347 ± 0.676
0.435ValHis: 0.435 ± 0.155
3.912ValIle: 3.912 ± 0.651
5.043ValLys: 5.043 ± 0.65
3.13ValLeu: 3.13 ± 0.624
0.869ValMet: 0.869 ± 0.22
3.13ValAsn: 3.13 ± 0.572
1.913ValPro: 1.913 ± 0.435
1.478ValGln: 1.478 ± 0.297
2.174ValArg: 2.174 ± 0.53
6.347ValSer: 6.347 ± 0.738
4.347ValThr: 4.347 ± 0.748
2.869ValVal: 2.869 ± 0.658
0.782ValTrp: 0.782 ± 0.246
2.0ValTyr: 2.0 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.285
0.0TrpCys: 0.0 ± 0.0
0.869TrpAsp: 0.869 ± 0.346
0.956TrpGlu: 0.956 ± 0.276
0.956TrpPhe: 0.956 ± 0.23
0.609TrpGly: 0.609 ± 0.22
0.348TrpHis: 0.348 ± 0.168
0.956TrpIle: 0.956 ± 0.324
1.304TrpLys: 1.304 ± 0.26
1.304TrpLeu: 1.304 ± 0.299
0.087TrpMet: 0.087 ± 0.091
0.782TrpAsn: 0.782 ± 0.279
0.174TrpPro: 0.174 ± 0.135
0.609TrpGln: 0.609 ± 0.199
0.782TrpArg: 0.782 ± 0.239
1.739TrpSer: 1.739 ± 0.325
1.043TrpThr: 1.043 ± 0.4
1.043TrpVal: 1.043 ± 0.198
0.174TrpTrp: 0.174 ± 0.095
0.435TrpTyr: 0.435 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.608TyrAla: 2.608 ± 0.45
0.696TyrCys: 0.696 ± 0.291
2.869TyrAsp: 2.869 ± 0.645
1.826TyrGlu: 1.826 ± 0.353
2.087TyrPhe: 2.087 ± 0.355
2.521TyrGly: 2.521 ± 0.495
1.043TyrHis: 1.043 ± 0.338
2.521TyrIle: 2.521 ± 0.589
3.13TyrLys: 3.13 ± 0.498
3.652TyrLeu: 3.652 ± 0.487
1.304TyrMet: 1.304 ± 0.399
1.652TyrAsn: 1.652 ± 0.336
1.304TyrPro: 1.304 ± 0.365
2.174TyrGln: 2.174 ± 0.383
2.434TyrArg: 2.434 ± 0.447
2.956TyrSer: 2.956 ± 0.584
2.087TyrThr: 2.087 ± 0.78
3.304TyrVal: 3.304 ± 0.518
0.0TyrTrp: 0.0 ± 0.0
2.347TyrTyr: 2.347 ± 0.587
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11503 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski