Amino acid dipepetide frequency for Streptococcus phage IPP60

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.532AlaAla: 2.532 ± 0.813
0.307AlaCys: 0.307 ± 0.153
5.602AlaAsp: 5.602 ± 0.564
6.216AlaGlu: 6.216 ± 0.674
2.686AlaPhe: 2.686 ± 0.573
5.142AlaGly: 5.142 ± 0.938
1.074AlaHis: 1.074 ± 0.283
4.835AlaIle: 4.835 ± 0.874
6.753AlaLys: 6.753 ± 0.836
6.293AlaLeu: 6.293 ± 0.912
2.456AlaMet: 2.456 ± 0.387
4.297AlaAsn: 4.297 ± 0.877
2.072AlaPro: 2.072 ± 0.381
2.379AlaGln: 2.379 ± 0.427
2.916AlaArg: 2.916 ± 0.551
3.146AlaSer: 3.146 ± 0.744
4.297AlaThr: 4.297 ± 0.611
5.142AlaVal: 5.142 ± 0.563
0.767AlaTrp: 0.767 ± 0.235
1.612AlaTyr: 1.612 ± 0.297
0.0AlaXaa: 0.0 ± 0.0
Cys
0.23CysAla: 0.23 ± 0.132
0.0CysCys: 0.0 ± 0.0
0.384CysAsp: 0.384 ± 0.17
0.384CysGlu: 0.384 ± 0.164
0.46CysPhe: 0.46 ± 0.189
0.23CysGly: 0.23 ± 0.172
0.077CysHis: 0.077 ± 0.093
0.307CysIle: 0.307 ± 0.234
0.46CysLys: 0.46 ± 0.175
0.307CysLeu: 0.307 ± 0.135
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.307CysPro: 0.307 ± 0.159
0.23CysGln: 0.23 ± 0.117
0.384CysArg: 0.384 ± 0.134
0.23CysSer: 0.23 ± 0.156
0.077CysThr: 0.077 ± 0.097
0.153CysVal: 0.153 ± 0.121
0.153CysTrp: 0.153 ± 0.086
0.384CysTyr: 0.384 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
3.684AspAla: 3.684 ± 0.713
0.537AspCys: 0.537 ± 0.239
3.377AspAsp: 3.377 ± 0.806
4.067AspGlu: 4.067 ± 0.864
3.53AspPhe: 3.53 ± 0.477
4.681AspGly: 4.681 ± 0.69
0.537AspHis: 0.537 ± 0.232
5.909AspIle: 5.909 ± 0.679
4.911AspLys: 4.911 ± 0.616
5.142AspLeu: 5.142 ± 0.785
1.765AspMet: 1.765 ± 0.313
3.3AspAsn: 3.3 ± 0.441
1.535AspPro: 1.535 ± 0.347
1.535AspGln: 1.535 ± 0.328
2.916AspArg: 2.916 ± 0.386
3.3AspSer: 3.3 ± 0.515
3.607AspThr: 3.607 ± 0.47
2.839AspVal: 2.839 ± 0.348
1.765AspTrp: 1.765 ± 0.439
3.377AspTyr: 3.377 ± 0.515
0.0AspXaa: 0.0 ± 0.0
Glu
6.216GluAla: 6.216 ± 0.824
0.153GluCys: 0.153 ± 0.098
3.99GluAsp: 3.99 ± 0.566
5.986GluGlu: 5.986 ± 0.976
3.684GluPhe: 3.684 ± 0.576
3.684GluGly: 3.684 ± 0.561
1.535GluHis: 1.535 ± 0.359
5.756GluIle: 5.756 ± 0.498
8.211GluLys: 8.211 ± 1.282
8.288GluLeu: 8.288 ± 0.955
2.225GluMet: 2.225 ± 0.569
4.758GluAsn: 4.758 ± 0.508
1.842GluPro: 1.842 ± 0.45
3.3GluGln: 3.3 ± 0.711
3.914GluArg: 3.914 ± 0.534
4.681GluSer: 4.681 ± 0.55
4.221GluThr: 4.221 ± 0.637
4.835GluVal: 4.835 ± 0.596
0.767GluTrp: 0.767 ± 0.232
3.146GluTyr: 3.146 ± 0.493
0.0GluXaa: 0.0 ± 0.0
Phe
2.609PheAla: 2.609 ± 0.65
0.23PheCys: 0.23 ± 0.131
4.297PheAsp: 4.297 ± 0.613
4.297PheGlu: 4.297 ± 0.6
1.688PhePhe: 1.688 ± 0.365
2.609PheGly: 2.609 ± 0.643
0.384PheHis: 0.384 ± 0.196
1.765PheIle: 1.765 ± 0.295
3.377PheLys: 3.377 ± 0.546
2.686PheLeu: 2.686 ± 0.44
1.305PheMet: 1.305 ± 0.314
2.916PheAsn: 2.916 ± 0.595
0.767PhePro: 0.767 ± 0.302
1.305PheGln: 1.305 ± 0.291
1.688PheArg: 1.688 ± 0.344
2.916PheSer: 2.916 ± 0.616
2.456PheThr: 2.456 ± 0.435
1.995PheVal: 1.995 ± 0.385
0.767PheTrp: 0.767 ± 0.292
1.919PheTyr: 1.919 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
2.916GlyAla: 2.916 ± 0.396
0.153GlyCys: 0.153 ± 0.093
3.684GlyAsp: 3.684 ± 0.609
4.451GlyGlu: 4.451 ± 0.605
2.686GlyPhe: 2.686 ± 0.428
4.144GlyGly: 4.144 ± 1.23
0.767GlyHis: 0.767 ± 0.183
3.453GlyIle: 3.453 ± 0.596
4.835GlyLys: 4.835 ± 0.544
5.832GlyLeu: 5.832 ± 0.928
1.688GlyMet: 1.688 ± 0.33
3.914GlyAsn: 3.914 ± 0.51
0.998GlyPro: 0.998 ± 0.246
3.146GlyGln: 3.146 ± 0.491
3.53GlyArg: 3.53 ± 0.453
3.684GlySer: 3.684 ± 0.877
2.686GlyThr: 2.686 ± 0.504
4.604GlyVal: 4.604 ± 0.598
0.921GlyTrp: 0.921 ± 0.482
2.609GlyTyr: 2.609 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
0.767HisAla: 0.767 ± 0.272
0.0HisCys: 0.0 ± 0.0
0.767HisAsp: 0.767 ± 0.283
1.151HisGlu: 1.151 ± 0.315
0.921HisPhe: 0.921 ± 0.258
0.998HisGly: 0.998 ± 0.299
0.307HisHis: 0.307 ± 0.162
0.614HisIle: 0.614 ± 0.265
0.691HisLys: 0.691 ± 0.27
1.228HisLeu: 1.228 ± 0.294
0.23HisMet: 0.23 ± 0.132
0.998HisAsn: 0.998 ± 0.217
0.691HisPro: 0.691 ± 0.234
0.691HisGln: 0.691 ± 0.243
0.537HisArg: 0.537 ± 0.209
1.688HisSer: 1.688 ± 0.539
0.767HisThr: 0.767 ± 0.221
0.844HisVal: 0.844 ± 0.258
0.077HisTrp: 0.077 ± 0.075
0.46HisTyr: 0.46 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
5.602IleAla: 5.602 ± 0.564
0.691IleCys: 0.691 ± 0.156
3.99IleAsp: 3.99 ± 0.622
6.676IleGlu: 6.676 ± 0.687
2.456IlePhe: 2.456 ± 0.477
3.76IleGly: 3.76 ± 0.684
0.23IleHis: 0.23 ± 0.143
2.916IleIle: 2.916 ± 0.485
5.986IleLys: 5.986 ± 0.638
4.374IleLeu: 4.374 ± 0.761
0.998IleMet: 0.998 ± 0.305
2.993IleAsn: 2.993 ± 0.396
1.919IlePro: 1.919 ± 0.351
2.532IleGln: 2.532 ± 0.29
2.456IleArg: 2.456 ± 0.527
5.142IleSer: 5.142 ± 0.784
3.99IleThr: 3.99 ± 0.409
3.377IleVal: 3.377 ± 0.514
0.614IleTrp: 0.614 ± 0.187
2.225IleTyr: 2.225 ± 0.513
0.0IleXaa: 0.0 ± 0.0
Lys
5.602LysAla: 5.602 ± 0.828
0.153LysCys: 0.153 ± 0.109
5.909LysAsp: 5.909 ± 0.673
6.523LysGlu: 6.523 ± 0.902
3.3LysPhe: 3.3 ± 0.548
4.221LysGly: 4.221 ± 0.607
1.612LysHis: 1.612 ± 0.297
5.602LysIle: 5.602 ± 0.662
7.827LysLys: 7.827 ± 1.065
7.214LysLeu: 7.214 ± 0.808
2.916LysMet: 2.916 ± 0.36
4.528LysAsn: 4.528 ± 0.524
2.686LysPro: 2.686 ± 0.673
3.3LysGln: 3.3 ± 0.544
3.837LysArg: 3.837 ± 0.476
4.604LysSer: 4.604 ± 0.562
6.293LysThr: 6.293 ± 0.604
6.139LysVal: 6.139 ± 0.589
1.074LysTrp: 1.074 ± 0.359
3.453LysTyr: 3.453 ± 0.422
0.0LysXaa: 0.0 ± 0.0
Leu
7.444LeuAla: 7.444 ± 0.839
0.384LeuCys: 0.384 ± 0.156
5.525LeuAsp: 5.525 ± 0.606
7.597LeuGlu: 7.597 ± 0.924
2.532LeuPhe: 2.532 ± 0.471
4.988LeuGly: 4.988 ± 1.164
1.074LeuHis: 1.074 ± 0.269
3.684LeuIle: 3.684 ± 0.477
7.137LeuLys: 7.137 ± 0.714
6.6LeuLeu: 6.6 ± 0.906
2.302LeuMet: 2.302 ± 0.406
3.453LeuAsn: 3.453 ± 0.702
2.763LeuPro: 2.763 ± 0.541
2.993LeuGln: 2.993 ± 0.547
4.144LeuArg: 4.144 ± 0.563
5.602LeuSer: 5.602 ± 0.868
5.525LeuThr: 5.525 ± 0.849
4.451LeuVal: 4.451 ± 0.517
0.691LeuTrp: 0.691 ± 0.178
2.379LeuTyr: 2.379 ± 0.352
0.0LeuXaa: 0.0 ± 0.0
Met
1.612MetAla: 1.612 ± 0.374
0.0MetCys: 0.0 ± 0.0
1.305MetAsp: 1.305 ± 0.222
2.149MetGlu: 2.149 ± 0.424
1.074MetPhe: 1.074 ± 0.235
0.998MetGly: 0.998 ± 0.386
0.307MetHis: 0.307 ± 0.173
1.842MetIle: 1.842 ± 0.381
2.225MetLys: 2.225 ± 0.521
1.688MetLeu: 1.688 ± 0.305
0.384MetMet: 0.384 ± 0.175
1.688MetAsn: 1.688 ± 0.476
1.228MetPro: 1.228 ± 0.333
0.998MetGln: 0.998 ± 0.313
1.151MetArg: 1.151 ± 0.321
1.381MetSer: 1.381 ± 0.348
1.688MetThr: 1.688 ± 0.384
1.842MetVal: 1.842 ± 0.341
0.153MetTrp: 0.153 ± 0.093
0.921MetTyr: 0.921 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
4.988AsnAla: 4.988 ± 0.649
0.23AsnCys: 0.23 ± 0.111
2.916AsnAsp: 2.916 ± 0.447
2.609AsnGlu: 2.609 ± 0.598
2.302AsnPhe: 2.302 ± 0.438
3.914AsnGly: 3.914 ± 0.586
0.844AsnHis: 0.844 ± 0.274
3.3AsnIle: 3.3 ± 0.488
4.374AsnLys: 4.374 ± 0.562
5.142AsnLeu: 5.142 ± 0.64
1.228AsnMet: 1.228 ± 0.303
2.916AsnAsn: 2.916 ± 0.481
1.688AsnPro: 1.688 ± 0.315
2.916AsnGln: 2.916 ± 0.549
2.609AsnArg: 2.609 ± 0.468
3.914AsnSer: 3.914 ± 0.694
3.07AsnThr: 3.07 ± 0.509
3.377AsnVal: 3.377 ± 0.443
0.921AsnTrp: 0.921 ± 0.248
2.149AsnTyr: 2.149 ± 0.256
0.0AsnXaa: 0.0 ± 0.0
Pro
2.609ProAla: 2.609 ± 0.42
0.23ProCys: 0.23 ± 0.16
1.612ProAsp: 1.612 ± 0.344
3.453ProGlu: 3.453 ± 0.449
0.767ProPhe: 0.767 ± 0.316
1.228ProGly: 1.228 ± 0.279
0.46ProHis: 0.46 ± 0.133
1.612ProIle: 1.612 ± 0.405
3.146ProLys: 3.146 ± 0.418
1.612ProLeu: 1.612 ± 0.363
0.614ProMet: 0.614 ± 0.233
1.612ProAsn: 1.612 ± 0.307
0.46ProPro: 0.46 ± 0.177
0.691ProGln: 0.691 ± 0.271
1.228ProArg: 1.228 ± 0.312
1.765ProSer: 1.765 ± 0.517
0.691ProThr: 0.691 ± 0.257
2.225ProVal: 2.225 ± 0.387
0.307ProTrp: 0.307 ± 0.142
1.535ProTyr: 1.535 ± 0.405
0.0ProXaa: 0.0 ± 0.0
Gln
3.607GlnAla: 3.607 ± 0.551
0.077GlnCys: 0.077 ± 0.078
1.842GlnAsp: 1.842 ± 0.32
3.837GlnGlu: 3.837 ± 0.582
1.535GlnPhe: 1.535 ± 0.311
1.612GlnGly: 1.612 ± 0.338
0.691GlnHis: 0.691 ± 0.217
2.763GlnIle: 2.763 ± 0.441
3.684GlnLys: 3.684 ± 0.509
2.993GlnLeu: 2.993 ± 0.41
0.844GlnMet: 0.844 ± 0.195
1.765GlnAsn: 1.765 ± 0.289
1.228GlnPro: 1.228 ± 0.288
1.612GlnGln: 1.612 ± 0.409
1.765GlnArg: 1.765 ± 0.486
2.225GlnSer: 2.225 ± 0.343
2.839GlnThr: 2.839 ± 0.508
3.914GlnVal: 3.914 ± 0.525
0.46GlnTrp: 0.46 ± 0.138
0.998GlnTyr: 0.998 ± 0.305
0.0GlnXaa: 0.0 ± 0.0
Arg
2.839ArgAla: 2.839 ± 0.486
0.384ArgCys: 0.384 ± 0.17
2.456ArgAsp: 2.456 ± 0.477
3.684ArgGlu: 3.684 ± 0.671
1.535ArgPhe: 1.535 ± 0.394
1.612ArgGly: 1.612 ± 0.341
0.537ArgHis: 0.537 ± 0.228
2.839ArgIle: 2.839 ± 0.666
3.607ArgLys: 3.607 ± 0.657
4.604ArgLeu: 4.604 ± 0.66
1.995ArgMet: 1.995 ± 0.404
2.609ArgAsn: 2.609 ± 0.586
0.921ArgPro: 0.921 ± 0.218
2.609ArgGln: 2.609 ± 0.497
2.686ArgArg: 2.686 ± 0.631
2.379ArgSer: 2.379 ± 0.387
2.993ArgThr: 2.993 ± 0.568
2.686ArgVal: 2.686 ± 0.482
0.537ArgTrp: 0.537 ± 0.234
1.842ArgTyr: 1.842 ± 0.391
0.0ArgXaa: 0.0 ± 0.0
Ser
3.99SerAla: 3.99 ± 0.885
0.23SerCys: 0.23 ± 0.129
3.607SerAsp: 3.607 ± 0.559
5.142SerGlu: 5.142 ± 0.623
2.379SerPhe: 2.379 ± 0.501
5.449SerGly: 5.449 ± 0.669
1.458SerHis: 1.458 ± 0.419
4.221SerIle: 4.221 ± 0.593
4.604SerLys: 4.604 ± 0.644
5.218SerLeu: 5.218 ± 0.722
1.151SerMet: 1.151 ± 0.356
3.53SerAsn: 3.53 ± 0.528
1.151SerPro: 1.151 ± 0.235
2.302SerGln: 2.302 ± 0.411
2.609SerArg: 2.609 ± 0.554
3.76SerSer: 3.76 ± 0.717
3.99SerThr: 3.99 ± 0.566
3.607SerVal: 3.607 ± 0.671
0.998SerTrp: 0.998 ± 0.43
2.456SerTyr: 2.456 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
5.065ThrAla: 5.065 ± 0.962
0.153ThrCys: 0.153 ± 0.105
3.607ThrAsp: 3.607 ± 0.613
4.451ThrGlu: 4.451 ± 0.633
3.453ThrPhe: 3.453 ± 0.844
4.604ThrGly: 4.604 ± 0.784
0.921ThrHis: 0.921 ± 0.293
4.681ThrIle: 4.681 ± 0.599
4.835ThrLys: 4.835 ± 0.702
3.76ThrLeu: 3.76 ± 0.514
0.691ThrMet: 0.691 ± 0.227
3.76ThrAsn: 3.76 ± 0.466
1.535ThrPro: 1.535 ± 0.385
2.916ThrGln: 2.916 ± 0.654
1.765ThrArg: 1.765 ± 0.38
3.914ThrSer: 3.914 ± 0.576
4.297ThrThr: 4.297 ± 0.694
3.914ThrVal: 3.914 ± 0.605
0.691ThrTrp: 0.691 ± 0.325
2.379ThrTyr: 2.379 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
5.218ValAla: 5.218 ± 0.619
0.23ValCys: 0.23 ± 0.147
4.297ValAsp: 4.297 ± 0.594
5.525ValGlu: 5.525 ± 0.672
2.072ValPhe: 2.072 ± 0.445
4.221ValGly: 4.221 ± 0.698
0.767ValHis: 0.767 ± 0.28
3.914ValIle: 3.914 ± 0.717
5.065ValLys: 5.065 ± 0.655
4.758ValLeu: 4.758 ± 0.746
0.844ValMet: 0.844 ± 0.31
3.914ValAsn: 3.914 ± 0.44
1.995ValPro: 1.995 ± 0.301
1.765ValGln: 1.765 ± 0.409
2.686ValArg: 2.686 ± 0.41
4.221ValSer: 4.221 ± 0.473
4.835ValThr: 4.835 ± 0.584
4.604ValVal: 4.604 ± 0.702
0.691ValTrp: 0.691 ± 0.241
2.839ValTyr: 2.839 ± 0.542
0.0ValXaa: 0.0 ± 0.0
Trp
0.767TrpAla: 0.767 ± 0.197
0.153TrpCys: 0.153 ± 0.103
0.691TrpAsp: 0.691 ± 0.254
0.767TrpGlu: 0.767 ± 0.259
1.228TrpPhe: 1.228 ± 0.512
0.767TrpGly: 0.767 ± 0.232
0.0TrpHis: 0.0 ± 0.0
0.537TrpIle: 0.537 ± 0.229
1.458TrpLys: 1.458 ± 0.303
0.537TrpLeu: 0.537 ± 0.184
0.384TrpMet: 0.384 ± 0.163
0.921TrpAsn: 0.921 ± 0.239
0.153TrpPro: 0.153 ± 0.114
0.998TrpGln: 0.998 ± 0.383
0.23TrpArg: 0.23 ± 0.132
0.384TrpSer: 0.384 ± 0.134
0.537TrpThr: 0.537 ± 0.197
1.381TrpVal: 1.381 ± 0.286
0.153TrpTrp: 0.153 ± 0.079
0.844TrpTyr: 0.844 ± 0.579
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.225TyrAla: 2.225 ± 0.378
0.46TyrCys: 0.46 ± 0.242
2.456TyrAsp: 2.456 ± 0.337
2.225TyrGlu: 2.225 ± 0.516
1.688TyrPhe: 1.688 ± 0.388
1.842TyrGly: 1.842 ± 0.331
0.844TyrHis: 0.844 ± 0.255
2.379TyrIle: 2.379 ± 0.442
3.607TyrLys: 3.607 ± 0.539
2.993TyrLeu: 2.993 ± 0.554
0.537TyrMet: 0.537 ± 0.283
1.535TyrAsn: 1.535 ± 0.355
1.919TyrPro: 1.919 ± 0.486
2.072TyrGln: 2.072 ± 0.389
2.379TyrArg: 2.379 ± 0.469
2.916TyrSer: 2.916 ± 0.629
2.532TyrThr: 2.532 ± 0.425
2.532TyrVal: 2.532 ± 0.545
0.384TyrTrp: 0.384 ± 0.225
1.458TyrTyr: 1.458 ± 0.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13032 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski