Amino acid dipepetide frequency for Streptococcus phage IPP69

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.108AlaAla: 2.108 ± 0.761
0.312AlaCys: 0.312 ± 0.195
5.621AlaAsp: 5.621 ± 0.69
6.558AlaGlu: 6.558 ± 0.975
2.108AlaPhe: 2.108 ± 0.677
5.778AlaGly: 5.778 ± 1.018
0.547AlaHis: 0.547 ± 0.204
4.06AlaIle: 4.06 ± 0.605
5.778AlaLys: 5.778 ± 0.76
6.402AlaLeu: 6.402 ± 0.921
2.186AlaMet: 2.186 ± 0.39
3.67AlaAsn: 3.67 ± 0.661
1.952AlaPro: 1.952 ± 0.345
2.577AlaGln: 2.577 ± 0.452
3.123AlaArg: 3.123 ± 0.579
2.967AlaSer: 2.967 ± 1.045
4.685AlaThr: 4.685 ± 0.667
4.216AlaVal: 4.216 ± 0.625
1.093AlaTrp: 1.093 ± 0.196
1.483AlaTyr: 1.483 ± 0.305
0.0AlaXaa: 0.0 ± 0.0
Cys
0.39CysAla: 0.39 ± 0.198
0.234CysCys: 0.234 ± 0.151
0.468CysAsp: 0.468 ± 0.186
0.39CysGlu: 0.39 ± 0.141
0.156CysPhe: 0.156 ± 0.12
0.468CysGly: 0.468 ± 0.214
0.234CysHis: 0.234 ± 0.125
0.859CysIle: 0.859 ± 0.352
0.781CysLys: 0.781 ± 0.226
0.312CysLeu: 0.312 ± 0.139
0.078CysMet: 0.078 ± 0.084
0.156CysAsn: 0.156 ± 0.189
0.234CysPro: 0.234 ± 0.137
0.312CysGln: 0.312 ± 0.15
0.234CysArg: 0.234 ± 0.118
0.39CysSer: 0.39 ± 0.179
0.156CysThr: 0.156 ± 0.106
0.078CysVal: 0.078 ± 0.084
0.234CysTrp: 0.234 ± 0.131
0.312CysTyr: 0.312 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
3.357AspAla: 3.357 ± 0.624
0.468AspCys: 0.468 ± 0.227
3.045AspAsp: 3.045 ± 0.658
4.919AspGlu: 4.919 ± 1.246
2.811AspPhe: 2.811 ± 0.476
4.919AspGly: 4.919 ± 0.519
0.312AspHis: 0.312 ± 0.162
5.231AspIle: 5.231 ± 0.508
5.231AspLys: 5.231 ± 0.727
5.621AspLeu: 5.621 ± 0.494
1.718AspMet: 1.718 ± 0.377
3.045AspAsn: 3.045 ± 0.478
2.03AspPro: 2.03 ± 0.493
1.796AspGln: 1.796 ± 0.367
2.889AspArg: 2.889 ± 0.468
3.201AspSer: 3.201 ± 0.475
3.826AspThr: 3.826 ± 0.476
3.592AspVal: 3.592 ± 0.485
1.64AspTrp: 1.64 ± 0.406
2.811AspTyr: 2.811 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
6.636GluAla: 6.636 ± 1.003
0.547GluCys: 0.547 ± 0.206
4.06GluAsp: 4.06 ± 0.776
7.027GluGlu: 7.027 ± 1.164
3.904GluPhe: 3.904 ± 0.749
4.216GluGly: 4.216 ± 0.577
0.781GluHis: 0.781 ± 0.254
6.558GluIle: 6.558 ± 0.529
6.168GluLys: 6.168 ± 0.954
9.916GluLeu: 9.916 ± 1.011
2.186GluMet: 2.186 ± 0.556
4.06GluAsn: 4.06 ± 0.668
1.718GluPro: 1.718 ± 0.456
3.748GluGln: 3.748 ± 0.64
4.841GluArg: 4.841 ± 0.601
4.45GluSer: 4.45 ± 0.492
3.592GluThr: 3.592 ± 0.491
5.387GluVal: 5.387 ± 0.725
1.015GluTrp: 1.015 ± 0.31
3.123GluTyr: 3.123 ± 0.549
0.0GluXaa: 0.0 ± 0.0
Phe
2.186PheAla: 2.186 ± 0.532
0.312PheCys: 0.312 ± 0.168
4.685PheAsp: 4.685 ± 0.727
3.982PheGlu: 3.982 ± 0.695
1.093PhePhe: 1.093 ± 0.28
2.186PheGly: 2.186 ± 0.661
0.312PheHis: 0.312 ± 0.144
2.42PheIle: 2.42 ± 0.485
3.513PheLys: 3.513 ± 0.517
2.264PheLeu: 2.264 ± 0.362
1.249PheMet: 1.249 ± 0.347
3.201PheAsn: 3.201 ± 0.533
0.781PhePro: 0.781 ± 0.288
1.718PheGln: 1.718 ± 0.332
1.093PheArg: 1.093 ± 0.235
3.045PheSer: 3.045 ± 0.517
2.186PheThr: 2.186 ± 0.348
1.483PheVal: 1.483 ± 0.27
0.468PheTrp: 0.468 ± 0.165
1.796PheTyr: 1.796 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
3.279GlyAla: 3.279 ± 0.397
0.156GlyCys: 0.156 ± 0.09
3.748GlyAsp: 3.748 ± 0.666
3.982GlyGlu: 3.982 ± 0.64
2.342GlyPhe: 2.342 ± 0.404
4.45GlyGly: 4.45 ± 1.044
0.781GlyHis: 0.781 ± 0.206
4.216GlyIle: 4.216 ± 0.564
4.685GlyLys: 4.685 ± 0.484
5.231GlyLeu: 5.231 ± 0.75
1.64GlyMet: 1.64 ± 0.275
3.435GlyAsn: 3.435 ± 0.511
0.703GlyPro: 0.703 ± 0.312
3.67GlyGln: 3.67 ± 0.494
4.45GlyArg: 4.45 ± 0.58
4.138GlySer: 4.138 ± 0.739
3.201GlyThr: 3.201 ± 0.507
3.904GlyVal: 3.904 ± 0.493
0.937GlyTrp: 0.937 ± 0.344
3.826GlyTyr: 3.826 ± 0.606
0.0GlyXaa: 0.0 ± 0.0
His
0.703HisAla: 0.703 ± 0.263
0.156HisCys: 0.156 ± 0.117
0.468HisAsp: 0.468 ± 0.23
1.249HisGlu: 1.249 ± 0.308
0.625HisPhe: 0.625 ± 0.169
0.703HisGly: 0.703 ± 0.212
0.078HisHis: 0.078 ± 0.075
0.625HisIle: 0.625 ± 0.319
0.547HisLys: 0.547 ± 0.206
0.859HisLeu: 0.859 ± 0.239
0.312HisMet: 0.312 ± 0.164
0.937HisAsn: 0.937 ± 0.252
0.468HisPro: 0.468 ± 0.171
0.547HisGln: 0.547 ± 0.206
1.015HisArg: 1.015 ± 0.304
1.483HisSer: 1.483 ± 0.428
0.547HisThr: 0.547 ± 0.192
0.781HisVal: 0.781 ± 0.192
0.234HisTrp: 0.234 ± 0.145
0.703HisTyr: 0.703 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.997IleAla: 4.997 ± 0.678
0.547IleCys: 0.547 ± 0.184
3.592IleAsp: 3.592 ± 0.472
6.168IleGlu: 6.168 ± 0.628
2.811IlePhe: 2.811 ± 0.548
4.606IleGly: 4.606 ± 0.774
0.625IleHis: 0.625 ± 0.266
3.435IleIle: 3.435 ± 0.436
6.012IleLys: 6.012 ± 0.639
4.685IleLeu: 4.685 ± 0.631
1.483IleMet: 1.483 ± 0.378
3.513IleAsn: 3.513 ± 0.418
1.796IlePro: 1.796 ± 0.311
2.108IleGln: 2.108 ± 0.387
3.592IleArg: 3.592 ± 0.645
6.012IleSer: 6.012 ± 0.673
4.294IleThr: 4.294 ± 0.434
2.967IleVal: 2.967 ± 0.519
0.547IleTrp: 0.547 ± 0.197
2.03IleTyr: 2.03 ± 0.569
0.0IleXaa: 0.0 ± 0.0
Lys
4.528LysAla: 4.528 ± 0.785
0.625LysCys: 0.625 ± 0.287
6.012LysAsp: 6.012 ± 0.535
7.73LysGlu: 7.73 ± 0.962
3.045LysPhe: 3.045 ± 0.705
4.763LysGly: 4.763 ± 0.631
1.093LysHis: 1.093 ± 0.282
6.324LysIle: 6.324 ± 0.661
6.871LysLys: 6.871 ± 0.828
7.73LysLeu: 7.73 ± 0.8
2.577LysMet: 2.577 ± 0.49
4.606LysAsn: 4.606 ± 0.449
2.577LysPro: 2.577 ± 0.532
3.592LysGln: 3.592 ± 0.562
3.904LysArg: 3.904 ± 0.441
4.45LysSer: 4.45 ± 0.623
4.763LysThr: 4.763 ± 0.48
5.543LysVal: 5.543 ± 0.623
0.937LysTrp: 0.937 ± 0.3
3.045LysTyr: 3.045 ± 0.533
0.0LysXaa: 0.0 ± 0.0
Leu
6.715LeuAla: 6.715 ± 0.981
0.625LeuCys: 0.625 ± 0.291
5.075LeuAsp: 5.075 ± 0.541
7.886LeuGlu: 7.886 ± 0.919
3.201LeuPhe: 3.201 ± 0.521
5.075LeuGly: 5.075 ± 1.055
1.249LeuHis: 1.249 ± 0.263
4.528LeuIle: 4.528 ± 0.601
7.495LeuLys: 7.495 ± 0.808
7.886LeuLeu: 7.886 ± 0.971
2.03LeuMet: 2.03 ± 0.454
4.606LeuAsn: 4.606 ± 0.635
2.498LeuPro: 2.498 ± 0.601
3.045LeuGln: 3.045 ± 0.583
4.372LeuArg: 4.372 ± 0.557
5.309LeuSer: 5.309 ± 0.785
5.387LeuThr: 5.387 ± 0.691
4.216LeuVal: 4.216 ± 0.665
0.703LeuTrp: 0.703 ± 0.206
2.577LeuTyr: 2.577 ± 0.378
0.0LeuXaa: 0.0 ± 0.0
Met
1.952MetAla: 1.952 ± 0.471
0.0MetCys: 0.0 ± 0.0
1.562MetAsp: 1.562 ± 0.257
2.03MetGlu: 2.03 ± 0.493
1.249MetPhe: 1.249 ± 0.279
1.327MetGly: 1.327 ± 0.369
0.39MetHis: 0.39 ± 0.183
1.562MetIle: 1.562 ± 0.297
2.342MetLys: 2.342 ± 0.462
1.562MetLeu: 1.562 ± 0.371
0.39MetMet: 0.39 ± 0.2
1.718MetAsn: 1.718 ± 0.456
0.937MetPro: 0.937 ± 0.343
0.781MetGln: 0.781 ± 0.257
1.171MetArg: 1.171 ± 0.295
1.796MetSer: 1.796 ± 0.417
1.874MetThr: 1.874 ± 0.411
1.405MetVal: 1.405 ± 0.279
0.156MetTrp: 0.156 ± 0.119
0.703MetTyr: 0.703 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
3.826AsnAla: 3.826 ± 0.698
0.234AsnCys: 0.234 ± 0.116
3.357AsnAsp: 3.357 ± 0.421
3.435AsnGlu: 3.435 ± 0.631
2.186AsnPhe: 2.186 ± 0.499
3.982AsnGly: 3.982 ± 0.546
1.171AsnHis: 1.171 ± 0.274
2.733AsnIle: 2.733 ± 0.374
4.685AsnLys: 4.685 ± 0.51
4.528AsnLeu: 4.528 ± 0.558
1.327AsnMet: 1.327 ± 0.369
1.718AsnAsn: 1.718 ± 0.46
2.03AsnPro: 2.03 ± 0.299
2.811AsnGln: 2.811 ± 0.587
2.733AsnArg: 2.733 ± 0.515
3.123AsnSer: 3.123 ± 0.678
2.577AsnThr: 2.577 ± 0.415
3.279AsnVal: 3.279 ± 0.565
1.015AsnTrp: 1.015 ± 0.203
1.796AsnTyr: 1.796 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
2.03ProAla: 2.03 ± 0.385
0.0ProCys: 0.0 ± 0.0
1.952ProAsp: 1.952 ± 0.453
3.201ProGlu: 3.201 ± 0.473
1.405ProPhe: 1.405 ± 0.481
0.625ProGly: 0.625 ± 0.188
0.39ProHis: 0.39 ± 0.147
1.483ProIle: 1.483 ± 0.462
3.592ProLys: 3.592 ± 0.421
1.405ProLeu: 1.405 ± 0.39
0.468ProMet: 0.468 ± 0.197
1.64ProAsn: 1.64 ± 0.34
0.703ProPro: 0.703 ± 0.205
1.015ProGln: 1.015 ± 0.386
1.483ProArg: 1.483 ± 0.446
1.171ProSer: 1.171 ± 0.339
1.093ProThr: 1.093 ± 0.289
1.874ProVal: 1.874 ± 0.323
0.468ProTrp: 0.468 ± 0.237
1.327ProTyr: 1.327 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
3.748GlnAla: 3.748 ± 0.558
0.312GlnCys: 0.312 ± 0.142
2.42GlnAsp: 2.42 ± 0.332
3.826GlnGlu: 3.826 ± 0.621
1.093GlnPhe: 1.093 ± 0.324
1.562GlnGly: 1.562 ± 0.373
0.312GlnHis: 0.312 ± 0.143
3.045GlnIle: 3.045 ± 0.438
3.748GlnLys: 3.748 ± 0.519
3.201GlnLeu: 3.201 ± 0.417
0.625GlnMet: 0.625 ± 0.165
1.718GlnAsn: 1.718 ± 0.286
0.859GlnPro: 0.859 ± 0.277
1.64GlnGln: 1.64 ± 0.431
2.264GlnArg: 2.264 ± 0.476
2.889GlnSer: 2.889 ± 0.307
2.342GlnThr: 2.342 ± 0.491
3.357GlnVal: 3.357 ± 0.444
0.39GlnTrp: 0.39 ± 0.157
1.249GlnTyr: 1.249 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
3.279ArgAla: 3.279 ± 0.529
0.312ArgCys: 0.312 ± 0.133
2.498ArgAsp: 2.498 ± 0.457
3.435ArgGlu: 3.435 ± 0.585
1.718ArgPhe: 1.718 ± 0.42
2.108ArgGly: 2.108 ± 0.472
0.781ArgHis: 0.781 ± 0.26
3.513ArgIle: 3.513 ± 0.436
4.216ArgLys: 4.216 ± 0.768
4.919ArgLeu: 4.919 ± 0.727
1.952ArgMet: 1.952 ± 0.414
2.577ArgAsn: 2.577 ± 0.582
1.405ArgPro: 1.405 ± 0.334
2.577ArgGln: 2.577 ± 0.548
2.108ArgArg: 2.108 ± 0.491
2.264ArgSer: 2.264 ± 0.42
2.967ArgThr: 2.967 ± 0.684
3.201ArgVal: 3.201 ± 0.46
0.781ArgTrp: 0.781 ± 0.265
2.342ArgTyr: 2.342 ± 0.479
0.0ArgXaa: 0.0 ± 0.0
Ser
4.763SerAla: 4.763 ± 0.729
0.312SerCys: 0.312 ± 0.131
3.201SerAsp: 3.201 ± 0.581
4.685SerGlu: 4.685 ± 0.614
2.264SerPhe: 2.264 ± 0.447
5.153SerGly: 5.153 ± 0.701
1.171SerHis: 1.171 ± 0.327
3.904SerIle: 3.904 ± 0.521
4.372SerLys: 4.372 ± 0.665
5.387SerLeu: 5.387 ± 0.57
1.171SerMet: 1.171 ± 0.361
2.889SerAsn: 2.889 ± 0.501
1.405SerPro: 1.405 ± 0.292
2.186SerGln: 2.186 ± 0.354
3.592SerArg: 3.592 ± 0.727
3.592SerSer: 3.592 ± 0.67
4.294SerThr: 4.294 ± 0.45
3.435SerVal: 3.435 ± 0.67
1.015SerTrp: 1.015 ± 0.438
2.655SerTyr: 2.655 ± 0.53
0.0SerXaa: 0.0 ± 0.0
Thr
5.153ThrAla: 5.153 ± 0.997
0.156ThrCys: 0.156 ± 0.12
4.294ThrAsp: 4.294 ± 0.695
4.138ThrGlu: 4.138 ± 0.424
3.045ThrPhe: 3.045 ± 0.519
3.513ThrGly: 3.513 ± 0.587
0.781ThrHis: 0.781 ± 0.268
4.216ThrIle: 4.216 ± 0.614
4.294ThrLys: 4.294 ± 0.701
4.685ThrLeu: 4.685 ± 0.593
0.859ThrMet: 0.859 ± 0.263
2.733ThrAsn: 2.733 ± 0.523
1.093ThrPro: 1.093 ± 0.297
2.733ThrGln: 2.733 ± 0.753
1.405ThrArg: 1.405 ± 0.351
3.904ThrSer: 3.904 ± 0.568
4.528ThrThr: 4.528 ± 0.823
4.606ThrVal: 4.606 ± 0.64
1.093ThrTrp: 1.093 ± 0.311
2.186ThrTyr: 2.186 ± 0.483
0.0ThrXaa: 0.0 ± 0.0
Val
4.841ValAla: 4.841 ± 0.59
0.312ValCys: 0.312 ± 0.146
3.045ValAsp: 3.045 ± 0.497
5.309ValGlu: 5.309 ± 0.655
1.874ValPhe: 1.874 ± 0.376
4.138ValGly: 4.138 ± 0.612
1.015ValHis: 1.015 ± 0.309
3.435ValIle: 3.435 ± 0.557
5.621ValLys: 5.621 ± 0.515
3.982ValLeu: 3.982 ± 0.569
1.483ValMet: 1.483 ± 0.418
3.592ValAsn: 3.592 ± 0.53
2.264ValPro: 2.264 ± 0.368
1.562ValGln: 1.562 ± 0.383
2.577ValArg: 2.577 ± 0.325
4.372ValSer: 4.372 ± 0.458
4.216ValThr: 4.216 ± 0.707
3.592ValVal: 3.592 ± 0.524
1.015ValTrp: 1.015 ± 0.277
2.42ValTyr: 2.42 ± 0.54
0.0ValXaa: 0.0 ± 0.0
Trp
0.937TrpAla: 0.937 ± 0.303
0.156TrpCys: 0.156 ± 0.1
0.937TrpAsp: 0.937 ± 0.289
0.937TrpGlu: 0.937 ± 0.344
0.937TrpPhe: 0.937 ± 0.498
0.703TrpGly: 0.703 ± 0.217
0.156TrpHis: 0.156 ± 0.114
0.859TrpIle: 0.859 ± 0.228
1.171TrpLys: 1.171 ± 0.291
0.937TrpLeu: 0.937 ± 0.422
0.468TrpMet: 0.468 ± 0.202
1.171TrpAsn: 1.171 ± 0.249
0.078TrpPro: 0.078 ± 0.075
0.547TrpGln: 0.547 ± 0.233
0.859TrpArg: 0.859 ± 0.3
0.547TrpSer: 0.547 ± 0.154
0.859TrpThr: 0.859 ± 0.238
1.015TrpVal: 1.015 ± 0.292
0.234TrpTrp: 0.234 ± 0.131
0.781TrpTyr: 0.781 ± 0.412
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.796TyrAla: 1.796 ± 0.314
0.703TyrCys: 0.703 ± 0.24
2.42TyrAsp: 2.42 ± 0.497
3.045TyrGlu: 3.045 ± 0.566
2.108TyrPhe: 2.108 ± 0.489
2.498TyrGly: 2.498 ± 0.366
0.859TyrHis: 0.859 ± 0.223
2.811TyrIle: 2.811 ± 0.453
3.592TyrLys: 3.592 ± 0.597
2.967TyrLeu: 2.967 ± 0.515
0.625TyrMet: 0.625 ± 0.242
1.562TyrAsn: 1.562 ± 0.385
1.796TyrPro: 1.796 ± 0.394
1.64TyrGln: 1.64 ± 0.411
1.249TyrArg: 1.249 ± 0.305
2.42TyrSer: 2.42 ± 0.519
2.03TyrThr: 2.03 ± 0.364
2.733TyrVal: 2.733 ± 0.526
0.312TyrTrp: 0.312 ± 0.184
1.796TyrTyr: 1.796 ± 0.522
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12809 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski