Amino acid dipepetide frequency for Streptococcus phage IPP61

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.125AlaAla: 2.125 ± 0.659
0.327AlaCys: 0.327 ± 0.13
5.286AlaAsp: 5.286 ± 0.478
5.559AlaGlu: 5.559 ± 0.643
2.452AlaPhe: 2.452 ± 0.556
5.177AlaGly: 5.177 ± 0.765
0.872AlaHis: 0.872 ± 0.265
5.014AlaIle: 5.014 ± 0.704
5.45AlaLys: 5.45 ± 0.524
5.94AlaLeu: 5.94 ± 0.669
2.071AlaMet: 2.071 ± 0.413
3.488AlaAsn: 3.488 ± 0.517
1.853AlaPro: 1.853 ± 0.327
2.289AlaGln: 2.289 ± 0.354
2.507AlaArg: 2.507 ± 0.452
3.161AlaSer: 3.161 ± 0.699
4.305AlaThr: 4.305 ± 0.634
4.305AlaVal: 4.305 ± 0.621
0.981AlaTrp: 0.981 ± 0.265
2.234AlaTyr: 2.234 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
0.272CysAla: 0.272 ± 0.102
0.054CysCys: 0.054 ± 0.057
0.272CysAsp: 0.272 ± 0.111
0.381CysGlu: 0.381 ± 0.158
0.327CysPhe: 0.327 ± 0.135
0.163CysGly: 0.163 ± 0.103
0.054CysHis: 0.054 ± 0.04
0.49CysIle: 0.49 ± 0.181
0.654CysLys: 0.654 ± 0.2
0.599CysLeu: 0.599 ± 0.17
0.054CysMet: 0.054 ± 0.053
0.163CysAsn: 0.163 ± 0.114
0.218CysPro: 0.218 ± 0.116
0.327CysGln: 0.327 ± 0.126
0.436CysArg: 0.436 ± 0.138
0.327CysSer: 0.327 ± 0.139
0.109CysThr: 0.109 ± 0.085
0.272CysVal: 0.272 ± 0.125
0.218CysTrp: 0.218 ± 0.108
0.327CysTyr: 0.327 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
2.779AspAla: 2.779 ± 0.474
0.599AspCys: 0.599 ± 0.214
3.651AspAsp: 3.651 ± 0.543
4.796AspGlu: 4.796 ± 0.789
3.324AspPhe: 3.324 ± 0.431
4.196AspGly: 4.196 ± 0.512
0.599AspHis: 0.599 ± 0.182
4.85AspIle: 4.85 ± 0.507
4.905AspLys: 4.905 ± 0.584
5.722AspLeu: 5.722 ± 0.666
1.58AspMet: 1.58 ± 0.289
2.452AspAsn: 2.452 ± 0.333
1.58AspPro: 1.58 ± 0.312
1.853AspGln: 1.853 ± 0.323
2.507AspArg: 2.507 ± 0.464
3.815AspSer: 3.815 ± 0.399
3.542AspThr: 3.542 ± 0.378
3.488AspVal: 3.488 ± 0.458
1.199AspTrp: 1.199 ± 0.275
2.452AspTyr: 2.452 ± 0.423
0.0AspXaa: 0.0 ± 0.0
Glu
6.049GluAla: 6.049 ± 0.786
0.218GluCys: 0.218 ± 0.114
3.815GluAsp: 3.815 ± 0.532
5.831GluGlu: 5.831 ± 0.843
3.542GluPhe: 3.542 ± 0.412
3.542GluGly: 3.542 ± 0.46
1.253GluHis: 1.253 ± 0.34
6.049GluIle: 6.049 ± 0.617
7.139GluLys: 7.139 ± 0.958
8.175GluLeu: 8.175 ± 0.837
1.962GluMet: 1.962 ± 0.372
4.578GluAsn: 4.578 ± 0.55
1.199GluPro: 1.199 ± 0.266
3.27GluGln: 3.27 ± 0.51
3.869GluArg: 3.869 ± 0.477
4.632GluSer: 4.632 ± 0.539
3.76GluThr: 3.76 ± 0.49
5.068GluVal: 5.068 ± 0.598
0.926GluTrp: 0.926 ± 0.233
3.27GluTyr: 3.27 ± 0.45
0.0GluXaa: 0.0 ± 0.0
Phe
2.125PheAla: 2.125 ± 0.448
0.272PheCys: 0.272 ± 0.146
3.379PheAsp: 3.379 ± 0.518
3.215PheGlu: 3.215 ± 0.393
1.962PhePhe: 1.962 ± 0.311
2.452PheGly: 2.452 ± 0.512
0.545PheHis: 0.545 ± 0.23
2.234PheIle: 2.234 ± 0.464
3.379PheLys: 3.379 ± 0.34
3.651PheLeu: 3.651 ± 0.489
0.981PheMet: 0.981 ± 0.287
2.289PheAsn: 2.289 ± 0.459
0.981PhePro: 0.981 ± 0.258
1.362PheGln: 1.362 ± 0.306
1.744PheArg: 1.744 ± 0.232
3.161PheSer: 3.161 ± 0.511
2.67PheThr: 2.67 ± 0.284
2.234PheVal: 2.234 ± 0.345
0.599PheTrp: 0.599 ± 0.2
2.18PheTyr: 2.18 ± 0.337
0.0PheXaa: 0.0 ± 0.0
Gly
2.779GlyAla: 2.779 ± 0.337
0.218GlyCys: 0.218 ± 0.131
3.379GlyAsp: 3.379 ± 0.456
4.414GlyGlu: 4.414 ± 0.584
3.161GlyPhe: 3.161 ± 0.503
4.142GlyGly: 4.142 ± 1.078
0.654GlyHis: 0.654 ± 0.158
4.087GlyIle: 4.087 ± 0.63
5.395GlyLys: 5.395 ± 0.558
5.94GlyLeu: 5.94 ± 0.787
1.526GlyMet: 1.526 ± 0.247
3.324GlyAsn: 3.324 ± 0.368
0.763GlyPro: 0.763 ± 0.227
2.834GlyGln: 2.834 ± 0.362
3.815GlyArg: 3.815 ± 0.515
3.433GlySer: 3.433 ± 0.551
2.943GlyThr: 2.943 ± 0.509
4.196GlyVal: 4.196 ± 0.442
0.872GlyTrp: 0.872 ± 0.339
3.161GlyTyr: 3.161 ± 0.358
0.0GlyXaa: 0.0 ± 0.0
His
1.09HisAla: 1.09 ± 0.33
0.0HisCys: 0.0 ± 0.0
0.763HisAsp: 0.763 ± 0.219
1.308HisGlu: 1.308 ± 0.257
0.763HisPhe: 0.763 ± 0.23
0.708HisGly: 0.708 ± 0.25
0.381HisHis: 0.381 ± 0.163
0.817HisIle: 0.817 ± 0.212
0.599HisLys: 0.599 ± 0.227
1.253HisLeu: 1.253 ± 0.247
0.272HisMet: 0.272 ± 0.112
0.926HisAsn: 0.926 ± 0.228
0.763HisPro: 0.763 ± 0.229
0.599HisGln: 0.599 ± 0.184
0.872HisArg: 0.872 ± 0.233
1.308HisSer: 1.308 ± 0.35
0.981HisThr: 0.981 ± 0.228
0.872HisVal: 0.872 ± 0.253
0.054HisTrp: 0.054 ± 0.057
0.763HisTyr: 0.763 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
5.177IleAla: 5.177 ± 0.649
0.545IleCys: 0.545 ± 0.139
3.869IleAsp: 3.869 ± 0.531
6.104IleGlu: 6.104 ± 0.806
2.616IlePhe: 2.616 ± 0.415
5.014IleGly: 5.014 ± 0.803
1.09IleHis: 1.09 ± 0.356
3.106IleIle: 3.106 ± 0.457
5.995IleLys: 5.995 ± 0.515
5.014IleLeu: 5.014 ± 0.588
1.471IleMet: 1.471 ± 0.265
3.76IleAsn: 3.76 ± 0.401
2.016IlePro: 2.016 ± 0.324
2.616IleGln: 2.616 ± 0.334
2.997IleArg: 2.997 ± 0.519
5.613IleSer: 5.613 ± 0.615
4.305IleThr: 4.305 ± 0.496
4.087IleVal: 4.087 ± 0.543
0.708IleTrp: 0.708 ± 0.193
2.18IleTyr: 2.18 ± 0.419
0.0IleXaa: 0.0 ± 0.0
Lys
5.014LysAla: 5.014 ± 0.56
0.327LysCys: 0.327 ± 0.122
5.504LysAsp: 5.504 ± 0.468
7.575LysGlu: 7.575 ± 0.771
2.507LysPhe: 2.507 ± 0.4
4.033LysGly: 4.033 ± 0.455
1.253LysHis: 1.253 ± 0.277
5.94LysIle: 5.94 ± 0.713
7.03LysLys: 7.03 ± 0.764
7.684LysLeu: 7.684 ± 0.605
2.234LysMet: 2.234 ± 0.333
4.251LysAsn: 4.251 ± 0.479
2.67LysPro: 2.67 ± 0.384
4.033LysGln: 4.033 ± 0.545
4.414LysArg: 4.414 ± 0.526
3.924LysSer: 3.924 ± 0.474
5.286LysThr: 5.286 ± 0.532
5.559LysVal: 5.559 ± 0.573
0.763LysTrp: 0.763 ± 0.231
3.433LysTyr: 3.433 ± 0.462
0.0LysXaa: 0.0 ± 0.0
Leu
7.03LeuAla: 7.03 ± 0.723
0.654LeuCys: 0.654 ± 0.211
5.232LeuAsp: 5.232 ± 0.62
6.921LeuGlu: 6.921 ± 0.802
2.67LeuPhe: 2.67 ± 0.345
5.613LeuGly: 5.613 ± 0.876
1.253LeuHis: 1.253 ± 0.265
4.523LeuIle: 4.523 ± 0.59
8.175LeuLys: 8.175 ± 0.65
7.466LeuLeu: 7.466 ± 0.874
2.071LeuMet: 2.071 ± 0.309
3.597LeuAsn: 3.597 ± 0.497
3.488LeuPro: 3.488 ± 0.473
3.542LeuGln: 3.542 ± 0.574
3.542LeuArg: 3.542 ± 0.495
5.995LeuSer: 5.995 ± 0.649
5.94LeuThr: 5.94 ± 0.596
5.177LeuVal: 5.177 ± 0.488
0.872LeuTrp: 0.872 ± 0.209
2.997LeuTyr: 2.997 ± 0.411
0.0LeuXaa: 0.0 ± 0.0
Met
1.253MetAla: 1.253 ± 0.262
0.054MetCys: 0.054 ± 0.054
1.308MetAsp: 1.308 ± 0.201
2.125MetGlu: 2.125 ± 0.403
1.035MetPhe: 1.035 ± 0.257
1.199MetGly: 1.199 ± 0.348
0.272MetHis: 0.272 ± 0.153
1.853MetIle: 1.853 ± 0.359
2.779MetLys: 2.779 ± 0.44
2.016MetLeu: 2.016 ± 0.332
0.49MetMet: 0.49 ± 0.173
1.635MetAsn: 1.635 ± 0.318
0.654MetPro: 0.654 ± 0.217
0.763MetGln: 0.763 ± 0.225
1.253MetArg: 1.253 ± 0.256
1.689MetSer: 1.689 ± 0.317
1.58MetThr: 1.58 ± 0.35
1.471MetVal: 1.471 ± 0.288
0.163MetTrp: 0.163 ± 0.101
0.872MetTyr: 0.872 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
3.978AsnAla: 3.978 ± 0.663
0.272AsnCys: 0.272 ± 0.118
2.616AsnAsp: 2.616 ± 0.372
3.324AsnGlu: 3.324 ± 0.527
1.853AsnPhe: 1.853 ± 0.368
3.815AsnGly: 3.815 ± 0.511
0.763AsnHis: 0.763 ± 0.186
3.161AsnIle: 3.161 ± 0.515
4.36AsnLys: 4.36 ± 0.518
3.869AsnLeu: 3.869 ± 0.514
1.308AsnMet: 1.308 ± 0.298
2.343AsnAsn: 2.343 ± 0.371
2.071AsnPro: 2.071 ± 0.368
2.725AsnGln: 2.725 ± 0.457
3.161AsnArg: 3.161 ± 0.442
3.052AsnSer: 3.052 ± 0.49
2.507AsnThr: 2.507 ± 0.337
2.779AsnVal: 2.779 ± 0.353
0.981AsnTrp: 0.981 ± 0.199
2.289AsnTyr: 2.289 ± 0.387
0.0AsnXaa: 0.0 ± 0.0
Pro
2.071ProAla: 2.071 ± 0.357
0.109ProCys: 0.109 ± 0.086
1.907ProAsp: 1.907 ± 0.342
2.67ProGlu: 2.67 ± 0.33
0.872ProPhe: 0.872 ± 0.26
0.872ProGly: 0.872 ± 0.185
0.545ProHis: 0.545 ± 0.145
2.452ProIle: 2.452 ± 0.415
2.779ProLys: 2.779 ± 0.434
1.689ProLeu: 1.689 ± 0.408
0.654ProMet: 0.654 ± 0.198
1.744ProAsn: 1.744 ± 0.393
0.817ProPro: 0.817 ± 0.295
0.817ProGln: 0.817 ± 0.187
1.253ProArg: 1.253 ± 0.211
1.798ProSer: 1.798 ± 0.357
1.253ProThr: 1.253 ± 0.285
1.907ProVal: 1.907 ± 0.312
0.327ProTrp: 0.327 ± 0.137
1.58ProTyr: 1.58 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
3.706GlnAla: 3.706 ± 0.438
0.218GlnCys: 0.218 ± 0.111
1.526GlnAsp: 1.526 ± 0.324
3.433GlnGlu: 3.433 ± 0.553
1.471GlnPhe: 1.471 ± 0.302
1.689GlnGly: 1.689 ± 0.301
0.654GlnHis: 0.654 ± 0.224
3.161GlnIle: 3.161 ± 0.475
3.161GlnLys: 3.161 ± 0.412
2.997GlnLeu: 2.997 ± 0.376
0.872GlnMet: 0.872 ± 0.189
1.798GlnAsn: 1.798 ± 0.266
0.926GlnPro: 0.926 ± 0.269
2.016GlnGln: 2.016 ± 0.446
2.18GlnArg: 2.18 ± 0.421
3.052GlnSer: 3.052 ± 0.41
3.106GlnThr: 3.106 ± 0.399
3.597GlnVal: 3.597 ± 0.385
0.545GlnTrp: 0.545 ± 0.153
1.199GlnTyr: 1.199 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
2.943ArgAla: 2.943 ± 0.421
0.49ArgCys: 0.49 ± 0.142
2.234ArgAsp: 2.234 ± 0.345
2.779ArgGlu: 2.779 ± 0.442
1.798ArgPhe: 1.798 ± 0.348
2.18ArgGly: 2.18 ± 0.333
0.817ArgHis: 0.817 ± 0.236
3.215ArgIle: 3.215 ± 0.405
3.978ArgLys: 3.978 ± 0.556
5.123ArgLeu: 5.123 ± 0.573
2.071ArgMet: 2.071 ± 0.325
2.234ArgAsn: 2.234 ± 0.444
1.526ArgPro: 1.526 ± 0.376
2.18ArgGln: 2.18 ± 0.392
2.507ArgArg: 2.507 ± 0.392
3.161ArgSer: 3.161 ± 0.467
2.67ArgThr: 2.67 ± 0.439
2.452ArgVal: 2.452 ± 0.363
0.436ArgTrp: 0.436 ± 0.141
2.125ArgTyr: 2.125 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
4.414SerAla: 4.414 ± 0.79
0.218SerCys: 0.218 ± 0.114
3.869SerAsp: 3.869 ± 0.391
4.578SerGlu: 4.578 ± 0.52
3.106SerPhe: 3.106 ± 0.408
4.741SerGly: 4.741 ± 0.584
1.09SerHis: 1.09 ± 0.305
4.687SerIle: 4.687 ± 0.571
4.087SerLys: 4.087 ± 0.571
6.049SerLeu: 6.049 ± 0.57
1.362SerMet: 1.362 ± 0.385
3.106SerAsn: 3.106 ± 0.533
1.471SerPro: 1.471 ± 0.253
2.616SerGln: 2.616 ± 0.414
2.67SerArg: 2.67 ± 0.421
3.978SerSer: 3.978 ± 0.511
4.251SerThr: 4.251 ± 0.491
4.142SerVal: 4.142 ± 0.522
0.926SerTrp: 0.926 ± 0.304
2.779SerTyr: 2.779 ± 0.425
0.0SerXaa: 0.0 ± 0.0
Thr
4.414ThrAla: 4.414 ± 0.754
0.218ThrCys: 0.218 ± 0.111
4.033ThrAsp: 4.033 ± 0.423
4.142ThrGlu: 4.142 ± 0.444
3.161ThrPhe: 3.161 ± 0.462
3.706ThrGly: 3.706 ± 0.537
1.144ThrHis: 1.144 ± 0.328
4.632ThrIle: 4.632 ± 0.53
4.523ThrLys: 4.523 ± 0.628
4.687ThrLeu: 4.687 ± 0.526
0.763ThrMet: 0.763 ± 0.187
3.379ThrAsn: 3.379 ± 0.429
1.58ThrPro: 1.58 ± 0.36
2.725ThrGln: 2.725 ± 0.521
1.962ThrArg: 1.962 ± 0.284
3.869ThrSer: 3.869 ± 0.463
4.687ThrThr: 4.687 ± 0.714
4.523ThrVal: 4.523 ± 0.581
0.49ThrTrp: 0.49 ± 0.207
2.452ThrTyr: 2.452 ± 0.484
0.0ThrXaa: 0.0 ± 0.0
Val
4.959ValAla: 4.959 ± 0.601
0.436ValCys: 0.436 ± 0.165
3.869ValAsp: 3.869 ± 0.464
5.395ValGlu: 5.395 ± 0.416
2.016ValPhe: 2.016 ± 0.279
4.523ValGly: 4.523 ± 0.504
0.817ValHis: 0.817 ± 0.202
4.578ValIle: 4.578 ± 0.493
5.123ValLys: 5.123 ± 0.557
4.959ValLeu: 4.959 ± 0.549
1.362ValMet: 1.362 ± 0.289
3.488ValAsn: 3.488 ± 0.564
1.689ValPro: 1.689 ± 0.282
1.962ValGln: 1.962 ± 0.424
2.725ValArg: 2.725 ± 0.345
4.796ValSer: 4.796 ± 0.497
4.087ValThr: 4.087 ± 0.506
3.924ValVal: 3.924 ± 0.626
0.545ValTrp: 0.545 ± 0.181
2.67ValTyr: 2.67 ± 0.495
0.0ValXaa: 0.0 ± 0.0
Trp
0.981TrpAla: 0.981 ± 0.277
0.109TrpCys: 0.109 ± 0.08
0.545TrpAsp: 0.545 ± 0.242
1.035TrpGlu: 1.035 ± 0.264
0.981TrpPhe: 0.981 ± 0.319
0.436TrpGly: 0.436 ± 0.145
0.054TrpHis: 0.054 ± 0.059
0.436TrpIle: 0.436 ± 0.183
1.09TrpLys: 1.09 ± 0.226
0.708TrpLeu: 0.708 ± 0.265
0.218TrpMet: 0.218 ± 0.106
0.817TrpAsn: 0.817 ± 0.231
0.054TrpPro: 0.054 ± 0.057
0.817TrpGln: 0.817 ± 0.218
0.327TrpArg: 0.327 ± 0.16
0.545TrpSer: 0.545 ± 0.149
0.981TrpThr: 0.981 ± 0.209
1.035TrpVal: 1.035 ± 0.286
0.163TrpTrp: 0.163 ± 0.09
0.654TrpTyr: 0.654 ± 0.369
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.234TyrAla: 2.234 ± 0.373
0.381TyrCys: 0.381 ± 0.123
2.834TyrAsp: 2.834 ± 0.506
2.616TyrGlu: 2.616 ± 0.471
1.798TyrPhe: 1.798 ± 0.322
2.834TyrGly: 2.834 ± 0.493
0.981TyrHis: 0.981 ± 0.217
2.997TyrIle: 2.997 ± 0.485
2.834TyrLys: 2.834 ± 0.469
3.215TyrLeu: 3.215 ± 0.417
1.09TyrMet: 1.09 ± 0.29
1.853TyrAsn: 1.853 ± 0.299
1.798TyrPro: 1.798 ± 0.289
1.962TyrGln: 1.962 ± 0.319
2.18TyrArg: 2.18 ± 0.421
2.834TyrSer: 2.834 ± 0.432
2.125TyrThr: 2.125 ± 0.283
2.834TyrVal: 2.834 ± 0.458
0.218TyrTrp: 0.218 ± 0.094
1.962TyrTyr: 1.962 ± 0.421
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (18350 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski