Amino acid dipepetide frequency for Streptococcus phage IPP50

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.138AlaAla: 3.138 ± 0.785
0.392AlaCys: 0.392 ± 0.185
5.413AlaAsp: 5.413 ± 0.615
6.354AlaGlu: 6.354 ± 0.668
2.275AlaPhe: 2.275 ± 0.594
4.55AlaGly: 4.55 ± 0.97
0.549AlaHis: 0.549 ± 0.233
4.55AlaIle: 4.55 ± 0.931
6.275AlaLys: 6.275 ± 0.598
6.511AlaLeu: 6.511 ± 0.979
2.275AlaMet: 2.275 ± 0.487
4.001AlaAsn: 4.001 ± 0.843
1.961AlaPro: 1.961 ± 0.409
2.432AlaGln: 2.432 ± 0.498
2.589AlaArg: 2.589 ± 0.502
2.196AlaSer: 2.196 ± 0.682
4.079AlaThr: 4.079 ± 0.712
5.256AlaVal: 5.256 ± 0.708
1.49AlaTrp: 1.49 ± 0.412
1.726AlaTyr: 1.726 ± 0.414
0.0AlaXaa: 0.0 ± 0.0
Cys
0.157CysAla: 0.157 ± 0.094
0.078CysCys: 0.078 ± 0.077
0.471CysAsp: 0.471 ± 0.187
0.235CysGlu: 0.235 ± 0.129
0.392CysPhe: 0.392 ± 0.2
0.078CysGly: 0.078 ± 0.082
0.078CysHis: 0.078 ± 0.078
0.549CysIle: 0.549 ± 0.269
0.784CysLys: 0.784 ± 0.254
0.471CysLeu: 0.471 ± 0.176
0.0CysMet: 0.0 ± 0.0
0.235CysAsn: 0.235 ± 0.171
0.392CysPro: 0.392 ± 0.173
0.314CysGln: 0.314 ± 0.151
0.314CysArg: 0.314 ± 0.14
0.235CysSer: 0.235 ± 0.147
0.078CysThr: 0.078 ± 0.09
0.078CysVal: 0.078 ± 0.085
0.078CysTrp: 0.078 ± 0.077
0.392CysTyr: 0.392 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
3.844AspAla: 3.844 ± 0.679
0.549AspCys: 0.549 ± 0.219
3.138AspAsp: 3.138 ± 0.628
4.471AspGlu: 4.471 ± 1.017
2.981AspPhe: 2.981 ± 0.539
4.314AspGly: 4.314 ± 0.556
0.706AspHis: 0.706 ± 0.253
5.491AspIle: 5.491 ± 0.638
5.726AspLys: 5.726 ± 0.789
5.099AspLeu: 5.099 ± 0.654
1.647AspMet: 1.647 ± 0.364
2.746AspAsn: 2.746 ± 0.485
1.647AspPro: 1.647 ± 0.41
1.647AspGln: 1.647 ± 0.401
2.824AspArg: 2.824 ± 0.555
3.922AspSer: 3.922 ± 0.588
3.765AspThr: 3.765 ± 0.491
3.295AspVal: 3.295 ± 0.464
1.647AspTrp: 1.647 ± 0.412
2.667AspTyr: 2.667 ± 0.435
0.0AspXaa: 0.0 ± 0.0
Glu
6.04GluAla: 6.04 ± 0.994
0.157GluCys: 0.157 ± 0.105
4.236GluAsp: 4.236 ± 0.672
6.197GluGlu: 6.197 ± 0.997
3.687GluPhe: 3.687 ± 0.583
3.216GluGly: 3.216 ± 0.461
1.334GluHis: 1.334 ± 0.335
5.02GluIle: 5.02 ± 0.514
8.158GluLys: 8.158 ± 1.285
8.629GluLeu: 8.629 ± 0.953
2.196GluMet: 2.196 ± 0.519
4.158GluAsn: 4.158 ± 0.437
1.647GluPro: 1.647 ± 0.444
3.059GluGln: 3.059 ± 0.579
4.393GluArg: 4.393 ± 0.633
4.942GluSer: 4.942 ± 0.717
3.844GluThr: 3.844 ± 0.626
5.413GluVal: 5.413 ± 0.678
1.098GluTrp: 1.098 ± 0.272
2.902GluTyr: 2.902 ± 0.496
0.0GluXaa: 0.0 ± 0.0
Phe
2.589PheAla: 2.589 ± 0.523
0.235PheCys: 0.235 ± 0.144
3.844PheAsp: 3.844 ± 0.546
3.53PheGlu: 3.53 ± 0.531
1.883PhePhe: 1.883 ± 0.375
2.353PheGly: 2.353 ± 0.633
0.314PheHis: 0.314 ± 0.155
1.804PheIle: 1.804 ± 0.397
3.373PheLys: 3.373 ± 0.527
2.824PheLeu: 2.824 ± 0.403
1.098PheMet: 1.098 ± 0.311
2.746PheAsn: 2.746 ± 0.665
0.471PhePro: 0.471 ± 0.188
1.177PheGln: 1.177 ± 0.3
1.726PheArg: 1.726 ± 0.333
3.138PheSer: 3.138 ± 0.639
2.51PheThr: 2.51 ± 0.369
1.726PheVal: 1.726 ± 0.323
0.706PheTrp: 0.706 ± 0.243
1.883PheTyr: 1.883 ± 0.433
0.0PheXaa: 0.0 ± 0.0
Gly
2.981GlyAla: 2.981 ± 0.588
0.157GlyCys: 0.157 ± 0.14
3.608GlyAsp: 3.608 ± 0.627
4.707GlyGlu: 4.707 ± 0.719
2.51GlyPhe: 2.51 ± 0.589
4.314GlyGly: 4.314 ± 1.154
0.941GlyHis: 0.941 ± 0.281
3.452GlyIle: 3.452 ± 0.629
5.334GlyLys: 5.334 ± 0.549
5.726GlyLeu: 5.726 ± 1.201
1.569GlyMet: 1.569 ± 0.303
3.373GlyAsn: 3.373 ± 0.484
1.02GlyPro: 1.02 ± 0.288
3.216GlyGln: 3.216 ± 0.508
3.373GlyArg: 3.373 ± 0.584
3.452GlySer: 3.452 ± 0.822
2.51GlyThr: 2.51 ± 0.455
4.628GlyVal: 4.628 ± 0.797
1.02GlyTrp: 1.02 ± 0.478
2.824GlyTyr: 2.824 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
0.549HisAla: 0.549 ± 0.26
0.0HisCys: 0.0 ± 0.0
0.863HisAsp: 0.863 ± 0.293
1.255HisGlu: 1.255 ± 0.282
0.549HisPhe: 0.549 ± 0.166
0.863HisGly: 0.863 ± 0.28
0.235HisHis: 0.235 ± 0.131
0.941HisIle: 0.941 ± 0.383
0.784HisLys: 0.784 ± 0.356
1.334HisLeu: 1.334 ± 0.36
0.078HisMet: 0.078 ± 0.085
1.02HisAsn: 1.02 ± 0.281
0.784HisPro: 0.784 ± 0.244
0.549HisGln: 0.549 ± 0.224
0.863HisArg: 0.863 ± 0.263
1.334HisSer: 1.334 ± 0.43
0.941HisThr: 0.941 ± 0.287
0.784HisVal: 0.784 ± 0.244
0.157HisTrp: 0.157 ± 0.107
0.706HisTyr: 0.706 ± 0.234
0.0HisXaa: 0.0 ± 0.0
Ile
5.648IleAla: 5.648 ± 0.688
0.863IleCys: 0.863 ± 0.212
3.844IleAsp: 3.844 ± 0.603
6.354IleGlu: 6.354 ± 0.757
2.432IlePhe: 2.432 ± 0.549
4.158IleGly: 4.158 ± 0.854
0.392IleHis: 0.392 ± 0.192
3.059IleIle: 3.059 ± 0.538
6.432IleLys: 6.432 ± 0.733
3.844IleLeu: 3.844 ± 0.663
1.177IleMet: 1.177 ± 0.377
3.138IleAsn: 3.138 ± 0.54
1.647IlePro: 1.647 ± 0.33
2.746IleGln: 2.746 ± 0.313
2.432IleArg: 2.432 ± 0.447
4.707IleSer: 4.707 ± 0.734
4.707IleThr: 4.707 ± 0.502
3.295IleVal: 3.295 ± 0.6
0.628IleTrp: 0.628 ± 0.223
2.196IleTyr: 2.196 ± 0.555
0.0IleXaa: 0.0 ± 0.0
Lys
5.334LysAla: 5.334 ± 0.737
0.471LysCys: 0.471 ± 0.219
5.648LysAsp: 5.648 ± 0.536
7.687LysGlu: 7.687 ± 0.906
2.981LysPhe: 2.981 ± 0.526
4.471LysGly: 4.471 ± 0.669
1.883LysHis: 1.883 ± 0.407
6.275LysIle: 6.275 ± 0.981
9.099LysLys: 9.099 ± 1.087
7.531LysLeu: 7.531 ± 0.55
3.373LysMet: 3.373 ± 0.449
4.393LysAsn: 4.393 ± 0.591
2.589LysPro: 2.589 ± 0.563
3.922LysGln: 3.922 ± 0.708
3.844LysArg: 3.844 ± 0.543
4.393LysSer: 4.393 ± 0.532
5.962LysThr: 5.962 ± 0.68
5.883LysVal: 5.883 ± 0.71
1.255LysTrp: 1.255 ± 0.362
4.55LysTyr: 4.55 ± 0.584
0.0LysXaa: 0.0 ± 0.0
Leu
6.825LeuAla: 6.825 ± 0.908
0.471LeuCys: 0.471 ± 0.263
6.511LeuAsp: 6.511 ± 0.677
6.746LeuGlu: 6.746 ± 0.886
3.295LeuPhe: 3.295 ± 0.468
5.726LeuGly: 5.726 ± 1.397
1.334LeuHis: 1.334 ± 0.333
4.158LeuIle: 4.158 ± 0.477
7.609LeuLys: 7.609 ± 0.695
6.746LeuLeu: 6.746 ± 0.962
2.04LeuMet: 2.04 ± 0.354
3.138LeuAsn: 3.138 ± 0.551
2.981LeuPro: 2.981 ± 0.524
3.138LeuGln: 3.138 ± 0.635
3.373LeuArg: 3.373 ± 0.499
5.177LeuSer: 5.177 ± 0.839
5.726LeuThr: 5.726 ± 0.769
4.628LeuVal: 4.628 ± 0.658
0.784LeuTrp: 0.784 ± 0.212
2.667LeuTyr: 2.667 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
1.726MetAla: 1.726 ± 0.372
0.078MetCys: 0.078 ± 0.07
1.569MetAsp: 1.569 ± 0.306
2.196MetGlu: 2.196 ± 0.485
1.02MetPhe: 1.02 ± 0.275
1.098MetGly: 1.098 ± 0.438
0.314MetHis: 0.314 ± 0.169
1.804MetIle: 1.804 ± 0.423
2.667MetLys: 2.667 ± 0.527
1.647MetLeu: 1.647 ± 0.357
0.471MetMet: 0.471 ± 0.253
1.804MetAsn: 1.804 ± 0.391
0.863MetPro: 0.863 ± 0.266
1.098MetGln: 1.098 ± 0.371
1.49MetArg: 1.49 ± 0.351
1.412MetSer: 1.412 ± 0.378
1.569MetThr: 1.569 ± 0.372
1.334MetVal: 1.334 ± 0.28
0.314MetTrp: 0.314 ± 0.179
0.863MetTyr: 0.863 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
4.942AsnAla: 4.942 ± 0.883
0.314AsnCys: 0.314 ± 0.136
3.138AsnAsp: 3.138 ± 0.525
3.295AsnGlu: 3.295 ± 0.562
1.961AsnPhe: 1.961 ± 0.439
4.001AsnGly: 4.001 ± 0.626
0.941AsnHis: 0.941 ± 0.288
2.667AsnIle: 2.667 ± 0.447
4.55AsnLys: 4.55 ± 0.648
5.02AsnLeu: 5.02 ± 0.557
1.177AsnMet: 1.177 ± 0.362
2.667AsnAsn: 2.667 ± 0.521
1.804AsnPro: 1.804 ± 0.394
2.902AsnGln: 2.902 ± 0.555
2.746AsnArg: 2.746 ± 0.547
3.295AsnSer: 3.295 ± 0.615
2.981AsnThr: 2.981 ± 0.63
3.059AsnVal: 3.059 ± 0.452
0.863AsnTrp: 0.863 ± 0.232
1.804AsnTyr: 1.804 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
1.883ProAla: 1.883 ± 0.436
0.0ProCys: 0.0 ± 0.0
1.804ProAsp: 1.804 ± 0.453
3.059ProGlu: 3.059 ± 0.393
0.706ProPhe: 0.706 ± 0.28
1.255ProGly: 1.255 ± 0.285
0.392ProHis: 0.392 ± 0.144
2.275ProIle: 2.275 ± 0.539
2.981ProLys: 2.981 ± 0.479
1.334ProLeu: 1.334 ± 0.348
0.471ProMet: 0.471 ± 0.194
1.647ProAsn: 1.647 ± 0.471
0.628ProPro: 0.628 ± 0.28
0.941ProGln: 0.941 ± 0.355
1.177ProArg: 1.177 ± 0.275
2.04ProSer: 2.04 ± 0.533
0.706ProThr: 0.706 ± 0.229
1.726ProVal: 1.726 ± 0.338
0.392ProTrp: 0.392 ± 0.213
1.961ProTyr: 1.961 ± 0.501
0.0ProXaa: 0.0 ± 0.0
Gln
3.608GlnAla: 3.608 ± 0.507
0.157GlnCys: 0.157 ± 0.102
1.255GlnAsp: 1.255 ± 0.283
3.216GlnGlu: 3.216 ± 0.657
1.726GlnPhe: 1.726 ± 0.375
1.883GlnGly: 1.883 ± 0.455
0.392GlnHis: 0.392 ± 0.212
3.295GlnIle: 3.295 ± 0.516
3.844GlnLys: 3.844 ± 0.528
3.373GlnLeu: 3.373 ± 0.497
1.02GlnMet: 1.02 ± 0.234
1.569GlnAsn: 1.569 ± 0.361
1.177GlnPro: 1.177 ± 0.343
1.49GlnGln: 1.49 ± 0.391
1.726GlnArg: 1.726 ± 0.414
2.353GlnSer: 2.353 ± 0.443
2.824GlnThr: 2.824 ± 0.426
3.53GlnVal: 3.53 ± 0.478
0.549GlnTrp: 0.549 ± 0.199
0.941GlnTyr: 0.941 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
2.432ArgAla: 2.432 ± 0.482
0.314ArgCys: 0.314 ± 0.177
2.04ArgAsp: 2.04 ± 0.436
2.824ArgGlu: 2.824 ± 0.468
1.883ArgPhe: 1.883 ± 0.406
1.804ArgGly: 1.804 ± 0.338
0.628ArgHis: 0.628 ± 0.278
2.981ArgIle: 2.981 ± 0.555
3.765ArgLys: 3.765 ± 0.65
4.942ArgLeu: 4.942 ± 0.748
2.432ArgMet: 2.432 ± 0.47
2.981ArgAsn: 2.981 ± 0.514
0.941ArgPro: 0.941 ± 0.215
2.04ArgGln: 2.04 ± 0.372
2.275ArgArg: 2.275 ± 0.632
2.746ArgSer: 2.746 ± 0.453
2.824ArgThr: 2.824 ± 0.527
2.589ArgVal: 2.589 ± 0.451
0.471ArgTrp: 0.471 ± 0.191
1.647ArgTyr: 1.647 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
4.236SerAla: 4.236 ± 0.947
0.235SerCys: 0.235 ± 0.136
3.687SerAsp: 3.687 ± 0.563
4.236SerGlu: 4.236 ± 0.591
1.883SerPhe: 1.883 ± 0.376
5.177SerGly: 5.177 ± 0.694
1.098SerHis: 1.098 ± 0.403
3.844SerIle: 3.844 ± 0.665
4.55SerLys: 4.55 ± 0.665
5.334SerLeu: 5.334 ± 0.713
1.255SerMet: 1.255 ± 0.362
3.765SerAsn: 3.765 ± 0.7
1.412SerPro: 1.412 ± 0.249
2.196SerGln: 2.196 ± 0.41
2.824SerArg: 2.824 ± 0.652
3.138SerSer: 3.138 ± 0.552
4.001SerThr: 4.001 ± 0.589
3.295SerVal: 3.295 ± 0.731
0.784SerTrp: 0.784 ± 0.365
3.059SerTyr: 3.059 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
4.393ThrAla: 4.393 ± 0.876
0.235ThrCys: 0.235 ± 0.158
3.844ThrAsp: 3.844 ± 0.47
4.707ThrGlu: 4.707 ± 0.634
3.216ThrPhe: 3.216 ± 0.615
4.079ThrGly: 4.079 ± 0.861
1.177ThrHis: 1.177 ± 0.331
4.393ThrIle: 4.393 ± 0.603
4.864ThrLys: 4.864 ± 0.678
3.765ThrLeu: 3.765 ± 0.635
0.863ThrMet: 0.863 ± 0.281
4.001ThrAsn: 4.001 ± 0.504
1.49ThrPro: 1.49 ± 0.52
2.667ThrGln: 2.667 ± 0.573
1.726ThrArg: 1.726 ± 0.358
4.314ThrSer: 4.314 ± 0.486
4.628ThrThr: 4.628 ± 0.931
4.471ThrVal: 4.471 ± 0.793
0.706ThrTrp: 0.706 ± 0.276
2.51ThrTyr: 2.51 ± 0.549
0.0ThrXaa: 0.0 ± 0.0
Val
4.942ValAla: 4.942 ± 0.648
0.157ValCys: 0.157 ± 0.1
4.079ValAsp: 4.079 ± 0.593
5.726ValGlu: 5.726 ± 0.724
1.883ValPhe: 1.883 ± 0.403
4.785ValGly: 4.785 ± 0.691
0.863ValHis: 0.863 ± 0.311
3.922ValIle: 3.922 ± 0.644
5.177ValLys: 5.177 ± 0.651
4.864ValLeu: 4.864 ± 0.6
1.02ValMet: 1.02 ± 0.303
4.158ValAsn: 4.158 ± 0.731
1.961ValPro: 1.961 ± 0.342
1.255ValGln: 1.255 ± 0.337
2.196ValArg: 2.196 ± 0.322
4.707ValSer: 4.707 ± 0.759
4.707ValThr: 4.707 ± 0.679
4.785ValVal: 4.785 ± 0.809
0.314ValTrp: 0.314 ± 0.125
2.118ValTyr: 2.118 ± 0.485
0.0ValXaa: 0.0 ± 0.0
Trp
1.098TrpAla: 1.098 ± 0.354
0.157TrpCys: 0.157 ± 0.106
0.863TrpAsp: 0.863 ± 0.312
1.02TrpGlu: 1.02 ± 0.357
1.02TrpPhe: 1.02 ± 0.441
0.706TrpGly: 0.706 ± 0.226
0.078TrpHis: 0.078 ± 0.071
0.549TrpIle: 0.549 ± 0.208
1.412TrpLys: 1.412 ± 0.331
0.784TrpLeu: 0.784 ± 0.338
0.392TrpMet: 0.392 ± 0.199
0.863TrpAsn: 0.863 ± 0.35
0.078TrpPro: 0.078 ± 0.07
0.863TrpGln: 0.863 ± 0.37
0.392TrpArg: 0.392 ± 0.171
0.314TrpSer: 0.314 ± 0.156
1.02TrpThr: 1.02 ± 0.26
1.412TrpVal: 1.412 ± 0.335
0.078TrpTrp: 0.078 ± 0.069
0.706TrpTyr: 0.706 ± 0.552
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.647TyrAla: 1.647 ± 0.294
0.392TyrCys: 0.392 ± 0.158
2.275TyrAsp: 2.275 ± 0.386
2.51TyrGlu: 2.51 ± 0.514
1.647TyrPhe: 1.647 ± 0.328
1.883TyrGly: 1.883 ± 0.377
0.941TyrHis: 0.941 ± 0.273
2.667TyrIle: 2.667 ± 0.524
4.236TyrLys: 4.236 ± 0.668
3.295TyrLeu: 3.295 ± 0.609
0.706TyrMet: 0.706 ± 0.319
1.726TyrAsn: 1.726 ± 0.35
1.883TyrPro: 1.883 ± 0.4
2.275TyrGln: 2.275 ± 0.374
2.275TyrArg: 2.275 ± 0.58
2.196TyrSer: 2.196 ± 0.541
2.667TyrThr: 2.667 ± 0.468
2.432TyrVal: 2.432 ± 0.536
0.471TyrTrp: 0.471 ± 0.221
1.647TyrTyr: 1.647 ± 0.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12749 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski