Amino acid dipepetide frequency for Streptococcus phage IPP24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.704AlaAla: 3.704 ± 1.512
0.278AlaCys: 0.278 ± 0.173
4.352AlaAsp: 4.352 ± 0.598
6.112AlaGlu: 6.112 ± 0.694
2.408AlaPhe: 2.408 ± 0.646
5.278AlaGly: 5.278 ± 1.788
0.926AlaHis: 0.926 ± 0.306
5.186AlaIle: 5.186 ± 0.921
5.186AlaLys: 5.186 ± 0.681
6.482AlaLeu: 6.482 ± 1.379
2.222AlaMet: 2.222 ± 0.488
3.241AlaAsn: 3.241 ± 0.539
1.574AlaPro: 1.574 ± 0.324
3.519AlaGln: 3.519 ± 0.778
2.408AlaArg: 2.408 ± 0.62
4.167AlaSer: 4.167 ± 0.769
3.241AlaThr: 3.241 ± 0.812
4.26AlaVal: 4.26 ± 1.258
0.741AlaTrp: 0.741 ± 0.254
2.408AlaTyr: 2.408 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.37CysAla: 0.37 ± 0.169
0.0CysCys: 0.0 ± 0.0
0.185CysAsp: 0.185 ± 0.145
0.463CysGlu: 0.463 ± 0.2
0.278CysPhe: 0.278 ± 0.218
0.741CysGly: 0.741 ± 0.414
0.185CysHis: 0.185 ± 0.115
0.278CysIle: 0.278 ± 0.158
0.556CysLys: 0.556 ± 0.227
0.648CysLeu: 0.648 ± 0.208
0.093CysMet: 0.093 ± 0.1
0.093CysAsn: 0.093 ± 0.088
0.37CysPro: 0.37 ± 0.204
0.278CysGln: 0.278 ± 0.189
0.37CysArg: 0.37 ± 0.242
0.833CysSer: 0.833 ± 0.431
0.185CysThr: 0.185 ± 0.131
0.278CysVal: 0.278 ± 0.163
0.0CysTrp: 0.0 ± 0.0
0.278CysTyr: 0.278 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
3.334AspAla: 3.334 ± 0.599
0.37AspCys: 0.37 ± 0.251
3.519AspAsp: 3.519 ± 0.582
5.556AspGlu: 5.556 ± 0.76
3.797AspPhe: 3.797 ± 0.662
5.741AspGly: 5.741 ± 1.084
0.741AspHis: 0.741 ± 0.251
3.982AspIle: 3.982 ± 0.636
4.537AspLys: 4.537 ± 0.66
4.723AspLeu: 4.723 ± 0.572
2.13AspMet: 2.13 ± 0.389
3.519AspAsn: 3.519 ± 0.548
1.759AspPro: 1.759 ± 0.377
0.741AspGln: 0.741 ± 0.272
3.148AspArg: 3.148 ± 0.581
3.056AspSer: 3.056 ± 0.481
3.519AspThr: 3.519 ± 0.432
3.797AspVal: 3.797 ± 0.583
0.463AspTrp: 0.463 ± 0.204
2.963AspTyr: 2.963 ± 0.497
0.0AspXaa: 0.0 ± 0.0
Glu
5.278GluAla: 5.278 ± 0.71
0.556GluCys: 0.556 ± 0.274
3.611GluAsp: 3.611 ± 0.528
7.038GluGlu: 7.038 ± 1.11
3.334GluPhe: 3.334 ± 0.76
3.797GluGly: 3.797 ± 0.549
1.759GluHis: 1.759 ± 0.417
6.667GluIle: 6.667 ± 1.217
7.408GluLys: 7.408 ± 1.244
7.501GluLeu: 7.501 ± 0.823
2.5GluMet: 2.5 ± 0.495
4.815GluAsn: 4.815 ± 0.748
1.852GluPro: 1.852 ± 0.496
3.889GluGln: 3.889 ± 0.703
4.074GluArg: 4.074 ± 0.631
3.704GluSer: 3.704 ± 0.811
3.148GluThr: 3.148 ± 0.559
5.926GluVal: 5.926 ± 0.896
0.741GluTrp: 0.741 ± 0.241
2.778GluTyr: 2.778 ± 0.567
0.0GluXaa: 0.0 ± 0.0
Phe
2.315PheAla: 2.315 ± 0.61
0.278PheCys: 0.278 ± 0.175
5.093PheAsp: 5.093 ± 0.701
4.815PheGlu: 4.815 ± 0.855
1.852PhePhe: 1.852 ± 0.468
3.056PheGly: 3.056 ± 0.61
0.278PheHis: 0.278 ± 0.161
3.704PheIle: 3.704 ± 0.713
2.778PheLys: 2.778 ± 0.494
2.685PheLeu: 2.685 ± 0.489
0.741PheMet: 0.741 ± 0.234
1.852PheAsn: 1.852 ± 0.518
0.37PhePro: 0.37 ± 0.149
1.574PheGln: 1.574 ± 0.36
1.852PheArg: 1.852 ± 0.432
3.148PheSer: 3.148 ± 0.596
1.296PheThr: 1.296 ± 0.443
2.871PheVal: 2.871 ± 0.454
0.185PheTrp: 0.185 ± 0.124
1.667PheTyr: 1.667 ± 0.361
0.0PheXaa: 0.0 ± 0.0
Gly
4.908GlyAla: 4.908 ± 2.006
0.556GlyCys: 0.556 ± 0.254
4.167GlyAsp: 4.167 ± 0.639
3.334GlyGlu: 3.334 ± 0.56
3.797GlyPhe: 3.797 ± 0.643
3.519GlyGly: 3.519 ± 0.601
1.296GlyHis: 1.296 ± 0.381
5.278GlyIle: 5.278 ± 0.832
5.093GlyLys: 5.093 ± 0.591
5.926GlyLeu: 5.926 ± 1.069
1.574GlyMet: 1.574 ± 0.464
3.889GlyAsn: 3.889 ± 0.691
1.019GlyPro: 1.019 ± 0.346
3.982GlyGln: 3.982 ± 0.645
3.148GlyArg: 3.148 ± 0.515
3.611GlySer: 3.611 ± 0.565
3.982GlyThr: 3.982 ± 0.733
4.167GlyVal: 4.167 ± 1.075
1.667GlyTrp: 1.667 ± 0.625
2.5GlyTyr: 2.5 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
0.926HisAla: 0.926 ± 0.292
0.093HisCys: 0.093 ± 0.097
1.019HisAsp: 1.019 ± 0.306
1.482HisGlu: 1.482 ± 0.398
0.648HisPhe: 0.648 ± 0.244
0.833HisGly: 0.833 ± 0.297
0.185HisHis: 0.185 ± 0.131
1.111HisIle: 1.111 ± 0.356
0.741HisLys: 0.741 ± 0.253
1.019HisLeu: 1.019 ± 0.391
0.185HisMet: 0.185 ± 0.175
0.648HisAsn: 0.648 ± 0.374
0.833HisPro: 0.833 ± 0.26
0.741HisGln: 0.741 ± 0.254
0.648HisArg: 0.648 ± 0.225
1.204HisSer: 1.204 ± 0.508
0.648HisThr: 0.648 ± 0.299
0.926HisVal: 0.926 ± 0.281
0.37HisTrp: 0.37 ± 0.182
0.556HisTyr: 0.556 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
5.834IleAla: 5.834 ± 0.599
0.556IleCys: 0.556 ± 0.265
5.371IleAsp: 5.371 ± 0.837
6.945IleGlu: 6.945 ± 0.89
2.13IlePhe: 2.13 ± 0.539
4.352IleGly: 4.352 ± 0.738
0.37IleHis: 0.37 ± 0.175
4.723IleIle: 4.723 ± 0.604
6.297IleLys: 6.297 ± 0.818
4.908IleLeu: 4.908 ± 0.684
1.389IleMet: 1.389 ± 0.302
4.445IleAsn: 4.445 ± 0.779
2.222IlePro: 2.222 ± 0.437
3.704IleGln: 3.704 ± 0.743
3.611IleArg: 3.611 ± 0.607
5.093IleSer: 5.093 ± 1.028
3.982IleThr: 3.982 ± 0.694
4.167IleVal: 4.167 ± 0.567
0.648IleTrp: 0.648 ± 0.328
2.963IleTyr: 2.963 ± 0.512
0.0IleXaa: 0.0 ± 0.0
Lys
5.926LysAla: 5.926 ± 0.72
0.37LysCys: 0.37 ± 0.185
5.463LysAsp: 5.463 ± 0.718
6.852LysGlu: 6.852 ± 1.073
2.315LysPhe: 2.315 ± 0.528
4.63LysGly: 4.63 ± 0.587
1.482LysHis: 1.482 ± 0.392
6.945LysIle: 6.945 ± 0.904
7.408LysLys: 7.408 ± 1.094
5.741LysLeu: 5.741 ± 0.782
2.963LysMet: 2.963 ± 0.411
4.352LysAsn: 4.352 ± 0.629
2.963LysPro: 2.963 ± 0.644
3.334LysGln: 3.334 ± 0.614
4.074LysArg: 4.074 ± 0.668
4.908LysSer: 4.908 ± 0.712
4.815LysThr: 4.815 ± 0.698
5.0LysVal: 5.0 ± 0.817
0.833LysTrp: 0.833 ± 0.208
2.593LysTyr: 2.593 ± 0.502
0.0LysXaa: 0.0 ± 0.0
Leu
6.297LeuAla: 6.297 ± 0.8
0.37LeuCys: 0.37 ± 0.205
5.186LeuAsp: 5.186 ± 0.574
6.019LeuGlu: 6.019 ± 0.856
2.593LeuPhe: 2.593 ± 0.438
6.575LeuGly: 6.575 ± 1.317
1.111LeuHis: 1.111 ± 0.311
5.556LeuIle: 5.556 ± 0.549
8.149LeuLys: 8.149 ± 0.862
6.389LeuLeu: 6.389 ± 0.863
1.667LeuMet: 1.667 ± 0.423
3.519LeuAsn: 3.519 ± 0.622
2.778LeuPro: 2.778 ± 0.562
3.056LeuGln: 3.056 ± 0.459
3.334LeuArg: 3.334 ± 0.544
6.945LeuSer: 6.945 ± 0.798
4.26LeuThr: 4.26 ± 0.59
4.445LeuVal: 4.445 ± 0.941
0.278LeuTrp: 0.278 ± 0.163
2.222LeuTyr: 2.222 ± 0.513
0.0LeuXaa: 0.0 ± 0.0
Met
2.593MetAla: 2.593 ± 0.815
0.093MetCys: 0.093 ± 0.082
2.13MetAsp: 2.13 ± 0.478
1.945MetGlu: 1.945 ± 0.446
0.648MetPhe: 0.648 ± 0.212
1.111MetGly: 1.111 ± 0.331
0.278MetHis: 0.278 ± 0.153
1.204MetIle: 1.204 ± 0.373
2.222MetLys: 2.222 ± 0.499
2.037MetLeu: 2.037 ± 0.349
0.463MetMet: 0.463 ± 0.27
1.389MetAsn: 1.389 ± 0.416
0.741MetPro: 0.741 ± 0.234
1.296MetGln: 1.296 ± 0.344
0.926MetArg: 0.926 ± 0.291
1.945MetSer: 1.945 ± 0.403
1.852MetThr: 1.852 ± 0.397
2.408MetVal: 2.408 ± 0.689
0.463MetTrp: 0.463 ± 0.181
0.278MetTyr: 0.278 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
3.334AsnAla: 3.334 ± 0.466
0.37AsnCys: 0.37 ± 0.167
2.5AsnAsp: 2.5 ± 0.476
3.611AsnGlu: 3.611 ± 0.601
2.5AsnPhe: 2.5 ± 0.431
4.167AsnGly: 4.167 ± 0.757
1.019AsnHis: 1.019 ± 0.332
3.982AsnIle: 3.982 ± 0.649
3.889AsnLys: 3.889 ± 0.633
3.982AsnLeu: 3.982 ± 0.564
1.574AsnMet: 1.574 ± 0.311
2.5AsnAsn: 2.5 ± 0.537
1.667AsnPro: 1.667 ± 0.426
2.222AsnGln: 2.222 ± 0.528
2.315AsnArg: 2.315 ± 0.573
4.537AsnSer: 4.537 ± 0.674
2.778AsnThr: 2.778 ± 0.473
2.871AsnVal: 2.871 ± 0.557
1.111AsnTrp: 1.111 ± 0.328
1.296AsnTyr: 1.296 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
1.482ProAla: 1.482 ± 0.435
0.278ProCys: 0.278 ± 0.149
1.852ProAsp: 1.852 ± 0.43
2.778ProGlu: 2.778 ± 0.641
1.759ProPhe: 1.759 ± 0.451
1.111ProGly: 1.111 ± 0.295
0.0ProHis: 0.0 ± 0.0
1.945ProIle: 1.945 ± 0.458
2.778ProLys: 2.778 ± 0.548
1.482ProLeu: 1.482 ± 0.432
0.556ProMet: 0.556 ± 0.215
1.482ProAsn: 1.482 ± 0.396
0.556ProPro: 0.556 ± 0.221
2.13ProGln: 2.13 ± 0.497
1.111ProArg: 1.111 ± 0.389
0.926ProSer: 0.926 ± 0.307
1.574ProThr: 1.574 ± 0.37
1.759ProVal: 1.759 ± 0.333
0.093ProTrp: 0.093 ± 0.087
1.574ProTyr: 1.574 ± 0.391
0.0ProXaa: 0.0 ± 0.0
Gln
3.982GlnAla: 3.982 ± 0.557
0.093GlnCys: 0.093 ± 0.103
1.482GlnAsp: 1.482 ± 0.426
4.074GlnGlu: 4.074 ± 0.683
1.296GlnPhe: 1.296 ± 0.275
2.871GlnGly: 2.871 ± 0.63
0.556GlnHis: 0.556 ± 0.253
3.334GlnIle: 3.334 ± 0.555
3.148GlnLys: 3.148 ± 0.635
4.074GlnLeu: 4.074 ± 0.629
1.019GlnMet: 1.019 ± 0.266
3.056GlnAsn: 3.056 ± 0.38
0.833GlnPro: 0.833 ± 0.32
2.315GlnGln: 2.315 ± 0.47
1.759GlnArg: 1.759 ± 0.483
3.704GlnSer: 3.704 ± 0.612
1.759GlnThr: 1.759 ± 0.345
2.593GlnVal: 2.593 ± 0.512
0.278GlnTrp: 0.278 ± 0.158
1.204GlnTyr: 1.204 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
2.037ArgAla: 2.037 ± 0.505
0.278ArgCys: 0.278 ± 0.234
1.759ArgAsp: 1.759 ± 0.437
4.074ArgGlu: 4.074 ± 0.688
2.222ArgPhe: 2.222 ± 0.65
3.519ArgGly: 3.519 ± 0.501
0.741ArgHis: 0.741 ± 0.28
2.871ArgIle: 2.871 ± 0.636
4.63ArgLys: 4.63 ± 0.723
4.167ArgLeu: 4.167 ± 0.578
1.574ArgMet: 1.574 ± 0.459
2.871ArgAsn: 2.871 ± 0.589
1.389ArgPro: 1.389 ± 0.406
1.574ArgGln: 1.574 ± 0.384
2.13ArgArg: 2.13 ± 0.438
2.13ArgSer: 2.13 ± 0.451
2.13ArgThr: 2.13 ± 0.492
2.315ArgVal: 2.315 ± 0.565
0.37ArgTrp: 0.37 ± 0.171
2.5ArgTyr: 2.5 ± 0.715
0.0ArgXaa: 0.0 ± 0.0
Ser
4.63SerAla: 4.63 ± 0.878
0.741SerCys: 0.741 ± 0.315
4.908SerAsp: 4.908 ± 0.538
3.889SerGlu: 3.889 ± 0.498
3.334SerPhe: 3.334 ± 0.588
4.537SerGly: 4.537 ± 0.665
1.204SerHis: 1.204 ± 0.415
4.445SerIle: 4.445 ± 0.642
4.908SerLys: 4.908 ± 0.739
5.186SerLeu: 5.186 ± 0.958
1.667SerMet: 1.667 ± 0.409
4.074SerAsn: 4.074 ± 0.8
2.13SerPro: 2.13 ± 0.407
2.778SerGln: 2.778 ± 0.615
2.778SerArg: 2.778 ± 0.47
3.426SerSer: 3.426 ± 0.574
3.241SerThr: 3.241 ± 0.524
4.63SerVal: 4.63 ± 0.752
0.926SerTrp: 0.926 ± 0.339
1.759SerTyr: 1.759 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
3.889ThrAla: 3.889 ± 0.688
0.185ThrCys: 0.185 ± 0.138
2.685ThrAsp: 2.685 ± 0.478
3.241ThrGlu: 3.241 ± 0.609
2.778ThrPhe: 2.778 ± 0.403
4.723ThrGly: 4.723 ± 0.95
1.019ThrHis: 1.019 ± 0.294
4.537ThrIle: 4.537 ± 0.724
4.167ThrLys: 4.167 ± 0.582
3.982ThrLeu: 3.982 ± 0.655
1.296ThrMet: 1.296 ± 0.383
1.759ThrAsn: 1.759 ± 0.45
1.667ThrPro: 1.667 ± 0.409
2.13ThrGln: 2.13 ± 0.452
2.037ThrArg: 2.037 ± 0.58
2.685ThrSer: 2.685 ± 0.717
3.334ThrThr: 3.334 ± 0.597
3.611ThrVal: 3.611 ± 0.674
0.463ThrTrp: 0.463 ± 0.233
1.852ThrTyr: 1.852 ± 0.419
0.0ThrXaa: 0.0 ± 0.0
Val
4.352ValAla: 4.352 ± 1.042
0.37ValCys: 0.37 ± 0.181
3.148ValAsp: 3.148 ± 0.519
5.278ValGlu: 5.278 ± 0.842
2.685ValPhe: 2.685 ± 0.532
4.63ValGly: 4.63 ± 1.183
0.926ValHis: 0.926 ± 0.338
4.63ValIle: 4.63 ± 0.613
5.186ValLys: 5.186 ± 0.739
4.167ValLeu: 4.167 ± 0.702
1.296ValMet: 1.296 ± 0.284
2.871ValAsn: 2.871 ± 0.502
1.389ValPro: 1.389 ± 0.415
2.037ValGln: 2.037 ± 0.405
2.593ValArg: 2.593 ± 0.493
5.463ValSer: 5.463 ± 0.731
3.982ValThr: 3.982 ± 0.622
3.611ValVal: 3.611 ± 0.728
0.926ValTrp: 0.926 ± 0.321
3.148ValTyr: 3.148 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.741TrpAla: 0.741 ± 0.265
0.185TrpCys: 0.185 ± 0.129
0.926TrpAsp: 0.926 ± 0.276
0.648TrpGlu: 0.648 ± 0.205
0.278TrpPhe: 0.278 ± 0.155
0.463TrpGly: 0.463 ± 0.207
0.093TrpHis: 0.093 ± 0.102
0.833TrpIle: 0.833 ± 0.281
0.926TrpLys: 0.926 ± 0.306
1.204TrpLeu: 1.204 ± 0.303
0.093TrpMet: 0.093 ± 0.105
0.648TrpAsn: 0.648 ± 0.272
0.093TrpPro: 0.093 ± 0.084
0.648TrpGln: 0.648 ± 0.237
0.556TrpArg: 0.556 ± 0.302
0.37TrpSer: 0.37 ± 0.144
0.833TrpThr: 0.833 ± 0.343
0.648TrpVal: 0.648 ± 0.225
0.093TrpTrp: 0.093 ± 0.102
0.926TrpTyr: 0.926 ± 0.63
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.759TyrAla: 1.759 ± 0.421
0.463TyrCys: 0.463 ± 0.212
2.037TyrAsp: 2.037 ± 0.536
2.315TyrGlu: 2.315 ± 0.493
1.759TyrPhe: 1.759 ± 0.384
1.759TyrGly: 1.759 ± 0.394
0.648TyrHis: 0.648 ± 0.245
2.5TyrIle: 2.5 ± 0.483
2.963TyrLys: 2.963 ± 0.581
4.445TyrLeu: 4.445 ± 0.808
0.926TyrMet: 0.926 ± 0.278
1.204TyrAsn: 1.204 ± 0.442
1.019TyrPro: 1.019 ± 0.329
1.482TyrGln: 1.482 ± 0.377
2.408TyrArg: 2.408 ± 0.61
3.334TyrSer: 3.334 ± 0.52
1.482TyrThr: 1.482 ± 0.345
2.408TyrVal: 2.408 ± 0.52
0.556TyrTrp: 0.556 ± 0.252
1.204TyrTyr: 1.204 ± 0.504
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (10800 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski