Amino acid dipepetide frequency for Streptococcus phage phi891591

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.635AlaAla: 3.635 ± 0.695
0.373AlaCys: 0.373 ± 0.164
5.126AlaAsp: 5.126 ± 0.824
6.431AlaGlu: 6.431 ± 0.923
1.678AlaPhe: 1.678 ± 0.369
6.151AlaGly: 6.151 ± 1.323
0.932AlaHis: 0.932 ± 0.411
6.431AlaIle: 6.431 ± 0.89
6.803AlaLys: 6.803 ± 0.91
6.058AlaLeu: 6.058 ± 0.714
1.864AlaMet: 1.864 ± 0.582
4.38AlaAsn: 4.38 ± 0.608
1.305AlaPro: 1.305 ± 0.335
2.982AlaGln: 2.982 ± 0.531
2.703AlaArg: 2.703 ± 0.54
4.101AlaSer: 4.101 ± 0.619
5.871AlaThr: 5.871 ± 0.797
4.473AlaVal: 4.473 ± 0.729
0.932AlaTrp: 0.932 ± 0.329
2.423AlaTyr: 2.423 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.28CysAla: 0.28 ± 0.192
0.373CysCys: 0.373 ± 0.213
0.466CysAsp: 0.466 ± 0.182
0.28CysGlu: 0.28 ± 0.172
0.28CysPhe: 0.28 ± 0.169
0.746CysGly: 0.746 ± 0.282
0.28CysHis: 0.28 ± 0.132
0.186CysIle: 0.186 ± 0.186
0.373CysLys: 0.373 ± 0.207
0.373CysLeu: 0.373 ± 0.199
0.186CysMet: 0.186 ± 0.137
0.373CysAsn: 0.373 ± 0.215
0.28CysPro: 0.28 ± 0.131
0.373CysGln: 0.373 ± 0.192
0.186CysArg: 0.186 ± 0.129
0.746CysSer: 0.746 ± 0.265
0.093CysThr: 0.093 ± 0.08
0.186CysVal: 0.186 ± 0.142
0.0CysTrp: 0.0 ± 0.0
0.466CysTyr: 0.466 ± 0.224
0.0CysXaa: 0.0 ± 0.0
Asp
3.448AspAla: 3.448 ± 0.636
0.373AspCys: 0.373 ± 0.252
3.075AspAsp: 3.075 ± 0.616
3.821AspGlu: 3.821 ± 0.798
3.262AspPhe: 3.262 ± 0.486
5.592AspGly: 5.592 ± 0.694
0.559AspHis: 0.559 ± 0.218
4.287AspIle: 4.287 ± 0.683
5.033AspLys: 5.033 ± 0.701
5.592AspLeu: 5.592 ± 0.808
2.144AspMet: 2.144 ± 0.484
3.262AspAsn: 3.262 ± 0.475
1.491AspPro: 1.491 ± 0.556
1.491AspGln: 1.491 ± 0.4
2.796AspArg: 2.796 ± 0.447
3.821AspSer: 3.821 ± 0.513
3.635AspThr: 3.635 ± 0.668
5.871AspVal: 5.871 ± 0.681
0.839AspTrp: 0.839 ± 0.285
1.678AspTyr: 1.678 ± 0.425
0.0AspXaa: 0.0 ± 0.0
Glu
6.337GluAla: 6.337 ± 0.684
0.373GluCys: 0.373 ± 0.16
2.889GluAsp: 2.889 ± 0.652
4.007GluGlu: 4.007 ± 0.96
2.796GluPhe: 2.796 ± 0.476
2.889GluGly: 2.889 ± 0.451
1.212GluHis: 1.212 ± 0.485
6.151GluIle: 6.151 ± 0.765
5.219GluLys: 5.219 ± 1.149
6.058GluLeu: 6.058 ± 1.095
2.05GluMet: 2.05 ± 0.624
4.753GluAsn: 4.753 ± 0.642
2.144GluPro: 2.144 ± 0.57
3.075GluGln: 3.075 ± 0.599
3.355GluArg: 3.355 ± 0.506
3.821GluSer: 3.821 ± 0.501
3.262GluThr: 3.262 ± 0.532
5.405GluVal: 5.405 ± 0.66
0.932GluTrp: 0.932 ± 0.285
3.169GluTyr: 3.169 ± 0.806
0.0GluXaa: 0.0 ± 0.0
Phe
2.144PheAla: 2.144 ± 0.315
0.28PheCys: 0.28 ± 0.191
3.541PheAsp: 3.541 ± 0.629
4.38PheGlu: 4.38 ± 0.623
1.118PhePhe: 1.118 ± 0.258
2.889PheGly: 2.889 ± 0.672
0.746PheHis: 0.746 ± 0.233
2.33PheIle: 2.33 ± 0.392
2.516PheLys: 2.516 ± 0.378
2.05PheLeu: 2.05 ± 0.545
0.932PheMet: 0.932 ± 0.293
2.05PheAsn: 2.05 ± 0.625
0.932PhePro: 0.932 ± 0.344
1.212PheGln: 1.212 ± 0.356
0.932PheArg: 0.932 ± 0.285
1.771PheSer: 1.771 ± 0.454
2.144PheThr: 2.144 ± 0.445
2.423PheVal: 2.423 ± 0.424
0.28PheTrp: 0.28 ± 0.126
1.398PheTyr: 1.398 ± 0.353
0.0PheXaa: 0.0 ± 0.0
Gly
4.939GlyAla: 4.939 ± 1.017
0.186GlyCys: 0.186 ± 0.174
4.007GlyAsp: 4.007 ± 0.565
3.355GlyGlu: 3.355 ± 0.514
2.423GlyPhe: 2.423 ± 0.633
5.405GlyGly: 5.405 ± 0.777
1.118GlyHis: 1.118 ± 0.272
5.033GlyIle: 5.033 ± 0.825
4.846GlyLys: 4.846 ± 0.644
5.219GlyLeu: 5.219 ± 0.581
1.957GlyMet: 1.957 ± 0.471
3.355GlyAsn: 3.355 ± 0.716
1.118GlyPro: 1.118 ± 0.337
3.914GlyGln: 3.914 ± 0.461
3.448GlyArg: 3.448 ± 0.572
4.473GlySer: 4.473 ± 0.851
4.007GlyThr: 4.007 ± 0.601
4.101GlyVal: 4.101 ± 0.942
1.212GlyTrp: 1.212 ± 0.327
2.982GlyTyr: 2.982 ± 0.631
0.0GlyXaa: 0.0 ± 0.0
His
0.746HisAla: 0.746 ± 0.262
0.093HisCys: 0.093 ± 0.083
0.746HisAsp: 0.746 ± 0.246
1.398HisGlu: 1.398 ± 0.353
0.839HisPhe: 0.839 ± 0.521
0.839HisGly: 0.839 ± 0.278
0.093HisHis: 0.093 ± 0.086
0.839HisIle: 0.839 ± 0.278
0.559HisLys: 0.559 ± 0.263
1.584HisLeu: 1.584 ± 0.355
0.28HisMet: 0.28 ± 0.175
0.466HisAsn: 0.466 ± 0.198
0.932HisPro: 0.932 ± 0.266
0.932HisGln: 0.932 ± 0.24
0.839HisArg: 0.839 ± 0.311
1.212HisSer: 1.212 ± 0.37
0.559HisThr: 0.559 ± 0.227
0.932HisVal: 0.932 ± 0.326
0.093HisTrp: 0.093 ± 0.086
0.466HisTyr: 0.466 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
6.151IleAla: 6.151 ± 0.606
0.559IleCys: 0.559 ± 0.276
5.405IleAsp: 5.405 ± 0.575
6.337IleGlu: 6.337 ± 1.068
1.864IlePhe: 1.864 ± 0.462
4.287IleGly: 4.287 ± 0.683
0.746IleHis: 0.746 ± 0.297
4.939IleIle: 4.939 ± 0.809
6.337IleLys: 6.337 ± 0.667
3.914IleLeu: 3.914 ± 0.568
1.678IleMet: 1.678 ± 0.564
3.541IleAsn: 3.541 ± 0.721
2.237IlePro: 2.237 ± 0.474
2.33IleGln: 2.33 ± 0.583
2.796IleArg: 2.796 ± 0.443
4.567IleSer: 4.567 ± 0.708
4.846IleThr: 4.846 ± 0.643
3.728IleVal: 3.728 ± 0.643
0.746IleTrp: 0.746 ± 0.236
2.237IleTyr: 2.237 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
5.499LysAla: 5.499 ± 0.803
0.373LysCys: 0.373 ± 0.227
4.38LysAsp: 4.38 ± 0.582
4.101LysGlu: 4.101 ± 0.811
1.957LysPhe: 1.957 ± 0.425
5.126LysGly: 5.126 ± 0.576
0.746LysHis: 0.746 ± 0.24
5.405LysIle: 5.405 ± 0.76
6.337LysLys: 6.337 ± 1.017
6.151LysLeu: 6.151 ± 0.761
1.584LysMet: 1.584 ± 0.354
4.007LysAsn: 4.007 ± 0.657
2.889LysPro: 2.889 ± 0.653
4.66LysGln: 4.66 ± 0.86
4.846LysArg: 4.846 ± 0.737
4.846LysSer: 4.846 ± 0.671
5.126LysThr: 5.126 ± 1.007
5.499LysVal: 5.499 ± 0.639
0.932LysTrp: 0.932 ± 0.248
2.516LysTyr: 2.516 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
7.922LeuAla: 7.922 ± 0.893
0.466LeuCys: 0.466 ± 0.182
4.753LeuAsp: 4.753 ± 0.658
5.965LeuGlu: 5.965 ± 1.096
2.982LeuPhe: 2.982 ± 0.613
5.033LeuGly: 5.033 ± 1.109
1.398LeuHis: 1.398 ± 0.385
4.101LeuIle: 4.101 ± 0.563
6.524LeuLys: 6.524 ± 0.79
6.337LeuLeu: 6.337 ± 0.914
1.584LeuMet: 1.584 ± 0.493
4.567LeuAsn: 4.567 ± 0.865
2.423LeuPro: 2.423 ± 0.663
2.796LeuGln: 2.796 ± 0.528
4.007LeuArg: 4.007 ± 0.702
6.524LeuSer: 6.524 ± 0.971
4.939LeuThr: 4.939 ± 0.82
4.473LeuVal: 4.473 ± 0.538
0.466LeuTrp: 0.466 ± 0.22
1.491LeuTyr: 1.491 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
2.423MetAla: 2.423 ± 0.627
0.373MetCys: 0.373 ± 0.192
1.398MetAsp: 1.398 ± 0.334
1.398MetGlu: 1.398 ± 0.327
0.652MetPhe: 0.652 ± 0.262
1.491MetGly: 1.491 ± 0.644
0.373MetHis: 0.373 ± 0.151
1.398MetIle: 1.398 ± 0.504
1.305MetLys: 1.305 ± 0.365
1.771MetLeu: 1.771 ± 0.468
0.28MetMet: 0.28 ± 0.175
1.212MetAsn: 1.212 ± 0.317
1.118MetPro: 1.118 ± 0.32
1.212MetGln: 1.212 ± 0.328
0.652MetArg: 0.652 ± 0.294
1.771MetSer: 1.771 ± 0.455
2.423MetThr: 2.423 ± 0.453
1.398MetVal: 1.398 ± 0.373
0.093MetTrp: 0.093 ± 0.08
1.025MetTyr: 1.025 ± 0.453
0.0MetXaa: 0.0 ± 0.0
Asn
2.982AsnAla: 2.982 ± 0.582
0.652AsnCys: 0.652 ± 0.27
2.61AsnAsp: 2.61 ± 0.477
3.262AsnGlu: 3.262 ± 0.527
2.703AsnPhe: 2.703 ± 0.555
4.473AsnGly: 4.473 ± 0.866
1.025AsnHis: 1.025 ± 0.316
3.169AsnIle: 3.169 ± 0.642
3.355AsnLys: 3.355 ± 0.532
4.846AsnLeu: 4.846 ± 0.858
0.839AsnMet: 0.839 ± 0.225
2.423AsnAsn: 2.423 ± 0.484
2.516AsnPro: 2.516 ± 0.614
3.169AsnGln: 3.169 ± 0.528
2.516AsnArg: 2.516 ± 0.492
3.169AsnSer: 3.169 ± 0.623
3.728AsnThr: 3.728 ± 0.632
3.355AsnVal: 3.355 ± 0.487
0.746AsnTrp: 0.746 ± 0.25
2.796AsnTyr: 2.796 ± 0.5
0.0AsnXaa: 0.0 ± 0.0
Pro
1.678ProAla: 1.678 ± 0.403
0.186ProCys: 0.186 ± 0.107
2.516ProAsp: 2.516 ± 0.635
1.957ProGlu: 1.957 ± 0.388
1.957ProPhe: 1.957 ± 0.474
1.305ProGly: 1.305 ± 0.33
0.373ProHis: 0.373 ± 0.199
2.703ProIle: 2.703 ± 0.737
2.05ProLys: 2.05 ± 0.483
2.61ProLeu: 2.61 ± 0.49
1.025ProMet: 1.025 ± 0.228
1.584ProAsn: 1.584 ± 0.437
0.559ProPro: 0.559 ± 0.215
1.212ProGln: 1.212 ± 0.371
1.305ProArg: 1.305 ± 0.414
2.144ProSer: 2.144 ± 0.514
2.05ProThr: 2.05 ± 0.416
1.678ProVal: 1.678 ± 0.323
0.466ProTrp: 0.466 ± 0.179
0.652ProTyr: 0.652 ± 0.236
0.0ProXaa: 0.0 ± 0.0
Gln
4.567GlnAla: 4.567 ± 0.618
0.373GlnCys: 0.373 ± 0.213
1.864GlnAsp: 1.864 ± 0.324
3.262GlnGlu: 3.262 ± 0.586
0.839GlnPhe: 0.839 ± 0.251
1.305GlnGly: 1.305 ± 0.386
0.28GlnHis: 0.28 ± 0.142
2.423GlnIle: 2.423 ± 0.507
3.541GlnLys: 3.541 ± 0.797
3.541GlnLeu: 3.541 ± 0.609
0.746GlnMet: 0.746 ± 0.27
3.169GlnAsn: 3.169 ± 0.513
1.398GlnPro: 1.398 ± 0.348
2.05GlnGln: 2.05 ± 0.678
1.957GlnArg: 1.957 ± 0.484
4.194GlnSer: 4.194 ± 0.791
2.889GlnThr: 2.889 ± 0.751
4.753GlnVal: 4.753 ± 0.671
0.466GlnTrp: 0.466 ± 0.208
1.305GlnTyr: 1.305 ± 0.389
0.0GlnXaa: 0.0 ± 0.0
Arg
2.61ArgAla: 2.61 ± 0.689
0.373ArgCys: 0.373 ± 0.183
2.889ArgAsp: 2.889 ± 0.534
3.262ArgGlu: 3.262 ± 0.758
1.584ArgPhe: 1.584 ± 0.397
1.864ArgGly: 1.864 ± 0.356
0.839ArgHis: 0.839 ± 0.293
3.448ArgIle: 3.448 ± 0.527
3.448ArgLys: 3.448 ± 0.673
4.473ArgLeu: 4.473 ± 0.574
0.746ArgMet: 0.746 ± 0.274
2.982ArgAsn: 2.982 ± 0.544
1.491ArgPro: 1.491 ± 0.456
1.957ArgGln: 1.957 ± 0.433
2.144ArgArg: 2.144 ± 0.444
1.398ArgSer: 1.398 ± 0.458
3.355ArgThr: 3.355 ± 0.526
2.61ArgVal: 2.61 ± 0.532
0.466ArgTrp: 0.466 ± 0.183
2.33ArgTyr: 2.33 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
5.312SerAla: 5.312 ± 1.152
0.373SerCys: 0.373 ± 0.17
4.38SerAsp: 4.38 ± 0.652
4.753SerGlu: 4.753 ± 0.904
2.516SerPhe: 2.516 ± 0.61
5.126SerGly: 5.126 ± 0.822
1.025SerHis: 1.025 ± 0.363
3.914SerIle: 3.914 ± 0.594
4.939SerLys: 4.939 ± 0.794
5.685SerLeu: 5.685 ± 0.871
1.957SerMet: 1.957 ± 0.381
2.982SerAsn: 2.982 ± 0.524
1.491SerPro: 1.491 ± 0.316
3.075SerGln: 3.075 ± 0.602
2.33SerArg: 2.33 ± 0.548
3.821SerSer: 3.821 ± 0.753
3.914SerThr: 3.914 ± 0.801
4.567SerVal: 4.567 ± 0.665
0.466SerTrp: 0.466 ± 0.183
2.982SerTyr: 2.982 ± 0.548
0.0SerXaa: 0.0 ± 0.0
Thr
5.499ThrAla: 5.499 ± 1.471
0.093ThrCys: 0.093 ± 0.101
4.101ThrAsp: 4.101 ± 0.564
4.567ThrGlu: 4.567 ± 0.795
2.05ThrPhe: 2.05 ± 0.39
4.753ThrGly: 4.753 ± 0.682
0.652ThrHis: 0.652 ± 0.253
4.567ThrIle: 4.567 ± 0.687
4.939ThrLys: 4.939 ± 0.893
5.312ThrLeu: 5.312 ± 0.595
0.839ThrMet: 0.839 ± 0.241
2.516ThrAsn: 2.516 ± 0.555
1.398ThrPro: 1.398 ± 0.32
3.355ThrGln: 3.355 ± 1.033
2.05ThrArg: 2.05 ± 0.432
4.101ThrSer: 4.101 ± 0.464
3.914ThrThr: 3.914 ± 1.166
6.524ThrVal: 6.524 ± 0.956
0.466ThrTrp: 0.466 ± 0.251
2.796ThrTyr: 2.796 ± 0.496
0.0ThrXaa: 0.0 ± 0.0
Val
5.126ValAla: 5.126 ± 0.664
0.373ValCys: 0.373 ± 0.185
4.753ValAsp: 4.753 ± 0.659
4.567ValGlu: 4.567 ± 0.711
2.61ValPhe: 2.61 ± 0.447
4.287ValGly: 4.287 ± 0.631
1.305ValHis: 1.305 ± 0.306
4.101ValIle: 4.101 ± 0.611
5.685ValLys: 5.685 ± 0.688
3.914ValLeu: 3.914 ± 0.765
1.584ValMet: 1.584 ± 0.412
4.101ValAsn: 4.101 ± 0.61
2.61ValPro: 2.61 ± 0.408
2.703ValGln: 2.703 ± 0.733
2.982ValArg: 2.982 ± 0.432
6.803ValSer: 6.803 ± 0.755
4.753ValThr: 4.753 ± 0.696
4.101ValVal: 4.101 ± 0.67
0.559ValTrp: 0.559 ± 0.183
2.33ValTyr: 2.33 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
0.746TrpAla: 0.746 ± 0.314
0.093TrpCys: 0.093 ± 0.08
0.28TrpAsp: 0.28 ± 0.154
0.839TrpGlu: 0.839 ± 0.253
0.186TrpPhe: 0.186 ± 0.141
0.839TrpGly: 0.839 ± 0.346
0.0TrpHis: 0.0 ± 0.0
0.932TrpIle: 0.932 ± 0.315
0.839TrpLys: 0.839 ± 0.224
0.746TrpLeu: 0.746 ± 0.277
0.373TrpMet: 0.373 ± 0.189
1.025TrpAsn: 1.025 ± 0.323
0.093TrpPro: 0.093 ± 0.086
0.559TrpGln: 0.559 ± 0.238
0.559TrpArg: 0.559 ± 0.28
0.559TrpSer: 0.559 ± 0.213
0.839TrpThr: 0.839 ± 0.239
1.025TrpVal: 1.025 ± 0.325
0.0TrpTrp: 0.0 ± 0.0
0.559TrpTyr: 0.559 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.61TyrAla: 2.61 ± 0.487
0.186TyrCys: 0.186 ± 0.114
2.889TyrAsp: 2.889 ± 0.55
1.957TyrGlu: 1.957 ± 0.394
1.771TyrPhe: 1.771 ± 0.456
2.703TyrGly: 2.703 ± 0.449
0.839TyrHis: 0.839 ± 0.273
2.889TyrIle: 2.889 ± 0.502
2.33TyrLys: 2.33 ± 0.424
2.516TyrLeu: 2.516 ± 0.51
1.025TyrMet: 1.025 ± 0.344
1.584TyrAsn: 1.584 ± 0.348
1.584TyrPro: 1.584 ± 0.493
1.864TyrGln: 1.864 ± 0.463
1.864TyrArg: 1.864 ± 0.44
1.771TyrSer: 1.771 ± 0.319
2.144TyrThr: 2.144 ± 0.369
2.144TyrVal: 2.144 ± 0.461
0.932TyrTrp: 0.932 ± 0.275
0.932TyrTyr: 0.932 ± 0.292
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (10731 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski