Amino acid dipepetide frequency for Streptococcus virus Sfi19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.5AlaAla: 3.5 ± 1.172
0.179AlaCys: 0.179 ± 0.117
4.935AlaAsp: 4.935 ± 0.979
4.397AlaGlu: 4.397 ± 0.807
2.243AlaPhe: 2.243 ± 0.517
3.589AlaGly: 3.589 ± 0.667
0.808AlaHis: 0.808 ± 0.307
5.205AlaIle: 5.205 ± 0.832
5.833AlaLys: 5.833 ± 1.145
6.281AlaLeu: 6.281 ± 0.841
1.705AlaMet: 1.705 ± 0.413
4.397AlaAsn: 4.397 ± 0.758
1.795AlaPro: 1.795 ± 0.394
2.513AlaGln: 2.513 ± 0.666
3.23AlaArg: 3.23 ± 0.746
3.859AlaSer: 3.859 ± 0.476
4.487AlaThr: 4.487 ± 0.823
3.141AlaVal: 3.141 ± 0.778
1.884AlaTrp: 1.884 ± 0.56
3.141AlaTyr: 3.141 ± 0.605
0.0AlaXaa: 0.0 ± 0.0
Cys
0.09CysAla: 0.09 ± 0.077
0.09CysCys: 0.09 ± 0.087
0.808CysAsp: 0.808 ± 0.315
0.449CysGlu: 0.449 ± 0.231
0.269CysPhe: 0.269 ± 0.215
0.538CysGly: 0.538 ± 0.348
0.269CysHis: 0.269 ± 0.164
0.179CysIle: 0.179 ± 0.121
0.359CysLys: 0.359 ± 0.192
0.628CysLeu: 0.628 ± 0.306
0.179CysMet: 0.179 ± 0.132
0.449CysAsn: 0.449 ± 0.205
0.179CysPro: 0.179 ± 0.132
0.269CysGln: 0.269 ± 0.124
0.269CysArg: 0.269 ± 0.187
0.449CysSer: 0.449 ± 0.221
0.359CysThr: 0.359 ± 0.185
0.359CysVal: 0.359 ± 0.145
0.179CysTrp: 0.179 ± 0.141
0.179CysTyr: 0.179 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
3.948AspAla: 3.948 ± 0.784
0.449AspCys: 0.449 ± 0.195
4.307AspAsp: 4.307 ± 0.827
3.769AspGlu: 3.769 ± 0.497
3.679AspPhe: 3.679 ± 0.629
7.179AspGly: 7.179 ± 1.359
0.628AspHis: 0.628 ± 0.278
5.384AspIle: 5.384 ± 0.85
4.128AspLys: 4.128 ± 0.567
3.679AspLeu: 3.679 ± 0.821
2.333AspMet: 2.333 ± 0.426
4.935AspAsn: 4.935 ± 0.716
1.884AspPro: 1.884 ± 0.375
1.615AspGln: 1.615 ± 0.271
2.692AspArg: 2.692 ± 0.535
3.859AspSer: 3.859 ± 0.572
3.5AspThr: 3.5 ± 0.609
3.41AspVal: 3.41 ± 0.65
1.256AspTrp: 1.256 ± 0.313
2.602AspTyr: 2.602 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
4.576GluAla: 4.576 ± 0.834
0.269GluCys: 0.269 ± 0.128
2.692GluAsp: 2.692 ± 0.724
4.666GluGlu: 4.666 ± 0.815
2.602GluPhe: 2.602 ± 0.641
3.23GluGly: 3.23 ± 0.472
1.346GluHis: 1.346 ± 0.392
5.474GluIle: 5.474 ± 0.759
4.487GluLys: 4.487 ± 0.932
5.115GluLeu: 5.115 ± 0.707
1.795GluMet: 1.795 ± 0.474
3.859GluAsn: 3.859 ± 0.743
1.884GluPro: 1.884 ± 0.625
2.423GluGln: 2.423 ± 0.382
2.602GluArg: 2.602 ± 0.516
3.051GluSer: 3.051 ± 0.49
3.051GluThr: 3.051 ± 0.543
5.474GluVal: 5.474 ± 0.833
1.436GluTrp: 1.436 ± 0.345
3.679GluTyr: 3.679 ± 0.634
0.0GluXaa: 0.0 ± 0.0
Phe
2.692PheAla: 2.692 ± 0.539
0.179PheCys: 0.179 ± 0.14
3.32PheAsp: 3.32 ± 0.589
2.243PheGlu: 2.243 ± 0.474
1.615PhePhe: 1.615 ± 0.272
3.23PheGly: 3.23 ± 0.626
0.538PheHis: 0.538 ± 0.208
2.423PheIle: 2.423 ± 0.595
4.756PheLys: 4.756 ± 0.641
3.051PheLeu: 3.051 ± 0.566
0.718PheMet: 0.718 ± 0.274
3.41PheAsn: 3.41 ± 0.682
0.359PhePro: 0.359 ± 0.167
1.436PheGln: 1.436 ± 0.371
1.884PheArg: 1.884 ± 0.376
3.051PheSer: 3.051 ± 0.806
2.513PheThr: 2.513 ± 0.509
2.513PheVal: 2.513 ± 0.417
0.538PheTrp: 0.538 ± 0.213
1.615PheTyr: 1.615 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
3.859GlyAla: 3.859 ± 0.723
0.269GlyCys: 0.269 ± 0.161
4.756GlyAsp: 4.756 ± 0.71
3.23GlyGlu: 3.23 ± 0.51
2.872GlyPhe: 2.872 ± 0.499
4.397GlyGly: 4.397 ± 0.658
0.808GlyHis: 0.808 ± 0.252
5.115GlyIle: 5.115 ± 0.699
6.64GlyLys: 6.64 ± 0.959
6.281GlyLeu: 6.281 ± 0.768
1.346GlyMet: 1.346 ± 0.378
5.025GlyAsn: 5.025 ± 0.943
1.346GlyPro: 1.346 ± 0.613
2.961GlyGln: 2.961 ± 0.605
4.038GlyArg: 4.038 ± 0.587
6.102GlySer: 6.102 ± 1.583
4.397GlyThr: 4.397 ± 0.61
3.5GlyVal: 3.5 ± 0.664
1.346GlyTrp: 1.346 ± 0.38
2.961GlyTyr: 2.961 ± 0.503
0.0GlyXaa: 0.0 ± 0.0
His
0.449HisAla: 0.449 ± 0.193
0.0HisCys: 0.0 ± 0.0
0.987HisAsp: 0.987 ± 0.22
0.718HisGlu: 0.718 ± 0.229
0.538HisPhe: 0.538 ± 0.177
1.525HisGly: 1.525 ± 0.763
0.538HisHis: 0.538 ± 0.167
0.808HisIle: 0.808 ± 0.285
1.077HisLys: 1.077 ± 0.294
1.256HisLeu: 1.256 ± 0.398
0.628HisMet: 0.628 ± 0.276
0.449HisAsn: 0.449 ± 0.241
0.628HisPro: 0.628 ± 0.17
0.269HisGln: 0.269 ± 0.141
0.628HisArg: 0.628 ± 0.228
0.987HisSer: 0.987 ± 0.262
0.897HisThr: 0.897 ± 0.294
1.525HisVal: 1.525 ± 0.278
0.0HisTrp: 0.0 ± 0.0
1.077HisTyr: 1.077 ± 0.306
0.0HisXaa: 0.0 ± 0.0
Ile
5.025IleAla: 5.025 ± 0.869
0.269IleCys: 0.269 ± 0.162
5.294IleAsp: 5.294 ± 0.62
4.038IleGlu: 4.038 ± 0.753
1.705IlePhe: 1.705 ± 0.416
4.218IleGly: 4.218 ± 0.547
0.628IleHis: 0.628 ± 0.186
3.32IleIle: 3.32 ± 0.557
6.102IleLys: 6.102 ± 0.795
3.41IleLeu: 3.41 ± 0.681
1.795IleMet: 1.795 ± 0.434
4.935IleAsn: 4.935 ± 0.573
3.589IlePro: 3.589 ± 0.672
2.961IleGln: 2.961 ± 0.436
2.513IleArg: 2.513 ± 0.542
4.756IleSer: 4.756 ± 0.491
3.23IleThr: 3.23 ± 0.487
3.859IleVal: 3.859 ± 0.591
0.718IleTrp: 0.718 ± 0.18
2.782IleTyr: 2.782 ± 0.532
0.0IleXaa: 0.0 ± 0.0
Lys
6.102LysAla: 6.102 ± 0.764
0.359LysCys: 0.359 ± 0.143
4.935LysAsp: 4.935 ± 0.682
6.91LysGlu: 6.91 ± 1.033
3.589LysPhe: 3.589 ± 0.789
5.653LysGly: 5.653 ± 0.839
1.256LysHis: 1.256 ± 0.456
5.025LysIle: 5.025 ± 0.775
6.999LysLys: 6.999 ± 1.216
5.294LysLeu: 5.294 ± 0.678
1.795LysMet: 1.795 ± 0.548
5.653LysAsn: 5.653 ± 1.034
3.32LysPro: 3.32 ± 0.465
3.769LysGln: 3.769 ± 0.577
3.051LysArg: 3.051 ± 0.51
3.41LysSer: 3.41 ± 0.57
4.666LysThr: 4.666 ± 0.586
5.025LysVal: 5.025 ± 0.882
1.256LysTrp: 1.256 ± 0.261
3.051LysTyr: 3.051 ± 0.565
0.0LysXaa: 0.0 ± 0.0
Leu
5.474LeuAla: 5.474 ± 0.557
0.538LeuCys: 0.538 ± 0.258
5.474LeuAsp: 5.474 ± 0.59
6.102LeuGlu: 6.102 ± 0.997
3.5LeuPhe: 3.5 ± 0.48
5.922LeuGly: 5.922 ± 0.876
0.987LeuHis: 0.987 ± 0.305
4.218LeuIle: 4.218 ± 0.387
6.192LeuLys: 6.192 ± 0.586
4.397LeuLeu: 4.397 ± 0.635
2.154LeuMet: 2.154 ± 0.508
4.218LeuAsn: 4.218 ± 0.57
2.064LeuPro: 2.064 ± 0.432
2.872LeuGln: 2.872 ± 0.448
3.23LeuArg: 3.23 ± 0.81
4.756LeuSer: 4.756 ± 0.524
6.012LeuThr: 6.012 ± 0.786
4.487LeuVal: 4.487 ± 0.513
0.718LeuTrp: 0.718 ± 0.293
1.974LeuTyr: 1.974 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
2.243MetAla: 2.243 ± 0.551
0.179MetCys: 0.179 ± 0.121
0.808MetAsp: 0.808 ± 0.278
1.795MetGlu: 1.795 ± 0.546
0.987MetPhe: 0.987 ± 0.201
0.987MetGly: 0.987 ± 0.263
0.628MetHis: 0.628 ± 0.257
1.615MetIle: 1.615 ± 0.359
2.513MetLys: 2.513 ± 0.598
1.525MetLeu: 1.525 ± 0.303
0.449MetMet: 0.449 ± 0.243
0.987MetAsn: 0.987 ± 0.275
1.346MetPro: 1.346 ± 0.308
0.808MetGln: 0.808 ± 0.221
0.987MetArg: 0.987 ± 0.254
1.884MetSer: 1.884 ± 0.369
1.705MetThr: 1.705 ± 0.383
2.782MetVal: 2.782 ± 1.324
0.09MetTrp: 0.09 ± 0.072
0.987MetTyr: 0.987 ± 0.344
0.0MetXaa: 0.0 ± 0.0
Asn
6.102AsnAla: 6.102 ± 1.271
0.449AsnCys: 0.449 ± 0.217
3.679AsnAsp: 3.679 ± 0.609
3.859AsnGlu: 3.859 ± 0.547
2.602AsnPhe: 2.602 ± 0.523
6.461AsnGly: 6.461 ± 1.23
0.987AsnHis: 0.987 ± 0.263
3.141AsnIle: 3.141 ± 0.498
4.038AsnLys: 4.038 ± 0.503
5.205AsnLeu: 5.205 ± 0.486
0.987AsnMet: 0.987 ± 0.354
3.859AsnAsn: 3.859 ± 0.732
2.513AsnPro: 2.513 ± 0.515
2.513AsnGln: 2.513 ± 0.455
2.154AsnArg: 2.154 ± 0.44
3.589AsnSer: 3.589 ± 0.57
3.32AsnThr: 3.32 ± 0.494
3.23AsnVal: 3.23 ± 0.438
1.795AsnTrp: 1.795 ± 0.377
2.602AsnTyr: 2.602 ± 0.599
0.0AsnXaa: 0.0 ± 0.0
Pro
1.705ProAla: 1.705 ± 0.326
0.09ProCys: 0.09 ± 0.104
1.525ProAsp: 1.525 ± 0.338
2.333ProGlu: 2.333 ± 0.444
1.346ProPhe: 1.346 ± 0.318
1.525ProGly: 1.525 ± 0.43
0.449ProHis: 0.449 ± 0.158
1.525ProIle: 1.525 ± 0.438
3.051ProLys: 3.051 ± 0.519
2.333ProLeu: 2.333 ± 0.423
0.449ProMet: 0.449 ± 0.234
2.423ProAsn: 2.423 ± 0.393
0.628ProPro: 0.628 ± 0.317
1.705ProGln: 1.705 ± 0.441
0.897ProArg: 0.897 ± 0.324
2.243ProSer: 2.243 ± 0.467
1.974ProThr: 1.974 ± 0.509
1.974ProVal: 1.974 ± 0.505
0.538ProTrp: 0.538 ± 0.187
1.077ProTyr: 1.077 ± 0.483
0.0ProXaa: 0.0 ± 0.0
Gln
3.679GlnAla: 3.679 ± 0.425
0.359GlnCys: 0.359 ± 0.179
2.333GlnAsp: 2.333 ± 0.428
2.782GlnGlu: 2.782 ± 0.416
1.436GlnPhe: 1.436 ± 0.33
4.038GlnGly: 4.038 ± 1.004
0.628GlnHis: 0.628 ± 0.192
2.243GlnIle: 2.243 ± 0.538
2.872GlnLys: 2.872 ± 0.66
3.051GlnLeu: 3.051 ± 0.534
1.436GlnMet: 1.436 ± 0.388
2.423GlnAsn: 2.423 ± 0.365
0.09GlnPro: 0.09 ± 0.107
2.243GlnGln: 2.243 ± 0.442
1.346GlnArg: 1.346 ± 0.319
2.423GlnSer: 2.423 ± 0.561
2.602GlnThr: 2.602 ± 0.391
2.333GlnVal: 2.333 ± 0.599
0.808GlnTrp: 0.808 ± 0.339
2.154GlnTyr: 2.154 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
1.705ArgAla: 1.705 ± 0.404
0.269ArgCys: 0.269 ± 0.198
2.872ArgAsp: 2.872 ± 0.46
2.513ArgGlu: 2.513 ± 0.52
1.795ArgPhe: 1.795 ± 0.398
2.872ArgGly: 2.872 ± 0.505
0.628ArgHis: 0.628 ± 0.228
2.961ArgIle: 2.961 ± 0.517
3.141ArgLys: 3.141 ± 0.541
3.41ArgLeu: 3.41 ± 0.662
0.987ArgMet: 0.987 ± 0.254
2.692ArgAsn: 2.692 ± 0.443
1.077ArgPro: 1.077 ± 0.325
1.795ArgGln: 1.795 ± 0.442
1.705ArgArg: 1.705 ± 0.417
1.795ArgSer: 1.795 ± 0.521
2.513ArgThr: 2.513 ± 0.584
3.141ArgVal: 3.141 ± 0.784
0.987ArgTrp: 0.987 ± 0.291
2.243ArgTyr: 2.243 ± 0.495
0.0ArgXaa: 0.0 ± 0.0
Ser
2.872SerAla: 2.872 ± 0.684
0.808SerCys: 0.808 ± 0.286
4.307SerAsp: 4.307 ± 0.593
2.782SerGlu: 2.782 ± 0.499
3.051SerPhe: 3.051 ± 0.616
4.307SerGly: 4.307 ± 0.368
0.718SerHis: 0.718 ± 0.227
4.218SerIle: 4.218 ± 0.602
4.487SerLys: 4.487 ± 0.731
4.307SerLeu: 4.307 ± 0.526
2.872SerMet: 2.872 ± 1.153
4.397SerAsn: 4.397 ± 0.604
1.974SerPro: 1.974 ± 0.372
2.961SerGln: 2.961 ± 0.451
2.423SerArg: 2.423 ± 0.422
3.41SerSer: 3.41 ± 0.528
3.948SerThr: 3.948 ± 0.553
5.474SerVal: 5.474 ± 0.893
0.808SerTrp: 0.808 ± 0.321
2.064SerTyr: 2.064 ± 0.548
0.0SerXaa: 0.0 ± 0.0
Thr
4.756ThrAla: 4.756 ± 0.687
0.359ThrCys: 0.359 ± 0.171
3.41ThrAsp: 3.41 ± 0.552
2.782ThrGlu: 2.782 ± 0.408
2.872ThrPhe: 2.872 ± 0.541
3.589ThrGly: 3.589 ± 0.534
1.795ThrHis: 1.795 ± 0.664
4.218ThrIle: 4.218 ± 0.749
5.384ThrLys: 5.384 ± 0.576
6.281ThrLeu: 6.281 ± 1.075
0.987ThrMet: 0.987 ± 0.272
3.769ThrAsn: 3.769 ± 0.591
1.705ThrPro: 1.705 ± 0.515
3.32ThrGln: 3.32 ± 0.903
2.243ThrArg: 2.243 ± 0.487
3.948ThrSer: 3.948 ± 0.518
3.141ThrThr: 3.141 ± 0.635
3.051ThrVal: 3.051 ± 0.654
0.718ThrTrp: 0.718 ± 0.283
3.23ThrTyr: 3.23 ± 0.511
0.0ThrXaa: 0.0 ± 0.0
Val
4.307ValAla: 4.307 ± 0.856
0.987ValCys: 0.987 ± 0.59
5.205ValAsp: 5.205 ± 0.655
3.859ValGlu: 3.859 ± 0.842
2.154ValPhe: 2.154 ± 0.359
4.576ValGly: 4.576 ± 0.587
0.449ValHis: 0.449 ± 0.175
4.128ValIle: 4.128 ± 0.615
5.564ValLys: 5.564 ± 0.672
4.487ValLeu: 4.487 ± 0.619
1.167ValMet: 1.167 ± 0.294
2.961ValAsn: 2.961 ± 0.625
2.154ValPro: 2.154 ± 0.347
1.705ValGln: 1.705 ± 0.406
1.795ValArg: 1.795 ± 0.385
4.397ValSer: 4.397 ± 0.656
6.192ValThr: 6.192 ± 1.828
3.32ValVal: 3.32 ± 0.671
0.897ValTrp: 0.897 ± 0.29
3.051ValTyr: 3.051 ± 0.891
0.0ValXaa: 0.0 ± 0.0
Trp
0.897TrpAla: 0.897 ± 0.27
0.09TrpCys: 0.09 ± 0.088
1.256TrpAsp: 1.256 ± 0.405
1.256TrpGlu: 1.256 ± 0.358
0.808TrpPhe: 0.808 ± 0.263
0.538TrpGly: 0.538 ± 0.285
0.269TrpHis: 0.269 ± 0.132
0.718TrpIle: 0.718 ± 0.212
0.897TrpLys: 0.897 ± 0.209
1.436TrpLeu: 1.436 ± 0.395
0.269TrpMet: 0.269 ± 0.136
0.718TrpAsn: 0.718 ± 0.303
0.09TrpPro: 0.09 ± 0.092
0.808TrpGln: 0.808 ± 0.278
0.987TrpArg: 0.987 ± 0.283
1.795TrpSer: 1.795 ± 0.511
1.167TrpThr: 1.167 ± 0.537
1.884TrpVal: 1.884 ± 0.548
0.359TrpTrp: 0.359 ± 0.187
0.449TrpTyr: 0.449 ± 0.279
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.141TyrAla: 3.141 ± 0.51
0.359TyrCys: 0.359 ± 0.241
2.333TyrAsp: 2.333 ± 0.469
3.23TyrGlu: 3.23 ± 0.747
2.423TyrPhe: 2.423 ± 0.483
2.872TyrGly: 2.872 ± 1.054
0.628TyrHis: 0.628 ± 0.213
3.23TyrIle: 3.23 ± 0.612
2.602TyrLys: 2.602 ± 0.441
3.859TyrLeu: 3.859 ± 0.613
1.167TyrMet: 1.167 ± 0.358
1.525TyrAsn: 1.525 ± 0.371
1.167TyrPro: 1.167 ± 0.399
2.333TyrGln: 2.333 ± 0.51
2.513TyrArg: 2.513 ± 0.548
2.423TyrSer: 2.423 ± 0.583
1.884TyrThr: 1.884 ± 0.587
2.872TyrVal: 2.872 ± 0.404
0.359TyrTrp: 0.359 ± 0.19
2.243TyrTyr: 2.243 ± 0.552
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (11145 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski