Amino acid dipepetide frequency for Streptococcus phage IPP38

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.855AlaAla: 2.855 ± 0.74
0.396AlaCys: 0.396 ± 0.201
5.868AlaAsp: 5.868 ± 0.588
6.264AlaGlu: 6.264 ± 0.645
2.379AlaPhe: 2.379 ± 0.518
4.678AlaGly: 4.678 ± 0.902
0.793AlaHis: 0.793 ± 0.296
4.758AlaIle: 4.758 ± 1.005
6.264AlaLys: 6.264 ± 0.675
5.947AlaLeu: 5.947 ± 0.811
2.141AlaMet: 2.141 ± 0.424
3.885AlaAsn: 3.885 ± 0.694
2.062AlaPro: 2.062 ± 0.462
1.982AlaGln: 1.982 ± 0.443
2.696AlaArg: 2.696 ± 0.544
2.458AlaSer: 2.458 ± 0.656
4.678AlaThr: 4.678 ± 0.606
5.471AlaVal: 5.471 ± 0.706
1.189AlaTrp: 1.189 ± 0.371
1.665AlaTyr: 1.665 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
0.238CysAla: 0.238 ± 0.114
0.079CysCys: 0.079 ± 0.074
0.317CysAsp: 0.317 ± 0.135
0.396CysGlu: 0.396 ± 0.182
0.396CysPhe: 0.396 ± 0.199
0.159CysGly: 0.159 ± 0.132
0.0CysHis: 0.0 ± 0.0
0.317CysIle: 0.317 ± 0.205
0.714CysLys: 0.714 ± 0.222
0.555CysLeu: 0.555 ± 0.202
0.0CysMet: 0.0 ± 0.0
0.159CysAsn: 0.159 ± 0.147
0.317CysPro: 0.317 ± 0.153
0.159CysGln: 0.159 ± 0.118
0.476CysArg: 0.476 ± 0.156
0.238CysSer: 0.238 ± 0.135
0.079CysThr: 0.079 ± 0.088
0.238CysVal: 0.238 ± 0.158
0.159CysTrp: 0.159 ± 0.113
0.396CysTyr: 0.396 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
3.41AspAla: 3.41 ± 0.673
0.555AspCys: 0.555 ± 0.232
3.648AspAsp: 3.648 ± 0.71
5.234AspGlu: 5.234 ± 1.015
2.855AspPhe: 2.855 ± 0.422
4.837AspGly: 4.837 ± 0.668
0.634AspHis: 0.634 ± 0.247
5.551AspIle: 5.551 ± 0.614
5.234AspLys: 5.234 ± 0.715
5.471AspLeu: 5.471 ± 0.566
1.427AspMet: 1.427 ± 0.285
2.617AspAsn: 2.617 ± 0.391
1.745AspPro: 1.745 ± 0.375
1.745AspGln: 1.745 ± 0.344
2.775AspArg: 2.775 ± 0.526
3.648AspSer: 3.648 ± 0.5
3.093AspThr: 3.093 ± 0.422
3.806AspVal: 3.806 ± 0.5
1.427AspTrp: 1.427 ± 0.285
2.537AspTyr: 2.537 ± 0.408
0.0AspXaa: 0.0 ± 0.0
Glu
6.661GluAla: 6.661 ± 0.935
0.079GluCys: 0.079 ± 0.062
3.885GluAsp: 3.885 ± 0.617
6.185GluGlu: 6.185 ± 1.043
3.727GluPhe: 3.727 ± 0.569
4.044GluGly: 4.044 ± 0.528
1.348GluHis: 1.348 ± 0.37
6.502GluIle: 6.502 ± 0.827
7.454GluLys: 7.454 ± 1.052
8.088GluLeu: 8.088 ± 0.867
1.903GluMet: 1.903 ± 0.452
4.044GluAsn: 4.044 ± 0.573
1.11GluPro: 1.11 ± 0.255
3.093GluGln: 3.093 ± 0.567
3.885GluArg: 3.885 ± 0.62
5.234GluSer: 5.234 ± 0.618
4.044GluThr: 4.044 ± 0.581
5.551GluVal: 5.551 ± 0.797
1.269GluTrp: 1.269 ± 0.268
2.696GluTyr: 2.696 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
2.3PheAla: 2.3 ± 0.551
0.238PheCys: 0.238 ± 0.148
3.885PheAsp: 3.885 ± 0.532
3.33PheGlu: 3.33 ± 0.496
1.824PhePhe: 1.824 ± 0.356
2.537PheGly: 2.537 ± 0.673
0.238PheHis: 0.238 ± 0.128
2.3PheIle: 2.3 ± 0.372
3.33PheLys: 3.33 ± 0.498
2.537PheLeu: 2.537 ± 0.313
0.872PheMet: 0.872 ± 0.339
2.537PheAsn: 2.537 ± 0.58
0.714PhePro: 0.714 ± 0.257
1.348PheGln: 1.348 ± 0.304
1.665PheArg: 1.665 ± 0.27
3.33PheSer: 3.33 ± 0.647
2.458PheThr: 2.458 ± 0.354
1.982PheVal: 1.982 ± 0.392
0.793PheTrp: 0.793 ± 0.273
1.824PheTyr: 1.824 ± 0.423
0.0PheXaa: 0.0 ± 0.0
Gly
3.093GlyAla: 3.093 ± 0.47
0.159GlyCys: 0.159 ± 0.112
3.885GlyAsp: 3.885 ± 0.631
4.52GlyGlu: 4.52 ± 0.564
3.013GlyPhe: 3.013 ± 0.526
4.758GlyGly: 4.758 ± 1.116
0.872GlyHis: 0.872 ± 0.209
3.568GlyIle: 3.568 ± 0.614
5.551GlyLys: 5.551 ± 0.492
5.947GlyLeu: 5.947 ± 1.138
1.665GlyMet: 1.665 ± 0.321
3.885GlyAsn: 3.885 ± 0.452
0.952GlyPro: 0.952 ± 0.302
3.41GlyGln: 3.41 ± 0.508
3.648GlyArg: 3.648 ± 0.574
3.568GlySer: 3.568 ± 0.691
2.458GlyThr: 2.458 ± 0.485
4.203GlyVal: 4.203 ± 0.69
1.031GlyTrp: 1.031 ± 0.449
2.855GlyTyr: 2.855 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
0.872HisAla: 0.872 ± 0.328
0.0HisCys: 0.0 ± 0.0
0.793HisAsp: 0.793 ± 0.262
1.348HisGlu: 1.348 ± 0.292
0.634HisPhe: 0.634 ± 0.219
0.872HisGly: 0.872 ± 0.288
0.396HisHis: 0.396 ± 0.208
0.872HisIle: 0.872 ± 0.317
0.634HisLys: 0.634 ± 0.27
1.11HisLeu: 1.11 ± 0.292
0.159HisMet: 0.159 ± 0.124
1.189HisAsn: 1.189 ± 0.28
0.872HisPro: 0.872 ± 0.24
0.634HisGln: 0.634 ± 0.225
0.872HisArg: 0.872 ± 0.272
1.507HisSer: 1.507 ± 0.413
0.793HisThr: 0.793 ± 0.241
0.714HisVal: 0.714 ± 0.245
0.159HisTrp: 0.159 ± 0.118
0.793HisTyr: 0.793 ± 0.295
0.0HisXaa: 0.0 ± 0.0
Ile
5.392IleAla: 5.392 ± 0.678
0.476IleCys: 0.476 ± 0.155
3.568IleAsp: 3.568 ± 0.573
6.819IleGlu: 6.819 ± 0.879
2.775IlePhe: 2.775 ± 0.615
5.075IleGly: 5.075 ± 0.889
0.793IleHis: 0.793 ± 0.377
2.934IleIle: 2.934 ± 0.516
6.661IleLys: 6.661 ± 0.582
4.599IleLeu: 4.599 ± 0.661
1.586IleMet: 1.586 ± 0.34
3.806IleAsn: 3.806 ± 0.596
1.427IlePro: 1.427 ± 0.331
2.22IleGln: 2.22 ± 0.328
2.537IleArg: 2.537 ± 0.443
5.392IleSer: 5.392 ± 0.719
4.599IleThr: 4.599 ± 0.545
3.965IleVal: 3.965 ± 0.621
0.714IleTrp: 0.714 ± 0.213
2.537IleTyr: 2.537 ± 0.63
0.0IleXaa: 0.0 ± 0.0
Lys
5.471LysAla: 5.471 ± 0.584
0.396LysCys: 0.396 ± 0.2
5.709LysAsp: 5.709 ± 0.595
7.692LysGlu: 7.692 ± 1.004
3.093LysPhe: 3.093 ± 0.557
4.52LysGly: 4.52 ± 0.595
1.586LysHis: 1.586 ± 0.393
6.106LysIle: 6.106 ± 0.818
7.612LysLys: 7.612 ± 0.931
7.692LysLeu: 7.692 ± 0.646
2.617LysMet: 2.617 ± 0.472
3.965LysAsn: 3.965 ± 0.474
2.379LysPro: 2.379 ± 0.562
3.965LysGln: 3.965 ± 0.586
4.203LysArg: 4.203 ± 0.564
4.123LysSer: 4.123 ± 0.493
5.868LysThr: 5.868 ± 0.665
5.947LysVal: 5.947 ± 0.737
0.952LysTrp: 0.952 ± 0.319
3.33LysTyr: 3.33 ± 0.372
0.0LysXaa: 0.0 ± 0.0
Leu
7.216LeuAla: 7.216 ± 0.899
0.555LeuCys: 0.555 ± 0.279
5.709LeuAsp: 5.709 ± 0.588
7.295LeuGlu: 7.295 ± 0.888
2.617LeuPhe: 2.617 ± 0.481
5.313LeuGly: 5.313 ± 1.138
1.269LeuHis: 1.269 ± 0.29
4.044LeuIle: 4.044 ± 0.592
7.85LeuLys: 7.85 ± 0.862
6.582LeuLeu: 6.582 ± 0.958
2.22LeuMet: 2.22 ± 0.385
3.41LeuAsn: 3.41 ± 0.464
3.41LeuPro: 3.41 ± 0.562
3.489LeuGln: 3.489 ± 0.659
3.41LeuArg: 3.41 ± 0.528
4.599LeuSer: 4.599 ± 0.645
5.947LeuThr: 5.947 ± 0.827
4.361LeuVal: 4.361 ± 0.538
0.952LeuTrp: 0.952 ± 0.273
2.379LeuTyr: 2.379 ± 0.355
0.0LeuXaa: 0.0 ± 0.0
Met
1.745MetAla: 1.745 ± 0.352
0.0MetCys: 0.0 ± 0.0
1.348MetAsp: 1.348 ± 0.259
1.903MetGlu: 1.903 ± 0.444
0.952MetPhe: 0.952 ± 0.248
1.269MetGly: 1.269 ± 0.462
0.238MetHis: 0.238 ± 0.139
1.903MetIle: 1.903 ± 0.425
2.3MetLys: 2.3 ± 0.369
1.586MetLeu: 1.586 ± 0.336
0.396MetMet: 0.396 ± 0.187
1.665MetAsn: 1.665 ± 0.424
0.872MetPro: 0.872 ± 0.301
0.793MetGln: 0.793 ± 0.313
1.507MetArg: 1.507 ± 0.358
1.586MetSer: 1.586 ± 0.417
1.745MetThr: 1.745 ± 0.377
1.586MetVal: 1.586 ± 0.39
0.238MetTrp: 0.238 ± 0.132
0.872MetTyr: 0.872 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
4.441AsnAla: 4.441 ± 0.793
0.317AsnCys: 0.317 ± 0.135
2.696AsnAsp: 2.696 ± 0.524
3.172AsnGlu: 3.172 ± 0.553
1.903AsnPhe: 1.903 ± 0.465
4.044AsnGly: 4.044 ± 0.68
0.952AsnHis: 0.952 ± 0.28
3.172AsnIle: 3.172 ± 0.565
4.678AsnLys: 4.678 ± 0.6
4.361AsnLeu: 4.361 ± 0.54
1.348AsnMet: 1.348 ± 0.307
2.379AsnAsn: 2.379 ± 0.493
1.824AsnPro: 1.824 ± 0.381
3.172AsnGln: 3.172 ± 0.602
2.696AsnArg: 2.696 ± 0.496
3.489AsnSer: 3.489 ± 0.515
2.379AsnThr: 2.379 ± 0.502
2.775AsnVal: 2.775 ± 0.433
0.793AsnTrp: 0.793 ± 0.241
1.903AsnTyr: 1.903 ± 0.399
0.0AsnXaa: 0.0 ± 0.0
Pro
1.745ProAla: 1.745 ± 0.459
0.079ProCys: 0.079 ± 0.076
2.062ProAsp: 2.062 ± 0.491
3.33ProGlu: 3.33 ± 0.44
0.634ProPhe: 0.634 ± 0.241
1.189ProGly: 1.189 ± 0.283
0.317ProHis: 0.317 ± 0.128
2.3ProIle: 2.3 ± 0.522
2.775ProLys: 2.775 ± 0.457
1.348ProLeu: 1.348 ± 0.37
0.634ProMet: 0.634 ± 0.217
1.982ProAsn: 1.982 ± 0.443
0.793ProPro: 0.793 ± 0.369
0.634ProGln: 0.634 ± 0.252
1.269ProArg: 1.269 ± 0.289
1.665ProSer: 1.665 ± 0.505
0.793ProThr: 0.793 ± 0.287
1.745ProVal: 1.745 ± 0.324
0.396ProTrp: 0.396 ± 0.199
1.745ProTyr: 1.745 ± 0.473
0.0ProXaa: 0.0 ± 0.0
Gln
3.965GlnAla: 3.965 ± 0.508
0.238GlnCys: 0.238 ± 0.138
1.427GlnAsp: 1.427 ± 0.34
3.013GlnGlu: 3.013 ± 0.571
1.348GlnPhe: 1.348 ± 0.314
1.586GlnGly: 1.586 ± 0.335
0.634GlnHis: 0.634 ± 0.251
3.489GlnIle: 3.489 ± 0.494
3.806GlnLys: 3.806 ± 0.5
3.33GlnLeu: 3.33 ± 0.528
0.872GlnMet: 0.872 ± 0.211
1.507GlnAsn: 1.507 ± 0.336
1.031GlnPro: 1.031 ± 0.319
1.507GlnGln: 1.507 ± 0.328
1.586GlnArg: 1.586 ± 0.386
2.537GlnSer: 2.537 ± 0.469
2.934GlnThr: 2.934 ± 0.421
3.41GlnVal: 3.41 ± 0.495
0.555GlnTrp: 0.555 ± 0.193
0.952GlnTyr: 0.952 ± 0.335
0.0GlnXaa: 0.0 ± 0.0
Arg
2.775ArgAla: 2.775 ± 0.423
0.396ArgCys: 0.396 ± 0.145
2.22ArgAsp: 2.22 ± 0.448
3.251ArgGlu: 3.251 ± 0.545
1.824ArgPhe: 1.824 ± 0.34
1.903ArgGly: 1.903 ± 0.382
0.872ArgHis: 0.872 ± 0.279
3.172ArgIle: 3.172 ± 0.535
3.885ArgLys: 3.885 ± 0.72
4.996ArgLeu: 4.996 ± 0.655
2.141ArgMet: 2.141 ± 0.411
2.775ArgAsn: 2.775 ± 0.435
1.11ArgPro: 1.11 ± 0.277
1.745ArgGln: 1.745 ± 0.387
2.537ArgArg: 2.537 ± 0.515
2.617ArgSer: 2.617 ± 0.505
2.775ArgThr: 2.775 ± 0.534
2.3ArgVal: 2.3 ± 0.405
0.555ArgTrp: 0.555 ± 0.177
1.745ArgTyr: 1.745 ± 0.403
0.0ArgXaa: 0.0 ± 0.0
Ser
4.282SerAla: 4.282 ± 0.942
0.159SerCys: 0.159 ± 0.107
3.727SerAsp: 3.727 ± 0.528
3.885SerGlu: 3.885 ± 0.541
2.141SerPhe: 2.141 ± 0.36
4.837SerGly: 4.837 ± 0.705
1.269SerHis: 1.269 ± 0.391
4.203SerIle: 4.203 ± 0.685
4.52SerLys: 4.52 ± 0.699
5.789SerLeu: 5.789 ± 0.796
1.348SerMet: 1.348 ± 0.386
2.775SerAsn: 2.775 ± 0.598
1.348SerPro: 1.348 ± 0.315
2.458SerGln: 2.458 ± 0.404
3.013SerArg: 3.013 ± 0.486
3.806SerSer: 3.806 ± 0.598
4.441SerThr: 4.441 ± 0.623
3.41SerVal: 3.41 ± 0.613
0.952SerTrp: 0.952 ± 0.354
3.013SerTyr: 3.013 ± 0.545
0.0SerXaa: 0.0 ± 0.0
Thr
4.123ThrAla: 4.123 ± 0.759
0.238ThrCys: 0.238 ± 0.135
4.044ThrAsp: 4.044 ± 0.457
4.282ThrGlu: 4.282 ± 0.538
3.093ThrPhe: 3.093 ± 0.474
3.727ThrGly: 3.727 ± 0.629
1.11ThrHis: 1.11 ± 0.376
5.075ThrIle: 5.075 ± 0.617
4.599ThrLys: 4.599 ± 0.752
4.361ThrLeu: 4.361 ± 0.575
0.793ThrMet: 0.793 ± 0.304
3.806ThrAsn: 3.806 ± 0.544
1.824ThrPro: 1.824 ± 0.497
2.537ThrGln: 2.537 ± 0.536
1.745ThrArg: 1.745 ± 0.33
3.885ThrSer: 3.885 ± 0.628
5.154ThrThr: 5.154 ± 0.926
3.727ThrVal: 3.727 ± 0.671
0.634ThrTrp: 0.634 ± 0.246
2.537ThrTyr: 2.537 ± 0.523
0.0ThrXaa: 0.0 ± 0.0
Val
4.916ValAla: 4.916 ± 0.722
0.476ValCys: 0.476 ± 0.222
3.885ValAsp: 3.885 ± 0.588
5.313ValGlu: 5.313 ± 0.593
1.982ValPhe: 1.982 ± 0.394
4.758ValGly: 4.758 ± 0.621
0.872ValHis: 0.872 ± 0.308
4.441ValIle: 4.441 ± 0.824
4.996ValLys: 4.996 ± 0.55
4.916ValLeu: 4.916 ± 0.499
1.031ValMet: 1.031 ± 0.322
3.727ValAsn: 3.727 ± 0.664
1.665ValPro: 1.665 ± 0.325
1.745ValGln: 1.745 ± 0.4
2.3ValArg: 2.3 ± 0.369
4.678ValSer: 4.678 ± 0.574
4.203ValThr: 4.203 ± 0.618
4.361ValVal: 4.361 ± 0.73
0.476ValTrp: 0.476 ± 0.183
2.062ValTyr: 2.062 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
1.11TrpAla: 1.11 ± 0.355
0.159TrpCys: 0.159 ± 0.098
0.714TrpAsp: 0.714 ± 0.273
1.031TrpGlu: 1.031 ± 0.329
1.11TrpPhe: 1.11 ± 0.44
0.555TrpGly: 0.555 ± 0.188
0.079TrpHis: 0.079 ± 0.075
0.793TrpIle: 0.793 ± 0.248
1.189TrpLys: 1.189 ± 0.309
0.634TrpLeu: 0.634 ± 0.288
0.317TrpMet: 0.317 ± 0.151
0.952TrpAsn: 0.952 ± 0.283
0.079TrpPro: 0.079 ± 0.091
0.952TrpGln: 0.952 ± 0.316
0.396TrpArg: 0.396 ± 0.163
0.476TrpSer: 0.476 ± 0.148
0.872TrpThr: 0.872 ± 0.26
1.269TrpVal: 1.269 ± 0.312
0.159TrpTrp: 0.159 ± 0.115
0.714TrpTyr: 0.714 ± 0.551
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.824TyrAla: 1.824 ± 0.346
0.396TyrCys: 0.396 ± 0.165
2.855TyrAsp: 2.855 ± 0.547
2.537TyrGlu: 2.537 ± 0.516
1.745TyrPhe: 1.745 ± 0.417
2.379TyrGly: 2.379 ± 0.412
0.952TyrHis: 0.952 ± 0.241
2.379TyrIle: 2.379 ± 0.523
3.172TyrLys: 3.172 ± 0.472
2.775TyrLeu: 2.775 ± 0.495
0.952TyrMet: 0.952 ± 0.296
1.427TyrAsn: 1.427 ± 0.316
1.903TyrPro: 1.903 ± 0.396
2.141TyrGln: 2.141 ± 0.429
2.458TyrArg: 2.458 ± 0.565
2.537TyrSer: 2.537 ± 0.445
1.982TyrThr: 1.982 ± 0.347
1.982TyrVal: 1.982 ± 0.417
0.159TyrTrp: 0.159 ± 0.1
1.745TyrTyr: 1.745 ± 0.493
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (12612 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski