Amino acid dipepetide frequency for Streptococcus phage Str-PAP-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.267AlaAla: 1.267 ± 0.471
0.362AlaCys: 0.362 ± 0.191
4.161AlaAsp: 4.161 ± 0.56
4.252AlaGlu: 4.252 ± 0.61
3.166AlaPhe: 3.166 ± 0.703
3.89AlaGly: 3.89 ± 0.781
1.086AlaHis: 1.086 ± 0.342
4.976AlaIle: 4.976 ± 0.707
5.79AlaLys: 5.79 ± 0.787
4.523AlaLeu: 4.523 ± 0.558
1.719AlaMet: 1.719 ± 0.348
3.347AlaAsn: 3.347 ± 0.496
1.628AlaPro: 1.628 ± 0.531
1.538AlaGln: 1.538 ± 0.331
1.9AlaArg: 1.9 ± 0.391
3.166AlaSer: 3.166 ± 0.671
4.252AlaThr: 4.252 ± 0.59
3.8AlaVal: 3.8 ± 0.685
0.633AlaTrp: 0.633 ± 0.267
2.623AlaTyr: 2.623 ± 0.487
0.0AlaXaa: 0.0 ± 0.0
Cys
0.452CysAla: 0.452 ± 0.195
0.181CysCys: 0.181 ± 0.123
0.814CysAsp: 0.814 ± 0.301
0.452CysGlu: 0.452 ± 0.213
0.181CysPhe: 0.181 ± 0.116
0.905CysGly: 0.905 ± 0.469
0.09CysHis: 0.09 ± 0.104
0.271CysIle: 0.271 ± 0.148
0.905CysLys: 0.905 ± 0.399
0.452CysLeu: 0.452 ± 0.183
0.09CysMet: 0.09 ± 0.09
0.452CysAsn: 0.452 ± 0.313
0.271CysPro: 0.271 ± 0.188
0.181CysGln: 0.181 ± 0.123
0.724CysArg: 0.724 ± 0.267
0.633CysSer: 0.633 ± 0.316
0.09CysThr: 0.09 ± 0.1
0.543CysVal: 0.543 ± 0.198
0.09CysTrp: 0.09 ± 0.097
0.543CysTyr: 0.543 ± 0.245
0.0CysXaa: 0.0 ± 0.0
Asp
3.438AspAla: 3.438 ± 0.53
0.543AspCys: 0.543 ± 0.23
3.98AspAsp: 3.98 ± 0.741
3.076AspGlu: 3.076 ± 0.455
2.714AspPhe: 2.714 ± 0.536
6.061AspGly: 6.061 ± 0.694
0.995AspHis: 0.995 ± 0.272
4.433AspIle: 4.433 ± 0.65
5.337AspLys: 5.337 ± 0.903
5.518AspLeu: 5.518 ± 0.745
1.719AspMet: 1.719 ± 0.521
4.523AspAsn: 4.523 ± 0.537
1.267AspPro: 1.267 ± 0.331
0.814AspGln: 0.814 ± 0.223
2.623AspArg: 2.623 ± 0.347
3.709AspSer: 3.709 ± 0.635
3.166AspThr: 3.166 ± 0.562
3.528AspVal: 3.528 ± 0.546
1.176AspTrp: 1.176 ± 0.342
3.528AspTyr: 3.528 ± 0.628
0.0AspXaa: 0.0 ± 0.0
Glu
3.98GluAla: 3.98 ± 0.69
0.814GluCys: 0.814 ± 0.343
3.8GluAsp: 3.8 ± 0.587
5.066GluGlu: 5.066 ± 0.868
2.714GluPhe: 2.714 ± 0.637
3.257GluGly: 3.257 ± 0.437
1.719GluHis: 1.719 ± 0.387
6.333GluIle: 6.333 ± 0.744
5.066GluLys: 5.066 ± 0.937
7.69GluLeu: 7.69 ± 0.884
2.443GluMet: 2.443 ± 0.514
4.614GluAsn: 4.614 ± 0.587
1.357GluPro: 1.357 ± 0.37
2.985GluGln: 2.985 ± 0.428
2.985GluArg: 2.985 ± 0.499
3.257GluSer: 3.257 ± 0.608
3.89GluThr: 3.89 ± 0.573
4.523GluVal: 4.523 ± 0.695
1.086GluTrp: 1.086 ± 0.279
2.171GluTyr: 2.171 ± 0.57
0.0GluXaa: 0.0 ± 0.0
Phe
2.352PheAla: 2.352 ± 0.544
0.0PheCys: 0.0 ± 0.0
2.985PheAsp: 2.985 ± 0.567
2.714PheGlu: 2.714 ± 0.482
1.9PhePhe: 1.9 ± 0.41
2.623PheGly: 2.623 ± 0.472
0.633PheHis: 0.633 ± 0.227
2.714PheIle: 2.714 ± 0.507
3.619PheLys: 3.619 ± 0.558
2.895PheLeu: 2.895 ± 0.527
0.814PheMet: 0.814 ± 0.289
3.619PheAsn: 3.619 ± 0.708
0.905PhePro: 0.905 ± 0.269
0.995PheGln: 0.995 ± 0.279
1.628PheArg: 1.628 ± 0.398
2.895PheSer: 2.895 ± 0.631
3.166PheThr: 3.166 ± 0.541
1.9PheVal: 1.9 ± 0.432
0.362PheTrp: 0.362 ± 0.183
1.809PheTyr: 1.809 ± 0.464
0.0PheXaa: 0.0 ± 0.0
Gly
2.985GlyAla: 2.985 ± 0.534
0.271GlyCys: 0.271 ± 0.139
3.89GlyAsp: 3.89 ± 0.522
3.619GlyGlu: 3.619 ± 0.503
3.076GlyPhe: 3.076 ± 0.574
5.518GlyGly: 5.518 ± 1.008
0.995GlyHis: 0.995 ± 0.336
5.428GlyIle: 5.428 ± 0.954
6.333GlyLys: 6.333 ± 0.8
5.066GlyLeu: 5.066 ± 0.576
1.719GlyMet: 1.719 ± 0.395
4.071GlyAsn: 4.071 ± 0.593
0.814GlyPro: 0.814 ± 0.32
2.171GlyGln: 2.171 ± 0.488
2.262GlyArg: 2.262 ± 0.455
4.342GlySer: 4.342 ± 0.834
4.976GlyThr: 4.976 ± 0.894
3.709GlyVal: 3.709 ± 0.579
1.538GlyTrp: 1.538 ± 0.308
4.252GlyTyr: 4.252 ± 0.754
0.0GlyXaa: 0.0 ± 0.0
His
0.633HisAla: 0.633 ± 0.286
0.09HisCys: 0.09 ± 0.102
0.814HisAsp: 0.814 ± 0.277
0.543HisGlu: 0.543 ± 0.203
0.543HisPhe: 0.543 ± 0.195
1.809HisGly: 1.809 ± 0.445
0.181HisHis: 0.181 ± 0.136
1.267HisIle: 1.267 ± 0.361
1.267HisLys: 1.267 ± 0.333
1.086HisLeu: 1.086 ± 0.275
0.09HisMet: 0.09 ± 0.1
1.176HisAsn: 1.176 ± 0.347
0.452HisPro: 0.452 ± 0.194
0.633HisGln: 0.633 ± 0.18
1.176HisArg: 1.176 ± 0.31
0.724HisSer: 0.724 ± 0.328
1.086HisThr: 1.086 ± 0.339
1.086HisVal: 1.086 ± 0.407
0.362HisTrp: 0.362 ± 0.205
0.995HisTyr: 0.995 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
4.161IleAla: 4.161 ± 0.554
0.543IleCys: 0.543 ± 0.234
4.976IleAsp: 4.976 ± 0.645
6.333IleGlu: 6.333 ± 0.67
2.533IlePhe: 2.533 ± 0.351
5.157IleGly: 5.157 ± 0.739
1.267IleHis: 1.267 ± 0.309
4.342IleIle: 4.342 ± 0.934
7.328IleLys: 7.328 ± 0.879
5.157IleLeu: 5.157 ± 0.665
1.628IleMet: 1.628 ± 0.307
4.161IleAsn: 4.161 ± 0.555
3.076IlePro: 3.076 ± 0.571
2.623IleGln: 2.623 ± 0.519
1.99IleArg: 1.99 ± 0.422
5.88IleSer: 5.88 ± 1.005
4.161IleThr: 4.161 ± 0.526
4.342IleVal: 4.342 ± 0.67
0.995IleTrp: 0.995 ± 0.237
2.081IleTyr: 2.081 ± 0.49
0.0IleXaa: 0.0 ± 0.0
Lys
6.785LysAla: 6.785 ± 0.902
0.271LysCys: 0.271 ± 0.16
5.247LysAsp: 5.247 ± 0.717
7.237LysGlu: 7.237 ± 1.063
2.623LysPhe: 2.623 ± 0.628
5.609LysGly: 5.609 ± 0.725
1.267LysHis: 1.267 ± 0.435
6.423LysIle: 6.423 ± 0.79
7.328LysLys: 7.328 ± 0.884
7.147LysLeu: 7.147 ± 1.092
2.533LysMet: 2.533 ± 0.565
5.428LysAsn: 5.428 ± 0.886
2.533LysPro: 2.533 ± 0.446
3.619LysGln: 3.619 ± 0.752
3.528LysArg: 3.528 ± 0.526
4.885LysSer: 4.885 ± 0.576
4.885LysThr: 4.885 ± 0.702
5.428LysVal: 5.428 ± 0.571
1.267LysTrp: 1.267 ± 0.286
4.161LysTyr: 4.161 ± 0.588
0.0LysXaa: 0.0 ± 0.0
Leu
4.704LeuAla: 4.704 ± 0.729
0.995LeuCys: 0.995 ± 0.332
4.252LeuAsp: 4.252 ± 0.499
6.604LeuGlu: 6.604 ± 0.787
2.804LeuPhe: 2.804 ± 0.418
3.528LeuGly: 3.528 ± 0.556
1.086LeuHis: 1.086 ± 0.293
5.518LeuIle: 5.518 ± 0.826
8.594LeuLys: 8.594 ± 1.303
6.694LeuLeu: 6.694 ± 0.841
1.628LeuMet: 1.628 ± 0.341
5.88LeuAsn: 5.88 ± 0.62
3.076LeuPro: 3.076 ± 0.536
2.623LeuGln: 2.623 ± 0.565
3.076LeuArg: 3.076 ± 0.475
5.247LeuSer: 5.247 ± 0.82
5.88LeuThr: 5.88 ± 0.843
4.704LeuVal: 4.704 ± 0.62
0.905LeuTrp: 0.905 ± 0.306
2.533LeuTyr: 2.533 ± 0.445
0.0LeuXaa: 0.0 ± 0.0
Met
2.533MetAla: 2.533 ± 0.462
0.0MetCys: 0.0 ± 0.0
1.357MetAsp: 1.357 ± 0.372
1.176MetGlu: 1.176 ± 0.315
0.543MetPhe: 0.543 ± 0.236
1.267MetGly: 1.267 ± 0.413
0.452MetHis: 0.452 ± 0.191
1.538MetIle: 1.538 ± 0.403
2.352MetLys: 2.352 ± 0.482
1.628MetLeu: 1.628 ± 0.424
0.452MetMet: 0.452 ± 0.198
1.538MetAsn: 1.538 ± 0.364
0.814MetPro: 0.814 ± 0.275
1.267MetGln: 1.267 ± 0.505
0.543MetArg: 0.543 ± 0.235
1.628MetSer: 1.628 ± 0.341
1.628MetThr: 1.628 ± 0.395
1.267MetVal: 1.267 ± 0.333
0.09MetTrp: 0.09 ± 0.081
0.543MetTyr: 0.543 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
4.252AsnAla: 4.252 ± 0.747
0.814AsnCys: 0.814 ± 0.281
2.985AsnAsp: 2.985 ± 0.609
5.337AsnGlu: 5.337 ± 0.884
3.166AsnPhe: 3.166 ± 0.658
7.056AsnGly: 7.056 ± 0.925
0.905AsnHis: 0.905 ± 0.384
3.89AsnIle: 3.89 ± 0.489
4.885AsnLys: 4.885 ± 0.62
4.161AsnLeu: 4.161 ± 0.715
1.538AsnMet: 1.538 ± 0.317
3.347AsnAsn: 3.347 ± 0.53
2.171AsnPro: 2.171 ± 0.446
3.347AsnGln: 3.347 ± 0.531
3.166AsnArg: 3.166 ± 0.56
3.709AsnSer: 3.709 ± 0.618
3.076AsnThr: 3.076 ± 0.438
3.619AsnVal: 3.619 ± 0.46
1.538AsnTrp: 1.538 ± 0.433
3.076AsnTyr: 3.076 ± 0.539
0.0AsnXaa: 0.0 ± 0.0
Pro
1.086ProAla: 1.086 ± 0.319
0.09ProCys: 0.09 ± 0.1
1.628ProAsp: 1.628 ± 0.429
1.9ProGlu: 1.9 ± 0.459
1.086ProPhe: 1.086 ± 0.335
0.271ProGly: 0.271 ± 0.135
0.452ProHis: 0.452 ± 0.24
1.809ProIle: 1.809 ± 0.327
3.257ProLys: 3.257 ± 0.717
2.443ProLeu: 2.443 ± 0.512
0.633ProMet: 0.633 ± 0.209
1.719ProAsn: 1.719 ± 0.375
0.814ProPro: 0.814 ± 0.32
1.176ProGln: 1.176 ± 0.321
1.357ProArg: 1.357 ± 0.36
3.438ProSer: 3.438 ± 0.575
2.804ProThr: 2.804 ± 0.479
2.262ProVal: 2.262 ± 0.414
0.09ProTrp: 0.09 ± 0.077
1.447ProTyr: 1.447 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
2.352GlnAla: 2.352 ± 0.434
0.452GlnCys: 0.452 ± 0.236
1.447GlnAsp: 1.447 ± 0.327
2.081GlnGlu: 2.081 ± 0.5
1.628GlnPhe: 1.628 ± 0.355
1.719GlnGly: 1.719 ± 0.323
0.633GlnHis: 0.633 ± 0.232
2.443GlnIle: 2.443 ± 0.553
3.89GlnLys: 3.89 ± 0.541
3.438GlnLeu: 3.438 ± 0.868
0.271GlnMet: 0.271 ± 0.21
2.262GlnAsn: 2.262 ± 0.468
1.267GlnPro: 1.267 ± 0.322
0.995GlnGln: 0.995 ± 0.303
2.262GlnArg: 2.262 ± 0.581
2.804GlnSer: 2.804 ± 0.477
1.99GlnThr: 1.99 ± 0.38
1.357GlnVal: 1.357 ± 0.301
0.452GlnTrp: 0.452 ± 0.233
1.538GlnTyr: 1.538 ± 0.469
0.0GlnXaa: 0.0 ± 0.0
Arg
1.719ArgAla: 1.719 ± 0.397
0.271ArgCys: 0.271 ± 0.148
2.533ArgAsp: 2.533 ± 0.427
1.99ArgGlu: 1.99 ± 0.467
1.267ArgPhe: 1.267 ± 0.513
1.538ArgGly: 1.538 ± 0.329
0.543ArgHis: 0.543 ± 0.224
3.347ArgIle: 3.347 ± 0.472
3.619ArgLys: 3.619 ± 0.621
4.342ArgLeu: 4.342 ± 0.709
1.267ArgMet: 1.267 ± 0.383
2.895ArgAsn: 2.895 ± 0.687
1.719ArgPro: 1.719 ± 0.317
1.538ArgGln: 1.538 ± 0.404
1.357ArgArg: 1.357 ± 0.561
2.443ArgSer: 2.443 ± 0.46
2.714ArgThr: 2.714 ± 0.528
1.628ArgVal: 1.628 ± 0.381
0.995ArgTrp: 0.995 ± 0.312
1.809ArgTyr: 1.809 ± 0.439
0.0ArgXaa: 0.0 ± 0.0
Ser
4.071SerAla: 4.071 ± 0.686
0.362SerCys: 0.362 ± 0.168
3.8SerAsp: 3.8 ± 0.546
4.976SerGlu: 4.976 ± 0.609
2.895SerPhe: 2.895 ± 0.569
4.614SerGly: 4.614 ± 0.581
1.086SerHis: 1.086 ± 0.332
4.433SerIle: 4.433 ± 0.607
4.976SerLys: 4.976 ± 0.524
5.518SerLeu: 5.518 ± 0.668
1.357SerMet: 1.357 ± 0.318
5.066SerAsn: 5.066 ± 0.533
1.719SerPro: 1.719 ± 0.353
2.352SerGln: 2.352 ± 0.438
1.628SerArg: 1.628 ± 0.325
4.433SerSer: 4.433 ± 0.548
3.438SerThr: 3.438 ± 0.705
4.252SerVal: 4.252 ± 0.513
0.995SerTrp: 0.995 ± 0.237
1.99SerTyr: 1.99 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
3.619ThrAla: 3.619 ± 0.584
0.543ThrCys: 0.543 ± 0.262
4.252ThrAsp: 4.252 ± 0.568
3.8ThrGlu: 3.8 ± 0.422
3.166ThrPhe: 3.166 ± 0.525
4.704ThrGly: 4.704 ± 0.64
0.995ThrHis: 0.995 ± 0.262
4.795ThrIle: 4.795 ± 0.598
4.342ThrLys: 4.342 ± 0.514
4.342ThrLeu: 4.342 ± 0.677
0.633ThrMet: 0.633 ± 0.232
4.614ThrAsn: 4.614 ± 0.849
2.171ThrPro: 2.171 ± 0.359
1.719ThrGln: 1.719 ± 0.393
2.081ThrArg: 2.081 ± 0.521
2.804ThrSer: 2.804 ± 0.479
4.161ThrThr: 4.161 ± 0.515
5.428ThrVal: 5.428 ± 0.944
0.724ThrTrp: 0.724 ± 0.27
2.895ThrTyr: 2.895 ± 0.554
0.0ThrXaa: 0.0 ± 0.0
Val
3.8ValAla: 3.8 ± 0.596
0.452ValCys: 0.452 ± 0.175
5.428ValAsp: 5.428 ± 0.602
3.98ValGlu: 3.98 ± 0.652
2.262ValPhe: 2.262 ± 0.342
4.614ValGly: 4.614 ± 0.644
0.633ValHis: 0.633 ± 0.215
5.609ValIle: 5.609 ± 0.618
4.885ValLys: 4.885 ± 0.663
4.071ValLeu: 4.071 ± 0.695
1.176ValMet: 1.176 ± 0.313
4.433ValAsn: 4.433 ± 0.668
1.9ValPro: 1.9 ± 0.447
1.99ValGln: 1.99 ± 0.433
2.262ValArg: 2.262 ± 0.431
4.161ValSer: 4.161 ± 0.504
3.166ValThr: 3.166 ± 0.497
3.528ValVal: 3.528 ± 0.54
0.633ValTrp: 0.633 ± 0.218
1.99ValTyr: 1.99 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
0.452TrpAla: 0.452 ± 0.147
0.181TrpCys: 0.181 ± 0.13
0.543TrpAsp: 0.543 ± 0.211
1.447TrpGlu: 1.447 ± 0.322
0.724TrpPhe: 0.724 ± 0.255
1.086TrpGly: 1.086 ± 0.334
0.271TrpHis: 0.271 ± 0.169
0.905TrpIle: 0.905 ± 0.242
1.357TrpLys: 1.357 ± 0.513
1.357TrpLeu: 1.357 ± 0.369
0.09TrpMet: 0.09 ± 0.084
0.543TrpAsn: 0.543 ± 0.25
0.09TrpPro: 0.09 ± 0.081
0.362TrpGln: 0.362 ± 0.169
0.995TrpArg: 0.995 ± 0.309
0.814TrpSer: 0.814 ± 0.284
1.267TrpThr: 1.267 ± 0.362
0.995TrpVal: 0.995 ± 0.365
0.181TrpTrp: 0.181 ± 0.112
0.633TrpTyr: 0.633 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.438TyrAla: 3.438 ± 0.591
1.086TyrCys: 1.086 ± 0.369
3.257TyrAsp: 3.257 ± 0.607
3.166TyrGlu: 3.166 ± 0.593
1.538TyrPhe: 1.538 ± 0.35
1.9TyrGly: 1.9 ± 0.357
0.724TyrHis: 0.724 ± 0.194
2.262TyrIle: 2.262 ± 0.585
3.076TyrLys: 3.076 ± 0.647
2.804TyrLeu: 2.804 ± 0.463
0.724TyrMet: 0.724 ± 0.231
2.352TyrAsn: 2.352 ± 0.447
1.719TyrPro: 1.719 ± 0.343
2.352TyrGln: 2.352 ± 0.32
1.99TyrArg: 1.99 ± 0.41
2.985TyrSer: 2.985 ± 0.564
1.99TyrThr: 1.99 ± 0.329
3.076TyrVal: 3.076 ± 0.459
0.181TyrTrp: 0.181 ± 0.117
1.99TyrTyr: 1.99 ± 0.557
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11055 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski