Amino acid dipepetide frequency for Streptococcus phage PH10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.515AlaAla: 3.515 ± 0.569
0.62AlaCys: 0.62 ± 0.271
5.066AlaAsp: 5.066 ± 0.709
5.066AlaGlu: 5.066 ± 0.831
1.861AlaPhe: 1.861 ± 0.371
3.619AlaGly: 3.619 ± 0.649
0.827AlaHis: 0.827 ± 0.292
6.927AlaIle: 6.927 ± 1.135
4.342AlaLys: 4.342 ± 0.715
7.237AlaLeu: 7.237 ± 0.742
2.688AlaMet: 2.688 ± 0.458
3.412AlaAsn: 3.412 ± 0.671
1.344AlaPro: 1.344 ± 0.402
2.792AlaGln: 2.792 ± 0.605
2.481AlaArg: 2.481 ± 0.523
3.619AlaSer: 3.619 ± 0.581
2.792AlaThr: 2.792 ± 0.531
3.825AlaVal: 3.825 ± 0.582
0.62AlaTrp: 0.62 ± 0.3
2.585AlaTyr: 2.585 ± 0.478
0.0AlaXaa: 0.0 ± 0.0
Cys
0.103CysAla: 0.103 ± 0.094
0.0CysCys: 0.0 ± 0.0
0.207CysAsp: 0.207 ± 0.175
0.517CysGlu: 0.517 ± 0.21
0.31CysPhe: 0.31 ± 0.223
0.103CysGly: 0.103 ± 0.087
0.31CysHis: 0.31 ± 0.185
0.517CysIle: 0.517 ± 0.225
0.414CysLys: 0.414 ± 0.165
1.034CysLeu: 1.034 ± 0.404
0.207CysMet: 0.207 ± 0.146
0.31CysAsn: 0.31 ± 0.187
0.0CysPro: 0.0 ± 0.0
0.207CysGln: 0.207 ± 0.136
0.103CysArg: 0.103 ± 0.111
0.62CysSer: 0.62 ± 0.353
0.31CysThr: 0.31 ± 0.305
0.103CysVal: 0.103 ± 0.095
0.103CysTrp: 0.103 ± 0.098
0.414CysTyr: 0.414 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
3.309AspAla: 3.309 ± 0.587
0.517AspCys: 0.517 ± 0.33
3.205AspAsp: 3.205 ± 0.544
5.273AspGlu: 5.273 ± 0.908
2.585AspPhe: 2.585 ± 0.462
4.653AspGly: 4.653 ± 1.011
0.724AspHis: 0.724 ± 0.245
6.514AspIle: 6.514 ± 0.658
4.653AspLys: 4.653 ± 0.608
4.859AspLeu: 4.859 ± 0.608
1.551AspMet: 1.551 ± 0.418
3.102AspAsn: 3.102 ± 0.498
1.758AspPro: 1.758 ± 0.476
1.241AspGln: 1.241 ± 0.264
2.481AspArg: 2.481 ± 0.45
3.722AspSer: 3.722 ± 0.555
2.585AspThr: 2.585 ± 0.491
3.619AspVal: 3.619 ± 0.6
0.931AspTrp: 0.931 ± 0.315
3.102AspTyr: 3.102 ± 0.546
0.0AspXaa: 0.0 ± 0.0
Glu
4.963GluAla: 4.963 ± 0.715
0.0GluCys: 0.0 ± 0.0
3.929GluAsp: 3.929 ± 0.858
7.341GluGlu: 7.341 ± 1.0
3.309GluPhe: 3.309 ± 0.649
3.309GluGly: 3.309 ± 0.555
0.414GluHis: 0.414 ± 0.235
7.031GluIle: 7.031 ± 0.91
6.203GluLys: 6.203 ± 0.904
9.305GluLeu: 9.305 ± 1.29
2.792GluMet: 2.792 ± 0.639
5.17GluAsn: 5.17 ± 0.602
1.758GluPro: 1.758 ± 0.489
4.653GluGln: 4.653 ± 0.855
3.309GluArg: 3.309 ± 0.569
3.929GluSer: 3.929 ± 0.64
4.239GluThr: 4.239 ± 1.061
5.273GluVal: 5.273 ± 0.756
1.344GluTrp: 1.344 ± 0.423
2.481GluTyr: 2.481 ± 0.372
0.0GluXaa: 0.0 ± 0.0
Phe
2.275PheAla: 2.275 ± 0.412
0.207PheCys: 0.207 ± 0.124
2.378PheAsp: 2.378 ± 0.378
2.688PheGlu: 2.688 ± 0.535
1.551PhePhe: 1.551 ± 0.471
3.825PheGly: 3.825 ± 0.545
0.31PheHis: 0.31 ± 0.17
4.653PheIle: 4.653 ± 0.732
4.239PheLys: 4.239 ± 0.599
2.792PheLeu: 2.792 ± 0.557
1.241PheMet: 1.241 ± 0.362
2.792PheAsn: 2.792 ± 0.539
1.344PhePro: 1.344 ± 0.425
1.137PheGln: 1.137 ± 0.359
1.654PheArg: 1.654 ± 0.331
2.585PheSer: 2.585 ± 0.635
1.861PheThr: 1.861 ± 0.462
2.481PheVal: 2.481 ± 0.546
0.62PheTrp: 0.62 ± 0.249
1.137PheTyr: 1.137 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
4.342GlyAla: 4.342 ± 0.953
0.31GlyCys: 0.31 ± 0.23
4.549GlyAsp: 4.549 ± 0.719
4.032GlyGlu: 4.032 ± 0.511
1.964GlyPhe: 1.964 ± 0.555
3.515GlyGly: 3.515 ± 0.637
0.724GlyHis: 0.724 ± 0.285
4.239GlyIle: 4.239 ± 0.751
5.066GlyLys: 5.066 ± 0.543
5.583GlyLeu: 5.583 ± 0.891
1.034GlyMet: 1.034 ± 0.328
3.102GlyAsn: 3.102 ± 0.603
0.724GlyPro: 0.724 ± 0.288
1.447GlyGln: 1.447 ± 0.385
2.068GlyArg: 2.068 ± 0.427
4.136GlySer: 4.136 ± 0.727
3.929GlyThr: 3.929 ± 0.445
4.136GlyVal: 4.136 ± 0.64
1.344GlyTrp: 1.344 ± 0.509
2.998GlyTyr: 2.998 ± 0.565
0.0GlyXaa: 0.0 ± 0.0
His
0.414HisAla: 0.414 ± 0.185
0.103HisCys: 0.103 ± 0.095
0.931HisAsp: 0.931 ± 0.316
0.931HisGlu: 0.931 ± 0.296
1.654HisPhe: 1.654 ± 0.331
0.414HisGly: 0.414 ± 0.197
0.207HisHis: 0.207 ± 0.136
1.447HisIle: 1.447 ± 0.429
0.724HisLys: 0.724 ± 0.289
1.137HisLeu: 1.137 ± 0.292
0.31HisMet: 0.31 ± 0.172
1.034HisAsn: 1.034 ± 0.33
0.414HisPro: 0.414 ± 0.237
0.62HisGln: 0.62 ± 0.198
0.517HisArg: 0.517 ± 0.215
1.034HisSer: 1.034 ± 0.318
0.931HisThr: 0.931 ± 0.299
1.137HisVal: 1.137 ± 0.415
0.0HisTrp: 0.0 ± 0.0
0.31HisTyr: 0.31 ± 0.145
0.0HisXaa: 0.0 ± 0.0
Ile
6.41IleAla: 6.41 ± 0.596
0.517IleCys: 0.517 ± 0.31
5.687IleAsp: 5.687 ± 0.733
4.756IleGlu: 4.756 ± 0.705
2.275IlePhe: 2.275 ± 0.426
3.722IleGly: 3.722 ± 0.55
0.931IleHis: 0.931 ± 0.268
5.17IleIle: 5.17 ± 0.943
7.444IleLys: 7.444 ± 1.0
5.48IleLeu: 5.48 ± 0.713
1.344IleMet: 1.344 ± 0.316
4.859IleAsn: 4.859 ± 0.62
2.481IlePro: 2.481 ± 0.47
3.619IleGln: 3.619 ± 0.555
2.998IleArg: 2.998 ± 0.517
5.583IleSer: 5.583 ± 0.943
5.17IleThr: 5.17 ± 0.764
4.032IleVal: 4.032 ± 0.763
0.0IleTrp: 0.0 ± 0.0
2.998IleTyr: 2.998 ± 0.694
0.0IleXaa: 0.0 ± 0.0
Lys
5.583LysAla: 5.583 ± 0.999
0.103LysCys: 0.103 ± 0.095
5.79LysAsp: 5.79 ± 0.656
8.375LysGlu: 8.375 ± 1.098
2.792LysPhe: 2.792 ± 0.438
3.619LysGly: 3.619 ± 0.66
1.447LysHis: 1.447 ± 0.424
4.342LysIle: 4.342 ± 0.665
7.341LysLys: 7.341 ± 0.952
5.79LysLeu: 5.79 ± 0.656
2.171LysMet: 2.171 ± 0.407
5.583LysAsn: 5.583 ± 0.626
3.515LysPro: 3.515 ± 0.61
3.929LysGln: 3.929 ± 0.531
5.273LysArg: 5.273 ± 1.034
4.653LysSer: 4.653 ± 0.706
5.48LysThr: 5.48 ± 0.78
4.032LysVal: 4.032 ± 0.577
1.447LysTrp: 1.447 ± 0.447
2.481LysTyr: 2.481 ± 0.592
0.0LysXaa: 0.0 ± 0.0
Leu
5.893LeuAla: 5.893 ± 0.918
0.931LeuCys: 0.931 ± 0.443
5.376LeuAsp: 5.376 ± 0.701
8.375LeuGlu: 8.375 ± 1.472
4.239LeuPhe: 4.239 ± 0.664
6.203LeuGly: 6.203 ± 0.775
1.137LeuHis: 1.137 ± 0.361
4.859LeuIle: 4.859 ± 0.698
9.409LeuLys: 9.409 ± 0.819
6.307LeuLeu: 6.307 ± 0.904
2.171LeuMet: 2.171 ± 0.592
4.342LeuAsn: 4.342 ± 0.505
2.275LeuPro: 2.275 ± 0.532
3.102LeuGln: 3.102 ± 0.537
4.239LeuArg: 4.239 ± 0.651
4.549LeuSer: 4.549 ± 0.617
5.583LeuThr: 5.583 ± 0.662
5.376LeuVal: 5.376 ± 0.84
0.931LeuTrp: 0.931 ± 0.259
2.585LeuTyr: 2.585 ± 0.456
0.0LeuXaa: 0.0 ± 0.0
Met
1.447MetAla: 1.447 ± 0.291
0.0MetCys: 0.0 ± 0.0
1.447MetAsp: 1.447 ± 0.338
2.068MetGlu: 2.068 ± 0.472
1.137MetPhe: 1.137 ± 0.38
0.827MetGly: 0.827 ± 0.241
0.103MetHis: 0.103 ± 0.098
1.964MetIle: 1.964 ± 0.431
1.758MetLys: 1.758 ± 0.348
1.861MetLeu: 1.861 ± 0.407
0.517MetMet: 0.517 ± 0.199
2.481MetAsn: 2.481 ± 0.522
0.414MetPro: 0.414 ± 0.206
0.724MetGln: 0.724 ± 0.241
1.241MetArg: 1.241 ± 0.369
2.275MetSer: 2.275 ± 0.384
1.964MetThr: 1.964 ± 0.512
1.447MetVal: 1.447 ± 0.391
0.31MetTrp: 0.31 ± 0.18
0.31MetTyr: 0.31 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
2.688AsnAla: 2.688 ± 0.707
0.31AsnCys: 0.31 ± 0.176
2.998AsnAsp: 2.998 ± 0.581
4.342AsnGlu: 4.342 ± 0.48
1.964AsnPhe: 1.964 ± 0.384
5.687AsnGly: 5.687 ± 0.778
1.551AsnHis: 1.551 ± 0.553
2.998AsnIle: 2.998 ± 0.437
6.1AsnLys: 6.1 ± 0.689
5.79AsnLeu: 5.79 ± 0.8
0.931AsnMet: 0.931 ± 0.438
2.895AsnAsn: 2.895 ± 0.593
2.998AsnPro: 2.998 ± 0.685
3.309AsnGln: 3.309 ± 0.669
1.758AsnArg: 1.758 ± 0.45
2.895AsnSer: 2.895 ± 0.597
3.619AsnThr: 3.619 ± 0.39
3.929AsnVal: 3.929 ± 0.659
0.724AsnTrp: 0.724 ± 0.264
1.344AsnTyr: 1.344 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
1.758ProAla: 1.758 ± 0.441
0.414ProCys: 0.414 ± 0.217
1.758ProAsp: 1.758 ± 0.699
1.964ProGlu: 1.964 ± 0.567
0.827ProPhe: 0.827 ± 0.298
1.447ProGly: 1.447 ± 0.389
0.724ProHis: 0.724 ± 0.238
2.378ProIle: 2.378 ± 0.481
1.861ProLys: 1.861 ± 0.423
2.895ProLeu: 2.895 ± 0.44
0.31ProMet: 0.31 ± 0.194
0.724ProAsn: 0.724 ± 0.278
0.517ProPro: 0.517 ± 0.258
1.654ProGln: 1.654 ± 0.371
0.62ProArg: 0.62 ± 0.226
2.171ProSer: 2.171 ± 0.486
2.275ProThr: 2.275 ± 0.51
1.964ProVal: 1.964 ± 0.362
0.414ProTrp: 0.414 ± 0.202
1.241ProTyr: 1.241 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
4.859GlnAla: 4.859 ± 0.628
0.103GlnCys: 0.103 ± 0.099
1.551GlnAsp: 1.551 ± 0.422
4.446GlnGlu: 4.446 ± 0.82
2.171GlnPhe: 2.171 ± 0.506
1.447GlnGly: 1.447 ± 0.334
0.517GlnHis: 0.517 ± 0.208
2.585GlnIle: 2.585 ± 0.457
2.792GlnLys: 2.792 ± 0.368
4.136GlnLeu: 4.136 ± 0.665
1.344GlnMet: 1.344 ± 0.441
2.792GlnAsn: 2.792 ± 0.545
0.827GlnPro: 0.827 ± 0.274
2.068GlnGln: 2.068 ± 0.432
1.964GlnArg: 1.964 ± 0.478
2.585GlnSer: 2.585 ± 0.572
1.861GlnThr: 1.861 ± 0.444
2.585GlnVal: 2.585 ± 0.365
0.31GlnTrp: 0.31 ± 0.149
1.034GlnTyr: 1.034 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
2.481ArgAla: 2.481 ± 0.599
0.31ArgCys: 0.31 ± 0.229
2.171ArgAsp: 2.171 ± 0.576
3.102ArgGlu: 3.102 ± 0.692
1.447ArgPhe: 1.447 ± 0.378
2.481ArgGly: 2.481 ± 0.575
0.517ArgHis: 0.517 ± 0.249
3.412ArgIle: 3.412 ± 0.764
4.342ArgLys: 4.342 ± 0.893
3.825ArgLeu: 3.825 ± 0.653
0.724ArgMet: 0.724 ± 0.256
2.998ArgAsn: 2.998 ± 0.608
1.137ArgPro: 1.137 ± 0.353
1.758ArgGln: 1.758 ± 0.448
2.275ArgArg: 2.275 ± 0.525
1.861ArgSer: 1.861 ± 0.4
1.758ArgThr: 1.758 ± 0.433
2.378ArgVal: 2.378 ± 0.408
0.827ArgTrp: 0.827 ± 0.303
2.378ArgTyr: 2.378 ± 0.663
0.0ArgXaa: 0.0 ± 0.0
Ser
4.342SerAla: 4.342 ± 0.829
0.31SerCys: 0.31 ± 0.157
3.825SerAsp: 3.825 ± 0.823
4.963SerGlu: 4.963 ± 0.714
4.136SerPhe: 4.136 ± 0.614
3.825SerGly: 3.825 ± 0.621
0.62SerHis: 0.62 ± 0.205
3.412SerIle: 3.412 ± 0.585
3.722SerLys: 3.722 ± 0.391
4.136SerLeu: 4.136 ± 0.533
1.964SerMet: 1.964 ± 0.536
3.619SerAsn: 3.619 ± 0.593
2.068SerPro: 2.068 ± 0.418
2.998SerGln: 2.998 ± 0.589
2.481SerArg: 2.481 ± 0.432
3.929SerSer: 3.929 ± 0.939
3.102SerThr: 3.102 ± 0.625
4.239SerVal: 4.239 ± 0.986
0.827SerTrp: 0.827 ± 0.244
2.792SerTyr: 2.792 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
4.549ThrAla: 4.549 ± 0.636
0.31ThrCys: 0.31 ± 0.191
3.102ThrAsp: 3.102 ± 0.641
3.309ThrGlu: 3.309 ± 0.597
2.585ThrPhe: 2.585 ± 0.579
4.446ThrGly: 4.446 ± 0.552
1.034ThrHis: 1.034 ± 0.279
5.48ThrIle: 5.48 ± 0.907
4.136ThrLys: 4.136 ± 0.536
5.17ThrLeu: 5.17 ± 0.752
0.931ThrMet: 0.931 ± 0.349
2.792ThrAsn: 2.792 ± 0.38
0.931ThrPro: 0.931 ± 0.293
2.378ThrGln: 2.378 ± 0.632
2.481ThrArg: 2.481 ± 0.559
2.688ThrSer: 2.688 ± 0.57
3.722ThrThr: 3.722 ± 0.603
4.756ThrVal: 4.756 ± 0.841
0.62ThrTrp: 0.62 ± 0.199
1.861ThrTyr: 1.861 ± 0.434
0.0ThrXaa: 0.0 ± 0.0
Val
3.722ValAla: 3.722 ± 0.605
0.207ValCys: 0.207 ± 0.136
3.825ValAsp: 3.825 ± 0.564
4.859ValGlu: 4.859 ± 0.836
2.068ValPhe: 2.068 ± 0.429
3.515ValGly: 3.515 ± 0.51
1.344ValHis: 1.344 ± 0.355
4.342ValIle: 4.342 ± 0.646
5.376ValLys: 5.376 ± 0.961
5.893ValLeu: 5.893 ± 0.922
1.034ValMet: 1.034 ± 0.333
3.515ValAsn: 3.515 ± 0.662
1.344ValPro: 1.344 ± 0.375
2.378ValGln: 2.378 ± 0.508
2.068ValArg: 2.068 ± 0.463
5.376ValSer: 5.376 ± 0.806
4.653ValThr: 4.653 ± 0.59
4.342ValVal: 4.342 ± 0.682
0.724ValTrp: 0.724 ± 0.481
1.447ValTyr: 1.447 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
0.517TrpAla: 0.517 ± 0.223
0.207TrpCys: 0.207 ± 0.17
0.414TrpAsp: 0.414 ± 0.181
1.034TrpGlu: 1.034 ± 0.325
0.62TrpPhe: 0.62 ± 0.202
0.414TrpGly: 0.414 ± 0.212
0.207TrpHis: 0.207 ± 0.13
0.827TrpIle: 0.827 ± 0.286
1.137TrpLys: 1.137 ± 0.33
1.034TrpLeu: 1.034 ± 0.288
0.207TrpMet: 0.207 ± 0.139
1.447TrpAsn: 1.447 ± 0.403
0.31TrpPro: 0.31 ± 0.176
0.62TrpGln: 0.62 ± 0.248
0.517TrpArg: 0.517 ± 0.234
0.827TrpSer: 0.827 ± 0.229
0.517TrpThr: 0.517 ± 0.181
0.827TrpVal: 0.827 ± 0.302
0.31TrpTrp: 0.31 ± 0.196
0.724TrpTyr: 0.724 ± 0.48
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.068TyrAla: 2.068 ± 0.485
0.414TyrCys: 0.414 ± 0.238
2.171TyrAsp: 2.171 ± 0.457
3.309TyrGlu: 3.309 ± 0.595
2.275TyrPhe: 2.275 ± 0.58
2.171TyrGly: 2.171 ± 0.569
0.62TyrHis: 0.62 ± 0.244
2.378TyrIle: 2.378 ± 0.504
2.688TyrLys: 2.688 ± 0.604
3.619TyrLeu: 3.619 ± 0.583
0.724TyrMet: 0.724 ± 0.252
1.964TyrAsn: 1.964 ± 0.502
1.758TyrPro: 1.758 ± 0.484
1.344TyrGln: 1.344 ± 0.346
1.654TyrArg: 1.654 ± 0.329
2.275TyrSer: 2.275 ± 0.479
0.931TyrThr: 0.931 ± 0.27
1.551TyrVal: 1.551 ± 0.334
0.31TyrTrp: 0.31 ± 0.195
2.585TyrTyr: 2.585 ± 0.806
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (9673 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski