Amino acid dipepetide frequency for Streptococcus phage P7573

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.1AlaAla: 3.1 ± 0.88
0.172AlaCys: 0.172 ± 0.109
4.306AlaAsp: 4.306 ± 0.696
3.703AlaGlu: 3.703 ± 0.547
1.722AlaPhe: 1.722 ± 0.451
3.789AlaGly: 3.789 ± 0.654
0.775AlaHis: 0.775 ± 0.307
4.392AlaIle: 4.392 ± 0.772
5.512AlaLys: 5.512 ± 0.961
6.373AlaLeu: 6.373 ± 0.621
1.206AlaMet: 1.206 ± 0.303
4.134AlaAsn: 4.134 ± 0.969
1.808AlaPro: 1.808 ± 0.397
1.722AlaGln: 1.722 ± 0.452
2.411AlaArg: 2.411 ± 0.454
4.306AlaSer: 4.306 ± 0.688
4.306AlaThr: 4.306 ± 0.753
3.445AlaVal: 3.445 ± 0.603
0.947AlaTrp: 0.947 ± 0.251
2.411AlaTyr: 2.411 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
0.172CysAla: 0.172 ± 0.104
0.172CysCys: 0.172 ± 0.129
0.689CysAsp: 0.689 ± 0.259
0.258CysGlu: 0.258 ± 0.136
0.086CysPhe: 0.086 ± 0.079
0.431CysGly: 0.431 ± 0.222
0.172CysHis: 0.172 ± 0.128
0.258CysIle: 0.258 ± 0.16
0.517CysLys: 0.517 ± 0.246
0.431CysLeu: 0.431 ± 0.177
0.0CysMet: 0.0 ± 0.0
0.517CysAsn: 0.517 ± 0.225
0.344CysPro: 0.344 ± 0.239
0.258CysGln: 0.258 ± 0.152
0.775CysArg: 0.775 ± 0.424
0.344CysSer: 0.344 ± 0.277
0.344CysThr: 0.344 ± 0.203
0.344CysVal: 0.344 ± 0.146
0.172CysTrp: 0.172 ± 0.117
0.172CysTyr: 0.172 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
3.272AspAla: 3.272 ± 0.501
0.258AspCys: 0.258 ± 0.166
4.564AspAsp: 4.564 ± 0.712
3.703AspGlu: 3.703 ± 0.702
3.875AspPhe: 3.875 ± 0.545
7.751AspGly: 7.751 ± 1.613
0.861AspHis: 0.861 ± 0.327
3.789AspIle: 3.789 ± 0.602
5.425AspLys: 5.425 ± 0.576
3.961AspLeu: 3.961 ± 0.706
2.153AspMet: 2.153 ± 0.484
3.531AspAsn: 3.531 ± 0.633
1.895AspPro: 1.895 ± 0.351
1.55AspGln: 1.55 ± 0.311
2.584AspArg: 2.584 ± 0.407
3.703AspSer: 3.703 ± 0.545
3.703AspThr: 3.703 ± 0.569
4.134AspVal: 4.134 ± 0.703
1.033AspTrp: 1.033 ± 0.261
2.928AspTyr: 2.928 ± 0.528
0.0AspXaa: 0.0 ± 0.0
Glu
4.306GluAla: 4.306 ± 0.577
0.258GluCys: 0.258 ± 0.145
2.497GluAsp: 2.497 ± 0.464
4.392GluGlu: 4.392 ± 0.749
2.239GluPhe: 2.239 ± 0.523
3.014GluGly: 3.014 ± 0.394
1.033GluHis: 1.033 ± 0.295
6.631GluIle: 6.631 ± 0.743
5.598GluLys: 5.598 ± 1.141
5.942GluLeu: 5.942 ± 0.842
2.411GluMet: 2.411 ± 0.424
4.478GluAsn: 4.478 ± 0.804
1.55GluPro: 1.55 ± 0.537
2.67GluGln: 2.67 ± 0.391
3.272GluArg: 3.272 ± 0.643
2.928GluSer: 2.928 ± 0.443
3.272GluThr: 3.272 ± 0.472
4.478GluVal: 4.478 ± 0.643
0.861GluTrp: 0.861 ± 0.253
3.617GluTyr: 3.617 ± 0.597
0.0GluXaa: 0.0 ± 0.0
Phe
2.928PheAla: 2.928 ± 0.608
0.603PheCys: 0.603 ± 0.242
3.445PheAsp: 3.445 ± 0.611
2.325PheGlu: 2.325 ± 0.485
1.722PhePhe: 1.722 ± 0.348
2.842PheGly: 2.842 ± 0.442
0.344PheHis: 0.344 ± 0.14
1.895PheIle: 1.895 ± 0.495
4.048PheLys: 4.048 ± 0.558
3.789PheLeu: 3.789 ± 0.685
0.517PheMet: 0.517 ± 0.215
3.617PheAsn: 3.617 ± 0.723
0.517PhePro: 0.517 ± 0.194
0.947PheGln: 0.947 ± 0.273
1.808PheArg: 1.808 ± 0.348
3.359PheSer: 3.359 ± 0.499
2.067PheThr: 2.067 ± 0.537
2.928PheVal: 2.928 ± 0.587
0.517PheTrp: 0.517 ± 0.222
1.378PheTyr: 1.378 ± 0.307
0.0PheXaa: 0.0 ± 0.0
Gly
3.617GlyAla: 3.617 ± 0.66
0.517GlyCys: 0.517 ± 0.229
4.392GlyAsp: 4.392 ± 0.584
3.445GlyGlu: 3.445 ± 0.594
2.928GlyPhe: 2.928 ± 0.492
3.961GlyGly: 3.961 ± 0.79
0.775GlyHis: 0.775 ± 0.271
5.77GlyIle: 5.77 ± 0.925
6.976GlyLys: 6.976 ± 0.717
6.2GlyLeu: 6.2 ± 0.762
1.55GlyMet: 1.55 ± 0.321
4.478GlyAsn: 4.478 ± 0.761
1.206GlyPro: 1.206 ± 0.487
2.928GlyGln: 2.928 ± 0.468
2.928GlyArg: 2.928 ± 0.543
4.65GlySer: 4.65 ± 0.638
4.564GlyThr: 4.564 ± 0.762
3.272GlyVal: 3.272 ± 0.67
1.378GlyTrp: 1.378 ± 0.352
3.186GlyTyr: 3.186 ± 0.589
0.0GlyXaa: 0.0 ± 0.0
His
0.258HisAla: 0.258 ± 0.138
0.0HisCys: 0.0 ± 0.0
1.033HisAsp: 1.033 ± 0.267
0.689HisGlu: 0.689 ± 0.272
0.517HisPhe: 0.517 ± 0.154
0.947HisGly: 0.947 ± 0.273
0.431HisHis: 0.431 ± 0.172
1.206HisIle: 1.206 ± 0.333
0.947HisLys: 0.947 ± 0.298
0.947HisLeu: 0.947 ± 0.267
0.258HisMet: 0.258 ± 0.128
0.517HisAsn: 0.517 ± 0.228
0.775HisPro: 0.775 ± 0.22
0.861HisGln: 0.861 ± 0.301
0.775HisArg: 0.775 ± 0.233
0.775HisSer: 0.775 ± 0.241
0.603HisThr: 0.603 ± 0.185
1.808HisVal: 1.808 ± 0.297
0.258HisTrp: 0.258 ± 0.14
0.775HisTyr: 0.775 ± 0.324
0.0HisXaa: 0.0 ± 0.0
Ile
4.823IleAla: 4.823 ± 0.807
0.344IleCys: 0.344 ± 0.18
5.253IleAsp: 5.253 ± 0.677
5.253IleGlu: 5.253 ± 0.768
1.808IlePhe: 1.808 ± 0.432
4.306IleGly: 4.306 ± 0.595
0.947IleHis: 0.947 ± 0.276
3.875IleIle: 3.875 ± 0.82
6.2IleLys: 6.2 ± 0.619
3.617IleLeu: 3.617 ± 0.62
1.808IleMet: 1.808 ± 0.528
4.22IleAsn: 4.22 ± 0.535
3.445IlePro: 3.445 ± 0.536
2.497IleGln: 2.497 ± 0.364
3.703IleArg: 3.703 ± 0.545
4.564IleSer: 4.564 ± 0.616
3.1IleThr: 3.1 ± 0.582
3.531IleVal: 3.531 ± 0.501
1.033IleTrp: 1.033 ± 0.252
2.239IleTyr: 2.239 ± 0.475
0.0IleXaa: 0.0 ± 0.0
Lys
5.598LysAla: 5.598 ± 0.488
0.431LysCys: 0.431 ± 0.221
4.736LysAsp: 4.736 ± 0.582
7.148LysGlu: 7.148 ± 0.896
3.445LysPhe: 3.445 ± 0.741
6.287LysGly: 6.287 ± 0.793
1.206LysHis: 1.206 ± 0.373
5.598LysIle: 5.598 ± 0.788
7.406LysLys: 7.406 ± 1.319
6.717LysLeu: 6.717 ± 0.801
2.153LysMet: 2.153 ± 0.429
4.995LysAsn: 4.995 ± 0.569
3.1LysPro: 3.1 ± 0.436
3.961LysGln: 3.961 ± 0.57
3.617LysArg: 3.617 ± 0.622
4.048LysSer: 4.048 ± 0.57
5.339LysThr: 5.339 ± 0.703
4.736LysVal: 4.736 ± 0.705
1.206LysTrp: 1.206 ± 0.258
3.703LysTyr: 3.703 ± 0.837
0.0LysXaa: 0.0 ± 0.0
Leu
6.2LeuAla: 6.2 ± 0.717
0.431LeuCys: 0.431 ± 0.188
5.081LeuAsp: 5.081 ± 0.839
6.631LeuGlu: 6.631 ± 0.921
3.014LeuPhe: 3.014 ± 0.498
5.081LeuGly: 5.081 ± 0.811
0.947LeuHis: 0.947 ± 0.281
4.478LeuIle: 4.478 ± 0.591
7.406LeuLys: 7.406 ± 0.784
5.512LeuLeu: 5.512 ± 0.688
2.325LeuMet: 2.325 ± 0.385
5.339LeuAsn: 5.339 ± 0.733
2.756LeuPro: 2.756 ± 0.437
3.014LeuGln: 3.014 ± 0.51
3.359LeuArg: 3.359 ± 0.805
5.339LeuSer: 5.339 ± 0.761
5.684LeuThr: 5.684 ± 0.695
4.048LeuVal: 4.048 ± 0.57
0.603LeuTrp: 0.603 ± 0.299
2.239LeuTyr: 2.239 ± 0.392
0.0LeuXaa: 0.0 ± 0.0
Met
2.325MetAla: 2.325 ± 0.432
0.172MetCys: 0.172 ± 0.116
1.12MetAsp: 1.12 ± 0.223
1.033MetGlu: 1.033 ± 0.424
1.292MetPhe: 1.292 ± 0.268
1.12MetGly: 1.12 ± 0.292
0.172MetHis: 0.172 ± 0.143
1.464MetIle: 1.464 ± 0.314
2.325MetLys: 2.325 ± 0.546
1.722MetLeu: 1.722 ± 0.29
0.689MetMet: 0.689 ± 0.188
2.067MetAsn: 2.067 ± 0.381
0.861MetPro: 0.861 ± 0.239
0.947MetGln: 0.947 ± 0.244
0.947MetArg: 0.947 ± 0.258
2.239MetSer: 2.239 ± 0.435
1.464MetThr: 1.464 ± 0.352
1.808MetVal: 1.808 ± 0.353
0.086MetTrp: 0.086 ± 0.068
1.206MetTyr: 1.206 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
4.22AsnAla: 4.22 ± 1.034
0.431AsnCys: 0.431 ± 0.219
4.134AsnAsp: 4.134 ± 0.357
3.875AsnGlu: 3.875 ± 0.642
1.808AsnPhe: 1.808 ± 0.508
6.459AsnGly: 6.459 ± 1.108
1.292AsnHis: 1.292 ± 0.273
3.703AsnIle: 3.703 ± 0.445
4.65AsnLys: 4.65 ± 0.655
4.995AsnLeu: 4.995 ± 0.535
1.292AsnMet: 1.292 ± 0.372
3.875AsnAsn: 3.875 ± 0.661
2.756AsnPro: 2.756 ± 0.499
2.756AsnGln: 2.756 ± 0.443
2.325AsnArg: 2.325 ± 0.442
4.048AsnSer: 4.048 ± 0.475
3.703AsnThr: 3.703 ± 0.678
3.531AsnVal: 3.531 ± 0.348
1.464AsnTrp: 1.464 ± 0.306
2.497AsnTyr: 2.497 ± 0.51
0.0AsnXaa: 0.0 ± 0.0
Pro
1.378ProAla: 1.378 ± 0.296
0.172ProCys: 0.172 ± 0.181
1.378ProAsp: 1.378 ± 0.4
2.497ProGlu: 2.497 ± 0.517
1.464ProPhe: 1.464 ± 0.346
1.378ProGly: 1.378 ± 0.475
0.344ProHis: 0.344 ± 0.157
1.808ProIle: 1.808 ± 0.352
3.359ProLys: 3.359 ± 0.56
2.411ProLeu: 2.411 ± 0.419
0.431ProMet: 0.431 ± 0.191
2.842ProAsn: 2.842 ± 0.479
0.689ProPro: 0.689 ± 0.35
1.464ProGln: 1.464 ± 0.307
1.12ProArg: 1.12 ± 0.401
2.497ProSer: 2.497 ± 0.457
2.239ProThr: 2.239 ± 0.328
1.378ProVal: 1.378 ± 0.486
0.517ProTrp: 0.517 ± 0.186
0.861ProTyr: 0.861 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
2.67GlnAla: 2.67 ± 0.557
0.172GlnCys: 0.172 ± 0.129
1.981GlnAsp: 1.981 ± 0.448
3.1GlnGlu: 3.1 ± 0.616
1.292GlnPhe: 1.292 ± 0.392
3.445GlnGly: 3.445 ± 0.816
0.689GlnHis: 0.689 ± 0.245
2.497GlnIle: 2.497 ± 0.586
3.186GlnLys: 3.186 ± 0.5
3.359GlnLeu: 3.359 ± 0.475
1.464GlnMet: 1.464 ± 0.369
2.153GlnAsn: 2.153 ± 0.498
0.258GlnPro: 0.258 ± 0.125
2.584GlnGln: 2.584 ± 0.467
1.464GlnArg: 1.464 ± 0.349
2.584GlnSer: 2.584 ± 0.52
2.497GlnThr: 2.497 ± 0.473
2.067GlnVal: 2.067 ± 0.499
0.431GlnTrp: 0.431 ± 0.192
2.411GlnTyr: 2.411 ± 0.495
0.0GlnXaa: 0.0 ± 0.0
Arg
1.722ArgAla: 1.722 ± 0.303
0.775ArgCys: 0.775 ± 0.431
2.928ArgAsp: 2.928 ± 0.358
2.928ArgGlu: 2.928 ± 0.565
1.895ArgPhe: 1.895 ± 0.4
2.153ArgGly: 2.153 ± 0.335
0.861ArgHis: 0.861 ± 0.287
3.272ArgIle: 3.272 ± 0.702
3.186ArgLys: 3.186 ± 0.579
3.875ArgLeu: 3.875 ± 0.552
1.292ArgMet: 1.292 ± 0.333
3.014ArgAsn: 3.014 ± 0.47
1.12ArgPro: 1.12 ± 0.26
1.636ArgGln: 1.636 ± 0.384
1.895ArgArg: 1.895 ± 0.525
1.895ArgSer: 1.895 ± 0.44
3.014ArgThr: 3.014 ± 0.716
3.014ArgVal: 3.014 ± 0.55
1.206ArgTrp: 1.206 ± 0.269
2.325ArgTyr: 2.325 ± 0.563
0.0ArgXaa: 0.0 ± 0.0
Ser
2.928SerAla: 2.928 ± 0.508
0.517SerCys: 0.517 ± 0.202
4.306SerAsp: 4.306 ± 0.693
3.789SerGlu: 3.789 ± 0.673
3.789SerPhe: 3.789 ± 0.692
4.736SerGly: 4.736 ± 0.645
0.861SerHis: 0.861 ± 0.275
4.134SerIle: 4.134 ± 0.617
5.339SerLys: 5.339 ± 0.798
3.703SerLeu: 3.703 ± 0.53
2.067SerMet: 2.067 ± 0.374
4.736SerAsn: 4.736 ± 0.646
1.895SerPro: 1.895 ± 0.474
3.014SerGln: 3.014 ± 0.575
2.928SerArg: 2.928 ± 0.528
3.703SerSer: 3.703 ± 0.555
4.306SerThr: 4.306 ± 0.606
5.339SerVal: 5.339 ± 0.691
0.861SerTrp: 0.861 ± 0.286
1.55SerTyr: 1.55 ± 0.412
0.0SerXaa: 0.0 ± 0.0
Thr
3.531ThrAla: 3.531 ± 0.561
0.344ThrCys: 0.344 ± 0.18
3.789ThrAsp: 3.789 ± 0.616
3.445ThrGlu: 3.445 ± 0.434
3.359ThrPhe: 3.359 ± 0.58
4.306ThrGly: 4.306 ± 0.777
0.947ThrHis: 0.947 ± 0.274
4.736ThrIle: 4.736 ± 0.902
4.564ThrLys: 4.564 ± 0.627
6.373ThrLeu: 6.373 ± 0.768
1.12ThrMet: 1.12 ± 0.331
3.875ThrAsn: 3.875 ± 0.718
2.153ThrPro: 2.153 ± 0.52
2.67ThrGln: 2.67 ± 0.489
1.808ThrArg: 1.808 ± 0.37
3.445ThrSer: 3.445 ± 0.486
3.531ThrThr: 3.531 ± 0.591
4.736ThrVal: 4.736 ± 0.75
0.775ThrTrp: 0.775 ± 0.236
2.756ThrTyr: 2.756 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
3.875ValAla: 3.875 ± 0.718
0.258ValCys: 0.258 ± 0.13
5.512ValAsp: 5.512 ± 0.72
3.617ValGlu: 3.617 ± 0.609
2.756ValPhe: 2.756 ± 0.54
4.134ValGly: 4.134 ± 0.457
0.603ValHis: 0.603 ± 0.18
3.961ValIle: 3.961 ± 0.661
4.995ValLys: 4.995 ± 0.645
4.564ValLeu: 4.564 ± 0.713
1.033ValMet: 1.033 ± 0.276
3.1ValAsn: 3.1 ± 0.544
1.895ValPro: 1.895 ± 0.335
1.808ValGln: 1.808 ± 0.348
2.842ValArg: 2.842 ± 0.542
5.512ValSer: 5.512 ± 0.907
5.081ValThr: 5.081 ± 0.727
3.703ValVal: 3.703 ± 0.523
0.775ValTrp: 0.775 ± 0.244
1.636ValTyr: 1.636 ± 0.346
0.0ValXaa: 0.0 ± 0.0
Trp
0.861TrpAla: 0.861 ± 0.18
0.172TrpCys: 0.172 ± 0.118
1.206TrpAsp: 1.206 ± 0.487
0.947TrpGlu: 0.947 ± 0.214
0.775TrpPhe: 0.775 ± 0.242
0.689TrpGly: 0.689 ± 0.265
0.258TrpHis: 0.258 ± 0.127
0.689TrpIle: 0.689 ± 0.244
0.861TrpLys: 0.861 ± 0.212
1.55TrpLeu: 1.55 ± 0.321
0.344TrpMet: 0.344 ± 0.147
0.775TrpAsn: 0.775 ± 0.3
0.086TrpPro: 0.086 ± 0.105
0.689TrpGln: 0.689 ± 0.255
0.861TrpArg: 0.861 ± 0.206
1.378TrpSer: 1.378 ± 0.511
1.033TrpThr: 1.033 ± 0.358
0.861TrpVal: 0.861 ± 0.217
0.258TrpTrp: 0.258 ± 0.156
0.258TrpTyr: 0.258 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.756TyrAla: 2.756 ± 0.383
0.258TyrCys: 0.258 ± 0.24
2.325TyrAsp: 2.325 ± 0.366
2.67TyrGlu: 2.67 ± 0.499
1.808TyrPhe: 1.808 ± 0.364
1.895TyrGly: 1.895 ± 0.468
0.861TyrHis: 0.861 ± 0.257
2.497TyrIle: 2.497 ± 0.602
2.928TyrLys: 2.928 ± 0.477
3.445TyrLeu: 3.445 ± 0.546
0.861TyrMet: 0.861 ± 0.235
1.55TyrAsn: 1.55 ± 0.425
1.12TyrPro: 1.12 ± 0.387
2.325TyrGln: 2.325 ± 0.341
2.584TyrArg: 2.584 ± 0.65
3.186TyrSer: 3.186 ± 0.71
2.411TyrThr: 2.411 ± 0.387
2.497TyrVal: 2.497 ± 0.449
0.172TyrTrp: 0.172 ± 0.125
2.153TyrTyr: 2.153 ± 0.495
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (11613 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski