Amino acid dipepetide frequency for Streptococcus phage P738

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.0AlaAla: 2.0 ± 0.677
0.286AlaCys: 0.286 ± 0.153
3.524AlaAsp: 3.524 ± 0.53
5.429AlaGlu: 5.429 ± 1.11
2.762AlaPhe: 2.762 ± 0.465
3.905AlaGly: 3.905 ± 0.805
0.571AlaHis: 0.571 ± 0.184
4.476AlaIle: 4.476 ± 0.88
5.619AlaLys: 5.619 ± 1.217
5.143AlaLeu: 5.143 ± 0.766
1.81AlaMet: 1.81 ± 0.523
4.952AlaAsn: 4.952 ± 0.689
1.524AlaPro: 1.524 ± 0.387
1.333AlaGln: 1.333 ± 0.268
2.952AlaArg: 2.952 ± 0.468
3.905AlaSer: 3.905 ± 0.707
4.762AlaThr: 4.762 ± 0.819
4.286AlaVal: 4.286 ± 0.765
0.476AlaTrp: 0.476 ± 0.186
2.571AlaTyr: 2.571 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.095CysAla: 0.095 ± 0.107
0.0CysCys: 0.0 ± 0.0
0.571CysAsp: 0.571 ± 0.202
0.286CysGlu: 0.286 ± 0.165
0.095CysPhe: 0.095 ± 0.1
0.19CysGly: 0.19 ± 0.152
0.19CysHis: 0.19 ± 0.136
0.286CysIle: 0.286 ± 0.167
0.19CysLys: 0.19 ± 0.152
0.381CysLeu: 0.381 ± 0.194
0.0CysMet: 0.0 ± 0.0
0.286CysAsn: 0.286 ± 0.156
0.095CysPro: 0.095 ± 0.099
0.0CysGln: 0.0 ± 0.0
0.19CysArg: 0.19 ± 0.181
0.286CysSer: 0.286 ± 0.161
0.095CysThr: 0.095 ± 0.106
0.476CysVal: 0.476 ± 0.256
0.095CysTrp: 0.095 ± 0.092
0.571CysTyr: 0.571 ± 0.354
0.0CysXaa: 0.0 ± 0.0
Asp
2.286AspAla: 2.286 ± 0.505
0.476AspCys: 0.476 ± 0.259
2.762AspAsp: 2.762 ± 0.441
4.286AspGlu: 4.286 ± 0.812
3.714AspPhe: 3.714 ± 0.539
5.333AspGly: 5.333 ± 0.898
0.381AspHis: 0.381 ± 0.147
5.619AspIle: 5.619 ± 0.663
5.619AspLys: 5.619 ± 0.752
3.81AspLeu: 3.81 ± 0.565
1.238AspMet: 1.238 ± 0.37
5.429AspAsn: 5.429 ± 0.814
1.048AspPro: 1.048 ± 0.268
0.857AspGln: 0.857 ± 0.268
2.476AspArg: 2.476 ± 0.407
4.476AspSer: 4.476 ± 0.546
3.333AspThr: 3.333 ± 0.616
2.857AspVal: 2.857 ± 0.499
0.476AspTrp: 0.476 ± 0.228
3.143AspTyr: 3.143 ± 0.651
0.0AspXaa: 0.0 ± 0.0
Glu
6.19GluAla: 6.19 ± 0.958
0.19GluCys: 0.19 ± 0.136
5.238GluAsp: 5.238 ± 0.762
9.714GluGlu: 9.714 ± 1.324
3.714GluPhe: 3.714 ± 0.591
4.476GluGly: 4.476 ± 0.614
1.238GluHis: 1.238 ± 0.429
5.143GluIle: 5.143 ± 0.76
6.667GluLys: 6.667 ± 1.157
7.429GluLeu: 7.429 ± 0.98
2.667GluMet: 2.667 ± 0.636
3.905GluAsn: 3.905 ± 0.599
0.667GluPro: 0.667 ± 0.225
2.19GluGln: 2.19 ± 0.525
3.714GluArg: 3.714 ± 0.662
3.048GluSer: 3.048 ± 0.446
4.381GluThr: 4.381 ± 0.581
5.81GluVal: 5.81 ± 0.791
1.238GluTrp: 1.238 ± 0.383
4.19GluTyr: 4.19 ± 0.686
0.0GluXaa: 0.0 ± 0.0
Phe
3.143PheAla: 3.143 ± 0.719
0.286PheCys: 0.286 ± 0.143
3.238PheAsp: 3.238 ± 0.526
4.381PheGlu: 4.381 ± 0.721
1.905PhePhe: 1.905 ± 0.395
3.143PheGly: 3.143 ± 0.632
0.571PheHis: 0.571 ± 0.194
2.19PheIle: 2.19 ± 0.402
4.952PheLys: 4.952 ± 0.605
2.0PheLeu: 2.0 ± 0.411
1.238PheMet: 1.238 ± 0.416
3.143PheAsn: 3.143 ± 0.553
1.048PhePro: 1.048 ± 0.239
1.333PheGln: 1.333 ± 0.277
1.524PheArg: 1.524 ± 0.418
2.286PheSer: 2.286 ± 0.439
3.714PheThr: 3.714 ± 0.752
1.81PheVal: 1.81 ± 0.494
0.476PheTrp: 0.476 ± 0.187
2.571PheTyr: 2.571 ± 0.556
0.0PheXaa: 0.0 ± 0.0
Gly
3.714GlyAla: 3.714 ± 0.999
0.095GlyCys: 0.095 ± 0.083
2.476GlyAsp: 2.476 ± 0.416
4.952GlyGlu: 4.952 ± 0.912
3.714GlyPhe: 3.714 ± 0.574
4.381GlyGly: 4.381 ± 1.134
0.762GlyHis: 0.762 ± 0.286
4.286GlyIle: 4.286 ± 0.664
4.762GlyLys: 4.762 ± 0.646
4.476GlyLeu: 4.476 ± 0.625
1.81GlyMet: 1.81 ± 0.35
4.667GlyAsn: 4.667 ± 0.924
0.571GlyPro: 0.571 ± 0.213
2.0GlyGln: 2.0 ± 0.477
2.19GlyArg: 2.19 ± 0.535
5.048GlySer: 5.048 ± 1.078
4.476GlyThr: 4.476 ± 0.887
5.333GlyVal: 5.333 ± 0.972
1.333GlyTrp: 1.333 ± 0.265
5.238GlyTyr: 5.238 ± 0.712
0.0GlyXaa: 0.0 ± 0.0
His
0.857HisAla: 0.857 ± 0.317
0.19HisCys: 0.19 ± 0.108
0.667HisAsp: 0.667 ± 0.275
0.571HisGlu: 0.571 ± 0.225
0.857HisPhe: 0.857 ± 0.234
0.952HisGly: 0.952 ± 0.348
0.286HisHis: 0.286 ± 0.177
0.857HisIle: 0.857 ± 0.229
1.81HisLys: 1.81 ± 0.525
0.381HisLeu: 0.381 ± 0.171
0.19HisMet: 0.19 ± 0.133
0.762HisAsn: 0.762 ± 0.242
0.286HisPro: 0.286 ± 0.129
0.381HisGln: 0.381 ± 0.283
0.571HisArg: 0.571 ± 0.23
1.143HisSer: 1.143 ± 0.401
0.286HisThr: 0.286 ± 0.175
0.381HisVal: 0.381 ± 0.221
0.095HisTrp: 0.095 ± 0.084
1.048HisTyr: 1.048 ± 0.284
0.0HisXaa: 0.0 ± 0.0
Ile
6.0IleAla: 6.0 ± 0.94
0.381IleCys: 0.381 ± 0.186
4.095IleAsp: 4.095 ± 0.794
6.762IleGlu: 6.762 ± 0.821
1.714IlePhe: 1.714 ± 0.363
4.381IleGly: 4.381 ± 0.629
0.667IleHis: 0.667 ± 0.271
4.095IleIle: 4.095 ± 0.78
4.667IleLys: 4.667 ± 0.902
4.571IleLeu: 4.571 ± 0.539
1.714IleMet: 1.714 ± 0.4
4.381IleAsn: 4.381 ± 0.632
2.381IlePro: 2.381 ± 0.5
2.0IleGln: 2.0 ± 0.52
2.857IleArg: 2.857 ± 0.565
4.952IleSer: 4.952 ± 0.811
5.238IleThr: 5.238 ± 0.767
4.0IleVal: 4.0 ± 0.788
0.476IleTrp: 0.476 ± 0.26
2.0IleTyr: 2.0 ± 0.404
0.0IleXaa: 0.0 ± 0.0
Lys
6.381LysAla: 6.381 ± 1.113
0.095LysCys: 0.095 ± 0.106
5.048LysAsp: 5.048 ± 0.668
9.238LysGlu: 9.238 ± 1.474
3.048LysPhe: 3.048 ± 0.535
4.857LysGly: 4.857 ± 0.81
1.429LysHis: 1.429 ± 0.345
4.952LysIle: 4.952 ± 0.777
6.762LysLys: 6.762 ± 1.245
6.381LysLeu: 6.381 ± 0.867
2.286LysMet: 2.286 ± 0.496
4.667LysAsn: 4.667 ± 0.564
2.571LysPro: 2.571 ± 0.621
2.667LysGln: 2.667 ± 0.514
3.048LysArg: 3.048 ± 0.664
4.0LysSer: 4.0 ± 0.573
6.0LysThr: 6.0 ± 0.993
5.333LysVal: 5.333 ± 0.824
0.952LysTrp: 0.952 ± 0.371
3.524LysTyr: 3.524 ± 0.535
0.0LysXaa: 0.0 ± 0.0
Leu
4.476LeuAla: 4.476 ± 0.692
0.19LeuCys: 0.19 ± 0.116
4.0LeuAsp: 4.0 ± 0.613
7.048LeuGlu: 7.048 ± 0.918
2.286LeuPhe: 2.286 ± 0.44
4.381LeuGly: 4.381 ± 0.579
0.857LeuHis: 0.857 ± 0.334
6.381LeuIle: 6.381 ± 0.714
5.905LeuLys: 5.905 ± 0.588
4.476LeuLeu: 4.476 ± 0.6
1.048LeuMet: 1.048 ± 0.25
3.714LeuAsn: 3.714 ± 0.57
2.0LeuPro: 2.0 ± 0.33
2.762LeuGln: 2.762 ± 0.504
3.429LeuArg: 3.429 ± 0.549
5.714LeuSer: 5.714 ± 0.663
5.238LeuThr: 5.238 ± 1.093
4.667LeuVal: 4.667 ± 0.735
1.333LeuTrp: 1.333 ± 0.266
2.857LeuTyr: 2.857 ± 0.49
0.0LeuXaa: 0.0 ± 0.0
Met
1.619MetAla: 1.619 ± 0.455
0.0MetCys: 0.0 ± 0.0
0.952MetAsp: 0.952 ± 0.351
2.19MetGlu: 2.19 ± 0.526
1.333MetPhe: 1.333 ± 0.311
0.667MetGly: 0.667 ± 0.217
0.476MetHis: 0.476 ± 0.172
1.524MetIle: 1.524 ± 0.425
2.095MetLys: 2.095 ± 0.409
1.81MetLeu: 1.81 ± 0.417
0.19MetMet: 0.19 ± 0.156
1.714MetAsn: 1.714 ± 0.399
0.381MetPro: 0.381 ± 0.194
0.857MetGln: 0.857 ± 0.283
0.667MetArg: 0.667 ± 0.304
1.524MetSer: 1.524 ± 0.456
2.286MetThr: 2.286 ± 0.547
1.524MetVal: 1.524 ± 0.365
0.286MetTrp: 0.286 ± 0.167
1.048MetTyr: 1.048 ± 0.31
0.0MetXaa: 0.0 ± 0.0
Asn
4.381AsnAla: 4.381 ± 0.65
0.381AsnCys: 0.381 ± 0.183
3.524AsnAsp: 3.524 ± 0.667
4.857AsnGlu: 4.857 ± 0.698
3.238AsnPhe: 3.238 ± 0.661
6.19AsnGly: 6.19 ± 0.878
0.762AsnHis: 0.762 ± 0.223
4.0AsnIle: 4.0 ± 0.577
5.619AsnLys: 5.619 ± 0.72
3.714AsnLeu: 3.714 ± 0.535
1.524AsnMet: 1.524 ± 0.368
4.0AsnAsn: 4.0 ± 0.802
2.381AsnPro: 2.381 ± 0.477
2.762AsnGln: 2.762 ± 0.455
2.0AsnArg: 2.0 ± 0.407
3.714AsnSer: 3.714 ± 0.68
3.81AsnThr: 3.81 ± 0.775
5.333AsnVal: 5.333 ± 0.794
1.429AsnTrp: 1.429 ± 0.355
3.333AsnTyr: 3.333 ± 0.746
0.0AsnXaa: 0.0 ± 0.0
Pro
0.667ProAla: 0.667 ± 0.315
0.0ProCys: 0.0 ± 0.0
1.238ProAsp: 1.238 ± 0.405
1.714ProGlu: 1.714 ± 0.411
1.333ProPhe: 1.333 ± 0.371
0.19ProGly: 0.19 ± 0.121
0.381ProHis: 0.381 ± 0.146
1.905ProIle: 1.905 ± 0.639
1.524ProLys: 1.524 ± 0.536
1.81ProLeu: 1.81 ± 0.562
0.762ProMet: 0.762 ± 0.206
2.857ProAsn: 2.857 ± 0.633
0.667ProPro: 0.667 ± 0.299
1.143ProGln: 1.143 ± 0.419
0.667ProArg: 0.667 ± 0.312
2.286ProSer: 2.286 ± 0.456
1.619ProThr: 1.619 ± 0.505
2.286ProVal: 2.286 ± 0.423
0.286ProTrp: 0.286 ± 0.175
0.762ProTyr: 0.762 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
1.524GlnAla: 1.524 ± 0.419
0.286GlnCys: 0.286 ± 0.153
1.524GlnAsp: 1.524 ± 0.302
2.286GlnGlu: 2.286 ± 0.469
0.762GlnPhe: 0.762 ± 0.288
1.714GlnGly: 1.714 ± 0.424
0.667GlnHis: 0.667 ± 0.253
2.095GlnIle: 2.095 ± 0.438
3.619GlnLys: 3.619 ± 0.585
3.619GlnLeu: 3.619 ± 0.665
0.667GlnMet: 0.667 ± 0.197
2.667GlnAsn: 2.667 ± 0.488
0.476GlnPro: 0.476 ± 0.174
1.143GlnGln: 1.143 ± 0.292
1.81GlnArg: 1.81 ± 0.465
2.667GlnSer: 2.667 ± 0.622
1.905GlnThr: 1.905 ± 0.433
2.0GlnVal: 2.0 ± 0.533
0.095GlnTrp: 0.095 ± 0.101
1.429GlnTyr: 1.429 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
2.19ArgAla: 2.19 ± 0.515
0.19ArgCys: 0.19 ± 0.123
1.81ArgAsp: 1.81 ± 0.415
1.714ArgGlu: 1.714 ± 0.424
1.81ArgPhe: 1.81 ± 0.356
2.286ArgGly: 2.286 ± 0.55
0.381ArgHis: 0.381 ± 0.196
2.667ArgIle: 2.667 ± 0.482
4.19ArgLys: 4.19 ± 0.859
2.857ArgLeu: 2.857 ± 0.552
1.048ArgMet: 1.048 ± 0.34
2.857ArgAsn: 2.857 ± 0.406
1.429ArgPro: 1.429 ± 0.384
1.714ArgGln: 1.714 ± 0.464
1.905ArgArg: 1.905 ± 0.507
1.619ArgSer: 1.619 ± 0.418
2.667ArgThr: 2.667 ± 0.488
2.286ArgVal: 2.286 ± 0.613
0.19ArgTrp: 0.19 ± 0.174
2.762ArgTyr: 2.762 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
4.19SerAla: 4.19 ± 0.828
0.19SerCys: 0.19 ± 0.127
6.0SerAsp: 6.0 ± 1.133
3.81SerGlu: 3.81 ± 0.729
3.905SerPhe: 3.905 ± 0.482
5.238SerGly: 5.238 ± 0.802
0.476SerHis: 0.476 ± 0.245
3.048SerIle: 3.048 ± 0.601
4.667SerLys: 4.667 ± 0.708
5.429SerLeu: 5.429 ± 0.868
1.619SerMet: 1.619 ± 0.42
4.667SerAsn: 4.667 ± 0.762
0.952SerPro: 0.952 ± 0.292
2.19SerGln: 2.19 ± 0.456
1.81SerArg: 1.81 ± 0.459
5.238SerSer: 5.238 ± 0.986
3.905SerThr: 3.905 ± 0.907
4.19SerVal: 4.19 ± 0.681
0.476SerTrp: 0.476 ± 0.163
2.571SerTyr: 2.571 ± 0.509
0.0SerXaa: 0.0 ± 0.0
Thr
4.286ThrAla: 4.286 ± 0.743
0.0ThrCys: 0.0 ± 0.0
4.667ThrAsp: 4.667 ± 0.769
3.238ThrGlu: 3.238 ± 0.667
3.429ThrPhe: 3.429 ± 0.596
5.524ThrGly: 5.524 ± 0.896
0.762ThrHis: 0.762 ± 0.221
5.524ThrIle: 5.524 ± 1.018
5.143ThrLys: 5.143 ± 0.662
5.143ThrLeu: 5.143 ± 0.838
0.952ThrMet: 0.952 ± 0.33
3.524ThrAsn: 3.524 ± 0.578
2.095ThrPro: 2.095 ± 0.438
2.571ThrGln: 2.571 ± 0.501
2.0ThrArg: 2.0 ± 0.468
4.286ThrSer: 4.286 ± 0.673
3.143ThrThr: 3.143 ± 0.702
5.714ThrVal: 5.714 ± 0.892
0.857ThrTrp: 0.857 ± 0.252
3.524ThrTyr: 3.524 ± 0.621
0.0ThrXaa: 0.0 ± 0.0
Val
5.238ValAla: 5.238 ± 0.871
0.381ValCys: 0.381 ± 0.207
4.476ValAsp: 4.476 ± 0.603
4.667ValGlu: 4.667 ± 0.655
3.238ValPhe: 3.238 ± 0.699
4.667ValGly: 4.667 ± 0.809
0.857ValHis: 0.857 ± 0.287
4.095ValIle: 4.095 ± 0.674
5.238ValLys: 5.238 ± 0.58
4.571ValLeu: 4.571 ± 0.754
0.952ValMet: 0.952 ± 0.339
4.571ValAsn: 4.571 ± 0.649
2.0ValPro: 2.0 ± 0.332
2.286ValGln: 2.286 ± 0.5
2.571ValArg: 2.571 ± 0.395
4.286ValSer: 4.286 ± 0.755
5.238ValThr: 5.238 ± 0.821
5.81ValVal: 5.81 ± 0.713
0.571ValTrp: 0.571 ± 0.236
2.0ValTyr: 2.0 ± 0.494
0.0ValXaa: 0.0 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.206
0.095TrpCys: 0.095 ± 0.089
0.952TrpAsp: 0.952 ± 0.26
0.952TrpGlu: 0.952 ± 0.37
0.381TrpPhe: 0.381 ± 0.205
0.952TrpGly: 0.952 ± 0.231
0.286TrpHis: 0.286 ± 0.149
0.762TrpIle: 0.762 ± 0.235
1.143TrpLys: 1.143 ± 0.293
0.857TrpLeu: 0.857 ± 0.271
0.0TrpMet: 0.0 ± 0.0
0.19TrpAsn: 0.19 ± 0.127
0.19TrpPro: 0.19 ± 0.125
0.571TrpGln: 0.571 ± 0.276
0.19TrpArg: 0.19 ± 0.132
1.048TrpSer: 1.048 ± 0.329
0.952TrpThr: 0.952 ± 0.361
1.048TrpVal: 1.048 ± 0.296
0.0TrpTrp: 0.0 ± 0.0
0.762TrpTyr: 0.762 ± 0.313
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.381TyrAla: 2.381 ± 0.408
0.667TyrCys: 0.667 ± 0.239
2.857TyrAsp: 2.857 ± 0.506
3.714TyrGlu: 3.714 ± 0.649
1.714TyrPhe: 1.714 ± 0.326
2.857TyrGly: 2.857 ± 0.532
0.571TyrHis: 0.571 ± 0.268
3.429TyrIle: 3.429 ± 0.565
3.143TyrLys: 3.143 ± 0.809
3.905TyrLeu: 3.905 ± 0.576
1.333TyrMet: 1.333 ± 0.3
4.0TyrAsn: 4.0 ± 0.568
1.333TyrPro: 1.333 ± 0.367
2.19TyrGln: 2.19 ± 0.391
2.0TyrArg: 2.0 ± 0.494
3.048TyrSer: 3.048 ± 0.487
3.333TyrThr: 3.333 ± 0.63
2.571TyrVal: 2.571 ± 0.504
0.857TyrTrp: 0.857 ± 0.226
2.476TyrTyr: 2.476 ± 0.535
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10501 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski