Amino acid dipepetide frequency for Streptococcus phage P0091

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.64AlaAla: 3.64 ± 1.058
0.178AlaCys: 0.178 ± 0.133
4.173AlaAsp: 4.173 ± 0.503
4.794AlaGlu: 4.794 ± 0.873
2.575AlaPhe: 2.575 ± 0.6
4.262AlaGly: 4.262 ± 0.762
0.533AlaHis: 0.533 ± 0.266
4.439AlaIle: 4.439 ± 0.675
5.86AlaLys: 5.86 ± 1.038
6.126AlaLeu: 6.126 ± 0.557
1.243AlaMet: 1.243 ± 0.308
4.972AlaAsn: 4.972 ± 1.005
1.687AlaPro: 1.687 ± 0.415
3.285AlaGln: 3.285 ± 0.635
2.486AlaArg: 2.486 ± 0.497
3.907AlaSer: 3.907 ± 0.664
3.64AlaThr: 3.64 ± 0.773
4.439AlaVal: 4.439 ± 0.593
0.977AlaTrp: 0.977 ± 0.271
2.397AlaTyr: 2.397 ± 0.451
0.0AlaXaa: 0.0 ± 0.0
Cys
0.266CysAla: 0.266 ± 0.148
0.0CysCys: 0.0 ± 0.0
0.71CysAsp: 0.71 ± 0.266
0.622CysGlu: 0.622 ± 0.275
0.444CysPhe: 0.444 ± 0.226
0.178CysGly: 0.178 ± 0.112
0.089CysHis: 0.089 ± 0.096
0.0CysIle: 0.0 ± 0.0
0.444CysLys: 0.444 ± 0.224
0.533CysLeu: 0.533 ± 0.231
0.0CysMet: 0.0 ± 0.0
0.178CysAsn: 0.178 ± 0.106
0.089CysPro: 0.089 ± 0.087
0.266CysGln: 0.266 ± 0.164
0.266CysArg: 0.266 ± 0.222
0.444CysSer: 0.444 ± 0.188
0.355CysThr: 0.355 ± 0.201
0.266CysVal: 0.266 ± 0.139
0.266CysTrp: 0.266 ± 0.161
0.089CysTyr: 0.089 ± 0.07
0.0CysXaa: 0.0 ± 0.0
Asp
3.729AspAla: 3.729 ± 0.614
0.533AspCys: 0.533 ± 0.204
4.084AspAsp: 4.084 ± 0.658
4.794AspGlu: 4.794 ± 0.767
4.084AspPhe: 4.084 ± 0.622
6.837AspGly: 6.837 ± 1.405
1.065AspHis: 1.065 ± 0.278
4.262AspIle: 4.262 ± 0.607
5.327AspLys: 5.327 ± 0.608
3.907AspLeu: 3.907 ± 0.842
1.865AspMet: 1.865 ± 0.577
3.551AspAsn: 3.551 ± 0.769
2.042AspPro: 2.042 ± 0.443
1.865AspGln: 1.865 ± 0.387
3.285AspArg: 3.285 ± 0.825
2.664AspSer: 2.664 ± 0.628
3.551AspThr: 3.551 ± 0.574
3.374AspVal: 3.374 ± 0.742
0.799AspTrp: 0.799 ± 0.268
2.397AspTyr: 2.397 ± 0.412
0.0AspXaa: 0.0 ± 0.0
Glu
3.729GluAla: 3.729 ± 0.605
0.444GluCys: 0.444 ± 0.192
3.285GluAsp: 3.285 ± 0.516
5.505GluGlu: 5.505 ± 1.281
1.865GluPhe: 1.865 ± 0.398
3.196GluGly: 3.196 ± 0.564
1.243GluHis: 1.243 ± 0.426
6.925GluIle: 6.925 ± 1.139
3.995GluLys: 3.995 ± 0.887
6.659GluLeu: 6.659 ± 0.951
2.131GluMet: 2.131 ± 0.531
4.439GluAsn: 4.439 ± 0.83
1.687GluPro: 1.687 ± 0.48
4.262GluGln: 4.262 ± 0.783
3.196GluArg: 3.196 ± 0.568
3.64GluSer: 3.64 ± 0.575
4.262GluThr: 4.262 ± 0.703
4.706GluVal: 4.706 ± 0.845
0.977GluTrp: 0.977 ± 0.247
3.374GluTyr: 3.374 ± 0.532
0.0GluXaa: 0.0 ± 0.0
Phe
3.019PheAla: 3.019 ± 0.546
0.266PheCys: 0.266 ± 0.204
3.196PheAsp: 3.196 ± 0.574
2.397PheGlu: 2.397 ± 0.522
1.598PhePhe: 1.598 ± 0.288
2.841PheGly: 2.841 ± 0.615
0.533PheHis: 0.533 ± 0.221
2.308PheIle: 2.308 ± 0.495
4.439PheLys: 4.439 ± 0.557
3.108PheLeu: 3.108 ± 0.483
0.533PheMet: 0.533 ± 0.281
3.64PheAsn: 3.64 ± 0.659
0.977PhePro: 0.977 ± 0.317
1.065PheGln: 1.065 ± 0.305
1.332PheArg: 1.332 ± 0.326
3.463PheSer: 3.463 ± 0.518
2.22PheThr: 2.22 ± 0.501
2.664PheVal: 2.664 ± 0.392
0.444PheTrp: 0.444 ± 0.183
1.776PheTyr: 1.776 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
2.752GlyAla: 2.752 ± 0.571
0.355GlyCys: 0.355 ± 0.191
3.551GlyAsp: 3.551 ± 0.667
3.907GlyGlu: 3.907 ± 0.55
3.551GlyPhe: 3.551 ± 0.673
3.995GlyGly: 3.995 ± 0.794
0.799GlyHis: 0.799 ± 0.38
4.528GlyIle: 4.528 ± 0.58
6.304GlyLys: 6.304 ± 0.971
6.748GlyLeu: 6.748 ± 0.876
1.421GlyMet: 1.421 ± 0.283
4.084GlyAsn: 4.084 ± 0.963
1.776GlyPro: 1.776 ± 0.599
3.108GlyGln: 3.108 ± 0.584
2.752GlyArg: 2.752 ± 0.525
4.883GlySer: 4.883 ± 0.83
4.972GlyThr: 4.972 ± 0.768
3.463GlyVal: 3.463 ± 0.609
1.154GlyTrp: 1.154 ± 0.426
3.019GlyTyr: 3.019 ± 0.505
0.0GlyXaa: 0.0 ± 0.0
His
0.533HisAla: 0.533 ± 0.209
0.089HisCys: 0.089 ± 0.096
1.065HisAsp: 1.065 ± 0.407
0.533HisGlu: 0.533 ± 0.263
0.444HisPhe: 0.444 ± 0.183
0.533HisGly: 0.533 ± 0.184
0.533HisHis: 0.533 ± 0.21
0.622HisIle: 0.622 ± 0.245
1.332HisLys: 1.332 ± 0.37
1.865HisLeu: 1.865 ± 0.33
0.266HisMet: 0.266 ± 0.153
0.977HisAsn: 0.977 ± 0.275
0.444HisPro: 0.444 ± 0.211
0.71HisGln: 0.71 ± 0.264
0.71HisArg: 0.71 ± 0.215
0.799HisSer: 0.799 ± 0.276
0.71HisThr: 0.71 ± 0.209
1.065HisVal: 1.065 ± 0.306
0.089HisTrp: 0.089 ± 0.088
0.888HisTyr: 0.888 ± 0.301
0.0HisXaa: 0.0 ± 0.0
Ile
5.86IleAla: 5.86 ± 0.754
0.355IleCys: 0.355 ± 0.221
4.706IleAsp: 4.706 ± 0.715
5.061IleGlu: 5.061 ± 0.858
1.776IlePhe: 1.776 ± 0.307
4.173IleGly: 4.173 ± 0.57
0.799IleHis: 0.799 ± 0.238
3.729IleIle: 3.729 ± 0.678
7.014IleLys: 7.014 ± 0.709
3.995IleLeu: 3.995 ± 0.812
1.598IleMet: 1.598 ± 0.428
4.439IleAsn: 4.439 ± 0.58
2.93IlePro: 2.93 ± 0.441
2.93IleGln: 2.93 ± 0.429
2.397IleArg: 2.397 ± 0.465
3.907IleSer: 3.907 ± 0.724
3.64IleThr: 3.64 ± 0.61
3.196IleVal: 3.196 ± 0.598
1.243IleTrp: 1.243 ± 0.29
2.042IleTyr: 2.042 ± 0.394
0.0IleXaa: 0.0 ± 0.0
Lys
6.126LysAla: 6.126 ± 0.686
0.444LysCys: 0.444 ± 0.236
5.061LysAsp: 5.061 ± 0.689
7.103LysGlu: 7.103 ± 1.03
3.64LysPhe: 3.64 ± 0.742
5.416LysGly: 5.416 ± 0.853
1.509LysHis: 1.509 ± 0.406
5.15LysIle: 5.15 ± 0.55
5.86LysLys: 5.86 ± 1.102
6.393LysLeu: 6.393 ± 0.917
2.22LysMet: 2.22 ± 0.489
5.594LysAsn: 5.594 ± 0.818
3.019LysPro: 3.019 ± 0.451
4.084LysGln: 4.084 ± 0.653
3.818LysArg: 3.818 ± 0.639
4.528LysSer: 4.528 ± 0.67
4.972LysThr: 4.972 ± 0.768
3.907LysVal: 3.907 ± 0.642
0.799LysTrp: 0.799 ± 0.256
3.551LysTyr: 3.551 ± 0.816
0.0LysXaa: 0.0 ± 0.0
Leu
6.304LeuAla: 6.304 ± 0.825
0.355LeuCys: 0.355 ± 0.18
6.037LeuAsp: 6.037 ± 0.662
6.748LeuGlu: 6.748 ± 0.976
2.752LeuPhe: 2.752 ± 0.458
5.416LeuGly: 5.416 ± 1.03
0.799LeuHis: 0.799 ± 0.29
4.262LeuIle: 4.262 ± 0.542
7.192LeuLys: 7.192 ± 0.772
4.794LeuLeu: 4.794 ± 0.667
2.752LeuMet: 2.752 ± 0.455
5.327LeuAsn: 5.327 ± 0.576
2.664LeuPro: 2.664 ± 0.446
2.841LeuGln: 2.841 ± 0.436
3.196LeuArg: 3.196 ± 0.715
5.949LeuSer: 5.949 ± 0.851
5.505LeuThr: 5.505 ± 0.706
4.351LeuVal: 4.351 ± 0.616
0.71LeuTrp: 0.71 ± 0.196
2.131LeuTyr: 2.131 ± 0.498
0.0LeuXaa: 0.0 ± 0.0
Met
1.509MetAla: 1.509 ± 0.414
0.0MetCys: 0.0 ± 0.0
1.243MetAsp: 1.243 ± 0.322
1.332MetGlu: 1.332 ± 0.413
0.888MetPhe: 0.888 ± 0.23
0.977MetGly: 0.977 ± 0.299
0.355MetHis: 0.355 ± 0.186
1.687MetIle: 1.687 ± 0.329
2.752MetLys: 2.752 ± 0.567
2.042MetLeu: 2.042 ± 0.339
0.888MetMet: 0.888 ± 0.271
1.154MetAsn: 1.154 ± 0.293
0.622MetPro: 0.622 ± 0.194
0.888MetGln: 0.888 ± 0.286
0.977MetArg: 0.977 ± 0.265
1.865MetSer: 1.865 ± 0.419
1.509MetThr: 1.509 ± 0.4
1.421MetVal: 1.421 ± 0.368
0.089MetTrp: 0.089 ± 0.08
1.065MetTyr: 1.065 ± 0.32
0.0MetXaa: 0.0 ± 0.0
Asn
4.173AsnAla: 4.173 ± 1.214
0.178AsnCys: 0.178 ± 0.131
3.64AsnAsp: 3.64 ± 0.582
3.64AsnGlu: 3.64 ± 0.667
2.752AsnPhe: 2.752 ± 0.498
6.748AsnGly: 6.748 ± 1.22
0.977AsnHis: 0.977 ± 0.226
4.262AsnIle: 4.262 ± 0.566
3.729AsnLys: 3.729 ± 0.493
5.238AsnLeu: 5.238 ± 0.674
0.977AsnMet: 0.977 ± 0.274
3.196AsnAsn: 3.196 ± 0.505
3.374AsnPro: 3.374 ± 0.597
2.308AsnGln: 2.308 ± 0.469
2.22AsnArg: 2.22 ± 0.488
4.617AsnSer: 4.617 ± 0.643
3.551AsnThr: 3.551 ± 0.79
3.729AsnVal: 3.729 ± 0.659
1.332AsnTrp: 1.332 ± 0.3
2.042AsnTyr: 2.042 ± 0.412
0.0AsnXaa: 0.0 ± 0.0
Pro
2.308ProAla: 2.308 ± 0.442
0.266ProCys: 0.266 ± 0.207
1.509ProAsp: 1.509 ± 0.473
1.953ProGlu: 1.953 ± 0.585
1.332ProPhe: 1.332 ± 0.35
1.154ProGly: 1.154 ± 0.393
0.355ProHis: 0.355 ± 0.157
1.865ProIle: 1.865 ± 0.338
3.818ProLys: 3.818 ± 0.562
2.042ProLeu: 2.042 ± 0.432
0.355ProMet: 0.355 ± 0.168
2.486ProAsn: 2.486 ± 0.477
0.444ProPro: 0.444 ± 0.242
1.865ProGln: 1.865 ± 0.276
1.065ProArg: 1.065 ± 0.315
2.308ProSer: 2.308 ± 0.436
2.042ProThr: 2.042 ± 0.368
1.687ProVal: 1.687 ± 0.394
0.355ProTrp: 0.355 ± 0.146
1.243ProTyr: 1.243 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
3.907GlnAla: 3.907 ± 0.631
0.266GlnCys: 0.266 ± 0.135
2.575GlnAsp: 2.575 ± 0.555
3.108GlnGlu: 3.108 ± 0.619
1.243GlnPhe: 1.243 ± 0.284
3.64GlnGly: 3.64 ± 0.745
0.444GlnHis: 0.444 ± 0.212
2.486GlnIle: 2.486 ± 0.557
3.196GlnLys: 3.196 ± 0.479
4.084GlnLeu: 4.084 ± 0.614
1.421GlnMet: 1.421 ± 0.334
2.575GlnAsn: 2.575 ± 0.454
0.799GlnPro: 0.799 ± 0.271
2.93GlnGln: 2.93 ± 0.596
2.22GlnArg: 2.22 ± 0.428
2.752GlnSer: 2.752 ± 0.427
3.108GlnThr: 3.108 ± 0.48
2.397GlnVal: 2.397 ± 0.464
0.533GlnTrp: 0.533 ± 0.199
1.865GlnTyr: 1.865 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
2.22ArgAla: 2.22 ± 0.357
0.089ArgCys: 0.089 ± 0.102
2.752ArgAsp: 2.752 ± 0.742
3.019ArgGlu: 3.019 ± 0.509
2.397ArgPhe: 2.397 ± 0.472
2.042ArgGly: 2.042 ± 0.35
0.799ArgHis: 0.799 ± 0.241
2.752ArgIle: 2.752 ± 0.516
3.019ArgLys: 3.019 ± 0.639
3.729ArgLeu: 3.729 ± 0.619
0.799ArgMet: 0.799 ± 0.25
2.397ArgAsn: 2.397 ± 0.347
1.065ArgPro: 1.065 ± 0.313
2.22ArgGln: 2.22 ± 0.409
1.509ArgArg: 1.509 ± 0.319
2.131ArgSer: 2.131 ± 0.401
2.752ArgThr: 2.752 ± 0.722
2.575ArgVal: 2.575 ± 0.312
1.154ArgTrp: 1.154 ± 0.325
2.042ArgTyr: 2.042 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
3.995SerAla: 3.995 ± 0.479
0.444SerCys: 0.444 ± 0.252
5.061SerAsp: 5.061 ± 0.646
3.64SerGlu: 3.64 ± 0.581
3.374SerPhe: 3.374 ± 0.773
4.706SerGly: 4.706 ± 0.544
0.622SerHis: 0.622 ± 0.209
4.706SerIle: 4.706 ± 0.742
4.883SerLys: 4.883 ± 0.851
5.15SerLeu: 5.15 ± 0.685
1.776SerMet: 1.776 ± 0.323
4.262SerAsn: 4.262 ± 0.707
2.131SerPro: 2.131 ± 0.416
3.285SerGln: 3.285 ± 0.617
2.575SerArg: 2.575 ± 0.681
4.351SerSer: 4.351 ± 0.64
4.262SerThr: 4.262 ± 0.684
4.084SerVal: 4.084 ± 0.559
0.622SerTrp: 0.622 ± 0.241
1.865SerTyr: 1.865 ± 0.482
0.0SerXaa: 0.0 ± 0.0
Thr
4.262ThrAla: 4.262 ± 0.737
0.266ThrCys: 0.266 ± 0.157
3.818ThrAsp: 3.818 ± 0.64
3.551ThrGlu: 3.551 ± 0.601
2.575ThrPhe: 2.575 ± 0.64
4.084ThrGly: 4.084 ± 0.743
1.154ThrHis: 1.154 ± 0.294
5.238ThrIle: 5.238 ± 0.91
5.327ThrLys: 5.327 ± 0.829
5.771ThrLeu: 5.771 ± 0.74
0.799ThrMet: 0.799 ± 0.289
3.463ThrAsn: 3.463 ± 0.579
1.687ThrPro: 1.687 ± 0.481
2.308ThrGln: 2.308 ± 0.563
2.308ThrArg: 2.308 ± 0.384
3.64ThrSer: 3.64 ± 0.534
3.374ThrThr: 3.374 ± 0.687
4.617ThrVal: 4.617 ± 0.621
1.243ThrTrp: 1.243 ± 0.376
2.93ThrTyr: 2.93 ± 0.715
0.0ThrXaa: 0.0 ± 0.0
Val
4.173ValAla: 4.173 ± 0.927
0.266ValCys: 0.266 ± 0.144
4.617ValAsp: 4.617 ± 0.639
3.551ValGlu: 3.551 ± 0.627
2.397ValPhe: 2.397 ± 0.368
3.818ValGly: 3.818 ± 0.496
0.622ValHis: 0.622 ± 0.235
3.907ValIle: 3.907 ± 0.554
4.972ValLys: 4.972 ± 0.714
3.285ValLeu: 3.285 ± 0.601
1.065ValMet: 1.065 ± 0.347
3.729ValAsn: 3.729 ± 0.613
1.865ValPro: 1.865 ± 0.445
2.131ValGln: 2.131 ± 0.434
2.131ValArg: 2.131 ± 0.572
5.061ValSer: 5.061 ± 0.701
4.883ValThr: 4.883 ± 0.716
2.93ValVal: 2.93 ± 0.532
0.71ValTrp: 0.71 ± 0.241
2.131ValTyr: 2.131 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
0.888TrpAla: 0.888 ± 0.255
0.089TrpCys: 0.089 ± 0.086
0.888TrpAsp: 0.888 ± 0.48
0.977TrpGlu: 0.977 ± 0.265
0.533TrpPhe: 0.533 ± 0.2
0.622TrpGly: 0.622 ± 0.188
0.266TrpHis: 0.266 ± 0.156
0.71TrpIle: 0.71 ± 0.262
1.065TrpLys: 1.065 ± 0.532
1.332TrpLeu: 1.332 ± 0.345
0.178TrpMet: 0.178 ± 0.125
0.622TrpAsn: 0.622 ± 0.233
0.178TrpPro: 0.178 ± 0.132
0.799TrpGln: 0.799 ± 0.277
0.622TrpArg: 0.622 ± 0.213
1.509TrpSer: 1.509 ± 0.523
0.888TrpThr: 0.888 ± 0.357
0.977TrpVal: 0.977 ± 0.205
0.355TrpTrp: 0.355 ± 0.217
0.444TrpTyr: 0.444 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.22TyrAla: 2.22 ± 0.462
0.622TyrCys: 0.622 ± 0.32
2.042TyrAsp: 2.042 ± 0.443
3.374TyrGlu: 3.374 ± 0.7
1.598TyrPhe: 1.598 ± 0.393
2.486TyrGly: 2.486 ± 0.519
0.71TyrHis: 0.71 ± 0.227
1.953TyrIle: 1.953 ± 0.478
2.664TyrLys: 2.664 ± 0.433
3.285TyrLeu: 3.285 ± 0.397
0.888TyrMet: 0.888 ± 0.34
1.598TyrAsn: 1.598 ± 0.375
0.977TyrPro: 0.977 ± 0.292
2.397TyrGln: 2.397 ± 0.468
2.397TyrArg: 2.397 ± 0.387
3.374TyrSer: 3.374 ± 0.716
2.22TyrThr: 2.22 ± 0.456
2.397TyrVal: 2.397 ± 0.573
0.089TyrTrp: 0.089 ± 0.093
2.042TyrTyr: 2.042 ± 0.616
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (11264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski