Amino acid dipepetide frequency for Staphylococcus phage StB20-like

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.813AlaAla: 3.813 ± 1.22
0.649AlaCys: 0.649 ± 0.263
3.083AlaAsp: 3.083 ± 0.397
3.651AlaGlu: 3.651 ± 0.563
2.272AlaPhe: 2.272 ± 0.445
3.083AlaGly: 3.083 ± 0.635
0.974AlaHis: 0.974 ± 0.451
5.436AlaIle: 5.436 ± 1.181
5.922AlaLys: 5.922 ± 0.609
4.3AlaLeu: 4.3 ± 0.88
2.109AlaMet: 2.109 ± 0.622
4.219AlaAsn: 4.219 ± 0.654
1.866AlaPro: 1.866 ± 0.396
2.19AlaGln: 2.19 ± 0.414
2.109AlaArg: 2.109 ± 0.382
2.353AlaSer: 2.353 ± 0.596
3.245AlaThr: 3.245 ± 0.882
4.138AlaVal: 4.138 ± 0.873
0.811AlaTrp: 0.811 ± 0.274
2.353AlaTyr: 2.353 ± 0.507
0.0AlaXaa: 0.0 ± 0.0
Cys
0.243CysAla: 0.243 ± 0.171
0.243CysCys: 0.243 ± 0.133
0.162CysAsp: 0.162 ± 0.134
0.243CysGlu: 0.243 ± 0.142
0.162CysPhe: 0.162 ± 0.117
0.649CysGly: 0.649 ± 0.254
0.243CysHis: 0.243 ± 0.155
0.325CysIle: 0.325 ± 0.155
0.406CysLys: 0.406 ± 0.196
0.325CysLeu: 0.325 ± 0.175
0.162CysMet: 0.162 ± 0.115
0.243CysAsn: 0.243 ± 0.141
0.162CysPro: 0.162 ± 0.166
0.162CysGln: 0.162 ± 0.137
0.0CysArg: 0.0 ± 0.0
0.406CysSer: 0.406 ± 0.182
0.487CysThr: 0.487 ± 0.177
0.325CysVal: 0.325 ± 0.175
0.0CysTrp: 0.0 ± 0.0
0.406CysTyr: 0.406 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
3.57AspAla: 3.57 ± 0.543
0.081AspCys: 0.081 ± 0.099
4.381AspAsp: 4.381 ± 0.992
4.706AspGlu: 4.706 ± 0.748
3.083AspPhe: 3.083 ± 0.444
2.921AspGly: 2.921 ± 0.507
0.73AspHis: 0.73 ± 0.256
5.111AspIle: 5.111 ± 0.931
5.436AspLys: 5.436 ± 0.801
5.841AspLeu: 5.841 ± 0.806
1.623AspMet: 1.623 ± 0.339
3.894AspAsn: 3.894 ± 0.557
1.785AspPro: 1.785 ± 0.318
1.217AspGln: 1.217 ± 0.31
1.623AspArg: 1.623 ± 0.429
2.758AspSer: 2.758 ± 0.428
3.894AspThr: 3.894 ± 0.5
3.245AspVal: 3.245 ± 0.427
0.649AspTrp: 0.649 ± 0.254
3.245AspTyr: 3.245 ± 0.425
0.0AspXaa: 0.0 ± 0.0
Glu
3.489GluAla: 3.489 ± 0.511
0.162GluCys: 0.162 ± 0.135
4.381GluAsp: 4.381 ± 0.819
5.436GluGlu: 5.436 ± 1.275
3.975GluPhe: 3.975 ± 0.742
2.596GluGly: 2.596 ± 0.626
1.46GluHis: 1.46 ± 0.412
5.273GluIle: 5.273 ± 0.732
6.166GluLys: 6.166 ± 1.076
6.815GluLeu: 6.815 ± 0.9
2.028GluMet: 2.028 ± 0.327
4.706GluAsn: 4.706 ± 0.731
2.109GluPro: 2.109 ± 0.516
4.056GluGln: 4.056 ± 0.926
3.002GluArg: 3.002 ± 0.464
4.624GluSer: 4.624 ± 0.756
4.219GluThr: 4.219 ± 0.908
5.111GluVal: 5.111 ± 0.781
1.217GluTrp: 1.217 ± 0.434
2.921GluTyr: 2.921 ± 0.538
0.0GluXaa: 0.0 ± 0.0
Phe
2.19PheAla: 2.19 ± 0.325
0.081PheCys: 0.081 ± 0.087
2.84PheAsp: 2.84 ± 0.443
3.326PheGlu: 3.326 ± 0.791
1.704PhePhe: 1.704 ± 0.44
2.84PheGly: 2.84 ± 0.669
0.406PheHis: 0.406 ± 0.187
3.407PheIle: 3.407 ± 0.581
4.138PheLys: 4.138 ± 0.571
3.326PheLeu: 3.326 ± 0.57
1.379PheMet: 1.379 ± 0.342
3.894PheAsn: 3.894 ± 0.6
0.73PhePro: 0.73 ± 0.257
1.136PheGln: 1.136 ± 0.3
1.541PheArg: 1.541 ± 0.347
3.002PheSer: 3.002 ± 0.531
2.19PheThr: 2.19 ± 0.423
1.704PheVal: 1.704 ± 0.414
0.0PheTrp: 0.0 ± 0.0
1.379PheTyr: 1.379 ± 0.452
0.0PheXaa: 0.0 ± 0.0
Gly
3.407GlyAla: 3.407 ± 0.72
0.162GlyCys: 0.162 ± 0.128
3.083GlyAsp: 3.083 ± 0.541
2.596GlyGlu: 2.596 ± 0.527
2.921GlyPhe: 2.921 ± 0.397
4.543GlyGly: 4.543 ± 1.284
1.055GlyHis: 1.055 ± 0.291
4.3GlyIle: 4.3 ± 0.885
5.03GlyLys: 5.03 ± 0.625
5.192GlyLeu: 5.192 ± 0.703
1.46GlyMet: 1.46 ± 0.436
3.57GlyAsn: 3.57 ± 0.54
1.298GlyPro: 1.298 ± 0.381
2.677GlyGln: 2.677 ± 0.431
2.028GlyArg: 2.028 ± 0.457
3.164GlySer: 3.164 ± 0.872
3.326GlyThr: 3.326 ± 0.631
4.949GlyVal: 4.949 ± 0.807
0.73GlyTrp: 0.73 ± 0.347
2.596GlyTyr: 2.596 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
1.055HisAla: 1.055 ± 0.268
0.081HisCys: 0.081 ± 0.078
1.379HisAsp: 1.379 ± 0.389
0.811HisGlu: 0.811 ± 0.237
1.379HisPhe: 1.379 ± 0.346
1.298HisGly: 1.298 ± 0.344
0.649HisHis: 0.649 ± 0.256
1.055HisIle: 1.055 ± 0.456
1.46HisLys: 1.46 ± 0.431
1.947HisLeu: 1.947 ± 0.355
0.243HisMet: 0.243 ± 0.143
1.46HisAsn: 1.46 ± 0.366
0.974HisPro: 0.974 ± 0.253
0.406HisGln: 0.406 ± 0.192
0.649HisArg: 0.649 ± 0.216
1.298HisSer: 1.298 ± 0.382
0.406HisThr: 0.406 ± 0.178
0.811HisVal: 0.811 ± 0.28
0.325HisTrp: 0.325 ± 0.212
1.136HisTyr: 1.136 ± 0.287
0.0HisXaa: 0.0 ± 0.0
Ile
4.381IleAla: 4.381 ± 0.657
0.892IleCys: 0.892 ± 0.285
4.787IleAsp: 4.787 ± 0.586
7.221IleGlu: 7.221 ± 1.083
2.921IlePhe: 2.921 ± 0.548
4.624IleGly: 4.624 ± 0.771
2.028IleHis: 2.028 ± 0.509
5.03IleIle: 5.03 ± 0.917
8.194IleLys: 8.194 ± 1.335
5.03IleLeu: 5.03 ± 0.702
1.298IleMet: 1.298 ± 0.382
7.788IleAsn: 7.788 ± 0.889
3.164IlePro: 3.164 ± 0.544
2.434IleGln: 2.434 ± 0.534
2.353IleArg: 2.353 ± 0.405
4.624IleSer: 4.624 ± 0.51
4.219IleThr: 4.219 ± 0.585
3.083IleVal: 3.083 ± 0.508
1.055IleTrp: 1.055 ± 0.312
3.164IleTyr: 3.164 ± 0.651
0.0IleXaa: 0.0 ± 0.0
Lys
5.436LysAla: 5.436 ± 0.96
0.325LysCys: 0.325 ± 0.21
6.409LysAsp: 6.409 ± 0.765
7.626LysGlu: 7.626 ± 1.166
3.245LysPhe: 3.245 ± 0.456
6.977LysGly: 6.977 ± 1.085
1.136LysHis: 1.136 ± 0.349
6.977LysIle: 6.977 ± 1.174
8.843LysLys: 8.843 ± 1.129
6.409LysLeu: 6.409 ± 0.597
1.785LysMet: 1.785 ± 0.371
6.815LysAsn: 6.815 ± 0.784
3.407LysPro: 3.407 ± 0.596
4.381LysGln: 4.381 ± 0.957
4.949LysArg: 4.949 ± 0.651
5.76LysSer: 5.76 ± 1.001
5.273LysThr: 5.273 ± 0.68
5.273LysVal: 5.273 ± 0.673
0.892LysTrp: 0.892 ± 0.279
3.894LysTyr: 3.894 ± 0.716
0.0LysXaa: 0.0 ± 0.0
Leu
5.03LeuAla: 5.03 ± 0.972
0.243LeuCys: 0.243 ± 0.148
4.868LeuAsp: 4.868 ± 0.979
5.355LeuGlu: 5.355 ± 0.995
2.921LeuPhe: 2.921 ± 0.55
3.894LeuGly: 3.894 ± 0.626
1.704LeuHis: 1.704 ± 0.349
6.328LeuIle: 6.328 ± 0.922
8.924LeuLys: 8.924 ± 0.887
5.922LeuLeu: 5.922 ± 0.755
1.947LeuMet: 1.947 ± 0.495
5.192LeuAsn: 5.192 ± 0.562
2.677LeuPro: 2.677 ± 0.443
3.245LeuGln: 3.245 ± 0.455
3.651LeuArg: 3.651 ± 0.597
4.624LeuSer: 4.624 ± 0.836
4.462LeuThr: 4.462 ± 0.696
4.3LeuVal: 4.3 ± 0.658
0.649LeuTrp: 0.649 ± 0.243
2.109LeuTyr: 2.109 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
1.866MetAla: 1.866 ± 0.368
0.162MetCys: 0.162 ± 0.143
1.055MetAsp: 1.055 ± 0.327
1.785MetGlu: 1.785 ± 0.314
0.892MetPhe: 0.892 ± 0.246
1.217MetGly: 1.217 ± 0.431
0.243MetHis: 0.243 ± 0.144
1.785MetIle: 1.785 ± 0.371
3.245MetLys: 3.245 ± 0.589
1.541MetLeu: 1.541 ± 0.313
0.73MetMet: 0.73 ± 0.269
1.866MetAsn: 1.866 ± 0.314
0.892MetPro: 0.892 ± 0.266
1.217MetGln: 1.217 ± 0.312
1.298MetArg: 1.298 ± 0.493
1.46MetSer: 1.46 ± 0.389
1.46MetThr: 1.46 ± 0.41
1.217MetVal: 1.217 ± 0.356
0.406MetTrp: 0.406 ± 0.219
1.217MetTyr: 1.217 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
5.273AsnAla: 5.273 ± 1.131
0.325AsnCys: 0.325 ± 0.18
4.462AsnAsp: 4.462 ± 0.732
5.679AsnGlu: 5.679 ± 0.812
2.596AsnPhe: 2.596 ± 0.419
5.598AsnGly: 5.598 ± 0.86
1.217AsnHis: 1.217 ± 0.445
3.813AsnIle: 3.813 ± 0.642
6.004AsnLys: 6.004 ± 1.04
5.922AsnLeu: 5.922 ± 0.58
1.136AsnMet: 1.136 ± 0.258
5.598AsnAsn: 5.598 ± 0.818
2.272AsnPro: 2.272 ± 0.453
3.489AsnGln: 3.489 ± 0.42
2.596AsnArg: 2.596 ± 0.42
3.002AsnSer: 3.002 ± 0.641
4.543AsnThr: 4.543 ± 0.446
4.868AsnVal: 4.868 ± 0.946
1.055AsnTrp: 1.055 ± 0.34
3.245AsnTyr: 3.245 ± 0.59
0.0AsnXaa: 0.0 ± 0.0
Pro
1.136ProAla: 1.136 ± 0.372
0.0ProCys: 0.0 ± 0.0
0.974ProAsp: 0.974 ± 0.357
2.84ProGlu: 2.84 ± 0.694
1.623ProPhe: 1.623 ± 0.37
1.46ProGly: 1.46 ± 0.339
0.487ProHis: 0.487 ± 0.159
2.596ProIle: 2.596 ± 0.439
3.326ProLys: 3.326 ± 0.603
2.84ProLeu: 2.84 ± 0.506
0.649ProMet: 0.649 ± 0.193
2.434ProAsn: 2.434 ± 0.475
0.811ProPro: 0.811 ± 0.292
0.974ProGln: 0.974 ± 0.311
0.73ProArg: 0.73 ± 0.246
1.866ProSer: 1.866 ± 0.48
2.109ProThr: 2.109 ± 0.394
1.704ProVal: 1.704 ± 0.372
0.081ProTrp: 0.081 ± 0.088
1.217ProTyr: 1.217 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
2.677GlnAla: 2.677 ± 0.432
0.325GlnCys: 0.325 ± 0.188
2.19GlnAsp: 2.19 ± 0.583
3.732GlnGlu: 3.732 ± 0.69
1.217GlnPhe: 1.217 ± 0.346
1.947GlnGly: 1.947 ± 0.486
0.325GlnHis: 0.325 ± 0.195
3.57GlnIle: 3.57 ± 0.499
3.002GlnLys: 3.002 ± 0.507
3.407GlnLeu: 3.407 ± 0.471
1.541GlnMet: 1.541 ± 0.421
2.596GlnAsn: 2.596 ± 0.526
1.217GlnPro: 1.217 ± 0.293
2.109GlnGln: 2.109 ± 0.491
1.947GlnArg: 1.947 ± 0.538
2.434GlnSer: 2.434 ± 0.494
2.028GlnThr: 2.028 ± 0.387
1.866GlnVal: 1.866 ± 0.354
0.243GlnTrp: 0.243 ± 0.155
1.541GlnTyr: 1.541 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
1.704ArgAla: 1.704 ± 0.335
0.081ArgCys: 0.081 ± 0.08
2.677ArgAsp: 2.677 ± 0.494
2.677ArgGlu: 2.677 ± 0.426
1.623ArgPhe: 1.623 ± 0.377
1.541ArgGly: 1.541 ± 0.282
0.73ArgHis: 0.73 ± 0.238
3.651ArgIle: 3.651 ± 0.581
3.083ArgLys: 3.083 ± 0.498
3.164ArgLeu: 3.164 ± 0.44
1.217ArgMet: 1.217 ± 0.257
3.002ArgAsn: 3.002 ± 0.643
0.487ArgPro: 0.487 ± 0.185
1.785ArgGln: 1.785 ± 0.45
0.974ArgArg: 0.974 ± 0.265
2.515ArgSer: 2.515 ± 0.345
1.947ArgThr: 1.947 ± 0.41
2.028ArgVal: 2.028 ± 0.401
0.325ArgTrp: 0.325 ± 0.224
2.028ArgTyr: 2.028 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
2.758SerAla: 2.758 ± 0.637
0.487SerCys: 0.487 ± 0.203
3.732SerAsp: 3.732 ± 0.658
4.949SerGlu: 4.949 ± 0.484
2.19SerPhe: 2.19 ± 0.395
3.813SerGly: 3.813 ± 0.968
1.217SerHis: 1.217 ± 0.34
4.949SerIle: 4.949 ± 0.721
5.03SerLys: 5.03 ± 0.809
3.813SerLeu: 3.813 ± 0.58
2.109SerMet: 2.109 ± 0.426
4.381SerAsn: 4.381 ± 0.484
1.541SerPro: 1.541 ± 0.396
2.677SerGln: 2.677 ± 0.471
2.434SerArg: 2.434 ± 0.448
3.57SerSer: 3.57 ± 0.486
3.407SerThr: 3.407 ± 0.535
3.651SerVal: 3.651 ± 0.644
0.406SerTrp: 0.406 ± 0.19
2.515SerTyr: 2.515 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
4.138ThrAla: 4.138 ± 0.725
0.243ThrCys: 0.243 ± 0.131
3.083ThrAsp: 3.083 ± 0.588
2.84ThrGlu: 2.84 ± 0.592
2.272ThrPhe: 2.272 ± 0.53
2.353ThrGly: 2.353 ± 0.562
2.109ThrHis: 2.109 ± 0.462
5.517ThrIle: 5.517 ± 0.666
6.166ThrLys: 6.166 ± 0.663
4.219ThrLeu: 4.219 ± 0.576
1.136ThrMet: 1.136 ± 0.324
3.651ThrAsn: 3.651 ± 0.496
1.623ThrPro: 1.623 ± 0.401
2.272ThrGln: 2.272 ± 0.508
0.974ThrArg: 0.974 ± 0.314
4.462ThrSer: 4.462 ± 0.717
3.57ThrThr: 3.57 ± 0.817
3.894ThrVal: 3.894 ± 0.646
0.811ThrTrp: 0.811 ± 0.26
1.623ThrTyr: 1.623 ± 0.377
0.0ThrXaa: 0.0 ± 0.0
Val
3.894ValAla: 3.894 ± 0.67
0.243ValCys: 0.243 ± 0.174
3.326ValAsp: 3.326 ± 0.589
4.219ValGlu: 4.219 ± 0.771
1.785ValPhe: 1.785 ± 0.445
4.056ValGly: 4.056 ± 0.766
0.811ValHis: 0.811 ± 0.313
4.624ValIle: 4.624 ± 0.634
6.247ValLys: 6.247 ± 0.853
4.624ValLeu: 4.624 ± 0.753
2.028ValMet: 2.028 ± 0.436
3.326ValAsn: 3.326 ± 0.546
1.46ValPro: 1.46 ± 0.403
1.704ValGln: 1.704 ± 0.372
1.866ValArg: 1.866 ± 0.32
4.462ValSer: 4.462 ± 0.393
3.651ValThr: 3.651 ± 0.625
3.164ValVal: 3.164 ± 0.604
0.811ValTrp: 0.811 ± 0.261
2.515ValTyr: 2.515 ± 0.4
0.0ValXaa: 0.0 ± 0.0
Trp
0.243TrpAla: 0.243 ± 0.138
0.0TrpCys: 0.0 ± 0.0
0.487TrpAsp: 0.487 ± 0.186
1.055TrpGlu: 1.055 ± 0.338
0.325TrpPhe: 0.325 ± 0.148
0.73TrpGly: 0.73 ± 0.292
0.325TrpHis: 0.325 ± 0.15
0.811TrpIle: 0.811 ± 0.29
0.811TrpLys: 0.811 ± 0.214
0.892TrpLeu: 0.892 ± 0.293
0.325TrpMet: 0.325 ± 0.187
1.379TrpAsn: 1.379 ± 0.603
0.081TrpPro: 0.081 ± 0.077
0.568TrpGln: 0.568 ± 0.197
0.406TrpArg: 0.406 ± 0.155
0.568TrpSer: 0.568 ± 0.205
0.892TrpThr: 0.892 ± 0.403
0.811TrpVal: 0.811 ± 0.259
0.0TrpTrp: 0.0 ± 0.0
0.081TrpTyr: 0.081 ± 0.077
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.272TyrAla: 2.272 ± 0.408
0.568TyrCys: 0.568 ± 0.231
2.434TyrAsp: 2.434 ± 0.552
3.002TyrGlu: 3.002 ± 0.71
2.109TyrPhe: 2.109 ± 0.586
1.947TyrGly: 1.947 ± 0.45
0.974TyrHis: 0.974 ± 0.287
3.732TyrIle: 3.732 ± 0.591
4.381TyrLys: 4.381 ± 0.872
2.596TyrLeu: 2.596 ± 0.665
0.568TyrMet: 0.568 ± 0.168
2.677TyrAsn: 2.677 ± 0.437
1.298TyrPro: 1.298 ± 0.411
1.217TyrGln: 1.217 ± 0.323
2.109TyrArg: 2.109 ± 0.433
2.677TyrSer: 2.677 ± 0.485
1.623TyrThr: 1.623 ± 0.423
2.677TyrVal: 2.677 ± 0.518
0.325TyrTrp: 0.325 ± 0.153
1.623TyrTyr: 1.623 ± 0.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12327 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski