Amino acid dipepetide frequency for Staphylococcus phage StB27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.728AlaAla: 2.728 ± 0.572
0.481AlaCys: 0.481 ± 0.245
3.611AlaAsp: 3.611 ± 0.478
4.574AlaGlu: 4.574 ± 0.798
2.488AlaPhe: 2.488 ± 0.563
2.889AlaGly: 2.889 ± 0.653
0.802AlaHis: 0.802 ± 0.241
5.376AlaIle: 5.376 ± 1.381
6.5AlaLys: 6.5 ± 0.926
4.654AlaLeu: 4.654 ± 1.041
1.685AlaMet: 1.685 ± 0.424
4.413AlaAsn: 4.413 ± 0.579
1.525AlaPro: 1.525 ± 0.363
2.086AlaGln: 2.086 ± 0.577
2.086AlaArg: 2.086 ± 0.339
2.648AlaSer: 2.648 ± 1.021
3.771AlaThr: 3.771 ± 0.672
3.049AlaVal: 3.049 ± 0.6
0.642AlaTrp: 0.642 ± 0.215
2.167AlaTyr: 2.167 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
0.16CysAla: 0.16 ± 0.12
0.241CysCys: 0.241 ± 0.146
0.321CysAsp: 0.321 ± 0.208
0.16CysGlu: 0.16 ± 0.124
0.241CysPhe: 0.241 ± 0.143
0.321CysGly: 0.321 ± 0.146
0.321CysHis: 0.321 ± 0.203
0.401CysIle: 0.401 ± 0.176
0.241CysLys: 0.241 ± 0.124
0.241CysLeu: 0.241 ± 0.134
0.16CysMet: 0.16 ± 0.119
0.241CysAsn: 0.241 ± 0.133
0.16CysPro: 0.16 ± 0.12
0.16CysGln: 0.16 ± 0.121
0.401CysArg: 0.401 ± 0.173
0.722CysSer: 0.722 ± 0.269
0.241CysThr: 0.241 ± 0.149
0.08CysVal: 0.08 ± 0.074
0.241CysTrp: 0.241 ± 0.166
0.16CysTyr: 0.16 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
4.574AspAla: 4.574 ± 0.658
0.16AspCys: 0.16 ± 0.107
4.654AspAsp: 4.654 ± 0.725
5.617AspGlu: 5.617 ± 0.892
3.531AspPhe: 3.531 ± 0.462
4.494AspGly: 4.494 ± 0.539
0.562AspHis: 0.562 ± 0.202
6.5AspIle: 6.5 ± 1.006
4.975AspLys: 4.975 ± 0.674
5.938AspLeu: 5.938 ± 0.596
2.407AspMet: 2.407 ± 0.432
4.173AspAsn: 4.173 ± 0.659
1.364AspPro: 1.364 ± 0.3
1.444AspGln: 1.444 ± 0.29
2.488AspArg: 2.488 ± 0.579
2.969AspSer: 2.969 ± 0.705
3.45AspThr: 3.45 ± 0.625
4.253AspVal: 4.253 ± 0.524
0.562AspTrp: 0.562 ± 0.231
2.407AspTyr: 2.407 ± 0.467
0.0AspXaa: 0.0 ± 0.0
Glu
3.611GluAla: 3.611 ± 0.569
0.241GluCys: 0.241 ± 0.14
4.975GluAsp: 4.975 ± 0.844
5.697GluGlu: 5.697 ± 1.002
3.049GluPhe: 3.049 ± 0.539
4.012GluGly: 4.012 ± 0.6
1.605GluHis: 1.605 ± 0.415
6.018GluIle: 6.018 ± 0.726
6.018GluLys: 6.018 ± 0.807
7.623GluLeu: 7.623 ± 0.873
2.327GluMet: 2.327 ± 0.325
4.253GluAsn: 4.253 ± 0.867
2.006GluPro: 2.006 ± 0.34
4.173GluGln: 4.173 ± 0.607
3.691GluArg: 3.691 ± 0.708
4.574GluSer: 4.574 ± 0.619
3.852GluThr: 3.852 ± 0.468
5.216GluVal: 5.216 ± 0.55
0.963GluTrp: 0.963 ± 0.365
3.049GluTyr: 3.049 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
2.247PheAla: 2.247 ± 0.345
0.241PheCys: 0.241 ± 0.128
3.13PheAsp: 3.13 ± 0.527
3.531PheGlu: 3.531 ± 0.585
1.123PhePhe: 1.123 ± 0.379
2.006PheGly: 2.006 ± 0.435
0.241PheHis: 0.241 ± 0.123
3.29PheIle: 3.29 ± 0.633
5.055PheLys: 5.055 ± 0.649
2.006PheLeu: 2.006 ± 0.467
0.722PheMet: 0.722 ± 0.246
2.728PheAsn: 2.728 ± 0.548
0.642PhePro: 0.642 ± 0.25
1.204PheGln: 1.204 ± 0.311
1.364PheArg: 1.364 ± 0.382
2.728PheSer: 2.728 ± 0.552
2.407PheThr: 2.407 ± 0.328
2.969PheVal: 2.969 ± 0.492
0.08PheTrp: 0.08 ± 0.09
2.247PheTyr: 2.247 ± 0.577
0.0PheXaa: 0.0 ± 0.0
Gly
3.611GlyAla: 3.611 ± 1.078
0.16GlyCys: 0.16 ± 0.111
2.488GlyAsp: 2.488 ± 0.484
3.37GlyGlu: 3.37 ± 0.772
2.167GlyPhe: 2.167 ± 0.5
1.846GlyGly: 1.846 ± 0.349
0.722GlyHis: 0.722 ± 0.277
4.173GlyIle: 4.173 ± 0.607
5.136GlyLys: 5.136 ± 0.783
3.852GlyLeu: 3.852 ± 0.754
1.926GlyMet: 1.926 ± 0.45
3.932GlyAsn: 3.932 ± 0.632
1.284GlyPro: 1.284 ± 0.728
3.37GlyGln: 3.37 ± 0.515
2.728GlyArg: 2.728 ± 0.402
3.049GlySer: 3.049 ± 0.574
3.21GlyThr: 3.21 ± 0.495
3.771GlyVal: 3.771 ± 0.652
0.883GlyTrp: 0.883 ± 0.329
2.969GlyTyr: 2.969 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
0.722HisAla: 0.722 ± 0.267
0.08HisCys: 0.08 ± 0.087
0.883HisAsp: 0.883 ± 0.299
1.043HisGlu: 1.043 ± 0.247
0.802HisPhe: 0.802 ± 0.25
1.284HisGly: 1.284 ± 0.282
0.481HisHis: 0.481 ± 0.198
1.364HisIle: 1.364 ± 0.351
1.364HisLys: 1.364 ± 0.295
1.284HisLeu: 1.284 ± 0.318
0.241HisMet: 0.241 ± 0.136
1.123HisAsn: 1.123 ± 0.318
0.562HisPro: 0.562 ± 0.217
0.722HisGln: 0.722 ± 0.22
0.802HisArg: 0.802 ± 0.345
1.123HisSer: 1.123 ± 0.268
1.043HisThr: 1.043 ± 0.254
0.963HisVal: 0.963 ± 0.385
0.321HisTrp: 0.321 ± 0.146
1.204HisTyr: 1.204 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
3.852IleAla: 3.852 ± 0.49
0.481IleCys: 0.481 ± 0.233
7.623IleAsp: 7.623 ± 1.03
5.938IleGlu: 5.938 ± 0.798
2.728IlePhe: 2.728 ± 0.668
2.728IleGly: 2.728 ± 0.672
1.364IleHis: 1.364 ± 0.366
4.012IleIle: 4.012 ± 0.628
6.821IleLys: 6.821 ± 0.918
4.654IleLeu: 4.654 ± 0.65
1.605IleMet: 1.605 ± 0.329
6.018IleAsn: 6.018 ± 0.886
2.407IlePro: 2.407 ± 0.4
2.728IleGln: 2.728 ± 0.554
2.568IleArg: 2.568 ± 0.473
4.012IleSer: 4.012 ± 0.825
5.537IleThr: 5.537 ± 0.668
4.092IleVal: 4.092 ± 0.635
0.722IleTrp: 0.722 ± 0.272
2.086IleTyr: 2.086 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
5.778LysAla: 5.778 ± 0.63
0.0LysCys: 0.0 ± 0.0
6.339LysAsp: 6.339 ± 0.814
7.543LysGlu: 7.543 ± 0.968
4.173LysPhe: 4.173 ± 0.69
4.654LysGly: 4.654 ± 0.727
1.765LysHis: 1.765 ± 0.368
4.574LysIle: 4.574 ± 0.532
7.463LysLys: 7.463 ± 0.844
7.222LysLeu: 7.222 ± 0.878
2.407LysMet: 2.407 ± 0.508
4.654LysAsn: 4.654 ± 0.803
2.568LysPro: 2.568 ± 0.667
4.574LysGln: 4.574 ± 0.653
3.852LysArg: 3.852 ± 0.637
4.815LysSer: 4.815 ± 0.602
5.858LysThr: 5.858 ± 0.772
6.5LysVal: 6.5 ± 0.789
0.722LysTrp: 0.722 ± 0.228
3.771LysTyr: 3.771 ± 0.603
0.0LysXaa: 0.0 ± 0.0
Leu
4.413LeuAla: 4.413 ± 0.788
0.08LeuCys: 0.08 ± 0.08
4.895LeuAsp: 4.895 ± 0.563
6.018LeuGlu: 6.018 ± 0.866
2.648LeuPhe: 2.648 ± 0.457
4.975LeuGly: 4.975 ± 0.756
1.284LeuHis: 1.284 ± 0.34
5.697LeuIle: 5.697 ± 0.746
8.345LeuLys: 8.345 ± 0.954
5.376LeuLeu: 5.376 ± 0.537
1.765LeuMet: 1.765 ± 0.363
5.938LeuAsn: 5.938 ± 0.75
2.728LeuPro: 2.728 ± 0.432
2.648LeuGln: 2.648 ± 0.477
2.407LeuArg: 2.407 ± 0.453
5.055LeuSer: 5.055 ± 0.685
4.815LeuThr: 4.815 ± 0.657
3.771LeuVal: 3.771 ± 0.516
0.642LeuTrp: 0.642 ± 0.211
2.648LeuTyr: 2.648 ± 0.439
0.0LeuXaa: 0.0 ± 0.0
Met
2.086MetAla: 2.086 ± 0.426
0.16MetCys: 0.16 ± 0.118
1.926MetAsp: 1.926 ± 0.363
1.605MetGlu: 1.605 ± 0.382
0.802MetPhe: 0.802 ± 0.288
1.364MetGly: 1.364 ± 0.339
0.481MetHis: 0.481 ± 0.149
0.883MetIle: 0.883 ± 0.267
2.086MetLys: 2.086 ± 0.388
2.247MetLeu: 2.247 ± 0.345
0.722MetMet: 0.722 ± 0.252
2.086MetAsn: 2.086 ± 0.409
0.642MetPro: 0.642 ± 0.236
0.401MetGln: 0.401 ± 0.154
1.204MetArg: 1.204 ± 0.307
0.802MetSer: 0.802 ± 0.215
2.167MetThr: 2.167 ± 0.355
1.043MetVal: 1.043 ± 0.242
0.642MetTrp: 0.642 ± 0.244
0.802MetTyr: 0.802 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
4.413AsnAla: 4.413 ± 0.695
0.241AsnCys: 0.241 ± 0.147
4.734AsnAsp: 4.734 ± 1.067
5.617AsnGlu: 5.617 ± 0.962
2.247AsnPhe: 2.247 ± 0.411
4.895AsnGly: 4.895 ± 0.748
1.605AsnHis: 1.605 ± 0.325
4.012AsnIle: 4.012 ± 0.57
5.617AsnLys: 5.617 ± 0.809
4.333AsnLeu: 4.333 ± 0.593
1.364AsnMet: 1.364 ± 0.257
4.815AsnAsn: 4.815 ± 0.94
2.006AsnPro: 2.006 ± 0.501
3.21AsnGln: 3.21 ± 0.532
2.247AsnArg: 2.247 ± 0.475
4.333AsnSer: 4.333 ± 0.701
4.494AsnThr: 4.494 ± 0.765
4.895AsnVal: 4.895 ± 0.685
0.883AsnTrp: 0.883 ± 0.206
2.568AsnTyr: 2.568 ± 0.675
0.0AsnXaa: 0.0 ± 0.0
Pro
1.284ProAla: 1.284 ± 0.333
0.08ProCys: 0.08 ± 0.077
0.802ProAsp: 0.802 ± 0.327
2.407ProGlu: 2.407 ± 0.497
1.685ProPhe: 1.685 ± 0.389
1.123ProGly: 1.123 ± 0.28
0.562ProHis: 0.562 ± 0.228
2.648ProIle: 2.648 ± 0.387
3.531ProLys: 3.531 ± 0.575
1.685ProLeu: 1.685 ± 0.332
0.321ProMet: 0.321 ± 0.174
2.327ProAsn: 2.327 ± 0.521
1.605ProPro: 1.605 ± 0.338
1.204ProGln: 1.204 ± 0.324
1.043ProArg: 1.043 ± 0.31
2.006ProSer: 2.006 ± 0.522
2.006ProThr: 2.006 ± 0.385
1.765ProVal: 1.765 ± 0.51
0.16ProTrp: 0.16 ± 0.11
0.722ProTyr: 0.722 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
2.247GlnAla: 2.247 ± 0.504
0.321GlnCys: 0.321 ± 0.155
2.648GlnAsp: 2.648 ± 0.491
2.728GlnGlu: 2.728 ± 0.626
2.086GlnPhe: 2.086 ± 0.405
2.327GlnGly: 2.327 ± 0.492
0.722GlnHis: 0.722 ± 0.197
3.049GlnIle: 3.049 ± 0.577
3.13GlnLys: 3.13 ± 0.618
3.531GlnLeu: 3.531 ± 0.537
1.204GlnMet: 1.204 ± 0.296
2.488GlnAsn: 2.488 ± 0.537
1.364GlnPro: 1.364 ± 0.308
2.969GlnGln: 2.969 ± 0.54
2.247GlnArg: 2.247 ± 0.349
2.809GlnSer: 2.809 ± 0.484
2.407GlnThr: 2.407 ± 0.4
2.167GlnVal: 2.167 ± 0.556
0.642GlnTrp: 0.642 ± 0.256
1.765GlnTyr: 1.765 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
1.444ArgAla: 1.444 ± 0.335
0.241ArgCys: 0.241 ± 0.154
1.605ArgAsp: 1.605 ± 0.35
2.889ArgGlu: 2.889 ± 0.544
2.086ArgPhe: 2.086 ± 0.433
1.846ArgGly: 1.846 ± 0.373
0.883ArgHis: 0.883 ± 0.268
3.771ArgIle: 3.771 ± 0.595
3.531ArgLys: 3.531 ± 0.594
3.932ArgLeu: 3.932 ± 0.554
0.963ArgMet: 0.963 ± 0.26
2.568ArgAsn: 2.568 ± 0.493
0.481ArgPro: 0.481 ± 0.177
1.846ArgGln: 1.846 ± 0.352
1.765ArgArg: 1.765 ± 0.407
2.167ArgSer: 2.167 ± 0.4
2.086ArgThr: 2.086 ± 0.431
3.049ArgVal: 3.049 ± 0.533
0.401ArgTrp: 0.401 ± 0.225
2.889ArgTyr: 2.889 ± 0.459
0.0ArgXaa: 0.0 ± 0.0
Ser
2.889SerAla: 2.889 ± 1.008
0.321SerCys: 0.321 ± 0.15
4.895SerAsp: 4.895 ± 0.64
4.413SerGlu: 4.413 ± 0.781
1.444SerPhe: 1.444 ± 0.286
3.852SerGly: 3.852 ± 0.739
1.204SerHis: 1.204 ± 0.307
4.975SerIle: 4.975 ± 0.605
4.815SerLys: 4.815 ± 0.708
4.092SerLeu: 4.092 ± 0.482
0.562SerMet: 0.562 ± 0.23
4.574SerAsn: 4.574 ± 0.833
1.204SerPro: 1.204 ± 0.266
2.247SerGln: 2.247 ± 0.554
2.407SerArg: 2.407 ± 0.473
4.815SerSer: 4.815 ± 1.073
4.173SerThr: 4.173 ± 0.653
4.173SerVal: 4.173 ± 0.74
0.562SerTrp: 0.562 ± 0.244
2.648SerTyr: 2.648 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
4.815ThrAla: 4.815 ± 0.771
0.321ThrCys: 0.321 ± 0.143
3.531ThrAsp: 3.531 ± 0.628
5.136ThrGlu: 5.136 ± 0.539
2.889ThrPhe: 2.889 ± 0.503
3.13ThrGly: 3.13 ± 0.49
1.204ThrHis: 1.204 ± 0.294
3.771ThrIle: 3.771 ± 0.452
5.617ThrLys: 5.617 ± 0.687
4.815ThrLeu: 4.815 ± 0.531
1.204ThrMet: 1.204 ± 0.309
4.092ThrAsn: 4.092 ± 0.544
2.327ThrPro: 2.327 ± 0.45
2.809ThrGln: 2.809 ± 0.524
1.926ThrArg: 1.926 ± 0.314
4.173ThrSer: 4.173 ± 0.634
4.413ThrThr: 4.413 ± 0.618
4.173ThrVal: 4.173 ± 0.52
0.642ThrTrp: 0.642 ± 0.256
2.006ThrTyr: 2.006 ± 0.44
0.0ThrXaa: 0.0 ± 0.0
Val
4.494ValAla: 4.494 ± 1.018
1.043ValCys: 1.043 ± 0.333
4.815ValAsp: 4.815 ± 0.553
4.494ValGlu: 4.494 ± 0.54
1.926ValPhe: 1.926 ± 0.317
3.771ValGly: 3.771 ± 0.565
0.562ValHis: 0.562 ± 0.22
3.932ValIle: 3.932 ± 0.717
4.895ValLys: 4.895 ± 0.622
4.413ValLeu: 4.413 ± 0.732
1.284ValMet: 1.284 ± 0.326
4.413ValAsn: 4.413 ± 0.543
2.568ValPro: 2.568 ± 0.391
2.327ValGln: 2.327 ± 0.33
3.21ValArg: 3.21 ± 0.529
3.45ValSer: 3.45 ± 0.685
4.253ValThr: 4.253 ± 0.504
4.173ValVal: 4.173 ± 0.541
0.401ValTrp: 0.401 ± 0.218
2.889ValTyr: 2.889 ± 0.635
0.0ValXaa: 0.0 ± 0.0
Trp
1.043TrpAla: 1.043 ± 0.347
0.0TrpCys: 0.0 ± 0.0
0.401TrpAsp: 0.401 ± 0.158
0.883TrpGlu: 0.883 ± 0.306
0.562TrpPhe: 0.562 ± 0.243
0.722TrpGly: 0.722 ± 0.208
0.16TrpHis: 0.16 ± 0.179
0.642TrpIle: 0.642 ± 0.223
0.401TrpLys: 0.401 ± 0.176
1.043TrpLeu: 1.043 ± 0.307
0.08TrpMet: 0.08 ± 0.077
0.883TrpAsn: 0.883 ± 0.259
0.08TrpPro: 0.08 ± 0.081
0.401TrpGln: 0.401 ± 0.172
0.401TrpArg: 0.401 ± 0.192
1.284TrpSer: 1.284 ± 0.357
0.481TrpThr: 0.481 ± 0.237
0.642TrpVal: 0.642 ± 0.234
0.0TrpTrp: 0.0 ± 0.0
0.562TrpTyr: 0.562 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.765TyrAla: 1.765 ± 0.404
0.321TyrCys: 0.321 ± 0.217
2.247TyrAsp: 2.247 ± 0.28
3.29TyrGlu: 3.29 ± 0.649
1.284TyrPhe: 1.284 ± 0.409
2.407TyrGly: 2.407 ± 0.731
0.883TyrHis: 0.883 ± 0.223
2.889TyrIle: 2.889 ± 0.556
3.691TyrLys: 3.691 ± 0.666
3.21TyrLeu: 3.21 ± 0.621
1.123TyrMet: 1.123 ± 0.406
2.809TyrAsn: 2.809 ± 0.535
1.444TyrPro: 1.444 ± 0.389
2.327TyrGln: 2.327 ± 0.494
1.605TyrArg: 1.605 ± 0.462
2.728TyrSer: 2.728 ± 0.491
2.327TyrThr: 2.327 ± 0.492
2.568TyrVal: 2.568 ± 0.437
0.562TyrTrp: 0.562 ± 0.291
2.086TyrTyr: 2.086 ± 0.482
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12463 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski