Amino acid dipepetide frequency for Staphylococcus phage B236

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.516AlaAla: 0.516 ± 0.232
0.295AlaCys: 0.295 ± 0.148
2.949AlaAsp: 2.949 ± 0.47
3.833AlaGlu: 3.833 ± 0.432
2.949AlaPhe: 2.949 ± 0.666
3.465AlaGly: 3.465 ± 0.549
1.327AlaHis: 1.327 ± 0.281
5.086AlaIle: 5.086 ± 0.6
6.266AlaLys: 6.266 ± 0.66
4.939AlaLeu: 4.939 ± 0.69
1.253AlaMet: 1.253 ± 0.372
3.686AlaAsn: 3.686 ± 0.331
1.769AlaPro: 1.769 ± 0.286
2.506AlaGln: 2.506 ± 0.431
2.654AlaArg: 2.654 ± 0.431
4.128AlaSer: 4.128 ± 0.65
3.686AlaThr: 3.686 ± 0.48
3.686AlaVal: 3.686 ± 0.66
0.811AlaTrp: 0.811 ± 0.275
2.064AlaTyr: 2.064 ± 0.449
0.0AlaXaa: 0.0 ± 0.0
Cys
0.147CysAla: 0.147 ± 0.1
0.0CysCys: 0.0 ± 0.0
0.295CysAsp: 0.295 ± 0.144
0.147CysGlu: 0.147 ± 0.087
0.369CysPhe: 0.369 ± 0.149
0.295CysGly: 0.295 ± 0.144
0.0CysHis: 0.0 ± 0.0
0.147CysIle: 0.147 ± 0.106
0.442CysLys: 0.442 ± 0.174
0.221CysLeu: 0.221 ± 0.132
0.147CysMet: 0.147 ± 0.103
0.221CysAsn: 0.221 ± 0.114
0.147CysPro: 0.147 ± 0.093
0.221CysGln: 0.221 ± 0.109
0.295CysArg: 0.295 ± 0.15
0.221CysSer: 0.221 ± 0.167
0.147CysThr: 0.147 ± 0.112
0.295CysVal: 0.295 ± 0.157
0.074CysTrp: 0.074 ± 0.071
0.221CysTyr: 0.221 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
3.833AspAla: 3.833 ± 0.512
0.147AspCys: 0.147 ± 0.101
5.455AspAsp: 5.455 ± 0.848
5.823AspGlu: 5.823 ± 0.787
3.317AspPhe: 3.317 ± 0.525
4.202AspGly: 4.202 ± 0.525
0.442AspHis: 0.442 ± 0.177
4.791AspIle: 4.791 ± 0.662
5.529AspLys: 5.529 ± 0.688
4.791AspLeu: 4.791 ± 0.458
1.843AspMet: 1.843 ± 0.376
3.17AspAsn: 3.17 ± 0.551
1.548AspPro: 1.548 ± 0.293
1.401AspGln: 1.401 ± 0.296
2.138AspArg: 2.138 ± 0.389
4.202AspSer: 4.202 ± 0.473
3.391AspThr: 3.391 ± 0.433
3.981AspVal: 3.981 ± 0.64
0.737AspTrp: 0.737 ± 0.248
3.096AspTyr: 3.096 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
4.497GluAla: 4.497 ± 0.695
0.442GluCys: 0.442 ± 0.185
3.907GluAsp: 3.907 ± 0.589
6.855GluGlu: 6.855 ± 1.071
2.875GluPhe: 2.875 ± 0.478
2.727GluGly: 2.727 ± 0.355
1.99GluHis: 1.99 ± 0.394
5.455GluIle: 5.455 ± 0.759
6.192GluLys: 6.192 ± 0.711
8.109GluLeu: 8.109 ± 0.969
2.064GluMet: 2.064 ± 0.397
4.644GluAsn: 4.644 ± 0.491
2.064GluPro: 2.064 ± 0.384
3.833GluGln: 3.833 ± 0.607
3.759GluArg: 3.759 ± 0.546
3.243GluSer: 3.243 ± 0.492
3.981GluThr: 3.981 ± 0.45
5.529GluVal: 5.529 ± 0.664
1.327GluTrp: 1.327 ± 0.273
3.833GluTyr: 3.833 ± 0.551
0.0GluXaa: 0.0 ± 0.0
Phe
1.769PheAla: 1.769 ± 0.403
0.516PheCys: 0.516 ± 0.186
4.128PheAsp: 4.128 ± 0.439
3.096PheGlu: 3.096 ± 0.555
1.253PhePhe: 1.253 ± 0.334
2.285PheGly: 2.285 ± 0.613
0.59PheHis: 0.59 ± 0.196
3.391PheIle: 3.391 ± 0.58
4.423PheLys: 4.423 ± 0.577
3.096PheLeu: 3.096 ± 0.447
0.811PheMet: 0.811 ± 0.274
2.875PheAsn: 2.875 ± 0.404
1.032PhePro: 1.032 ± 0.319
0.737PheGln: 0.737 ± 0.22
1.253PheArg: 1.253 ± 0.247
2.285PheSer: 2.285 ± 0.45
3.022PheThr: 3.022 ± 0.444
2.875PheVal: 2.875 ± 0.505
0.369PheTrp: 0.369 ± 0.188
1.843PheTyr: 1.843 ± 0.455
0.0PheXaa: 0.0 ± 0.0
Gly
4.349GlyAla: 4.349 ± 0.656
0.295GlyCys: 0.295 ± 0.14
3.538GlyAsp: 3.538 ± 0.583
3.022GlyGlu: 3.022 ± 0.526
2.801GlyPhe: 2.801 ± 0.505
3.243GlyGly: 3.243 ± 0.508
1.474GlyHis: 1.474 ± 0.377
4.054GlyIle: 4.054 ± 0.634
4.865GlyLys: 4.865 ± 0.47
4.054GlyLeu: 4.054 ± 0.719
1.769GlyMet: 1.769 ± 0.337
3.317GlyAsn: 3.317 ± 0.489
0.295GlyPro: 0.295 ± 0.162
2.949GlyGln: 2.949 ± 0.374
2.58GlyArg: 2.58 ± 0.423
2.727GlySer: 2.727 ± 0.506
3.981GlyThr: 3.981 ± 0.455
5.455GlyVal: 5.455 ± 0.694
0.811GlyTrp: 0.811 ± 0.213
2.875GlyTyr: 2.875 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
1.622HisAla: 1.622 ± 0.295
0.0HisCys: 0.0 ± 0.0
0.59HisAsp: 0.59 ± 0.229
1.474HisGlu: 1.474 ± 0.329
0.811HisPhe: 0.811 ± 0.181
1.327HisGly: 1.327 ± 0.318
0.442HisHis: 0.442 ± 0.159
1.401HisIle: 1.401 ± 0.295
1.401HisLys: 1.401 ± 0.365
1.253HisLeu: 1.253 ± 0.289
0.369HisMet: 0.369 ± 0.145
0.811HisAsn: 0.811 ± 0.206
0.663HisPro: 0.663 ± 0.247
1.179HisGln: 1.179 ± 0.304
0.516HisArg: 0.516 ± 0.21
1.253HisSer: 1.253 ± 0.324
1.179HisThr: 1.179 ± 0.284
0.958HisVal: 0.958 ± 0.283
0.0HisTrp: 0.0 ± 0.0
1.106HisTyr: 1.106 ± 0.366
0.0HisXaa: 0.0 ± 0.0
Ile
5.16IleAla: 5.16 ± 0.542
0.369IleCys: 0.369 ± 0.152
5.455IleAsp: 5.455 ± 0.703
6.413IleGlu: 6.413 ± 0.796
2.949IlePhe: 2.949 ± 0.506
5.16IleGly: 5.16 ± 0.946
0.737IleHis: 0.737 ± 0.239
4.128IleIle: 4.128 ± 0.574
8.33IleLys: 8.33 ± 0.908
3.243IleLeu: 3.243 ± 0.488
2.064IleMet: 2.064 ± 0.349
4.865IleAsn: 4.865 ± 0.664
1.99IlePro: 1.99 ± 0.399
2.949IleGln: 2.949 ± 0.49
3.17IleArg: 3.17 ± 0.627
4.497IleSer: 4.497 ± 0.467
5.381IleThr: 5.381 ± 0.603
3.759IleVal: 3.759 ± 0.475
0.885IleTrp: 0.885 ± 0.312
2.801IleTyr: 2.801 ± 0.529
0.0IleXaa: 0.0 ± 0.0
Lys
5.234LysAla: 5.234 ± 0.437
0.221LysCys: 0.221 ± 0.126
5.529LysAsp: 5.529 ± 0.647
8.477LysGlu: 8.477 ± 0.903
3.391LysPhe: 3.391 ± 0.47
5.234LysGly: 5.234 ± 0.565
2.064LysHis: 2.064 ± 0.417
7.445LysIle: 7.445 ± 0.791
7.298LysLys: 7.298 ± 0.911
6.561LysLeu: 6.561 ± 0.791
2.949LysMet: 2.949 ± 0.454
5.602LysAsn: 5.602 ± 0.573
2.875LysPro: 2.875 ± 0.445
3.981LysGln: 3.981 ± 0.576
4.349LysArg: 4.349 ± 0.58
4.791LysSer: 4.791 ± 0.705
5.823LysThr: 5.823 ± 0.702
5.602LysVal: 5.602 ± 0.644
0.811LysTrp: 0.811 ± 0.228
3.833LysTyr: 3.833 ± 0.527
0.0LysXaa: 0.0 ± 0.0
Leu
4.57LeuAla: 4.57 ± 0.673
0.147LeuCys: 0.147 ± 0.099
5.234LeuAsp: 5.234 ± 0.555
6.339LeuGlu: 6.339 ± 0.904
3.243LeuPhe: 3.243 ± 0.467
3.465LeuGly: 3.465 ± 0.484
1.548LeuHis: 1.548 ± 0.3
4.497LeuIle: 4.497 ± 0.451
6.118LeuLys: 6.118 ± 0.527
4.57LeuLeu: 4.57 ± 0.51
1.695LeuMet: 1.695 ± 0.325
5.897LeuAsn: 5.897 ± 0.697
2.285LeuPro: 2.285 ± 0.384
3.391LeuGln: 3.391 ± 0.498
2.654LeuArg: 2.654 ± 0.546
4.202LeuSer: 4.202 ± 0.554
5.823LeuThr: 5.823 ± 0.668
3.686LeuVal: 3.686 ± 0.481
0.442LeuTrp: 0.442 ± 0.23
3.907LeuTyr: 3.907 ± 0.574
0.0LeuXaa: 0.0 ± 0.0
Met
1.401MetAla: 1.401 ± 0.475
0.0MetCys: 0.0 ± 0.0
1.548MetAsp: 1.548 ± 0.344
1.769MetGlu: 1.769 ± 0.318
1.401MetPhe: 1.401 ± 0.305
1.032MetGly: 1.032 ± 0.291
0.295MetHis: 0.295 ± 0.175
2.211MetIle: 2.211 ± 0.403
1.548MetLys: 1.548 ± 0.322
2.433MetLeu: 2.433 ± 0.396
1.032MetMet: 1.032 ± 0.341
1.99MetAsn: 1.99 ± 0.38
1.401MetPro: 1.401 ± 0.307
1.474MetGln: 1.474 ± 0.367
1.179MetArg: 1.179 ± 0.295
1.548MetSer: 1.548 ± 0.39
1.695MetThr: 1.695 ± 0.39
0.811MetVal: 0.811 ± 0.253
0.369MetTrp: 0.369 ± 0.149
0.958MetTyr: 0.958 ± 0.28
0.0MetXaa: 0.0 ± 0.0
Asn
4.054AsnAla: 4.054 ± 0.59
0.369AsnCys: 0.369 ± 0.195
4.718AsnAsp: 4.718 ± 0.691
4.644AsnGlu: 4.644 ± 0.633
2.654AsnPhe: 2.654 ± 0.503
4.57AsnGly: 4.57 ± 0.667
1.032AsnHis: 1.032 ± 0.313
4.497AsnIle: 4.497 ± 0.56
6.339AsnLys: 6.339 ± 0.687
3.538AsnLeu: 3.538 ± 0.51
1.695AsnMet: 1.695 ± 0.312
4.718AsnAsn: 4.718 ± 0.704
2.506AsnPro: 2.506 ± 0.471
2.506AsnGln: 2.506 ± 0.368
2.654AsnArg: 2.654 ± 0.446
3.686AsnSer: 3.686 ± 0.447
3.465AsnThr: 3.465 ± 0.426
3.465AsnVal: 3.465 ± 0.544
0.885AsnTrp: 0.885 ± 0.245
2.875AsnTyr: 2.875 ± 0.499
0.0AsnXaa: 0.0 ± 0.0
Pro
1.327ProAla: 1.327 ± 0.245
0.0ProCys: 0.0 ± 0.0
1.769ProAsp: 1.769 ± 0.322
1.99ProGlu: 1.99 ± 0.339
1.548ProPhe: 1.548 ± 0.305
1.769ProGly: 1.769 ± 0.452
0.442ProHis: 0.442 ± 0.186
2.211ProIle: 2.211 ± 0.47
3.465ProLys: 3.465 ± 0.583
1.474ProLeu: 1.474 ± 0.319
0.885ProMet: 0.885 ± 0.258
1.769ProAsn: 1.769 ± 0.382
0.442ProPro: 0.442 ± 0.155
0.885ProGln: 0.885 ± 0.288
0.958ProArg: 0.958 ± 0.233
2.138ProSer: 2.138 ± 0.39
1.548ProThr: 1.548 ± 0.32
1.769ProVal: 1.769 ± 0.338
0.074ProTrp: 0.074 ± 0.075
1.695ProTyr: 1.695 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
3.317GlnAla: 3.317 ± 0.483
0.442GlnCys: 0.442 ± 0.181
1.843GlnAsp: 1.843 ± 0.391
2.654GlnGlu: 2.654 ± 0.493
2.064GlnPhe: 2.064 ± 0.416
2.359GlnGly: 2.359 ± 0.455
0.958GlnHis: 0.958 ± 0.242
3.096GlnIle: 3.096 ± 0.403
3.317GlnLys: 3.317 ± 0.603
3.022GlnLeu: 3.022 ± 0.444
1.401GlnMet: 1.401 ± 0.369
2.727GlnAsn: 2.727 ± 0.384
1.253GlnPro: 1.253 ± 0.306
2.211GlnGln: 2.211 ± 0.573
2.359GlnArg: 2.359 ± 0.395
1.843GlnSer: 1.843 ± 0.377
1.99GlnThr: 1.99 ± 0.458
2.064GlnVal: 2.064 ± 0.468
0.369GlnTrp: 0.369 ± 0.18
1.253GlnTyr: 1.253 ± 0.391
0.0GlnXaa: 0.0 ± 0.0
Arg
1.474ArgAla: 1.474 ± 0.282
0.369ArgCys: 0.369 ± 0.146
2.727ArgAsp: 2.727 ± 0.386
3.538ArgGlu: 3.538 ± 0.515
1.917ArgPhe: 1.917 ± 0.428
2.064ArgGly: 2.064 ± 0.429
1.401ArgHis: 1.401 ± 0.313
3.17ArgIle: 3.17 ± 0.44
3.833ArgLys: 3.833 ± 0.619
3.465ArgLeu: 3.465 ± 0.517
0.663ArgMet: 0.663 ± 0.224
2.727ArgAsn: 2.727 ± 0.405
1.179ArgPro: 1.179 ± 0.298
2.211ArgGln: 2.211 ± 0.447
1.548ArgArg: 1.548 ± 0.304
1.843ArgSer: 1.843 ± 0.347
2.211ArgThr: 2.211 ± 0.501
1.917ArgVal: 1.917 ± 0.351
0.516ArgTrp: 0.516 ± 0.195
2.359ArgTyr: 2.359 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
4.054SerAla: 4.054 ± 0.604
0.0SerCys: 0.0 ± 0.0
4.423SerAsp: 4.423 ± 0.555
3.022SerGlu: 3.022 ± 0.491
2.211SerPhe: 2.211 ± 0.332
4.128SerGly: 4.128 ± 0.7
0.811SerHis: 0.811 ± 0.261
5.013SerIle: 5.013 ± 0.659
5.897SerLys: 5.897 ± 0.829
3.981SerLeu: 3.981 ± 0.475
2.064SerMet: 2.064 ± 0.436
4.054SerAsn: 4.054 ± 0.606
1.179SerPro: 1.179 ± 0.275
2.285SerGln: 2.285 ± 0.463
1.99SerArg: 1.99 ± 0.34
3.17SerSer: 3.17 ± 0.599
3.538SerThr: 3.538 ± 0.479
3.243SerVal: 3.243 ± 0.574
0.811SerTrp: 0.811 ± 0.243
2.433SerTyr: 2.433 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
3.686ThrAla: 3.686 ± 0.522
0.0ThrCys: 0.0 ± 0.0
3.022ThrAsp: 3.022 ± 0.447
4.57ThrGlu: 4.57 ± 0.62
2.211ThrPhe: 2.211 ± 0.454
4.423ThrGly: 4.423 ± 0.703
1.327ThrHis: 1.327 ± 0.274
5.16ThrIle: 5.16 ± 0.673
5.16ThrLys: 5.16 ± 0.688
5.013ThrLeu: 5.013 ± 0.475
1.253ThrMet: 1.253 ± 0.302
3.981ThrAsn: 3.981 ± 0.594
2.211ThrPro: 2.211 ± 0.379
2.875ThrGln: 2.875 ± 0.454
2.359ThrArg: 2.359 ± 0.408
4.497ThrSer: 4.497 ± 0.85
3.833ThrThr: 3.833 ± 0.511
3.391ThrVal: 3.391 ± 0.489
0.885ThrTrp: 0.885 ± 0.295
2.727ThrTyr: 2.727 ± 0.523
0.0ThrXaa: 0.0 ± 0.0
Val
4.349ValAla: 4.349 ± 0.743
0.147ValCys: 0.147 ± 0.092
4.791ValAsp: 4.791 ± 0.641
4.939ValGlu: 4.939 ± 0.671
1.769ValPhe: 1.769 ± 0.338
3.465ValGly: 3.465 ± 0.556
0.295ValHis: 0.295 ± 0.153
4.423ValIle: 4.423 ± 0.565
6.487ValLys: 6.487 ± 0.73
5.455ValLeu: 5.455 ± 0.576
1.327ValMet: 1.327 ± 0.321
3.317ValAsn: 3.317 ± 0.455
1.99ValPro: 1.99 ± 0.427
0.811ValGln: 0.811 ± 0.294
2.359ValArg: 2.359 ± 0.352
3.759ValSer: 3.759 ± 0.627
3.907ValThr: 3.907 ± 0.548
3.907ValVal: 3.907 ± 0.562
0.811ValTrp: 0.811 ± 0.231
2.064ValTyr: 2.064 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
0.59TrpAla: 0.59 ± 0.209
0.147TrpCys: 0.147 ± 0.09
0.295TrpAsp: 0.295 ± 0.14
0.958TrpGlu: 0.958 ± 0.274
0.442TrpPhe: 0.442 ± 0.152
0.663TrpGly: 0.663 ± 0.253
0.295TrpHis: 0.295 ± 0.149
0.885TrpIle: 0.885 ± 0.234
1.032TrpLys: 1.032 ± 0.274
1.106TrpLeu: 1.106 ± 0.307
0.147TrpMet: 0.147 ± 0.094
1.179TrpAsn: 1.179 ± 0.379
0.074TrpPro: 0.074 ± 0.065
0.442TrpGln: 0.442 ± 0.199
0.295TrpArg: 0.295 ± 0.159
0.885TrpSer: 0.885 ± 0.252
1.179TrpThr: 1.179 ± 0.215
0.885TrpVal: 0.885 ± 0.255
0.147TrpTrp: 0.147 ± 0.092
0.442TrpTyr: 0.442 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.917TyrAla: 1.917 ± 0.462
0.147TyrCys: 0.147 ± 0.103
1.622TyrAsp: 1.622 ± 0.379
3.759TyrGlu: 3.759 ± 0.501
1.474TyrPhe: 1.474 ± 0.428
2.58TyrGly: 2.58 ± 0.507
0.885TyrHis: 0.885 ± 0.293
3.243TyrIle: 3.243 ± 0.577
4.128TyrLys: 4.128 ± 0.549
3.391TyrLeu: 3.391 ± 0.501
0.59TyrMet: 0.59 ± 0.172
3.317TyrAsn: 3.317 ± 0.537
1.253TyrPro: 1.253 ± 0.34
1.695TyrGln: 1.695 ± 0.343
2.064TyrArg: 2.064 ± 0.439
3.391TyrSer: 3.391 ± 0.508
2.801TyrThr: 2.801 ± 0.421
3.17TyrVal: 3.17 ± 0.597
0.958TyrTrp: 0.958 ± 0.367
1.843TyrTyr: 1.843 ± 0.383
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13567 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski