Amino acid dipepetide frequency for Staphylococcus phage vB_SauS-SAP27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.595AlaAla: 0.595 ± 0.291
0.223AlaCys: 0.223 ± 0.104
2.007AlaAsp: 2.007 ± 0.313
3.864AlaGlu: 3.864 ± 0.509
3.047AlaPhe: 3.047 ± 0.579
4.013AlaGly: 4.013 ± 0.697
1.189AlaHis: 1.189 ± 0.318
4.385AlaIle: 4.385 ± 0.637
6.168AlaLys: 6.168 ± 0.727
4.31AlaLeu: 4.31 ± 0.703
1.263AlaMet: 1.263 ± 0.411
3.047AlaAsn: 3.047 ± 0.447
1.858AlaPro: 1.858 ± 0.38
2.378AlaGln: 2.378 ± 0.464
2.675AlaArg: 2.675 ± 0.415
4.385AlaSer: 4.385 ± 0.676
3.27AlaThr: 3.27 ± 0.467
3.939AlaVal: 3.939 ± 0.869
0.966AlaTrp: 0.966 ± 0.344
2.675AlaTyr: 2.675 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.149CysAla: 0.149 ± 0.085
0.074CysCys: 0.074 ± 0.068
0.446CysAsp: 0.446 ± 0.186
0.297CysGlu: 0.297 ± 0.17
0.297CysPhe: 0.297 ± 0.156
0.223CysGly: 0.223 ± 0.112
0.074CysHis: 0.074 ± 0.075
0.372CysIle: 0.372 ± 0.17
0.446CysLys: 0.446 ± 0.169
0.223CysLeu: 0.223 ± 0.125
0.074CysMet: 0.074 ± 0.069
0.223CysAsn: 0.223 ± 0.118
0.297CysPro: 0.297 ± 0.185
0.149CysGln: 0.149 ± 0.103
0.297CysArg: 0.297 ± 0.158
0.52CysSer: 0.52 ± 0.199
0.149CysThr: 0.149 ± 0.099
0.446CysVal: 0.446 ± 0.169
0.074CysTrp: 0.074 ± 0.075
0.446CysTyr: 0.446 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
3.939AspAla: 3.939 ± 0.577
0.52AspCys: 0.52 ± 0.192
4.756AspAsp: 4.756 ± 0.787
6.02AspGlu: 6.02 ± 0.821
3.344AspPhe: 3.344 ± 0.615
4.236AspGly: 4.236 ± 0.618
0.223AspHis: 0.223 ± 0.133
4.831AspIle: 4.831 ± 0.586
5.276AspLys: 5.276 ± 0.688
5.425AspLeu: 5.425 ± 0.652
2.304AspMet: 2.304 ± 0.421
3.419AspAsn: 3.419 ± 0.512
0.892AspPro: 0.892 ± 0.263
1.04AspGln: 1.04 ± 0.277
1.932AspArg: 1.932 ± 0.413
3.567AspSer: 3.567 ± 0.475
3.344AspThr: 3.344 ± 0.491
4.087AspVal: 4.087 ± 0.578
0.595AspTrp: 0.595 ± 0.243
2.378AspTyr: 2.378 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
3.641GluAla: 3.641 ± 0.518
0.669GluCys: 0.669 ± 0.196
3.641GluAsp: 3.641 ± 0.608
5.797GluGlu: 5.797 ± 0.834
3.641GluPhe: 3.641 ± 0.563
3.196GluGly: 3.196 ± 0.419
1.263GluHis: 1.263 ± 0.333
6.763GluIle: 6.763 ± 0.862
6.317GluLys: 6.317 ± 0.837
7.952GluLeu: 7.952 ± 0.99
2.081GluMet: 2.081 ± 0.319
4.831GluAsn: 4.831 ± 0.686
2.007GluPro: 2.007 ± 0.363
3.864GluGln: 3.864 ± 0.59
2.973GluArg: 2.973 ± 0.394
3.344GluSer: 3.344 ± 0.473
3.716GluThr: 3.716 ± 0.486
5.276GluVal: 5.276 ± 0.621
1.263GluTrp: 1.263 ± 0.305
4.236GluTyr: 4.236 ± 0.619
0.0GluXaa: 0.0 ± 0.0
Phe
1.784PheAla: 1.784 ± 0.395
0.297PheCys: 0.297 ± 0.131
4.385PheAsp: 4.385 ± 0.498
3.567PheGlu: 3.567 ± 0.52
0.966PhePhe: 0.966 ± 0.249
2.452PheGly: 2.452 ± 0.568
0.669PheHis: 0.669 ± 0.196
3.79PheIle: 3.79 ± 0.606
4.459PheLys: 4.459 ± 0.556
2.973PheLeu: 2.973 ± 0.441
0.817PheMet: 0.817 ± 0.234
3.196PheAsn: 3.196 ± 0.427
0.743PhePro: 0.743 ± 0.308
0.892PheGln: 0.892 ± 0.312
1.486PheArg: 1.486 ± 0.281
3.196PheSer: 3.196 ± 0.623
3.27PheThr: 3.27 ± 0.565
2.898PheVal: 2.898 ± 0.526
0.372PheTrp: 0.372 ± 0.175
2.007PheTyr: 2.007 ± 0.417
0.0PheXaa: 0.0 ± 0.0
Gly
4.236GlyAla: 4.236 ± 0.698
0.297GlyCys: 0.297 ± 0.162
3.419GlyAsp: 3.419 ± 0.521
2.824GlyGlu: 2.824 ± 0.475
2.824GlyPhe: 2.824 ± 0.525
3.121GlyGly: 3.121 ± 0.564
1.486GlyHis: 1.486 ± 0.34
4.236GlyIle: 4.236 ± 0.587
5.648GlyLys: 5.648 ± 0.564
4.608GlyLeu: 4.608 ± 0.725
1.561GlyMet: 1.561 ± 0.345
3.27GlyAsn: 3.27 ± 0.541
0.52GlyPro: 0.52 ± 0.211
2.378GlyGln: 2.378 ± 0.411
2.229GlyArg: 2.229 ± 0.45
3.344GlySer: 3.344 ± 0.482
3.641GlyThr: 3.641 ± 0.521
4.31GlyVal: 4.31 ± 0.797
1.115GlyTrp: 1.115 ± 0.411
3.047GlyTyr: 3.047 ± 0.607
0.0GlyXaa: 0.0 ± 0.0
His
1.263HisAla: 1.263 ± 0.297
0.0HisCys: 0.0 ± 0.0
0.669HisAsp: 0.669 ± 0.226
1.263HisGlu: 1.263 ± 0.328
1.189HisPhe: 1.189 ± 0.288
0.966HisGly: 0.966 ± 0.195
0.669HisHis: 0.669 ± 0.246
1.338HisIle: 1.338 ± 0.281
1.04HisLys: 1.04 ± 0.279
1.561HisLeu: 1.561 ± 0.371
0.223HisMet: 0.223 ± 0.119
1.263HisAsn: 1.263 ± 0.281
1.189HisPro: 1.189 ± 0.25
0.669HisGln: 0.669 ± 0.237
0.52HisArg: 0.52 ± 0.231
1.189HisSer: 1.189 ± 0.252
1.709HisThr: 1.709 ± 0.303
0.892HisVal: 0.892 ± 0.257
0.074HisTrp: 0.074 ± 0.071
0.892HisTyr: 0.892 ± 0.305
0.0HisXaa: 0.0 ± 0.0
Ile
4.385IleAla: 4.385 ± 0.67
0.149IleCys: 0.149 ± 0.114
5.425IleAsp: 5.425 ± 0.631
6.911IleGlu: 6.911 ± 0.865
2.675IlePhe: 2.675 ± 0.526
5.276IleGly: 5.276 ± 0.788
0.966IleHis: 0.966 ± 0.26
4.087IleIle: 4.087 ± 0.603
8.398IleLys: 8.398 ± 0.853
4.608IleLeu: 4.608 ± 0.579
2.155IleMet: 2.155 ± 0.373
5.276IleAsn: 5.276 ± 0.815
2.527IlePro: 2.527 ± 0.357
2.898IleGln: 2.898 ± 0.485
3.344IleArg: 3.344 ± 0.54
4.162IleSer: 4.162 ± 0.649
4.682IleThr: 4.682 ± 0.814
4.162IleVal: 4.162 ± 0.565
0.892IleTrp: 0.892 ± 0.334
3.121IleTyr: 3.121 ± 0.622
0.0IleXaa: 0.0 ± 0.0
Lys
5.499LysAla: 5.499 ± 0.548
0.297LysCys: 0.297 ± 0.177
5.499LysAsp: 5.499 ± 0.706
8.026LysGlu: 8.026 ± 0.784
4.087LysPhe: 4.087 ± 0.601
5.871LysGly: 5.871 ± 0.706
2.304LysHis: 2.304 ± 0.408
7.134LysIle: 7.134 ± 0.789
7.952LysLys: 7.952 ± 0.839
6.911LysLeu: 6.911 ± 0.777
2.304LysMet: 2.304 ± 0.349
4.905LysAsn: 4.905 ± 0.653
2.75LysPro: 2.75 ± 0.481
3.79LysGln: 3.79 ± 0.555
4.979LysArg: 4.979 ± 0.547
5.202LysSer: 5.202 ± 0.617
5.574LysThr: 5.574 ± 0.697
5.128LysVal: 5.128 ± 0.609
0.966LysTrp: 0.966 ± 0.265
3.939LysTyr: 3.939 ± 0.596
0.0LysXaa: 0.0 ± 0.0
Leu
4.162LeuAla: 4.162 ± 0.584
0.372LeuCys: 0.372 ± 0.192
4.756LeuAsp: 4.756 ± 0.619
5.722LeuGlu: 5.722 ± 0.988
3.716LeuPhe: 3.716 ± 0.588
3.419LeuGly: 3.419 ± 0.538
1.561LeuHis: 1.561 ± 0.354
5.722LeuIle: 5.722 ± 0.535
7.58LeuLys: 7.58 ± 0.648
5.425LeuLeu: 5.425 ± 0.67
1.412LeuMet: 1.412 ± 0.424
5.276LeuAsn: 5.276 ± 0.579
2.081LeuPro: 2.081 ± 0.415
3.344LeuGln: 3.344 ± 0.397
3.419LeuArg: 3.419 ± 0.571
4.756LeuSer: 4.756 ± 0.638
4.979LeuThr: 4.979 ± 0.778
3.641LeuVal: 3.641 ± 0.52
0.595LeuTrp: 0.595 ± 0.25
3.419LeuTyr: 3.419 ± 0.612
0.0LeuXaa: 0.0 ± 0.0
Met
1.338MetAla: 1.338 ± 0.465
0.149MetCys: 0.149 ± 0.113
1.115MetAsp: 1.115 ± 0.269
1.561MetGlu: 1.561 ± 0.311
1.115MetPhe: 1.115 ± 0.263
0.892MetGly: 0.892 ± 0.284
0.669MetHis: 0.669 ± 0.217
1.486MetIle: 1.486 ± 0.303
2.155MetLys: 2.155 ± 0.393
2.155MetLeu: 2.155 ± 0.361
0.743MetMet: 0.743 ± 0.233
2.081MetAsn: 2.081 ± 0.452
0.817MetPro: 0.817 ± 0.229
1.486MetGln: 1.486 ± 0.381
0.966MetArg: 0.966 ± 0.254
1.635MetSer: 1.635 ± 0.381
2.007MetThr: 2.007 ± 0.363
1.115MetVal: 1.115 ± 0.269
0.595MetTrp: 0.595 ± 0.176
1.189MetTyr: 1.189 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
3.716AsnAla: 3.716 ± 0.608
0.297AsnCys: 0.297 ± 0.146
5.128AsnAsp: 5.128 ± 0.594
4.831AsnGlu: 4.831 ± 0.602
2.601AsnPhe: 2.601 ± 0.498
4.756AsnGly: 4.756 ± 0.764
1.115AsnHis: 1.115 ± 0.356
4.385AsnIle: 4.385 ± 0.574
6.317AsnLys: 6.317 ± 0.794
3.493AsnLeu: 3.493 ± 0.683
1.486AsnMet: 1.486 ± 0.342
5.202AsnAsn: 5.202 ± 0.858
2.898AsnPro: 2.898 ± 0.456
2.229AsnGln: 2.229 ± 0.385
2.229AsnArg: 2.229 ± 0.413
3.641AsnSer: 3.641 ± 0.516
4.087AsnThr: 4.087 ± 0.506
3.196AsnVal: 3.196 ± 0.533
0.817AsnTrp: 0.817 ± 0.225
3.121AsnTyr: 3.121 ± 0.486
0.0AsnXaa: 0.0 ± 0.0
Pro
1.338ProAla: 1.338 ± 0.292
0.149ProCys: 0.149 ± 0.116
1.412ProAsp: 1.412 ± 0.324
1.784ProGlu: 1.784 ± 0.378
1.709ProPhe: 1.709 ± 0.31
1.486ProGly: 1.486 ± 0.49
0.52ProHis: 0.52 ± 0.208
2.229ProIle: 2.229 ± 0.5
3.047ProLys: 3.047 ± 0.545
1.338ProLeu: 1.338 ± 0.264
0.743ProMet: 0.743 ± 0.237
1.932ProAsn: 1.932 ± 0.453
0.372ProPro: 0.372 ± 0.149
0.966ProGln: 0.966 ± 0.272
0.966ProArg: 0.966 ± 0.249
2.007ProSer: 2.007 ± 0.412
2.081ProThr: 2.081 ± 0.334
2.007ProVal: 2.007 ± 0.355
0.223ProTrp: 0.223 ± 0.135
1.412ProTyr: 1.412 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
3.419GlnAla: 3.419 ± 0.454
0.372GlnCys: 0.372 ± 0.169
1.561GlnAsp: 1.561 ± 0.365
3.047GlnGlu: 3.047 ± 0.564
2.007GlnPhe: 2.007 ± 0.359
2.304GlnGly: 2.304 ± 0.382
1.04GlnHis: 1.04 ± 0.242
2.75GlnIle: 2.75 ± 0.364
2.304GlnLys: 2.304 ± 0.438
2.675GlnLeu: 2.675 ± 0.417
0.892GlnMet: 0.892 ± 0.253
2.229GlnAsn: 2.229 ± 0.415
1.709GlnPro: 1.709 ± 0.403
1.635GlnGln: 1.635 ± 0.407
2.007GlnArg: 2.007 ± 0.359
2.229GlnSer: 2.229 ± 0.373
1.412GlnThr: 1.412 ± 0.271
2.229GlnVal: 2.229 ± 0.546
0.297GlnTrp: 0.297 ± 0.155
1.412GlnTyr: 1.412 ± 0.394
0.0GlnXaa: 0.0 ± 0.0
Arg
1.635ArgAla: 1.635 ± 0.326
0.372ArgCys: 0.372 ± 0.175
2.824ArgAsp: 2.824 ± 0.482
3.641ArgGlu: 3.641 ± 0.544
1.932ArgPhe: 1.932 ± 0.344
1.858ArgGly: 1.858 ± 0.401
1.189ArgHis: 1.189 ± 0.323
3.344ArgIle: 3.344 ± 0.458
3.567ArgLys: 3.567 ± 0.483
4.087ArgLeu: 4.087 ± 0.608
1.04ArgMet: 1.04 ± 0.289
2.824ArgAsn: 2.824 ± 0.45
0.966ArgPro: 0.966 ± 0.279
1.709ArgGln: 1.709 ± 0.343
1.115ArgArg: 1.115 ± 0.288
1.486ArgSer: 1.486 ± 0.327
1.486ArgThr: 1.486 ± 0.353
2.007ArgVal: 2.007 ± 0.35
0.297ArgTrp: 0.297 ± 0.134
2.304ArgTyr: 2.304 ± 0.434
0.0ArgXaa: 0.0 ± 0.0
Ser
3.641SerAla: 3.641 ± 0.489
0.223SerCys: 0.223 ± 0.151
4.087SerAsp: 4.087 ± 0.556
3.121SerGlu: 3.121 ± 0.492
2.378SerPhe: 2.378 ± 0.509
4.162SerGly: 4.162 ± 0.616
0.966SerHis: 0.966 ± 0.238
5.276SerIle: 5.276 ± 0.645
5.797SerLys: 5.797 ± 0.641
3.641SerLeu: 3.641 ± 0.487
2.452SerMet: 2.452 ± 0.361
4.905SerAsn: 4.905 ± 0.716
1.04SerPro: 1.04 ± 0.302
2.229SerGln: 2.229 ± 0.508
1.858SerArg: 1.858 ± 0.331
3.493SerSer: 3.493 ± 0.453
4.162SerThr: 4.162 ± 0.518
3.716SerVal: 3.716 ± 0.6
0.743SerTrp: 0.743 ± 0.243
2.081SerTyr: 2.081 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
3.419ThrAla: 3.419 ± 0.634
0.074ThrCys: 0.074 ± 0.071
3.641ThrAsp: 3.641 ± 0.499
4.459ThrGlu: 4.459 ± 0.475
2.824ThrPhe: 2.824 ± 0.55
4.013ThrGly: 4.013 ± 0.668
0.966ThrHis: 0.966 ± 0.221
4.608ThrIle: 4.608 ± 0.719
4.831ThrLys: 4.831 ± 0.743
4.905ThrLeu: 4.905 ± 0.521
0.817ThrMet: 0.817 ± 0.256
4.385ThrAsn: 4.385 ± 0.705
1.709ThrPro: 1.709 ± 0.348
2.007ThrGln: 2.007 ± 0.441
2.304ThrArg: 2.304 ± 0.441
4.905ThrSer: 4.905 ± 0.851
4.013ThrThr: 4.013 ± 0.658
3.864ThrVal: 3.864 ± 0.525
0.892ThrTrp: 0.892 ± 0.307
2.304ThrTyr: 2.304 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
4.979ValAla: 4.979 ± 0.836
0.297ValCys: 0.297 ± 0.157
4.608ValAsp: 4.608 ± 0.748
5.425ValGlu: 5.425 ± 0.654
1.858ValPhe: 1.858 ± 0.351
2.675ValGly: 2.675 ± 0.581
0.669ValHis: 0.669 ± 0.196
4.608ValIle: 4.608 ± 0.469
6.168ValLys: 6.168 ± 0.57
4.533ValLeu: 4.533 ± 0.557
1.784ValMet: 1.784 ± 0.33
3.196ValAsn: 3.196 ± 0.398
2.304ValPro: 2.304 ± 0.406
1.189ValGln: 1.189 ± 0.338
2.081ValArg: 2.081 ± 0.326
3.641ValSer: 3.641 ± 0.757
3.79ValThr: 3.79 ± 0.599
3.567ValVal: 3.567 ± 0.538
0.817ValTrp: 0.817 ± 0.249
2.452ValTyr: 2.452 ± 0.49
0.0ValXaa: 0.0 ± 0.0
Trp
0.966TrpAla: 0.966 ± 0.22
0.074TrpCys: 0.074 ± 0.075
0.669TrpAsp: 0.669 ± 0.191
1.04TrpGlu: 1.04 ± 0.211
0.372TrpPhe: 0.372 ± 0.139
0.966TrpGly: 0.966 ± 0.339
0.223TrpHis: 0.223 ± 0.122
0.743TrpIle: 0.743 ± 0.239
1.115TrpLys: 1.115 ± 0.289
0.966TrpLeu: 0.966 ± 0.286
0.223TrpMet: 0.223 ± 0.139
0.892TrpAsn: 0.892 ± 0.255
0.074TrpPro: 0.074 ± 0.074
0.743TrpGln: 0.743 ± 0.211
0.372TrpArg: 0.372 ± 0.174
0.595TrpSer: 0.595 ± 0.24
0.892TrpThr: 0.892 ± 0.219
0.817TrpVal: 0.817 ± 0.243
0.0TrpTrp: 0.0 ± 0.0
0.446TrpTyr: 0.446 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.155TyrAla: 2.155 ± 0.433
0.372TyrCys: 0.372 ± 0.154
2.081TyrAsp: 2.081 ± 0.435
3.27TyrGlu: 3.27 ± 0.549
1.635TyrPhe: 1.635 ± 0.397
2.229TyrGly: 2.229 ± 0.472
0.817TyrHis: 0.817 ± 0.281
4.013TyrIle: 4.013 ± 0.571
4.533TyrLys: 4.533 ± 0.782
3.567TyrLeu: 3.567 ± 0.523
0.817TyrMet: 0.817 ± 0.244
3.196TyrAsn: 3.196 ± 0.564
0.966TyrPro: 0.966 ± 0.335
1.932TyrGln: 1.932 ± 0.316
1.932TyrArg: 1.932 ± 0.574
2.75TyrSer: 2.75 ± 0.501
2.675TyrThr: 2.675 ± 0.494
3.344TyrVal: 3.344 ± 0.502
0.595TyrTrp: 0.595 ± 0.205
2.007TyrTyr: 2.007 ± 0.517
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13457 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski