Amino acid dipepetide frequency for Staphylococcus phage SH-St 15644

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.655AlaAla: 2.655 ± 0.759
0.359AlaCys: 0.359 ± 0.162
2.87AlaAsp: 2.87 ± 0.466
4.162AlaGlu: 4.162 ± 0.514
1.65AlaPhe: 1.65 ± 0.278
3.373AlaGly: 3.373 ± 0.68
1.076AlaHis: 1.076 ± 0.271
4.664AlaIle: 4.664 ± 0.702
6.602AlaLys: 6.602 ± 1.113
5.238AlaLeu: 5.238 ± 0.701
1.148AlaMet: 1.148 ± 0.285
4.234AlaAsn: 4.234 ± 0.816
1.363AlaPro: 1.363 ± 0.372
1.722AlaGln: 1.722 ± 0.414
2.655AlaArg: 2.655 ± 0.372
4.808AlaSer: 4.808 ± 0.762
3.373AlaThr: 3.373 ± 0.48
2.727AlaVal: 2.727 ± 0.436
1.292AlaTrp: 1.292 ± 0.361
2.942AlaTyr: 2.942 ± 0.501
0.0AlaXaa: 0.0 ± 0.0
Cys
0.144CysAla: 0.144 ± 0.086
0.0CysCys: 0.0 ± 0.0
0.072CysAsp: 0.072 ± 0.086
0.502CysGlu: 0.502 ± 0.181
0.287CysPhe: 0.287 ± 0.15
0.359CysGly: 0.359 ± 0.164
0.144CysHis: 0.144 ± 0.099
0.431CysIle: 0.431 ± 0.154
0.574CysLys: 0.574 ± 0.201
0.646CysLeu: 0.646 ± 0.244
0.144CysMet: 0.144 ± 0.104
0.287CysAsn: 0.287 ± 0.12
0.144CysPro: 0.144 ± 0.107
0.144CysGln: 0.144 ± 0.103
0.287CysArg: 0.287 ± 0.183
0.215CysSer: 0.215 ± 0.135
0.287CysThr: 0.287 ± 0.136
0.072CysVal: 0.072 ± 0.071
0.0CysTrp: 0.0 ± 0.0
0.359CysTyr: 0.359 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
2.511AspAla: 2.511 ± 0.546
0.431AspCys: 0.431 ± 0.179
4.018AspAsp: 4.018 ± 0.578
5.023AspGlu: 5.023 ± 0.719
3.444AspPhe: 3.444 ± 0.532
3.588AspGly: 3.588 ± 0.662
0.646AspHis: 0.646 ± 0.175
5.238AspIle: 5.238 ± 0.505
7.104AspLys: 7.104 ± 0.749
5.956AspLeu: 5.956 ± 0.553
2.368AspMet: 2.368 ± 0.437
2.655AspAsn: 2.655 ± 0.361
1.076AspPro: 1.076 ± 0.329
1.507AspGln: 1.507 ± 0.256
2.153AspArg: 2.153 ± 0.375
3.444AspSer: 3.444 ± 0.598
3.588AspThr: 3.588 ± 0.525
3.947AspVal: 3.947 ± 0.426
0.933AspTrp: 0.933 ± 0.256
3.229AspTyr: 3.229 ± 0.568
0.0AspXaa: 0.0 ± 0.0
Glu
4.879GluAla: 4.879 ± 0.534
0.431GluCys: 0.431 ± 0.195
4.592GluAsp: 4.592 ± 0.624
7.463GluGlu: 7.463 ± 1.054
3.157GluPhe: 3.157 ± 0.63
3.66GluGly: 3.66 ± 0.68
0.789GluHis: 0.789 ± 0.23
5.956GluIle: 5.956 ± 0.877
8.539GluLys: 8.539 ± 0.978
7.606GluLeu: 7.606 ± 0.805
2.368GluMet: 2.368 ± 0.439
5.525GluAsn: 5.525 ± 0.609
1.22GluPro: 1.22 ± 0.312
3.014GluGln: 3.014 ± 0.475
3.516GluArg: 3.516 ± 0.635
3.444GluSer: 3.444 ± 0.519
4.09GluThr: 4.09 ± 0.557
3.875GluVal: 3.875 ± 0.427
0.933GluTrp: 0.933 ± 0.183
2.87GluTyr: 2.87 ± 0.582
0.0GluXaa: 0.0 ± 0.0
Phe
1.866PheAla: 1.866 ± 0.405
0.431PheCys: 0.431 ± 0.177
3.157PheAsp: 3.157 ± 0.436
3.373PheGlu: 3.373 ± 0.523
1.076PhePhe: 1.076 ± 0.278
3.301PheGly: 3.301 ± 0.549
0.574PheHis: 0.574 ± 0.199
3.373PheIle: 3.373 ± 0.582
4.449PheLys: 4.449 ± 0.642
1.937PheLeu: 1.937 ± 0.363
1.148PheMet: 1.148 ± 0.258
3.731PheAsn: 3.731 ± 0.556
0.861PhePro: 0.861 ± 0.311
0.933PheGln: 0.933 ± 0.226
1.22PheArg: 1.22 ± 0.274
2.009PheSer: 2.009 ± 0.38
1.794PheThr: 1.794 ± 0.348
1.937PheVal: 1.937 ± 0.385
0.287PheTrp: 0.287 ± 0.127
1.794PheTyr: 1.794 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
4.305GlyAla: 4.305 ± 0.942
0.144GlyCys: 0.144 ± 0.111
3.803GlyAsp: 3.803 ± 0.44
3.66GlyGlu: 3.66 ± 0.43
2.583GlyPhe: 2.583 ± 0.39
4.879GlyGly: 4.879 ± 1.075
1.363GlyHis: 1.363 ± 0.384
3.875GlyIle: 3.875 ± 0.576
6.099GlyLys: 6.099 ± 0.639
5.525GlyLeu: 5.525 ± 0.95
1.363GlyMet: 1.363 ± 0.365
2.942GlyAsn: 2.942 ± 0.571
0.861GlyPro: 0.861 ± 0.203
1.65GlyGln: 1.65 ± 0.398
2.153GlyArg: 2.153 ± 0.484
3.516GlySer: 3.516 ± 0.581
4.018GlyThr: 4.018 ± 0.574
4.592GlyVal: 4.592 ± 0.691
1.076GlyTrp: 1.076 ± 0.328
2.44GlyTyr: 2.44 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
0.933HisAla: 0.933 ± 0.246
0.072HisCys: 0.072 ± 0.081
0.718HisAsp: 0.718 ± 0.254
0.933HisGlu: 0.933 ± 0.213
0.861HisPhe: 0.861 ± 0.181
1.148HisGly: 1.148 ± 0.276
0.431HisHis: 0.431 ± 0.205
1.507HisIle: 1.507 ± 0.429
1.292HisLys: 1.292 ± 0.234
1.363HisLeu: 1.363 ± 0.262
0.215HisMet: 0.215 ± 0.104
0.718HisAsn: 0.718 ± 0.206
0.933HisPro: 0.933 ± 0.178
0.718HisGln: 0.718 ± 0.223
0.933HisArg: 0.933 ± 0.246
1.076HisSer: 1.076 ± 0.238
1.076HisThr: 1.076 ± 0.217
0.861HisVal: 0.861 ± 0.31
0.287HisTrp: 0.287 ± 0.161
1.005HisTyr: 1.005 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
4.592IleAla: 4.592 ± 0.586
0.287IleCys: 0.287 ± 0.18
5.382IleAsp: 5.382 ± 0.773
5.956IleGlu: 5.956 ± 0.678
2.727IlePhe: 2.727 ± 0.554
3.229IleGly: 3.229 ± 0.572
1.722IleHis: 1.722 ± 0.304
4.521IleIle: 4.521 ± 0.671
7.678IleLys: 7.678 ± 0.753
4.736IleLeu: 4.736 ± 0.679
1.579IleMet: 1.579 ± 0.299
4.951IleAsn: 4.951 ± 0.561
2.511IlePro: 2.511 ± 0.364
2.009IleGln: 2.009 ± 0.315
3.588IleArg: 3.588 ± 0.461
4.664IleSer: 4.664 ± 0.495
4.09IleThr: 4.09 ± 0.561
3.731IleVal: 3.731 ± 0.682
0.574IleTrp: 0.574 ± 0.228
2.511IleTyr: 2.511 ± 0.427
0.0IleXaa: 0.0 ± 0.0
Lys
7.965LysAla: 7.965 ± 1.08
0.287LysCys: 0.287 ± 0.128
5.382LysAsp: 5.382 ± 0.604
9.257LysGlu: 9.257 ± 0.835
2.655LysPhe: 2.655 ± 0.401
5.31LysGly: 5.31 ± 0.9
1.866LysHis: 1.866 ± 0.376
6.099LysIle: 6.099 ± 0.649
7.965LysLys: 7.965 ± 0.992
9.328LysLeu: 9.328 ± 0.998
2.727LysMet: 2.727 ± 0.413
5.884LysAsn: 5.884 ± 0.619
2.511LysPro: 2.511 ± 0.486
4.951LysGln: 4.951 ± 0.608
4.018LysArg: 4.018 ± 0.663
5.956LysSer: 5.956 ± 1.18
4.879LysThr: 4.879 ± 0.591
5.669LysVal: 5.669 ± 0.696
1.722LysTrp: 1.722 ± 0.404
4.879LysTyr: 4.879 ± 0.666
0.0LysXaa: 0.0 ± 0.0
Leu
4.305LeuAla: 4.305 ± 0.853
0.502LeuCys: 0.502 ± 0.194
5.669LeuAsp: 5.669 ± 0.788
6.673LeuGlu: 6.673 ± 0.769
2.942LeuPhe: 2.942 ± 0.431
4.305LeuGly: 4.305 ± 0.849
1.148LeuHis: 1.148 ± 0.348
5.31LeuIle: 5.31 ± 0.594
8.539LeuLys: 8.539 ± 1.248
6.745LeuLeu: 6.745 ± 0.84
2.009LeuMet: 2.009 ± 0.34
6.099LeuAsn: 6.099 ± 0.616
3.014LeuPro: 3.014 ± 0.473
3.086LeuGln: 3.086 ± 0.641
3.731LeuArg: 3.731 ± 0.505
5.741LeuSer: 5.741 ± 0.778
5.454LeuThr: 5.454 ± 0.731
3.731LeuVal: 3.731 ± 0.427
0.431LeuTrp: 0.431 ± 0.191
3.444LeuTyr: 3.444 ± 0.788
0.0LeuXaa: 0.0 ± 0.0
Met
1.292MetAla: 1.292 ± 0.28
0.144MetCys: 0.144 ± 0.103
1.005MetAsp: 1.005 ± 0.303
1.579MetGlu: 1.579 ± 0.324
0.933MetPhe: 0.933 ± 0.209
1.507MetGly: 1.507 ± 0.53
0.431MetHis: 0.431 ± 0.206
1.22MetIle: 1.22 ± 0.257
2.87MetLys: 2.87 ± 0.569
2.009MetLeu: 2.009 ± 0.504
0.359MetMet: 0.359 ± 0.142
1.794MetAsn: 1.794 ± 0.425
0.789MetPro: 0.789 ± 0.176
1.507MetGln: 1.507 ± 0.393
0.933MetArg: 0.933 ± 0.23
2.009MetSer: 2.009 ± 0.367
2.583MetThr: 2.583 ± 0.34
0.933MetVal: 0.933 ± 0.208
0.215MetTrp: 0.215 ± 0.096
0.933MetTyr: 0.933 ± 0.258
0.0MetXaa: 0.0 ± 0.0
Asn
3.588AsnAla: 3.588 ± 0.512
0.144AsnCys: 0.144 ± 0.104
4.305AsnAsp: 4.305 ± 0.486
3.947AsnGlu: 3.947 ± 0.55
1.722AsnPhe: 1.722 ± 0.286
4.305AsnGly: 4.305 ± 0.59
0.861AsnHis: 0.861 ± 0.323
4.09AsnIle: 4.09 ± 0.491
7.176AsnLys: 7.176 ± 0.747
5.31AsnLeu: 5.31 ± 0.525
0.861AsnMet: 0.861 ± 0.209
4.162AsnAsn: 4.162 ± 0.692
2.583AsnPro: 2.583 ± 0.338
2.727AsnGln: 2.727 ± 0.466
2.799AsnArg: 2.799 ± 0.461
4.521AsnSer: 4.521 ± 0.55
4.018AsnThr: 4.018 ± 0.396
3.301AsnVal: 3.301 ± 0.515
1.148AsnTrp: 1.148 ± 0.277
2.511AsnTyr: 2.511 ± 0.498
0.0AsnXaa: 0.0 ± 0.0
Pro
1.148ProAla: 1.148 ± 0.241
0.215ProCys: 0.215 ± 0.118
1.363ProAsp: 1.363 ± 0.332
2.153ProGlu: 2.153 ± 0.403
1.22ProPhe: 1.22 ± 0.301
2.009ProGly: 2.009 ± 0.38
0.287ProHis: 0.287 ± 0.15
1.866ProIle: 1.866 ± 0.345
2.081ProLys: 2.081 ± 0.456
2.583ProLeu: 2.583 ± 0.475
0.718ProMet: 0.718 ± 0.253
1.866ProAsn: 1.866 ± 0.36
0.646ProPro: 0.646 ± 0.238
1.292ProGln: 1.292 ± 0.338
1.148ProArg: 1.148 ± 0.25
2.153ProSer: 2.153 ± 0.292
1.722ProThr: 1.722 ± 0.3
1.22ProVal: 1.22 ± 0.347
0.359ProTrp: 0.359 ± 0.152
1.005ProTyr: 1.005 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
2.583GlnAla: 2.583 ± 0.358
0.215GlnCys: 0.215 ± 0.128
2.44GlnAsp: 2.44 ± 0.373
2.44GlnGlu: 2.44 ± 0.462
1.507GlnPhe: 1.507 ± 0.308
2.224GlnGly: 2.224 ± 0.407
0.861GlnHis: 0.861 ± 0.229
3.014GlnIle: 3.014 ± 0.645
3.373GlnLys: 3.373 ± 0.394
3.588GlnLeu: 3.588 ± 0.503
0.861GlnMet: 0.861 ± 0.34
1.937GlnAsn: 1.937 ± 0.42
1.076GlnPro: 1.076 ± 0.238
1.005GlnGln: 1.005 ± 0.377
1.866GlnArg: 1.866 ± 0.418
2.224GlnSer: 2.224 ± 0.362
1.22GlnThr: 1.22 ± 0.339
2.081GlnVal: 2.081 ± 0.401
0.359GlnTrp: 0.359 ± 0.201
1.722GlnTyr: 1.722 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
2.44ArgAla: 2.44 ± 0.482
0.0ArgCys: 0.0 ± 0.0
3.014ArgAsp: 3.014 ± 0.326
2.799ArgGlu: 2.799 ± 0.458
2.081ArgPhe: 2.081 ± 0.376
2.296ArgGly: 2.296 ± 0.425
0.789ArgHis: 0.789 ± 0.22
3.588ArgIle: 3.588 ± 0.549
4.162ArgLys: 4.162 ± 0.589
3.301ArgLeu: 3.301 ± 0.631
1.005ArgMet: 1.005 ± 0.247
2.655ArgAsn: 2.655 ± 0.42
0.718ArgPro: 0.718 ± 0.323
1.435ArgGln: 1.435 ± 0.281
1.579ArgArg: 1.579 ± 0.365
1.794ArgSer: 1.794 ± 0.352
2.296ArgThr: 2.296 ± 0.403
2.511ArgVal: 2.511 ± 0.413
0.431ArgTrp: 0.431 ± 0.189
2.583ArgTyr: 2.583 ± 0.544
0.0ArgXaa: 0.0 ± 0.0
Ser
3.875SerAla: 3.875 ± 0.709
0.431SerCys: 0.431 ± 0.176
5.023SerAsp: 5.023 ± 0.533
4.664SerGlu: 4.664 ± 0.583
2.583SerPhe: 2.583 ± 0.5
4.449SerGly: 4.449 ± 0.762
0.646SerHis: 0.646 ± 0.17
4.018SerIle: 4.018 ± 0.52
6.458SerLys: 6.458 ± 1.11
3.803SerLeu: 3.803 ± 0.466
2.153SerMet: 2.153 ± 0.439
4.592SerAsn: 4.592 ± 0.571
1.579SerPro: 1.579 ± 0.351
2.655SerGln: 2.655 ± 0.431
2.224SerArg: 2.224 ± 0.364
4.09SerSer: 4.09 ± 0.602
3.373SerThr: 3.373 ± 0.534
3.803SerVal: 3.803 ± 0.576
0.789SerTrp: 0.789 ± 0.231
1.937SerTyr: 1.937 ± 0.384
0.0SerXaa: 0.0 ± 0.0
Thr
3.588ThrAla: 3.588 ± 0.545
0.215ThrCys: 0.215 ± 0.124
3.373ThrAsp: 3.373 ± 0.513
4.018ThrGlu: 4.018 ± 0.436
2.87ThrPhe: 2.87 ± 0.377
4.449ThrGly: 4.449 ± 0.575
1.507ThrHis: 1.507 ± 0.384
4.736ThrIle: 4.736 ± 0.57
5.382ThrLys: 5.382 ± 0.659
4.449ThrLeu: 4.449 ± 0.533
0.789ThrMet: 0.789 ± 0.252
3.588ThrAsn: 3.588 ± 0.565
2.296ThrPro: 2.296 ± 0.361
1.507ThrGln: 1.507 ± 0.281
1.794ThrArg: 1.794 ± 0.339
3.157ThrSer: 3.157 ± 0.468
2.727ThrThr: 2.727 ± 0.513
3.947ThrVal: 3.947 ± 0.499
0.431ThrTrp: 0.431 ± 0.212
2.009ThrTyr: 2.009 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
3.086ValAla: 3.086 ± 0.482
0.287ValCys: 0.287 ± 0.141
3.731ValAsp: 3.731 ± 0.502
5.525ValGlu: 5.525 ± 0.595
2.368ValPhe: 2.368 ± 0.482
3.516ValGly: 3.516 ± 0.537
1.005ValHis: 1.005 ± 0.23
4.09ValIle: 4.09 ± 0.493
4.664ValLys: 4.664 ± 0.675
3.875ValLeu: 3.875 ± 0.488
1.435ValMet: 1.435 ± 0.258
3.229ValAsn: 3.229 ± 0.486
1.435ValPro: 1.435 ± 0.352
2.224ValGln: 2.224 ± 0.386
2.44ValArg: 2.44 ± 0.378
4.09ValSer: 4.09 ± 0.484
3.086ValThr: 3.086 ± 0.571
3.086ValVal: 3.086 ± 0.45
0.502ValTrp: 0.502 ± 0.187
2.081ValTyr: 2.081 ± 0.481
0.0ValXaa: 0.0 ± 0.0
Trp
0.718TrpAla: 0.718 ± 0.191
0.0TrpCys: 0.0 ± 0.0
0.574TrpAsp: 0.574 ± 0.217
0.789TrpGlu: 0.789 ± 0.316
1.22TrpPhe: 1.22 ± 0.268
0.359TrpGly: 0.359 ± 0.154
0.0TrpHis: 0.0 ± 0.0
0.789TrpIle: 0.789 ± 0.236
0.861TrpLys: 0.861 ± 0.265
1.22TrpLeu: 1.22 ± 0.266
0.359TrpMet: 0.359 ± 0.214
0.861TrpAsn: 0.861 ± 0.241
0.287TrpPro: 0.287 ± 0.232
0.646TrpGln: 0.646 ± 0.188
0.502TrpArg: 0.502 ± 0.187
1.076TrpSer: 1.076 ± 0.39
0.646TrpThr: 0.646 ± 0.191
0.718TrpVal: 0.718 ± 0.187
0.144TrpTrp: 0.144 ± 0.11
0.718TrpTyr: 0.718 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.44TyrAla: 2.44 ± 0.291
0.502TyrCys: 0.502 ± 0.198
2.368TyrAsp: 2.368 ± 0.522
3.014TyrGlu: 3.014 ± 0.542
1.363TyrPhe: 1.363 ± 0.343
2.583TyrGly: 2.583 ± 0.594
0.933TyrHis: 0.933 ± 0.323
2.655TyrIle: 2.655 ± 0.492
3.444TyrLys: 3.444 ± 0.453
3.66TyrLeu: 3.66 ± 0.568
1.363TyrMet: 1.363 ± 0.254
2.368TyrAsn: 2.368 ± 0.483
1.148TyrPro: 1.148 ± 0.275
1.937TyrGln: 1.937 ± 0.379
1.937TyrArg: 1.937 ± 0.52
3.157TyrSer: 3.157 ± 0.415
2.511TyrThr: 2.511 ± 0.473
3.014TyrVal: 3.014 ± 0.522
0.574TyrTrp: 0.574 ± 0.166
1.363TyrTyr: 1.363 ± 0.417
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski