Amino acid dipepetide frequency for Staphylococcus phage vB_SepS_SEP9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.153AlaAla: 2.153 ± 0.687
0.371AlaCys: 0.371 ± 0.136
2.635AlaAsp: 2.635 ± 0.292
3.563AlaGlu: 3.563 ± 0.406
2.041AlaPhe: 2.041 ± 0.27
2.858AlaGly: 2.858 ± 0.633
0.742AlaHis: 0.742 ± 0.212
3.785AlaIle: 3.785 ± 0.473
3.785AlaLys: 3.785 ± 0.4
3.748AlaLeu: 3.748 ± 0.421
1.707AlaMet: 1.707 ± 0.303
2.301AlaAsn: 2.301 ± 0.379
0.891AlaPro: 0.891 ± 0.195
1.41AlaGln: 1.41 ± 0.417
2.301AlaArg: 2.301 ± 0.293
2.487AlaSer: 2.487 ± 0.393
2.635AlaThr: 2.635 ± 0.44
2.858AlaVal: 2.858 ± 0.415
0.52AlaTrp: 0.52 ± 0.161
1.819AlaTyr: 1.819 ± 0.243
0.0AlaXaa: 0.0 ± 0.0
Cys
0.26CysAla: 0.26 ± 0.084
0.037CysCys: 0.037 ± 0.035
0.482CysAsp: 0.482 ± 0.138
0.371CysGlu: 0.371 ± 0.11
0.148CysPhe: 0.148 ± 0.076
0.557CysGly: 0.557 ± 0.146
0.223CysHis: 0.223 ± 0.085
0.779CysIle: 0.779 ± 0.195
0.816CysLys: 0.816 ± 0.191
0.482CysLeu: 0.482 ± 0.15
0.297CysMet: 0.297 ± 0.11
0.482CysAsn: 0.482 ± 0.141
0.26CysPro: 0.26 ± 0.12
0.223CysGln: 0.223 ± 0.098
0.111CysArg: 0.111 ± 0.066
0.445CysSer: 0.445 ± 0.133
0.371CysThr: 0.371 ± 0.127
0.594CysVal: 0.594 ± 0.164
0.223CysTrp: 0.223 ± 0.102
0.742CysTyr: 0.742 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
2.524AspAla: 2.524 ± 0.227
0.445AspCys: 0.445 ± 0.129
4.416AspAsp: 4.416 ± 0.521
5.418AspGlu: 5.418 ± 0.561
3.006AspPhe: 3.006 ± 0.405
4.862AspGly: 4.862 ± 0.53
0.408AspHis: 0.408 ± 0.095
6.717AspIle: 6.717 ± 0.513
6.977AspLys: 6.977 ± 0.645
5.567AspLeu: 5.567 ± 0.486
2.115AspMet: 2.115 ± 0.274
4.825AspAsn: 4.825 ± 0.482
1.188AspPro: 1.188 ± 0.219
0.854AspGln: 0.854 ± 0.167
2.524AspArg: 2.524 ± 0.394
4.379AspSer: 4.379 ± 0.44
3.303AspThr: 3.303 ± 0.411
3.674AspVal: 3.674 ± 0.39
0.816AspTrp: 0.816 ± 0.166
4.231AspTyr: 4.231 ± 0.499
0.0AspXaa: 0.0 ± 0.0
Glu
3.229GluAla: 3.229 ± 0.37
0.705GluCys: 0.705 ± 0.191
5.233GluAsp: 5.233 ± 0.54
6.606GluGlu: 6.606 ± 0.836
3.043GluPhe: 3.043 ± 0.453
3.08GluGly: 3.08 ± 0.353
1.076GluHis: 1.076 ± 0.229
6.086GluIle: 6.086 ± 0.661
8.202GluLys: 8.202 ± 0.461
8.239GluLeu: 8.239 ± 0.835
2.264GluMet: 2.264 ± 0.342
5.307GluAsn: 5.307 ± 0.63
1.67GluPro: 1.67 ± 0.243
3.823GluGln: 3.823 ± 0.354
3.043GluArg: 3.043 ± 0.348
3.674GluSer: 3.674 ± 0.34
3.934GluThr: 3.934 ± 0.371
3.971GluVal: 3.971 ± 0.471
1.225GluTrp: 1.225 ± 0.224
3.971GluTyr: 3.971 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
2.264PheAla: 2.264 ± 0.325
0.297PheCys: 0.297 ± 0.099
3.117PheAsp: 3.117 ± 0.46
2.524PheGlu: 2.524 ± 0.344
1.819PhePhe: 1.819 ± 0.273
2.338PheGly: 2.338 ± 0.557
0.371PheHis: 0.371 ± 0.112
4.008PheIle: 4.008 ± 0.461
3.934PheLys: 3.934 ± 0.348
3.155PheLeu: 3.155 ± 0.518
0.928PheMet: 0.928 ± 0.214
3.229PheAsn: 3.229 ± 0.337
0.445PhePro: 0.445 ± 0.114
0.891PheGln: 0.891 ± 0.182
1.113PheArg: 1.113 ± 0.258
2.561PheSer: 2.561 ± 0.304
2.115PheThr: 2.115 ± 0.26
2.19PheVal: 2.19 ± 0.239
0.334PheTrp: 0.334 ± 0.108
2.412PheTyr: 2.412 ± 0.346
0.0PheXaa: 0.0 ± 0.0
Gly
2.895GlyAla: 2.895 ± 0.597
0.334GlyCys: 0.334 ± 0.111
3.6GlyAsp: 3.6 ± 0.397
3.043GlyGlu: 3.043 ± 0.348
2.746GlyPhe: 2.746 ± 0.289
3.377GlyGly: 3.377 ± 0.568
0.965GlyHis: 0.965 ± 0.22
4.713GlyIle: 4.713 ± 0.557
5.678GlyLys: 5.678 ± 0.612
5.233GlyLeu: 5.233 ± 0.488
1.67GlyMet: 1.67 ± 0.311
3.971GlyAsn: 3.971 ± 0.399
0.631GlyPro: 0.631 ± 0.185
1.856GlyGln: 1.856 ± 0.387
2.301GlyArg: 2.301 ± 0.307
3.934GlySer: 3.934 ± 0.444
3.823GlyThr: 3.823 ± 0.523
3.971GlyVal: 3.971 ± 0.548
1.076GlyTrp: 1.076 ± 0.364
3.155GlyTyr: 3.155 ± 0.424
0.0GlyXaa: 0.0 ± 0.0
His
0.705HisAla: 0.705 ± 0.174
0.334HisCys: 0.334 ± 0.14
1.076HisAsp: 1.076 ± 0.227
1.262HisGlu: 1.262 ± 0.239
0.779HisPhe: 0.779 ± 0.206
1.188HisGly: 1.188 ± 0.221
0.52HisHis: 0.52 ± 0.141
1.522HisIle: 1.522 ± 0.24
1.299HisLys: 1.299 ± 0.189
1.039HisLeu: 1.039 ± 0.188
0.445HisMet: 0.445 ± 0.134
1.447HisAsn: 1.447 ± 0.204
0.297HisPro: 0.297 ± 0.087
0.297HisGln: 0.297 ± 0.101
0.965HisArg: 0.965 ± 0.166
1.039HisSer: 1.039 ± 0.212
0.854HisThr: 0.854 ± 0.19
0.742HisVal: 0.742 ± 0.156
0.074HisTrp: 0.074 ± 0.062
0.779HisTyr: 0.779 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
3.08IleAla: 3.08 ± 0.425
0.631IleCys: 0.631 ± 0.151
6.124IleAsp: 6.124 ± 0.47
6.012IleGlu: 6.012 ± 0.603
2.746IlePhe: 2.746 ± 0.34
3.934IleGly: 3.934 ± 0.479
1.113IleHis: 1.113 ± 0.214
5.752IleIle: 5.752 ± 0.732
9.352IleLys: 9.352 ± 0.629
6.42IleLeu: 6.42 ± 0.639
1.967IleMet: 1.967 ± 0.28
5.752IleAsn: 5.752 ± 0.529
2.746IlePro: 2.746 ± 0.337
2.227IleGln: 2.227 ± 0.343
3.266IleArg: 3.266 ± 0.393
4.862IleSer: 4.862 ± 0.355
4.528IleThr: 4.528 ± 0.415
5.27IleVal: 5.27 ± 0.477
0.816IleTrp: 0.816 ± 0.301
3.192IleTyr: 3.192 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
4.008LysAla: 4.008 ± 0.578
0.557LysCys: 0.557 ± 0.134
6.754LysAsp: 6.754 ± 0.449
9.724LysGlu: 9.724 ± 0.652
2.598LysPhe: 2.598 ± 0.364
6.903LysGly: 6.903 ± 0.851
2.524LysHis: 2.524 ± 0.326
6.309LysIle: 6.309 ± 0.614
9.278LysLys: 9.278 ± 0.718
5.79LysLeu: 5.79 ± 0.445
3.006LysMet: 3.006 ± 0.338
7.311LysAsn: 7.311 ± 0.627
2.115LysPro: 2.115 ± 0.265
4.528LysGln: 4.528 ± 0.486
4.008LysArg: 4.008 ± 0.349
5.715LysSer: 5.715 ± 0.636
5.307LysThr: 5.307 ± 0.412
5.938LysVal: 5.938 ± 0.444
0.965LysTrp: 0.965 ± 0.17
4.713LysTyr: 4.713 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
3.563LeuAla: 3.563 ± 0.428
0.705LeuCys: 0.705 ± 0.183
6.458LeuAsp: 6.458 ± 0.625
7.311LeuGlu: 7.311 ± 0.789
2.524LeuPhe: 2.524 ± 0.359
4.825LeuGly: 4.825 ± 0.665
1.373LeuHis: 1.373 ± 0.246
5.678LeuIle: 5.678 ± 0.548
8.016LeuLys: 8.016 ± 0.626
6.124LeuLeu: 6.124 ± 0.668
1.893LeuMet: 1.893 ± 0.316
5.938LeuAsn: 5.938 ± 0.528
2.19LeuPro: 2.19 ± 0.289
2.746LeuGln: 2.746 ± 0.326
2.449LeuArg: 2.449 ± 0.349
5.01LeuSer: 5.01 ± 0.412
4.342LeuThr: 4.342 ± 0.407
4.416LeuVal: 4.416 ± 0.367
0.965LeuTrp: 0.965 ± 0.242
3.785LeuTyr: 3.785 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
1.93MetAla: 1.93 ± 0.333
0.26MetCys: 0.26 ± 0.098
1.373MetAsp: 1.373 ± 0.237
2.004MetGlu: 2.004 ± 0.274
1.113MetPhe: 1.113 ± 0.2
1.188MetGly: 1.188 ± 0.289
0.371MetHis: 0.371 ± 0.097
1.707MetIle: 1.707 ± 0.231
2.858MetLys: 2.858 ± 0.334
1.707MetLeu: 1.707 ± 0.308
0.668MetMet: 0.668 ± 0.157
2.078MetAsn: 2.078 ± 0.285
0.705MetPro: 0.705 ± 0.183
0.668MetGln: 0.668 ± 0.18
0.891MetArg: 0.891 ± 0.217
2.19MetSer: 2.19 ± 0.326
1.373MetThr: 1.373 ± 0.226
1.373MetVal: 1.373 ± 0.2
0.408MetTrp: 0.408 ± 0.125
1.076MetTyr: 1.076 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
2.561AsnAla: 2.561 ± 0.338
0.594AsnCys: 0.594 ± 0.145
4.565AsnAsp: 4.565 ± 0.506
5.27AsnGlu: 5.27 ± 0.622
2.672AsnPhe: 2.672 ± 0.291
5.456AsnGly: 5.456 ± 0.544
1.336AsnHis: 1.336 ± 0.188
6.495AsnIle: 6.495 ± 0.666
7.831AsnLys: 7.831 ± 0.653
4.565AsnLeu: 4.565 ± 0.469
1.893AsnMet: 1.893 ± 0.219
5.344AsnAsn: 5.344 ± 0.547
1.893AsnPro: 1.893 ± 0.251
1.893AsnGln: 1.893 ± 0.286
3.303AsnArg: 3.303 ± 0.349
4.194AsnSer: 4.194 ± 0.351
3.6AsnThr: 3.6 ± 0.413
4.157AsnVal: 4.157 ± 0.379
0.742AsnTrp: 0.742 ± 0.18
3.155AsnTyr: 3.155 ± 0.406
0.0AsnXaa: 0.0 ± 0.0
Pro
0.965ProAla: 0.965 ± 0.219
0.148ProCys: 0.148 ± 0.079
1.188ProAsp: 1.188 ± 0.249
2.338ProGlu: 2.338 ± 0.313
0.816ProPhe: 0.816 ± 0.174
0.816ProGly: 0.816 ± 0.194
0.408ProHis: 0.408 ± 0.103
1.893ProIle: 1.893 ± 0.258
1.633ProLys: 1.633 ± 0.26
1.819ProLeu: 1.819 ± 0.27
0.594ProMet: 0.594 ± 0.141
1.781ProAsn: 1.781 ± 0.278
0.52ProPro: 0.52 ± 0.157
0.779ProGln: 0.779 ± 0.168
0.779ProArg: 0.779 ± 0.157
1.485ProSer: 1.485 ± 0.217
1.41ProThr: 1.41 ± 0.263
1.67ProVal: 1.67 ± 0.234
0.074ProTrp: 0.074 ± 0.056
1.336ProTyr: 1.336 ± 0.208
0.0ProXaa: 0.0 ± 0.0
Gln
1.596GlnAla: 1.596 ± 0.366
0.223GlnCys: 0.223 ± 0.09
1.67GlnAsp: 1.67 ± 0.264
3.526GlnGlu: 3.526 ± 0.331
1.447GlnPhe: 1.447 ± 0.25
1.633GlnGly: 1.633 ± 0.32
0.482GlnHis: 0.482 ± 0.139
2.449GlnIle: 2.449 ± 0.224
2.487GlnLys: 2.487 ± 0.331
3.006GlnLeu: 3.006 ± 0.447
1.113GlnMet: 1.113 ± 0.31
2.153GlnAsn: 2.153 ± 0.245
0.482GlnPro: 0.482 ± 0.155
1.41GlnGln: 1.41 ± 0.245
1.67GlnArg: 1.67 ± 0.299
2.041GlnSer: 2.041 ± 0.259
1.41GlnThr: 1.41 ± 0.26
1.633GlnVal: 1.633 ± 0.274
0.779GlnTrp: 0.779 ± 0.246
1.299GlnTyr: 1.299 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
1.596ArgAla: 1.596 ± 0.269
0.408ArgCys: 0.408 ± 0.149
2.487ArgAsp: 2.487 ± 0.309
2.783ArgGlu: 2.783 ± 0.368
1.707ArgPhe: 1.707 ± 0.187
2.301ArgGly: 2.301 ± 0.288
0.854ArgHis: 0.854 ± 0.189
3.043ArgIle: 3.043 ± 0.324
3.748ArgLys: 3.748 ± 0.402
2.746ArgLeu: 2.746 ± 0.364
0.705ArgMet: 0.705 ± 0.192
3.117ArgAsn: 3.117 ± 0.389
0.779ArgPro: 0.779 ± 0.154
1.373ArgGln: 1.373 ± 0.258
1.596ArgArg: 1.596 ± 0.241
2.153ArgSer: 2.153 ± 0.393
1.707ArgThr: 1.707 ± 0.198
2.338ArgVal: 2.338 ± 0.246
0.668ArgTrp: 0.668 ± 0.173
2.153ArgTyr: 2.153 ± 0.239
0.0ArgXaa: 0.0 ± 0.0
Ser
2.895SerAla: 2.895 ± 0.309
0.482SerCys: 0.482 ± 0.156
4.491SerAsp: 4.491 ± 0.412
4.157SerGlu: 4.157 ± 0.47
3.155SerPhe: 3.155 ± 0.294
3.637SerGly: 3.637 ± 0.477
0.965SerHis: 0.965 ± 0.215
4.528SerIle: 4.528 ± 0.475
5.567SerLys: 5.567 ± 0.502
5.456SerLeu: 5.456 ± 0.38
1.447SerMet: 1.447 ± 0.349
3.897SerAsn: 3.897 ± 0.393
1.336SerPro: 1.336 ± 0.211
2.301SerGln: 2.301 ± 0.289
1.819SerArg: 1.819 ± 0.301
4.231SerSer: 4.231 ± 0.484
3.711SerThr: 3.711 ± 0.399
4.157SerVal: 4.157 ± 0.459
0.631SerTrp: 0.631 ± 0.144
2.895SerTyr: 2.895 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
2.746ThrAla: 2.746 ± 0.469
0.371ThrCys: 0.371 ± 0.107
3.748ThrAsp: 3.748 ± 0.392
4.12ThrGlu: 4.12 ± 0.433
2.524ThrPhe: 2.524 ± 0.308
3.414ThrGly: 3.414 ± 0.727
1.039ThrHis: 1.039 ± 0.206
4.602ThrIle: 4.602 ± 0.418
4.194ThrLys: 4.194 ± 0.405
4.936ThrLeu: 4.936 ± 0.435
0.854ThrMet: 0.854 ± 0.224
3.006ThrAsn: 3.006 ± 0.424
1.819ThrPro: 1.819 ± 0.244
2.078ThrGln: 2.078 ± 0.27
1.744ThrArg: 1.744 ± 0.248
2.783ThrSer: 2.783 ± 0.39
3.303ThrThr: 3.303 ± 0.343
3.414ThrVal: 3.414 ± 0.566
0.52ThrTrp: 0.52 ± 0.121
2.524ThrTyr: 2.524 ± 0.299
0.0ThrXaa: 0.0 ± 0.0
Val
3.229ValAla: 3.229 ± 0.465
0.445ValCys: 0.445 ± 0.152
4.194ValAsp: 4.194 ± 0.355
4.082ValGlu: 4.082 ± 0.457
2.375ValPhe: 2.375 ± 0.25
2.858ValGly: 2.858 ± 0.345
0.891ValHis: 0.891 ± 0.161
5.122ValIle: 5.122 ± 0.435
5.938ValLys: 5.938 ± 0.591
4.788ValLeu: 4.788 ± 0.428
1.299ValMet: 1.299 ± 0.229
4.75ValAsn: 4.75 ± 0.49
1.41ValPro: 1.41 ± 0.263
1.67ValGln: 1.67 ± 0.27
2.004ValArg: 2.004 ± 0.295
4.305ValSer: 4.305 ± 0.402
3.192ValThr: 3.192 ± 0.303
3.823ValVal: 3.823 ± 0.374
0.742ValTrp: 0.742 ± 0.196
2.449ValTyr: 2.449 ± 0.303
0.0ValXaa: 0.0 ± 0.0
Trp
0.705TrpAla: 0.705 ± 0.183
0.148TrpCys: 0.148 ± 0.072
0.668TrpAsp: 0.668 ± 0.218
0.928TrpGlu: 0.928 ± 0.196
0.742TrpPhe: 0.742 ± 0.255
0.631TrpGly: 0.631 ± 0.156
0.223TrpHis: 0.223 ± 0.087
0.965TrpIle: 0.965 ± 0.231
1.336TrpLys: 1.336 ± 0.39
1.076TrpLeu: 1.076 ± 0.235
0.111TrpMet: 0.111 ± 0.066
0.928TrpAsn: 0.928 ± 0.221
0.0TrpPro: 0.0 ± 0.0
0.408TrpGln: 0.408 ± 0.109
0.408TrpArg: 0.408 ± 0.108
0.779TrpSer: 0.779 ± 0.213
0.594TrpThr: 0.594 ± 0.204
0.742TrpVal: 0.742 ± 0.179
0.186TrpTrp: 0.186 ± 0.087
0.668TrpTyr: 0.668 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.707TyrAla: 1.707 ± 0.244
0.408TyrCys: 0.408 ± 0.129
3.971TyrAsp: 3.971 ± 0.372
3.34TyrGlu: 3.34 ± 0.414
2.041TyrPhe: 2.041 ± 0.312
3.117TyrGly: 3.117 ± 0.466
0.779TyrHis: 0.779 ± 0.145
3.637TyrIle: 3.637 ± 0.462
5.122TyrLys: 5.122 ± 0.433
4.416TyrLeu: 4.416 ± 0.526
0.928TyrMet: 0.928 ± 0.189
3.897TyrAsn: 3.897 ± 0.455
1.076TyrPro: 1.076 ± 0.212
1.076TyrGln: 1.076 ± 0.208
1.967TyrArg: 1.967 ± 0.249
3.489TyrSer: 3.489 ± 0.395
2.227TyrThr: 2.227 ± 0.295
2.672TyrVal: 2.672 ± 0.329
0.482TyrTrp: 0.482 ± 0.199
2.672TyrTyr: 2.672 ± 0.469
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 129 proteins (26946 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski