Amino acid dipepetide frequency for Staphylococcus phage Maine

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.047AlaAla: 0.047 ± 0.028
0.187AlaCys: 0.187 ± 0.06
2.172AlaAsp: 2.172 ± 0.247
2.99AlaGlu: 2.99 ± 0.359
1.869AlaPhe: 1.869 ± 0.194
2.336AlaGly: 2.336 ± 0.325
0.864AlaHis: 0.864 ± 0.175
3.644AlaIle: 3.644 ± 0.28
4.111AlaLys: 4.111 ± 0.384
3.48AlaLeu: 3.48 ± 0.31
1.168AlaMet: 1.168 ± 0.241
2.429AlaAsn: 2.429 ± 0.245
1.145AlaPro: 1.145 ± 0.206
1.892AlaGln: 1.892 ± 0.233
1.705AlaArg: 1.705 ± 0.185
3.364AlaSer: 3.364 ± 0.363
3.013AlaThr: 3.013 ± 0.3
2.78AlaVal: 2.78 ± 0.242
0.444AlaTrp: 0.444 ± 0.114
2.289AlaTyr: 2.289 ± 0.193
0.0AlaXaa: 0.0 ± 0.0
Cys
0.187CysAla: 0.187 ± 0.072
0.07CysCys: 0.07 ± 0.035
0.234CysAsp: 0.234 ± 0.071
0.257CysGlu: 0.257 ± 0.082
0.21CysPhe: 0.21 ± 0.084
0.327CysGly: 0.327 ± 0.08
0.117CysHis: 0.117 ± 0.059
0.444CysIle: 0.444 ± 0.1
0.514CysLys: 0.514 ± 0.135
0.397CysLeu: 0.397 ± 0.096
0.07CysMet: 0.07 ± 0.041
0.117CysAsn: 0.117 ± 0.069
0.21CysPro: 0.21 ± 0.087
0.21CysGln: 0.21 ± 0.086
0.164CysArg: 0.164 ± 0.076
0.374CysSer: 0.374 ± 0.1
0.397CysThr: 0.397 ± 0.094
0.257CysVal: 0.257 ± 0.068
0.093CysTrp: 0.093 ± 0.048
0.257CysTyr: 0.257 ± 0.073
0.0CysXaa: 0.0 ± 0.0
Asp
2.943AspAla: 2.943 ± 0.27
0.257AspCys: 0.257 ± 0.073
4.391AspAsp: 4.391 ± 0.439
5.232AspGlu: 5.232 ± 0.396
3.48AspPhe: 3.48 ± 0.338
3.34AspGly: 3.34 ± 0.265
0.467AspHis: 0.467 ± 0.102
6.33AspIle: 6.33 ± 0.429
6.937AspLys: 6.937 ± 0.385
5.746AspLeu: 5.746 ± 0.335
2.196AspMet: 2.196 ± 0.267
5.349AspAsn: 5.349 ± 0.4
1.588AspPro: 1.588 ± 0.183
1.191AspGln: 1.191 ± 0.214
2.383AspArg: 2.383 ± 0.225
4.181AspSer: 4.181 ± 0.358
3.948AspThr: 3.948 ± 0.261
4.812AspVal: 4.812 ± 0.339
0.584AspTrp: 0.584 ± 0.121
3.948AspTyr: 3.948 ± 0.304
0.0AspXaa: 0.0 ± 0.0
Glu
3.854GluAla: 3.854 ± 0.279
0.257GluCys: 0.257 ± 0.081
6.891GluAsp: 6.891 ± 0.524
8.97GluGlu: 8.97 ± 0.746
3.13GluPhe: 3.13 ± 0.256
4.461GluGly: 4.461 ± 0.295
1.518GluHis: 1.518 ± 0.18
5.746GluIle: 5.746 ± 0.443
7.311GluLys: 7.311 ± 0.54
7.311GluLeu: 7.311 ± 0.453
2.359GluMet: 2.359 ± 0.268
4.602GluAsn: 4.602 ± 0.364
2.009GluPro: 2.009 ± 0.407
3.901GluGln: 3.901 ± 0.341
3.06GluArg: 3.06 ± 0.282
4.672GluSer: 4.672 ± 0.323
3.714GluThr: 3.714 ± 0.259
5.232GluVal: 5.232 ± 0.369
0.701GluTrp: 0.701 ± 0.112
4.134GluTyr: 4.134 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
1.425PheAla: 1.425 ± 0.176
0.257PheCys: 0.257 ± 0.073
2.453PheAsp: 2.453 ± 0.283
2.64PheGlu: 2.64 ± 0.258
1.378PhePhe: 1.378 ± 0.161
2.056PheGly: 2.056 ± 0.245
0.584PheHis: 0.584 ± 0.13
3.714PheIle: 3.714 ± 0.352
3.831PheLys: 3.831 ± 0.291
3.2PheLeu: 3.2 ± 0.301
0.958PheMet: 0.958 ± 0.16
3.223PheAsn: 3.223 ± 0.29
1.121PhePro: 1.121 ± 0.15
1.331PheGln: 1.331 ± 0.17
1.238PheArg: 1.238 ± 0.153
2.896PheSer: 2.896 ± 0.288
2.289PheThr: 2.289 ± 0.281
2.733PheVal: 2.733 ± 0.249
0.35PheTrp: 0.35 ± 0.095
2.266PheTyr: 2.266 ± 0.281
0.0PheXaa: 0.0 ± 0.0
Gly
2.663GlyAla: 2.663 ± 0.5
0.257GlyCys: 0.257 ± 0.073
3.667GlyAsp: 3.667 ± 0.327
4.111GlyGlu: 4.111 ± 0.3
2.266GlyPhe: 2.266 ± 0.249
4.134GlyGly: 4.134 ± 0.908
1.098GlyHis: 1.098 ± 0.162
4.064GlyIle: 4.064 ± 0.354
5.746GlyLys: 5.746 ± 0.445
4.695GlyLeu: 4.695 ± 0.339
1.518GlyMet: 1.518 ± 0.199
3.644GlyAsn: 3.644 ± 0.347
0.0GlyPro: 0.0 ± 0.0
1.985GlyGln: 1.985 ± 0.297
2.079GlyArg: 2.079 ± 0.244
4.321GlySer: 4.321 ± 0.406
3.924GlyThr: 3.924 ± 0.421
3.737GlyVal: 3.737 ± 0.291
0.607GlyTrp: 0.607 ± 0.15
3.27GlyTyr: 3.27 ± 0.255
0.0GlyXaa: 0.0 ± 0.0
His
0.584HisAla: 0.584 ± 0.121
0.257HisCys: 0.257 ± 0.091
0.888HisAsp: 0.888 ± 0.15
1.168HisGlu: 1.168 ± 0.169
0.771HisPhe: 0.771 ± 0.125
0.981HisGly: 0.981 ± 0.137
0.234HisHis: 0.234 ± 0.082
1.822HisIle: 1.822 ± 0.252
1.098HisLys: 1.098 ± 0.152
1.285HisLeu: 1.285 ± 0.2
0.28HisMet: 0.28 ± 0.083
1.051HisAsn: 1.051 ± 0.166
0.561HisPro: 0.561 ± 0.106
0.467HisGln: 0.467 ± 0.085
0.654HisArg: 0.654 ± 0.137
0.934HisSer: 0.934 ± 0.138
0.958HisThr: 0.958 ± 0.135
0.934HisVal: 0.934 ± 0.146
0.187HisTrp: 0.187 ± 0.07
0.771HisTyr: 0.771 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
3.247IleAla: 3.247 ± 0.324
0.42IleCys: 0.42 ± 0.121
5.653IleAsp: 5.653 ± 0.37
6.12IleGlu: 6.12 ± 0.427
2.593IlePhe: 2.593 ± 0.228
4.158IleGly: 4.158 ± 0.367
1.051IleHis: 1.051 ± 0.174
6.073IleIle: 6.073 ± 0.487
6.984IleLys: 6.984 ± 0.329
5.956IleLeu: 5.956 ± 0.447
1.892IleMet: 1.892 ± 0.227
5.396IleAsn: 5.396 ± 0.331
2.336IlePro: 2.336 ± 0.213
2.336IleGln: 2.336 ± 0.249
2.85IleArg: 2.85 ± 0.271
4.508IleSer: 4.508 ± 0.308
4.999IleThr: 4.999 ± 0.382
4.905IleVal: 4.905 ± 0.396
0.561IleTrp: 0.561 ± 0.106
3.013IleTyr: 3.013 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
3.807LysAla: 3.807 ± 0.398
0.374LysCys: 0.374 ± 0.098
7.288LysAsp: 7.288 ± 0.397
10.558LysGlu: 10.558 ± 0.748
3.107LysPhe: 3.107 ± 0.241
6.026LysGly: 6.026 ± 0.516
1.892LysHis: 1.892 ± 0.236
4.415LysIle: 4.415 ± 0.305
8.059LysLys: 8.059 ± 0.584
6.984LysLeu: 6.984 ± 0.423
2.593LysMet: 2.593 ± 0.295
5.396LysAsn: 5.396 ± 0.384
2.85LysPro: 2.85 ± 0.34
3.807LysGln: 3.807 ± 0.322
2.92LysArg: 2.92 ± 0.283
4.765LysSer: 4.765 ± 0.339
4.485LysThr: 4.485 ± 0.308
7.054LysVal: 7.054 ± 0.399
0.701LysTrp: 0.701 ± 0.132
4.321LysTyr: 4.321 ± 0.346
0.0LysXaa: 0.0 ± 0.0
Leu
3.41LeuAla: 3.41 ± 0.308
0.327LeuCys: 0.327 ± 0.088
5.793LeuAsp: 5.793 ± 0.367
7.311LeuGlu: 7.311 ± 0.464
2.873LeuPhe: 2.873 ± 0.239
4.765LeuGly: 4.765 ± 0.472
1.051LeuHis: 1.051 ± 0.152
5.863LeuIle: 5.863 ± 0.445
7.241LeuLys: 7.241 ± 0.421
6.307LeuLeu: 6.307 ± 0.458
2.056LeuMet: 2.056 ± 0.232
5.139LeuAsn: 5.139 ± 0.369
2.616LeuPro: 2.616 ± 0.221
3.083LeuGln: 3.083 ± 0.303
3.107LeuArg: 3.107 ± 0.252
6.143LeuSer: 6.143 ± 0.332
5.676LeuThr: 5.676 ± 0.378
4.718LeuVal: 4.718 ± 0.414
0.654LeuTrp: 0.654 ± 0.109
3.34LeuTyr: 3.34 ± 0.313
0.0LeuXaa: 0.0 ± 0.0
Met
1.752MetAla: 1.752 ± 0.241
0.187MetCys: 0.187 ± 0.072
1.892MetAsp: 1.892 ± 0.238
2.126MetGlu: 2.126 ± 0.234
1.238MetPhe: 1.238 ± 0.178
1.261MetGly: 1.261 ± 0.203
0.28MetHis: 0.28 ± 0.084
1.939MetIle: 1.939 ± 0.235
2.149MetLys: 2.149 ± 0.271
1.682MetLeu: 1.682 ± 0.203
0.677MetMet: 0.677 ± 0.141
1.752MetAsn: 1.752 ± 0.211
0.561MetPro: 0.561 ± 0.122
0.747MetGln: 0.747 ± 0.123
1.215MetArg: 1.215 ± 0.198
1.939MetSer: 1.939 ± 0.206
1.752MetThr: 1.752 ± 0.181
1.261MetVal: 1.261 ± 0.168
0.117MetTrp: 0.117 ± 0.054
1.051MetTyr: 1.051 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
3.06AsnAla: 3.06 ± 0.284
0.327AsnCys: 0.327 ± 0.087
4.158AsnAsp: 4.158 ± 0.331
4.461AsnGlu: 4.461 ± 0.362
2.453AsnPhe: 2.453 ± 0.245
3.994AsnGly: 3.994 ± 0.272
1.238AsnHis: 1.238 ± 0.188
4.882AsnIle: 4.882 ± 0.36
6.821AsnLys: 6.821 ± 0.43
5.256AsnLeu: 5.256 ± 0.375
1.705AsnMet: 1.705 ± 0.19
4.835AsnAsn: 4.835 ± 0.352
2.546AsnPro: 2.546 ± 0.308
2.056AsnGln: 2.056 ± 0.221
2.453AsnArg: 2.453 ± 0.204
3.878AsnSer: 3.878 ± 0.275
3.457AsnThr: 3.457 ± 0.309
3.948AsnVal: 3.948 ± 0.283
0.397AsnTrp: 0.397 ± 0.101
3.387AsnTyr: 3.387 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
1.168ProAla: 1.168 ± 0.163
0.093ProCys: 0.093 ± 0.045
1.565ProAsp: 1.565 ± 0.221
2.429ProGlu: 2.429 ± 0.283
1.121ProPhe: 1.121 ± 0.174
1.145ProGly: 1.145 ± 0.195
0.444ProHis: 0.444 ± 0.106
2.056ProIle: 2.056 ± 0.194
2.499ProLys: 2.499 ± 0.248
2.009ProLeu: 2.009 ± 0.212
0.724ProMet: 0.724 ± 0.116
1.915ProAsn: 1.915 ± 0.255
0.677ProPro: 0.677 ± 0.19
1.191ProGln: 1.191 ± 0.194
0.888ProArg: 0.888 ± 0.148
2.406ProSer: 2.406 ± 0.285
2.032ProThr: 2.032 ± 0.261
1.588ProVal: 1.588 ± 0.178
0.14ProTrp: 0.14 ± 0.052
1.588ProTyr: 1.588 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
2.126GlnAla: 2.126 ± 0.218
0.117GlnCys: 0.117 ± 0.047
2.359GlnAsp: 2.359 ± 0.236
3.434GlnGlu: 3.434 ± 0.324
1.472GlnPhe: 1.472 ± 0.218
2.476GlnGly: 2.476 ± 0.319
0.537GlnHis: 0.537 ± 0.102
2.406GlnIle: 2.406 ± 0.242
2.196GlnLys: 2.196 ± 0.215
3.364GlnLeu: 3.364 ± 0.273
0.864GlnMet: 0.864 ± 0.145
1.658GlnAsn: 1.658 ± 0.228
1.191GlnPro: 1.191 ± 0.248
1.845GlnGln: 1.845 ± 0.314
0.981GlnArg: 0.981 ± 0.147
2.616GlnSer: 2.616 ± 0.264
1.822GlnThr: 1.822 ± 0.209
1.915GlnVal: 1.915 ± 0.199
0.327GlnTrp: 0.327 ± 0.088
1.799GlnTyr: 1.799 ± 0.218
0.0GlnXaa: 0.0 ± 0.0
Arg
1.472ArgAla: 1.472 ± 0.192
0.28ArgCys: 0.28 ± 0.081
2.476ArgAsp: 2.476 ± 0.214
3.13ArgGlu: 3.13 ± 0.284
1.682ArgPhe: 1.682 ± 0.203
2.056ArgGly: 2.056 ± 0.183
0.42ArgHis: 0.42 ± 0.099
2.733ArgIle: 2.733 ± 0.229
3.434ArgLys: 3.434 ± 0.331
2.99ArgLeu: 2.99 ± 0.226
0.911ArgMet: 0.911 ± 0.137
1.962ArgAsn: 1.962 ± 0.204
1.098ArgPro: 1.098 ± 0.148
1.355ArgGln: 1.355 ± 0.167
1.635ArgArg: 1.635 ± 0.196
1.729ArgSer: 1.729 ± 0.212
2.079ArgThr: 2.079 ± 0.288
2.476ArgVal: 2.476 ± 0.218
0.257ArgTrp: 0.257 ± 0.074
1.658ArgTyr: 1.658 ± 0.205
0.0ArgXaa: 0.0 ± 0.0
Ser
2.896SerAla: 2.896 ± 0.244
0.164SerCys: 0.164 ± 0.059
4.158SerAsp: 4.158 ± 0.365
4.321SerGlu: 4.321 ± 0.282
3.083SerPhe: 3.083 ± 0.283
3.854SerGly: 3.854 ± 0.407
1.051SerHis: 1.051 ± 0.138
5.116SerIle: 5.116 ± 0.325
6.167SerLys: 6.167 ± 0.433
5.326SerLeu: 5.326 ± 0.349
1.565SerMet: 1.565 ± 0.204
4.345SerAsn: 4.345 ± 0.354
2.009SerPro: 2.009 ± 0.245
1.775SerGln: 1.775 ± 0.191
2.149SerArg: 2.149 ± 0.236
4.648SerSer: 4.648 ± 0.401
4.158SerThr: 4.158 ± 0.341
4.181SerVal: 4.181 ± 0.345
0.607SerTrp: 0.607 ± 0.127
3.901SerTyr: 3.901 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
2.499ThrAla: 2.499 ± 0.288
0.14ThrCys: 0.14 ± 0.058
3.807ThrAsp: 3.807 ± 0.336
4.929ThrGlu: 4.929 ± 0.409
2.336ThrPhe: 2.336 ± 0.27
4.088ThrGly: 4.088 ± 0.401
0.911ThrHis: 0.911 ± 0.14
4.929ThrIle: 4.929 ± 0.366
4.952ThrLys: 4.952 ± 0.372
5.069ThrLeu: 5.069 ± 0.387
1.004ThrMet: 1.004 ± 0.142
3.2ThrAsn: 3.2 ± 0.26
2.242ThrPro: 2.242 ± 0.285
2.196ThrGln: 2.196 ± 0.268
2.312ThrArg: 2.312 ± 0.275
3.854ThrSer: 3.854 ± 0.331
2.99ThrThr: 2.99 ± 0.352
4.391ThrVal: 4.391 ± 0.386
0.654ThrTrp: 0.654 ± 0.127
2.71ThrTyr: 2.71 ± 0.241
0.0ThrXaa: 0.0 ± 0.0
Val
2.499ValAla: 2.499 ± 0.261
0.514ValCys: 0.514 ± 0.115
4.765ValAsp: 4.765 ± 0.3
5.443ValGlu: 5.443 ± 0.495
2.546ValPhe: 2.546 ± 0.222
3.107ValGly: 3.107 ± 0.242
1.028ValHis: 1.028 ± 0.157
4.859ValIle: 4.859 ± 0.367
6.003ValLys: 6.003 ± 0.433
5.139ValLeu: 5.139 ± 0.364
1.518ValMet: 1.518 ± 0.162
4.602ValAsn: 4.602 ± 0.347
1.658ValPro: 1.658 ± 0.188
1.845ValGln: 1.845 ± 0.236
2.149ValArg: 2.149 ± 0.198
4.718ValSer: 4.718 ± 0.335
4.088ValThr: 4.088 ± 0.399
3.878ValVal: 3.878 ± 0.35
0.42ValTrp: 0.42 ± 0.104
3.504ValTyr: 3.504 ± 0.312
0.0ValXaa: 0.0 ± 0.0
Trp
0.35TrpAla: 0.35 ± 0.077
0.07TrpCys: 0.07 ± 0.032
0.677TrpAsp: 0.677 ± 0.11
0.771TrpGlu: 0.771 ± 0.14
0.304TrpPhe: 0.304 ± 0.082
0.584TrpGly: 0.584 ± 0.135
0.14TrpHis: 0.14 ± 0.049
0.654TrpIle: 0.654 ± 0.131
0.841TrpLys: 0.841 ± 0.163
0.607TrpLeu: 0.607 ± 0.121
0.14TrpMet: 0.14 ± 0.064
0.584TrpAsn: 0.584 ± 0.12
0.0TrpPro: 0.0 ± 0.0
0.327TrpGln: 0.327 ± 0.078
0.187TrpArg: 0.187 ± 0.059
0.607TrpSer: 0.607 ± 0.132
0.42TrpThr: 0.42 ± 0.109
0.491TrpVal: 0.491 ± 0.108
0.164TrpTrp: 0.164 ± 0.073
0.607TrpTyr: 0.607 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.939TyrAla: 1.939 ± 0.228
0.35TyrCys: 0.35 ± 0.09
4.018TyrAsp: 4.018 ± 0.294
3.434TyrGlu: 3.434 ± 0.313
2.032TyrPhe: 2.032 ± 0.208
2.64TyrGly: 2.64 ± 0.265
0.958TyrHis: 0.958 ± 0.17
3.364TyrIle: 3.364 ± 0.274
4.508TyrLys: 4.508 ± 0.32
4.672TyrLeu: 4.672 ± 0.386
1.355TyrMet: 1.355 ± 0.181
4.111TyrAsn: 4.111 ± 0.323
1.261TyrPro: 1.261 ± 0.179
1.962TyrGln: 1.962 ± 0.2
1.705TyrArg: 1.705 ± 0.172
2.92TyrSer: 2.92 ± 0.244
3.013TyrThr: 3.013 ± 0.387
2.99TyrVal: 2.99 ± 0.275
0.584TyrTrp: 0.584 ± 0.133
3.27TyrTyr: 3.27 ± 0.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 219 proteins (42812 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski