Amino acid dipepetide frequency for Lactobacillus phage SAC12B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.454AlaAla: 1.454 ± 0.254
0.194AlaCys: 0.194 ± 0.068
3.199AlaAsp: 3.199 ± 0.295
2.618AlaGlu: 2.618 ± 0.274
1.794AlaPhe: 1.794 ± 0.251
2.909AlaGly: 2.909 ± 0.433
0.533AlaHis: 0.533 ± 0.118
4.145AlaIle: 4.145 ± 0.301
4.436AlaLys: 4.436 ± 0.445
3.805AlaLeu: 3.805 ± 0.332
1.309AlaMet: 1.309 ± 0.184
4.023AlaAsn: 4.023 ± 0.355
1.794AlaPro: 1.794 ± 0.346
1.527AlaGln: 1.527 ± 0.206
1.236AlaArg: 1.236 ± 0.275
4.096AlaSer: 4.096 ± 0.393
3.539AlaThr: 3.539 ± 0.361
2.521AlaVal: 2.521 ± 0.288
0.703AlaTrp: 0.703 ± 0.141
2.763AlaTyr: 2.763 ± 0.28
0.0AlaXaa: 0.0 ± 0.0
Cys
0.097CysAla: 0.097 ± 0.058
0.024CysCys: 0.024 ± 0.024
0.461CysAsp: 0.461 ± 0.11
0.194CysGlu: 0.194 ± 0.081
0.315CysPhe: 0.315 ± 0.093
0.388CysGly: 0.388 ± 0.102
0.048CysHis: 0.048 ± 0.032
0.412CysIle: 0.412 ± 0.099
0.388CysLys: 0.388 ± 0.098
0.679CysLeu: 0.679 ± 0.126
0.145CysMet: 0.145 ± 0.063
0.388CysAsn: 0.388 ± 0.085
0.17CysPro: 0.17 ± 0.081
0.218CysGln: 0.218 ± 0.073
0.097CysArg: 0.097 ± 0.052
0.364CysSer: 0.364 ± 0.089
0.436CysThr: 0.436 ± 0.099
0.388CysVal: 0.388 ± 0.089
0.121CysTrp: 0.121 ± 0.053
0.388CysTyr: 0.388 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
3.369AspAla: 3.369 ± 0.31
0.388AspCys: 0.388 ± 0.097
4.654AspAsp: 4.654 ± 0.428
3.927AspGlu: 3.927 ± 0.383
3.49AspPhe: 3.49 ± 0.346
3.611AspGly: 3.611 ± 0.296
0.751AspHis: 0.751 ± 0.138
7.126AspIle: 7.126 ± 0.369
7.683AspLys: 7.683 ± 0.405
6.399AspLeu: 6.399 ± 0.425
1.745AspMet: 1.745 ± 0.208
5.502AspAsn: 5.502 ± 0.36
2.084AspPro: 2.084 ± 0.22
1.26AspGln: 1.26 ± 0.203
2.133AspArg: 2.133 ± 0.217
6.471AspSer: 6.471 ± 0.357
4.096AspThr: 4.096 ± 0.297
3.393AspVal: 3.393 ± 0.272
0.751AspTrp: 0.751 ± 0.159
4.339AspTyr: 4.339 ± 0.38
0.0AspXaa: 0.0 ± 0.0
Glu
3.005GluAla: 3.005 ± 0.311
0.388GluCys: 0.388 ± 0.099
5.211GluAsp: 5.211 ± 0.448
5.284GluGlu: 5.284 ± 0.528
2.375GluPhe: 2.375 ± 0.242
2.763GluGly: 2.763 ± 0.243
0.824GluHis: 0.824 ± 0.159
4.314GluIle: 4.314 ± 0.322
6.059GluLys: 6.059 ± 0.407
5.55GluLeu: 5.55 ± 0.44
1.6GluMet: 1.6 ± 0.192
4.508GluAsn: 4.508 ± 0.423
1.285GluPro: 1.285 ± 0.211
1.818GluGln: 1.818 ± 0.213
1.43GluArg: 1.43 ± 0.184
2.981GluSer: 2.981 ± 0.253
2.933GluThr: 2.933 ± 0.388
3.975GluVal: 3.975 ± 0.393
0.485GluTrp: 0.485 ± 0.109
2.787GluTyr: 2.787 ± 0.289
0.0GluXaa: 0.0 ± 0.0
Phe
1.624PheAla: 1.624 ± 0.218
0.218PheCys: 0.218 ± 0.095
3.151PheAsp: 3.151 ± 0.27
2.012PheGlu: 2.012 ± 0.232
1.333PhePhe: 1.333 ± 0.198
2.23PheGly: 2.23 ± 0.241
0.533PheHis: 0.533 ± 0.123
3.175PheIle: 3.175 ± 0.249
3.539PheLys: 3.539 ± 0.348
3.03PheLeu: 3.03 ± 0.263
0.897PheMet: 0.897 ± 0.157
3.733PheAsn: 3.733 ± 0.307
1.551PhePro: 1.551 ± 0.185
0.921PheGln: 0.921 ± 0.152
1.285PheArg: 1.285 ± 0.169
4.145PheSer: 4.145 ± 0.347
2.254PheThr: 2.254 ± 0.229
2.036PheVal: 2.036 ± 0.253
0.388PheTrp: 0.388 ± 0.116
2.109PheTyr: 2.109 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
2.133GlyAla: 2.133 ± 0.297
0.291GlyCys: 0.291 ± 0.08
3.902GlyAsp: 3.902 ± 0.338
2.545GlyGlu: 2.545 ± 0.221
2.254GlyPhe: 2.254 ± 0.191
3.127GlyGly: 3.127 ± 0.579
0.848GlyHis: 0.848 ± 0.162
5.962GlyIle: 5.962 ± 0.401
5.066GlyLys: 5.066 ± 0.393
4.339GlyLeu: 4.339 ± 0.325
1.236GlyMet: 1.236 ± 0.18
4.775GlyAsn: 4.775 ± 0.399
0.412GlyPro: 0.412 ± 0.099
1.139GlyGln: 1.139 ± 0.179
1.672GlyArg: 1.672 ± 0.22
4.678GlySer: 4.678 ± 0.455
3.418GlyThr: 3.418 ± 0.356
3.781GlyVal: 3.781 ± 0.313
0.848GlyTrp: 0.848 ± 0.223
4.193GlyTyr: 4.193 ± 0.28
0.0GlyXaa: 0.0 ± 0.0
His
0.485HisAla: 0.485 ± 0.107
0.194HisCys: 0.194 ± 0.069
0.848HisAsp: 0.848 ± 0.172
0.533HisGlu: 0.533 ± 0.113
0.606HisPhe: 0.606 ± 0.127
0.873HisGly: 0.873 ± 0.165
0.509HisHis: 0.509 ± 0.123
0.897HisIle: 0.897 ± 0.173
1.091HisLys: 1.091 ± 0.172
1.139HisLeu: 1.139 ± 0.145
0.218HisMet: 0.218 ± 0.079
0.824HisAsn: 0.824 ± 0.161
0.557HisPro: 0.557 ± 0.13
0.509HisGln: 0.509 ± 0.103
0.582HisArg: 0.582 ± 0.119
0.776HisSer: 0.776 ± 0.133
0.897HisThr: 0.897 ± 0.142
0.848HisVal: 0.848 ± 0.145
0.242HisTrp: 0.242 ± 0.093
0.824HisTyr: 0.824 ± 0.162
0.0HisXaa: 0.0 ± 0.0
Ile
3.418IleAla: 3.418 ± 0.301
0.364IleCys: 0.364 ± 0.096
6.059IleAsp: 6.059 ± 0.441
4.023IleGlu: 4.023 ± 0.324
2.181IlePhe: 2.181 ± 0.27
4.217IleGly: 4.217 ± 0.352
1.139IleHis: 1.139 ± 0.163
5.138IleIle: 5.138 ± 0.38
7.732IleLys: 7.732 ± 0.446
5.769IleLeu: 5.769 ± 0.45
1.575IleMet: 1.575 ± 0.175
7.489IleAsn: 7.489 ± 0.42
2.933IlePro: 2.933 ± 0.265
1.939IleGln: 1.939 ± 0.221
2.254IleArg: 2.254 ± 0.255
7.465IleSer: 7.465 ± 0.599
5.381IleThr: 5.381 ± 0.361
4.169IleVal: 4.169 ± 0.337
0.436IleTrp: 0.436 ± 0.142
3.248IleTyr: 3.248 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
4.532LysAla: 4.532 ± 0.381
0.388LysCys: 0.388 ± 0.09
7.32LysAsp: 7.32 ± 0.487
8.047LysGlu: 8.047 ± 0.695
2.909LysPhe: 2.909 ± 0.301
4.993LysGly: 4.993 ± 0.379
1.333LysHis: 1.333 ± 0.198
4.896LysIle: 4.896 ± 0.383
6.035LysLys: 6.035 ± 0.414
6.859LysLeu: 6.859 ± 0.411
1.769LysMet: 1.769 ± 0.185
5.938LysAsn: 5.938 ± 0.375
2.254LysPro: 2.254 ± 0.25
2.715LysGln: 2.715 ± 0.239
3.005LysArg: 3.005 ± 0.318
6.229LysSer: 6.229 ± 0.484
4.46LysThr: 4.46 ± 0.288
6.108LysVal: 6.108 ± 0.421
0.873LysTrp: 0.873 ± 0.131
4.92LysTyr: 4.92 ± 0.353
0.0LysXaa: 0.0 ± 0.0
Leu
4.678LeuAla: 4.678 ± 0.386
0.485LeuCys: 0.485 ± 0.113
6.496LeuAsp: 6.496 ± 0.495
5.526LeuGlu: 5.526 ± 0.472
3.49LeuPhe: 3.49 ± 0.286
4.872LeuGly: 4.872 ± 0.295
0.8LeuHis: 0.8 ± 0.159
5.429LeuIle: 5.429 ± 0.432
6.835LeuLys: 6.835 ± 0.508
7.368LeuLeu: 7.368 ± 0.555
1.309LeuMet: 1.309 ± 0.191
6.35LeuAsn: 6.35 ± 0.387
3.127LeuPro: 3.127 ± 0.318
2.181LeuGln: 2.181 ± 0.229
2.424LeuArg: 2.424 ± 0.273
7.732LeuSer: 7.732 ± 0.459
4.823LeuThr: 4.823 ± 0.371
4.726LeuVal: 4.726 ± 0.416
0.654LeuTrp: 0.654 ± 0.137
3.733LeuTyr: 3.733 ± 0.339
0.0LeuXaa: 0.0 ± 0.0
Met
1.406MetAla: 1.406 ± 0.223
0.097MetCys: 0.097 ± 0.052
1.818MetAsp: 1.818 ± 0.192
1.406MetGlu: 1.406 ± 0.187
0.654MetPhe: 0.654 ± 0.129
1.26MetGly: 1.26 ± 0.183
0.17MetHis: 0.17 ± 0.063
1.309MetIle: 1.309 ± 0.177
1.915MetLys: 1.915 ± 0.228
2.181MetLeu: 2.181 ± 0.243
0.485MetMet: 0.485 ± 0.131
1.624MetAsn: 1.624 ± 0.228
0.703MetPro: 0.703 ± 0.143
0.703MetGln: 0.703 ± 0.128
0.703MetArg: 0.703 ± 0.138
1.769MetSer: 1.769 ± 0.209
1.236MetThr: 1.236 ± 0.158
1.115MetVal: 1.115 ± 0.17
0.218MetTrp: 0.218 ± 0.08
0.897MetTyr: 0.897 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.636AsnAla: 3.636 ± 0.333
0.485AsnCys: 0.485 ± 0.121
4.314AsnAsp: 4.314 ± 0.296
3.781AsnGlu: 3.781 ± 0.305
2.981AsnPhe: 2.981 ± 0.264
4.145AsnGly: 4.145 ± 0.391
1.309AsnHis: 1.309 ± 0.194
6.253AsnIle: 6.253 ± 0.498
7.901AsnLys: 7.901 ± 0.478
5.744AsnLeu: 5.744 ± 0.395
2.133AsnMet: 2.133 ± 0.215
6.714AsnAsn: 6.714 ± 0.581
2.569AsnPro: 2.569 ± 0.251
2.739AsnGln: 2.739 ± 0.309
3.03AsnArg: 3.03 ± 0.3
6.908AsnSer: 6.908 ± 0.599
4.896AsnThr: 4.896 ± 0.389
3.66AsnVal: 3.66 ± 0.312
0.509AsnTrp: 0.509 ± 0.105
4.387AsnTyr: 4.387 ± 0.318
0.0AsnXaa: 0.0 ± 0.0
Pro
1.551ProAla: 1.551 ± 0.205
0.097ProCys: 0.097 ± 0.04
2.521ProAsp: 2.521 ± 0.247
1.939ProGlu: 1.939 ± 0.214
1.188ProPhe: 1.188 ± 0.222
1.43ProGly: 1.43 ± 0.187
0.315ProHis: 0.315 ± 0.09
2.303ProIle: 2.303 ± 0.226
2.036ProLys: 2.036 ± 0.252
2.69ProLeu: 2.69 ± 0.257
0.63ProMet: 0.63 ± 0.142
2.327ProAsn: 2.327 ± 0.249
0.436ProPro: 0.436 ± 0.11
1.163ProGln: 1.163 ± 0.234
0.654ProArg: 0.654 ± 0.13
2.157ProSer: 2.157 ± 0.218
2.521ProThr: 2.521 ± 0.426
2.327ProVal: 2.327 ± 0.27
0.291ProTrp: 0.291 ± 0.085
1.794ProTyr: 1.794 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
2.351GlnAla: 2.351 ± 0.273
0.073GlnCys: 0.073 ± 0.04
2.448GlnAsp: 2.448 ± 0.253
2.133GlnGlu: 2.133 ± 0.273
0.921GlnPhe: 0.921 ± 0.158
1.794GlnGly: 1.794 ± 0.237
0.485GlnHis: 0.485 ± 0.094
1.6GlnIle: 1.6 ± 0.222
1.794GlnLys: 1.794 ± 0.226
3.127GlnLeu: 3.127 ± 0.279
0.582GlnMet: 0.582 ± 0.117
1.769GlnAsn: 1.769 ± 0.224
0.97GlnPro: 0.97 ± 0.183
1.188GlnGln: 1.188 ± 0.237
0.97GlnArg: 0.97 ± 0.198
2.206GlnSer: 2.206 ± 0.264
1.212GlnThr: 1.212 ± 0.186
1.939GlnVal: 1.939 ± 0.188
0.291GlnTrp: 0.291 ± 0.089
1.939GlnTyr: 1.939 ± 0.215
0.0GlnXaa: 0.0 ± 0.0
Arg
1.236ArgAla: 1.236 ± 0.183
0.17ArgCys: 0.17 ± 0.077
1.891ArgAsp: 1.891 ± 0.225
1.987ArgGlu: 1.987 ± 0.237
1.357ArgPhe: 1.357 ± 0.172
1.479ArgGly: 1.479 ± 0.204
0.242ArgHis: 0.242 ± 0.089
2.472ArgIle: 2.472 ± 0.232
2.545ArgLys: 2.545 ± 0.261
2.884ArgLeu: 2.884 ± 0.289
0.8ArgMet: 0.8 ± 0.134
1.866ArgAsn: 1.866 ± 0.222
0.897ArgPro: 0.897 ± 0.184
1.26ArgGln: 1.26 ± 0.194
1.066ArgArg: 1.066 ± 0.202
1.769ArgSer: 1.769 ± 0.218
1.479ArgThr: 1.479 ± 0.229
2.521ArgVal: 2.521 ± 0.274
0.388ArgTrp: 0.388 ± 0.086
1.697ArgTyr: 1.697 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
3.83SerAla: 3.83 ± 0.428
0.412SerCys: 0.412 ± 0.1
6.059SerAsp: 6.059 ± 0.359
4.581SerGlu: 4.581 ± 0.376
3.539SerPhe: 3.539 ± 0.298
5.841SerGly: 5.841 ± 0.537
0.994SerHis: 0.994 ± 0.19
7.223SerIle: 7.223 ± 0.613
6.568SerLys: 6.568 ± 0.442
6.738SerLeu: 6.738 ± 0.394
1.6SerMet: 1.6 ± 0.233
6.98SerAsn: 6.98 ± 0.444
2.23SerPro: 2.23 ± 0.277
2.812SerGln: 2.812 ± 0.266
2.327SerArg: 2.327 ± 0.292
7.441SerSer: 7.441 ± 0.71
4.023SerThr: 4.023 ± 0.403
4.096SerVal: 4.096 ± 0.283
0.945SerTrp: 0.945 ± 0.154
4.751SerTyr: 4.751 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
3.272ThrAla: 3.272 ± 0.372
0.412ThrCys: 0.412 ± 0.109
3.902ThrAsp: 3.902 ± 0.283
3.03ThrGlu: 3.03 ± 0.243
2.739ThrPhe: 2.739 ± 0.257
3.902ThrGly: 3.902 ± 0.295
0.824ThrHis: 0.824 ± 0.162
5.066ThrIle: 5.066 ± 0.353
3.975ThrLys: 3.975 ± 0.341
4.387ThrLeu: 4.387 ± 0.323
0.727ThrMet: 0.727 ± 0.123
4.193ThrAsn: 4.193 ± 0.393
2.787ThrPro: 2.787 ± 0.25
1.866ThrGln: 1.866 ± 0.284
1.721ThrArg: 1.721 ± 0.212
4.848ThrSer: 4.848 ± 0.485
4.581ThrThr: 4.581 ± 0.722
4.726ThrVal: 4.726 ± 0.676
0.727ThrTrp: 0.727 ± 0.159
2.739ThrTyr: 2.739 ± 0.243
0.0ThrXaa: 0.0 ± 0.0
Val
3.563ValAla: 3.563 ± 0.35
0.436ValCys: 0.436 ± 0.112
4.387ValAsp: 4.387 ± 0.333
3.102ValGlu: 3.102 ± 0.456
3.175ValPhe: 3.175 ± 0.274
3.199ValGly: 3.199 ± 0.308
0.606ValHis: 0.606 ± 0.13
4.145ValIle: 4.145 ± 0.318
5.187ValLys: 5.187 ± 0.404
4.339ValLeu: 4.339 ± 0.33
1.285ValMet: 1.285 ± 0.172
4.387ValAsn: 4.387 ± 0.355
1.648ValPro: 1.648 ± 0.187
1.43ValGln: 1.43 ± 0.205
1.333ValArg: 1.333 ± 0.167
5.866ValSer: 5.866 ± 0.422
4.629ValThr: 4.629 ± 0.349
2.884ValVal: 2.884 ± 0.259
0.776ValTrp: 0.776 ± 0.234
2.981ValTyr: 2.981 ± 0.257
0.0ValXaa: 0.0 ± 0.0
Trp
0.461TrpAla: 0.461 ± 0.108
0.121TrpCys: 0.121 ± 0.053
0.97TrpAsp: 0.97 ± 0.183
0.582TrpGlu: 0.582 ± 0.121
0.533TrpPhe: 0.533 ± 0.118
0.873TrpGly: 0.873 ± 0.163
0.097TrpHis: 0.097 ± 0.048
0.63TrpIle: 0.63 ± 0.099
0.606TrpLys: 0.606 ± 0.128
1.163TrpLeu: 1.163 ± 0.225
0.291TrpMet: 0.291 ± 0.08
0.703TrpAsn: 0.703 ± 0.134
0.145TrpPro: 0.145 ± 0.054
0.557TrpGln: 0.557 ± 0.116
0.194TrpArg: 0.194 ± 0.072
0.436TrpSer: 0.436 ± 0.125
0.412TrpThr: 0.412 ± 0.093
0.921TrpVal: 0.921 ± 0.159
0.194TrpTrp: 0.194 ± 0.062
0.509TrpTyr: 0.509 ± 0.118
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.496TyrAla: 2.496 ± 0.283
0.485TyrCys: 0.485 ± 0.111
3.563TyrAsp: 3.563 ± 0.288
2.351TyrGlu: 2.351 ± 0.245
2.327TyrPhe: 2.327 ± 0.258
2.715TyrGly: 2.715 ± 0.251
1.091TyrHis: 1.091 ± 0.157
4.436TyrIle: 4.436 ± 0.37
4.193TyrLys: 4.193 ± 0.366
4.848TyrLeu: 4.848 ± 0.354
1.309TyrMet: 1.309 ± 0.195
3.999TyrAsn: 3.999 ± 0.311
1.818TyrPro: 1.818 ± 0.189
1.842TyrGln: 1.842 ± 0.22
1.891TyrArg: 1.891 ± 0.226
4.654TyrSer: 4.654 ± 0.359
3.199TyrThr: 3.199 ± 0.332
3.224TyrVal: 3.224 ± 0.347
0.557TyrTrp: 0.557 ± 0.102
3.054TyrTyr: 3.054 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 191 proteins (41259 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski