Amino acid dipepetide frequency for Cellulophaga phage phi13:2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.243AlaAla: 3.243 ± 0.497
0.533AlaCys: 0.533 ± 0.161
2.932AlaAsp: 2.932 ± 0.363
3.554AlaGlu: 3.554 ± 0.443
2.177AlaPhe: 2.177 ± 0.32
2.132AlaGly: 2.132 ± 0.376
0.444AlaHis: 0.444 ± 0.144
3.998AlaIle: 3.998 ± 0.435
4.753AlaLys: 4.753 ± 0.551
4.709AlaLeu: 4.709 ± 0.625
1.51AlaMet: 1.51 ± 0.251
3.776AlaAsn: 3.776 ± 0.41
0.933AlaPro: 0.933 ± 0.252
1.955AlaGln: 1.955 ± 0.281
1.644AlaArg: 1.644 ± 0.325
3.465AlaSer: 3.465 ± 0.478
3.598AlaThr: 3.598 ± 0.426
2.888AlaVal: 2.888 ± 0.326
0.444AlaTrp: 0.444 ± 0.139
1.955AlaTyr: 1.955 ± 0.323
0.0AlaXaa: 0.0 ± 0.0
Cys
0.178CysAla: 0.178 ± 0.095
0.089CysCys: 0.089 ± 0.061
0.711CysAsp: 0.711 ± 0.167
1.066CysGlu: 1.066 ± 0.276
0.489CysPhe: 0.489 ± 0.144
0.888CysGly: 0.888 ± 0.223
0.089CysHis: 0.089 ± 0.064
0.755CysIle: 0.755 ± 0.178
1.333CysLys: 1.333 ± 0.282
0.533CysLeu: 0.533 ± 0.157
0.089CysMet: 0.089 ± 0.064
0.311CysAsn: 0.311 ± 0.12
0.533CysPro: 0.533 ± 0.176
0.089CysGln: 0.089 ± 0.081
0.4CysArg: 0.4 ± 0.129
0.666CysSer: 0.666 ± 0.199
0.4CysThr: 0.4 ± 0.154
0.489CysVal: 0.489 ± 0.152
0.044CysTrp: 0.044 ± 0.046
0.444CysTyr: 0.444 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
3.554AspAla: 3.554 ± 0.484
0.755AspCys: 0.755 ± 0.191
3.154AspAsp: 3.154 ± 0.456
4.798AspGlu: 4.798 ± 0.536
4.442AspPhe: 4.442 ± 0.501
4.665AspGly: 4.665 ± 0.482
0.8AspHis: 0.8 ± 0.202
4.665AspIle: 4.665 ± 0.437
5.42AspLys: 5.42 ± 0.495
6.486AspLeu: 6.486 ± 0.479
1.51AspMet: 1.51 ± 0.298
4.131AspAsn: 4.131 ± 0.48
2.132AspPro: 2.132 ± 0.375
1.377AspGln: 1.377 ± 0.221
2.488AspArg: 2.488 ± 0.296
4.176AspSer: 4.176 ± 0.453
3.243AspThr: 3.243 ± 0.439
3.421AspVal: 3.421 ± 0.399
0.933AspTrp: 0.933 ± 0.207
2.976AspTyr: 2.976 ± 0.374
0.0AspXaa: 0.0 ± 0.0
Glu
4.531GluAla: 4.531 ± 0.543
0.755GluCys: 0.755 ± 0.186
5.064GluAsp: 5.064 ± 0.577
7.152GluGlu: 7.152 ± 0.711
3.065GluPhe: 3.065 ± 0.353
3.643GluGly: 3.643 ± 0.374
0.844GluHis: 0.844 ± 0.224
6.708GluIle: 6.708 ± 0.735
6.575GluLys: 6.575 ± 0.749
7.819GluLeu: 7.819 ± 0.661
1.688GluMet: 1.688 ± 0.264
5.731GluAsn: 5.731 ± 0.462
1.022GluPro: 1.022 ± 0.186
2.665GluGln: 2.665 ± 0.415
2.31GluArg: 2.31 ± 0.31
5.42GluSer: 5.42 ± 0.553
6.042GluThr: 6.042 ± 0.596
4.798GluVal: 4.798 ± 0.566
0.8GluTrp: 0.8 ± 0.177
3.821GluTyr: 3.821 ± 0.508
0.0GluXaa: 0.0 ± 0.0
Phe
2.443PheAla: 2.443 ± 0.382
0.622PheCys: 0.622 ± 0.172
3.332PheAsp: 3.332 ± 0.371
3.287PheGlu: 3.287 ± 0.354
2.044PhePhe: 2.044 ± 0.318
2.266PheGly: 2.266 ± 0.312
0.4PheHis: 0.4 ± 0.152
3.687PheIle: 3.687 ± 0.375
4.176PheLys: 4.176 ± 0.405
3.199PheLeu: 3.199 ± 0.413
0.977PheMet: 0.977 ± 0.165
3.421PheAsn: 3.421 ± 0.486
1.066PhePro: 1.066 ± 0.206
0.8PheGln: 0.8 ± 0.224
1.644PheArg: 1.644 ± 0.274
3.154PheSer: 3.154 ± 0.342
2.888PheThr: 2.888 ± 0.367
2.532PheVal: 2.532 ± 0.341
0.4PheTrp: 0.4 ± 0.134
1.555PheTyr: 1.555 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
2.355GlyAla: 2.355 ± 0.309
0.489GlyCys: 0.489 ± 0.132
3.909GlyAsp: 3.909 ± 0.468
4.22GlyGlu: 4.22 ± 0.44
2.754GlyPhe: 2.754 ± 0.437
3.154GlyGly: 3.154 ± 0.461
0.489GlyHis: 0.489 ± 0.171
4.709GlyIle: 4.709 ± 0.486
4.398GlyLys: 4.398 ± 0.406
4.265GlyLeu: 4.265 ± 0.485
1.199GlyMet: 1.199 ± 0.235
3.732GlyAsn: 3.732 ± 0.511
0.933GlyPro: 0.933 ± 0.196
1.466GlyGln: 1.466 ± 0.258
2.221GlyArg: 2.221 ± 0.362
4.665GlySer: 4.665 ± 0.592
4.354GlyThr: 4.354 ± 0.615
4.354GlyVal: 4.354 ± 0.476
0.933GlyTrp: 0.933 ± 0.197
2.221GlyTyr: 2.221 ± 0.386
0.0GlyXaa: 0.0 ± 0.0
His
0.4HisAla: 0.4 ± 0.148
0.4HisCys: 0.4 ± 0.139
0.578HisAsp: 0.578 ± 0.171
1.111HisGlu: 1.111 ± 0.267
0.888HisPhe: 0.888 ± 0.194
0.755HisGly: 0.755 ± 0.239
0.178HisHis: 0.178 ± 0.139
1.155HisIle: 1.155 ± 0.281
0.755HisLys: 0.755 ± 0.22
0.933HisLeu: 0.933 ± 0.241
0.4HisMet: 0.4 ± 0.135
1.022HisAsn: 1.022 ± 0.206
0.444HisPro: 0.444 ± 0.145
0.489HisGln: 0.489 ± 0.139
0.666HisArg: 0.666 ± 0.182
0.622HisSer: 0.622 ± 0.162
0.933HisThr: 0.933 ± 0.175
0.533HisVal: 0.533 ± 0.181
0.355HisTrp: 0.355 ± 0.116
0.8HisTyr: 0.8 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
4.398IleAla: 4.398 ± 0.539
1.111IleCys: 1.111 ± 0.224
6.708IleAsp: 6.708 ± 0.463
6.575IleGlu: 6.575 ± 0.568
2.355IlePhe: 2.355 ± 0.448
4.531IleGly: 4.531 ± 0.429
0.977IleHis: 0.977 ± 0.287
5.42IleIle: 5.42 ± 0.624
7.419IleLys: 7.419 ± 0.653
6.131IleLeu: 6.131 ± 0.635
1.599IleMet: 1.599 ± 0.332
5.775IleAsn: 5.775 ± 0.44
2.532IlePro: 2.532 ± 0.333
2.843IleGln: 2.843 ± 0.413
2.31IleArg: 2.31 ± 0.381
5.331IleSer: 5.331 ± 0.569
5.82IleThr: 5.82 ± 0.693
3.998IleVal: 3.998 ± 0.409
0.711IleTrp: 0.711 ± 0.164
2.443IleTyr: 2.443 ± 0.32
0.0IleXaa: 0.0 ± 0.0
Lys
4.798LysAla: 4.798 ± 0.479
0.8LysCys: 0.8 ± 0.19
5.908LysAsp: 5.908 ± 0.607
10.218LysGlu: 10.218 ± 0.974
3.376LysPhe: 3.376 ± 0.42
4.398LysGly: 4.398 ± 0.542
1.155LysHis: 1.155 ± 0.238
6.264LysIle: 6.264 ± 0.526
8.618LysLys: 8.618 ± 0.835
7.641LysLeu: 7.641 ± 0.623
2.843LysMet: 2.843 ± 0.304
5.775LysAsn: 5.775 ± 0.576
2.488LysPro: 2.488 ± 0.335
3.199LysGln: 3.199 ± 0.351
3.909LysArg: 3.909 ± 0.527
5.908LysSer: 5.908 ± 0.549
5.82LysThr: 5.82 ± 0.604
5.064LysVal: 5.064 ± 0.461
0.888LysTrp: 0.888 ± 0.165
3.554LysTyr: 3.554 ± 0.359
0.0LysXaa: 0.0 ± 0.0
Leu
3.865LeuAla: 3.865 ± 0.352
0.622LeuCys: 0.622 ± 0.188
5.731LeuAsp: 5.731 ± 0.51
6.219LeuGlu: 6.219 ± 0.615
3.51LeuPhe: 3.51 ± 0.379
4.887LeuGly: 4.887 ± 0.541
1.244LeuHis: 1.244 ± 0.274
6.131LeuIle: 6.131 ± 0.575
8.574LeuLys: 8.574 ± 0.847
5.686LeuLeu: 5.686 ± 0.632
1.555LeuMet: 1.555 ± 0.228
7.064LeuAsn: 7.064 ± 0.604
1.821LeuPro: 1.821 ± 0.302
2.577LeuGln: 2.577 ± 0.374
3.332LeuArg: 3.332 ± 0.448
6.175LeuSer: 6.175 ± 0.534
4.665LeuThr: 4.665 ± 0.551
3.598LeuVal: 3.598 ± 0.393
0.622LeuTrp: 0.622 ± 0.167
3.11LeuTyr: 3.11 ± 0.315
0.0LeuXaa: 0.0 ± 0.0
Met
1.422MetAla: 1.422 ± 0.248
0.222MetCys: 0.222 ± 0.113
1.422MetAsp: 1.422 ± 0.259
1.644MetGlu: 1.644 ± 0.355
0.755MetPhe: 0.755 ± 0.191
1.288MetGly: 1.288 ± 0.226
0.178MetHis: 0.178 ± 0.091
1.644MetIle: 1.644 ± 0.308
3.332MetLys: 3.332 ± 0.388
1.51MetLeu: 1.51 ± 0.258
0.355MetMet: 0.355 ± 0.115
1.688MetAsn: 1.688 ± 0.246
0.666MetPro: 0.666 ± 0.175
1.111MetGln: 1.111 ± 0.233
0.622MetArg: 0.622 ± 0.183
1.733MetSer: 1.733 ± 0.27
1.377MetThr: 1.377 ± 0.253
1.155MetVal: 1.155 ± 0.235
0.089MetTrp: 0.089 ± 0.062
0.578MetTyr: 0.578 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
3.909AsnAla: 3.909 ± 0.513
0.533AsnCys: 0.533 ± 0.171
4.176AsnAsp: 4.176 ± 0.462
4.753AsnGlu: 4.753 ± 0.561
2.888AsnPhe: 2.888 ± 0.354
3.954AsnGly: 3.954 ± 0.427
1.599AsnHis: 1.599 ± 0.254
5.953AsnIle: 5.953 ± 0.616
6.442AsnLys: 6.442 ± 0.619
5.553AsnLeu: 5.553 ± 0.574
2.177AsnMet: 2.177 ± 0.335
3.954AsnAsn: 3.954 ± 0.528
2.088AsnPro: 2.088 ± 0.294
1.999AsnGln: 1.999 ± 0.254
2.932AsnArg: 2.932 ± 0.36
4.887AsnSer: 4.887 ± 0.523
4.309AsnThr: 4.309 ± 0.392
3.287AsnVal: 3.287 ± 0.353
0.622AsnTrp: 0.622 ± 0.179
2.71AsnTyr: 2.71 ± 0.269
0.0AsnXaa: 0.0 ± 0.0
Pro
1.155ProAla: 1.155 ± 0.214
0.089ProCys: 0.089 ± 0.058
2.044ProAsp: 2.044 ± 0.281
2.443ProGlu: 2.443 ± 0.408
1.422ProPhe: 1.422 ± 0.284
0.711ProGly: 0.711 ± 0.172
0.267ProHis: 0.267 ± 0.104
2.177ProIle: 2.177 ± 0.328
2.754ProLys: 2.754 ± 0.335
2.044ProLeu: 2.044 ± 0.336
0.355ProMet: 0.355 ± 0.12
1.377ProAsn: 1.377 ± 0.263
0.666ProPro: 0.666 ± 0.155
0.844ProGln: 0.844 ± 0.28
0.578ProArg: 0.578 ± 0.161
1.777ProSer: 1.777 ± 0.334
2.532ProThr: 2.532 ± 0.353
1.999ProVal: 1.999 ± 0.267
0.089ProTrp: 0.089 ± 0.057
1.155ProTyr: 1.155 ± 0.174
0.0ProXaa: 0.0 ± 0.0
Gln
1.422GlnAla: 1.422 ± 0.249
0.4GlnCys: 0.4 ± 0.155
1.91GlnAsp: 1.91 ± 0.37
3.021GlnGlu: 3.021 ± 0.552
1.288GlnPhe: 1.288 ± 0.254
2.088GlnGly: 2.088 ± 0.365
0.622GlnHis: 0.622 ± 0.18
2.044GlnIle: 2.044 ± 0.263
2.976GlnLys: 2.976 ± 0.464
2.132GlnLeu: 2.132 ± 0.308
0.888GlnMet: 0.888 ± 0.216
1.91GlnAsn: 1.91 ± 0.261
1.199GlnPro: 1.199 ± 0.256
1.599GlnGln: 1.599 ± 0.296
1.155GlnArg: 1.155 ± 0.242
1.866GlnSer: 1.866 ± 0.299
1.91GlnThr: 1.91 ± 0.26
1.555GlnVal: 1.555 ± 0.258
0.311GlnTrp: 0.311 ± 0.104
1.288GlnTyr: 1.288 ± 0.206
0.0GlnXaa: 0.0 ± 0.0
Arg
1.288ArgAla: 1.288 ± 0.282
0.222ArgCys: 0.222 ± 0.101
2.177ArgAsp: 2.177 ± 0.309
3.243ArgGlu: 3.243 ± 0.465
1.866ArgPhe: 1.866 ± 0.244
1.51ArgGly: 1.51 ± 0.355
0.711ArgHis: 0.711 ± 0.216
3.465ArgIle: 3.465 ± 0.426
2.754ArgLys: 2.754 ± 0.36
2.799ArgLeu: 2.799 ± 0.39
1.244ArgMet: 1.244 ± 0.227
2.488ArgAsn: 2.488 ± 0.357
0.977ArgPro: 0.977 ± 0.204
1.422ArgGln: 1.422 ± 0.215
1.466ArgArg: 1.466 ± 0.238
1.91ArgSer: 1.91 ± 0.294
2.221ArgThr: 2.221 ± 0.245
2.31ArgVal: 2.31 ± 0.288
0.311ArgTrp: 0.311 ± 0.119
1.599ArgTyr: 1.599 ± 0.271
0.0ArgXaa: 0.0 ± 0.0
Ser
3.421SerAla: 3.421 ± 0.42
0.622SerCys: 0.622 ± 0.148
4.753SerAsp: 4.753 ± 0.472
5.064SerGlu: 5.064 ± 0.458
3.065SerPhe: 3.065 ± 0.349
5.908SerGly: 5.908 ± 0.774
0.755SerHis: 0.755 ± 0.182
6.264SerIle: 6.264 ± 0.547
6.664SerLys: 6.664 ± 0.84
5.553SerLeu: 5.553 ± 0.623
1.022SerMet: 1.022 ± 0.234
4.531SerAsn: 4.531 ± 0.596
1.555SerPro: 1.555 ± 0.279
2.221SerGln: 2.221 ± 0.303
2.621SerArg: 2.621 ± 0.338
4.309SerSer: 4.309 ± 0.518
4.531SerThr: 4.531 ± 0.53
3.065SerVal: 3.065 ± 0.37
0.755SerTrp: 0.755 ± 0.196
3.154SerTyr: 3.154 ± 0.343
0.0SerXaa: 0.0 ± 0.0
Thr
3.51ThrAla: 3.51 ± 0.457
0.267ThrCys: 0.267 ± 0.115
4.265ThrAsp: 4.265 ± 0.431
4.798ThrGlu: 4.798 ± 0.479
2.577ThrPhe: 2.577 ± 0.417
4.043ThrGly: 4.043 ± 0.604
0.844ThrHis: 0.844 ± 0.17
6.353ThrIle: 6.353 ± 0.584
5.109ThrLys: 5.109 ± 0.429
4.753ThrLeu: 4.753 ± 0.481
1.066ThrMet: 1.066 ± 0.211
3.776ThrAsn: 3.776 ± 0.397
2.488ThrPro: 2.488 ± 0.348
1.955ThrGln: 1.955 ± 0.305
2.443ThrArg: 2.443 ± 0.247
4.531ThrSer: 4.531 ± 0.511
4.087ThrThr: 4.087 ± 0.361
4.309ThrVal: 4.309 ± 0.614
0.8ThrTrp: 0.8 ± 0.193
2.443ThrTyr: 2.443 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
2.399ValAla: 2.399 ± 0.329
0.489ValCys: 0.489 ± 0.181
3.554ValAsp: 3.554 ± 0.385
3.954ValGlu: 3.954 ± 0.331
2.577ValPhe: 2.577 ± 0.305
2.976ValGly: 2.976 ± 0.321
0.888ValHis: 0.888 ± 0.253
3.554ValIle: 3.554 ± 0.45
5.953ValLys: 5.953 ± 0.572
4.798ValLeu: 4.798 ± 0.525
1.111ValMet: 1.111 ± 0.227
3.643ValAsn: 3.643 ± 0.431
1.688ValPro: 1.688 ± 0.281
1.555ValGln: 1.555 ± 0.274
1.51ValArg: 1.51 ± 0.24
5.331ValSer: 5.331 ± 0.427
3.243ValThr: 3.243 ± 0.521
4.176ValVal: 4.176 ± 0.561
0.711ValTrp: 0.711 ± 0.196
2.355ValTyr: 2.355 ± 0.286
0.0ValXaa: 0.0 ± 0.0
Trp
0.533TrpAla: 0.533 ± 0.143
0.044TrpCys: 0.044 ± 0.06
0.755TrpAsp: 0.755 ± 0.168
0.844TrpGlu: 0.844 ± 0.188
0.311TrpPhe: 0.311 ± 0.116
0.666TrpGly: 0.666 ± 0.166
0.267TrpHis: 0.267 ± 0.114
0.888TrpIle: 0.888 ± 0.179
0.666TrpLys: 0.666 ± 0.18
0.844TrpLeu: 0.844 ± 0.199
0.222TrpMet: 0.222 ± 0.124
1.022TrpAsn: 1.022 ± 0.233
0.089TrpPro: 0.089 ± 0.069
0.355TrpGln: 0.355 ± 0.121
0.489TrpArg: 0.489 ± 0.17
0.844TrpSer: 0.844 ± 0.184
0.622TrpThr: 0.622 ± 0.143
0.533TrpVal: 0.533 ± 0.149
0.355TrpTrp: 0.355 ± 0.115
0.533TrpTyr: 0.533 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.821TyrAla: 1.821 ± 0.319
0.533TyrCys: 0.533 ± 0.144
2.044TyrAsp: 2.044 ± 0.32
2.488TyrGlu: 2.488 ± 0.39
2.044TyrPhe: 2.044 ± 0.35
2.355TyrGly: 2.355 ± 0.27
0.666TyrHis: 0.666 ± 0.171
3.154TyrIle: 3.154 ± 0.354
3.998TyrLys: 3.998 ± 0.46
3.598TyrLeu: 3.598 ± 0.379
0.933TyrMet: 0.933 ± 0.175
3.598TyrAsn: 3.598 ± 0.472
1.155TyrPro: 1.155 ± 0.242
1.066TyrGln: 1.066 ± 0.215
1.244TyrArg: 1.244 ± 0.228
3.065TyrSer: 3.065 ± 0.348
1.821TyrThr: 1.821 ± 0.349
2.399TyrVal: 2.399 ± 0.365
0.711TyrTrp: 0.711 ± 0.18
1.599TyrTyr: 1.599 ± 0.247
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 127 proteins (22511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski