Amino acid dipepetide frequency for Clostridium phage phiCDHM19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.15AlaAla: 1.15 ± 0.32
0.639AlaCys: 0.639 ± 0.228
2.684AlaAsp: 2.684 ± 0.428
3.323AlaGlu: 3.323 ± 0.44
1.278AlaPhe: 1.278 ± 0.277
3.259AlaGly: 3.259 ± 0.728
0.32AlaHis: 0.32 ± 0.128
4.154AlaIle: 4.154 ± 0.475
5.113AlaLys: 5.113 ± 0.494
4.41AlaLeu: 4.41 ± 0.544
1.406AlaMet: 1.406 ± 0.295
2.876AlaAsn: 2.876 ± 0.438
0.511AlaPro: 0.511 ± 0.156
1.47AlaGln: 1.47 ± 0.513
1.981AlaArg: 1.981 ± 0.354
3.196AlaSer: 3.196 ± 0.473
3.515AlaThr: 3.515 ± 0.451
2.492AlaVal: 2.492 ± 0.362
0.703AlaTrp: 0.703 ± 0.21
1.534AlaTyr: 1.534 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
0.447CysAla: 0.447 ± 0.153
0.32CysCys: 0.32 ± 0.145
0.895CysAsp: 0.895 ± 0.288
1.15CysGlu: 1.15 ± 0.289
0.703CysPhe: 0.703 ± 0.228
0.767CysGly: 0.767 ± 0.188
0.128CysHis: 0.128 ± 0.088
1.214CysIle: 1.214 ± 0.262
1.406CysLys: 1.406 ± 0.34
1.47CysLeu: 1.47 ± 0.259
0.511CysMet: 0.511 ± 0.156
1.023CysAsn: 1.023 ± 0.291
0.447CysPro: 0.447 ± 0.173
0.192CysGln: 0.192 ± 0.114
0.831CysArg: 0.831 ± 0.227
0.767CysSer: 0.767 ± 0.197
0.639CysThr: 0.639 ± 0.156
0.447CysVal: 0.447 ± 0.208
0.256CysTrp: 0.256 ± 0.108
0.511CysTyr: 0.511 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
2.429AspAla: 2.429 ± 0.365
0.895AspCys: 0.895 ± 0.281
3.004AspAsp: 3.004 ± 0.471
4.793AspGlu: 4.793 ± 0.522
2.94AspPhe: 2.94 ± 0.454
3.004AspGly: 3.004 ± 0.387
0.256AspHis: 0.256 ± 0.118
7.541AspIle: 7.541 ± 0.688
6.774AspLys: 6.774 ± 0.742
4.985AspLeu: 4.985 ± 0.421
1.023AspMet: 1.023 ± 0.218
3.643AspAsn: 3.643 ± 0.464
0.575AspPro: 0.575 ± 0.174
0.447AspGln: 0.447 ± 0.179
2.173AspArg: 2.173 ± 0.339
3.323AspSer: 3.323 ± 0.396
3.835AspThr: 3.835 ± 0.469
3.451AspVal: 3.451 ± 0.408
0.639AspTrp: 0.639 ± 0.199
2.237AspTyr: 2.237 ± 0.348
0.0AspXaa: 0.0 ± 0.0
Glu
4.282GluAla: 4.282 ± 0.6
1.086GluCys: 1.086 ± 0.276
5.241GluAsp: 5.241 ± 0.579
8.628GluGlu: 8.628 ± 0.854
3.196GluPhe: 3.196 ± 0.471
4.218GluGly: 4.218 ± 0.702
1.023GluHis: 1.023 ± 0.28
7.989GluIle: 7.989 ± 0.846
9.395GluLys: 9.395 ± 0.782
8.82GluLeu: 8.82 ± 0.781
1.917GluMet: 1.917 ± 0.462
6.902GluAsn: 6.902 ± 0.656
0.959GluPro: 0.959 ± 0.259
2.876GluGln: 2.876 ± 0.461
2.876GluArg: 2.876 ± 0.533
4.474GluSer: 4.474 ± 0.452
3.771GluThr: 3.771 ± 0.59
4.729GluVal: 4.729 ± 0.592
0.575GluTrp: 0.575 ± 0.202
4.09GluTyr: 4.09 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
1.342PheAla: 1.342 ± 0.281
0.639PheCys: 0.639 ± 0.176
2.94PheAsp: 2.94 ± 0.466
3.835PheGlu: 3.835 ± 0.493
1.726PhePhe: 1.726 ± 0.268
2.812PheGly: 2.812 ± 0.341
0.511PheHis: 0.511 ± 0.161
4.154PheIle: 4.154 ± 0.452
4.346PheLys: 4.346 ± 0.434
2.684PheLeu: 2.684 ± 0.407
1.023PheMet: 1.023 ± 0.211
2.684PheAsn: 2.684 ± 0.419
1.214PhePro: 1.214 ± 0.347
0.959PheGln: 0.959 ± 0.199
1.534PheArg: 1.534 ± 0.357
2.556PheSer: 2.556 ± 0.406
2.62PheThr: 2.62 ± 0.397
2.045PheVal: 2.045 ± 0.279
0.256PheTrp: 0.256 ± 0.11
1.789PheTyr: 1.789 ± 0.361
0.0PheXaa: 0.0 ± 0.0
Gly
2.684GlyAla: 2.684 ± 0.517
0.959GlyCys: 0.959 ± 0.26
2.429GlyAsp: 2.429 ± 0.508
4.09GlyGlu: 4.09 ± 0.414
2.94GlyPhe: 2.94 ± 0.49
4.857GlyGly: 4.857 ± 1.691
0.767GlyHis: 0.767 ± 0.308
4.346GlyIle: 4.346 ± 0.619
4.857GlyLys: 4.857 ± 0.53
3.962GlyLeu: 3.962 ± 0.361
0.895GlyMet: 0.895 ± 0.208
3.068GlyAsn: 3.068 ± 0.448
0.511GlyPro: 0.511 ± 0.178
1.214GlyGln: 1.214 ± 0.257
1.726GlyArg: 1.726 ± 0.329
2.492GlySer: 2.492 ± 0.415
2.876GlyThr: 2.876 ± 0.506
3.835GlyVal: 3.835 ± 0.522
0.639GlyTrp: 0.639 ± 0.197
3.196GlyTyr: 3.196 ± 0.602
0.0GlyXaa: 0.0 ± 0.0
His
0.128HisAla: 0.128 ± 0.087
0.256HisCys: 0.256 ± 0.127
0.575HisAsp: 0.575 ± 0.19
1.15HisGlu: 1.15 ± 0.295
0.383HisPhe: 0.383 ± 0.137
0.383HisGly: 0.383 ± 0.21
0.192HisHis: 0.192 ± 0.093
0.639HisIle: 0.639 ± 0.205
1.086HisLys: 1.086 ± 0.261
1.214HisLeu: 1.214 ± 0.279
0.256HisMet: 0.256 ± 0.118
0.703HisAsn: 0.703 ± 0.251
0.383HisPro: 0.383 ± 0.147
0.32HisGln: 0.32 ± 0.166
0.383HisArg: 0.383 ± 0.144
0.639HisSer: 0.639 ± 0.169
0.767HisThr: 0.767 ± 0.274
0.831HisVal: 0.831 ± 0.293
0.192HisTrp: 0.192 ± 0.102
0.575HisTyr: 0.575 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
5.049IleAla: 5.049 ± 0.648
1.214IleCys: 1.214 ± 0.314
6.391IleAsp: 6.391 ± 0.685
8.82IleGlu: 8.82 ± 0.802
2.812IlePhe: 2.812 ± 0.442
3.579IleGly: 3.579 ± 0.526
0.895IleHis: 0.895 ± 0.277
7.35IleIle: 7.35 ± 0.729
10.417IleLys: 10.417 ± 0.788
9.075IleLeu: 9.075 ± 0.951
1.789IleMet: 1.789 ± 0.322
6.647IleAsn: 6.647 ± 0.558
2.684IlePro: 2.684 ± 0.367
2.556IleGln: 2.556 ± 0.412
3.579IleArg: 3.579 ± 0.467
5.305IleSer: 5.305 ± 0.502
4.218IleThr: 4.218 ± 0.54
4.218IleVal: 4.218 ± 0.557
0.703IleTrp: 0.703 ± 0.255
3.899IleTyr: 3.899 ± 0.528
0.0IleXaa: 0.0 ± 0.0
Lys
5.432LysAla: 5.432 ± 0.588
0.959LysCys: 0.959 ± 0.231
6.774LysAsp: 6.774 ± 0.688
11.248LysGlu: 11.248 ± 0.878
3.579LysPhe: 3.579 ± 0.436
5.241LysGly: 5.241 ± 0.502
1.981LysHis: 1.981 ± 0.362
10.29LysIle: 10.29 ± 0.712
9.778LysLys: 9.778 ± 1.024
8.692LysLeu: 8.692 ± 0.793
2.94LysMet: 2.94 ± 0.436
8.5LysAsn: 8.5 ± 0.672
1.662LysPro: 1.662 ± 0.342
3.132LysGln: 3.132 ± 0.397
3.196LysArg: 3.196 ± 0.478
5.752LysSer: 5.752 ± 0.59
5.496LysThr: 5.496 ± 0.495
7.158LysVal: 7.158 ± 0.598
0.575LysTrp: 0.575 ± 0.168
4.026LysTyr: 4.026 ± 0.398
0.0LysXaa: 0.0 ± 0.0
Leu
4.41LeuAla: 4.41 ± 0.528
0.831LeuCys: 0.831 ± 0.2
5.113LeuAsp: 5.113 ± 0.64
8.628LeuGlu: 8.628 ± 0.876
3.004LeuPhe: 3.004 ± 0.44
3.899LeuGly: 3.899 ± 0.51
1.278LeuHis: 1.278 ± 0.295
6.519LeuIle: 6.519 ± 0.659
10.162LeuLys: 10.162 ± 0.713
6.135LeuLeu: 6.135 ± 0.713
2.109LeuMet: 2.109 ± 0.303
6.902LeuAsn: 6.902 ± 0.56
2.045LeuPro: 2.045 ± 0.454
2.301LeuGln: 2.301 ± 0.377
2.94LeuArg: 2.94 ± 0.635
5.752LeuSer: 5.752 ± 0.495
4.985LeuThr: 4.985 ± 0.522
5.049LeuVal: 5.049 ± 0.528
0.895LeuTrp: 0.895 ± 0.292
3.323LeuTyr: 3.323 ± 0.493
0.0LeuXaa: 0.0 ± 0.0
Met
1.214MetAla: 1.214 ± 0.27
0.383MetCys: 0.383 ± 0.145
1.406MetAsp: 1.406 ± 0.298
1.981MetGlu: 1.981 ± 0.274
1.15MetPhe: 1.15 ± 0.239
0.831MetGly: 0.831 ± 0.202
0.192MetHis: 0.192 ± 0.106
1.278MetIle: 1.278 ± 0.27
2.556MetLys: 2.556 ± 0.417
2.109MetLeu: 2.109 ± 0.348
0.447MetMet: 0.447 ± 0.194
1.789MetAsn: 1.789 ± 0.282
0.639MetPro: 0.639 ± 0.18
0.639MetGln: 0.639 ± 0.187
0.575MetArg: 0.575 ± 0.181
1.534MetSer: 1.534 ± 0.298
1.662MetThr: 1.662 ± 0.341
1.086MetVal: 1.086 ± 0.22
0.192MetTrp: 0.192 ± 0.081
0.895MetTyr: 0.895 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
3.387AsnAla: 3.387 ± 0.5
0.831AsnCys: 0.831 ± 0.229
3.132AsnAsp: 3.132 ± 0.509
5.88AsnGlu: 5.88 ± 0.706
2.876AsnPhe: 2.876 ± 0.422
3.835AsnGly: 3.835 ± 0.477
0.767AsnHis: 0.767 ± 0.205
7.861AsnIle: 7.861 ± 0.715
7.797AsnLys: 7.797 ± 0.745
5.88AsnLeu: 5.88 ± 0.5
1.662AsnMet: 1.662 ± 0.295
6.455AsnAsn: 6.455 ± 0.67
1.214AsnPro: 1.214 ± 0.277
1.598AsnGln: 1.598 ± 0.327
3.259AsnArg: 3.259 ± 0.439
5.305AsnSer: 5.305 ± 0.449
3.962AsnThr: 3.962 ± 0.69
4.729AsnVal: 4.729 ± 0.553
0.703AsnTrp: 0.703 ± 0.198
1.917AsnTyr: 1.917 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
0.767ProAla: 0.767 ± 0.216
0.447ProCys: 0.447 ± 0.163
1.15ProAsp: 1.15 ± 0.233
1.086ProGlu: 1.086 ± 0.253
0.895ProPhe: 0.895 ± 0.225
1.023ProGly: 1.023 ± 0.259
0.128ProHis: 0.128 ± 0.086
1.789ProIle: 1.789 ± 0.31
1.917ProLys: 1.917 ± 0.33
1.342ProLeu: 1.342 ± 0.31
0.192ProMet: 0.192 ± 0.098
1.406ProAsn: 1.406 ± 0.265
0.192ProPro: 0.192 ± 0.114
0.639ProGln: 0.639 ± 0.169
0.895ProArg: 0.895 ± 0.247
1.662ProSer: 1.662 ± 0.251
1.662ProThr: 1.662 ± 0.362
0.959ProVal: 0.959 ± 0.227
0.192ProTrp: 0.192 ± 0.101
0.767ProTyr: 0.767 ± 0.214
0.0ProXaa: 0.0 ± 0.0
Gln
1.726GlnAla: 1.726 ± 0.42
0.192GlnCys: 0.192 ± 0.106
1.726GlnAsp: 1.726 ± 0.306
2.301GlnGlu: 2.301 ± 0.396
0.959GlnPhe: 0.959 ± 0.206
1.023GlnGly: 1.023 ± 0.227
0.256GlnHis: 0.256 ± 0.114
2.876GlnIle: 2.876 ± 0.605
2.237GlnLys: 2.237 ± 0.32
1.981GlnLeu: 1.981 ± 0.37
0.511GlnMet: 0.511 ± 0.167
2.237GlnAsn: 2.237 ± 0.291
0.192GlnPro: 0.192 ± 0.096
0.895GlnGln: 0.895 ± 0.26
1.214GlnArg: 1.214 ± 0.259
1.342GlnSer: 1.342 ± 0.308
2.045GlnThr: 2.045 ± 0.355
1.534GlnVal: 1.534 ± 0.311
0.32GlnTrp: 0.32 ± 0.14
1.534GlnTyr: 1.534 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
1.342ArgAla: 1.342 ± 0.329
0.959ArgCys: 0.959 ± 0.295
1.917ArgAsp: 1.917 ± 0.395
3.579ArgGlu: 3.579 ± 0.525
1.789ArgPhe: 1.789 ± 0.294
1.342ArgGly: 1.342 ± 0.291
0.383ArgHis: 0.383 ± 0.215
3.515ArgIle: 3.515 ± 0.505
4.026ArgLys: 4.026 ± 0.446
3.643ArgLeu: 3.643 ± 0.547
1.15ArgMet: 1.15 ± 0.277
1.981ArgAsn: 1.981 ± 0.34
0.767ArgPro: 0.767 ± 0.23
0.639ArgGln: 0.639 ± 0.187
1.534ArgArg: 1.534 ± 0.252
2.045ArgSer: 2.045 ± 0.34
1.342ArgThr: 1.342 ± 0.313
2.492ArgVal: 2.492 ± 0.306
0.575ArgTrp: 0.575 ± 0.207
1.598ArgTyr: 1.598 ± 0.27
0.0ArgXaa: 0.0 ± 0.0
Ser
2.556SerAla: 2.556 ± 0.437
0.959SerCys: 0.959 ± 0.223
2.812SerAsp: 2.812 ± 0.362
4.09SerGlu: 4.09 ± 0.446
4.026SerPhe: 4.026 ± 0.536
3.771SerGly: 3.771 ± 0.519
0.575SerHis: 0.575 ± 0.174
4.857SerIle: 4.857 ± 0.548
7.669SerLys: 7.669 ± 0.707
4.282SerLeu: 4.282 ± 0.506
1.214SerMet: 1.214 ± 0.298
4.602SerAsn: 4.602 ± 0.495
1.406SerPro: 1.406 ± 0.297
1.598SerGln: 1.598 ± 0.299
2.045SerArg: 2.045 ± 0.315
3.707SerSer: 3.707 ± 0.544
3.196SerThr: 3.196 ± 0.39
3.579SerVal: 3.579 ± 0.416
0.767SerTrp: 0.767 ± 0.189
3.132SerTyr: 3.132 ± 0.393
0.0SerXaa: 0.0 ± 0.0
Thr
3.132ThrAla: 3.132 ± 0.425
0.767ThrCys: 0.767 ± 0.236
3.387ThrAsp: 3.387 ± 0.407
4.218ThrGlu: 4.218 ± 0.668
2.301ThrPhe: 2.301 ± 0.34
3.515ThrGly: 3.515 ± 0.624
0.575ThrHis: 0.575 ± 0.21
5.241ThrIle: 5.241 ± 0.435
5.432ThrLys: 5.432 ± 0.55
5.56ThrLeu: 5.56 ± 0.551
1.023ThrMet: 1.023 ± 0.273
3.771ThrAsn: 3.771 ± 0.488
1.342ThrPro: 1.342 ± 0.233
2.109ThrGln: 2.109 ± 0.397
2.045ThrArg: 2.045 ± 0.314
3.004ThrSer: 3.004 ± 0.387
3.515ThrThr: 3.515 ± 0.556
2.876ThrVal: 2.876 ± 0.482
0.447ThrTrp: 0.447 ± 0.19
2.812ThrTyr: 2.812 ± 0.427
0.0ThrXaa: 0.0 ± 0.0
Val
2.748ValAla: 2.748 ± 0.514
0.767ValCys: 0.767 ± 0.234
3.515ValAsp: 3.515 ± 0.483
5.113ValGlu: 5.113 ± 0.596
2.556ValPhe: 2.556 ± 0.44
2.684ValGly: 2.684 ± 0.438
0.32ValHis: 0.32 ± 0.121
4.538ValIle: 4.538 ± 0.541
5.305ValLys: 5.305 ± 0.555
4.857ValLeu: 4.857 ± 0.5
1.342ValMet: 1.342 ± 0.253
4.154ValAsn: 4.154 ± 0.588
1.534ValPro: 1.534 ± 0.229
1.981ValGln: 1.981 ± 0.365
1.853ValArg: 1.853 ± 0.367
4.154ValSer: 4.154 ± 0.532
3.899ValThr: 3.899 ± 0.601
3.835ValVal: 3.835 ± 0.636
0.639ValTrp: 0.639 ± 0.246
2.237ValTyr: 2.237 ± 0.36
0.0ValXaa: 0.0 ± 0.0
Trp
0.192TrpAla: 0.192 ± 0.106
0.256TrpCys: 0.256 ± 0.113
0.511TrpAsp: 0.511 ± 0.186
0.575TrpGlu: 0.575 ± 0.191
0.511TrpPhe: 0.511 ± 0.185
0.639TrpGly: 0.639 ± 0.22
0.064TrpHis: 0.064 ± 0.064
0.959TrpIle: 0.959 ± 0.21
1.278TrpLys: 1.278 ± 0.269
1.023TrpLeu: 1.023 ± 0.278
0.32TrpMet: 0.32 ± 0.136
0.767TrpAsn: 0.767 ± 0.357
0.0TrpPro: 0.0 ± 0.0
0.383TrpGln: 0.383 ± 0.16
0.256TrpArg: 0.256 ± 0.119
0.447TrpSer: 0.447 ± 0.23
0.32TrpThr: 0.32 ± 0.152
0.831TrpVal: 0.831 ± 0.221
0.0TrpTrp: 0.0 ± 0.0
0.256TrpTyr: 0.256 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.598TyrAla: 1.598 ± 0.286
0.767TyrCys: 0.767 ± 0.203
2.365TyrAsp: 2.365 ± 0.447
2.556TyrGlu: 2.556 ± 0.503
2.045TyrPhe: 2.045 ± 0.302
1.662TyrGly: 1.662 ± 0.318
0.447TyrHis: 0.447 ± 0.16
4.09TyrIle: 4.09 ± 0.512
4.921TyrLys: 4.921 ± 0.558
4.154TyrLeu: 4.154 ± 0.58
0.767TyrMet: 0.767 ± 0.181
2.94TyrAsn: 2.94 ± 0.566
0.831TyrPro: 0.831 ± 0.211
1.15TyrGln: 1.15 ± 0.296
1.726TyrArg: 1.726 ± 0.33
3.387TyrSer: 3.387 ± 0.488
2.748TyrThr: 2.748 ± 0.475
1.853TyrVal: 1.853 ± 0.353
0.32TyrTrp: 0.32 ± 0.132
1.598TyrTyr: 1.598 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 88 proteins (15648 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski