Amino acid dipepetide frequency for Clostridium phage CPD4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.319AlaAla: 3.319 ± 0.671
0.43AlaCys: 0.43 ± 0.24
2.643AlaAsp: 2.643 ± 0.483
3.442AlaGlu: 3.442 ± 0.44
2.028AlaPhe: 2.028 ± 0.506
2.95AlaGly: 2.95 ± 0.562
1.045AlaHis: 1.045 ± 0.262
3.503AlaIle: 3.503 ± 0.523
6.084AlaLys: 6.084 ± 0.818
4.732AlaLeu: 4.732 ± 0.769
1.475AlaMet: 1.475 ± 0.304
2.766AlaAsn: 2.766 ± 0.358
1.598AlaPro: 1.598 ± 0.359
2.151AlaGln: 2.151 ± 0.285
2.766AlaArg: 2.766 ± 0.431
2.09AlaSer: 2.09 ± 0.422
3.319AlaThr: 3.319 ± 0.448
3.995AlaVal: 3.995 ± 0.672
0.492AlaTrp: 0.492 ± 0.157
1.967AlaTyr: 1.967 ± 0.25
0.0AlaXaa: 0.0 ± 0.0
Cys
0.43CysAla: 0.43 ± 0.187
0.184CysCys: 0.184 ± 0.111
0.492CysAsp: 0.492 ± 0.147
1.291CysGlu: 1.291 ± 0.274
0.43CysPhe: 0.43 ± 0.162
0.738CysGly: 0.738 ± 0.216
0.492CysHis: 0.492 ± 0.197
0.676CysIle: 0.676 ± 0.196
1.168CysLys: 1.168 ± 0.368
1.352CysLeu: 1.352 ± 0.294
0.184CysMet: 0.184 ± 0.129
1.291CysAsn: 1.291 ± 0.265
0.676CysPro: 0.676 ± 0.307
0.615CysGln: 0.615 ± 0.155
0.246CysArg: 0.246 ± 0.091
0.615CysSer: 0.615 ± 0.182
0.307CysThr: 0.307 ± 0.124
0.553CysVal: 0.553 ± 0.163
0.061CysTrp: 0.061 ± 0.059
1.106CysTyr: 1.106 ± 0.22
0.0CysXaa: 0.0 ± 0.0
Asp
1.229AspAla: 1.229 ± 0.3
0.615AspCys: 0.615 ± 0.177
4.179AspAsp: 4.179 ± 0.749
6.392AspGlu: 6.392 ± 1.139
2.704AspPhe: 2.704 ± 0.338
3.38AspGly: 3.38 ± 0.552
0.553AspHis: 0.553 ± 0.192
7.068AspIle: 7.068 ± 0.608
5.777AspLys: 5.777 ± 0.56
7.191AspLeu: 7.191 ± 0.726
0.922AspMet: 0.922 ± 0.183
3.81AspAsn: 3.81 ± 0.562
2.766AspPro: 2.766 ± 0.533
1.106AspGln: 1.106 ± 0.246
2.458AspArg: 2.458 ± 0.464
3.319AspSer: 3.319 ± 0.395
3.134AspThr: 3.134 ± 0.436
3.073AspVal: 3.073 ± 0.406
1.106AspTrp: 1.106 ± 0.249
3.995AspTyr: 3.995 ± 0.542
0.0AspXaa: 0.0 ± 0.0
Glu
5.47GluAla: 5.47 ± 0.691
0.738GluCys: 0.738 ± 0.221
6.699GluAsp: 6.699 ± 0.83
9.649GluGlu: 9.649 ± 1.458
2.581GluPhe: 2.581 ± 0.361
5.593GluGly: 5.593 ± 0.598
0.983GluHis: 0.983 ± 0.21
6.146GluIle: 6.146 ± 0.713
6.392GluLys: 6.392 ± 0.691
8.236GluLeu: 8.236 ± 0.688
2.581GluMet: 2.581 ± 0.456
3.442GluAsn: 3.442 ± 0.42
2.213GluPro: 2.213 ± 0.37
1.844GluGln: 1.844 ± 0.29
2.458GluArg: 2.458 ± 0.526
3.38GluSer: 3.38 ± 0.371
2.95GluThr: 2.95 ± 0.439
5.962GluVal: 5.962 ± 0.513
0.86GluTrp: 0.86 ± 0.207
2.95GluTyr: 2.95 ± 0.462
0.0GluXaa: 0.0 ± 0.0
Phe
1.291PheAla: 1.291 ± 0.231
0.184PheCys: 0.184 ± 0.101
2.397PheAsp: 2.397 ± 0.386
2.827PheGlu: 2.827 ± 0.47
0.86PhePhe: 0.86 ± 0.196
2.151PheGly: 2.151 ± 0.479
0.553PheHis: 0.553 ± 0.2
3.257PheIle: 3.257 ± 0.449
3.503PheLys: 3.503 ± 0.575
2.335PheLeu: 2.335 ± 0.342
0.922PheMet: 0.922 ± 0.253
3.011PheAsn: 3.011 ± 0.475
1.045PhePro: 1.045 ± 0.277
1.045PheGln: 1.045 ± 0.262
1.414PheArg: 1.414 ± 0.253
2.766PheSer: 2.766 ± 0.412
3.073PheThr: 3.073 ± 0.538
1.659PheVal: 1.659 ± 0.336
0.492PheTrp: 0.492 ± 0.172
0.983PheTyr: 0.983 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
2.827GlyAla: 2.827 ± 0.668
0.922GlyCys: 0.922 ± 0.223
3.995GlyAsp: 3.995 ± 0.441
4.241GlyGlu: 4.241 ± 0.555
2.213GlyPhe: 2.213 ± 0.326
5.347GlyGly: 5.347 ± 0.736
1.045GlyHis: 1.045 ± 0.254
4.917GlyIle: 4.917 ± 0.571
7.006GlyLys: 7.006 ± 0.759
5.531GlyLeu: 5.531 ± 0.467
1.536GlyMet: 1.536 ± 0.223
2.766GlyAsn: 2.766 ± 0.406
1.536GlyPro: 1.536 ± 0.332
2.028GlyGln: 2.028 ± 0.335
1.598GlyArg: 1.598 ± 0.394
4.241GlySer: 4.241 ± 0.421
3.81GlyThr: 3.81 ± 0.451
4.671GlyVal: 4.671 ± 0.502
0.86GlyTrp: 0.86 ± 0.21
3.196GlyTyr: 3.196 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
0.369HisAla: 0.369 ± 0.134
0.369HisCys: 0.369 ± 0.147
0.799HisAsp: 0.799 ± 0.193
1.352HisGlu: 1.352 ± 0.247
0.676HisPhe: 0.676 ± 0.165
0.738HisGly: 0.738 ± 0.175
0.676HisHis: 0.676 ± 0.179
1.045HisIle: 1.045 ± 0.242
1.352HisLys: 1.352 ± 0.315
1.782HisLeu: 1.782 ± 0.303
0.184HisMet: 0.184 ± 0.113
1.291HisAsn: 1.291 ± 0.273
0.86HisPro: 0.86 ± 0.218
0.615HisGln: 0.615 ± 0.193
0.676HisArg: 0.676 ± 0.211
1.106HisSer: 1.106 ± 0.311
0.983HisThr: 0.983 ± 0.334
0.492HisVal: 0.492 ± 0.182
0.369HisTrp: 0.369 ± 0.165
0.799HisTyr: 0.799 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
3.995IleAla: 3.995 ± 0.618
1.352IleCys: 1.352 ± 0.275
5.101IleAsp: 5.101 ± 0.565
6.576IleGlu: 6.576 ± 0.755
2.335IlePhe: 2.335 ± 0.355
3.749IleGly: 3.749 ± 0.523
1.352IleHis: 1.352 ± 0.301
5.224IleIle: 5.224 ± 0.801
6.638IleLys: 6.638 ± 0.587
6.945IleLeu: 6.945 ± 0.515
1.721IleMet: 1.721 ± 0.324
5.408IleAsn: 5.408 ± 0.527
3.257IlePro: 3.257 ± 0.43
3.011IleGln: 3.011 ± 0.419
2.09IleArg: 2.09 ± 0.327
4.425IleSer: 4.425 ± 0.474
4.241IleThr: 4.241 ± 0.562
4.609IleVal: 4.609 ± 0.612
0.922IleTrp: 0.922 ± 0.213
3.073IleTyr: 3.073 ± 0.452
0.0IleXaa: 0.0 ± 0.0
Lys
7.682LysAla: 7.682 ± 0.745
0.983LysCys: 0.983 ± 0.281
5.777LysAsp: 5.777 ± 0.63
8.727LysGlu: 8.727 ± 0.955
3.196LysPhe: 3.196 ± 0.5
5.593LysGly: 5.593 ± 0.49
1.905LysHis: 1.905 ± 0.326
6.392LysIle: 6.392 ± 0.639
9.157LysLys: 9.157 ± 1.182
8.42LysLeu: 8.42 ± 0.817
2.95LysMet: 2.95 ± 0.362
5.04LysAsn: 5.04 ± 0.712
3.38LysPro: 3.38 ± 0.571
3.565LysGln: 3.565 ± 0.481
3.81LysArg: 3.81 ± 0.563
4.917LysSer: 4.917 ± 0.735
4.179LysThr: 4.179 ± 0.559
6.023LysVal: 6.023 ± 0.499
0.983LysTrp: 0.983 ± 0.256
4.425LysTyr: 4.425 ± 0.573
0.0LysXaa: 0.0 ± 0.0
Leu
4.917LeuAla: 4.917 ± 0.754
1.598LeuCys: 1.598 ± 0.307
5.47LeuAsp: 5.47 ± 0.512
7.129LeuGlu: 7.129 ± 0.81
2.397LeuPhe: 2.397 ± 0.46
5.654LeuGly: 5.654 ± 0.615
1.536LeuHis: 1.536 ± 0.329
6.392LeuIle: 6.392 ± 0.793
9.157LeuLys: 9.157 ± 0.908
6.269LeuLeu: 6.269 ± 0.605
1.721LeuMet: 1.721 ± 0.429
5.04LeuAsn: 5.04 ± 0.556
2.52LeuPro: 2.52 ± 0.402
3.073LeuGln: 3.073 ± 0.452
3.872LeuArg: 3.872 ± 0.563
5.163LeuSer: 5.163 ± 0.521
5.101LeuThr: 5.101 ± 0.581
4.241LeuVal: 4.241 ± 0.445
0.983LeuTrp: 0.983 ± 0.239
3.626LeuTyr: 3.626 ± 0.577
0.0LeuXaa: 0.0 ± 0.0
Met
1.536MetAla: 1.536 ± 0.333
0.184MetCys: 0.184 ± 0.113
1.721MetAsp: 1.721 ± 0.34
1.721MetGlu: 1.721 ± 0.327
0.983MetPhe: 0.983 ± 0.302
1.352MetGly: 1.352 ± 0.29
0.184MetHis: 0.184 ± 0.091
1.721MetIle: 1.721 ± 0.283
2.397MetLys: 2.397 ± 0.409
2.274MetLeu: 2.274 ± 0.402
0.307MetMet: 0.307 ± 0.112
1.659MetAsn: 1.659 ± 0.357
0.983MetPro: 0.983 ± 0.252
0.615MetGln: 0.615 ± 0.169
1.229MetArg: 1.229 ± 0.253
1.229MetSer: 1.229 ± 0.309
1.291MetThr: 1.291 ± 0.351
0.983MetVal: 0.983 ± 0.209
0.123MetTrp: 0.123 ± 0.102
0.799MetTyr: 0.799 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
2.581AsnAla: 2.581 ± 0.379
0.738AsnCys: 0.738 ± 0.203
2.827AsnAsp: 2.827 ± 0.467
3.442AsnGlu: 3.442 ± 0.509
2.028AsnPhe: 2.028 ± 0.316
3.565AsnGly: 3.565 ± 0.524
0.983AsnHis: 0.983 ± 0.213
4.671AsnIle: 4.671 ± 0.566
6.453AsnLys: 6.453 ± 0.676
4.855AsnLeu: 4.855 ± 0.565
0.983AsnMet: 0.983 ± 0.264
4.179AsnAsn: 4.179 ± 0.484
2.028AsnPro: 2.028 ± 0.46
2.028AsnGln: 2.028 ± 0.435
2.889AsnArg: 2.889 ± 0.479
4.609AsnSer: 4.609 ± 0.51
2.889AsnThr: 2.889 ± 0.359
2.889AsnVal: 2.889 ± 0.419
0.738AsnTrp: 0.738 ± 0.23
2.335AsnTyr: 2.335 ± 0.405
0.0AsnXaa: 0.0 ± 0.0
Pro
2.274ProAla: 2.274 ± 0.488
0.553ProCys: 0.553 ± 0.202
2.52ProAsp: 2.52 ± 0.328
3.073ProGlu: 3.073 ± 0.414
0.738ProPhe: 0.738 ± 0.202
2.643ProGly: 2.643 ± 0.473
0.492ProHis: 0.492 ± 0.16
2.704ProIle: 2.704 ± 0.528
3.503ProLys: 3.503 ± 0.499
1.782ProLeu: 1.782 ± 0.317
0.738ProMet: 0.738 ± 0.222
1.905ProAsn: 1.905 ± 0.306
0.492ProPro: 0.492 ± 0.198
1.291ProGln: 1.291 ± 0.274
1.475ProArg: 1.475 ± 0.281
1.844ProSer: 1.844 ± 0.351
2.397ProThr: 2.397 ± 0.519
2.09ProVal: 2.09 ± 0.379
0.676ProTrp: 0.676 ± 0.218
1.229ProTyr: 1.229 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
2.458GlnAla: 2.458 ± 0.404
0.553GlnCys: 0.553 ± 0.162
1.598GlnAsp: 1.598 ± 0.255
2.766GlnGlu: 2.766 ± 0.427
1.168GlnPhe: 1.168 ± 0.249
3.011GlnGly: 3.011 ± 0.349
0.246GlnHis: 0.246 ± 0.109
2.274GlnIle: 2.274 ± 0.362
3.011GlnLys: 3.011 ± 0.458
2.95GlnLeu: 2.95 ± 0.642
0.738GlnMet: 0.738 ± 0.202
1.475GlnAsn: 1.475 ± 0.283
1.045GlnPro: 1.045 ± 0.217
1.229GlnGln: 1.229 ± 0.262
0.983GlnArg: 0.983 ± 0.221
1.536GlnSer: 1.536 ± 0.295
1.659GlnThr: 1.659 ± 0.282
2.827GlnVal: 2.827 ± 0.46
0.553GlnTrp: 0.553 ± 0.15
1.536GlnTyr: 1.536 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
2.397ArgAla: 2.397 ± 0.569
0.369ArgCys: 0.369 ± 0.125
2.827ArgAsp: 2.827 ± 0.339
3.073ArgGlu: 3.073 ± 0.542
1.782ArgPhe: 1.782 ± 0.296
2.458ArgGly: 2.458 ± 0.384
0.246ArgHis: 0.246 ± 0.13
2.95ArgIle: 2.95 ± 0.434
4.241ArgLys: 4.241 ± 0.647
3.503ArgLeu: 3.503 ± 0.558
0.983ArgMet: 0.983 ± 0.242
1.782ArgAsn: 1.782 ± 0.336
1.352ArgPro: 1.352 ± 0.303
0.922ArgGln: 0.922 ± 0.25
2.397ArgArg: 2.397 ± 0.487
2.028ArgSer: 2.028 ± 0.346
1.905ArgThr: 1.905 ± 0.3
1.905ArgVal: 1.905 ± 0.384
0.43ArgTrp: 0.43 ± 0.165
1.905ArgTyr: 1.905 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
2.766SerAla: 2.766 ± 0.634
0.86SerCys: 0.86 ± 0.23
3.503SerAsp: 3.503 ± 0.34
3.011SerGlu: 3.011 ± 0.46
2.643SerPhe: 2.643 ± 0.435
4.978SerGly: 4.978 ± 0.67
1.229SerHis: 1.229 ± 0.271
3.995SerIle: 3.995 ± 0.509
5.777SerLys: 5.777 ± 0.492
4.671SerLeu: 4.671 ± 0.531
1.106SerMet: 1.106 ± 0.209
3.257SerAsn: 3.257 ± 0.432
1.844SerPro: 1.844 ± 0.353
2.581SerGln: 2.581 ± 0.337
1.844SerArg: 1.844 ± 0.386
2.704SerSer: 2.704 ± 0.474
3.011SerThr: 3.011 ± 0.542
3.319SerVal: 3.319 ± 0.502
0.738SerTrp: 0.738 ± 0.231
2.397SerTyr: 2.397 ± 0.383
0.0SerXaa: 0.0 ± 0.0
Thr
2.274ThrAla: 2.274 ± 0.395
0.492ThrCys: 0.492 ± 0.193
3.565ThrAsp: 3.565 ± 0.468
3.565ThrGlu: 3.565 ± 0.458
2.09ThrPhe: 2.09 ± 0.399
4.056ThrGly: 4.056 ± 0.505
1.106ThrHis: 1.106 ± 0.228
4.425ThrIle: 4.425 ± 0.479
4.364ThrLys: 4.364 ± 0.507
4.118ThrLeu: 4.118 ± 0.431
1.352ThrMet: 1.352 ± 0.327
2.213ThrAsn: 2.213 ± 0.367
2.889ThrPro: 2.889 ± 0.581
1.905ThrGln: 1.905 ± 0.276
2.643ThrArg: 2.643 ± 0.485
3.319ThrSer: 3.319 ± 0.524
2.274ThrThr: 2.274 ± 0.423
3.073ThrVal: 3.073 ± 0.431
0.615ThrTrp: 0.615 ± 0.212
2.151ThrTyr: 2.151 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
2.581ValAla: 2.581 ± 0.405
0.983ValCys: 0.983 ± 0.301
4.487ValAsp: 4.487 ± 0.535
4.794ValGlu: 4.794 ± 0.494
2.95ValPhe: 2.95 ± 0.446
2.95ValGly: 2.95 ± 0.421
0.738ValHis: 0.738 ± 0.183
4.855ValIle: 4.855 ± 0.544
6.576ValLys: 6.576 ± 0.711
4.425ValLeu: 4.425 ± 0.483
1.598ValMet: 1.598 ± 0.255
3.38ValAsn: 3.38 ± 0.496
2.151ValPro: 2.151 ± 0.361
2.028ValGln: 2.028 ± 0.366
2.581ValArg: 2.581 ± 0.398
3.073ValSer: 3.073 ± 0.434
2.827ValThr: 2.827 ± 0.41
3.995ValVal: 3.995 ± 0.596
0.799ValTrp: 0.799 ± 0.234
2.151ValTyr: 2.151 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
0.983TrpAla: 0.983 ± 0.265
0.184TrpCys: 0.184 ± 0.105
0.983TrpAsp: 0.983 ± 0.197
1.229TrpGlu: 1.229 ± 0.294
0.43TrpPhe: 0.43 ± 0.166
0.799TrpGly: 0.799 ± 0.173
0.246TrpHis: 0.246 ± 0.114
0.553TrpIle: 0.553 ± 0.163
0.922TrpLys: 0.922 ± 0.223
1.045TrpLeu: 1.045 ± 0.254
0.123TrpMet: 0.123 ± 0.094
0.799TrpAsn: 0.799 ± 0.221
0.246TrpPro: 0.246 ± 0.116
0.43TrpGln: 0.43 ± 0.138
0.492TrpArg: 0.492 ± 0.197
1.045TrpSer: 1.045 ± 0.239
0.307TrpThr: 0.307 ± 0.161
1.106TrpVal: 1.106 ± 0.238
0.184TrpTrp: 0.184 ± 0.098
0.492TrpTyr: 0.492 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.414TyrAla: 1.414 ± 0.244
0.615TyrCys: 0.615 ± 0.159
3.196TyrAsp: 3.196 ± 0.421
2.827TyrGlu: 2.827 ± 0.47
1.536TyrPhe: 1.536 ± 0.413
2.581TyrGly: 2.581 ± 0.439
0.922TyrHis: 0.922 ± 0.218
3.257TyrIle: 3.257 ± 0.441
3.626TyrLys: 3.626 ± 0.593
3.442TyrLeu: 3.442 ± 0.53
1.106TyrMet: 1.106 ± 0.272
3.196TyrAsn: 3.196 ± 0.498
1.536TyrPro: 1.536 ± 0.283
1.536TyrGln: 1.536 ± 0.237
1.536TyrArg: 1.536 ± 0.299
2.827TyrSer: 2.827 ± 0.48
2.827TyrThr: 2.827 ± 0.405
2.458TyrVal: 2.458 ± 0.41
0.615TyrTrp: 0.615 ± 0.18
2.028TyrTyr: 2.028 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (16272 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski